CsaV3_4G009620 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_4G009620
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionUnknown protein
Locationchr4: 7344320 .. 7349225 (+)
RNA-Seq ExpressionCsaV3_4G009620
SyntenyCsaV3_4G009620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATTTCATGGTTTTGGTAAGTCACAAAGTTGAAATTCTTCCCACCTTTTTCAATAAAGTAGTGGAAAGAGAGAGAAGAAAGAAAAGCACAAAAGCTCATCAAAACCCCAAATGTGACCCATTATACATCCCATTATCATTTTTGTTCCAAAAATTTCCTTCTCTTTCCCAATTCATTGAAAGAAGAAAACATCATGGTTAAAGCTTTGCAACAATCAAAATCGCAATCAAGAACAACAAAAACAACCAATAATCTTGTTTCTCCAAAGCTTTTCCTCTATCTCCTCTCCATCTCAGCTCTCCTTTTCATCCTCTTCCACATCCACTCCCTCCACCACCATGTCCCTCCACCACCTTCCTCCATCGTTGCTGCGAAGCTCCGCCGTTCCGTTACGTTTCTTCCCCTCAAGGACTTGCGTTACTCCAACAAAGCCCTTGTAGGCCATACATGGTTCATGAGCTCCTTGTATGACATCCAGGAGGAAGGTGAGGTTCAATACCAGCAGTTTCCATCACCGGTGGTGGACGGCGACGAGCGGATGCTTTGCCTGAAAGGGCGTGACACTCACGACGGGTCTTGGAATTACTATGGGTTGGCGTGGCCTGAAGGTTTGCCGGAAAATGCGAGGGTCAAGAAAGGTGTGAGCTTTGTGTCTTACAATCATTATGATTATCAAAATATTTGGCATGGCTTGTCCGCTCTCATGCCTTTCGTTGCTTGGCATCAGATTCAAGGTAACGCCATTTGTTTTTTTTTTGGATTTATTCATCTCATTACTCATTTTTTTGCTTCATTTCTGGGCTTAGCGCTACTTGGAAATTGTGTCAAGTCTTAAAACCTTCATTTTATATATATATATAACTTGATTTGGAAGTAAAAGTAATTTATTATTAATTTAAGCATTTTTTTATATAGTTTTTGTTGTCTATAAAAATACTTGTTGTATCAATACTCTATTATATATTCATTTATTCAAAACGATATATTACTATAAATTATATCAATATCTTAATAAATAATTACATTGCCTAACACTCTAATTAGGAATAATAATTGAGAGGTTTACTAAACAAAACAAAATAATAATAATGATAGACGATACGGTAATTTTTAATTTAATTATCTTAGTTTAACGTTAACATTAAACTAAAATAATTAAATTAAATTATCTTAAAAATTATACATTTTGTAATTTTATACTATTTTGGTAATCAATATGAGAAAACAAAAAGTTGTATATTTTGCTTTAATTTTGATATGAATTTATGATACAACAAAGGAAAGTGTGAAGTACCAGAGAGATGGATATTATACCACTGGGGGGAACTGAGATTGAGAATGGGAAAATGGGTTTCCACATTAATGGAGGCCACATTCGGAGCCCCGCTTCAGTTCGAAGCTTTCGAAGATATCAGCGAAGGGCAGCCAGTTTGCTTCGAGAAGGCGGTGGTGATGAGACACAACGAGGGCGGAATGTCGAGGCAGCGCCGGATGGAGACCTACGACTTTATGAGGTGCAAGGCACGGTTGTTCTGCAACTTAACCTCGCCTGAGCCGTTGTCAGCGGCGGTGGGTATGACGATGTTAATGAGAACGGGGCCCAGGTCGTTCAGGAATGAGACGACGGTGGTGGAGATTTTTGGGAAGGAATGCGCAAAAGTCGCCGGCTGTCGCCTCACGGTGGCTTATTCCAATAATCTCACCTTCTGTGAACAGGTAACCCGTCCTTTAATCTACCTTCTCTTCTTTCTTTTTTATTTATTGAGATTTGGAATATTAAATTTTGTTGTTATTCTTTTTAAATTATTGGTAAATTGAAATTATCACTTTGAAAGTTTTACACCATCAAGGTTTTAGCCTACAAATTTTAAAATTTATATAACAATCACATCACATGTGCATATTATATAATGGGAGAGTATCCTATTTATCTGATTAAGATTATTATAAAATCTTTTATTTCTCCAATTTGATCTAGCAATACATATCAACAACTTTTCTATTATCTTTCTTAAAAAAAAAATGAAAAAAAACAACTTTTCTATTAATAAATTATATGATATAATCGTATTTAGAGAATATTTATATTTGAGAATTTTCGATAAATTTAAGTGTAAATTTGAGAATTTTGAAATATTAAAATCAAAATTTAGAAAACTCTAAAAGATTTAGGATCGTTATTTATATTTCTAAACTTTCATTAAAATTAATATATTAGGAAAAATCAACTTTTTTAAATATAGAAAAATAGGTTAAATTGTATGTAATAGAAATCAATACTTAATCTTAGACGTGAATAGAAACCTATTAATGTCTTTTAATAATAGAAATTGGTAAAAATCAATAATTGATCAAGCCCATAGATATATTATATATTATGATTAGAGATTTATATAAGTTTATCATTGTCTACAAGTTTTTTTTTCTATTTCTATAAATAGTTTAACGTTTTTCTACAAAAATTTATCCAAGAAAAAACTTAAATTTGATGGTGTCGAGAGGTTGCAAATTTTACATTAAAAAAACTAAACAAACTCATACGTCTGATAAGGTAGGTTAAGCATTAATTACTATTCTTATTACCAATGACGAGATCCATAGATTTTCGTTAATGTTTTTCTTCAAAGCTTTTAATATCATGGAAGATATGTATCTATTTATATTCTAAATGCAGGTTGGGAGGTGCATGTTATGACATTAAAATTTATTATTGGGAATAGTCCAAATGACTTTATTAAAGCATATCTTATAAAATACTTGAACAACAACCCACATTATATTGGATAATAAAGTGATGAATCCAGTACTCTGGTCACCCACTCAAATTTTCATAAACAAGTTTATTTGTGTTTGGTATTTCTTTTTTCTAAGTATGACGGCCTTGGTCATAAATCCAACCTTTGATCTTTATATAGAAAAAGTCTCTCTTTTAGTTATTATTAAGCTAAACTAACTTTGATGTACTTGTTTGGTATTCATATATTTTTTATTTTGCAACCTATAATATAGAAAATTAAAATTTTAGATTCTCACTAAAATAAAATTTGATTTTATATCAAAATATCACTAAATTAGCTTTAAATTTTTAAATACAATAAAGGTATAATCGAAACAATTTTAGTTAGTTTATGAGAGTAGTTGAAGAACTTGTAAGAGTTTCTAAAAATTGTTTAAAACAAATAAACATTATAATATAATATAGAACCTTAAGGGAAGTCTAGGAAGAAAATAGGCAAAACATTTCGGGAACATTTTAACCCTTTAGAACGTAGAATTAAGTTTTAAAGTCTAAAGTCAATATGTTGAAGTTAGGAAGTCTACGTTTTGGAGTATTGTTATTTAACGTGGAGCTAAGAAACTCTGAAAAGAAAAGAGAAAAAGATCAAATTTCTTGATAAATATGGTTTAATAGTAAAGAATGTTGAAGTTAATGAAATTAAGTTATTTATTATCTATTTTTAAATTGGATCAGTCAAACACACTTGTTGTTTATTTCCTTTCACAATATACAAAACATGTATATCAAAGTTTGTATTTTGATGATTAACTAATGAAAAATGTGAATTAATTAACAGGTGAGTTTGATGGGGAAGACGGACATATTGATATCCCCACATGGAGCACAACTGACAAACATGATTCTAATGAACAGAAACAGTAGCGTAATGGAATTCTTTCCCAAAGGGTGGTTGGAACTTGCGGGCATTGGCCAATATGTGTACCATTGGCTCGCTAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATAGCACTCTTCCCTGTCCTTATTCTCCCGGCGATCGTCGATGCATGTCCATTTACAAAGCTGGCACTATTGGGTAGTACATTTCATATCGTTCTATTTCTGTTTTAATTTGTTTTTTCAAAAATTATCTTGTTCTTGATTGAATTTTCGACTATAGTTTATTGGAAATTAACTTTTTTAGTCCATTAGTTTTTGTACCGATTTAAATGGTTTTTTGAAGTTAATTTTGTCATTTGATATCAAATACATCCTACTTAAAAAAAAGGTAGGAGCCATTTCAATTTGTTTCTCAAAATTTAGGAGACTTGTGAAGTCTATTTCAAAATTTACTTTTTTCATATTTTTCGAATTTGCTCCTTGAGTTTGCACAGCGATTTCAAGTTTGGGGTCTTTAAACTTTTAGAGTTGTTATGTTTGATGTATTCGTAAACTTTGAATATTGTGCTCATGGATTGTTGATAGATTTCTTGTAAACTTTTGATATGATACTTAATATTTATCAAAACTTTCGTCAGACCCTGTAGATAGCTCCACTTTCAATTTTCGATCCATGACCTTGTAGACCTATTTTTGACCCATTTTCAATTTTGAATACAAAAGAGTCCATTGTATTTTGTACAAACTTATATGATTTCTTGTTGGATAGTCTAGATATGCAGAAGTAGGCTTTTCCTAATTAAAATTGAAGTTCAGATGCGTTTGGTGTGTTAGATTTCTGTTGAATATAGTATTAGACACATCCCAACGGATAACATTTCCTACCCATGCATGATGAACTAAAAGAATTATTAATCCTAATTCATACTAATAATGTTACTATTTTTTTATTATGTGTTAACAGATACAACAGAACACACTTTTCTGAGTGGGCTAAGAGTGTTCTGAATGAGGTGAAGATGAGAAAGATGGAGGAAGCAACAAAGGTCACTACAAATCAAATTCATGAGTGTTCTTGTATCTAATTTTTCCCCATCTTCTAGGCACATTTCTTTAGTAATTTATATACCCCATTGACTCTTTTTTTTTCCGGAAAAAGAGAATAATAATCATTGTATAAAAATATACTGGAATTTATTGACTTTTTTGGTAAAATGATTTACACATCCATCCAAAAATTCTATTTGTTAG

mRNA sequence

ATGGTTAAAGCTTTGCAACAATCAAAATCGCAATCAAGAACAACAAAAACAACCAATAATCTTGTTTCTCCAAAGCTTTTCCTCTATCTCCTCTCCATCTCAGCTCTCCTTTTCATCCTCTTCCACATCCACTCCCTCCACCACCATGTCCCTCCACCACCTTCCTCCATCGTTGCTGCGAAGCTCCGCCGTTCCGTTACGTTTCTTCCCCTCAAGGACTTGCGTTACTCCAACAAAGCCCTTGTAGGCCATACATGGTTCATGAGCTCCTTGTATGACATCCAGGAGGAAGGTGAGGTTCAATACCAGCAGTTTCCATCACCGGTGGTGGACGGCGACGAGCGGATGCTTTGCCTGAAAGGGCGTGACACTCACGACGGGTCTTGGAATTACTATGGGTTGGCGTGGCCTGAAGGTTTGCCGGAAAATGCGAGGGTCAAGAAAGGTGTGAGCTTTGTGTCTTACAATCATTATGATTATCAAAATATTTGGCATGGCTTGTCCGCTCTCATGCCTTTCGTTGCTTGGCATCAGATTCAAGGAAAGTGTGAAGTACCAGAGAGATGGATATTATACCACTGGGGGGAACTGAGATTGAGAATGGGAAAATGGGTTTCCACATTAATGGAGGCCACATTCGGAGCCCCGCTTCAGTTCGAAGCTTTCGAAGATATCAGCGAAGGGCAGCCAGTTTGCTTCGAGAAGGCGGTGGTGATGAGACACAACGAGGGCGGAATGTCGAGGCAGCGCCGGATGGAGACCTACGACTTTATGAGGTGCAAGGCACGGTTGTTCTGCAACTTAACCTCGCCTGAGCCGTTGTCAGCGGCGGTGGGTATGACGATGTTAATGAGAACGGGGCCCAGGTCGTTCAGGAATGAGACGACGGTGGTGGAGATTTTTGGGAAGGAATGCGCAAAAGTCGCCGGCTGTCGCCTCACGGTGGCTTATTCCAATAATCTCACCTTCTGTGAACAGGTGAGTTTGATGGGGAAGACGGACATATTGATATCCCCACATGGAGCACAACTGACAAACATGATTCTAATGAACAGAAACAGTAGCGTAATGGAATTCTTTCCCAAAGGGTGGTTGGAACTTGCGGGCATTGGCCAATATGTGTACCATTGGCTCGCTAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATAGCACTCTTCCCTGTCCTTATTCTCCCGGCGATCGTCGATGCATGTCCATTTACAAAGCTGGCACTATTGGATACAACAGAACACACTTTTCTGAGTGGGCTAAGAGTGTTCTGAATGAGGTGAAGATGAGAAAGATGGAGGAAGCAACAAAGGTCACTACAAATCAAATTCATGAGTGTTCTTGTATCTAA

Coding sequence (CDS)

ATGGTTAAAGCTTTGCAACAATCAAAATCGCAATCAAGAACAACAAAAACAACCAATAATCTTGTTTCTCCAAAGCTTTTCCTCTATCTCCTCTCCATCTCAGCTCTCCTTTTCATCCTCTTCCACATCCACTCCCTCCACCACCATGTCCCTCCACCACCTTCCTCCATCGTTGCTGCGAAGCTCCGCCGTTCCGTTACGTTTCTTCCCCTCAAGGACTTGCGTTACTCCAACAAAGCCCTTGTAGGCCATACATGGTTCATGAGCTCCTTGTATGACATCCAGGAGGAAGGTGAGGTTCAATACCAGCAGTTTCCATCACCGGTGGTGGACGGCGACGAGCGGATGCTTTGCCTGAAAGGGCGTGACACTCACGACGGGTCTTGGAATTACTATGGGTTGGCGTGGCCTGAAGGTTTGCCGGAAAATGCGAGGGTCAAGAAAGGTGTGAGCTTTGTGTCTTACAATCATTATGATTATCAAAATATTTGGCATGGCTTGTCCGCTCTCATGCCTTTCGTTGCTTGGCATCAGATTCAAGGAAAGTGTGAAGTACCAGAGAGATGGATATTATACCACTGGGGGGAACTGAGATTGAGAATGGGAAAATGGGTTTCCACATTAATGGAGGCCACATTCGGAGCCCCGCTTCAGTTCGAAGCTTTCGAAGATATCAGCGAAGGGCAGCCAGTTTGCTTCGAGAAGGCGGTGGTGATGAGACACAACGAGGGCGGAATGTCGAGGCAGCGCCGGATGGAGACCTACGACTTTATGAGGTGCAAGGCACGGTTGTTCTGCAACTTAACCTCGCCTGAGCCGTTGTCAGCGGCGGTGGGTATGACGATGTTAATGAGAACGGGGCCCAGGTCGTTCAGGAATGAGACGACGGTGGTGGAGATTTTTGGGAAGGAATGCGCAAAAGTCGCCGGCTGTCGCCTCACGGTGGCTTATTCCAATAATCTCACCTTCTGTGAACAGGTGAGTTTGATGGGGAAGACGGACATATTGATATCCCCACATGGAGCACAACTGACAAACATGATTCTAATGAACAGAAACAGTAGCGTAATGGAATTCTTTCCCAAAGGGTGGTTGGAACTTGCGGGCATTGGCCAATATGTGTACCATTGGCTCGCTAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATAGCACTCTTCCCTGTCCTTATTCTCCCGGCGATCGTCGATGCATGTCCATTTACAAAGCTGGCACTATTGGATACAACAGAACACACTTTTCTGAGTGGGCTAAGAGTGTTCTGAATGAGGTGAAGATGAGAAAGATGGAGGAAGCAACAAAGGTCACTACAAATCAAATTCATGAGTGTTCTTGTATCTAA

Protein sequence

MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI*
Homology
BLAST of CsaV3_4G009620 vs. NCBI nr
Match: XP_011653390.1 (uncharacterized protein LOC101219216 [Cucumis sativus] >KGN53729.1 hypothetical protein Csa_014798 [Cucumis sativus])

HSP 1 Score: 958.0 bits (2475), Expect = 3.0e-275
Identity = 457/457 (100.00%), Postives = 457/457 (100.00%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60
           MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA
Sbjct: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60

Query: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120
           KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK
Sbjct: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120

Query: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180
           GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ
Sbjct: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180

Query: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240
           GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR
Sbjct: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240

Query: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300
           HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI
Sbjct: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300

Query: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360
           FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF
Sbjct: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360

Query: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420
           PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN
Sbjct: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420

Query: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI
Sbjct: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 457

BLAST of CsaV3_4G009620 vs. NCBI nr
Match: TYK07539.1 (uncharacterized protein E5676_scaffold544G00050 [Cucumis melo var. makuwa])

HSP 1 Score: 890.2 bits (2299), Expect = 7.6e-255
Identity = 423/450 (94.00%), Postives = 431/450 (95.78%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60
           +VK LQQSKSQSR TKTTNNLV PKLFLYLLSISALL ILFHIHSLHHHV PPP S + A
Sbjct: 2   VVKGLQQSKSQSRATKTTNNLVCPKLFLYLLSISALLSILFHIHSLHHHVLPPPPSSIVA 61

Query: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120
           KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK
Sbjct: 62  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 121

Query: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180
           GRDTHDGSWNYYGLAWPEGLPENA V KGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ
Sbjct: 122 GRDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 181

Query: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240
           GKCEVPERWILYHWGELRLRMGKWV+TLMEATFGAP++ EAFE ISEGQPVCFEKAVVMR
Sbjct: 182 GKCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPVCFEKAVVMR 241

Query: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300
           HNEGGMSRQRRMETYDFMRCKARL CNLTSPEPLS AVGMTMLMRTGPRSFRNETTV EI
Sbjct: 242 HNEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSFRNETTVAEI 301

Query: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360
           FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF
Sbjct: 302 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 361

Query: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420
           PKGWLELAGIGQYVYHWLASWSGM+HQGAWRDPNSTLPCPYSP DRRCMS YK GTIGYN
Sbjct: 362 PKGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSFYKGGTIGYN 421

Query: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQ 451
           RT+FSEWAKSVLNEVKMRK+EEATK TTNQ
Sbjct: 422 RTYFSEWAKSVLNEVKMRKIEEATKFTTNQ 451

BLAST of CsaV3_4G009620 vs. NCBI nr
Match: XP_008462883.2 (PREDICTED: uncharacterized protein LOC103501161, partial [Cucumis melo])

HSP 1 Score: 815.1 bits (2104), Expect = 3.1e-232
Identity = 378/396 (95.45%), Postives = 385/396 (97.22%), Query Frame = 0

Query: 62  LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 121
           LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG
Sbjct: 1   LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 60

Query: 122 RDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 181
           RDTHDGSWNYYGLAWPEGLPENA V KGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG
Sbjct: 61  RDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 120

Query: 182 KCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMRH 241
           KCEVPERWILYHWGELRLRMGKWV+TLMEATFGAP++ EAFE ISEGQPVCFEKAVVMRH
Sbjct: 121 KCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPVCFEKAVVMRH 180

Query: 242 NEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEIF 301
           NEGGMSRQRRMETYDFMRCKARL CNLTSPEPLS AVGMTMLMRTGPRSFRNETTV EIF
Sbjct: 181 NEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSFRNETTVAEIF 240

Query: 302 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 361
           GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP
Sbjct: 241 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 300

Query: 362 KGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYNR 421
           KGWLELAGIGQYVYHWLASWSGM+HQGAWRDPNSTLPCPYSP DRRCMS YK GTIGYNR
Sbjct: 301 KGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSFYKGGTIGYNR 360

Query: 422 THFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           T+FSEWAKSVLNEVKMRK+EEATK TTNQ+HECSCI
Sbjct: 361 TYFSEWAKSVLNEVKMRKIEEATKFTTNQVHECSCI 396

BLAST of CsaV3_4G009620 vs. NCBI nr
Match: XP_023541716.1 (uncharacterized protein LOC111801789 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 768.1 bits (1982), Expect = 4.4e-218
Identity = 363/461 (78.74%), Postives = 395/461 (85.68%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPP-----S 60
           MVK    SK   RTT     L SPKLF+YLLSISA+LFI FHI SLH HV PPP     S
Sbjct: 1   MVKPSHTSKPHQRTTSI---LFSPKLFIYLLSISAVLFIFFHIQSLHRHVLPPPQNPSSS 60

Query: 61  SIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDER 120
           S  AAKLRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  DGD R
Sbjct: 61  SSAAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDAR 120

Query: 121 MLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVA 180
           +LCLKG DTHDGSWNYY +AWPE LPENA V KG+SFVSYNHY+Y NIWHGLSALMPFVA
Sbjct: 121 LLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVA 180

Query: 181 WHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEK 240
           WHQIQGKCE+PERWILYHWGELRL+MG WV T+ME TFG P + EAFE I EGQPVCFEK
Sbjct: 181 WHQIQGKCEIPERWILYHWGELRLKMGTWVRTIMEVTFGGPPKIEAFEGIGEGQPVCFEK 240

Query: 241 AVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNET 300
           AVVMRHNEGGMSRQRRMETYD MRCKARLFCN TSPEP    VGMT+ MRTG RSF+NET
Sbjct: 241 AVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPEPSVTTVGMTLFMRTGARSFKNET 300

Query: 301 TVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSS 360
            V+EIFG ECAKVAGCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+RNSS
Sbjct: 301 AVMEIFGAECAKVAGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSS 360

Query: 361 VMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAG 420
           VMEFFPKGWL+LAGIGQ+VY W+ASWSGMRHQGAWRDP+  L CPY+  DRRCMSI+K G
Sbjct: 361 VMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPHG-LTCPYNEDDRRCMSIFKGG 420

Query: 421 TIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 457
           TIGYNRT+FSEWAK+VL+EVKMRKM+EA + TTN +HECSC
Sbjct: 421 TIGYNRTYFSEWAKNVLDEVKMRKMDEAAQATTNHVHECSC 457

BLAST of CsaV3_4G009620 vs. NCBI nr
Match: XP_022942991.1 (uncharacterized protein LOC111447859 [Cucurbita moschata])

HSP 1 Score: 766.5 bits (1978), Expect = 1.3e-217
Identity = 362/462 (78.35%), Postives = 394/462 (85.28%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPP------ 60
           MVK    SK   RTT     L SPKLF+YLLSISA+LFI FHI SLH HVPP P      
Sbjct: 1   MVKPSHTSKPHQRTTSI---LFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSS 60

Query: 61  SSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDE 120
           SS  AAKLRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  DGD 
Sbjct: 61  SSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDA 120

Query: 121 RMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFV 180
           R+LCLKG DTHDGSWNYY +AWPE LPENA V KG+SFVSYNHY+Y NIWHGLSALMPFV
Sbjct: 121 RLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFV 180

Query: 181 AWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFE 240
           AWHQIQGKCE+PERWILYHWGELRL+MG WVST+ME TFG P + EAF+ ISEGQPVCFE
Sbjct: 181 AWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFE 240

Query: 241 KAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNE 300
           KAVVMRHNEGGMSRQRRMETYD MRCKARLFCN TSP+P  A VGMT+ MRTG RSF+NE
Sbjct: 241 KAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNE 300

Query: 301 TTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNS 360
           T VVEIFG EC KV GCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+RNS
Sbjct: 301 TAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNS 360

Query: 361 SVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKA 420
           SVMEFFPKGWL+LAGIGQ+VY W+ASWSGMRHQGAWRDPN  L CPY+  DRRCMSI+K 
Sbjct: 361 SVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNG-LTCPYNEDDRRCMSIFKG 420

Query: 421 GTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 457
           GTIGYNRT+FSEWAK+VLNEVK RKM+EA + T N +H+CSC
Sbjct: 421 GTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 458

BLAST of CsaV3_4G009620 vs. ExPASy TrEMBL
Match: A0A0A0KXZ9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G112630 PE=4 SV=1)

HSP 1 Score: 958.0 bits (2475), Expect = 1.4e-275
Identity = 457/457 (100.00%), Postives = 457/457 (100.00%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60
           MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA
Sbjct: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60

Query: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120
           KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK
Sbjct: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120

Query: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180
           GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ
Sbjct: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180

Query: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240
           GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR
Sbjct: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240

Query: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300
           HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI
Sbjct: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300

Query: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360
           FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF
Sbjct: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360

Query: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420
           PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN
Sbjct: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420

Query: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI
Sbjct: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 457

BLAST of CsaV3_4G009620 vs. ExPASy TrEMBL
Match: A0A5D3CB36 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold544G00050 PE=4 SV=1)

HSP 1 Score: 890.2 bits (2299), Expect = 3.7e-255
Identity = 423/450 (94.00%), Postives = 431/450 (95.78%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60
           +VK LQQSKSQSR TKTTNNLV PKLFLYLLSISALL ILFHIHSLHHHV PPP S + A
Sbjct: 2   VVKGLQQSKSQSRATKTTNNLVCPKLFLYLLSISALLSILFHIHSLHHHVLPPPPSSIVA 61

Query: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120
           KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK
Sbjct: 62  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 121

Query: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180
           GRDTHDGSWNYYGLAWPEGLPENA V KGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ
Sbjct: 122 GRDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 181

Query: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240
           GKCEVPERWILYHWGELRLRMGKWV+TLMEATFGAP++ EAFE ISEGQPVCFEKAVVMR
Sbjct: 182 GKCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPVCFEKAVVMR 241

Query: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300
           HNEGGMSRQRRMETYDFMRCKARL CNLTSPEPLS AVGMTMLMRTGPRSFRNETTV EI
Sbjct: 242 HNEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSFRNETTVAEI 301

Query: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360
           FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF
Sbjct: 302 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 361

Query: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420
           PKGWLELAGIGQYVYHWLASWSGM+HQGAWRDPNSTLPCPYSP DRRCMS YK GTIGYN
Sbjct: 362 PKGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSFYKGGTIGYN 421

Query: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQ 451
           RT+FSEWAKSVLNEVKMRK+EEATK TTNQ
Sbjct: 422 RTYFSEWAKSVLNEVKMRKIEEATKFTTNQ 451

BLAST of CsaV3_4G009620 vs. ExPASy TrEMBL
Match: A0A1S3CIF4 (uncharacterized protein LOC103501161 OS=Cucumis melo OX=3656 GN=LOC103501161 PE=4 SV=1)

HSP 1 Score: 815.1 bits (2104), Expect = 1.5e-232
Identity = 378/396 (95.45%), Postives = 385/396 (97.22%), Query Frame = 0

Query: 62  LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 121
           LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG
Sbjct: 1   LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 60

Query: 122 RDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 181
           RDTHDGSWNYYGLAWPEGLPENA V KGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG
Sbjct: 61  RDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 120

Query: 182 KCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMRH 241
           KCEVPERWILYHWGELRLRMGKWV+TLMEATFGAP++ EAFE ISEGQPVCFEKAVVMRH
Sbjct: 121 KCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPVCFEKAVVMRH 180

Query: 242 NEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEIF 301
           NEGGMSRQRRMETYDFMRCKARL CNLTSPEPLS AVGMTMLMRTGPRSFRNETTV EIF
Sbjct: 181 NEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSFRNETTVAEIF 240

Query: 302 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 361
           GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP
Sbjct: 241 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 300

Query: 362 KGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYNR 421
           KGWLELAGIGQYVYHWLASWSGM+HQGAWRDPNSTLPCPYSP DRRCMS YK GTIGYNR
Sbjct: 301 KGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSFYKGGTIGYNR 360

Query: 422 THFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           T+FSEWAKSVLNEVKMRK+EEATK TTNQ+HECSCI
Sbjct: 361 TYFSEWAKSVLNEVKMRKIEEATKFTTNQVHECSCI 396

BLAST of CsaV3_4G009620 vs. ExPASy TrEMBL
Match: A0A6J1FQH3 (uncharacterized protein LOC111447859 OS=Cucurbita moschata OX=3662 GN=LOC111447859 PE=4 SV=1)

HSP 1 Score: 766.5 bits (1978), Expect = 6.1e-218
Identity = 362/462 (78.35%), Postives = 394/462 (85.28%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPP------ 60
           MVK    SK   RTT     L SPKLF+YLLSISA+LFI FHI SLH HVPP P      
Sbjct: 1   MVKPSHTSKPHQRTTSI---LFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSS 60

Query: 61  SSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDE 120
           SS  AAKLRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  DGD 
Sbjct: 61  SSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDA 120

Query: 121 RMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFV 180
           R+LCLKG DTHDGSWNYY +AWPE LPENA V KG+SFVSYNHY+Y NIWHGLSALMPFV
Sbjct: 121 RLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFV 180

Query: 181 AWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFE 240
           AWHQIQGKCE+PERWILYHWGELRL+MG WVST+ME TFG P + EAF+ ISEGQPVCFE
Sbjct: 181 AWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFE 240

Query: 241 KAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNE 300
           KAVVMRHNEGGMSRQRRMETYD MRCKARLFCN TSP+P  A VGMT+ MRTG RSF+NE
Sbjct: 241 KAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNE 300

Query: 301 TTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNS 360
           T VVEIFG EC KV GCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+RNS
Sbjct: 301 TAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNS 360

Query: 361 SVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKA 420
           SVMEFFPKGWL+LAGIGQ+VY W+ASWSGMRHQGAWRDPN  L CPY+  DRRCMSI+K 
Sbjct: 361 SVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNG-LTCPYNEDDRRCMSIFKG 420

Query: 421 GTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 457
           GTIGYNRT+FSEWAK+VLNEVK RKM+EA + T N +H+CSC
Sbjct: 421 GTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 458

BLAST of CsaV3_4G009620 vs. ExPASy TrEMBL
Match: A0A6J1J255 (uncharacterized protein LOC111482727 OS=Cucurbita maxima OX=3661 GN=LOC111482727 PE=4 SV=1)

HSP 1 Score: 760.0 bits (1961), Expect = 5.7e-216
Identity = 359/462 (77.71%), Postives = 390/462 (84.42%), Query Frame = 0

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPP------ 60
           MVK    SK   RTT     L SPKLF+YLLSISA+LFI FHI SLH HVPPPP      
Sbjct: 1   MVKPSHTSKPHQRTTSI---LFSPKLFIYLLSISAVLFIFFHIQSLHRHVPPPPQNNPSS 60

Query: 61  SSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDE 120
           SS   AKLRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  DGD 
Sbjct: 61  SSSAVAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDA 120

Query: 121 RMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFV 180
           R+LCLKG DTHDGSWNYY +AWPE LPENA V KG+SFVSYNHY+Y NIWHGLSALMPFV
Sbjct: 121 RLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFV 180

Query: 181 AWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFE 240
           AWHQIQGKCE+PERWILYHWGELRL+MG WV T+ME TFG P + EAF+ I EGQPVCFE
Sbjct: 181 AWHQIQGKCEIPERWILYHWGELRLKMGTWVRTIMEVTFGGPPKIEAFDGIGEGQPVCFE 240

Query: 241 KAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNE 300
           KAVVMRHNEGGMSRQRRMETYD MRCKARLFCN TS EP  A VGMT+ MRTG RSF+NE
Sbjct: 241 KAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSSEPSVATVGMTLFMRTGARSFKNE 300

Query: 301 TTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNS 360
           T VVEIFG EC KV GC+L VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+RNS
Sbjct: 301 TAVVEIFGAECNKVTGCQLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNS 360

Query: 361 SVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKA 420
           SVMEFFPKGWL+LAGIGQ+VY W+ASWSGMRHQGAWRDP+  L CPY+  DRRCMSI+K 
Sbjct: 361 SVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPHG-LTCPYNEDDRRCMSIFKG 420

Query: 421 GTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 457
           GTIGYNRT+FSEWAK+VLNEVK+RKM EA   T N +HECSC
Sbjct: 421 GTIGYNRTYFSEWAKNVLNEVKIRKMNEAAHATANHVHECSC 458

BLAST of CsaV3_4G009620 vs. TAIR 10
Match: AT4G33600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4 anthesis, C globular stage, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33590.1); Has 131 Blast hits to 131 proteins in 40 species: Archae - 0; Bacteria - 9; Metazoa - 12; Fungi - 24; Plants - 58; Viruses - 0; Other Eukaryotes - 28 (source: NCBI BLink). )

HSP 1 Score: 548.5 bits (1412), Expect = 5.1e-156
Identity = 269/475 (56.63%), Postives = 335/475 (70.53%), Query Frame = 0

Query: 9   KSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPP---PPS---------- 68
           +S SR   T   L SPK  L +L +   +F+L  I S H    P   PPS          
Sbjct: 4   RSVSRNLVTC--LASPKFSLNVLCLVVTVFVLLQIWSFHITQQPILLPPSLFTYLKEQQQ 63

Query: 69  -----------SIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQ 128
                      + +  KLR SVTFLPLKDLR+SNK L GHTWFMSSLYD Q +GEVQYQ+
Sbjct: 64  EPEQIKSENETAYLVEKLRESVTFLPLKDLRFSNKPLEGHTWFMSSLYDNQTKGEVQYQE 123

Query: 129 FPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIW 188
           FPS    G  R+LCLKG D HDGSWNYY LAWP+ LP NA +++G++FVSYNHYDY N+W
Sbjct: 124 FPSESSKG--RLLCLKGVDEHDGSWNYYALAWPQALPVNASLQEGLTFVSYNHYDYGNMW 183

Query: 189 HGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFED 248
           HGLSA++PFVAW  ++ +CE P+RW+LYHWGELR +MG W++ ++ AT+G   +F  F D
Sbjct: 184 HGLSAMVPFVAW-SLRHQCENPQRWVLYHWGELRFKMGNWLNEIITATYGQNTEFLRFRD 243

Query: 249 ISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLM 308
             + +PVCFEKAVVMRHNEGGMSR+RRME +D +RCKAR +CN++  E   + +GMT+LM
Sbjct: 244 --KNRPVCFEKAVVMRHNEGGMSRERRMEVFDLIRCKARHYCNISLSETSKSRIGMTLLM 303

Query: 309 RTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQL 368
           RTGPRSF+NE+ V++IF +EC  V GC L V+YSNNLTFCEQV LM  TD+L+SPHGAQL
Sbjct: 304 RTGPRSFKNESAVIDIFKRECKNVEGCELKVSYSNNLTFCEQVELMRMTDVLVSPHGAQL 363

Query: 369 TNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPG 428
           TN++LM+RNSSVMEF PKGW +LAG+GQ VY W   WSGMRH+G+W DP+  + C +   
Sbjct: 364 TNLVLMDRNSSVMEFLPKGWRKLAGVGQLVYQWGTRWSGMRHEGSWHDPDGEI-CQFPDT 423

Query: 429 DRRCM-SIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEAT--KVTTNQIHECSC 457
           DRRCM S+YK G IGYN T+F EWAKSVL + K RKM      K +   +  C C
Sbjct: 424 DRRCMSSVYKNGRIGYNETYFGEWAKSVLGKFKERKMANVVGRKHSYGSLDGCWC 470

BLAST of CsaV3_4G009620 vs. TAIR 10
Match: AT4G33590.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33600.1); Has 126 Blast hits to 126 proteins in 35 species: Archae - 0; Bacteria - 12; Metazoa - 0; Fungi - 21; Plants - 62; Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink). )

HSP 1 Score: 530.8 bits (1366), Expect = 1.1e-150
Identity = 243/411 (59.12%), Postives = 312/411 (75.91%), Query Frame = 0

Query: 32  SISALLFILFHIHSLHHHVPPPPSSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSL 91
           S+S    +L ++   H  V    ++ +  KLR SVTFLPLKD R+SNK L GHTWFMSSL
Sbjct: 46  SLSLPPALLTYLKHNHEEVSENKTASLVEKLRESVTFLPLKDYRFSNKPLEGHTWFMSSL 105

Query: 92  YDIQEEGEVQYQQFPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVS 151
           YD Q +GE QYQ+FPS    G  R+LCLKG D HDGSWN Y LAWPE LP NA ++ G++
Sbjct: 106 YDNQTKGEAQYQEFPSDSSKG--RLLCLKGVDEHDGSWNSYALAWPEALPTNAILQDGLT 165

Query: 152 FVSYNHYDYQNIWHGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEA 211
           FVSYN YDY N+WHGL+A++PF+AW  ++ +CE P++W+LYHWGELR  MG W+S ++ A
Sbjct: 166 FVSYNQYDYGNLWHGLTAVVPFIAW-SLRNQCEKPQKWVLYHWGELRFGMGHWLSEIVTA 225

Query: 212 TFGAPLQFEAFEDISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSP 271
           T+G    F  F D  + +PVCFEKAVVMRHNEGGMSR+RRME +D +RCKAR +CN++S 
Sbjct: 226 TYGQEPDFLRFVD--DDKPVCFEKAVVMRHNEGGMSRERRMEAFDLIRCKARNYCNISSS 285

Query: 272 EPLSAAVGMTMLMRTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMG 331
                 +GMT+L+RTG RSFRNE+ V+++F KEC +V GC ++V+YSNNL+FCEQV LM 
Sbjct: 286 VASKPRIGMTLLLRTGARSFRNESMVIDVFKKECKRVDGCEISVSYSNNLSFCEQVELMK 345

Query: 332 KTDILISPHGAQLTNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWR 391
           KTD+L+SPHGAQLTN+ LM++NSSVMEFFPKGWL+LAG+GQ V+ W A+WSGMRH+G+W 
Sbjct: 346 KTDVLVSPHGAQLTNLFLMDKNSSVMEFFPKGWLKLAGVGQLVFQWGANWSGMRHEGSWH 405

Query: 392 DPNSTLPCPYSPGDRRCMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEE 443
           DP   + C +   DRRCMSIYK   IGYN T+F EWA+ VL +  +R+M+E
Sbjct: 406 DPVGEI-CQFPDTDRRCMSIYKNAMIGYNETYFGEWARRVLGKFSIREMKE 450

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653390.13.0e-275100.00uncharacterized protein LOC101219216 [Cucumis sativus] >KGN53729.1 hypothetical ... [more]
TYK07539.17.6e-25594.00uncharacterized protein E5676_scaffold544G00050 [Cucumis melo var. makuwa][more]
XP_008462883.23.1e-23295.45PREDICTED: uncharacterized protein LOC103501161, partial [Cucumis melo][more]
XP_023541716.14.4e-21878.74uncharacterized protein LOC111801789 [Cucurbita pepo subsp. pepo][more]
XP_022942991.11.3e-21778.35uncharacterized protein LOC111447859 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KXZ91.4e-275100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G112630 PE=4 SV=1[more]
A0A5D3CB363.7e-25594.00Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CIF41.5e-23295.45uncharacterized protein LOC103501161 OS=Cucumis melo OX=3656 GN=LOC103501161 PE=... [more]
A0A6J1FQH36.1e-21878.35uncharacterized protein LOC111447859 OS=Cucurbita moschata OX=3662 GN=LOC1114478... [more]
A0A6J1J2555.7e-21677.71uncharacterized protein LOC111482727 OS=Cucurbita maxima OX=3661 GN=LOC111482727... [more]
Match NameE-valueIdentityDescription
AT4G33600.15.1e-15656.63unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G33590.11.1e-15059.12unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007657Glycosyltransferase 61PFAMPF04577DUF563coord: 225..386
e-value: 1.8E-18
score: 67.4
IPR007657Glycosyltransferase 61PANTHERPTHR20961GLYCOSYLTRANSFERASEcoord: 20..447
NoneNo IPR availablePANTHERPTHR20961:SF94TRANSMEMBRANE PROTEINcoord: 20..447

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G009620.1CsaV3_4G009620.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016757 glycosyltransferase activity