HG10017575 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10017575
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionO-fucosyltransferase family protein
LocationChr03: 15991636 .. 15995441 (-)
RNA-Seq ExpressionHG10017575
SyntenyHG10017575
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTTCCCAGAACCCACAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCCTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCTTTTCCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGACCCAGAAACAGAGACGTAATCGGCATGTTTTTAGTGTAAATGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGGTAATTTTCTTGAAGTTCTTCTGAATTCTGGGGTTTAATCCTTACTTCAAGGTGTTTTGTCCTTGTTTATTTTGTTGGATTTACATGATAAGGTTTCTGTATGAAAGGGATTGGAATTGGGTGTGCAGTAACTCATCAGCATTGTTGTAGTTCTATCACCAAATGGAGAATAGTCATATTTTGAAACTTCTGTTTTCCCCTTGCTTGATTTTAGTTATGTGTTATTGCTATGAGGTGCCATTTAGTTTTGTCTCGCTTTCTCTTTGAAATGGTAATGGATTGGTCCAGTTCTCTTAAGTTCAGAATAATTTATTGGTAGATTGCCTTTGCATAACCTACCTAGGCAATTAACTACTCTACTCTGTCTTTTTTGGTGCAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAGGTGTTTCTTCCCTCTTTTCTTCTCCAATAATGTTTTTCTTGTATGCTCTCTCTCTTGCAACTGTGTTGATCCATGTTGTTTCTAATCATATAAATAGTGTACCTATTGAATGTTGCATTTTCTAACAGGCATATACTTGATGGTAGTTGAAACTGTATTTAAATTGGTTCTCCTTGGGCTTCAAACTAGGATGCAAACTTCATTGCAAAAAAAGTTCACTGTTCTTTCATATATTTGTGATCTTTCCTAGAAAAAACGTAGAACCTGAATTTGTTCTTTGCATTGCAGTTGGGAAGCAAACTCTTGTGCCATGGATTCTTTGTACGATATGGACCTTATATCTGATACCGTGCCAGTGATTTTAGACAACTCAAAATTGTGGTATCAGGTGCTGTCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCTCTTTCATGGTAAATCAAACTTGGGGAAATAAGATTATTTTTACCACTTTTGTGTGTGTCTAGATAGATATAAGCATCATTGTATAGCAAATCAGTAAAAGCTCAATTAGGCTTCTCTTTTCAATGTTTTGTTTGTATGCTTTTCCTTGCAACTTAGTATGACAGGTGTTTTGATTTAAGATCCCAAGTTATGAATTGTTTGGGTGTTGATGTCATGATAACTCTTTAGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGAGATGCAGCTGATAAGGTATATTTTGAAATCACATTAGGTTCTTTATTTTGCCATCCATTGCATTGCATACATAGATTTTTGTACTCTTCTGTAAATCATGACAGCCTAGCAAAGTTGCAAGCTATTATCTCATTGCTTCTGAAACCTAGTTTAAGTGTATTTCCATATTTCGAGGTTGGTGTTTTATAGTTTTAGTAGATGGTCAAGTTTAGTGCTACAAGATTAGGTATTCGGTTTATTCAGACTTCGCAGATGCTTCTTAGCATAAACTTGCCAGTTGCCACCTTAGAAAATATAATTATTGAACTCTGGGTCGAATCTAAATACTCTACTTTAGATACTCTTTAGATACTCTTTGACACGTATAGCAAACCCTAATTAAAGACTCATTATTATTATCTATTTATTTATTTATAATTTGCGACAACTTGGATATCAAGGGGTGCCACTCTTTGTTTAGTGCTCATGGTCCACCCTTACGGTTTCCTAAGAAAATTTTGCAACCTTTCTATAGAAGACAACAAGGGTTTTAAATTGTTGAAACACCTGATAGTCAATTTGTTATGGTTTTATAAATATTTACACTATGTGATTCTTTCTTTGTCTTCTACTTTTTAAAAGTATTTTCAAACTCTAAGCCAAATTTTAAAAACCAGAAAAAGGTCTTCTTTACTTTTTTATTTTATTTTTATTTTATTTTTTATTTTTTTGGAATTTGTCGGAGAATTCATATCCATTTGCAAAAATAGTGAAAATATACTATTAAAATTGAGGGCAGTAGACATAATTTTTTGAAACAAAAAACTAAAAAAGGAATGGTTATTAAACAGGGGCACTCCTAGGATGACCTTAGTTTAAGACCTTCCACTCCTAGGAATTCAATTTATTCCACCATCCAAAGTACTAGGAAAGTCCTTCTTTTTCTGTCTTTGTCGAACTTTGAAGTTCTGTCGTCAGGCCATGGTTGCATAATAGTGAAAGACAAAGAATAAGAAAGAAGGGGCTGTATACAATACACAACCATTTGCAGAAATAAGCTAGTTGTAACCATTATATTTGGATAGGTTTGTAGAAAATTTTGAAGTACCTGAAACTGAATGGAAGGCACACCCTTGATTAGGGATCCTCACATCCTTCATACAGCCAGGTTGAGACAGGTGTTCGTTTAACCCCTTAGCAGCATTAGATAAATTCGAAAAGAGAAATTAGCTATGGCGGGTGGGTGGTAAGGCTTGCATTGGCTAGAGACGAAGTTTGCTTATACTGTGCATAAAAACATTGAGAAAAGATTTACAAGTTGTCTCTTTTTCTTTTTTGGATAGTTCTATACGTAACCGTGTCTTCCGAAATCTTATTCCTTGAATTTTGTGTCATTTGATATTTCTAAATGGTGTTGATAATGCACTTCATCAATGAAAAATCTCATTTTATTGATAATCCTGAAAAGAACATCTGATGAACAGATTAAAGGACTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTCGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCGCCCCTCTCTGCTCGGTGAGCCCATCTCACTCACTTTTTTAGTTCACTAGCGTTAGAGATGAAGTATTGGTTCTTTACTCAAAGGATCTTCTTGTGTTCTGATAGCTATAATATAAATCATCAAACCGTTTTTCATTTAGTTACATATCCCCTTGAGGATTCCACTTTTAACCAAGAATAAAGAACGAAGTATTTTGCGCTTTGTGCTTCAATGTTAGTCCACACCACATTGATTCATGATACCCTGATTTTGCAGGTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTATTCATGATCGAAAGGCTCATTATGGCGGGTGCTAAGACATTCATCAGAACATTCAAAGAAGACGATACGGATCTAAGCCTCACCGACGACCCAAAGAAGAACATGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGA

mRNA sequence

ATGGCATTTCCCAGAACCCACAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCCTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCTTTTCCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGACCCAGAAACAGAGACGTAATCGGCATGTTTTTAGTGTAAATGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCCATGGATTCTTTGTACGATATGGACCTTATATCTGATACCGTGCCAGTGATTTTAGACAACTCAAAATTGTGGTATCAGGTGCTGTCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGAGATGCAGCTGATAAGATTAAAGGACTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTCGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTATTCATGATCGAAAGGCTCATTATGGCGGGTGCTAAGACATTCATCAGAACATTCAAAGAAGACGATACGGATCTAAGCCTCACCGACGACCCAAAGAAGAACATGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGA

Coding sequence (CDS)

ATGGCATTTCCCAGAACCCACAAGCCAAAACCCAAACCCAGATCCCCACTCATCTTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCCTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCTTTTCCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGACCCAGAAACAGAGACGTAATCGGCATGTTTTTAGTGTAAATGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACCAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCCATGGATTCTTTGTACGATATGGACCTTATATCTGATACCGTGCCAGTGATTTTAGACAACTCAAAATTGTGGTATCAGGTGCTGTCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAGCAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGAGATGCAGCTGATAAGATTAAAGGACTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTCGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCGATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTATTCATGATCGAAAGGCTCATTATGGCGGGTGCTAAGACATTCATCAGAACATTCAAAGAAGACGATACGGATCTAAGCCTCACCGACGACCCAAAGAAGAACATGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGA

Protein sequence

MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS
Homology
BLAST of HG10017575 vs. NCBI nr
Match: XP_038881641.1 (uncharacterized protein LOC120073097 [Benincasa hispida])

HSP 1 Score: 795.8 bits (2054), Expect = 1.7e-226
Identity = 393/408 (96.32%), Postives = 399/408 (97.79%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNL 60
           MAFPRT KPKPKPRSPLIFFFVALAAIAFLFLFSSL+STNGASSF SSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKPRSPLIFFFVALAAIAFLFLFSSLVSTNGASSFSSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FVM
Sbjct: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRIFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLSTGMKLGARAVAHVEQVSR+ELRD+SRYS+LLLINRTASPLSWFMECKDRNNRSAILL
Sbjct: 181 VLSTGMKLGARAVAHVEQVSRLELRDNSRYSDLLLINRTASPLSWFMECKDRNNRSAILL 240

Query: 241 PYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAEN+RDAA+KIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENMRDAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNER PGFFSPLS RYKLAYS NYS ILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERTPGFFSPLSDRYKLAYSLNYSSILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN KIWQIPVYTADEER
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQIPVYTADEER 408

BLAST of HG10017575 vs. NCBI nr
Match: XP_008455718.1 (PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo] >KAA0025926.1 uncharacterized protein E6C27_scaffold34G002270 [Cucumis melo var. makuwa] >TYK27788.1 uncharacterized protein E5676_scaffold749G00060 [Cucumis melo var. makuwa])

HSP 1 Score: 793.5 bits (2048), Expect = 8.6e-226
Identity = 390/408 (95.59%), Postives = 398/408 (97.55%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNL 60
           MAFPRT KPKPK RSPLIFFFV+LAAIAFLFLFSSLISTNG+SSFPSSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLST MKLGARAVAHVEQVSR+ELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA++L
Sbjct: 181 VLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVML 240

Query: 241 PYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAA+KIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN K+WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of HG10017575 vs. NCBI nr
Match: XP_004144331.1 (uncharacterized protein LOC101219097 [Cucumis sativus] >KGN54704.1 hypothetical protein Csa_012613 [Cucumis sativus])

HSP 1 Score: 781.9 bits (2018), Expect = 2.6e-222
Identity = 385/408 (94.36%), Postives = 396/408 (97.06%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNL 60
           MAFPRT KPKPKPRSPLIFFFV+L+AIAFLFLFSSLISTNG+SSFPSSNSIQKIFR KNL
Sbjct: 1   MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLSTGMKLGARAV HVE+VSR+ELRDSSRYSNLLLINRTASPLSWFMECKDRNN SA++L
Sbjct: 181 VLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVML 240

Query: 241 PYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAA+KIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           MIERLIMAGAKT IRTFKEDDTDLSLTDDPKKN K WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTDEERR 407

BLAST of HG10017575 vs. NCBI nr
Match: XP_022967807.1 (uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima])

HSP 1 Score: 760.0 bits (1961), Expect = 1.1e-215
Identity = 374/410 (91.22%), Postives = 387/410 (94.39%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG-ASSFPSSNSIQKIFRFKN 60
           MA  +T K KPKPRSP +FFFVALA IAFLFLFSSLISTNG +SSFPSSNSI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           L QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA++KIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQL 360
           RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS 410
           FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN K+WQ P+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of HG10017575 vs. NCBI nr
Match: XP_023545162.1 (uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 759.2 bits (1959), Expect = 1.8e-215
Identity = 373/411 (90.75%), Postives = 387/411 (94.16%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG--ASSFPSSNSIQKIFRFK 60
           MA  +T K KPKPRSP +FFFVALA IAFLFLFSSLISTNG  +SSFPSSNSI++IFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           NL QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR F
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240
           YQV STGMKLG+R VAHV+QVSR+ELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA++KIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQ 360
           TRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS 410
           LFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKN K+WQ P+YT DEE S
Sbjct: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of HG10017575 vs. ExPASy TrEMBL
Match: A0A5D3DWC1 (O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold749G00060 PE=3 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 4.2e-226
Identity = 390/408 (95.59%), Postives = 398/408 (97.55%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNL 60
           MAFPRT KPKPK RSPLIFFFV+LAAIAFLFLFSSLISTNG+SSFPSSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLST MKLGARAVAHVEQVSR+ELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA++L
Sbjct: 181 VLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVML 240

Query: 241 PYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAA+KIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN K+WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of HG10017575 vs. ExPASy TrEMBL
Match: A0A1S3C1H9 (O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 4.2e-226
Identity = 390/408 (95.59%), Postives = 398/408 (97.55%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNL 60
           MAFPRT KPKPK RSPLIFFFV+LAAIAFLFLFSSLISTNG+SSFPSSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLST MKLGARAVAHVEQVSR+ELRDSS YSNLLLINRTASPLSWFMECKDRNNRSA++L
Sbjct: 181 VLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVML 240

Query: 241 PYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAA+KIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN K+WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of HG10017575 vs. ExPASy TrEMBL
Match: A0A0A0L0X3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G430860 PE=4 SV=1)

HSP 1 Score: 781.9 bits (2018), Expect = 1.3e-222
Identity = 385/408 (94.36%), Postives = 396/408 (97.06%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNL 60
           MAFPRT KPKPKPRSPLIFFFV+L+AIAFLFLFSSLISTNG+SSFPSSNSIQKIFR KNL
Sbjct: 1   MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLSTGMKLGARAV HVE+VSR+ELRDSSRYSNLLLINRTASPLSWFMECKDRNN SA++L
Sbjct: 181 VLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVML 240

Query: 241 PYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAA+KIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVV+NNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           MIERLIMAGAKT IRTFKEDDTDLSLTDDPKKN K WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTDEERR 407

BLAST of HG10017575 vs. ExPASy TrEMBL
Match: A0A6J1HRU4 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 PE=3 SV=1)

HSP 1 Score: 760.0 bits (1961), Expect = 5.1e-216
Identity = 374/410 (91.22%), Postives = 387/410 (94.39%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNG-ASSFPSSNSIQKIFRFKN 60
           MA  +T K KPKPRSP +FFFVALA IAFLFLFSSLISTNG +SSFPSSNSI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           L QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD SRYSNLLLINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA++KIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQL 360
           RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS 410
           FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN K+WQ P+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of HG10017575 vs. ExPASy TrEMBL
Match: A0A6J1DKB9 (O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC111021332 PE=3 SV=1)

HSP 1 Score: 758.8 bits (1958), Expect = 1.1e-215
Identity = 375/411 (91.24%), Postives = 389/411 (94.65%), Query Frame = 0

Query: 1   MAFPRTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASS--FPSSNSIQKIFRFK 60
           MAFPR  K KPKPRSPL FFFVALAAIAFLFLFSSLISTNGASS  F SSNSIQKIFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           N+ +K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKG+LHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240
           YQVLSTGMKLGARAVAHVE+VSR EL+D++RYSNLLLINRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDAA+KIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQ 360
           TRPEFMLKR+AKWV  GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEERS 410
           LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKN KIWQ PVYT DEE+S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of HG10017575 vs. TAIR 10
Match: AT2G41150.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 127 Blast hits to 127 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 536.6 bits (1381), Expect = 1.8e-152
Identity = 265/402 (65.92%), Postives = 322/402 (80.10%), Query Frame = 0

Query: 5   RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQ 64
           + HK K  P S  +   + + A+AFL LF+S+IST G  + P   ++   F       + 
Sbjct: 6   KPHKLKATPGSQRL-VLLCIVAVAFLLLFTSVISTGGL-ALPYRTTLIGYF------VRS 65

Query: 65  RRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRM 124
            RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMPSRM
Sbjct: 66  TRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSRM 125

Query: 125 CINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLST 184
           CINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST
Sbjct: 126 CINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLST 185

Query: 185 GMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKF 244
            MKL  R  AHV   +R EL DSS ++NLLLINRTASPL+WF+ECKDR NRS ++LPY F
Sbjct: 186 SMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPYSF 245

Query: 245 LPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM 304
           L +MAA  LRDAA+KIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTRPEF+
Sbjct: 246 LQTMAASRLRDAAEKIKAKLGDYDAIHVRRGDKLKTRKDRFRVERSQFPHLDRDTRPEFI 305

Query: 305 LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIER 364
           + RI K +P GRTLFI SNER P FFSPL+ RYK+AYSSN+S+ILDP+++NNYQLFM+ER
Sbjct: 306 IGRIQKQIPPGRTLFIGSNERTPDFFSPLAIRYKVAYSSNFSEILDPIIENNYQLFMVER 365

Query: 365 LIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADE 407
           LIM GAKTF +TF+E +TDL+LTDDPKKN K W+IPVYT DE
Sbjct: 366 LIMMGAKTFFKTFREYETDLTLTDDPKKN-KNWEIPVYTMDE 398

BLAST of HG10017575 vs. TAIR 10
Match: AT3G56750.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 536.2 bits (1380), Expect = 2.3e-152
Identity = 265/404 (65.59%), Postives = 323/404 (79.95%), Query Frame = 0

Query: 5   RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQ 64
           +  + KP   S  +  F  +   +FL LFSS+IST G    P   ++   F +       
Sbjct: 6   KAQRTKPTSGSQRLVLF-CIVVFSFLLLFSSVIST-GKLGLPYQQTLIDYFVW------S 65

Query: 65  RRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRM 124
            R +   S+++K+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMPS M
Sbjct: 66  PRGKRQHSLSEKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSGM 125

Query: 125 CINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLST 184
           CINPIHNKKG+L++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK W+ VLST
Sbjct: 126 CINPIHNKKGILNRSDNKTTEEGWLGSSCAMDSLYDIDLISEKIPVILDDSKTWHIVLST 185

Query: 185 GMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKF 244
            MKLG R +AHV  V+R  L++ S YSNLL+INRTASPL+WF+ECKDR+NRSA++LPY F
Sbjct: 186 SMKLGERGIAHVSGVTRHRLKE-SHYSNLLIINRTASPLAWFVECKDRSNRSAVMLPYSF 245

Query: 245 LPSMAAENLRDAADKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM 304
           LP+MAA  LR+AA+KIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRPEF+
Sbjct: 246 LPNMAAAKLRNAAEKIKAQLGDYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRPEFI 305

Query: 305 LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLFMIER 364
           L+RI K +P GRTLFI SNER PGFFSPL+ RYKLAYSSN+S+ILDP+++NNYQLFM+ER
Sbjct: 306 LRRIEKRIPRGRTLFIGSNERKPGFFSPLAVRYKLAYSSNFSEILDPIIENNYQLFMMER 365

Query: 365 LIMAGAKTFIRTFKEDDTDLSLTDDPKKNMKIWQIPVYTADEER 409
           L+M GAKT+ +TFKE +TDL+LTDDPKKN K W+IPVYT DE R
Sbjct: 366 LVMMGAKTYFKTFKEYETDLTLTDDPKKN-KNWEIPVYTMDERR 399

BLAST of HG10017575 vs. TAIR 10
Match: AT2G41150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 57 Blast hits to 57 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 312.0 bits (798), Expect = 7.1e-85
Identity = 157/259 (60.62%), Postives = 197/259 (76.06%), Query Frame = 0

Query: 5   RTHKPKPKPRSPLIFFFVALAAIAFLFLFSSLISTNGASSFPSSNSIQKIFRFKNLTQKQ 64
           + HK K  P S  +   + + A+AFL LF+S+IST G  + P   ++   F       + 
Sbjct: 6   KPHKLKATPGSQRL-VLLCIVAVAFLLLFTSVISTGGL-ALPYRTTLIGYF------VRS 65

Query: 65  RRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRM 124
            RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMPSRM
Sbjct: 66  TRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSRM 125

Query: 125 CINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLST 184
           CINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST
Sbjct: 126 CINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLST 185

Query: 185 GMKLGARAVAHVEQVSRVELRDSSRYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKF 244
            MKL  R  AHV   +R EL DSS ++NLLLINRTASPL+WF+ECKDR NRS ++LPY F
Sbjct: 186 SMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPYSF 245

Query: 245 LPSMAAENLRDAADKIKGL 264
           L +MAA  LRDAA+K+K L
Sbjct: 246 LQTMAASRLRDAAEKVKEL 256

BLAST of HG10017575 vs. TAIR 10
Match: AT4G12700.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G04280.1); Has 136 Blast hits to 136 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 92.4 bits (228), Expect = 8.9e-19
Identity = 77/311 (24.76%), Postives = 133/311 (42.77%), Query Frame = 0

Query: 92  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEAN 151
           + C+ + H   S  CAL EA +L RT VM   +C++ ++   G   +  +      +E  
Sbjct: 264 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSVYTLSGQNEEGKDFRFYFDFE-- 323

Query: 152 SCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDSSRYS 211
                 L +   + D V    D  K WY+    G+KL       V  +  V+++D+    
Sbjct: 324 -----HLKEAASMLDQVQFWADWGK-WYK--KNGLKLHLVEDFRVTPMKLVDVKDT---- 383

Query: 212 NLLLINR--TASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKIKGLLG-DYD 271
             L++ +  T  P +++    +    S +  P+  L    ++ L +    I   L  DYD
Sbjct: 384 --LIMRKFGTVEPDNYWYRVCEGETESVVQRPWNLL--WKSKRLMEIVSAIASRLNWDYD 443

Query: 272 AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPG 331
           AIH+ RGDK +        ++ + P+L++DT P  +L  +   +  GR L+IA+NE    
Sbjct: 444 AIHIERGDKAR--------NKEVWPNLEKDTSPSSILSTLQDKIEQGRNLYIATNEPELS 503

Query: 332 FFSPLSARYKLAYSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT 384
           FF+PL  +YK  +   + D+ D                PV  + Y    ++  +    K 
Sbjct: 504 FFNPLKDKYKPHFLDEFKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKK 548

BLAST of HG10017575 vs. TAIR 10
Match: AT2G04280.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G12700.1); Has 130 Blast hits to 130 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 91.3 bits (225), Expect = 2.0e-18
Identity = 76/311 (24.44%), Postives = 131/311 (42.12%), Query Frame = 0

Query: 92  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEAN 151
           + C+ + H   S  CAL EA +L RT VM   +C++ I+   G         +EE  +  
Sbjct: 269 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSIYTSSG--------QNEEGKD-- 328

Query: 152 SCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDSSRYS 211
                  +D + + +   V LD ++ W Q      K   R   H+ +  RV     +   
Sbjct: 329 ---FRFYFDFEHLKEAASV-LDEAQFWAQWGKLRKKRRNRLNLHLVEDFRVTPMKLAAVK 388

Query: 212 NLLLINRTAS--PLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAADKIKGLLG-DYD 271
           + L++ +  S  P +++    + +  S +  P+  L    +  L +    I   L  DYD
Sbjct: 389 DTLIMRKFGSVEPDNYWYRVCEGDAESVVKRPWHLL--WKSRRLMEIVSAIASRLNWDYD 448

Query: 272 AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPG 331
           A+H+ RG+K +        ++ + P+L+ DT P  +L  +   V  GR L+IA+NE    
Sbjct: 449 AVHIERGEKAR--------NKEVWPNLEADTSPSALLSTLQDKVEEGRHLYIATNEGELS 508

Query: 332 FFSPLSARYKLAYSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT 384
           FF+PL  +Y   +  +Y D+ D                PV  + Y    ++  +    K 
Sbjct: 509 FFNPLKDKYATHFLYDYKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKK 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881641.11.7e-22696.32uncharacterized protein LOC120073097 [Benincasa hispida][more]
XP_008455718.18.6e-22695.59PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo] >KAA0025926.1 unc... [more]
XP_004144331.12.6e-22294.36uncharacterized protein LOC101219097 [Cucumis sativus] >KGN54704.1 hypothetical ... [more]
XP_022967807.11.1e-21591.22uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima][more]
XP_023545162.11.8e-21590.75uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3DWC14.2e-22695.59O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3C1H94.2e-22695.59O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3... [more]
A0A0A0L0X31.3e-22294.36Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G430860 PE=4 SV=1[more]
A0A6J1HRU45.1e-21691.22O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 ... [more]
A0A6J1DKB91.1e-21591.24O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC1110213... [more]
Match NameE-valueIdentityDescription
AT2G41150.21.8e-15265.92unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G56750.12.3e-15265.59unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G41150.17.1e-8560.62unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G12700.18.9e-1924.76unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G04280.12.0e-1824.44unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.11350coord: 240..378
e-value: 2.0E-6
score: 29.6
NoneNo IPR availablePANTHERPTHR31469:SF8PLANT/PROTEINcoord: 3..407
NoneNo IPR availablePANTHERPTHR31469OS07G0633600 PROTEINcoord: 3..407
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 92..377
e-value: 1.5E-8
score: 34.8

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10017575.1HG10017575.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity