Spg012297 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg012297
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionO-fucosyltransferase family protein
Locationscaffold1: 19434566 .. 19439219 (-)
RNA-Seq ExpressionSpg012297
SyntenySpg012297
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACTCACGGATATCAAAAAAGAAAAAAAAAATCTTATATCAGTTTGAACTAAGCTCATGTTGGCTAAACTGATATAAATCTTGTATCGGTAACGAAGAGATTTATAATTGTGACATGTCGTAAGAAAAAGTATAATGTTGTTGAAATTAAAAAAAAAAAAACAATAAAAAAAGGGGGAAAATCAGATCACTCTCCTCTCTCTTCGCGTCTCTGTTTTTTATTTTTCTTCCCCTCACCTGATCTCACTCTGTCGTGCAATCCGGCGACACGCAGACCAGCGCCGGCCGCCAGCCGCAGCCGCAGCCGCAGCCGCACGCCCATTCCACTCGGAGCCCTGCTCGCGCCGTCGCTCATAGCCGCAACCCACTGCTCCGCCGCGTGTTCAGCCACCATTTCGACGTCTGTACAGCCGCCGCCGGCTCCGTTCCTCGCCGCCACCGGCGGGAGTTAGTTCCAGCAGCCCGTGATCGCGTTTGGGGTCGGATTAGGTAGATTACGGTAAGATTCTTCGTCCGGTTGAGGTTATTAGACAATTTGCTTAGGAATTAGATTTAACCCATAATGTATAACTCGTTTCAGACTCGTTGGCCACGTGTGGGGTTTGTTTTTGCATCAATTATACTCTGGTAATTTATAGATGATAAAGTTCCATGATCCATCACTTAAACATTTCCTTGTCCGAATGAATGAAATTTGAATCACTGTTGTGCTGAAGGTTTCAATCCCTCCCTCTTCCACTCTGGAATTTCGATGGCGCTTCACAAAACCCAGAAGGCAAAATCCAAACCCAGATCTCCACTCCTCTTCTTCTTCGTTGCCCTCGCCGCCATTGCGCTTATTTTCCTCTTTTCCTCTCTCATTTCTACCAATGGGGCTTCTTCTTCTTTTCCATCCTCAGATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGAACCAGAAACAGAGACGTAATCGTCACGTTTTTAGTGTGAACGATAAGTTCTTGTACTGGGGCAACCGAATCGACTGCCCTGGGAAGCACTGCGAGTCTTGTGAGGGTTTAGGTCACCAGGAGTCCAGCTTGAGGTGCGCTCTTGAGGAAGCCATGTTCCTTCAGAGGTAATTTTCGTGTAATTTTCGTGAAGTTCTGGATTCTGGGGTTTAGTACGTACTTGAAGATGTTTTCTCCTTGTTTAATTTGTTGGATTGACATGATAAGGTTTCTGTATGGAAGGGATTTGGATTTGGATTAGTAGTGACTCATCAGCATTGCTGTGGTTCTAAAACTAAATGGAAAATTCTCATATTCTGAATAGATTTAAAGCAGCTAATTGTTTTTGCATGCAACATTCGCCGCTTGCTTGATTTTTATGCTATAAGGTGCCATTTAGTTTTGTCTCACTTTCTCTTTGTAAGTGGAGCATAATTTATTGGTAGAACACTTTTGCATAAACTACTCAGGCAATTTACTACTCTTTCTTTTTGGTGCAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAGGGGATTCTTCATCAGTCGAAGAATGCAAGCTCAGAGGAAAGGTGTTTCTTCCCTCTTTTCTTCTCTAATAATGTTTTTCCGATATGCTTTTTCTCCTGCTACTGTGTTGATCCATGTCGTTTCTTAATCATATACGTAGAATACCTATTAATTTCGCATGAAACATATACTTGATGATAATTGAAACTGTCTTACATTGATTCTCTTCGTGCTTCAAACTAAGATGCAAACTGCCTTGCATTAAAAAATTCACTGTTTTTTTCATACCTTTGTGATCTTTTCTAGAAAAACTCAAGACCTGAATTTGTTTCTCTGCATTGCAGTTGGGAAGCAAACTCTTGTGCGATGGACTCTTTGTACGATGTGGACCTTATATCTGACACTGTACCAGTTATTTTAGACAACTCGAAACTATGGTATCAGGTGCTGTCAACTGGTATGAAATTGGGAGCTAGAGCAGTTGCCCACGTTGAACAAGTTAGTCGTGTTGAACTCAGGGACAACAGCCGTTACTCCAATCTTTTCATAATAAACCGAACTGCCAGCCCTCTTTCATGGTAAATCAAACTTGGGGAAACAAGATTTTCTTTTCGTCTTTTGTGTGTGTGAGTTTGTTTTAGATAGATCCAAGCATCATCATATAATAAATCCGTAAAAGCTCAATTAGGCTTATCTTTTTAAAACTTAGTATAACAGATATTTTGATTTAAGATCCCAAGTTATGGATTATTTGGGTGTTAAATATCTTGATAACTCTTTAGGTTTATGGAGTGCAAGGACAGAAACAATCGTAGTGCCATATTGCTGCCGTATAAATTTCTTCCTTCAATGGCTGCAGAAAACTTGAGGGATGCAGCTGAGAAGGTACTTTTCAAATCATATTAGGCTCTTCCTTTTGTTATCCATTGCATTGTATGCATAGATATTTGCACTCTTCTAAGATATCAGGACATAGAGCCTCTCAAAGTTGCAAACTGTTACCTCATTACAATTCCGTATTTCTAGGTTGGTGTTTTATAGTTGTAGTAGATGCTCAACTTCATTGTTATAAGATTATTTATTCAGTTCTATCAAAAAGAGTATTTATTCAGACTTCCTGGATGAGAAAATATAATTTTTGAATTCTTGTTCAATTGTAAAGACTTGACTTTAGATACTCTTGGACTCATTAAGTAGTGTCTTTTTTTTTTCCTCTCTTTTTAACGATAACTGGATATCGAGGGGTGCCCTCATGACCCACCCTACTAATTCCTATAAAGCTTTTGTGCATTCCCTGCTGGAGACAACAAGGGTTTTAGATTGTTTAGTTTTCCACTCCTAGGATAACCCAAATTTAAGACTTTCCACCCCTAGGATATTAATTAAGTCCACCATCCAAAGTGCTTGGAAAATCCCATCTTTTTCTGTCTTTGATACTTTGAAGTTTTGTCATCAGGCCATGGTTGTATAAAGTATAATATGGTGGTGTTTAAAGAATAATATAACAATTGGAATATAATGGTGAAAGACAGAATAAGAAAGAAGGGGCTAAATACAACCATTTACAGAAGCAAGCCAGTTGTAATCTAACCATTATATTTGGACAGGTTTGTAGAAAATTTTGAAGTGCCTGAAACTTTATGGTTTTGGGTGCCTTCTTGGAAACTGAATAGAAGGACATTAATGATTAAGGATCTGCATGTGCTTCATAAACCCAGGTTGAGCACACGGGTGTTACTTCGACTTTTTAGCAGCACTAGATAAATTCAAAAGGAGTAATTTAGCTATGGCGGGTAGTTGGTGAGGCTTGCATTGGTTAGAGATAGAGACTAAGTTTGTATGCAGTTCTCTGTTTCTGTTTGGTTTACTGATATGCTATACTCTGTGTAATAACTTTGAGAAAAGATTTACAAGTTTTCTTTTTGGAATAGTCCTTTAATTTATCTTAACGTGTTGGTAATTCACTTCATCAGTCTCTCATTTTATTGATAATCCTGAAAAGAATATCTGATCGACAATAGATTAAGGGGCTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGAGATACACGGCCCGAGTTTTTGCTAAAGAGAATTGCAAAGTGGGTTCCACCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAACTCCTGGGTTCTTTTCGCCCCTCTCTGCTCGGTGAGCCCTTCTAATTCACCTTCTTAGTTCGGTAGAGTTAGAGCTGAAGTATTGGTTCCTTGCCATTTGGTAATAAAAAGACTTTTAAAGGATCTTCTTGTGTTCCGATAGCTGTGATATAGATCATCAAACCGTCTCTCAGTTAGTGGCATGTCCCATTATGCACAATGATTCCATTTTTAACTTAGAAAGAATGAAGTGTTTTGTCCTTAGATGCTTCGTTATTAGGTGCCTTCATTCATGATACGCCAATTTTGCAGGTACAAGTTGGCATATTCCTCAAACTATAGCCATATTCTGGATCCCGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGGTCATCATGGCGGGTGCTAAGACGTTCATTAGAACATTCAAAGAAGATGATACAGATCTCAGCCTCACCGATGACCCGAAGAAGAACACAAAGATATGGCAAAAACCTGTCTACACAACTGATGAAGAAAGAAGCTGAGTTTTTGTCCAGTTAACTGGGAAAATGTCTTGGAGAGATCTTTTTGATCCATGTTAATGCCAATCAACAAAAGCCCCTCATTGCTTATTGAAAAGCTTCATAGTGAAGATAAGAAATGTTGGAGGAGAGGGAGATGCTTGGCCAATGTTGTAAATGTGGTATAAATTACGATGCAGGTTTTTTTTTCATTAGTTCCATTTGTTGAGTGTTACCACTGCCATTTCTTTCTTGTTTAGATTTTATTTTATTTTATTGGTTTAAATACTATTTTGGTCTCTACACAGATTTGGTTCATTTTGGTCCCATGTACTTTAAAAACGTCCATTTTAGTTCCTATGCTTTTTAAAAGTTTCATTTTGGTCCTTGCTGTAAATTTACATTTAAACAACTAACGGATTGATGATGTGTCGTTTTTCT

mRNA sequence

ATGGCGCTTCACAAAACCCAGAAGGCAAAATCCAAACCCAGATCTCCACTCCTCTTCTTCTTCGTTGCCCTCGCCGCCATTGCGCTTATTTTCCTCTTTTCCTCTCTCATTTCTACCAATGGGGCTTCTTCTTCTTTTCCATCCTCAGATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGAACCAGAAACAGAGACGTAATCGTCACGTTTTTAGTGTGAACGATAAGTTCTTGTACTGGGGCAACCGAATCGACTGCCCTGGGAAGCACTGCGAGTCTTGTGAGGGTTTAGGTCACCAGGAGTCCAGCTTGAGGTGCGCTCTTGAGGAAGCCATGTTCCTTCAGAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAGGGGATTCTTCATCAGTCGAAGAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCGATGGACTCTTTGTACGATGTGGACCTTATATCTGACACTGTACCAGTTATTTTAGACAACTCGAAACTATGGTATCAGGTGCTGTCAACTGGTATGAAATTGGGAGCTAGAGCAGTTGCCCACGTTGAACAAGTTAGTCGTGTTGAACTCAGGGACAACAGCCGTTACTCCAATCTTTTCATAATAAACCGAACTGCCAGCCCTCTTTCATGGTTTATGGAGTGCAAGGACAGAAACAATCGTAGTGCCATATTGCTGCCGTATAAATTTCTTCCTTCAATGGCTGCAGAAAACTTGAGGGATGCAGCTGAGAAGATTAAGGGGCTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGAGATACACGGCCCGAGTTTTTGCTAAAGAGAATTGCAAAGTGGGTTCCACCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAACTCCTGGGTTCTTTTCGCCCCTCTCTGCTCGGTACAAGTTGGCATATTCCTCAAACTATAGCCATATTCTGGATCCCGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGGTCATCATGGCGGGTGCTAAGACGTTCATTAGAACATTCAAAGAAGATGATACAGATCTCAGCCTCACCGATGACCCGAAGAAGAACACAAAGATATGGCAAAAACCTGTCTACACAACTGATGAAGAAAGAAGCTGA

Coding sequence (CDS)

ATGGCGCTTCACAAAACCCAGAAGGCAAAATCCAAACCCAGATCTCCACTCCTCTTCTTCTTCGTTGCCCTCGCCGCCATTGCGCTTATTTTCCTCTTTTCCTCTCTCATTTCTACCAATGGGGCTTCTTCTTCTTTTCCATCCTCAGATTCAATTCAGAAAATCTTCAGATTCAAGAATCTGAACCAGAAACAGAGACGTAATCGTCACGTTTTTAGTGTGAACGATAAGTTCTTGTACTGGGGCAACCGAATCGACTGCCCTGGGAAGCACTGCGAGTCTTGTGAGGGTTTAGGTCACCAGGAGTCCAGCTTGAGGTGCGCTCTTGAGGAAGCCATGTTCCTTCAGAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACATAATAAGAAGGGGATTCTTCATCAGTCGAAGAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCGATGGACTCTTTGTACGATGTGGACCTTATATCTGACACTGTACCAGTTATTTTAGACAACTCGAAACTATGGTATCAGGTGCTGTCAACTGGTATGAAATTGGGAGCTAGAGCAGTTGCCCACGTTGAACAAGTTAGTCGTGTTGAACTCAGGGACAACAGCCGTTACTCCAATCTTTTCATAATAAACCGAACTGCCAGCCCTCTTTCATGGTTTATGGAGTGCAAGGACAGAAACAATCGTAGTGCCATATTGCTGCCGTATAAATTTCTTCCTTCAATGGCTGCAGAAAACTTGAGGGATGCAGCTGAGAAGATTAAGGGGCTACTTGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGAGATACACGGCCCGAGTTTTTGCTAAAGAGAATTGCAAAGTGGGTTCCACCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAACTCCTGGGTTCTTTTCGCCCCTCTCTGCTCGGTACAAGTTGGCATATTCCTCAAACTATAGCCATATTCTGGATCCCGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGGTCATCATGGCGGGTGCTAAGACGTTCATTAGAACATTCAAAGAAGATGATACAGATCTCAGCCTCACCGATGACCCGAAGAAGAACACAAAGATATGGCAAAAACCTGTCTACACAACTGATGAAGAAAGAAGCTGA

Protein sequence

MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKNLNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQLFMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS
Homology
BLAST of Spg012297 vs. NCBI nr
Match: XP_022967807.1 (uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima])

HSP 1 Score: 775.8 bits (2002), Expect = 1.9e-220
Identity = 378/410 (92.20%), Postives = 394/410 (96.10%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKN 60
           MA+HKTQKAK KPRSP LFFFVALA IA +FLFSSLISTNG SSSFPSS+SI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           LNQKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILHQS NASSEE WE NSCAMDSLYD+DLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD+SRYSNL +INRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360
           RPEF+LKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           FMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQKP+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Spg012297 vs. NCBI nr
Match: XP_023545162.1 (uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 771.2 bits (1990), Expect = 4.6e-219
Identity = 377/411 (91.73%), Postives = 395/411 (96.11%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNG-ASSSFPSSDSIQKIFRFK 60
           MA+HKTQKAK KPRSP LFFFVALA IA +FLFSSLISTNG +SSSFPSS+SI++IFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           NLNQKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR F
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQS NASSEE WE NSCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAI 240
           YQV STGMKLG+R VAHV+QVSR+ELRD+SRYSNL +INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEF+LKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           LFMIER++MAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQKP+YT DEE S
Sbjct: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of Spg012297 vs. NCBI nr
Match: KAG7033584.1 (hypothetical protein SDJN02_03308 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 767.3 bits (1980), Expect = 6.7e-218
Identity = 376/411 (91.48%), Postives = 393/411 (95.62%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNG-ASSSFPSSDSIQKIFRFK 60
           MA+HKTQKAK KPRSP LFFFVALA IA +FLFSSLISTNG +SSSFPSS+SI++IFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           NLNQKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR F
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH S NASSEE WE NSCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAI 240
           YQV STGMKLG+R VAHV+QVSR+ELRD+SRYSNL +INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEF+LKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           LFMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQKP+YT DEE S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of Spg012297 vs. NCBI nr
Match: XP_022932889.1 (uncharacterized protein LOC111439414 isoform X1 [Cucurbita moschata])

HSP 1 Score: 764.2 bits (1972), Expect = 5.6e-217
Identity = 374/410 (91.22%), Postives = 390/410 (95.12%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKN 60
           MA+HKTQKAK KPRSP LFFFVALA IA +FLFSSLIST G SSSFPSS+SI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           LNQKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILH S NASSEE WE NSCAMDSLYD+DLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD+SRYSNL IINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360
           RPEF+LKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360

Query: 361 FMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           FMIER+IMAGAKTFIRTFKED+TDLSLTDDPKKNTK+WQKP+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Spg012297 vs. NCBI nr
Match: XP_022153942.1 (uncharacterized protein LOC111021332 [Momordica charantia])

HSP 1 Score: 763.5 bits (1970), Expect = 9.6e-217
Identity = 376/411 (91.48%), Postives = 393/411 (95.62%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSS-FPSSDSIQKIFRFK 60
           MA  + QKAK KPRSPL FFFVALAAIA +FLFSSLISTNGASSS F SS+SIQKIFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           N+N+K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAI 240
           YQVLSTGMKLGARAVAHVE+VSR EL+DN+RYSNL +INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEF+LKR+AKWV PGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           LFMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYT DEE+S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of Spg012297 vs. ExPASy TrEMBL
Match: A0A6J1HRU4 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 PE=3 SV=1)

HSP 1 Score: 775.8 bits (2002), Expect = 9.1e-221
Identity = 378/410 (92.20%), Postives = 394/410 (96.10%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKN 60
           MA+HKTQKAK KPRSP LFFFVALA IA +FLFSSLISTNG SSSFPSS+SI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           LNQKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILHQS NASSEE WE NSCAMDSLYD+DLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD+SRYSNL +INRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360
           RPEF+LKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           FMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQKP+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Spg012297 vs. ExPASy TrEMBL
Match: A0A6J1EY95 (uncharacterized protein LOC111439414 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439414 PE=4 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 2.7e-217
Identity = 374/410 (91.22%), Postives = 390/410 (95.12%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKN 60
           MA+HKTQKAK KPRSP LFFFVALA IA +FLFSSLIST G SSSFPSS+SI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           LNQKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILH S NASSEE WE NSCAMDSLYD+DLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD+SRYSNL IINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360
           RPEF+LKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360

Query: 361 FMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           FMIER+IMAGAKTFIRTFKED+TDLSLTDDPKKNTK+WQKP+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Spg012297 vs. ExPASy TrEMBL
Match: A0A6J1DKB9 (O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC111021332 PE=3 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 4.7e-217
Identity = 376/411 (91.48%), Postives = 393/411 (95.62%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSS-FPSSDSIQKIFRFK 60
           MA  + QKAK KPRSPL FFFVALAAIA +FLFSSLISTNGASSS F SS+SIQKIFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           N+N+K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAI 240
           YQVLSTGMKLGARAVAHVE+VSR EL+DN+RYSNL +INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEF+LKR+AKWV PGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEERS 411
           LFMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYT DEE+S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of Spg012297 vs. ExPASy TrEMBL
Match: A0A5D3DWC1 (O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold749G00060 PE=3 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 1.1e-213
Identity = 372/409 (90.95%), Postives = 388/409 (94.87%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKN 60
           MA  +TQK K K RSPL+FFFV+LAAIA +FLFSSLISTNG SSSFPSS+SIQKIFRFKN
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKN 60

Query: 61  LNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           L QKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV
Sbjct: 61  LTQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120

Query: 121 MPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LHQS NASSEESWEANSCAMDSLYD+DLISDTVPVILDNSK WY
Sbjct: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAIL 240
           QVLST MKLGARAVAHVEQVSR+ELRD+S YSNL +INRTASPLSWFMECKDRNNRSA++
Sbjct: 181 QVLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVM 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360
           RPEF+LKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQL 360

Query: 361 FMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEER 410
           FMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQ PVYT +E R
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of Spg012297 vs. ExPASy TrEMBL
Match: A0A1S3C1H9 (O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 1.1e-213
Identity = 372/409 (90.95%), Postives = 388/409 (94.87%), Query Frame = 0

Query: 1   MALHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKN 60
           MA  +TQK K K RSPL+FFFV+LAAIA +FLFSSLISTNG SSSFPSS+SIQKIFRFKN
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKN 60

Query: 61  LNQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           L QKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV
Sbjct: 61  LTQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120

Query: 121 MPSRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LHQS NASSEESWEANSCAMDSLYD+DLISDTVPVILDNSK WY
Sbjct: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAIL 240
           QVLST MKLGARAVAHVEQVSR+ELRD+S YSNL +INRTASPLSWFMECKDRNNRSA++
Sbjct: 181 QVLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVM 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360
           RPEF+LKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQL 360

Query: 361 FMIERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEER 410
           FMIER+IMAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQ PVYT +E R
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of Spg012297 vs. TAIR 10
Match: AT2G41150.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 127 Blast hits to 127 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 531.6 bits (1368), Expect = 5.7e-151
Identity = 263/405 (64.94%), Postives = 321/405 (79.26%), Query Frame = 0

Query: 3   LHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKNLN 62
           + K  K K+ P S  L   + + A+A + LF+S+IST G   + P   ++   F      
Sbjct: 4   MSKPHKLKATPGSQRL-VLLCIVAVAFLLLFTSVISTGGL--ALPYRTTLIGYF------ 63

Query: 63  QKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMP 122
            +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMP
Sbjct: 64  VRSTRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMP 123

Query: 123 SRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWYQV 182
           SRMCINPIHNKKGIL++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +
Sbjct: 124 SRMCINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIM 183

Query: 183 LSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAILLP 242
           LST MKL  R  AHV   +R EL D+S ++NL +INRTASPL+WF+ECKDR NRS ++LP
Sbjct: 184 LSTSMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLP 243

Query: 243 YKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP 302
           Y FL +MAA  LRDAAEKIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTRP
Sbjct: 244 YSFLQTMAASRLRDAAEKIKAKLGDYDAIHVRRGDKLKTRKDRFRVERSQFPHLDRDTRP 303

Query: 303 EFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQLFM 362
           EF++ RI K +PPGRTLFI SNERTP FFSPL+ RYK+AYSSN+S ILDP+++NNYQLFM
Sbjct: 304 EFIIGRIQKQIPPGRTLFIGSNERTPDFFSPLAIRYKVAYSSNFSEILDPIIENNYQLFM 363

Query: 363 IERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDE 408
           +ER+IM GAKTF +TF+E +TDL+LTDDPKKN K W+ PVYT DE
Sbjct: 364 VERLIMMGAKTFFKTFREYETDLTLTDDPKKN-KNWEIPVYTMDE 398

BLAST of Spg012297 vs. TAIR 10
Match: AT3G56750.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 528.1 bits (1359), Expect = 6.3e-150
Identity = 264/407 (64.86%), Postives = 320/407 (78.62%), Query Frame = 0

Query: 3   LHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKNLN 62
           + K Q+ K    S  L  F  +   + + LFSS+IST       P   ++   F +    
Sbjct: 4   MSKAQRTKPTSGSQRLVLF-CIVVFSFLLLFSSVIST--GKLGLPYQQTLIDYFVWSPRG 63

Query: 63  QKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMP 122
           ++Q       S+++K+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMP
Sbjct: 64  KRQH------SLSEKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMP 123

Query: 123 SRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWYQV 182
           S MCINPIHNKKGIL++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK W+ V
Sbjct: 124 SGMCINPIHNKKGILNRSDNKTTEEGWLGSSCAMDSLYDIDLISEKIPVILDDSKTWHIV 183

Query: 183 LSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAILLP 242
           LST MKLG R +AHV  V+R  L++ S YSNL IINRTASPL+WF+ECKDR+NRSA++LP
Sbjct: 184 LSTSMKLGERGIAHVSGVTRHRLKE-SHYSNLLIINRTASPLAWFVECKDRSNRSAVMLP 243

Query: 243 YKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRP 302
           Y FLP+MAA  LR+AAEKIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRP
Sbjct: 244 YSFLPNMAAAKLRNAAEKIKAQLGDYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRP 303

Query: 303 EFLLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQLFM 362
           EF+L+RI K +P GRTLFI SNER PGFFSPL+ RYKLAYSSN+S ILDP+++NNYQLFM
Sbjct: 304 EFILRRIEKRIPRGRTLFIGSNERKPGFFSPLAVRYKLAYSSNFSEILDPIIENNYQLFM 363

Query: 363 IERVIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTTDEER 410
           +ER++M GAKT+ +TFKE +TDL+LTDDPKKN K W+ PVYT DE R
Sbjct: 364 MERLVMMGAKTYFKTFKEYETDLTLTDDPKKN-KNWEIPVYTMDERR 399

BLAST of Spg012297 vs. TAIR 10
Match: AT2G41150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 57 Blast hits to 57 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 308.1 bits (788), Expect = 1.0e-83
Identity = 155/262 (59.16%), Postives = 196/262 (74.81%), Query Frame = 0

Query: 3   LHKTQKAKSKPRSPLLFFFVALAAIALIFLFSSLISTNGASSSFPSSDSIQKIFRFKNLN 62
           + K  K K+ P S  L   + + A+A + LF+S+IST G   + P   ++   F      
Sbjct: 4   MSKPHKLKATPGSQRL-VLLCIVAVAFLLLFTSVISTGGL--ALPYRTTLIGYF------ 63

Query: 63  QKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMP 122
            +  RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMP
Sbjct: 64  VRSTRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMP 123

Query: 123 SRMCINPIHNKKGILHQSKNASSEESWEANSCAMDSLYDVDLISDTVPVILDNSKLWYQV 182
           SRMCINPIHNKKGIL++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +
Sbjct: 124 SRMCINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIM 183

Query: 183 LSTGMKLGARAVAHVEQVSRVELRDNSRYSNLFIINRTASPLSWFMECKDRNNRSAILLP 242
           LST MKL  R  AHV   +R EL D+S ++NL +INRTASPL+WF+ECKDR NRS ++LP
Sbjct: 184 LSTSMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLP 243

Query: 243 YKFLPSMAAENLRDAAEKIKGL 265
           Y FL +MAA  LRDAAEK+K L
Sbjct: 244 YSFLQTMAASRLRDAAEKVKEL 256

BLAST of Spg012297 vs. TAIR 10
Match: AT4G12700.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G04280.1); Has 136 Blast hits to 136 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 92.0 bits (227), Expect = 1.2e-18
Identity = 77/311 (24.76%), Postives = 131/311 (42.12%), Query Frame = 0

Query: 93  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGILHQSKNASSEESWEAN 152
           + C+ + H   S  CAL EA +L RT VM   +C++ ++   G   + K+      +E  
Sbjct: 264 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSVYTLSGQNEEGKDFRFYFDFE-- 323

Query: 153 SCAMDSLYDVDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDNSRYS 212
                 L +   + D V    D  K WY+    G+KL       V  +  V+++D     
Sbjct: 324 -----HLKEAASMLDQVQFWADWGK-WYK--KNGLKLHLVEDFRVTPMKLVDVKDT---- 383

Query: 213 NLFIINR--TASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAAEKIKGLLG-DYD 272
              I+ +  T  P +++    +    S +  P+  L    ++ L +    I   L  DYD
Sbjct: 384 --LIMRKFGTVEPDNYWYRVCEGETESVVQRPWNLL--WKSKRLMEIVSAIASRLNWDYD 443

Query: 273 AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFLLKRIAKWVPPGRTLFIASNERTPG 332
           AIH+ RGDK +        ++ + P+L++DT P  +L  +   +  GR L+IA+NE    
Sbjct: 444 AIHIERGDKAR--------NKEVWPNLEKDTSPSSILSTLQDKIEQGRNLYIATNEPELS 503

Query: 333 FFSPLSARYKLAYSSNYSHILD----------------PVVKNNYQLFMIERVIMAGAKT 385
           FF+PL  +YK  +   +  + D                PV  + Y    ++  +    K 
Sbjct: 504 FFNPLKDKYKPHFLDEFKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKK 548

BLAST of Spg012297 vs. TAIR 10
Match: AT2G04280.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G12700.1); Has 130 Blast hits to 130 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 90.1 bits (222), Expect = 4.4e-18
Identity = 75/311 (24.12%), Postives = 128/311 (41.16%), Query Frame = 0

Query: 93  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGILHQSKNASSEESWEAN 152
           + C+ + H   S  CAL EA +L RT VM   +C++ I+   G   + K+          
Sbjct: 269 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSIYTSSGQNEEGKD---------- 328

Query: 153 SCAMDSLYDVDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDNSRYS 212
                  +D + + +   V LD ++ W Q      K   R   H+ +  RV     +   
Sbjct: 329 ---FRFYFDFEHLKEAASV-LDEAQFWAQWGKLRKKRRNRLNLHLVEDFRVTPMKLAAVK 388

Query: 213 NLFIINRTAS--PLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAAEKIKGLLG-DYD 272
           +  I+ +  S  P +++    + +  S +  P+  L    +  L +    I   L  DYD
Sbjct: 389 DTLIMRKFGSVEPDNYWYRVCEGDAESVVKRPWHLL--WKSRRLMEIVSAIASRLNWDYD 448

Query: 273 AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFLLKRIAKWVPPGRTLFIASNERTPG 332
           A+H+ RG+K +        ++ + P+L+ DT P  LL  +   V  GR L+IA+NE    
Sbjct: 449 AVHIERGEKAR--------NKEVWPNLEADTSPSALLSTLQDKVEEGRHLYIATNEGELS 508

Query: 333 FFSPLSARYKLAYSSNYSHILD----------------PVVKNNYQLFMIERVIMAGAKT 385
           FF+PL  +Y   +  +Y  + D                PV  + Y    ++  +    K 
Sbjct: 509 FFNPLKDKYATHFLYDYKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKK 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022967807.11.9e-22092.20uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima][more]
XP_023545162.14.6e-21991.73uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7033584.16.7e-21891.48hypothetical protein SDJN02_03308 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022932889.15.6e-21791.22uncharacterized protein LOC111439414 isoform X1 [Cucurbita moschata][more]
XP_022153942.19.6e-21791.48uncharacterized protein LOC111021332 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1HRU49.1e-22192.20O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 ... [more]
A0A6J1EY952.7e-21791.22uncharacterized protein LOC111439414 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1DKB94.7e-21791.48O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC1110213... [more]
A0A5D3DWC11.1e-21390.95O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3C1H91.1e-21390.95O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3... [more]
Match NameE-valueIdentityDescription
AT2G41150.25.7e-15164.94unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G56750.16.3e-15064.86unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G41150.11.0e-8359.16unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G12700.11.2e-1824.76unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G04280.14.4e-1824.12unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 93..378
e-value: 5.8E-9
score: 36.1
NoneNo IPR availableGENE3D3.40.50.11350coord: 237..379
e-value: 9.0E-7
score: 30.8
NoneNo IPR availablePANTHERPTHR31469:SF8PLANT/PROTEINcoord: 4..408
NoneNo IPR availablePANTHERPTHR31469OS07G0633600 PROTEINcoord: 4..408

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg012297.1Spg012297.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity