Spg039072 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg039072
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHexosyltransferase
Locationscaffold12: 1316298 .. 1322013 (+)
RNA-Seq ExpressionSpg039072
SyntenySpg039072
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACAAAAAAAAAAAAAAAATAGAGATGTTTTAACGGATTAAAAGTGTATTTGTTTTCGTGCTTAAGCAATTAATTACGTTTTGGTCACTTACCAAATTTGATGATGAGATTTTGAATACATAGTTTCAATCTGTTCATCTCCATCTCTGAAATGGACTGAAAATTTTCACCAACCGCAAATTCAACCTCTTCTTCTTCTTTCATCGAATTTCTCGTCCAGTCTCAGTCTTCTCCACTTCTCAGCTCTCAAAATCCAATCGATTCTGCAGATCAATTTCAATCCGAAACAAGTTTCGGAGTCGATTTTGTGATCTTCAATATCTTAAATGCATCAGGTTTGATTTTTTTGGTTTTTCAAGGTTGAAGACGAAGCCTATTAGTTTCTGTGAAGATGCGGAGTAAGGGATCGAATGCTCGACTGTCGAGCATGCCTATTCGATCCCGAATTCCCACCCTTTTGCTCTCCATGTTCGCTACCTTCGCTTCAATCTACGTCGCCGGCCGGTTTGTTCTCTTTGATTTCATCGTTTCCACTATTTCTTCTTCCCACAGTGTTTGATTATGGATGATTCTGAATTCTCCGTGATTTGCAGGCTATGGCAGGATGCGGAGAACAGGGTTTACTTGATTAAAGAGTTGGATAGGCTAACTGGTCAGGTGGGTTAGCCGCTTTAGCTATTAATGGTGCTTATTGATTTGTGGAATTTTTATTTTTTAGTTAACATTATTTGTAAGCCACAATTGAGCATCTCCATTTCGTATTGTACTGAGCTTAAGCTGAATTTCAGCTACTATTTTGCGACTAGCGGTTCAATTATTACATTGATTGGGACGGTTTTGCTGAATCTAATTAATATAAATGTCCGTCAACGGTAATGCAGTATGCATAGGGAAGAAACGGTGGTTGATAGTCCCATGGAGGTCCTTGAAATATGTAACATTTTTTGTTTCTTTGCTTCAAGATTTTTCCCCTCTTAAAGGGACACTCGTGAATGGGAAAACTCTGTATGAGTTTGTTGATTTCTTGGATTTTTTTTATAACTGGCATATTGTTATAATTTCTTTGCTAATTATCTAACAAGTTCAGACTGTTCTTGTTATTGTTTGTTTGTTCTTTTCTTTACAAAGGCTTGAGTCGATACCTGCATTCAGATTCTGGATGGAAATTTAAATCAACCAAAATTTGGTAGTTGATGTTGAATCGTCACTATGAAATAGCATAGGATGATTTAAATGGAACACATTTGCATCTTACTTGCAGTGAGGCTATGGCTTAGCCAAAAAATGCAATAGTGGCTATCAGTTCTTGGCTTTGAATTCGTTACTTTTACACCCCTTCTTTTAGGTGCAGATCCGGTTCCAATTTTTGTTCATTTCATATCCACTTGTTTACTTTTTGTCTTCTCTGGATTATTCAAATTGATCAGTGGTACTTGTTCAGGGGCAATCTGCCATTTCAGTGGATGATACATTAAAAATCATAGCCTGCAGGTAGTTCTTATTTTTATTTGGTTGAATTGAAACCATCTAATTTTATGTATGCAGCTCCAATCTCTTGTAATATATTTATATATTTTTGTAGGGAACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAGTTGGCTGTTGCTAGACAGGAGGGTTTTTTGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATCCCCTTGGTTGTAATTGGAGTCATCACTAGATTTGGCCGCAAAAACAACAGAGATGCAATTCGTAAAGCGTGGATGGGAACTGGTAATGCATCATTCTTCCCTGTGGTCAAAGTTATTCACATACCCAATTGCCAGGATTGAGGTTGTGTATGCCTCAAAGGTTAAGCAACGAAAAATCAGGGTTAGGTCTGGAAAAATGTTGCTTTTTGTGACTTTGCAATTCATTCTTGTACTTTCCCATGGCGGAGAAACAAATGTCTTCCCTCCTTTCTGATTGCTACATTTTGAAAATATAGTTCGTGCTTATGTATATTCAATCATTTCTTAAGGTTGTTTTAATGTTTAGAACTAAAGAGCTTCATTTGCAGGTGCTTCTTTGAGAAAAATGGAGAATCAGAAGGGTATAATTGCTAGATTTGTCATCGGAAGAAGGTTCTATTTCTAGTCATTTATCAACTTTATGTCTAATATTCTAATTTTCATCTAAATGCCTTTCTTAACATCTAGTTTTATGTACTTTCTTACAGTCCAAACCGTGGGGACAGTTTAGACAGAGCCATTGACGATGAAAATGGACAATATAATGATTTCATTATACATGTATGTTTTCTTAATGGTTATAGATGATTTGCTTCAACAGCAAAATCATTTCTTGCTAAGGGATGCAATTTTGGAGAATGATACACAATCCGTGTTACTCTGAGTAAAATTTAGTTGACTGGCTTTAGCCTACTTCTTTACAGAATGACCATGTGGAGGCGCCTGAGGAGCTTTCAAAGAAGGCCAAGCTTTTTTTTGCTTATGCTGTTGATAAATGGGATGCCGAATTTTATGCCAAAGTCAATGATGACGTTTATATAAACATTGGTTAGTACATCTATTGAGAGGAAATATTTTCCTGTCTTTGCATTTAGCTTTGCATGTAAAACATCCACCTTGAGAAAGTAAGAAACTTCATGCCTCATGCTAAACAATCACTTTCTTGATATGCTAGCAGTGTATCTTTTTGTATTGAAAGTGAGATAATGCAGTGCAATACAACATTGGTTGAGACTGTCGCACTATAATTGTAACGCCCCAAGCCCAGGAGTCGGAATCCGGATTCGGCACTTGACAGCCCCGGCATTCCCCTGCGACATGACCACGTTACCATACTCGTCTTAAACCGCTTCTAAGATCGAAGATTGTCCCCACAAACCAACATGGGATCCTTTTAGCATGCTTTGTCCTCACTCACATGCTTCCTAAGAAAATTCCCAGGAGGTCACCCAACATAGAACTTCTCCAAGCTAAGCACGCTTAACTTCAAAGTTCCTATGATTGAGCCACCGAAAAGGAAGGTGCACCTTGTTGGTATAGGTAGTATCTATCAATTCTTTTAAGCTTATCTTGACTATGCTTTCATCTCCTCAGGATCCCTCTCATTCGGATTTGGTATCGGTTCATTCATGTACCCTTCCTAACCCGGGCGTCACATGCCCACCAGCTTCCGCCTTGGTTCGTCCCCGAACCACATCCTACTGGGAGAGGTTCCGCTCTGATACCATCTGTAACGCCCCAAGCCCAGGAGTCGGAATCCGGATTCGGCACTTGATAGCCCCGGCATTCCCCTGCGACCTGACCACGTTACCATACTCGTCTTAAACCGTTTCTAAGATTGAAGATTGTCCCCACAAACCAACACGGGGTCCTTTTAGCATGCTTTGTCCTCACTCACATGCTTCCTAAGAAAATTCCCAGGAGGTCACCCAACATAGAACTTCTCCAAGCTAAGCACGCTTAACTTCAAAGTTCCTATGATTGAGCCACCGAAAAGGAAGGTGCACCTTGTTGGTATAGGTAGTATCTATCAATTCTTTTAAGCTTATCTTGACTATGCTTTCATCTCCTCAGGATCCCTCTCATTCGGATGTGGTATCGGTTCATTCATGTACCCTTCCTAACCCGGGCGTCACAATAATGGAGGATGTTCTGCATGTTCTGTTTAAAGATAGAGGGAGGACTCTTTAGTAGACTGTCTCTTTGCTTTTCTTTGGAGGATTGGGTTAGAAATAAAAAGGGAATTTCTAAGAGGGTAGAGAAATCTTGAGAAGTTTGAAGTCTCGTGCATGGCAATACCTTCTTTTGGGTCTTTGTGGTTATCATTACTTTTGATTCTTAGCAAGGATTCTCTCCTTGTACCGTTTCTTTCCTAGGGTTCATTTTGTTTGGATCTGGCCTGTGTTTTCATGACCTTATTTTTTTAAAGGGAAAAAAAAAACTCTCGTTCCTTTTTTTCTTATAAAGCACAATTATTACTCTACCCTCAACTGCATGGTTCATAATCTTGTTTGCGTTATTTTCCCCCATTGAAGTGAAGCTATTCTCAACTGCTTCTCATTGTGCTTATGGCACTTGTGAGTAAAAATGTATACATGGGCTAATCATAACTGTACTTTATTTTTGTTGTGTACAGATGCTCTAGGGAGCACACTTGCTTCTTACTTGGACAAACCTCGTGTCTATGTTGGGTGCATGAAATCTGGTGAAGTATTCTCAGAACCGTGAGTATGGCGTATCACCAAAAACATTTAACATAGGTAGTGGACATTTGACCTTGAAATTAAATCTCTCATCGTTCTCCTTTTGGACCTTTTCAGGAGCCATAAATGGTACGAACCAGATTGGTGGAAATTTGGTGACAAAAAAACGTGAGTGACACATTGCATGTGTATATGCTTTATTGCAAAAATTAATGTTACTAATTCGTCCTGCTAAATTCTTTCCATTGCCGAACATGCTAAGACTATGTTTGGGGTGTTCTTGCACTTCTGCTATGGTCAAGAACACGTGGAGTAATATAAAAGTACTCTAGAGAAAGTAAAAAAAATACTTGAAAAGTGTTCTTTTTGATTAATGAAAGTGTTATTATTATTATTATTTTTTAAAAAACTTTTCAGAAAGTATTTGTGTTTGGGCTCAAATACTTGATTGAAGAACACTTCAAAAACTATTTTAAAGGTGTAGAAACAAAGTTCATTGAGTACTTTAGTAAAAGAACACTTCTGCTAAAAGCACTTATTTTAGAAGCATTTTCGTACGCACCTTGACTCTGTGATCCTCCGTGGATTTGTGAACTATCTTTTATCTTTTCAGCTACACATTTTGAAAGTTATCTATACTCCTTTATAATTGTGTTTCATATGTTTGTATTCTTTTTAATAAAAATTGTCATCTTTGGCAAACAGATACTTCCGTCATGCTTCTGGCGAAATGTATGTCATATCAAAAGCTCTGGCCAAGTTTATTTCAATAAACAGGCAAGTTTCTACATGTAATCCATTGTCTTGTCTGCATATTACATGGCTCAACACAAATGATTAGATCGGGTTCATCAAATCTGCAGATCTCTTCTCCGTTCTTACGCCCATGACGATGTCACGACCGGTTCCTGGTTTATTGGGCTTGATGTCACATACATTGACGAGGGAAAGTTTTGCTGCTCATCTTGGTCTGCAGGTCTGTCTCTCCCTGTAGCTAATAGAACAGATCAGAGTCGGAAATCTAGCAACCCAAACTTTCAACATTAACACCAAAATTACGAATATGGTAGCAAGTTGATTTTTCTTTTTCTCCCTTTTCAGGAGCTATTTGTGCAGGTGTCTGATTAGTTTGCTTGAAGATCCTTGATGGGAAGGGAATACAAGCCTGAAGAGAACACAGAATTAATTTACATCTTTTTGATGCCCTTGTGTGTGGCCAGACGAAAGACGATAGCTCTTCCAACACTGCCTTCGGATTCTTACGCTAACATCGATTTCTCGAGAGACTGGCCAGAGGTTCAACAGTTTGTAAGATGTGCAACACAGAAACATTAACAGTTGAAGGGGAACAGTACTTTCTTCATTCTTTGGACTTCATCTTACTGATTCTTATATGCAATTTTAAGTTCATTTAGCACATCGGAAATGTTAATTCTTTCAGGCAGGTTGGAACTTGAGAAATGTAGATTTAAAGGCACAGTTCTATTAAAAGGGCACCTATTATCGAGG

mRNA sequence

ATGCGGAGTAAGGGATCGAATGCTCGACTGTCGAGCATGCCTATTCGATCCCGAATTCCCACCCTTTTGCTCTCCATGTTCGCTACCTTCGCTTCAATCTACGTCGCCGGCCGGCTATGGCAGGATGCGGAGAACAGGGTTTACTTGATTAAAGAGTTGGATAGGCTAACTGGTCAGGGGCAATCTGCCATTTCAGTGGATGATACATTAAAAATCATAGCCTGCAGGGAACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAGTTGGCTGTTGCTAGACAGGAGGGTTTTTTGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATCCCCTTGGTTGTAATTGGAGTCATCACTAGATTTGGCCGCAAAAACAACAGAGATGCAATTCGTAAAGCGTGGATGGGAACTGGTGCTTCTTTGAGAAAAATGGAGAATCAGAAGGGTATAATTGCTAGATTTGTCATCGGAAGAAGTCCAAACCGTGGGGACAGTTTAGACAGAGCCATTGACGATGAAAATGGACAATATAATGATTTCATTATACATAATGACCATGTGGAGGCGCCTGAGGAGCTTTCAAAGAAGGCCAAGCTTTTTTTTGCTTATGCTGTTGATAAATGGGATGCCGAATTTTATGCCAAAGTCAATGATGACGTTTATATAAACATTGATGCTCTAGGGAGCACACTTGCTTCTTACTTGGACAAACCTCGTGTCTATGTTGGGTGCATGAAATCTGGTGAAGTATTCTCAGAACCGAGCCATAAATGGTACGAACCAGATTGGTGGAAATTTGGTGACAAAAAAACATACTTCCGTCATGCTTCTGGCGAAATGTATGTCATATCAAAAGCTCTGGCCAAGTTTATTTCAATAAACAGGCAAGTTTCTACATATCTCTTCTCCGTTCTTACGCCCATGACGATGTCACGACCGGTTCCTGGTTTATTGGGCTTGATGTCACATACATTGACGAGGGAAAGTTTTGCTGCTCATCTTGGTCTGCAGGAGCTATTTGTGCAGGTGTCTGATTAG

Coding sequence (CDS)

ATGCGGAGTAAGGGATCGAATGCTCGACTGTCGAGCATGCCTATTCGATCCCGAATTCCCACCCTTTTGCTCTCCATGTTCGCTACCTTCGCTTCAATCTACGTCGCCGGCCGGCTATGGCAGGATGCGGAGAACAGGGTTTACTTGATTAAAGAGTTGGATAGGCTAACTGGTCAGGGGCAATCTGCCATTTCAGTGGATGATACATTAAAAATCATAGCCTGCAGGGAACAACAGAAGAAGTTGTTGGCGCTTGAGATGGAGTTGGCTGTTGCTAGACAGGAGGGTTTTTTGGTGAAGCATTCAAGAGAGACTAATGAAACAAAGATCCCCTTGGTTGTAATTGGAGTCATCACTAGATTTGGCCGCAAAAACAACAGAGATGCAATTCGTAAAGCGTGGATGGGAACTGGTGCTTCTTTGAGAAAAATGGAGAATCAGAAGGGTATAATTGCTAGATTTGTCATCGGAAGAAGTCCAAACCGTGGGGACAGTTTAGACAGAGCCATTGACGATGAAAATGGACAATATAATGATTTCATTATACATAATGACCATGTGGAGGCGCCTGAGGAGCTTTCAAAGAAGGCCAAGCTTTTTTTTGCTTATGCTGTTGATAAATGGGATGCCGAATTTTATGCCAAAGTCAATGATGACGTTTATATAAACATTGATGCTCTAGGGAGCACACTTGCTTCTTACTTGGACAAACCTCGTGTCTATGTTGGGTGCATGAAATCTGGTGAAGTATTCTCAGAACCGAGCCATAAATGGTACGAACCAGATTGGTGGAAATTTGGTGACAAAAAAACATACTTCCGTCATGCTTCTGGCGAAATGTATGTCATATCAAAAGCTCTGGCCAAGTTTATTTCAATAAACAGGCAAGTTTCTACATATCTCTTCTCCGTTCTTACGCCCATGACGATGTCACGACCGGTTCCTGGTTTATTGGGCTTGATGTCACATACATTGACGAGGGAAAGTTTTGCTGCTCATCTTGGTCTGCAGGAGCTATTTGTGCAGGTGTCTGATTAG

Protein sequence

MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQVSTYLFSVLTPMTMSRPVPGLLGLMSHTLTRESFAAHLGLQELFVQVSD
Homology
BLAST of Spg039072 vs. NCBI nr
Match: KAG7016837.1 (Hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 580.9 bits (1496), Expect = 7.4e-162
Identity = 289/297 (97.31%), Postives = 293/297 (98.65%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFMVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKTNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDVLGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           Y+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINRQV
Sbjct: 241 YIGCMKSGEVFSEPNHKWYEPDWWKFGDKKAYFRHASGEMYVISKALAKFISINRQV 297

BLAST of Spg039072 vs. NCBI nr
Match: KAA0042571.1 (hydroxyproline O-galactosyltransferase HPGT1 [Cucumis melo var. makuwa])

HSP 1 Score: 577.8 bits (1488), Expect = 6.3e-161
Identity = 292/305 (95.74%), Postives = 297/305 (97.38%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLS MPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSGMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFTVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRS NRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKNNRDAIRKAWMGTGVSLRKMESQKGIIARFVIGRSSNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWNAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQVSTY 300
           YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV  Y
Sbjct: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV--Y 300

Query: 301 LFSVL 306
           + S+L
Sbjct: 301 VLSLL 303

BLAST of Spg039072 vs. NCBI nr
Match: XP_022922391.1 (hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita moschata] >KAG6579334.1 Hydroxyproline O-galactosyltransferase HPGT1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 577.8 bits (1488), Expect = 6.3e-161
Identity = 287/297 (96.63%), Postives = 292/297 (98.32%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFMVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKTNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDVLGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           Y+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YIGCMKSGEVFSEPNHKWYEPDWWKFGDKKAYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. NCBI nr
Match: XP_023551313.1 (hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 577.4 bits (1487), Expect = 8.2e-161
Identity = 287/297 (96.63%), Postives = 292/297 (98.32%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACRE+QKKLLALEM+LA ARQEGF+VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACRERQKKLLALEMDLAAARQEGFMVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKTNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDVLGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           Y+GCMKSGEVFSEPSHKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YIGCMKSGEVFSEPSHKWYEPDWWKFGDKKAYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. NCBI nr
Match: XP_008437561.1 (PREDICTED: LOW QUALITY PROTEIN: hydroxyproline O-galactosyltransferase HPGT1 [Cucumis melo])

HSP 1 Score: 575.1 bits (1481), Expect = 4.1e-160
Identity = 287/297 (96.63%), Postives = 291/297 (97.98%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLS MPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSGMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFTVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKNNRDAIRKAWMGTGVSLRKMESQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAK FFAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKXFFAYAVDKWNAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. ExPASy Swiss-Prot
Match: Q94F27 (Hydroxyproline O-galactosyltransferase HPGT1 OS=Arabidopsis thaliana OX=3702 GN=HPTG1 PE=1 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 4.0e-118
Identity = 210/300 (70.00%), Postives = 252/300 (84.00%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           M  KGS+ RLSS    SRI TLLL MFATFAS YVAGRLWQ+++ RV+LI ELDR+TGQG
Sbjct: 1   MARKGSSIRLSS----SRISTLLLFMFATFASFYVAGRLWQESQTRVHLINELDRVTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSR---ETNETKIPLVVIGV 120
           +SAISVDDTLKIIACREQ+K L ALEMEL+ ARQEGF+ K  +    T   K PLVVIG+
Sbjct: 61  KSAISVDDTLKIIACREQKKTLAALEMELSSARQEGFVSKSPKLADGTETKKRPLVVIGI 120

Query: 121 ITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQY 180
           +T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIGRS N+GDS+D++ID EN Q 
Sbjct: 121 MTSLGNKKKRDAVRQAWMGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQT 180

Query: 181 NDFIIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDK 240
           +DFII +D VEAPEE SKK KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLA++L+ 
Sbjct: 181 DDFIILDDVVEAPEEASKKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLEN 240

Query: 241 PRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Sbjct: 241 PRAYIGCMKSGEVFSEPNHKWYEPEWWKFGDKKAYFRHAYGEMYVITHALARFVSINRDI 296

BLAST of Spg039072 vs. ExPASy Swiss-Prot
Match: Q5XEZ1 (Hydroxyproline O-galactosyltransferase HPGT3 OS=Arabidopsis thaliana OX=3702 GN=HPGT3 PE=2 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 1.8e-73
Identity = 139/296 (46.96%), Postives = 200/296 (67.57%), Query Frame = 0

Query: 8   ARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVD 67
           AR S     S  P+++++ F+  A +YVAGRLWQDAENRV L   L +   Q    ++VD
Sbjct: 16  ARSSKFSQSSSKPSVIMAFFSCVAWLYVAGRLWQDAENRVVLNNILKKSYDQKPKVLTVD 75

Query: 68  DTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNR 127
           D L ++ C++ +++++  EMEL +A+ +G+L      ++  K  L VIGV + FG    R
Sbjct: 76  DKLMVLGCKDLERRIVETEMELTLAKSQGYLKNLKSGSSSGKKLLAVIGVYSGFGSHLRR 135

Query: 128 DAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDHV 187
           +  R ++M  G +LRK+E ++GI+ RFVIGRSPNRGDSLDR ID+EN    DF+I  +H 
Sbjct: 136 NTFRGSYMPQGDALRKLE-ERGIVIRFVIGRSPNRGDSLDRKIDEENQARKDFLILENHE 195

Query: 188 EAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKS 247
           EA EEL+KK K FF+ AV  WDAEFY KV+D++ ++++ L   L S   +   Y+GCMKS
Sbjct: 196 EAQEELAKKVKFFFSAAVQNWDAEFYIKVDDNIDLDLEGLIGLLESRRGQDAAYIGCMKS 255

Query: 248 GEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR-QVSTYLF 303
           GEV +E   KWYEP+WWKFGD+K+YFRHA+G + ++SK LA++++IN   + TY F
Sbjct: 256 GEVVAEEGGKWYEPEWWKFGDEKSYFRHAAGSLLILSKTLAQYVNINSGSLKTYAF 310

BLAST of Spg039072 vs. ExPASy Swiss-Prot
Match: Q94A05 (Hydroxyproline O-galactosyltransferase HPGT2 OS=Arabidopsis thaliana OX=3702 GN=HPGT2 PE=1 SV=1)

HSP 1 Score: 264.6 bits (675), Expect = 1.5e-69
Identity = 131/288 (45.49%), Postives = 192/288 (66.67%), Query Frame = 0

Query: 20  PTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQ 79
           P+L+L+ F+  A +YVAGRLWQDA+ R  L   L     Q    ++V+D L ++ C++ +
Sbjct: 27  PSLILAFFSCLAWLYVAGRLWQDAQYRAALNTVLKMNYDQRPKVLTVEDKLVVLGCKDLE 86

Query: 80  KKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGA 139
           ++++  EMELA A+ +G+L K    ++  K  L VIGV T FG    R+  R +WM    
Sbjct: 87  RRIVETEMELAQAKSQGYLKKQKSVSSSGKKMLAVIGVYTGFGSHLKRNKFRGSWMPRDD 146

Query: 140 SLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDHVEAPEELSKKAKL 199
           +L+K+E ++G++ RFVIGRS NRGDSLDR ID+EN    DF+I  +H EA EEL KK K 
Sbjct: 147 ALKKLE-ERGVVIRFVIGRSANRGDSLDRKIDEENRATKDFLILENHEEAQEELPKKVKF 206

Query: 200 FFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWY 259
           F++ AV  WDAEFY KV+D+V ++++ + + L S   +   Y+GCMKSG+V +E   +WY
Sbjct: 207 FYSAAVQNWDAEFYVKVDDNVDLDLEGMIALLESRRSQDGAYIGCMKSGDVITEEGSQWY 266

Query: 260 EPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR-QVSTYLFSVLT 307
           EP+WWKFGD K+YFRHA+G + ++SK LA++++IN   + TY F   T
Sbjct: 267 EPEWWKFGDDKSYFRHATGSLVILSKNLAQYVNINSGLLKTYAFDDTT 313

BLAST of Spg039072 vs. ExPASy Swiss-Prot
Match: Q9MAP8 (Beta-1,6-galactosyltransferase GALT31A OS=Arabidopsis thaliana OX=3702 GN=GALT31A PE=1 SV=1)

HSP 1 Score: 204.1 bits (518), Expect = 2.5e-51
Identity = 119/299 (39.80%), Postives = 177/299 (59.20%), Query Frame = 0

Query: 10  LSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAIS-VDD 69
           +SS  +   +   LL+ F T   I  A     D    +  + + +   G   S +S   D
Sbjct: 24  ISSFLLGVLVVNRLLASFETVDGIERASPEQNDQSRSLNPLVDCESKEGDILSRVSHTHD 83

Query: 70  TLKIIACREQQKKLLALEMELAVAR------QEGFLVKHSRETNETKI---PLVVIGVIT 129
            +K +      K + +LE+ELA AR      ++G         +++KI      V+G++T
Sbjct: 84  VIKTL-----DKTISSLEVELATARAARSDGRDGSPAVAKTVADQSKIRPRMFFVMGIMT 143

Query: 130 RFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYND 189
            F  +  RD+IR  W+  G  L+++E +KGII RFVIG S + G  LD  I+ E  Q+ D
Sbjct: 144 AFSSRKRRDSIRGTWLPKGDELKRLETEKGIIMRFVIGHSSSPGGVLDHTIEAEEEQHKD 203

Query: 190 FIIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPR 249
           F   N H+E   ELS K +++F+ AV KWDA+FY KV+DDV++N+  LGSTLA +  KPR
Sbjct: 204 FFRLN-HIEGYHELSSKTQIYFSSAVAKWDADFYIKVDDDVHVNLGMLGSTLARHRSKPR 263

Query: 250 VYVGCMKSGEVFSEPSHKWYEPDWWKFGDK-KTYFRHASGEMYVISKALAKFISINRQV 298
           VY+GCMKSG V ++   K++EP++WKFG++   YFRHA+G++Y ISK LA +IS+NRQ+
Sbjct: 264 VYIGCMKSGPVLAQKGVKYHEPEYWKFGEEGNKYFRHATGQIYAISKDLATYISVNRQL 316

BLAST of Spg039072 vs. ExPASy Swiss-Prot
Match: Q9C809 (Probable beta-1,3-galactosyltransferase 8 OS=Arabidopsis thaliana OX=3702 GN=B3GALT8 PE=2 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 1.2e-50
Identity = 101/230 (43.91%), Postives = 152/230 (66.09%), Query Frame = 0

Query: 74  ACREQQKKLLALEMELAVAR-----QEGFLVKHSRETNETKIPLVVIGVITRFGRKNNRD 133
           A +  ++ +  LEMELA AR      E +  + ++  +  +    VIG+ T F  K  RD
Sbjct: 82  AVKSLERTMSTLEMELAAARTSDRSSEFWSERSAKNQSRLQKVFAVIGINTAFSSKKRRD 141

Query: 134 AIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDHVE 193
           ++R+ WM TG  L+K+E +KGI+ RFVIG S   G  LD+AID+E+ ++ DF +   H+E
Sbjct: 142 SVRQTWMPTGEKLKKIEKEKGIVVRFVIGHSATPGGVLDKAIDEEDSEHKDF-LRLKHIE 201

Query: 194 APEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSG 253
              +LS K +L+F+ A   +DAEFY KV+DDV++N+  L +TLA Y  +PR+Y+GCMKSG
Sbjct: 202 GYHQLSTKTRLYFSTATAMYDAEFYVKVDDDVHVNLGMLVTTLARYQSRPRIYIGCMKSG 261

Query: 254 EVFSEPSHKWYEPDWWKFGDK-KTYFRHASGEMYVISKALAKFISINRQV 298
            V S+   K++EP++WKFG++   YFRHA+G++Y ISK LA +IS N+ +
Sbjct: 262 PVLSQKGVKYHEPEFWKFGEEGNKYFRHATGQIYAISKDLATYISTNQGI 310

BLAST of Spg039072 vs. ExPASy TrEMBL
Match: A0A5A7TLW6 (Hexosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G00660 PE=3 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 3.0e-161
Identity = 292/305 (95.74%), Postives = 297/305 (97.38%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLS MPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSGMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFTVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRS NRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKNNRDAIRKAWMGTGVSLRKMESQKGIIARFVIGRSSNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWNAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQVSTY 300
           YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV  Y
Sbjct: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV--Y 300

Query: 301 LFSVL 306
           + S+L
Sbjct: 301 VLSLL 303

BLAST of Spg039072 vs. ExPASy TrEMBL
Match: A0A6J1E406 (Hexosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111430398 PE=3 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 3.0e-161
Identity = 287/297 (96.63%), Postives = 292/297 (98.32%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFMVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRK NRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKTNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINID LGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDVLGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           Y+GCMKSGEVFSEP+HKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YIGCMKSGEVFSEPNHKWYEPDWWKFGDKKAYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. ExPASy TrEMBL
Match: A0A1S3ATZ9 (Hexosyltransferase OS=Cucumis melo OX=3656 GN=LOC103482939 PE=3 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 2.0e-160
Identity = 287/297 (96.63%), Postives = 291/297 (97.98%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLS MPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSGMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFTVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKNNRDAIRKAWMGTGVSLRKMESQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAK FFAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKXFFAYAVDKWNAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. ExPASy TrEMBL
Match: A0A6J1IBW9 (Hexosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111471553 PE=3 SV=1)

HSP 1 Score: 574.7 bits (1480), Expect = 2.6e-160
Identity = 284/297 (95.62%), Postives = 291/297 (97.98%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF+VKHSRETNETK+PLVVIG+ITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFMVKHSRETNETKVPLVVIGIITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRK NRDAIRKAWMGTGASLRKME QKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKTNRDAIRKAWMGTGASLRKMEKQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVD+WDAEFYAKVNDDVYINID LGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDRWDAEFYAKVNDDVYINIDVLGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           Y+GCMKSGEVFSEPSHKWYEPDWWKFGDKK YFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YIGCMKSGEVFSEPSHKWYEPDWWKFGDKKAYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. ExPASy TrEMBL
Match: A0A5D3C3M1 (Hexosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold376G00710 PE=3 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 4.4e-160
Identity = 287/297 (96.63%), Postives = 291/297 (97.98%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           MRSKGSNARLS MPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG
Sbjct: 1   MRSKGSNARLSGMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITR 120
           QSAISVDDTLKIIACREQQKKLLALEM+LA ARQEGF VKHSRETNETKIPLVVIGVITR
Sbjct: 61  QSAISVDDTLKIIACREQQKKLLALEMDLAAARQEGFTVKHSRETNETKIPLVVIGVITR 120

Query: 121 FGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDF 180
           FGRKNNRDAIRKAWMGTG SLRKME+QKGIIARFVIGRS NRGDSLDRAIDDENGQYNDF
Sbjct: 121 FGRKNNRDAIRKAWMGTGVSLRKMESQKGIIARFVIGRSSNRGDSLDRAIDDENGQYNDF 180

Query: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240
           IIHNDHVEAPEELSKKAKLFFAYAVDKW+AEFYAKVNDDVYINIDALGSTLASYLDKPRV
Sbjct: 181 IIHNDHVEAPEELSKKAKLFFAYAVDKWNAEFYAKVNDDVYINIDALGSTLASYLDKPRV 240

Query: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR +
Sbjct: 241 YVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRSL 297

BLAST of Spg039072 vs. TAIR 10
Match: AT5G53340.1 (Galactosyltransferase family protein )

HSP 1 Score: 426.0 bits (1094), Expect = 2.9e-119
Identity = 210/300 (70.00%), Postives = 252/300 (84.00%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           M  KGS+ RLSS    SRI TLLL MFATFAS YVAGRLWQ+++ RV+LI ELDR+TGQG
Sbjct: 1   MARKGSSIRLSS----SRISTLLLFMFATFASFYVAGRLWQESQTRVHLINELDRVTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSR---ETNETKIPLVVIGV 120
           +SAISVDDTLKIIACREQ+K L ALEMEL+ ARQEGF+ K  +    T   K PLVVIG+
Sbjct: 61  KSAISVDDTLKIIACREQKKTLAALEMELSSARQEGFVSKSPKLADGTETKKRPLVVIGI 120

Query: 121 ITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQY 180
           +T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIGRS N+GDS+D++ID EN Q 
Sbjct: 121 MTSLGNKKKRDAVRQAWMGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQT 180

Query: 181 NDFIIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDK 240
           +DFII +D VEAPEE SKK KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLA++L+ 
Sbjct: 181 DDFIILDDVVEAPEEASKKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLEN 240

Query: 241 PRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Sbjct: 241 PRAYIGCMKSGEVFSEPNHKWYEPEWWKFGDKKAYFRHAYGEMYVITHALARFVSINRDI 296

BLAST of Spg039072 vs. TAIR 10
Match: AT5G53340.2 (Galactosyltransferase family protein )

HSP 1 Score: 426.0 bits (1094), Expect = 2.9e-119
Identity = 210/300 (70.00%), Postives = 252/300 (84.00%), Query Frame = 0

Query: 1   MRSKGSNARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQG 60
           M  KGS+ RLSS    SRI TLLL MFATFAS YVAGRLWQ+++ RV+LI ELDR+TGQG
Sbjct: 1   MARKGSSIRLSS----SRISTLLLFMFATFASFYVAGRLWQESQTRVHLINELDRVTGQG 60

Query: 61  QSAISVDDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSR---ETNETKIPLVVIGV 120
           +SAISVDDTLKIIACREQ+K L ALEMEL+ ARQEGF+ K  +    T   K PLVVIG+
Sbjct: 61  KSAISVDDTLKIIACREQKKTLAALEMELSSARQEGFVSKSPKLADGTETKKRPLVVIGI 120

Query: 121 ITRFGRKNNRDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQY 180
           +T  G K  RDA+R+AWMGTGASL+K+E++KG+IARFVIGRS N+GDS+D++ID EN Q 
Sbjct: 121 MTSLGNKKKRDAVRQAWMGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQT 180

Query: 181 NDFIIHNDHVEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDK 240
           +DFII +D VEAPEE SKK KLFFAYA D+WDA+FYAK  D++Y+NIDALG+TLA++L+ 
Sbjct: 181 DDFIILDDVVEAPEEASKKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLEN 240

Query: 241 PRVYVGCMKSGEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINRQV 298
           PR Y+GCMKSGEVFSEP+HKWYEP+WWKFGDKK YFRHA GEMYVI+ ALA+F+SINR +
Sbjct: 241 PRAYIGCMKSGEVFSEPNHKWYEPEWWKFGDKKAYFRHAYGEMYVITHALARFVSINRDI 296

BLAST of Spg039072 vs. TAIR 10
Match: AT2G25300.1 (Galactosyltransferase family protein )

HSP 1 Score: 277.7 bits (709), Expect = 1.3e-74
Identity = 139/296 (46.96%), Postives = 200/296 (67.57%), Query Frame = 0

Query: 8   ARLSSMPIRSRIPTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVD 67
           AR S     S  P+++++ F+  A +YVAGRLWQDAENRV L   L +   Q    ++VD
Sbjct: 16  ARSSKFSQSSSKPSVIMAFFSCVAWLYVAGRLWQDAENRVVLNNILKKSYDQKPKVLTVD 75

Query: 68  DTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNR 127
           D L ++ C++ +++++  EMEL +A+ +G+L      ++  K  L VIGV + FG    R
Sbjct: 76  DKLMVLGCKDLERRIVETEMELTLAKSQGYLKNLKSGSSSGKKLLAVIGVYSGFGSHLRR 135

Query: 128 DAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDHV 187
           +  R ++M  G +LRK+E ++GI+ RFVIGRSPNRGDSLDR ID+EN    DF+I  +H 
Sbjct: 136 NTFRGSYMPQGDALRKLE-ERGIVIRFVIGRSPNRGDSLDRKIDEENQARKDFLILENHE 195

Query: 188 EAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKS 247
           EA EEL+KK K FF+ AV  WDAEFY KV+D++ ++++ L   L S   +   Y+GCMKS
Sbjct: 196 EAQEELAKKVKFFFSAAVQNWDAEFYIKVDDNIDLDLEGLIGLLESRRGQDAAYIGCMKS 255

Query: 248 GEVFSEPSHKWYEPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR-QVSTYLF 303
           GEV +E   KWYEP+WWKFGD+K+YFRHA+G + ++SK LA++++IN   + TY F
Sbjct: 256 GEVVAEEGGKWYEPEWWKFGDEKSYFRHAAGSLLILSKTLAQYVNINSGSLKTYAF 310

BLAST of Spg039072 vs. TAIR 10
Match: AT4G32120.1 (Galactosyltransferase family protein )

HSP 1 Score: 264.6 bits (675), Expect = 1.1e-70
Identity = 131/288 (45.49%), Postives = 192/288 (66.67%), Query Frame = 0

Query: 20  PTLLLSMFATFASIYVAGRLWQDAENRVYLIKELDRLTGQGQSAISVDDTLKIIACREQQ 79
           P+L+L+ F+  A +YVAGRLWQDA+ R  L   L     Q    ++V+D L ++ C++ +
Sbjct: 27  PSLILAFFSCLAWLYVAGRLWQDAQYRAALNTVLKMNYDQRPKVLTVEDKLVVLGCKDLE 86

Query: 80  KKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNNRDAIRKAWMGTGA 139
           ++++  EMELA A+ +G+L K    ++  K  L VIGV T FG    R+  R +WM    
Sbjct: 87  RRIVETEMELAQAKSQGYLKKQKSVSSSGKKMLAVIGVYTGFGSHLKRNKFRGSWMPRDD 146

Query: 140 SLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDHVEAPEELSKKAKL 199
           +L+K+E ++G++ RFVIGRS NRGDSLDR ID+EN    DF+I  +H EA EEL KK K 
Sbjct: 147 ALKKLE-ERGVVIRFVIGRSANRGDSLDRKIDEENRATKDFLILENHEEAQEELPKKVKF 206

Query: 200 FFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMKSGEVFSEPSHKWY 259
           F++ AV  WDAEFY KV+D+V ++++ + + L S   +   Y+GCMKSG+V +E   +WY
Sbjct: 207 FYSAAVQNWDAEFYVKVDDNVDLDLEGMIALLESRRSQDGAYIGCMKSGDVITEEGSQWY 266

Query: 260 EPDWWKFGDKKTYFRHASGEMYVISKALAKFISINR-QVSTYLFSVLT 307
           EP+WWKFGD K+YFRHA+G + ++SK LA++++IN   + TY F   T
Sbjct: 267 EPEWWKFGDDKSYFRHATGSLVILSKNLAQYVNINSGLLKTYAFDDTT 313

BLAST of Spg039072 vs. TAIR 10
Match: AT1G77810.2 (Galactosyltransferase family protein )

HSP 1 Score: 205.7 bits (522), Expect = 6.1e-53
Identity = 107/232 (46.12%), Postives = 151/232 (65.09%), Query Frame = 0

Query: 67  DDTLKIIACREQQKKLLALEMELAVARQEGFLVKHSRETNETKIPLVVIGVITRFGRKNN 126
           D T +++   E  + L      L+  R    +V  S ETN  K   +V+G+ T F  +  
Sbjct: 72  DVTGEVLRTHEAIQSLDKSVSTLSSTRSSQEMVDGS-ETNPRKKVFMVMGINTAFSSRKR 131

Query: 127 RDAIRKAWMGTGASLRKMENQKGIIARFVIGRSPNRGDSLDRAIDDENGQYNDFIIHNDH 186
           RD++R+ WM  G  L ++E +KGI+ +F+IG S      LDRAID E+ Q+ DF +  +H
Sbjct: 132 RDSVRETWMPQGEKLERLEQEKGIVIKFMIGHSATSNSILDRAIDSEDAQHKDF-LRLEH 191

Query: 187 VEAPEELSKKAKLFFAYAVDKWDAEFYAKVNDDVYINIDALGSTLASYLDKPRVYVGCMK 246
           VE   ELS K K+FF+ AV KWDAEFY KV+DDV++N+  L STLA +  KPRVY+GCMK
Sbjct: 192 VEGYHELSAKTKIFFSTAVAKWDAEFYIKVDDDVHVNLGMLASTLARHRSKPRVYIGCMK 251

Query: 247 SGEVFSEPSHKWYEPDWWKFG-DKKTYFRHASGEMYVISKALAKFISINRQV 298
           SG V ++ + K++EP++WKFG D   YFRHA+G++Y ISK LA +ISIN+ +
Sbjct: 252 SGPVLAQKTVKYHEPEYWKFGEDGNKYFRHATGQIYAISKDLANYISINQPI 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7016837.17.4e-16297.31Hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita argyrosperma subsp. argy... [more]
KAA0042571.16.3e-16195.74hydroxyproline O-galactosyltransferase HPGT1 [Cucumis melo var. makuwa][more]
XP_022922391.16.3e-16196.63hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita moschata] >KAG6579334.1 ... [more]
XP_023551313.18.2e-16196.63hydroxyproline O-galactosyltransferase HPGT1 [Cucurbita pepo subsp. pepo][more]
XP_008437561.14.1e-16096.63PREDICTED: LOW QUALITY PROTEIN: hydroxyproline O-galactosyltransferase HPGT1 [Cu... [more]
Match NameE-valueIdentityDescription
Q94F274.0e-11870.00Hydroxyproline O-galactosyltransferase HPGT1 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q5XEZ11.8e-7346.96Hydroxyproline O-galactosyltransferase HPGT3 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q94A051.5e-6945.49Hydroxyproline O-galactosyltransferase HPGT2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q9MAP82.5e-5139.80Beta-1,6-galactosyltransferase GALT31A OS=Arabidopsis thaliana OX=3702 GN=GALT31... [more]
Q9C8091.2e-5043.91Probable beta-1,3-galactosyltransferase 8 OS=Arabidopsis thaliana OX=3702 GN=B3G... [more]
Match NameE-valueIdentityDescription
A0A5A7TLW63.0e-16195.74Hexosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G00... [more]
A0A6J1E4063.0e-16196.63Hexosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111430398 PE=3 SV=1[more]
A0A1S3ATZ92.0e-16096.63Hexosyltransferase OS=Cucumis melo OX=3656 GN=LOC103482939 PE=3 SV=1[more]
A0A6J1IBW92.6e-16095.62Hexosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111471553 PE=3 SV=1[more]
A0A5D3C3M14.4e-16096.63Hexosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold376G0... [more]
Match NameE-valueIdentityDescription
AT5G53340.12.9e-11970.00Galactosyltransferase family protein [more]
AT5G53340.22.9e-11970.00Galactosyltransferase family protein [more]
AT2G25300.11.3e-7446.96Galactosyltransferase family protein [more]
AT4G32120.11.1e-7045.49Galactosyltransferase family protein [more]
AT1G77810.26.1e-5346.12Galactosyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 75..95
NoneNo IPR availableGENE3D3.90.550.50coord: 100..304
e-value: 1.5E-12
score: 49.3
NoneNo IPR availablePANTHERPTHR11214:SF74HYDROXYPROLINE O-GALACTOSYLTRANSFERASE HPGT1coord: 1..297
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 126..300
e-value: 1.0E-32
score: 113.5
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 1..297
IPR025298Domain of unknown function DUF4094PFAMPF13334DUF4094coord: 18..94
e-value: 8.7E-7
score: 29.3

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg039072.1Spg039072.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010405 arabinogalactan protein metabolic process
biological_process GO:0018258 protein O-linked glycosylation via hydroxyproline
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:1990714 hydroxyproline O-galactosyltransferase activity
molecular_function GO:0008194 UDP-glycosyltransferase activity
molecular_function GO:0016758 hexosyltransferase activity