MS009770 (gene) Bitter gourd (TR) v1

Overview
NameMS009770
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionpeptidyl serine alpha-galactosyltransferase
Locationscaffold173: 125737 .. 132000 (-)
RNA-Seq ExpressionMS009770
SyntenyMS009770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGGAATTATTGGTTTTTGTGGCGATTTCTTTGGTGGGGTTTGTGGCCGGCAATGGGCGGAGCAACAACACTGGCACGGCGGCGCCGTGGCGGATTCACACTCTGTTCTCGGTCGAGTGCCAGAACTACTTCGATTGGCAAACTGTTGGGCTGATGAACAGCTTCAAGAAGTCGCAGCAGCCGGGGCCAATCACCCGCTTGCTAAGTTGCACCGATGAGGAGAAGAAGAATTACAGAGGGATGAATTTGGCTCCCACTTTTGAGGTCCCATCCATGAGTAGGCATCCAAAAACTGGGGACTGGTTAGAGTTTCTTCACCTTTTCCCTTTCATTTCTAGAACTTTGGAGTTGAGCATTTGGGTTTTGAATTGCTTTTGGCGTGCTTCTTGGTAGTAATATGGTTGTTTTGATTGCTATCTGTGTTTGTTTAATTTGATTTCAGTTATGAGATTGTTCATTTGTTGACTAGATTGATCAGTTTGTGTACAGTTTTTTCCCCTATGCTTCTGGTTAGTAATCTTCTTGTGTTTTGATTGCTATCTTTATTAGTTTGTTTGGTTTTAGTTCTGAAATTGTTCATCTGCTGACTAGATCGATCAGTGTGCTTACAGTTTCTTCTTCCCTTTTTCCTTTTTTTTTGGCTGTTTTTGCGTTGACCTGTTGGATTAGGATCAGAAAATGATGAAATTCTGATTGGTTTATGTGCTCTGTTTAACCGTTTGCTGCTATTCAGTGATCAAAATTTAGCTTGTTTACTAATCAGTAACTGCGAAGCCAGTGGAAGAGGCTTCCTGTTTAGAAAATATATCTCAAGGGCTGTAGAGAGCTGTGATATTCTGGTTTGAAAGATAATGTTGTTCAATCATCAATGATGTAGGGGTGATTAGAGGAGAAGATATTGAAATGGAACGTGAAATTGTAAACCGGATAAGGGGAAATGAAATGGGAGAAAATGGGAAGAGTTATTTGGAGAAGGTTGGGGACTGGCTTGCATTTTGACCTTGTTCAATCTTGATGGAAGAGATCGTACCCATATTTGTGTATCAATAACTCTCCACGATCCAACTTTCAAGAATATGGATAGAGTAGCTACAGGAAGAAGAATATTTGTGTAAATTGCTCGAACAAGAGGCTTTAGAATTACTTCGGAATATGTATTCTTTGTTGAGAAAAATTGATTGAATTTTGCACTGATGAACAGGGGGGATGTTTTCCGTTGATAATTAATATCTGAGGAGAAAGCTTGTTTTCGCCTTTGTTTGACTTGATAATTTATGGATGTTCTTTTTCATACTCACTCATCAAAGTAGCAATTAGGATCCCAAATTTTTTCTTTTCATTCTTGTCTTTTTGATGATTTGAACAACTCTCTCTTACTATATTTCTTCCGACTTCACAACTTGTTAATTTTTCTTTTTCCTCCTCTTGAATCCTGGCGACAAATGAAAAGAAGAATGATACCAAAATAAAATATTTACATGTTTTTGCAGGTATCCTGCAATAAATAAACCTGCAGGGGTTGTCCACTGGCTTAAACATAGCAAAGAAGCAGAGAATGTTGATTGGGTTGTGATTTTGGATGCAGACATGATCATAAGAGGCCCCATAATACCTTGGGAACTTGGTGCTGAGAAGGGCAGACCTGTTGCAGCATATTATGGGTTTGTTTCTTCTCTATTTTAGTTCAGTCTCTTATAAATGGAAACTTCAATCAAAGTTACTTGGATTGATGTTGCTGGTCTTCTGCAGTTAAAATAAAAAAGGAAAGAAACGGTTTAAATCTTGTTTTGGGACCTATTTAATACTTCTTTGTGGGCCTCGGTTACTAAGGAATTCTGTACTTATCAGTTTGGTGTTCTTATTGTGGATTGGAGCCCCTTTTTTAGACTTCGCTTCCTTTTTTGGGCTGAATTTTTTCTTTAACGCCCTTGTACATCCTTTCATTTGTCTAAATGAAAGCTTAGTTTCTTATTAAAAGAGAAAAAAAATACCTAAAGTTTTGTTTTAGTCCTTATACTTTCACTATTATTATTCCCTCAACTTTTACATAAAAAAGTGGTTTTAGTCCTTATCATTAATTTTCTACTAATCATTCAGAAAAACCTAAATCTATTTTGAAATTCAACTCTTCCCTACTACATGTGACTTAAAATACCTGTCCATATAGATACTTCTTGATTAAGTAAAAATGTAGGGGCATTTTGTTGAATGTTCAGAGGAAAACAAAATGTTTGAAAGTTTAAGGTCCAAAATAGAATAAGCATGAAGTTTAAGGGCTAGAATAGGATTAGAAAAAACTAACATATGGTATTTCTGTTGTGAGATTTGTATGGCAATAAGGAACTTCTTGCATTTATATCTTTCCTAGTTGTTACTGAGTTTGAATGATGTAAATAATATTTCTCTCAGTTACTTGGTTGGATGTGACAATATTCTTGCTAAACTGCACACTAAGCACCCAGAGCTCTGTGACAAAGTTGGTGGGTTGTTAGCAATGCATATAGATGATCTTCGAGTGTTCGCACCAATGTGGCTTTCGAAGACGGAAGAAGTACGTGAAGATAGAGATCACTGGGCGACCAACATAACTGGTGATATCTATGGGAAAGGGTGGATAAGTGAGATGTATGGTTACTCGTTTGGAGCAGCAGAAGTAAGCTTTATTTTTATTTCAACTTTTCCAAAGGGCAATATGAAAAGCTTTTGCTATCTCGCCGTGTCTATTGACTGCTCGGTATGTTTCTGTTATTCTATAGATTTATGTTTTGCAACTTCTTTGCTTATTTCAGGTTGGTCTCCGACACAAAATCAATGACAATTTGATGATCTACCCCGGTTATATTCCTCGTCCAGACATCGAGCCTATACTTCTTCACTATGGGTTGCCATTTAGTGTGGGAAATTGGTCCTTTAGTAAATTAAATCACCATGAAGATGGTATTGTCTATGACTGTAACCGGCTTTTCCCAGAACCTCCATATCCTCGAGAGGTATGTGCACAAATGATTGGTTCCTTAGATATTTCGATGCGTTTTTATGAAAAACATTTGGAGAAGGTCTTCACTGAACACATTTTTTTGCATTGAATTCGTTATATTCTAGTTTGTTACTTAAGATTGAACTCTGTGCAGATACAACAAATGGAATCTGATCCAAATAAGAAGCGGGCGCTACTTATAAATATAGAGTGTATCAACTTATTGAATGAGGGCCTATTGTTGCAACATAAGCGAAACGAATGCCCGAAGCCACAGTGGTCAAAATATTTAAGCTTTTTAAAGAGTAAAACTTTTACTGACCTAACTAAACCCAAGTATCCAACCCCTGCTACTCTAGTGATGAGGGAAGATCATGTTCAGAAACAACCGATGAAGGAAGATCATGGTCAGAAACAACCGGTGAAGGAAAATCATGTTCAGAAGCAACCGGTGTTTGATGAGCCGAATAAATCACATCCAAAGATTCACACACTTTTCTCGACGGAGTGTACTCCTTATTTCGACTGGCAGACTGTAGGCCTTATGCATAGTTTCCGCCTGAGCGGCCAACCTGGAAACATTACGCGACTTCTCAGCTGTACCAGTGAGGACTTGAAAGAATACAAAGGTCACGATCTGGCTCCGACGCATTATGTTCCTTCCATGAATCGACATCCACTAACCGGCGACTGGTAATCTCTTTTTTGTATTCACCCATGAGCCTGCTTTTACTCTTTTGCTTATCGCTCAGCTTTTTCGGTTTCCCAATTTTATTAGGTATCACATACTATTTTGGAGAGGAATGGAATATTTCTGTGTATCATTATATGGAGTTTTTTTGGGACCTCATAAGGTTTAATGTTTTCTTTACGGGGTCTGTTTCGAAATTGTTTTGTAACTATCAGTTAGGTCATATTCTTTTGGATCGGAGGCCCTTTCTTTAGTTTGGCTTTCCTCTTCAGGGCTTTGTTTTTGTATGCCGTACGTCCGTTTTGTTTTCATTATTCTCAATGAAAAATTTGGTTTCTTATAAAAAAAAAAGAAAATTTATATGGATATAAAGCAAGAAACCATCTATTTTTTTTCTTTATCTTCTTCCTATTTTCTACAAATTTCTCTCTCTGCTATCTAATGTGCATTATATTTACCCTTCAAAACTACAATCTTACAAATTTTTCAATCCTGCCTATGTTGACCCAAAAAAAAAAAAGATGTTCAAAGATTTGGGAGGAAAATTAAATCTCGAATCTTTCTGTTCCGTCTTAGGTATCCGGCCATAAACAAGCCAGCTGCAGTGCTTCATTGGCTCAATCATGTAAACACTGATGCAGAATTTATCGTTATTCTTGATGCTGATATGATCTTGAGAGGACCAATTACGCCGTGGGAGTTCAAAGCAGCTCGTGGACGTCCTGTTTCAACTCCCTATGAGTAAGAATCTATTCAAGTATTTCTCTCTTTGATAAATATGCAGAACCCCTCTCTCCCATATATATGTGTGCATGTAGGCGTGTAAATGACATAAGAATATGCATCTTTTGTAATTATCCTATTGGTCTTATTCGTTTGAAGTGGAGCTCTTTTTCTATCTCGTTAGGTCTAGGTGTCGTTTTGTCTCTCCCTTCTCTAGGCTGTGGTTTTTGTAAGCCTCTTTTGTATTCTTCCGCTTTTCTCAATGAAAGTCCGGTTTCATTCTTTTAAGAAAAAAAAAAATGCTATTGTTTGCATATGAACTTGATTTCTCTTTTTAAGTTTTAGTAGGAAATATTTTGACTTGTGTCAAATATATGTTTGTAGTTACCTTATTGGCTGTGACAATGTGCTTGCCAAACTCCATACAAGCCATCCTGAAGCTTGTGACAAAGTTGGGGGTGTTATTATCATGCACGTAGATGATCTCAGGAAATTTGCCATGCTATGGTTGCATAAAACCGAGGAGGTCCGAGCCGACCGAGCTCATTATGCAACGAATATCACGGGAGATATATATGAATCTGGCTGGATCAGTGAGATGTATGGTTACTCATTCGGTGCTGCCGAGGTACTTAGGCTATGGTTTAGCTAGTTGTTCTTTTTTAGTTTAAATCTTCGATTGGTCTTTGAACTTTCATGCTTATTTCGGCTCATAGTTATATTTTAGTTCCTCAACTTAATGTTTATAAAAAAAAGAAAGAAAGAAAAAACACTTTAAAATGTTTTGCTTTAGCCCTTATAGCTTTGCATAAAAGACTGATTTAGTCCTTATTGTTGAATTTCAGTTAATTAGTTTTTGAAAAAAATTACAAACCATTTCAAAACCCGTTTTCAATTTGAACACGTTTCAAGTGACTTTAAATACTTATCTTTAGACGTATCTGCTCATTAGGTAGTAAACAAAAGACATTCTTATGTCAAATTTTTAATGCAAGATCGAGGAACTAGAATAGAACATTTTGAAAGTTTACGGACCAAAACAAATCAGCCTGAAAGTTGAGGTTATTGTTGTATAGGTTGCATGTTTCTTCACCATTCAATCATTTCCAGGTCTTTATCGTTAACCAAGCTGAAATCAGTATTGAATATCCTCAAACCATTACAATGATACTGAAACTGAAAACATTTCTTTTTCACTCTCGAACAGCTGCAATTACGGCACATTCGAAACAGCAAGATCTTGATATACCCGGGATACGTTCCTGAATCTGGAGTTCATTACAGAGTGTTTCACTATGGACTTGAATTTAAAGTGGGGAATTGGAGCTTTGATAAGGCAAATTGGAGGGAAACTGATGTGGTGAACAAATGCTGGGCTCAATTTCCTGATCCACCAGATACTTCCACACTTGATCCAACTGACAAGGAAGCTTTTGACAGGGACTTGCTTAGCATAGAGTGTATAAGAACTCTGAATGAAGCTCTGAATTTGCATCATAAGAAGATGAACTGCCCCGATCCTAACTCATTGGCCAACTCGAACCCGGAATATGAAAGTGAAGCTGTGGTTTCGAGGAAAGTTGGCAAGCTTGATGAAAGCTATACTGAAAAAGTTGACAATTTGTCTCGGGAATTGTCGGAGGAGGCGAAGGATGATGGGATGTTTAGTTCTCTGAGGTTGTGGATAATTGCTCTGTGGGTGATATCTGGTTTGGTGTTCTTGGTAGTGATCGTATCTAGGTTTTCGGGTCGAAAAGGGAAGGGGATGAGAGGCAAACATCACAGGAACAAGAGGAGAACCGCTTCATATTCAGGTTTCATGGATCGGAACGGGCATGAGAAGTATGTTCGAGATCTCGATGCCTCCTTG

mRNA sequence

ATGAGGGAATTATTGGTTTTTGTGGCGATTTCTTTGGTGGGGTTTGTGGCCGGCAATGGGCGGAGCAACAACACTGGCACGGCGGCGCCGTGGCGGATTCACACTCTGTTCTCGGTCGAGTGCCAGAACTACTTCGATTGGCAAACTGTTGGGCTGATGAACAGCTTCAAGAAGTCGCAGCAGCCGGGGCCAATCACCCGCTTGCTAAGTTGCACCGATGAGGAGAAGAAGAATTACAGAGGGATGAATTTGGCTCCCACTTTTGAGGTCCCATCCATGAGTAGGCATCCAAAAACTGGGGACTGGTATCCTGCAATAAATAAACCTGCAGGGGTTGTCCACTGGCTTAAACATAGCAAAGAAGCAGAGAATGTTGATTGGGTTGTGATTTTGGATGCAGACATGATCATAAGAGGCCCCATAATACCTTGGGAACTTGGTGCTGAGAAGGGCAGACCTGTTGCAGCATATTATGGTTACTTGGTTGGATGTGACAATATTCTTGCTAAACTGCACACTAAGCACCCAGAGCTCTGTGACAAAGTTGGTGGGTTGTTAGCAATGCATATAGATGATCTTCGAGTGTTCGCACCAATGTGGCTTTCGAAGACGGAAGAAGTACGTGAAGATAGAGATCACTGGGCGACCAACATAACTGGTGATATCTATGGGAAAGGGTGGATAAGTGAGATGTATGGTTACTCGTTTGGAGCAGCAGAAGTTGGTCTCCGACACAAAATCAATGACAATTTGATGATCTACCCCGGTTATATTCCTCGTCCAGACATCGAGCCTATACTTCTTCACTATGGGTTGCCATTTAGTGTGGGAAATTGGTCCTTTAGTAAATTAAATCACCATGAAGATGGTATTGTCTATGACTGTAACCGGCTTTTCCCAGAACCTCCATATCCTCGAGAGATACAACAAATGGAATCTGATCCAAATAAGAAGCGGGCGCTACTTATAAATATAGAGTGTATCAACTTATTGAATGAGGGCCTATTGTTGCAACATAAGCGAAACGAATGCCCGAAGCCACAGTGGTCAAAATATTTAAGCTTTTTAAAGAGTAAAACTTTTACTGACCTAACTAAACCCAAGTATCCAACCCCTGCTACTCTAGTGATGAGGGAAGATCATGTTCAGAAACAACCGATGAAGGAAGATCATGGTCAGAAACAACCGGTGAAGGAAAATCATGTTCAGAAGCAACCGGTGTTTGATGAGCCGAATAAATCACATCCAAAGATTCACACACTTTTCTCGACGGAGTGTACTCCTTATTTCGACTGGCAGACTGTAGGCCTTATGCATAGTTTCCGCCTGAGCGGCCAACCTGGAAACATTACGCGACTTCTCAGCTGTACCAGTGAGGACTTGAAAGAATACAAAGGTCACGATCTGGCTCCGACGCATTATGTTCCTTCCATGAATCGACATCCACTAACCGGCGACTGGTATCCGGCCATAAACAAGCCAGCTGCAGTGCTTCATTGGCTCAATCATGTAAACACTGATGCAGAATTTATCGTTATTCTTGATGCTGATATGATCTTGAGAGGACCAATTACGCCGTGGGAGTTCAAAGCAGCTCGTGGACGTCCTGTTTCAACTCCCTATGATTACCTTATTGGCTGTGACAATGTGCTTGCCAAACTCCATACAAGCCATCCTGAAGCTTGTGACAAAGTTGGGGGTGTTATTATCATGCACGTAGATGATCTCAGGAAATTTGCCATGCTATGGTTGCATAAAACCGAGGAGGTCCGAGCCGACCGAGCTCATTATGCAACGAATATCACGGGAGATATATATGAATCTGGCTGGATCAGTGAGATGTATGGTTACTCATTCGGTGCTGCCGAGCTGCAATTACGGCACATTCGAAACAGCAAGATCTTGATATACCCGGGATACGTTCCTGAATCTGGAGTTCATTACAGAGTGTTTCACTATGGACTTGAATTTAAAGTGGGGAATTGGAGCTTTGATAAGGCAAATTGGAGGGAAACTGATGTGGTGAACAAATGCTGGGCTCAATTTCCTGATCCACCAGATACTTCCACACTTGATCCAACTGACAAGGAAGCTTTTGACAGGGACTTGCTTAGCATAGAGTGTATAAGAACTCTGAATGAAGCTCTGAATTTGCATCATAAGAAGATGAACTGCCCCGATCCTAACTCATTGGCCAACTCGAACCCGGAATATGAAAGTGAAGCTGTGGTTTCGAGGAAAGTTGGCAAGCTTGATGAAAGCTATACTGAAAAAGTTGACAATTTGTCTCGGGAATTGTCGGAGGAGGCGAAGGATGATGGGATGTTTAGTTCTCTGAGGTTGTGGATAATTGCTCTGTGGGTGATATCTGGTTTGGTGTTCTTGGTAGTGATCGTATCTAGGTTTTCGGGTCGAAAAGGGAAGGGGATGAGAGGCAAACATCACAGGAACAAGAGGAGAACCGCTTCATATTCAGGTTTCATGGATCGGAACGGGCATGAGAAGTATGTTCGAGATCTCGATGCCTCCTTG

Coding sequence (CDS)

ATGAGGGAATTATTGGTTTTTGTGGCGATTTCTTTGGTGGGGTTTGTGGCCGGCAATGGGCGGAGCAACAACACTGGCACGGCGGCGCCGTGGCGGATTCACACTCTGTTCTCGGTCGAGTGCCAGAACTACTTCGATTGGCAAACTGTTGGGCTGATGAACAGCTTCAAGAAGTCGCAGCAGCCGGGGCCAATCACCCGCTTGCTAAGTTGCACCGATGAGGAGAAGAAGAATTACAGAGGGATGAATTTGGCTCCCACTTTTGAGGTCCCATCCATGAGTAGGCATCCAAAAACTGGGGACTGGTATCCTGCAATAAATAAACCTGCAGGGGTTGTCCACTGGCTTAAACATAGCAAAGAAGCAGAGAATGTTGATTGGGTTGTGATTTTGGATGCAGACATGATCATAAGAGGCCCCATAATACCTTGGGAACTTGGTGCTGAGAAGGGCAGACCTGTTGCAGCATATTATGGTTACTTGGTTGGATGTGACAATATTCTTGCTAAACTGCACACTAAGCACCCAGAGCTCTGTGACAAAGTTGGTGGGTTGTTAGCAATGCATATAGATGATCTTCGAGTGTTCGCACCAATGTGGCTTTCGAAGACGGAAGAAGTACGTGAAGATAGAGATCACTGGGCGACCAACATAACTGGTGATATCTATGGGAAAGGGTGGATAAGTGAGATGTATGGTTACTCGTTTGGAGCAGCAGAAGTTGGTCTCCGACACAAAATCAATGACAATTTGATGATCTACCCCGGTTATATTCCTCGTCCAGACATCGAGCCTATACTTCTTCACTATGGGTTGCCATTTAGTGTGGGAAATTGGTCCTTTAGTAAATTAAATCACCATGAAGATGGTATTGTCTATGACTGTAACCGGCTTTTCCCAGAACCTCCATATCCTCGAGAGATACAACAAATGGAATCTGATCCAAATAAGAAGCGGGCGCTACTTATAAATATAGAGTGTATCAACTTATTGAATGAGGGCCTATTGTTGCAACATAAGCGAAACGAATGCCCGAAGCCACAGTGGTCAAAATATTTAAGCTTTTTAAAGAGTAAAACTTTTACTGACCTAACTAAACCCAAGTATCCAACCCCTGCTACTCTAGTGATGAGGGAAGATCATGTTCAGAAACAACCGATGAAGGAAGATCATGGTCAGAAACAACCGGTGAAGGAAAATCATGTTCAGAAGCAACCGGTGTTTGATGAGCCGAATAAATCACATCCAAAGATTCACACACTTTTCTCGACGGAGTGTACTCCTTATTTCGACTGGCAGACTGTAGGCCTTATGCATAGTTTCCGCCTGAGCGGCCAACCTGGAAACATTACGCGACTTCTCAGCTGTACCAGTGAGGACTTGAAAGAATACAAAGGTCACGATCTGGCTCCGACGCATTATGTTCCTTCCATGAATCGACATCCACTAACCGGCGACTGGTATCCGGCCATAAACAAGCCAGCTGCAGTGCTTCATTGGCTCAATCATGTAAACACTGATGCAGAATTTATCGTTATTCTTGATGCTGATATGATCTTGAGAGGACCAATTACGCCGTGGGAGTTCAAAGCAGCTCGTGGACGTCCTGTTTCAACTCCCTATGATTACCTTATTGGCTGTGACAATGTGCTTGCCAAACTCCATACAAGCCATCCTGAAGCTTGTGACAAAGTTGGGGGTGTTATTATCATGCACGTAGATGATCTCAGGAAATTTGCCATGCTATGGTTGCATAAAACCGAGGAGGTCCGAGCCGACCGAGCTCATTATGCAACGAATATCACGGGAGATATATATGAATCTGGCTGGATCAGTGAGATGTATGGTTACTCATTCGGTGCTGCCGAGCTGCAATTACGGCACATTCGAAACAGCAAGATCTTGATATACCCGGGATACGTTCCTGAATCTGGAGTTCATTACAGAGTGTTTCACTATGGACTTGAATTTAAAGTGGGGAATTGGAGCTTTGATAAGGCAAATTGGAGGGAAACTGATGTGGTGAACAAATGCTGGGCTCAATTTCCTGATCCACCAGATACTTCCACACTTGATCCAACTGACAAGGAAGCTTTTGACAGGGACTTGCTTAGCATAGAGTGTATAAGAACTCTGAATGAAGCTCTGAATTTGCATCATAAGAAGATGAACTGCCCCGATCCTAACTCATTGGCCAACTCGAACCCGGAATATGAAAGTGAAGCTGTGGTTTCGAGGAAAGTTGGCAAGCTTGATGAAAGCTATACTGAAAAAGTTGACAATTTGTCTCGGGAATTGTCGGAGGAGGCGAAGGATGATGGGATGTTTAGTTCTCTGAGGTTGTGGATAATTGCTCTGTGGGTGATATCTGGTTTGGTGTTCTTGGTAGTGATCGTATCTAGGTTTTCGGGTCGAAAAGGGAAGGGGATGAGAGGCAAACATCACAGGAACAAGAGGAGAACCGCTTCATATTCAGGTTTCATGGATCGGAACGGGCATGAGAAGTATGTTCGAGATCTCGATGCCTCCTTG

Protein sequence

MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQQPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHTLFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVGNWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHHKKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKYVRDLDASL
Homology
BLAST of MS009770 vs. NCBI nr
Match: XP_022154175.1 (peptidyl serine alpha-galactosyltransferase [Momordica charantia])

HSP 1 Score: 1783.8 bits (4619), Expect = 0.0e+00
Identity = 843/844 (99.88%), Postives = 843/844 (99.88%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ
Sbjct: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT
Sbjct: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR
Sbjct: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH
Sbjct: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLR 780
           KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLR
Sbjct: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLR 780

Query: 781 LWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKYVRDL 840
           LWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRT SYSGFMDRNGHEKYVRDL
Sbjct: 781 LWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTTSYSGFMDRNGHEKYVRDL 840

Query: 841 DASL 845
           DASL
Sbjct: 841 DASL 844

BLAST of MS009770 vs. NCBI nr
Match: XP_038899299.1 (peptidyl serine alpha-galactosyltransferase [Benincasa hispida])

HSP 1 Score: 1611.7 bits (4172), Expect = 0.0e+00
Identity = 765/848 (90.21%), Postives = 800/848 (94.34%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           M+E L+FVAI LVGFVAG+G SNN+G A P RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAPPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNY+GM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYKGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAE+VDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAEDVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKL+HHEDGIVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLDHHEDGIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLLLQHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           FTDLTKPKYPTPATLVM+ED VQKQP+K+D  QKQPVKE+ VQKQPV DE  + +PKIHT
Sbjct: 361 FTDLTKPKYPTPATLVMKEDRVQKQPVKKDLVQKQPVKEDLVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTECT YFDWQTVGLMHSF LSGQPGNITRLLSCT EDLKEYKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECTTYFDWQTVGLMHSFHLSGQPGNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMI+RG ITPWEFKAARG PVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPWEFKAARGHPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKFAMLWLHKTEEVRADRAHYA N
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYAKN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIY+SGWISEMYGYSFGAAELQLRHIRN++IL+YPGYVP+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYQSGWISEMYGYSFGAAELQLRHIRNNEILLYPGYVPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSFDKANWRETD+VN CWA FP PPD STLD TDK+AF RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFDKANWRETDLVNTCWAHFPVPPDPSTLDQTDKDAFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNL----SRELSEEAKDDGMF 780
           KK NC DPN+L NS  EYESEA VSRK+GKLDESY  K D+L    S+E SEEAK+DG+F
Sbjct: 721 KKRNCSDPNALTNSKSEYESEAGVSRKIGKLDESYIGKDDHLSTESSQESSEEAKEDGIF 780

Query: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKY 840
           SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKG+RGKHHR KRRTASYSGF+DRNG EKY
Sbjct: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKY 840

Query: 841 VRDLDASL 845
            RDLDASL
Sbjct: 841 ARDLDASL 848

BLAST of MS009770 vs. NCBI nr
Match: KGN58321.2 (hypothetical protein Csa_017560 [Cucumis sativus])

HSP 1 Score: 1597.0 bits (4134), Expect = 0.0e+00
Identity = 756/848 (89.15%), Postives = 797/848 (93.99%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MRE L+FVAI LVGFVA +G +NN+G AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MREFLLFVAIFLVGFVASDGWTNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKK YRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKIN+NLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP
Sbjct: 241 VGLRHKINENLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLL QHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLWQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           FTDLTKPKYPTPA+LVM+ED VQKQP+K D  QKQPVKE+ VQKQPV DE  + +PKIHT
Sbjct: 361 FTDLTKPKYPTPASLVMKEDCVQKQPVKVDRVQKQPVKEDLVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTECT YFDWQTVGLMHSFRLSGQPGNITRLLSCT EDLK+YKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECTTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAE+IVILDADMI+RG ITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEYIVILDADMIMRGSITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKF+MLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFSMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIY+SGWISEMYGYSFGAAELQLRHIR+S+IL+YPGY P+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYQSGWISEMYGYSFGAAELQLRHIRSSEILLYPGYAPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSFDKANWRETD+VN+CWAQFP PPD STLD +DK+ F RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFDKANWRETDLVNRCWAQFPAPPDPSTLDQSDKDGFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEE----AKDDGMF 780
           KK NC DPN LAN N + ESE  VSRK+GKLDESYT K D+LS + S+E    AK+DG+F
Sbjct: 721 KKRNCSDPNLLANPNLDDESEVGVSRKIGKLDESYTGKEDHLSTDSSQESSQAAKEDGIF 780

Query: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKY 840
            SLRLWIIALWVISGLVFLVVI+S+FSGRK KG+RGKHHR KRRTASYSGF+DRNG EKY
Sbjct: 781 GSLRLWIIALWVISGLVFLVVIISKFSGRKAKGVRGKHHRIKRRTASYSGFVDRNGQEKY 840

Query: 841 VRDLDASL 845
           VRDLDASL
Sbjct: 841 VRDLDASL 848

BLAST of MS009770 vs. NCBI nr
Match: KAG6581066.1 (Peptidyl serine alpha-galactosyltransferase, partial [Cucurbita argyrosperma subsp. sororia] >KAG7017795.1 Peptidyl serine alpha-galactosyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1596.6 bits (4133), Expect = 0.0e+00
Identity = 759/848 (89.50%), Postives = 794/848 (93.63%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MR  LVFVA+ L+GFVAG+GRS N+  AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MRGFLVFVAVCLMGFVAGDGRSINSDMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMDLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKL HHED IVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLYHHEDDIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLLLQHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           F DLTKPKYPTPATLVM+ED VQKQP+KED  QKQPVKE  VQKQPV DE  + +PKIHT
Sbjct: 361 FADLTKPKYPTPATLVMKEDRVQKQPVKEDRVQKQPVKEELVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTEC+ YFDWQTVGLMHSFRLSGQPGNITRLLSCT EDLK+YKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMI+RGPITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGPITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIYESGWISEMYGYSFGAAELQLRHIRN++ILIYPGY P+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNTEILIYPGYYPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSF KANWR+TD+VN CWAQFP PPD STLD TDK AF RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFGKANWRDTDLVNTCWAQFPAPPDASTLDQTDKNAFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNL----SRELSEEAKDDGMF 780
           KK NC DP+SL NSN E ESEA VSRK+GKLDESYT K D+L    S+E SEE K+D MF
Sbjct: 721 KKSNCSDPSSLTNSNSENESEAGVSRKIGKLDESYTGKGDHLSTESSQESSEEVKEDAMF 780

Query: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKY 840
           SSLRLWII++WVISGL+FLV+I+S+FSGRK K +RGKH R KRRTASYSGF+DRNG EKY
Sbjct: 781 SSLRLWIISIWVISGLLFLVLIISKFSGRKVKVVRGKHQRIKRRTASYSGFVDRNGQEKY 840

Query: 841 VRDLDASL 845
           VRDLDASL
Sbjct: 841 VRDLDASL 848

BLAST of MS009770 vs. NCBI nr
Match: XP_022934960.1 (peptidyl serine alpha-galactosyltransferase-like [Cucurbita moschata])

HSP 1 Score: 1593.6 bits (4125), Expect = 0.0e+00
Identity = 757/848 (89.27%), Postives = 792/848 (93.40%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MR  LVFVA+ L+GFV G+GRS N+  AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MRGFLVFVAVCLMGFVVGDGRSINSDMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMDLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEK RPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKSRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKL HHED IVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLYHHEDDIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLLLQHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           F DLTKPKYPTPATLVM+EDHV KQP+KED  QKQPVKE  VQKQPV DE  + +PKIHT
Sbjct: 361 FADLTKPKYPTPATLVMKEDHVPKQPVKEDRVQKQPVKEELVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTEC+ YFDWQTVGLMHSFRLSGQPGNITRLLSCT EDLK+YKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMI+RGPITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGPITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIYESGWISEMYGYSFGAAELQLRHIRN++ILIYPGY P+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNTEILIYPGYYPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSF KANWR+TD+VN CWAQFP PPD STLD TDK AF RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFGKANWRDTDLVNTCWAQFPAPPDASTLDQTDKNAFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNL----SRELSEEAKDDGMF 780
           KK NC DP+SL NSN E ESEA VSRK+GKLDESYT K D+L    S+E SEE K+D MF
Sbjct: 721 KKSNCSDPSSLTNSNSENESEAGVSRKIGKLDESYTGKGDHLSTESSQESSEEVKEDAMF 780

Query: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKY 840
           SSLRLWII++WVISGL+FLV+I+S+FSGRK K +RGKH R KRRTASYSGF+DRNG EKY
Sbjct: 781 SSLRLWIISIWVISGLLFLVLIISKFSGRKVKVVRGKHQRIKRRTASYSGFVDRNGQEKY 840

Query: 841 VRDLDASL 845
           VRDLDASL
Sbjct: 841 VRDLDASL 848

BLAST of MS009770 vs. ExPASy Swiss-Prot
Match: Q8VYF9 (Peptidyl serine alpha-galactosyltransferase OS=Arabidopsis thaliana OX=3702 GN=SERGT1 PE=2 SV=1)

HSP 1 Score: 1263.1 bits (3267), Expect = 0.0e+00
Identity = 584/810 (72.10%), Postives = 679/810 (83.83%), Query Frame = 0

Query: 22  SNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQQPGPITRLLSCTDEEKKNYRG 81
           ++ +G  AP+RIHTLFSVECQNYFDWQTVGLM+SF KS QPGPITRLLSCTD++KK YRG
Sbjct: 19  ADESGQMAPYRIHTLFSVECQNYFDWQTVGLMHSFLKSGQPGPITRLLSCTDDQKKTYRG 78

Query: 82  MNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEAENVDWVVILDADMIIRGPI 141
           MNLAPTFEVPS SRHPKTGDWYPAINKP GV++WL+HS+EA++VDWVVILDADMIIRGPI
Sbjct: 79  MNLAPTFEVPSWSRHPKTGDWYPAINKPVGVLYWLQHSEEAKHVDWVVILDADMIIRGPI 138

Query: 142 IPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRVFAPMWL 201
           IPWELGAE+GRP AA+YGYLVGCDN+L +LHTKHPELCDKVGGLLAMHIDDLRV AP+WL
Sbjct: 139 IPWELGAERGRPFAAHYGYLVGCDNLLVRLHTKHPELCDKVGGLLAMHIDDLRVLAPLWL 198

Query: 202 SKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRP 261
           SKTE+VR+D  HW TN+TGDIYGKGWISEMYGYSFGAAE GL+HKIND+LMIYPGY+PR 
Sbjct: 199 SKTEDVRQDTAHWTTNLTGDIYGKGWISEMYGYSFGAAEAGLKHKINDDLMIYPGYVPRE 258

Query: 262 DIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDPNKKRAL 321
            +EP+L+HYGLPFS+GNWSF+KL+HHED IVYDCNRLFPEPPYPRE++ ME DP+K+R L
Sbjct: 259 GVEPVLMHYGLPFSIGNWSFTKLDHHEDNIVYDCNRLFPEPPYPREVKIMEPDPSKRRGL 318

Query: 322 LINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMREDH 381
           ++++EC+N LNEGL+L+H  N CPKP+W+KYLSFLKSKTF +LT+PK   P ++ +  D 
Sbjct: 319 ILSLECMNTLNEGLILRHAENGCPKPKWTKYLSFLKSKTFMELTRPKLLAPGSVHILPD- 378

Query: 382 VQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHTLFSTECTPYFDWQTVGLMHSF 441
                      Q +P         P  DE   ++PKIHTLFSTECT YFDWQTVG MHSF
Sbjct: 379 -----------QHEP---------PPIDEFKGTYPKIHTLFSTECTTYFDWQTVGFMHSF 438

Query: 442 RLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWL 501
           R SGQPGNITRLLSCT E LK YKGHDLAPTHYVPSM+RHPLTGDWYPAINKPAAV+HWL
Sbjct: 439 RQSGQPGNITRLLSCTDEALKNYKGHDLAPTHYVPSMSRHPLTGDWYPAINKPAAVVHWL 498

Query: 502 NHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEA 561
           +H N DAE++VILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDN LA+LHT +PEA
Sbjct: 499 HHTNIDAEYVVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNDLARLHTRNPEA 558

Query: 562 CDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGA 621
           CDKVGGVIIMH++DLRKFAM WL KT+EVRAD+ HY   +TGDIYESGWISEMYGYSFGA
Sbjct: 559 CDKVGGVIIMHIEDLRKFAMYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGA 618

Query: 622 AELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVGNWSFDKANWRETDVVNKCWAQ 681
           AEL LRH  N +I+IYPGYVPE G  YRVFHYGLEFKVGNWSFDKANWR TD++NKCWA+
Sbjct: 619 AELNLRHSINKEIMIYPGYVPEPGADYRVFHYGLEFKVGNWSFDKANWRNTDLINKCWAK 678

Query: 682 FPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHHKKMNCPDPNSLANSNPEYESE 741
           FPDPP  S +  TD +   RDLLSIEC + LNEAL LHHK+ NCP+P S      E   +
Sbjct: 679 FPDPPSPSAVHQTDNDLRQRDLLSIECGQKLNEALFLHHKRRNCPEPGS------ESTEK 738

Query: 742 AVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLRLWIIALWVISGLVFLVVIVSR 801
             VSRKVG ++   T+  D  ++E S  ++ +G FS+L+LW+IALW+ISG+ FLVV++  
Sbjct: 739 ISVSRKVGNIETKQTQGSDE-TKESSGSSESEGRFSTLKLWVIALWLISGVGFLVVMLLV 798

Query: 802 FSGRKGKG-MRGKHHRNKRRTA-SYSGFMD 830
           FS R+G+G  RGK +RNKRRT+ S +GF+D
Sbjct: 799 FSTRRGRGTTRGKGYRNKRRTSYSNTGFLD 800

BLAST of MS009770 vs. ExPASy Swiss-Prot
Match: H3JU05 (Peptidyl serine alpha-galactosyltransferase OS=Chlamydomonas reinhardtii OX=3055 GN=SGT1 PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 4.8e-56
Identity = 127/339 (37.46%), Postives = 181/339 (53.39%), Query Frame = 0

Query: 4   LLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQQPG 63
           LL+ +A+         G +N TG      +H  F  +CQ Y DWQ+VG   SFK S QPG
Sbjct: 14  LLLLLALQHGASAEEPGFANRTG------VHVAFLTDCQMYSDWQSVGAAFSFKMSGQPG 73

Query: 64  PITRLLSCTDEEKKNYRG--MNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKE 123
            + R++ C++E+ KNY    + +  T+  P  +   +TGD Y A NKP  V+ WL H+  
Sbjct: 74  SVIRVMCCSEEQAKNYNKGLLGMVDTWVAPDATHSKRTGDRYAAYNKPEAVIDWLDHN-- 133

Query: 124 AENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKH------ 183
               D+V++LD+DM++R P     +G  KG  V A Y Y++G  N LA  H  H      
Sbjct: 134 VPKHDYVLVLDSDMVLRRPFFVENMGPRKGLAVGARYTYMIGVANELAVRHIPHVPPRND 193

Query: 184 ------PELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYG----- 243
                     D+VGG   +H DDL+  +  WL  +E+VR D    A  ++GD+Y      
Sbjct: 194 TLAGPFGRRADQVGGFFFIHKDDLKAMSHDWLKFSEDVRVDDQ--AYRLSGDVYAIHPGD 253

Query: 244 KGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVG-NWSFSK 303
           + WISEMYGY+FGAA   + HK +   MIYPGY PR  I P L+HYGL F +G N+SF K
Sbjct: 254 RPWISEMYGYAFGAANHNVWHKWDTFSMIYPGYEPREGI-PKLMHYGLLFEIGKNYSFDK 313

Query: 304 LNHHEDGI-------VYDCNR----LFPEPPYPREIQQM 312
             H++  +       + D  R    +FPEPP P  ++++
Sbjct: 314 HWHYDFDVTVCPPWDLKDPKRRTHGIFPEPPRPSSLRKV 341

BLAST of MS009770 vs. ExPASy Swiss-Prot
Match: Q9FY51 (Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT3 PE=1 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 7.2e-12
Identity = 67/288 (23.26%), Postives = 120/288 (41.67%), Query Frame = 0

Query: 402 VQKQPVFDEPNKSHP-KIHTLFSTECTPYFDWQTVGLMHSFR----LSGQP-GNITRLL- 461
           V + P+     KS P   H   +    PY  WQ   + + ++    L G   G  TR+L 
Sbjct: 46  VVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILH 105

Query: 462 SCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVIL 521
           S  S++L      D  PT  V  +   P     Y  +N+P A + WL       +++++ 
Sbjct: 106 SGNSDNLM-----DEIPTFVVDPL--PPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMA 165

Query: 522 DADMILRGPITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKVGGV---- 581
           + D +    + P    A  G P + P+ Y+     +N++ K + +       +  +    
Sbjct: 166 EPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSP 225

Query: 582 IIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRH 641
           +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +A   +RH
Sbjct: 226 VIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIASAIHGVRH 285

Query: 642 IRNSKILIYPGY-VPESGVHYRVFHYGLEF---------KVGNWSFDK 667
           I     ++ P + +   G     + YG ++         K+G W FDK
Sbjct: 286 ILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDK 315

BLAST of MS009770 vs. ExPASy Swiss-Prot
Match: E9KID3 (Hydroxyproline O-arabinosyltransferase NOD3 (Fragment) OS=Pisum sativum OX=3888 GN=NOD3 PE=2 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 2.1e-11
Identity = 59/277 (21.30%), Postives = 111/277 (40.07%), Query Frame = 0

Query: 417 KIHTLFSTECTPYFDWQTVGLMHSFRLS-----GQPGNITRLLSCTSEDLKEYKGHDLAP 476
           K H   +     Y  WQ   + + ++ +        G  TR+L    ED           
Sbjct: 43  KFHVAVTATDAAYSQWQCRIMYYWYKKAKDMPGSAMGKFTRILHSGKED---------QL 102

Query: 477 THYVPSMNRHPLTGD---WYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWE 536
            + +P+    PL       Y  +N+P A + WL     D E+I++ + D I    + P  
Sbjct: 103 MNEIPTFVVDPLPDGLDRGYIVLNRPWAFVQWLEKAVIDEEYILMAEPDHIF---VNPLP 162

Query: 537 FKAARGRPVSTPYDYLIGCDN--VLAKLHTSHPEACDKVGGV----IIMHVDDLRKFAML 596
             A+   P   P+ Y+   +N  ++ K +         V  +    +I+H   L + A  
Sbjct: 163 NLASENEPAGYPFFYIKPAENEKIMRKFYPKEKGPVTDVDPIGNSPVIIHKYLLEEIAPT 222

Query: 597 WLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVP 656
           W++ +  ++ D        T  ++  GW+ EMY Y+  +A   ++H      ++ P +  
Sbjct: 223 WVNVSLRMKDDPE------TDKVF--GWVLEMYAYAVASALHGIKHTLRKDFMLQPPWDL 282

Query: 657 ESGVHYRV-FHYGLEF---------KVGNWSFDKANW 670
           E G  + + + YG ++         K+G W FDK ++
Sbjct: 283 EVGKTFIIHYTYGCDYNLKGKLTYGKIGEWRFDKRSY 299

BLAST of MS009770 vs. ExPASy Swiss-Prot
Match: E9KID2 (Hydroxyproline O-arabinosyltransferase RDN1 OS=Medicago truncatula OX=3880 GN=RDN1 PE=2 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 8.0e-11
Identity = 61/288 (21.18%), Postives = 111/288 (38.54%), Query Frame = 0

Query: 410 EPNKSHPKIHTLFSTECTPYFDWQTVGLMHSFRLS-----GQPGNITRLLSCTSEDLKEY 469
           E   ++ K H   +     Y  WQ   + + ++ +        G  TR+L         +
Sbjct: 51  EIRNTNSKYHVAVTATDAAYSQWQCRIMYYWYKKTKDMPGSAMGKFTRIL---------H 110

Query: 470 KGHDLAPTHYVPSMNRHPL---TGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILR 529
            G      + +P+    PL       Y  +N+P A + WL     D E+I++ + D I  
Sbjct: 111 SGRGDQLMNEIPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKAVIDEEYILMAEPDHIF- 170

Query: 530 GPITPWEFKAARGRPVSTPYDYLIGCDN--VLAKLHTSHPEACDKVGGV----IIMHVDD 589
             + P    A    P   P+ Y+   +N  ++ K +         V  +    +I+H   
Sbjct: 171 --VNPLPNLATENEPAGYPFFYIKPAENEKIMRKFYPKENGPVTDVDPIGNSPVIIHKYM 230

Query: 590 LRKFAMLW----LHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRHIRN 649
           L + A  W    L   ++   D+A             GW+ EMY Y+  +A   ++HI  
Sbjct: 231 LEEIAPTWVNISLRMKDDPETDKAF------------GWVLEMYAYAVASALHGIKHILR 290

Query: 650 SKILIYPGYVPESGVHYRV-FHYGLEF---------KVGNWSFDKANW 670
              ++ P +  + G  + + F YG ++         K+G W FDK ++
Sbjct: 291 KDFMLQPPWDLDVGKKFIIHFTYGCDYNLKGKLTYGKIGEWRFDKRSY 314

BLAST of MS009770 vs. ExPASy TrEMBL
Match: A0A6J1DIW2 (peptidyl serine alpha-galactosyltransferase OS=Momordica charantia OX=3673 GN=LOC111021491 PE=4 SV=1)

HSP 1 Score: 1783.8 bits (4619), Expect = 0.0e+00
Identity = 843/844 (99.88%), Postives = 843/844 (99.88%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ
Sbjct: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT
Sbjct: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR
Sbjct: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH
Sbjct: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLR 780
           KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLR
Sbjct: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLR 780

Query: 781 LWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKYVRDL 840
           LWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRT SYSGFMDRNGHEKYVRDL
Sbjct: 781 LWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTTSYSGFMDRNGHEKYVRDL 840

Query: 841 DASL 845
           DASL
Sbjct: 841 DASL 844

BLAST of MS009770 vs. ExPASy TrEMBL
Match: A0A6J1F984 (peptidyl serine alpha-galactosyltransferase-like OS=Cucurbita moschata OX=3662 GN=LOC111441973 PE=4 SV=1)

HSP 1 Score: 1593.6 bits (4125), Expect = 0.0e+00
Identity = 757/848 (89.27%), Postives = 792/848 (93.40%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MR  LVFVA+ L+GFV G+GRS N+  AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MRGFLVFVAVCLMGFVVGDGRSINSDMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMDLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEK RPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKSRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKL HHED IVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLYHHEDDIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLLLQHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           F DLTKPKYPTPATLVM+EDHV KQP+KED  QKQPVKE  VQKQPV DE  + +PKIHT
Sbjct: 361 FADLTKPKYPTPATLVMKEDHVPKQPVKEDRVQKQPVKEELVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTEC+ YFDWQTVGLMHSFRLSGQPGNITRLLSCT EDLK+YKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMI+RGPITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGPITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIYESGWISEMYGYSFGAAELQLRHIRN++ILIYPGY P+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNTEILIYPGYYPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSF KANWR+TD+VN CWAQFP PPD STLD TDK AF RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFGKANWRDTDLVNTCWAQFPAPPDASTLDQTDKNAFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNL----SRELSEEAKDDGMF 780
           KK NC DP+SL NSN E ESEA VSRK+GKLDESYT K D+L    S+E SEE K+D MF
Sbjct: 721 KKSNCSDPSSLTNSNSENESEAGVSRKIGKLDESYTGKGDHLSTESSQESSEEVKEDAMF 780

Query: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKY 840
           SSLRLWII++WVISGL+FLV+I+S+FSGRK K +RGKH R KRRTASYSGF+DRNG EKY
Sbjct: 781 SSLRLWIISIWVISGLLFLVLIISKFSGRKVKVVRGKHQRIKRRTASYSGFVDRNGQEKY 840

Query: 841 VRDLDASL 845
           VRDLDASL
Sbjct: 841 VRDLDASL 848

BLAST of MS009770 vs. ExPASy TrEMBL
Match: A0A6J1J567 (peptidyl serine alpha-galactosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111481857 PE=4 SV=1)

HSP 1 Score: 1588.5 bits (4112), Expect = 0.0e+00
Identity = 755/848 (89.03%), Postives = 793/848 (93.51%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MR  L+FVAI ++GFVAG+GRS N+  AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MRGFLMFVAIFVMGFVAGDGRSINSDMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMDLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKL HHED IVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLYHHEDDIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLLLQHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           F DLTKPKYPTPATLVM+EDHV KQP+K D  QKQPVKE  VQKQPV DE  + +PKIHT
Sbjct: 361 FADLTKPKYPTPATLVMKEDHVPKQPVKGDRVQKQPVKEELVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTEC+ YFDWQTVGLMHSFRLSGQPGNITRLLSCT E+LK+YKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDENLKKYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMI+RGPITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGPITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIYESGWISEMYGYSFGAAELQLRHIRN++ILIYPGY P+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNTEILIYPGYYPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSF KANWR+TD+VN CWAQFP PPD STLD TDK AF RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFGKANWRDTDLVNTCWAQFPAPPDASTLDQTDKNAFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEKVDNL----SRELSEEAKDDGMF 780
           KK NC DP+SL NSN E ESEA VSRK+GKLDESYT K ++L    S+E SEE K+D MF
Sbjct: 721 KKSNCSDPSSLTNSNSENESEAGVSRKIGKLDESYTGKGNHLSTESSQESSEEVKEDAMF 780

Query: 781 SSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKY 840
           SSLRLWII++WVISGL+FLV+I+S+FSGRK K +RGKH R KRRTASYSGF+DRNG EKY
Sbjct: 781 SSLRLWIISIWVISGLLFLVLIISKFSGRKVKVVRGKHQRIKRRTASYSGFVDRNGQEKY 840

Query: 841 VRDLDASL 845
           VRDLDASL
Sbjct: 841 VRDLDASL 848

BLAST of MS009770 vs. ExPASy TrEMBL
Match: A0A1S3BNB4 (uncharacterized protein LOC103491714 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491714 PE=4 SV=1)

HSP 1 Score: 1576.2 bits (4080), Expect = 0.0e+00
Identity = 750/847 (88.55%), Postives = 787/847 (92.92%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MRE L+FVAI LV FVA +G +NN+  AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MREFLLFVAIFLVRFVASDGWTNNSSMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTD+EKK YRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDDEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKINDNLMIYPGYIPRP+IEPILLHYGLPFSVGNWSFSKLNHHED IVYDCNRLFP
Sbjct: 241 VGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVGNWSFSKLNHHEDDIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLL QHKRN CPKP+WSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLWQHKRNGCPKPEWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMREDHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHT 420
           FTDLTKPKYPTP+TLVM+ED VQKQP+K    QKQPVKE+ VQKQPV DE  + +PKIHT
Sbjct: 361 FTDLTKPKYPTPSTLVMKEDRVQKQPVKVYRVQKQPVKEDLVQKQPVLDELQEPYPKIHT 420

Query: 421 LFSTECTPYFDWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNR 480
           LFSTECT YFDWQTVGLMHSFRLSGQPGNITRLLSCT EDLK+YKGH+LAPTHYVPSM+R
Sbjct: 421 LFSTECTTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSR 480

Query: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTP 540
           HPLTGDWYPAINKPAAVLHWLNHVNTDAE+IVILDADMI+RG ITPWEFKAARGRPVSTP
Sbjct: 481 HPLTGDWYPAINKPAAVLHWLNHVNTDAEYIVILDADMIMRGSITPWEFKAARGRPVSTP 540

Query: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATN 600
           YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMH+DDLRKFAMLWLHKTEEVRADRAHYATN
Sbjct: 541 YDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATN 600

Query: 601 ITGDIYESGWISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVG 660
           ITGDIY+SGWISEMYGYSFGAAELQLRHIRNS+IL+YPGYVP+ GVHYRVFHYGLEFKVG
Sbjct: 601 ITGDIYQSGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVG 660

Query: 661 NWSFDKANWRETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHH 720
           NWSFDKANWRETD+VN+CWAQFP PPD STLD TDK  F RDLLSIECIRTLNEAL LHH
Sbjct: 661 NWSFDKANWRETDLVNRCWAQFPAPPDPSTLDQTDKGGFARDLLSIECIRTLNEALYLHH 720

Query: 721 KKMNCPDPNSLANSNPEYESEAVVSRKVGKLDESYTEK---VDNLSRELSEEAKDDGMFS 780
           KK NC DPN L N N E ESE  VS K+GKLDESYT K       S+E S EAK+DG+FS
Sbjct: 721 KKRNCSDPNLLTNLNSEDESETGVSWKIGKLDESYTGKGHLSTESSQESSVEAKEDGIFS 780

Query: 781 SLRLWIIALWVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKYV 840
           SLR WIIALWVISGLVFLVVI+S+FSGRK KG+RGKHHR KRRTASYS F+DRNG EKYV
Sbjct: 781 SLRSWIIALWVISGLVFLVVIISKFSGRKAKGVRGKHHRIKRRTASYSVFVDRNGQEKYV 840

Query: 841 RDLDASL 845
           +DLDASL
Sbjct: 841 KDLDASL 847

BLAST of MS009770 vs. ExPASy TrEMBL
Match: A0A0A0LDQ3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G597250 PE=4 SV=1)

HSP 1 Score: 1574.7 bits (4076), Expect = 0.0e+00
Identity = 756/898 (84.19%), Postives = 797/898 (88.75%), Query Frame = 0

Query: 1   MRELLVFVAISLVGFVAGNGRSNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQ 60
           MRE L+FVAI LVGFVA +G +NN+G AAP RIHTLFSVECQNYFDWQTVGLM+SFKKS+
Sbjct: 1   MREFLLFVAIFLVGFVASDGWTNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKK YRGM+LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCD 180

Query: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240
           KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE
Sbjct: 181 KVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAE 240

Query: 241 VGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300
           VGLRHKIN+NLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP
Sbjct: 241 VGLRHKINENLMIYPGYIPRPDIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFP 300

Query: 301 EPPYPREIQQMESDPNKKRALLINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKT 360
           EPPYPREIQQMESD NKKR LLINIECINLLNEGLL QHKRN CPKPQWSKYLSFLKSKT
Sbjct: 301 EPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLWQHKRNGCPKPQWSKYLSFLKSKT 360

Query: 361 FTDLTKPKYPTPATLVMRE----------------------------------------- 420
           FTDLTKPKYPTPA+LVM+E                                         
Sbjct: 361 FTDLTKPKYPTPASLVMKEDCVQKQPVKVDRVQKQPVKVDRVQKQPVKVDRVQKQPVKVD 420

Query: 421 ---------DHVQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHTLFSTECTPYF 480
                    D VQKQP+K D  QKQPVKE+ VQKQPV DE  + +PKIHTLFSTECT YF
Sbjct: 421 RVQKQPVKVDRVQKQPVKVDRVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYF 480

Query: 481 DWQTVGLMHSFRLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPA 540
           DWQTVGLMHSFRLSGQPGNITRLLSCT EDLK+YKGH+LAPTHYVPSM+RHPLTGDWYPA
Sbjct: 481 DWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPA 540

Query: 541 INKPAAVLHWLNHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNV 600
           INKPAAVLHWLNHVNTDAE+IVILDADMI+RG ITPWEFKAARGRPVSTPYDYLIGCDNV
Sbjct: 541 INKPAAVLHWLNHVNTDAEYIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNV 600

Query: 601 LAKLHTSHPEACDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGW 660
           LAKLHTSHPEACDKVGGVIIMH+DDLRKF+MLWLHKTEEVRADRAHYATNITGDIY+SGW
Sbjct: 601 LAKLHTSHPEACDKVGGVIIMHIDDLRKFSMLWLHKTEEVRADRAHYATNITGDIYQSGW 660

Query: 661 ISEMYGYSFGAAELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVGNWSFDKANWR 720
           ISEMYGYSFGAAELQLRHIR+S+IL+YPGY P+ GVHYRVFHYGLEFKVGNWSFDKANWR
Sbjct: 661 ISEMYGYSFGAAELQLRHIRSSEILLYPGYAPDPGVHYRVFHYGLEFKVGNWSFDKANWR 720

Query: 721 ETDVVNKCWAQFPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHHKKMNCPDPNS 780
           ETD+VN+CWAQFP PPD STLD +DK+ F RDLLSIECIRTLNEAL LHHKK NC DPN 
Sbjct: 721 ETDLVNRCWAQFPAPPDPSTLDQSDKDGFARDLLSIECIRTLNEALYLHHKKRNCSDPNL 780

Query: 781 LANSNPEYESEAVVSRKVGKLDESYTEKVDNLSRELSEE----AKDDGMFSSLRLWIIAL 840
           LAN N + ESE  VSRK+GKLDESYT K D+LS + S+E    AK+DG+F SLRLWIIAL
Sbjct: 781 LANPNLDDESEVGVSRKIGKLDESYTGKEDHLSTDSSQESSQAAKEDGIFGSLRLWIIAL 840

Query: 841 WVISGLVFLVVIVSRFSGRKGKGMRGKHHRNKRRTASYSGFMDRNGHEKYVRDLDASL 845
           WVISGLVFLVVI+S+FSGRK KG+RGKHHR KRRTASYSGF+DRNG EKYVRDLDASL
Sbjct: 841 WVISGLVFLVVIISKFSGRKAKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 898

BLAST of MS009770 vs. TAIR 10
Match: AT3G01720.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 374 Blast hits to 211 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316; Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink). )

HSP 1 Score: 1263.1 bits (3267), Expect = 0.0e+00
Identity = 584/810 (72.10%), Postives = 679/810 (83.83%), Query Frame = 0

Query: 22  SNNTGTAAPWRIHTLFSVECQNYFDWQTVGLMNSFKKSQQPGPITRLLSCTDEEKKNYRG 81
           ++ +G  AP+RIHTLFSVECQNYFDWQTVGLM+SF KS QPGPITRLLSCTD++KK YRG
Sbjct: 19  ADESGQMAPYRIHTLFSVECQNYFDWQTVGLMHSFLKSGQPGPITRLLSCTDDQKKTYRG 78

Query: 82  MNLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEAENVDWVVILDADMIIRGPI 141
           MNLAPTFEVPS SRHPKTGDWYPAINKP GV++WL+HS+EA++VDWVVILDADMIIRGPI
Sbjct: 79  MNLAPTFEVPSWSRHPKTGDWYPAINKPVGVLYWLQHSEEAKHVDWVVILDADMIIRGPI 138

Query: 142 IPWELGAEKGRPVAAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRVFAPMWL 201
           IPWELGAE+GRP AA+YGYLVGCDN+L +LHTKHPELCDKVGGLLAMHIDDLRV AP+WL
Sbjct: 139 IPWELGAERGRPFAAHYGYLVGCDNLLVRLHTKHPELCDKVGGLLAMHIDDLRVLAPLWL 198

Query: 202 SKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRP 261
           SKTE+VR+D  HW TN+TGDIYGKGWISEMYGYSFGAAE GL+HKIND+LMIYPGY+PR 
Sbjct: 199 SKTEDVRQDTAHWTTNLTGDIYGKGWISEMYGYSFGAAEAGLKHKINDDLMIYPGYVPRE 258

Query: 262 DIEPILLHYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDPNKKRAL 321
            +EP+L+HYGLPFS+GNWSF+KL+HHED IVYDCNRLFPEPPYPRE++ ME DP+K+R L
Sbjct: 259 GVEPVLMHYGLPFSIGNWSFTKLDHHEDNIVYDCNRLFPEPPYPREVKIMEPDPSKRRGL 318

Query: 322 LINIECINLLNEGLLLQHKRNECPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMREDH 381
           ++++EC+N LNEGL+L+H  N CPKP+W+KYLSFLKSKTF +LT+PK   P ++ +  D 
Sbjct: 319 ILSLECMNTLNEGLILRHAENGCPKPKWTKYLSFLKSKTFMELTRPKLLAPGSVHILPD- 378

Query: 382 VQKQPMKEDHGQKQPVKENHVQKQPVFDEPNKSHPKIHTLFSTECTPYFDWQTVGLMHSF 441
                      Q +P         P  DE   ++PKIHTLFSTECT YFDWQTVG MHSF
Sbjct: 379 -----------QHEP---------PPIDEFKGTYPKIHTLFSTECTTYFDWQTVGFMHSF 438

Query: 442 RLSGQPGNITRLLSCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWL 501
           R SGQPGNITRLLSCT E LK YKGHDLAPTHYVPSM+RHPLTGDWYPAINKPAAV+HWL
Sbjct: 439 RQSGQPGNITRLLSCTDEALKNYKGHDLAPTHYVPSMSRHPLTGDWYPAINKPAAVVHWL 498

Query: 502 NHVNTDAEFIVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEA 561
           +H N DAE++VILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDN LA+LHT +PEA
Sbjct: 499 HHTNIDAEYVVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNDLARLHTRNPEA 558

Query: 562 CDKVGGVIIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGA 621
           CDKVGGVIIMH++DLRKFAM WL KT+EVRAD+ HY   +TGDIYESGWISEMYGYSFGA
Sbjct: 559 CDKVGGVIIMHIEDLRKFAMYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGA 618

Query: 622 AELQLRHIRNSKILIYPGYVPESGVHYRVFHYGLEFKVGNWSFDKANWRETDVVNKCWAQ 681
           AEL LRH  N +I+IYPGYVPE G  YRVFHYGLEFKVGNWSFDKANWR TD++NKCWA+
Sbjct: 619 AELNLRHSINKEIMIYPGYVPEPGADYRVFHYGLEFKVGNWSFDKANWRNTDLINKCWAK 678

Query: 682 FPDPPDTSTLDPTDKEAFDRDLLSIECIRTLNEALNLHHKKMNCPDPNSLANSNPEYESE 741
           FPDPP  S +  TD +   RDLLSIEC + LNEAL LHHK+ NCP+P S      E   +
Sbjct: 679 FPDPPSPSAVHQTDNDLRQRDLLSIECGQKLNEALFLHHKRRNCPEPGS------ESTEK 738

Query: 742 AVVSRKVGKLDESYTEKVDNLSRELSEEAKDDGMFSSLRLWIIALWVISGLVFLVVIVSR 801
             VSRKVG ++   T+  D  ++E S  ++ +G FS+L+LW+IALW+ISG+ FLVV++  
Sbjct: 739 ISVSRKVGNIETKQTQGSDE-TKESSGSSESEGRFSTLKLWVIALWLISGVGFLVVMLLV 798

Query: 802 FSGRKGKG-MRGKHHRNKRRTA-SYSGFMD 830
           FS R+G+G  RGK +RNKRRT+ S +GF+D
Sbjct: 799 FSTRRGRGTTRGKGYRNKRRTSYSNTGFLD 800

BLAST of MS009770 vs. TAIR 10
Match: AT5G13500.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 74.3 bits (181), Expect = 5.2e-13
Identity = 67/288 (23.26%), Postives = 120/288 (41.67%), Query Frame = 0

Query: 402 VQKQPVFDEPNKSHP-KIHTLFSTECTPYFDWQTVGLMHSFR----LSGQP-GNITRLL- 461
           V + P+     KS P   H   +    PY  WQ   + + ++    L G   G  TR+L 
Sbjct: 46  VVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILH 105

Query: 462 SCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVIL 521
           S  S++L      D  PT  V  +   P     Y  +N+P A + WL       +++++ 
Sbjct: 106 SGNSDNLM-----DEIPTFVVDPL--PPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMA 165

Query: 522 DADMILRGPITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKVGGV---- 581
           + D +    + P    A  G P + P+ Y+     +N++ K + +       +  +    
Sbjct: 166 EPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSP 225

Query: 582 IIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRH 641
           +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +A   +RH
Sbjct: 226 VIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIASAIHGVRH 285

Query: 642 IRNSKILIYPGY-VPESGVHYRVFHYGLEF---------KVGNWSFDK 667
           I     ++ P + +   G     + YG ++         K+G W FDK
Sbjct: 286 ILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDK 315

BLAST of MS009770 vs. TAIR 10
Match: AT5G13500.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 228 Blast hits to 200 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213; Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink). )

HSP 1 Score: 74.3 bits (181), Expect = 5.2e-13
Identity = 67/288 (23.26%), Postives = 120/288 (41.67%), Query Frame = 0

Query: 402 VQKQPVFDEPNKSHP-KIHTLFSTECTPYFDWQTVGLMHSFR----LSGQP-GNITRLL- 461
           V + P+     KS P   H   +    PY  WQ   + + ++    L G   G  TR+L 
Sbjct: 46  VVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILH 105

Query: 462 SCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVIL 521
           S  S++L      D  PT  V  +   P     Y  +N+P A + WL       +++++ 
Sbjct: 106 SGNSDNLM-----DEIPTFVVDPL--PPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMA 165

Query: 522 DADMILRGPITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKVGGV---- 581
           + D +    + P    A  G P + P+ Y+     +N++ K + +       +  +    
Sbjct: 166 EPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSP 225

Query: 582 IIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRH 641
           +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +A   +RH
Sbjct: 226 VIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIASAIHGVRH 285

Query: 642 IRNSKILIYPGY-VPESGVHYRVFHYGLEF---------KVGNWSFDK 667
           I     ++ P + +   G     + YG ++         K+G W FDK
Sbjct: 286 ILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDK 315

BLAST of MS009770 vs. TAIR 10
Match: AT5G13500.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 74.3 bits (181), Expect = 5.2e-13
Identity = 67/288 (23.26%), Postives = 120/288 (41.67%), Query Frame = 0

Query: 402 VQKQPVFDEPNKSHP-KIHTLFSTECTPYFDWQTVGLMHSFR----LSGQP-GNITRLL- 461
           V + P+     KS P   H   +    PY  WQ   + + ++    L G   G  TR+L 
Sbjct: 46  VVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILH 105

Query: 462 SCTSEDLKEYKGHDLAPTHYVPSMNRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVIL 521
           S  S++L      D  PT  V  +   P     Y  +N+P A + WL       +++++ 
Sbjct: 106 SGNSDNLM-----DEIPTFVVDPL--PPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMA 165

Query: 522 DADMILRGPITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKVGGV---- 581
           + D +    + P    A  G P + P+ Y+     +N++ K + +       +  +    
Sbjct: 166 EPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSP 225

Query: 582 IIMHVDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLRH 641
           +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +A   +RH
Sbjct: 226 VIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIASAIHGVRH 285

Query: 642 IRNSKILIYPGY-VPESGVHYRVFHYGLEF---------KVGNWSFDK 667
           I     ++ P + +   G     + YG ++         K+G W FDK
Sbjct: 286 ILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDK 315

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154175.10.0e+0099.88peptidyl serine alpha-galactosyltransferase [Momordica charantia][more]
XP_038899299.10.0e+0090.21peptidyl serine alpha-galactosyltransferase [Benincasa hispida][more]
KGN58321.20.0e+0089.15hypothetical protein Csa_017560 [Cucumis sativus][more]
KAG6581066.10.0e+0089.50Peptidyl serine alpha-galactosyltransferase, partial [Cucurbita argyrosperma sub... [more]
XP_022934960.10.0e+0089.27peptidyl serine alpha-galactosyltransferase-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q8VYF90.0e+0072.10Peptidyl serine alpha-galactosyltransferase OS=Arabidopsis thaliana OX=3702 GN=S... [more]
H3JU054.8e-5637.46Peptidyl serine alpha-galactosyltransferase OS=Chlamydomonas reinhardtii OX=3055... [more]
Q9FY517.2e-1223.26Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
E9KID32.1e-1121.30Hydroxyproline O-arabinosyltransferase NOD3 (Fragment) OS=Pisum sativum OX=3888 ... [more]
E9KID28.0e-1121.18Hydroxyproline O-arabinosyltransferase RDN1 OS=Medicago truncatula OX=3880 GN=RD... [more]
Match NameE-valueIdentityDescription
A0A6J1DIW20.0e+0099.88peptidyl serine alpha-galactosyltransferase OS=Momordica charantia OX=3673 GN=LO... [more]
A0A6J1F9840.0e+0089.27peptidyl serine alpha-galactosyltransferase-like OS=Cucurbita moschata OX=3662 G... [more]
A0A6J1J5670.0e+0089.03peptidyl serine alpha-galactosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A1S3BNB40.0e+0088.55uncharacterized protein LOC103491714 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0LDQ30.0e+0084.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G597250 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G01720.10.0e+0072.10unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.15.2e-1323.26unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.25.2e-1323.26unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.35.2e-1323.26unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR044845Glycosyltransferase HPAT/SRGT1-likePANTHERPTHR31485PEPTIDYL SERINE ALPHA-GALACTOSYLTRANSFERASEcoord: 25..827
NoneNo IPR availablePANTHERPTHR31485:SF25PEPTIDYL SERINE ALPHA-GALACTOSYLTRANSFERASEcoord: 25..827

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS009770.1MS009770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016757 glycosyltransferase activity