Clc05G15090 (gene) Watermelon (cordophanus) v2

Overview
NameClc05G15090
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionpeptidyl serine alpha-galactosyltransferase
LocationClcChr05: 16981853 .. 16988863 (-)
RNA-Seq ExpressionClc05G15090
SyntenyClc05G15090
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAAAATGGCATTGGCAATAATTACCCCAGGAGAGAAGAAGCGGTTCTCGAAAATGAAGGTCGGACTTAGAAAAGTCATTCCCAACTGCTCCCATGTTCTGATCAGAGAAGTAAATTCCAACCCCAAGCTTCTAGTTTTTGTAGAAATTAAAAAATAGCTTCTCATTTTTTTAGAATTTTTTTCTGAATTATTTTATTAGCAACCAATTGATTGCTTTTTTAAGCTAATTTTTCGATCAAACTTGTCCCCACATCACTCCCTCTCCATGTCAAAGAAAACAAAAAGAGTAAGAAAAATGGAAGTTTGACCTGTGGCTGCGATCCGGGGAATGCAACCCAGATCTCTAAAACCAGTACCCAATTGAAATCTAGTCCAGGATTGAACCAAAATGAAAGAATTCTTGCTGTTCGTGGCGATATTTTTGGTGGGGTTTGTGGCCGGCGATGGGTGGAGCAATAATTCCGGCATGGCGGCGCCGCGGCGGATTCATACTCTGTTTTCGGTGGAGTGTCAGAATTACTTCGATTGGCAAACTGTTGGGTTGATGCATAGCTTCAAGAAGTCGAAGCAACCGGGGCCGATCACCCGTTTGCTTAGTTGCACCGATGAGGAGAAGAAGAACTATAGAGGGATGCATTTGGCTCCCACTTTTGAGGTTCCATCCATGAGTAGGCACCCCAAAACTGGCGACTGGTGAGAGTTTCTTCTTCTTTTCCCTTTCATTTCATTGAACTTTATGGAGTATAGGATTTGTGTTTTGTGTTGCTTTGGGTGTGCTTCTGCTAAGTGATTGGGTTGTTTTCAATTTGATTGCTACCTGTGTTGGTTTTGTTTGATTTTATTTTTGAAATTGTTCCTCTGTTGGCTAGATTAGTTCCTTAACTTTGTAGGAGCATGCTTCCCAAAACTGGCGACTGGTTAGACTTGCTTCATCTTTTCCCTTTCATTGCATGAACTTTTGGAGTTGAGGATTTGGGTTTCTTCTATTGCTTTTAGTATGCTTCTGGTTAGTAATCTGGTTGAGTTTTGATTGGTTATCTTCCTTAGCTTTGTTTGAATTTAGTTCTGAAATTGTTAGTCTGTTTGACTACATTGATCGGTTTGTATACAGTTTTTTCTTCCCTTCTTTTCATAAGATGGGTTGAGCTCTTGGACTAGGAGCACAGAATGATGATATTGTGATTGGTTTATGTGCTCTGTTTAACTGTTTGACGCTAACTCACTGCTCCAAGTTTAACTCATTTAGCAATTGATAGCTGTGAAGCCAGAGGAAGAGGTTTCTTGTTTTGAAAATATATCTCAATTTTGAAGGATTGTGGAAAGATGTAAGATTCTGGTTGAAAAGCTTAGTGATGTATAAATGTAAATGGTAGTGGTAGTTGAGGAGAGAATATTGGAATGGTGTGTAAATCAAACCGGACAAGGGGGAAGTAACTAAATGGGAGAAACTGGAAGAGTCAATTGCAGAGGGTTGAGGAAGGGGGGGTGGGAACGGCTTTCATTTTGACCTTGGTCAATCTTCATTAAGGGGTCTGGCCCGAATTTTGTATATCACTGCAACTCCAAAGAATGTATACATCAGTCAACGTTCAAGAATATGGATAATTCAGCTGCACAAAGAAGAGGATTTGTGTAGGTTGCATGCTCGAGAGGCTTCAGGATTTCTTTGGACTATACATATCTCACTCGAGAAACTCTATCGAATGTTGCATCGTTGAACAGAGGAGATGTTTCCCATTGATGCTTAATACCAGAGGAAAAAACTTGTTTTCACTTTTGTTTGACTTGATGCTTTACAGATGTTTTTTTCTTACTCATTCATCAAAATAACCATTAGATATATTTCTTCTGACTTCATACTCGTTAGTCTTTCTTTTTCCTCCTGTTGAAAAATGGCTACAAATGAAAAAAAATAGGATATAAGAAGAAAATATTACTTGTTTTGCAGGTATCCTGCAATAAATAAACCTGCAGGGGTTGTCCACTGGCTTAAACATAGCAAAGAAGCAGAGAATGTTGATTGGGTTGTTATTCTGGATGCAGACATGATCATTAGAGGCCCAATAATACCTTGGGAACTTGGTGCAGAGAAGGGCAGACCTGTTGCAGCCTATTATGGGTTACATTCTTCTCTCTTCTCCCTCCTTCAGTCCCTTATGGCGTCGAAGCTTATTATTCTCCTTAAATTGAGATTTCAACAAAGTTACTTGGATAGATGCTTTTGGCTTTCTACAGTAATAAAGAAAAAATTGGTTTAAATATTCTTTGGTACTTGAACTTTTATTTTAGTCTCTATACTGTCAACTATCAAGTGTTGTGTTTTAGTCGCTTAATTTTCTAATTTTATAACTTTAGTTACTAAACGTGGTATCAAAAACTGTTTTAATCTTGCTGATAAATTTTCTTTATCATTTAGAAAAGTCTAAGTCCATTTTGAAATTGGACATATTCTACTATGTAGTACGTGAGACTTCAAATATTTGTCTGTATAGACATTTATTGATCAACGAAATGAAAAAAATGTGATTAGGTTAGACTTACATTGTATAATAAAAAATTAACTATAAGAACCATAATGCTTTAATATATATATATATATATTTAAAGTTCTAAGACAAATAGAATGTTTAAAAGTTTAGGGATCAAAATAAGTAAAAAAGTTACTGGACTAGAATAAGATTTAACAGATAGTAAACTTTTGTTGTGAGACTTGCAGTGCAATTGGGAATATTTTGTGTTTATATTTAAATTTCTTCTTAATGTTGTTGTGACTGCGTTCAAATGATGAATAATTATTTCTTCAGATACTTGGTTGGATGTGACAACATTCTTGCTAAATTGCACACCAAGCACCCAGAGCTCTGTGACAAAGTTGGTGGCCTCTTAGCAATGCATATAGATGATCTTCGAGTATTTGCACCAATGTGGCTTTCAAAGACTGAAGAAGTGCGTGAAGATAGAGATCACTGGGCGACCAATATAACTGGTGATATCTATGGCAAAGGGTGGATAAGTGAGATGTACGGTTACTCATTCGGAGCTGCGGAAGTAAGCGTTATTTTTCCTCTCATCTTCTCCAAAGGACAATTTGAACAGCTTTTTGCTATCATACTTTGTTTTTTGGCTGCTTGAGATATGTCCTTTATTCTCTTGATTTGTGTTTTGCTTATGTGACTTCTTTGCCCAACCCAGGTTGGTCTCCGGCACAAAATTAATGACAATTTGATGATATACCCGGGTTATATTCCTCGTCCCGAGATTGAGCCTATACTTCTTCACTATGGTTTGCCATTTAGTGTTGGAAATTGGTCCTTTAGTAAATTAAATCACCATGAAGATGATATTGTCTATGACTGTAACCGGCTTTTCCCTGAGCCTCCTTATCCTCGAGAGGTATGTGTCCAAATTAGTTCCTTGGATATTTTGATACGCTTTTATGAAAAACATCGGGACAAGGTCTTCAACAATTGCATTGTTTTTATGCATTGAGTTTGTTGCAATTCTAGTTCATAAGTTGTTATTTGTTGCTAAAAAGTCGAATTTCATGCAGATACAACAAATGGAATCTGATTCAAATAAGAAGCGAGGGCTATTTATAAATATAGAGTGTATCAACCTGTTGAATGAGGGCCTATTGTTGCAACACAAACGAAATGGATGCCCGAAGCCACAGTGGTCAAAATATTTAAGCTTTTTAAAGAGTAAAACTTTTACTGACTTAACTAAACCCAAGTATCCAACCCCTGCTACTCTAGTAATGAAGGAAGATCGTGTTCAGAAACAACCAGTGAAGGGAGATCATGCTCAGAAACAACCGGTGAAGGAAGATCTTGTCCAGAAACAACCAGTGAAGGAAGATCTTGTTCAGAAACAACCGGTGCTTGATGAACTGCAGGAACCATATCCGAAAATCCACACACTCTTCTCAACGGAGTGCAGTACTTATTTCGATTGGCAAACTGTAGGCCTTATGCATAGTTTCCGCCTGAGCGGCCAGCCTGGGAACATTACACGACTTCTCAGCTGTACCGACGAGGATTTAAAAGAATACAAAGGTCACAATCTGGCTCCAACCCATTATGTTCCTTCCATGAGCCGACATCCACTGACAGGCGACTGGTAATTTCTTTCTGTCTTCATTGCATGAGCTATCTTCACTCTTTTGCTTATCACTCTGCCTTTTTGATAATTCTTCGGTTCAGTTCATCATGTACTCCTCAATTTTATTTGTTACCGCACATTATTTAGGGAAGGAATAGAATCATTCTATGCGTCATTATACAGATGTAAAGCAAGAAACCATCTAGTAATTTCTTTATCTTCTTCCAATTTTCTGCAAATTTCTCTGCCTGAGATCTAATGTTTGCATCATGATATGGATGTAAAGCAAGAAACCATCTTGTAATTTCTTTATCTTCTTCCGATTTTCAGCAAATATTTCTACCTGAGATCTAATGTGTTATTTACCCTTGAGTCCTACAATCTTCCAATAGATTCCTAATCTTGCTTGTGAAGAGCCAAAAAAGGGTAAAGATTTGGTGAAGAAAAGTTTTATTAAAGGCAGAGGAAGACTAAAATCTTTGATCTATTCTGTCTTAGGTATCCGGCAATTAATAAGCCAGCTGCAGTGCTTCATTGGCTCAATCATGTGAACACTGATGCAGAATTTATAGTTATTCTTGATGCTGATATGATCATGAGAGGATCAATTACGCCGTGGGAGTTCAAAGCAGCTCGTGGACGTCCTGTTTCTACACCCTATGAGTAAGAAGTTAATGTGATACTTCTCTCCAACTCTTCTCTCTCATGCATCTGTCATGCACAAGAATATATGGTTGCTCCATAAGTATGTATTTTGCAATTACTTGATTCCTTCTTTTTTAGGAAATTTTTTGACCTGTTTTCATATATGTTTTGTAGTTACCTTATTGGCTGTGACAATGTGCTTGCAAAACTTCACACAAGCCATCCTGAAGCTTGTGACAAGGTTGGCGGTGTTATTATCATGCACATAGATGATCTCAGGAAATTTGCAATGCTATGGCTGCATAAAACTGAGGAGGTCCGAGCGGATCGAGCTCATTATGCAACGAATATCACAGGAGATATATATCAATCTGGCTGGATCAGTGAGATGTATGGTTACTCATTCGGTGCTGCCGAGGTACTGGGTTATGATGAAGCAAGTAGTCTTTATTTTAAACAATTTTTTTAATCTTACTTTGCTCGTGAACTTTCATACTTATTTTTGTCTATGATTCTATTGTCTCTTACTTTCAAATCTCTTGTTTTAGTCCTTGTCATTGGAATAATTTGAGACATCTACTCATCAAATAACTAATCAAAAGGTTAGCTTATTTCAAGTCTTTATATAGTTAGATGAGGGACTAAAATGGGTCATTTGAAAGTATTTGAATTTTTAGGGGATAACACAAAACATGTATGGAAGTTAAGTTTATTGTTAGACAGATTGCATGTCTCCAGTATACAATCATTTCAATGTCTATATTGTGAACCAAGTTAGAATATACTCGAACAAATATAATGAAACTGAAAATGTTTCTCTTTTCATGTTTGAACAGTTGCAATTACGGCATATTCGAAACAGTGAGATACTATTATACCCAGGATATGTTCCTGATCCTGGAGTTCATTACAGAGTTTTTCATTATGGACTTGAATTTAAAGTTGGGAATTGGAGCTTTGACAAGGCAAATTGGAGGGAAACTGATTTGGTCAACACATGCTGGGCCCATTTTCCTGGCCCACCAGATCCTTCCACACTTGATCAAACTGATAAGGATGCTTTTGCAAGGGATTTGCTTAGCATAGAGTGTATAAGAACACTTAATGAAGCTTTGTATCTGCATCATAAGAAAAGGAACTGCTCAGATCCTAACTCGTTGACCAACTCGAACTCAGAAGATGAAACTGAAGCTGGGATTTCTAGGAAAATCGGCAAGCTTGATGAAAGCTATACTGGAAAAGACGATCAATCAACAGAAAGTTCTCAGGAATCATCAGAGGAGGCAAAAGAGGATGGGATATTTAGTTCTTTGAGGTTGTGGATTATTGCTTTGTGGGTGATTTCTGGTTTGGTGTTCTTGGTAGTGATTGTATCAAGGTTTTCGGGTCGGAAAGGGAAGGGAGTGAGAGGCAAACATCACAGGATCAAGAGAAGAACTGCTTCTTATTCAGGTTTCGTGGATCGGAATGGGCAGGAGAAGTATGTTCGAGATCTTGATGCCTCCTTGTAATGTTTTTTGGCAAAGTGAATTCAGAAGTTTGTTGTAGACAAAAGACAGTTGCAAAGCAACCTGGGAAACAACTCCTCGTGCTGATTGAACATGGCGAGTTCTTGATTTCTGATGTTCGTCGTCTGTTAACTTCTGCAGATTTTTCAAAAGTTGAACAGGAAAAGGAGTTAAAGGGGCTTTGAATCTTGGATGGATTTGTTCTTGGTGAAGTAGCCACCTAATACAGGTTTGTGTAGCCTCTTCTTTTCTGCAAATATTATAGACATTTTGACATTAGATGAGAAAAGTATCTTGTTTACTAAGTAAAAGATGAGATATTTCCACATTTCTCCCTTTAAGCCTGTATTTTTTTTAGTCTTTTCAACAAGAGCAGAACAGAGAAAAGAACCGATGAAATGGTTGCCATCAATTCATTGAAGAGGAAAATAATCTTTTTGAAGGTTTTGTCTTGTATTTGAATTGATCCTAACTTTATAGTTACATTCTCATTGACTAGTCAACACTTGATTGCTTATTGTTTCTCGTATAATGTACATTGAGGTCTTGTCCTTTTAATTATCAGCTTTTTCAAATGGGGGCGTTGCGGTTATCTTTTCTTAGTTCATCACTTGTAAGAACAGGATATTGAGTTATTCACATTTGAGATGGTGATAAATGTTTTGTTTTTTAAGCTATACTTAAAAGGATACCTTATTATTTGTATCATTGAATTTTTACGTACATAATATGCTCAAAATATTATAATGATTTAG

mRNA sequence

CAAAAAATGGCATTGGCAATAATTACCCCAGGAGAGAAGAAGCGGTTCTCGAAAATGAAGGTCGGACTTAGAAAAGTCATTCCCAACTGCTCCCATGTTCTGATCAGAGAAGTAAATTCCAACCCCAAGCTTCTAGTTTTTGTAGAAATTAAAAAATAGCTTCTCATTTTTTTAGAATTTTTTTCTGAATTATTTTATTAGCAACCAATTGATTGCTTTTTTAAGCTAATTTTTCGATCAAACTTGTCCCCACATCACTCCCTCTCCATGTCAAAGAAAACAAAAAGAGTAAGAAAAATGGAAGTTTGACCTGTGGCTGCGATCCGGGGAATGCAACCCAGATCTCTAAAACCAGTACCCAATTGAAATCTAGTCCAGGATTGAACCAAAATGAAAGAATTCTTGCTGTTCGTGGCGATATTTTTGGTGGGGTTTGTGGCCGGCGATGGGTGGAGCAATAATTCCGGCATGGCGGCGCCGCGGCGGATTCATACTCTGTTTTCGGTGGAGTGTCAGAATTACTTCGATTGGCAAACTGTTGGGTTGATGCATAGCTTCAAGAAGTCGAAGCAACCGGGGCCGATCACCCGTTTGCTTAGTTGCACCGATGAGGAGAAGAAGAACTATAGAGGGATGCATTTGGCTCCCACTTTTGAGGTTCCATCCATGAGTAGGCACCCCAAAACTGGCGACTGGTATCCTGCAATAAATAAACCTGCAGGGGTTGTCCACTGGCTTAAACATAGCAAAGAAGCAGAGAATGTTGATTGGGTTGTTATTCTGGATGCAGACATGATCATTAGAGGCCCAATAATACCTTGGGAACTTGGTGCAGAGAAGGGCAGACCTGTTGCAGCCTATTATGGGTTACATTCTTCTCTCTTCTCCCTCCTTCAGTCCCTTATGGCGTCGAAGCTTATTATTCTCCTTAAATTGAGATTTCAACAAAGTTACTTGGATAGATGCTTTTGGCTTTCTACACACCCAGAGCTCTGTGACAAAGTTGGTGGCCTCTTAGCAATGCATATAGATGATCTTCGAGTATTTGCACCAATGTGGCTTTCAAAGACTGAAGAAGTGCGTGAAGATAGAGATCACTGGGCGACCAATATAACTGGTGATATCTATGGCAAAGGGTGGATAAGTGAGATGTACGGTTACTCATTCGGAGCTGCGGAAGTTGGTCTCCGGCACAAAATTAATGACAATTTGATGATATACCCGGGTTATATTCCTCGTCCCGAGATTGAGCCTATACTTCTTCACTATGGTTTGCCATTTAGTGTTGGAAATTGGTCCTTTAGTAAATTAAATCACCATGAAGATGATATTGTCTATGACTGTAACCGGCTTTTCCCTGAGCCTCCTTATCCTCGAGAGATACAACAAATGGAATCTGATTCAAATAAGAAGCGAGGGCTATTTATAAATATAGAGTGTATCAACCTGTTGAATGAGGGCCTATTGTTGCAACACAAACGAAATGGATGCCCGAAGCCACAGTGGTCAAAATATTTAAGCTTTTTAAAGAGTAAAACTTTTACTGACTTAACTAAACCCAAGTATCCAACCCCTGCTACTCTAGTAATGAAGGAAGATCGTGTTCAGAAACAACCAGTGAAGGGAGATCATGCTCAGAAACAACCGGTGAAGGAAGATCTTGTCCAGAAACAACCAGTGAAGGAAGATCTTGTTCAGAAACAACCGGTGCTTGATGAACTGCAGGAACCATATCCGAAAATCCACACACTCTTCTCAACGGAGTGCAGTACTTATTTCGATTGGCAAACTGTAGGCCTTATGCATAGTTTCCGCCTGAGCGGCCAGCCTGGGAACATTACACGACTTCTCAGCTGTACCGACGAGGATTTAAAAGAATACAAAGGTCACAATCTGGCTCCAACCCATTATGTTCCTTCCATGAGCCGACATCCACTGACAGGCGACTGGTATCCGGCAATTAATAAGCCAGCTGCAGTGCTTCATTGGCTCAATCATGTGAACACTGATGCAGAATTTATAGTTATTCTTGATGCTGATATGATCATGAGAGGATCAATTACGCCGTGGGAGTTCAAAGCAGCTCGTGGACGTCCTGTTTCTACACCCTATGATTACCTTATTGGCTGTGACAATGTGCTTGCAAAACTTCACACAAGCCATCCTGAAGCTTGTGACAAGGTTGGCGGTGTTATTATCATGCACATAGATGATCTCAGGAAATTTGCAATGCTATGGCTGCATAAAACTGAGGAGGTCCGAGCGGATCGAGCTCATTATGCAACGAATATCACAGGAGATATATATCAATCTGGCTGGATCAGTGAGATGTATGGTTACTCATTCGGTGCTGCCGAGTTGCAATTACGGCATATTCGAAACAGTGAGATACTATTATACCCAGGATATGTTCCTGATCCTGGAGTTCATTACAGAGTTTTTCATTATGGACTTGAATTTAAAGTTGGGAATTGGAGCTTTGACAAGGCAAATTGGAGGGAAACTGATTTGGTCAACACATGCTGGGCCCATTTTCCTGGCCCACCAGATCCTTCCACACTTGATCAAACTGATAAGGATGCTTTTGCAAGGGATTTGCTTAGCATAGAGTGTATAAGAACACTTAATGAAGCTTTGTATCTGCATCATAAGAAAAGGAACTGCTCAGATCCTAACTCGTTGACCAACTCGAACTCAGAAGATGAAACTGAAGCTGGGATTTCTAGGAAAATCGGCAAGCTTGATGAAAGCTATACTGGAAAAGACGATCAATCAACAGAAAGTTCTCAGGAATCATCAGAGGAGGCAAAAGAGGATGGGATATTTAGTTCTTTGAGGTTGTGGATTATTGCTTTGTGGGTGATTTCTGGTTTGGTGTTCTTGGTAGTGATTGTATCAAGGTTTTCGGGTCGGAAAGGGAAGGGAGTGAGAGGCAAACATCACAGGATCAAGAGAAGAACTGCTTCTTATTCAGGTTTCGTGGATCGGAATGGGCAGGAGAAGTATGTTCGAGATCTTGATGCCTCCTTGTAATGTTTTTTGGCAAAGTGAATTCAGAAGTTTGTTGTAGACAAAAGACAGTTGCAAAGCAACCTGGGAAACAACTCCTCGTGCTGATTGAACATGGCGAGTTCTTGATTTCTGATGTTCGTCGTCTGTTAACTTCTGCAGATTTTTCAAAAGTTGAACAGGAAAAGGAGTTAAAGGGGCTTTGAATCTTGGATGGATTTGTTCTTGGTGAAGTAGCCACCTAATACAGGTTTGTGTAGCCTCTTCTTTTCTGCAAATATTATAGACATTTTGACATTAGATGAGAAAAGTATCTTGTTTACTAAGTAAAAGATGAGATATTTCCACATTTCTCCCTTTAAGCCTGTATTTTTTTTAGTCTTTTCAACAAGAGCAGAACAGAGAAAAGAACCGATGAAATGGTTGCCATCAATTCATTGAAGAGGAAAATAATCTTTTTGAAGGTTTTGTCTTGTATTTGAATTGATCCTAACTTTATAGTTACATTCTCATTGACTAGTCAACACTTGATTGCTTATTGTTTCTCGTATAATGTACATTGAGGTCTTGTCCTTTTAATTATCAGCTTTTTCAAATGGGGGCGTTGCGGTTATCTTTTCTTAGTTCATCACTTGTAAGAACAGGATATTGAGTTATTCACATTTGAGATGGTGATAAATGTTTTGTTTTTTAAGCTATACTTAAAAGGATACCTTATTATTTGTATCATTGAATTTTTACGTACATAATATGCTCAAAATATTATAATGATTTAG

Coding sequence (CDS)

ATGAAAGAATTCTTGCTGTTCGTGGCGATATTTTTGGTGGGGTTTGTGGCCGGCGATGGGTGGAGCAATAATTCCGGCATGGCGGCGCCGCGGCGGATTCATACTCTGTTTTCGGTGGAGTGTCAGAATTACTTCGATTGGCAAACTGTTGGGTTGATGCATAGCTTCAAGAAGTCGAAGCAACCGGGGCCGATCACCCGTTTGCTTAGTTGCACCGATGAGGAGAAGAAGAACTATAGAGGGATGCATTTGGCTCCCACTTTTGAGGTTCCATCCATGAGTAGGCACCCCAAAACTGGCGACTGGTATCCTGCAATAAATAAACCTGCAGGGGTTGTCCACTGGCTTAAACATAGCAAAGAAGCAGAGAATGTTGATTGGGTTGTTATTCTGGATGCAGACATGATCATTAGAGGCCCAATAATACCTTGGGAACTTGGTGCAGAGAAGGGCAGACCTGTTGCAGCCTATTATGGGTTACATTCTTCTCTCTTCTCCCTCCTTCAGTCCCTTATGGCGTCGAAGCTTATTATTCTCCTTAAATTGAGATTTCAACAAAGTTACTTGGATAGATGCTTTTGGCTTTCTACACACCCAGAGCTCTGTGACAAAGTTGGTGGCCTCTTAGCAATGCATATAGATGATCTTCGAGTATTTGCACCAATGTGGCTTTCAAAGACTGAAGAAGTGCGTGAAGATAGAGATCACTGGGCGACCAATATAACTGGTGATATCTATGGCAAAGGGTGGATAAGTGAGATGTACGGTTACTCATTCGGAGCTGCGGAAGTTGGTCTCCGGCACAAAATTAATGACAATTTGATGATATACCCGGGTTATATTCCTCGTCCCGAGATTGAGCCTATACTTCTTCACTATGGTTTGCCATTTAGTGTTGGAAATTGGTCCTTTAGTAAATTAAATCACCATGAAGATGATATTGTCTATGACTGTAACCGGCTTTTCCCTGAGCCTCCTTATCCTCGAGAGATACAACAAATGGAATCTGATTCAAATAAGAAGCGAGGGCTATTTATAAATATAGAGTGTATCAACCTGTTGAATGAGGGCCTATTGTTGCAACACAAACGAAATGGATGCCCGAAGCCACAGTGGTCAAAATATTTAAGCTTTTTAAAGAGTAAAACTTTTACTGACTTAACTAAACCCAAGTATCCAACCCCTGCTACTCTAGTAATGAAGGAAGATCGTGTTCAGAAACAACCAGTGAAGGGAGATCATGCTCAGAAACAACCGGTGAAGGAAGATCTTGTCCAGAAACAACCAGTGAAGGAAGATCTTGTTCAGAAACAACCGGTGCTTGATGAACTGCAGGAACCATATCCGAAAATCCACACACTCTTCTCAACGGAGTGCAGTACTTATTTCGATTGGCAAACTGTAGGCCTTATGCATAGTTTCCGCCTGAGCGGCCAGCCTGGGAACATTACACGACTTCTCAGCTGTACCGACGAGGATTTAAAAGAATACAAAGGTCACAATCTGGCTCCAACCCATTATGTTCCTTCCATGAGCCGACATCCACTGACAGGCGACTGGTATCCGGCAATTAATAAGCCAGCTGCAGTGCTTCATTGGCTCAATCATGTGAACACTGATGCAGAATTTATAGTTATTCTTGATGCTGATATGATCATGAGAGGATCAATTACGCCGTGGGAGTTCAAAGCAGCTCGTGGACGTCCTGTTTCTACACCCTATGATTACCTTATTGGCTGTGACAATGTGCTTGCAAAACTTCACACAAGCCATCCTGAAGCTTGTGACAAGGTTGGCGGTGTTATTATCATGCACATAGATGATCTCAGGAAATTTGCAATGCTATGGCTGCATAAAACTGAGGAGGTCCGAGCGGATCGAGCTCATTATGCAACGAATATCACAGGAGATATATATCAATCTGGCTGGATCAGTGAGATGTATGGTTACTCATTCGGTGCTGCCGAGTTGCAATTACGGCATATTCGAAACAGTGAGATACTATTATACCCAGGATATGTTCCTGATCCTGGAGTTCATTACAGAGTTTTTCATTATGGACTTGAATTTAAAGTTGGGAATTGGAGCTTTGACAAGGCAAATTGGAGGGAAACTGATTTGGTCAACACATGCTGGGCCCATTTTCCTGGCCCACCAGATCCTTCCACACTTGATCAAACTGATAAGGATGCTTTTGCAAGGGATTTGCTTAGCATAGAGTGTATAAGAACACTTAATGAAGCTTTGTATCTGCATCATAAGAAAAGGAACTGCTCAGATCCTAACTCGTTGACCAACTCGAACTCAGAAGATGAAACTGAAGCTGGGATTTCTAGGAAAATCGGCAAGCTTGATGAAAGCTATACTGGAAAAGACGATCAATCAACAGAAAGTTCTCAGGAATCATCAGAGGAGGCAAAAGAGGATGGGATATTTAGTTCTTTGAGGTTGTGGATTATTGCTTTGTGGGTGATTTCTGGTTTGGTGTTCTTGGTAGTGATTGTATCAAGGTTTTCGGGTCGGAAAGGGAAGGGAGTGAGAGGCAAACATCACAGGATCAAGAGAAGAACTGCTTCTTATTCAGGTTTCGTGGATCGGAATGGGCAGGAGAAGTATGTTCGAGATCTTGATGCCTCCTTGTAA

Protein sequence

MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSKQPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILLKLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVGNWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPVKEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPDPSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRKIGKLDESYTGKDDQSTESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL
Homology
BLAST of Clc05G15090 vs. NCBI nr
Match: XP_038899299.1 (peptidyl serine alpha-galactosyltransferase [Benincasa hispida])

HSP 1 Score: 1683.7 bits (4359), Expect = 0.0e+00
Identity = 812/881 (92.17%), Postives = 825/881 (93.64%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           MKEFLLFVAIFLVGFVAGDGWSNNSGMA PRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAPPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNY+GMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYKGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAE+VDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAEDVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRP+IEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKL+HHED IVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLLL
Sbjct: 301 NWSFSKLDHHEDGIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLL 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRV          QKQPV
Sbjct: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRV----------QKQPV 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
           K+DLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSF LSGQP
Sbjct: 421 KKDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFHLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AEFIVILDADMIMRGSITPWEFKAARG PVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEFIVILDADMIMRGSITPWEFKAARGHPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYA NITGDIYQSGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYAKNITGDIYQSGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRN+EILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFP PPD
Sbjct: 661 HIRNNEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPVPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
           PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPN+LTNS SE E+EAG+SRK
Sbjct: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNALTNSKSEYESEAGVSRK 780

Query: 781 IGKLDESYTGKDDQ-STESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFS 840
           IGKLDESY GKDD  STESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFS
Sbjct: 781 IGKLDESYIGKDDHLSTESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFS 840

Query: 841 GRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           GRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKY RDLDASL
Sbjct: 841 GRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYARDLDASL 848

BLAST of Clc05G15090 vs. NCBI nr
Match: XP_011651582.2 (peptidyl serine alpha-galactosyltransferase [Cucumis sativus])

HSP 1 Score: 1663.7 bits (4307), Expect = 0.0e+00
Identity = 801/901 (88.90%), Postives = 823/901 (91.34%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLVGFVA DGW+NNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVGFVASDGWTNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKIN+NLMIYPGYIPRP+IEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINENLMIYPGYIPRPDIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHED IVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKE------------------ 420
           QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPA+LVMKE                  
Sbjct: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPASLVMKEDCVQKQPVKVDHVQKQPV 420

Query: 421 --DRVQKQPVKGDHAQKQPVKEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECS 480
             DRVQKQPVK D  QKQPVK D VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+
Sbjct: 421 KVDRVQKQPVKVDRVQKQPVKVDRVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECT 480

Query: 481 TYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDW 540
           TYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDW
Sbjct: 481 TYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDW 540

Query: 541 YPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGC 600
           YPAINKPAAVLHWLNHVNTDAE+IVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGC
Sbjct: 541 YPAINKPAAVLHWLNHVNTDAEYIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGC 600

Query: 601 DNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQ 660
           DNVLAKLHTSHPEACDKVGGVIIMHIDDLRKF+MLWLHKTEEVRADRAHYATNITGDIYQ
Sbjct: 601 DNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFSMLWLHKTEEVRADRAHYATNITGDIYQ 660

Query: 661 SGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKA 720
           SGWISEMYGYSFGAAELQLRHIR+SEILLYPGY PDPGVHYRVFHYGLEFKVGNWSFDKA
Sbjct: 661 SGWISEMYGYSFGAAELQLRHIRSSEILLYPGYAPDPGVHYRVFHYGLEFKVGNWSFDKA 720

Query: 721 NWRETDLVNTCWAHFPGPPDPSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSD 780
           NWRETDLVN CWA FP PPDPSTLDQ+DKD FARDLLSIECIRTLNEALYLHHKKRNCSD
Sbjct: 721 NWRETDLVNRCWAQFPAPPDPSTLDQSDKDGFARDLLSIECIRTLNEALYLHHKKRNCSD 780

Query: 781 PNSLTNSNSEDETEAGISRKIGKLDESYTGKDDQ-STESSQESSEEAKEDGIFSSLRLWI 840
           PN L N N +DE+E G+SRKIGKLDESYTGK+D  ST+SSQESS+ AKEDGIF SLRLWI
Sbjct: 781 PNLLANPNLDDESEVGVSRKIGKLDESYTGKEDHLSTDSSQESSQAAKEDGIFGSLRLWI 840

Query: 841 IALWVISGLVFLVVIVSRFSGRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDAS 881
           IALWVISGLVFLVVI+S+FSGRK KGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDAS
Sbjct: 841 IALWVISGLVFLVVIISKFSGRKAKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDAS 878

BLAST of Clc05G15090 vs. NCBI nr
Match: KGN58321.2 (hypothetical protein Csa_017560 [Cucumis sativus])

HSP 1 Score: 1654.0 bits (4282), Expect = 0.0e+00
Identity = 793/881 (90.01%), Postives = 815/881 (92.51%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLVGFVA DGW+NNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVGFVASDGWTNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKIN+NLMIYPGYIPRP+IEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINENLMIYPGYIPRPDIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHED IVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPA+LVMKED VQKQPVK D       
Sbjct: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPASLVMKEDCVQKQPVKVDR------ 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
               VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSFRLSGQP
Sbjct: 421 ----VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AE+IVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEYIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKF+MLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFSMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIR+SEILLYPGY PDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVN CWA FP PPD
Sbjct: 661 HIRSSEILLYPGYAPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
           PSTLDQ+DKD FARDLLSIECIRTLNEALYLHHKKRNCSDPN L N N +DE+E G+SRK
Sbjct: 721 PSTLDQSDKDGFARDLLSIECIRTLNEALYLHHKKRNCSDPNLLANPNLDDESEVGVSRK 780

Query: 781 IGKLDESYTGKDDQ-STESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFS 840
           IGKLDESYTGK+D  ST+SSQESS+ AKEDGIF SLRLWIIALWVISGLVFLVVI+S+FS
Sbjct: 781 IGKLDESYTGKEDHLSTDSSQESSQAAKEDGIFGSLRLWIIALWVISGLVFLVVIISKFS 840

Query: 841 GRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           GRK KGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL
Sbjct: 841 GRKAKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 848

BLAST of Clc05G15090 vs. NCBI nr
Match: XP_008449998.1 (PREDICTED: uncharacterized protein LOC103491714 isoform X1 [Cucumis melo])

HSP 1 Score: 1653.6 bits (4281), Expect = 0.0e+00
Identity = 796/880 (90.45%), Postives = 812/880 (92.27%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLV FVA DGW+NNS MAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVRFVASDGWTNNSSMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTD+EKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDDEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKP+WSKYLSFLKSKTFTDLTKPKYPTP+TLVMKEDRVQKQPVK         
Sbjct: 361 QHKRNGCPKPEWSKYLSFLKSKTFTDLTKPKYPTPSTLVMKEDRVQKQPVKVYR------ 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
               VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSFRLSGQP
Sbjct: 421 ----VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AE+IVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEYIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVN CWA FP PPD
Sbjct: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
           PSTLDQTDK  FARDLLSIECIRTLNEALYLHHKKRNCSDPN LTN NSEDE+E G+S K
Sbjct: 721 PSTLDQTDKGGFARDLLSIECIRTLNEALYLHHKKRNCSDPNLLTNLNSEDESETGVSWK 780

Query: 781 IGKLDESYTGKDDQSTESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSG 840
           IGKLDESYTGK   STESSQESS EAKEDGIFSSLR WIIALWVISGLVFLVVI+S+FSG
Sbjct: 781 IGKLDESYTGKGHLSTESSQESSVEAKEDGIFSSLRSWIIALWVISGLVFLVVIISKFSG 840

Query: 841 RKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           RK KGVRGKHHRIKRRTASYS FVDRNGQEKYV+DLDASL
Sbjct: 841 RKAKGVRGKHHRIKRRTASYSVFVDRNGQEKYVKDLDASL 847

BLAST of Clc05G15090 vs. NCBI nr
Match: XP_016900856.1 (PREDICTED: uncharacterized protein LOC103491714 isoform X2 [Cucumis melo])

HSP 1 Score: 1643.2 bits (4254), Expect = 0.0e+00
Identity = 794/880 (90.23%), Postives = 810/880 (92.05%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLV FVA DGW+NNS MAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVRFVASDGWTNNSSMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTD+EKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDDEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKP+WSKYLSFLKSKTFTDLTKPKYPTP+TLVMKEDRVQKQPVK         
Sbjct: 361 QHKRNGCPKPEWSKYLSFLKSKTFTDLTKPKYPTPSTLVMKEDRVQKQPVKVYR------ 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
               VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSFRLSGQP
Sbjct: 421 ----VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AE+IVILDADMIMRGSITPWEFKAARGRPVSTP  YLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEYIVILDADMIMRGSITPWEFKAARGRPVSTP--YLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVN CWA FP PPD
Sbjct: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
           PSTLDQTDK  FARDLLSIECIRTLNEALYLHHKKRNCSDPN LTN NSEDE+E G+S K
Sbjct: 721 PSTLDQTDKGGFARDLLSIECIRTLNEALYLHHKKRNCSDPNLLTNLNSEDESETGVSWK 780

Query: 781 IGKLDESYTGKDDQSTESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSG 840
           IGKLDESYTGK   STESSQESS EAKEDGIFSSLR WIIALWVISGLVFLVVI+S+FSG
Sbjct: 781 IGKLDESYTGKGHLSTESSQESSVEAKEDGIFSSLRSWIIALWVISGLVFLVVIISKFSG 840

Query: 841 RKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           RK KGVRGKHHRIKRRTASYS FVDRNGQEKYV+DLDASL
Sbjct: 841 RKAKGVRGKHHRIKRRTASYSVFVDRNGQEKYVKDLDASL 845

BLAST of Clc05G15090 vs. ExPASy Swiss-Prot
Match: Q8VYF9 (Peptidyl serine alpha-galactosyltransferase OS=Arabidopsis thaliana OX=3702 GN=SERGT1 PE=2 SV=1)

HSP 1 Score: 1225.7 bits (3170), Expect = 0.0e+00
Identity = 577/846 (68.20%), Postives = 674/846 (79.67%), Query Frame = 0

Query: 22  SNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSKQPGPITRLLSCTDEEKKNYRG 81
           ++ SG  AP RIHTLFSVECQNYFDWQTVGLMHSF KS QPGPITRLLSCTD++KK YRG
Sbjct: 19  ADESGQMAPYRIHTLFSVECQNYFDWQTVGLMHSFLKSGQPGPITRLLSCTDDQKKTYRG 78

Query: 82  MHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEAENVDWVVILDADMIIRGPI 141
           M+LAPTFEVPS SRHPKTGDWYPAINKP GV++WL+HS+EA++VDWVVILDADMIIRGPI
Sbjct: 79  MNLAPTFEVPSWSRHPKTGDWYPAINKPVGVLYWLQHSEEAKHVDWVVILDADMIIRGPI 138

Query: 142 IPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILLKLRFQQSYLDRCFWLSTHPEL 201
           IPWELGAE+GRP AA+YG      +LL  L                        + HPEL
Sbjct: 139 IPWELGAERGRPFAAHYGYLVGCDNLLVRLH-----------------------TKHPEL 198

Query: 202 CDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGA 261
           CDKVGGLLAMHIDDLRV AP+WLSKTE+VR+D  HW TN+TGDIYGKGWISEMYGYSFGA
Sbjct: 199 CDKVGGLLAMHIDDLRVLAPLWLSKTEDVRQDTAHWTTNLTGDIYGKGWISEMYGYSFGA 258

Query: 262 AEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVGNWSFSKLNHHEDDIVYDCNRL 321
           AE GL+HKIND+LMIYPGY+PR  +EP+L+HYGLPFS+GNWSF+KL+HHED+IVYDCNRL
Sbjct: 259 AEAGLKHKINDDLMIYPGYVPREGVEPVLMHYGLPFSIGNWSFTKLDHHEDNIVYDCNRL 318

Query: 322 FPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKS 381
           FPEPPYPRE++ ME D +K+RGL +++EC+N LNEGL+L+H  NGCPKP+W+KYLSFLKS
Sbjct: 319 FPEPPYPREVKIMEPDPSKRRGLILSLECMNTLNEGLILRHAENGCPKPKWTKYLSFLKS 378

Query: 382 KTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPVKEDLVQKQPVKEDLVQKQPVL 441
           KTF +LT+PK   P ++ +  D                      Q +P         P +
Sbjct: 379 KTFMELTRPKLLAPGSVHILPD----------------------QHEP---------PPI 438

Query: 442 DELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKEYKGHN 501
           DE +  YPKIHTLFSTEC+TYFDWQTVG MHSFR SGQPGNITRLLSCTDE LK YKGH+
Sbjct: 439 DEFKGTYPKIHTLFSTECTTYFDWQTVGFMHSFRQSGQPGNITRLLSCTDEALKNYKGHD 498

Query: 502 LAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPWE 561
           LAPTHYVPSMSRHPLTGDWYPAINKPAAV+HWL+H N DAE++VILDADMI+RG ITPWE
Sbjct: 499 LAPTHYVPSMSRHPLTGDWYPAINKPAAVVHWLHHTNIDAEYVVILDADMILRGPITPWE 558

Query: 562 FKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTE 621
           FKAARGRPVSTPYDYLIGCDN LA+LHT +PEACDKVGGVIIMHI+DLRKFAM WL KT+
Sbjct: 559 FKAARGRPVSTPYDYLIGCDNDLARLHTRNPEACDKVGGVIIMHIEDLRKFAMYWLLKTQ 618

Query: 622 EVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVHY 681
           EVRAD+ HY   +TGDIY+SGWISEMYGYSFGAAEL LRH  N EI++YPGYVP+PG  Y
Sbjct: 619 EVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPGYVPEPGADY 678

Query: 682 RVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPDPSTLDQTDKDAFARDLLSIEC 741
           RVFHYGLEFKVGNWSFDKANWR TDL+N CWA FP PP PS + QTD D   RDLLSIEC
Sbjct: 679 RVFHYGLEFKVGNWSFDKANWRNTDLINKCWAKFPDPPSPSAVHQTDNDLRQRDLLSIEC 738

Query: 742 IRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRKIGKLDESYTGKDDQSTESSQE 801
            + LNEAL+LHHK+RNC +P       SE   +  +SRK+G ++     K  Q ++ ++E
Sbjct: 739 GQKLNEALFLHHKRRNCPEP------GSESTEKISVSRKVGNIET----KQTQGSDETKE 798

Query: 802 SSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSGRKGKG-VRGKHHRIKRRTA-S 861
           SS  ++ +G FS+L+LW+IALW+ISG+ FLVV++  FS R+G+G  RGK +R KRRT+ S
Sbjct: 799 SSGSSESEGRFSTLKLWVIALWLISGVGFLVVMLLVFSTRRGRGTTRGKGYRNKRRTSYS 800

Query: 862 YSGFVD 866
            +GF+D
Sbjct: 859 NTGFLD 800

BLAST of Clc05G15090 vs. ExPASy Swiss-Prot
Match: H3JU05 (Peptidyl serine alpha-galactosyltransferase OS=Chlamydomonas reinhardtii OX=3055 GN=SGT1 PE=1 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 7.0e-50
Identity = 124/350 (35.43%), Postives = 182/350 (52.00%), Query Frame = 0

Query: 5   LLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSKQPGP 64
           L+  A+ L+  +     +   G A    +H  F  +CQ Y DWQ+VG   SFK S QPG 
Sbjct: 9   LVLGALLLLLALQHGASAEEPGFANRTGVHVAFLTDCQMYSDWQSVGAAFSFKMSGQPGS 68

Query: 65  ITRLLSCTDEEKKNYRG--MHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEA 124
           + R++ C++E+ KNY    + +  T+  P  +   +TGD Y A NKP  V+ WL H+   
Sbjct: 69  VIRVMCCSEEQAKNYNKGLLGMVDTWVAPDATHSKRTGDRYAAYNKPEAVIDWLDHN--V 128

Query: 125 ENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILLKL 184
              D+V++LD+DM++R P     +G  KG  V A Y             +A++L +    
Sbjct: 129 PKHDYVLVLDSDMVLRRPFFVENMGPRKGLAVGARYTYMIG--------VANELAV---- 188

Query: 185 RFQQSYLDRCFWLS-THPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNI 244
           R       R   L+       D+VGG   +H DDL+  +  WL  +E+VR D    A  +
Sbjct: 189 RHIPHVPPRNDTLAGPFGRRADQVGGFFFIHKDDLKAMSHDWLKFSEDVRVDDQ--AYRL 248

Query: 245 TGDIYG-----KGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLP 304
           +GD+Y      + WISEMYGY+FGAA   + HK +   MIYPGY PR  I P L+HYGL 
Sbjct: 249 SGDVYAIHPGDRPWISEMYGYAFGAANHNVWHKWDTFSMIYPGYEPREGI-PKLMHYGLL 308

Query: 305 FSVG-NWSFSKLNHHEDDI-------VYDCNR----LFPEPPYPREIQQM 335
           F +G N+SF K  H++ D+       + D  R    +FPEPP P  ++++
Sbjct: 309 FEIGKNYSFDKHWHYDFDVTVCPPWDLKDPKRRTHGIFPEPPRPSSLRKV 341

BLAST of Clc05G15090 vs. ExPASy Swiss-Prot
Match: G7LG31 (Hydroxyproline O-arabinosyltransferase RDN2 OS=Medicago truncatula OX=3880 GN=RDN2 PE=3 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 3.8e-11
Identity = 51/215 (23.72%), Postives = 93/215 (43.26%), Query Frame = 0

Query: 521 YPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGC 580
           Y  +N+P A + WL   N + E+I++ + D +    + P    A    P + P+ Y+   
Sbjct: 134 YVVLNRPWAFVQWLEKANIEEEYILMAEPDHVF---VRPLPNLAFGENPAAFPFFYIKPK 193

Query: 581 DN--VLAKLHTSHPEACDKVGGV----IIMHIDDLRKFAMLWLHKTEEVRADRAHYATNI 640
           +N  ++ K +         V  +    +I+  D + K A  W++ + +++ D        
Sbjct: 194 ENEKIVRKYYPEENGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISMKMKEDPE------ 253

Query: 641 TGDIYQSGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVHYRV-FHYGLEF--- 700
           T   +  GW+ EMYGY+  +A   +RHI   + +L P +  +    Y + + YG ++   
Sbjct: 254 TDKAF--GWVLEMYGYAVASALHGVRHILRKDFMLQPPWDTETFNKYIIHYTYGCDYNLK 313

Query: 701 ------KVGNWSFDKANWRETDLVNTCWAHFPGPP 720
                 K+G W FDK             +H  GPP
Sbjct: 314 GELTYGKIGEWRFDKR------------SHLRGPP 325

BLAST of Clc05G15090 vs. ExPASy Swiss-Prot
Match: Q9FY51 (Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT3 PE=1 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 4.9e-11
Identity = 68/315 (21.59%), Postives = 126/315 (40.00%), Query Frame = 0

Query: 429 PVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFR----LSGQP-GNI 488
           P+ + +VQ    + + +      H   +   + Y  WQ   + + ++    L G   G  
Sbjct: 41  PLLDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGF 100

Query: 489 TRLLSCTDEDLKEYKGHNL---APTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 548
           TR+L   + D       NL    PT  V  +   P     Y  +N+P A + WL      
Sbjct: 101 TRILHSGNSD-------NLMDEIPTFVVDPLP--PGLDRGYVVLNRPWAFVQWLERATIK 160

Query: 549 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKV 608
            +++++ + D +    + P    A  G P + P+ Y+     +N++ K + +       +
Sbjct: 161 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 220

Query: 609 GGV----IIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGA 668
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 221 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 280

Query: 669 AELQLRHIRNSEILLYPGY-VPDPGVHYRVFHYGLEF---------KVGNWSFDKANWRE 720
           A   +RHI   + +L P + +   G     + YG ++         K+G W FDK     
Sbjct: 281 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKR---- 323

BLAST of Clc05G15090 vs. ExPASy TrEMBL
Match: A0A0A0LDQ3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G597250 PE=4 SV=1)

HSP 1 Score: 1656.0 bits (4287), Expect = 0.0e+00
Identity = 801/921 (86.97%), Postives = 823/921 (89.36%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLVGFVA DGW+NNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVGFVASDGWTNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKIN+NLMIYPGYIPRP+IEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINENLMIYPGYIPRPDIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHED IVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKE------------------ 420
           QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPA+LVMKE                  
Sbjct: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPASLVMKEDCVQKQPVKVDRVQKQPV 420

Query: 421 ----------------------DRVQKQPVKGDHAQKQPVKEDLVQKQPVKEDLVQKQPV 480
                                 DRVQKQPVK D  QKQPVK D VQKQPVKEDLVQKQPV
Sbjct: 421 KVDRVQKQPVKVDRVQKQPVKVDRVQKQPVKVDRVQKQPVKVDRVQKQPVKEDLVQKQPV 480

Query: 481 LDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKEYKGH 540
           LDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLK+YKGH
Sbjct: 481 LDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKKYKGH 540

Query: 541 NLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPW 600
           NLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTDAE+IVILDADMIMRGSITPW
Sbjct: 541 NLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTDAEYIVILDADMIMRGSITPW 600

Query: 601 EFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKT 660
           EFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKF+MLWLHKT
Sbjct: 601 EFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFSMLWLHKT 660

Query: 661 EEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVH 720
           EEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIR+SEILLYPGY PDPGVH
Sbjct: 661 EEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIRSSEILLYPGYAPDPGVH 720

Query: 721 YRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPDPSTLDQTDKDAFARDLLSIE 780
           YRVFHYGLEFKVGNWSFDKANWRETDLVN CWA FP PPDPSTLDQ+DKD FARDLLSIE
Sbjct: 721 YRVFHYGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPDPSTLDQSDKDGFARDLLSIE 780

Query: 781 CIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRKIGKLDESYTGKDDQ-STESS 840
           CIRTLNEALYLHHKKRNCSDPN L N N +DE+E G+SRKIGKLDESYTGK+D  ST+SS
Sbjct: 781 CIRTLNEALYLHHKKRNCSDPNLLANPNLDDESEVGVSRKIGKLDESYTGKEDHLSTDSS 840

Query: 841 QESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSGRKGKGVRGKHHRIKRRTAS 881
           QESS+ AKEDGIF SLRLWIIALWVISGLVFLVVI+S+FSGRK KGVRGKHHRIKRRTAS
Sbjct: 841 QESSQAAKEDGIFGSLRLWIIALWVISGLVFLVVIISKFSGRKAKGVRGKHHRIKRRTAS 898

BLAST of Clc05G15090 vs. ExPASy TrEMBL
Match: A0A1S3BNB4 (uncharacterized protein LOC103491714 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491714 PE=4 SV=1)

HSP 1 Score: 1653.6 bits (4281), Expect = 0.0e+00
Identity = 796/880 (90.45%), Postives = 812/880 (92.27%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLV FVA DGW+NNS MAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVRFVASDGWTNNSSMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTD+EKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDDEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKP+WSKYLSFLKSKTFTDLTKPKYPTP+TLVMKEDRVQKQPVK         
Sbjct: 361 QHKRNGCPKPEWSKYLSFLKSKTFTDLTKPKYPTPSTLVMKEDRVQKQPVKVYR------ 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
               VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSFRLSGQP
Sbjct: 421 ----VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AE+IVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEYIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVN CWA FP PPD
Sbjct: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
           PSTLDQTDK  FARDLLSIECIRTLNEALYLHHKKRNCSDPN LTN NSEDE+E G+S K
Sbjct: 721 PSTLDQTDKGGFARDLLSIECIRTLNEALYLHHKKRNCSDPNLLTNLNSEDESETGVSWK 780

Query: 781 IGKLDESYTGKDDQSTESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSG 840
           IGKLDESYTGK   STESSQESS EAKEDGIFSSLR WIIALWVISGLVFLVVI+S+FSG
Sbjct: 781 IGKLDESYTGKGHLSTESSQESSVEAKEDGIFSSLRSWIIALWVISGLVFLVVIISKFSG 840

Query: 841 RKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           RK KGVRGKHHRIKRRTASYS FVDRNGQEKYV+DLDASL
Sbjct: 841 RKAKGVRGKHHRIKRRTASYSVFVDRNGQEKYVKDLDASL 847

BLAST of Clc05G15090 vs. ExPASy TrEMBL
Match: A0A1S4DXZ6 (uncharacterized protein LOC103491714 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491714 PE=4 SV=1)

HSP 1 Score: 1643.2 bits (4254), Expect = 0.0e+00
Identity = 794/880 (90.23%), Postives = 810/880 (92.05%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+EFLLFVAIFLV FVA DGW+NNS MAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MREFLLFVAIFLVRFVASDGWTNNSSMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTD+EKK YRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDDEKKKYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLL 
Sbjct: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLW 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKP+WSKYLSFLKSKTFTDLTKPKYPTP+TLVMKEDRVQKQPVK         
Sbjct: 361 QHKRNGCPKPEWSKYLSFLKSKTFTDLTKPKYPTPSTLVMKEDRVQKQPVKVYR------ 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
               VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTEC+TYFDWQTVGLMHSFRLSGQP
Sbjct: 421 ----VQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECTTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AE+IVILDADMIMRGSITPWEFKAARGRPVSTP  YLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEYIVILDADMIMRGSITPWEFKAARGRPVSTP--YLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVN CWA FP PPD
Sbjct: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
           PSTLDQTDK  FARDLLSIECIRTLNEALYLHHKKRNCSDPN LTN NSEDE+E G+S K
Sbjct: 721 PSTLDQTDKGGFARDLLSIECIRTLNEALYLHHKKRNCSDPNLLTNLNSEDESETGVSWK 780

Query: 781 IGKLDESYTGKDDQSTESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSG 840
           IGKLDESYTGK   STESSQESS EAKEDGIFSSLR WIIALWVISGLVFLVVI+S+FSG
Sbjct: 781 IGKLDESYTGKGHLSTESSQESSVEAKEDGIFSSLRSWIIALWVISGLVFLVVIISKFSG 840

Query: 841 RKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           RK KGVRGKHHRIKRRTASYS FVDRNGQEKYV+DLDASL
Sbjct: 841 RKAKGVRGKHHRIKRRTASYSVFVDRNGQEKYVKDLDASL 845

BLAST of Clc05G15090 vs. ExPASy TrEMBL
Match: A0A6J1J567 (peptidyl serine alpha-galactosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111481857 PE=4 SV=1)

HSP 1 Score: 1628.6 bits (4216), Expect = 0.0e+00
Identity = 784/881 (88.99%), Postives = 812/881 (92.17%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+ FL+FVAIF++GFVAGDG S NS MAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MRGFLMFVAIFVMGFVAGDGRSINSDMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGM LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMDLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRP+IEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKL HHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLLL
Sbjct: 301 NWSFSKLYHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLL 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKPQWSKYLSFLKSKTF DLTKPKYPTPATLVMKED V KQPVKGD       
Sbjct: 361 QHKRNGCPKPQWSKYLSFLKSKTFADLTKPKYPTPATLVMKEDHVPKQPVKGDR------ 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
               VQKQPVKE+LVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP
Sbjct: 421 ----VQKQPVKEELVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDE+LK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDENLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AEFIVILDADMIMRG ITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEFIVILDADMIMRGPITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIY+SGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRN+EIL+YPGY PDPGVHYRVFHYGLEFKVGNWSF KANWR+TDLVNTCWA FP PPD
Sbjct: 661 HIRNTEILIYPGYYPDPGVHYRVFHYGLEFKVGNWSFGKANWRDTDLVNTCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
            STLDQTDK+AFARDLLSIECIRTLNEALYLHHKK NCSDP+SLTNSNSE+E+EAG+SRK
Sbjct: 721 ASTLDQTDKNAFARDLLSIECIRTLNEALYLHHKKSNCSDPSSLTNSNSENESEAGVSRK 780

Query: 781 IGKLDESYTGKDDQ-STESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFS 840
           IGKLDESYTGK +  STESSQESSEE KED +FSSLRLWII++WVISGL+FLV+I+S+FS
Sbjct: 781 IGKLDESYTGKGNHLSTESSQESSEEVKEDAMFSSLRLWIISIWVISGLLFLVLIISKFS 840

Query: 841 GRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           GRK K VRGKH RIKRRTASYSGFVDRNGQEKYVRDLDASL
Sbjct: 841 GRKVKVVRGKHQRIKRRTASYSGFVDRNGQEKYVRDLDASL 848

BLAST of Clc05G15090 vs. ExPASy TrEMBL
Match: A0A6J1F984 (peptidyl serine alpha-galactosyltransferase-like OS=Cucurbita moschata OX=3662 GN=LOC111441973 PE=4 SV=1)

HSP 1 Score: 1625.5 bits (4208), Expect = 0.0e+00
Identity = 783/881 (88.88%), Postives = 809/881 (91.83%), Query Frame = 0

Query: 1   MKEFLLFVAIFLVGFVAGDGWSNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60
           M+ FL+FVA+ L+GFV GDG S NS MAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK
Sbjct: 1   MRGFLVFVAVCLMGFVVGDGRSINSDMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSK 60

Query: 61  QPGPITRLLSCTDEEKKNYRGMHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120
           QPGPITRLLSCTDEEKKNYRGM LAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK
Sbjct: 61  QPGPITRLLSCTDEEKKNYRGMDLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSK 120

Query: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILL 180
           EAENVDWVVILDADMIIRGPIIPWELGAEK RPVAAYYG      ++L  L         
Sbjct: 121 EAENVDWVVILDADMIIRGPIIPWELGAEKSRPVAAYYGYLVGCDNILAKLH-------- 180

Query: 181 KLRFQQSYLDRCFWLSTHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240
                          + HPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN
Sbjct: 181 ---------------TKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATN 240

Query: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVG 300
           ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRP+IEPILLHYGLPFSVG
Sbjct: 241 ITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPRPDIEPILLHYGLPFSVG 300

Query: 301 NWSFSKLNHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLL 360
           NWSFSKL HHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGL INIECINLLNEGLLL
Sbjct: 301 NWSFSKLYHHEDDIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECINLLNEGLLL 360

Query: 361 QHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPV 420
           QHKRNGCPKPQWSKYLSFLKSKTF DLTKPKYPTPATLVMKE          DH  KQPV
Sbjct: 361 QHKRNGCPKPQWSKYLSFLKSKTFADLTKPKYPTPATLVMKE----------DHVPKQPV 420

Query: 421 KEDLVQKQPVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480
           KED VQKQPVKE+LVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP
Sbjct: 421 KEDRVQKQPVKEELVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQP 480

Query: 481 GNITRLLSCTDEDLKEYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540
           GNITRLLSCTDEDLK+YKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD
Sbjct: 481 GNITRLLSCTDEDLKKYKGHNLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 540

Query: 541 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600
           AEFIVILDADMIMRG ITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG
Sbjct: 541 AEFIVILDADMIMRGPITPWEFKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGG 600

Query: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLR 660
           VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIY+SGWISEMYGYSFGAAELQLR
Sbjct: 601 VIIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYESGWISEMYGYSFGAAELQLR 660

Query: 661 HIRNSEILLYPGYVPDPGVHYRVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPD 720
           HIRN+EIL+YPGY PDPGVHYRVFHYGLEFKVGNWSF KANWR+TDLVNTCWA FP PPD
Sbjct: 661 HIRNTEILIYPGYYPDPGVHYRVFHYGLEFKVGNWSFGKANWRDTDLVNTCWAQFPAPPD 720

Query: 721 PSTLDQTDKDAFARDLLSIECIRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRK 780
            STLDQTDK+AFARDLLSIECIRTLNEALYLHHKK NCSDP+SLTNSNSE+E+EAG+SRK
Sbjct: 721 ASTLDQTDKNAFARDLLSIECIRTLNEALYLHHKKSNCSDPSSLTNSNSENESEAGVSRK 780

Query: 781 IGKLDESYTGKDDQ-STESSQESSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFS 840
           IGKLDESYTGK D  STESSQESSEE KED +FSSLRLWII++WVISGL+FLV+I+S+FS
Sbjct: 781 IGKLDESYTGKGDHLSTESSQESSEEVKEDAMFSSLRLWIISIWVISGLLFLVLIISKFS 840

Query: 841 GRKGKGVRGKHHRIKRRTASYSGFVDRNGQEKYVRDLDASL 881
           GRK K VRGKH RIKRRTASYSGFVDRNGQEKYVRDLDASL
Sbjct: 841 GRKVKVVRGKHQRIKRRTASYSGFVDRNGQEKYVRDLDASL 848

BLAST of Clc05G15090 vs. TAIR 10
Match: AT3G01720.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 374 Blast hits to 211 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316; Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink). )

HSP 1 Score: 1225.7 bits (3170), Expect = 0.0e+00
Identity = 577/846 (68.20%), Postives = 674/846 (79.67%), Query Frame = 0

Query: 22  SNNSGMAAPRRIHTLFSVECQNYFDWQTVGLMHSFKKSKQPGPITRLLSCTDEEKKNYRG 81
           ++ SG  AP RIHTLFSVECQNYFDWQTVGLMHSF KS QPGPITRLLSCTD++KK YRG
Sbjct: 19  ADESGQMAPYRIHTLFSVECQNYFDWQTVGLMHSFLKSGQPGPITRLLSCTDDQKKTYRG 78

Query: 82  MHLAPTFEVPSMSRHPKTGDWYPAINKPAGVVHWLKHSKEAENVDWVVILDADMIIRGPI 141
           M+LAPTFEVPS SRHPKTGDWYPAINKP GV++WL+HS+EA++VDWVVILDADMIIRGPI
Sbjct: 79  MNLAPTFEVPSWSRHPKTGDWYPAINKPVGVLYWLQHSEEAKHVDWVVILDADMIIRGPI 138

Query: 142 IPWELGAEKGRPVAAYYGLHSSLFSLLQSLMASKLIILLKLRFQQSYLDRCFWLSTHPEL 201
           IPWELGAE+GRP AA+YG      +LL  L                        + HPEL
Sbjct: 139 IPWELGAERGRPFAAHYGYLVGCDNLLVRLH-----------------------TKHPEL 198

Query: 202 CDKVGGLLAMHIDDLRVFAPMWLSKTEEVREDRDHWATNITGDIYGKGWISEMYGYSFGA 261
           CDKVGGLLAMHIDDLRV AP+WLSKTE+VR+D  HW TN+TGDIYGKGWISEMYGYSFGA
Sbjct: 199 CDKVGGLLAMHIDDLRVLAPLWLSKTEDVRQDTAHWTTNLTGDIYGKGWISEMYGYSFGA 258

Query: 262 AEVGLRHKINDNLMIYPGYIPRPEIEPILLHYGLPFSVGNWSFSKLNHHEDDIVYDCNRL 321
           AE GL+HKIND+LMIYPGY+PR  +EP+L+HYGLPFS+GNWSF+KL+HHED+IVYDCNRL
Sbjct: 259 AEAGLKHKINDDLMIYPGYVPREGVEPVLMHYGLPFSIGNWSFTKLDHHEDNIVYDCNRL 318

Query: 322 FPEPPYPREIQQMESDSNKKRGLFINIECINLLNEGLLLQHKRNGCPKPQWSKYLSFLKS 381
           FPEPPYPRE++ ME D +K+RGL +++EC+N LNEGL+L+H  NGCPKP+W+KYLSFLKS
Sbjct: 319 FPEPPYPREVKIMEPDPSKRRGLILSLECMNTLNEGLILRHAENGCPKPKWTKYLSFLKS 378

Query: 382 KTFTDLTKPKYPTPATLVMKEDRVQKQPVKGDHAQKQPVKEDLVQKQPVKEDLVQKQPVL 441
           KTF +LT+PK   P ++ +  D                      Q +P         P +
Sbjct: 379 KTFMELTRPKLLAPGSVHILPD----------------------QHEP---------PPI 438

Query: 442 DELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFRLSGQPGNITRLLSCTDEDLKEYKGHN 501
           DE +  YPKIHTLFSTEC+TYFDWQTVG MHSFR SGQPGNITRLLSCTDE LK YKGH+
Sbjct: 439 DEFKGTYPKIHTLFSTECTTYFDWQTVGFMHSFRQSGQPGNITRLLSCTDEALKNYKGHD 498

Query: 502 LAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTDAEFIVILDADMIMRGSITPWE 561
           LAPTHYVPSMSRHPLTGDWYPAINKPAAV+HWL+H N DAE++VILDADMI+RG ITPWE
Sbjct: 499 LAPTHYVPSMSRHPLTGDWYPAINKPAAVVHWLHHTNIDAEYVVILDADMILRGPITPWE 558

Query: 562 FKAARGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTE 621
           FKAARGRPVSTPYDYLIGCDN LA+LHT +PEACDKVGGVIIMHI+DLRKFAM WL KT+
Sbjct: 559 FKAARGRPVSTPYDYLIGCDNDLARLHTRNPEACDKVGGVIIMHIEDLRKFAMYWLLKTQ 618

Query: 622 EVRADRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIRNSEILLYPGYVPDPGVHY 681
           EVRAD+ HY   +TGDIY+SGWISEMYGYSFGAAEL LRH  N EI++YPGYVP+PG  Y
Sbjct: 619 EVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPGYVPEPGADY 678

Query: 682 RVFHYGLEFKVGNWSFDKANWRETDLVNTCWAHFPGPPDPSTLDQTDKDAFARDLLSIEC 741
           RVFHYGLEFKVGNWSFDKANWR TDL+N CWA FP PP PS + QTD D   RDLLSIEC
Sbjct: 679 RVFHYGLEFKVGNWSFDKANWRNTDLINKCWAKFPDPPSPSAVHQTDNDLRQRDLLSIEC 738

Query: 742 IRTLNEALYLHHKKRNCSDPNSLTNSNSEDETEAGISRKIGKLDESYTGKDDQSTESSQE 801
            + LNEAL+LHHK+RNC +P       SE   +  +SRK+G ++     K  Q ++ ++E
Sbjct: 739 GQKLNEALFLHHKRRNCPEP------GSESTEKISVSRKVGNIET----KQTQGSDETKE 798

Query: 802 SSEEAKEDGIFSSLRLWIIALWVISGLVFLVVIVSRFSGRKGKG-VRGKHHRIKRRTA-S 861
           SS  ++ +G FS+L+LW+IALW+ISG+ FLVV++  FS R+G+G  RGK +R KRRT+ S
Sbjct: 799 SSGSSESEGRFSTLKLWVIALWLISGVGFLVVMLLVFSTRRGRGTTRGKGYRNKRRTSYS 800

Query: 862 YSGFVD 866
            +GF+D
Sbjct: 859 NTGFLD 800

BLAST of Clc05G15090 vs. TAIR 10
Match: AT5G13500.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 71.6 bits (174), Expect = 3.5e-12
Identity = 68/315 (21.59%), Postives = 126/315 (40.00%), Query Frame = 0

Query: 429 PVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFR----LSGQP-GNI 488
           P+ + +VQ    + + +      H   +   + Y  WQ   + + ++    L G   G  
Sbjct: 41  PLLDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGF 100

Query: 489 TRLLSCTDEDLKEYKGHNL---APTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 548
           TR+L   + D       NL    PT  V  +   P     Y  +N+P A + WL      
Sbjct: 101 TRILHSGNSD-------NLMDEIPTFVVDPLP--PGLDRGYVVLNRPWAFVQWLERATIK 160

Query: 549 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKV 608
            +++++ + D +    + P    A  G P + P+ Y+     +N++ K + +       +
Sbjct: 161 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 220

Query: 609 GGV----IIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGA 668
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 221 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 280

Query: 669 AELQLRHIRNSEILLYPGY-VPDPGVHYRVFHYGLEF---------KVGNWSFDKANWRE 720
           A   +RHI   + +L P + +   G     + YG ++         K+G W FDK     
Sbjct: 281 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKR---- 323

BLAST of Clc05G15090 vs. TAIR 10
Match: AT5G13500.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 228 Blast hits to 200 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213; Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink). )

HSP 1 Score: 71.6 bits (174), Expect = 3.5e-12
Identity = 68/315 (21.59%), Postives = 126/315 (40.00%), Query Frame = 0

Query: 429 PVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFR----LSGQP-GNI 488
           P+ + +VQ    + + +      H   +   + Y  WQ   + + ++    L G   G  
Sbjct: 41  PLLDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGF 100

Query: 489 TRLLSCTDEDLKEYKGHNL---APTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 548
           TR+L   + D       NL    PT  V  +   P     Y  +N+P A + WL      
Sbjct: 101 TRILHSGNSD-------NLMDEIPTFVVDPLP--PGLDRGYVVLNRPWAFVQWLERATIK 160

Query: 549 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKV 608
            +++++ + D +    + P    A  G P + P+ Y+     +N++ K + +       +
Sbjct: 161 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 220

Query: 609 GGV----IIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGA 668
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 221 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 280

Query: 669 AELQLRHIRNSEILLYPGY-VPDPGVHYRVFHYGLEF---------KVGNWSFDKANWRE 720
           A   +RHI   + +L P + +   G     + YG ++         K+G W FDK     
Sbjct: 281 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKR---- 323

BLAST of Clc05G15090 vs. TAIR 10
Match: AT5G13500.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 71.6 bits (174), Expect = 3.5e-12
Identity = 68/315 (21.59%), Postives = 126/315 (40.00%), Query Frame = 0

Query: 429 PVKEDLVQKQPVLDELQEPYPKIHTLFSTECSTYFDWQTVGLMHSFR----LSGQP-GNI 488
           P+ + +VQ    + + +      H   +   + Y  WQ   + + ++    L G   G  
Sbjct: 41  PLLDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGF 100

Query: 489 TRLLSCTDEDLKEYKGHNL---APTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHVNTD 548
           TR+L   + D       NL    PT  V  +   P     Y  +N+P A + WL      
Sbjct: 101 TRILHSGNSD-------NLMDEIPTFVVDPLP--PGLDRGYVVLNRPWAFVQWLERATIK 160

Query: 549 AEFIVILDADMIMRGSITPWEFKAARGRPVSTPYDYLI--GCDNVLAKLHTSHPEACDKV 608
            +++++ + D +    + P    A  G P + P+ Y+     +N++ K + +       +
Sbjct: 161 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 220

Query: 609 GGV----IIMHIDDLRKFAMLWLHKTEEVRADRAHYATNITGDIYQSGWISEMYGYSFGA 668
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 221 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 280

Query: 669 AELQLRHIRNSEILLYPGY-VPDPGVHYRVFHYGLEF---------KVGNWSFDKANWRE 720
           A   +RHI   + +L P + +   G     + YG ++         K+G W FDK     
Sbjct: 281 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKR---- 323

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899299.10.0e+0092.17peptidyl serine alpha-galactosyltransferase [Benincasa hispida][more]
XP_011651582.20.0e+0088.90peptidyl serine alpha-galactosyltransferase [Cucumis sativus][more]
KGN58321.20.0e+0090.01hypothetical protein Csa_017560 [Cucumis sativus][more]
XP_008449998.10.0e+0090.45PREDICTED: uncharacterized protein LOC103491714 isoform X1 [Cucumis melo][more]
XP_016900856.10.0e+0090.23PREDICTED: uncharacterized protein LOC103491714 isoform X2 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q8VYF90.0e+0068.20Peptidyl serine alpha-galactosyltransferase OS=Arabidopsis thaliana OX=3702 GN=S... [more]
H3JU057.0e-5035.43Peptidyl serine alpha-galactosyltransferase OS=Chlamydomonas reinhardtii OX=3055... [more]
G7LG313.8e-1123.72Hydroxyproline O-arabinosyltransferase RDN2 OS=Medicago truncatula OX=3880 GN=RD... [more]
Q9FY514.9e-1121.59Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
Match NameE-valueIdentityDescription
A0A0A0LDQ30.0e+0086.97Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G597250 PE=4 SV=1[more]
A0A1S3BNB40.0e+0090.45uncharacterized protein LOC103491714 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4DXZ60.0e+0090.23uncharacterized protein LOC103491714 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1J5670.0e+0088.99peptidyl serine alpha-galactosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A6J1F9840.0e+0088.88peptidyl serine alpha-galactosyltransferase-like OS=Cucurbita moschata OX=3662 G... [more]
Match NameE-valueIdentityDescription
AT3G01720.10.0e+0068.20unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.13.5e-1221.59unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.23.5e-1221.59unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.33.5e-1221.59unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31485:SF25PEPTIDYL SERINE ALPHA-GALACTOSYLTRANSFERASEcoord: 433..863
coord: 28..394
IPR044845Glycosyltransferase HPAT/SRGT1-likePANTHERPTHR31485PEPTIDYL SERINE ALPHA-GALACTOSYLTRANSFERASEcoord: 28..394
IPR044845Glycosyltransferase HPAT/SRGT1-likePANTHERPTHR31485PEPTIDYL SERINE ALPHA-GALACTOSYLTRANSFERASEcoord: 433..863

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc05G15090.2Clc05G15090.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071704 organic substance metabolic process
biological_process GO:0016310 phosphorylation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity
molecular_function GO:0016301 kinase activity
molecular_function GO:0016773 phosphotransferase activity, alcohol group as acceptor