Cla97C02G028650 (gene) Watermelon (97103) v2

NameCla97C02G028650
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGlycosyltransferase
LocationCla97Chr02 : 2142883 .. 2144258 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGCCACTTCTAGCCTACACATAGCAATGTACCCTTGGTTTGCTTTTGGCCACTTCACTCCATATCTCCAAATTGCCAACAAATTAGCCAAAAAAGGCCATAAAATCTCCTTCTTCATCCCATCAAAAACTCAACCCAAATTGCAGCCTTTCAATCACTTTCCAAGTCTCATTACCTTTGTCCCCATCATTGTTCCTCATGTTGATGGTCTCCCAGAAGCTGCTGAAACTACTGCTGATGTTTCTCATCCTTCACTGTTCAATCTCATCATGACTGCAATGGATCTCACTCAACCACAAATCAAATGCATTCTCCAAAACATCAAACCCCATTTCATCTTCTTTGATTTCACCTTCTGGATGCCAAAATTAGCATCTCAACTGGGCATTAAGTCAATTTATCATAGTGTAATTAGTGCAACAACATTTGCTTATGTTTTCCCCCCATCAAGACAACTCTGTGGAGATAATTGCACTGAGGCTGACTTTATGAAGCCACCTCCTGGCTATCCAAGTTCCACCATCAAACTTCATTCTCATGAGGCCAAATTTTATGCTTCCATGAGCCATATGAGATTTGGAAGTGATGTTCTTTTCTTTCATCGCCATTTCACTGGTCTTTGTGAATCTGATGCTATAGCATTCAAGTCATGTAGGGAGATTGAAGGGCCTTTTGTAGACTATCTCATAAGTGAATTCAAAAAGCCTGTTCTGCTTTCAGGACCTGATGGGAACATACAAGAACCAGCAACAACTTTTGAACATAGATGGGCAGCATGGCTATCAAGGTTCAAAGCTGGTTCAGTCATATACTGTGCATTTGGAAGTGAATGTACCTTAACAAAACACCAATTCCAAGAATTGGTATTGGGTTTTGAGCTTACAAATTTACCATTCTTTGCTGTACTCAAACTGCCTCACAGCATCGATATGGTCGGTGCCGCCTTACCGGAAGGTTTCGAACAGAGAGTTCAGGGGAGAGGGATAGTGTGTGGAGGATGGGTTCAACAACAACAAATTTCAGGGGAGAGGGATGCTTTGTTACACATTGTGGGGCAGGGTCTATATCTGAGGCTTTGGTGAAGAAGTGTCAATTGGTGTTTTTACCTCATAATGGTGACCATTTTTTTAGAGCAAGAACAATGAGCAGGTGTTTGAAGGTTGGTGTGGAGGTGGAAAGAAGGGAAGAAGATGGAATTTTCAACAAAGAAAGTGTGTGTAAGGCAGTGAAGACAGTGATGGATGAAGAGAGTGAAATTGGGAAAGAGATCAGAGCAAACCGTGCAAAGTTAAGGGAATTATTGGTTGACAAAGATTTGGAAGAGTCTTATATCAACAATTTCATTCACAGTCTCCATGGTTTGATTGTATAA

mRNA sequence

ATGGCAGCCACTTCTAGCCTACACATAGCAATGTACCCTTGGTTTGCTTTTGGCCACTTCACTCCATATCTCCAAATTGCCAACAAATTAGCCAAAAAAGGCCATAAAATCTCCTTCTTCATCCCATCAAAAACTCAACCCAAATTGCAGCCTTTCAATCACTTTCCAAGTCTCATTACCTTTGTCCCCATCATTGTTCCTCATGTTGATGGTCTCCCAGAAGCTGCTGAAACTACTGCTGATGTTTCTCATCCTTCACTGTTCAATCTCATCATGACTGCAATGGATCTCACTCAACCACAAATCAAATGCATTCTCCAAAACATCAAACCCCATTTCATCTTCTTTGATTTCACCTTCTGGATGCCAAAATTAGCATCTCAACTGGGCATTAAGTCAATTTATCATAGTGTAATTAGTGCAACAACATTTGCTTATGTTTTCCCCCCATCAAGACAACTCTGTGGAGATAATTGCACTGAGGCTGACTTTATGAAGCCACCTCCTGGCTATCCAAGTTCCACCATCAAACTTCATTCTCATGAGGCCAAATTTTATGCTTCCATGAGCCATATGAGATTTGGAAGTGATGTTCTTTTCTTTCATCGCCATTTCACTGGTCTTTGTGAATCTGATGCTATAGCATTCAAGTCATGTAGGGAGATTGAAGGGCCTTTTGTAGACTATCTCATAAGTGAATTCAAAAAGCCTGTTCTGCTTTCAGGACCTGATGGGAACATACAAGAACCAGCAACAACTTTTGAACATAGATGGGCAGCATGGCTATCAAGGTTCAAAGCTGGTTCAGTCATATACTGTGCATTTGGAAGTGAATGTACCTTAACAAAACACCAATTCCAAGAATTGGTATTGGGTTTTGAGCTTACAAATTTACCATTCTTTGCTGTACTCAAACTGCCTCACAGCATCGATATGGTCGGTGCCGCCTTACCGGAAGGGTCTATATCTGAGGCTTTGGTGAAGAAGTGTCAATTGGTGTTTTTACCTCATAATGGTGACCATTTTTTTAGAGCAAGAACAATGAGCAGGTGTTTGAAGGTTGGTGTGGAGGTGGAAAGAAGGGAAGAAGATGGAATTTTCAACAAAGAAAGTGTGTGTAAGGCAGTGAAGACAGTGATGGATGAAGAGAGTGAAATTGGGAAAGAGATCAGAGCAAACCGTGCAAAGTTAAGGGAATTATTGGTTGACAAAGATTTGGAAGAGTCTTATATCAACAATTTCATTCACAGTCTCCATGGTTTGATTGTATAA

Coding sequence (CDS)

ATGGCAGCCACTTCTAGCCTACACATAGCAATGTACCCTTGGTTTGCTTTTGGCCACTTCACTCCATATCTCCAAATTGCCAACAAATTAGCCAAAAAAGGCCATAAAATCTCCTTCTTCATCCCATCAAAAACTCAACCCAAATTGCAGCCTTTCAATCACTTTCCAAGTCTCATTACCTTTGTCCCCATCATTGTTCCTCATGTTGATGGTCTCCCAGAAGCTGCTGAAACTACTGCTGATGTTTCTCATCCTTCACTGTTCAATCTCATCATGACTGCAATGGATCTCACTCAACCACAAATCAAATGCATTCTCCAAAACATCAAACCCCATTTCATCTTCTTTGATTTCACCTTCTGGATGCCAAAATTAGCATCTCAACTGGGCATTAAGTCAATTTATCATAGTGTAATTAGTGCAACAACATTTGCTTATGTTTTCCCCCCATCAAGACAACTCTGTGGAGATAATTGCACTGAGGCTGACTTTATGAAGCCACCTCCTGGCTATCCAAGTTCCACCATCAAACTTCATTCTCATGAGGCCAAATTTTATGCTTCCATGAGCCATATGAGATTTGGAAGTGATGTTCTTTTCTTTCATCGCCATTTCACTGGTCTTTGTGAATCTGATGCTATAGCATTCAAGTCATGTAGGGAGATTGAAGGGCCTTTTGTAGACTATCTCATAAGTGAATTCAAAAAGCCTGTTCTGCTTTCAGGACCTGATGGGAACATACAAGAACCAGCAACAACTTTTGAACATAGATGGGCAGCATGGCTATCAAGGTTCAAAGCTGGTTCAGTCATATACTGTGCATTTGGAAGTGAATGTACCTTAACAAAACACCAATTCCAAGAATTGGTATTGGGTTTTGAGCTTACAAATTTACCATTCTTTGCTGTACTCAAACTGCCTCACAGCATCGATATGGTCGGTGCCGCCTTACCGGAAGGGTCTATATCTGAGGCTTTGGTGAAGAAGTGTCAATTGGTGTTTTTACCTCATAATGGTGACCATTTTTTTAGAGCAAGAACAATGAGCAGGTGTTTGAAGGTTGGTGTGGAGGTGGAAAGAAGGGAAGAAGATGGAATTTTCAACAAAGAAAGTGTGTGTAAGGCAGTGAAGACAGTGATGGATGAAGAGAGTGAAATTGGGAAAGAGATCAGAGCAAACCGTGCAAAGTTAAGGGAATTATTGGTTGACAAAGATTTGGAAGAGTCTTATATCAACAATTTCATTCACAGTCTCCATGGTTTGATTGTATAA

Protein sequence

MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVLKLPHSIDMVGAALPEGSISEALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLIV
BLAST of Cla97C02G028650 vs. NCBI nr
Match: XP_008455479.1 (PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis melo] >XP_016901697.1 PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis melo])

HSP 1 Score: 728.0 bits (1878), Expect = 1.8e-206
Identity = 361/460 (78.48%), Postives = 385/460 (83.70%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAMYPWFAFGH  PYLQIANKLAKKGH+ISF IPS TQPKLQPFNHFP+LIT
Sbjct: 1   MAATSSLHIAMYPWFAFGHLIPYLQIANKLAKKGHRISFLIPSNTQPKLQPFNHFPNLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           F+PIIVPHV+GLP+ AETTADVS+   F+LIMTAMDLTQPQI+  LQ++KPHF FFDFT+
Sbjct: 61  FLPIIVPHVNGLPQGAETTADVSNLQQFSLIMTAMDLTQPQIESFLQHVKPHFFFFDFTY 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           WMPKLASQ GIKSIYHSVISATTFAYV+PPSRQLCG + T  DFMKPP G+PSS IKLHS
Sbjct: 121 WMPKLASQFGIKSIYHSVISATTFAYVYPPSRQLCGHDFTVDDFMKPPLGFPSSVIKLHS 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAKFYASMSHM+FGSDVLFFHRHFTGLCESDAIAFKS REIEGPF+DYL +EFKKPVLL
Sbjct: 181 HEAKFYASMSHMKFGSDVLFFHRHFTGLCESDAIAFKSSREIEGPFIDYLETEFKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDGNIQEP TT E RWA  LS FKAGSVIYCAFGSECTLTK Q QELVLGFELTNLPF
Sbjct: 241 SGPDGNIQEPTTTLEQRWAECLSEFKAGSVIYCAFGSECTLTKDQLQELVLGFELTNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
           FAVLK PH +D + AALPE                                     GS+S
Sbjct: 301 FAVLKPPHGMDTINAALPEGFEQRIQGRGVVYGGWVQQQHILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLVFLPH GDHFFRART+S CLKVGVEVERR+EDG+FNKESVCKAVKTVMDEE
Sbjct: 361 EALVKKCQLVFLPHIGDHFFRARTLSSCLKVGVEVERRQEDGVFNKESVCKAVKTVMDEE 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLIV 424
           +E GKEIRAN AKLRELLVDKDLEESYINNFIH LH LIV
Sbjct: 421 NESGKEIRANLAKLRELLVDKDLEESYINNFIHKLHCLIV 460

BLAST of Cla97C02G028650 vs. NCBI nr
Match: XP_022952333.1 (UDP-glycosyltransferase 79B30-like [Cucurbita moschata])

HSP 1 Score: 691.0 bits (1782), Expect = 2.5e-195
Identity = 346/459 (75.38%), Postives = 370/459 (80.61%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAMYPWFAFGH  PYLQIANKLAKKGHKISFF+PSKT  KLQPFNHFP LIT
Sbjct: 1   MAATSSLHIAMYPWFAFGHLVPYLQIANKLAKKGHKISFFVPSKTHLKLQPFNHFPHLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           F+PI VPHVDGLP+AAETTADVSHPS F+ IMTAMDLTQPQIK +LQ+++PHFIFFDFTF
Sbjct: 61  FLPITVPHVDGLPQAAETTADVSHPSQFSHIMTAMDLTQPQIKRLLQDLQPHFIFFDFTF 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           WMPKLASQLGI SIY+SVISATTF Y++ PSRQLCG + TEADFM+PPP YP STIKLH+
Sbjct: 121 WMPKLASQLGITSIYYSVISATTFGYIYTPSRQLCGHDLTEADFMQPPPKYPISTIKLHA 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAK  ASM  MRFGSDVLF HRHFTGLCE+DAIAFKSCREIEGP VDYLISE KKPVLL
Sbjct: 181 HEAKNIASMGRMRFGSDVLFSHRHFTGLCEADAIAFKSCREIEGPSVDYLISELKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDG+IQ+P T  EHRWA WLS F AGSVIYCAFGSEC LTK Q  ELVLGFEL+NLPF
Sbjct: 241 SGPDGDIQQPTTALEHRWAEWLSGFDAGSVIYCAFGSECYLTKDQLHELVLGFELSNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
           FA LK P  ++ VGAALPE                                     GS+S
Sbjct: 301 FAALKPPLGVESVGAALPEGFEQRVQGRGVVYGGWVQQQLILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLV LPH GDH FRARTMS  LKVGVEVE+REEDG F KESVCKAVKTVMDEE
Sbjct: 361 EALVKKCQLVLLPHVGDHIFRARTMSSHLKVGVEVEKREEDGFFTKESVCKAVKTVMDEE 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLI 423
           +E GK IRANR KLREL +DKDLEESYINNFIH L  LI
Sbjct: 421 NETGKAIRANREKLRELFLDKDLEESYINNFIHDLQVLI 459

BLAST of Cla97C02G028650 vs. NCBI nr
Match: XP_022969084.1 (UDP-glycosyltransferase 79B30-like [Cucurbita maxima])

HSP 1 Score: 683.3 bits (1762), Expect = 5.1e-193
Identity = 342/459 (74.51%), Postives = 367/459 (79.96%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAMYPWFAFGH  PYL IANKLAKKGHKISFF+PSKT  KLQPFNHFP LIT
Sbjct: 1   MAATSSLHIAMYPWFAFGHLVPYLHIANKLAKKGHKISFFVPSKTHLKLQPFNHFPHLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           FVPI VPHVDGLPEAAETTADVSHPS F+ IMTAMDLTQPQIK +LQ+++PHFIFFDFTF
Sbjct: 61  FVPITVPHVDGLPEAAETTADVSHPSRFSHIMTAMDLTQPQIKRLLQDLQPHFIFFDFTF 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           WMPKLASQLGI S+Y+SVISATTF Y++ PSRQLCG + TEADFM+PPP YP STIKLH+
Sbjct: 121 WMPKLASQLGITSLYYSVISATTFGYIYTPSRQLCGHDLTEADFMQPPPKYPISTIKLHA 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAK  ASM  M FGSDVLF HRHFTGLCE+DAIAFKSCREIEGP VDYLISE KKPVLL
Sbjct: 181 HEAKNVASMGRMSFGSDVLFSHRHFTGLCEADAIAFKSCREIEGPSVDYLISELKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDG+IQ+P T  EHRWA WLS F AGSVIYCAFGSEC LTK +  ELVLGFEL+NLPF
Sbjct: 241 SGPDGDIQQPTTALEHRWAEWLSGFDAGSVIYCAFGSECYLTKDRLHELVLGFELSNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
           FA LK P  ++ VGAALPE                                     GS+S
Sbjct: 301 FAALKPPLGVESVGAALPEGFEQRVQGRGVVYGGWVQQQLILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLV LPH GDH FRARTMS  LKVGVEVE+REEDG F KESVCKAVKTVMDEE
Sbjct: 361 EALVKKCQLVLLPHVGDHIFRARTMSSHLKVGVEVEKREEDGFFTKESVCKAVKTVMDEE 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLI 423
           +E GK IR NR KLREL +DK+LEESYINNFIH L  LI
Sbjct: 421 NETGKAIRENREKLRELFLDKELEESYINNFIHDLQVLI 459

BLAST of Cla97C02G028650 vs. NCBI nr
Match: XP_023554470.1 (UDP-glycosyltransferase 79B30-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 682.6 bits (1760), Expect = 8.8e-193
Identity = 342/459 (74.51%), Postives = 367/459 (79.96%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAMYPWFAFGH  PYLQIANKLAKKGHKISFF+PSKT  KLQPFNHFP LIT
Sbjct: 1   MAATSSLHIAMYPWFAFGHLVPYLQIANKLAKKGHKISFFVPSKTHLKLQPFNHFPHLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           F+PI VPHVDGLP+AAETTADVSHPS F+ IMTAMDLTQPQIK +LQ+++PHFIFFDFTF
Sbjct: 61  FLPITVPHVDGLPQAAETTADVSHPSQFSHIMTAMDLTQPQIKRLLQDLQPHFIFFDFTF 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           WMPKLASQLGI SIY+SVISATTF Y++ PSRQLCG + TEADFM+PPP YP STIKLH+
Sbjct: 121 WMPKLASQLGITSIYYSVISATTFGYIYTPSRQLCGHDLTEADFMQPPPKYPISTIKLHA 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAK  ASM  M+FGSDVLF HRHFTGLCESDAIAFKSCREIEGP VDYL SE KKPVLL
Sbjct: 181 HEAKNVASMGRMKFGSDVLFSHRHFTGLCESDAIAFKSCREIEGPCVDYLTSELKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDG+IQ+P T  EHRWA WLS F AGSVIYCAFGSEC LTK Q  ELVLGFEL+NLPF
Sbjct: 241 SGPDGDIQQPTTALEHRWAEWLSGFNAGSVIYCAFGSECYLTKDQLHELVLGFELSNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
           FA LK P  ++ V AALPE                                     GS+S
Sbjct: 301 FAALKPPLGVESVCAALPEGFEQRVQGRGVVYGGWVQQQLILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLV LPH GDH FRARTMS  LKVGVEVE+REEDG F KESVCKAVKTVMDEE
Sbjct: 361 EALVKKCQLVLLPHVGDHIFRARTMSSHLKVGVEVEKREEDGFFTKESVCKAVKTVMDEE 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLI 423
           +E GK IR NR KLREL +DK+LEESYINNFIH L  LI
Sbjct: 421 NETGKAIRENREKLRELFLDKELEESYINNFIHDLQVLI 459

BLAST of Cla97C02G028650 vs. NCBI nr
Match: XP_004144602.1 (PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis sativus] >XP_011658688.1 PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis sativus] >KGN43484.1 hypothetical protein Csa_7G041300 [Cucumis sativus])

HSP 1 Score: 676.4 bits (1744), Expect = 6.3e-191
Identity = 341/459 (74.29%), Postives = 371/459 (80.83%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAM+PWFAFGH  PYLQIANKLAKKGHKISF IPSKTQ KLQPFNHFP+LIT
Sbjct: 1   MAATSSLHIAMFPWFAFGHLAPYLQIANKLAKKGHKISFLIPSKTQVKLQPFNHFPNLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           FVPIIVPHVDGLPE AE TADVS+   FNLIMTAMDLTQPQIK +LQ IKPH IFFDFTF
Sbjct: 61  FVPIIVPHVDGLPEGAEITADVSNLHEFNLIMTAMDLTQPQIKTLLQLIKPHVIFFDFTF 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           W+PKLASQLGIKSIY+SVISATTF+YVF P+RQLCG + T  +FM+PP G   S IKLHS
Sbjct: 121 WIPKLASQLGIKSIYYSVISATTFSYVFTPTRQLCGPDFTVDEFMQPPLGLAISAIKLHS 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAK    MS+M FGSDV FFHRHFTGLCE+DAIAFK+C EIEGPFVD+LISEFKKPVLL
Sbjct: 181 HEAKNVTFMSNMIFGSDVRFFHRHFTGLCEADAIAFKACGEIEGPFVDFLISEFKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDG+IQEP TT EHRW  WLS+FK+GSVIYCAFGSECTLTK QFQELVLGFELTNLPF
Sbjct: 241 SGPDGDIQEPKTTLEHRWQEWLSKFKSGSVIYCAFGSECTLTKDQFQELVLGFELTNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
            AVLK P  +D V AALP+                                     GS+S
Sbjct: 301 LAVLKPPVGVDTVTAALPDGFEERVEGRGVVYGGWVQQQHILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLV LPH GDHFFRART+S CLKVGVEVE+RE+DG F KESVC+AVKT+MDE 
Sbjct: 361 EALVKKCQLVLLPHVGDHFFRARTLSSCLKVGVEVEKREDDGFFTKESVCEAVKTLMDEG 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLI 423
           +E GKEIRA RAKLRELL+DKDLEESYI NFIH+L  L+
Sbjct: 421 NERGKEIRATRAKLRELLLDKDLEESYIINFIHNLQSLV 459

BLAST of Cla97C02G028650 vs. TrEMBL
Match: tr|A0A1S4E0D1|A0A1S4E0D1_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103495633 PE=3 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 1.2e-206
Identity = 361/460 (78.48%), Postives = 385/460 (83.70%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAMYPWFAFGH  PYLQIANKLAKKGH+ISF IPS TQPKLQPFNHFP+LIT
Sbjct: 1   MAATSSLHIAMYPWFAFGHLIPYLQIANKLAKKGHRISFLIPSNTQPKLQPFNHFPNLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           F+PIIVPHV+GLP+ AETTADVS+   F+LIMTAMDLTQPQI+  LQ++KPHF FFDFT+
Sbjct: 61  FLPIIVPHVNGLPQGAETTADVSNLQQFSLIMTAMDLTQPQIESFLQHVKPHFFFFDFTY 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           WMPKLASQ GIKSIYHSVISATTFAYV+PPSRQLCG + T  DFMKPP G+PSS IKLHS
Sbjct: 121 WMPKLASQFGIKSIYHSVISATTFAYVYPPSRQLCGHDFTVDDFMKPPLGFPSSVIKLHS 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAKFYASMSHM+FGSDVLFFHRHFTGLCESDAIAFKS REIEGPF+DYL +EFKKPVLL
Sbjct: 181 HEAKFYASMSHMKFGSDVLFFHRHFTGLCESDAIAFKSSREIEGPFIDYLETEFKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDGNIQEP TT E RWA  LS FKAGSVIYCAFGSECTLTK Q QELVLGFELTNLPF
Sbjct: 241 SGPDGNIQEPTTTLEQRWAECLSEFKAGSVIYCAFGSECTLTKDQLQELVLGFELTNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
           FAVLK PH +D + AALPE                                     GS+S
Sbjct: 301 FAVLKPPHGMDTINAALPEGFEQRIQGRGVVYGGWVQQQHILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLVFLPH GDHFFRART+S CLKVGVEVERR+EDG+FNKESVCKAVKTVMDEE
Sbjct: 361 EALVKKCQLVFLPHIGDHFFRARTLSSCLKVGVEVERRQEDGVFNKESVCKAVKTVMDEE 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLIV 424
           +E GKEIRAN AKLRELLVDKDLEESYINNFIH LH LIV
Sbjct: 421 NESGKEIRANLAKLRELLVDKDLEESYINNFIHKLHCLIV 460

BLAST of Cla97C02G028650 vs. TrEMBL
Match: tr|A0A0A0K3E5|A0A0A0K3E5_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G041300 PE=3 SV=1)

HSP 1 Score: 676.4 bits (1744), Expect = 4.2e-191
Identity = 341/459 (74.29%), Postives = 371/459 (80.83%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAM+PWFAFGH  PYLQIANKLAKKGHKISF IPSKTQ KLQPFNHFP+LIT
Sbjct: 1   MAATSSLHIAMFPWFAFGHLAPYLQIANKLAKKGHKISFLIPSKTQVKLQPFNHFPNLIT 60

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           FVPIIVPHVDGLPE AE TADVS+   FNLIMTAMDLTQPQIK +LQ IKPH IFFDFTF
Sbjct: 61  FVPIIVPHVDGLPEGAEITADVSNLHEFNLIMTAMDLTQPQIKTLLQLIKPHVIFFDFTF 120

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           W+PKLASQLGIKSIY+SVISATTF+YVF P+RQLCG + T  +FM+PP G   S IKLHS
Sbjct: 121 WIPKLASQLGIKSIYYSVISATTFSYVFTPTRQLCGPDFTVDEFMQPPLGLAISAIKLHS 180

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAK    MS+M FGSDV FFHRHFTGLCE+DAIAFK+C EIEGPFVD+LISEFKKPVLL
Sbjct: 181 HEAKNVTFMSNMIFGSDVRFFHRHFTGLCEADAIAFKACGEIEGPFVDFLISEFKKPVLL 240

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDG+IQEP TT EHRW  WLS+FK+GSVIYCAFGSECTLTK QFQELVLGFELTNLPF
Sbjct: 241 SGPDGDIQEPKTTLEHRWQEWLSKFKSGSVIYCAFGSECTLTKDQFQELVLGFELTNLPF 300

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
            AVLK P  +D V AALP+                                     GS+S
Sbjct: 301 LAVLKPPVGVDTVTAALPDGFEERVEGRGVVYGGWVQQQHILEHPSIGCFVTHCGAGSLS 360

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLV LPH GDHFFRART+S CLKVGVEVE+RE+DG F KESVC+AVKT+MDE 
Sbjct: 361 EALVKKCQLVLLPHVGDHFFRARTLSSCLKVGVEVEKREDDGFFTKESVCEAVKTLMDEG 420

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLI 423
           +E GKEIRA RAKLRELL+DKDLEESYI NFIH+L  L+
Sbjct: 421 NERGKEIRATRAKLRELLLDKDLEESYIINFIHNLQSLV 459

BLAST of Cla97C02G028650 vs. TrEMBL
Match: tr|A0A1S3C1R0|A0A1S3C1R0_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103495635 PE=3 SV=1)

HSP 1 Score: 674.1 bits (1738), Expect = 2.1e-190
Identity = 339/459 (73.86%), Postives = 371/459 (80.83%), Query Frame = 0

Query: 1   MAATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLIT 60
           MAATSSLHIAM+PWFAFGH +PYLQIANKLAKKGHKISF IPSK + KLQPFNHFP+LIT
Sbjct: 4   MAATSSLHIAMFPWFAFGHLSPYLQIANKLAKKGHKISFLIPSKIRAKLQPFNHFPNLIT 63

Query: 61  FVPIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTF 120
           FVPIIVPHVDGLPE AETTADVS+   FNLIMTAMDLTQPQIK +LQ+IKPH IFFDFTF
Sbjct: 64  FVPIIVPHVDGLPEGAETTADVSNLHQFNLIMTAMDLTQPQIKSLLQHIKPHVIFFDFTF 123

Query: 121 WMPKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHS 180
           W+PKLASQLGIKSIY+SVISATTF+YVF P+RQLCG + T  +F+KPP G  +S IKLHS
Sbjct: 124 WIPKLASQLGIKSIYYSVISATTFSYVFTPTRQLCGPDFTVDEFIKPPLGLATSAIKLHS 183

Query: 181 HEAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLL 240
           HEAK    MS+M FGSDV FFHRHFTGLCESDAIAFK+CREIEGPFVD+LISEFKKP+LL
Sbjct: 184 HEAKNVTFMSNMIFGSDVRFFHRHFTGLCESDAIAFKACREIEGPFVDHLISEFKKPILL 243

Query: 241 SGPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPF 300
           SGPDG+IQEP TT EHRW  WLSRFK GSVIYCAFGSECTLTK Q QELVLGFELTN PF
Sbjct: 244 SGPDGDIQEPKTTLEHRWQEWLSRFKYGSVIYCAFGSECTLTKDQLQELVLGFELTNQPF 303

Query: 301 FAVLKLPHSIDMVGAALPE-------------------------------------GSIS 360
           FAV K P  +D V AALP+                                     GS+S
Sbjct: 304 FAVFKPPVGVDTVTAALPDGFEERVEGRGVVYGGWVQQQHILEHPSIGCFVTHCGAGSLS 363

Query: 361 EALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEE 420
           EALVKKCQLV LPH GDH FRART+S  LKVGVEVE+RE+DG F KESVC+AVKTVMDE 
Sbjct: 364 EALVKKCQLVLLPHVGDHIFRARTLSSYLKVGVEVEKREDDGFFTKESVCEAVKTVMDEG 423

Query: 421 SEIGKEIRANRAKLRELLVDKDLEESYINNFIHSLHGLI 423
           +E GKEIRA RAKLRELL+DKDLEESYI+NFIH+L  L+
Sbjct: 424 NERGKEIRATRAKLRELLLDKDLEESYISNFIHNLQSLV 462

BLAST of Cla97C02G028650 vs. TrEMBL
Match: tr|A0A0A0K5E3|A0A0A0K5E3_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G041320 PE=3 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 1.1e-154
Identity = 282/457 (61.71%), Postives = 336/457 (73.52%), Query Frame = 0

Query: 5   SSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPI 64
           SSLHIAMYPWFAFGH  P+LQIANKLA KGH+IS FIPSKT P+LQ FNHFP+LITFV I
Sbjct: 18  SSLHIAMYPWFAFGHMIPFLQIANKLANKGHRISIFIPSKTLPELQHFNHFPNLITFVLI 77

Query: 65  IVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPK 124
            VPHVDGLP  A+TTAD+SHPS   L+M +MDLT+P+I   LQ+IKP+ IF+DF +W+ K
Sbjct: 78  TVPHVDGLPPGAQTTADISHPSQLPLLMISMDLTEPEIASCLQDIKPNVIFYDFAYWVTK 137

Query: 125 LASQLGIKSIYHSVISATTFAYVFPPSRQLCG-DNCTEADFMKPPPGYPSSTIKLHSHEA 184
           LA Q+GI SIY++V+SA T  YV     +L G D  T+ DFM+PPPG+PSS+IKLH+HEA
Sbjct: 138 LADQMGITSIYYNVVSAVTVGYVQGKIWELSGHDTLTQDDFMQPPPGFPSSSIKLHAHEA 197

Query: 185 KFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGP 244
           + +AS+SH+RF + +  F +  T     +A+A KSCREIEGPF+ Y+ +E KK VLLSG 
Sbjct: 198 QNFASLSHLRFSNGIALFDQFSTSFTNCNALALKSCREIEGPFIGYIENELKKHVLLSGA 257

Query: 245 DGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAV 304
             +++   T+ E RW  WL++F +GSVIYCAFGSEC LTK QFQEL+LG EL+NLPF AV
Sbjct: 258 -VDLEPLTTSLEERWEKWLAKFHSGSVIYCAFGSECILTKIQFQELLLGLELSNLPFLAV 317

Query: 305 LKLPHSIDMVGAALPE-------------------------------------GSISEAL 364
           LK P  ID V AALPE                                     GS++EAL
Sbjct: 318 LKPPEGIDTVEAALPEGFEQRIEGRGVVYGGWVQQQQILEHPSIGCFVTHCGAGSLNEAL 377

Query: 365 VKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEI 424
           V+KCQLV LPH  DHFFRART+S  LKVGVEVE+REEDG F+KESVCKAVKTVMDEE+E 
Sbjct: 378 VRKCQLVLLPHVSDHFFRARTLSSHLKVGVEVEKREEDGFFSKESVCKAVKTVMDEENES 437

BLAST of Cla97C02G028650 vs. TrEMBL
Match: tr|A0A1S3C1Q5|A0A1S3C1Q5_CUCME (anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like OS=Cucumis melo OX=3656 GN=LOC103495632 PE=4 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 2.9e-152
Identity = 277/458 (60.48%), Postives = 327/458 (71.40%), Query Frame = 0

Query: 3   ATSSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFV 62
           A S LHIAMYPWFA GH   +LQI NKLA KGH+ISFFIPSKTQPKLQPFNHFP+LITFV
Sbjct: 4   APSGLHIAMYPWFALGHLIAFLQIGNKLANKGHRISFFIPSKTQPKLQPFNHFPNLITFV 63

Query: 63  PIIVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWM 122
           PI VPHVDGLP  AETTADVSHPS   LIMT+MD T+P+I   LQ+IKP  IF+D  +W+
Sbjct: 64  PITVPHVDGLPLGAETTADVSHPSQIPLIMTSMDRTEPEIASRLQDIKPEVIFYDLAYWV 123

Query: 123 PKLASQLGIKSIYHSVISATTFAYVFPPSRQLCGD-NCTEADFMKPPPGYPSSTIKLHSH 182
           PKLA  LGIKS+Y + +SA T +Y+     +  G  N T  D + PPP +P S+IKLH+H
Sbjct: 124 PKLAHPLGIKSVYFTAVSAVTMSYIQCKLWKFPGHYNLTRDDLLHPPPDFPCSSIKLHAH 183

Query: 183 EAKFYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLS 242
           EA++ AS   M+FG D+ FF R    L + +AIA KSCREIEGPF+DYL S  ++P+LL 
Sbjct: 184 EAQYLASFGRMKFGGDITFFERISNALSQCNAIALKSCREIEGPFIDYLESIVERPILLP 243

Query: 243 GPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFF 302
           G   N++   T+ E RWA WLS FK+GSVIYCAFGSEC LTK+QFQEL+LG EL+NLPFF
Sbjct: 244 G-TVNLEPLTTSLEERWANWLSEFKSGSVIYCAFGSECILTKNQFQELLLGLELSNLPFF 303

Query: 303 AVLKLPHSIDMVGAALPEG-------------------------------------SISE 362
             LK P  ID V AALP+G                                     S+SE
Sbjct: 304 VALKPPDGIDTVEAALPKGFKQRIEGRGIVYGGWVQQQQILDHPSIGCFITHCGAESLSE 363

Query: 363 ALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEES 422
           A+VKKCQLV      D  FRAR+MS+ LKVGVE+E+ EEDG+F+KESVCKAVKTVMDEE+
Sbjct: 364 AVVKKCQLVLFSRTTDQLFRARSMSKFLKVGVEIEKGEEDGVFSKESVCKAVKTVMDEEN 423

BLAST of Cla97C02G028650 vs. Swiss-Prot
Match: sp|Q53UH5|DUSKY_IPOPU (Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea purpurea OX=4121 GN=3GGT PE=2 SV=1)

HSP 1 Score: 450.7 bits (1158), Expect = 1.8e-125
Identity = 225/455 (49.45%), Postives = 297/455 (65.27%), Query Frame = 0

Query: 5   SSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPI 64
           ++ H+AMYPWF  GH T + ++ANKLA KGH+ISF IP  TQ KL+ FN  P LI+FVPI
Sbjct: 6   TTYHMAMYPWFGVGHLTGFFRLANKLAGKGHRISFLIPKNTQSKLESFNLHPHLISFVPI 65

Query: 65  IVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPK 124
           +VP + GLP  AETT+DV  PS  +L+M AMD TQ  I+ IL+++K   +F+DFT W+P 
Sbjct: 66  VVPSIPGLPPGAETTSDVPFPST-HLLMEAMDKTQNDIEIILKDLKVDVVFYDFTHWLPS 125

Query: 125 LASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAK 184
           LA ++GIKS+++S IS     Y   P R++ G   TEAD MK P  +P  +IKLH+HEA+
Sbjct: 126 LARKIGIKSVFYSTISPLMHGYALSPERRVVGKQLTEADMMKAPASFPDPSIKLHAHEAR 185

Query: 185 FYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPD 244
            + + + M+FG D+ FF R FT + ESD +A+ +CREIEG F DY+ ++F+KPVLL+GP 
Sbjct: 186 GFTARTVMKFGGDITFFDRIFTAVSESDGLAYSTCREIEGQFCDYIETQFQKPVLLAGPA 245

Query: 245 GNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVL 304
             +    +T E +W+ WL +FK GSVIYCAFGSECTL K +FQEL+ G ELT +PFFA L
Sbjct: 246 LPVPS-KSTMEQKWSDWLGKFKEGSVIYCAFGSECTLRKDKFQELLWGLELTGMPFFAAL 305

Query: 305 KLPHSIDMVGAALPE-------------------------------------GSISEALV 364
           K P   + V AA+PE                                      S+SEALV
Sbjct: 306 KPPFETESVEAAIPEELKEKIQGRGIVHGEWVQQQLFLQHPSVGCFVSHCGWASLSEALV 365

Query: 365 KKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIG 423
             CQ+V LP  GD    AR MS  LKVGVEVE+ EEDG+F++ESVCKAVK VMDE+SEIG
Sbjct: 366 NDCQIVLLPQVGDQIINARIMSVSLKVGVEVEKGEEDGVFSRESVCKAVKAVMDEKSEIG 425

BLAST of Cla97C02G028650 vs. Swiss-Prot
Match: sp|Q53UH4|DUSKY_IPONI (Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea nil OX=35883 GN=3GGT PE=1 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 2.4e-125
Identity = 224/455 (49.23%), Postives = 297/455 (65.27%), Query Frame = 0

Query: 5   SSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPI 64
           ++ H+AMYPWF  GH T + ++ANKLA KGH+ISF IP  TQ KL+ FN  P LI+FVPI
Sbjct: 6   TTYHMAMYPWFGVGHLTGFFRLANKLAGKGHRISFLIPKNTQSKLESFNLHPHLISFVPI 65

Query: 65  IVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPK 124
           +VP + GLP  AETT+DV  PS  +L+M AMD TQ  I+ IL+++K   +F+DFT W+P 
Sbjct: 66  VVPSIPGLPPGAETTSDVPFPST-HLLMEAMDKTQNDIEIILKDLKVDVVFYDFTHWLPS 125

Query: 125 LASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAK 184
           LA ++GIKS+++S IS     Y   P R++ G   TEAD MK P  +P  +IKLH+HEA+
Sbjct: 126 LARKIGIKSVFYSTISPLMHGYALSPERRVVGKQLTEADMMKAPASFPDPSIKLHAHEAR 185

Query: 185 FYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPD 244
            + + + M+FG D+ FF R FT + ESD +A+ +CREIEG F DY+ ++F+KPVLL+GP 
Sbjct: 186 GFTARTVMKFGGDITFFDRIFTAVSESDGLAYSTCREIEGQFCDYIETQFQKPVLLAGPA 245

Query: 245 GNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVL 304
             +    +T E +W+ WL +FK GSVIYCAFGSECTL K +FQEL+ G ELT +PFFA L
Sbjct: 246 LPVPS-KSTMEQKWSDWLGKFKEGSVIYCAFGSECTLRKDKFQELLWGLELTGMPFFAAL 305

Query: 305 KLPHSIDMVGAALPE-------------------------------------GSISEALV 364
           K P   + + AA+PE                                      S+SEALV
Sbjct: 306 KPPFEAESIEAAIPEELKEKIQGRGIVHGEWVQQQLFLQHPSVGCFVSHCGWASLSEALV 365

Query: 365 KKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIG 423
             CQ+V LP  GD    AR MS  LKVGVEVE+ EEDG+F++ESVCKAVK VMDE+SEIG
Sbjct: 366 NDCQIVLLPQVGDQIINARIMSVSLKVGVEVEKGEEDGVFSRESVCKAVKAVMDEKSEIG 425

BLAST of Cla97C02G028650 vs. Swiss-Prot
Match: sp|I1KEV6|FG3H_SOYBN (UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=2)

HSP 1 Score: 435.6 bits (1119), Expect = 6.1e-121
Identity = 221/453 (48.79%), Postives = 293/453 (64.68%), Query Frame = 0

Query: 7   LHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPIIV 66
           LHIAMYPW A GH T +L + NKLA +GHKISF  P K Q KL+PFN  P+ ITFV I V
Sbjct: 6   LHIAMYPWLAMGHQTAFLHLCNKLAIRGHKISFITPPKAQAKLEPFNLHPNSITFVTINV 65

Query: 67  PHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKLA 126
           PHV+GLP  A+TTADV++P L   IMTAMDLT+  I+ +L  +KP  +F+DFT WMP LA
Sbjct: 66  PHVEGLPPDAQTTADVTYP-LQPQIMTAMDLTKDDIETLLTGLKPDLVFYDFTHWMPALA 125

Query: 127 SQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKFY 186
            +LGIK++++   S+    Y   PSR   G +  E+D M+PP GYP S+IKL +HEA+ +
Sbjct: 126 KRLGIKAVHYCTASSVMVGYTLTPSRFHQGTDLMESDLMEPPEGYPDSSIKLQTHEARTF 185

Query: 187 ASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPDGN 246
           A+     FGS+VLF+ R F  L E+D +A+++CREIEGP++DY+  +F KPV+ +GP   
Sbjct: 186 AAKRKDTFGSNVLFYDRQFIALNEADLLAYRTCREIEGPYMDYIGKQFNKPVVATGP-VI 245

Query: 247 IQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVLKL 306
           +  P    E +++ WL  F+ GSV+YC FGSECTL  +QF ELVLG ELT +PF A +K 
Sbjct: 246 LDPPTLDLEEKFSTWLGGFEPGSVVYCCFGSECTLRPNQFLELVLGLELTGMPFLAAVKA 305

Query: 307 PHSIDMVGAALPE-------------------------------------GSISEALVKK 366
           P   + V +A+PE                                     GS+SEALV K
Sbjct: 306 PLGFETVESAMPEGFQERVKGRGFVYGGWVQQQLILAHPSVGCFITHCGSGSLSEALVNK 365

Query: 367 CQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIGKE 423
           CQLV LP+ GD    AR M   L+VGVEVE+ +EDG++ KESVCKAV  VMD E+E  K 
Sbjct: 366 CQLVLLPNVGDQILNARMMGTNLEVGVEVEKGDEDGMYTKESVCKAVSIVMDCENETSKR 425

BLAST of Cla97C02G028650 vs. Swiss-Prot
Match: sp|A0A0G4DBR5|FG3N_SOYBN (UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 4.0e-120
Identity = 219/453 (48.34%), Postives = 292/453 (64.46%), Query Frame = 0

Query: 7   LHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPIIV 66
           LHIAMYPW A GH   +L + NKLA +GHKISF  P K Q KL+PFN  P+ ITFV I V
Sbjct: 6   LHIAMYPWLAMGHQIAFLHLCNKLAIRGHKISFITPPKAQAKLEPFNLHPNSITFVTINV 65

Query: 67  PHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKLA 126
           PHV+GLP  A+TTADV++P L   IMTAMDLT+  I+ +L  +KP  +F+DFT WMP LA
Sbjct: 66  PHVEGLPPDAQTTADVTYP-LQPQIMTAMDLTKDDIETLLTGLKPDLVFYDFTHWMPALA 125

Query: 127 SQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKFY 186
            +LGIK++++   S+    Y   P+R   G +  E+D M+PP GYP S+IKL +HEA+ +
Sbjct: 126 KRLGIKAVHYCTASSVMIGYTLTPARFHQGTDLMESDLMEPPEGYPDSSIKLQTHEARVF 185

Query: 187 ASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPDGN 246
           A+     FGS+VLF+ R F  L E+D +A+++CREIEGP++DY+  +F KPV+ +GP   
Sbjct: 186 AAKRKDTFGSNVLFYDRQFIALNEADLLAYRTCREIEGPYMDYIGKQFNKPVVATGP-VI 245

Query: 247 IQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVLKL 306
           +  P    E +++ WL  F+ GSV+YC FGSECTL  +QF ELVLG ELT +PF A +K 
Sbjct: 246 LDPPTLDLEEKFSTWLGGFEPGSVVYCCFGSECTLRPNQFLELVLGLELTGMPFLAAVKA 305

Query: 307 PHSIDMVGAALPE-------------------------------------GSISEALVKK 366
           P   + V +A+PE                                     GS+SEALV K
Sbjct: 306 PLGFETVESAMPEGFQERVKGRGFVYGGWVQQQLILAHPSVGCFITHCGSGSLSEALVNK 365

Query: 367 CQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIGKE 423
           CQLV LP+ GD    AR M   L+VGVEVE+ +EDG++ KESVCKAV  VMD E+E  K 
Sbjct: 366 CQLVLLPNVGDQILNARMMGTNLEVGVEVEKGDEDGMYTKESVCKAVSIVMDCENETSKR 425

BLAST of Cla97C02G028650 vs. Swiss-Prot
Match: sp|Q9FN26|U79B6_ARATH (UDP-glycosyltransferase 79B6 OS=Arabidopsis thaliana OX=3702 GN=UGT79B6 PE=2 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 6.6e-91
Identity = 184/455 (40.44%), Postives = 254/455 (55.82%), Query Frame = 0

Query: 5   SSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPI 64
           S  H  M+PWF FGH T +L +ANKLA+K HKI+F +P K + +L+  N FP  I F  +
Sbjct: 3   SKFHAFMFPWFGFGHMTAFLHLANKLAEKDHKITFLLPKKARKQLESLNLFPDCIVFQTL 62

Query: 65  IVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPK 124
            +P VDGLP+ AETT+D+   SL + + +AMD T+ Q+K  +   KP  IFFDF  W+P+
Sbjct: 63  TIPSVDGLPDGAETTSDIP-ISLGSFLASAMDRTRIQVKEAVSVGKPDLIFFDFAHWIPE 122

Query: 125 LASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAK 184
           +A + G+KS+    ISA   A  F P R       ++ D    PPGYPSS + L  HE  
Sbjct: 123 IAREYGVKSVNFITISAACVAISFVPGR-------SQDDLGSTPPGYPSSKVLLRGHETN 182

Query: 185 FYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPD 244
             + +S+  FG    F+ R   GL   D I+ ++C+E+EG F D++ ++F++ VLL+GP 
Sbjct: 183 SLSFLSY-PFGDGTSFYERIMIGLKNCDVISIRTCQEMEGKFCDFIENQFQRKVLLTGPM 242

Query: 245 GNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVL 304
               + +   E +W  WLS+F  GSVIYCA GS+  L K QFQEL LG ELT LPF   +
Sbjct: 243 LPEPDNSKPLEDQWRQWLSKFDPGSVIYCALGSQIILEKDQFQELCLGMELTGLPFLVAV 302

Query: 305 KLPHSIDMVGAALPE-------------------------------------GSISEALV 364
           K P     +  ALP+                                     GS+ EALV
Sbjct: 303 KPPKGSSTIQEALPKGFEERVKARGVVWGGWVQQPLILAHPSIGCFVSHCGFGSMWEALV 362

Query: 365 KKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIG 423
             CQ+VF+PH G+     R MS  LKV VEV +REE G F+KES+  AV++VMD +SE+G
Sbjct: 363 NDCQIVFIPHLGEQILNTRLMSEELKVSVEV-KREETGWFSKESLSGAVRSVMDRDSELG 422

BLAST of Cla97C02G028650 vs. TAIR10
Match: AT5G54010.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 335.9 bits (860), Expect = 3.6e-92
Identity = 184/455 (40.44%), Postives = 254/455 (55.82%), Query Frame = 0

Query: 5   SSLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPI 64
           S  H  M+PWF FGH T +L +ANKLA+K HKI+F +P K + +L+  N FP  I F  +
Sbjct: 3   SKFHAFMFPWFGFGHMTAFLHLANKLAEKDHKITFLLPKKARKQLESLNLFPDCIVFQTL 62

Query: 65  IVPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPK 124
            +P VDGLP+ AETT+D+   SL + + +AMD T+ Q+K  +   KP  IFFDF  W+P+
Sbjct: 63  TIPSVDGLPDGAETTSDIP-ISLGSFLASAMDRTRIQVKEAVSVGKPDLIFFDFAHWIPE 122

Query: 125 LASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAK 184
           +A + G+KS+    ISA   A  F P R       ++ D    PPGYPSS + L  HE  
Sbjct: 123 IAREYGVKSVNFITISAACVAISFVPGR-------SQDDLGSTPPGYPSSKVLLRGHETN 182

Query: 185 FYASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPD 244
             + +S+  FG    F+ R   GL   D I+ ++C+E+EG F D++ ++F++ VLL+GP 
Sbjct: 183 SLSFLSY-PFGDGTSFYERIMIGLKNCDVISIRTCQEMEGKFCDFIENQFQRKVLLTGPM 242

Query: 245 GNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVL 304
               + +   E +W  WLS+F  GSVIYCA GS+  L K QFQEL LG ELT LPF   +
Sbjct: 243 LPEPDNSKPLEDQWRQWLSKFDPGSVIYCALGSQIILEKDQFQELCLGMELTGLPFLVAV 302

Query: 305 KLPHSIDMVGAALPE-------------------------------------GSISEALV 364
           K P     +  ALP+                                     GS+ EALV
Sbjct: 303 KPPKGSSTIQEALPKGFEERVKARGVVWGGWVQQPLILAHPSIGCFVSHCGFGSMWEALV 362

Query: 365 KKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIG 423
             CQ+VF+PH G+     R MS  LKV VEV +REE G F+KES+  AV++VMD +SE+G
Sbjct: 363 NDCQIVFIPHLGEQILNTRLMSEELKVSVEV-KREETGWFSKESLSGAVRSVMDRDSELG 422

BLAST of Cla97C02G028650 vs. TAIR10
Match: AT5G53990.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 332.4 bits (851), Expect = 4.0e-91
Identity = 183/458 (39.96%), Postives = 259/458 (56.55%), Query Frame = 0

Query: 6   SLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPII 65
           + H  M+PWFAFGH TPYL +ANKLA KGH+++F +P K Q +L+  N FP  I F  + 
Sbjct: 4   NFHAFMFPWFAFGHMTPYLHLANKLAAKGHRVTFLLPKKAQKQLEHHNLFPDRIIFHSLT 63

Query: 66  VPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKL 125
           +PHVDGLP  AET +D+   SL   +  AMDLT+ Q++  ++ ++P  IFFD  +W+P++
Sbjct: 64  IPHVDGLPAGAETASDIP-ISLGKFLTAAMDLTRDQVEAAVRALRPDLIFFDTAYWVPEM 123

Query: 126 ASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKF 185
           A +  +KS+ + VISA + A+   P  +L            PPPGYPSS +    H+A  
Sbjct: 124 AKEHRVKSVIYFVISANSIAHELVPGGEL----------GVPPPGYPSSKVLYRGHDAHA 183

Query: 186 YASMSHMRFGSDVLFFHR-HF---TGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLS 245
             + S        +F+ R H+   TGL   D I+ ++C+EIEG F DY+  ++++ VLL+
Sbjct: 184 LLTFS--------IFYERLHYRITTGLKNCDFISIRTCKEIEGKFCDYIERQYQRKVLLT 243

Query: 246 GPDGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFF 305
           GP     + +   E RW  WL++FK GSVIYCA GS+ TL K QFQEL LG ELT LPF 
Sbjct: 244 GPMLPEPDNSRPLEDRWNHWLNQFKPGSVIYCALGSQITLEKDQFQELCLGMELTGLPFL 303

Query: 306 AVLKLPHSIDMVGAALPE-------------------------------------GSISE 365
             +K P     +  ALPE                                     GS+ E
Sbjct: 304 VAVKPPKGAKTIQEALPEGFEERVKNHGVVWGEWVQQPLILAHPSVGCFVTHCGFGSMWE 363

Query: 366 ALVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEES 423
           +LV  CQ+V LP+  D     R MS  L+V VEV +REE G F+KES+  A+ +VMD++S
Sbjct: 364 SLVSDCQIVLLPYLCDQILNTRLMSEELEVSVEV-KREETGWFSKESLSVAITSVMDKDS 423

BLAST of Cla97C02G028650 vs. TAIR10
Match: AT1G64920.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 330.9 bits (847), Expect = 1.2e-90
Identity = 179/457 (39.17%), Postives = 256/457 (56.02%), Query Frame = 0

Query: 7   LHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPIIV 66
           +H  M+PWFAFGH TPYL + NKLA+KGH+++F +P K Q +L+  N FP  I F P+++
Sbjct: 5   IHAFMFPWFAFGHMTPYLHLGNKLAEKGHRVTFLLPKKAQKQLEHQNLFPHGIVFHPLVI 64

Query: 67  PHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKLA 126
           PHVDGLP  AET +D+   SL   +  AMDLT+ QI+  +  ++P  I FD   W+P++A
Sbjct: 65  PHVDGLPAGAETASDIP-ISLVKFLSIAMDLTRDQIEAAIGALRPDLILFDLAHWVPEMA 124

Query: 127 SQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKFY 186
             L +KS+ ++V+SAT+ A+   P  +L             PPGYPSS      H+A   
Sbjct: 125 KALKVKSMLYNVMSATSIAHDLVPGGEL----------GVAPPGYPSSKALYREHDAHAL 184

Query: 187 ASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPDGN 246
            + S    G    F+HR  TGL   D I+ ++C EIEG F DY+ S++KK VLL+GP   
Sbjct: 185 LTFS----GFYKRFYHRFTTGLMNCDFISIRTCEEIEGKFCDYIESQYKKKVLLTGPMLP 244

Query: 247 IQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVLKL 306
             + +   E +W+ WLS F  GSV++CA GS+  L K+QFQEL LG ELT LPF   +K 
Sbjct: 245 EPDKSKPLEDQWSHWLSGFGQGSVVFCALGSQTILEKNQFQELCLGIELTGLPFLVAVKP 304

Query: 307 PHSIDMVGAALPE-----------------------------------------GSISEA 366
           P   + +  ALPE                                         GS+ E+
Sbjct: 305 PKGANTIHEALPEGFEERVKGRGIVWGEWVQQPSWQPLILAHPSVGCFVSHCGFGSMWES 364

Query: 367 LVKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESE 423
           L+  CQ+VF+P   D     R M+  L+V VEV+ REE G F+KE++  A+ ++MD++SE
Sbjct: 365 LMSDCQIVFIPVLNDQVLTTRVMTEELEVSVEVQ-REETGWFSKENLSGAIMSLMDQDSE 424

BLAST of Cla97C02G028650 vs. TAIR10
Match: AT1G64910.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 330.1 bits (845), Expect = 2.0e-90
Identity = 181/456 (39.69%), Postives = 253/456 (55.48%), Query Frame = 0

Query: 6   SLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPII 65
           + H  M+PWFAFGH TPYL +ANKLA++GH+I+F IP K Q +L+  N FP  I F  + 
Sbjct: 4   TFHAFMFPWFAFGHMTPYLHLANKLAERGHRITFLIPKKAQKQLEHLNLFPDSIVFHSLT 63

Query: 66  VPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKL 125
           +PHVDGLP  AET +D+  P L+  +  A+DLT+ Q++  +  + P  I FD   W+P++
Sbjct: 64  IPHVDGLPAGAETFSDIPMP-LWKFLPPAIDLTRDQVEAAVSALSPDLILFDIASWVPEV 123

Query: 126 ASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKF 185
           A +  +KS+ +++ISAT+ A+ F P  +L            PPPGYPSS +    H+A  
Sbjct: 124 AKEYRVKSMLYNIISATSIAHDFVPGGEL----------GVPPPGYPSSKLLYRKHDAHA 183

Query: 186 YASMS--HMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGP 245
             S S  + RF       HR  TGL   D I+ ++C+EIEG F +YL  ++ K V L+GP
Sbjct: 184 LLSFSVYYKRFS------HRLITGLMNCDFISIRTCKEIEGKFCEYLERQYHKKVFLTGP 243

Query: 246 DGNIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAV 305
                      E RW+ WL+ F+ GSV++CA GS+ TL K QFQEL LG ELT LPFF  
Sbjct: 244 MLPEPNKGKPLEDRWSHWLNGFEQGSVVFCALGSQVTLEKDQFQELCLGIELTGLPFFVA 303

Query: 306 LKLPHSIDMVGAALPE-------------------------------------GSISEAL 365
           +  P     +  ALPE                                     GS+ E++
Sbjct: 304 VTPPKGAKTIQDALPEGFEERVKDRGVVLGEWVQQPLLLAHPSVGCFLSHCGFGSMWESI 363

Query: 366 VKKCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEI 423
           +  CQ+V LP   D     R M+  LKV VEV+ REE G F+KES+  A+ +VMD+ SEI
Sbjct: 364 MSDCQIVLLPFLADQVLNTRLMTEELKVSVEVQ-REETGWFSKESLSVAITSVMDQASEI 423

BLAST of Cla97C02G028650 vs. TAIR10
Match: AT2G22930.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 329.3 bits (843), Expect = 3.4e-90
Identity = 181/455 (39.78%), Postives = 257/455 (56.48%), Query Frame = 0

Query: 6   SLHIAMYPWFAFGHFTPYLQIANKLAKKGHKISFFIPSKTQPKLQPFNHFPSLITFVPII 65
           + H  M+PWFAFGH  P+L +ANKLA+KGH+I+F +P K Q +L+  N FP  I F P+ 
Sbjct: 4   TFHAFMFPWFAFGHMIPFLHLANKLAEKGHQITFLLPKKAQKQLEHHNLFPDSIVFHPLT 63

Query: 66  VPHVDGLPEAAETTADVSHPSLFNLIMTAMDLTQPQIKCILQNIKPHFIFFDFTFWMPKL 125
           +PHV+GLP  AETT+D+S  S+ NL+  A+DLT+ Q++  ++ ++P  IFFDF  W+P++
Sbjct: 64  IPHVNGLPAGAETTSDIS-ISMDNLLSEALDLTRDQVEAAVRALRPDLIFFDFAHWIPEI 123

Query: 126 ASQLGIKSIYHSVISATTFAYVFPPSRQLCGDNCTEADFMKPPPGYPSSTIKLHSHEAKF 185
           A +  IKS+ + ++SATT AY F P   L            PPPGYPSS +    ++A  
Sbjct: 124 AKEHMIKSVSYMIVSATTIAYTFAPGGVL----------GVPPPGYPSSKVLYRENDAHA 183

Query: 186 YASMSHMRFGSDVLFFHRHFTGLCESDAIAFKSCREIEGPFVDYLISEFKKPVLLSGPDG 245
            A++S          +H+  TG    D IA ++C EIEG F DY+ S++ K VLL+GP  
Sbjct: 184 LATLSIFY----KRLYHQITTGFKSCDIIALRTCNEIEGKFCDYISSQYHKKVLLTGPML 243

Query: 246 NIQEPATTFEHRWAAWLSRFKAGSVIYCAFGSECTLTKHQFQELVLGFELTNLPFFAVLK 305
             Q+ +   E + + +LSRF   SV++CA GS+  L K QFQEL LG ELT LPF   +K
Sbjct: 244 PEQDTSKPLEEQLSHFLSRFPPRSVVFCALGSQIVLEKDQFQELCLGMELTGLPFLIAVK 303

Query: 306 LPHSIDMVGAALPE-------------------------------------GSISEALVK 365
            P     V   LPE                                     G+I E L+ 
Sbjct: 304 PPRGSSTVEEGLPEGFQERVKGRGVVWGGWVQQPLILDHPSIGCFVNHCGPGTIWECLMT 363

Query: 366 KCQLVFLPHNGDHFFRARTMSRCLKVGVEVERREEDGIFNKESVCKAVKTVMDEESEIGK 424
            CQ+V LP  GD     R M+   KV VEV  RE+ G F+KES+  A+K+VMD++S++GK
Sbjct: 364 DCQMVLLPFLGDQVLFTRLMTEEFKVSVEVS-REKTGWFSKESLSDAIKSVMDKDSDLGK 423

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008455479.11.8e-20678.48PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis m... [more]
XP_022952333.12.5e-19575.38UDP-glycosyltransferase 79B30-like [Cucurbita moschata][more]
XP_022969084.15.1e-19374.51UDP-glycosyltransferase 79B30-like [Cucurbita maxima][more]
XP_023554470.18.8e-19374.51UDP-glycosyltransferase 79B30-like [Cucurbita pepo subsp. pepo][more]
XP_004144602.16.3e-19174.29PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis s... [more]
Match NameE-valueIdentityDescription
tr|A0A1S4E0D1|A0A1S4E0D1_CUCME1.2e-20678.48Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103495633 PE=3 SV=1[more]
tr|A0A0A0K3E5|A0A0A0K3E5_CUCSA4.2e-19174.29Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G041300 PE=3 SV=1[more]
tr|A0A1S3C1R0|A0A1S3C1R0_CUCME2.1e-19073.86Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103495635 PE=3 SV=1[more]
tr|A0A0A0K5E3|A0A0A0K5E3_CUCSA1.1e-15461.71Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G041320 PE=3 SV=1[more]
tr|A0A1S3C1Q5|A0A1S3C1Q5_CUCME2.9e-15260.48anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
sp|Q53UH5|DUSKY_IPOPU1.8e-12549.45Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea purpurea OX=412... [more]
sp|Q53UH4|DUSKY_IPONI2.4e-12549.23Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea nil OX=35883 GN... [more]
sp|I1KEV6|FG3H_SOYBN6.1e-12148.79UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=2[more]
sp|A0A0G4DBR5|FG3N_SOYBN4.0e-12048.34UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=1[more]
sp|Q9FN26|U79B6_ARATH6.6e-9140.44UDP-glycosyltransferase 79B6 OS=Arabidopsis thaliana OX=3702 GN=UGT79B6 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
AT5G54010.13.6e-9240.44UDP-Glycosyltransferase superfamily protein[more]
AT5G53990.14.0e-9139.96UDP-Glycosyltransferase superfamily protein[more]
AT1G64920.11.2e-9039.17UDP-Glycosyltransferase superfamily protein[more]
AT1G64910.12.0e-9039.69UDP-Glycosyltransferase superfamily protein[more]
AT2G22930.13.4e-9039.78UDP-Glycosyltransferase superfamily protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G028650.1Cla97C02G028650.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 5..255
e-value: 1.5E-31
score: 111.7
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 320..401
e-value: 8.8E-11
score: 43.6
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 256..319
e-value: 1.2E-8
score: 36.7
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 4..320
NoneNo IPR availablePANTHERPTHR11926:SF356SUBFAMILY NOT NAMEDcoord: 320..419
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 320..419
NoneNo IPR availablePANTHERPTHR11926:SF356SUBFAMILY NOT NAMEDcoord: 4..320
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 7..407

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C02G028650CsGy7G003040Cucumber (Gy14) v2cgybwmbB502
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C02G028650Silver-seed gourdcarwmbB0202
Cla97C02G028650Silver-seed gourdcarwmbB0348
Cla97C02G028650Silver-seed gourdcarwmbB1124
Cla97C02G028650Cucumber (Gy14) v2cgybwmbB118
Cla97C02G028650Cucumber (Gy14) v2cgybwmbB500
Cla97C02G028650Cucumber (Gy14) v1cgywmbB248
Cla97C02G028650Cucurbita maxima (Rimu)cmawmbB105
Cla97C02G028650Cucurbita maxima (Rimu)cmawmbB531
Cla97C02G028650Cucurbita maxima (Rimu)cmawmbB582
Cla97C02G028650Cucurbita maxima (Rimu)cmawmbB618
Cla97C02G028650Cucurbita moschata (Rifu)cmowmbB094
Cla97C02G028650Cucurbita moschata (Rifu)cmowmbB512
Cla97C02G028650Cucurbita moschata (Rifu)cmowmbB552
Cla97C02G028650Wild cucumber (PI 183967)cpiwmbB126
Cla97C02G028650Wild cucumber (PI 183967)cpiwmbB552
Cla97C02G028650Wild cucumber (PI 183967)cpiwmbB553
Cla97C02G028650Cucumber (Chinese Long) v3cucwmbB127
Cla97C02G028650Cucumber (Chinese Long) v3cucwmbB548
Cla97C02G028650Cucumber (Chinese Long) v3cucwmbB549
Cla97C02G028650Cucumber (Chinese Long) v2cuwmbB124
Cla97C02G028650Cucumber (Chinese Long) v2cuwmbB527
Cla97C02G028650Cucumber (Chinese Long) v2cuwmbB529
Cla97C02G028650Bottle gourd (USVL1VR-Ls)lsiwmbB055
Cla97C02G028650Bottle gourd (USVL1VR-Ls)lsiwmbB086
Cla97C02G028650Melon (DHL92) v3.6.1medwmbB090
Cla97C02G028650Melon (DHL92) v3.6.1medwmbB164
Cla97C02G028650Melon (DHL92) v3.5.1mewmbB094
Cla97C02G028650Melon (DHL92) v3.5.1mewmbB178
Cla97C02G028650Watermelon (Charleston Gray)wcgwmbB138
Cla97C02G028650Watermelon (Charleston Gray)wcgwmbB150
Cla97C02G028650Watermelon (97103) v1wmwmbB317
Cla97C02G028650Wax gourdwgowmbB241
Cla97C02G028650Wax gourdwgowmbB376