Cla97C02G045300 (gene) Watermelon (97103) v2.5

Overview
NameCla97C02G045300
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUDP-Glycosyltransferase superfamily protein
LocationCla97Chr02: 33367256 .. 33373061 (+)
RNA-Seq ExpressionCla97C02G045300
SyntenyCla97C02G045300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCTACTCCGCTCCACTCTACAAATTTTGATTCCCTAAAGATATCGCCGCCATAGCTCAAGCTTCCGATAACCCAAGATGGATCCGATCAGTGGTTCTGTAGCACCCAGACGGATTCATCTGGCGGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTTCTCTCTCTCAAAAACCCCAACATTCTCATCTCCTTCATCGTCACCGACGAGTGGCTCACCTTCCTCGCCGCCGATCCCAAACCCCAAAACATCCGTTTCGCCACTTTCCCCAATGTTATCCCCTCTGAGCTCGGCCGCGCCAACGACTTCACCGGTTTCCTCCGATCCATCCACACCCATATGGAGGCTCCCGTTGAGACTCTACTCCATCGCCTCGACCCGCCGCCGACTGCCATTCTCGCCGATGCCTTCGTCACTTGGGCTGTCCAGTTGGGGAAACGCCTCAATGTTCCGGTCGCTTCACTCTGGCCCATGTCGGCTACGGTTTTCTCCATCCTTTACCATTTCGACCTTCTCAAGGAAAATGGGCATTTTCCAGCAGATCTCTCAGGTAACTTTAAAATCTAATTGATGGAAATTACATTTTTGGAATCAGGATTTAAGTTCTTTCTTACTCTGTTTTGAAGAGCGGGGAGAAGAGATTGTCGATTACTTCCCCGGAGTGTCGAAGATTTGTCTTGCAGATTTGCCGTCTTTCTTTTCTGGCGATGGTCTCCAAAGCGTCGAATCCGCCGTGAACTCCGCCCGTTCCGTCGACAAATCCCAATTTTTCATCTCCACCTCTGTTTACGAGCTTGAATCCTCTGTTATCGACGCCTTAAAAGCAAAATTTCCCTTCCCGATTTACACCATCGGACCCAGTACTCCATATTTCGAGCTAGAATGCTCCGCCCCAAATGGCGGCACCGACGACTATTTCCGGTGGCTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACATTTCACAGGGCAGTTATCTTTCAGTTTCTAGCGCCCAAATAGACGAGATCGTCGCCGGGGTGAAAGCCAGCGGCGTTCGGTTCTTGTGGGTGGTGCGTGGAGATGACGGCCGGTTGAAGGACGTGGACAGAGAAACTGGGATGGTGGTTGGATGGTGCGATCAATTGAAGGTTCTGTGCCACAGATCCGTGGGAGGGTTTTGGACTCACGGCGGTTGGAATTCAACTCTGGAAGGGGTTTTCGCCGGCGTTCCGATGCTTGCTTGGCCGATATTCTGGGATCAATTTCCGAACAGTAAGAAGATTGCGGAGGATTGGAAAGTTGGGGTCCGATTTAAAGCAGTTGGGGGTAAGGATTTGGTGAGGAGAGTGGAAATTGCCGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGATGAAGGGAGGGAGTTGAGGAACAGAGTATCGGAATTTCAAGAGATTTGCCGGCGAGCGGTGGCGAAAGGTGGTTCCTCTGATTCCAACATTGATGCATTTCTCAAACATATTTCAGGAGAGTTATGAATGAATTTGAGATGTTTAGCAGTTTGGGTGATTATGAATTTTGATTGGGAGGTGGTTTGATCCGAGTTGGTGATTTTAAAACACCAGAATAATATTCCATAACAAGATGAAAAATCCCATCTTTTGTGCTATCTATCAAATGGTTAAATTCATAGTATTAAACATTTCTTGGTATATATTTATACTTCTATCAATATTTTCACCTTTAATTAGATGATTTTGATTTGTTCTTTTGGACAATCATGGGTTGTAACATGTTCCCACACTTCAAATAAAATAAAGATCATTTCAAACCCTAAACCTAAAATTCATGTTTTTTTTTCCTTTTCATTATATCATAAAAAATATTCACAAGACACAAAAATATACCACAAAATATCATAAACCTAAATATGATATAAATAATGAATGTGTTAATTTGTTTAAAAATCATAATATGTTTATTCACTCATTTAAATTTATTTTTTGAGGTTTTTTTTTTTATAATTTTTTTTAGAAACATTCATTAATATTGAGATTTTCATATTCTTACACTTTTCTCGTAAATTTAATTTTTTTTACACTTTTTATCAACAACTAGATTTTAAAGCTTGTTTCAATCATAATTTGTTGCGACAATTCTCCTTGCTTCTTTTCTAAAAGATTTTAATTTATTACTCATGTCGAATGAAATTGAGATGCATAAATATTGGTCCATTTTAAGGTTTATTTTACTTTTTCTCAAAGTAGTCATCCAATATTTAACAAAACAAATCACAAATTGATTCATTCCTCCTTTCTTTCTTTCTTTTTTTTTTTTTTTTCCTATTTTGATAATGTATATGAGTACAATCCACTAAAATAAATGCTTTCAACTACTTAATTGTTTTTTTTTAGGAAAAAACAAGAAAAAAGGTTAAAGTATTTCAAATATGAATCTAAATTGGTCATTTCTTTTTATAAAACTAAAACAATATAAATGAACCTATGTTGGAAAAAAAAATGTGAAATTTTGTTGGAATAAATATCTCCATGACTATCTAAGATAAATTGTTATTTCTAACACAAAAAGTTTTTTTTATGGTACATAACTAACATATTTTCTAAAATTTCTTTTTGTAACTCTATTTTCAACTTTTTTGAGTTTTAGAGATGGGTCTCACATGAAGAAATGAAGTTAATACCCAAAAACCATCTTCCTAAACACTCAAACCATTGTTTTCTTCCTGAAACCATCTTCTTCCTTAGAGTCCAGTTTTTTTATCGTACTCTCTAATTTATGTCATTTATGTAAAACTCATCCTATGTGACCCGATTTCTGGAAATAAACCCACAAAGAACTTCTTTGAAACGCTTATTTATGGAAATTTAGGTTTTCGAATAGTTGGTGGTAAAAGAGGAAAAGAAAATCCAAAATTTTCAGGTAGAGTAGATTTAATATGTGCAGGAAATGTCTAAAATCATATAAAAAAATAATTTGGTATGTGTTTTTTTGCTCATTATCATCTTTCCCGTAAGTTGTATGAGTGTACGAGAAAATTGACAAAATGCCCAAATCTCATAAATTACACAGATTGCCCAAATCTCAGAAGATTTGAACAAAATTAACGAGAACAAGAGCGATACTAGTGAGACGATATTCACCCGAGAAACATGTGAGAATTTTGAGTTTGGTTGATGGTGGTATAAAGTAGGCAACAAGCTTAAGTCTCCTTCTCTCCTTGAGGTGAGATAACGAAGGTGTTTTTGAAATCTCGTTCATGCCTTTTTATCGTCCTTCTTACTCTTTTAAGCCCTATTTATATGACATGTGTTTATGCTATTTTTTATTTTTCTCACTTTCAATCATATTTAATTGATTGCTATTTACGTTTATGTGCATCTATTTATGTTCAACTCCTTATTATTTATTGTTATTTTCATCAATGTCATTATTTTTCTTATTTTATAATTTTTTCTCTCTTTTCTTACCCTTTTTTTTTTCTTTGGGTCCCATTTTCCCTTCCCTTTTTTCCTTCTCTCCCTCTGTCTCACATCTTCTAGTGTCCTGCTGCTCTTATTAATTTTATTTCTATTGTTTTGTTTCTAAAAAAAGAAAAGAAAAAAAAAAATAAGTATGTGTGTTGAGTAAATAAATAAAAAGCAAGAAAGACAAACTTTCTTCTACCCGTTTGGATTGATTTGAGAAAATTGTTTTTTTATTTAAAATACACCTCAAAATCTATTTTGAATTGTTGGTAAACACTTCAAATTTTTTCAAAACAATTTATTTTTTAAATTAAACACTTAAAAATGTAATACAATTACACTCATTTTCTTTGTAGGCCATAATAACAGTTAAATTTGGAAATGTAAATAGCTTGAAGTAAAAAGCCAAACAAGATAAAGAGAAGAGAAAAATTGGGTTTTGTCGCTAGACAATTAATTTTTTAAAAAATTACTTTTAGATTATTTTTTATGTAAATAGAAAAAAGAGAGTATAAACTAAAGTATTGGGTAGACTACAAAAGAGTTTTGGTGGAGATTTTTTTGGGTTGGAGAGATATGCTCTGTTAAATGGCTTTTTTTTTTTTTTTTTTCAAGGAAATAAAAGAAATTTCGAAGAAAATCTTCTCAAACTATTATTATCATCCAATCAAATTTTTATTATTATTATTAATTTATTTATGTTTCTTTCAGCTTTTCTTTACTAAGTATTCCTTAGTCATATTTTCATATTTTCAATCATATGTTATATTGTGTGTTTTGTCCGCATGGTATTTAACTGGGAAGGGAGTATTCGCTTTAATAATCGTTTTACTCTATTTCATTCCCTCAGAGAGATCGCTGCCGTAGCTCAAGCTTCCGATAACCCCAAAATGGATCCGATCAGTGGTCCTGCAGCACCCAGACGGATTCATCTGGCTGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTTCTCTCTCTCAGAAACCCCAACATTCTCATCTCCTTCATCGTCACCGACAAGTGGCTCACCTTCCTCGCCGCCGACCCCAAGCCCCAAAACATCCATTTTGCCACTTTCCCCAATGTTATCCCCTCTGAACTCCGCCGCGCCAACGACTTCCTCGGTTTCTTCCGGTCCATCCAAACTCATATGTTGCCTCCCGTTGAGACTCTTCTCCGCCGCCTCGACCCGCCGCTGACTGCCATAAGTGCCGATTCCTTCCTCACTTGGGCTGTCCAGTTGAGCAAACGCCTCAATGTTCCGGTCGCTTCACTCTGGCCCATGTCCGCTACGGTTTTCTCCATTCTTTACCATTTCGACTTTCTCAAGGAAAATAGGCATTTCCCAGCCGATCTCTCAGGTACTGCAAGAATTTCCCCATTTCCGAAATTTGTAATTTGTGGGTAAATTTTTTTTACGGAAAAAGCTGTTGTTCTTGTTTTAAAGAGCGTGGTGAAGAGATTGTCGATTACATTTCCGGAGTTTCCAAGATTCGTCTTGCAGATCTTCCCACTTTCTTCTCCGGCGTCGGACTTGAAGTCCTCGGTTCAACATTGGAAGCGGCGCGTTCTGTTGACAAAGCTCAATTTCTCATCTCCACCTCTGTTTACGAACTTGAAACCTCTGTAATCGACGTTTTGAAACCGAAATTTCCCTTTCCAGTTTACACAATCAGGCCCTGTACGCCATATTTCGAGGCCTTAAACGGCTGCACCAATGACTATCTTCGGTGGCTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACGTTTCGGAGGGGAGTTATCTTTCAGTTTCAAGCTCCCAAATGGACGAGATCGTCGCTGGTGTGAAAGCTAGCGGCGTTCGGTTCTTGTGGGTGGCGCGTGGAGATGACGCTCGGTTTAAGGACGTGGACAGAGAAACTGGGATGGTAGTTTAATGGTGCGACCAATTGAGGGTTCTGTGCCATAGCGCCGTGGGGGGATTTTGGACTCACGGCGGTAGGAATTCTACTTTGGAAGGGGTTTTCGCCGGCGTCCCGATGCTTGCTTGGCCAATACTTTGGGATCAGTTTCCGAACAGTAAGAAGATTGCTGAGGATTGGAAAGTTGGAGTCCGATTTAAAGCAGTTGGGGGTAGGGATTTGGTGAGGAGAGAGGAAATTACAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGGAGGGAGATGAGGAACAGAGTGTCGGACTTACAAGAGATTTGCCAGCGAGCGGTGGTGAAAGGTGGTTCTTTTGATTCCAACATTGATGCATTTCTGAACCATATTTCTAGAGAGTTATGA

mRNA sequence

TCCTACTCCGCTCCACTCTACAAATTTTGATTCCCTAAAGATATCGCCGCCATAGCTCAAGCTTCCGATAACCCAAGATGGATCCGATCAGTGGTTCTGTAGCACCCAGACGGATTCATCTGGCGGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTTCTCTCTCTCAAAAACCCCAACATTCTCATCTCCTTCATCGTCACCGACGAGTGGCTCACCTTCCTCGCCGCCGATCCCAAACCCCAAAACATCCGTTTCGCCACTTTCCCCAATGTTATCCCCTCTGAGCTCGGCCGCGCCAACGACTTCACCGGTTTCCTCCGATCCATCCACACCCATATGGAGGCTCCCGTTGAGACTCTACTCCATCGCCTCGACCCGCCGCCGACTGCCATTCTCGCCGATGCCTTCGTCACTTGGGCTGTCCAGTTGGGGAAACGCCTCAATGTTCCGGTCGCTTCACTCTGGCCCATGTCGGCTACGGTTTTCTCCATCCTTTACCATTTCGACCTTCTCAAGGAAAATGGGCATTTTCCAGCAGATCTCTCAGAGCGGGGAGAAGAGATTGTCGATTACTTCCCCGGAGTGTCGAAGATTTGTCTTGCAGATTTGCCGTCTTTCTTTTCTGGCGATGGTCTCCAAAGCGTCGAATCCGCCGTGAACTCCGCCCGTTCCGTCGACAAATCCCAATTTTTCATCTCCACCTCTGTTTACGAGCTTGAATCCTCTGTTATCGACGCCTTAAAAGCAAAATTTCCCTTCCCGATTTACACCATCGGACCCAGTACTCCATATTTCGAGCTAGAATGCTCCGCCCCAAATGGCGGCACCGACGACTATTTCCGGTGGCTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACATTTCACAGGGCAGTTATCTTTCAGTTTCTAGCGCCCAAATAGACGAGATCGTCGCCGGGGTGAAAGCCAGCGGCGTTCGGTTCTTGTGGGTGGTGCGTGGAGATGACGGCCGGTTGAAGGACGTGGACAGAGAAACTGGGATGGTGGTTGGATGGTGCGATCAATTGAAGGTTCTGTGCCACAGATCCGTGGGAGGGTTTTGGACTCACGGCGGTTGGAATTCAACTCTGGAAGGGGTTTTCGCCGGCGTTCCGATGCTTGCTTGGCCGATATTCTGGGATCAATTTCCGAACAGTAAGAAGATTGCGGAGGATTGGAAAGTTGGGGTCCGATTTAAAGCAGTTGGGGGTAAGGATTTGGTGAGGAGAGTGGAAATTGCCGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGATGAAGGGAGGGAGTTGAGGAACAGAGTATCGGAATTTCAAGAGATTTGCCGGCGAGCGGTGGCGAAAGGTGGTTCCTCTGATTCCAACATTGATGCATTTCTCAAACATATTTCAGGAGAAGAGATCGCTGCCGTAGCTCAAGCTTCCGATAACCCCAAAATGGATCCGATCAGTGGTCCTGCAGCACCCAGACGGATTCATCTGGCTGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTTCTCTCTCTCAGAAACCCCAACATTCTCATCTCCTTCATCGTCACCGACAAGTGGCTCACCTTCCTCGCCGCCGACCCCAAGCCCCAAAACATCCATTTTGCCACTTTCCCCAATGTTATCCCCTCTGAACTCCGCCGCGCCAACGACTTCCTCGGTTTCTTCCGGTCCATCCAAACTCATATGTTGCCTCCCGTTGAGACTCTTCTCCGCCGCCTCGACCCGCCGCTGACTGCCATAAGTGCCGATTCCTTCCTCACTTGGGCTGTCCAGTTGAGCAAACGCCTCAATGTTCCGGTCGCTTCACTCTGGCCCATGTCCGCTACGGTTTTCTCCATTCTTTACCATTTCGACTTTCTCAAGGAAAATAGGCATTTCCCAGCCGATCTCTCAGAGCGTGGTGAAGAGATTGTCGATTACATTTCCGGAGTTTCCAAGATTCGTCTTGCAGATCTTCCCACTTTCTTCTCCGGCGTCGGACTTGAAGTCCTCGGTTCAACATTGGAAGCGGCGCGTTCTGTTGACAAAGCTCAATTTCTCATCTCCACCTCTGTTTACGAACTTGAAACCTCTGTAATCGACGTTTTGAAACCGAAATTTCCCTTTCCAGTTTACACAATCAGGCCCTGTACGCCATATTTCGAGGCCTTAAACGGCTGCACCAATGACTATCTTCGGTGGCTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACGTTTCGGAGGGGAGTTATCTTTCAGTTTCAAGCTCCCAAATGGACGAGATCGTCGCTGGTGTGAAAGCTAGCGGCGTTCGGTTCTTGTGGGTGGCGCGTGGAGATGACGCTCGGGTTCTGTGCCATAGCGCCGTGGGGGGATTTTGGACTCACGGCGGTAGGAATTCTACTTTGGAAGGGGTTTTCGCCGGCGTCCCGATGCTTGCTTGGCCAATACTTTGGGATCAGTTTCCGAACAGTAAGAAGATTGCTGAGGATTGGAAAGTTGGAGTCCGATTTAAAGCAGTTGGGGGTAGGGATTTGGTGAGGAGAGAGGAAATTACAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGGAGGGAGATGAGGAACAGAGTGTCGGACTTACAAGAGATTTGCCAGCGAGCGGTGGTGAAAGGTGGTTCTTTTGATTCCAACATTGATGCATTTCTGAACCATATTTCTAGAGAGTTATGA

Coding sequence (CDS)

ATGGATCCGATCAGTGGTTCTGTAGCACCCAGACGGATTCATCTGGCGGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTTCTCTCTCTCAAAAACCCCAACATTCTCATCTCCTTCATCGTCACCGACGAGTGGCTCACCTTCCTCGCCGCCGATCCCAAACCCCAAAACATCCGTTTCGCCACTTTCCCCAATGTTATCCCCTCTGAGCTCGGCCGCGCCAACGACTTCACCGGTTTCCTCCGATCCATCCACACCCATATGGAGGCTCCCGTTGAGACTCTACTCCATCGCCTCGACCCGCCGCCGACTGCCATTCTCGCCGATGCCTTCGTCACTTGGGCTGTCCAGTTGGGGAAACGCCTCAATGTTCCGGTCGCTTCACTCTGGCCCATGTCGGCTACGGTTTTCTCCATCCTTTACCATTTCGACCTTCTCAAGGAAAATGGGCATTTTCCAGCAGATCTCTCAGAGCGGGGAGAAGAGATTGTCGATTACTTCCCCGGAGTGTCGAAGATTTGTCTTGCAGATTTGCCGTCTTTCTTTTCTGGCGATGGTCTCCAAAGCGTCGAATCCGCCGTGAACTCCGCCCGTTCCGTCGACAAATCCCAATTTTTCATCTCCACCTCTGTTTACGAGCTTGAATCCTCTGTTATCGACGCCTTAAAAGCAAAATTTCCCTTCCCGATTTACACCATCGGACCCAGTACTCCATATTTCGAGCTAGAATGCTCCGCCCCAAATGGCGGCACCGACGACTATTTCCGGTGGCTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACATTTCACAGGGCAGTTATCTTTCAGTTTCTAGCGCCCAAATAGACGAGATCGTCGCCGGGGTGAAAGCCAGCGGCGTTCGGTTCTTGTGGGTGGTGCGTGGAGATGACGGCCGGTTGAAGGACGTGGACAGAGAAACTGGGATGGTGGTTGGATGGTGCGATCAATTGAAGGTTCTGTGCCACAGATCCGTGGGAGGGTTTTGGACTCACGGCGGTTGGAATTCAACTCTGGAAGGGGTTTTCGCCGGCGTTCCGATGCTTGCTTGGCCGATATTCTGGGATCAATTTCCGAACAGTAAGAAGATTGCGGAGGATTGGAAAGTTGGGGTCCGATTTAAAGCAGTTGGGGGTAAGGATTTGGTGAGGAGAGTGGAAATTGCCGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGATGAAGGGAGGGAGTTGAGGAACAGAGTATCGGAATTTCAAGAGATTTGCCGGCGAGCGGTGGCGAAAGGTGGTTCCTCTGATTCCAACATTGATGCATTTCTCAAACATATTTCAGGAGAAGAGATCGCTGCCGTAGCTCAAGCTTCCGATAACCCCAAAATGGATCCGATCAGTGGTCCTGCAGCACCCAGACGGATTCATCTGGCTGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTTCTCTCTCTCAGAAACCCCAACATTCTCATCTCCTTCATCGTCACCGACAAGTGGCTCACCTTCCTCGCCGCCGACCCCAAGCCCCAAAACATCCATTTTGCCACTTTCCCCAATGTTATCCCCTCTGAACTCCGCCGCGCCAACGACTTCCTCGGTTTCTTCCGGTCCATCCAAACTCATATGTTGCCTCCCGTTGAGACTCTTCTCCGCCGCCTCGACCCGCCGCTGACTGCCATAAGTGCCGATTCCTTCCTCACTTGGGCTGTCCAGTTGAGCAAACGCCTCAATGTTCCGGTCGCTTCACTCTGGCCCATGTCCGCTACGGTTTTCTCCATTCTTTACCATTTCGACTTTCTCAAGGAAAATAGGCATTTCCCAGCCGATCTCTCAGAGCGTGGTGAAGAGATTGTCGATTACATTTCCGGAGTTTCCAAGATTCGTCTTGCAGATCTTCCCACTTTCTTCTCCGGCGTCGGACTTGAAGTCCTCGGTTCAACATTGGAAGCGGCGCGTTCTGTTGACAAAGCTCAATTTCTCATCTCCACCTCTGTTTACGAACTTGAAACCTCTGTAATCGACGTTTTGAAACCGAAATTTCCCTTTCCAGTTTACACAATCAGGCCCTGTACGCCATATTTCGAGGCCTTAAACGGCTGCACCAATGACTATCTTCGGTGGCTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACGTTTCGGAGGGGAGTTATCTTTCAGTTTCAAGCTCCCAAATGGACGAGATCGTCGCTGGTGTGAAAGCTAGCGGCGTTCGGTTCTTGTGGGTGGCGCGTGGAGATGACGCTCGGGTTCTGTGCCATAGCGCCGTGGGGGGATTTTGGACTCACGGCGGTAGGAATTCTACTTTGGAAGGGGTTTTCGCCGGCGTCCCGATGCTTGCTTGGCCAATACTTTGGGATCAGTTTCCGAACAGTAAGAAGATTGCTGAGGATTGGAAAGTTGGAGTCCGATTTAAAGCAGTTGGGGGTAGGGATTTGGTGAGGAGAGAGGAAATTACAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGGAGGGAGATGAGGAACAGAGTGTCGGACTTACAAGAGATTTGCCAGCGAGCGGTGGTGAAAGGTGGTTCTTTTGATTCCAACATTGATGCATTTCTGAACCATATTTCTAGAGAGTTATGA

Protein sequence

MDPISGSVAPRRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPNVIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFSGDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELECSAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRFKAVGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDNPKMDPISGPAAPRRIHLAALPYPGRGHINALMNLCKLLSLRNPNILISFIVTDKWLTFLAADPKPQNIHFATFPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLDPPLTAISADSFLTWAVQLSKRLNVPVASLWPMSATVFSILYHFDFLKENRHFPADLSERGEEIVDYISGVSKIRLADLPTFFSGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYTIRPCTPYFEALNGCTNDYLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKASGVRFLWVARGDDARVLCHSAVGGFWTHGGRNSTLEGVFAGVPMLAWPILWDQFPNSKKIAEDWKVGVRFKAVGGRDLVRREEITEFVKRFMNSESVEGREMRNRVSDLQEICQRAVVKGGSFDSNIDAFLNHISREL
Homology
BLAST of Cla97C02G045300 vs. NCBI nr
Match: KAF4349973.1 (hypothetical protein G4B88_024484 [Cannabis sativa])

HSP 1 Score: 923.3 bits (2385), Expect = 1.6e-264
Identity = 474/950 (49.89%), Postives = 622/950 (65.47%), Query Frame = 0

Query: 8   VAPRRIHLAALPYPGRGHINALMNLCKLLSLK-NPNILISFIVTDEWLTFLAADPKPQNI 67
           +A    H+ A+PYPGR HINA+MNLCK LS +   +ILI+F+VT EW   L +DPKP NI
Sbjct: 1   MATMTCHVVAMPYPGRSHINAVMNLCKQLSARVKYDILITFVVTQEWFGLLRSDPKPDNI 60

Query: 68  RFATFPNVIPSELGRANDFTGFLRSIHTHMEAPVETLLHRL-DPPPTAILADAFVTWAVQ 127
           +FAT PNV+PSE  RA D +GFL ++ T +EAP E LL  L +PP   I+AD+ + W + 
Sbjct: 61  QFATIPNVVPSEHVRAEDLSGFLEAVSTKLEAPFEELLDGLHEPPVKVIIADSIMAWPIH 120

Query: 128 LGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFP--ADLSERGEEIVDYFPGVSKICL 187
           +G R N+PVAS WP+SA++FS+ YHFDLLK+ GH+P  A   E G++IVDY PG+S   L
Sbjct: 121 VGNRRNIPVASFWPLSASMFSVFYHFDLLKQRGHYPIQAPDVESGDKIVDYIPGISTTPL 180

Query: 188 ADLPS-FFSGDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIG 247
           +DL    F  +  +  E  + +   V K Q+ +STSVYELES V D LK KF FP+Y++G
Sbjct: 181 SDLSDRLFHSNNEKMAELIIEAVSKVTKVQYLLSTSVYELESQVFDVLKLKFSFPLYSMG 240

Query: 248 P--STPYFELECSAPN----GGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAG 307
           P   +P  +LE +  +        +Y +WLDSQ E SVLYIS GS+LSVS  Q+DEIVAG
Sbjct: 241 PISISPQIQLENTFDDTTNISTVVEYIQWLDSQPEASVLYISFGSFLSVSDTQLDEIVAG 300

Query: 308 VKASGVRFLWVVRGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGV 367
           ++  GVR +WV R +  ++KD   + G VV WCDQL+VLCH S+GGFWTH GWNSTLE +
Sbjct: 301 IRTGGVRHMWVARENVSKIKDGCGDVGFVVPWCDQLRVLCHPSIGGFWTHCGWNSTLEAI 360

Query: 368 FAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRFK-----AVGGKD---LVRRVEIAEFVKR 427
           FAGVPML +PI  DQ  NSK+I E+WK+G +          G D   LVRR EI+  V+R
Sbjct: 361 FAGVPMLTFPITADQHSNSKQIVEEWKIGCKVNDKKKIICAGTDQISLVRRDEISVLVER 420

Query: 428 FMNSESDEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDNPK 487
           FM+ +S   + ++NRV E Q+ C+ A  K  S                ++ V +A   P 
Sbjct: 421 FMDPDSIGMKVMKNRVKELQKSCQLAFRKTTS----------------LSPVPKA---PS 480

Query: 488 MDPISGPAAPRRIHLAALPYPGRGHINALMNLCKLLSLRNPNILISFIVTDKWLTFLAAD 547
           M  +         H+ A+PYPGRGHIN LMN+CK L  RN  +L++F++T++WL  L +D
Sbjct: 481 MGTLQTVEPTTDCHVVAMPYPGRGHINPLMNICKELVSRNARLLVTFVITEEWLGLLGSD 540

Query: 548 PKPQNIHFATFPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLDPPLTAISADSFL 607
           PKP  + F T PNVIPSE  RA +F GFF+S+  ++  P E LL RLD P+  I AD++L
Sbjct: 541 PKPDRVRFRTVPNVIPSEHGRAKNFSGFFQSVTNNLKAPFEELLDRLDTPVNVIIADTYL 600

Query: 608 TWAVQLSKRLNVPVASLWPMSATVFSILYHFDFLKENRHFPADLSERGEEIVDYISGVSK 667
            W   +  R N+PVASLWPMSA+VFS+  HFD +++N HFP DLS RG EIVDYI G+  
Sbjct: 601 IWMTDVGNRRNIPVASLWPMSASVFSVFRHFDLVEQNGHFPIDLSVRGHEIVDYIPGIPT 660

Query: 668 IRLADLPTFFSGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYT 727
           IR+ DLPT F G G +VL    EA   V KAQFL+STSVYELE+ V D LK KFPFPVY 
Sbjct: 661 IRVEDLPTIFEGEGRKVLKWAKEATSKVSKAQFLLSTSVYELESQVFDALKAKFPFPVYP 720

Query: 728 IRPCTPYFEALNGCTN-----DYLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKA 787
           + P  P+ + L  C+N     DY +WLDSQ +GSVLY+S GS+LSVS++Q+DE+VAG++ 
Sbjct: 721 LGPSIPHSQ-LQTCSNYSDTADYFKWLDSQPQGSVLYISLGSFLSVSAAQLDELVAGIRG 780

Query: 788 SGVRFLWVAR-----------GDD------------ARVLCHSAVGGFWTHGGRNSTLEG 847
           SG R+LWVAR           GDD             RVLCH+++GGFWTH G NSTLE 
Sbjct: 781 SGTRYLWVARDNVSKIKEYGTGDDDELGFVVPWCDQLRVLCHASIGGFWTHCGWNSTLEA 840

Query: 848 VFAGVPMLAWPILWDQFPNSKKIAEDWKVGVRFKAVGG------------RDLVRREEIT 899
           +++GVPML  PI WDQ P+SK+I EDWK+G      GG            + LV+RE+I 
Sbjct: 841 IYSGVPMLTCPIFWDQVPDSKQIVEDWKIGYNVIKKGGTTMSSRTTDDDDQGLVKREKIA 900

BLAST of Cla97C02G045300 vs. NCBI nr
Match: RXH84926.1 (hypothetical protein DVH24_041694 [Malus domestica])

HSP 1 Score: 901.0 bits (2327), Expect = 8.5e-258
Identity = 474/1057 (44.84%), Postives = 624/1057 (59.04%), Query Frame = 0

Query: 14   HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
            H+ ALPYPGRGHIN +MN CKLLS K P+ILI+F++T+EW  F+ +D KP NIRF+T PN
Sbjct: 138  HVMALPYPGRGHINPMMNFCKLLSSKKPDILITFVITEEWQGFIGSDAKPDNIRFSTLPN 197

Query: 74   VIPSELGRANDFTGFLRSIHTHMEAPVETLL--HRLDPPPTAILADAFVTWAVQLGKRLN 133
            VIPSEL RA +F GF+ +++T +E P + LL    L  P   I+AD F+ WAV++G   N
Sbjct: 198  VIPSELVRATNFPGFVEAVNTELEGPFDQLLLERDLQQPVNVIVADPFLVWAVRVGNGRN 257

Query: 134  VPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFS 193
            +PVAS WPMSA+VF++ +HF+LLK+NGHFP D+ ERG+E++DY PG+S   +ADLP+   
Sbjct: 258  IPVASFWPMSASVFTVFHHFELLKQNGHFPVDVLERGDEVIDYIPGISTTRIADLPTILY 317

Query: 194  GDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFEL- 253
            G+  Q +  A+ +  S+ K+Q+ +STS+YELES V D LKAK P P+Y IGP+ PYF+L 
Sbjct: 318  GNDRQLLHRAMETISSMYKAQYILSTSIYELESQVFDNLKAKLPIPVYPIGPTIPYFQLS 377

Query: 254  ECSAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRG 313
            E S+       Y  WLDSQ + SVLYIS GS+LSVS  Q+DE+V GV+ SGVRFLWV RG
Sbjct: 378  ESSSILHDGLSYLHWLDSQPKASVLYISMGSFLSVSQTQMDELVFGVRDSGVRFLWVARG 437

Query: 314  DDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWD 373
            D  RLK+   + G+VV WCDQL+V CH S+GGFW+H GW+ST+E V+AG+P+L  PIFWD
Sbjct: 438  DASRLKESVGDVGLVVPWCDQLRVFCHDSIGGFWSHCGWSSTIEAVYAGLPVLTCPIFWD 497

Query: 374  QFPNSKKIAEDWKVGVRFKA-VGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQE 433
            Q PNS++I +DWK+G R K   G + LV R EIA+ V+RFM+ ES+EG+++R R  + Q+
Sbjct: 498  QVPNSRQIVDDWKIGYRVKKNEGAEHLVTREEIAQLVRRFMDLESNEGKKMRKRAKQLQK 557

Query: 434  ICRRAVAKGG-------------------------------------------------- 493
             C+ A+AK                                                    
Sbjct: 558  TCQEAIAKASIFSLQSSREKWAQRKLRQSVCHVVALPYQGRGHINPMMNLCKLLSSKNPL 617

Query: 494  ---------------SSDSNID----------------------AFLKHISGEEIAAVAQ 553
                            SD  +D                      AFL+ +  +  A V Q
Sbjct: 618  LLITFVVTEEWHGFIESDRKLDNIRLVTIPNVIPSENGRAKDFAAFLEAVWTKMEAPVEQ 677

Query: 554  ASD----------------------NPKMDPISGPAAPRR-------------------- 613
              D                      N +  P++    P                      
Sbjct: 678  LLDGLEPPVTAIVADTFLVWALRVGNRRNIPVASLWTPSPTLFSMLHHFELFKENGHFPL 737

Query: 614  ------------------------IHLAALPYPGRGHINALMNLCKLLSLRNPNILISFI 673
                                     HL ALPYPGRGHIN +MNLCK LS +NP + I+F+
Sbjct: 738  DVSGALQTSYKREMGTMKVEPITVCHLVALPYPGRGHINPMMNLCKQLSSKNPQLFITFV 797

Query: 674  VTDKWLTFLAADPKPQNIHFATFPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLD 733
            VT++W  F+ ++PKP+NI  AT PNVIPSE  RA DF  F  ++ T +  PVE L+  L+
Sbjct: 798  VTEEWRGFIESNPKPENIRLATIPNVIPSEHGRAKDFAAFVEAVWTKLEAPVEQLMDGLE 857

Query: 734  PPLTAISADSFLTWAVQLSKRLNVPVASLWPMSATVFSILYHFDFLKENRHFPADLSERG 793
             P+TAI AD+FL WA+++  R N+PVASLW  S T+FS+L+HF+  KEN HF  D+SERG
Sbjct: 858  QPVTAIVADTFLVWALRIGNRRNIPVASLWTQSPTMFSVLHHFELFKENGHFALDVSERG 917

Query: 794  EEIVDYISGVSKIRLADLPTFFSGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVID 853
            +EIV+YI GVS   +ADLP  F     +VL   +E     +KA++L+ TSVYEL+  V +
Sbjct: 918  DEIVEYIPGVSTTCIADLPAIFFTDDPKVLHKAIEVISEAEKAKYLLFTSVYELDPQVFE 977

Query: 854  VLKPKFPFPVYTIRPCTPYFE-----ALNGCTNDYLRWLDSQAEGSVLYVSEGSYLSVSS 889
             LK KF FP+Y I P  P+FE       N    DYL WLDSQ + SVLY+S GS+LSVS 
Sbjct: 978  ALKAKFAFPIYPIGPSIPHFELSKTLPTNQNDIDYLHWLDSQPKKSVLYISMGSFLSVSK 1037

BLAST of Cla97C02G045300 vs. NCBI nr
Match: CAE6021044.1 (unnamed protein product [Arabidopsis arenosa])

HSP 1 Score: 879.8 bits (2272), Expect = 2.0e-251
Identity = 454/915 (49.62%), Postives = 603/915 (65.90%), Query Frame = 0

Query: 15  LAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPNV 74
           + A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT+EWL F+ +DPKP  I FAT PN+
Sbjct: 1   MVAMPYPGRGHINPMMNLCKRLVHRYPNLHVTFVVTEEWLGFIGSDPKPDRIHFATLPNL 60

Query: 75  IPSELGRANDFTGFLRSIHTHMEAPVETLLHRL-DPPPTAILADAFVTWAVQLGKRLNVP 134
           IPSEL RA DF GF+ +++T +E P E LL  L  PPP+AI+AD +V WAV++G+R N+P
Sbjct: 61  IPSELVRAKDFIGFIDAVYTRLEEPFEKLLDGLTSPPPSAIIADTYVIWAVRVGRRRNIP 120

Query: 135 VASLWPMSATVFSILYHFDLLKENGHFPADLSE-RGEEIVDYFPGVSKICLADLPSFFSG 194
           V SLW MSAT+ S   H DLL  +GH   + SE + EE+VDY PG+    L DLP  F G
Sbjct: 121 VVSLWTMSATILSFFLHADLLISHGHALFEQSESKEEEVVDYVPGLPPTKLRDLPPIFDG 180

Query: 195 DGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELEC 254
              +  + A      +  ++  + T+ YELE   IDA  +K   P+Y  GP  P+ EL  
Sbjct: 181 YSHRLFKKAKLCFDELLGAKCLVFTTAYELEHKAIDAFTSKLDIPVYATGPLIPFEELSV 240

Query: 255 SAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDD 314
              N    DY RWLD Q E SVLYISQGS+LSVS  Q++EI+ GV+ SGVRFLWV RG +
Sbjct: 241 RNDN-KEPDYIRWLDEQPESSVLYISQGSFLSVSEVQMEEIIVGVRESGVRFLWVARGGE 300

Query: 315 GRLKD-VDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQ 374
            +LK+ ++  +G+VV WCDQL+VLCH +VGGFWTH G+NSTLEG+++GVPMLA+P+FWDQ
Sbjct: 301 LKLKEALEGSSGVVVSWCDQLRVLCHAAVGGFWTHCGFNSTLEGIYSGVPMLAFPLFWDQ 360

Query: 375 FPNSKKIAEDWKVGVRFKAVGGKD-LVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEI 434
             N+K I EDW+VG+R +     + L+ R EI E VKRFM+ ES+EG+E+R R  +  EI
Sbjct: 361 ILNAKMIVEDWRVGMRIERTKNTELLIGREEIKEVVKRFMDRESEEGKEMRRRACDLSEI 420

Query: 435 CRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDNPKMDPISGPAAPRRIHLAALPYPG 494
            R AVAK        +A + H+                M+PI       R H+ A+P+PG
Sbjct: 421 SRGAVAKS-------EAPVAHL----------------MNPIKPEPLGVR-HVVAMPWPG 480

Query: 495 RGHINALMNLCKLLSLRNPNILISFIVTDKWLTFLAADPKPQNIHFATFPNVIPSELRRA 554
           RGHIN ++NLCK L  R+PN++++F+VT++WL F+ +DPKP  IHFAT PN+IPSEL RA
Sbjct: 481 RGHINPMLNLCKRLVRRDPNLIVTFVVTEEWLGFIGSDPKPNRIHFATLPNLIPSELVRA 540

Query: 555 NDFLGFFRSIQTHMLPPVETLLRRLDPPL-TAISADSFLTWAVQLSKRLNVPVASLWPMS 614
           NDF+GF  ++ T +  P E LL RL+ PL T I AD+++ WAV++  + N+PVAS W  S
Sbjct: 541 NDFIGFVDAVLTRLEQPFEQLLDRLNSPLPTVIIADTYIIWAVRVGTKRNIPVASFWTTS 600

Query: 615 ATVFSILYHFDFLKENRHFPADLSE-RGEEIVDYISGVSKIRLADLPTFFSGVGLEVLGS 674
           AT+ S+  H D L  + HFP + SE + +EIVDYI G+S  RL DL   F G  L+V   
Sbjct: 601 ATILSLFIHTDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLRDL-QIFHGYSLQVFNI 660

Query: 675 TLEAARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYTIRPCTPYFEALNGCTN---D 734
              +   + KA++L+ +S YE+E   ID    KF FPVY+  P  P+ E   G  N   D
Sbjct: 661 FKTSFGELSKAKYLLFSSAYEIEPKAIDFFTSKFDFPVYSTGPLIPFEELSVGNENRELD 720

Query: 735 YLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKASGVRFLWVARG----------- 794
           Y+RWLD Q E SVLY+S+GS+LSVS +QM+EIV GV+ SGVRFLWVARG           
Sbjct: 721 YIRWLDEQPESSVLYISQGSFLSVSDAQMEEIVVGVRESGVRFLWVARGGELKLKEALEG 780

Query: 795 ---------DDARVLCHSAVGGFWTHGGRNSTLEGVFAGVPMLAWPILWDQFPNSKKIAE 854
                    D  RVLCH+AVGGFWTH G NSTLEG+++GVPML +P+  DQF N+K I E
Sbjct: 781 SLGVVVSWCDQLRVLCHAAVGGFWTHCGFNSTLEGIYSGVPMLTFPLFRDQFLNAKMIVE 840

Query: 855 DWKVGVRFKAVGGRD-LVRREEITEFVKRFMNSESVEGREMRNRVSDLQEICQRAVVKGG 900
           +W+VG+R ++    + L+   EI   VK+FM+ ES EG+EMR R  DL EIC+ AV + G
Sbjct: 841 EWRVGMRIESKKQTELLIVSNEIKGLVKKFMDGESEEGKEMRRRTCDLSEICRGAVAETG 889

BLAST of Cla97C02G045300 vs. NCBI nr
Match: KAG5603112.1 (hypothetical protein H5410_034482 [Solanum commersonii])

HSP 1 Score: 870.9 bits (2249), Expect = 9.4e-249
Identity = 438/914 (47.92%), Postives = 600/914 (65.65%), Query Frame = 0

Query: 14   HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
            H+ A+PYPGRGHIN ++N CK++  K  NI ++FIVT+EWL+ ++++  P+NI++AT PN
Sbjct: 173  HIVAMPYPGRGHINPMINFCKIIVTKYSNIFVTFIVTEEWLSLISSENLPENIKYATIPN 232

Query: 74   VIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAVQLGKRLNVP 133
            VIPSE  RANDFT F ++  T+ME PVE L+  L   P  I+ D +++W ++LG R N+P
Sbjct: 233  VIPSEFDRANDFTAFFKATLTNMEGPVEKLIDALTMKPIVIVYDTYLSWVIRLGNRRNIP 292

Query: 134  VASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFSGD 193
            VAS + MSATVFSI YH DLL +NGH  A+LS +  E VDY PG+  I + DLP+ F G 
Sbjct: 293  VASFFTMSATVFSIGYHMDLLAQNGHLRANLSGKMHEQVDYIPGIPSIRILDLPTPFYGK 352

Query: 194  GLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELE-- 253
            G + ++  ++   +V K+Q+ + TSVYELESSVI+ALK KFP P+Y+IGP+ PYF  E  
Sbjct: 353  GQELLDVVMDIFSTVSKAQYLLFTSVYELESSVINALKQKFPIPVYSIGPAIPYFTCEKN 412

Query: 254  -CSAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRG 313
              S  +    +Y +WL++Q  GSVLYISQGS+LSVS  ++DEIVAGV+ SGVRF WV R 
Sbjct: 413  PSSTTSIDEPEYIKWLNAQPNGSVLYISQGSFLSVSRDELDEIVAGVQDSGVRFFWVARD 472

Query: 314  DDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWD 373
            +  R +      G+VV WCDQLKVL H S+GGFW+H GWNST E  F+G+PML +PIFWD
Sbjct: 473  ETVRFQKNGCSVGLVVPWCDQLKVLSHPSIGGFWSHCGWNSTKEAAFSGLPMLTFPIFWD 532

Query: 374  QFPNSKKIAEDWKVGVRFKAVGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEI 433
            Q  NSK+IAEDWK+G R K    + ++R  EI+  +K  M+S ++E  E R R  E Q+I
Sbjct: 533  QRTNSKQIAEDWKIGNRVKKHDQRSILRE-EISSLLKWSMDSGNEEVMETRRRAKEIQKI 592

Query: 434  CRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDNPKMDPISGPAAPRRIHLAALPYPG 493
            C+ + A GGSS+ NI+AF+K++                             H   +PYPG
Sbjct: 593  CQCSTANGGSSEINIEAFIKNV---------------------------LCHKYTMPYPG 652

Query: 494  RGHINALMNLCKLLSLRNPNILISFIVTDKWLTFLAADPKPQNIHFATFPNVIPSELRRA 553
            RGHIN +MN CK++  + P+I I+FIVT++W + ++++  P+NI +AT PNVIPSE  RA
Sbjct: 653  RGHINPMMNFCKIIVTKYPSIFITFIVTEEWYSLISSENLPENIKYATIPNVIPSEFGRA 712

Query: 554  NDFLGFFRSIQTHMLPPVETLLRRLDPPLTAISADSFLTWAVQLSKRLNVPVASLWPMSA 613
             DF+GF ++  T M  PVE L+  L    + I  D++L+W V L  R N+PVAS + MSA
Sbjct: 713  KDFVGFVKATLTKMEGPVEKLIDELMMKPSVIVYDTYLSWVVGLGNRRNIPVASFFTMSA 772

Query: 614  TVFSILYHFDFLKENRHFPADLSERGEEIVDYISGVSKIRLADLP-TFFSGVGLEVLGST 673
            T+FSI YH D L +N H   +LS +  E VDYI G+  IR+ DLP + + G G E+L   
Sbjct: 773  TMFSIGYHMDLLAQNAHLRGNLSGKMHEQVDYIPGIPSIRVLDLPISSYDGKGQELLDIV 832

Query: 674  LEAARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYTIRPCTPYFEALNGCTN----- 733
            ++   +V KAQ+L+ TSVYELE+SVI+ LK KFP PVY+I P  PYF +    ++     
Sbjct: 833  MDIFSTVSKAQYLLFTSVYELESSVINALKQKFPIPVYSIGPAIPYFTSEKNPSSTTSID 892

Query: 734  --DYLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKASGVRFLWVARGDDAR---- 793
              +Y++WL++Q  GSVLY+S+GS+LSVS  ++DEI+AGV  SGVRF WVAR + AR    
Sbjct: 893  EPEYIKWLNAQPNGSVLYISQGSFLSVSRDELDEIIAGVHDSGVRFFWVARDETARFQKY 952

Query: 794  ---------------VLCHSAVGGFWTHGGRNSTLEGVFAGVPMLAWPILWDQFPNSKKI 853
                           VL H ++GGFW+H G NST E  F+G+PML +PI WDQ  NSK+I
Sbjct: 953  GCSVGLVVPWCDQLKVLSHPSIGGFWSHCGWNSTKEAAFSGLPMLTFPIFWDQRTNSKQI 1012

Query: 854  AEDWKVGVRFKAVGGRDLVRREEITEFVKRFMNSESVEGREMRNRVSDLQEICQRAVVKG 898
             EDWK+G R K    +  + REEI+  +K FM+S + E  E R R  ++Q+ICQ +   G
Sbjct: 1013 VEDWKIGYRVKK-HDQCSITREEISSLLKWFMDSGNEEVMETRRRTEEIQKICQFSTANG 1057

BLAST of Cla97C02G045300 vs. NCBI nr
Match: KAG6761213.1 (hypothetical protein POTOM_034414 [Populus tomentosa])

HSP 1 Score: 869.8 bits (2246), Expect = 2.1e-248
Identity = 461/975 (47.28%), Postives = 605/975 (62.05%), Query Frame = 0

Query: 14  HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
           H+ A+PYPGRGHIN +M LCK L  +  +I I+F+VT+EWL+ + +DPKP  I F+T PN
Sbjct: 13  HVVAMPYPGRGHINPMMELCKSLVRRKDDIRITFVVTEEWLSLIGSDPKPDQISFSTIPN 72

Query: 74  VIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAVQLGKRLNVP 133
           V+PSEL RA++   F+ ++ T+MEAP E  L  L  PP  I+AD F+ WAV++G R N+P
Sbjct: 73  VVPSELVRASNMLEFIEALMTNMEAPFERFLDHLVQPPAVIIADTFLLWAVRVGNRKNIP 132

Query: 134 VASLWPMSATVFSILYHFDLLKENGHFPADL--SERGEEIVDYFPGVSKICLADLPSFFS 193
           VAS WPMS  VF + ++ DLL+ENG F  DL   ERG E  DY PGVS   L D PSF +
Sbjct: 133 VASFWPMSVNVFLMFHYLDLLRENGQFIVDLLGLERGHERADYIPGVSSTSLVDFPSFIN 192

Query: 194 GDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELE 253
           G     +   V     V K+Q+ +  S+YELE   IDA+KA F FP+YT+GPS PY +LE
Sbjct: 193 GSNPYMLGRIVEVFSWVPKAQYLLFPSIYELEPQAIDAIKAGFSFPVYTVGPSIPYSKLE 252

Query: 254 CSAPN---GGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVV 313
             +      G  DY RWLD Q   S+LYIS GS+LS SSAQ+DEI  G+  SGVR+LWV 
Sbjct: 253 DGSHTITAHGDVDYLRWLDDQPSKSILYISMGSFLSFSSAQMDEIAGGLHDSGVRYLWVA 312

Query: 314 RGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIF 373
           RG+  RLK+V  + G+VV WCDQL+VLCH SVGGFWTH GWNS  EGVFAGVP L +PI 
Sbjct: 313 RGETSRLKEVCGDKGLVVPWCDQLRVLCHPSVGGFWTHCGWNSVREGVFAGVPFLTYPIS 372

Query: 374 WDQFPNSKKIAEDWKVGVRF-KAVGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEF 433
            DQ PNSK I EDWKVG R  K    ++LVRR EI   V+ FM+ +S+EG+E+R RV  F
Sbjct: 373 ADQRPNSKLIVEDWKVGWRVEKEYRVENLVRREEIGGLVRDFMDLDSNEGKEMRRRVKGF 432

Query: 434 QEICRRAVAKGGSSDSNIDAFLKHISGEEIAAVA-------------------------- 493
           QEIC++A++K GSS++NI +F++ IS  E  ++                           
Sbjct: 433 QEICQQAISKDGSSETNIKSFIREISIGEFLSLCGWNSVQQGIYSDSMTLEISLCPLHHL 492

Query: 494 -------------------QASDNPKMDP--ISGPAA-----PRRIHLAALPYPGRGHIN 553
                              QA +    DP  +S   A         H+ A+PYPGRGH+N
Sbjct: 493 LRGGIVVIYASQITYSFLLQAWEFTAKDPCLVSDMDAVTVKPTYSCHVVAIPYPGRGHVN 552

Query: 554 ALMNLCKLLSLRNPNILISFIVTDKWLTFL--AADPKPQNIHFATFPNVIPSELRRANDF 613
            LMN C +L+ + P+ LI+F+VT++WL F+  +++  P N+ F + PNVIPSEL R  D 
Sbjct: 553 PLMNFCTILASKKPDTLITFVVTEEWLGFISSSSNSSPSNLQFRSIPNVIPSELVRNADP 612

Query: 614 LGFFRSIQTHMLPPVETLLRRLDPPL--TAISADSFLTWAVQLSKRLNVPVASLWPMSAT 673
           +GF  +  T M  P E LL     PL  T I  D+FL WA+ +  R N+PVAS +PMS+T
Sbjct: 613 IGFIEAAMTKMETPFEELLDSFHQPLRPTLIVTDAFLFWAIGVGNRRNIPVASFFPMSST 672

Query: 674 VFSILYHFDFLKENRHFPADLSERGEEIVDYISGVSKIRLADLPTFFSGVGLEVLGSTLE 733
           VFS+ YH D L ++ HFP DLSE+G EIVDYI GVS +RL DLP+F        L   L+
Sbjct: 673 VFSVFYHLDLLAQHGHFPVDLSEKGNEIVDYIPGVSPLRLLDLPSFIFASNQYTLHRILD 732

Query: 734 AARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYTIRPCTPYFEALNGCTN------- 793
               + KA++L+  S+YELE+ VI  LK K   PVYTI P  P  +  +  ++       
Sbjct: 733 LISWIPKARYLLLPSIYELESQVIKALKYKISIPVYTIGPAIPDLKLRDNSSSSSNNNEL 792

Query: 794 DYLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKASGVRFLWVARG---------- 853
           + L+WLD Q E SVLYV+ GS+++VSS+QMDEI AG+  SGVRFLWV R           
Sbjct: 793 NILQWLDCQPESSVLYVTLGSHVAVSSAQMDEIAAGLCDSGVRFLWVVRDETSRLRQVCG 852

Query: 854 ---------DDARVLCHSAVGGFWTHGGRNSTLEGVFAGVPMLAWPILWDQFPNSKKIAE 900
                    D  +VLCHS+VGGFWTH G NS  EG+FAGVP L +PI  DQ  +SK I E
Sbjct: 853 DMGLVEPWCDQLKVLCHSSVGGFWTHCGWNSVKEGIFAGVPFLTFPIFADQLTHSKVIVE 912

BLAST of Cla97C02G045300 vs. ExPASy Swiss-Prot
Match: O64733 (UDP-glycosyltransferase 87A2 OS=Arabidopsis thaliana OX=3702 GN=UGT87A2 PE=1 SV=1)

HSP 1 Score: 475.7 bits (1223), Expect = 1.2e-132
Identity = 232/443 (52.37%), Postives = 311/443 (70.20%), Query Frame = 0

Query: 14  HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
           H+ A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT+EWL F+  DPKP  I F+T PN
Sbjct: 13  HVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGPDPKPDRIHFSTLPN 72

Query: 74  VIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLD-PPPTAILADAFVTWAVQLGKRLNV 133
           +IPSEL RA DF GF+ +++T +E P E LL  L+ PPP+ I AD +V WAV++G++ N+
Sbjct: 73  LIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADTYVIWAVRVGRKRNI 132

Query: 134 PVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFSG 193
           PV SLW MSAT+ S   H DLL  +GH   + SE  EE+VDY PG+S   L DLP  F G
Sbjct: 133 PVVSLWTMSATILSFFLHSDLLISHGHALFEPSE--EEVVDYVPGLSPTKLRDLPPIFDG 192

Query: 194 DGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELEC 253
              +  ++A      +  ++  + T+ YELE   IDA  +K   P+Y IGP  P+ EL  
Sbjct: 193 YSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPVYAIGPLIPFEELSV 252

Query: 254 SAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDD 313
              N    +Y +WL+ Q EGSVLYISQGS+LSVS AQ++EIV G++ SGVRFLWV RG +
Sbjct: 253 QNDN-KEPNYIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLRESGVRFLWVARGGE 312

Query: 314 GRLKD-VDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQ 373
            +LK+ ++   G+VV WCDQL+VLCH++VGGFWTH G+NSTLEG+++GVPMLA+P+FWDQ
Sbjct: 313 LKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIYSGVPMLAFPLFWDQ 372

Query: 374 FPNSKKIAEDWKVGVRFKAVGGKD-LVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEI 433
             N+K I EDW+VG+R +     + L+ R EI E VKRFM+ ES+EG+E+R R  +  EI
Sbjct: 373 ILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEEGKEMRRRACDLSEI 432

Query: 434 CRRAVAKGGSSDSNIDAFLKHIS 454
            R AVAK GSS+ NID F++HI+
Sbjct: 433 SRGAVAKSGSSNVNIDEFVRHIT 452

BLAST of Cla97C02G045300 vs. ExPASy Swiss-Prot
Match: O64732 (UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana OX=3702 GN=UGT87A1 PE=2 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 4.4e-132
Identity = 229/439 (52.16%), Postives = 309/439 (70.39%), Query Frame = 0

Query: 18  LPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPNVIPS 77
           +P+PGRGHIN ++NLCK L  ++PN+ ++F+VT+EWL F+ +DPKP  I FAT PN+IPS
Sbjct: 1   MPWPGRGHINPMLNLCKSLVRRDPNLTVTFVVTEEWLGFIGSDPKPNRIHFATLPNIIPS 60

Query: 78  ELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAVQLGKRLNVPVASL 137
           EL RANDF  F+ ++ T +E P E LL RL+ PPTAI+AD ++ WAV++G + N+PVAS 
Sbjct: 61  ELVRANDFIAFIDAVLTRLEEPFEQLLDRLNSPPTAIIADTYIIWAVRVGTKRNIPVASF 120

Query: 138 WPMSATVFSILYHFDLLKENGHFPADLSE-RGEEIVDYFPGVSKICLADLPSFFSGDGLQ 197
           W  SAT+ S+  + DLL  +GHFP + SE + +EIVDY PG+S   L+DL     G   Q
Sbjct: 121 WTTSATILSLFINSDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLSDL-QILHGYSHQ 180

Query: 198 SVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELECSAPN 257
                  S   + K+++ +  S YELE   ID   +KF FP+Y+ GP  P  EL     N
Sbjct: 181 VFNIFKKSFGELYKAKYLLFPSAYELEPKAIDFFTSKFDFPVYSTGPLIPLEELSVGNEN 240

Query: 258 GGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDDGRLK 317
               DYF+WLD Q E SVLYISQGS+LSVS AQ++EIV GV+ +GV+F WV RG + +LK
Sbjct: 241 REL-DYFKWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLK 300

Query: 318 D-VDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNS 377
           + ++   G+VV WCDQL+VLCH ++GGFWTH G+NSTLEG+ +GVP+L +P+FWDQF N+
Sbjct: 301 EALEGSLGVVVSWCDQLRVLCHAAIGGFWTHCGYNSTLEGICSGVPLLTFPVFWDQFLNA 360

Query: 378 KKIAEDWKVGVRFKAVGGKD-LVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRA 437
           K I E+W+VG+  +     + L+   EI E VKRFM+ ES+EG+E+R R  +  EICR A
Sbjct: 361 KMIVEEWRVGMGIERKKQMELLIVSDEIKELVKRFMDGESEEGKEMRRRTCDLSEICRGA 420

Query: 438 VAKGGSSDSNIDAFLKHIS 454
           VAKGGSSD+NIDAF+K I+
Sbjct: 421 VAKGGSSDANIDAFIKDIT 437

BLAST of Cla97C02G045300 vs. ExPASy Swiss-Prot
Match: Q9SJL0 (UDP-glycosyltransferase 86A1 OS=Arabidopsis thaliana OX=3702 GN=UGT86A1 PE=2 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 6.7e-64
Identity = 154/478 (32.22%), Postives = 239/478 (50.00%), Query Frame = 0

Query: 11  RRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDE-------------WLTFL 70
           R+ H+  +PYP +GH+   ++L   + L +    I+F+ TD                 F 
Sbjct: 7   RKPHIMMIPYPLQGHVIPFVHLA--IKLASHGFTITFVNTDSIHHHISTAHQDDAGDIFS 66

Query: 71  AADPKPQ-NIRFATFPNVIPSELGRAND----FTGFLRSIHTHMEAPVETLLHRLDPPPT 130
           AA    Q +IR+ T  +  P +  R+ +    F G L     H++  +  L  R DPP T
Sbjct: 67  AARSSGQHDIRYTTVSDGFPLDFDRSLNHDQFFEGILHVFSAHVDDLIAKLSRRDDPPVT 126

Query: 131 AILADAFVTWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIV 190
            ++AD F  W+  +  + N+   S W   A V ++ YH DLL  NGHF +   +  ++++
Sbjct: 127 CLIADTFYVWSSMICDKHNLVNVSFWTEPALVLNLYYHMDLLISNGHFKS--LDNRKDVI 186

Query: 191 DYFPGVSKICLADLPSFFSGDGLQSVESAV------NSARSVDKSQFFISTSVYELESSV 250
           DY PGV  I   DL S+          + V       + + V ++ F +  +V ELE   
Sbjct: 187 DYVPGVKAIEPKDLMSYLQVSDKDVDTNTVVYRILFKAFKDVKRADFVVCNTVQELEPDS 246

Query: 251 IDALKAKFPFPIYTIGPSTPYFELECSAPNG--GTDDYFRWLDSQAEGSVLYISQGSYLS 310
           + AL+AK   P+Y IG   P F  +   P       D   WL  +  GSVLY+S GSY  
Sbjct: 247 LSALQAK--QPVYAIG---PVFSTDSVVPTSLWAESDCTEWLKGRPTGSVLYVSFGSYAH 306

Query: 311 VSSAQIDEIVAGVKASGVRFLWVVRGD----------DGRLKDVDRETGMVVGWCDQLKV 370
           V   +I EI  G+  SG+ F+WV+R D               D  ++ G+VV WC Q++V
Sbjct: 307 VGKKEIVEIAHGLLLSGISFIWVLRPDIVGSNVPDFLPAGFVDQAQDRGLVVQWCCQMEV 366

Query: 371 LCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRFKAVGGK 430
           + + +VGGF+TH GWNS LE V+ G+P+L +P+  DQF N K + +DW +G+    +  K
Sbjct: 367 ISNPAVGGFFTHCGWNSILESVWCGLPLLCYPLLTDQFTNRKLVVDDWCIGIN---LCEK 426

Query: 431 DLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHI 453
             + R +++  VKR MN E+    ELRN V + +   + AV   GSS++N + F+  +
Sbjct: 427 KTITRDQVSANVKRLMNGETSS--ELRNNVEKVKRHLKDAVTTVGSSETNFNLFVSEV 470

BLAST of Cla97C02G045300 vs. ExPASy Swiss-Prot
Match: Q9M9E7 (UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana OX=3702 GN=UGT85A4 PE=2 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 1.3e-56
Identity = 151/487 (31.01%), Postives = 244/487 (50.10%), Query Frame = 0

Query: 6   GSVAPRRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQ- 65
           G  + ++ H   +PYP +GHIN ++ L KLL  +     ++F+ TD     +     P  
Sbjct: 5   GGSSSQKPHAMCIPYPAQGHINPMLKLAKLLHAR--GFHVTFVNTDYNHRRILQSRGPHA 64

Query: 66  -----NIRFATFPNVIP-SELGRANDFTGFLRSIHTHMEAPVETLLHRLD-----PPPTA 125
                + RF T P+ +P +++    D    + S   +  AP + L+ RL+     PP + 
Sbjct: 65  LNGLPSFRFETIPDGLPWTDVDAKQDMLKLIDSTINNCLAPFKDLILRLNSGSDIPPVSC 124

Query: 126 ILADAFVTWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFP----ADLSERGE 185
           I++DA +++ +   + L +PV  LW  SAT   +  H+  L E    P    +DL +  E
Sbjct: 125 IISDASMSFTIDAAEELKIPVVLLWTNSATALILYLHYQKLIEKEIIPLKDSSDLKKHLE 184

Query: 186 EIVDYFPGVSKICLADLPSFFSGDGLQS--VESAVNSARSVDKSQFFISTSVYELESSVI 245
             +D+ P + KI L D P F +    Q   +   ++    + ++      +  +LE +V+
Sbjct: 185 TEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRIKRASAIFINTFEKLEHNVL 244

Query: 246 DALKAKFPFPIYTIGP-----------STPYFELECSAPNGGTDDYFRWLDSQAEGSVLY 305
            +L++  P  IY++GP           ++   +L  +     T+    WLD++AE +V+Y
Sbjct: 245 LSLRSLLP-QIYSVGPFQILENREIDKNSEIRKLGLNLWEEETES-LDWLDTKAEKAVIY 304

Query: 306 ISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVR-----GDDGRLK----DVDRETGMVV- 365
           ++ GS   ++S QI E   G+  SG  FLWVVR     GDD  L        +  GM++ 
Sbjct: 305 VNFGSLTVLTSEQILEFAWGLARSGKEFLWVVRSGMVDGDDSILPAEFLSETKNRGMLIK 364

Query: 366 GWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGV 425
           GWC Q KVL H ++GGF TH GWNSTLE ++AGVPM+ WP F DQ  N K   EDW +G+
Sbjct: 365 GWCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLTNRKFCCEDWGIGM 424

Query: 426 RFKAVGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRAVAKG-GSSDSNI 453
                 G++ V+R  +   VK  M+ E  +G+ LR +V E++ +   A A   GSS  N 
Sbjct: 425 EI----GEE-VKRERVETVVKELMDGE--KGKRLREKVVEWRRLAEEASAPPLGSSYVNF 480

BLAST of Cla97C02G045300 vs. ExPASy Swiss-Prot
Match: Q9ZUV0 (UDP-glycosyltransferase 86A2 OS=Arabidopsis thaliana OX=3702 GN=UGT86A2 PE=2 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 3.7e-54
Identity = 144/480 (30.00%), Postives = 223/480 (46.46%), Query Frame = 0

Query: 2   DPISGSVAPRRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVT----------- 61
           +P         +H   +PYP +GH+N  ++L   + L +  I ++F+ T           
Sbjct: 6   NPTKNHHGHHHLHALLIPYPFQGHVNPFVHLA--IKLASQGITVTFVNTHYIHHQITNGS 65

Query: 62  DEWLTFLAADPKPQNIRFATFPNVIPSELGRANDFTGFLRSIHTHMEAPVETLLHRL--- 121
           D  +          +IR+AT  + +P    R+ +   +  S+     A VE L+  L   
Sbjct: 66  DGDIFAGVRSESGLDIRYATVSDGLPVGFDRSLNHDTYQSSLLHVFYAHVEELVASLVGG 125

Query: 122 DPPPTAILADAFVTWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFPADLSER 181
           D     ++AD F  W   + ++  +   S W  +A VFS+ YH DLL+ +GHF A   E 
Sbjct: 126 DGGVNVMIADTFFVWPSVVARKFGLVCVSFWTEAALVFSLYYHMDLLRIHGHFGA--QET 185

Query: 182 GEEIVDYFPGVSKICLADLPSFFSGDGLQSVESAV--NSARSVDKSQFFISTSVYELESS 241
             +++DY PGV+ I   D  S+       SV   +   +   V K  F +  ++ + E  
Sbjct: 186 RSDLIDYIPGVAAINPKDTASYLQETDTSSVVHQIIFKAFEDVKKVDFVLCNTIQQFEDK 245

Query: 242 VIDALKAKFPFPIYTIGPSTPYFELECSAPNG--GTDDYFRWLDSQAEGSVLYISQGSYL 301
            I AL  K PF  Y IGP  P+     S         D  +WL+++ + SVLYIS GSY 
Sbjct: 246 TIKALNTKIPF--YAIGPIIPFNNQTGSVTTSLWSESDCTQWLNTKPKSSVLYISFGSYA 305

Query: 302 SVSSAQIDEIVAGVKASGVRFLWVVRGDDGRLKDVD----------RETGMVVGWCDQLK 361
            V+   + EI  G+  S V F+WVVR D     + +           + G+V+ WC Q+ 
Sbjct: 306 HVTKKDLVEIAHGILLSKVNFVWVVRPDIVSSDETNPLPEGFETEAGDRGIVIPWCCQMT 365

Query: 362 VLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRF---KA 421
           VL H SVGGF TH GWNS LE ++  VP+L +P+  DQ  N K + +DW++G+     K+
Sbjct: 366 VLSHESVGGFLTHCGWNSILETIWCEVPVLCFPLLTDQVTNRKLVVDDWEIGINLCEDKS 425

Query: 422 VGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRAVA-KGGSSDSNIDAFL 450
             G+D     E+   + R M   S E      ++   +     AV   G SS+ N+  F+
Sbjct: 426 DFGRD-----EVGRNINRLMCGVSKE------KIGRVKMSLEGAVRNSGSSSEMNLGLFI 468

BLAST of Cla97C02G045300 vs. ExPASy TrEMBL
Match: A0A7J6DV67 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_024484 PE=4 SV=1)

HSP 1 Score: 923.3 bits (2385), Expect = 7.7e-265
Identity = 474/950 (49.89%), Postives = 622/950 (65.47%), Query Frame = 0

Query: 8   VAPRRIHLAALPYPGRGHINALMNLCKLLSLK-NPNILISFIVTDEWLTFLAADPKPQNI 67
           +A    H+ A+PYPGR HINA+MNLCK LS +   +ILI+F+VT EW   L +DPKP NI
Sbjct: 1   MATMTCHVVAMPYPGRSHINAVMNLCKQLSARVKYDILITFVVTQEWFGLLRSDPKPDNI 60

Query: 68  RFATFPNVIPSELGRANDFTGFLRSIHTHMEAPVETLLHRL-DPPPTAILADAFVTWAVQ 127
           +FAT PNV+PSE  RA D +GFL ++ T +EAP E LL  L +PP   I+AD+ + W + 
Sbjct: 61  QFATIPNVVPSEHVRAEDLSGFLEAVSTKLEAPFEELLDGLHEPPVKVIIADSIMAWPIH 120

Query: 128 LGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFP--ADLSERGEEIVDYFPGVSKICL 187
           +G R N+PVAS WP+SA++FS+ YHFDLLK+ GH+P  A   E G++IVDY PG+S   L
Sbjct: 121 VGNRRNIPVASFWPLSASMFSVFYHFDLLKQRGHYPIQAPDVESGDKIVDYIPGISTTPL 180

Query: 188 ADLPS-FFSGDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIG 247
           +DL    F  +  +  E  + +   V K Q+ +STSVYELES V D LK KF FP+Y++G
Sbjct: 181 SDLSDRLFHSNNEKMAELIIEAVSKVTKVQYLLSTSVYELESQVFDVLKLKFSFPLYSMG 240

Query: 248 P--STPYFELECSAPN----GGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAG 307
           P   +P  +LE +  +        +Y +WLDSQ E SVLYIS GS+LSVS  Q+DEIVAG
Sbjct: 241 PISISPQIQLENTFDDTTNISTVVEYIQWLDSQPEASVLYISFGSFLSVSDTQLDEIVAG 300

Query: 308 VKASGVRFLWVVRGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGV 367
           ++  GVR +WV R +  ++KD   + G VV WCDQL+VLCH S+GGFWTH GWNSTLE +
Sbjct: 301 IRTGGVRHMWVARENVSKIKDGCGDVGFVVPWCDQLRVLCHPSIGGFWTHCGWNSTLEAI 360

Query: 368 FAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRFK-----AVGGKD---LVRRVEIAEFVKR 427
           FAGVPML +PI  DQ  NSK+I E+WK+G +          G D   LVRR EI+  V+R
Sbjct: 361 FAGVPMLTFPITADQHSNSKQIVEEWKIGCKVNDKKKIICAGTDQISLVRRDEISVLVER 420

Query: 428 FMNSESDEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDNPK 487
           FM+ +S   + ++NRV E Q+ C+ A  K  S                ++ V +A   P 
Sbjct: 421 FMDPDSIGMKVMKNRVKELQKSCQLAFRKTTS----------------LSPVPKA---PS 480

Query: 488 MDPISGPAAPRRIHLAALPYPGRGHINALMNLCKLLSLRNPNILISFIVTDKWLTFLAAD 547
           M  +         H+ A+PYPGRGHIN LMN+CK L  RN  +L++F++T++WL  L +D
Sbjct: 481 MGTLQTVEPTTDCHVVAMPYPGRGHINPLMNICKELVSRNARLLVTFVITEEWLGLLGSD 540

Query: 548 PKPQNIHFATFPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLDPPLTAISADSFL 607
           PKP  + F T PNVIPSE  RA +F GFF+S+  ++  P E LL RLD P+  I AD++L
Sbjct: 541 PKPDRVRFRTVPNVIPSEHGRAKNFSGFFQSVTNNLKAPFEELLDRLDTPVNVIIADTYL 600

Query: 608 TWAVQLSKRLNVPVASLWPMSATVFSILYHFDFLKENRHFPADLSERGEEIVDYISGVSK 667
            W   +  R N+PVASLWPMSA+VFS+  HFD +++N HFP DLS RG EIVDYI G+  
Sbjct: 601 IWMTDVGNRRNIPVASLWPMSASVFSVFRHFDLVEQNGHFPIDLSVRGHEIVDYIPGIPT 660

Query: 668 IRLADLPTFFSGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYT 727
           IR+ DLPT F G G +VL    EA   V KAQFL+STSVYELE+ V D LK KFPFPVY 
Sbjct: 661 IRVEDLPTIFEGEGRKVLKWAKEATSKVSKAQFLLSTSVYELESQVFDALKAKFPFPVYP 720

Query: 728 IRPCTPYFEALNGCTN-----DYLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKA 787
           + P  P+ + L  C+N     DY +WLDSQ +GSVLY+S GS+LSVS++Q+DE+VAG++ 
Sbjct: 721 LGPSIPHSQ-LQTCSNYSDTADYFKWLDSQPQGSVLYISLGSFLSVSAAQLDELVAGIRG 780

Query: 788 SGVRFLWVAR-----------GDD------------ARVLCHSAVGGFWTHGGRNSTLEG 847
           SG R+LWVAR           GDD             RVLCH+++GGFWTH G NSTLE 
Sbjct: 781 SGTRYLWVARDNVSKIKEYGTGDDDELGFVVPWCDQLRVLCHASIGGFWTHCGWNSTLEA 840

Query: 848 VFAGVPMLAWPILWDQFPNSKKIAEDWKVGVRFKAVGG------------RDLVRREEIT 899
           +++GVPML  PI WDQ P+SK+I EDWK+G      GG            + LV+RE+I 
Sbjct: 841 IYSGVPMLTCPIFWDQVPDSKQIVEDWKIGYNVIKKGGTTMSSRTTDDDDQGLVKREKIA 900

BLAST of Cla97C02G045300 vs. ExPASy TrEMBL
Match: A0A498INJ2 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_041694 PE=4 SV=1)

HSP 1 Score: 901.0 bits (2327), Expect = 4.1e-258
Identity = 474/1057 (44.84%), Postives = 624/1057 (59.04%), Query Frame = 0

Query: 14   HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
            H+ ALPYPGRGHIN +MN CKLLS K P+ILI+F++T+EW  F+ +D KP NIRF+T PN
Sbjct: 138  HVMALPYPGRGHINPMMNFCKLLSSKKPDILITFVITEEWQGFIGSDAKPDNIRFSTLPN 197

Query: 74   VIPSELGRANDFTGFLRSIHTHMEAPVETLL--HRLDPPPTAILADAFVTWAVQLGKRLN 133
            VIPSEL RA +F GF+ +++T +E P + LL    L  P   I+AD F+ WAV++G   N
Sbjct: 198  VIPSELVRATNFPGFVEAVNTELEGPFDQLLLERDLQQPVNVIVADPFLVWAVRVGNGRN 257

Query: 134  VPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFS 193
            +PVAS WPMSA+VF++ +HF+LLK+NGHFP D+ ERG+E++DY PG+S   +ADLP+   
Sbjct: 258  IPVASFWPMSASVFTVFHHFELLKQNGHFPVDVLERGDEVIDYIPGISTTRIADLPTILY 317

Query: 194  GDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFEL- 253
            G+  Q +  A+ +  S+ K+Q+ +STS+YELES V D LKAK P P+Y IGP+ PYF+L 
Sbjct: 318  GNDRQLLHRAMETISSMYKAQYILSTSIYELESQVFDNLKAKLPIPVYPIGPTIPYFQLS 377

Query: 254  ECSAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRG 313
            E S+       Y  WLDSQ + SVLYIS GS+LSVS  Q+DE+V GV+ SGVRFLWV RG
Sbjct: 378  ESSSILHDGLSYLHWLDSQPKASVLYISMGSFLSVSQTQMDELVFGVRDSGVRFLWVARG 437

Query: 314  DDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWD 373
            D  RLK+   + G+VV WCDQL+V CH S+GGFW+H GW+ST+E V+AG+P+L  PIFWD
Sbjct: 438  DASRLKESVGDVGLVVPWCDQLRVFCHDSIGGFWSHCGWSSTIEAVYAGLPVLTCPIFWD 497

Query: 374  QFPNSKKIAEDWKVGVRFKA-VGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQE 433
            Q PNS++I +DWK+G R K   G + LV R EIA+ V+RFM+ ES+EG+++R R  + Q+
Sbjct: 498  QVPNSRQIVDDWKIGYRVKKNEGAEHLVTREEIAQLVRRFMDLESNEGKKMRKRAKQLQK 557

Query: 434  ICRRAVAKGG-------------------------------------------------- 493
             C+ A+AK                                                    
Sbjct: 558  TCQEAIAKASIFSLQSSREKWAQRKLRQSVCHVVALPYQGRGHINPMMNLCKLLSSKNPL 617

Query: 494  ---------------SSDSNID----------------------AFLKHISGEEIAAVAQ 553
                            SD  +D                      AFL+ +  +  A V Q
Sbjct: 618  LLITFVVTEEWHGFIESDRKLDNIRLVTIPNVIPSENGRAKDFAAFLEAVWTKMEAPVEQ 677

Query: 554  ASD----------------------NPKMDPISGPAAPRR-------------------- 613
              D                      N +  P++    P                      
Sbjct: 678  LLDGLEPPVTAIVADTFLVWALRVGNRRNIPVASLWTPSPTLFSMLHHFELFKENGHFPL 737

Query: 614  ------------------------IHLAALPYPGRGHINALMNLCKLLSLRNPNILISFI 673
                                     HL ALPYPGRGHIN +MNLCK LS +NP + I+F+
Sbjct: 738  DVSGALQTSYKREMGTMKVEPITVCHLVALPYPGRGHINPMMNLCKQLSSKNPQLFITFV 797

Query: 674  VTDKWLTFLAADPKPQNIHFATFPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLD 733
            VT++W  F+ ++PKP+NI  AT PNVIPSE  RA DF  F  ++ T +  PVE L+  L+
Sbjct: 798  VTEEWRGFIESNPKPENIRLATIPNVIPSEHGRAKDFAAFVEAVWTKLEAPVEQLMDGLE 857

Query: 734  PPLTAISADSFLTWAVQLSKRLNVPVASLWPMSATVFSILYHFDFLKENRHFPADLSERG 793
             P+TAI AD+FL WA+++  R N+PVASLW  S T+FS+L+HF+  KEN HF  D+SERG
Sbjct: 858  QPVTAIVADTFLVWALRIGNRRNIPVASLWTQSPTMFSVLHHFELFKENGHFALDVSERG 917

Query: 794  EEIVDYISGVSKIRLADLPTFFSGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVID 853
            +EIV+YI GVS   +ADLP  F     +VL   +E     +KA++L+ TSVYEL+  V +
Sbjct: 918  DEIVEYIPGVSTTCIADLPAIFFTDDPKVLHKAIEVISEAEKAKYLLFTSVYELDPQVFE 977

Query: 854  VLKPKFPFPVYTIRPCTPYFE-----ALNGCTNDYLRWLDSQAEGSVLYVSEGSYLSVSS 889
             LK KF FP+Y I P  P+FE       N    DYL WLDSQ + SVLY+S GS+LSVS 
Sbjct: 978  ALKAKFAFPIYPIGPSIPHFELSKTLPTNQNDIDYLHWLDSQPKKSVLYISMGSFLSVSK 1037

BLAST of Cla97C02G045300 vs. ExPASy TrEMBL
Match: A0A445AMU1 (UDPGT domain-containing protein OS=Arachis hypogaea OX=3818 GN=Ahy_B01g051711 PE=3 SV=1)

HSP 1 Score: 865.9 bits (2236), Expect = 1.5e-247
Identity = 438/946 (46.30%), Postives = 613/946 (64.80%), Query Frame = 0

Query: 14  HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
           H+ A+PYPGRGH+N +MNLC +L+ + P+I++SF+VT+EW  F+  + KP N+RFAT PN
Sbjct: 13  HVVAMPYPGRGHVNPMMNLCSMLATRKPDIIVSFVVTEEWHGFIKNEAKPDNVRFATIPN 72

Query: 74  VIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAVQLGKRLNVP 133
           VIPSEL RA DF  F+ ++ T ME P E LL RLDPP TAI+AD  + W++ +  R N+P
Sbjct: 73  VIPSELDRAKDFPAFMNAVSTKMEGPFEDLLDRLDPPVTAIIADTKLVWSIGVANRKNIP 132

Query: 134 VASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFSGD 193
           VASLWPMSATV+S+LYHFD+LKENGHFP  +SE G+E+VDY PG+S   + DLP+   G+
Sbjct: 133 VASLWPMSATVYSLLYHFDMLKENGHFPIQISEIGDEVVDYIPGMSPTQIRDLPTVLHGE 192

Query: 194 GLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELE-- 253
            L+ +E A+N+   V K+Q  + T+ YELE   +DAL++K+  P+Y +GPS P+F+L+  
Sbjct: 193 DLRLLERALNTVNLVSKAQCLMFTTAYELEPQAVDALRSKYKIPVYPVGPSVPFFKLKPK 252

Query: 254 ---CSAPNGGT---------------------DDYFRWLDSQAEGSVLYISQGSYLSVSS 313
               +  NG +                      +YF+WLD Q  GSVLYISQGS+LSVSS
Sbjct: 253 NTTLAQSNGDSFASNANLISNGYAHNHEEEQAPEYFKWLDCQPHGSVLYISQGSFLSVSS 312

Query: 314 AQIDEIVAGVKASGVRFLWVVRGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHG 373
           AQ+DEIVAG++ SGV +LWV RG+  +L     E G+VV W +QLKVLCH ++GGFW+H 
Sbjct: 313 AQMDEIVAGIRDSGVSYLWVARGETTKLNGCLGEKGIVVPWVEQLKVLCHPAIGGFWSHC 372

Query: 374 GWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRF-KAVGGKDLVRRVEIAEFV 433
           GWNS+LE  F+GVP+L +PIFWDQ PNSK+  EDW+ G R  K +G ++ V R EI + V
Sbjct: 373 GWNSSLEAAFSGVPVLTYPIFWDQVPNSKRFVEDWRAGWRVRKKIGKENFVSREEICDLV 432

Query: 434 KRFMNSESDEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDN 493
           +RFM+ E++E +E+R R  E +E C++A++ GGS+++N+D+F+ ++S  +    AQ  +N
Sbjct: 433 RRFMDGENNEIKEMRKRALELREACQKAISPGGSTETNLDSFIDYVSQIQ----AQEKNN 492

Query: 494 PKMDPISGPAAPRRIHLAALPYPGRGHINALMNLCKLLSLRNPN------ILISFIVTDK 553
            K +  +  +A    H+ A+PYPGRGH+N +MNLCK L + + N      ILI+F+VT +
Sbjct: 493 FKKEN-NFDSATMVYHIVAVPYPGRGHVNPMMNLCKFLLISSKNHHHNHEILITFVVTQE 552

Query: 554 WLTFLAADPK---PQNIHFATFPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLDP 613
           W T +  +       +I   + PNV+PSE  R NDF GF+ ++   M  P E ++  +D 
Sbjct: 553 WQTLINKNTNHAHADSIRICSIPNVLPSEEVRGNDFPGFYEAVMRKMEAPFEAVIDEIDS 612

Query: 614 PLTA--ISADSFLTWAVQLSKRLNVPVASLWPMSATVFSILYHFDFLKENRHFPADLSER 673
            +    I AD+ L WA  ++ R  +P+A LW  SA+VFS+  H     EN  F     E 
Sbjct: 613 NVNVDLIIADTELLWAHAVATRRLLPLALLWTASASVFSMFLHHQLFLENHSF-----EN 672

Query: 674 GEEIVDYISGVSKIRLADLPTFFSGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVI 733
           GE+ VDYI GVS +R++DLP+ F     +VL   L+    V  A++L+  S+YE+E + I
Sbjct: 673 GEKRVDYIPGVSPLRISDLPSIFHDKNGKVLQLYLQCISKVQYAKYLLFNSIYEIEPNAI 732

Query: 734 DVLKPKFPFPVYTIRPCTPYFEALNGCTNDYLRWLDSQAEGSVLYVSEGSYLSVSSSQMD 793
           D LK K+ FP+YTI P  P++EA N    +Y++WLDSQ +GSVLY+S GSYLS+S  QM+
Sbjct: 733 DTLKSKYSFPLYTIGPLIPFYEATNNEHINYMQWLDSQPKGSVLYISLGSYLSISKEQME 792

Query: 794 EIVAGVKASGVRFLWVARG-------------------DDARVLCHSAVGGFWTHGGRNS 853
           E+V G+  SGVRFL V RG                   D  RVL H ++GGF +H G NS
Sbjct: 793 ELVVGLCDSGVRFLMVYRGPNKTLLSQNSNSGLFVPWCDQLRVLSHKSIGGFLSHCGWNS 852

Query: 854 TLEGVFAGVPMLAWPILWDQFPNSKKIAEDWKVGVRFKAVGG-----RDLVRREEITEFV 898
            LE +F GVP+L +PI  DQ PNSK+I ED KVGV+ K   G      D+V R+EI   V
Sbjct: 853 VLEAMFCGVPVLTFPISIDQVPNSKQIVEDLKVGVKMKERLGTNNKENDVVMRKEIARIV 912

BLAST of Cla97C02G045300 vs. ExPASy TrEMBL
Match: A0A6N2M6F7 (CCT domain-containing protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS282673 PE=4 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 1.9e-242
Identity = 454/926 (49.03%), Postives = 580/926 (62.63%), Query Frame = 0

Query: 6    GSVAPRRI-HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQ 65
            G++ P  I H+ ALPYPGRGHIN +MNLC+ L+ K P++LI+F+VT+EWL  + ++PKP 
Sbjct: 277  GNLEPTTICHVMALPYPGRGHINPMMNLCRSLASKKPDVLITFVVTEEWLGLIGSEPKPG 336

Query: 66   NIRFATFPNVIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAV 125
            +IRF T PNV+PSE  RA     F + + T MEAPVE LL +L PP + I+AD ++ WA 
Sbjct: 337  SIRFGTIPNVLPSERARAGKLPAFFQEVLTKMEAPVERLLGQLKPPVSTIVADTYLMWAF 396

Query: 126  QLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLA 185
            ++  R N+P ASLW MS  V+S+  HFDLL +NGHFP +L ERGEE+V+Y PG+S   + 
Sbjct: 397  EMANRNNIPAASLWTMSVPVYSVFQHFDLLVQNGHFPIELKERGEELVEYIPGISSTRIV 456

Query: 186  DLPSFFSGDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPS 245
            DLP+   G G   +               F + ++YELES VIDALKA    PIY IGP+
Sbjct: 457  DLPTCVYGHGRDILHRG------------FEAIAIYELESQVIDALKASISLPIYHIGPT 516

Query: 246  TPYFELECS--APNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGV 305
             PYF+LE        G  DYFRWLD Q  GSVLY+SQGS  S  SAQ+DEI AG++ SGV
Sbjct: 517  IPYFKLEQEEIITGPGETDYFRWLDMQPRGSVLYVSQGSTHSAPSAQLDEIAAGLRDSGV 576

Query: 306  RFLWVVRGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPM 365
            RFL    G  GR   V R    ++ W                          G+ + + +
Sbjct: 577  RFL---VGGKGRGLFVQR----ILWW-------------------------RGLGSAMKL 636

Query: 366  LAWPIFWDQFPNSKKIAEDWKVGVRFKAVGGKD---LVRRVEIAEFVKRFMNSESDEGRE 425
            L +    +  PNSK I +DWK+G  ++   G D   L++R EIA  VKRFM+SESDE +E
Sbjct: 637  LLYHAITES-PNSKIIVDDWKIG--WRVTSGLDVGRLIKRDEIAGLVKRFMDSESDEVKE 696

Query: 426  LRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHISGEEIAAVAQASDNPKMDPISGPAAPR 485
            +R R  E  EICRRA+ K                        +   NP+M  +    A  
Sbjct: 697  MRRRAREISEICRRAIRK------------------------EKKQNPRMGTLRTEPA-T 756

Query: 486  RIHLAALPYPGRGHINALMNLCKLLSLRNPNILISFIVTDKWLTFLAAD-PKPQNIHFAT 545
             +H+ A+PYPGRGH+N +MNLCKL+S R P+IL +F+VT++W  F+ +D  KP NIHFAT
Sbjct: 757  TLHVVAMPYPGRGHVNPMMNLCKLMSSRKPDILFTFVVTEEWYDFIHSDTKKPDNIHFAT 816

Query: 546  FPNVIPSELRRANDFLGFFRSIQTHMLPPVETLLRRLDPPLTAISADSFLTWAVQLSKRL 605
             PN IPSE+ RA DF GF +++ T M  P E LL RL+ P+  I AD++L W V +  R 
Sbjct: 817  IPNCIPSEVGRAKDFPGFLKAVATKMEAPFEQLLDRLELPVGVIIADTYLDWVVHVGNRR 876

Query: 606  NVPVASLWPMSATVFSILYHFDFLKENRHFPADLSERGEEIVDYISGVSKIRLADLPTFF 665
            N+PVASLW MSA VFS+L HF+ L++N HFP +LSERGEE VDYI G+   RL D PT F
Sbjct: 877  NIPVASLWTMSAYVFSLLRHFELLEQNGHFPVELSERGEERVDYIPGIPPTRLVDFPTLF 936

Query: 666  SGVGLEVLGSTLEAARSVDKAQFLISTSVYELETSVIDVLKPKFPFPVYTIRPCTPYFE- 725
             G G ++L   LE    V KAQ+L+ TS Y LE  VI  LKPKFPFPVY I P  PYFE 
Sbjct: 937  HGTGRQILPRALEPVSLVSKAQYLLFTSFYGLEAQVISALKPKFPFPVYPIGPSIPYFEI 996

Query: 726  ----ALNGCTNDYLRWLDSQAEGSVLYVSEGSYLSVSSSQMDEIVAGVKASGVRFLWVAR 785
                +++G  + Y+ WLDSQ +GSVLYVS GS+LSVSSSQ+DEIVAGV  SGVRFLWV R
Sbjct: 997  EDHSSVSGNVSGYIEWLDSQPKGSVLYVSMGSFLSVSSSQLDEIVAGVHDSGVRFLWVCR 1056

Query: 786  G-------------------DDARVLCHSAVGGFWTHGGRNSTLEGVFAGVPMLAWPILW 845
            G                      RVLCH AVGGFWTH G NSTLE VFAGVPMLA PI W
Sbjct: 1057 GKTTLFKDGCGDMGLVVPWCHQLRVLCHPAVGGFWTHCGWNSTLEAVFAGVPMLASPIFW 1116

Query: 846  DQFPNSKKIAEDWKVGVRFKAVGGRD-LVRREEITEFVKRFMNSESVEGREMRNRVSDLQ 900
            DQ P+SK I EDW++G R K     + LV REEI++ VK FM++E++E + MR R  +LQ
Sbjct: 1117 DQIPDSKMIVEDWQIGWRVKRDERSEILVTREEISKLVKSFMDAENIEVKAMRKRAKELQ 1130

BLAST of Cla97C02G045300 vs. ExPASy TrEMBL
Match: A0A6J0ZY01 (LOW QUALITY PROTEIN: uncharacterized protein LOC110413059 OS=Herrania umbratica OX=108875 GN=LOC110413059 PE=3 SV=1)

HSP 1 Score: 835.5 bits (2157), Expect = 2.1e-238
Identity = 452/958 (47.18%), Postives = 600/958 (62.63%), Query Frame = 0

Query: 1   MDPISGSVAPRRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAAD 60
           M PIS +      H+ A+PYPGRGHIN +MNLCKL++ +  ++ I+F+VT EWL F+ + 
Sbjct: 6   MKPISST------HVVAIPYPGRGHINPMMNLCKLIASRKHDLEITFVVTKEWLGFIGSC 65

Query: 61  PKPQNIRFATFPNVIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFV 120
            KP N+ FA+ PNV+PSEL R  DF GF  ++ T MEAP E LL  L  P TAI+AD  +
Sbjct: 66  AKPDNLHFASIPNVLPSELVRGADFPGFYEAVMTTMEAPFEELLDNLKLPVTAIIADTEL 125

Query: 121 TWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSK 180
            WA+++G R N PVASL   SATVFSIL   D L +N HF  DL ++  E+V++ PG+S 
Sbjct: 126 QWAIRVGNRRNFPVASLCTTSATVFSILQSID-LAQNCHFLVDLLDKSSELVEHSPGISP 185

Query: 181 ICLADLPSFFSGDGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYT 240
             LADL     G+  + +E  +     V K+++ + TSVYELE  V+DALK+KF  PIY 
Sbjct: 186 GHLADLQVLLEGNAPRVIELTLECISWVPKAKYLLFTSVYELERHVMDALKSKFNIPIYP 245

Query: 241 IGPSTPYFEL-----ECSAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVA 300
           +GP+ PYF+L     E + PN    +Y +WLDSQ   SVLY+S GS+LSVS+ Q+DEI A
Sbjct: 246 VGPAIPYFDLHENSSESTFPN--VPNYMQWLDSQPPCSVLYVSLGSFLSVSNEQMDEIAA 305

Query: 301 GVKASGVRFLWVVRGDDGRLKDVDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEG 360
           G++ SGV ++WV RG+  RL+D     G+VV WCDQLKVLCH SVGGF TH GWNSTLE 
Sbjct: 306 GLQDSGVPYVWVARGETSRLRDSCDGVGLVVPWCDQLKVLCHSSVGGFLTHCGWNSTLEA 365

Query: 361 VFAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRFKAV-GGKDLVRRVEIAEFVKRFMNSES 420
           +FAG+PML +PI +DQ PNSK+I +DWK+G R K     + LV R  IAE V+ FM+ E+
Sbjct: 366 IFAGIPMLTFPIIFDQAPNSKQIVDDWKIGWRVKEQHRDESLVTRARIAELVRSFMDPEN 425

Query: 421 DEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHISGEEIAA-----VAQASDNPK- 480
           +E + +R    E +E CR+++AKGGSS  N+DAF+ HIS    A+     V    D P+ 
Sbjct: 426 NEVKNMRRSAGELKEKCRKSIAKGGSSQMNLDAFINHISQATFASYAIYVVQDFLDEPRS 485

Query: 481 -----------MDPISGP---------------AAPRRI-HLAALPYPGRGHINALMNLC 540
                        P S                 A P  + H+ ALP+PGRGHIN +MNLC
Sbjct: 486 LTNIFILCESHFIPFSSELIKSFGQVSNMESTNAQPTTVCHVVALPFPGRGHINPMMNLC 545

Query: 541 KLLSLRNPNILISFIVTDKWLTFLAADPKPQNIHFATFPNVIPSELRRANDFLGFFRSIQ 600
           KLL  +  +ILI+F+VT++WL  + +DPKP NI F   PNVI  E  RA +F GF+ ++ 
Sbjct: 546 KLLVSKRQDILITFVVTEEWLGCIGSDPKPDNIRFEAIPNVITPERLRAANFPGFYEAMM 605

Query: 601 THMLPPVETLLRRLDPPLTAISADSFLTWAVQLSKRLNVPVASLWPMSATVFSILYHFDF 660
           T M  P E LL RL+ P+T I  D  + W   +  R N+PVA +W MSA+VFS+ +HFD 
Sbjct: 606 TKMEAPFEQLLDRLELPVTVIIGDIEVRWGSCVGNRRNIPVALVWTMSASVFSMFHHFDL 665

Query: 661 LKENRHFPADLSERGEEIVDYISGVSKIRLADLPTFFSGVGLEVLGSTLEAARSVDKAQF 720
             ++ H   +L+E+    VD I G+S   +A+L T F      VL   L+    V KAQ+
Sbjct: 666 HIKHSHAKVNLTEQ----VDNIPGISSYDVAELRTIFYRDNETVLELALDCISGVXKAQY 725

Query: 721 LISTSVYELETSVIDVLKPKFPFPVYTIRPCTPYFEALNGC--TNDYLRWLDSQAEGSVL 780
           L+ TSVYE E  V+D L   F FPVY I P  PY E  +G   T+ YL+WLDSQ   SVL
Sbjct: 726 LLFTSVYEFEPQVLDSLSATFSFPVYPIGPAIPYLELKDGSCKTSSYLQWLDSQQVASVL 785

Query: 781 YVSEGSYLSVSSSQMDEIVAGVKASGVRFLWVARG-------------------DDARVL 840
           Y+S GS+LSVS++QM EI+AG++   VR+LWVAR                    D  +VL
Sbjct: 786 YISLGSFLSVSNTQMHEIIAGLQICDVRYLWVAREEPSRLQDRCGDMGLVIPWCDQLKVL 845

Query: 841 CHSAVGGFWTHGGRNSTLEGVFAGVPMLAWPILWDQFPNSKKIAEDWKVGVRFKA-VGGR 898
           CH ++GGFWTH G NS LE  FAGVPML +P+  DQ  NS++IAEDWK G R K+ V   
Sbjct: 846 CHPSIGGFWTHCGWNSILEAAFAGVPMLTFPLFLDQDTNSRQIAEDWKNGWRVKSTVRAE 905

BLAST of Cla97C02G045300 vs. TAIR 10
Match: AT2G30140.2 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 476.5 bits (1225), Expect = 4.8e-134
Identity = 232/443 (52.37%), Postives = 310/443 (69.98%), Query Frame = 0

Query: 14  HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
           H+ A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT+EWL F+  DPKP  I F+T PN
Sbjct: 13  HVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGPDPKPDRIHFSTLPN 72

Query: 74  VIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLD-PPPTAILADAFVTWAVQLGKRLNV 133
           +IPSEL RA DF GF+ +++T +E P E LL  L+ PPP+ I AD +V WAV++G++ N+
Sbjct: 73  LIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADTYVIWAVRVGRKRNI 132

Query: 134 PVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFSG 193
           PV SLW MSAT+ S   H DLL  +GH    L E  EE+VDY PG+S   L DLP  F G
Sbjct: 133 PVVSLWTMSATILSFFLHSDLLISHGH---ALFEPSEEVVDYVPGLSPTKLRDLPPIFDG 192

Query: 194 DGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELEC 253
              +  ++A      +  ++  + T+ YELE   IDA  +K   P+Y IGP  P+ EL  
Sbjct: 193 YSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPVYAIGPLIPFEELSV 252

Query: 254 SAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDD 313
              N    +Y +WL+ Q EGSVLYISQGS+LSVS AQ++EIV G++ SGVRFLWV RG +
Sbjct: 253 QNDN-KEPNYIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLRESGVRFLWVARGGE 312

Query: 314 GRLKD-VDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQ 373
            +LK+ ++   G+VV WCDQL+VLCH++VGGFWTH G+NSTLEG+++GVPMLA+P+FWDQ
Sbjct: 313 LKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIYSGVPMLAFPLFWDQ 372

Query: 374 FPNSKKIAEDWKVGVRFKAVGGKD-LVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEI 433
             N+K I EDW+VG+R +     + L+ R EI E VKRFM+ ES+EG+E+R R  +  EI
Sbjct: 373 ILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEEGKEMRRRACDLSEI 432

Query: 434 CRRAVAKGGSSDSNIDAFLKHIS 454
            R AVAK GSS+ NID F++HI+
Sbjct: 433 SRGAVAKSGSSNVNIDEFVRHIT 451

BLAST of Cla97C02G045300 vs. TAIR 10
Match: AT2G30140.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 475.7 bits (1223), Expect = 8.2e-134
Identity = 232/443 (52.37%), Postives = 311/443 (70.20%), Query Frame = 0

Query: 14  HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPN 73
           H+ A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT+EWL F+  DPKP  I F+T PN
Sbjct: 13  HVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGPDPKPDRIHFSTLPN 72

Query: 74  VIPSELGRANDFTGFLRSIHTHMEAPVETLLHRLD-PPPTAILADAFVTWAVQLGKRLNV 133
           +IPSEL RA DF GF+ +++T +E P E LL  L+ PPP+ I AD +V WAV++G++ N+
Sbjct: 73  LIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADTYVIWAVRVGRKRNI 132

Query: 134 PVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKICLADLPSFFSG 193
           PV SLW MSAT+ S   H DLL  +GH   + SE  EE+VDY PG+S   L DLP  F G
Sbjct: 133 PVVSLWTMSATILSFFLHSDLLISHGHALFEPSE--EEVVDYVPGLSPTKLRDLPPIFDG 192

Query: 194 DGLQSVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELEC 253
              +  ++A      +  ++  + T+ YELE   IDA  +K   P+Y IGP  P+ EL  
Sbjct: 193 YSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPVYAIGPLIPFEELSV 252

Query: 254 SAPNGGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDD 313
              N    +Y +WL+ Q EGSVLYISQGS+LSVS AQ++EIV G++ SGVRFLWV RG +
Sbjct: 253 QNDN-KEPNYIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLRESGVRFLWVARGGE 312

Query: 314 GRLKD-VDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQ 373
            +LK+ ++   G+VV WCDQL+VLCH++VGGFWTH G+NSTLEG+++GVPMLA+P+FWDQ
Sbjct: 313 LKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIYSGVPMLAFPLFWDQ 372

Query: 374 FPNSKKIAEDWKVGVRFKAVGGKD-LVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEI 433
             N+K I EDW+VG+R +     + L+ R EI E VKRFM+ ES+EG+E+R R  +  EI
Sbjct: 373 ILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEEGKEMRRRACDLSEI 432

Query: 434 CRRAVAKGGSSDSNIDAFLKHIS 454
            R AVAK GSS+ NID F++HI+
Sbjct: 433 SRGAVAKSGSSNVNIDEFVRHIT 452

BLAST of Cla97C02G045300 vs. TAIR 10
Match: AT2G30150.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 473.8 bits (1218), Expect = 3.1e-133
Identity = 229/439 (52.16%), Postives = 309/439 (70.39%), Query Frame = 0

Query: 18  LPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQNIRFATFPNVIPS 77
           +P+PGRGHIN ++NLCK L  ++PN+ ++F+VT+EWL F+ +DPKP  I FAT PN+IPS
Sbjct: 1   MPWPGRGHINPMLNLCKSLVRRDPNLTVTFVVTEEWLGFIGSDPKPNRIHFATLPNIIPS 60

Query: 78  ELGRANDFTGFLRSIHTHMEAPVETLLHRLDPPPTAILADAFVTWAVQLGKRLNVPVASL 137
           EL RANDF  F+ ++ T +E P E LL RL+ PPTAI+AD ++ WAV++G + N+PVAS 
Sbjct: 61  ELVRANDFIAFIDAVLTRLEEPFEQLLDRLNSPPTAIIADTYIIWAVRVGTKRNIPVASF 120

Query: 138 WPMSATVFSILYHFDLLKENGHFPADLSE-RGEEIVDYFPGVSKICLADLPSFFSGDGLQ 197
           W  SAT+ S+  + DLL  +GHFP + SE + +EIVDY PG+S   L+DL     G   Q
Sbjct: 121 WTTSATILSLFINSDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLSDL-QILHGYSHQ 180

Query: 198 SVESAVNSARSVDKSQFFISTSVYELESSVIDALKAKFPFPIYTIGPSTPYFELECSAPN 257
                  S   + K+++ +  S YELE   ID   +KF FP+Y+ GP  P  EL     N
Sbjct: 181 VFNIFKKSFGELYKAKYLLFPSAYELEPKAIDFFTSKFDFPVYSTGPLIPLEELSVGNEN 240

Query: 258 GGTDDYFRWLDSQAEGSVLYISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVRGDDGRLK 317
               DYF+WLD Q E SVLYISQGS+LSVS AQ++EIV GV+ +GV+F WV RG + +LK
Sbjct: 241 REL-DYFKWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLK 300

Query: 318 D-VDRETGMVVGWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNS 377
           + ++   G+VV WCDQL+VLCH ++GGFWTH G+NSTLEG+ +GVP+L +P+FWDQF N+
Sbjct: 301 EALEGSLGVVVSWCDQLRVLCHAAIGGFWTHCGYNSTLEGICSGVPLLTFPVFWDQFLNA 360

Query: 378 KKIAEDWKVGVRFKAVGGKD-LVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRA 437
           K I E+W+VG+  +     + L+   EI E VKRFM+ ES+EG+E+R R  +  EICR A
Sbjct: 361 KMIVEEWRVGMGIERKKQMELLIVSDEIKELVKRFMDGESEEGKEMRRRTCDLSEICRGA 420

Query: 438 VAKGGSSDSNIDAFLKHIS 454
           VAKGGSSD+NIDAF+K I+
Sbjct: 421 VAKGGSSDANIDAFIKDIT 437

BLAST of Cla97C02G045300 vs. TAIR 10
Match: AT2G36970.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 247.3 bits (630), Expect = 4.7e-65
Identity = 154/478 (32.22%), Postives = 239/478 (50.00%), Query Frame = 0

Query: 11  RRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDE-------------WLTFL 70
           R+ H+  +PYP +GH+   ++L   + L +    I+F+ TD                 F 
Sbjct: 7   RKPHIMMIPYPLQGHVIPFVHLA--IKLASHGFTITFVNTDSIHHHISTAHQDDAGDIFS 66

Query: 71  AADPKPQ-NIRFATFPNVIPSELGRAND----FTGFLRSIHTHMEAPVETLLHRLDPPPT 130
           AA    Q +IR+ T  +  P +  R+ +    F G L     H++  +  L  R DPP T
Sbjct: 67  AARSSGQHDIRYTTVSDGFPLDFDRSLNHDQFFEGILHVFSAHVDDLIAKLSRRDDPPVT 126

Query: 131 AILADAFVTWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIV 190
            ++AD F  W+  +  + N+   S W   A V ++ YH DLL  NGHF +   +  ++++
Sbjct: 127 CLIADTFYVWSSMICDKHNLVNVSFWTEPALVLNLYYHMDLLISNGHFKS--LDNRKDVI 186

Query: 191 DYFPGVSKICLADLPSFFSGDGLQSVESAV------NSARSVDKSQFFISTSVYELESSV 250
           DY PGV  I   DL S+          + V       + + V ++ F +  +V ELE   
Sbjct: 187 DYVPGVKAIEPKDLMSYLQVSDKDVDTNTVVYRILFKAFKDVKRADFVVCNTVQELEPDS 246

Query: 251 IDALKAKFPFPIYTIGPSTPYFELECSAPNG--GTDDYFRWLDSQAEGSVLYISQGSYLS 310
           + AL+AK   P+Y IG   P F  +   P       D   WL  +  GSVLY+S GSY  
Sbjct: 247 LSALQAK--QPVYAIG---PVFSTDSVVPTSLWAESDCTEWLKGRPTGSVLYVSFGSYAH 306

Query: 311 VSSAQIDEIVAGVKASGVRFLWVVRGD----------DGRLKDVDRETGMVVGWCDQLKV 370
           V   +I EI  G+  SG+ F+WV+R D               D  ++ G+VV WC Q++V
Sbjct: 307 VGKKEIVEIAHGLLLSGISFIWVLRPDIVGSNVPDFLPAGFVDQAQDRGLVVQWCCQMEV 366

Query: 371 LCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGVRFKAVGGK 430
           + + +VGGF+TH GWNS LE V+ G+P+L +P+  DQF N K + +DW +G+    +  K
Sbjct: 367 ISNPAVGGFFTHCGWNSILESVWCGLPLLCYPLLTDQFTNRKLVVDDWCIGIN---LCEK 426

Query: 431 DLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRAVAKGGSSDSNIDAFLKHI 453
             + R +++  VKR MN E+    ELRN V + +   + AV   GSS++N + F+  +
Sbjct: 427 KTITRDQVSANVKRLMNGETSS--ELRNNVEKVKRHLKDAVTTVGSSETNFNLFVSEV 470

BLAST of Cla97C02G045300 vs. TAIR 10
Match: AT1G78270.1 (UDP-glucosyl transferase 85A4 )

HSP 1 Score: 223.0 bits (567), Expect = 9.6e-58
Identity = 151/487 (31.01%), Postives = 244/487 (50.10%), Query Frame = 0

Query: 6   GSVAPRRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWLTFLAADPKPQ- 65
           G  + ++ H   +PYP +GHIN ++ L KLL  +     ++F+ TD     +     P  
Sbjct: 5   GGSSSQKPHAMCIPYPAQGHINPMLKLAKLLHAR--GFHVTFVNTDYNHRRILQSRGPHA 64

Query: 66  -----NIRFATFPNVIP-SELGRANDFTGFLRSIHTHMEAPVETLLHRLD-----PPPTA 125
                + RF T P+ +P +++    D    + S   +  AP + L+ RL+     PP + 
Sbjct: 65  LNGLPSFRFETIPDGLPWTDVDAKQDMLKLIDSTINNCLAPFKDLILRLNSGSDIPPVSC 124

Query: 126 ILADAFVTWAVQLGKRLNVPVASLWPMSATVFSILYHFDLLKENGHFP----ADLSERGE 185
           I++DA +++ +   + L +PV  LW  SAT   +  H+  L E    P    +DL +  E
Sbjct: 125 IISDASMSFTIDAAEELKIPVVLLWTNSATALILYLHYQKLIEKEIIPLKDSSDLKKHLE 184

Query: 186 EIVDYFPGVSKICLADLPSFFSGDGLQS--VESAVNSARSVDKSQFFISTSVYELESSVI 245
             +D+ P + KI L D P F +    Q   +   ++    + ++      +  +LE +V+
Sbjct: 185 TEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRIKRASAIFINTFEKLEHNVL 244

Query: 246 DALKAKFPFPIYTIGP-----------STPYFELECSAPNGGTDDYFRWLDSQAEGSVLY 305
            +L++  P  IY++GP           ++   +L  +     T+    WLD++AE +V+Y
Sbjct: 245 LSLRSLLP-QIYSVGPFQILENREIDKNSEIRKLGLNLWEEETES-LDWLDTKAEKAVIY 304

Query: 306 ISQGSYLSVSSAQIDEIVAGVKASGVRFLWVVR-----GDDGRLK----DVDRETGMVV- 365
           ++ GS   ++S QI E   G+  SG  FLWVVR     GDD  L        +  GM++ 
Sbjct: 305 VNFGSLTVLTSEQILEFAWGLARSGKEFLWVVRSGMVDGDDSILPAEFLSETKNRGMLIK 364

Query: 366 GWCDQLKVLCHRSVGGFWTHGGWNSTLEGVFAGVPMLAWPIFWDQFPNSKKIAEDWKVGV 425
           GWC Q KVL H ++GGF TH GWNSTLE ++AGVPM+ WP F DQ  N K   EDW +G+
Sbjct: 365 GWCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLTNRKFCCEDWGIGM 424

Query: 426 RFKAVGGKDLVRRVEIAEFVKRFMNSESDEGRELRNRVSEFQEICRRAVAKG-GSSDSNI 453
                 G++ V+R  +   VK  M+ E  +G+ LR +V E++ +   A A   GSS  N 
Sbjct: 425 EI----GEE-VKRERVETVVKELMDGE--KGKRLREKVVEWRRLAEEASAPPLGSSYVNF 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAF4349973.11.6e-26449.89hypothetical protein G4B88_024484 [Cannabis sativa][more]
RXH84926.18.5e-25844.84hypothetical protein DVH24_041694 [Malus domestica][more]
CAE6021044.12.0e-25149.62unnamed protein product [Arabidopsis arenosa][more]
KAG5603112.19.4e-24947.92hypothetical protein H5410_034482 [Solanum commersonii][more]
KAG6761213.12.1e-24847.28hypothetical protein POTOM_034414 [Populus tomentosa][more]
Match NameE-valueIdentityDescription
O647331.2e-13252.37UDP-glycosyltransferase 87A2 OS=Arabidopsis thaliana OX=3702 GN=UGT87A2 PE=1 SV=... [more]
O647324.4e-13252.16UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana OX=3702 GN=UGT87A1 PE=2 SV=... [more]
Q9SJL06.7e-6432.22UDP-glycosyltransferase 86A1 OS=Arabidopsis thaliana OX=3702 GN=UGT86A1 PE=2 SV=... [more]
Q9M9E71.3e-5631.01UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana OX=3702 GN=UGT85A4 PE=2 SV=... [more]
Q9ZUV03.7e-5430.00UDP-glycosyltransferase 86A2 OS=Arabidopsis thaliana OX=3702 GN=UGT86A2 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A7J6DV677.7e-26549.89Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_024484 PE=4 SV=1[more]
A0A498INJ24.1e-25844.84Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_041694 PE=4 SV=1[more]
A0A445AMU11.5e-24746.30UDPGT domain-containing protein OS=Arachis hypogaea OX=3818 GN=Ahy_B01g051711 PE... [more]
A0A6N2M6F71.9e-24249.03CCT domain-containing protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS282673 PE... [more]
A0A6J0ZY012.1e-23847.18LOW QUALITY PROTEIN: uncharacterized protein LOC110413059 OS=Herrania umbratica ... [more]
Match NameE-valueIdentityDescription
AT2G30140.24.8e-13452.37UDP-Glycosyltransferase superfamily protein [more]
AT2G30140.18.2e-13452.37UDP-Glycosyltransferase superfamily protein [more]
AT2G30150.13.1e-13352.16UDP-Glycosyltransferase superfamily protein [more]
AT2G36970.14.7e-6532.22UDP-Glycosyltransferase superfamily protein [more]
AT1G78270.19.6e-5831.01UDP-glucosyl transferase 85A4 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 247..398
e-value: 7.7E-20
score: 71.2
coord: 778..841
e-value: 1.7E-8
score: 33.7
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 482..874
e-value: 2.76616E-50
score: 180.825
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 14..429
e-value: 6.30618E-66
score: 224.738
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 475..720
e-value: 2.9E-49
score: 169.9
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 18..445
e-value: 1.5E-114
score: 385.7
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 778..877
e-value: 9.7E-30
score: 105.4
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 254..433
e-value: 1.5E-114
score: 385.7
coord: 721..777
e-value: 1.7E-12
score: 49.2
NoneNo IPR availablePANTHERPTHR48047GLYCOSYLTRANSFERASEcoord: 479..900
NoneNo IPR availablePANTHERPTHR48047:SF80UDP-GLYCOSYLTRANSFERASE 87A1-LIKEcoord: 11..455
NoneNo IPR availablePANTHERPTHR48047GLYCOSYLTRANSFERASEcoord: 11..455
NoneNo IPR availablePANTHERPTHR48047:SF80UDP-GLYCOSYLTRANSFERASE 87A1-LIKEcoord: 479..900
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 12..452
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 480..898

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G045300.2Cla97C02G045300.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008194 UDP-glycosyltransferase activity