CmaCh14G020970 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G020970
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionUDP-Glycosyltransferase superfamily protein
LocationCma_Chr14 : 14507459 .. 14511063 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTTCCCAGACAGACGGCGGAGCTCAATCTCCATCAGTCCACATAGTGATGCTTCCAAGTCCAGGTATGGGCCATTTAATCCCTCTCCTTGAGTTTGCCAAACGCCTCCTCACCTTCAACCACCATTTCACCATCACCTTCGCCATCCCTTCCGATGGCCCTCCTACCACCGCCCAAATTTCCGTCCTCTGTTCCCTCCCTCCCCAGATCCGCCACGTCTTTCTCCCACCCGTTTCCCTCAACGATCTTCCCCTCGATTCGAGAATGGAAACCATCATTACCCTCACCGTCGCTCGCTCTGTTCCTTCTCTTCGAGATCTCTTGAAATCCATGGTGGCCGATACTCAAACTAACCTCGTTGCCTTGGTTGTCGACCATTTTTGCATCGACGCCCTCGATGTGGGTAAGGAATTTAACCTCTCCTCTTGTATTTTTTTCCCCTCTACTGCCATGGCTCTCTCCGCCAATCTCTGCTTAGCGGAGCTCGACGAAATGGTCACTGGAGAGTACAGAGACCATCCCGACCTGATTCGAATTCCAGGATGCACTCCGATTCATGGGAGAGATCTATTCGAGCCGACTCAAGATAGGCAAAACCAAGCCTATAAATTATTTCTCCAAAATGCAAAAAGATTTCGATTCGCAGATGGCATTTTTGTGAATAGCTTCCCGGAGTTCGAGCCGGGCGCCATTAGTGCTCTGAAATTGGCAGAACCCCCGATTTACCCCATTGGTCCAGTGGTGAAAATGGATGAAAATGGCAGTGGTGAAGGTGCAAAATGTTTGAATTGGTTGGATGAACAACCACATGGGTCTGTCCTGTACGTGTCGTTTGGGAGTGGGGGGACTCTATCCAGCAAACAAACCGTGGAGTTGGCGATGGGATTGGAAATGAGTGGGGAAAGATTCCTATGGATCGTCAGAAGTCCCAACGACGAGTTATCAAATGCATCCTATTTCAGCGTGCATTCACGAAATGATCCATTGAGTTATCTGCCGGAGGGGTTCGTGGAGAGAGTGAAAGGGAGGGGGCTGTTGGTGCCATCATGGGCGCCGCAAACTCGAATCCTGAAGCACCGCTCCACCGGCGGGTTTTTGAGCCATTGCGGGAACAATTCAGTGTTGGAGAGCGTAGTAAATGGGGTTCCTCTGATCGCTTGGCCGCTTTATGCAGAACAGAGAATGAACGCTGTGACGCTAACAGAGGAGATCAAGGTGGCGCTGAGGCCGAAGGTGAATGAGGAAAATGGATTTGTGGAGAAGGAAGAGATTGCTAAAGTGGTGAAGTCGCTTTTCAAAGGTGAAGAGGGGAAAAAAGTGAGTGCTAGAATGAAGCAATTGCAAGACGCGGCCATAAGAGCCGTCGGAGAGGATGGGTCTTCTACAAAAGCCCTGCGCCAAGCGCTTCTCAAGTGGAAAACACCTTTTTAATCATTTCCATATTTTCTCACTTTTTAATGATAAAATAAAATAAAATTATATGTTATTTTAAATTTAGAAGTTTATATTTTTATATAAACAGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGTGTTGGAATTTTAGATAATTTGTTTGTGAAGGAGTGTGGTTGTCCGACCCACAGCAGCACCAGAGCAGAGCCACCCACCGCCATGGAAGAAGCTCAATCTCAAACGCCCCACGTTCTAATGATGCCAAGTCCGGGAATGGGTCATCTTATCCCACTCATCGAATTTGCCAAACGGCTCGTCTTACTGCACCGCTTCACTGTCACTTTCGCCATTCCTTCCGGCGACGCTCCTTCCAAAGCTCAAATCTCGGTCCTAAATTCCCTACCCTCTGCTATCGACCACCTCTTCCTCCCGCCTGCTCCATTGAAGGACCTTCCAAGCAACACCAAAGCCGAAACCATCATCGTCCTTGCCGTTAGTCGCTCTCTTCCCTCTCTTCGCGACCTCTTCAAATCCATCGTGACCCAACGCAACCTTGTCGCCCTCGTTGTCGACCAATTCGGCACCGTGGCCTTCGAAGTCGCTAAGGAATTCAGCGTCTCGCCTTACATTTACTTTCCTTGCGCCGCCACGACTCTCTCGCTTATTCTCCACATGCCGAAGTTGGACGAGTCGGTCACCGGCGAGTACAGAGTCCTCACCGAACCTATTAGACTTCCGGGGTGCACTCCAATTCCAGGGAAGGAATTGCCGGATCCGTTTCTAGACAGGGAAAATGATTCCTACAAGTTTTTTCTCGAAACCATGAAGGGGTTTGTGTTAGCAGAGGGGATTTTCCTAAACAGCTTTCTGGAATTGGAGTCCAGTGCCATAAATGCTCTGCAATTGAGCGGATCCGGCAACCCCCCAATTTACCCAGTTGGTCCATTGGTGAAAGTTGATTCAAGTGTGACTGAGGAAGGGGTTGAGTGTTTGAATTGGCTGGATGAACAACCACGTGGGTCTGTTCTGTTCGTGTCGTTTGGAAGTGGGGGAACTCTGTCGAGTGTTCAACTGAACGAATTGGCTTTGGGATTGGAAATGAGTGGGCAAAAATTCATATGGGTTGTTAGAAGTCCGAGCGATAAGGAAGCCAGTGCATCATTTTTCAGTGTCCATAGCCAGGATGATCCATTGAGGTACTTGCCGGAGGGGTTCGTGGAGAGAAACAGGGGAAGGGGATTAATGGTGCCGTCGTGGGCTCCGCAGGCACAAATACTGAAGCATGGTTCGACCGGGGGGTTCCTGAGCCACTGCGGGTGGAATTCGACATTGGAGAGCTTGGTTAGTGGGGTTCCTCTGATTGCTTGGCCACTGTATGCAGAACAGAGAGTGAACGCCATCATTTTAACAGAAGAGATTAAGGCGGCGCTGAGGCCGAAGATGAACGAGGAAAGTGGGGTTATTGAGAAGGAAGAGATAGCAAAGGTCGTGAAGTGTCTGTTTGAAGGGGAAGAGGGGAAGAAAGTGCGTGCGAAAATGGAGGAGTTGAGAGTTGCAGGGGAAAGGGCCACTGGAGACGGGGGATCTTCTTCAAGAACGCTCCTGGAAGTAGTTCAGAAATGGAGCAGCAGCAACGTTTCGGGATAG

mRNA sequence

ATGGAAGTTTCCCAGACAGACGGCGGAGCTCAATCTCCATCAGTCCACATAGTGATGCTTCCAAGTCCAGGTATGGGCCATTTAATCCCTCTCCTTGAGTTTGCCAAACGCCTCCTCACCTTCAACCACCATTTCACCATCACCTTCGCCATCCCTTCCGATGGCCCTCCTACCACCGCCCAAATTTCCGTCCTCTGTTCCCTCCCTCCCCAGATCCGCCACGTCTTTCTCCCACCCGTTTCCCTCAACGATCTTCCCCTCGATTCGAGAATGGAAACCATCATTACCCTCACCGTCGCTCGCTCTGTTCCTTCTCTTCGAGATCTCTTGAAATCCATGGTGGCCGATACTCAAACTAACCTCGTTGCCTTGGTTGTCGACCATTTTTGCATCGACGCCCTCGATGTGGGTAAGGAATTTAACCTCTCCTCTTGTATTTTTTTCCCCTCTACTGCCATGGCTCTCTCCGCCAATCTCTGCTTAGCGGAGCTCGACGAAATGGTCACTGGAGAGTACAGAGACCATCCCGACCTGATTCGAATTCCAGGATGCACTCCGATTCATGGGAGAGATCTATTCGAGCCGACTCAAGATAGGCAAAACCAAGCCTATAAATTATTTCTCCAAAATGCAAAAAGATTTCGATTCGCAGATGGCATTTTTGTGAATAGCTTCCCGGAGTTCGAGCCGGGCGCCATTAGTGCTCTGAAATTGGCAGAACCCCCGATTTACCCCATTGGTCCAGTGGTGAAAATGGATGAAAATGGCAGTGGTGAAGGTGCAAAATGTTTGAATTGGTTGGATGAACAACCACATGGGTCTGTCCTGTACGTGTCGTTTGGGAGTGGGGGGACTCTATCCAGCAAACAAACCGTGGAGTTGGCGATGGGATTGGAAATGAGTGGGGAAAGATTCCTATGGATCGTCAGAAGTCCCAACGACGAGTTATCAAATGCATCCTATTTCAGCGTGCATTCACGAAATGATCCATTGAGTTATCTGCCGGAGGGGTTCGTGGAGAGAGTGAAAGGGAGGGGGCTGTTGGTGCCATCATGGGCGCCGCAAACTCGAATCCTGAAGCACCGCTCCACCGGCGGGTTTTTGAGCCATTGCGGGAACAATTCAGTGTTGGAGAGCGTAGTAAATGGGGTTCCTCTGATCGCTTGGCCGCTTTATGCAGAACAGAGAATGAACGCTGTGACGCTAACAGAGGAGATCAAGGTGGCGCTGAGGCCGAAGGTGAATGAGGAAAATGGATTTGTGGAGAAGGAAGAGATTGCTAAAGTGGTGAAGTCGCTTTTCAAAGGTGAAGAGGGGAAAAAAGAGTGTGGTTGTCCGACCCACAGCAGCACCAGAGCAGAGCCACCCACCGCCATGGAAGAAGCTCAATCTCAAACGCCCCACGTTCTAATGATGCCAAGTCCGGGAATGGGTCATCTTATCCCACTCATCGAATTTGCCAAACGGCTCGTCTTACTGCACCGCTTCACTGTCACTTTCGCCATTCCTTCCGGCGACGCTCCTTCCAAAGCTCAAATCTCGGTCCTAAATTCCCTACCCTCTGCTATCGACCACCTCTTCCTCCCGCCTGCTCCATTGAAGGACCTTCCAAGCAACACCAAAGCCGAAACCATCATCGTCCTTGCCGTTAGTCGCTCTCTTCCCTCTCTTCGCGACCTCTTCAAATCCATCGTGACCCAACGCAACCTTGTCGCCCTCGTTGTCGACCAATTCGGCACCGTGGCCTTCGAAGTCGCTAAGGAATTCAGCGTCTCGCCTTACATTTACTTTCCTTGCGCCGCCACGACTCTCTCGCTTATTCTCCACATGCCGAAGTTGGACGAGTCGGTCACCGGCGAGTACAGAGTCCTCACCGAACCTATTAGACTTCCGGGGTGCACTCCAATTCCAGGGAAGGAATTGCCGGATCCGTTTCTAGACAGGGAAAATGATTCCTACAAGTTTTTTCTCGAAACCATGAAGGGGTTTGTGTTAGCAGAGGGGATTTTCCTAAACAGCTTTCTGGAATTGGAGTCCAGTGCCATAAATGCTCTGCAATTGAGCGGATCCGGCAACCCCCCAATTTACCCAGTTGGTCCATTGGTGAAAGTTGATTCAAGTGTGACTGAGGAAGGGGTTGAGTGTTTGAATTGGCTGGATGAACAACCACGTGGGTCTGTTCTGTTCGTGTCGTTTGGAAGTGGGGGAACTCTGTCGAGTGTTCAACTGAACGAATTGGCTTTGGGATTGGAAATGAGTGGGCAAAAATTCATATGGGTTGTTAGAAGTCCGAGCGATAAGGAAGCCAGTGCATCATTTTTCAGTGTCCATAGCCAGGATGATCCATTGAGGTACTTGCCGGAGGGGTTCGTGGAGAGAAACAGGGGAAGGGGATTAATGGTGCCGTCGTGGGCTCCGCAGGCACAAATACTGAAGCATGGTTCGACCGGGGGGTTCCTGAGCCACTGCGGGTGGAATTCGACATTGGAGAGCTTGGTTAGTGGGGTTCCTCTGATTGCTTGGCCACTGTATGCAGAACAGAGAGTGAACGCCATCATTTTAACAGAAGAGATTAAGGCGGCGCTGAGGCCGAAGATGAACGAGGAAAGTGGGGTTATTGAGAAGGAAGAGATAGCAAAGGTCGTGAAGTGTCTGTTTGAAGGGGAAGAGGGGAAGAAAGTGCGTGCGAAAATGGAGGAGTTGAGAGTTGCAGGGGAAAGGGCCACTGGAGACGGGGGATCTTCTTCAAGAACGCTCCTGGAAGTAGTTCAGAAATGGAGCAGCAGCAACGTTTCGGGATAG

Coding sequence (CDS)

ATGGAAGTTTCCCAGACAGACGGCGGAGCTCAATCTCCATCAGTCCACATAGTGATGCTTCCAAGTCCAGGTATGGGCCATTTAATCCCTCTCCTTGAGTTTGCCAAACGCCTCCTCACCTTCAACCACCATTTCACCATCACCTTCGCCATCCCTTCCGATGGCCCTCCTACCACCGCCCAAATTTCCGTCCTCTGTTCCCTCCCTCCCCAGATCCGCCACGTCTTTCTCCCACCCGTTTCCCTCAACGATCTTCCCCTCGATTCGAGAATGGAAACCATCATTACCCTCACCGTCGCTCGCTCTGTTCCTTCTCTTCGAGATCTCTTGAAATCCATGGTGGCCGATACTCAAACTAACCTCGTTGCCTTGGTTGTCGACCATTTTTGCATCGACGCCCTCGATGTGGGTAAGGAATTTAACCTCTCCTCTTGTATTTTTTTCCCCTCTACTGCCATGGCTCTCTCCGCCAATCTCTGCTTAGCGGAGCTCGACGAAATGGTCACTGGAGAGTACAGAGACCATCCCGACCTGATTCGAATTCCAGGATGCACTCCGATTCATGGGAGAGATCTATTCGAGCCGACTCAAGATAGGCAAAACCAAGCCTATAAATTATTTCTCCAAAATGCAAAAAGATTTCGATTCGCAGATGGCATTTTTGTGAATAGCTTCCCGGAGTTCGAGCCGGGCGCCATTAGTGCTCTGAAATTGGCAGAACCCCCGATTTACCCCATTGGTCCAGTGGTGAAAATGGATGAAAATGGCAGTGGTGAAGGTGCAAAATGTTTGAATTGGTTGGATGAACAACCACATGGGTCTGTCCTGTACGTGTCGTTTGGGAGTGGGGGGACTCTATCCAGCAAACAAACCGTGGAGTTGGCGATGGGATTGGAAATGAGTGGGGAAAGATTCCTATGGATCGTCAGAAGTCCCAACGACGAGTTATCAAATGCATCCTATTTCAGCGTGCATTCACGAAATGATCCATTGAGTTATCTGCCGGAGGGGTTCGTGGAGAGAGTGAAAGGGAGGGGGCTGTTGGTGCCATCATGGGCGCCGCAAACTCGAATCCTGAAGCACCGCTCCACCGGCGGGTTTTTGAGCCATTGCGGGAACAATTCAGTGTTGGAGAGCGTAGTAAATGGGGTTCCTCTGATCGCTTGGCCGCTTTATGCAGAACAGAGAATGAACGCTGTGACGCTAACAGAGGAGATCAAGGTGGCGCTGAGGCCGAAGGTGAATGAGGAAAATGGATTTGTGGAGAAGGAAGAGATTGCTAAAGTGGTGAAGTCGCTTTTCAAAGGTGAAGAGGGGAAAAAAGAGTGTGGTTGTCCGACCCACAGCAGCACCAGAGCAGAGCCACCCACCGCCATGGAAGAAGCTCAATCTCAAACGCCCCACGTTCTAATGATGCCAAGTCCGGGAATGGGTCATCTTATCCCACTCATCGAATTTGCCAAACGGCTCGTCTTACTGCACCGCTTCACTGTCACTTTCGCCATTCCTTCCGGCGACGCTCCTTCCAAAGCTCAAATCTCGGTCCTAAATTCCCTACCCTCTGCTATCGACCACCTCTTCCTCCCGCCTGCTCCATTGAAGGACCTTCCAAGCAACACCAAAGCCGAAACCATCATCGTCCTTGCCGTTAGTCGCTCTCTTCCCTCTCTTCGCGACCTCTTCAAATCCATCGTGACCCAACGCAACCTTGTCGCCCTCGTTGTCGACCAATTCGGCACCGTGGCCTTCGAAGTCGCTAAGGAATTCAGCGTCTCGCCTTACATTTACTTTCCTTGCGCCGCCACGACTCTCTCGCTTATTCTCCACATGCCGAAGTTGGACGAGTCGGTCACCGGCGAGTACAGAGTCCTCACCGAACCTATTAGACTTCCGGGGTGCACTCCAATTCCAGGGAAGGAATTGCCGGATCCGTTTCTAGACAGGGAAAATGATTCCTACAAGTTTTTTCTCGAAACCATGAAGGGGTTTGTGTTAGCAGAGGGGATTTTCCTAAACAGCTTTCTGGAATTGGAGTCCAGTGCCATAAATGCTCTGCAATTGAGCGGATCCGGCAACCCCCCAATTTACCCAGTTGGTCCATTGGTGAAAGTTGATTCAAGTGTGACTGAGGAAGGGGTTGAGTGTTTGAATTGGCTGGATGAACAACCACGTGGGTCTGTTCTGTTCGTGTCGTTTGGAAGTGGGGGAACTCTGTCGAGTGTTCAACTGAACGAATTGGCTTTGGGATTGGAAATGAGTGGGCAAAAATTCATATGGGTTGTTAGAAGTCCGAGCGATAAGGAAGCCAGTGCATCATTTTTCAGTGTCCATAGCCAGGATGATCCATTGAGGTACTTGCCGGAGGGGTTCGTGGAGAGAAACAGGGGAAGGGGATTAATGGTGCCGTCGTGGGCTCCGCAGGCACAAATACTGAAGCATGGTTCGACCGGGGGGTTCCTGAGCCACTGCGGGTGGAATTCGACATTGGAGAGCTTGGTTAGTGGGGTTCCTCTGATTGCTTGGCCACTGTATGCAGAACAGAGAGTGAACGCCATCATTTTAACAGAAGAGATTAAGGCGGCGCTGAGGCCGAAGATGAACGAGGAAAGTGGGGTTATTGAGAAGGAAGAGATAGCAAAGGTCGTGAAGTGTCTGTTTGAAGGGGAAGAGGGGAAGAAAGTGCGTGCGAAAATGGAGGAGTTGAGAGTTGCAGGGGAAAGGGCCACTGGAGACGGGGGATCTTCTTCAAGAACGCTCCTGGAAGTAGTTCAGAAATGGAGCAGCAGCAACGTTTCGGGATAG

Protein sequence

MEVSQTDGGAQSPSVHIVMLPSPGMGHLIPLLEFAKRLLTFNHHFTITFAIPSDGPPTTAQISVLCSLPPQIRHVFLPPVSLNDLPLDSRMETIITLTVARSVPSLRDLLKSMVADTQTNLVALVVDHFCIDALDVGKEFNLSSCIFFPSTAMALSANLCLAELDEMVTGEYRDHPDLIRIPGCTPIHGRDLFEPTQDRQNQAYKLFLQNAKRFRFADGIFVNSFPEFEPGAISALKLAEPPIYPIGPVVKMDENGSGEGAKCLNWLDEQPHGSVLYVSFGSGGTLSSKQTVELAMGLEMSGERFLWIVRSPNDELSNASYFSVHSRNDPLSYLPEGFVERVKGRGLLVPSWAPQTRILKHRSTGGFLSHCGNNSVLESVVNGVPLIAWPLYAEQRMNAVTLTEEIKVALRPKVNEENGFVEKEEIAKVVKSLFKGEEGKKECGCPTHSSTRAEPPTAMEEAQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPSAIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFGTVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIWVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIAKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSSSNVSG
BLAST of CmaCh14G020970 vs. Swiss-Prot
Match: HQGT_RAUSE (Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1 SV=1)

HSP 1 Score: 552.7 bits (1423), Expect = 7.4e-156
Identity = 269/463 (58.10%), Postives = 348/463 (75.16%), Query Frame = 1

Query: 466 TPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPSAIDH 525
           TPH+ M+P+PGMGHLIPL+EFAKRLVL H F VTF IP+     KAQ S L++LP+ +++
Sbjct: 4   TPHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAGVNY 63

Query: 526 LFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFGTVAFEV 585
           + LPP    DLP++ + ET I L ++RSLP +RD  K+++    L ALVVD FGT AF+V
Sbjct: 64  VLLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDAFDV 123

Query: 586 AKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKELPDPF 645
           A EF VSPYI++P  A  LSL  H+PKLD+ V+ EYR + EP+++PGC PI GK+  DP 
Sbjct: 124 AIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFLDPA 183

Query: 646 LDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPLVKVD 705
            DR+ND+YK  L   K + LAEGI +N+F +LE   + ALQ    G PP+YP+GPL++ D
Sbjct: 184 QDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLIRAD 243

Query: 706 SSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIWVVRSPS 765
           SS   +  ECL WLD+QPRGSVLF+SFGSGG +S  Q  ELALGLEMS Q+F+WVVRSP+
Sbjct: 244 SSSKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVVRSPN 303

Query: 766 DKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGFLSHCGW 825
           DK A+A++FS+ +Q+D L YLPEGF+ER +GR L+VPSWAPQ +IL HGSTGGFL+HCGW
Sbjct: 304 DKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLTHCGW 363

Query: 826 NSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIAKVVKCL 885
           NS LES+V+GVPLIAWPLYAEQ++NA++LTE +K ALRPK   E+G+I + EIA  VK L
Sbjct: 364 NSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAG-ENGLIGRVEIANAVKGL 423

Query: 886 FEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
            EGEEGKK R+ M++L+ A  RA  D GSS++ L E+  KW +
Sbjct: 424 MEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACKWEN 465

BLAST of CmaCh14G020970 vs. Swiss-Prot
Match: U72B1_ARATH (UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1 PE=1 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 8.5e-152
Identity = 278/470 (59.15%), Postives = 351/470 (74.68%), Query Frame = 1

Query: 463 QSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPSA 522
           +S+TPHV ++PSPGMGHLIPL+EFAKRLV LH  TVTF I     PSKAQ +VL+SLPS+
Sbjct: 3   ESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSS 62

Query: 523 IDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNL-VALVVDQFGTV 582
           I  +FLPP  L DL S+T+ E+ I L V+RS P LR +F S V    L  ALVVD FGT 
Sbjct: 63  ISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTD 122

Query: 583 AFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKEL 642
           AF+VA EF V PYI++P  A  LS  LH+PKLDE+V+ E+R LTEP+ LPGC P+ GK+ 
Sbjct: 123 AFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDF 182

Query: 643 PDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPL 702
            DP  DR++D+YK+ L   K +  AEGI +N+F ELE +AI ALQ  G   PP+YPVGPL
Sbjct: 183 LDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPL 242

Query: 703 V---KVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFI 762
           V   K ++  TEE  ECL WLD QP GSVL+VSFGSGGTL+  QLNELALGL  S Q+F+
Sbjct: 243 VNIGKQEAKQTEES-ECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFL 302

Query: 763 WVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGG 822
           WV+RSPS   A++S+F  HSQ DPL +LP GF+ER + RG ++P WAPQAQ+L H STGG
Sbjct: 303 WVIRSPSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGG 362

Query: 823 FLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEI 882
           FL+HCGWNSTLES+VSG+PLIAWPLYAEQ++NA++L+E+I+AALRP+  ++ G++ +EE+
Sbjct: 363 FLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGDD-GLVRREEV 422

Query: 883 AKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
           A+VVK L EGEEGK VR KM+EL+ A  R   D G+S++ L  V  KW +
Sbjct: 423 ARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWKA 469

BLAST of CmaCh14G020970 vs. Swiss-Prot
Match: U72B3_ARATH (UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3 PE=2 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 4.2e-143
Identity = 265/471 (56.26%), Postives = 348/471 (73.89%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           A   TPHV ++PSPG+GHLIPL+E AKRL+  H FTVTF IP    PSKAQ SVLNSLPS
Sbjct: 2   ADGNTPHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPS 61

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVA-LVVDQFGT 581
           +I  +FLPPA L D+PS  + ET I L V+RS P+LR+LF S+  ++ L A LVVD FGT
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGT 121

Query: 582 VAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKE 641
            AF+VA EF VSPYI++   A  L+ +LH+PKLDE+V+ E+R LTEP+ +PGC PI GK+
Sbjct: 122 DAFDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKD 181

Query: 642 LPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGP 701
             DP  DR+++SYK+ L  +K F  AEGI +NSF++LE + I  +Q      PP+Y +GP
Sbjct: 182 FVDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGP 241

Query: 702 LVKV---DSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKF 761
           LV     D+ V +E  +CLNWLD QP GSVL+VSFGSGGTL+  Q  ELALGL  SG++F
Sbjct: 242 LVNSGSHDADVNDE-YKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRF 301

Query: 762 IWVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTG 821
           +WV+RSPS   AS+S+F+  S++DP  +LP+GF++R + +GL+V SWAPQAQIL H S G
Sbjct: 302 LWVIRSPSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIG 361

Query: 822 GFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEE 881
           GFL+HCGWNS+LES+V+GVPLIAWPLYAEQ++NA++L  ++ AALR ++ E+ GV+ +EE
Sbjct: 362 GFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLV-DVGAALRARLGED-GVVGREE 421

Query: 882 IAKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
           +A+VVK L EGEEG  VR KM+EL+    R   D G S+++L EV  KW +
Sbjct: 422 VARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWKA 468

BLAST of CmaCh14G020970 vs. Swiss-Prot
Match: U72B2_ARATH (UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2 PE=2 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 9.4e-143
Identity = 257/470 (54.68%), Postives = 343/470 (72.98%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           A++ TPH+ +MPSPGMGHLIP +E AKRLV    FTVT  I    +PSKAQ SVLNSLPS
Sbjct: 2   AEANTPHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPS 61

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVA-LVVDQFGT 581
           +I  +FLPPA L D+PS  + ET  +L ++RS P+LR+LF S+ T+++L A LVVD FG 
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGA 121

Query: 582 VAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKE 641
            AF+VA +F VSPYI++   A  LS  LH+PKLD++V+ E+R LTEP+++PGC PI GK+
Sbjct: 122 DAFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKD 181

Query: 642 LPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGP 701
             D   DR +D+YK  L   K +  A+GI +NSF++LES+AI ALQ      P +YP+GP
Sbjct: 182 FLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGP 241

Query: 702 LVKVDSSVT--EEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFI 761
           LV   SS    E+   CL+WLD QP GSVL++SFGSGGTL+  Q NELA+GL  SG++FI
Sbjct: 242 LVNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFI 301

Query: 762 WVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGG 821
           WV+RSPS+   S+S+F+ HS+ DP  +LP GF++R + +GL+VPSWAPQ QIL H ST G
Sbjct: 302 WVIRSPSE-IVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCG 361

Query: 822 FLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEI 881
           FL+HCGWNSTLES+V+GVPLIAWPL+AEQ++N ++L E++ AALR    E+ G++ +EE+
Sbjct: 362 FLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGED-GIVRREEV 421

Query: 882 AKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
            +VVK L EGEEGK +  K++EL+    R  GD G SS++  EV+ KW +
Sbjct: 422 VRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWKT 469

BLAST of CmaCh14G020970 vs. Swiss-Prot
Match: UFOG5_MANES (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 4.1e-98
Identity = 197/474 (41.56%), Postives = 289/474 (60.97%), Query Frame = 1

Query: 467 PHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSL--PSAID 526
           PH++++ SPG+GHLIP++E  KR+V L  F VT  +   D  S A+  VL S   P   +
Sbjct: 10  PHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDT-SAAEPQVLRSAMTPKLCE 69

Query: 527 HLFLPPAPLKDL--PSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNL--VALVVDQFGT 586
            + LPP  +  L  P  T    + VL     +  +R  F++ V+       A++VD FGT
Sbjct: 70  IIQLPPPNISCLIDPEATVCTRLFVL-----MREIRPAFRAAVSALKFRPAAIIVDLFGT 129

Query: 587 VAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKE 646
            + EVAKE  ++ Y+Y    A  L+L +++P LD+ V GE+ +  EP+++PGC P+  +E
Sbjct: 130 ESLEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEE 189

Query: 647 LPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQ----LSGSGNPPIY 706
           + DP LDR N  Y  +         A+GI +N++  LE +   AL+    L      P++
Sbjct: 190 VVDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVF 249

Query: 707 PVGPLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQK 766
           P+GPL +  +       E L+WLD+QP+ SV++VSFGSGGTLS  Q+ ELA GLE S Q+
Sbjct: 250 PIGPLRR-QAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQR 309

Query: 767 FIWVVRSPSDKEASASFFSV-HSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGS 826
           FIWVVR P+ K   A+FF+     DD   Y PEGF+ R +  GL+VP W+PQ  I+ H S
Sbjct: 310 FIWVVRQPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPS 369

Query: 827 TGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEK 886
            G FLSHCGWNS LES+ +GVP+IAWP+YAEQR+NA +LTEE+  A+RPK      V+++
Sbjct: 370 VGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKR 429

Query: 887 EEIAKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSSS 930
           EEI ++++ +   EEG ++R ++ EL+ +GE+A  +GGSS   +  +  +W  S
Sbjct: 430 EEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNEWEKS 476

BLAST of CmaCh14G020970 vs. TrEMBL
Match: A0A0A0L8M3_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_3G119710 PE=3 SV=1)

HSP 1 Score: 745.3 bits (1923), Expect = 8.7e-212
Identity = 365/471 (77.49%), Postives = 416/471 (88.32%), Query Frame = 1

Query: 460 EEAQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSL 519
           +E +S TPHV+MM SPGMGHLIPL+EFAKRLVLLHRFTVTF IPSG  P KAQIS+L+SL
Sbjct: 11  QEFESSTPHVVMMVSPGMGHLIPLVEFAKRLVLLHRFTVTFVIPSGGPPPKAQISLLSSL 70

Query: 520 PSAIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFG 579
           PSAIDH+FLPP  L DLP  TK ETIIVL V+RSLPSLRD FKS++TQRN VA VVDQF 
Sbjct: 71  PSAIDHVFLPPVSLNDLPPQTKGETIIVLTVTRSLPSLRDQFKSMLTQRNPVAFVVDQFC 130

Query: 580 TVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGK 639
           T+A ++A+EF+V PY+Y PC+ATTLSL+LHMP+LD+SV GEY  LTEPI+LP C+P P K
Sbjct: 131 TIAIDLAREFNVPPYVYLPCSATTLSLVLHMPELDKSVVGEYTDLTEPIKLPACSPFPAK 190

Query: 640 ELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVG 699
            LPDPFLDR++DSYK+FLE+M  F LA+GIF+NSF ELE   INAL+L  SG PPIYPVG
Sbjct: 191 ALPDPFLDRKDDSYKYFLESMSRFGLADGIFVNSFPELEPDPINALKLEESGYPPIYPVG 250

Query: 700 PLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIW 759
           P+VK+DSS +EE +ECL WLDEQP GSVLFVSFGSGGTLSS+Q NELA+GLEMSGQKFIW
Sbjct: 251 PIVKMDSSGSEEEIECLKWLDEQPHGSVLFVSFGSGGTLSSIQNNELAMGLEMSGQKFIW 310

Query: 760 VVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGF 819
           VVRSP DKEA+ASFFSVHSQ+DPL++LPEGFVERN+GRGL++PSWAPQAQIL HGSTGGF
Sbjct: 311 VVRSPHDKEANASFFSVHSQNDPLKFLPEGFVERNKGRGLLLPSWAPQAQILSHGSTGGF 370

Query: 820 LSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIA 879
           LSHCGWNSTLESLV+GVP+IAWPLYAEQR+NA+IL EEIK AL+ KMNEESG+IEKEEIA
Sbjct: 371 LSHCGWNSTLESLVNGVPMIAWPLYAEQRLNAVILIEEIKVALKVKMNEESGIIEKEEIA 430

Query: 880 KVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSSSN 931
           KVVK LFE EEGKKVR KMEELRVAGER  G+GGSSSRT+LEVVQKW + N
Sbjct: 431 KVVKSLFESEEGKKVREKMEELRVAGERVVGEGGSSSRTVLEVVQKWRNRN 481

BLAST of CmaCh14G020970 vs. TrEMBL
Match: C5XYZ7_SORBI (Putative uncharacterized protein Sb04g008700 OS=Sorghum bicolor GN=Sb04g008700 PE=4 SV=1)

HSP 1 Score: 720.7 bits (1859), Expect = 2.3e-204
Identity = 420/984 (42.68%), Postives = 571/984 (58.03%), Query Frame = 1

Query: 9   GAQSPSVHIVMLPSPGMGHLIPLLEFAKRLLTFNHHFT----ITFA-IPSDGPPTTAQIS 68
           G +    H+V++ SPG GHL+P+ E A+RL+   HH      +TFA + +D    +A  +
Sbjct: 12  GPRPDRPHVVLVSSPGAGHLMPMAELARRLVA--HHAVAATLVTFADLSADSDAHSA--A 71

Query: 69  VLCSL-PPQIRHVFLPPVSLNDLPLDSRMETIITLTVARSVPSLRDLLKSMVADTQTNLV 128
           VL SL    +    LP V  +DLP D+R+ET++   + RS+P LR LL+ +  D+   L 
Sbjct: 72  VLSSLRAANVSTATLPAVPHDDLPADARIETVLLEVIGRSIPHLRALLRDV--DSTAPLA 131

Query: 129 ALVVDHFCIDALDVGKEFNLSSCIFFPSTAMALSANLCLAELDEMV-TGEYRDHPDLIRI 188
           ALV D FC  AL +  E  +   IFFPS    LS      E+++    GEYRD PD +++
Sbjct: 132 ALVPDFFCTAALPLASELGVPGYIFFPSNLTVLSVMRSAVEVNDGAGAGEYRDLPDPLQL 191

Query: 189 PGCTPIHGRDLFEPTQDRQNQAYKLFLQNAKRFRFADGIFVNSFPEFEPGAISALKLAE- 248
           PG   +   DL +  +D +   Y   +   +R+R A G   N+F   +P  +   K A  
Sbjct: 192 PGGVSLRREDLPDGFRDGKEPVYAHLVGEGRRYRAAAGFLANTFHGMDPATVEEFKKAAE 251

Query: 249 ----PPIYPIGPVVKMDENGSGEGAKCLNWLDEQPHGSVLYVSFGSGGTLSSKQTVELAM 308
               PP YP+GP V+   +  G  + C+ WLD QP GSV+YVSFGS GTLS +QT ELA 
Sbjct: 252 QIRFPPAYPVGPFVRSSSDEGGASSPCIEWLDRQPTGSVVYVSFGSAGTLSVEQTAELAA 311

Query: 309 GLEMSGERFLWIVRSPN------DELSNASYFSVHSRNDPLSYLPEGFVERVKGRGLLVP 368
           GLE SG RFLWIVR P+      D++   S       NDPL++LP+GF+ER +GRGL V 
Sbjct: 312 GLEDSGHRFLWIVRMPSLDGEHSDDMGRKSRGGGGDENDPLAWLPDGFLERTRGRGLAVA 371

Query: 369 SWAPQTRILKHRSTGGFLSHCGNNSVLESVVNGVPLIAWPLYAEQRMNAVTLTEEIKVAL 428
           SWAPQ R+L H +T  F+SHCG NS LESV +GVP++AWPLYAEQRMNAV L+E + VAL
Sbjct: 372 SWAPQVRVLSHPATAAFVSHCGWNSALESVTSGVPMVAWPLYAEQRMNAVVLSENVGVAL 431

Query: 429 RPKVN-EENGFVEKEEIAKVVKSLFKGEEG---KKECG----------CPTHSSTRA--- 488
           R +V  ++ G V +EEIA  V+ L +GE G   ++  G           P  SS RA   
Sbjct: 432 RLRVRPDDGGLVGREEIAAAVRELMEGEHGRAMRRRTGDLQQAADMAWAPDGSSRRALGE 491

Query: 489 ------------------EPPTAMEEAQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHR 548
                              PP  ME   SQ   V++  SPG GHLIPL+E A+RL + H 
Sbjct: 492 VVGRWKAATTTGASTEWLSPPELMENLPSQ-QQVVLFASPGAGHLIPLVELARRLAMDHG 551

Query: 549 FTVTFAIPSGDAPSKAQISVLNSLPSAIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLP 608
           F VT  + +G +      +VL+SLPS++    LP   L DLP +    T++   V RSLP
Sbjct: 552 FAVTLVMLTGMSDPANDAAVLSSLPSSVATAVLPAVSLDDLPPDVGFGTLMFELVRRSLP 611

Query: 609 SLRDLFKSIVTQRNLVALVVDQFGTVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKL-D 668
            LR L      +  + ALV D FGT A  +A E     Y++FP +   +S++ H+ ++  
Sbjct: 612 HLRALMDGASGRGPVTALVCDFFGTAALPLAAELGALGYVFFPNSFAMISIMRHIVEIHG 671

Query: 669 ESVTGEYRVLTEPIRLPGCTPIPGKELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSF 728
           ++  GEYR L +P+ LPG   +   +LPD F + E+  Y + +E  + +  A+G  +NSF
Sbjct: 672 DAAPGEYRDLPDPLPLPGGPLLRHADLPDGFRESEDPVYAYLVEEARRYGRADGFLVNSF 731

Query: 729 LELESSAINALQLSGSGN--PPIYPVGPLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSF 788
            ELE +  +  +        PP+YPVGP V+  S    +   CL WLD QP GSV++VSF
Sbjct: 732 EELEVAMADMFKRDAEDGAFPPVYPVGPFVRSSSGDEADESGCLEWLDRQPEGSVVYVSF 791

Query: 789 GSGGTLSSVQLNELALGLEMSGQKFIWVVRSPS-DKEASASFFSVHSQDDPLRYLPEGFV 848
           G+GG LS  Q  ELA GLEMSG +F+WVVR PS D    A       +DDPL +LPEGFV
Sbjct: 792 GTGGALSVEQTAELAAGLEMSGHRFLWVVRMPSLDGNPCALGTIPGDKDDPLAWLPEGFV 851

Query: 849 ERNRGRGLMVPSWAPQAQILKHGSTGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNA 908
           +R  GRGL V +WAPQ ++L H +T  F+SHCGWNSTLES+ +GVP++AWPLYAEQ+ NA
Sbjct: 852 QRTSGRGLAVVAWAPQVRVLSHPATASFVSHCGWNSTLESVAAGVPMVAWPLYAEQKTNA 911

Query: 909 IILTEEIKAALRP--KMNEESGVIEKEEIAKVVKCLFEGEEGKKVRAKMEELRVAGERAT 934
            ILTE    ALRP  + + + G++ +E IA  V+ L EGEEG  VR +  ELR A +RA 
Sbjct: 912 AILTEVTGVALRPAARGHGQYGLVTREVIAAAVRELMEGEEGSAVRGRARELREASKRAW 971

BLAST of CmaCh14G020970 vs. TrEMBL
Match: K7NBX4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG2 PE=2 SV=1)

HSP 1 Score: 689.9 bits (1779), Expect = 4.3e-195
Identity = 345/465 (74.19%), Postives = 395/465 (84.95%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           AQS TPHV+M+PSPGMGHLIPL+EFAKRL+ LHRFTVTFAIPSGD PSKAQIS+L+SLPS
Sbjct: 10  AQSPTPHVVMLPSPGMGHLIPLLEFAKRLLFLHRFTVTFAIPSGDPPSKAQISILSSLPS 69

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFGTV 581
            ID++FLPP    DLP +TKA   IVLAV+RSLPS RDLFKS+V   NLVALVVDQFGT 
Sbjct: 70  GIDYVFLPPVNFHDLPKDTKAGVFIVLAVARSLPSFRDLFKSMVANTNLVALVVDQFGTD 129

Query: 582 AFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKEL 641
           AF+VA+EF+VSPYI+FPCAA TLS +L +P+ DE+V GEYR L EPIRL GC PIPGK+L
Sbjct: 130 AFDVAREFNVSPYIFFPCAAMTLSFLLRLPEFDETVAGEYRELPEPIRLSGCAPIPGKDL 189

Query: 642 PDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPL 701
             PF DREND+YK FL   K + LA+GIFLNSF ELE  AI AL    S  P ++PVGPL
Sbjct: 190 AGPFHDRENDAYKLFLHNAKRYALADGIFLNSFPELEPGAIKALLEEESRKPLVHPVGPL 249

Query: 702 VKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIWVV 761
           V++DSS +EEG ECL WL+EQP GSVLFVSFGSGG LSS Q+NELALGLEMSG +FIWVV
Sbjct: 250 VQIDSSGSEEGAECLKWLEEQPHGSVLFVSFGSGGALSSDQINELALGLEMSGHRFIWVV 309

Query: 762 RSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGFLS 821
           RSPSD+ A+ASFFSVHSQ+DPL +LPEGF+E  RGR ++VPSWAPQAQIL H STGGFLS
Sbjct: 310 RSPSDEAANASFFSVHSQNDPLSFLPEGFLEGTRGRSVVVPSWAPQAQILSHSSTGGFLS 369

Query: 822 HCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIAKV 881
           HCGWNSTLES+V GVPLIAWPLYAEQ++NAI+LTE+IKAALRPK+NEESG+IEKEEIA+V
Sbjct: 370 HCGWNSTLESVVYGVPLIAWPLYAEQKMNAILLTEDIKAALRPKINEESGLIEKEEIAEV 429

Query: 882 VKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKW 927
           VK LFEGE+GK+VRAKMEEL+ A  R  G+ GSSS TL EVVQKW
Sbjct: 430 VKELFEGEDGKRVRAKMEELKDAAVRVLGEDGSSS-TLSEVVQKW 473

BLAST of CmaCh14G020970 vs. TrEMBL
Match: K7NBR5_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG3 PE=2 SV=1)

HSP 1 Score: 681.4 bits (1757), Expect = 1.5e-192
Identity = 338/472 (71.61%), Postives = 395/472 (83.69%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           AQS TPHV+M+PSPGMGHLIPL+EFAKRL+ LHRFTVTFAIPSGD PSKAQIS+L+SLPS
Sbjct: 10  AQSPTPHVVMLPSPGMGHLIPLLEFAKRLLFLHRFTVTFAIPSGDPPSKAQISILSSLPS 69

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFGTV 581
            ID++FLPP    DLP +TKAE  IVLAV+RSLPS RDLFKS+V   NLVALVVDQFGT 
Sbjct: 70  GIDYVFLPPVNFHDLPKDTKAEVFIVLAVARSLPSFRDLFKSMVANTNLVALVVDQFGTD 129

Query: 582 AFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKEL 641
           AF+VA+EF+VSPYI+FPCAA TLS +L +P+ DE+V  EYR L EPIRL GC PIPGK+L
Sbjct: 130 AFDVAREFNVSPYIFFPCAAMTLSFLLRLPEFDETVAEEYRELPEPIRLSGCAPIPGKDL 189

Query: 642 PDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPL 701
            DPF DREND+YK FL   K + LA+GIFLNSF ELE  AI AL    S  P ++PVGPL
Sbjct: 190 ADPFHDRENDAYKLFLHNAKRYALADGIFLNSFPELEPGAIKALLEEESRKPLVHPVGPL 249

Query: 702 VKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIWVV 761
           V++DSS +EEG ECL WL+EQP GSVLFVSFGSGGTLSS Q+NELALGLEMSG +FIWVV
Sbjct: 250 VQIDSSGSEEGAECLKWLEEQPHGSVLFVSFGSGGTLSSDQINELALGLEMSGHRFIWVV 309

Query: 762 RSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGFLS 821
           RSPSD+ A+ASFFSVHSQ+DPL +LPEGF+E  RGR ++VPSWAPQAQIL H STGGFLS
Sbjct: 310 RSPSDEAANASFFSVHSQNDPLSFLPEGFLEGTRGRSVVVPSWAPQAQILSHSSTGGFLS 369

Query: 822 HCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIAKV 881
           HCGWNSTLES+V GVPLIAWPLYAEQ++NAI+LTE+IK ALRPK NE++G++EKEEIA+ 
Sbjct: 370 HCGWNSTLESVVYGVPLIAWPLYAEQKMNAILLTEDIKVALRPKTNEKTGIVEKEEIAEA 429

Query: 882 VKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSSSNVSG 934
           VK L EGE+GKK+R+KM+ LR A ER   + GSSS+ L ++V KW  S +SG
Sbjct: 430 VKTLMEGEDGKKLRSKMKYLRNAAERVLEEDGSSSKALSQMVLKW-KSKISG 480

BLAST of CmaCh14G020970 vs. TrEMBL
Match: A0A0R0IWD3_SOYBN (Uncharacterized protein (Fragment) OS=Glycine max GN=GLYMA_08G338900 PE=4 SV=1)

HSP 1 Score: 679.5 bits (1752), Expect = 5.9e-192
Identity = 367/755 (48.61%), Postives = 499/755 (66.09%), Query Frame = 1

Query: 191 DLFEPTQDRQNQAYKLFLQNAKRFRFADGIFVNSFPEFEPGAISALKLAEPPIYPIGPVV 250
           DL +P +DR +Q Y  FLQ +K    ADGI VNSF E E G I AL+             
Sbjct: 1   DLPKPFRDRTSQMYSFFLQRSKTLHVADGILVNSFKEIEAGPIRALR------------- 60

Query: 251 KMDENGSGEGAKCLNWLDEQPHGSVLYVSFGSGGTLSSKQTVELAMGLEMSGERFLWIVR 310
              E G  E   CL WL++Q   SVLYVSFGSGGTLS  Q  ELA+GLE+SG++FLW+VR
Sbjct: 61  ---EEGRCE---CLRWLEKQVPNSVLYVSFGSGGTLSQDQFNELALGLELSGKKFLWVVR 120

Query: 311 SPNDELSNASYFSVHSRNDPLSYLPEGFVERVKGR--GLLVPSWAPQTRILKHRSTGGFL 370
           +P+ E  N+ +    S N PL +LPE F+ER KG+  GL+ PSWAPQ ++L H  TGGFL
Sbjct: 121 APS-ESQNSVHLGCESDN-PLRFLPERFIERTKGKEHGLVAPSWAPQVQVLSHNVTGGFL 180

Query: 371 SHCGNNSVLESVVNGVPLIAWPLYAEQRMNAVTLTEEIKVALRPKVNEENGFVEKEEIAK 430
           +H G NS LES+VNGVPLIAWPLYAEQ MNAV LT ++KVALRPK NE+ G VE+E++AK
Sbjct: 181 THFGWNSTLESIVNGVPLIAWPLYAEQGMNAVMLTNDLKVALRPKDNEK-GLVEREQVAK 240

Query: 431 VVKSLFKGEEGKKECGCPTHSSTRAEPPTAMEEAQS--------------------QTPH 490
           V++ L + +EG+ E G    +S  A   T  EE  S                    +  H
Sbjct: 241 VIRRLMEDQEGR-EIGERMQNSKNAAAETQQEEGSSTKTLIQLGVYLLVMILIPMEKPTH 300

Query: 491 VLMMPSPGMGHLIPLIEFAKRLVL-LHRFTVTFAIPSGDAPSKAQISVLNSLPSAIDHLF 550
           ++++PSPG  HL+ LIEF+KRL+   +   VT  IP+ D+PS+   ++L +LPS I  +F
Sbjct: 301 IVIVPSPGFSHLLSLIEFSKRLIHHSNGLQVTCMIPTLDSPSEPSQAILQTLPSTIHSIF 360

Query: 551 LPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFGTVAFEVAK 610
           LP        + T     + LAV+ SLP +R+  K+I     LVA+  D F + A   AK
Sbjct: 361 LPSIHFNK-ETQTPIAVQVQLAVTHSLPFIREALKTISLSSRLVAMFADMFASDALICAK 420

Query: 611 EFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKELPDPFLD 670
           E ++  ++YFP +A TLS   ++PKLD++   E++ LTEPI +PGC PI GK+LP P  D
Sbjct: 421 ELNLLSFVYFPSSAMTLSFCFYLPKLDQTFPSEFKDLTEPIEIPGCVPIYGKDLPKPVQD 480

Query: 671 RENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPLVKVDSS 730
           R    Y+FFL+  K     +G+ +NSF  +E   I AL   G+G P +YP+GP+++    
Sbjct: 481 RTGQMYEFFLKRCKQLHETDGVLVNSFKGIEEGPIRALVEEGNGYPNVYPIGPIMQTGLG 540

Query: 731 VTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIWVVRSPSDK 790
               G E L WL+ Q   SVL+VSFGSGGTLS  QLNELA GLE+SG+KF+WVVR+PS+ 
Sbjct: 541 NLRNGSESLRWLENQVPNSVLYVSFGSGGTLSKDQLNELAFGLELSGEKFLWVVRAPSE- 600

Query: 791 EASASFFSVHSQDDPLRYLPEGFVERNR-GRGLMVPSWAPQAQILKHGSTGGFLSHCGWN 850
            A++S+ +  S DD LR+LPEGF+ER +  +GL+VPSWAPQ Q+L H +TGGFL+HCGWN
Sbjct: 601 SANSSYLNSQS-DDSLRFLPEGFIERTKEEQGLVVPSWAPQVQVLAHKATGGFLTHCGWN 660

Query: 851 STLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIAKVVKCLF 910
           STLES+++GVPLI WPL+AEQR+NA+ LT+++K ALRPK N E+G++ +EE+AKVV+ L 
Sbjct: 661 STLESIMNGVPLIVWPLFAEQRMNAVTLTDDLKVALRPKAN-ENGLVGREEVAKVVRKLI 720

Query: 911 EGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLE 922
           +GEEG+++  +M++L+ A   A  + GSS++TL++
Sbjct: 721 KGEEGREIGGRMQKLKNAAAEALEEEGSSTKTLIQ 728

BLAST of CmaCh14G020970 vs. TAIR10
Match: AT4G01070.1 (AT4G01070.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 539.3 bits (1388), Expect = 4.8e-153
Identity = 278/470 (59.15%), Postives = 351/470 (74.68%), Query Frame = 1

Query: 463 QSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPSA 522
           +S+TPHV ++PSPGMGHLIPL+EFAKRLV LH  TVTF I     PSKAQ +VL+SLPS+
Sbjct: 3   ESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSS 62

Query: 523 IDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNL-VALVVDQFGTV 582
           I  +FLPP  L DL S+T+ E+ I L V+RS P LR +F S V    L  ALVVD FGT 
Sbjct: 63  ISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTD 122

Query: 583 AFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKEL 642
           AF+VA EF V PYI++P  A  LS  LH+PKLDE+V+ E+R LTEP+ LPGC P+ GK+ 
Sbjct: 123 AFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDF 182

Query: 643 PDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPL 702
            DP  DR++D+YK+ L   K +  AEGI +N+F ELE +AI ALQ  G   PP+YPVGPL
Sbjct: 183 LDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPL 242

Query: 703 V---KVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFI 762
           V   K ++  TEE  ECL WLD QP GSVL+VSFGSGGTL+  QLNELALGL  S Q+F+
Sbjct: 243 VNIGKQEAKQTEES-ECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFL 302

Query: 763 WVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGG 822
           WV+RSPS   A++S+F  HSQ DPL +LP GF+ER + RG ++P WAPQAQ+L H STGG
Sbjct: 303 WVIRSPSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGG 362

Query: 823 FLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEI 882
           FL+HCGWNSTLES+VSG+PLIAWPLYAEQ++NA++L+E+I+AALRP+  ++ G++ +EE+
Sbjct: 363 FLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGDD-GLVRREEV 422

Query: 883 AKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
           A+VVK L EGEEGK VR KM+EL+ A  R   D G+S++ L  V  KW +
Sbjct: 423 ARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWKA 469

BLAST of CmaCh14G020970 vs. TAIR10
Match: AT1G01420.1 (AT1G01420.1 UDP-glucosyl transferase 72B3)

HSP 1 Score: 510.4 bits (1313), Expect = 2.4e-144
Identity = 265/471 (56.26%), Postives = 348/471 (73.89%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           A   TPHV ++PSPG+GHLIPL+E AKRL+  H FTVTF IP    PSKAQ SVLNSLPS
Sbjct: 2   ADGNTPHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPS 61

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVA-LVVDQFGT 581
           +I  +FLPPA L D+PS  + ET I L V+RS P+LR+LF S+  ++ L A LVVD FGT
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGT 121

Query: 582 VAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKE 641
            AF+VA EF VSPYI++   A  L+ +LH+PKLDE+V+ E+R LTEP+ +PGC PI GK+
Sbjct: 122 DAFDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKD 181

Query: 642 LPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGP 701
             DP  DR+++SYK+ L  +K F  AEGI +NSF++LE + I  +Q      PP+Y +GP
Sbjct: 182 FVDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGP 241

Query: 702 LVKV---DSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKF 761
           LV     D+ V +E  +CLNWLD QP GSVL+VSFGSGGTL+  Q  ELALGL  SG++F
Sbjct: 242 LVNSGSHDADVNDE-YKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRF 301

Query: 762 IWVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTG 821
           +WV+RSPS   AS+S+F+  S++DP  +LP+GF++R + +GL+V SWAPQAQIL H S G
Sbjct: 302 LWVIRSPSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIG 361

Query: 822 GFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEE 881
           GFL+HCGWNS+LES+V+GVPLIAWPLYAEQ++NA++L  ++ AALR ++ E+ GV+ +EE
Sbjct: 362 GFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLV-DVGAALRARLGED-GVVGREE 421

Query: 882 IAKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
           +A+VVK L EGEEG  VR KM+EL+    R   D G S+++L EV  KW +
Sbjct: 422 VARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWKA 468

BLAST of CmaCh14G020970 vs. TAIR10
Match: AT1G01390.1 (AT1G01390.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 509.2 bits (1310), Expect = 5.3e-144
Identity = 257/470 (54.68%), Postives = 343/470 (72.98%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           A++ TPH+ +MPSPGMGHLIP +E AKRLV    FTVT  I    +PSKAQ SVLNSLPS
Sbjct: 2   AEANTPHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPS 61

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVA-LVVDQFGT 581
           +I  +FLPPA L D+PS  + ET  +L ++RS P+LR+LF S+ T+++L A LVVD FG 
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGA 121

Query: 582 VAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKE 641
            AF+VA +F VSPYI++   A  LS  LH+PKLD++V+ E+R LTEP+++PGC PI GK+
Sbjct: 122 DAFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKD 181

Query: 642 LPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGP 701
             D   DR +D+YK  L   K +  A+GI +NSF++LES+AI ALQ      P +YP+GP
Sbjct: 182 FLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGP 241

Query: 702 LVKVDSSVT--EEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFI 761
           LV   SS    E+   CL+WLD QP GSVL++SFGSGGTL+  Q NELA+GL  SG++FI
Sbjct: 242 LVNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFI 301

Query: 762 WVVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGG 821
           WV+RSPS+   S+S+F+ HS+ DP  +LP GF++R + +GL+VPSWAPQ QIL H ST G
Sbjct: 302 WVIRSPSE-IVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCG 361

Query: 822 FLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEI 881
           FL+HCGWNSTLES+V+GVPLIAWPL+AEQ++N ++L E++ AALR    E+ G++ +EE+
Sbjct: 362 FLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGED-GIVRREEV 421

Query: 882 AKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSS 929
            +VVK L EGEEGK +  K++EL+    R  GD G SS++  EV+ KW +
Sbjct: 422 VRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWKT 469

BLAST of CmaCh14G020970 vs. TAIR10
Match: AT3G50740.1 (AT3G50740.1 UDP-glucosyl transferase 72E1)

HSP 1 Score: 333.6 bits (854), Expect = 4.0e-91
Identity = 196/472 (41.53%), Postives = 283/472 (59.96%), Query Frame = 1

Query: 467 PHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSL---PSAI 526
           PHV M  SPGMGH+IP+IE  KRL   H F VT  +   DA S AQ   LNS     + +
Sbjct: 6   PHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAAS-AQSQFLNSPGCDAALV 65

Query: 527 DHLFLPPAPLKDLPSNTKAETIIVLAVSR-SLPSLRDLFKSIVTQRNLVALVVDQFGTVA 586
           D + LP   +  L   +    I +L + R ++P++R   + +  Q    AL+VD FG  A
Sbjct: 66  DIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEM--QHKPTALIVDLFGLDA 125

Query: 587 FEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKELP 646
             +  EF++  YI+    A  L++ L  P LD+ +  E+ +  +P+ +PGC P+  ++  
Sbjct: 126 IPLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTL 185

Query: 647 DPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQ----LSGSGNPPIYPV 706
           + FLD  +  Y+ F+     F   +GI +N++ ++E   + +LQ    L      P+YP+
Sbjct: 186 ETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPI 245

Query: 707 GPLVK-VDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKF 766
           GPL + VD S T   V  L+WL++QP  SVL++SFGSGG+LS+ QL ELA GLEMS Q+F
Sbjct: 246 GPLSRPVDPSKTNHPV--LDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRF 305

Query: 767 IWVVRSPSDKEASASFFSVHS---QDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHG 826
           +WVVR P D  A +++ S +S   +D    YLPEGFV R   RG MV SWAPQA+IL H 
Sbjct: 306 VWVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQ 365

Query: 827 STGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIE 886
           + GGFL+HCGWNS LES+V GVP+IAWPL+AEQ +NA +L EE+  A+R K     GVI 
Sbjct: 366 AVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVIT 425

Query: 887 KEEIAKVVKCLFEGEEGKKVRAKMEELR-VAGERATGDGGSSSRTLLEVVQK 926
           + EI  +V+ +   EEG ++R K+++L+  A E  + DGG +  +L  +  +
Sbjct: 426 RAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRIADE 472

BLAST of CmaCh14G020970 vs. TAIR10
Match: AT2G18570.1 (AT2G18570.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 328.9 bits (842), Expect = 9.8e-90
Identity = 196/476 (41.18%), Postives = 282/476 (59.24%), Query Frame = 1

Query: 467 PHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTF-AIPSGDA-PSKAQ---------ISV 526
           PH L++ SPG+GHLIP++E   RL  +    VT  A+ SG + P++ +         I  
Sbjct: 4   PHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTICQ 63

Query: 527 LNSLPSA-IDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALV 586
           +  +PS  +D+L  P A +          T +V+ +    P++RD  K  + +R    ++
Sbjct: 64  ITEIPSVDVDNLVEPDATIF---------TKMVVKMRAMKPAVRDAVK--LMKRKPTVMI 123

Query: 587 VDQFGTVAFEVAKEFSVSP-YIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGC 646
           VD  GT    VA +  ++  Y+Y P  A  L++++++P LD  V GEY  + EP+++PGC
Sbjct: 124 VDFLGTELMSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGC 183

Query: 647 TPIPGKELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINAL----QLSG 706
            P+  KEL +  LDR    YK  +       +++G+ +N++ EL+ + + AL    +LS 
Sbjct: 184 KPVGPKELMETMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSR 243

Query: 707 SGNPPIYPVGPLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALG 766
               P+YP+GP+V+ +  V +       WLDEQ   SV+FV  GSGGTL+  Q  ELALG
Sbjct: 244 VMKVPVYPIGPIVRTNQHVDKPN-SIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALG 303

Query: 767 LEMSGQKFIWVVRSPSDKEASASFFSVHSQDDPL--RYLPEGFVERNRGRGLMVPSWAPQ 826
           LE+SGQ+F+WV+R P      AS+    S DD      LPEGF++R RG G++V  WAPQ
Sbjct: 304 LELSGQRFVWVLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQ 363

Query: 827 AQILKHGSTGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMN 886
            +IL H S GGFLSHCGW+S LESL  GVP+IAWPLYAEQ +NA +LTEEI  A+R    
Sbjct: 364 VEILSHRSIGGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSEL 423

Query: 887 EESGVIEKEEIAKVVKCLF--EGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLE 922
               VI +EE+A +V+ +   E EEG+K+RAK EE+RV+ ERA    GSS  +L E
Sbjct: 424 PSERVIGREEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNSLFE 461

BLAST of CmaCh14G020970 vs. NCBI nr
Match: gi|828339687|ref|XP_012567789.1| (PREDICTED: uncharacterized protein LOC101504804 [Cicer arietinum])

HSP 1 Score: 852.0 bits (2200), Expect = 9.5e-244
Identity = 452/955 (47.33%), Postives = 619/955 (64.82%), Query Frame = 1

Query: 15  VHIVMLPSPGMGHLIPLLEFAKRLLTFNHHFTITFAIPSDGPPTTAQISVLCSLPPQIRH 74
           +HI ++P  G  HL P+L+F+K L+  + HF +T  IPS G   T   ++L +LP  I  
Sbjct: 5   IHIAVVPGVGYSHLNPILQFSKLLVHLHPHFHVTCFIPSLGSLPTDSKTILQTLPSNIHC 64

Query: 75  VFLPPVSLNDLPLDSRMETIITLTVARSVPSLRDLLKSMVADTQTNLVALVVDHFCIDAL 134
            FLPP+   +LPL   +E  +  TV  S+PSL  +LK++    +T  VA++VD F ++AL
Sbjct: 65  YFLPPLDPKNLPLQLPLELQLQFTVNHSLPSLHQVLKTLTL--KTPFVAMIVDSFAVEAL 124

Query: 135 DVGKEFNLSSCIFFPSTAMALSANLCLAELDEMVTGEYRDHPDLIRIPGCTPIHGRDLFE 194
           D+ KEFN+ S ++FPS    LS+   L +LD++ + EYRD P+ ++IPGC PIHGRDL  
Sbjct: 125 DLAKEFNMLSYVYFPSAVTTLSSYFHLIKLDKVTSCEYRDLPEPVKIPGCVPIHGRDLVV 184

Query: 195 PTQDRQNQAYKLFLQNAKRFRFADGIFVNSFPEFEPGAISALK---LAEPPIYPIGPVVK 254
             QDR +Q+YK  L+  +RFR  DG+ +NSF E E G I AL       P +YP+GP+++
Sbjct: 185 QAQDRLSQSYKFLLKRVERFRLVDGVIINSFLEMEIGVIRALVEEGSGNPVVYPVGPIIQ 244

Query: 255 MDENGSGEGAKCLNWLDEQPHGSVLYVSFGSGGTLSSKQTVELAMGLEMSGERFLWIVRS 314
            D    G   +CL WLD+Q   SVL+VSFGSGGTLS +Q  ELA+GLE+S  RFLW++R+
Sbjct: 245 QDTQ-QGHDLECLAWLDKQQPCSVLFVSFGSGGTLSQEQIFELALGLELSDHRFLWVMRA 304

Query: 315 PNDELSNASYFSVHSRN-DPLSYLPEGFVERVKGRGLLVPSWAPQTRILKHRSTGGFLSH 374
           P++ L+NA+Y S      DPL +LP GF++R K +GL++P WAPQ +IL H S GGFLSH
Sbjct: 305 PSN-LANAAYLSGGKDGVDPLQFLPSGFLDRTKEKGLVIPLWAPQIQILSHSSVGGFLSH 364

Query: 375 CGNNSVLESVVNGVPLIAWPLYAEQRMNAVTLTEEIKVALRPKVNEENGFVEKEEIAKVV 434
           CG NSVLESV++GVPLI WPL+AEQRMNAV L+E +KV +RP+VN ENG VE+EEI KV+
Sbjct: 365 CGWNSVLESVMHGVPLITWPLFAEQRMNAVVLSEGLKVGVRPRVN-ENGIVEREEIVKVI 424

Query: 435 KSLFKGEEG------KKECGCPTHSSTRAEPPTAMEEAQ--------------------- 494
           K L +GEEG       KE     +++ + +  +    +Q                     
Sbjct: 425 KCLMEGEEGGTMRDRMKELKNAANNAIKEDGSSIKTLSQLALKLRNLYSPWFWNVLNKVS 484

Query: 495 ------SQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFT-VTFAIPSGDAPSKAQISVL 554
                  +T H+ ++P  G GHL+P+++F K LV LH F  VT  IP+  +P  A  ++L
Sbjct: 485 FMNLDMEKTIHIAVVPGVGFGHLVPILQFTKLLVHLHPFIHVTCLIPTLGSPPSALKTIL 544

Query: 555 NSLPSAIDHLFLPPAPLKDLPSNT-KAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVV 614
            +LPS I++ FL P    DLP  T   E    L V+ SLP L    KS+  +  LVALV 
Sbjct: 545 QTLPSNINYTFLLPVDPNDLPQETLTLEMKSQLIVTLSLPYLHQALKSLALRTPLVALVA 604

Query: 615 DQFGTVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTP 674
           D F   A   AK+F++  YIYF  AATTLS   + PKLDE  + EYR L EPI++PGC P
Sbjct: 605 DSFAVEALNFAKDFNMLSYIYFTSAATTLSFSFYFPKLDEETSCEYRDLPEPIKIPGCIP 664

Query: 675 IPGKELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPI 734
           + G +L  P  DR + +YK FL+  K    A+G+ +NSFLE+E   I AL   GSGNP +
Sbjct: 665 LHGSDLLTPAQDRSSQAYKHFLQHSKSLCFADGVLVNSFLEMEMGPIKALTEEGSGNPAV 724

Query: 735 YPVGPLVKV---DSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEM 794
           YP+GP+++      S    G ECL WLD+Q   SVL+VSFGSGGTLS  Q  ELALGLE+
Sbjct: 725 YPIGPIIQTGTKSGSDVGNGKECLTWLDKQKPCSVLYVSFGSGGTLSQEQTVELALGLEL 784

Query: 795 SGQKFIWVVRSPSDKEASASFFSVHSQD-DPLRYLPEGFVERNRGRGLMVPSWAPQAQIL 854
           S  +F+WVVR+P++  A+A++FS    D DPL++LP GF+ER + +GL++PSWAPQ QIL
Sbjct: 785 SNHRFLWVVRAPNN-SANAAYFSTQDDDVDPLKFLPSGFLERTKEKGLVIPSWAPQIQIL 844

Query: 855 KHGSTGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESG 914
            H S GGFLSHCGWNS LES++ GVPLI WPL+AEQR+NA +L+E +K  +RP++N E+G
Sbjct: 845 SHSSVGGFLSHCGWNSCLESVMHGVPLITWPLFAEQRMNAALLSEGLKVGVRPRVN-ENG 904

Query: 915 VIEKEEIAKVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKW 927
           ++E+ EI KV+KCL E EEG+ +   M+EL+ A   A  + G S++T+ ++  KW
Sbjct: 905 IVERVEIVKVIKCLMEEEEGRNLCNNMKELKDAAINALKENGPSTKTIYQLTLKW 952

BLAST of CmaCh14G020970 vs. NCBI nr
Match: gi|659075011|ref|XP_008437917.1| (PREDICTED: hydroquinone glucosyltransferase-like [Cucumis melo])

HSP 1 Score: 756.5 bits (1952), Expect = 5.4e-215
Identity = 372/467 (79.66%), Postives = 422/467 (90.36%), Query Frame = 1

Query: 460 EEAQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSL 519
           +E +S TPHV+MMPSPGMGHLIPL+EFAKRLVLLHRFTVTF IPSG  PSKAQISVL+SL
Sbjct: 11  QEVESPTPHVVMMPSPGMGHLIPLVEFAKRLVLLHRFTVTFVIPSGGPPSKAQISVLSSL 70

Query: 520 PSAIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFG 579
           PSAIDH+FLPP  L DLP  TKAETIIVL+V+RSLPSLRD FKS+VTQRNLVA VVDQFG
Sbjct: 71  PSAIDHVFLPPPSLNDLPPQTKAETIIVLSVTRSLPSLRDQFKSMVTQRNLVAFVVDQFG 130

Query: 580 TVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGK 639
           T+AF++ +EF+V PY+Y PC+ATTLSLILHM +LD+SV G+Y  LTEPIRLP C+PIP K
Sbjct: 131 TIAFDLVREFNVPPYVYLPCSATTLSLILHMSELDKSVVGDYTDLTEPIRLPACSPIPAK 190

Query: 640 ELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVG 699
            LPDPFLDR++DSYK+FLE+M  F LAEGIF+NSF ELE + INAL+ S    PPI+PVG
Sbjct: 191 ALPDPFLDRKDDSYKYFLESMSRFGLAEGIFVNSFPELEPNPINALK-SEESYPPIHPVG 250

Query: 700 PLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIW 759
           P+VK+DSS +EEG+ECLNWLDEQP GSVLFVSFGSGGTLSS+Q NELA+GLEMSGQKFIW
Sbjct: 251 PIVKIDSSGSEEGIECLNWLDEQPHGSVLFVSFGSGGTLSSIQNNELAMGLEMSGQKFIW 310

Query: 760 VVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGF 819
           VVRSP DKEA+ASFFSVHS++DPL++LPEGFVERNRGRGL++PSWAPQAQIL HGSTGGF
Sbjct: 311 VVRSPHDKEANASFFSVHSENDPLQFLPEGFVERNRGRGLVLPSWAPQAQILSHGSTGGF 370

Query: 820 LSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIA 879
           LSHCGWNSTLESLV+GVPLIAWPLYAEQ++N++ILTEEIK AL+ KMNEESG+IEKEEIA
Sbjct: 371 LSHCGWNSTLESLVNGVPLIAWPLYAEQKLNSVILTEEIKVALKLKMNEESGIIEKEEIA 430

Query: 880 KVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKW 927
           KVVK LFE EEG+KVR KMEELR AGERA G+GGSSSRTLLEVVQKW
Sbjct: 431 KVVKSLFESEEGQKVREKMEELRAAGERAVGEGGSSSRTLLEVVQKW 476

BLAST of CmaCh14G020970 vs. NCBI nr
Match: gi|449432064|ref|XP_004133820.1| (PREDICTED: hydroquinone glucosyltransferase-like [Cucumis sativus])

HSP 1 Score: 745.3 bits (1923), Expect = 1.2e-211
Identity = 365/471 (77.49%), Postives = 416/471 (88.32%), Query Frame = 1

Query: 460 EEAQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSL 519
           +E +S TPHV+MM SPGMGHLIPL+EFAKRLVLLHRFTVTF IPSG  P KAQIS+L+SL
Sbjct: 11  QEFESSTPHVVMMVSPGMGHLIPLVEFAKRLVLLHRFTVTFVIPSGGPPPKAQISLLSSL 70

Query: 520 PSAIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFG 579
           PSAIDH+FLPP  L DLP  TK ETIIVL V+RSLPSLRD FKS++TQRN VA VVDQF 
Sbjct: 71  PSAIDHVFLPPVSLNDLPPQTKGETIIVLTVTRSLPSLRDQFKSMLTQRNPVAFVVDQFC 130

Query: 580 TVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGK 639
           T+A ++A+EF+V PY+Y PC+ATTLSL+LHMP+LD+SV GEY  LTEPI+LP C+P P K
Sbjct: 131 TIAIDLAREFNVPPYVYLPCSATTLSLVLHMPELDKSVVGEYTDLTEPIKLPACSPFPAK 190

Query: 640 ELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVG 699
            LPDPFLDR++DSYK+FLE+M  F LA+GIF+NSF ELE   INAL+L  SG PPIYPVG
Sbjct: 191 ALPDPFLDRKDDSYKYFLESMSRFGLADGIFVNSFPELEPDPINALKLEESGYPPIYPVG 250

Query: 700 PLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIW 759
           P+VK+DSS +EE +ECL WLDEQP GSVLFVSFGSGGTLSS+Q NELA+GLEMSGQKFIW
Sbjct: 251 PIVKMDSSGSEEEIECLKWLDEQPHGSVLFVSFGSGGTLSSIQNNELAMGLEMSGQKFIW 310

Query: 760 VVRSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGF 819
           VVRSP DKEA+ASFFSVHSQ+DPL++LPEGFVERN+GRGL++PSWAPQAQIL HGSTGGF
Sbjct: 311 VVRSPHDKEANASFFSVHSQNDPLKFLPEGFVERNKGRGLLLPSWAPQAQILSHGSTGGF 370

Query: 820 LSHCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIA 879
           LSHCGWNSTLESLV+GVP+IAWPLYAEQR+NA+IL EEIK AL+ KMNEESG+IEKEEIA
Sbjct: 371 LSHCGWNSTLESLVNGVPMIAWPLYAEQRLNAVILIEEIKVALKVKMNEESGIIEKEEIA 430

Query: 880 KVVKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKWSSSN 931
           KVVK LFE EEGKKVR KMEELRVAGER  G+GGSSSRT+LEVVQKW + N
Sbjct: 431 KVVKSLFESEEGKKVREKMEELRVAGERVVGEGGSSSRTVLEVVQKWRNRN 481

BLAST of CmaCh14G020970 vs. NCBI nr
Match: gi|242064612|ref|XP_002453595.1| (hypothetical protein SORBIDRAFT_04g008700 [Sorghum bicolor])

HSP 1 Score: 720.7 bits (1859), Expect = 3.3e-204
Identity = 420/984 (42.68%), Postives = 571/984 (58.03%), Query Frame = 1

Query: 9   GAQSPSVHIVMLPSPGMGHLIPLLEFAKRLLTFNHHFT----ITFA-IPSDGPPTTAQIS 68
           G +    H+V++ SPG GHL+P+ E A+RL+   HH      +TFA + +D    +A  +
Sbjct: 12  GPRPDRPHVVLVSSPGAGHLMPMAELARRLVA--HHAVAATLVTFADLSADSDAHSA--A 71

Query: 69  VLCSL-PPQIRHVFLPPVSLNDLPLDSRMETIITLTVARSVPSLRDLLKSMVADTQTNLV 128
           VL SL    +    LP V  +DLP D+R+ET++   + RS+P LR LL+ +  D+   L 
Sbjct: 72  VLSSLRAANVSTATLPAVPHDDLPADARIETVLLEVIGRSIPHLRALLRDV--DSTAPLA 131

Query: 129 ALVVDHFCIDALDVGKEFNLSSCIFFPSTAMALSANLCLAELDEMV-TGEYRDHPDLIRI 188
           ALV D FC  AL +  E  +   IFFPS    LS      E+++    GEYRD PD +++
Sbjct: 132 ALVPDFFCTAALPLASELGVPGYIFFPSNLTVLSVMRSAVEVNDGAGAGEYRDLPDPLQL 191

Query: 189 PGCTPIHGRDLFEPTQDRQNQAYKLFLQNAKRFRFADGIFVNSFPEFEPGAISALKLAE- 248
           PG   +   DL +  +D +   Y   +   +R+R A G   N+F   +P  +   K A  
Sbjct: 192 PGGVSLRREDLPDGFRDGKEPVYAHLVGEGRRYRAAAGFLANTFHGMDPATVEEFKKAAE 251

Query: 249 ----PPIYPIGPVVKMDENGSGEGAKCLNWLDEQPHGSVLYVSFGSGGTLSSKQTVELAM 308
               PP YP+GP V+   +  G  + C+ WLD QP GSV+YVSFGS GTLS +QT ELA 
Sbjct: 252 QIRFPPAYPVGPFVRSSSDEGGASSPCIEWLDRQPTGSVVYVSFGSAGTLSVEQTAELAA 311

Query: 309 GLEMSGERFLWIVRSPN------DELSNASYFSVHSRNDPLSYLPEGFVERVKGRGLLVP 368
           GLE SG RFLWIVR P+      D++   S       NDPL++LP+GF+ER +GRGL V 
Sbjct: 312 GLEDSGHRFLWIVRMPSLDGEHSDDMGRKSRGGGGDENDPLAWLPDGFLERTRGRGLAVA 371

Query: 369 SWAPQTRILKHRSTGGFLSHCGNNSVLESVVNGVPLIAWPLYAEQRMNAVTLTEEIKVAL 428
           SWAPQ R+L H +T  F+SHCG NS LESV +GVP++AWPLYAEQRMNAV L+E + VAL
Sbjct: 372 SWAPQVRVLSHPATAAFVSHCGWNSALESVTSGVPMVAWPLYAEQRMNAVVLSENVGVAL 431

Query: 429 RPKVN-EENGFVEKEEIAKVVKSLFKGEEG---KKECG----------CPTHSSTRA--- 488
           R +V  ++ G V +EEIA  V+ L +GE G   ++  G           P  SS RA   
Sbjct: 432 RLRVRPDDGGLVGREEIAAAVRELMEGEHGRAMRRRTGDLQQAADMAWAPDGSSRRALGE 491

Query: 489 ------------------EPPTAMEEAQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHR 548
                              PP  ME   SQ   V++  SPG GHLIPL+E A+RL + H 
Sbjct: 492 VVGRWKAATTTGASTEWLSPPELMENLPSQ-QQVVLFASPGAGHLIPLVELARRLAMDHG 551

Query: 549 FTVTFAIPSGDAPSKAQISVLNSLPSAIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLP 608
           F VT  + +G +      +VL+SLPS++    LP   L DLP +    T++   V RSLP
Sbjct: 552 FAVTLVMLTGMSDPANDAAVLSSLPSSVATAVLPAVSLDDLPPDVGFGTLMFELVRRSLP 611

Query: 609 SLRDLFKSIVTQRNLVALVVDQFGTVAFEVAKEFSVSPYIYFPCAATTLSLILHMPKL-D 668
            LR L      +  + ALV D FGT A  +A E     Y++FP +   +S++ H+ ++  
Sbjct: 612 HLRALMDGASGRGPVTALVCDFFGTAALPLAAELGALGYVFFPNSFAMISIMRHIVEIHG 671

Query: 669 ESVTGEYRVLTEPIRLPGCTPIPGKELPDPFLDRENDSYKFFLETMKGFVLAEGIFLNSF 728
           ++  GEYR L +P+ LPG   +   +LPD F + E+  Y + +E  + +  A+G  +NSF
Sbjct: 672 DAAPGEYRDLPDPLPLPGGPLLRHADLPDGFRESEDPVYAYLVEEARRYGRADGFLVNSF 731

Query: 729 LELESSAINALQLSGSGN--PPIYPVGPLVKVDSSVTEEGVECLNWLDEQPRGSVLFVSF 788
            ELE +  +  +        PP+YPVGP V+  S    +   CL WLD QP GSV++VSF
Sbjct: 732 EELEVAMADMFKRDAEDGAFPPVYPVGPFVRSSSGDEADESGCLEWLDRQPEGSVVYVSF 791

Query: 789 GSGGTLSSVQLNELALGLEMSGQKFIWVVRSPS-DKEASASFFSVHSQDDPLRYLPEGFV 848
           G+GG LS  Q  ELA GLEMSG +F+WVVR PS D    A       +DDPL +LPEGFV
Sbjct: 792 GTGGALSVEQTAELAAGLEMSGHRFLWVVRMPSLDGNPCALGTIPGDKDDPLAWLPEGFV 851

Query: 849 ERNRGRGLMVPSWAPQAQILKHGSTGGFLSHCGWNSTLESLVSGVPLIAWPLYAEQRVNA 908
           +R  GRGL V +WAPQ ++L H +T  F+SHCGWNSTLES+ +GVP++AWPLYAEQ+ NA
Sbjct: 852 QRTSGRGLAVVAWAPQVRVLSHPATASFVSHCGWNSTLESVAAGVPMVAWPLYAEQKTNA 911

Query: 909 IILTEEIKAALRP--KMNEESGVIEKEEIAKVVKCLFEGEEGKKVRAKMEELRVAGERAT 934
            ILTE    ALRP  + + + G++ +E IA  V+ L EGEEG  VR +  ELR A +RA 
Sbjct: 912 AILTEVTGVALRPAARGHGQYGLVTREVIAAAVRELMEGEEGSAVRGRARELREASKRAW 971

BLAST of CmaCh14G020970 vs. NCBI nr
Match: gi|343466213|gb|AEM43000.1| (UDP-glucosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 689.9 bits (1779), Expect = 6.2e-195
Identity = 345/465 (74.19%), Postives = 395/465 (84.95%), Query Frame = 1

Query: 462 AQSQTPHVLMMPSPGMGHLIPLIEFAKRLVLLHRFTVTFAIPSGDAPSKAQISVLNSLPS 521
           AQS TPHV+M+PSPGMGHLIPL+EFAKRL+ LHRFTVTFAIPSGD PSKAQIS+L+SLPS
Sbjct: 10  AQSPTPHVVMLPSPGMGHLIPLLEFAKRLLFLHRFTVTFAIPSGDPPSKAQISILSSLPS 69

Query: 522 AIDHLFLPPAPLKDLPSNTKAETIIVLAVSRSLPSLRDLFKSIVTQRNLVALVVDQFGTV 581
            ID++FLPP    DLP +TKA   IVLAV+RSLPS RDLFKS+V   NLVALVVDQFGT 
Sbjct: 70  GIDYVFLPPVNFHDLPKDTKAGVFIVLAVARSLPSFRDLFKSMVANTNLVALVVDQFGTD 129

Query: 582 AFEVAKEFSVSPYIYFPCAATTLSLILHMPKLDESVTGEYRVLTEPIRLPGCTPIPGKEL 641
           AF+VA+EF+VSPYI+FPCAA TLS +L +P+ DE+V GEYR L EPIRL GC PIPGK+L
Sbjct: 130 AFDVAREFNVSPYIFFPCAAMTLSFLLRLPEFDETVAGEYRELPEPIRLSGCAPIPGKDL 189

Query: 642 PDPFLDRENDSYKFFLETMKGFVLAEGIFLNSFLELESSAINALQLSGSGNPPIYPVGPL 701
             PF DREND+YK FL   K + LA+GIFLNSF ELE  AI AL    S  P ++PVGPL
Sbjct: 190 AGPFHDRENDAYKLFLHNAKRYALADGIFLNSFPELEPGAIKALLEEESRKPLVHPVGPL 249

Query: 702 VKVDSSVTEEGVECLNWLDEQPRGSVLFVSFGSGGTLSSVQLNELALGLEMSGQKFIWVV 761
           V++DSS +EEG ECL WL+EQP GSVLFVSFGSGG LSS Q+NELALGLEMSG +FIWVV
Sbjct: 250 VQIDSSGSEEGAECLKWLEEQPHGSVLFVSFGSGGALSSDQINELALGLEMSGHRFIWVV 309

Query: 762 RSPSDKEASASFFSVHSQDDPLRYLPEGFVERNRGRGLMVPSWAPQAQILKHGSTGGFLS 821
           RSPSD+ A+ASFFSVHSQ+DPL +LPEGF+E  RGR ++VPSWAPQAQIL H STGGFLS
Sbjct: 310 RSPSDEAANASFFSVHSQNDPLSFLPEGFLEGTRGRSVVVPSWAPQAQILSHSSTGGFLS 369

Query: 822 HCGWNSTLESLVSGVPLIAWPLYAEQRVNAIILTEEIKAALRPKMNEESGVIEKEEIAKV 881
           HCGWNSTLES+V GVPLIAWPLYAEQ++NAI+LTE+IKAALRPK+NEESG+IEKEEIA+V
Sbjct: 370 HCGWNSTLESVVYGVPLIAWPLYAEQKMNAILLTEDIKAALRPKINEESGLIEKEEIAEV 429

Query: 882 VKCLFEGEEGKKVRAKMEELRVAGERATGDGGSSSRTLLEVVQKW 927
           VK LFEGE+GK+VRAKMEEL+ A  R  G+ GSSS TL EVVQKW
Sbjct: 430 VKELFEGEDGKRVRAKMEELKDAAVRVLGEDGSSS-TLSEVVQKW 473

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HQGT_RAUSE7.4e-15658.10Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1 SV=1[more]
U72B1_ARATH8.5e-15259.15UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1 PE=1 SV=1[more]
U72B3_ARATH4.2e-14356.26UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3 PE=2 SV=1[more]
U72B2_ARATH9.4e-14354.68UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2 PE=2 SV=1[more]
UFOG5_MANES4.1e-9841.56Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L8M3_CUCSA8.7e-21277.49Glycosyltransferase OS=Cucumis sativus GN=Csa_3G119710 PE=3 SV=1[more]
C5XYZ7_SORBI2.3e-20442.68Putative uncharacterized protein Sb04g008700 OS=Sorghum bicolor GN=Sb04g008700 P... [more]
K7NBX4_SIRGR4.3e-19574.19Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG2 PE=2 SV=1[more]
K7NBR5_SIRGR1.5e-19271.61Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG3 PE=2 SV=1[more]
A0A0R0IWD3_SOYBN5.9e-19248.61Uncharacterized protein (Fragment) OS=Glycine max GN=GLYMA_08G338900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G01070.14.8e-15359.15 UDP-Glycosyltransferase superfamily protein[more]
AT1G01420.12.4e-14456.26 UDP-glucosyl transferase 72B3[more]
AT1G01390.15.3e-14454.68 UDP-Glycosyltransferase superfamily protein[more]
AT3G50740.14.0e-9141.53 UDP-glucosyl transferase 72E1[more]
AT2G18570.19.8e-9041.18 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|828339687|ref|XP_012567789.1|9.5e-24447.33PREDICTED: uncharacterized protein LOC101504804 [Cicer arietinum][more]
gi|659075011|ref|XP_008437917.1|5.4e-21579.66PREDICTED: hydroquinone glucosyltransferase-like [Cucumis melo][more]
gi|449432064|ref|XP_004133820.1|1.2e-21177.49PREDICTED: hydroquinone glucosyltransferase-like [Cucumis sativus][more]
gi|242064612|ref|XP_002453595.1|3.3e-20442.68hypothetical protein SORBIDRAFT_04g008700 [Sorghum bicolor][more]
gi|343466213|gb|AEM43000.1|6.2e-19574.19UDP-glucosyltransferase [Siraitia grosvenorii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G020970.1CmaCh14G020970.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 464..926
score: 2.1E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 274..405
score: 2.2E-18coord: 726..857
score: 1.6
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 804..847
score: -coord: 352..395
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 727..854
score: 1.3E-5coord: 272..403
score: 2.
NoneNo IPR availablePANTHERPTHR11926:SF189UDP-GLYCOSYLTRANSFERASE 72B2-RELATEDcoord: 464..926
score: 2.1E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 467..926
score: 1.79E-113coord: 16..440
score: 1.33E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh14G020970CmoCh06G010510Cucurbita moschata (Rifu)cmacmoB271
CmaCh14G020970Carg27349Silver-seed gourdcarcmaB0297
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh14G020970Cucumber (Gy14) v2cgybcmaB318
CmaCh14G020970Cucumber (Gy14) v2cgybcmaB597
CmaCh14G020970Melon (DHL92) v3.6.1cmamedB235
CmaCh14G020970Melon (DHL92) v3.6.1cmamedB273
CmaCh14G020970Silver-seed gourdcarcmaB1364
CmaCh14G020970Silver-seed gourdcarcmaB1373
CmaCh14G020970Cucumber (Chinese Long) v3cmacucB0291
CmaCh14G020970Cucumber (Chinese Long) v3cmacucB0305
CmaCh14G020970Watermelon (97103) v2cmawmbB247
CmaCh14G020970Watermelon (97103) v2cmawmbB262
CmaCh14G020970Watermelon (97103) v2cmawmbB263
CmaCh14G020970Wax gourdcmawgoB0308
CmaCh14G020970Wax gourdcmawgoB0315
CmaCh14G020970Cucurbita maxima (Rimu)cmacmaB008
CmaCh14G020970Cucurbita maxima (Rimu)cmacmaB242
CmaCh14G020970Cucurbita maxima (Rimu)cmacmaB243
CmaCh14G020970Cucurbita maxima (Rimu)cmacmaB253
CmaCh14G020970Cucurbita maxima (Rimu)cmacmaB263
CmaCh14G020970Cucurbita maxima (Rimu)cmacmaB279
CmaCh14G020970Cucumber (Gy14) v1cgycmaB0709
CmaCh14G020970Cucurbita moschata (Rifu)cmacmoB245
CmaCh14G020970Cucurbita moschata (Rifu)cmacmoB255
CmaCh14G020970Cucurbita moschata (Rifu)cmacmoB260
CmaCh14G020970Wild cucumber (PI 183967)cmacpiB255
CmaCh14G020970Cucumber (Chinese Long) v2cmacuB253
CmaCh14G020970Melon (DHL92) v3.5.1cmameB242
CmaCh14G020970Watermelon (Charleston Gray)cmawcgB233
CmaCh14G020970Watermelon (97103) v1cmawmB242
CmaCh14G020970Cucurbita pepo (Zucchini)cmacpeB253
CmaCh14G020970Cucurbita pepo (Zucchini)cmacpeB283
CmaCh14G020970Cucurbita pepo (Zucchini)cmacpeB291
CmaCh14G020970Cucurbita pepo (Zucchini)cmacpeB292
CmaCh14G020970Bottle gourd (USVL1VR-Ls)cmalsiB226
CmaCh14G020970Bottle gourd (USVL1VR-Ls)cmalsiB245