Tan0003950 (gene) Snake gourd v1

Overview
NameTan0003950
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycosyltransferase
LocationLG01: 12992600 .. 12995229 (-)
RNA-Seq ExpressionTan0003950
SyntenyTan0003950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATCAATACACCATTTAATTTATCAACTTCATTATAGTTTAATTAGTGAATATGTCGTTGGCCAAGTGATTGAAAGTCTCGATTGTCACTTTTGAATTTTTTTTTTTTTTTAAATAGTTAATACACATCCCATACTTTGGCTTTATATACATGCATATACAAGCTACAGTTTGTTCACTCTTAATTATTATCGCTTCTTCATCTTCTTCTCCATGGACTCCCAAACTCACGTCGCTCTCATTTCTAGCCCTGGAATGGGCCATCTCTTCCCCTGTCTCGAGCTCGCCACGCGCCTCTCCACGTGCCACCACCTCACAGTCACTGTCTTCCTCGTCACCTCCCACTCCTCCTCCGCCGAAAACAAAATCATCTCCGCCGCTGAGGCCACCGGCCTTTTCACAGTCGTCGAACTCCCCCCTGCCGACATGTCCGACGTCACCGACTCCACCGTTGTCGGCCGCCTCGCCATCACCATGCGCTACCACATCCCGGCCCTCCGCTCCGCTGTCTCCGCCCTCACCTCTCGCCCCTCCGTCCTCATCGCCGACATCTTCGCCACGGAGTCCTTCGTCGTCGCCGATGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAATGCATGGTTCTTAGCCTTGACCGTTTACGCCCAGGTTCTCGACAAGCAAATTGTCGGCCAGTACGTGGACCAGAAAGAACCGCTCCAAATTCCCGGATGCGAACCGGTCCGGCCCTGCGACGTCGTGGACCCGATGCTGGATCGGACCGAGCCCCAGTATTACGAATACGTCAAAGTCGGGATGGGAATAGCGTCGAGCGACGGCGTTTTGGTTAACACGTGGGACGAGTTGCAAGGTCGCACGCTCGCATCGTTCAAAGATCGAAGTCTGTTGGGGCGAGTGATGAAGCCGCCGGTTTACTCGATCGGACCGATCGTTCGACCATCGGGTTCGGGGAAAGGCGGTTCGAGCGAGCTGACGTTCAACTGGTTGAGGAAGCAGCCCGTAGAGTCGGTGATATATGTGTCGTTCGGGAGCGGTGGAACGCTGTCGTTTGAGCAAATGACAGAAGTGGCTCACGGATTGGAAATGAGTCGCCAGAGATTTGTTTGGGTGGTTCGAACGCCCAAGGTCAGGATTAAGACATTAAATAATTAATCTTCTTTAATAATTTAATTAGTTTCCGATCTTATGTTTTCTTTTTTAAATTATGTTTGTTCTAATATCTAATCTAATAGGTTATTAAACTTTAAAACATATTTCACGCTTTCAATTTTACATGTAATAGGTCTTTTACACTTTTAAAATTTTAAAATTAATGGATCTATTAGACATAAAATTGAATTTTGTATTTAAAAATATATTAAACTTCTAATTTCGTGTCTCAATAATTTTTTAAAATGTCGAGTAGGTCAATTATCTATTAGACAAATTCATGAGTAGATCAATTATCTATTAAACACAAAATTAAAAGTTAAAACACTTAAAAATACAAAATAAATTACAAATCGATGTAGATTTATTATTCAGGATATAGGGAATGTCAGATCAAATCACAAAAAAACAAGTATACTGGTTAGTATGTCATCAATCTCACTTAAATTTCAAGTTTAAATTAATGTCATATGCCTCCCACCAAAGTTTGAATATTTAGGAGTTGTTTGAGGCGCTAAATGAATTATAATAACATGATATTATAATAATCTGTGGGTTATTATACTATGTGAAACATATAACATTATTTAAAATGCAGAGTAATATAGTCTAAGATTATAATATGAGGTAGAGTATTTCACATGTAATAATAACGAGTGGCCAAACATACCCTTAAAAAGTTGAAAGTTACTTCTTTTTTTTTTTTTTACATCAGGTGAGATCGGATGCGGCGTTTTTCACAACGGGGGATGGGAGTGAGGAGCAATCAGAGGCGAGATTTTTGCCGAAGGGGTTTTTGGAGCGGACGAGCGAGGTGGGGTTTGTGGTGTCAATGTGGGCGGACCAGACGGCTGTGTTGGGAAGTCCGGCGGTGGGGGGATTTTTCACACACGGCGGATGGAACTCGTCATTGGAAGGGATTACGAACGGAGTTCCGATGATAGTGTGGCCGTTGTACGCGGAGCAGCGACTGAACGCCACCATGCTGGCAGAGGAGGTGGGAGTGGCGGTCCGACCAAAGGAACTGCCGACGAAGGCAGTGATCGGAAGGGAGGAGATTGCGGCGATGGTAAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCGAAGGCTAAGGAACTTCAACGGAGTGCAGAAAAGGCCTCTACCACAGGTGGCTCGTCGTACGAGAACTTTGCTCGAGTTGTGAAATTGTTTGGCCGTGATGGATAAGTTTTATTTATTTTTAAATCTATATATAATATAATATAGGTTTAAATTACAAGTTTAGTCCATAAATTTTGAGTTTTGTGTTGTGTCAAATAGGTCATTGACTTAGTTGACGTTTTTTTAAGATTCATGAACTTACTAGCTACAAAATTAAAAACTTAACTTCTATTGGGCAGAAAATTCAATTTTATATAAAATAGATCGATTAATTTTAAAAAAATTGTGAATATGTTATAACTTATTAAAAATTCAAG

mRNA sequence

GATCAATACACCATTTAATTTATCAACTTCATTATAGTTTAATTAGTGAATATGTCGTTGGCCAAGTGATTGAAAGTCTCGATTGTCACTTTTGAATTTTTTTTTTTTTTTAAATAGTTAATACACATCCCATACTTTGGCTTTATATACATGCATATACAAGCTACAGTTTGTTCACTCTTAATTATTATCGCTTCTTCATCTTCTTCTCCATGGACTCCCAAACTCACGTCGCTCTCATTTCTAGCCCTGGAATGGGCCATCTCTTCCCCTGTCTCGAGCTCGCCACGCGCCTCTCCACGTGCCACCACCTCACAGTCACTGTCTTCCTCGTCACCTCCCACTCCTCCTCCGCCGAAAACAAAATCATCTCCGCCGCTGAGGCCACCGGCCTTTTCACAGTCGTCGAACTCCCCCCTGCCGACATGTCCGACGTCACCGACTCCACCGTTGTCGGCCGCCTCGCCATCACCATGCGCTACCACATCCCGGCCCTCCGCTCCGCTGTCTCCGCCCTCACCTCTCGCCCCTCCGTCCTCATCGCCGACATCTTCGCCACGGAGTCCTTCGTCGTCGCCGATGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAATGCATGGTTCTTAGCCTTGACCGTTTACGCCCAGGTTCTCGACAAGCAAATTGTCGGCCAGTACGTGGACCAGAAAGAACCGCTCCAAATTCCCGGATGCGAACCGGTCCGGCCCTGCGACGTCGTGGACCCGATGCTGGATCGGACCGAGCCCCAGTATTACGAATACGTCAAAGTCGGGATGGGAATAGCGTCGAGCGACGGCGTTTTGGTTAACACGTGGGACGAGTTGCAAGGTCGCACGCTCGCATCGTTCAAAGATCGAAGTCTGTTGGGGCGAGTGATGAAGCCGCCGGTTTACTCGATCGGACCGATCGTTCGACCATCGGGTTCGGGGAAAGGCGGTTCGAGCGAGCTGACGTTCAACTGGTTGAGGAAGCAGCCCGTAGAGTCGGTGATATATGTGTCGTTCGGGAGCGGTGGAACGCTGTCGTTTGAGCAAATGACAGAAGTGGCTCACGGATTGGAAATGAGTCGCCAGAGATTTGTTTGGGTGGTTCGAACGCCCAAGGTGAGATCGGATGCGGCGTTTTTCACAACGGGGGATGGGAGTGAGGAGCAATCAGAGGCGAGATTTTTGCCGAAGGGGTTTTTGGAGCGGACGAGCGAGGTGGGGTTTGTGGTGTCAATGTGGGCGGACCAGACGGCTGTGTTGGGAAGTCCGGCGGTGGGGGGATTTTTCACACACGGCGGATGGAACTCGTCATTGGAAGGGATTACGAACGGAGTTCCGATGATAGTGTGGCCGTTGTACGCGGAGCAGCGACTGAACGCCACCATGCTGGCAGAGGAGGTGGGAGTGGCGGTCCGACCAAAGGAACTGCCGACGAAGGCAGTGATCGGAAGGGAGGAGATTGCGGCGATGGTAAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCGAAGGCTAAGGAACTTCAACGGAGTGCAGAAAAGGCCTCTACCACAGGTGGCTCGTCGTACGAGAACTTTGCTCGAGTTGTGAAATTGTTTGGCCGTGATGGATAAGTTTTATTTATTTTTAAATCTATATATAATATAATATAGGTTTAAATTACAAGTTTAGTCCATAAATTTTGAGTTTTGTGTTGTGTCAAATAGGTCATTGACTTAGTTGACGTTTTTTTAAGATTCATGAACTTACTAGCTACAAAATTAAAAACTTAACTTCTATTGGGCAGAAAATTCAATTTTATATAAAATAGATCGATTAATTTTAAAAAAATTGTGAATATGTTATAACTTATTAAAAATTCAAG

Coding sequence (CDS)

ATGGACTCCCAAACTCACGTCGCTCTCATTTCTAGCCCTGGAATGGGCCATCTCTTCCCCTGTCTCGAGCTCGCCACGCGCCTCTCCACGTGCCACCACCTCACAGTCACTGTCTTCCTCGTCACCTCCCACTCCTCCTCCGCCGAAAACAAAATCATCTCCGCCGCTGAGGCCACCGGCCTTTTCACAGTCGTCGAACTCCCCCCTGCCGACATGTCCGACGTCACCGACTCCACCGTTGTCGGCCGCCTCGCCATCACCATGCGCTACCACATCCCGGCCCTCCGCTCCGCTGTCTCCGCCCTCACCTCTCGCCCCTCCGTCCTCATCGCCGACATCTTCGCCACGGAGTCCTTCGTCGTCGCCGATGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAATGCATGGTTCTTAGCCTTGACCGTTTACGCCCAGGTTCTCGACAAGCAAATTGTCGGCCAGTACGTGGACCAGAAAGAACCGCTCCAAATTCCCGGATGCGAACCGGTCCGGCCCTGCGACGTCGTGGACCCGATGCTGGATCGGACCGAGCCCCAGTATTACGAATACGTCAAAGTCGGGATGGGAATAGCGTCGAGCGACGGCGTTTTGGTTAACACGTGGGACGAGTTGCAAGGTCGCACGCTCGCATCGTTCAAAGATCGAAGTCTGTTGGGGCGAGTGATGAAGCCGCCGGTTTACTCGATCGGACCGATCGTTCGACCATCGGGTTCGGGGAAAGGCGGTTCGAGCGAGCTGACGTTCAACTGGTTGAGGAAGCAGCCCGTAGAGTCGGTGATATATGTGTCGTTCGGGAGCGGTGGAACGCTGTCGTTTGAGCAAATGACAGAAGTGGCTCACGGATTGGAAATGAGTCGCCAGAGATTTGTTTGGGTGGTTCGAACGCCCAAGGTGAGATCGGATGCGGCGTTTTTCACAACGGGGGATGGGAGTGAGGAGCAATCAGAGGCGAGATTTTTGCCGAAGGGGTTTTTGGAGCGGACGAGCGAGGTGGGGTTTGTGGTGTCAATGTGGGCGGACCAGACGGCTGTGTTGGGAAGTCCGGCGGTGGGGGGATTTTTCACACACGGCGGATGGAACTCGTCATTGGAAGGGATTACGAACGGAGTTCCGATGATAGTGTGGCCGTTGTACGCGGAGCAGCGACTGAACGCCACCATGCTGGCAGAGGAGGTGGGAGTGGCGGTCCGACCAAAGGAACTGCCGACGAAGGCAGTGATCGGAAGGGAGGAGATTGCGGCGATGGTAAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCGAAGGCTAAGGAACTTCAACGGAGTGCAGAAAAGGCCTCTACCACAGGTGGCTCGTCGTACGAGAACTTTGCTCGAGTTGTGAAATTGTTTGGCCGTGATGGATAA

Protein sequence

MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATGLFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFVVADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDPMLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGPIVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGRDG
Homology
BLAST of Tan0003950 vs. ExPASy Swiss-Prot
Match: Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 2.1e-133
Identity = 248/464 (53.45%), Postives = 333/464 (71.77%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           ++S+ H+ L+SSPG+GHL P LEL  R+ T  +  VT+F+V S +S+AE +++ +A    
Sbjct: 6   LNSKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPK 65

Query: 61  LFTVVELPPADMSDVTD--STVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATES 120
           L  +++LPP ++S + D  +TV  RL + MR   PA R+AVSAL  RP+ +I D+F TES
Sbjct: 66  LCEIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTES 125

Query: 121 FVVADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVV 180
             VA E  +AKYV++ASNAWFLALT+Y  +LDK++ G++V QKEP++IPGC PVR  +VV
Sbjct: 126 LEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVV 185

Query: 181 DPMLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSI 240
           DPMLDRT  QY EY ++G+ I ++DG+L+NTW+ L+  T  + +D   LGRV K PV+ I
Sbjct: 186 DPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFPI 245

Query: 241 GPIVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRF 300
           GP+ R +G   G + EL  +WL +QP ESV+YVSFGSGGTLS EQM E+A GLE S+QRF
Sbjct: 246 GPLRRQAGP-CGSNCEL-LDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRF 305

Query: 301 VWVVRTPKVRS-DAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSP 360
           +WVVR P V++ DAAFFT GDG+++ S   + P+GFL R   VG VV  W+ Q  ++  P
Sbjct: 306 IWVVRQPTVKTGDAAFFTQGDGADDMS--GYFPEGFLTRIQNVGLVVPQWSPQIHIMSHP 365

Query: 361 AVGGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIG 420
           +VG F +H GWNS LE IT GVP+I WP+YAEQR+NAT+L EE+GVAVRPK LP K V+ 
Sbjct: 366 SVGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVK 425

Query: 421 REEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSY 462
           REEI  M+R+IM   DEEG  IR + +EL+ S EKA   GGSS+
Sbjct: 426 REEIERMIRRIMV--DEEGSEIRKRVRELKDSGEKALNEGGSSF 463

BLAST of Tan0003950 vs. ExPASy Swiss-Prot
Match: Q9ZU72 (UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 1.8e-121
Identity = 235/464 (50.65%), Postives = 321/464 (69.18%), Query Frame = 0

Query: 4   QTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSS-AENKIISAAEATGLF 63
           Q H  L++SPG+GHL P LEL  RLS+  ++ VT+  VTS SSS  E + I AA A  + 
Sbjct: 3   QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62

Query: 64  TVVELPPADMSDVT--DSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 123
            + E+P  D+ ++   D+T+  ++ + MR   PA+R AV  +  +P+V+I D   TE   
Sbjct: 63  QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMS 122

Query: 124 VADEFHM-AKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVD 183
           VAD+  M AKYV+V ++AWFLA+ VY  VLD  + G+YVD KEPL+IPGC+PV P ++++
Sbjct: 123 VADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELME 182

Query: 184 PMLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIG 243
            MLDR+  QY E V+ G+ +  SDGVLVNTW+ELQG TLA+ ++   L RVMK PVY IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242

Query: 244 PIVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFV 303
           PIVR +      +S   F WL +Q   SV++V  GSGGTL+FEQ  E+A GLE+S QRFV
Sbjct: 243 PIVRTNQHVDKPNS--IFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFV 302

Query: 304 WVVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAV 363
           WV+R P      A +     S+++  +  LP+GFL+RT  VG VV+ WA Q  +L   ++
Sbjct: 303 WVLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSI 362

Query: 364 GGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGRE 423
           GGF +H GW+S+LE +T GVP+I WPLYAEQ +NAT+L EE+GVAVR  ELP++ VIGRE
Sbjct: 363 GGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGRE 422

Query: 424 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYEN 464
           E+A++VRKIMAEEDEEG+ IRAKA+E++ S+E+A +  GSSY +
Sbjct: 423 EVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of Tan0003950 vs. ExPASy Swiss-Prot
Match: Q94A84 (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 3.2e-110
Identity = 213/470 (45.32%), Postives = 309/470 (65.74%), Query Frame = 0

Query: 3   SQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEA-TGL 62
           ++ HVA+ +SPGMGH+ P +EL  RL+  H   VT+F++ + ++SA+++ +++      L
Sbjct: 4   TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63

Query: 63  FTVVELPPADMSDVTD-STVVG-RLAITMRYHIPALRSAVSALTSRPSVLIADIFATESF 122
             +V LP  D+S + D S   G +L + MR  IP +RS +  +  +P+ LI D+F  ++ 
Sbjct: 64  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123

Query: 123 VVADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVD 182
            +  EF+M  Y+F+ASNA FLA+ ++   LDK +  +++ +K+P+ +PGCEPVR  D ++
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183

Query: 183 PMLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIG 242
             LD     Y E+V  G    + DG++VNTWD+++ +TL S +D  LLGR+   PVY IG
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIG 243

Query: 243 PIVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFV 302
           P+ RP    K  ++    +WL KQP ESV+Y+SFGSGG+LS +Q+TE+A GLEMS+QRFV
Sbjct: 244 PLSRPVDPSK--TNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFV 303

Query: 303 WVVRTPKVRSD-AAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPA 362
           WVVR P   S  +A+ +   G        +LP+GF+ RT E GF+VS WA Q  +L   A
Sbjct: 304 WVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQA 363

Query: 363 VGGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGR 422
           VGGF TH GWNS LE +  GVPMI WPL+AEQ +NAT+L EE+GVAVR K+LP++ VI R
Sbjct: 364 VGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITR 423

Query: 423 EEIAAMVRKIMAEEDEEGKAIRAKAKEL-QRSAEKASTTGGSSYENFARV 468
            EI A+VRKIM E  EEG  +R K K+L + +AE  S  GG ++E+ +R+
Sbjct: 424 AEIEALVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of Tan0003950 vs. ExPASy Swiss-Prot
Match: O81498 (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 1.2e-107
Identity = 208/472 (44.07%), Postives = 316/472 (66.95%), Query Frame = 0

Query: 3   SQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATGLF 62
           ++ H A+ SSPGMGH+ P +ELA RLS  H   VTVF++ + ++S ++K+++   +TG+ 
Sbjct: 4   TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLLN---STGV- 63

Query: 63  TVVELPPADMSDVTD--STVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 122
            +V LP  D+S + D  + VV ++ + MR  +P LRS + A+   P+ LI D+F T++  
Sbjct: 64  DIVNLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 182
           +A E +M  YVF+ASNA +L +++Y   LD+ I  ++  Q++PL IPGCEPVR  D++D 
Sbjct: 124 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDA 183

Query: 183 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 242
            L   EP Y++ V+  +    +DG+LVNTW+E++ ++L S +D  LLGRV + PVY +GP
Sbjct: 184 YLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGP 243

Query: 243 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 302
           + RP  S    +    F+WL KQP ESV+Y+SFGSGG+L+ +Q+TE+A GLE S+QRF+W
Sbjct: 244 LCRPIQSST--TDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIW 303

Query: 303 VVRTPKVRSDAA-FFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VVR P   S  + +F+   G  + +   +LP+GF+ RT + GF++  WA Q  +L   AV
Sbjct: 304 VVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAV 363

Query: 363 GGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGRE 422
           GGF TH GW+S+LE +  GVPMI WPL+AEQ +NA +L++E+G++VR  +   K  I R 
Sbjct: 364 GGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRS 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTT--GGSSYENFARVVK 470
           +I AMVRK+MAE  +EG+ +R K K+L+ +AE + +   GGS++E+  RV K
Sbjct: 424 KIEAMVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of Tan0003950 vs. ExPASy Swiss-Prot
Match: Q9LVR1 (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 4.4e-107
Identity = 212/472 (44.92%), Postives = 313/472 (66.31%), Query Frame = 0

Query: 3   SQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATGLF 62
           ++ H A+ SSPGMGH+ P +EL  RLS  +   VTVF++ + ++SA++K ++   +TG+ 
Sbjct: 4   TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFLN---STGV- 63

Query: 63  TVVELPPADMSDVT--DSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 122
            +V+LP  D+  +   D  VV ++ + MR  +PALRS ++A+  +P+ LI D+F T++  
Sbjct: 64  DIVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 182
           +A EF+M  YVF+ +NA FL +++Y   LDK I  ++  Q+ PL IPGCEPVR  D +D 
Sbjct: 124 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 183

Query: 183 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 242
            L   EP Y ++V+ G+    +DG+LVNTW+E++ ++L S  +  LLGRV + PVY IGP
Sbjct: 184 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 243

Query: 243 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 302
           + RP  S +  +     +WL +QP ESV+Y+SFGSGG LS +Q+TE+A GLE S+QRFVW
Sbjct: 244 LCRPIQSSE--TDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVW 303

Query: 303 VVRTPKVRSDAA-FFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VVR P   S  + + +   G  E +   +LP+GF+ RTS+ GFVV  WA Q  +L   AV
Sbjct: 304 VVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 363

Query: 363 GGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGRE 422
           GGF TH GW+S+LE +  GVPMI WPL+AEQ +NA +L++E+G+AVR  +   K  I R 
Sbjct: 364 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRW 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTT--GGSSYENFARVVK 470
           +I A+VRK+M E  +EG+A+R K K+L+ SAE + +   GG ++E+  RV K
Sbjct: 424 KIEALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of Tan0003950 vs. NCBI nr
Match: XP_038880693.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida])

HSP 1 Score: 838.2 bits (2164), Expect = 3.6e-239
Identity = 427/472 (90.47%), Postives = 447/472 (94.70%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDSQTHVALISSPGMGHLFP LELATRLST HHLTVTVF+V SHSS+AENK+I+AAEA G
Sbjct: 1   MDSQTHVALISSPGMGHLFPSLELATRLSTRHHLTVTVFIVPSHSSNAENKVIAAAEAAG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTDS+VVGRLAITMR H+P LRSAVSALTS PSVLIADIFATESF 
Sbjct: 61  LFTVVELPPADMSDVTDSSVVGRLAITMRRHVPILRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALTVYAQV DKQIVGQYVDQKEPLQIPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTVYAQVWDKQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           +LDRT+PQY+E VKVGMGIAS DGVLVNTWD+LQGRTLASF+DR+LLG++MKPPVYSIGP
Sbjct: 181 LLDRTQPQYFEIVKVGMGIASCDGVLVNTWDDLQGRTLASFRDRNLLGKIMKPPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR SGS KGGSSEL FNWL KQP ESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW
Sbjct: 241 IVRQSGSKKGGSSEL-FNWLSKQPTESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR PKVRSD A+FTTGDGSEEQS  +FLP+GFLERTSEVGFVVSMWADQTAVLGSPAVG
Sbjct: 301 VVRAPKVRSDGAYFTTGDGSEEQSAGKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNS+LEGITNGVPM+VWPLYAEQRLNATMLAEEV VAVRPKELPTKAVIGREE
Sbjct: 361 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEVRVAVRPKELPTKAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFG 473
           IAAMVRKIMAEEDEEGKAIRAKAKELQRSAE AS   GSSYENFARVVKLFG
Sbjct: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAENASAEDGSSYENFARVVKLFG 471

BLAST of Tan0003950 vs. NCBI nr
Match: XP_023550260.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 833.2 bits (2151), Expect = 1.1e-237
Identity = 425/475 (89.47%), Postives = 447/475 (94.11%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEATG
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTDSTVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQK+PLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKKPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVKVGM IASS GVLVN+WDELQGRTLASFKDRSLLGRVMK PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKVGMAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMKAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFTTGDGSE+QSEAR+LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGRDG 476
           IAAMVRKIMAEEDEEGKAIR K KELQRSAEKA   GGSSYENFARVVKLFGR G
Sbjct: 421 IAAMVRKIMAEEDEEGKAIRTKVKELQRSAEKACAQGGSSYENFARVVKLFGRTG 474

BLAST of Tan0003950 vs. NCBI nr
Match: XP_022941908.1 (anthocyanidin 3-O-glucosyltransferase 5-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 828.9 bits (2140), Expect = 2.2e-236
Identity = 423/475 (89.05%), Postives = 446/475 (93.89%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEATG
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTDSTVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVK+G  IASS GVLVN+WDELQGRTLASFKDRSLLGRVM  PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFTTGDGSE+QSEAR+LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGRDG 476
           IAAMVRKIMAEEDEEG+AIRAKA ELQRSAEKA   GGSSYENFARVVKLFGR G
Sbjct: 421 IAAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGRTG 474

BLAST of Tan0003950 vs. NCBI nr
Match: XP_022941909.1 (anthocyanidin 3-O-glucosyltransferase 5-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 827.4 bits (2136), Expect = 6.3e-236
Identity = 422/473 (89.22%), Postives = 445/473 (94.08%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEATG
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTDSTVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVK+G  IASS GVLVN+WDELQGRTLASFKDRSLLGRVM  PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFTTGDGSE+QSEAR+LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGR 474
           IAAMVRKIMAEEDEEG+AIRAKA ELQRSAEKA   GGSSYENFARVVKLFGR
Sbjct: 421 IAAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGR 472

BLAST of Tan0003950 vs. NCBI nr
Match: XP_022980430.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita maxima])

HSP 1 Score: 817.0 bits (2109), Expect = 8.5e-233
Identity = 417/473 (88.16%), Postives = 442/473 (93.45%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEA G
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEAAG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTD TVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDFTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQK+PLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKKPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVKVGM IASS GVLVN+WDELQGR LASFKDRSLLGRVMK PVYSIGP
Sbjct: 181 MLDRTEFQYYEYVKVGMAIASSHGVLVNSWDELQGRALASFKDRSLLGRVMKAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFT GDGS++QS+ARFLP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTIGDGSDDQSKARFLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGR 474
           IAAMVRKIMA EDEEGKAIRAK +ELQRSAEKA   GGSSY+NFARVVKLFGR
Sbjct: 421 IAAMVRKIMAAEDEEGKAIRAKVEELQRSAEKACAQGGSSYQNFARVVKLFGR 472

BLAST of Tan0003950 vs. ExPASy TrEMBL
Match: A0A6J1FTD7 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1)

HSP 1 Score: 828.9 bits (2140), Expect = 1.0e-236
Identity = 423/475 (89.05%), Postives = 446/475 (93.89%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEATG
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTDSTVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVK+G  IASS GVLVN+WDELQGRTLASFKDRSLLGRVM  PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFTTGDGSE+QSEAR+LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGRDG 476
           IAAMVRKIMAEEDEEG+AIRAKA ELQRSAEKA   GGSSYENFARVVKLFGR G
Sbjct: 421 IAAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGRTG 474

BLAST of Tan0003950 vs. ExPASy TrEMBL
Match: A0A6J1FPT8 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1)

HSP 1 Score: 827.4 bits (2136), Expect = 3.0e-236
Identity = 422/473 (89.22%), Postives = 445/473 (94.08%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEATG
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTDSTVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVK+G  IASS GVLVN+WDELQGRTLASFKDRSLLGRVM  PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFTTGDGSE+QSEAR+LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGR 474
           IAAMVRKIMAEEDEEG+AIRAKA ELQRSAEKA   GGSSYENFARVVKLFGR
Sbjct: 421 IAAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGR 472

BLAST of Tan0003950 vs. ExPASy TrEMBL
Match: A0A6J1IW98 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111479797 PE=3 SV=1)

HSP 1 Score: 817.0 bits (2109), Expect = 4.1e-233
Identity = 417/473 (88.16%), Postives = 442/473 (93.45%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           MDS THVALISSPGMGHLFP LELATRLST HHLT+TVFLVTSHSSSAEN +++AAEA G
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEAAG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTVVELPPADMSDVTD TVVGRLAITMR H+PALRSA+SALTSRPS LIADIF+TE+F 
Sbjct: 61  LFTVVELPPADMSDVTDFTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLALT+YAQVLDKQIVGQYVDQK+PLQIPGCEPVRPCDVVDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKKPLQIPGCEPVRPCDVVDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           MLDRTE QYYEYVKVGM IASS GVLVN+WDELQGR LASFKDRSLLGRVMK PVYSIGP
Sbjct: 181 MLDRTEFQYYEYVKVGMAIASSHGVLVNSWDELQGRALASFKDRSLLGRVMKAPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR  GSGK GSSEL FNWLRKQP +SVIYVSFGSGGTLSFEQMTE+AHGLE+SRQRFVW
Sbjct: 241 IVRHFGSGKDGSSEL-FNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR P VRSDA FFT GDGS++QS+ARFLP+GFLERTSEVGF+VSMWA+QTAVLGSPAVG
Sbjct: 301 VVRPPTVRSDAMFFTIGDGSDDQSKARFLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNSSLEGIT GVPMIVWPLYAEQR+NATMLA+E+GVAVRPKELP  AVIGREE
Sbjct: 361 GFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLFGR 474
           IAAMVRKIMA EDEEGKAIRAK +ELQRSAEKA   GGSSY+NFARVVKLFGR
Sbjct: 421 IAAMVRKIMAAEDEEGKAIRAKVEELQRSAEKACAQGGSSYQNFARVVKLFGR 472

BLAST of Tan0003950 vs. ExPASy TrEMBL
Match: A0A6J1EP22 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1)

HSP 1 Score: 798.1 bits (2060), Expect = 2.0e-227
Identity = 403/469 (85.93%), Postives = 433/469 (92.32%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           M+SQ HVAL+SSPGMGHLFP LELATRLS  HHL+VTVF+V S SSSAENK+I+AA+A G
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTV+ELPPADMSDVT+S VVGRL ITMR H+PALRSAVS LT+ PSVLIADIFATESF 
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWFLA T+Y  VLDKQI GQYVDQKEPL IPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           +LDRTEPQY+EYV++GM I SSDGVLVNTWD+LQGRTLASF+DR+LLGR+M  PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR +G  KGGSSEL FNWL KQP ESVIYVSFGSGGTLS EQMTEVAHGLEMS QRFVW
Sbjct: 241 IVRQTGGKKGGSSEL-FNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR PKVRSDA FFTTGDGSE+QSEA+FLP GFLERTSEVGFVVSMWADQTAVLGSPAVG
Sbjct: 301 VVRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNS+LEGITNGVPM+VWPLYAEQR+NATMLAEEV VAVRPKELPTKAVIGREE
Sbjct: 361 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVK 470
           IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEK++  GGSS+ENFARVVK
Sbjct: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468

BLAST of Tan0003950 vs. ExPASy TrEMBL
Match: A0A6J1J726 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 2.9e-226
Identity = 403/471 (85.56%), Postives = 435/471 (92.36%), Query Frame = 0

Query: 1   MDSQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATG 60
           M+SQ HVALISSPGMGHLFP LELATRLS  HHL+VTVF+V S SSSAE K+I+AA+A G
Sbjct: 1   MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 120
           LFTV+ELPPADMSDVTDSTVVGRL+ITMR H+PALRSAVSALTS PSVLIADIFATESF 
Sbjct: 61  LFTVIELPPADMSDVTDSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180
           VADEFHMAKYVFVASNAWF ALT+Y  VLDKQI GQYVDQKEP  IPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 180

Query: 181 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 240
           +LDRTEPQY+EYV++G  I SSDGVLVNTWD+L+GRTLASF+D +LLGR+MK PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 240

Query: 241 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 300
           IVR +G  KGG+SEL FNWL KQP ESVIYVSFGSGGTLS EQMTEVAHGLEMS QRFVW
Sbjct: 241 IVRQTGGKKGGASEL-FNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 300

Query: 301 VVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAVG 360
           VVR PKVRSDA FFTTGDG+E+QSEA+FLP GFLERTSEVGFVVSMWADQTAVLGSPAVG
Sbjct: 301 VVRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 360

Query: 361 GFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGREE 420
           GFFTHGGWNS+LEGITNGVPM+VWPLYAEQR+NATMLAEEV VAVRPKELPTKAVIGREE
Sbjct: 361 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 420

Query: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYENFARVVKLF 472
           IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEK++  GGSS+ENFARVVKL+
Sbjct: 421 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLW 470

BLAST of Tan0003950 vs. TAIR 10
Match: AT2G18570.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 437.6 bits (1124), Expect = 1.3e-122
Identity = 235/464 (50.65%), Postives = 321/464 (69.18%), Query Frame = 0

Query: 4   QTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSS-AENKIISAAEATGLF 63
           Q H  L++SPG+GHL P LEL  RLS+  ++ VT+  VTS SSS  E + I AA A  + 
Sbjct: 3   QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62

Query: 64  TVVELPPADMSDVT--DSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 123
            + E+P  D+ ++   D+T+  ++ + MR   PA+R AV  +  +P+V+I D   TE   
Sbjct: 63  QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMS 122

Query: 124 VADEFHM-AKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVD 183
           VAD+  M AKYV+V ++AWFLA+ VY  VLD  + G+YVD KEPL+IPGC+PV P ++++
Sbjct: 123 VADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELME 182

Query: 184 PMLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIG 243
            MLDR+  QY E V+ G+ +  SDGVLVNTW+ELQG TLA+ ++   L RVMK PVY IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242

Query: 244 PIVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFV 303
           PIVR +      +S   F WL +Q   SV++V  GSGGTL+FEQ  E+A GLE+S QRFV
Sbjct: 243 PIVRTNQHVDKPNS--IFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFV 302

Query: 304 WVVRTPKVRSDAAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAV 363
           WV+R P      A +     S+++  +  LP+GFL+RT  VG VV+ WA Q  +L   ++
Sbjct: 303 WVLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSI 362

Query: 364 GGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGRE 423
           GGF +H GW+S+LE +T GVP+I WPLYAEQ +NAT+L EE+GVAVR  ELP++ VIGRE
Sbjct: 363 GGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGRE 422

Query: 424 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTTGGSSYEN 464
           E+A++VRKIMAEEDEEG+ IRAKA+E++ S+E+A +  GSSY +
Sbjct: 423 EVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of Tan0003950 vs. TAIR 10
Match: AT3G50740.1 (UDP-glucosyl transferase 72E1 )

HSP 1 Score: 400.2 bits (1027), Expect = 2.3e-111
Identity = 213/470 (45.32%), Postives = 309/470 (65.74%), Query Frame = 0

Query: 3   SQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEA-TGL 62
           ++ HVA+ +SPGMGH+ P +EL  RL+  H   VT+F++ + ++SA+++ +++      L
Sbjct: 4   TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63

Query: 63  FTVVELPPADMSDVTD-STVVG-RLAITMRYHIPALRSAVSALTSRPSVLIADIFATESF 122
             +V LP  D+S + D S   G +L + MR  IP +RS +  +  +P+ LI D+F  ++ 
Sbjct: 64  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123

Query: 123 VVADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVD 182
            +  EF+M  Y+F+ASNA FLA+ ++   LDK +  +++ +K+P+ +PGCEPVR  D ++
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183

Query: 183 PMLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIG 242
             LD     Y E+V  G    + DG++VNTWD+++ +TL S +D  LLGR+   PVY IG
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIG 243

Query: 243 PIVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFV 302
           P+ RP    K  ++    +WL KQP ESV+Y+SFGSGG+LS +Q+TE+A GLEMS+QRFV
Sbjct: 244 PLSRPVDPSK--TNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFV 303

Query: 303 WVVRTPKVRSD-AAFFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPA 362
           WVVR P   S  +A+ +   G        +LP+GF+ RT E GF+VS WA Q  +L   A
Sbjct: 304 WVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQA 363

Query: 363 VGGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGR 422
           VGGF TH GWNS LE +  GVPMI WPL+AEQ +NAT+L EE+GVAVR K+LP++ VI R
Sbjct: 364 VGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITR 423

Query: 423 EEIAAMVRKIMAEEDEEGKAIRAKAKEL-QRSAEKASTTGGSSYENFARV 468
            EI A+VRKIM E  EEG  +R K K+L + +AE  S  GG ++E+ +R+
Sbjct: 424 AEIEALVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of Tan0003950 vs. TAIR 10
Match: AT5G26310.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 391.7 bits (1005), Expect = 8.2e-109
Identity = 208/472 (44.07%), Postives = 316/472 (66.95%), Query Frame = 0

Query: 3   SQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATGLF 62
           ++ H A+ SSPGMGH+ P +ELA RLS  H   VTVF++ + ++S ++K+++   +TG+ 
Sbjct: 4   TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLLN---STGV- 63

Query: 63  TVVELPPADMSDVTD--STVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 122
            +V LP  D+S + D  + VV ++ + MR  +P LRS + A+   P+ LI D+F T++  
Sbjct: 64  DIVNLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 182
           +A E +M  YVF+ASNA +L +++Y   LD+ I  ++  Q++PL IPGCEPVR  D++D 
Sbjct: 124 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDA 183

Query: 183 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 242
            L   EP Y++ V+  +    +DG+LVNTW+E++ ++L S +D  LLGRV + PVY +GP
Sbjct: 184 YLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGP 243

Query: 243 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 302
           + RP  S    +    F+WL KQP ESV+Y+SFGSGG+L+ +Q+TE+A GLE S+QRF+W
Sbjct: 244 LCRPIQSST--TDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIW 303

Query: 303 VVRTPKVRSDAA-FFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VVR P   S  + +F+   G  + +   +LP+GF+ RT + GF++  WA Q  +L   AV
Sbjct: 304 VVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAV 363

Query: 363 GGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGRE 422
           GGF TH GW+S+LE +  GVPMI WPL+AEQ +NA +L++E+G++VR  +   K  I R 
Sbjct: 364 GGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRS 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTT--GGSSYENFARVVK 470
           +I AMVRK+MAE  +EG+ +R K K+L+ +AE + +   GGS++E+  RV K
Sbjct: 424 KIEAMVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of Tan0003950 vs. TAIR 10
Match: AT5G66690.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 389.8 bits (1000), Expect = 3.1e-108
Identity = 212/472 (44.92%), Postives = 313/472 (66.31%), Query Frame = 0

Query: 3   SQTHVALISSPGMGHLFPCLELATRLSTCHHLTVTVFLVTSHSSSAENKIISAAEATGLF 62
           ++ H A+ SSPGMGH+ P +EL  RLS  +   VTVF++ + ++SA++K ++   +TG+ 
Sbjct: 4   TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFLN---STGV- 63

Query: 63  TVVELPPADMSDVT--DSTVVGRLAITMRYHIPALRSAVSALTSRPSVLIADIFATESFV 122
            +V+LP  D+  +   D  VV ++ + MR  +PALRS ++A+  +P+ LI D+F T++  
Sbjct: 64  DIVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTVYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 182
           +A EF+M  YVF+ +NA FL +++Y   LDK I  ++  Q+ PL IPGCEPVR  D +D 
Sbjct: 124 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 183

Query: 183 MLDRTEPQYYEYVKVGMGIASSDGVLVNTWDELQGRTLASFKDRSLLGRVMKPPVYSIGP 242
            L   EP Y ++V+ G+    +DG+LVNTW+E++ ++L S  +  LLGRV + PVY IGP
Sbjct: 184 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 243

Query: 243 IVRPSGSGKGGSSELTFNWLRKQPVESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVW 302
           + RP  S +  +     +WL +QP ESV+Y+SFGSGG LS +Q+TE+A GLE S+QRFVW
Sbjct: 244 LCRPIQSSE--TDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVW 303

Query: 303 VVRTPKVRSDAA-FFTTGDGSEEQSEARFLPKGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VVR P   S  + + +   G  E +   +LP+GF+ RTS+ GFVV  WA Q  +L   AV
Sbjct: 304 VVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 363

Query: 363 GGFFTHGGWNSSLEGITNGVPMIVWPLYAEQRLNATMLAEEVGVAVRPKELPTKAVIGRE 422
           GGF TH GW+S+LE +  GVPMI WPL+AEQ +NA +L++E+G+AVR  +   K  I R 
Sbjct: 364 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRW 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKASTT--GGSSYENFARVVK 470
           +I A+VRK+M E  +EG+A+R K K+L+ SAE + +   GG ++E+  RV K
Sbjct: 424 KIEALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of Tan0003950 vs. TAIR 10
Match: AT2G18560.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 369.8 bits (948), Expect = 3.3e-102
Identity = 192/385 (49.87%), Postives = 265/385 (68.83%), Query Frame = 0

Query: 88  MRYHIPALRSAVSALTSRPSVLIADIFATESFVVADEFHMAKYVFVASNAWFLALTVYAQ 147
           MR     +R AV ++  +P+V+I D F T    + D    +KYV++ S+AWFLAL VY  
Sbjct: 1   MREMKSTVRDAVKSMKQKPTVMIVDFFGTALLSITDVGVTSKYVYIPSHAWFLALIVYLP 60

Query: 148 VLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDPMLDRTEPQYYEYVKVGMGIASSDGVLV 207
           VLDK + G+YVD KEP++IPGC+PV P +++D MLDR++ QY + V++G+ I  SDGVLV
Sbjct: 61  VLDKVMEGEYVDIKEPMKIPGCKPVGPKELLDTMLDRSDQQYRDCVQIGLEIPMSDGVLV 120

Query: 208 NTWDELQGRTLASFKDRSLLGRVMKPPVYSIGPIVRPSGSGKGGSSELTFNWLRKQPVES 267
           NTW ELQG+TLA+ ++   L RV+K PVY IGPIVR +   +  +S  TF WL KQ   S
Sbjct: 121 NTWGELQGKTLAALREDIDLNRVIKVPVYPIGPIVRTNVLIEKPNS--TFEWLDKQEERS 180

Query: 268 VIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRTPKVRSDAAFFTTGDGSEEQSEAR 327
           V+YV  GSGGTLSFEQ  E+A GLE+S Q F+WV+R P        +      ++   + 
Sbjct: 181 VVYVCLGSGGTLSFEQTMELAWGLELSCQSFLWVLRKP------PSYLGASSKDDDQVSD 240

Query: 328 FLPKGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFTHGGWNSSLEGITNGVPMIVWPLY 387
            LP+GFL+RT  VG VV+ WA Q  +L   ++GGF +H GW+S LE +T GVP+I WPLY
Sbjct: 241 GLPEGFLDRTRGVGLVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIIAWPLY 300

Query: 388 AEQRLNATMLAEEVGVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQ 447
           AEQ +NAT+L EE+G+A+R  ELP+K VI REE+A++V+KI+AEED+EG+ I+ KA+E++
Sbjct: 301 AEQWMNATLLTEEIGMAIRTSELPSKKVISREEVASLVKKIVAEEDKEGRKIKTKAEEVR 360

Query: 448 RSAEKASTTGGSSYENFARVVKLFG 473
            S+E+A T GGSS+ +     K  G
Sbjct: 361 VSSERAWTHGGSSHSSLFEWAKRCG 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q402872.1e-13353.45Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... [more]
Q9ZU721.8e-12150.65UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=... [more]
Q94A843.2e-11045.32UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=... [more]
O814981.2e-10744.07UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=... [more]
Q9LVR14.4e-10744.92UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
XP_038880693.13.6e-23990.47anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida][more]
XP_023550260.11.1e-23789.47anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita pepo subsp. pepo][more]
XP_022941908.12.2e-23689.05anthocyanidin 3-O-glucosyltransferase 5-like isoform X1 [Cucurbita moschata][more]
XP_022941909.16.3e-23689.22anthocyanidin 3-O-glucosyltransferase 5-like isoform X2 [Cucurbita moschata][more]
XP_022980430.18.5e-23388.16anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1FTD71.0e-23689.05Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1[more]
A0A6J1FPT83.0e-23689.22Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1[more]
A0A6J1IW984.1e-23388.16Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111479797 PE=3 SV=1[more]
A0A6J1EP222.0e-22785.93Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1[more]
A0A6J1J7262.9e-22685.56Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G18570.11.3e-12250.65UDP-Glycosyltransferase superfamily protein [more]
AT3G50740.12.3e-11145.32UDP-glucosyl transferase 72E1 [more]
AT5G26310.18.2e-10944.07UDP-Glycosyltransferase superfamily protein [more]
AT5G66690.13.1e-10844.92UDP-Glycosyltransferase superfamily protein [more]
AT2G18560.13.3e-10249.87UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 429..449
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 6..460
e-value: 9.5E-133
score: 445.4
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 254..455
e-value: 9.5E-133
score: 445.4
NoneNo IPR availablePANTHERPTHR48046:SF2GLYCOSYLTRANSFERASEcoord: 2..469
NoneNo IPR availablePANTHERPTHR48046UDP-GLYCOSYLTRANSFERASE 72E1coord: 2..469
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 5..470
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 266..406
e-value: 4.0E-19
score: 68.8
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 6..454
e-value: 1.79609E-63
score: 209.33
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 347..390

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003950.1Tan0003950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008194 UDP-glycosyltransferase activity