Tan0018419 (gene) Snake gourd v1

Overview
NameTan0018419
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycosyltransferase
LocationLG02: 91400318 .. 91404886 (+)
RNA-Seq ExpressionTan0018419
SyntenyTan0018419
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGAGACGACGGCGAACGGAGGAGAGAAGACGAAACAGAATCACGTAATCATCTTCCCTTTCCCAAGGCACGGCCACATGAGTCCAATTCTCCAATTCGCCAAACGATTAATCTCCAAAGGCCTTATCGTCACTTTCCTCACCGCTTCCTCTGCAACTCAATCCCTAATTATCAATCTCCCTCCCTCTCCCTCTCTCCGTCTCAAAATCATCTCCGATATCCCCGAATCCAACGACATCGCTACTCTCGACGCCTATCTTCGAGCCTTCAGAGCCGCCGTTACCAAATCCTTGGCCAATTTCATCGACGAAACCCTAATTTCAAGTTCCGATGAGGTTCGTCCTAGTCTCATTGTTTACGACTCTGTTATGCCTTGGGTGCAGAGCATCGCTGCAGAGCGAGGTCTCGATGCGGCGCCGTTTTTCACTCAATCGGCCGCTGTTAATCACATTCTCGATCTCGTCTATGGCGGATCTTTGAGGCTTCCGCCGCCGGAGAATGTGGCAGTTTCGCTTCCGGCGTTGCCGATGGATCTTCAGCCGGAGGATCTGCCGGCCTTCCCTGACGATACTGAAGTGGTAGTGAAGTTCATGACCAGTCAGTTCTCGAATTTGGAGAAAGTGAAGTGGATTTTCTTCAACACGTTTGATCACCTCGAGTGCAAGGTAATTGATGCTTGTTGAATTGAATCTGATTTTTTTTTTCTCTCTTTTTGAGCAATTGAATGAGATTTATCTTTGATCAAAAAAAAAAAAATCAATCTCTAATGTTGTTGGAGTAAAAGAAAAACTAGTTAATTACTTGCGTTATAAAATATTTTGTAAAAATAATATCAAGCTAAAAATTCCCTATGAGTATATTTTTTTTAGTACATTAATAATAGGGATTGGGTGAATTCGAATTCATGACCTCTTAGGGTTGTTTGGAGTATATGTTATAATAACATGTGGAATAATTTGCACCCCAAACACAGACTATTATAACCGTTGATTTACTACTCTACATTTTAAATAATATTATATGATTCCATAGACTATAGTAACAAACTATTATAACTCATGTTATTGTAAGGTTAAAATACCATTTTGGTCTTCGTACTTTGATAATTGTTCTATTTTGGTCCCTGTACTTTCAAATGTCCAATTTTATTCCATGTATTTTCAATAAAACTTAAAATTGATCCCTACCATTAGTTTATTGTTAATTTTTTTTTAAAAAAATAGTCTCTTTTAGTTTTTTCCCTACAAAATTTGAAAGTATGTTTACATTGTGTCTTTTCTTGTATTGAAATTATTATTATTTTGTAACTAATTTTGACAAAAATTAACTTCAAACTTATGGTAGGTACCAATTTTAAGTTTTATTAAAAGTACAGGAACTAAAATTAGACATTTGAAAGTACAAGGACCAAAATAAAACAATTCTCAAAGTACAAGGACCAAAATGGTATTTTAATCTTATTATAATTCACTCAGTGCCCCGAACAACCCCTTAGTCATAAGTATTACTCGATGTCAGTTGAGTTATACTCTTGTTGGCGACTATAAGTAGATTAAATATGTTAATATGTGCCTTAACTCAATCATAATTGATATCTATTCATTGTTTGAGGTTGGGATTTAAATTATCATATCCTATATGTTGTACTAAAAAAGTGATAATAGGTTAAATATATTACTTATATATATTTTATTATTTTATTTTGTTATATTAGTGATATATTTTATATTAAATGTATTGATGATATATAGTAAAAAAATGTTACAATAAGTTAAATTAAAATTTTAGTTTAGATTTAGTTCTGGAAGGATCTATTAAACACAGAATTGATTAGATATGCAATCAAACCTATATATAATAAGTTAATTCATTTTTTTAAAAAAAATGAATTTTTTAAGGACTTATTAGATACAAAATTGAAAATTTAGGTCTCGTTTGATAACCATTTGGTTTTTTATTTTTAGTTTTTGAAAATCAAGTCTATAAAAATACTACTTTGGTCCATCACTTTCCATGTTTTGTTTTCTTATCTACTTTTTAGATATGTTTTCAAAAACTAAGTCAAGTTTTGAAAACTAAACAAAGCGCGCAGTTTCCAAAAACTTGTTTTTGTTTTTGGAATTTAGCTAAGAGTACAAGTGTTTTCTTAATAAAAATGGAAACCATTGTATGTAAAATGAAAATCATTATAGAGAAGTCGTGAGAAAACATGTACGGTTTTCAAAAACCAAACATGATTATCAAATTGTGGGACTAAACTTGTAATTTAATCATTTAATTTTGTCTTCCTTAGTATTTTTCAATACAATTTGGTTAATTTTTTTTTAAAAAAAATAAAACAGTTTTTTAGTAGAACAAATGTAGAGTATGAGAATTTAAACTTCAAACTTCGAGAGAATGATCGAATATATGTCAATTGTCTTTGGAAAAATACTTGGTTGCATGTCAGTGCATAGGTATTTTCGTACACGCCACATGAGAGAGGAAGGTGGAAATTGAGAAATAGGAACTTTTTTACTTTGCTTAGTTGTAAAAGATTTAATAGCCATGCGTAAGAATACTACACAATAATATGCAGCCAATTATTGTTCTACCCTTGTGTTTTGCTCACGTTGACAAAGAAATATCTACAACTTAGGTTAAATTGCAAGTTTAATCTTTGAACTTTGAGACTGTGTATATTTGGTCCTTTAATTTTCAAAAGTGTCAAATAAATCCCTAAACTTTTAATTTTGTGTCTAATAAATTCTTAAACTTTCAATTTTGTATTTAATAGGTTCCTGAGAAATTTTAATTTTCTTAAAATTAATTGATCTATTAGATATAAATTTAAATTTTACGTCTAATGGAACTCTAAAATTTCAATTTTGTGTCTAGTAAATCTGTGAATTTTACAAAATTTAAAAATAAAAAAACTAAGGTTTCGTTTGATAACCATTTGATTTTTGGTTTAAGCTTAAAAATACAACTTCTACCAATGAGTTTCTATGTTTTTTTATCTACTTTGTACCTATATTTTCAAAAATCAAGCTAAGTTTTGAAAAACTAAAAAAAAAATAGTTTTCAAAATTTTGTGTGTTTTTGAAATTTGGCTAAAAACTTAGATAGTGTTGAAGATACCACATTCGAAAAATGTAGGGTGACTTCACAACTTATAAGATACATGAGTTATTCTTCTCATTGTCAATTAATTTTGAAATGAAATCTCATGTAATCTAATATGGCATCAAAAATGGTAAATAAAAAAATGTACTATATTGAGAAGCATATTGAGGATTTCACATTATTTATACAAAAAACCGACACTTCAGTTGGAGCGGTCGGAAAAGTGCTATTCCGACCGACCGATTTATATTCACATGATTCACGAGAAAACCATCCACGTGAATCCTTCTCTCCTTTCCCACCAGTTTTTCTTAATCTCTCTCATTTCTTCCCTTTCCAAAATGTCGAGTTATGAATTAGCAGTATATTGAAAAGATTTTAATTACACTCCTGGGATAAACTATAGTGGAAATTATGAATGGTTTGCTTATGTGGGATTACAATGAATTTTCACGATTATTGATAATAAGAACTTAATCTCAATTGAAAGTTTGTAATTGAAAGGCATAGGTTCTCGAAAGGTTATAACTTCAAAGTTTCTCAATTCATATGTTGTTAAATTTTAAAAAAAAAAAAATGTACTCAAATAATAATAATCACGAAAATATTTGGACTTGGATTTTGATTGTGTTGTATTCGCAATTATTTTTTCTGCTACTTTTGCAATATTTTATGTAAATTACGCCAGAATTTCAAATATCCTTGATTATTTCACTAATATCAAACAATATTAGGTTGTTAATTGGATGGCCAATAGATTGCCCATCAAGACAATTGGACCGACCATTCCATCGGCATATTTGGACGGTCGCTTGAAGGATGACAAAGCCTATGGTTTGAATGTCTTAAGACTCGATGATGGGAAGAAGCCTATGCAATGGTTAGACTCAAAAGAAACAGGTTCAGTTGTTTATATTTCATTTGGAAGTTTGGTTATCTTATTTGAAGAACAAGTAAAGGAACTGACATGTTTCCTTAAAGACACCAATCTTTCATTTTTATGGGTCCTAAGAGAATCAGAACTGAAAAAACTTCCTAACAACTTTGAACAAGAGACATCAGAACGAGGCCTAATTGTCAACTGGTGTTGTCAACTAGAAGTTCTGTCTCATAAGGCTGTAAGTTGTTTTGTGACTCATTGTGGTTGGAATTCGACGTTAGAAGCTTTGAGCTTGGGAGTGCCGATGGTTGCAATTCCGCAATGGGTCGATCAAACGACGAACGCTAAGTTCGTTGCTAATGTTTGGGAAGTCGGAGTTAGAGTGAAGAAGAATGATAAAGGCATTGCAACAAAGGAAGAACTAGAAGCCTCCATCTTAATGATTGTTCAAGGAGAGAGGTCAAATAAGTTTAAGAATAACTCAATCAAGTGGAAGAAACTAGCTAAAGATGCAGTAGATGAAGGAGGCAGTTCTGATAAAAACATTGAAGAATTTGTCAAAGCGATTGCTTCAATCTAA

mRNA sequence

ATGGAGGAGACGACGGCGAACGGAGGAGAGAAGACGAAACAGAATCACGTAATCATCTTCCCTTTCCCAAGGCACGGCCACATGAGTCCAATTCTCCAATTCGCCAAACGATTAATCTCCAAAGGCCTTATCGTCACTTTCCTCACCGCTTCCTCTGCAACTCAATCCCTAATTATCAATCTCCCTCCCTCTCCCTCTCTCCGTCTCAAAATCATCTCCGATATCCCCGAATCCAACGACATCGCTACTCTCGACGCCTATCTTCGAGCCTTCAGAGCCGCCGTTACCAAATCCTTGGCCAATTTCATCGACGAAACCCTAATTTCAAGTTCCGATGAGGTTCGTCCTAGTCTCATTGTTTACGACTCTGTTATGCCTTGGGTGCAGAGCATCGCTGCAGAGCGAGGTCTCGATGCGGCGCCGTTTTTCACTCAATCGGCCGCTGTTAATCACATTCTCGATCTCGTCTATGGCGGATCTTTGAGGCTTCCGCCGCCGGAGAATGTGGCAGTTTCGCTTCCGGCGTTGCCGATGGATCTTCAGCCGGAGGATCTGCCGGCCTTCCCTGACGATACTGAAGTGGTAGTGAAGTTCATGACCAGTCAGTTCTCGAATTTGGAGAAAGTGAAGTGGATTTTCTTCAACACGTTTGATCACCTCGAGTGCAAGGTTGTTAATTGGATGGCCAATAGATTGCCCATCAAGACAATTGGACCGACCATTCCATCGGCATATTTGGACGGTCGCTTGAAGGATGACAAAGCCTATGGTTTGAATGTCTTAAGACTCGATGATGGGAAGAAGCCTATGCAATGGTTAGACTCAAAAGAAACAGGTTCAGTTGTTTATATTTCATTTGGAAGTTTGGTTATCTTATTTGAAGAACAAGTAAAGGAACTGACATGTTTCCTTAAAGACACCAATCTTTCATTTTTATGGGTCCTAAGAGAATCAGAACTGAAAAAACTTCCTAACAACTTTGAACAAGAGACATCAGAACGAGGCCTAATTGTCAACTGGTGTTGTCAACTAGAAGTTCTGTCTCATAAGGCTGTAAGTTGTTTTGTGACTCATTGTGGTTGGAATTCGACGTTAGAAGCTTTGAGCTTGGGAGTGCCGATGGTTGCAATTCCGCAATGGGTCGATCAAACGACGAACGCTAAGTTCGTTGCTAATGTTTGGGAAGTCGGAGTTAGAGTGAAGAAGAATGATAAAGGCATTGCAACAAAGGAAGAACTAGAAGCCTCCATCTTAATGATTGTTCAAGGAGAGAGGTCAAATAAGTTTAAGAATAACTCAATCAAGTGGAAGAAACTAGCTAAAGATGCAGTAGATGAAGGAGGCAGTTCTGATAAAAACATTGAAGAATTTGTCAAAGCGATTGCTTCAATCTAA

Coding sequence (CDS)

ATGGAGGAGACGACGGCGAACGGAGGAGAGAAGACGAAACAGAATCACGTAATCATCTTCCCTTTCCCAAGGCACGGCCACATGAGTCCAATTCTCCAATTCGCCAAACGATTAATCTCCAAAGGCCTTATCGTCACTTTCCTCACCGCTTCCTCTGCAACTCAATCCCTAATTATCAATCTCCCTCCCTCTCCCTCTCTCCGTCTCAAAATCATCTCCGATATCCCCGAATCCAACGACATCGCTACTCTCGACGCCTATCTTCGAGCCTTCAGAGCCGCCGTTACCAAATCCTTGGCCAATTTCATCGACGAAACCCTAATTTCAAGTTCCGATGAGGTTCGTCCTAGTCTCATTGTTTACGACTCTGTTATGCCTTGGGTGCAGAGCATCGCTGCAGAGCGAGGTCTCGATGCGGCGCCGTTTTTCACTCAATCGGCCGCTGTTAATCACATTCTCGATCTCGTCTATGGCGGATCTTTGAGGCTTCCGCCGCCGGAGAATGTGGCAGTTTCGCTTCCGGCGTTGCCGATGGATCTTCAGCCGGAGGATCTGCCGGCCTTCCCTGACGATACTGAAGTGGTAGTGAAGTTCATGACCAGTCAGTTCTCGAATTTGGAGAAAGTGAAGTGGATTTTCTTCAACACGTTTGATCACCTCGAGTGCAAGGTTGTTAATTGGATGGCCAATAGATTGCCCATCAAGACAATTGGACCGACCATTCCATCGGCATATTTGGACGGTCGCTTGAAGGATGACAAAGCCTATGGTTTGAATGTCTTAAGACTCGATGATGGGAAGAAGCCTATGCAATGGTTAGACTCAAAAGAAACAGGTTCAGTTGTTTATATTTCATTTGGAAGTTTGGTTATCTTATTTGAAGAACAAGTAAAGGAACTGACATGTTTCCTTAAAGACACCAATCTTTCATTTTTATGGGTCCTAAGAGAATCAGAACTGAAAAAACTTCCTAACAACTTTGAACAAGAGACATCAGAACGAGGCCTAATTGTCAACTGGTGTTGTCAACTAGAAGTTCTGTCTCATAAGGCTGTAAGTTGTTTTGTGACTCATTGTGGTTGGAATTCGACGTTAGAAGCTTTGAGCTTGGGAGTGCCGATGGTTGCAATTCCGCAATGGGTCGATCAAACGACGAACGCTAAGTTCGTTGCTAATGTTTGGGAAGTCGGAGTTAGAGTGAAGAAGAATGATAAAGGCATTGCAACAAAGGAAGAACTAGAAGCCTCCATCTTAATGATTGTTCAAGGAGAGAGGTCAAATAAGTTTAAGAATAACTCAATCAAGTGGAAGAAACTAGCTAAAGATGCAGTAGATGAAGGAGGCAGTTCTGATAAAAACATTGAAGAATTTGTCAAAGCGATTGCTTCAATCTAA

Protein sequence

MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDLQPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIASI
Homology
BLAST of Tan0018419 vs. ExPASy Swiss-Prot
Match: K7NBW3 (Mogroside IE synthase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC1 PE=1 SV=1)

HSP 1 Score: 460.3 bits (1183), Expect = 2.6e-128
Identity = 226/448 (50.45%), Postives = 329/448 (73.44%), Query Frame = 0

Query: 16  HVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLKIISDI 75
           H+++FPFP  GH++P+LQ +KRLI+KG+ V+ +T    +  L +    S S+++++ISD 
Sbjct: 7   HILVFPFPSQGHINPLLQLSKRLIAKGIKVSLVTTLHVSNHLQLQGAYSNSVKIEVISDG 66

Query: 76  PESN-DIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVMPWVQSIAAE 135
            E   +  T+   L  FR  +TK+L +F+ + ++SS+    P  I+YDS MPWV  +A E
Sbjct: 67  SEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNP---PKFILYDSTMPWVLEVAKE 126

Query: 136 RGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDLQPEDLPAF---PDD 195
            GLD APF+TQS A+N I   V  G L+L PPE   +SLP++P+ L+P DLPA+   P  
Sbjct: 127 FGLDRAPFYTQSCALNSINYHVLHGQLKL-PPETPTISLPSMPL-LRPSDLPAYDFDPAS 186

Query: 196 TEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMAN-RLPIKTIGPTIPSAYLDGRL 255
           T+ ++  +TSQ+SN++    +F NTFD LE +++ WM     P+KT+GPT+PSAYLD R+
Sbjct: 187 TDTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVKTVGPTVPSAYLDKRV 246

Query: 256 KDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKELTCFLKDTNLS 315
           ++DK YGL++ + ++    ++WLDSK +GSV+Y+S+GSLV + EEQ+KEL   +K+T   
Sbjct: 247 ENDKHYGLSLFKPNE-DVCLKWLDSKPSGSVLYVSYGSLVEMGEEQLKELALGIKETGKF 306

Query: 316 FLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTLEALSL 375
           FLWV+R++E +KLP NF +  +E+GL+V+WC QLEVL+H +V CF THCGWNSTLEAL L
Sbjct: 307 FLWVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGCFFTHCGWNSTLEALCL 366

Query: 376 GVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMIVQGERSNKFK 435
           GVP+VA PQW DQ TNAKF+ +VW+VG RVK+N++ +A+KEE+ + I  +++GER+++FK
Sbjct: 367 GVPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVRSCIWEVMEGERASEFK 426

Query: 436 NNSIKWKKLAKDAVDEGGSSDKNIEEFV 459
           +NS++WKK AK+AVDEGGSSDKNIEEFV
Sbjct: 427 SNSMEWKKWAKEAVDEGGSSDKNIEEFV 448

BLAST of Tan0018419 vs. ExPASy Swiss-Prot
Match: W8JMV4 (UDP glycosyltransferase 9 OS=Catharanthus roseus OX=4058 GN=UGT9 PE=2 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 3.8e-103
Identity = 198/464 (42.67%), Postives = 304/464 (65.52%), Query Frame = 0

Query: 16  HVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLKIISD- 75
           H++ FPFP  GH++P+L    RL SKG  +T +T  S  +S  +    +  + ++ I D 
Sbjct: 14  HILAFPFPAKGHINPLLHLCNRLASKGFKITLITTVSTLKS--VKTSKANGIDIESIPDG 73

Query: 76  IP--ESNDIAT-----LDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVMPW 135
           IP  +++ I T     ++ Y + F+A+  ++    I +     +    P +++YDS MPW
Sbjct: 74  IPQEQNHQIITVMEMNMELYFKQFKASAIENTTKLIQKL---KTKNPPPKVLIYDSSMPW 133

Query: 136 VQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLP--PPENVAVSLPALPMDLQPEDL 195
           +  +A E+GL  A FFTQ  +V+ I   +  G+++LP    EN  VSLP LP+ L+ +DL
Sbjct: 134 ILEVAHEQGLLGASFFTQPCSVSAIYYHMLQGTIKLPLENSENGMVSLPYLPL-LEKKDL 193

Query: 196 PA---FPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPTIP 255
           P    F D++E + + +  QFSN++ V ++ FNTFD LE +VVNWM ++ PI T+GPT P
Sbjct: 194 PGVQQFEDNSEALAELLADQFSNIDDVDYVLFNTFDALEIEVVNWMGSKWPILTVGPTAP 253

Query: 256 SA--YLDGRLKD-DKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKE 315
           ++   LD + K+ +    +N L   + +  M+WLD +E  +V+Y+SFGSL  L EEQ+++
Sbjct: 254 TSMFLLDKKQKNYEDGRSINYLFETNTEVCMKWLDQREIDTVIYVSFGSLASLTEEQMEQ 313

Query: 316 LTCFLKDTNLSFLWVLRESELKKLPNNFEQETS-ERGLIVNWCCQLEVLSHKAVSCFVTH 375
           ++  L  +N  FLWV+RE E  KLP +F++ TS ++GL++NWC QL+VL+HK+V+CF+TH
Sbjct: 314 VSQALIRSNCYFLWVVREEEENKLPKDFKETTSKKKGLVINWCPQLDVLAHKSVACFMTH 373

Query: 376 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDK-GIATKEELEASI 435
           CGWNSTLEAL  GVPM+ +PQW DQTTNAK + +VW++GV V K+D+ GI  +E++E  I
Sbjct: 374 CGWNSTLEALCSGVPMICMPQWADQTTNAKLIEHVWKIGVGVNKSDENGIVKREDIEDCI 433

Query: 436 LMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAI 462
             +++ ER  + K N+IKWK+LAK+AV EGGSS  NI+EF  ++
Sbjct: 434 RQVIESERGKELKRNAIKWKELAKEAVSEGGSSYNNIQEFSSSL 471

BLAST of Tan0018419 vs. ExPASy Swiss-Prot
Match: Q9SYK9 (UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 8.4e-103
Identity = 200/459 (43.57%), Postives = 284/459 (61.87%), Query Frame = 0

Query: 15  NHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLK--II 74
           +H+I+ PFP  GH++P+ QF KRL SKGL +T +  S           PSP  + +   I
Sbjct: 5   SHLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSD---------KPSPPYKTEHDSI 64

Query: 75  SDIPESN-------DIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVM 134
           +  P SN        +  LD Y+     ++  +L   +++  +S +    P  IVYDS M
Sbjct: 65  TVFPISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNP---PRAIVYDSTM 124

Query: 135 PWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPE---NVAVSLPALPMDLQP 194
           PW+  +A   GL  A FFTQ   V  I   V+ GS  +P  +   +   S P+ PM L  
Sbjct: 125 PWLLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPM-LTA 184

Query: 195 EDLPAFPDDTEV---VVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGP 254
            DLP+F  ++     +++ +  Q SN+++V  +  NTFD LE K++ W+ +  P+  IGP
Sbjct: 185 NDLPSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGP 244

Query: 255 TIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKE 314
           T+PS YLD RL +DK YG ++       + M+WL+SKE  SVVY+SFGSLVIL E+Q+ E
Sbjct: 245 TVPSMYLDKRLSEDKNYGFSLFNAKVA-ECMEWLNSKEPNSVVYLSFGSLVILKEDQMLE 304

Query: 315 LTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHC 374
           L   LK +   FLWV+RE+E  KLP N+ +E  E+GLIV+W  QL+VL+HK++ CF+THC
Sbjct: 305 LAAGLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHC 364

Query: 375 GWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILM 434
           GWNSTLE LSLGVPM+ +P W DQ TNAKF+ +VW+VGVRVK    G   +EE+  S+  
Sbjct: 365 GWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEE 424

Query: 435 IVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFV 459
           +++GE+  + + N+ KWK LA++AV EGGSSDK+I EFV
Sbjct: 425 VMEGEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

BLAST of Tan0018419 vs. ExPASy Swiss-Prot
Match: P0C7P7 (UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=1)

HSP 1 Score: 369.0 bits (946), Expect = 7.8e-101
Identity = 197/459 (42.92%), Postives = 283/459 (61.66%), Query Frame = 0

Query: 15  NHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLK--II 74
           +HVI+ PFP  GH++P+ QF KRL SK L +T +  S           PSP  + +   I
Sbjct: 5   SHVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSD---------KPSPPYKTEHDTI 64

Query: 75  SDIPESNDI-------ATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVM 134
           + +P SN           LD Y+    +++   L   I++  +S +    P  +VYDS M
Sbjct: 65  TVVPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNP---PRALVYDSTM 124

Query: 135 PWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPE---NVAVSLPALPMDLQP 194
           PW+  +A   GL  A FFTQ   V+ I   V+ GS  +P  +   +   S P+LP+ L  
Sbjct: 125 PWLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPI-LNA 184

Query: 195 EDLPAFPDDTE---VVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGP 254
            DLP+F  ++     +++ +  Q SN+++V  +  NTFD LE K++ W+ +  P+  IGP
Sbjct: 185 NDLPSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGP 244

Query: 255 TIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKE 314
           T+PS YLD RL +DK YG ++       + M+WL+SK+  SVVY+SFGSLV+L ++Q+ E
Sbjct: 245 TVPSMYLDKRLAEDKNYGFSLFGAKIA-ECMEWLNSKQPSSVVYVSFGSLVVLKKDQLIE 304

Query: 315 LTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHC 374
           L   LK +   FLWV+RE+E +KLP N+ +E  E+GL V+W  QLEVL+HK++ CFVTHC
Sbjct: 305 LAAGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHC 364

Query: 375 GWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILM 434
           GWNSTLE LSLGVPM+ +P W DQ TNAKF+ +VW+VGVRVK +  G   +EE    +  
Sbjct: 365 GWNSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEE 424

Query: 435 IVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFV 459
           +++ E+  + + N+ KWK LA++AV EGGSSDKNI EFV
Sbjct: 425 VMEAEQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFV 449

BLAST of Tan0018419 vs. ExPASy Swiss-Prot
Match: Q9SKC5 (UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana OX=3702 GN=UGT74D1 PE=1 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 4.0e-97
Identity = 199/465 (42.80%), Postives = 294/465 (63.23%), Query Frame = 0

Query: 9   GEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLR 68
           GEK K N V++F FP  GH++P+LQF+KRL+SK + VTFLT SS   S++       +  
Sbjct: 2   GEKAKAN-VLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATA 61

Query: 69  LKI----ISDIPESNDIATLDA--YLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYD 128
           L +    I D  E +  +T  +  Y   F+  V++SL+  I      SS + +P+ +VYD
Sbjct: 62  LPLSFVPIDDGFEEDHPSTDTSPDYFAKFQENVSRSLSELI------SSMDPKPNAVVYD 121

Query: 129 SVMPWVQSIAAER-GLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDLQ 188
           S +P+V  +  +  G+ AA FFTQS+ VN        G  +    +   V LPA+P  L+
Sbjct: 122 SCLPYVLDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFKEFQND---VVLPAMP-PLK 181

Query: 189 PEDLPAFPDDTEV---VVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIG 248
             DLP F  D  +   + + ++SQF N++ + +   N+FD LE +V+ WM N+ P+K IG
Sbjct: 182 GNDLPVFLYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPVKNIG 241

Query: 249 PTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVK 308
           P IPS YLD RL  DK YG+N+       + + WLDSK  GSV+Y+SFGSL +L ++Q+ 
Sbjct: 242 PMIPSMYLDKRLAGDKDYGINLFNA-QVNECLDWLDSKPPGSVIYVSFGSLAVLKDDQMI 301

Query: 309 ELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTH 368
           E+   LK T  +FLWV+RE+E KKLP+N+ ++  ++GLIVNW  QL+VL+HK++ CF+TH
Sbjct: 302 EVAAGLKQTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMTH 361

Query: 369 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASI- 428
           CGWNSTLEALSLGV ++ +P + DQ TNAKF+ +VW+VGVRVK +  G   KEE+   + 
Sbjct: 362 CGWNSTLEALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCVG 421

Query: 429 -LMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAI 462
            +M    E+  + + N+ +  + A++A+ +GG+SDKNI+EFV  I
Sbjct: 422 EVMEDMSEKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFVAKI 454

BLAST of Tan0018419 vs. NCBI nr
Match: XP_022997132.1 (UDP-glycosyltransferase 74E2-like [Cucurbita maxima] >XP_022997133.1 UDP-glycosyltransferase 74E2-like [Cucurbita maxima])

HSP 1 Score: 758.8 bits (1958), Expect = 2.7e-215
Identity = 374/463 (80.78%), Postives = 419/463 (90.50%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           ME+TT NGG + KQNHVI+FPFPRHGHM+P+LQFAKRL+SKG ++TFLT SSA+QSLI++
Sbjct: 1   MEKTTVNGGGEMKQNHVIVFPFPRHGHMNPMLQFAKRLVSKGFLLTFLTTSSASQSLILD 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LPPSP +  K+ISD+PESN+I +LDAYLR+FRAA +KSLANFIDE+LIS S+EV PSLIV
Sbjct: 61  LPPSP-IHHKVISDVPESNNIDSLDAYLRSFRAAASKSLANFIDESLISDSNEVLPSLIV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDL 180
           YDSVMPWVQS+AAERGLDAAPFFTQSAAVNHILDLVY GSL +PPPE+VAVSLP+  + L
Sbjct: 121 YDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPS-EIVL 180

Query: 181 QPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPT 240
           QP DLPA PDD  VV+ FMTSQF NLEKVKWIFFNTFD LECKVVNWM   LPIKT+GPT
Sbjct: 181 QPADLPALPDDGVVVLDFMTSQFINLEKVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPT 240

Query: 241 IPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKEL 300
           IPSAYLDGRL DDKAYGLNVL  +DGKK +QWLDSKET S++YISFGSLV L  EQV EL
Sbjct: 241 IPSAYLDGRLVDDKAYGLNVLNPNDGKKAIQWLDSKETASIIYISFGSLVNLKIEQVNEL 300

Query: 301 TCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCG 360
           TCFL+DTNLSFLWVLRESEL KLPNNF Q+TSE GLIVNWCCQL+VLSHKAVSCFVTHCG
Sbjct: 301 TCFLEDTNLSFLWVLRESELGKLPNNFVQDTSEHGLIVNWCCQLQVLSHKAVSCFVTHCG 360

Query: 361 WNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMI 420
           WNST+EALSLGVPMVAIPQWVDQTTNAKFVA+VWEVGVRVKKNDKGIATKEELEASI  +
Sbjct: 361 WNSTIEALSLGVPMVAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIATKEELEASIRKV 420

Query: 421 VQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           VQGE+ N+ K NSIKWKKLAK+A+DEGGSSDKNI+EFV+A+A+
Sbjct: 421 VQGEKPNEIKQNSIKWKKLAKEAMDEGGSSDKNIDEFVQAMAA 461

BLAST of Tan0018419 vs. NCBI nr
Match: XP_022962392.1 (UDP-glycosyltransferase 74E2-like [Cucurbita moschata] >XP_022962393.1 UDP-glycosyltransferase 74E2-like [Cucurbita moschata])

HSP 1 Score: 751.1 bits (1938), Expect = 5.6e-213
Identity = 369/463 (79.70%), Postives = 420/463 (90.71%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           ME+TT +GG + KQ+HVI+FPFPRHGHM+P+LQFAKRL+SKGL++TFLT SSA++SLI++
Sbjct: 1   MEKTTVDGGGEMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILD 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LPPSP +R K+ISD PESN+I +LDAYLR+FRAA +KSLANFIDE LIS S+EV PSLIV
Sbjct: 61  LPPSP-IRHKVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDL 180
           YDSVMPWVQS+AAERGLDAAPFFTQSAAVNHILDLVY GSL +PPPE+VAVSLP+  + L
Sbjct: 121 YDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPS-EIVL 180

Query: 181 QPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPT 240
           QP DLP  PDD +VV++FMTSQF NLE VKWIFFNTFD LECKVVNWM   LPIKT+GPT
Sbjct: 181 QPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPT 240

Query: 241 IPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKEL 300
           IPSAYLDGRL DDKAYGLNVL  +DGKK ++WLDSKET SV+YISFGSLV L +EQV EL
Sbjct: 241 IPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTEL 300

Query: 301 TCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCG 360
           TCFL++TNLSFLWVLRESEL KLPNNF Q+TSE+GLIVNWCCQLEVLSHKAVSCFVTHCG
Sbjct: 301 TCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCG 360

Query: 361 WNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMI 420
           WNST+EALSLGVPM+AIPQWVDQTTNAKFVA+VWEVGVRVKKNDKGI TKEELEASI  I
Sbjct: 361 WNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKI 420

Query: 421 VQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           VQGE+ N+ K NSIKWKK+AK+A+DEGGSSDKNI+EFV+A+A+
Sbjct: 421 VQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAA 461

BLAST of Tan0018419 vs. NCBI nr
Match: KAG6598621.1 (UDP-glycosyltransferase 74E2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 749.6 bits (1934), Expect = 1.6e-212
Identity = 367/463 (79.27%), Postives = 417/463 (90.06%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           ME+TT +GG + KQ+HVI+FPFPRHGHM+P+LQFAKRL+SKGL++TFLT SSA++SLI++
Sbjct: 1   MEKTTVDGGGEMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILD 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LPPSP +  K+ISD+PESN+I +LDAYLR+FRAA +KSLANFIDE LIS S+EV PSLIV
Sbjct: 61  LPPSP-IHHKVISDVPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDL 180
           YDSVMPWVQS+AAERGLDAAPFFTQSAAVNHILDLVY GSL +PPPE+VAVSLP+  + L
Sbjct: 121 YDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPS-EIVL 180

Query: 181 QPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPT 240
           QP DLP  PDD +VV++FMTSQF NLE VKWIFFNTFD LECKVVNWM   LPIKT+GPT
Sbjct: 181 QPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPT 240

Query: 241 IPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKEL 300
           IPSAYLDGRL DDKAYGLNVL  +DGKK +QWLDSKET SV+YISFGSLV L  EQV EL
Sbjct: 241 IPSAYLDGRLVDDKAYGLNVLNPNDGKKAIQWLDSKETASVIYISFGSLVNLENEQVTEL 300

Query: 301 TCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCG 360
           TCFL+DTNLSFLWVLRESEL KLPNNF Q+TSE+GLIVNWCCQLEVLSHK VSCFVTHCG
Sbjct: 301 TCFLRDTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKTVSCFVTHCG 360

Query: 361 WNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMI 420
           WNST+EALSLGVPM+AIPQWVDQTTNAKFVA+VWEVGVRVKKNDKGI TKEEL ASI  +
Sbjct: 361 WNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELAASIRKV 420

Query: 421 VQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           V+GE+ N+ K NSIKWKKLAK+A+DEGGSSDKNI+EFV+A+A+
Sbjct: 421 VRGEKPNEIKQNSIKWKKLAKEAMDEGGSSDKNIDEFVQAMAA 461

BLAST of Tan0018419 vs. NCBI nr
Match: XP_038885149.1 (mogroside IE synthase-like [Benincasa hispida] >XP_038885150.1 mogroside IE synthase-like [Benincasa hispida])

HSP 1 Score: 748.8 bits (1932), Expect = 2.8e-212
Identity = 375/467 (80.30%), Postives = 421/467 (90.15%), Query Frame = 0

Query: 1   MEETTANG-GEK--TKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSL 60
           MEETT NG G +   KQNHVI+FPFPRHGH+SP+LQF+KRLISKGL++TFLT SSA+QSL
Sbjct: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60

Query: 61  IINLPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSD-EVRP 120
           I+NLPPSPS  LKIISD+ ESN +A+L AYL++FRAAVTKSLANFID+ LISSSD E+ P
Sbjct: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120

Query: 121 SLIVYDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPAL 180
           +LIVYDSVMPWVQ++AAERGLD APFFTQSAAVNH+L LVYGGSL +PPPENVAVSLPA 
Sbjct: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPA- 180

Query: 181 PMDLQPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKT 240
            + LQP DLPAFPDD+EVV+KFMTSQF NLE VKWIF NTFD LE KVVNWMA  LPIKT
Sbjct: 181 EIALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKT 240

Query: 241 IGPTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQ 300
           +GPTIPSAYLDGRL+DDKAYGLNV + + GK P++WLDSKET SVVYISFGSLVIL EEQ
Sbjct: 241 VGPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQ 300

Query: 301 VKELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFV 360
           VKELT  L+DT+ SFLWVLRESEL+KLPNNF Q+TSERGLIVNWCCQ +VLSHKAVSCFV
Sbjct: 301 VKELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFV 360

Query: 361 THCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEAS 420
           THCGWNSTLEALSLGVPMVAIPQWVDQTTNAKF+A+VW VG+RVKKN+KGIATKEELEAS
Sbjct: 361 THCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEAS 420

Query: 421 ILMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           I  IVQGER+N+FK NSIKWK LAK+AVDEGG+SDK+IEEFV+AI +
Sbjct: 421 IRKIVQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVA 466

BLAST of Tan0018419 vs. NCBI nr
Match: XP_023546480.1 (UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 746.5 bits (1926), Expect = 1.4e-211
Identity = 365/463 (78.83%), Postives = 418/463 (90.28%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           ME+TT +GG + KQ+HVI+FPFPRHGHM+P+LQFAKRL+SKGL++TFLT SSA++SLI++
Sbjct: 1   MEKTTVDGGGEMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILD 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LPPSP +  K+ISD+PES++I +LDAYLR+FRAA +KSLANFIDE LIS S+EV PSLIV
Sbjct: 61  LPPSP-IHHKVISDVPESSNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDL 180
           YDSVMPWVQS+AAERGLDAAPFFTQSAAVNHILDLVY GSL +PPPE+VA+SLP+  + L
Sbjct: 121 YDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAISLPS-EIVL 180

Query: 181 QPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPT 240
           QP DLP  PDD +VV++FMTSQF NLE VKWIFFNTFD LECKVVNWM   LPIKT+GPT
Sbjct: 181 QPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPT 240

Query: 241 IPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKEL 300
           IPSAYLDGRL  DKAYGLNVL  +DGKK +QWLDSKET S++YISFGSLV L +EQV EL
Sbjct: 241 IPSAYLDGRLAYDKAYGLNVLNPNDGKKAIQWLDSKETASIIYISFGSLVNLEKEQVTEL 300

Query: 301 TCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCG 360
           TCFLKDTNLSFLWVLRESEL KLPNNF Q+T E+GLIVNWCCQL+VLSHKAVSCFVTHCG
Sbjct: 301 TCFLKDTNLSFLWVLRESELGKLPNNFVQDTLEQGLIVNWCCQLQVLSHKAVSCFVTHCG 360

Query: 361 WNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMI 420
           WNST+EALSLGVPMVAIPQWVDQTTNAKFVA+VWEVGVRVKKNDKGI TKEELEASI  +
Sbjct: 361 WNSTIEALSLGVPMVAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKV 420

Query: 421 VQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           VQGE+ N+ K NSIKWKK+AK+A+DEGGSSDKNI+EFV+A+A+
Sbjct: 421 VQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAA 461

BLAST of Tan0018419 vs. ExPASy TrEMBL
Match: A0A6J1KD05 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111492133 PE=3 SV=1)

HSP 1 Score: 758.8 bits (1958), Expect = 1.3e-215
Identity = 374/463 (80.78%), Postives = 419/463 (90.50%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           ME+TT NGG + KQNHVI+FPFPRHGHM+P+LQFAKRL+SKG ++TFLT SSA+QSLI++
Sbjct: 1   MEKTTVNGGGEMKQNHVIVFPFPRHGHMNPMLQFAKRLVSKGFLLTFLTTSSASQSLILD 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LPPSP +  K+ISD+PESN+I +LDAYLR+FRAA +KSLANFIDE+LIS S+EV PSLIV
Sbjct: 61  LPPSP-IHHKVISDVPESNNIDSLDAYLRSFRAAASKSLANFIDESLISDSNEVLPSLIV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDL 180
           YDSVMPWVQS+AAERGLDAAPFFTQSAAVNHILDLVY GSL +PPPE+VAVSLP+  + L
Sbjct: 121 YDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPS-EIVL 180

Query: 181 QPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPT 240
           QP DLPA PDD  VV+ FMTSQF NLEKVKWIFFNTFD LECKVVNWM   LPIKT+GPT
Sbjct: 181 QPADLPALPDDGVVVLDFMTSQFINLEKVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPT 240

Query: 241 IPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKEL 300
           IPSAYLDGRL DDKAYGLNVL  +DGKK +QWLDSKET S++YISFGSLV L  EQV EL
Sbjct: 241 IPSAYLDGRLVDDKAYGLNVLNPNDGKKAIQWLDSKETASIIYISFGSLVNLKIEQVNEL 300

Query: 301 TCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCG 360
           TCFL+DTNLSFLWVLRESEL KLPNNF Q+TSE GLIVNWCCQL+VLSHKAVSCFVTHCG
Sbjct: 301 TCFLEDTNLSFLWVLRESELGKLPNNFVQDTSEHGLIVNWCCQLQVLSHKAVSCFVTHCG 360

Query: 361 WNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMI 420
           WNST+EALSLGVPMVAIPQWVDQTTNAKFVA+VWEVGVRVKKNDKGIATKEELEASI  +
Sbjct: 361 WNSTIEALSLGVPMVAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIATKEELEASIRKV 420

Query: 421 VQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           VQGE+ N+ K NSIKWKKLAK+A+DEGGSSDKNI+EFV+A+A+
Sbjct: 421 VQGEKPNEIKQNSIKWKKLAKEAMDEGGSSDKNIDEFVQAMAA 461

BLAST of Tan0018419 vs. ExPASy TrEMBL
Match: A0A6J1HCL4 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111462850 PE=3 SV=1)

HSP 1 Score: 751.1 bits (1938), Expect = 2.7e-213
Identity = 369/463 (79.70%), Postives = 420/463 (90.71%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           ME+TT +GG + KQ+HVI+FPFPRHGHM+P+LQFAKRL+SKGL++TFLT SSA++SLI++
Sbjct: 1   MEKTTVDGGGEMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILD 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LPPSP +R K+ISD PESN+I +LDAYLR+FRAA +KSLANFIDE LIS S+EV PSLIV
Sbjct: 61  LPPSP-IRHKVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDL 180
           YDSVMPWVQS+AAERGLDAAPFFTQSAAVNHILDLVY GSL +PPPE+VAVSLP+  + L
Sbjct: 121 YDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPS-EIVL 180

Query: 181 QPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPT 240
           QP DLP  PDD +VV++FMTSQF NLE VKWIFFNTFD LECKVVNWM   LPIKT+GPT
Sbjct: 181 QPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPT 240

Query: 241 IPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKEL 300
           IPSAYLDGRL DDKAYGLNVL  +DGKK ++WLDSKET SV+YISFGSLV L +EQV EL
Sbjct: 241 IPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTEL 300

Query: 301 TCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHCG 360
           TCFL++TNLSFLWVLRESEL KLPNNF Q+TSE+GLIVNWCCQLEVLSHKAVSCFVTHCG
Sbjct: 301 TCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCG 360

Query: 361 WNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILMI 420
           WNST+EALSLGVPM+AIPQWVDQTTNAKFVA+VWEVGVRVKKNDKGI TKEELEASI  I
Sbjct: 361 WNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKI 420

Query: 421 VQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           VQGE+ N+ K NSIKWKK+AK+A+DEGGSSDKNI+EFV+A+A+
Sbjct: 421 VQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAA 461

BLAST of Tan0018419 vs. ExPASy TrEMBL
Match: A0A1S3BCU2 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488485 PE=3 SV=1)

HSP 1 Score: 728.4 bits (1879), Expect = 1.9e-206
Identity = 361/466 (77.47%), Postives = 415/466 (89.06%), Query Frame = 0

Query: 1   MEETTAN-GGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLII 60
           ME T AN GGE+ KQ+HVI+FPFPRHGHMSP+LQF+KRLISKGL++TFL  SSA+QSL I
Sbjct: 1   MEMTAANGGGERIKQSHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLITSSASQSLTI 60

Query: 61  NLPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSS-DEVRPSL 120
           N+PPSPS   KIISD+PES+D+ATLDAYLR+FRAAVTKSL+NFIDE L SSS +EV P+L
Sbjct: 61  NIPPSPSFHFKIISDLPESDDVATLDAYLRSFRAAVTKSLSNFIDEVLTSSSNEEVPPTL 120

Query: 121 IVYDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPM 180
           IVYDSVMPWVQS+AAERGLD+APFFT+SAAVNH+L LVYGGSL +PPP+NV VSLP+  +
Sbjct: 121 IVYDSVMPWVQSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPPPDNVVVSLPS-EI 180

Query: 181 DLQPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIG 240
            LQPEDLP+FPDD EVV+ FMTSQFS+LE VKWIF NTFD LE KVVNWMA  LPIKT+G
Sbjct: 181 VLQPEDLPSFPDDPEVVLDFMTSQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTVG 240

Query: 241 PTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVK 300
           PTIPSAYLDGRL+ DKAYGLNV + ++GK P++WLDSKET SV+YISFGSLVIL EEQVK
Sbjct: 241 PTIPSAYLDGRLEKDKAYGLNVSKSNNGKCPIKWLDSKETASVIYISFGSLVILSEEQVK 300

Query: 301 ELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTH 360
           ELT  L+DT+ SFLWVLRESE+ KLP NF Q+TS+RGLIVNWCCQL+VLSHKAVSCFVTH
Sbjct: 301 ELTNLLRDTDFSFLWVLRESEMVKLPKNFVQDTSDRGLIVNWCCQLQVLSHKAVSCFVTH 360

Query: 361 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASI- 420
           CGWNSTLEALSLGVPMVAIPQW+DQTTNAKFVA+VW VGVRVKKN+K +A KEELEASI 
Sbjct: 361 CGWNSTLEALSLGVPMVAIPQWIDQTTNAKFVADVWRVGVRVKKNEKSVAIKEELEASIR 420

Query: 421 LMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
            ++VQG  +N+FK N+IKWK LAK+AVDE GSSDKNIEEFV+A+ +
Sbjct: 421 KIVVQGNGTNEFKQNAIKWKNLAKEAVDERGSSDKNIEEFVQALVA 465

BLAST of Tan0018419 vs. ExPASy TrEMBL
Match: A0A0A0KD63 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G366280 PE=3 SV=1)

HSP 1 Score: 724.9 bits (1870), Expect = 2.1e-205
Identity = 358/466 (76.82%), Postives = 414/466 (88.84%), Query Frame = 0

Query: 1   MEETTAN-GGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLII 60
           ME+  AN GG + KQNHVI+FPFPRHGHMSP+LQF+KRLISKGL++TFL  SSA+QSL I
Sbjct: 1   MEKAMANGGGGRIKQNHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLVTSSASQSLTI 60

Query: 61  NLPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSS-DEVRPSL 120
           N+PPSPS  +KIISD+PES+D+AT DAY+R+F+AAVTKSL+NFIDE LISSS +EV P+L
Sbjct: 61  NIPPSPSFHIKIISDLPESDDVATFDAYIRSFQAAVTKSLSNFIDEALISSSYEEVSPTL 120

Query: 121 IVYDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPM 180
           IVYDS+MPWV S+AAERGLD+APFFT+SAAVNH+L LVYGGSL +P PENV VSLP+  +
Sbjct: 121 IVYDSIMPWVHSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPAPENVVVSLPS-EI 180

Query: 181 DLQPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIG 240
            LQP DLP+FPDD EVV+ FM +QFS+LE VKWIF NTFD LE KVVNWMA  LPIKT+G
Sbjct: 181 VLQPGDLPSFPDDPEVVLDFMINQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTVG 240

Query: 241 PTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVK 300
           PTIPSAYLDGRL++DKAYGLNV + ++GK P++WLDSKET SV+YISFGSLV+L EEQVK
Sbjct: 241 PTIPSAYLDGRLENDKAYGLNVSKSNNGKSPIKWLDSKETASVIYISFGSLVMLSEEQVK 300

Query: 301 ELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTH 360
           ELT  L+DT+ SFLWVLRESEL KLPNNF Q+TS+ GLIVNWCCQL+VLSHKAVSCFVTH
Sbjct: 301 ELTNLLRDTDFSFLWVLRESELVKLPNNFVQDTSDHGLIVNWCCQLQVLSHKAVSCFVTH 360

Query: 361 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASI- 420
           CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVA+VW VGVRVKKN+KG+A KEELEASI 
Sbjct: 361 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVADVWRVGVRVKKNEKGVAIKEELEASIR 420

Query: 421 LMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
            ++VQG R N+FK NSIKWK LAK+AVDE GSSDKNIEEFV+A+A+
Sbjct: 421 KIVVQGNRPNEFKQNSIKWKNLAKEAVDERGSSDKNIEEFVQALAA 465

BLAST of Tan0018419 vs. ExPASy TrEMBL
Match: A0A6J1GJP1 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111454485 PE=3 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 1.3e-162
Identity = 296/463 (63.93%), Postives = 362/463 (78.19%), Query Frame = 0

Query: 1   MEETTANGGEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIIN 60
           M  TTANGG ++  +HV++F +P+ GH+SP+LQFAKRL SKGL +TFLT +SAT+SL I+
Sbjct: 1   MSNTTANGGRRS--SHVVLFAYPKQGHLSPMLQFAKRLASKGLRITFLTTTSATKSLEID 60

Query: 61  LPPSPSLRLKIISDIPESNDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIV 120
           LP S  + L+ ISD+  +  I +L     +F A V+KS  +FID TL SS  +  P  +V
Sbjct: 61  LPASYQIDLRFISDV-RTEPILSLKDEHESFEAVVSKSFGDFIDGTLRSSGYD-PPRFVV 120

Query: 121 YDSVMPWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVA--VSLPALPM 180
           +DSVMPW   +A  RG+ +APFFT+S  VNHIL+ VY GS  +PP ENVA  +S+P LP+
Sbjct: 121 FDSVMPWAMDVARVRGIGSAPFFTESCVVNHILNQVYKGSFSIPPVENVAAGISIPPLPV 180

Query: 181 DLQPEDLPAFPDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIG 240
            LQ EDLP F  + E+V+KFMT QFS+ +  KWIF NTFD LE KVVNWM  + PIKTIG
Sbjct: 181 -LQTEDLPYFSYEPELVLKFMTDQFSSFKNAKWIFVNTFDQLEMKVVNWMTQKWPIKTIG 240

Query: 241 PTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVK 300
           P+IPSAYLDGRLKDDK YGLN   L++  K  QWL+SKE  SV+YISFGSLVIL ++QV 
Sbjct: 241 PSIPSAYLDGRLKDDKTYGLNHQNLNN-CKIFQWLNSKEIASVIYISFGSLVILPDKQVN 300

Query: 301 ELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTH 360
           EL  FLK+TNLSFLWVLRESE +KLPNNF Q+TS +GL+V WCCQL+VLSHKAVSCFVTH
Sbjct: 301 ELASFLKNTNLSFLWVLRESEQEKLPNNFVQQTSHKGLVVKWCCQLQVLSHKAVSCFVTH 360

Query: 361 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASIL 420
           CGWNST+EALSLGVPMVA+PQW+DQTTNAKF+A+VW+VG RVK NDKGIATK ELEAS+ 
Sbjct: 361 CGWNSTIEALSLGVPMVAVPQWIDQTTNAKFIADVWKVGARVKMNDKGIATKLELEASLR 420

Query: 421 MIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAI 462
            + QG R N+ K NSIK + LAK+A+DEGGSSDKNIE+FVK +
Sbjct: 421 HVSQGYRQNEIKQNSIKLRNLAKEAMDEGGSSDKNIEQFVKEL 457

BLAST of Tan0018419 vs. TAIR 10
Match: AT1G05680.1 (Uridine diphosphate glycosyltransferase 74E2 )

HSP 1 Score: 375.6 bits (963), Expect = 5.9e-104
Identity = 200/459 (43.57%), Postives = 284/459 (61.87%), Query Frame = 0

Query: 15  NHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLK--II 74
           +H+I+ PFP  GH++P+ QF KRL SKGL +T +  S           PSP  + +   I
Sbjct: 5   SHLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSD---------KPSPPYKTEHDSI 64

Query: 75  SDIPESN-------DIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVM 134
           +  P SN        +  LD Y+     ++  +L   +++  +S +    P  IVYDS M
Sbjct: 65  TVFPISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNP---PRAIVYDSTM 124

Query: 135 PWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPE---NVAVSLPALPMDLQP 194
           PW+  +A   GL  A FFTQ   V  I   V+ GS  +P  +   +   S P+ PM L  
Sbjct: 125 PWLLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPM-LTA 184

Query: 195 EDLPAFPDDTEV---VVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGP 254
            DLP+F  ++     +++ +  Q SN+++V  +  NTFD LE K++ W+ +  P+  IGP
Sbjct: 185 NDLPSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGP 244

Query: 255 TIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKE 314
           T+PS YLD RL +DK YG ++       + M+WL+SKE  SVVY+SFGSLVIL E+Q+ E
Sbjct: 245 TVPSMYLDKRLSEDKNYGFSLFNAKVA-ECMEWLNSKEPNSVVYLSFGSLVILKEDQMLE 304

Query: 315 LTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHC 374
           L   LK +   FLWV+RE+E  KLP N+ +E  E+GLIV+W  QL+VL+HK++ CF+THC
Sbjct: 305 LAAGLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHC 364

Query: 375 GWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILM 434
           GWNSTLE LSLGVPM+ +P W DQ TNAKF+ +VW+VGVRVK    G   +EE+  S+  
Sbjct: 365 GWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEE 424

Query: 435 IVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFV 459
           +++GE+  + + N+ KWK LA++AV EGGSSDK+I EFV
Sbjct: 425 VMEGEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

BLAST of Tan0018419 vs. TAIR 10
Match: AT1G05675.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 369.0 bits (946), Expect = 5.6e-102
Identity = 197/459 (42.92%), Postives = 283/459 (61.66%), Query Frame = 0

Query: 15  NHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLK--II 74
           +HVI+ PFP  GH++P+ QF KRL SK L +T +  S           PSP  + +   I
Sbjct: 5   SHVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSD---------KPSPPYKTEHDTI 64

Query: 75  SDIPESNDI-------ATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVM 134
           + +P SN           LD Y+    +++   L   I++  +S +    P  +VYDS M
Sbjct: 65  TVVPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNP---PRALVYDSTM 124

Query: 135 PWVQSIAAERGLDAAPFFTQSAAVNHILDLVYGGSLRLPPPE---NVAVSLPALPMDLQP 194
           PW+  +A   GL  A FFTQ   V+ I   V+ GS  +P  +   +   S P+LP+ L  
Sbjct: 125 PWLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPI-LNA 184

Query: 195 EDLPAFPDDTE---VVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGP 254
            DLP+F  ++     +++ +  Q SN+++V  +  NTFD LE K++ W+ +  P+  IGP
Sbjct: 185 NDLPSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGP 244

Query: 255 TIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKE 314
           T+PS YLD RL +DK YG ++       + M+WL+SK+  SVVY+SFGSLV+L ++Q+ E
Sbjct: 245 TVPSMYLDKRLAEDKNYGFSLFGAKIA-ECMEWLNSKQPSSVVYVSFGSLVVLKKDQLIE 304

Query: 315 LTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTHC 374
           L   LK +   FLWV+RE+E +KLP N+ +E  E+GL V+W  QLEVL+HK++ CFVTHC
Sbjct: 305 LAAGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHC 364

Query: 375 GWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASILM 434
           GWNSTLE LSLGVPM+ +P W DQ TNAKF+ +VW+VGVRVK +  G   +EE    +  
Sbjct: 365 GWNSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEE 424

Query: 435 IVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFV 459
           +++ E+  + + N+ KWK LA++AV EGGSSDKNI EFV
Sbjct: 425 VMEAEQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFV 449

BLAST of Tan0018419 vs. TAIR 10
Match: AT2G31750.1 (UDP-glucosyl transferase 74D1 )

HSP 1 Score: 356.7 bits (914), Expect = 2.9e-98
Identity = 199/465 (42.80%), Postives = 294/465 (63.23%), Query Frame = 0

Query: 9   GEKTKQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLR 68
           GEK K N V++F FP  GH++P+LQF+KRL+SK + VTFLT SS   S++       +  
Sbjct: 2   GEKAKAN-VLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATA 61

Query: 69  LKI----ISDIPESNDIATLDA--YLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYD 128
           L +    I D  E +  +T  +  Y   F+  V++SL+  I      SS + +P+ +VYD
Sbjct: 62  LPLSFVPIDDGFEEDHPSTDTSPDYFAKFQENVSRSLSELI------SSMDPKPNAVVYD 121

Query: 129 SVMPWVQSIAAER-GLDAAPFFTQSAAVNHILDLVYGGSLRLPPPENVAVSLPALPMDLQ 188
           S +P+V  +  +  G+ AA FFTQS+ VN        G  +    +   V LPA+P  L+
Sbjct: 122 SCLPYVLDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFKEFQND---VVLPAMP-PLK 181

Query: 189 PEDLPAFPDDTEV---VVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIG 248
             DLP F  D  +   + + ++SQF N++ + +   N+FD LE +V+ WM N+ P+K IG
Sbjct: 182 GNDLPVFLYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPVKNIG 241

Query: 249 PTIPSAYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVK 308
           P IPS YLD RL  DK YG+N+       + + WLDSK  GSV+Y+SFGSL +L ++Q+ 
Sbjct: 242 PMIPSMYLDKRLAGDKDYGINLFNA-QVNECLDWLDSKPPGSVIYVSFGSLAVLKDDQMI 301

Query: 309 ELTCFLKDTNLSFLWVLRESELKKLPNNFEQETSERGLIVNWCCQLEVLSHKAVSCFVTH 368
           E+   LK T  +FLWV+RE+E KKLP+N+ ++  ++GLIVNW  QL+VL+HK++ CF+TH
Sbjct: 302 EVAAGLKQTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMTH 361

Query: 369 CGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVKKNDKGIATKEELEASI- 428
           CGWNSTLEALSLGV ++ +P + DQ TNAKF+ +VW+VGVRVK +  G   KEE+   + 
Sbjct: 362 CGWNSTLEALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCVG 421

Query: 429 -LMIVQGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAI 462
            +M    E+  + + N+ +  + A++A+ +GG+SDKNI+EFV  I
Sbjct: 422 EVMEDMSEKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFVAKI 454

BLAST of Tan0018419 vs. TAIR 10
Match: AT2G43840.2 (UDP-glycosyltransferase 74 F1 )

HSP 1 Score: 345.9 bits (886), Expect = 5.0e-95
Identity = 197/460 (42.83%), Postives = 281/460 (61.09%), Query Frame = 0

Query: 14  QNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLI--INLPPSPSLRLKI 73
           + HV+  PFP  GH++PI QF KRL SKG    F T  + T  +   I+L PS  + +  
Sbjct: 5   RGHVLAVPFPSQGHITPIRQFCKRLHSKG----FKTTHTLTTFIFNTIHLDPSSPISIAT 64

Query: 74  ISDIPES---NDIATLDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVMPWV 133
           ISD  +    +   ++  YL+ F+   +K++A+ I +     S +   + IVYDS MPW 
Sbjct: 65  ISDGYDQGGFSSAGSVPEYLQNFKTFGSKTVADIIRK---HQSTDNPITCIVYDSFMPWA 124

Query: 134 QSIAAERGLDAAPFFTQSAAVNHI--LDLVYGGSLRLPPPENVAVSLPALPMDLQPEDLP 193
             +A + GL AAPFFTQS AVN+I  L  +  GSL LP        +  LP+ L+ +DLP
Sbjct: 125 LDLAMDFGLAAAPFFTQSCAVNYINYLSYINNGSLTLP--------IKDLPL-LELQDLP 184

Query: 194 AFPDDTE---VVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPTIPS 253
            F   T       + +  QF+N +K  ++  N+F  L+  V   ++   P+ TIGPT+PS
Sbjct: 185 TFVTPTGSHLAYFEMVLQQFTNFDKADFVLVNSFHDLDLHVKELLSKVCPVLTIGPTVPS 244

Query: 254 AYLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKELTCF 313
            YLD ++K D  Y LN+  L +      WLD +  GSVVYI+FGS+  L  EQ++E+   
Sbjct: 245 MYLDQQIKSDNDYDLNLFDLKEAALCTDWLDKRPEGSVVYIAFGSMAKLSSEQMEEIASA 304

Query: 314 LKDTNLSFLWVLRESELKKLPNNF-EQETSERGLIVNWCCQLEVLSHKAVSCFVTHCGWN 373
           +  +N S+LWV+R SE  KLP  F E    ++ L++ W  QL+VLS+KA+ CF+THCGWN
Sbjct: 305 I--SNFSYLWVVRASEESKLPPGFLETVDKDKSLVLKWSPQLQVLSNKAIGCFMTHCGWN 364

Query: 374 STLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVK-KNDKGIATKEELEASILMIV 433
           ST+E LSLGVPMVA+PQW DQ  NAK++ +VW+VGVRVK + + GI  +EE+E SI  ++
Sbjct: 365 STMEGLSLGVPMVAMPQWTDQPMNAKYIQDVWKVGVRVKAEKESGICKREEIEFSIKEVM 424

Query: 434 QGERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAI 462
           +GE+S + K N+ KW+ LA  ++ EGGS+D NI EFV  I
Sbjct: 425 EGEKSKEMKENAGKWRDLAVKSLSEGGSTDININEFVSKI 446

BLAST of Tan0018419 vs. TAIR 10
Match: AT2G43820.1 (UDP-glucosyltransferase 74F2 )

HSP 1 Score: 344.7 bits (883), Expect = 1.1e-94
Identity = 196/461 (42.52%), Postives = 276/461 (59.87%), Query Frame = 0

Query: 13  KQNHVIIFPFPRHGHMSPILQFAKRLISKGLIVTFLTASSATQSLIINLPPSPSLRLKII 72
           K+ HV+  P+P  GH++P  QF KRL  KGL  T    +    S  IN   S  + +  I
Sbjct: 4   KRGHVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFNS--INPDLSGPISIATI 63

Query: 73  SDIPESNDIAT---LDAYLRAFRAAVTKSLANFIDETLISSSDEVRPSLIVYDSVMPWVQ 132
           SD  +     T   +D YL+ F+ + +K++A+ I +   S +     + IVYD+ +PW  
Sbjct: 64  SDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDNP---ITCIVYDAFLPWAL 123

Query: 133 SIAAERGLDAAPFFTQSAAVNHILDLVY--GGSLRLPPPENVAVSLPALPMDLQPEDLPA 192
            +A E GL A PFFTQ  AVN++  L Y   GSL+LP  E        LP  L+ +DLP+
Sbjct: 124 DVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQLPIEE--------LPF-LELQDLPS 183

Query: 193 F---PDDTEVVVKFMTSQFSNLEKVKWIFFNTFDHLECKVVNWMANRLPIKTIGPTIPSA 252
           F           + +  QF N EK  ++  N+F  LE       +   P+ TIGPTIPS 
Sbjct: 184 FFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGPTIPSI 243

Query: 253 YLDGRLKDDKAYGLNVLRLDDGKKPMQWLDSKETGSVVYISFGSLVILFEEQVKELTCFL 312
           YLD R+K D  Y LN+    D    + WLD++  GSVVY++FGS+  L   Q++EL   +
Sbjct: 244 YLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV 303

Query: 313 KDTNLSFLWVLRESELKKLPNNF-EQETSERGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 372
             +N SFLWV+R SE +KLP+ F E    E+ L++ W  QL+VLS+KA+ CF+THCGWNS
Sbjct: 304 --SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNS 363

Query: 373 TLEALSLGVPMVAIPQWVDQTTNAKFVANVWEVGVRVK-KNDKGIATKEELEASILMIVQ 432
           T+EAL+ GVPMVA+PQW DQ  NAK++ +VW+ GVRVK + + GIA +EE+E SI  +++
Sbjct: 364 TMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFSIKEVME 423

Query: 433 GERSNKFKNNSIKWKKLAKDAVDEGGSSDKNIEEFVKAIAS 464
           GERS + K N  KW+ LA  +++EGGS+D NI+ FV  + S
Sbjct: 424 GERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRVQS 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
K7NBW32.6e-12850.45Mogroside IE synthase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC1 PE=1 SV=1[more]
W8JMV43.8e-10342.67UDP glycosyltransferase 9 OS=Catharanthus roseus OX=4058 GN=UGT9 PE=2 SV=1[more]
Q9SYK98.4e-10343.57UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=... [more]
P0C7P77.8e-10142.92UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=... [more]
Q9SKC54.0e-9742.80UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana OX=3702 GN=UGT74D1 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
XP_022997132.12.7e-21580.78UDP-glycosyltransferase 74E2-like [Cucurbita maxima] >XP_022997133.1 UDP-glycosy... [more]
XP_022962392.15.6e-21379.70UDP-glycosyltransferase 74E2-like [Cucurbita moschata] >XP_022962393.1 UDP-glyco... [more]
KAG6598621.11.6e-21279.27UDP-glycosyltransferase 74E2, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_038885149.12.8e-21280.30mogroside IE synthase-like [Benincasa hispida] >XP_038885150.1 mogroside IE synt... [more]
XP_023546480.11.4e-21178.83UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1KD051.3e-21580.78Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111492133 PE=3 SV=1[more]
A0A6J1HCL42.7e-21379.70Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111462850 PE=3 SV=1[more]
A0A1S3BCU21.9e-20677.47Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488485 PE=3 SV=1[more]
A0A0A0KD632.1e-20576.82Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G366280 PE=3 SV=1[more]
A0A6J1GJP11.3e-16263.93Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111454485 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G05680.15.9e-10443.57Uridine diphosphate glycosyltransferase 74E2 [more]
AT1G05675.15.6e-10242.92UDP-Glycosyltransferase superfamily protein [more]
AT2G31750.12.9e-9842.80UDP-glucosyl transferase 74D1 [more]
AT2G43840.25.0e-9542.83UDP-glycosyltransferase 74 F1 [more]
AT2G43820.11.1e-9442.52UDP-glucosyltransferase 74F2 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 275..442
e-value: 7.3E-26
score: 91.0
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 16..450
e-value: 2.21595E-77
score: 245.153
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 255..442
e-value: 1.7E-133
score: 448.0
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 19..454
e-value: 1.7E-133
score: 448.0
NoneNo IPR availablePANTHERPTHR11926:SF1330GLYCOSYLTRANSFERASEcoord: 14..462
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 14..462
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 15..461
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 340..383

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018419.1Tan0018419.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0008194 UDP-glycosyltransferase activity