Tan0017414 (gene) Snake gourd v1

Overview
NameTan0017414
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNucleotide-diphospho-sugar transferase family protein
LocationLG11: 16400372 .. 16404671 (-)
RNA-Seq ExpressionTan0017414
SyntenyTan0017414
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTAGCTACTCCTCCTTCCTCCCCTTCCGTCGTACCGTTCAGATCATTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCCTTGTTGTTTTAAGAGAACTCCACTCCCTTCGCTACTTCCCTCTCTTCTCCTTCACTACTTTCTCTGAATCTCCTCCTGTTTCGCCCTTCCTCCCCTCCCTCGGTGATGACGACGAGCCTTCTGCGGTGAGCTCATTCGGTTTCATTATTTAATCTTCCGTTCATGCTTTGCTTTATTATTAATTGATTAGTTTTTGTGTGTTTTTGTTCTTTTGGCGTTTCTGTTTATCATTCCAGTTTCCTTAATCGGGTTTCTTGATTCTTGTTCGTTTCGAATTGTGAGATTGAGTTTCATTGATGGTGAAAGTGGACTTCGATGAATACGCCTTGATGATGACGGGTGTGATAAATTATCTTTTTATTCGGCTTGCATGCCTGCACTCTGTGTGTTCTTCTATTTCTTATCTCTCCCTCTCTGTCTCTTTCAGACGTGAGCGCAGAGTTGATTTGTTTTCAACTTTCAATTTCAACCTTCAAATCTTTGCTCCATTCTGCTTATTTTGCTTGCCTCTGGGTCTGGTATTGAAGGTGCTAATGTTATTTGGAGGGACAATTCTCGGTTGTCGGTTCCCAATTGCAGAATCCCTTTGGGCGGATAATTTCATATTGCATTTAGGTTGTGTCTAATATCGTTATCTTCGAAGCTTTTTTTGTTTTCCTACTTAAAATTCTAGCATTTTAGTTCGAAATAACCAATGTGTCGTCCATTTTCTTTCCTTTTTTTAAATAAGAGTTGCGTGATTTTCCATGTAGTCAAGGATATGATGTTATGTTGTGAATCTCTGAAGAGTAAAAAGCTTACTGTTGTTTTGGAAGATGAATTTACCTGCCAGGATAACATATAGTAGCACGGTGTGGCTGATATGTACAATGTATGGGAATGTAGCAAGACCTTCTTATTTTATTACTACTTGCCTATAAACTATTATGCTCGAGCATTGAAATGTTATGGACTGGACTTTTCTTTTATGCTTCAATTTATACTTTTAGTGCCTCTTTTGGCTGATTATAGGACGCTGATGAGTATGGACTGGACAAGGTCTTAAACGATGCTGCAACAGAAGATAGAACTGTTATTTTAACTACTTTGAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCTTTAGAATTGGAAATCGAACTCGCCAACTGTTAAACCATTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGCTTGGCTATCCATATCCATTGCTTTGCTCTTGTTACTGAAGGAGTTGATTTTCATTCAGAGGCATATTTTATGACACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTATTCTTGAGACGGGGTACAATTTTGTTTTCACGGTATCATACTCTCTCCTCTCACCTTGAGATTGTTGCTTCCATCTTTTGATTTTCTTTTGAAGGTTGAGATGAGTAATATCATGTCGAGGTAAATCGTTTACTACCAGGCAACTGCATATTGACTTTTACTATCATGACCAATATGGCACAAAAAAGTTTCAGTTTTGAAGAATGCTATCATGGATTTTTTTTAGTACAACAAATATGGAGGGGTGAGAATTTGAACCTCCGAGCTCAAGGAAAGGAGTAGATATTGACACTATCATAGATATTTGTTGAGAAAAGATCCAGATTGTGGAGAAATGAGAAATGTTTGACTATAATAATCTAATGCTTAGGAAGGATGAATAAACAGTATGCTAGTAAATCGATAATGGGTGGGCAATTAATCTATTTCAGTGAATTTTTATATTACTAGTGAGGAAATTTAGAGTTTGCCTTGTGTAGGAATCTTTTTGGAATTATATAATTTTTTTTAATATATATGGTATACAATAAACAACCACATAGATATATGATACATCTCGAGTGTGGTAGGGGCAACTCTTAAAATATAGCTGTACGCTTGGACACGCTAAATCTATTCTTTTTTACTTAAGAGTTTCAAGGGAAAGTTACGATATTTGCATTTACAATATAAATGAAAGTAGTGACAAGGCTTAGGTTACCTGTGAGGAAGTTGAAGATGCCAATAATTTATTTCTATGTTACACAGGTGTATCAACGGGTCTTTCCAGTTGATACTCTAATTTGTGTATTAAACATGTATCGGGGTGTCTTAAGTCTTATTGGGGTTATCCAACATATAACAGGATGTCTCGAGTCTAAAGTGTTGGACAGCATACACTATCAAAACAAGTGTTTGTACTTCATAACTATATGACATGGCGAATAGGTATCAAACTGGGTGTCAATGTATCTTTTTTCTGGTTTGCTTAAGACGTCCATTTTCCAATGGATGAACCTATTCTCTTTGTGAACTCAATTTCCTCGGGGAGTTCAGGGAGAGGCAGGGAAGGAAAAACTCCTTTATTAAATTTATCTCACCTCCGCTCCACAAGTCCATTTTAAATTATGAATTTACAGTCACTCCAGTGACCACTTCGATGTCTTCCTCTTTCACTATTGCAACATGCTCGGGTGGTTTCTTCAGGGGGATATTTTTTGGATGGGGCAAAATTTCTTTATCAATCAAATGTCACCTCTGCACTCTAGGTTGATTTAAGATAAGGAATTAGGATCATTTCATTGCCCATTTGGTTGTCTTCCTCCTCCACTATCAGCAGTAACCCCCATGTTGGGAAACATAGAAACCCCACTATTTTCTCCTTTGCCTTTCATTTCAGTAGTCAAATTTCTGCGTACGGTAGCAGGACATGCCTTTCTTAAACTTTTGTGGTTTGAAATCTGCTGTTTAGCTGGTCAACGTCTCATACTTTCACCACTCTTGGCTTGTTTACCATTTACTTTCTTTAGGATATAACCAGTTTTTACTTTCATGTATGTTGAAGGATGGCAATTCACATGCTTAGATCAATATATAATATTACTAGTTTTTATTTTTGTATCTCTTTTTAATTCCCCTACACGTGTTTCTTCCTTCAGGATGCTGATGTTATGTGGTTCAGGGATCCATTCCCATTCTTTGATATGGATGCAGATTTCCAAATTGCTTGTGATCAATACCTTGGCCTCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTGAAGTCCAATAATCGGTCAATTGAGTTTTACAAATACTGGTACTCATCTCGAGAAACTTATCCAGAATACCATGATCAGGATGTTCTTAATAAGATCAAATACGATTCTTTCATTGACGACATTGGGTTAAAGATTAGGTTCTTGGATACTGCTTATTTTGGTGGGTTCTGTCAACCCAGCAAAGATTTGAATCGTGTATTAACCATGCATGCAAACTGCTGTATTGGAATGGAAAGTAAGCTTCATGATCTTAGAATTATGCTCGAGGATTGGAAACGATACATGTCCATGCCGCCATATGTTAAGGGATCATCAACTTGGACAGTTCCTCAGAACTGCAGGTATGGTACCATTTCTAGCTCTTGAACAATTTTGCGCTTTAATTAGATATGTTTCTGTATTCTTAAATTTCAATCTTTTATATTGGATCCTTTTATTCCTCTTGTTGTTCCAGTGTCTGAGTTTTGTCCCTATGATTCTCCAAGCAAAGAATTGATGAATTCTACCCAACTCGGAGTTACATATTGAGCATCATCTTTTCACTTCTCATTTCGAGTTTTGTGGCCGTTAAAGATATTTCTACATTACAACTTACACCAAAGTGAATGTTAAAGAAGGCATATCTGGTGTAGCAGCAGTATTCTTCGTTTGCTAGTTTCTGTAAATTTATACCAAATAAATTTTGAAGTCCAGCAATTGTACATCACAAAATCATGAAAGCCTGAAGAACACAGCCTATAACCAACATTAGGTAGCTTGCTTGCTAGGAAATTTGGTCAAATGATCAATTTTGCTTCAAGGTATAGCTAATTTTCCTTAGATTATAATGGGTTATGTAGAAGTATTCTATCCCGTAGACATGAATCCCCAATTATATTATATTATATATATCATAAGAGTTTACTACAGAAGCATATGACCATATACAACCAATACACACAAACATGGGAGATAAAACAAGCTACAATATTGTCTCGTCTCACTCTCCTAAGAGACAAGGAAATCAATTGCTTTTCGTGACAAGAATGACTCGATCCTAACACACTCATTACACATCATTTCTACGATAGTGTCAAAGAATTCGATGCATTAAAAGCATAG

mRNA sequence

ATGTTTAGCTACTCCTCCTTCCTCCCCTTCCGTCGTACCGTTCAGATCATTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCCTTGTTGTTTTAAGAGAACTCCACTCCCTTCGCTACTTCCCTCTCTTCTCCTTCACTACTTTCTCTGAATCTCCTCCTGTTTCGCCCTTCCTCCCCTCCCTCGGTGATGACGACGAGCCTTCTGCGGACGCTGATGAGTATGGACTGGACAAGGTCTTAAACGATGCTGCAACAGAAGATAGAACTGTTATTTTAACTACTTTGAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCTTTAGAATTGGAAATCGAACTCGCCAACTGTTAAACCATTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGCTTGGCTATCCATATCCATTGCTTTGCTCTTGTTACTGAAGGAGTTGATTTTCATTCAGAGGCATATTTTATGACACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTATTCTTGAGACGGGGTACAATTTTGTTTTCACGGATGCTGATGTTATGTGGTTCAGGGATCCATTCCCATTCTTTGATATGGATGCAGATTTCCAAATTGCTTGTGATCAATACCTTGGCCTCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTGAAGTCCAATAATCGGTCAATTGAGTTTTACAAATACTGGTACTCATCTCGAGAAACTTATCCAGAATACCATGATCAGGATGTTCTTAATAAGATCAAATACGATTCTTTCATTGACGACATTGGGTTAAAGATTAGGTTCTTGGATACTGCTTATTTTGGTGGGTTCTGTCAACCCAGCAAAGATTTGAATCGTGTATTAACCATGCATGCAAACTGCTGTATTGGAATGGAAAGTAAGCTTCATGATCTTAGAATTATGCTCGAGGATTGGAAACGATACATGTCCATGCCGCCATATGTTAAGGGATCATCAACTTGGACAGTTCCTCAGAACTGCAGTGTCAAAGAATTCGATGCATTAAAAGCATAG

Coding sequence (CDS)

ATGTTTAGCTACTCCTCCTTCCTCCCCTTCCGTCGTACCGTTCAGATCATTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCCTTGTTGTTTTAAGAGAACTCCACTCCCTTCGCTACTTCCCTCTCTTCTCCTTCACTACTTTCTCTGAATCTCCTCCTGTTTCGCCCTTCCTCCCCTCCCTCGGTGATGACGACGAGCCTTCTGCGGACGCTGATGAGTATGGACTGGACAAGGTCTTAAACGATGCTGCAACAGAAGATAGAACTGTTATTTTAACTACTTTGAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCTTTAGAATTGGAAATCGAACTCGCCAACTGTTAAACCATTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGCTTGGCTATCCATATCCATTGCTTTGCTCTTGTTACTGAAGGAGTTGATTTTCATTCAGAGGCATATTTTATGACACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTATTCTTGAGACGGGGTACAATTTTGTTTTCACGGATGCTGATGTTATGTGGTTCAGGGATCCATTCCCATTCTTTGATATGGATGCAGATTTCCAAATTGCTTGTGATCAATACCTTGGCCTCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTGAAGTCCAATAATCGGTCAATTGAGTTTTACAAATACTGGTACTCATCTCGAGAAACTTATCCAGAATACCATGATCAGGATGTTCTTAATAAGATCAAATACGATTCTTTCATTGACGACATTGGGTTAAAGATTAGGTTCTTGGATACTGCTTATTTTGGTGGGTTCTGTCAACCCAGCAAAGATTTGAATCGTGTATTAACCATGCATGCAAACTGCTGTATTGGAATGGAAAGTAAGCTTCATGATCTTAGAATTATGCTCGAGGATTGGAAACGATACATGTCCATGCCGCCATATGTTAAGGGATCATCAACTTGGACAGTTCCTCAGAACTGCAGTGTCAAAGAATTCGATGCATTAAAAGCATAG

Protein sequence

MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPSLGDDDEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRTRQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLRTILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDLNRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGSSTWTVPQNCSVKEFDALKA
Homology
BLAST of Tan0017414 vs. ExPASy Swiss-Prot
Match: P0C042 (Uncharacterized protein At4g15970 OS=Arabidopsis thaliana OX=3702 GN=At4g15970 PE=2 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 6.5e-87
Identity = 151/276 (54.71%), Postives = 197/276 (71.38%), Query Frame = 0

Query: 76  LDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRTRQLLNHLVIIALDKKA 135
           L K+L +AATED+TVI+TTLN+AW+ PNS  DLFL SF +G  T+ LL HLV+  LD++A
Sbjct: 42  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 101

Query: 136 FVRCLAIHIH-CFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLRTILETGYNFVFTDAD 195
           + RC  +H H C+ + T G+DF  +  FMTPDYLKMMWRRI+FL T+L+  YNF+FT   
Sbjct: 102 YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFT--- 161

Query: 196 VMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSSRE 255
                 PFP    + DFQIACD+Y G   D+ N  NGGF +VK+N R+I+FY YWY SR 
Sbjct: 162 -----IPFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFAFVKANQRTIDFYNYWYMSRL 221

Query: 256 TYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDLNRVLTMHANCCIGME 315
            YP+ HDQDVL++IK   +   IGLK+RFLDT YFGGFC+PS+DL++V TMHANCC+G+E
Sbjct: 222 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 281

Query: 316 SKLHDLRIMLEDWKRYMSMPPYVKGS-STWTVPQNC 350
           +K+ DLR ++ DW+ Y+S      G   TW  P+NC
Sbjct: 282 NKIKDLRQVIVDWENYVSAAKTTDGQIMTWRDPENC 309

BLAST of Tan0017414 vs. ExPASy Swiss-Prot
Match: Q3E6Y3 (Uncharacterized protein At1g28695 OS=Arabidopsis thaliana OX=3702 GN=At1g28695 PE=2 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 1.8e-52
Identity = 108/272 (39.71%), Postives = 162/272 (59.56%), Query Frame = 0

Query: 83  AATEDRTVILTTLNEAWASP----NSVIDLFLESFRIGNRTRQLLNHLVIIALDKKAFVR 142
           AA  ++TVI+T +N+A+       ++++DLFLESF  G  T  LL+HL+++A+D+ A+ R
Sbjct: 53  AAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAVDQTAYDR 112

Query: 143 CLAIHIHCFALVTE-GVDFHSEAYFMTPDYLKMMWRRIDFLRTILETGYNFVFTDADVMW 202
           C    +HC+ + TE GVD   E  FM+ D+++MMWRR   +  +L  GYN +FTD DVMW
Sbjct: 113 CRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIFTDTDVMW 172

Query: 203 FRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSSRETYP 262
            R P    +M  D QI+ D+ + +   L N    GF +V+SNN++I  ++ WY  R    
Sbjct: 173 LRSPLSRLNMSLDMQISVDR-INVGGQLINT---GFYHVRSNNKTISLFQKWYDMRLNST 232

Query: 263 EYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDLNRVLTMHANCCIGMESKL 322
              +QDVL  +    F + +GL + FL T  F GFCQ S  +  V T+HANCC+ + +K+
Sbjct: 233 GMKEQDVLKNLLDSGFFNQLGLNVGFLSTTEFSGFCQDSPHMGVVTTVHANCCLHIPAKV 292

Query: 323 HDLRIMLEDWKRYMS------MPPYVKGSSTW 344
            DL  +L DWKRY +        P++K S +W
Sbjct: 293 FDLTRVLRDWKRYKASHVNSKWSPHLKCSRSW 320

BLAST of Tan0017414 vs. ExPASy Swiss-Prot
Match: Q54RP0 (UDP-galactose:fucoside alpha-3-galactosyltransferase OS=Dictyostelium discoideum OX=44689 GN=agtA PE=1 SV=1)

HSP 1 Score: 46.6 bits (109), Expect = 6.9e-04
Identity = 25/68 (36.76%), Postives = 41/68 (60.29%), Query Frame = 0

Query: 181 ILETGYNFVFTDADVMWFRDPFPFF--DMDADFQIACDQYLGL-PDDLDNRPNGGFNYVK 240
           +L+ GYN ++TD D++W RDPF  F  D++ + Q   D  + L     D+    GF +++
Sbjct: 121 VLKKGYNVLWTDTDIVWKRDPFIHFYQDINQENQFTNDDDIDLYVQQDDDDICAGFYFIR 180

Query: 241 SNNRSIEF 246
           SN R+I+F
Sbjct: 181 SNQRTIKF 188

BLAST of Tan0017414 vs. NCBI nr
Match: XP_022990900.1 (uncharacterized protein At4g15970-like [Cucurbita maxima])

HSP 1 Score: 636.7 bits (1641), Expect = 1.2e-178
Identity = 311/352 (88.35%), Postives = 328/352 (93.18%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M SYSS  PFRRT+QI LLFAAISLSC+VVLRE+ SLR FPLFS TTFS+S PVS FLPS
Sbjct: 6   MLSYSSSFPFRRTLQIFLLFAAISLSCVVVLREVDSLRDFPLFSLTTFSDSSPVSLFLPS 65

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD +EP +D DE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLESFRIGNRT
Sbjct: 66  LDDDYNEPFSDTDEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA+FMTPDYLKMMWRRIDFLR
Sbjct: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+PDDL NRPNGGFNYVKSN
Sbjct: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPDDLSNRPNGGFNYVKSN 245

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP+YHDQDVLNKIKY+ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGS--STWTVPQNC 350
           NRVLTMHANCC+GM+SKLHDLRIMLEDWKRYMSMPPYVKGS  S W VPQNC
Sbjct: 306 NRVLTMHANCCVGMKSKLHDLRIMLEDWKRYMSMPPYVKGSSISVWRVPQNC 357

BLAST of Tan0017414 vs. NCBI nr
Match: KAG6601909.1 (hypothetical protein SDJN03_07142, partial [Cucurbita argyrosperma subsp. sororia] >KAG7032607.1 hypothetical protein SDJN02_06657, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 631.3 bits (1627), Expect = 5.0e-177
Identity = 309/362 (85.36%), Postives = 330/362 (91.16%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M SYSS   FRRT+QI LLFAAISLSCLVVLREL SLR FPLFS TTFS+S P S FLPS
Sbjct: 1   MLSYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 60

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD  EP ADADE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLESFRIGNRT
Sbjct: 61  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 120

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA+FMTPDYLKMMWRRIDFLR
Sbjct: 121 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 180

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+P+DL NRPNGGFNYVKSN
Sbjct: 181 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 240

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP++HDQDVLNKIKY+ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 241 NRSIEFYKYWYSSRETYPKFHDQDVLNKIKYEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 300

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGSST--WTVPQNCSVKEFDAL 360
           NRVLTMHANCC+G++SKLHDLRIMLEDWKRYMSMPPYVKGSS+  W VPQ C      ++
Sbjct: 301 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSSSSVWRVPQYCRFGNISSI 360

BLAST of Tan0017414 vs. NCBI nr
Match: XP_022939876.1 (uncharacterized protein At4g15970-like [Cucurbita moschata])

HSP 1 Score: 626.7 bits (1615), Expect = 1.2e-175
Identity = 303/354 (85.59%), Postives = 325/354 (91.81%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVS-PFLP 60
           MFSY S + FRRT+QI+LLF AISL+CLV+ REL S RYFPLFSF+TFS SPP + PF P
Sbjct: 14  MFSYHSSISFRRTLQILLLFTAISLACLVIFRELDSFRYFPLFSFSTFSASPPPAFPFFP 73

Query: 61  SLGDDDEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           SL DDDEPSADADEY L KVL DAATE+RTVILTTLNEAWA+PNSVIDLFLESFRIGN+T
Sbjct: 74  SLADDDEPSADADEYELGKVLKDAATENRTVILTTLNEAWATPNSVIDLFLESFRIGNQT 133

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
           RQLLNHLVIIA DKKAF+RCLAIH+HCF+LVTEGVDFHSEAYFM+PDYLKMMWRRIDFLR
Sbjct: 134 RQLLNHLVIIAFDKKAFIRCLAIHVHCFSLVTEGVDFHSEAYFMSPDYLKMMWRRIDFLR 193

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDMDADFQIACD YLG+PDDLDNRPNGGFNYVKSN
Sbjct: 194 TVLEMGYNFVFTDADVMWFRDPFPFFDMDADFQIACDHYLGIPDDLDNRPNGGFNYVKSN 253

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETY  YHDQDVLNKIKYD FI +IGLKI FLDTAYFGGFC+PSKDL
Sbjct: 254 NRSIEFYKYWYSSRETYLGYHDQDVLNKIKYDFFIYEIGLKIIFLDTAYFGGFCEPSKDL 313

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGS--STWTVPQNCSV 352
           NRVLTMHANCCIGM +KLHDLRIMLEDWK YMSMPPY+K S  S+W VPQNCS+
Sbjct: 314 NRVLTMHANCCIGMNNKLHDLRIMLEDWKHYMSMPPYLKASSNSSWRVPQNCSI 367

BLAST of Tan0017414 vs. NCBI nr
Match: XP_022923665.1 (uncharacterized protein At4g15970-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 625.9 bits (1613), Expect = 2.1e-175
Identity = 307/362 (84.81%), Postives = 326/362 (90.06%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M  YSS   FRRT+QI LLFAAISLSCLVVLREL SLR FPLFS TTFS+S P S FLPS
Sbjct: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD  EP ADADE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLES RIGNRT
Sbjct: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA FMTPDYLKMMWRRIDFLR
Sbjct: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+P+DL NRPNGGFNYVKSN
Sbjct: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP+YHDQDVLNKIK++ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKG--SSTWTVPQNCSVKEFDAL 360
           NRVLTMHANCC+G++SKLHDLRIMLEDWKRYMSMPPYVKG  SS W VPQ C      ++
Sbjct: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNISSI 365

BLAST of Tan0017414 vs. NCBI nr
Match: XP_022923668.1 (uncharacterized protein At4g15970-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 625.9 bits (1613), Expect = 2.1e-175
Identity = 307/362 (84.81%), Postives = 326/362 (90.06%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M  YSS   FRRT+QI LLFAAISLSCLVVLREL SLR FPLFS TTFS+S P S FLPS
Sbjct: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD  EP ADADE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLES RIGNRT
Sbjct: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA FMTPDYLKMMWRRIDFLR
Sbjct: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+P+DL NRPNGGFNYVKSN
Sbjct: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP+YHDQDVLNKIK++ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKG--SSTWTVPQNCSVKEFDAL 360
           NRVLTMHANCC+G++SKLHDLRIMLEDWKRYMSMPPYVKG  SS W VPQ C      ++
Sbjct: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNISSI 365

BLAST of Tan0017414 vs. ExPASy TrEMBL
Match: A0A6J1JRD3 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111487650 PE=3 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 5.8e-179
Identity = 311/352 (88.35%), Postives = 328/352 (93.18%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M SYSS  PFRRT+QI LLFAAISLSC+VVLRE+ SLR FPLFS TTFS+S PVS FLPS
Sbjct: 6   MLSYSSSFPFRRTLQIFLLFAAISLSCVVVLREVDSLRDFPLFSLTTFSDSSPVSLFLPS 65

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD +EP +D DE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLESFRIGNRT
Sbjct: 66  LDDDYNEPFSDTDEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA+FMTPDYLKMMWRRIDFLR
Sbjct: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+PDDL NRPNGGFNYVKSN
Sbjct: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPDDLSNRPNGGFNYVKSN 245

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP+YHDQDVLNKIKY+ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGS--STWTVPQNC 350
           NRVLTMHANCC+GM+SKLHDLRIMLEDWKRYMSMPPYVKGS  S W VPQNC
Sbjct: 306 NRVLTMHANCCVGMKSKLHDLRIMLEDWKRYMSMPPYVKGSSISVWRVPQNC 357

BLAST of Tan0017414 vs. ExPASy TrEMBL
Match: A0A6J1FH13 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111445609 PE=3 SV=1)

HSP 1 Score: 626.7 bits (1615), Expect = 6.0e-176
Identity = 303/354 (85.59%), Postives = 325/354 (91.81%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVS-PFLP 60
           MFSY S + FRRT+QI+LLF AISL+CLV+ REL S RYFPLFSF+TFS SPP + PF P
Sbjct: 14  MFSYHSSISFRRTLQILLLFTAISLACLVIFRELDSFRYFPLFSFSTFSASPPPAFPFFP 73

Query: 61  SLGDDDEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           SL DDDEPSADADEY L KVL DAATE+RTVILTTLNEAWA+PNSVIDLFLESFRIGN+T
Sbjct: 74  SLADDDEPSADADEYELGKVLKDAATENRTVILTTLNEAWATPNSVIDLFLESFRIGNQT 133

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
           RQLLNHLVIIA DKKAF+RCLAIH+HCF+LVTEGVDFHSEAYFM+PDYLKMMWRRIDFLR
Sbjct: 134 RQLLNHLVIIAFDKKAFIRCLAIHVHCFSLVTEGVDFHSEAYFMSPDYLKMMWRRIDFLR 193

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDMDADFQIACD YLG+PDDLDNRPNGGFNYVKSN
Sbjct: 194 TVLEMGYNFVFTDADVMWFRDPFPFFDMDADFQIACDHYLGIPDDLDNRPNGGFNYVKSN 253

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETY  YHDQDVLNKIKYD FI +IGLKI FLDTAYFGGFC+PSKDL
Sbjct: 254 NRSIEFYKYWYSSRETYLGYHDQDVLNKIKYDFFIYEIGLKIIFLDTAYFGGFCEPSKDL 313

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGS--STWTVPQNCSV 352
           NRVLTMHANCCIGM +KLHDLRIMLEDWK YMSMPPY+K S  S+W VPQNCS+
Sbjct: 314 NRVLTMHANCCIGMNNKLHDLRIMLEDWKHYMSMPPYLKASSNSSWRVPQNCSI 367

BLAST of Tan0017414 vs. ExPASy TrEMBL
Match: A0A6J1ECI2 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431304 PE=3 SV=1)

HSP 1 Score: 625.9 bits (1613), Expect = 1.0e-175
Identity = 307/362 (84.81%), Postives = 326/362 (90.06%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M  YSS   FRRT+QI LLFAAISLSCLVVLREL SLR FPLFS TTFS+S P S FLPS
Sbjct: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD  EP ADADE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLES RIGNRT
Sbjct: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA FMTPDYLKMMWRRIDFLR
Sbjct: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+P+DL NRPNGGFNYVKSN
Sbjct: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP+YHDQDVLNKIK++ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKG--SSTWTVPQNCSVKEFDAL 360
           NRVLTMHANCC+G++SKLHDLRIMLEDWKRYMSMPPYVKG  SS W VPQ C      ++
Sbjct: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNISSI 365

BLAST of Tan0017414 vs. ExPASy TrEMBL
Match: A0A6J1EA97 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431304 PE=3 SV=1)

HSP 1 Score: 625.9 bits (1613), Expect = 1.0e-175
Identity = 307/362 (84.81%), Postives = 326/362 (90.06%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 60
           M  YSS   FRRT+QI LLFAAISLSCLVVLREL SLR FPLFS TTFS+S P S FLPS
Sbjct: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65

Query: 61  LGDD-DEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 120
           L DD  EP ADADE+GLD VL DAATEDRTVILTTLN+AWASPNSVIDLFLES RIGNRT
Sbjct: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125

Query: 121 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 180
            QLLNHLVIIALDKKAFVRCL IHIHCFALVTEGVDFHSEA FMTPDYLKMMWRRIDFLR
Sbjct: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185

Query: 181 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 240
           T+LE GYNFVFTDADVMWFRDPFPFFDM+ADFQIACDQYLG+P+DL NRPNGGFNYVKSN
Sbjct: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245

Query: 241 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 300
           NRSIEFYKYWYSSRETYP+YHDQDVLNKIK++ FIDDIGLKIRFLDTAYFGGFC+PSKDL
Sbjct: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305

Query: 301 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKG--SSTWTVPQNCSVKEFDAL 360
           NRVLTMHANCC+G++SKLHDLRIMLEDWKRYMSMPPYVKG  SS W VPQ C      ++
Sbjct: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNISSI 365

BLAST of Tan0017414 vs. ExPASy TrEMBL
Match: A0A6J1CZL5 (Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111015662 PE=3 SV=1)

HSP 1 Score: 618.6 bits (1594), Expect = 1.6e-173
Identity = 298/355 (83.94%), Postives = 326/355 (91.83%), Query Frame = 0

Query: 1   MFSYSSFLPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSP--FL 60
           MF+YSS LPFRRT+QI+LL AAISLSCLV+LRE HSL Y  LFSF+TFS++PP+S   F 
Sbjct: 28  MFAYSSSLPFRRTLQILLLSAAISLSCLVLLRESHSLPYLHLFSFSTFSDTPPLSSPYFA 87

Query: 61  PSLGDDDEPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNR 120
            SLGD DEPS++ADEYGL++VL DAATEDRT+ILTTLNEAWASP+SVIDLFLESFRIGN 
Sbjct: 88  SSLGDVDEPSSEADEYGLERVLKDAATEDRTIILTTLNEAWASPSSVIDLFLESFRIGNH 147

Query: 121 TRQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFL 180
           TRQLLNHLVIIALDKKAF+RCLA+HIHCFALVTEGVDFH EAYFMTPDYLKMMWRRIDFL
Sbjct: 148 TRQLLNHLVIIALDKKAFIRCLAVHIHCFALVTEGVDFHLEAYFMTPDYLKMMWRRIDFL 207

Query: 181 RTILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKS 240
           RT+LE GYNFVFTDADVMWFRDPFP FDMDADFQIACD YLG+P+DLDNRPNGGF +VKS
Sbjct: 208 RTVLEMGYNFVFTDADVMWFRDPFPIFDMDADFQIACDHYLGIPEDLDNRPNGGFAFVKS 267

Query: 241 NNRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKD 300
           NNRSIEFYKYWYSSRETYP YHDQDVLNKIKYD  IDDIGLK RFLDTAYFGGFC+PSKD
Sbjct: 268 NNRSIEFYKYWYSSRETYPGYHDQDVLNKIKYDPLIDDIGLKFRFLDTAYFGGFCEPSKD 327

Query: 301 LNRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGSS--TWTVPQNCSV 352
           LN V+TMHANCCIGM SKLHDLRIM+EDWK++MS+PPYVK SS  +W VPQNCS+
Sbjct: 328 LNLVITMHANCCIGMNSKLHDLRIMIEDWKQFMSLPPYVKSSSILSWRVPQNCSI 382

BLAST of Tan0017414 vs. TAIR 10
Match: AT1G14590.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 435.6 bits (1119), Expect = 3.7e-122
Identity = 222/346 (64.16%), Postives = 262/346 (75.72%), Query Frame = 0

Query: 8   LPFRRTVQIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPSLGDDDEP 67
           +P RR     L  AAIS+SC V+ R   SL +           SPP+   L S  D++EP
Sbjct: 39  IPLRRAA---LFLAAISISCFVLYRAADSLSF-----------SPPIFD-LSSYLDNEEP 98

Query: 68  SADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRTRQLLNHLV 127
                   L+ VL+ AAT DRTV+LTTLN AWA+P SVIDLF ESFRIG  T Q+L+HLV
Sbjct: 99  K-------LEDVLSKAATRDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLV 158

Query: 128 IIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLRTILETGYN 187
           I+ALD KA+ RCL +H HCF+LVTEGVDF  EAYFMT  YLKMMWRRID LR++LE GYN
Sbjct: 159 IVALDAKAYSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYN 218

Query: 188 FVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSNNRSIEFYK 247
           FVFTDADVMWFR+PFP F M ADFQIACD YLG  +DL NRPNGGFN+V+SNNR+I FYK
Sbjct: 219 FVFTDADVMWFRNPFPRFYMYADFQIACDHYLGRSNDLHNRPNGGFNFVRSNNRTILFYK 278

Query: 248 YWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDLNRVLTMHA 307
           YWY+SR  +P YHDQDVLN +K + F+  IGLK+RFL+TAYFGG C+PS+DLN V TMHA
Sbjct: 279 YWYASRLRFPGYHDQDVLNFLKAEPFVFRIGLKMRFLNTAYFGGLCEPSRDLNLVRTMHA 338

Query: 308 NCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGSS--TWTVPQNCSV 352
           NCC GMESKLHDLRIML+DWK +MS+P ++K SS  +W VPQNCS+
Sbjct: 339 NCCYGMESKLHDLRIMLQDWKDFMSLPLHLKQSSGFSWKVPQNCSL 362

BLAST of Tan0017414 vs. TAIR 10
Match: AT2G02061.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 403.7 bits (1036), Expect = 1.6e-112
Identity = 202/317 (63.72%), Postives = 240/317 (75.71%), Query Frame = 0

Query: 38  RYFPLFSFTTFSESPPVSPFLPSLGDDDEPSADADEYGLDKVLNDAATEDRTVILTTLNE 97
           R FP  + ++ S SP      PSL  +     + +E  L++VL  AAT+D TVILTTLNE
Sbjct: 80  RIFPSVNDSSSSPSPS-----PSLSPE-----EIEEPKLEEVLRRAATKDGTVILTTLNE 139

Query: 98  AWASPNSVIDLFLESFRIGNRTRQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFH 157
           AWA+P SVIDLF ESFRIG  TR+LL HLVIIALD KA+ RC  +H HCF L TEGVDF 
Sbjct: 140 AWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALDAKAYSRCQELHKHCFRLETEGVDFS 199

Query: 158 -SEAYFMTPDYLKMMWRRIDFLRTILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACD 217
             EAYFMTP YL MMWRRI FLR++LE GYNFVFTDADVMWFR+PF  F  D DFQIACD
Sbjct: 200 GGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVFTDADVMWFRNPFRRFYEDGDFQIACD 259

Query: 218 QYLGLPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDD 277
            Y+G P+D  NRPNGGF +V++NNRSI FYK+WY SR  YP+ HDQDVLN IK D F+  
Sbjct: 260 HYIGRPNDFRNRPNGGFTFVRANNRSIGFYKFWYDSRTKYPKNHDQDVLNFIKTDPFLWK 319

Query: 278 IGLKIRFLDTAYFGGFCQPSKDLNRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPY 337
           + ++IRFL+T YFGGFC+PSKDLN V TMHANCC G++SKLHDLRIML+DW+ + S+P +
Sbjct: 320 LRIRIRFLNTVYFGGFCEPSKDLNLVCTMHANCCFGLDSKLHDLRIMLQDWRDFKSLPLH 379

Query: 338 VKGSS--TWTVPQNCSV 352
              SS  TW+VPQNCS+
Sbjct: 380 SNQSSGFTWSVPQNCSL 386

BLAST of Tan0017414 vs. TAIR 10
Match: AT5G44820.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 382.9 bits (982), Expect = 2.9e-106
Identity = 181/350 (51.71%), Postives = 250/350 (71.43%), Query Frame = 0

Query: 2   FSYSSFLPFRRTV-QIILLFAAISLSCLVVLRELHSLRYFPLFSFTTFSESPPVSPFLPS 61
           F  S F+  R+ + +I++LF  ++ SCLV+ +  + L+   + + T+   SP  SP LP+
Sbjct: 18  FMDSGFIIGRKELTRILILFLGLTASCLVLYKTAYPLQRLNVSNLTSLQASP--SPLLPN 77

Query: 62  LGDDD-EPSADADEYGLDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRT 121
           L   +  P     +    ++L +A+T++ TVI+TTLN+AWA PNS+ DLFLESFRIG  T
Sbjct: 78  LNSSEISPETTKPKLSFKEILENASTKNNTVIITTLNQAWAEPNSLFDLFLESFRIGQGT 137

Query: 122 RQLLNHLVIIALDKKAFVRCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLR 181
           +QLL H+V++ LD KAF RC  +H +C+ + T   DF  E  + TPDYLKMMW RID L 
Sbjct: 138 QQLLKHVVVVCLDIKAFERCSQLHTNCYHIETSETDFSGEKVYNTPDYLKMMWARIDLLT 197

Query: 182 TILETGYNFVFTDADVMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSN 241
            +LE G+NF+FTDAD+MW RDPFP    D DFQ+ACD++ G P D DN  NGGF YV+SN
Sbjct: 198 QVLEMGFNFIFTDADIMWLRDPFPRLYPDGDFQMACDRFFGNPYDSDNWVNGGFTYVRSN 257

Query: 242 NRSIEFYKYWYSSRETYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDL 301
           NRSIEFYK+W+ SR  YP+ HDQDV N+IK++ FI +IG+++RF DT YFGGFCQ S+D+
Sbjct: 258 NRSIEFYKFWHKSRLDYPDLHDQDVFNRIKHEPFISEIGIQMRFFDTVYFGGFCQTSRDI 317

Query: 302 NRVLTMHANCCIGMESKLHDLRIMLEDWKRYMSMPPYVKGSSTWTVPQNC 350
           N V TMHANCCIG++ KLHDL ++L+DW++Y+S+   V+ ++TW+VP  C
Sbjct: 318 NLVCTMHANCCIGLDKKLHDLNLVLDDWRKYLSLSEPVQ-NTTWSVPMKC 364

BLAST of Tan0017414 vs. TAIR 10
Match: AT4G19970.1 (CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069); BEST Arabidopsis thaliana protein match is: Nucleotide-diphospho-sugar transferase family protein (TAIR:AT5G44820.1); Has 801 Blast hits to 466 proteins in 35 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 750; Viruses - 0; Other Eukaryotes - 49 (source: NCBI BLink). )

HSP 1 Score: 372.9 bits (956), Expect = 3.0e-103
Identity = 165/273 (60.44%), Postives = 213/273 (78.02%), Query Frame = 0

Query: 78  KVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRTRQLLNHLVIIALDKKAFV 137
           +VL +A+TE+RTVI+TTLN+AWA PNS+ DLFLESFRIG  T++LL H+V++ LD KAF 
Sbjct: 444 EVLENASTENRTVIVTTLNQAWAEPNSLFDLFLESFRIGQGTKKLLQHVVVVCLDSKAFA 503

Query: 138 RCLAIHIHCFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLRTILETGYNFVFTDADVMW 197
           RC  +H +C+ L T G DF  E  F TPDYLKMMWRRI+ L  +LE GYNF+FTDAD+MW
Sbjct: 504 RCSQLHPNCYYLKTTGTDFSGEKLFATPDYLKMMWRRIELLTQVLEMGYNFIFTDADIMW 563

Query: 198 FRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSSRETYP 257
            RDPFP    D DFQ+ACD++ G P D DN  NGGF YVKSN+RSIEFYK+WY+SR  YP
Sbjct: 564 LRDPFPRLYPDGDFQMACDRFFGDPHDSDNWVNGGFTYVKSNHRSIEFYKFWYNSRLDYP 623

Query: 258 EYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDLNRVLTMHANCCIGMESKL 317
           + HDQDV N+IK+ + + +IG+++RF DT YFGGFCQ S+D+N V TMHANCC+G+  KL
Sbjct: 624 KMHDQDVFNQIKHKALVSEIGIQMRFFDTVYFGGFCQTSRDINLVCTMHANCCVGLAKKL 683

Query: 318 HDLRIMLEDWKRYMSMPPYVKGSSTWTVPQNCS 351
           HDL ++L+DW+ Y+S+   VK ++TW+VP  C+
Sbjct: 684 HDLNLVLDDWRNYLSLSEPVK-NTTWSVPMKCT 715

BLAST of Tan0017414 vs. TAIR 10
Match: AT4G15970.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 323.2 bits (827), Expect = 2.7e-88
Identity = 151/276 (54.71%), Postives = 197/276 (71.38%), Query Frame = 0

Query: 76  LDKVLNDAATEDRTVILTTLNEAWASPNSVIDLFLESFRIGNRTRQLLNHLVIIALDKKA 135
           L K+L +AATED+TVI+TTLN+AW+ PNS  DLFL SF +G  T+ LL HLV+  LD++A
Sbjct: 33  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 92

Query: 136 FVRCLAIHIH-CFALVTEGVDFHSEAYFMTPDYLKMMWRRIDFLRTILETGYNFVFTDAD 195
           + RC  +H H C+ + T G+DF  +  FMTPDYLKMMWRRI+FL T+L+  YNF+FT   
Sbjct: 93  YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFT--- 152

Query: 196 VMWFRDPFPFFDMDADFQIACDQYLGLPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSSRE 255
                 PFP    + DFQIACD+Y G   D+ N  NGGF +VK+N R+I+FY YWY SR 
Sbjct: 153 -----IPFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFTFVKANQRTIDFYNYWYMSRL 212

Query: 256 TYPEYHDQDVLNKIKYDSFIDDIGLKIRFLDTAYFGGFCQPSKDLNRVLTMHANCCIGME 315
            YP+ HDQDVL++IK   +   IGLK+RFLDT YFGGFC+PS+DL++V TMHANCC+G+E
Sbjct: 213 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 272

Query: 316 SKLHDLRIMLEDWKRYMSMPPYVKGS-STWTVPQNC 350
           +K+ DLR ++ DW+ Y+S      G   TW  P+NC
Sbjct: 273 NKIKDLRQVIVDWENYVSAAKTTDGQIMTWRDPENC 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C0426.5e-8754.71Uncharacterized protein At4g15970 OS=Arabidopsis thaliana OX=3702 GN=At4g15970 P... [more]
Q3E6Y31.8e-5239.71Uncharacterized protein At1g28695 OS=Arabidopsis thaliana OX=3702 GN=At1g28695 P... [more]
Q54RP06.9e-0436.76UDP-galactose:fucoside alpha-3-galactosyltransferase OS=Dictyostelium discoideum... [more]
Match NameE-valueIdentityDescription
XP_022990900.11.2e-17888.35uncharacterized protein At4g15970-like [Cucurbita maxima][more]
KAG6601909.15.0e-17785.36hypothetical protein SDJN03_07142, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022939876.11.2e-17585.59uncharacterized protein At4g15970-like [Cucurbita moschata][more]
XP_022923665.12.1e-17584.81uncharacterized protein At4g15970-like isoform X1 [Cucurbita moschata][more]
XP_022923668.12.1e-17584.81uncharacterized protein At4g15970-like isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1JRD35.8e-17988.35Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111487650 PE=3 SV=1[more]
A0A6J1FH136.0e-17685.59Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111445609 PE=3 SV=1[more]
A0A6J1ECI21.0e-17584.81Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431304 PE=3 SV=1[more]
A0A6J1EA971.0e-17584.81Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431304 PE=3 SV=1[more]
A0A6J1CZL51.6e-17383.94Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111015662 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14590.13.7e-12264.16Nucleotide-diphospho-sugar transferase family protein [more]
AT2G02061.11.6e-11263.72Nucleotide-diphospho-sugar transferase family protein [more]
AT5G44820.12.9e-10651.71Nucleotide-diphospho-sugar transferase family protein [more]
AT4G19970.13.0e-10360.44CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (I... [more]
AT4G15970.12.7e-8854.71Nucleotide-diphospho-sugar transferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005069Nucleotide-diphospho-sugar transferasePFAMPF03407Nucleotid_transcoord: 122..320
e-value: 5.4E-65
score: 219.3
NoneNo IPR availablePANTHERPTHR46038:SF34GLYCOSYLTRANSFERASEcoord: 6..353
IPR044821Putative nucleotide-diphospho-sugar transferase At1g28695/At4g15970-likePANTHERPTHR46038EXPRESSED PROTEIN-RELATEDcoord: 6..353
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 154..275

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017414.1Tan0017414.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071555 cell wall organization
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity