Tan0001707 (gene) Snake gourd v1

Overview
NameTan0001707
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglutelin type-A 2-like
LocationLG01: 7228944 .. 7230809 (+)
RNA-Seq ExpressionTan0001707
SyntenyTan0001707
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAACCGATGAATCCCAAGCCCTTCACTGAGGGAGATGCTGGATCGTATCACAAATGGCTTCCTTCTGAATATCCCTTACTTGCTCAGACCAAGGTCGCCGCCGGCCGCCTTCTCCTCCGCCCTCGCGGCTTCGTCGTTCCCCACTACGCCGATTGCTCTAAAGTGGGCTATGTTCTTCAAGGTTAACCTCTCTATCTCTACTACTTTTAGTTCTCTTATGTCAATTTATAGTTTAAGATTCGAACAAGAAATTTTCTCATAATAATTGATAAGGACAACGGATCCTCTACCTCGTTCGTAATTTTGCTTTTTTTTTTTGTTTAAGTCGAAATTAGACTTCATATAATGTCCCTTACTTAATTTATGATTTTTTCTTCCCTAATAGAATTTGCTTCAATTCATACAAAAATTTATGAAGGCGACAAATTTTATCATTCTCTCAACAATTTCGGTTGATTATTCCATATAAACTTTCTCTTGAAGATCTTCGTCATGAAAAAGACAAAAGAATTAAATAGATGTAAGGATCACAAATTTAAGACAACGAATTAATTAGAAAGAGAAATTTTTAAAGCTTTTAAATATTTTTGCTCAATACTTTCAAAGTTGTTTTGGAATAATTTTTCAAATCTTTAGAAAATGGGTTTTTCATATGTAAAACAAAAAAAAAAGATTTTTGAGTATTTGAAAATTCATTTTAAACATGTTTGTGTAATCCATGCACGAGGGCGGGAGCTCTCTTCAAATTTTTTATCTTTATATAGAAAAAGTAAAGTCAAATAATAGTAAATATGTTATTAACCTCACCCAAGTCCCAAATTCAAGTTTAGTTTGTACAATGTTAATAATTTTTCCAATGTCATATTGAATCTATGTGTTTGCAGGTGAAGATGGGGTTGCAGGATTGGTGTTTCCAAACAAGTCCGATGAAGTGGTAGTGAAACTTAAGAAAGGAGATCTGATTCCGGTGCCGGAAGGAGTCACGTCGTGGTGGTTCAACGACGGAGACTCCGATTTCGAGATTATCTTTTTGGGTGAAACCAAAACCGCTCATGTCGCCGGTGACATCTCTTACTTCATTCTCTCCGGCCCTCTTGGCTTCCTGCAAGGCTTCTCGCCGGAGTACGTCGGAAAACCTACTCTTTAAACGAAGAACAAACAACCACACTTCTCAAAAGCCAATCCAACGCCCTTATCTTCGCCCTTGCACAACCCCAATCCCTCCCCAAACCCCAAAAACACAGCAAACTAGTTTACAACATTGACGCCGCCGCGCCGGACACCACACCCAAGCCTAGCGGCGGCGGCGCCGTCACGACGGTGACGGAATCCAAATTTCCCTCCATTGGCCAATCTGGGTTGACGGCAATTCTTGAAAAGCTTGACGCCAACGCCGTTCGATCGCCGGTGTACGTTGCTGAGCCGTCCGATCAACTGATCTATGTGGCTAAAGGATTCGGGAAGATTCAGATTGTTGGATTTTCGAGTAAAGTTGATGCAGAGGTGAAAATGGGTCAGCTTATTTTAGTCCCCAAATACTTCGTCGCCGGAAAAATCGCCGGAGAAGAAGGCTTGGAGTGCTTCTCCATTATCACAGCTACACAGTAAAAACTAAAAAGCTTCAATTTTATTTTTTTTTTACTTTTTTATTTTTGGGGTTTTGTTTCTGAATCTTAAAATTTTTACATTTCTGATTTTGAATTGAACAGTCCTCTGGTGGAAGAATTGGCCGGAAAGACGTCGGTTTTCGAGGCATTGTCGCCGGAGATTCTTCAAGTTTCGTTCAACGTCACGGCGGAGTTCGAAAAGCTTCTTAGATCGAAGATCACAAAAACTTCACCAGTGATTCCACCTTCAGATTGA

mRNA sequence

ATGGAACCGATGAATCCCAAGCCCTTCACTGAGGGAGATGCTGGATCGTATCACAAATGGCTTCCTTCTGAATATCCCTTACTTGCTCAGACCAAGGTCGCCGCCGGCCGCCTTCTCCTCCGCCCTCGCGGCTTCGTCGTTCCCCACTACGCCGATTGCTCTAAAGTGGGCTATGTTCTTCAAGGTGAAGATGGGGTTGCAGGATTGGTGTTTCCAAACAAGTCCGATGAAGTGGTAGTGAAACTTAAGAAAGGAGATCTGATTCCGGTGCCGGAAGGAGTCACGTCGTGGTGGTTCAACGACGGAGACTCCGATTTCGAGATTATCTTTTTGGGTGAAACCAAAACCGCTCATGTCGCCGGTGACATCTCTTACTTCATTCTCTCCGGCCCTCTTGGCTTCCTGCAAGGCTTCTCGCCGGACCAATCCAACGCCCTTATCTTCGCCCTTGCACAACCCCAATCCCTCCCCAAACCCCAAAAACACAGCAAACTAGTTTACAACATTGACGCCGCCGCGCCGGACACCACACCCAAGCCTAGCGGCGGCGGCGCCGTCACGACGGTGACGGAATCCAAATTTCCCTCCATTGGCCAATCTGGGTTGACGGCAATTCTTGAAAAGCTTGACGCCAACGCCGTTCGATCGCCGGTGTACGTTGCTGAGCCGTCCGATCAACTGATCTATGTGGCTAAAGGATTCGGGAAGATTCAGATTGTTGGATTTTCGAGTAAAGTTGATGCAGAGGTGAAAATGGGTCAGCTTATTTTAGTCCCCAAATACTTCGTCGCCGGAAAAATCGCCGGAGAAGAAGGCTTGGAGTGCTTCTCCATTATCACAGCTACACATCCTCTGGTGGAAGAATTGGCCGGAAAGACGTCGGTTTTCGAGGCATTGTCGCCGGAGATTCTTCAAGTTTCGTTCAACGTCACGGCGGAGTTCGAAAAGCTTCTTAGATCGAAGATCACAAAAACTTCACCAGTGATTCCACCTTCAGATTGA

Coding sequence (CDS)

ATGGAACCGATGAATCCCAAGCCCTTCACTGAGGGAGATGCTGGATCGTATCACAAATGGCTTCCTTCTGAATATCCCTTACTTGCTCAGACCAAGGTCGCCGCCGGCCGCCTTCTCCTCCGCCCTCGCGGCTTCGTCGTTCCCCACTACGCCGATTGCTCTAAAGTGGGCTATGTTCTTCAAGGTGAAGATGGGGTTGCAGGATTGGTGTTTCCAAACAAGTCCGATGAAGTGGTAGTGAAACTTAAGAAAGGAGATCTGATTCCGGTGCCGGAAGGAGTCACGTCGTGGTGGTTCAACGACGGAGACTCCGATTTCGAGATTATCTTTTTGGGTGAAACCAAAACCGCTCATGTCGCCGGTGACATCTCTTACTTCATTCTCTCCGGCCCTCTTGGCTTCCTGCAAGGCTTCTCGCCGGACCAATCCAACGCCCTTATCTTCGCCCTTGCACAACCCCAATCCCTCCCCAAACCCCAAAAACACAGCAAACTAGTTTACAACATTGACGCCGCCGCGCCGGACACCACACCCAAGCCTAGCGGCGGCGGCGCCGTCACGACGGTGACGGAATCCAAATTTCCCTCCATTGGCCAATCTGGGTTGACGGCAATTCTTGAAAAGCTTGACGCCAACGCCGTTCGATCGCCGGTGTACGTTGCTGAGCCGTCCGATCAACTGATCTATGTGGCTAAAGGATTCGGGAAGATTCAGATTGTTGGATTTTCGAGTAAAGTTGATGCAGAGGTGAAAATGGGTCAGCTTATTTTAGTCCCCAAATACTTCGTCGCCGGAAAAATCGCCGGAGAAGAAGGCTTGGAGTGCTTCTCCATTATCACAGCTACACATCCTCTGGTGGAAGAATTGGCCGGAAAGACGTCGGTTTTCGAGGCATTGTCGCCGGAGATTCTTCAAGTTTCGTTCAACGTCACGGCGGAGTTCGAAAAGCTTCTTAGATCGAAGATCACAAAAACTTCACCAGTGATTCCACCTTCAGATTGA

Protein sequence

MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPDQSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
Homology
BLAST of Tan0001707 vs. ExPASy Swiss-Prot
Match: Q09151 (Glutelin type-A 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA3 PE=2 SV=2)

HSP 1 Score: 94.0 bits (232), Expect = 3.5e-18
Identity = 92/408 (22.55%), Postives = 163/408 (39.95%), Query Frame = 0

Query: 31  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP------------------ 90
           T V   R ++ PRG ++PHY++ + + YV+QG  G+ G  FP                  
Sbjct: 79  TGVFVVRRVIEPRGLLLPHYSNGATLVYVIQGR-GITGPTFPGCPETYQQQFQQSEQDQQ 138

Query: 91  ----------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFL----------- 150
                         + + + ++GD++ +P GV  W +NDGD+    I++           
Sbjct: 139 LEGQSQSHKFRDEHQKIHRFQQGDVVALPAGVAHWCYNDGDAPIVAIYVTDIYNSANQLD 198

Query: 151 ---------GETK-------------TAHVAGDISYFILSGPLGFLQGFS------PDQS 210
                    G  K             + +V G  S  +LS  LG   G +       DQ 
Sbjct: 199 PRHRDFFLAGNNKIGQQLYRYEARDNSKNVFGGFSVELLSEALGISSGVARQLQCQNDQR 258

Query: 211 NALI-----FALAQP-QSLPKPQKHS-------------------------------KLV 270
             ++      +L QP  SL + Q+                                 ++ 
Sbjct: 259 GEIVRVEHGLSLLQPYASLQEQQQEQVQSRDYGQTQYQQKQLQGSCSNGLDETFCTMRVR 318

Query: 271 YNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ 330
            NID      T  P   G +T +   KFP +    ++A+   L  NA+ SP +    +  
Sbjct: 319 QNIDNPNLADTYNPR-AGRITYLNGQKFPILNLVQMSAVKVNLYQNALLSPFWNIN-AHS 378

Query: 331 LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHP 333
           ++Y+ +G  ++Q+V  + K   D E++ GQL+++P++ V  K A  EG    ++ T    
Sbjct: 379 VVYITQGRARVQVVNNNGKTVFDGELRRGQLLIIPQHHVVIKKAQREGCSYIALKTNPDS 438

BLAST of Tan0001707 vs. ExPASy Swiss-Prot
Match: P07730 (Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1)

HSP 1 Score: 88.2 bits (217), Expect = 1.9e-16
Identity = 84/395 (21.27%), Postives = 155/395 (39.24%), Query Frame = 0

Query: 31  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP------------------ 90
           T V+  R ++ PRG ++PHY + + + Y++QG  G+ G  FP                  
Sbjct: 80  TGVSVVRRVIEPRGLLLPHYTNGASLVYIIQGR-GITGPTFPGCPETYQQQFQQSGQAQL 139

Query: 91  ----------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAGD 150
                         + + + ++GD+I +P GV  W +NDG+     I++ +        D
Sbjct: 140 TESQSQSHKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDINNGANQLD 199

Query: 151 ISY--FILSG---------------PLGFLQGFSP---------------------DQSN 210
                F+L+G                     GFS                      DQ  
Sbjct: 200 PRQRDFLLAGNKRNPQAYRREVEEWSQNIFSGFSTELLSEAFGISNQVARQLQCQNDQRG 259

Query: 211 ALI-----FALAQPQSLPKPQKHSKLV--------------------------------- 270
            ++      +L QP +  + Q+  ++                                  
Sbjct: 260 EIVRVERGLSLLQPYASLQEQEQGQMQSREHYQEGGYQQSQYGSGCPNGLDETFCTMRVR 319

Query: 271 YNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ 320
            NID      T  P   G VT +    FP +    ++A+   L  NA+ SP +    +  
Sbjct: 320 QNIDNPNRADTYNPR-AGRVTNLNSQNFPILNLVQMSAVKVNLYQNALLSPFWNIN-AHS 379

BLAST of Tan0001707 vs. ExPASy Swiss-Prot
Match: P07728 (Glutelin type-A 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA1 PE=1 SV=2)

HSP 1 Score: 86.7 bits (213), Expect = 5.6e-16
Identity = 81/395 (20.51%), Postives = 156/395 (39.49%), Query Frame = 0

Query: 31  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP------------------ 90
           T V+  R ++ PRG ++PHY + + + Y++QG  G+ G  FP                  
Sbjct: 80  TGVSVVRRVIEPRGLLLPHYTNGASLVYIIQGR-GITGPTFPGCPESYQQQFQQSGQAQL 139

Query: 91  ----------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLG---------- 150
                         + + + ++GD+I +P GV  W +NDG+     I++           
Sbjct: 140 TESQSQSQKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDLNNGANQLD 199

Query: 151 ----------------------ETKTAHVAGDISYFILSGPLGFLQGFS------PDQSN 210
                                 E ++ ++    S  +LS  LG     +       DQ  
Sbjct: 200 PRQRDFLLAGNKRNPQAYRREVEERSQNIFSGFSTELLSEALGVSSQVARQLQCQNDQRG 259

Query: 211 ALI-----FALAQPQSLPKPQKHS---------------------------------KLV 270
            ++      +L QP +  + Q+                                   ++ 
Sbjct: 260 EIVRVEHGLSLLQPYASLQEQEQGQVQSRERYQEGQYQQSQYGSGCSNGLDETFCTLRVR 319

Query: 271 YNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ 320
            NID      T  P   G VT +    FP +    ++A+   L  NA+ SP +    +  
Sbjct: 320 QNIDNPNRADTYNPR-AGRVTNLNTQNFPILSLVQMSAVKVNLYQNALLSPFWNIN-AHS 379

BLAST of Tan0001707 vs. ExPASy Swiss-Prot
Match: A0A222NNM9 (Cocosin 1 OS=Cocos nucifera OX=13894 GN=COS-1 PE=1 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 8.9e-14
Identity = 82/378 (21.69%), Postives = 152/378 (40.21%), Query Frame = 0

Query: 33  VAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP-------------------- 92
           V+  R ++ PRG ++P  ++  ++ Y++QG  G+ GLV P                    
Sbjct: 81  VSTIRRVIEPRGLLLPSMSNAPRLVYIVQGR-GIVGLVMPGCPETFQSFQRSEREEGERH 140

Query: 93  ---NKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAGDISY--FI 152
                  + V + ++GD++ VP G   W +N+G++    I + +T       D S+  F+
Sbjct: 141 RWSRDEHQKVYQFQEGDVLAVPNGFAYWCYNNGENPVVAITVLDTSNDANQLDRSHRQFL 200

Query: 153 LSG---------------PLGFLQGFSPDQSNALI------------------------- 212
           L+G                   L+GFS +   A                           
Sbjct: 201 LAGRQEQGRQRYGREGSIKENILRGFSTELLAAAFGVNMELARKLQCRDDTRGEIVRAEN 260

Query: 213 -FALAQPQSLPKPQKHS--------------KLVYNIDAAAPDTTPKPSGGGAVTTVTES 272
              + +P  + + ++                K+  NI          P  GG +TT+   
Sbjct: 261 GLQVLRPSGMEEEEREEGRSINGFEETYCSMKIKQNIGDPRRADVFNPR-GGRITTLNSE 320

Query: 273 KFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKV--DAEV 329
           K P +    ++A    L  NA+ SP +    +  ++Y   G G++++     +   D E+
Sbjct: 321 KLPILRFIQMSAERVVLYRNAMVSPHWNIN-AHSIMYCTGGRGRVEVADDRGETVFDGEL 380

BLAST of Tan0001707 vs. ExPASy Swiss-Prot
Match: Q9ZWA9 (12S seed storage protein CRD OS=Arabidopsis thaliana OX=3702 GN=CRD PE=1 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 2.0e-13
Identity = 96/408 (23.53%), Postives = 158/408 (38.73%), Query Frame = 0

Query: 6   PKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDG 65
           P   T+ +AG    W     P L    V   R+ L+P    +P +     + YV+QGE G
Sbjct: 46  PAQATKFEAGQMEVW-DHMSPELRCAGVTVARITLQPNSIFLPAFFSPPALAYVVQGE-G 105

Query: 66  VAGLV---FPNKSDEV----------------------VVKLKKGDLIPVPEGVTSWWFN 125
           V G +    P    EV                      +   ++GD+     GV+ WW+N
Sbjct: 106 VMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVSQWWYN 165

Query: 126 DGDSDFEI-IFLGETKTAHVAGDI-SYFILSG--------PLGF------LQGFSPD--- 185
            GDSD  I I L  T   +    +   F L+G        PL +        GF P+   
Sbjct: 166 RGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFDPNIIA 225

Query: 186 -------------------------QSNALIFALAQPQ---------SLPKPQKHSKLVY 245
                                     +  L F +  P+          + +    +K+  
Sbjct: 226 EAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCTAKIHE 285

Query: 246 NIDAAAPDTTPK-PSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ 305
           NID   P+ +    +  G ++T+     P +    L A+   L +  +  P + A  +  
Sbjct: 286 NID--DPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTAN-AHT 345

Query: 306 LIYVAKGFGKIQIV--GFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHP 333
           ++YV  G  KIQ+V     S  + +V  GQ+I++P+ F   K AGE G E  S  T  + 
Sbjct: 346 VLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTNDNA 405

BLAST of Tan0001707 vs. NCBI nr
Match: XP_022922755.1 (legumin J-like [Cucurbita moschata])

HSP 1 Score: 549.3 bits (1414), Expect = 2.3e-152
Identity = 283/351 (80.63%), Postives = 299/351 (85.19%), Query Frame = 0

Query: 2   EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 61
           +PMNPKPFTE +AGSYHKWLPSEYPLLAQ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAQNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAG 121
           GE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFNDGDSD EIIFLGE+K AHV G
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKH 181
           DISYF+LSGPL  L GFSP+                   QSNALIF++ Q QSLPKP K+
Sbjct: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPSKY 182

Query: 182 SKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAE 241
           SK VYNIDAAAPD   K  G GAVTTVTESKFP IGQSGLTAILEKL+ANAVRSPVYVAE
Sbjct: 183 SKFVYNIDAAAPDGRVK-GGAGAVTTVTESKFPFIGQSGLTAILEKLNANAVRSPVYVAE 242

Query: 242 PSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT 301
           P DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Sbjct: 243 PYDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITAT 302

Query: 302 HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           HP+VEELAGKTSV EALSPEI QVSFNVTAEFEKLLRSKIT  SPVI  SD
Sbjct: 303 HPVVEELAGKTSVLEALSPEIFQVSFNVTAEFEKLLRSKITNASPVIGSSD 352

BLAST of Tan0001707 vs. NCBI nr
Match: XP_022985328.1 (12S seed storage protein CRD-like [Cucurbita maxima])

HSP 1 Score: 547.7 bits (1410), Expect = 6.7e-152
Identity = 282/351 (80.34%), Postives = 298/351 (84.90%), Query Frame = 0

Query: 2   EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 61
           +PMNPKPFTE +AGSYHKWLPSEYPLLA  KVAAGRLLLRPRGFVVPHYADCSKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAHNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAG 121
           GE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFNDGDSD EIIFLGE+K AHV G
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKH 181
           DISYF+LSG L  L GFSP+                   QSNALIF++ Q QSLPKP K+
Sbjct: 123 DISYFVLSGILSLLNGFSPEYVGETYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPPKY 182

Query: 182 SKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAE 241
           SK VYNIDAAAPD   K  G GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAE
Sbjct: 183 SKFVYNIDAAAPDGRVK-GGAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAE 242

Query: 242 PSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT 301
           P DQLIYVAKG GKIQIVGFSSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Sbjct: 243 PYDQLIYVAKGRGKIQIVGFSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITAT 302

Query: 302 HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           HP+VEELAGKTSV EALSPE+ QVSFNVTAEFEKLLRSKIT  SPVI  SD
Sbjct: 303 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIRSSD 352

BLAST of Tan0001707 vs. NCBI nr
Match: KAG6576976.1 (12S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 547.0 bits (1408), Expect = 1.1e-151
Identity = 282/351 (80.34%), Postives = 298/351 (84.90%), Query Frame = 0

Query: 2   EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 61
           +PMNPKPFTE +AGSYHKWLPSEYPLLA+ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAG 121
           GE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFN+GDSD EIIFLGE+K AHV G
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNEGDSDLEIIFLGESKNAHVPG 122

Query: 122 DISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKH 181
           DISYF+LSGPL  L GFSP+                   QSNALIF++ Q QSLPKP K 
Sbjct: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPSKF 182

Query: 182 SKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAE 241
           SK VYNIDAAAPD   K  G GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAE
Sbjct: 183 SKFVYNIDAAAPDGRVK-GGAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAE 242

Query: 242 PSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT 301
           P DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Sbjct: 243 PYDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITAT 302

Query: 302 HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           HP+VEELAGKTSV EALSPEI QVSFNVTAEFEKLLRSKIT  SPVI  SD
Sbjct: 303 HPVVEELAGKTSVLEALSPEIFQVSFNVTAEFEKLLRSKITNASPVIGSSD 352

BLAST of Tan0001707 vs. NCBI nr
Match: XP_023552908.1 (12S seed storage globulin 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 540.0 bits (1390), Expect = 1.4e-149
Identity = 279/351 (79.49%), Postives = 295/351 (84.05%), Query Frame = 0

Query: 2   EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 61
           +PMNPKPFTE +AGSYHKWLPSEYPLLA+ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAG 121
           GE+GV GLVFP+KSDEVVV LKKGDLIPVP GV+SWWFNDGDSD EIIFLGE+K AHV G
Sbjct: 63  GENGVVGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKH 181
           DISYF+LSGPL  L GFSP+                   QSNALI ++ Q QSLPKP K 
Sbjct: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKF 182

Query: 182 SKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAE 241
           SK VYNIDAAAPD   K S  GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAE
Sbjct: 183 SKFVYNIDAAAPDGRVKGS-AGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAE 242

Query: 242 PSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT 301
           P DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GK AGE+GLEC SIITAT
Sbjct: 243 PYDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 302

Query: 302 HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           HP+VEELAGKTSV EALSPE+ QVSFNVTAEFEKLLRSKIT  SPVI  SD
Sbjct: 303 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIGSSD 352

BLAST of Tan0001707 vs. NCBI nr
Match: XP_008456076.1 (PREDICTED: glutelin type-A 2-like [Cucumis melo] >KAA0039043.1 glutelin type-A 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 510.4 bits (1313), Expect = 1.2e-140
Identity = 255/341 (74.78%), Postives = 277/341 (81.23%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           ME MNPKPF EG+ GSY KWLPS+YPLLAQT VA GRLLLRPRGF VPHYADCSK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVL 60

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVA 120
           QGEDGV G VFPNK +EVV+KLKKGDLIPVP G+TSWWFNDGDSD EIIFLGETK AHV 
Sbjct: 61  QGEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVP 120

Query: 121 GDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQK 180
           GDI+YFILSGP G LQGF+P+                   QSN LIF +   QSLPKP K
Sbjct: 121 GDITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHK 180

Query: 181 HSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVA 240
           HSKLVYNIDAA PD   K  G  AVT VTES FP IGQ+GLTA+LEKLDANA+RSPVY+A
Sbjct: 181 HSKLVYNIDAAVPDNRAK-VGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIA 240

Query: 241 EPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA 300
           EPSDQLIYV KG GKIQ+VGFSSK DA+VK+GQLILVP+YF  GK+AGEEGLEC S+I A
Sbjct: 241 EPSDQLIYVTKGSGKIQVVGFSSKFDADVKIGQLILVPRYFAVGKMAGEEGLECISMIVA 300

Query: 301 THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI 323
           THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Sbjct: 301 THPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Tan0001707 vs. ExPASy TrEMBL
Match: A0A6J1E9P2 (legumin J-like OS=Cucurbita moschata OX=3662 GN=LOC111430654 PE=4 SV=1)

HSP 1 Score: 549.3 bits (1414), Expect = 1.1e-152
Identity = 283/351 (80.63%), Postives = 299/351 (85.19%), Query Frame = 0

Query: 2   EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 61
           +PMNPKPFTE +AGSYHKWLPSEYPLLAQ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAQNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAG 121
           GE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFNDGDSD EIIFLGE+K AHV G
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKH 181
           DISYF+LSGPL  L GFSP+                   QSNALIF++ Q QSLPKP K+
Sbjct: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPSKY 182

Query: 182 SKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAE 241
           SK VYNIDAAAPD   K  G GAVTTVTESKFP IGQSGLTAILEKL+ANAVRSPVYVAE
Sbjct: 183 SKFVYNIDAAAPDGRVK-GGAGAVTTVTESKFPFIGQSGLTAILEKLNANAVRSPVYVAE 242

Query: 242 PSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT 301
           P DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Sbjct: 243 PYDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITAT 302

Query: 302 HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           HP+VEELAGKTSV EALSPEI QVSFNVTAEFEKLLRSKIT  SPVI  SD
Sbjct: 303 HPVVEELAGKTSVLEALSPEIFQVSFNVTAEFEKLLRSKITNASPVIGSSD 352

BLAST of Tan0001707 vs. ExPASy TrEMBL
Match: A0A6J1JDB2 (12S seed storage protein CRD-like OS=Cucurbita maxima OX=3661 GN=LOC111483370 PE=4 SV=1)

HSP 1 Score: 547.7 bits (1410), Expect = 3.3e-152
Identity = 282/351 (80.34%), Postives = 298/351 (84.90%), Query Frame = 0

Query: 2   EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 61
           +PMNPKPFTE +AGSYHKWLPSEYPLLA  KVAAGRLLLRPRGFVVPHYADCSKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAHNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVAG 121
           GE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFNDGDSD EIIFLGE+K AHV G
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKH 181
           DISYF+LSG L  L GFSP+                   QSNALIF++ Q QSLPKP K+
Sbjct: 123 DISYFVLSGILSLLNGFSPEYVGETYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPPKY 182

Query: 182 SKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAE 241
           SK VYNIDAAAPD   K  G GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAE
Sbjct: 183 SKFVYNIDAAAPDGRVK-GGAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAE 242

Query: 242 PSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT 301
           P DQLIYVAKG GKIQIVGFSSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Sbjct: 243 PYDQLIYVAKGRGKIQIVGFSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITAT 302

Query: 302 HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           HP+VEELAGKTSV EALSPE+ QVSFNVTAEFEKLLRSKIT  SPVI  SD
Sbjct: 303 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIRSSD 352

BLAST of Tan0001707 vs. ExPASy TrEMBL
Match: A0A5A7T7U8 (Glutelin type-A 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold84G001330 PE=3 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 5.8e-141
Identity = 255/341 (74.78%), Postives = 277/341 (81.23%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           ME MNPKPF EG+ GSY KWLPS+YPLLAQT VA GRLLLRPRGF VPHYADCSK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVL 60

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVA 120
           QGEDGV G VFPNK +EVV+KLKKGDLIPVP G+TSWWFNDGDSD EIIFLGETK AHV 
Sbjct: 61  QGEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVP 120

Query: 121 GDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQK 180
           GDI+YFILSGP G LQGF+P+                   QSN LIF +   QSLPKP K
Sbjct: 121 GDITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHK 180

Query: 181 HSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVA 240
           HSKLVYNIDAA PD   K  G  AVT VTES FP IGQ+GLTA+LEKLDANA+RSPVY+A
Sbjct: 181 HSKLVYNIDAAVPDNRAK-VGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIA 240

Query: 241 EPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA 300
           EPSDQLIYV KG GKIQ+VGFSSK DA+VK+GQLILVP+YF  GK+AGEEGLEC S+I A
Sbjct: 241 EPSDQLIYVTKGSGKIQVVGFSSKFDADVKIGQLILVPRYFAVGKMAGEEGLECISMIVA 300

Query: 301 THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI 323
           THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Sbjct: 301 THPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Tan0001707 vs. ExPASy TrEMBL
Match: A0A1S3C2D5 (glutelin type-A 2-like OS=Cucumis melo OX=3656 GN=LOC103496119 PE=3 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 5.8e-141
Identity = 255/341 (74.78%), Postives = 277/341 (81.23%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           ME MNPKPF EG+ GSY KWLPS+YPLLAQT VA GRLLLRPRGF VPHYADCSK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVL 60

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVA 120
           QGEDGV G VFPNK +EVV+KLKKGDLIPVP G+TSWWFNDGDSD EIIFLGETK AHV 
Sbjct: 61  QGEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVP 120

Query: 121 GDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQK 180
           GDI+YFILSGP G LQGF+P+                   QSN LIF +   QSLPKP K
Sbjct: 121 GDITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHK 180

Query: 181 HSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVA 240
           HSKLVYNIDAA PD   K  G  AVT VTES FP IGQ+GLTA+LEKLDANA+RSPVY+A
Sbjct: 181 HSKLVYNIDAAVPDNRAK-VGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIA 240

Query: 241 EPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA 300
           EPSDQLIYV KG GKIQ+VGFSSK DA+VK+GQLILVP+YF  GK+AGEEGLEC S+I A
Sbjct: 241 EPSDQLIYVTKGSGKIQVVGFSSKFDADVKIGQLILVPRYFAVGKMAGEEGLECISMIVA 300

Query: 301 THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI 323
           THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Sbjct: 301 THPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Tan0001707 vs. ExPASy TrEMBL
Match: A0A0A0L6K0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218160 PE=4 SV=1)

HSP 1 Score: 506.9 bits (1304), Expect = 6.4e-140
Identity = 254/341 (74.49%), Postives = 275/341 (80.65%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           ME MNPKPF EG+ GSYHKWLPS+YPLLAQT VA GRLLLRPRGF VPHY+DCSK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVA 120
           QGEDGV G VFP K +EVV+KLKKGDLIPVP GVTSWWFNDGDSD EIIFLGETK AHV 
Sbjct: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120

Query: 121 GDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQK 180
           GDI+YFILSGP G LQGF+P+                   Q N LIF +   QSLPKP K
Sbjct: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180

Query: 181 HSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVA 240
           +SKLVYNIDAAAPD   K  G  AVT VTES FP IGQ+GLT +LEKLDANA+RSPVY+A
Sbjct: 181 YSKLVYNIDAAAPDNRAK-VGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIA 240

Query: 241 EPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA 300
           EPSDQLIYV KG GKIQ+VGFSSK DA+VK GQLILVP+YF  GKIAGEEGLEC S+I A
Sbjct: 241 EPSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVA 300

Query: 301 THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI 323
           THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Sbjct: 301 THPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Tan0001707 vs. TAIR 10
Match: AT1G07750.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 240.7 bits (613), Expect = 1.6e-63
Identity = 131/354 (37.01%), Postives = 193/354 (54.52%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           + P  PK    GD GSY  W P E P+L Q  + A +L L   GF VP Y+D SKV YVL
Sbjct: 5   LTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKVAYVL 64

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVA 120
           QG  G AG+V P K +E V+ +K+GD I +P GV +WWFN+ D +  I+FLGET   H A
Sbjct: 65  QG-SGTAGIVLPEK-EEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKGHKA 124

Query: 121 GDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQK 180
           G  + F L+G  G   GFS +                   Q+   I  L     +P+P++
Sbjct: 125 GQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQPKE 184

Query: 181 HSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVA 240
            ++  + ++            GG V  +     P +G+ G  A L ++DA+++ SP +  
Sbjct: 185 ENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCSPGFSC 244

Query: 241 EPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSII 300
           + + Q+ Y+  G G++Q+VG   K  ++  +K G L +VP++FV  KIA  +G+  FSI+
Sbjct: 245 DSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWFSIV 304

Query: 301 TATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           T   P+   LAG TSV+++LSPE+LQ +F V  E EK  RS  T ++   PPS+
Sbjct: 305 TTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSAIFFPPSN 356

BLAST of Tan0001707 vs. TAIR 10
Match: AT2G28680.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 234.2 bits (596), Expect = 1.5e-61
Identity = 130/354 (36.72%), Postives = 189/354 (53.39%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           + P  PK    GD GSY  W P E P+L    + A +L L   G  +P Y+D  KV YVL
Sbjct: 5   LSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKVAYVL 64

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGETKTAHVA 120
           QG  G AG+V P K +E V+ +KKGD I +P GV +WWFN+ D++  ++FLGET   H A
Sbjct: 65  QGA-GTAGIVLPEK-EEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKGHKA 124

Query: 121 GDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQK 180
           G  + F L+G  G   GFS +                   Q+   I  +     +P+P+K
Sbjct: 125 GQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPEPKK 184

Query: 181 HSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVA 240
             +  + ++            GG V  +     P +G+ G  A L ++D +++ SP +  
Sbjct: 185 GDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCSPGFSC 244

Query: 241 EPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSII 300
           + + Q+ Y+  G G++QIVG   K  ++  VK G L +VP++FV  KIA  +GL  FSI+
Sbjct: 245 DSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWFSIV 304

Query: 301 TATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD 334
           T   P+   LAG+TSV++ALSPE+LQ +F V  E EK  RSK T  +    PS+
Sbjct: 305 TTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSDAIFFSPSN 356

BLAST of Tan0001707 vs. TAIR 10
Match: AT1G03890.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 78.2 bits (191), Expect = 1.4e-14
Identity = 96/408 (23.53%), Postives = 158/408 (38.73%), Query Frame = 0

Query: 6   PKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDG 65
           P   T+ +AG    W     P L    V   R+ L+P    +P +     + YV+QGE G
Sbjct: 46  PAQATKFEAGQMEVW-DHMSPELRCAGVTVARITLQPNSIFLPAFFSPPALAYVVQGE-G 105

Query: 66  VAGLV---FPNKSDEV----------------------VVKLKKGDLIPVPEGVTSWWFN 125
           V G +    P    EV                      +   ++GD+     GV+ WW+N
Sbjct: 106 VMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVSQWWYN 165

Query: 126 DGDSDFEI-IFLGETKTAHVAGDI-SYFILSG--------PLGF------LQGFSPD--- 185
            GDSD  I I L  T   +    +   F L+G        PL +        GF P+   
Sbjct: 166 RGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFDPNIIA 225

Query: 186 -------------------------QSNALIFALAQPQ---------SLPKPQKHSKLVY 245
                                     +  L F +  P+          + +    +K+  
Sbjct: 226 EAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCTAKIHE 285

Query: 246 NIDAAAPDTTPK-PSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ 305
           NID   P+ +    +  G ++T+     P +    L A+   L +  +  P + A  +  
Sbjct: 286 NID--DPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTAN-AHT 345

Query: 306 LIYVAKGFGKIQIV--GFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHP 333
           ++YV  G  KIQ+V     S  + +V  GQ+I++P+ F   K AGE G E  S  T  + 
Sbjct: 346 VLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTNDNA 405

BLAST of Tan0001707 vs. TAIR 10
Match: AT1G03880.1 (cruciferin 2 )

HSP 1 Score: 70.5 bits (171), Expect = 2.9e-12
Identity = 89/411 (21.65%), Postives = 160/411 (38.93%), Query Frame = 0

Query: 1   MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVL 60
           +  + P    + + G    W     P L  +  A  R ++ P+G  +P + +  K+ +V+
Sbjct: 35  LNALEPSQIIKSEGGRIEVW-DHHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVV 94

Query: 61  QGEDGVAGLVFPNKSD------------------------EVVVKLKKGDLIPVPEGVTS 120
            G  G+ G V P  ++                        + V  L+ GD I  P GV  
Sbjct: 95  HGR-GLMGRVIPGCAETFMESPVFGEGQGQGQSQGFRDMHQKVEHLRCGDTIATPSGVAQ 154

Query: 121 WWFNDGDSDFEIIFLGE--TKTAHVAGDISYFILSG--PLG--FLQGFSPDQSNALI--F 180
           W++N+G+    ++   +  +    +  ++  F+++G  P G  +LQG    + N +   F
Sbjct: 155 WFYNNGNEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNIFNGF 214

Query: 181 A---LAQ---------------------------PQSLPKP---------QKHS------ 240
           A   LAQ                           P  + +P         Q H       
Sbjct: 215 APEILAQAFKINVETAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPHEIANGLE 274

Query: 241 ------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSP 300
                 +   N+D  +     KPS  G ++T+     P +    L+A+   +  NA+  P
Sbjct: 275 ETLCTMRCTENLDDPSDADVYKPS-LGYISTLNSYNLPILRLLRLSALRGSIRKNAMVLP 334

Query: 301 VYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLEC 327
            +    ++  +YV  G   IQ+V  + +   D E+  GQL++VP+ F   K A  E  E 
Sbjct: 335 QWNVN-ANAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGEQFEW 394

BLAST of Tan0001707 vs. TAIR 10
Match: AT4G28520.3 (cruciferin 3 )

HSP 1 Score: 64.7 bits (156), Expect = 1.6e-10
Identity = 64/279 (22.94%), Postives = 115/279 (41.22%), Query Frame = 0

Query: 61  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFL--------- 120
           QG+ G  G        + V  +++GD+     G   W +N G+    II L         
Sbjct: 181 QGQQGQQGF---RDMHQKVEHVRRGDVFANTPGSAHWIYNSGEQPLVIIALLDIANYQNQ 240

Query: 121 --GETKTAHVAGDISYFILSGPLGFLQGFSPDQSNALIFALAQPQSLPKPQKHSKLVYNI 180
                +  H+AG       +   G   G    Q    +++    Q + +  K       I
Sbjct: 241 LDRNPRVFHLAG-------NNQQGGFGGSQQQQEQKNLWSGFDAQVIAQALK-------I 300

Query: 181 DAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIY 240
           D   P         G VT+V     P +    L+A    L  NA+  P Y    +++++Y
Sbjct: 301 DVYKPSL-------GRVTSVNSYTLPILEYVRLSATRGVLQGNAMVLPKYNMN-ANEILY 360

Query: 241 VAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVE 300
              G G+IQ+V  + +  +D +V+ GQL+++P+ F     +     E  S  T  + ++ 
Sbjct: 361 CTGGQGRIQVVNDNGQNVLDQQVQKGQLVVIPQGFAYVVQSHGNKFEWISFKTNENAMIS 420

Query: 301 ELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTS 327
            LAG+TS+  AL  E++   F ++ E  + ++    +T+
Sbjct: 421 TLAGRTSLLRALPLEVISNGFQISPEEARKIKFNTLETT 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q091513.5e-1822.55Glutelin type-A 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA3 PE=2 SV=2[more]
P077301.9e-1621.27Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1[more]
P077285.6e-1620.51Glutelin type-A 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA1 PE=1 SV=2[more]
A0A222NNM98.9e-1421.69Cocosin 1 OS=Cocos nucifera OX=13894 GN=COS-1 PE=1 SV=1[more]
Q9ZWA92.0e-1323.5312S seed storage protein CRD OS=Arabidopsis thaliana OX=3702 GN=CRD PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022922755.12.3e-15280.63legumin J-like [Cucurbita moschata][more]
XP_022985328.16.7e-15280.3412S seed storage protein CRD-like [Cucurbita maxima][more]
KAG6576976.11.1e-15180.3412S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023552908.11.4e-14979.4912S seed storage globulin 1-like [Cucurbita pepo subsp. pepo][more]
XP_008456076.11.2e-14074.78PREDICTED: glutelin type-A 2-like [Cucumis melo] >KAA0039043.1 glutelin type-A 2... [more]
Match NameE-valueIdentityDescription
A0A6J1E9P21.1e-15280.63legumin J-like OS=Cucurbita moschata OX=3662 GN=LOC111430654 PE=4 SV=1[more]
A0A6J1JDB23.3e-15280.3412S seed storage protein CRD-like OS=Cucurbita maxima OX=3661 GN=LOC111483370 PE... [more]
A0A5A7T7U85.8e-14174.78Glutelin type-A 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold8... [more]
A0A1S3C2D55.8e-14174.78glutelin type-A 2-like OS=Cucumis melo OX=3656 GN=LOC103496119 PE=3 SV=1[more]
A0A0A0L6K06.4e-14074.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218160 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07750.11.6e-6337.01RmlC-like cupins superfamily protein [more]
AT2G28680.11.5e-6136.72RmlC-like cupins superfamily protein [more]
AT1G03890.11.4e-1423.53RmlC-like cupins superfamily protein [more]
AT1G03880.12.9e-1221.65cruciferin 2 [more]
AT4G28520.31.6e-1022.94cruciferin 3 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 2..148
e-value: 2.1E-18
score: 77.2
coord: 167..316
e-value: 5.5E-12
score: 55.8
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 181..314
e-value: 1.3E-15
score: 57.3
coord: 5..117
e-value: 1.0E-18
score: 67.4
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 1..138
e-value: 3.9E-25
score: 90.5
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 175..333
e-value: 2.6E-35
score: 123.2
NoneNo IPR availablePANTHERPTHR31189:SF4511S GLOBULIN SEED STORAGE PROTEIN 2-LIKEcoord: 1..324
NoneNo IPR availablePANTHERPTHR31189OS03G0336100 PROTEIN-RELATEDcoord: 1..324
NoneNo IPR availableCDDcd02243cupin_11S_legumin_Ccoord: 182..330
e-value: 2.8429E-59
score: 185.755
NoneNo IPR availableCDDcd02242cupin_11S_legumin_Ncoord: 3..141
e-value: 9.05029E-51
score: 165.836
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 4..319

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001707.1Tan0001707.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0045735 nutrient reservoir activity