Tan0021704 (gene) Snake gourd v1

Overview
NameTan0021704
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionvicilin-like seed storage protein At2g18540
LocationLG03: 74741442 .. 74743978 (+)
RNA-Seq ExpressionTan0021704
SyntenyTan0021704
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAATCCGCAGCAATTTCAGGATCGCCATTTTCATCGTCCGTTCTCATACCCATCTTCTTCCTTCTCCTCTCTCTGCCTTCAAATGCCGACGACGGACGGTGGGAAGGGGCCAGGCCTGAGGTGAAGAGAGCCAGCGAGAGAATATCACTCCTTAAAACAGAGTACGGCGAGATCTCCGCCATTGATTTCAACGATGGCTCAAGATTTGGACCTTACCATCTCCAGTTCATCACAATGGAACCCAACTCCCTGTTTCTCCCTGTTCTTCTTCATGCTGACATGGTGTTCTATATCCACACTGGTATGCTCTGTTTCTTGAAAAAAAAATAAAAGGACGACGAAATTTTGAATGATGTGGATGTTTGTGATTATTGTAGGAAGTGGGAGATTGAGTTGGTTTGATGATGTTGATTTGAGGGAAGTGGATTTACGGCGGGGAGATATTTACAGGCTCCATCCAGGTTCCATTTTTTACTTGCAGAGCAGCTTAGAGACCGAATGTGAAAAGTTTCGGATTTATGCTCTGTTCTCAAGCACAGATGATGATTCATACGTATGTTCTAATATATATCCTTAATCAATCGAAGTTTGTAAATGGGTTCGATTCTAAGATAAATTTGATGCATGTTTTTTGAAGGACCCGTCCATTGGAGCCTACTCGAGTGTCACTGATCTGGTTCGCGGCTTCGACAAGAAAGTTCTCCGTGAAGCTTTCAAGGTATGATTTTAGCTGAGATTCAAGTTTTTCAAGCTGATCCAATTTTGTCCATAACTTTCTAAAATTCTCAATTTTCAATAAGCTAATGTGTCATTCTATTGATACTTGTCAATTCGTATTAGTTCGTTAACATGTCGTGTCTTTCGCAAATGGGGATAGATTTTGCATATTTAGATTTGATTTTATGAACTGTTGTGATAATGTGTCAAAAGGCCGCTGAGGAAGTAATAGACGAACTAATGAATGGCACAAAGCCGCCATTGATCATGCACGCCGAGGCGGCGACAAAAATCAAGAAGCCATCCACGACGACATCGACGTGGGAGTTAGAAGCTCGGTTCTTGAAAGGCTTTCTAGGAGGAGGCGCAGGAGGGATGGGATTCAATAAGAAGAAGAAGAAAAGCATATACAACGTTTATGAAGCAGACCCAGATTTTGAAAACTGCAATGGATGGAGTTTGACTGTAACCAAGAAAGTCTCCCATCAGTTAAAGGGCTCCAATGTCGGCCTCTTCGTCGTCAACCTCACAGCGGTTAGTACTTCGATTTCGAATTTCTAATTCCTTTAATATGATTGATTCATGGAATTCACTTGAAATGATGTGTTGTTTAATTGTAGGGTTCGATGATGGGTCCGCATTGGAATCCAAGGGCGTGGGAGATTGGGATCGTGACGTCGGAGGAGGCGGGGGTGGTTCGGGTGGGTTGTTCGAGCAGCAGGACGAATGGTTCCGCGTGCAAGAATTGGAGTTATGTAGTAGGGGAAGGGGACGTGTTTGTGGTGCCAAAGTTCAATCCAATGGCGCAGATGTCATTCAACAATGGATCGTTTGTGTTTGTGGGATTTAGCACGGCCAACAGATATAATGTGCCGCAGTTCTTGGCGGGAAGTAGCTCGGTGTTGCAAATTATGGACAGGGAAGTGTTGGCGTGGTCGTTTGATGTCAATGTGACGACGGTTGATCGGTTGTTGGGAGCTCGAGTTGAGTCGGTCATATTGGAGTGTACTTCTTGTGCTGAGGAGGAAGTGAGGAAAATGGAAGAGGAAGCTGAGAGAAAGAGACAAGAGGAAGAGGAAGAGAGAAAAAGAAGAGAGGAGGAAGAAGAAGAGAGAAAAAGAGAGGAAGAAGAGAGAAAAAGAAGAGAAGAGGAAGAACAGAGGGAGAGAGAAGAGGAGGAGGAAGGAAGGAGAGAGAGGAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAGAGAGAGAAAGAGAGGAAGAGAGAGAGAGAGAGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAAGAGAGGGAAGAAGAAGAAGAGAGAAAGAGAAGAGAAGAGGAAGAAGAGAGGGAGAGAGAGGAAGAGAGGAAGAGAGAAGAGGAGGAAGCAAGGAGAGAGGAAGAGGAAGAGAGAGAAAGAAAGGAAGAGGAGGAGAGAAAAAGAGAGAAAGAAGAAGAGAGAAAAAGAAGAGGAGAGGAAGAACAGAGGGAGAGAGAGGAAGAGAGAAAAAGAGAAGGAGGAGAAGCAGCAGCTAGAAAAAGGGAAGAGGAGAGGGAGAGAGAGGCGGAGAGAGAGGAGGAGGAAGCAAGGGAGAGAGAGGAAACCTATCAAAAAGAGAGAAGGAAAAGACGGAGAGAAGCAGAGAGGGAGGCAGAAGAAAAACAGAGAAGAGCTTGGGAGGAGGAGGAGGAAAAAGAAGAAGAAGCAGAAACAGAGCCAGTGATAAGGATTTTGAGACAGTGGACATAACTCGAAACTTTCAACCACTATGCTTTCTTTTAAGCAACAGTATGCTTTTGCTTCACTCTCATGAATAA

mRNA sequence

ATGAAGAAATCCGCAGCAATTTCAGGATCGCCATTTTCATCGTCCGTTCTCATACCCATCTTCTTCCTTCTCCTCTCTCTGCCTTCAAATGCCGACGACGGACGGTGGGAAGGGGCCAGGCCTGAGGTGAAGAGAGCCAGCGAGAGAATATCACTCCTTAAAACAGAGTACGGCGAGATCTCCGCCATTGATTTCAACGATGGCTCAAGATTTGGACCTTACCATCTCCAGTTCATCACAATGGAACCCAACTCCCTGTTTCTCCCTGTTCTTCTTCATGCTGACATGGTGTTCTATATCCACACTGGAAGTGGGAGATTGAGTTGGTTTGATGATGTTGATTTGAGGGAAGTGGATTTACGGCGGGGAGATATTTACAGGCTCCATCCAGGTTCCATTTTTTACTTGCAGAGCAGCTTAGAGACCGAATGTGAAAAGTTTCGGATTTATGCTCTGTTCTCAAGCACAGATGATGATTCATACGACCCGTCCATTGGAGCCTACTCGAGTGTCACTGATCTGGTTCGCGGCTTCGACAAGAAAGTTCTCCGTGAAGCTTTCAAGGCCGCTGAGGAAGTAATAGACGAACTAATGAATGGCACAAAGCCGCCATTGATCATGCACGCCGAGGCGGCGACAAAAATCAAGAAGCCATCCACGACGACATCGACGTGGGAGTTAGAAGCTCGGTTCTTGAAAGGCTTTCTAGGAGGAGGCGCAGGAGGGATGGGATTCAATAAGAAGAAGAAGAAAAGCATATACAACGTTTATGAAGCAGACCCAGATTTTGAAAACTGCAATGGATGGAGTTTGACTGTAACCAAGAAAGTCTCCCATCAGTTAAAGGGCTCCAATGTCGGCCTCTTCGTCGTCAACCTCACAGCGGGTTCGATGATGGGTCCGCATTGGAATCCAAGGGCGTGGGAGATTGGGATCGTGACGTCGGAGGAGGCGGGGGTGGTTCGGGTGGGTTGTTCGAGCAGCAGGACGAATGGTTCCGCGTGCAAGAATTGGAGTTATGTAGTAGGGGAAGGGGACGTGTTTGTGGTGCCAAAGTTCAATCCAATGGCGCAGATGTCATTCAACAATGGATCGTTTGTGTTTGTGGGATTTAGCACGGCCAACAGATATAATGTGCCGCAGTTCTTGGCGGGAAGTAGCTCGGTGTTGCAAATTATGGACAGGGAAGTGTTGGCGTGGTCGTTTGATGTCAATGTGACGACGGTTGATCGGTTGTTGGGAGCTCGAGTTGAGTCGGTCATATTGGAGTGTACTTCTTGTGCTGAGGAGGAAGTGAGGAAAATGGAAGAGGAAGCTGAGAGAAAGAGACAAGAGGAAGAGGAAGAGAGAAAAAGAAGAGAGGAGGAAGAAGAAGAGAGAAAAAGAGAGGAAGAAGAGAGAAAAAGAAGAGAAGAGGAAGAACAGAGGGAGAGAGAAGAGGAGGAGGAAGGAAGGAGAGAGAGGAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAGAGAGAGAAAGAGAGGAAGAGAGAGAGAGAGAGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAAGAGAGGGAAGAAGAAGAAGAGAGAAAGAGAAGAGAAGAGGAAGAAGAGAGGGAGAGAGAGGAAGAGAGGAAGAGAGAAGAGGAGGAAGCAAGGAGAGAGGAAGAGGAAGAGAGAGAAAGAAAGGAAGAGGAGGAGAGAAAAAGAGAGAAAGAAGAAGAGAGAAAAAGAAGAGGAGAGGAAGAACAGAGGGAGAGAGAGGAAGAGAGAAAAAGAGAAGGAGGAGAAGCAAGAGAGGAGGAGGAAGCAAGGGAGAGAGAGGAAACCTATCAAAAAGAGAGAAGGAAAAGACGGAGAGAAGCAGAGAGGGAGGCAGAAGAAAAACAGAGAAGAGCTTGGGAGGAGGAGGAGGAAAAAGAAGAAGAAGCAGAAACAGAGCCAGTGATAAGGATTTTGAGACAGTGGACATAACTCGAAACTTTCAACCACTATGCTTTCTTTTAAGCAACAGTATGCTTTTGCTTCACTCTCATGAATAA

Coding sequence (CDS)

ATGAAGAAATCCGCAGCAATTTCAGGATCGCCATTTTCATCGTCCGTTCTCATACCCATCTTCTTCCTTCTCCTCTCTCTGCCTTCAAATGCCGACGACGGACGGTGGGAAGGGGCCAGGCCTGAGGTGAAGAGAGCCAGCGAGAGAATATCACTCCTTAAAACAGAGTACGGCGAGATCTCCGCCATTGATTTCAACGATGGCTCAAGATTTGGACCTTACCATCTCCAGTTCATCACAATGGAACCCAACTCCCTGTTTCTCCCTGTTCTTCTTCATGCTGACATGGTGTTCTATATCCACACTGGAAGTGGGAGATTGAGTTGGTTTGATGATGTTGATTTGAGGGAAGTGGATTTACGGCGGGGAGATATTTACAGGCTCCATCCAGGTTCCATTTTTTACTTGCAGAGCAGCTTAGAGACCGAATGTGAAAAGTTTCGGATTTATGCTCTGTTCTCAAGCACAGATGATGATTCATACGACCCGTCCATTGGAGCCTACTCGAGTGTCACTGATCTGGTTCGCGGCTTCGACAAGAAAGTTCTCCGTGAAGCTTTCAAGGCCGCTGAGGAAGTAATAGACGAACTAATGAATGGCACAAAGCCGCCATTGATCATGCACGCCGAGGCGGCGACAAAAATCAAGAAGCCATCCACGACGACATCGACGTGGGAGTTAGAAGCTCGGTTCTTGAAAGGCTTTCTAGGAGGAGGCGCAGGAGGGATGGGATTCAATAAGAAGAAGAAGAAAAGCATATACAACGTTTATGAAGCAGACCCAGATTTTGAAAACTGCAATGGATGGAGTTTGACTGTAACCAAGAAAGTCTCCCATCAGTTAAAGGGCTCCAATGTCGGCCTCTTCGTCGTCAACCTCACAGCGGGTTCGATGATGGGTCCGCATTGGAATCCAAGGGCGTGGGAGATTGGGATCGTGACGTCGGAGGAGGCGGGGGTGGTTCGGGTGGGTTGTTCGAGCAGCAGGACGAATGGTTCCGCGTGCAAGAATTGGAGTTATGTAGTAGGGGAAGGGGACGTGTTTGTGGTGCCAAAGTTCAATCCAATGGCGCAGATGTCATTCAACAATGGATCGTTTGTGTTTGTGGGATTTAGCACGGCCAACAGATATAATGTGCCGCAGTTCTTGGCGGGAAGTAGCTCGGTGTTGCAAATTATGGACAGGGAAGTGTTGGCGTGGTCGTTTGATGTCAATGTGACGACGGTTGATCGGTTGTTGGGAGCTCGAGTTGAGTCGGTCATATTGGAGTGTACTTCTTGTGCTGAGGAGGAAGTGAGGAAAATGGAAGAGGAAGCTGAGAGAAAGAGACAAGAGGAAGAGGAAGAGAGAAAAAGAAGAGAGGAGGAAGAAGAAGAGAGAAAAAGAGAGGAAGAAGAGAGAAAAAGAAGAGAAGAGGAAGAACAGAGGGAGAGAGAAGAGGAGGAGGAAGGAAGGAGAGAGAGGAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAGAGAGAGAAAGAGAGGAAGAGAGAGAGAGAGAGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAAGAGAGGGAAGAAGAAGAAGAGAGAAAGAGAAGAGAAGAGGAAGAAGAGAGGGAGAGAGAGGAAGAGAGGAAGAGAGAAGAGGAGGAAGCAAGGAGAGAGGAAGAGGAAGAGAGAGAAAGAAAGGAAGAGGAGGAGAGAAAAAGAGAGAAAGAAGAAGAGAGAAAAAGAAGAGGAGAGGAAGAACAGAGGGAGAGAGAGGAAGAGAGAAAAAGAGAAGGAGGAGAAGCAAGAGAGGAGGAGGAAGCAAGGGAGAGAGAGGAAACCTATCAAAAAGAGAGAAGGAAAAGACGGAGAGAAGCAGAGAGGGAGGCAGAAGAAAAACAGAGAAGAGCTTGGGAGGAGGAGGAGGAAAAAGAAGAAGAAGCAGAAACAGAGCCAGTGATAAGGATTTTGAGACAGTGGACATAACTCGAAACTTTCAACCACTATGCTTTCTTTTAAGCAACAGTATGCTTTTGCTTCACTCTCATGAATAA

Protein sequence

MKKSAAISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYGEISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGFDKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGGGAGGMGFNKKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMAQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARVESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKREEEERKRREEEEQREREEEEEGRRERKRERERERERRERKRGRERERGEREREREREKREGRRRREKEKRRGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGERGREKKRRRRSKRGGGSKGERGNLSKREKEKTERSREGGRRKTEKSLGGGGGKRRRSRNRASDKDFETVDITRNFQPLCFLLSNSMLLLHSHE
Homology
BLAST of Tan0021704 vs. ExPASy Swiss-Prot
Match: F4JQG6 (Vicilin-like seed storage protein At4g36700 OS=Arabidopsis thaliana OX=3702 GN=At4g36700 PE=3 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 1.7e-91
Identity = 244/533 (45.78%), Postives = 342/533 (64.17%), Query Frame = 0

Query: 11  PFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLK--------TEYGEISA 70
           P S  +L+ +F    SL  + +   ++ A P     S  + + K        T++G+IS 
Sbjct: 8   PLSVLLLVLLFLCTESLAKSEESEEYDVAVPSCCGFSSPLLIKKDQWKPIFETKFGQIST 67

Query: 71  IDFNDG-SRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDLR 130
           +   +G    GPY +  IT+EPN++ LP+LLH+DMVF++ +GSG L+W D+ + +  ++R
Sbjct: 68  VQIGNGCGGMGPYKIHSITLEPNTILLPLLLHSDMVFFVDSGSGILNWVDE-EAKSTEIR 127

Query: 131 RGDIYRLHPGSIFYLQSS-----LETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVR 190
            GD+YRL PGS+FYLQS      L T   K ++YA+FS+ D+  +DP  GAYSS+TDL+ 
Sbjct: 128 LGDVYRLRPGSVFYLQSKPVDIFLGT---KLKLYAIFSNNDECLHDPCFGAYSSITDLMF 187

Query: 191 GFDKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFL 250
           GFD+ +L+ AF   E +I+ + N TKPPLI+     T         +TW+L+ R LK F 
Sbjct: 188 GFDETILQSAFGVPEGIIELMRNRTKPPLIVSETLCT-----PGVANTWQLQPRLLKLF- 247

Query: 251 GGGAGGMGFNKKKKK-----------SIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSN 310
             G+  +  NKKKK+             +NV+E++PDFE+  G ++T+ +K    LKGS 
Sbjct: 248 -AGSADLVDNKKKKEKKEKKEKVKKAKTFNVFESEPDFESPYGRTITINRKDLKVLKGSM 307

Query: 311 VGLFVVNLTAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTN-GSACKNWSYVVGE 370
           VG+ +VNLT GSMMGPHWNP A EI IV  + AG+VRV  SS  +N  S CKN  + V E
Sbjct: 308 VGVSMVNLTQGSMMGPHWNPWACEISIVL-KGAGMVRVLRSSISSNTSSECKNVRFKVEE 367

Query: 371 GDVFVVPKFNPMAQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDV 430
           GD+F VP+ +PMAQMSFNN S VFVGF+T+ + N PQFLAG  S L+++DR+VLA S +V
Sbjct: 368 GDIFAVPRLHPMAQMSFNNDSLVFVGFTTSAKNNEPQFLAGEDSALRMLDRQVLAASLNV 427

Query: 431 NVTTVDRLLGARVESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKREE 490
           +  T+D LLGA+ E+VILEC SCAE E+ K++ E ERK+   ++ERKRR    +ERK+EE
Sbjct: 428 SSVTIDGLLGAQKEAVILECHSCAEGEIEKLKVEIERKK--IDDERKRR---HDERKKEE 487

Query: 491 EERKRREEEEQREREEEEEGRRERKRERERERERRERKRGRERERGERERERE 518
           EE K REEEE+R+REEEEE +R   ++  +E E RER+   E+E  E E E E
Sbjct: 488 EEAK-REEEERRKREEEEEKKRWPPQQPPQEEELRERQLPMEKE-WEMEGEEE 521

BLAST of Tan0021704 vs. ExPASy Swiss-Prot
Match: F4IQK5 (Vicilin-like seed storage protein At2g18540 OS=Arabidopsis thaliana OX=3702 GN=At2g18540 PE=3 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 5.5e-87
Identity = 286/642 (44.55%), Postives = 401/642 (62.46%), Query Frame = 0

Query: 41  PEVKRASERISLLKTEYGEISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYI 100
           P + +  +R S++ TE+G ISA+   DG     YH+QFIT+EPN+L LP+LLH+DMVF++
Sbjct: 42  PLLVKKDQRTSVVATEFGNISAVQIGDG-----YHIQFITLEPNALLLPLLLHSDMVFFV 101

Query: 101 HTGSGRLSWFDDVDLREVDLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDS 160
           HTG+G L+W D+   R+++LRRGD++RL  G++FY+ S+     EK R+YA+F +     
Sbjct: 102 HTGTGILNWIDEESERKLELRRGDVFRLRSGTVFYVHSN-----EKLRVYAIF-NVGKCL 161

Query: 161 YDPSIGAYSSVTDLVRGFDKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPST 220
            DP +GAYSSV DL+ GFD + LR AF   E+++ ++ + TKPPLI++  A  + +    
Sbjct: 162 NDPCLGAYSSVRDLLLGFDDRTLRSAFAVPEDILRKIRDATKPPLIVN--ALPRNRTQGL 221

Query: 221 TTSTWELEARFLKGFLGG-------GAGGMGFNKKKKKSIYNVYEADPDFENCNGWSLTV 280
               W  ++R ++ F+             +    KKK   +NV+E DPDFEN NG S+ V
Sbjct: 222 EEDKW--QSRLVRLFVSVEDVTDHLAMKPIVDTNKKKSRTFNVFEEDPDFENNNGRSIVV 281

Query: 281 TKKVSHQLKGSNVGLFVVNLTAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGS 340
            +K    LKGS  G+F+VNLT GSM+GPHWNP A EI IV   E G+VRV    ++ + S
Sbjct: 282 DEKDLDALKGSRFGVFMVNLTKGSMIGPHWNPSACEISIVLEGE-GMVRV---VNQQSLS 341

Query: 341 ACKN----WSYVVGEGDVFVVPKFNPMAQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSV 400
           +CKN     S++V EGDVFVVPKF+PMAQMSF N SFVF+GFST+ + N PQFL G SSV
Sbjct: 342 SCKNDRKSESFMVEEGDVFVVPKFHPMAQMSFENSSFVFMGFSTSAKTNHPQFLVGQSSV 401

Query: 401 LQIMDREVLAWSFDVNVTTVDRLLGARVESVILECTSCAEEEVRK-MEEEAERKRQEEEE 460
           L+++DR+V+A SF+++  T+  LL A+ ESVI EC SCAE E+ K M E  ERKR+EEEE
Sbjct: 402 LKVLDRDVVAVSFNLSNETIKGLLKAQKESVIFECASCAEGELSKLMREIEERKRREEEE 461

Query: 461 ERKRREEEEEERK------REEEERKRREEEE-QREREEEEEGRR---ERKRERERERER 520
             +RR+EEEE RK      REEEE KRREEEE +R++ EEEE R+   ERKRE E  + R
Sbjct: 462 IERRRKEEEEARKREEAKRREEEEAKRREEEETERKKREEEEARKREEERKREEEEAKRR 521

Query: 521 RERKRGRERERGE---REREREREKREGRRRREKEKRRGRREGERGREEERRGGSKERGR 580
            E ++ RE E  +   RE ERE+E+   ++R E+ +R+ R E ER R EE     +ER R
Sbjct: 522 EEERKKREEEAEQARKREEEREKEEEMAKKREEERQRKEREEVERKRREE-----QERKR 581

Query: 581 GRERKKGRGGEKKRE-----RRREKKKRRGRTEGERGREKKRRRRSKRGGGSKGERGNLS 640
             E  + R  E+KRE     RR ++++R+ R E ER   +++ R+ +     + E+    
Sbjct: 582 REEEARKREEERKREEEMAKRREQERQRKEREEVERKIREEQERKREEEMAKRREQERQK 641

Query: 641 KREKEKTERSREGGRRKTEKSLGGGGGKRRRSRNRASDKDFE 653
           K  +E   + RE   RK E+ +      R   R R   +D E
Sbjct: 642 KEREEMERKKREEEARKREEEM---AKIREEERQRKEREDVE 656

BLAST of Tan0021704 vs. ExPASy Swiss-Prot
Match: Q9SK09 (Vicilin-like seed storage protein At2g28490 OS=Arabidopsis thaliana OX=3702 GN=At2g28490 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 8.9e-37
Identity = 128/432 (29.63%), Postives = 202/432 (46.76%), Query Frame = 0

Query: 48  ERISLLKTEYGEISAIDFNDGSRF-GPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGR 107
           E   ++K+E GE+  +    G     P H+ F+TMEP +LF+P  L + ++ +I  G   
Sbjct: 90  ESRQVIKSEGGEMRVVLSPRGRIIEKPMHIGFLTMEPKTLFVPQYLDSSLLIFIRQGEAT 149

Query: 108 LSWFDDVDLREVDLRRGDIYRLHPGSIFYLQSS-----LETECEKFRIYALFSSTDDDSY 167
           L      +  E  L+ GDIY +  GS+FYL ++     L   C      +L   T    Y
Sbjct: 150 LGVICKDEFGERKLKAGDIYWIPAGSVFYLHNTGLGQRLHVICSIDPTQSLGFETFQPFY 209

Query: 168 DPSIGAYSSVTDLVRGFDKKVLREAFKAA-EEVIDELMNGTKPPLIMHAEAATKIKKPST 227
              IG   S   ++ GFD   L  AF  +  E+   +M+  + P++      T+  +P  
Sbjct: 210 ---IGGGPS--SVLAGFDPHTLTSAFNVSLPELQQMMMSQFRGPIVY----VTEGPQPQP 269

Query: 228 TTSTW--------ELEARFLKGFLGGGAG---------------------------GMGF 287
            ++ W        E + + LK  L    G                             G 
Sbjct: 270 QSTVWTQFLGLRGEEKHKQLKKLLETKQGSPQDQQYSSGWSWRNIVRSILDLTEEKNKGS 329

Query: 288 NKKKKKSIYNVYEA--DPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGSMMGPHW 347
              + +  YN+Y+    P F+N  GWS+ +       LK S +G+++VNLTAG+MM PH 
Sbjct: 330 GSSECEDSYNIYDKKDKPSFDNKYGWSIALDYDDYKPLKHSGIGVYLVNLTAGAMMAPHM 389

Query: 348 NPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMAQMSFNN 407
           NP A E GIV +  +G ++V       NG++  N    V  GDVF +P++    Q++   
Sbjct: 390 NPTATEYGIVLA-GSGEIQV----VFPNGTSAMNTR--VSVGDVFWIPRYFAFCQIASRT 449

Query: 408 GSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARVESVILE 436
           G F FVGF+T+   N PQFL GS+S+L+ ++   L+ +F V+  T+ R + A+ E+VIL 
Sbjct: 450 GPFEFVGFTTSAHKNRPQFLVGSNSLLRTLNLTSLSIAFGVDEETMRRFIEAQREAVILP 505

BLAST of Tan0021704 vs. NCBI nr
Match: XP_038882657.1 (vicilin-like seed storage protein At2g18540 [Benincasa hispida])

HSP 1 Score: 712.2 bits (1837), Expect = 4.2e-201
Identity = 435/603 (72.14%), Postives = 491/603 (81.43%), Query Frame = 0

Query: 1   MKKSAAISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYGEI 60
           MKK  AISGSP S S LI I FL LSLP+NADDG WE   P VKRA+ERI LLKTEYGEI
Sbjct: 1   MKKCTAISGSPSSPSFLISILFLFLSLPTNADDGWWETDSPVVKRANERIPLLKTEYGEI 60

Query: 61  SAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDL 120
           S +DF DGSRFG YHLQFIT+EPNSLFLPVLLHADMVFY HTGSGRLSWFDD DLREVD+
Sbjct: 61  STVDFADGSRFGHYHLQFITLEPNSLFLPVLLHADMVFYTHTGSGRLSWFDDDDLREVDI 120

Query: 121 RRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGFDK 180
           RRGDIYRLHPGSIFYLQS+LETE EK RIYALFSSTD+DS++PSIGAYS VTDLVRGFDK
Sbjct: 121 RRGDIYRLHPGSIFYLQSNLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGFDK 180

Query: 181 KVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGGGA 240
           +VLR+AF A EEVI+E+MN  +PPLI+HA      KK       WELEAR LK F+ GGA
Sbjct: 181 EVLRKAFMAPEEVIEEIMNAKRPPLIVHAATTPSKKKKKVAAVAWELEARLLKTFI-GGA 240

Query: 241 GGMGFNKKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGSMMG 300
            GM FNKKKKK +YNVYE DPDFENCNGWSLTVTKK SHQLKGSN+G  VVNLT+GSMMG
Sbjct: 241 SGMEFNKKKKKGVYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTSGSMMG 300

Query: 301 PHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMAQMS 360
           PHWNP AWEIGIVTS+E GVVRVGCSS++ N S CKNWS+VVG+GDVFVVP+F+PMAQMS
Sbjct: 301 PHWNPWAWEIGIVTSDEPGVVRVGCSSTK-NSSKCKNWSFVVGKGDVFVVPRFHPMAQMS 360

Query: 361 FNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARVESV 420
           FNNG+FVFVGFST N +N+PQFLAGSSSVLQI+DREVLAWSFDVNVTTVDRLLGARVES+
Sbjct: 361 FNNGTFVFVGFSTTNGHNMPQFLAGSSSVLQIVDREVLAWSFDVNVTTVDRLLGARVESI 420

Query: 421 ILECTSCAEEEVRKMEEEAERKRQEEEEERKRRE-------------EEEEERKREEEER 480
           ILECTSCAEEEVRKMEEEAER+R+EEEEERKR E             EEEEERKREEEER
Sbjct: 421 ILECTSCAEEEVRKMEEEAEREREEEEEERKREEEEERKREEEERKREEEEERKREEEER 480

Query: 481 KRREEEEQREREEEEEGRRERKRERERERER-RERKRGRERERGEREREREREKREGRRR 540
           KR EEEE+R+REEEEE +RE + ER+RE ER +E +R RE E  +RE ER R + E  RR
Sbjct: 481 KREEEEEERKREEEEERKREEEEERKREEEREQEEERRREEEEAKREEERRRREEE-ERR 540

Query: 541 REKEKRRGRREGERGREEER-RGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGE 589
            EKE+ RG RE +R  EE R R  + +R RGR R++     ++R+RRR ++++    E E
Sbjct: 541 EEKEEERGEREAKREEEEAREREETHQRERGRRRRETEREAEERQRRRWEEEKEEEEEEE 600

BLAST of Tan0021704 vs. NCBI nr
Match: XP_011658490.2 (vicilin-like seed storage protein At2g18540 [Cucumis sativus] >KAE8647587.1 hypothetical protein Csa_003555 [Cucumis sativus])

HSP 1 Score: 704.1 bits (1816), Expect = 1.1e-198
Identity = 437/656 (66.62%), Postives = 520/656 (79.27%), Query Frame = 0

Query: 1   MKKSAA--ISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYG 60
           MKK +A  +SGS FS S+LIPIFFL LSLP+ ADDG WEG  P VKRA+ERI +LKTEYG
Sbjct: 1   MKKHSAVSVSGSSFSPSILIPIFFLFLSLPAYADDGWWEGDTPVVKRANERIPILKTEYG 60

Query: 61  EISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREV 120
           EISA+DF+DG+RFG YHLQFIT+EPNSLFLPVLLH+DMVFY+HTGSGRL+WFDD DL+EV
Sbjct: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120

Query: 121 DLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGF 180
           DLRRGD+YRLHPGSIFYLQSSLETE EK RIYALFSSTD+DS++PSIGAYS VTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180

Query: 181 DKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGG 240
            K+VLR+AF A +EVI+E+MN  +PPLI+HA A T   K + ++S WE EAR LK FLGG
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIK-AKSSSPWEFEARLLKSFLGG 240

Query: 241 GAGGMGFN-KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGS 300
            A  + FN KKKKK IYNVYE DPDFENCNGWSLTVTKK SHQLKGSN+G  VVNLTAGS
Sbjct: 241 DASAIEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGS 300

Query: 301 MMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMA 360
           MMGPHWNPRAWEIGIVTS+E GV+RVGCSS+  N S CKNWS+VV +GDVFVVP+F+PMA
Sbjct: 301 MMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMA 360

Query: 361 QMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARV 420
           QMSFNNG+FVFVGFST N +N+PQF AGSSSVL+I+DREVLAWSFDVNVTT+DRLL ARV
Sbjct: 361 QMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARV 420

Query: 421 ESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKREEEERKRREEEEQRE 480
           ES++LECTSCAEEEVRKMEEEAER+R+EEEE   R+ EEEE RKREEEE ++REEEE+R+
Sbjct: 421 ESIVLECTSCAEEEVRKMEEEAEREREEEEE---RKREEEERRKREEEEERKREEEERRK 480

Query: 481 REEEEEGRRERKRERERERERRERKRGRERERGEREREREREKREGRRRREKEKRRGRRE 540
           REEEEE +RE +  R+RE E  +++   E  + E E E++RE+ E R+R E+E+R+   E
Sbjct: 481 REEEEERKREEEERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREEE 540

Query: 541 GERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGERGREKKRRRRSK 600
            ER REEE     +E  R RE ++    +K+ ERRRE+++RR R E ER RE++ R R K
Sbjct: 541 EERKREEE----EEEEEREREEEE---AQKEEERRREEEERRRREEEERKREEEEREREK 600

Query: 601 RGGGSKGERGNLSKREKEKTERSREGGR-RKTEKSLGGGGGKRRRSRNRASDKDFE 653
             G  +  R    + E+E+ E  RE    RK E+      GKRRR       + +E
Sbjct: 601 ERGEEEQRRREEEEEEEEEREAEREEEEARKREEEHQRERGKRRREGEERQRRRWE 645

BLAST of Tan0021704 vs. NCBI nr
Match: KAG6603989.1 (Vicilin-like seed storage protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 695.7 bits (1794), Expect = 4.1e-196
Identity = 429/607 (70.68%), Postives = 492/607 (81.05%), Query Frame = 0

Query: 1   MKKSAAISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYGEI 60
           MKKS AISGSPFS S L  +FFL +SLPSNADD  WEGA P VKRA+ER SLLKTEYGEI
Sbjct: 1   MKKSTAISGSPFSLSFLFTVFFLFVSLPSNADDKWWEGACPVVKRANERRSLLKTEYGEI 60

Query: 61  SAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDL 120
           SA+D +D S+FGPYHLQFITMEPNSLFLPVLLHADMV Y+HTGSGRL+WFDD DLREVDL
Sbjct: 61  SAVDLHDASQFGPYHLQFITMEPNSLFLPVLLHADMVLYMHTGSGRLNWFDDDDLREVDL 120

Query: 121 RRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGFDK 180
           RRGDI+RL PG+IFY+ SSLETE EK R+YALFSSTD+D ++P+IGAYS VTD VRGFDK
Sbjct: 121 RRGDIFRLQPGAIFYIHSSLETEREKLRMYALFSSTDEDPFEPAIGAYSRVTDHVRGFDK 180

Query: 181 KVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTS-TWELEARFLKGFLGGG 240
           +VL +AF   EEVI+E+M+  +PPLI+HA A T  KKP+++ S + ELEARFLK F+GG 
Sbjct: 181 EVLCKAFMVPEEVIEEIMDAKRPPLIVHA-ATTLSKKPTSSLSMSLELEARFLKSFIGGR 240

Query: 241 AGGMGFN--KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGS 300
             GM FN  KKKKK +YNV+EADPDFENCNGWSLTVTKKVSHQLKGSN+G FVVNLTAGS
Sbjct: 241 GSGMDFNKKKKKKKGLYNVFEADPDFENCNGWSLTVTKKVSHQLKGSNIGFFVVNLTAGS 300

Query: 301 MMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMA 360
           MMGPHWNPRAWEIGIVTSEEAGVVRVGC SS TN S CK WS+VVG+GDVFVVP+F+PMA
Sbjct: 301 MMGPHWNPRAWEIGIVTSEEAGVVRVGC-SSMTNSSICKKWSFVVGKGDVFVVPRFHPMA 360

Query: 361 QMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARV 420
           QMSFNNGSFVFVGFST NR N+PQFLAG SSVLQ +DREVLAWSFDVNVTT+DRLLGARV
Sbjct: 361 QMSFNNGSFVFVGFSTTNRNNLPQFLAGRSSVLQTVDREVLAWSFDVNVTTIDRLLGARV 420

Query: 421 ESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKREEEERKRREEEEQRE 480
           ESVILECTSCAEEEVRKM EEAER+RQEEEE ++  EEEE +R+ EEEERKR+EEEE+R+
Sbjct: 421 ESVILECTSCAEEEVRKMVEEAERERQEEEERKREEEEEERKREEEEEERKRKEEEEERK 480

Query: 481 REEEEEGRRERKRERERERERRERKRGRERERGERERERE--REKREGRRRREKEKRRGR 540
           REEEE  R E +R RE E   RE +  R+RE  ERERE E  R++ E  R RE+E+ R R
Sbjct: 481 REEEEAKREEERRRREEEEREREEEEARKREEEEREREEEEARKREEEEREREEEEARKR 540

Query: 541 REGERGREE----ERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGERGREKK 599
            E ER REE    ERR G + R +    +  R  E++ E  RE ++   R  G R RE +
Sbjct: 541 EEEEREREEEAERERREGEEARRKEEAERGEREAEREAEEARESEEAHRRERGRRRREAE 600

BLAST of Tan0021704 vs. NCBI nr
Match: XP_023545029.1 (vicilin-like seed storage protein At2g18540 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 690.6 bits (1781), Expect = 1.3e-194
Identity = 450/683 (65.89%), Postives = 520/683 (76.13%), Query Frame = 0

Query: 1   MKKSAAISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYGEI 60
           MKKS  ISGSPFS S L  +FFL +SLPSNADD  WE A P VKRA+ER SLLKTEYGEI
Sbjct: 1   MKKSTVISGSPFSLSFLFTVFFLFVSLPSNADDKWWEAACPVVKRANERKSLLKTEYGEI 60

Query: 61  SAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDL 120
           SA+D  D S+FGPYHLQFITMEPNSLFLPVLLHADMV Y+HTGSGRL+WFDD DLREVDL
Sbjct: 61  SAVDLYDASQFGPYHLQFITMEPNSLFLPVLLHADMVLYMHTGSGRLNWFDDDDLREVDL 120

Query: 121 RRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGFDK 180
           RRGDI+RL PG+IFY+ SSLETE EK R+YALFSSTD+D ++P+IGAYS VTDLVRGFDK
Sbjct: 121 RRGDIFRLQPGAIFYIHSSLETEREKLRMYALFSSTDEDPFEPAIGAYSRVTDLVRGFDK 180

Query: 181 KVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKP-STTTSTWELEARFLKGFLGGG 240
           +VL +AF   EEVI+E+M+  +PPLI+HA A T  KKP S+   + ELEARFLK F+GGG
Sbjct: 181 EVLCKAFMVPEEVIEEIMDAKRPPLIVHA-ATTLSKKPRSSLLMSLELEARFLKSFIGGG 240

Query: 241 AGGMGFN---KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAG 300
             GM FN   KKKKK +YNV+EADPDFENCNGWSLTVTKKVSHQLKGSN+G FVVNLTAG
Sbjct: 241 GSGMDFNKKKKKKKKGLYNVFEADPDFENCNGWSLTVTKKVSHQLKGSNIGFFVVNLTAG 300

Query: 301 SMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPM 360
           SMMGPHWNPRAWEIGIVTSEEAGVVRVGC SS TN S CK WS+VVG+GDVFVVP+F+PM
Sbjct: 301 SMMGPHWNPRAWEIGIVTSEEAGVVRVGC-SSMTNSSICKKWSFVVGKGDVFVVPRFHPM 360

Query: 361 AQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGAR 420
           AQMSFNNGSFVFVGFST NR N+PQFLAG SSVLQ +DREVLAWSFDVNVTT+DRLLGAR
Sbjct: 361 AQMSFNNGSFVFVGFSTTNRNNLPQFLAGRSSVLQTVDREVLAWSFDVNVTTIDRLLGAR 420

Query: 421 VESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREE-----EEEERKREEEERKRRE 480
           VESVILECTSCAEEEVRKMEEEAER+RQEEEE +++ EE     EEEERKREEEE ++RE
Sbjct: 421 VESVILECTSCAEEEVRKMEEEAERERQEEEERKRKEEEERKRKEEEERKREEEEERKRE 480

Query: 481 EEEQREREEEEEGRRERKRERERERERRERKRGRERERGER---------EREREREKRE 540
           EEE+R+REEEE  R E ++  E E +R E +R RE E  ER         E ERERE+ E
Sbjct: 481 EEEERKREEEEAKREEERKREEEEAKREEERRRREEEEREREEEEARKREEEEREREEEE 540

Query: 541 GRRR------REKEKRRGRREGERGREEERRGGSKERGRG-----RERKKGRGGEKKRER 600
            R+R      RE+E+ R R E ER REE  R   +E  RG     RE ++ R  E+ R R
Sbjct: 541 ARKREDEEREREEEEARKREEEEREREERERREEEEAERGEREAEREAEEARESEETRRR 600

Query: 601 RREKKKR----RGRTEGERGREKKRRRRSKRGGGSKGERGNLSKREKEKTERSREGGRRK 651
            R +++R    R R E ER RE++    ++R G  + ER     RE E+T R   G RR+
Sbjct: 601 ERGRRRREEEARKREEEERERERREEEEAER-GEREAEREAEEARESEETRRRERGRRRR 660

BLAST of Tan0021704 vs. NCBI nr
Match: TYK12638.1 (vicilin-like seed storage protein [Cucumis melo var. makuwa])

HSP 1 Score: 687.2 bits (1772), Expect = 1.4e-193
Identity = 444/683 (65.01%), Postives = 531/683 (77.75%), Query Frame = 0

Query: 1   MKKSAAI--SGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYG 60
           MKK  AI  SGSPFS S LI IFFL  SLP+ ADDG WEG  P VKRA+ERI LLKTEYG
Sbjct: 1   MKKPTAISVSGSPFSLSFLISIFFLFFSLPAYADDGWWEGDSPVVKRANERIQLLKTEYG 60

Query: 61  EISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREV 120
           +ISA+DF+DGSRFGPYHLQFIT+EPNSLFLPVLLH+DMVFYIHTGSGRL+WFD+ DL+EV
Sbjct: 61  DISAVDFDDGSRFGPYHLQFITLEPNSLFLPVLLHSDMVFYIHTGSGRLNWFDENDLKEV 120

Query: 121 DLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGF 180
           DLRRGD+YRLHPGSIFYLQSSLE E EK RIYALFSSTD+DS++PS+GAYS VTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLEIEREKLRIYALFSSTDEDSFNPSLGAYSRVTDLVRGF 180

Query: 181 DKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGG 240
            K+VLR+AF A +EVI+E+M   +PPLI+HA A T   + + ++S WE EAR LK FLGG
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMTAKRPPLIVHAAAPTPSIR-AKSSSPWEFEARLLKAFLGG 240

Query: 241 GAGGMGFN--KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAG 300
            A G+ FN  KKKKK IYNVYE DPDFENCNGWSLTVTKK SHQLKGSN+G  VVNLTAG
Sbjct: 241 DASGIEFNKKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAG 300

Query: 301 SMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPM 360
           SMMGPHWNPRAWEIGIVTS+E GVV VGCSS+  N S CKNWS+VV +GD+FVVP+F+PM
Sbjct: 301 SMMGPHWNPRAWEIGIVTSDEPGVVHVGCSSTSANSSKCKNWSFVVEKGDIFVVPRFHPM 360

Query: 361 AQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGAR 420
           AQMSFNNG+FVFVGFST N +N+PQF  GSSSVLQ++DREVLAWSFDVNVTTVDRLL AR
Sbjct: 361 AQMSFNNGTFVFVGFSTTNGHNMPQFFVGSSSVLQLVDREVLAWSFDVNVTTVDRLLKAR 420

Query: 421 VESVILECTSCAEEEVRKMEEEAERKRQEE-----EEERKRREEEEEERKREEEERKRRE 480
           VES+ILECTSCAEEEVRKMEEEAER+R+EE     EEE +R+ EEEE+RKREEEER++RE
Sbjct: 421 VESIILECTSCAEEEVRKMEEEAEREREEEEERKREEEEQRKREEEEQRKREEEERRKRE 480

Query: 481 EEEQREREEEEEGRRE----RKRERERERER-RERKRGRERERGEREREREREKREGR-R 540
           EEEQR+REEEE+ +RE    RKRE E E ER  ERKR  E  + E ER R RE+ E R +
Sbjct: 481 EEEQRKREEEEQRKREEEERRKREEEEEAEREEERKREEEEAQREEERRRRREEEEKREK 540

Query: 541 RREKEKRRGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGE 600
            RE+E++R R E E+ +EEE     +ER R RE ++ +  E++ E+ RE+++++   E +
Sbjct: 541 EREEEEQRRREEEEQQQEEEEAQREEERRRRREEEERKREEEEGEKEREEEQQQEEEEAK 600

Query: 601 RGREKKRRRRSKRGGGSKGERGNLSKREKEKTERSREGGRRKTEKSLGGGGGKRRRSRNR 660
           R  E++R+R  +R    + +R     RE+E+ E  RE GRR+ E       G+RRR    
Sbjct: 601 REEEEERKREEER----EAKREEEEAREREE-EHQRERGRRRREAE----EGQRRRWEEE 660

Query: 661 ASDKDFETVDITRNFQPLCFLLS 669
             + + E  +     QP+  +LS
Sbjct: 661 EGEGEEEEEE-----QPVLRILS 668

BLAST of Tan0021704 vs. ExPASy TrEMBL
Match: A0A0A0KGF2 (PreproMP73 OS=Cucumis sativus OX=3659 GN=Csa_6G502040 PE=4 SV=1)

HSP 1 Score: 701.4 bits (1809), Expect = 3.6e-198
Identity = 438/658 (66.57%), Postives = 521/658 (79.18%), Query Frame = 0

Query: 1   MKKSAA--ISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYG 60
           MKK +A  +SGS FS S+LIPIFFL LSLP+ ADDG WEG  P VKRA+ERI +LKTEYG
Sbjct: 1   MKKHSAVSVSGSSFSPSILIPIFFLFLSLPAYADDGWWEGDTPVVKRANERIPILKTEYG 60

Query: 61  EISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREV 120
           EISA+DF+DG+RFG YHLQFIT+EPNSLFLPVLLH+DMVFY+HTGSGRL+WFDD DL+EV
Sbjct: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120

Query: 121 DLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGF 180
           DLRRGD+YRLHPGSIFYLQSSLETE EK RIYALFSSTD+DS++PSIGAYS VTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180

Query: 181 DKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGG 240
            K+VLR+AF A +EVI+E+MN  +PPLI+HA A T   K + ++S WE EAR LK FLGG
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIK-AKSSSPWEFEARLLKSFLGG 240

Query: 241 GAGGMGFN-KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGS 300
            A  + FN KKKKK IYNVYE DPDFENCNGWSLTVTKK SHQLKGSN+G  VVNLTAGS
Sbjct: 241 DASAIEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGS 300

Query: 301 MMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMA 360
           MMGPHWNPRAWEIGIVTS+E GV+RVGCSS+  N S CKNWS+VV +GDVFVVP+F+PMA
Sbjct: 301 MMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMA 360

Query: 361 QMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARV 420
           QMSFNNG+FVFVGFST N +N+PQF AGSSSVL+I+DREVLAWSFDVNVTT+DRLL ARV
Sbjct: 361 QMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARV 420

Query: 421 ESVILECTSCAEEEVRKMEEEAERKRQEE-----EEERKRREEEEEERKREEEERKRREE 480
           ES++LECTSCAEEEVRKMEEEAER+R+EE     EEE +R+ EEEEERKREEEER++REE
Sbjct: 421 ESIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEERKREEEERRKREE 480

Query: 481 EEQREREEEEEGRRERKRERERERERRERKRGRERERGEREREREREKREGRRRREKEKR 540
           EE+R+REEEE    ERKRE E E E R+R+   E++R E E E ER KRE    +++E+ 
Sbjct: 481 EEERKREEEER-EEERKREEEEEEEERKREEEEEKKREEEEEEEER-KREEEEEKKREEE 540

Query: 541 RGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGE----RGR 600
               E +R  EEE++   +E    R+RK+ RGG +KR+R +E+K+R    + E    RG 
Sbjct: 541 EEEEERKREEEEEKKREEEEEEEERKRKRKRGGGEKRKREKERKRRERERKREERRNRGG 600

Query: 601 EKKRRRRSKRGGGSKGERGNLSKREKEKTERSREGGRRKTEKSLGGGGGKRRRSRNRA 647
           EKK+R++ KR G  +  R  L K  K   ER   GG R  +    GG  ++++   R+
Sbjct: 601 EKKKRKKKKR-GRQRERRRKLGKERKNIKEREGRGGERGRKDKEDGGRKRKKKVEERS 654

BLAST of Tan0021704 vs. ExPASy TrEMBL
Match: A0A5D3CNA8 (Vicilin-like seed storage protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002250 PE=4 SV=1)

HSP 1 Score: 687.2 bits (1772), Expect = 7.0e-194
Identity = 444/683 (65.01%), Postives = 531/683 (77.75%), Query Frame = 0

Query: 1   MKKSAAI--SGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYG 60
           MKK  AI  SGSPFS S LI IFFL  SLP+ ADDG WEG  P VKRA+ERI LLKTEYG
Sbjct: 1   MKKPTAISVSGSPFSLSFLISIFFLFFSLPAYADDGWWEGDSPVVKRANERIQLLKTEYG 60

Query: 61  EISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREV 120
           +ISA+DF+DGSRFGPYHLQFIT+EPNSLFLPVLLH+DMVFYIHTGSGRL+WFD+ DL+EV
Sbjct: 61  DISAVDFDDGSRFGPYHLQFITLEPNSLFLPVLLHSDMVFYIHTGSGRLNWFDENDLKEV 120

Query: 121 DLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGF 180
           DLRRGD+YRLHPGSIFYLQSSLE E EK RIYALFSSTD+DS++PS+GAYS VTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLEIEREKLRIYALFSSTDEDSFNPSLGAYSRVTDLVRGF 180

Query: 181 DKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGG 240
            K+VLR+AF A +EVI+E+M   +PPLI+HA A T   + + ++S WE EAR LK FLGG
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMTAKRPPLIVHAAAPTPSIR-AKSSSPWEFEARLLKAFLGG 240

Query: 241 GAGGMGFN--KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAG 300
            A G+ FN  KKKKK IYNVYE DPDFENCNGWSLTVTKK SHQLKGSN+G  VVNLTAG
Sbjct: 241 DASGIEFNKKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAG 300

Query: 301 SMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPM 360
           SMMGPHWNPRAWEIGIVTS+E GVV VGCSS+  N S CKNWS+VV +GD+FVVP+F+PM
Sbjct: 301 SMMGPHWNPRAWEIGIVTSDEPGVVHVGCSSTSANSSKCKNWSFVVEKGDIFVVPRFHPM 360

Query: 361 AQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGAR 420
           AQMSFNNG+FVFVGFST N +N+PQF  GSSSVLQ++DREVLAWSFDVNVTTVDRLL AR
Sbjct: 361 AQMSFNNGTFVFVGFSTTNGHNMPQFFVGSSSVLQLVDREVLAWSFDVNVTTVDRLLKAR 420

Query: 421 VESVILECTSCAEEEVRKMEEEAERKRQEE-----EEERKRREEEEEERKREEEERKRRE 480
           VES+ILECTSCAEEEVRKMEEEAER+R+EE     EEE +R+ EEEE+RKREEEER++RE
Sbjct: 421 VESIILECTSCAEEEVRKMEEEAEREREEEEERKREEEEQRKREEEEQRKREEEERRKRE 480

Query: 481 EEEQREREEEEEGRRE----RKRERERERER-RERKRGRERERGEREREREREKREGR-R 540
           EEEQR+REEEE+ +RE    RKRE E E ER  ERKR  E  + E ER R RE+ E R +
Sbjct: 481 EEEQRKREEEEQRKREEEERRKREEEEEAEREEERKREEEEAQREEERRRRREEEEKREK 540

Query: 541 RREKEKRRGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGE 600
            RE+E++R R E E+ +EEE     +ER R RE ++ +  E++ E+ RE+++++   E +
Sbjct: 541 EREEEEQRRREEEEQQQEEEEAQREEERRRRREEEERKREEEEGEKEREEEQQQEEEEAK 600

Query: 601 RGREKKRRRRSKRGGGSKGERGNLSKREKEKTERSREGGRRKTEKSLGGGGGKRRRSRNR 660
           R  E++R+R  +R    + +R     RE+E+ E  RE GRR+ E       G+RRR    
Sbjct: 601 REEEEERKREEER----EAKREEEEAREREE-EHQRERGRRRREAE----EGQRRRWEEE 660

Query: 661 ASDKDFETVDITRNFQPLCFLLS 669
             + + E  +     QP+  +LS
Sbjct: 661 EGEGEEEEEE-----QPVLRILS 668

BLAST of Tan0021704 vs. ExPASy TrEMBL
Match: A0A6J1GFV7 (vicilin-like seed storage protein At2g18540 OS=Cucurbita moschata OX=3662 GN=LOC111453792 PE=4 SV=1)

HSP 1 Score: 686.0 bits (1769), Expect = 1.6e-193
Identity = 431/606 (71.12%), Postives = 487/606 (80.36%), Query Frame = 0

Query: 1   MKKSAAISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYGEI 60
           MKKS AISGSPFS S L  +FFL +SLPSNADD  WEGA P VKRA+ER SLLKTEYGEI
Sbjct: 1   MKKSTAISGSPFSLSFLFTVFFLFVSLPSNADDKWWEGACPVVKRANERRSLLKTEYGEI 60

Query: 61  SAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDL 120
           SAID +D S+FGPYHLQFITMEPNSLFLPVLLHADMV Y+HTGSGRL+WFDD DLREVDL
Sbjct: 61  SAIDLHDASQFGPYHLQFITMEPNSLFLPVLLHADMVLYMHTGSGRLNWFDDDDLREVDL 120

Query: 121 RRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGFDK 180
           RRGDI+RL PG+IFY+ SSLETE EK R+YALFSSTD+D ++P+IGAYS VTD VRGFDK
Sbjct: 121 RRGDIFRLQPGAIFYIHSSLETEREKLRMYALFSSTDEDPFEPAIGAYSRVTDHVRGFDK 180

Query: 181 KVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTS-TWELEARFLKGFLGGG 240
           +VL +AF   EEVI+E+M+  +PPLI+HA A T  KKP+++ S + ELEARFLK F+GGG
Sbjct: 181 EVLCKAFMVPEEVIEEIMDAKRPPLIVHA-ATTLSKKPTSSLSMSLELEARFLKSFIGGG 240

Query: 241 AGGMGFN--KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGS 300
             GM FN  KKKKK +YNV+EADPDFENCNGWSLTVTKKVSHQLKGSN+G FVVNLTAGS
Sbjct: 241 GSGMDFNKKKKKKKGLYNVFEADPDFENCNGWSLTVTKKVSHQLKGSNIGFFVVNLTAGS 300

Query: 301 MMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMA 360
           MMGPHWNPRAWEIGIVTSEEAGVVRVGC SS TN S CK WS+VVG+GDVFVVP+F+PMA
Sbjct: 301 MMGPHWNPRAWEIGIVTSEEAGVVRVGC-SSMTNSSICKKWSFVVGKGDVFVVPRFHPMA 360

Query: 361 QMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARV 420
           QMSFNNGSFVFVGFST NR N+PQFLAG SSVLQ +DREVLAWSFDVNVTT+DRLLGARV
Sbjct: 361 QMSFNNGSFVFVGFSTTNRNNLPQFLAGRSSVLQTVDREVLAWSFDVNVTTIDRLLGARV 420

Query: 421 ESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKR---EEEERKRREE-- 480
           ESVILECTSCAEEEVRKM EEAER+RQ EEEERKR EEEEEERKR   EEEERKR+EE  
Sbjct: 421 ESVILECTSCAEEEVRKMVEEAERERQ-EEEERKREEEEEEERKRKEEEEEERKRKEEEA 480

Query: 481 --EEQREREEEEEGRRERKRERERERERRERKRGRERERGEREREREREKREGRRRREKE 540
             EE+R R EEEE  RE +  R+RE E RER+   ER R E E  R+RE+ E R R E+E
Sbjct: 481 KREEERRRREEEEREREEEEARKREEEEREREEEAERGRREEEEARKREEEEEREREEEE 540

Query: 541 KRRGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGERGREK 597
            RR   E  RG  E  R   + R      ++ RG   +R R  E+++RR R E E    +
Sbjct: 541 ARREEEEAVRGEREAEREAEEARESEEAHRRERG---RRRREAEERQRRRREEEEEPTLR 600

BLAST of Tan0021704 vs. ExPASy TrEMBL
Match: A0A1S3B2F9 (vicilin-like seed storage protein At2g18540 OS=Cucumis melo OX=3656 GN=LOC103485029 PE=4 SV=1)

HSP 1 Score: 685.6 bits (1768), Expect = 2.0e-193
Identity = 433/649 (66.72%), Postives = 513/649 (79.04%), Query Frame = 0

Query: 1   MKKSAAI--SGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYG 60
           MKK  AI  SGSPFS S LI IFFL  SLP+ ADDG WEG  P VKRA+ERI LLKTEYG
Sbjct: 1   MKKPTAISVSGSPFSLSFLISIFFLFFSLPAYADDGWWEGDSPVVKRANERIQLLKTEYG 60

Query: 61  EISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREV 120
           +ISA+DF+DGSRFGPYHLQFIT+EPNSLFLPVLLH+DMVFYIHTGSGRL+WFD+ DL+EV
Sbjct: 61  DISAVDFDDGSRFGPYHLQFITLEPNSLFLPVLLHSDMVFYIHTGSGRLNWFDENDLKEV 120

Query: 121 DLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGF 180
           DLRRGD+YRLHPGSIFYLQSSLE E EK RIYALFSSTD+DS++PS+GAYS VTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLEIEREKLRIYALFSSTDEDSFNPSLGAYSRVTDLVRGF 180

Query: 181 DKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFLGG 240
            K+VLR+AF A +EVI+E+M   +PPLI+HA A T   + + ++S WE EAR LK FLGG
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMTAKRPPLIVHAAAPTPSIR-AKSSSPWEFEARLLKAFLGG 240

Query: 241 GAGGMGFN--KKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAG 300
            A G+ FN  KKKKK IYNVYE DPDFENCNGWSLTVTKK SHQLKGSN+G  VVNLTAG
Sbjct: 241 DASGIEFNKKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAG 300

Query: 301 SMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPM 360
           SMMGPHWNPRAWEIGIVTS+E GVV VGCSS+  N S CKNWS+VV +GD+FVVP+F+PM
Sbjct: 301 SMMGPHWNPRAWEIGIVTSDEPGVVHVGCSSTSANSSKCKNWSFVVEKGDIFVVPRFHPM 360

Query: 361 AQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGAR 420
           AQMSFNNG+FVFVGFST N +N+PQF  GSSSVLQ++DREVLAWSFDVNVTTVDRLL AR
Sbjct: 361 AQMSFNNGTFVFVGFSTTNGHNMPQFFVGSSSVLQLVDREVLAWSFDVNVTTVDRLLKAR 420

Query: 421 VESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKREEEERK-------- 480
           VES+ILECTSCAEEEVRKMEEEAER+R+EEEE   R+ EEEE RKREEEE++        
Sbjct: 421 VESIILECTSCAEEEVRKMEEEAEREREEEEE---RKREEEERRKREEEEQRKRXXXXXX 480

Query: 481 -RREEEEQREREEEEEGRRERKRERERERERRERKRGRERERGEREREREREKREGRRRR 540
            +REEEE+R+REEEEE  RE +R+RE E  +RE +R R RE  E +RE+ERE+ E RRR 
Sbjct: 481 XKREEEERRKREEEEEAEREEERKREEEEAQREEERRRRREE-EEKREKEREEEEQRRRE 540

Query: 541 EKEKRRGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRREKKKRRGRTEGERG 600
           E+E++  + E E  REEERR   +E  R RE ++   GEK+RE  +++++   + E E  
Sbjct: 541 EEEQQ--QEEEEAQREEERRRRREEEERKREEEE---GEKEREEEQQQEEEEAKREEEEE 600

Query: 601 REKKRRRRSKRGGGSKGERGNLSKREKEKTER-SREGGRRKTEKSLGGG 636
           R+++  R +KR      ER    +RE+ +  R + EG RR+ E+  G G
Sbjct: 601 RKREEEREAKREEEEAREREEEHQRERGRRRREAEEGQRRRWEEEEGEG 639

BLAST of Tan0021704 vs. ExPASy TrEMBL
Match: Q8W3X8 (PreproMP73 OS=Cucurbita maxima OX=3661 GN=CmMP73 PE=2 SV=1)

HSP 1 Score: 671.0 bits (1730), Expect = 5.2e-189
Identity = 444/689 (64.44%), Postives = 512/689 (74.31%), Query Frame = 0

Query: 1   MKKSAAISGSPFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLKTEYGEI 60
           MKK  AISGSPFS S L  +FFL LSLPSNADD  WE A P VKRA+ER SLLKTEYGEI
Sbjct: 1   MKKCTAISGSPFSLSFLFTVFFLFLSLPSNADDKWWEAACP-VKRANERKSLLKTEYGEI 60

Query: 61  SAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDL 120
           SA+D +D S+FGPYHLQFITMEPNSLFLPVLLHADMV Y+HTGSGRL+WFDD DLREVDL
Sbjct: 61  SAVDLHDASQFGPYHLQFITMEPNSLFLPVLLHADMVLYMHTGSGRLNWFDDDDLREVDL 120

Query: 121 RRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVRGFDK 180
           RRGDI+RL PG+IFY+ SSLETE EK R+YALFSSTD+D ++P+IGAYS VTD VRGFDK
Sbjct: 121 RRGDIFRLQPGAIFYIHSSLETEREKLRMYALFSSTDEDPFEPAIGAYSRVTDHVRGFDK 180

Query: 181 KVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKI---KKPSTTTSTWELEARFLKGFLG 240
           ++L +AF   EEVI+E+M+  +PPLI+HA         K+ S+ + + ELEARFLK F+G
Sbjct: 181 EILCKAFMVPEEVIEEIMDAKRPPLIVHAATTLSTLSKKQRSSLSMSLELEARFLKSFIG 240

Query: 241 GGAGGMGF--NKKKKKSIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTA 300
           GG  GM F   KKKKK +YNV+EADPDFENCNGWSLTVTKKVSHQLKGSN+G FVVNLTA
Sbjct: 241 GGGIGMDFKKKKKKKKGLYNVFEADPDFENCNGWSLTVTKKVSHQLKGSNIGFFVVNLTA 300

Query: 301 GSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNP 360
           GSMMGPHWNPRAWEIGIVTSEEAGVVRVGC SS TN S CK WS+VVG+GDVFVVP+F+P
Sbjct: 301 GSMMGPHWNPRAWEIGIVTSEEAGVVRVGC-SSMTNSSKCKKWSFVVGKGDVFVVPRFHP 360

Query: 361 MAQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGA 420
           MAQMSFNNGSF FVGFST NR N+PQFLAG SSVLQ ++R+VLAWSFDVNVTT+DRLL A
Sbjct: 361 MAQMSFNNGSFAFVGFSTTNRNNLPQFLAGRSSVLQTVERQVLAWSFDVNVTTIDRLLEA 420

Query: 421 RVESVILECTSCAEEEVRKMEEEAERKRQEEEEERK----RREEEEEERKREEEERKRRE 480
           RVESVILECTSCAEEEV KMEEEAER+RQEEEE R+    R  EEEE RKREEEER+R E
Sbjct: 421 RVESVILECTSCAEEEVMKMEEEAERERQEEEERRREEEEREREEEEARKREEEEREREE 480

Query: 481 ------EEEQREREEEEEGRRE-----------RKRERERERERRERKRGRERERGERER 540
                 EEE+REREEEE  +RE           RKRE ERERE  E +R  E ER   E 
Sbjct: 481 EEARKREEEEREREEEEARKREEEEREREEEEARKREEEREREEEEERRREEEEREREEE 540

Query: 541 E-REREKREGRRRREKEKRRGRREGERGREEERRGGSKERGRGRERKKGRGGEKKRERRR 600
           E R+RE+ E R+R E+E+ R   E  +  EEE R   +E  R RE ++ R  EK+  R+R
Sbjct: 541 EARKREEEEARKREEEEREREEEEARKREEEEARKREEEEARKREEEEARKREKEEARKR 600

Query: 601 EKKKRRGRTEGERGR----EKKRRRRSKRGGGSKGERGNLSKREKEKTERSREGGRRKTE 659
           E+++R    E ER R    E+ RRR     G  + ER     RE E+  R RE GRR+ E
Sbjct: 601 EEEEREREEEAERERREEEEEARRREEAERGEREAEREAEEARESEEAHR-RERGRRRRE 660

BLAST of Tan0021704 vs. TAIR 10
Match: AT4G36700.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 338.6 bits (867), Expect = 1.2e-92
Identity = 244/533 (45.78%), Postives = 342/533 (64.17%), Query Frame = 0

Query: 11  PFSSSVLIPIFFLLLSLPSNADDGRWEGARPEVKRASERISLLK--------TEYGEISA 70
           P S  +L+ +F    SL  + +   ++ A P     S  + + K        T++G+IS 
Sbjct: 8   PLSVLLLVLLFLCTESLAKSEESEEYDVAVPSCCGFSSPLLIKKDQWKPIFETKFGQIST 67

Query: 71  IDFNDG-SRFGPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGRLSWFDDVDLREVDLR 130
           +   +G    GPY +  IT+EPN++ LP+LLH+DMVF++ +GSG L+W D+ + +  ++R
Sbjct: 68  VQIGNGCGGMGPYKIHSITLEPNTILLPLLLHSDMVFFVDSGSGILNWVDE-EAKSTEIR 127

Query: 131 RGDIYRLHPGSIFYLQSS-----LETECEKFRIYALFSSTDDDSYDPSIGAYSSVTDLVR 190
            GD+YRL PGS+FYLQS      L T   K ++YA+FS+ D+  +DP  GAYSS+TDL+ 
Sbjct: 128 LGDVYRLRPGSVFYLQSKPVDIFLGT---KLKLYAIFSNNDECLHDPCFGAYSSITDLMF 187

Query: 191 GFDKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPSTTTSTWELEARFLKGFL 250
           GFD+ +L+ AF   E +I+ + N TKPPLI+     T         +TW+L+ R LK F 
Sbjct: 188 GFDETILQSAFGVPEGIIELMRNRTKPPLIVSETLCT-----PGVANTWQLQPRLLKLF- 247

Query: 251 GGGAGGMGFNKKKKK-----------SIYNVYEADPDFENCNGWSLTVTKKVSHQLKGSN 310
             G+  +  NKKKK+             +NV+E++PDFE+  G ++T+ +K    LKGS 
Sbjct: 248 -AGSADLVDNKKKKEKKEKKEKVKKAKTFNVFESEPDFESPYGRTITINRKDLKVLKGSM 307

Query: 311 VGLFVVNLTAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTN-GSACKNWSYVVGE 370
           VG+ +VNLT GSMMGPHWNP A EI IV  + AG+VRV  SS  +N  S CKN  + V E
Sbjct: 308 VGVSMVNLTQGSMMGPHWNPWACEISIVL-KGAGMVRVLRSSISSNTSSECKNVRFKVEE 367

Query: 371 GDVFVVPKFNPMAQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDV 430
           GD+F VP+ +PMAQMSFNN S VFVGF+T+ + N PQFLAG  S L+++DR+VLA S +V
Sbjct: 368 GDIFAVPRLHPMAQMSFNNDSLVFVGFTTSAKNNEPQFLAGEDSALRMLDRQVLAASLNV 427

Query: 431 NVTTVDRLLGARVESVILECTSCAEEEVRKMEEEAERKRQEEEEERKRREEEEEERKREE 490
           +  T+D LLGA+ E+VILEC SCAE E+ K++ E ERK+   ++ERKRR    +ERK+EE
Sbjct: 428 SSVTIDGLLGAQKEAVILECHSCAEGEIEKLKVEIERKK--IDDERKRR---HDERKKEE 487

Query: 491 EERKRREEEEQREREEEEEGRRERKRERERERERRERKRGRERERGERERERE 518
           EE K REEEE+R+REEEEE +R   ++  +E E RER+   E+E  E E E E
Sbjct: 488 EEAK-REEEERRKREEEEEKKRWPPQQPPQEEELRERQLPMEKE-WEMEGEEE 521

BLAST of Tan0021704 vs. TAIR 10
Match: AT2G18540.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 323.6 bits (828), Expect = 3.9e-88
Identity = 286/642 (44.55%), Postives = 401/642 (62.46%), Query Frame = 0

Query: 41  PEVKRASERISLLKTEYGEISAIDFNDGSRFGPYHLQFITMEPNSLFLPVLLHADMVFYI 100
           P + +  +R S++ TE+G ISA+   DG     YH+QFIT+EPN+L LP+LLH+DMVF++
Sbjct: 42  PLLVKKDQRTSVVATEFGNISAVQIGDG-----YHIQFITLEPNALLLPLLLHSDMVFFV 101

Query: 101 HTGSGRLSWFDDVDLREVDLRRGDIYRLHPGSIFYLQSSLETECEKFRIYALFSSTDDDS 160
           HTG+G L+W D+   R+++LRRGD++RL  G++FY+ S+     EK R+YA+F +     
Sbjct: 102 HTGTGILNWIDEESERKLELRRGDVFRLRSGTVFYVHSN-----EKLRVYAIF-NVGKCL 161

Query: 161 YDPSIGAYSSVTDLVRGFDKKVLREAFKAAEEVIDELMNGTKPPLIMHAEAATKIKKPST 220
            DP +GAYSSV DL+ GFD + LR AF   E+++ ++ + TKPPLI++  A  + +    
Sbjct: 162 NDPCLGAYSSVRDLLLGFDDRTLRSAFAVPEDILRKIRDATKPPLIVN--ALPRNRTQGL 221

Query: 221 TTSTWELEARFLKGFLGG-------GAGGMGFNKKKKKSIYNVYEADPDFENCNGWSLTV 280
               W  ++R ++ F+             +    KKK   +NV+E DPDFEN NG S+ V
Sbjct: 222 EEDKW--QSRLVRLFVSVEDVTDHLAMKPIVDTNKKKSRTFNVFEEDPDFENNNGRSIVV 281

Query: 281 TKKVSHQLKGSNVGLFVVNLTAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSSRTNGS 340
            +K    LKGS  G+F+VNLT GSM+GPHWNP A EI IV   E G+VRV    ++ + S
Sbjct: 282 DEKDLDALKGSRFGVFMVNLTKGSMIGPHWNPSACEISIVLEGE-GMVRV---VNQQSLS 341

Query: 341 ACKN----WSYVVGEGDVFVVPKFNPMAQMSFNNGSFVFVGFSTANRYNVPQFLAGSSSV 400
           +CKN     S++V EGDVFVVPKF+PMAQMSF N SFVF+GFST+ + N PQFL G SSV
Sbjct: 342 SCKNDRKSESFMVEEGDVFVVPKFHPMAQMSFENSSFVFMGFSTSAKTNHPQFLVGQSSV 401

Query: 401 LQIMDREVLAWSFDVNVTTVDRLLGARVESVILECTSCAEEEVRK-MEEEAERKRQEEEE 460
           L+++DR+V+A SF+++  T+  LL A+ ESVI EC SCAE E+ K M E  ERKR+EEEE
Sbjct: 402 LKVLDRDVVAVSFNLSNETIKGLLKAQKESVIFECASCAEGELSKLMREIEERKRREEEE 461

Query: 461 ERKRREEEEEERK------REEEERKRREEEE-QREREEEEEGRR---ERKRERERERER 520
             +RR+EEEE RK      REEEE KRREEEE +R++ EEEE R+   ERKRE E  + R
Sbjct: 462 IERRRKEEEEARKREEAKRREEEEAKRREEEETERKKREEEEARKREEERKREEEEAKRR 521

Query: 521 RERKRGRERERGE---REREREREKREGRRRREKEKRRGRREGERGREEERRGGSKERGR 580
            E ++ RE E  +   RE ERE+E+   ++R E+ +R+ R E ER R EE     +ER R
Sbjct: 522 EEERKKREEEAEQARKREEEREKEEEMAKKREEERQRKEREEVERKRREE-----QERKR 581

Query: 581 GRERKKGRGGEKKRE-----RRREKKKRRGRTEGERGREKKRRRRSKRGGGSKGERGNLS 640
             E  + R  E+KRE     RR ++++R+ R E ER   +++ R+ +     + E+    
Sbjct: 582 REEEARKREEERKREEEMAKRREQERQRKEREEVERKIREEQERKREEEMAKRREQERQK 641

Query: 641 KREKEKTERSREGGRRKTEKSLGGGGGKRRRSRNRASDKDFE 653
           K  +E   + RE   RK E+ +      R   R R   +D E
Sbjct: 642 KEREEMERKKREEEARKREEEM---AKIREEERQRKEREDVE 656

BLAST of Tan0021704 vs. TAIR 10
Match: AT2G28490.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 156.8 bits (395), Expect = 6.3e-38
Identity = 128/432 (29.63%), Postives = 202/432 (46.76%), Query Frame = 0

Query: 48  ERISLLKTEYGEISAIDFNDGSRF-GPYHLQFITMEPNSLFLPVLLHADMVFYIHTGSGR 107
           E   ++K+E GE+  +    G     P H+ F+TMEP +LF+P  L + ++ +I  G   
Sbjct: 90  ESRQVIKSEGGEMRVVLSPRGRIIEKPMHIGFLTMEPKTLFVPQYLDSSLLIFIRQGEAT 149

Query: 108 LSWFDDVDLREVDLRRGDIYRLHPGSIFYLQSS-----LETECEKFRIYALFSSTDDDSY 167
           L      +  E  L+ GDIY +  GS+FYL ++     L   C      +L   T    Y
Sbjct: 150 LGVICKDEFGERKLKAGDIYWIPAGSVFYLHNTGLGQRLHVICSIDPTQSLGFETFQPFY 209

Query: 168 DPSIGAYSSVTDLVRGFDKKVLREAFKAA-EEVIDELMNGTKPPLIMHAEAATKIKKPST 227
              IG   S   ++ GFD   L  AF  +  E+   +M+  + P++      T+  +P  
Sbjct: 210 ---IGGGPS--SVLAGFDPHTLTSAFNVSLPELQQMMMSQFRGPIVY----VTEGPQPQP 269

Query: 228 TTSTW--------ELEARFLKGFLGGGAG---------------------------GMGF 287
            ++ W        E + + LK  L    G                             G 
Sbjct: 270 QSTVWTQFLGLRGEEKHKQLKKLLETKQGSPQDQQYSSGWSWRNIVRSILDLTEEKNKGS 329

Query: 288 NKKKKKSIYNVYEA--DPDFENCNGWSLTVTKKVSHQLKGSNVGLFVVNLTAGSMMGPHW 347
              + +  YN+Y+    P F+N  GWS+ +       LK S +G+++VNLTAG+MM PH 
Sbjct: 330 GSSECEDSYNIYDKKDKPSFDNKYGWSIALDYDDYKPLKHSGIGVYLVNLTAGAMMAPHM 389

Query: 348 NPRAWEIGIVTSEEAGVVRVGCSSSRTNGSACKNWSYVVGEGDVFVVPKFNPMAQMSFNN 407
           NP A E GIV +  +G ++V       NG++  N    V  GDVF +P++    Q++   
Sbjct: 390 NPTATEYGIVLA-GSGEIQV----VFPNGTSAMNTR--VSVGDVFWIPRYFAFCQIASRT 449

Query: 408 GSFVFVGFSTANRYNVPQFLAGSSSVLQIMDREVLAWSFDVNVTTVDRLLGARVESVILE 436
           G F FVGF+T+   N PQFL GS+S+L+ ++   L+ +F V+  T+ R + A+ E+VIL 
Sbjct: 450 GPFEFVGFTTSAHKNRPQFLVGSNSLLRTLNLTSLSIAFGVDEETMRRFIEAQREAVILP 505

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4JQG61.7e-9145.78Vicilin-like seed storage protein At4g36700 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
F4IQK55.5e-8744.55Vicilin-like seed storage protein At2g18540 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Q9SK098.9e-3729.63Vicilin-like seed storage protein At2g28490 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Match NameE-valueIdentityDescription
XP_038882657.14.2e-20172.14vicilin-like seed storage protein At2g18540 [Benincasa hispida][more]
XP_011658490.21.1e-19866.62vicilin-like seed storage protein At2g18540 [Cucumis sativus] >KAE8647587.1 hypo... [more]
KAG6603989.14.1e-19670.68Vicilin-like seed storage protein, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023545029.11.3e-19465.89vicilin-like seed storage protein At2g18540 [Cucurbita pepo subsp. pepo][more]
TYK12638.11.4e-19365.01vicilin-like seed storage protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A0A0KGF23.6e-19866.57PreproMP73 OS=Cucumis sativus OX=3659 GN=Csa_6G502040 PE=4 SV=1[more]
A0A5D3CNA87.0e-19465.01Vicilin-like seed storage protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A6J1GFV71.6e-19371.12vicilin-like seed storage protein At2g18540 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A1S3B2F92.0e-19366.72vicilin-like seed storage protein At2g18540 OS=Cucumis melo OX=3656 GN=LOC103485... [more]
Q8W3X85.2e-18964.44PreproMP73 OS=Cucurbita maxima OX=3661 GN=CmMP73 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36700.11.2e-9245.78RmlC-like cupins superfamily protein [more]
AT2G18540.13.9e-8844.55RmlC-like cupins superfamily protein [more]
AT2G28490.16.3e-3829.63RmlC-like cupins superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 425..498
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 553..579
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 485..524
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 485..651
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 532..552
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 601..651
NoneNo IPR availablePANTHERPTHR31189OS03G0336100 PROTEIN-RELATEDcoord: 19..473
NoneNo IPR availablePANTHERPTHR31189:SF7PREPROMP73coord: 19..473
NoneNo IPR availableCDDcd02245cupin_7S_vicilin-like_Ccoord: 261..419
e-value: 1.69263E-63
score: 206.214
NoneNo IPR availableCDDcd02244cupin_7S_vicilin-like_Ncoord: 52..209
e-value: 2.6851E-62
score: 203.509
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 254..410
e-value: 5.7E-47
score: 172.0
coord: 44..195
e-value: 0.0086
score: 0.8
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 254..409
e-value: 4.7E-28
score: 97.7
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 245..427
e-value: 5.8E-42
score: 145.2
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 39..220
e-value: 4.4E-39
score: 135.8
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 21..416

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021704.1Tan0021704.1mRNA