Tan0006949 (gene) Snake gourd v1

Overview
NameTan0006949
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartic proteinase CDR1-like
LocationLG01: 10231651 .. 10232946 (+)
RNA-Seq ExpressionTan0006949
SyntenyTan0006949
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCATTTCAATCTTCTTCTATCTCCTCCTTTTGATCTCTTTCTCCAAAGCAACCACCTATGGCGGCGGCCGCGGCGGAAATGGCTTCACCACCACTCTCTTCCACCGCGATTCTCCACTCTCCCCTCTCCATAACTCATCTCTCACTCACTACGACCGCCTCAATAATGCCTTCCGCCGCTCCTTCTCCCGCGCCGCCGCACTATTCCACCATGCCGCTGCCACCCCCGCCGCCGCCATTATCCAATCTCCGATCTCCCTCGGCAGCGGCAAGTATCTAATGTCCGTCTCCATTGGAACACCGCCGGTGGCTTACGTGGCCATCGCCGACACCGGCAGTGATCTGACGTGGACTCAATGCTTGCCATGTCAGAAATGCTACGACCAATCACAACCCATTTTCAACCCCTTCAAATCCTCCTCCTTCCGTCACGTGCCTTGCACGTCCCAATTATGTCACGCCATCGACGACGCCCTCTGTGGGGTCCAGGGGGTTTGCGATTACCGTTACACGTACGGAGATCGAAGCTACACGAAGGGAGATTTGGGATTCGATAAGATCACCTTCGGGTCATCGTCCGTGAACACGGTCATCGGATGTGGCCACGAGAGTGGCGGCGGGTTCGGCCATGCCTCGGGTCTCATCGGTCTCGGCGGCGGCGAACTCTCTTTAGTCTCTCAAATGAGCCGAAACGCCGCCGTCAGCCGGCGATTCTCCTATTGCTTACCGACCGTATTCAGTCAAACAAATGGCAAAATTAACTTCGGCCAAAACGCCGTCGTTTCCGGCCCTGGTGTCGTTTCAACGCCACTTGTCCCTAAAATTCCCAAGACGTATTATTACATGACTCTCGAAGCCGTTTCCGTTGGCAACGAACGTCACGCGGCCGACATGTCGTCCGCACGGGGAAACATGATTATAGACTCCGGGACGACATTGACGATCCTTCCGAAGGAGTTGTATGACGGCGTCGTTTCGTCGTTGGTGAAGGTCGTTCGAGGGAGGCGGGTGAACGATCCCCGCGGGCTTTTTGGACTCTGCTATGCTGCAGAAGGCAACGGCGTGGATATTCCGGTCATCACCGCCCATTTCGCCGGCGGCGCCGACGTGAAGTTGATGCCGGTGAATACGTTTAAGAAAGTGGCCGATGATGTGAGTTGCTTGGCGTTGGCGCCGACGTCGCCGAAACATGGTTTTGGGATTCTGGGGAATTTGGCGCAGTCGAATTTCTTGATCGGATATGATTTGGAGATGAGAACATTGTCGTTCAAACCAGCTGTCTGTGCTTAG

mRNA sequence

ATGGCTGCCATTTCAATCTTCTTCTATCTCCTCCTTTTGATCTCTTTCTCCAAAGCAACCACCTATGGCGGCGGCCGCGGCGGAAATGGCTTCACCACCACTCTCTTCCACCGCGATTCTCCACTCTCCCCTCTCCATAACTCATCTCTCACTCACTACGACCGCCTCAATAATGCCTTCCGCCGCTCCTTCTCCCGCGCCGCCGCACTATTCCACCATGCCGCTGCCACCCCCGCCGCCGCCATTATCCAATCTCCGATCTCCCTCGGCAGCGGCAAGTATCTAATGTCCGTCTCCATTGGAACACCGCCGGTGGCTTACGTGGCCATCGCCGACACCGGCAGTGATCTGACGTGGACTCAATGCTTGCCATGTCAGAAATGCTACGACCAATCACAACCCATTTTCAACCCCTTCAAATCCTCCTCCTTCCGTCACGTGCCTTGCACGTCCCAATTATGTCACGCCATCGACGACGCCCTCTGTGGGGTCCAGGGGGTTTGCGATTACCGTTACACGTACGGAGATCGAAGCTACACGAAGGGAGATTTGGGATTCGATAAGATCACCTTCGGGTCATCGTCCGTGAACACGGTCATCGGATGTGGCCACGAGAGTGGCGGCGGGTTCGGCCATGCCTCGGGTCTCATCGGTCTCGGCGGCGGCGAACTCTCTTTAGTCTCTCAAATGAGCCGAAACGCCGCCGTCAGCCGGCGATTCTCCTATTGCTTACCGACCGTATTCAGTCAAACAAATGGCAAAATTAACTTCGGCCAAAACGCCGTCGTTTCCGGCCCTGGTGTCGTTTCAACGCCACTTGTCCCTAAAATTCCCAAGACGTATTATTACATGACTCTCGAAGCCGTTTCCGTTGGCAACGAACGTCACGCGGCCGACATGTCGTCCGCACGGGGAAACATGATTATAGACTCCGGGACGACATTGACGATCCTTCCGAAGGAGTTGTATGACGGCGTCGTTTCGTCGTTGGTGAAGGTCGTTCGAGGGAGGCGGGTGAACGATCCCCGCGGGCTTTTTGGACTCTGCTATGCTGCAGAAGGCAACGGCGTGGATATTCCGGTCATCACCGCCCATTTCGCCGGCGGCGCCGACGTGAAGTTGATGCCGGTGAATACGTTTAAGAAAGTGGCCGATGATGTGAGTTGCTTGGCGTTGGCGCCGACGTCGCCGAAACATGGTTTTGGGATTCTGGGGAATTTGGCGCAGTCGAATTTCTTGATCGGATATGATTTGGAGATGAGAACATTGTCGTTCAAACCAGCTGTCTGTGCTTAG

Coding sequence (CDS)

ATGGCTGCCATTTCAATCTTCTTCTATCTCCTCCTTTTGATCTCTTTCTCCAAAGCAACCACCTATGGCGGCGGCCGCGGCGGAAATGGCTTCACCACCACTCTCTTCCACCGCGATTCTCCACTCTCCCCTCTCCATAACTCATCTCTCACTCACTACGACCGCCTCAATAATGCCTTCCGCCGCTCCTTCTCCCGCGCCGCCGCACTATTCCACCATGCCGCTGCCACCCCCGCCGCCGCCATTATCCAATCTCCGATCTCCCTCGGCAGCGGCAAGTATCTAATGTCCGTCTCCATTGGAACACCGCCGGTGGCTTACGTGGCCATCGCCGACACCGGCAGTGATCTGACGTGGACTCAATGCTTGCCATGTCAGAAATGCTACGACCAATCACAACCCATTTTCAACCCCTTCAAATCCTCCTCCTTCCGTCACGTGCCTTGCACGTCCCAATTATGTCACGCCATCGACGACGCCCTCTGTGGGGTCCAGGGGGTTTGCGATTACCGTTACACGTACGGAGATCGAAGCTACACGAAGGGAGATTTGGGATTCGATAAGATCACCTTCGGGTCATCGTCCGTGAACACGGTCATCGGATGTGGCCACGAGAGTGGCGGCGGGTTCGGCCATGCCTCGGGTCTCATCGGTCTCGGCGGCGGCGAACTCTCTTTAGTCTCTCAAATGAGCCGAAACGCCGCCGTCAGCCGGCGATTCTCCTATTGCTTACCGACCGTATTCAGTCAAACAAATGGCAAAATTAACTTCGGCCAAAACGCCGTCGTTTCCGGCCCTGGTGTCGTTTCAACGCCACTTGTCCCTAAAATTCCCAAGACGTATTATTACATGACTCTCGAAGCCGTTTCCGTTGGCAACGAACGTCACGCGGCCGACATGTCGTCCGCACGGGGAAACATGATTATAGACTCCGGGACGACATTGACGATCCTTCCGAAGGAGTTGTATGACGGCGTCGTTTCGTCGTTGGTGAAGGTCGTTCGAGGGAGGCGGGTGAACGATCCCCGCGGGCTTTTTGGACTCTGCTATGCTGCAGAAGGCAACGGCGTGGATATTCCGGTCATCACCGCCCATTTCGCCGGCGGCGCCGACGTGAAGTTGATGCCGGTGAATACGTTTAAGAAAGTGGCCGATGATGTGAGTTGCTTGGCGTTGGCGCCGACGTCGCCGAAACATGGTTTTGGGATTCTGGGGAATTTGGCGCAGTCGAATTTCTTGATCGGATATGATTTGGAGATGAGAACATTGTCGTTCAAACCAGCTGTCTGTGCTTAG

Protein sequence

MAAISIFFYLLLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYTKGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRFSYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADMSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGNGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVCA
Homology
BLAST of Tan0006949 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 7.8e-95
Identity = 200/450 (44.44%), Postives = 275/450 (61.11%), Query Frame = 0

Query: 10  LLLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAA 69
           LL    F   T    G   N F+  L HRDSPLSP++N  +T  DRLN AF RS SR+  
Sbjct: 6   LLCFFLFFSVTLSSSGHPKN-FSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 65

Query: 70  LFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCY 129
             H  + T     +QS +    G++ MS++IGTPP+   AIADTGSDLTW QC PCQ+CY
Sbjct: 66  FNHQLSQTD----LQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCY 125

Query: 130 DQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGV---QGVCDYRYTYGDRSYTKGDLGF 189
            ++ PIF+  KSS+++  PC S+ C A+     G      +C YRY+YGD+S++KGD+  
Sbjct: 126 KENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVAT 185

Query: 190 DKITFGSSS------VNTVIGCGHESGGGFGH-ASGLIGLGGGELSLVSQMSRNAAVSRR 249
           + ++  S+S        TV GCG+ +GG F    SG+IGLGGG LSL+SQ+   +++S++
Sbjct: 186 ETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQL--GSSISKK 245

Query: 250 FSYCLPTVFSQTNGK--INFGQNAVVSG----PGVVSTPLVPKIPKTYYYMTLEAVSVG- 309
           FSYCL    + TNG   IN G N++ S      GVVSTPLV K P TYYY+TLEA+SVG 
Sbjct: 246 FSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK 305

Query: 310 ----------NERHAADMSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRG-RRVND 369
                     N      +S   GN+IIDSGTTLT+L    +D   S++ + V G +RV+D
Sbjct: 306 KKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSD 365

Query: 370 PRGLFGLCYAAEGNGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGF 429
           P+GL   C+ +    + +P IT HF  GADV+L P+N F K+++D+ CL++ PT+     
Sbjct: 366 PQGLLSHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTE---V 425

Query: 430 GILGNLAQSNFLIGYDLEMRTLSFKPAVCA 432
            I GN AQ +FL+GYDLE RT+SF+   C+
Sbjct: 426 AIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of Tan0006949 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 1.2e-92
Identity = 195/419 (46.54%), Postives = 262/419 (62.53%), Query Frame = 0

Query: 30  GFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAALFHHAAA--TPAAAIIQSPI 89
           GFT  L HRDSP SP +N   T   RL NA  RS +R   +FH      TP     Q  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQP---QIDL 89

Query: 90  SLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCYDQSQPIFNPFKSSSFRHV 149
           +  SG+YLM+VSIGTPP   +AIADTGSDL WTQC PC  CY Q  P+F+P  SS+++ V
Sbjct: 90  TSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDV 149

Query: 150 PCTSQLCHAIDD-ALCGV-QGVCDYRYTYGDRSYTKGDLGFDKITFGSSSV------NTV 209
            C+S  C A+++ A C      C Y  +YGD SYTKG++  D +T GSS        N +
Sbjct: 150 SCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 209

Query: 210 IGCGHESGGGFG-HASGLIGLGGGELSLVSQMSRNAAVSRRFSYCLPTVFSQTN--GKIN 269
           IGCGH + G F    SG++GLGGG +SL+ Q+    ++  +FSYCL  + S+ +   KIN
Sbjct: 210 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQL--GDSIDGKFSYCLVPLTSKKDQTSKIN 269

Query: 270 FGQNAVVSGPGVVSTPLVPKI-PKTYYYMTLEAVSVGNER---HAADMSSARGNMIIDSG 329
           FG NA+VSG GVVSTPL+ K   +T+YY+TL+++SVG+++     +D  S+ GN+IIDSG
Sbjct: 270 FGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSG 329

Query: 330 TTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGNGVDIPVITAHFAGGADV 389
           TTLT+LP E Y  +  ++   +   +  DP+    LCY+A G+ + +PVIT HF  GADV
Sbjct: 330 TTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGD-LKVPVITMHF-DGADV 389

Query: 390 KLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVCA 432
           KL   N F +V++D+ C A    SP   F I GN+AQ NFL+GYD   +T+SFKP  CA
Sbjct: 390 KLDSSNAFVQVSEDLVCFAFR-GSP--SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Tan0006949 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 1.0e-62
Identity = 156/414 (37.68%), Postives = 217/414 (52.42%), Query Frame = 0

Query: 30  GFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAALFHHAAATPAAAIIQSPISL 89
           GF   L H DS        +LT +  L  A  R   R   L    A     + +++ +  
Sbjct: 40  GFQIMLEHVDS------GKNLTKFQLLERAIERGSRRLQRL---EAMLNGPSGVETSVYA 99

Query: 90  GSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCYDQSQPIFNPFKSSSFRHVPC 149
           G G+YLM++SIGTP   + AI DTGSDL WTQC PC +C++QS PIFNP  SSSF  +PC
Sbjct: 100 GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPC 159

Query: 150 TSQLCHAIDDALCGVQGVCDYRYTYGDRSYTKGDLGFDKITFGSSSV-NTVIGCGHESGG 209
           +SQLC A+    C     C Y Y YGD S T+G +G + +TFGS S+ N   GCG  + G
Sbjct: 160 SSQLCQALSSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQG 219

Query: 210 -GFGHASGLIGLGGGELSLVSQMSRNAAVSRRFSYCLPTVFSQTNGKINFGQ--NAVVSG 269
            G G+ +GL+G+G G LSL SQ+        +FSYC+  + S T   +  G   N+V +G
Sbjct: 220 FGQGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTPSNLLLGSLANSVTAG 279

Query: 270 PGVVSTPLVPKIPKTYYYMTLEAVSVGNER-------HAADMSSARGNMIIDSGTTLTIL 329
               +     +IP T+YY+TL  +SVG+ R        A + ++  G +IIDSGTTLT  
Sbjct: 280 SPNTTLIQSSQIP-TFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYF 339

Query: 330 PKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY--AAEGNGVDIPVITAHFAGGADVKLMP 389
               Y  V    +  +    VN     F LC+   ++ + + IP    HF GG D++L  
Sbjct: 340 VNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPS 399

Query: 390 VNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVC 431
            N F   ++ + CLA+  +S   G  I GN+ Q N L+ YD     +SF  A C
Sbjct: 400 ENYFISPSNGLICLAMGSSS--QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Tan0006949 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 2.8e-60
Identity = 146/393 (37.15%), Postives = 212/393 (53.94%), Query Frame = 0

Query: 49  SLTHYDRLNNAFRRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYV 108
           +LT Y+ +  A +R   R  ++    A   +++ I++P+  G G+YLM+V+IGTP  ++ 
Sbjct: 54  NLTKYELIKRAIKRGERRMRSI---NAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFS 113

Query: 109 AIADTGSDLTWTQCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVC 168
           AI DTGSDL WTQC PC +C+ Q  PIFNP  SSSF  +PC SQ C  +    C     C
Sbjct: 114 AIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCN-NNEC 173

Query: 169 DYRYTYGDRSYTKGDLGFDKITFGSSSV-NTVIGCGHESGG-GFGHASGLIGLGGGELSL 228
            Y Y YGD S T+G +  +  TF +SSV N   GCG ++ G G G+ +GLIG+G G LSL
Sbjct: 174 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 233

Query: 229 VSQMSRNAAVSRRFSYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLV-PKIPKTYYYMT 288
            SQ+        +FSYC+ +  S +   +  G  A     G  ST L+   +  TYYY+T
Sbjct: 234 PSQLGVG-----QFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYIT 293

Query: 289 LEAVSVGNERHAADMSS------ARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRV 348
           L+ ++VG +      S+        G MIIDSGTTLT LP++ Y+ V  +    +    V
Sbjct: 294 LQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTV 353

Query: 349 NDPRGLFGLCY--AAEGNGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSP 408
           ++       C+   ++G+ V +P I+  F GG  + L   N     A+ V CLA+  +S 
Sbjct: 354 DESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMG-SSS 413

Query: 409 KHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVC 431
           + G  I GN+ Q    + YDL+   +SF P  C
Sbjct: 414 QLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Tan0006949 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 2.1e-52
Identity = 137/434 (31.57%), Postives = 209/434 (48.16%), Query Frame = 0

Query: 21  TYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAALFHHAA----- 80
           T+      + +T  L HRD   S  + +   H+ RL+   RR   R +A+    +     
Sbjct: 49  THFSDESSSKYTLRLLHRDRFPSVTYRN---HHHRLHARMRRDTDRVSAILRRISGKVIP 108

Query: 81  -------ATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKC 140
                       + I S +  GSG+Y + + +G+PP     + D+GSD+ W QC PC+ C
Sbjct: 109 SSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC 168

Query: 141 YDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYTKGDLGFDK 200
           Y QS P+F+P KS S+  V C S +C  I+++ C   G C Y   YGD SYTKG L  + 
Sbjct: 169 YKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGTLALET 228

Query: 201 ITFGSSSV-NTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRFSYCLPTV 260
           +TF  + V N  +GCGH + G F  A+GL+G+GGG +S V Q+S        F YCL + 
Sbjct: 229 LTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLS--GQTGGAFGYCLVSR 288

Query: 261 FSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNER-----HAADMS- 320
            + + G + FG+ A+  G   V     P+ P ++YY+ L+ + VG  R        D++ 
Sbjct: 289 GTDSTGSLVFGREALPVGASWVPLVRNPRAP-SFYYVGLKGLGVGGVRIPLPDGVFDLTE 348

Query: 321 SARGNMIIDSGTTLTILPKELY----DGVVSSLVKVVRGRRVNDPRGLFGLCYAAEG-NG 380
           +  G +++D+GT +T LP   Y    DG  S    + R   V+    +F  CY   G   
Sbjct: 349 TGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVS----IFDTCYDLSGFVS 408

Query: 381 VDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGY 431
           V +P ++ +F  G  V  +P   F    DD      A  +   G  I+GN+ Q    + +
Sbjct: 409 VRVPTVSFYFTEG-PVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSF 468

BLAST of Tan0006949 vs. NCBI nr
Match: XP_038889229.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 637.9 bits (1644), Expect = 6.4e-179
Identity = 320/435 (73.56%), Postives = 364/435 (83.68%), Query Frame = 0

Query: 1   MAAISIFFYLLLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAF 60
           MA IS+F YLLLLISFS+AT        NGFTT+LFHRD   S    SSL+HYDRL NAF
Sbjct: 1   MATISLFSYLLLLISFSQATI----NDDNGFTTSLFHRD---SLFQISSLSHYDRLTNAF 60

Query: 61  RRSFSRAAALFHHAA--ATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLT 120
           +RSFSR+ AL +H A  AT  AA++QSPI  GSG+YLM VSIGTPPV Y+ IADTGSDL 
Sbjct: 61  QRSFSRSTALINHVATVATTTAAVVQSPIGPGSGEYLMYVSIGTPPVDYIGIADTGSDLI 120

Query: 121 WTQCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRS 180
           WTQCLPCQKC++QS+PIFNP KSSS+  V CTSQ CHA++ A CGVQG+C+Y YTYGD++
Sbjct: 121 WTQCLPCQKCFNQSRPIFNPLKSSSYHRVSCTSQSCHALNVAHCGVQGICNYSYTYGDQT 180

Query: 181 YTKGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSR 240
           YTKGDLGFDKIT GSSSVN+VIGCGHESGGGFG+ASG+IGLGGGELSLVSQMS+ AA+S+
Sbjct: 181 YTKGDLGFDKITIGSSSVNSVIGCGHESGGGFGYASGVIGLGGGELSLVSQMSQTAAISQ 240

Query: 241 RFSYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAA 300
           +FSYCLPT+ S  NGKINFGQNA++SGPGVVSTPLVPK PKTYYYMTLEA+S+GNERH A
Sbjct: 241 QFSYCLPTLLSHANGKINFGQNAIISGPGVVSTPLVPKNPKTYYYMTLEAISIGNERHVA 300

Query: 301 DMSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGN--G 360
           DMSS +GNMIIDSGTTLTILPKELYDGVVSSL+KVVR RRV DP    GLC+A + N  G
Sbjct: 301 DMSSKQGNMIIDSGTTLTILPKELYDGVVSSLLKVVRARRVEDPGHFLGLCFADDSNSGG 360

Query: 361 VDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGY 420
           + IP+ITAHFAGGADVKL+P NTF KVA +VSC  L    P+ GFGILGNLAQ+NFLIGY
Sbjct: 361 LGIPIITAHFAGGADVKLLPENTFMKVAKNVSCSTLTSAEPRDGFGILGNLAQANFLIGY 420

Query: 421 DLEMRTLSFKPAVCA 432
           DLE R LSFKP +CA
Sbjct: 421 DLEARRLSFKPTICA 428

BLAST of Tan0006949 vs. NCBI nr
Match: KAA0044967.1 (putative aspartic protease [Cucumis melo var. makuwa])

HSP 1 Score: 601.7 bits (1550), Expect = 5.1e-168
Identity = 306/436 (70.18%), Postives = 358/436 (82.11%), Query Frame = 0

Query: 1   MAAISIFFYL-LLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNA 60
           +A ISIFF L LLLISFS+ T      G NGFTT+LFHRDS LSPL  S+L+HYDRL+NA
Sbjct: 2   VATISIFFLLFLLLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNA 61

Query: 61  FRRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTW 120
           FRRS SR+AAL +  AAT  A  +QSPI+ GSG+YLMSVSIGTPPV Y+ +ADTGSDLTW
Sbjct: 62  FRRSLSRSAALLNR-AATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTW 121

Query: 121 TQCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSY 180
            QCLPC KC+ QS+PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++Y
Sbjct: 122 AQCLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTY 181

Query: 181 TKGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRR 240
           TKGDLG +KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRR
Sbjct: 182 TKGDLGLEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRR 241

Query: 241 FSYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAAD 300
           FSYCLPT+ S  NGKINFGQNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A 
Sbjct: 242 FSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA- 301

Query: 301 MSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEG 360
            S+ +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  
Sbjct: 302 -SAKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAAS 361

Query: 361 NGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLI 420
           +G  IP+IT HF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLI
Sbjct: 362 SG--IPIITTHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLI 421

Query: 421 GYDLEMRTLSFKPAVC 431
           GYDLE + LSFKP VC
Sbjct: 422 GYDLEAKRLSFKPTVC 429

BLAST of Tan0006949 vs. NCBI nr
Match: XP_008452153.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo] >TYK16501.1 putative aspartic protease [Cucumis melo var. makuwa])

HSP 1 Score: 597.8 bits (1540), Expect = 7.3e-167
Identity = 305/434 (70.28%), Postives = 356/434 (82.03%), Query Frame = 0

Query: 3   AISIFFYLLL-LISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFR 62
           A SIF  L+L LISFS+ T      G NGFTT+LFHRDS LSPL  SSL+HYDRL+NAFR
Sbjct: 2   AASIFCRLILFLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLSNAFR 61

Query: 63  RSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQ 122
           RS SR+AAL +  AAT  A  +QSPI+ GSG+YLMSVSIGTPPV Y+ +ADTGSDLTW Q
Sbjct: 62  RSLSRSAALLNR-AATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTWAQ 121

Query: 123 CLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYTK 182
           CLPC KC+ QS+PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++YTK
Sbjct: 122 CLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTK 181

Query: 183 GDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRFS 242
           GDLG +KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRRFS
Sbjct: 182 GDLGLEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFS 241

Query: 243 YCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADMS 302
           YCLPT+ S  NGKINFGQNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A  S
Sbjct: 242 YCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA--S 301

Query: 303 SARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEGNG 362
           + +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  +G
Sbjct: 302 AKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSG 361

Query: 363 VDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGY 422
             IP+ITAHF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLIGY
Sbjct: 362 --IPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGY 421

Query: 423 DLEMRTLSFKPAVC 431
           DLE + LSFKP VC
Sbjct: 422 DLEAKRLSFKPTVC 427

BLAST of Tan0006949 vs. NCBI nr
Match: XP_004149005.3 (probable aspartic protease At2g35615 [Cucumis sativus] >KAE8649217.1 hypothetical protein Csa_015005 [Cucumis sativus])

HSP 1 Score: 594.7 bits (1532), Expect = 6.2e-166
Identity = 303/434 (69.82%), Postives = 351/434 (80.88%), Query Frame = 0

Query: 2   AAISIFFYLLL-LISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAF 61
           A IS+FF+L+L LISFS+ T      G NGFTT+LFHRDS LSPL  SSL+HYDRL NAF
Sbjct: 3   ATISLFFHLILFLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAF 62

Query: 62  RRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWT 121
           RRS SR+AAL +  AAT  A  +QS I  GSG+YLMSVSIGTPPV Y+ IADTGSDLTW 
Sbjct: 63  RRSLSRSAALLNR-AATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWA 122

Query: 122 QCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYT 181
           QCLPC KCY Q +PIFNP KS+SF HVPC +Q CHA+DD  CGVQGVCDY YTYGDR+Y+
Sbjct: 123 QCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYS 182

Query: 182 KGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRF 241
           KGDLGF+KIT GSSSV +VIGCGH S GGFG ASG+IGLGGG+LSLVSQMS+ + +SRRF
Sbjct: 183 KGDLGFEKITIGSSSVKSVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 242

Query: 242 SYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADM 301
           SYCLPT+ S  NGKINFG+NAVVSGPGVVSTPL+ K   TYYY+TLEA+S+GNERH A  
Sbjct: 243 SYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMA-- 302

Query: 302 SSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGN---GV 361
            + +GN+IIDSGTTLTILPKELYDGVVSSL+KVV+ +RV DP G   LC+    N    +
Sbjct: 303 FAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASL 362

Query: 362 DIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGYD 421
            IPVITAHF+GGA+V L+P+NTF+KVAD+V+CL L   SP   FGI+GNLAQ+NFLIGYD
Sbjct: 363 GIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYD 422

Query: 422 LEMRTLSFKPAVCA 432
           LE + LSFKP VCA
Sbjct: 423 LEAKRLSFKPTVCA 430

BLAST of Tan0006949 vs. NCBI nr
Match: XP_008452152.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo] >TYK16500.1 putative aspartic protease [Cucumis melo var. makuwa])

HSP 1 Score: 590.5 bits (1521), Expect = 1.2e-164
Identity = 301/435 (69.20%), Postives = 353/435 (81.15%), Query Frame = 0

Query: 2   AAISIFFYL-LLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAF 61
           A ISIFF L LLLISFS+ T      G NGFTT+LFHRDS LSPL  S+L+HYDRL+NAF
Sbjct: 3   ATISIFFLLFLLLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNAF 62

Query: 62  RRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWT 121
           RRS SR+AAL +   AT  A  +QSPI+ GSG+YLM VSIGTPPV Y+ + DTGSDLTW 
Sbjct: 63  RRSLSRSAALLNR-TATSGAVGLQSPIAPGSGEYLMYVSIGTPPVDYIGMIDTGSDLTWA 122

Query: 122 QCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYT 181
           QCLPC+KC+ Q +PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++YT
Sbjct: 123 QCLPCRKCFLQLRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYT 182

Query: 182 KGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRF 241
           KGDLGF+KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRRF
Sbjct: 183 KGDLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 242

Query: 242 SYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADM 301
           SYCLP +    NGKINF QNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A  
Sbjct: 243 SYCLPPLLGHANGKINFAQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA-- 302

Query: 302 SSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEGN 361
           S+ +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  +
Sbjct: 303 SAKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASS 362

Query: 362 GVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIG 421
           G  IP+ITAHF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLIG
Sbjct: 363 G--IPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIG 422

Query: 422 YDLEMRTLSFKPAVC 431
           YDLE + LSFKP VC
Sbjct: 423 YDLEAKRLSFKPTVC 429

BLAST of Tan0006949 vs. ExPASy TrEMBL
Match: A0A5A7TSV3 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002730 PE=3 SV=1)

HSP 1 Score: 601.7 bits (1550), Expect = 2.5e-168
Identity = 306/436 (70.18%), Postives = 358/436 (82.11%), Query Frame = 0

Query: 1   MAAISIFFYL-LLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNA 60
           +A ISIFF L LLLISFS+ T      G NGFTT+LFHRDS LSPL  S+L+HYDRL+NA
Sbjct: 2   VATISIFFLLFLLLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNA 61

Query: 61  FRRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTW 120
           FRRS SR+AAL +  AAT  A  +QSPI+ GSG+YLMSVSIGTPPV Y+ +ADTGSDLTW
Sbjct: 62  FRRSLSRSAALLNR-AATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTW 121

Query: 121 TQCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSY 180
            QCLPC KC+ QS+PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++Y
Sbjct: 122 AQCLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTY 181

Query: 181 TKGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRR 240
           TKGDLG +KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRR
Sbjct: 182 TKGDLGLEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRR 241

Query: 241 FSYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAAD 300
           FSYCLPT+ S  NGKINFGQNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A 
Sbjct: 242 FSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA- 301

Query: 301 MSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEG 360
            S+ +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  
Sbjct: 302 -SAKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAAS 361

Query: 361 NGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLI 420
           +G  IP+IT HF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLI
Sbjct: 362 SG--IPIITTHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLI 421

Query: 421 GYDLEMRTLSFKPAVC 431
           GYDLE + LSFKP VC
Sbjct: 422 GYDLEAKRLSFKPTVC 429

BLAST of Tan0006949 vs. ExPASy TrEMBL
Match: A0A5D3CX41 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003090 PE=3 SV=1)

HSP 1 Score: 597.8 bits (1540), Expect = 3.6e-167
Identity = 305/434 (70.28%), Postives = 356/434 (82.03%), Query Frame = 0

Query: 3   AISIFFYLLL-LISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFR 62
           A SIF  L+L LISFS+ T      G NGFTT+LFHRDS LSPL  SSL+HYDRL+NAFR
Sbjct: 2   AASIFCRLILFLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLSNAFR 61

Query: 63  RSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQ 122
           RS SR+AAL +  AAT  A  +QSPI+ GSG+YLMSVSIGTPPV Y+ +ADTGSDLTW Q
Sbjct: 62  RSLSRSAALLNR-AATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTWAQ 121

Query: 123 CLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYTK 182
           CLPC KC+ QS+PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++YTK
Sbjct: 122 CLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTK 181

Query: 183 GDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRFS 242
           GDLG +KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRRFS
Sbjct: 182 GDLGLEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFS 241

Query: 243 YCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADMS 302
           YCLPT+ S  NGKINFGQNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A  S
Sbjct: 242 YCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA--S 301

Query: 303 SARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEGNG 362
           + +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  +G
Sbjct: 302 AKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSG 361

Query: 363 VDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGY 422
             IP+ITAHF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLIGY
Sbjct: 362 --IPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGY 421

Query: 423 DLEMRTLSFKPAVC 431
           DLE + LSFKP VC
Sbjct: 422 DLEAKRLSFKPTVC 427

BLAST of Tan0006949 vs. ExPASy TrEMBL
Match: A0A1S3BUB0 (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493257 PE=3 SV=1)

HSP 1 Score: 597.8 bits (1540), Expect = 3.6e-167
Identity = 305/434 (70.28%), Postives = 356/434 (82.03%), Query Frame = 0

Query: 3   AISIFFYLLL-LISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFR 62
           A SIF  L+L LISFS+ T      G NGFTT+LFHRDS LSPL  SSL+HYDRL+NAFR
Sbjct: 2   AASIFCRLILFLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLSNAFR 61

Query: 63  RSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQ 122
           RS SR+AAL +  AAT  A  +QSPI+ GSG+YLMSVSIGTPPV Y+ +ADTGSDLTW Q
Sbjct: 62  RSLSRSAALLNR-AATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTWAQ 121

Query: 123 CLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYTK 182
           CLPC KC+ QS+PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++YTK
Sbjct: 122 CLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTK 181

Query: 183 GDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRFS 242
           GDLG +KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRRFS
Sbjct: 182 GDLGLEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFS 241

Query: 243 YCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADMS 302
           YCLPT+ S  NGKINFGQNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A  S
Sbjct: 242 YCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA--S 301

Query: 303 SARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEGNG 362
           + +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  +G
Sbjct: 302 AKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSG 361

Query: 363 VDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGY 422
             IP+ITAHF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLIGY
Sbjct: 362 --IPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGY 421

Query: 423 DLEMRTLSFKPAVC 431
           DLE + LSFKP VC
Sbjct: 422 DLEAKRLSFKPTVC 427

BLAST of Tan0006949 vs. ExPASy TrEMBL
Match: A0A0A0KV20 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 2.3e-166
Identity = 303/434 (69.82%), Postives = 351/434 (80.88%), Query Frame = 0

Query: 2   AAISIFFYLLL-LISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAF 61
           A IS+FF+L+L LISFS+ T      G NGFTT+LFHRDS LSPL  SSL+HYDRL NAF
Sbjct: 3   ATISLFFHLILFLISFSQTTII---NGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAF 62

Query: 62  RRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWT 121
           RRS SR+AAL +  AAT  A  +QS I  GSG+YLMSVSIGTPPV Y+ IADTGSDLTW 
Sbjct: 63  RRSLSRSAALLNR-AATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWA 122

Query: 122 QCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYT 181
           QCLPC KCY Q +PIFNP KS+SF HVPC +Q CHA+DD  CGVQGVCDY YTYGDR+Y+
Sbjct: 123 QCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYS 182

Query: 182 KGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRF 241
           KGDLGF+KIT GSSSV +VIGCGH S GGFG ASG+IGLGGG+LSLVSQMS+ + +SRRF
Sbjct: 183 KGDLGFEKITIGSSSVKSVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 242

Query: 242 SYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADM 301
           SYCLPT+ S  NGKINFG+NAVVSGPGVVSTPL+ K   TYYY+TLEA+S+GNERH A  
Sbjct: 243 SYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMA-- 302

Query: 302 SSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGN---GV 361
            + +GN+IIDSGTTLTILPKELYDGVVSSL+KVV+ +RV DP G   LC+    N    +
Sbjct: 303 FAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASL 362

Query: 362 DIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGYD 421
            IPVITAHF+GGA+V L+P+NTF+KVAD+V+CL L   SP   FGI+GNLAQ+NFLIGYD
Sbjct: 363 GIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYD 422

Query: 422 LEMRTLSFKPAVCA 432
           LE + LSFKP VCA
Sbjct: 423 LEAKRLSFKPTVCA 430

BLAST of Tan0006949 vs. ExPASy TrEMBL
Match: A0A5D3CYR2 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003080 PE=3 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 5.7e-165
Identity = 301/435 (69.20%), Postives = 353/435 (81.15%), Query Frame = 0

Query: 2   AAISIFFYL-LLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAF 61
           A ISIFF L LLLISFS+ T      G NGFTT+LFHRDS LSPL  S+L+HYDRL+NAF
Sbjct: 3   ATISIFFLLFLLLISFSQTTII---NGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNAF 62

Query: 62  RRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWT 121
           RRS SR+AAL +   AT  A  +QSPI+ GSG+YLM VSIGTPPV Y+ + DTGSDLTW 
Sbjct: 63  RRSLSRSAALLNR-TATSGAVGLQSPIAPGSGEYLMYVSIGTPPVDYIGMIDTGSDLTWA 122

Query: 122 QCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGVQGVCDYRYTYGDRSYT 181
           QCLPC+KC+ Q +PIFNP KS+SF HVPC SQ+C AIDDA CGVQGVCDY YTYGD++YT
Sbjct: 123 QCLPCRKCFLQLRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYT 182

Query: 182 KGDLGFDKITFGSSSVNTVIGCGHESGGGFGHASGLIGLGGGELSLVSQMSRNAAVSRRF 241
           KGDLGF+KIT GSSSV +VIGCGHESGGGFG ASG+IGLGGG+LSLVSQMS+ + +SRRF
Sbjct: 183 KGDLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 242

Query: 242 SYCLPTVFSQTNGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNERHAADM 301
           SYCLP +    NGKINF QNAVVSGPGVVSTPL+ K P TYYY+TLEA+S+GNERH A  
Sbjct: 243 SYCLPPLLGHANGKINFAQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA-- 302

Query: 302 SSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCY-----AAEGN 361
           S+ +GN+IIDSGTTLT+LPKELYDGVVSSL+KVV+ +RV DP   + LC+      A  +
Sbjct: 303 SAKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASS 362

Query: 362 GVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIG 421
           G  IP+ITAHF+GGA+V L+PVNTF+KVA++V+CL L   SP   FGI+GNLAQ+NFLIG
Sbjct: 363 G--IPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIG 422

Query: 422 YDLEMRTLSFKPAVC 431
           YDLE + LSFKP VC
Sbjct: 423 YDLEAKRLSFKPTVC 429

BLAST of Tan0006949 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 368.2 bits (944), Expect = 8.8e-102
Identity = 210/439 (47.84%), Postives = 275/439 (62.64%), Query Frame = 0

Query: 6   IFFYLLLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFS 65
           IF  LL L+  S    Y      +GFT  L HRDSP SP +NS+ T   R+ NA RRS +
Sbjct: 5   IFATLLSLLLLSNVNAY----PKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS-A 64

Query: 66  RAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPC 125
           R+   F +  A+P +   QS I+   G+YLM++SIGTPPV  +AIADTGSDL WTQC PC
Sbjct: 65  RSTLQFSNDDASPNSP--QSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC 124

Query: 126 QKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGV-QGVCDYRYTYGDRSYTKGDL 185
           + CY Q+ P+F+P +SS++R V C+S  C A++DA C   +  C Y  TYGD SYTKGD+
Sbjct: 125 EDCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDV 184

Query: 186 GFDKITFGSSS------VNTVIGCGHESGGGFGHA-SGLIGLGGGELSLVSQMSRNAAVS 245
             D +T GSS        N +IGCGHE+ G F  A SG+IGLGGG  SLVSQ+ +  +++
Sbjct: 185 AVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRK--SIN 244

Query: 246 RRFSYCLPTVFSQT--NGKINFGQNAVVSGPGVVSTPLVPKIPKTYYYMTLEAVSVGNER 305
            +FSYCL    S+T    KINFG N +VSG GVVST +V K P TYY++ LEA+SVG+++
Sbjct: 245 GKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKK 304

Query: 306 ---HAADMSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAA 365
               +    +  GN++IDSGTTLT+LP   Y  + S +   ++  RV DP G+  LCY  
Sbjct: 305 IQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCY-R 364

Query: 366 EGNGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNF 425
           + +   +P IT HF GG DVKL  +NTF  V++DVSC A A         I GNLAQ NF
Sbjct: 365 DSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAFAANEQ---LTIFGNLAQMNF 424

Query: 426 LIGYDLEMRTLSFKPAVCA 432
           L+GYD    T+SFK   C+
Sbjct: 425 LVGYDTVSGTVSFKKTDCS 429

BLAST of Tan0006949 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 349.0 bits (894), Expect = 5.5e-96
Identity = 200/450 (44.44%), Postives = 275/450 (61.11%), Query Frame = 0

Query: 10  LLLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAA 69
           LL    F   T    G   N F+  L HRDSPLSP++N  +T  DRLN AF RS SR+  
Sbjct: 6   LLCFFLFFSVTLSSSGHPKN-FSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 65

Query: 70  LFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCY 129
             H  + T     +QS +    G++ MS++IGTPP+   AIADTGSDLTW QC PCQ+CY
Sbjct: 66  FNHQLSQTD----LQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCY 125

Query: 130 DQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGV---QGVCDYRYTYGDRSYTKGDLGF 189
            ++ PIF+  KSS+++  PC S+ C A+     G      +C YRY+YGD+S++KGD+  
Sbjct: 126 KENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVAT 185

Query: 190 DKITFGSSS------VNTVIGCGHESGGGFGH-ASGLIGLGGGELSLVSQMSRNAAVSRR 249
           + ++  S+S        TV GCG+ +GG F    SG+IGLGGG LSL+SQ+   +++S++
Sbjct: 186 ETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQL--GSSISKK 245

Query: 250 FSYCLPTVFSQTNGK--INFGQNAVVSG----PGVVSTPLVPKIPKTYYYMTLEAVSVG- 309
           FSYCL    + TNG   IN G N++ S      GVVSTPLV K P TYYY+TLEA+SVG 
Sbjct: 246 FSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK 305

Query: 310 ----------NERHAADMSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVRG-RRVND 369
                     N      +S   GN+IIDSGTTLT+L    +D   S++ + V G +RV+D
Sbjct: 306 KKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSD 365

Query: 370 PRGLFGLCYAAEGNGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAPTSPKHGF 429
           P+GL   C+ +    + +P IT HF  GADV+L P+N F K+++D+ CL++ PT+     
Sbjct: 366 PQGLLSHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTE---V 425

Query: 430 GILGNLAQSNFLIGYDLEMRTLSFKPAVCA 432
            I GN AQ +FL+GYDLE RT+SF+   C+
Sbjct: 426 AIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of Tan0006949 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 341.7 bits (875), Expect = 8.8e-94
Identity = 195/419 (46.54%), Postives = 262/419 (62.53%), Query Frame = 0

Query: 30  GFTTTLFHRDSPLSPLHNSSLTHYDRLNNAFRRSFSRAAALFHHAAA--TPAAAIIQSPI 89
           GFT  L HRDSP SP +N   T   RL NA  RS +R   +FH      TP     Q  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQP---QIDL 89

Query: 90  SLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCYDQSQPIFNPFKSSSFRHV 149
           +  SG+YLM+VSIGTPP   +AIADTGSDL WTQC PC  CY Q  P+F+P  SS+++ V
Sbjct: 90  TSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDV 149

Query: 150 PCTSQLCHAIDD-ALCGV-QGVCDYRYTYGDRSYTKGDLGFDKITFGSSSV------NTV 209
            C+S  C A+++ A C      C Y  +YGD SYTKG++  D +T GSS        N +
Sbjct: 150 SCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 209

Query: 210 IGCGHESGGGFG-HASGLIGLGGGELSLVSQMSRNAAVSRRFSYCLPTVFSQTN--GKIN 269
           IGCGH + G F    SG++GLGGG +SL+ Q+    ++  +FSYCL  + S+ +   KIN
Sbjct: 210 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQL--GDSIDGKFSYCLVPLTSKKDQTSKIN 269

Query: 270 FGQNAVVSGPGVVSTPLVPKI-PKTYYYMTLEAVSVGNER---HAADMSSARGNMIIDSG 329
           FG NA+VSG GVVSTPL+ K   +T+YY+TL+++SVG+++     +D  S+ GN+IIDSG
Sbjct: 270 FGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSG 329

Query: 330 TTLTILPKELYDGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGNGVDIPVITAHFAGGADV 389
           TTLT+LP E Y  +  ++   +   +  DP+    LCY+A G+ + +PVIT HF  GADV
Sbjct: 330 TTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGD-LKVPVITMHF-DGADV 389

Query: 390 KLMPVNTFKKVADDVSCLALAPTSPKHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVCA 432
           KL   N F +V++D+ C A    SP   F I GN+AQ NFL+GYD   +T+SFKP  CA
Sbjct: 390 KLDSSNAFVQVSEDLVCFAFR-GSP--SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Tan0006949 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 330.9 bits (847), Expect = 1.6e-90
Identity = 197/457 (43.11%), Postives = 269/457 (58.86%), Query Frame = 0

Query: 1   MAAISIFFYLLLLISFSKATTYGGGRGGNGFTTTLFHRDSPLSPLHNSSLTHYDRLNNAF 60
           MA  +  +  LL ISF  A+     R     T  L HRDSP SPL+N   T  DRLN AF
Sbjct: 1   MATKTFLYCSLLAISFFFASNSSANR--ENLTVELIHRDSPHSPLYNPHHTVSDRLNAAF 60

Query: 61  RRSFSRAAALFHHAAATPAAAIIQSPISLGSGKYLMSVSIGTPPVAYVAIADTGSDLTWT 120
            RS SR+               +QS +    G+Y MS+SIGTPP    AIADTGSDLTW 
Sbjct: 61  LRSISRSRRF-------TTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWV 120

Query: 121 QCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQLCHAIDDALCGV---QGVCDYRYTYGDR 180
           QC PCQ+CY Q+ P+F+  KSS+++   C S+ C A+ +   G    + +C YRY+YGD 
Sbjct: 121 QCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDN 180

Query: 181 SYTKGDLGFDKITFGSSS------VNTVIGCGHESGGGFGH-ASGLIGLGGGELSLVSQM 240
           S+TKGD+  + I+  SSS        TV GCG+ +GG F    SG+IGLGGG LSLVSQ+
Sbjct: 181 SFTKGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQL 240

Query: 241 SRNAAVSRRFSYCLPTVFSQTNGK--INFGQNAVVSGP----GVVSTPLVPKIPKTYYYM 300
              +++ ++FSYCL    + TNG   IN G N++ S P      ++TPL+ K P+TYY++
Sbjct: 241 --GSSIGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFL 300

Query: 301 TLEAVSVGNERHA---------ADMSSARGNMIIDSGTTLTILPKELYDGVVSSLVKVVR 360
           TLEAV+VG  +              S   GN+IIDSGTTLT+L    YD   +++ + V 
Sbjct: 301 TLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVT 360

Query: 361 G-RRVNDPRGLFGLCYAAEGNGVDIPVITAHFAGGADVKLMPVNTFKKVADDVSCLALAP 420
           G +RV+DP+GL   C+ +    + +P IT HF   ADVKL P+N F K+ +D  CL++ P
Sbjct: 361 GAKRVSDPQGLLTHCFKSGDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIP 420

Query: 421 TSPKHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVCA 432
           T+      I GN+ Q +FL+GYDLE +T+SF+   C+
Sbjct: 421 TTE---VAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of Tan0006949 vs. TAIR 10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 229.2 bits (583), Expect = 6.4e-60
Identity = 144/350 (41.14%), Postives = 186/350 (53.14%), Query Frame = 0

Query: 94  YLMSVSIGTPPVAYVAIADTGSDLTWTQCLPCQKCYDQSQPIFNPFKSSSFRHVPCTSQL 153
           YLM + +GTPP    AI DTGS++TWTQCLPC  CY+Q+ PIF+P KSS+F+   C    
Sbjct: 65  YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCDGH- 124

Query: 154 CHAIDDALCGVQGVCDYRYTYGDRSYTKGDLGFDKITFGSSS------VNTVIGCGHESG 213
                         C Y   Y D +YT G L  + IT  S+S        T+IGCGH + 
Sbjct: 125 -------------SCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNS 184

Query: 214 GGFGHASGLIGLGGGELSLVSQMSRNAAVSRRFSYCLPTVFSQTNGKINFGQNAVVSGPG 273
                 SG++GL  G  SL++QM          SYC      Q   KINFG NA+V+G G
Sbjct: 185 WFKPSFSGMVGLNWGPSSLITQM--GGEYPGLMSYCFS---GQGTSKINFGANAIVAGDG 244

Query: 274 VVSTPLVPKIPKT-YYYMTLEAVSVGN---ERHAADMSSARGNMIIDSGTTLTILPKELY 333
           VVST +     K  +YY+ L+AVSVGN   E       +  GN++IDSGTTLT  P    
Sbjct: 245 VVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYFPVSYC 304

Query: 334 DGVVSSLVKVVRGRRVNDPRGLFGLCYAAEGNGVDI-PVITAHFAGGADVKLMPVNTFKK 393
           + V  ++  VV   R  DP G   LCY    + +DI PVIT HF+GG D+ L   N + +
Sbjct: 305 NLVRQAVEHVVTAVRAADPTGNDMLCY--NSDTIDIFPVITMHFSGGVDLVLDKYNMYME 364

Query: 394 VAD-DVSCLALAPTSPKHGFGILGNLAQSNFLIGYDLEMRTLSFKPAVCA 432
             +  V CLA+   SP     I GN AQ+NFL+GYD     +SF P  C+
Sbjct: 365 SNNGGVFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3EBM57.8e-9544.44Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q6XBF81.2e-9246.54Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q766C31.0e-6237.68Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C22.8e-6037.15Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LHE32.1e-5231.57Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
XP_038889229.16.4e-17973.56aspartic proteinase CDR1-like [Benincasa hispida][more]
KAA0044967.15.1e-16870.18putative aspartic protease [Cucumis melo var. makuwa][more]
XP_008452153.17.3e-16770.28PREDICTED: probable aspartic protease At2g35615 [Cucumis melo] >TYK16501.1 putat... [more]
XP_004149005.36.2e-16669.82probable aspartic protease At2g35615 [Cucumis sativus] >KAE8649217.1 hypothetica... [more]
XP_008452152.11.2e-16469.20PREDICTED: probable aspartic protease At2g35615 [Cucumis melo] >TYK16500.1 putat... [more]
Match NameE-valueIdentityDescription
A0A5A7TSV32.5e-16870.18Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... [more]
A0A5D3CX413.6e-16770.28Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
A0A1S3BUB03.6e-16770.28probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493257 PE=... [more]
A0A0A0KV202.3e-16669.82Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G05540... [more]
A0A5D3CYR25.7e-16569.20Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
Match NameE-valueIdentityDescription
AT1G64830.18.8e-10247.84Eukaryotic aspartyl protease family protein [more]
AT2G35615.15.5e-9644.44Eukaryotic aspartyl protease family protein [more]
AT5G33340.18.8e-9446.54Eukaryotic aspartyl protease family protein [more]
AT1G31450.11.6e-9043.11Eukaryotic aspartyl protease family protein [more]
AT2G28010.16.4e-6041.14Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 307..318
score: 42.36
coord: 100..120
score: 41.86
coord: 402..417
score: 25.92
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 263..431
e-value: 4.5E-43
score: 149.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 72..259
e-value: 1.7E-50
score: 173.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 87..430
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 282..426
e-value: 4.7E-26
score: 91.5
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 94..259
e-value: 2.2E-49
score: 168.1
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 6..430
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 6..430
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 307..318
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 94..426
score: 40.485886
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 94..430
e-value: 2.44473E-75
score: 234.079

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006949.1Tan0006949.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity