Tan0000871 (gene) Snake gourd v1

Overview
NameTan0000871
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartic proteinase CDR1-like
LocationLG01: 10226584 .. 10227885 (+)
RNA-Seq ExpressionTan0000871
SyntenyTan0000871
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCATTTCAATTTTCTTCTATATTCTCCTCTTCTCCTTCACTGAAGCAACCACCAATGGCGGCGGCAATGGCTTCACCACTTCTCTATTCCACCGCGATTCTGTTCTCTCCCCTCTCTATAACCCATCTCTCTCCCGCTACAACCGCCTCACCAACGCCTTCCGCCGCTCCATCTCCCGCTCCGCCACCCTCCTCCACCGCGCCGCCGCCGTCTCCACCACCGGCATCCAATCCCCGATCATCCCCGACAGCGGCGAGTTCCTGATGTCCGTTTCCCTCGGAACCCCGCCGGTGGATGTCATCGCCATCGCCGACACCGGCAGCGACCTGACGTGGACCCAGTGCTTGCCATGTCACCAATGCTTCAACCAATCACTTCCTATTTTTAATCCACGTCAATCCTCCTCCTACCGTCGCGTGTCTTGCACGTCCGATGCCTGCCGCTCCCTCGACGACTCCAGCTGTGGGGCCGACCTCCGAACCTGCAGCTACGGCTACAGCTACGGAGACCGATCCTTCACGTACGGTGACCTAGCATCTGACAAAATTACCATCGGGTCCTTCAAACTCCGCAAGACAATCATCGGATGCGGCCACAATAACGGCGGCACTTTCGGCGGAGTTACCTCGGGAATTATCGGACTCGGCGGCGGCGCTCTCTCGTTGGTCTCTCAAATGAGAAAAATCGCCGCCGTCCAACGGCGGTTCTCATATTGCTTGCCACCCTTCTTCAGCGACACGAATGTCACAGGCAAAATAAACTTCGGTCGAAACGCCGTCGTATTGGGGCGTAAAGTCGTTTCTACCCCTCTTGTATCAAAATTTCCCGATACCTTCTATTATTTGACTCTTGAAGCAATCTCCGTCGCAAACAAGCGGATTGAAGCCGCGGACGACATGTCAGCCGCAGCAGAAAAAGGGAATATCATTATCGATTCTGGCACTACATTGACGCTTCTACCCTCGAAGTTGTACGACGGTGTCGTTTCGACTTTGGTGAGTGTTGTAAAAGCGAAGCGGGTGGATGATCCGACTGGAATTTTAGAACTCTGCTACGCCGCGGGAGGGGAAGATGATTTGGATATTCCGGTGATTACGGCACATTTTGCCGGCGGCGCGGACGTGAAGTTGTTGCCGTTGAATACATTTGCGATGGTGGCTGATAATGTGACTTGTTTGACATTGGCGCCGTCGTCGGATTTGGCCATTTTTGGGAACTTGGCGCAGATGAACTTTTTAGTCGGATATGATCTCGAACGGAAGAGGTTGTCGTTTAAAAATACCGTTTGTGCATAG

mRNA sequence

ATGGCTGCCATTTCAATTTTCTTCTATATTCTCCTCTTCTCCTTCACTGAAGCAACCACCAATGGCGGCGGCAATGGCTTCACCACTTCTCTATTCCACCGCGATTCTGTTCTCTCCCCTCTCTATAACCCATCTCTCTCCCGCTACAACCGCCTCACCAACGCCTTCCGCCGCTCCATCTCCCGCTCCGCCACCCTCCTCCACCGCGCCGCCGCCGTCTCCACCACCGGCATCCAATCCCCGATCATCCCCGACAGCGGCGAGTTCCTGATGTCCGTTTCCCTCGGAACCCCGCCGGTGGATGTCATCGCCATCGCCGACACCGGCAGCGACCTGACGTGGACCCAGTGCTTGCCATGTCACCAATGCTTCAACCAATCACTTCCTATTTTTAATCCACGTCAATCCTCCTCCTACCGTCGCGTGTCTTGCACGTCCGATGCCTGCCGCTCCCTCGACGACTCCAGCTGTGGGGCCGACCTCCGAACCTGCAGCTACGGCTACAGCTACGGAGACCGATCCTTCACGTACGGTGACCTAGCATCTGACAAAATTACCATCGGGTCCTTCAAACTCCGCAAGACAATCATCGGATGCGGCCACAATAACGGCGGCACTTTCGGCGGAGTTACCTCGGGAATTATCGGACTCGGCGGCGGCGCTCTCTCGTTGGTCTCTCAAATGAGAAAAATCGCCGCCGTCCAACGGCGGTTCTCATATTGCTTGCCACCCTTCTTCAGCGACACGAATGTCACAGGCAAAATAAACTTCGGTCGAAACGCCGTCGTATTGGGGCGTAAAGTCGTTTCTACCCCTCTTGTATCAAAATTTCCCGATACCTTCTATTATTTGACTCTTGAAGCAATCTCCGTCGCAAACAAGCGGATTGAAGCCGCGGACGACATGTCAGCCGCAGCAGAAAAAGGGAATATCATTATCGATTCTGGCACTACATTGACGCTTCTACCCTCGAAGTTGTACGACGGTGTCGTTTCGACTTTGGTGAGTGTTGTAAAAGCGAAGCGGGTGGATGATCCGACTGGAATTTTAGAACTCTGCTACGCCGCGGGAGGGGAAGATGATTTGGATATTCCGGTGATTACGGCACATTTTGCCGGCGGCGCGGACGTGAAGTTGTTGCCGTTGAATACATTTGCGATGGTGGCTGATAATGTGACTTGTTTGACATTGGCGCCGTCGTCGGATTTGGCCATTTTTGGGAACTTGGCGCAGATGAACTTTTTAGTCGGATATGATCTCGAACGGAAGAGGTTGTCGTTTAAAAATACCGTTTGTGCATAG

Coding sequence (CDS)

ATGGCTGCCATTTCAATTTTCTTCTATATTCTCCTCTTCTCCTTCACTGAAGCAACCACCAATGGCGGCGGCAATGGCTTCACCACTTCTCTATTCCACCGCGATTCTGTTCTCTCCCCTCTCTATAACCCATCTCTCTCCCGCTACAACCGCCTCACCAACGCCTTCCGCCGCTCCATCTCCCGCTCCGCCACCCTCCTCCACCGCGCCGCCGCCGTCTCCACCACCGGCATCCAATCCCCGATCATCCCCGACAGCGGCGAGTTCCTGATGTCCGTTTCCCTCGGAACCCCGCCGGTGGATGTCATCGCCATCGCCGACACCGGCAGCGACCTGACGTGGACCCAGTGCTTGCCATGTCACCAATGCTTCAACCAATCACTTCCTATTTTTAATCCACGTCAATCCTCCTCCTACCGTCGCGTGTCTTGCACGTCCGATGCCTGCCGCTCCCTCGACGACTCCAGCTGTGGGGCCGACCTCCGAACCTGCAGCTACGGCTACAGCTACGGAGACCGATCCTTCACGTACGGTGACCTAGCATCTGACAAAATTACCATCGGGTCCTTCAAACTCCGCAAGACAATCATCGGATGCGGCCACAATAACGGCGGCACTTTCGGCGGAGTTACCTCGGGAATTATCGGACTCGGCGGCGGCGCTCTCTCGTTGGTCTCTCAAATGAGAAAAATCGCCGCCGTCCAACGGCGGTTCTCATATTGCTTGCCACCCTTCTTCAGCGACACGAATGTCACAGGCAAAATAAACTTCGGTCGAAACGCCGTCGTATTGGGGCGTAAAGTCGTTTCTACCCCTCTTGTATCAAAATTTCCCGATACCTTCTATTATTTGACTCTTGAAGCAATCTCCGTCGCAAACAAGCGGATTGAAGCCGCGGACGACATGTCAGCCGCAGCAGAAAAAGGGAATATCATTATCGATTCTGGCACTACATTGACGCTTCTACCCTCGAAGTTGTACGACGGTGTCGTTTCGACTTTGGTGAGTGTTGTAAAAGCGAAGCGGGTGGATGATCCGACTGGAATTTTAGAACTCTGCTACGCCGCGGGAGGGGAAGATGATTTGGATATTCCGGTGATTACGGCACATTTTGCCGGCGGCGCGGACGTGAAGTTGTTGCCGTTGAATACATTTGCGATGGTGGCTGATAATGTGACTTGTTTGACATTGGCGCCGTCGTCGGATTTGGCCATTTTTGGGAACTTGGCGCAGATGAACTTTTTAGTCGGATATGATCTCGAACGGAAGAGGTTGTCGTTTAAAAATACCGTTTGTGCATAG

Protein sequence

MAAISIFFYILLFSFTEATTNGGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGDLASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFSYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYDLERKRLSFKNTVCA
Homology
BLAST of Tan0000871 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 1.2e-103
Identity = 207/415 (49.88%), Postives = 275/415 (66.27%), Query Frame = 0

Query: 26  GFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSISRSATLLHRAAAVSTTGIQSPIIPD 85
           GFT  L HRDS  SP YNP  +   RL NA  RS++R   + H     +T   Q  +  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQPQIDLTSN 89

Query: 86  SGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQCFNQSLPIFNPRQSSSYRRVSCT 145
           SGE+LM+VS+GTPP  ++AIADTGSDL WTQC PC  C+ Q  P+F+P+ SS+Y+ VSC+
Sbjct: 90  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 149

Query: 146 SDACRSLDD-SSCGADLRTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLRKTIIGC 205
           S  C +L++ +SC  +  TCSY  SYGD S+T G++A D +T+GS      +L+  IIGC
Sbjct: 150 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 209

Query: 206 GHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFSYCLPPFFSDTNVTGKINFGR 265
           GHNN GTF    SGI+GLGGG +SL+ Q+    ++  +FSYCL P  S  + T KINFG 
Sbjct: 210 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGT 269

Query: 266 NAVVLGRKVVSTPLVSK-FPDTFYYLTLEAISVANKRIEAADDMSAAAEKGNIIIDSGTT 325
           NA+V G  VVSTPL++K   +TFYYLTL++ISV +K+I+ +   S ++E GNIIIDSGTT
Sbjct: 270 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSE-GNIIIDSGTT 329

Query: 326 LTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVK 385
           LTLLP++ Y  +   + S + A++  DP   L LCY+A G  DL +PVIT HF  GADVK
Sbjct: 330 LTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKVPVITMHF-DGADVK 389

Query: 386 LLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYDLERKRLSFKNTVCA 434
           L   N F  V++++ C     S   +I+GN+AQMNFLVGYD   K +SFK T CA
Sbjct: 390 LDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Tan0000871 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.1e-100
Identity = 209/452 (46.24%), Postives = 290/452 (64.16%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSI 60
           MA   +  + L FS T  +++G    F+  L HRDS LSP+YNP ++  +RL  AF RS+
Sbjct: 1   MATQILLCFFLFFSVT-LSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSV 60

Query: 61  SRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPC 120
           SRS    H+   +S T +QS +I   GEF MS+++GTPP+ V AIADTGSDLTW QC PC
Sbjct: 61  SRSRRFNHQ---LSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC 120

Query: 121 HQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGAD--LRTCSYGYSYGDRSFTYG 180
            QC+ ++ PIF+ ++SS+Y+   C S  C++L  +  G D     C Y YSYGD+SF+ G
Sbjct: 121 QQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKG 180

Query: 181 DLASDKITIGS-----FKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAA 240
           D+A++ ++I S          T+ GCG+NNGGTF    SGIIGLGGG LSL+SQ+   ++
Sbjct: 181 DVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SS 240

Query: 241 VQRRFSYCLPPFFSDTNVTGKINFGRNAVVLGRK----VVSTPLVSKFPDTFYYLTLEAI 300
           + ++FSYCL    + TN T  IN G N++         VVSTPLV K P T+YYLTLEAI
Sbjct: 241 ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAI 300

Query: 301 SVANKRI-------EAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTL-VSVVKAK 360
           SV  K+I          DD   +   GNIIIDSGTTLTLL +  +D   S +  SV  AK
Sbjct: 301 SVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAK 360

Query: 361 RVDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSS 420
           RV DP G+L  C+ +G   ++ +P IT HF  GADV+L P+N F  +++++ CL++ P++
Sbjct: 361 RVSDPQGLLSHCFKSGSA-EIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTT 420

Query: 421 DLAIFGNLAQMNFLVGYDLERKRLSFKNTVCA 434
           ++AI+GN AQM+FLVGYDLE + +SF++  C+
Sbjct: 421 EVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of Tan0000871 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 2.2e-60
Identity = 151/414 (36.47%), Postives = 219/414 (52.90%), Query Frame = 0

Query: 26  GFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSISRSATLLHRAAAV--STTGIQSPII 85
           GF   L H DS        +L+++  L     R+I R +  L R  A+    +G+++ + 
Sbjct: 40  GFQIMLEHVDS------GKNLTKFQLL----ERAIERGSRRLQRLEAMLNGPSGVETSVY 99

Query: 86  PDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQCFNQSLPIFNPRQSSSYRRVS 145
              GE+LM++S+GTP     AI DTGSDL WTQC PC QCFNQS PIFNP+ SSS+  + 
Sbjct: 100 AGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLP 159

Query: 146 CTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGDLASDKITIGSFKLRKTIIGCGHNN 205
           C+S  C++L   +C  +   C Y Y YGD S T G + ++ +T GS  +     GCG NN
Sbjct: 160 CSSQLCQALSSPTCSNNF--CQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENN 219

Query: 206 GGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFSYCLPPFFSDTNVTGKINFGRNAVV 265
            G   G  +G++G+G G LSL SQ+        +FSYC+ P  S T     +    N+V 
Sbjct: 220 QGFGQGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTPSNLLLGSLANSVT 279

Query: 266 LGRKVVSTPLVSKFPDTFYYLTLEAISVANKRI---EAADDMSAAAEKGNIIIDSGTTLT 325
            G    +    S+ P TFYY+TL  +SV + R+    +A  +++    G IIIDSGTTLT
Sbjct: 280 AGSPNTTLIQSSQIP-TFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLT 339

Query: 326 LLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE-DDLDIPVITAHFAGGADVKL 385
              +  Y  V    +S +    V+  +   +LC+    +  +L IP    HF GG D++L
Sbjct: 340 YFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLEL 399

Query: 386 LPLNTFAMVADNVTCLTLAPSSD-LAIFGNLAQMNFLVGYDLERKRLSFKNTVC 433
              N F   ++ + CL +  SS  ++IFGN+ Q N LV YD     +SF +  C
Sbjct: 400 PSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Tan0000871 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 3.6e-55
Identity = 142/394 (36.04%), Postives = 205/394 (52.03%), Query Frame = 0

Query: 45  SLSRYNRLTNAFRRSISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIA 104
           +L++Y  +  A +R   R  ++   A   S++GI++P+    GE+LM+V++GTP     A
Sbjct: 54  NLTKYELIKRAIKRGERRMRSI--NAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSA 113

Query: 105 IADTGSDLTWTQCLPCHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTC 164
           I DTGSDL WTQC PC QCF+Q  PIFNP+ SSS+  + C S  C+ L   +C  +   C
Sbjct: 114 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--EC 173

Query: 165 SYGYSYGDRSFTYGDLASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSL 224
            Y Y YGD S T G +A++  T  +  +     GCG +N G   G  +G+IG+G G LSL
Sbjct: 174 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 233

Query: 225 VSQMRKIAAVQRRFSYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLV-SKFPDTFYY 284
            SQ+        +FSYC+  + S +  T  +    + V  G    ST L+ S    T+YY
Sbjct: 234 PSQLG-----VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYY 293

Query: 285 LTLEAISVA--NKRIEAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAK 344
           +TL+ I+V   N  I ++         G +IIDSGTTLT LP   Y+ V       +   
Sbjct: 294 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLP 353

Query: 345 RVDDPTGILELCYAAGGE-DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPS 404
            VD+ +  L  C+    +   + +P I+  F GG  + L   N     A+ V CL +  S
Sbjct: 354 TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSS 413

Query: 405 SDL--AIFGNLAQMNFLVGYDLERKRLSFKNTVC 433
           S L  +IFGN+ Q    V YDL+   +SF  T C
Sbjct: 414 SQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Tan0000871 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.8e-54
Identity = 130/366 (35.52%), Postives = 200/366 (54.64%), Query Frame = 0

Query: 75  TTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQCFNQSLPIFNPR 134
           TT + S     SGE+   + +GTP  ++  + DTGSD+ W QC PC  C+ QS P+FNP 
Sbjct: 148 TTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPT 207

Query: 135 QSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGDLASDKITIG-SFKLR 194
            SS+Y+ ++C++  C  L+ S+C ++   C Y  SYGD SFT G+LA+D +T G S K+ 
Sbjct: 208 SSSTYKSLTCSAPQCSLLETSACRSN--KCLYQVSYGDGSFTVGELATDTVTFGNSGKIN 267

Query: 195 KTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFSYCLPPFFSDTNVTG 254
              +GCGH+N G F G  +G++GLGGG LS+ +QM+  +     FSYCL     D+  + 
Sbjct: 268 NVALGCGHDNEGLFTG-AAGLLGLGGGVLSITNQMKATS-----FSYCLVD--RDSGKSS 327

Query: 255 KINFGRNAVVLGRKVVSTPLV-SKFPDTFYYLTLEAISVANKRI---EAADDMSAAAEKG 314
            ++F  N+V LG    + PL+ +K  DTFYY+ L   SV  +++   +A  D+ A+   G
Sbjct: 328 SLDF--NSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASG-SG 387

Query: 315 NIIIDSGTTLTLLPSKLYDGVVSTLVSV-VKAKRVDDPTGILELCYAAGGEDDLDIPVIT 374
            +I+D GT +T L ++ Y+ +    + + V  K+      + + CY       + +P + 
Sbjct: 388 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 447

Query: 375 AHFAGGADVKLLPLNTFAMVADNVT-CLTLAP-SSDLAIFGNLAQMNFLVGYDLERKRLS 433
            HF GG  + L   N    V D+ T C   AP SS L+I GN+ Q    + YDL +  + 
Sbjct: 448 FHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIG 500

BLAST of Tan0000871 vs. NCBI nr
Match: XP_038889220.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 699.9 bits (1805), Expect = 1.4e-197
Identity = 355/433 (81.99%), Postives = 387/433 (89.38%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTN-GGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 60
           MAAISIFFY LLFSF EATTN GGGNGFTTSLFHRDS+LSPL+N SLS ++R TNAFRRS
Sbjct: 1   MAAISIFFYFLLFSFAEATTNRGGGNGFTTSLFHRDSLLSPLHNSSLSCHDRRTNAFRRS 60

Query: 61  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 120
            SRSATLL    AVST  I SPIIP+SGEFLMSVS+GTPPVD IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLSHVNAVSTACIHSPIIPNSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLP 120

Query: 121 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGD 180
           C +CFNQS P+FNPR+SSSYR VSCTSD CRSLD   CG DL+TCSYGYSYGDRSFTYGD
Sbjct: 121 CQKCFNQSHPMFNPRRSSSYRNVSCTSDTCRSLDSYHCGTDLQTCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFS 240
           LASDKITI SFKL KT+IGCGH NGGTFGGVTSGIIGLGGGALSLVSQM  IAA++R+FS
Sbjct: 181 LASDKITIESFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFS 240

Query: 241 YCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAA 300
           YCLP FFSD N+TGKI+FG+NAVV G KV+STPLVS+ PDTFY+LTLEAISVANKR++AA
Sbjct: 241 YCLPTFFSDENITGKISFGQNAVVSGPKVISTPLVSRSPDTFYFLTLEAISVANKRLKAA 300

Query: 301 DDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE 360
           DD SA   +GNIIIDSGTTLT LP  LY+ +VSTLVSV+KAKRVDDP+GILELCYAAGG 
Sbjct: 301 DDTSALTRRGNIIIDSGTTLTFLPRNLYEDLVSTLVSVIKAKRVDDPSGILELCYAAGGG 360

Query: 361 DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYD 420
           DDL IPVI AHFAGGADVKLLPLNTFA+VA+NVTCLTLAP+SDLAIFGNLAQ+NF+VGYD
Sbjct: 361 DDLHIPVIIAHFAGGADVKLLPLNTFALVAENVTCLTLAPASDLAIFGNLAQINFIVGYD 420

Query: 421 LERKRLSFKNTVC 433
           LE KRLSFK TVC
Sbjct: 421 LENKRLSFKPTVC 433

BLAST of Tan0000871 vs. NCBI nr
Match: XP_023543528.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 674.1 bits (1738), Expect = 8.1e-190
Identity = 339/437 (77.57%), Postives = 381/437 (87.19%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNG----GGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAF 60
           MAAISIFF   L SF++AT +G    GG+GFTTSLFHRDS LSPLYNPSLS Y+RLTNAF
Sbjct: 1   MAAISIFFCFFLISFSQATVHGGVGDGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60

Query: 61  RRSISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQ 120
           RRS SRS TLL+RAAAVSTTGI S IIPD GEFLMS+S+GTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSDTLLNRAAAVSTTGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120

Query: 121 CLPCHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFT 180
           C+PCH+CFNQS PIFNPR+S SYR VSCTS+ACRSLDD  CG D RTCSYGYSYGD+SFT
Sbjct: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180

Query: 181 YGDLASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQR 240
           YGDLAS+KIT+GSFKL KT+IGCGH NGGTF G TSGIIGLGGG LSL+SQMRKIAAV+R
Sbjct: 181 YGDLASEKITVGSFKLYKTVIGCGHVNGGTFSGDTSGIIGLGGGPLSLISQMRKIAAVKR 240

Query: 241 RFSYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRI 300
           RFSYCLP FFSD NVTGKI+FG+ A+V GRKV+STPLV K P+TFYY+TL+A+SVANKR 
Sbjct: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVISTPLVLKEPNTFYYVTLKAMSVANKRF 300

Query: 301 EAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAA 360
           +AA++MSAA E+GNI+IDSGTTLT+LP  LY GV STL  VVKAKRV+DPTG+L+LC+A 
Sbjct: 301 KAANNMSAAVERGNILIDSGTTLTILPPNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAT 360

Query: 361 GGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLV 420
              D L+IPVITAHFAGGADVKLLPLNTFAMVADNV CL   PS++ AIFGNLAQ+NFLV
Sbjct: 361 RSVDHLNIPVITAHFAGGADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420

Query: 421 GYDLERKRLSFKNTVCA 434
           GYDLERKRLSFK  VCA
Sbjct: 421 GYDLERKRLSFKYNVCA 437

BLAST of Tan0000871 vs. NCBI nr
Match: XP_022942027.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 669.8 bits (1727), Expect = 1.5e-188
Identity = 340/437 (77.80%), Postives = 379/437 (86.73%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTN----GGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAF 60
           MAAISIFF + L SF++AT +    GGG+GFTTSLFHRDS LSPLYNPSLS Y+RLTNAF
Sbjct: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60

Query: 61  RRSISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQ 120
           RRS SRS TLL+RAAAVS TGI S IIPD GEFLMS+S+GTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120

Query: 121 CLPCHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFT 180
           C+PCH+CFNQS PIFNPR+S SYR VSCTS+ACRSLDD  CG D RTCSYGYSYGD+SFT
Sbjct: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180

Query: 181 YGDLASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQR 240
           YGDLAS+KITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQMRKIAAV+R
Sbjct: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 240

Query: 241 RFSYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRI 300
           RFSYCLP FFSD NVTGKI+FG+ A+V GRKVVSTPLV K P+TFYYLTLEA+SVANKR 
Sbjct: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300

Query: 301 EAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAA 360
           +AA++MS A E+GNI+IDSGTTLT+LP  LY GV STL  VVKAKRV+DPTG+L+LC+AA
Sbjct: 301 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAA 360

Query: 361 GGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLV 420
              D L+IPVITAHFAG ADVKLLPLNTFAMVADNV CL   PS++ AIFGNLAQ+NFLV
Sbjct: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420

Query: 421 GYDLERKRLSFKNTVCA 434
           GYDLERKRLSFK  VCA
Sbjct: 421 GYDLERKRLSFKYNVCA 437

BLAST of Tan0000871 vs. NCBI nr
Match: KAG6600420.1 (putative aspartic protease, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 669.5 bits (1726), Expect = 2.0e-188
Identity = 339/437 (77.57%), Postives = 382/437 (87.41%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTN----GGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAF 60
           MAAISIFF + L SF++AT +    GGG+GFTTSLFHRDS LSPLYNPSLS Y+RLTNAF
Sbjct: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60

Query: 61  RRSISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQ 120
           RRS SRS TLL+RAAAVS TGI S IIPD+GEFLMS+S+GTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDNGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120

Query: 121 CLPCHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFT 180
           C+PCH+CFNQS PIFNPR+S SYR VSCTS+ACRSLDD  CG + RTCSYGYSYGD+SFT
Sbjct: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPNNRTCSYGYSYGDQSFT 180

Query: 181 YGDLASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQR 240
           YGDLAS+KITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQMRKIAAV+R
Sbjct: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSKDTSGIIGLGGGPLSLISQMRKIAAVKR 240

Query: 241 RFSYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRI 300
           RFSYCLP FFSD NVTGKI+FG+ A+VLGRKVVSTPLV K P+TFYYLTLEA+SVANKR 
Sbjct: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVLGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300

Query: 301 EAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAA 360
           +AA++MSAA E+GNI+IDSGTTLT+LP  LY GV STL  VVKAKRV+DPTG+L+LC+AA
Sbjct: 301 KAANNMSAAVEQGNILIDSGTTLTILPQNLYKGVASTLARVVKAKRVNDPTGVLDLCFAA 360

Query: 361 GGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLV 420
              D L+IPVITAHFAG ADVKLLPLNTFAMVADNV CL   PS++ AIFGNLAQ+NFLV
Sbjct: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420

Query: 421 GYDLERKRLSFKNTVCA 434
           GYDLERKR+SF+  VCA
Sbjct: 421 GYDLERKRVSFEYNVCA 437

BLAST of Tan0000871 vs. NCBI nr
Match: KAA0044968.1 (putative aspartic protease [Cucumis melo var. makuwa])

HSP 1 Score: 655.2 bits (1689), Expect = 3.9e-184
Identity = 331/434 (76.27%), Postives = 377/434 (86.87%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGG-NGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 60
           M  ISIFFY LLF  ++AT +GGG +GFTTSLFHRDS+LSPL+NPSLSRY+ L  +FRRS
Sbjct: 1   MPVISIFFYFLLFFSSKATAHGGGHHGFTTSLFHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 120
            SRSATLL+   +VST  I+SPIIPDSGEFLMS+ +GTP V+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGD 180
           C +CFNQS PIFNPR+SSSYR+VSC+SD CRSL+ S CG DL++CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFS 240
           LASDKITIGSFKL KT+IGCGH NGGTFGGVTSGIIGLGGG+LSLVSQM  IA V+ +FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAA 300
           YCLP FFS+ N+TGKI+FGR AVV GR+VVSTPLV + PDTFY+LTLEAISV NKR +AA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 DDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE 360
            DMSA   +GNIIIDSGTTLTLLP  LYDGVVSTL  V+KAKRVDDP+GILELCY+AG  
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKAKRVDDPSGILELCYSAGQL 360

Query: 361 DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYD 420
           +DL+IP+ITAHF+G ADVKLLP+NTFA VADNV CLTLAP++++AIFGNLAQ+NF VGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LERKRLSFKNTVCA 434
           L  KRLSFK T CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of Tan0000871 vs. ExPASy TrEMBL
Match: A0A6J1FP39 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111447216 PE=3 SV=1)

HSP 1 Score: 669.8 bits (1727), Expect = 7.4e-189
Identity = 340/437 (77.80%), Postives = 379/437 (86.73%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTN----GGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAF 60
           MAAISIFF + L SF++AT +    GGG+GFTTSLFHRDS LSPLYNPSLS Y+RLTNAF
Sbjct: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60

Query: 61  RRSISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQ 120
           RRS SRS TLL+RAAAVS TGI S IIPD GEFLMS+S+GTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120

Query: 121 CLPCHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFT 180
           C+PCH+CFNQS PIFNPR+S SYR VSCTS+ACRSLDD  CG D RTCSYGYSYGD+SFT
Sbjct: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180

Query: 181 YGDLASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQR 240
           YGDLAS+KITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQMRKIAAV+R
Sbjct: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 240

Query: 241 RFSYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRI 300
           RFSYCLP FFSD NVTGKI+FG+ A+V GRKVVSTPLV K P+TFYYLTLEA+SVANKR 
Sbjct: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300

Query: 301 EAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAA 360
           +AA++MS A E+GNI+IDSGTTLT+LP  LY GV STL  VVKAKRV+DPTG+L+LC+AA
Sbjct: 301 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAA 360

Query: 361 GGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLV 420
              D L+IPVITAHFAG ADVKLLPLNTFAMVADNV CL   PS++ AIFGNLAQ+NFLV
Sbjct: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420

Query: 421 GYDLERKRLSFKNTVCA 434
           GYDLERKRLSFK  VCA
Sbjct: 421 GYDLERKRLSFKYNVCA 437

BLAST of Tan0000871 vs. ExPASy TrEMBL
Match: A0A5A7TPZ5 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002740 PE=3 SV=1)

HSP 1 Score: 655.2 bits (1689), Expect = 1.9e-184
Identity = 331/434 (76.27%), Postives = 377/434 (86.87%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGG-NGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 60
           M  ISIFFY LLF  ++AT +GGG +GFTTSLFHRDS+LSPL+NPSLSRY+ L  +FRRS
Sbjct: 1   MPVISIFFYFLLFFSSKATAHGGGHHGFTTSLFHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 120
            SRSATLL+   +VST  I+SPIIPDSGEFLMS+ +GTP V+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGD 180
           C +CFNQS PIFNPR+SSSYR+VSC+SD CRSL+ S CG DL++CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFS 240
           LASDKITIGSFKL KT+IGCGH NGGTFGGVTSGIIGLGGG+LSLVSQM  IA V+ +FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAA 300
           YCLP FFS+ N+TGKI+FGR AVV GR+VVSTPLV + PDTFY+LTLEAISV NKR +AA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 DDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE 360
            DMSA   +GNIIIDSGTTLTLLP  LYDGVVSTL  V+KAKRVDDP+GILELCY+AG  
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKAKRVDDPSGILELCYSAGQL 360

Query: 361 DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYD 420
           +DL+IP+ITAHF+G ADVKLLP+NTFA VADNV CLTLAP++++AIFGNLAQ+NF VGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LERKRLSFKNTVCA 434
           L  KRLSFK T CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of Tan0000871 vs. ExPASy TrEMBL
Match: A0A5D3D1Z7 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003070 PE=3 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 4.2e-184
Identity = 330/434 (76.04%), Postives = 377/434 (86.87%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGG-NGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 60
           M AISIFFY LLF  ++AT +GGG +GFTTSL+HRDS+LSPL+NPSLSRY+ L  +FRRS
Sbjct: 1   MPAISIFFYFLLFFSSKATAHGGGHHGFTTSLYHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 120
            SRSATLL+   +VST  I+SPIIPDSGEFLMS+ +GTP V+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGD 180
           C +CFNQS PIFNPR+SSSYR+VSC+SD CRSL+ S CG DL++CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFS 240
           LASDKITIGSFKL KT+IGCGH NGGTFGGVTSGIIGLGGG+LSLVSQM  IA V+ +FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAA 300
           YCLP FFS+ N+TGKI+FGR AVV GR+VVSTPLV + PDTFY+LTLEAISV NKR +AA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 DDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE 360
            DMSA   +GNIIIDSGTTLTLLP  LYDGVVSTL  V+K KRVDDP+GILELCY+AG  
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSGILELCYSAGQL 360

Query: 361 DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYD 420
           +DL+IP+ITAHF+G ADVKLLP+NTFA VADNV CLTLAP++++AIFGNLAQ+NF VGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LERKRLSFKNTVCA 434
           L  KRLSFK T CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of Tan0000871 vs. ExPASy TrEMBL
Match: A0A1S3BT75 (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=3 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 4.2e-184
Identity = 330/434 (76.04%), Postives = 377/434 (86.87%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGG-NGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 60
           M AISIFFY LLF  ++AT +GGG +GFTTSL+HRDS+LSPL+NPSLSRY+ L  +FRRS
Sbjct: 1   MPAISIFFYFLLFFSSKATAHGGGHHGFTTSLYHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 120
            SRSATLL+   +VST  I+SPIIPDSGEFLMS+ +GTP V+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGD 180
           C +CFNQS PIFNPR+SSSYR+VSC+SD CRSL+ S CG DL++CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFS 240
           LASDKITIGSFKL KT+IGCGH NGGTFGGVTSGIIGLGGG+LSLVSQM  IA V+ +FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAA 300
           YCLP FFS+ N+TGKI+FGR AVV GR+VVSTPLV + PDTFY+LTLEAISV NKR +AA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 DDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE 360
            DMSA   +GNIIIDSGTTLTLLP  LYDGVVSTL  V+K KRVDDP+GILELCY+AG  
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSGILELCYSAGQL 360

Query: 361 DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYD 420
           +DL+IP+ITAHF+G ADVKLLP+NTFA VADNV CLTLAP++++AIFGNLAQ+NF VGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LERKRLSFKNTVCA 434
           L  KRLSFK T CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of Tan0000871 vs. ExPASy TrEMBL
Match: A0A0A0KZZ3 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 647.9 bits (1670), Expect = 3.0e-182
Identity = 329/434 (75.81%), Postives = 372/434 (85.71%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGG-NGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 60
           MAAISIFFY LLF  ++ T +GGG +GFTTSLF RDS LSPL+NPSLSRY+ L +AFRRS
Sbjct: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 120
            SRSATLL    +VST  I+SPIIPDSGEFLMS+ +GTPPV+VIAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGD 180
           C +CFNQS PIFNPR+SSSYR+VSC SD CRSL+   CG DL++CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFS 240
           LASD+ITIGSFKL KT+IGCGH NGGTFGGVTSGIIGLGGG+LSLVSQMR IA V+ RFS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAA 300
           YCLP FFS+ N+TG I+FGR AVV GR+VVSTPLV + PDTFY+LTLEAISV  KR +AA
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 DDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGE 360
           + +SA    GNIIIDSGTTLTLLP  LY GV STL  V+KAKRVDDP+GILELCY+AG  
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 DDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYD 420
           DDL+IP+ITAHFAGGADVKLLP+NTFA VADNVTCLT AP++ +AIFGNLAQ+NF VGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LERKRLSFKNTVCA 434
           L  KRLSF+  +CA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of Tan0000871 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 409.5 bits (1051), Expect = 3.5e-114
Identity = 220/414 (53.14%), Postives = 277/414 (66.91%), Query Frame = 0

Query: 25  NGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSISRSATLLHRAAAVSTTGIQSPIIP 84
           +GFT  L HRDS  SP YN + +   R+ NA RR  S  +TL       S    QS I  
Sbjct: 24  DGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRR--SARSTLQFSNDDASPNSPQSFITS 83

Query: 85  DSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQCFNQSLPIFNPRQSSSYRRVSC 144
           + GE+LM++S+GTPPV ++AIADTGSDL WTQC PC  C+ Q+ P+F+P++SS+YR+VSC
Sbjct: 84  NRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSC 143

Query: 145 TSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLRKTIIGC 204
           +S  CR+L+D+SC  D  TCSY  +YGD S+T GD+A D +T+GS       LR  IIGC
Sbjct: 144 SSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGC 203

Query: 205 GHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFSYCLPPFFSDTNVTGKINFGR 264
           GH N GTF    SGIIGLGGG+ SLVSQ+RK  ++  +FSYCL PF S+T +T KINFG 
Sbjct: 204 GHENTGTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSETGLTSKINFGT 263

Query: 265 NAVVLGRKVVSTPLVSKFPDTFYYLTLEAISVANKRIEAADDMSAAAEKGNIIIDSGTTL 324
           N +V G  VVST +V K P T+Y+L LEAISV +K+I+    +    E GNI+IDSGTTL
Sbjct: 264 NGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGE-GNIVIDSGTTL 323

Query: 325 TLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVKL 384
           TLLPS  Y  + S + S +KA+RV DP GIL LCY         +P IT HF GG DVKL
Sbjct: 324 TLLPSNFYYELESVVASTIKAERVQDPDGILSLCYR--DSSSFKVPDITVHFKGG-DVKL 383

Query: 385 LPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYDLERKRLSFKNTVCA 434
             LNTF  V+++V+C   A +  L IFGNLAQMNFLVGYD     +SFK T C+
Sbjct: 384 GNLNTFVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429

BLAST of Tan0000871 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 378.3 bits (970), Expect = 8.6e-105
Identity = 207/415 (49.88%), Postives = 275/415 (66.27%), Query Frame = 0

Query: 26  GFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSISRSATLLHRAAAVSTTGIQSPIIPD 85
           GFT  L HRDS  SP YNP  +   RL NA  RS++R   + H     +T   Q  +  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQPQIDLTSN 89

Query: 86  SGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQCFNQSLPIFNPRQSSSYRRVSCT 145
           SGE+LM+VS+GTPP  ++AIADTGSDL WTQC PC  C+ Q  P+F+P+ SS+Y+ VSC+
Sbjct: 90  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 149

Query: 146 SDACRSLDD-SSCGADLRTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLRKTIIGC 205
           S  C +L++ +SC  +  TCSY  SYGD S+T G++A D +T+GS      +L+  IIGC
Sbjct: 150 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 209

Query: 206 GHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRFSYCLPPFFSDTNVTGKINFGR 265
           GHNN GTF    SGI+GLGGG +SL+ Q+    ++  +FSYCL P  S  + T KINFG 
Sbjct: 210 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGT 269

Query: 266 NAVVLGRKVVSTPLVSK-FPDTFYYLTLEAISVANKRIEAADDMSAAAEKGNIIIDSGTT 325
           NA+V G  VVSTPL++K   +TFYYLTL++ISV +K+I+ +   S ++E GNIIIDSGTT
Sbjct: 270 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSE-GNIIIDSGTT 329

Query: 326 LTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVK 385
           LTLLP++ Y  +   + S + A++  DP   L LCY+A G  DL +PVIT HF  GADVK
Sbjct: 330 LTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKVPVITMHF-DGADVK 389

Query: 386 LLPLNTFAMVADNVTCLTLAPSSDLAIFGNLAQMNFLVGYDLERKRLSFKNTVCA 434
           L   N F  V++++ C     S   +I+GN+AQMNFLVGYD   K +SFK T CA
Sbjct: 390 LDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Tan0000871 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 367.5 bits (942), Expect = 1.5e-101
Identity = 209/452 (46.24%), Postives = 290/452 (64.16%), Query Frame = 0

Query: 1   MAAISIFFYILLFSFTEATTNGGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSI 60
           MA   +  + L FS T  +++G    F+  L HRDS LSP+YNP ++  +RL  AF RS+
Sbjct: 1   MATQILLCFFLFFSVT-LSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSV 60

Query: 61  SRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPC 120
           SRS    H+   +S T +QS +I   GEF MS+++GTPP+ V AIADTGSDLTW QC PC
Sbjct: 61  SRSRRFNHQ---LSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC 120

Query: 121 HQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGAD--LRTCSYGYSYGDRSFTYG 180
            QC+ ++ PIF+ ++SS+Y+   C S  C++L  +  G D     C Y YSYGD+SF+ G
Sbjct: 121 QQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKG 180

Query: 181 DLASDKITIGS-----FKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAA 240
           D+A++ ++I S          T+ GCG+NNGGTF    SGIIGLGGG LSL+SQ+   ++
Sbjct: 181 DVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SS 240

Query: 241 VQRRFSYCLPPFFSDTNVTGKINFGRNAVVLGRK----VVSTPLVSKFPDTFYYLTLEAI 300
           + ++FSYCL    + TN T  IN G N++         VVSTPLV K P T+YYLTLEAI
Sbjct: 241 ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAI 300

Query: 301 SVANKRI-------EAADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTL-VSVVKAK 360
           SV  K+I          DD   +   GNIIIDSGTTLTLL +  +D   S +  SV  AK
Sbjct: 301 SVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAK 360

Query: 361 RVDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSS 420
           RV DP G+L  C+ +G   ++ +P IT HF  GADV+L P+N F  +++++ CL++ P++
Sbjct: 361 RVSDPQGLLSHCFKSGSA-EIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTT 420

Query: 421 DLAIFGNLAQMNFLVGYDLERKRLSFKNTVCA 434
           ++AI+GN AQM+FLVGYDLE + +SF++  C+
Sbjct: 421 EVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of Tan0000871 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 348.2 bits (892), Expect = 9.5e-96
Identity = 199/451 (44.12%), Postives = 276/451 (61.20%), Query Frame = 0

Query: 3   AISIFFYILLFS---FTEATTNGGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRS 62
           A   F Y  L +   F  + ++      T  L HRDS  SPLYNP  +  +RL  AF RS
Sbjct: 2   ATKTFLYCSLLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRS 61

Query: 63  ISRSATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLP 122
           ISRS          + T +QS +I + GE+ MS+S+GTPP  V AIADTGSDLTW QC P
Sbjct: 62  ISRSRRF------TTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKP 121

Query: 123 CHQCFNQSLPIFNPRQSSSYRRVSCTSDACRSLD--DSSCGADLRTCSYGYSYGDRSFTY 182
           C QC+ Q+ P+F+ ++SS+Y+  SC S  C++L   +  C      C Y YSYGD SFT 
Sbjct: 122 CQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTK 181

Query: 183 GDLASDKITI-----GSFKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIA 242
           GD+A++ I+I      S     T+ GCG+NNGGTF    SGIIGLGGG LSLVSQ+   +
Sbjct: 182 GDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLG--S 241

Query: 243 AVQRRFSYCLPPFFSDTNVTGKINFGRNAVVLG----RKVVSTPLVSKFPDTFYYLTLEA 302
           ++ ++FSYCL    + TN T  IN G N++          ++TPL+ K P+T+Y+LTLEA
Sbjct: 242 SIGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEA 301

Query: 303 ISVANKRIEAAD-----DMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTL-VSVVKAKR 362
           ++V   ++         +  ++   GNIIIDSGTTLTLL S  YD   + +  SV  AKR
Sbjct: 302 VTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKR 361

Query: 363 VDDPTGILELCYAAGGEDDLDIPVITAHFAGGADVKLLPLNTFAMVADNVTCLTLAPSSD 422
           V DP G+L  C+ + G+ ++ +P IT HF   ADVKL P+N F  + ++  CL++ P+++
Sbjct: 362 VSDPQGLLTHCFKS-GDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTTE 421

Query: 423 LAIFGNLAQMNFLVGYDLERKRLSFKNTVCA 434
           +AI+GN+ QM+FLVGYDLE K +SF+   C+
Sbjct: 422 VAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of Tan0000871 vs. TAIR 10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 234.6 bits (597), Expect = 1.5e-61
Identity = 174/441 (39.46%), Postives = 228/441 (51.70%), Query Frame = 0

Query: 4   ISIFFYILLFSFTEATTNGGGNGFTTSLFHRDSVLSPLYNPSLSRYNRLTNAFRRSISRS 63
           I + F  +   F   TT    +GFT  L HR S  S          +R++N    S   +
Sbjct: 7   IIVLFLQISLCFLFTTTASPPHGFTMDLIHRRSNAS----------SRVSNTQSGSSPYA 66

Query: 64  ATLLHRAAAVSTTGIQSPIIPDSGEFLMSVSLGTPPVDVIAIADTGSDLTWTQCLPCHQC 123
            T+                  D+  +LM + +GTPP ++ AI DTGS++TWTQCLPC  C
Sbjct: 67  NTVF-----------------DNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHC 126

Query: 124 FNQSLPIFNPRQSSSYRRVSCTSDACRSLDDSSCGADLRTCSYGYSYGDRSFTYGDLASD 183
           + Q+ PIF+P +SS+++   C               D  +C Y   Y D ++T G LA++
Sbjct: 127 YEQNAPIFDPSKSSTFKEKRC---------------DGHSCPYEVDYFDHTYTMGTLATE 186

Query: 184 KITIGS-----FKLRKTIIGCGHNNGGTFGGVTSGIIGLGGGALSLVSQMRKIAAVQRRF 243
            IT+ S     F + +TIIGCGHNN   F    SG++GL  G  SL++QM          
Sbjct: 187 TITLHSTSGEPFVMPETIIGCGHNN-SWFKPSFSGMVGLNWGPSSLITQMG--GEYPGLM 246

Query: 244 SYCLPPFFSDTNVTGKINFGRNAVVLGRKVVSTPL-VSKFPDTFYYLTLEAISVANKRIE 303
           SYC    FS    T KINFG NA+V G  VVST + ++     FYYL L+A+SV N RIE
Sbjct: 247 SYC----FSGQG-TSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIE 306

Query: 304 AADDMSAAAEKGNIIIDSGTTLTLLPSKLYDGVVSTLVSVVKAKRVDDPTGILELCYAAG 363
                  A E GNI+IDSGTTLT  P    + V   +  VV A R  DPTG   LCY   
Sbjct: 307 TMGTTFHALE-GNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCY--- 366

Query: 364 GEDDLDI-PVITAHFAGGADVKLLPLNTFAMVADN--VTCLTLAPSSDL--AIFGNLAQM 423
             D +DI PVIT HF+GG D+ L   N + M ++N  V CL +  +S    AIFGN AQ 
Sbjct: 367 NSDTIDIFPVITMHFSGGVDLVLDKYNMY-MESNNGGVFCLAIICNSPTQEAIFGNRAQN 392

Query: 424 NFLVGYDLERKRLSFKNTVCA 434
           NFLVGYD     +SF  T C+
Sbjct: 427 NFLVGYDSSSLLVSFSPTNCS 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6XBF81.2e-10349.88Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM52.1e-10046.24Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C32.2e-6036.47Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C23.6e-5536.04Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LS401.8e-5435.52Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
XP_038889220.11.4e-19781.99aspartic proteinase CDR1-like [Benincasa hispida][more]
XP_023543528.18.1e-19077.57aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
XP_022942027.11.5e-18877.80aspartic proteinase CDR1-like [Cucurbita moschata][more]
KAG6600420.12.0e-18877.57putative aspartic protease, partial [Cucurbita argyrosperma subsp. sororia][more]
KAA0044968.13.9e-18476.27putative aspartic protease [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A6J1FP397.4e-18977.80aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111447216 PE=3... [more]
A0A5A7TPZ51.9e-18476.27Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... [more]
A0A5D3D1Z74.2e-18476.04Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
A0A1S3BT754.2e-18476.04probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=... [more]
A0A0A0KZZ33.0e-18275.81Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G05541... [more]
Match NameE-valueIdentityDescription
AT1G64830.13.5e-11453.14Eukaryotic aspartyl protease family protein [more]
AT5G33340.18.6e-10549.88Eukaryotic aspartyl protease family protein [more]
AT2G35615.11.5e-10146.24Eukaryotic aspartyl protease family protein [more]
AT1G31450.19.5e-9644.12Eukaryotic aspartyl protease family protein [more]
AT2G28010.11.5e-6139.46Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 89..258
e-value: 2.2E-51
score: 174.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 70..257
e-value: 3.4E-50
score: 172.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 258..433
e-value: 1.2E-45
score: 157.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 83..432
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 281..428
e-value: 5.7E-28
score: 97.7
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 7..432
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 7..432
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 311..322
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 89..428
score: 43.419895
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 88..432
e-value: 1.10595E-75
score: 235.235

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000871.1Tan0000871.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity