Tan0017014 (gene) Snake gourd v1

Overview
NameTan0017014
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartic proteinase CDR1-like
LocationLG06: 72383174 .. 72384613 (+)
RNA-Seq ExpressionTan0017014
SyntenyTan0017014
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCACTACTTTTCTCTCTAATTTTCTTATTCTCCGCGGCCGTCTCAGCCGCCACCAGCGGTGGCTATGGCTTCACCGTCGAACTCATGCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCGGAGACCCACTACCAACGCCTCGCCAACACCCTCCGCCGATCCATCCGCCGTAACAAGGCGGCGGCGCTGGCAGACACTGCGGCGGCGCCAATGTACAACAACAGAGGAGAATATCTCATGAAAATCTCCCTCGGAACGCCGCCGTTTCCAATTCTAGCCATTGCTGATACAGGAAGCGACGTCGTTTGGACCCAATGCCAACCATGCCCAAATTGCTACCAGCAAAACGCGCCGATGTTTAACCCGAGTGAATCGTCGACTTACAAGAAAGTGCCGTGTTCCTCGCCGATTTGCTCGTATGCAGGAGAGGAACGTTCTTGCTCTGATCGGTCTGAGTGTTTGTACTCGATTACTTACGGCGATAGGTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGTTACTATGGGGTCCACCTCCGGTCGCCCCGTGGCTTTTCCTCGTACTGTGATTGGTTGTGGCCATGACAATGCTGGTACTTTCGATGCTAATGTTTCCGGCATTGTTGGTCTAGGTCAAGGTCCGGCCTCGCTCGTCCCACAAATGGGTCCTGCCTCGGGTGGAAAATTTTCTTATTGTTTGGCTCCAATTGGTGCTGGTCTCGAGTCGAGCAAACTTAACTTTGGTTCTAATGCTGATGTATCTGGCTCTGAAGCTGTCTCAACCCCAATTTATACTAGTGGTAATTAATTAAATTATCACTCAACTCAAAAGTTTTTTATAAATTTTTTATTTTTTTCTCTTTGACTACAATTATATATGTGGATGGTGGGGTTGGGATTTAATATTTCAATTTAATATTAATGTAATTTTTTGTTCATATTTTTCACAGATAGATTCAATAGCTTCTACTCACTCAACCTAGAAGCCATCAGCGTAGGGGAGGACAAGTTCGATCTTCCGACCTCTTCACCATTGGGCGATGGACCAAACATCATCATTGACTCTGGCACCACGCTTACGCTCCTTCCACGGAACGTCTACACCGATGTTGCCACGGCGATTTCTAACTCGACCAACCTCCAACGCACCGACGACCCGAACCAATTCTTGGAGTACTGCTTCGAGACCACGACCGACGACTTCGAAGCACCGGCAATCACCGTGCACTTCGAAGGCGCCGACGTGCCCCTGCACCGAGAAAACGTGTTCATTAGGGTGGCGGATAATGTCGTCTGCTTGGCGTTGGCCGCCGGCCAGGACGATGGCATTTTCATCTACGGCAACATTGCTCAGAACAACTTCTTGGTTGGTTATGATATTAAGAACAAGTTGGTTTCCTTCAAGCCGGCCGATTGCGCTGCCATGTGA

mRNA sequence

ATGGCACTACTTTTCTCTCTAATTTTCTTATTCTCCGCGGCCGTCTCAGCCGCCACCAGCGGTGGCTATGGCTTCACCGTCGAACTCATGCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCGGAGACCCACTACCAACGCCTCGCCAACACCCTCCGCCGATCCATCCGCCGTAACAAGGCGGCGGCGCTGGCAGACACTGCGGCGGCGCCAATGTACAACAACAGAGGAGAATATCTCATGAAAATCTCCCTCGGAACGCCGCCGTTTCCAATTCTAGCCATTGCTGATACAGGAAGCGACGTCGTTTGGACCCAATGCCAACCATGCCCAAATTGCTACCAGCAAAACGCGCCGATGTTTAACCCGAGTGAATCGTCGACTTACAAGAAAGTGCCGTGTTCCTCGCCGATTTGCTCGTATGCAGGAGAGGAACGTTCTTGCTCTGATCGGTCTGAGTGTTTGTACTCGATTACTTACGGCGATAGGTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGTTACTATGGGGTCCACCTCCGGTCGCCCCGTGGCTTTTCCTCGTACTGTGATTGGTTGTGGCCATGACAATGCTGGTACTTTCGATGCTAATGTTTCCGGCATTGTTGGTCTAGGTCAAGGTCCGGCCTCGCTCGTCCCACAAATGGGTCCTGCCTCGGGTGGAAAATTTTCTTATTGTTTGGCTCCAATTGGTGCTGGTCTCGAGTCGAGCAAACTTAACTTTGGTTCTAATGCTGATGTATCTGGCTCTGAAGCTGTCTCAACCCCAATTTATACTAGTGATAGATTCAATAGCTTCTACTCACTCAACCTAGAAGCCATCAGCGTAGGGGAGGACAAGTTCGATCTTCCGACCTCTTCACCATTGGGCGATGGACCAAACATCATCATTGACTCTGGCACCACGCTTACGCTCCTTCCACGGAACGTCTACACCGATGTTGCCACGGCGATTTCTAACTCGACCAACCTCCAACGCACCGACGACCCGAACCAATTCTTGGAGTACTGCTTCGAGACCACGACCGACGACTTCGAAGCACCGGCAATCACCGTGCACTTCGAAGGCGCCGACGTGCCCCTGCACCGAGAAAACGTGTTCATTAGGGTGGCGGATAATGTCGTCTGCTTGGCGTTGGCCGCCGGCCAGGACGATGGCATTTTCATCTACGGCAACATTGCTCAGAACAACTTCTTGGTTGGTTATGATATTAAGAACAAGTTGGTTTCCTTCAAGCCGGCCGATTGCGCTGCCATGTGA

Coding sequence (CDS)

ATGGCACTACTTTTCTCTCTAATTTTCTTATTCTCCGCGGCCGTCTCAGCCGCCACCAGCGGTGGCTATGGCTTCACCGTCGAACTCATGCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCGGAGACCCACTACCAACGCCTCGCCAACACCCTCCGCCGATCCATCCGCCGTAACAAGGCGGCGGCGCTGGCAGACACTGCGGCGGCGCCAATGTACAACAACAGAGGAGAATATCTCATGAAAATCTCCCTCGGAACGCCGCCGTTTCCAATTCTAGCCATTGCTGATACAGGAAGCGACGTCGTTTGGACCCAATGCCAACCATGCCCAAATTGCTACCAGCAAAACGCGCCGATGTTTAACCCGAGTGAATCGTCGACTTACAAGAAAGTGCCGTGTTCCTCGCCGATTTGCTCGTATGCAGGAGAGGAACGTTCTTGCTCTGATCGGTCTGAGTGTTTGTACTCGATTACTTACGGCGATAGGTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGTTACTATGGGGTCCACCTCCGGTCGCCCCGTGGCTTTTCCTCGTACTGTGATTGGTTGTGGCCATGACAATGCTGGTACTTTCGATGCTAATGTTTCCGGCATTGTTGGTCTAGGTCAAGGTCCGGCCTCGCTCGTCCCACAAATGGGTCCTGCCTCGGGTGGAAAATTTTCTTATTGTTTGGCTCCAATTGGTGCTGGTCTCGAGTCGAGCAAACTTAACTTTGGTTCTAATGCTGATGTATCTGGCTCTGAAGCTGTCTCAACCCCAATTTATACTAGTGATAGATTCAATAGCTTCTACTCACTCAACCTAGAAGCCATCAGCGTAGGGGAGGACAAGTTCGATCTTCCGACCTCTTCACCATTGGGCGATGGACCAAACATCATCATTGACTCTGGCACCACGCTTACGCTCCTTCCACGGAACGTCTACACCGATGTTGCCACGGCGATTTCTAACTCGACCAACCTCCAACGCACCGACGACCCGAACCAATTCTTGGAGTACTGCTTCGAGACCACGACCGACGACTTCGAAGCACCGGCAATCACCGTGCACTTCGAAGGCGCCGACGTGCCCCTGCACCGAGAAAACGTGTTCATTAGGGTGGCGGATAATGTCGTCTGCTTGGCGTTGGCCGCCGGCCAGGACGATGGCATTTTCATCTACGGCAACATTGCTCAGAACAACTTCTTGGTTGGTTATGATATTAAGAACAAGTTGGTTTCCTTCAAGCCGGCCGATTGCGCTGCCATGTGA

Protein sequence

MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRRNKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIGAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVSFKPADCAAM
Homology
BLAST of Tan0017014 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 402.9 bits (1034), Expect = 4.5e-111
Identity = 214/434 (49.31%), Postives = 286/434 (65.90%), Query Frame = 0

Query: 2   ALLFSLIFLFSAAVSAATS-GGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 61
           ++L SL  L S  +S A +    GFT +L+HRDSPKSP YNP ET  QRL N + RS+ R
Sbjct: 7   SVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR 66

Query: 62  NKAAALADTAAAP---MYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQ 121
                  D    P   + +N GEYLM +S+GTPPFPI+AIADTGSD++WTQC PC +CY 
Sbjct: 67  VFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYT 126

Query: 122 QNAPMFNPSESSTYKKVPCSSPICSYAGEERSCS-DRSECLYSITYGDRSHSQGDLAVDT 181
           Q  P+F+P  SSTYK V CSS  C+    + SCS + + C YS++YGD S+++G++AVDT
Sbjct: 127 QVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 186

Query: 182 VTMGSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYC 241
           +T+GS+  RP+     +IGCGH+NAGTF+   SGIVGLG GP SL+ Q+G +  GKFSYC
Sbjct: 187 LTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYC 246

Query: 242 LAPIGAGL-ESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPT 301
           L P+ +   ++SK+NFG+NA VSGS  VSTP+       +FY L L++ISVG  +     
Sbjct: 247 LVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG 306

Query: 302 SSPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDF 361
           S       NIIIDSGTTLTLLP   Y+++  A+++S + ++  DP   L  C+ + T D 
Sbjct: 307 SDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDL 366

Query: 362 EAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIK 421
           + P IT+HF+GADV L   N F++V++++VC A          IYGN+AQ NFLVGYD  
Sbjct: 367 KVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRG--SPSFSIYGNVAQMNFLVGYDTV 426

Query: 422 NKLVSFKPADCAAM 430
           +K VSFKP DCA M
Sbjct: 427 SKTVSFKPTDCAKM 437

BLAST of Tan0017014 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 4.9e-89
Identity = 191/439 (43.51%), Postives = 270/439 (61.50%), Query Frame = 0

Query: 9   FLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRRNKA--AAL 68
           FLF +   +++     F+VEL+HRDSP SP+YNP  T   RL     RS+ R++     L
Sbjct: 10  FLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQL 69

Query: 69  ADT-AAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPMFNP 128
           + T   + +    GE+ M I++GTPP  + AIADTGSD+ W QC+PC  CY++N P+F+ 
Sbjct: 70  SQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDK 129

Query: 129 SESSTYKKVPCSSPIC-SYAGEERSCSDRSE-CLYSITYGDRSHSQGDLAVDTVTMGSTS 188
            +SSTYK  PC S  C + +  ER C + +  C Y  +YGD+S S+GD+A +TV++ S S
Sbjct: 130 KKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSAS 189

Query: 189 GRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIGAG 248
           G PV+FP TV GCG++N GTFD   SGI+GLG G  SL+ Q+G +   KFSYCL+   A 
Sbjct: 190 GSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSAT 249

Query: 249 LE-SSKLNFGSNADVSG----SEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSS- 308
              +S +N G+N+  S     S  VSTP+   +   ++Y L LEAISVG+ K     SS 
Sbjct: 250 TNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAISVGKKKIPYTGSSY 309

Query: 309 -PLGDG------PNIIIDSGTTLTLLPRNVYTDVATAISNS-TNLQRTDDPNQFLEYCFE 368
            P  DG       NIIIDSGTTLTLL    +   ++A+  S T  +R  DP   L +CF+
Sbjct: 310 NPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFK 369

Query: 369 TTTDDFEAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFL 428
           + + +   P ITVHF GADV L   N F+++++++VCL++    +  + IYGN AQ +FL
Sbjct: 370 SGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTE--VAIYGNFAQMDFL 429

BLAST of Tan0017014 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 5.2e-67
Identity = 163/446 (36.55%), Postives = 238/446 (53.36%), Query Frame = 0

Query: 3   LLFSLIFLFSAAVSAATSGGY---------GFTVELMHRDSPKSPMYNPSETHYQRLANT 62
           L  S++++F A   + +             GF + L H DS K      + T +Q L   
Sbjct: 10  LALSIVYIFVAPTHSTSRTALNHRHEAKVTGFQIMLEHVDSGK------NLTKFQLLERA 69

Query: 63  LRRSIRR-NKAAALADTAA---APMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQC 122
           + R  RR  +  A+ +  +     +Y   GEYLM +S+GTP  P  AI DTGSD++WTQC
Sbjct: 70  IERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQC 129

Query: 123 QPCPNCYQQNAPMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQ 182
           QPC  C+ Q+ P+FNP  SS++  +PCSS +C  A    +CS+ + C Y+  YGD S +Q
Sbjct: 130 QPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQ-ALSSPTCSN-NFCQYTYGYGDGSETQ 189

Query: 183 GDLAVDTVTMGSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPAS 242
           G +  +T+T GS     V+ P    GCG +N G    N +G+VG+G+GP SL  Q+    
Sbjct: 190 GSMGTETLTFGS-----VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT- 249

Query: 243 GGKFSYCLAPIGAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGED 302
             KFSYC+ PIG+   S+ L  GS A+   + + +T +  S +  +FY + L  +SVG  
Sbjct: 250 --KFSYCMTPIGSSTPSNLL-LGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGST 309

Query: 303 KFDLPTS-----SPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFL 362
           +  +  S     S  G G  IIIDSGTTLT    N Y  V     +  NL   +  +   
Sbjct: 310 RLPIDPSAFALNSNNGTG-GIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGF 369

Query: 363 EYCFETTTD--DFEAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGN 422
           + CF+T +D  + + P   +HF+G D+ L  EN FI  ++ ++CLA+ +    G+ I+GN
Sbjct: 370 DLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNGLICLAMGS-SSQGMSIFGN 429

Query: 423 IAQNNFLVGYDIKNKLVSFKPADCAA 429
           I Q N LV YD  N +VSF  A C A
Sbjct: 430 IQQQNMLVVYDTGNSVVSFASAQCGA 436

BLAST of Tan0017014 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 2.6e-66
Identity = 156/414 (37.68%), Postives = 231/414 (55.80%), Query Frame = 0

Query: 24  GFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRRNKA--AALADTAA--APMYNNRG 83
           G  V+L   DS K      + T Y+ +   ++R  RR ++  A L  ++    P+Y   G
Sbjct: 41  GLRVDLEQVDSGK------NLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDG 100

Query: 84  EYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPMFNPSESSTYKKVPCSSP 143
           EYLM +++GTP     AI DTGSD++WTQC+PC  C+ Q  P+FNP +SS++  +PC S 
Sbjct: 101 EYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQ 160

Query: 144 ICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVAFPRTVIGCGHD 203
            C     E +C++ +EC Y+  YGD S +QG +A +T T  ++S   +AF     GCG D
Sbjct: 161 YCQDLPSE-TCNN-NECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAF-----GCGED 220

Query: 204 NAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIGAGLESSKLNFGSNADVSG 263
           N G    N +G++G+G GP SL  Q+G    G+FSYC+   G+    S L  GS A    
Sbjct: 221 NQGFGQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSYGSS-SPSTLALGSAASGVP 280

Query: 264 SEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSS--PLGDGP-NIIIDSGTTLTLL 323
             + ST +  S    ++Y + L+ I+VG D   +P+S+     DG   +IIDSGTTLT L
Sbjct: 281 EGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 340

Query: 324 PRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTD--DFEAPAITVHFEGADVPLHRE 383
           P++ Y  VA A ++  NL   D+ +  L  CF+  +D    + P I++ F+G  + L  +
Sbjct: 341 PQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQ 400

Query: 384 NVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVSFKPADCAA 429
           N+ I  A+ V+CLA+ +    GI I+GNI Q    V YD++N  VSF P  C A
Sbjct: 401 NILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCGA 437

BLAST of Tan0017014 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 7.8e-55
Identity = 136/373 (36.46%), Postives = 196/373 (52.55%), Query Frame = 0

Query: 62  KAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAP 121
           +    + +  + +    GEY  ++ +GTP   +  + DTGSD+VW QC PC  CY Q+ P
Sbjct: 123 RPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP 182

Query: 122 MFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMGS 181
           +F+P +S TY  +PCSSP C         + R  CLY ++YGD S + GD + +T+T   
Sbjct: 183 IFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR 242

Query: 182 TSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIG 241
              + VA     +GCGHDN G F    +G++GLG+G  S   Q G     KFSYCL    
Sbjct: 243 NRVKGVA-----LGCGHDNEGLF-VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS 302

Query: 242 AGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSS---- 301
           A  + S + FG NA VS   A  TP+ ++ + ++FY + L  ISVG  +    T+S    
Sbjct: 303 ASSKPSSVVFG-NAAVS-RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKL 362

Query: 302 -PLGDGPNIIIDSGTTLTLLPRNVYTDVATAIS-NSTNLQRTDDPNQFLEYCFE-TTTDD 361
             +G+G  +IIDSGT++T L R  Y  +  A    +  L+R  D + F + CF+ +  ++
Sbjct: 363 DQIGNG-GVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLF-DTCFDLSNMNE 422

Query: 362 FEAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDI 421
            + P + +HF GADV L   N  I V  N       AG   G+ I GNI Q  F V YD+
Sbjct: 423 VKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDL 482

Query: 422 KNKLVSFKPADCA 428
            +  V F P  CA
Sbjct: 483 ASSRVGFAPGGCA 485

BLAST of Tan0017014 vs. NCBI nr
Match: XP_022964067.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 667.9 bits (1722), Expect = 5.7e-188
Identity = 325/429 (75.76%), Postives = 369/429 (86.01%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+FSLIFL S+AV AA +G YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFSLIFLISSAVFAAVNGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           NKA AL DTA APM+N+RGEYL+++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NKAVALLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+PS+SSTYK +PCSSP C+ AG+ERSCSDRS C YSI+YGD SHS GD AVDTVTMG
Sbjct: 121 PMFDPSKSSTYKIIPCSSPSCALAGQERSCSDRSGCQYSISYGDGSHSNGDFAVDTVTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+AGTF  NVSGIVGLG+GPASLVPQMG ASGGKFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASGGKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   ESSKLNFGSNA V+GS  VSTPI TSDRFNSFYSLN+EA+SVG  +F+ P +S LG
Sbjct: 241 GDSAESSKLNFGSNAQVAGSGTVSTPIKTSDRFNSFYSLNIEAMSVGGKRFEFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT+LP   Y+  ATAIS+S +L+RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGANVIIDSGTTLTILPTEFYSTFATAISDSISLERTEDPNQFLDFCFKTTNLDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NVVCLA   G    I IYGNIAQNNFLVGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVVCLAFRGGDGQSISIYGNIAQNNFLVGYDVTRNSVS 420

Query: 421 FKPADCAAM 430
           FKPADC+AM
Sbjct: 421 FKPADCSAM 429

BLAST of Tan0017014 vs. NCBI nr
Match: XP_023514471.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 664.8 bits (1714), Expect = 4.9e-187
Identity = 323/429 (75.29%), Postives = 367/429 (85.55%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+F LIFL S+AV AA SG YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFCLIFLISSAVFAAASGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           NKA AL DTA APM+N+RGEYL+++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NKAVALLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+PS+SSTYK +PCSSP C+ AG+ERSCSDRSEC YS++YGD SHS GD AVDTVTMG
Sbjct: 121 PMFDPSKSSTYKIIPCSSPSCTLAGQERSCSDRSECQYSVSYGDGSHSNGDFAVDTVTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+ GTF  NVSGIVGLG+GPASLVPQMG ASGGKFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSGGTFSTNVSGIVGLGRGPASLVPQMGAASGGKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   ESSKLNFGSNA V+GS  VSTPI TSDRFNSFYSLN+EA+SVG  +F+ P +S LG
Sbjct: 241 GDSAESSKLNFGSNAQVAGSGTVSTPIKTSDRFNSFYSLNIEAMSVGGKRFEFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT+LP   Y+  ATAIS+S +L+RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGANVIIDSGTTLTILPTEFYSTFATAISDSISLERTEDPNQFLDFCFKTTNLDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NVVCLA   G    I IYGNIAQNNFLVGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVVCLAFRGGDGQSISIYGNIAQNNFLVGYDVTRNSVS 420

Query: 421 FKPADCAAM 430
           FK ADC+AM
Sbjct: 421 FKQADCSAM 429

BLAST of Tan0017014 vs. NCBI nr
Match: XP_038876324.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 660.6 bits (1703), Expect = 9.2e-186
Identity = 335/436 (76.83%), Postives = 378/436 (86.70%), Query Frame = 0

Query: 1   MALLFSL-IFLFSAA--VSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRS 60
           MA +FSL IFL S+A  ++AAT   +GFTVEL+HRDSPKSPMYNPSETHY RLAN LRRS
Sbjct: 1   MASVFSLIIFLISSAAVLAAATGREFGFTVELIHRDSPKSPMYNPSETHYHRLANALRRS 60

Query: 61  IRRNKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQ 120
           I RN  AA+ DTA AP+YN RG+YLMKISLGTPPF I+A+ADTGSDV+WTQC+PCPNCY+
Sbjct: 61  ISRN-TAAVTDTAVAPIYNYRGQYLMKISLGTPPFSIIAVADTGSDVIWTQCEPCPNCYE 120

Query: 121 QNAPMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTV 180
           Q+APMFNPS+S+TYK VPCSSPICSYAGE+ SCS  SECLYSI+YGDRSHSQGD AVDTV
Sbjct: 121 QSAPMFNPSKSTTYKNVPCSSPICSYAGEDSSCSAHSECLYSISYGDRSHSQGDFAVDTV 180

Query: 181 TMGSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCL 240
           TMGSTSG PV FP   IGCGHDNAGTFDA+VSGIVGLGQG ASLV QMGPA+GGKFSYCL
Sbjct: 181 TMGSTSGSPVTFPHMAIGCGHDNAGTFDASVSGIVGLGQGSASLVSQMGPATGGKFSYCL 240

Query: 241 APIG-AGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLP-T 300
           APIG +  ESSKLNFGSNADVSGSEAVSTPIYTS ++ +FYSL LEA+SVGE+KFD P  
Sbjct: 241 APIGNSSAESSKLNFGSNADVSGSEAVSTPIYTSVKYKTFYSLKLEAVSVGENKFDFPIV 300

Query: 301 SSPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDF 360
           SS LG   NIIIDSGTTLT LP ++Y + AT IS+S NLQRTDDPNQFL+YCF TTTDD+
Sbjct: 301 SSRLGGEGNIIIDSGTTLTFLPVDLYNNFATTISDSINLQRTDDPNQFLDYCFATTTDDY 360

Query: 361 EAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDG--IFIYGNIAQNNFLVGYD 420
           EAP++T+HFEGADVPL+RENVFIR++D++VCLA  A QDD   IFIYGNI+QNNFLVGYD
Sbjct: 361 EAPSVTMHFEGADVPLNRENVFIRISDDIVCLAFKASQDDQEMIFIYGNISQNNFLVGYD 420

Query: 421 IKNKLVSFKPADCAAM 430
           IKN +VSFK ADC AM
Sbjct: 421 IKNMVVSFKQADCVAM 435

BLAST of Tan0017014 vs. NCBI nr
Match: KAG6593735.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 660.6 bits (1703), Expect = 9.2e-186
Identity = 320/429 (74.59%), Postives = 366/429 (85.31%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+FSLIFL S+AV AA SG YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFSLIFLISSAVFAAASGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           NKA  L DTA APM+N+RGEYL+++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NKAVGLLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+P++SSTYK +PCSSP C+ AG+ERSCSDRSEC YSI+YGD SHS GD AVDT+TMG
Sbjct: 121 PMFDPNKSSTYKIIPCSSPSCALAGQERSCSDRSECQYSISYGDGSHSNGDFAVDTLTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+AGTF  NVSGIVGLG+GPASLVPQMG ASGGKFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASGGKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   +SSKLNFGSNA V GS  VSTPI TSDRFNSFYSLN+EA+SVG  +F+ P +S LG
Sbjct: 241 GDSAKSSKLNFGSNAQVVGSGTVSTPIKTSDRFNSFYSLNIEAMSVGGKRFEFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT+LP   Y+   TAIS+S +L+RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGANVIIDSGTTLTILPTEFYSTFVTAISDSISLERTEDPNQFLDFCFKTTNLDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NV CLA   G    I IYGNIAQNNFLVGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVACLAFRGGDGQSISIYGNIAQNNFLVGYDVTRNSVS 420

Query: 421 FKPADCAAM 430
           FKPADC+A+
Sbjct: 421 FKPADCSAV 429

BLAST of Tan0017014 vs. NCBI nr
Match: XP_022964064.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 659.4 bits (1700), Expect = 2.0e-185
Identity = 320/429 (74.59%), Postives = 367/429 (85.55%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+FSLIFL S+AV AA SG YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFSLIFLISSAVFAAASGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           N+A AL DTA APM+N+RGEYL+++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NRAVALLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+PS+SSTYK +PCSSP C+ AG+ERSCSDRSEC YSI+YGD SHS GD AVDTVTMG
Sbjct: 121 PMFDPSKSSTYKIIPCSSPSCALAGQERSCSDRSECQYSISYGDGSHSNGDFAVDTVTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+AGTF  NVSGIVGLG+GPASLVPQMG ASG KFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASGNKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   ESSKLNFGSNA V+GS  VSTPI TSDRF+S+YSLN+EA+SVG  +F+ P +S LG
Sbjct: 241 GDSAESSKLNFGSNAQVAGSGTVSTPIKTSDRFDSYYSLNIEAMSVGGKRFEFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT+LP   Y+  ATAIS S +L+RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGSNVIIDSGTTLTILPTEFYSTFATAISESISLERTEDPNQFLDFCFKTTNLDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NVVCLA   G    I IYGNIAQ NF+VGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVVCLAFRGGDGQSISIYGNIAQINFIVGYDVTRNFVS 420

Query: 421 FKPADCAAM 430
           FKPA+C+AM
Sbjct: 421 FKPANCSAM 429

BLAST of Tan0017014 vs. ExPASy TrEMBL
Match: A0A6J1HGT9 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464204 PE=3 SV=1)

HSP 1 Score: 667.9 bits (1722), Expect = 2.8e-188
Identity = 325/429 (75.76%), Postives = 369/429 (86.01%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+FSLIFL S+AV AA +G YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFSLIFLISSAVFAAVNGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           NKA AL DTA APM+N+RGEYL+++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NKAVALLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+PS+SSTYK +PCSSP C+ AG+ERSCSDRS C YSI+YGD SHS GD AVDTVTMG
Sbjct: 121 PMFDPSKSSTYKIIPCSSPSCALAGQERSCSDRSGCQYSISYGDGSHSNGDFAVDTVTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+AGTF  NVSGIVGLG+GPASLVPQMG ASGGKFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASGGKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   ESSKLNFGSNA V+GS  VSTPI TSDRFNSFYSLN+EA+SVG  +F+ P +S LG
Sbjct: 241 GDSAESSKLNFGSNAQVAGSGTVSTPIKTSDRFNSFYSLNIEAMSVGGKRFEFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT+LP   Y+  ATAIS+S +L+RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGANVIIDSGTTLTILPTEFYSTFATAISDSISLERTEDPNQFLDFCFKTTNLDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NVVCLA   G    I IYGNIAQNNFLVGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVVCLAFRGGDGQSISIYGNIAQNNFLVGYDVTRNSVS 420

Query: 421 FKPADCAAM 430
           FKPADC+AM
Sbjct: 421 FKPADCSAM 429

BLAST of Tan0017014 vs. ExPASy TrEMBL
Match: A0A6J1HM14 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464202 PE=3 SV=1)

HSP 1 Score: 659.4 bits (1700), Expect = 9.9e-186
Identity = 320/429 (74.59%), Postives = 367/429 (85.55%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+FSLIFL S+AV AA SG YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFSLIFLISSAVFAAASGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           N+A AL DTA APM+N+RGEYL+++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NRAVALLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+PS+SSTYK +PCSSP C+ AG+ERSCSDRSEC YSI+YGD SHS GD AVDTVTMG
Sbjct: 121 PMFDPSKSSTYKIIPCSSPSCALAGQERSCSDRSECQYSISYGDGSHSNGDFAVDTVTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+AGTF  NVSGIVGLG+GPASLVPQMG ASG KFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASGNKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   ESSKLNFGSNA V+GS  VSTPI TSDRF+S+YSLN+EA+SVG  +F+ P +S LG
Sbjct: 241 GDSAESSKLNFGSNAQVAGSGTVSTPIKTSDRFDSYYSLNIEAMSVGGKRFEFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT+LP   Y+  ATAIS S +L+RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGSNVIIDSGTTLTILPTEFYSTFATAISESISLERTEDPNQFLDFCFKTTNLDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NVVCLA   G    I IYGNIAQ NF+VGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVVCLAFRGGDGQSISIYGNIAQINFIVGYDVTRNFVS 420

Query: 421 FKPADCAAM 430
           FKPA+C+AM
Sbjct: 421 FKPANCSAM 429

BLAST of Tan0017014 vs. ExPASy TrEMBL
Match: A0A6J1KIW1 (aspartic proteinase CDR1 OS=Cucurbita maxima OX=3661 GN=LOC111494868 PE=3 SV=1)

HSP 1 Score: 657.1 bits (1694), Expect = 4.9e-185
Identity = 320/429 (74.59%), Postives = 362/429 (84.38%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 60
           MAL+FSLIFL S+ V AA  G YGF+VE++HRDSPKSPMYNPSETHY RLANTLRRSI  
Sbjct: 1   MALIFSLIFLISSTVFAAARGEYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSILL 60

Query: 61  NKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNA 120
           NKA AL DTA APM+N+RGEYLM++SLGTPPFPILAIADTGSD+VWTQCQPCP CY+Q A
Sbjct: 61  NKAVALLDTAEAPMFNDRGEYLMEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQTA 120

Query: 121 PMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMG 180
           PMF+PS+SSTYK +PCSSP C+ AG+ERSCSDRS C YSI+YGD SHS GD AVDT+TMG
Sbjct: 121 PMFDPSKSSTYKIIPCSSPSCALAGQERSCSDRSVCQYSISYGDGSHSNGDFAVDTLTMG 180

Query: 181 STSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPI 240
           STSGRPVAFPRTV+GCGHD+AGTF  NVSGIVGLG+GPASLVPQMG AS GKFSYCL PI
Sbjct: 181 STSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASDGKFSYCLTPI 240

Query: 241 GAGLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLG 300
           G   ESSKLNFGSNA V+GS  VSTPI  SDRFNSFYSLN+EA+SVG  +F  P +S LG
Sbjct: 241 GDSAESSKLNFGSNAQVAGSGTVSTPIKISDRFNSFYSLNIEAMSVGGKRFQFPAASALG 300

Query: 301 DGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAI 360
           DG N+IIDSGTTLT++P   Y+  ATAIS+S +L RT+DPNQFL++CF+TT  DFE P++
Sbjct: 301 DGANVIIDSGTTLTIVPTEFYSTFATAISDSISLDRTEDPNQFLDFCFKTTNFDFEVPSV 360

Query: 361 TVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 420
           TVHFEGADVPL RENVF+ VA+NVVCLA   G    I IYGNIAQNNFLVGYD+    VS
Sbjct: 361 TVHFEGADVPLRRENVFVMVAENVVCLAFRGGDGQSISIYGNIAQNNFLVGYDVTRNSVS 420

Query: 421 FKPADCAAM 430
           FKP DC+AM
Sbjct: 421 FKPVDCSAM 429

BLAST of Tan0017014 vs. ExPASy TrEMBL
Match: A0A1S4E2N4 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC107991689 PE=3 SV=1)

HSP 1 Score: 637.1 bits (1642), Expect = 5.3e-179
Identity = 315/433 (72.75%), Postives = 372/433 (85.91%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAV-SAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIR 60
           MA +FS++FL S AV SA T+  YGFTVEL+HRDS KSPMYN SETHY R+AN LRRSI 
Sbjct: 1   MAPIFSILFLISTAVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSIN 60

Query: 61  RNKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQN 120
           RNKA   +DTA AP+YNN GEYL++IS+GTPPF ILA+ADTGSDV+WTQC+PC NCYQQ+
Sbjct: 61  RNKAVLTSDTAEAPIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQS 120

Query: 121 APMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTM 180
           APMF+PS+S+TYK VPCSSP+CSY+G+  SCSD SECLYSI YGD+SHS G+LAVDTVTM
Sbjct: 121 APMFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVTM 180

Query: 181 GSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAP 240
            STSGRPVAFPRTVIGCGHDNAGTF+ANVSGIVGLG+GPASLV Q+GPA+GGKFSYCL P
Sbjct: 181 QSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLMP 240

Query: 241 IG-AGLE-SSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLP-TS 300
           IG A +E S+KLNFGSNADVSGS AVSTPIYTSD++ +FYSL LEA+SVG++KFD P  S
Sbjct: 241 IGNASMEDSTKLNFGSNADVSGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDFPEVS 300

Query: 301 SPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFE 360
           S LG   NIIIDSGTTLT LP ++ ++  +AI++S NL R +DP+QFL+YCF TTTDD+E
Sbjct: 301 SKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDYE 360

Query: 361 APAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKN 420
            P++T+HFEGADVPL REN+FIR++++ +CLA  A  DD IFIYGNIAQ+NFLVGYDIKN
Sbjct: 361 VPSVTMHFEGADVPLQRENMFIRLSEDTICLAFGAFSDDNIFIYGNIAQSNFLVGYDIKN 420

Query: 421 KLVSFKPADCAAM 430
             VSF+PADC AM
Sbjct: 421 LAVSFQPADCNAM 433

BLAST of Tan0017014 vs. ExPASy TrEMBL
Match: A0A5D3DLM9 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G00670 PE=3 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 3.4e-178
Identity = 313/433 (72.29%), Postives = 372/433 (85.91%), Query Frame = 0

Query: 1   MALLFSLIFLFSAAV-SAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIR 60
           MA +FS++FL S AV SA T+  YGFTVEL+HRDS KSPMYN SETHY R+AN LRRSI 
Sbjct: 1   MAPIFSILFLISTAVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSIN 60

Query: 61  RNKAAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQN 120
           RNKA   +DTA AP+YNN GEYL++IS+GTPPF ILA+ADTGSDV+WTQC+PC NCYQQ+
Sbjct: 61  RNKAVLTSDTAEAPIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQS 120

Query: 121 APMFNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTM 180
           APMF+PS+S+TYK VPCSSP+CSY+G+  SCSD SECLYSI YGD+SHS G+LAVDTVTM
Sbjct: 121 APMFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVTM 180

Query: 181 GSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAP 240
            STSGRPV+FPRTVIGCGHDNAGTF+ANVSGIVGLG+GPASLV Q+GPA+GGKFSYCL P
Sbjct: 181 QSTSGRPVSFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLMP 240

Query: 241 IG-AGLE-SSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLP-TS 300
           IG A +E S+KLNFGSNADVSGS AVSTPIYTSD++ +FYSL LEA+SVG++KFD P  S
Sbjct: 241 IGNASMEDSTKLNFGSNADVSGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDFPEVS 300

Query: 301 SPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFE 360
           S LG   NIIIDSGTTLT LP ++ ++  +AI++S NL R +DP+QFL+YCF TTTDD+E
Sbjct: 301 SKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDYE 360

Query: 361 APAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKN 420
            P++T+HFEGADVPL REN+FIR++++ +CLA  A  DD IFIYGNIAQ+NFLVGYDIKN
Sbjct: 361 VPSVTMHFEGADVPLQRENMFIRLSEDTICLAFGAFSDDNIFIYGNIAQSNFLVGYDIKN 420

Query: 421 KLVSFKPADCAAM 430
             VSF+PA+C AM
Sbjct: 421 LAVSFQPAECNAM 433

BLAST of Tan0017014 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 402.9 bits (1034), Expect = 3.2e-112
Identity = 214/434 (49.31%), Postives = 286/434 (65.90%), Query Frame = 0

Query: 2   ALLFSLIFLFSAAVSAATS-GGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR 61
           ++L SL  L S  +S A +    GFT +L+HRDSPKSP YNP ET  QRL N + RS+ R
Sbjct: 7   SVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR 66

Query: 62  NKAAALADTAAAP---MYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQ 121
                  D    P   + +N GEYLM +S+GTPPFPI+AIADTGSD++WTQC PC +CY 
Sbjct: 67  VFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYT 126

Query: 122 QNAPMFNPSESSTYKKVPCSSPICSYAGEERSCS-DRSECLYSITYGDRSHSQGDLAVDT 181
           Q  P+F+P  SSTYK V CSS  C+    + SCS + + C YS++YGD S+++G++AVDT
Sbjct: 127 QVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 186

Query: 182 VTMGSTSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYC 241
           +T+GS+  RP+     +IGCGH+NAGTF+   SGIVGLG GP SL+ Q+G +  GKFSYC
Sbjct: 187 LTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYC 246

Query: 242 LAPIGAGL-ESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPT 301
           L P+ +   ++SK+NFG+NA VSGS  VSTP+       +FY L L++ISVG  +     
Sbjct: 247 LVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG 306

Query: 302 SSPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDF 361
           S       NIIIDSGTTLTLLP   Y+++  A+++S + ++  DP   L  C+ + T D 
Sbjct: 307 SDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDL 366

Query: 362 EAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIK 421
           + P IT+HF+GADV L   N F++V++++VC A          IYGN+AQ NFLVGYD  
Sbjct: 367 KVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRG--SPSFSIYGNVAQMNFLVGYDTV 426

Query: 422 NKLVSFKPADCAAM 430
           +K VSFKP DCA M
Sbjct: 427 SKTVSFKPTDCAKM 437

BLAST of Tan0017014 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 399.8 bits (1026), Expect = 2.7e-111
Identity = 217/414 (52.42%), Postives = 284/414 (68.60%), Query Frame = 0

Query: 24  GFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRR----NKAAALADTAAAPMYNNRG 83
           GFT++L+HRDSPKSP YN +ET  QR+ N +RRS R     +   A  ++  + + +NRG
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNRG 84

Query: 84  EYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPMFNPSESSTYKKVPCSSP 143
           EYLM IS+GTPP PILAIADTGSD++WTQC PC +CYQQ +P+F+P ESSTY+KV CSS 
Sbjct: 85  EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSS 144

Query: 144 ICSYAGEERSCS-DRSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVAFPRTVIGCGH 203
            C  A E+ SCS D + C Y+ITYGD S+++GD+AVDTVTMGS+  RPV+    +IGCGH
Sbjct: 145 QCR-ALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGH 204

Query: 204 DNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIGA--GLESSKLNFGSNAD 263
           +N GTFD   SGI+GLG G  SLV Q+  +  GKFSYCL P  +  GL +SK+NFG+N  
Sbjct: 205 ENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGL-TSKINFGTNGI 264

Query: 264 VSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLGDGP-NIIIDSGTTLTL 323
           VSG   VST +   D   ++Y LNLEAISVG  K    TS+  G G  NI+IDSGTTLTL
Sbjct: 265 VSGDGVVSTSMVKKDP-ATYYFLNLEAISVGSKKIQF-TSTIFGTGEGNIVIDSGTTLTL 324

Query: 324 LPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAITVHFEGADVPLHREN 383
           LP N Y ++ + ++++   +R  DP+  L  C+  ++  F+ P ITVHF+G DV L   N
Sbjct: 325 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS-SFKVPDITVHFKGGDVKLGNLN 384

Query: 384 VFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVSFKPADCAAM 430
            F+ V+++V C A AA +   + I+GN+AQ NFLVGYD  +  VSFK  DC+ M
Sbjct: 385 TFVAVSEDVSCFAFAANEQ--LTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQM 431

BLAST of Tan0017014 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 332.4 bits (851), Expect = 5.3e-91
Identity = 190/439 (43.28%), Postives = 263/439 (59.91%), Query Frame = 0

Query: 4   LFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRRNKA 63
           L ++ F F++  SA        TVEL+HRDSP SP+YNP  T   RL     RSI R++ 
Sbjct: 11  LLAISFFFASNSSANRE---NLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRR 70

Query: 64  AALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPMF 123
                   + + +N GEY M IS+GTPP  + AIADTGSD+ W QC+PC  CY+QN+P+F
Sbjct: 71  FTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLF 130

Query: 124 NPSESSTYKKVPCSSPICSYAGEERSCSDRSE--CLYSITYGDRSHSQGDLAVDTVTMGS 183
           +  +SSTYK   C S  C    E     D S+  C Y  +YGD S ++GD+A +T+++ S
Sbjct: 131 DKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDS 190

Query: 184 TSGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIG 243
           +SG  V+FP TV GCG++N GTF+   SGI+GLG GP SLV Q+G + G KFSYCL+   
Sbjct: 191 SSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTA 250

Query: 244 AGLE-SSKLNFGSNADVSG----SEAVSTPIYTSDRFNSFYSLNLEAISVGEDK------ 303
           A    +S +N G+N+  S     S  ++TP+   D   ++Y L LEA++VG+ K      
Sbjct: 251 ATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDP-ETYYFLTLEAVTVGKTKLPYTGG 310

Query: 304 -FDLPTSSPLGDGPNIIIDSGTTLTLLPRNVYTDVATAISNS-TNLQRTDDPNQFLEYCF 363
            + L   S    G NIIIDSGTTLTLL    Y D  TA+  S T  +R  DP   L +CF
Sbjct: 311 GYGLNGKSSKRTG-NIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCF 370

Query: 364 ETTTDDFEAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNF 423
           ++   +   PAIT+HF  ADV L   N F+++ ++ VCL++    +  + IYGN+ Q +F
Sbjct: 371 KSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTE--VAIYGNMVQMDF 430

Query: 424 LVGYDIKNKLVSFKPADCA 428
           LVGYD++ K VSF+  DC+
Sbjct: 431 LVGYDLETKTVSFQRMDCS 442

BLAST of Tan0017014 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 329.7 bits (844), Expect = 3.5e-90
Identity = 191/439 (43.51%), Postives = 270/439 (61.50%), Query Frame = 0

Query: 9   FLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRRNKA--AAL 68
           FLF +   +++     F+VEL+HRDSP SP+YNP  T   RL     RS+ R++     L
Sbjct: 10  FLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQL 69

Query: 69  ADT-AAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPMFNP 128
           + T   + +    GE+ M I++GTPP  + AIADTGSD+ W QC+PC  CY++N P+F+ 
Sbjct: 70  SQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDK 129

Query: 129 SESSTYKKVPCSSPIC-SYAGEERSCSDRSE-CLYSITYGDRSHSQGDLAVDTVTMGSTS 188
            +SSTYK  PC S  C + +  ER C + +  C Y  +YGD+S S+GD+A +TV++ S S
Sbjct: 130 KKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSAS 189

Query: 189 GRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIGAG 248
           G PV+FP TV GCG++N GTFD   SGI+GLG G  SL+ Q+G +   KFSYCL+   A 
Sbjct: 190 GSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSAT 249

Query: 249 LE-SSKLNFGSNADVSG----SEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSS- 308
              +S +N G+N+  S     S  VSTP+   +   ++Y L LEAISVG+ K     SS 
Sbjct: 250 TNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAISVGKKKIPYTGSSY 309

Query: 309 -PLGDG------PNIIIDSGTTLTLLPRNVYTDVATAISNS-TNLQRTDDPNQFLEYCFE 368
            P  DG       NIIIDSGTTLTLL    +   ++A+  S T  +R  DP   L +CF+
Sbjct: 310 NPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFK 369

Query: 369 TTTDDFEAPAITVHFEGADVPLHRENVFIRVADNVVCLALAAGQDDGIFIYGNIAQNNFL 428
           + + +   P ITVHF GADV L   N F+++++++VCL++    +  + IYGN AQ +FL
Sbjct: 370 SGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTE--VAIYGNFAQMDFL 429

BLAST of Tan0017014 vs. TAIR 10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 276.6 bits (706), Expect = 3.5e-74
Identity = 169/429 (39.39%), Postives = 239/429 (55.71%), Query Frame = 0

Query: 3   LLFSLIFLFSAAVSAATSGGYGFTVELMHRDSPKSPMYNPSETHYQRLANTLRRSIRRNK 62
           L  SL FLF+   S      +GFT++L+HR S  S   + +++     ANT         
Sbjct: 12  LQISLCFLFTTTASPP----HGFTMDLIHRRSNASSRVSNTQSGSSPYANT--------- 71

Query: 63  AAALADTAAAPMYNNRGEYLMKISLGTPPFPILAIADTGSDVVWTQCQPCPNCYQQNAPM 122
                      +++N   YLMK+ +GTPPF I AI DTGS++ WTQC PC +CY+QNAP+
Sbjct: 72  -----------VFDN-SVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPI 131

Query: 123 FNPSESSTYKKVPCSSPICSYAGEERSCSDRSECLYSITYGDRSHSQGDLAVDTVTMGST 182
           F+PS+SST+K+  C               D   C Y + Y D +++ G LA +T+T+ ST
Sbjct: 132 FDPSKSSTFKEKRC---------------DGHSCPYEVDYFDHTYTMGTLATETITLHST 191

Query: 183 SGRPVAFPRTVIGCGHDNAGTFDANVSGIVGLGQGPASLVPQMGPASGGKFSYCLAPIGA 242
           SG P   P T+IGCGH+N+  F  + SG+VGL  GP+SL+ QMG    G  SYC     +
Sbjct: 192 SGEPFVMPETIIGCGHNNS-WFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCF----S 251

Query: 243 GLESSKLNFGSNADVSGSEAVSTPIYTSDRFNSFYSLNLEAISVGEDKFDLPTSSPLGDG 302
           G  +SK+NFG+NA V+G   VST ++ +     FY LNL+A+SVG  + +   ++     
Sbjct: 252 GQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE 311

Query: 303 PNIIIDSGTTLTLLPRNVYTDVATAISNSTNLQRTDDPNQFLEYCFETTTDDFEAPAITV 362
            NI+IDSGTTLT  P +    V  A+ +     R  DP      C+ + T D   P IT+
Sbjct: 312 GNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDI-FPVITM 371

Query: 363 HFE-GADVPLHRENVFIRVAD-NVVCLALAAGQDDGIFIYGNIAQNNFLVGYDIKNKLVS 422
           HF  G D+ L + N+++   +  V CLA+         I+GN AQNNFLVGYD  + LVS
Sbjct: 372 HFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVS 394

Query: 423 FKPADCAAM 430
           F P +C+A+
Sbjct: 432 FSPTNCSAL 394

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6XBF84.5e-11149.31Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM54.9e-8943.51Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C35.2e-6736.55Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C22.6e-6637.68Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ37.8e-5536.46Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
XP_022964067.15.7e-18875.76aspartic proteinase CDR1-like [Cucurbita moschata][more]
XP_023514471.14.9e-18775.29aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
XP_038876324.19.2e-18676.83aspartic proteinase CDR1-like [Benincasa hispida][more]
KAG6593735.19.2e-18674.59Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022964064.12.0e-18574.59aspartic proteinase CDR1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1HGT92.8e-18875.76aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464204 PE=3... [more]
A0A6J1HM149.9e-18674.59aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464202 PE=3... [more]
A0A6J1KIW14.9e-18574.59aspartic proteinase CDR1 OS=Cucurbita maxima OX=3661 GN=LOC111494868 PE=3 SV=1[more]
A0A1S4E2N45.3e-17972.75aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC107991689 PE=3 SV=1[more]
A0A5D3DLM93.4e-17872.29Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
Match NameE-valueIdentityDescription
AT5G33340.13.2e-11249.31Eukaryotic aspartyl protease family protein [more]
AT1G64830.12.7e-11152.42Eukaryotic aspartyl protease family protein [more]
AT1G31450.15.3e-9143.28Eukaryotic aspartyl protease family protein [more]
AT2G35615.13.5e-9043.51Eukaryotic aspartyl protease family protein [more]
AT2G28010.13.5e-7439.39Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 305..316
score: 42.89
coord: 87..107
score: 39.76
coord: 398..413
score: 22.25
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 59..250
e-value: 8.1E-56
score: 191.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 251..429
e-value: 4.2E-43
score: 149.1
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 76..426
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 276..422
e-value: 3.3E-25
score: 88.7
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 81..253
e-value: 1.2E-55
score: 188.5
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 5..427
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 5..427
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 96..107
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 305..316
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 81..422
score: 43.402004
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 80..426
e-value: 1.1071E-98
score: 293.785

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017014.1Tan0017014.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity