CSPI04G06740 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G06740
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionaspartic proteinase CDR1-like
LocationChr4: 4715632 .. 4716936 (-)
RNA-Seq ExpressionCSPI04G06740
SyntenyCSPI04G06740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCATTTCAATCTTCTTCTACTTTCTCCTCTTCTTCTCCTCCAAAGTAACCGCTCATGGCGGTGGCCACCATGGCTTCACTACCTCTCTATTCCGCCGCGATTCTCCTCTCTCCCCTCTCCACAACCCATCTCTCTCCCGCTATGACAGCCTTATCGACGCCTTTCGTCGCTCCTTCTCCCGCTCCGCCACCCTCCTCACCCACCTCACTTCTGTCTCCACTGCATGCATACGTTCTCCCATCATCCCCGACAGCGGCGAGTTTCTAATGTCTATCTTTATCGGAACCCCTCCAGTGAATGTCATAGCCATTGCCGATACTGGCAGCGACCTAACGTGGACTCAATGCTTGCCATGTCGGGAATGCTTCAACCAATCACAGCCTATTTTTAATCCACGTCGGTCATCTTCCTACCGCAAAGTGTCTTGTGCATCGGATACGTGTCGCTCCTTGGAAAGTTACCATTGTGGACCGGACCTTCAAAGCTGCAGCTACGGCTATAGCTATGGAGATCGATCATTTACGTATGGTGACCTAGCATCTGATCAAATTACCATTGGGTCCTTCAAACTCCCCAAGACAGTTATCGGATGCGGTCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCCGGAATCATTGGACTTGGTGGTGGATCTCTCTCTTTGGTGTCGCAAATGAGAACAATTGCCGGCGTCAAACCACGATTCTCATATTGCTTGCCCACTTTCTTTAGCAACGCTAATATCACAGGTACAATAAGCTTTGGCCGAAAGGCTGTCGTTTCAGGGCGTCAAGTCGTTTCTACCCCTCTCGTTCCGAGATCTCCCGATACTTTTTATTTCTTGACTCTTGAGGCAATCTCTGTTGGAAAGAAGCGATTTAAAGCCGCAAACGGCATATCGGCCATGACCAACCATGGGAATATCATTATAGATTCCGGTACGACATTGACTCTTCTACCTCGGAGTCTGTACTACGGCGTCCTTTCGACTTTGGCGAGAGTCATTAAAGCAAAGCGGGTGGACGATCCTTCGGGGATTTTGGAGCTTTGCTATTCTGCGGGGCAAGTTGACGATTTAAATATTCCAATAATCACGGCACATTTTGCAGGTGGTGCCGACGTGAAGTTGCTGCCGGTGAACACATTTGCACCGGTGGCTGATAATGTGACTTGTTTGACTTTTGCACCGGCAACGCAGGTCGCCATTTTTGGAAACTTGGCACAGATAAACTTTGAAGTTGGATATGATCTTGGGAATAAGAGATTATCGTTTGAACCTAAACTTTGTGCTTAG

mRNA sequence

ATGGCTGCCATTTCAATCTTCTTCTACTTTCTCCTCTTCTTCTCCTCCAAAGTAACCGCTCATGGCGGTGGCCACCATGGCTTCACTACCTCTCTATTCCGCCGCGATTCTCCTCTCTCCCCTCTCCACAACCCATCTCTCTCCCGCTATGACAGCCTTATCGACGCCTTTCGTCGCTCCTTCTCCCGCTCCGCCACCCTCCTCACCCACCTCACTTCTGTCTCCACTGCATGCATACGTTCTCCCATCATCCCCGACAGCGGCGAGTTTCTAATGTCTATCTTTATCGGAACCCCTCCAGTGAATGTCATAGCCATTGCCGATACTGGCAGCGACCTAACGTGGACTCAATGCTTGCCATGTCGGGAATGCTTCAACCAATCACAGCCTATTTTTAATCCACGTCGGTCATCTTCCTACCGCAAAGTGTCTTGTGCATCGGATACGTGTCGCTCCTTGGAAAGTTACCATTGTGGACCGGACCTTCAAAGCTGCAGCTACGGCTATAGCTATGGAGATCGATCATTTACGTATGGTGACCTAGCATCTGATCAAATTACCATTGGGTCCTTCAAACTCCCCAAGACAGTTATCGGATGCGGTCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCCGGAATCATTGGACTTGGTGGTGGATCTCTCTCTTTGGTGTCGCAAATGAGAACAATTGCCGGCGTCAAACCACGATTCTCATATTGCTTGCCCACTTTCTTTAGCAACGCTAATATCACAGGTACAATAAGCTTTGGCCGAAAGGCTGTCGTTTCAGGGCGTCAAGTCGTTTCTACCCCTCTCGTTCCGAGATCTCCCGATACTTTTTATTTCTTGACTCTTGAGGCAATCTCTGTTGGAAAGAAGCGATTTAAAGCCGCAAACGGCATATCGGCCATGACCAACCATGGGAATATCATTATAGATTCCGGTACGACATTGACTCTTCTACCTCGGAGTCTGTACTACGGCGTCCTTTCGACTTTGGCGAGAGTCATTAAAGCAAAGCGGGTGGACGATCCTTCGGGGATTTTGGAGCTTTGCTATTCTGCGGGGCAAGTTGACGATTTAAATATTCCAATAATCACGGCACATTTTGCAGGTGGTGCCGACGTGAAGTTGCTGCCGGTGAACACATTTGCACCGGTGGCTGATAATGTGACTTGTTTGACTTTTGCACCGGCAACGCAGGTCGCCATTTTTGGAAACTTGGCACAGATAAACTTTGAAGTTGGATATGATCTTGGGAATAAGAGATTATCGTTTGAACCTAAACTTTGTGCTTAG

Coding sequence (CDS)

ATGGCTGCCATTTCAATCTTCTTCTACTTTCTCCTCTTCTTCTCCTCCAAAGTAACCGCTCATGGCGGTGGCCACCATGGCTTCACTACCTCTCTATTCCGCCGCGATTCTCCTCTCTCCCCTCTCCACAACCCATCTCTCTCCCGCTATGACAGCCTTATCGACGCCTTTCGTCGCTCCTTCTCCCGCTCCGCCACCCTCCTCACCCACCTCACTTCTGTCTCCACTGCATGCATACGTTCTCCCATCATCCCCGACAGCGGCGAGTTTCTAATGTCTATCTTTATCGGAACCCCTCCAGTGAATGTCATAGCCATTGCCGATACTGGCAGCGACCTAACGTGGACTCAATGCTTGCCATGTCGGGAATGCTTCAACCAATCACAGCCTATTTTTAATCCACGTCGGTCATCTTCCTACCGCAAAGTGTCTTGTGCATCGGATACGTGTCGCTCCTTGGAAAGTTACCATTGTGGACCGGACCTTCAAAGCTGCAGCTACGGCTATAGCTATGGAGATCGATCATTTACGTATGGTGACCTAGCATCTGATCAAATTACCATTGGGTCCTTCAAACTCCCCAAGACAGTTATCGGATGCGGTCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCCGGAATCATTGGACTTGGTGGTGGATCTCTCTCTTTGGTGTCGCAAATGAGAACAATTGCCGGCGTCAAACCACGATTCTCATATTGCTTGCCCACTTTCTTTAGCAACGCTAATATCACAGGTACAATAAGCTTTGGCCGAAAGGCTGTCGTTTCAGGGCGTCAAGTCGTTTCTACCCCTCTCGTTCCGAGATCTCCCGATACTTTTTATTTCTTGACTCTTGAGGCAATCTCTGTTGGAAAGAAGCGATTTAAAGCCGCAAACGGCATATCGGCCATGACCAACCATGGGAATATCATTATAGATTCCGGTACGACATTGACTCTTCTACCTCGGAGTCTGTACTACGGCGTCCTTTCGACTTTGGCGAGAGTCATTAAAGCAAAGCGGGTGGACGATCCTTCGGGGATTTTGGAGCTTTGCTATTCTGCGGGGCAAGTTGACGATTTAAATATTCCAATAATCACGGCACATTTTGCAGGTGGTGCCGACGTGAAGTTGCTGCCGGTGAACACATTTGCACCGGTGGCTGATAATGTGACTTGTTTGACTTTTGCACCGGCAACGCAGGTCGCCATTTTTGGAAACTTGGCACAGATAAACTTTGAAGTTGGATATGATCTTGGGAATAAGAGATTATCGTTTGAACCTAAACTTTGTGCTTAG

Protein sequence

MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA*
Homology
BLAST of CSPI04G06740 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 3.2e-96
Identity = 207/453 (45.70%), Postives = 278/453 (61.37%), Query Frame = 0

Query: 3   AISIFFYFLLFFSSKVTAHGGGH-HGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSF 62
           A  I   F LFFS  VT    GH   F+  L  RDSPLSP++NP ++  D L  AF RS 
Sbjct: 2   ATQILLCFFLFFS--VTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSV 61

Query: 63  SRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC 122
           SRS      L+      ++S +I   GEF MSI IGTPP+ V AIADTGSDLTW QC PC
Sbjct: 62  SRSRRFNHQLSQTD---LQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC 121

Query: 123 RECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQS--CSYGYSYGDRSFTYG 182
           ++C+ ++ PIF+ ++SS+Y+   C S  C++L S   G D  +  C Y YSYGD+SF+ G
Sbjct: 122 QQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKG 181

Query: 183 DLASDQITIGS-----FKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAG 242
           D+A++ ++I S        P TV GCG+ NGGTF    SGIIGLGGG LSL+SQ+   + 
Sbjct: 182 DVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SS 241

Query: 243 VKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ----VVSTPLVPRSPDTFYFLTLEAI 302
           +  +FSYCL    +  N T  I+ G  ++ S       VVSTPLV + P T+Y+LTLEAI
Sbjct: 242 ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAI 301

Query: 303 SVGKKRFKAA--------NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLAR-VIKA 362
           SVGKK+            +GI + T+ GNIIIDSGTTLTLL    +    S +   V  A
Sbjct: 302 SVGKKKIPYTGSSYNPNDDGILSETS-GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGA 361

Query: 363 KRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPA 422
           KRV DP G+L  C+ +G   ++ +P IT HF  GADV+L P+N F  +++++ CL+  P 
Sbjct: 362 KRVSDPQGLLSHCFKSGSA-EIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPT 421

Query: 423 TQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           T+VAI+GN AQ++F VGYDL  + +SF+   C+
Sbjct: 422 TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of CSPI04G06740 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 6.0e-95
Identity = 192/416 (46.15%), Postives = 264/416 (63.46%), Query Frame = 0

Query: 27  GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSV-STACIRSPIIP 86
           GFT  L  RDSP SP +NP  +    L +A  RS +R    + H T   +T   +  +  
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR----VFHFTEKDNTPQPQIDLTS 89

Query: 87  DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSC 146
           +SGE+LM++ IGTPP  ++AIADTGSDL WTQC PC +C+ Q  P+F+P+ SS+Y+ VSC
Sbjct: 90  NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 149

Query: 147 ASDTCRSLESY-HCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIG 206
           +S  C +LE+   C  +  +CSY  SYGD S+T G++A D +T+GS      +L   +IG
Sbjct: 150 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 209

Query: 207 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFG 266
           CGH N GTF    SGI+GLGGG +SL+ Q+     +  +FSYCL    S  + T  I+FG
Sbjct: 210 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFG 269

Query: 267 RKAVVSGRQVVSTPLVPR-SPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGT 326
             A+VSG  VVSTPL+ + S +TFY+LTL++ISVG K+ +  +G  + ++ GNIIIDSGT
Sbjct: 270 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQ-YSGSDSESSEGNIIIDSGT 329

Query: 327 TLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 386
           TLTLLP   Y  +   +A  I A++  DP   L LCYSA    DL +P+IT HF  GADV
Sbjct: 330 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHF-DGADV 389

Query: 387 KLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           KL   N F  V++++ C  F  +   +I+GN+AQ+NF VGYD  +K +SF+P  CA
Sbjct: 390 KLDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CSPI04G06740 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 5.0e-57
Identity = 156/444 (35.14%), Postives = 221/444 (49.77%), Query Frame = 0

Query: 1   MAAISIFFYFLL-FFSSKVTAHGGGHH----GFTTSLFRRDSPLSPLHNPSLSRYDSLID 60
           + A+SI + F+    S+  TA    H     GF   L   DS        +L+++  L  
Sbjct: 9   LLALSIVYIFVAPTHSTSRTALNHRHEAKVTGFQIMLEHVDS------GKNLTKFQLLER 68

Query: 61  AFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTW 120
           A  R   R   L   L   S   + + +    GE+LM++ IGTP     AI DTGSDL W
Sbjct: 69  AIERGSRRLQRLEAMLNGPSG--VETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIW 128

Query: 121 TQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRS 180
           TQC PC +CFNQS PIFNP+ SSS+  + C+S  C++L S  C  +   C Y Y YGD S
Sbjct: 129 TQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNF--CQYTYGYGDGS 188

Query: 181 FTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGV 240
            T G + ++ +T GS  +P    GCG  N G   G  +G++G+G G LSL SQ+      
Sbjct: 189 ETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV---- 248

Query: 241 KPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSP-DTFYFLTLEAISVGK 300
             +FSYC+    S+      +     +V +G    +T L+  S   TFY++TL  +SVG 
Sbjct: 249 -TKFSYCMTPIGSSTPSNLLLGSLANSVTAGSP--NTTLIQSSQIPTFYYITLNGLSVGS 308

Query: 301 KRF---KAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGIL 360
            R     +A  +++    G IIIDSGTTLT    + Y  V       I    V+  S   
Sbjct: 309 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGF 368

Query: 361 ELCY-SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQ-VAIFGN 420
           +LC+ +     +L IP    HF GG D++L   N F   ++ + CL    ++Q ++IFGN
Sbjct: 369 DLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICLAMGSSSQGMSIFGN 428

Query: 421 LAQINFEVGYDLGNKRLSFEPKLC 434
           + Q N  V YD GN  +SF    C
Sbjct: 429 IQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI04G06740 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.4e-56
Identity = 144/394 (36.55%), Postives = 207/394 (52.54%), Query Frame = 0

Query: 46  SLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIA 105
           +L++Y+ +  A +R   R  ++   L S S   I +P+    GE+LM++ IGTP  +  A
Sbjct: 54  NLTKYELIKRAIKRGERRMRSINAMLQSSSG--IETPVYAGDGEYLMNVAIGTPDSSFSA 113

Query: 106 IADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSC 165
           I DTGSDL WTQC PC +CF+Q  PIFNP+ SSS+  + C S  C+ L S  C  +   C
Sbjct: 114 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--EC 173

Query: 166 SYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSL 225
            Y Y YGD S T G +A++  T  +  +P    GCG  N G   G  +G+IG+G G LSL
Sbjct: 174 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 233

Query: 226 VSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPD-TFYF 285
            SQ+    GV  +FSYC+ ++ S++  T  +      V  G    ST L+  S + T+Y+
Sbjct: 234 PSQL----GV-GQFSYCMTSYGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYY 293

Query: 286 LTLEAISVGKKRFKAANGISAMTNH--GNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAK 345
           +TL+ I+VG       +    + +   G +IIDSGTTLT LP+  Y  V       I   
Sbjct: 294 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLP 353

Query: 346 RVDDPSGILELCY-SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPA 405
            VD+ S  L  C+        + +P I+  F GG  + L   N     A+ V CL    +
Sbjct: 354 TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSS 413

Query: 406 TQ--VAIFGNLAQINFEVGYDLGNKRLSFEPKLC 434
           +Q  ++IFGN+ Q   +V YDL N  +SF P  C
Sbjct: 414 SQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI04G06740 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 3.9e-54
Identity = 133/368 (36.14%), Postives = 192/368 (52.17%), Query Frame = 0

Query: 76  TACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPR 135
           ++ + S +   SGE+   + +GTP   V  + DTGSD+ W QC PCR C++QS PIF+PR
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 187

Query: 136 RSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPK 195
           +S +Y  + C+S  CR L+S  C    ++C Y  SYGD SFT GD +++ +T    ++  
Sbjct: 188 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG 247

Query: 196 TVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGT 255
             +GCGH N G F G  +G++GLG G LS   Q  T      +FSYCL    S ++   +
Sbjct: 248 VALGCGHDNEGLFVG-AAGLLGLGKGKLSFPGQ--TGHRFNQKFSYCL-VDRSASSKPSS 307

Query: 256 ISFGRKAVVSGRQVVSTPLVPRSP-DTFYFLTLEAISVGKKRFKAANGISA------MTN 315
           + FG  AV   R    TPL+     DTFY++ L  ISVG  R     G++A         
Sbjct: 308 VVFGNAAV--SRIARFTPLLSNPKLDTFYYVGLLGISVGGTR---VPGVTASLFKLDQIG 367

Query: 316 HGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPII 375
           +G +IIDSGT++T L R  Y  +        K  +      + + C+    ++++ +P +
Sbjct: 368 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTV 427

Query: 376 TAHFAGGADVKLLPVNTFAPVADN-VTCLTFAPAT-QVAIFGNLAQINFEVGYDLGNKRL 435
             HF  GADV L   N   PV  N   C  FA     ++I GN+ Q  F V YDL + R+
Sbjct: 428 VLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 485

BLAST of CSPI04G06740 vs. ExPASy TrEMBL
Match: A0A0A0KZZ3 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 1.9e-248
Identity = 433/434 (99.77%), Postives = 433/434 (99.77%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS
Sbjct: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
           NGISAMTNHGNIIIDSGTTLTLLPRSLYYGV STLARVIKAKRVDDPSGILELCYSAGQV
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSFEPKLCA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of CSPI04G06740 vs. ExPASy TrEMBL
Match: A0A5A7TPZ5 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002740 PE=3 SV=1)

HSP 1 Score: 792.7 bits (2046), Expect = 7.6e-226
Identity = 397/434 (91.47%), Postives = 410/434 (94.47%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           M  ISIFFYFLLFFSSK TAHGGGHHGFTTSLF RDS LSPLHNPSLSRYDSL+++FRRS
Sbjct: 1   MPVISIFFYFLLFFSSKATAHGGGHHGFTTSLFHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLL HLTSVSTACIRSPIIPDSGEFLMSIFIGTP VN IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSC+SDTCRSLES HCG DL+SCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASD+ITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM TIAGVKP+FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSN NITG ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVG KRFKAA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
             +SAMTN GNIIIDSGTTLTLLPRSLY GV+STLARVIKAKRVDDPSGILELCYSAGQ+
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKAKRVDDPSGILELCYSAGQL 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           +DLNIPIITAHF+G ADVKLLPVNTFAPVADNV CLT APAT VAIFGNLAQINFEVGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSF+P  CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of CSPI04G06740 vs. ExPASy TrEMBL
Match: A0A5D3D1Z7 (Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003070 PE=3 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 1.7e-225
Identity = 396/434 (91.24%), Postives = 410/434 (94.47%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           M AISIFFYFLLFFSSK TAHGGGHHGFTTSL+ RDS LSPLHNPSLSRYDSL+++FRRS
Sbjct: 1   MPAISIFFYFLLFFSSKATAHGGGHHGFTTSLYHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLL HLTSVSTACIRSPIIPDSGEFLMSIFIGTP VN IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSC+SDTCRSLES HCG DL+SCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASD+ITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM TIAGVKP+FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSN NITG ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVG KRFKAA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
             +SAMTN GNIIIDSGTTLTLLPRSLY GV+STLARVIK KRVDDPSGILELCYSAGQ+
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSGILELCYSAGQL 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           +DLNIPIITAHF+G ADVKLLPVNTFAPVADNV CLT APAT VAIFGNLAQINFEVGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSF+P  CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of CSPI04G06740 vs. ExPASy TrEMBL
Match: A0A1S3BT75 (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=3 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 1.7e-225
Identity = 396/434 (91.24%), Postives = 410/434 (94.47%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           M AISIFFYFLLFFSSK TAHGGGHHGFTTSL+ RDS LSPLHNPSLSRYDSL+++FRRS
Sbjct: 1   MPAISIFFYFLLFFSSKATAHGGGHHGFTTSLYHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLL HLTSVSTACIRSPIIPDSGEFLMSIFIGTP VN IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSC+SDTCRSLES HCG DL+SCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASD+ITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM TIAGVKP+FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSN NITG ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVG KRFKAA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
             +SAMTN GNIIIDSGTTLTLLPRSLY GV+STLARVIK KRVDDPSGILELCYSAGQ+
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSGILELCYSAGQL 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           +DLNIPIITAHF+G ADVKLLPVNTFAPVADNV CLT APAT VAIFGNLAQINFEVGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSF+P  CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of CSPI04G06740 vs. ExPASy TrEMBL
Match: A0A6J1FP39 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111447216 PE=3 SV=1)

HSP 1 Score: 624.8 bits (1610), Expect = 2.7e-175
Identity = 314/437 (71.85%), Postives = 358/437 (81.92%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAH---GGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAF 60
           MAAISIFF   L   S+ TAH   GGG HGFTTSLF RDS LSPL+NPSLS YD L +AF
Sbjct: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60

Query: 61  RRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQ 120
           RRSFSRS TLL    +VS   I S IIPD GEFLMSI IGTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120

Query: 121 CLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFT 180
           C+PC +CFNQS PIFNPRRS SYR VSC S+ CRSL+ Y CGPD ++CSYGYSYGD+SFT
Sbjct: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180

Query: 181 YGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKP 240
           YGDLAS++ITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQMR IA VK 
Sbjct: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 240

Query: 241 RFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRF 300
           RFSYCLPTFFS+ N+TG ISFG+KA+VSGR+VVSTPLV + P+TFY+LTLEA+SV  KRF
Sbjct: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300

Query: 301 KAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSA 360
           KAAN +S     GNI+IDSGTTLT+LP++LY GV STLA V+KAKRV+DP+G+L+LC++A
Sbjct: 301 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAA 360

Query: 361 GQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEV 420
             VD LNIP+ITAHFAG ADVKLLP+NTFA VADNV CL F P+   AIFGNLAQ+NF V
Sbjct: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420

Query: 421 GYDLGNKRLSFEPKLCA 435
           GYDL  KRLSF+  +CA
Sbjct: 421 GYDLERKRLSFKYNVCA 437

BLAST of CSPI04G06740 vs. NCBI nr
Match: XP_004149004.1 (probable aspartic protease At2g35615 [Cucumis sativus] >KGN53446.1 hypothetical protein Csa_015341 [Cucumis sativus])

HSP 1 Score: 867.8 bits (2241), Expect = 3.8e-248
Identity = 433/434 (99.77%), Postives = 433/434 (99.77%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS
Sbjct: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
           NGISAMTNHGNIIIDSGTTLTLLPRSLYYGV STLARVIKAKRVDDPSGILELCYSAGQV
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSFEPKLCA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of CSPI04G06740 vs. NCBI nr
Match: KAA0044968.1 (putative aspartic protease [Cucumis melo var. makuwa])

HSP 1 Score: 792.7 bits (2046), Expect = 1.6e-225
Identity = 397/434 (91.47%), Postives = 410/434 (94.47%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           M  ISIFFYFLLFFSSK TAHGGGHHGFTTSLF RDS LSPLHNPSLSRYDSL+++FRRS
Sbjct: 1   MPVISIFFYFLLFFSSKATAHGGGHHGFTTSLFHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLL HLTSVSTACIRSPIIPDSGEFLMSIFIGTP VN IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSC+SDTCRSLES HCG DL+SCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASD+ITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM TIAGVKP+FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSN NITG ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVG KRFKAA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
             +SAMTN GNIIIDSGTTLTLLPRSLY GV+STLARVIKAKRVDDPSGILELCYSAGQ+
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKAKRVDDPSGILELCYSAGQL 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           +DLNIPIITAHF+G ADVKLLPVNTFAPVADNV CLT APAT VAIFGNLAQINFEVGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSF+P  CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of CSPI04G06740 vs. NCBI nr
Match: XP_008452150.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo] >TYK16499.1 putative aspartic protease [Cucumis melo var. makuwa])

HSP 1 Score: 791.6 bits (2043), Expect = 3.5e-225
Identity = 396/434 (91.24%), Postives = 410/434 (94.47%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           M AISIFFYFLLFFSSK TAHGGGHHGFTTSL+ RDS LSPLHNPSLSRYDSL+++FRRS
Sbjct: 1   MPAISIFFYFLLFFSSKATAHGGGHHGFTTSLYHRDSLLSPLHNPSLSRYDSLVESFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLL HLTSVSTACIRSPIIPDSGEFLMSIFIGTP VN IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSC+SDTCRSLES HCG DL+SCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASD+ITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM TIAGVKP+FS
Sbjct: 181 LASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKPQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSN NITG ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVG KRFKAA
Sbjct: 241 YCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
             +SAMTN GNIIIDSGTTLTLLPRSLY GV+STLARVIK KRVDDPSGILELCYSAGQ+
Sbjct: 301 KDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSGILELCYSAGQL 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           +DLNIPIITAHF+G ADVKLLPVNTFAPVADNV CLT APAT VAIFGNLAQINFEVGYD
Sbjct: 361 EDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSF+P  CA
Sbjct: 421 LGNKRLSFKPTRCA 434

BLAST of CSPI04G06740 vs. NCBI nr
Match: XP_038889220.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 686.8 bits (1771), Expect = 1.2e-193
Identity = 343/433 (79.21%), Postives = 382/433 (88.22%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MAAISIFFYFLLF  ++ T + GG +GFTTSLF RDS LSPLHN SLS +D   +AFRRS
Sbjct: 1   MAAISIFFYFLLFSFAEATTNRGGGNGFTTSLFHRDSLLSPLHNSSLSCHDRRTNAFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLL+H+ +VSTACI SPIIP+SGEFLMS+ IGTPPV+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLSHVNAVSTACIHSPIIPNSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           C++CFNQS P+FNPRRSSSYR VSC SDTCRSL+SYHCG DLQ+CSYGYSYGDRSFTYGD
Sbjct: 121 CQKCFNQSHPMFNPRRSSSYRNVSCTSDTCRSLDSYHCGTDLQTCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASD+ITI SFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGG+LSLVSQM TIA +K +FS
Sbjct: 181 LASDKITIESFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFS+ NITG ISFG+ AVVSG +V+STPLV RSPDTFYFLTLEAISV  KR KAA
Sbjct: 241 YCLPTFFSDENITGKISFGQNAVVSGPKVISTPLVSRSPDTFYFLTLEAISVANKRLKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQV 360
           +  SA+T  GNIIIDSGTTLT LPR+LY  ++STL  VIKAKRVDDPSGILELCY+AG  
Sbjct: 301 DDTSALTRRGNIIIDSGTTLTFLPRNLYEDLVSTLVSVIKAKRVDDPSGILELCYAAGGG 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           DDL+IP+I AHFAGGADVKLLP+NTFA VA+NVTCLT APA+ +AIFGNLAQINF VGYD
Sbjct: 361 DDLHIPVIIAHFAGGADVKLLPLNTFALVAENVTCLTLAPASDLAIFGNLAQINFIVGYD 420

Query: 421 LGNKRLSFEPKLC 434
           L NKRLSF+P +C
Sbjct: 421 LENKRLSFKPTVC 433

BLAST of CSPI04G06740 vs. NCBI nr
Match: XP_023543528.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 630.2 bits (1624), Expect = 1.3e-176
Identity = 314/437 (71.85%), Postives = 360/437 (82.38%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGG---GHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAF 60
           MAAISIFF F L   S+ T HGG   G HGFTTSLF RDS LSPL+NPSLS YD L +AF
Sbjct: 1   MAAISIFFCFFLISFSQATVHGGVGDGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60

Query: 61  RRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQ 120
           RRSFSRS TLL    +VST  I S IIPD GEFLMSI IGTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSDTLLNRAAAVSTTGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120

Query: 121 CLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFT 180
           C+PC +CFNQS PIFNPRRS SYR VSC S+ CRSL+ Y CGPD ++CSYGYSYGD+SFT
Sbjct: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180

Query: 181 YGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKP 240
           YGDLAS++IT+GSFKL KTVIGCGH NGGTF G TSGIIGLGGG LSL+SQMR IA VK 
Sbjct: 181 YGDLASEKITVGSFKLYKTVIGCGHVNGGTFSGDTSGIIGLGGGPLSLISQMRKIAAVKR 240

Query: 241 RFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRF 300
           RFSYCLPTFFS+ N+TG ISFG+KA+VSGR+V+STPLV + P+TFY++TL+A+SV  KRF
Sbjct: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVISTPLVLKEPNTFYYVTLKAMSVANKRF 300

Query: 301 KAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSA 360
           KAAN +SA    GNI+IDSGTTLT+LP +LY GV STLA V+KAKRV+DP+G+L+LC++ 
Sbjct: 301 KAANNMSAAVERGNILIDSGTTLTILPPNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAT 360

Query: 361 GQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEV 420
             VD LNIP+ITAHFAGGADVKLLP+NTFA VADNV CL F P+   AIFGNLAQ+NF V
Sbjct: 361 RSVDHLNIPVITAHFAGGADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420

Query: 421 GYDLGNKRLSFEPKLCA 435
           GYDL  KRLSF+  +CA
Sbjct: 421 GYDLERKRLSFKYNVCA 437

BLAST of CSPI04G06740 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 381.3 bits (978), Expect = 1.0e-105
Identity = 214/439 (48.75%), Postives = 275/439 (62.64%), Query Frame = 0

Query: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MA++       L   S V A+     GFT  L  RDSP SP +N + +    + +A RR 
Sbjct: 1   MASLIFATLLSLLLLSNVNAY--PKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRR- 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
            S  +TL       S    +S I  + GE+LM+I IGTPPV ++AIADTGSDL WTQC P
Sbjct: 61  -SARSTLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           C +C+ Q+ P+F+P+ SS+YRKVSC+S  CR+LE   C  D  +CSY  +YGD S+T GD
Sbjct: 121 CEDCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGD 180

Query: 181 LASDQITIGS-----FKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGV 240
           +A D +T+GS       L   +IGCGH+N GTF    SGIIGLGGGS SLVSQ+R    +
Sbjct: 181 VAVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLR--KSI 240

Query: 241 KPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKK 300
             +FSYCL  F S   +T  I+FG   +VSG  VVST +V + P T+YFL LEAISVG K
Sbjct: 241 NGKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSK 300

Query: 301 RFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCY 360
           + +  + I   T  GNI+IDSGTTLTLLP + YY + S +A  IKA+RV DP GIL LCY
Sbjct: 301 KIQFTSTIFG-TGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCY 360

Query: 361 SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINF 420
                    +P IT HF GG DVKL  +NTF  V+++V+C  FA   Q+ IFGNLAQ+NF
Sbjct: 361 R--DSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAFAANEQLTIFGNLAQMNF 420

Query: 421 EVGYDLGNKRLSFEPKLCA 435
            VGYD  +  +SF+   C+
Sbjct: 421 LVGYDTVSGTVSFKKTDCS 429

BLAST of CSPI04G06740 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 353.6 bits (906), Expect = 2.3e-97
Identity = 207/453 (45.70%), Postives = 278/453 (61.37%), Query Frame = 0

Query: 3   AISIFFYFLLFFSSKVTAHGGGH-HGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSF 62
           A  I   F LFFS  VT    GH   F+  L  RDSPLSP++NP ++  D L  AF RS 
Sbjct: 2   ATQILLCFFLFFS--VTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSV 61

Query: 63  SRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC 122
           SRS      L+      ++S +I   GEF MSI IGTPP+ V AIADTGSDLTW QC PC
Sbjct: 62  SRSRRFNHQLSQTD---LQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC 121

Query: 123 RECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQS--CSYGYSYGDRSFTYG 182
           ++C+ ++ PIF+ ++SS+Y+   C S  C++L S   G D  +  C Y YSYGD+SF+ G
Sbjct: 122 QQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKG 181

Query: 183 DLASDQITIGS-----FKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAG 242
           D+A++ ++I S        P TV GCG+ NGGTF    SGIIGLGGG LSL+SQ+   + 
Sbjct: 182 DVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SS 241

Query: 243 VKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ----VVSTPLVPRSPDTFYFLTLEAI 302
           +  +FSYCL    +  N T  I+ G  ++ S       VVSTPLV + P T+Y+LTLEAI
Sbjct: 242 ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAI 301

Query: 303 SVGKKRFKAA--------NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVLSTLAR-VIKA 362
           SVGKK+            +GI + T+ GNIIIDSGTTLTLL    +    S +   V  A
Sbjct: 302 SVGKKKIPYTGSSYNPNDDGILSETS-GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGA 361

Query: 363 KRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPA 422
           KRV DP G+L  C+ +G   ++ +P IT HF  GADV+L P+N F  +++++ CL+  P 
Sbjct: 362 KRVSDPQGLLSHCFKSGSA-EIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPT 421

Query: 423 TQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           T+VAI+GN AQ++F VGYDL  + +SF+   C+
Sbjct: 422 TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of CSPI04G06740 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 349.4 bits (895), Expect = 4.3e-96
Identity = 192/416 (46.15%), Postives = 264/416 (63.46%), Query Frame = 0

Query: 27  GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSV-STACIRSPIIP 86
           GFT  L  RDSP SP +NP  +    L +A  RS +R    + H T   +T   +  +  
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR----VFHFTEKDNTPQPQIDLTS 89

Query: 87  DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSC 146
           +SGE+LM++ IGTPP  ++AIADTGSDL WTQC PC +C+ Q  P+F+P+ SS+Y+ VSC
Sbjct: 90  NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 149

Query: 147 ASDTCRSLESY-HCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIG 206
           +S  C +LE+   C  +  +CSY  SYGD S+T G++A D +T+GS      +L   +IG
Sbjct: 150 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 209

Query: 207 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFG 266
           CGH N GTF    SGI+GLGGG +SL+ Q+     +  +FSYCL    S  + T  I+FG
Sbjct: 210 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFG 269

Query: 267 RKAVVSGRQVVSTPLVPR-SPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGT 326
             A+VSG  VVSTPL+ + S +TFY+LTL++ISVG K+ +  +G  + ++ GNIIIDSGT
Sbjct: 270 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQ-YSGSDSESSEGNIIIDSGT 329

Query: 327 TLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 386
           TLTLLP   Y  +   +A  I A++  DP   L LCYSA    DL +P+IT HF  GADV
Sbjct: 330 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHF-DGADV 389

Query: 387 KLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           KL   N F  V++++ C  F  +   +I+GN+AQ+NF VGYD  +K +SF+P  CA
Sbjct: 390 KLDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CSPI04G06740 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 342.8 bits (878), Expect = 4.0e-94
Identity = 197/440 (44.77%), Postives = 262/440 (59.55%), Query Frame = 0

Query: 13  FFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLT 72
           FF+S  +A+       T  L  RDSP SPL+NP  +  D L  AF RS SRS    T   
Sbjct: 17  FFASNSSAN---RENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTKTD 76

Query: 73  SVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIF 132
                 ++S +I + GE+ MSI IGTPP  V AIADTGSDLTW QC PC++C+ Q+ P+F
Sbjct: 77  ------LQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLF 136

Query: 133 NPRRSSSYRKVSCASDTCRSLESYH--CGPDLQSCSYGYSYGDRSFTYGDLASDQITI-- 192
           + ++SS+Y+  SC S TC++L  +   C      C Y YSYGD SFT GD+A++ I+I  
Sbjct: 137 DKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDS 196

Query: 193 ---GSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPT 252
               S   P TV GCG+ NGGTF    SGIIGLGGG LSLVSQ+ +  G K  FSYCL  
Sbjct: 197 SSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKK--FSYCLSH 256

Query: 253 FFSNANITGTISFGRKAVVSG----RQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAAN 312
             +  N T  I+ G  ++ S        ++TPL+ + P+T+YFLTLEA++VGK +     
Sbjct: 257 TAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTG 316

Query: 313 GISAMTNH-----GNIIIDSGTTLTLLPRSLY--YGVLSTLARVIKAKRVDDPSGILELC 372
           G   +        GNIIIDSGTTLTLL    Y  +G  +    V  AKRV DP G+L  C
Sbjct: 317 GGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGT-AVEESVTGAKRVSDPQGLLTHC 376

Query: 373 YSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQIN 432
           + +G   ++ +P IT HF   ADVKL P+N F  + ++  CL+  P T+VAI+GN+ Q++
Sbjct: 377 FKSGD-KEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTTEVAIYGNMVQMD 436

Query: 433 FEVGYDLGNKRLSFEPKLCA 435
           F VGYDL  K +SF+   C+
Sbjct: 437 FLVGYDLETKTVSFQRMDCS 442

BLAST of CSPI04G06740 vs. TAIR 10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 228.8 bits (582), Expect = 8.4e-60
Identity = 146/359 (40.67%), Postives = 199/359 (55.43%), Query Frame = 0

Query: 86  DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSC 145
           D+  +LM + +GTPP  + AI DTGS++TWTQCLPC  C+ Q+ PIF+P +SS++++  C
Sbjct: 61  DNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRC 120

Query: 146 ASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGC 205
                          D  SC Y   Y D ++T G LA++ IT+ S     F +P+T+IGC
Sbjct: 121 ---------------DGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGC 180

Query: 206 GHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKP-RFSYCLPTFFSNANITGTISFG 265
           GH N   F    SG++GL  G  SL++QM    G  P   SYC    FS    T  I+FG
Sbjct: 181 GH-NNSWFKPSFSGMVGLNWGPSSLITQM---GGEYPGLMSYC----FSGQG-TSKINFG 240

Query: 266 RKAVVSGRQVVSTPL-VPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGT 325
             A+V+G  VVST + +  +   FY+L L+A+SVG  R +   G +     GNI+IDSGT
Sbjct: 241 ANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETM-GTTFHALEGNIVIDSGT 300

Query: 326 TLTLLPRSLYYGVLSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 385
           TLT  P S    V   +  V+ A R  DP+G   LCY++  +D    P+IT HF+GG D+
Sbjct: 301 TLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID--IFPVITMHFSGGVDL 360

Query: 386 KLLPVNTFAPVAD-NVTCLTFA--PATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
            L   N +    +  V CL       TQ AIFGN AQ NF VGYD  +  +SF P  C+
Sbjct: 361 VLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3EBM53.2e-9645.70Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q6XBF86.0e-9546.15Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q766C35.0e-5735.14Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.4e-5636.55Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ33.9e-5436.14Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KZZ31.9e-24899.77Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G05541... [more]
A0A5A7TPZ57.6e-22691.47Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... [more]
A0A5D3D1Z71.7e-22591.24Putative aspartic protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
A0A1S3BT751.7e-22591.24probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=... [more]
A0A6J1FP392.7e-17571.85aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111447216 PE=3... [more]
Match NameE-valueIdentityDescription
XP_004149004.13.8e-24899.77probable aspartic protease At2g35615 [Cucumis sativus] >KGN53446.1 hypothetical ... [more]
KAA0044968.11.6e-22591.47putative aspartic protease [Cucumis melo var. makuwa][more]
XP_008452150.13.5e-22591.24PREDICTED: probable aspartic protease At2g35615 [Cucumis melo] >TYK16499.1 putat... [more]
XP_038889220.11.2e-19379.21aspartic proteinase CDR1-like [Benincasa hispida][more]
XP_023543528.11.3e-17671.85aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G64830.11.0e-10548.75Eukaryotic aspartyl protease family protein [more]
AT2G35615.12.3e-9745.70Eukaryotic aspartyl protease family protein [more]
AT5G33340.14.3e-9646.15Eukaryotic aspartyl protease family protein [more]
AT1G31450.14.0e-9444.77Eukaryotic aspartyl protease family protein [more]
AT2G28010.18.4e-6040.67Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 90..259
e-value: 1.6E-51
score: 175.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 71..257
e-value: 1.1E-49
score: 171.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 258..434
e-value: 2.8E-45
score: 156.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 85..433
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 282..429
e-value: 1.7E-25
score: 89.7
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 6..433
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 6..433
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 312..323
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 90..429
score: 42.865295
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 89..433
e-value: 4.12508E-67
score: 213.279

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G06740.1CSPI04G06740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity