CSPI01G34590 (gene) Wild cucumber (PI 183967)

NameCSPI01G34590
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr1 : 29547554 .. 29549104 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTCGATATCATCATCAATCTCCACTGCAACAAAATTCTTGAGCCTATTCCTTCTTCTTGTACATGTCTCAACACAAACCCTAGCAACAAACCCTAAAACCAATTTCCCCAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACATCCCTCCTTACACCCAAAAAAGGCTATAATTTCATTTCAAAGAAGAGAATGAAGGCAATGGATCAGACGGATGGTGATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTAATGTCCTTATCAATAGGGACACCCCCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGATCTCACGTGGGTTCCTTGTGGTAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATATTTCTGGCCCAAGATTGGCCGCCTTTTTGCCTACTCATTCTTCTACTTCTATTCGAGACACGTGTGGTAGTTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCTTAGCTAGCCTTGTGAAGGGCACTTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGAGCAAGTGGGGTTGTAACTGGAAGTTTAACAAGAGATGTTCTTTTTACGCATGGAAATTATAATAACAATAATAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGAGCAACTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGAAGAGGCTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTTTTACCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAATCTTGCCATTTCTTCAAAAGATGAAAATTTGCAATTTACCCCTTTGTTGAAAAGTCCAATGTACCCTAACTATTACTATATTGGGCTTGAGTCAATTACCATTGGGAATGGGGATAATAATTTTAGATTTGGGGTTTCTTTTAAATTGAGAGAGATTGATACAAAGGGTAATGGAGGGATGTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAACTTATTTCTAATCTTGAATTAGTGATAGGTTATCCAAGAGCCAAACAAGTTGAACTCAATACTGGGTTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTCTTCTTTTGTTGATGACGCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTCAGTGTTGTTTTGCCTCAAGGCAATAACTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGGTTAAATGTTTGTTGTATCAAAGCATGGACGGTGTTGGTGACGATAACGACAGTGACGACAATGGGCCGGCGGGTATTTTCGGAAGCTTTCAACAGCAAAATATAGAGGTCGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGTTTCTGTTGCTGCCAAACAGGGACTTCACAAGAATGTTAGAAGGAATGAAAGTTGA

mRNA sequence

ATGCCTTCGATATCATCATCAATCTCCACTGCAACAAAATTCTTGAGCCTATTCCTTCTTCTTGTACATGTCTCAACACAAACCCTAGCAACAAACCCTAAAACCAATTTCCCCAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACATCCCTCCTTACACCCAAAAAAGGCTATAATTTCATTTCAAAGAAGAGAATGAAGGCAATGGATCAGACGGATGGTGATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTAATGTCCTTATCAATAGGGACACCCCCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGATCTCACGTGGGTTCCTTGTGGTAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATATTTCTGGCCCAAGATTGGCCGCCTTTTTGCCTACTCATTCTTCTACTTCTATTCGAGACACGTGTGGTAGTTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCTTAGCTAGCCTTGTGAAGGGCACTTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGAGCAAGTGGGGTTGTAACTGGAAGTTTAACAAGAGATGTTCTTTTTACGCATGGAAATTATAATAACAATAATAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGAGCAACTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGAAGAGGCTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTTTTACCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAATCTTGCCATTTCTTCAAAAGATGAAAATTTGCAATTTACCCCTTTGTTGAAAAGTCCAATGTACCCTAACTATTACTATATTGGGCTTGAGTCAATTACCATTGGGAATGGGGATAATAATTTTAGATTTGGGGTTTCTTTTAAATTGAGAGAGATTGATACAAAGGGTAATGGAGGGATGTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAACTTATTTCTAATCTTGAATTAGTGATAGGTTATCCAAGAGCCAAACAAGTTGAACTCAATACTGGGTTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTCTTCTTTTGTTGATGACGCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTCAGTGTTGTTTTGCCTCAAGGCAATAACTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGGTTAAATGTTTGTTGTATCAAAGCATGGACGGTGTTGGTGACGATAACGACAGTGACGACAATGGGCCGGCGGGTATTTTCGGAAGCTTTCAACAGCAAAATATAGAGGTCGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGTTTCTGTTGCTGCCAAACAGGGACTTCACAAGAATGTTAGAAGGAATGAAAGTTGA

Coding sequence (CDS)

ATGCCTTCGATATCATCATCAATCTCCACTGCAACAAAATTCTTGAGCCTATTCCTTCTTCTTGTACATGTCTCAACACAAACCCTAGCAACAAACCCTAAAACCAATTTCCCCAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACATCCCTCCTTACACCCAAAAAAGGCTATAATTTCATTTCAAAGAAGAGAATGAAGGCAATGGATCAGACGGATGGTGATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTAATGTCCTTATCAATAGGGACACCCCCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGATCTCACGTGGGTTCCTTGTGGTAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATATTTCTGGCCCAAGATTGGCCGCCTTTTTGCCTACTCATTCTTCTACTTCTATTCGAGACACGTGTGGTAGTTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCTTAGCTAGCCTTGTGAAGGGCACTTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGAGCAAGTGGGGTTGTAACTGGAAGTTTAACAAGAGATGTTCTTTTTACGCATGGAAATTATAATAACAATAATAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGAGCAACTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGAAGAGGCTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTTTTACCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAATCTTGCCATTTCTTCAAAAGATGAAAATTTGCAATTTACCCCTTTGTTGAAAAGTCCAATGTACCCTAACTATTACTATATTGGGCTTGAGTCAATTACCATTGGGAATGGGGATAATAATTTTAGATTTGGGGTTTCTTTTAAATTGAGAGAGATTGATACAAAGGGTAATGGAGGGATGTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAACTTATTTCTAATCTTGAATTAGTGATAGGTTATCCAAGAGCCAAACAAGTTGAACTCAATACTGGGTTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTCTTCTTTTGTTGATGACGCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTCAGTGTTGTTTTGCCTCAAGGCAATAACTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGGTTAAATGTTTGTTGTATCAAAGCATGGACGGTGTTGGTGACGATAACGACAGTGACGACAATGGGCCGGCGGGTATTTTCGGAAGCTTTCAACAGCAAAATATAGAGGTCGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGTTTCTGTTGCTGCCAAACAGGGACTTCACAAGAATGTTAGAAGGAATGAAAGTTGA
BLAST of CSPI01G34590 vs. Swiss-Prot
Match: ASP63_ARATH (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 5.9e-54
Identity = 150/444 (33.78%), Postives = 204/444 (45.95%), Query Frame = 1

Query: 91  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 150
           YL+SLS+G+    V +Y+DTGSDL W PC    F C  CE      S P       + SS
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPCR--PFTCILCESKPLPPSPPS------SLSS 142

Query: 151 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPR---PCPSFAYTYGASGVV 210
           ++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 SATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 211 TGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGF--SH 270
              L  D L      +  +  +  F FGC   T  EPIG+AGFGRG LSLP QL     H
Sbjct: 203 VAKLYSDSL------SLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPH 262

Query: 271 KG--FSHCFLPFKF-SNNPNFSSPLILGNLA------------------ISSKDENLQFT 330
            G  FS+C +   F S+     SPLILG                        K     FT
Sbjct: 263 LGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFT 322

Query: 331 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 390
            +L++P +P +Y + L+ I+IG  +          LR ID  G GG+++DSGTT+T LP 
Sbjct: 323 EMLENPKHPYFYSVSLQGISIGKRN----IPAPAMLRRIDKNGGGGVVVDSGTTFTMLPA 382

Query: 391 PLYSQLISNLELVIG--YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFL- 450
             Y+ ++   +  +G  + RA +VE ++G   CY +             ++P++  HF  
Sbjct: 383 KFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN---------QTVKVPALVLHFAG 442

Query: 451 NNVSVVLPQGNNFYAMA----APINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 502
           N  SV LP+ N FY              + CL+  +    G D      G   I G++QQ
Sbjct: 443 NRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMN----GGDESELRGGTGAILGNYQQ 494

BLAST of CSPI01G34590 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 3.6e-35
Identity = 131/439 (29.84%), Postives = 183/439 (41.69%), Query Frame = 1

Query: 65  KKRMKAMDQTDGDDNVIEPLREIRDG-YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLS 124
           ++RM++++      + IE      DG YLM+++IGTP       MDTGSDL W  C    
Sbjct: 69  ERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE--- 128

Query: 125 FDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA 184
             C  C      I       F P  SS+     C S +C D+     P + C    C   
Sbjct: 129 -PCTQCFSQPTPI-------FNPQDSSSFSTLPCESQYCQDL-----PSETCNNNECQ-- 188

Query: 185 SLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGC----VGA 244
                          YTYG      GS T+  + T   +      +P   FGC     G 
Sbjct: 189 ---------------YTYGYGD---GSTTQGYMATE-TFTFETSSVPNIAFGCGEDNQGF 248

Query: 245 TYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDE 304
                 G+ G G G LSLP QLG     FS+C   +  S+     S L LG+ A S   E
Sbjct: 249 GQGNGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSYGSSS----PSTLALGSAA-SGVPE 308

Query: 305 NLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTY 364
               T L+ S + P YYYI L+ IT+G GDN    G+     ++   G GGM+IDSGTT 
Sbjct: 309 GSPSTTLIHSSLNPTYYYITLQGITVG-GDN---LGIPSSTFQLQDDGTGGMIIDSGTTL 368

Query: 365 THLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFH 424
           T+LP+  Y+ +       I  P     E ++G   C++ P   +        Q+P I+  
Sbjct: 369 TYLPQDAYNAVAQAFTDQINLPTVD--ESSSGLSTCFQQPSDGST------VQVPEISMQ 428

Query: 425 FLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQN 484
           F   V  +  Q      + +P    +  CL   S   +G            IFG+ QQQ 
Sbjct: 429 FDGGVLNLGEQN----ILISPAEGVI--CLAMGSSSQLG----------ISIFGNIQQQE 435

Query: 485 IEVVYDLEKERLGFQPMDC 499
            +V+YDL+   + F P  C
Sbjct: 489 TQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI01G34590 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 4.4e-33
Identity = 122/430 (28.37%), Postives = 183/430 (42.56%), Query Frame = 1

Query: 79  NVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISG 138
           +V+  L +    Y   L +GTP + V + +DTGSD+ W+ C      C+ C    + I  
Sbjct: 130 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA----PCRRCYSQSDPIFD 189

Query: 139 PRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFA 198
           PR        S T     C S  C  + S          AGC+     + TC      + 
Sbjct: 190 PR-------KSKTYATIPCSSPHCRRLDS----------AGCNTR---RKTC-----LYQ 249

Query: 199 YTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGC--------VGATYREPIGIAGF 258
            +YG      G  + + L    N      ++     GC        VGA      G+ G 
Sbjct: 250 VSYGDGSFTVGDFSTETLTFRRN------RVKGVALGCGHDNEGLFVGAA-----GLLGL 309

Query: 259 GRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLK 318
           G+G LS P Q G  F+ K FS+C +    S+ P   S ++ GN A+S      +FTPLL 
Sbjct: 310 GKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP---SSVVFGNAAVS---RIARFTPLLS 369

Query: 319 SPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYS 378
           +P    +YY+GL  I++G        GV+  L ++D  GNGG++IDSGT+ T L  P Y 
Sbjct: 370 NPKLDTFYYVGLLGISVGGTRVP---GVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYI 429

Query: 379 QLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVL 438
            +       +G    K+    + FD C+ +       S +++ ++P++  HF     V L
Sbjct: 430 AMRDAFR--VGAKTLKRAPDFSLFDTCFDL-------SNMNEVKVPTVVLHF-RGADVSL 484

Query: 439 PQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEK 498
           P  N       P+++    C  +           +   G   I G+ QQQ   VVYDL  
Sbjct: 490 PATN----YLIPVDTNGKFCFAF-----------AGTMGGLSIIGNIQQQGFRVVYDLAS 484

BLAST of CSPI01G34590 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 1.1e-31
Identity = 127/415 (30.60%), Postives = 171/415 (41.20%), Query Frame = 1

Query: 91  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 150
           YLM+LSIGTP Q     MDTGSDL W         CQ C +  N  +      F P  SS
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWT-------QCQPCTQCFNQST----PIFNPQGSS 154

Query: 151 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 210
           +     C S  C  + S                     TC      + Y YG      GS
Sbjct: 155 SFSTLPCSSQLCQALSSP--------------------TCSNNFCQYTYGYGDGSETQGS 214

Query: 211 LTRDVLFTHGNYNNNNKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPFQLGFSHK 270
           +  + L T G+ +     IP   FGC     G       G+ G GRG LSLP QL  +  
Sbjct: 215 MGTETL-TFGSVS-----IPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTK- 274

Query: 271 GFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESITIG 330
            FS+C  P   S   N    L+LG+LA +S       T L++S   P +YYI L  +++G
Sbjct: 275 -FSYCMTPIGSSTPSN----LLLGSLA-NSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG 334

Query: 331 NGDNNFRFGV---SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRA 390
               + R  +   +F L      G GG++IDSGTT T+     Y  +       I  P  
Sbjct: 335 ----STRLPIDPSAFALN--SNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVV 394

Query: 391 KQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINS 450
                ++GFDLC++ P   +N       Q+P+   HF +   + LP  N F    +P N 
Sbjct: 395 N--GSSSGFDLCFQTPSDPSN------LQIPTFVMHF-DGGDLELPSENYF---ISPSNG 434

Query: 451 TVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 499
            +  CL   S            +    IFG+ QQQN+ VVYD     + F    C
Sbjct: 455 LI--CLAMGS-----------SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI01G34590 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 3.5e-30
Identity = 123/432 (28.47%), Postives = 182/432 (42.13%), Query Frame = 1

Query: 78  DNVIEPLREIRDG---YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQN 137
           DN  +P  ++      YLM++SIGTPP  +    DTGSDL W  C      C DC    +
Sbjct: 74  DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCA----PCDDCYTQVD 133

Query: 138 NISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPC 197
            +       F P  SST    +C SS C  + +          A CS       TC    
Sbjct: 134 PL-------FDPKTSSTYKDVSCSSSQCTALENQ---------ASCSTND---NTC---- 193

Query: 198 PSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGC----VGATYREPIGIAGF 257
            S++ +YG +    G++  D L T G+ +    Q+     GC     G   ++  GI G 
Sbjct: 194 -SYSLSYGDNSYTKGNIAVDTL-TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGL 253

Query: 258 GRGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKS 317
           G G +SL  QLG S  G FS+C +P   ++  + +S +  G  AI S    +  TPL+  
Sbjct: 254 GGGPVSLIKQLGDSIDGKFSYCLVP--LTSKKDQTSKINFGTNAIVS-GSGVVSTPLIAK 313

Query: 318 PMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQ 377
                +YY+ L+SI++G+    +    S           G ++IDSGTT T LP   YS+
Sbjct: 314 ASQETFYYLTLKSISVGSKQIQYSGSDS-------ESSEGNIIIDSGTTLTLLPTEFYSE 373

Query: 378 LISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLP 437
           L   +   I     K+ +  +G  LCY         S   D ++P IT HF +   V L 
Sbjct: 374 LEDAVASSI--DAEKKQDPQSGLSLCY---------SATGDLKVPVITMHF-DGADVKLD 433

Query: 438 QGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKE 497
             N F  +     S  + C  ++                  I+G+  Q N  V YD   +
Sbjct: 434 SSNAFVQV-----SEDLVCFAFRGSPSF------------SIYGNVAQMNFLVGYDTVSK 437

Query: 498 RLGFQPMDCVSV 502
            + F+P DC  +
Sbjct: 494 TVSFKPTDCAKM 437

BLAST of CSPI01G34590 vs. TrEMBL
Match: A0A0A0LYP0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G704590 PE=3 SV=1)

HSP 1 Score: 1051.6 bits (2718), Expect = 3.1e-304
Identity = 515/519 (99.23%), Postives = 515/519 (99.23%), Query Frame = 1

Query: 1   MPSISSSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY 60
           MPSISS ISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY
Sbjct: 1   MPSISS-ISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120
           NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY---NNNNKQIPRFCFGCV 240
           SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY   NNNNKQIPRFCFGCV
Sbjct: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCV 240

Query: 241 GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300
           GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
Sbjct: 241 GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300

Query: 301 DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT 360
           DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT
Sbjct: 301 DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT 360

Query: 361 TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT 420
           TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT
Sbjct: 361 TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT 420

Query: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 480
           FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ
Sbjct: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 480

Query: 481 QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 517
           QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES
Sbjct: 481 QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 518

BLAST of CSPI01G34590 vs. TrEMBL
Match: V4TN99_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10017863mg PE=3 SV=1)

HSP 1 Score: 652.1 bits (1681), Expect = 5.6e-184
Identity = 331/510 (64.90%), Postives = 398/510 (78.04%), Query Frame = 1

Query: 4   ISSSISTATKFLSLFLLLVHVST-QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNF 63
           ++ + S     + LFLL + ++  QTLAT  + N  K SLVLGL +SR SLL P    + 
Sbjct: 1   MAKAYSNIATIILLFLLSMSLTFHQTLAT--QKNNGKHSLVLGLTNSRASLLIPSASKSS 60

Query: 64  ISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNL 123
           I KK  + +D       ++EPLRE+RDGYL+SL+IGTP QV+QVYMDTGSDLTWVPCGNL
Sbjct: 61  I-KKPSETLD-------MMEPLREVRDGYLISLNIGTPTQVIQVYMDTGSDLTWVPCGNL 120

Query: 124 SFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL 183
           SFDC DC++Y+NN     ++ F P+ SS+S RDTC SSFC++IHSSDNPFDPCT++GCSL
Sbjct: 121 SFDCVDCDDYRNN---KLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSL 180

Query: 184 ASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYR 243
           ++L+K TC RPCPSFAYTYG  G+VTG LTRD L  HG+     ++IP+FCFGCVG+TYR
Sbjct: 181 STLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYR 240

Query: 244 EPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQ 303
           EPIGIAGFGRG LS+P QLGF  KGFSHCFL FK++N+PN SSPL+LG++AISSKD NLQ
Sbjct: 241 EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVLGDVAISSKD-NLQ 300

Query: 304 FTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHL 363
           FTP+LKSPMYPNYYYIGLE+ITIGN        V   LRE D++GNGG+L+DSGTTYTHL
Sbjct: 301 FTPMLKSPMYPNYYYIGLEAITIGNSSLT---EVPLSLREFDSQGNGGLLVDSGTTYTHL 360

Query: 364 PEPLYSQLISNLELVIG-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFL 423
           PEP YSQL+S L+  I  YPRAK+VE  TGFDLCY+VPC NN  +F DD   PSITFHFL
Sbjct: 361 PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN--TFTDDL-FPSITFHFL 420

Query: 424 NNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIE 483
           NNVS+VLPQGN+FYAM+AP NS+ VKCLL+QSM       D  D GP+G+FGSFQQQN+E
Sbjct: 421 NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-------DDGDYGPSGVFGSFQQQNVE 480

Query: 484 VVYDLEKERLGFQPMDCVSVAAKQGLHKNV 512
           VVYDLEKER+GFQPMDC S A+ QGLHK +
Sbjct: 481 VVYDLEKERIGFQPMDCASTASAQGLHKKL 483

BLAST of CSPI01G34590 vs. TrEMBL
Match: K4B7R8_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=3 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 2.4e-179
Identity = 320/505 (63.37%), Postives = 381/505 (75.45%), Query Frame = 1

Query: 7   SISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKK 66
           S+ST   F  L   L+H + Q  A   K +    SLVL L H++TSL  PK  YN + KK
Sbjct: 6   SLSTYFIFFFLSSALLHFN-QCYAKEKKPS--SYSLVLSLTHTKTSLTIPKSSYNLV-KK 65

Query: 67  RMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDC 126
             + +D       + EPLRE+RDGYL+SL+IGTPPQ++QVYMDTGSDLTWVPCGNLSFDC
Sbjct: 66  NSETLD-------IREPLREVRDGYLISLNIGTPPQIIQVYMDTGSDLTWVPCGNLSFDC 125

Query: 127 QDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLV 186
            DC++Y+++     +++F P+ SS+S RD C SS C+DIHSSDNPFD CTIAGCSL SL+
Sbjct: 126 IDCDDYRDH---KLMSSFSPSFSSSSYRDLCTSSSCIDIHSSDNPFDQCTIAGCSLNSLL 185

Query: 187 KGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNN--KQIPRFCFGCVGATYREP 246
           KGTC RPCPSFAYTYG  G+V+G+LTRD L  HG  +N N  +++P+F FGCVG TYREP
Sbjct: 186 KGTCSRPCPSFAYTYG-EGIVSGTLTRDTLRVHGTSSNPNSIREVPKFVFGCVGTTYREP 245

Query: 247 IGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFT 306
           IGI GFG+G LSLP QLGF  KGFSHCFLPFKF+NNPN SSPL++G+ AISSK EN QFT
Sbjct: 246 IGIVGFGKGPLSLPSQLGFLKKGFSHCFLPFKFANNPNISSPLVVGDQAISSK-ENFQFT 305

Query: 307 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 366
           P+LKSPMYPN+YYIGLE+IT+GNG       V   LRE D+ GNGGMLIDSGTTYTHLPE
Sbjct: 306 PMLKSPMYPNFYYIGLEAITVGNGATT---QVPLTLREFDSLGNGGMLIDSGTTYTHLPE 365

Query: 367 PLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNV 426
           P YS L++ L   I YPRA+ +E  TGFDLCY++PC NNN + +     PSITFHFLNNV
Sbjct: 366 PFYSSLLTALRSSINYPRAEDIEARTGFDLCYRLPCPNNNLNSLVTDDFPSITFHFLNNV 425

Query: 427 SVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVY 486
           S+ LP GN+FYAM AP NSTVVKCLL+QSM+G        + GPAGIFG+FQQQN+EVVY
Sbjct: 426 SLFLPNGNDFYAMGAPRNSTVVKCLLFQSMEG-------SEEGPAGIFGNFQQQNVEVVY 484

Query: 487 DLEKERLGFQPMDCVSVAAKQGLHK 510
           DLEKER+GFQ  DC S A  QGLHK
Sbjct: 486 DLEKERIGFQTTDCASAATSQGLHK 484

BLAST of CSPI01G34590 vs. TrEMBL
Match: M5X3K9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015155mg PE=3 SV=1)

HSP 1 Score: 635.2 bits (1637), Expect = 7.0e-179
Identity = 321/497 (64.59%), Postives = 376/497 (75.65%), Query Frame = 1

Query: 17  LFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRMKAMDQTDG 76
           LF L + +      T  K      SLVLGL +S TSL  PK   N      +K M     
Sbjct: 11  LFFLAIAIFLNFNQTLAKHKPSSTSLVLGLTNSYTSLPIPKASAN------LKKMPSQVS 70

Query: 77  DDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNI 136
           D  ++EPLR +RDGYL+SL++GTPPQV+QVYMDTGSDLTWVPCGNLSF C DC++Y+NN 
Sbjct: 71  D--MMEPLRGVRDGYLISLNLGTPPQVIQVYMDTGSDLTWVPCGNLSFVCMDCDDYRNNR 130

Query: 137 SGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPS 196
             P    F P+ SS+S+RD CGSSFC+DIHSS+N  DPCTIAGCSL +L+K TCPRPCPS
Sbjct: 131 LMP---TFSPSASSSSLRDLCGSSFCLDIHSSENSIDPCTIAGCSLTTLLKATCPRPCPS 190

Query: 197 FAYTYGASGVVTGSLTRDVLFTHGNYNNNN----KQIPRFCFGCVGATYREPIGIAGFGR 256
           FAYTYG  GVVTG+L+RD L  HG  +  +    +++P+FCFGC+G+TYREPIGIAGFGR
Sbjct: 191 FAYTYGGGGVVTGTLSRDTLRVHGISSTPDNVVTREVPKFCFGCIGSTYREPIGIAGFGR 250

Query: 257 GLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMY 316
           G LSLP QLGF  KGFSHCFLPFK++NNPN SSPL++G++AISSK ENLQFTP+LKSPMY
Sbjct: 251 GSLSLPSQLGFLQKGFSHCFLPFKYANNPNISSPLVVGDVAISSK-ENLQFTPMLKSPMY 310

Query: 317 PNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLIS 376
           PN YYIGLE+ITIGN     +  +S  LRE D +GNGGMLIDSGTTYTHLPEPLYS L+S
Sbjct: 311 PNNYYIGLEAITIGNATAITQMPLS--LREFDAQGNGGMLIDSGTTYTHLPEPLYSNLLS 370

Query: 377 NLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGN 436
            L  VI YPRAK++E  T FDLCY VP   N  +   D   PSITFHFL NVS+VLPQGN
Sbjct: 371 LLHSVISYPRAKEMETKTSFDLCYVVPYTINTLTKPGDL-FPSITFHFLKNVSLVLPQGN 430

Query: 437 NFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLG 496
           +FYAM AP NSTVVKCLL+Q+MD        +D GPAG+FGSFQQQN+EVVYDLEKER+G
Sbjct: 431 HFYAMGAPANSTVVKCLLFQAMD-------DEDYGPAGVFGSFQQQNVEVVYDLEKERIG 485

Query: 497 FQPMDCVSVAAKQGLHK 510
           FQPMDC S +A QGLHK
Sbjct: 491 FQPMDCASASASQGLHK 485

BLAST of CSPI01G34590 vs. TrEMBL
Match: B9RRP1_RICCO (Pepsin A, putative OS=Ricinus communis GN=RCOM_1425140 PE=3 SV=1)

HSP 1 Score: 626.3 bits (1614), Expect = 3.3e-176
Identity = 296/430 (68.84%), Postives = 346/430 (80.47%), Query Frame = 1

Query: 80  VIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGP 139
           ++E LRE+RDGYL+SL+IGTPPQV+QVYMDTGSDLTWVPCGNLSFDC DC++Y+N+    
Sbjct: 1   MVEQLREVRDGYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNS---K 60

Query: 140 RLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAY 199
            ++AF P+HSS+S RD+C S +C DIHSSDN FDPCT+AGCSL++L+K TC RPCPSFAY
Sbjct: 61  LMSAFSPSHSSSSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAY 120

Query: 200 TYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPF 259
           TYGA GVVTG+LTRD L  H       K IP+FCFGCVG+TY EPIGIAGF RG LS P 
Sbjct: 121 TYGAGGVVTGTLTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHEPIGIAGFVRGTLSFPS 180

Query: 260 QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIG 319
           QLG   KGFSHCFL FK++NNPN SSPL++G+ A+SSKD N+QFTP+LKSPMYPNYYYIG
Sbjct: 181 QLGLLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKD-NMQFTPMLKSPMYPNYYYIG 240

Query: 320 LESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIG 379
           LE+IT+GN        V   LRE D++GNGGMLIDSGTTYTHLPEP YSQL+S  + +I 
Sbjct: 241 LEAITVGNVSAT---TVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIIT 300

Query: 380 YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAA 439
           YPRA +VE+  GFDLCYKVPC NN  +  DD   PSITFHFLNNVS VLPQGN+FYAM+A
Sbjct: 301 YPRATEVEMRAGFDLCYKVPCPNNRLT-DDDNLFPSITFHFLNNVSFVLPQGNHFYAMSA 360

Query: 440 PINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDCV 499
           P NSTVVKCLL+QSM          D GPAG+FGSFQQQN+++VYDLEKER+GFQPMDC 
Sbjct: 361 PSNSTVVKCLLFQSM-------ADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCA 415

Query: 500 SVAAKQGLHK 510
           S A  QGLH+
Sbjct: 421 SAAVSQGLHR 415

BLAST of CSPI01G34590 vs. TAIR10
Match: AT5G45120.1 (AT5G45120.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 598.2 bits (1541), Expect = 4.8e-171
Identity = 308/512 (60.16%), Postives = 375/512 (73.24%), Query Frame = 1

Query: 8   ISTATKFLSLFLL---LVHVSTQTLATNPKTNFPKDS--LVLGLVHSRTSLLTPKKGYNF 67
           + T T  L LFLL   L++ + +T A   K      S  LVL L  S  SL TPK     
Sbjct: 1   METQTHVLFLFLLITLLLNTTNKTQARQHKNPSSSSSSFLVLTLTKSSVSLPTPKSQTQE 60

Query: 68  ISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNL 127
             KK + ++D       V+EPLRE+RDGYL++L+IGTPPQ VQVY+DTGSDLTWVPCGNL
Sbjct: 61  RIKKPLSSVDV------VMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNL 120

Query: 128 SFDCQDCEEYQNN-ISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 187
           SFDC +C + +NN +  P  + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS
Sbjct: 121 SFDCIECYDLKNNDLKSP--SVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCS 180

Query: 188 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATY 247
           ++ L+K TC RPCPSFAYTYG  G+++G LTRD+L          + +PRF FGCV +TY
Sbjct: 181 VSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDIL------KARTRDVPRFSFGCVTSTY 240

Query: 248 REPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDEN 307
           REPIGIAGFGRGLLSLP QLGF  KGFSHCFLPFKF NNPN SSPLILG  A+S +  ++
Sbjct: 241 REPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDS 300

Query: 308 LQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYT 367
           LQFTP+L +PMYPN YYIGLESITIG   N     V   LR+ D++GNGGML+DSGTTYT
Sbjct: 301 LQFTPMLNTPMYPNSYYIGLESITIGT--NITPTQVPLTLRQFDSQGNGGMLVDSGTTYT 360

Query: 368 HLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNN-SSFVDDAQL--PSIT 427
           HLPEP YSQL++ L+  I YPRA + E  TGFDLCYKVPC NNN +S  +D  +  PSIT
Sbjct: 361 HLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSIT 420

Query: 428 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 487
           FHFLNN +++LPQGN+FYAM+AP + +VV+CLL+Q+M+         D GPAG+FGSFQQ
Sbjct: 421 FHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNME-------DGDYGPAGVFGSFQQ 480

Query: 488 QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHK 510
           QN++VVYDLEKER+GFQ MDCV  AA  GL++
Sbjct: 481 QNVKVVYDLEKERIGFQAMDCVLEAASHGLNQ 489

BLAST of CSPI01G34590 vs. TAIR10
Match: AT4G16563.1 (AT4G16563.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 213.4 bits (542), Expect = 3.3e-55
Identity = 150/444 (33.78%), Postives = 204/444 (45.95%), Query Frame = 1

Query: 91  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 150
           YL+SLS+G+    V +Y+DTGSDL W PC    F C  CE      S P       + SS
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPCR--PFTCILCESKPLPPSPPS------SLSS 142

Query: 151 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPR---PCPSFAYTYGASGVV 210
           ++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 SATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 211 TGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGF--SH 270
              L  D L      +  +  +  F FGC   T  EPIG+AGFGRG LSLP QL     H
Sbjct: 203 VAKLYSDSL------SLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPH 262

Query: 271 KG--FSHCFLPFKF-SNNPNFSSPLILGNLA------------------ISSKDENLQFT 330
            G  FS+C +   F S+     SPLILG                        K     FT
Sbjct: 263 LGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFT 322

Query: 331 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 390
            +L++P +P +Y + L+ I+IG  +          LR ID  G GG+++DSGTT+T LP 
Sbjct: 323 EMLENPKHPYFYSVSLQGISIGKRN----IPAPAMLRRIDKNGGGGVVVDSGTTFTMLPA 382

Query: 391 PLYSQLISNLELVIG--YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFL- 450
             Y+ ++   +  +G  + RA +VE ++G   CY +             ++P++  HF  
Sbjct: 383 KFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN---------QTVKVPALVLHFAG 442

Query: 451 NNVSVVLPQGNNFYAMA----APINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 502
           N  SV LP+ N FY              + CL+  +    G D      G   I G++QQ
Sbjct: 443 NRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMN----GGDESELRGGTGAILGNYQQ 494

BLAST of CSPI01G34590 vs. TAIR10
Match: AT3G52500.1 (AT3G52500.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 181.4 bits (459), Expect = 1.4e-45
Identity = 133/418 (31.82%), Postives = 188/418 (44.98%), Query Frame = 1

Query: 90  GYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHS 149
           GY +SLS GTP Q +    DTGS L W+PC +  + C  C+   + +    +  F+P +S
Sbjct: 89  GYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTS-RYLCSGCDF--SGLDPTLIPRFIPKNS 148

Query: 150 STSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTG 209
           S+S    C S  C  ++    P   C   GC   +     C   CP +   YG       
Sbjct: 149 SSSKIIGCQSPKCQFLYG---PNVQCR--GCDPNTR---NCTVGCPPYILQYGLGSTAGV 208

Query: 210 SLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFS 269
            +T  + F        +  +P F  GC   + R+P GIAGFGRG +SLP Q+    K FS
Sbjct: 209 LITEKLDFP-------DLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL--KRFS 268

Query: 270 HCFLPFKFSNNPNFSSPLILGNLA---ISSKDENLQFTPLLKSPMYPN-----YYYIGLE 329
           HC +  +F +  N ++ L L   +     SK   L +TP  K+P   N     YYY+ L 
Sbjct: 269 HCLVSRRFDDT-NVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLR 328

Query: 330 SITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL-ELVIGY 389
            I +G         + +K     T G+GG ++DSG+T+T +  P++  +       +  Y
Sbjct: 329 RIYVGRK----HVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNY 388

Query: 390 PRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAP 449
            R K +E  TG   C+ +  K        D  +P + F F     + LP  N F      
Sbjct: 389 TREKDLEKETGLGPCFNISGKG-------DVTVPELIFEFKGGAKLELPLSNYFTF---- 448

Query: 450 INSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 499
           + +T   CL   S   V   N S   GPA I GSFQQQN  V YDLE +R GF    C
Sbjct: 449 VGNTDTVCLTVVSDKTV---NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467

BLAST of CSPI01G34590 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 156.4 bits (394), Expect = 4.8e-38
Identity = 161/526 (30.61%), Postives = 222/526 (42.21%), Query Frame = 1

Query: 1   MPSISSSISTATKFLSLFLLLVHVSTQTLATNPKT---NFPKDSLVLGLVH-----SRTS 60
           M S SSS      FL LF  L+ VS+   +   +T   N P+    L L H     + T 
Sbjct: 1   MASSSSSSLLFPFFLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTK 60

Query: 61  LLTPKKGYN--FISKKRMKAM------DQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVV 120
           +   ++G N  F    R+ A+       + D  +N+  P       +LM LSIG P    
Sbjct: 61  IQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKY 120

Query: 121 QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMD 180
              +DTGSDL W  C      C +C +    I       F P  SS+  +  C S  C  
Sbjct: 121 SAIVDTGSDLIWTQCK----PCTECFDQPTPI-------FDPEKSSSYSKVGCSSGLCNA 180

Query: 181 IHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNN 240
           +  S+   D             K  C      + YTYG       S TR +L T      
Sbjct: 181 LPRSNCNED-------------KDAC-----EYLYTYGDY-----SSTRGLLATETFTFE 240

Query: 241 NNKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNN 300
           +   I    FGC     G  + +  G+ G GRG LSL  QL      FS+C    + S  
Sbjct: 241 DENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQL--KETKFSYCLTSIEDSEA 300

Query: 301 PNFSSPLILGNLA--------ISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNF 360
              SS L +G+LA         S   E  +   LL++P  P++YY+ L+ IT+G      
Sbjct: 301 ---SSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK---- 360

Query: 361 RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGF 420
           R  V     E+   G GGM+IDSGTT T+L E  +  L       +  P       +TG 
Sbjct: 361 RLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG--STGL 420

Query: 421 DLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQ 480
           DLC+K+P    N +      +P + FHF     + LP G N+  M A  +ST V CL   
Sbjct: 421 DLCFKLPDAAKNIA------VPKMIFHF-KGADLELP-GENY--MVAD-SSTGVLCLAMG 458

Query: 481 SMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 499
           S +G+             IFG+ QQQN  V++DLEKE + F P +C
Sbjct: 481 SSNGMS------------IFGNVQQQNFNVLHDLEKETVSFVPTEC 458

BLAST of CSPI01G34590 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 149.4 bits (376), Expect = 5.9e-36
Identity = 140/510 (27.45%), Postives = 204/510 (40.00%), Query Frame = 1

Query: 2   PSISSSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLT-PKKGY 61
           PS  +++S   K+L L LL             K+ FP  +  L L   R   L+  +K  
Sbjct: 19  PSNIAAVSNHNKYLKLPLLR------------KSPFPSPTQALALDTRRLHFLSLRRKPI 78

Query: 62  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 121
            F+    +       G              Y + L IG PPQ + +  DTGSDL WV C 
Sbjct: 79  PFVKSPVVSGAASGSGQ-------------YFVDLRIGQPPQSLLLIADTGSDLVWVKCS 138

Query: 122 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 181
                C++C  +           F P HSST     C    C  +   D        A  
Sbjct: 139 ----ACRNCSHHS------PATVFFPRHSSTFSPAHCYDPVCRLVPKPDR-------API 198

Query: 182 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGC---- 241
              + +  TC      + Y Y    + +G   R+      + +    ++    FGC    
Sbjct: 199 CNHTRIHSTC-----HYEYGYADGSLTSGLFARETTSLKTS-SGKEARLKSVAFGCGFRI 258

Query: 242 -----VGATYREPIGIAGFGRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLIL 301
                 G ++    G+ G GRG +S   QLG  F +K FS+C + +  S  P  +S LI+
Sbjct: 259 SGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK-FSYCLMDYTLSPPP--TSYLII 318

Query: 302 GNLAISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNG 361
           GN         L FTPLL +P+ P +YY+ L+S+ +    N  +  +   + EID  GNG
Sbjct: 319 GN--GGDGISKLFFTPLLTNPLSPTFYYVKLKSVFV----NGAKLRIDPSIWEIDDSGNG 378

Query: 362 GMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVD 421
           G ++DSGTT   L EP Y  +I+ +   +  P A    L  GFDLC  V     +     
Sbjct: 379 GTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD--ALTPGFDLCVNV-----SGVTKP 438

Query: 422 DAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMD-GVGDDNDSDDNGP 481
           +  LP + F F      V P  N F           ++CL  QS+D  VG          
Sbjct: 439 EKILPRLKFEFSGGAVFVPPPRNYFIE-----TEEQIQCLAIQSVDPKVG---------- 449

Query: 482 AGIFGSFQQQNIEVVYDLEKERLGFQPMDC 499
             + G+  QQ     +D ++ RLGF    C
Sbjct: 499 FSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449

BLAST of CSPI01G34590 vs. NCBI nr
Match: gi|778665454|ref|XP_004145478.2| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus])

HSP 1 Score: 1051.6 bits (2718), Expect = 4.5e-304
Identity = 515/519 (99.23%), Postives = 515/519 (99.23%), Query Frame = 1

Query: 1   MPSISSSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY 60
           MPSISS ISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY
Sbjct: 1   MPSISS-ISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120
           NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY---NNNNKQIPRFCFGCV 240
           SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY   NNNNKQIPRFCFGCV
Sbjct: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCV 240

Query: 241 GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300
           GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
Sbjct: 241 GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300

Query: 301 DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT 360
           DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT
Sbjct: 301 DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT 360

Query: 361 TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT 420
           TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT
Sbjct: 361 TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT 420

Query: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 480
           FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ
Sbjct: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQ 480

Query: 481 QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 517
           QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES
Sbjct: 481 QNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 518

BLAST of CSPI01G34590 vs. NCBI nr
Match: gi|659118383|ref|XP_008459091.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 1008.8 bits (2607), Expect = 3.4e-291
Identity = 492/521 (94.43%), Postives = 503/521 (96.55%), Query Frame = 1

Query: 1   MPSISSSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY 60
           MPSISS+ S ATKFLSLFLLLVH S QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY
Sbjct: 1   MPSISST-SIATKFLSLFLLLVHASKQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120
           NFISKKRMKAMDQ DGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNNISGP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNN-------KQIPRFC 240
           SLA+LVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLF HGNY+NNN       KQ+PRFC
Sbjct: 181 SLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRFC 240

Query: 241 FGCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA 300
           FGCVGATYREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Sbjct: 241 FGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLA 300

Query: 301 ISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLI 360
           ISSKDENLQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLI
Sbjct: 301 ISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360

Query: 361 DSGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQL 420
           DSGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKVPCKNNNSSFVDD+QL
Sbjct: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQL 420

Query: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFG 480
           PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFG
Sbjct: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFG 480

Query: 481 SFQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRN 515
           SFQQQN++VVYDLEKERLGFQ MDCVSVAA QGLHKNVRRN
Sbjct: 481 SFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKNVRRN 520

BLAST of CSPI01G34590 vs. NCBI nr
Match: gi|567912849|ref|XP_006448738.1| (hypothetical protein CICLE_v10017863mg [Citrus clementina])

HSP 1 Score: 652.1 bits (1681), Expect = 8.0e-184
Identity = 331/510 (64.90%), Postives = 398/510 (78.04%), Query Frame = 1

Query: 4   ISSSISTATKFLSLFLLLVHVST-QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNF 63
           ++ + S     + LFLL + ++  QTLAT  + N  K SLVLGL +SR SLL P    + 
Sbjct: 1   MAKAYSNIATIILLFLLSMSLTFHQTLAT--QKNNGKHSLVLGLTNSRASLLIPSASKSS 60

Query: 64  ISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNL 123
           I KK  + +D       ++EPLRE+RDGYL+SL+IGTP QV+QVYMDTGSDLTWVPCGNL
Sbjct: 61  I-KKPSETLD-------MMEPLREVRDGYLISLNIGTPTQVIQVYMDTGSDLTWVPCGNL 120

Query: 124 SFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL 183
           SFDC DC++Y+NN     ++ F P+ SS+S RDTC SSFC++IHSSDNPFDPCT++GCSL
Sbjct: 121 SFDCVDCDDYRNN---KLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSL 180

Query: 184 ASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYR 243
           ++L+K TC RPCPSFAYTYG  G+VTG LTRD L  HG+     ++IP+FCFGCVG+TYR
Sbjct: 181 STLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYR 240

Query: 244 EPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQ 303
           EPIGIAGFGRG LS+P QLGF  KGFSHCFL FK++N+PN SSPL+LG++AISSKD NLQ
Sbjct: 241 EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVLGDVAISSKD-NLQ 300

Query: 304 FTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHL 363
           FTP+LKSPMYPNYYYIGLE+ITIGN        V   LRE D++GNGG+L+DSGTTYTHL
Sbjct: 301 FTPMLKSPMYPNYYYIGLEAITIGNSSLT---EVPLSLREFDSQGNGGLLVDSGTTYTHL 360

Query: 364 PEPLYSQLISNLELVIG-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFL 423
           PEP YSQL+S L+  I  YPRAK+VE  TGFDLCY+VPC NN  +F DD   PSITFHFL
Sbjct: 361 PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN--TFTDDL-FPSITFHFL 420

Query: 424 NNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIE 483
           NNVS+VLPQGN+FYAM+AP NS+ VKCLL+QSM       D  D GP+G+FGSFQQQN+E
Sbjct: 421 NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-------DDGDYGPSGVFGSFQQQNVE 480

Query: 484 VVYDLEKERLGFQPMDCVSVAAKQGLHKNV 512
           VVYDLEKER+GFQPMDC S A+ QGLHK +
Sbjct: 481 VVYDLEKERIGFQPMDCASTASAQGLHKKL 483

BLAST of CSPI01G34590 vs. NCBI nr
Match: gi|985434164|ref|XP_006468472.2| (PREDICTED: aspartic proteinase nepenthesin-1 [Citrus sinensis])

HSP 1 Score: 651.4 bits (1679), Expect = 1.4e-183
Identity = 330/510 (64.71%), Postives = 397/510 (77.84%), Query Frame = 1

Query: 4   ISSSISTATKFLSLFLLLVHVST-QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNF 63
           ++ + S     + LFLL + +   QTLAT  + N  K SLVLGL +SR SLL P    + 
Sbjct: 1   MAKAYSNIATIILLFLLSMSLRFHQTLAT--QKNNGKHSLVLGLTNSRVSLLIPSASKSS 60

Query: 64  ISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNL 123
           I KK  + +D       ++EPLRE+RDGYL+SL+IGTP QV+QVYMDTGSDLTWVPCGNL
Sbjct: 61  I-KKPSETLD-------MMEPLREVRDGYLISLNIGTPTQVIQVYMDTGSDLTWVPCGNL 120

Query: 124 SFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL 183
           SFDC DC++Y+NN     ++ F P+ SS+S RDTC SSFC++IHSSDNPFDPCT++GCSL
Sbjct: 121 SFDCMDCDDYRNN---KLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSL 180

Query: 184 ASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYR 243
           ++L+K TC RPCPSFAYTYG  G+VTG LTRD L  HG+     ++IP+FCFGCVG+TYR
Sbjct: 181 STLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYR 240

Query: 244 EPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQ 303
           EPIGIAGFGRG LS+P QLGF  KGFSHCFL FK++N+PN SSPL++G++AISSKD NLQ
Sbjct: 241 EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKD-NLQ 300

Query: 304 FTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHL 363
           FTP+LKSPMYPNYYYIGLE+ITIGN        V   LRE D++GNGG+L+DSGTTYTHL
Sbjct: 301 FTPMLKSPMYPNYYYIGLEAITIGNSSLT---EVPLSLREFDSQGNGGLLVDSGTTYTHL 360

Query: 364 PEPLYSQLISNLELVIG-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFL 423
           PEP YSQL+S L+  I  YPRAK+VE  TGFDLCY+VPC NN  +F DD   PSITFHFL
Sbjct: 361 PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN--TFTDDL-FPSITFHFL 420

Query: 424 NNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIE 483
           NNVS+VLPQGN+FYAM+AP NS+ VKCLL+QSM       D  D GP+G+FGSFQQQN+E
Sbjct: 421 NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-------DDGDYGPSGVFGSFQQQNVE 480

Query: 484 VVYDLEKERLGFQPMDCVSVAAKQGLHKNV 512
           VVYDLEKER+GFQPMDC S A+ QGLHK +
Sbjct: 481 VVYDLEKERIGFQPMDCASTASAQGLHKKL 483

BLAST of CSPI01G34590 vs. NCBI nr
Match: gi|1000972698|ref|XP_002516410.2| (PREDICTED: aspartic proteinase nepenthesin-2 [Ricinus communis])

HSP 1 Score: 647.1 bits (1668), Expect = 2.6e-182
Identity = 317/493 (64.30%), Postives = 375/493 (76.06%), Query Frame = 1

Query: 17  LFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRMKAMDQTDG 76
           LFLLL++    +  +  K   P   L+LGL HSR SL  P    N  +  R +  D+ D 
Sbjct: 11  LFLLLINPLLNSNQSLAKRRNPNYILILGLTHSRASLPLP----NASTSSRKRQTDELD- 70

Query: 77  DDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNI 136
              ++E LRE+RDGYL+SL+IGTPPQV+QVYMDTGSDLTWVPCGNLSFDC DC++Y+N+ 
Sbjct: 71  ---MVEQLREVRDGYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNS- 130

Query: 137 SGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPS 196
               ++AF P+HSS+S RD+C S +C DIHSSDN FDPCT+AGCSL++L+K TC RPCPS
Sbjct: 131 --KLMSAFSPSHSSSSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPS 190

Query: 197 FAYTYGASGVVTGSLTRDVLFTHGNYNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLS 256
           FAYTYGA GVVTG+LTRD L  H       K IP+FCFGCVG+TY EPIGIAGF RG LS
Sbjct: 191 FAYTYGAGGVVTGTLTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHEPIGIAGFVRGTLS 250

Query: 257 LPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYY 316
            P QLG   KGFSHCFL FK++NNPN SSPL++G+ A+SSKD N+QFTP+LKSPMYPNYY
Sbjct: 251 FPSQLGLLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKD-NMQFTPMLKSPMYPNYY 310

Query: 317 YIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLEL 376
           YIGLE+IT+GN        V   LRE D++GNGGMLIDSGTTYTHLPEP YSQL+S  + 
Sbjct: 311 YIGLEAITVGNVSAT---TVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKA 370

Query: 377 VIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYA 436
           +I YPRA +VE+  GFDLCYKVPC NN  +  DD   PSITFHFLNNVS VLPQGN+FYA
Sbjct: 371 IITYPRATEVEMRAGFDLCYKVPCPNNRLT-DDDNLFPSITFHFLNNVSFVLPQGNHFYA 430

Query: 437 MAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPM 496
           M+AP NSTVVKCLL+QSM          D GPAG+FGSFQQQN+++VYDLEKER+GFQPM
Sbjct: 431 MSAPSNSTVVKCLLFQSM-------ADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPM 480

Query: 497 DCVSVAAKQGLHK 510
           DC S A  QGLH+
Sbjct: 491 DCASAAVSQGLHR 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASP63_ARATH5.9e-5433.78Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 S... [more]
NEP2_NEPGR3.6e-3529.84Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
APF2_ARATH4.4e-3328.37Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR1.1e-3130.60Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
CDR1_ARATH3.5e-3028.47Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LYP0_CUCSA3.1e-30499.23Uncharacterized protein OS=Cucumis sativus GN=Csa_1G704590 PE=3 SV=1[more]
V4TN99_9ROSI5.6e-18464.90Uncharacterized protein OS=Citrus clementina GN=CICLE_v10017863mg PE=3 SV=1[more]
K4B7R8_SOLLC2.4e-17963.37Uncharacterized protein OS=Solanum lycopersicum PE=3 SV=1[more]
M5X3K9_PRUPE7.0e-17964.59Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015155mg PE=3 SV=1[more]
B9RRP1_RICCO3.3e-17668.84Pepsin A, putative OS=Ricinus communis GN=RCOM_1425140 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G45120.14.8e-17160.16 Eukaryotic aspartyl protease family protein[more]
AT4G16563.13.3e-5533.78 Eukaryotic aspartyl protease family protein[more]
AT3G52500.11.4e-4531.82 Eukaryotic aspartyl protease family protein[more]
AT2G03200.14.8e-3830.61 Eukaryotic aspartyl protease family protein[more]
AT3G25700.15.9e-3627.45 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778665454|ref|XP_004145478.2|4.5e-30499.23PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus][more]
gi|659118383|ref|XP_008459091.1|3.4e-29194.43PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
gi|567912849|ref|XP_006448738.1|8.0e-18464.90hypothetical protein CICLE_v10017863mg [Citrus clementina][more]
gi|985434164|ref|XP_006468472.2|1.4e-18364.71PREDICTED: aspartic proteinase nepenthesin-1 [Citrus sinensis][more]
gi|1000972698|ref|XP_002516410.2|2.6e-18264.30PREDICTED: aspartic proteinase nepenthesin-2 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016020 membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G34590.1CSPI01G34590.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 190..506
score: 4.9E-219coord: 7..162
score: 4.9E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 351..362
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 298..499
score: 3.6E-41coord: 89..291
score: 2.3
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 90..500
score: 2.76
NoneNo IPR availablePANTHERPTHR13683:SF264ASPARTYL PROTEASE FAMILY PROTEINcoord: 190..506
score: 4.9E-219coord: 7..162
score: 4.9E

The following gene(s) are paralogous to this gene:

None