CSPI05G27660 (gene) Wild cucumber (PI 183967)

NameCSPI05G27660
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionAspartic proteinase nepenthesin-1
LocationChr5 : 26106153 .. 26109737 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTAGGCATTGAAGTTCATCACATATATTCAATATAGGGTTATTATATTTAATTTGATAGTGTAATTGAATTTAATTAGTGTTAAACTACAAATTATTTGGCAGGGAATGGGGAGTACGAATGTTGTATAAATATGGATGGGGGTGTGAGTGAGTCTTCCCCACAATCATCATACCCCCTTTAATATTCCTTTCAATTAATTACTTCCCACACCACACTCGAAGGAAGCCATGAACAACACCAAATCCCCCTTCCTTCTCCTCCTCCTCCTCCTACTCCTCCACCTATTCTCCATTTCCACCGCCAAATCCCATATCCCCTCCAACTGCAACCCCGCCGCTGACCGGAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGCTCCCCATTCCGCCCATCAAAGCCCCTTTCTTGGGCCGACAATGTACTCCAAATGCAGGCCAAGGACCAAGCCCGTCTCCAGTTCCTATCCTCCCTGGTGGCTCGAAGGTCGTTCGTCCCCATCGCCTCCGCCCGGCAGCTGATCCAGAGCCCCACCTTCGTGGTCCGGGCCAAGATCGGGACCCCTGCCCAGACCCTCCTCTTGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCTTGCTCCGGCTGCATCGGTTGCCCTTCCACCACCGTCTTCTCATCCGACAAGTCGTCCTCCTTCCGTCCCCTCCCCTGCCAATCCTCTCAATGCAACCAGGTACTCATCTTTTTTATTTTCCTACCCCTTCCTCTACTCACAAATCATTTAAGGCATTTAAATATATTTAAGATACCTAATCTTTTTTTTTTTTTTTCTCTTCTTAATTGAGAGTTATAGTTGGTTTGGAATCTCAATTAGCTTCCTATATATATACATTGAACAAAAAACAACCTGATTCATATGCAAAAGAAGAAAAAGGAATTTTAAGTGTTGAAAAGTTGCAAGAGCAAGTAGAGGGAAGTAGTGTTTAATTTTAAGATTTTTGTAGGAGCGGTGTGATATTAATTAACTATACAAGTGTGCGCCGCTAGCATTAGATTAGTGGAGAGGCCACATATCCACATATCCCTTGGAATATTCAATCACTATAGCAACATCCAATAAATCAATAATACTTCACTTACATAACATTCTCCTTTTGCTTCCATCCCCACTTTATGCTTTTCTTTCCTTTTTTTTCTTCTTCTTCTTTAATATTAAAAATGTAATGCCTCACACTTTTATAACTAATTATTTTAGCAAACATTTTACAATAATTACGAATATAACCAAATTTATTCATTATGGGCATTTCTTTAATAATATACTCTAAAAATTGTAGTGTTCCTTAGTCTTTTGTTTCAATATATTATATTTGATTCTTGTATTTGCAATTGCTCTGTATTTGCATCTGATCTATTTGATCTATTTTTACCACCCTGTAAAATTAAACAATTTGTCATTGGGTATCAACTTTTTTCTTTTTTAATTTGGTTTTTAAATTTAGAGTGTATTACTAAATTTTGTATTTTGTGTTATGTGAGTTTCAAGGTTCTGTATTTTGTTTTCAATAGATCAGTAACTTATTCAACAAATTGTAAAAAAATGTACATATGTTTTAACATACAAAATTGAAAGTTTGAAGTTTTGCATTAAGCAGCTTATCTATATATATAATTTGTATGGTTTGAAAACTTTAATTACGACATAAACTTGGTAGGTATGAAATTTTTTGAAGTCTTAAAAGTTATGAGACATAAAACAAAACTTATAAAATTGAGCAGTAGATGAGATGTAATTGTCAGGTTTAGATAGTATTTTATTTTAAATATAGAATGTGGTGACCAACTTAAAATTAAAAAGAAGAAGTGATTCTTTCTAATTAAATTGTAACCAATTTAGATTTGCATAGAAAGTTTTAGATGCTTTTGAAGAATCCAATCCAAGTTCCCACGTGCCAAGCAATCGTTTCCTTCGAGGCAACACGTGTGACAGGAAAAAGCCTGATTTAAATTATTTCCGTACGAGTTATTTCCTTTTAATTTTAATAATATTTTGGGTGTCTCTAGAATCCACGAGTGTCAGTGAGTCACCGACGGGGGAGGGTTGAGTCACGGGCCTAAAGAAAAACTTTTCCCAAAAAAATGAAAAAGTCCTTGTCGTATTGGAGAATGGAGGTATCACGGTGACACTTTTTATTTGTCAGGTACCGAACCCGAGTTGCAGCGGCAGCGCTTGTGGTTTCAACCTCACGTACGGGAGCTCGACGGTGGCGGCGGATCTGGTTCAAGACAACCTAACTCTGGCCACAGACTCAGTCCCCTCCTACACATTCGGTTGCATCAGGAAGGCCACGGGTAGTTCAGTACCGCCTCAGGGGCTATTGGGGTTGGGCCGAGGCCCATTATCACTTTTGGGCCAGAGCCAGAGCTTGTACCAATCCACATTCTCCTACTGTCTTCCCAGCTTCAAATCCGTCAACTTTTCTGGCTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCATTAGAATTAAGTACACCCCGCTGCTCCGCAACCCCCGACGGTCCTCGTTGTATTACGTGAATTTAATTTCCATCAGAGTTGGCCGCAAAATCGTCGATATTCCTCCCTCTGCTCTCGCTTTTAACTCCGCCACCGGCGCCGGCACTGTCATCGATTCCGGTAATTTAATGTGTTTTGAAAAACAATAGCCAAAATTTGAATGTATAAAAATGGTTGTTGAGTTGTTGAGTTGTTGTTGTTGTTCAGGGACGACGTTCACAAGGCTGGTGGCACCGGCGTACACGGCGGTGAGGGACGAATTCCGGCGAAGAGTGGGGAGAAACGTAACAGTTTCATCCTTGGGGGGGTTTGACACGTGCTATACAGTCCCAATCATCTCTCCCACCATAACCTTTATGTTCGCGGGAATGAACGTGACTCTTCCACCGGACAACTTCCTGATACACAGCACGGCGGGATCAACGACGTGCCTGGCCATGGCCGCGGCCCCGGATAACGTGAACTCTGTACTGAACGTGATTGCTAGCATGCAGCAGCAAAACCATCGCATACTGTTCGACATACCGAATTCCCGCGTGGGAGTTGCCCGTGAATCCTGCTCCTCTTAATTAATATAACACAAGTTGGTGGTGGTGGTTGTTGGTTTTTAGAAATTAGAATAAGAATATGTGTGTGGTAAAAGTAAAAGTAAAAGTAGTAATTTTGAATTATTGGTGGTTTGGTTTTCAAAATTTAGTAGTACATTTGAATGGAGTAAGAAGGGCGGTAAAGGAGAGAGAGAAGTATAAAAAAGAAAATGGTGTCATTTGGGATTTGGAATGTTTTAGGGTTTATTATATTATTCAATATGTCTCTCTCTTGACCTAAAGCAAACGGCTTTTTTTGGTTTTCTATTCTCCGCCTTGTTTCTGACTTTGTGTGTCGTAACCCTACTTTTACTACTCAACAGAACACTTCTTTTTATTTTTATATTTATAATTTTATATTATATGAGAAAAGACAGAGTATACGTACCTGAGCTTTCCACTTATATATATATATCTTCTCTTCTTTTACTACAGTTTTCCATTTTCCC

mRNA sequence

ATGAACAACACCAAATCCCCCTTCCTTCTCCTCCTCCTCCTCCTACTCCTCCACCTATTCTCCATTTCCACCGCCAAATCCCATATCCCCTCCAACTGCAACCCCGCCGCTGACCGGAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGCTCCCCATTCCGCCCATCAAAGCCCCTTTCTTGGGCCGACAATGTACTCCAAATGCAGGCCAAGGACCAAGCCCGTCTCCAGTTCCTATCCTCCCTGGTGGCTCGAAGGTCGTTCGTCCCCATCGCCTCCGCCCGGCAGCTGATCCAGAGCCCCACCTTCGTGGTCCGGGCCAAGATCGGGACCCCTGCCCAGACCCTCCTCTTGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCTTGCTCCGGCTGCATCGGTTGCCCTTCCACCACCGTCTTCTCATCCGACAAGTCGTCCTCCTTCCGTCCCCTCCCCTGCCAATCCTCTCAATGCAACCAGGTACCGAACCCGAGTTGCAGCGGCAGCGCTTGTGGTTTCAACCTCACGTACGGGAGCTCGACGGTGGCGGCGGATCTGGTTCAAGACAACCTAACTCTGGCCACAGACTCAGTCCCCTCCTACACATTCGGTTGCATCAGGAAGGCCACGGGTAGTTCAGTACCGCCTCAGGGGCTATTGGGGTTGGGCCGAGGCCCATTATCACTTTTGGGCCAGAGCCAGAGCTTGTACCAATCCACATTCTCCTACTGTCTTCCCAGCTTCAAATCCGTCAACTTTTCTGGCTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCATTAGAATTAAGTACACCCCGCTGCTCCGCAACCCCCGACGGTCCTCGTTGTATTACGTGAATTTAATTTCCATCAGAGTTGGCCGCAAAATCGTCGATATTCCTCCCTCTGCTCTCGCTTTTAACTCCGCCACCGGCGCCGGCACTGTCATCGATTCCGGGACGACGTTCACAAGGCTGGTGGCACCGGCGTACACGGCGGTGAGGGACGAATTCCGGCGAAGAGTGGGGAGAAACGTAACAGTTTCATCCTTGGGGGGGTTTGACACGTGCTATACAGTCCCAATCATCTCTCCCACCATAACCTTTATGTTCGCGGGAATGAACGTGACTCTTCCACCGGACAACTTCCTGATACACAGCACGGCGGGATCAACGACGTGCCTGGCCATGGCCGCGGCCCCGGATAACGTGAACTCTGTACTGAACGTGATTGCTAGCATGCAGCAGCAAAACCATCGCATACTGTTCGACATACCGAATTCCCGCGTGGGAGTTGCCCGTGAATCCTGCTCCTCTTAA

Coding sequence (CDS)

ATGAACAACACCAAATCCCCCTTCCTTCTCCTCCTCCTCCTCCTACTCCTCCACCTATTCTCCATTTCCACCGCCAAATCCCATATCCCCTCCAACTGCAACCCCGCCGCTGACCGGAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGCTCCCCATTCCGCCCATCAAAGCCCCTTTCTTGGGCCGACAATGTACTCCAAATGCAGGCCAAGGACCAAGCCCGTCTCCAGTTCCTATCCTCCCTGGTGGCTCGAAGGTCGTTCGTCCCCATCGCCTCCGCCCGGCAGCTGATCCAGAGCCCCACCTTCGTGGTCCGGGCCAAGATCGGGACCCCTGCCCAGACCCTCCTCTTGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCTTGCTCCGGCTGCATCGGTTGCCCTTCCACCACCGTCTTCTCATCCGACAAGTCGTCCTCCTTCCGTCCCCTCCCCTGCCAATCCTCTCAATGCAACCAGGTACCGAACCCGAGTTGCAGCGGCAGCGCTTGTGGTTTCAACCTCACGTACGGGAGCTCGACGGTGGCGGCGGATCTGGTTCAAGACAACCTAACTCTGGCCACAGACTCAGTCCCCTCCTACACATTCGGTTGCATCAGGAAGGCCACGGGTAGTTCAGTACCGCCTCAGGGGCTATTGGGGTTGGGCCGAGGCCCATTATCACTTTTGGGCCAGAGCCAGAGCTTGTACCAATCCACATTCTCCTACTGTCTTCCCAGCTTCAAATCCGTCAACTTTTCTGGCTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCATTAGAATTAAGTACACCCCGCTGCTCCGCAACCCCCGACGGTCCTCGTTGTATTACGTGAATTTAATTTCCATCAGAGTTGGCCGCAAAATCGTCGATATTCCTCCCTCTGCTCTCGCTTTTAACTCCGCCACCGGCGCCGGCACTGTCATCGATTCCGGGACGACGTTCACAAGGCTGGTGGCACCGGCGTACACGGCGGTGAGGGACGAATTCCGGCGAAGAGTGGGGAGAAACGTAACAGTTTCATCCTTGGGGGGGTTTGACACGTGCTATACAGTCCCAATCATCTCTCCCACCATAACCTTTATGTTCGCGGGAATGAACGTGACTCTTCCACCGGACAACTTCCTGATACACAGCACGGCGGGATCAACGACGTGCCTGGCCATGGCCGCGGCCCCGGATAACGTGAACTCTGTACTGAACGTGATTGCTAGCATGCAGCAGCAAAACCATCGCATACTGTTCGACATACCGAATTCCCGCGTGGGAGTTGCCCGTGAATCCTGCTCCTCTTAA
BLAST of CSPI05G27660 vs. Swiss-Prot
Match: AED3_ARATH (Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 5.3e-112
Identity = 227/456 (49.78%), Postives = 304/456 (66.67%), Query Frame = 1

Query: 1   MNNTKSPFLLLLLLLLLHLFSISTAKSHIPSNCNPAA-DRSSTLQVFHIFSPCSPFRPSK 60
           M ++   F   L LLL   F+ +T  +     C  AA D S  L +  I + CSPF P+ 
Sbjct: 1   MASSSLHFFFFLTLLLPFTFTTATRDT-----CATAAPDGSDDLSIIPINAKCSPFAPTH 60

Query: 61  -PLSWADNVLQMQAKDQARLQFLSSLVARR---SFVPIASARQLIQSPTFVVRAKIGTPA 120
              S  D VL M + D  RL +LSSLVA +   + VP+AS  QL     +VVRAK+GTP 
Sbjct: 61  VSASVIDTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQL-HIGNYVVRAKLGTPP 120

Query: 121 QTLLLALDTSNDAAWIPCSGCIGCPS-TTVFSSDKSSSFRPLPCQSSQCNQV-----PNP 180
           Q + + LDTSNDA W+PCSGC GC + +T F+++ SS++  + C ++QC Q      P+ 
Sbjct: 121 QLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSS 180

Query: 181 SCSGSACGFNLTYGS-STVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLG 240
           S   S C FN +YG  S+ +A LVQD LTLA D +P+++FGCI  A+G+S+PPQGL+GLG
Sbjct: 181 SPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLG 240

Query: 241 RGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLY 300
           RGP+SL+ Q+ SLY   FSYCLPSF+S  FSGSL+LG + QP  I+YTPLLRNPRR SLY
Sbjct: 241 RGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLY 300

Query: 301 YVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGR 360
           YVNL  + VG   V + P  L F++ +GAGT+IDSGT  TR   P Y A+RDEFR++V  
Sbjct: 301 YVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNV 360

Query: 361 NVTVSSLGGFDTCYTV--PIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPD 420
           + + S+LG FDTC++     ++P IT     +++ LP +N LIHS+AG+ TCL+MA    
Sbjct: 361 S-SFSTLGAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQ 420

Query: 421 NVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 443
           N N+VLNVIA++QQQN RILFD+PNSR+G+A E C+
Sbjct: 421 NANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CSPI05G27660 vs. Swiss-Prot
Match: AP25_ORYSJ (Aspartyl protease 25 OS=Oryza sativa subsp. japonica GN=AP25 PE=2 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 4.0e-99
Identity = 207/430 (48.14%), Postives = 274/430 (63.72%), Query Frame = 1

Query: 37  ADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARR--SFVPIA 96
           A  ++ L V+H   P SP     PL   ++++ +   D ARL FLSS  A    S  P+A
Sbjct: 19  AAAAAELSVYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVA 78

Query: 97  SARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFR 156
           S +     P++VVRA +G+P+Q LLLALDTS DA W  CS C  CPS+++F+   SSS+ 
Sbjct: 79  SGQA---PPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYA 138

Query: 157 PLPCQSSQC-----NQVPNPSCSGSA---------CGFNLTYGSSTVAADLVQDNLTLAT 216
            LPC SS C        P P   G A         C F+  +  ++  A L  D L L  
Sbjct: 139 SLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGK 198

Query: 217 DSVPSYTFGCIRKATG--SSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF 276
           D++P+YTFGC+   TG  +++P QGLLGLGRGP++LL Q+ SLY   FSYCLPS++S  F
Sbjct: 199 DAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYF 258

Query: 277 SGSLRLGPVA-QPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGA 336
           SGSLRLG    QP  ++YTP+LRNP RSSLYYVN+  + VG   V +P  + AF++ATGA
Sbjct: 259 SGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGA 318

Query: 337 GTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPII----SPTIT- 396
           GTV+DSGT  TR  AP Y A+R+EFRR+V      +SLG FDTC+    +    +P +T 
Sbjct: 319 GTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTV 378

Query: 397 FMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNS 443
            M  G+++ LP +N LIHS+A    CLAMA AP NVNSV+NVIA++QQQN R++FD+ NS
Sbjct: 379 HMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANS 438

BLAST of CSPI05G27660 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 3.9e-46
Identity = 132/441 (29.93%), Postives = 208/441 (47.17%), Query Frame = 1

Query: 21  SISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQF 80
           S S  +S   S  +  +  S TL + HI +  S   P +  S   + LQ  ++    +  
Sbjct: 52  SESLLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFS---SRLQRDSRRVKSIAT 111

Query: 81  LSSLVARRSFV----PIASARQLIQ-----SPTFVVRAKIGTPAQTLLLALDTSNDAAWI 140
           L++ +  R+      P   +  ++      S  +  R  +GTPA+ + + LDT +D  W+
Sbjct: 112 LAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWL 171

Query: 141 PCSGCIGCPSTT--VFSSDKSSSFRPLPCQSSQCNQVPNPSCSG--SACGFNLTYGS-ST 200
            C+ C  C S +  +F   KS ++  +PC S  C ++ +  C+     C + ++YG  S 
Sbjct: 172 QCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSF 231

Query: 201 VAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTF 260
              D   + LT   + V     GC     G  V   GLLGLG+G LS  GQ+   +   F
Sbjct: 232 TVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKF 291

Query: 261 SYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVG-RKIVDIP 320
           SYCL    + +   S+  G  A     ++TPLL NP+  + YYV L+ I VG  ++  + 
Sbjct: 292 SYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVT 351

Query: 321 PSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVP 380
            S    +     G +IDSGT+ TRL+ PAY A+RD FR              FDTC+ + 
Sbjct: 352 ASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLS 411

Query: 381 IIS----PTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQ 440
            ++    PT+   F G +V+LP  N+LI        C A A         L++I ++QQQ
Sbjct: 412 NMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG----LSIIGNIQQQ 471

Query: 441 NHRILFDIPNSRVGVARESCS 443
             R+++D+ +SRVG A   C+
Sbjct: 472 GFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI05G27660 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 1.1e-45
Identity = 117/348 (33.62%), Postives = 167/348 (47.99%), Query Frame = 1

Query: 102 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 161
           S  + VR  +G+P +   + +D+ +D  W+ C  C  C   S  VF   KS S+  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 162 SSQCNQVPNPSCSGSACGFNLTYGS-STVAADLVQDNLTLATDSVPSYTFGCIRKATGSS 221
           SS C+++ N  C    C + + YG  S     L  + LT A   V +   GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 222 VPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPL 281
           +   GLLG+G G +S +GQ        F YCL S +  + +GSL  G  A P+   + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 282 LRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAV 341
           +RNPR  S YYV L  + VG   + +P            G V+D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 367

Query: 342 RDEFRRRVGRNVTVSSLGGFDTCYT----VPIISPTITFMFA-GMNVTLPPDNFLIHSTA 401
           RD F+ +       S +  FDTCY     V +  PT++F F  G  +TLP  NFL+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 402 GSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 442
             T C A AA+P      L++I ++QQ+  ++ FD  N  VG     C
Sbjct: 428 SGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CSPI05G27660 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.3e-44
Identity = 123/351 (35.04%), Postives = 177/351 (50.43%), Query Frame = 1

Query: 105 FVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQSSQ 164
           +++   IGTPAQ     +DT +D  W  C  C  C   ST +F+   SSSF  LPC S  
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 165 CNQVPNPSCSGSACGFNLTYGS-STVAADLVQDNLTLATDSVPSYTFGCIRKATG-SSVP 224
           C  + +P+CS + C +   YG  S     +  + LT  + S+P+ TFGC     G     
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGN 214

Query: 225 PQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRI--KYTPL 284
             GL+G+GRGPLSL  Q   L  + FSYC+    S   S +L LG +A  +      T L
Sbjct: 215 GAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGSSTPS-NLLLGSLANSVTAGSPNTTL 274

Query: 285 LRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAG-TVIDSGTTFTRLVAPAYTA 344
           +++ +  + YY+ L  + VG   + I PSA A NS  G G  +IDSGTT T  V  AY +
Sbjct: 275 IQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQS 334

Query: 345 VRDEFRRRVGRNVTVSSLGGFDTCYTVP-----IISPTITFMFAGMNVTLPPDNFLIHST 404
           VR EF  ++   V   S  GFD C+  P     +  PT    F G ++ LP +N+ I S 
Sbjct: 335 VRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFI-SP 394

Query: 405 AGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 444
           +    CLAM ++       +++  ++QQQN  +++D  NS V  A   C +
Sbjct: 395 SNGLICLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQCGA 436

BLAST of CSPI05G27660 vs. TrEMBL
Match: A0A0A0KTR5_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_5G623870 PE=3 SV=1)

HSP 1 Score: 858.2 bits (2216), Expect = 4.4e-246
Identity = 440/443 (99.32%), Postives = 440/443 (99.32%), Query Frame = 1

Query: 1   MNNTKSPFLLLLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP 60
           MNNTKSPFL  LLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP
Sbjct: 1   MNNTKSPFL--LLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP 60

Query: 61  LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL 120
           LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL
Sbjct: 61  LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL 120

Query: 121 ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNL 180
           ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQS QCNQVPNPSCSGSACGFNL
Sbjct: 121 ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNL 180

Query: 181 TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS 240
           TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS
Sbjct: 181 TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS 240

Query: 241 LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK 300
           LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK
Sbjct: 241 LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK 300

Query: 301 IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT 360
           IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT
Sbjct: 301 IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDIPNSRVGVARESCSS 444
           QNHRILFDIPNSRVGVARESCSS
Sbjct: 421 QNHRILFDIPNSRVGVARESCSS 441

BLAST of CSPI05G27660 vs. TrEMBL
Match: A0A067L7Q3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24539 PE=3 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 3.2e-188
Identity = 335/430 (77.91%), Postives = 381/430 (88.60%), Query Frame = 1

Query: 13  LLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQA 72
           LL L  LF       H+   C+ + D+ STLQVFH++SPCSPFRPSKPLSW ++VLQMQA
Sbjct: 5   LLSLAFLFFSLAQGLHLNPKCS-SQDQGSTLQVFHVYSPCSPFRPSKPLSWEESVLQMQA 64

Query: 73  KDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIP 132
           KDQARLQFLSSLVA RSFVPIAS RQ+IQSPT++VRAKIGTPAQTLLLA+DTSNDAAWIP
Sbjct: 65  KDQARLQFLSSLVAGRSFVPIASGRQIIQSPTYIVRAKIGTPAQTLLLAVDTSNDAAWIP 124

Query: 133 CSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVAADLV 192
           CSGC GC S+TVF S KS+SF+ + C + QC QVPNP+CSGSAC FN TYGSS++AA+L 
Sbjct: 125 CSGCDGC-SSTVFDSVKSTSFQTVGCGAPQCKQVPNPTCSGSACTFNTTYGSSSIAANLS 184

Query: 193 QDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPS 252
           QD ++LATDSVP YTFGCI KATGSSVPPQGLLGLGRGPLSLL Q+Q+LYQSTFSYCLPS
Sbjct: 185 QDTVSLATDSVPGYTFGCIAKATGSSVPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS 244

Query: 253 FKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFN 312
           F+S+NFSG+LRLGP  QP RIK TPLLRNPRRSSLYYVNL++IRVGR++VDIPPSALAFN
Sbjct: 245 FRSLNFSGTLRLGPNGQPKRIKTTPLLRNPRRSSLYYVNLVAIRVGRRVVDIPPSALAFN 304

Query: 313 SATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTIT 372
             TGAGT+ DSGT FTRLV PAYTAVRD FR+RVG N TV+SLGGFDTCY+VPI++PTIT
Sbjct: 305 PTTGAGTIFDSGTVFTRLVTPAYTAVRDAFRKRVG-NATVTSLGGFDTCYSVPIVAPTIT 364

Query: 373 FMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNS 432
           FMF+GMNVTLPP+N LIHSTAGST+CLA+AAAPDNVNSVLNVIA+MQQQNHRILFD+PNS
Sbjct: 365 FMFSGMNVTLPPENLLIHSTAGSTSCLAIAAAPDNVNSVLNVIANMQQQNHRILFDVPNS 424

Query: 433 RVGVARESCS 443
           R+GVARE C+
Sbjct: 425 RLGVAREQCT 431

BLAST of CSPI05G27660 vs. TrEMBL
Match: I1JW44_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_04G136000 PE=3 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 6.7e-186
Identity = 328/436 (75.23%), Postives = 381/436 (87.39%), Query Frame = 1

Query: 11  LLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQM 70
           L  L  L LF + +    +   C+   D  STL+VFH+FSPCSPFRP KPLSWA++VLQ+
Sbjct: 5   LFSLSPLFLFLLFSLVEGLTPKCD-TQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVLQL 64

Query: 71  QAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAW 130
           QAKDQARLQFL+S+VA RS VPIAS RQ+IQSPT++VRAKIG+P QTLLLA+DTSNDAAW
Sbjct: 65  QAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAW 124

Query: 131 IPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVAAD 190
           IPC+ C GC ST +F+ +KS++F+ + C S QCNQVPNPSC  SAC FNLTYGSS++AA+
Sbjct: 125 IPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIAAN 184

Query: 191 LVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL 250
           +VQD +TLATD +P YTFGC+ K TG+S PPQGLLGLGRGPLSLL Q+Q+LYQSTFSYCL
Sbjct: 185 VVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL 244

Query: 251 PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALA 310
           PSFKS+NFSGSLRLGPVAQPIRIKYTPLL+NPRRSSLYYVNL++IRVGRK+VDIPP ALA
Sbjct: 245 PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALA 304

Query: 311 FNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG----RNVTVSSLGGFDTCYTVPI 370
           FN+ATGAGTV DSGT FTRLVAPAYTAVRDEF+RRV      N+TV+SLGGFDTCYTVPI
Sbjct: 305 FNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPI 364

Query: 371 ISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRIL 430
           ++PTITFMF+GMNVTLP DN LIHSTAGSTTCLAMA+APDNVNSVLNVIA+MQQQNHR+L
Sbjct: 365 VAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVL 424

Query: 431 FDIPNSRVGVARESCS 443
           +D+PNSR+GVARE C+
Sbjct: 425 YDVPNSRLGVARELCT 438

BLAST of CSPI05G27660 vs. TrEMBL
Match: A0A0B2PGF3_GLYSO (Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_045617 PE=3 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 6.7e-186
Identity = 328/436 (75.23%), Postives = 381/436 (87.39%), Query Frame = 1

Query: 11  LLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQM 70
           L  L  L LF + +    +   C+   D  STL+VFH+FSPCSPFRP KPLSWA++VLQ+
Sbjct: 5   LFSLSPLFLFLLFSLVEGLTPKCD-TQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVLQL 64

Query: 71  QAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAW 130
           QAKDQARLQFL+S+VA RS VPIAS RQ+IQSPT++VRAKIG+P QTLLLA+DTSNDAAW
Sbjct: 65  QAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAW 124

Query: 131 IPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVAAD 190
           IPC+ C GC ST +F+ +KS++F+ + C S QCNQVPNPSC  SAC FNLTYGSS++AA+
Sbjct: 125 IPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIAAN 184

Query: 191 LVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL 250
           +VQD +TLATD +P YTFGC+ K TG+S PPQGLLGLGRGPLSLL Q+Q+LYQSTFSYCL
Sbjct: 185 VVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL 244

Query: 251 PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALA 310
           PSFKS+NFSGSLRLGPVAQPIRIKYTPLL+NPRRSSLYYVNL++IRVGRK+VDIPP ALA
Sbjct: 245 PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALA 304

Query: 311 FNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG----RNVTVSSLGGFDTCYTVPI 370
           FN+ATGAGTV DSGT FTRLVAPAYTAVRDEF+RRV      N+TV+SLGGFDTCYTVPI
Sbjct: 305 FNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPI 364

Query: 371 ISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRIL 430
           ++PTITFMF+GMNVTLP DN LIHSTAGSTTCLAMA+APDNVNSVLNVIA+MQQQNHR+L
Sbjct: 365 VAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVL 424

Query: 431 FDIPNSRVGVARESCS 443
           +D+PNSR+GVARE C+
Sbjct: 425 YDVPNSRLGVARELCT 438

BLAST of CSPI05G27660 vs. TrEMBL
Match: V4V1N5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10001212mg PE=3 SV=1)

HSP 1 Score: 656.8 bits (1693), Expect = 1.9e-185
Identity = 328/434 (75.58%), Postives = 384/434 (88.48%), Query Frame = 1

Query: 10  LLLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQ 69
           L+  L  L LFS+S   + I   C+   D SSTLQVFH+FSPCSPF+PSKPLSW ++VL+
Sbjct: 5   LVFFLAFLFLFSLSEGLNPI---CD-TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 64

Query: 70  MQAKDQARLQFLSSL-VARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDA 129
           M AKDQARLQFLSSL VAR+S VPIAS RQ+ QSPT++VRAKIGTPAQTLL+A+DTSNDA
Sbjct: 65  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 124

Query: 130 AWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVA 189
           AW+PC+GC+GC S+TVF+S +S++F+ L CQ++QC QVPNP+C G AC FNLTYGSST+A
Sbjct: 125 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 184

Query: 190 ADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSY 249
           A+L QD ++LATD VP YTFGCI+KATG+SVPPQGLLGLGRG LSLL Q+Q+LYQSTFSY
Sbjct: 185 ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 244

Query: 250 CLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSA 309
           CLPSFK+++FSGSLRLGP+ QP RIKYTPLL+NPRRSSLYYVNL++IRVGR++VDIPP A
Sbjct: 245 CLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 304

Query: 310 LAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIIS 369
           L FN  TGAGT+IDSGT FTRLVAPAYTAVRD FRRRVG N+TV+SLGGFDTCY+VPI++
Sbjct: 305 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA 364

Query: 370 PTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 429
           PTIT MF+GMNVTLP DN LIHSTAGS TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+D
Sbjct: 365 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 424

Query: 430 IPNSRVGVARESCS 443
           +PNSR+GVARE C+
Sbjct: 425 VPNSRLGVARELCT 433

BLAST of CSPI05G27660 vs. TAIR10
Match: AT5G07030.1 (AT5G07030.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 577.8 bits (1488), Expect = 5.8e-165
Identity = 286/447 (63.98%), Postives = 354/447 (79.19%), Query Frame = 1

Query: 2   NNTKSPFLLLLLLLLLHLFSI---STAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPS 61
           +N K+   +  L+L L LFSI   +   +H   +     D+ STL++FHI SPCSPF+ S
Sbjct: 9   SNPKAYNTMSTLVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSS 68

Query: 62  KPLSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTL 121
            PLSW   VLQ  A+DQARLQ+LSSLVA RS VPIAS RQ++QS T++V+A IGTPAQ L
Sbjct: 69  SPLSWEARVLQTLAQDQARLQYLSSLVAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPL 128

Query: 122 LLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGF 181
           LLA+DTS+D AWIPCSGC+GCPS T FS  KS+SF+ + C + QC QVPNP+C   AC F
Sbjct: 129 LLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSF 188

Query: 182 NLTYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSV--PPQGLLGLGRGPLSLLG 241
           NLTYGSS++AA+L QD + LA D + ++TFGC+ K  G     PPQGLLGLGRGPLSL+ 
Sbjct: 189 NLTYGSSSIAANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMS 248

Query: 242 QSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIR 301
           Q+QS+Y+STFSYCLPSF+S+ FSGSLRLGP +QP R+KYT LLRNPRRSSLYYVNL++IR
Sbjct: 249 QAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIR 308

Query: 302 VGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRV-GRNVTVSSL 361
           VGRK+VD+PP+A+AFN +TGAGT+ DSGT +TRL  P Y AVR+EFR+RV      V+SL
Sbjct: 309 VGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSL 368

Query: 362 GGFDTCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVI 421
           GGFDTCY+  +  PTITFMF G+N+T+P DN ++HSTAGST+CLAMAAAP+NVNSV+NVI
Sbjct: 369 GGFDTCYSGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVI 428

Query: 422 ASMQQQNHRILFDIPNSRVGVARESCS 443
           ASMQQQNHR+L D+PN R+G+ARE CS
Sbjct: 429 ASMQQQNHRVLIDVPNGRLGLARERCS 455

BLAST of CSPI05G27660 vs. TAIR10
Match: AT3G54400.1 (AT3G54400.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 563.5 bits (1451), Expect = 1.1e-160
Identity = 295/436 (67.66%), Postives = 353/436 (80.96%), Query Frame = 1

Query: 9   LLLLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVL 68
           LL+LL+ LL L S S        NCN  +  SS L+VFHI S CSPF+ S  +SWAD +L
Sbjct: 5   LLILLISLLILKSESI-------NCNEKS-HSSDLRVFHINSLCSPFKTS--VSWADTLL 64

Query: 69  QMQAKDQARLQFLSSLVA-RRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSND 128
           Q    D+AR  +LSSL   R+S VPIAS R ++QSPT++VRA IGTPAQ +L+ALDTSND
Sbjct: 65  Q----DKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSND 124

Query: 129 AAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGS-ACGFNLTYGSST 188
           AAWIPCSGC+GC S+ +F   KSSS R L C++ QC Q PNPSC+ S +CGFN+TYG ST
Sbjct: 125 AAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGST 184

Query: 189 VAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTF 248
           + A L QD LTLA+D +P+YTFGCI KA+G+S+P QGL+GLGRGPLSL+ QSQ+LYQSTF
Sbjct: 185 IEAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTF 244

Query: 249 SYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPP 308
           SYCLP+ KS NFSGSLRLGP  QPIRIK TPLL+NPRRSSLYYVNL+ IRVG KIVDIP 
Sbjct: 245 SYCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPT 304

Query: 309 SALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPI 368
           SALAF+ ATGAGT+ DSGT +TRLV PAY AVR+EFRRRV +N   +SLGGFDTCY+  +
Sbjct: 305 SALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYSGSV 364

Query: 369 ISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRIL 428
           + P++TFMFAGMNVTLPPDN LIHS+AG+ +CLAMAAAP NVNSVLNVIASMQQQNHR+L
Sbjct: 365 VFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVL 424

Query: 429 FDIPNSRVGVARESCS 443
            D+PNSR+G++RE+C+
Sbjct: 425 IDVPNSRLGISRETCT 425

BLAST of CSPI05G27660 vs. TAIR10
Match: AT1G09750.1 (AT1G09750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 406.0 bits (1042), Expect = 3.0e-113
Identity = 227/456 (49.78%), Postives = 304/456 (66.67%), Query Frame = 1

Query: 1   MNNTKSPFLLLLLLLLLHLFSISTAKSHIPSNCNPAA-DRSSTLQVFHIFSPCSPFRPSK 60
           M ++   F   L LLL   F+ +T  +     C  AA D S  L +  I + CSPF P+ 
Sbjct: 1   MASSSLHFFFFLTLLLPFTFTTATRDT-----CATAAPDGSDDLSIIPINAKCSPFAPTH 60

Query: 61  -PLSWADNVLQMQAKDQARLQFLSSLVARR---SFVPIASARQLIQSPTFVVRAKIGTPA 120
              S  D VL M + D  RL +LSSLVA +   + VP+AS  QL     +VVRAK+GTP 
Sbjct: 61  VSASVIDTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQL-HIGNYVVRAKLGTPP 120

Query: 121 QTLLLALDTSNDAAWIPCSGCIGCPS-TTVFSSDKSSSFRPLPCQSSQCNQV-----PNP 180
           Q + + LDTSNDA W+PCSGC GC + +T F+++ SS++  + C ++QC Q      P+ 
Sbjct: 121 QLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSS 180

Query: 181 SCSGSACGFNLTYGS-STVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLG 240
           S   S C FN +YG  S+ +A LVQD LTLA D +P+++FGCI  A+G+S+PPQGL+GLG
Sbjct: 181 SPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLG 240

Query: 241 RGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLY 300
           RGP+SL+ Q+ SLY   FSYCLPSF+S  FSGSL+LG + QP  I+YTPLLRNPRR SLY
Sbjct: 241 RGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLY 300

Query: 301 YVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGR 360
           YVNL  + VG   V + P  L F++ +GAGT+IDSGT  TR   P Y A+RDEFR++V  
Sbjct: 301 YVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNV 360

Query: 361 NVTVSSLGGFDTCYTV--PIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPD 420
           + + S+LG FDTC++     ++P IT     +++ LP +N LIHS+AG+ TCL+MA    
Sbjct: 361 S-SFSTLGAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQ 420

Query: 421 NVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 443
           N N+VLNVIA++QQQN RILFD+PNSR+G+A E C+
Sbjct: 421 NANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CSPI05G27660 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 187.2 bits (474), Expect = 2.2e-47
Identity = 132/441 (29.93%), Postives = 208/441 (47.17%), Query Frame = 1

Query: 21  SISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQF 80
           S S  +S   S  +  +  S TL + HI +  S   P +  S   + LQ  ++    +  
Sbjct: 52  SESLLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFS---SRLQRDSRRVKSIAT 111

Query: 81  LSSLVARRSFV----PIASARQLIQ-----SPTFVVRAKIGTPAQTLLLALDTSNDAAWI 140
           L++ +  R+      P   +  ++      S  +  R  +GTPA+ + + LDT +D  W+
Sbjct: 112 LAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWL 171

Query: 141 PCSGCIGCPSTT--VFSSDKSSSFRPLPCQSSQCNQVPNPSCSG--SACGFNLTYGS-ST 200
            C+ C  C S +  +F   KS ++  +PC S  C ++ +  C+     C + ++YG  S 
Sbjct: 172 QCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSF 231

Query: 201 VAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTF 260
              D   + LT   + V     GC     G  V   GLLGLG+G LS  GQ+   +   F
Sbjct: 232 TVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKF 291

Query: 261 SYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVG-RKIVDIP 320
           SYCL    + +   S+  G  A     ++TPLL NP+  + YYV L+ I VG  ++  + 
Sbjct: 292 SYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVT 351

Query: 321 PSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVP 380
            S    +     G +IDSGT+ TRL+ PAY A+RD FR              FDTC+ + 
Sbjct: 352 ASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLS 411

Query: 381 IIS----PTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQ 440
            ++    PT+   F G +V+LP  N+LI        C A A         L++I ++QQQ
Sbjct: 412 NMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG----LSIIGNIQQQ 471

Query: 441 NHRILFDIPNSRVGVARESCS 443
             R+++D+ +SRVG A   C+
Sbjct: 472 GFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI05G27660 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 185.7 bits (470), Expect = 6.4e-47
Identity = 117/348 (33.62%), Postives = 167/348 (47.99%), Query Frame = 1

Query: 102 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 161
           S  + VR  +G+P +   + +D+ +D  W+ C  C  C   S  VF   KS S+  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 162 SSQCNQVPNPSCSGSACGFNLTYGS-STVAADLVQDNLTLATDSVPSYTFGCIRKATGSS 221
           SS C+++ N  C    C + + YG  S     L  + LT A   V +   GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 222 VPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPL 281
           +   GLLG+G G +S +GQ        F YCL S +  + +GSL  G  A P+   + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 282 LRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAV 341
           +RNPR  S YYV L  + VG   + +P            G V+D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 367

Query: 342 RDEFRRRVGRNVTVSSLGGFDTCYT----VPIISPTITFMFA-GMNVTLPPDNFLIHSTA 401
           RD F+ +       S +  FDTCY     V +  PT++F F  G  +TLP  NFL+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 402 GSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 442
             T C A AA+P      L++I ++QQ+  ++ FD  N  VG     C
Sbjct: 428 SGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CSPI05G27660 vs. NCBI nr
Match: gi|449449334|ref|XP_004142420.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 858.2 bits (2216), Expect = 6.3e-246
Identity = 440/443 (99.32%), Postives = 440/443 (99.32%), Query Frame = 1

Query: 1   MNNTKSPFLLLLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP 60
           MNNTKSPFL  LLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP
Sbjct: 1   MNNTKSPFL--LLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP 60

Query: 61  LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL 120
           LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL
Sbjct: 61  LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL 120

Query: 121 ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNL 180
           ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQS QCNQVPNPSCSGSACGFNL
Sbjct: 121 ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNL 180

Query: 181 TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS 240
           TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS
Sbjct: 181 TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS 240

Query: 241 LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK 300
           LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK
Sbjct: 241 LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK 300

Query: 301 IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT 360
           IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT
Sbjct: 301 IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDIPNSRVGVARESCSS 444
           QNHRILFDIPNSRVGVARESCSS
Sbjct: 421 QNHRILFDIPNSRVGVARESCSS 441

BLAST of CSPI05G27660 vs. NCBI nr
Match: gi|659092233|ref|XP_008446966.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo])

HSP 1 Score: 834.7 bits (2155), Expect = 7.4e-239
Identity = 427/444 (96.17%), Postives = 436/444 (98.20%), Query Frame = 1

Query: 1   MNNTKSPFLLL-LLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSK 60
           MNNTKSPFL L LLLLLL LFSISTAKSHIP NCNPA DRSSTL+VFHIFSPCSPFRPSK
Sbjct: 1   MNNTKSPFLPLPLLLLLLLLFSISTAKSHIPLNCNPADDRSSTLKVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVARRS VPIASARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARRSVVPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 LALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFN 180
           LALDTSNDAAWIPCSGC+GCPSTTVFSSDKSSSFRPLPCQS QCNQVPNPSCSG+ACGFN
Sbjct: 121 LALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGTACGFN 180

Query: 181 LTYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDN+TLATDSVP+YTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNVTLATDSVPAYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGR 300
           SLY+STFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLI+IRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 KIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFD 360
           +IVDIPPSALAFNSATGAGT+IDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFD
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFD 360

Query: 361 TCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQ 420
           TCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQ
Sbjct: 361 TCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQ 420

Query: 421 QQNHRILFDIPNSRVGVARESCSS 444
           QQNHRILFDIPNSRVGVARE CSS
Sbjct: 421 QQNHRILFDIPNSRVGVAREPCSS 444

BLAST of CSPI05G27660 vs. NCBI nr
Match: gi|802574128|ref|XP_012068673.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas])

HSP 1 Score: 666.0 bits (1717), Expect = 4.6e-188
Identity = 335/430 (77.91%), Postives = 381/430 (88.60%), Query Frame = 1

Query: 13  LLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQA 72
           LL L  LF       H+   C+ + D+ STLQVFH++SPCSPFRPSKPLSW ++VLQMQA
Sbjct: 5   LLSLAFLFFSLAQGLHLNPKCS-SQDQGSTLQVFHVYSPCSPFRPSKPLSWEESVLQMQA 64

Query: 73  KDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIP 132
           KDQARLQFLSSLVA RSFVPIAS RQ+IQSPT++VRAKIGTPAQTLLLA+DTSNDAAWIP
Sbjct: 65  KDQARLQFLSSLVAGRSFVPIASGRQIIQSPTYIVRAKIGTPAQTLLLAVDTSNDAAWIP 124

Query: 133 CSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVAADLV 192
           CSGC GC S+TVF S KS+SF+ + C + QC QVPNP+CSGSAC FN TYGSS++AA+L 
Sbjct: 125 CSGCDGC-SSTVFDSVKSTSFQTVGCGAPQCKQVPNPTCSGSACTFNTTYGSSSIAANLS 184

Query: 193 QDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPS 252
           QD ++LATDSVP YTFGCI KATGSSVPPQGLLGLGRGPLSLL Q+Q+LYQSTFSYCLPS
Sbjct: 185 QDTVSLATDSVPGYTFGCIAKATGSSVPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS 244

Query: 253 FKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFN 312
           F+S+NFSG+LRLGP  QP RIK TPLLRNPRRSSLYYVNL++IRVGR++VDIPPSALAFN
Sbjct: 245 FRSLNFSGTLRLGPNGQPKRIKTTPLLRNPRRSSLYYVNLVAIRVGRRVVDIPPSALAFN 304

Query: 313 SATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTIT 372
             TGAGT+ DSGT FTRLV PAYTAVRD FR+RVG N TV+SLGGFDTCY+VPI++PTIT
Sbjct: 305 PTTGAGTIFDSGTVFTRLVTPAYTAVRDAFRKRVG-NATVTSLGGFDTCYSVPIVAPTIT 364

Query: 373 FMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNS 432
           FMF+GMNVTLPP+N LIHSTAGST+CLA+AAAPDNVNSVLNVIA+MQQQNHRILFD+PNS
Sbjct: 365 FMFSGMNVTLPPENLLIHSTAGSTSCLAIAAAPDNVNSVLNVIANMQQQNHRILFDVPNS 424

Query: 433 RVGVARESCS 443
           R+GVARE C+
Sbjct: 425 RLGVAREQCT 431

BLAST of CSPI05G27660 vs. NCBI nr
Match: gi|1009171231|ref|XP_015866632.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Ziziphus jujuba])

HSP 1 Score: 662.5 bits (1708), Expect = 5.1e-187
Identity = 323/421 (76.72%), Postives = 372/421 (88.36%), Query Frame = 1

Query: 22  ISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFL 81
           I+T    +  NC    D+ STLQV H++SPCSPFRPSKPLSW ++VLQMQAKDQARLQFL
Sbjct: 17  ITTIVQGLNLNCQNQ-DKGSTLQVLHVYSPCSPFRPSKPLSWEESVLQMQAKDQARLQFL 76

Query: 82  SSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPS 141
           SSLVAR+S VPIAS RQ+IQSPT++VRAKIGTP QTLLLALDTSNDAAWIPC+GC+GC S
Sbjct: 77  SSLVARKSVVPIASGRQIIQSPTYIVRAKIGTPVQTLLLALDTSNDAAWIPCAGCVGC-S 136

Query: 142 TTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVAADLVQDNLTLATD 201
           +  F+  KS+SF+ L CQ+ QC QVPNP+C+G+ C FNLTYG S++AADL QD +TLA D
Sbjct: 137 SAAFAPIKSTSFKSLGCQAPQCRQVPNPTCTGTTCSFNLTYGGSSIAADLSQDTITLAND 196

Query: 202 SVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGS 261
           +VP+YTFGCI+KATGSS+PPQGLLGLGRGPLSLL Q+Q+LYQSTFSYCLPSFKS+NFSGS
Sbjct: 197 AVPAYTFGCIKKATGSSLPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGS 256

Query: 262 LRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVI 321
           LRLGP+ QPIRIKYTPLL+NPRRSSLYYVNL +IRVGRK+VDIPP+ LAFN  TGAGT+ 
Sbjct: 257 LRLGPIGQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKVVDIPPADLAFNPTTGAGTIF 316

Query: 322 DSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTITFMFAGMNVT 381
           DSGT FTRLVA AYTAVRDEFR+RV  N  +++LGGFDTCY+VPI +PT+TFMF GMNVT
Sbjct: 317 DSGTVFTRLVASAYTAVRDEFRKRVKPNAPITTLGGFDTCYSVPITAPTVTFMFTGMNVT 376

Query: 382 LPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 441
           LPPDN LIHSTAGS TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+D+PNSR+GVARE C
Sbjct: 377 LPPDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREPC 435

Query: 442 S 443
           +
Sbjct: 437 T 435

BLAST of CSPI05G27660 vs. NCBI nr
Match: gi|356508308|ref|XP_003522900.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Glycine max])

HSP 1 Score: 658.3 bits (1697), Expect = 9.5e-186
Identity = 328/436 (75.23%), Postives = 381/436 (87.39%), Query Frame = 1

Query: 11  LLLLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQM 70
           L  L  L LF + +    +   C+   D  STL+VFH+FSPCSPFRP KPLSWA++VLQ+
Sbjct: 5   LFSLSPLFLFLLFSLVEGLTPKCD-TQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVLQL 64

Query: 71  QAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAW 130
           QAKDQARLQFL+S+VA RS VPIAS RQ+IQSPT++VRAKIG+P QTLLLA+DTSNDAAW
Sbjct: 65  QAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAW 124

Query: 131 IPCSGCIGCPSTTVFSSDKSSSFRPLPCQSSQCNQVPNPSCSGSACGFNLTYGSSTVAAD 190
           IPC+ C GC ST +F+ +KS++F+ + C S QCNQVPNPSC  SAC FNLTYGSS++AA+
Sbjct: 125 IPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIAAN 184

Query: 191 LVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL 250
           +VQD +TLATD +P YTFGC+ K TG+S PPQGLLGLGRGPLSLL Q+Q+LYQSTFSYCL
Sbjct: 185 VVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL 244

Query: 251 PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALA 310
           PSFKS+NFSGSLRLGPVAQPIRIKYTPLL+NPRRSSLYYVNL++IRVGRK+VDIPP ALA
Sbjct: 245 PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALA 304

Query: 311 FNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG----RNVTVSSLGGFDTCYTVPI 370
           FN+ATGAGTV DSGT FTRLVAPAYTAVRDEF+RRV      N+TV+SLGGFDTCYTVPI
Sbjct: 305 FNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPI 364

Query: 371 ISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRIL 430
           ++PTITFMF+GMNVTLP DN LIHSTAGSTTCLAMA+APDNVNSVLNVIA+MQQQNHR+L
Sbjct: 365 VAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVL 424

Query: 431 FDIPNSRVGVARESCS 443
           +D+PNSR+GVARE C+
Sbjct: 425 YDVPNSRLGVARELCT 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AED3_ARATH5.3e-11249.78Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1[more]
AP25_ORYSJ4.0e-9948.14Aspartyl protease 25 OS=Oryza sativa subsp. japonica GN=AP25 PE=2 SV=1[more]
APF2_ARATH3.9e-4629.93Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG2_ARATH1.1e-4533.62Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
NEP1_NEPGR1.3e-4435.04Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KTR5_CUCSA4.4e-24699.32Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_5G623870 PE=3 SV=1[more]
A0A067L7Q3_JATCU3.2e-18877.91Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24539 PE=3 SV=1[more]
I1JW44_SOYBN6.7e-18675.23Uncharacterized protein OS=Glycine max GN=GLYMA_04G136000 PE=3 SV=1[more]
A0A0B2PGF3_GLYSO6.7e-18675.23Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_045617 PE=3 SV=1[more]
V4V1N5_9ROSI1.9e-18575.58Uncharacterized protein OS=Citrus clementina GN=CICLE_v10001212mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G07030.15.8e-16563.98 Eukaryotic aspartyl protease family protein[more]
AT3G54400.11.1e-16067.66 Eukaryotic aspartyl protease family protein[more]
AT1G09750.13.0e-11349.78 Eukaryotic aspartyl protease family protein[more]
AT1G01300.12.2e-4729.93 Eukaryotic aspartyl protease family protein[more]
AT3G20015.16.4e-4733.62 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449449334|ref|XP_004142420.1|6.3e-24699.32PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|659092233|ref|XP_008446966.1|7.4e-23996.17PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo][more]
gi|802574128|ref|XP_012068673.1|4.6e-18877.91PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas][more]
gi|1009171231|ref|XP_015866632.1|5.1e-18776.72PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Ziziphus jujuba][more]
gi|356508308|ref|XP_003522900.1|9.5e-18675.23PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0048046 apoplast
cellular_component GO:0009507 chloroplast
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G27660.1CSPI05G27660.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 10..442
score: 4.5E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 272..442
score: 2.2E-41coord: 102..265
score: 2.5
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 104..441
score: 4.55
NoneNo IPR availablePANTHERPTHR13683:SF258ASPARTYL PROTEASE FAMILY PROTEIN-RELATEDcoord: 10..442
score: 4.5E