CSPI07G19920 (gene) Wild cucumber (PI 183967)

NameCSPI07G19920
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionAspartic proteinase nepenthesin-1, putative
LocationChr7 : 17407167 .. 17410225 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTTAATTTTCTCTCATAGCCATGGAGATTTCGAAATCTCTCCATTTCCCCCTTTCTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTCTCCATTGGCGTCGATGCTCGTTCGAGCTCGTTCAACCTCGGTAATGGTGATAACCATGAGAAGGGTCTTCTTCAGCTTTTTCAGAATTTTCCATGGAAGGAGCATGGAGAAGCTGTTGTTAATTGCATCTTTCAGAAGCCAAGTGAGTGTTTCTCTTTTTTTACACTAATCTTCATTGTTGTTGTTGTTAATTATCTTTGGGATCTGTTTTTTCTTTTAATGAAAAATGTGACCTTTTTGCTTTTGCTTTTTTGTAAGGATATAGGATTTGCCATTTTTTTTCTTGCTTTAAAATGTGAACTCTCATCATTAATTGCTTTTTTCTTTTCTTTATAAAGTTAATTAAATTTGGGTTTGTGTAATTTTCTAGTAAAAGCTAATCTTTTTTTGGGCTCTTGGGTTATGTTTAAATCTCTTTTTAAAGGGTTTTTTTTTTCTTTTAAAAGAAAAAACATATCAAACTATTAACTCACATTATTACATTTTGAATTTCTTAAACAGAAATTTGTTTTTCCTCTTTTAGGTTTTTTAGCAAAAAATACTTTGTATTTTTTTTCTCTCTGTTGGGCTATTTCTCTCTTCTGTTTTCTTTTTTTCCTTTTTTTTTTGGGAAATTACATTCACCAATTAATAATGGATTTTTCTTTCCCTTATTGTGCAAATTTCAATCAAGTTATTAATGTACCTTTTTCTTTTTTTTAACATTTTATTAGAATCATAAGACTTAAAATGATAGTAAAAAAAACATTATTATAATTTTTCACCAAAGTATTCATTCTAGATTCTATTTCTAAAGAATACAAATGAAATATATACACACACCCACATTATCATAGTAGTTTGCTAAATGAGGGTTTGAAGATTTAACTTCTTCTTTTTTTGTGTTAAATTAAATAATACTTCAAAATTTAATAAATCAAATTCCGACATGCTCCATATTGAAACTATATGAAAACTATCTCTAATGGTAAAATTTCCAAGTTACCCTTGATTAAAGATTTGGGTGAGTAGGTAACATCGTTTTACATTAAAGGTTATATTGGTAATTAAGAAATTTGAGAGGGATGTCGATATATATATATATATATGTATTATAATTTTGTTTAAAAAATTTAAATATTCTAATTATTGTCAGATATGTAATTTGTTGTTTTGTTAAGGAGATAACTATAAATTCTATTATATTTTTTATACTGTTTATCTTGTGAAGCCTATGGGAAAAAAGAAAAAAAAATGGTGTTTTACAATACAAGTAGTATTTTCACGAATATTCATGTGGAGGGAAATCCACAATTGAATTAGCTTCAATATAATTAATTTTGTGTCTCAATTAATTATGGAATTTGAATTTCAGAGATAACGAAGGGAATAACGACACTGGAAATGAAACAGAGAGATTACTGTTCAGGCAAAATAACAGACTGGGAAAAGATTTTTCAAAACCGTATCATCCTCGACGCCATTAACGTCAATTCCCTTTTTTCACATTTCAAATCAGCCATTTTTCCCGGCCAGACTCACCAACTTTCCGATTCCCAAATCCCCATTTCCTCCGGCGCCAGGCTCCAAACACTCAATTACATCGTCACCGTCGGCATCGGCGGCCAGAATTCCACTCTCATCGTTGACACCGGCAGCGATCTCACTTGGGTTCAATGCCTCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCAATCCCTCTAATTCCTCTTCCTTCCTCTCCCTTCCTTGTAATTCCCCTACCTGTGTGGCTCTTCAACCCACCGCCGGAAGCTCCGGCCTCTGTTCTAACAAAAACTCAACTTCCTGCGACTACCAGATCGACTACGGCGATGGATCTTACTCCCGTGGGGAACTCGGATTCGAGAAGCTGACTTTAGGTAAAACTGAGATTGATAGTTTCATATTCGGATGTGGCCGGAATAACAAAGGATTATTTGGAGGAGCTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTGGTTTCTCAAACGTCCTCTCTTTTTGGTAGTGTTTTTTCTTACTGTTTACCAACAACTGGAGTTGGATCTTCAGGTTCCTTAACATTGGGGGGAGCCGATTTCTCAAATTTCAAGAACATTTCACCAATTTCCTACACAAGAATGATTCAAAACCCACAAATGTCGAATTTCTACTTTCTGAATCTGACTGGAATTTCAATCGGTGGGGTTAATTTGAATGTGCCTCGTTTATCTTCAAACGAAGGGGTTTTGAGTTTACTCGATTCGGGAACAGTGATAACAAGGCTATCTCCATCGATTTACAAAGCTTTCAAAGCAGAATTTGAGAAACAATTTTCTGGGTATCGAACTACACCAGGATTTTCGATTCTTAATACTTGTTTTAATCTAACAGGGTACGAAGAAGTGAATATTCCTACTGTGAAATTTATCTTTGAAGGCAATGCAGAGATGGTTGTTGATGTTGAAGGGGTTTTTTATTTTGTGAAATCTGATGCTTCACAGATTTGTTTAGCGTTTGCGAGTTTGGGTTATGAAGATCAGACGATGATAATTGGGAATTATCAGCAGAAGAATCAAAGGGTTATTTATAATTCTAAAGAATCTAAAGTGGGTTTTGCAGGGGAGCCTTGCAGTTTTTAAGAATTTTTCCCGGAAAATGGGTGATTTTCCCGGAAAATTTGATTGGATCTCGAGAAATATTGAGGCTGATTATTTTAGTGTAATGGGTTGGGTTGGATTTGACATTGGGTTAAAACGAAATGGAGTTCTTGTGATGAATTAATCTCGTTTCGTCCAAGAAGAAGAAGAAATTTGGATTTGTATATTTGATTTTTACTTGTTTTGTAATCTGTATATTTATTTCAATTCAATTGTTACAATCAATTCATACATCTTTTGTTCAAGTTCACATTTTTGTGTGTAAAGCTTTGATGGGATCTTTTTTGTACTTATCATTCAATTCAACGCGAAA

mRNA sequence

ATGGAGATTTCGAAATCTCTCCATTTCCCCCTTTCTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTCTCCATTGGCGTCGATGCTCGTTCGAGCTCGTTCAACCTCGGTAATGGTGATAACCATGAGAAGGGTCTTCTTCAGCTTTTTCAGAATTTTCCATGGAAGGAGCATGGAGAAGCTGTTGTTAATTGCATCTTTCAGAAGCCAAAGATAACGAAGGGAATAACGACACTGGAAATGAAACAGAGAGATTACTGTTCAGGCAAAATAACAGACTGGGAAAAGATTTTTCAAAACCGTATCATCCTCGACGCCATTAACGTCAATTCCCTTTTTTCACATTTCAAATCAGCCATTTTTCCCGGCCAGACTCACCAACTTTCCGATTCCCAAATCCCCATTTCCTCCGGCGCCAGGCTCCAAACACTCAATTACATCGTCACCGTCGGCATCGGCGGCCAGAATTCCACTCTCATCGTTGACACCGGCAGCGATCTCACTTGGGTTCAATGCCTCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCAATCCCTCTAATTCCTCTTCCTTCCTCTCCCTTCCTTGTAATTCCCCTACCTGTGTGGCTCTTCAACCCACCGCCGGAAGCTCCGGCCTCTGTTCTAACAAAAACTCAACTTCCTGCGACTACCAGATCGACTACGGCGATGGATCTTACTCCCGTGGGGAACTCGGATTCGAGAAGCTGACTTTAGGTAAAACTGAGATTGATAGTTTCATATTCGGATGTGGCCGGAATAACAAAGGATTATTTGGAGGAGCTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTGGTTTCTCAAACGTCCTCTCTTTTTGGTAGTGTTTTTTCTTACTGTTTACCAACAACTGGAGTTGGATCTTCAGGTTCCTTAACATTGGGGGGAGCCGATTTCTCAAATTTCAAGAACATTTCACCAATTTCCTACACAAGAATGATTCAAAACCCACAAATGTCGAATTTCTACTTTCTGAATCTGACTGGAATTTCAATCGGTGGGGTTAATTTGAATGTGCCTCGTTTATCTTCAAACGAAGGGGTTTTGAGTTTACTCGATTCGGGAACAGTGATAACAAGGCTATCTCCATCGATTTACAAAGCTTTCAAAGCAGAATTTGAGAAACAATTTTCTGGGTATCGAACTACACCAGGATTTTCGATTCTTAATACTTGTTTTAATCTAACAGGGTACGAAGAAGTGAATATTCCTACTGTGAAATTTATCTTTGAAGGCAATGCAGAGATGGTTGTTGATGTTGAAGGGGTTTTTTATTTTGTGAAATCTGATGCTTCACAGATTTGTTTAGCGTTTGCGAGTTTGGGTTATGAAGATCAGACGATGATAATTGGGAATTATCAGCAGAAGAATCAAAGGGTTATTTATAATTCTAAAGAATCTAAAGTGGGTTTTGCAGGGGAGCCTTGCAGTTTTTAA

Coding sequence (CDS)

ATGGAGATTTCGAAATCTCTCCATTTCCCCCTTTCTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTCTCCATTGGCGTCGATGCTCGTTCGAGCTCGTTCAACCTCGGTAATGGTGATAACCATGAGAAGGGTCTTCTTCAGCTTTTTCAGAATTTTCCATGGAAGGAGCATGGAGAAGCTGTTGTTAATTGCATCTTTCAGAAGCCAAAGATAACGAAGGGAATAACGACACTGGAAATGAAACAGAGAGATTACTGTTCAGGCAAAATAACAGACTGGGAAAAGATTTTTCAAAACCGTATCATCCTCGACGCCATTAACGTCAATTCCCTTTTTTCACATTTCAAATCAGCCATTTTTCCCGGCCAGACTCACCAACTTTCCGATTCCCAAATCCCCATTTCCTCCGGCGCCAGGCTCCAAACACTCAATTACATCGTCACCGTCGGCATCGGCGGCCAGAATTCCACTCTCATCGTTGACACCGGCAGCGATCTCACTTGGGTTCAATGCCTCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCAATCCCTCTAATTCCTCTTCCTTCCTCTCCCTTCCTTGTAATTCCCCTACCTGTGTGGCTCTTCAACCCACCGCCGGAAGCTCCGGCCTCTGTTCTAACAAAAACTCAACTTCCTGCGACTACCAGATCGACTACGGCGATGGATCTTACTCCCGTGGGGAACTCGGATTCGAGAAGCTGACTTTAGGTAAAACTGAGATTGATAGTTTCATATTCGGATGTGGCCGGAATAACAAAGGATTATTTGGAGGAGCTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTGGTTTCTCAAACGTCCTCTCTTTTTGGTAGTGTTTTTTCTTACTGTTTACCAACAACTGGAGTTGGATCTTCAGGTTCCTTAACATTGGGGGGAGCCGATTTCTCAAATTTCAAGAACATTTCACCAATTTCCTACACAAGAATGATTCAAAACCCACAAATGTCGAATTTCTACTTTCTGAATCTGACTGGAATTTCAATCGGTGGGGTTAATTTGAATGTGCCTCGTTTATCTTCAAACGAAGGGGTTTTGAGTTTACTCGATTCGGGAACAGTGATAACAAGGCTATCTCCATCGATTTACAAAGCTTTCAAAGCAGAATTTGAGAAACAATTTTCTGGGTATCGAACTACACCAGGATTTTCGATTCTTAATACTTGTTTTAATCTAACAGGGTACGAAGAAGTGAATATTCCTACTGTGAAATTTATCTTTGAAGGCAATGCAGAGATGGTTGTTGATGTTGAAGGGGTTTTTTATTTTGTGAAATCTGATGCTTCACAGATTTGTTTAGCGTTTGCGAGTTTGGGTTATGAAGATCAGACGATGATAATTGGGAATTATCAGCAGAAGAATCAAAGGGTTATTTATAATTCTAAAGAATCTAAAGTGGGTTTTGCAGGGGAGCCTTGCAGTTTTTAA
BLAST of CSPI07G19920 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 8.4e-82
Identity = 172/398 (43.22%), Postives = 237/398 (59.55%), Query Frame = 1

Query: 104 LDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNS--TLIV 163
           LD   VNS+ S     +      +   + +P   G+ L + NYIVTVG+G   +  +LI 
Sbjct: 90  LDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIF 149

Query: 164 DTGSDLTWVQCLPC-RLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 223
           DTGSDLTW QC PC R CY+Q+EP+FNPS S+S+ ++ C+S  C +L    G++G CS  
Sbjct: 150 DTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS 209

Query: 224 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DSFIFGCGRNNKGLFGGASGLMGLAR 283
           N   C Y I YGD S+S G L  EK TL  +++ D   FGCG NN+GLF G +GL+GL R
Sbjct: 210 N---CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGR 269

Query: 284 SELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMS 343
            +LS  SQT++ +  +FSYCLP++    +G LT G A  S     +PIS          +
Sbjct: 270 DKLSFPSQTATAYNKIFSYCLPSSA-SYTGHLTFGSAGISRSVKFTPISTIT-----DGT 329

Query: 344 NFYFLNLTGISIGGVNLNVPR-LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSG 403
           +FY LN+  I++GG  L +P  + S  G  +L+DSGTVITRL P  Y A ++ F+ + S 
Sbjct: 330 SFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSK 389

Query: 404 YRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFA 463
           Y TT G SIL+TCF+L+G++ V IP V F F G A + +  +G+FY  K   SQ+CLAFA
Sbjct: 390 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFA 449

Query: 464 SLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 497
               +    I GN QQ+   V+Y+    +VGFA   CS
Sbjct: 450 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of CSPI07G19920 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 7.4e-70
Identity = 160/414 (38.65%), Postives = 235/414 (56.76%), Query Frame = 1

Query: 87  CSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNY 146
           CS   +D        I  D   V S++S   S     +  +   +++P  SG  L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 147 IVTVGIGG--QNSTLIVDTGSDLTWVQCLPCR-LCYNQQEPLFNPSNSSSFLSLPCNSPT 206
           IVT+GIG    + +L+ DTGSDLTW QC PC   CY+Q+EP FNPS+SS++ ++ C+SP 
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 207 CVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DSFIFGCGR 266
           C   +        CS  N   C Y I YGD S+++G L  EK TL  +++ +   FGCG 
Sbjct: 194 CEDAES-------CSASN---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 267 NNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFK 326
           NN+GLF G +GL+GL   +LSL +QT++ + ++FSYCLP+    S+G LT G A  S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 327 NISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNV-PRLSSNEGVLSLLDSGTVITRLS 386
             +PIS       P   N Y +++ GIS+G   L + P   S EG  +++DSGTV TRL 
Sbjct: 314 KFTPIS-----SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLP 373

Query: 387 PSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEG 446
             +Y   ++ F+++ S Y++T G+ + +TC++ TG + V  PT+ F F G+  + +D  G
Sbjct: 374 TKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSG 433

Query: 447 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 496
           +   +K   SQ+CLAFA  G +D   I GN QQ    V+Y+    +VGFA   C
Sbjct: 434 ISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of CSPI07G19920 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 3.7e-61
Identity = 133/410 (32.44%), Postives = 212/410 (51.71%), Query Frame = 1

Query: 101 RIILDAINVNSLFSHFKSAIFPGQT--HQLSDSQIPISSGARLQTLNYIVTVGIGG--QN 160
           R+  D   V+++       + P     ++++D    I SG    +  Y V +G+G   ++
Sbjct: 84  RMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRD 143

Query: 161 STLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGL 220
             +++D+GSD+ WVQC PC+LCY Q +P+F+P+ S S+  + C S  C  ++ +   SG 
Sbjct: 144 QYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSG- 203

Query: 221 CSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMG 280
                   C Y++ YGDGSY++G L  E LT  KT + +   GCG  N+G+F GA+GL+G
Sbjct: 204 -------GCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLG 263

Query: 281 LARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYTRMIQ 340
           +    +S V Q S   G  F YCL + G  S+GSL  G       +   P+  S+  +++
Sbjct: 264 IGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFG-------REALPVGASWVPLVR 323

Query: 341 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL---------LDSGTVITRLSPSIY 400
           NP+  +FY++ L G+ +GGV + +P     +GV  L         +D+GT +TRL  + Y
Sbjct: 324 NPRAPSFYYVGLKGLGVGGVRIPLP-----DGVFDLTETGDGGVVMDTGTAVTRLPTAAY 383

Query: 401 KAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYF 460
            AF+  F+ Q +      G SI +TC++L+G+  V +PTV F F     + +     F  
Sbjct: 384 VAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARN-FLM 443

Query: 461 VKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 496
              D+   C AFA+        IIGN QQ+  +V ++     VGF    C
Sbjct: 444 PVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CSPI07G19920 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.2e-59
Identity = 152/436 (34.86%), Postives = 227/436 (52.06%), Query Frame = 1

Query: 78  TLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQ--THQLSDSQIPI 137
           TL +   D  S   T  +++F +R+  D+  V S+ +   +A  PG+  TH         
Sbjct: 73  TLNLDHIDALSSNKTP-DELFSSRLQRDSRRVKSIAT--LAAQIPGRNVTHAPRPGGFSS 132

Query: 138 S--SGARLQTLNYIVTVGIG--GQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNS 197
           S  SG    +  Y   +G+G   +   +++DTGSD+ W+QC PCR CY+Q +P+F+P  S
Sbjct: 133 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 192

Query: 198 SSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT 257
            ++ ++PC+SP C  L  +AG    C+ +  T C YQ+ YGDGS++ G+   E LT  + 
Sbjct: 193 KTYATIPCSSPHCRRLD-SAG----CNTRRKT-CLYQVSYGDGSFTVGDFSTETLTFRRN 252

Query: 258 EIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS-GS 317
            +     GCG +N+GLF GA+GL+GL + +LS   QT   F   FSYCL      S   S
Sbjct: 253 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 312

Query: 318 LTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSS------- 377
           +  G A  S     +P     ++ NP++  FY++ L GIS+GG    VP +++       
Sbjct: 313 VVFGNAAVSRIARFTP-----LLSNPKLDTFYYVGLLGISVGGT--RVPGVTASLFKLDQ 372

Query: 378 --NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 437
             N GV  ++DSGT +TRL    Y A +  F       +  P FS+ +TCF+L+   EV 
Sbjct: 373 IGNGGV--IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVK 432

Query: 438 IPTVKFIFEGNAEMVVDVEGVFYFVKSDAS-QICLAFASLGYEDQTMIIGNYQQKNQRVI 497
           +PTV   F G     V +    Y +  D + + C AFA  G      IIGN QQ+  RV+
Sbjct: 433 VPTVVLHFRG---ADVSLPATNYLIPVDTNGKFCFAFA--GTMGGLSIIGNIQQQGFRVV 485

BLAST of CSPI07G19920 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 1.8e-55
Identity = 137/442 (31.00%), Postives = 225/442 (50.90%), Query Frame = 1

Query: 78  TLEMKQRD-YCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF------------PGQ 137
           +LE+  RD + + +  D++ +  +R+  D+  V  + +  + A+                
Sbjct: 81  SLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDT 140

Query: 138 THQLSDSQIPISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDLTWVQCLPCRLCYNQQ 197
            +Q  D   P+ SGA   +  Y   +G+G   +   L++DTGSD+ W+QC PC  CY Q 
Sbjct: 141 RYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQS 200

Query: 198 EPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELG 257
           +P+FNP++SS++ SL C++P C  L+ +A     C    S  C YQ+ YGDGS++ GEL 
Sbjct: 201 DPVFNPTSSSTYKSLTCSAPQCSLLETSA-----C---RSNKCLYQVSYGDGSFTVGELA 260

Query: 258 FEKLTLGKT-EIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 317
            + +T G + +I++   GCG +N+GLF GA+GL+GL    LS+ +Q  +   + FSYCL 
Sbjct: 261 TDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKA---TSFSYCLV 320

Query: 318 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP-- 377
               G S SL             +P     +++N ++  FY++ L+G S+GG  + +P  
Sbjct: 321 DRDSGKSSSLDFNSVQLGGGDATAP-----LLRNKKIDTFYYVGLSGFSVGGEKVVLPDA 380

Query: 378 ----RLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRT-TPGFSILNTCFNL 437
                 S + GV  +LD GT +TRL    Y + +  F K     +  +   S+ +TC++ 
Sbjct: 381 IFDVDASGSGGV--ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 440

Query: 438 TGYEEVNIPTVKFIFEGNAEMVVDVEGVFYFVK-SDASQICLAFASLGYEDQTMIIGNYQ 496
           +    V +PTV F F G   +  D+    Y +   D+   C AFA         IIGN Q
Sbjct: 441 SSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPT--SSSLSIIGNVQ 500

BLAST of CSPI07G19920 vs. TrEMBL
Match: A0A0A0K8J2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431320 PE=3 SV=1)

HSP 1 Score: 986.9 bits (2550), Expect = 9.2e-285
Identity = 494/497 (99.40%), Postives = 496/497 (99.80%), Query Frame = 1

Query: 1   MEISKSLHFPLSLLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60
           MEISKSLHFPLSLLLLLLL PLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE
Sbjct: 1   MEISKSLHFPLSLLLLLLL-PLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60

Query: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120
           AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI
Sbjct: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120

Query: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180
           FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN
Sbjct: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180

Query: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240
           QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE
Sbjct: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240

Query: 241 LGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300
           LGFEKLTLGKTEID+FIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL
Sbjct: 241 LGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300

Query: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360
           PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR
Sbjct: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360

Query: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420
           LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV
Sbjct: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420

Query: 421 NIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480
           NIPTVKFIFEGNAEM+VDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI
Sbjct: 421 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480

Query: 481 YNSKESKVGFAGEPCSF 498
           YNSKESKVGFAGEPCSF
Sbjct: 481 YNSKESKVGFAGEPCSF 496

BLAST of CSPI07G19920 vs. TrEMBL
Match: M5WTF1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005040mg PE=3 SV=1)

HSP 1 Score: 581.3 bits (1497), Expect = 1.2e-162
Identity = 299/457 (65.43%), Postives = 363/457 (79.43%), Query Frame = 1

Query: 43  EKGLLQLFQNFPWKEHGEAVVN-CIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNR 102
           EK +L+L Q F W++HG      C+ QK +  KG T LE+K RDYCSGKI DW+K  Q R
Sbjct: 30  EKKVLKL-QEFRWRQHGGTRSTVCLSQKSRKEKGATILEIKHRDYCSGKIVDWDKKQQKR 89

Query: 103 IILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIV 162
           +I D ++V SL S FK+ +  G+   LS++QIP++SG RLQTLNYIVTV +GG+N T+IV
Sbjct: 90  LIFDDLHVRSLQSQFKNRV-SGRIKDLSEAQIPLTSGIRLQTLNYIVTVELGGRNMTVIV 149

Query: 163 DTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKN 222
           DTGSDLTWVQC PC+LCYNQQEPLFN S S S+ S+ CNS TC ALQ   G+SG C + N
Sbjct: 150 DTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKSVLCNSSTCQALQFDTGNSGACGS-N 209

Query: 223 STSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSE 282
            TSC+Y ++YGDGSY+RGELG + L+LG T +++F+FGCGRNNKGLFGGASGLMGL RSE
Sbjct: 210 PTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNFVFGCGRNNKGLFGGASGLMGLGRSE 269

Query: 283 -LSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSN 342
            +SLVSQTS+LFG VFSYCLPTT   +SGSL +GG D S +KN +PISYTRM+ NP++S 
Sbjct: 270 SVSLVSQTSALFGGVFSYCLPTTEATASGSLIMGG-DASIYKNSTPISYTRMVPNPELST 329

Query: 343 FYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYR 402
           FYFLNLTGISIGGV L     +S  G+L  +DSGTVI+RL+PS+YKA KAEF KQFSGY 
Sbjct: 330 FYFLNLTGISIGGVALQNQSFASG-GIL--IDSGTVISRLAPSVYKAVKAEFLKQFSGYP 389

Query: 403 TTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFASL 462
             PGF+IL+TCFNL+ Y+EV+IPT+KF FEGNAE+ VDV G+FY VK+DASQICLA ASL
Sbjct: 390 PAPGFAILDTCFNLSAYQEVSIPTLKFHFEGNAELNVDVTGIFYLVKTDASQICLALASL 449

Query: 463 GYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 498
            YED+  IIGNYQQKNQRVIYN+K+SK+GFA E CSF
Sbjct: 450 SYEDEIGIIGNYQQKNQRVIYNTKDSKLGFAEESCSF 479

BLAST of CSPI07G19920 vs. TrEMBL
Match: A0A061GAK7_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_027560 PE=3 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 7.5e-154
Identity = 291/488 (59.63%), Postives = 365/488 (74.80%), Query Frame = 1

Query: 14  LLLLLLLPLLSIGVDARSSSFNLGNGDNH---EKGLLQLFQNFPWKEHGEAVVNCIFQKP 73
           + LLL   +LS+ V     S  L NG  H   EK LL+L Q+F WK+  +A   C+ QK 
Sbjct: 5   MALLLFPSMLSLLV----FSLILHNGVVHCFGEKKLLKL-QHFQWKQKWDAST-CLSQKS 64

Query: 74  KITKGITTLEMKQRDYC-SGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLS 133
           +  KG T LEMK RDYC  G + DW K+ Q R+ILD + V SL +  K+  F G+T  +S
Sbjct: 65  RKEKGATILEMKHRDYCYGGGVKDWNKLLQKRLILDDLRVQSLQARIKNKAF-GKTEGVS 124

Query: 134 DSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPS 193
           D++IP++SG  L TLNYIVTV +GG+   +IVDTGSDLTWVQC PC+ CYNQ+EPLFNPS
Sbjct: 125 DTRIPLTSGVELGTLNYIVTVELGGRKMRVIVDTGSDLTWVQCQPCKSCYNQKEPLFNPS 184

Query: 194 NSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLG 253
            S S+ ++ CNS TC +L    G++G+C N N  +C+Y + YGDGSY+RGEL  + L+LG
Sbjct: 185 ASPSYQTVSCNSSTCQSLAFATGNTGICGN-NPPTCNYIVSYGDGSYTRGELAHDHLSLG 244

Query: 254 KTEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSG 313
           KT +D+F+FGCGRNNKGLFGGASGLMGL RS +SLVSQT+ +FG  FSYCLP+T  G+SG
Sbjct: 245 KTPVDNFVFGCGRNNKGLFGGASGLMGLGRSSISLVSQTTDIFGGFFSYCLPSTQAGASG 304

Query: 314 SLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLS 373
           SL LGG + S +K  S ISYTRMI NPQ+S FYFLNLTG+S+GGV L  P  +  +G + 
Sbjct: 305 SLVLGG-NSSVYKTSSAISYTRMIPNPQLSTFYFLNLTGVSVGGVTL--PDSTFGKGAM- 364

Query: 374 LLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIF 433
           L+DSGTVITRL PSIYKA KAEF KQFSG+ + P FSIL+TCFNL+ Y+EV++PT+K  F
Sbjct: 365 LIDSGTVITRLPPSIYKALKAEFMKQFSGFPSAPAFSILDTCFNLSAYQEVDVPTIKMQF 424

Query: 434 EGNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVG 493
           EGNAEM VDV GVFYFVK+DASQ+CLA ASL +ED+  II NYQQ+NQRVIY++K++K+G
Sbjct: 425 EGNAEMNVDVTGVFYFVKTDASQVCLALASLSFEDEIGIIANYQQRNQRVIYDTKKAKLG 480

Query: 494 FAGEPCSF 498
           FA E CSF
Sbjct: 485 FAHESCSF 480

BLAST of CSPI07G19920 vs. TrEMBL
Match: A0A067EW14_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011482mg PE=3 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 1.9e-152
Identity = 288/487 (59.14%), Postives = 358/487 (73.51%), Query Frame = 1

Query: 13  LLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQL-FQNFPWKEHGEAVVNCI-FQKP 72
           L +L LLLPL+        S F L  G +  +G  +L      W++   +  +C+  QK 
Sbjct: 8   LTILSLLLPLMV-------SLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKS 67

Query: 73  KITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSD 132
           +I  G  TLE+K ++YCSGKI DW +  QNR+ILD ++V  L S  K+ I  G    +S+
Sbjct: 68  RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSN 127

Query: 133 SQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSN 192
           ++IP++SG RLQTLNYI T+ +GG+N T+IVDTGSDLTWVQC PC+ CYNQQ+P+F+PS 
Sbjct: 128 TEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 187

Query: 193 SSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGK 252
           S S+  + CNS TC AL+   G+SG+CS+ +   C+Y + YGDGSY+RGELG E L LGK
Sbjct: 188 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 247

Query: 253 TEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT-GVGSSG 312
             ++ FIFGCGRNNKGLFGG SGLMGL RS+LSLVSQTS +FG +FSYCLP+T   G+SG
Sbjct: 248 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 307

Query: 313 SLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLS 372
           SL LGG + S FKN +PI+YT MI NPQ++ FY LNLTGISIGG  L     +   G+  
Sbjct: 308 SLILGG-NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GI-- 367

Query: 373 LLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIF 432
           L+DSGTVITRL PSIY A KAEF KQFSG+ + PGFSIL+TCFNL+ Y+EVNIP VK  F
Sbjct: 368 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 427

Query: 433 EGNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVG 492
           EGNAEM VDV G+ YFVKSDASQ+CLA ASL YED+T IIGNYQQKNQRVIY++K S++G
Sbjct: 428 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 482

Query: 493 FAGEPCS 497
           FAGE CS
Sbjct: 488 FAGEDCS 482

BLAST of CSPI07G19920 vs. TrEMBL
Match: B9RGP5_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1442500 PE=3 SV=1)

HSP 1 Score: 545.4 bits (1404), Expect = 7.1e-152
Identity = 275/419 (65.63%), Postives = 329/419 (78.52%), Query Frame = 1

Query: 81  MKQRDYC--SGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSG 140
           MK RD+C  SGK TDW K  Q  +ILD   V SL S  KS IF G      DSQIP+SSG
Sbjct: 1   MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKS-IFSGNNIDALDSQIPLSSG 60

Query: 141 ARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLP 200
            RLQTLNYIVTV IGG+N T+IVDTGSDLTWVQC PCRLCYNQQ+PLFNPS S S+ ++ 
Sbjct: 61  VRLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTIL 120

Query: 201 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDSFIF 260
           CNS TC +LQ   G+ G+C + N+ +C+Y ++YGDGSY+RG+LG E+L LG T + +FIF
Sbjct: 121 CNSSTCQSLQYATGNLGVCGS-NTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIF 180

Query: 261 GCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADF 320
           GCGRNNKGLFGGASGLMGL +S+LSLVSQTS++F  VFSYCLPTT   +SGSL LGG + 
Sbjct: 181 GCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGG-NS 240

Query: 321 SNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVIT 380
           S +KN +PISYTRMI NPQ+  FYFLNLTGISIGGV L  P    + G+L  +DSGTVIT
Sbjct: 241 SVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQS-GIL--IDSGTVIT 300

Query: 381 RLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVD 440
           RL P +Y+  KAEF KQFSG+ + P FSIL+TCFNL GY+EV+IPT++  FEGNAE+ VD
Sbjct: 301 RLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAELTVD 360

Query: 441 VEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 498
           V G+FYFVK+DASQ+CLA ASL ++D+  IIGNYQQ+NQRVIYN+KESK+GFA E CSF
Sbjct: 361 VTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACSF 413

BLAST of CSPI07G19920 vs. TAIR10
Match: AT1G79720.1 (AT1G79720.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 491.1 bits (1263), Expect = 8.0e-139
Identity = 263/488 (53.89%), Postives = 334/488 (68.44%), Query Frame = 1

Query: 13  LLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPW--KEHGEAVVNCIFQKP 72
           LL+ L LL  +  GVD              EK +L +  N  W  K+  EA  +C  +  
Sbjct: 15  LLVFLFLLSCVVHGVD--------------EKKILSVHNNI-WSPKKSYEASTSCFSRSL 74

Query: 73  KITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSD 132
              +  TTLEMK R+ CSGK  D  K  +  ++LD I V SL    K+         +S+
Sbjct: 75  GKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSE 134

Query: 133 SQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSN 192
           +QIP++SG +L++LNYIVTV +GG+N +LIVDTGSDLTWVQC PCR CYNQQ PL++PS 
Sbjct: 135 TQIPLTSGIKLESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV 194

Query: 193 SSSFLSLPCNSPTCVALQPTAGSSGLCSNKNS---TSCDYQIDYGDGSYSRGELGFEKLT 252
           SSS+ ++ CNS TC  L     +SG C   N    T C+Y + YGDGSY+RG+L  E + 
Sbjct: 195 SSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESIL 254

Query: 253 LGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS 312
           LG T++++F+FGCGRNNKGLFGG+SGLMGL RS +SLVSQT   F  VFSYCLP+   G+
Sbjct: 255 LGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGA 314

Query: 313 SGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGV 372
           SGSL+ G  D S + N + +SYT ++QNPQ+ +FY LNLTG SIGGV L     SS+ G 
Sbjct: 315 SGSLSFGN-DSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK----SSSFGR 374

Query: 373 LSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKF 432
             L+DSGTVITRL PSIYKA K EF KQFSG+ T PG+SIL+TCFNLT YE+++IP +K 
Sbjct: 375 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKM 434

Query: 433 IFEGNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESK 492
           IF+GNAE+ VDV GVFYFVK DAS +CLA ASL YE++  IIGNYQQKNQRVIY++ + +
Sbjct: 435 IFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQER 482

Query: 493 VGFAGEPC 496
           +G  GE C
Sbjct: 495 LGIVGENC 482

BLAST of CSPI07G19920 vs. TAIR10
Match: AT5G10770.1 (AT5G10770.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 305.8 bits (782), Expect = 4.8e-83
Identity = 172/398 (43.22%), Postives = 237/398 (59.55%), Query Frame = 1

Query: 104 LDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNS--TLIV 163
           LD   VNS+ S     +      +   + +P   G+ L + NYIVTVG+G   +  +LI 
Sbjct: 90  LDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIF 149

Query: 164 DTGSDLTWVQCLPC-RLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 223
           DTGSDLTW QC PC R CY+Q+EP+FNPS S+S+ ++ C+S  C +L    G++G CS  
Sbjct: 150 DTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS 209

Query: 224 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DSFIFGCGRNNKGLFGGASGLMGLAR 283
           N   C Y I YGD S+S G L  EK TL  +++ D   FGCG NN+GLF G +GL+GL R
Sbjct: 210 N---CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGR 269

Query: 284 SELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMS 343
            +LS  SQT++ +  +FSYCLP++    +G LT G A  S     +PIS          +
Sbjct: 270 DKLSFPSQTATAYNKIFSYCLPSSA-SYTGHLTFGSAGISRSVKFTPISTIT-----DGT 329

Query: 344 NFYFLNLTGISIGGVNLNVPR-LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSG 403
           +FY LN+  I++GG  L +P  + S  G  +L+DSGTVITRL P  Y A ++ F+ + S 
Sbjct: 330 SFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSK 389

Query: 404 YRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFA 463
           Y TT G SIL+TCF+L+G++ V IP V F F G A + +  +G+FY  K   SQ+CLAFA
Sbjct: 390 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFA 449

Query: 464 SLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 497
               +    I GN QQ+   V+Y+    +VGFA   CS
Sbjct: 450 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of CSPI07G19920 vs. TAIR10
Match: AT5G10760.1 (AT5G10760.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 266.2 bits (679), Expect = 4.2e-71
Identity = 160/414 (38.65%), Postives = 235/414 (56.76%), Query Frame = 1

Query: 87  CSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNY 146
           CS   +D        I  D   V S++S   S     +  +   +++P  SG  L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 147 IVTVGIGG--QNSTLIVDTGSDLTWVQCLPCR-LCYNQQEPLFNPSNSSSFLSLPCNSPT 206
           IVT+GIG    + +L+ DTGSDLTW QC PC   CY+Q+EP FNPS+SS++ ++ C+SP 
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 207 CVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DSFIFGCGR 266
           C   +        CS  N   C Y I YGD S+++G L  EK TL  +++ +   FGCG 
Sbjct: 194 CEDAES-------CSASN---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 267 NNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFK 326
           NN+GLF G +GL+GL   +LSL +QT++ + ++FSYCLP+    S+G LT G A  S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 327 NISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNV-PRLSSNEGVLSLLDSGTVITRLS 386
             +PIS       P   N Y +++ GIS+G   L + P   S EG  +++DSGTV TRL 
Sbjct: 314 KFTPIS-----SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLP 373

Query: 387 PSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEG 446
             +Y   ++ F+++ S Y++T G+ + +TC++ TG + V  PT+ F F G+  + +D  G
Sbjct: 374 TKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSG 433

Query: 447 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 496
           +   +K   SQ+CLAFA  G +D   I GN QQ    V+Y+    +VGFA   C
Sbjct: 434 ISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of CSPI07G19920 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 237.3 bits (604), Expect = 2.1e-62
Identity = 133/410 (32.44%), Postives = 212/410 (51.71%), Query Frame = 1

Query: 101 RIILDAINVNSLFSHFKSAIFPGQT--HQLSDSQIPISSGARLQTLNYIVTVGIGG--QN 160
           R+  D   V+++       + P     ++++D    I SG    +  Y V +G+G   ++
Sbjct: 84  RMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRD 143

Query: 161 STLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGL 220
             +++D+GSD+ WVQC PC+LCY Q +P+F+P+ S S+  + C S  C  ++ +   SG 
Sbjct: 144 QYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSG- 203

Query: 221 CSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMG 280
                   C Y++ YGDGSY++G L  E LT  KT + +   GCG  N+G+F GA+GL+G
Sbjct: 204 -------GCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLG 263

Query: 281 LARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYTRMIQ 340
           +    +S V Q S   G  F YCL + G  S+GSL  G       +   P+  S+  +++
Sbjct: 264 IGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFG-------REALPVGASWVPLVR 323

Query: 341 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL---------LDSGTVITRLSPSIY 400
           NP+  +FY++ L G+ +GGV + +P     +GV  L         +D+GT +TRL  + Y
Sbjct: 324 NPRAPSFYYVGLKGLGVGGVRIPLP-----DGVFDLTETGDGGVVMDTGTAVTRLPTAAY 383

Query: 401 KAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYF 460
            AF+  F+ Q +      G SI +TC++L+G+  V +PTV F F     + +     F  
Sbjct: 384 VAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARN-FLM 443

Query: 461 VKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 496
              D+   C AFA+        IIGN QQ+  +V ++     VGF    C
Sbjct: 444 PVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CSPI07G19920 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 232.3 bits (591), Expect = 6.7e-61
Identity = 152/436 (34.86%), Postives = 227/436 (52.06%), Query Frame = 1

Query: 78  TLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQ--THQLSDSQIPI 137
           TL +   D  S   T  +++F +R+  D+  V S+ +   +A  PG+  TH         
Sbjct: 73  TLNLDHIDALSSNKTP-DELFSSRLQRDSRRVKSIAT--LAAQIPGRNVTHAPRPGGFSS 132

Query: 138 S--SGARLQTLNYIVTVGIG--GQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNS 197
           S  SG    +  Y   +G+G   +   +++DTGSD+ W+QC PCR CY+Q +P+F+P  S
Sbjct: 133 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 192

Query: 198 SSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT 257
            ++ ++PC+SP C  L  +AG    C+ +  T C YQ+ YGDGS++ G+   E LT  + 
Sbjct: 193 KTYATIPCSSPHCRRLD-SAG----CNTRRKT-CLYQVSYGDGSFTVGDFSTETLTFRRN 252

Query: 258 EIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS-GS 317
            +     GCG +N+GLF GA+GL+GL + +LS   QT   F   FSYCL      S   S
Sbjct: 253 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 312

Query: 318 LTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSS------- 377
           +  G A  S     +P     ++ NP++  FY++ L GIS+GG    VP +++       
Sbjct: 313 VVFGNAAVSRIARFTP-----LLSNPKLDTFYYVGLLGISVGGT--RVPGVTASLFKLDQ 372

Query: 378 --NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 437
             N GV  ++DSGT +TRL    Y A +  F       +  P FS+ +TCF+L+   EV 
Sbjct: 373 IGNGGV--IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVK 432

Query: 438 IPTVKFIFEGNAEMVVDVEGVFYFVKSDAS-QICLAFASLGYEDQTMIIGNYQQKNQRVI 497
           +PTV   F G     V +    Y +  D + + C AFA  G      IIGN QQ+  RV+
Sbjct: 433 VPTVVLHFRG---ADVSLPATNYLIPVDTNGKFCFAFA--GTMGGLSIIGNIQQQGFRVV 485

BLAST of CSPI07G19920 vs. NCBI nr
Match: gi|778728858|ref|XP_004135889.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 986.9 bits (2550), Expect = 1.3e-284
Identity = 494/497 (99.40%), Postives = 496/497 (99.80%), Query Frame = 1

Query: 1   MEISKSLHFPLSLLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60
           MEISKSLHFPLSLLLLLLL PLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE
Sbjct: 1   MEISKSLHFPLSLLLLLLL-PLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60

Query: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120
           AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI
Sbjct: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120

Query: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180
           FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN
Sbjct: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180

Query: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240
           QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE
Sbjct: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240

Query: 241 LGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300
           LGFEKLTLGKTEID+FIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL
Sbjct: 241 LGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300

Query: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360
           PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR
Sbjct: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360

Query: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420
           LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV
Sbjct: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420

Query: 421 NIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480
           NIPTVKFIFEGNAEM+VDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI
Sbjct: 421 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480

Query: 481 YNSKESKVGFAGEPCSF 498
           YNSKESKVGFAGEPCSF
Sbjct: 481 YNSKESKVGFAGEPCSF 496

BLAST of CSPI07G19920 vs. NCBI nr
Match: gi|659122560|ref|XP_008461208.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 939.1 bits (2426), Expect = 3.1e-270
Identity = 474/498 (95.18%), Postives = 485/498 (97.39%), Query Frame = 1

Query: 1   MEISKSLHFPLSLLLLLLLLPLLSIGVDARSSSFNLGNGDN-HEKGLLQLFQNFPWKEHG 60
           ME+SKSLHFPLSLL LLLL PLL I VDARSS   +GNG N HEKGLLQLFQNFPWKEHG
Sbjct: 3   MEVSKSLHFPLSLLFLLLL-PLLFIIVDARSS---VGNGGNYHEKGLLQLFQNFPWKEHG 62

Query: 61  EAVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSA 120
           EAVVNCIFQKPKITKGITTLEMKQRDYCSGKITD EKIFQNRIILDAINVNSL SH KSA
Sbjct: 63  EAVVNCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSA 122

Query: 121 IFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCY 180
           IFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCY
Sbjct: 123 IFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCY 182

Query: 181 NQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRG 240
           NQQEPLFNPSNSSSFLSLPC+SPTC+ALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRG
Sbjct: 183 NQQEPLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRG 242

Query: 241 ELGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYC 300
           ELG+EKLTLGKTEID+FIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSS+FGS+FSYC
Sbjct: 243 ELGYEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYC 302

Query: 301 LPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP 360
           LPTTGVGSSGSLTLGG DFS+FKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP
Sbjct: 303 LPTTGVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP 362

Query: 361 RLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEE 420
           RLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEE
Sbjct: 363 RLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEE 422

Query: 421 VNIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV 480
           VNIPTVKFIFEGNAEM+VDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV
Sbjct: 423 VNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV 482

Query: 481 IYNSKESKVGFAGEPCSF 498
           +YNSKESKVGFAGEPCSF
Sbjct: 483 VYNSKESKVGFAGEPCSF 496

BLAST of CSPI07G19920 vs. NCBI nr
Match: gi|595931085|ref|XP_007215297.1| (hypothetical protein PRUPE_ppa005040mg [Prunus persica])

HSP 1 Score: 581.3 bits (1497), Expect = 1.7e-162
Identity = 299/457 (65.43%), Postives = 363/457 (79.43%), Query Frame = 1

Query: 43  EKGLLQLFQNFPWKEHGEAVVN-CIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNR 102
           EK +L+L Q F W++HG      C+ QK +  KG T LE+K RDYCSGKI DW+K  Q R
Sbjct: 30  EKKVLKL-QEFRWRQHGGTRSTVCLSQKSRKEKGATILEIKHRDYCSGKIVDWDKKQQKR 89

Query: 103 IILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIV 162
           +I D ++V SL S FK+ +  G+   LS++QIP++SG RLQTLNYIVTV +GG+N T+IV
Sbjct: 90  LIFDDLHVRSLQSQFKNRV-SGRIKDLSEAQIPLTSGIRLQTLNYIVTVELGGRNMTVIV 149

Query: 163 DTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKN 222
           DTGSDLTWVQC PC+LCYNQQEPLFN S S S+ S+ CNS TC ALQ   G+SG C + N
Sbjct: 150 DTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKSVLCNSSTCQALQFDTGNSGACGS-N 209

Query: 223 STSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSE 282
            TSC+Y ++YGDGSY+RGELG + L+LG T +++F+FGCGRNNKGLFGGASGLMGL RSE
Sbjct: 210 PTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNFVFGCGRNNKGLFGGASGLMGLGRSE 269

Query: 283 -LSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSN 342
            +SLVSQTS+LFG VFSYCLPTT   +SGSL +GG D S +KN +PISYTRM+ NP++S 
Sbjct: 270 SVSLVSQTSALFGGVFSYCLPTTEATASGSLIMGG-DASIYKNSTPISYTRMVPNPELST 329

Query: 343 FYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYR 402
           FYFLNLTGISIGGV L     +S  G+L  +DSGTVI+RL+PS+YKA KAEF KQFSGY 
Sbjct: 330 FYFLNLTGISIGGVALQNQSFASG-GIL--IDSGTVISRLAPSVYKAVKAEFLKQFSGYP 389

Query: 403 TTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFASL 462
             PGF+IL+TCFNL+ Y+EV+IPT+KF FEGNAE+ VDV G+FY VK+DASQICLA ASL
Sbjct: 390 PAPGFAILDTCFNLSAYQEVSIPTLKFHFEGNAELNVDVTGIFYLVKTDASQICLALASL 449

Query: 463 GYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 498
            YED+  IIGNYQQKNQRVIYN+K+SK+GFA E CSF
Sbjct: 450 SYEDEIGIIGNYQQKNQRVIYNTKDSKLGFAEESCSF 479

BLAST of CSPI07G19920 vs. NCBI nr
Match: gi|645245370|ref|XP_008228848.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Prunus mume])

HSP 1 Score: 573.5 bits (1477), Expect = 3.5e-160
Identity = 295/457 (64.55%), Postives = 361/457 (78.99%), Query Frame = 1

Query: 43  EKGLLQLFQNFPWKEHGEAVVN-CIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNR 102
           EK +L+L Q F W++ G      C+ QK +  KG T LE+K RDYCSGKI DW K  Q R
Sbjct: 30  EKKVLKL-QEFRWRQRGGTRSTVCLSQKSRKEKGATILEIKHRDYCSGKIVDWNKKQQKR 89

Query: 103 IILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIV 162
           +I D ++V SL S FK+ ++ G+    S++QIP++SG RLQTLNYIVT+ +GG+N T+IV
Sbjct: 90  LIFDDLHVRSLQSQFKNRVY-GRIKDASEAQIPLTSGIRLQTLNYIVTLELGGRNMTVIV 149

Query: 163 DTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKN 222
           DTGSDLTWVQC PC+LCYNQQEPLFN S S S+ S+ CNS TC ALQ   G+SG C + N
Sbjct: 150 DTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKSVLCNSSTCQALQFDTGNSGACGS-N 209

Query: 223 STSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDSFIFGCGRNNKGLFGGASGLMGLARSE 282
            TSC+Y ++YGDGSY+RGELG + L+LG T +++F+FGCGRNNKGLFGGASGLMGL RSE
Sbjct: 210 PTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNFVFGCGRNNKGLFGGASGLMGLGRSE 269

Query: 283 -LSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSN 342
            +SLVSQTS+LFG VFSYCLPTT   +SGSL +GG D S +KN +PISYTRM+ NP++S+
Sbjct: 270 SVSLVSQTSALFGGVFSYCLPTTEATASGSLIMGG-DASIYKNSTPISYTRMVPNPELSS 329

Query: 343 FYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYR 402
           FYFLNLTGISIGGV L     +S  G+L  +DSGTVI+RL+P +Y+A KAEF KQFSGY 
Sbjct: 330 FYFLNLTGISIGGVALQAQSFASG-GIL--IDSGTVISRLAPLVYEAVKAEFLKQFSGYP 389

Query: 403 TTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMVVDVEGVFYFVKSDASQICLAFASL 462
             PGFSIL+TCFNL+ Y+EV+IPT+KF FEGNAE+ VDV GVFY VK+DASQICLA ASL
Sbjct: 390 PAPGFSILDTCFNLSAYQEVSIPTLKFHFEGNAELNVDVTGVFYLVKTDASQICLALASL 449

Query: 463 GYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 498
            YED+  IIGNYQQKN+RVIYN+K+SK+GFA E CSF
Sbjct: 450 SYEDEIGIIGNYQQKNRRVIYNTKDSKLGFAEESCSF 479

BLAST of CSPI07G19920 vs. NCBI nr
Match: gi|1000979824|ref|XP_015570839.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis])

HSP 1 Score: 565.8 bits (1457), Expect = 7.2e-158
Identity = 299/487 (61.40%), Postives = 362/487 (74.33%), Query Frame = 1

Query: 15  LLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPW--KEHGEAVVNCIFQKPKI 74
           LL LL+ LL++         N G     EK +L L Q + W  K + +   +C+ QK K 
Sbjct: 19  LLSLLVFLLTV--------VNGGAQSLQEKKVLSL-QEYQWQLKSNTDTNSSCLSQKSKR 78

Query: 75  TKGITTLEMKQRDYC--SGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSD 134
            KG T LEMK RD+C  SGK TDW K  Q  +ILD   V SL S  KS IF G      D
Sbjct: 79  EKGATILEMKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKS-IFSGNNIDALD 138

Query: 135 SQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSN 194
           SQIP+SSG RLQTLNYIVTV IGG+N T+IVDTGSDLTWVQC PCRLCYNQQ+PLFNPS 
Sbjct: 139 SQIPLSSGVRLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSG 198

Query: 195 SSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGK 254
           S S+ ++ CNS TC +LQ   G+ G+C + N+ +C+Y ++YGDGSY+RG+LG E+L LG 
Sbjct: 199 SPSYQTILCNSSTCQSLQYATGNLGVCGS-NTPTCNYVVNYGDGSYTRGDLGMEQLNLGT 258

Query: 255 TEIDSFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGS 314
           T + +FIFGCGRNNKGLFGGASGLMGL +S+LSLVSQTS++F  VFSYCLPTT   +SGS
Sbjct: 259 THVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGS 318

Query: 315 LTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL 374
           L LGG + S +KN +PISYTRMI NPQ+  FYFLNLTGISIGGV L  P    + G+L  
Sbjct: 319 LILGG-NSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQS-GIL-- 378

Query: 375 LDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE 434
           +DSGTVITRL P +Y+  KAEF KQFSG+ + P FSIL+TCFNL GY+EV+IPT++  FE
Sbjct: 379 IDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFE 438

Query: 435 GNAEMVVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGF 494
           GNAE+ VDV G+FYFVK+DASQ+CLA ASL ++D+  IIGNYQQ+NQRVIYN+KESK+GF
Sbjct: 439 GNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGF 490

Query: 495 AGEPCSF 498
           A E CSF
Sbjct: 499 AAEACSF 490

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPA_ARATH8.4e-8243.22Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
AED1_ARATH7.4e-7038.65Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
ASPG2_ARATH3.7e-6132.44Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH1.2e-5934.86Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH1.8e-5531.00Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0K8J2_CUCSA9.2e-28599.40Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431320 PE=3 SV=1[more]
M5WTF1_PRUPE1.2e-16265.43Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005040mg PE=3 SV=1[more]
A0A061GAK7_THECC7.5e-15459.63Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_027560 PE=... [more]
A0A067EW14_CITSI1.9e-15259.14Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011482mg PE=3 SV=1[more]
B9RGP5_RICCO7.1e-15265.63Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1442500 ... [more]
Match NameE-valueIdentityDescription
AT1G79720.18.0e-13953.89 Eukaryotic aspartyl protease family protein[more]
AT5G10770.14.8e-8343.22 Eukaryotic aspartyl protease family protein[more]
AT5G10760.14.2e-7138.65 Eukaryotic aspartyl protease family protein[more]
AT3G20015.12.1e-6232.44 Eukaryotic aspartyl protease family protein[more]
AT1G01300.16.7e-6134.86 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778728858|ref|XP_004135889.2|1.3e-28499.40PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|659122560|ref|XP_008461208.1|3.1e-27095.18PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
gi|595931085|ref|XP_007215297.1|1.7e-16265.43hypothetical protein PRUPE_ppa005040mg [Prunus persica][more]
gi|645245370|ref|XP_008228848.1|3.5e-16064.55PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Prunus mume][more]
gi|1000979824|ref|XP_015570839.1|7.2e-15861.40PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0048046 apoplast
cellular_component GO:0005575 cellular_component
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G19920.1CSPI07G19920.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 150..170
score: 3.0E-6coord: 369..380
score: 3.0E-6coord: 309..322
score: 3.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 67..496
score: 1.1E-222coord: 7..29
score: 1.1E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 159..170
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 326..496
score: 2.3E-38coord: 133..315
score: 2.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 141..495
score: 1.73
NoneNo IPR availablePANTHERPTHR13683:SF263SUBFAMILY NOT NAMEDcoord: 67..496
score: 1.1E-222coord: 7..29
score: 1.1E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CSPI07G19920CSPI04G26020Wild cucumber (PI 183967)cpicpiB168
The following block(s) are covering this gene:
GeneOrganismBlock
CSPI07G19920Wild cucumber (PI 183967)cpicpiB142
CSPI07G19920Cucurbita maxima (Rimu)cmacpiB501
CSPI07G19920Cucurbita moschata (Rifu)cmocpiB497
CSPI07G19920Cucumber (Chinese Long) v2cpicuB338
CSPI07G19920Watermelon (Charleston Gray)cpiwcgB560
CSPI07G19920Cucurbita pepo (Zucchini)cpecpiB344