CSPI01G03690 (gene) Wild cucumber (PI 183967)

NameCSPI01G03690
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr1 : 2313591 .. 2315060 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCACTTTCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACGACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTTGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGGGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTTCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCAACATTCTAATTCCATCCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTATTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGACGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTTCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCAAATGTGGAGGTGCCAACCATAGCATTTATTTTGCCGGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTCCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

mRNA sequence

ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCACTTTCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACGACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTTGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGGGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTTCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCAACATTCTAATTCCATCCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTATTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGACGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTTCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCAAATGTGGAGGTGCCAACCATAGCATTTATTTTGCCGGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTCCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

Coding sequence (CDS)

ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCACTTTCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACGACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTTGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGGGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTTCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCAACATTCTAATTCCATCCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTATTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGACGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTTCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCAAATGTGGAGGTGCCAACCATAGCATTTATTTTGCCGGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTCCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA
BLAST of CSPI01G03690 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 3.2e-126
Identity = 243/493 (49.29%), Postives = 329/493 (66.73%), Query Frame = 1

Query: 12  LTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSN------ 71
           +T   FL      SR L+T  P  TN  DV +S+ Q    LS+ P     T +       
Sbjct: 13  VTLSLFLTTTDASSRSLST--PPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSD 72

Query: 72  ---YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQF 131
              ++SSSPLSL LH R T     ++DY SL  +RL R ++R   +  K+  +++G  + 
Sbjct: 73  PVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRS 132

Query: 132 GRR-INGSDS---TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 191
             + +   D+   T  LT PV SGASQG+GEYF+RIGVG P +  + V DTGSDV+W+QC
Sbjct: 133 DLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQC 192

Query: 192 QPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 251
           +PC     CY+Q  P+F+P SSS+Y SL+C + QC LL+ +AC +N C+Y+V YGDGSFT
Sbjct: 193 EPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 252

Query: 252 VGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYC 311
           VGELAT+T +F +S  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q++ATSFSYC
Sbjct: 253 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYC 312

Query: 312 LVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFE 371
           LVD DS  SS+LDFN+ Q      T+PL++N +  TF YV + G SVGG+ + +  + F+
Sbjct: 313 LVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFD 372

Query: 372 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPP-APGVSPFDTCYDLSSQSN 431
           +D SGSGG+I+D GT +T + +  Y+ LRDAF+ LT NL   +  +S FDTCYD SS S 
Sbjct: 373 VDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST 432

Query: 432 VEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYD 490
           V+VPT+AF   G  SL LPAKN L  VD +GTFC AF P++  LSIIGNVQQQG R++YD
Sbjct: 433 VKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYD 492

BLAST of CSPI01G03690 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 5.4e-97
Identity = 202/483 (41.82%), Postives = 280/483 (57.97%), Query Frame = 1

Query: 14  FFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLS 73
           FF FLH    L   L++ S  S   F +   +   L   +  P  F  TH +  SSS  +
Sbjct: 6   FFFFLH----LHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPD-FNNTHFSDESSSKYT 65

Query: 74  LSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDS-- 133
           L L  R    + +Y ++   + AR+ R   R  ++ R++          G+ I  SDS  
Sbjct: 66  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRIS---------GKVIPSSDSRY 125

Query: 134 -TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 193
             N   + + SG  QG+GEYF RIGVG P +  + V D+GSD+ W+QCQPC     CYKQ
Sbjct: 126 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC---KLCYKQ 185

Query: 194 IGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFQ 253
             P+FDP  S SY+ +SC S  C  ++ + C +  C YEV YGDGS+T G LA ET +F 
Sbjct: 186 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 245

Query: 254 HSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESS 313
            +  + N+ +GCGH N G+FIGAAGL+G+GGG++S   QL   +   F YCLV   ++S+
Sbjct: 246 KT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST 305

Query: 314 STLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGI 373
            +L F  +  P  +   PLV+N R P+F YV + G+ VGG  +P+    F++ E+G GG+
Sbjct: 306 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGV 365

Query: 374 IVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFIL 433
           ++D+GT +T +P+  Y   RD F   T NLP A GVS FDTCYDLS   +V VPT++F  
Sbjct: 366 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYF 425

Query: 434 PGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFST 490
                L LPA+N L  VD +GT+C AF  S   LSIIGN+QQ+GI+VS+D AN  VGF  
Sbjct: 426 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

BLAST of CSPI01G03690 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 330.9 bits (847), Expect = 2.4e-89
Identity = 205/492 (41.67%), Postives = 286/492 (58.13%), Query Frame = 1

Query: 8   AFLFLTFFTFLHFPSILSR-KLTTQSPYSTNTFDVS-ASINQALNALSIKPKPFQTTHSN 67
           A LF   F FL  PS  S     T  P S +    S  S     ++ S+    F++  S+
Sbjct: 7   ALLFSLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESG-SD 66

Query: 68  YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR 127
             SSS ++L+L     + +    D   L  +RL R + R +S+      +    +  GR 
Sbjct: 67  SESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSI------ATLAAQIPGRN 126

Query: 128 INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGEN 187
           +  +      ++ V SG SQG+GEYF R+GVG P +  + V DTGSD+ WLQC PC    
Sbjct: 127 VTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---R 186

Query: 188 GCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDAN--SCIYEVEYGDGSFTVGELA 247
            CY Q  PIFDP+ S +Y+++ C S  C  LD A C+    +C+Y+V YGDGSFTVG+ +
Sbjct: 187 RCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFS 246

Query: 248 TETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT---SFSYCLV 307
           TET +F+  N +  + +GCGHDNEGLF+GAAGL+GLG G +S   Q        FSYCLV
Sbjct: 247 TETLTFRR-NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLV 306

Query: 308 DLDSES--SSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISSSSFE 367
           D  + S  SS +  NA     +  +PL+ N +  TF YV ++G+SVGG  +P +++S F+
Sbjct: 307 DRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFK 366

Query: 368 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNV 427
           +D+ G+GG+I+DSGT++T +    Y  +RDAF    K L  AP  S FDTC+DLS+ + V
Sbjct: 367 LDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEV 426

Query: 428 EVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 487
           +VPT+     G + + LPA N L  VD+ G FC AF  +   LSIIGN+QQQG RV YDL
Sbjct: 427 KVPTVVLHFRGAD-VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDL 484

Query: 488 ANSLVGFSTDKC 490
           A+S VGF+   C
Sbjct: 487 ASSRVGFAPGGC 484

BLAST of CSPI01G03690 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 2.3e-68
Identity = 154/386 (39.90%), Postives = 222/386 (57.51%), Query Frame = 1

Query: 111 KLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTG 170
           K +L  +  ++  RR+   ++  +  + V +    G GEY   + +G P Q +  + DTG
Sbjct: 56  KFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTG 115

Query: 171 SDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEV 230
           SD+ W QCQPC     C+ Q  PIF+P+ SSS+S+L C S+ C  L    C  N C Y  
Sbjct: 116 SDLIWTQCQPC---TQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTY 175

Query: 231 EYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIG-AAGLIGLGGGAISLSSQ 290
            YGDGS T G + TET +F  S SIPN+  GCG +N+G   G  AGL+G+G G +SL SQ
Sbjct: 176 GYGDGSETQGSMGTETLTF-GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQ 235

Query: 291 LEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSP---LVKNDRFPTFRYVKVIGMSVGG 350
           L+ T FSYC+  + S + S L   +   S +  SP   L+++ + PTF Y+ + G+SVG 
Sbjct: 236 LDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 295

Query: 351 KPLPISSSSFEID-ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSP- 410
             LPI  S+F ++  +G+GGII+DSGTT+T   ++ Y  +R  F+    NLP   G S  
Sbjct: 296 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSG 355

Query: 411 FDTCYDL-SSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSII 470
           FD C+   S  SN+++PT      G   L+LP++N  F   S G  CLA   S+  +SI 
Sbjct: 356 FDLCFQTPSDPSNLQIPTFVMHFDG-GDLELPSEN-YFISPSNGLICLAMGSSSQGMSIF 415

Query: 471 GNVQQQGIRVSYDLANSLVGFSTDKC 490
           GN+QQQ + V YD  NS+V F++ +C
Sbjct: 416 GNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI01G03690 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 7.6e-67
Identity = 155/386 (40.16%), Postives = 220/386 (56.99%), Query Frame = 1

Query: 112 LELSLKGGKQFGRRINGS-DSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTG 171
           ++ ++K G++  R IN    S++ +  PV +G     GEY   + +G P  S+  + DTG
Sbjct: 61  IKRAIKRGERRMRSINAMLQSSSGIETPVYAGD----GEYLMNVAIGTPDSSFSAIMDTG 120

Query: 172 SDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEV 231
           SD+ W QC+PC     C+ Q  PIF+P+ SSS+S+L C+S+ C  L    C+ N C Y  
Sbjct: 121 SDLIWTQCEPC---TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTY 180

Query: 232 EYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIG-AAGLIGLGGGAISLSSQ 291
            YGDGS T G +ATETF+F+ ++S+PN+  GCG DN+G   G  AGLIG+G G +SL SQ
Sbjct: 181 GYGDGSTTQGYMATETFTFE-TSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ 240

Query: 292 LEATSFSYCLVDLDSESSSTLDFN---ADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGG 351
           L    FSYC+    S S STL      +  P  S ++ L+ +   PT+ Y+ + G++VGG
Sbjct: 241 LGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGG 300

Query: 352 KPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPA-PGVSPF 411
             L I SS+F++ + G+GG+I+DSGTT+T +P D Y+ +  AF     NLP      S  
Sbjct: 301 DNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI-NLPTVDESSSGL 360

Query: 412 DTCYDLSSQ-SNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF-LPSTFPLSII 471
            TC+   S  S V+VP I+    G   L L  +N L    + G  CLA    S   +SI 
Sbjct: 361 STCFQQPSDGSTVQVPEISMQFDG-GVLNLGEQNILIS-PAEGVICLAMGSSSQLGISIF 420

Query: 472 GNVQQQGIRVSYDLANSLVGFSTDKC 490
           GN+QQQ  +V YDL N  V F   +C
Sbjct: 421 GNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI01G03690 vs. TrEMBL
Match: A0A0A0LS14_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1)

HSP 1 Score: 961.4 bits (2484), Expect = 4.1e-277
Identity = 484/489 (98.98%), Postives = 486/489 (99.39%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYAFLFLTFF  LHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           CDGENGCYKQIGPIFDPKSSSSYS LSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG
Sbjct: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240

Query: 241 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATETFSF+HSNSIPNLPIGCGHDNEGLF+GAAGLIGLGGGAISLSSQLEATSFSYCLV
Sbjct: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE
Sbjct: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP
Sbjct: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
           TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS
Sbjct: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480

Query: 481 LVGFSTDKC 490
           LVGFSTDKC
Sbjct: 481 LVGFSTDKC 489

BLAST of CSPI01G03690 vs. TrEMBL
Match: A0A0A0LPJ3_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 6.9e-192
Identity = 352/490 (71.84%), Postives = 407/490 (83.06%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLS  FLFLT FT L FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQ  PIFDPKSSSSYS LSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of CSPI01G03690 vs. TrEMBL
Match: A0A0A0LPR7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1)

HSP 1 Score: 579.3 bits (1492), Expect = 4.3e-162
Identity = 280/334 (83.83%), Postives = 306/334 (91.62%), Query Frame = 1

Query: 156 VGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHL 215
           VGQP Q  FFV DTGSDV+WLQC PC G+NGCY+QI PIFDP+ SSSY+ +SCDSEQC L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 216 LDEAACDANSCIYEVEYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAG 275
           LDEA C+ NSCIY+VEYGDGSFT+GELATET +F HSNSIPN+ IGCGHDNEGLF+GA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 276 LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFR 335
           LIGLGGGAIS+SSQL+A+SFSYCLVD+DS S STLDFN D PSDSL SPLVKNDRFP+FR
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 336 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 395
           YVKVIGMSVGGKPLPISSS FEIDESG GGIIVDSGTTIT++PSDVY+VLR+AF+GLT N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 396 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 455
           LPPAP +SPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 456 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
           +TFPLSIIGN QQQGIRVSYDL NSLVGFST+KC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

BLAST of CSPI01G03690 vs. TrEMBL
Match: M5WPB5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1)

HSP 1 Score: 501.9 bits (1291), Expect = 8.8e-139
Identity = 271/496 (54.64%), Postives = 351/496 (70.77%), Query Frame = 1

Query: 8   AFLFLTF---FTFLH-FPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKP---KPF- 67
           AFL+L     FT    FPS  SR L+ ++   T   DVSAS+ QA + LS  P   KP  
Sbjct: 5   AFLYLAILSAFTLTSLFPSTHSRSLSEET---TTLLDVSASLTQAHDVLSFNPQTLKPLD 64

Query: 68  -QTTHSNYHSSSPL----SLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLEL 127
            Q T +  H+ +PL    SL L PR  +HN  ++DY SLV++RL R +AR  SL+ KL+L
Sbjct: 65  RQETQAQAHTLTPLNSSFSLQLLPRDALHNSQHKDYESLVQSRLGRDSARVNSLHTKLQL 124

Query: 128 SLKGGKQFGRR-INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDV 187
            ++  K+     ++       L+ PV SG SQG+GEYF RIGVG P +S + V DTGSD+
Sbjct: 125 VVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSGEYFTRIGVGTPAKSLYMVLDTGSDI 184

Query: 188 SWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYG 247
           +WLQC+PC   + CY+Q  P+F+P  SS+Y  ++CDS QCH L  +AC A+ C+Y+V YG
Sbjct: 185 NWLQCEPC---SDCYQQTDPVFNPTGSSTYRPVTCDSAQCHSLHVSACRADKCLYQVSYG 244

Query: 248 DGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT 307
           DGS+TVG+  TET SF +S +I N+ +GCGHDNEGLF+GAAGL+GLGGGA+SL SQ +AT
Sbjct: 245 DGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKAT 304

Query: 308 SFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISS 367
           SFSYCLV+ DS +SSTL+FN+  PSDS+T+PL+K+ R  TF YV + G SVGG+P+ +  
Sbjct: 305 SFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVPP 364

Query: 368 SSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSS 427
           S FE+DESG+GGIIVDSGT IT + ++ Y+ LRDAF  LT++LP A G + FDTCYDLSS
Sbjct: 365 SVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLSS 424

Query: 428 QSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRV 487
           +S V+VPT++F+     SL LPAKN L  VDSAGTFC AF P++   SIIGNVQQQG RV
Sbjct: 425 RSRVQVPTVSFLFADGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTRV 484

Query: 488 SYDLANSLVGFSTDKC 490
           SYDLAN+ VGFS +KC
Sbjct: 485 SYDLANNRVGFSPNKC 494

BLAST of CSPI01G03690 vs. TrEMBL
Match: A0A0A0LQD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1)

HSP 1 Score: 497.7 bits (1280), Expect = 1.7e-137
Identity = 261/487 (53.59%), Postives = 338/487 (69.40%), Query Frame = 1

Query: 9   FLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYH- 68
           FLFL F +   FP  LSR  +  SP+S+ + DVSAS+ QA   L   P    +     H 
Sbjct: 10  FLFLFFLSL--FPFTLSRSSSHLSPHSSASLDVSASLQQANQVLKFDPTASISFQQQVHL 69

Query: 69  ----SSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFG 128
               SS   SL LHPR ++HN  ++DY SLV +RL+R ++R +S+  +LE +L   K+  
Sbjct: 70  VPSNSSFSFSLQLHPRDSLHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSD 129

Query: 129 RR-INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCD 188
              +        L+ P+ SG SQG+GEYF+R+GVGQP + ++ V DTGSD++WLQCQPC 
Sbjct: 130 LEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC- 189

Query: 189 GENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGEL 248
               CY+Q  PIFDP+SSSS++SL C+S+QC  L+ + C A+ C+Y+V YGDGSFTVGE 
Sbjct: 190 --TDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEF 249

Query: 249 ATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLVDL 308
             ET +F +S  I N+ +GCGHDNEGLF+G+AGL+GLGGG++SL+SQ++A+SFSYCLVD 
Sbjct: 250 VIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLVDR 309

Query: 309 DSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESG 368
           DS SSS L+FN+  PSDS+ +PL+K+ +  TF YV + GMSVGG+ L I  + F++D+SG
Sbjct: 310 DSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSG 369

Query: 369 SGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTI 428
            GGIIVDSGT IT + +  Y+ LRDAFV  T  L    G + FDTCYDLSSQS V +PT+
Sbjct: 370 YGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTV 429

Query: 429 AFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLV 488
           +F   G  SLQLP KN L  VDS GTFC AF P+T  LSIIGNVQQQG RV YDLANS+V
Sbjct: 430 SFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVV 489

Query: 489 GFSTDKC 490
           GFS  KC
Sbjct: 490 GFSPHKC 491

BLAST of CSPI01G03690 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 461.5 bits (1186), Expect = 6.7e-130
Identity = 246/489 (50.31%), Postives = 333/489 (68.10%), Query Frame = 1

Query: 4   SLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTH 63
           S +Y+F F  FF   H  S+ SR L   S  +T+  +V+ SI++     S +    Q   
Sbjct: 2   SPNYSFFFFIFFLTSH-SSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLN--QQEE 61

Query: 64  SNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKG-GKQF 123
             + +SS  SL LH R++V    + DY SL  ARL R  AR +SL  +L+L++    K  
Sbjct: 62  QTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKAD 121

Query: 124 GRRINGSDSTNS--LTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 183
            + I+   +T    + AP+ SG +QG+GEYF R+G+G+P +  + V DTGSDV+WLQC P
Sbjct: 122 LKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP 181

Query: 184 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 243
           C     CY Q  PIF+P SSSSY  LSCD+ QC+ L+ + C   +C+YEV YGDGS+TVG
Sbjct: 182 CAD---CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVG 241

Query: 244 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 303
           + ATET +   S  + N+ +GCGH NEGLF+GAAGL+GLGGG ++L SQL  TSFSYCLV
Sbjct: 242 DFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 301

Query: 304 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 363
           D DS+S+ST+DF      D++ +PL++N +  TF Y+ + G+SVGG+ L I  SSFE+DE
Sbjct: 302 DRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 361

Query: 364 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 423
           SGSGGII+DSGT +T + +++Y+ LRD+FV  T +L  A GV+ FDTCY+LS+++ VEVP
Sbjct: 362 SGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVP 421

Query: 424 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 483
           T+AF  PG   L LPAKN +  VDS GTFCLAF P+   L+IIGNVQQQG RV++DLANS
Sbjct: 422 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 481

Query: 484 LVGFSTDKC 490
           L+GFS++KC
Sbjct: 482 LIGFSSNKC 483

BLAST of CSPI01G03690 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 453.4 bits (1165), Expect = 1.8e-127
Identity = 243/493 (49.29%), Postives = 329/493 (66.73%), Query Frame = 1

Query: 12  LTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSN------ 71
           +T   FL      SR L+T  P  TN  DV +S+ Q    LS+ P     T +       
Sbjct: 13  VTLSLFLTTTDASSRSLST--PPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSD 72

Query: 72  ---YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQF 131
              ++SSSPLSL LH R T     ++DY SL  +RL R ++R   +  K+  +++G  + 
Sbjct: 73  PVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRS 132

Query: 132 GRR-INGSDS---TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 191
             + +   D+   T  LT PV SGASQG+GEYF+RIGVG P +  + V DTGSDV+W+QC
Sbjct: 133 DLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQC 192

Query: 192 QPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 251
           +PC     CY+Q  P+F+P SSS+Y SL+C + QC LL+ +AC +N C+Y+V YGDGSFT
Sbjct: 193 EPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 252

Query: 252 VGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYC 311
           VGELAT+T +F +S  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q++ATSFSYC
Sbjct: 253 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYC 312

Query: 312 LVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFE 371
           LVD DS  SS+LDFN+ Q      T+PL++N +  TF YV + G SVGG+ + +  + F+
Sbjct: 313 LVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFD 372

Query: 372 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPP-APGVSPFDTCYDLSSQSN 431
           +D SGSGG+I+D GT +T + +  Y+ LRDAF+ LT NL   +  +S FDTCYD SS S 
Sbjct: 373 VDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST 432

Query: 432 VEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYD 490
           V+VPT+AF   G  SL LPAKN L  VD +GTFC AF P++  LSIIGNVQQQG R++YD
Sbjct: 433 VKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYD 492

BLAST of CSPI01G03690 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 356.3 bits (913), Expect = 3.0e-98
Identity = 202/483 (41.82%), Postives = 280/483 (57.97%), Query Frame = 1

Query: 14  FFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLS 73
           FF FLH    L   L++ S  S   F +   +   L   +  P  F  TH +  SSS  +
Sbjct: 6   FFFFLH----LHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPD-FNNTHFSDESSSKYT 65

Query: 74  LSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDS-- 133
           L L  R    + +Y ++   + AR+ R   R  ++ R++          G+ I  SDS  
Sbjct: 66  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRIS---------GKVIPSSDSRY 125

Query: 134 -TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 193
             N   + + SG  QG+GEYF RIGVG P +  + V D+GSD+ W+QCQPC     CYKQ
Sbjct: 126 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC---KLCYKQ 185

Query: 194 IGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFQ 253
             P+FDP  S SY+ +SC S  C  ++ + C +  C YEV YGDGS+T G LA ET +F 
Sbjct: 186 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 245

Query: 254 HSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESS 313
            +  + N+ +GCGH N G+FIGAAGL+G+GGG++S   QL   +   F YCLV   ++S+
Sbjct: 246 KT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST 305

Query: 314 STLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGI 373
            +L F  +  P  +   PLV+N R P+F YV + G+ VGG  +P+    F++ E+G GG+
Sbjct: 306 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGV 365

Query: 374 IVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFIL 433
           ++D+GT +T +P+  Y   RD F   T NLP A GVS FDTCYDLS   +V VPT++F  
Sbjct: 366 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYF 425

Query: 434 PGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFST 490
                L LPA+N L  VD +GT+C AF  S   LSIIGN+QQ+GI+VS+D AN  VGF  
Sbjct: 426 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

BLAST of CSPI01G03690 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 336.7 bits (862), Expect = 2.5e-92
Identity = 197/439 (44.87%), Postives = 267/439 (60.82%), Query Frame = 1

Query: 68  SSSPLSLSLHPRLTVHNPSYEDYG--SLVRARLARGAARAQSLNRKLELSLKGGKQFGRR 127
           S S  SLS+H        S+ D     L   RL R + R +S+     +S   G+   +R
Sbjct: 55  SESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVST--GRNATKR 114

Query: 128 INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGEN 187
                +    +  V SG SQG+GEYF R+GVG P  + + V DTGSDV WLQC PC    
Sbjct: 115 T--PRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC---K 174

Query: 188 GCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAA-C---DANSCIYEVEYGDGSFTVGE 247
            CY Q   IFDPK S +++++ C S  C  LD+++ C    + +C+Y+V YGDGSFT G+
Sbjct: 175 ACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGD 234

Query: 248 LATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT---SFSYC 307
            +TET +F H   + ++P+GCGHDNEGLF+GAAGL+GLG G +S  SQ +      FSYC
Sbjct: 235 FSTETLTF-HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYC 294

Query: 308 LVDLDSESSS-----TLDF-NADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-I 367
           LVD  S  SS     T+ F NA  P  S+ +PL+ N +  TF Y++++G+SVGG  +P +
Sbjct: 295 LVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGV 354

Query: 368 SSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAF-VGLTKNLPPAPGVSPFDTCYD 427
           S S F++D +G+GG+I+DSGT++T +    Y  LRDAF +G TK L  AP  S FDTC+D
Sbjct: 355 SESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATK-LKRAPSYSLFDTCFD 414

Query: 428 LSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQG 487
           LS  + V+VPT+ F   G   + LPA N L  V++ G FC AF  +   LSIIGN+QQQG
Sbjct: 415 LSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 474

Query: 488 IRVSYDLANSLVGFSTDKC 490
            RV+YDL  S VGF +  C
Sbjct: 475 FRVAYDLVGSRVGFLSRAC 483

BLAST of CSPI01G03690 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 330.9 bits (847), Expect = 1.4e-90
Identity = 205/492 (41.67%), Postives = 286/492 (58.13%), Query Frame = 1

Query: 8   AFLFLTFFTFLHFPSILSR-KLTTQSPYSTNTFDVS-ASINQALNALSIKPKPFQTTHSN 67
           A LF   F FL  PS  S     T  P S +    S  S     ++ S+    F++  S+
Sbjct: 7   ALLFSLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESG-SD 66

Query: 68  YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR 127
             SSS ++L+L     + +    D   L  +RL R + R +S+      +    +  GR 
Sbjct: 67  SESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSI------ATLAAQIPGRN 126

Query: 128 INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGEN 187
           +  +      ++ V SG SQG+GEYF R+GVG P +  + V DTGSD+ WLQC PC    
Sbjct: 127 VTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---R 186

Query: 188 GCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDAN--SCIYEVEYGDGSFTVGELA 247
            CY Q  PIFDP+ S +Y+++ C S  C  LD A C+    +C+Y+V YGDGSFTVG+ +
Sbjct: 187 RCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFS 246

Query: 248 TETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT---SFSYCLV 307
           TET +F+  N +  + +GCGHDNEGLF+GAAGL+GLG G +S   Q        FSYCLV
Sbjct: 247 TETLTFRR-NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLV 306

Query: 308 DLDSES--SSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISSSSFE 367
           D  + S  SS +  NA     +  +PL+ N +  TF YV ++G+SVGG  +P +++S F+
Sbjct: 307 DRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFK 366

Query: 368 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNV 427
           +D+ G+GG+I+DSGT++T +    Y  +RDAF    K L  AP  S FDTC+DLS+ + V
Sbjct: 367 LDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEV 426

Query: 428 EVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 487
           +VPT+     G + + LPA N L  VD+ G FC AF  +   LSIIGN+QQQG RV YDL
Sbjct: 427 KVPTVVLHFRGAD-VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDL 484

Query: 488 ANSLVGFSTDKC 490
           A+S VGF+   C
Sbjct: 487 ASSRVGFAPGGC 484

BLAST of CSPI01G03690 vs. NCBI nr
Match: gi|778664722|ref|XP_004138237.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 961.4 bits (2484), Expect = 5.8e-277
Identity = 484/489 (98.98%), Postives = 486/489 (99.39%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYAFLFLTFF  LHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           CDGENGCYKQIGPIFDPKSSSSYS LSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG
Sbjct: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240

Query: 241 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATETFSF+HSNSIPNLPIGCGHDNEGLF+GAAGLIGLGGGAISLSSQLEATSFSYCLV
Sbjct: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE
Sbjct: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP
Sbjct: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
           TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS
Sbjct: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480

Query: 481 LVGFSTDKC 490
           LVGFSTDKC
Sbjct: 481 LVGFSTDKC 489

BLAST of CSPI01G03690 vs. NCBI nr
Match: gi|659106557|ref|XP_008453383.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 839.0 bits (2166), Expect = 4.4e-240
Identity = 427/489 (87.32%), Postives = 452/489 (92.43%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYA LFLT FTFL FPSILSRKLT QSPYST TFDVSASINQALNALSIKPKPFQ
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           T HS YHS+SPLSLSLHPRLTVHNPSY+DYG+LVRARLAR A R QSLNRKLELSL G K
Sbjct: 62  T-HS-YHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAK 121

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFG+RINGS STNSLTAPVTSGAS G GEYFARIGVGQPVQS+F VPDTGSDV+WLQC+P
Sbjct: 122 QFGKRINGSASTNSLTAPVTSGASHGDGEYFARIGVGQPVQSFFLVPDTGSDVTWLQCKP 181

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           C  EN C+KQ+ PIFDPKSSSSYSSLSC+SEQC LLDEA C +NSCIYEVEYGDGSFT+G
Sbjct: 182 CANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIG 241

Query: 241 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATET SF +SNSIPNLPIGCGHDNEGLF  AAGLIGLGGGAISLSSQL+A+SFSYCLV
Sbjct: 242 ELATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLV 301

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDS+SSSTLDFNADQPSDSLTSPLVKN+RFP+FRYVKVIGMSVGGK LPISSS FEIDE
Sbjct: 302 DLDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDE 361

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTIT++PSDVYDVLRDAFVGLT NLP APGVSPFDTCYDLSSQS+VEVP
Sbjct: 362 SGSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVP 421

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
            IAFILPG  SL+LPAKNCL QVDSAGTFCLAFLP TFPLSIIGNVQQQGIRVSYDL NS
Sbjct: 422 IIAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNS 481

Query: 481 LVGFSTDKC 490
           +VGF+T+KC
Sbjct: 482 IVGFATNKC 488

BLAST of CSPI01G03690 vs. NCBI nr
Match: gi|659106559|ref|XP_008453384.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo])

HSP 1 Score: 687.6 bits (1773), Expect = 1.6e-194
Identity = 359/490 (73.27%), Postives = 413/490 (84.29%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           M TSLS  FLFLT FT L F SILSRKLT QSPYST+ FDV AS NQALNALSIKPK  Q
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLT-QSPYSTSIFDVLASTNQALNALSIKPKHLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           T HS+  +SS LSL L+PRL++HNPSY+DY SLVRARLAR AAR Q LNR LE SL GGK
Sbjct: 61  T-HSHLPNSS-LSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG   NGS   +S+TAPV SG S+G+G EY A++GVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 DFGEVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQI PIFDPKSSSSY+ LSC+S+QC LLD   C++ +CIY+V YGDGSFT 
Sbjct: 181 PCATENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V++DS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKVIG+SVGGK LPISS+ FEID
Sbjct: 301 VNMDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+LSSQSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNLSSQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L G  SL+LPA+N L +VD+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSGGTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of CSPI01G03690 vs. NCBI nr
Match: gi|449440933|ref|XP_004138238.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 678.3 bits (1749), Expect = 9.8e-192
Identity = 352/490 (71.84%), Postives = 407/490 (83.06%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLS  FLFLT FT L FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQ  PIFDPKSSSSYS LSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of CSPI01G03690 vs. NCBI nr
Match: gi|778664719|ref|XP_004138341.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 579.3 bits (1492), Expect = 6.2e-162
Identity = 280/334 (83.83%), Postives = 306/334 (91.62%), Query Frame = 1

Query: 156 VGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHL 215
           VGQP Q  FFV DTGSDV+WLQC PC G+NGCY+QI PIFDP+ SSSY+ +SCDSEQC L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 216 LDEAACDANSCIYEVEYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAG 275
           LDEA C+ NSCIY+VEYGDGSFT+GELATET +F HSNSIPN+ IGCGHDNEGLF+GA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 276 LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFR 335
           LIGLGGGAIS+SSQL+A+SFSYCLVD+DS S STLDFN D PSDSL SPLVKNDRFP+FR
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 336 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 395
           YVKVIGMSVGGKPLPISSS FEIDESG GGIIVDSGTTIT++PSDVY+VLR+AF+GLT N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 396 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 455
           LPPAP +SPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 456 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
           +TFPLSIIGN QQQGIRVSYDL NSLVGFST+KC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH3.2e-12649.29Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH5.4e-9741.82Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH2.4e-8941.67Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR2.3e-6839.90Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR7.6e-6740.16Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LS14_CUCSA4.1e-27798.98Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1[more]
A0A0A0LPJ3_CUCSA6.9e-19271.84Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1[more]
A0A0A0LPR7_CUCSA4.3e-16283.83Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1[more]
M5WPB5_PRUPE8.8e-13954.64Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1[more]
A0A0A0LQD2_CUCSA1.7e-13753.59Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G25510.16.7e-13050.31 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.8e-12749.29 Eukaryotic aspartyl protease family protein[more]
AT3G20015.13.0e-9841.82 Eukaryotic aspartyl protease family protein[more]
AT3G61820.12.5e-9244.87 Eukaryotic aspartyl protease family protein[more]
AT1G01300.11.4e-9041.67 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778664722|ref|XP_004138237.2|5.8e-27798.98PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|659106557|ref|XP_008453383.1|4.4e-24087.32PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
gi|659106559|ref|XP_008453384.1|1.6e-19473.27PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo][more]
gi|449440933|ref|XP_004138238.1|9.8e-19271.84PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|778664719|ref|XP_004138341.2|6.2e-16283.83PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0050896 response to stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G03690.1CSPI01G03690.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 2..24
score: 4.3E-200coord: 62..489
score: 4.3E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 320..489
score: 3.2E-44coord: 134..308
score: 6.2
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 145..489
score: 1.54
NoneNo IPR availablePANTHERPTHR13683:SF274ASPARTYL PROTEASE FAMILY PROTEINcoord: 2..24
score: 4.3E-200coord: 62..489
score: 4.3E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI01G03690Cucumber (Gy14) v1cgycpiB310
CSPI01G03690Cucumber (Gy14) v1cgycpiB428
CSPI01G03690Cucurbita maxima (Rimu)cmacpiB062
CSPI01G03690Cucurbita maxima (Rimu)cmacpiB086
CSPI01G03690Cucurbita maxima (Rimu)cmacpiB223
CSPI01G03690Cucurbita maxima (Rimu)cmacpiB397
CSPI01G03690Cucurbita maxima (Rimu)cmacpiB408
CSPI01G03690Cucurbita maxima (Rimu)cmacpiB695
CSPI01G03690Cucurbita moschata (Rifu)cmocpiB053
CSPI01G03690Cucurbita moschata (Rifu)cmocpiB079
CSPI01G03690Cucurbita moschata (Rifu)cmocpiB209
CSPI01G03690Cucurbita moschata (Rifu)cmocpiB394
CSPI01G03690Cucumber (Chinese Long) v2cpicuB000
CSPI01G03690Cucumber (Chinese Long) v2cpicuB006
CSPI01G03690Cucumber (Chinese Long) v2cpicuB038
CSPI01G03690Melon (DHL92) v3.5.1cpimeB006
CSPI01G03690Melon (DHL92) v3.5.1cpimeB029
CSPI01G03690Melon (DHL92) v3.5.1cpimeB056
CSPI01G03690Watermelon (Charleston Gray)cpiwcgB061
CSPI01G03690Watermelon (Charleston Gray)cpiwcgB067
CSPI01G03690Watermelon (Charleston Gray)cpiwcgB084
CSPI01G03690Watermelon (97103) v1cpiwmB040
CSPI01G03690Watermelon (97103) v1cpiwmB068
CSPI01G03690Watermelon (97103) v1cpiwmB100
CSPI01G03690Cucurbita pepo (Zucchini)cpecpiB023
CSPI01G03690Cucurbita pepo (Zucchini)cpecpiB353
CSPI01G03690Cucurbita pepo (Zucchini)cpecpiB381
CSPI01G03690Cucurbita pepo (Zucchini)cpecpiB509
CSPI01G03690Cucurbita pepo (Zucchini)cpecpiB657
CSPI01G03690Bottle gourd (USVL1VR-Ls)cpilsiB025
CSPI01G03690Bottle gourd (USVL1VR-Ls)cpilsiB054
CSPI01G03690Bottle gourd (USVL1VR-Ls)cpilsiB071
CSPI01G03690Melon (DHL92) v3.6.1cpimedB006
CSPI01G03690Melon (DHL92) v3.6.1cpimedB029
CSPI01G03690Melon (DHL92) v3.6.1cpimedB050
CSPI01G03690Cucumber (Gy14) v2cgybcpiB001
CSPI01G03690Cucumber (Gy14) v2cgybcpiB005
CSPI01G03690Cucumber (Gy14) v2cgybcpiB201
CSPI01G03690Silver-seed gourdcarcpiB0419
CSPI01G03690Silver-seed gourdcarcpiB0497
CSPI01G03690Silver-seed gourdcarcpiB0550
CSPI01G03690Silver-seed gourdcarcpiB0628
CSPI01G03690Silver-seed gourdcarcpiB0876
CSPI01G03690Silver-seed gourdcarcpiB1078
CSPI01G03690Cucumber (Chinese Long) v3cpicucB001
CSPI01G03690Cucumber (Chinese Long) v3cpicucB004
CSPI01G03690Cucumber (Chinese Long) v3cpicucB043
CSPI01G03690Watermelon (97103) v2cpiwmbB057
CSPI01G03690Watermelon (97103) v2cpiwmbB064
CSPI01G03690Watermelon (97103) v2cpiwmbB080
CSPI01G03690Wax gourdcpiwgoB039
CSPI01G03690Wax gourdcpiwgoB012
CSPI01G03690Wax gourdcpiwgoB072
CSPI01G03690Wild cucumber (PI 183967)cpicpiB001
CSPI01G03690Wild cucumber (PI 183967)cpicpiB030