CSPI01G03690.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI01G03690.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr1 : 2313591 .. 2315060 (+)
Sequence length1470
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCACTTTCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACGACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTTGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGGGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTTCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCAACATTCTAATTCCATCCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTATTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGACGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTTCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCAAATGTGGAGGTGCCAACCATAGCATTTATTTTGCCGGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTCCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

mRNA sequence

ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCACTTTCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACGACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTTGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGGGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTTCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCAACATTCTAATTCCATCCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTATTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGACGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTTCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCAAATGTGGAGGTGCCAACCATAGCATTTATTTTGCCGGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTCCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

Coding sequence (CDS)

ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCACTTTCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACGACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTTGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGGGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTTCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCAACATTCTAATTCCATCCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTATTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGACGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTTCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCAAATGTGGAGGTGCCAACCATAGCATTTATTTTGCCGGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTCCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA
BLAST of CSPI01G03690.1 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 3.2e-126
Identity = 243/493 (49.29%), Postives = 329/493 (66.73%), Query Frame = 1

Query: 12  LTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSN------ 71
           +T   FL      SR L+T  P  TN  DV +S+ Q    LS+ P     T +       
Sbjct: 13  VTLSLFLTTTDASSRSLST--PPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSD 72

Query: 72  ---YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQF 131
              ++SSSPLSL LH R T     ++DY SL  +RL R ++R   +  K+  +++G  + 
Sbjct: 73  PVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRS 132

Query: 132 GRR-INGSDS---TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 191
             + +   D+   T  LT PV SGASQG+GEYF+RIGVG P +  + V DTGSDV+W+QC
Sbjct: 133 DLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQC 192

Query: 192 QPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 251
           +PC     CY+Q  P+F+P SSS+Y SL+C + QC LL+ +AC +N C+Y+V YGDGSFT
Sbjct: 193 EPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 252

Query: 252 VGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYC 311
           VGELAT+T +F +S  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q++ATSFSYC
Sbjct: 253 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYC 312

Query: 312 LVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFE 371
           LVD DS  SS+LDFN+ Q      T+PL++N +  TF YV + G SVGG+ + +  + F+
Sbjct: 313 LVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFD 372

Query: 372 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPP-APGVSPFDTCYDLSSQSN 431
           +D SGSGG+I+D GT +T + +  Y+ LRDAF+ LT NL   +  +S FDTCYD SS S 
Sbjct: 373 VDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST 432

Query: 432 VEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYD 490
           V+VPT+AF   G  SL LPAKN L  VD +GTFC AF P++  LSIIGNVQQQG R++YD
Sbjct: 433 VKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYD 492

BLAST of CSPI01G03690.1 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 5.4e-97
Identity = 202/483 (41.82%), Postives = 280/483 (57.97%), Query Frame = 1

Query: 14  FFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLS 73
           FF FLH    L   L++ S  S   F +   +   L   +  P  F  TH +  SSS  +
Sbjct: 6   FFFFLH----LHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPD-FNNTHFSDESSSKYT 65

Query: 74  LSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDS-- 133
           L L  R    + +Y ++   + AR+ R   R  ++ R++          G+ I  SDS  
Sbjct: 66  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRIS---------GKVIPSSDSRY 125

Query: 134 -TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 193
             N   + + SG  QG+GEYF RIGVG P +  + V D+GSD+ W+QCQPC     CYKQ
Sbjct: 126 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC---KLCYKQ 185

Query: 194 IGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFQ 253
             P+FDP  S SY+ +SC S  C  ++ + C +  C YEV YGDGS+T G LA ET +F 
Sbjct: 186 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 245

Query: 254 HSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESS 313
            +  + N+ +GCGH N G+FIGAAGL+G+GGG++S   QL   +   F YCLV   ++S+
Sbjct: 246 KT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST 305

Query: 314 STLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGI 373
            +L F  +  P  +   PLV+N R P+F YV + G+ VGG  +P+    F++ E+G GG+
Sbjct: 306 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGV 365

Query: 374 IVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFIL 433
           ++D+GT +T +P+  Y   RD F   T NLP A GVS FDTCYDLS   +V VPT++F  
Sbjct: 366 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYF 425

Query: 434 PGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFST 490
                L LPA+N L  VD +GT+C AF  S   LSIIGN+QQ+GI+VS+D AN  VGF  
Sbjct: 426 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

BLAST of CSPI01G03690.1 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 330.9 bits (847), Expect = 2.4e-89
Identity = 205/492 (41.67%), Postives = 286/492 (58.13%), Query Frame = 1

Query: 8   AFLFLTFFTFLHFPSILSR-KLTTQSPYSTNTFDVS-ASINQALNALSIKPKPFQTTHSN 67
           A LF   F FL  PS  S     T  P S +    S  S     ++ S+    F++  S+
Sbjct: 7   ALLFSLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESG-SD 66

Query: 68  YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR 127
             SSS ++L+L     + +    D   L  +RL R + R +S+      +    +  GR 
Sbjct: 67  SESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSI------ATLAAQIPGRN 126

Query: 128 INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGEN 187
           +  +      ++ V SG SQG+GEYF R+GVG P +  + V DTGSD+ WLQC PC    
Sbjct: 127 VTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---R 186

Query: 188 GCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDAN--SCIYEVEYGDGSFTVGELA 247
            CY Q  PIFDP+ S +Y+++ C S  C  LD A C+    +C+Y+V YGDGSFTVG+ +
Sbjct: 187 RCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFS 246

Query: 248 TETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT---SFSYCLV 307
           TET +F+  N +  + +GCGHDNEGLF+GAAGL+GLG G +S   Q        FSYCLV
Sbjct: 247 TETLTFRR-NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLV 306

Query: 308 DLDSES--SSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISSSSFE 367
           D  + S  SS +  NA     +  +PL+ N +  TF YV ++G+SVGG  +P +++S F+
Sbjct: 307 DRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFK 366

Query: 368 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNV 427
           +D+ G+GG+I+DSGT++T +    Y  +RDAF    K L  AP  S FDTC+DLS+ + V
Sbjct: 367 LDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEV 426

Query: 428 EVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 487
           +VPT+     G + + LPA N L  VD+ G FC AF  +   LSIIGN+QQQG RV YDL
Sbjct: 427 KVPTVVLHFRGAD-VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDL 484

Query: 488 ANSLVGFSTDKC 490
           A+S VGF+   C
Sbjct: 487 ASSRVGFAPGGC 484

BLAST of CSPI01G03690.1 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 2.3e-68
Identity = 154/386 (39.90%), Postives = 222/386 (57.51%), Query Frame = 1

Query: 111 KLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTG 170
           K +L  +  ++  RR+   ++  +  + V +    G GEY   + +G P Q +  + DTG
Sbjct: 56  KFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTG 115

Query: 171 SDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEV 230
           SD+ W QCQPC     C+ Q  PIF+P+ SSS+S+L C S+ C  L    C  N C Y  
Sbjct: 116 SDLIWTQCQPC---TQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTY 175

Query: 231 EYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIG-AAGLIGLGGGAISLSSQ 290
            YGDGS T G + TET +F  S SIPN+  GCG +N+G   G  AGL+G+G G +SL SQ
Sbjct: 176 GYGDGSETQGSMGTETLTF-GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQ 235

Query: 291 LEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSP---LVKNDRFPTFRYVKVIGMSVGG 350
           L+ T FSYC+  + S + S L   +   S +  SP   L+++ + PTF Y+ + G+SVG 
Sbjct: 236 LDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 295

Query: 351 KPLPISSSSFEID-ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSP- 410
             LPI  S+F ++  +G+GGII+DSGTT+T   ++ Y  +R  F+    NLP   G S  
Sbjct: 296 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSG 355

Query: 411 FDTCYDL-SSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSII 470
           FD C+   S  SN+++PT      G   L+LP++N  F   S G  CLA   S+  +SI 
Sbjct: 356 FDLCFQTPSDPSNLQIPTFVMHFDG-GDLELPSEN-YFISPSNGLICLAMGSSSQGMSIF 415

Query: 471 GNVQQQGIRVSYDLANSLVGFSTDKC 490
           GN+QQQ + V YD  NS+V F++ +C
Sbjct: 416 GNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI01G03690.1 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 7.6e-67
Identity = 155/386 (40.16%), Postives = 220/386 (56.99%), Query Frame = 1

Query: 112 LELSLKGGKQFGRRINGS-DSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTG 171
           ++ ++K G++  R IN    S++ +  PV +G     GEY   + +G P  S+  + DTG
Sbjct: 61  IKRAIKRGERRMRSINAMLQSSSGIETPVYAGD----GEYLMNVAIGTPDSSFSAIMDTG 120

Query: 172 SDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEV 231
           SD+ W QC+PC     C+ Q  PIF+P+ SSS+S+L C+S+ C  L    C+ N C Y  
Sbjct: 121 SDLIWTQCEPC---TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTY 180

Query: 232 EYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIG-AAGLIGLGGGAISLSSQ 291
            YGDGS T G +ATETF+F+ ++S+PN+  GCG DN+G   G  AGLIG+G G +SL SQ
Sbjct: 181 GYGDGSTTQGYMATETFTFE-TSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ 240

Query: 292 LEATSFSYCLVDLDSESSSTLDFN---ADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGG 351
           L    FSYC+    S S STL      +  P  S ++ L+ +   PT+ Y+ + G++VGG
Sbjct: 241 LGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGG 300

Query: 352 KPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPA-PGVSPF 411
             L I SS+F++ + G+GG+I+DSGTT+T +P D Y+ +  AF     NLP      S  
Sbjct: 301 DNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI-NLPTVDESSSGL 360

Query: 412 DTCYDLSSQ-SNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF-LPSTFPLSII 471
            TC+   S  S V+VP I+    G   L L  +N L    + G  CLA    S   +SI 
Sbjct: 361 STCFQQPSDGSTVQVPEISMQFDG-GVLNLGEQNILIS-PAEGVICLAMGSSSQLGISIF 420

Query: 472 GNVQQQGIRVSYDLANSLVGFSTDKC 490
           GN+QQQ  +V YDL N  V F   +C
Sbjct: 421 GNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI01G03690.1 vs. TrEMBL
Match: A0A0A0LS14_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1)

HSP 1 Score: 961.4 bits (2484), Expect = 4.1e-277
Identity = 484/489 (98.98%), Postives = 486/489 (99.39%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYAFLFLTFF  LHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           CDGENGCYKQIGPIFDPKSSSSYS LSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG
Sbjct: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240

Query: 241 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATETFSF+HSNSIPNLPIGCGHDNEGLF+GAAGLIGLGGGAISLSSQLEATSFSYCLV
Sbjct: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE
Sbjct: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP
Sbjct: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
           TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS
Sbjct: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480

Query: 481 LVGFSTDKC 490
           LVGFSTDKC
Sbjct: 481 LVGFSTDKC 489

BLAST of CSPI01G03690.1 vs. TrEMBL
Match: A0A0A0LPJ3_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 6.9e-192
Identity = 352/490 (71.84%), Postives = 407/490 (83.06%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLS  FLFLT FT L FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQ  PIFDPKSSSSYS LSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of CSPI01G03690.1 vs. TrEMBL
Match: A0A0A0LPR7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1)

HSP 1 Score: 579.3 bits (1492), Expect = 4.3e-162
Identity = 280/334 (83.83%), Postives = 306/334 (91.62%), Query Frame = 1

Query: 156 VGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHL 215
           VGQP Q  FFV DTGSDV+WLQC PC G+NGCY+QI PIFDP+ SSSY+ +SCDSEQC L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 216 LDEAACDANSCIYEVEYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAG 275
           LDEA C+ NSCIY+VEYGDGSFT+GELATET +F HSNSIPN+ IGCGHDNEGLF+GA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 276 LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFR 335
           LIGLGGGAIS+SSQL+A+SFSYCLVD+DS S STLDFN D PSDSL SPLVKNDRFP+FR
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 336 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 395
           YVKVIGMSVGGKPLPISSS FEIDESG GGIIVDSGTTIT++PSDVY+VLR+AF+GLT N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 396 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 455
           LPPAP +SPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 456 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
           +TFPLSIIGN QQQGIRVSYDL NSLVGFST+KC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

BLAST of CSPI01G03690.1 vs. TrEMBL
Match: M5WPB5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1)

HSP 1 Score: 501.9 bits (1291), Expect = 8.8e-139
Identity = 271/496 (54.64%), Postives = 351/496 (70.77%), Query Frame = 1

Query: 8   AFLFLTF---FTFLH-FPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKP---KPF- 67
           AFL+L     FT    FPS  SR L+ ++   T   DVSAS+ QA + LS  P   KP  
Sbjct: 5   AFLYLAILSAFTLTSLFPSTHSRSLSEET---TTLLDVSASLTQAHDVLSFNPQTLKPLD 64

Query: 68  -QTTHSNYHSSSPL----SLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLEL 127
            Q T +  H+ +PL    SL L PR  +HN  ++DY SLV++RL R +AR  SL+ KL+L
Sbjct: 65  RQETQAQAHTLTPLNSSFSLQLLPRDALHNSQHKDYESLVQSRLGRDSARVNSLHTKLQL 124

Query: 128 SLKGGKQFGRR-INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDV 187
            ++  K+     ++       L+ PV SG SQG+GEYF RIGVG P +S + V DTGSD+
Sbjct: 125 VVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSGEYFTRIGVGTPAKSLYMVLDTGSDI 184

Query: 188 SWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYG 247
           +WLQC+PC   + CY+Q  P+F+P  SS+Y  ++CDS QCH L  +AC A+ C+Y+V YG
Sbjct: 185 NWLQCEPC---SDCYQQTDPVFNPTGSSTYRPVTCDSAQCHSLHVSACRADKCLYQVSYG 244

Query: 248 DGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT 307
           DGS+TVG+  TET SF +S +I N+ +GCGHDNEGLF+GAAGL+GLGGGA+SL SQ +AT
Sbjct: 245 DGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKAT 304

Query: 308 SFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISS 367
           SFSYCLV+ DS +SSTL+FN+  PSDS+T+PL+K+ R  TF YV + G SVGG+P+ +  
Sbjct: 305 SFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVPP 364

Query: 368 SSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSS 427
           S FE+DESG+GGIIVDSGT IT + ++ Y+ LRDAF  LT++LP A G + FDTCYDLSS
Sbjct: 365 SVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLSS 424

Query: 428 QSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRV 487
           +S V+VPT++F+     SL LPAKN L  VDSAGTFC AF P++   SIIGNVQQQG RV
Sbjct: 425 RSRVQVPTVSFLFADGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTRV 484

Query: 488 SYDLANSLVGFSTDKC 490
           SYDLAN+ VGFS +KC
Sbjct: 485 SYDLANNRVGFSPNKC 494

BLAST of CSPI01G03690.1 vs. TrEMBL
Match: A0A0A0LQD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1)

HSP 1 Score: 497.7 bits (1280), Expect = 1.7e-137
Identity = 261/487 (53.59%), Postives = 338/487 (69.40%), Query Frame = 1

Query: 9   FLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYH- 68
           FLFL F +   FP  LSR  +  SP+S+ + DVSAS+ QA   L   P    +     H 
Sbjct: 10  FLFLFFLSL--FPFTLSRSSSHLSPHSSASLDVSASLQQANQVLKFDPTASISFQQQVHL 69

Query: 69  ----SSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFG 128
               SS   SL LHPR ++HN  ++DY SLV +RL+R ++R +S+  +LE +L   K+  
Sbjct: 70  VPSNSSFSFSLQLHPRDSLHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSD 129

Query: 129 RR-INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCD 188
              +        L+ P+ SG SQG+GEYF+R+GVGQP + ++ V DTGSD++WLQCQPC 
Sbjct: 130 LEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC- 189

Query: 189 GENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGEL 248
               CY+Q  PIFDP+SSSS++SL C+S+QC  L+ + C A+ C+Y+V YGDGSFTVGE 
Sbjct: 190 --TDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEF 249

Query: 249 ATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLVDL 308
             ET +F +S  I N+ +GCGHDNEGLF+G+AGL+GLGGG++SL+SQ++A+SFSYCLVD 
Sbjct: 250 VIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLVDR 309

Query: 309 DSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESG 368
           DS SSS L+FN+  PSDS+ +PL+K+ +  TF YV + GMSVGG+ L I  + F++D+SG
Sbjct: 310 DSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSG 369

Query: 369 SGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTI 428
            GGIIVDSGT IT + +  Y+ LRDAFV  T  L    G + FDTCYDLSSQS V +PT+
Sbjct: 370 YGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTV 429

Query: 429 AFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLV 488
           +F   G  SLQLP KN L  VDS GTFC AF P+T  LSIIGNVQQQG RV YDLANS+V
Sbjct: 430 SFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVV 489

Query: 489 GFSTDKC 490
           GFS  KC
Sbjct: 490 GFSPHKC 491

BLAST of CSPI01G03690.1 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 461.5 bits (1186), Expect = 6.7e-130
Identity = 246/489 (50.31%), Postives = 333/489 (68.10%), Query Frame = 1

Query: 4   SLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTH 63
           S +Y+F F  FF   H  S+ SR L   S  +T+  +V+ SI++     S +    Q   
Sbjct: 2   SPNYSFFFFIFFLTSH-SSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLN--QQEE 61

Query: 64  SNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKG-GKQF 123
             + +SS  SL LH R++V    + DY SL  ARL R  AR +SL  +L+L++    K  
Sbjct: 62  QTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKAD 121

Query: 124 GRRINGSDSTNS--LTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 183
            + I+   +T    + AP+ SG +QG+GEYF R+G+G+P +  + V DTGSDV+WLQC P
Sbjct: 122 LKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP 181

Query: 184 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 243
           C     CY Q  PIF+P SSSSY  LSCD+ QC+ L+ + C   +C+YEV YGDGS+TVG
Sbjct: 182 CAD---CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVG 241

Query: 244 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 303
           + ATET +   S  + N+ +GCGH NEGLF+GAAGL+GLGGG ++L SQL  TSFSYCLV
Sbjct: 242 DFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 301

Query: 304 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 363
           D DS+S+ST+DF      D++ +PL++N +  TF Y+ + G+SVGG+ L I  SSFE+DE
Sbjct: 302 DRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 361

Query: 364 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 423
           SGSGGII+DSGT +T + +++Y+ LRD+FV  T +L  A GV+ FDTCY+LS+++ VEVP
Sbjct: 362 SGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVP 421

Query: 424 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 483
           T+AF  PG   L LPAKN +  VDS GTFCLAF P+   L+IIGNVQQQG RV++DLANS
Sbjct: 422 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 481

Query: 484 LVGFSTDKC 490
           L+GFS++KC
Sbjct: 482 LIGFSSNKC 483

BLAST of CSPI01G03690.1 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 453.4 bits (1165), Expect = 1.8e-127
Identity = 243/493 (49.29%), Postives = 329/493 (66.73%), Query Frame = 1

Query: 12  LTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSN------ 71
           +T   FL      SR L+T  P  TN  DV +S+ Q    LS+ P     T +       
Sbjct: 13  VTLSLFLTTTDASSRSLST--PPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSD 72

Query: 72  ---YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQF 131
              ++SSSPLSL LH R T     ++DY SL  +RL R ++R   +  K+  +++G  + 
Sbjct: 73  PVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRS 132

Query: 132 GRR-INGSDS---TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 191
             + +   D+   T  LT PV SGASQG+GEYF+RIGVG P +  + V DTGSDV+W+QC
Sbjct: 133 DLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQC 192

Query: 192 QPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 251
           +PC     CY+Q  P+F+P SSS+Y SL+C + QC LL+ +AC +N C+Y+V YGDGSFT
Sbjct: 193 EPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 252

Query: 252 VGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYC 311
           VGELAT+T +F +S  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q++ATSFSYC
Sbjct: 253 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYC 312

Query: 312 LVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFE 371
           LVD DS  SS+LDFN+ Q      T+PL++N +  TF YV + G SVGG+ + +  + F+
Sbjct: 313 LVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFD 372

Query: 372 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPP-APGVSPFDTCYDLSSQSN 431
           +D SGSGG+I+D GT +T + +  Y+ LRDAF+ LT NL   +  +S FDTCYD SS S 
Sbjct: 373 VDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST 432

Query: 432 VEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYD 490
           V+VPT+AF   G  SL LPAKN L  VD +GTFC AF P++  LSIIGNVQQQG R++YD
Sbjct: 433 VKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYD 492

BLAST of CSPI01G03690.1 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 356.3 bits (913), Expect = 3.0e-98
Identity = 202/483 (41.82%), Postives = 280/483 (57.97%), Query Frame = 1

Query: 14  FFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLS 73
           FF FLH    L   L++ S  S   F +   +   L   +  P  F  TH +  SSS  +
Sbjct: 6   FFFFLH----LHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPD-FNNTHFSDESSSKYT 65

Query: 74  LSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDS-- 133
           L L  R    + +Y ++   + AR+ R   R  ++ R++          G+ I  SDS  
Sbjct: 66  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRIS---------GKVIPSSDSRY 125

Query: 134 -TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 193
             N   + + SG  QG+GEYF RIGVG P +  + V D+GSD+ W+QCQPC     CYKQ
Sbjct: 126 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC---KLCYKQ 185

Query: 194 IGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFQ 253
             P+FDP  S SY+ +SC S  C  ++ + C +  C YEV YGDGS+T G LA ET +F 
Sbjct: 186 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 245

Query: 254 HSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESS 313
            +  + N+ +GCGH N G+FIGAAGL+G+GGG++S   QL   +   F YCLV   ++S+
Sbjct: 246 KT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST 305

Query: 314 STLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGI 373
            +L F  +  P  +   PLV+N R P+F YV + G+ VGG  +P+    F++ E+G GG+
Sbjct: 306 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGV 365

Query: 374 IVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFIL 433
           ++D+GT +T +P+  Y   RD F   T NLP A GVS FDTCYDLS   +V VPT++F  
Sbjct: 366 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYF 425

Query: 434 PGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFST 490
                L LPA+N L  VD +GT+C AF  S   LSIIGN+QQ+GI+VS+D AN  VGF  
Sbjct: 426 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

BLAST of CSPI01G03690.1 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 336.7 bits (862), Expect = 2.5e-92
Identity = 197/439 (44.87%), Postives = 267/439 (60.82%), Query Frame = 1

Query: 68  SSSPLSLSLHPRLTVHNPSYEDYG--SLVRARLARGAARAQSLNRKLELSLKGGKQFGRR 127
           S S  SLS+H        S+ D     L   RL R + R +S+     +S   G+   +R
Sbjct: 55  SESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVST--GRNATKR 114

Query: 128 INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGEN 187
                +    +  V SG SQG+GEYF R+GVG P  + + V DTGSDV WLQC PC    
Sbjct: 115 T--PRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC---K 174

Query: 188 GCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAA-C---DANSCIYEVEYGDGSFTVGE 247
            CY Q   IFDPK S +++++ C S  C  LD+++ C    + +C+Y+V YGDGSFT G+
Sbjct: 175 ACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGD 234

Query: 248 LATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT---SFSYC 307
            +TET +F H   + ++P+GCGHDNEGLF+GAAGL+GLG G +S  SQ +      FSYC
Sbjct: 235 FSTETLTF-HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYC 294

Query: 308 LVDLDSESSS-----TLDF-NADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-I 367
           LVD  S  SS     T+ F NA  P  S+ +PL+ N +  TF Y++++G+SVGG  +P +
Sbjct: 295 LVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGV 354

Query: 368 SSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAF-VGLTKNLPPAPGVSPFDTCYD 427
           S S F++D +G+GG+I+DSGT++T +    Y  LRDAF +G TK L  AP  S FDTC+D
Sbjct: 355 SESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATK-LKRAPSYSLFDTCFD 414

Query: 428 LSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQG 487
           LS  + V+VPT+ F   G   + LPA N L  V++ G FC AF  +   LSIIGN+QQQG
Sbjct: 415 LSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 474

Query: 488 IRVSYDLANSLVGFSTDKC 490
            RV+YDL  S VGF +  C
Sbjct: 475 FRVAYDLVGSRVGFLSRAC 483

BLAST of CSPI01G03690.1 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 330.9 bits (847), Expect = 1.4e-90
Identity = 205/492 (41.67%), Postives = 286/492 (58.13%), Query Frame = 1

Query: 8   AFLFLTFFTFLHFPSILSR-KLTTQSPYSTNTFDVS-ASINQALNALSIKPKPFQTTHSN 67
           A LF   F FL  PS  S     T  P S +    S  S     ++ S+    F++  S+
Sbjct: 7   ALLFSLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESG-SD 66

Query: 68  YHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR 127
             SSS ++L+L     + +    D   L  +RL R + R +S+      +    +  GR 
Sbjct: 67  SESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSI------ATLAAQIPGRN 126

Query: 128 INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGEN 187
           +  +      ++ V SG SQG+GEYF R+GVG P +  + V DTGSD+ WLQC PC    
Sbjct: 127 VTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---R 186

Query: 188 GCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDAN--SCIYEVEYGDGSFTVGELA 247
            CY Q  PIFDP+ S +Y+++ C S  C  LD A C+    +C+Y+V YGDGSFTVG+ +
Sbjct: 187 RCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFS 246

Query: 248 TETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEAT---SFSYCLV 307
           TET +F+  N +  + +GCGHDNEGLF+GAAGL+GLG G +S   Q        FSYCLV
Sbjct: 247 TETLTFRR-NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLV 306

Query: 308 DLDSES--SSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISSSSFE 367
           D  + S  SS +  NA     +  +PL+ N +  TF YV ++G+SVGG  +P +++S F+
Sbjct: 307 DRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFK 366

Query: 368 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNV 427
           +D+ G+GG+I+DSGT++T +    Y  +RDAF    K L  AP  S FDTC+DLS+ + V
Sbjct: 367 LDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEV 426

Query: 428 EVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 487
           +VPT+     G + + LPA N L  VD+ G FC AF  +   LSIIGN+QQQG RV YDL
Sbjct: 427 KVPTVVLHFRGAD-VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDL 484

Query: 488 ANSLVGFSTDKC 490
           A+S VGF+   C
Sbjct: 487 ASSRVGFAPGGC 484

BLAST of CSPI01G03690.1 vs. NCBI nr
Match: gi|778664722|ref|XP_004138237.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 961.4 bits (2484), Expect = 5.8e-277
Identity = 484/489 (98.98%), Postives = 486/489 (99.39%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYAFLFLTFF  LHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           CDGENGCYKQIGPIFDPKSSSSYS LSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG
Sbjct: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240

Query: 241 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATETFSF+HSNSIPNLPIGCGHDNEGLF+GAAGLIGLGGGAISLSSQLEATSFSYCLV
Sbjct: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE
Sbjct: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP
Sbjct: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
           TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS
Sbjct: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480

Query: 481 LVGFSTDKC 490
           LVGFSTDKC
Sbjct: 481 LVGFSTDKC 489

BLAST of CSPI01G03690.1 vs. NCBI nr
Match: gi|659106557|ref|XP_008453383.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 839.0 bits (2166), Expect = 4.4e-240
Identity = 427/489 (87.32%), Postives = 452/489 (92.43%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYA LFLT FTFL FPSILSRKLT QSPYST TFDVSASINQALNALSIKPKPFQ
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           T HS YHS+SPLSLSLHPRLTVHNPSY+DYG+LVRARLAR A R QSLNRKLELSL G K
Sbjct: 62  T-HS-YHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAK 121

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFG+RINGS STNSLTAPVTSGAS G GEYFARIGVGQPVQS+F VPDTGSDV+WLQC+P
Sbjct: 122 QFGKRINGSASTNSLTAPVTSGASHGDGEYFARIGVGQPVQSFFLVPDTGSDVTWLQCKP 181

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           C  EN C+KQ+ PIFDPKSSSSYSSLSC+SEQC LLDEA C +NSCIYEVEYGDGSFT+G
Sbjct: 182 CANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIG 241

Query: 241 ELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATET SF +SNSIPNLPIGCGHDNEGLF  AAGLIGLGGGAISLSSQL+A+SFSYCLV
Sbjct: 242 ELATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLV 301

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDS+SSSTLDFNADQPSDSLTSPLVKN+RFP+FRYVKVIGMSVGGK LPISSS FEIDE
Sbjct: 302 DLDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDE 361

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTIT++PSDVYDVLRDAFVGLT NLP APGVSPFDTCYDLSSQS+VEVP
Sbjct: 362 SGSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVP 421

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
            IAFILPG  SL+LPAKNCL QVDSAGTFCLAFLP TFPLSIIGNVQQQGIRVSYDL NS
Sbjct: 422 IIAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNS 481

Query: 481 LVGFSTDKC 490
           +VGF+T+KC
Sbjct: 482 IVGFATNKC 488

BLAST of CSPI01G03690.1 vs. NCBI nr
Match: gi|659106559|ref|XP_008453384.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo])

HSP 1 Score: 687.6 bits (1773), Expect = 1.6e-194
Identity = 359/490 (73.27%), Postives = 413/490 (84.29%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           M TSLS  FLFLT FT L F SILSRKLT QSPYST+ FDV AS NQALNALSIKPK  Q
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLT-QSPYSTSIFDVLASTNQALNALSIKPKHLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           T HS+  +SS LSL L+PRL++HNPSY+DY SLVRARLAR AAR Q LNR LE SL GGK
Sbjct: 61  T-HSHLPNSS-LSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG   NGS   +S+TAPV SG S+G+G EY A++GVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 DFGEVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQI PIFDPKSSSSY+ LSC+S+QC LLD   C++ +CIY+V YGDGSFT 
Sbjct: 181 PCATENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V++DS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKVIG+SVGGK LPISS+ FEID
Sbjct: 301 VNMDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+LSSQSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNLSSQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L G  SL+LPA+N L +VD+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSGGTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of CSPI01G03690.1 vs. NCBI nr
Match: gi|449440933|ref|XP_004138238.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 678.3 bits (1749), Expect = 9.8e-192
Identity = 352/490 (71.84%), Postives = 407/490 (83.06%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFTFLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLS  FLFLT FT L FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQ  PIFDPKSSSSYS LSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of CSPI01G03690.1 vs. NCBI nr
Match: gi|778664719|ref|XP_004138341.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 579.3 bits (1492), Expect = 6.2e-162
Identity = 280/334 (83.83%), Postives = 306/334 (91.62%), Query Frame = 1

Query: 156 VGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSSLSCDSEQCHL 215
           VGQP Q  FFV DTGSDV+WLQC PC G+NGCY+QI PIFDP+ SSSY+ +SCDSEQC L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 216 LDEAACDANSCIYEVEYGDGSFTVGELATETFSFQHSNSIPNLPIGCGHDNEGLFIGAAG 275
           LDEA C+ NSCIY+VEYGDGSFT+GELATET +F HSNSIPN+ IGCGHDNEGLF+GA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 276 LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFR 335
           LIGLGGGAIS+SSQL+A+SFSYCLVD+DS S STLDFN D PSDSL SPLVKNDRFP+FR
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 336 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 395
           YVKVIGMSVGGKPLPISSS FEIDESG GGIIVDSGTTIT++PSDVY+VLR+AF+GLT N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 396 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 455
           LPPAP +SPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 456 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
           +TFPLSIIGN QQQGIRVSYDL NSLVGFST+KC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH3.2e-12649.29Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH5.4e-9741.82Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH2.4e-8941.67Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR2.3e-6839.90Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR7.6e-6740.16Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LS14_CUCSA4.1e-27798.98Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1[more]
A0A0A0LPJ3_CUCSA6.9e-19271.84Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1[more]
A0A0A0LPR7_CUCSA4.3e-16283.83Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1[more]
M5WPB5_PRUPE8.8e-13954.64Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1[more]
A0A0A0LQD2_CUCSA1.7e-13753.59Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G25510.16.7e-13050.31 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.8e-12749.29 Eukaryotic aspartyl protease family protein[more]
AT3G20015.13.0e-9841.82 Eukaryotic aspartyl protease family protein[more]
AT3G61820.12.5e-9244.87 Eukaryotic aspartyl protease family protein[more]
AT1G01300.11.4e-9041.67 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778664722|ref|XP_004138237.2|5.8e-27798.98PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|659106557|ref|XP_008453383.1|4.4e-24087.32PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
gi|659106559|ref|XP_008453384.1|1.6e-19473.27PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo][more]
gi|449440933|ref|XP_004138238.1|9.8e-19271.84PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|778664719|ref|XP_004138341.2|6.2e-16283.83PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0050896 response to stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI01G03690CSPI01G03690gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI01G03690.1CSPI01G03690.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G03690.1.cds1CSPI01G03690.1.cds1CDS


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 2..24
score: 4.3E-200coord: 62..489
score: 4.3E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 320..489
score: 3.2E-44coord: 134..308
score: 6.2
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 145..489
score: 1.54
NoneNo IPR availablePANTHERPTHR13683:SF274ASPARTYL PROTEASE FAMILY PROTEINcoord: 2..24
score: 4.3E-200coord: 62..489
score: 4.3E