CSPI05G08550 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI05G08550
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationChr5: 7262823 .. 7264646 (+)
RNA-Seq ExpressionCSPI05G08550
SyntenyCSPI05G08550
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGAAAAATGAGATTGGATTGGGCCCAAAAAGAGAGCCCAACAGTTGAAAGAAGAAGAAGCAAAGTTGTTCCCTCGTGTGATCCGTTTTCATAGTCTTACAGATTTTGATTTGAGTTCGAGTGTTCCTTCCAAAACAACAATGCCAGTCCTCTCCATTTCCCCATTCTTCCTTCTCATTCTTCTCTTCTCCTTCTTTCTCACACATCTCCCCAACCCCAATGCCACCGCCGTCGCTGCCCCCGCCGACTTCCTGAAGCTCCCCCTTCTTCACAAACCCCCCTTCTCCTCCCCTTCCCAATCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCTCGCCCCAACCCCACTCTCAAATCCCCTCTCATCTCCGGCGCTTCCACCGGTTCCGGCCAATACTTCGTCGACATCCGCCTCGGTACTCCTCCCCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTTAAATGCTCCGCCTGCCGCAACTGCTCTCACCATCCTCCTTCCTCCGCCTTCCTCCCCCGCCATTCCTCCTCCTTCTCCCCTTTCCATTGCTTCGACCCCCACTGCCGTCTCCTCCCCCACGCTCCTCACCATCTCTGTAACCACACGCGCCTCCACTCCCCTTGTCGCTTCCTCTACTCCTATGCCGATGGCTCCCTCTCCTCCGGCTTCTTCTCCAAAGAAACCACCACATTGAAGTCGCTCTCCGGGTCCGAAATCCATCTTAAAGGCCTCTCGTTCGGCTGCGGATTTCGGATCTCCGGTCCCAGCGTTTCGGGGGCTCAGTTCAATGGTGCACGTGGCGTCATGGGATTGGGTAGAGGCTCCATTTCCTTCTCTTCTCAACTCGGCCGCCGATTCGGCAACAAATTTTCTTACTGTCTTATGGATTACACTCTCTCTCCGCCGCCTACCAGCTTCTTAATGATCGGCGGCGGCCTCCACAGCCTCCCTCTCACCAATGCCACAAAAATCAGCTATACCCCTTTGCAGATTAACCCTCTTTCCCCCACATTCTACTACATTACCATCCACAGCATCACCATCGACGGCGTGAAATTACCCATCAACCCCGCCGTTTGGGAAATCGACGAACAGGGCAATGGCGGCACGGTGGTGGATTCAGGGACAACGCTAACCTACCTAACGAAGACAGCGTACGAGGAGGTGCTGAAGTCAGTAAGACGGCGAGTGAAACTACCAAATGCTGCAGAGTTGACACCGGGATTCGATCTATGCGTGAATGCGTCGGGAGAGTCGCGGCGGCCGAGTCTGCCGCGACTGAGATTCCGACTGGGAGGTGGGGCGGTGTTTGCTCCACCGCCGAGGAACTATTTTCTGGAAACAGAGGAGGGAGTGATGTGTTTGGCGATCCGAGCGGTGGAATCGGGAAATGGGTTTTCGGTGATCGGAAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAAGGAGGAATCGAGGCTGGGTTTTACAAGGCGGGGATGTGGGCTTCCATGAACGAACATCATCAACTTTTGGAACTTTGTTTCTTTATTTTTGTTAACTCGACTCACTGAGTTTGGAACTCGCAATCAATTAATTTTCGATTTGGTAAAATTATTATTATTATCATTATCATTATTATTATTATTATTAAAAAAGAGGTTTTTAAATTTTTAATAATAATAAGTTTGTACTTTGGATTGGATTCTCTTTTGTATACTACAAAAACTCATTGATTTTTGGACATTATGTTTTATTTGTATCACACCTTCCTTTTCTTTCCCCCCAACCTTACTTATATTTATATAAATCAATCCA

mRNA sequence

AATGAAAAATGAGATTGGATTGGGCCCAAAAAGAGAGCCCAACAGTTGAAAGAAGAAGAAGCAAAGTTGTTCCCTCGTGTGATCCGTTTTCATAGTCTTACAGATTTTGATTTGAGTTCGAGTGTTCCTTCCAAAACAACAATGCCAGTCCTCTCCATTTCCCCATTCTTCCTTCTCATTCTTCTCTTCTCCTTCTTTCTCACACATCTCCCCAACCCCAATGCCACCGCCGTCGCTGCCCCCGCCGACTTCCTGAAGCTCCCCCTTCTTCACAAACCCCCCTTCTCCTCCCCTTCCCAATCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCTCGCCCCAACCCCACTCTCAAATCCCCTCTCATCTCCGGCGCTTCCACCGGTTCCGGCCAATACTTCGTCGACATCCGCCTCGGTACTCCTCCCCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTTAAATGCTCCGCCTGCCGCAACTGCTCTCACCATCCTCCTTCCTCCGCCTTCCTCCCCCGCCATTCCTCCTCCTTCTCCCCTTTCCATTGCTTCGACCCCCACTGCCGTCTCCTCCCCCACGCTCCTCACCATCTCTGTAACCACACGCGCCTCCACTCCCCTTGTCGCTTCCTCTACTCCTATGCCGATGGCTCCCTCTCCTCCGGCTTCTTCTCCAAAGAAACCACCACATTGAAGTCGCTCTCCGGGTCCGAAATCCATCTTAAAGGCCTCTCGTTCGGCTGCGGATTTCGGATCTCCGGTCCCAGCGTTTCGGGGGCTCAGTTCAATGGTGCACGTGGCGTCATGGGATTGGGTAGAGGCTCCATTTCCTTCTCTTCTCAACTCGGCCGCCGATTCGGCAACAAATTTTCTTACTGTCTTATGGATTACACTCTCTCTCCGCCGCCTACCAGCTTCTTAATGATCGGCGGCGGCCTCCACAGCCTCCCTCTCACCAATGCCACAAAAATCAGCTATACCCCTTTGCAGATTAACCCTCTTTCCCCCACATTCTACTACATTACCATCCACAGCATCACCATCGACGGCGTGAAATTACCCATCAACCCCGCCGTTTGGGAAATCGACGAACAGGGCAATGGCGGCACGGTGGTGGATTCAGGGACAACGCTAACCTACCTAACGAAGACAGCGTACGAGGAGGTGCTGAAGTCAGTAAGACGGCGAGTGAAACTACCAAATGCTGCAGAGTTGACACCGGGATTCGATCTATGCGTGAATGCGTCGGGAGAGTCGCGGCGGCCGAGTCTGCCGCGACTGAGATTCCGACTGGGAGGTGGGGCGGTGTTTGCTCCACCGCCGAGGAACTATTTTCTGGAAACAGAGGAGGGAGTGATGTGTTTGGCGATCCGAGCGGTGGAATCGGGAAATGGGTTTTCGGTGATCGGAAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAAGGAGGAATCGAGGCTGGGTTTTACAAGGCGGGGATGTGGGCTTCCATGAACGAACATCATCAACTTTTGGAACTTTGTTTCTTTATTTTTGTTAACTCGACTCACTGAGTTTGGAACTCGCAATCAATTAATTTTCGATTTGGTAAAATTATTATTATTATCATTATCATTATTATTATTATTATTAAAAAAGAGGTTTTTAAATTTTTAATAATAATAAGTTTGTACTTTGGATTGGATTCTCTTTTGTATACTACAAAAACTCATTGATTTTTGGACATTATGTTTTATTTGTATCACACCTTCCTTTTCTTTCCCCCCAACCTTACTTATATTTATATAAATCAATCCA

Coding sequence (CDS)

ATGCCAGTCCTCTCCATTTCCCCATTCTTCCTTCTCATTCTTCTCTTCTCCTTCTTTCTCACACATCTCCCCAACCCCAATGCCACCGCCGTCGCTGCCCCCGCCGACTTCCTGAAGCTCCCCCTTCTTCACAAACCCCCCTTCTCCTCCCCTTCCCAATCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCTCGCCCCAACCCCACTCTCAAATCCCCTCTCATCTCCGGCGCTTCCACCGGTTCCGGCCAATACTTCGTCGACATCCGCCTCGGTACTCCTCCCCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTTAAATGCTCCGCCTGCCGCAACTGCTCTCACCATCCTCCTTCCTCCGCCTTCCTCCCCCGCCATTCCTCCTCCTTCTCCCCTTTCCATTGCTTCGACCCCCACTGCCGTCTCCTCCCCCACGCTCCTCACCATCTCTGTAACCACACGCGCCTCCACTCCCCTTGTCGCTTCCTCTACTCCTATGCCGATGGCTCCCTCTCCTCCGGCTTCTTCTCCAAAGAAACCACCACATTGAAGTCGCTCTCCGGGTCCGAAATCCATCTTAAAGGCCTCTCGTTCGGCTGCGGATTTCGGATCTCCGGTCCCAGCGTTTCGGGGGCTCAGTTCAATGGTGCACGTGGCGTCATGGGATTGGGTAGAGGCTCCATTTCCTTCTCTTCTCAACTCGGCCGCCGATTCGGCAACAAATTTTCTTACTGTCTTATGGATTACACTCTCTCTCCGCCGCCTACCAGCTTCTTAATGATCGGCGGCGGCCTCCACAGCCTCCCTCTCACCAATGCCACAAAAATCAGCTATACCCCTTTGCAGATTAACCCTCTTTCCCCCACATTCTACTACATTACCATCCACAGCATCACCATCGACGGCGTGAAATTACCCATCAACCCCGCCGTTTGGGAAATCGACGAACAGGGCAATGGCGGCACGGTGGTGGATTCAGGGACAACGCTAACCTACCTAACGAAGACAGCGTACGAGGAGGTGCTGAAGTCAGTAAGACGGCGAGTGAAACTACCAAATGCTGCAGAGTTGACACCGGGATTCGATCTATGCGTGAATGCGTCGGGAGAGTCGCGGCGGCCGAGTCTGCCGCGACTGAGATTCCGACTGGGAGGTGGGGCGGTGTTTGCTCCACCGCCGAGGAACTATTTTCTGGAAACAGAGGAGGGAGTGATGTGTTTGGCGATCCGAGCGGTGGAATCGGGAAATGGGTTTTCGGTGATCGGAAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAAGGAGGAATCGAGGCTGGGTTTTACAAGGCGGGGATGTGGGCTTCCATGA

Protein sequence

MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP*
Homology
BLAST of CSPI05G08550 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 1.3e-58
Identity = 154/460 (33.48%), Postives = 233/460 (50.65%), Query Frame = 0

Query: 14  LLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS------ 73
           LL S F +   + +++++    D +     +K P    S  L  D+ R+  + +      
Sbjct: 55  LLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIP 114

Query: 74  ------RPNP-TLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 133
                  P P    S ++SG S GSG+YF  + +GTP + + +V DTGSD+VW++C+ CR
Sbjct: 115 GRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR 174

Query: 134 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 193
            C +      F PR S +++   C  PHCR L  A    CN  R    C +  SY DGS 
Sbjct: 175 RC-YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG---CNTRR--KTCLYQVSYGDGSF 234

Query: 194 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 253
           + G FS ET T +        +KG++ GCG    G       F GA G++GLG+G +SF 
Sbjct: 235 TVGDFSTETLTFR-----RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLSFP 294

Query: 254 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 313
            Q G RF  KFSYCL+D + S  P+S +     +  +         +TPL  NP   TFY
Sbjct: 295 GQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDTFY 354

Query: 314 YITIHSITIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK 373
           Y+ +  I++ G ++P +  +++++D+ GNGG ++DSGT++T L + AY  +  + R   K
Sbjct: 355 YVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAK 414

Query: 374 LPNAAELTPGFDLCVNAS--GESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCL 433
               A     FD C + S   E + P++  L FR   GA  + P  NY +  +  G  C 
Sbjct: 415 TLKRAPDFSLFDTCFDLSNMNEVKVPTVV-LHFR---GADVSLPATNYLIPVDTNGKFCF 474

Query: 434 AIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           A     +  G S+IGN+ QQGF + +D   SR+GF   GC
Sbjct: 475 AFAG--TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CSPI05G08550 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 7.9e-53
Identity = 127/374 (33.96%), Postives = 194/374 (51.87%), Query Frame = 0

Query: 84  GSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFH 143
           G G+Y +++ +GTP  S   + DTGSDL+W +C  C  C    P+  F P+ SSSFS   
Sbjct: 92  GDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQC-FSQPTPIFNPQDSSSFSTLP 151

Query: 144 CFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLK 203
           C   +C+ L   P   CN    ++ C++ Y Y DGS + G+ + ET T ++ S     + 
Sbjct: 152 CESQYCQDL---PSETCN----NNECQYTYGYGDGSTTQGYMATETFTFETSS-----VP 211

Query: 204 GLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPP 263
            ++FGCG    G      Q NGA G++G+G G +S  SQLG     +FSYC+  Y  S P
Sbjct: 212 NIAFGCGEDNQG----FGQGNGA-GLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSP 271

Query: 264 PTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEI 323
            T  L +G     +P  + +    T L  + L+PT+YYIT+  IT+ G  L I  + +++
Sbjct: 272 ST--LALGSAASGVPEGSPS----TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQL 331

Query: 324 DEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRP 383
            + G GG ++DSGTTLTYL + AY  V ++   ++ LP   E + G   C     +    
Sbjct: 332 QDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTV 391

Query: 384 SLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFD 443
            +P +  +  GG V     +N  +   EGV+CLA+ +  S  G S+ GN+ QQ   + +D
Sbjct: 392 QVPEISMQFDGG-VLNLGEQNILISPAEGVICLAMGS-SSQLGISIFGNIQQQETQVLYD 436

Query: 444 KEESRLGFTRRGCG 458
            +   + F    CG
Sbjct: 452 LQNLAVSFVPTQCG 436

BLAST of CSPI05G08550 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 8.7e-52
Identity = 138/410 (33.66%), Postives = 204/410 (49.76%), Query Frame = 0

Query: 53  QSLSSDTHRLSLLFSRPNPTLKSPLISGAST----GSGQYFVDIRLGTPPQSLLLVADTG 112
           Q L     R S    R    L  P  SG  T    G G+Y +++ +GTP Q    + DTG
Sbjct: 58  QLLERAIERGSRRLQRLEAMLNGP--SGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTG 117

Query: 113 SDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSP 172
           SDL+W +C  C  C  +  +  F P+ SSSFS   C    C+        L + T  ++ 
Sbjct: 118 SDLIWTQCQPCTQC-FNQSTPIFNPQGSSSFSTLPCSSQLCQA-------LSSPTCSNNF 177

Query: 173 CRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARG 232
           C++ Y Y DGS + G    ET T  S+S     +  ++FGCG    G      Q NGA G
Sbjct: 178 CQYTYGYGDGSETQGSMGTETLTFGSVS-----IPNITFGCGENNQG----FGQGNGA-G 237

Query: 233 VMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYT 292
           ++G+GRG +S  SQL      KFSYC+     S P  S L++G   +S+    A   + T
Sbjct: 238 LVGMGRGPLSLPSQLD---VTKFSYCMTPIGSSTP--SNLLLGSLANSV---TAGSPNTT 297

Query: 293 PLQINPLSPTFYYITIHSITIDGVKLPINPAVWEID-EQGNGGTVVDSGTTLTYLTKTAY 352
            +Q + + PTFYYIT++ +++   +LPI+P+ + ++   G GG ++DSGTTLTY    AY
Sbjct: 298 LIQSSQI-PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAY 357

Query: 353 EEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFL 412
           + V +    ++ LP     + GFDLC     +     +P       GG +   P  NYF+
Sbjct: 358 QSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDL-ELPSENYFI 417

Query: 413 ETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCG 458
               G++CLA+ +  S  G S+ GN+ QQ  L+ +D   S + F    CG
Sbjct: 418 SPSNGLICLAMGS--SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435

BLAST of CSPI05G08550 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 203.8 bits (517), Expect = 4.3e-51
Identity = 145/431 (33.64%), Postives = 207/431 (48.03%), Query Frame = 0

Query: 38  LKLPLLHK-----PPFSSPSQSLSSDTHRLSLLFSRPNPT--LKSPLISGASTGSGQYFV 97
           ++L L H+      P S     + +D  R SL+  + N T  +K  L SG   G+ QYF 
Sbjct: 49  VRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFT 108

Query: 98  DIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCR 157
           +IR+GTP +   +V DTGS+L WV C              F    S SF    C    C+
Sbjct: 109 EIRVGTPAKKFRVVVDTGSELTWVNCR--YRARGKDNRRVFRADESKSFKTVGCLTQTCK 168

Query: 158 LLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCG 217
           +       L       +PC + Y YADGS + G F+KET T+   +G    L G   GC 
Sbjct: 169 VDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGC- 228

Query: 218 FRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMI 277
                 S +G  F GA GV+GL     SF+S     +G KFSYCL+D+  +   +++L+ 
Sbjct: 229 ----SSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIF 288

Query: 278 GGGLHSLPLTNATKISY---TPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQG 337
           G        + +TK ++   TPL +  + P FY I +  I++    L I   VW  D   
Sbjct: 289 GS-------SRSTKTAFRRTTPLDLTRI-PPFYAINVIGISLGYDMLDIPSQVW--DATS 348

Query: 338 NGGTVVDSGTTLTYLTKTAYEEVLKSVRR-RVKLPNAAELTPGFDLCVNASGESRRPSLP 397
            GGT++DSGT+LT L   AY++V+  + R  V+L          + C + +       LP
Sbjct: 349 GGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLP 408

Query: 398 RLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGN-GFSVIGNLMQQGFLLEFDKE 457
           +L F L GGA F P  ++Y ++   GV CL    V +G    +VIGN+MQQ +L EFD  
Sbjct: 409 QLTFHLKGGARFEPHRKSYLVDAAPGVKCLGF--VSAGTPATNVIGNIMQQNYLWEFDLM 460

BLAST of CSPI05G08550 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 5.8e-48
Identity = 125/384 (32.55%), Postives = 198/384 (51.56%), Query Frame = 0

Query: 75  SPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPR 134
           S ++SG   GSG+YFV I +G+PP+   +V D+GSD+VWV+C  C+ C +      F P 
Sbjct: 118 SDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC-YKQSDPVFDPA 177

Query: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKS 194
            S S++   C    C  + ++  H          CR+   Y DGS     ++K T  L++
Sbjct: 178 KSGSYTGVSCGSSVCDRIENSGCH-------SGGCRYEVMYGDGS-----YTKGTLALET 237

Query: 195 LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYC 254
           L+ ++  ++ ++ GCG R  G       F GA G++G+G GS+SF  QL  + G  F YC
Sbjct: 238 LTFAKTVVRNVAMGCGHRNRG------MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYC 297

Query: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314
           L+  +     T  L+   G  +LP+      S+ PL  NP +P+FYY+ +  + + GV++
Sbjct: 298 LV--SRGTDSTGSLVF--GREALPV----GASWVPLVRNPRAPSFYYVGLKGLGVGGVRI 357

Query: 315 PINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVR-RRVKLPNAAELTPGFDLC 374
           P+   V+++ E G+GG V+D+GT +T L   AY       + +   LP A+ ++  FD C
Sbjct: 358 PLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FDTC 417

Query: 375 VNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESGNGFSVIGN 434
            + SG      +P + F    G V   P RN+ +  ++ G  C A  A  S  G S+IGN
Sbjct: 418 YDLSG-FVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAA--SPTGLSIIGN 470

Query: 435 LMQQGFLLEFDKEESRLGFTRRGC 457
           + Q+G  + FD     +GF    C
Sbjct: 478 IQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CSPI05G08550 vs. ExPASy TrEMBL
Match: A0A0A0KNH6 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G174650 PE=3 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 3.8e-268
Identity = 459/459 (100.00%), Postives = 459/459 (100.00%), Query Frame = 0

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550 vs. ExPASy TrEMBL
Match: A0A1S3CSZ8 (aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103504612 PE=3 SV=1)

HSP 1 Score: 903.3 bits (2333), Expect = 4.2e-259
Identity = 446/459 (97.17%), Postives = 450/459 (98.04%), Query Frame = 0

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MP+LSISPFFLLI LF FFLTHL NPNATAVAA ADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPMLSISPFFLLIPLFFFFLTHLSNPNATAVAAAADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAF PRHSSSFSPFHCFDPHCRLLPHAP H CNHT LHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHAPPHHCNHTLLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLK+LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKTLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLP+ NATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPVNNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITI+SITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITINSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550 vs. ExPASy TrEMBL
Match: A0A6J1HXS2 (aspartyl protease family protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111469001 PE=3 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 2.8e-226
Identity = 398/461 (86.33%), Postives = 419/461 (90.89%), Query Frame = 0

Query: 3   VLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRL 62
           +LS+S FF LILL  F L  L N      AA AD+LKLPLLHK PFSSPSQ+LSSDTHRL
Sbjct: 5   MLSVSTFFHLILLL-FSLADLLN-----AAAAADYLKLPLLHKNPFSSPSQALSSDTHRL 64

Query: 63  SLLFS----RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA 122
           SLLFS    RPNPTLKSPLISGASTGSGQYFVD+R+GTPPQSLLLVADTGSDLVWVKCSA
Sbjct: 65  SLLFSALRRRPNPTLKSPLISGASTGSGQYFVDLRIGTPPQSLLLVADTGSDLVWVKCSA 124

Query: 123 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADG 182
           CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAP HLCNHTRLHSPCRFLY+YADG
Sbjct: 125 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPPHLCNHTRLHSPCRFLYTYADG 184

Query: 183 SLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSIS 242
           S SSGFFSKETTTLK+LSGSE  LK LSFGCGFRISGPSVSGAQFNGARGVMGLGRG IS
Sbjct: 185 STSSGFFSKETTTLKTLSGSETRLKDLSFGCGFRISGPSVSGAQFNGARGVMGLGRGPIS 244

Query: 243 FSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPT 302
           FS+QLGRRFGNKFSYCLMDYTLSPPPTS+LMIGGGL SLP+TNATKISYTPL INPLSPT
Sbjct: 245 FSTQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNATKISYTPLLINPLSPT 304

Query: 303 FYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV 362
           FYYI + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AY+EVLK+VR+RV
Sbjct: 305 FYYIAVKSITVDGVKLPINPTVWAIDEQGNGGTVVDSGTTLTYLAEEAYKEVLKAVRQRV 364

Query: 363 KLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAI 422
           KLP AAELTPGFDLCVN S ES+RPSLPR+RFR+G GAVFAPP RNYFLET EGVMCLAI
Sbjct: 365 KLPAAAELTPGFDLCVNVSKESQRPSLPRVRFRVGNGAVFAPPARNYFLETVEGVMCLAI 424

Query: 423 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           RAVE GNGFSVIGNLMQQGFLLEFDKE SRLGF+RRGCGLP
Sbjct: 425 RAVEGGNGFSVIGNLMQQGFLLEFDKEASRLGFSRRGCGLP 459

BLAST of CSPI05G08550 vs. ExPASy TrEMBL
Match: A0A6J1ELQ4 (aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111435697 PE=3 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 6.2e-226
Identity = 395/461 (85.68%), Postives = 419/461 (90.89%), Query Frame = 0

Query: 3   VLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRL 62
           +LS+SPFF LILL  F L  L N      AA AD+LKLPLLHK PFSSPSQ+LSSDTHRL
Sbjct: 5   MLSVSPFFHLILLL-FSLADLLN-----AAAAADYLKLPLLHKNPFSSPSQALSSDTHRL 64

Query: 63  SLLFS----RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA 122
           SLLFS    RPNPTLKSPLISGASTGSGQYFVD+R+GTPPQSLLLVADTGSDLVWVKCSA
Sbjct: 65  SLLFSALRRRPNPTLKSPLISGASTGSGQYFVDLRIGTPPQSLLLVADTGSDLVWVKCSA 124

Query: 123 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADG 182
           CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAP HLCNHTRLHSPCRFLY+YADG
Sbjct: 125 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPPHLCNHTRLHSPCRFLYTYADG 184

Query: 183 SLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSIS 242
           S SSGFFSKETTTLK+L+GSE  LK LSFGCGFRISGPSVSGAQFNGARGVMGLGRG IS
Sbjct: 185 STSSGFFSKETTTLKTLTGSETRLKDLSFGCGFRISGPSVSGAQFNGARGVMGLGRGPIS 244

Query: 243 FSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPT 302
           FS+QLGRRFGNKFSYCLMDYTLSPPPTS+LMIGGGL  LP+TNATKISYTPL INPLSPT
Sbjct: 245 FSTQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGGLRRLPVTNATKISYTPLLINPLSPT 304

Query: 303 FYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV 362
           FYYI + SIT+DGVKLPINP +W IDEQGNGGTVVDSGTTLTYL + AY+EVLK++R+RV
Sbjct: 305 FYYIAVKSITVDGVKLPINPTLWAIDEQGNGGTVVDSGTTLTYLAEEAYKEVLKAMRQRV 364

Query: 363 KLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAI 422
           KLP AAELTPGFDLCVN S ES+RPSLPR+RF+LG GAVF PP RNYFLETEEGVMCLAI
Sbjct: 365 KLPAAAELTPGFDLCVNVSNESQRPSLPRVRFQLGNGAVFPPPARNYFLETEEGVMCLAI 424

Query: 423 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           RAVE GNGFSVIGNLMQQGFLLEFDKE SRLGF+RRGCGLP
Sbjct: 425 RAVEGGNGFSVIGNLMQQGFLLEFDKEASRLGFSRRGCGLP 459

BLAST of CSPI05G08550 vs. ExPASy TrEMBL
Match: A0A6J1EPZ3 (aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111436585 PE=3 SV=1)

HSP 1 Score: 735.7 bits (1898), Expect = 1.2e-208
Identity = 363/458 (79.26%), Postives = 395/458 (86.24%), Query Frame = 0

Query: 6   ISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLL 65
           +  F LL+LL    L  L N      A P+ +LK PLLH  PFSSPSQ+LSSDTHRLSLL
Sbjct: 4   VPQFLLLLLLLLSSLADLSN------AIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLL 63

Query: 66  FS--RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCS 125
           FS  R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLV DTGSDLVWVKCS CRNCS
Sbjct: 64  FSAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCS 123

Query: 126 HHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSG 185
           HHPPSSAF PRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SLSSG
Sbjct: 124 HHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSG 183

Query: 186 FFSKETTTLKSLSG--SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSS 245
           FFSK+ TT  + SG  ++  L  LSFGCGFRISGPSVSGA+F GARGVMGLGRG ISFSS
Sbjct: 184 FFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSS 243

Query: 246 QLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 305
           QLG RFGN FSYCLMDYTLSPPPTS+LMIGGGL SLP+TNA+KISYTPLQINPLSPTFYY
Sbjct: 244 QLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYY 303

Query: 306 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 365
           I + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRVKLP
Sbjct: 304 IVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLP 363

Query: 366 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 425
            A +L+PGFDLCVNAS ESR  SLP++RFR+GGG VFAPP RNYF+ETEEGVMCLAIR V
Sbjct: 364 RALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPV 423

Query: 426 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           +SGNGFSVIGNLMQQGFLLEFD+E+SR+GF+RRGCGLP
Sbjct: 424 DSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 455

BLAST of CSPI05G08550 vs. NCBI nr
Match: XP_004143702.1 (aspartyl protease family protein 2 [Cucumis sativus] >KGN50439.1 hypothetical protein Csa_000435 [Cucumis sativus])

HSP 1 Score: 933.3 bits (2411), Expect = 7.9e-268
Identity = 459/459 (100.00%), Postives = 459/459 (100.00%), Query Frame = 0

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550 vs. NCBI nr
Match: XP_008467208.1 (PREDICTED: aspartyl protease family protein 2 [Cucumis melo])

HSP 1 Score: 903.3 bits (2333), Expect = 8.7e-259
Identity = 446/459 (97.17%), Postives = 450/459 (98.04%), Query Frame = 0

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MP+LSISPFFLLI LF FFLTHL NPNATAVAA ADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPMLSISPFFLLIPLFFFFLTHLSNPNATAVAAAADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAF PRHSSSFSPFHCFDPHCRLLPHAP H CNHT LHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHAPPHHCNHTLLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLK+LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKTLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLP+ NATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPVNNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITI+SITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITINSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550 vs. NCBI nr
Match: XP_038907006.1 (aspartyl protease family protein 2 [Benincasa hispida])

HSP 1 Score: 829.7 bits (2142), Expect = 1.2e-236
Identity = 411/457 (89.93%), Postives = 431/457 (94.31%), Query Frame = 0

Query: 3   VLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRL 62
           +L ISPFF  I LF FFLT   N NATA    AD+LKLPLLHKPPFSSPSQ+LSSDTHRL
Sbjct: 1   MLPISPFFHFIFLF-FFLT---NFNATA----ADYLKLPLLHKPPFSSPSQALSSDTHRL 60

Query: 63  SLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNC 122
           SLLFSRPNPTLKSPLISGASTGSGQYFVD+RLGTPPQSLLLVADTGSDLVWVKCSACRNC
Sbjct: 61  SLLFSRPNPTLKSPLISGASTGSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSACRNC 120

Query: 123 SHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSS 182
           SHHPPS+AFLPRHSSSFSPFHCFDPHCRLLPHAP HLCNHTR HSPCRFLYSYADGSLSS
Sbjct: 121 SHHPPSTAFLPRHSSSFSPFHCFDPHCRLLPHAPPHLCNHTRFHSPCRFLYSYADGSLSS 180

Query: 183 GFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQ 242
           GFFSKETTTLK+LSGSEIHLKGLSFGCGFRISGPSVSGAQF+GARGVMGLGRGSISFSSQ
Sbjct: 181 GFFSKETTTLKTLSGSEIHLKGLSFGCGFRISGPSVSGAQFSGARGVMGLGRGSISFSSQ 240

Query: 243 LGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYI 302
           LGRRFGNKFSYCLMDYTLSPPPTS+LMIGGG  SLP+TNATKISYTPLQINPLSPTFYYI
Sbjct: 241 LGRRFGNKFSYCLMDYTLSPPPTSYLMIGGGHRSLPVTNATKISYTPLQINPLSPTFYYI 300

Query: 303 TIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPN 362
            IHSIT+DGVKLPINPAVW +D+QGNGGTVVDSGTTLTYL K AY+EVLK+VRRRVKLP+
Sbjct: 301 AIHSITVDGVKLPINPAVWAMDKQGNGGTVVDSGTTLTYLAKAAYDEVLKAVRRRVKLPS 360

Query: 363 AAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVE 422
           A+ELTPGFDLCVNAS  SRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE VMCLAIR V+
Sbjct: 361 ASELTPGFDLCVNASDSSRRPSLPRLRFRLGGGAVFAPPPRNYFLETEERVMCLAIRPVD 420

Query: 423 SGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           SGNGFSVIGNLMQQGFLLEFDK+ +RLGF+RRGCGLP
Sbjct: 421 SGNGFSVIGNLMQQGFLLEFDKDAARLGFSRRGCGLP 449

BLAST of CSPI05G08550 vs. NCBI nr
Match: XP_023549997.1 (aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 797.7 bits (2059), Expect = 5.2e-227
Identity = 398/461 (86.33%), Postives = 420/461 (91.11%), Query Frame = 0

Query: 3   VLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRL 62
           +LS+SPFF  ILL  F L  L N      AA AD+LKLPLLHK PFSSPSQ+LSSDTHRL
Sbjct: 5   MLSVSPFFHFILLL-FSLADLLN-----AAAAADYLKLPLLHKNPFSSPSQALSSDTHRL 64

Query: 63  SLLFS----RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA 122
           SLLFS    RPNPTLKSPLISGASTGSGQYFVD+R+GTPPQSLLLVADTGSDLVWVKCSA
Sbjct: 65  SLLFSALRRRPNPTLKSPLISGASTGSGQYFVDLRIGTPPQSLLLVADTGSDLVWVKCSA 124

Query: 123 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADG 182
           CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAP HLCNHTRLHSPCRFLY+YADG
Sbjct: 125 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPPHLCNHTRLHSPCRFLYTYADG 184

Query: 183 SLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSIS 242
           S SSGFFSKETTTLK+LSGSE  LK LSFGCGFRISGPSVSGAQFNGARGVMGLGRG IS
Sbjct: 185 STSSGFFSKETTTLKTLSGSETRLKDLSFGCGFRISGPSVSGAQFNGARGVMGLGRGPIS 244

Query: 243 FSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPT 302
           FS+QLGRRFGNKFSYCLMDYTLSPPPTS+LMIGGGL SLP+TNATKISYTPL INPLSPT
Sbjct: 245 FSTQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNATKISYTPLLINPLSPT 304

Query: 303 FYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV 362
           FYYI + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AY+EVLK+VR+RV
Sbjct: 305 FYYIAVKSITVDGVKLPINPTVWAIDEQGNGGTVVDSGTTLTYLAEEAYKEVLKAVRQRV 364

Query: 363 KLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAI 422
           KLP AAELTPGFDLCVN S ES+RPSLPR+RF+LG GAVFAPP RNYFLETEEGVMCL+I
Sbjct: 365 KLPAAAELTPGFDLCVNVSNESQRPSLPRVRFQLGNGAVFAPPARNYFLETEEGVMCLSI 424

Query: 423 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           RAVE GNGFSVIGNLMQQGFLLEFDKE SRLGF+RRGCGLP
Sbjct: 425 RAVEGGNGFSVIGNLMQQGFLLEFDKEASRLGFSRRGCGLP 459

BLAST of CSPI05G08550 vs. NCBI nr
Match: XP_022969957.1 (aspartyl protease family protein 2 [Cucurbita maxima])

HSP 1 Score: 794.3 bits (2050), Expect = 5.7e-226
Identity = 398/461 (86.33%), Postives = 419/461 (90.89%), Query Frame = 0

Query: 3   VLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRL 62
           +LS+S FF LILL  F L  L N      AA AD+LKLPLLHK PFSSPSQ+LSSDTHRL
Sbjct: 5   MLSVSTFFHLILLL-FSLADLLN-----AAAAADYLKLPLLHKNPFSSPSQALSSDTHRL 64

Query: 63  SLLFS----RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA 122
           SLLFS    RPNPTLKSPLISGASTGSGQYFVD+R+GTPPQSLLLVADTGSDLVWVKCSA
Sbjct: 65  SLLFSALRRRPNPTLKSPLISGASTGSGQYFVDLRIGTPPQSLLLVADTGSDLVWVKCSA 124

Query: 123 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADG 182
           CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAP HLCNHTRLHSPCRFLY+YADG
Sbjct: 125 CRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPPHLCNHTRLHSPCRFLYTYADG 184

Query: 183 SLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSIS 242
           S SSGFFSKETTTLK+LSGSE  LK LSFGCGFRISGPSVSGAQFNGARGVMGLGRG IS
Sbjct: 185 STSSGFFSKETTTLKTLSGSETRLKDLSFGCGFRISGPSVSGAQFNGARGVMGLGRGPIS 244

Query: 243 FSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPT 302
           FS+QLGRRFGNKFSYCLMDYTLSPPPTS+LMIGGGL SLP+TNATKISYTPL INPLSPT
Sbjct: 245 FSTQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNATKISYTPLLINPLSPT 304

Query: 303 FYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV 362
           FYYI + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AY+EVLK+VR+RV
Sbjct: 305 FYYIAVKSITVDGVKLPINPTVWAIDEQGNGGTVVDSGTTLTYLAEEAYKEVLKAVRQRV 364

Query: 363 KLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAI 422
           KLP AAELTPGFDLCVN S ES+RPSLPR+RFR+G GAVFAPP RNYFLET EGVMCLAI
Sbjct: 365 KLPAAAELTPGFDLCVNVSKESQRPSLPRVRFRVGNGAVFAPPARNYFLETVEGVMCLAI 424

Query: 423 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           RAVE GNGFSVIGNLMQQGFLLEFDKE SRLGF+RRGCGLP
Sbjct: 425 RAVEGGNGFSVIGNLMQQGFLLEFDKEASRLGFSRRGCGLP 459

BLAST of CSPI05G08550 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 596.7 bits (1537), Expect = 1.6e-170
Identity = 296/456 (64.91%), Postives = 355/456 (77.85%), Query Frame = 0

Query: 10  FLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR- 69
           F L    S FL  LP  N  AV+    +LKLPLL K PF SP+Q+L+ DT RL  L  R 
Sbjct: 6   FFLCSFLSLFL--LPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRR 65

Query: 70  -PNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 129
            P P +KSP++SGA++GSGQYFVD+R+G PPQSLLL+ADTGSDLVWVKCSACRNCSHH P
Sbjct: 66  KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 125

Query: 130 SSAFLPRHSSSFSPFHCFDPHCRLLP---HAPHHLCNHTRLHSPCRFLYSYADGSLSSGF 189
           ++ F PRHSS+FSP HC+DP CRL+P    AP  +CNHTR+HS C + Y YADGSL+SG 
Sbjct: 126 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAP--ICNHTRIHSTCHYEYGYADGSLTSGL 185

Query: 190 FSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 249
           F++ETT+LK+ SG E  LK ++FGCGFRISG SVSG  FNGA GVMGLGRG ISF+SQLG
Sbjct: 186 FARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLG 245

Query: 250 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITI 309
           RRFGNKFSYCLMDYTLSPPPTS+L+IG G   +     +K+ +TPL  NPLSPTFYY+ +
Sbjct: 246 RRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKL 305

Query: 310 HSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 369
            S+ ++G KL I+P++WEID+ GNGGTVVDSGTTL +L + AY  V+ +VRRRVKLP A 
Sbjct: 306 KSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 365

Query: 370 ELTPGFDLCVNASGESR-RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVES 429
            LTPGFDLCVN SG ++    LPRL+F   GGAVF PPPRNYF+ETEE + CLAI++V+ 
Sbjct: 366 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDP 425

Query: 430 GNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
             GFSVIGNLMQQGFL EFD++ SRLGF+RRGC LP
Sbjct: 426 KVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of CSPI05G08550 vs. TAIR 10
Match: AT3G25700.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 407.9 bits (1047), Expect = 1.1e-113
Identity = 227/456 (49.78%), Postives = 270/456 (59.21%), Query Frame = 0

Query: 10  FLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR- 69
           F L    S FL  LP  N  AV+    +LKLPLL K PF SP+Q+L+ DT RL  L  R 
Sbjct: 6   FFLCSFLSLFL--LPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRR 65

Query: 70  -PNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 129
            P P +KSP++SGA++GSGQYFVD+R+G PPQSLLL+ADTGSDLVWVKCSACRNCSHH P
Sbjct: 66  KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 125

Query: 130 SSAFLPRHSSSFSPFHCFDPHCRLLP---HAPHHLCNHTRLHSPCRFLYSYADGSLSSGF 189
           ++ F PRHSS+FSP HC+DP CRL+P    AP  +CNHTR+HS C + Y YADGSL+SG 
Sbjct: 126 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAP--ICNHTRIHSTCHYEYGYADGSLTSGL 185

Query: 190 FSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 249
           F++ETT+LK+ SG E  LK ++FGCGFRISG SVS                         
Sbjct: 186 FARETTSLKTSSGKEARLKSVAFGCGFRISGQSVS------------------------- 245

Query: 250 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITI 309
                                                                       
Sbjct: 246 ------------------------------------------------------------ 305

Query: 310 HSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 369
                                 GNGGTVVDSGTTL +L + AY  V+ +VRRRVKLP A 
Sbjct: 306 ----------------------GNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 350

Query: 370 ELTPGFDLCVNASGESR-RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVES 429
            LTPGFDLCVN SG ++    LPRL+F   GGAVF PPPRNYF+ETEE + CLAI++V+ 
Sbjct: 366 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDP 350

Query: 430 GNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
             GFSVIGNLMQQGFL EFD++ SRLGF+RRGC LP
Sbjct: 426 KVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 350

BLAST of CSPI05G08550 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 232.3 bits (591), Expect = 8.1e-61
Identity = 148/391 (37.85%), Postives = 204/391 (52.17%), Query Frame = 0

Query: 73  LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL 132
           L + L SG + GSG+YF+D+ +GTPP+   L+ DTGSDL W++C  C +C  H     + 
Sbjct: 145 LIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDC-FHQNGMFYD 204

Query: 133 PRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTL 192
           P+ S+SF    C DP C L+  +P         +  C + Y Y D S ++G F+ ET T+
Sbjct: 205 PKTSASFKNITCNDPRCSLI-SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV 264

Query: 193 KSLS----GSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFG 252
              +     SE  +  + FGCG    G       F+GA G++GLGRG +SFSSQL   +G
Sbjct: 265 NLTTTEGGSSEYKVGNMMFGCGHWNRG------LFSGASGLLGLGRGPLSFSSQLQSLYG 324

Query: 253 NKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSIT 312
           + FSYCL+D   +   +S L+ G     L  TN    S+   + N +  TFYYI I SI 
Sbjct: 325 HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVE-TFYYIQIKSIL 384

Query: 313 IDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK--LPNAAEL 372
           + G  L I    W I   G+GGT++DSGTTL+Y  + AYE +      ++K   P   + 
Sbjct: 385 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 444

Query: 373 TPGFDLCVNASG-ESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGN 432
            P  D C N SG E     LP L      G V+  P  N F+   E ++CLAI       
Sbjct: 445 -PVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKST 504

Query: 433 GFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
            FS+IGN  QQ F + +D + SRLGFT   C
Sbjct: 505 -FSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of CSPI05G08550 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 228.8 bits (582), Expect = 8.9e-60
Identity = 154/460 (33.48%), Postives = 233/460 (50.65%), Query Frame = 0

Query: 14  LLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS------ 73
           LL S F +   + +++++    D +     +K P    S  L  D+ R+  + +      
Sbjct: 55  LLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIP 114

Query: 74  ------RPNP-TLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 133
                  P P    S ++SG S GSG+YF  + +GTP + + +V DTGSD+VW++C+ CR
Sbjct: 115 GRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR 174

Query: 134 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 193
            C +      F PR S +++   C  PHCR L  A    CN  R    C +  SY DGS 
Sbjct: 175 RC-YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG---CNTRR--KTCLYQVSYGDGSF 234

Query: 194 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 253
           + G FS ET T +        +KG++ GCG    G       F GA G++GLG+G +SF 
Sbjct: 235 TVGDFSTETLTFR-----RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLSFP 294

Query: 254 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 313
            Q G RF  KFSYCL+D + S  P+S +     +  +         +TPL  NP   TFY
Sbjct: 295 GQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDTFY 354

Query: 314 YITIHSITIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK 373
           Y+ +  I++ G ++P +  +++++D+ GNGG ++DSGT++T L + AY  +  + R   K
Sbjct: 355 YVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAK 414

Query: 374 LPNAAELTPGFDLCVNAS--GESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCL 433
               A     FD C + S   E + P++  L FR   GA  + P  NY +  +  G  C 
Sbjct: 415 TLKRAPDFSLFDTCFDLSNMNEVKVPTVV-LHFR---GADVSLPATNYLIPVDTNGKFCF 474

Query: 434 AIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           A     +  G S+IGN+ QQGF + +D   SR+GF   GC
Sbjct: 475 AFAG--TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CSPI05G08550 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 222.6 bits (566), Expect = 6.4e-58
Identity = 154/454 (33.92%), Postives = 232/454 (51.10%), Query Frame = 0

Query: 17  SFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSRPNPTLKSP 76
           S  L+H+   ++ + A+PAD   L L      S   +S++S    L+ + +  N T ++P
Sbjct: 62  SVHLSHVDALSSFSDASPADLFNLRLQRD---SLRVKSITS----LAAVSTGRNATKRTP 121

Query: 77  ---------LISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 136
                    +ISG S GSG+YF+ + +GTP  ++ +V DTGSD+VW++CS C+ C ++  
Sbjct: 122 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKAC-YNQT 181

Query: 137 SSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSK 196
            + F P+ S +F+   C    CR L  +   +   TR    C +  SY DGS + G FS 
Sbjct: 182 DAIFDPKKSKTFATVPCGSRLCRRLDDSSECV---TRRSKTCLYQVSYGDGSFTEGDFST 241

Query: 197 ETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRF 256
           ET T        + L     GCG    G       F GA G++GLGRG +SF SQ   R+
Sbjct: 242 ETLTFHGARVDHVPL-----GCGHDNEG------LFVGAAGLLGLGRGGLSFPSQTKNRY 301

Query: 257 GNKFSYCLMDYT----LSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYIT 316
             KFSYCL+D T     S PP++ +    G  ++P T+     +TPL  NP   TFYY+ 
Sbjct: 302 NGKFSYCLVDRTSSGSSSKPPSTIVF---GNAAVPKTSV----FTPLLTNPKLDTFYYLQ 361

Query: 317 IHSITIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPN 376
           +  I++ G ++P ++ + +++D  GNGG ++DSGT++T LT+ AY  +  + R       
Sbjct: 362 LLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLK 421

Query: 377 AAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVE 436
            A     FD C + SG +    +P + F  GGG V  P          EG  C A     
Sbjct: 422 RAPSYSLFDTCFDLSGMT-TVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAG-- 481

Query: 437 SGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           +    S+IGN+ QQGF + +D   SR+GF  R C
Sbjct: 482 TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LNJ31.3e-5833.48Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C27.9e-5333.96Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C38.7e-5233.66Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LTW44.3e-5133.64Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Q9LHE35.8e-4832.55Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A0A0KNH63.8e-268100.00Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G17465... [more]
A0A1S3CSZ84.2e-25997.17aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103504612 PE=3 ... [more]
A0A6J1HXS22.8e-22686.33aspartyl protease family protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111469001 P... [more]
A0A6J1ELQ46.2e-22685.68aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1EPZ31.2e-20879.26aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
Match NameE-valueIdentityDescription
XP_004143702.17.9e-268100.00aspartyl protease family protein 2 [Cucumis sativus] >KGN50439.1 hypothetical pr... [more]
XP_008467208.18.7e-25997.17PREDICTED: aspartyl protease family protein 2 [Cucumis melo][more]
XP_038907006.11.2e-23689.93aspartyl protease family protein 2 [Benincasa hispida][more]
XP_023549997.15.2e-22786.33aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo][more]
XP_022969957.15.7e-22686.33aspartyl protease family protein 2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G25700.11.6e-17064.91Eukaryotic aspartyl protease family protein [more]
AT3G25700.21.1e-11349.78Eukaryotic aspartyl protease family protein [more]
AT2G42980.18.1e-6137.85Eukaryotic aspartyl protease family protein [more]
AT1G01300.18.9e-6033.48Eukaryotic aspartyl protease family protein [more]
AT3G61820.16.4e-5833.92Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 94..114
score: 49.53
coord: 428..443
score: 18.99
coord: 331..342
score: 41.24
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 88..272
e-value: 2.2E-48
score: 164.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 67..271
e-value: 1.1E-45
score: 158.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 277..457
e-value: 3.2E-50
score: 172.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 82..456
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 299..452
e-value: 1.7E-36
score: 125.4
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 12..457
NoneNo IPR availablePANTHERPTHR47967:SF72BNAA02G27600D PROTEINcoord: 12..457
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 88..452
score: 40.986813
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 87..456
e-value: 1.40021E-84
score: 258.732

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G08550.1CSPI05G08550.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity