CSPI05G08550.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI05G08550.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr5 : 7262823 .. 7264646 (+)
Sequence length1380
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGAAAAATGAGATTGGATTGGGCCCAAAAAGAGAGCCCAACAGTTGAAAGAAGAAGAAGCAAAGTTGTTCCCTCGTGTGATCCGTTTTCATAGTCTTACAGATTTTGATTTGAGTTCGAGTGTTCCTTCCAAAACAACAATGCCAGTCCTCTCCATTTCCCCATTCTTCCTTCTCATTCTTCTCTTCTCCTTCTTTCTCACACATCTCCCCAACCCCAATGCCACCGCCGTCGCTGCCCCCGCCGACTTCCTGAAGCTCCCCCTTCTTCACAAACCCCCCTTCTCCTCCCCTTCCCAATCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCTCGCCCCAACCCCACTCTCAAATCCCCTCTCATCTCCGGCGCTTCCACCGGTTCCGGCCAATACTTCGTCGACATCCGCCTCGGTACTCCTCCCCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTTAAATGCTCCGCCTGCCGCAACTGCTCTCACCATCCTCCTTCCTCCGCCTTCCTCCCCCGCCATTCCTCCTCCTTCTCCCCTTTCCATTGCTTCGACCCCCACTGCCGTCTCCTCCCCCACGCTCCTCACCATCTCTGTAACCACACGCGCCTCCACTCCCCTTGTCGCTTCCTCTACTCCTATGCCGATGGCTCCCTCTCCTCCGGCTTCTTCTCCAAAGAAACCACCACATTGAAGTCGCTCTCCGGGTCCGAAATCCATCTTAAAGGCCTCTCGTTCGGCTGCGGATTTCGGATCTCCGGTCCCAGCGTTTCGGGGGCTCAGTTCAATGGTGCACGTGGCGTCATGGGATTGGGTAGAGGCTCCATTTCCTTCTCTTCTCAACTCGGCCGCCGATTCGGCAACAAATTTTCTTACTGTCTTATGGATTACACTCTCTCTCCGCCGCCTACCAGCTTCTTAATGATCGGCGGCGGCCTCCACAGCCTCCCTCTCACCAATGCCACAAAAATCAGCTATACCCCTTTGCAGATTAACCCTCTTTCCCCCACATTCTACTACATTACCATCCACAGCATCACCATCGACGGCGTGAAATTACCCATCAACCCCGCCGTTTGGGAAATCGACGAACAGGGCAATGGCGGCACGGTGGTGGATTCAGGGACAACGCTAACCTACCTAACGAAGACAGCGTACGAGGAGGTGCTGAAGTCAGTAAGACGGCGAGTGAAACTACCAAATGCTGCAGAGTTGACACCGGGATTCGATCTATGCGTGAATGCGTCGGGAGAGTCGCGGCGGCCGAGTCTGCCGCGACTGAGATTCCGACTGGGAGGTGGGGCGGTGTTTGCTCCACCGCCGAGGAACTATTTTCTGGAAACAGAGGAGGGAGTGATGTGTTTGGCGATCCGAGCGGTGGAATCGGGAAATGGGTTTTCGGTGATCGGAAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAAGGAGGAATCGAGGCTGGGTTTTACAAGGCGGGGATGTGGGCTTCCATGAACGAACATCATCAACTTTTGGAACTTTGTTTCTTTATTTTTGTTAACTCGACTCACTGAGTTTGGAACTCGCAATCAATTAATTTTCGATTTGGTAAAATTATTATTATTATCATTATCATTATTATTATTATTATTAAAAAAGAGGTTTTTAAATTTTTAATAATAATAAGTTTGTACTTTGGATTGGATTCTCTTTTGTATACTACAAAAACTCATTGATTTTTGGACATTATGTTTTATTTGTATCACACCTTCCTTTTCTTTCCCCCCAACCTTACTTATATTTATATAAATCAATCCA

mRNA sequence

ATGCCAGTCCTCTCCATTTCCCCATTCTTCCTTCTCATTCTTCTCTTCTCCTTCTTTCTCACACATCTCCCCAACCCCAATGCCACCGCCGTCGCTGCCCCCGCCGACTTCCTGAAGCTCCCCCTTCTTCACAAACCCCCCTTCTCCTCCCCTTCCCAATCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCTCGCCCCAACCCCACTCTCAAATCCCCTCTCATCTCCGGCGCTTCCACCGGTTCCGGCCAATACTTCGTCGACATCCGCCTCGGTACTCCTCCCCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTTAAATGCTCCGCCTGCCGCAACTGCTCTCACCATCCTCCTTCCTCCGCCTTCCTCCCCCGCCATTCCTCCTCCTTCTCCCCTTTCCATTGCTTCGACCCCCACTGCCGTCTCCTCCCCCACGCTCCTCACCATCTCTGTAACCACACGCGCCTCCACTCCCCTTGTCGCTTCCTCTACTCCTATGCCGATGGCTCCCTCTCCTCCGGCTTCTTCTCCAAAGAAACCACCACATTGAAGTCGCTCTCCGGGTCCGAAATCCATCTTAAAGGCCTCTCGTTCGGCTGCGGATTTCGGATCTCCGGTCCCAGCGTTTCGGGGGCTCAGTTCAATGGTGCACGTGGCGTCATGGGATTGGGTAGAGGCTCCATTTCCTTCTCTTCTCAACTCGGCCGCCGATTCGGCAACAAATTTTCTTACTGTCTTATGGATTACACTCTCTCTCCGCCGCCTACCAGCTTCTTAATGATCGGCGGCGGCCTCCACAGCCTCCCTCTCACCAATGCCACAAAAATCAGCTATACCCCTTTGCAGATTAACCCTCTTTCCCCCACATTCTACTACATTACCATCCACAGCATCACCATCGACGGCGTGAAATTACCCATCAACCCCGCCGTTTGGGAAATCGACGAACAGGGCAATGGCGGCACGGTGGTGGATTCAGGGACAACGCTAACCTACCTAACGAAGACAGCGTACGAGGAGGTGCTGAAGTCAGTAAGACGGCGAGTGAAACTACCAAATGCTGCAGAGTTGACACCGGGATTCGATCTATGCGTGAATGCGTCGGGAGAGTCGCGGCGGCCGAGTCTGCCGCGACTGAGATTCCGACTGGGAGGTGGGGCGGTGTTTGCTCCACCGCCGAGGAACTATTTTCTGGAAACAGAGGAGGGAGTGATGTGTTTGGCGATCCGAGCGGTGGAATCGGGAAATGGGTTTTCGGTGATCGGAAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAAGGAGGAATCGAGGCTGGGTTTTACAAGGCGGGGATGTGGGCTTCCATGA

Coding sequence (CDS)

ATGCCAGTCCTCTCCATTTCCCCATTCTTCCTTCTCATTCTTCTCTTCTCCTTCTTTCTCACACATCTCCCCAACCCCAATGCCACCGCCGTCGCTGCCCCCGCCGACTTCCTGAAGCTCCCCCTTCTTCACAAACCCCCCTTCTCCTCCCCTTCCCAATCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCTCGCCCCAACCCCACTCTCAAATCCCCTCTCATCTCCGGCGCTTCCACCGGTTCCGGCCAATACTTCGTCGACATCCGCCTCGGTACTCCTCCCCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTTAAATGCTCCGCCTGCCGCAACTGCTCTCACCATCCTCCTTCCTCCGCCTTCCTCCCCCGCCATTCCTCCTCCTTCTCCCCTTTCCATTGCTTCGACCCCCACTGCCGTCTCCTCCCCCACGCTCCTCACCATCTCTGTAACCACACGCGCCTCCACTCCCCTTGTCGCTTCCTCTACTCCTATGCCGATGGCTCCCTCTCCTCCGGCTTCTTCTCCAAAGAAACCACCACATTGAAGTCGCTCTCCGGGTCCGAAATCCATCTTAAAGGCCTCTCGTTCGGCTGCGGATTTCGGATCTCCGGTCCCAGCGTTTCGGGGGCTCAGTTCAATGGTGCACGTGGCGTCATGGGATTGGGTAGAGGCTCCATTTCCTTCTCTTCTCAACTCGGCCGCCGATTCGGCAACAAATTTTCTTACTGTCTTATGGATTACACTCTCTCTCCGCCGCCTACCAGCTTCTTAATGATCGGCGGCGGCCTCCACAGCCTCCCTCTCACCAATGCCACAAAAATCAGCTATACCCCTTTGCAGATTAACCCTCTTTCCCCCACATTCTACTACATTACCATCCACAGCATCACCATCGACGGCGTGAAATTACCCATCAACCCCGCCGTTTGGGAAATCGACGAACAGGGCAATGGCGGCACGGTGGTGGATTCAGGGACAACGCTAACCTACCTAACGAAGACAGCGTACGAGGAGGTGCTGAAGTCAGTAAGACGGCGAGTGAAACTACCAAATGCTGCAGAGTTGACACCGGGATTCGATCTATGCGTGAATGCGTCGGGAGAGTCGCGGCGGCCGAGTCTGCCGCGACTGAGATTCCGACTGGGAGGTGGGGCGGTGTTTGCTCCACCGCCGAGGAACTATTTTCTGGAAACAGAGGAGGGAGTGATGTGTTTGGCGATCCGAGCGGTGGAATCGGGAAATGGGTTTTCGGTGATCGGAAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAAGGAGGAATCGAGGCTGGGTTTTACAAGGCGGGGATGTGGGCTTCCATGA
BLAST of CSPI05G08550.1 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 1.2e-58
Identity = 154/460 (33.48%), Postives = 233/460 (50.65%), Query Frame = 1

Query: 14  LLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS------ 73
           LL S F +   + +++++    D +     +K P    S  L  D+ R+  + +      
Sbjct: 55  LLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIP 114

Query: 74  ------RPNPT-LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 133
                  P P    S ++SG S GSG+YF  + +GTP + + +V DTGSD+VW++C+ CR
Sbjct: 115 GRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR 174

Query: 134 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 193
            C +      F PR S +++   C  PHCR L  A    CN  R    C +  SY DGS 
Sbjct: 175 RC-YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG---CNTRR--KTCLYQVSYGDGSF 234

Query: 194 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 253
           + G FS ET T +        +KG++ GCG    G       F GA G++GLG+G +SF 
Sbjct: 235 TVGDFSTETLTFR-----RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLSFP 294

Query: 254 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 313
            Q G RF  KFSYCL+D + S  P+S +     +  +         +TPL  NP   TFY
Sbjct: 295 GQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDTFY 354

Query: 314 YITIHSITIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK 373
           Y+ +  I++ G ++P +  +++++D+ GNGG ++DSGT++T L + AY  +  + R   K
Sbjct: 355 YVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAK 414

Query: 374 LPNAAELTPGFDLCVNAS--GESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCL 433
               A     FD C + S   E + P++  L FR   GA  + P  NY +  +  G  C 
Sbjct: 415 TLKRAPDFSLFDTCFDLSNMNEVKVPTVV-LHFR---GADVSLPATNYLIPVDTNGKFCF 474

Query: 434 AIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           A     +  G S+IGN+ QQGF + +D   SR+GF   GC
Sbjct: 475 AFAG--TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CSPI05G08550.1 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 7.6e-53
Identity = 127/374 (33.96%), Postives = 193/374 (51.60%), Query Frame = 1

Query: 84  GSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFH 143
           G G+Y +++ +GTP  S   + DTGSDL+W +C  C  C   P +  F P+ SSSFS   
Sbjct: 92  GDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQP-TPIFNPQDSSSFSTLP 151

Query: 144 CFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLK 203
           C   +C+ LP      CN+      C++ Y Y DGS + G+ + ET T ++ S     + 
Sbjct: 152 CESQYCQDLPS---ETCNNNE----CQYTYGYGDGSTTQGYMATETFTFETSS-----VP 211

Query: 204 GLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPP 263
            ++FGCG    G      Q NGA G++G+G G +S  SQLG     +FSYC+  Y  S P
Sbjct: 212 NIAFGCGEDNQG----FGQGNGA-GLIGMGWGPLSLPSQLGV---GQFSYCMTSYGSSSP 271

Query: 264 PTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEI 323
            T  L +G     +P  + +    T L  + L+PT+YYIT+  IT+ G  L I  + +++
Sbjct: 272 ST--LALGSAASGVPEGSPS----TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQL 331

Query: 324 DEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRP 383
            + G GG ++DSGTTLTYL + AY  V ++   ++ LP   E + G   C     +    
Sbjct: 332 QDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTV 391

Query: 384 SLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFD 443
            +P +  +  GG V     +N  +   EGV+CLA+ +  S  G S+ GN+ QQ   + +D
Sbjct: 392 QVPEISMQFDGG-VLNLGEQNILISPAEGVICLAMGS-SSQLGISIFGNIQQQETQVLYD 436

Query: 444 KEESRLGFTRRGCG 458
            +   + F    CG
Sbjct: 452 LQNLAVSFVPTQCG 436

BLAST of CSPI05G08550.1 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 1.1e-51
Identity = 137/410 (33.41%), Postives = 200/410 (48.78%), Query Frame = 1

Query: 53  QSLSSDTHRLSLLFSRPNPTLKSPLISGAST----GSGQYFVDIRLGTPPQSLLLVADTG 112
           Q L     R S    R    L  P  SG  T    G G+Y +++ +GTP Q    + DTG
Sbjct: 58  QLLERAIERGSRRLQRLEAMLNGP--SGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTG 117

Query: 113 SDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSP 172
           SDL+W +C  C  C +   +  F P+ SSSFS   C    C+ L        + T  ++ 
Sbjct: 118 SDLIWTQCQPCTQCFNQS-TPIFNPQGSSSFSTLPCSSQLCQALS-------SPTCSNNF 177

Query: 173 CRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARG 232
           C++ Y Y DGS + G    ET T  S+S     +  ++FGCG    G      Q NGA G
Sbjct: 178 CQYTYGYGDGSETQGSMGTETLTFGSVS-----IPNITFGCGENNQG----FGQGNGA-G 237

Query: 233 VMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYT 292
           ++G+GRG +S  SQL      KFSYC+     S P  S L++G   +S+   +       
Sbjct: 238 LVGMGRGPLSLPSQLDV---TKFSYCMTPIGSSTP--SNLLLGSLANSVTAGSPNTTLIQ 297

Query: 293 PLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQ-GNGGTVVDSGTTLTYLTKTAY 352
             QI    PTFYYIT++ +++   +LPI+P+ + ++   G GG ++DSGTTLTY    AY
Sbjct: 298 SSQI----PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAY 357

Query: 353 EEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFL 412
           + V +    ++ LP     + GFDLC     +     +P       GG +   P  NYF+
Sbjct: 358 QSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDL-ELPSENYFI 417

Query: 413 ETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCG 458
               G++CLA+ +  S  G S+ GN+ QQ  L+ +D   S + F    CG
Sbjct: 418 SPSNGLICLAMGS--SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435

BLAST of CSPI05G08550.1 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 5.6e-48
Identity = 125/384 (32.55%), Postives = 198/384 (51.56%), Query Frame = 1

Query: 75  SPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPR 134
           S ++SG   GSG+YFV I +G+PP+   +V D+GSD+VWV+C  C+ C +      F P 
Sbjct: 118 SDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC-YKQSDPVFDPA 177

Query: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKS 194
            S S++   C    C  + ++  H          CR+   Y DGS     ++K T  L++
Sbjct: 178 KSGSYTGVSCGSSVCDRIENSGCH-------SGGCRYEVMYGDGS-----YTKGTLALET 237

Query: 195 LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYC 254
           L+ ++  ++ ++ GCG R  G       F GA G++G+G GS+SF  QL  + G  F YC
Sbjct: 238 LTFAKTVVRNVAMGCGHRNRG------MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYC 297

Query: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314
           L+  +     T  L+ G    +LP+      S+ PL  NP +P+FYY+ +  + + GV++
Sbjct: 298 LV--SRGTDSTGSLVFGR--EALPVG----ASWVPLVRNPRAPSFYYVGLKGLGVGGVRI 357

Query: 315 PINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVR-RRVKLPNAAELTPGFDLC 374
           P+   V+++ E G+GG V+D+GT +T L   AY       + +   LP A+ ++  FD C
Sbjct: 358 PLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FDTC 417

Query: 375 VNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESGNGFSVIGN 434
            + SG      +P + F    G V   P RN+ +  ++ G  C A  A  S  G S+IGN
Sbjct: 418 YDLSG-FVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAA--SPTGLSIIGN 470

Query: 435 LMQQGFLLEFDKEESRLGFTRRGC 457
           + Q+G  + FD     +GF    C
Sbjct: 478 IQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CSPI05G08550.1 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.5e-45
Identity = 119/389 (30.59%), Postives = 189/389 (48.59%), Query Frame = 1

Query: 73  LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL 132
           L +P++SGAS GSG+YF  I +GTP + + LV DTGSD+ W++C  C +C +      F 
Sbjct: 147 LTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADC-YQQSDPVFN 206

Query: 133 PRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTL 192
           P  SS++    C  P C LL  +    C   +    C +  SY DGS + G  + +T T 
Sbjct: 207 PTSSSTYKSLTCSAPQCSLLETS---ACRSNK----CLYQVSYGDGSFTVGELATDTVTF 266

Query: 193 KSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFS 252
               G+   +  ++ GCG    G       F GA G++GLG G +S ++Q+       FS
Sbjct: 267 ----GNSGKINNVALGCGHDNEG------LFTGAAGLLGLGGGVLSITNQMK---ATSFS 326

Query: 253 YCLMDYTLSPPPT---SFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITI 312
           YCL+D       +   + + +GGG  + PL    KI            TFYY+ +   ++
Sbjct: 327 YCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKID-----------TFYYVGLSGFSV 386

Query: 313 DGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKS-VRRRVKLPNAAELTP 372
            G K+ +  A++++D  G+GG ++D GT +T L   AY  +  + ++  V L   +    
Sbjct: 387 GGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSIS 446

Query: 373 GFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESGNGF 432
            FD C + S  S    +P + F   GG     P +NY +  ++ G  C A     S    
Sbjct: 447 LFDTCYDFSSLS-TVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS--SL 500

Query: 433 SVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           S+IGN+ QQG  + +D  ++ +G +   C
Sbjct: 507 SIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CSPI05G08550.1 vs. TrEMBL
Match: A0A0A0KNH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 1.1e-268
Identity = 459/459 (100.00%), Postives = 459/459 (100.00%), Query Frame = 1

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550.1 vs. TrEMBL
Match: F6HF17_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=1)

HSP 1 Score: 610.9 bits (1574), Expect = 1.3e-171
Identity = 299/458 (65.28%), Postives = 361/458 (78.82%), Query Frame = 1

Query: 4   LSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLS 63
           L  S  F L+LL  FF T + N    A     ++LKL LLH  PF++PSQ+LS D+HRLS
Sbjct: 3   LPFSSLFSLLLLLIFFFTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLS 62

Query: 64  LLFSRPNP--TLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRN 123
             FS  +   +LKSP++SGASTGSGQYFVD+RLGTPPQ LLLVADTGSDLVWVKCSACRN
Sbjct: 63  FFFSALHTPQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRN 122

Query: 124 CSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLS 183
           C+ H P SAFL RHS++FSP HC+D  C+L+P   HH CNH RLHSPCR+ YSY DGS +
Sbjct: 123 CTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKT 182

Query: 184 SGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSS 243
           SGFFSKETTTL + SG E  LKG++FGC FRISGPSVSGA FNGA GVMGLGRG IS SS
Sbjct: 183 SGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSS 242

Query: 244 QLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 303
           QLG RFGNKFSYCLMD+ +SP PTS+L+IG   + +      ++ +TPL INPLSPTFYY
Sbjct: 243 QLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDV-APGKRRMRFTPLHINPLSPTFYY 302

Query: 304 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 363
           I I S+++DG+KLPINP+VW +DE GNGGT+VDSGTTLT+L + AY ++L  ++RRV+LP
Sbjct: 303 IGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLP 362

Query: 364 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 423
           + AE TPGFDLCVN S E   P LP+L F+LGG +VF+PPPRNYF++T+E V CLA++AV
Sbjct: 363 SPAEPTPGFDLCVNVS-EIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAV 422

Query: 424 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
            + +GFSVIGNLMQQGFLLEFDK+ +RLGF+R GC LP
Sbjct: 423 MTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458

BLAST of CSPI05G08550.1 vs. TrEMBL
Match: A0A067K2U7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 4.1e-170
Identity = 304/463 (65.66%), Postives = 364/463 (78.62%), Query Frame = 1

Query: 1   MPVLSISPFFLLILL----FSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLS 60
           M ++S  PF LL+LL    +S  L    N  AT      ++LKLPLLH+ PF SP+Q+L 
Sbjct: 1   MVLVSSLPFLLLLLLTDLCYSISLRTTVNSTATK-----EYLKLPLLHRTPFKSPAQALP 60

Query: 61  SDTHRLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKC 120
            D  RLSLL  R   +LKSP+ISGASTGSGQYFV +RLG+P Q+LLLVADTGSDLVWVKC
Sbjct: 61  FDIRRLSLLH-RQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLVWVKC 120

Query: 121 SACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYA 180
           SAC+NCS++ P SAFL RHSS+FS  HCF+  CRL+PH   + CN TRLHSPCR+ YSYA
Sbjct: 121 SACKNCSNYSPGSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYEYSYA 180

Query: 181 DGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGS 240
           DGS +SGFFSKETTTL + +G E  LK L+FGCGFRISGPS++GA F GA GV+GLGR  
Sbjct: 181 DGSSTSGFFSKETTTLNTSAGREKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGLGRAP 240

Query: 241 ISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLS 300
           ISFSSQLGRRFGNKFSYCLMDYTLSPPPTS+LMIGG  +S  ++    +++TPL +N LS
Sbjct: 241 ISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSA-VSRKRILNFTPLLVNSLS 300

Query: 301 PTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRR 360
           PTFYYI I S+++DGVKLPINP+VW ID+ GNGGT++DSGTTLT+L + AY E+L +++R
Sbjct: 301 PTFYYIGIKSVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSAIKR 360

Query: 361 RVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCL 420
           RVKLP   ELTPGFDLCVN SG  RRP  PR+   L G +VF+PPPRNYF++T EGV CL
Sbjct: 361 RVKLPGPGELTPGFDLCVNVSG-VRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVKCL 420

Query: 421 AIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           AI+ V SG+GFSVIGNLMQQG+LLEFD++ SRLGF R GC LP
Sbjct: 421 AIQPVNSGSGFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455

BLAST of CSPI05G08550.1 vs. TrEMBL
Match: V4KZL9_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10004188mg PE=3 SV=1)

HSP 1 Score: 600.1 bits (1546), Expect = 2.2e-168
Identity = 298/454 (65.64%), Postives = 356/454 (78.41%), Query Frame = 1

Query: 12  LILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR--P 71
           LI+L SF    L  P   A     ++LKLPLL K PF SP+QSL+ DT RL  L  R  P
Sbjct: 4   LIVLCSFLSLFLLPPVNLAAVNDDEYLKLPLLRKSPFPSPTQSLALDTRRLHFLSLRRKP 63

Query: 72  NPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSS 131
            P +KSP++SGAS+GSGQYFVD+R+G PPQSLLL+ADTGSDLVWVKCSACRNCS H P +
Sbjct: 64  VPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSLHSPGT 123

Query: 132 AFLPRHSSSFSPFHCFDPHCRLLPH---APHHLCNHTRLHSPCRFLYSYADGSLSSGFFS 191
            F PRHSS+FSP HC+DP CRL+P    AP   CNHTR+HS C + Y+YADGSL+SG F+
Sbjct: 124 VFFPRHSSTFSPAHCYDPICRLVPEPGRAPK--CNHTRIHSTCPYEYAYADGSLTSGLFA 183

Query: 192 KETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR 251
           +ETTTLK+ SG E +LK ++FGCGFRISG SVSG  FNGA GVMGLGRG ISF+SQLGRR
Sbjct: 184 RETTTLKTSSGREAYLKSVAFGCGFRISGQSVSGTSFNGAHGVMGLGRGPISFASQLGRR 243

Query: 252 FGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHS 311
           FGNKFSYCLMDYTLSPPPTS+L+IG G   +     +K+S+TPL  NPLSPTFYY+ + S
Sbjct: 244 FGNKFSYCLMDYTLSPPPTSYLIIGDGGGGVRSDAVSKLSFTPLLTNPLSPTFYYVRLKS 303

Query: 312 ITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAEL 371
           I ++G KL I+P+VWEID+ GNGGTVVDSGTTL +L + AY  V+ +VRRR++LP AAE+
Sbjct: 304 IFVNGAKLRIDPSVWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRIRLPIAAEV 363

Query: 372 TPGFDLCVNASGESR-RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGN 431
           TPGFDLCVN SG S+    +PRL+F L GGA+F PPPRNYF+ETEE + CLAI++V    
Sbjct: 364 TPGFDLCVNISGVSKPEKIMPRLKFELAGGALFVPPPRNYFIETEEQIQCLAIQSVNPKV 423

Query: 432 GFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           GFSVIGNLMQQGFL EFD++ SRLGF+RRGC LP
Sbjct: 424 GFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 455

BLAST of CSPI05G08550.1 vs. TrEMBL
Match: Q9LI73_ARATH (Aspartyl protease family protein OS=Arabidopsis thaliana GN=At3g25700 PE=2 SV=1)

HSP 1 Score: 596.7 bits (1537), Expect = 2.5e-167
Identity = 296/456 (64.91%), Postives = 355/456 (77.85%), Query Frame = 1

Query: 10  FLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR- 69
           F L    S FL  LP  N  AV+    +LKLPLL K PF SP+Q+L+ DT RL  L  R 
Sbjct: 6   FFLCSFLSLFL--LPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRR 65

Query: 70  -PNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 129
            P P +KSP++SGA++GSGQYFVD+R+G PPQSLLL+ADTGSDLVWVKCSACRNCSHH P
Sbjct: 66  KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 125

Query: 130 SSAFLPRHSSSFSPFHCFDPHCRLLP---HAPHHLCNHTRLHSPCRFLYSYADGSLSSGF 189
           ++ F PRHSS+FSP HC+DP CRL+P    AP  +CNHTR+HS C + Y YADGSL+SG 
Sbjct: 126 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAP--ICNHTRIHSTCHYEYGYADGSLTSGL 185

Query: 190 FSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 249
           F++ETT+LK+ SG E  LK ++FGCGFRISG SVSG  FNGA GVMGLGRG ISF+SQLG
Sbjct: 186 FARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLG 245

Query: 250 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITI 309
           RRFGNKFSYCLMDYTLSPPPTS+L+IG G   +     +K+ +TPL  NPLSPTFYY+ +
Sbjct: 246 RRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKL 305

Query: 310 HSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 369
            S+ ++G KL I+P++WEID+ GNGGTVVDSGTTL +L + AY  V+ +VRRRVKLP A 
Sbjct: 306 KSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 365

Query: 370 ELTPGFDLCVNASGESR-RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVES 429
            LTPGFDLCVN SG ++    LPRL+F   GGAVF PPPRNYF+ETEE + CLAI++V+ 
Sbjct: 366 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDP 425

Query: 430 GNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
             GFSVIGNLMQQGFL EFD++ SRLGF+RRGC LP
Sbjct: 426 KVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of CSPI05G08550.1 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 596.7 bits (1537), Expect = 1.2e-170
Identity = 296/456 (64.91%), Postives = 355/456 (77.85%), Query Frame = 1

Query: 10  FLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR- 69
           F L    S FL  LP  N  AV+    +LKLPLL K PF SP+Q+L+ DT RL  L  R 
Sbjct: 6   FFLCSFLSLFL--LPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRR 65

Query: 70  -PNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 129
            P P +KSP++SGA++GSGQYFVD+R+G PPQSLLL+ADTGSDLVWVKCSACRNCSHH P
Sbjct: 66  KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 125

Query: 130 SSAFLPRHSSSFSPFHCFDPHCRLLP---HAPHHLCNHTRLHSPCRFLYSYADGSLSSGF 189
           ++ F PRHSS+FSP HC+DP CRL+P    AP  +CNHTR+HS C + Y YADGSL+SG 
Sbjct: 126 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAP--ICNHTRIHSTCHYEYGYADGSLTSGL 185

Query: 190 FSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 249
           F++ETT+LK+ SG E  LK ++FGCGFRISG SVSG  FNGA GVMGLGRG ISF+SQLG
Sbjct: 186 FARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLG 245

Query: 250 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITI 309
           RRFGNKFSYCLMDYTLSPPPTS+L+IG G   +     +K+ +TPL  NPLSPTFYY+ +
Sbjct: 246 RRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKL 305

Query: 310 HSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 369
            S+ ++G KL I+P++WEID+ GNGGTVVDSGTTL +L + AY  V+ +VRRRVKLP A 
Sbjct: 306 KSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 365

Query: 370 ELTPGFDLCVNASGESR-RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVES 429
            LTPGFDLCVN SG ++    LPRL+F   GGAVF PPPRNYF+ETEE + CLAI++V+ 
Sbjct: 366 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDP 425

Query: 430 GNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
             GFSVIGNLMQQGFL EFD++ SRLGF+RRGC LP
Sbjct: 426 KVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of CSPI05G08550.1 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 232.3 bits (591), Expect = 6.2e-61
Identity = 148/391 (37.85%), Postives = 204/391 (52.17%), Query Frame = 1

Query: 73  LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL 132
           L + L SG + GSG+YF+D+ +GTPP+   L+ DTGSDL W++C  C +C  H     + 
Sbjct: 145 LIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDC-FHQNGMFYD 204

Query: 133 PRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTL 192
           P+ S+SF    C DP C L+  +P         +  C + Y Y D S ++G F+ ET T+
Sbjct: 205 PKTSASFKNITCNDPRCSLI-SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV 264

Query: 193 KSLS----GSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFG 252
              +     SE  +  + FGCG    G       F+GA G++GLGRG +SFSSQL   +G
Sbjct: 265 NLTTTEGGSSEYKVGNMMFGCGHWNRG------LFSGASGLLGLGRGPLSFSSQLQSLYG 324

Query: 253 NKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSIT 312
           + FSYCL+D   +   +S L+ G     L  TN    S+   + N +  TFYYI I SI 
Sbjct: 325 HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVE-TFYYIQIKSIL 384

Query: 313 IDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK--LPNAAEL 372
           + G  L I    W I   G+GGT++DSGTTL+Y  + AYE +      ++K   P   + 
Sbjct: 385 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 444

Query: 373 TPGFDLCVNASG-ESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGN 432
            P  D C N SG E     LP L      G V+  P  N F+   E ++CLAI       
Sbjct: 445 -PVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKST 504

Query: 433 GFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
            FS+IGN  QQ F + +D + SRLGFT   C
Sbjct: 505 -FSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of CSPI05G08550.1 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 228.8 bits (582), Expect = 6.8e-60
Identity = 154/460 (33.48%), Postives = 233/460 (50.65%), Query Frame = 1

Query: 14  LLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS------ 73
           LL S F +   + +++++    D +     +K P    S  L  D+ R+  + +      
Sbjct: 55  LLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIP 114

Query: 74  ------RPNPT-LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 133
                  P P    S ++SG S GSG+YF  + +GTP + + +V DTGSD+VW++C+ CR
Sbjct: 115 GRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR 174

Query: 134 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 193
            C +      F PR S +++   C  PHCR L  A    CN  R    C +  SY DGS 
Sbjct: 175 RC-YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG---CNTRR--KTCLYQVSYGDGSF 234

Query: 194 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 253
           + G FS ET T +        +KG++ GCG    G       F GA G++GLG+G +SF 
Sbjct: 235 TVGDFSTETLTFR-----RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLSFP 294

Query: 254 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 313
            Q G RF  KFSYCL+D + S  P+S +     +  +         +TPL  NP   TFY
Sbjct: 295 GQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDTFY 354

Query: 314 YITIHSITIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK 373
           Y+ +  I++ G ++P +  +++++D+ GNGG ++DSGT++T L + AY  +  + R   K
Sbjct: 355 YVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAK 414

Query: 374 LPNAAELTPGFDLCVNAS--GESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCL 433
               A     FD C + S   E + P++  L FR   GA  + P  NY +  +  G  C 
Sbjct: 415 TLKRAPDFSLFDTCFDLSNMNEVKVPTVV-LHFR---GADVSLPATNYLIPVDTNGKFCF 474

Query: 434 AIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           A     +  G S+IGN+ QQGF + +D   SR+GF   GC
Sbjct: 475 AFAG--TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CSPI05G08550.1 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 222.6 bits (566), Expect = 4.9e-58
Identity = 154/454 (33.92%), Postives = 232/454 (51.10%), Query Frame = 1

Query: 17  SFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSRPNPTLKSP 76
           S  L+H+   ++ + A+PAD   L L      S   +S++S    L+ + +  N T ++P
Sbjct: 62  SVHLSHVDALSSFSDASPADLFNLRLQRD---SLRVKSITS----LAAVSTGRNATKRTP 121

Query: 77  ---------LISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 136
                    +ISG S GSG+YF+ + +GTP  ++ +V DTGSD+VW++CS C+ C ++  
Sbjct: 122 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKAC-YNQT 181

Query: 137 SSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSK 196
            + F P+ S +F+   C    CR L  +   +   TR    C +  SY DGS + G FS 
Sbjct: 182 DAIFDPKKSKTFATVPCGSRLCRRLDDSSECV---TRRSKTCLYQVSYGDGSFTEGDFST 241

Query: 197 ETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRF 256
           ET T        + L     GCG    G       F GA G++GLGRG +SF SQ   R+
Sbjct: 242 ETLTFHGARVDHVPL-----GCGHDNEG------LFVGAAGLLGLGRGGLSFPSQTKNRY 301

Query: 257 GNKFSYCLMDYT----LSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYIT 316
             KFSYCL+D T     S PP++ +    G  ++P T+     +TPL  NP   TFYY+ 
Sbjct: 302 NGKFSYCLVDRTSSGSSSKPPSTIVF---GNAAVPKTSV----FTPLLTNPKLDTFYYLQ 361

Query: 317 IHSITIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPN 376
           +  I++ G ++P ++ + +++D  GNGG ++DSGT++T LT+ AY  +  + R       
Sbjct: 362 LLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLK 421

Query: 377 AAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVE 436
            A     FD C + SG +    +P + F  GGG V  P          EG  C A     
Sbjct: 422 RAPSYSLFDTCFDLSGMT-TVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAG-- 481

Query: 437 SGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           +    S+IGN+ QQGF + +D   SR+GF  R C
Sbjct: 482 TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CSPI05G08550.1 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 218.0 bits (554), Expect = 1.2e-56
Identity = 143/392 (36.48%), Postives = 204/392 (52.04%), Query Frame = 1

Query: 73  LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL 132
           L + L SG + GSG+YF+D+ +G+PP+   L+ DTGSDL W++C  C +C     + AF 
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQ--NGAFY 214

Query: 133 -PRHSSSFSPFHCFDPHCRLLPHA-PHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETT 192
            P+ S+S+    C D  C L+    P   C     +  C + Y Y D S ++G F+ ET 
Sbjct: 215 DPKASASYKNITCNDQRCNLVSSPDPPMPCKSD--NQSCPYYYWYGDSSNTTGDFAVETF 274

Query: 193 TLKSLSG---SEIH-LKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR 252
           T+   +    SE++ ++ + FGCG    G       F+GA G++GLGRG +SFSSQL   
Sbjct: 275 TVNLTTNGGSSELYNVENMMFGCGHWNRG------LFHGAAGLLGLGRGPLSFSSQLQSL 334

Query: 253 FGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHS 312
           +G+ FSYCL+D       +S L+ G     L   N    S+   + N L  TFYY+ I S
Sbjct: 335 YGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKEN-LVDTFYYVQIKS 394

Query: 313 ITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV--KLPNAA 372
           I + G  L I    W I   G GGT++DSGTTL+Y  + AYE +   +  +   K P   
Sbjct: 395 ILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYR 454

Query: 373 ELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESG 432
           +  P  D C N SG      LP L      GAV+  P  N F+   E ++CLA+      
Sbjct: 455 DF-PILDPCFNVSG-IHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPK- 514

Query: 433 NGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 457
           + FS+IGN  QQ F + +D + SRLG+    C
Sbjct: 515 SAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532

BLAST of CSPI05G08550.1 vs. NCBI nr
Match: gi|449451908|ref|XP_004143702.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus])

HSP 1 Score: 933.3 bits (2411), Expect = 1.6e-268
Identity = 459/459 (100.00%), Postives = 459/459 (100.00%), Query Frame = 1

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550.1 vs. NCBI nr
Match: gi|659073000|ref|XP_008467208.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 903.3 bits (2333), Expect = 1.8e-259
Identity = 446/459 (97.17%), Postives = 450/459 (98.04%), Query Frame = 1

Query: 1   MPVLSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTH 60
           MP+LSISPFFLLI LF FFLTHL NPNATAVAA ADFLKLPLLHKPPFSSPSQSLSSDTH
Sbjct: 1   MPMLSISPFFLLIPLFFFFLTHLSNPNATAVAAAADFLKLPLLHKPPFSSPSQSLSSDTH 60

Query: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120
           RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR
Sbjct: 61  RLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 120

Query: 121 NCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSL 180
           NCSHHPPSSAF PRHSSSFSPFHCFDPHCRLLPHAP H CNHT LHSPCRFLYSYADGSL
Sbjct: 121 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHAPPHHCNHTLLHSPCRFLYSYADGSL 180

Query: 181 SSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240
           SSGFFSKETTTLK+LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS
Sbjct: 181 SSGFFSKETTTLKTLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFS 240

Query: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
           SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLP+ NATKISYTPLQINPLSPTFY
Sbjct: 241 SQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPVNNATKISYTPLQINPLSPTFY 300

Query: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
           YITI+SITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL
Sbjct: 301 YITINSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420
           PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CSPI05G08550.1 vs. NCBI nr
Match: gi|359473000|ref|XP_002278677.2| (PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera])

HSP 1 Score: 610.9 bits (1574), Expect = 1.8e-171
Identity = 299/458 (65.28%), Postives = 361/458 (78.82%), Query Frame = 1

Query: 4   LSISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLS 63
           L  S  F L+LL  FF T + N    A     ++LKL LLH  PF++PSQ+LS D+HRLS
Sbjct: 3   LPFSSLFSLLLLLIFFFTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLS 62

Query: 64  LLFSRPNP--TLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRN 123
             FS  +   +LKSP++SGASTGSGQYFVD+RLGTPPQ LLLVADTGSDLVWVKCSACRN
Sbjct: 63  FFFSALHTPQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRN 122

Query: 124 CSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLS 183
           C+ H P SAFL RHS++FSP HC+D  C+L+P   HH CNH RLHSPCR+ YSY DGS +
Sbjct: 123 CTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKT 182

Query: 184 SGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSS 243
           SGFFSKETTTL + SG E  LKG++FGC FRISGPSVSGA FNGA GVMGLGRG IS SS
Sbjct: 183 SGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSS 242

Query: 244 QLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 303
           QLG RFGNKFSYCLMD+ +SP PTS+L+IG   + +      ++ +TPL INPLSPTFYY
Sbjct: 243 QLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDV-APGKRRMRFTPLHINPLSPTFYY 302

Query: 304 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 363
           I I S+++DG+KLPINP+VW +DE GNGGT+VDSGTTLT+L + AY ++L  ++RRV+LP
Sbjct: 303 IGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLP 362

Query: 364 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 423
           + AE TPGFDLCVN S E   P LP+L F+LGG +VF+PPPRNYF++T+E V CLA++AV
Sbjct: 363 SPAEPTPGFDLCVNVS-EIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAV 422

Query: 424 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
            + +GFSVIGNLMQQGFLLEFDK+ +RLGF+R GC LP
Sbjct: 423 MTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458

BLAST of CSPI05G08550.1 vs. NCBI nr
Match: gi|802680767|ref|XP_012082020.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas])

HSP 1 Score: 605.9 bits (1561), Expect = 5.8e-170
Identity = 304/463 (65.66%), Postives = 364/463 (78.62%), Query Frame = 1

Query: 1   MPVLSISPFFLLILL----FSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLS 60
           M ++S  PF LL+LL    +S  L    N  AT      ++LKLPLLH+ PF SP+Q+L 
Sbjct: 1   MVLVSSLPFLLLLLLTDLCYSISLRTTVNSTATK-----EYLKLPLLHRTPFKSPAQALP 60

Query: 61  SDTHRLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKC 120
            D  RLSLL  R   +LKSP+ISGASTGSGQYFV +RLG+P Q+LLLVADTGSDLVWVKC
Sbjct: 61  FDIRRLSLLH-RQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLVWVKC 120

Query: 121 SACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYA 180
           SAC+NCS++ P SAFL RHSS+FS  HCF+  CRL+PH   + CN TRLHSPCR+ YSYA
Sbjct: 121 SACKNCSNYSPGSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYEYSYA 180

Query: 181 DGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGS 240
           DGS +SGFFSKETTTL + +G E  LK L+FGCGFRISGPS++GA F GA GV+GLGR  
Sbjct: 181 DGSSTSGFFSKETTTLNTSAGREKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGLGRAP 240

Query: 241 ISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLS 300
           ISFSSQLGRRFGNKFSYCLMDYTLSPPPTS+LMIGG  +S  ++    +++TPL +N LS
Sbjct: 241 ISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSA-VSRKRILNFTPLLVNSLS 300

Query: 301 PTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRR 360
           PTFYYI I S+++DGVKLPINP+VW ID+ GNGGT++DSGTTLT+L + AY E+L +++R
Sbjct: 301 PTFYYIGIKSVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSAIKR 360

Query: 361 RVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCL 420
           RVKLP   ELTPGFDLCVN SG  RRP  PR+   L G +VF+PPPRNYF++T EGV CL
Sbjct: 361 RVKLPGPGELTPGFDLCVNVSG-VRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVKCL 420

Query: 421 AIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           AI+ V SG+GFSVIGNLMQQG+LLEFD++ SRLGF R GC LP
Sbjct: 421 AIQPVNSGSGFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455

BLAST of CSPI05G08550.1 vs. NCBI nr
Match: gi|567142517|ref|XP_006395632.1| (hypothetical protein EUTSA_v10004188mg [Eutrema salsugineum])

HSP 1 Score: 600.1 bits (1546), Expect = 3.2e-168
Identity = 298/454 (65.64%), Postives = 356/454 (78.41%), Query Frame = 1

Query: 12  LILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR--P 71
           LI+L SF    L  P   A     ++LKLPLL K PF SP+QSL+ DT RL  L  R  P
Sbjct: 4   LIVLCSFLSLFLLPPVNLAAVNDDEYLKLPLLRKSPFPSPTQSLALDTRRLHFLSLRRKP 63

Query: 72  NPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSS 131
            P +KSP++SGAS+GSGQYFVD+R+G PPQSLLL+ADTGSDLVWVKCSACRNCS H P +
Sbjct: 64  VPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSLHSPGT 123

Query: 132 AFLPRHSSSFSPFHCFDPHCRLLPH---APHHLCNHTRLHSPCRFLYSYADGSLSSGFFS 191
            F PRHSS+FSP HC+DP CRL+P    AP   CNHTR+HS C + Y+YADGSL+SG F+
Sbjct: 124 VFFPRHSSTFSPAHCYDPICRLVPEPGRAPK--CNHTRIHSTCPYEYAYADGSLTSGLFA 183

Query: 192 KETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR 251
           +ETTTLK+ SG E +LK ++FGCGFRISG SVSG  FNGA GVMGLGRG ISF+SQLGRR
Sbjct: 184 RETTTLKTSSGREAYLKSVAFGCGFRISGQSVSGTSFNGAHGVMGLGRGPISFASQLGRR 243

Query: 252 FGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHS 311
           FGNKFSYCLMDYTLSPPPTS+L+IG G   +     +K+S+TPL  NPLSPTFYY+ + S
Sbjct: 244 FGNKFSYCLMDYTLSPPPTSYLIIGDGGGGVRSDAVSKLSFTPLLTNPLSPTFYYVRLKS 303

Query: 312 ITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAEL 371
           I ++G KL I+P+VWEID+ GNGGTVVDSGTTL +L + AY  V+ +VRRR++LP AAE+
Sbjct: 304 IFVNGAKLRIDPSVWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRIRLPIAAEV 363

Query: 372 TPGFDLCVNASGESR-RPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGN 431
           TPGFDLCVN SG S+    +PRL+F L GGA+F PPPRNYF+ETEE + CLAI++V    
Sbjct: 364 TPGFDLCVNISGVSKPEKIMPRLKFELAGGALFVPPPRNYFIETEEQIQCLAIQSVNPKV 423

Query: 432 GFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 460
           GFSVIGNLMQQGFL EFD++ SRLGF+RRGC LP
Sbjct: 424 GFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APF2_ARATH1.2e-5833.48Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP2_NEPGR7.6e-5333.96Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR1.1e-5133.41Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG2_ARATH5.6e-4832.55Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPG1_ARATH1.5e-4530.59Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KNH6_CUCSA1.1e-268100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1[more]
F6HF17_VITVI1.3e-17165.28Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=... [more]
A0A067K2U7_JATCU4.1e-17065.66Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1[more]
V4KZL9_EUTSA2.2e-16865.64Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10004188mg PE=3 SV=1[more]
Q9LI73_ARATH2.5e-16764.91Aspartyl protease family protein OS=Arabidopsis thaliana GN=At3g25700 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT3G25700.11.2e-17064.91 Eukaryotic aspartyl protease family protein[more]
AT2G42980.16.2e-6137.85 Eukaryotic aspartyl protease family protein[more]
AT1G01300.16.8e-6033.48 Eukaryotic aspartyl protease family protein[more]
AT3G61820.14.9e-5833.92 Eukaryotic aspartyl protease family protein[more]
AT3G59080.11.2e-5636.48 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449451908|ref|XP_004143702.1|1.6e-268100.00PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus][more]
gi|659073000|ref|XP_008467208.1|1.8e-25997.17PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
gi|359473000|ref|XP_002278677.2|1.8e-17165.28PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera][more]
gi|802680767|ref|XP_012082020.1|5.8e-17065.66PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas][more]
gi|567142517|ref|XP_006395632.1|3.2e-16865.64hypothetical protein EUTSA_v10004188mg [Eutrema salsugineum][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI05G08550CSPI05G08550gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI05G08550.1CSPI05G08550.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G08550.1.utr5p1CSPI05G08550.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G08550.1.cds1CSPI05G08550.1.cds1CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G08550.1.utr3p1CSPI05G08550.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 94..114
score: 1.8E-7coord: 428..443
score: 1.8E-7coord: 331..342
score: 1.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 4..457
score: 4.6E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 282..456
score: 2.3E-46coord: 76..271
score: 2.5
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 82..456
score: 4.12
NoneNo IPR availablePANTHERPTHR13683:SF354ASPARTYL PROTEASE FAMILY PROTEINcoord: 4..457
score: 4.6E