CSPI03G17800 (gene) Wild cucumber (PI 183967)

NameCSPI03G17800
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr3 : 13365470 .. 13366942 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCAGCAACCCCCTCCCCACAATCAACAATGCTTCTGATTCTCTTCTCTCTCTCCTTATTCACTCTCTCCTTCTCTCAATCCAACTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCTCTGGAAAACCCTCCAATACTATCCCATCATACTCTTCCCAGCTTTACGCCAAGAGGCCATCCTCCTACGGCTCCTTCAAGCTTCCTTTCAAATACTCCTCCACTGCCCTCGTCGTCTCTCTACCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAAATCCCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGCCTCCACCGAAAACAGGGGTATTTTGGGAATGAACCGTGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGTCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCTGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGCTAAACATCCCCCCAGCCGCTTTCAAACCGGACGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAAGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATACGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTCACGGCGGAGGTGGGCCGCAGGATTGGCGGCATCTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGATCGGACGGTCAGAAAGGCTTGGGATTGGGAGTAATATAATCGGCACTGTTCATCAACAGAATATGTGGGTGGAGTATGATCTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGATGATGGGCGGTAAAGATTTATACACGTGTGTGGTTTTTGGATGTTTATATAATCATATTTGATATTGTGTTATTGTGTAAATGTGTGTATAGTTATTTCATTTCATATATATACATTCTATATCAATAAAACAAATTTTGTTCTATCCTGT

mRNA sequence

ATGCTTCTGATTCTCTTCTCTCTCTCCTTATTCACTCTCTCCTTCTCTCAATCCAACTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCTCTGGAAAACCCTCCAATACTATCCCATCATACTCTTCCCAGCTTTACGCCAAGAGGCCATCCTCCTACGGCTCCTTCAAGCTTCCTTTCAAATACTCCTCCACTGCCCTCGTCGTCTCTCTACCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAAATCCCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGCCTCCACCGAAAACAGGGGTATTTTGGGAATGAACCGTGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGTCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCTGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGCTAAACATCCCCCCAGCCGCTTTCAAACCGGACGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAAGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATACGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTCACGGCGGAGGTGGGCCGCAGGATTGGCGGCATCTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGATCGGACGGTCAGAAAGGCTTGGGATTGGGAGTAATATAATCGGCACTGTTCATCAACAGAATATGTGGGTGGAGTATGATCTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGA

Coding sequence (CDS)

ATGCTTCTGATTCTCTTCTCTCTCTCCTTATTCACTCTCTCCTTCTCTCAATCCAACTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCTCTGGAAAACCCTCCAATACTATCCCATCATACTCTTCCCAGCTTTACGCCAAGAGGCCATCCTCCTACGGCTCCTTCAAGCTTCCTTTCAAATACTCCTCCACTGCCCTCGTCGTCTCTCTACCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAAATCCCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGCCTCCACCGAAAACAGGGGTATTTTGGGAATGAACCGTGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGTCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCTGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGCTAAACATCCCCCCAGCCGCTTTCAAACCGGACGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAAGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATACGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTCACGGCGGAGGTGGGCCGCAGGATTGGCGGCATCTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGATCGGACGGTCAGAAAGGCTTGGGATTGGGAGTAATATAATCGGCACTGTTCATCAACAGAATATGTGGGTGGAGTATGATCTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGA
BLAST of CSPI03G17800 vs. Swiss-Prot
Match: PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 1.7e-62
Identity = 160/446 (35.87%), Postives = 231/446 (51.79%), Query Frame = 1

Query: 6   FSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFKYSSTA 65
           FS S F+   S S+S +L  PL     P++            RP+     KL F ++ T 
Sbjct: 32  FSFSSFS---SSSSSQTLVLPLKTRITPTD-----------HRPTD----KLHFHHNVT- 91

Query: 66  LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLL 125
           L V+L +GTPPQ   +V+DTGS+LSW++C+           P P    FDP+ SSS+S +
Sbjct: 92  LTVTLTVGTPPQNISMVIDTGSELSWLRCNRSS-------NPNP-VNNFDPTRSSSYSPI 151

Query: 126 PCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVIL 185
           PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F  S +   +I 
Sbjct: 152 PCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIF 211

Query: 186 GC--------AQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDN 245
           GC         +  T+  G+LGMNRG LSFISQ    KFSYC+ S T   P G   LGD 
Sbjct: 212 GCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-SGTDDFP-GFLLLGD- 271

Query: 246 PNSSKFKYVTMLTFPE----SQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 305
              S F ++T L +      S   P  D +AYT+ +  IK+ GK L IP +   PD  G+
Sbjct: 272 ---SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA 331

Query: 306 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMM----KKGYVYAAVADMCF---DAGVT 365
           GQTM+DSG+  T+L+   Y  ++   +     ++       +V+    D+C+      + 
Sbjct: 332 GQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIR 391

Query: 366 AEVGRRIGGISFEFDNGVEIFVGRGEGVLTEV------EKGVKCVGIGRSERLGIGSNII 425
           + +  R+  +S  F+ G EI V  G+ +L  V         V C   G S+ +G+ + +I
Sbjct: 392 SGILHRLPTVSLVFE-GAEIAVS-GQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVI 442

Query: 426 GTVHQQNMWVEYDLANKRVGFGGAEC 427
           G  HQQNMW+E+DL   R+G    EC
Sbjct: 452 GHHHQQNMWIEFDLQRSRIGLAPVEC 442

BLAST of CSPI03G17800 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 1.6e-41
Identity = 118/367 (32.15%), Postives = 174/367 (47.41%), Query Frame = 1

Query: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLP 126
           +++L IGTP QP   ++DTGS L W QC                T  F+P  SSSFS LP
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQP------CTQCFNQSTPIFNPQGSSSFSTLP 155

Query: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILG 186
           C+  +C+       L +    N  C Y+Y Y DG+  +G++  E  TF  S+S P +  G
Sbjct: 156 CSSQLCQ------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFG 215

Query: 187 CAQAST-----ENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSS 246
           C + +         G++GM RG LS  SQ  ++KFSYC+     S P+ L  LG   NS 
Sbjct: 216 CGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLL-LGSLANSV 275

Query: 247 KFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDA-GGSGQTMIDS 306
                       SQ      P  Y + +  + +   RL I P+AF  ++  G+G  +IDS
Sbjct: 276 TAGSPNTTLIQSSQI-----PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDS 335

Query: 307 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFEF 366
           G+ LTY V+ AY+ V++E +  +   +  G   ++  D+CF    +     +I      F
Sbjct: 336 GTTLTYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQT-PSDPSNLQIPTFVMHF 395

Query: 367 DNG-VEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 426
           D G +E+     E        G+ C+ +G S +   G +I G + QQNM V YD  N  V
Sbjct: 396 DGGDLEL---PSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVV 434

BLAST of CSPI03G17800 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.0e-38
Identity = 120/373 (32.17%), Postives = 174/373 (46.65%), Query Frame = 1

Query: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLP 126
           ++++ IGTP      ++DTGS L W QC         P      T  F+P  SSSFS LP
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQP------TPIFNPQDSSSFSTLP 156

Query: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILG 186
           C    C+       LP+    N  C Y+Y Y DG+  +G +  E FTF  S S P +  G
Sbjct: 157 CESQYCQD------LPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFG 216

Query: 187 CAQAST-----ENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSS 246
           C + +         G++GM  G LS  SQ  + +FSYC+ S   S+P+ L  LG   +  
Sbjct: 217 CGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLA-LGSAASG- 276

Query: 247 KFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQT 306
                     PE   S      +L+P  Y + ++ I + G  L IP + F+    G+G  
Sbjct: 277 ---------VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGM 336

Query: 307 MIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDA---GVTAEVGRRI 366
           +IDSG+ LTYL  +AY  V +     +   +      ++    CF     G T +V    
Sbjct: 337 IIDSGTTLTYLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTVQVPE-- 396

Query: 367 GGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYD 426
             IS +FD GV + +G  + +L    +GV C+ +G S +LGI  +I G + QQ   V YD
Sbjct: 397 --ISMQFDGGV-LNLGE-QNILISPAEGVICLAMGSSSQLGI--SIFGNIQQQETQVLYD 435

BLAST of CSPI03G17800 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 5.3e-32
Identity = 99/363 (27.27%), Postives = 162/363 (44.63%), Query Frame = 1

Query: 72  IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPI 131
           +GTP +   LVLDTGS ++WIQC             +     F+P+ SS++  L C+ P 
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEP------CADCYQQSDPVFNPTSSSTYKSLTCSAPQ 227

Query: 132 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQAS 191
           C        L TS  ++  C Y   Y DG+   G L  +  TF  S     V LGC   +
Sbjct: 228 CS------LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 287

Query: 192 ----TENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKYVT 251
               T   G+LG+  G LS  +Q K + FSYC+  R            D+  SS   + +
Sbjct: 288 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDR------------DSGKSSSLDFNS 347

Query: 252 MLTFPESQSSPNLD----PLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSD 311
           +       ++P L        Y + +    + G+++ +P A F  DA GSG  ++D G+ 
Sbjct: 348 VQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 407

Query: 312 LTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFEFDNG 371
           +T L  +AY  +++  ++L    +KKG    ++ D C+D    + V  ++  ++F F  G
Sbjct: 408 VTRLQTQAYNSLRDAFLKLT-VNLKKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGG 467

Query: 372 VEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVGFGG 427
             + +     ++   + G  C     +       +IIG V QQ   + YDL+   +G  G
Sbjct: 468 KSLDLPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSG 500

BLAST of CSPI03G17800 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 1.1e-29
Identity = 112/393 (28.50%), Postives = 180/393 (45.80%), Query Frame = 1

Query: 45  YAKRPSSYGSFKLP-FKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKR 104
           +A RP  + S  +      S      L +GTP +   +VLDTGS + W+QC   ++   +
Sbjct: 120 HAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQ 179

Query: 105 LPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTL 164
             P+       FDP  S +++ +PC+ P C+ R+      T   + + C Y   Y DG+ 
Sbjct: 180 SDPI-------FDPRKSKTYATIPCSSPHCR-RLDSAGCNT---RRKTCLYQVSYGDGSF 239

Query: 165 AEGNLVREKFTFSKSLSTPPVILGCAQAS----TENRGILGMNRGRLSFISQAK---ISK 224
             G+   E  TF ++     V LGC   +        G+LG+ +G+LSF  Q       K
Sbjct: 240 TVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQK 299

Query: 225 FSYCVPSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIA 284
           FSYC+  R+ S+       G+   S   ++  +L      S+P LD   Y + +  I + 
Sbjct: 300 FSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLL------SNPKLDTF-YYVGLLGISVG 359

Query: 285 GKRL-NIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYA 344
           G R+  +  + FK D  G+G  +IDSG+ +T L+  AY  +++     VGA   K     
Sbjct: 360 GTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVGAKTLKRAPDF 419

Query: 345 AVADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLG 404
           ++ D CFD     EV  ++  +   F  G ++ +     ++     G  C     +  +G
Sbjct: 420 SLFDTCFDLSNMNEV--KVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGT--MG 479

Query: 405 IGSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 428
            G +IIG + QQ   V YDLA+ RVGF    C+
Sbjct: 480 -GLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI03G17800 vs. TrEMBL
Match: Q9FGI3_ARATH (AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 1.1e-153
Identity = 282/436 (64.68%), Postives = 338/436 (77.52%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPL-SLSGKPSNTIPSYSSQLYAKR----PSSYGSFK 61
           LL +F    +++S S S+SLSL FPL SL   P+    S+ + L ++R    PSS  +F+
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 121
              KYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FDP
Sbjct: 72  SNIKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFDP 131

Query: 122 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSK 181
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS 
Sbjct: 132 SLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSN 191

Query: 182 SLSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRT---GSNPTGLFY 241
           S +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG FY
Sbjct: 192 SQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFY 251

Query: 242 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 301
           LGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + F+PDAGGS
Sbjct: 252 LGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGS 311

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   + E+GR I
Sbjct: 312 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 371

Query: 362 GGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYD 421
           G + FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 372 GDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 431

Query: 422 LANKRVGFGGAECSRL 430
           + N+RVGF  AEC  L
Sbjct: 432 VTNRRVGFSKAECRLL 441

BLAST of CSPI03G17800 vs. TrEMBL
Match: D7MID8_ARALL (Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_493732 PE=3 SV=1)

HSP 1 Score: 546.2 bits (1406), Expect = 3.6e-152
Identity = 283/437 (64.76%), Postives = 336/437 (76.89%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPL-SLSGKPSNTIPSYSSQLYAKRPSSYGS----F 60
           +L I F     ++S S S+SLSL FPL SL   P+    S+ + L ++R  S  S    F
Sbjct: 12  LLYIFFFFFCNSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPSSSPYTF 71

Query: 61  KLPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFD 120
           +  FKYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FD
Sbjct: 72  RSNFKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFD 131

Query: 121 PSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS 180
           PSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS
Sbjct: 132 PSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFS 191

Query: 181 KSLSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRT---GSNPTGLF 240
            S +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG F
Sbjct: 192 NSQTTPPLILGCAKESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSF 251

Query: 241 YLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGG 300
           YLG+NPNS  FKYV++LTFP+SQ  PNLDPLAYT+P+  I+I  KRLNIP + F+PDAGG
Sbjct: 252 YLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGG 311

Query: 301 SGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRR 360
           SGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD      +GR 
Sbjct: 312 SGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRL 371

Query: 361 IGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEY 420
           IG + FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+
Sbjct: 372 IGDLVFEFGRGVEILVEK-QRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEF 431

Query: 421 DLANKRVGFGGAECSRL 430
           D+AN+RVGF  AECSRL
Sbjct: 432 DVANRRVGFSKAECSRL 442

BLAST of CSPI03G17800 vs. TrEMBL
Match: A0A061EL58_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=3 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 8.8e-151
Identity = 284/428 (66.36%), Postives = 334/428 (78.04%), Query Frame = 1

Query: 4   ILFSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFKYSS 63
           I FS  L +L FS+ N  +L   L +S KP++T+          RPSSY ++K  FKYS 
Sbjct: 35  ISFSFPLTSLRFSRDNVQTLYRSL-VSTKPNSTVQP--------RPSSY-NYKTTFKYSM 94

Query: 64  TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFS 123
            AL+V+LPIGTPPQ   +VLDTGSQLSWIQCH KKV ++ PP P     +FDPSLSSSFS
Sbjct: 95  -ALIVALPIGTPPQTQQMVLDTGSQLSWIQCH-KKVARKPPPPP----TSFDPSLSSSFS 154

Query: 124 LLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPV 183
           +LPC HP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS+S STPP+
Sbjct: 155 VLPCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPPL 214

Query: 184 ILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSR---TGSNPTGLFYLGDNPNS 243
           ILGCA  ++E++GILGMN GRLSF SQAKISKFSYCVP+R    G +PTG FYLG+NP+S
Sbjct: 215 ILGCATDTSEDKGILGMNLGRLSFASQAKISKFSYCVPTRRTQPGFSPTGSFYLGENPSS 274

Query: 244 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 303
             F+YV ++ FPES + PN+DPLAYTLPM+ I+I  K+L IP + F+PDAGGSGQTMIDS
Sbjct: 275 RGFQYVNLMIFPESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMIDS 334

Query: 304 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFEF 363
           GS+ TYLVD+AY KV+EEVVRLVG  +KKGYVY  VADMCFD G   E+GR IG +  EF
Sbjct: 335 GSEFTYLVDDAYNKVREEVVRLVGPRIKKGYVYGGVADMCFD-GNPIEIGRLIGDMVLEF 394

Query: 364 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 423
           + GVEI V + E VL +VE GV C+GIGRS  LG  SNIIG  HQQN+WVEYDL N+RVG
Sbjct: 395 EKGVEITVEK-ERVLADVEGGVHCLGIGRSSMLGAASNIIGNFHQQNLWVEYDLVNRRVG 444

Query: 424 FGGAECSR 429
           FG A+CSR
Sbjct: 455 FGKADCSR 444

BLAST of CSPI03G17800 vs. TrEMBL
Match: A0A0R0L0W4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_02G243600 PE=3 SV=1)

HSP 1 Score: 540.8 bits (1392), Expect = 1.5e-150
Identity = 270/407 (66.34%), Postives = 321/407 (78.87%), Query Frame = 1

Query: 26  PLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFKYSSTALVVSLPIGTPPQPTDLVLDT 85
           PLS  GKP N  P+    L     SS  + K  FKYS  ALVV+LPIGTPPQ   +VLDT
Sbjct: 4   PLSPKGKPLNRNPN----LRTLSSSSSYNIKSSFKYSM-ALVVTLPIGTPPQHQQMVLDT 63

Query: 86  GSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSC 145
           GSQLSWIQCH+K           P TA+FDPSLSSSF +LPC HP+CKPR+PDFTLPT+C
Sbjct: 64  GSQLSWIQCHNKT----------PPTASFDPSLSSSFYILPCTHPLCKPRVPDFTLPTTC 123

Query: 146 DQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQASTENRGILGMNRGRL 205
           DQNRLCHYSYFYADGT AEGNLVREK TFS S +TPP+ILGCA  S++ RGILGMN GRL
Sbjct: 124 DQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLILGCATESSDARGILGMNLGRL 183

Query: 206 SFISQAKISKFSYCVPSRTGSN----PTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLD 265
           SF SQAK++KFSYCVP+R  +N    PTG FYLG+NPNS++F+YV+MLTFP+SQ  PNLD
Sbjct: 184 SFPSQAKVTKFSYCVPTRQAANDNNLPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLD 243

Query: 266 PLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVR 325
           PLAYT+PM+ I+I GK+LNIPP+ F+P+AGGSGQTM+DSGS+ T+LVD AY+ V+EEV+R
Sbjct: 244 PLAYTVPMQGIRIGGKKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDAAYDAVREEVIR 303

Query: 326 LVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKG 385
           +VG  +KKGYVY  VADMCFD G   E+GR IG ++FEF+ GVEI V + E VL +V  G
Sbjct: 304 VVGPRVKKGYVYGGVADMCFD-GSVMEIGRLIGDVAFEFEKGVEIVVPK-ERVLADVGGG 363

Query: 386 VKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSR 429
           V C+GIGRSERLG  SNIIG  HQQN+WVE+DLAN+R+GFG A+CSR
Sbjct: 364 VHCLGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCSR 393

BLAST of CSPI03G17800 vs. TrEMBL
Match: I1MBU0_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_14G212700 PE=3 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 2.2e-149
Identity = 278/446 (62.33%), Postives = 339/446 (76.01%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNS---------LSLPFPLSL----SGKPSNTIPSYSSQLYAK 60
           + ++LFS   F  SFS S+S         LSL FPL+     + KP NT P    +L   
Sbjct: 16  LCMLLFSFFFF-FSFSTSSSAKLNPTTDSLSLSFPLTSLPLSTAKPLNTNP----KLRTL 75

Query: 61  RPSSYGSFKLPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLP 120
             SS  + K  FKYS  ALVV+LPIGTPPQP  +VLDTGSQLSWIQCH+K          
Sbjct: 76  SSSSSYNIKSSFKYSM-ALVVTLPIGTPPQPQQMVLDTGSQLSWIQCHNKT--------- 135

Query: 121 KPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNL 180
            P TA+FDPSLSSSF +LPC HP+CKPR+PDFTLPT+CDQNRLCHYSYFYADGT AEGNL
Sbjct: 136 -PPTASFDPSLSSSFYVLPCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNL 195

Query: 181 VREKFTFSKSLSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSN 240
           VREK  FS S +TPP+ILGC+  S + RGILGMN GRLSF  QAK++KFSYCVP+R  +N
Sbjct: 196 VREKLAFSPSQTTPPLILGCSSESRDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPAN 255

Query: 241 ----PTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPP 300
               PTG FYLG+NPNS++F+YV+MLTFP+SQ  PNLDPLAYT+PM+ I+I G++LNIPP
Sbjct: 256 NNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPP 315

Query: 301 AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDA 360
           + F+P+AGGSGQTM+DSGS+ T+LVD AY++V+EE++R++G  +KKGYVY  VADMCFD 
Sbjct: 316 SVFRPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFD- 375

Query: 361 GVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTV 420
           G   E+GR +G ++FEF+ GVEI V + E VL +V  GV CVGIGRSERLG  SNIIG  
Sbjct: 376 GNAMEIGRLLGDVAFEFEKGVEIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNF 435

Query: 421 HQQNMWVEYDLANKRVGFGGAECSRL 430
           HQQN+WVE+DLAN+R+GFG A+CSRL
Sbjct: 436 HQQNLWVEFDLANRRIGFGVADCSRL 443

BLAST of CSPI03G17800 vs. TAIR10
Match: AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 551.2 bits (1419), Expect = 5.6e-157
Identity = 282/436 (64.68%), Postives = 338/436 (77.52%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPL-SLSGKPSNTIPSYSSQLYAKR----PSSYGSFK 61
           LL +F    +++S S S+SLSL FPL SL   P+    S+ + L ++R    PSS  +F+
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 121
              KYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FDP
Sbjct: 72  SNIKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFDP 131

Query: 122 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSK 181
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS 
Sbjct: 132 SLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSN 191

Query: 182 SLSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRT---GSNPTGLFY 241
           S +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG FY
Sbjct: 192 SQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFY 251

Query: 242 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 301
           LGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + F+PDAGGS
Sbjct: 252 LGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGS 311

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   + E+GR I
Sbjct: 312 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 371

Query: 362 GGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYD 421
           G + FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 372 GDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 431

Query: 422 LANKRVGFGGAECSRL 430
           + N+RVGF  AEC  L
Sbjct: 432 VTNRRVGFSKAECRLL 441

BLAST of CSPI03G17800 vs. TAIR10
Match: AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 512.3 bits (1318), Expect = 2.9e-145
Identity = 270/432 (62.50%), Postives = 319/432 (73.84%), Query Frame = 1

Query: 4   ILFSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYG---SFKLPFK 63
           + F   L  +S S S SL LP         +N+    +S L  K PS      +F+  FK
Sbjct: 8   LFFFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRSRFK 67

Query: 64  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSS 123
           YS  AL++SLPIGTPPQ   +VLDTGSQLSWIQCH    +K+LPP  KPKT+ FDPSLSS
Sbjct: 68  YSM-ALIISLPIGTPPQAQQMVLDTGSQLSWIQCH----RKKLPP--KPKTS-FDPSLSS 127

Query: 124 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST 183
           SFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EK TFS +  T
Sbjct: 128 SFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEIT 187

Query: 184 PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVP---SRTGSNPTGLFYLGDN 243
           PP+ILGCA  S+++RGILGMNRGRLSF+SQAKISKFSYC+P   +R G  PTG FYLGDN
Sbjct: 188 PPLILGCATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDN 247

Query: 244 PNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTM 303
           PNS  FKYV++LTFPESQ  PNLDPLAYT+PM  I+   K+LNI  + F+PDAGGSGQTM
Sbjct: 248 PNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTM 307

Query: 304 IDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGIS 363
           +DSGS+ T+LVD AY+KV+ E++  VG  +KKGYVY   ADMCFD  V A + R IG + 
Sbjct: 308 VDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNV-AMIPRLIGDLV 367

Query: 364 FEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANK 423
           F F  GVEI V + E VL  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D+ N+
Sbjct: 368 FVFTRGVEILVPK-ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 427

Query: 424 RVGFGGAECSRL 430
           RVGF  A+CSR+
Sbjct: 428 RVGFAKADCSRV 429

BLAST of CSPI03G17800 vs. TAIR10
Match: AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 242.7 bits (618), Expect = 4.3e-64
Identity = 158/439 (35.99%), Postives = 241/439 (54.90%), Query Frame = 1

Query: 7   SLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFKYSSTAL 66
           SLSL + +F + + L L FPL+   K S+T  +    L  ++     S KL F+++ T L
Sbjct: 9   SLSL-SKNFLRISVLLLIFPLTFC-KTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVT-L 68

Query: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLP 126
            V+L +G PPQ   +VLDTGS+LSW+ C      K+ P L     + F+P  SS++S +P
Sbjct: 69  TVTLAVGDPPQNISMVLDTGSELSWLHC------KKSPNLG----SVFNPVSSSTYSPVP 128

Query: 127 CNHPICKPRIPDFTLPTSCD-QNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVIL 186
           C+ PIC+ R  D  +P SCD +  LCH +  YAD T  EGNL  E F    S++ P  + 
Sbjct: 129 CSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLF 188

Query: 187 GC--------AQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDN 246
           GC        ++   ++ G++GMNRG LSF++Q   SKFSYC+   +GS+ +G   LGD 
Sbjct: 189 GCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGSDSSGFLLLGDA 248

Query: 247 PNS--SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQ 306
             S     +Y  ++   +S   P  D +AYT+ ++ I++  K L++P + F PD  G+GQ
Sbjct: 249 SYSWLGPIQYTPLVL--QSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQ 308

Query: 307 TMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK----KGYVYAAVADMCFDAGVTAEVG- 366
           TM+DSG+  T+L+   Y  +K E +    ++++      +V+    D+C+  G T     
Sbjct: 309 TMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNF 368

Query: 367 RRIGGISFEFDNGVEIFVG------RGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVH 424
             +  +S  F  G E+ V       R  G  +E ++ V C   G S+ LGI + +IG  H
Sbjct: 369 SGLPMVSLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHH 427

BLAST of CSPI03G17800 vs. TAIR10
Match: AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 241.5 bits (615), Expect = 9.5e-64
Identity = 160/446 (35.87%), Postives = 231/446 (51.79%), Query Frame = 1

Query: 6   FSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFKYSSTA 65
           FS S F+   S S+S +L  PL     P++            RP+     KL F ++ T 
Sbjct: 32  FSFSSFS---SSSSSQTLVLPLKTRITPTD-----------HRPTD----KLHFHHNVT- 91

Query: 66  LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLL 125
           L V+L +GTPPQ   +V+DTGS+LSW++C+           P P    FDP+ SSS+S +
Sbjct: 92  LTVTLTVGTPPQNISMVIDTGSELSWLRCNRSS-------NPNP-VNNFDPTRSSSYSPI 151

Query: 126 PCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVIL 185
           PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F  S +   +I 
Sbjct: 152 PCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIF 211

Query: 186 GC--------AQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDN 245
           GC         +  T+  G+LGMNRG LSFISQ    KFSYC+ S T   P G   LGD 
Sbjct: 212 GCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-SGTDDFP-GFLLLGD- 271

Query: 246 PNSSKFKYVTMLTFPE----SQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 305
              S F ++T L +      S   P  D +AYT+ +  IK+ GK L IP +   PD  G+
Sbjct: 272 ---SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA 331

Query: 306 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMM----KKGYVYAAVADMCF---DAGVT 365
           GQTM+DSG+  T+L+   Y  ++   +     ++       +V+    D+C+      + 
Sbjct: 332 GQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIR 391

Query: 366 AEVGRRIGGISFEFDNGVEIFVGRGEGVLTEV------EKGVKCVGIGRSERLGIGSNII 425
           + +  R+  +S  F+ G EI V  G+ +L  V         V C   G S+ +G+ + +I
Sbjct: 392 SGILHRLPTVSLVFE-GAEIAVS-GQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVI 442

Query: 426 GTVHQQNMWVEYDLANKRVGFGGAEC 427
           G  HQQNMW+E+DL   R+G    EC
Sbjct: 452 GHHHQQNMWIEFDLQRSRIGLAPVEC 442

BLAST of CSPI03G17800 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 154.1 bits (388), Expect = 2.0e-37
Identity = 119/398 (29.90%), Postives = 183/398 (45.98%), Query Frame = 1

Query: 46  AKRPSSYGSFKLPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPP 105
           A +P    + K P    S   ++ L IG P      ++DTGS L W QC     K     
Sbjct: 87  ASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQC-----KPCTEC 146

Query: 106 LPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTS-CDQNR-LCHYSYFYADGTLA 165
             +P T  FDP  SSS+S + C+  +C        LP S C++++  C Y Y Y D +  
Sbjct: 147 FDQP-TPIFDPEKSSSYSKVGCSSGLCN------ALPRSNCNEDKDACEYLYTYGDYSST 206

Query: 166 EGNLVREKFTFSKSLSTPPVILGCAQAS-----TENRGILGMNRGRLSFISQAKISKFSY 225
            G L  E FTF    S   +  GC   +     ++  G++G+ RG LS ISQ K +KFSY
Sbjct: 207 RGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSY 266

Query: 226 CVPSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNL----DPLAYTLPMKAIKI 285
           C+ S   S  +   ++G   +    K    L    +++   L     P  Y L ++ I +
Sbjct: 267 CLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITV 326

Query: 286 AGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYA 345
             KRL++  + F+    G+G  +IDSG+ +TYL + A++ +KEE    +   +      +
Sbjct: 327 GAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDS--GS 386

Query: 346 AVADMCF---DAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSE 405
              D+CF   DA     V + I    F F  G ++ +     ++ +   GV C+ +G S 
Sbjct: 387 TGLDLCFKLPDAAKNIAVPKMI----FHF-KGADLELPGENYMVADSSTGVLCLAMGSSN 446

Query: 406 RLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRL 430
               G +I G V QQN  V +DL  + V F   EC +L
Sbjct: 447 ----GMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of CSPI03G17800 vs. NCBI nr
Match: gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 853.6 bits (2204), Expect = 1.5e-244
Identity = 427/430 (99.30%), Postives = 428/430 (99.53%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFK 60
           MLLILFSLSLFTLSFSQSNSLSLPFPLSLS KPSNTIPSYSSQLYAKRPSSYGSFKLPFK
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSEKPSNTIPSYSSQLYAKRPSSYGSFKLPFK 60

Query: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSS 120
           YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLSS
Sbjct: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSS 120

Query: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST 180
           SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST 180

Query: 181 PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNS 240
           PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNS
Sbjct: 181 PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNS 240

Query: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300
           SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS
Sbjct: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300

Query: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFEF 360
           GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYA VADMCFDAGVTAEVGRRIGGISFEF
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360

Query: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420
           DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420

Query: 421 FGGAECSRLK 431
           FGGAECSRLK
Sbjct: 421 FGGAECSRLK 430

BLAST of CSPI03G17800 vs. NCBI nr
Match: gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 822.0 bits (2122), Expect = 4.8e-235
Identity = 412/431 (95.59%), Postives = 418/431 (96.98%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSY-SSQLYAKRPSSYGSFKLPF 60
           MLLILFSLSLFTLSFSQSNSLSLPFPLSL+ KPSN  P Y SSQLY K+PSS+G FKLPF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60

Query: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS SLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180

Query: 181 TPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240
           TPPVILGCAQ STENRGILGMN GRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS RLGIGSNIIGTVHQQNMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420

Query: 421 GFGGAECSRLK 431
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 431

BLAST of CSPI03G17800 vs. NCBI nr
Match: gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 807.4 bits (2084), Expect = 1.2e-230
Identity = 403/430 (93.72%), Postives = 414/430 (96.28%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSGKPSNTIPSYSSQLYAKRPSSYGSFKLPFK 60
           MLLILFSLSLFTL FSQSNS+SLPFPLSLS KPSN  P Y SQLYAK+PSS+GSFKLPFK
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPSNISPIYGSQLYAKKPSSHGSFKLPFK 60

Query: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSS 120
           YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD KVKK+LPPLPKPKTA+FDPSLSS
Sbjct: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KVKKKLPPLPKPKTASFDPSLSS 120

Query: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST 180
           SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ S SLST
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLST 180

Query: 181 PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNS 240
           PPVILGCAQASTENRGILGMN+GRLSFISQAKISKFSYCVP+RTGSNPTGLFYLGDNPNS
Sbjct: 181 PPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPNS 240

Query: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300
           S+FKYVTMLTFPESQSSPNLDPLAYTLPMK IKIAGKRLNI PAAFKPDAGGSGQTMIDS
Sbjct: 241 SRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMIDS 300

Query: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRIGGISFEF 360
           GSDLTYLVDEAYEKVKEEVVRLVGA MKKGYVYAAVADMCFDA VTAEVGRRIGGISFEF
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFEF 360

Query: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420
           DNGVEI VGRGEGVLTEVEKGVKCVG GRSERLGIGSNIIGTVHQQNMWVEYDL N+R+G
Sbjct: 361 DNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRIG 420

Query: 421 FGGAECSRLK 431
           FGGAECSRLK
Sbjct: 421 FGGAECSRLK 429

BLAST of CSPI03G17800 vs. NCBI nr
Match: gi|18421660|ref|NP_568551.1| (aspartyl protease family protein [Arabidopsis thaliana])

HSP 1 Score: 551.2 bits (1419), Expect = 1.6e-153
Identity = 282/436 (64.68%), Postives = 338/436 (77.52%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPL-SLSGKPSNTIPSYSSQLYAKR----PSSYGSFK 61
           LL +F    +++S S S+SLSL FPL SL   P+    S+ + L ++R    PSS  +F+
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 121
              KYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FDP
Sbjct: 72  SNIKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFDP 131

Query: 122 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSK 181
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS 
Sbjct: 132 SLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSN 191

Query: 182 SLSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRT---GSNPTGLFY 241
           S +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG FY
Sbjct: 192 SQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFY 251

Query: 242 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 301
           LGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + F+PDAGGS
Sbjct: 252 LGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGS 311

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   + E+GR I
Sbjct: 312 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 371

Query: 362 GGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYD 421
           G + FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 372 GDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 431

Query: 422 LANKRVGFGGAECSRL 430
           + N+RVGF  AEC  L
Sbjct: 432 VTNRRVGFSKAECRLL 441

BLAST of CSPI03G17800 vs. NCBI nr
Match: gi|297801286|ref|XP_002868527.1| (aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata])

HSP 1 Score: 546.2 bits (1406), Expect = 5.1e-152
Identity = 283/437 (64.76%), Postives = 336/437 (76.89%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPL-SLSGKPSNTIPSYSSQLYAKRPSSYGS----F 60
           +L I F     ++S S S+SLSL FPL SL   P+    S+ + L ++R  S  S    F
Sbjct: 12  LLYIFFFFFCNSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPSSSPYTF 71

Query: 61  KLPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFD 120
           +  FKYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FD
Sbjct: 72  RSNFKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFD 131

Query: 121 PSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS 180
           PSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS
Sbjct: 132 PSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFS 191

Query: 181 KSLSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRT---GSNPTGLF 240
            S +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG F
Sbjct: 192 NSQTTPPLILGCAKESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSF 251

Query: 241 YLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGG 300
           YLG+NPNS  FKYV++LTFP+SQ  PNLDPLAYT+P+  I+I  KRLNIP + F+PDAGG
Sbjct: 252 YLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGG 311

Query: 301 SGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTAEVGRR 360
           SGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD      +GR 
Sbjct: 312 SGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRL 371

Query: 361 IGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEY 420
           IG + FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+
Sbjct: 372 IGDLVFEFGRGVEILVEK-QRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEF 431

Query: 421 DLANKRVGFGGAECSRL 430
           D+AN+RVGF  AECSRL
Sbjct: 432 DVANRRVGFSKAECSRL 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCS1L_ARATH1.7e-6235.87Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1[more]
NEP1_NEPGR1.6e-4132.15Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR1.0e-3832.17Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPG1_ARATH5.3e-3227.27Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
APF2_ARATH1.1e-2928.50Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
Q9FGI3_ARATH1.1e-15364.68AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1[more]
D7MID8_ARALL3.6e-15264.76Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
A0A061EL58_THECC8.8e-15166.36Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=... [more]
A0A0R0L0W4_SOYBN1.5e-15066.34Uncharacterized protein OS=Glycine max GN=GLYMA_02G243600 PE=3 SV=1[more]
I1MBU0_SOYBN2.2e-14962.33Uncharacterized protein OS=Glycine max GN=GLYMA_14G212700 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G37540.15.6e-15764.68 Eukaryotic aspartyl protease family protein[more]
AT1G66180.12.9e-14562.50 Eukaryotic aspartyl protease family protein[more]
AT2G39710.14.3e-6435.99 Eukaryotic aspartyl protease family protein[more]
AT5G02190.19.5e-6435.87 Eukaryotic aspartyl protease family protein[more]
AT2G03200.12.0e-3729.90 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778679913|ref|XP_004140731.2|1.5e-24499.30PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|778679910|ref|XP_011651212.1|4.8e-23595.59PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|659114575|ref|XP_008457122.1|1.2e-23093.72PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
gi|18421660|ref|NP_568551.1|1.6e-15364.68aspartyl protease family protein [Arabidopsis thaliana][more]
gi|297801286|ref|XP_002868527.1|5.1e-15264.76aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G17800.1CSPI03G17800.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 398..413
score: 3.1E-5coord: 296..307
score: 3.1E-5coord: 72..92
score: 3.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..429
score: 4.1E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 81..92
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 67..236
score: 1.6E-31coord: 238..428
score: 9.6
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 63..428
score: 1.06
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 1..429
score: 4.1E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI03G17800CmoCh01G003510Cucurbita moschata (Rifu)cmocpiB449
The following gene(s) are paralogous to this gene:

None