CSPI03G17780 (gene) Wild cucumber (PI 183967)

NameCSPI03G17780
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr3 : 13350541 .. 13351999 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCCCCAACACAATCAACAATGCTTCTAATTCTCTTCTCTCTCTCATTATTCACTCTCTCCTTCTCTCAATCCAATTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCACTGAAAAACCCTCCAATATTACCCCATTATACTACTCTTCCCAGCTTTACGTCAAAAAGCCATCATCCCATGGCCCCTTCAAGCTTCCTTTCAAATACTCCTCCTCTGCCCTCGTCGTCTCTCTTCCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAATTCTCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGGCTCCACCGAAGACAGGGGTATTTTGGGAATGAATCATGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGTCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCTGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGCTAAACATCCCCCCAGCCGCTTTCAAACCGGACGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAGGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATATGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTGACGGTGGAGGTGGGCCGCAGGATTGGCGACATGTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAGGTGGAAAAAGGAGTGAAGTGTGTGGGGATCGGACGGTCAGGAAGGCTTGGGATTGGAAGTAATATAATCGGTACCGTTCATCAACAGAATATGTGGGTGGAGTATGATTTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGATGATGGACAGTAAAGATTTATACACGTGTGTGGGTTTTGGAATGTTTATATAATCATATTTGATATTGTGTTATTGTGTAAATGTGTGTATAGTTATTTCATTTCACATATATACTTTATATATAAATAAAACAAAAAAATTT

mRNA sequence

ATGCTTCTAATTCTCTTCTCTCTCTCATTATTCACTCTCTCCTTCTCTCAATCCAATTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCACTGAAAAACCCTCCAATATTACCCCATTATACTACTCTTCCCAGCTTTACGTCAAAAAGCCATCATCCCATGGCCCCTTCAAGCTTCCTTTCAAATACTCCTCCTCTGCCCTCGTCGTCTCTCTTCCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAATTCTCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGGCTCCACCGAAGACAGGGGTATTTTGGGAATGAATCATGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGTCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCTGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGCTAAACATCCCCCCAGCCGCTTTCAAACCGGACGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAGGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATATGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTGACGGTGGAGGTGGGCCGCAGGATTGGCGACATGTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAGGTGGAAAAAGGAGTGAAGTGTGTGGGGATCGGACGGTCAGGAAGGCTTGGGATTGGAAGTAATATAATCGGTACCGTTCATCAACAGAATATGTGGGTGGAGTATGATTTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGA

Coding sequence (CDS)

ATGCTTCTAATTCTCTTCTCTCTCTCATTATTCACTCTCTCCTTCTCTCAATCCAATTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCACTGAAAAACCCTCCAATATTACCCCATTATACTACTCTTCCCAGCTTTACGTCAAAAAGCCATCATCCCATGGCCCCTTCAAGCTTCCTTTCAAATACTCCTCCTCTGCCCTCGTCGTCTCTCTTCCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAATTCTCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGGCTCCACCGAAGACAGGGGTATTTTGGGAATGAATCATGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGTCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCTGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGCTAAACATCCCCCCAGCCGCTTTCAAACCGGACGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAGGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATATGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTGACGGTGGAGGTGGGCCGCAGGATTGGCGACATGTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAGGTGGAAAAAGGAGTGAAGTGTGTGGGGATCGGACGGTCAGGAAGGCTTGGGATTGGAAGTAATATAATCGGTACCGTTCATCAACAGAATATGTGGGTGGAGTATGATTTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGA
BLAST of CSPI03G17780 vs. Swiss-Prot
Match: PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)

HSP 1 Score: 240.4 bits (612), Expect = 3.8e-62
Identity = 159/452 (35.18%), Postives = 232/452 (51.33%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           +LL+L   +   +S S S+S S  F    +   S    L   +++    P+ H P     
Sbjct: 10  LLLVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRI---TPTDHRPTDKLH 69

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
            + +  L V+L +GTPPQ   +V+DTGS+LSW++C+           P P    FDP+ S
Sbjct: 70  FHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSN-------PNPVN-NFDPTRS 129

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SS+S +PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS +
Sbjct: 130 SSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN 189

Query: 181 TPPVILGC--------AQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGL 240
              +I GC         +  T+  G+LGMN G LSFISQ    KFSYC+ S T   P G 
Sbjct: 190 DSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-SGTDDFP-GF 249

Query: 241 FYLGDNPNSSKFKYVTMLTFPE----SQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFK 300
             LGD    S F ++T L +      S   P  D +AYT+ +  IK+ GK L IP +   
Sbjct: 250 LLLGD----SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLV 309

Query: 301 PDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMM----KKGYVYAAVADMCFD- 360
           PD  G+GQTM+DSG+  T+L+   Y  ++   +     ++       +V+    D+C+  
Sbjct: 310 PDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRI 369

Query: 361 AGVTVEVG--RRIGDMSFEFDNGVEIFVGRGEGVLTEV------EKGVKCVGIGRSGRLG 420
           + V +  G   R+  +S  F+ G EI V  G+ +L  V         V C   G S  +G
Sbjct: 370 SPVRIRSGILHRLPTVSLVFE-GAEIAVS-GQPLLYRVPHLTVGNDSVYCFTFGNSDLMG 429

Query: 421 IGSNIIGTVHQQNMWVEYDLANKRVGFGGAEC 428
           + + +IG  HQQNMW+E+DL   R+G    EC
Sbjct: 430 MEAYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442

BLAST of CSPI03G17780 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 9.7e-42
Identity = 118/367 (32.15%), Postives = 173/367 (47.14%), Query Frame = 1

Query: 68  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLP 127
           +++L IGTP QP   ++DTGS L W QC                T  F+P  SSSFS LP
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQP------CTQCFNQSTPIFNPQGSSSFSTLP 155

Query: 128 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILG 187
           C+  +C+       L +    N  C Y+Y Y DG+  +G++  E  TF  S+S P +  G
Sbjct: 156 CSSQLCQ------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFG 215

Query: 188 CAQ-----GSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSS 247
           C +     G     G++GM  G LS  SQ  ++KFSYC+     S P+ L  LG   NS 
Sbjct: 216 CGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLL-LGSLANSV 275

Query: 248 KFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDA-GGSGQTMIDS 307
                       SQ      P  Y + +  + +   RL I P+AF  ++  G+G  +IDS
Sbjct: 276 TAGSPNTTLIQSSQI-----PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDS 335

Query: 308 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEF 367
           G+ LTY V+ AY+ V++E +  +   +  G   ++  D+CF    +     +I      F
Sbjct: 336 GTTLTYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQT-PSDPSNLQIPTFVMHF 395

Query: 368 DNG-VEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 427
           D G +E+     E        G+ C+ +G S +   G +I G + QQNM V YD  N  V
Sbjct: 396 DGGDLEL---PSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVV 434

BLAST of CSPI03G17780 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 4.1e-40
Identity = 123/389 (31.62%), Postives = 179/389 (46.02%), Query Frame = 1

Query: 52  SHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPK 111
           S    + P        ++++ IGTP      ++DTGS L W QC         P      
Sbjct: 81  SSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQP------ 140

Query: 112 TATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVRE 171
           T  F+P  SSSFS LPC    C+       LP+    N  C Y+Y Y DG+  +G +  E
Sbjct: 141 TPIFNPQDSSSFSTLPCESQYCQD------LPSETCNNNECQYTYGYGDGSTTQGYMATE 200

Query: 172 KFTFSNSLSTPPVILGCAQ-----GSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTG 231
            FTF  S S P +  GC +     G     G++GM  G LS  SQ  + +FSYC+ S   
Sbjct: 201 TFTFETS-SVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGS 260

Query: 232 SNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIAGKRLN 291
           S+P+ L  LG   +            PE   S      +L+P  Y + ++ I + G  L 
Sbjct: 261 SSPSTLA-LGSAASG----------VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLG 320

Query: 292 IPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMC 351
           IP + F+    G+G  +IDSG+ LTYL  +AY  V +     +   +      ++    C
Sbjct: 321 IPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTC 380

Query: 352 FDA---GVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGS 411
           F     G TV+V     ++S +FD GV + +G  + +L    +GV C+ +G S +LGI  
Sbjct: 381 FQQPSDGSTVQVP----EISMQFDGGV-LNLGE-QNILISPAEGVICLAMGSSSQLGI-- 435

Query: 412 NIIGTVHQQNMWVEYDLANKRVGFGGAEC 428
           +I G + QQ   V YDL N  V F   +C
Sbjct: 441 SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI03G17780 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 1.4e-32
Identity = 100/363 (27.55%), Postives = 162/363 (44.63%), Query Frame = 1

Query: 73  IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPI 132
           +GTP +   LVLDTGS ++WIQC             +     F+P+ SS++  L C+ P 
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEP------CADCYQQSDPVFNPTSSSTYKSLTCSAPQ 227

Query: 133 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGS 192
           C        L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +
Sbjct: 228 CS------LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 287

Query: 193 ----TEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKYVT 252
               T   G+LG+  G LS  +Q K + FSYC+  R            D+  SS   + +
Sbjct: 288 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDR------------DSGKSSSLDFNS 347

Query: 253 MLTFPESQSSPNLD----PLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSD 312
           +       ++P L        Y + +    + G+++ +P A F  DA GSG  ++D G+ 
Sbjct: 348 VQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 407

Query: 313 LTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNG 372
           +T L  +AY  +++  ++L    +KKG    ++ D C+D      V  ++  ++F F  G
Sbjct: 408 VTRLQTQAYNSLRDAFLKLT-VNLKKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGG 467

Query: 373 VEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGG 428
             + +     ++   + G  C     +       +IIG V QQ   + YDL+   +G  G
Sbjct: 468 KSLDLPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSG 500

BLAST of CSPI03G17780 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 6.5e-30
Identity = 108/367 (29.43%), Postives = 171/367 (46.59%), Query Frame = 1

Query: 71  LPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTATFDPSLSSSFSLLPCN 130
           L +GTP +   +VLDTGS + W+QC   ++   +  P+       FDP  S +++ +PC+
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI-------FDPRKSKTYATIPCS 205

Query: 131 HPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCA 190
            P C+ R+      T   + + C Y   Y DG+   G+   E  TF  +     V LGC 
Sbjct: 206 SPHCR-RLDSAGCNT---RRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCG 265

Query: 191 QGS----TEDRGILGMNHGRLSFISQAK---ISKFSYCVPSRTGSNPTGLFYLGDNPNSS 250
             +        G+LG+  G+LSF  Q       KFSYC+  R+ S+       G+   S 
Sbjct: 266 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 325

Query: 251 KFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRL-NIPPAAFKPDAGGSGQTMIDS 310
             ++  +L      S+P LD   Y + +  I + G R+  +  + FK D  G+G  +IDS
Sbjct: 326 IARFTPLL------SNPKLDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDS 385

Query: 311 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEF 370
           G+ +T L+  AY  +++     VGA   K     ++ D CFD     EV  ++  +   F
Sbjct: 386 GTSVTRLIRPAYIAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEV--KVPTVVLHF 445

Query: 371 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVG 429
             G ++ +     ++     G  C     +G +G G +IIG + QQ   V YDLA+ RVG
Sbjct: 446 -RGADVSLPATNYLIPVDTNGKFCFAF--AGTMG-GLSIIGNIQQQGFRVVYDLASSRVG 485

BLAST of CSPI03G17780 vs. TrEMBL
Match: Q9FGI3_ARATH (AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1)

HSP 1 Score: 553.5 bits (1425), Expect = 2.2e-154
Identity = 283/436 (64.91%), Postives = 340/436 (77.98%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPL-SLTEKPSNITPLYYSSQLYVKKPS---SHGPFK 61
           LL +F    +++S S S+SLSL FPL SL   P+  +  + +S L  + PS   S   F+
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 121
              KYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FDP
Sbjct: 72  SNIKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFDP 131

Query: 122 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSN 181
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSN
Sbjct: 132 SLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSN 191

Query: 182 SLSTPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRT---GSNPTGLFY 241
           S +TPP+ILGCA+ ST+++GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG FY
Sbjct: 192 SQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFY 251

Query: 242 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 301
           LGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + F+PDAGGS
Sbjct: 252 LGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGS 311

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   ++E+GR I
Sbjct: 312 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 371

Query: 362 GDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYD 421
           GD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 372 GDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 431

Query: 422 LANKRVGFGGAECSRL 431
           + N+RVGF  AEC  L
Sbjct: 432 VTNRRVGFSKAECRLL 441

BLAST of CSPI03G17780 vs. TrEMBL
Match: D7MID8_ARALL (Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_493732 PE=3 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 1.1e-153
Identity = 286/437 (65.45%), Postives = 340/437 (77.80%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPL-SLTEKPSNITPLYYSSQLYVKKPS-SHGP--F 60
           +L I F     ++S S S+SLSL FPL SL   P+  +  + +S L  + PS S  P  F
Sbjct: 12  LLYIFFFFFCNSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPSSSPYTF 71

Query: 61  KLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFD 120
           +  FKYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FD
Sbjct: 72  RSNFKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFD 131

Query: 121 PSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS 180
           PSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS
Sbjct: 132 PSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFS 191

Query: 181 NSLSTPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRT---GSNPTGLF 240
           NS +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG F
Sbjct: 192 NSQTTPPLILGCAKESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSF 251

Query: 241 YLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGG 300
           YLG+NPNS  FKYV++LTFP+SQ  PNLDPLAYT+P+  I+I  KRLNIP + F+PDAGG
Sbjct: 252 YLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGG 311

Query: 301 SGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRR 360
           SGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD    + +GR 
Sbjct: 312 SGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRL 371

Query: 361 IGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEY 420
           IGD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+
Sbjct: 372 IGDLVFEFGRGVEILVEK-QRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEF 431

Query: 421 DLANKRVGFGGAECSRL 431
           D+AN+RVGF  AECSRL
Sbjct: 432 DVANRRVGFSKAECSRL 442

BLAST of CSPI03G17780 vs. TrEMBL
Match: A0A061EL58_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=3 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 2.1e-152
Identity = 284/429 (66.20%), Postives = 331/429 (77.16%), Query Frame = 1

Query: 4   ILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYS 63
           I FS  L +L FS+ N  +L   L  T+  S + P          +PSS+  +K  FKYS
Sbjct: 35  ISFSFPLTSLRFSRDNVQTLYRSLVSTKPNSTVQP----------RPSSYN-YKTTFKYS 94

Query: 64  SSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSF 123
             AL+V+LPIGTPPQ   +VLDTGSQLSWIQCH KKV ++ PP P     +FDPSLSSSF
Sbjct: 95  M-ALIVALPIGTPPQTQQMVLDTGSQLSWIQCH-KKVARKPPPPP----TSFDPSLSSSF 154

Query: 124 SLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPP 183
           S+LPC HP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS S STPP
Sbjct: 155 SVLPCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPP 214

Query: 184 VILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSR---TGSNPTGLFYLGDNPN 243
           +ILGCA  ++ED+GILGMN GRLSF SQAKISKFSYCVP+R    G +PTG FYLG+NP+
Sbjct: 215 LILGCATDTSEDKGILGMNLGRLSFASQAKISKFSYCVPTRRTQPGFSPTGSFYLGENPS 274

Query: 244 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 303
           S  F+YV ++ FPES + PN+DPLAYTLPM+ I+I  K+L IP + F+PDAGGSGQTMID
Sbjct: 275 SRGFQYVNLMIFPESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMID 334

Query: 304 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 363
           SGS+ TYLVD+AY KV+EEVVRLVG  +KKGYVY  VADMCFD G  +E+GR IGDM  E
Sbjct: 335 SGSEFTYLVDDAYNKVREEVVRLVGPRIKKGYVYGGVADMCFD-GNPIEIGRLIGDMVLE 394

Query: 364 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 423
           F+ GVEI V + E VL +VE GV C+GIGRS  LG  SNIIG  HQQN+WVEYDL N+RV
Sbjct: 395 FEKGVEITVEK-ERVLADVEGGVHCLGIGRSSMLGAASNIIGNFHQQNLWVEYDLVNRRV 444

Query: 424 GFGGAECSR 430
           GFG A+CSR
Sbjct: 455 GFGKADCSR 444

BLAST of CSPI03G17780 vs. TrEMBL
Match: R0GII4_9BRAS (Uncharacterized protein OS=Capsella rubella GN=CARUB_v10004834mg PE=3 SV=1)

HSP 1 Score: 537.7 bits (1384), Expect = 1.3e-149
Identity = 278/432 (64.35%), Postives = 330/432 (76.39%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPFK 61
           ++  F  +  +L+ S S+SLSL FPL+           + +S L  + PSS   F+   K
Sbjct: 19  IIFFFLCNSVSLTCS-SSSLSLHFPLTSLHLSPTTNSSFTTSLLSRRNPSSSYTFRSNVK 78

Query: 62  YSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSS 121
           YS  ALV+SLPIGTP Q  +LVLDTGSQLSWI+CH KK+KK       P T +FDPSLSS
Sbjct: 79  YSM-ALVISLPIGTPSQSQELVLDTGSQLSWIKCHPKKIKK-------PITTSFDPSLSS 138

Query: 122 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLST 181
           SFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS  T
Sbjct: 139 SFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQIT 198

Query: 182 PPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSR---TGSNPTGLFYLGDN 241
           PP+ILGCA+ ST+++GILGMN GRLSFISQAKISKFSYC+P+R   +G   TG FYLGDN
Sbjct: 199 PPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSDQSGLASTGSFYLGDN 258

Query: 242 PNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTM 301
           PNS  FKYV++LTFP+SQ  PNLDPLAYT+P+  I+I  KRLNIP +AF+PDAGGSGQTM
Sbjct: 259 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPASAFRPDAGGSGQTM 318

Query: 302 IDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMS 361
           +DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD      +GR IGD+ 
Sbjct: 319 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDE----NIGRLIGDLV 378

Query: 362 FEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANK 421
           FEF  GVEI V R + +L  V  GV CVGIGRS  LG  SNIIG VHQQN+WVE+D+ N+
Sbjct: 379 FEFGRGVEILVER-QRLLVNVGGGVHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 436

Query: 422 RVGFGGAECSRL 431
           RVGF  A CSR+
Sbjct: 439 RVGFSKAVCSRI 436

BLAST of CSPI03G17780 vs. TrEMBL
Match: A0A0R0L0W4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_02G243600 PE=3 SV=1)

HSP 1 Score: 537.3 bits (1383), Expect = 1.7e-149
Identity = 269/408 (65.93%), Postives = 319/408 (78.19%), Query Frame = 1

Query: 26  PLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLD 85
           PLS   KP N  P      L     SS    K  FKYS  ALVV+LPIGTPPQ   +VLD
Sbjct: 4   PLSPKGKPLNRNP-----NLRTLSSSSSYNIKSSFKYSM-ALVVTLPIGTPPQHQQMVLD 63

Query: 86  TGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTS 145
           TGSQLSWIQCH+K           P TA+FDPSLSSSF +LPC HP+CKPR+PDFTLPT+
Sbjct: 64  TGSQLSWIQCHNKT----------PPTASFDPSLSSSFYILPCTHPLCKPRVPDFTLPTT 123

Query: 146 CDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTEDRGILGMNHGR 205
           CDQNRLCHYSYFYADGT AEGNLVREK TFS S +TPP+ILGCA  S++ RGILGMN GR
Sbjct: 124 CDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLILGCATESSDARGILGMNLGR 183

Query: 206 LSFISQAKISKFSYCVPSRTGSN----PTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNL 265
           LSF SQAK++KFSYCVP+R  +N    PTG FYLG+NPNS++F+YV+MLTFP+SQ  PNL
Sbjct: 184 LSFPSQAKVTKFSYCVPTRQAANDNNLPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNL 243

Query: 266 DPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVV 325
           DPLAYT+PM+ I+I GK+LNIPP+ F+P+AGGSGQTM+DSGS+ T+LVD AY+ V+EEV+
Sbjct: 244 DPLAYTVPMQGIRIGGKKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDAAYDAVREEVI 303

Query: 326 RLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEK 385
           R+VG  +KKGYVY  VADMCFD G  +E+GR IGD++FEF+ GVEI V + E VL +V  
Sbjct: 304 RVVGPRVKKGYVYGGVADMCFD-GSVMEIGRLIGDVAFEFEKGVEIVVPK-ERVLADVGG 363

Query: 386 GVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSR 430
           GV C+GIGRS RLG  SNIIG  HQQN+WVE+DLAN+R+GFG A+CSR
Sbjct: 364 GVHCLGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCSR 393

BLAST of CSPI03G17780 vs. TAIR10
Match: AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 553.5 bits (1425), Expect = 1.1e-157
Identity = 283/436 (64.91%), Postives = 340/436 (77.98%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPL-SLTEKPSNITPLYYSSQLYVKKPS---SHGPFK 61
           LL +F    +++S S S+SLSL FPL SL   P+  +  + +S L  + PS   S   F+
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 121
              KYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FDP
Sbjct: 72  SNIKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFDP 131

Query: 122 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSN 181
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSN
Sbjct: 132 SLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSN 191

Query: 182 SLSTPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRT---GSNPTGLFY 241
           S +TPP+ILGCA+ ST+++GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG FY
Sbjct: 192 SQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFY 251

Query: 242 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 301
           LGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + F+PDAGGS
Sbjct: 252 LGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGS 311

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   ++E+GR I
Sbjct: 312 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 371

Query: 362 GDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYD 421
           GD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 372 GDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 431

Query: 422 LANKRVGFGGAECSRL 431
           + N+RVGF  AEC  L
Sbjct: 432 VTNRRVGFSKAECRLL 441

BLAST of CSPI03G17780 vs. TAIR10
Match: AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 522.3 bits (1344), Expect = 2.8e-148
Identity = 275/436 (63.07%), Postives = 322/436 (73.85%), Query Frame = 1

Query: 4   ILFSLSLFTLSFSQSNSLSLPF---PLSLTEKPSNITPLYYSSQLYVKKPSSHGP---FK 63
           + F   L  +S S S SL LP    P+S T      T    +S L  K PS   P   F+
Sbjct: 8   LFFFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFT----TSLLSRKNPSPSSPPYNFR 67

Query: 64  LPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 123
             FKYS  AL++SLPIGTPPQ   +VLDTGSQLSWIQCH    +K+LPP  KPKT+ FDP
Sbjct: 68  SRFKYSM-ALIISLPIGTPPQAQQMVLDTGSQLSWIQCH----RKKLPP--KPKTS-FDP 127

Query: 124 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSN 183
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EK TFSN
Sbjct: 128 SLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSN 187

Query: 184 SLSTPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVP---SRTGSNPTGLFY 243
           +  TPP+ILGCA  S++DRGILGMN GRLSF+SQAKISKFSYC+P   +R G  PTG FY
Sbjct: 188 TEITPPLILGCATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFY 247

Query: 244 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 303
           LGDNPNS  FKYV++LTFPESQ  PNLDPLAYT+PM  I+   K+LNI  + F+PDAGGS
Sbjct: 248 LGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 307

Query: 304 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRI 363
           GQTM+DSGS+ T+LVD AY+KV+ E++  VG  +KKGYVY   ADMCFD  V + + R I
Sbjct: 308 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAM-IPRLI 367

Query: 364 GDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYD 423
           GD+ F F  GVEI V + E VL  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 368 GDLVFVFTRGVEILVPK-ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 427

Query: 424 LANKRVGFGGAECSRL 431
           + N+RVGF  A+CSR+
Sbjct: 428 VTNRRVGFAKADCSRV 429

BLAST of CSPI03G17780 vs. TAIR10
Match: AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 241.1 bits (614), Expect = 1.2e-63
Identity = 158/439 (35.99%), Postives = 235/439 (53.53%), Query Frame = 1

Query: 7   SLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSA 66
           SLSL + +F + + L L FPL+  +  S    L +S  L  +K       KL F+++ + 
Sbjct: 9   SLSL-SKNFLRISVLLLIFPLTFCKTSSTNQTLLFS--LKTQKLPQSSSDKLSFRHNVT- 68

Query: 67  LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLL 126
           L V+L +G PPQ   +VLDTGS+LSW+ C      K+ P L     + F+P  SS++S +
Sbjct: 69  LTVTLAVGDPPQNISMVLDTGSELSWLHC------KKSPNL----GSVFNPVSSSTYSPV 128

Query: 127 PCNHPICKPRIPDFTLPTSCD-QNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVI 186
           PC+ PIC+ R  D  +P SCD +  LCH +  YAD T  EGNL  E F    S++ P  +
Sbjct: 129 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI-GSVTRPGTL 188

Query: 187 LGC-----AQGSTED---RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGD 246
            GC     +  S ED    G++GMN G LSF++Q   SKFSYC+   +GS+ +G   LGD
Sbjct: 189 FGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGSDSSGFLLLGD 248

Query: 247 NPNS--SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSG 306
              S     +Y  ++   +S   P  D +AYT+ ++ I++  K L++P + F PD  G+G
Sbjct: 249 ASYSWLGPIQYTPLVL--QSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 308

Query: 307 QTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK----KGYVYAAVADMCFDAGVTVEVG 366
           QTM+DSG+  T+L+   Y  +K E +    ++++      +V+    D+C+  G T    
Sbjct: 309 QTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPN 368

Query: 367 RRIGDMSFEFDNGVEIFVG------RGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVH 425
                M      G E+ V       R  G  +E ++ V C   G S  LGI + +IG  H
Sbjct: 369 FSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHH 427

BLAST of CSPI03G17780 vs. TAIR10
Match: AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 240.4 bits (612), Expect = 2.1e-63
Identity = 159/452 (35.18%), Postives = 232/452 (51.33%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           +LL+L   +   +S S S+S S  F    +   S    L   +++    P+ H P     
Sbjct: 10  LLLVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRI---TPTDHRPTDKLH 69

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
            + +  L V+L +GTPPQ   +V+DTGS+LSW++C+           P P    FDP+ S
Sbjct: 70  FHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSN-------PNPVN-NFDPTRS 129

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SS+S +PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS +
Sbjct: 130 SSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN 189

Query: 181 TPPVILGC--------AQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGL 240
              +I GC         +  T+  G+LGMN G LSFISQ    KFSYC+ S T   P G 
Sbjct: 190 DSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-SGTDDFP-GF 249

Query: 241 FYLGDNPNSSKFKYVTMLTFPE----SQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFK 300
             LGD    S F ++T L +      S   P  D +AYT+ +  IK+ GK L IP +   
Sbjct: 250 LLLGD----SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLV 309

Query: 301 PDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMM----KKGYVYAAVADMCFD- 360
           PD  G+GQTM+DSG+  T+L+   Y  ++   +     ++       +V+    D+C+  
Sbjct: 310 PDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRI 369

Query: 361 AGVTVEVG--RRIGDMSFEFDNGVEIFVGRGEGVLTEV------EKGVKCVGIGRSGRLG 420
           + V +  G   R+  +S  F+ G EI V  G+ +L  V         V C   G S  +G
Sbjct: 370 SPVRIRSGILHRLPTVSLVFE-GAEIAVS-GQPLLYRVPHLTVGNDSVYCFTFGNSDLMG 429

Query: 421 IGSNIIGTVHQQNMWVEYDLANKRVGFGGAEC 428
           + + +IG  HQQNMW+E+DL   R+G    EC
Sbjct: 430 MEAYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442

BLAST of CSPI03G17780 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 156.8 bits (395), Expect = 3.1e-38
Identity = 119/396 (30.05%), Postives = 182/396 (45.96%), Query Frame = 1

Query: 49  KPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLP 108
           KP      K P    S   ++ L IG P      ++DTGS L W QC     K       
Sbjct: 89  KPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQC-----KPCTECFD 148

Query: 109 KPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTS-CDQNR-LCHYSYFYADGTLAEG 168
           +P T  FDP  SSS+S + C+  +C        LP S C++++  C Y Y Y D +   G
Sbjct: 149 QP-TPIFDPEKSSSYSKVGCSSGLCN------ALPRSNCNEDKDACEYLYTYGDYSSTRG 208

Query: 169 NLVREKFTFSNSLSTPPVILGCA-----QGSTEDRGILGMNHGRLSFISQAKISKFSYCV 228
            L  E FTF +  S   +  GC       G ++  G++G+  G LS ISQ K +KFSYC+
Sbjct: 209 LLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCL 268

Query: 229 PSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNL----DPLAYTLPMKAIKIAG 288
            S   S  +   ++G   +    K    L    +++   L     P  Y L ++ I +  
Sbjct: 269 TSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGA 328

Query: 289 KRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAV 348
           KRL++  + F+    G+G  +IDSG+ +TYL + A++ +KEE    +   +      +  
Sbjct: 329 KRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDS--GSTG 388

Query: 349 ADMCF---DAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRL 408
            D+CF   DA   + V +    M F F  G ++ +     ++ +   GV C+ +G S   
Sbjct: 389 LDLCFKLPDAAKNIAVPK----MIFHF-KGADLELPGENYMVADSSTGVLCLAMGSSN-- 448

Query: 409 GIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRL 431
             G +I G V QQN  V +DL  + V F   EC +L
Sbjct: 449 --GMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of CSPI03G17780 vs. NCBI nr
Match: gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 866.3 bits (2237), Expect = 2.2e-248
Identity = 429/431 (99.54%), Postives = 431/431 (100.00%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180

Query: 181 TPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240
           TPPVILGCAQGSTE+RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 431

BLAST of CSPI03G17780 vs. NCBI nr
Match: gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 822.8 bits (2124), Expect = 2.8e-235
Identity = 411/431 (95.36%), Postives = 418/431 (96.98%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFSLSLFTLSFSQSNSLSLPFPLSL+EKPSN  P Y SSQLY K+PSS+G FKLPF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSEKPSNTIPSY-SSQLYAKRPSSYGSFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS SLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS 180

Query: 181 TPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240
           TPPVILGCAQ STE+RGILGMN GRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYA VADMCFDAGVT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS RLGIGSNIIGTVHQQNMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 430

BLAST of CSPI03G17780 vs. NCBI nr
Match: gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 799.3 bits (2063), Expect = 3.4e-228
Identity = 398/431 (92.34%), Postives = 412/431 (95.59%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFSLSLFTL FSQSNS+SLPFPLSL+EKPSNI+P+Y  SQLY KKPSSHG FKLPF
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPSNISPIY-GSQLYAKKPSSHGSFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD KVKK+LPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KVKKKLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNSLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLS 180

Query: 181 TPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240
           TPPVILGCAQ STE+RGILGMN GRLSFISQAKISKFSYCVP+RTGSNPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SS+FKYVTMLTFPESQSSPNLDPLAYTLPMK IKIAGKRLNI PAAFKPDAGGSGQTMID
Sbjct: 241 SSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGA MKKGYVYAAVADMCFDA VT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEI VGRGEGVLTEVEKGVKCVG GRS RLGIGSNIIGTVHQQNMWVEYDL N+R+
Sbjct: 361 FDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRI 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 429

BLAST of CSPI03G17780 vs. NCBI nr
Match: gi|18421660|ref|NP_568551.1| (aspartyl protease family protein [Arabidopsis thaliana])

HSP 1 Score: 553.5 bits (1425), Expect = 3.2e-154
Identity = 283/436 (64.91%), Postives = 340/436 (77.98%), Query Frame = 1

Query: 2   LLILFSLSLFTLSFSQSNSLSLPFPL-SLTEKPSNITPLYYSSQLYVKKPS---SHGPFK 61
           LL +F    +++S S S+SLSL FPL SL   P+  +  + +S L  + PS   S   F+
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDP 121
              KYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FDP
Sbjct: 72  SNIKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFDP 131

Query: 122 SLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSN 181
           SLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSN
Sbjct: 132 SLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSN 191

Query: 182 SLSTPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRT---GSNPTGLFY 241
           S +TPP+ILGCA+ ST+++GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG FY
Sbjct: 192 SQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFY 251

Query: 242 LGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 301
           LGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + F+PDAGGS
Sbjct: 252 LGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGS 311

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   ++E+GR I
Sbjct: 312 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 371

Query: 362 GDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYD 421
           GD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+D
Sbjct: 372 GDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 431

Query: 422 LANKRVGFGGAECSRL 431
           + N+RVGF  AEC  L
Sbjct: 432 VTNRRVGFSKAECRLL 441

BLAST of CSPI03G17780 vs. NCBI nr
Match: gi|297801286|ref|XP_002868527.1| (aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata])

HSP 1 Score: 551.2 bits (1419), Expect = 1.6e-153
Identity = 286/437 (65.45%), Postives = 340/437 (77.80%), Query Frame = 1

Query: 1   MLLILFSLSLFTLSFSQSNSLSLPFPL-SLTEKPSNITPLYYSSQLYVKKPS-SHGP--F 60
           +L I F     ++S S S+SLSL FPL SL   P+  +  + +S L  + PS S  P  F
Sbjct: 12  LLYIFFFFFCNSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPSSSPYTF 71

Query: 61  KLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFD 120
           +  FKYS  AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LPP     T +FD
Sbjct: 72  RSNFKYSM-ALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP----PTTSFD 131

Query: 121 PSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS 180
           PSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFS
Sbjct: 132 PSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFS 191

Query: 181 NSLSTPPVILGCAQGSTEDRGILGMNHGRLSFISQAKISKFSYCVPSRT---GSNPTGLF 240
           NS +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G   TG F
Sbjct: 192 NSQTTPPLILGCAKESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSF 251

Query: 241 YLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGG 300
           YLG+NPNS  FKYV++LTFP+SQ  PNLDPLAYT+P+  I+I  KRLNIP + F+PDAGG
Sbjct: 252 YLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGG 311

Query: 301 SGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRR 360
           SGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD    + +GR 
Sbjct: 312 SGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRL 371

Query: 361 IGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEY 420
           IGD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQQN+WVE+
Sbjct: 372 IGDLVFEFGRGVEILVEK-QRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEF 431

Query: 421 DLANKRVGFGGAECSRL 431
           D+AN+RVGF  AECSRL
Sbjct: 432 DVANRRVGFSKAECSRL 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCS1L_ARATH3.8e-6235.18Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1[more]
NEP1_NEPGR9.7e-4232.15Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR4.1e-4031.62Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPG1_ARATH1.4e-3227.55Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
APF2_ARATH6.5e-3029.43Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
Q9FGI3_ARATH2.2e-15464.91AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1[more]
D7MID8_ARALL1.1e-15365.45Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
A0A061EL58_THECC2.1e-15266.20Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=... [more]
R0GII4_9BRAS1.3e-14964.35Uncharacterized protein OS=Capsella rubella GN=CARUB_v10004834mg PE=3 SV=1[more]
A0A0R0L0W4_SOYBN1.7e-14965.93Uncharacterized protein OS=Glycine max GN=GLYMA_02G243600 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G37540.11.1e-15764.91 Eukaryotic aspartyl protease family protein[more]
AT1G66180.12.8e-14863.07 Eukaryotic aspartyl protease family protein[more]
AT2G39710.11.2e-6335.99 Eukaryotic aspartyl protease family protein[more]
AT5G02190.12.1e-6335.18 Eukaryotic aspartyl protease family protein[more]
AT2G03200.13.1e-3830.05 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778679910|ref|XP_011651212.1|2.2e-24899.54PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|778679913|ref|XP_004140731.2|2.8e-23595.36PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|659114575|ref|XP_008457122.1|3.4e-22892.34PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
gi|18421660|ref|NP_568551.1|3.2e-15464.91aspartyl protease family protein [Arabidopsis thaliana][more]
gi|297801286|ref|XP_002868527.1|1.6e-15365.45aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0015992 proton transport
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
biological_process GO:0006744 ubiquinone biosynthetic process
biological_process GO:0006814 sodium ion transport
biological_process GO:0051788 response to misfolded protein
biological_process GO:0006979 response to oxidative stress
biological_process GO:0080129 proteasome core complex assembly
biological_process GO:0009853 photorespiration
biological_process GO:0006120 mitochondrial electron transport, NADH to ubiquinone
biological_process GO:0009630 gravitropism
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0005747 mitochondrial respiratory chain complex I
molecular_function GO:0016740 transferase activity
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0051537 2 iron, 2 sulfur cluster binding
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0009055 electron carrier activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0008137 NADH dehydrogenase (ubiquinone) activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G17780.1CSPI03G17780.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 297..308
score: 3.1E-5coord: 73..93
score: 3.1E-5coord: 399..414
score: 3.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..430
score: 1.8E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 82..93
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 65..237
score: 9.4E-31coord: 239..429
score: 9.1
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 64..429
score: 3.27
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 1..430
score: 1.8E