CmoCh14G008870 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G008870
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr14 : 4701231 .. 4702502 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCTCTCTCTCTTCTATCTCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCTCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTTCCTCTGTTTCTCCTCTCTTCTCTTCTCTCTCCGCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGGAAAGTTCATAAGAAATTGCTCAAGCCCAAAACGACGTCGTTCGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGTACCTTGGCTGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAATACTGGACGTCTCTCCTTCGTCTCCCAAGCTAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACTGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCATTGAAGGGCATAAAAATAGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGACAGGAGTGGGTCCGGTCAAACCATGATTGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACGGCGGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAACACGTGGATAGAATATGATTTGGCCAATAGGAGAATAGGTTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

mRNA sequence

ATGCTTCTCTCTCTCTTCTATCTCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCTCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTTCCTCTGTTTCTCCTCTCTTCTCTTCTCTCTCCGCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGGAAAGTTCATAAGAAATTGCTCAAGCCCAAAACGACGTCGTTCGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGTACCTTGGCTGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAATACTGGACGTCTCTCCTTCGTCTCCCAAGCTAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACTGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCATTGAAGGGCATAAAAATAGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGACAGGAGTGGGTCCGGTCAAACCATGATTGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACGGCGGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAACACGTGGATAGAATATGATTTGGCCAATAGGAGAATAGGTTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

Coding sequence (CDS)

ATGCTTCTCTCTCTCTTCTATCTCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCTCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTTCCTCTGTTTCTCCTCTCTTCTCTTCTCTCTCCGCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGGAAAGTTCATAAGAAATTGCTCAAGCCCAAAACGACGTCGTTCGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGTACCTTGGCTGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAATACTGGACGTCTCTCCTTCGTCTCCCAAGCTAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACTGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCATTGAAGGGCATAAAAATAGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGACAGGAGTGGGTCCGGTCAAACCATGATTGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACGGCGGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAACACGTGGATAGAATATGATTTGGCCAATAGGAGAATAGGTTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA
BLAST of CmoCh14G008870 vs. Swiss-Prot
Match: PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 1.8e-64
Identity = 159/448 (35.49%), Postives = 234/448 (52.23%), Query Frame = 1

Query: 4   SLFYLSLLSLS----FSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG---SVKLP 63
           +LF L +LS+      S S S S SF  S     S   + V PL + ++      + KL 
Sbjct: 7   ALFLLLVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLH 66

Query: 64  FKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFS 123
           F ++  L V+L +GTPPQ   +V+DTGS+LSW++C+R  +         +FDP+ SSS+S
Sbjct: 67  FHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPN----PVNNFDPTRSSSYS 126

Query: 124 LLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPV 183
            +PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS     +
Sbjct: 127 PIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNL 186

Query: 184 ILGC--------AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLG 243
           I GC         +  T+  G+LGMN G LSF+SQ    KFSYC+ G    D  G   LG
Sbjct: 187 IFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLG 246

Query: 244 DNPNSANFKYISLLTFPK----SQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRS 303
           D    +NF +++ L +      S   P  D +AYT+ L GIK+ G  L I  +V  PD +
Sbjct: 247 D----SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHT 306

Query: 304 GSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLM----KKGYEYAAVADMCFNDGET- 363
           G+GQTM+DSG+  T+L+   Y  ++   +     ++       + +    D+C+      
Sbjct: 307 GAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVR 366

Query: 364 --AEVGRRIRDMSFEFENGVEISVGKGEGVLTEV------EKGVKCVGFGRSGRLGIASN 420
             + +  R+  +S  FE G EI+V  G+ +L  V         V C  FG S  +G+ + 
Sbjct: 367 IRSGILHRLPTVSLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAY 426

BLAST of CmoCh14G008870 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 6.1e-41
Identity = 115/364 (31.59%), Postives = 175/364 (48.08%), Query Frame = 1

Query: 64  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHP 123
           ++++ IGTP      ++DTGS L W QC      +     T  F+P  SSSFS LPC   
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCT--QCFSQPTPIFNPQDSSSFSTLPCESQ 156

Query: 124 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQA 183
            C+       LP+    N  C Y+Y Y DG+  +G +  E FTF  S + P +  GC + 
Sbjct: 157 YCQD------LPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFGCGED 216

Query: 184 ST-----ENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKY 243
           +         G++GM  G LS  SQ  + +FSYC+    G  S     LG   +      
Sbjct: 217 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSY-GSSSPSTLALGSAASGVPEGS 276

Query: 244 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLT 303
            S      S     L+P  Y + L+GI + G+ L I S+ F+    G+G  +IDSG+ LT
Sbjct: 277 PSTTLIHSS-----LNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 336

Query: 304 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCF---NDGETAEVGRRIRDMSFEFEN 363
           YL  +AY  V +     +   +    E ++    CF   +DG T +V     ++S +F+ 
Sbjct: 337 YLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTVQV----PEISMQFDG 396

Query: 364 GVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFG 420
           GV +++G+ + +L    +GV C+  G S +LGI  +I G + Q+ T + YDL N  + F 
Sbjct: 397 GV-LNLGE-QNILISPAEGVICLAMGSSSQLGI--SIFGNIQQQETQVLYDLQNLAVSFV 435

BLAST of CmoCh14G008870 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 2.4e-37
Identity = 109/363 (30.03%), Postives = 166/363 (45.73%), Query Frame = 1

Query: 64  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHP 123
           +++L IGTP QP   ++DTGS L W QC      +     T  F+P  SSSFS LPC+  
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCT--QCFNQSTPIFNPQGSSSFSTLPCSSQ 155

Query: 124 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQA 183
           LC+       L +    N  C Y+Y Y DG+  +G++  E  TF  S + P +  GC + 
Sbjct: 156 LCQ------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGEN 215

Query: 184 ST-----ENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKY 243
           +         G++GM  G LS  SQ  ++KFSYC+    G  +     LG   NS     
Sbjct: 216 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP-IGSSTPSNLLLGSLANSVTAGS 275

Query: 244 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPD-RSGSGQTMIDSGSDL 303
            +      SQ      P  Y + L G+ +   RL I  + F  +  +G+G  +IDSG+ L
Sbjct: 276 PNTTLIQSSQI-----PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 335

Query: 304 TYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENG- 363
           TY V+ AY+ V++E +  +   +  G   ++  D+CF          +I      F+ G 
Sbjct: 336 TYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQTPSDPS-NLQIPTFVMHFDGGD 395

Query: 364 VEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGG 420
           +E+     E        G+ C+  G S +     +I G + Q+N  + YD  N  + F  
Sbjct: 396 LEL---PSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVVSFAS 434

BLAST of CmoCh14G008870 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 2.7e-36
Identity = 107/359 (29.81%), Postives = 166/359 (46.24%), Query Frame = 1

Query: 69  IGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCKPR 128
           +GTP +   LVLDTGS ++WIQC         +     F+P+ SS++  L C+ P C   
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCAD--CYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227

Query: 129 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS---- 188
                L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +    
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287

Query: 189 TENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFY----LGDNPNSANFKYIS 248
           T   G+LG+  G LS  +Q K + FSYC+  R    S+ L +    LG    +A      
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLL--- 347

Query: 249 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 308
                   R+  +D   Y + L G  + G ++ +  A+F  D SGSG  ++D G+ +T L
Sbjct: 348 --------RNKKIDTFYY-VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 407

Query: 309 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 368
             +AY  +++  +KL   L KKG    ++ D C++    + V  ++  ++F F  G  + 
Sbjct: 408 QTQAYNSLRDAFLKLTVNL-KKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLD 467

Query: 369 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADC 420
           +     ++   + G  C  F  +     + +IIG V Q+ T I YDL+   IG  G  C
Sbjct: 468 LPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh14G008870 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 8.9e-32
Identity = 104/362 (28.73%), Postives = 176/362 (48.62%), Query Frame = 1

Query: 67  LPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCK 126
           L +GTP +   +VLDTGS + W+QC     ++        FDP  S +++ +PC+ P C+
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 205

Query: 127 PRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS-- 186
            R+      T   + + C Y   Y DG+   G+   E  TF  +R    V LGC   +  
Sbjct: 206 -RLDSAGCNT---RRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR-VKGVALGCGHDNEG 265

Query: 187 --TENRGILGMNTGRLSFVSQAK---ISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYI 246
                 G+LG+  G+LSF  Q       KFSYC+  R+          G+   S   ++ 
Sbjct: 266 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 325

Query: 247 SLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRL-NISSAVFKPDRSGSGQTMIDSGSDLT 306
            LL+ PK      LD   Y + L GI + G R+  +++++FK D+ G+G  +IDSG+ +T
Sbjct: 326 PLLSNPK------LDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVT 385

Query: 307 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVE 366
            L+  AY  +++   ++    +K+  ++ ++ D CF+     EV  ++  +   F  G +
Sbjct: 386 RLIRPAYIAMRDAF-RVGAKTLKRAPDF-SLFDTCFDLSNMNEV--KVPTVVLHF-RGAD 445

Query: 367 ISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGAD 421
           +S+     ++     G  C  F  +G +G   +IIG + Q+   + YDLA+ R+GF    
Sbjct: 446 VSLPATNYLIPVDTNGKFCFAF--AGTMG-GLSIIGNIQQQGFRVVYDLASSRVGFAPGG 485

BLAST of CmoCh14G008870 vs. TrEMBL
Match: Q9FGI3_ARATH (AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1)

HSP 1 Score: 543.1 bits (1398), Expect = 3.0e-151
Identity = 281/432 (65.05%), Postives = 329/432 (76.16%), Query Frame = 1

Query: 2   LLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG--------SV 61
           LL +F+    S+S S S SLSL FPL+ S R +P ++S S   S LS           + 
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLT-SLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTF 71

Query: 62  KLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSS 121
           +   KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P TTSFDPSLSS
Sbjct: 72  RSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSS 131

Query: 122 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 181
           SFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TT
Sbjct: 132 SFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT 191

Query: 182 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDN 241
           PP+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDN
Sbjct: 192 PPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDN 251

Query: 242 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 301
           PNS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM
Sbjct: 252 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTM 311

Query: 302 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 361
           +DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ 
Sbjct: 312 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLV 371

Query: 362 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 421
           FEF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D+ NR
Sbjct: 372 FEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 431

Query: 422 RIGFGGADCSRL 423
           R+GF  A+C  L
Sbjct: 432 RVGFSKAECRLL 441

BLAST of CmoCh14G008870 vs. TrEMBL
Match: D7MID8_ARALL (Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_493732 PE=3 SV=1)

HSP 1 Score: 542.0 bits (1395), Expect = 6.6e-151
Identity = 283/433 (65.36%), Postives = 330/433 (76.21%), Query Frame = 1

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG--------S 60
           +L   F+    S+S S S SLSL FPL+ S R +P ++S S   S LS           +
Sbjct: 12  LLYIFFFFFCNSVSLSWSSSLSLHFPLT-SLRLTPTTNSSSFKTSLLSRRNPSPSSSPYT 71

Query: 61  VKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLS 120
            +  FKYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P TTSFDPSLS
Sbjct: 72  FRSNFKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLS 131

Query: 121 SSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRT 180
           SSFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+T
Sbjct: 132 SSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT 191

Query: 181 TPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGD 240
           TPP+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLG+
Sbjct: 192 TPPLILGCAKESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGE 251

Query: 241 NPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQT 300
           NPNS  FKY+SLLTFP+SQR PNLDPLAYT+PL GI+I   RLNI S+VF+PD  GSGQT
Sbjct: 252 NPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQT 311

Query: 301 MIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDM 360
           M+DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+      +GR I D+
Sbjct: 312 MVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDL 371

Query: 361 SFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLAN 420
            FEF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D+AN
Sbjct: 372 VFEFGRGVEILVEK-QRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVAN 431

Query: 421 RRIGFGGADCSRL 423
           RR+GF  A+CSRL
Sbjct: 432 RRVGFSKAECSRL 442

BLAST of CmoCh14G008870 vs. TrEMBL
Match: V4LXR6_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10028350mg PE=3 SV=1)

HSP 1 Score: 538.9 bits (1387), Expect = 5.6e-150
Identity = 284/439 (64.69%), Postives = 336/439 (76.54%), Query Frame = 1

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG--------- 60
           + ++  Y+ L + S S S SLSL FPL  S R +P ++S S   SS S++          
Sbjct: 3   LFVTFLYIFLCN-SVSLSSSLSLHFPLK-SLRLTPTTNSSSSSSSSSSSFQTSLASRRTP 62

Query: 61  -----SVKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTS 120
                S +  FKYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK  KP TTS
Sbjct: 63  SSLPYSFRSNFKYSMALILSLPIGTPAQTQELVLDTGSQLSWIQCHPKKKKK--KP-TTS 122

Query: 121 FDPSLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFT 180
           FDPSLSSSFS LPC+HPLCKPRIPDFTLPT+CD NRLCHYSYFYADGT AEGNLV+EKFT
Sbjct: 123 FDPSLSSSFSDLPCSHPLCKPRIPDFTLPTTCDSNRLCHYSYFYADGTFAEGNLVKEKFT 182

Query: 181 FSNSRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRT---GPDSTG 240
           FSN++ TPP+ILGCA  ST+++GILGMN GRLSFVSQAKISKFSYC+P R+   G  STG
Sbjct: 183 FSNTQITPPLILGCAAESTDDKGILGMNLGRLSFVSQAKISKFSYCIPTRSNQPGLSSTG 242

Query: 241 LFYLGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDR 300
            FYLG+NP+S  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNIS++VF+PD 
Sbjct: 243 SFYLGENPSSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNISASVFRPDA 302

Query: 301 SGSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVG 360
            GSGQTM+DSGS+ T+LVD AY+KVKEEIV+LVGP +KKGY Y A ADMCF+     E+G
Sbjct: 303 GGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGPRLKKGYVYGATADMCFDGNNPVEIG 362

Query: 361 RRIRDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWI 420
           R I D+ FEF  GVEI V K + +L  V  GV C+G GRS  LG ASNIIG VHQ+N W+
Sbjct: 363 RLIGDLVFEFGRGVEILVEK-QRLLVNVGGGVHCLGIGRSSMLGAASNIIGNVHQQNLWV 422

Query: 421 EYDLANRRIGFGGADCSRL 423
           E+D+ANRR+GF  ADCSRL
Sbjct: 423 EFDVANRRVGFSKADCSRL 435

BLAST of CmoCh14G008870 vs. TrEMBL
Match: B9T2R1_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 PE=3 SV=1)

HSP 1 Score: 535.8 bits (1379), Expect = 4.7e-149
Identity = 284/437 (64.99%), Postives = 330/437 (75.51%), Query Frame = 1

Query: 1   MLLSLF----YLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAY-GSVKL 60
           MLL LF    ++S  S +   +H  + SF  S   R  P SS   P     S++    K 
Sbjct: 6   MLLFLFLFFTFVSSSSPNPERTHLNTSSFSFSFPLRSLPASSPSKPSSPFRSSFVAQTKQ 65

Query: 61  P-------FKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRK-VHKKLLKPKTTSF 120
           P       FKYS AL+VSLPIGTPPQ   +VLDTGSQLSWIQCH+K V KK   P TTSF
Sbjct: 66  PSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKK--PPPTTSF 125

Query: 121 DPSLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTF 180
           DPSLSSSFS+LPCNHPLCKPRIPDFTLPT+CDQNRLCHYSYFYADGT AEG+LVREK TF
Sbjct: 126 DPSLSSSFSVLPCNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITF 185

Query: 181 SNSRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGL 240
           S+S++TPP+ILGCA+AST+ +GILGMN GR SF SQAKISKFSYCVP    R G  STG 
Sbjct: 186 SSSQSTPPLILGCAEASTDEKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGS 245

Query: 241 FYLGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRS 300
           FYLG+NPNS  F+YI+LLTF  SQRSPNLDPLAYT+P++GI++   RLNIS+ +F+PD S
Sbjct: 246 FYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPS 305

Query: 301 GSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGR 360
           G+GQT+IDSGS+ TYLVDEAY KV+EE+V+LVGP +KKGY Y  V+DMCF DG   E+GR
Sbjct: 306 GAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCF-DGNPMEIGR 365

Query: 361 RIRDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIE 420
            I +M FEFE GVEI + K   VL +V  GV C+G GRS  LG ASNIIG  HQ+N W+E
Sbjct: 366 LIGNMVFEFEKGVEIVIDKWR-VLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVE 425

Query: 421 YDLANRRIGFGGADCSR 422
           YDLANRRIG G ADCSR
Sbjct: 426 YDLANRRIGLGKADCSR 438

BLAST of CmoCh14G008870 vs. TrEMBL
Match: A0A067JPK4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22524 PE=3 SV=1)

HSP 1 Score: 534.6 bits (1376), Expect = 1.1e-148
Identity = 261/363 (71.90%), Postives = 298/363 (82.09%), Query Frame = 1

Query: 62  ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCN 121
           AL+VSLPIGTPPQ   +VLDTGSQLSWIQCH+K  +KL  P TTSFDPSLSSSFS+LPCN
Sbjct: 2   ALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKAPRKL--PPTTSFDPSLSSSFSVLPCN 61

Query: 122 HPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCA 181
           HPLCKPRIPDFTLPT+CDQNRLCHYSYFYADGTLAEG+LVREKFTFSN+++TPP+ILGCA
Sbjct: 62  HPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGSLVREKFTFSNTQSTPPLILGCA 121

Query: 182 QASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDNPNSANFKY 241
           + S +++GILGMN GR SF SQAKISKFSYCVP    R G   TGLFYLGDNPNS  F Y
Sbjct: 122 EDSGDDKGILGMNLGRRSFASQAKISKFSYCVPTRGNRAGLSPTGLFYLGDNPNSGGFHY 181

Query: 242 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLT 301
           I+LLTF  SQRSPNLDPLAYT+P++GI+I   RLNI ++VF+PD SGSGQTM+DSGS+ T
Sbjct: 182 INLLTFTPSQRSPNLDPLAYTVPMQGIRIGNTRLNIPASVFRPDPSGSGQTMVDSGSEFT 241

Query: 302 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVE 361
           YLVDEAY KV+EEIV++ G  +KK Y Y  V+DMCF DG   E+GR I +M FEFE GVE
Sbjct: 242 YLVDEAYNKVREEIVRVAGTKLKKNYVYGGVSDMCF-DGNPVEIGRLIGNMVFEFEKGVE 301

Query: 362 ISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGAD 421
           I V + E VL  V  GV CVG GRS  LG ASNIIG  HQ+N W+E+DLANRR+GFG AD
Sbjct: 302 IVVDR-ERVLANVGNGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDLANRRVGFGKAD 360

BLAST of CmoCh14G008870 vs. TAIR10
Match: AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 543.1 bits (1398), Expect = 1.5e-154
Identity = 281/432 (65.05%), Postives = 329/432 (76.16%), Query Frame = 1

Query: 2   LLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG--------SV 61
           LL +F+    S+S S S SLSL FPL+ S R +P ++S S   S LS           + 
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLT-SLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTF 71

Query: 62  KLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSS 121
           +   KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P TTSFDPSLSS
Sbjct: 72  RSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSS 131

Query: 122 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 181
           SFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TT
Sbjct: 132 SFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT 191

Query: 182 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDN 241
           PP+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDN
Sbjct: 192 PPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDN 251

Query: 242 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 301
           PNS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM
Sbjct: 252 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTM 311

Query: 302 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 361
           +DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ 
Sbjct: 312 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLV 371

Query: 362 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 421
           FEF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D+ NR
Sbjct: 372 FEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 431

Query: 422 RIGFGGADCSRL 423
           R+GF  A+C  L
Sbjct: 432 RVGFSKAECRLL 441

BLAST of CmoCh14G008870 vs. TAIR10
Match: AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 508.8 bits (1309), Expect = 3.1e-144
Identity = 274/436 (62.84%), Postives = 314/436 (72.02%), Query Frame = 1

Query: 2   LLSLFYLSLLSLSFSHSHSLSLS------------FPLSLSKRPSPVSSSVSPLFSSLSA 61
           L   F+L+ +SLS S S  L L+            F  SL  R +P  SS    F S   
Sbjct: 8   LFFFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRSR-- 67

Query: 62  YGSVKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDP 121
                  FKYS AL++SLPIGTPPQ   +VLDTGSQLSWIQCHRK  K   KPKT SFDP
Sbjct: 68  -------FKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRK--KLPPKPKT-SFDP 127

Query: 122 SLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSN 181
           SLSSSFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EK TFSN
Sbjct: 128 SLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSN 187

Query: 182 SRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFY 241
           +  TPP+ILGCA  S+++RGILGMN GRLSFVSQAKISKFSYC+P    R G   TG FY
Sbjct: 188 TEITPPLILGCATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFY 247

Query: 242 LGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGS 301
           LGDNPNS  FKY+SLLTFP+SQR PNLDPLAYT+P+ GI+    +LNIS +VF+PD  GS
Sbjct: 248 LGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 307

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KV+ EI+  VG  +KKGY Y   ADMCF DG  A + R I
Sbjct: 308 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCF-DGNVAMIPRLI 367

Query: 362 RDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYD 421
            D+ F F  GVEI V K E VL  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D
Sbjct: 368 GDLVFVFTRGVEILVPK-ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 427

Query: 422 LANRRIGFGGADCSRL 423
           + NRR+GF  ADCSR+
Sbjct: 428 VTNRRVGFAKADCSRV 429

BLAST of CmoCh14G008870 vs. TAIR10
Match: AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 248.1 bits (632), Expect = 1.0e-65
Identity = 159/448 (35.49%), Postives = 234/448 (52.23%), Query Frame = 1

Query: 4   SLFYLSLLSLS----FSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG---SVKLP 63
           +LF L +LS+      S S S S SF  S     S   + V PL + ++      + KL 
Sbjct: 7   ALFLLLVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLH 66

Query: 64  FKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFS 123
           F ++  L V+L +GTPPQ   +V+DTGS+LSW++C+R  +         +FDP+ SSS+S
Sbjct: 67  FHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPN----PVNNFDPTRSSSYS 126

Query: 124 LLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPV 183
            +PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS     +
Sbjct: 127 PIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNL 186

Query: 184 ILGC--------AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLG 243
           I GC         +  T+  G+LGMN G LSF+SQ    KFSYC+ G    D  G   LG
Sbjct: 187 IFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLG 246

Query: 244 DNPNSANFKYISLLTFPK----SQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRS 303
           D    +NF +++ L +      S   P  D +AYT+ L GIK+ G  L I  +V  PD +
Sbjct: 247 D----SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHT 306

Query: 304 GSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLM----KKGYEYAAVADMCFNDGET- 363
           G+GQTM+DSG+  T+L+   Y  ++   +     ++       + +    D+C+      
Sbjct: 307 GAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVR 366

Query: 364 --AEVGRRIRDMSFEFENGVEISVGKGEGVLTEV------EKGVKCVGFGRSGRLGIASN 420
             + +  R+  +S  FE G EI+V  G+ +L  V         V C  FG S  +G+ + 
Sbjct: 367 IRSGILHRLPTVSLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAY 426

BLAST of CmoCh14G008870 vs. TAIR10
Match: AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 244.6 bits (623), Expect = 1.1e-64
Identity = 152/431 (35.27%), Postives = 228/431 (52.90%), Query Frame = 1

Query: 11  LSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSS-LSAYGSVKLPFKYSTALVVSLPI 70
           LS +F     L L FPL+  K  S   + +  L +  L    S KL F+++  L V+L +
Sbjct: 12  LSKNFLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVTLTVTLAV 71

Query: 71  GTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTS-FDPSLSSSFSLLPCNHPLCKPR 130
           G PPQ   +VLDTGS+LSW+ C +        P   S F+P  SS++S +PC+ P+C+ R
Sbjct: 72  GDPPQNISMVLDTGSELSWLHCKKS-------PNLGSVFNPVSSSTYSPVPCSSPICRTR 131

Query: 131 IPDFTLPTSCD-QNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC------- 190
             D  +P SCD +  LCH +  YAD T  EGNL  E F    S T P  + GC       
Sbjct: 132 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSS 191

Query: 191 -AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYI 250
            ++   ++ G++GMN G LSFV+Q   SKFSYC+   +G DS+G   LGD    A++ ++
Sbjct: 192 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGSDSSGFLLLGD----ASYSWL 251

Query: 251 SLLTFP----KSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGS 310
             + +     +S   P  D +AYT+ L+GI++    L++  +VF PD +G+GQTM+DSG+
Sbjct: 252 GPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGT 311

Query: 311 DLTYLVDEAYEKVKEEIVKLVGPLMK----KGYEYAAVADMCFNDGETAEVGRRIRDMSF 370
             T+L+   Y  +K E +     +++      + +    D+C+  G T         M  
Sbjct: 312 QFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVS 371

Query: 371 EFENGVEISVG------KGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEY 417
               G E+SV       +  G  +E ++ V C  FG S  LGI + +IG  HQ+N W+E+
Sbjct: 372 LMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEF 427

BLAST of CmoCh14G008870 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 154.5 bits (389), Expect = 1.5e-37
Identity = 107/359 (29.81%), Postives = 166/359 (46.24%), Query Frame = 1

Query: 69  IGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCKPR 128
           +GTP +   LVLDTGS ++WIQC         +     F+P+ SS++  L C+ P C   
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCAD--CYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227

Query: 129 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS---- 188
                L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +    
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287

Query: 189 TENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFY----LGDNPNSANFKYIS 248
           T   G+LG+  G LS  +Q K + FSYC+  R    S+ L +    LG    +A      
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLL--- 347

Query: 249 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 308
                   R+  +D   Y + L G  + G ++ +  A+F  D SGSG  ++D G+ +T L
Sbjct: 348 --------RNKKIDTFYY-VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 407

Query: 309 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 368
             +AY  +++  +KL   L KKG    ++ D C++    + V  ++  ++F F  G  + 
Sbjct: 408 QTQAYNSLRDAFLKLTVNL-KKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLD 467

Query: 369 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADC 420
           +     ++   + G  C  F  +     + +IIG V Q+ T I YDL+   IG  G  C
Sbjct: 468 LPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh14G008870 vs. NCBI nr
Match: gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 685.3 bits (1767), Expect = 6.9e-194
Identity = 349/433 (80.60%), Postives = 380/433 (87.76%), Query Frame = 1

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSL------SAYGSVK 60
           MLL LF LSL +L FS S+S+SL FPLSLS++PS    ++SP++ S       S++GS K
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPS----NISPIYGSQLYAKKPSSHGSFK 60

Query: 61  LPFKYS-TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLL---KPKTTSFDPS 120
           LPFKYS TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH KV KKL    KPKT SFDPS
Sbjct: 61  LPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPS 120

Query: 121 LSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNS 180
           LSSSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNS
Sbjct: 121 LSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNS 180

Query: 181 RTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDN 240
            +TPPVILGCAQASTENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDN
Sbjct: 181 LSTPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDN 240

Query: 241 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 300
           PNS+ FKY+++LTFP+SQ SPNLDPLAYTLP+KGIKIAG RLNIS A FKPD  GSGQTM
Sbjct: 241 PNSSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTM 300

Query: 301 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 360
           IDSGSDLTYLVDEAYEKVKEE+V+LVG  MKKGY YAAVADMCF+   TAEVGRRI  +S
Sbjct: 301 IDSGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGIS 360

Query: 361 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 420
           FEF+NGVEI VG+GEGVLTEVEKGVKCVGFGRS RLGI SNIIGTVHQ+N W+EYDL NR
Sbjct: 361 FEFDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNR 420

Query: 421 RIGFGGADCSRLK 424
           RIGFGGA+CSRLK
Sbjct: 421 RIGFGGAECSRLK 429

BLAST of CmoCh14G008870 vs. NCBI nr
Match: gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 684.9 bits (1766), Expect = 9.1e-194
Identity = 347/435 (79.77%), Postives = 380/435 (87.36%), Query Frame = 1

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSL-------SAYGSV 60
           MLL LF LSL +LSFS S+SLSL FPLSL+++PS    +++PL+ S        S++G  
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPS----NITPLYYSSQLYVKKPSSHGPF 60

Query: 61  KLPFKYST-ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLL----KPKTTSFD 120
           KLPFKYS+ ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH K  KK L    KPKT SFD
Sbjct: 61  KLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFD 120

Query: 121 PSLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS 180
           PSLSSSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS
Sbjct: 121 PSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS 180

Query: 181 NSRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLG 240
           NS +TPPVILGCAQ STENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLG
Sbjct: 181 NSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLG 240

Query: 241 DNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQ 300
           DNPNS+ FKY+++LTFP+SQ SPNLDPLAYTLP+K IKIAG RLNI  A FKPD  GSGQ
Sbjct: 241 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQ 300

Query: 301 TMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRD 360
           TMIDSGSDLTYLVDEAYEKVKEE+V+LVG +MKKGY YAAVADMCF+ G T EVGRRI D
Sbjct: 301 TMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGD 360

Query: 361 MSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLA 420
           MSFEF+NGVEI VG+GEGVLTEVEKGVKCVG GRSGRLGI SNIIGTVHQ+N W+EYDLA
Sbjct: 361 MSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLA 420

Query: 421 NRRIGFGGADCSRLK 424
           N+R+GFGGA+CSRLK
Sbjct: 421 NKRVGFGGAECSRLK 431

BLAST of CmoCh14G008870 vs. NCBI nr
Match: gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 682.2 bits (1759), Expect = 5.9e-193
Identity = 348/430 (80.93%), Postives = 378/430 (87.91%), Query Frame = 1

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPS-PVSSSVSPLFSSL-SAYGSVKLPFK 60
           MLL LF LSL +LSFS S+SLSL FPLSLS++PS  + S  S L++   S+YGS KLPFK
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSEKPSNTIPSYSSQLYAKRPSSYGSFKLPFK 60

Query: 61  YS-TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLL----KPKTTSFDPSLSS 120
           YS TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH K  KK L    KPKT SFDPSLSS
Sbjct: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSS 120

Query: 121 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 180
           SFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS S +T
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST 180

Query: 181 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNS 240
           PPVILGCAQASTENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDNPNS
Sbjct: 181 PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNS 240

Query: 241 ANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDS 300
           + FKY+++LTFP+SQ SPNLDPLAYTLP+K IKIAG RLNI  A FKPD  GSGQTMIDS
Sbjct: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300

Query: 301 GSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEF 360
           GSDLTYLVDEAYEKVKEE+V+LVG +MKKGY YA VADMCF+ G TAEVGRRI  +SFEF
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360

Query: 361 ENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIG 420
           +NGVEI VG+GEGVLTEVEKGVKCVG GRS RLGI SNIIGTVHQ+N W+EYDLAN+R+G
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420

Query: 421 FGGADCSRLK 424
           FGGA+CSRLK
Sbjct: 421 FGGAECSRLK 430

BLAST of CmoCh14G008870 vs. NCBI nr
Match: gi|729306436|ref|XP_010528567.1| (PREDICTED: aspartic proteinase PCS1-like [Tarenaya hassleriana])

HSP 1 Score: 543.5 bits (1399), Expect = 3.3e-151
Identity = 281/426 (65.96%), Postives = 329/426 (77.23%), Query Frame = 1

Query: 5   LFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLP-------- 64
           LF+ + +SLS +HS  LSL+     S R SP ++S SP  +SL A      P        
Sbjct: 16  LFFCNAISLSSTHSLHLSLT-----SLRLSPATNS-SPFATSLVAGRKTPPPSYPYSFRS 75

Query: 65  -FKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSF 124
            FKYS AL+VSLPIGTPPQ   +VLDTGSQLSWIQCH K  +K   P TTSFDPSLSSSF
Sbjct: 76  NFKYSMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHGK--RKPSPPPTTSFDPSLSSSF 135

Query: 125 SLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPP 184
           S++PC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLVREKFTFS + +TPP
Sbjct: 136 SVVPCSHPLCKPRIPDFTLPTSCDTNRLCHYSYFYADGTFAEGNLVREKFTFSKTESTPP 195

Query: 185 VILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSAN 244
           + LGCA  S+++RGILGMN GRLSFVSQA++SKFSYCVP R GP+STG FYLG+NPNS  
Sbjct: 196 LTLGCATESSDDRGILGMNRGRLSFVSQARVSKFSYCVPTRHGPNSTGSFYLGENPNSRG 255

Query: 245 FKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGS 304
           FKY+SLLTF +SQR PNLDPLAYT+PL+GI+I   RLNIS++VF+PD  GSGQTM+DSGS
Sbjct: 256 FKYVSLLTFRQSQRMPNLDPLAYTVPLQGIRIGRKRLNISASVFRPDSGGSGQTMVDSGS 315

Query: 305 DLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFEN 364
           + TYLVDEAY+KV+EEIV+LVG  +KK Y Y   ADMCF  G   ++GR I D+ FEF  
Sbjct: 316 EFTYLVDEAYDKVREEIVRLVGARLKKDYVYGGSADMCF-VGNPIQIGRSIGDLVFEFGR 375

Query: 365 GVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFG 422
           GVEI V K E VL  VE GV C+G GRS  LG ASNIIG +HQ+N W+E+D+ NRR+GFG
Sbjct: 376 GVEILVEK-ERVLVHVEGGVHCIGIGRSSMLGAASNIIGNIHQQNLWVEFDVTNRRMGFG 431

BLAST of CmoCh14G008870 vs. NCBI nr
Match: gi|18421660|ref|NP_568551.1| (aspartyl protease family protein [Arabidopsis thaliana])

HSP 1 Score: 543.1 bits (1398), Expect = 4.3e-151
Identity = 281/432 (65.05%), Postives = 329/432 (76.16%), Query Frame = 1

Query: 2   LLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG--------SV 61
           LL +F+    S+S S S SLSL FPL+ S R +P ++S S   S LS           + 
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLT-SLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTF 71

Query: 62  KLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSS 121
           +   KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P TTSFDPSLSS
Sbjct: 72  RSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSS 131

Query: 122 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 181
           SFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TT
Sbjct: 132 SFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT 191

Query: 182 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDN 241
           PP+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDN
Sbjct: 192 PPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDN 251

Query: 242 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 301
           PNS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM
Sbjct: 252 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTM 311

Query: 302 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 361
           +DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ 
Sbjct: 312 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLV 371

Query: 362 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 421
           FEF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D+ NR
Sbjct: 372 FEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 431

Query: 422 RIGFGGADCSRL 423
           R+GF  A+C  L
Sbjct: 432 RVGFSKAECRLL 441

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCS1L_ARATH1.8e-6435.49Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1[more]
NEP2_NEPGR6.1e-4131.59Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR2.4e-3730.03Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG1_ARATH2.7e-3629.81Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
APF2_ARATH8.9e-3228.73Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
Q9FGI3_ARATH3.0e-15165.05AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1[more]
D7MID8_ARALL6.6e-15165.36Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
V4LXR6_EUTSA5.6e-15064.69Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10028350mg PE=3 SV=1[more]
B9T2R1_RICCO4.7e-14964.99Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 ... [more]
A0A067JPK4_JATCU1.1e-14871.90Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22524 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G37540.11.5e-15465.05 Eukaryotic aspartyl protease family protein[more]
AT1G66180.13.1e-14462.84 Eukaryotic aspartyl protease family protein[more]
AT5G02190.11.0e-6535.49 Eukaryotic aspartyl protease family protein[more]
AT2G39710.11.1e-6435.27 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.5e-3729.81 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659114575|ref|XP_008457122.1|6.9e-19480.60PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
gi|778679910|ref|XP_011651212.1|9.1e-19479.77PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|778679913|ref|XP_004140731.2|5.9e-19380.93PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|729306436|ref|XP_010528567.1|3.3e-15165.96PREDICTED: aspartic proteinase PCS1-like [Tarenaya hassleriana][more]
gi|18421660|ref|NP_568551.1|4.3e-15165.05aspartyl protease family protein [Arabidopsis thaliana][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G008870.1CmoCh14G008870.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 391..406
score: 3.0E-5coord: 289..300
score: 3.0E-5coord: 69..89
score: 3.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..422
score: 2.1E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 78..89
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 232..421
score: 7.0E-31coord: 64..229
score: 1.1
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 64..421
score: 8.71
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 5..422
score: 2.1E