CmoCh14G008870 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh14G008870
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase PCS1-like
LocationCmo_Chr14: 4701231 .. 4702502 (-)
RNA-Seq ExpressionCmoCh14G008870
SyntenyCmoCh14G008870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCTCTCTCTCTTCTATCTCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCTCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTTCCTCTGTTTCTCCTCTCTTCTCTTCTCTCTCCGCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGGAAAGTTCATAAGAAATTGCTCAAGCCCAAAACGACGTCGTTCGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGTACCTTGGCTGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAATACTGGACGTCTCTCCTTCGTCTCCCAAGCTAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACTGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCATTGAAGGGCATAAAAATAGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGACAGGAGTGGGTCCGGTCAAACCATGATTGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACGGCGGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAACACGTGGATAGAATATGATTTGGCCAATAGGAGAATAGGTTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

mRNA sequence

ATGCTTCTCTCTCTCTTCTATCTCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCTCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTTCCTCTGTTTCTCCTCTCTTCTCTTCTCTCTCCGCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGGAAAGTTCATAAGAAATTGCTCAAGCCCAAAACGACGTCGTTCGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGTACCTTGGCTGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAATACTGGACGTCTCTCCTTCGTCTCCCAAGCTAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACTGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCATTGAAGGGCATAAAAATAGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGACAGGAGTGGGTCCGGTCAAACCATGATTGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACGGCGGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAACACGTGGATAGAATATGATTTGGCCAATAGGAGAATAGGTTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

Coding sequence (CDS)

ATGCTTCTCTCTCTCTTCTATCTCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCTCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTTCCTCTGTTTCTCCTCTCTTCTCTTCTCTCTCCGCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGGAAAGTTCATAAGAAATTGCTCAAGCCCAAAACGACGTCGTTCGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGTACCTTGGCTGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAATACTGGACGTCTCTCCTTCGTCTCCCAAGCTAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACTGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCATTGAAGGGCATAAAAATAGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGACAGGAGTGGGTCCGGTCAAACCATGATTGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACGGCGGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAACACGTGGATAGAATATGATTTGGCCAATAGGAGAATAGGTTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

Protein sequence

MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCSRLK
Homology
BLAST of CmoCh14G008870 vs. ExPASy Swiss-Prot
Match: Q9LZL3 (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 1.8e-64
Identity = 159/448 (35.49%), Postives = 234/448 (52.23%), Query Frame = 0

Query: 4   SLFYLSLLSL----SFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG---SVKLP 63
           +LF L +LS+      S S S S SF  S     S   + V PL + ++      + KL 
Sbjct: 7   ALFLLLVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLH 66

Query: 64  FKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFS 123
           F ++  L V+L +GTPPQ   +V+DTGS+LSW++C+R  +         +FDP+ SSS+S
Sbjct: 67  FHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPN----PVNNFDPTRSSSYS 126

Query: 124 LLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPV 183
            +PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS     +
Sbjct: 127 PIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNL 186

Query: 184 ILGC--------AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLG 243
           I GC         +  T+  G+LGMN G LSF+SQ    KFSYC+ G    D  G   LG
Sbjct: 187 IFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLG 246

Query: 244 DNPNSANFKYISLLTFPK----SQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRS 303
           D    +NF +++ L +      S   P  D +AYT+ L GIK+ G  L I  +V  PD +
Sbjct: 247 D----SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHT 306

Query: 304 GSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLM----KKGYEYAAVADMCFNDGET- 363
           G+GQTM+DSG+  T+L+   Y  ++   +     ++       + +    D+C+      
Sbjct: 307 GAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVR 366

Query: 364 --AEVGRRIRDMSFEFENGVEISVGKGEGVLTEV------EKGVKCVGFGRSGRLGIASN 420
             + +  R+  +S  FE G EI+V  G+ +L  V         V C  FG S  +G+ + 
Sbjct: 367 IRSGILHRLPTVSLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAY 426

BLAST of CmoCh14G008870 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 6.4e-41
Identity = 115/364 (31.59%), Postives = 175/364 (48.08%), Query Frame = 0

Query: 64  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHP 123
           ++++ IGTP      ++DTGS L W QC  +   +     T  F+P  SSSFS LPC   
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQC--EPCTQCFSQPTPIFNPQDSSSFSTLPCESQ 156

Query: 124 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQ- 183
            C+       LP+    N  C Y+Y Y DG+  +G +  E FTF  S + P +  GC + 
Sbjct: 157 YCQ------DLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFGCGED 216

Query: 184 ----ASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKY 243
                     G++GM  G LS  SQ  + +FSYC+    G  S     LG   +      
Sbjct: 217 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTS-YGSSSPSTLALGSAASGVPEGS 276

Query: 244 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLT 303
            S      S     L+P  Y + L+GI + G+ L I S+ F+    G+G  +IDSG+ LT
Sbjct: 277 PSTTLIHSS-----LNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 336

Query: 304 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCF---NDGETAEVGRRIRDMSFEFEN 363
           YL  +AY  V +     +   +    E ++    CF   +DG T +V     ++S +F+ 
Sbjct: 337 YLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTVQV----PEISMQFDG 396

Query: 364 GVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFG 420
           GV +++G+ + +L    +GV C+  G S +LGI  +I G + Q+ T + YDL N  + F 
Sbjct: 397 GV-LNLGE-QNILISPAEGVICLAMGSSSQLGI--SIFGNIQQQETQVLYDLQNLAVSFV 435

BLAST of CmoCh14G008870 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 1.9e-37
Identity = 109/363 (30.03%), Postives = 165/363 (45.45%), Query Frame = 0

Query: 64  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHP 123
           +++L IGTP QP   ++DTGS L W QC  +   +     T  F+P  SSSFS LPC+  
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQC--QPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 155

Query: 124 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQ- 183
           LC+       L +    N  C Y+Y Y DG+  +G++  E  TF  S + P +  GC + 
Sbjct: 156 LCQ------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTF-GSVSIPNITFGCGEN 215

Query: 184 ----ASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKY 243
                     G++GM  G LS  SQ  ++KFSYC+    G  +     LG   NS     
Sbjct: 216 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMT-PIGSSTPSNLLLGSLANSVTAGS 275

Query: 244 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFK-PDRSGSGQTMIDSGSDL 303
            +      SQ      P  Y + L G+ +   RL I  + F     +G+G  +IDSG+ L
Sbjct: 276 PNTTLIQSSQ-----IPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 335

Query: 304 TYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENG- 363
           TY V+ AY+ V++E +  +   +  G   ++  D+CF          +I      F+ G 
Sbjct: 336 TYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQTPSDPS-NLQIPTFVMHFDGGD 395

Query: 364 VEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGG 420
           +E+     E        G+ C+  G S +     +I G + Q+N  + YD  N  + F  
Sbjct: 396 LEL---PSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVVSFAS 434

BLAST of CmoCh14G008870 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 3.6e-36
Identity = 107/359 (29.81%), Postives = 167/359 (46.52%), Query Frame = 0

Query: 69  IGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCKPR 128
           +GTP +   LVLDTGS ++WIQC  +      +     F+P+ SS++  L C+ P C   
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQC--EPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227

Query: 129 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS---- 188
                L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +    
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287

Query: 189 TENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFY----LGDNPNSANFKYIS 248
           T   G+LG+  G LS  +Q K + FSYC+  R    S+ L +    LG    +A      
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPL---- 347

Query: 249 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 308
                   R+  +D   Y + L G  + G ++ +  A+F  D SGSG  ++D G+ +T L
Sbjct: 348 -------LRNKKIDTF-YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 407

Query: 309 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 368
             +AY  +++  +KL   L KKG    ++ D C++    + V  ++  ++F F  G  + 
Sbjct: 408 QTQAYNSLRDAFLKLTVNL-KKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLD 467

Query: 369 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADC 420
           +     ++   + G  C  F  +     + +IIG V Q+ T I YDL+   IG  G  C
Sbjct: 468 LPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh14G008870 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 9.2e-32
Identity = 104/362 (28.73%), Postives = 176/362 (48.62%), Query Frame = 0

Query: 67  LPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCK 126
           L +GTP +   +VLDTGS + W+QC     ++        FDP  S +++ +PC+ P C+
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 205

Query: 127 PRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS-- 186
            R+      T   + + C Y   Y DG+   G+   E  TF  +R    V LGC   +  
Sbjct: 206 -RLDSAGCNT---RRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR-VKGVALGCGHDNEG 265

Query: 187 --TENRGILGMNTGRLSFVSQAK---ISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYI 246
                 G+LG+  G+LSF  Q       KFSYC+  R+          G+   S   ++ 
Sbjct: 266 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 325

Query: 247 SLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRL-NISSAVFKPDRSGSGQTMIDSGSDLT 306
            LL+ PK      LD   Y + L GI + G R+  +++++FK D+ G+G  +IDSG+ +T
Sbjct: 326 PLLSNPK------LDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVT 385

Query: 307 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVE 366
            L+  AY  +++   ++    +K+  ++ ++ D CF+     EV  ++  +   F  G +
Sbjct: 386 RLIRPAYIAMRDAF-RVGAKTLKRAPDF-SLFDTCFDLSNMNEV--KVPTVVLHF-RGAD 445

Query: 367 ISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGAD 421
           +S+     ++     G  C  F  +G +G   +IIG + Q+   + YDLA+ R+GF    
Sbjct: 446 VSLPATNYLIPVDTNGKFCFAF--AGTMG-GLSIIGNIQQQGFRVVYDLASSRVGFAPGG 485

BLAST of CmoCh14G008870 vs. ExPASy TrEMBL
Match: A0A6J1F8Z2 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111441919 PE=3 SV=1)

HSP 1 Score: 844.0 bits (2179), Expect = 2.8e-241
Identity = 423/423 (100.00%), Postives = 423/423 (100.00%), Query Frame = 0

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLPFKYS 60
           MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLPFKYS
Sbjct: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLPFKYS 60

Query: 61  TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPC 120
           TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPC
Sbjct: 61  TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPC 120

Query: 121 NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC 180
           NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC
Sbjct: 121 NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC 180

Query: 181 AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS 240
           AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS
Sbjct: 181 AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS 240

Query: 241 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 300
           LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL
Sbjct: 241 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 300

Query: 301 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 360
           VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS
Sbjct: 301 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 360

Query: 361 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCS 420
           VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCS
Sbjct: 361 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCS 420

Query: 421 RLK 424
           RLK
Sbjct: 421 RLK 423

BLAST of CmoCh14G008870 vs. ExPASy TrEMBL
Match: A0A6J1J2U3 (aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111482170 PE=3 SV=1)

HSP 1 Score: 825.5 bits (2131), Expect = 1.0e-235
Identity = 413/423 (97.64%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLPFKYS 60
           MLLSLFY+SLLSLSFSHSHSL LSFPLSLSKRPSP    VSPLFSSLS+YGSVKLPFKYS
Sbjct: 1   MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSP----VSPLFSSLSSYGSVKLPFKYS 60

Query: 61  TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPC 120
           TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRK+HKKLLKPKT SFDPSLSSSFSLLPC
Sbjct: 61  TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPC 120

Query: 121 NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC 180
           NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC
Sbjct: 121 NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC 180

Query: 181 AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS 240
           AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS
Sbjct: 181 AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS 240

Query: 241 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 300
           LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL
Sbjct: 241 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 300

Query: 301 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 360
           VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS
Sbjct: 301 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 360

Query: 361 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCS 420
           VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTW+EYDLANRRIGFGGADCS
Sbjct: 361 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADCS 419

Query: 421 RLK 424
           RLK
Sbjct: 421 RLK 419

BLAST of CmoCh14G008870 vs. ExPASy TrEMBL
Match: A0A5A7U1M2 (Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold363G00330 PE=3 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 1.7e-193
Identity = 349/433 (80.60%), Postives = 380/433 (87.76%), Query Frame = 0

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSL------SAYGSVK 60
           MLL LF LSL +L FS S+S+SL FPLSLS++P    S++SP++ S       S++GS K
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKP----SNISPIYGSQLYAKKPSSHGSFK 60

Query: 61  LPFKY-STALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKK---LLKPKTTSFDPS 120
           LPFKY STALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH KV KK   L KPKT SFDPS
Sbjct: 61  LPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPS 120

Query: 121 LSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNS 180
           LSSSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNS
Sbjct: 121 LSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNS 180

Query: 181 RTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDN 240
            +TPPVILGCAQASTENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDN
Sbjct: 181 LSTPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDN 240

Query: 241 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 300
           PNS+ FKY+++LTFP+SQ SPNLDPLAYTLP+KGIKIAG RLNIS A FKPD  GSGQTM
Sbjct: 241 PNSSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTM 300

Query: 301 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 360
           IDSGSDLTYLVDEAYEKVKEE+V+LVG  MKKGY YAAVADMCF+   TAEVGRRI  +S
Sbjct: 301 IDSGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGIS 360

Query: 361 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 420
           FEF+NGVEI VG+GEGVLTEVEKGVKCVGFGRS RLGI SNIIGTVHQ+N W+EYDL NR
Sbjct: 361 FEFDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNR 420

Query: 421 RIGFGGADCSRLK 424
           RIGFGGA+CSRLK
Sbjct: 421 RIGFGGAECSRLK 429

BLAST of CmoCh14G008870 vs. ExPASy TrEMBL
Match: A0A1S3C4D2 (aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103496869 PE=3 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 1.7e-193
Identity = 349/433 (80.60%), Postives = 380/433 (87.76%), Query Frame = 0

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSL------SAYGSVK 60
           MLL LF LSL +L FS S+S+SL FPLSLS++P    S++SP++ S       S++GS K
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKP----SNISPIYGSQLYAKKPSSHGSFK 60

Query: 61  LPFKY-STALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKK---LLKPKTTSFDPS 120
           LPFKY STALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH KV KK   L KPKT SFDPS
Sbjct: 61  LPFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPS 120

Query: 121 LSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNS 180
           LSSSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNS
Sbjct: 121 LSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNS 180

Query: 181 RTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDN 240
            +TPPVILGCAQASTENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDN
Sbjct: 181 LSTPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDN 240

Query: 241 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 300
           PNS+ FKY+++LTFP+SQ SPNLDPLAYTLP+KGIKIAG RLNIS A FKPD  GSGQTM
Sbjct: 241 PNSSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTM 300

Query: 301 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 360
           IDSGSDLTYLVDEAYEKVKEE+V+LVG  MKKGY YAAVADMCF+   TAEVGRRI  +S
Sbjct: 301 IDSGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGIS 360

Query: 361 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 420
           FEF+NGVEI VG+GEGVLTEVEKGVKCVGFGRS RLGI SNIIGTVHQ+N W+EYDL NR
Sbjct: 361 FEFDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNR 420

Query: 421 RIGFGGADCSRLK 424
           RIGFGGA+CSRLK
Sbjct: 421 RIGFGGAECSRLK 429

BLAST of CmoCh14G008870 vs. ExPASy TrEMBL
Match: A0A6J1F1P3 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111438856 PE=3 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 4.4e-170
Identity = 309/423 (73.05%), Postives = 346/423 (81.80%), Query Frame = 0

Query: 1   MLLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYGSVKLPFKYS 60
           M LSL  LSLL L FS S+SLSLSFPL+       VSS  + L  S+      K PF+YS
Sbjct: 1   MPLSLLLLSLLGLLFSPSNSLSLSFPLT----SHSVSSQEASLSLSIKTKSHGKFPFQYS 60

Query: 61  TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPC 120
            ALVVS+PIG+PPQ  D+V+DTGSQLSWIQCH KV +K +KP    FDP LSSSFS LPC
Sbjct: 61  NALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPC 120

Query: 121 NHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC 180
           N  LC+PRIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EKFTFSNS TT  ++LGC
Sbjct: 121 NTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGC 180

Query: 181 AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYIS 240
           A AST+NRG+LGMNTGRLSF+SQAKISKFSYCVP RTG D TGLFYLGDNPNSA FKY++
Sbjct: 181 ATASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSAKFKYVN 240

Query: 241 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 300
           +LTFPKS+ SPNLD  AYTLP+KGI+I  N+LNIS AVFKPD SG+GQTMIDSGSDLTYL
Sbjct: 241 MLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYL 300

Query: 301 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 360
           VDEAY KV+EE+V+LVGP+MKKGYEYAAVADMCF+    A VGRRI DM F+FENGVEI 
Sbjct: 301 VDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEIL 360

Query: 361 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCS 420
           VGKGEG+LTEVE+GVKCVG GRS RL   SNIIG VHQ+N W+EYDL+N+RIGFG A CS
Sbjct: 361 VGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 419

Query: 421 RLK 424
            LK
Sbjct: 421 GLK 419

BLAST of CmoCh14G008870 vs. TAIR 10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 543.1 bits (1398), Expect = 2.0e-154
Identity = 281/432 (65.05%), Postives = 329/432 (76.16%), Query Frame = 0

Query: 2   LLSLFYLSLLSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG--------SV 61
           LL +F+    S+S S S SLSL FPL+ S R +P ++S S   S LS           + 
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLT-SLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTF 71

Query: 62  KLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSS 121
           +   KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P TTSFDPSLSS
Sbjct: 72  RSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSS 131

Query: 122 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 181
           SFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TT
Sbjct: 132 SFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT 191

Query: 182 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDN 241
           PP+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDN
Sbjct: 192 PPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDN 251

Query: 242 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 301
           PNS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM
Sbjct: 252 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTM 311

Query: 302 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 361
           +DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ 
Sbjct: 312 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLV 371

Query: 362 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANR 421
           FEF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D+ NR
Sbjct: 372 FEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNR 431

Query: 422 RIGFGGADCSRL 423
           R+GF  A+C  L
Sbjct: 432 RVGFSKAECRLL 441

BLAST of CmoCh14G008870 vs. TAIR 10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 508.8 bits (1309), Expect = 4.1e-144
Identity = 274/436 (62.84%), Postives = 314/436 (72.02%), Query Frame = 0

Query: 2   LLSLFYLSLLSLSFSHSHSLSLS------------FPLSLSKRPSPVSSSVSPLFSSLSA 61
           L   F+L+ +SLS S S  L L+            F  SL  R +P  SS    F S   
Sbjct: 8   LFFFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRS--- 67

Query: 62  YGSVKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDP 121
                  FKYS AL++SLPIGTPPQ   +VLDTGSQLSWIQCHRK  K   KPK TSFDP
Sbjct: 68  ------RFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRK--KLPPKPK-TSFDP 127

Query: 122 SLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSN 181
           SLSSSFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EK TFSN
Sbjct: 128 SLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSN 187

Query: 182 SRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFY 241
           +  TPP+ILGCA  S+++RGILGMN GRLSFVSQAKISKFSYC+P    R G   TG FY
Sbjct: 188 TEITPPLILGCATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFY 247

Query: 242 LGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGS 301
           LGDNPNS  FKY+SLLTFP+SQR PNLDPLAYT+P+ GI+    +LNIS +VF+PD  GS
Sbjct: 248 LGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 307

Query: 302 GQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRI 361
           GQTM+DSGS+ T+LVD AY+KV+ EI+  VG  +KKGY Y   ADMCF DG  A + R I
Sbjct: 308 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCF-DGNVAMIPRLI 367

Query: 362 RDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYD 421
            D+ F F  GVEI V K E VL  V  G+ CVG GRS  LG ASNIIG VHQ+N W+E+D
Sbjct: 368 GDLVFVFTRGVEILVPK-ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 427

Query: 422 LANRRIGFGGADCSRL 423
           + NRR+GF  ADCSR+
Sbjct: 428 VTNRRVGFAKADCSRV 429

BLAST of CmoCh14G008870 vs. TAIR 10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 248.1 bits (632), Expect = 1.3e-65
Identity = 159/448 (35.49%), Postives = 234/448 (52.23%), Query Frame = 0

Query: 4   SLFYLSLLSL----SFSHSHSLSLSFPLSLSKRPSPVSSSVSPLFSSLSAYG---SVKLP 63
           +LF L +LS+      S S S S SF  S     S   + V PL + ++      + KL 
Sbjct: 7   ALFLLLVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLH 66

Query: 64  FKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFS 123
           F ++  L V+L +GTPPQ   +V+DTGS+LSW++C+R  +         +FDP+ SSS+S
Sbjct: 67  FHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPN----PVNNFDPTRSSSYS 126

Query: 124 LLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPV 183
            +PC+ P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS     +
Sbjct: 127 PIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNL 186

Query: 184 ILGC--------AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLG 243
           I GC         +  T+  G+LGMN G LSF+SQ    KFSYC+ G    D  G   LG
Sbjct: 187 IFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLG 246

Query: 244 DNPNSANFKYISLLTFPK----SQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRS 303
           D    +NF +++ L +      S   P  D +AYT+ L GIK+ G  L I  +V  PD +
Sbjct: 247 D----SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHT 306

Query: 304 GSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLM----KKGYEYAAVADMCFNDGET- 363
           G+GQTM+DSG+  T+L+   Y  ++   +     ++       + +    D+C+      
Sbjct: 307 GAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVR 366

Query: 364 --AEVGRRIRDMSFEFENGVEISVGKGEGVLTEV------EKGVKCVGFGRSGRLGIASN 420
             + +  R+  +S  FE G EI+V  G+ +L  V         V C  FG S  +G+ + 
Sbjct: 367 IRSGILHRLPTVSLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAY 426

BLAST of CmoCh14G008870 vs. TAIR 10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 244.6 bits (623), Expect = 1.4e-64
Identity = 152/431 (35.27%), Postives = 227/431 (52.67%), Query Frame = 0

Query: 11  LSLSFSHSHSLSLSFPLSLSKRPSPVSSSVSPL-FSSLSAYGSVKLPFKYSTALVVSLPI 70
           LS +F     L L FPL+  K  S   + +  L    L    S KL F+++  L V+L +
Sbjct: 12  LSKNFLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVTLTVTLAV 71

Query: 71  GTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTS-FDPSLSSSFSLLPCNHPLCKPR 130
           G PPQ   +VLDTGS+LSW+ C +        P   S F+P  SS++S +PC+ P+C+ R
Sbjct: 72  GDPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSSPICRTR 131

Query: 131 IPDFTLPTSCD-QNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC------- 190
             D  +P SCD +  LCH +  YAD T  EGNL  E F    S T P  + GC       
Sbjct: 132 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI-GSVTRPGTLFGCMDSGLSS 191

Query: 191 -AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYI 250
            ++   ++ G++GMN G LSFV+Q   SKFSYC+   +G DS+G   LGD    A++ ++
Sbjct: 192 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGSDSSGFLLLGD----ASYSWL 251

Query: 251 SLLTFP----KSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGS 310
             + +     +S   P  D +AYT+ L+GI++    L++  +VF PD +G+GQTM+DSG+
Sbjct: 252 GPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGT 311

Query: 311 DLTYLVDEAYEKVKEEIVKLVGPLMK----KGYEYAAVADMCFNDGETAEVGRRIRDMSF 370
             T+L+   Y  +K E +     +++      + +    D+C+  G T         M  
Sbjct: 312 QFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVS 371

Query: 371 EFENGVEISVG------KGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEY 417
               G E+SV       +  G  +E ++ V C  FG S  LGI + +IG  HQ+N W+E+
Sbjct: 372 LMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEF 427

BLAST of CmoCh14G008870 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 154.1 bits (388), Expect = 2.6e-37
Identity = 107/359 (29.81%), Postives = 167/359 (46.52%), Query Frame = 0

Query: 69  IGTPPQPTDLVLDTGSQLSWIQCHRKVHKKLLKPKTTSFDPSLSSSFSLLPCNHPLCKPR 128
           +GTP +   LVLDTGS ++WIQC  +      +     F+P+ SS++  L C+ P C   
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQC--EPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227

Query: 129 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS---- 188
                L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +    
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287

Query: 189 TENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFY----LGDNPNSANFKYIS 248
           T   G+LG+  G LS  +Q K + FSYC+  R    S+ L +    LG    +A      
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPL---- 347

Query: 249 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 308
                   R+  +D   Y + L G  + G ++ +  A+F  D SGSG  ++D G+ +T L
Sbjct: 348 -------LRNKKIDTF-YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 407

Query: 309 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 368
             +AY  +++  +KL   L KKG    ++ D C++    + V  ++  ++F F  G  + 
Sbjct: 408 QTQAYNSLRDAFLKLTVNL-KKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLD 467

Query: 369 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWIEYDLANRRIGFGGADC 420
           +     ++   + G  C  F  +     + +IIG V Q+ T I YDL+   IG  G  C
Sbjct: 468 LPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LZL31.8e-6435.49Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
Q766C26.4e-4131.59Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C31.9e-3730.03Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LS403.6e-3629.81Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LNJ39.2e-3228.73Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A6J1F8Z22.8e-241100.00aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111441919 PE=3... [more]
A0A6J1J2U31.0e-23597.64aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111482170 PE=3 S... [more]
A0A5A7U1M21.7e-19380.60Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A1S3C4D21.7e-19380.60aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103496869 PE=3 SV=1[more]
A0A6J1F1P34.4e-17073.05aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111438856 PE=3... [more]
Match NameE-valueIdentityDescription
AT5G37540.12.0e-15465.05Eukaryotic aspartyl protease family protein [more]
AT1G66180.14.1e-14462.84Eukaryotic aspartyl protease family protein [more]
AT5G02190.11.3e-6535.49Eukaryotic aspartyl protease family protein [more]
AT2G39710.11.4e-6435.27Eukaryotic aspartyl protease family protein [more]
AT3G18490.12.6e-3729.81Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 391..406
score: 24.96
coord: 289..300
score: 28.66
coord: 69..89
score: 47.5
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47965ASPARTYL PROTEASE-RELATEDcoord: 23..422
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 257..414
e-value: 1.5E-32
score: 112.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 229..423
e-value: 1.9E-38
score: 133.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 49..228
e-value: 2.9E-36
score: 127.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 64..421
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 64..229
e-value: 2.8E-36
score: 125.4
NoneNo IPR availablePANTHERPTHR47965:SF51EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 23..422
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 78..89
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 63..415
score: 32.417362
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 64..419
e-value: 9.68168E-74
score: 229.842

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G008870.1CmoCh14G008870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity