CmoCh01G009790 (gene) Cucurbita moschata (Rifu)

NameCmoCh01G009790
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr01 : 6394663 .. 6395925 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCTCTCTCTCCTACTGCTCTCCCTATTGGGTCTCTTATTCTCCCCATCAAACTCCCTCTCCCTCTCCTTCCCTCTGACCTCCCATTCTGTCTCCTCCCAAGAAGCCTCCCTTTCCCTCTCCTCCAAAACCAAATCCCATGGGAAGTTCCCCTTCCAGTACTCCAACGCCCTTGTGGTGTCTGTTCCCATTGGATCGCCGCCGCAGCAGATGGACATGGTCGTAGACACCGGCAGCCAACTGTCCTGGATTCAATGCCACGGCAAAGTAAGGAGAAAATCAGTGAAGCCCATGATTAATTGGTTTGACCCTTATCTCTCCTCCTCTTTCTCTTTCCTTCCCTGTAACACCACTCTGTGCAGACCCCGAATTCCCGATTTTACCCTTCCTACTTCTTGTGACCCAACTCGCCACTGCCACTACTCCTACTTCTACGCGGATGGGACCCTGGCCGAGGGTAATCTCGTCACTGAAAAATTCACCTTCTCCAATTCCTTAACTACCCGCTCCCTCCTTCTGGGCTGCGCTACGGCCTCCACCCAAAACAGGGGTATGTTGGGAATGAATACTGGACGCCTCTCCTTCATCTCCCAGGCCAAGATATCCAAGTTTTCCTATTGCGTTCCGGATCGAACCGGGTCGGATCTAACCGGCTTGTTCTACCTTGGAGACAACCCCAACTCCGCTAAATTCAAATACGTCAACATGTTGACTTTCCCCAAAAGTCGACTCTCCCCGAATCTTGACAAGTCGGCCTACACCCTCCCGATGAAGGGGATAAGAATAGGCAACAACAAACTCAACATCTCGCCCGCCGTTTTCAAACCGGACCCATCTGGGGCCGGTCAGACCATGATCGACTCCGGCTCCGATTTGACGTATTTGGTAGATGAGGCCTACAGCAAGGTCAGAGAAGAGATGGTCAGATTAGTGGGGCCCATGATGAAGAAAGGATACGAATACGCCGCCGTCGCCGACATGTGTTTCGACGGCGCAGTGGCGGCGGCGGTGGGTCGGAGGATCGGCGACATGTGGTTCCAGTTTGAGAATGGGGTGGAGATATTGGTCGGGAAAGGGGAAGGGTTATTGACGGAAGTGGAGGAAGGGGTGAAGTGCGTGGGGATCGGACGGTCAGATAGACTTGTGACTGAGAGTAATATAATCGGGAACGTTCATCAGCAGAATATGTGGGTGGAGTACGATCTGAGCAATAAGAGAATTGGGTTTGGTGTTGCCAAGTGTAGTGGATTGAAGGCATGA

mRNA sequence

ATGCCTCTCTCTCTCCTACTGCTCTCCCTATTGGGTCTCTTATTCTCCCCATCAAACTCCCTCTCCCTCTCCTTCCCTCTGACCTCCCATTCTGTCTCCTCCCAAGAAGCCTCCCTTTCCCTCTCCTCCAAAACCAAATCCCATGGGAAGTTCCCCTTCCAGTACTCCAACGCCCTTGTGGTGTCTGTTCCCATTGGATCGCCGCCGCAGCAGATGGACATGGTCGTAGACACCGGCAGCCAACTGTCCTGGATTCAATGCCACGGCAAAGTAAGGAGAAAATCAGTGAAGCCCATGATTAATTGGTTTGACCCTTATCTCTCCTCCTCTTTCTCTTTCCTTCCCTGTAACACCACTCTGTGCAGACCCCGAATTCCCGATTTTACCCTTCCTACTTCTTGTGACCCAACTCGCCACTGCCACTACTCCTACTTCTACGCGGATGGGACCCTGGCCGAGGGTAATCTCGTCACTGAAAAATTCACCTTCTCCAATTCCTTAACTACCCGCTCCCTCCTTCTGGGCTGCGCTACGGCCTCCACCCAAAACAGGGGTATGTTGGGAATGAATACTGGACGCCTCTCCTTCATCTCCCAGGCCAAGATATCCAAGTTTTCCTATTGCGTTCCGGATCGAACCGGGTCGGATCTAACCGGCTTGTTCTACCTTGGAGACAACCCCAACTCCGCTAAATTCAAATACGTCAACATGTTGACTTTCCCCAAAAGTCGACTCTCCCCGAATCTTGACAAGTCGGCCTACACCCTCCCGATGAAGGGGATAAGAATAGGCAACAACAAACTCAACATCTCGCCCGCCGTTTTCAAACCGGACCCATCTGGGGCCGGTCAGACCATGATCGACTCCGGCTCCGATTTGACGTATTTGGTAGATGAGGCCTACAGCAAGGTCAGAGAAGAGATGGTCAGATTAGTGGGGCCCATGATGAAGAAAGGATACGAATACGCCGCCGTCGCCGACATGTGTTTCGACGGCGCAGTGGCGGCGGCGGTGGGTCGGAGGATCGGCGACATGTGGTTCCAGTTTGAGAATGGGGTGGAGATATTGGTCGGGAAAGGGGAAGGGTTATTGACGGAAGTGGAGGAAGGGGTGAAGTGCGTGGGGATCGGACGGTCAGATAGACTTGTGACTGAGAGTAATATAATCGGGAACGTTCATCAGCAGAATATGTGGGTGGAGTACGATCTGAGCAATAAGAGAATTGGGTTTGGTGTTGCCAAGTGTAGTGGATTGAAGGCATGA

Coding sequence (CDS)

ATGCCTCTCTCTCTCCTACTGCTCTCCCTATTGGGTCTCTTATTCTCCCCATCAAACTCCCTCTCCCTCTCCTTCCCTCTGACCTCCCATTCTGTCTCCTCCCAAGAAGCCTCCCTTTCCCTCTCCTCCAAAACCAAATCCCATGGGAAGTTCCCCTTCCAGTACTCCAACGCCCTTGTGGTGTCTGTTCCCATTGGATCGCCGCCGCAGCAGATGGACATGGTCGTAGACACCGGCAGCCAACTGTCCTGGATTCAATGCCACGGCAAAGTAAGGAGAAAATCAGTGAAGCCCATGATTAATTGGTTTGACCCTTATCTCTCCTCCTCTTTCTCTTTCCTTCCCTGTAACACCACTCTGTGCAGACCCCGAATTCCCGATTTTACCCTTCCTACTTCTTGTGACCCAACTCGCCACTGCCACTACTCCTACTTCTACGCGGATGGGACCCTGGCCGAGGGTAATCTCGTCACTGAAAAATTCACCTTCTCCAATTCCTTAACTACCCGCTCCCTCCTTCTGGGCTGCGCTACGGCCTCCACCCAAAACAGGGGTATGTTGGGAATGAATACTGGACGCCTCTCCTTCATCTCCCAGGCCAAGATATCCAAGTTTTCCTATTGCGTTCCGGATCGAACCGGGTCGGATCTAACCGGCTTGTTCTACCTTGGAGACAACCCCAACTCCGCTAAATTCAAATACGTCAACATGTTGACTTTCCCCAAAAGTCGACTCTCCCCGAATCTTGACAAGTCGGCCTACACCCTCCCGATGAAGGGGATAAGAATAGGCAACAACAAACTCAACATCTCGCCCGCCGTTTTCAAACCGGACCCATCTGGGGCCGGTCAGACCATGATCGACTCCGGCTCCGATTTGACGTATTTGGTAGATGAGGCCTACAGCAAGGTCAGAGAAGAGATGGTCAGATTAGTGGGGCCCATGATGAAGAAAGGATACGAATACGCCGCCGTCGCCGACATGTGTTTCGACGGCGCAGTGGCGGCGGCGGTGGGTCGGAGGATCGGCGACATGTGGTTCCAGTTTGAGAATGGGGTGGAGATATTGGTCGGGAAAGGGGAAGGGTTATTGACGGAAGTGGAGGAAGGGGTGAAGTGCGTGGGGATCGGACGGTCAGATAGACTTGTGACTGAGAGTAATATAATCGGGAACGTTCATCAGCAGAATATGTGGGTGGAGTACGATCTGAGCAATAAGAGAATTGGGTTTGGTGTTGCCAAGTGTAGTGGATTGAAGGCATGA
BLAST of CmoCh01G009790 vs. Swiss-Prot
Match: PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 1.0e-64
Identity = 158/432 (36.57%), Postives = 229/432 (53.01%), Query Frame = 1

Query: 16  SPSNSLSLSFPLTSHSVSSQEASLSLSSKTK----SH---GKFPFQYSNALVVSVPIGSP 75
           S S+S S SF  +S S SS   +L L  KT+     H    K  F ++  L V++ +G+P
Sbjct: 23  SSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTP 82

Query: 76  PQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRPRIPDF 135
           PQ + MV+DTGS+LSW++C+    R S    +N FDP  SSS+S +PC++  CR R  DF
Sbjct: 83  PQNISMVIDTGSELSWLRCN----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDF 142

Query: 136 TLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATA--------S 195
            +P SCD  + CH +  YAD + +EGNL  E F F NS    +L+ GC  +         
Sbjct: 143 LIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEED 202

Query: 196 TQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSAKFKYVNMLTF 255
           T+  G+LGMN G LSFISQ    KFSYC+      D  G   LGD    + F ++  L +
Sbjct: 203 TKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLGD----SNFTWLTPLNY 262

Query: 256 -PKSRLS---PNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYL 315
            P  R+S   P  D+ AYT+ + GI++    L I  +V  PD +GAGQTM+DSG+  T+L
Sbjct: 263 TPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFL 322

Query: 316 VDEAYSKVREEMVRLVGPMM----KKGYEYAAVADMCF---DGAVAAAVGRRIGDMWFQF 375
           +   Y+ +R   +     ++       + +    D+C+      + + +  R+  +   F
Sbjct: 323 LGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVF 382

Query: 376 ENGVEILVGKGEGLLTEV------EEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDL 416
           E G EI V  G+ LL  V       + V C   G SD +  E+ +IG+ HQQNMW+E+DL
Sbjct: 383 E-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 442

BLAST of CmoCh01G009790 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 2.7e-36
Identity = 111/362 (30.66%), Postives = 174/362 (48.07%), Query Frame = 1

Query: 60  VVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTT 119
           ++++ IG+P Q    ++DTGS L W QC    +  +    I  F+P  SSSFS LPC++ 
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPI--FNPQGSSSFSTLPCSSQ 155

Query: 120 LCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATA 179
           LC+      + PT  +    C Y+Y Y DG+  +G++ TE  TF  S++  ++  GC   
Sbjct: 156 LCQA----LSSPTCSN--NFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGEN 215

Query: 180 ST-----QNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSAKFKY 239
           +         G++GM  G LS  SQ  ++KFSYC+    GS       LG   NS     
Sbjct: 216 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP-IGSSTPSNLLLGSLANSVTAGS 275

Query: 240 VNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPS-GAGQTMIDSGSDL 299
            N      S++      + Y + + G+ +G+ +L I P+ F  + + G G  +IDSG+ L
Sbjct: 276 PNTTLIQSSQIP-----TFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 335

Query: 300 TYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGV 359
           TY V+ AY  VR+E +  +   +  G   ++  D+CF    +     +I      F+ G 
Sbjct: 336 TYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQ-TPSDPSNLQIPTFVMHFDGG- 395

Query: 360 EILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVA 416
             L    E        G+ C+ +G S +     +I GN+ QQNM V YD  N  + F  A
Sbjct: 396 -DLELPSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVVSFASA 434

BLAST of CmoCh01G009790 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.9e-35
Identity = 112/365 (30.68%), Postives = 173/365 (47.40%), Query Frame = 1

Query: 60  VVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTT 119
           +++V IG+P      ++DTGS L W QC    +  S    I  F+P  SSSFS LPC + 
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPI--FNPQDSSSFSTLPCESQ 156

Query: 120 LCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCAT- 179
            C+       LP+       C Y+Y Y DG+  +G + TE FTF  S +  ++  GC   
Sbjct: 157 YCQ------DLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFGCGED 216

Query: 180 ----ASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSAKFKY 239
                     G++GM  G LS  SQ  + +FSYC+    GS       LG   +    + 
Sbjct: 217 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTS-YGSSSPSTLALGSAASGVP-EG 276

Query: 240 VNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLT 299
               T   S L+P    + Y + ++GI +G + L I  + F+    G G  +IDSG+ LT
Sbjct: 277 SPSTTLIHSSLNP----TYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 336

Query: 300 YLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCF----DGAVAAAVGRRIGDMWFQFE 359
           YL  +AY+ V +     +   +    E ++    CF    DG+       ++ ++  QF+
Sbjct: 337 YLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTV-----QVPEISMQFD 396

Query: 360 NGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGF 416
            GV + +G+ + +L    EGV C+ +G S +L    +I GN+ QQ   V YDL N  + F
Sbjct: 397 GGV-LNLGE-QNILISPAEGVICLAMGSSSQL--GISIFGNIQQQETQVLYDLQNLAVSF 435

BLAST of CmoCh01G009790 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 7.2e-34
Identity = 105/357 (29.41%), Postives = 171/357 (47.90%), Query Frame = 1

Query: 63  VPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCR 122
           + +G+P ++M +V+DTGS ++WIQC            +  F+P  SS++  L C+   C 
Sbjct: 166 IGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPV--FNPTSSSTYKSLTCSAPQCS 225

Query: 123 PRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATAS-- 182
                  L TS   +  C Y   Y DG+   G L T+  TF NS    ++ LGC   +  
Sbjct: 226 ------LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEG 285

Query: 183 --TQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSAKFKYVNML 242
             T   G+LG+  G LS  +Q K + FSYC+ DR     + L +     NS +    +  
Sbjct: 286 LFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDF-----NSVQLGGGDA- 345

Query: 243 TFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLVD 302
           T P  R +  +D + Y + + G  +G  K+ +  A+F  D SG+G  ++D G+ +T L  
Sbjct: 346 TAPLLR-NKKID-TFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 405

Query: 303 EAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILVG 362
           +AY+ +R+  ++L    +KKG    ++ D C+D +  + V  ++  + F F  G  + + 
Sbjct: 406 QAYNSLRDAFLKLT-VNLKKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLDLP 465

Query: 363 KGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKC 416
               L+   + G  C     +    +  +IIGNV QQ   + YDLS   IG    KC
Sbjct: 466 AKNYLIPVDDSGTFCFAFAPTS---SSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh01G009790 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 8.3e-30
Identity = 102/373 (27.35%), Postives = 172/373 (46.11%), Query Frame = 1

Query: 54  QYSNALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHG-KVRRKSVKPMINWFDPYLSSSFS 113
           Q S    V + +GSPP+   MV+D+GS + W+QC   K+  K   P+   FDP  S S++
Sbjct: 126 QGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPV---FDPAKSGSYT 185

Query: 114 FLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSL 173
            + C +++C     D    + C  +  C Y   Y DG+  +G L  E  TF+ ++  R++
Sbjct: 186 GVSCGSSVC-----DRIENSGCH-SGGCRYEVMYGDGSYTKGTLALETLTFAKTV-VRNV 245

Query: 174 LLGCATASTQNRGM-------LGMNTGRLSFISQAK---ISKFSYCVPDRTGSDLTGLFY 233
            +GC     +NRGM       LG+  G +SF+ Q        F YC+  R G+D TG   
Sbjct: 246 AMGC---GHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSR-GTDSTGSLV 305

Query: 234 LGDNPNSAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGA 293
            G         +V ++  P++        S Y + +KG+ +G  ++ +   VF    +G 
Sbjct: 306 FGREALPVGASWVPLVRNPRA-------PSFYYVGLKGLGVGGVRIPLPDGVFDLTETGD 365

Query: 294 GQTMIDSGSDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRI 353
           G  ++D+G+ +T L   AY   R+        + +      ++ D C+D  ++  V  R+
Sbjct: 366 GGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRA--SGVSIFDTCYD--LSGFVSVRV 425

Query: 354 GDMWFQFENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYD 413
             + F F  G  + +     L+   + G  C     S    T  +IIGN+ Q+ + V +D
Sbjct: 426 PTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASP---TGLSIIGNIQQEGIQVSFD 470

Query: 414 LSNKRIGFGVAKC 416
            +N  +GFG   C
Sbjct: 486 GANGFVGFGPNVC 470

BLAST of CmoCh01G009790 vs. TrEMBL
Match: A0A061EL58_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=3 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 5.4e-137
Identity = 259/416 (62.26%), Postives = 308/416 (74.04%), Query Frame = 1

Query: 17  PSNSLSLSFPLTS------------HSVSSQEASLSLSSKTKSHG-KFPFQYSNALVVSV 76
           P+NS+S SFPLTS             S+ S + + ++  +  S+  K  F+YS AL+V++
Sbjct: 31  PNNSISFSFPLTSLRFSRDNVQTLYRSLVSTKPNSTVQPRPSSYNYKTTFKYSMALIVAL 90

Query: 77  PIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRP 136
           PIG+PPQ   MV+DTGSQLSWIQCH KV RK   P  + FDP LSSSFS LPC   LC+P
Sbjct: 91  PIGTPPQTQQMVLDTGSQLSWIQCHKKVARKPPPPPTS-FDPSLSSSFSVLPCTHPLCKP 150

Query: 137 RIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATASTQN 196
           RIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EKFTFS S +T  L+LGCAT ++++
Sbjct: 151 RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPPLILGCATDTSED 210

Query: 197 RGMLGMNTGRLSFISQAKISKFSYCVPDR---TGSDLTGLFYLGDNPNSAKFKYVNMLTF 256
           +G+LGMN GRLSF SQAKISKFSYCVP R    G   TG FYLG+NP+S  F+YVN++ F
Sbjct: 211 KGILGMNLGRLSFASQAKISKFSYCVPTRRTQPGFSPTGSFYLGENPSSRGFQYVNLMIF 270

Query: 257 PKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLVDEA 316
           P+S   PN+D  AYTLPM+GIRIG  KL I  +VF+PD  G+GQTMIDSGS+ TYLVD+A
Sbjct: 271 PESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMIDSGSEFTYLVDDA 330

Query: 317 YSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILVGKG 376
           Y+KVREE+VRLVGP +KKGY Y  VADMCFDG     +GR IGDM  +FE GVEI V K 
Sbjct: 331 YNKVREEVVRLVGPRIKKGYVYGGVADMCFDGN-PIEIGRLIGDMVLEFEKGVEITVEK- 390

Query: 377 EGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 417
           E +L +VE GV C+GIGRS  L   SNIIGN HQQN+WVEYDL N+R+GFG A CS
Sbjct: 391 ERVLADVEGGVHCLGIGRSSMLGAASNIIGNFHQQNLWVEYDLVNRRVGFGKADCS 443

BLAST of CmoCh01G009790 vs. TrEMBL
Match: A0A0B2S2W1_GLYSO (Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_013398 PE=3 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 7.8e-136
Identity = 255/410 (62.20%), Postives = 311/410 (75.85%), Query Frame = 1

Query: 18  SNSLSLSFPLTSHSVSSQEAS------LSLSSKTKSHG-KFPFQYSNALVVSVPIGSPPQ 77
           ++SLSLSFPLTS  +S+ +         +LSS + S+  K  F+YS ALVV++PIG+PPQ
Sbjct: 24  TDSLSLSFPLTSLPLSTAKPLNRNPNLRTLSSSSSSYNIKSSFKYSMALVVTLPIGTPPQ 83

Query: 78  QMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRPRIPDFTL 137
              MV+DTGSQLSWIQCH K       P    FDP LSSSF  LPC   LC+PR+PDFTL
Sbjct: 84  HQQMVLDTGSQLSWIQCHNKT------PPTASFDPSLSSSFYILPCTHPLCKPRVPDFTL 143

Query: 138 PTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATASTQNRGMLGMN 197
           PT+CD  R CHYSYFYADGT AEGNLV EK TFS S TT  L+LGCAT S+  RG+LGMN
Sbjct: 144 PTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLILGCATESSDARGILGMN 203

Query: 198 TGRLSFISQAKISKFSYCVPDRTGSD----LTGLFYLGDNPNSAKFKYVNMLTFPKSRLS 257
            GRLSF SQAK++KFSYCVP R  ++     TG FYLG+NPNSA+F+YV+MLTFP+S+  
Sbjct: 204 LGRLSFPSQAKVTKFSYCVPTRQAANDNNLPTGSFYLGNNPNSARFRYVSMLTFPQSQRM 263

Query: 258 PNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLVDEAYSKVRE 317
           PNLD  AYT+PM+GIRIG  KLNI P+VF+P+  G+GQTM+DSGS+ T+LVD AY  VRE
Sbjct: 264 PNLDPLAYTVPMQGIRIGGKKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDAAYDAVRE 323

Query: 318 EMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILVGKGEGLLTE 377
           E++R+VGP +KKGY Y  VADMCF+G+ A  +GR IGD+ F+FE GVEI+V K E +L +
Sbjct: 324 EVIRVVGPRVKKGYVYGGVADMCFNGS-AMEIGRLIGDVAFEFEKGVEIVVPK-ERVLAD 383

Query: 378 VEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 417
           V  GV C+GIGRS+RL   SNIIGN HQQN+WVE+DL+N+RIGFGVA CS
Sbjct: 384 VGGGVHCLGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCS 425

BLAST of CmoCh01G009790 vs. TrEMBL
Match: B9T2R1_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 PE=3 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 1.0e-135
Identity = 255/410 (62.20%), Postives = 305/410 (74.39%), Query Frame = 1

Query: 18  SNSLSLSFPLTSHSVSSQE-------ASLSLSSKTKSHG-KFPFQYSNALVVSVPIGSPP 77
           ++S S SFPL S   SS         +S    +K  S+  +  F+YS AL+VS+PIG+PP
Sbjct: 31  TSSFSFSFPLRSLPASSPSKPSSPFRSSFVAQTKQPSYNYRSSFKYSMALIVSLPIGTPP 90

Query: 78  QQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRPRIPDFT 137
           Q   MV+DTGSQLSWIQCH K   K   P  + FDP LSSSFS LPCN  LC+PRIPDFT
Sbjct: 91  QTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTS-FDPSLSSSFSVLPCNHPLCKPRIPDFT 150

Query: 138 LPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATASTQNRGMLGM 197
           LPT+CD  R CHYSYFYADGT AEG+LV EK TFS+S +T  L+LGCA AST  +G+LGM
Sbjct: 151 LPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEASTDEKGILGM 210

Query: 198 NTGRLSFISQAKISKFSYCVPDR---TGSDLTGLFYLGDNPNSAKFKYVNMLTFPKSRLS 257
           N GR SF SQAKISKFSYCVP R    G   TG FYLG+NPNS +F+Y+N+LTF  S+ S
Sbjct: 211 NLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRS 270

Query: 258 PNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLVDEAYSKVRE 317
           PNLD  AYT+PM+GIR+GN +LNIS  +F+PDPSGAGQT+IDSGS+ TYLVDEAY+KVRE
Sbjct: 271 PNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVRE 330

Query: 318 EMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILVGKGEGLLTE 377
           E+VRLVGP +KKGY Y  V+DMCFDG     +GR IG+M F+FE GVEI++ K   +L +
Sbjct: 331 EVVRLVGPKLKKGYVYGGVSDMCFDGN-PMEIGRLIGNMVFEFEKGVEIVIDKWR-VLAD 390

Query: 378 VEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 417
           V  GV C+GIGRS+ L   SNIIGN HQQN+WVEYDL+N+RIG G A CS
Sbjct: 391 VGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKADCS 437

BLAST of CmoCh01G009790 vs. TrEMBL
Match: A0A0S3RKY8_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.03G096000 PE=3 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 5.1e-135
Identity = 261/428 (60.98%), Postives = 314/428 (73.36%), Query Frame = 1

Query: 2   PLSLLLLSLLGLLFSPSNS-----LSLSFPLTSHSVSSQEA-----SLSLSSKTKSHGKF 61
           P+SL  L    LLFS S+S     +S SFPL S  +S+ +       L   S    + K 
Sbjct: 6   PVSLFSLLCTVLLFSASSSAKHDSVSFSFPLRSLPISAGKPLKTNPKLRSLSSASYNVKL 65

Query: 62  PFQYSNALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSF 121
           PF+YS ALVVS+PIG+PPQ   MV+DTGSQLSWIQC  K       P ++ FDP LSSSF
Sbjct: 66  PFKYSMALVVSLPIGTPPQHQQMVLDTGSQLSWIQCRNKT-----PPTVS-FDPSLSSSF 125

Query: 122 SFLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRS 181
             +PC   LC+PR+PDFTLPT+CD  R CHYSYFYADGT AEGNLV EK TFS S TT  
Sbjct: 126 YVIPCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPP 185

Query: 182 LLLGCATASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDR---TGSDLTGLFYLGDNPN 241
           L LGCAT S    G+LGMN GRLSF SQAKI+KFSYCVP R   +G+  TG FYLG+NPN
Sbjct: 186 LTLGCATESRDAGGILGMNLGRLSFPSQAKITKFSYCVPTRVSGSGNLPTGSFYLGNNPN 245

Query: 242 SAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMID 301
           SA+F+YV+MLTF +S+  PNLD  AYT+PM+GIRIG  +LNI+P+VF+PD  G+GQTMID
Sbjct: 246 SARFRYVSMLTFSQSQRMPNLDPLAYTVPMQGIRIGGKRLNINPSVFRPDAGGSGQTMID 305

Query: 302 SGSDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQ 361
           SGS+ T+LVDEAY +VREE+VR+VGP +KKGY Y  VADMCFDG+   ++GR IGD+  +
Sbjct: 306 SGSEFTFLVDEAYDRVREEVVRVVGPRIKKGYVYGGVADMCFDGSARESIGRLIGDVVLE 365

Query: 362 FENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRI 417
           FE GVEI+V K E +L +V  GV CVGIGRS+RL   SNIIGN+HQQN+WVE+DL+N RI
Sbjct: 366 FEKGVEIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNIHQQNLWVEFDLANHRI 425

BLAST of CmoCh01G009790 vs. TrEMBL
Match: A0A0L9UW35_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan07g076900 PE=3 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 5.1e-135
Identity = 261/428 (60.98%), Postives = 314/428 (73.36%), Query Frame = 1

Query: 2   PLSLLLLSLLGLLFSPSNS-----LSLSFPLTSHSVSSQEA-----SLSLSSKTKSHGKF 61
           P+SL  L    LLFS S+S     +S SFPL S  +S+ +       L   S    + K 
Sbjct: 6   PVSLFSLLCTVLLFSASSSAKHDSVSFSFPLRSLPISAGKPLKTNPKLRSLSSASYNVKL 65

Query: 62  PFQYSNALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSF 121
           PF+YS ALVVS+PIG+PPQ   MV+DTGSQLSWIQC  K       P ++ FDP LSSSF
Sbjct: 66  PFKYSMALVVSLPIGTPPQHQQMVLDTGSQLSWIQCRNKT-----PPTVS-FDPSLSSSF 125

Query: 122 SFLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRS 181
             +PC   LC+PR+PDFTLPT+CD  R CHYSYFYADGT AEGNLV EK TFS S TT  
Sbjct: 126 YVIPCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPP 185

Query: 182 LLLGCATASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDR---TGSDLTGLFYLGDNPN 241
           L LGCAT S    G+LGMN GRLSF SQAKI+KFSYCVP R   +G+  TG FYLG+NPN
Sbjct: 186 LTLGCATESRDAGGILGMNLGRLSFPSQAKITKFSYCVPTRVSGSGNLPTGSFYLGNNPN 245

Query: 242 SAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMID 301
           SA+F+YV+MLTF +S+  PNLD  AYT+PM+GIRIG  +LNI+P+VF+PD  G+GQTMID
Sbjct: 246 SARFRYVSMLTFSQSQRMPNLDPLAYTVPMQGIRIGGKRLNINPSVFRPDAGGSGQTMID 305

Query: 302 SGSDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQ 361
           SGS+ T+LVDEAY +VREE+VR+VGP +KKGY Y  VADMCFDG+   ++GR IGD+  +
Sbjct: 306 SGSEFTFLVDEAYDRVREEVVRVVGPRIKKGYVYGGVADMCFDGSARESIGRLIGDVVLE 365

Query: 362 FENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRI 417
           FE GVEI+V K E +L +V  GV CVGIGRS+RL   SNIIGN+HQQN+WVE+DL+N RI
Sbjct: 366 FEKGVEIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNIHQQNLWVEFDLANHRI 425

BLAST of CmoCh01G009790 vs. TAIR10
Match: AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 486.5 bits (1251), Expect = 1.7e-137
Identity = 258/432 (59.72%), Postives = 312/432 (72.22%), Query Frame = 1

Query: 6   LLLSLLGLLF--------SPSNSLSLSFPLTSHSVSSQEASLS-----LSSKTKSHGKFP 65
           L L LL + F        S S+SLSL FPLTS  ++    S S     LS +  S    P
Sbjct: 8   LFLKLLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSP 67

Query: 66  F------QYSNALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPY 125
           +      +YS AL++S+PIG+P Q  ++V+DTGSQLSWIQCH K  +K + P    FDP 
Sbjct: 68  YTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPS 127

Query: 126 LSSSFSFLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNS 185
           LSSSFS LPC+  LC+PRIPDFTLPTSCD  R CHYSYFYADGT AEGNLV EKFTFSNS
Sbjct: 128 LSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNS 187

Query: 186 LTTRSLLLGCATASTQNRGMLGMNTGRLSFISQAKISKFSYCVP---DRTGSDLTGLFYL 245
            TT  L+LGCA  ST  +G+LGMN GRLSFISQAKISKFSYC+P   +R G   TG FYL
Sbjct: 188 QTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYL 247

Query: 246 GDNPNSAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAG 305
           GDNPNS  FKYV++LTFP+S+  PNLD  AYT+P++GIRIG  +LNI  +VF+PD  G+G
Sbjct: 248 GDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSG 307

Query: 306 QTMIDSGSDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIG 365
           QTM+DSGS+ T+LVD AY KV+EE+VRLVG  +KKGY Y + ADMCFDG  +  +GR IG
Sbjct: 308 QTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIG 367

Query: 366 DMWFQFENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDL 416
           D+ F+F  GVEILV K + LL  V  G+ CVGIGRS  L   SNIIGNVHQQN+WVE+D+
Sbjct: 368 DLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDV 427

BLAST of CmoCh01G009790 vs. TAIR10
Match: AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 470.7 bits (1210), Expect = 9.4e-133
Identity = 253/415 (60.96%), Postives = 298/415 (71.81%), Query Frame = 1

Query: 16  SPSNSLSLSFPLTSHSVS----SQEASLSLSSKTKSHGKFP-------FQYSNALVVSVP 75
           S S SLSL  PLTS  +S    S   + SL S+       P       F+YS AL++S+P
Sbjct: 18  SLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRSRFKYSMALIISLP 77

Query: 76  IGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRPR 135
           IG+PPQ   MV+DTGSQLSWIQCH K  +   KP  + FDP LSSSFS LPC+  LC+PR
Sbjct: 78  IGTPPQAQQMVLDTGSQLSWIQCHRK--KLPPKPKTS-FDPSLSSSFSTLPCSHPLCKPR 137

Query: 136 IPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATASTQNR 195
           IPDFTLPTSCD  R CHYSYFYADGT AEGNLV EK TFSN+  T  L+LGCAT S+ +R
Sbjct: 138 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSDDR 197

Query: 196 GMLGMNTGRLSFISQAKISKFSYCVP---DRTGSDLTGLFYLGDNPNSAKFKYVNMLTFP 255
           G+LGMN GRLSF+SQAKISKFSYC+P   +R G   TG FYLGDNPNS  FKYV++LTFP
Sbjct: 198 GILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFP 257

Query: 256 KSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLVDEAY 315
           +S+  PNLD  AYT+PM GIR G  KLNIS +VF+PD  G+GQTM+DSGS+ T+LVD AY
Sbjct: 258 ESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAY 317

Query: 316 SKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILVGKGE 375
            KVR E++  VG  +KKGY Y   ADMCFDG V A + R IGD+ F F  GVEILV K E
Sbjct: 318 DKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNV-AMIPRLIGDLVFVFTRGVEILVPK-E 377

Query: 376 GLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 417
            +L  V  G+ CVGIGRS  L   SNIIGNVHQQN+WVE+D++N+R+GF  A CS
Sbjct: 378 RVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCS 427

BLAST of CmoCh01G009790 vs. TAIR10
Match: AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 248.8 bits (634), Expect = 5.8e-66
Identity = 158/432 (36.57%), Postives = 229/432 (53.01%), Query Frame = 1

Query: 16  SPSNSLSLSFPLTSHSVSSQEASLSLSSKTK----SH---GKFPFQYSNALVVSVPIGSP 75
           S S+S S SF  +S S SS   +L L  KT+     H    K  F ++  L V++ +G+P
Sbjct: 23  SSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTP 82

Query: 76  PQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRPRIPDF 135
           PQ + MV+DTGS+LSW++C+    R S    +N FDP  SSS+S +PC++  CR R  DF
Sbjct: 83  PQNISMVIDTGSELSWLRCN----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDF 142

Query: 136 TLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATA--------S 195
            +P SCD  + CH +  YAD + +EGNL  E F F NS    +L+ GC  +         
Sbjct: 143 LIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEED 202

Query: 196 TQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSAKFKYVNMLTF 255
           T+  G+LGMN G LSFISQ    KFSYC+      D  G   LGD    + F ++  L +
Sbjct: 203 TKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLGD----SNFTWLTPLNY 262

Query: 256 -PKSRLS---PNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYL 315
            P  R+S   P  D+ AYT+ + GI++    L I  +V  PD +GAGQTM+DSG+  T+L
Sbjct: 263 TPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFL 322

Query: 316 VDEAYSKVREEMVRLVGPMM----KKGYEYAAVADMCF---DGAVAAAVGRRIGDMWFQF 375
           +   Y+ +R   +     ++       + +    D+C+      + + +  R+  +   F
Sbjct: 323 LGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVF 382

Query: 376 ENGVEILVGKGEGLLTEV------EEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDL 416
           E G EI V  G+ LL  V       + V C   G SD +  E+ +IG+ HQQNMW+E+DL
Sbjct: 383 E-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 442

BLAST of CmoCh01G009790 vs. TAIR10
Match: AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 239.2 bits (609), Expect = 4.6e-63
Identity = 156/442 (35.29%), Postives = 233/442 (52.71%), Query Frame = 1

Query: 16  SPSNSLSLS------------FPLTSHSVSSQEASLSLSSKTK-----SHGKFPFQYSNA 75
           S S+SLSLS            FPLT    SS   +L  S KT+     S  K  F+++  
Sbjct: 5   SSSSSLSLSKNFLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVT 64

Query: 76  LVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNT 135
           L V++ +G PPQ + MV+DTGS+LSW+ C       SV      F+P  SS++S +PC++
Sbjct: 65  LTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSV------FNPVSSSTYSPVPCSS 124

Query: 136 TLCRPRIPDFTLPTSCDPTRH-CHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCA 195
            +CR R  D  +P SCDP  H CH +  YAD T  EGNL  E F    S+T    L GC 
Sbjct: 125 PICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCM 184

Query: 196 TAS--------TQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNS 255
            +          ++ G++GMN G LSF++Q   SKFSYC+   +GSD +G   LGD    
Sbjct: 185 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGSDSSGFLLLGD---- 244

Query: 256 AKFKYVNMLTFP----KSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQT 315
           A + ++  + +     +S   P  D+ AYT+ ++GIR+G+  L++  +VF PD +GAGQT
Sbjct: 245 ASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQT 304

Query: 316 MIDSGSDLTYLVDEAYSKVREEMVRLVGPMMK----KGYEYAAVADMCFDGAVAAAVGRR 375
           M+DSG+  T+L+   Y+ ++ E +     +++      + +    D+C+           
Sbjct: 305 MVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFS 364

Query: 376 IGDMWFQFENGVEILVGKGEGLL-------TEVEEGVKCVGIGRSDRLVTESNIIGNVHQ 416
              M      G E+ V  G+ LL       +E +E V C   G SD L  E+ +IG+ HQ
Sbjct: 365 GLPMVSLMFRGAEMSV-SGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQ 424

BLAST of CmoCh01G009790 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 150.2 bits (378), Expect = 2.8e-36
Identity = 118/390 (30.26%), Postives = 184/390 (47.18%), Query Frame = 1

Query: 44  KTKSHGKFPFQYSNALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWF 103
           K  +HG      S   ++ + IG+P  +   +VDTGS L W QC  K   +        F
Sbjct: 97  KAPTHGG-----SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQC--KPCTECFDQPTPIF 156

Query: 104 DPYLSSSFSFLPCNTTLCRPRIPDFTLPTS-CDPTRH-CHYSYFYADGTLAEGNLVTEKF 163
           DP  SSS+S + C++ LC        LP S C+  +  C Y Y Y D +   G L TE F
Sbjct: 157 DPEKSSSYSKVGCSSGLCN------ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETF 216

Query: 164 TFSNSLTTRSLLLGCATAS-----TQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSD 223
           TF +  +   +  GC   +     +Q  G++G+  G LS ISQ K +KFSYC+     S+
Sbjct: 217 TFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSE 276

Query: 224 LTGLFYLGDNP----NSAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISP 283
            +   ++G       N         +T   S L      S Y L ++GI +G  +L++  
Sbjct: 277 ASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEK 336

Query: 284 AVFKPDPSGAGQTMIDSGSDLTYLVDEAYSKVREEMV-RLVGPMMKKGYEYAAVADMCF- 343
           + F+    G G  +IDSG+ +TYL + A+  ++EE   R+  P+   G   +   D+CF 
Sbjct: 337 STFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG---STGLDLCFK 396

Query: 344 --DGAVAAAVGRRIGDMWFQFENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNI 403
             D A   AV + I    F F+ G ++ +     ++ +   GV C+ +G S+ +    +I
Sbjct: 397 LPDAAKNIAVPKMI----FHFK-GADLELPGENYMVADSSTGVLCLAMGSSNGM----SI 456

Query: 404 IGNVHQQNMWVEYDLSNKRIGFGVAKCSGL 419
            GNV QQN  V +DL  + + F   +C  L
Sbjct: 457 FGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of CmoCh01G009790 vs. NCBI nr
Match: gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 588.2 bits (1515), Expect = 1.1e-164
Identity = 309/431 (71.69%), Postives = 342/431 (79.35%), Query Frame = 1

Query: 1   MPLSLLLLSLLGLLFSPSNSLSLSFPLTSHSVSSQEASLSLSS-----KTKSHGKF--PF 60
           M L L  LSL  L FS SNSLSL FPL+     S    L  SS     K  SHG F  PF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60

Query: 61  QYSN-ALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSV----KPMINWFDPYLS 120
           +YS+ ALVVS+PIG+PPQ  D+V+DTGSQLSWIQCH K  +K +    KP    FDP LS
Sbjct: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSFLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLT 180
           SSFS LPCN  +C+PRIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EKFTFSNSL+
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180

Query: 181 TRSLLLGCATASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPN 240
           T  ++LGCA  ST+NRG+LGMN GRLSFISQAKISKFSYCVP RTGS+ TGLFYLGDNPN
Sbjct: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMID 300
           S+KFKYV MLTFP+S+ SPNLD  AYTLPMK I+I   +LNI PA FKPD  G+GQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQ 360
           SGSDLTYLVDEAY KV+EE+VRLVG MMKKGY YAAVADMCFD  V   VGRRIGDM F+
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360

Query: 361 FENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRI 420
           F+NGVEI VG+GEG+LTEVE+GVKCVGIGRS RL   SNIIG VHQQNMWVEYDL+NKR+
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420

BLAST of CmoCh01G009790 vs. NCBI nr
Match: gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 585.1 bits (1507), Expect = 9.7e-164
Identity = 305/429 (71.10%), Postives = 346/429 (80.65%), Query Frame = 1

Query: 1   MPLSLLLLSLLGLLFSPSNSLSLSFPLT----SHSVSSQEASLSLSSKTKSHGKF--PFQ 60
           M L L  LSL  L FS SNS+SL FPL+      ++S    S   + K  SHG F  PF+
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPSNISPIYGSQLYAKKPSSHGSFKLPFK 60

Query: 61  YSN-ALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSV---KPMINWFDPYLSSS 120
           YS+ ALVVS+PIG+PPQ  D+V+DTGSQLSWIQCH KV++K     KP    FDP LSSS
Sbjct: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPSLSSS 120

Query: 121 FSFLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTR 180
           FS LPCN  +C+PRIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EKF+ SNSL+T 
Sbjct: 121 FSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLSTP 180

Query: 181 SLLLGCATASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPNSA 240
            ++LGCA AST+NRG+LGMN GRLSFISQAKISKFSYCVP RTGS+ TGLFYLGDNPNS+
Sbjct: 181 PVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPNSS 240

Query: 241 KFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSG 300
           +FKYV MLTFP+S+ SPNLD  AYTLPMKGI+I   +LNISPA FKPD  G+GQTMIDSG
Sbjct: 241 RFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMIDSG 300

Query: 301 SDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFE 360
           SDLTYLVDEAY KV+EE+VRLVG  MKKGY YAAVADMCFD  V A VGRRIG + F+F+
Sbjct: 301 SDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFEFD 360

Query: 361 NGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGF 420
           NGVEILVG+GEG+LTEVE+GVKCVG GRS+RL   SNIIG VHQQNMWVEYDL+N+RIGF
Sbjct: 361 NGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRIGF 420

BLAST of CmoCh01G009790 vs. NCBI nr
Match: gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 581.6 bits (1498), Expect = 1.1e-162
Identity = 304/431 (70.53%), Postives = 346/431 (80.28%), Query Frame = 1

Query: 1   MPLSLLLLSLLGLLFSPSNSLSLSFPLT-----SHSVSSQEASLSLSSKTKSHGKF--PF 60
           M L L  LSL  L FS SNSLSL FPL+     S+++ S  + L  + +  S+G F  PF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSEKPSNTIPSYSSQL-YAKRPSSYGSFKLPF 60

Query: 61  QYSN-ALVVSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSV----KPMINWFDPYLS 120
           +YS+ ALVVS+PIG+PPQ  D+V+DTGSQLSWIQCH K  +K +    KP    FDP LS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSFLPCNTTLCRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLT 180
           SSFS LPCN  +C+PRIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EKFTFS SL+
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS 180

Query: 181 TRSLLLGCATASTQNRGMLGMNTGRLSFISQAKISKFSYCVPDRTGSDLTGLFYLGDNPN 240
           T  ++LGCA AST+NRG+LGMN GRLSFISQAKISKFSYCVP RTGS+ TGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SAKFKYVNMLTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMID 300
           S+KFKYV MLTFP+S+ SPNLD  AYTLPMK I+I   +LNI PA FKPD  G+GQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQ 360
           SGSDLTYLVDEAY KV+EE+VRLVG MMKKGY YA VADMCFD  V A VGRRIG + F+
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 360

Query: 361 FENGVEILVGKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRI 420
           F+NGVEI VG+GEG+LTEVE+GVKCVGIGRS+RL   SNIIG VHQQNMWVEYDL+NKR+
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 420

BLAST of CmoCh01G009790 vs. NCBI nr
Match: gi|590648249|ref|XP_007032118.1| (Eukaryotic aspartyl protease family protein [Theobroma cacao])

HSP 1 Score: 495.7 bits (1275), Expect = 7.8e-137
Identity = 259/416 (62.26%), Postives = 308/416 (74.04%), Query Frame = 1

Query: 17  PSNSLSLSFPLTS------------HSVSSQEASLSLSSKTKSHG-KFPFQYSNALVVSV 76
           P+NS+S SFPLTS             S+ S + + ++  +  S+  K  F+YS AL+V++
Sbjct: 31  PNNSISFSFPLTSLRFSRDNVQTLYRSLVSTKPNSTVQPRPSSYNYKTTFKYSMALIVAL 90

Query: 77  PIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTLCRP 136
           PIG+PPQ   MV+DTGSQLSWIQCH KV RK   P  + FDP LSSSFS LPC   LC+P
Sbjct: 91  PIGTPPQTQQMVLDTGSQLSWIQCHKKVARKPPPPPTS-FDPSLSSSFSVLPCTHPLCKP 150

Query: 137 RIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATASTQN 196
           RIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EKFTFS S +T  L+LGCAT ++++
Sbjct: 151 RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPPLILGCATDTSED 210

Query: 197 RGMLGMNTGRLSFISQAKISKFSYCVPDR---TGSDLTGLFYLGDNPNSAKFKYVNMLTF 256
           +G+LGMN GRLSF SQAKISKFSYCVP R    G   TG FYLG+NP+S  F+YVN++ F
Sbjct: 211 KGILGMNLGRLSFASQAKISKFSYCVPTRRTQPGFSPTGSFYLGENPSSRGFQYVNLMIF 270

Query: 257 PKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLVDEA 316
           P+S   PN+D  AYTLPM+GIRIG  KL I  +VF+PD  G+GQTMIDSGS+ TYLVD+A
Sbjct: 271 PESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMIDSGSEFTYLVDDA 330

Query: 317 YSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILVGKG 376
           Y+KVREE+VRLVGP +KKGY Y  VADMCFDG     +GR IGDM  +FE GVEI V K 
Sbjct: 331 YNKVREEVVRLVGPRIKKGYVYGGVADMCFDGN-PIEIGRLIGDMVLEFEKGVEITVEK- 390

Query: 377 EGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 417
           E +L +VE GV C+GIGRS  L   SNIIGN HQQN+WVEYDL N+R+GFG A CS
Sbjct: 391 ERVLADVEGGVHCLGIGRSSMLGAASNIIGNFHQQNLWVEYDLVNRRVGFGKADCS 443

BLAST of CmoCh01G009790 vs. NCBI nr
Match: gi|743915197|ref|XP_011001546.1| (PREDICTED: aspartic proteinase PCS1-like [Populus euphratica])

HSP 1 Score: 492.3 bits (1266), Expect = 8.6e-136
Identity = 261/419 (62.29%), Postives = 309/419 (73.75%), Query Frame = 1

Query: 18  SNSLSLSFPLTSHSVSSQEASLSLSS-----------KTKSHGKFP------FQYSNALV 77
           ++SLSLSFPLTS   S Q +    SS           K+ S    P      F+YS  L+
Sbjct: 24  NDSLSLSFPLTSLPRSPQASPSFYSSFISQTKKAPTVKSSSFSSSPYNYRSGFKYSMILL 83

Query: 78  VSVPIGSPPQQMDMVVDTGSQLSWIQCHGKVRRKSVKPMINWFDPYLSSSFSFLPCNTTL 137
           VS+PIG+PPQ   M++DTGSQLSWIQCH KV RK   P  + FDP LSSSFS LPCN  L
Sbjct: 84  VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKP--PPSSVFDPSLSSSFSVLPCNHPL 143

Query: 138 CRPRIPDFTLPTSCDPTRHCHYSYFYADGTLAEGNLVTEKFTFSNSLTTRSLLLGCATAS 197
           C+PRIPDFTLPTSCD  R CHYSYFYADGTLAEGNLV EK TFS S +T  L+LGCA  S
Sbjct: 144 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEES 203

Query: 198 TQNRGMLGMNTGRLSFISQAKISKFSYCVPDRT---GSDLTGLFYLGDNPNSAKFKYVNM 257
           +  +G+LGMN GRLSF SQAK++KFSYCVP R    G   TG FYLG+NPNS  F+Y+N+
Sbjct: 204 SDAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYINL 263

Query: 258 LTFPKSRLSPNLDKSAYTLPMKGIRIGNNKLNISPAVFKPDPSGAGQTMIDSGSDLTYLV 317
           LTF +S+  PNLD  AYT+ M+GIRIGN KLNI  + F+PDPSGAGQTMIDSGS+ T+LV
Sbjct: 264 LTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTFLV 323

Query: 318 DEAYSKVREEMVRLVGPMMKKGYEYAAVADMCFDGAVAAAVGRRIGDMWFQFENGVEILV 377
           DEAY+KVREE+VRLVGP +KKGY Y  V+DMCF+G  A  +GR IG+M F+F+ GVEI+V
Sbjct: 324 DEAYNKVREEVVRLVGPRLKKGYVYGGVSDMCFNGN-AIEIGRLIGNMVFEFDKGVEIVV 383

Query: 378 GKGEGLLTEVEEGVKCVGIGRSDRLVTESNIIGNVHQQNMWVEYDLSNKRIGFGVAKCS 417
            K E +L +V  GV CVGIGRS+ L   SNIIGN HQQNMWVE+DL+N+R+GFG A CS
Sbjct: 384 EK-ERVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNMWVEFDLANRRVGFGKADCS 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCS1L_ARATH1.0e-6436.57Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1[more]
NEP1_NEPGR2.7e-3630.66Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR2.9e-3530.68Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPG1_ARATH7.2e-3429.41Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH8.3e-3027.35Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A061EL58_THECC5.4e-13762.26Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=... [more]
A0A0B2S2W1_GLYSO7.8e-13662.20Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_013398 PE=3 SV=1[more]
B9T2R1_RICCO1.0e-13562.20Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 ... [more]
A0A0S3RKY8_PHAAN5.1e-13560.98Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.03G096000 PE=... [more]
A0A0L9UW35_PHAAN5.1e-13560.98Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan07g076900 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G37540.11.7e-13759.72 Eukaryotic aspartyl protease family protein[more]
AT1G66180.19.4e-13360.96 Eukaryotic aspartyl protease family protein[more]
AT5G02190.15.8e-6636.57 Eukaryotic aspartyl protease family protein[more]
AT2G39710.14.6e-6335.29 Eukaryotic aspartyl protease family protein[more]
AT2G03200.12.8e-3630.26 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778679910|ref|XP_011651212.1|1.1e-16471.69PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|659114575|ref|XP_008457122.1|9.7e-16471.10PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
gi|778679913|ref|XP_004140731.2|1.1e-16270.53PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|590648249|ref|XP_007032118.1|7.8e-13762.26Eukaryotic aspartyl protease family protein [Theobroma cacao][more]
gi|743915197|ref|XP_011001546.1|8.6e-13662.29PREDICTED: aspartic proteinase PCS1-like [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G009790.1CmoCh01G009790.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 285..296
score: 1.7E-5coord: 387..402
score: 1.7E-5coord: 65..85
score: 1.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 3..416
score: 4.2E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 74..85
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 228..416
score: 4.1E-31coord: 57..225
score: 1.8
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 60..416
score: 8.58
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 3..416
score: 4.2E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None