CmaCh03G011230 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh03G011230
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr03: 7637591 .. 7638985 (-)
RNA-Seq ExpressionCmaCh03G011230
SyntenyCmaCh03G011230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTTTTTCCCATTCAATTTCTCCTTTCCATCGTTCTTCTTCTTTCCGCTTCATCTTCTTCCTCCAGCATCACCGTTACACTCCCCCTCACTGCCTTCCCTTCGCTTCCACTTACTCATCCATGGAAAAACATCAAGCATCTTGTCTCTGCTTCCCTCGCCAGAGCTCAACACCTCAAGACCCCAAAAACTAAATCAAATACTTCCATTCAGAATGTTGCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCTCTCGCCTTCGGAACTCCACCGCAGAGTTTATCCTTAGTTTTCGATACTGGAAGTAGTCTCGTCTGGTTCCCATGTACTGCCGGTTACCGCTGTTCCAATTGTTCGTTTCCCAATGTCGATGCTGCAACGATTCCGAAATTCATCCCCAAATTATCTTCCTCTGCGAGGATTATTGGTTGTCGAAATCGAAAATGTTCTTGGATTTTTGGCCCTAATTTGAAGTCCTCGTGTAGAAGTTGTAGCCCTAGATCGCGAAAATGCTCCGATACTTGTCCTGGATATGGCATTCAATACGGCTCTGGCGCAACTGCCGGGTTCCTCCTCTCTGAAACGCTCGATTTCCCCGAGAAACGAGTTCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCATCAACCAGCCGGCATTGCTGGATTCGGCCGCGGTCCCGAATCGTTGCCCTCGCAAATGGGACTGAAACGATTCTCCCATTGCCTTGTTCCACGCCAGTTCGACGACTCGCCAGTGAGTAGCCCTCTCGTACTAGACTCCAGTCCGGAATCCGGCGATTCGAAGACTAACAGTCTCATCTACGCACCGTTCCGAGAAAATCCATCAGGATCCAACGCCGCATTTCGAGAGTACTATTACCTTACTCTTCGGAGAATCCTCATCGGCAGAAAGCCGGTGAAGTTCCCGTACAAATACCTCGTGCCAAACTCCGCTGGAAACGGCGGTGCGATCATCGATTCCGGTTCGACGTTCACGTTTCTAGATAAACCAATTTTCGAAGCCGTAGCGGAAGAGTTGGAGAAACAACTGGTGAAATATCCCCGAGCTAAGGGCGTTGAAGCAGAGTCCGGTTTGAGGCCGTGCTTCGATATATCCAAGGAGGAATCAGTGGAGTTTCCGGAACTGATTTTGAAATTTAAAGGCGGAGCGACGCTGGCTTTGCCGCCGGCGAATTACTTGGCGTTGGTGACGGATACCGGCGTGGTGTGCTTAACGATGATAACGGATGTAAACTTCCTCGGCGGCGGCGGTGGGCCGGCGATTATATTCGGGGCGTTTCAGCAGCAGAATGTTTTGGTGCAGTATGATTTGGCGAAGGAGAGAATCGGATTTCGGAAGCAGAGATGCACCGGAAATTGA

mRNA sequence

ATGGAGTTTTTTCCCATTCAATTTCTCCTTTCCATCGTTCTTCTTCTTTCCGCTTCATCTTCTTCCTCCAGCATCACCGTTACACTCCCCCTCACTGCCTTCCCTTCGCTTCCACTTACTCATCCATGGAAAAACATCAAGCATCTTGTCTCTGCTTCCCTCGCCAGAGCTCAACACCTCAAGACCCCAAAAACTAAATCAAATACTTCCATTCAGAATGTTGCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCTCTCGCCTTCGGAACTCCACCGCAGAGTTTATCCTTAGTTTTCGATACTGGAAGTAGTCTCGTCTGGTTCCCATGTACTGCCGGTTACCGCTGTTCCAATTGTTCGTTTCCCAATGTCGATGCTGCAACGATTCCGAAATTCATCCCCAAATTATCTTCCTCTGCGAGGATTATTGGTTGTCGAAATCGAAAATGTTCTTGGATTTTTGGCCCTAATTTGAAGTCCTCGTGTAGAAGTTGTAGCCCTAGATCGCGAAAATGCTCCGATACTTGTCCTGGATATGGCATTCAATACGGCTCTGGCGCAACTGCCGGGTTCCTCCTCTCTGAAACGCTCGATTTCCCCGAGAAACGAGTTCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCATCAACCAGCCGGCATTGCTGGATTCGGCCGCGGTCCCGAATCGTTGCCCTCGCAAATGGGACTGAAACGATTCTCCCATTGCCTTGTTCCACGCCAGTTCGACGACTCGCCAGTGAGTAGCCCTCTCGTACTAGACTCCAGTCCGGAATCCGGCGATTCGAAGACTAACAGTCTCATCTACGCACCGTTCCGAGAAAATCCATCAGGATCCAACGCCGCATTTCGAGAGTACTATTACCTTACTCTTCGGAGAATCCTCATCGGCAGAAAGCCGGTGAAGTTCCCGTACAAATACCTCGTGCCAAACTCCGCTGGAAACGGCGGTGCGATCATCGATTCCGGTTCGACGTTCACGTTTCTAGATAAACCAATTTTCGAAGCCGTAGCGGAAGAGTTGGAGAAACAACTGGTGAAATATCCCCGAGCTAAGGGCGTTGAAGCAGAGTCCGGTTTGAGGCCGTGCTTCGATATATCCAAGGAGGAATCAGTGGAGTTTCCGGAACTGATTTTGAAATTTAAAGGCGGAGCGACGCTGGCTTTGCCGCCGGCGAATTACTTGGCGTTGGTGACGGATACCGGCGTGGTGTGCTTAACGATGATAACGGATGTAAACTTCCTCGGCGGCGGCGGTGGGCCGGCGATTATATTCGGGGCGTTTCAGCAGCAGAATGTTTTGGTGCAGTATGATTTGGCGAAGGAGAGAATCGGATTTCGGAAGCAGAGATGCACCGGAAATTGA

Coding sequence (CDS)

ATGGAGTTTTTTCCCATTCAATTTCTCCTTTCCATCGTTCTTCTTCTTTCCGCTTCATCTTCTTCCTCCAGCATCACCGTTACACTCCCCCTCACTGCCTTCCCTTCGCTTCCACTTACTCATCCATGGAAAAACATCAAGCATCTTGTCTCTGCTTCCCTCGCCAGAGCTCAACACCTCAAGACCCCAAAAACTAAATCAAATACTTCCATTCAGAATGTTGCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCTCTCGCCTTCGGAACTCCACCGCAGAGTTTATCCTTAGTTTTCGATACTGGAAGTAGTCTCGTCTGGTTCCCATGTACTGCCGGTTACCGCTGTTCCAATTGTTCGTTTCCCAATGTCGATGCTGCAACGATTCCGAAATTCATCCCCAAATTATCTTCCTCTGCGAGGATTATTGGTTGTCGAAATCGAAAATGTTCTTGGATTTTTGGCCCTAATTTGAAGTCCTCGTGTAGAAGTTGTAGCCCTAGATCGCGAAAATGCTCCGATACTTGTCCTGGATATGGCATTCAATACGGCTCTGGCGCAACTGCCGGGTTCCTCCTCTCTGAAACGCTCGATTTCCCCGAGAAACGAGTTCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCATCAACCAGCCGGCATTGCTGGATTCGGCCGCGGTCCCGAATCGTTGCCCTCGCAAATGGGACTGAAACGATTCTCCCATTGCCTTGTTCCACGCCAGTTCGACGACTCGCCAGTGAGTAGCCCTCTCGTACTAGACTCCAGTCCGGAATCCGGCGATTCGAAGACTAACAGTCTCATCTACGCACCGTTCCGAGAAAATCCATCAGGATCCAACGCCGCATTTCGAGAGTACTATTACCTTACTCTTCGGAGAATCCTCATCGGCAGAAAGCCGGTGAAGTTCCCGTACAAATACCTCGTGCCAAACTCCGCTGGAAACGGCGGTGCGATCATCGATTCCGGTTCGACGTTCACGTTTCTAGATAAACCAATTTTCGAAGCCGTAGCGGAAGAGTTGGAGAAACAACTGGTGAAATATCCCCGAGCTAAGGGCGTTGAAGCAGAGTCCGGTTTGAGGCCGTGCTTCGATATATCCAAGGAGGAATCAGTGGAGTTTCCGGAACTGATTTTGAAATTTAAAGGCGGAGCGACGCTGGCTTTGCCGCCGGCGAATTACTTGGCGTTGGTGACGGATACCGGCGTGGTGTGCTTAACGATGATAACGGATGTAAACTTCCTCGGCGGCGGCGGTGGGCCGGCGATTATATTCGGGGCGTTTCAGCAGCAGAATGTTTTGGTGCAGTATGATTTGGCGAAGGAGAGAATCGGATTTCGGAAGCAGAGATGCACCGGAAATTGA

Protein sequence

MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHLKTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGLKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMITDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN
Homology
BLAST of CmaCh03G011230 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 3.2e-46
Identity = 153/509 (30.06%), Postives = 233/509 (45.78%), Query Frame = 0

Query: 6   IQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHLKTPKT 65
           I FL + +L      S SS++  L L    SL  +    +  HL+ +S +R+   +  + 
Sbjct: 6   IFFLYTTILQYYFHFSVSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSS-ARFRRH 65

Query: 66  KSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPN 125
                 Q ++L   S   Y ISL+ G+   ++SL  DTGS LVWFPC   + C  C    
Sbjct: 66  HHKQQQQQLSLPISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILC---- 125

Query: 126 VDAATIPKFIP-KLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKC---------- 185
            ++  +P   P  LSSSA  + C +  C         S+  S  P S  C          
Sbjct: 126 -ESKPLPPSPPSSLSSSATTVSCSSPSC---------SAAHSSLPSSDLCAISNCPLDFI 185

Query: 186 --------SDTCPGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAG 245
                   S  CP +   YG G+    L S++L  P   V +F  GC+  ++ +P G+AG
Sbjct: 186 ETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAG 245

Query: 246 FGRGPESLPSQMGL------KRFSHCLVPRQFDDSPV--SSPLVL--------------D 305
           FGRG  SLP+Q+ +        FS+CLV   FD   V   SPL+L              D
Sbjct: 246 FGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTD 305

Query: 306 SSPESGD--SKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPVKFPYKYLVPNS 365
              +  D   K N  ++    ENP         +Y ++L+ I IG++ +  P      + 
Sbjct: 306 DHDDGDDEKKKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDK 365

Query: 366 AGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVK-YPRAKGVEAESGLRPCFDISKEES 425
            G GG ++DSG+TFT L    + +V EE + ++ + + RA  VE  SG+ PC+ ++  ++
Sbjct: 366 NGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QT 425

Query: 426 VEFPELILKFKGG-ATLALPPANYLALVTDTG--------VVCLTMITDVNFLGGGGGPA 462
           V+ P L+L F G  +++ LP  NY     D G        + CL ++   +     GG  
Sbjct: 426 VKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTG 485

BLAST of CmaCh03G011230 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.2e-38
Identity = 122/390 (31.28%), Postives = 177/390 (45.38%), Query Frame = 0

Query: 82  GAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPNVDAATIPKFIPKLSSS 141
           G Y ++L+ GTP Q  S + DTGS L+W  C    +C N S         P F P+ SSS
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQS--------TPIFNPQGSSS 152

Query: 142 ARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPGYGIQYGSGA-TAGFLLSETLD 201
              + C ++ C  +  P               CS+    Y   YG G+ T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSPT--------------CSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 202 FPEKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMGLKRFSHCLVPRQFDDSP 261
           F    +P+   GC            AG+ G GRGP SLPSQ+ + +FS+C+ P     S 
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP---IGSS 272

Query: 262 VSSPLVLDSSPES--GDSKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIG--RKPVK 321
             S L+L S   S    S   +LI           ++    +YY+TL  + +G  R P+ 
Sbjct: 273 TPSNLLLGSLANSVTAGSPNTTLI----------QSSQIPTFYYITLNGLSVGSTRLPID 332

Query: 322 FPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLRP 381
            P  + + ++ G GG IIDSG+T T+     +++V +E   Q +  P   G  + SG   
Sbjct: 333 -PSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNG--SSSGFDL 392

Query: 382 CFDISKEES-VEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMITDVNFLGGGGGP 441
           CF    + S ++ P  ++ F GG  L LP  NY  +    G++CL M       G     
Sbjct: 393 CFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICLAM-------GSSSQG 434

Query: 442 AIIFGAFQQQNVLVQYDLAKERIGFRKQRC 462
             IFG  QQQN+LV YD     + F   +C
Sbjct: 453 MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh03G011230 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 3.0e-36
Identity = 124/387 (32.04%), Postives = 176/387 (45.48%), Query Frame = 0

Query: 82  GAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPNVDAATIPKFIPKLSSS 141
           G Y +++A GTP  S S + DTGS L+W  C     C+ C      +   P F P+ SSS
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP---CTQCF-----SQPTPIFNPQDSSS 153

Query: 142 ARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPGYGIQYGSGATA-GFLLSETLD 201
              + C ++ C  +               S  C++    Y   YG G+T  G++ +ET  
Sbjct: 154 FSTLPCESQYCQDL--------------PSETCNNNECQYTYGYGDGSTTQGYMATETFT 213

Query: 202 FPEKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMGLKRFSHCLVPRQFDDSP 261
           F    VP+   GC            AG+ G G GP SLPSQ+G+ +FS+C+       SP
Sbjct: 214 FETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMT-SYGSSSP 273

Query: 262 VSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPVKFPYK 321
            +  L   +S     S + +LI++    NP+        YYY+TL+ I +G   +  P  
Sbjct: 274 STLALGSAASGVPEGSPSTTLIHSSL--NPT--------YYYITLQGITVGGDNLGIPSS 333

Query: 322 YLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLRPCFDI 381
                  G GG IIDSG+T T+L +  + AVA+    Q +  P     E+ SGL  CF  
Sbjct: 334 TFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-INLPTVD--ESSSGLSTCFQQ 393

Query: 382 SKEES-VEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMITDVNFLGGGGGPAI-I 441
             + S V+ PE+ ++F GG  L L   N L    + GV+CL M       G      I I
Sbjct: 394 PSDGSTVQVPEISMQFDGG-VLNLGEQNILISPAE-GVICLAM-------GSSSQLGISI 435

Query: 442 FGAFQQQNVLVQYDLAKERIGFRKQRC 462
           FG  QQQ   V YDL    + F   +C
Sbjct: 454 FGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmaCh03G011230 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 3.1e-33
Identity = 145/473 (30.66%), Postives = 198/473 (41.86%), Query Frame = 0

Query: 17  SASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLAR-AQHLKTPKT-KSNTSIQNV 76
           S S S SS ++TL L    +L      K    L S+ L R ++ +K+  T  +    +NV
Sbjct: 62  SGSDSESSSSITLNLDHIDALSSN---KTPDELFSSRLQRDSRRVKSIATLAAQIPGRNV 121

Query: 77  ALFPR--------------SYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 136
              PR                G Y   L  GTP + + +V DTGS +VW  C    RC +
Sbjct: 122 THAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYS 181

Query: 137 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 196
            S P  D        P+ S +   I C +  C        +     C+ R + C      
Sbjct: 182 QSDPIFD--------PRKSKTYATIPCSSPHCR-------RLDSAGCNTRRKTCL----- 241

Query: 197 YGIQYGSGA-TAGFLLSETLDFPEKRVPDFLVGCSVLSVHQ-------PAGIAGFGRGPE 256
           Y + YG G+ T G   +ETL F   RV    +GC     H         AG+ G G+G  
Sbjct: 242 YQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG----HDNEGLFVGAAGLLGLGKGKL 301

Query: 257 SLPSQMGLK---RFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGS 316
           S P Q G +   +FS+CLV R     P        SS   G++  + +     R  P  S
Sbjct: 302 SFPGQTGHRFNQKFSYCLVDRSASSKP--------SSVVFGNAAVSRIA----RFTPLLS 361

Query: 317 NAAFREYYYLTLRRILIGRKPVKFPYKYLVP-NSAGNGGAIIDSGSTFTFLDKPIFEAVA 376
           N     +YY+ L  I +G   V      L   +  GNGG IIDSG++ T L +P + A+ 
Sbjct: 362 NPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMR 421

Query: 377 EELEKQLVKYPRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALV 436
           +          RA      S    CFD+S    V+ P ++L F+ GA ++LP  NYL  V
Sbjct: 422 DAFRVGAKTLKRAPDF---SLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPV 481

Query: 437 TDTGVVCLTMITDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 462
              G  C        F G  GG +II G  QQQ   V YDLA  R+GF    C
Sbjct: 482 DTNGKFCFA------FAGTMGGLSII-GNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmaCh03G011230 vs. ExPASy Swiss-Prot
Match: Q7XV21 (Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2)

HSP 1 Score: 132.5 bits (332), Expect = 1.2e-29
Identity = 119/411 (28.95%), Postives = 174/411 (42.34%), Query Frame = 0

Query: 82  GAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPNVDAATIPKFIPKLSSS 141
           G Y + L  GTPP   +   DT S L+W  C     C+ C +  VD    P F P++SS+
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQP---CTGC-YHQVD----PMFNPRVSST 146

Query: 142 ARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPGYGIQYGSGATAGFLLSETLDF 201
              + C +  C        +     C        ++C       G+  T G L  + L  
Sbjct: 147 YAALPCSSDTCD-------ELDVHRC---GHDDDESCQYTYTYSGNATTEGTLAVDKLVI 206

Query: 202 PEKRVPDFLVGCSVLSV-----HQPAGIAGFGRGPESLPSQMGLKRFSHCLVPRQFDDSP 261
            E        GCS  S       Q +G+ G GRGP SL SQ+ ++RF++CL P     S 
Sbjct: 207 GEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPP---PASR 266

Query: 262 VSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPVKFPYK 321
           +   LVL +  ++  + TN  I  P R +P      +  YYYL L  +LIG + +  P  
Sbjct: 267 IPGKLVLGADADAARNATNR-IAVPMRRDP-----RYPSYYYLNLDGLLIGDRAMSLPPT 326

Query: 322 YLV----------------PNS-------AGNGGAIIDSGSTFTFLDKPIFEAVAEELEK 381
                              PN+       A   G IID  ST TFL+  +++ +  +LE 
Sbjct: 327 TTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEV 386

Query: 382 QLVKYPRAKGVEAESGLRPCF---DISKEESVEFPELILKFKGGATLALPPANYLALVTD 441
           + ++ PR  G  +  GL  CF   D    + V  P + L F  G  L L  A   A   +
Sbjct: 387 E-IRLPRGTG--SSLGLDLCFILPDGVAFDRVYVPAVALAF-DGRWLRLDKARLFAEDRE 446

Query: 442 TGVVCLTMITDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 462
           +G++CL +           G   I G FQQQN+ V Y+L + R+ F +  C
Sbjct: 447 SGMMCLMVGR------AEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460

BLAST of CmaCh03G011230 vs. ExPASy TrEMBL
Match: A0A6J1IMR7 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813 PE=3 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 3.8e-268
Identity = 464/464 (100.00%), Postives = 464/464 (100.00%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL
Sbjct: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN
Sbjct: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420
           AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT
Sbjct: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420

Query: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 465
           DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN
Sbjct: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 464

BLAST of CmaCh03G011230 vs. ExPASy TrEMBL
Match: A0A6J1EDJ0 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111433208 PE=3 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 2.3e-257
Identity = 445/464 (95.91%), Postives = 451/464 (97.20%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEFF I FLLSIVLLLSASSSSSS TVTLPLT FPSLP  HPWKNIKHLVSASL RAQHL
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           KTP+TKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN
Sbjct: 61  KTPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFPNVDAATIPKFIPKLSSSA+IIGCRNRKCSWIFGPNLK+ CRSCSPRSRKCSDTCPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMGL
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCLVPRQFDDSPVSSPLVLDSS ESG+SK NSLIYAPFRENPSGSNAAFREYYYLT
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420
           AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPP+NYLALV DT VVCLTMIT
Sbjct: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMIT 420

Query: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 465
           DV FLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN
Sbjct: 421 DVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 464

BLAST of CmaCh03G011230 vs. ExPASy TrEMBL
Match: A0A5A7SGF9 (Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold541G00670 PE=3 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 4.2e-214
Identity = 366/463 (79.05%), Postives = 410/463 (88.55%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEF PI FL SI LLL  SSSSS   +TLPL  FPS+P T P K I HL+SASL+RAQHL
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSS---ITLPLATFPSIPFTDPLKTINHLLSASLSRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           K+P++KSNTS +NV+LFPRSYGAY++SLAFGTPPQ+LS +FDTGSSLVWFPCTAGYRC++
Sbjct: 61  KSPQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAH 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFP+VD ATI KF+PKLSSS +I+GCRN KC+WIFGPNLKS CR+C+P+SRKCSD+CPG
Sbjct: 121 CSFPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM L
Sbjct: 181 YGIQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCL+PR FDDSPVSSPLVLDS PES +SKT S IYAPF+ENPS SN AFREYYYL+
Sbjct: 241 KRFSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLS 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIG KPVKFPYKYLVP+S G GGAIIDSGSTFTFLDKPIFEA+A ELEKQLVKYPR
Sbjct: 301 LRRILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISK-EESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420
           AK +EA++GLRPCF+ISK EES EFPE+ LKFKGG  L+LPP NYL +VTD  VVCLTM+
Sbjct: 361 AKDIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMM 420

Query: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 463
           T+   +G GGGPAIIFGAFQQQNVLV+YDLAK+RIGFRKQ+CT
Sbjct: 421 TNAEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of CmaCh03G011230 vs. ExPASy TrEMBL
Match: A0A1S3CHV2 (aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 PE=3 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 4.2e-214
Identity = 366/463 (79.05%), Postives = 410/463 (88.55%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEF PI FL SI LLL  SSSSS   +TLPL  FPS+P T P K I HL+SASL+RAQHL
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSS---ITLPLATFPSIPFTDPLKTINHLLSASLSRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           K+P++KSNTS +NV+LFPRSYGAY++SLAFGTPPQ+LS +FDTGSSLVWFPCTAGYRC++
Sbjct: 61  KSPQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAH 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFP+VD ATI KF+PKLSSS +I+GCRN KC+WIFGPNLKS CR+C+P+SRKCSD+CPG
Sbjct: 121 CSFPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM L
Sbjct: 181 YGIQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCL+PR FDDSPVSSPLVLDS PES +SKT S IYAPF+ENPS SN AFREYYYL+
Sbjct: 241 KRFSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLS 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIG KPVKFPYKYLVP+S G GGAIIDSGSTFTFLDKPIFEA+A ELEKQLVKYPR
Sbjct: 301 LRRILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISK-EESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420
           AK +EA++GLRPCF+ISK EES EFPE+ LKFKGG  L+LPP NYL +VTD  VVCLTM+
Sbjct: 361 AKDIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMM 420

Query: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 463
           T+   +G GGGPAIIFGAFQQQNVLV+YDLAK+RIGFRKQ+CT
Sbjct: 421 TNAEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of CmaCh03G011230 vs. ExPASy TrEMBL
Match: A0A5D3CAS4 (Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G001760 PE=3 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 9.3e-214
Identity = 366/463 (79.05%), Postives = 410/463 (88.55%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEF PI FL SI LLL  SSSSS   +TLPLT FPS+P T P K I HL+SASL+RAQHL
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSS---ITLPLTTFPSIPFTDPLKTINHLLSASLSRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           K+P++KSNTS +NV+LFPRSYGAY++SLAFGTPPQ+LS +FDTGSSLVWFPCTAGYRC++
Sbjct: 61  KSPQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAH 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFP+VD ATI KF+PKLSSS +I+GCRN KC+WIFGPNLKS CR+C+P+SRKCSD+CPG
Sbjct: 121 CSFPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAG LLSETLD   KRVPDFLVGCSV+SV QPAGIAGFGRGPESLPSQM L
Sbjct: 181 YGIQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVRQPAGIAGFGRGPESLPSQMRL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCL+PR FDDSPVSSPLVLDS PES +SKT S IYAPF+ENPS SN AFREYYYL+
Sbjct: 241 KRFSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLS 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIG KPVKFPYKYLVP+S G GGAIIDSGSTFTFLDKPIFEA+A ELEKQLVKYPR
Sbjct: 301 LRRILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISK-EESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420
           AK +EA++GLRPCF+ISK EES EFPE+ LKFKGG  L+LPP NYL +VTD  VVCLTM+
Sbjct: 361 AKDIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMM 420

Query: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 463
           T+   +G GGGPAIIFGAFQQQNVLV+YDLAK+RIGFRKQ+CT
Sbjct: 421 TNAEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of CmaCh03G011230 vs. NCBI nr
Match: XP_022979057.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 933.3 bits (2411), Expect = 7.9e-268
Identity = 464/464 (100.00%), Postives = 464/464 (100.00%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL
Sbjct: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN
Sbjct: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420
           AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT
Sbjct: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420

Query: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 465
           DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN
Sbjct: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 464

BLAST of CmaCh03G011230 vs. NCBI nr
Match: XP_023543736.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 902.9 bits (2332), Expect = 1.1e-258
Identity = 448/464 (96.55%), Postives = 453/464 (97.63%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEFFPI FLLSIVLLLSASSSSSS TVTLPLT FPSLP THPWKNIKHLVSASL RAQHL
Sbjct: 1   MEFFPIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFTHPWKNIKHLVSASLTRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           KTP+ KSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN
Sbjct: 61  KTPRIKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFPNVDAATIPKFIPKLSSSA+IIGCRNRKCSWIFGPNLKS CRSCSPRSRKCSDTCPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKSLCRSCSPRSRKCSDTCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMGL
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCLVPRQFDDSPVSSPLVLDSS ESG+SK NSLIYAPFRENPSGSNAAFREYYYLT
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420
           AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT
Sbjct: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420

Query: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 465
           DV FLGGGGGPAIIFGAFQQQNVLVQYDLAK+RIGFRKQRCT N
Sbjct: 421 DVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKDRIGFRKQRCTVN 464

BLAST of CmaCh03G011230 vs. NCBI nr
Match: XP_022925946.1 (probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 897.5 bits (2318), Expect = 4.8e-257
Identity = 445/464 (95.91%), Postives = 451/464 (97.20%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEFF I FLLSIVLLLSASSSSSS TVTLPLT FPSLP  HPWKNIKHLVSASL RAQHL
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           KTP+TKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN
Sbjct: 61  KTPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFPNVDAATIPKFIPKLSSSA+IIGCRNRKCSWIFGPNLK+ CRSCSPRSRKCSDTCPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMGL
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCLVPRQFDDSPVSSPLVLDSS ESG+SK NSLIYAPFRENPSGSNAAFREYYYLT
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420
           AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPP+NYLALV DT VVCLTMIT
Sbjct: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMIT 420

Query: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 465
           DV FLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN
Sbjct: 421 DVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 464

BLAST of CmaCh03G011230 vs. NCBI nr
Match: KAG7034471.1 (Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 895.2 bits (2312), Expect = 2.4e-256
Identity = 444/464 (95.69%), Postives = 450/464 (96.98%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
           MEFF I FLLSIVLLLSASSSSSS TVTLPLT FPSLP  HPWKNIKHLVSASL RAQHL
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120
           K P+TKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN
Sbjct: 61  KIPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPG 180
           CSFPNVDAATIPKFIPKLSSSA+IIGCRNRKCSWIFGPNLK+ CRSCSPRSRKCSDTCPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCPG 180

Query: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240
           YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMGL
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300
           KRFSHCLVPRQFDDSPVSSPLVLDSS ESG+SK NSLIYAPFRENPSGSNAAFREYYYLT
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360
           LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMIT 420
           AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPP+NYLALV DT VVCLTMIT
Sbjct: 361 AKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMIT 420

Query: 421 DVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 465
           DV FLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN
Sbjct: 421 DVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCTGN 464

BLAST of CmaCh03G011230 vs. NCBI nr
Match: XP_038881211.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 768.8 bits (1984), Expect = 2.6e-218
Identity = 385/463 (83.15%), Postives = 412/463 (88.98%), Query Frame = 0

Query: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLT-HPWKNIKHLVSASLARAQH 60
           MEFFPI FL SI LLL  SSSSS  T+TLPLTAFPS+PLT  P K I +L+SASL RAQH
Sbjct: 1   MEFFPIPFLFSIFLLLPTSSSSSISTITLPLTAFPSIPLTDDPLKIINYLLSASLNRAQH 60

Query: 61  LKTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120
           LK P+TK   SIQNV+LF RSYGAYSI+LAFGTPPQ+LS VFDTGSSLVWFPCTAGYRCS
Sbjct: 61  LKNPQTK---SIQNVSLFSRSYGAYSITLAFGTPPQNLSFVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCP 180
           NCSFPNVDAATIPKF+PKLSSSA+IIGCRN KC+WIFGPNL S CR+C+P+SR CS +CP
Sbjct: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLNSRCRNCNPKSRNCSGSCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMG 240
           GYGI YGSGATAGFLLSETLDFP+K VPDFLVGCSV SVHQPAGIAGFGR PESLPSQM 
Sbjct: 181 GYGILYGSGATAGFLLSETLDFPKKGVPDFLVGCSVSSVHQPAGIAGFGRAPESLPSQMR 240

Query: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYL 300
           LKRFS+CLV R FDDSPVSSPLVLDS  ES DSKT S IYAPFRENPSGSNAAFREYYYL
Sbjct: 241 LKRFSYCLVSRGFDDSPVSSPLVLDSGSESDDSKTESFIYAPFRENPSGSNAAFREYYYL 300

Query: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360
           +LRRILIG KPVK PYKYL+P+SAG GGAIIDSGSTFTFLDKPIFEAVA ELEKQLVKYP
Sbjct: 301 SLRRILIGGKPVKIPYKYLMPDSAGKGGAIIDSGSTFTFLDKPIFEAVAGELEKQLVKYP 360

Query: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420
           R K VE +SGLRPCFDISKE SVEFPEL+LKFKGGA L+LPP NYLALVTD GVVCLTM+
Sbjct: 361 RTKSVEVQSGLRPCFDISKEVSVEFPELVLKFKGGAKLSLPPVNYLALVTDAGVVCLTMM 420

Query: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 463
           TDV  +GGG GPAIIFGAFQQQNVLV+YDLA+ RIGFRKQRCT
Sbjct: 421 TDVGVVGGGAGPAIIFGAFQQQNVLVEYDLARNRIGFRKQRCT 460

BLAST of CmaCh03G011230 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 486.1 bits (1250), Expect = 3.1e-137
Identity = 244/453 (53.86%), Postives = 318/453 (70.20%), Query Frame = 0

Query: 27  VTLPLTAFPSLPLT--HPWKNIKHLVSASLARAQHLK------------TPKTKSNTSIQ 86
           V LPL+ F     +   P+ +++ L  +S+ARA  LK            +  T ++ ++ 
Sbjct: 19  VKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVV 78

Query: 87  NVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPNVDAATIP 146
              L  +SYG YS+SL+FGTP Q++  VFDTGSSLVW PCT+ Y CS C F  +D   IP
Sbjct: 79  KSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIP 138

Query: 147 KFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPGYGIQYGSGATAG 206
           +FIPK SSS++IIGC++ KC +++GPN++  CR C P +R C+  CP Y +QYG G+TAG
Sbjct: 139 RFIPKNSSSSKIIGCQSPKCQFLYGPNVQ--CRGCDPNTRNCTVGCPPYILQYGLGSTAG 198

Query: 207 FLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGLKRFSHCLVPRQF 266
            L++E LDFP+  VPDF+VGCS++S  QPAGIAGFGRGP SLPSQM LKRFSHCLV R+F
Sbjct: 199 VLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRF 258

Query: 267 DDSPVSSPLVLDS-SPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPV 326
           DD+ V++ L LD+ S  +  SKT  L Y PFR+NP+ SN AF EYYYL LRRI +GRK V
Sbjct: 259 DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 318

Query: 327 KFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLR 386
           K PYKYL P + G+GG+I+DSGSTFTF+++P+FE VAEE   Q+  Y R K +E E+GL 
Sbjct: 319 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 378

Query: 387 PCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMITD--VNFLGGGG 446
           PCF+IS +  V  PELI +FKGGA L LP +NY   V +T  VCLT+++D  VN   GG 
Sbjct: 379 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVN-PSGGT 438

Query: 447 GPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 463
           GPAII G+FQQQN LV+YDL  +R GF K++C+
Sbjct: 439 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of CmaCh03G011230 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 211.8 bits (538), Expect = 1.1e-54
Identity = 152/435 (34.94%), Postives = 211/435 (48.51%), Query Frame = 0

Query: 60  LKTPKTKSNTSIQ------NVALFP--RSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFP 119
           L TPK+++   I+      +V + P       Y I+L  GTPPQ++ +  DTGS L W P
Sbjct: 51  LPTPKSQTQERIKKPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVP 110

Query: 120 C-TAGYRCSNC-SFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWI------FGPNLKSS 179
           C    + C  C    N D  +   F P  SS++    C +  C  I      F P   + 
Sbjct: 111 CGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAG 170

Query: 180 CRSCSPRSRKCSDTCPGYGIQYGSGA-TAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPA 239
           C         C   CP +   YG G   +G L  + L    + VP F  GC   +  +P 
Sbjct: 171 CSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTSTYREPI 230

Query: 240 GIAGFGRGPESLPSQMGL--KRFSHCLVPRQFDDSP-VSSPLVLDSSPESGDSKTNSLIY 299
           GIAGFGRG  SLPSQ+G   K FSHC +P +F ++P +SSPL+L +S  S  + T+SL +
Sbjct: 231 GIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALS-INLTDSLQF 290

Query: 300 APFRENPSGSNAAFREYYYLTLRRILIGRK--PVKFPYKYLVPNSAGNGGAIIDSGSTFT 359
            P    P   N+     YY+ L  I IG    P + P      +S GNGG ++DSG+T+T
Sbjct: 291 TPMLNTPMYPNS-----YYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYT 350

Query: 360 FLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLRPCFDI--------SKEESVE--FPEL 419
            L +P +  +   L+   + YPRA   E+ +G   C+ +        S E  V   FP +
Sbjct: 351 HLPEPFYSQLLTTLQ-STITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSI 410

Query: 420 ILKFKGGATLALPPAN-YLALVTDTGVVCLTMITDVNFLGGGGGPAIIFGAFQQQNVLVQ 462
              F   ATL LP  N + A+   +    +  +   N   G  GPA +FG+FQQQNV V 
Sbjct: 411 TFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVV 470

BLAST of CmaCh03G011230 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 187.6 bits (475), Expect = 2.3e-47
Identity = 153/509 (30.06%), Postives = 233/509 (45.78%), Query Frame = 0

Query: 6   IQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHLKTPKT 65
           I FL + +L      S SS++  L L    SL  +    +  HL+ +S +R+   +  + 
Sbjct: 6   IFFLYTTILQYYFHFSVSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSS-ARFRRH 65

Query: 66  KSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPN 125
                 Q ++L   S   Y ISL+ G+   ++SL  DTGS LVWFPC   + C  C    
Sbjct: 66  HHKQQQQQLSLPISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILC---- 125

Query: 126 VDAATIPKFIP-KLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKC---------- 185
            ++  +P   P  LSSSA  + C +  C         S+  S  P S  C          
Sbjct: 126 -ESKPLPPSPPSSLSSSATTVSCSSPSC---------SAAHSSLPSSDLCAISNCPLDFI 185

Query: 186 --------SDTCPGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAG 245
                   S  CP +   YG G+    L S++L  P   V +F  GC+  ++ +P G+AG
Sbjct: 186 ETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAG 245

Query: 246 FGRGPESLPSQMGL------KRFSHCLVPRQFDDSPV--SSPLVL--------------D 305
           FGRG  SLP+Q+ +        FS+CLV   FD   V   SPL+L              D
Sbjct: 246 FGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTD 305

Query: 306 SSPESGD--SKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPVKFPYKYLVPNS 365
              +  D   K N  ++    ENP         +Y ++L+ I IG++ +  P      + 
Sbjct: 306 DHDDGDDEKKKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDK 365

Query: 366 AGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVK-YPRAKGVEAESGLRPCFDISKEES 425
            G GG ++DSG+TFT L    + +V EE + ++ + + RA  VE  SG+ PC+ ++  ++
Sbjct: 366 NGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QT 425

Query: 426 VEFPELILKFKGG-ATLALPPANYLALVTDTG--------VVCLTMITDVNFLGGGGGPA 462
           V+ P L+L F G  +++ LP  NY     D G        + CL ++   +     GG  
Sbjct: 426 VKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTG 485

BLAST of CmaCh03G011230 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 154.8 bits (390), Expect = 1.6e-37
Identity = 134/431 (31.09%), Postives = 189/431 (43.85%), Query Frame = 0

Query: 44  KNIKHLVSASLARAQHLKTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDT 103
           K+I  L + S  R    +TP+T    S   ++   +  G Y + L  GTP  ++ +V DT
Sbjct: 95  KSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDT 154

Query: 104 GSSLVWFPCTAGYRCSNCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSS 163
           GS +VW  C+    C N      DA     F PK S +   + C +R C       L  S
Sbjct: 155 GSDVVWLQCSPCKACYN----QTDAI----FDPKKSKTFATVPCGSRLCR-----RLDDS 214

Query: 164 CRSCSPRSRKCSDTCPGYGIQYGSGA-TAGFLLSETLDFPEKRVPDFLVGCSVLSVHQ-- 223
               + RS+ C      Y + YG G+ T G   +ETL F   RV    +GC     H   
Sbjct: 215 SECVTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCG----HDNE 274

Query: 224 -----PAGIAGFGRGPESLPSQMGLK---RFSHCLVPRQFDDSPVSSPLVLDSSPESGDS 283
                 AG+ G GRG  S PSQ   +   +FS+CLV R    S    P  +     +   
Sbjct: 275 GLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFG-NAAVP 334

Query: 284 KTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIG--RKPVKFPYKYLVPNSAGNGGAII 343
           KT+  ++ P   NP         +YYL L  I +G  R P     ++ + ++ GNGG II
Sbjct: 335 KTS--VFTPLLTNPK-----LDTFYYLQLLGISVGGSRVPGVSESQFKL-DATGNGGVII 394

Query: 344 DSGSTFTFLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLRPCFDISKEESVEFPELILK 403
           DSG++ T L +P + A+ +       K  RA    + S    CFD+S   +V+ P ++  
Sbjct: 395 DSGTSVTRLTQPAYVALRDAFRLGATKLKRA---PSYSLFDTCFDLSGMTTVKVPTVVFH 454

Query: 404 FKGGATLALPPANYLALVTDTGVVCLTMITDVNFLGGGGGPAIIFGAFQQQNVLVQYDLA 462
           F GG  ++LP +NYL  V   G  C           G  G   I G  QQQ   V YDL 
Sbjct: 455 F-GGGEVSLPASNYLIPVNTEGRFCFA-------FAGTMGSLSIIGNIQQQGFRVAYDLV 483

BLAST of CmaCh03G011230 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 154.5 bits (389), Expect = 2.2e-37
Identity = 117/384 (30.47%), Postives = 169/384 (44.01%), Query Frame = 0

Query: 82  GAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSNCSFPNVDAATIPKFIPKLSSS 141
           G Y   +  G P + + +V DTGS + W  CT    C++C        T P F P  SSS
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP---CADCYH-----QTEPIFEPSSSSS 205

Query: 142 ARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCPGYGIQYGSGA-TAGFLLSETLD 201
              + C   +C+ +      S CR+          TC  Y + YG G+ T G   +ETL 
Sbjct: 206 YEPLSCDTPQCNAL----EVSECRNA---------TCL-YEVSYGDGSYTVGDFATETLT 265

Query: 202 FPEKRVPDFLVGCSVLS---VHQPAGIAGFGRGPESLPSQMGLKRFSHCLVPRQFDDSPV 261
                V +  VGC   +       AG+ G G G  +LPSQ+    FS+CLV R  D +  
Sbjct: 266 IGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA-- 325

Query: 262 SSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLTLRRILIGRKPVKFPYKY 321
                  S+ + G S +   + AP   N          +YYL L  I +G + ++ P   
Sbjct: 326 -------STVDFGTSLSPDAVVAPLLRNHQ-----LDTFYYLGLTGISVGGELLQIPQSS 385

Query: 322 LVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPRAKGVEAESGLRPCFDIS 381
              + +G+GG IIDSG+  T L   I+ ++ +   K  +   +A GV   +    C+++S
Sbjct: 386 FEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGV---AMFDTCYNLS 445

Query: 382 KEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMITDVNFLGGGGGPAIIFGA 441
            + +VE P +   F GG  LALP  NY+  V   G  CL      + L        I G 
Sbjct: 446 AKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLA-------IIGN 483

Query: 442 FQQQNVLVQYDLAKERIGFRKQRC 462
            QQQ   V +DLA   IGF   +C
Sbjct: 506 VQQQGTRVTFDLANSLIGFSSNKC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R43.2e-4630.06Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C33.2e-3831.28Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C23.0e-3632.04Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ33.1e-3330.66Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q7XV211.2e-2928.95Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1IMR73.8e-268100.00probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813... [more]
A0A6J1EDJ02.3e-25795.91probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114332... [more]
A0A5A7SGF94.2e-21479.05Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3CHV24.2e-21479.05aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 P... [more]
A0A5D3CAS49.3e-21479.05Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
Match NameE-valueIdentityDescription
XP_022979057.17.9e-268100.00probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_023543736.11.1e-25896.55probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022925946.14.8e-25795.91probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic... [more]
KAG7034471.12.4e-25695.69Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_038881211.12.6e-21883.15probable aspartyl protease At4g16563 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT3G52500.13.1e-13753.86Eukaryotic aspartyl protease family protein [more]
AT5G45120.11.1e-5434.94Eukaryotic aspartyl protease family protein [more]
AT4G16563.12.3e-4730.06Eukaryotic aspartyl protease family protein [more]
AT3G61820.11.6e-3731.09Eukaryotic aspartyl protease family protein [more]
AT1G25510.12.2e-3730.47Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 328..339
score: 36.31
coord: 90..110
score: 52.29
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 65..258
e-value: 1.6E-33
score: 118.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 266..464
e-value: 7.5E-49
score: 167.9
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 80..461
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 297..457
e-value: 2.7E-34
score: 118.3
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 84..258
e-value: 5.6E-29
score: 101.6
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 7..461
NoneNo IPR availablePANTHERPTHR47967:SF36BNACNNG47670D PROTEINcoord: 7..461
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 84..457
score: 35.029343
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 83..461
e-value: 1.52991E-81
score: 251.413

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G011230.1CmaCh03G011230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity