MC01g0666 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC01g0666
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationMC01: 12885335 .. 12887443 (+)
RNA-Seq ExpressionMC01g0666
SyntenyMC01g0666
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCTACATTACAATGATAAAGACTGATGAGTCAATAAAATTCTTCAGGATTAGAATAAAATTTTATGCTCAATAAATAAGAAACCATTACTTTATTTTCGTACGCGTTCGTGTTTGGTTTAAATTCAAAAAAAAAAAAAAAAAGGCAAAATTTGGATCGAGCACAAATTCAAAGCATTTATGAAAACTGAAGGATGAGAAATTGAAAATACCAAGGTTATTTATGAAGGTGGGCCACGTGGCATGCAGTTGTCACTCGGTGGATTGCTTGCAGCGGCCACGCCATCGCCATTTTCCTCTGCTTCCATCAACGACTGAACAGAGCATCCATTAGCTTCAGAAGCAGAGCACATTCCAATTCCAATTCCAAATCCATGGAGTTTCTTCCAATTCCCTTTCTGTTTTCCATCTTCCTTCTTCTTTCCACTTCGTCTTCTTCCTCCATCACACTCCCCCTCACCGCCTTCCCTTCAACTCGAGCTCCAGATCCATGGAAGAATCTCAATTACCTTGCCTCTGCTTCGATCATCAGAGCTCATCACCTCAAGAACCGAAACAAATCAAGCGATTTCGTCCATAAATCCAAATCCGCACTCACTCCTCGTAGCTATGGCGCTTACTCGGTTTCTATCGGCTTCGGAACTCCTCCCCAGAATTTATCGTTCGTCTTCGATACTGGAAGTAGTCTCGTGTGGTTCCCCTGCACCGCCCGTTATCTCTGCTCCAGGTGTTCGTTTCCCAACGTGAATACTGCGACGATTACGAAATTCATCCCGAAATTATCTTCTTCTGCGAGGATTGTCGGTTGCGGAAACCGGAAATGTGCTTGGATTTTTGACCCTAATGTGAGATCTAGGTGTCGAAATTGTACCCCTAATTCTCGAAATTGTTCCGATGCTTGTCCTGGTTATGGAATTCAGTACGGTTCTGGATTAACCGCTGGATTTCTACTCTCTGAAACGCTTGATTTACCGGATAAACGAGTGCCAGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTCCACCAACCTGCCGGCATTGTCGGATTTGGCCGCGGTCCTCAATCGTTGCCGTCGCAAATGCGCCTGAAAAGATTCTCCTACTGCCTCGCTTCTCGCCAGTTCGACGACTCGCCGGTGAGCAGCCCTCTCGTGCTGGACTTCGGATCCAAATCCGGCGACACCAACACTAACGGCCTCATTTACTCGCCGTTCCGAGAGAATCCCTCCGCCTCCGACGCCGCATTTAGAGAATACTATTACCTTTCTCTTCGCAGAATCCTCATCGGCGGAAAACCGGTGAAATTTCCGTACAAGTATCTCTCACCGGACTCCACCGGTAACGGCGGCACGATCATCGATTCCGGCTCGACCTTTACGATTCTGGATAAGCCAATTTTCGAAGCCGTAGCGGAAGAATTCGAGAAGCAACTTATTAAATATCCCCGAGCTACCGGCGTGGAAGCTCGGTCCGGTCTAAGGCCGTGCTTCAATGTCTCGAAGGAGAAGACGGTGGAATTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGCCTGGAACTGGCTCTGCCGCCGGCTAATTACTTCGCGTTGGTGGCGGAGTCCGGCGTGGTGTGCATGACGATGTTGACGGATGACGTCGGCGGTGAGAAGGTCGGCGGTGGACCGGCGATTATACTCGGCGCGTTTCAGCAGCAGAATATATTGGTGGAGTATGACTTGGCGAAGGACAGAATCGGATTTCGAAAGCAGAGATGCGTATGATTTATGAACTATGAAATTAATAATTTAGTGGAAAATAAAAATCGAAAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGTAAATGTAAAACAAAGTTCACGAATTGAATGTGGTGGTAACTTCTAAGTTTCAATAGACTTCCAAATTTCAAGTTATTTGGTAAAAATTTTTTAATTACAATCAAATATGTATTGGTAAAAAAGAGAGAAACAATAAAAAATAAAACTATTTTTCTGATTTTTTAAAATTAATTATTTAAATATAAGGGTAAAACTTTTGGGCGATCTAGGTCATTTAATTAAAGTGTCACTCTTTCAATTTGAG

mRNA sequence

GGCTACATTACAATGATAAAGACTGATGAGTCAATAAAATTCTTCAGGATTAGAATAAAATTTTATGCTCAATAAATAAGAAACCATTACTTTATTTTCGTACGCGTTCGTGTTTGGTTTAAATTCAAAAAAAAAAAAAAAAAGGCAAAATTTGGATCGAGCACAAATTCAAAGCATTTATGAAAACTGAAGGATGAGAAATTGAAAATACCAAGGTTATTTATGAAGGTGGGCCACGTGGCATGCAGTTGTCACTCGGTGGATTGCTTGCAGCGGCCACGCCATCGCCATTTTCCTCTGCTTCCATCAACGACTGAACAGAGCATCCATTAGCTTCAGAAGCAGAGCACATTCCAATTCCAATTCCAAATCCATGGAGTTTCTTCCAATTCCCTTTCTGTTTTCCATCTTCCTTCTTCTTTCCACTTCGTCTTCTTCCTCCATCACACTCCCCCTCACCGCCTTCCCTTCAACTCGAGCTCCAGATCCATGGAAGAATCTCAATTACCTTGCCTCTGCTTCGATCATCAGAGCTCATCACCTCAAGAACCGAAACAAATCAAGCGATTTCGTCCATAAATCCAAATCCGCACTCACTCCTCGTAGCTATGGCGCTTACTCGGTTTCTATCGGCTTCGGAACTCCTCCCCAGAATTTATCGTTCGTCTTCGATACTGGAAGTAGTCTCGTGTGGTTCCCCTGCACCGCCCGTTATCTCTGCTCCAGGTGTTCGTTTCCCAACGTGAATACTGCGACGATTACGAAATTCATCCCGAAATTATCTTCTTCTGCGAGGATTGTCGGTTGCGGAAACCGGAAATGTGCTTGGATTTTTGACCCTAATGTGAGATCTAGGTGTCGAAATTGTACCCCTAATTCTCGAAATTGTTCCGATGCTTGTCCTGGTTATGGAATTCAGTACGGTTCTGGATTAACCGCTGGATTTCTACTCTCTGAAACGCTTGATTTACCGGATAAACGAGTGCCAGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTCCACCAACCTGCCGGCATTGTCGGATTTGGCCGCGGTCCTCAATCGTTGCCGTCGCAAATGCGCCTGAAAAGATTCTCCTACTGCCTCGCTTCTCGCCAGTTCGACGACTCGCCGGTGAGCAGCCCTCTCGTGCTGGACTTCGGATCCAAATCCGGCGACACCAACACTAACGGCCTCATTTACTCGCCGTTCCGAGAGAATCCCTCCGCCTCCGACGCCGCATTTAGAGAATACTATTACCTTTCTCTTCGCAGAATCCTCATCGGCGGAAAACCGGTGAAATTTCCGTACAAGTATCTCTCACCGGACTCCACCGGTAACGGCGGCACGATCATCGATTCCGGCTCGACCTTTACGATTCTGGATAAGCCAATTTTCGAAGCCGTAGCGGAAGAATTCGAGAAGCAACTTATTAAATATCCCCGAGCTACCGGCGTGGAAGCTCGGTCCGGTCTAAGGCCGTGCTTCAATGTCTCGAAGGAGAAGACGGTGGAATTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGCCTGGAACTGGCTCTGCCGCCGGCTAATTACTTCGCGTTGGTGGCGGAGTCCGGCGTGGTGTGCATGACGATGTTGACGGATGACGTCGGCGGTGAGAAGGTCGGCGGTGGACCGGCGATTATACTCGGCGCGTTTCAGCAGCAGAATATATTGGTGGAGTATGACTTGGCGAAGGACAGAATCGGATTTCGAAAGCAGAGATGCGTATGATTTATGAACTATGAAATTAATAATTTAGTGGAAAATAAAAATCGAAAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGTAAATGTAAAACAAAGTTCACGAATTGAATGTGGTGGTAACTTCTAAGTTTCAATAGACTTCCAAATTTCAAGTTATTTGGTAAAAATTTTTTAATTACAATCAAATATGTATTGGTAAAAAAGAGAGAAACAATAAAAAATAAAACTATTTTTCTGATTTTTTAAAATTAATTATTTAAATATAAGGGTAAAACTTTTGGGCGATCTAGGTCATTTAATTAAAGTGTCACTCTTTCAATTTGAG

Coding sequence (CDS)

ATGGAGTTTCTTCCAATTCCCTTTCTGTTTTCCATCTTCCTTCTTCTTTCCACTTCGTCTTCTTCCTCCATCACACTCCCCCTCACCGCCTTCCCTTCAACTCGAGCTCCAGATCCATGGAAGAATCTCAATTACCTTGCCTCTGCTTCGATCATCAGAGCTCATCACCTCAAGAACCGAAACAAATCAAGCGATTTCGTCCATAAATCCAAATCCGCACTCACTCCTCGTAGCTATGGCGCTTACTCGGTTTCTATCGGCTTCGGAACTCCTCCCCAGAATTTATCGTTCGTCTTCGATACTGGAAGTAGTCTCGTGTGGTTCCCCTGCACCGCCCGTTATCTCTGCTCCAGGTGTTCGTTTCCCAACGTGAATACTGCGACGATTACGAAATTCATCCCGAAATTATCTTCTTCTGCGAGGATTGTCGGTTGCGGAAACCGGAAATGTGCTTGGATTTTTGACCCTAATGTGAGATCTAGGTGTCGAAATTGTACCCCTAATTCTCGAAATTGTTCCGATGCTTGTCCTGGTTATGGAATTCAGTACGGTTCTGGATTAACCGCTGGATTTCTACTCTCTGAAACGCTTGATTTACCGGATAAACGAGTGCCAGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTCCACCAACCTGCCGGCATTGTCGGATTTGGCCGCGGTCCTCAATCGTTGCCGTCGCAAATGCGCCTGAAAAGATTCTCCTACTGCCTCGCTTCTCGCCAGTTCGACGACTCGCCGGTGAGCAGCCCTCTCGTGCTGGACTTCGGATCCAAATCCGGCGACACCAACACTAACGGCCTCATTTACTCGCCGTTCCGAGAGAATCCCTCCGCCTCCGACGCCGCATTTAGAGAATACTATTACCTTTCTCTTCGCAGAATCCTCATCGGCGGAAAACCGGTGAAATTTCCGTACAAGTATCTCTCACCGGACTCCACCGGTAACGGCGGCACGATCATCGATTCCGGCTCGACCTTTACGATTCTGGATAAGCCAATTTTCGAAGCCGTAGCGGAAGAATTCGAGAAGCAACTTATTAAATATCCCCGAGCTACCGGCGTGGAAGCTCGGTCCGGTCTAAGGCCGTGCTTCAATGTCTCGAAGGAGAAGACGGTGGAATTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGCCTGGAACTGGCTCTGCCGCCGGCTAATTACTTCGCGTTGGTGGCGGAGTCCGGCGTGGTGTGCATGACGATGTTGACGGATGACGTCGGCGGTGAGAAGGTCGGCGGTGGACCGGCGATTATACTCGGCGCGTTTCAGCAGCAGAATATATTGGTGGAGTATGACTTGGCGAAGGACAGAATCGGATTTCGAAAGCAGAGATGCGTATGA

Protein sequence

MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRCV
Homology
BLAST of MC01g0666 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 9.0e-49
Identity = 158/501 (31.54%), Postives = 226/501 (45.11%), Query Frame = 0

Query: 4   LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKS 63
           L  P L  +   LSTS  SS  L L    S+R           +SA   R HH + + + 
Sbjct: 25  LSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSR-----------SSARFRRHHHKQQQQQL 84

Query: 64  SDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFP 123
           S           P S G+ Y +S+  G+    +S   DTGS LVWFPC   + C  C   
Sbjct: 85  S----------LPISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 144

Query: 124 NVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCS 183
            +  +  +     LSSSA  V C +  C+        S      NC      T +    S
Sbjct: 145 PLPPSPPS----SLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 204

Query: 184 DACPGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLP 243
             CP +   YG G     L S++L LP   V +F  GC+  ++ +P G+ GFGRG  SLP
Sbjct: 205 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 264

Query: 244 SQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN--------- 303
           +Q+ +        FSYCL S  FD   V  P  L  G        + G T+         
Sbjct: 265 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 324

Query: 304 --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIID 363
              N  +++   ENP         +Y +SL+ I IG + +  P      D  G GG ++D
Sbjct: 325 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 384

Query: 364 SGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGLRPCFNVSKEKTVEFPELVLK 423
           SG+TFT+L    + +V EEF+ ++ + + RA  VE  SG+ PC+ ++  +TV+ P LVL 
Sbjct: 385 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 444

Query: 424 FKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQ 461
           F G    + LP  NYF    + G        + C+ ML +     ++ GG   ILG +QQ
Sbjct: 445 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCL-MLMNGGDESELRGGTGAILGNYQQ 491

BLAST of MC01g0666 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 5.1e-36
Identity = 118/390 (30.26%), Postives = 170/390 (43.59%), Query Frame = 0

Query: 80  GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSS 139
           G Y +++  GTP Q  S + DTGS L+W  C     C   S P  N        P+ SSS
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN--------PQGSSS 152

Query: 140 ARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGL-TAGFLLSETLD 199
              + C ++ C  +  P               CS+    Y   YG G  T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 200 LPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSP 259
                +P+   GC            AG+VG GRGP SLPSQ+ + +FSYC+       +P
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM-------TP 272

Query: 260 V--SSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKF- 319
           +  S+P  L  GS +          SP         +    +YY++L  + +G   +   
Sbjct: 273 IGSSTPSNLLLGSLANSVTAG----SP--NTTLIQSSQIPTFYYITLNGLSVGSTRLPID 332

Query: 320 PYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPC 379
           P  +    + G GG IIDSG+T T      +++V +EF  Q I  P   G  + SG   C
Sbjct: 333 PSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNG--SSSGFDLC 392

Query: 380 FNV-SKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGP 439
           F   S    ++ P  V+ F GG +L LP  NYF +   +G++C+ M +   G        
Sbjct: 393 FQTPSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICLAMGSSSQG-------- 434

Query: 440 AIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
             I G  QQQN+LV YD     + F   +C
Sbjct: 453 MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of MC01g0666 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 1.1e-35
Identity = 132/417 (31.65%), Postives = 188/417 (45.08%), Query Frame = 0

Query: 58  KNRNKSSDFVHKSKSALTPRSY---GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARY 117
           + R +S + + +S S +    Y   G Y +++  GTP  + S + DTGS L+W  C    
Sbjct: 69  ERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP-- 128

Query: 118 LCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSD 177
            C++C      +     F P+ SSS   + C ++ C  +               S  C++
Sbjct: 129 -CTQCF-----SQPTPIFNPQDSSSFSTLPCESQYCQDL--------------PSETCNN 188

Query: 178 ACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGP 237
               Y   YG G  T G++ +ET       VP+   GC            AG++G G GP
Sbjct: 189 NECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 248

Query: 238 QSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKS-----GDTNTNGLIYSPFRENP 297
            SLPSQ+ + +FSYC+ S        SSP  L  GS +     G  +T  LI+S    NP
Sbjct: 249 LSLPSQLGVGQFSYCMTS-----YGSSSPSTLALGSAASGVPEGSPSTT-LIHSSL--NP 308

Query: 298 SASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEA 357
           +        YYY++L+ I +GG  +  P         G GG IIDSG+T T L +  + A
Sbjct: 309 T--------YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNA 368

Query: 358 VAEEFEKQLIKYPRATGVEARSGLRPCF-NVSKEKTVEFPELVLKFKGGLELALPPANYF 417
           VA+ F  Q I  P  T  E+ SGL  CF   S   TV+ PE+ ++F GG+ L L   N  
Sbjct: 369 VAQAFTDQ-INLP--TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNIL 428

Query: 418 ALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
              AE GV+C+ M +    G         I G  QQQ   V YDL    + F   +C
Sbjct: 429 ISPAE-GVICLAMGSSSQLG-------ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of MC01g0666 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 4.8e-34
Identity = 137/468 (29.27%), Postives = 194/468 (41.45%), Query Frame = 0

Query: 18  TSSSSSITLPL---TAFPSTRAPDPW---------KNLNYLAS-ASIIRAHHLKNRNKSS 77
           + SSSSITL L    A  S + PD           + +  +A+ A+ I   ++ +  +  
Sbjct: 66  SESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPG 125

Query: 78  DFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNV 137
            F     S L+  S G Y   +G GTP + +  V DTGS +VW  C     C RC     
Sbjct: 126 GFSSSVVSGLSQGS-GEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP---CRRC----- 185

Query: 138 NTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYG 197
            + +   F P+ S +   + C +  C  +      +R + C             Y + YG
Sbjct: 186 YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCL------------YQVSYG 245

Query: 198 SG-LTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQ-------PAGIVGFGRGPQSLPSQM 257
            G  T G   +ETL     RV    +GC     H         AG++G G+G  S P Q 
Sbjct: 246 DGSFTVGDFSTETLTFRRNRVKGVALGCG----HDNEGLFVGAAGLLGLGKGKLSFPGQT 305

Query: 258 RLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFRE 317
             +   +FSYCL  R    S  S P  + FG        N  +    R  P  S+     
Sbjct: 306 GHRFNQKFSYCLVDR----SASSKPSSVVFG--------NAAVSRIARFTPLLSNPKLDT 365

Query: 318 YYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQ 377
           +YY+ L  I +GG  V          D  GNGG IIDSG++ T L +P + A+ + F   
Sbjct: 366 FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVG 425

Query: 378 LIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVV 437
                RA      S    CF++S    V+ P +VL F+G  +++LP  NY   V  +G  
Sbjct: 426 AKTLKRAPDF---SLFDTCFDLSNMNEVKVPTVVLHFRGA-DVSLPATNYLIPVDTNGKF 484

Query: 438 CMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
           C       +GG         I+G  QQQ   V YDLA  R+GF    C
Sbjct: 486 CFA-FAGTMGG-------LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of MC01g0666 vs. ExPASy Swiss-Prot
Match: Q8S9J6 (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 4.1e-33
Identity = 131/431 (30.39%), Postives = 182/431 (42.23%), Query Frame = 0

Query: 46  LASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA------YSVSIGFGTPPQNLSFVF 105
           L  A +   H   ++  ++D V +SKS   P   G+      Y V++G GTP  +LS +F
Sbjct: 90  LDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIF 149

Query: 106 DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVR 165
           DTGS L W  C     C R  +          F P  S+S   V C +  C  +      
Sbjct: 150 DTGSDLTWTQCQP---CVRTCYDQKEPI----FNPSKSTSYYNVSCSSAACGSL------ 209

Query: 166 SRCRNCTPNSRNCSDACPGYGIQYG-SGLTAGFLLSETLDLPDKRVPD-FLVGCSVLS-- 225
               + T N+ +CS +   YGIQYG    + GFL  E   L +  V D    GC   +  
Sbjct: 210 ---SSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQG 269

Query: 226 -VHQPAGIVGFGRGPQSLPSQMRL---KRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTN 285
                AG++G GR   S PSQ      K FSYCL       S  S    L FGS      
Sbjct: 270 LFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL------PSSASYTGHLTFGSAG---- 329

Query: 286 TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSG 345
               I    +  P ++      +Y L++  I +GG+ +  P    S       G +IDSG
Sbjct: 330 ----ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSG 389

Query: 346 STFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKG 405
           +  T L    + A+   F+ ++ KYP  +GV   S L  CF++S  KTV  P++   F G
Sbjct: 390 TVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSFSG 449

Query: 406 GLELALPPANYFALVAESGVVCMTML--TDDVGGEKVGGGPAIILGAFQQQNILVEYDLA 461
           G  + L     F  V +   VC+     +DD          A I G  QQQ + V YD A
Sbjct: 450 GAVVELGSKGIF-YVFKISQVCLAFAGNSDD--------SNAAIFGNVQQQTLEVVYDGA 473

BLAST of MC01g0666 vs. NCBI nr
Match: XP_022979057.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 711 bits (1834), Expect = 1.11e-254
Identity = 354/466 (75.97%), Postives = 394/466 (84.55%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
           MEF PI FL SI LLLS SSSSS   +TLPLTAFPS     PWKN+ +L SAS+ RA HL
Sbjct: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60

Query: 61  KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
           K  + KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61  KTPKTKSNTSIQNV--ALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120

Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
           S CSFPNV+ ATI KFIPKLSSSARI+GC NRKC+WIF PN++S CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTC 180

Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
           PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSVLSVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM 240

Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
            LKRFS+CL  RQFDDSPVSSPLVLD   +SGD+ TN LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYY 300

Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
           L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360

Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
           PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LALPPANY ALV ++GVVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTM 420

Query: 421 LTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           +TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVNFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461

BLAST of MC01g0666 vs. NCBI nr
Match: XP_023543736.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 709 bits (1830), Expect = 4.52e-254
Identity = 353/466 (75.75%), Postives = 394/466 (84.55%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
           MEF PIPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HL
Sbjct: 1   MEFFPIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFTHPWKNIKHLVSASLTRAQHL 60

Query: 61  KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
           K  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61  KTPRIKSNTSIQNV--ALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120

Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
           S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN++S CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKSLCRSCSPRSRKCSDTC 180

Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
           PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240

Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
            LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300

Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
           L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360

Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
           PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LALPPANY ALV ++GVVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTM 420

Query: 421 LTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           +TD   +GG   GGGPAII GAFQQQN+LV+YDLAKDRIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKDRIGFRKQRC 461

BLAST of MC01g0666 vs. NCBI nr
Match: XP_022925946.1 (probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 702 bits (1811), Expect = 3.52e-251
Identity = 349/466 (74.89%), Postives = 393/466 (84.33%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
           MEF  IPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HL
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
           K  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61  KTPRTKSNTSIQNV--ALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120

Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
           S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTC 180

Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
           PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240

Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
            LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300

Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
           L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360

Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
           PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LALPP+NY ALVA++ VVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTM 420

Query: 421 LTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           +TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461

BLAST of MC01g0666 vs. NCBI nr
Match: KAG7034471.1 (Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 701 bits (1809), Expect = 7.09e-251
Identity = 349/466 (74.89%), Postives = 393/466 (84.33%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
           MEF  IPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HL
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  K-NRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
           K  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61  KIPRTKSNTSIQNV--ALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120

Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
           S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTC 180

Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
           PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240

Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
            LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300

Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
           L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360

Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
           PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LALPP+NY ALVA++ VVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTM 420

Query: 421 LTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           +TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461

BLAST of MC01g0666 vs. NCBI nr
Match: XP_011657732.1 (probable aspartyl protease At4g16563 [Cucumis sativus] >KGN48299.1 hypothetical protein Csa_004059 [Cucumis sativus])

HSP 1 Score: 696 bits (1796), Expect = 6.04e-249
Identity = 348/464 (75.00%), Postives = 392/464 (84.48%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN 60
           MEFLPIPFLFSIFLLL TSSSSS T LPLT FPS    DP+K +N L SAS+ RA HLK 
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60

Query: 61  RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRC 120
               S+   ++ S L PRSYGAYSVS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y CSRC
Sbjct: 61  PQSKSNTSIQNVS-LFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRC 120

Query: 121 SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY 180
           SFP V+ ATI+KF+PKLSSS ++VGC N KCAWIF PN++SRCRNC   SR CSD+CPGY
Sbjct: 121 SFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGY 180

Query: 181 GIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLK 240
           G+QYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLK
Sbjct: 181 GLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLK 240

Query: 241 RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL 300
           RFS+CL SR FDDSPVSSPLVLD GS+S ++ T   IY+PFRENPS S+AAFREYYYLSL
Sbjct: 241 RFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSL 300

Query: 301 RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRA 360
           RRILIGGKPVKFPYKYL PDSTGNGG IIDSGSTFT LDKPIFEA+A+E EKQL+KYPRA
Sbjct: 301 RRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRA 360

Query: 361 TGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLT 420
             VEA+SGLRPCFN+ KE+ + EFP++VLKFKGG +L+L   NY A+V + GVVC+TM+T
Sbjct: 361 KDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMT 420

Query: 421 DD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           D+  VGG   GGGPAIILGAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 DEAVVGG---GGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 460

BLAST of MC01g0666 vs. ExPASy TrEMBL
Match: A0A6J1IMR7 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813 PE=3 SV=1)

HSP 1 Score: 711 bits (1834), Expect = 5.38e-255
Identity = 354/466 (75.97%), Postives = 394/466 (84.55%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
           MEF PI FL SI LLLS SSSSS   +TLPLTAFPS     PWKN+ +L SAS+ RA HL
Sbjct: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60

Query: 61  KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
           K  + KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61  KTPKTKSNTSIQNV--ALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120

Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
           S CSFPNV+ ATI KFIPKLSSSARI+GC NRKC+WIF PN++S CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTC 180

Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
           PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSVLSVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM 240

Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
            LKRFS+CL  RQFDDSPVSSPLVLD   +SGD+ TN LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYY 300

Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
           L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360

Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
           PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LALPPANY ALV ++GVVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTM 420

Query: 421 LTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           +TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVNFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461

BLAST of MC01g0666 vs. ExPASy TrEMBL
Match: A0A6J1EDJ0 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111433208 PE=3 SV=1)

HSP 1 Score: 702 bits (1811), Expect = 1.70e-251
Identity = 349/466 (74.89%), Postives = 393/466 (84.33%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
           MEF  IPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HL
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
           K  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61  KTPRTKSNTSIQNV--ALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120

Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
           S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTC 180

Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
           PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240

Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
            LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300

Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
           L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360

Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
           PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LALPP+NY ALVA++ VVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTM 420

Query: 421 LTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           +TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461

BLAST of MC01g0666 vs. ExPASy TrEMBL
Match: A0A0A0KHK2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G454470 PE=3 SV=1)

HSP 1 Score: 696 bits (1796), Expect = 2.92e-249
Identity = 348/464 (75.00%), Postives = 392/464 (84.48%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN 60
           MEFLPIPFLFSIFLLL TSSSSS T LPLT FPS    DP+K +N L SAS+ RA HLK 
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60

Query: 61  RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRC 120
               S+   ++ S L PRSYGAYSVS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y CSRC
Sbjct: 61  PQSKSNTSIQNVS-LFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRC 120

Query: 121 SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY 180
           SFP V+ ATI+KF+PKLSSS ++VGC N KCAWIF PN++SRCRNC   SR CSD+CPGY
Sbjct: 121 SFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGY 180

Query: 181 GIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLK 240
           G+QYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLK
Sbjct: 181 GLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLK 240

Query: 241 RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL 300
           RFS+CL SR FDDSPVSSPLVLD GS+S ++ T   IY+PFRENPS S+AAFREYYYLSL
Sbjct: 241 RFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSL 300

Query: 301 RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRA 360
           RRILIGGKPVKFPYKYL PDSTGNGG IIDSGSTFT LDKPIFEA+A+E EKQL+KYPRA
Sbjct: 301 RRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRA 360

Query: 361 TGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLT 420
             VEA+SGLRPCFN+ KE+ + EFP++VLKFKGG +L+L   NY A+V + GVVC+TM+T
Sbjct: 361 KDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMT 420

Query: 421 DD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
           D+  VGG   GGGPAIILGAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 DEAVVGG---GGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 460

BLAST of MC01g0666 vs. ExPASy TrEMBL
Match: A0A5A7SGF9 (Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold541G00670 PE=3 SV=1)

HSP 1 Score: 689 bits (1778), Expect = 1.55e-246
Identity = 340/462 (73.59%), Postives = 389/462 (84.20%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNR 60
           MEFLPIPFLFSIFLLL TSSSSSITLPL  FPS    DP K +N+L SAS+ RA HLK+ 
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSITLPLATFPSIPFTDPLKTINHLLSASLSRAQHLKSP 60

Query: 61  NKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCS 120
              S+   ++ S L PRSYGAY+VS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y C+ CS
Sbjct: 61  QSKSNTSTENVS-LFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAHCS 120

Query: 121 FPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYG 180
           FP+V+ ATI+KF+PKLSSS +IVGC N KCAWIF PN++SRCRNC P SR CSD+CPGYG
Sbjct: 121 FPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPGYG 180

Query: 181 IQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKR 240
           IQYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKR
Sbjct: 181 IQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240

Query: 241 FSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR 300
           FS+CL  R FDDSPVSSPLVLD G +S ++ T   IY+PF+ENPS S+ AFREYYYLSLR
Sbjct: 241 FSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLSLR 300

Query: 301 RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRAT 360
           RILIGGKPVKFPYKYL PDSTG GG IIDSGSTFT LDKPIFEA+A E EKQL+KYPRA 
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPRAK 360

Query: 361 GVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTD 420
            +EA++GLRPCFN+SKE+ + EFPE+ LKFKGG +L+LPP NY  +V ++ VVC+TM+T+
Sbjct: 361 DIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMMTN 420

Query: 421 -DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
            +V G  VGGGPAII GAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 AEVVG--VGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKC 459

BLAST of MC01g0666 vs. ExPASy TrEMBL
Match: A0A1S3CHV2 (aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 PE=3 SV=1)

HSP 1 Score: 689 bits (1778), Expect = 1.55e-246
Identity = 340/462 (73.59%), Postives = 389/462 (84.20%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNR 60
           MEFLPIPFLFSIFLLL TSSSSSITLPL  FPS    DP K +N+L SAS+ RA HLK+ 
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSITLPLATFPSIPFTDPLKTINHLLSASLSRAQHLKSP 60

Query: 61  NKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCS 120
              S+   ++ S L PRSYGAY+VS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y C+ CS
Sbjct: 61  QSKSNTSTENVS-LFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAHCS 120

Query: 121 FPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYG 180
           FP+V+ ATI+KF+PKLSSS +IVGC N KCAWIF PN++SRCRNC P SR CSD+CPGYG
Sbjct: 121 FPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPGYG 180

Query: 181 IQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKR 240
           IQYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKR
Sbjct: 181 IQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240

Query: 241 FSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR 300
           FS+CL  R FDDSPVSSPLVLD G +S ++ T   IY+PF+ENPS S+ AFREYYYLSLR
Sbjct: 241 FSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLSLR 300

Query: 301 RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRAT 360
           RILIGGKPVKFPYKYL PDSTG GG IIDSGSTFT LDKPIFEA+A E EKQL+KYPRA 
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPRAK 360

Query: 361 GVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTD 420
            +EA++GLRPCFN+SKE+ + EFPE+ LKFKGG +L+LPP NY  +V ++ VVC+TM+T+
Sbjct: 361 DIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMMTN 420

Query: 421 -DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 460
            +V G  VGGGPAII GAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 AEVVG--VGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKC 459

BLAST of MC01g0666 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 496.5 bits (1277), Expect = 2.3e-140
Identity = 248/462 (53.68%), Postives = 323/462 (69.91%), Query Frame = 0

Query: 13  FLLLSTSSSSSITLPLTAFP-STRAP-DPWKNLNYLASASIIRAHHLKNRNK---SSDFV 72
           F L+  S  S++ LPL+ F  S ++P DP+ +L  LA +SI RAH LK+        D +
Sbjct: 8   FFLIFLSVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDAL 67

Query: 73  HKS--------KSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRC 132
             +        KS L+ +SYG YSVS+ FGTP Q + FVFDTGSSLVW PCT+RYLCS C
Sbjct: 68  SSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGC 127

Query: 133 SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY 192
            F  ++   I +FIPK SSS++I+GC + KC +++ PNV  +CR C PN+RNC+  CP Y
Sbjct: 128 DFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPNTRNCTVGCPPY 187

Query: 193 GIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLK 252
            +QYG G TAG L++E LD PD  VPDF+VGCS++S  QPAGI GFGRGP SLPSQM LK
Sbjct: 188 ILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLK 247

Query: 253 RFSYCLASRQFDDSPVSSPLVLDFGS-KSGDTNTNGLIYSPFRENPSASDAAFREYYYLS 312
           RFS+CL SR+FDD+ V++ L LD GS  +  + T GL Y+PFR+NP+ S+ AF EYYYL+
Sbjct: 248 RFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLN 307

Query: 313 LRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPR 372
           LRRI +G K VK PYKYL+P + G+GG+I+DSGSTFT +++P+FE VAEEF  Q+  Y R
Sbjct: 308 LRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTR 367

Query: 373 ATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLT 432
              +E  +GL PCFN+S +  V  PEL+ +FKGG +L LP +NYF  V  +  VC+T+++
Sbjct: 368 EKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVS 427

Query: 433 DDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
           D       G GPAIILG+FQQQN LVEYDL  DR GF K++C
Sbjct: 428 DKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467

BLAST of MC01g0666 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 201.8 bits (512), Expect = 1.2e-51
Identity = 138/406 (33.99%), Postives = 196/406 (48.28%), Query Frame = 0

Query: 82  YSVSIGFGTPPQNLSFVFDTGSSLVWFPC-TARYLCSRC-SFPNVNTATITKFIPKLSSS 141
           Y +++  GTPPQ +    DTGS L W PC    + C  C    N +  + + F P  SS+
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142

Query: 142 ARIVGCGNRKCAWI------FDPNVRSRCRNCTPNSRNCSDACPGYGIQYG-SGLTAGFL 201
           +    C +  C  I      FDP   + C         C   CP +   YG  GL +G L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202

Query: 202 LSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL--KRFSYCLASRQF 261
             + L    + VP F  GC   +  +P GI GFGRG  SLPSQ+    K FS+C    +F
Sbjct: 203 TRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKF 262

Query: 262 DDSP-VSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGK-- 321
            ++P +SSPL+L   + S +  T+ L ++P    P      +   YY+ L  I IG    
Sbjct: 263 VNNPNISSPLILGASALSINL-TDSLQFTPMLNTP-----MYPNSYYIGLESITIGTNIT 322

Query: 322 PVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSG 381
           P + P      DS GNGG ++DSG+T+T L +P +  +    +   I YPRAT  E+R+G
Sbjct: 323 PTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQ-STITYPRATETESRTG 382

Query: 382 LRPCFNV----------SKEKTVEFPELVLKFKGGLELALPPAN-YFALVAES-GVVCMT 441
              C+ V            +  + FP +   F     L LP  N ++A+ A S G V   
Sbjct: 383 FDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQC 442

Query: 442 MLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRCV 462
           +L  ++  E    GPA + G+FQQQN+ V YDL K+RIGF+   CV
Sbjct: 443 LLFQNM--EDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479

BLAST of MC01g0666 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 196.1 bits (497), Expect = 6.4e-50
Identity = 158/501 (31.54%), Postives = 226/501 (45.11%), Query Frame = 0

Query: 4   LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKS 63
           L  P L  +   LSTS  SS  L L    S+R           +SA   R HH + + + 
Sbjct: 25  LSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSR-----------SSARFRRHHHKQQQQQL 84

Query: 64  SDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFP 123
           S           P S G+ Y +S+  G+    +S   DTGS LVWFPC   + C  C   
Sbjct: 85  S----------LPISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 144

Query: 124 NVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCS 183
            +  +  +     LSSSA  V C +  C+        S      NC      T +    S
Sbjct: 145 PLPPSPPS----SLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 204

Query: 184 DACPGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLP 243
             CP +   YG G     L S++L LP   V +F  GC+  ++ +P G+ GFGRG  SLP
Sbjct: 205 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 264

Query: 244 SQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN--------- 303
           +Q+ +        FSYCL S  FD   V  P  L  G        + G T+         
Sbjct: 265 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 324

Query: 304 --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIID 363
              N  +++   ENP         +Y +SL+ I IG + +  P      D  G GG ++D
Sbjct: 325 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 384

Query: 364 SGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGLRPCFNVSKEKTVEFPELVLK 423
           SG+TFT+L    + +V EEF+ ++ + + RA  VE  SG+ PC+ ++  +TV+ P LVL 
Sbjct: 385 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 444

Query: 424 FKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQ 461
           F G    + LP  NYF    + G        + C+ ML +     ++ GG   ILG +QQ
Sbjct: 445 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCL-MLMNGGDESELRGGTGAILGNYQQ 491

BLAST of MC01g0666 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 168.7 bits (426), Expect = 1.1e-41
Identity = 139/432 (32.18%), Postives = 189/432 (43.75%), Query Frame = 0

Query: 41  KNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFD 100
           K++  LA+ S  R    +    +  F     S L+  S G Y + +G GTP  N+  V D
Sbjct: 95  KSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGS-GEYFMRLGVGTPATNVYMVLD 154

Query: 101 TGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS 160
           TGS +VW  C+    C  C        T   F PK S +   V CG+R C  + D    S
Sbjct: 155 TGSDVVWLQCSP---CKAC-----YNQTDAIFDPKKSKTFATVPCGSRLCRRLDD---SS 214

Query: 161 RCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQ- 220
            C   T  S+ C      Y + YG G  T G   +ETL     RV    +GC     H  
Sbjct: 215 EC--VTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCG----HDN 274

Query: 221 ------PAGIVGFGRGPQSLPSQMRLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGD 280
                  AG++G GRG  S PSQ + +   +FSYCL  R    S    P  + FG+ +  
Sbjct: 275 EGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVP 334

Query: 281 TNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTII 340
             +   +++P   NP         +YYL L  I +GG  V          D+TGNGG II
Sbjct: 335 KTS---VFTPLLTNPKLD-----TFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 394

Query: 341 DSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLK 400
           DSG++ T L +P + A+ + F     K  RA    + S    CF++S   TV+ P +V  
Sbjct: 395 DSGTSVTRLTQPAYVALRDAFRLGATKLKRA---PSYSLFDTCFDLSGMTTVKVPTVVFH 454

Query: 401 FKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDL 460
           F GG E++LP +NY   V   G  C               G   I+G  QQQ   V YDL
Sbjct: 455 FGGG-EVSLPASNYLIPVNTEGRFCFAFAGT--------MGSLSIIGNIQQQGFRVAYDL 483

BLAST of MC01g0666 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 165.6 bits (418), Expect = 9.3e-41
Identity = 125/391 (31.97%), Postives = 169/391 (43.22%), Query Frame = 0

Query: 75  TPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIP 134
           T +  G Y   +G G P + +  V DTGS + W  CT    C+ C        T   F P
Sbjct: 141 TTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP---CADCYH-----QTEPIFEP 200

Query: 135 KLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLL 194
             SSS   + C   +C    +    S CRN T     C      Y + YG G  T G   
Sbjct: 201 SSSSSYEPLSCDTPQC----NALEVSECRNAT-----CL-----YEVSYGDGSYTVGDFA 260

Query: 195 SETLDLPDKRVPDFLVGCSVLS---VHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQF 254
           +ETL +    V +  VGC   +       AG++G G G  +LPSQ+    FSYCL  R  
Sbjct: 261 TETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDS 320

Query: 255 DDSPVSSPLVLDFG-SKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPV 314
           D     S   +DFG S S D      + +P   N          +YYL L  I +GG+ +
Sbjct: 321 D-----SASTVDFGTSLSPDA-----VVAPLLRNHQLD-----TFYYLGLTGISVGGELL 380

Query: 315 KFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLR 374
           + P      D +G+GG IIDSG+  T L   I+ ++ + F K  +   +A GV   +   
Sbjct: 381 QIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGV---AMFD 440

Query: 375 PCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGG 434
            C+N+S + TVE P +   F GG  LALP  NY   V   G  C+               
Sbjct: 441 TCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPT--------AS 483

Query: 435 PAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
              I+G  QQQ   V +DLA   IGF   +C
Sbjct: 501 SLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R49.0e-4931.54Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C35.1e-3630.26Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.1e-3531.65Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ34.8e-3429.27Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q8S9J64.1e-3330.39Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Match NameE-valueIdentityDescription
XP_022979057.11.11e-25475.97probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_023543736.14.52e-25475.75probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022925946.13.52e-25174.89probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic... [more]
KAG7034471.17.09e-25174.89Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_011657732.16.04e-24975.00probable aspartyl protease At4g16563 [Cucumis sativus] >KGN48299.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
A0A6J1IMR75.38e-25575.97probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813... [more]
A0A6J1EDJ01.70e-25174.89probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114332... [more]
A0A0A0KHK22.92e-24975.00Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G45447... [more]
A0A5A7SGF91.55e-24673.59Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3CHV21.55e-24673.59aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 P... [more]
Match NameE-valueIdentityDescription
AT3G52500.12.3e-14053.68Eukaryotic aspartyl protease family protein [more]
AT5G45120.11.2e-5133.99Eukaryotic aspartyl protease family protein [more]
AT4G16563.16.4e-5031.54Eukaryotic aspartyl protease family protein [more]
AT3G61820.11.1e-4132.18Eukaryotic aspartyl protease family protein [more]
AT1G25510.19.3e-4131.97Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 326..337
score: 31.42
coord: 88..108
score: 51.74
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 295..456
e-value: 1.6E-37
score: 128.7
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 82..252
e-value: 9.4E-29
score: 100.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 67..264
e-value: 9.8E-33
score: 115.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 265..461
e-value: 7.8E-51
score: 174.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 78..460
NoneNo IPR availablePANTHERPTHR47967:SF36BNACNNG47670D PROTEINcoord: 6..460
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 6..460
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 97..108
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 82..456
score: 35.888077
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 81..460
e-value: 1.82727E-82
score: 253.339

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC01g0666.1MC01g0666.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity