CmaCh03G001060 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh03G001060
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionaspartic proteinase nepenthesin-1
LocationCma_Chr03: 1100733 .. 1103306 (-)
RNA-Seq ExpressionCmaCh03G001060
SyntenyCmaCh03G001060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACGTAGCTCCCATCTTACTCTAAATTCCTTCTTTTTTTCTATTTCATCTCCCTAATCTATTCTCAATTCTCTTCATTGCTCAAGGATTCAAGAACTCATGGCGGATTCGTTACGATATCTGATCGTCTCGGTTGTTCTATCGATTACCATGTTATTCATTCATACCTCGGCTTCAAGTTCGTCTCTCTCGAGGCGAGCTCTACGTCAACCTAAGTTGCCTAGTGATGGCTTTCGAGTGAGTCTTAACCATGTGGATCATGTCAAGAATTTGACGAGATTCGAGCGGTTGCAACGAGGAGTGGCGCGTGGGAAGACTAGATTGCATAGACTAAACGCCATGATGTTGGCTGCCAACATCGGTGTTGGTGGTCGAGTGCAGGCGCCGGTGGTGGCGGGTAATGGTGAGTTTCTTATGAAGTTGGCTATCGGATCTCCACCGAGAAGCTTCTCGGCGATCATGGACACGGGGAGTGACCTAATTTGGACACAGTGTAAGCCTTGTCAACAGTGTTTTGATCAAGCTACGCCTATTTTTGATCCGAAAGAATCTTCTTCTTTCTCTAAGATTTCTTGCTCCAGCGAACTCTGTGATGCTCTCCCGACATCGACATGTAGTAGCGATGAGTGCGAATATTTCTACACGTATGGTGATTATTCCTCTACCCATGGCGTTTTGGCTGCTGAGACCTTCACGTTTGGAGATTCAAGCCAAGACCAGGTTATTAACCACCATCAAACCAAAAAAAATTATAAAGTTTGATAAGTTATGGTTTACTTAACATAGTTGTTTTTAATAAAAATTCTCTAAATTTTGAGAATTTTTTCGTTCTAGTATAAACGATAGGGGGTTACAAGGATCGAGCTGGGTCAGGTTGATGAAATTTTTTGGATCAAATCAAAATTTCGAGTTGGTCGGATTGGTAACCCAAATAATTCGAAATTGTTTCACATCCTAACCCAACCCTATATTTTTAGGTCACCCTATATTTTTAGGTCAGGTTAGATTGGGTTGTCCGGTGGTTGGTGAGAGCGATGGTGAAGGCTGAGGAGCTGCATCTGTGAGAGGTGGCTTGTTTACATTTGACGACGATGGTGAGGATAAGGAGTGTGGCCAACGACGAGGAGCTGTGTCTGAGACCTAACTTTTGTCTAATTTTTGCTATATATATACTTATTTATTATTATTATTATTTTAATTTGGACTAATTTGGTTTGGATGGATAGAACCTTTAACCAATAAACTCGAAACTTAATCCAAACCAGCTTGATTCATGAAAATTAACCTAACTCAACTTTTACGATTTGGGTGGGTGGGTTGGTCGAGTCTTTTGGGTTCAATTTAAACTCCTAATTTCTCCCCTTTGTTTCAATAGCATGTTAAATCACCACTAAACTAAAAAAGTTTAAAATGGTATATAAACTCTATTCGATCTTATATACACGTTCTTAATTCAGGTATCGATCCCTGGACTTGGATTTGGATGTGGAGATGACAACGAAGGAGACGGGTTCAGCCAAGGTGAGGGGCTAGTGGGGCTTGGCCGAGGACCCTTATCGCTAGTTTCTCAACTAAAAGAACAGAAGTTTTCGTATTGTTTAACCGCCATTGATGACACGAAACCAAGCTCACTTTTGTTGGGATCTCTAGCAAATGTGAAACCTAAAGCATCCGATGGTGAAATCAAAACCACCCCATTGATAAGAAACCCATCTCAGCCATCTTTTTACTATCTTTCTCTACAAGGAATCTCGGTTGGTGGCACTCAATTACCAATACCAAAGAACACTTTTGAGCTCCATGATGATGGGAGTGGTGGCGTGATCATAGATTCAGGCACAACAATCACATACATTGAGAAAAATGCTTTCACTTTACTCAAAAAAGAGTTCGTTTCTCAAATGAAACTTCCCGTTGACGACTCGGGTACCAGTGGCCTCGACCTTTGCTTTAACTTGCCACCTAAGACAAATCAGGTACGTTAACCTAACAACTTAAGTTGATGAGGAGTCCCACGTTGGCTAATTTAGGAATGATCATGATTTTATAAGTAAGAAATACATCTCCATTAGTACGAGGCCTTTTGGGGAAACAAAAAGCAAAACAACGAGAGTTATGCTTAAAGTGGACAATATCATACCATTGTATAGTGTCGTGATTTCTAACATAAAATAATGTATTGATGTTTTTGTTTATGTTGTTAACAAAGGTGGAGGTTCCGAAGTTGACGTTCCATTTCAAAGGCGCCGATTTGGAGCTTCCGGGGGAGAACTACATGATCGGCGACTCGAAGGCGGAGTTGATATGTTTGACCATTGGGAGTTCGAACGGAATGTCCATCTTTGGGAATCTTCAGCAACAAAACATCATGGTTGTTCATGATCTTCAAGAAGAAACTGTGTCGTTTTTGCCTACTCAGTGTAGTGATATATGAAAAAGTTGAAGGGAATTTGTTCAATCAAAATGGAGTGAAATGATAGATTTAAGATTGTTTATATTATTAATTCAAGCTATTCAATTTGCAAGTTTATTAAGGGATTTTAGAACTTGTATCGAACAAGTTACATGCATGTTAG

mRNA sequence

ACGTAGCTCCCATCTTACTCTAAATTCCTTCTTTTTTTCTATTTCATCTCCCTAATCTATTCTCAATTCTCTTCATTGCTCAAGGATTCAAGAACTCATGGCGGATTCGTTACGATATCTGATCGTCTCGGTTGTTCTATCGATTACCATGTTATTCATTCATACCTCGGCTTCAAGTTCGTCTCTCTCGAGGCGAGCTCTACGTCAACCTAAGTTGCCTAGTGATGGCTTTCGAGTGAGTCTTAACCATGTGGATCATGTCAAGAATTTGACGAGATTCGAGCGGTTGCAACGAGGAGTGGCGCGTGGGAAGACTAGATTGCATAGACTAAACGCCATGATGTTGGCTGCCAACATCGGTGTTGGTGGTCGAGTGCAGGCGCCGGTGGTGGCGGGTAATGGTGAGTTTCTTATGAAGTTGGCTATCGGATCTCCACCGAGAAGCTTCTCGGCGATCATGGACACGGGGAGTGACCTAATTTGGACACAGTGTAAGCCTTGTCAACAGTGTTTTGATCAAGCTACGCCTATTTTTGATCCGAAAGAATCTTCTTCTTTCTCTAAGATTTCTTGCTCCAGCGAACTCTGTGATGCTCTCCCGACATCGACATGTAGTAGCGATGAGTGCGAATATTTCTACACGTATGGTGATTATTCCTCTACCCATGGCGTTTTGGCTGCTGAGACCTTCACGTTTGGAGATTCAAGCCAAGACCAGGTATCGATCCCTGGACTTGGATTTGGATGTGGAGATGACAACGAAGGAGACGGGTTCAGCCAAGGTGAGGGGCTAGTGGGGCTTGGCCGAGGACCCTTATCGCTAGTTTCTCAACTAAAAGAACAGAAGTTTTCGTATTGTTTAACCGCCATTGATGACACGAAACCAAGCTCACTTTTGTTGGGATCTCTAGCAAATGTGAAACCTAAAGCATCCGATGGTGAAATCAAAACCACCCCATTGATAAGAAACCCATCTCAGCCATCTTTTTACTATCTTTCTCTACAAGGAATCTCGGTTGGTGGCACTCAATTACCAATACCAAAGAACACTTTTGAGCTCCATGATGATGGGAGTGGTGGCGTGATCATAGATTCAGGCACAACAATCACATACATTGAGAAAAATGCTTTCACTTTACTCAAAAAAGAGTTCGTTTCTCAAATGAAACTTCCCGTTGACGACTCGGGTACCAGTGGCCTCGACCTTTGCTTTAACTTGCCACCTAAGACAAATCAGGTGGAGGTTCCGAAGTTGACGTTCCATTTCAAAGGCGCCGATTTGGAGCTTCCGGGGGAGAACTACATGATCGGCGACTCGAAGGCGGAGTTGATATGTTTGACCATTGGGAGTTCGAACGGAATGTCCATCTTTGGGAATCTTCAGCAACAAAACATCATGGTTGTTCATGATCTTCAAGAAGAAACTGTGTCGTTTTTGCCTACTCAGTGTAGTGATATATGAAAAAGTTGAAGGGAATTTGTTCAATCAAAATGGAGTGAAATGATAGATTTAAGATTGTTTATATTATTAATTCAAGCTATTCAATTTGCAAGTTTATTAAGGGATTTTAGAACTTGTATCGAACAAGTTACATGCATGTTAG

Coding sequence (CDS)

ATGGCGGATTCGTTACGATATCTGATCGTCTCGGTTGTTCTATCGATTACCATGTTATTCATTCATACCTCGGCTTCAAGTTCGTCTCTCTCGAGGCGAGCTCTACGTCAACCTAAGTTGCCTAGTGATGGCTTTCGAGTGAGTCTTAACCATGTGGATCATGTCAAGAATTTGACGAGATTCGAGCGGTTGCAACGAGGAGTGGCGCGTGGGAAGACTAGATTGCATAGACTAAACGCCATGATGTTGGCTGCCAACATCGGTGTTGGTGGTCGAGTGCAGGCGCCGGTGGTGGCGGGTAATGGTGAGTTTCTTATGAAGTTGGCTATCGGATCTCCACCGAGAAGCTTCTCGGCGATCATGGACACGGGGAGTGACCTAATTTGGACACAGTGTAAGCCTTGTCAACAGTGTTTTGATCAAGCTACGCCTATTTTTGATCCGAAAGAATCTTCTTCTTTCTCTAAGATTTCTTGCTCCAGCGAACTCTGTGATGCTCTCCCGACATCGACATGTAGTAGCGATGAGTGCGAATATTTCTACACGTATGGTGATTATTCCTCTACCCATGGCGTTTTGGCTGCTGAGACCTTCACGTTTGGAGATTCAAGCCAAGACCAGGTATCGATCCCTGGACTTGGATTTGGATGTGGAGATGACAACGAAGGAGACGGGTTCAGCCAAGGTGAGGGGCTAGTGGGGCTTGGCCGAGGACCCTTATCGCTAGTTTCTCAACTAAAAGAACAGAAGTTTTCGTATTGTTTAACCGCCATTGATGACACGAAACCAAGCTCACTTTTGTTGGGATCTCTAGCAAATGTGAAACCTAAAGCATCCGATGGTGAAATCAAAACCACCCCATTGATAAGAAACCCATCTCAGCCATCTTTTTACTATCTTTCTCTACAAGGAATCTCGGTTGGTGGCACTCAATTACCAATACCAAAGAACACTTTTGAGCTCCATGATGATGGGAGTGGTGGCGTGATCATAGATTCAGGCACAACAATCACATACATTGAGAAAAATGCTTTCACTTTACTCAAAAAAGAGTTCGTTTCTCAAATGAAACTTCCCGTTGACGACTCGGGTACCAGTGGCCTCGACCTTTGCTTTAACTTGCCACCTAAGACAAATCAGGTGGAGGTTCCGAAGTTGACGTTCCATTTCAAAGGCGCCGATTTGGAGCTTCCGGGGGAGAACTACATGATCGGCGACTCGAAGGCGGAGTTGATATGTTTGACCATTGGGAGTTCGAACGGAATGTCCATCTTTGGGAATCTTCAGCAACAAAACATCATGGTTGTTCATGATCTTCAAGAAGAAACTGTGTCGTTTTTGCCTACTCAGTGTAGTGATATATGA

Protein sequence

MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSNGMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI
Homology
BLAST of CmaCh03G001060 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.3e-119
Identity = 235/454 (51.76%), Postives = 308/454 (67.84%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRAL-RQPKLPSDGFRVSLNHVDHVKNLT 60
           MA SL   +++  LSI  +F+   A + S SR AL  + +    GF++ L HVD  KNLT
Sbjct: 1   MASSLYSFLLA--LSIVYIFV---APTHSTSRTALNHRHEAKVTGFQIMLEHVDSGKNLT 60

Query: 61  RFERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSA 120
           +F+ L+R + RG  RL RL AM+     G  G V+  V AG+GE+LM L+IG+P + FSA
Sbjct: 61  KFQLLERAIERGSRRLQRLEAML----NGPSG-VETSVYAGDGEYLMNLSIGTPAQPFSA 120

Query: 121 IMDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEY 180
           IMDTGSDLIWTQC+PC QCF+Q+TPIF+P+ SSSFS + CSS+LC AL + TCS++ C+Y
Sbjct: 121 IMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQY 180

Query: 181 FYTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGP 240
            Y YGD S T G +  ET TFG      VSIP + FGCG++N+G G   G GLVG+GRGP
Sbjct: 181 TYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGP 240

Query: 241 LSLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYY 300
           LSL SQL   KFSYC+T I  + PS+LLLGSLAN     S      T LI++   P+FYY
Sbjct: 241 LSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGS----PNTTLIQSSQIPTFYY 300

Query: 301 LSLQGISVGGTQLPIPKNTFELH-DDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKL 360
           ++L G+SVG T+LPI  + F L+ ++G+GG+IIDSGTT+TY   NA+  +++EF+SQ+ L
Sbjct: 301 ITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINL 360

Query: 361 PVDDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGS 420
           PV +  +SG DLCF  P   + +++P    HF G DLELP ENY I  S   LICL +GS
Sbjct: 361 PVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNG-LICLAMGS 420

Query: 421 SN-GMSIFGNLQQQNIMVVHDLQEETVSFLPTQC 452
           S+ GMSIFGN+QQQN++VV+D     VSF   QC
Sbjct: 421 SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh03G001060 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 1.6e-114
Identity = 216/447 (48.32%), Postives = 302/447 (67.56%), Query Frame = 0

Query: 9   IVSVVLSITMLFIHTSASSSSLSRRAL--RQPKLPSDGFRVSLNHVDHVKNLTRFERLQR 68
           + SVVL + ++     A +SS SR  L     K P  G RV L  VD  KNLT++E ++R
Sbjct: 5   LYSVVLGLAIVSA-IVAPTSSTSRGTLLHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKR 64

Query: 69  GVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSD 128
            + RG+ R+  +NAM+ +++      ++ PV AG+GE+LM +AIG+P  SFSAIMDTGSD
Sbjct: 65  AIKRGERRMRSINAMLQSSS-----GIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSD 124

Query: 129 LIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDY 188
           LIWTQC+PC QCF Q TPIF+P++SSSFS + C S+ C  LP+ TC+++EC+Y Y YGD 
Sbjct: 125 LIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGYGDG 184

Query: 189 SSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQL 248
           S+T G +A ETFTF  S     S+P + FGCG+DN+G G   G GL+G+G GPLSL SQL
Sbjct: 185 STTQGYMATETFTFETS-----SVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQL 244

Query: 249 KEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGIS 308
              +FSYC+T+   + PS+L LGS A+  P+ S     +T LI +   P++YY++LQGI+
Sbjct: 245 GVGQFSYCMTSYGSSSPSTLALGSAASGVPEGS----PSTTLIHSSLNPTYYYITLQGIT 304

Query: 309 VGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTS 368
           VGG  L IP +TF+L DDG+GG+IIDSGTT+TY+ ++A+  + + F  Q+ LP  D  +S
Sbjct: 305 VGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSS 364

Query: 369 GLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN--GMSI 428
           GL  CF  P   + V+VP+++  F G  L L  +N +I  ++  +ICL +GSS+  G+SI
Sbjct: 365 GLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEG-VICLAMGSSSQLGISI 424

Query: 429 FGNLQQQNIMVVHDLQEETVSFLPTQC 452
           FGN+QQQ   V++DLQ   VSF+PTQC
Sbjct: 425 FGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmaCh03G001060 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.2e-69
Identity = 168/454 (37.00%), Postives = 245/454 (53.96%), Query Frame = 0

Query: 8   LIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRG 67
           L  SV+LS+ +L       SS     A  +PKL   GF   L H D  K+   +  ++  
Sbjct: 4   LFSSVLLSLCLL-------SSLFLSNANAKPKL---GFTADLIHRDSPKS-PFYNPMETS 63

Query: 68  VARGKTRLHR-LNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSD 127
             R +  +HR +N +          + Q  + + +GE+LM ++IG+PP    AI DTGSD
Sbjct: 64  SQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSD 123

Query: 128 LIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPT-STCSSDE--CEYFYTY 187
           L+WTQC PC  C+ Q  P+FDPK SS++  +SCSS  C AL   ++CS+++  C Y  +Y
Sbjct: 124 LLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY 183

Query: 188 GDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLV 247
           GD S T G +A +T T G S    + +  +  GCG +N G    +G G+VGLG GP+SL+
Sbjct: 184 GDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLI 243

Query: 248 SQLKEQ---KFSYCLTAIDDTK--PSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFY 307
            QL +    KFSYCL  +   K   S +  G+ A V    S   + +TPLI   SQ +FY
Sbjct: 244 KQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV----SGSGVVSTPLIAKASQETFY 303

Query: 308 YLSLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKL 367
           YL+L+ ISVG  Q+    +  E      G +IIDSGTT+T +    ++ L+    S +  
Sbjct: 304 YLTLKSISVGSKQIQYSGSDSE---SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDA 363

Query: 368 PVDDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGS 427
                  SGL LC++    T  ++VP +T HF GAD++L   N  +  S+ +L+C     
Sbjct: 364 EKKQDPQSGLSLCYS---ATGDLKVPVITMHFDGADVKLDSSNAFVQVSE-DLVCFAFRG 423

Query: 428 SNGMSIFGNLQQQNIMVVHDLQEETVSFLPTQCS 453
           S   SI+GN+ Q N +V +D   +TVSF PT C+
Sbjct: 424 SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmaCh03G001060 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 259.2 bits (661), Expect = 8.5e-68
Identity = 163/431 (37.82%), Postives = 239/431 (55.45%), Query Frame = 0

Query: 42  SDGFRVSLNHVDHV-KNLTRFE----RLQRGVARGKTRLHRLNAMMLAANI-------GV 101
           S    ++L+H+D +  N T  E    RLQR   R K+ +  L A +   N+       G 
Sbjct: 69  SSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKS-IATLAAQIPGRNVTHAPRPGGF 128

Query: 102 GGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPK 161
              V + +  G+GE+  +L +G+P R    ++DTGSD++W QC PC++C+ Q+ PIFDP+
Sbjct: 129 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 188

Query: 162 ESSSFSKISCSSELCDALPTSTCSS--DECEYFYTYGDYSSTHGVLAAETFTFGDSSQDQ 221
           +S +++ I CSS  C  L ++ C++    C Y  +YGD S T G  + ET TF  +    
Sbjct: 189 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN---- 248

Query: 222 VSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLK---EQKFSYCLT-AIDDTKP 281
             + G+  GCG DNEG  F    GL+GLG+G LS   Q      QKFSYCL      +KP
Sbjct: 249 -RVKGVALGCGHDNEG-LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKP 308

Query: 282 SSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLP-IPKNTFELH 341
           SS++ G+ A  +        + TPL+ NP   +FYY+ L GISVGGT++P +  + F+L 
Sbjct: 309 SSVVFGNAAVSR------IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLD 368

Query: 342 DDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPPKTNQVE 401
             G+GGVIIDSGT++T + + A+  ++  F    K        S  D CF+L    N+V+
Sbjct: 369 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDL-SNMNEVK 428

Query: 402 VPKLTFHFKGADLELPGENYMIG-DSKAELICLTIGSSNGMSIFGNLQQQNIMVVHDLQE 453
           VP +  HF+GAD+ LP  NY+I  D+  +      G+  G+SI GN+QQQ   VV+DL  
Sbjct: 429 VPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLAS 485

BLAST of CmaCh03G001060 vs. ExPASy Swiss-Prot
Match: Q7XV21 (Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2)

HSP 1 Score: 258.1 bits (658), Expect = 1.9e-67
Identity = 158/458 (34.50%), Postives = 239/458 (52.18%), Query Frame = 0

Query: 30  LSRRALRQPKLPSDGFRVSLNHVD----HVKNLTRFERLQRGVARGKTRLHRLN-AMMLA 89
           L+  AL     P   FR+ L  VD       NLT  E L+R + R + RL  +  A   A
Sbjct: 10  LALAALPASCAPPRSFRLELASVDASAADAANLTEHELLRRAIQRSRYRLAGIGMARGEA 69

Query: 90  ANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQATP 149
           A+       + P++   GE+L+KL IG+PP  F+A +DT SDLIWTQC+PC  C+ Q  P
Sbjct: 70  ASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDP 129

Query: 150 IFDPKESSSFSKISCSSELCDALPTSTCSSDE---CEYFYTYGDYSSTHGVLAAETFTFG 209
           +F+P+ SS+++ + CSS+ CD L    C  D+   C+Y YTY   ++T G LA +    G
Sbjct: 130 MFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG 189

Query: 210 DSSQDQVSIPGLGFGCGDDNEGDG-FSQGEGLVGLGRGPLSLVSQLKEQKFSYCLTAIDD 269
           +      +  G+ FGC   + G     Q  G+VGLGRGPLSLVSQL  ++F+YCL     
Sbjct: 190 ED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPAS 249

Query: 270 TKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLPIPKNT-- 329
             P  L+LG+ A+    A++      P+ R+P  PS+YYL+L G+ +G   + +P  T  
Sbjct: 250 RIPGKLVLGADADAARNATNR--IAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTT 309

Query: 330 ---------------------FELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMK 389
                                  + D    G+IID  +TIT++E + +  L  +   +++
Sbjct: 310 TATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIR 369

Query: 390 LPVDDSGTSGLDLCFNLPPKT--NQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLT 449
           LP     + GLDLCF LP     ++V VP +   F G  L L        D ++ ++CL 
Sbjct: 370 LPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLM 429

Query: 450 IG--SSNGMSIFGNLQQQNIMVVHDLQEETVSFLPTQC 452
           +G   +  +SI GN QQQN+ V+++L+   V+F+ + C
Sbjct: 430 VGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460

BLAST of CmaCh03G001060 vs. ExPASy TrEMBL
Match: A0A6J1HXS0 (aspartic proteinase nepenthesin-1 OS=Cucurbita maxima OX=3661 GN=LOC111467205 PE=3 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 1.9e-259
Identity = 454/454 (100.00%), Postives = 454/454 (100.00%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60
           MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR
Sbjct: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60

Query: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120
           FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120

Query: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180
           MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF
Sbjct: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180

Query: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240
           YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL
Sbjct: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240

Query: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300
           SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL
Sbjct: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300

Query: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360
           SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV
Sbjct: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360

Query: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420
           DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN
Sbjct: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420

Query: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI
Sbjct: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of CmaCh03G001060 vs. ExPASy TrEMBL
Match: A0A6J1EPU6 (aspartic proteinase nepenthesin-1 OS=Cucurbita moschata OX=3662 GN=LOC111435534 PE=3 SV=1)

HSP 1 Score: 884.4 bits (2284), Expect = 2.0e-253
Identity = 442/454 (97.36%), Postives = 450/454 (99.12%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60
           MADSLRYLI+SVVLSITMLFIHTSASSSSLSRRAL QPKLPSDGFRVSLNHVDHVKNLTR
Sbjct: 1   MADSLRYLIISVVLSITMLFIHTSASSSSLSRRALWQPKLPSDGFRVSLNHVDHVKNLTR 60

Query: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120
           FERLQRGVARGKTRLHRLNAM+LAAN+GVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 61  FERLQRGVARGKTRLHRLNAMVLAANVGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120

Query: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180
           MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF
Sbjct: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180

Query: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240
           YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL
Sbjct: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240

Query: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300
           SLVSQLKEQKFSYCLTAIDDTKPSSLL+GSLANVKPKAS+GEIKTTPLIRNPSQPSFYYL
Sbjct: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLMGSLANVKPKASEGEIKTTPLIRNPSQPSFYYL 300

Query: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360
           SLQGISVGGTQLPIPK TFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV
Sbjct: 301 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360

Query: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420
           DDSGTSGLDLCFNLPP+T QVEVPKLTFHFKGADLELPGENYMIGDSKAELICL IGSS+
Sbjct: 361 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLAIGSSS 420

Query: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCS+I
Sbjct: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSEI 454

BLAST of CmaCh03G001060 vs. ExPASy TrEMBL
Match: A0A0A0KYT9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G554680 PE=3 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 5.3e-214
Identity = 377/445 (84.72%), Postives = 405/445 (91.01%), Query Frame = 0

Query: 12  VVLSITMLFIHTSASSSSLSRRALRQP-KLPSDGFRVSLNHVDHVKNLTRFERLQRGVAR 71
           +++ IT LFI+T A SSSLSRRAL++P KLPS GFRV L HVDHVKNLTRFERL+RGVAR
Sbjct: 17  LIILITTLFINTLAFSSSLSRRALQKPNKLPSHGFRVRLKHVDHVKNLTRFERLRRGVAR 76

Query: 72  GKTRLHRLNAMML-AANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIW 131
           GK RLHRLNAM+L AAN  VG +V+APVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIW
Sbjct: 77  GKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIW 136

Query: 132 TQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSST 191
           TQCKPCQQCFDQ+TPIFDPK+SSSF KISCSSELC ALPTSTCSSD CEY YTYGD SST
Sbjct: 137 TQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSST 196

Query: 192 HGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLKEQ 251
            GVLA ETFTFGDS++DQ+SIPGLGFGCG+DN GDGFSQG GLVGLGRGPLSLVSQLKEQ
Sbjct: 197 QGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ 256

Query: 252 KFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGG 311
           KF+YCLTAIDD+KPSSLLLGSLAN+ PK S  E+KTTPLI+NPSQPSFYYLSLQGISVGG
Sbjct: 257 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGG 316

Query: 312 TQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLD 371
           TQL IPK+TFELHDDGSGGVIIDSGTTITY+E +AFT LK EF++QM LPVDDSGT GLD
Sbjct: 317 TQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLD 376

Query: 372 LCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSNGMSIFGNLQ 431
           LCFNLP  TNQVEVPKLTFHFKGADLELPGENYMIGDSKA L+CL IGSS GMSIFGNLQ
Sbjct: 377 LCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQ 436

Query: 432 QQNIMVVHDLQEETVSFLPTQCSDI 455
           QQN MVVHDLQEET+SFLPTQC  I
Sbjct: 437 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of CmaCh03G001060 vs. ExPASy TrEMBL
Match: A0A5A7TD10 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold128G001020 PE=3 SV=1)

HSP 1 Score: 745.3 bits (1923), Expect = 1.5e-211
Identity = 377/454 (83.04%), Postives = 405/454 (89.21%), Query Frame = 0

Query: 4   SLRYL-IVSVVLSITMLFIHTSASSSSLSRRALRQP-KLPSDGFRVSLNHVDHVKNLTRF 63
           S  YL ++ +++ IT LFI+T A SSSLS RAL++P KLPS GFRV L HVDHVKNLTRF
Sbjct: 8   SFGYLQLLLLIVFITTLFINTLAFSSSLSTRALQKPNKLPSHGFRVRLKHVDHVKNLTRF 67

Query: 64  ERLQRGVARGKTRLHRLNAMML-AANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123
           ERL+RGVARGK RLHRLNAM+L AAN  VG +V+APVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 68  ERLRRGVARGKNRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAI 127

Query: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183
           MDTGSDLIWTQCKPCQQCFDQATPIFDPK+SSSFSKISC SELC ALPTSTCSSD CEY 
Sbjct: 128 MDTGSDLIWTQCKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYL 187

Query: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243
           YTYGD SST GVLA ETFTFGDS++DQ+SIPGLGFGCG+DN GDGFSQG GLVGLGRGPL
Sbjct: 188 YTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPL 247

Query: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 303
           SLVSQLKEQKF+YCLTAIDD+KPSSLLLGSLAN+ PK S  E+K TPLI+NPSQPSFYYL
Sbjct: 248 SLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYL 307

Query: 304 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363
           SLQGISVGGTQL IPK+TFELHDDGSGGVIIDSGTTITYIE  AF+ LK EF++QM LPV
Sbjct: 308 SLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPV 367

Query: 364 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 423
           DDSGT GLDLCFNLP  T QVEVPKLTFHFKGADLELPGENYMIGDSK  L+CL IGSS 
Sbjct: 368 DDSGTGGLDLCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSR 427

Query: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQN MVVHDLQEET+SFLPTQC  I
Sbjct: 428 GMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of CmaCh03G001060 vs. ExPASy TrEMBL
Match: A0A5D3BTY9 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1738G00580 PE=3 SV=1)

HSP 1 Score: 745.3 bits (1923), Expect = 1.5e-211
Identity = 377/454 (83.04%), Postives = 405/454 (89.21%), Query Frame = 0

Query: 4   SLRYL-IVSVVLSITMLFIHTSASSSSLSRRALRQP-KLPSDGFRVSLNHVDHVKNLTRF 63
           S  YL ++ +++ IT LFI+T A SSSLSRRAL++P KLPS GF V L HVDHVKNLTRF
Sbjct: 8   SFGYLQLLLLIVFITTLFINTLAFSSSLSRRALQKPNKLPSHGFMVRLKHVDHVKNLTRF 67

Query: 64  ERLQRGVARGKTRLHRLNAMML-AANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123
           ERL+RGVARGK RLHRLNAM+L AAN  VG +V+APVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 68  ERLRRGVARGKNRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAI 127

Query: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183
           MDTGSDLIWTQCKPCQQCFDQATPIFDPK+SSSFSKISC SELC ALPTSTCSSD CEY 
Sbjct: 128 MDTGSDLIWTQCKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYL 187

Query: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243
           YTYGD SST GVLA ETFTFGDS++DQ+SIPGLGFGCG+DN GDGFSQG GLVGLGRGPL
Sbjct: 188 YTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPL 247

Query: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 303
           SLVSQLKEQKF+YCLTAIDD+KPSSLLLGSLAN+ PK S  E+K TPLI+NPSQPSFYYL
Sbjct: 248 SLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYL 307

Query: 304 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363
           SLQGISVGGTQL IPK+TFELHDDGSGGVIIDSGTTITYIE  AF+ LK EF++QM LPV
Sbjct: 308 SLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPV 367

Query: 364 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 423
           DDSGT GLDLCFNLP  T QVEVPKLTFHFKGADLELPGENYMIGDSK  L+CL IGSS 
Sbjct: 368 DDSGTGGLDLCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSR 427

Query: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQN MVVHDLQEET+SFLPTQC  I
Sbjct: 428 GMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of CmaCh03G001060 vs. NCBI nr
Match: XP_022967799.1 (aspartic proteinase nepenthesin-1 [Cucurbita maxima])

HSP 1 Score: 904.4 bits (2336), Expect = 3.9e-259
Identity = 454/454 (100.00%), Postives = 454/454 (100.00%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60
           MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR
Sbjct: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60

Query: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120
           FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120

Query: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180
           MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF
Sbjct: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180

Query: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240
           YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL
Sbjct: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240

Query: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300
           SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL
Sbjct: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300

Query: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360
           SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV
Sbjct: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360

Query: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420
           DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN
Sbjct: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420

Query: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI
Sbjct: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of CmaCh03G001060 vs. NCBI nr
Match: XP_023543561.1 (aspartic proteinase nepenthesin-1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 886.3 bits (2289), Expect = 1.1e-253
Identity = 444/454 (97.80%), Postives = 450/454 (99.12%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60
           MADSLRYLIVSVVLSITMLFIHTSASSSS SRRALRQPKLPSDGFRVSLNHVDHVKNLTR
Sbjct: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSHSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60

Query: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120
           FE+LQRGVARGKTRLHRLNAMMLAAN+GVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 61  FEQLQRGVARGKTRLHRLNAMMLAANVGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120

Query: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180
           MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF
Sbjct: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180

Query: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240
           YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL
Sbjct: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240

Query: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300
           SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKAS+GEIKTTPLIRNPSQPSFYYL
Sbjct: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASEGEIKTTPLIRNPSQPSFYYL 300

Query: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360
           SLQGISVGGTQLPIPK TFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV
Sbjct: 301 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360

Query: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420
           DDSGTSGLDLCFNLPP+T QVEVPKLTFHFKGADLELPGENYMIGDS+AELICL IGSS+
Sbjct: 361 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSRAELICLAIGSSS 420

Query: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI
Sbjct: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of CmaCh03G001060 vs. NCBI nr
Match: KAG7033562.1 (Aspartic proteinase nepenthesin-1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 884.8 bits (2285), Expect = 3.2e-253
Identity = 443/454 (97.58%), Postives = 449/454 (98.90%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60
           MADSLRYLI+SVVLSITMLFIHTSASSSSLSRRAL QPKLPSDGFRVSLNHVDHVKNLTR
Sbjct: 1   MADSLRYLIISVVLSITMLFIHTSASSSSLSRRALWQPKLPSDGFRVSLNHVDHVKNLTR 60

Query: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120
           FERLQRGVARGKTRLHRLNAMMLAAN+GVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 61  FERLQRGVARGKTRLHRLNAMMLAANVGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120

Query: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180
           MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF
Sbjct: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180

Query: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240
           YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL
Sbjct: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240

Query: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300
           SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKAS+GEIKTTPLI NPSQPSFYYL
Sbjct: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASEGEIKTTPLISNPSQPSFYYL 300

Query: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360
           SLQGISVGGTQLPIPK TFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV
Sbjct: 301 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360

Query: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420
           DDSGTSGLDLCFNLPP+T QVEVPKLTFHFKGADLELPGENYMIGDS+AELICL IGSS+
Sbjct: 361 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSRAELICLAIGSSS 420

Query: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI
Sbjct: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of CmaCh03G001060 vs. NCBI nr
Match: XP_022928703.1 (aspartic proteinase nepenthesin-1 [Cucurbita moschata])

HSP 1 Score: 884.4 bits (2284), Expect = 4.1e-253
Identity = 442/454 (97.36%), Postives = 450/454 (99.12%), Query Frame = 0

Query: 1   MADSLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTR 60
           MADSLRYLI+SVVLSITMLFIHTSASSSSLSRRAL QPKLPSDGFRVSLNHVDHVKNLTR
Sbjct: 1   MADSLRYLIISVVLSITMLFIHTSASSSSLSRRALWQPKLPSDGFRVSLNHVDHVKNLTR 60

Query: 61  FERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120
           FERLQRGVARGKTRLHRLNAM+LAAN+GVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI
Sbjct: 61  FERLQRGVARGKTRLHRLNAMVLAANVGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 120

Query: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180
           MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF
Sbjct: 121 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 180

Query: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240
           YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL
Sbjct: 181 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 240

Query: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 300
           SLVSQLKEQKFSYCLTAIDDTKPSSLL+GSLANVKPKAS+GEIKTTPLIRNPSQPSFYYL
Sbjct: 241 SLVSQLKEQKFSYCLTAIDDTKPSSLLMGSLANVKPKASEGEIKTTPLIRNPSQPSFYYL 300

Query: 301 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360
           SLQGISVGGTQLPIPK TFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV
Sbjct: 301 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 360

Query: 361 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 420
           DDSGTSGLDLCFNLPP+T QVEVPKLTFHFKGADLELPGENYMIGDSKAELICL IGSS+
Sbjct: 361 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLAIGSSS 420

Query: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCS+I
Sbjct: 421 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSEI 454

BLAST of CmaCh03G001060 vs. NCBI nr
Match: KAG6603264.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 847.8 bits (2189), Expect = 4.3e-242
Identity = 421/431 (97.68%), Postives = 426/431 (98.84%), Query Frame = 0

Query: 24   SASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRGVARGKTRLHRLNAMML 83
            SASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRGVARGKTRLHRLNAMML
Sbjct: 915  SASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRGVARGKTRLHRLNAMML 974

Query: 84   AANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQAT 143
            AAN+GVGGRVQAPVV GNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQAT
Sbjct: 975  AANVGVGGRVQAPVVVGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQAT 1034

Query: 144  PIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSSTHGVLAAETFTFGDS 203
            PIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSSTHGVLAAETFTFGDS
Sbjct: 1035 PIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSSTHGVLAAETFTFGDS 1094

Query: 204  SQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLKEQKFSYCLTAIDDTKP 263
            SQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLKEQKFSYCLTAIDDTKP
Sbjct: 1095 SQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLKEQKFSYCLTAIDDTKP 1154

Query: 264  SSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLPIPKNTFELHD 323
            SSLLLGSLANVKPKAS+GEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLPIPK TFELHD
Sbjct: 1155 SSLLLGSLANVKPKASEGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLPIPKATFELHD 1214

Query: 324  DGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPPKTNQVEV 383
            DGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPP+T QVEV
Sbjct: 1215 DGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPPETTQVEV 1274

Query: 384  PKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSNGMSIFGNLQQQNIMVVHDLQEET 443
            PKLTFHFKGADLELPGENYMI DS+AELICL IGSS+GMSIFGNLQQQNIMVVHDLQEET
Sbjct: 1275 PKLTFHFKGADLELPGENYMIEDSRAELICLAIGSSSGMSIFGNLQQQNIMVVHDLQEET 1334

Query: 444  VSFLPTQCSDI 455
            VSFLPTQCSDI
Sbjct: 1335 VSFLPTQCSDI 1345

BLAST of CmaCh03G001060 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 543.5 bits (1399), Expect = 1.6e-154
Identity = 277/458 (60.48%), Postives = 348/458 (75.98%), Query Frame = 0

Query: 8   LIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRG 67
           L+    L +    I  S+S  SL  R L +  LP  GFR+SL HVD  KNLT+ +++QRG
Sbjct: 9   LLFPFFLILFSCLISVSSSRRSLIDRTLPK-NLPRSGFRLSLRHVDSGKNLTKIQKIQRG 68

Query: 68  VARGKTRLHRLNA---MMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTG 127
           + RG  RL+RL A   + +A+       ++AP   G+GEFLM+L+IG+P   +SAI+DTG
Sbjct: 69  INRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTG 128

Query: 128 SDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDE--CEYFYT 187
           SDLIWTQCKPC +CFDQ TPIFDP++SSS+SK+ CSS LC+ALP S C+ D+  CEY YT
Sbjct: 129 SDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYT 188

Query: 188 YGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSL 247
           YGDYSST G+LA ETFTF    +D+ SI G+GFGCG +NEGDGFSQG GLVGLGRGPLSL
Sbjct: 189 YGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSL 248

Query: 248 VSQLKEQKFSYCLTAIDDTK-PSSLLLGSLA----NVKPKASDGEI-KTTPLIRNPSQPS 307
           +SQLKE KFSYCLT+I+D++  SSL +GSLA    N    + DGE+ KT  L+RNP QPS
Sbjct: 249 ISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPS 308

Query: 308 FYYLSLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQM 367
           FYYL LQGI+VG  +L + K+TFEL +DG+GG+IIDSGTTITY+E+ AF +LK+EF S+M
Sbjct: 309 FYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM 368

Query: 368 KLPVDDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTI 427
            LPVDDSG++GLDLCF LP     + VPK+ FHFKGADLELPGENYM+ DS   ++CL +
Sbjct: 369 SLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAM 428

Query: 428 GSSNGMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 455
           GSSNGMSIFGN+QQQN  V+HDL++ETVSF+PT+C  +
Sbjct: 429 GSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of CmaCh03G001060 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 265.4 bits (677), Expect = 8.5e-71
Identity = 168/454 (37.00%), Postives = 245/454 (53.96%), Query Frame = 0

Query: 8   LIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVDHVKNLTRFERLQRG 67
           L  SV+LS+ +L       SS     A  +PKL   GF   L H D  K+   +  ++  
Sbjct: 4   LFSSVLLSLCLL-------SSLFLSNANAKPKL---GFTADLIHRDSPKS-PFYNPMETS 63

Query: 68  VARGKTRLHR-LNAMMLAANIGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSD 127
             R +  +HR +N +          + Q  + + +GE+LM ++IG+PP    AI DTGSD
Sbjct: 64  SQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSD 123

Query: 128 LIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPT-STCSSDE--CEYFYTY 187
           L+WTQC PC  C+ Q  P+FDPK SS++  +SCSS  C AL   ++CS+++  C Y  +Y
Sbjct: 124 LLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY 183

Query: 188 GDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLV 247
           GD S T G +A +T T G S    + +  +  GCG +N G    +G G+VGLG GP+SL+
Sbjct: 184 GDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLI 243

Query: 248 SQLKEQ---KFSYCLTAIDDTK--PSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFY 307
            QL +    KFSYCL  +   K   S +  G+ A V    S   + +TPLI   SQ +FY
Sbjct: 244 KQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV----SGSGVVSTPLIAKASQETFY 303

Query: 308 YLSLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKL 367
           YL+L+ ISVG  Q+    +  E      G +IIDSGTT+T +    ++ L+    S +  
Sbjct: 304 YLTLKSISVGSKQIQYSGSDSE---SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDA 363

Query: 368 PVDDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGS 427
                  SGL LC++    T  ++VP +T HF GAD++L   N  +  S+ +L+C     
Sbjct: 364 EKKQDPQSGLSLCYS---ATGDLKVPVITMHFDGADVKLDSSNAFVQVSE-DLVCFAFRG 423

Query: 428 SNGMSIFGNLQQQNIMVVHDLQEETVSFLPTQCS 453
           S   SI+GN+ Q N +V +D   +TVSF PT C+
Sbjct: 424 SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmaCh03G001060 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 264.2 bits (674), Expect = 1.9e-70
Identity = 155/427 (36.30%), Postives = 234/427 (54.80%), Query Frame = 0

Query: 41  PSDGFRVSLNHVDHVKN------LTRFERLQRGVARGKTRLHRLNAMMLAANIGVGGRVQ 100
           P DGF + L H D  K+       T  +R++  + R        + +  + +       Q
Sbjct: 22  PKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSAR-----STLQFSNDDASPNSPQ 81

Query: 101 APVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSF 160
           + + +  GE+LM ++IG+PP    AI DTGSDLIWTQC PC+ C+ Q +P+FDPKESS++
Sbjct: 82  SFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTY 141

Query: 161 SKISCSSELCDALPTSTCSSDE--CEYFYTYGDYSSTHGVLAAETFTFGDSSQDQVSIPG 220
            K+SCSS  C AL  ++CS+DE  C Y  TYGD S T G +A +T T G S +  VS+  
Sbjct: 142 RKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRN 201

Query: 221 LGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLKEQ---KFSYCLTAI--DDTKPSSLL 280
           +  GCG +N G     G G++GLG G  SLVSQL++    KFSYCL     +    S + 
Sbjct: 202 MIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKIN 261

Query: 281 LGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLPIPKNTFELHDDGSG 340
            G+   V   + DG + T+ + ++P+  ++Y+L+L+ ISVG  ++      F     G G
Sbjct: 262 FGTNGIV---SGDGVVSTSMVKKDPA--TYYFLNLEAISVGSKKIQFTSTIF---GTGEG 321

Query: 341 GVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPPKTNQVEVPKLT 400
            ++IDSGTT+T +  N +  L+    S +K          L LC+     ++  +VP +T
Sbjct: 322 NIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYR---DSSSFKVPDIT 381

Query: 401 FHFKGADLELPGENYMIGDSKAELICLTIGSSNGMSIFGNLQQQNIMVVHDLQEETVSFL 455
            HFKG D++L   N  +  S+ ++ C    ++  ++IFGNL Q N +V +D    TVSF 
Sbjct: 382 VHFKGGDVKLGNLNTFVAVSE-DVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFK 431

BLAST of CmaCh03G001060 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 261.5 bits (667), Expect = 1.2e-69
Identity = 168/451 (37.25%), Postives = 235/451 (52.11%), Query Frame = 0

Query: 22  HTSASSSSLSRRALRQPKLPSDGFRVSLNHVDH--VKNLTRFERLQRGVARGKTRLHRLN 81
           H+++SS SL   +           RVS+   +H   K+LT   RL R  AR K+ + RL+
Sbjct: 61  HSASSSFSLQLHS-----------RVSVRGTEHSDYKSLT-LARLNRDTARVKSLITRLD 120

Query: 82  AMMLAANIGVGG-------------RVQAPVVA----GNGEFLMKLAIGSPPRSFSAIMD 141
             +   NI                  ++AP+++    G+GE+  ++ IG P R    ++D
Sbjct: 121 --LAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLD 180

Query: 142 TGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYT 201
           TGSD+ W QC PC  C+ Q  PIF+P  SSS+  +SC +  C+AL  S C +  C Y  +
Sbjct: 181 TGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVS 240

Query: 202 YGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSL 261
           YGD S T G  A ET T G +    V++     GCG  NEG  F    GL+GLG G L+L
Sbjct: 241 YGDGSYTVGDFATETLTIGSTLVQNVAV-----GCGHSNEG-LFVGAAGLLGLGGGLLAL 300

Query: 262 VSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSL 321
            SQL    FSYCL   D    S++  G+  ++ P A        PL+RN    +FYYL L
Sbjct: 301 PSQLNTTSFSYCLVDRDSDSASTVDFGT--SLSPDA-----VVAPLLRNHQLDTFYYLGL 360

Query: 322 QGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDD 381
            GISVGG  L IP+++FE+ + GSGG+IIDSGT +T ++   +  L+  FV         
Sbjct: 361 TGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKA 420

Query: 382 SGTSGLDLCFNLPPKTNQVEVPKLTFHFKGAD-LELPGENYMIGDSKAELICLTIG-SSN 441
           +G +  D C+NL  KT  VEVP + FHF G   L LP +NYMI        CL    +++
Sbjct: 421 AGVAMFDTCYNLSAKTT-VEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTAS 480

Query: 442 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQC 452
            ++I GN+QQQ   V  DL    + F   +C
Sbjct: 481 SLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of CmaCh03G001060 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 259.2 bits (661), Expect = 6.1e-69
Identity = 163/431 (37.82%), Postives = 239/431 (55.45%), Query Frame = 0

Query: 42  SDGFRVSLNHVDHV-KNLTRFE----RLQRGVARGKTRLHRLNAMMLAANI-------GV 101
           S    ++L+H+D +  N T  E    RLQR   R K+ +  L A +   N+       G 
Sbjct: 69  SSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKS-IATLAAQIPGRNVTHAPRPGGF 128

Query: 102 GGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPK 161
              V + +  G+GE+  +L +G+P R    ++DTGSD++W QC PC++C+ Q+ PIFDP+
Sbjct: 129 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 188

Query: 162 ESSSFSKISCSSELCDALPTSTCSS--DECEYFYTYGDYSSTHGVLAAETFTFGDSSQDQ 221
           +S +++ I CSS  C  L ++ C++    C Y  +YGD S T G  + ET TF  +    
Sbjct: 189 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN---- 248

Query: 222 VSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPLSLVSQLK---EQKFSYCLT-AIDDTKP 281
             + G+  GCG DNEG  F    GL+GLG+G LS   Q      QKFSYCL      +KP
Sbjct: 249 -RVKGVALGCGHDNEG-LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKP 308

Query: 282 SSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLP-IPKNTFELH 341
           SS++ G+ A  +        + TPL+ NP   +FYY+ L GISVGGT++P +  + F+L 
Sbjct: 309 SSVVFGNAAVSR------IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLD 368

Query: 342 DDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFNLPPKTNQVE 401
             G+GGVIIDSGT++T + + A+  ++  F    K        S  D CF+L    N+V+
Sbjct: 369 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDL-SNMNEVK 428

Query: 402 VPKLTFHFKGADLELPGENYMIG-DSKAELICLTIGSSNGMSIFGNLQQQNIMVVHDLQE 453
           VP +  HF+GAD+ LP  NY+I  D+  +      G+  G+SI GN+QQQ   VV+DL  
Sbjct: 429 VPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLAS 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q766C31.3e-11951.76Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.6e-11448.32Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q6XBF81.2e-6937.00Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q9LNJ38.5e-6837.82Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q7XV211.9e-6734.50Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1HXS01.9e-259100.00aspartic proteinase nepenthesin-1 OS=Cucurbita maxima OX=3661 GN=LOC111467205 PE... [more]
A0A6J1EPU62.0e-25397.36aspartic proteinase nepenthesin-1 OS=Cucurbita moschata OX=3662 GN=LOC111435534 ... [more]
A0A0A0KYT95.3e-21484.72Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G55468... [more]
A0A5A7TD101.5e-21183.04Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5D3BTY91.5e-21183.04Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
XP_022967799.13.9e-259100.00aspartic proteinase nepenthesin-1 [Cucurbita maxima][more]
XP_023543561.11.1e-25397.80aspartic proteinase nepenthesin-1 [Cucurbita pepo subsp. pepo][more]
KAG7033562.13.2e-25397.58Aspartic proteinase nepenthesin-1 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022928703.14.1e-25397.36aspartic proteinase nepenthesin-1 [Cucurbita moschata][more]
KAG6603264.14.3e-24297.68Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
AT2G03200.11.6e-15460.48Eukaryotic aspartyl protease family protein [more]
AT5G33340.18.5e-7137.00Eukaryotic aspartyl protease family protein [more]
AT1G64830.11.9e-7036.30Eukaryotic aspartyl protease family protein [more]
AT1G25510.11.2e-6937.25Eukaryotic aspartyl protease family protein [more]
AT1G01300.16.1e-6937.82Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 110..130
score: 43.19
coord: 329..340
score: 38.11
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 297..446
e-value: 1.9E-35
score: 122.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 92..269
e-value: 3.6E-56
score: 192.2
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 271..454
e-value: 1.7E-55
score: 189.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 97..452
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 104..270
e-value: 7.8E-53
score: 179.3
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 16..452
NoneNo IPR availablePANTHERPTHR47967:SF23OS08G0469000 PROTEINcoord: 16..452
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 329..340
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 104..447
score: 41.290951
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 103..451
e-value: 3.39579E-111
score: 326.913

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G001060.1CmaCh03G001060.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity