CmaCh17G000520 (gene) Cucurbita maxima (Rimu)

NameCmaCh17G000520
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein, putative
LocationCma_Chr17 : 225122 .. 226476 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGATCCGAGTAAATGGTCGACTTACAAAACAGTCTCGAGTTCCTCGCCGACTTGCTCGATTACAGGGCCGGGAAATTCTTGTTCCTCGGATTCCGTGTGCGAGTACTCCATTTCATACGGCGATGGATCCCACAGCAACGGGGATATTGCCGTTGATACCCTTACGATGGACTCCACCTCCGGCCGCCCCGTGGCGTTTCCACGGATTGCGATTGGCTGTGGCCATGACAATGCTGGCTCTTTTGATTCTAAGGTTTCTGGGATTGTTGGGCTCAGTCATGGTTCAGCTTCGCTTGTCCAGCAGATGGGGCCGGCCACCGGCGGAAAATTCTCTTACTGTTTGGCACCCATCGGAAACTCTAACTACTCGAGCTATCTTAACTTCGGCTCTAATGCTGTCGGTCTCTGGCTCTAGAGCCGTCTCGACTCCAATTTATACTAGCGGTAAATACAAACGTTTTTTTATCGATAAAATGATTAGTGGGTAGTTAAACGTCTCATTAAGAAAAAATCATGGGTAAAAAAAAAATCATAGGTATATAACTAATAGATCGATAATACTAAAGTCATGCTCTTAATTTAATCGAATGTTTGGATGTTGAGCAAGCAAGTCGTGTGCTCTATAGAAAAGGAGTTAACTTGGATTTAGGGTTTAAAAGGGGATCACACTGGTAATGCGCTCCATAGAAAAGGAGTCAGACCAGTATTTAAAGTTTAAAAGTGGATAATAATGGTCGTGCTCTCGAAAAAAGGAGTTGACCTAAATTTAGGGTTTAAAAGTGGATGATATTATATTTTTACTGTGAGTTAGTTATACTAAGATGAATTGTATAAATAATAATTATTTTTTTGTTCATGTTTATAGAAGGTGACTACAAAATATTCTACCTCCTGAAAATAAAAGCAATGAGTGTTGGAAGCAACAAATTTAATTTTTTGAGATCTTCACCATTTGGAACAAACGGGAACATCATTATCGACTCTGGCACGACGCTTACATTCTTACAACTGGACATCTTTGCTAGCTTCTCTCAGGCAATTTCGGAGGTGATGGACCTCAAGTCCATGACTAGTCCAATTCAAACCTTGGAGTATTGCTATGAGAGAACCACCAACGACTATAAGGTGCCGCCTGTCACAGCGCACTTCAAAGACGACGACGTGAATCTCAAGCGAGAAAATCTGTTCATTAGGGTGGTGGACGACGTCGTTTGCTTGGCATTTGTTGGCAACAGCAGGGAAAACAACATGCAAATCTATGGCAACATTGCACCGACTAACTTCTTGGTTAGCTGTAATATCAAGAAATCATCTATTTTTTTCAAGCCAGCAAATTGTGCTGCCTCGTGA

mRNA sequence

ATGTTTGATCCGAGTAAATGGTCGACTTACAAAACAGTCTCGAGTTCCTCGCCGACTTGCTCGATTACAGGGCCGGGAAATTCTTGTTCCTCGGATTCCGTGTGCGAGTACTCCATTTCATACGGCGATGGATCCCACAGCAACGGGGATATTGCCGTTGATACCCTTACGATGGACTCCACCTCCGGCCGCCCCGTGGCGTTTCCACGGATTGCGATTGGCTGTGGCCATGACAATGCTGGCTCTTTTGATTCTAAGGTTTCTGGGATTGTTGGGCTCAGTCATGGTTCAGCTTCGCTTGTCCAGCAGATGGGGCCGGCCACCGGCGGAAAATTCTCTTACTGTTTGGCACCCATCGGAAACTCTAACTACTCGAGCTATCTTAACTTCGGCTCTAATGCTTGGGTAGTTAAACGTCTCATTAAGAAAAAATCATGGGGTTTAAAAGGGGATCACACTGAAGGTGACTACAAAATATTCTACCTCCTGAAAATAAAAGCAATGAGTGTTGGAAGCAACAAATTTAATTTTTTGAGATCTTCACCATTTGGAACAAACGGGAACATCATTATCGACTCTGGCACGACGCTTACATTCTTACAACTGGACATCTTTGCTAGCTTCTCTCAGGCAATTTCGGAGGTGATGGACCTCAAGTCCATGACTAGTCCAATTCAAACCTTGGAGTATTGCTATGAGAGAACCACCAACGACTATAAGGTGCCGCCTGTCACAGCGCACTTCAAAGACGACGACGTGAATCTCAAGCGAGAAAATCTGTTCATTAGGGTGGTGGACGACGTCGTTTGCTTGGCATTTGTTGGCAACAGCAGGGAAAACAACATGCAAATCTATGGCAACATTGCACCGACTAACTTCTTGGTTAGCTGTAATATCAAGAAATCATCTATTTTTTTCAAGCCAGCAAATTGTGCTGCCTCGTGA

Coding sequence (CDS)

ATGTTTGATCCGAGTAAATGGTCGACTTACAAAACAGTCTCGAGTTCCTCGCCGACTTGCTCGATTACAGGGCCGGGAAATTCTTGTTCCTCGGATTCCGTGTGCGAGTACTCCATTTCATACGGCGATGGATCCCACAGCAACGGGGATATTGCCGTTGATACCCTTACGATGGACTCCACCTCCGGCCGCCCCGTGGCGTTTCCACGGATTGCGATTGGCTGTGGCCATGACAATGCTGGCTCTTTTGATTCTAAGGTTTCTGGGATTGTTGGGCTCAGTCATGGTTCAGCTTCGCTTGTCCAGCAGATGGGGCCGGCCACCGGCGGAAAATTCTCTTACTGTTTGGCACCCATCGGAAACTCTAACTACTCGAGCTATCTTAACTTCGGCTCTAATGCTTGGGTAGTTAAACGTCTCATTAAGAAAAAATCATGGGGTTTAAAAGGGGATCACACTGAAGGTGACTACAAAATATTCTACCTCCTGAAAATAAAAGCAATGAGTGTTGGAAGCAACAAATTTAATTTTTTGAGATCTTCACCATTTGGAACAAACGGGAACATCATTATCGACTCTGGCACGACGCTTACATTCTTACAACTGGACATCTTTGCTAGCTTCTCTCAGGCAATTTCGGAGGTGATGGACCTCAAGTCCATGACTAGTCCAATTCAAACCTTGGAGTATTGCTATGAGAGAACCACCAACGACTATAAGGTGCCGCCTGTCACAGCGCACTTCAAAGACGACGACGTGAATCTCAAGCGAGAAAATCTGTTCATTAGGGTGGTGGACGACGTCGTTTGCTTGGCATTTGTTGGCAACAGCAGGGAAAACAACATGCAAATCTATGGCAACATTGCACCGACTAACTTCTTGGTTAGCTGTAATATCAAGAAATCATCTATTTTTTTCAAGCCAGCAAATTGTGCTGCCTCGTGA

Protein sequence

MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIGNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFLRSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSCNIKKSSIFFKPANCAAS
BLAST of CmaCh17G000520 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 1.0e-61
Identity = 135/314 (42.99%), Postives = 185/314 (58.92%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSS-DSVCEYSISYGDGSHSNGDIAVDTLTMD 60
           +FDP   STYK VS SS  C+      SCS+ D+ C YS+SYGD S++ G+IAVDTLT+ 
Sbjct: 131 LFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLG 190

Query: 61  STSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI 120
           S+  RP+    I IGCGH+NAG+F+ K SGIVGL  G  SL++Q+G +  GKFSYCL P+
Sbjct: 191 SSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPL 250

Query: 121 -GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
               + +S +NFG+NA V    +       K        + FY L +K++SVGS +  + 
Sbjct: 251 TSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQ-----ETFYYLTLKSISVGSKQIQYS 310

Query: 181 RSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTND 240
            S    + GNIIIDSGTTLT L  + ++    A++  +D +    P   L  CY   T D
Sbjct: 311 GSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYS-ATGD 370

Query: 241 YKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSCN 300
            KVP +T HF   DV L   N F++V +D+VC AF G+    +  IYGN+A  NFLV  +
Sbjct: 371 LKVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSP---SFSIYGNVAQMNFLVGYD 430

Query: 301 IKKSSIFFKPANCA 313
               ++ FKP +CA
Sbjct: 431 TVSKTVSFKPTDCA 435

BLAST of CmaCh17G000520 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 4.1e-50
Identity = 119/326 (36.50%), Postives = 187/326 (57.36%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTC-SITGPGNSCS-SDSVCEYSISYGDGSHSNGDIAVDTLTM 60
           +FD  K STYK+    S  C +++     C  S+++C+Y  SYGD S S GD+A +T+++
Sbjct: 126 IFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSI 185

Query: 61  DSTSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLA- 120
           DS SG PV+FP    GCG++N G+FD   SGI+GL  G  SL+ Q+G +   KFSYCL+ 
Sbjct: 186 DSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSH 245

Query: 121 PIGNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNF 180
               +N +S +N G+N+  +   + K S  +     + +   +Y L ++A+SVG  K  +
Sbjct: 246 KSATTNGTSVINLGTNS--IPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPY 305

Query: 181 LRSS--------PFGTNGNIIIDSGTTLTFLQLDIFASFSQAISE-VMDLKSMTSPIQTL 240
             SS           T+GNIIIDSGTTLT L+   F  FS A+ E V   K ++ P   L
Sbjct: 306 TGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLL 365

Query: 241 EYCYERTTNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNI 300
            +C++  + +  +P +T HF   DV L   N F+++ +D+VCL+ V  +    + IYGN 
Sbjct: 366 SHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTT---EVAIYGNF 425

Query: 301 APTNFLVSCNIKKSSIFFKPANCAAS 315
           A  +FLV  +++  ++ F+  +C+A+
Sbjct: 426 AQMDFLVGYDLETRTVSFQHMDCSAN 446

BLAST of CmaCh17G000520 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 5.6e-31
Identity = 104/322 (32.30%), Postives = 156/322 (48.45%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTC-SITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMD 60
           +F+P   S++ T+  SS  C +++ P  +CS++  C+Y+  YGDGS + G +  +TLT  
Sbjct: 136 IFNPQGSSSFSTLPCSSQLCQALSSP--TCSNN-FCQYTYGYGDGSETQGSMGTETLTFG 195

Query: 61  STSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI 120
           S     V+ P I  GCG +N G      +G+VG+  G  SL  Q+      KFSYC+ PI
Sbjct: 196 S-----VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPI 255

Query: 121 GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKF---- 180
           G+S  S+ L  GS A  V       +              FY + +  +SVGS +     
Sbjct: 256 GSSTPSNLL-LGSLANSVTAGSPNTTL-----IQSSQIPTFYYITLNGLSVGSTRLPIDP 315

Query: 181 -NFLRSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYER 240
             F  +S  GT G IIIDSGTTLT+   + + S  Q     ++L  +       + C++ 
Sbjct: 316 SAFALNSNNGT-GGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQT 375

Query: 241 TT--NDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTN 300
            +  ++ ++P    HF   D+ L  EN FI   + ++CLA    S    M I+GNI   N
Sbjct: 376 PSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNGLICLAM--GSSSQGMSIFGNIQQQN 435

Query: 301 FLVSCNIKKSSIFFKPANCAAS 315
            LV  +   S + F  A C AS
Sbjct: 436 MLVVYDTGNSVVSFASAQCGAS 437

BLAST of CmaCh17G000520 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 2.8e-30
Identity = 100/321 (31.15%), Postives = 168/321 (52.34%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +F+P   S++ T+   S  C    P  +C+++  C+Y+  YGDGS + G +A +T T ++
Sbjct: 137 IFNPQDSSSFSTLPCESQYCQDL-PSETCNNNE-CQYTYGYGDGSTTQGYMATETFTFET 196

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           +S      P IA GCG DN G      +G++G+  G  SL  Q+G    G+FSYC+   G
Sbjct: 197 SS-----VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSYG 256

Query: 121 NSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFLRS 180
           +S+ S+ L  GS A  V       S  L        Y   Y + ++ ++VG +    + S
Sbjct: 257 SSSPST-LALGSAASGVPE--GSPSTTLIHSSLNPTY---YYITLQGITVGGDNLG-IPS 316

Query: 181 SPF-----GTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERT 240
           S F     GT G +IIDSGTTLT+L  D + + +QA ++ ++L ++      L  C+++ 
Sbjct: 317 STFQLQDDGT-GGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQP 376

Query: 241 T--NDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNF 300
           +  +  +VP ++  F    +NL  +N+ I   + V+CLA +G+S +  + I+GNI     
Sbjct: 377 SDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVICLA-MGSSSQLGISIFGNIQQQET 436

Query: 301 LVSCNIKKSSIFFKPANCAAS 315
            V  +++  ++ F P  C AS
Sbjct: 437 QVLYDLQNLAVSFVPTQCGAS 438

BLAST of CmaCh17G000520 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 8.9e-29
Identity = 101/318 (31.76%), Postives = 142/318 (44.65%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDP K  TY T+  SSP C         +    C Y +SYGDGS + GD + +TLT   
Sbjct: 183 IFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-- 242

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
              R      +A+GCGHDN G F    +G++GL  G  S   Q G     KFSYCL    
Sbjct: 243 ---RRNRVKGVALGCGHDNEGLFVG-AAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS 302

Query: 121 NSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFLRS 180
            S+  S + FG+ A  V R+ +          +      FY + +  +SVG  +   + +
Sbjct: 303 ASSKPSSVVFGNAA--VSRIAR-----FTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTA 362

Query: 181 SPFGT----NGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTT 240
           S F      NG +IIDSGT++T L    + +   A                 + C++ + 
Sbjct: 363 SLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSN 422

Query: 241 -NDYKVPPVTAHFKDDDVNLKRENLFIRV-VDDVVCLAFVGNSRENNMQIYGNIAPTNFL 300
            N+ KVP V  HF+  DV+L   N  I V  +   C AF G      + I GNI    F 
Sbjct: 423 MNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTM--GGLSIIGNIQQQGFR 482

Query: 301 VSCNIKKSSIFFKPANCA 313
           V  ++  S + F P  CA
Sbjct: 483 VVYDLASSRVGFAPGGCA 485

BLAST of CmaCh17G000520 vs. TrEMBL
Match: A0A0A0K928_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078650 PE=3 SV=1)

HSP 1 Score: 339.3 bits (869), Expect = 4.8e-90
Identity = 180/316 (56.96%), Postives = 226/316 (71.52%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           MFDPSK +TYK V+ SSP CS +G G+SCS DS C YSI+YGD SHS G++AVDT+TM S
Sbjct: 124 MFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQS 183

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI- 120
           TSGRPVAFPR  IGCGHDNAG+F++ VSGIVGL  G ASLV Q+GPATGGKFSYCL PI 
Sbjct: 184 TSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIG 243

Query: 121 -GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
            G++N S+ LNFGSNA      +          ++   YK FY LK++A+SVG  KFNF 
Sbjct: 244 TGSTNDSTKLNFGSNA-----NVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFP 303

Query: 181 R-SSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTN 240
             +S  G   NIIIDSGTTLT+L   +  SF  AIS+ M L     P + L+YC+  TT+
Sbjct: 304 EGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTD 363

Query: 241 DYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSC 300
           DY++PPVT HF+  DV L+RENLF+R+ DD +CLAF G+  ++N+ IYGNIA +NFLV  
Sbjct: 364 DYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAF-GSFPDDNIFIYGNIAQSNFLVGY 423

Query: 301 NIKKSSIFFKPANCAA 314
           +IK  ++ F+PA+C A
Sbjct: 424 DIKNLAVSFQPAHCGA 433

BLAST of CmaCh17G000520 vs. TrEMBL
Match: A0A0A0K9V4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078630 PE=3 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 5.5e-86
Identity = 180/316 (56.96%), Postives = 222/316 (70.25%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           MF+PSK +TY+ VS SSP CS TG  NSCS    C YSISYGD SHS GD AVDTLTM S
Sbjct: 126 MFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGS 185

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           TSGR VAFPR AIGCGHDNAGSFD+ VSGIVGL  G ASL++QMG A GGKFSYCL PIG
Sbjct: 186 TSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIG 245

Query: 121 NSN-YSSYLNFGSNAWVVKRLIKKKSWGLKGD-HTEGDYKIFYLLKIKAMSVG-SNKFNF 180
           N +  S+ LNFGSNA V        S  +    +    +K FY LK+KA+SVG +N F  
Sbjct: 246 NDDGGSNKLNFGSNANV------SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYS 305

Query: 181 LRSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTN 240
             +S  G   NIIIDSGTTLT L +D++ +F++AIS  ++L+    P Q LEYC+E TT+
Sbjct: 306 TANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTD 365

Query: 241 DYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSC 300
           DYKVP +  HF+  ++ L+REN+ IRV D+V+CLAF G +++N++ IYGNIA  NFLV  
Sbjct: 366 DYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAG-AQDNDISIYGNIAQINFLVGY 425

Query: 301 NIKKSSIFFKPANCAA 314
           ++   S+ FKP NC A
Sbjct: 426 DVTNMSLSFKPMNCVA 434

BLAST of CmaCh17G000520 vs. TrEMBL
Match: A0A059C519_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_E01757 PE=3 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.1e-65
Identity = 139/323 (43.03%), Postives = 194/323 (60.06%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDPSK STYK VS  +  C +    +     S+CEYS +YGD S++ G++A DT T+ S
Sbjct: 100 LFDPSKSSTYKEVSCQTSQCEVVRQTSCGGGGSLCEYSYAYGDQSYTQGNLATDTFTLGS 159

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           TSGRPV+FP++  GCGH N G+FD++V G+ GL  G ASLV Q+G ATGGKFSYCLAP  
Sbjct: 160 TSGRPVSFPKLVFGCGHSNGGTFDNRVDGLFGLGGGDASLVTQLGTATGGKFSYCLAPTS 219

Query: 121 NSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHT------EGDYKIFYLLKIKAMSVGSNK 180
               +S LNFG+NA            G+ GD        + D K FY L ++ +SVG  K
Sbjct: 220 PDEKTSKLNFGANA------------GVTGDGAVSTPLIQKDPKTFYYLSLEEVSVGETK 279

Query: 181 FNFLR--SSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCY 240
            +F    SS     GNIIIDSGTTLT L  D+++    A+++ +DL   + P Q L  C+
Sbjct: 280 IDFPSDGSSSSADEGNIIIDSGTTLTLLPQDLYSQIEDAVAKAVDLPKASDPTQLLSLCF 339

Query: 241 E-RTTNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPT 300
              +     +P VT HFK  DV L   N F++V D ++CL+F    R   + I+GN+A  
Sbjct: 340 RVESDAQLSLPTVTFHFKGADVELSPTNTFVQVADGIICLSF----RPEKVSIFGNLAQI 399

Query: 301 NFLVSCNIKKSSIFFKPANCAAS 315
           N+L+  +I+ S ++FKP +CA++
Sbjct: 400 NYLIGYDIQNSKLYFKPVDCASN 406

BLAST of CmaCh17G000520 vs. TrEMBL
Match: I1M0V7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_13G200600 PE=3 SV=2)

HSP 1 Score: 253.8 bits (647), Expect = 2.7e-64
Identity = 146/316 (46.20%), Postives = 195/316 (61.71%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDPSK  TYKT+  SS TC       +CSSD+VCEYSI YGDGSHS+GD++V+TLT+ S
Sbjct: 132 IFDPSKSKTYKTLPCSSNTCESLR-NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGS 191

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI- 120
           T G  V FP+  IGCGH+N G+F  + SGIVGL  G  SL+ Q+  + GGKFSYCLAPI 
Sbjct: 192 TDGSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIF 251

Query: 121 GNSNYSSYLNFGSNAWVVKR-LIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
             SN SS LNFG  A V  R  +      L G       ++FY L ++A SVG N+  F 
Sbjct: 252 SESNSSSKLNFGDAAVVSGRGTVSTPLDPLNG-------QVFYFLTLEAFSVGDNRIEFS 311

Query: 181 RSSPFGT---NGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERT 240
            SS  G+   +GNIIIDSGTTLT L  + + +   A+S+V+ L+    P + L  CY+ T
Sbjct: 312 GSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTT 371

Query: 241 TNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLV 300
           +++  +P +TAHFK  DV L   + F+ V   VVC AF+ +       I+GN+A  N LV
Sbjct: 372 SDELDLPVITAHFKGADVELNPISTFVPVEKGVVCFAFISSKIG---AIFGNLAQQNLLV 431

Query: 301 SCNIKKSSIFFKPANC 312
             ++ K ++ FKP +C
Sbjct: 432 GYDLVKKTVSFKPTDC 436

BLAST of CmaCh17G000520 vs. TrEMBL
Match: A0A0B2RZL7_GLYSO (Putative aspartic protease OS=Glycine soja GN=glysoja_007342 PE=3 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 2.7e-64
Identity = 146/316 (46.20%), Postives = 195/316 (61.71%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDPSK  TYKT+  SS TC       +CSSD+VCEYSI YGDGSHS+GD++V+TLT+ S
Sbjct: 98  IFDPSKSKTYKTLPCSSNTCESLR-NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGS 157

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI- 120
           T G  V FP+  IGCGH+N G+F  + SGIVGL  G  SL+ Q+  + GGKFSYCLAPI 
Sbjct: 158 TDGSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIF 217

Query: 121 GNSNYSSYLNFGSNAWVVKR-LIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
             SN SS LNFG  A V  R  +      L G       ++FY L ++A SVG N+  F 
Sbjct: 218 SESNSSSKLNFGDAAVVSGRGTVSTPLDPLNG-------QVFYFLTLEAFSVGDNRIEFS 277

Query: 181 RSSPFGT---NGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERT 240
            SS  G+   +GNIIIDSGTTLT L  + + +   A+S+V+ L+    P + L  CY+ T
Sbjct: 278 GSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTT 337

Query: 241 TNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLV 300
           +++  +P +TAHFK  DV L   + F+ V   VVC AF+ +       I+GN+A  N LV
Sbjct: 338 SDELDLPVITAHFKGADVELNPISTFVPVEKGVVCFAFISSKIG---AIFGNLAQQNLLV 397

Query: 301 SCNIKKSSIFFKPANC 312
             ++ K ++ FKP +C
Sbjct: 398 GYDLVKKTVSFKPTDC 402

BLAST of CmaCh17G000520 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 238.4 bits (607), Expect = 5.9e-63
Identity = 135/314 (42.99%), Postives = 185/314 (58.92%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSS-DSVCEYSISYGDGSHSNGDIAVDTLTMD 60
           +FDP   STYK VS SS  C+      SCS+ D+ C YS+SYGD S++ G+IAVDTLT+ 
Sbjct: 131 LFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLG 190

Query: 61  STSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI 120
           S+  RP+    I IGCGH+NAG+F+ K SGIVGL  G  SL++Q+G +  GKFSYCL P+
Sbjct: 191 SSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPL 250

Query: 121 -GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
               + +S +NFG+NA V    +       K        + FY L +K++SVGS +  + 
Sbjct: 251 TSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQ-----ETFYYLTLKSISVGSKQIQYS 310

Query: 181 RSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTND 240
            S    + GNIIIDSGTTLT L  + ++    A++  +D +    P   L  CY   T D
Sbjct: 311 GSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYS-ATGD 370

Query: 241 YKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSCN 300
            KVP +T HF   DV L   N F++V +D+VC AF G+    +  IYGN+A  NFLV  +
Sbjct: 371 LKVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSP---SFSIYGNVAQMNFLVGYD 430

Query: 301 IKKSSIFFKPANCA 313
               ++ FKP +CA
Sbjct: 431 TVSKTVSFKPTDCA 435

BLAST of CmaCh17G000520 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 223.0 bits (567), Expect = 2.6e-58
Identity = 127/314 (40.45%), Postives = 181/314 (57.64%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDP + STY+ VS SS  C      +  + ++ C Y+I+YGD S++ GD+AVDT+TM S
Sbjct: 127 LFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGS 186

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI- 120
           +  RPV+   + IGCGH+N G+FD   SGI+GL  GS SLV Q+  +  GKFSYCL P  
Sbjct: 187 SGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFT 246

Query: 121 GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFLR 180
             +  +S +NFG+N  V    +   S   K      D   +Y L ++A+SVGS K  F  
Sbjct: 247 SETGLTSKINFGTNGIVSGDGVVSTSMVKK------DPATYYFLNLEAISVGSKKIQF-T 306

Query: 181 SSPFGT-NGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTND 240
           S+ FGT  GNI+IDSGTTLT L  + +      ++  +  + +  P   L  CY R ++ 
Sbjct: 307 STIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCY-RDSSS 366

Query: 241 YKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSCN 300
           +KVP +T HFK  DV L   N F+ V +DV C AF  N +   + I+GN+A  NFLV  +
Sbjct: 367 FKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQ---LTIFGNLAQMNFLVGYD 426

Query: 301 IKKSSIFFKPANCA 313
               ++ FK  +C+
Sbjct: 427 TVSGTVSFKKTDCS 429

BLAST of CmaCh17G000520 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 199.9 bits (507), Expect = 2.3e-51
Identity = 119/326 (36.50%), Postives = 187/326 (57.36%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTC-SITGPGNSCS-SDSVCEYSISYGDGSHSNGDIAVDTLTM 60
           +FD  K STYK+    S  C +++     C  S+++C+Y  SYGD S S GD+A +T+++
Sbjct: 126 IFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSI 185

Query: 61  DSTSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLA- 120
           DS SG PV+FP    GCG++N G+FD   SGI+GL  G  SL+ Q+G +   KFSYCL+ 
Sbjct: 186 DSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSH 245

Query: 121 PIGNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNF 180
               +N +S +N G+N+  +   + K S  +     + +   +Y L ++A+SVG  K  +
Sbjct: 246 KSATTNGTSVINLGTNS--IPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPY 305

Query: 181 LRSS--------PFGTNGNIIIDSGTTLTFLQLDIFASFSQAISE-VMDLKSMTSPIQTL 240
             SS           T+GNIIIDSGTTLT L+   F  FS A+ E V   K ++ P   L
Sbjct: 306 TGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLL 365

Query: 241 EYCYERTTNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNI 300
            +C++  + +  +P +T HF   DV L   N F+++ +D+VCL+ V  +    + IYGN 
Sbjct: 366 SHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTT---EVAIYGNF 425

Query: 301 APTNFLVSCNIKKSSIFFKPANCAAS 315
           A  +FLV  +++  ++ F+  +C+A+
Sbjct: 426 AQMDFLVGYDLETRTVSFQHMDCSAN 446

BLAST of CmaCh17G000520 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 193.4 bits (490), Expect = 2.2e-49
Identity = 115/324 (35.49%), Postives = 181/324 (55.86%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTC-SITGPGNSCS-SDSVCEYSISYGDGSHSNGDIAVDTLTM 60
           +FD  K STYKT S  S TC +++     C  S  +C+Y  SYGD S + GD+A +T+++
Sbjct: 126 LFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISI 185

Query: 61  DSTSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLA- 120
           DS+SG  V+FP    GCG++N G+F+   SGI+GL  G  SLV Q+G + G KFSYCL+ 
Sbjct: 186 DSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSH 245

Query: 121 PIGNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNF 180
               +N +S +N G+N+  +     K S  L     + D + +Y L ++A++VG  K  +
Sbjct: 246 TAATTNGTSVINLGTNS--IPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPY 305

Query: 181 ------LRSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISE-VMDLKSMTSPIQTLEY 240
                 L        GNIIIDSGTTLT L    +  F  A+ E V   K ++ P   L +
Sbjct: 306 TGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTH 365

Query: 241 CYERTTNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAP 300
           C++    +  +P +T HF + DV L   N F+++ +D VCL+ +  +    + IYGN+  
Sbjct: 366 CFKSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTT---EVAIYGNMVQ 425

Query: 301 TNFLVSCNIKKSSIFFKPANCAAS 315
            +FLV  +++  ++ F+  +C+ +
Sbjct: 426 MDFLVGYDLETKTVSFQRMDCSGN 444

BLAST of CmaCh17G000520 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 147.9 bits (372), Expect = 1.0e-35
Identity = 106/315 (33.65%), Postives = 152/315 (48.25%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDPSK ST+K                 C   S C Y + Y D +++ G +A +T+T+ S
Sbjct: 106 IFDPSKSSTFKE--------------KRCDGHS-CPYEVDYFDHTYTMGTLATETITLHS 165

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           TSG P   P   IGCGH+N+  F    SG+VGL+ G +SL+ QMG    G  SYC +  G
Sbjct: 166 TSGEPFVMPETIIGCGHNNSW-FKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQG 225

Query: 121 NSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFLRS 180
            S     +NFG+NA V    +   +  +           FY L + A+SVG+ +   + +
Sbjct: 226 TSK----INFGANAIVAGDGVVSTTMFMTTAKPG-----FYYLNLDAVSVGNTRIETMGT 285

Query: 181 SPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTNDYK 240
           +     GNI+IDSGTTLT+  +       QA+  V+       P      CY   T D  
Sbjct: 286 TFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDI- 345

Query: 241 VPPVTAHFKDD-DVNLKRENLFIRVVD-DVVCLAFVGNSRENNMQIYGNIAPTNFLVSCN 300
            P +T HF    D+ L + N+++   +  V CLA + NS      I+GN A  NFLV  +
Sbjct: 346 FPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNS-PTQEAIFGNRAQNNFLVGYD 393

Query: 301 IKKSSIFFKPANCAA 314
                + F P NC+A
Sbjct: 406 SSSLLVSFSPTNCSA 393

BLAST of CmaCh17G000520 vs. NCBI nr
Match: gi|700191066|gb|KGN46270.1| (hypothetical protein Csa_6G078650 [Cucumis sativus])

HSP 1 Score: 339.3 bits (869), Expect = 6.9e-90
Identity = 180/316 (56.96%), Postives = 226/316 (71.52%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           MFDPSK +TYK V+ SSP CS +G G+SCS DS C YSI+YGD SHS G++AVDT+TM S
Sbjct: 124 MFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQS 183

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI- 120
           TSGRPVAFPR  IGCGHDNAG+F++ VSGIVGL  G ASLV Q+GPATGGKFSYCL PI 
Sbjct: 184 TSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIG 243

Query: 121 -GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
            G++N S+ LNFGSNA      +          ++   YK FY LK++A+SVG  KFNF 
Sbjct: 244 TGSTNDSTKLNFGSNA-----NVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFP 303

Query: 181 R-SSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTN 240
             +S  G   NIIIDSGTTLT+L   +  SF  AIS+ M L     P + L+YC+  TT+
Sbjct: 304 EGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTD 363

Query: 241 DYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSC 300
           DY++PPVT HF+  DV L+RENLF+R+ DD +CLAF G+  ++N+ IYGNIA +NFLV  
Sbjct: 364 DYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAF-GSFPDDNIFIYGNIAQSNFLVGY 423

Query: 301 NIKKSSIFFKPANCAA 314
           +IK  ++ F+PA+C A
Sbjct: 424 DIKNLAVSFQPAHCGA 433

BLAST of CmaCh17G000520 vs. NCBI nr
Match: gi|778722025|ref|XP_004153020.2| (PREDICTED: aspartic proteinase CDR1-like [Cucumis sativus])

HSP 1 Score: 339.3 bits (869), Expect = 6.9e-90
Identity = 180/316 (56.96%), Postives = 226/316 (71.52%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           MFDPSK +TYK V+ SSP CS +G G+SCS DS C YSI+YGD SHS G++AVDT+TM S
Sbjct: 527 MFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQS 586

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPI- 120
           TSGRPVAFPR  IGCGHDNAG+F++ VSGIVGL  G ASLV Q+GPATGGKFSYCL PI 
Sbjct: 587 TSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIG 646

Query: 121 -GNSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHTEGDYKIFYLLKIKAMSVGSNKFNFL 180
            G++N S+ LNFGSNA      +          ++   YK FY LK++A+SVG  KFNF 
Sbjct: 647 TGSTNDSTKLNFGSNA-----NVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFP 706

Query: 181 R-SSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTN 240
             +S  G   NIIIDSGTTLT+L   +  SF  AIS+ M L     P + L+YC+  TT+
Sbjct: 707 EGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTD 766

Query: 241 DYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSC 300
           DY++PPVT HF+  DV L+RENLF+R+ DD +CLAF G+  ++N+ IYGNIA +NFLV  
Sbjct: 767 DYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAF-GSFPDDNIFIYGNIAQSNFLVGY 826

Query: 301 NIKKSSIFFKPANCAA 314
           +IK  ++ F+PA+C A
Sbjct: 827 DIKNLAVSFQPAHCGA 836

BLAST of CmaCh17G000520 vs. NCBI nr
Match: gi|659120454|ref|XP_008460202.1| (PREDICTED: uncharacterized protein LOC103499087 [Cucumis melo])

HSP 1 Score: 338.6 bits (867), Expect = 1.2e-89
Identity = 182/317 (57.41%), Postives = 230/317 (72.56%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           MFDPSK +TYK V  SSP CS +G G+SCS DS C YSI+YGD SHS+G++AVDT+TM S
Sbjct: 541 MFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVTMQS 600

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           TSGRPVAFPR  IGCGHDNAG+F++ VSGIVGL  G ASLV Q+GPATGGKFSYCL PIG
Sbjct: 601 TSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLMPIG 660

Query: 121 NSNY--SSYLNFGSNAWVVKRLIKKKSWGLKGD-HTEGDYKIFYLLKIKAMSVGSNKFNF 180
           N++   S+ LNFGSNA V        S  +    +T   YK FY LK++A+SVG NKF+F
Sbjct: 661 NASMEDSTKLNFGSNADV------SGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDF 720

Query: 181 LR-SSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTT 240
              SS  G   NIIIDSGTTLT+L  D+ ++F  AI++ ++L     P Q L+YC+  TT
Sbjct: 721 PEVSSKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTT 780

Query: 241 NDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVS 300
           +DY+VP VT HF+  DV L+REN+FIR+ +D +CLAF G   ++N+ IYGNIA +NFLV 
Sbjct: 781 DDYEVPSVTMHFEGADVPLQRENMFIRLSEDTICLAF-GAFSDDNIFIYGNIAQSNFLVG 840

Query: 301 CNIKKSSIFFKPANCAA 314
            +IK  ++ F+PA+C A
Sbjct: 841 YDIKNLAVSFQPADCNA 850

BLAST of CmaCh17G000520 vs. NCBI nr
Match: gi|700191064|gb|KGN46268.1| (hypothetical protein Csa_6G078630 [Cucumis sativus])

HSP 1 Score: 325.9 bits (834), Expect = 7.9e-86
Identity = 180/316 (56.96%), Postives = 222/316 (70.25%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           MF+PSK +TY+ VS SSP CS TG  NSCS    C YSISYGD SHS GD AVDTLTM S
Sbjct: 126 MFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGS 185

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           TSGR VAFPR AIGCGHDNAGSFD+ VSGIVGL  G ASL++QMG A GGKFSYCL PIG
Sbjct: 186 TSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIG 245

Query: 121 NSN-YSSYLNFGSNAWVVKRLIKKKSWGLKGD-HTEGDYKIFYLLKIKAMSVG-SNKFNF 180
           N +  S+ LNFGSNA V        S  +    +    +K FY LK+KA+SVG +N F  
Sbjct: 246 NDDGGSNKLNFGSNANV------SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYS 305

Query: 181 LRSSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCYERTTN 240
             +S  G   NIIIDSGTTLT L +D++ +F++AIS  ++L+    P Q LEYC+E TT+
Sbjct: 306 TANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTD 365

Query: 241 DYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPTNFLVSC 300
           DYKVP +  HF+  ++ L+REN+ IRV D+V+CLAF G +++N++ IYGNIA  NFLV  
Sbjct: 366 DYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAG-AQDNDISIYGNIAQINFLVGY 425

Query: 301 NIKKSSIFFKPANCAA 314
           ++   S+ FKP NC A
Sbjct: 426 DVTNMSLSFKPMNCVA 434

BLAST of CmaCh17G000520 vs. NCBI nr
Match: gi|629108150|gb|KCW73296.1| (hypothetical protein EUGRSUZ_E01757, partial [Eucalyptus grandis])

HSP 1 Score: 258.5 bits (659), Expect = 1.6e-65
Identity = 139/323 (43.03%), Postives = 194/323 (60.06%), Query Frame = 1

Query: 1   MFDPSKWSTYKTVSSSSPTCSITGPGNSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDS 60
           +FDPSK STYK VS  +  C +    +     S+CEYS +YGD S++ G++A DT T+ S
Sbjct: 100 LFDPSKSSTYKEVSCQTSQCEVVRQTSCGGGGSLCEYSYAYGDQSYTQGNLATDTFTLGS 159

Query: 61  TSGRPVAFPRIAIGCGHDNAGSFDSKVSGIVGLSHGSASLVQQMGPATGGKFSYCLAPIG 120
           TSGRPV+FP++  GCGH N G+FD++V G+ GL  G ASLV Q+G ATGGKFSYCLAP  
Sbjct: 160 TSGRPVSFPKLVFGCGHSNGGTFDNRVDGLFGLGGGDASLVTQLGTATGGKFSYCLAPTS 219

Query: 121 NSNYSSYLNFGSNAWVVKRLIKKKSWGLKGDHT------EGDYKIFYLLKIKAMSVGSNK 180
               +S LNFG+NA            G+ GD        + D K FY L ++ +SVG  K
Sbjct: 220 PDEKTSKLNFGANA------------GVTGDGAVSTPLIQKDPKTFYYLSLEEVSVGETK 279

Query: 181 FNFLR--SSPFGTNGNIIIDSGTTLTFLQLDIFASFSQAISEVMDLKSMTSPIQTLEYCY 240
            +F    SS     GNIIIDSGTTLT L  D+++    A+++ +DL   + P Q L  C+
Sbjct: 280 IDFPSDGSSSSADEGNIIIDSGTTLTLLPQDLYSQIEDAVAKAVDLPKASDPTQLLSLCF 339

Query: 241 E-RTTNDYKVPPVTAHFKDDDVNLKRENLFIRVVDDVVCLAFVGNSRENNMQIYGNIAPT 300
              +     +P VT HFK  DV L   N F++V D ++CL+F    R   + I+GN+A  
Sbjct: 340 RVESDAQLSLPTVTFHFKGADVELSPTNTFVQVADGIICLSF----RPEKVSIFGNLAQI 399

Query: 301 NFLVSCNIKKSSIFFKPANCAAS 315
           N+L+  +I+ S ++FKP +CA++
Sbjct: 400 NYLIGYDIQNSKLYFKPVDCASN 406

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH1.0e-6142.99Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
ASPR1_ARATH4.1e-5036.50Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
NEP1_NEPGR5.6e-3132.30Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR2.8e-3031.15Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
APF2_ARATH8.9e-2931.76Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K928_CUCSA4.8e-9056.96Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078650 PE=3 SV=1[more]
A0A0A0K9V4_CUCSA5.5e-8656.96Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078630 PE=3 SV=1[more]
A0A059C519_EUCGR1.1e-6543.03Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_E01757 PE=3 ... [more]
I1M0V7_SOYBN2.7e-6446.20Uncharacterized protein OS=Glycine max GN=GLYMA_13G200600 PE=3 SV=2[more]
A0A0B2RZL7_GLYSO2.7e-6446.20Putative aspartic protease OS=Glycine soja GN=glysoja_007342 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G33340.15.9e-6342.99 Eukaryotic aspartyl protease family protein[more]
AT1G64830.12.6e-5840.45 Eukaryotic aspartyl protease family protein[more]
AT2G35615.12.3e-5136.50 Eukaryotic aspartyl protease family protein[more]
AT1G31450.12.2e-4935.49 Eukaryotic aspartyl protease family protein[more]
AT2G28010.11.0e-3533.65 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|700191066|gb|KGN46270.1|6.9e-9056.96hypothetical protein Csa_6G078650 [Cucumis sativus][more]
gi|778722025|ref|XP_004153020.2|6.9e-9056.96PREDICTED: aspartic proteinase CDR1-like [Cucumis sativus][more]
gi|659120454|ref|XP_008460202.1|1.2e-8957.41PREDICTED: uncharacterized protein LOC103499087 [Cucumis melo][more]
gi|700191064|gb|KGN46268.1|7.9e-8656.96hypothetical protein Csa_6G078630 [Cucumis sativus][more]
gi|629108150|gb|KCW73296.1|1.6e-6543.03hypothetical protein EUGRSUZ_E01757, partial [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh17G000520.1CmaCh17G000520.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..314
score: 2.5E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 8..132
score: 3.9E-25coord: 157..312
score: 4.2
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 1..311
score: 1.65
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 1..314
score: 2.5E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh17G000520CmaCh08G009820Cucurbita maxima (Rimu)cmacmaB388