CmaCh17G000530 (gene) Cucurbita maxima (Rimu)

NameCmaCh17G000530
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein, putative
LocationCma_Chr17 : 228580 .. 230218 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCACTCATTTTCTCACTGATTTTGTTTGTCTCCTCCGCCGCTGCCGCTGCCGCAGACGGTGGCTATGGCTTCTCCGTCGAACTTGTCCACCGTGACTTCCCCAAGTTCCCACTTTTCAACGCATCAGAGACACACTACCAACGCATCGCCGACGCTCTCCGTCGCTCCATCAGCCGTGGGACGGTGTCGCTGACAGACACGGGGAGAGCCCCAATATACAACAGCGGAGGCGCATACGTTGTGAAAGTATCCCTCGGAACGCCGCCGTTTTCGATTGTAGCCGTTGCTGACACTGGAAGCGACATCATTTGGACTCAGTGCAAACCTTGCCCGAATTGCTACCAGCAAATCGACCCGATGTTTGATCCGAGTAAATCGTCGACTTACAAGACAGTCCCGTGTTCCTCGCCGACTTGCTCGTTTGCAGGGCCGAGAAGTTCTTGTTCCTCGGATTCCGTGTGCGAGTACTCCATTTCATACGGCGATGGATCCCACAGCAACGGGGATATTGCCGTTGATACCCTTACAATGGACTCCACCTCCGGGCGCCCCGTGGCGTTTCCACGGACTGCGATTGGCTGTGGCCATGACAATGCTGGCTCTTTTGATTCTAAAGTTTCTGGGATTGTCGGGCTCGGTCATGGTTCAGCTTCCCTTATCCAGCAGATGGGGCCGGCCACCGATGGGAAATTCTCTTACTGTTTGGCACCGGTTGGAAACTCTCACGACTCGAGCTATCTTAACTTTGGCTCTAATGCTATCGTCTCTGGCTCTGGAGTCGTCTCGACTCCGATTTATACTAGTGGTAAATACAAAATGTTTTCTTATCGATAAAATGATTTGTGGGTAGGTGAATAACTCAGTAAGGTAAAATTAGGAGTATACAAGTAAGAGATCGTAGAAGTTCATTGTCGTAACAAACAATATTAAAGTCACGCTCTTAACTTAGCCAGATACTGGTCGTGTGCTCTAGAGAAAATGAGTTGGCCTGGATTTAGGGTTTAAAAGTGGATAATACTGGTCGTATGCTCTAGAGAAAAGGAGTCGATTTTGATTTAGGGTTTAAAAGTGCATAATATTATAGTTTTACGTGAGTTTGTTCGTTATACTAAGATAAATTGTATAAGTAATTATTTTTTTTTTCATGTTTGTAGAAGGCGACTACGAAACTTTCTACGTGCTGAATATAGAAGCAATAAGTGTAGGAAGCAACAAATTTGATTTTTCAAGCTCTTCACCATTTGGAACAAACGGGAACATCATTATCGACTCCGGCACGACACTTACATTCTTGCCACCGGACACCTACACAAGCTTCTCCAAGGCGATTTCCGAGGCGATGGACCTGAAGCCCACGACTAGTCCAATTCAAGGCGTGGAGTATTGCTATACGACCACCACCGACGACTATAAGGTGCCACCTGTCACAGTGCATTTCGAAGGCGCCGACGTGTCTCTCAAGCGAGAAAACCTGTTCATTAGGGTGGATAACAACGTCGTTTGCTTGGCATTTATGGACAGTAACGGCGTCGGCCTACAGATCTACGGCAACATTGCACAGACTAACTTCTTAATTGGCTATGATATCAAAAAATTGACCGTTTCTTTCAAGCCACAAAATTGTGCTGCCTCGTAA

mRNA sequence

ATGGCACTCATTTTCTCACTGATTTTGTTTGTCTCCTCCGCCGCTGCCGCTGCCGCAGACGGTGGCTATGGCTTCTCCGTCGAACTTGTCCACCGTGACTTCCCCAAGTTCCCACTTTTCAACGCATCAGAGACACACTACCAACGCATCGCCGACGCTCTCCGTCGCTCCATCAGCCGTGGGACGGTGTCGCTGACAGACACGGGGAGAGCCCCAATATACAACAGCGGAGGCGCATACGTTGTGAAAGTATCCCTCGGAACGCCGCCGTTTTCGATTGTAGCCGTTGCTGACACTGGAAGCGACATCATTTGGACTCAGTGCAAACCTTGCCCGAATTGCTACCAGCAAATCGACCCGATGTTTGATCCGAGTAAATCGTCGACTTACAAGACAGTCCCGTGTTCCTCGCCGACTTGCTCGTTTGCAGGGCCGAGAAGTTCTTGTTCCTCGGATTCCGTGTGCGAGTACTCCATTTCATACGGCGATGGATCCCACAGCAACGGGGATATTGCCGTTGATACCCTTACAATGGACTCCACCTCCGGGCGCCCCGTGGCGTTTCCACGGACTGCGATTGGCTGTGGCCATGACAATGCTGGCTCTTTTGATTCTAAAGTTTCTGGGATTGTCGGGCTCGGTCATGGTTCAGCTTCCCTTATCCAGCAGATGGGGCCGGCCACCGATGGGAAATTCTCTTACTGTTTGGCACCGGTTGGAAACTCTCACGACTCGAGCTATCTTAACTTTGGCTCTAATGCTATCGTCTCTGGCTCTGGAGTCGTCTCGACTCCGATTTATACTAGTGTTTTACAAGGCGACTACGAAACTTTCTACGTGCTGAATATAGAAGCAATAAGTGTAGGAAGCAACAAATTTGATTTTTCAAGCTCTTCACCATTTGGAACAAACGGGAACATCATTATCGACTCCGGCACGACACTTACATTCTTGCCACCGGACACCTACACAAGCTTCTCCAAGGCGATTTCCGAGGCGATGGACCTGAAGCCCACGACTAGTCCAATTCAAGGCGTGGAGTATTGCTATACGACCACCACCGACGACTATAAGGTGCCACCTGTCACAGTGCATTTCGAAGGCGCCGACGTGTCTCTCAAGCGAGAAAACCTGTTCATTAGGGTGGATAACAACGTCGTTTGCTTGGCATTTATGGACAGTAACGGCGTCGGCCTACAGATCTACGGCAACATTGCACAGACTAACTTCTTAATTGGCTATGATATCAAAAAATTGACCGTTTCTTTCAAGCCACAAAATTGTGCTGCCTCGTAA

Coding sequence (CDS)

ATGGCACTCATTTTCTCACTGATTTTGTTTGTCTCCTCCGCCGCTGCCGCTGCCGCAGACGGTGGCTATGGCTTCTCCGTCGAACTTGTCCACCGTGACTTCCCCAAGTTCCCACTTTTCAACGCATCAGAGACACACTACCAACGCATCGCCGACGCTCTCCGTCGCTCCATCAGCCGTGGGACGGTGTCGCTGACAGACACGGGGAGAGCCCCAATATACAACAGCGGAGGCGCATACGTTGTGAAAGTATCCCTCGGAACGCCGCCGTTTTCGATTGTAGCCGTTGCTGACACTGGAAGCGACATCATTTGGACTCAGTGCAAACCTTGCCCGAATTGCTACCAGCAAATCGACCCGATGTTTGATCCGAGTAAATCGTCGACTTACAAGACAGTCCCGTGTTCCTCGCCGACTTGCTCGTTTGCAGGGCCGAGAAGTTCTTGTTCCTCGGATTCCGTGTGCGAGTACTCCATTTCATACGGCGATGGATCCCACAGCAACGGGGATATTGCCGTTGATACCCTTACAATGGACTCCACCTCCGGGCGCCCCGTGGCGTTTCCACGGACTGCGATTGGCTGTGGCCATGACAATGCTGGCTCTTTTGATTCTAAAGTTTCTGGGATTGTCGGGCTCGGTCATGGTTCAGCTTCCCTTATCCAGCAGATGGGGCCGGCCACCGATGGGAAATTCTCTTACTGTTTGGCACCGGTTGGAAACTCTCACGACTCGAGCTATCTTAACTTTGGCTCTAATGCTATCGTCTCTGGCTCTGGAGTCGTCTCGACTCCGATTTATACTAGTGTTTTACAAGGCGACTACGAAACTTTCTACGTGCTGAATATAGAAGCAATAAGTGTAGGAAGCAACAAATTTGATTTTTCAAGCTCTTCACCATTTGGAACAAACGGGAACATCATTATCGACTCCGGCACGACACTTACATTCTTGCCACCGGACACCTACACAAGCTTCTCCAAGGCGATTTCCGAGGCGATGGACCTGAAGCCCACGACTAGTCCAATTCAAGGCGTGGAGTATTGCTATACGACCACCACCGACGACTATAAGGTGCCACCTGTCACAGTGCATTTCGAAGGCGCCGACGTGTCTCTCAAGCGAGAAAACCTGTTCATTAGGGTGGATAACAACGTCGTTTGCTTGGCATTTATGGACAGTAACGGCGTCGGCCTACAGATCTACGGCAACATTGCACAGACTAACTTCTTAATTGGCTATGATATCAAAAAATTGACCGTTTCTTTCAAGCCACAAAATTGTGCTGCCTCGTAA

Protein sequence

MALIFSLILFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTVSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPVGNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSSSSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTVSFKPQNCAAS
BLAST of CmaCh17G000530 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 410.2 bits (1053), Expect = 2.8e-113
Identity = 213/428 (49.77%), Postives = 289/428 (67.52%), Query Frame = 1

Query: 9   LFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTVSLTDT 68
           LF+S+A A       GF+ +L+HRD PK P +N  ET  QR+ +A+ RS++R     T+ 
Sbjct: 18  LFLSNANAKPK---LGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR-VFHFTEK 77

Query: 69  GRAP-----IYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFD 128
              P     + ++ G Y++ VS+GTPPF I+A+ADTGSD++WTQC PC +CY Q+DP+FD
Sbjct: 78  DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFD 137

Query: 129 PSKSSTYKTVPCSSPTCSFAGPRSSCSS-DSVCEYSISYGDGSHSNGDIAVDTLTMDSTS 188
           P  SSTYK V CSS  C+    ++SCS+ D+ C YS+SYGD S++ G+IAVDTLT+ S+ 
Sbjct: 138 PKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSD 197

Query: 189 GRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPVGNS 248
            RP+      IGCGH+NAG+F+ K SGIVGLG G  SLI+Q+G + DGKFSYCL P+ + 
Sbjct: 198 TRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSK 257

Query: 249 HD-SSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSSSSPF 308
            D +S +NFG+NAIVSGSGVVSTP+   + +   ETFY L +++ISVGS +  +S S   
Sbjct: 258 KDQTSKINFGTNAIVSGSGVVSTPL---IAKASQETFYYLTLKSISVGSKQIQYSGSDSE 317

Query: 309 GTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTDDYKVPP 368
            + GNIIIDSGTTLT LP + Y+    A++ ++D +    P  G+  CY + T D KVP 
Sbjct: 318 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDLKVPV 377

Query: 369 VTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTV 428
           +T+HF+GADV L   N F++V  ++VC AF  S      IYGN+AQ NFL+GYD    TV
Sbjct: 378 ITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGS--PSFSIYGNVAQMNFLVGYDTVSKTV 435

Query: 429 SFKPQNCA 430
           SFKP +CA
Sbjct: 438 SFKPTDCA 435

BLAST of CmaCh17G000530 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 3.4e-87
Identity = 187/453 (41.28%), Postives = 274/453 (60.49%), Query Frame = 1

Query: 1   MALIFSLILFVSSAAAAAADGG-YGFSVELVHRDFPKFPLFNASETHYQRIADALRRSIS 60
           MA    L  F+  +   ++ G    FSVEL+HRD P  P++N   T   R+  A  RS+S
Sbjct: 1   MATQILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVS 60

Query: 61  RGT-----VSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNC 120
           R       +S TD  ++ +  + G + + +++GTPP  + A+ADTGSD+ W QCKPC  C
Sbjct: 61  RSRRFNHQLSQTDL-QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQC 120

Query: 121 YQQIDPMFDPSKSSTYKTVPCSSPTC-SFAGPRSSCS-SDSVCEYSISYGDGSHSNGDIA 180
           Y++  P+FD  KSSTYK+ PC S  C + +     C  S+++C+Y  SYGD S S GD+A
Sbjct: 121 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 180

Query: 181 VDTLTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKF 240
            +T+++DS SG PV+FP T  GCG++N G+FD   SGI+GLG G  SLI Q+G +   KF
Sbjct: 181 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKF 240

Query: 241 SYCLA-PVGNSHDSSYLNFGSNAIVSG----SGVVSTPIYTSVLQGDYETFYVLNIEAIS 300
           SYCL+     ++ +S +N G+N+I S     SGVVSTP+    +  +  T+Y L +EAIS
Sbjct: 241 SYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPL----VDKEPLTYYYLTLEAIS 300

Query: 301 VGSNKFDFSSSS--------PFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPT 360
           VG  K  ++ SS           T+GNIIIDSGTTLT L    +  FS A+ E++     
Sbjct: 301 VGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKR 360

Query: 361 TSPIQG-VEYCYTTTTDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVG 420
            S  QG + +C+ + + +  +P +TVHF GADV L   N F+++  ++VCL+ + +  V 
Sbjct: 361 VSDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTEVA 420

Query: 421 LQIYGNIAQTNFLIGYDIKKLTVSFKPQNCAAS 432
             IYGN AQ +FL+GYD++  TVSF+  +C+A+
Sbjct: 421 --IYGNFAQMDFLVGYDLETRTVSFQHMDCSAN 446

BLAST of CmaCh17G000530 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 2.9e-62
Identity = 154/423 (36.41%), Postives = 220/423 (52.01%), Query Frame = 1

Query: 24  GFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTVSLTDTGRAP------IYNSG 83
           GF + L H D  K      + T +Q +  A+ R  SR    L      P      +Y   
Sbjct: 40  GFQIMLEHVDSGK------NLTKFQLLERAIERG-SRRLQRLEAMLNGPSGVETSVYAGD 99

Query: 84  GAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFDPSKSSTYKTVPCSS 143
           G Y++ +S+GTP     A+ DTGSD+IWTQC+PC  C+ Q  P+F+P  SS++ T+PCSS
Sbjct: 100 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSS 159

Query: 144 PTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTSGRPVAFPRTAIGCGH 203
             C      S   S++ C+Y+  YGDGS + G +  +TLT  S     V+ P    GCG 
Sbjct: 160 QLCQALS--SPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNITFGCGE 219

Query: 204 DNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPVGNSHDSSYLNFGS--NAI 263
           +N G      +G+VG+G G  SL  Q+      KFSYC+ P+G+S  S+ L  GS  N++
Sbjct: 220 NNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLL-LGSLANSV 279

Query: 264 VSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFD-----FSSSSPFGTNGNIIID 323
            +GS     P  T +      TFY + +  +SVGS +       F+ +S  GT G IIID
Sbjct: 280 TAGS-----PNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGT-GGIIID 339

Query: 324 SGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTD--DYKVPPVTVHFEG 383
           SGTTLT+   + Y S  +     ++L        G + C+ T +D  + ++P   +HF+G
Sbjct: 340 SGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDG 399

Query: 384 ADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTVSFKPQNC 432
            D+ L  EN FI   N ++CLA M S+  G+ I+GNI Q N L+ YD     VSF    C
Sbjct: 400 GDLELPSENYFISPSNGLICLA-MGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 437

BLAST of CmaCh17G000530 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 4.9e-62
Identity = 144/398 (36.18%), Postives = 222/398 (55.78%), Query Frame = 1

Query: 45  THYQRIADALRRSISR----GTVSLTDTG-RAPIYNSGGAYVVKVSLGTPPFSIVAVADT 104
           T Y+ I  A++R   R      +  + +G   P+Y   G Y++ V++GTP  S  A+ DT
Sbjct: 56  TKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDT 115

Query: 105 GSDIIWTQCKPCPNCYQQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSI 164
           GSD+IWTQC+PC  C+ Q  P+F+P  SS++ T+PC S  C    P  +C+++  C+Y+ 
Sbjct: 116 GSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDL-PSETCNNNE-CQYTY 175

Query: 165 SYGDGSHSNGDIAVDTLTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSAS 224
            YGDGS + G +A +T T +++S      P  A GCG DN G      +G++G+G G  S
Sbjct: 176 GYGDGSTTQGYMATETFTFETSS-----VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLS 235

Query: 225 LIQQMGPATDGKFSYCLAPVGNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFY 284
           L  Q+G    G+FSYC+   G+S  S+ L  GS A     G  ST +  S L     T+Y
Sbjct: 236 LPSQLGV---GQFSYCMTSYGSSSPST-LALGSAASGVPEGSPSTTLIHSSLN---PTYY 295

Query: 285 VLNIEAISVGSNKFDFSSSS----PFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMD 344
            + ++ I+VG +     SS+      GT G +IIDSGTTLT+LP D Y + ++A ++ ++
Sbjct: 296 YITLQGITVGGDNLGIPSSTFQLQDDGT-GGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 355

Query: 345 LKPTTSPIQGVEYCYTTTTD--DYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMD 404
           L        G+  C+   +D    +VP +++ F+G  ++L  +N+ I     V+CLA   
Sbjct: 356 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVICLAMGS 415

Query: 405 SNGVGLQIYGNIAQTNFLIGYDIKKLTVSFKPQNCAAS 432
           S+ +G+ I+GNI Q    + YD++ L VSF P  C AS
Sbjct: 416 SSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCGAS 438

BLAST of CmaCh17G000530 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 2.9e-54
Identity = 141/358 (39.39%), Postives = 182/358 (50.84%), Query Frame = 1

Query: 78  GAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFDPSKSSTYKTVPCSS 137
           G Y  ++ +GTP   +  V DTGSDI+W QC PC  CY Q DP+FDP KS TY T+PCSS
Sbjct: 140 GEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSS 199

Query: 138 PTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTSGRPVAFPRTAIGCGH 197
           P C         +    C Y +SYGDGS + GD + +TLT      + V     A+GCGH
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGV-----ALGCGH 259

Query: 198 DNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPVGNSHDSSYLNFGSNAIVS 257
           DN G F    +G++GLG G  S   Q G   + KFSYCL     S   S + FG NA VS
Sbjct: 260 DNEGLFVG-AAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFG-NAAVS 319

Query: 258 GSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSSSSPFGT----NGNIIIDSGT 317
                 TP+ ++      +TFY + +  ISVG  +    ++S F      NG +IIDSGT
Sbjct: 320 RIARF-TPLLSN---PKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 379

Query: 318 TLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCY-TTTTDDYKVPPVTVHFEGADVS 377
           ++T L    Y +   A                 + C+  +  ++ KVP V +HF GADVS
Sbjct: 380 SVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVS 439

Query: 378 LKRENLFIRVD-NNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTVSFKPQNCA 430
           L   N  I VD N   C AF  + G GL I GNI Q  F + YD+    V F P  CA
Sbjct: 440 LPATNYLIPVDTNGKFCFAFAGTMG-GLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh17G000530 vs. TrEMBL
Match: A0A0A0K928_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078650 PE=3 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 1.6e-152
Identity = 275/436 (63.07%), Postives = 340/436 (77.98%), Query Frame = 1

Query: 1   MALIFSLILFVSSAA--AAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSI 60
           MA +FSL+  +S+A+  +A     YGF+VEL+HRD PK P++N+SETH+ RI +ALRRS 
Sbjct: 1   MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 61  SRGTVSL-TDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQ 120
            R TV L +DT  API+N+GG Y+V++S+GTPPFSIVAVADTGSD+IWTQCKPC NCYQQ
Sbjct: 61  HRNTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQ 120

Query: 121 IDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLT 180
             PMFDPSKS+TYK V CSSP CS++G  SSCS DS C YSI+YGD SHS G++AVDT+T
Sbjct: 121 NAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVT 180

Query: 181 MDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLA 240
           M STSGRPVAFPRT IGCGHDNAG+F++ VSGIVGLG G ASL+ Q+GPAT GKFSYCL 
Sbjct: 181 MQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLI 240

Query: 241 PV--GNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDF 300
           P+  G+++DS+ LNFGSNA VSGSG VSTPIY+S     Y+TFY L +EA+SVG  KF+F
Sbjct: 241 PIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSS---AQYKTFYSLKLEAVSVGDTKFNF 300

Query: 301 -SSSSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTT 360
              +S  G   NIIIDSGTTLT+LP     SF  AIS++M L     P + ++YC+ TTT
Sbjct: 301 PEGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTT 360

Query: 361 DDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGY 420
           DDY++PPVT+HFEGADV L+RENLF+R+ ++ +CLAF       + IYGNIAQ+NFL+GY
Sbjct: 361 DDYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGY 420

Query: 421 DIKKLTVSFKPQNCAA 431
           DIK L VSF+P +C A
Sbjct: 421 DIKNLAVSFQPAHCGA 433

BLAST of CmaCh17G000530 vs. TrEMBL
Match: A0A0A0K9V4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078630 PE=3 SV=1)

HSP 1 Score: 525.8 bits (1353), Expect = 5.0e-146
Identity = 272/437 (62.24%), Postives = 332/437 (75.97%), Query Frame = 1

Query: 1   MALIFSLIL----FVSSAAAAAADG-GYGFSVELVHRDFPKFPLFNASETHYQRIADALR 60
           MA IFSL++     +S+A  +AA G  YGF+VEL+HRD PK P++N  E HY R+AD LR
Sbjct: 1   MAPIFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLR 60

Query: 61  RSISRGTVSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCY 120
           RSIS  T  +T+T  APIYN+ G Y++K+S+GTPPF I+AVADTGSDIIWTQC+PC NCY
Sbjct: 61  RSISHNTGLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCY 120

Query: 121 QQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDT 180
           QQ  PMF+PSKS+TY+ V CSSP CSF G  +SCS    C YSISYGD SHS GD AVDT
Sbjct: 121 QQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDT 180

Query: 181 LTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYC 240
           LTM STSGR VAFPRTAIGCGHDNAGSFD+ VSGIVGLG G ASLI+QMG A  GKFSYC
Sbjct: 181 LTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYC 240

Query: 241 LAPVGNSH-DSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVG-SNKF 300
           L P+GN    S+ LNFGSNA VSGSG VSTPIY S     +++FY L ++A+SVG +N F
Sbjct: 241 LTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYIS---DKFKSFYSLKLKAVSVGRNNTF 300

Query: 301 DFSSSSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTT 360
             +++S  G   NIIIDSGTTLT LP D Y +F+KAIS +++L+ T  P Q +EYC+ TT
Sbjct: 301 YSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETT 360

Query: 361 TDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIG 420
           TDDYKVP + +HFEGA++ L+REN+ IRV +NV+CLAF  +    + IYGNIAQ NFL+G
Sbjct: 361 TDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVG 420

Query: 421 YDIKKLTVSFKPQNCAA 431
           YD+  +++SFKP NC A
Sbjct: 421 YDVTNMSLSFKPMNCVA 434

BLAST of CmaCh17G000530 vs. TrEMBL
Match: I1M0V7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_13G200600 PE=3 SV=2)

HSP 1 Score: 417.9 bits (1073), Expect = 1.5e-113
Identity = 217/420 (51.67%), Postives = 291/420 (69.29%), Query Frame = 1

Query: 18  AADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGT-----VSLTDTGRAP 77
           A DGG  FSVE++HRD  + PL+  +ET +QR+A+A+RRSI+RG         TD+  + 
Sbjct: 26  ANDGG--FSVEMIHRDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAEST 85

Query: 78  IYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFDPSKSSTYKT 137
           +  S G Y+++ S+G+PPF ++ + DTGSDI+W QC+PC +CY+Q  P+FDPSKS TYKT
Sbjct: 86  VVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKT 145

Query: 138 VPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTSGRPVAFPRTA 197
           +PCSS TC      ++CSSD+VCEYSI YGDGSHS+GD++V+TLT+ ST G  V FP+T 
Sbjct: 146 LPCSSNTCESLR-NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTV 205

Query: 198 IGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPV-GNSHDSSYLNFG 257
           IGCGH+N G+F  + SGIVGLG G  SLI Q+  +  GKFSYCLAP+   S+ SS LNFG
Sbjct: 206 IGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFG 265

Query: 258 SNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSSSSPFGT---NGNII 317
             A+VSG G VSTP+    L G  + FY L +EA SVG N+ +FS SS  G+   +GNII
Sbjct: 266 DAAVVSGRGTVSTPL--DPLNG--QVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNII 325

Query: 318 IDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTDDYKVPPVTVHFEG 377
           IDSGTTLT LP + Y +   A+S+ + L+    P + +  CY TT+D+  +P +T HF+G
Sbjct: 326 IDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDELDLPVITAHFKG 385

Query: 378 ADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTVSFKPQNC 429
           ADV L   + F+ V+  VVC AF+ S  +G  I+GN+AQ N L+GYD+ K TVSFKP +C
Sbjct: 386 ADVELNPISTFVPVEKGVVCFAFISSK-IG-AIFGNLAQQNLLVGYDLVKKTVSFKPTDC 436

BLAST of CmaCh17G000530 vs. TrEMBL
Match: I1LVB5_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_12G235400 PE=3 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 4.7e-112
Identity = 223/437 (51.03%), Postives = 292/437 (66.82%), Query Frame = 1

Query: 4   IFSLILFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGT- 63
           I  L L+++ +   A DGG GFSVE++HRD  + P +  +ET +QR+A+ALRRSI+R   
Sbjct: 9   IVLLCLYINISFLNALDGG-GFSVEIIHRDSSRSPYYRPTETQFQRVANALRRSINRANH 68

Query: 64  ------VSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQ 123
                 V+ T+T  + +  S G Y++  S+GTPPF I+ + DTGSDIIW QC+PC +CY 
Sbjct: 69  FNKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYN 128

Query: 124 QIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDS-VCEYSISYGDGSHSNGDIAVDT 183
           Q  P+FDPS+S TYKT+PCSS  C      +SCSS++  CEY+I+YGD SHS GD++V+T
Sbjct: 129 QTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVET 188

Query: 184 LTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYC 243
           LT+ ST G  V FP+T IGCGH+N G+F  + SGIVGLG G  SLI Q+  +  GKFSYC
Sbjct: 189 LTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYC 248

Query: 244 LAPV-GNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFD 303
           LAP+   S+ SS LNFG  A+VSG G VSTPI      G    FY L +EA SVG N+ +
Sbjct: 249 LAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLG----FYFLTLEAFSVGDNRIE 308

Query: 304 FSSSS--PFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCY-T 363
           F SSS    G  GNIIIDSGTTLT LP D Y +   A+++A++L+    P + +  CY T
Sbjct: 309 FGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRT 368

Query: 364 TTTDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFL 423
           T++D+  VP +T HF+GADV L   + FI VD  VVC AF  S  +G  I+GN+AQ N L
Sbjct: 369 TSSDELNVPVITAHFKGADVELNPISTFIEVDEGVVCFAFRSSK-IG-PIFGNLAQQNLL 428

Query: 424 IGYDIKKLTVSFKPQNC 429
           +GYD+ K TVSFKP +C
Sbjct: 429 VGYDLVKQTVSFKPTDC 438

BLAST of CmaCh17G000530 vs. TrEMBL
Match: A0A0D2W503_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G112800 PE=3 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 2.3e-111
Identity = 225/441 (51.02%), Postives = 287/441 (65.08%), Query Frame = 1

Query: 1   MALIFSLILFV------SSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADAL 60
           MA I SL  FV      S+ +   A  G GFSVEL+HRD PK PL+N  +T Y R+ +AL
Sbjct: 1   MAAIVSLFAFVFAIVGLSNLSLIQAQKG-GFSVELIHRDSPKSPLYNHLDTTYNRVTNAL 60

Query: 61  RRSISR-----GTVSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCK 120
           RRS +R      T   T    A +    G Y++ +S+GTP F IVA+ADTGSD+IWTQCK
Sbjct: 61  RRSFNRVHRFKPTSVSTMEAEADVIADSGEYLMNISIGTPAFDIVAIADTGSDLIWTQCK 120

Query: 121 PCPNCYQQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNG 180
           PC  C+ Q  P+FDP+ SSTYKT  C +  C      +SCSS+  C+YS+SYGDGS+SNG
Sbjct: 121 PCSQCFPQNAPLFDPTASSTYKTFSCRTSQCGDV-EGTSCSSNGSCQYSVSYGDGSYSNG 180

Query: 181 DIAVDTLTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATD 240
           ++A DTLT+DST+G PV  P   +GCGHDN GSFD   SGI+GLG G +SLI Q+G   D
Sbjct: 181 EVAADTLTLDSTTGSPVVIPNVIMGCGHDNDGSFDENTSGIIGLGGGDSSLISQLGSTID 240

Query: 241 GKFSYCLAPVGNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVG 300
           GKFSYCL P   + +SS +NFGS+AIVSG+GVVSTP+     +   +TFY L +EAISVG
Sbjct: 241 GKFSYCLLPFSEAGNSSKMNFGSDAIVSGNGVVSTPL----TKQSPQTFYFLTLEAISVG 300

Query: 301 SNKFDFSSSSPFGTN-GNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEY 360
           +N+ +F +  PFGT+ GNIIIDSGTTLT LP D Y+    A+S  ++      P +G+  
Sbjct: 301 TNRINF-TDKPFGTDQGNIIIDSGTTLTLLPDDFYSELESAVSSMINATKVNGP-EGLNL 360

Query: 361 CYTTTTDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQT 420
           CY  T  ++ VP +TVHF GADV L+  N F+ +   V C  F  S      IYGN+AQ 
Sbjct: 361 CYDATI-EFAVPDITVHFSGADVKLQPLNTFVLISETVACFTF--SPLPNFAIYGNLAQM 420

Query: 421 NFLIGYDIKKLTVSFKPQNCA 430
           NFL+GYD  K TVSFK  +C+
Sbjct: 421 NFLVGYDTIKQTVSFKSTDCS 430

BLAST of CmaCh17G000530 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 410.2 bits (1053), Expect = 1.6e-114
Identity = 213/428 (49.77%), Postives = 289/428 (67.52%), Query Frame = 1

Query: 9   LFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTVSLTDT 68
           LF+S+A A       GF+ +L+HRD PK P +N  ET  QR+ +A+ RS++R     T+ 
Sbjct: 18  LFLSNANAKPK---LGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR-VFHFTEK 77

Query: 69  GRAP-----IYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFD 128
              P     + ++ G Y++ VS+GTPPF I+A+ADTGSD++WTQC PC +CY Q+DP+FD
Sbjct: 78  DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFD 137

Query: 129 PSKSSTYKTVPCSSPTCSFAGPRSSCSS-DSVCEYSISYGDGSHSNGDIAVDTLTMDSTS 188
           P  SSTYK V CSS  C+    ++SCS+ D+ C YS+SYGD S++ G+IAVDTLT+ S+ 
Sbjct: 138 PKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSD 197

Query: 189 GRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPVGNS 248
            RP+      IGCGH+NAG+F+ K SGIVGLG G  SLI+Q+G + DGKFSYCL P+ + 
Sbjct: 198 TRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSK 257

Query: 249 HD-SSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSSSSPF 308
            D +S +NFG+NAIVSGSGVVSTP+   + +   ETFY L +++ISVGS +  +S S   
Sbjct: 258 KDQTSKINFGTNAIVSGSGVVSTPL---IAKASQETFYYLTLKSISVGSKQIQYSGSDSE 317

Query: 309 GTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTDDYKVPP 368
            + GNIIIDSGTTLT LP + Y+    A++ ++D +    P  G+  CY + T D KVP 
Sbjct: 318 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDLKVPV 377

Query: 369 VTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTV 428
           +T+HF+GADV L   N F++V  ++VC AF  S      IYGN+AQ NFL+GYD    TV
Sbjct: 378 ITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGS--PSFSIYGNVAQMNFLVGYDTVSKTV 435

Query: 429 SFKPQNCA 430
           SFKP +CA
Sbjct: 438 SFKPTDCA 435

BLAST of CmaCh17G000530 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 368.6 bits (945), Expect = 5.2e-102
Identity = 200/436 (45.87%), Postives = 282/436 (64.68%), Query Frame = 1

Query: 2   ALIFSLILFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRG 61
           +LIF+ +L +   +   A    GF+++L+HRD PK P +N++ET  QR+ +A+RRS +R 
Sbjct: 3   SLIFATLLSLLLLSNVNAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS-ARS 62

Query: 62  TVSLTDTGRAP------IYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCY 121
           T+  ++   +P      I ++ G Y++ +S+GTPP  I+A+ADTGSD+IWTQC PC +CY
Sbjct: 63  TLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCY 122

Query: 122 QQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDT 181
           QQ  P+FDP +SSTY+ V CSS  C      S  + ++ C Y+I+YGD S++ GD+AVDT
Sbjct: 123 QQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDT 182

Query: 182 LTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYC 241
           +TM S+  RPV+     IGCGH+N G+FD   SGI+GLG GS SL+ Q+  + +GKFSYC
Sbjct: 183 VTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYC 242

Query: 242 LAP-VGNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFD 301
           L P    +  +S +NFG+N IVSG GVVS    TS+++ D  T+Y LN+EAISVGS K  
Sbjct: 243 LVPFTSETGLTSKINFGTNGIVSGDGVVS----TSMVKKDPATYYFLNLEAISVGSKKIQ 302

Query: 302 FSSSSPFGT-NGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTT 361
           F+S+  FGT  GNI+IDSGTTLT LP + Y      ++  +  +    P   +  CY  +
Sbjct: 303 FTSTI-FGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDS 362

Query: 362 TDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIG 421
           +  +KVP +TVHF+G DV L   N F+ V  +V C AF  +    L I+GN+AQ NFL+G
Sbjct: 363 S-SFKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANE--QLTIFGNLAQMNFLVG 422

Query: 422 YDIKKLTVSFKPQNCA 430
           YD    TVSFK  +C+
Sbjct: 423 YDTVSGTVSFKKTDCS 429

BLAST of CmaCh17G000530 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 323.6 bits (828), Expect = 1.9e-88
Identity = 187/453 (41.28%), Postives = 274/453 (60.49%), Query Frame = 1

Query: 1   MALIFSLILFVSSAAAAAADGG-YGFSVELVHRDFPKFPLFNASETHYQRIADALRRSIS 60
           MA    L  F+  +   ++ G    FSVEL+HRD P  P++N   T   R+  A  RS+S
Sbjct: 1   MATQILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVS 60

Query: 61  RGT-----VSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNC 120
           R       +S TD  ++ +  + G + + +++GTPP  + A+ADTGSD+ W QCKPC  C
Sbjct: 61  RSRRFNHQLSQTDL-QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQC 120

Query: 121 YQQIDPMFDPSKSSTYKTVPCSSPTC-SFAGPRSSCS-SDSVCEYSISYGDGSHSNGDIA 180
           Y++  P+FD  KSSTYK+ PC S  C + +     C  S+++C+Y  SYGD S S GD+A
Sbjct: 121 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 180

Query: 181 VDTLTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKF 240
            +T+++DS SG PV+FP T  GCG++N G+FD   SGI+GLG G  SLI Q+G +   KF
Sbjct: 181 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKF 240

Query: 241 SYCLA-PVGNSHDSSYLNFGSNAIVSG----SGVVSTPIYTSVLQGDYETFYVLNIEAIS 300
           SYCL+     ++ +S +N G+N+I S     SGVVSTP+    +  +  T+Y L +EAIS
Sbjct: 241 SYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPL----VDKEPLTYYYLTLEAIS 300

Query: 301 VGSNKFDFSSSS--------PFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPT 360
           VG  K  ++ SS           T+GNIIIDSGTTLT L    +  FS A+ E++     
Sbjct: 301 VGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKR 360

Query: 361 TSPIQG-VEYCYTTTTDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVG 420
            S  QG + +C+ + + +  +P +TVHF GADV L   N F+++  ++VCL+ + +  V 
Sbjct: 361 VSDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTEVA 420

Query: 421 LQIYGNIAQTNFLIGYDIKKLTVSFKPQNCAAS 432
             IYGN AQ +FL+GYD++  TVSF+  +C+A+
Sbjct: 421 --IYGNFAQMDFLVGYDLETRTVSFQHMDCSAN 446

BLAST of CmaCh17G000530 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 322.0 bits (824), Expect = 5.6e-88
Identity = 176/440 (40.00%), Postives = 265/440 (60.23%), Query Frame = 1

Query: 4   IFSLILFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTV 63
           + ++  F +S ++A  +     +VEL+HRD P  PL+N   T   R+  A  RSISR   
Sbjct: 11  LLAISFFFASNSSANREN---LTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRR 70

Query: 64  SLTDTG-RAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMF 123
             T T  ++ + ++GG Y + +S+GTPP  + A+ADTGSD+ W QCKPC  CY+Q  P+F
Sbjct: 71  FTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLF 130

Query: 124 DPSKSSTYKTVPCSSPTC-SFAGPRSSCS-SDSVCEYSISYGDGSHSNGDIAVDTLTMDS 183
           D  KSSTYKT  C S TC + +     C  S  +C+Y  SYGD S + GD+A +T+++DS
Sbjct: 131 DKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDS 190

Query: 184 TSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLA-PV 243
           +SG  V+FP T  GCG++N G+F+   SGI+GLG G  SL+ Q+G +   KFSYCL+   
Sbjct: 191 SSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTA 250

Query: 244 GNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSSSS 303
             ++ +S +N G+N+I S     S  + T ++Q D ET+Y L +EA++VG  K  ++   
Sbjct: 251 ATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGG 310

Query: 304 PFGTN-------GNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQG-VEYCYT 363
            +G N       GNIIIDSGTTLT L    Y  F  A+ E++      S  QG + +C+ 
Sbjct: 311 -YGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFK 370

Query: 364 TTTDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFL 423
           +   +  +P +T+HF  ADV L   N F++++ + VCL+ + +  V   IYGN+ Q +FL
Sbjct: 371 SGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTEVA--IYGNMVQMDFL 430

Query: 424 IGYDIKKLTVSFKPQNCAAS 432
           +GYD++  TVSF+  +C+ +
Sbjct: 431 VGYDLETKTVSFQRMDCSGN 444

BLAST of CmaCh17G000530 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 268.1 bits (684), Expect = 9.5e-72
Identity = 154/381 (40.42%), Postives = 218/381 (57.22%), Query Frame = 1

Query: 55  RRSISRGTVSLTDTGRAPIYNS---GGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPC 114
           RRS +   VS T +G +P  N+      Y++K+ +GTPPF I A+ DTGS+I WTQC PC
Sbjct: 37  RRSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPC 96

Query: 115 PNCYQQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDI 174
            +CY+Q  P+FDPSKSST+K   C   +               C Y + Y D +++ G +
Sbjct: 97  VHCYEQNAPIFDPSKSSTFKEKRCDGHS---------------CPYEVDYFDHTYTMGTL 156

Query: 175 AVDTLTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGK 234
           A +T+T+ STSG P   P T IGCGH+N+  F    SG+VGL  G +SLI QMG    G 
Sbjct: 157 ATETITLHSTSGEPFVMPETIIGCGHNNSW-FKPSFSGMVGLNWGPSSLITQMGGEYPGL 216

Query: 235 FSYCLAPVGNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSN 294
            SYC +  G    +S +NFG+NAIV+G GVVST ++ +  +     FY LN++A+SVG+ 
Sbjct: 217 MSYCFSGQG----TSKINFGANAIVAGDGVVSTTMFMTTAK---PGFYYLNLDAVSVGNT 276

Query: 295 KFDFSSSSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYT 354
           + +   ++     GNI+IDSGTTLT+ P        +A+   +       P      CY 
Sbjct: 277 RIETMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN 336

Query: 355 TTTDDYKVPPVTVHFE-GADVSLKRENLFIRVDN-NVVCLAFMDSNGVGLQIYGNIAQTN 414
           + T D   P +T+HF  G D+ L + N+++  +N  V CLA + ++     I+GN AQ N
Sbjct: 337 SDTIDI-FPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNN 393

Query: 415 FLIGYDIKKLTVSFKPQNCAA 431
           FL+GYD   L VSF P NC+A
Sbjct: 397 FLVGYDSSSLLVSFSPTNCSA 393

BLAST of CmaCh17G000530 vs. NCBI nr
Match: gi|659120454|ref|XP_008460202.1| (PREDICTED: uncharacterized protein LOC103499087 [Cucumis melo])

HSP 1 Score: 550.8 bits (1418), Expect = 2.1e-153
Identity = 274/423 (64.78%), Postives = 335/423 (79.20%), Query Frame = 1

Query: 12  SSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTVSLT-DTGR 71
           +S  +A     YGF+VEL+HRD  K P++N+SETHY RIA+ALRRSI+R    LT DT  
Sbjct: 431 NSVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSINRNKAVLTSDTAE 490

Query: 72  APIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMFDPSKSSTY 131
           APIYN+GG Y+V++S+GTPPFSI+AVADTGSD+IWTQC+PC NCYQQ  PMFDPSKS+TY
Sbjct: 491 APIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQSAPMFDPSKSATY 550

Query: 132 KTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTSGRPVAFPR 191
           K VPCSSP CS++G  SSCS DS C YSI+YGD SHS+G++AVDT+TM STSGRPVAFPR
Sbjct: 551 KNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVTMQSTSGRPVAFPR 610

Query: 192 TAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPVGNS--HDSSYL 251
           T IGCGHDNAG+F++ VSGIVGLG G ASL+ Q+GPAT GKFSYCL P+GN+   DS+ L
Sbjct: 611 TVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLMPIGNASMEDSTKL 670

Query: 252 NFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDFSS-SSPFGTNGNI 311
           NFGSNA VSGSG VSTPIYTS     Y+TFY L +EA+SVG NKFDF   SS  G   NI
Sbjct: 671 NFGSNADVSGSGAVSTPIYTS---DQYKTFYSLKLEAVSVGDNKFDFPEVSSKLGGEANI 730

Query: 312 IIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTDDYKVPPVTVHFE 371
           IIDSGTTLT+LP D  ++F  AI+++++L     P Q ++YC++TTTDDY+VP VT+HFE
Sbjct: 731 IIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDYEVPSVTMHFE 790

Query: 372 GADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKLTVSFKPQN 431
           GADV L+REN+FIR+  + +CLAF   +   + IYGNIAQ+NFL+GYDIK L VSF+P +
Sbjct: 791 GADVPLQRENMFIRLSEDTICLAFGAFSDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAD 850

BLAST of CmaCh17G000530 vs. NCBI nr
Match: gi|700191066|gb|KGN46270.1| (hypothetical protein Csa_6G078650 [Cucumis sativus])

HSP 1 Score: 547.4 bits (1409), Expect = 2.3e-152
Identity = 275/436 (63.07%), Postives = 340/436 (77.98%), Query Frame = 1

Query: 1   MALIFSLILFVSSAA--AAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSI 60
           MA +FSL+  +S+A+  +A     YGF+VEL+HRD PK P++N+SETH+ RI +ALRRS 
Sbjct: 1   MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 61  SRGTVSL-TDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQ 120
            R TV L +DT  API+N+GG Y+V++S+GTPPFSIVAVADTGSD+IWTQCKPC NCYQQ
Sbjct: 61  HRNTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQ 120

Query: 121 IDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLT 180
             PMFDPSKS+TYK V CSSP CS++G  SSCS DS C YSI+YGD SHS G++AVDT+T
Sbjct: 121 NAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVT 180

Query: 181 MDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLA 240
           M STSGRPVAFPRT IGCGHDNAG+F++ VSGIVGLG G ASL+ Q+GPAT GKFSYCL 
Sbjct: 181 MQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLI 240

Query: 241 PV--GNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDF 300
           P+  G+++DS+ LNFGSNA VSGSG VSTPIY+S     Y+TFY L +EA+SVG  KF+F
Sbjct: 241 PIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSS---AQYKTFYSLKLEAVSVGDTKFNF 300

Query: 301 -SSSSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTT 360
              +S  G   NIIIDSGTTLT+LP     SF  AIS++M L     P + ++YC+ TTT
Sbjct: 301 PEGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTT 360

Query: 361 DDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGY 420
           DDY++PPVT+HFEGADV L+RENLF+R+ ++ +CLAF       + IYGNIAQ+NFL+GY
Sbjct: 361 DDYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGY 420

Query: 421 DIKKLTVSFKPQNCAA 431
           DIK L VSF+P +C A
Sbjct: 421 DIKNLAVSFQPAHCGA 433

BLAST of CmaCh17G000530 vs. NCBI nr
Match: gi|778722025|ref|XP_004153020.2| (PREDICTED: aspartic proteinase CDR1-like [Cucumis sativus])

HSP 1 Score: 541.6 bits (1394), Expect = 1.3e-150
Identity = 271/431 (62.88%), Postives = 333/431 (77.26%), Query Frame = 1

Query: 4   IFSLILFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGTV 63
           I  +   V+S  +A     YGF+VEL+HRD PK P++N+SETH+ RI +ALRRS  R TV
Sbjct: 409 IAQINFLVASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNTV 468

Query: 64  SL-TDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCYQQIDPMF 123
            L +DT  API+N+GG Y+V++S+GTPPFSIVAVADTGSD+IWTQCKPC NCYQQ  PMF
Sbjct: 469 VLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMF 528

Query: 124 DPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDTLTMDSTS 183
           DPSKS+TYK V CSSP CS++G  SSCS DS C YSI+YGD SHS G++AVDT+TM STS
Sbjct: 529 DPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTS 588

Query: 184 GRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYCLAPV--G 243
           GRPVAFPRT IGCGHDNAG+F++ VSGIVGLG G ASL+ Q+GPAT GKFSYCL P+  G
Sbjct: 589 GRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTG 648

Query: 244 NSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDF-SSSS 303
           +++DS+ LNFGSNA VSGSG VSTPIY+S     Y+TFY L +EA+SVG  KF+F   +S
Sbjct: 649 STNDSTKLNFGSNANVSGSGTVSTPIYSS---AQYKTFYSLKLEAVSVGDTKFNFPEGAS 708

Query: 304 PFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTTTDDYKV 363
             G   NIIIDSGTTLT+LP     SF  AIS++M L     P + ++YC+ TTTDDY++
Sbjct: 709 KLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEM 768

Query: 364 PPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIGYDIKKL 423
           PPVT+HFEGADV L+RENLF+R+ ++ +CLAF       + IYGNIAQ+NFL+GYDIK L
Sbjct: 769 PPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNL 828

Query: 424 TVSFKPQNCAA 431
            VSF+P +C A
Sbjct: 829 AVSFQPAHCGA 836

BLAST of CmaCh17G000530 vs. NCBI nr
Match: gi|700191064|gb|KGN46268.1| (hypothetical protein Csa_6G078630 [Cucumis sativus])

HSP 1 Score: 525.8 bits (1353), Expect = 7.2e-146
Identity = 272/437 (62.24%), Postives = 332/437 (75.97%), Query Frame = 1

Query: 1   MALIFSLIL----FVSSAAAAAADG-GYGFSVELVHRDFPKFPLFNASETHYQRIADALR 60
           MA IFSL++     +S+A  +AA G  YGF+VEL+HRD PK P++N  E HY R+AD LR
Sbjct: 1   MAPIFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLR 60

Query: 61  RSISRGTVSLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCY 120
           RSIS  T  +T+T  APIYN+ G Y++K+S+GTPPF I+AVADTGSDIIWTQC+PC NCY
Sbjct: 61  RSISHNTGLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCY 120

Query: 121 QQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDT 180
           QQ  PMF+PSKS+TY+ V CSSP CSF G  +SCS    C YSISYGD SHS GD AVDT
Sbjct: 121 QQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDT 180

Query: 181 LTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYC 240
           LTM STSGR VAFPRTAIGCGHDNAGSFD+ VSGIVGLG G ASLI+QMG A  GKFSYC
Sbjct: 181 LTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYC 240

Query: 241 LAPVGNSH-DSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVG-SNKF 300
           L P+GN    S+ LNFGSNA VSGSG VSTPIY S     +++FY L ++A+SVG +N F
Sbjct: 241 LTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYIS---DKFKSFYSLKLKAVSVGRNNTF 300

Query: 301 DFSSSSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTT 360
             +++S  G   NIIIDSGTTLT LP D Y +F+KAIS +++L+ T  P Q +EYC+ TT
Sbjct: 301 YSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETT 360

Query: 361 TDDYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLIG 420
           TDDYKVP + +HFEGA++ L+REN+ IRV +NV+CLAF  +    + IYGNIAQ NFL+G
Sbjct: 361 TDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVG 420

Query: 421 YDIKKLTVSFKPQNCAA 431
           YD+  +++SFKP NC A
Sbjct: 421 YDVTNMSLSFKPMNCVA 434

BLAST of CmaCh17G000530 vs. NCBI nr
Match: gi|702355161|ref|XP_010058581.1| (PREDICTED: aspartic proteinase CDR1-like [Eucalyptus grandis])

HSP 1 Score: 419.1 bits (1076), Expect = 9.5e-114
Identity = 217/439 (49.43%), Postives = 292/439 (66.51%), Query Frame = 1

Query: 3   LIFSLILFVSSAAAAAADGGYGFSVELVHRDFPKFPLFNASETHYQRIADALRRSISRGT 62
           L F+LIL  S    +A+D  YGF+ EL+HRD P+ P +N ++T YQR+A+A+RRSISR  
Sbjct: 11  LSFTLILAFSILCESASD--YGFTTELIHRDSPRSPYYNPADTPYQRLANAIRRSISRAH 70

Query: 63  V-------SLTDTGRAPIYNSGGAYVVKVSLGTPPFSIVAVADTGSDIIWTQCKPCPNCY 122
           +       +  DT  A I  +GG Y++KVSLGTPP   + +ADTGSD+IWTQCKPC +C+
Sbjct: 71  LLSLNSGGATPDTPSAVITAAGGEYIMKVSLGTPPVDFLGIADTGSDLIWTQCKPCTDCF 130

Query: 123 QQIDPMFDPSKSSTYKTVPCSSPTCSFAGPRSSCSSDSVCEYSISYGDGSHSNGDIAVDT 182
           +Q  P+FDPSKSSTYK V C +  C      S     S+CEYS +YGD S++ G++A DT
Sbjct: 131 EQASPLFDPSKSSTYKEVSCQTSQCEVVRQTSCGGGGSLCEYSYAYGDQSYTQGNLATDT 190

Query: 183 LTMDSTSGRPVAFPRTAIGCGHDNAGSFDSKVSGIVGLGHGSASLIQQMGPATDGKFSYC 242
            T+ STSGRPV+FP+   GCGH N G+FD++V G+ GLG G ASL+ Q+G AT GKFSYC
Sbjct: 191 FTLGSTSGRPVSFPKLVFGCGHSNGGTFDNRVDGLFGLGGGDASLVTQLGTATGGKFSYC 250

Query: 243 LAPVGNSHDSSYLNFGSNAIVSGSGVVSTPIYTSVLQGDYETFYVLNIEAISVGSNKFDF 302
           LAP      +S LNFG+NA V+G G VSTP+    +Q D +TFY L++E +SVG  K DF
Sbjct: 251 LAPTSPDEKTSKLNFGANAGVTGDGAVSTPL----IQKDPKTFYYLSLEEVSVGETKIDF 310

Query: 303 SS--SSPFGTNGNIIIDSGTTLTFLPPDTYTSFSKAISEAMDLKPTTSPIQGVEYCYTTT 362
            S  SS     GNIIIDSGTTLT LP D Y+    A+++A+DL   + P Q +  C+   
Sbjct: 311 PSDGSSSSADEGNIIIDSGTTLTLLPQDLYSQIEDAVAKAVDLPKASDPTQLLSLCFRVE 370

Query: 363 TD-DYKVPPVTVHFEGADVSLKRENLFIRVDNNVVCLAFMDSNGVGLQIYGNIAQTNFLI 422
           +D    +P VT HF+GADV L   N F++V + ++CL+F       + I+GN+AQ N+LI
Sbjct: 371 SDAQLSLPTVTFHFKGADVELSPTNTFVQVADGIICLSFRPEK---VSIFGNLAQINYLI 430

Query: 423 GYDIKKLTVSFKPQNCAAS 432
           GYDI+   + FKP +CA++
Sbjct: 431 GYDIQNSKLYFKPVDCASN 440

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH2.8e-11349.77Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
ASPR1_ARATH3.4e-8741.28Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
NEP1_NEPGR2.9e-6236.41Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR4.9e-6236.18Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
APF2_ARATH2.9e-5439.39Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K928_CUCSA1.6e-15263.07Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078650 PE=3 SV=1[more]
A0A0A0K9V4_CUCSA5.0e-14662.24Uncharacterized protein OS=Cucumis sativus GN=Csa_6G078630 PE=3 SV=1[more]
I1M0V7_SOYBN1.5e-11351.67Uncharacterized protein OS=Glycine max GN=GLYMA_13G200600 PE=3 SV=2[more]
I1LVB5_SOYBN4.7e-11251.03Uncharacterized protein OS=Glycine max GN=GLYMA_12G235400 PE=3 SV=1[more]
A0A0D2W503_GOSRA2.3e-11151.02Uncharacterized protein OS=Gossypium raimondii GN=B456_013G112800 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G33340.11.6e-11449.77 Eukaryotic aspartyl protease family protein[more]
AT1G64830.15.2e-10245.87 Eukaryotic aspartyl protease family protein[more]
AT2G35615.11.9e-8841.28 Eukaryotic aspartyl protease family protein[more]
AT1G31450.15.6e-8840.00 Eukaryotic aspartyl protease family protein[more]
AT2G28010.19.5e-7240.42 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659120454|ref|XP_008460202.1|2.1e-15364.78PREDICTED: uncharacterized protein LOC103499087 [Cucumis melo][more]
gi|700191066|gb|KGN46270.1|2.3e-15263.07hypothetical protein Csa_6G078650 [Cucumis sativus][more]
gi|778722025|ref|XP_004153020.2|1.3e-15062.88PREDICTED: aspartic proteinase CDR1-like [Cucumis sativus][more]
gi|700191064|gb|KGN46268.1|7.2e-14662.24hypothetical protein Csa_6G078630 [Cucumis sativus][more]
gi|702355161|ref|XP_010058581.1|9.5e-11449.43PREDICTED: aspartic proteinase CDR1-like [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh17G000530.1CmaCh17G000530.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 2..431
score: 2.7E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 95..106
score: -coord: 307..318
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 266..429
score: 5.6E-39coord: 76..252
score: 1.4
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 74..428
score: 1.67
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 2..431
score: 2.7E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh17G000530Cp4.1LG12g00400Cucurbita pepo (Zucchini)cmacpeB371
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh17G000530Cucumber (Chinese Long) v3cmacucB0425
CmaCh17G000530Cucumber (Chinese Long) v3cmacucB0441
CmaCh17G000530Watermelon (97103) v2cmawmbB370
CmaCh17G000530Watermelon (97103) v2cmawmbB366
CmaCh17G000530Wax gourdcmawgoB0465
CmaCh17G000530Cucurbita maxima (Rimu)cmacmaB249
CmaCh17G000530Cucurbita maxima (Rimu)cmacmaB373
CmaCh17G000530Cucurbita maxima (Rimu)cmacmaB370
CmaCh17G000530Cucurbita maxima (Rimu)cmacmaB388
CmaCh17G000530Cucumber (Gy14) v1cgycmaB0715
CmaCh17G000530Cucumber (Gy14) v1cgycmaB0998
CmaCh17G000530Cucurbita moschata (Rifu)cmacmoB350
CmaCh17G000530Cucurbita moschata (Rifu)cmacmoB353
CmaCh17G000530Cucurbita moschata (Rifu)cmacmoB362
CmaCh17G000530Cucurbita moschata (Rifu)cmacmoB365
CmaCh17G000530Cucurbita moschata (Rifu)cmacmoB381
CmaCh17G000530Wild cucumber (PI 183967)cmacpiB361
CmaCh17G000530Wild cucumber (PI 183967)cmacpiB373
CmaCh17G000530Cucumber (Chinese Long) v2cmacuB357
CmaCh17G000530Cucumber (Chinese Long) v2cmacuB369
CmaCh17G000530Melon (DHL92) v3.5.1cmameB316
CmaCh17G000530Melon (DHL92) v3.5.1cmameB343
CmaCh17G000530Watermelon (Charleston Gray)cmawcgB319
CmaCh17G000530Watermelon (Charleston Gray)cmawcgB323
CmaCh17G000530Watermelon (97103) v1cmawmB344
CmaCh17G000530Watermelon (97103) v1cmawmB353
CmaCh17G000530Cucurbita pepo (Zucchini)cmacpeB382
CmaCh17G000530Cucurbita pepo (Zucchini)cmacpeB402
CmaCh17G000530Cucurbita pepo (Zucchini)cmacpeB411
CmaCh17G000530Bottle gourd (USVL1VR-Ls)cmalsiB334
CmaCh17G000530Bottle gourd (USVL1VR-Ls)cmalsiB356
CmaCh17G000530Cucumber (Gy14) v2cgybcmaB345
CmaCh17G000530Cucumber (Gy14) v2cgybcmaB753
CmaCh17G000530Melon (DHL92) v3.6.1cmamedB368
CmaCh17G000530Melon (DHL92) v3.6.1cmamedB398
CmaCh17G000530Silver-seed gourdcarcmaB0314
CmaCh17G000530Silver-seed gourdcarcmaB0876
CmaCh17G000530Silver-seed gourdcarcmaB1321
CmaCh17G000530Silver-seed gourdcarcmaB1487