CmaCh00G002950 (gene) Cucurbita maxima (Rimu)

NameCmaCh00G002950
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr00 : 22633785 .. 22635474 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAATTCGACTCCCAAAAAAATTAAAAGTAAAAAATAAAACTTTTATAAAGTCCACTTTAATTCTTTATACTTCTTTTATTTATACTCTCACACCTGTGCCGAACTCGTACCTCCCGCACTCTCAAAAATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGAGTTCACTCAGTGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTTCGGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCGACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTGCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCGGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

mRNA sequence

ATGATAATTCGACTCCCAAAAAAATTAAAAGTAAAAAATAAAACTTTTATAAAGTCCACTTTAATTCTTTATACTTCTTTTATTTATACTCTCACACCTGTGCCGAACTCGTACCTCCCGCACTCTCAAAAATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGATGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTTCGGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCGACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTGCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCGGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

Coding sequence (CDS)

ATGATAATTCGACTCCCAAAAAAATTAAAAGTAAAAAATAAAACTTTTATAAAGTCCACTTTAATTCTTTATACTTCTTTTATTTATACTCTCACACCTGTGCCGAACTCGTACCTCCCGCACTCTCAAAAATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGATGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTTCGGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCGACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTGCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCGGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

Protein sequence

MIIRLPKKLKVKNKTFIKSTLILYTSFIYTLTPVPNSYLPHSQKWRQKPVHYPLSAFFSLFSLSPPPSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA
BLAST of CmaCh00G002950 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 6.0e-177
Identity = 332/495 (67.07%), Postives = 380/495 (76.77%), Query Frame = 1

Query: 79  FRLHLHSLSTAFSDFQTLIPKP--LP-ASPSLFSPESDTDSESFISSE----------AG 138
           F L L S S+  S FQTL P    LP ASP  F P  D+DSES + SE          + 
Sbjct: 15  FFLSLPSFSSLPS-FQTLFPNSHSLPCASPVSFQP--DSDSESLLESEFESGSDSESSSS 74

Query: 139 LELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGG 198
           + L L H+DALS N+TP+ELF  RLQRD+     S       +P      R+ +H    G
Sbjct: 75  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVK-SIATLAAQIPG-----RNVTHAPRPG 134

Query: 199 SQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQC 258
                          GFSSSV+SGL+QGSGEYFTR+GVGTP +Y+YMVLDTGSDIVWLQC
Sbjct: 135 ---------------GFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 194

Query: 259 APCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTT 318
           APC+ CYSQ+DP+F+P KS++Y+ + C +P C RL+S GCN +++TCLYQVSYGDGS+T 
Sbjct: 195 APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTV 254

Query: 319 GEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSY 378
           G+F TETLTFRR +V+ VALGCGHDNEGLFVGAAGLLGLG+G LSFP Q G  FNQKFSY
Sbjct: 255 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 314

Query: 379 CLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISAS 438
           CLVDRSASSKPSSVVFG++AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS
Sbjct: 315 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 374

Query: 439 HFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGK 498
            FKLD  GNGGVIID GTSVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS  
Sbjct: 375 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNM 434

Query: 499 TTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVY 558
             VKVPTVVLHFRGADVSLPA+NYLIPVD NG+FCFAFAGT  GLSIIGNIQQQGFRVVY
Sbjct: 435 NEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVY 485

Query: 559 DLAGSRVGFSPRGCA 560
           DLA SRVGF+P GCA
Sbjct: 495 DLASSRVGFAPGGCA 485

BLAST of CmaCh00G002950 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 7.7e-108
Identity = 236/503 (46.92%), Postives = 304/503 (60.44%), Query Frame = 1

Query: 62  SLSPPPSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIP-KPLPASPSLFSPESDTDSESFI 121
           SLS PP            + ++  + ++    QT++   P  +S +   PES +D   F 
Sbjct: 28  SLSTPP------------KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPV-FF 87

Query: 122 SSEAGLELQLHHLDAL--SLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQ 181
           +S + L L+LH  D    S ++  + L   RL+RD      S + AG       ++ R  
Sbjct: 88  NSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERD------SSRVAGIV-----AKIRFA 147

Query: 182 SHETDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGS 241
               D           T + T   ++ V+SG +QGSGEYF+RIGVGTP K +Y+VLDTGS
Sbjct: 148 VEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGS 207

Query: 242 DIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYG 301
           D+ W+QC PC +CY Q+DPVFNP  S +Y  + C  P C  LE+  C   + CLYQVSYG
Sbjct: 208 DVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNK-CLYQVSYG 267

Query: 302 DGSYTTGEFVTETLTFRRT-KVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRS 361
           DGS+T GE  T+T+TF  + K+  VALGCGHDNEGLF GAAGLLGLG G LS  +Q   +
Sbjct: 268 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 327

Query: 362 FNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTP 421
               FSYCLVDR  S K SS+ F    +       PLL N ++DTFYYV L G SVGG  
Sbjct: 328 ---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEK 387

Query: 422 VFGISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKL-APEFSLFD 481
           V  +  + F +D++G+GGVI+DCGT+VTRL   AY +LRDAF     +LK  +   SLFD
Sbjct: 388 VV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFD 447

Query: 482 TCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNI 541
           TCYD S  +TVKVPTV  HF G   + LPA NYLIPVDD+G FCFAFA T+S LSIIGN+
Sbjct: 448 TCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNV 500

Query: 542 QQQGFRVVYDLAGSRVGFSPRGC 559
           QQQG R+ YDL+ + +G S   C
Sbjct: 508 QQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh00G002950 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 387.1 bits (993), Expect = 3.2e-106
Identity = 220/493 (44.62%), Postives = 292/493 (59.23%), Query Frame = 1

Query: 73  PLYPNLFRLHLH---SLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQ 132
           PL+     LHLH   S S +F DFQ +     P + +   P+ +    S  SS     L+
Sbjct: 4   PLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPDFNNTHFSDESSSK-YTLR 63

Query: 133 LHHLDALS--LNRTPEELFHLRLQRDALGCS-VSEQNAGGALPSPPSERRSQSHETDGGS 192
           L H D       R      H R++RD    S +  + +G  +PS                
Sbjct: 64  LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS---------------- 123

Query: 193 QNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCA 252
                 S + +    F S ++SG+ QGSGEYF RIGVG+PP+  YMV+D+GSD+VW+QC 
Sbjct: 124 ------SDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ 183

Query: 253 PCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGE 312
           PCK CY Q+DPVF+P KS SY+ V C + +C R+E+ GC+    C Y+V YGDGSYT G 
Sbjct: 184 PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGT 243

Query: 313 FVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCL 372
              ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G +SF  Q        F YCL
Sbjct: 244 LALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCL 303

Query: 373 VDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHF 432
           V R   S   S+VFG  A+   A + PL+ NPR  +FYYV L G+ VGG  +  +    F
Sbjct: 304 VSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-PLPDGVF 363

Query: 433 KLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTT 492
            L   G+GGV++D GT+VTRL   AY+A RD F++  ++L  A   S+FDTCYDLSG  +
Sbjct: 364 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 423

Query: 493 VKVPTVVLHF-RGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYD 552
           V+VPTV  +F  G  ++LPA N+L+PVDD+G +CFAFA + +GLSIIGNIQQ+G +V +D
Sbjct: 424 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 470

Query: 553 LAGSRVGFSPRGC 559
            A   VGF P  C
Sbjct: 484 GANGFVGFGPNVC 470

BLAST of CmaCh00G002950 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 1.2e-71
Identity = 156/381 (40.94%), Postives = 212/381 (55.64%), Query Frame = 1

Query: 189 LSQASGTSHGTTGFSSSVIS--GLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCA 248
           LS+   T H +   S+ + +  G   GSG Y   +G+GTP   + ++ DTGSD+ W QC 
Sbjct: 102 LSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQ 161

Query: 249 PC-KNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESP----GCNQKQTCLYQVSYGDGS 308
           PC + CY Q +P+FNP KS SY  V C +  C  L S     G      C+Y + YGD S
Sbjct: 162 PCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQS 221

Query: 309 YTTGEFVTETLTFRRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQ 368
           ++ G    E  T   + V + V  GCG +N+GLF G AGLLGLGR  LSFPSQ   ++N+
Sbjct: 222 FSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNK 281

Query: 369 KFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFG 428
            FSYCL   S++S    + FG + +SR+ +FTP+ T     +FY + ++ I+VGG  +  
Sbjct: 282 IFSYCLP--SSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKL-P 341

Query: 429 ISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYD 488
           I ++ F        G +ID GT +TRL   AY ALR +F+A  S        S+ DTC+D
Sbjct: 342 IPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFD 401

Query: 489 LSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTT--SGLSIIGNIQQQ 548
           LSG  TV +P V   F G  V    S  +  V    + C AFAG +  S  +I GN+QQQ
Sbjct: 402 LSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQ 461

Query: 549 GFRVVYDLAGSRVGFSPRGCA 560
              VVYD AG RVGF+P GC+
Sbjct: 462 TLEVVYDGAGGRVGFAPNGCS 474

BLAST of CmaCh00G002950 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 2.0e-71
Identity = 150/354 (42.37%), Postives = 201/354 (56.78%), Query Frame = 1

Query: 208 SGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCK-NCYSQTDPVFNPVKSRS 267
           SG+  GSG Y   IG+GTP   + +V DTGSD+ W QC PC  +CYSQ +P FNP  S +
Sbjct: 123 SGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSST 182

Query: 268 YSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV-ERVALG 327
           Y  V C +P+C   ES   +    C+Y + YGD S+T G    E  T   + V E V  G
Sbjct: 183 YQNVSCSSPMCEDAESCSASN---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFG 242

Query: 328 CGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAV 387
           CG +N+GLF G AGLLGLG G LS P+Q   ++N  FSYCL   +++S    + FG + +
Sbjct: 243 CGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGHLTFGSAGI 302

Query: 388 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVT 447
           S + +FTP+ + P     Y ++++GISVG   +  I+ + F  +     G IID GT  T
Sbjct: 303 SESVKFTPISSFPSAFN-YGIDIIGISVGDKEL-AITPNSFSTE-----GAIIDSGTVFT 362

Query: 448 RLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD-VSLP 507
           RL    Y  LR  F+   SS K    + LFDTCYD +G  TV  PT+   F G+  V L 
Sbjct: 363 RLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELD 422

Query: 508 ASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 559
            S   +P+  + + C AFAG     +I GN+QQ    VVYD+AG RVGF+P GC
Sbjct: 423 GSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of CmaCh00G002950 vs. TrEMBL
Match: A0A0A0L8K0_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_3G119540 PE=3 SV=1)

HSP 1 Score: 780.0 bits (2013), Expect = 1.9e-222
Identity = 410/498 (82.33%), Postives = 432/498 (86.75%), Query Frame = 1

Query: 67  PSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-- 126
           P+  S P     F L + SL+TAFSDFQTL    LP+SPS F P   +DS SF+SSEA  
Sbjct: 44  PNTISLPFI--FFLLTVLSLATAFSDFQTLPLTSLPSSPS-FLP---SDSNSFLSSEATQ 103

Query: 127 ---GLELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHE 186
              GLEL LHHLDALS NRTPEELFHLRLQRDA+   V + ++ GA              
Sbjct: 104 SELGLELHLHHLDALSFNRTPEELFHLRLQRDAI--RVKKLSSLGAT------------- 163

Query: 187 TDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIV 246
               S+NLS+  G    TTGFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIV
Sbjct: 164 ----SRNLSKPGG----TTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIV 223

Query: 247 WLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGS 306
           WLQCAPCKNCYSQTDPVFNPVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGS
Sbjct: 224 WLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGS 283

Query: 307 YTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQK 366
           YTTGEFVTETLTFRRTKVE+VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQK
Sbjct: 284 YTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQK 343

Query: 367 FSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGI 426
           FSYCLVDRSASSKPSSVVFG+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPV GI
Sbjct: 344 FSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 403

Query: 427 SASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDL 486
           +ASHFKLD  GNGGVIIDCGTSVTRLN+PAYIALRDAFRAGASSLK APEFSLFDTCYDL
Sbjct: 404 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 463

Query: 487 SGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFR 546
           SGKTTVKVPTVVLHFRGADVSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFR
Sbjct: 464 SGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFR 512

Query: 547 VVYDLAGSRVGFSPRGCA 560
           VVYDLA SRVGFSPRGCA
Sbjct: 524 VVYDLASSRVGFSPRGCA 512

BLAST of CmaCh00G002950 vs. TrEMBL
Match: B9SBG8_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0717990 PE=3 SV=1)

HSP 1 Score: 667.5 bits (1721), Expect = 1.4e-188
Identity = 343/469 (73.13%), Postives = 383/469 (81.66%), Query Frame = 1

Query: 92  DFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDALSLNRTPEELFHLRLQ 151
           ++QTL+  PL + P+L   +S++ +++  SS A   +QLHH+DALS N TPE LF  RLQ
Sbjct: 27  NYQTLVANPLRSQPTLSWTDSESPTDTAESS-ATFSVQLHHVDALSFNSTPETLFTTRLQ 86

Query: 152 RDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGTTGFSSSVISGLA 211
           RDA                   E  S   ET G  + +          TGFSSSVISGLA
Sbjct: 87  RDAARV----------------EAISYLAETAGTGKRVG---------TGFSSSVISGLA 146

Query: 212 QGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVL 271
           QGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPCK CY+Q+DPVF+P KSRS++ + 
Sbjct: 147 QGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIA 206

Query: 272 CRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDN 331
           CR+PLC RL+SPGCN QKQTC+YQVSYGDGS+T G+F TETLTFRRT+V RVALGCGHDN
Sbjct: 207 CRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHDN 266

Query: 332 EGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTAR 391
           EGLFVGAAGLLGLGRG LSFPSQ GR FN KFSYCLVDRSASSKPSS+VFGDSAVSRTAR
Sbjct: 267 EGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTAR 326

Query: 392 FTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVTRLNRP 451
           FTPL++NP+LDTFYYVELLGISVGGT V GI+AS FKLD  GNGGVIID GTSVTRL RP
Sbjct: 327 FTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTRP 386

Query: 452 AYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLI 511
           AYIA RDAFRAGAS+LK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGADVSLPASNYLI
Sbjct: 387 AYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLI 446

Query: 512 PVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 560
           PVD +G FC AFAGT  GLSIIGNIQQQGFRVVYDLAGSRVGF+P GCA
Sbjct: 447 PVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469

BLAST of CmaCh00G002950 vs. TrEMBL
Match: A0A067JX70_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14253 PE=3 SV=1)

HSP 1 Score: 667.2 bits (1720), Expect = 1.8e-188
Identity = 343/483 (71.01%), Postives = 386/483 (79.92%), Query Frame = 1

Query: 85  SLSTAFS---DFQTLIPKPLPASPSLFSPESDTDSESF-----ISSEAGLELQLHHLDAL 144
           SLST  S   D+QTL+  PLP   +L  P +D+++E+       +      LQLHH+DAL
Sbjct: 19  SLSTTLSSPLDYQTLVLNPLPRQTALSWPAADSEAETLQTLTDTADSTTFSLQLHHIDAL 78

Query: 145 SLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTS 204
           S N+TP++LF  RLQRDA         A  A+ +             GG           
Sbjct: 79  SNNKTPQDLFGERLQRDAFRVEALSSVAASAVGA-------------GGRVG-------- 138

Query: 205 HGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTD 264
              TGFSSSVISGLAQGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPC  CYSQ+D
Sbjct: 139 ---TGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCNKCYSQSD 198

Query: 265 PVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFR 324
           PVF+P KSRS++ + C +PLC RL+SPGCN QK+TC+YQVSYGDGS+T G+F TETLTFR
Sbjct: 199 PVFDPRKSRSFAGIPCGSPLCNRLDSPGCNTQKRTCMYQVSYGDGSFTYGDFSTETLTFR 258

Query: 325 RTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKP 384
           RTKV RVA+GCGHDN+GLFVGAAGLLGLGRG LSFPSQ G  FN+KFSYCLVDRSASSKP
Sbjct: 259 RTKVRRVAIGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGAQFNRKFSYCLVDRSASSKP 318

Query: 385 SSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGG 444
           SSVVFGDSA+SRTARFTPL++NP+LDTFYYVELLGISVGGT V GI+AS FKLD  GNGG
Sbjct: 319 SSVVFGDSAISRTARFTPLISNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 378

Query: 445 VIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLH 504
           VIID GTSVTRL RPAY+ALR+AFR GAS+LK APEFSLFDTC+DLSGKT VKVPTV LH
Sbjct: 379 VIIDSGTSVTRLTRPAYVALRNAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVALH 438

Query: 505 FRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSP 559
           FRGADVSLPASNYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLAGSRVGF+P
Sbjct: 439 FRGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAP 477

BLAST of CmaCh00G002950 vs. TrEMBL
Match: V4TCM2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019938mg PE=3 SV=1)

HSP 1 Score: 666.4 bits (1718), Expect = 3.1e-188
Identity = 341/480 (71.04%), Postives = 385/480 (80.21%), Query Frame = 1

Query: 87  STAFSDFQTLIPKPLPASPSLFSPESDTDSESFIS-------SEAGLELQLHHLDALSLN 146
           + A   +QT +   LP   +L  PES + SES  S       +E+ L L+LHH+D+LS N
Sbjct: 19  AAASLQYQTFVLNSLPTQSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFN 78

Query: 147 RTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGT 206
           RTPE LF+LR+QRD L        A  A+  PP  R                + G ++G 
Sbjct: 79  RTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNR----------------SRGRANG- 138

Query: 207 TGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVF 266
            GFSSSVISGLAQGSGEYFTR+GVGTPP+Y+YMVLDTGSD+VW+QCAPCK CYSQTDPVF
Sbjct: 139 -GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 198

Query: 267 NPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 326
           +P KSRS++ V CR+PLC +L+S GCN++ TCLYQVSYGDGS T G+F TETLTFR T+V
Sbjct: 199 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 258

Query: 327 ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVV 386
            RVALGCGHDNEGLFV AAGLLGLGRG LSFP+Q GR FN+KFSYCLVDRS S+KPSS+V
Sbjct: 259 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 318

Query: 387 FGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIID 446
           FGDSAVSRTARFTPLL NP+LDTFYYVEL+GISVGG  V GI+AS FKLD  GNGGVIID
Sbjct: 319 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 378

Query: 447 CGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 506
            GTSVTRL RPAYIALRDAFRAGASSLK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGA
Sbjct: 379 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 438

Query: 507 DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 560
           DVSLPA+NYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLA SR+GF+PRGCA
Sbjct: 439 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480

BLAST of CmaCh00G002950 vs. TrEMBL
Match: A0A067HET1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g040810mg PE=3 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 4.0e-188
Identity = 341/480 (71.04%), Postives = 385/480 (80.21%), Query Frame = 1

Query: 87  STAFSDFQTLIPKPLPASPSLFSPESDTDSESFIS-------SEAGLELQLHHLDALSLN 146
           + A   +QT +   LP   +L  PES + SES  S       +E+ L L+LHH+D+LS N
Sbjct: 19  AAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFN 78

Query: 147 RTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGT 206
           RTPE LF+LR+QRD L        A  A+  PP  R                + G ++G 
Sbjct: 79  RTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNR----------------SRGRANG- 138

Query: 207 TGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVF 266
            GFSSSVISGLAQGSGEYFTR+GVGTPP+Y+YMVLDTGSD+VW+QCAPCK CYSQTDPVF
Sbjct: 139 -GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 198

Query: 267 NPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 326
           +P KSRS++ V CR+PLC +L+S GCN++ TCLYQVSYGDGS T G+F TETLTFR T+V
Sbjct: 199 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 258

Query: 327 ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVV 386
            RVALGCGHDNEGLFV AAGLLGLGRG LSFP+Q GR FN+KFSYCLVDRS S+KPSS+V
Sbjct: 259 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 318

Query: 387 FGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIID 446
           FGDSAVSRTARFTPLL NP+LDTFYYVEL+GISVGG  V GI+AS FKLD  GNGGVIID
Sbjct: 319 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 378

Query: 447 CGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 506
            GTSVTRL RPAYIALRDAFRAGASSLK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGA
Sbjct: 379 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 438

Query: 507 DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 560
           DVSLPA+NYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLA SR+GF+PRGCA
Sbjct: 439 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480

BLAST of CmaCh00G002950 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 622.1 bits (1603), Expect = 3.4e-178
Identity = 332/495 (67.07%), Postives = 380/495 (76.77%), Query Frame = 1

Query: 79  FRLHLHSLSTAFSDFQTLIPKP--LP-ASPSLFSPESDTDSESFISSE----------AG 138
           F L L S S+  S FQTL P    LP ASP  F P  D+DSES + SE          + 
Sbjct: 15  FFLSLPSFSSLPS-FQTLFPNSHSLPCASPVSFQP--DSDSESLLESEFESGSDSESSSS 74

Query: 139 LELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGG 198
           + L L H+DALS N+TP+ELF  RLQRD+     S       +P      R+ +H    G
Sbjct: 75  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVK-SIATLAAQIPG-----RNVTHAPRPG 134

Query: 199 SQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQC 258
                          GFSSSV+SGL+QGSGEYFTR+GVGTP +Y+YMVLDTGSDIVWLQC
Sbjct: 135 ---------------GFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 194

Query: 259 APCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTT 318
           APC+ CYSQ+DP+F+P KS++Y+ + C +P C RL+S GCN +++TCLYQVSYGDGS+T 
Sbjct: 195 APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTV 254

Query: 319 GEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSY 378
           G+F TETLTFRR +V+ VALGCGHDNEGLFVGAAGLLGLG+G LSFP Q G  FNQKFSY
Sbjct: 255 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 314

Query: 379 CLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISAS 438
           CLVDRSASSKPSSVVFG++AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS
Sbjct: 315 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 374

Query: 439 HFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGK 498
            FKLD  GNGGVIID GTSVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS  
Sbjct: 375 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNM 434

Query: 499 TTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVY 558
             VKVPTVVLHFRGADVSLPA+NYLIPVD NG+FCFAFAGT  GLSIIGNIQQQGFRVVY
Sbjct: 435 NEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVY 485

Query: 559 DLAGSRVGFSPRGCA 560
           DLA SRVGF+P GCA
Sbjct: 495 DLASSRVGFAPGGCA 485

BLAST of CmaCh00G002950 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 559.3 bits (1440), Expect = 2.7e-159
Identity = 294/482 (61.00%), Postives = 353/482 (73.24%), Query Frame = 1

Query: 87  STAFSDFQTLIPKPLPASPSLFSPESDT-DSESFISSEAGLELQLHHLDALSL--NRTPE 146
           S+A S +QTL+   LP+S +L  PES++   ES   S   L + L H+DALS   + +P 
Sbjct: 21  SSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTSLSVHLSHVDALSSFSDASPA 80

Query: 147 ELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGTTGFS 206
           +LF+LRLQRD+L        A  +     ++R                   T     GFS
Sbjct: 81  DLFNLRLQRDSLRVKSITSLAAVSTGRNATKR-------------------TPRTAGGFS 140

Query: 207 SSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 266
            +VISGL+QGSGEYF R+GVGTP   +YMVLDTGSD+VWLQC+PCK CY+QTD +F+P K
Sbjct: 141 GAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKK 200

Query: 267 SRSYSKVLCRTPLCLRLE-SPGC--NQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 326
           S++++ V C + LC RL+ S  C   + +TCLYQVSYGDGS+T G+F TETLTF   +V+
Sbjct: 201 SKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD 260

Query: 327 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDR----SASSKPS 386
            V LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQ    +N KFSYCLVDR    S+S  PS
Sbjct: 261 HVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 320

Query: 387 SVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGV 446
           ++VFG++AV +T+ FTPLLTNP+LDTFYY++LLGISVGG+ V G+S S FKLD+ GNGGV
Sbjct: 321 TIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 380

Query: 447 IIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHF 506
           IID GTSVTRL +PAY+ALRDAFR GA+ LK AP +SLFDTC+DLSG TTVKVPTVV HF
Sbjct: 381 IIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF 440

Query: 507 RGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPR 559
            G +VSLPASNYLIPV+  GRFCFAFAGT   LSIIGNIQQQGFRV YDL GSRVGF  R
Sbjct: 441 GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSR 483

BLAST of CmaCh00G002950 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 399.8 bits (1026), Expect = 2.7e-111
Identity = 206/354 (58.19%), Postives = 249/354 (70.34%), Query Frame = 1

Query: 206 VISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSR 265
           +ISG  QGSGEYFTR+G+G P + +YMVLDTGSD+ WLQC PC +CY QT+P+F P  S 
Sbjct: 137 LISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSS 196

Query: 266 SYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 325
           SY  + C TP C  LE   C +  TCLY+VSYGDGSYT G+F TETLT   T V+ VA+G
Sbjct: 197 SYEPLSCDTPQCNALEVSEC-RNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVG 256

Query: 326 CGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAV 385
           CGH NEGLFVGAAGLLGLG G L+ PSQ   +    FSYCLVDR + S  S+V FG S +
Sbjct: 257 CGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDS-ASTVDFGTS-L 316

Query: 386 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVT 445
           S  A   PLL N +LDTFYY+ L GISVGG  +  I  S F++D +G+GG+IID GT+VT
Sbjct: 317 SPDAVVAPLLRNHQLDTFYYLGLTGISVGG-ELLQIPQSSFEMDESGSGGIIIDSGTAVT 376

Query: 446 RLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD-VSLP 505
           RL    Y +LRD+F  G   L+ A   ++FDTCY+LS KTTV+VPTV  HF G   ++LP
Sbjct: 377 RLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALP 436

Query: 506 ASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 559
           A NY+IPVD  G FC AFA T S L+IIGN+QQQG RV +DLA S +GFS   C
Sbjct: 437 AKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of CmaCh00G002950 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 392.5 bits (1007), Expect = 4.3e-109
Identity = 236/503 (46.92%), Postives = 304/503 (60.44%), Query Frame = 1

Query: 62  SLSPPPSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIP-KPLPASPSLFSPESDTDSESFI 121
           SLS PP            + ++  + ++    QT++   P  +S +   PES +D   F 
Sbjct: 28  SLSTPP------------KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPV-FF 87

Query: 122 SSEAGLELQLHHLDAL--SLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQ 181
           +S + L L+LH  D    S ++  + L   RL+RD      S + AG       ++ R  
Sbjct: 88  NSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERD------SSRVAGIV-----AKIRFA 147

Query: 182 SHETDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGS 241
               D           T + T   ++ V+SG +QGSGEYF+RIGVGTP K +Y+VLDTGS
Sbjct: 148 VEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGS 207

Query: 242 DIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYG 301
           D+ W+QC PC +CY Q+DPVFNP  S +Y  + C  P C  LE+  C   + CLYQVSYG
Sbjct: 208 DVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNK-CLYQVSYG 267

Query: 302 DGSYTTGEFVTETLTFRRT-KVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRS 361
           DGS+T GE  T+T+TF  + K+  VALGCGHDNEGLF GAAGLLGLG G LS  +Q   +
Sbjct: 268 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 327

Query: 362 FNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTP 421
               FSYCLVDR  S K SS+ F    +       PLL N ++DTFYYV L G SVGG  
Sbjct: 328 ---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEK 387

Query: 422 VFGISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKL-APEFSLFD 481
           V  +  + F +D++G+GGVI+DCGT+VTRL   AY +LRDAF     +LK  +   SLFD
Sbjct: 388 VV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFD 447

Query: 482 TCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNI 541
           TCYD S  +TVKVPTV  HF G   + LPA NYLIPVDD+G FCFAFA T+S LSIIGN+
Sbjct: 448 TCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNV 500

Query: 542 QQQGFRVVYDLAGSRVGFSPRGC 559
           QQQG R+ YDL+ + +G S   C
Sbjct: 508 QQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh00G002950 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 387.1 bits (993), Expect = 1.8e-107
Identity = 220/493 (44.62%), Postives = 292/493 (59.23%), Query Frame = 1

Query: 73  PLYPNLFRLHLH---SLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQ 132
           PL+     LHLH   S S +F DFQ +     P + +   P+ +    S  SS     L+
Sbjct: 4   PLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPDFNNTHFSDESSSK-YTLR 63

Query: 133 LHHLDALS--LNRTPEELFHLRLQRDALGCS-VSEQNAGGALPSPPSERRSQSHETDGGS 192
           L H D       R      H R++RD    S +  + +G  +PS                
Sbjct: 64  LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS---------------- 123

Query: 193 QNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCA 252
                 S + +    F S ++SG+ QGSGEYF RIGVG+PP+  YMV+D+GSD+VW+QC 
Sbjct: 124 ------SDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ 183

Query: 253 PCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGE 312
           PCK CY Q+DPVF+P KS SY+ V C + +C R+E+ GC+    C Y+V YGDGSYT G 
Sbjct: 184 PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGT 243

Query: 313 FVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCL 372
              ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G +SF  Q        F YCL
Sbjct: 244 LALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCL 303

Query: 373 VDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHF 432
           V R   S   S+VFG  A+   A + PL+ NPR  +FYYV L G+ VGG  +  +    F
Sbjct: 304 VSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-PLPDGVF 363

Query: 433 KLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTT 492
            L   G+GGV++D GT+VTRL   AY+A RD F++  ++L  A   S+FDTCYDLSG  +
Sbjct: 364 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 423

Query: 493 VKVPTVVLHF-RGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYD 552
           V+VPTV  +F  G  ++LPA N+L+PVDD+G +CFAFA + +GLSIIGNIQQ+G +V +D
Sbjct: 424 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 470

Query: 553 LAGSRVGFSPRGC 559
            A   VGF P  C
Sbjct: 484 GANGFVGFGPNVC 470

BLAST of CmaCh00G002950 vs. NCBI nr
Match: gi|659074959|ref|XP_008437888.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis melo])

HSP 1 Score: 786.2 bits (2029), Expect = 3.8e-224
Identity = 409/484 (84.50%), Postives = 430/484 (88.84%), Query Frame = 1

Query: 81  LHLHSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-----GLELQLHHLDA 140
           L + SLSTAFSDFQTLI + LP+SPS F P   +DS SF+SSEA     GLEL LHHLDA
Sbjct: 18  LAILSLSTAFSDFQTLILRSLPSSPS-FLP---SDSNSFLSSEATETELGLELHLHHLDA 77

Query: 141 LSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGT 200
           LS NRTPEELFHLRLQRDA+   V + ++ GA                  S+NLS+ SG 
Sbjct: 78  LSFNRTPEELFHLRLQRDAI--RVKKLSSLGAT-----------------SRNLSRPSG- 137

Query: 201 SHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQT 260
              TTGFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIVWLQCAPCKNCYSQT
Sbjct: 138 ---TTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQT 197

Query: 261 DPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFR 320
           DPVFNPVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGSYTTGEFVTETLTFR
Sbjct: 198 DPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFR 257

Query: 321 RTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKP 380
           RTKVE+VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQKFSYCLVDRSASSKP
Sbjct: 258 RTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKP 317

Query: 381 SSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGG 440
           SSVVFG+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPV GIS+SHFKLD  GNGG
Sbjct: 318 SSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISSSHFKLDRTGNGG 377

Query: 441 VIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLH 500
           VIIDCGTSVTRLN+PAYIALRDAFRAGASSLK APEFSLFDTCYDLSGKTTVKVPTVVLH
Sbjct: 378 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLH 437

Query: 501 FRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSP 560
           FRGADVSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSP
Sbjct: 438 FRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSP 474

BLAST of CmaCh00G002950 vs. NCBI nr
Match: gi|449432044|ref|XP_004133810.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus])

HSP 1 Score: 780.0 bits (2013), Expect = 2.7e-222
Identity = 410/498 (82.33%), Postives = 432/498 (86.75%), Query Frame = 1

Query: 67  PSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-- 126
           P+  S P     F L + SL+TAFSDFQTL    LP+SPS F P   +DS SF+SSEA  
Sbjct: 3   PNTISLPFI--FFLLTVLSLATAFSDFQTLPLTSLPSSPS-FLP---SDSNSFLSSEATQ 62

Query: 127 ---GLELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHE 186
              GLEL LHHLDALS NRTPEELFHLRLQRDA+   V + ++ GA              
Sbjct: 63  SELGLELHLHHLDALSFNRTPEELFHLRLQRDAI--RVKKLSSLGAT------------- 122

Query: 187 TDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIV 246
               S+NLS+  G    TTGFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIV
Sbjct: 123 ----SRNLSKPGG----TTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIV 182

Query: 247 WLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGS 306
           WLQCAPCKNCYSQTDPVFNPVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGS
Sbjct: 183 WLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGS 242

Query: 307 YTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQK 366
           YTTGEFVTETLTFRRTKVE+VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQK
Sbjct: 243 YTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQK 302

Query: 367 FSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGI 426
           FSYCLVDRSASSKPSSVVFG+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPV GI
Sbjct: 303 FSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 362

Query: 427 SASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDL 486
           +ASHFKLD  GNGGVIIDCGTSVTRLN+PAYIALRDAFRAGASSLK APEFSLFDTCYDL
Sbjct: 363 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 422

Query: 487 SGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFR 546
           SGKTTVKVPTVVLHFRGADVSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFR
Sbjct: 423 SGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFR 471

Query: 547 VVYDLAGSRVGFSPRGCA 560
           VVYDLA SRVGFSPRGCA
Sbjct: 483 VVYDLASSRVGFSPRGCA 471

BLAST of CmaCh00G002950 vs. NCBI nr
Match: gi|700201288|gb|KGN56421.1| (Aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 780.0 bits (2013), Expect = 2.7e-222
Identity = 410/498 (82.33%), Postives = 432/498 (86.75%), Query Frame = 1

Query: 67  PSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-- 126
           P+  S P     F L + SL+TAFSDFQTL    LP+SPS F P   +DS SF+SSEA  
Sbjct: 44  PNTISLPFI--FFLLTVLSLATAFSDFQTLPLTSLPSSPS-FLP---SDSNSFLSSEATQ 103

Query: 127 ---GLELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHE 186
              GLEL LHHLDALS NRTPEELFHLRLQRDA+   V + ++ GA              
Sbjct: 104 SELGLELHLHHLDALSFNRTPEELFHLRLQRDAI--RVKKLSSLGAT------------- 163

Query: 187 TDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIV 246
               S+NLS+  G    TTGFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIV
Sbjct: 164 ----SRNLSKPGG----TTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIV 223

Query: 247 WLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGS 306
           WLQCAPCKNCYSQTDPVFNPVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGS
Sbjct: 224 WLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGS 283

Query: 307 YTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQK 366
           YTTGEFVTETLTFRRTKVE+VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQK
Sbjct: 284 YTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQK 343

Query: 367 FSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGI 426
           FSYCLVDRSASSKPSSVVFG+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPV GI
Sbjct: 344 FSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 403

Query: 427 SASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDL 486
           +ASHFKLD  GNGGVIIDCGTSVTRLN+PAYIALRDAFRAGASSLK APEFSLFDTCYDL
Sbjct: 404 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 463

Query: 487 SGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFR 546
           SGKTTVKVPTVVLHFRGADVSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFR
Sbjct: 464 SGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFR 512

Query: 547 VVYDLAGSRVGFSPRGCA 560
           VVYDLA SRVGFSPRGCA
Sbjct: 524 VVYDLASSRVGFSPRGCA 512

BLAST of CmaCh00G002950 vs. NCBI nr
Match: gi|255564685|ref|XP_002523337.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis])

HSP 1 Score: 667.5 bits (1721), Expect = 2.0e-188
Identity = 343/469 (73.13%), Postives = 383/469 (81.66%), Query Frame = 1

Query: 92  DFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDALSLNRTPEELFHLRLQ 151
           ++QTL+  PL + P+L   +S++ +++  SS A   +QLHH+DALS N TPE LF  RLQ
Sbjct: 27  NYQTLVANPLRSQPTLSWTDSESPTDTAESS-ATFSVQLHHVDALSFNSTPETLFTTRLQ 86

Query: 152 RDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGTTGFSSSVISGLA 211
           RDA                   E  S   ET G  + +          TGFSSSVISGLA
Sbjct: 87  RDAARV----------------EAISYLAETAGTGKRVG---------TGFSSSVISGLA 146

Query: 212 QGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVL 271
           QGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPCK CY+Q+DPVF+P KSRS++ + 
Sbjct: 147 QGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIA 206

Query: 272 CRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDN 331
           CR+PLC RL+SPGCN QKQTC+YQVSYGDGS+T G+F TETLTFRRT+V RVALGCGHDN
Sbjct: 207 CRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHDN 266

Query: 332 EGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTAR 391
           EGLFVGAAGLLGLGRG LSFPSQ GR FN KFSYCLVDRSASSKPSS+VFGDSAVSRTAR
Sbjct: 267 EGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTAR 326

Query: 392 FTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVTRLNRP 451
           FTPL++NP+LDTFYYVELLGISVGGT V GI+AS FKLD  GNGGVIID GTSVTRL RP
Sbjct: 327 FTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTRP 386

Query: 452 AYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLI 511
           AYIA RDAFRAGAS+LK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGADVSLPASNYLI
Sbjct: 387 AYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLI 446

Query: 512 PVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 560
           PVD +G FC AFAGT  GLSIIGNIQQQGFRVVYDLAGSRVGF+P GCA
Sbjct: 447 PVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469

BLAST of CmaCh00G002950 vs. NCBI nr
Match: gi|643716856|gb|KDP28482.1| (hypothetical protein JCGZ_14253 [Jatropha curcas])

HSP 1 Score: 667.2 bits (1720), Expect = 2.6e-188
Identity = 343/483 (71.01%), Postives = 386/483 (79.92%), Query Frame = 1

Query: 85  SLSTAFS---DFQTLIPKPLPASPSLFSPESDTDSESF-----ISSEAGLELQLHHLDAL 144
           SLST  S   D+QTL+  PLP   +L  P +D+++E+       +      LQLHH+DAL
Sbjct: 19  SLSTTLSSPLDYQTLVLNPLPRQTALSWPAADSEAETLQTLTDTADSTTFSLQLHHIDAL 78

Query: 145 SLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTS 204
           S N+TP++LF  RLQRDA         A  A+ +             GG           
Sbjct: 79  SNNKTPQDLFGERLQRDAFRVEALSSVAASAVGA-------------GGRVG-------- 138

Query: 205 HGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTD 264
              TGFSSSVISGLAQGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPC  CYSQ+D
Sbjct: 139 ---TGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCNKCYSQSD 198

Query: 265 PVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFR 324
           PVF+P KSRS++ + C +PLC RL+SPGCN QK+TC+YQVSYGDGS+T G+F TETLTFR
Sbjct: 199 PVFDPRKSRSFAGIPCGSPLCNRLDSPGCNTQKRTCMYQVSYGDGSFTYGDFSTETLTFR 258

Query: 325 RTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKP 384
           RTKV RVA+GCGHDN+GLFVGAAGLLGLGRG LSFPSQ G  FN+KFSYCLVDRSASSKP
Sbjct: 259 RTKVRRVAIGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGAQFNRKFSYCLVDRSASSKP 318

Query: 385 SSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGG 444
           SSVVFGDSA+SRTARFTPL++NP+LDTFYYVELLGISVGGT V GI+AS FKLD  GNGG
Sbjct: 319 SSVVFGDSAISRTARFTPLISNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 378

Query: 445 VIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLH 504
           VIID GTSVTRL RPAY+ALR+AFR GAS+LK APEFSLFDTC+DLSGKT VKVPTV LH
Sbjct: 379 VIIDSGTSVTRLTRPAYVALRNAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVALH 438

Query: 505 FRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSP 559
           FRGADVSLPASNYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLAGSRVGF+P
Sbjct: 439 FRGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAP 477

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APF2_ARATH6.0e-17767.07Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH7.7e-10846.92Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH3.2e-10644.62Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPA_ARATH1.2e-7140.94Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
AED1_ARATH2.0e-7142.37Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L8K0_CUCSA1.9e-22282.33Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_3G119540 PE=3 SV=1[more]
B9SBG8_RICCO1.4e-18873.13Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0717990 ... [more]
A0A067JX70_JATCU1.8e-18871.01Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14253 PE=3 SV=1[more]
V4TCM2_9ROSI3.1e-18871.04Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019938mg PE=3 SV=1[more]
A0A067HET1_CITSI4.0e-18871.04Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g040810mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G01300.13.4e-17867.07 Eukaryotic aspartyl protease family protein[more]
AT3G61820.12.7e-15961.00 Eukaryotic aspartyl protease family protein[more]
AT1G25510.12.7e-11158.19 Eukaryotic aspartyl protease family protein[more]
AT3G18490.14.3e-10946.92 Eukaryotic aspartyl protease family protein[more]
AT3G20015.11.8e-10744.62 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659074959|ref|XP_008437888.1|3.8e-22484.50PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis melo][more]
gi|449432044|ref|XP_004133810.1|2.7e-22282.33PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus][more]
gi|700201288|gb|KGN56421.1|2.7e-22282.33Aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
gi|255564685|ref|XP_002523337.1|2.0e-18873.13PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis][more]
gi|643716856|gb|KDP28482.1|2.6e-18871.01hypothetical protein JCGZ_14253 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0080167 response to karrikin
cellular_component GO:0016020 membrane
cellular_component GO:0009505 plant-type cell wall
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002950.1CmaCh00G002950.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 530..545
score: 7.7E-6coord: 436..447
score: 7.7E-6coord: 223..243
score: 7.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 12..34
score: 4.8E-242coord: 204..558
score: 4.8E-242coord: 116..176
score: 4.8E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 232..243
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 214..382
score: 9.1E-41coord: 384..559
score: 2.3
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 210..558
score: 2.38E
NoneNo IPR availablePANTHERPTHR13683:SF308ASPARTYL PROTEASE-RELATEDcoord: 12..34
score: 4.8E-242coord: 116..176
score: 4.8E-242coord: 204..558
score: 4.8E