CmaCh06G010150 (gene) Cucurbita maxima (Rimu)

NameCmaCh06G010150
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr06 : 6866760 .. 6868184 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGAGTTCACTCAGTGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTCCCGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCAACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTTCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCAGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

mRNA sequence

ATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGAGTTCACTCAGTGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTCCCGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCAACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTTCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCAGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

Coding sequence (CDS)

ATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGAGTTCACTCAGTGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTCCCGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCAACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTTCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCAGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

Protein sequence

MEAKTSALPFIGFLLTLLSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA
BLAST of CmaCh06G010150 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 630.2 bits (1624), Expect = 1.9e-179
Identity = 329/476 (69.12%), Postives = 381/476 (80.04%), Query Frame = 1

Query: 13  FLLTLLSLSTAFSDFQTLIPKP--LP-ASPSLFSPESDTDSESFISSE----------AG 72
           F L+L S S+  S FQTL P    LP ASP  F P  D+DSES + SE          + 
Sbjct: 15  FFLSLPSFSSLPS-FQTLFPNSHSLPCASPVSFQP--DSDSESLLESEFESGSDSESSSS 74

Query: 73  LELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGFSS 132
           + L L H+DALS N+TP+ELF  RLQRD+ RV  +++L+      +       G  GFSS
Sbjct: 75  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPG--GFSS 134

Query: 133 SVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKS 192
           SV+SGL+QGSGEYFTR+GVGTP +Y+YMVLDTGSDIVWLQCAPC+ CYSQ+DP+F+P KS
Sbjct: 135 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 194

Query: 193 RSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVA 252
           ++Y+ + C +P C RL+S GCN +++TCLYQVSYGDGS+T G+F TETLTFRR +V+ VA
Sbjct: 195 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVA 254

Query: 253 LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDS 312
           LGCGHDNEGLFVGAAGLLGLG+G LSFP Q G  FNQKFSYCLVDRSASSKPSSVVFG++
Sbjct: 255 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNA 314

Query: 313 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTS 372
           AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V  ++AS FKLD  GNGGVIID GTS
Sbjct: 315 AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 374

Query: 373 VTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSL 432
           VTRL RPAYIA+R+AFR GA +LK AP+FSLFDTC+DLS    VKVPTVVLHFRGADVSL
Sbjct: 375 VTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSL 434

Query: 433 PASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           PA+NYLIPVD NG+FCFAFAGT  GLSIIGNIQQQGFRVVYDLA SRVGF+P GCA
Sbjct: 435 PATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh06G010150 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 398.7 bits (1023), Expect = 9.1e-110
Identity = 228/471 (48.41%), Postives = 300/471 (63.69%), Query Frame = 1

Query: 17  LLSLSTAFSDFQTLIP-KPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDAL--SLN 76
           +L + ++    QT++   P  +S +   PES +D   F +S + L L+LH  D    S +
Sbjct: 37  VLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPV-FFNSSSPLSLELHSRDTFVASQH 96

Query: 77  RTPEELFHLRLQRDALRVTKL-SSLSGGSQNLSQAS-------GTSHGTTGFSSSVISGL 136
           +  + L   RL+RD+ RV  + + +    + + ++         T + T   ++ V+SG 
Sbjct: 97  KDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGA 156

Query: 137 AQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKV 196
           +QGSGEYF+RIGVGTP K +Y+VLDTGSD+ W+QC PC +CY Q+DPVFNP  S +Y  +
Sbjct: 157 SQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSL 216

Query: 197 LCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVERVALGCGHD 256
            C  P C  LE+  C   + CLYQVSYGDGS+T GE  T+T+TF  + K+  VALGCGHD
Sbjct: 217 TCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHD 276

Query: 257 NEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTA 316
           NEGLF GAAGLLGLG G LS  +Q   +    FSYCLVDR  S K SS+ F    +    
Sbjct: 277 NEGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGD 336

Query: 317 RFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTSVTRLNR 376
              PLL N ++DTFYYV L G SVGG  V  +  + F +D++G+GGVI+DCGT+VTRL  
Sbjct: 337 ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQT 396

Query: 377 PAYIALRNAFRAGASSLKL-APEFSLFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASN 436
            AY +LR+AF     +LK  +   SLFDTCYD S  +TVKVPTV  HF G   + LPA N
Sbjct: 397 QAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKN 456

Query: 437 YLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 474
           YLIPVDD+G FCFAFA T+S LSIIGN+QQQG R+ YDL+ + +G S   C
Sbjct: 457 YLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh06G010150 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 385.2 bits (988), Expect = 1.0e-105
Identity = 217/474 (45.78%), Postives = 289/474 (60.97%), Query Frame = 1

Query: 8   LPFIGFLLTL-----LSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLEL 67
           LP   F L L      S S +F DFQ +     P + +   P+ +    S  SS     L
Sbjct: 3   LPLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPDFNNTHFSDESSSK-YTL 62

Query: 68  QLHHLDALS--LNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGFSSS 127
           +L H D       R      H R++RD  RV+ +     G   +  +S + +    F S 
Sbjct: 63  RLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGK--VIPSSDSRYEVNDFGSD 122

Query: 128 VISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSR 187
           ++SG+ QGSGEYF RIGVG+PP+  YMV+D+GSD+VW+QC PCK CY Q+DPVF+P KS 
Sbjct: 123 IVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSG 182

Query: 188 SYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 247
           SY+ V C + +C R+E+ GC+    C Y+V YGDGSYT G    ETLTF +T V  VA+G
Sbjct: 183 SYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMG 242

Query: 248 CGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAV 307
           CGH N G+F+GAAGLLG+G G +SF  Q        F YCLV R   S   S+VFG  A+
Sbjct: 243 CGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGREAL 302

Query: 308 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTSVT 367
              A + PL+ NPR  +FYYV L G+ VGG  +  +    F L   G+GGV++D GT+VT
Sbjct: 303 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIP-LPDGVFDLTETGDGGVVMDTGTAVT 362

Query: 368 RLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLP 427
           RL   AY+A R+ F++  ++L  A   S+FDTCYDLSG  +V+VPTV  +F  G  ++LP
Sbjct: 363 RLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLP 422

Query: 428 ASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 474
           A N+L+PVDD+G +CFAFA + +GLSIIGNIQQ+G +V +D A   VGF P  C
Sbjct: 423 ARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmaCh06G010150 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 1.2e-72
Identity = 174/452 (38.50%), Postives = 242/452 (53.54%), Query Frame = 1

Query: 34  PLPASPSLFSPESDTDSESF-ISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALRVT 93
           P  +S  + SP + T   S  ++   G   +L++  A S    P+ +  LRL  D  RV 
Sbjct: 43  PSSSSSCVLSPRASTTKSSLHVTHRHGTCSRLNNGKATS----PDHVEILRL--DQARVN 102

Query: 94  KLSSLSGGSQNLSQASGTSHGTTGFSSSVIS--GLAQGSGEYFTRIGVGTPPKYIYMVLD 153
            + S       LS+   T H +   S+ + +  G   GSG Y   +G+GTP   + ++ D
Sbjct: 103 SIHS------KLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFD 162

Query: 154 TGSDIVWLQCAPC-KNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESP----GCNQKQT 213
           TGSD+ W QC PC + CY Q +P+FNP KS SY  V C +  C  L S     G      
Sbjct: 163 TGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASN 222

Query: 214 CLYQVSYGDGSYTTGEFVTETLTFRRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLS 273
           C+Y + YGD S++ G    E  T   + V + V  GCG +N+GLF G AGLLGLGR  LS
Sbjct: 223 CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLS 282

Query: 274 FPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELL 333
           FPSQ   ++N+ FSYCL   S++S    + FG + +SR+ +FTP+ T     +FY + ++
Sbjct: 283 FPSQTATAYNKIFSYCLP--SSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIV 342

Query: 334 GISVGGTPVSRISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLA 393
            I+VGG  +  I ++ F        G +ID GT +TRL   AY ALR++F+A  S     
Sbjct: 343 AITVGGQKLP-IPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTT 402

Query: 394 PEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTT--S 453
              S+ DTC+DLSG  TV +P V   F G  V    S  +  V    + C AFAG +  S
Sbjct: 403 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDS 462

Query: 454 GLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
             +I GN+QQQ   VVYD AG RVGF+P GC+
Sbjct: 463 NAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of CmaCh06G010150 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 273.9 bits (699), Expect = 3.4e-72
Identity = 164/420 (39.05%), Postives = 231/420 (55.00%), Query Frame = 1

Query: 59  GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGFS 118
           G ++ L H+D+   N T  +L    ++R             GS+ L +     +G +G  
Sbjct: 40  GFQIMLEHVDS-GKNLTKFQLLERAIER-------------GSRRLQRLEAMLNGPSGVE 99

Query: 119 SSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 178
           +SV +G     GEY   + +GTP +    ++DTGSD++W QC PC  C++Q+ P+FNP  
Sbjct: 100 TSVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQG 159

Query: 179 SRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVA 238
           S S+S + C + LC  L SP C+    C Y   YGDGS T G   TETLTF    +  + 
Sbjct: 160 SSSFSTLPCSSQLCQALSSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNIT 219

Query: 239 LGCGHDNEGLFVG-AAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGD 298
            GCG +N+G   G  AGL+G+GRG LS PSQ   +   KFSYC+     SS PS+++ G 
Sbjct: 220 FGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTP-IGSSTPSNLLLGS 279

Query: 299 SAVSRTARF--TPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDS-NGNGGVIID 358
            A S TA    T L+ + ++ TFYY+ L G+SVG T +  I  S F L+S NG GG+IID
Sbjct: 280 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLP-IDPSAFALNSNNGTGGIIID 339

Query: 359 CGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDL-SGKTTVKVPTVVLHFRG 418
            GT++T     AY ++R  F +  +   +    S FD C+   S  + +++PT V+HF G
Sbjct: 340 SGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDG 399

Query: 419 ADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 474
            D+ LP+ NY I    NG  C A   ++ G+SI GNIQQQ   VVYD   S V F+   C
Sbjct: 400 GDLELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh06G010150 vs. TrEMBL
Match: A0A0A0L8K0_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_3G119540 PE=3 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 9.1e-234
Identity = 420/479 (87.68%), Postives = 440/479 (91.86%), Query Frame = 1

Query: 1   MEAKTSALPFIGFLLTLLSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-- 60
           ME  T +LPFI FLLT+LSL+TAFSDFQTL    LP+SPS F P   +DS SF+SSEA  
Sbjct: 42  MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPS-FLP---SDSNSFLSSEATQ 101

Query: 61  ---GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTT 120
              GLEL LHHLDALS NRTPEELFHLRLQRDA+RV KLSSL   S+NLS+  G    TT
Sbjct: 102 SELGLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGG----TT 161

Query: 121 GFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 180
           GFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIVWLQCAPCKNCYSQTDPVFN
Sbjct: 162 GFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 221

Query: 181 PVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 240
           PVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE
Sbjct: 222 PVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 281

Query: 241 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVF 300
           +VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQKFSYCLVDRSASSKPSSVVF
Sbjct: 282 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVF 341

Query: 301 GDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDC 360
           G+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVS I+ASHFKLD  GNGGVIIDC
Sbjct: 342 GNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDC 401

Query: 361 GTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 420
           GTSVTRLN+PAYIALR+AFRAGASSLK APEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD
Sbjct: 402 GTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 461

Query: 421 VSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           VSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 462 VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 512

BLAST of CmaCh06G010150 vs. TrEMBL
Match: A0A067JX70_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14253 PE=3 SV=1)

HSP 1 Score: 683.7 bits (1763), Expect = 1.6e-193
Identity = 352/483 (72.88%), Postives = 397/483 (82.19%), Query Frame = 1

Query: 1   MEAKT-SALPFIGFLLTLLSLSTAFS---DFQTLIPKPLPASPSLFSPESDTDSESF--- 60
           ME K  +A  F  F +  LSLST  S   D+QTL+  PLP   +L  P +D+++E+    
Sbjct: 1   MEGKARNAFLFFSFTI-FLSLSTTLSSPLDYQTLVLNPLPRQTALSWPAADSEAETLQTL 60

Query: 61  --ISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTS 120
              +      LQLHH+DALS N+TP++LF  RLQRDA RV  LSS++  +       GT 
Sbjct: 61  TDTADSTTFSLQLHHIDALSNNKTPQDLFGERLQRDAFRVEALSSVAASAVGAGGRVGT- 120

Query: 121 HGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTD 180
               GFSSSVISGLAQGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPC  CYSQ+D
Sbjct: 121 ----GFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCNKCYSQSD 180

Query: 181 PVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFR 240
           PVF+P KSRS++ + C +PLC RL+SPGCN QK+TC+YQVSYGDGS+T G+F TETLTFR
Sbjct: 181 PVFDPRKSRSFAGIPCGSPLCNRLDSPGCNTQKRTCMYQVSYGDGSFTYGDFSTETLTFR 240

Query: 241 RTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKP 300
           RTKV RVA+GCGHDN+GLFVGAAGLLGLGRG LSFPSQ G  FN+KFSYCLVDRSASSKP
Sbjct: 241 RTKVRRVAIGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGAQFNRKFSYCLVDRSASSKP 300

Query: 301 SSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGG 360
           SSVVFGDSA+SRTARFTPL++NP+LDTFYYVELLGISVGGT V  I+AS FKLD  GNGG
Sbjct: 301 SSVVFGDSAISRTARFTPLISNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 360

Query: 361 VIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLH 420
           VIID GTSVTRL RPAY+ALRNAFR GAS+LK APEFSLFDTC+DLSGKT VKVPTV LH
Sbjct: 361 VIIDSGTSVTRLTRPAYVALRNAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVALH 420

Query: 421 FRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSP 474
           FRGADVSLPASNYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLAGSRVGF+P
Sbjct: 421 FRGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAP 477

BLAST of CmaCh06G010150 vs. TrEMBL
Match: B9SBG8_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0717990 PE=3 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 1.5e-191
Identity = 349/477 (73.17%), Postives = 395/477 (82.81%), Query Frame = 1

Query: 1   MEAKTSALPFIGFLLTLLSLSTAFS-DFQTLIPKPLPASPSLFSPESDTDSESFISSEAG 60
           ME K     F+ F    +  S + S ++QTL+  PL + P+L   +S++ +++  SS A 
Sbjct: 1   MEGKAGRNAFLLFFSFTIFFSHSTSLNYQTLVANPLRSQPTLSWTDSESPTDTAESS-AT 60

Query: 61  LELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGT-TGFS 120
             +QLHH+DALS N TPE LF  RLQRDA RV  +S L+       + +GT     TGFS
Sbjct: 61  FSVQLHHVDALSFNSTPETLFTTRLQRDAARVEAISYLA-------ETAGTGKRVGTGFS 120

Query: 121 SSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 180
           SSVISGLAQGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPCK CY+Q+DPVF+P K
Sbjct: 121 SSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRK 180

Query: 181 SRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERV 240
           SRS++ + CR+PLC RL+SPGCN QKQTC+YQVSYGDGS+T G+F TETLTFRRT+V RV
Sbjct: 181 SRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARV 240

Query: 241 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGD 300
           ALGCGHDNEGLFVGAAGLLGLGRG LSFPSQ GR FN KFSYCLVDRSASSKPSS+VFGD
Sbjct: 241 ALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGD 300

Query: 301 SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGT 360
           SAVSRTARFTPL++NP+LDTFYYVELLGISVGGT V  I+AS FKLD  GNGGVIID GT
Sbjct: 301 SAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGT 360

Query: 361 SVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVS 420
           SVTRL RPAYIA R+AFRAGAS+LK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGADVS
Sbjct: 361 SVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 420

Query: 421 LPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           LPASNYLIPVD +G FC AFAGT  GLSIIGNIQQQGFRVVYDLAGSRVGF+P GCA
Sbjct: 421 LPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469

BLAST of CmaCh06G010150 vs. TrEMBL
Match: V7BRW0_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_006G113700g PE=3 SV=1)

HSP 1 Score: 669.8 bits (1727), Expect = 2.4e-189
Identity = 348/484 (71.90%), Postives = 392/484 (80.99%), Query Frame = 1

Query: 1   MEAK---TSALPFIGFLLTL--LSLSTAFSDFQ------TLIPKPLPASPSLFSPESDTD 60
           ME K   T+ L F+   LTL   + +T    FQ      TL    LP SP+L  P+++  
Sbjct: 57  MEVKKKTTNGLLFLSLALTLSAAAATTTSLPFQLQTQTLTLSLHSLPHSPTLSWPQAEAL 116

Query: 61  SESFISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASG 120
           +E     E  L L LHH+DALS N+TPE+LFHLRLQRD  RV  L +L+  + + ++ SG
Sbjct: 117 AEP--DPEEALSLNLHHIDALSSNKTPEQLFHLRLQRDGKRVQSLLTLAALNSSHARRSG 176

Query: 121 TSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQ 180
           +     GFSSS+ISGLAQGSGEYFTRIGVGTP KY+YMVLDTGSD+VWLQCAPC+ CY+Q
Sbjct: 177 S-----GFSSSIISGLAQGSGEYFTRIGVGTPAKYVYMVLDTGSDVVWLQCAPCRKCYTQ 236

Query: 181 TDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTF 240
           TDPVF+P KSR+Y+ + C  PLC RL+SPGCN+ + C YQVSYGDGS+T G+F TETLTF
Sbjct: 237 TDPVFDPTKSRTYAGIPCGAPLCRRLDSPGCNKNKVCQYQVSYGDGSFTFGDFSTETLTF 296

Query: 241 RRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSK 300
           RRT+V RVALGCGHDNEGLF+GAAGLLGLGRG LSFP Q GR FNQKFSYCLVDRSAS+K
Sbjct: 297 RRTRVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAK 356

Query: 301 PSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNG 360
           PSSVVFGDSAVSRTARFTPLL NP+LDTFYYVELLGISVGGTPV  +SAS F+LDS GNG
Sbjct: 357 PSSVVFGDSAVSRTARFTPLLQNPKLDTFYYVELLGISVGGTPVRGLSASLFRLDSAGNG 416

Query: 361 GVIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVL 420
           GVIID GTSVTRL RPAYIALR+AFR GAS LK A EFSLFDTC+DLSG T VKVPTVVL
Sbjct: 417 GVIIDSGTSVTRLTRPAYIALRDAFRVGASRLKRASEFSLFDTCFDLSGLTEVKVPTVVL 476

Query: 421 HFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFS 474
           HFRGADVSLPASNYLIPVD++G FCFAFAGT SGLSIIGNIQQQGFRV YDL GSRVGF+
Sbjct: 477 HFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSYDLGGSRVGFA 533

BLAST of CmaCh06G010150 vs. TrEMBL
Match: B9GR19_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0002s17270g PE=3 SV=2)

HSP 1 Score: 669.1 bits (1725), Expect = 4.0e-189
Identity = 346/486 (71.19%), Postives = 395/486 (81.28%), Query Frame = 1

Query: 7   ALPFIGFLLTLLSLSTAF----SDFQTLIPKPLPASPSLF----SPESDTDSESFISSEA 66
           AL F  F    LSLST        FQTL   PLP  P+L      PES+ ++++   S +
Sbjct: 10  ALLFFSFTCVFLSLSTTTLSTSPQFQTLTVNPLPNKPTLSWADTGPESEPETQTLTDSTS 69

Query: 67  -------GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSG--GSQNLSQASG 126
                   L +QLHHLDALS + TP++LF+ RL RDA RV  L+SL+   GS N ++A G
Sbjct: 70  TEASTTTSLSVQLHHLDALSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARG 129

Query: 127 TSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQ 186
                 GFSSSV SGLAQGSGEYFTR+GVGTP +Y++MVLDTGSD+VW+QCAPCK CYSQ
Sbjct: 130 P-----GFSSSVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQ 189

Query: 187 TDPVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLT 246
           TDPVFNP KSRS++ + C +PLC RL+SPGC+ +K  CLYQVSYGDGS+T GEF TETLT
Sbjct: 190 TDPVFNPTKSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLT 249

Query: 247 FRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASS 306
           FR T+V RVALGCGHDNEGLF+GAAGLLGLGRG LSFPSQ GR F++KFSYCLVDRSASS
Sbjct: 250 FRGTRVGRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASS 309

Query: 307 KPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGN 366
           KPS +VFGDSA+SRTARFTPL++NP+LDTFYYVELLG+SVGGT V  I+AS FKLDS GN
Sbjct: 310 KPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGN 369

Query: 367 GGVIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVV 426
           GGVIID GTSVTRL RPAY+ALR+AFR GAS+LK APEFSLFDTC+DLSGKT VKVPTVV
Sbjct: 370 GGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVV 429

Query: 427 LHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGF 475
           LHFRGADVSLPASNYLIPVD++G FCFAFAGT SGLSI+GNIQQQGFRVVYDLA SRVGF
Sbjct: 430 LHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGF 489

BLAST of CmaCh06G010150 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 630.2 bits (1624), Expect = 1.0e-180
Identity = 329/476 (69.12%), Postives = 381/476 (80.04%), Query Frame = 1

Query: 13  FLLTLLSLSTAFSDFQTLIPKP--LP-ASPSLFSPESDTDSESFISSE----------AG 72
           F L+L S S+  S FQTL P    LP ASP  F P  D+DSES + SE          + 
Sbjct: 15  FFLSLPSFSSLPS-FQTLFPNSHSLPCASPVSFQP--DSDSESLLESEFESGSDSESSSS 74

Query: 73  LELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGFSS 132
           + L L H+DALS N+TP+ELF  RLQRD+ RV  +++L+      +       G  GFSS
Sbjct: 75  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPG--GFSS 134

Query: 133 SVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKS 192
           SV+SGL+QGSGEYFTR+GVGTP +Y+YMVLDTGSDIVWLQCAPC+ CYSQ+DP+F+P KS
Sbjct: 135 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 194

Query: 193 RSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVA 252
           ++Y+ + C +P C RL+S GCN +++TCLYQVSYGDGS+T G+F TETLTFRR +V+ VA
Sbjct: 195 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVA 254

Query: 253 LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDS 312
           LGCGHDNEGLFVGAAGLLGLG+G LSFP Q G  FNQKFSYCLVDRSASSKPSSVVFG++
Sbjct: 255 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNA 314

Query: 313 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTS 372
           AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V  ++AS FKLD  GNGGVIID GTS
Sbjct: 315 AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 374

Query: 373 VTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSL 432
           VTRL RPAYIA+R+AFR GA +LK AP+FSLFDTC+DLS    VKVPTVVLHFRGADVSL
Sbjct: 375 VTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSL 434

Query: 433 PASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           PA+NYLIPVD NG+FCFAFAGT  GLSIIGNIQQQGFRVVYDLA SRVGF+P GCA
Sbjct: 435 PATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh06G010150 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 573.9 bits (1478), Expect = 8.9e-164
Identity = 299/483 (61.90%), Postives = 360/483 (74.53%), Query Frame = 1

Query: 1   MEAKTSALPFIGFLLTLLSLSTAFSDFQTLIPKPLPASPSLFSPESDT-DSESFISSEAG 60
           ME K            L   S+A S +QTL+   LP+S +L  PES++   ES   S   
Sbjct: 1   MERKVLNTLAFSVFAVLFFTSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTS 60

Query: 61  LELQLHHLDALSL--NRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGF 120
           L + L H+DALS   + +P +LF+LRLQRD+LRV  ++SL+  S   +    T     GF
Sbjct: 61  LSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGF 120

Query: 121 SSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPV 180
           S +VISGL+QGSGEYF R+GVGTP   +YMVLDTGSD+VWLQC+PCK CY+QTD +F+P 
Sbjct: 121 SGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPK 180

Query: 181 KSRSYSKVLCRTPLCLRLE-SPGC--NQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 240
           KS++++ V C + LC RL+ S  C   + +TCLYQVSYGDGS+T G+F TETLTF   +V
Sbjct: 181 KSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARV 240

Query: 241 ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDR----SASSKP 300
           + V LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQ    +N KFSYCLVDR    S+S  P
Sbjct: 241 DHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPP 300

Query: 301 SSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGG 360
           S++VFG++AV +T+ FTPLLTNP+LDTFYY++LLGISVGG+ V  +S S FKLD+ GNGG
Sbjct: 301 STIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGG 360

Query: 361 VIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLH 420
           VIID GTSVTRL +PAY+ALR+AFR GA+ LK AP +SLFDTC+DLSG TTVKVPTVV H
Sbjct: 361 VIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 420

Query: 421 FRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSP 474
           F G +VSLPASNYLIPV+  GRFCFAFAGT   LSIIGNIQQQGFRV YDL GSRVGF  
Sbjct: 421 FGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 480

BLAST of CmaCh06G010150 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 398.7 bits (1023), Expect = 5.1e-111
Identity = 228/471 (48.41%), Postives = 300/471 (63.69%), Query Frame = 1

Query: 17  LLSLSTAFSDFQTLIP-KPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDAL--SLN 76
           +L + ++    QT++   P  +S +   PES +D   F +S + L L+LH  D    S +
Sbjct: 37  VLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPV-FFNSSSPLSLELHSRDTFVASQH 96

Query: 77  RTPEELFHLRLQRDALRVTKL-SSLSGGSQNLSQAS-------GTSHGTTGFSSSVISGL 136
           +  + L   RL+RD+ RV  + + +    + + ++         T + T   ++ V+SG 
Sbjct: 97  KDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGA 156

Query: 137 AQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKV 196
           +QGSGEYF+RIGVGTP K +Y+VLDTGSD+ W+QC PC +CY Q+DPVFNP  S +Y  +
Sbjct: 157 SQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSL 216

Query: 197 LCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVERVALGCGHD 256
            C  P C  LE+  C   + CLYQVSYGDGS+T GE  T+T+TF  + K+  VALGCGHD
Sbjct: 217 TCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHD 276

Query: 257 NEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTA 316
           NEGLF GAAGLLGLG G LS  +Q   +    FSYCLVDR  S K SS+ F    +    
Sbjct: 277 NEGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGD 336

Query: 317 RFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTSVTRLNR 376
              PLL N ++DTFYYV L G SVGG  V  +  + F +D++G+GGVI+DCGT+VTRL  
Sbjct: 337 ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQT 396

Query: 377 PAYIALRNAFRAGASSLKL-APEFSLFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASN 436
            AY +LR+AF     +LK  +   SLFDTCYD S  +TVKVPTV  HF G   + LPA N
Sbjct: 397 QAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKN 456

Query: 437 YLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 474
           YLIPVDD+G FCFAFA T+S LSIIGN+QQQG R+ YDL+ + +G S   C
Sbjct: 457 YLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh06G010150 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 397.1 bits (1019), Expect = 1.5e-110
Identity = 234/490 (47.76%), Postives = 298/490 (60.82%), Query Frame = 1

Query: 9   PFIGFLLTLLSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISS------------ 68
           P   F   +  L++  S F  ++P+    + S+ +         + SS            
Sbjct: 3   PNYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHS 62

Query: 69  -EAGLELQLHHLDALSLNRTP----EELFHLRLQRDALRVTKL-SSLSGGSQNLSQAS-- 128
             +   LQLH    +S+  T     + L   RL RD  RV  L + L     N+S+A   
Sbjct: 63  ASSSFSLQLH--SRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLK 122

Query: 129 --GTSHGTT--GFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCK 188
              T + T      + +ISG  QGSGEYFTR+G+G P + +YMVLDTGSD+ WLQC PC 
Sbjct: 123 PISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCA 182

Query: 189 NCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVT 248
           +CY QT+P+F P  S SY  + C TP C  LE   C +  TCLY+VSYGDGSYT G+F T
Sbjct: 183 DCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC-RNATCLYEVSYGDGSYTVGDFAT 242

Query: 249 ETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDR 308
           ETLT   T V+ VA+GCGH NEGLFVGAAGLLGLG G L+ PSQ   +    FSYCLVDR
Sbjct: 243 ETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDR 302

Query: 309 SASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLD 368
            + S  S+V FG S +S  A   PLL N +LDTFYY+ L GISVGG  + +I  S F++D
Sbjct: 303 DSDS-ASTVDFGTS-LSPDAVVAPLLRNHQLDTFYYLGLTGISVGG-ELLQIPQSSFEMD 362

Query: 369 SNGNGGVIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKV 428
            +G+GG+IID GT+VTRL    Y +LR++F  G   L+ A   ++FDTCY+LS KTTV+V
Sbjct: 363 ESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEV 422

Query: 429 PTVVLHFRGAD-VSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAG 474
           PTV  HF G   ++LPA NY+IPVD  G FC AFA T S L+IIGN+QQQG RV +DLA 
Sbjct: 423 PTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLAN 482

BLAST of CmaCh06G010150 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 385.2 bits (988), Expect = 5.9e-107
Identity = 217/474 (45.78%), Postives = 289/474 (60.97%), Query Frame = 1

Query: 8   LPFIGFLLTL-----LSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLEL 67
           LP   F L L      S S +F DFQ +     P + +   P+ +    S  SS     L
Sbjct: 3   LPLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPDFNNTHFSDESSSK-YTL 62

Query: 68  QLHHLDALS--LNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTTGFSSS 127
           +L H D       R      H R++RD  RV+ +     G   +  +S + +    F S 
Sbjct: 63  RLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGK--VIPSSDSRYEVNDFGSD 122

Query: 128 VISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSR 187
           ++SG+ QGSGEYF RIGVG+PP+  YMV+D+GSD+VW+QC PCK CY Q+DPVF+P KS 
Sbjct: 123 IVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSG 182

Query: 188 SYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 247
           SY+ V C + +C R+E+ GC+    C Y+V YGDGSYT G    ETLTF +T V  VA+G
Sbjct: 183 SYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMG 242

Query: 248 CGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAV 307
           CGH N G+F+GAAGLLG+G G +SF  Q        F YCLV R   S   S+VFG  A+
Sbjct: 243 CGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGREAL 302

Query: 308 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDCGTSVT 367
              A + PL+ NPR  +FYYV L G+ VGG  +  +    F L   G+GGV++D GT+VT
Sbjct: 303 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIP-LPDGVFDLTETGDGGVVMDTGTAVT 362

Query: 368 RLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLP 427
           RL   AY+A R+ F++  ++L  A   S+FDTCYDLSG  +V+VPTV  +F  G  ++LP
Sbjct: 363 RLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLP 422

Query: 428 ASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 474
           A N+L+PVDD+G +CFAFA + +GLSIIGNIQQ+G +V +D A   VGF P  C
Sbjct: 423 ARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmaCh06G010150 vs. NCBI nr
Match: gi|659074959|ref|XP_008437888.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis melo])

HSP 1 Score: 819.7 bits (2116), Expect = 2.6e-234
Identity = 423/480 (88.12%), Postives = 443/480 (92.29%), Query Frame = 1

Query: 1   MEAKTSALPFIGFLL-TLLSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA- 60
           MEA T +LPFI FLL  +LSLSTAFSDFQTLI + LP+SPS F P   +DS SF+SSEA 
Sbjct: 3   MEANTISLPFIFFLLLAILSLSTAFSDFQTLILRSLPSSPS-FLP---SDSNSFLSSEAT 62

Query: 61  ----GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGT 120
               GLEL LHHLDALS NRTPEELFHLRLQRDA+RV KLSSL   S+NLS+ SG    T
Sbjct: 63  ETELGLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSRPSG----T 122

Query: 121 TGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVF 180
           TGFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIVWLQCAPCKNCYSQTDPVF
Sbjct: 123 TGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVF 182

Query: 181 NPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 240
           NPVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGSYTTGEFVTETLTFRRTKV
Sbjct: 183 NPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 242

Query: 241 ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVV 300
           E+VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQKFSYCLVDRSASSKPSSVV
Sbjct: 243 EQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVV 302

Query: 301 FGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIID 360
           FG+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVS IS+SHFKLD  GNGGVIID
Sbjct: 303 FGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISSSHFKLDRTGNGGVIID 362

Query: 361 CGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 420
           CGTSVTRLN+PAYIALR+AFRAGASSLK APEFSLFDTCYDLSGKTTVKVPTVVLHFRGA
Sbjct: 363 CGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 422

Query: 421 DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           DVSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 423 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 474

BLAST of CmaCh06G010150 vs. NCBI nr
Match: gi|449432044|ref|XP_004133810.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus])

HSP 1 Score: 817.4 bits (2110), Expect = 1.3e-233
Identity = 420/479 (87.68%), Postives = 440/479 (91.86%), Query Frame = 1

Query: 1   MEAKTSALPFIGFLLTLLSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-- 60
           ME  T +LPFI FLLT+LSL+TAFSDFQTL    LP+SPS F P   +DS SF+SSEA  
Sbjct: 1   MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPS-FLP---SDSNSFLSSEATQ 60

Query: 61  ---GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTT 120
              GLEL LHHLDALS NRTPEELFHLRLQRDA+RV KLSSL   S+NLS+  G    TT
Sbjct: 61  SELGLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGG----TT 120

Query: 121 GFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 180
           GFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIVWLQCAPCKNCYSQTDPVFN
Sbjct: 121 GFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 180

Query: 181 PVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 240
           PVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE
Sbjct: 181 PVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 240

Query: 241 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVF 300
           +VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQKFSYCLVDRSASSKPSSVVF
Sbjct: 241 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVF 300

Query: 301 GDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDC 360
           G+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVS I+ASHFKLD  GNGGVIIDC
Sbjct: 301 GNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDC 360

Query: 361 GTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 420
           GTSVTRLN+PAYIALR+AFRAGASSLK APEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD
Sbjct: 361 GTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 420

Query: 421 VSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           VSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 421 VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471

BLAST of CmaCh06G010150 vs. NCBI nr
Match: gi|700201288|gb|KGN56421.1| (Aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 817.4 bits (2110), Expect = 1.3e-233
Identity = 420/479 (87.68%), Postives = 440/479 (91.86%), Query Frame = 1

Query: 1   MEAKTSALPFIGFLLTLLSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEA-- 60
           ME  T +LPFI FLLT+LSL+TAFSDFQTL    LP+SPS F P   +DS SF+SSEA  
Sbjct: 42  MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPS-FLP---SDSNSFLSSEATQ 101

Query: 61  ---GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTT 120
              GLEL LHHLDALS NRTPEELFHLRLQRDA+RV KLSSL   S+NLS+  G    TT
Sbjct: 102 SELGLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGG----TT 161

Query: 121 GFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 180
           GFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIVWLQCAPCKNCYSQTDPVFN
Sbjct: 162 GFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 221

Query: 181 PVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 240
           PVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE
Sbjct: 222 PVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 281

Query: 241 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVF 300
           +VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR+FNQKFSYCLVDRSASSKPSSVVF
Sbjct: 282 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVF 341

Query: 301 GDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGGVIIDC 360
           G+SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVS I+ASHFKLD  GNGGVIIDC
Sbjct: 342 GNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDC 401

Query: 361 GTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 420
           GTSVTRLN+PAYIALR+AFRAGASSLK APEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD
Sbjct: 402 GTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 461

Query: 421 VSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 475
           VSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 462 VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 512

BLAST of CmaCh06G010150 vs. NCBI nr
Match: gi|643716856|gb|KDP28482.1| (hypothetical protein JCGZ_14253 [Jatropha curcas])

HSP 1 Score: 683.7 bits (1763), Expect = 2.3e-193
Identity = 352/483 (72.88%), Postives = 397/483 (82.19%), Query Frame = 1

Query: 1   MEAKT-SALPFIGFLLTLLSLSTAFS---DFQTLIPKPLPASPSLFSPESDTDSESF--- 60
           ME K  +A  F  F +  LSLST  S   D+QTL+  PLP   +L  P +D+++E+    
Sbjct: 1   MEGKARNAFLFFSFTI-FLSLSTTLSSPLDYQTLVLNPLPRQTALSWPAADSEAETLQTL 60

Query: 61  --ISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTS 120
              +      LQLHH+DALS N+TP++LF  RLQRDA RV  LSS++  +       GT 
Sbjct: 61  TDTADSTTFSLQLHHIDALSNNKTPQDLFGERLQRDAFRVEALSSVAASAVGAGGRVGT- 120

Query: 121 HGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTD 180
               GFSSSVISGLAQGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPC  CYSQ+D
Sbjct: 121 ----GFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCNKCYSQSD 180

Query: 181 PVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEFVTETLTFR 240
           PVF+P KSRS++ + C +PLC RL+SPGCN QK+TC+YQVSYGDGS+T G+F TETLTFR
Sbjct: 181 PVFDPRKSRSFAGIPCGSPLCNRLDSPGCNTQKRTCMYQVSYGDGSFTYGDFSTETLTFR 240

Query: 241 RTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKP 300
           RTKV RVA+GCGHDN+GLFVGAAGLLGLGRG LSFPSQ G  FN+KFSYCLVDRSASSKP
Sbjct: 241 RTKVRRVAIGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGAQFNRKFSYCLVDRSASSKP 300

Query: 301 SSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFKLDSNGNGG 360
           SSVVFGDSA+SRTARFTPL++NP+LDTFYYVELLGISVGGT V  I+AS FKLD  GNGG
Sbjct: 301 SSVVFGDSAISRTARFTPLISNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 360

Query: 361 VIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLH 420
           VIID GTSVTRL RPAY+ALRNAFR GAS+LK APEFSLFDTC+DLSGKT VKVPTV LH
Sbjct: 361 VIIDSGTSVTRLTRPAYVALRNAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVALH 420

Query: 421 FRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSP 474
           FRGADVSLPASNYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLAGSRVGF+P
Sbjct: 421 FRGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAP 477

BLAST of CmaCh06G010150 vs. NCBI nr
Match: gi|802694547|ref|XP_012083203.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas])

HSP 1 Score: 682.6 bits (1760), Expect = 5.0e-193
Identity = 352/491 (71.69%), Postives = 399/491 (81.26%), Query Frame = 1

Query: 1   MEAKT-SALPFIGFLLTLLSLSTAFS---DFQTLIPKPLPASPSLFSPESDTDSESFISS 60
           ME K  +A  F  F +  LSLST  S   D+QTL+  PLP   +L  P +D+++E+ ++ 
Sbjct: 1   MEGKARNAFLFFSFTI-FLSLSTTLSSPLDYQTLVLNPLPRQTALSWPAADSEAETLLNQ 60

Query: 61  E-------------AGLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQN 120
           +                 LQLHH+DALS N+TP++LF  RLQRDA RV  LSS++  +  
Sbjct: 61  DPCHAETLTDTADSTTFSLQLHHIDALSNNKTPQDLFGERLQRDAFRVEALSSVAASAVG 120

Query: 121 LSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPC 180
                GT     GFSSSVISGLAQGSGEYFTRIGVGTPP+Y+YMVLDTGSDIVW+QCAPC
Sbjct: 121 AGGRVGT-----GFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC 180

Query: 181 KNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTTGEF 240
             CYSQ+DPVF+P KSRS++ + C +PLC RL+SPGCN QK+TC+YQVSYGDGS+T G+F
Sbjct: 181 NKCYSQSDPVFDPRKSRSFAGIPCGSPLCNRLDSPGCNTQKRTCMYQVSYGDGSFTYGDF 240

Query: 241 VTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLV 300
            TETLTFRRTKV RVA+GCGHDN+GLFVGAAGLLGLGRG LSFPSQ G  FN+KFSYCLV
Sbjct: 241 STETLTFRRTKVRRVAIGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGAQFNRKFSYCLV 300

Query: 301 DRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSRISASHFK 360
           DRSASSKPSSVVFGDSA+SRTARFTPL++NP+LDTFYYVELLGISVGGT V  I+AS FK
Sbjct: 301 DRSASSKPSSVVFGDSAISRTARFTPLISNPKLDTFYYVELLGISVGGTRVPGITASLFK 360

Query: 361 LDSNGNGGVIIDCGTSVTRLNRPAYIALRNAFRAGASSLKLAPEFSLFDTCYDLSGKTTV 420
           LD  GNGGVIID GTSVTRL RPAY+ALRNAFR GAS+LK APEFSLFDTC+DLSGKT V
Sbjct: 361 LDQTGNGGVIIDSGTSVTRLTRPAYVALRNAFRVGASNLKRAPEFSLFDTCFDLSGKTEV 420

Query: 421 KVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA 474
           KVPTV LHFRGADVSLPASNYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLA
Sbjct: 421 KVPTVALHFRGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APF2_ARATH1.9e-17969.12Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH9.1e-11048.41Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH1.0e-10545.78Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPA_ARATH1.2e-7238.50Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
NEP1_NEPGR3.4e-7239.05Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L8K0_CUCSA9.1e-23487.68Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_3G119540 PE=3 SV=1[more]
A0A067JX70_JATCU1.6e-19372.88Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14253 PE=3 SV=1[more]
B9SBG8_RICCO1.5e-19173.17Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0717990 ... [more]
V7BRW0_PHAVU2.4e-18971.90Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_006G113700g PE=3 SV=1[more]
B9GR19_POPTR4.0e-18971.19Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0002s17270g PE=... [more]
Match NameE-valueIdentityDescription
AT1G01300.11.0e-18069.12 Eukaryotic aspartyl protease family protein[more]
AT3G61820.18.9e-16461.90 Eukaryotic aspartyl protease family protein[more]
AT3G18490.15.1e-11148.41 Eukaryotic aspartyl protease family protein[more]
AT1G25510.11.5e-11047.76 Eukaryotic aspartyl protease family protein[more]
AT3G20015.15.9e-10745.78 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659074959|ref|XP_008437888.1|2.6e-23488.13PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis melo][more]
gi|449432044|ref|XP_004133810.1|1.3e-23387.68PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus][more]
gi|700201288|gb|KGN56421.1|1.3e-23387.68Aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
gi|643716856|gb|KDP28482.1|2.3e-19372.88hypothetical protein JCGZ_14253 [Jatropha curcas][more]
gi|802694547|ref|XP_012083203.1|5.0e-19371.69PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0080167 response to karrikin
cellular_component GO:0016020 membrane
cellular_component GO:0009505 plant-type cell wall
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh06G010150.1CmaCh06G010150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 445..460
score: 5.0E-6coord: 351..362
score: 5.0E-6coord: 138..158
score: 5.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 50..473
score: 3.0E-246coord: 2..26
score: 3.0E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 147..158
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 129..297
score: 1.1E-41coord: 299..474
score: 4.6
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 125..473
score: 6.5E
NoneNo IPR availablePANTHERPTHR13683:SF308ASPARTYL PROTEASE-RELATEDcoord: 50..473
score: 3.0E-246coord: 2..26
score: 3.0E