CmaCh01G013150 (gene) Cucurbita maxima (Rimu)

NameCmaCh01G013150
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr01 : 9386065 .. 9387741 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTTTCTCGGAAAACAATCAGGCAGTACTAGAGGTTTTCAAAATTGCAGTGTGTATCTTGCATTGATTTTCCTCTTGCTTTTCTCTAGCGTATTTGATACGATTGCTGAAGCGCATGTTCGTCAAGGATTCAACGAGTCCAATCGGTCTGGTGTTTTCGGAATCGAATTGCCGGAAAATATTAGCTCTGGTATTGCTACTTCTTCGGTGAGTGCTCCGTGTAGTTTCAGTAATGAAGATGAAGAAGAGGAAGAGAGGTTAATGGCGAAGTCGGTGAAGAAATCGGTGAAGCTTCACTTGAAAAAGCGGTCAACGAGTCGAGTGACCGAACCGAAGGAATCGATTACTGAATCTGCAGTTAGGGATTTGGCGAGAATCCAGACGCTTCATAAGAGAATCACGGAGAGGAAGAATCAAGATACGACTTCTAGACTGAAGAATGGCAATGCTGAGCGGAGGAAACCGGCGGAGGCGGTTTCTCCGGCCGCTTCGCCTGATTCTTACTCCGGCTACTTCTCCGGTCAGCTTATGGCGACTCTGGAATCCGGCGTTAGTCTTGGCTCTGGTGAGTACTTCATCGACGTCTTCGTCGGTTCTCCGCCCAAACATTTCTCTCTGATTCTCGATACCGGTAGCGATTTGAACTGGATTCAATGTGTTCCTTGCCATGATTGTTTCGAGCAAACCGGGCCTTATTACGACCCTAAAGATTCAATTTCTTTCAGAAACATAACCTGTAACGATCCTCGATGTCAATTAGTTTCGTCTCCAGATCCTCCGCAGCCGTGCAAATCCGAGACGCAATCGTGCCCTTACTTCTACTGGTACGGCGACTGTTCGAACACCACCGGCGATTTCGCGCTCGAGACGTTCACCGTCAATCTGACCTCTTCGACGACGGGGAAGTCGGAGTTTCGGCGAGTGGAGAATGTGATGTTCGGATGCGGCCATTGGAACAGAGGCCTCTTCCATGGCGCCGCCGGACTTTTAGGGCTTGGCCGAGGACCTCTCTCGTTTTCATCGCAGCTTCAATCTCTCTACGGCCATTCTTTCTCCTACTGTCTTGTTGATCGAAACAGTGATACGAGCGTGAGCAGCAAGCTGATTTTCGGCGAAGACAGAGATCTATTAACTCATCCGGAACTGAAATTCACATCGCTATTCGGCGGAAAGGAAAATCCAGTCGACACATTCTACTATCTGCAAATCAAATCGATCTTCGTCGGAGGAGAGAAGCTCCAAATCCCCGAGGAGAACTGGAAGATTTCCGCCGACGGCGCTGGCGGAACAATCATCGATTCCGGCACAACTCTCAGCTATTTCTCCGATCCGGCTTACCGAATCATCAAAGAAGCATTCCTGAGGAAAGTAAAGAACTATAAACTGGTTGAAGATTTTCCGATCTTACATCCTTGCTACAACGTCTCCAGCGCTGATAAACTTGAATTTCCAGAATTCGAAATCCAGTTCGCCGATGGCGCCGTATGGAAATTCCCGGTGGAGAATTACTTCATCAGAATCGAGCAATTCGATATGGTTTGCTTGGCGATGTTAGGGACTCCAAAATCGGCTCTGTCGATCATCGGAAATTACCAGCAGCAGAATTTTCACATACTGTACGATACGAAGAACTCGAGACTGGGGTTTGCGCCGATGAGATGCGCTGACGTTTAA

mRNA sequence

ATGAATTTTCTCGGAAAACAATCAGGCAGTACTAGAGGTTTTCAAAATTGCAGTGTGTATCTTGCATTGATTTTCCTCTTGCTTTTCTCTAGCGTATTTGATACGATTGCTGAAGCGCATGTTCGTCAAGGATTCAACGAGTCCAATCGGTCTGGTGTTTTCGGAATCGAATTGCCGGAAAATATTAGCTCTGGTATTGCTACTTCTTCGGTGAGTGCTCCGTGTAGTTTCAGTAATGAAGATGAAGAAGAGGAAGAGAGGTTAATGGCGAAGTCGGTGAAGAAATCGGTGAAGCTTCACTTGAAAAAGCGGTCAACGAGTCGAGTGACCGAACCGAAGGAATCGATTACTGAATCTGCAGTTAGGGATTTGGCGAGAATCCAGACGCTTCATAAGAGAATCACGGAGAGGAAGAATCAAGATACGACTTCTAGACTGAAGAATGGCAATGCTGAGCGGAGGAAACCGGCGGAGGCGGTTTCTCCGGCCGCTTCGCCTGATTCTTACTCCGGCTACTTCTCCGGTCAGCTTATGGCGACTCTGGAATCCGGCGTTAGTCTTGGCTCTGGTGAGTACTTCATCGACGTCTTCGTCGGTTCTCCGCCCAAACATTTCTCTCTGATTCTCGATACCGGTAGCGATTTGAACTGGATTCAATGTGTTCCTTGCCATGATTGTTTCGAGCAAACCGGGCCTTATTACGACCCTAAAGATTCAATTTCTTTCAGAAACATAACCTGTAACGATCCTCGATGTCAATTAGTTTCGTCTCCAGATCCTCCGCAGCCGTGCAAATCCGAGACGCAATCGTGCCCTTACTTCTACTGGTACGGCGACTGTTCGAACACCACCGGCGATTTCGCGCTCGAGACGTTCACCGTCAATCTGACCTCTTCGACGACGGGGAAGTCGGAGTTTCGGCGAGTGGAGAATGTGATGTTCGGATGCGGCCATTGGAACAGAGGCCTCTTCCATGGCGCCGCCGGACTTTTAGGGCTTGGCCGAGGACCTCTCTCGTTTTCATCGCAGCTTCAATCTCTCTACGGCCATTCTTTCTCCTACTGTCTTGTTGATCGAAACAGTGATACGAGCGTGAGCAGCAAGCTGATTTTCGGCGAAGACAGAGATCTATTAACTCATCCGGAACTGAAATTCACATCGCTATTCGGCGGAAAGGAAAATCCAGTCGACACATTCTACTATCTGCAAATCAAATCGATCTTCGTCGGAGGAGAGAAGCTCCAAATCCCCGAGGAGAACTGGAAGATTTCCGCCGACGGCGCTGGCGGAACAATCATCGATTCCGGCACAACTCTCAGCTATTTCTCCGATCCGGCTTACCGAATCATCAAAGAAGCATTCCTGAGGAAAGTAAAGAACTATAAACTGGTTGAAGATTTTCCGATCTTACATCCTTGCTACAACGTCTCCAGCGCTGATAAACTTGAATTTCCAGAATTCGAAATCCAGTTCGCCGATGGCGCCGTATGGAAATTCCCGGTGGAGAATTACTTCATCAGAATCGAGCAATTCGATATGGTTTGCTTGGCGATGTTAGGGACTCCAAAATCGGCTCTGTCGATCATCGGAAATTACCAGCAGCAGAATTTTCACATACTGTACGATACGAAGAACTCGAGACTGGGGTTTGCGCCGATGAGATGCGCTGACGTTTAA

Coding sequence (CDS)

ATGAATTTTCTCGGAAAACAATCAGGCAGTACTAGAGGTTTTCAAAATTGCAGTGTGTATCTTGCATTGATTTTCCTCTTGCTTTTCTCTAGCGTATTTGATACGATTGCTGAAGCGCATGTTCGTCAAGGATTCAACGAGTCCAATCGGTCTGGTGTTTTCGGAATCGAATTGCCGGAAAATATTAGCTCTGGTATTGCTACTTCTTCGGTGAGTGCTCCGTGTAGTTTCAGTAATGAAGATGAAGAAGAGGAAGAGAGGTTAATGGCGAAGTCGGTGAAGAAATCGGTGAAGCTTCACTTGAAAAAGCGGTCAACGAGTCGAGTGACCGAACCGAAGGAATCGATTACTGAATCTGCAGTTAGGGATTTGGCGAGAATCCAGACGCTTCATAAGAGAATCACGGAGAGGAAGAATCAAGATACGACTTCTAGACTGAAGAATGGCAATGCTGAGCGGAGGAAACCGGCGGAGGCGGTTTCTCCGGCCGCTTCGCCTGATTCTTACTCCGGCTACTTCTCCGGTCAGCTTATGGCGACTCTGGAATCCGGCGTTAGTCTTGGCTCTGGTGAGTACTTCATCGACGTCTTCGTCGGTTCTCCGCCCAAACATTTCTCTCTGATTCTCGATACCGGTAGCGATTTGAACTGGATTCAATGTGTTCCTTGCCATGATTGTTTCGAGCAAACCGGGCCTTATTACGACCCTAAAGATTCAATTTCTTTCAGAAACATAACCTGTAACGATCCTCGATGTCAATTAGTTTCGTCTCCAGATCCTCCGCAGCCGTGCAAATCCGAGACGCAATCGTGCCCTTACTTCTACTGGTACGGCGACTGTTCGAACACCACCGGCGATTTCGCGCTCGAGACGTTCACCGTCAATCTGACCTCTTCGACGACGGGGAAGTCGGAGTTTCGGCGAGTGGAGAATGTGATGTTCGGATGCGGCCATTGGAACAGAGGCCTCTTCCATGGCGCCGCCGGACTTTTAGGGCTTGGCCGAGGACCTCTCTCGTTTTCATCGCAGCTTCAATCTCTCTACGGCCATTCTTTCTCCTACTGTCTTGTTGATCGAAACAGTGATACGAGCGTGAGCAGCAAGCTGATTTTCGGCGAAGACAGAGATCTATTAACTCATCCGGAACTGAAATTCACATCGCTATTCGGCGGAAAGGAAAATCCAGTCGACACATTCTACTATCTGCAAATCAAATCGATCTTCGTCGGAGGAGAGAAGCTCCAAATCCCCGAGGAGAACTGGAAGATTTCCGCCGACGGCGCTGGCGGAACAATCATCGATTCCGGCACAACTCTCAGCTATTTCTCCGATCCGGCTTACCGAATCATCAAAGAAGCATTCCTGAGGAAAGTAAAGAACTATAAACTGGTTGAAGATTTTCCGATCTTACATCCTTGCTACAACGTCTCCAGCGCTGATAAACTTGAATTTCCAGAATTCGAAATCCAGTTCGCCGATGGCGCCGTATGGAAATTCCCGGTGGAGAATTACTTCATCAGAATCGAGCAATTCGATATGGTTTGCTTGGCGATGTTAGGGACTCCAAAATCGGCTCTGTCGATCATCGGAAATTACCAGCAGCAGAATTTTCACATACTGTACGATACGAAGAACTCGAGACTGGGGTTTGCGCCGATGAGATGCGCTGACGTTTAA

Protein sequence

MNFLGKQSGSTRGFQNCSVYLALIFLLLFSSVFDTIAEAHVRQGFNESNRSGVFGIELPENISSGIATSSVSAPCSFSNEDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKESITESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAVSPAASPDSYSGYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGFAPMRCADV
BLAST of CmaCh01G013150 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 1.4e-72
Identity = 157/403 (38.96%), Postives = 224/403 (55.58%), Query Frame = 1

Query: 156 PAEAVSPAASPDSYSGYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDL 215
           P   V+ A  P  +S        +++ SG+S GSGEYF  + VG+P ++  ++LDTGSD+
Sbjct: 114 PGRNVTHAPRPGGFS--------SSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDI 173

Query: 216 NWIQCVPCHDCFEQTGPYYDPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFY 275
            W+QC PC  C+ Q+ P +DP+ S ++  I C+ P C+ + S      C +  ++C Y  
Sbjct: 174 VWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG----CNTRRKTCLYQV 233

Query: 276 WYGDCSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGR 335
            YGD S T GDF+ ET T              RV+ V  GCGH N GLF GAAGLLGLG+
Sbjct: 234 SYGDGSFTVGDFSTETLTFRR----------NRVKGVALGCGHDNEGLFVGAAGLLGLGK 293

Query: 336 GPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENP 395
           G LSF  Q    +   FSYCLVDR++ +  SS ++FG   +       +FT L     NP
Sbjct: 294 GKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS-VVFG---NAAVSRIARFTPLL---SNP 353

Query: 396 -VDTFYYLQIKSIFVGGEKLQ-IPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEA 455
            +DTFYY+ +  I VGG ++  +    +K+   G GG IIDSGT+++    PAY  +++A
Sbjct: 354 KLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDA 413

Query: 456 FLRKVKNYKLVEDFPILHPCYNVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDM 515
           F    K  K   DF +   C+++S+ ++++ P   + F  GA    P  NY I ++    
Sbjct: 414 FRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGK 473

Query: 516 VCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGFAPMRCA 557
            C A  GT    LSIIGN QQQ F ++YD  +SR+GFAP  CA
Sbjct: 474 FCFAFAGT-MGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh01G013150 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 3.5e-68
Identity = 160/433 (36.95%), Postives = 231/433 (53.35%), Query Frame = 1

Query: 131 HKRITERKNQDTTSRLKNGNAERRKPAEAVSPAASPDSYSG---YFSGQLMATLESGVSL 190
           +K +T  + +  +SR+    A+ R   E V  +     Y+    Y +  L   + SG S 
Sbjct: 98  YKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQ 157

Query: 191 GSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSISFRNITC 250
           GSGEYF  + VG+P K   L+LDTGSD+NWIQC PC DC++Q+ P ++P  S +++++TC
Sbjct: 158 GSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTC 217

Query: 251 NDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSSTTGKSEFR 310
           + P+C L+ +      C+S    C Y   YGD S T G+ A +T    +T   +GK    
Sbjct: 218 SAPQCSLLET----SACRSN--KCLYQVSYGDGSFTVGELATDT----VTFGNSGK---- 277

Query: 311 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVS- 370
            + NV  GCGH N GLF GAAGLLGLG G LS ++Q+++    SFSYCLVDR+S  S S 
Sbjct: 278 -INNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRDSGKSSSL 337

Query: 371 ---SKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEENWKI 430
              S  + G D    T P L        +   +DTFYY+ +    VGGEK+ +P+  + +
Sbjct: 338 DFNSVQLGGGD---ATAPLL--------RNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDV 397

Query: 431 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKL-VEDFPILHPCYNVSSADKL 490
            A G+GG I+D GT ++     AY  +++AFL+   N K       +   CY+ SS   +
Sbjct: 398 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTV 457

Query: 491 EFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNFHILYD 550
           + P     F  G     P +NY I ++     C A   T  S+LSIIGN QQQ   I YD
Sbjct: 458 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPT-SSSLSIIGNVQQQGTRITYD 500

Query: 551 TKNSRLGFAPMRC 556
              + +G +  +C
Sbjct: 518 LSKNVIGLSGNKC 500

BLAST of CmaCh01G013150 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 242.7 bits (618), Expect = 9.8e-63
Identity = 135/396 (34.09%), Postives = 200/396 (50.51%), Query Frame = 1

Query: 160 VSPAASPDSYSGYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQ 219
           +S    P S S Y      + + SG+  GSGEYF+ + VGSPP+   +++D+GSD+ W+Q
Sbjct: 99  ISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQ 158

Query: 220 CVPCHDCFEQTGPYYDPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGD 279
           C PC  C++Q+ P +DP  S S+  ++C    C  + +      C S    C Y   YGD
Sbjct: 159 CQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIEN----SGCHS--GGCRYEVMYGD 218

Query: 280 CSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLS 339
            S T G  ALET T   T           V NV  GCGH NRG+F GAAGLLG+G G +S
Sbjct: 219 GSYTKGTLALETLTFAKTV----------VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMS 278

Query: 340 FSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTF 399
           F  QL    G +F YCLV R +D++ S  L+FG +          +  L      P  +F
Sbjct: 279 FVGQLSGQTGGAFGYCLVSRGTDSTGS--LVFGRE---ALPVGASWVPLVRNPRAP--SF 338

Query: 400 YYLQIKSIFVGGEKLQIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK 459
           YY+ +K + VGG ++ +P+  + ++  G GG ++D+GT ++     AY   ++ F  +  
Sbjct: 339 YYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTA 398

Query: 460 NYKLVEDFPILHPCYNVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAML 519
           N        I   CY++S    +  P     F +G V   P  N+ + ++     C A  
Sbjct: 399 NLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFA 458

Query: 520 GTPKSALSIIGNYQQQNFHILYDTKNSRLGFAPMRC 556
            +P + LSIIGN QQ+   + +D  N  +GF P  C
Sbjct: 459 ASP-TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmaCh01G013150 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 240.4 bits (612), Expect = 4.9e-62
Identity = 147/383 (38.38%), Postives = 199/383 (51.96%), Query Frame = 1

Query: 176 QLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYD 235
           Q  + +E+ V  G GEY ++V +G+P   FS I+DTGSDL W QC PC  CF Q  P ++
Sbjct: 80  QSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFN 139

Query: 236 PKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVN 295
           P+DS SF  + C    CQ +    P + C +    C Y Y YGD S T G  A ETFT  
Sbjct: 140 PQDSSSFSTLPCESQYCQDL----PSETCNN--NECQYTYGYGDGSTTQGYMATETFTFE 199

Query: 296 LTSSTTGKSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSY 355
            +S          V N+ FGCG  N+G   G  AGL+G+G GPLS  SQL       FSY
Sbjct: 200 TSS----------VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV---GQFSY 259

Query: 356 CLVDRNSDTSVSSKLIFGEDRDLLTHPE-LKFTSLFGGKENPVDTFYYLQIKSIFVGGEK 415
           C+    S  S  S L  G     +  PE    T+L     NP  T+YY+ ++ I VGG+ 
Sbjct: 260 CMTSYGS--SSPSTLALGSAASGV--PEGSPSTTLIHSSLNP--TYYYITLQGITVGGDN 319

Query: 416 LQIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPC 475
           L IP   +++  DG GG IIDSGTTL+Y    AY  + +AF  ++    + E    L  C
Sbjct: 320 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTC 379

Query: 476 Y-NVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNY 535
           +   S    ++ PE  +QF DG V     +N  I   +  ++CLAM  + +  +SI GN 
Sbjct: 380 FQQPSDGSTVQVPEISMQF-DGGVLNLGEQNILISPAE-GVICLAMGSSSQLGISIFGNI 435

Query: 536 QQQNFHILYDTKNSRLGFAPMRC 556
           QQQ   +LYD +N  + F P +C
Sbjct: 440 QQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmaCh01G013150 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 1.9e-61
Identity = 141/378 (37.30%), Postives = 199/378 (52.65%), Query Frame = 1

Query: 181 LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSI 240
           +E+ V  G GEY +++ +G+P + FS I+DTGSDL W QC PC  CF Q+ P ++P+ S 
Sbjct: 84  VETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSS 143

Query: 241 SFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSST 300
           SF  + C+   CQ +SSP            C Y Y YGD S T G    ET T    S  
Sbjct: 144 SFSTLPCSSQLCQALSSP------TCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVS-- 203

Query: 301 TGKSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR 360
                   + N+ FGCG  N+G   G  AGL+G+GRGPLS  SQL       FSYC+   
Sbjct: 204 --------IPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPI 263

Query: 361 NSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEE 420
            S T   S L+ G   + +T      T +   + + + TFYY+ +  + VG  +L I   
Sbjct: 264 GSST--PSNLLLGSLANSVTAGSPNTTLI---QSSQIPTFYYITLNGLSVGSTRLPIDPS 323

Query: 421 NWKI-SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNV-S 480
            + + S +G GG IIDSGTTL+YF + AY+ +++ F+ ++    +         C+   S
Sbjct: 324 AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPS 383

Query: 481 SADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNF 540
               L+ P F + F DG   + P ENYFI      ++CLAM G+    +SI GN QQQN 
Sbjct: 384 DPSNLQIPTFVMHF-DGGDLELPSENYFISPSN-GLICLAM-GSSSQGMSIFGNIQQQNM 434

Query: 541 HILYDTKNSRLGFAPMRC 556
            ++YDT NS + FA  +C
Sbjct: 444 LVVYDTGNSVVSFASAQC 434

BLAST of CmaCh01G013150 vs. TrEMBL
Match: A0A0A0L2W1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G608130 PE=3 SV=1)

HSP 1 Score: 997.7 bits (2578), Expect = 5.8e-288
Identity = 491/562 (87.37%), Postives = 518/562 (92.17%), Query Frame = 1

Query: 1   MNFLGKQSGSTRGFQNCSVYLALIFLLLFSSVFDTI--AEAHVRQGFNESNRSGVFGIEL 60
           M+FLG Q+GS+RGFQN  ++L LIFLLLFS VF T+   EAH+ QGF++SNRSGVFGIEL
Sbjct: 1   MDFLGNQAGSSRGFQNWKLFLTLIFLLLFSGVFHTVFFVEAHIPQGFHKSNRSGVFGIEL 60

Query: 61  PENISSGIATSSVSAPCSFSNEDEE-EEERLMAKSVKKSVKLHLKKRSTSRVTEPKESIT 120
           PEN+SSGIA+SS SAPCSF NE EE E E LMA SVK+SVKLHLKKRST+   +PKESIT
Sbjct: 61  PENLSSGIASSSASAPCSFGNEGEEGERESLMADSVKQSVKLHLKKRSTNTANKPKESIT 120

Query: 121 ESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAVS-PAASPDSYSGYFSGQ 180
           ESAVRDLARIQTLH RITERKNQDTTSRLK  N ER+KP E VS PA SP+SY+ YFSGQ
Sbjct: 121 ESAVRDLARIQTLHTRITERKNQDTTSRLKKSNVERKKPMEEVSSPAESPESYADYFSGQ 180

Query: 181 LMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDP 240
           LMATLESGVSLGSGEYFIDVF+GSPPKHFSLILDTGSDLNWIQCVPC DCFEQ GPYYDP
Sbjct: 181 LMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDP 240

Query: 241 KDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNL 300
           KDSISFRNITCNDPRCQLVSSPDPP+PCK ETQSCPYFYWYGD SNTTGDFALETFTVNL
Sbjct: 241 KDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL 300

Query: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360
           TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360

Query: 361 VDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQI 420
           VDR+SDTSVSSKLIFGED+DLLTHPEL FTSL  GKENPVDTFYYLQIKSIFVGGEKLQI
Sbjct: 361 VDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQI 420

Query: 421 PEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNV 480
           PEENW +SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK YKLVEDFPILHPCYNV
Sbjct: 421 PEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNV 480

Query: 481 SSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQN 540
           S  D+L FPEF IQFADGAVW FPVENYFIRI+Q D+VCLAMLGTPKSALSIIGNYQQQN
Sbjct: 481 SGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQN 540

Query: 541 FHILYDTKNSRLGFAPMRCADV 559
           FHILYDTKNSRLG+APMRCA+V
Sbjct: 541 FHILYDTKNSRLGYAPMRCAEV 562

BLAST of CmaCh01G013150 vs. TrEMBL
Match: V4TKP8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031104mg PE=3 SV=1)

HSP 1 Score: 797.3 bits (2058), Expect = 1.2e-227
Identity = 389/564 (68.97%), Postives = 461/564 (81.74%), Query Frame = 1

Query: 19  VYLALIFLLLFSSVFDTIAEAHVRQGFNE--SNRSGVFGIELPENISSGIATSSVSAPCS 78
           V L L+ L + +  FD +A AH  +  N   SN S + GI+LP+++S    +SS ++ CS
Sbjct: 5   VSLVLVLLSISAGSFDAVARAHDHRRTNSFNSNTSSLAGIKLPDHMSFNAVSSSTNSGCS 64

Query: 79  FSN---------------------EDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKES 138
           FS                      +D++ ++ L  K  K+ VKLHLK RS +R TEPK+S
Sbjct: 65  FSKSNKPTHPERIDTQEKDGDVALDDDDGDDLLTLKLSKQKVKLHLKHRSKNRETEPKKS 124

Query: 139 ITESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAE-AVSPAASPDSYSGYFS 198
           ++ES +RDL RIQ LH+RI E+KNQ+T SRLK  + + +K  +  V+PAASP+SY+   S
Sbjct: 125 VSESTIRDLTRIQALHRRIIEKKNQNTVSRLKKESQKSKKQIKPVVTPAASPESYASGVS 184

Query: 199 GQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYY 258
           GQL+ATLESGVSLG+GEYF+DVFVG+PPKH+  ILDTGSDLNWIQCVPC+DCFEQ GP+Y
Sbjct: 185 GQLVATLESGVSLGAGEYFMDVFVGTPPKHYYFILDTGSDLNWIQCVPCYDCFEQNGPHY 244

Query: 259 DPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTV 318
           DPKDS SF+NI+C+DPRC LVSSPDPP+PC++E Q+CPYFYWYGD SNTTGDFALETFTV
Sbjct: 245 DPKDSSSFKNISCHDPRCHLVSSPDPPRPCQAENQTCPYFYWYGDSSNTTGDFALETFTV 304

Query: 319 NLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 378
           NL S+ TGKSEFR+VENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY
Sbjct: 305 NL-STPTGKSEFRQVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 364

Query: 379 CLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKL 438
           CLVDRNSDT+VSSKLIFGED+DLL HP L FTSL  GKENPVDTFYYLQIKSI VGGE L
Sbjct: 365 CLVDRNSDTNVSSKLIFGEDKDLLNHPNLNFTSLVSGKENPVDTFYYLQIKSIIVGGEVL 424

Query: 439 QIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCY 498
            IP+E W++S +GAGGTIIDSGTTLSYF++PAY+IIK+AF++KVK Y LV+DFPIL PCY
Sbjct: 425 SIPDETWRLSPEGAGGTIIDSGTTLSYFAEPAYQIIKQAFMKKVKGYPLVKDFPILDPCY 484

Query: 499 NVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQ 558
           NVS  +K+E PEF IQFADG VW FPVENYFIR++  D+VCLA+LGTP+SALSIIGNYQQ
Sbjct: 485 NVSGIEKMELPEFGIQFADGGVWNFPVENYFIRLDPEDVVCLAILGTPRSALSIIGNYQQ 544

BLAST of CmaCh01G013150 vs. TrEMBL
Match: B9H837_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s22630g PE=3 SV=2)

HSP 1 Score: 774.6 bits (1999), Expect = 8.0e-221
Identity = 380/562 (67.62%), Postives = 447/562 (79.54%), Query Frame = 1

Query: 20  YLALIFLLLFSSVFDTIAEAHVRQGFNESNRSGVFGIELPENIS-SGIATSSVSAPCSFS 79
           ++ L+  LLFS  F+ IA  H      +SN S + GIELP+++S + +++S+ +  C+  
Sbjct: 6   FIVLVLFLLFSGAFEAIAGIHDHGKNVKSNISTLAGIELPDHMSFNAVSSSTTNTGCNLD 65

Query: 80  ---------------------NEDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKESIT 139
                                ++D+++E     K  K++VKLHLK RS  R +E KES  
Sbjct: 66  TSKKVKQSQTIVSKEDFDLLEDDDDDDEGGEEEKEAKQTVKLHLKHRSKDRKSEGKESFV 125

Query: 140 ESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAV-SPAASPDSYSGYFSGQ 199
           ES  RDLARIQTLH RI E+KNQ+  SRLK       K  + V + AASP+SY    SGQ
Sbjct: 126 ESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERPEKQIKTVVATAASPESYGTGLSGQ 185

Query: 200 LMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDP 259
           LMATLESGV+LGSGEYF+DVF+G+PPKH+SLILDTGSDLNWIQCVPCHDCFEQ GPYYDP
Sbjct: 186 LMATLESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDP 245

Query: 260 KDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNL 319
           K+S SFRNI C+DPRC LVSSPDPP PCK+E Q+CPYFYWYGD SNTTGDFA ETFTVNL
Sbjct: 246 KESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNL 305

Query: 320 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 379
           TS T GKSEF+RVENVMFGCGHWNRGLFHGA+GLLGLGRGPLSFSSQLQSLYGHSFSYCL
Sbjct: 306 TSPT-GKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCL 365

Query: 380 VDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQI 439
           VDRNSDT+VSSKLIFGED+DLL HPEL FT+L GGKENPVDTFYY+QIKSI VGGE L I
Sbjct: 366 VDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNI 425

Query: 440 PEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNV 499
           PE  W +++DG GGTI+DSGTTLSYF++PAY+IIK+AF +KVK Y +V+DFPIL PCYNV
Sbjct: 426 PESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFAKKVKGYPIVQDFPILDPCYNV 485

Query: 500 SSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQN 559
           S  +K++ P+F I FADGAVW FPVENYFIR++  ++VCLA+LGTP+SALSIIGNYQQQN
Sbjct: 486 SGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQN 545

BLAST of CmaCh01G013150 vs. TrEMBL
Match: A0A067JSN0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22200 PE=3 SV=1)

HSP 1 Score: 766.1 bits (1977), Expect = 2.8e-218
Identity = 383/557 (68.76%), Postives = 450/557 (80.79%), Query Frame = 1

Query: 21  LALIFLLLFSSVFDTIAEAHVRQGFNESNRSGVFGIELPENIS-SGIATSSVSAPCSFSN 80
           + L+ LLL +  F  + +   ++  N+ N S + GIELPE++S + +++S++   CS S 
Sbjct: 7   MILVLLLLIAGAFGGLCDQ--KKNVNQ-NISTLAGIELPEHMSFNAVSSSTIKTDCSLST 66

Query: 81  ----------------EDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPK-ESITESAVR 140
                           E+EEEE     +  KK+VK+ LK RST++ +E K ES   S  R
Sbjct: 67  SKKDQKPQSQRISVQEEEEEEEGGDGEEDAKKTVKVELKHRSTNKESEAKKESFITSTTR 126

Query: 141 DLARIQTLHKRITERKNQDTTSRLKNGNAERRKP-AEAVSPAASPDSYSGYFSGQLMATL 200
           DL RIQTLHKRI E+KNQ+  SRL   N +R +P   +  PAASP+SY    SG+LMATL
Sbjct: 127 DLTRIQTLHKRIIEKKNQNAISRL---NKDRNQPKVGSEPPAASPESYPAELSGKLMATL 186

Query: 201 ESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSIS 260
           ESGVSLGSGEYFIDVF+G+PPKHFSLILDTGSDLNWIQCVPC DCFEQ GPYYDPKDSIS
Sbjct: 187 ESGVSLGSGEYFIDVFIGTPPKHFSLILDTGSDLNWIQCVPCVDCFEQNGPYYDPKDSIS 246

Query: 261 FRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSSTT 320
           F+NI+C+DPRC+LVSSPDPPQPCK++ Q+CPYFYWYGD SNTTGDFALETFTVNLTS+  
Sbjct: 247 FKNISCHDPRCRLVSSPDPPQPCKAQNQTCPYFYWYGDSSNTTGDFALETFTVNLTST-- 306

Query: 321 GKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS 380
              EFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS
Sbjct: 307 ---EFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS 366

Query: 381 DTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 440
           DT+VSSKLIFGED+DLL HP+L FTSL GGK NPVDTFYY+QIKSI VGGE L IPE  W
Sbjct: 367 DTNVSSKLIFGEDKDLLNHPKLNFTSLVGGKGNPVDTFYYVQIKSIIVGGEVLNIPENTW 426

Query: 441 KISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNVSSADK 500
            +S+DGAGGTI+DSGTTLSYF++PAY++IK+AF++KVK Y +++DFPIL PCYNVS  DK
Sbjct: 427 HLSSDGAGGTIVDSGTTLSYFAEPAYQMIKDAFVKKVKGYPVIKDFPILDPCYNVSGVDK 486

Query: 501 LEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNFHILY 559
           +E PEF I F DGA+W FPVENYFIR+E  ++VCLA+LGTP+SALSIIGNYQQQNFHILY
Sbjct: 487 IELPEFTILFEDGAMWNFPVENYFIRLEPENVVCLAILGTPQSALSIIGNYQQQNFHILY 546

BLAST of CmaCh01G013150 vs. TrEMBL
Match: A0A061DT42_THECC (Eukaryotic aspartyl protease family protein isoform 1 OS=Theobroma cacao GN=TCM_001987 PE=3 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 4.1e-217
Identity = 381/579 (65.80%), Postives = 453/579 (78.24%), Query Frame = 1

Query: 3   FLGKQSGSTRGFQNCSVYLALIFLLLFSSVFDTIAEAH---VRQGFNESNRSGVFGIELP 62
           F+ KQ   +      S+ L L+ L + S  F+ IA  H   ++ G   SN S + GIELP
Sbjct: 24  FVTKQEKISAMLVKVSLSLLLLLLSISSGAFEAIARVHHDHIKNG--NSNISTLTGIELP 83

Query: 63  ENISSGIATSSVS-APCSFSNE------------------DEEEEERLMAKSVKKSVKLH 122
           +++S    +SS S + CS S +                  D+E+EE    K  KKSVKLH
Sbjct: 84  DHMSFNAVSSSTSNSGCSLSKQKKAKPSQKIASQEVSSYLDDEDEEDEQQKP-KKSVKLH 143

Query: 123 LKKRSTSRVTEPKESITESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAE-A 182
           LK R      EPK S+ ES +RDL RI+T H R+ E+KNQ+  SRL N   + ++  +  
Sbjct: 144 LKHRQIDGKAEPKNSVLESTMRDLTRIRTFHTRVIEKKNQNVISRLNNDRKQSKQHLKPV 203

Query: 183 VSPAASPDSYSGYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQ 242
           V  AA+P+ Y+    GQL+ATLESGVSLGSGEYFIDVFVG+PPKHFSLILDTGSDLNWIQ
Sbjct: 204 VEKAAAPEPYTSGVPGQLVATLESGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQ 263

Query: 243 CVPCHDCFEQTGPYYDPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGD 302
           CVPC+DCFEQ GP+YDP++S SFRNI+C+DPRCQLVSSPDPPQPCK+E Q+CPY+YWYGD
Sbjct: 264 CVPCYDCFEQNGPHYDPRESSSFRNISCHDPRCQLVSSPDPPQPCKAENQTCPYYYWYGD 323

Query: 303 CSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLS 362
            SNTTGDFA+ETFTVNLTS + GKSEFR+VENVMFGCGHWNRGLFHGAAGLLGLGRGPLS
Sbjct: 324 SSNTTGDFAVETFTVNLTSPS-GKSEFRQVENVMFGCGHWNRGLFHGAAGLLGLGRGPLS 383

Query: 363 FSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTF 422
           F+SQLQSLYGHSFSYCLVDRNSD +VSSKLIFGED+DLL+HP L FT+L  GKEN VDTF
Sbjct: 384 FASQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPNLNFTALVAGKENSVDTF 443

Query: 423 YYLQIKSIFVGGEKLQIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK 482
           YY+QIKS+ VGGE L IPEE W++SADGAGGTIIDSGTTLSYF+DP Y+IIK+AF++K K
Sbjct: 444 YYVQIKSVIVGGEVLNIPEETWQLSADGAGGTIIDSGTTLSYFADPTYQIIKDAFVKKTK 503

Query: 483 NYKLVEDFPILHPCYNVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAML 542
            Y +++DFP+L PCYNVS  + +E P+F IQF DGAVW FPVENYFI +E+ D+VCLA+L
Sbjct: 504 GYPVLKDFPVLDPCYNVSGVENVELPDFGIQFVDGAVWNFPVENYFIWLEE-DVVCLAIL 563

Query: 543 GTPKSALSIIGNYQQQNFHILYDTKNSRLGFAPMRCADV 559
           GTP+SALSIIGNYQQQNFHILYDTK SRLG+APM+CADV
Sbjct: 564 GTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 597

BLAST of CmaCh01G013150 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 684.5 bits (1765), Expect = 5.5e-197
Identity = 351/548 (64.05%), Postives = 409/548 (74.64%), Query Frame = 1

Query: 14  FQNCSVYLALIFLLLFSSVFDTIAEAHVRQGFNES-NRSGVFGIELPENISSGIATSSVS 73
           F   S  L LIF  + +   D+ A A    G NE  N SG  GI+ P  +  G A+SS S
Sbjct: 2   FSKYSFILCLIFFFVTAFSGDSRALA----GNNEQKNISGFSGIDFPNPMRFGSASSSTS 61

Query: 74  APCSFSNEDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPK-ESITESAVRDLARIQTLH 133
             C FS+ ++E  +    ++  K+VK HLK+R T+   +    S+ E  +RDL RIQTLH
Sbjct: 62  NDCGFSSPEKEPTKERTGEN--KTVKFHLKRRETTTTEKATTNSVLELQIRDLTRIQTLH 121

Query: 134 KRITERKNQDTTSRLKNGNAERRKPAEAVSPAASPDSYSGYFSGQLMATLESGVSLGSGE 193
           KR+ E+ NQ+T S+ +  N    K     +P AS        +GQL+ATLESG++LGSGE
Sbjct: 122 KRVLEKNNQNTVSQKQKKND---KEVVTTTPVASSVEEQ---AGQLVATLESGMTLGSGE 181

Query: 194 YFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSISFRNITCNDPR 253
           YF+DV VGSPPKHFSLILDTGSDLNWIQC+PC+DCF+Q G +YDPK S S++NITCND R
Sbjct: 182 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 241

Query: 254 CQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 313
           C LVSSPDPP PCKS+ QSCPY+YWYGD SNTTGDFA+ETFTVNLT++  G SE   VEN
Sbjct: 242 CNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNG-GSSELYNVEN 301

Query: 314 VMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIF 373
           +MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT+VSSKLIF
Sbjct: 302 MMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIF 361

Query: 374 GEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEENWKISADGAGGT 433
           GED+DLL+HP L FTS   GKEN VDTFYY+QIKSI V GE L IPEE W IS+DGAGGT
Sbjct: 362 GEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGT 421

Query: 434 IIDSGTTLSYFSDPAYRIIKEAFLRKVK-NYKLVEDFPILHPCYNVSSADKLEFPEFEIQ 493
           IIDSGTTLSYF++PAY  IK     K K  Y +  DFPIL PC+NVS    ++ PE  I 
Sbjct: 422 IIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIA 481

Query: 494 FADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGF 553
           FADGAVW FP EN FI + + D+VCLAMLGTPKSA SIIGNYQQQNFHILYDTK SRLG+
Sbjct: 482 FADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGY 535

Query: 554 APMRCADV 559
           AP +CAD+
Sbjct: 542 APTKCADI 535

BLAST of CmaCh01G013150 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 630.2 bits (1624), Expect = 1.2e-180
Identity = 321/539 (59.55%), Postives = 391/539 (72.54%), Query Frame = 1

Query: 23  LIFLLLFSSVFDTIAEAHVRQGFNESNRSGVFGIELPENISSGIATSSVSAPCSFSNEDE 82
           L+ L+LFS V     +     G +E   S +      + +    A+SS S  C FS+++ 
Sbjct: 9   LLGLILFS-VSPFSGDCRTLSGKHEHYSSSLNMFNSQDTMRFSSASSSTSNDCGFSSKEH 68

Query: 83  EEEERLMAKSVKKSVKLHLKKRSTSRVTEPKESITESAVRDLARIQTLHKRITERKNQDT 142
           +  +    +SVK   ++   K+ T R T    S+ +  ++DL RI+TLH R  + K Q  
Sbjct: 69  DPSKEHTRESVKPQSRI---KQETKRTTH---SVVDLQIQDLTRIKTLHARFNKSKKQ-- 128

Query: 143 TSRLKNGNAERRKPAEAVSPAASPDSYSGYFSGQLMATLESGVSLGSGEYFIDVFVGSPP 202
               KN    R+K    +S   +P+       G+L+ATLESG++LGSGEYF+DV VG+PP
Sbjct: 129 ----KNEKV-RKKITSDISLVGAPE----VSPGKLIATLESGMTLGSGEYFMDVLVGTPP 188

Query: 203 KHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSISFRNITCNDPRCQLVSSPDPPQ 262
           KHFSLILDTGSDLNW+QC+PC+DCF Q G +YDPK S SF+NITCNDPRC L+SSPDPP 
Sbjct: 189 KHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPRCSLISSPDPPV 248

Query: 263 PCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRG 322
            C+S+ QSCPYFYWYGD SNTTGDFA+ETFTVNLT++  G SE++ V N+MFGCGHWNRG
Sbjct: 249 QCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK-VGNMMFGCGHWNRG 308

Query: 323 LFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPE 382
           LF GA+GLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS+T+VSSKLIFGED+DLL H  
Sbjct: 309 LFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTN 368

Query: 383 LKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEENWKISADGAGGTIIDSGTTLSYF 442
           L FTS   GKEN V+TFYY+QIKSI VGG+ L IPEE W IS+DG GGTIIDSGTTLSYF
Sbjct: 369 LNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYF 428

Query: 443 SDPAYRIIKEAFLRKVK-NYKLVEDFPILHPCYNVSSADK--LEFPEFEIQFADGAVWKF 502
           ++PAY IIK  F  K+K NY +  DFP+L PC+NVS  ++  +  PE  I F DG VW F
Sbjct: 429 AEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNF 488

Query: 503 PVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGFAPMRCADV 559
           P EN FI + + D+VCLA+LGTPKS  SIIGNYQQQNFHILYDTK SRLGF P +CAD+
Sbjct: 489 PAENSFIWLSE-DLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCADI 527

BLAST of CmaCh01G013150 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 278.9 bits (712), Expect = 7.0e-75
Identity = 187/543 (34.44%), Postives = 274/543 (50.46%), Query Frame = 1

Query: 16  NCSVYLALIFLLLFSSVFDTIAEAHVRQGFNESNRSGVFGIELPENISSGIATSSVSAPC 75
           N S +  + FL   SSVF  I          E++ +    + + ++I     TSS     
Sbjct: 4   NYSFFFFIFFLTSHSSVFSRILP--------ETSTTTTSILNVADSIHRTKYTSSFRL-- 63

Query: 76  SFSNEDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKE--SITESAV-RDLARIQTLHK 135
              N+ EE+       S   S  L L  R + R TE  +  S+T + + RD AR+++L  
Sbjct: 64  ---NQQEEQTH-----SASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLIT 123

Query: 136 RITERKNQDTTSRLKNGNAERRKPAEAVSPAASPDSYSGYFSGQLMATLESGVSLGSGEY 195
           R+    N  + + LK        P   +      D         + A L SG + GSGEY
Sbjct: 124 RLDLAINNISKADLK--------PISTMYTTEEQD---------IEAPLISGTTQGSGEY 183

Query: 196 FIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDPKDSISFRNITCNDPRC 255
           F  V +G P +   ++LDTGSD+NW+QC PC DC+ QT P ++P  S S+  ++C+ P+C
Sbjct: 184 FTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQC 243

Query: 256 QLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNLTSSTTGKSEFRRVENV 315
             +   +    C++ T  C Y   YGD S T GDFA ET T+  T           V+NV
Sbjct: 244 NALEVSE----CRNAT--CLYEVSYGDGSYTVGDFATETLTIGSTL----------VQNV 303

Query: 316 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFG 375
             GCGH N GLF GAAGLLGLG G L+  SQL +    SFSYCLVDR+SD++ +      
Sbjct: 304 AVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGTS 363

Query: 376 EDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQIPEENWKISADGAGGTI 435
              D +  P L+         + +DTFYYL +  I VGGE LQIP+ ++++   G+GG I
Sbjct: 364 LSPDAVVAPLLR--------NHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 423

Query: 436 IDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNVSSADKLEFPEFEIQFA 495
           IDSGT ++      Y  ++++F++   + +      +   CYN+S+   +E P     F 
Sbjct: 424 IDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFP 483

Query: 496 DGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGFAP 555
            G +   P +NY I ++     CLA   T  S+L+IIGN QQQ   + +D  NS +GF+ 
Sbjct: 484 GGKMLALPAKNYMIPVDSVGTFCLAFAPT-ASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483

BLAST of CmaCh01G013150 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 275.4 bits (703), Expect = 7.7e-74
Identity = 157/403 (38.96%), Postives = 224/403 (55.58%), Query Frame = 1

Query: 156 PAEAVSPAASPDSYSGYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDL 215
           P   V+ A  P  +S        +++ SG+S GSGEYF  + VG+P ++  ++LDTGSD+
Sbjct: 114 PGRNVTHAPRPGGFS--------SSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDI 173

Query: 216 NWIQCVPCHDCFEQTGPYYDPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFY 275
            W+QC PC  C+ Q+ P +DP+ S ++  I C+ P C+ + S      C +  ++C Y  
Sbjct: 174 VWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG----CNTRRKTCLYQV 233

Query: 276 WYGDCSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGR 335
            YGD S T GDF+ ET T              RV+ V  GCGH N GLF GAAGLLGLG+
Sbjct: 234 SYGDGSFTVGDFSTETLTFRR----------NRVKGVALGCGHDNEGLFVGAAGLLGLGK 293

Query: 336 GPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENP 395
           G LSF  Q    +   FSYCLVDR++ +  SS ++FG   +       +FT L     NP
Sbjct: 294 GKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS-VVFG---NAAVSRIARFTPLL---SNP 353

Query: 396 -VDTFYYLQIKSIFVGGEKLQ-IPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEA 455
            +DTFYY+ +  I VGG ++  +    +K+   G GG IIDSGT+++    PAY  +++A
Sbjct: 354 KLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDA 413

Query: 456 FLRKVKNYKLVEDFPILHPCYNVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDM 515
           F    K  K   DF +   C+++S+ ++++ P   + F  GA    P  NY I ++    
Sbjct: 414 FRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGK 473

Query: 516 VCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGFAPMRCA 557
            C A  GT    LSIIGN QQQ F ++YD  +SR+GFAP  CA
Sbjct: 474 FCFAFAGT-MGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh01G013150 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 268.9 bits (686), Expect = 7.2e-72
Identity = 159/386 (41.19%), Postives = 216/386 (55.96%), Query Frame = 1

Query: 183 SGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQT-GPYYDPKDSIS 242
           SG + GSG+YF+D+ +G PP+   LI DTGSDL W++C  C +C   +    + P+ S +
Sbjct: 75  SGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSST 134

Query: 243 FRNITCNDPRCQLVSSPDPPQPCKSET--QSCPYFYWYGDCSNTTGDFALETFTVNLTSS 302
           F    C DP C+LV  PD    C       +C Y Y Y D S T+G FA ET     TS 
Sbjct: 135 FSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARET-----TSL 194

Query: 303 TTGKSEFRRVENVMFGCGHWNRGL------FHGAAGLLGLGRGPLSFSSQLQSLYGHSFS 362
            T   +  R+++V FGCG    G       F+GA G++GLGRGP+SF+SQL   +G+ FS
Sbjct: 195 KTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFS 254

Query: 363 YCLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEK 422
           YCL+D       +S LI G   D ++  +L FT L     +P  TFYY+++KS+FV G K
Sbjct: 255 YCLMDYTLSPPPTSYLIIGNGGDGIS--KLFFTPLLTNPLSP--TFYYVKLKSVFVNGAK 314

Query: 423 LQIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPC 482
           L+I    W+I   G GGT++DSGTTL++ ++PAYR +  A  R+VK        P    C
Sbjct: 315 LRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLC 374

Query: 483 YNVSSADKLE--FPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGT-PKSALSIIG 542
            NVS   K E   P  + +F+ GAV+  P  NYFI  E+  + CLA+    PK   S+IG
Sbjct: 375 VNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEE-QIQCLAIQSVDPKVGFSVIG 434

Query: 543 NYQQQNFHILYDTKNSRLGFAPMRCA 557
           N  QQ F   +D   SRLGF+   CA
Sbjct: 435 NLMQQGFLFEFDRDRSRLGFSRRGCA 450

BLAST of CmaCh01G013150 vs. NCBI nr
Match: gi|778695350|ref|XP_011653979.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 997.7 bits (2578), Expect = 8.3e-288
Identity = 491/562 (87.37%), Postives = 518/562 (92.17%), Query Frame = 1

Query: 1   MNFLGKQSGSTRGFQNCSVYLALIFLLLFSSVFDTI--AEAHVRQGFNESNRSGVFGIEL 60
           M+FLG Q+GS+RGFQN  ++L LIFLLLFS VF T+   EAH+ QGF++SNRSGVFGIEL
Sbjct: 1   MDFLGNQAGSSRGFQNWKLFLTLIFLLLFSGVFHTVFFVEAHIPQGFHKSNRSGVFGIEL 60

Query: 61  PENISSGIATSSVSAPCSFSNEDEE-EEERLMAKSVKKSVKLHLKKRSTSRVTEPKESIT 120
           PEN+SSGIA+SS SAPCSF NE EE E E LMA SVK+SVKLHLKKRST+   +PKESIT
Sbjct: 61  PENLSSGIASSSASAPCSFGNEGEEGERESLMADSVKQSVKLHLKKRSTNTANKPKESIT 120

Query: 121 ESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAVS-PAASPDSYSGYFSGQ 180
           ESAVRDLARIQTLH RITERKNQDTTSRLK  N ER+KP E VS PA SP+SY+ YFSGQ
Sbjct: 121 ESAVRDLARIQTLHTRITERKNQDTTSRLKKSNVERKKPMEEVSSPAESPESYADYFSGQ 180

Query: 181 LMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDP 240
           LMATLESGVSLGSGEYFIDVF+GSPPKHFSLILDTGSDLNWIQCVPC DCFEQ GPYYDP
Sbjct: 181 LMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDP 240

Query: 241 KDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNL 300
           KDSISFRNITCNDPRCQLVSSPDPP+PCK ETQSCPYFYWYGD SNTTGDFALETFTVNL
Sbjct: 241 KDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL 300

Query: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360
           TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360

Query: 361 VDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQI 420
           VDR+SDTSVSSKLIFGED+DLLTHPEL FTSL  GKENPVDTFYYLQIKSIFVGGEKLQI
Sbjct: 361 VDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQI 420

Query: 421 PEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNV 480
           PEENW +SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK YKLVEDFPILHPCYNV
Sbjct: 421 PEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNV 480

Query: 481 SSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQN 540
           S  D+L FPEF IQFADGAVW FPVENYFIRI+Q D+VCLAMLGTPKSALSIIGNYQQQN
Sbjct: 481 SGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQN 540

Query: 541 FHILYDTKNSRLGFAPMRCADV 559
           FHILYDTKNSRLG+APMRCA+V
Sbjct: 541 FHILYDTKNSRLGYAPMRCAEV 562

BLAST of CmaCh01G013150 vs. NCBI nr
Match: gi|659130979|ref|XP_008465452.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 993.4 bits (2567), Expect = 1.6e-286
Identity = 489/562 (87.01%), Postives = 518/562 (92.17%), Query Frame = 1

Query: 1   MNFLGKQSGSTRGFQNCSVYLALIFLLLFSSVFDTIA--EAHVRQGFNESNRSGVFGIEL 60
           M+FLG  + S+ GFQ+C ++L LIFLLLF+SVFDT+   EAH+ QGF++SNRS VFGIEL
Sbjct: 1   MDFLGIPARSSIGFQDCKLFLTLIFLLLFASVFDTVVVVEAHIPQGFHKSNRSSVFGIEL 60

Query: 61  PENISSGIATSSVSAPCSFSNEDEE-EEERLMAKSVKKSVKLHLKKRSTSRVTEPKESIT 120
           PEN+SSGIA+SS SAPCSF NE EE E E LMA SVK+SVKLHLKKRST+   EP+ESIT
Sbjct: 61  PENLSSGIASSSASAPCSFGNEGEEGETESLMADSVKQSVKLHLKKRSTNTANEPRESIT 120

Query: 121 ESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAVS-PAASPDSYSGYFSGQ 180
           ESAVRDLARIQTLH RI ERKNQDTTSRLK  N ER+KP E VS PA SP+SY+ YFSGQ
Sbjct: 121 ESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVERKKPMEKVSSPAESPESYADYFSGQ 180

Query: 181 LMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYDP 240
           LMATLESGVSLGSGEYFIDVF+GSPPKHFSLILDTGSDLNWIQCVPC+DCFEQ GPYYDP
Sbjct: 181 LMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDP 240

Query: 241 KDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVNL 300
           KDSISFRNITCNDPRCQLVSSPDPPQPCK E QSCPYFYWYGD SNTTGDFALETFTVNL
Sbjct: 241 KDSISFRNITCNDPRCQLVSSPDPPQPCKFEKQSCPYFYWYGDSSNTTGDFALETFTVNL 300

Query: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360
           TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360

Query: 361 VDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQI 420
           VDR+SDTSVSSKLIFGED+DLLTHPEL FTSL GGKENPVDTFYYLQIKSIFVGGEKLQI
Sbjct: 361 VDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEKLQI 420

Query: 421 PEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYNV 480
           PEENW +SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK YKLVEDFPILHPCYNV
Sbjct: 421 PEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNV 480

Query: 481 SSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQN 540
           SS D+L FPEF IQFADGAVW FPVENYFIRI+Q D+VCLAMLGTPKSALSIIGNYQQQN
Sbjct: 481 SSTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQN 540

Query: 541 FHILYDTKNSRLGFAPMRCADV 559
           FHILYDTKNSRLG+APMRCA+V
Sbjct: 541 FHILYDTKNSRLGYAPMRCAEV 562

BLAST of CmaCh01G013150 vs. NCBI nr
Match: gi|567889228|ref|XP_006437136.1| (hypothetical protein CICLE_v10031104mg [Citrus clementina])

HSP 1 Score: 797.3 bits (2058), Expect = 1.7e-227
Identity = 389/564 (68.97%), Postives = 461/564 (81.74%), Query Frame = 1

Query: 19  VYLALIFLLLFSSVFDTIAEAHVRQGFNE--SNRSGVFGIELPENISSGIATSSVSAPCS 78
           V L L+ L + +  FD +A AH  +  N   SN S + GI+LP+++S    +SS ++ CS
Sbjct: 5   VSLVLVLLSISAGSFDAVARAHDHRRTNSFNSNTSSLAGIKLPDHMSFNAVSSSTNSGCS 64

Query: 79  FSN---------------------EDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKES 138
           FS                      +D++ ++ L  K  K+ VKLHLK RS +R TEPK+S
Sbjct: 65  FSKSNKPTHPERIDTQEKDGDVALDDDDGDDLLTLKLSKQKVKLHLKHRSKNRETEPKKS 124

Query: 139 ITESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAE-AVSPAASPDSYSGYFS 198
           ++ES +RDL RIQ LH+RI E+KNQ+T SRLK  + + +K  +  V+PAASP+SY+   S
Sbjct: 125 VSESTIRDLTRIQALHRRIIEKKNQNTVSRLKKESQKSKKQIKPVVTPAASPESYASGVS 184

Query: 199 GQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYY 258
           GQL+ATLESGVSLG+GEYF+DVFVG+PPKH+  ILDTGSDLNWIQCVPC+DCFEQ GP+Y
Sbjct: 185 GQLVATLESGVSLGAGEYFMDVFVGTPPKHYYFILDTGSDLNWIQCVPCYDCFEQNGPHY 244

Query: 259 DPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTV 318
           DPKDS SF+NI+C+DPRC LVSSPDPP+PC++E Q+CPYFYWYGD SNTTGDFALETFTV
Sbjct: 245 DPKDSSSFKNISCHDPRCHLVSSPDPPRPCQAENQTCPYFYWYGDSSNTTGDFALETFTV 304

Query: 319 NLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 378
           NL S+ TGKSEFR+VENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY
Sbjct: 305 NL-STPTGKSEFRQVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 364

Query: 379 CLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKL 438
           CLVDRNSDT+VSSKLIFGED+DLL HP L FTSL  GKENPVDTFYYLQIKSI VGGE L
Sbjct: 365 CLVDRNSDTNVSSKLIFGEDKDLLNHPNLNFTSLVSGKENPVDTFYYLQIKSIIVGGEVL 424

Query: 439 QIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCY 498
            IP+E W++S +GAGGTIIDSGTTLSYF++PAY+IIK+AF++KVK Y LV+DFPIL PCY
Sbjct: 425 SIPDETWRLSPEGAGGTIIDSGTTLSYFAEPAYQIIKQAFMKKVKGYPLVKDFPILDPCY 484

Query: 499 NVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQ 558
           NVS  +K+E PEF IQFADG VW FPVENYFIR++  D+VCLA+LGTP+SALSIIGNYQQ
Sbjct: 485 NVSGIEKMELPEFGIQFADGGVWNFPVENYFIRLDPEDVVCLAILGTPRSALSIIGNYQQ 544

BLAST of CmaCh01G013150 vs. NCBI nr
Match: gi|985457792|ref|XP_015387676.1| (PREDICTED: uncharacterized protein LOC102625748 [Citrus sinensis])

HSP 1 Score: 797.0 bits (2057), Expect = 2.2e-227
Identity = 389/564 (68.97%), Postives = 461/564 (81.74%), Query Frame = 1

Query: 19  VYLALIFLLLFSSVFDTIAEAHVRQGFNE--SNRSGVFGIELPENISSGIATSSVSAPCS 78
           V L L+ L + +  FD +A AH  +  N   SN S + GI+LP+++S    +SS ++ CS
Sbjct: 5   VSLVLVLLSISAGSFDAVARAHDHRRTNSFNSNTSSLAGIKLPDHMSFNAVSSSTNSGCS 64

Query: 79  FSN---------------------EDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKES 138
           FS                      +D++ ++ L  K  K+ VKLHLK RS +R TEPK+S
Sbjct: 65  FSKSNKPTHPERIDTQEKDGDVALDDDDGDDLLTLKLSKQKVKLHLKHRSKNRETEPKKS 124

Query: 139 ITESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAV-SPAASPDSYSGYFS 198
           ++ES +RDL RIQ LH+RI E+KNQ+T SRLK  + + +K  + V +PAASP+SY+   S
Sbjct: 125 VSESTIRDLTRIQALHRRIIEKKNQNTVSRLKKESQKSKKQIKPVLTPAASPESYASGVS 184

Query: 199 GQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYY 258
           GQL+ATLESGVSLG+GEYF+DVFVG+PPKH+  ILDTGSDLNWIQCVPC+DCFEQ GP+Y
Sbjct: 185 GQLVATLESGVSLGAGEYFMDVFVGTPPKHYYFILDTGSDLNWIQCVPCYDCFEQNGPHY 244

Query: 259 DPKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTV 318
           DPKDS SF+NI+C+DPRC LVSSPDPP+PC++E Q+CPYFYWYGD SNTTGDFALETFTV
Sbjct: 245 DPKDSSSFKNISCHDPRCHLVSSPDPPRPCQAENQTCPYFYWYGDSSNTTGDFALETFTV 304

Query: 319 NLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 378
           NL S+ TGKSEFR+VENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY
Sbjct: 305 NL-STPTGKSEFRQVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 364

Query: 379 CLVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKL 438
           CLVDRNSDT+VSSKLIFGED+DLL HP L FTSL  GKENPVDTFYYLQIKSI VGGE L
Sbjct: 365 CLVDRNSDTNVSSKLIFGEDKDLLNHPNLNFTSLVSGKENPVDTFYYLQIKSIIVGGEVL 424

Query: 439 QIPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCY 498
            IP+E W++S +GAGGTIIDSGTTLSYF++PAY+IIK+AF++KVK Y LV+DFPIL PCY
Sbjct: 425 SIPDETWRLSPEGAGGTIIDSGTTLSYFAEPAYQIIKQAFMKKVKGYPLVKDFPILDPCY 484

Query: 499 NVSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQ 558
           NVS  +K+E PEF IQFADG VW FPVENYFIR++  D+VCLA+LGTP+SALSIIGNYQQ
Sbjct: 485 NVSGIEKMELPEFGIQFADGGVWNFPVENYFIRLDPEDVVCLAILGTPRSALSIIGNYQQ 544

BLAST of CmaCh01G013150 vs. NCBI nr
Match: gi|743902826|ref|XP_011044763.1| (PREDICTED: aspartic proteinase nepenthesin-2-like [Populus euphratica])

HSP 1 Score: 775.8 bits (2002), Expect = 5.2e-221
Identity = 381/563 (67.67%), Postives = 449/563 (79.75%), Query Frame = 1

Query: 19  VYLALIFLLLFSSVFDTIAEAHVRQGFNESNRSGVFGIELPENIS-SGIATSSVSAPCSF 78
           + L L+  LLFS  F+ IA  H  +   +SN S + GIELP+++S + +++S+ +  C+ 
Sbjct: 7   IVLVLVLSLLFSGAFEAIAGIHDHRKNVKSNISTLAGIELPDHMSFNAVSSSTTNTGCNL 66

Query: 79  S---------------------NEDEEEEERLMAKSVKKSVKLHLKKRSTSRVTEPKESI 138
                                 ++D+++E     K  K++VKLHLK RS  R +E KES 
Sbjct: 67  DTSKKVKQSQTIVSQEDFDLLEDDDDDDEGGEEGKEAKQTVKLHLKHRSKDRKSEGKESF 126

Query: 139 TESAVRDLARIQTLHKRITERKNQDTTSRLKNGNAERRKPAEAV-SPAASPDSYSGYFSG 198
            ES  RDLARIQTLH RI E+KNQ+  SRLK       K  + V + AASP+SY    SG
Sbjct: 127 VESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKKRPEKQIKTVVATAASPESYGTGLSG 186

Query: 199 QLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQTGPYYD 258
           QLMATLESGVSLGSGEYF+DVF+G+PPKH+SLILDTGSDLNWIQCVPCHDCFEQ GPYYD
Sbjct: 187 QLMATLESGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYD 246

Query: 259 PKDSISFRNITCNDPRCQLVSSPDPPQPCKSETQSCPYFYWYGDCSNTTGDFALETFTVN 318
           PK+S SFRNI C+DPRC LVSSPDPP PCK+E Q+CPYFYWYGD SNTTGDFALETFTVN
Sbjct: 247 PKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFALETFTVN 306

Query: 319 LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYC 378
           LTS T GKSEF+RVENVMFGCGHWNRGLFHGA+GLLGLGRGPLSFSSQLQSLYGHSFSYC
Sbjct: 307 LTSPT-GKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 366

Query: 379 LVDRNSDTSVSSKLIFGEDRDLLTHPELKFTSLFGGKENPVDTFYYLQIKSIFVGGEKLQ 438
           LVDRNSD++VSSKLIFGED+DLL HPEL FT+L GGKENPVDTFYY+QIKSI VGGE L 
Sbjct: 367 LVDRNSDSNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLN 426

Query: 439 IPEENWKISADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKNYKLVEDFPILHPCYN 498
           IPE  W +++DG GGTI+DSGTTLSYF++PAY+IIK+AF++KVK Y  V+DFPIL PCYN
Sbjct: 427 IPEGTWNLTSDGVGGTIVDSGTTLSYFAEPAYQIIKDAFVKKVKGYPTVQDFPILDPCYN 486

Query: 499 VSSADKLEFPEFEIQFADGAVWKFPVENYFIRIEQFDMVCLAMLGTPKSALSIIGNYQQQ 558
           VS  +K+E P+F + FADGAVW FPVENYFIR++  ++VCLA+LGTP+SALSIIGNYQQQ
Sbjct: 487 VSGVEKIELPDFGLLFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQ 546

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APF2_ARATH1.4e-7238.96Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH3.5e-6836.95Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH9.8e-6334.09Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
NEP2_NEPGR4.9e-6238.38Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR1.9e-6137.30Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L2W1_CUCSA5.8e-28887.37Uncharacterized protein OS=Cucumis sativus GN=Csa_4G608130 PE=3 SV=1[more]
V4TKP8_9ROSI1.2e-22768.97Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031104mg PE=3 SV=1[more]
B9H837_POPTR8.0e-22167.62Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s22630g PE=3 SV=2[more]
A0A067JSN0_JATCU2.8e-21868.76Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22200 PE=3 SV=1[more]
A0A061DT42_THECC4.1e-21765.80Eukaryotic aspartyl protease family protein isoform 1 OS=Theobroma cacao GN=TCM_... [more]
Match NameE-valueIdentityDescription
AT3G59080.15.5e-19764.05 Eukaryotic aspartyl protease family protein[more]
AT2G42980.11.2e-18059.55 Eukaryotic aspartyl protease family protein[more]
AT1G25510.17.0e-7534.44 Eukaryotic aspartyl protease family protein[more]
AT1G01300.17.7e-7438.96 Eukaryotic aspartyl protease family protein[more]
AT3G25700.17.2e-7241.19 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778695350|ref|XP_011653979.1|8.3e-28887.37PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
gi|659130979|ref|XP_008465452.1|1.6e-28687.01PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
gi|567889228|ref|XP_006437136.1|1.7e-22768.97hypothetical protein CICLE_v10031104mg [Citrus clementina][more]
gi|985457792|ref|XP_015387676.1|2.2e-22768.97PREDICTED: uncharacterized protein LOC102625748 [Citrus sinensis][more]
gi|743902826|ref|XP_011044763.1|5.2e-22167.67PREDICTED: aspartic proteinase nepenthesin-2-like [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
biological_process GO:0006412 translation
biological_process GO:0042254 ribosome biogenesis
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005840 ribosome
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003735 structural constituent of ribosome

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G013150.1CmaCh01G013150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 527..542
score: 5.7E-7coord: 198..218
score: 5.7E-7coord: 431..442
score: 5.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 81..148
score: 3.8E-283coord: 179..558
score: 3.8E-283coord: 14..38
score: 3.8E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 207..218
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 180..372
score: 1.8E-38coord: 393..556
score: 2.9
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 184..556
score: 2.82
NoneNo IPR availablePANTHERPTHR13683:SF250ASPARTYL PROTEASE-LIKE PROTEINcoord: 81..148
score: 3.8E-283coord: 179..558
score: 3.8E-283coord: 14..38
score: 3.8E

The following gene(s) are paralogous to this gene:

None