CmaCh04G013250 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G013250
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr04 : 6753092 .. 6755618 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCTTTCTTCACTTCCACACTCTCTCAGTCCCAAATTGAAGGCATGAAGACAAAATCCCCCTTCCTACTCTCTCTCCTCCTCCTTCTCCTCTCCATATCCGCCGAATCCCTCCACCACCACCATCACCCGAACTGTAACGCCGCCGCCGACCGCAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGTTCCCCCTTCCGCCCCTCCAAGCCGCTGTCATGGGCCGACAATGTGCTCCAAATGCAGGCCAAGGACCAAGCCCGCCTCCAGTTCCTGTCCTCCCTTGTCGCCTGCAAGTCCGTCGCCCCCATTGCCTCCGCCCGCCAACTCATCCAGAGCCCCACTTTCGTGGTCCGGGCCAAGATCGGCACCCCTGCTCAGACCCTCCTCATGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCCTGTTCCGGCTGTGTCGGCTGCCCTTCCACCACCGTCTTCTCCTCCGATAAGTCCACCTCCTTCCGCCCCCTCCCCTGCCGATCCCCTCAATGCAACCAGGTACATAGCCCTGTTCTATTTCAAATCAATTCCCCTTTTTCTTCTCATATTTCTTGAATTTTTCCAACTCACCAATGTGGGGGCATTTAAATTCTCATGATTCTACATTAAACCTTTATTATCGAAGTGCACGCGCTCACGCGCCGCCTGTATCGGTGGAGAAGCCAGATATCCGTTGGAATATTCAATGATTATAAACCTACCTTTATTCACCCACATAGCATTCCCCTTTTAGTTGGTAATATGTGGGGTTCCCCTCTTTATTATGCTTTCTTTGGATTTTTTTTTTTATAAAATAATACTTTTAATTTTATTTCTAAATCCAATAAAATAAATTAATTGAATATATTTTTTTTTGAAAATTAATAAGAACTTCGGATGCCTTTTGTAGAATCAAATCCATGTTCCCACGTGCCAGTGGTGACACGTGTGTCCAAAAACTCCATTCCTCGTGATACGAGCCCCTCTTTATCACGTCAATCATGTAACAAATCAACTATTTCTTAACATTATTTTAAAAGAAAGGAATTAATTTCGGTAAAAGTTAAATAATAAATAATTTATTTATTTTTTAATTTTTGGATTTCTCTAGAATCTATGACTGTCAGTGAGTCACCGACGGGAACGTTCAGTCACGGGCCAAAAAACTTCTTCCCAACAAGACTTATCGTATTATTAAACATCGCACGTTAACGGCACGTGACCCTTTTGTTTATCAGGTACCGAACCCCAGTTGCAGCAGTAGCGCGTGCGGCTTCAACTTGACGTACGGCAGCTCGACTGTGGCGGCGGATCTGGTTCAAGACAACATAACTCTGGCCAACGACTCAGTTCCGGCGTACACATTTGGGTGCATCACGAAGGCGACGGGTAGTTCAGTGCCACCGCAGGGACTATTGGGCCTGGGTCGAGGCCCATTATCGCTTTTGGGCCAGAGCCAGAGTCTGTACCGGTCCACATTCTCGTACTGCCTTCCGAGCTTCAAATCGACCAGCTTTTCTGGGTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCAAGAGGATTAAGTACACGCCGCTGCTCCGAAACCCGAGGCGGTCGTCTTTGTATTACGTGAATCTAATCGCGATCAGAGTCGGCCGGCGAATCGTCGACATTCCTCCCTCTGCTCTTGCCTTTAACTCCGCCACCGGCGCCGGCACCATCATCGATTCCGGTATCGATCTAAATAGATTTTAAAAATATTTTAGACTCAATTAATAAAAATACACTTCTAATTTTAAAAATATCTTTAATTCAGGGACGACATTCACAAGGCTGGTGGCACCGGCGTACACGGCGGTGAGAAACGAATTCCGGCGAAGAATAGGCAACGCAACCATCTCATCCCTCGGCGGCTTCGACACGTGCTACACAGTCCCCATCATCTCTCCCACCATAACCTTCATGTTCGCCGGAATGAACGTAACTCTTCCGCCGGACAACTTCCTCCTCCACAGCTCCGCCGGAACCACCACCTGCCTCGCCATGGCCGCCGCCCCGGATAACGTCAACTCCGTACTCAACGTTATTGCCAGCATGCAGCAGCAGAACCACCGCATTCTCTTCGACGTTCCAAATTCCCGAATGGGCGTCGCCCGCGAACCCTGCTCTTAAACGGCCGTCGTCCGCCGTATTATCTCCAGCTCCGGTGATATTTATTTTTGTTGGCTTTAGAATAATAAGAATGTGTGGTCATCATGGTTTGGTGGGGGGATGGGGAAAAGGGTAAAGTAGTAATTTTGTATGGCTGTTTTTATAGTTGTGTTGTTTGGGATTCTCAAGGTTTTAGCTTATTATATTATGGATGTCTGAAAAAGCAAAGCTTTTTTTCTTCAATTCCAGTGTTGTGTAAGTTTACTACTAATAACCTAAGTCTTAATGACAATTTAATCTTAGATCTCAAGTCCGTTAAAGTCGTTAAACCCTTTTTTAATAGACGCGTTTTAAAATTGTGAGGCCAACGGCGATATGTAAC

mRNA sequence

CCCTTTCTTCACTTCCACACTCTCTCAGTCCCAAATTGAAGGCATGAAGACAAAATCCCCCTTCCTACTCTCTCTCCTCCTCCTTCTCCTCTCCATATCCGCCGAATCCCTCCACCACCACCATCACCCGAACTGTAACGCCGCCGCCGACCGCAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGTTCCCCCTTCCGCCCCTCCAAGCCGCTGTCATGGGCCGACAATGTGCTCCAAATGCAGGCCAAGGACCAAGCCCGCCTCCAGTTCCTGTCCTCCCTTGTCGCCTGCAAGTCCGTCGCCCCCATTGCCTCCGCCCGCCAACTCATCCAGAGCCCCACTTTCGTGGTCCGGGCCAAGATCGGCACCCCTGCTCAGACCCTCCTCATGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCCTGTTCCGGCTGTGTCGGCTGCCCTTCCACCACCGTCTTCTCCTCCGATAAGTCCACCTCCTTCCGCCCCCTCCCCTGCCGATCCCCTCAATGCAACCAGGTACCGAACCCCAGTTGCAGCAGTAGCGCGTGCGGCTTCAACTTGACGTACGGCAGCTCGACTGTGGCGGCGGATCTGGTTCAAGACAACATAACTCTGGCCAACGACTCAGTTCCGGCGTACACATTTGGGTGCATCACGAAGGCGACGGGTAGTTCAGTGCCACCGCAGGGACTATTGGGCCTGGGTCGAGGCCCATTATCGCTTTTGGGCCAGAGCCAGAGTCTGTACCGGTCCACATTCTCGTACTGCCTTCCGAGCTTCAAATCGACCAGCTTTTCTGGGTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCAAGAGGATTAAGTACACGCCGCTGCTCCGAAACCCGAGGCGGTCGTCTTTGTATTACGTGAATCTAATCGCGATCAGAGTCGGCCGGCGAATCGTCGACATTCCTCCCTCTGCTCTTGCCTTTAACTCCGCCACCGGCGCCGGCACCATCATCGATTCCGGGACGACATTCACAAGGCTGGTGGCACCGGCGTACACGGCGGTGAGAAACGAATTCCGGCGAAGAATAGGCAACGCAACCATCTCATCCCTCGGCGGCTTCGACACGTGCTACACAGTCCCCATCATCTCTCCCACCATAACCTTCATGTTCGCCGGAATGAACGTAACTCTTCCGCCGGACAACTTCCTCCTCCACAGCTCCGCCGGAACCACCACCTGCCTCGCCATGGCCGCCGCCCCGGATAACGTCAACTCCGTACTCAACGTTATTGCCAGCATGCAGCAGCAGAACCACCGCATTCTCTTCGACGTTCCAAATTCCCGAATGGGCGTCGCCCGCGAACCCTGCTCTTAAACGGCCGTCGTCCGCCGTATTATCTCCAGCTCCGGTGATATTTATTTTTGTTGGCTTTAGAATAATAAGAATGTGTGGTCATCATGGTTTGGTGGGGGGATGGGGAAAAGGGTAAAGTAGTAATTTTGTATGGCTGTTTTTATAGTTGTGTTGTTTGGGATTCTCAAGGTTTTAGCTTATTATATTATGGATGTCTGAAAAAGCAAAGCTTTTTTTCTTCAATTCCAGTGTTGTGTAAGTTTACTACTAATAACCTAAGTCTTAATGACAATTTAATCTTAGATCTCAAGTCCGTTAAAGTCGTTAAACCCTTTTTTAATAGACGCGTTTTAAAATTGTGAGGCCAACGGCGATATGTAAC

Coding sequence (CDS)

ATGAAGACAAAATCCCCCTTCCTACTCTCTCTCCTCCTCCTTCTCCTCTCCATATCCGCCGAATCCCTCCACCACCACCATCACCCGAACTGTAACGCCGCCGCCGACCGCAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGTTCCCCCTTCCGCCCCTCCAAGCCGCTGTCATGGGCCGACAATGTGCTCCAAATGCAGGCCAAGGACCAAGCCCGCCTCCAGTTCCTGTCCTCCCTTGTCGCCTGCAAGTCCGTCGCCCCCATTGCCTCCGCCCGCCAACTCATCCAGAGCCCCACTTTCGTGGTCCGGGCCAAGATCGGCACCCCTGCTCAGACCCTCCTCATGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCCTGTTCCGGCTGTGTCGGCTGCCCTTCCACCACCGTCTTCTCCTCCGATAAGTCCACCTCCTTCCGCCCCCTCCCCTGCCGATCCCCTCAATGCAACCAGGTACCGAACCCCAGTTGCAGCAGTAGCGCGTGCGGCTTCAACTTGACGTACGGCAGCTCGACTGTGGCGGCGGATCTGGTTCAAGACAACATAACTCTGGCCAACGACTCAGTTCCGGCGTACACATTTGGGTGCATCACGAAGGCGACGGGTAGTTCAGTGCCACCGCAGGGACTATTGGGCCTGGGTCGAGGCCCATTATCGCTTTTGGGCCAGAGCCAGAGTCTGTACCGGTCCACATTCTCGTACTGCCTTCCGAGCTTCAAATCGACCAGCTTTTCTGGGTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCAAGAGGATTAAGTACACGCCGCTGCTCCGAAACCCGAGGCGGTCGTCTTTGTATTACGTGAATCTAATCGCGATCAGAGTCGGCCGGCGAATCGTCGACATTCCTCCCTCTGCTCTTGCCTTTAACTCCGCCACCGGCGCCGGCACCATCATCGATTCCGGGACGACATTCACAAGGCTGGTGGCACCGGCGTACACGGCGGTGAGAAACGAATTCCGGCGAAGAATAGGCAACGCAACCATCTCATCCCTCGGCGGCTTCGACACGTGCTACACAGTCCCCATCATCTCTCCCACCATAACCTTCATGTTCGCCGGAATGAACGTAACTCTTCCGCCGGACAACTTCCTCCTCCACAGCTCCGCCGGAACCACCACCTGCCTCGCCATGGCCGCCGCCCCGGATAACGTCAACTCCGTACTCAACGTTATTGCCAGCATGCAGCAGCAGAACCACCGCATTCTCTTCGACGTTCCAAATTCCCGAATGGGCGTCGCCCGCGAACCCTGCTCTTAA

Protein sequence

MKTKSPFLLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPCS
BLAST of CmaCh04G013250 vs. Swiss-Prot
Match: AED3_ARATH (Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1)

HSP 1 Score: 411.8 bits (1057), Expect = 9.6e-114
Identity = 220/420 (52.38%), Postives = 290/420 (69.05%), Query Frame = 1

Query: 33  AAADRSSTLQVFHIFSPCSPFRPSK-PLSWADNVLQMQAKDQARLQFLSSLVACK---SV 92
           AA D S  L +  I + CSPF P+    S  D VL M + D  RL +LSSLVA K   + 
Sbjct: 31  AAPDGSDDLSIIPINAKCSPFAPTHVSASVIDTVLHMASSDSHRLTYLSSLVAGKPKPTS 90

Query: 93  APIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPS-TTVFSSDK 152
            P+AS  QL     +VVRAK+GTP Q + M LDTSNDA W+PCSGC GC + +T F+++ 
Sbjct: 91  VPVASGNQL-HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNS 150

Query: 153 STSFRPLPCRSPQCNQVPNPSCSSSA-----CGFNLTYGS-STVAADLVQDNITLANDSV 212
           S+++  + C + QC Q    +C SS+     C FN +YG  S+ +A LVQD +TLA D +
Sbjct: 151 SSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVI 210

Query: 213 PAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLR 272
           P ++FGCI  A+G+S+PPQGL+GLGRGP+SL+ Q+ SLY   FSYCLPSF+S  FSGSL+
Sbjct: 211 PNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 270

Query: 273 LGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDS 332
           LG + QPK I+YTPLLRNPRR SLYYVNL  + VG   V + P  L F++ +GAGTIIDS
Sbjct: 271 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDS 330

Query: 333 GTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTV--PIISPTITFMFAGMNVTL 392
           GT  TR   P Y A+R+EFR+++  ++ S+LG FDTC++     ++P IT     +++ L
Sbjct: 331 GTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAFDTCFSADNENVAPKITLHMTSLDLKL 390

Query: 393 PPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPCS 440
           P +N L+HSSAGT TCL+MA    N N+VLNVIA++QQQN RILFDVPNSR+G+A EPC+
Sbjct: 391 PMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmaCh04G013250 vs. Swiss-Prot
Match: AP25_ORYSJ (Aspartyl protease 25 OS=Oryza sativa subsp. japonica GN=AP25 PE=2 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 1.2e-95
Identity = 206/456 (45.18%), Postives = 284/456 (62.28%), Query Frame = 1

Query: 9   LSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQM 68
           + LLLLLL+ +  +          AAA+    L V+H   P SP     PL   ++++ +
Sbjct: 7   IPLLLLLLAATVAA----------AAAE----LSVYHNVHPSSP----SPL---ESIIAL 66

Query: 69  QAKDQARLQFLSSLVACKSV--APIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDA 128
              D ARL FLSS  A   V  AP+AS +     P++VVRA +G+P+Q LL+ALDTS DA
Sbjct: 67  ARDDDARLLFLSSKAATAGVSSAPVASGQA---PPSYVVRAGLGSPSQQLLLALDTSADA 126

Query: 129 AWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSS-------------- 188
            W  CS C  CPS+++F+   S+S+  LPC S  C      +C +               
Sbjct: 127 TWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLP 186

Query: 189 ACGFNLTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATG--SSVPPQGLLGLGRGPL 248
            C F+  +  ++  A L  D + L  D++P YTFGC++  TG  +++P QGLLGLGRGP+
Sbjct: 187 TCAFSKPFADASFQAALASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPM 246

Query: 249 SLLGQSQSLYRSTFSYCLPSFKSTSFSGSLRLGPVA-QPKRIKYTPLLRNPRRSSLYYVN 308
           +LL Q+ SLY   FSYCLPS++S  FSGSLRLG    QP+ ++YTP+LRNP RSSLYYVN
Sbjct: 247 ALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVN 306

Query: 309 LIAIRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNAT- 368
           +  + VG   V +P  + AF++ATGAGT++DSGT  TR  AP Y A+R EFRR++   + 
Sbjct: 307 VTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSG 366

Query: 369 ISSLGGFDTCYTVPII----SPTIT-FMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPD 428
            +SLG FDTC+    +    +P +T  M  G+++ LP +N L+HSSA    CLAMA AP 
Sbjct: 367 YTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQ 426

Query: 429 NVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPCS 440
           NVNSV+NVIA++QQQN R++FDV NSR+G A+E C+
Sbjct: 427 NVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438

BLAST of CmaCh04G013250 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 2.7e-47
Identity = 118/353 (33.43%), Postives = 182/353 (51.56%), Query Frame = 1

Query: 100 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPSTT--VFSSDKSTSFRPLPCR 159
           S  +  R  +GTPA+ + M LDT +D  W+ C+ C  C S +  +F   KS ++  +PC 
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198

Query: 160 SPQCNQVPNPSCSS--SACGFNLTYGS-STVAADLVQDNITLANDSVPAYTFGCITKATG 219
           SP C ++ +  C++    C + ++YG  S    D   + +T   + V     GC     G
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEG 258

Query: 220 SSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYT 279
             V   GLLGLG+G LS  GQ+   +   FSYCL    ++S   S+  G  A  +  ++T
Sbjct: 259 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 318

Query: 280 PLLRNPRRSSLYYVNLIAIRVG-RRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAY 339
           PLL NP+  + YYV L+ I VG  R+  +  S    +     G IIDSGT+ TRL+ PAY
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378

Query: 340 TAVRNEFRRRIGNATIS---SLGGFDTCYTVPIIS----PTITFMFAGMNVTLPPDNFLL 399
            A+R+ F  R+G  T+        FDTC+ +  ++    PT+   F G +V+LP  N+L+
Sbjct: 379 IAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLI 438

Query: 400 HSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPCS 440
                   C A A         L++I ++QQQ  R+++D+ +SR+G A   C+
Sbjct: 439 PVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh04G013250 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 5.2e-43
Identity = 116/348 (33.33%), Postives = 166/348 (47.70%), Query Frame = 1

Query: 100 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGC--PSTTVFSSDKSTSFRPLPCR 159
           S  + VR  +G+P +   M +D+ +D  W+ C  C  C   S  VF   KS S+  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 160 SPQCNQVPNPSCSSSACGFNLTYGS-STVAADLVQDNITLANDSVPAYTFGCITKATGSS 219
           S  C+++ N  C S  C + + YG  S     L  + +T A   V     GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 220 VPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPL 279
           +   GLLG+G G +S +GQ        F YCL S + T  +GSL  G  A P    + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 280 LRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAV 339
           +RNPR  S YYV L  + VG   + +P            G ++D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 367

Query: 340 RNEFRRRIGN-ATISSLGGFDTCYT----VPIISPTITFMFA-GMNVTLPPDNFLLHSSA 399
           R+ F+ +  N    S +  FDTCY     V +  PT++F F  G  +TLP  NFL+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 400 GTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPC 439
             T C A AA+P      L++I ++QQ+  ++ FD  N  +G     C
Sbjct: 428 SGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmaCh04G013250 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 7.5e-42
Identity = 121/349 (34.67%), Postives = 178/349 (51.00%), Query Frame = 1

Query: 103 FVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGC--PSTTVFSSDKSTSFRPLPCRSPQ 162
           +++   IGTPAQ     +DT +D  W  C  C  C   ST +F+   S+SF  LPC S  
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 163 CNQVPNPSCSSSACGFNLTYGS-STVAADLVQDNITLANDSVPAYTFGCITKATG-SSVP 222
           C  + +P+CS++ C +   YG  S     +  + +T  + S+P  TFGC     G     
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGN 214

Query: 223 PQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRI--KYTPL 282
             GL+G+GRGPLSL  Q   L  + FSYC+    S++ S +L LG +A         T L
Sbjct: 215 GAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGSSTPS-NLLLGSLANSVTAGSPNTTL 274

Query: 283 LRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAG-TIIDSGTTFTRLVAPAYTA 342
           +++ +  + YY+ L  + VG   + I PSA A NS  G G  IIDSGTT T  V  AY +
Sbjct: 275 IQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQS 334

Query: 343 VRNEFRRRIGNATIS-SLGGFDTCYTVP-----IISPTITFMFAGMNVTLPPDNFLLHSS 402
           VR EF  +I    ++ S  GFD C+  P     +  PT    F G ++ LP +N+ +  S
Sbjct: 335 VRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPS 394

Query: 403 AGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPC 439
            G   CLAM ++       +++  ++QQQN  +++D  NS +  A   C
Sbjct: 395 NG-LICLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh04G013250 vs. TrEMBL
Match: A0A0A0KTR5_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_5G623870 PE=3 SV=1)

HSP 1 Score: 771.2 bits (1990), Expect = 7.0e-220
Identity = 393/438 (89.73%), Postives = 414/438 (94.52%), Query Frame = 1

Query: 3   TKSPFLLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWA 62
           TKSPFLL LLLL L  S  +   H   NCN AADRSSTLQVFHIFSPCSPFRPSKPLSWA
Sbjct: 4   TKSPFLLLLLLLHL-FSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWA 63

Query: 63  DNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDT 122
           DNVLQMQAKDQARLQFLSSLVA +S  PIASARQLIQSPTFVVRAKIGTPAQTLL+ALDT
Sbjct: 64  DNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDT 123

Query: 123 SNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGS 182
           SNDAAWIPCSGC+GCPSTTVFSSDKS+SFRPLPC+SPQCNQVPNPSCS SACGFNLTYGS
Sbjct: 124 SNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLTYGS 183

Query: 183 STVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRS 242
           STVAADLVQDN+TLA DSVP+YTFGCI KATGSSVPPQGLLGLGRGPLSLLGQSQSLY+S
Sbjct: 184 STVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQS 243

Query: 243 TFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDI 302
           TFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLI+IRVGR+IVDI
Sbjct: 244 TFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDI 303

Query: 303 PPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIG-NATISSLGGFDTCYTV 362
           PPSALAFNSATGAGT+IDSGTTFTRLVAPAYTAVR+EFRRR+G N T+SSLGGFDTCYTV
Sbjct: 304 PPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV 363

Query: 363 PIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHR 422
           PIISPTITFMFAGMNVTLPPDNFL+HS+AG+TTCLAMAAAPDNVNSVLNVIASMQQQNHR
Sbjct: 364 PIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHR 423

Query: 423 ILFDVPNSRMGVAREPCS 440
           ILFD+PNSR+GVARE CS
Sbjct: 424 ILFDIPNSRVGVARESCS 440

BLAST of CmaCh04G013250 vs. TrEMBL
Match: A0A067L7Q3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24539 PE=3 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.2e-187
Identity = 333/432 (77.08%), Postives = 385/432 (89.12%), Query Frame = 1

Query: 8   LLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQ 67
           LLSL  L  S+ A+ LH +  P C++  D+ STLQVFH++SPCSPFRPSKPLSW ++VLQ
Sbjct: 5   LLSLAFLFFSL-AQGLHLN--PKCSSQ-DQGSTLQVFHVYSPCSPFRPSKPLSWEESVLQ 64

Query: 68  MQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAA 127
           MQAKDQARLQFLSSLVA +S  PIAS RQ+IQSPT++VRAKIGTPAQTLL+A+DTSNDAA
Sbjct: 65  MQAKDQARLQFLSSLVAGRSFVPIASGRQIIQSPTYIVRAKIGTPAQTLLLAVDTSNDAA 124

Query: 128 WIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTVAA 187
           WIPCSGC GC S+TVF S KSTSF+ + C +PQC QVPNP+CS SAC FN TYGSS++AA
Sbjct: 125 WIPCSGCDGC-SSTVFDSVKSTSFQTVGCGAPQCKQVPNPTCSGSACTFNTTYGSSSIAA 184

Query: 188 DLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYC 247
           +L QD ++LA DSVP YTFGCI KATGSSVPPQGLLGLGRGPLSLL Q+Q+LY+STFSYC
Sbjct: 185 NLSQDTVSLATDSVPGYTFGCIAKATGSSVPPQGLLGLGRGPLSLLSQTQNLYQSTFSYC 244

Query: 248 LPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSAL 307
           LPSF+S +FSG+LRLGP  QPKRIK TPLLRNPRRSSLYYVNL+AIRVGRR+VDIPPSAL
Sbjct: 245 LPSFRSLNFSGTLRLGPNGQPKRIKTTPLLRNPRRSSLYYVNLVAIRVGRRVVDIPPSAL 304

Query: 308 AFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIISPT 367
           AFN  TGAGTI DSGT FTRLV PAYTAVR+ FR+R+GNAT++SLGGFDTCY+VPI++PT
Sbjct: 305 AFNPTTGAGTIFDSGTVFTRLVTPAYTAVRDAFRKRVGNATVTSLGGFDTCYSVPIVAPT 364

Query: 368 ITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVP 427
           ITFMF+GMNVTLPP+N L+HS+AG+T+CLA+AAAPDNVNSVLNVIA+MQQQNHRILFDVP
Sbjct: 365 ITFMFSGMNVTLPPENLLIHSTAGSTSCLAIAAAPDNVNSVLNVIANMQQQNHRILFDVP 424

Query: 428 NSRMGVAREPCS 440
           NSR+GVARE C+
Sbjct: 425 NSRLGVAREQCT 431

BLAST of CmaCh04G013250 vs. TrEMBL
Match: B9RG92_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1452350 PE=3 SV=1)

HSP 1 Score: 658.7 bits (1698), Expect = 5.0e-186
Identity = 324/432 (75.00%), Postives = 380/432 (87.96%), Query Frame = 1

Query: 8   LLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQ 67
           L SL  L  +++      H +P C    D+ S LQVFH++SPCSPF PSKPL W ++VLQ
Sbjct: 5   LFSLAFLFFTLAQGM---HLNPKCGIQ-DQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQ 64

Query: 68  MQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAA 127
           MQAKDQARLQFLSSLVA KSV PIAS RQ++QSPT++VRAKIGTPAQT+L+A+DTSNDAA
Sbjct: 65  MQAKDQARLQFLSSLVARKSVVPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAA 124

Query: 128 WIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTVAA 187
           WIPCSGCVGC S+TVF++ KST+F+ + C +PQC QVPN  C  SAC FN+TYGSS++AA
Sbjct: 125 WIPCSGCVGC-SSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAA 184

Query: 188 DLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYC 247
           +L QD +TLA DS+P+YTFGC+T+ATGSS+PPQGLLGLGRGP+SLL Q+Q+LY+STFSYC
Sbjct: 185 NLSQDVVTLATDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYC 244

Query: 248 LPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSAL 307
           LPSF+S +FSGSLRLGPV QPKRIK TPLL+NPRRSSLYYVNL+AIRVGRR+VDIPPSAL
Sbjct: 245 LPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSAL 304

Query: 308 AFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIISPT 367
           AFN  TGAGTI DSGT FTRLVAPAYTAVR+ FR+R+GNAT++SLGGFDTCYT PI++PT
Sbjct: 305 AFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGGFDTCYTSPIVAPT 364

Query: 368 ITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVP 427
           ITFMF+GMNVTLPPDN L+HS+A + TCLAMAAAPDNVNSVLNVIA+MQQQNHRILFDVP
Sbjct: 365 ITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVP 424

Query: 428 NSRMGVAREPCS 440
           NSR+GVAREPC+
Sbjct: 425 NSRLGVAREPCT 431

BLAST of CmaCh04G013250 vs. TrEMBL
Match: A0A0B2PGF3_GLYSO (Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_045617 PE=3 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 4.0e-183
Identity = 324/444 (72.97%), Postives = 382/444 (86.04%), Query Frame = 1

Query: 1   MKTKSPFLLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLS 60
           MKT    L  L L LL    E L     P C+   D  STL+VFH+FSPCSPFRP KPLS
Sbjct: 1   MKTTLFSLSPLFLFLLFSLVEGLT----PKCDTQ-DHGSTLEVFHVFSPCSPFRPPKPLS 60

Query: 61  WADNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMAL 120
           WA++VLQ+QAKDQARLQFL+S+VA +SV PIAS RQ+IQSPT++VRAKIG+P QTLL+A+
Sbjct: 61  WAESVLQLQAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAM 120

Query: 121 DTSNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTY 180
           DTSNDAAWIPC+ C GC ST +F+ +KST+F+ + C SPQCNQVPNPSC +SAC FNLTY
Sbjct: 121 DTSNDAAWIPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTY 180

Query: 181 GSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 240
           GSS++AA++VQD +TLA D +P YTFGC+ K TG+S PPQGLLGLGRGPLSLL Q+Q+LY
Sbjct: 181 GSSSIAANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLY 240

Query: 241 RSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIV 300
           +STFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLL+NPRRSSLYYVNL+AIRVGR++V
Sbjct: 241 QSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVV 300

Query: 301 DIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRI-----GNATISSLGGF 360
           DIPP ALAFN+ATGAGT+ DSGT FTRLVAPAYTAVR+EF+RR+      N T++SLGGF
Sbjct: 301 DIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGF 360

Query: 361 DTCYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASM 420
           DTCYTVPI++PTITFMF+GMNVTLP DN L+HS+AG+TTCLAMA+APDNVNSVLNVIA+M
Sbjct: 361 DTCYTVPIVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANM 420

Query: 421 QQQNHRILFDVPNSRMGVAREPCS 440
           QQQNHR+L+DVPNSR+GVARE C+
Sbjct: 421 QQQNHRVLYDVPNSRLGVARELCT 438

BLAST of CmaCh04G013250 vs. TrEMBL
Match: I1JW44_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_04G136000 PE=3 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 4.0e-183
Identity = 324/444 (72.97%), Postives = 382/444 (86.04%), Query Frame = 1

Query: 1   MKTKSPFLLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLS 60
           MKT    L  L L LL    E L     P C+   D  STL+VFH+FSPCSPFRP KPLS
Sbjct: 1   MKTTLFSLSPLFLFLLFSLVEGLT----PKCDTQ-DHGSTLEVFHVFSPCSPFRPPKPLS 60

Query: 61  WADNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMAL 120
           WA++VLQ+QAKDQARLQFL+S+VA +SV PIAS RQ+IQSPT++VRAKIG+P QTLL+A+
Sbjct: 61  WAESVLQLQAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAM 120

Query: 121 DTSNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTY 180
           DTSNDAAWIPC+ C GC ST +F+ +KST+F+ + C SPQCNQVPNPSC +SAC FNLTY
Sbjct: 121 DTSNDAAWIPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTY 180

Query: 181 GSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 240
           GSS++AA++VQD +TLA D +P YTFGC+ K TG+S PPQGLLGLGRGPLSLL Q+Q+LY
Sbjct: 181 GSSSIAANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLY 240

Query: 241 RSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIV 300
           +STFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLL+NPRRSSLYYVNL+AIRVGR++V
Sbjct: 241 QSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVV 300

Query: 301 DIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRI-----GNATISSLGGF 360
           DIPP ALAFN+ATGAGT+ DSGT FTRLVAPAYTAVR+EF+RR+      N T++SLGGF
Sbjct: 301 DIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGF 360

Query: 361 DTCYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASM 420
           DTCYTVPI++PTITFMF+GMNVTLP DN L+HS+AG+TTCLAMA+APDNVNSVLNVIA+M
Sbjct: 361 DTCYTVPIVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANM 420

Query: 421 QQQNHRILFDVPNSRMGVAREPCS 440
           QQQNHR+L+DVPNSR+GVARE C+
Sbjct: 421 QQQNHRVLYDVPNSRLGVARELCT 438

BLAST of CmaCh04G013250 vs. TAIR10
Match: AT5G07030.1 (AT5G07030.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 583.6 bits (1503), Expect = 1.0e-166
Identity = 291/437 (66.59%), Postives = 355/437 (81.24%), Query Frame = 1

Query: 9   LSLLLLLLSISAESLHHHHHPNCNAAA--DRSSTLQVFHIFSPCSPFRPSKPLSWADNVL 68
           L L L L SI   +L  +H PNC+     D+ STL++FHI SPCSPF+ S PLSW   VL
Sbjct: 20  LVLFLQLFSILPLALGLNH-PNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVL 79

Query: 69  QMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDA 128
           Q  A+DQARLQ+LSSLVA +SV PIAS RQ++QS T++V+A IGTPAQ LL+A+DTS+D 
Sbjct: 80  QTLAQDQARLQYLSSLVAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDV 139

Query: 129 AWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTVA 188
           AWIPCSGCVGCPS T FS  KSTSF+ + C +PQC QVPNP+C + AC FNLTYGSS++A
Sbjct: 140 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA 199

Query: 189 ADLVQDNITLANDSVPAYTFGCITKATGSSV--PPQGLLGLGRGPLSLLGQSQSLYRSTF 248
           A+L QD I LA D + A+TFGC+ K  G     PPQGLLGLGRGPLSL+ Q+QS+Y+STF
Sbjct: 200 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTF 259

Query: 249 SYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPP 308
           SYCLPSF+S +FSGSLRLGP +QP+R+KYT LLRNPRRSSLYYVNL+AIRVGR++VD+PP
Sbjct: 260 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 319

Query: 309 SALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRI--GNATISSLGGFDTCYTVP 368
           +A+AFN +TGAGTI DSGT +TRL  P Y AVRNEFR+R+    A ++SLGGFDTCY+  
Sbjct: 320 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQ 379

Query: 369 IISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRI 428
           +  PTITFMF G+N+T+P DN +LHS+AG+T+CLAMAAAP+NVNSV+NVIASMQQQNHR+
Sbjct: 380 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 439

Query: 429 LFDVPNSRMGVAREPCS 440
           L DVPN R+G+ARE CS
Sbjct: 440 LIDVPNGRLGLARERCS 455

BLAST of CmaCh04G013250 vs. TAIR10
Match: AT3G54400.1 (AT3G54400.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 565.1 bits (1455), Expect = 3.8e-161
Identity = 292/434 (67.28%), Postives = 349/434 (80.41%), Query Frame = 1

Query: 8   LLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQ 67
           LL LL+ LL + +ES+      NCN  +  SS L+VFHI S CSPF+ S  +SWAD +LQ
Sbjct: 5   LLILLISLLILKSESI------NCNEKS-HSSDLRVFHINSLCSPFKTS--VSWADTLLQ 64

Query: 68  MQAKDQARLQFLSSLVAC-KSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDA 127
               D+AR  +LSSL    KS  PIAS R ++QSPT++VRA IGTPAQ +L+ALDTSNDA
Sbjct: 65  ----DKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDA 124

Query: 128 AWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCS-SSACGFNLTYGSSTV 187
           AWIPCSGCVGC S+ +F   KS+S R L C +PQC Q PNPSC+ S +CGFN+TYG ST+
Sbjct: 125 AWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTI 184

Query: 188 AADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFS 247
            A L QD +TLA+D +P YTFGCI KA+G+S+P QGL+GLGRGPLSL+ QSQ+LY+STFS
Sbjct: 185 EAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFS 244

Query: 248 YCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPS 307
           YCLP+ KS++FSGSLRLGP  QP RIK TPLL+NPRRSSLYYVNL+ IRVG +IVDIP S
Sbjct: 245 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 304

Query: 308 ALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIIS 367
           ALAF+ ATGAGTI DSGT +TRLV PAY AVRNEFRRR+ NA  +SLGGFDTCY+  ++ 
Sbjct: 305 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVVF 364

Query: 368 PTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 427
           P++TFMFAGMNVTLPPDN L+HSSAG  +CLAMAAAP NVNSVLNVIASMQQQNHR+L D
Sbjct: 365 PSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLID 424

Query: 428 VPNSRMGVAREPCS 440
           VPNSR+G++RE C+
Sbjct: 425 VPNSRLGISRETCT 425

BLAST of CmaCh04G013250 vs. TAIR10
Match: AT1G09750.1 (AT1G09750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 411.8 bits (1057), Expect = 5.4e-115
Identity = 220/420 (52.38%), Postives = 290/420 (69.05%), Query Frame = 1

Query: 33  AAADRSSTLQVFHIFSPCSPFRPSK-PLSWADNVLQMQAKDQARLQFLSSLVACK---SV 92
           AA D S  L +  I + CSPF P+    S  D VL M + D  RL +LSSLVA K   + 
Sbjct: 31  AAPDGSDDLSIIPINAKCSPFAPTHVSASVIDTVLHMASSDSHRLTYLSSLVAGKPKPTS 90

Query: 93  APIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPS-TTVFSSDK 152
            P+AS  QL     +VVRAK+GTP Q + M LDTSNDA W+PCSGC GC + +T F+++ 
Sbjct: 91  VPVASGNQL-HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNS 150

Query: 153 STSFRPLPCRSPQCNQVPNPSCSSSA-----CGFNLTYGS-STVAADLVQDNITLANDSV 212
           S+++  + C + QC Q    +C SS+     C FN +YG  S+ +A LVQD +TLA D +
Sbjct: 151 SSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVI 210

Query: 213 PAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLR 272
           P ++FGCI  A+G+S+PPQGL+GLGRGP+SL+ Q+ SLY   FSYCLPSF+S  FSGSL+
Sbjct: 211 PNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 270

Query: 273 LGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDS 332
           LG + QPK I+YTPLLRNPRR SLYYVNL  + VG   V + P  L F++ +GAGTIIDS
Sbjct: 271 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDS 330

Query: 333 GTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTV--PIISPTITFMFAGMNVTL 392
           GT  TR   P Y A+R+EFR+++  ++ S+LG FDTC++     ++P IT     +++ L
Sbjct: 331 GTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAFDTCFSADNENVAPKITLHMTSLDLKL 390

Query: 393 PPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPCS 440
           P +N L+HSSAGT TCL+MA    N N+VLNVIA++QQQN RILFDVPNSR+G+A EPC+
Sbjct: 391 PMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmaCh04G013250 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 182.6 bits (462), Expect = 5.4e-46
Identity = 124/358 (34.64%), Postives = 179/358 (50.00%), Query Frame = 1

Query: 100 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPSTT--VFSSDKSTSFRPLPCR 159
           S  + +R  +GTPA  + M LDT +D  W+ CS C  C + T  +F   KS +F  +PC 
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCG 191

Query: 160 SPQCNQVPNPS-C---SSSACGFNLTYGS-STVAADLVQDNITLANDSVPAYTFGCITKA 219
           S  C ++ + S C    S  C + ++YG  S    D   + +T     V     GC    
Sbjct: 192 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDN 251

Query: 220 TGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFS----GSLRLGPVAQP 279
            G  V   GLLGLGRG LS   Q+++ Y   FSYCL    S+  S     ++  G  A P
Sbjct: 252 EGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVP 311

Query: 280 KRIKYTPLLRNPRRSSLYYVNLIAIRV-GRRIVDIPPSALAFNSATGAGTIIDSGTTFTR 339
           K   +TPLL NP+  + YY+ L+ I V G R+  +  S    ++    G IIDSGT+ TR
Sbjct: 312 KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTR 371

Query: 340 LVAPAYTAVRNEFR---RRIGNATISSLGGFDTCYTV----PIISPTITFMFAGMNVTLP 399
           L  PAY A+R+ FR    ++  A   SL  FDTC+ +     +  PT+ F F G  V+LP
Sbjct: 372 LTQPAYVALRDAFRLGATKLKRAPSYSL--FDTCFDLSGMTTVKVPTVVFHFGGGEVSLP 431

Query: 400 PDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPC 439
             N+L+  +     C A A    +    L++I ++QQQ  R+ +D+  SR+G     C
Sbjct: 432 ASNYLIPVNTEGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CmaCh04G013250 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 176.8 bits (447), Expect = 2.9e-44
Identity = 116/348 (33.33%), Postives = 166/348 (47.70%), Query Frame = 1

Query: 100 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGC--PSTTVFSSDKSTSFRPLPCR 159
           S  + VR  +G+P +   M +D+ +D  W+ C  C  C   S  VF   KS S+  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 160 SPQCNQVPNPSCSSSACGFNLTYGS-STVAADLVQDNITLANDSVPAYTFGCITKATGSS 219
           S  C+++ N  C S  C + + YG  S     L  + +T A   V     GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 220 VPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPL 279
           +   GLLG+G G +S +GQ        F YCL S + T  +GSL  G  A P    + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 280 LRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAV 339
           +RNPR  S YYV L  + VG   + +P            G ++D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 367

Query: 340 RNEFRRRIGN-ATISSLGGFDTCYT----VPIISPTITFMFA-GMNVTLPPDNFLLHSSA 399
           R+ F+ +  N    S +  FDTCY     V +  PT++F F  G  +TLP  NFL+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 400 GTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRMGVAREPC 439
             T C A AA+P      L++I ++QQ+  ++ FD  N  +G     C
Sbjct: 428 SGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmaCh04G013250 vs. NCBI nr
Match: gi|659092233|ref|XP_008446966.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo])

HSP 1 Score: 777.3 bits (2006), Expect = 1.4e-221
Identity = 400/440 (90.91%), Postives = 418/440 (95.00%), Query Frame = 1

Query: 3   TKSPFL-LSLLLLLLSISAESLHHHHHP-NCNAAADRSSTLQVFHIFSPCSPFRPSKPLS 62
           TKSPFL L LLLLLL + + S    H P NCN A DRSSTL+VFHIFSPCSPFRPSKPLS
Sbjct: 4   TKSPFLPLPLLLLLLLLFSISTAKSHIPLNCNPADDRSSTLKVFHIFSPCSPFRPSKPLS 63

Query: 63  WADNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMAL 122
           WADNVLQMQAKDQARLQFLSSLVA +SV PIASARQLIQSPTFVVRAKIGTPAQTLL+AL
Sbjct: 64  WADNVLQMQAKDQARLQFLSSLVARRSVVPIASARQLIQSPTFVVRAKIGTPAQTLLLAL 123

Query: 123 DTSNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTY 182
           DTSNDAAWIPCSGCVGCPSTTVFSSDKS+SFRPLPC+SPQCNQVPNPSCS +ACGFNLTY
Sbjct: 124 DTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGTACGFNLTY 183

Query: 183 GSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 242
           GSSTVAADLVQDN+TLA DSVPAYTFGCI KATGSSVPPQGLLGLGRGPLSLLGQSQSLY
Sbjct: 184 GSSTVAADLVQDNVTLATDSVPAYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 243

Query: 243 RSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIV 302
           RSTFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLIAIRVGRRIV
Sbjct: 244 RSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIV 303

Query: 303 DIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIG-NATISSLGGFDTCY 362
           DIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVR+EFRRR+G N T+SSLGGFDTCY
Sbjct: 304 DIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCY 363

Query: 363 TVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQN 422
           TVPIISPTITFMFAGMNVTLPPDNFL+HS+AG+TTCLAMAAAPDNVNSVLNVIASMQQQN
Sbjct: 364 TVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQN 423

Query: 423 HRILFDVPNSRMGVAREPCS 440
           HRILFD+PNSR+GVAREPCS
Sbjct: 424 HRILFDIPNSRVGVAREPCS 443

BLAST of CmaCh04G013250 vs. NCBI nr
Match: gi|449449334|ref|XP_004142420.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 771.2 bits (1990), Expect = 1.0e-219
Identity = 393/438 (89.73%), Postives = 414/438 (94.52%), Query Frame = 1

Query: 3   TKSPFLLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWA 62
           TKSPFLL LLLL L  S  +   H   NCN AADRSSTLQVFHIFSPCSPFRPSKPLSWA
Sbjct: 4   TKSPFLLLLLLLHL-FSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPLSWA 63

Query: 63  DNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDT 122
           DNVLQMQAKDQARLQFLSSLVA +S  PIASARQLIQSPTFVVRAKIGTPAQTLL+ALDT
Sbjct: 64  DNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDT 123

Query: 123 SNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGS 182
           SNDAAWIPCSGC+GCPSTTVFSSDKS+SFRPLPC+SPQCNQVPNPSCS SACGFNLTYGS
Sbjct: 124 SNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLTYGS 183

Query: 183 STVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRS 242
           STVAADLVQDN+TLA DSVP+YTFGCI KATGSSVPPQGLLGLGRGPLSLLGQSQSLY+S
Sbjct: 184 STVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQS 243

Query: 243 TFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDI 302
           TFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLI+IRVGR+IVDI
Sbjct: 244 TFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDI 303

Query: 303 PPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIG-NATISSLGGFDTCYTV 362
           PPSALAFNSATGAGT+IDSGTTFTRLVAPAYTAVR+EFRRR+G N T+SSLGGFDTCYTV
Sbjct: 304 PPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV 363

Query: 363 PIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHR 422
           PIISPTITFMFAGMNVTLPPDNFL+HS+AG+TTCLAMAAAPDNVNSVLNVIASMQQQNHR
Sbjct: 364 PIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHR 423

Query: 423 ILFDVPNSRMGVAREPCS 440
           ILFD+PNSR+GVARE CS
Sbjct: 424 ILFDIPNSRVGVARESCS 440

BLAST of CmaCh04G013250 vs. NCBI nr
Match: gi|802574128|ref|XP_012068673.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas])

HSP 1 Score: 664.1 bits (1712), Expect = 1.7e-187
Identity = 333/432 (77.08%), Postives = 385/432 (89.12%), Query Frame = 1

Query: 8   LLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQ 67
           LLSL  L  S+ A+ LH +  P C++  D+ STLQVFH++SPCSPFRPSKPLSW ++VLQ
Sbjct: 5   LLSLAFLFFSL-AQGLHLN--PKCSSQ-DQGSTLQVFHVYSPCSPFRPSKPLSWEESVLQ 64

Query: 68  MQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAA 127
           MQAKDQARLQFLSSLVA +S  PIAS RQ+IQSPT++VRAKIGTPAQTLL+A+DTSNDAA
Sbjct: 65  MQAKDQARLQFLSSLVAGRSFVPIASGRQIIQSPTYIVRAKIGTPAQTLLLAVDTSNDAA 124

Query: 128 WIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTVAA 187
           WIPCSGC GC S+TVF S KSTSF+ + C +PQC QVPNP+CS SAC FN TYGSS++AA
Sbjct: 125 WIPCSGCDGC-SSTVFDSVKSTSFQTVGCGAPQCKQVPNPTCSGSACTFNTTYGSSSIAA 184

Query: 188 DLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYC 247
           +L QD ++LA DSVP YTFGCI KATGSSVPPQGLLGLGRGPLSLL Q+Q+LY+STFSYC
Sbjct: 185 NLSQDTVSLATDSVPGYTFGCIAKATGSSVPPQGLLGLGRGPLSLLSQTQNLYQSTFSYC 244

Query: 248 LPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSAL 307
           LPSF+S +FSG+LRLGP  QPKRIK TPLLRNPRRSSLYYVNL+AIRVGRR+VDIPPSAL
Sbjct: 245 LPSFRSLNFSGTLRLGPNGQPKRIKTTPLLRNPRRSSLYYVNLVAIRVGRRVVDIPPSAL 304

Query: 308 AFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIISPT 367
           AFN  TGAGTI DSGT FTRLV PAYTAVR+ FR+R+GNAT++SLGGFDTCY+VPI++PT
Sbjct: 305 AFNPTTGAGTIFDSGTVFTRLVTPAYTAVRDAFRKRVGNATVTSLGGFDTCYSVPIVAPT 364

Query: 368 ITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVP 427
           ITFMF+GMNVTLPP+N L+HS+AG+T+CLA+AAAPDNVNSVLNVIA+MQQQNHRILFDVP
Sbjct: 365 ITFMFSGMNVTLPPENLLIHSTAGSTSCLAIAAAPDNVNSVLNVIANMQQQNHRILFDVP 424

Query: 428 NSRMGVAREPCS 440
           NSR+GVARE C+
Sbjct: 425 NSRLGVAREQCT 431

BLAST of CmaCh04G013250 vs. NCBI nr
Match: gi|255543963|ref|XP_002513044.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Ricinus communis])

HSP 1 Score: 658.7 bits (1698), Expect = 7.2e-186
Identity = 324/432 (75.00%), Postives = 380/432 (87.96%), Query Frame = 1

Query: 8   LLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQ 67
           L SL  L  +++      H +P C    D+ S LQVFH++SPCSPF PSKPL W ++VLQ
Sbjct: 5   LFSLAFLFFTLAQGM---HLNPKCGIQ-DQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQ 64

Query: 68  MQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAA 127
           MQAKDQARLQFLSSLVA KSV PIAS RQ++QSPT++VRAKIGTPAQT+L+A+DTSNDAA
Sbjct: 65  MQAKDQARLQFLSSLVARKSVVPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAA 124

Query: 128 WIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTVAA 187
           WIPCSGCVGC S+TVF++ KST+F+ + C +PQC QVPN  C  SAC FN+TYGSS++AA
Sbjct: 125 WIPCSGCVGC-SSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAA 184

Query: 188 DLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYC 247
           +L QD +TLA DS+P+YTFGC+T+ATGSS+PPQGLLGLGRGP+SLL Q+Q+LY+STFSYC
Sbjct: 185 NLSQDVVTLATDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYC 244

Query: 248 LPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSAL 307
           LPSF+S +FSGSLRLGPV QPKRIK TPLL+NPRRSSLYYVNL+AIRVGRR+VDIPPSAL
Sbjct: 245 LPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSAL 304

Query: 308 AFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIISPT 367
           AFN  TGAGTI DSGT FTRLVAPAYTAVR+ FR+R+GNAT++SLGGFDTCYT PI++PT
Sbjct: 305 AFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGGFDTCYTSPIVAPT 364

Query: 368 ITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVP 427
           ITFMF+GMNVTLPPDN L+HS+A + TCLAMAAAPDNVNSVLNVIA+MQQQNHRILFDVP
Sbjct: 365 ITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVP 424

Query: 428 NSRMGVAREPCS 440
           NSR+GVAREPC+
Sbjct: 425 NSRLGVAREPCT 431

BLAST of CmaCh04G013250 vs. NCBI nr
Match: gi|1009171231|ref|XP_015866632.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Ziziphus jujuba])

HSP 1 Score: 649.8 bits (1675), Expect = 3.4e-183
Identity = 326/435 (74.94%), Postives = 376/435 (86.44%), Query Frame = 1

Query: 6   PFLLSLLLLLLSISAESLHHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNV 65
           P L S   L L +   ++    + NC    D+ STLQV H++SPCSPFRPSKPLSW ++V
Sbjct: 3   PLLFSATPLALVLIITTIVQGLNLNCQNQ-DKGSTLQVLHVYSPCSPFRPSKPLSWEESV 62

Query: 66  LQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLLMALDTSND 125
           LQMQAKDQARLQFLSSLVA KSV PIAS RQ+IQSPT++VRAKIGTP QTLL+ALDTSND
Sbjct: 63  LQMQAKDQARLQFLSSLVARKSVVPIASGRQIIQSPTYIVRAKIGTPVQTLLLALDTSND 122

Query: 126 AAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFNLTYGSSTV 185
           AAWIPC+GCVGC S+  F+  KSTSF+ L C++PQC QVPNP+C+ + C FNLTYG S++
Sbjct: 123 AAWIPCAGCVGC-SSAAFAPIKSTSFKSLGCQAPQCRQVPNPTCTGTTCSFNLTYGGSSI 182

Query: 186 AADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFS 245
           AADL QD ITLAND+VPAYTFGCI KATGSS+PPQGLLGLGRGPLSLL Q+Q+LY+STFS
Sbjct: 183 AADLSQDTITLANDAVPAYTFGCIKKATGSSLPPQGLLGLGRGPLSLLSQTQNLYQSTFS 242

Query: 246 YCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPS 305
           YCLPSFKS +FSGSLRLGP+ QP RIKYTPLL+NPRRSSLYYVNL AIRVGR++VDIPP+
Sbjct: 243 YCLPSFKSLNFSGSLRLGPIGQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKVVDIPPA 302

Query: 306 ALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRI-GNATISSLGGFDTCYTVPII 365
            LAFN  TGAGTI DSGT FTRLVA AYTAVR+EFR+R+  NA I++LGGFDTCY+VPI 
Sbjct: 303 DLAFNPTTGAGTIFDSGTVFTRLVASAYTAVRDEFRKRVKPNAPITTLGGFDTCYSVPIT 362

Query: 366 SPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILF 425
           +PT+TFMF GMNVTLPPDN L+HS+AG+ TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+
Sbjct: 363 APTVTFMFTGMNVTLPPDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 422

Query: 426 DVPNSRMGVAREPCS 440
           DVPNSR+GVAREPC+
Sbjct: 423 DVPNSRLGVAREPCT 435

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AED3_ARATH9.6e-11452.38Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1[more]
AP25_ORYSJ1.2e-9545.18Aspartyl protease 25 OS=Oryza sativa subsp. japonica GN=AP25 PE=2 SV=1[more]
APF2_ARATH2.7e-4733.43Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG2_ARATH5.2e-4333.33Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
NEP1_NEPGR7.5e-4234.67Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KTR5_CUCSA7.0e-22089.73Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_5G623870 PE=3 SV=1[more]
A0A067L7Q3_JATCU1.2e-18777.08Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24539 PE=3 SV=1[more]
B9RG92_RICCO5.0e-18675.00Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1452350 ... [more]
A0A0B2PGF3_GLYSO4.0e-18372.97Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_045617 PE=3 SV=1[more]
I1JW44_SOYBN4.0e-18372.97Uncharacterized protein OS=Glycine max GN=GLYMA_04G136000 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G07030.11.0e-16666.59 Eukaryotic aspartyl protease family protein[more]
AT3G54400.13.8e-16167.28 Eukaryotic aspartyl protease family protein[more]
AT1G09750.15.4e-11552.38 Eukaryotic aspartyl protease family protein[more]
AT3G61820.15.4e-4634.64 Eukaryotic aspartyl protease family protein[more]
AT3G20015.12.9e-4433.33 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659092233|ref|XP_008446966.1|1.4e-22190.91PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo][more]
gi|449449334|ref|XP_004142420.1|1.0e-21989.73PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|802574128|ref|XP_012068673.1|1.7e-18777.08PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas][more]
gi|255543963|ref|XP_002513044.1|7.2e-18675.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Ricinus communis][more]
gi|1009171231|ref|XP_015866632.1|3.4e-18374.94PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0048046 apoplast
cellular_component GO:0009507 chloroplast
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G013250.1CmaCh04G013250.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..439
score: 4.1E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 102..263
score: 8.6E-29coord: 265..439
score: 8.9
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 102..438
score: 1.19
NoneNo IPR availablePANTHERPTHR13683:SF258ASPARTYL PROTEASE FAMILY PROTEIN-RELATEDcoord: 1..439
score: 4.1E