CmaCh00G002950.1 (mRNA) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002950.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionaspartyl protease family protein 2-like
LocationCma_Chr00: 22633785 .. 22635474 (-)
Sequence length1680
RNA-Seq ExpressionCmaCh00G002950.1
SyntenyCmaCh00G002950.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAATTCGACTCCCAAAAAAATTAAAAGTAAAAAATAAAACTTTTATAAAGTCCACTTTAATTCTTTATACTTCTTTTATTTATACTCTCACACCTGTGCCGAACTCGTACCTCCCGCACTCTCAAAAATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGAGTTCACTCAGTGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTTCGGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCGACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTGCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCGGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

mRNA sequence

ATGATAATTCGACTCCCAAAAAAATTAAAAGTAAAAAATAAAACTTTTATAAAGTCCACTTTAATTCTTTATACTTCTTTTATTTATACTCTCACACCTGTGCCGAACTCGTACCTCCCGCACTCTCAAAAATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGATGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTTCGGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCGACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTGCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCGGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

Coding sequence (CDS)

ATGATAATTCGACTCCCAAAAAAATTAAAAGTAAAAAATAAAACTTTTATAAAGTCCACTTTAATTCTTTATACTTCTTTTATTTATACTCTCACACCTGTGCCGAACTCGTACCTCCCGCACTCTCAAAAATGGAGGCAAAAACCAGTGCATTACCCTTTATCGGCTTTCTTCTCACTCTTCTCTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTCTCTCCACCGCCTTCTCCGACTTCCAGACCCTTATACCCAAACCTCTTCCGGCTTCACCTTCACTCTTCTCGCCTGAATCCGACACCGATTCCGAGTCTTTCATCTCCTCGGAGGCCGGCTTAGAGTTGCAGCTTCACCATTTGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCGGATGCTCTGTCTCTGAACAGAACGCCGGAGGAGCTCTTCCATCTCCGCCTTCAGAGAGACGCTCTCAGAGTCACGAAACTGATGGTGGCTCTCAGAATCTTAGCCAAGCTAGTGGGACCAGCCACGGGACCACTGGGTTCAGTAGCTCAGTGATCTCGGGACTCGCTCAGGGTAGCGGCGAGTACTTCACGCGCATCGGCGTTGGCACGCCGCCCAAGTATATCTACATGGTTCTTGACACTGGTAGCGATATTGTTTGGCTACAGTGCGCTCCCTGTAAGAATTGCTACTCTCAGACCGACCCGGTTTTCAACCCGGTTAAGTCCAGATCCTACTCCAAGGTCCTTTGCCGAACGCCGCTTTGTCTCCGGCTCGAATCTCCGGGGTGCAACCAGAAGCAGACGTGTCTCTACCAGGTTTCTTACGGGGACGGTTCCTATACCACTGGTGAATTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGCGGCCACGATAATGAGGGCTTATTCGTTGGTGCGGCGGGGCTTTTAGGTCTCGGTCGGGGAGGATTGTCATTTCCGTCGCAAGCTGGCCGGAGTTTCAATCAGAAATTCTCCTACTGCTTGGTGGACCGATCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTGACTCCGCCGTATCTAGAACCGCCCGGTTCACTCCTCTTCTCACAAACCCTAGGCTGGATACATTTTACTATGTCGAACTGTTAGGGATCAGCGTCGGAGGCACGCCTGTTTTCGGCATCTCCGCTTCACATTTCAAGCTCGATTCGAACGGAAATGGTGGAGTCATCATCGATTGCGGTACCTCCGTCACTCGATTAAACCGACCGGCGTACATAGCCTTGCGCGACGCCTTCCGTGCTGGAGCCTCGAGTTTGAAATTGGCCCCTGAGTTTTCCCTTTTCGATACTTGCTACGACTTATCCGGGAAAACGACGGTGAAGGTGCCGACGGTGGTGCTACATTTTAGAGGCGCTGACGTGTCGTTACCGGCGTCCAATTATCTTATCCCGGTCGACGACAACGGGAGGTTCTGCTTCGCCTTCGCTGGAACGACCAGTGGGCTGTCCATCATCGGCAACATTCAGCAGCAAGGATTCCGGGTCGTGTACGATTTGGCGGGTTCCCGGGTCGGATTCTCCCCTCGTGGTTGTGCCTGA

Protein sequence

MIIRLPKKLKVKNKTFIKSTLILYTSFIYTLTPVPNSYLPHSQKWRQKPVHYPLSAFFSLFSLSPPPSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA
Homology
BLAST of CmaCh00G002950.1 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 4.7e-177
Identity = 331/495 (66.87%), Postives = 379/495 (76.57%), Query Frame = 0

Query: 79  FRLHLHSLSTAFSDFQTLIPK--PLP-ASPSLFSPESDTDSESFISSE----------AG 138
           F L L S S +   FQTL P    LP ASP  F P  D+DSES + SE          + 
Sbjct: 15  FFLSLPSFS-SLPSFQTLFPNSHSLPCASPVSFQP--DSDSESLLESEFESGSDSESSSS 74

Query: 139 LELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGG 198
           + L L H+DALS N+TP+ELF  RLQRD+     S       +P      R+ +H    G
Sbjct: 75  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVK-SIATLAAQIPG-----RNVTHAPRPG 134

Query: 199 SQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQC 258
                          GFSSSV+SGL+QGSGEYFTR+GVGTP +Y+YMVLDTGSDIVWLQC
Sbjct: 135 ---------------GFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 194

Query: 259 APCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTT 318
           APC+ CYSQ+DP+F+P KS++Y+ + C +P C RL+S GCN +++TCLYQVSYGDGS+T 
Sbjct: 195 APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTV 254

Query: 319 GEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSY 378
           G+F TETLTFRR +V+ VALGCGHDNEGLFVGAAGLLGLG+G LSFP Q G  FNQKFSY
Sbjct: 255 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 314

Query: 379 CLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISAS 438
           CLVDRSASSKPSSVVFG++AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS
Sbjct: 315 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 374

Query: 439 HFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGK 498
            FKLD  GNGGVIID GTSVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS  
Sbjct: 375 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNM 434

Query: 499 TTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVY 558
             VKVPTVVLHFRGADVSLPA+NYLIPVD NG+FCFAFAGT  GLSIIGNIQQQGFRVVY
Sbjct: 435 NEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVY 485

Query: 559 DLAGSRVGFSPRGCA 560
           DLA SRVGF+P GCA
Sbjct: 495 DLASSRVGFAPGGCA 485

BLAST of CmaCh00G002950.1 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 8.0e-108
Identity = 236/503 (46.92%), Postives = 304/503 (60.44%), Query Frame = 0

Query: 62  SLSPPPSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIP-KPLPASPSLFSPESDTDSESFI 121
           SLS PP            + ++  + ++    QT++   P  +S +   PES +D   F 
Sbjct: 28  SLSTPP------------KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSD-PVFF 87

Query: 122 SSEAGLELQLHHLDAL--SLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQ 181
           +S + L L+LH  D    S ++  + L   RL+RD      S + AG       ++ R  
Sbjct: 88  NSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERD------SSRVAGIV-----AKIRFA 147

Query: 182 SHETDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGS 241
               D           T + T   ++ V+SG +QGSGEYF+RIGVGTP K +Y+VLDTGS
Sbjct: 148 VEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGS 207

Query: 242 DIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYG 301
           D+ W+QC PC +CY Q+DPVFNP  S +Y  + C  P C  LE+  C   + CLYQVSYG
Sbjct: 208 DVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNK-CLYQVSYG 267

Query: 302 DGSYTTGEFVTETLTFRRT-KVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRS 361
           DGS+T GE  T+T+TF  + K+  VALGCGHDNEGLF GAAGLLGLG G LS  +Q   +
Sbjct: 268 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 327

Query: 362 FNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTP 421
               FSYCLVDR  S K SS+ F    +       PLL N ++DTFYYV L G SVGG  
Sbjct: 328 ---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEK 387

Query: 422 VFGISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKL-APEFSLFD 481
           V  +  + F +D++G+GGVI+DCGT+VTRL   AY +LRDAF     +LK  +   SLFD
Sbjct: 388 VV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFD 447

Query: 482 TCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNI 541
           TCYD S  +TVKVPTV  HF G   + LPA NYLIPVDD+G FCFAFA T+S LSIIGN+
Sbjct: 448 TCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNV 500

Query: 542 QQQGFRVVYDLAGSRVGFSPRGC 559
           QQQG R+ YDL+ + +G S   C
Sbjct: 508 QQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh00G002950.1 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 2.6e-106
Identity = 220/493 (44.62%), Postives = 293/493 (59.43%), Query Frame = 0

Query: 73  PLYPNLFRLHLH---SLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQ 132
           PL+     LHLH   S S +F DFQ +     P + +   P+ +    S  SS +   L+
Sbjct: 4   PLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPDFNNTHFSDESS-SKYTLR 63

Query: 133 LHHLDALS--LNRTPEELFHLRLQRDALGCS-VSEQNAGGALPSPPSERRSQSHETDGGS 192
           L H D       R      H R++RD    S +  + +G  +PS                
Sbjct: 64  LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS---------------- 123

Query: 193 QNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCA 252
                 S + +    F S ++SG+ QGSGEYF RIGVG+PP+  YMV+D+GSD+VW+QC 
Sbjct: 124 ------SDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ 183

Query: 253 PCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGE 312
           PCK CY Q+DPVF+P KS SY+ V C + +C R+E+ GC+    C Y+V YGDGSYT G 
Sbjct: 184 PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGT 243

Query: 313 FVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCL 372
              ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G +SF  Q        F YCL
Sbjct: 244 LALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCL 303

Query: 373 VDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHF 432
           V R   S   S+VFG  A+   A + PL+ NPR  +FYYV L G+ VGG  +  +    F
Sbjct: 304 VSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-PLPDGVF 363

Query: 433 KLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTT 492
            L   G+GGV++D GT+VTRL   AY+A RD F++  ++L  A   S+FDTCYDLSG  +
Sbjct: 364 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 423

Query: 493 VKVPTVVLHF-RGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYD 552
           V+VPTV  +F  G  ++LPA N+L+PVDD+G +CFAFA + +GLSIIGNIQQ+G +V +D
Sbjct: 424 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 470

Query: 553 LAGSRVGFSPRGC 559
            A   VGF P  C
Sbjct: 484 GANGFVGFGPNVC 470

BLAST of CmaCh00G002950.1 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 2.0e-71
Identity = 151/354 (42.66%), Postives = 202/354 (57.06%), Query Frame = 0

Query: 208 SGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPC-KNCYSQTDPVFNPVKSRS 267
           SG+  GSG Y   IG+GTP   + +V DTGSD+ W QC PC  +CYSQ +P FNP  S +
Sbjct: 123 SGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSST 182

Query: 268 YSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV-ERVALG 327
           Y  V C +P+C   ES  C+    C+Y + YGD S+T G    E  T   + V E V  G
Sbjct: 183 YQNVSCSSPMCEDAES--CS-ASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFG 242

Query: 328 CGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAV 387
           CG +N+GLF G AGLLGLG G LS P+Q   ++N  FSYCL   +++S    + FG + +
Sbjct: 243 CGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGHLTFGSAGI 302

Query: 388 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVT 447
           S + +FTP+ + P     Y ++++GISVG   +  I+ + F  +     G IID GT  T
Sbjct: 303 SESVKFTPISSFPSAFN-YGIDIIGISVGDKEL-AITPNSFSTE-----GAIIDSGTVFT 362

Query: 448 RLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD-VSLP 507
           RL    Y  LR  F+   SS K    + LFDTCYD +G  TV  PT+   F G+  V L 
Sbjct: 363 RLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELD 422

Query: 508 ASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 559
            S   +P+  + + C AFAG     +I GN+QQ    VVYD+AG RVGF+P GC
Sbjct: 423 GSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of CmaCh00G002950.1 vs. ExPASy Swiss-Prot
Match: Q8S9J6 (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 2.0e-71
Identity = 156/381 (40.94%), Postives = 212/381 (55.64%), Query Frame = 0

Query: 189 LSQASGTSHGTTGFSSSVIS--GLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCA 248
           LS+   T H +   S+ + +  G   GSG Y   +G+GTP   + ++ DTGSD+ W QC 
Sbjct: 102 LSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQ 161

Query: 249 PC-KNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLES----PGCNQKQTCLYQVSYGDGS 308
           PC + CY Q +P+FNP KS SY  V C +  C  L S     G      C+Y + YGD S
Sbjct: 162 PCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQS 221

Query: 309 YTTGEFVTETLTFRRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQ 368
           ++ G    E  T   + V + V  GCG +N+GLF G AGLLGLGR  LSFPSQ   ++N+
Sbjct: 222 FSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNK 281

Query: 369 KFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFG 428
            FSYCL   S++S    + FG + +SR+ +FTP+ T     +FY + ++ I+VGG  +  
Sbjct: 282 IFSYCL--PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKL-P 341

Query: 429 ISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYD 488
           I ++ F        G +ID GT +TRL   AY ALR +F+A  S        S+ DTC+D
Sbjct: 342 IPSTVF-----STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFD 401

Query: 489 LSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTT--SGLSIIGNIQQQ 548
           LSG  TV +P V   F G  V    S  +  V    + C AFAG +  S  +I GN+QQQ
Sbjct: 402 LSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQ 461

Query: 549 GFRVVYDLAGSRVGFSPRGCA 560
              VVYD AG RVGF+P GC+
Sbjct: 462 TLEVVYDGAGGRVGFAPNGCS 474

BLAST of CmaCh00G002950.1 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 622.5 bits (1604), Expect = 3.4e-178
Identity = 331/495 (66.87%), Postives = 379/495 (76.57%), Query Frame = 0

Query: 79  FRLHLHSLSTAFSDFQTLIPK--PLP-ASPSLFSPESDTDSESFISSE----------AG 138
           F L L S S +   FQTL P    LP ASP  F P  D+DSES + SE          + 
Sbjct: 15  FFLSLPSFS-SLPSFQTLFPNSHSLPCASPVSFQP--DSDSESLLESEFESGSDSESSSS 74

Query: 139 LELQLHHLDALSLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGG 198
           + L L H+DALS N+TP+ELF  RLQRD+     S       +P      R+ +H    G
Sbjct: 75  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVK-SIATLAAQIPG-----RNVTHAPRPG 134

Query: 199 SQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQC 258
                          GFSSSV+SGL+QGSGEYFTR+GVGTP +Y+YMVLDTGSDIVWLQC
Sbjct: 135 ---------------GFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 194

Query: 259 APCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCN-QKQTCLYQVSYGDGSYTT 318
           APC+ CYSQ+DP+F+P KS++Y+ + C +P C RL+S GCN +++TCLYQVSYGDGS+T 
Sbjct: 195 APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTV 254

Query: 319 GEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSY 378
           G+F TETLTFRR +V+ VALGCGHDNEGLFVGAAGLLGLG+G LSFP Q G  FNQKFSY
Sbjct: 255 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 314

Query: 379 CLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISAS 438
           CLVDRSASSKPSSVVFG++AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS
Sbjct: 315 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 374

Query: 439 HFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGK 498
            FKLD  GNGGVIID GTSVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS  
Sbjct: 375 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNM 434

Query: 499 TTVKVPTVVLHFRGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVY 558
             VKVPTVVLHFRGADVSLPA+NYLIPVD NG+FCFAFAGT  GLSIIGNIQQQGFRVVY
Sbjct: 435 NEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVY 485

Query: 559 DLAGSRVGFSPRGCA 560
           DLA SRVGF+P GCA
Sbjct: 495 DLASSRVGFAPGGCA 485

BLAST of CmaCh00G002950.1 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 556.2 bits (1432), Expect = 3.0e-158
Identity = 294/482 (61.00%), Postives = 351/482 (72.82%), Query Frame = 0

Query: 87  STAFSDFQTLIPKPLPASPSLFSPESDT-DSESFISSEAGLELQLHHLDALS--LNRTPE 146
           S+A S +QTL+   LP+S +L  PES++   ES   S   L + L H+DALS   + +P 
Sbjct: 21  SSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTSLSVHLSHVDALSSFSDASPA 80

Query: 147 ELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQSHETDGGSQNLSQASGTSHGTTGFS 206
           +LF+LRLQRD+L                   R          S   +    T     GFS
Sbjct: 81  DLFNLRLQRDSL-------------------RVKSITSLAAVSTGRNATKRTPRTAGGFS 140

Query: 207 SSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 266
            +VISGL+QGSGEYF R+GVGTP   +YMVLDTGSD+VWLQC+PCK CY+QTD +F+P K
Sbjct: 141 GAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKK 200

Query: 267 SRSYSKVLCRTPLCLRL-ESPGC--NQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 326
           S++++ V C + LC RL +S  C   + +TCLYQVSYGDGS+T G+F TETLTF   +V+
Sbjct: 201 SKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD 260

Query: 327 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDR----SASSKPS 386
            V LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQ    +N KFSYCLVDR    S+S  PS
Sbjct: 261 HVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 320

Query: 387 SVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGV 446
           ++VFG++AV +T+ FTPLLTNP+LDTFYY++LLGISVGG+ V G+S S FKLD+ GNGGV
Sbjct: 321 TIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 380

Query: 447 IIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHF 506
           IID GTSVTRL +PAY+ALRDAFR GA+ LK AP +SLFDTC+DLSG TTVKVPTVV HF
Sbjct: 381 IIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF 440

Query: 507 RGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPR 559
            G +VSLPASNYLIPV+  GRFCFAFAGT   LSIIGNIQQQGFRV YDL GSRVGF  R
Sbjct: 441 GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSR 483

BLAST of CmaCh00G002950.1 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 399.8 bits (1026), Expect = 3.5e-111
Identity = 206/354 (58.19%), Postives = 249/354 (70.34%), Query Frame = 0

Query: 206 VISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSR 265
           +ISG  QGSGEYFTR+G+G P + +YMVLDTGSD+ WLQC PC +CY QT+P+F P  S 
Sbjct: 137 LISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSS 196

Query: 266 SYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 325
           SY  + C TP C  LE   C +  TCLY+VSYGDGSYT G+F TETLT   T V+ VA+G
Sbjct: 197 SYEPLSCDTPQCNALEVSEC-RNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVG 256

Query: 326 CGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVFGDSAV 385
           CGH NEGLFVGAAGLLGLG G L+ PSQ   +    FSYCLVDR + S  S+V FG S +
Sbjct: 257 CGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDS-ASTVDFGTS-L 316

Query: 386 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHFKLDSNGNGGVIIDCGTSVT 445
           S  A   PLL N +LDTFYY+ L GISVGG  +  I  S F++D +G+GG+IID GT+VT
Sbjct: 317 SPDAVVAPLLRNHQLDTFYYLGLTGISVGG-ELLQIPQSSFEMDESGSGGIIIDSGTAVT 376

Query: 446 RLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD-VSLP 505
           RL    Y +LRD+F  G   L+ A   ++FDTCY+LS KTTV+VPTV  HF G   ++LP
Sbjct: 377 RLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALP 436

Query: 506 ASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGC 559
           A NY+IPVD  G FC AFA T S L+IIGN+QQQG RV +DLA S +GFS   C
Sbjct: 437 AKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of CmaCh00G002950.1 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 392.5 bits (1007), Expect = 5.7e-109
Identity = 236/503 (46.92%), Postives = 304/503 (60.44%), Query Frame = 0

Query: 62  SLSPPPSPTSRPLYPNLFRLHLHSLSTAFSDFQTLIP-KPLPASPSLFSPESDTDSESFI 121
           SLS PP            + ++  + ++    QT++   P  +S +   PES +D   F 
Sbjct: 28  SLSTPP------------KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSD-PVFF 87

Query: 122 SSEAGLELQLHHLDAL--SLNRTPEELFHLRLQRDALGCSVSEQNAGGALPSPPSERRSQ 181
           +S + L L+LH  D    S ++  + L   RL+RD      S + AG       ++ R  
Sbjct: 88  NSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERD------SSRVAGIV-----AKIRFA 147

Query: 182 SHETDGGSQNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGS 241
               D           T + T   ++ V+SG +QGSGEYF+RIGVGTP K +Y+VLDTGS
Sbjct: 148 VEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGS 207

Query: 242 DIVWLQCAPCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYG 301
           D+ W+QC PC +CY Q+DPVFNP  S +Y  + C  P C  LE+  C   + CLYQVSYG
Sbjct: 208 DVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNK-CLYQVSYG 267

Query: 302 DGSYTTGEFVTETLTFRRT-KVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRS 361
           DGS+T GE  T+T+TF  + K+  VALGCGHDNEGLF GAAGLLGLG G LS  +Q   +
Sbjct: 268 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 327

Query: 362 FNQKFSYCLVDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTP 421
               FSYCLVDR  S K SS+ F    +       PLL N ++DTFYYV L G SVGG  
Sbjct: 328 ---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEK 387

Query: 422 VFGISASHFKLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKL-APEFSLFD 481
           V  +  + F +D++G+GGVI+DCGT+VTRL   AY +LRDAF     +LK  +   SLFD
Sbjct: 388 VV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFD 447

Query: 482 TCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNI 541
           TCYD S  +TVKVPTV  HF G   + LPA NYLIPVDD+G FCFAFA T+S LSIIGN+
Sbjct: 448 TCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNV 500

Query: 542 QQQGFRVVYDLAGSRVGFSPRGC 559
           QQQG R+ YDL+ + +G S   C
Sbjct: 508 QQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh00G002950.1 vs. TAIR 10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 387.5 bits (994), Expect = 1.8e-107
Identity = 220/493 (44.62%), Postives = 293/493 (59.43%), Query Frame = 0

Query: 73  PLYPNLFRLHLH---SLSTAFSDFQTLIPKPLPASPSLFSPESDTDSESFISSEAGLELQ 132
           PL+     LHLH   S S +F DFQ +     P + +   P+ +    S  SS +   L+
Sbjct: 4   PLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPDFNNTHFSDESS-SKYTLR 63

Query: 133 LHHLDALS--LNRTPEELFHLRLQRDALGCS-VSEQNAGGALPSPPSERRSQSHETDGGS 192
           L H D       R      H R++RD    S +  + +G  +PS                
Sbjct: 64  LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS---------------- 123

Query: 193 QNLSQASGTSHGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCA 252
                 S + +    F S ++SG+ QGSGEYF RIGVG+PP+  YMV+D+GSD+VW+QC 
Sbjct: 124 ------SDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ 183

Query: 253 PCKNCYSQTDPVFNPVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGE 312
           PCK CY Q+DPVF+P KS SY+ V C + +C R+E+ GC+    C Y+V YGDGSYT G 
Sbjct: 184 PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGT 243

Query: 313 FVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCL 372
              ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G +SF  Q        F YCL
Sbjct: 244 LALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCL 303

Query: 373 VDRSASSKPSSVVFGDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVFGISASHF 432
           V R   S   S+VFG  A+   A + PL+ NPR  +FYYV L G+ VGG  +  +    F
Sbjct: 304 VSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-PLPDGVF 363

Query: 433 KLDSNGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTT 492
            L   G+GGV++D GT+VTRL   AY+A RD F++  ++L  A   S+FDTCYDLSG  +
Sbjct: 364 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 423

Query: 493 VKVPTVVLHF-RGADVSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYD 552
           V+VPTV  +F  G  ++LPA N+L+PVDD+G +CFAFA + +GLSIIGNIQQ+G +V +D
Sbjct: 424 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 470

Query: 553 LAGSRVGFSPRGC 559
            A   VGF P  C
Sbjct: 484 GANGFVGFGPNVC 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LNJ34.7e-17766.87Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LS408.0e-10846.92Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LHE32.6e-10644.62Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LEW32.0e-7142.66Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Q8S9J62.0e-7140.94Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Match NameE-valueIdentityDescription
AT1G01300.13.4e-17866.87Eukaryotic aspartyl protease family protein [more]
AT3G61820.13.0e-15861.00Eukaryotic aspartyl protease family protein [more]
AT1G25510.13.5e-11158.19Eukaryotic aspartyl protease family protein [more]
AT3G18490.15.7e-10946.92Eukaryotic aspartyl protease family protein [more]
AT3G20015.11.8e-10744.62Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 530..545
score: 26.83
coord: 436..447
score: 39.97
coord: 223..243
score: 40.67
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 116..558
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 217..382
e-value: 3.3E-55
score: 187.0
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 403..554
e-value: 2.6E-35
score: 121.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 380..559
e-value: 2.0E-53
score: 182.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 186..379
e-value: 5.2E-55
score: 188.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 210..558
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 182..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 160..200
NoneNo IPR availablePANTHERPTHR13683:SF761ASPARTYL PROTEASE FAMILY PROTEIN 2-LIKEcoord: 116..558
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 232..243
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 217..554
score: 47.141079
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 216..558
e-value: 2.02598E-139
score: 403.961

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh00G002950CmaCh00G002950gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh00G002950.1:exon:1608CmaCh00G002950.1:exon:1608exon
CmaCh00G002950.1:exon:1609CmaCh00G002950.1:exon:1609exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh00G002950.1:cdsCmaCh00G002950.1:cds_2CDS
CmaCh00G002950.1:cdsCmaCh00G002950.1:cdsCDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh00G002950.1CmaCh00G002950.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity