CmaCh16G011400 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G011400
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr16: 8762211 .. 8763653 (+)
RNA-Seq ExpressionCmaCh16G011400
SyntenyCmaCh16G011400
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCCTGTTTTCCTCTTCCTCCTCTGTTTTCTCCTTCCTTCCCCTGTTTTCTCCTCACAGATTCTGCTCTTACCTCTCTCTAATTCCTTATCATCCTCATCCGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCACGCTCTTCCGCCCGCTTCCACCACCGCCGCCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTCTCCCCCGGCGGCGATTATACTCTCTCCTTCAACCTCGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCTCAAATCAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGCTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCACCGTCACCGGCGATCAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGTGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGATTCAACTCGCCACTTTCTCTCCCCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCATGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTTGGGTCGGTGATGATTCCGGCGCCGGAGTTTTTGAAAAAGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTGGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCGAGCCGGGCGAGTCAGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAAATCAGTGGAAGTGCCACGTGTCGTGTTACACTTCGTTGGGGAAAAATCCAGTGTGATGCTTCCTAGAAAAAATTATTTCTATGAGTTCTTGGACGGCGGAGATGGGGTGGGGAGAAAGATAAAAGTCGGGTGTTTAATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGGCAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA

mRNA sequence

ATGGCTTCCCCTGTTTTCCTCTTCCTCCTCTGTTTTCTCCTTCCTTCCCCTGTTTTCTCCTCACAGATTCTGCTCTTACCTCTCTCTAATTCCTTATCATCCTCATCCGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCACGCTCTTCCGCCCGCTTCCACCACCGCCGCCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTCTCCCCCGGCGGCGATTATACTCTCTCCTTCAACCTCGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCTCAAATCAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGCTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCACCGTCACCGGCGATCAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGTGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGATTCAACTCGCCACTTTCTCTCCCCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCATGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTTGGGTCGGTGATGATTCCGGCGCCGGAGTTTTTGAAAAAGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTGGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCGAGCCGGGCGAGTCAGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAAATCAGTGGAAGTGCCACGTGTCGTGTTACACTTCGTTGGGGAAAAATCCAGTGTGATGCTTCCTAGAAAAAATTATTTCTATGAGTTCTTGGACGGCGGAGATGGGGTGGGGAGAAAGATAAAAGTCGGGTGTTTAATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGGCAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA

Coding sequence (CDS)

ATGGCTTCCCCTGTTTTCCTCTTCCTCCTCTGTTTTCTCCTTCCTTCCCCTGTTTTCTCCTCACAGATTCTGCTCTTACCTCTCTCTAATTCCTTATCATCCTCATCCGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCACGCTCTTCCGCCCGCTTCCACCACCGCCGCCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTCTCCCCCGGCGGCGATTATACTCTCTCCTTCAACCTCGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCTCAAATCAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGCTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCACCGTCACCGGCGATCAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGTGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGATTCAACTCGCCACTTTCTCTCCCCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCATGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTTGGGTCGGTGATGATTCCGGCGCCGGAGTTTTTGAAAAAGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTGGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCGAGCCGGGCGAGTCAGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAAATCAGTGGAAGTGCCACGTGTCGTGTTACACTTCGTTGGGGAAAAATCCAGTGTGATGCTTCCTAGAAAAAATTATTTCTATGAGTTCTTGGACGGCGGAGATGGGGTGGGGAGAAAGATAAAAGTCGGGTGTTTAATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGGCAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA

Protein sequence

MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNRS
Homology
BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 4.7e-165
Identity = 292/477 (61.22%), Postives = 354/477 (74.21%), Query Frame = 0

Query: 24  LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFN 83
           LLL LS+SLS+S   ++  +LLKS+++RSSARF        +  LSLP+S G DY +S +
Sbjct: 29  LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88

Query: 84  LGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQ-KSVSCSAAACSA 143
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S+   +VSCS+ +CSA
Sbjct: 89  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148

Query: 144 AHGGSLSASHLCAISRCPLESIEVSEC--SSFSCPPFYYAYGDGSLIGRLYRDSLSLPAP 203
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+ +LY DSLSL   
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208

Query: 204 APSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSF 263
              P+++V NFTFGCAH+ L EPIGVAGFGRG LS+P QLA  SP LGN FSYCLVSHSF
Sbjct: 209 ---PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 268

Query: 264 AADRVRRPSPLILGRYYG--------------------SETEFIYTSMLENPKHPYFYSV 323
            +DRVRRPSPLILGR+                       + EF++T MLENPKHPYFYSV
Sbjct: 269 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 328

Query: 324 GLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVA 383
            L GIS+G   IPAP  L+++D+ G GGVVVDSGTTFTMLPA  YNSVV +F++R GRV 
Sbjct: 329 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 388

Query: 384 SRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKI 443
            RA ++E ++G+SPCYY  ++V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG   K 
Sbjct: 389 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 448

Query: 444 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 478
           K+GCLMLMNGGDE+EL GG GA LGNYQQQGFEV YDL N RVGFA+R+C++LWDSL
Sbjct: 449 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 3.8e-34
Identity = 120/405 (29.63%), Postives = 181/405 (44.69%), Query Frame = 0

Query: 76  GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSC 135
           G+Y ++ ++G+ +Q  S  MDTGSDL+W  C P  C  C          P  + Q S S 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152

Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGS-LIGRLYRDS 195
           S   CS         S LC       +++    CS+  C  + Y YGDGS   G +  ++
Sbjct: 153 STLPCS---------SQLC-------QALSSPTCSNNFC-QYTYGYGDGSETQGSMGTET 212

Query: 196 LSLPAPAPSPAINVRNFTFGCAHS----ALGEPIGVAGFGRGLLSMPIQLATFSPQLGNR 255
           L+        ++++ N TFGC  +      G   G+ G GRG LS+P QL         +
Sbjct: 213 LTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------TK 272

Query: 256 FSYCLVSHSFAADRVRRPSPLILGRYYGSETE-FIYTSMLENPKHPYFYSVGLAGISVGS 315
           FSYC+     +      PS L+LG    S T     T+++++ + P FY + L G+SVGS
Sbjct: 273 FSYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 332

Query: 316 VMIPA-PEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEE 375
             +P  P         G+GG+++DSGTT T      Y SV  +F ++        S    
Sbjct: 333 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS---- 392

Query: 376 NTGLSPCYYY---EKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCL 435
           ++G   C+       ++++P  V+HF G    + LP +NYF    +G         + CL
Sbjct: 393 SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENYFISPSNG---------LICL 434

Query: 436 MLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 471
            + +      +        GN QQQ   V YD  N+ V FA  QC
Sbjct: 453 AMGSSSQGMSI-------FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.1e-33
Identity = 121/401 (30.17%), Postives = 169/401 (42.14%), Query Frame = 0

Query: 76  GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSC 135
           G+Y     +G+ ++++ L +DTGSD+ W  C P  C  C  +          S  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGRLYRDS 195
           SA  CS                      +E S C S  C  +  +YGDGS  +G L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279

Query: 196 LSLPAPAPSPAINVRNFTFGCAHSALG---EPIGVAGFGRGLLSMPIQLATFSPQLGNRF 255
           ++        +  + N   GC H   G      G+ G G G+LS+  Q+   S      F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339

Query: 256 SYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGISVGSVM 315
           SYCLV            + + LG   G  T      +L N K   FY VGL+G SVG   
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGGEK 399

Query: 316 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTG 375
           +  P+ +  VD  GSGGV++D GT  T L    YNS+   F   T  +   +S I   + 
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI---SL 459

Query: 376 LSPCYYYE--KSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMN 435
              CY +    +V+VP V  HF G K S+ LP KNY     D G          C     
Sbjct: 460 FDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFAP 500

Query: 436 GGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 471
                 +       +GN QQQG  + YDL  N +G +  +C
Sbjct: 520 TSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 1.6e-32
Identity = 130/410 (31.71%), Postives = 186/410 (45.37%), Query Frame = 0

Query: 72  LSPG-GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQ 131
           LS G G+Y     +G+ ++ + + +DTGSD+VW  C+P  C  C  +       P    +
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 132 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGR 191
           KS + +   CS+ H   L ++  C   R          C       +  +YGDGS  +G 
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 192 LYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFGRGLLSMPIQLATFSPQ 251
              ++L+           V+    GC H   G  +G A   G G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---GHR 314

Query: 252 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGIS 311
              +FSYCLV  S ++    +PS ++ G    S     +T +L NPK   FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGIS 374

Query: 312 VGSVMIP-APEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQ 371
           VG   +P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A    +
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 372 IEENTGLSPCYYYE--KSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVG 431
             + +    C+       V+VP VVLHF G  + V LP  NY                  
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP--------------- 485

Query: 432 CLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVAYDLENNRVGFARRQCS 472
             +  NG      AG  G  + +GN QQQGF V YDL ++RVGFA   C+
Sbjct: 495 --VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 1.0e-31
Identity = 124/440 (28.18%), Postives = 189/440 (42.95%), Query Frame = 0

Query: 42  HNLLKSTAARSSARFHH-RRRTHHRSHLSLPLSPG-GDYTLSFNLGSESQKISLYMDTGS 101
           + L+K    R   R           S +  P+  G G+Y ++  +G+     S  MDTGS
Sbjct: 58  YELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGS 117

Query: 102 DLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRC 161
           DL+W  C P  C  C        P P  + Q S S S   C + +   L           
Sbjct: 118 DLIWTQCEP--CTQC-----FSQPTPIFNPQDSSSFSTLPCESQYCQDL----------- 177

Query: 162 PLESIEVSECSSFSCPPFYYAYGDGSLI-GRLYRDSLSLPAPAPSPAINVRNFTFGCAHS 221
           P E+   +EC       + Y YGDGS   G +  ++ +          +V N  FGC   
Sbjct: 178 PSETCNNNECQ------YTYGYGDGSTTQGYMATETFTFETS------SVPNIAFGCGED 237

Query: 222 ----ALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG 281
                 G   G+ G G G LS+P QL         +FSYC+ S+  ++     PS L LG
Sbjct: 238 NQGFGQGNGAGLIGMGWGPLSLPSQLGV------GQFSYCMTSYGSSS-----PSTLALG 297

Query: 282 RYYGSETE-FIYTSMLENPKHPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDS 341
                  E    T+++ +  +P +Y + L GI+VG   +  P    ++ + G+GG+++DS
Sbjct: 298 SAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDS 357

Query: 342 GTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTGLSPCYYYE---KSVEVPRVVLH 401
           GTT T LP   YN+V   F ++     +  +  E ++GLS C+       +V+VP + + 
Sbjct: 358 GTTLTYLPQDAYNAVAQAFTDQ----INLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQ 417

Query: 402 FVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQ 461
           F G    + L  +N      +G         V CL +   G  ++L     +  GN QQQ
Sbjct: 418 FDG--GVLNLGEQNILISPAEG---------VICLAM---GSSSQLG---ISIFGNIQQQ 435

Query: 462 GFEVAYDLENNRVGFARRQC 471
             +V YDL+N  V F   QC
Sbjct: 478 ETQVLYDLQNLAVSFVPTQC 435

BLAST of CmaCh16G011400 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 582.4 bits (1500), Expect = 3.3e-166
Identity = 292/477 (61.22%), Postives = 354/477 (74.21%), Query Frame = 0

Query: 24  LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFN 83
           LLL LS+SLS+S   ++  +LLKS+++RSSARF        +  LSLP+S G DY +S +
Sbjct: 29  LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88

Query: 84  LGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQ-KSVSCSAAACSA 143
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S+   +VSCS+ +CSA
Sbjct: 89  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148

Query: 144 AHGGSLSASHLCAISRCPLESIEVSEC--SSFSCPPFYYAYGDGSLIGRLYRDSLSLPAP 203
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+ +LY DSLSL   
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208

Query: 204 APSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSF 263
              P+++V NFTFGCAH+ L EPIGVAGFGRG LS+P QLA  SP LGN FSYCLVSHSF
Sbjct: 209 ---PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 268

Query: 264 AADRVRRPSPLILGRYYG--------------------SETEFIYTSMLENPKHPYFYSV 323
            +DRVRRPSPLILGR+                       + EF++T MLENPKHPYFYSV
Sbjct: 269 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 328

Query: 324 GLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVA 383
            L GIS+G   IPAP  L+++D+ G GGVVVDSGTTFTMLPA  YNSVV +F++R GRV 
Sbjct: 329 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 388

Query: 384 SRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKI 443
            RA ++E ++G+SPCYY  ++V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG   K 
Sbjct: 389 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 448

Query: 444 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 478
           K+GCLMLMNGGDE+EL GG GA LGNYQQQGFEV YDL N RVGFA+R+C++LWDSL
Sbjct: 449 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CmaCh16G011400 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 199.9 bits (507), Expect = 4.6e-51
Identity = 166/498 (33.33%), Postives = 239/498 (47.99%), Query Frame = 0

Query: 5   VFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHH 64
           +FLFLL  LL +    +Q       N  SSSS F     L KS+ +  + +   + R   
Sbjct: 8   LFLFLLITLLLNTTNKTQ--ARQHKNPSSSSSSF-LVLTLTKSSVSLPTPKSQTQERIKK 67

Query: 65  -RSHLSLPLSP----GGDYTLSFNLGSESQKISLYMDTGSDLVWFPCS--PFECILCEG- 124
             S + + + P       Y ++ N+G+  Q + +Y+DTGSDL W PC    F+CI C   
Sbjct: 68  PLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDL 127

Query: 125 ------KPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECS 184
                  P + SPL   ++ +  SC+++ C   H  S +    CA++ C +  +  S C 
Sbjct: 128 KNNDLKSPSVFSPLHSSTSFRD-SCASSFCVEIH-SSDNPFDPCAVAGCSVSMLLKSTCV 187

Query: 185 SFSCPPFYYAYGDGSLI-GRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGF 244
              CP F Y YG+G LI G L RD L       +   +V  F+FGC  S   EPIG+AGF
Sbjct: 188 R-PCPSFAYTYGEGGLISGILTRDILK------ARTRDVPRFSFGCVTSTYREPIGIAGF 247

Query: 245 GRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGS---ETEFIYT 304
           GRGLLS+P QL      L   FS+C +   F  +     SPLILG    S        +T
Sbjct: 248 GRGLLSLPSQLGF----LEKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQFT 307

Query: 305 SMLENPKHPYFYSVGLAGISVGSVMIP--APEFLKKVDEGGSGGVVVDSGTTFTMLPAGL 364
            ML  P +P  Y +GL  I++G+ + P   P  L++ D  G+GG++VDSGTT+T LP   
Sbjct: 308 PMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 367

Query: 365 YNSVVAQFENRTGRVASRASQIEENTGLSPCY----------YYEKSVEV--PRVVLHFV 424
           Y+ ++   ++       RA++ E  TG   CY            E  V +  P +  HF+
Sbjct: 368 YSQLLTTLQSTI--TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFL 427

Query: 425 GEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGF 471
              ++++LP+ N FY      DG      V CL+  N  D      GP    G++QQQ  
Sbjct: 428 -NNATLLLPQGNSFYAMSAPSDG----SVVQCLLFQNMEDGDY---GPAGVFGSFQQQNV 478

BLAST of CmaCh16G011400 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 178.3 bits (451), Expect = 1.4e-44
Identity = 156/502 (31.08%), Postives = 231/502 (46.02%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSD-FNNTHNLLKSTAARSSARFH-- 60
           MAS +F F L FL  S V + ++ L P S+S  S  D + +   L +S+ AR+    H  
Sbjct: 1   MASSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGT 60

Query: 61  ---------HRRRTHHRSHLSLPLSPG--GDYTLSFNLGSESQKISLYMDTGSDLVWFPC 120
                        T   + +  PLS    G Y++S + G+ SQ I    DTGS LVW PC
Sbjct: 61  SIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPC 120

Query: 121 -SPFECILCEGKPKIQSPLPKI-----SNQKSVSCSAAACSAAHGGSLSASHLCAISRCP 180
            S + C  C+      + +P+      S+ K + C +  C   +G ++         +C 
Sbjct: 121 TSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV---------QCR 180

Query: 181 LESIEVSECSSFSCPPFYYAYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSAL 240
                   C +  CPP+   YG GS  G L  + L        P + V +F  GC+  + 
Sbjct: 181 GCDPNTRNC-TVGCPPYILQYGLGSTAGVLITEKLDF------PDLTVPDFVVGCSIIST 240

Query: 241 GEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY--G 300
            +P G+AGFGRG +S+P Q+         RFS+CLVS  F    V     L  G  +  G
Sbjct: 241 RQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSG 300

Query: 301 SETE-FIYTSMLENPK-----HPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVD 360
           S+T    YT   +NP         +Y + L  I VG   +  P         G GG +VD
Sbjct: 301 SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 360

Query: 361 SGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTGLSPCYYY--EKSVEVPRVVLH 420
           SG+TFT +   ++  V  +F ++     +R   +E+ TGL PC+    +  V VP ++  
Sbjct: 361 SGSTFTFMERPVFELVAEEFASQMSNY-TREKDLEKETGLGPCFNISGKGDVTVPELIFE 420

Query: 421 FVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAG-GPGATLGNYQQ 472
           F G  + + LP  NYF  F+   D V       CL +++        G GP   LG++QQ
Sbjct: 421 FKG-GAKLELPLSNYF-TFVGNTDTV-------CLTVVSDKTVNPSGGTGPAIILGSFQQ 468

BLAST of CmaCh16G011400 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 156.0 bits (393), Expect = 7.7e-38
Identity = 130/421 (30.88%), Postives = 190/421 (45.13%), Query Frame = 0

Query: 62  THHRSHLSLPLSPG-----GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEG 121
           T     +  PL  G     G+Y     +G  ++++ + +DTGSD+ W  C+P  C  C  
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP--CADCYH 186

Query: 122 KPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPP 181
           + +        S+ + +SC    C+A                     +EVSEC + +C  
Sbjct: 187 QTEPIFEPSSSSSYEPLSCDTPQCNA---------------------LEVSECRNATC-L 246

Query: 182 FYYAYGDGS-LIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFGRG 241
           +  +YGDGS  +G    ++L++ +        V+N   GC HS  G  +G A   G G G
Sbjct: 247 YEVSYGDGSYTVGDFATETLTIGSTL------VQNVAVGCGHSNEGLFVGAAGLLGLGGG 306

Query: 242 LLSMPIQLATFSPQLGNRFSYCLVSH-SFAADRVRRPSPLILGRYYGSETEFIYTSMLEN 301
           LL++P QL T S      FSYCLV   S +A  V   + L          + +   +L N
Sbjct: 307 LLALPSQLNTTS------FSYCLVDRDSDSASTVDFGTSL--------SPDAVVAPLLRN 366

Query: 302 PKHPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQ 361
            +   FY +GL GISVG  ++  P+   ++DE GSGG+++DSGT  T L   +YNS+   
Sbjct: 367 HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDS 426

Query: 362 FENRTGRVASRASQIEENTGLSPCYYY--EKSVEVPRVVLHFVGEKSSVMLPRKNYFYEF 421
           F   T  +   A     +T    CY    + +VEVP V  HF G K  + LP KNY    
Sbjct: 427 FVKGTLDLEKAAGVAMFDT----CYNLSAKTTVEVPTVAFHFPGGK-MLALPAKNYMIPV 483

Query: 422 LDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQ 471
               D VG      CL                A +GN QQQG  V +DL N+ +GF+  +
Sbjct: 487 ----DSVG----TFCLAFAPTASSL-------AIIGNVQQQGTRVTFDLANSLIGFSSNK 483

BLAST of CmaCh16G011400 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 146.0 bits (367), Expect = 7.9e-35
Identity = 121/401 (30.17%), Postives = 169/401 (42.14%), Query Frame = 0

Query: 76  GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSC 135
           G+Y     +G+ ++++ L +DTGSD+ W  C P  C  C  +          S  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGRLYRDS 195
           SA  CS                      +E S C S  C  +  +YGDGS  +G L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279

Query: 196 LSLPAPAPSPAINVRNFTFGCAHSALG---EPIGVAGFGRGLLSMPIQLATFSPQLGNRF 255
           ++        +  + N   GC H   G      G+ G G G+LS+  Q+   S      F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339

Query: 256 SYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGISVGSVM 315
           SYCLV            + + LG   G  T      +L N K   FY VGL+G SVG   
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGGEK 399

Query: 316 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTG 375
           +  P+ +  VD  GSGGV++D GT  T L    YNS+   F   T  +   +S I   + 
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI---SL 459

Query: 376 LSPCYYYE--KSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMN 435
              CY +    +V+VP V  HF G K S+ LP KNY     D G          C     
Sbjct: 460 FDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFAP 500

Query: 436 GGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 471
                 +       +GN QQQG  + YDL  N +G +  +C
Sbjct: 520 TSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R44.7e-16561.22Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C33.8e-3429.63Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LS401.1e-3330.17Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LNJ31.6e-3231.71Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C21.0e-3128.18Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
AT4G16563.13.3e-16661.22Eukaryotic aspartyl protease family protein [more]
AT5G45120.14.6e-5133.33Eukaryotic aspartyl protease family protein [more]
AT3G52500.11.4e-4431.08Eukaryotic aspartyl protease family protein [more]
AT1G25510.17.7e-3830.88Eukaryotic aspartyl protease family protein [more]
AT3G18490.17.9e-3530.17Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 273..476
e-value: 1.1E-46
score: 160.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 57..263
e-value: 9.6E-34
score: 119.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 73..475
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 297..466
e-value: 2.5E-26
score: 92.4
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 78..261
e-value: 1.3E-28
score: 100.5
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 1..475
NoneNo IPR availablePANTHERPTHR47967:SF26BNAA01G17170D PROTEINcoord: 1..475
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 329..340
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 78..466
score: 31.361834
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 78..470
e-value: 1.04057E-69
score: 221.368

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G011400.1CmaCh16G011400.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity