Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCCTGTTTTCCTCTTCCTCCTCTGTTTTCTCCTTCCTTCCCCTGTTTTCTCCTCACAGATTCTGCTCTTACCTCTCTCTAATTCCTTATCATCCTCATCCGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCACGCTCTTCCGCCCGCTTCCACCACCGCCGCCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTCTCCCCCGGCGGCGATTATACTCTCTCCTTCAACCTCGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCTCAAATCAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGCTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCACCGTCACCGGCGATCAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGTGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGATTCAACTCGCCACTTTCTCTCCCCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCATGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTTGGGTCGGTGATGATTCCGGCGCCGGAGTTTTTGAAAAAGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTGGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCGAGCCGGGCGAGTCAGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAAATCAGTGGAAGTGCCACGTGTCGTGTTACACTTCGTTGGGGAAAAATCCAGTGTGATGCTTCCTAGAAAAAATTATTTCTATGAGTTCTTGGACGGCGGAGATGGGGTGGGGAGAAAGATAAAAGTCGGGTGTTTAATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGGCAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA
mRNA sequence
ATGGCTTCCCCTGTTTTCCTCTTCCTCCTCTGTTTTCTCCTTCCTTCCCCTGTTTTCTCCTCACAGATTCTGCTCTTACCTCTCTCTAATTCCTTATCATCCTCATCCGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCACGCTCTTCCGCCCGCTTCCACCACCGCCGCCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTCTCCCCCGGCGGCGATTATACTCTCTCCTTCAACCTCGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCTCAAATCAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGCTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCACCGTCACCGGCGATCAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGTGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGATTCAACTCGCCACTTTCTCTCCCCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCATGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTTGGGTCGGTGATGATTCCGGCGCCGGAGTTTTTGAAAAAGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTGGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCGAGCCGGGCGAGTCAGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAAATCAGTGGAAGTGCCACGTGTCGTGTTACACTTCGTTGGGGAAAAATCCAGTGTGATGCTTCCTAGAAAAAATTATTTCTATGAGTTCTTGGACGGCGGAGATGGGGTGGGGAGAAAGATAAAAGTCGGGTGTTTAATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGGCAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA
Coding sequence (CDS)
ATGGCTTCCCCTGTTTTCCTCTTCCTCCTCTGTTTTCTCCTTCCTTCCCCTGTTTTCTCCTCACAGATTCTGCTCTTACCTCTCTCTAATTCCTTATCATCCTCATCCGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCACGCTCTTCCGCCCGCTTCCACCACCGCCGCCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTCTCCCCCGGCGGCGATTATACTCTCTCCTTCAACCTCGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCTCAAATCAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGCTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCACCGTCACCGGCGATCAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGTGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGATTCAACTCGCCACTTTCTCTCCCCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCATGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTTGGGTCGGTGATGATTCCGGCGCCGGAGTTTTTGAAAAAGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTGGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCGAGCCGGGCGAGTCAGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAAATCAGTGGAAGTGCCACGTGTCGTGTTACACTTCGTTGGGGAAAAATCCAGTGTGATGCTTCCTAGAAAAAATTATTTCTATGAGTTCTTGGACGGCGGAGATGGGGTGGGGAGAAAGATAAAAGTCGGGTGTTTAATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGGCAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA
Protein sequence
MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNRS
Homology
BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match:
Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)
HSP 1 Score: 582.4 bits (1500), Expect = 4.7e-165
Identity = 292/477 (61.22%), Postives = 354/477 (74.21%), Query Frame = 0
Query: 24 LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFN 83
LLL LS+SLS+S ++ +LLKS+++RSSARF + LSLP+S G DY +S +
Sbjct: 29 LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88
Query: 84 LGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQ-KSVSCSAAACSA 143
+GS S +SLY+DTGSDLVWFPC PF CILCE KP SP +S+ +VSCS+ +CSA
Sbjct: 89 VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148
Query: 144 AHGGSLSASHLCAISRCPLESIEVSEC--SSFSCPPFYYAYGDGSLIGRLYRDSLSLPAP 203
AH SL +S LCAIS CPL+ IE +C SS+ CPPFYYAYGDGSL+ +LY DSLSL
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208
Query: 204 APSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSF 263
P+++V NFTFGCAH+ L EPIGVAGFGRG LS+P QLA SP LGN FSYCLVSHSF
Sbjct: 209 ---PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 268
Query: 264 AADRVRRPSPLILGRYYG--------------------SETEFIYTSMLENPKHPYFYSV 323
+DRVRRPSPLILGR+ + EF++T MLENPKHPYFYSV
Sbjct: 269 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 328
Query: 324 GLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVA 383
L GIS+G IPAP L+++D+ G GGVVVDSGTTFTMLPA YNSVV +F++R GRV
Sbjct: 329 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 388
Query: 384 SRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKI 443
RA ++E ++G+SPCYY ++V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG K
Sbjct: 389 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 448
Query: 444 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 478
K+GCLMLMNGGDE+EL GG GA LGNYQQQGFEV YDL N RVGFA+R+C++LWDSL
Sbjct: 449 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498
BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match:
Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)
HSP 1 Score: 147.5 bits (371), Expect = 3.8e-34
Identity = 120/405 (29.63%), Postives = 181/405 (44.69%), Query Frame = 0
Query: 76 GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSC 135
G+Y ++ ++G+ +Q S MDTGSDL+W C P C C P + Q S S
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152
Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGS-LIGRLYRDS 195
S CS S LC +++ CS+ C + Y YGDGS G + ++
Sbjct: 153 STLPCS---------SQLC-------QALSSPTCSNNFC-QYTYGYGDGSETQGSMGTET 212
Query: 196 LSLPAPAPSPAINVRNFTFGCAHS----ALGEPIGVAGFGRGLLSMPIQLATFSPQLGNR 255
L+ ++++ N TFGC + G G+ G GRG LS+P QL +
Sbjct: 213 LTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------TK 272
Query: 256 FSYCLVSHSFAADRVRRPSPLILGRYYGSETE-FIYTSMLENPKHPYFYSVGLAGISVGS 315
FSYC+ + PS L+LG S T T+++++ + P FY + L G+SVGS
Sbjct: 273 FSYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 332
Query: 316 VMIPA-PEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEE 375
+P P G+GG+++DSGTT T Y SV +F ++ S
Sbjct: 333 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS---- 392
Query: 376 NTGLSPCYYY---EKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCL 435
++G C+ ++++P V+HF G + LP +NYF +G + CL
Sbjct: 393 SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENYFISPSNG---------LICL 434
Query: 436 MLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 471
+ + + GN QQQ V YD N+ V FA QC
Sbjct: 453 AMGSSSQGMSI-------FGNIQQQNMLVVYDTGNSVVSFASAQC 434
BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match:
Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)
HSP 1 Score: 146.0 bits (367), Expect = 1.1e-33
Identity = 121/401 (30.17%), Postives = 169/401 (42.14%), Query Frame = 0
Query: 76 GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSC 135
G+Y +G+ ++++ L +DTGSD+ W C P C C + S KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219
Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGRLYRDS 195
SA CS +E S C S C + +YGDGS +G L D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279
Query: 196 LSLPAPAPSPAINVRNFTFGCAHSALG---EPIGVAGFGRGLLSMPIQLATFSPQLGNRF 255
++ + + N GC H G G+ G G G+LS+ Q+ S F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339
Query: 256 SYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGISVGSVM 315
SYCLV + + LG G T +L N K FY VGL+G SVG
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGGEK 399
Query: 316 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTG 375
+ P+ + VD GSGGV++D GT T L YNS+ F T + +S I +
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI---SL 459
Query: 376 LSPCYYYE--KSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMN 435
CY + +V+VP V HF G K S+ LP KNY D G C
Sbjct: 460 FDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFAP 500
Query: 436 GGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 471
+ +GN QQQG + YDL N +G + +C
Sbjct: 520 TSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500
BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match:
Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)
HSP 1 Score: 142.1 bits (357), Expect = 1.6e-32
Identity = 130/410 (31.71%), Postives = 186/410 (45.37%), Query Frame = 0
Query: 72 LSPG-GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQ 131
LS G G+Y +G+ ++ + + +DTGSD+VW C+P C C + P +
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194
Query: 132 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGR 191
KS + + CS+ H L ++ C R C + +YGDGS +G
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254
Query: 192 LYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFGRGLLSMPIQLATFSPQ 251
++L+ V+ GC H G +G A G G+G LS P Q +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---GHR 314
Query: 252 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGIS 311
+FSYCLV S ++ +PS ++ G S +T +L NPK FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGIS 374
Query: 312 VGSVMIP-APEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQ 371
VG +P L K+D+ G+GGV++DSGT+ T L Y ++ F R G A +
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434
Query: 372 IEENTGLSPCYYYE--KSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVG 431
+ + C+ V+VP VVLHF G + V LP NY
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP--------------- 485
Query: 432 CLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVAYDLENNRVGFARRQCS 472
+ NG AG G + +GN QQQGF V YDL ++RVGFA C+
Sbjct: 495 --VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
BLAST of CmaCh16G011400 vs. ExPASy Swiss-Prot
Match:
Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)
HSP 1 Score: 139.4 bits (350), Expect = 1.0e-31
Identity = 124/440 (28.18%), Postives = 189/440 (42.95%), Query Frame = 0
Query: 42 HNLLKSTAARSSARFHH-RRRTHHRSHLSLPLSPG-GDYTLSFNLGSESQKISLYMDTGS 101
+ L+K R R S + P+ G G+Y ++ +G+ S MDTGS
Sbjct: 58 YELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGS 117
Query: 102 DLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRC 161
DL+W C P C C P P + Q S S S C + + L
Sbjct: 118 DLIWTQCEP--CTQC-----FSQPTPIFNPQDSSSFSTLPCESQYCQDL----------- 177
Query: 162 PLESIEVSECSSFSCPPFYYAYGDGSLI-GRLYRDSLSLPAPAPSPAINVRNFTFGCAHS 221
P E+ +EC + Y YGDGS G + ++ + +V N FGC
Sbjct: 178 PSETCNNNECQ------YTYGYGDGSTTQGYMATETFTFETS------SVPNIAFGCGED 237
Query: 222 ----ALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG 281
G G+ G G G LS+P QL +FSYC+ S+ ++ PS L LG
Sbjct: 238 NQGFGQGNGAGLIGMGWGPLSLPSQLGV------GQFSYCMTSYGSSS-----PSTLALG 297
Query: 282 RYYGSETE-FIYTSMLENPKHPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDS 341
E T+++ + +P +Y + L GI+VG + P ++ + G+GG+++DS
Sbjct: 298 SAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDS 357
Query: 342 GTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTGLSPCYYYE---KSVEVPRVVLH 401
GTT T LP YN+V F ++ + + E ++GLS C+ +V+VP + +
Sbjct: 358 GTTLTYLPQDAYNAVAQAFTDQ----INLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQ 417
Query: 402 FVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQ 461
F G + L +N +G V CL + G ++L + GN QQQ
Sbjct: 418 FDG--GVLNLGEQNILISPAEG---------VICLAM---GSSSQLG---ISIFGNIQQQ 435
Query: 462 GFEVAYDLENNRVGFARRQC 471
+V YDL+N V F QC
Sbjct: 478 ETQVLYDLQNLAVSFVPTQC 435
BLAST of CmaCh16G011400 vs. TAIR 10
Match:
AT4G16563.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 582.4 bits (1500), Expect = 3.3e-166
Identity = 292/477 (61.22%), Postives = 354/477 (74.21%), Query Frame = 0
Query: 24 LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFN 83
LLL LS+SLS+S ++ +LLKS+++RSSARF + LSLP+S G DY +S +
Sbjct: 29 LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88
Query: 84 LGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQ-KSVSCSAAACSA 143
+GS S +SLY+DTGSDLVWFPC PF CILCE KP SP +S+ +VSCS+ +CSA
Sbjct: 89 VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148
Query: 144 AHGGSLSASHLCAISRCPLESIEVSEC--SSFSCPPFYYAYGDGSLIGRLYRDSLSLPAP 203
AH SL +S LCAIS CPL+ IE +C SS+ CPPFYYAYGDGSL+ +LY DSLSL
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208
Query: 204 APSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSF 263
P+++V NFTFGCAH+ L EPIGVAGFGRG LS+P QLA SP LGN FSYCLVSHSF
Sbjct: 209 ---PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 268
Query: 264 AADRVRRPSPLILGRYYG--------------------SETEFIYTSMLENPKHPYFYSV 323
+DRVRRPSPLILGR+ + EF++T MLENPKHPYFYSV
Sbjct: 269 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 328
Query: 324 GLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVA 383
L GIS+G IPAP L+++D+ G GGVVVDSGTTFTMLPA YNSVV +F++R GRV
Sbjct: 329 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 388
Query: 384 SRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKI 443
RA ++E ++G+SPCYY ++V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG K
Sbjct: 389 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 448
Query: 444 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 478
K+GCLMLMNGGDE+EL GG GA LGNYQQQGFEV YDL N RVGFA+R+C++LWDSL
Sbjct: 449 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498
BLAST of CmaCh16G011400 vs. TAIR 10
Match:
AT5G45120.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 199.9 bits (507), Expect = 4.6e-51
Identity = 166/498 (33.33%), Postives = 239/498 (47.99%), Query Frame = 0
Query: 5 VFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHH 64
+FLFLL LL + +Q N SSSS F L KS+ + + + + R
Sbjct: 8 LFLFLLITLLLNTTNKTQ--ARQHKNPSSSSSSF-LVLTLTKSSVSLPTPKSQTQERIKK 67
Query: 65 -RSHLSLPLSP----GGDYTLSFNLGSESQKISLYMDTGSDLVWFPCS--PFECILCEG- 124
S + + + P Y ++ N+G+ Q + +Y+DTGSDL W PC F+CI C
Sbjct: 68 PLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDL 127
Query: 125 ------KPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECS 184
P + SPL ++ + SC+++ C H S + CA++ C + + S C
Sbjct: 128 KNNDLKSPSVFSPLHSSTSFRD-SCASSFCVEIH-SSDNPFDPCAVAGCSVSMLLKSTCV 187
Query: 185 SFSCPPFYYAYGDGSLI-GRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGF 244
CP F Y YG+G LI G L RD L + +V F+FGC S EPIG+AGF
Sbjct: 188 R-PCPSFAYTYGEGGLISGILTRDILK------ARTRDVPRFSFGCVTSTYREPIGIAGF 247
Query: 245 GRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGS---ETEFIYT 304
GRGLLS+P QL L FS+C + F + SPLILG S +T
Sbjct: 248 GRGLLSLPSQLGF----LEKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQFT 307
Query: 305 SMLENPKHPYFYSVGLAGISVGSVMIP--APEFLKKVDEGGSGGVVVDSGTTFTMLPAGL 364
ML P +P Y +GL I++G+ + P P L++ D G+GG++VDSGTT+T LP
Sbjct: 308 PMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 367
Query: 365 YNSVVAQFENRTGRVASRASQIEENTGLSPCY----------YYEKSVEV--PRVVLHFV 424
Y+ ++ ++ RA++ E TG CY E V + P + HF+
Sbjct: 368 YSQLLTTLQSTI--TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFL 427
Query: 425 GEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGF 471
++++LP+ N FY DG V CL+ N D GP G++QQQ
Sbjct: 428 -NNATLLLPQGNSFYAMSAPSDG----SVVQCLLFQNMEDGDY---GPAGVFGSFQQQNV 478
BLAST of CmaCh16G011400 vs. TAIR 10
Match:
AT3G52500.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 178.3 bits (451), Expect = 1.4e-44
Identity = 156/502 (31.08%), Postives = 231/502 (46.02%), Query Frame = 0
Query: 1 MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSD-FNNTHNLLKSTAARSSARFH-- 60
MAS +F F L FL S V + ++ L P S+S S D + + L +S+ AR+ H
Sbjct: 1 MASSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGT 60
Query: 61 ---------HRRRTHHRSHLSLPLSPG--GDYTLSFNLGSESQKISLYMDTGSDLVWFPC 120
T + + PLS G Y++S + G+ SQ I DTGS LVW PC
Sbjct: 61 SIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPC 120
Query: 121 -SPFECILCEGKPKIQSPLPKI-----SNQKSVSCSAAACSAAHGGSLSASHLCAISRCP 180
S + C C+ + +P+ S+ K + C + C +G ++ +C
Sbjct: 121 TSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV---------QCR 180
Query: 181 LESIEVSECSSFSCPPFYYAYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSAL 240
C + CPP+ YG GS G L + L P + V +F GC+ +
Sbjct: 181 GCDPNTRNC-TVGCPPYILQYGLGSTAGVLITEKLDF------PDLTVPDFVVGCSIIST 240
Query: 241 GEPIGVAGFGRGLLSMPIQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY--G 300
+P G+AGFGRG +S+P Q+ RFS+CLVS F V L G + G
Sbjct: 241 RQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSG 300
Query: 301 SETE-FIYTSMLENPK-----HPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVD 360
S+T YT +NP +Y + L I VG + P G GG +VD
Sbjct: 301 SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 360
Query: 361 SGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTGLSPCYYY--EKSVEVPRVVLH 420
SG+TFT + ++ V +F ++ +R +E+ TGL PC+ + V VP ++
Sbjct: 361 SGSTFTFMERPVFELVAEEFASQMSNY-TREKDLEKETGLGPCFNISGKGDVTVPELIFE 420
Query: 421 FVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMNGGDEAELAG-GPGATLGNYQQ 472
F G + + LP NYF F+ D V CL +++ G GP LG++QQ
Sbjct: 421 FKG-GAKLELPLSNYF-TFVGNTDTV-------CLTVVSDKTVNPSGGTGPAIILGSFQQ 468
BLAST of CmaCh16G011400 vs. TAIR 10
Match:
AT1G25510.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 156.0 bits (393), Expect = 7.7e-38
Identity = 130/421 (30.88%), Postives = 190/421 (45.13%), Query Frame = 0
Query: 62 THHRSHLSLPLSPG-----GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEG 121
T + PL G G+Y +G ++++ + +DTGSD+ W C+P C C
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP--CADCYH 186
Query: 122 KPKIQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPP 181
+ + S+ + +SC C+A +EVSEC + +C
Sbjct: 187 QTEPIFEPSSSSSYEPLSCDTPQCNA---------------------LEVSECRNATC-L 246
Query: 182 FYYAYGDGS-LIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFGRG 241
+ +YGDGS +G ++L++ + V+N GC HS G +G A G G G
Sbjct: 247 YEVSYGDGSYTVGDFATETLTIGSTL------VQNVAVGCGHSNEGLFVGAAGLLGLGGG 306
Query: 242 LLSMPIQLATFSPQLGNRFSYCLVSH-SFAADRVRRPSPLILGRYYGSETEFIYTSMLEN 301
LL++P QL T S FSYCLV S +A V + L + + +L N
Sbjct: 307 LLALPSQLNTTS------FSYCLVDRDSDSASTVDFGTSL--------SPDAVVAPLLRN 366
Query: 302 PKHPYFYSVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQ 361
+ FY +GL GISVG ++ P+ ++DE GSGG+++DSGT T L +YNS+
Sbjct: 367 HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDS 426
Query: 362 FENRTGRVASRASQIEENTGLSPCYYY--EKSVEVPRVVLHFVGEKSSVMLPRKNYFYEF 421
F T + A +T CY + +VEVP V HF G K + LP KNY
Sbjct: 427 FVKGTLDLEKAAGVAMFDT----CYNLSAKTTVEVPTVAFHFPGGK-MLALPAKNYMIPV 483
Query: 422 LDGGDGVGRKIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQ 471
D VG CL A +GN QQQG V +DL N+ +GF+ +
Sbjct: 487 ----DSVG----TFCLAFAPTASSL-------AIIGNVQQQGTRVTFDLANSLIGFSSNK 483
BLAST of CmaCh16G011400 vs. TAIR 10
Match:
AT3G18490.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 146.0 bits (367), Expect = 7.9e-35
Identity = 121/401 (30.17%), Postives = 169/401 (42.14%), Query Frame = 0
Query: 76 GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNQKSVSC 135
G+Y +G+ ++++ L +DTGSD+ W C P C C + S KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219
Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGRLYRDS 195
SA CS +E S C S C + +YGDGS +G L D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279
Query: 196 LSLPAPAPSPAINVRNFTFGCAHSALG---EPIGVAGFGRGLLSMPIQLATFSPQLGNRF 255
++ + + N GC H G G+ G G G+LS+ Q+ S F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339
Query: 256 SYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVGLAGISVGSVM 315
SYCLV + + LG G T +L N K FY VGL+G SVG
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGGEK 399
Query: 316 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASQIEENTG 375
+ P+ + VD GSGGV++D GT T L YNS+ F T + +S I +
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI---SL 459
Query: 376 LSPCYYYE--KSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRKIKVGCLMLMN 435
CY + +V+VP V HF G K S+ LP KNY D G C
Sbjct: 460 FDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFAP 500
Query: 436 GGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 471
+ +GN QQQG + YDL N +G + +C
Sbjct: 520 TSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q940R4 | 4.7e-165 | 61.22 | Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... | [more] |
Q766C3 | 3.8e-34 | 29.63 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... | [more] |
Q9LS40 | 1.1e-33 | 30.17 | Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... | [more] |
Q9LNJ3 | 1.6e-32 | 31.71 | Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... | [more] |
Q766C2 | 1.0e-31 | 28.18 | Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... | [more] |