Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTATCAGTCTCAACATGAACAACAATAAACCCATTTTCTCCGCTTTCCTCTTCACCGCAATCCTCAATTCCCTTCTTTTCTCTTCTTCTCTCGCCCGCGTTTTCACGGAAACCACCACCGTCTTCGATGTCTCCGCTTCCTCTAACCGAGCTCAGAACGCCCTCTCCATCACCCCTCCTCAATTTCACTCCCATCACCTTTCAAATTCCTCTCTGTCTCTGTCGCTGCACTCCCGACTCGCCATTCATAAGCATAATTACAAGGACTACGAGAGCTTGGTCCGAGCTCGACTCGCCCGTGATGCCGCCCGTGTTCAATCCCTTAATCGGAATCTTAATCTGGCTTTGGCTGGTGATGCGGTCCGCCCCAACTCCTTAACCGCCCCTGTTGTTTCTGGGCAGAGTCAGGGGAGTGGGGAGTATTTTGCACGGATTGCTGTCGGGCAGCCGGCTCAATCGTTCTATTTGGTGCCCGACACCGGCAGTGACATCACTTGGCTACAGTGCCTGCCCTGTAGTATTGGAAATACCTGTTACCCACAAACCGACCCGATATTCAACCCGACATCCTCGTCCTCTTACAGACCCCTGTCTTGCGATTCGCAGCAATGTCAATCCCTCAACAGACCCGGATGTCAATCCGGCACGTGCGTGTACCAAGTCTGGTACGGCGACGGTTCGTTCACCACCGGTGATTTCGCCACCGAGACGCTGACCTTCGGAAATTCCAAATCCATCCCCAATCTCCCAATCGGCTGTGGCCACGACAATAAAGGCCTCTTCGTCGGAGCCGCCGGTTTGATCGGCCTCGGCGGTGGGGCTCTTTCCCTCTCCTCCCAACTTAAAGCGTCGTCGTTTTCCTACTGCCTCGTCGACCGCGACTCGGACTCGTCCTCCACTCTGGAATTCGACTCGGCGCGACCCAGTGACTCGATCACTACCCCACTTCTGAAAAACAACCGAATCGACTCGTACCGGTACGTGCAGGTGACCGGAATGAGCGTGGGGGGGAAGGCGCTGTCTATTTCTTCGACGAGATTTGAAATCGACGGGTCGGGAATGGGGGGAATAATCGTGGACTCGGGTACGTTTATAACTCGGTTGCCGACTGACGTGTACGAATCATTGAGAGAAGCGTTTGTGCAAGGGGCGCGGAGCCTGACAGCGGCGGGAGCGATATCACCGTTCGACACGTGTTACAATCTGGCGGGTCAGTCGAACGTGCAGGTGCCGACGGTGGCGTTTGAATTGTCAAAAGGTAACTTGCTGCAGCTGCCGGCGAGAAACTACTTAATACGAATGGACACGGCGGGAACTTATTGCTTGGCGTTTCTCAAATTGACAACGTCGTCGCTTTCCATAATAGGGAGCTTTCAACAGCAGGGAATGCGTGTCAGCTATGACCTGGTCAACTCCCTGGTCGGATTCTCGTCTAATAAATGCTAA
mRNA sequence
ATGCCTATCAGTCTCAACATGAACAACAATAAACCCATTTTCTCCGCTTTCCTCTTCACCGCAATCCTCAATTCCCTTCTTTTCTCTTCTTCTCTCGCCCGCGTTTTCACGGAAACCACCACCGTCTTCGATGTCTCCGCTTCCTCTAACCGAGCTCAGAACGCCCTCTCCATCACCCCTCCTCAATTTCACTCCCATCACCTTTCAAATTCCTCTCTGTCTCTGTCGCTGCACTCCCGACTCGCCATTCATAAGCATAATTACAAGGACTACGAGAGCTTGGTCCGAGCTCGACTCGCCCGTGATGCCGCCCGTGTTCAATCCCTTAATCGGAATCTTAATCTGGCTTTGGCTGGTGATGCGGTCCGCCCCAACTCCTTAACCGCCCCTGTTGTTTCTGGGCAGAGTCAGGGGAGTGGGGAGTATTTTGCACGGATTGCTGTCGGGCAGCCGGCTCAATCGTTCTATTTGGTGCCCGACACCGGCAGTGACATCACTTGGCTACAGTGCCTGCCCTGTAGTATTGGAAATACCTGTTACCCACAAACCGACCCGATATTCAACCCGACATCCTCGTCCTCTTACAGACCCCTGTCTTGCGATTCGCAGCAATGTCAATCCCTCAACAGACCCGGATGTCAATCCGGCACGTGCGTGTACCAAGTCTGGTACGGCGACGGTTCGTTCACCACCGGTGATTTCGCCACCGAGACGCTGACCTTCGGAAATTCCAAATCCATCCCCAATCTCCCAATCGGCTGTGGCCACGACAATAAAGGCCTCTTCGTCGGAGCCGCCGGTTTGATCGGCCTCGGCGGTGGGGCTCTTTCCCTCTCCTCCCAACTTAAAGCGTCGTCGTTTTCCTACTGCCTCGTCGACCGCGACTCGGACTCGTCCTCCACTCTGGAATTCGACTCGGCGCGACCCAGTGACTCGATCACTACCCCACTTCTGAAAAACAACCGAATCGACTCGTACCGGTACGTGCAGGTGACCGGAATGAGCGTGGGGGGGAAGGCGCTGTCTATTTCTTCGACGAGATTTGAAATCGACGGGTCGGGAATGGGGGGAATAATCGTGGACTCGGGTACGTTTATAACTCGGTTGCCGACTGACGTGTACGAATCATTGAGAGAAGCGTTTGTGCAAGGGGCGCGGAGCCTGACAGCGGCGGGAGCGATATCACCGTTCGACACGTGTTACAATCTGGCGGGTCAGTCGAACGTGCAGGTGCCGACGGTGGCGTTTGAATTGTCAAAAGGTAACTTGCTGCAGCTGCCGGCGAGAAACTACTTAATACGAATGGACACGGCGGGAACTTATTGCTTGGCGTTTCTCAAATTGACAACGTCGTCGCTTTCCATAATAGGGAGCTTTCAACAGCAGGGAATGCGTGTCAGCTATGACCTGGTCAACTCCCTGGTCGGATTCTCGTCTAATAAATGCTAA
Coding sequence (CDS)
ATGCCTATCAGTCTCAACATGAACAACAATAAACCCATTTTCTCCGCTTTCCTCTTCACCGCAATCCTCAATTCCCTTCTTTTCTCTTCTTCTCTCGCCCGCGTTTTCACGGAAACCACCACCGTCTTCGATGTCTCCGCTTCCTCTAACCGAGCTCAGAACGCCCTCTCCATCACCCCTCCTCAATTTCACTCCCATCACCTTTCAAATTCCTCTCTGTCTCTGTCGCTGCACTCCCGACTCGCCATTCATAAGCATAATTACAAGGACTACGAGAGCTTGGTCCGAGCTCGACTCGCCCGTGATGCCGCCCGTGTTCAATCCCTTAATCGGAATCTTAATCTGGCTTTGGCTGGTGATGCGGTCCGCCCCAACTCCTTAACCGCCCCTGTTGTTTCTGGGCAGAGTCAGGGGAGTGGGGAGTATTTTGCACGGATTGCTGTCGGGCAGCCGGCTCAATCGTTCTATTTGGTGCCCGACACCGGCAGTGACATCACTTGGCTACAGTGCCTGCCCTGTAGTATTGGAAATACCTGTTACCCACAAACCGACCCGATATTCAACCCGACATCCTCGTCCTCTTACAGACCCCTGTCTTGCGATTCGCAGCAATGTCAATCCCTCAACAGACCCGGATGTCAATCCGGCACGTGCGTGTACCAAGTCTGGTACGGCGACGGTTCGTTCACCACCGGTGATTTCGCCACCGAGACGCTGACCTTCGGAAATTCCAAATCCATCCCCAATCTCCCAATCGGCTGTGGCCACGACAATAAAGGCCTCTTCGTCGGAGCCGCCGGTTTGATCGGCCTCGGCGGTGGGGCTCTTTCCCTCTCCTCCCAACTTAAAGCGTCGTCGTTTTCCTACTGCCTCGTCGACCGCGACTCGGACTCGTCCTCCACTCTGGAATTCGACTCGGCGCGACCCAGTGACTCGATCACTACCCCACTTCTGAAAAACAACCGAATCGACTCGTACCGGTACGTGCAGGTGACCGGAATGAGCGTGGGGGGGAAGGCGCTGTCTATTTCTTCGACGAGATTTGAAATCGACGGGTCGGGAATGGGGGGAATAATCGTGGACTCGGGTACGTTTATAACTCGGTTGCCGACTGACGTGTACGAATCATTGAGAGAAGCGTTTGTGCAAGGGGCGCGGAGCCTGACAGCGGCGGGAGCGATATCACCGTTCGACACGTGTTACAATCTGGCGGGTCAGTCGAACGTGCAGGTGCCGACGGTGGCGTTTGAATTGTCAAAAGGTAACTTGCTGCAGCTGCCGGCGAGAAACTACTTAATACGAATGGACACGGCGGGAACTTATTGCTTGGCGTTTCTCAAATTGACAACGTCGTCGCTTTCCATAATAGGGAGCTTTCAACAGCAGGGAATGCGTGTCAGCTATGACCTGGTCAACTCCCTGGTCGGATTCTCGTCTAATAAATGCTAA
Protein sequence
MPISLNMNNNKPIFSAFLFTAILNSLLFSSSLARVFTETTTVFDVSASSNRAQNALSITPPQFHSHHLSNSSLSLSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRNLNLALAGDAVRPNSLTAPVVSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQSGTCVYQVWYGDGSFTTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLKASSFSYCLVDRDSDSSSTLEFDSARPSDSITTPLLKNNRIDSYRYVQVTGMSVGGKALSISSTRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLAGQSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGFSSNKC
Homology
BLAST of CmaCh11G002640 vs. ExPASy Swiss-Prot
Match:
Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)
HSP 1 Score: 436.8 bits (1122), Expect = 3.2e-121
Identity = 232/461 (50.33%), Postives = 311/461 (67.46%), Query Frame = 0
Query: 37 TETTTVFDVSASSNRAQNALSITPPQFHSHHLSNSSLSLSLHSRLAIHKHNYKDYESLVR 96
T+T D + SS S++ P F + S+S LSL LHSR +KDY+SL
Sbjct: 47 TQTILSLDPTRSSLTTTKPESLSDPVFFN---SSSPLSLELHSRDTFVASQHKDYKSLTL 106
Query: 97 ARLARDAARVQSLNRNLNLALAG-------------DAVRPNSLTAPVVSGQSQGSGEYF 156
+RL RD++RV + + A+ G + LT PVVSG SQGSGEYF
Sbjct: 107 SRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYF 166
Query: 157 ARIAVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQ 216
+RI VG PA+ YLV DTGSD+ W+QC PC+ CY Q+DP+FNPTSSS+Y+ L+C +
Sbjct: 167 SRIGVGTPAKEMYLVLDTGSDVNWIQCEPCA---DCYQQSDPVFNPTSSSTYKSLTCSAP 226
Query: 217 QCQSLNRPGCQSGTCVYQVWYGDGSFTTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFV 276
QC L C+S C+YQV YGDGSFT G+ AT+T+TFGNS I N+ +GCGHDN+GLF
Sbjct: 227 QCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFT 286
Query: 277 GAAGLIGLGGGALSLSSQLKASSFSYCLVDRDSDSSSTLEFDSAR-PSDSITTPLLKNNR 336
GAAGL+GLGGG LS+++Q+KA+SFSYCLVDRDS SS+L+F+S + T PLL+N +
Sbjct: 287 GAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKK 346
Query: 337 IDSYRYVQVTGMSVGGKALSISSTRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFV 396
ID++ YV ++G SVGG+ + + F++D SG GG+I+D GT +TRL T Y SLR+AF+
Sbjct: 347 IDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFL 406
Query: 397 QGARSL-TAAGAISPFDTCYNLAGQSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTY 456
+ +L + +IS FDTCY+ + S V+VPTVAF + G L LPA+NYLI +D +GT+
Sbjct: 407 KLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTF 466
Query: 457 CLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGFSSNKC 483
C AF T+SSLSIIG+ QQQG R++YDL +++G S NKC
Sbjct: 467 CFAFAP-TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
BLAST of CmaCh11G002640 vs. ExPASy Swiss-Prot
Match:
Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)
HSP 1 Score: 346.7 bits (888), Expect = 4.3e-94
Identity = 186/437 (42.56%), Postives = 269/437 (61.56%), Query Frame = 0
Query: 56 LSITPPQFHSHHLSNSSLS---LSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRN 115
++ T P F++ H S+ S S L L R Y+++ + AR+ RD RV ++ R
Sbjct: 39 VTATLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRR 98
Query: 116 LN---LALAGDAVRPNSLTAPVVSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQ 175
++ + + N + +VSG QGSGEYF RI VG P + Y+V D+GSD+ W+Q
Sbjct: 99 ISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQ 158
Query: 176 CLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQSGTCVYQVWYGDGSF 235
C PC + CY Q+DP+F+P S SY +SC S C + GC SG C Y+V YGDGS+
Sbjct: 159 CQPCKL---CYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSY 218
Query: 236 TTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLK---ASS 295
T G A ETLTF + + N+ +GCGH N+G+F+GAAGL+G+GGG++S QL +
Sbjct: 219 TKGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGA 278
Query: 296 FSYCLVDRDSDSSSTLEFD-SARPSDSITTPLLKNNRIDSYRYVQVTGMSVGGKALSISS 355
F YCLV R +DS+ +L F A P + PL++N R S+ YV + G+ VGG + +
Sbjct: 279 FGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPD 338
Query: 356 TRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLAG 415
F++ +G GG+++D+GT +TRLPT Y + R+ F +L A +S FDTCY+L+G
Sbjct: 339 GVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSG 398
Query: 416 QSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGMR 475
+V+VPTV+F ++G +L LPARN+L+ +D +GTYC AF T LSIIG+ QQ+G++
Sbjct: 399 FVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPT-GLSIIGNIQQEGIQ 458
Query: 476 VSYDLVNSLVGFSSNKC 483
VS+D N VGF N C
Sbjct: 459 VSFDGANGFVGFGPNVC 470
BLAST of CmaCh11G002640 vs. ExPASy Swiss-Prot
Match:
Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)
HSP 1 Score: 341.3 bits (874), Expect = 1.8e-92
Identity = 204/438 (46.58%), Postives = 275/438 (62.79%), Query Frame = 0
Query: 57 SITPPQFHSHHLSNSSLSLSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRNLNLA 116
S+ +F S S SS S++L+ + K + L +RL RD+ RV+S+ L
Sbjct: 54 SLLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSI-ATLAAQ 113
Query: 117 LAG----DAVRPNSLTAPVVSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQCLP 176
+ G A RP ++ VVSG SQGSGEYF R+ VG PA+ Y+V DTGSDI WLQC P
Sbjct: 114 IPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP 173
Query: 177 CSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQS--GTCVYQVWYGDGSFT 236
C CY Q+DPIF+P S +Y + C S C+ L+ GC + TC+YQV YGDGSFT
Sbjct: 174 C---RRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFT 233
Query: 237 TGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLK---ASSF 296
GDF+TETLTF ++ + + +GCGHDN+GLFVGAAGL+GLG G LS Q F
Sbjct: 234 VGDFSTETLTFRRNR-VKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKF 293
Query: 297 SYCLVDRDSDSS-STLEFDSARPSD-SITTPLLKNNRIDSYRYVQVTGMSVGG-KALSIS 356
SYCLVDR + S S++ F +A S + TPLL N ++D++ YV + G+SVGG + ++
Sbjct: 294 SYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVT 353
Query: 357 STRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLA 416
++ F++D G GG+I+DSGT +TRL Y ++R+AF GA++L A S FDTC++L+
Sbjct: 354 ASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLS 413
Query: 417 GQSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGM 476
+ V+VPTV +G + LPA NYLI +DT G +C AF T LSIIG+ QQQG
Sbjct: 414 NMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAG-TMGGLSIIGNIQQQGF 473
Query: 477 RVSYDLVNSLVGFSSNKC 483
RV YDL +S VGF+ C
Sbjct: 474 RVVYDLASSRVGFAPGGC 484
BLAST of CmaCh11G002640 vs. ExPASy Swiss-Prot
Match:
Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)
HSP 1 Score: 261.9 bits (668), Expect = 1.4e-68
Identity = 161/402 (40.05%), Postives = 227/402 (56.47%), Query Frame = 0
Query: 87 NYKDYESLVRARLARDAARVQSLNRNLNLALAGDAVRPNSLTAPVVSGQSQGSGEYFARI 146
N ++ L RA + R + R+Q L LN P+ + V + G GEY +
Sbjct: 53 NLTKFQLLERA-IERGSRRLQRLEAMLN--------GPSGVETSVYA----GDGEYLMNL 112
Query: 147 AVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQ 206
++G PAQ F + DTGSD+ W QC PC+ C+ Q+ PIFNP SSS+ L C SQ CQ
Sbjct: 113 SIGTPAQPFSAIMDTGSDLIWTQCQPCT---QCFNQSTPIFNPQGSSSFSTLPCSSQLCQ 172
Query: 207 SLNRPGCQSGTCVYQVWYGDGSFTTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVG-A 266
+L+ P C + C Y YGDGS T G TETLTFG S SIPN+ GCG +N+G G
Sbjct: 173 ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGENNQGFGQGNG 232
Query: 267 AGLIGLGGGALSLSSQLKASSFSYCLVDRDSDSSSTLEFDSARPS---DSITTPLLKNNR 326
AGL+G+G G LSL SQL + FSYC+ S + S L S S S T L+++++
Sbjct: 233 AGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQ 292
Query: 327 IDSYRYVQVTGMSVGGKALSISSTRFEID-GSGMGGIIVDSGTFITRLPTDVYESLREAF 386
I ++ Y+ + G+SVG L I + F ++ +G GGII+DSGT +T + Y+S+R+ F
Sbjct: 293 IPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEF 352
Query: 387 VQGARSLTAAGAISPFDTCYNL-AGQSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGT 446
+ G+ S FD C+ + SN+Q+PT G+ L+LP+ NY I + G
Sbjct: 353 ISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD-LELPSENYFI-SPSNGL 412
Query: 447 YCLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGFSSNKC 483
CLA + ++ +SI G+ QQQ M V YD NS+V F+S +C
Sbjct: 413 ICLA-MGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
BLAST of CmaCh11G002640 vs. ExPASy Swiss-Prot
Match:
Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)
HSP 1 Score: 257.7 bits (657), Expect = 2.6e-67
Identity = 160/401 (39.90%), Postives = 214/401 (53.37%), Query Frame = 0
Query: 87 NYKDYESLVRARLARDAARVQSLNRNLNLALAGDAVRPNSLTAPVVSGQSQGSGEYFARI 146
N YE L++ + R R++S+N L + + + PV + G GEY +
Sbjct: 54 NLTKYE-LIKRAIKRGERRMRSINAMLQSS--------SGIETPVYA----GDGEYLMNV 113
Query: 147 AVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQ 206
A+G P SF + DTGSD+ W QC PC+ C+ Q PIFNP SSS+ L C+SQ CQ
Sbjct: 114 AIGTPDSSFSAIMDTGSDLIWTQCEPCT---QCFSQPTPIFNPQDSSSFSTLPCESQYCQ 173
Query: 207 SLNRPGCQSGTCVYQVWYGDGSFTTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVG-A 266
L C + C Y YGDGS T G ATET TF S S+PN+ GCG DN+G G
Sbjct: 174 DLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFGCGEDNQGFGQGNG 233
Query: 267 AGLIGLGGGALSLSSQLKASSFSYCLVDRDSDSSSTLEFDSAR---PSDSITTPLLKNNR 326
AGLIG+G G LSL SQL FSYC+ S S STL SA P S +T L+ ++
Sbjct: 234 AGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSL 293
Query: 327 IDSYRYVQVTGMSVGGKALSISSTRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFV 386
+Y Y+ + G++VGG L I S+ F++ G GG+I+DSGT +T LP D Y ++ +AF
Sbjct: 294 NPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFT 353
Query: 387 QGARSLTAAGAISPFDTCYNLAGQ-SNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTY 446
T + S TC+ S VQVP ++ + G +L L +N LI G
Sbjct: 354 DQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFD-GGVLNLGEQNILI-SPAEGVI 413
Query: 447 CLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGFSSNKC 483
CLA + +SI G+ QQQ +V YDL N V F +C
Sbjct: 414 CLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
BLAST of CmaCh11G002640 vs. TAIR 10
Match:
AT1G25510.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 477.2 bits (1227), Expect = 1.5e-134
Identity = 250/482 (51.87%), Postives = 342/482 (70.95%), Query Frame = 0
Query: 14 FSAFLFTAILNSLLFSSSLARVFTETTTVFDVSASSNRAQNALSI-TPPQFHSHHLSNSS 73
F F+F +S +FS L T TT++ +V+ S +R + S Q H ++SS
Sbjct: 7 FFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSASSS 66
Query: 74 LSLSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRNLNLALAGDA---VRPNS--- 133
SL LHSR+++ + DY+SL ARL RD ARV+SL L+LA+ + ++P S
Sbjct: 67 FSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMY 126
Query: 134 ------LTAPVVSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCY 193
+ AP++SG +QGSGEYF R+ +G+PA+ Y+V DTGSD+ WLQC PC+ CY
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCA---DCY 186
Query: 194 PQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQSGTCVYQVWYGDGSFTTGDFATETLT 253
QT+PIF P+SSSSY PLSCD+ QC +L C++ TC+Y+V YGDGS+T GDFATETLT
Sbjct: 187 HQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLT 246
Query: 254 FGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLKASSFSYCLVDRDSDSSS 313
G S + N+ +GCGH N+GLFVGAAGL+GLGGG L+L SQL +SFSYCLVDRDSDS+S
Sbjct: 247 IG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSAS 306
Query: 314 TLEFDSARPSDSITTPLLKNNRIDSYRYVQVTGMSVGGKALSISSTRFEIDGSGMGGIIV 373
T++F ++ D++ PLL+N+++D++ Y+ +TG+SVGG+ L I + FE+D SG GGII+
Sbjct: 307 TVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 366
Query: 374 DSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLAGQSNVQVPTVAFELSK 433
DSGT +TRL T++Y SLR++FV+G L A ++ FDTCYNL+ ++ V+VPTVAF
Sbjct: 367 DSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPG 426
Query: 434 GNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGFSSN 483
G +L LPA+NY+I +D+ GT+CLAF T SSL+IIG+ QQQG RV++DL NSL+GFSSN
Sbjct: 427 GKMLALPAKNYMIPVDSVGTFCLAFAP-TASSLAIIGNVQQQGTRVTFDLANSLIGFSSN 483
BLAST of CmaCh11G002640 vs. TAIR 10
Match:
AT3G18490.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 436.8 bits (1122), Expect = 2.3e-122
Identity = 232/461 (50.33%), Postives = 311/461 (67.46%), Query Frame = 0
Query: 37 TETTTVFDVSASSNRAQNALSITPPQFHSHHLSNSSLSLSLHSRLAIHKHNYKDYESLVR 96
T+T D + SS S++ P F + S+S LSL LHSR +KDY+SL
Sbjct: 47 TQTILSLDPTRSSLTTTKPESLSDPVFFN---SSSPLSLELHSRDTFVASQHKDYKSLTL 106
Query: 97 ARLARDAARVQSLNRNLNLALAG-------------DAVRPNSLTAPVVSGQSQGSGEYF 156
+RL RD++RV + + A+ G + LT PVVSG SQGSGEYF
Sbjct: 107 SRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYF 166
Query: 157 ARIAVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQ 216
+RI VG PA+ YLV DTGSD+ W+QC PC+ CY Q+DP+FNPTSSS+Y+ L+C +
Sbjct: 167 SRIGVGTPAKEMYLVLDTGSDVNWIQCEPCA---DCYQQSDPVFNPTSSSTYKSLTCSAP 226
Query: 217 QCQSLNRPGCQSGTCVYQVWYGDGSFTTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFV 276
QC L C+S C+YQV YGDGSFT G+ AT+T+TFGNS I N+ +GCGHDN+GLF
Sbjct: 227 QCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFT 286
Query: 277 GAAGLIGLGGGALSLSSQLKASSFSYCLVDRDSDSSSTLEFDSAR-PSDSITTPLLKNNR 336
GAAGL+GLGGG LS+++Q+KA+SFSYCLVDRDS SS+L+F+S + T PLL+N +
Sbjct: 287 GAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKK 346
Query: 337 IDSYRYVQVTGMSVGGKALSISSTRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFV 396
ID++ YV ++G SVGG+ + + F++D SG GG+I+D GT +TRL T Y SLR+AF+
Sbjct: 347 IDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFL 406
Query: 397 QGARSL-TAAGAISPFDTCYNLAGQSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTY 456
+ +L + +IS FDTCY+ + S V+VPTVAF + G L LPA+NYLI +D +GT+
Sbjct: 407 KLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTF 466
Query: 457 CLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGFSSNKC 483
C AF T+SSLSIIG+ QQQG R++YDL +++G S NKC
Sbjct: 467 CFAFAP-TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
BLAST of CmaCh11G002640 vs. TAIR 10
Match:
AT3G61820.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 359.0 bits (920), Expect = 6.0e-99
Identity = 220/485 (45.36%), Postives = 300/485 (61.86%), Query Frame = 0
Query: 22 ILNSLLFSSSLARVFTETTTVFDVSASSNRAQNALSITPPQFHS---HHLSNSSLSLSLH 81
+LN+L FS FT + + + N ++ +++ P+ S LS S+ SLS+H
Sbjct: 5 VLNTLAFSVFAVLFFTSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTSLSVH 64
Query: 82 SRLAIHKHNYKDYE--SLVRARLARDAARVQSLNRNLNLALAGDAVRPNSLTA-----PV 141
++ D L RL RD+ RV+S+ ++ +A + TA V
Sbjct: 65 LSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAV 124
Query: 142 VSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQCLPCSIGNTCYPQTDPIFNPTS 201
+SG SQGSGEYF R+ VG PA + Y+V DTGSD+ WLQC PC CY QTD IF+P
Sbjct: 125 ISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC---KACYNQTDAIFDPKK 184
Query: 202 SSSYRPLSCDSQQCQSLNRPG-C---QSGTCVYQVWYGDGSFTTGDFATETLTFGNSKSI 261
S ++ + C S+ C+ L+ C +S TC+YQV YGDGSFT GDF+TETLTF ++ +
Sbjct: 185 SKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR-V 244
Query: 262 PNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLK---ASSFSYCLVDRDSDSS----- 321
++P+GCGHDN+GLFVGAAGL+GLG G LS SQ K FSYCLVDR S S
Sbjct: 245 DHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPP 304
Query: 322 STLEF-DSARPSDSITTPLLKNNRIDSYRYVQVTGMSVGGKAL-SISSTRFEIDGSGMGG 381
ST+ F ++A P S+ TPLL N ++D++ Y+Q+ G+SVGG + +S ++F++D +G GG
Sbjct: 305 STIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGG 364
Query: 382 IIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLAGQSNVQVPTVAFE 441
+I+DSGT +TRL Y +LR+AF GA L A + S FDTC++L+G + V+VPTV F
Sbjct: 365 VIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 424
Query: 442 LSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLVGF 483
G + LPA NYLI ++T G +C AF T SLSIIG+ QQQG RV+YDLV S VGF
Sbjct: 425 FGGGE-VSLPASNYLIPVNTEGRFCFAFAG-TMGSLSIIGNIQQQGFRVAYDLVGSRVGF 483
BLAST of CmaCh11G002640 vs. TAIR 10
Match:
AT3G20015.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 346.7 bits (888), Expect = 3.1e-95
Identity = 186/437 (42.56%), Postives = 269/437 (61.56%), Query Frame = 0
Query: 56 LSITPPQFHSHHLSNSSLS---LSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRN 115
++ T P F++ H S+ S S L L R Y+++ + AR+ RD RV ++ R
Sbjct: 39 VTATLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRR 98
Query: 116 LN---LALAGDAVRPNSLTAPVVSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQ 175
++ + + N + +VSG QGSGEYF RI VG P + Y+V D+GSD+ W+Q
Sbjct: 99 ISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQ 158
Query: 176 CLPCSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQSGTCVYQVWYGDGSF 235
C PC + CY Q+DP+F+P S SY +SC S C + GC SG C Y+V YGDGS+
Sbjct: 159 CQPCKL---CYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSY 218
Query: 236 TTGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLK---ASS 295
T G A ETLTF + + N+ +GCGH N+G+F+GAAGL+G+GGG++S QL +
Sbjct: 219 TKGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGA 278
Query: 296 FSYCLVDRDSDSSSTLEFD-SARPSDSITTPLLKNNRIDSYRYVQVTGMSVGGKALSISS 355
F YCLV R +DS+ +L F A P + PL++N R S+ YV + G+ VGG + +
Sbjct: 279 FGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPD 338
Query: 356 TRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLAG 415
F++ +G GG+++D+GT +TRLPT Y + R+ F +L A +S FDTCY+L+G
Sbjct: 339 GVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSG 398
Query: 416 QSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGMR 475
+V+VPTV+F ++G +L LPARN+L+ +D +GTYC AF T LSIIG+ QQ+G++
Sbjct: 399 FVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPT-GLSIIGNIQQEGIQ 458
Query: 476 VSYDLVNSLVGFSSNKC 483
VS+D N VGF N C
Sbjct: 459 VSFDGANGFVGFGPNVC 470
BLAST of CmaCh11G002640 vs. TAIR 10
Match:
AT1G01300.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 341.3 bits (874), Expect = 1.3e-93
Identity = 204/438 (46.58%), Postives = 275/438 (62.79%), Query Frame = 0
Query: 57 SITPPQFHSHHLSNSSLSLSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRNLNLA 116
S+ +F S S SS S++L+ + K + L +RL RD+ RV+S+ L
Sbjct: 54 SLLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSI-ATLAAQ 113
Query: 117 LAG----DAVRPNSLTAPVVSGQSQGSGEYFARIAVGQPAQSFYLVPDTGSDITWLQCLP 176
+ G A RP ++ VVSG SQGSGEYF R+ VG PA+ Y+V DTGSDI WLQC P
Sbjct: 114 IPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP 173
Query: 177 CSIGNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQS--GTCVYQVWYGDGSFT 236
C CY Q+DPIF+P S +Y + C S C+ L+ GC + TC+YQV YGDGSFT
Sbjct: 174 C---RRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFT 233
Query: 237 TGDFATETLTFGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLK---ASSF 296
GDF+TETLTF ++ + + +GCGHDN+GLFVGAAGL+GLG G LS Q F
Sbjct: 234 VGDFSTETLTFRRNR-VKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKF 293
Query: 297 SYCLVDRDSDSS-STLEFDSARPSD-SITTPLLKNNRIDSYRYVQVTGMSVGG-KALSIS 356
SYCLVDR + S S++ F +A S + TPLL N ++D++ YV + G+SVGG + ++
Sbjct: 294 SYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVT 353
Query: 357 STRFEIDGSGMGGIIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLA 416
++ F++D G GG+I+DSGT +TRL Y ++R+AF GA++L A S FDTC++L+
Sbjct: 354 ASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLS 413
Query: 417 GQSNVQVPTVAFELSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGM 476
+ V+VPTV +G + LPA NYLI +DT G +C AF T LSIIG+ QQQG
Sbjct: 414 NMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAG-TMGGLSIIGNIQQQGF 473
Query: 477 RVSYDLVNSLVGFSSNKC 483
RV YDL +S VGF+ C
Sbjct: 474 RVVYDLASSRVGFAPGGC 484
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LS40 | 3.2e-121 | 50.33 | Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... | [more] |
Q9LHE3 | 4.3e-94 | 42.56 | Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... | [more] |
Q9LNJ3 | 1.8e-92 | 46.58 | Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... | [more] |
Q766C3 | 1.4e-68 | 40.05 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... | [more] |
Q766C2 | 2.6e-67 | 39.90 | Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... | [more] |