Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAGCAGCATAAAAAGGCATGTTCAGATGCTCATATTCATAGTTTCTGATTCTTCTTCTTTTCCATTTTTTTCAACTTCAATTTTCAGCGGGAGAATGGTTAAGTGTTTCTCTCTCTTCGTCGCGCTCTCTCTCCTCGCCGTATCGGCGATCGGCGGGGATCATTTCTCCGGCGATGGTTATGGAGATTCCATAATCCGTCAGGTTGTCGACGATGGAGGAGTCAACGGAGGAGGAAGTAACGGCGACGATCTGCTACTCGGAGCTGAGCACCACTTTTCAGTCTTCAAGCAGAAGTTTGGGAAGTCATACGCCTCTAAGGAGGAGCATGATCATCGGTTCAGGGTCTTCAAAGCCAACCTGAGGCGAGCTCAGCGCCACCAGGCTCTTGATCCATCTGCTACTCATGGCGTCACTCAGTTCTCTGATTTGACACCATCGGAATTCCGAAGGTTGTTTCTAGGGCTTAGAGGTCGCCGTCTTGGACTCCCTGTGGACGCTAACAAAGCCCCTATTCTTCCTACTGATGGCCTTCCGACTGATTTCGATTGGAGAGATCATGGAGCTGTCACGGAAGTCAAGAATCAGGTTTGGTTCTTTTCTTTCTTCTTTTTTAATTTAGTCTTACGTTTATCTTTATTTTTCCTATAATTTTTTTTTTAATCTGGGGCGAAGAGTCATCTTCGTCTTATTTTAGTATGGAATTTAATGCTTTCAGATTCTCTTCCGTATCTTTTATATTTTTTTTCTTTATAATAAATAGTTCTTTTAGTTTCAACTCGTACTCTGTATCTGTTATTCCTTATGTTGACGATGACTTCCAATTAATTCTTTTAAAACCAAGATGACTAATATTTATTGAACGGAGGTTTTGATTGCTGTGATTGATATTGTTTCTGTTTTCTTATAAGGGTTCGTGTGGATCATGCTGGAGTTTCAGTTCAACCGGTGCTCTTGAGGGCGCTAACTTCCTTTCTTCCGGCGAACTTGTTAGCCTCAGCGAACAACAGCTTGTTGACTGTGATCATGAGGTTTGGATTTTAATTTTCATCCATCTTTGATTTGCAATTTATGAAGTGAGTTTTTCAGAATTTACGTTTTGATTATGATCGTTTATGATACTGTATTGGGTGTCAAATCTACCTTTCTGCACATGATTCCATTGTTTTAGGAAGCAAAATACCTGTTCTTGCGCTTGATGATCTATGTTTCCCGTTTATGTTGATGGTTGGTTGCTCTCTATTAGATTTAGTATCTCTATTGCAAACTTTGGAGACAATTGGTAAATATTTGCATCTGTTTAGTATTGGCTGTGCTCATAAATCATATGTACTAGCAAATTCGGCAGTCTTCAAAAGGATTCAAGTGCATGATTTAGCCTTACTTCAAGGGATGAATGATTTGGTTCACTTATCTACTCTGGAGCTTCATTAAGGAAGGTGTTTCTTTTTATGTTCATGCAATTTAATGCACTTCTATGGTGTTTGCCAGTGTGATCCAGAAGAAAAAGGTTCCTGTGATGCTGGCTGCAGTGGGGGGCTAATGAATAGTGCATTCGAGTACACATTGAAAGCCGGTGGACTAATGAAAGAGAAAGACTACCCATACACTGGTACAGATCGTGGAGCCTGCAAATTTGACAAGACCAAAATTGCAGCATCAGTAGCCAACTTTAGTGTGATCTCCCTTGATGAAGAACAAATTGCTGCCAATCTCGTGAAAAATGGCCCTCTTGCAAGTAATTACAGGCTCAACCATGCTATATCTTAAATCCTATTCTTTTCTTTATAGTTGCTCACTGCTCGATTTATCTTGAATGCAGTTGGTATCAATGCTGTATTCATGCAGACATATATTGGTGGGGTTTCTTGCCCTTACATTTGCTCAAAGCACTTAGATCATGGAGTTCTATTGGTTGGTTATGGATCAGCTGCCTATGCTCCCATCCGAATGAAAGAAAAACCTTACTGGATCATTAAGAACTCGTGGGGAGCCAAATGGGGGGAGAATGGATACTACAAACTCTGCAGGGGTCGCAATATCTGTGGTGTCGATTCCATGGTGTCGACAGTAGCTGCAGTTCATATCACCTCGAACTAGTATACGAACTGCTGGAGATTACTCACAGCTATAATTCCTGTATATATGGCAATATTATCACCTATGGAAGGAAGTTAAGTTATGAACTGCCATTTAAAGCGCGTAGAATTTGAACTCTCAGATTTCTGTTGGTTTAGAAACTTGAAAAAGTAGCTTATATACATGCATATTTATATTTAGCTCAAGCTGTTTTGGTTAAGTTTGGATTGTTTGTAAGCAAAAACAAGCTCTTTGCTATCTTTTTCTTCTGGTAAAGATTGCTTGTTGAGCTTGTGAACTCTCTCATGGTGAATCTTTTGAGTACAGTTGGGGGATATTGAAGCAATTGGCTATCTACTACAGCCCGCAACCTAAATTAAGTATGGAAAGAATATCAGATCAGCCGCAAAGGAAAGATCACCATTGCCAAGATCTTGCTTATGGGGTAAGTAATGAATCATATAGTTGGGAAAATAAGGTGTTTTGTTTCAATGGGTATTCTTAGTGACTTTCGTCATCAATTCGCAAGAAAGGTCATGCTCAAGTAAAAACTAGGACTCCTAAAATACCATTTCTTGCACGCCATATGTTCAAACTCTGAACCCCTCCCCCCAACAAAAAAAAAATAATAATAAAAAGATAAGGCCATCCCATTTCGACAAAAGACTCACCCAAGTTTAATAGCTTTGAGTCAGATATTTTGATCATGATCATGATTTTCCTACTTCAGAGGAAAAAGTCCTTCCCCTTTTTGGCCAGCTGTGGGTGAGGCGTTCAAACTATTTGATAGTATAGTTAAGCTCTATATGAAGCTGCCTCAGCCTAAAACATGATCAAATTAAGACTCCTCCTCAGCCTAAAACATGATCAAATTAAGACTCCTTGCTAGGACTTAAGAGCACACCTAGAATCTCATGTCTGACAAACATGCCTTAGCATGAACAATACTTGCCACCATCCTCATAAGCACATATCTTTTCAAAATCAT
mRNA sequence
GCAGCAGCATAAAAAGGCATGTTCAGATGCTCATATTCATAGTTTCTGATTCTTCTTCTTTTCCATTTTTTTCAACTTCAATTTTCAGCGGGAGAATGGTTAAGTGTTTCTCTCTCTTCGTCGCGCTCTCTCTCCTCGCCGTATCGGCGATCGGCGGGGATCATTTCTCCGGCGATGGTTATGGAGATTCCATAATCCGTCAGGTTGTCGACGATGGAGGAGTCAACGGAGGAGGAAGTAACGGCGACGATCTGCTACTCGGAGCTGAGCACCACTTTTCAGTCTTCAAGCAGAAGTTTGGGAAGTCATACGCCTCTAAGGAGGAGCATGATCATCGGTTCAGGGTCTTCAAAGCCAACCTGAGGCGAGCTCAGCGCCACCAGGCTCTTGATCCATCTGCTACTCATGGCGTCACTCAGTTCTCTGATTTGACACCATCGGAATTCCGAAGGTTGTTTCTAGGGCTTAGAGGTCGCCGTCTTGGACTCCCTGTGGACGCTAACAAAGCCCCTATTCTTCCTACTGATGGCCTTCCGACTGATTTCGATTGGAGAGATCATGGAGCTGTCACGGAAGTCAAGAATCAGGGTTCGTGTGGATCATGCTGGAGTTTCAGTTCAACCGGTGCTCTTGAGGGCGCTAACTTCCTTTCTTCCGGCGAACTTGTTAGCCTCAGCGAACAACAGCTTGTTGACTGTGATCATGAGTGTGATCCAGAAGAAAAAGGTTCCTGTGATGCTGGCTGCAGTGGGGGGCTAATGAATAGTGCATTCGAGTACACATTGAAAGCCGGTGGACTAATGAAAGAGAAAGACTACCCATACACTGGTACAGATCGTGGAGCCTGCAAATTTGACAAGACCAAAATTGCAGCATCAGTAGCCAACTTTAGTGTGATCTCCCTTGATGAAGAACAAATTGCTGCCAATCTCGTGAAAAATGGCCCTCTTGCAATTGGTATCAATGCTGTATTCATGCAGACATATATTGGTGGGGTTTCTTGCCCTTACATTTGCTCAAAGCACTTAGATCATGGAGTTCTATTGGTTGGTTATGGATCAGCTGCCTATGCTCCCATCCGAATGAAAGAAAAACCTTACTGGATCATTAAGAACTCGTGGGGAGCCAAATGGGGGGAGAATGGATACTACAAACTCTGCAGGGGTCGCAATATCTGTGGTGTCGATTCCATGGTGTCGACAGTAGCTGCAGTTCATATCACCTCGAACTAGTATACGAACTGCTGGAGATTACTCACAGCTATAATTCCTGTATATATGGCAATATTATCACCTATGGAAGGAAGTTAAGTTATGAACTGCCATTTAAAGCGCGTAGAATTTGAACTCTCAGATTTCTGTTGGTTTAGAAACTTGAAAAAGTAGCTTATATACATGCATATTTATATTTAGCTCAAGCTGTTTTGGTTAAGTTTGGATTGTTTGTAAGCAAAAACAAGCTCTTTGCTATCTTTTTCTTCTGGTAAAGATTGCTTGTTGAGCTTGTGAACTCTCTCATGGTGAATCTTTTGAGTACAGTTGGGGGATATTGAAGCAATTGGCTATCTACTACAGCCCGCAACCTAAATTAAGTATGGAAAGAATATCAGATCAGCCGCAAAGGAAAGATCACCATTGCCAAGATCTTGCTTATGGGGTAAGTAATGAATCATATAGTTGGGAAAATAAGGTGTTTTGTTTCAATGGGTATTCTTAGTGACTTTCGTCATCAATTCGCAAGAAAGGTCATGCTCAAGTAAAAACTAGGACTCCTAAAATACCATTTCTTGCACGCCATATGTTCAAACTCTGAACCCCTCCCCCCAACAAAAAAAAAATAATAATAAAAAGATAAGGCCATCCCATTTCGACAAAAGACTCACCCAAGTTTAATAGCTTTGAGTCAGATATTTTGATCATGATCATGATTTTCCTACTTCAGAGGAAAAAGTCCTTCCCCTTTTTGGCCAGCTGTGGGTGAGGCGTTCAAACTATTTGATAGTATAGTTAAGCTCTATATGAAGCTGCCTCAGCCTAAAACATGATCAAATTAAGACTCCTCCTCAGCCTAAAACATGATCAAATTAAGACTCCTTGCTAGGACTTAAGAGCACACCTAGAATCTCATGTCTGACAAACATGCCTTAGCATGAACAATACTTGCCACCATCCTCATAAGCACATATCTTTTCAAAATCAT
Coding sequence (CDS)
ATGCTCATATTCATAGTTTCTGATTCTTCTTCTTTTCCATTTTTTTCAACTTCAATTTTCAGCGGGAGAATGGTTAAGTGTTTCTCTCTCTTCGTCGCGCTCTCTCTCCTCGCCGTATCGGCGATCGGCGGGGATCATTTCTCCGGCGATGGTTATGGAGATTCCATAATCCGTCAGGTTGTCGACGATGGAGGAGTCAACGGAGGAGGAAGTAACGGCGACGATCTGCTACTCGGAGCTGAGCACCACTTTTCAGTCTTCAAGCAGAAGTTTGGGAAGTCATACGCCTCTAAGGAGGAGCATGATCATCGGTTCAGGGTCTTCAAAGCCAACCTGAGGCGAGCTCAGCGCCACCAGGCTCTTGATCCATCTGCTACTCATGGCGTCACTCAGTTCTCTGATTTGACACCATCGGAATTCCGAAGGTTGTTTCTAGGGCTTAGAGGTCGCCGTCTTGGACTCCCTGTGGACGCTAACAAAGCCCCTATTCTTCCTACTGATGGCCTTCCGACTGATTTCGATTGGAGAGATCATGGAGCTGTCACGGAAGTCAAGAATCAGGGTTCGTGTGGATCATGCTGGAGTTTCAGTTCAACCGGTGCTCTTGAGGGCGCTAACTTCCTTTCTTCCGGCGAACTTGTTAGCCTCAGCGAACAACAGCTTGTTGACTGTGATCATGAGTGTGATCCAGAAGAAAAAGGTTCCTGTGATGCTGGCTGCAGTGGGGGGCTAATGAATAGTGCATTCGAGTACACATTGAAAGCCGGTGGACTAATGAAAGAGAAAGACTACCCATACACTGGTACAGATCGTGGAGCCTGCAAATTTGACAAGACCAAAATTGCAGCATCAGTAGCCAACTTTAGTGTGATCTCCCTTGATGAAGAACAAATTGCTGCCAATCTCGTGAAAAATGGCCCTCTTGCAATTGGTATCAATGCTGTATTCATGCAGACATATATTGGTGGGGTTTCTTGCCCTTACATTTGCTCAAAGCACTTAGATCATGGAGTTCTATTGGTTGGTTATGGATCAGCTGCCTATGCTCCCATCCGAATGAAAGAAAAACCTTACTGGATCATTAAGAACTCGTGGGGAGCCAAATGGGGGGAGAATGGATACTACAAACTCTGCAGGGGTCGCAATATCTGTGGTGTCGATTCCATGGTGTCGACAGTAGCTGCAGTTCATATCACCTCGAACTAG
Protein sequence
MLIFIVSDSSSFPFFSTSIFSGRMVKCFSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHHFSVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRLFLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEGANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGVSCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNICGVDSMVSTVAAVHITSN
Homology
BLAST of CmaCh02G003710 vs. ExPASy Swiss-Prot
Match:
P43296 (Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1)
HSP 1 Score: 552.4 bits (1422), Expect = 4.3e-156
Identity = 277/378 (73.28%), Postives = 311/378 (82.28%), Query Frame = 0
Query: 23 RMVKCFSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEH 82
R+ FS+FV LS VS D GD D +IRQVV GG+ + +L +E
Sbjct: 3 RLKLYFSVFV-LSFFIVSVSSSDVNDGD---DLVIRQVV-------GGA--EPQVLTSED 62
Query: 83 HFSVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRR 142
HFS+FK+KFGK YAS EEHD+RF VFKANLRRA+RHQ LDPSATHGVTQFSDLT SEFR+
Sbjct: 63 HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRK 122
Query: 143 LFLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGAL 202
LG+R LP DANKAPILPT+ LP DFDWRDHGAVT VKNQGSCGSCWSFS+TGAL
Sbjct: 123 KHLGVRS-GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGAL 182
Query: 203 EGANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEK 262
EGANFL++G+LVSLSEQQLVDCDHECDPEE SCD+GC+GGLMNSAFEYTLK GGLMKE+
Sbjct: 183 EGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEE 242
Query: 263 DYPYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIG 322
DYPYTG D CK DK+KI ASV+NFSVIS+DEEQIAANLVKNGPLA+ INA +MQTYIG
Sbjct: 243 DYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIG 302
Query: 323 GVSCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRN 382
GVSCPYIC++ L+HGVLLVGYG+A YAP R KEKPYWIIKNSWG WGENG+YK+C+GRN
Sbjct: 303 GVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRN 362
Query: 383 ICGVDSMVSTVAAVHITS 401
ICGVDSMVSTVAA T+
Sbjct: 363 ICGVDSMVSTVAATVSTT 366
BLAST of CmaCh02G003710 vs. ExPASy Swiss-Prot
Match:
Q9SUL1 (Probable cysteine protease RD19C OS=Arabidopsis thaliana OX=3702 GN=RD19C PE=2 SV=1)
HSP 1 Score: 534.6 bits (1376), Expect = 9.3e-151
Identity = 263/374 (70.32%), Postives = 307/374 (82.09%), Query Frame = 0
Query: 28 FSLFVALSLLAVSAIGGDHFSG---DGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHHF 87
F +A +LLA ++G SG DG+ + IRQVV + D+ LL AEHHF
Sbjct: 6 FFFLIAATLLA-GSLGSTVISGEVTDGFVNP-IRQVVPE--------ENDEQLLNAEHHF 65
Query: 88 SVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRLF 147
++FK K+ K+YA++ EHDHRFRVFKANLRRA+R+Q LDPSA HGVTQFSDLTP EFRR F
Sbjct: 66 TLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKF 125
Query: 148 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEG 207
LGL+ R LP D APILPT LPT+FDWR+ GAVT VKNQG CGSCWSFS+ GALEG
Sbjct: 126 LGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEG 185
Query: 208 ANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDY 267
A+FL++ ELVSLSEQQLVDCDHECDP + SCD+GCSGGLMN+AFEY LKAGGLMKE+DY
Sbjct: 186 AHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDY 245
Query: 268 PYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGV 327
PYTG D ACKFDK+KI ASV+NFSV+S DE+QIAANLV++GPLAI INA++MQTYIGGV
Sbjct: 246 PYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGV 305
Query: 328 SCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRG-RNI 387
SCPY+CSK DHGVLLVG+GS+ YAPIR+KEKPYWIIKNSWGA WGE+GYYK+CRG N+
Sbjct: 306 SCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNM 365
Query: 388 CGVDSMVSTVAAVH 398
CG+D+MVSTVAAVH
Sbjct: 366 CGMDTMVSTVAAVH 369
BLAST of CmaCh02G003710 vs. ExPASy Swiss-Prot
Match:
P43295 (Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 SV=2)
HSP 1 Score: 529.6 bits (1363), Expect = 3.0e-149
Identity = 255/368 (69.29%), Postives = 303/368 (82.34%), Query Frame = 0
Query: 28 FSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHHFSVF 87
FS+ + ++VS G + D +IRQVVD+ + +L +E HF++F
Sbjct: 9 FSVSLIFVFVSVSVCGDE--------DVLIRQVVDE---------TEPKVLSSEDHFTLF 68
Query: 88 KQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRLFLGL 147
K+KFGK Y S EEH +RF VFKANL RA RHQ +DPSA HGVTQFSDLT SEFRR LG+
Sbjct: 69 KKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGV 128
Query: 148 RGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEGANF 207
+G LP DAN+APILPT LP +FDWRD GAVT VKNQGSCGSCWSFS+TGALEGA+F
Sbjct: 129 KG-GFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHF 188
Query: 208 LSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDYPYT 267
L++G+LVSLSEQQLVDCDHECDPEE+GSCD+GC+GGLMNSAFEYTLK GGLM+EKDYPYT
Sbjct: 189 LATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYT 248
Query: 268 GTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGVSCP 327
GTD G+CK D++KI ASV+NFSV+S++E+QIAANL+KNGPLA+ INA +MQTYIGGVSCP
Sbjct: 249 GTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP 308
Query: 328 YICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNICGVD 387
YICS+ L+HGVLLVGYGSA ++ R+KEKPYWIIKNSWG WGENG+YK+C+GRNICGVD
Sbjct: 309 YICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVD 358
Query: 388 SMVSTVAA 396
S+VSTVAA
Sbjct: 369 SLVSTVAA 358
BLAST of CmaCh02G003710 vs. ExPASy Swiss-Prot
Match:
P25804 (Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1)
HSP 1 Score: 516.9 bits (1330), Expect = 2.0e-145
Identity = 252/367 (68.66%), Postives = 300/367 (81.74%), Query Frame = 0
Query: 30 LFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHHFSVFKQ 89
LF AV+ D + D D IIRQVVD N +D LL AEHHF+ FK
Sbjct: 6 LFALFLFAAVATAVTDDTNND---DFIIRQVVD---------NEEDHLLNAEHHFTSFKS 65
Query: 90 KFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRLFLGLRG 149
KF KSYA+KEEHD+RF VFK+NL +A+ HQ DP+A HG+T+FSDLT SEFRR FLGL+
Sbjct: 66 KFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLK- 125
Query: 150 RRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEGANFLS 209
+RL LP A KAPILPT LP DFDWR+ GAVT VK+QGSCGSCW+FS+TGALEGA++L+
Sbjct: 126 KRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLA 185
Query: 210 SGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDYPYTGT 269
+G+LVSLSEQQLVDCDH CDPE+ GSCD+GC+GGLMN+AFEY L++GG+++EKDY YTG
Sbjct: 186 TGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGR 245
Query: 270 DRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGVSCPYI 329
D G+CKFDK+K+ ASV+NFSV++LDE+QIAANLVKNGPLA+ INA +MQTY+ GVSCPY+
Sbjct: 246 D-GSCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYV 305
Query: 330 CSK-HLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNICGVDS 389
C+K LDHGVLLVG+G AYAPIR+KEKPYWIIKNSWG WGE GYYK+CRGRN+CGVDS
Sbjct: 306 CAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDS 358
Query: 390 MVSTVAA 396
MVSTVAA
Sbjct: 366 MVSTVAA 358
BLAST of CmaCh02G003710 vs. ExPASy Swiss-Prot
Match:
Q10716 (Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1)
HSP 1 Score: 503.8 bits (1296), Expect = 1.8e-141
Identity = 244/351 (69.52%), Postives = 283/351 (80.63%), Query Frame = 0
Query: 54 DSIIRQVVDDGGVNGGGSNGDDLLLGAEHHFSVFKQKFGKSYASKEEHDHRFRVFKANLR 113
D +IRQVV G + +DL L AE HF F Q+FGKSY +EH +R VFK NLR
Sbjct: 25 DPLIRQVVP-------GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLR 84
Query: 114 RAQRHQALDPSATHGVTQFSDLTPSEFRRLFLGLRGRRLG----LPVDANKAPILPTDGL 173
RA+RHQ LDPSA HGVT+FSDLTP+EFRR +LGLR R L A++AP+LPTDGL
Sbjct: 85 RARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGL 144
Query: 174 PTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEGANFLSSGELVSLSEQQLVDCDHECD 233
P DFDWRDHGAV VKNQGSCGSCWSFS++GALEGA++L++G+L LSEQQ VDCDHECD
Sbjct: 145 PDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECD 204
Query: 234 PEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGACKFDKTKIAASVANFS 293
E SCD+GC+GGLM +AF Y KAGGL EKDYPYTG+D G CKFDK+KI ASV NFS
Sbjct: 205 SSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFS 264
Query: 294 VISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGVSCPYICSKHLDHGVLLVGYGSAAYA 353
V+S+DE QI+ANL+K+GPLAIGINA +MQTYIGGVSCPYIC +HLDHGVLLVGYG++ +A
Sbjct: 265 VVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFA 324
Query: 354 PIRMKEKPYWIIKNSWGAKWGENGYYKLCRG---RNICGVDSMVSTVAAVH 398
PIR+K+KPYWIIKNSWG WGENGYYK+CRG RN CGVDSMVSTV+AVH
Sbjct: 325 PIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVH 367
BLAST of CmaCh02G003710 vs. TAIR 10
Match:
AT4G39090.1 (Papain family cysteine protease )
HSP 1 Score: 552.4 bits (1422), Expect = 3.1e-157
Identity = 277/378 (73.28%), Postives = 311/378 (82.28%), Query Frame = 0
Query: 23 RMVKCFSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEH 82
R+ FS+FV LS VS D GD D +IRQVV GG+ + +L +E
Sbjct: 3 RLKLYFSVFV-LSFFIVSVSSSDVNDGD---DLVIRQVV-------GGA--EPQVLTSED 62
Query: 83 HFSVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRR 142
HFS+FK+KFGK YAS EEHD+RF VFKANLRRA+RHQ LDPSATHGVTQFSDLT SEFR+
Sbjct: 63 HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRK 122
Query: 143 LFLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGAL 202
LG+R LP DANKAPILPT+ LP DFDWRDHGAVT VKNQGSCGSCWSFS+TGAL
Sbjct: 123 KHLGVRS-GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGAL 182
Query: 203 EGANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEK 262
EGANFL++G+LVSLSEQQLVDCDHECDPEE SCD+GC+GGLMNSAFEYTLK GGLMKE+
Sbjct: 183 EGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEE 242
Query: 263 DYPYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIG 322
DYPYTG D CK DK+KI ASV+NFSVIS+DEEQIAANLVKNGPLA+ INA +MQTYIG
Sbjct: 243 DYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIG 302
Query: 323 GVSCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRN 382
GVSCPYIC++ L+HGVLLVGYG+A YAP R KEKPYWIIKNSWG WGENG+YK+C+GRN
Sbjct: 303 GVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRN 362
Query: 383 ICGVDSMVSTVAAVHITS 401
ICGVDSMVSTVAA T+
Sbjct: 363 ICGVDSMVSTVAATVSTT 366
BLAST of CmaCh02G003710 vs. TAIR 10
Match:
AT4G16190.1 (Papain family cysteine protease )
HSP 1 Score: 534.6 bits (1376), Expect = 6.6e-152
Identity = 263/374 (70.32%), Postives = 307/374 (82.09%), Query Frame = 0
Query: 28 FSLFVALSLLAVSAIGGDHFSG---DGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHHF 87
F +A +LLA ++G SG DG+ + IRQVV + D+ LL AEHHF
Sbjct: 6 FFFLIAATLLA-GSLGSTVISGEVTDGFVNP-IRQVVPE--------ENDEQLLNAEHHF 65
Query: 88 SVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRLF 147
++FK K+ K+YA++ EHDHRFRVFKANLRRA+R+Q LDPSA HGVTQFSDLTP EFRR F
Sbjct: 66 TLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKF 125
Query: 148 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEG 207
LGL+ R LP D APILPT LPT+FDWR+ GAVT VKNQG CGSCWSFS+ GALEG
Sbjct: 126 LGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEG 185
Query: 208 ANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDY 267
A+FL++ ELVSLSEQQLVDCDHECDP + SCD+GCSGGLMN+AFEY LKAGGLMKE+DY
Sbjct: 186 AHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDY 245
Query: 268 PYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGV 327
PYTG D ACKFDK+KI ASV+NFSV+S DE+QIAANLV++GPLAI INA++MQTYIGGV
Sbjct: 246 PYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGV 305
Query: 328 SCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRG-RNI 387
SCPY+CSK DHGVLLVG+GS+ YAPIR+KEKPYWIIKNSWGA WGE+GYYK+CRG N+
Sbjct: 306 SCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNM 365
Query: 388 CGVDSMVSTVAAVH 398
CG+D+MVSTVAAVH
Sbjct: 366 CGMDTMVSTVAAVH 369
BLAST of CmaCh02G003710 vs. TAIR 10
Match:
AT2G21430.1 (Papain family cysteine protease )
HSP 1 Score: 529.6 bits (1363), Expect = 2.1e-150
Identity = 255/368 (69.29%), Postives = 303/368 (82.34%), Query Frame = 0
Query: 28 FSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHHFSVF 87
FS+ + ++VS G + D +IRQVVD+ + +L +E HF++F
Sbjct: 9 FSVSLIFVFVSVSVCGDE--------DVLIRQVVDE---------TEPKVLSSEDHFTLF 68
Query: 88 KQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRLFLGL 147
K+KFGK Y S EEH +RF VFKANL RA RHQ +DPSA HGVTQFSDLT SEFRR LG+
Sbjct: 69 KKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGV 128
Query: 148 RGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEGANF 207
+G LP DAN+APILPT LP +FDWRD GAVT VKNQGSCGSCWSFS+TGALEGA+F
Sbjct: 129 KG-GFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHF 188
Query: 208 LSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDYPYT 267
L++G+LVSLSEQQLVDCDHECDPEE+GSCD+GC+GGLMNSAFEYTLK GGLM+EKDYPYT
Sbjct: 189 LATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYT 248
Query: 268 GTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGGVSCP 327
GTD G+CK D++KI ASV+NFSV+S++E+QIAANL+KNGPLA+ INA +MQTYIGGVSCP
Sbjct: 249 GTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP 308
Query: 328 YICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNICGVD 387
YICS+ L+HGVLLVGYGSA ++ R+KEKPYWIIKNSWG WGENG+YK+C+GRNICGVD
Sbjct: 309 YICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVD 358
Query: 388 SMVSTVAA 396
S+VSTVAA
Sbjct: 369 SLVSTVAA 358
BLAST of CmaCh02G003710 vs. TAIR 10
Match:
AT3G54940.2 (Papain family cysteine protease )
HSP 1 Score: 427.2 bits (1097), Expect = 1.5e-119
Identity = 203/323 (62.85%), Postives = 255/323 (78.95%), Query Frame = 0
Query: 77 LLG--AEHHFSVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSD 136
LLG E F +F +GK+Y+++EE+ HR +F N+ +A HQ +DPSA HGVTQFSD
Sbjct: 42 LLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSD 101
Query: 137 LTPSEFRRLFLGLR--GRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGS 196
LT EF+R++ G+ G G V A +AP++ DGLP DFDWR+ G VTEVKNQG+CGS
Sbjct: 102 LTEEEFKRMYTGVADVGGSRGGTVGA-EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGS 161
Query: 197 CWSFSSTGALEGANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYT 256
CW+FS+TGA EGA+F+S+G+L+SLSEQQLVDCD CDP++K +CD GC GGLM +A+EY
Sbjct: 162 CWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYL 221
Query: 257 LKAGGLMKEKDYPYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGI 316
++AGGL +E+ YPYTG RG CKFD K+A V NF+ I LDE QIAANLV++GPLA+G+
Sbjct: 222 MEAGGLEEERSYPYTG-KRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGL 281
Query: 317 NAVFMQTYIGGVSCPYICSK-HLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGE 376
NAVFMQTYIGGVSCP ICSK +++HGVLLVGYGS ++ +R+ KPYWIIKNSWG KWGE
Sbjct: 282 NAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGE 341
Query: 377 NGYYKLCRGRNICGVDSMVSTVA 395
NGYYKLCRG +ICG++SMVS VA
Sbjct: 342 NGYYKLCRGHDICGINSMVSAVA 362
BLAST of CmaCh02G003710 vs. TAIR 10
Match:
AT3G19390.1 (Granulin repeat cysteine protease family protein )
HSP 1 Score: 225.7 bits (574), Expect = 6.6e-59
Identity = 128/310 (41.29%), Postives = 178/310 (57.42%), Query Frame = 0
Query: 93 KSYASKEEHDHRFRVFKANLRRAQRHQALDPSATH--GVTQFSDLTPSEFRRLFLGLRGR 152
K+Y E + RF +FK NL+ + H ++ P+ T+ G+T+F+DLT EFR ++L +
Sbjct: 52 KNYNGLGEKERRFEIFKDNLKFVEEHSSI-PNRTYEVGLTRFADLTNDEFRAIYLRSKME 111
Query: 153 RLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALEGANFLSS 212
R +PV K D LP DWR GAV VK+QGSCGSCW+FS+ GA+EG N + +
Sbjct: 112 RTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKT 171
Query: 213 GELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKDYPYTGTD 272
GEL+SLSEQ+LVDCD S + GC GGLM+ AF++ ++ GG+ E+DYPY TD
Sbjct: 172 GELISLSEQELVDCD--------TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATD 231
Query: 273 RGACKFDKTKI-AASVANFSVISLDEEQIAANLVKNGPLAIGINA--VFMQTYIGGVSCP 332
C DK ++ + + ++E+ + N P+++ I A Q Y GV
Sbjct: 232 VNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTG 291
Query: 333 YICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNI---- 392
C LDHGV+ VGYGS + YWI++NSWG+ WGE+GY+KL RNI
Sbjct: 292 -TCGTSLDHGVVAVGYGSEG-------GQDYWIVRNSWGSNWGESGYFKL--ERNIKESS 342
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P43296 | 4.3e-156 | 73.28 | Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1 | [more] |
Q9SUL1 | 9.3e-151 | 70.32 | Probable cysteine protease RD19C OS=Arabidopsis thaliana OX=3702 GN=RD19C PE=2 S... | [more] |
P43295 | 3.0e-149 | 69.29 | Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 S... | [more] |
P25804 | 2.0e-145 | 68.66 | Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1 | [more] |
Q10716 | 1.8e-141 | 69.52 | Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1 | [more] |