Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCTCTGTTTCTCCCTCCGCGTAGTAAAAGTCCACTTTTCTCTCCACGTTGTCTGCATCACAGTGAAACCTACCATTGATAGAGAAACCTCACGACTTCTTCCATGCCCTCCTTCACTACCTTCCTCTAATTAAATATTCAAACCCAAATCTTCACGAAATCCATATGGTTCGAACCCGAATCAAACTCATCTCCGCCATTACTGTTTCTTAATTCCCAAAAAACAGAGCATCACCTCTGTTTCATTGAGCCCACCTGAAGCTTCCTCCATCGGTGACGTTTCTTCTCATTTTTTGTATATAAAATTGATATGGGAATGGAATAATTCAAGGATACCCTGGAATCATGGTCGATTAGTGTTGGCAATCTTGACTATTTCTTCTTTTTCCTTCAATCATGCCCTATTAGTGCTCTTAGTCCTCCTCACGATCAATCCTCATTCTCTTTTTCTCTGCTTTCCCGTTCTCAATGGGGGAATGGTGAAGCCCTTCAGTTCATCCATTTGGTGCAACATTTTTCTGGCTATCTGTTTCGATTGTTCTAGTTTTAGGCTTTTTTGAAGTGGGTTTCTCCTAATCTATCCATTTGAGCAAGATTTTTCTTGCTATCTGTTCCGATTGTGGCTAACTGTTCTAGTTTTAGGCTTCTCTGAAGTGGGTTTCTTCTAATCCATCCATTTGGGCAAGATTTTTCTTGCTATCTGTACCGATTCTTCCAGTTTTTGGCTTCTCTGAAGTGGGTTTCTTCTAATCCATCCATTTGGGGCAAGAACTTTCCGGCAATCTGCTCCGATTGTTCCAGTTTTAGGCTTCTCTGAAGTGGGCTTCGTCTACTCTTTGCTGATAAGGTCTGTTTATTGAATCCTTTTATAAACCTGTGTTTTCTTTTGGGGTTTGAGATTGTTCGATCATACAATGCATTATGAATCATTGTTTGATTTTGATCTGTTTGTGGCGTTTTCGGACATTAGATGTTGTTTTTTGTTATGCAGCTGCTGGATTTAACTTAAGAACTTGTTTTGAGTTTGACAATGGCATCAAAGTCATTCAAGCCAAACCGTTCAAATTTGTCCACAGCTTCTGATGCATCTGAAGCACAGAAGCCTCCTCTTCCACCTACTGTGACATTCGGTCGGAGAACCTCCTCCGGTCGCTATATTAGCTACTCGAGGGATGATCTCGATAGCGAGCTTGGGAGTGGTGACTTTATGAACTATACTGTACACATTCCTCCAACACCTGATAATCAACCAATGGATCCTTCAATCTCACAGAAGGTTGAAGAGCAATACGTATCGAATTCGCTGTTTACCGGTGGGTTCAATAACATAACACGAGCTCATTTAATGGACAAAGTGATTGAATCTGAAGCAACACATCCTCAAATGGCGGGTACGAAAGGATCTTCGTGTTCTATACCTGGCTGTGATGCAAAGGTTATGAGCGACGAACGTGGAAACGATATACTCCCTTGTGAATGCGATTTCAAGATATGTCGAGATTGCTATGTCGATGCTGTTAAACTAGGTGGTGGGATTTGTCCAGGCTGCAAAGAACCGTATAAAAACACAGATCTTGATGAAATTGCTGTTGAACATGGAAGACCGCTTCCGCTTCCTCCGCCAGCCACAATGTCGAAGATGGAGAGGAGGCTGTCGTTGATGAAGTCGACGAAATCTGCGTTGATGCGAAGCCACACGGGGGTTGGAGAATTTGATCATAATAAATGGCTATTCGAAACGAGAGGAACTTATGGATATGGGAATGCTATATGGCCAAAGGATGAGGGTTTTGAAAATGGTAATACTGATGAAGTTGAGCCTATGGAGTTTATGAATAAACCGTGGCGGCCCCTAACTCGAAAGTTGAAGATTCCTGCTGCTGTTCTTAGCCCGTATCGGTATGTTCTTTGTTTATAAGTGGTATTGATATTCTAGCATGCTATGGGATTTCATCCAACTTATTGTTGAATAACATTGTTGTGACAGACTTTTGATCGTCGTTCGAATGGTCGTGCTCGGGTTCTTCTTGGCTTGGCGAGTGAGCCATCCGAACACCGATGCATACTGGTTGTGGGCTATGTCTATAGTTTGTGAGATTTGGTTTGCTTTTTCTTGGCTGCTTGATCAGCTGCCAAAGTTGTGCCCCATCAATAGAGCTACTGATCTTAACGTGTTGACGGAGAAATTCGAAACGCCTAGTCCGAGTAATCCTACCGGAAAATCTGATCTACCAGGCATAGATATCTTTGTTTCTACTGCAGATCCCGAGAAAGAACCACCTCTTGTAACTGCGAACACAATCCTTTCGATTCTAGCTGCAGATTATCCAGTTGAAAAGCTTGCTTGTTATGTTTCTGATGATGGAGGTGCGCTTTTAACTTTCGAGGCCATGGCTGAAGCTGCAAGTTTTGCTAATACTTGGGTTCCTTTCTGTCGAAAACATGGCATCGAACCGCGCAATCCTGAGTCTTATTTTAGTTTGAAAAGAGATCCATTCAAGAACAAAGTTAAGCCAGATTTTGTTAAGGATCGTAGACGTGTTAAGCGGGAGTATGACGAGTTCAAAGTTCGTATAAATGGACTTCCTGACTCTATTCGTCGTCGCTCGGATGCTTATCATGCACGAGAAGAAATCAAAGCTATGAAGCTTCAGAAACAGAACATTGGTGCTGATGAGCCGATAGAGAGTGTGAAAATCGCTAAAGCGACATGGATGGCTGATGGCACACATTGGCCAGGGACTTGGTTGCAGCCATCGTCTGAGCACTCGAAGGGTGACCATGCTGGTATCATACAGGTACAACGTCGAACGTACTCAAAGATCATTGTTTAAATAGGGATTTCAATCTGTTTTTTCACCGAGAGTCTATCTTGTTTTGTTATAGGTGATGTTGAAGCCACCTAGTGATGAACCTCTTCATGGAAATGTTGAAGATGAGAAACTTATCGACACTTCTGAGGTCGATATTCGTCTTCCTTTACTCGTTTATGTTTCTCGAGAGAAACGACCAGGCTATGACCACAACAAGAAGGCAGGAGCGATGAATGCTCTAGTTCGAGCCTCGGCAATCATGTCGAATGGTCCGTTCATTCTCAACCTCGATTGTGACCACTATATCTACAACTCTCAGGCAATGAGAGAAGGAATGTGCTTCATGATGGATCGTGGAGGCGATCGTCTTTGCTATGTCCAATTCCCTCAAAGGTTCGAGGGCATTGATCCTTCTGATCGATATGCAAATCACAACACTGTGTTTTTCGACGTTAACATGCGAGCTCTTGATGGGCTTCAAGGACCAGTTTACGTCGGAACAGGATGTCTCTTTAGAAGGGTTGCCCTATATGGTTTCGATCCACCTCGATCAAAAGAGCATCACCCTGGTTTTTGTAGTTGTTGTTGTGGCGGACGAAAAAAGCATACATCAGTTGCGAGCACACCGGAAGAGAGCAGAGCTTTGAGAATGGGTGATTCTGATGATGAAGAAATGAATCTCTCTTTGTTTCCTAAGAGATTTGGGAACTCTACTTTCCTTATTGATTCAATCCCGGTTGCTGAATTTCAAGGCCGCCCCTTGGCCGATCACCCTGCTGTGAAGAACGGACGTCCACCGGGTGCTCTTACGATCCCTCGTGATCTTCTCGATGCTTCAACAGTTGCAGAGGCAATCAGTGTCATTTCTTGCTGGTACGAAGATAAGACCGAATGGGGTAACCGTGTTGGATGGATTTATGGATCTGTTACTGAGGATGTGGTCACCGGATATAGGATGCATAATAGAGGATGGAAATCGGTGTACTGCGTAACGAAACGAGACGCTTTTCGTGGGACAGCTCCGATCAACCTAACAGATAGGCTGCATCAAGTCCTCCGATGGGCTACCGGGTCGGTCGAGATCTTCTTCTCCCGCAACAACGCCATCCTAGCTAGTCCAAGAATGAAACTTCTACAAAGAATAGCATACTTAAACGTGGGGATATATCCATTCACTTCAATCTTCCTCATAGTATATTGCTTTCTACCAGCACTGTCACTGTTCTCCGGTCAGTTCATCGTCCAAACGCTTAACGTCACGTTCCTTACATACCTTCTGGTTATCACGTTAACGTTGTGCATGCTTGCGGTGCTCGAGATCCGATGGTCTGGTATTGAATTAGAAGAGTGGTGGAGGAATGAGCAGTTCTGGTTGATTGGTGGTACAAGTGCACATCTTGCTGCTGTACTTCAGGGTCTGCTAAAAGTCGTTGCTGGGATCGAAATATCGTTCACTTTGACGTCGAAATCGGGAGGTGACGACGTAGACGACGAGTTTGCTGATCTCTACATTGTGAAATGGACATCTCTAATGATACCACCAATCACGATCATGATAACGAACTTAATTGCAATAGCAGTCGGGTTTAGCCGAACGATATACAGTGTGATACCGCAATGGAGCCGACTGATCGGTGGCGTTTTCTTTAGCTTCTGGGTATTGGCTCATCTCTACCCTTTTGCCAAAGGGCTGATGGGAAGAAGAGGAAGGACACCTACCATTGTTTTTGTGTGGTCAGGGCTTATTGCTATCACCATATCTCTTCTTTGGGTAGCCATTAGTCCTCCATCAGGAACTAACCAAATTGGAGGTTCATTCACATTCCCTTAAACACTTCATTTTTTTTTTGTCTCCCAAAATTCTTTCACTTCATAAACTTGAATTAGGTACATTCTTCTGTTGTAATTCTTGCAAATTTTTACCATCTATAATAATTCACTTTTGGGTAACTTTTGGGTATCGTTGTATTTGAAAATGTGACACCCG
mRNA sequence
GGCTCTGTTTCTCCCTCCGCGTAGTAAAAGTCCACTTTTCTCTCCACGTTGTCTGCATCACAGTGAAACCTACCATTGATAGAGAAACCTCACGACTTCTTCCATGCCCTCCTTCACTACCTTCCTCTAATTAAATATTCAAACCCAAATCTTCACGAAATCCATATGGTTCGAACCCGAATCAAACTCATCTCCGCCATTACTGTTTCTTAATTCCCAAAAAACAGAGCATCACCTCTGTTTCATTGAGCCCACCTGAAGCTTCCTCCATCGGTGACGTTTCTTCTCATTTTTTGTATATAAAATTGATATGGGAATGGAATAATTCAAGGATACCCTGGAATCATGGTCGATTAGTGTTGGCAATCTTGACTATTTCTTCTTTTTCCTTCAATCATGCCCTATTAGTGCTCTTAGTCCTCCTCACGATCAATCCTCATTCTCTTTTTCTCTGCTTTCCCGTTCTCAATGGGGGAATGGTGAAGCCCTTCAGTTCATCCATTTGGTGCAACATTTTTCTGGCTATCTGTTTCGATTGTTCTAGTTTTAGGCTTTTTTGAAGTGGGTTTCTCCTAATCTATCCATTTGAGCAAGATTTTTCTTGCTATCTGTTCCGATTGTGGCTAACTGTTCTAGTTTTAGGCTTCTCTGAAGTGGGTTTCTTCTAATCCATCCATTTGGGCAAGATTTTTCTTGCTATCTGTACCGATTCTTCCAGTTTTTGGCTTCTCTGAAGTGGGTTTCTTCTAATCCATCCATTTGGGGCAAGAACTTTCCGGCAATCTGCTCCGATTGTTCCAGTTTTAGGCTTCTCTGAAGTGGGCTTCGTCTACTCTTTGCTGATAAGCTGCTGGATTTAACTTAAGAACTTGTTTTGAGTTTGACAATGGCATCAAAGTCATTCAAGCCAAACCGTTCAAATTTGTCCACAGCTTCTGATGCATCTGAAGCACAGAAGCCTCCTCTTCCACCTACTGTGACATTCGGTCGGAGAACCTCCTCCGGTCGCTATATTAGCTACTCGAGGGATGATCTCGATAGCGAGCTTGGGAGTGGTGACTTTATGAACTATACTGTACACATTCCTCCAACACCTGATAATCAACCAATGGATCCTTCAATCTCACAGAAGGTTGAAGAGCAATACGTATCGAATTCGCTGTTTACCGGTGGGTTCAATAACATAACACGAGCTCATTTAATGGACAAAGTGATTGAATCTGAAGCAACACATCCTCAAATGGCGGGTACGAAAGGATCTTCGTGTTCTATACCTGGCTGTGATGCAAAGGTTATGAGCGACGAACGTGGAAACGATATACTCCCTTGTGAATGCGATTTCAAGATATGTCGAGATTGCTATGTCGATGCTGTTAAACTAGGTGGTGGGATTTGTCCAGGCTGCAAAGAACCGTATAAAAACACAGATCTTGATGAAATTGCTGTTGAACATGGAAGACCGCTTCCGCTTCCTCCGCCAGCCACAATGTCGAAGATGGAGAGGAGGCTGTCGTTGATGAAGTCGACGAAATCTGCGTTGATGCGAAGCCACACGGGGGTTGGAGAATTTGATCATAATAAATGGCTATTCGAAACGAGAGGAACTTATGGATATGGGAATGCTATATGGCCAAAGGATGAGGGTTTTGAAAATGGTAATACTGATGAAGTTGAGCCTATGGAGTTTATGAATAAACCGTGGCGGCCCCTAACTCGAAAGTTGAAGATTCCTGCTGCTGTTCTTAGCCCGTATCGACTTTTGATCGTCGTTCGAATGGTCGTGCTCGGGTTCTTCTTGGCTTGGCGAGTGAGCCATCCGAACACCGATGCATACTGGTTGTGGGCTATGTCTATAGTTTGTGAGATTTGGTTTGCTTTTTCTTGGCTGCTTGATCAGCTGCCAAAGTTGTGCCCCATCAATAGAGCTACTGATCTTAACGTGTTGACGGAGAAATTCGAAACGCCTAGTCCGAGTAATCCTACCGGAAAATCTGATCTACCAGGCATAGATATCTTTGTTTCTACTGCAGATCCCGAGAAAGAACCACCTCTTGTAACTGCGAACACAATCCTTTCGATTCTAGCTGCAGATTATCCAGTTGAAAAGCTTGCTTGTTATGTTTCTGATGATGGAGGTGCGCTTTTAACTTTCGAGGCCATGGCTGAAGCTGCAAGTTTTGCTAATACTTGGGTTCCTTTCTGTCGAAAACATGGCATCGAACCGCGCAATCCTGAGTCTTATTTTAGTTTGAAAAGAGATCCATTCAAGAACAAAGTTAAGCCAGATTTTGTTAAGGATCGTAGACGTGTTAAGCGGGAGTATGACGAGTTCAAAGTTCGTATAAATGGACTTCCTGACTCTATTCGTCGTCGCTCGGATGCTTATCATGCACGAGAAGAAATCAAAGCTATGAAGCTTCAGAAACAGAACATTGGTGCTGATGAGCCGATAGAGAGTGTGAAAATCGCTAAAGCGACATGGATGGCTGATGGCACACATTGGCCAGGGACTTGGTTGCAGCCATCGTCTGAGCACTCGAAGGGTGACCATGCTGGTATCATACAGGTGATGTTGAAGCCACCTAGTGATGAACCTCTTCATGGAAATGTTGAAGATGAGAAACTTATCGACACTTCTGAGGTCGATATTCGTCTTCCTTTACTCGTTTATGTTTCTCGAGAGAAACGACCAGGCTATGACCACAACAAGAAGGCAGGAGCGATGAATGCTCTAGTTCGAGCCTCGGCAATCATGTCGAATGGTCCGTTCATTCTCAACCTCGATTGTGACCACTATATCTACAACTCTCAGGCAATGAGAGAAGGAATGTGCTTCATGATGGATCGTGGAGGCGATCGTCTTTGCTATGTCCAATTCCCTCAAAGGTTCGAGGGCATTGATCCTTCTGATCGATATGCAAATCACAACACTGTGTTTTTCGACGTTAACATGCGAGCTCTTGATGGGCTTCAAGGACCAGTTTACGTCGGAACAGGATGTCTCTTTAGAAGGGTTGCCCTATATGGTTTCGATCCACCTCGATCAAAAGAGCATCACCCTGGTTTTTGTAGTTGTTGTTGTGGCGGACGAAAAAAGCATACATCAGTTGCGAGCACACCGGAAGAGAGCAGAGCTTTGAGAATGGGTGATTCTGATGATGAAGAAATGAATCTCTCTTTGTTTCCTAAGAGATTTGGGAACTCTACTTTCCTTATTGATTCAATCCCGGTTGCTGAATTTCAAGGCCGCCCCTTGGCCGATCACCCTGCTGTGAAGAACGGACGTCCACCGGGTGCTCTTACGATCCCTCGTGATCTTCTCGATGCTTCAACAGTTGCAGAGGCAATCAGTGTCATTTCTTGCTGGTACGAAGATAAGACCGAATGGGGTAACCGTGTTGGATGGATTTATGGATCTGTTACTGAGGATGTGGTCACCGGATATAGGATGCATAATAGAGGATGGAAATCGGTGTACTGCGTAACGAAACGAGACGCTTTTCGTGGGACAGCTCCGATCAACCTAACAGATAGGCTGCATCAAGTCCTCCGATGGGCTACCGGGTCGGTCGAGATCTTCTTCTCCCGCAACAACGCCATCCTAGCTAGTCCAAGAATGAAACTTCTACAAAGAATAGCATACTTAAACGTGGGGATATATCCATTCACTTCAATCTTCCTCATAGTATATTGCTTTCTACCAGCACTGTCACTGTTCTCCGGTCAGTTCATCGTCCAAACGCTTAACGTCACGTTCCTTACATACCTTCTGGTTATCACGTTAACGTTGTGCATGCTTGCGGTGCTCGAGATCCGATGGTCTGGTATTGAATTAGAAGAGTGGTGGAGGAATGAGCAGTTCTGGTTGATTGGTGGTACAAGTGCACATCTTGCTGCTGTACTTCAGGGTCTGCTAAAAGTCGTTGCTGGGATCGAAATATCGTTCACTTTGACGTCGAAATCGGGAGGTGACGACGTAGACGACGAGTTTGCTGATCTCTACATTGTGAAATGGACATCTCTAATGATACCACCAATCACGATCATGATAACGAACTTAATTGCAATAGCAGTCGGGTTTAGCCGAACGATATACAGTGTGATACCGCAATGGAGCCGACTGATCGGTGGCGTTTTCTTTAGCTTCTGGGTATTGGCTCATCTCTACCCTTTTGCCAAAGGGCTGATGGGAAGAAGAGGAAGGACACCTACCATTGTTTTTGTGTGGTCAGGGCTTATTGCTATCACCATATCTCTTCTTTGGGTAGCCATTAGTCCTCCATCAGGAACTAACCAAATTGGAGGTTCATTCACATTCCCTTAAACACTTCATTTTTTTTTTGTCTCCCAAAATTCTTTCACTTCATAAACTTGAATTAGGTACATTCTTCTGTTGTAATTCTTGCAAATTTTTACCATCTATAATAATTCACTTTTGGGTAACTTTTGGGTATCGTTGTATTTGAAAATGTGACACCCG
Coding sequence (CDS)
ATGGCATCAAAGTCATTCAAGCCAAACCGTTCAAATTTGTCCACAGCTTCTGATGCATCTGAAGCACAGAAGCCTCCTCTTCCACCTACTGTGACATTCGGTCGGAGAACCTCCTCCGGTCGCTATATTAGCTACTCGAGGGATGATCTCGATAGCGAGCTTGGGAGTGGTGACTTTATGAACTATACTGTACACATTCCTCCAACACCTGATAATCAACCAATGGATCCTTCAATCTCACAGAAGGTTGAAGAGCAATACGTATCGAATTCGCTGTTTACCGGTGGGTTCAATAACATAACACGAGCTCATTTAATGGACAAAGTGATTGAATCTGAAGCAACACATCCTCAAATGGCGGGTACGAAAGGATCTTCGTGTTCTATACCTGGCTGTGATGCAAAGGTTATGAGCGACGAACGTGGAAACGATATACTCCCTTGTGAATGCGATTTCAAGATATGTCGAGATTGCTATGTCGATGCTGTTAAACTAGGTGGTGGGATTTGTCCAGGCTGCAAAGAACCGTATAAAAACACAGATCTTGATGAAATTGCTGTTGAACATGGAAGACCGCTTCCGCTTCCTCCGCCAGCCACAATGTCGAAGATGGAGAGGAGGCTGTCGTTGATGAAGTCGACGAAATCTGCGTTGATGCGAAGCCACACGGGGGTTGGAGAATTTGATCATAATAAATGGCTATTCGAAACGAGAGGAACTTATGGATATGGGAATGCTATATGGCCAAAGGATGAGGGTTTTGAAAATGGTAATACTGATGAAGTTGAGCCTATGGAGTTTATGAATAAACCGTGGCGGCCCCTAACTCGAAAGTTGAAGATTCCTGCTGCTGTTCTTAGCCCGTATCGACTTTTGATCGTCGTTCGAATGGTCGTGCTCGGGTTCTTCTTGGCTTGGCGAGTGAGCCATCCGAACACCGATGCATACTGGTTGTGGGCTATGTCTATAGTTTGTGAGATTTGGTTTGCTTTTTCTTGGCTGCTTGATCAGCTGCCAAAGTTGTGCCCCATCAATAGAGCTACTGATCTTAACGTGTTGACGGAGAAATTCGAAACGCCTAGTCCGAGTAATCCTACCGGAAAATCTGATCTACCAGGCATAGATATCTTTGTTTCTACTGCAGATCCCGAGAAAGAACCACCTCTTGTAACTGCGAACACAATCCTTTCGATTCTAGCTGCAGATTATCCAGTTGAAAAGCTTGCTTGTTATGTTTCTGATGATGGAGGTGCGCTTTTAACTTTCGAGGCCATGGCTGAAGCTGCAAGTTTTGCTAATACTTGGGTTCCTTTCTGTCGAAAACATGGCATCGAACCGCGCAATCCTGAGTCTTATTTTAGTTTGAAAAGAGATCCATTCAAGAACAAAGTTAAGCCAGATTTTGTTAAGGATCGTAGACGTGTTAAGCGGGAGTATGACGAGTTCAAAGTTCGTATAAATGGACTTCCTGACTCTATTCGTCGTCGCTCGGATGCTTATCATGCACGAGAAGAAATCAAAGCTATGAAGCTTCAGAAACAGAACATTGGTGCTGATGAGCCGATAGAGAGTGTGAAAATCGCTAAAGCGACATGGATGGCTGATGGCACACATTGGCCAGGGACTTGGTTGCAGCCATCGTCTGAGCACTCGAAGGGTGACCATGCTGGTATCATACAGGTGATGTTGAAGCCACCTAGTGATGAACCTCTTCATGGAAATGTTGAAGATGAGAAACTTATCGACACTTCTGAGGTCGATATTCGTCTTCCTTTACTCGTTTATGTTTCTCGAGAGAAACGACCAGGCTATGACCACAACAAGAAGGCAGGAGCGATGAATGCTCTAGTTCGAGCCTCGGCAATCATGTCGAATGGTCCGTTCATTCTCAACCTCGATTGTGACCACTATATCTACAACTCTCAGGCAATGAGAGAAGGAATGTGCTTCATGATGGATCGTGGAGGCGATCGTCTTTGCTATGTCCAATTCCCTCAAAGGTTCGAGGGCATTGATCCTTCTGATCGATATGCAAATCACAACACTGTGTTTTTCGACGTTAACATGCGAGCTCTTGATGGGCTTCAAGGACCAGTTTACGTCGGAACAGGATGTCTCTTTAGAAGGGTTGCCCTATATGGTTTCGATCCACCTCGATCAAAAGAGCATCACCCTGGTTTTTGTAGTTGTTGTTGTGGCGGACGAAAAAAGCATACATCAGTTGCGAGCACACCGGAAGAGAGCAGAGCTTTGAGAATGGGTGATTCTGATGATGAAGAAATGAATCTCTCTTTGTTTCCTAAGAGATTTGGGAACTCTACTTTCCTTATTGATTCAATCCCGGTTGCTGAATTTCAAGGCCGCCCCTTGGCCGATCACCCTGCTGTGAAGAACGGACGTCCACCGGGTGCTCTTACGATCCCTCGTGATCTTCTCGATGCTTCAACAGTTGCAGAGGCAATCAGTGTCATTTCTTGCTGGTACGAAGATAAGACCGAATGGGGTAACCGTGTTGGATGGATTTATGGATCTGTTACTGAGGATGTGGTCACCGGATATAGGATGCATAATAGAGGATGGAAATCGGTGTACTGCGTAACGAAACGAGACGCTTTTCGTGGGACAGCTCCGATCAACCTAACAGATAGGCTGCATCAAGTCCTCCGATGGGCTACCGGGTCGGTCGAGATCTTCTTCTCCCGCAACAACGCCATCCTAGCTAGTCCAAGAATGAAACTTCTACAAAGAATAGCATACTTAAACGTGGGGATATATCCATTCACTTCAATCTTCCTCATAGTATATTGCTTTCTACCAGCACTGTCACTGTTCTCCGGTCAGTTCATCGTCCAAACGCTTAACGTCACGTTCCTTACATACCTTCTGGTTATCACGTTAACGTTGTGCATGCTTGCGGTGCTCGAGATCCGATGGTCTGGTATTGAATTAGAAGAGTGGTGGAGGAATGAGCAGTTCTGGTTGATTGGTGGTACAAGTGCACATCTTGCTGCTGTACTTCAGGGTCTGCTAAAAGTCGTTGCTGGGATCGAAATATCGTTCACTTTGACGTCGAAATCGGGAGGTGACGACGTAGACGACGAGTTTGCTGATCTCTACATTGTGAAATGGACATCTCTAATGATACCACCAATCACGATCATGATAACGAACTTAATTGCAATAGCAGTCGGGTTTAGCCGAACGATATACAGTGTGATACCGCAATGGAGCCGACTGATCGGTGGCGTTTTCTTTAGCTTCTGGGTATTGGCTCATCTCTACCCTTTTGCCAAAGGGCTGATGGGAAGAAGAGGAAGGACACCTACCATTGTTTTTGTGTGGTCAGGGCTTATTGCTATCACCATATCTCTTCTTTGGGTAGCCATTAGTCCTCCATCAGGAACTAACCAAATTGGAGGTTCATTCACATTCCCTTAA
Protein sequence
MASKSFKPNRSNLSTASDASEAQKPPLPPTVTFGRRTSSGRYISYSRDDLDSELGSGDFMNYTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHPQMAGTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPYKNTDLDEIAVEHGRPLPLPPPATMSKMERRLSLMKSTKSALMRSHTGVGEFDHNKWLFETRGTYGYGNAIWPKDEGFENGNTDEVEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLIVVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVLTEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVSDDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRRRVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKATWMADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLPLLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVALYGFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMGDSDDEEMNLSLFPKRFGNSTFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCWYEDKTEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQGLLKVVAGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSRTIYSVIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWVAISPPSGTNQIGGSFTFP
Homology
BLAST of CmaCh02G004670 vs. ExPASy Swiss-Prot
Match:
Q9M9M4 (Cellulose synthase-like protein D3 OS=Arabidopsis thaliana OX=3702 GN=CSLD3 PE=1 SV=1)
HSP 1 Score: 1980.7 bits (5130), Expect = 0.0e+00
Identity = 963/1154 (83.45%), Postives = 1051/1154 (91.07%), Query Frame = 0
Query: 1 MASKS-FKPNRSNLSTASDASEAQK--PPLPPTVTFGRRTSSGRYISYSRDDLDSELGSG 60
MAS + F +RSNLST SDA+EA++ P+ +VTF RRT SGRY++YSRDDLDSELGS
Sbjct: 1 MASNNHFMNSRSNLSTNSDAAEAERHQQPVSNSVTFARRTPSGRYVNYSRDDLDSELGSV 60
Query: 61 DFMNYTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHP 120
D Y+VHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFN++TRAHLM+KVI++E +HP
Sbjct: 61 DLTGYSVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNSVTRAHLMEKVIDTETSHP 120
Query: 121 QMAGTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPY 180
QMAG KGSSC++PGCD KVMSDERG D+LPCECDFKICRDC++DAVK GG+CPGCKEPY
Sbjct: 121 QMAGAKGSSCAVPGCDVKVMSDERGQDLLPCECDFKICRDCFMDAVKT-GGMCPGCKEPY 180
Query: 181 KNTDLDEIAVEHGRPLP-LPPPATMSKMERRLSLMKSTKSALMRSHTGVGEFDHNKWLFE 240
+NTDL + A + + P LPPPA SKM+RRLSLMKSTKS LMRS T G+FDHN+WLFE
Sbjct: 181 RNTDLADFADNNKQQRPMLPPPAGGSKMDRRLSLMKSTKSGLMRSQT--GDFDHNRWLFE 240
Query: 241 TRGTYGYGNAIWPKDEGF---ENGNTDEVEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLI 300
T GTYG+GNA W KD F ++GN + P + M++PWRPLTRKL+IPAAV+SPYRLLI
Sbjct: 241 TSGTYGFGNAFWTKDGNFGSDKDGNGHGMGPQDLMSRPWRPLTRKLQIPAAVISPYRLLI 300
Query: 301 VVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVL 360
++R+VVL FL WR+ H N DA WLW MS+VCE+WFA SWLLDQLPKLCPINRATDLNVL
Sbjct: 301 LIRIVVLALFLMWRIKHKNPDAIWLWGMSVVCELWFALSWLLDQLPKLCPINRATDLNVL 360
Query: 361 TEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVS 420
EKFETP+PSNPTGKSDLPG+D+FVSTADPEKEPPLVT+NTILSILAADYPVEKLACYVS
Sbjct: 361 KEKFETPTPSNPTGKSDLPGLDMFVSTADPEKEPPLVTSNTILSILAADYPVEKLACYVS 420
Query: 421 DDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRR 480
DDGGALLTFEAMAEAASFAN WVPFCRKH IEPRNP+SYFSLKRDP+KNKVK DFVKDRR
Sbjct: 421 DDGGALLTFEAMAEAASFANMWVPFCRKHNIEPRNPDSYFSLKRDPYKNKVKADFVKDRR 480
Query: 481 RVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKATWM 540
RVKREYDEFKVRIN LPDSIRRRSDAYHAREEIKAMKLQ+QN +E +E VKI KATWM
Sbjct: 481 RVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKLQRQN-RDEEIVEPVKIPKATWM 540
Query: 541 ADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLPLL 600
ADGTHWPGTW+ +HS+ DHAGIIQVMLKPPSDEPLHG E +D ++VDIRLPLL
Sbjct: 541 ADGTHWPGTWINSGPDHSRSDHAGIIQVMLKPPSDEPLHG--VSEGFLDLTDVDIRLPLL 600
Query: 601 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFMMD 660
VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQA+REGMCFMMD
Sbjct: 601 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQALREGMCFMMD 660
Query: 661 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVALY 720
RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGL GPVYVGTGCLFRR+ALY
Sbjct: 661 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLMGPVYVGTGCLFRRIALY 720
Query: 721 GFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMG--DSDDEEMNLSLFPKRFG 780
GFDPPR+KEHHPGFCSCC +KK + V PEE+R+LRMG DDEEMNLSL PK+FG
Sbjct: 721 GFDPPRAKEHHPGFCSCCFSRKKKKSRV---PEENRSLRMGGDSDDDEEMNLSLVPKKFG 780
Query: 781 NSTFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCWYEDK 840
NSTFLIDSIPVAEFQGRPLADHPAV+NGRPPGALTIPR+LLDASTVAEAI+VISCWYEDK
Sbjct: 781 NSTFLIDSIPVAEFQGRPLADHPAVQNGRPPGALTIPRELLDASTVAEAIAVISCWYEDK 840
Query: 841 TEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWA 900
TEWG+R+GWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWA
Sbjct: 841 TEWGSRIGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWA 900
Query: 901 TGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQ 960
TGSVEIFFSRNNA ASPRMK+LQRIAYLNVGIYPFTS FLIVYCFLPALSLFSGQFIVQ
Sbjct: 901 TGSVEIFFSRNNAFFASPRMKILQRIAYLNVGIYPFTSFFLIVYCFLPALSLFSGQFIVQ 960
Query: 961 TLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQGLLK 1020
TLNVTFL YLL+I++TLC+LA+LEI+WSGI LEEWWRNEQFWLIGGTSAHLAAV+QGLLK
Sbjct: 961 TLNVTFLVYLLIISITLCLLALLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVIQGLLK 1020
Query: 1021 VVAGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSRTIYS 1080
VVAGIEISFTLTSKSGG+DVDDEFADLYIVKWTSLMIPPITIM+ NLIAIAVGFSRTIYS
Sbjct: 1021 VVAGIEISFTLTSKSGGEDVDDEFADLYIVKWTSLMIPPITIMMVNLIAIAVGFSRTIYS 1080
Query: 1081 VIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWVAISP 1140
VIPQWS+LIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIV+VWSGL+AITISLLWVAI+P
Sbjct: 1081 VIPQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWVAINP 1140
Query: 1141 PSGTNQIGGSFTFP 1146
P+G+ QIGGSFTFP
Sbjct: 1141 PAGSTQIGGSFTFP 1145
BLAST of CmaCh02G004670 vs. ExPASy Swiss-Prot
Match:
Q9LFL0 (Cellulose synthase-like protein D2 OS=Arabidopsis thaliana OX=3702 GN=CSLD2 PE=3 SV=1)
HSP 1 Score: 1956.0 bits (5066), Expect = 0.0e+00
Identity = 948/1152 (82.29%), Postives = 1036/1152 (89.93%), Query Frame = 0
Query: 2 ASKSFKPNRSNLSTASDASEAQKPPLPPTVTFGRRTSSGRYISYSRDDLDSELGSGDFMN 61
++K F +RSNLS SD E +PP +V F +RTSSGRYI+YSRDDLDSELG DFM+
Sbjct: 3 SNKHFDKSRSNLSNNSDIQEPGRPPAGHSVKFAQRTSSGRYINYSRDDLDSELGGQDFMS 62
Query: 62 YTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHPQMAG 121
YTVHIPPTPDNQPMDPSISQKVEEQYV+NS+FTGGF + TRAHLM KVIE+E HPQMAG
Sbjct: 63 YTVHIPPTPDNQPMDPSISQKVEEQYVANSMFTGGFKSNTRAHLMHKVIETEPNHPQMAG 122
Query: 122 TKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPYKNTD 181
+KGSSC+IPGCDAKVMSDERG D+LPCECDFKICRDC++DAVK GGGICPGCKEPYKNT
Sbjct: 123 SKGSSCAIPGCDAKVMSDERGQDLLPCECDFKICRDCFIDAVKTGGGICPGCKEPYKNTH 182
Query: 182 LDEIAVEHGRPLPLPPPATMSKMERRLSLMKST-KSALMRSHTGVGEFDHNKWLFETRGT 241
L + E+G+ P+ P SKMERRLS++KST KSALMRS T G+FDHN+WLFET GT
Sbjct: 183 LTDQVDENGQQRPMLPGGGGSKMERRLSMVKSTNKSALMRSQT--GDFDHNRWLFETTGT 242
Query: 242 YGYGNAIWPKDEGFENGNTDE-------VEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLI 301
YGYGNA W KD F +G + +E + M++PWRPLTRKLKIPA V+SPYRLLI
Sbjct: 243 YGYGNAFWTKDGDFGSGKDGDGDGDGMGMEAQDLMSRPWRPLTRKLKIPAGVISPYRLLI 302
Query: 302 VVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVL 361
+R+VVL FL WRV H N DA WLW MS+VCE+WFA SWLLDQLPKLCPINRATDL VL
Sbjct: 303 FIRIVVLALFLTWRVKHQNPDAVWLWGMSVVCELWFALSWLLDQLPKLCPINRATDLQVL 362
Query: 362 TEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVS 421
EKFETP+ SNPTGKSDLPG D+FVSTADPEKEPPLVTANTILSILAA+YPVEKL+CYVS
Sbjct: 363 KEKFETPTASNPTGKSDLPGFDVFVSTADPEKEPPLVTANTILSILAAEYPVEKLSCYVS 422
Query: 422 DDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRR 481
DDGGALLTFEAMAEAASFAN WVPFCRKH IEPRNP+SYFSLKRDP+KNKVK DFVKDRR
Sbjct: 423 DDGGALLTFEAMAEAASFANIWVPFCRKHAIEPRNPDSYFSLKRDPYKNKVKSDFVKDRR 482
Query: 482 RVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKATWM 541
RVKRE+DEFKVR+N LPDSIRRRSDAYHAREEIKAMK+Q+QN DEP+E VKI KATWM
Sbjct: 483 RVKREFDEFKVRVNSLPDSIRRRSDAYHAREEIKAMKMQRQN-RDDEPMEPVKIPKATWM 542
Query: 542 ADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLPLL 601
ADGTHWPGTWL +S+H+KGDHAGIIQVMLKPPSDEPLHG E +D ++VDIRLPLL
Sbjct: 543 ADGTHWPGTWLTSASDHAKGDHAGIIQVMLKPPSDEPLHG--VSEGFLDLTDVDIRLPLL 602
Query: 602 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFMMD 661
VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNS+A+REGMCFMMD
Sbjct: 603 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSEALREGMCFMMD 662
Query: 662 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVALY 721
RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGL GPVYVGTGCLFRR+ALY
Sbjct: 663 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLMGPVYVGTGCLFRRIALY 722
Query: 722 GFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMGDSDDEEMNLSLFPKRFGNS 781
GF+PPRSK+ P SCC KK + PEE+RALRM D DDEEMNLSL PK+FGNS
Sbjct: 723 GFNPPRSKDFSPSCWSCCFPRSKK----KNIPEENRALRMSDYDDEEMNLSLVPKKFGNS 782
Query: 782 TFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCWYEDKTE 841
TFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPR+LLDASTVAEAI+VISCWYEDKTE
Sbjct: 783 TFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRELLDASTVAEAIAVISCWYEDKTE 842
Query: 842 WGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG 901
WG+R+GWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG
Sbjct: 843 WGSRIGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG 902
Query: 902 SVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTL 961
SVEIFFSRNNA+LAS +MK+LQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTL
Sbjct: 903 SVEIFFSRNNALLASSKMKILQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTL 962
Query: 962 NVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQGLLKVV 1021
NVTFL YLL+I++TLC+LA+LEI+WSGI LEEWWRNEQFWLIGGTSAHLAAVLQGLLKVV
Sbjct: 963 NVTFLVYLLIISITLCLLALLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVV 1022
Query: 1022 AGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSRTIYSVI 1081
AG+EISFTLTSKSGGDD+DDEFADLY+VKWTSLMIPPITI++ NLIAIAVGFSRTIYSV+
Sbjct: 1023 AGVEISFTLTSKSGGDDIDDEFADLYMVKWTSLMIPPITIIMVNLIAIAVGFSRTIYSVV 1082
Query: 1082 PQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWVAISPPS 1141
PQWS+LIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIV+VWSGL+AITISLLWVAI+PP+
Sbjct: 1083 PQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWVAINPPA 1142
Query: 1142 GTNQIGGSFTFP 1146
G +IGG+F+FP
Sbjct: 1143 GNTEIGGNFSFP 1145
BLAST of CmaCh02G004670 vs. ExPASy Swiss-Prot
Match:
A2YU42 (Cellulose synthase-like protein D2 OS=Oryza sativa subsp. indica OX=39946 GN=CSLD2 PE=3 SV=1)
HSP 1 Score: 1909.0 bits (4944), Expect = 0.0e+00
Identity = 940/1166 (80.62%), Postives = 1026/1166 (87.99%), Query Frame = 0
Query: 9 NRSNLSTASDASEAQKPPLP------PTVTFGRRTSSGRYISYSRDDLDSELG-SGD--- 68
N S LS S + E + P P VTF RRT SGRY+SYSRDDLDSELG SGD
Sbjct: 13 NSSRLSRMSYSGEDGRSQAPGGGGDRPMVTFARRTHSGRYVSYSRDDLDSELGNSGDMSP 72
Query: 69 -----FMNYTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESE 128
F+NY V IP TPDNQPMDP+IS +VEEQYVSNSLFTGGFN++TRAHLMDKVIESE
Sbjct: 73 ESGQEFLNYHVTIPATPDNQPMDPAISARVEEQYVSNSLFTGGFNSVTRAHLMDKVIESE 132
Query: 129 ATHPQMAGTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGC 188
A+HPQMAG KGSSC+I GCDAKVMSDERG+DILPCECDFKIC DC+ DAVK GG CPGC
Sbjct: 133 ASHPQMAGAKGSSCAINGCDAKVMSDERGDDILPCECDFKICADCFADAVK-NGGACPGC 192
Query: 189 KEPYKNTDLDEIAVEHGRP-LPLPPP---ATMSKMERRLSLMKSTKSALMRSHTGVGEFD 248
K+PYK T+LD++ RP L LPPP S+MERRLS+M+S K A+ RS T G++D
Sbjct: 193 KDPYKATELDDVV--GARPTLSLPPPPGGLPASRMERRLSIMRSQK-AMTRSQT--GDWD 252
Query: 249 HNKWLFETRGTYGYGNAIWPKDEGFENG---------NTDEVEPMEFMNKPWRPLTRKLK 308
HN+WLFET+GTYGYGNAIWPK+ +NG + +P EF +KPWRPLTRKLK
Sbjct: 253 HNRWLFETKGTYGYGNAIWPKENEVDNGGGGGGGGGLGGGDGQPAEFTSKPWRPLTRKLK 312
Query: 309 IPAAVLSPYRLLIVVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPK 368
IPA VLSPYRLLI++RM VLG FLAWR+ H N DA WLW MS+VCE+WF SWLLDQLPK
Sbjct: 313 IPAGVLSPYRLLILIRMAVLGLFLAWRIKHKNEDAMWLWGMSVVCELWFGLSWLLDQLPK 372
Query: 369 LCPINRATDLNVLTEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILA 428
LCP+NRATDL VL +KFETP+PSNP G+SDLPG+DIFVSTADPEKEPPLVTANTILSILA
Sbjct: 373 LCPVNRATDLAVLKDKFETPTPSNPNGRSDLPGLDIFVSTADPEKEPPLVTANTILSILA 432
Query: 429 ADYPVEKLACYVSDDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPF 488
ADYPVEKL+CYVSDDGGALLTFEAMAEAASFAN WVPFCRKH IEPRNPESYF+LKRDP+
Sbjct: 433 ADYPVEKLSCYVSDDGGALLTFEAMAEAASFANMWVPFCRKHDIEPRNPESYFNLKRDPY 492
Query: 489 KNKVKPDFVKDRRRVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADE 548
KNKV+ DFVKDRRRVKREYDEFKVRIN LPDSIRRRSDAYHAREEIKAMK Q++ D+
Sbjct: 493 KNKVRSDFVKDRRRVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKRQRE-AALDD 552
Query: 549 PIESVKIAKATWMADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDE-K 608
+E+VKI KATWMADGTHWPGTW+QPS+EH++GDHAGIIQVMLKPPSD+PL+G +E +
Sbjct: 553 VVEAVKIPKATWMADGTHWPGTWIQPSAEHARGDHAGIIQVMLKPPSDDPLYGTSSEEGR 612
Query: 609 LIDTSEVDIRLPLLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIY 668
+D +EVDIRLP+LVYVSREKRPGYDHNKKAGAMNALVR+SA+MSNGPFILNLDCDHY+Y
Sbjct: 613 PLDFTEVDIRLPMLVYVSREKRPGYDHNKKAGAMNALVRSSAVMSNGPFILNLDCDHYVY 672
Query: 669 NSQAMREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPV 728
NSQA REGMCFMMDRGGDR+ YVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDG+ GPV
Sbjct: 673 NSQAFREGMCFMMDRGGDRIGYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGIMGPV 732
Query: 729 YVGTGCLFRRVALYGFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMGDSDDE 788
YVGTGCLFRR+ALYGFDPPRSKE H G CSCC R+K + EE +ALRM D DDE
Sbjct: 733 YVGTGCLFRRIALYGFDPPRSKE-HSGCCSCCFPQRRKVKTSTVASEERQALRMADFDDE 792
Query: 789 EMNLSLFPKRFGNSTFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAE 848
EMN+S FPK+FGNS FLI+SIP+AEFQGRPLADHP VKNGRPPGALT+PRDLLDASTVAE
Sbjct: 793 EMNMSQFPKKFGNSNFLINSIPIAEFQGRPLADHPGVKNGRPPGALTVPRDLLDASTVAE 852
Query: 849 AISVISCWYEDKTEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPIN 908
AISVISCWYEDKTEWG RVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPIN
Sbjct: 853 AISVISCWYEDKTEWGQRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPIN 912
Query: 909 LTDRLHQVLRWATGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLP 968
LTDRLHQVLRWATGSVEIFFSRNNA+LAS +MK LQRIAYLNVGIYPFTSIFLIVYCFLP
Sbjct: 913 LTDRLHQVLRWATGSVEIFFSRNNALLASRKMKFLQRIAYLNVGIYPFTSIFLIVYCFLP 972
Query: 969 ALSLFSGQFIVQTLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTS 1028
ALSLFSGQFIV+TLNVTFLTYLLVITLT+CMLAVLEI+WSGI LEEWWRNEQFWLIGGTS
Sbjct: 973 ALSLFSGQFIVRTLNVTFLTYLLVITLTMCMLAVLEIKWSGISLEEWWRNEQFWLIGGTS 1032
Query: 1029 AHLAAVLQGLLKVVAGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLI 1088
AHLAAVLQGLLKV+AGIEISFTLTSKSGGD+ DDEFADLYIVKWTSLMIPPI IM+ NLI
Sbjct: 1033 AHLAAVLQGLLKVIAGIEISFTLTSKSGGDEADDEFADLYIVKWTSLMIPPIVIMMVNLI 1092
Query: 1089 AIAVGFSRTIYSVIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIA 1146
AIAVGFSRTIYS IPQWS+L+GGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGL+A
Sbjct: 1093 AIAVGFSRTIYSEIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLLA 1152
BLAST of CmaCh02G004670 vs. ExPASy Swiss-Prot
Match:
Q9LHZ7 (Cellulose synthase-like protein D2 OS=Oryza sativa subsp. japonica OX=39947 GN=CSLD2 PE=2 SV=1)
HSP 1 Score: 1908.3 bits (4942), Expect = 0.0e+00
Identity = 940/1166 (80.62%), Postives = 1027/1166 (88.08%), Query Frame = 0
Query: 9 NRSNLSTASDASEAQKPPLP------PTVTFGRRTSSGRYISYSRDDLDSELG-SGD--- 68
N S LS S + E + P P VTF RRT SGRY+SYSRDDLDSELG SGD
Sbjct: 13 NSSRLSRMSYSGEDGRAQAPGGGGDRPMVTFARRTHSGRYVSYSRDDLDSELGNSGDMSP 72
Query: 69 -----FMNYTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESE 128
F+NY V IP TPDNQPMDP+IS +VEEQYVSNSLFTGGFN++TRAHLMDKVIESE
Sbjct: 73 ESGQEFLNYHVTIPATPDNQPMDPAISARVEEQYVSNSLFTGGFNSVTRAHLMDKVIESE 132
Query: 129 ATHPQMAGTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGC 188
A+HPQMAG KGSSC+I GCDAKVMSDERG+DILPCECDFKIC DC+ DAVK GG CPGC
Sbjct: 133 ASHPQMAGAKGSSCAINGCDAKVMSDERGDDILPCECDFKICADCFADAVK-NGGACPGC 192
Query: 189 KEPYKNTDLDEIAVEHGRP-LPLPPP---ATMSKMERRLSLMKSTKSALMRSHTGVGEFD 248
K+PYK T+LD++ RP L LPPP S+MERRLS+M+S K A+ RS T G++D
Sbjct: 193 KDPYKATELDDVV--GARPTLSLPPPPGGLPASRMERRLSIMRSQK-AMTRSQT--GDWD 252
Query: 249 HNKWLFETRGTYGYGNAIWPKDEGFENG---------NTDEVEPMEFMNKPWRPLTRKLK 308
HN+WLFET+GTYGYGNAIWPK+ +NG + +P EF +KPWRPLTRKLK
Sbjct: 253 HNRWLFETKGTYGYGNAIWPKENEVDNGGGGGGGGGLGGGDGQPAEFTSKPWRPLTRKLK 312
Query: 309 IPAAVLSPYRLLIVVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPK 368
IPA VLSPYRLLI++RM VLG FLAWR+ H N DA WLW MS+VCE+WF SWLLDQLPK
Sbjct: 313 IPAGVLSPYRLLILIRMAVLGLFLAWRIKHKNEDAMWLWGMSVVCELWFGLSWLLDQLPK 372
Query: 369 LCPINRATDLNVLTEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILA 428
LCP+NRATDL VL +KFETP+PSNP G+SDLPG+DIFVSTADPEKEPPLVTANTILSILA
Sbjct: 373 LCPVNRATDLAVLKDKFETPTPSNPNGRSDLPGLDIFVSTADPEKEPPLVTANTILSILA 432
Query: 429 ADYPVEKLACYVSDDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPF 488
ADYPVEKL+CYVSDDGGALLTFEAMAEAASFAN WVPFCRKH IEPRNPESYF+LKRDP+
Sbjct: 433 ADYPVEKLSCYVSDDGGALLTFEAMAEAASFANMWVPFCRKHDIEPRNPESYFNLKRDPY 492
Query: 489 KNKVKPDFVKDRRRVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADE 548
KNKV+ DFVKDRRRVKREYDEFKVRIN LPDSIRRRSDAYHAREEIKAMK Q++ D+
Sbjct: 493 KNKVRSDFVKDRRRVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKRQRE-AALDD 552
Query: 549 PIESVKIAKATWMADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHG-NVEDEK 608
+E+VKI KATWMADGTHWPGTW+QPS+EH++GDHAGIIQVMLKPPSD+PL+G + E+ +
Sbjct: 553 VVEAVKIPKATWMADGTHWPGTWIQPSAEHARGDHAGIIQVMLKPPSDDPLYGTSGEEGR 612
Query: 609 LIDTSEVDIRLPLLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIY 668
+D +EVDIRLP+LVYVSREKRPGYDHNKKAGAMNALVR+SA+MSNGPFILNLDCDHY+Y
Sbjct: 613 PLDFTEVDIRLPMLVYVSREKRPGYDHNKKAGAMNALVRSSAVMSNGPFILNLDCDHYVY 672
Query: 669 NSQAMREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPV 728
NSQA REGMCFMMDRGGDR+ YVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDG+ GPV
Sbjct: 673 NSQAFREGMCFMMDRGGDRIGYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGIMGPV 732
Query: 729 YVGTGCLFRRVALYGFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMGDSDDE 788
YVGTGCLFRR+ALYGFDPPRSKE H G CSCC R+K + EE +ALRM D DDE
Sbjct: 733 YVGTGCLFRRIALYGFDPPRSKE-HSGCCSCCFPQRRKVKTSTVASEERQALRMADFDDE 792
Query: 789 EMNLSLFPKRFGNSTFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAE 848
EMN+S FPK+FGNS FLI+SIP+AEFQGRPLADHP VKNGRPPGALT+PRDLLDASTVAE
Sbjct: 793 EMNMSQFPKKFGNSNFLINSIPIAEFQGRPLADHPGVKNGRPPGALTVPRDLLDASTVAE 852
Query: 849 AISVISCWYEDKTEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPIN 908
AISVISCWYEDKTEWG RVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPIN
Sbjct: 853 AISVISCWYEDKTEWGQRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPIN 912
Query: 909 LTDRLHQVLRWATGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLP 968
LTDRLHQVLRWATGSVEIFFSRNNA+LAS +MK LQRIAYLNVGIYPFTSIFLIVYCFLP
Sbjct: 913 LTDRLHQVLRWATGSVEIFFSRNNALLASRKMKFLQRIAYLNVGIYPFTSIFLIVYCFLP 972
Query: 969 ALSLFSGQFIVQTLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTS 1028
ALSLFSGQFIV+TLNVTFLTYLLVITLT+CMLAVLEI+WSGI LEEWWRNEQFWLIGGTS
Sbjct: 973 ALSLFSGQFIVRTLNVTFLTYLLVITLTMCMLAVLEIKWSGISLEEWWRNEQFWLIGGTS 1032
Query: 1029 AHLAAVLQGLLKVVAGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLI 1088
AHLAAVLQGLLKV+AGIEISFTLTSKSGGD+ DDEFADLYIVKWTSLMIPPI IM+ NLI
Sbjct: 1033 AHLAAVLQGLLKVIAGIEISFTLTSKSGGDEADDEFADLYIVKWTSLMIPPIVIMMVNLI 1092
Query: 1089 AIAVGFSRTIYSVIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIA 1146
AIAVGFSRTIYS IPQWS+L+GGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGL+A
Sbjct: 1093 AIAVGFSRTIYSEIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLLA 1152
BLAST of CmaCh02G004670 vs. ExPASy Swiss-Prot
Match:
A2ZAK8 (Cellulose synthase-like protein D1 OS=Oryza sativa subsp. indica OX=39946 GN=CSLD1 PE=3 SV=2)
HSP 1 Score: 1785.4 bits (4623), Expect = 0.0e+00
Identity = 883/1160 (76.12%), Postives = 981/1160 (84.57%), Query Frame = 0
Query: 1 MASKSFKPNRSNLSTASDASEAQKPPLPPTVTFGRRTSSGRYISYSRDDLDSELGSGDFM 60
MASK N TA ++ PTV FGRRT SGR+ISYSRDDLDSE+ S DF
Sbjct: 1 MASKGILKNGGKPPTAPSSA-------APTVVFGRRTDSGRFISYSRDDLDSEISSVDFQ 60
Query: 61 NYTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHPQMA 120
+Y VHIP TPDNQPMDP+ E+QYVS+SLFTGGFN++TRAH+M+K S A
Sbjct: 61 DYHVHIPMTPDNQPMDPAAGD--EQQYVSSSLFTGGFNSVTRAHVMEKQASS-------A 120
Query: 121 GTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPYKNT 180
S+C + GC +K+M + RG DILPCECDFKIC DC+ DAVK GGG+CPGCKEPYK+
Sbjct: 121 RATVSACMVQGCGSKIMRNGRGADILPCECDFKICVDCFTDAVKGGGGVCPGCKEPYKHA 180
Query: 181 DLDEI--AVEH---GRPLPLP-PPATMSKMERRLSLMKSTKSALMRSHTGVGEFDHNKWL 240
+ +E+ A H R L LP KMERRLSL+K A GEFDHN+WL
Sbjct: 181 EWEEVVSASNHDAINRALSLPHGHGHGPKMERRLSLVKQNGGA-------PGEFDHNRWL 240
Query: 241 FETRGTYGYGNAIWPKDEGFENGNTDEVEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLIV 300
FET+GTYGYGNAIWP+D+G P E M+KPWRPLTRKL+I AAV+SPYRLL++
Sbjct: 241 FETKGTYGYGNAIWPEDDGVAG------HPKELMSKPWRPLTRKLRIQAAVISPYRLLVL 300
Query: 301 VRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVLT 360
+R+V LG FL WR+ H N DA WLW MSIVCE+WFA SW+LDQLPKLCPINRATDL+VL
Sbjct: 301 IRLVALGLFLMWRIKHQNEDAIWLWGMSIVCELWFALSWVLDQLPKLCPINRATDLSVLK 360
Query: 361 EKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVSD 420
+KFETP+PSNPTGKSDLPGIDIFVSTADPEKEP LVTANTILSILAADYPV+KLACYVSD
Sbjct: 361 DKFETPTPSNPTGKSDLPGIDIFVSTADPEKEPVLVTANTILSILAADYPVDKLACYVSD 420
Query: 421 DGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRRR 480
DGGALLTFEAMAEAASFAN WVPFCRKH IEPRNP+SYF+LKRDPFKNKVK DFVKDRRR
Sbjct: 421 DGGALLTFEAMAEAASFANLWVPFCRKHEIEPRNPDSYFNLKRDPFKNKVKGDFVKDRRR 480
Query: 481 VKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNI---GADEPIESVKIAKAT 540
VKREYDEFKVR+NGLPD+IRRRSDAYHAREEI+AM LQ++ + G ++ +E +KI KAT
Sbjct: 481 VKREYDEFKVRVNGLPDAIRRRSDAYHAREEIQAMNLQREKMKAGGDEQQLEPIKIPKAT 540
Query: 541 WMADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLP 600
WMADGTHWPGTWLQ S EH++GDHAGIIQVMLKPPS P + EK +D S VD RLP
Sbjct: 541 WMADGTHWPGTWLQASPEHARGDHAGIIQVMLKPPSPSPSSSGGDMEKRVDLSGVDTRLP 600
Query: 601 LLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFM 660
+LVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHY+YNS+A REGMCFM
Sbjct: 601 MLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYVYNSKAFREGMCFM 660
Query: 661 MDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVA 720
MDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRR+A
Sbjct: 661 MDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRIA 720
Query: 721 LYGFDPPRSKEHHPGFCSCCCGGRKKHTSVASTP----EESRALRMGDSDDEEMNLSLFP 780
LYGFDPPRSK+H + CC R++ T P EE+ ALRM D MN++ FP
Sbjct: 721 LYGFDPPRSKDHTTPW--SCCLPRRRRTRSQPQPQEEEEETMALRM--DMDGAMNMASFP 780
Query: 781 KRFGNSTFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCW 840
K+FGNS+FLIDSIPVAEFQGRPLADHP+VKNGRPPGALTIPR+ LDAS VAEAISV+SCW
Sbjct: 781 KKFGNSSFLIDSIPVAEFQGRPLADHPSVKNGRPPGALTIPRETLDASIVAEAISVVSCW 840
Query: 841 YEDKTEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQV 900
YE+KTEWG RVGWIYGSVTEDVVTGYRMHNRGWKSVYCVT RDAFRGTAPINLTDRLHQV
Sbjct: 841 YEEKTEWGTRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTHRDAFRGTAPINLTDRLHQV 900
Query: 901 LRWATGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQ 960
LRWATGSVEIFFSRNNA+ AS +MK+LQRIAYLNVGIYPFTS+FLIVYCFLPALSLFSGQ
Sbjct: 901 LRWATGSVEIFFSRNNALFASSKMKVLQRIAYLNVGIYPFTSVFLIVYCFLPALSLFSGQ 960
Query: 961 FIVQTLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQ 1020
FIVQTLNVTFLTYLL+IT+TLC+LA+LEI+WSGI LEEWWRNEQFWLIGGTSAHLAAVLQ
Sbjct: 961 FIVQTLNVTFLTYLLIITITLCLLAMLEIKWSGIALEEWWRNEQFWLIGGTSAHLAAVLQ 1020
Query: 1021 GLLKVVAGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSR 1080
GLLKV+AGIEISFTLTSK GDDVDDEFA+LY VKWTSLMIPP+TI++ NL+AIAVGFSR
Sbjct: 1021 GLLKVIAGIEISFTLTSKQLGDDVDDEFAELYAVKWTSLMIPPLTIIMINLVAIAVGFSR 1080
Query: 1081 TIYSVIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWV 1140
TIYS IPQWS+L+GGVFFSFWVLAHLYPFAKGLMGRRGRTPTIV+VWSGL+AITISLLW+
Sbjct: 1081 TIYSTIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWI 1127
Query: 1141 AISPPS--GTNQIGGSFTFP 1146
AI PPS +Q+GGSF+FP
Sbjct: 1141 AIKPPSAQANSQLGGSFSFP 1127
BLAST of CmaCh02G004670 vs. TAIR 10
Match:
AT3G03050.1 (cellulose synthase-like D3 )
HSP 1 Score: 1980.7 bits (5130), Expect = 0.0e+00
Identity = 963/1154 (83.45%), Postives = 1051/1154 (91.07%), Query Frame = 0
Query: 1 MASKS-FKPNRSNLSTASDASEAQK--PPLPPTVTFGRRTSSGRYISYSRDDLDSELGSG 60
MAS + F +RSNLST SDA+EA++ P+ +VTF RRT SGRY++YSRDDLDSELGS
Sbjct: 1 MASNNHFMNSRSNLSTNSDAAEAERHQQPVSNSVTFARRTPSGRYVNYSRDDLDSELGSV 60
Query: 61 DFMNYTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHP 120
D Y+VHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFN++TRAHLM+KVI++E +HP
Sbjct: 61 DLTGYSVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNSVTRAHLMEKVIDTETSHP 120
Query: 121 QMAGTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPY 180
QMAG KGSSC++PGCD KVMSDERG D+LPCECDFKICRDC++DAVK GG+CPGCKEPY
Sbjct: 121 QMAGAKGSSCAVPGCDVKVMSDERGQDLLPCECDFKICRDCFMDAVKT-GGMCPGCKEPY 180
Query: 181 KNTDLDEIAVEHGRPLP-LPPPATMSKMERRLSLMKSTKSALMRSHTGVGEFDHNKWLFE 240
+NTDL + A + + P LPPPA SKM+RRLSLMKSTKS LMRS T G+FDHN+WLFE
Sbjct: 181 RNTDLADFADNNKQQRPMLPPPAGGSKMDRRLSLMKSTKSGLMRSQT--GDFDHNRWLFE 240
Query: 241 TRGTYGYGNAIWPKDEGF---ENGNTDEVEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLI 300
T GTYG+GNA W KD F ++GN + P + M++PWRPLTRKL+IPAAV+SPYRLLI
Sbjct: 241 TSGTYGFGNAFWTKDGNFGSDKDGNGHGMGPQDLMSRPWRPLTRKLQIPAAVISPYRLLI 300
Query: 301 VVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVL 360
++R+VVL FL WR+ H N DA WLW MS+VCE+WFA SWLLDQLPKLCPINRATDLNVL
Sbjct: 301 LIRIVVLALFLMWRIKHKNPDAIWLWGMSVVCELWFALSWLLDQLPKLCPINRATDLNVL 360
Query: 361 TEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVS 420
EKFETP+PSNPTGKSDLPG+D+FVSTADPEKEPPLVT+NTILSILAADYPVEKLACYVS
Sbjct: 361 KEKFETPTPSNPTGKSDLPGLDMFVSTADPEKEPPLVTSNTILSILAADYPVEKLACYVS 420
Query: 421 DDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRR 480
DDGGALLTFEAMAEAASFAN WVPFCRKH IEPRNP+SYFSLKRDP+KNKVK DFVKDRR
Sbjct: 421 DDGGALLTFEAMAEAASFANMWVPFCRKHNIEPRNPDSYFSLKRDPYKNKVKADFVKDRR 480
Query: 481 RVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKATWM 540
RVKREYDEFKVRIN LPDSIRRRSDAYHAREEIKAMKLQ+QN +E +E VKI KATWM
Sbjct: 481 RVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKLQRQN-RDEEIVEPVKIPKATWM 540
Query: 541 ADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLPLL 600
ADGTHWPGTW+ +HS+ DHAGIIQVMLKPPSDEPLHG E +D ++VDIRLPLL
Sbjct: 541 ADGTHWPGTWINSGPDHSRSDHAGIIQVMLKPPSDEPLHG--VSEGFLDLTDVDIRLPLL 600
Query: 601 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFMMD 660
VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQA+REGMCFMMD
Sbjct: 601 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQALREGMCFMMD 660
Query: 661 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVALY 720
RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGL GPVYVGTGCLFRR+ALY
Sbjct: 661 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLMGPVYVGTGCLFRRIALY 720
Query: 721 GFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMG--DSDDEEMNLSLFPKRFG 780
GFDPPR+KEHHPGFCSCC +KK + V PEE+R+LRMG DDEEMNLSL PK+FG
Sbjct: 721 GFDPPRAKEHHPGFCSCCFSRKKKKSRV---PEENRSLRMGGDSDDDEEMNLSLVPKKFG 780
Query: 781 NSTFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCWYEDK 840
NSTFLIDSIPVAEFQGRPLADHPAV+NGRPPGALTIPR+LLDASTVAEAI+VISCWYEDK
Sbjct: 781 NSTFLIDSIPVAEFQGRPLADHPAVQNGRPPGALTIPRELLDASTVAEAIAVISCWYEDK 840
Query: 841 TEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWA 900
TEWG+R+GWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWA
Sbjct: 841 TEWGSRIGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWA 900
Query: 901 TGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQ 960
TGSVEIFFSRNNA ASPRMK+LQRIAYLNVGIYPFTS FLIVYCFLPALSLFSGQFIVQ
Sbjct: 901 TGSVEIFFSRNNAFFASPRMKILQRIAYLNVGIYPFTSFFLIVYCFLPALSLFSGQFIVQ 960
Query: 961 TLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQGLLK 1020
TLNVTFL YLL+I++TLC+LA+LEI+WSGI LEEWWRNEQFWLIGGTSAHLAAV+QGLLK
Sbjct: 961 TLNVTFLVYLLIISITLCLLALLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVIQGLLK 1020
Query: 1021 VVAGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSRTIYS 1080
VVAGIEISFTLTSKSGG+DVDDEFADLYIVKWTSLMIPPITIM+ NLIAIAVGFSRTIYS
Sbjct: 1021 VVAGIEISFTLTSKSGGEDVDDEFADLYIVKWTSLMIPPITIMMVNLIAIAVGFSRTIYS 1080
Query: 1081 VIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWVAISP 1140
VIPQWS+LIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIV+VWSGL+AITISLLWVAI+P
Sbjct: 1081 VIPQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWVAINP 1140
Query: 1141 PSGTNQIGGSFTFP 1146
P+G+ QIGGSFTFP
Sbjct: 1141 PAGSTQIGGSFTFP 1145
BLAST of CmaCh02G004670 vs. TAIR 10
Match:
AT5G16910.1 (cellulose-synthase like D2 )
HSP 1 Score: 1956.0 bits (5066), Expect = 0.0e+00
Identity = 948/1152 (82.29%), Postives = 1036/1152 (89.93%), Query Frame = 0
Query: 2 ASKSFKPNRSNLSTASDASEAQKPPLPPTVTFGRRTSSGRYISYSRDDLDSELGSGDFMN 61
++K F +RSNLS SD E +PP +V F +RTSSGRYI+YSRDDLDSELG DFM+
Sbjct: 3 SNKHFDKSRSNLSNNSDIQEPGRPPAGHSVKFAQRTSSGRYINYSRDDLDSELGGQDFMS 62
Query: 62 YTVHIPPTPDNQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHPQMAG 121
YTVHIPPTPDNQPMDPSISQKVEEQYV+NS+FTGGF + TRAHLM KVIE+E HPQMAG
Sbjct: 63 YTVHIPPTPDNQPMDPSISQKVEEQYVANSMFTGGFKSNTRAHLMHKVIETEPNHPQMAG 122
Query: 122 TKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPYKNTD 181
+KGSSC+IPGCDAKVMSDERG D+LPCECDFKICRDC++DAVK GGGICPGCKEPYKNT
Sbjct: 123 SKGSSCAIPGCDAKVMSDERGQDLLPCECDFKICRDCFIDAVKTGGGICPGCKEPYKNTH 182
Query: 182 LDEIAVEHGRPLPLPPPATMSKMERRLSLMKST-KSALMRSHTGVGEFDHNKWLFETRGT 241
L + E+G+ P+ P SKMERRLS++KST KSALMRS T G+FDHN+WLFET GT
Sbjct: 183 LTDQVDENGQQRPMLPGGGGSKMERRLSMVKSTNKSALMRSQT--GDFDHNRWLFETTGT 242
Query: 242 YGYGNAIWPKDEGFENGNTDE-------VEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLI 301
YGYGNA W KD F +G + +E + M++PWRPLTRKLKIPA V+SPYRLLI
Sbjct: 243 YGYGNAFWTKDGDFGSGKDGDGDGDGMGMEAQDLMSRPWRPLTRKLKIPAGVISPYRLLI 302
Query: 302 VVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVL 361
+R+VVL FL WRV H N DA WLW MS+VCE+WFA SWLLDQLPKLCPINRATDL VL
Sbjct: 303 FIRIVVLALFLTWRVKHQNPDAVWLWGMSVVCELWFALSWLLDQLPKLCPINRATDLQVL 362
Query: 362 TEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVS 421
EKFETP+ SNPTGKSDLPG D+FVSTADPEKEPPLVTANTILSILAA+YPVEKL+CYVS
Sbjct: 363 KEKFETPTASNPTGKSDLPGFDVFVSTADPEKEPPLVTANTILSILAAEYPVEKLSCYVS 422
Query: 422 DDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRR 481
DDGGALLTFEAMAEAASFAN WVPFCRKH IEPRNP+SYFSLKRDP+KNKVK DFVKDRR
Sbjct: 423 DDGGALLTFEAMAEAASFANIWVPFCRKHAIEPRNPDSYFSLKRDPYKNKVKSDFVKDRR 482
Query: 482 RVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKATWM 541
RVKRE+DEFKVR+N LPDSIRRRSDAYHAREEIKAMK+Q+QN DEP+E VKI KATWM
Sbjct: 483 RVKREFDEFKVRVNSLPDSIRRRSDAYHAREEIKAMKMQRQN-RDDEPMEPVKIPKATWM 542
Query: 542 ADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLPLL 601
ADGTHWPGTWL +S+H+KGDHAGIIQVMLKPPSDEPLHG E +D ++VDIRLPLL
Sbjct: 543 ADGTHWPGTWLTSASDHAKGDHAGIIQVMLKPPSDEPLHG--VSEGFLDLTDVDIRLPLL 602
Query: 602 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFMMD 661
VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNS+A+REGMCFMMD
Sbjct: 603 VYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSEALREGMCFMMD 662
Query: 662 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVALY 721
RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGL GPVYVGTGCLFRR+ALY
Sbjct: 663 RGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLMGPVYVGTGCLFRRIALY 722
Query: 722 GFDPPRSKEHHPGFCSCCCGGRKKHTSVASTPEESRALRMGDSDDEEMNLSLFPKRFGNS 781
GF+PPRSK+ P SCC KK + PEE+RALRM D DDEEMNLSL PK+FGNS
Sbjct: 723 GFNPPRSKDFSPSCWSCCFPRSKK----KNIPEENRALRMSDYDDEEMNLSLVPKKFGNS 782
Query: 782 TFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCWYEDKTE 841
TFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPR+LLDASTVAEAI+VISCWYEDKTE
Sbjct: 783 TFLIDSIPVAEFQGRPLADHPAVKNGRPPGALTIPRELLDASTVAEAIAVISCWYEDKTE 842
Query: 842 WGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG 901
WG+R+GWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG
Sbjct: 843 WGSRIGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG 902
Query: 902 SVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTL 961
SVEIFFSRNNA+LAS +MK+LQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTL
Sbjct: 903 SVEIFFSRNNALLASSKMKILQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTL 962
Query: 962 NVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQGLLKVV 1021
NVTFL YLL+I++TLC+LA+LEI+WSGI LEEWWRNEQFWLIGGTSAHLAAVLQGLLKVV
Sbjct: 963 NVTFLVYLLIISITLCLLALLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVV 1022
Query: 1022 AGIEISFTLTSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSRTIYSVI 1081
AG+EISFTLTSKSGGDD+DDEFADLY+VKWTSLMIPPITI++ NLIAIAVGFSRTIYSV+
Sbjct: 1023 AGVEISFTLTSKSGGDDIDDEFADLYMVKWTSLMIPPITIIMVNLIAIAVGFSRTIYSVV 1082
Query: 1082 PQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWVAISPPS 1141
PQWS+LIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIV+VWSGL+AITISLLWVAI+PP+
Sbjct: 1083 PQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWVAINPPA 1142
Query: 1142 GTNQIGGSFTFP 1146
G +IGG+F+FP
Sbjct: 1143 GNTEIGGNFSFP 1145
BLAST of CmaCh02G004670 vs. TAIR 10
Match:
AT4G38190.1 (cellulose synthase like D4 )
HSP 1 Score: 1662.5 bits (4304), Expect = 0.0e+00
Identity = 807/1127 (71.61%), Postives = 938/1127 (83.23%), Query Frame = 0
Query: 30 TVTFGRRTSSGRYISYSRD--DLDSELGSGDFMNYTVHIPPTPDNQPMDPSISQKVEEQY 89
TV F RRTSSGRY+S SRD +L EL SGD+ NYTVHIPPTPDNQPM + K EEQY
Sbjct: 21 TVKFARRTSSGRYVSLSRDNIELSGEL-SGDYSNYTVHIPPTPDNQPM----ATKAEEQY 80
Query: 90 VSNSLFTGGFNNITRAHLMDKVIESEATHPQMAGTKGSSCSIPGCDAKVMSDERGNDILP 149
VSNSLFTGGFN++TRAHLMDKVI+S+ THPQMAG KGSSC++P CD VM DERG D++P
Sbjct: 81 VSNSLFTGGFNSVTRAHLMDKVIDSDVTHPQMAGAKGSSCAMPACDGNVMKDERGKDVMP 140
Query: 150 CECDFKICRDCYVDAVKLGGGICPGCKEPYKNTDLDEIAVEHGR-PLPLPPPATMSK-ME 209
CEC FKICRDC++DA K G+CPGCKE YK DLD+ ++ LPLP P +
Sbjct: 141 CECRFKICRDCFMDAQK-ETGLCPGCKEQYKIGDLDDDTPDYSSGALPLPAPGKDQRGNN 200
Query: 210 RRLSLMKSTKSALMRSHTGVGEFDHNKWLFETRGTYGYGNAIWPKDEGFENGNTDEVE-- 269
+S+MK ++ GEFDHN+WLFET+GTYGYGNA WP+DE + + + +
Sbjct: 201 NNMSMMKRNQN---------GEFDHNRWLFETQGTYGYGNAYWPQDEMYGDDMDEGMRGG 260
Query: 270 PMEFMNKPWRPLTRKLKIPAAVLSPYRLLIVVRMVVLGFFLAWRVSHPNTDAYWLWAMSI 329
+E +KPWRPL+R++ IPAA++SPYRLLIV+R VVL FFL WR+ +PN DA WLW MSI
Sbjct: 261 MVETADKPWRPLSRRIPIPAAIISPYRLLIVIRFVVLCFFLTWRIRNPNEDAIWLWLMSI 320
Query: 330 VCEIWFAFSWLLDQLPKLCPINRATDLNVLTEKFETPSPSNPTGKSDLPGIDIFVSTADP 389
+CE+WF FSW+LDQ+PKLCPINR+TDL VL +KF+ PSPSNPTG+SDLPGID+FVSTADP
Sbjct: 321 ICELWFGFSWILDQIPKLCPINRSTDLEVLRDKFDMPSPSNPTGRSDLPGIDLFVSTADP 380
Query: 390 EKEPPLVTANTILSILAADYPVEKLACYVSDDGGALLTFEAMAEAASFANTWVPFCRKHG 449
EKEPPLVTANTILSILA DYPVEK++CY+SDDGGALL+FEAMAEAASFA+ WVPFCRKH
Sbjct: 381 EKEPPLVTANTILSILAVDYPVEKVSCYLSDDGGALLSFEAMAEAASFADLWVPFCRKHN 440
Query: 450 IEPRNPESYFSLKRDPFKNKVKPDFVKDRRRVKREYDEFKVRINGLPDSIRRRSDAYHAR 509
IEPRNP+SYFSLK DP KNK + DFVKDRR++KREYDEFKVRINGLPDSIRRRSDA++AR
Sbjct: 441 IEPRNPDSYFSLKIDPTKNKSRIDFVKDRRKIKREYDEFKVRINGLPDSIRRRSDAFNAR 500
Query: 510 EEIKAMKLQKQNIGADEPIESVKIAKATWMADGTHWPGTWLQPSSEHSKGDHAGIIQVML 569
EE+KA+K +++ G +P E VK+ KATWMADGTHWPGTW + EHSKGDHAGI+QVML
Sbjct: 501 EEMKALKQMRESGG--DPTEPVKVPKATWMADGTHWPGTWAASTREHSKGDHAGILQVML 560
Query: 570 KPPSDEPLHGNVEDEKLIDTSEVDIRLPLLVYVSREKRPGYDHNKKAGAMNALVRASAIM 629
KPPS +PL GN D+K+ID S+ D RLP+ VYVSREKRPGYDHNKKAGAMNALVRASAI+
Sbjct: 561 KPPSSDPLIGN-SDDKVIDFSDTDTRLPMFVYVSREKRPGYDHNKKAGAMNALVRASAIL 620
Query: 630 SNGPFILNLDCDHYIYNSQAMREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTV 689
SNGPFILNLDCDHYIYN +A+REGMCFMMDRGG+ +CY+QFPQRFEGIDPSDRYAN+NTV
Sbjct: 621 SNGPFILNLDCDHYIYNCKAVREGMCFMMDRGGEDICYIQFPQRFEGIDPSDRYANNNTV 680
Query: 690 FFDVNMRALDGLQGPVYVGTGCLFRRVALYGFDPPRSKEHHPGFCSCCCGGRKKHTSVAS 749
FFD NMRALDG+QGPVYVGTG +FRR ALYGFDPP + +
Sbjct: 681 FFDGNMRALDGVQGPVYVGTGTMFRRFALYGFDPPNPDK-----------------LLEK 740
Query: 750 TPEESRALRMGDSDDEEMNLSLFPKRFGNSTFLIDSIPVAEFQGRPLADHPAVKNGRPPG 809
E+ AL D D +++++ PKRFGNST L +SIP+AEFQGRPLADHPAVK GRPPG
Sbjct: 741 KESETEALTTSDF-DPDLDVTQLPKRFGNSTLLAESIPIAEFQGRPLADHPAVKYGRPPG 800
Query: 810 ALTIPRDLLDASTVAEAISVISCWYEDKTEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSV 869
AL +PRD LDA+TVAE++SVISCWYEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGW+SV
Sbjct: 801 ALRVPRDPLDATTVAESVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRSV 860
Query: 870 YCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNAILASPRMKLLQRIAYLNVG 929
YC+TKRD+FRG+APINLTDRLHQVLRWATGSVEIFFSRNNAILAS R+K LQR+AYLNVG
Sbjct: 861 YCITKRDSFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAILASKRLKFLQRLAYLNVG 920
Query: 930 IYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLVITLTLCMLAVLEIRWSGIEL 989
IYPFTS+FLI+YCFLPA SLFSGQFIV+TL+++FL YLL+IT+ L LAVLE++WSGI L
Sbjct: 921 IYPFTSLFLILYCFLPAFSLFSGQFIVRTLSISFLVYLLMITICLIGLAVLEVKWSGIGL 980
Query: 990 EEWWRNEQFWLIGGTSAHLAAVLQGLLKVVAGIEISFTLTSKSGGDDVDDEFADLYIVKW 1049
EEWWRNEQ+WLI GTS+HL AV+QG+LKV+AGIEISFTLT+KSGGDD +D +ADLYIVKW
Sbjct: 981 EEWWRNEQWWLISGTSSHLYAVVQGVLKVIAGIEISFTLTTKSGGDDNEDIYADLYIVKW 1040
Query: 1050 TSLMIPPITIMITNLIAIAVGFSRTIYSVIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGR 1109
+SLMIPPI I + N+IAI V F RTIY +PQWS+LIGG FFSFWVLAHLYPFAKGLMGR
Sbjct: 1041 SSLMIPPIVIAMVNIIAIVVAFIRTIYQAVPQWSKLIGGAFFSFWVLAHLYPFAKGLMGR 1100
Query: 1110 RGRTPTIVFVWSGLIAITISLLWVAISPPSGTNQI-----GGSFTFP 1146
RG+TPTIVFVW+GLIAITISLLW AI+P +G GG F FP
Sbjct: 1101 RGKTPTIVFVWAGLIAITISLLWTAINPNTGPAAAAEGVGGGGFQFP 1111
BLAST of CmaCh02G004670 vs. TAIR 10
Match:
AT1G02730.1 (cellulose synthase-like D5 )
HSP 1 Score: 1508.4 bits (3904), Expect = 0.0e+00
Identity = 751/1164 (64.52%), Postives = 900/1164 (77.32%), Query Frame = 0
Query: 10 RSNLSTASDASEAQKPPLPPTVTFGRRTSS---GRYISYSRDDLDSELGSGD-FMNYTVH 69
R+++ T ++ + + +++ G R S+ GRY S S +DL +E + + ++YTVH
Sbjct: 36 RASVITNQNSPLSSRATRRTSISSGNRRSNGDEGRYCSMSVEDLTAETTNSECVLSYTVH 95
Query: 70 IPPTPDNQPMDPSISQKVEE---------QYVSNSLFTGGFNNITRAHLMDKVIESEATH 129
IPPTPD+Q + S + +E ++S ++FTGGF ++TR H++D +
Sbjct: 96 IPPTPDHQTVFASQESEEDEMLKGNSNQKSFLSGTIFTGGFKSVTRGHVID--CSMDRAD 155
Query: 130 PQMAGTKGSSCSIPGCDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEP 189
P+ G C + GCD KV+ CEC F+ICRDCY D + GGG CPGCKEP
Sbjct: 156 PEK--KSGQICWLKGCDEKVVHGR-------CECGFRICRDCYFDCITSGGGNCPGCKEP 215
Query: 190 YKNTDLD---EIAVEHGRPLPLPPPATMSKMERRLSLMKSTKSALMRSHTGVGEFDHNKW 249
Y++ + D E E PLP SK+++RLS++KS K + G+FDH +W
Sbjct: 216 YRDINDDPETEEEDEEDEAKPLPQMGE-SKLDKRLSVVKSFK-----AQNQAGDFDHTRW 275
Query: 250 LFETRGTYGYGNAIWPKDE---GFENGNTDEVEPMEFMNKPWRPLTRKLKIPAAVLSPYR 309
LFET+GTYGYGNA+WPKD G G P EF + RPLTRK+ + AA++SPYR
Sbjct: 276 LFETKGTYGYGNAVWPKDGYGIGSGGGGNGYETPPEFGERSKRPLTRKVSVSAAIISPYR 335
Query: 310 LLIVVRMVVLGFFLAWRVSHPNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDL 369
LLI +R+V LG FL WRV HPN +A WLW MS CE+WFA SWLLDQLPKLCP+NR TDL
Sbjct: 336 LLIALRLVALGLFLTWRVRHPNREAMWLWGMSTTCELWFALSWLLDQLPKLCPVNRLTDL 395
Query: 370 NVLTEKFETPSPSNPTGKSDLPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLAC 429
VL E+FE+P+ NP G+SDLPGID+FVSTADPEKEPPLVTANTILSILA DYPVEKLAC
Sbjct: 396 GVLKERFESPNLRNPKGRSDLPGIDVFVSTADPEKEPPLVTANTILSILAVDYPVEKLAC 455
Query: 430 YVSDDGGALLTFEAMAEAASFANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVK 489
Y+SDDGGALLTFEA+A+ ASFA+TWVPFCRKH IEPRNPE+YF KR+ KNKV+ DFV+
Sbjct: 456 YLSDDGGALLTFEALAQTASFASTWVPFCRKHNIEPRNPEAYFGQKRNFLKNKVRLDFVR 515
Query: 490 DRRRVKREYDEFKVRINGLPDSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKA 549
+RRRVKREYDEFKVRIN LP++IRRRSDAY+ EE++A K Q + + + P E+V + KA
Sbjct: 516 ERRRVKREYDEFKVRINSLPEAIRRRSDAYNVHEELRAKKKQMEMMMGNNPQETVIVPKA 575
Query: 550 TWMADGTHWPGTWLQPSSEHSKGDHAGIIQVMLKPPSDEPLHGNVED-EKLIDTSEVDIR 609
TWM+DG+HWPGTW +++S+GDHAGIIQ ML PP+ EP++G D E LIDT++VDIR
Sbjct: 576 TWMSDGSHWPGTWSSGETDNSRGDHAGIIQAMLAPPNAEPVYGAEADAENLIDTTDVDIR 635
Query: 610 LPLLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMC 669
LP+LVYVSREKRPGYDHNKKAGAMNALVR SAIMSNGPFILNLDCDHYIYNS A+REGMC
Sbjct: 636 LPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYIYNSMALREGMC 695
Query: 670 FMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRR 729
FM+DRGGDR+CYVQFPQRFEGIDP+DRYANHNTVFFDV+MRALDGLQGP+YVGTGC+FRR
Sbjct: 696 FMLDRGGDRICYVQFPQRFEGIDPNDRYANHNTVFFDVSMRALDGLQGPMYVGTGCIFRR 755
Query: 730 VALYGFDPPRSKEHHP--GFCSCCCGGRKKHTSVASTPEESRAL----RMGDSDDEEMNL 789
ALYGF PPR+ EHH G R+ + E S + ++DD ++
Sbjct: 756 TALYGFSPPRATEHHGWLGRRKVKISLRRPKAMMKKDDEVSLPINGEYNEEENDDGDIES 815
Query: 790 SLFPKRFGNSTFLIDSIPVAEFQGRPLAD-HPAVKNGRPPGALTIPRDLLDASTVAEAIS 849
L PKRFGNS + SIPVAE+QGR + D KN RP G+L +PR+ LDA+TVAEAIS
Sbjct: 816 LLLPKRFGNSNSFVASIPVAEYQGRLIQDLQGKGKNSRPAGSLAVPREPLDAATVAEAIS 875
Query: 850 VISCWYEDKTEWGNRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTD 909
VISC+YEDKTEWG RVGWIYGSVTEDVVTGYRMHNRGW+S+YCVTKRDAFRGTAPINLTD
Sbjct: 876 VISCFYEDKTEWGKRVGWIYGSVTEDVVTGYRMHNRGWRSIYCVTKRDAFRGTAPINLTD 935
Query: 910 RLHQVLRWATGSVEIFFSRNNAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALS 969
RLHQVLRWATGSVEIFFSRNNAI A+ RMK LQR+AY NVG+YPFTS+FLIVYC LPA+S
Sbjct: 936 RLHQVLRWATGSVEIFFSRNNAIFATRRMKFLQRVAYFNVGMYPFTSLFLIVYCILPAIS 995
Query: 970 LFSGQFIVQTLNVTFLTYLLVITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHL 1029
LFSGQFIVQ+L++TFL YLL ITLTLCML++LEI+WSGI L EWWRNEQFW+IGGTSAH
Sbjct: 996 LFSGQFIVQSLDITFLIYLLSITLTLCMLSLLEIKWSGITLHEWWRNEQFWVIGGTSAHP 1055
Query: 1030 AAVLQGLLKVVAGIEISFTLTSKSGG-DDVDDEFADLYIVKWTSLMIPPITIMITNLIAI 1089
AAVLQGLLKV+AG++ISFTLTSKS +D DDEFADLY+VKW+ LM+PP+TIM+ N+IAI
Sbjct: 1056 AAVLQGLLKVIAGVDISFTLTSKSSAPEDGDDEFADLYVVKWSFLMVPPLTIMMVNMIAI 1115
Query: 1090 AVGFSRTIYSVIPQWSRLIGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAIT 1146
AVG +RT+YS PQWS+L+GGVFFSFWVL HLYPFAKGLMGRRGR PTIVFVWSGL++I
Sbjct: 1116 AVGLARTLYSPFPQWSKLVGGVFFSFWVLCHLYPFAKGLMGRRGRVPTIVFVWSGLLSII 1175
BLAST of CmaCh02G004670 vs. TAIR 10
Match:
AT2G33100.1 (cellulose synthase-like D1 )
HSP 1 Score: 1423.3 bits (3683), Expect = 0.0e+00
Identity = 711/1138 (62.48%), Postives = 858/1138 (75.40%), Query Frame = 0
Query: 17 SDASEAQKPPLPPTVTFGRRTSSGRYISYSRDDLDSEL-----GSGDFMNYTVHIPPTPD 76
S +S +P P V FGRRTSSGR +S SRDD D ++ G D++NYTV +PPTPD
Sbjct: 12 SQSSSLSRP--PQAVKFGRRTSSGRIVSLSRDD-DMDVSGDYSGQNDYINYTVLMPPTPD 71
Query: 77 NQPMDPSISQKVEEQYVSNSLFTGGFNNITRAHLMDKVIESEATHPQMAGTKGSSCSIPG 136
NQP AG+ GS+
Sbjct: 72 NQP---------------------------------------------AGSSGST----- 131
Query: 137 CDAKVMSDERGNDILPCECDFKICRDCYVDAVKLGGGICPGCKEPYKNTDLDEIAVEHGR 196
S+ +G DA + GGG
Sbjct: 132 ------SESKG------------------DANRGGGG----------------------- 191
Query: 197 PLPLPPPATMSKMERRLSLMKS-TKSALMRSHTGVGEFDHNKWLFETRGTYGYGNAIWPK 256
P +K+ERRLS+MKS KS L+RS T G+FDHN+WLFE++G YG GNA W +
Sbjct: 192 ---GDGPKMGNKLERRLSVMKSNNKSMLLRSQT--GDFDHNRWLFESKGKYGIGNAFWSE 251
Query: 257 DEGFENGNTDEVEPMEFMNKPWRPLTRKLKIPAAVLSPYRLLIVVRMVVLGFFLAWRVSH 316
++ +G V +F++KPW+PLTRK++IPA +LSPYRLLIV+R+V++ FFL WR+++
Sbjct: 252 EDDTYDGG---VSKSDFLDKPWKPLTRKVQIPAKILSPYRLLIVIRLVIVFFFLWWRITN 311
Query: 317 PNTDAYWLWAMSIVCEIWFAFSWLLDQLPKLCPINRATDLNVLTEKFETPSPSNPTGKSD 376
PN DA WLW +SIVCEIWFAFSW+LD LPKL PINRATDL L +KFE PSPSNPTG+SD
Sbjct: 312 PNEDAMWLWGLSIVCEIWFAFSWILDILPKLNPINRATDLAALHDKFEQPSPSNPTGRSD 371
Query: 377 LPGIDIFVSTADPEKEPPLVTANTILSILAADYPVEKLACYVSDDGGALLTFEAMAEAAS 436
LPG+D+FVSTADPEKEPPLVTANT+LSILA DYP+EKL+ Y+SDDGGA+LTFEAMAEA
Sbjct: 372 LPGVDVFVSTADPEKEPPLVTANTLLSILAVDYPIEKLSAYISDDGGAILTFEAMAEAVR 431
Query: 437 FANTWVPFCRKHGIEPRNPESYFSLKRDPFKNKVKPDFVKDRRRVKREYDEFKVRINGLP 496
FA WVPFCRKH IEPRNP+SYFS+K+DP KNK + DFVKDRR +KREYDEFKVRINGLP
Sbjct: 432 FAEYWVPFCRKHDIEPRNPDSYFSIKKDPTKNKKRQDFVKDRRWIKREYDEFKVRINGLP 491
Query: 497 DSIRRRSDAYHAREEIKAMKLQKQNIGADEPIESVKIAKATWMADGTHWPGTWLQPSSEH 556
+ I++R++ ++ REE+K ++ ++ G P + V++ KATWMADGTHWPGTW +P +H
Sbjct: 492 EQIKKRAEQFNMREELKEKRIAREKNGGVLPPDGVEVVKATWMADGTHWPGTWFEPKPDH 551
Query: 557 SKGDHAGIIQVMLKPPSDEPLHGNVEDEKLIDTSEVDIRLPLLVYVSREKRPGYDHNKKA 616
SKGDHAGI+Q+M K P EP+ G +E +D + +DIR+P+ YVSREKRPG+DHNKKA
Sbjct: 552 SKGDHAGILQIMSKVPDLEPVMGG-PNEGALDFTGIDIRVPMFAYVSREKRPGFDHNKKA 611
Query: 617 GAMNALVRASAIMSNGPFILNLDCDHYIYNSQAMREGMCFMMDRGGDRLCYVQFPQRFEG 676
GAMN +VRASAI+SNG FILNLDCDHYIYNS+A++EGMCFMMDRGGDR+CY+QFPQRFEG
Sbjct: 612 GAMNGMVRASAILSNGAFILNLDCDHYIYNSKAIKEGMCFMMDRGGDRICYIQFPQRFEG 671
Query: 677 IDPSDRYANHNTVFFDVNMRALDGLQGPVYVGTGCLFRRVALYGFDPPRSKEHHPGFCSC 736
IDPSDRYANHNTVFFD NMRALDGLQGPVYVGTGC+FRR ALYGF+PPR+ E+ F
Sbjct: 672 IDPSDRYANHNTVFFDGNMRALDGLQGPVYVGTGCMFRRYALYGFNPPRANEYSGVF--- 731
Query: 737 CCGGRKK----HTSVASTPEESRALRMGDSDDEEMN----LSLFPKRFGNSTFLIDSIPV 796
G++K H S ++ +SD + +N L L PK+FGNST D+IPV
Sbjct: 732 ---GQEKAPAMHVRTQSQASQTSQASDLESDTQPLNDDPDLGL-PKKFGNSTMFTDTIPV 791
Query: 797 AEFQGRPLADHPAVKNGRPPGALTIPRDLLDASTVAEAISVISCWYEDKTEWGNRVGWIY 856
AE+QGRPLADH +VKNGRPPGAL +PR LDA TVAEAI+VISCWYED TEWG+R+GWIY
Sbjct: 792 AEYQGRPLADHMSVKNGRPPGALLLPRPPLDAPTVAEAIAVISCWYEDNTEWGDRIGWIY 851
Query: 857 GSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRN 916
GSVTEDVVTGYRMHNRGW+SVYC+TKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFS+N
Sbjct: 852 GSVTEDVVTGYRMHNRGWRSVYCITKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSKN 911
Query: 917 NAILASPRMKLLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLL 976
NA+ A+ R+K LQR+AYLNVGIYPFTSIFL+VYCFLPAL LFSG+FIVQ+L++ FL+YLL
Sbjct: 912 NAMFATRRLKFLQRVAYLNVGIYPFTSIFLVVYCFLPALCLFSGKFIVQSLDIHFLSYLL 971
Query: 977 VITLTLCMLAVLEIRWSGIELEEWWRNEQFWLIGGTSAHLAAVLQGLLKVVAGIEISFTL 1036
IT+TL ++++LE++WSGI LEEWWRNEQFWLIGGTSAHLAAV+QGLLKV+AGIEISFTL
Sbjct: 972 CITVTLTLISLLEVKWSGIGLEEWWRNEQFWLIGGTSAHLAAVVQGLLKVIAGIEISFTL 1031
Query: 1037 TSKSGGDDVDDEFADLYIVKWTSLMIPPITIMITNLIAIAVGFSRTIYSVIPQWSRLIGG 1096
TSK+ G+D DD FADLYIVKWT L I P+TI+I NL+AI +G SRTIYSVIPQW +L+GG
Sbjct: 1032 TSKASGEDEDDIFADLYIVKWTGLFIMPLTIIIVNLVAIVIGASRTIYSVIPQWGKLMGG 1033
Query: 1097 VFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLIAITISLLWVAISPPSGTNQIGG 1141
+FFS WVL H+YPFAKGLMGRRG+ PTIV+VWSGL++IT+SLLW+ ISPP + GG
Sbjct: 1092 IFFSLWVLTHMYPFAKGLMGRRGKVPTIVYVWSGLVSITVSLLWITISPPDDVSGSGG 1033
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9M9M4 | 0.0e+00 | 83.45 | Cellulose synthase-like protein D3 OS=Arabidopsis thaliana OX=3702 GN=CSLD3 PE=1... | [more] |
Q9LFL0 | 0.0e+00 | 82.29 | Cellulose synthase-like protein D2 OS=Arabidopsis thaliana OX=3702 GN=CSLD2 PE=3... | [more] |
A2YU42 | 0.0e+00 | 80.62 | Cellulose synthase-like protein D2 OS=Oryza sativa subsp. indica OX=39946 GN=CSL... | [more] |
Q9LHZ7 | 0.0e+00 | 80.62 | Cellulose synthase-like protein D2 OS=Oryza sativa subsp. japonica OX=39947 GN=C... | [more] |
A2ZAK8 | 0.0e+00 | 76.12 | Cellulose synthase-like protein D1 OS=Oryza sativa subsp. indica OX=39946 GN=CSL... | [more] |