Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTATTCCATCACTTTGGCCCTCTACTTCATTTTGCCTTTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTACTCTACTGCTCTTCAGTATTCCATTCTCTTCTTTGAGGGACAGCGATCCGGGAAGCTTCCCTCGAACCAACGTCTCACATGGAGAGGAAATTCTGCCTTATCAGATGGCTCCTCTTATCATGTGCGTTGTTCCCCTTTTGCTTATTTTGATGTTCATTACTTCTATATAAGTCTAACCTAGTTGATATTGATAAATAACTCAGGCTCTAACTTGATAAGACGTGAATATGAACTTTCCAGGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTTACTACTACATTGTTGGCTTGGAGTGTCATTGAGTTTGGTGACTCGATGGGGAACGAGATTGAGAATGCAAGAGAAGCAGTCCGTTGGGGGTCGGATTATCTATTGAAGGCTGCCACCGACGCGCCTAATGGCTTATATGTTCAAGTGAGTAGAGTGACTATAAAGAAGAATAAGATGATTAAGATAATAATAATCACTATAATAAGGATATTTGTAACGTATGTAAACTTGTGGTGTTAAGGTGGGAGATCCAAACCTTGATCATAAATGTTGGGAAAGGCCAGAAGATATGGACACACCACGCACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCCGATGTGGCATCAGAGACTGCAGCTGCATTGGCTGCAGCTTCGATTGTGTTCAATACATCCGATCCTTCATATTCCAACAAATTGCTTGACGCGGCCTTAAAAGTAAATCATACTCCCGATCCCTCCTATTCCATTTTTCTTTCACATTAGACATAGCCATCGGACCTCATTATGCTCCAAACAAGGTAACAAGTTCCCATCAATGCAATGCAGGTATTCGATTTTGCAGACAAGCATAGAGGTGCTTACAGTGATTCCCTCCATTCAGTGGTCTGTCCATTTTACTGCTCTTACTCAGGATACAATGTAAGTAAGAAGATACACATAAACCTACTTGTTTTCTTTCTAGGTGTTGTTTTTTCTGAGTTTATATGACTTCTATCAGGATGAGCTTCTATGGGGTGCCTCATGGATATACAAAGCCTCAAAAAACAGCATACATTTGAACTATATCCAGTCCAATGGCCATATACTGGGAGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCGGGAACCAAAATCCTTCTCTCCCAGGTTTGTGTTTACTCCCTCTAAATAAAAATCAGTCCAAAATCTAATTCCTAAACTCAAGCTTGATGATCTTAACATCGCGTGGACCTTTGCCACAGGATTTCCTAGTGCAAAATTCAGAGGAGTTCCAAATCTATAAAGCACACTCAGATAACTACATATGCTCCCTCATTCCAGGAACTTCCAGTTCTAGTGGTCAATATACTCCTGGTTAGCAAAAACTAACTCTTCGCTTGTCATTACTTTATGTGTGTGCATATATGAGGTCTTAATCCTATTTTTTGTTTATGTCCTAAAACATTAAGCATTTTTTCATATTGATACTTGAATTTTTAAAATATTCATTTTGGTCCTTAAAACTTTTTTCTTTAACAAACGATCGTTCTAGTTCAAAACTTTCGAAATCTCTATTTTAGTACTTCAACTTCTAAAAAAGACAATTTTTGTCCAATACCATTAATTTGTGCTTGCTACCTAATATTTACCCACCCAATACTCACTTTCAACTTACCTATGTGGATTCATTTAAGTGCCAAAGTTCATACATTAATAAACGTATTAAAGATATACTACCCATTCATTTAAATCAAAATGGTAATTTTTAAAGTTTAGGAACAAATTAAAGACAAATATGAATGAAAGTTTAAAGTAAATAGGCCAAAATAGATATTTTGAAAGTAAAGTGACCAAAATGAACTTAAGTTGAAAAGAACATGAAACAAAATTAACTTAAATTGGAAGTATAGACGCTAAAATCACAGGGTAAAGGACTAAAATGTAGCTTAAACCTATACACATGAAACCAAAAGCAGAGCTAGTACACAAACAAGACAGAACAGATCCCCATTTTCAATGAAAACAAATCGTGATCATGAAATGATAAGTGCAGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCCTTCCTCCTTGTAACGTACGCGAAATACCTAAGCTCCAACGGAGGAGCAATTCGATGTGGAACTTCAAGGATTTCACCAGAAGAACTAATCGCAGAGGCAAAGAAGCAAGTTGATTACATATTAGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCACCATAGAGGCTCCTCTGTGCCGTCTCTTCATTCGCGCCCTAATCAAGTTTCTTGCAATGAAGGCTTCCAGTTTCTGTACTCTTCTTCCCCCAATCCGAACGTGCTCGCTGGCGCCATTGTTGGTGGACCTGATAATGGCGATAAGTTCTCCGACGATCGCAATAACTATCAGCAGTCGGAGCCCGCCACTTACATAAATGCCCCATTCGTTGGCGCCTTGGCCTATTTTTCAAAAACGCCCTAATGGATAATCCACCCTTTGGGCGGAGGCTAAGAAGGAAGGAGAGAGAGGTCATTTTGGACATTTGTCTTACGAGTCTCGGTTTAGGGTTTAGGTCGGGGTCGGTCGATCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAAAATATATATATTAACGTTTTAAGGATCCGATCTTACTGAACCCAATAAAAATTTTCTAGTTTATTTAGTTTTTTCTCTTACCCTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTATGATATATAAACGGAACAGTAAACCTAATCTCCAAATCTATTCTAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAAAATATATATATTAACGTTTTAAGGATCCGATCTTACTGAACCCAATAAAAATTTTCTAGTTTATTTAGTTTTTTCTCTTACCCTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTATGATATATAAACGGAACAGTAAACCTAATCTCCAAATCTATTCTAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAATAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAAAATATATATATTAACGTTTTAAGGATCCGATCTTGCTGAACCCAATAAAAATTTTCTAGTTTATTTAGTTTTTTCTCTTACCCTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTATGATATATAAACGGAACAGTAAACCTAATCTCCAAATCTATTCTAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAATATATATATATTAACGTTTTAAGGATCTGATCTTACCGAACCTAATAGAAATTTTCTAGTTTATTTAGTTTTTTTGTCGTTTCTCTTGCACTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTAAGATATATAAAATTCAAATCTCCACTTGTTATGTACATATATTAAAAAGTTAATTTAAATATAGTTCCGTGGATAGTTCGTCTTCAACACCAAGATTGGTGGAAGTATAAATTTCCAAATAAAATTAATTTTTTGATTCCGATATAACCAGGGAGATTTGCAGAAGCAGATCTGAGAGATAAGAAGAAAACTCCGATCGTCTTCTTCGTCGGCTTCAACTCTCGTCGCCGTGGCCGCCATGGATTTCTCCTCCATCTTTCTCCTCCTTGTTTTCCTCTTACCTATATCTTCTTTTGCAGAGATTCGCTTCACCGACATCAGAAATGACAATCGACCAATCATTCCCTTCGACGTCTTCGGCTTCAGCCATGGCGGTCGTCTCGAGCTCAACGTCTCTCATGTCTCACTCTCCGATACCAATCCAGATCTGGACCTCTCCAAGGCTGGATTCTTCCTCTGTACTCGAGAATCGTGGCTTCATGTGATCCAGCAATTAGAGGAAGCGGAAATCTCTTGCGCTCTTCAATCCGACCTCGTCAAACCGGTCTACACCTTCAATTCACTCAAAGGTCAGGACCGATTCGATGTTCTTTATTCCGAGAGTGATGCCGATCAATACACTCTCGTTTTTGCTAACTGTCTTCAGCAACTTAAGGTTTCTATGGACGTTCGATCTGCTATGTACAATCTCGAAGGCAAAAGCGGTCGCCGGGATTATCTTTCTGCTGGGAAAACCATCCTCCCGAGGATTTACTTCATTTTCTCTATGATTTATTTCTTGCTCGCTATCGTATGGATTCATGTTCTTTACAAAAAGCGATTGACGGTTTATGGTATTCATTTCTTTATGCTTGCTGTTGTGATCTTGAAGGCTTTGAATCTTATCTGTGAGGCTGAGGATAAATCGTACATTAAGCGCACTGGGAGCGCCCATGGTTGGGATATTCTGTTCTACATATTTAGTTTCTTAAAGGGCATCACCTTGTTCACTTTGATCGTCTTGATTGGCACTGGTTGGTCTTTCTTGAAACCATATTTGCAGGACAAGGAGAAGAAGGTCTTGATGATTGTGATTCCATTGCAAGTGGTGGCTAATATTGCCCAGGTTGTGACTGATGAAACTGGGCCATTTGAGCAAGAATGGGTCACTTGGAAACAGGTGTTTTTGCTTGTTGATGTGATCTGTTGCTGTGCTGTTTTGTTCCCCATTGTTTGGTCAATCAAGAACTTGCGTGAGGCTGCACGAACAGATGGAAAAGCAGCTGTGAATTTGATGAAATTGACCCTTTTTAGACAGTATTACATTGTGGTTATATGCTATATCTACTTCACTCGGGTTGTCGTTTATGCACTTGAGACAATTACATCGTATCGGTATCTTTGGACGAGTGTGGTGGCCGGGGAACTGGCAACGTTTGCATTTTATGCATTCACTGGTTACAAGTTCAAGCCTGAGGCTCATAATCCTTATTTTGTAGTTGATGATGAGGAGGAGGAAGCAGCTGCTGAGGCACTGAAGCTTGAAGATGAGTTTGAATTGTGA
mRNA sequence
ATGCCTTATTCCATCACTTTGGCCCTCTACTTCATTTTGCCTTTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTACTCTACTGCTCTTCAGTATTCCATTCTCTTCTTTGAGGGACAGCGATCCGGGAAGCTTCCCTCGAACCAACGTCTCACATGGAGAGGAAATTCTGCCTTATCAGATGGCTCCTCTTATCATGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTTACTACTACATTGTTGGCTTGGAGTGTCATTGAGTTTGGTGACTCGATGGGGAACGAGATTGAGAATGCAAGAGAAGCAGTCCGTTGGGGGTCGGATTATCTATTGAAGGCTGCCACCGACGCGCCTAATGGCTTATATGTTCAAGTGGGAGATCCAAACCTTGATCATAAATGTTGGGAAAGGCCAGAAGATATGGACACACCACGCACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCCGATGTGGCATCAGAGACTGCAGCTGCATTGGCTGCAGCTTCGATTGTGTTCAATACATCCGATCCTTCATATTCCAACAAATTGCTTGACGCGGCCTTAAAAGTATTCGATTTTGCAGACAAGCATAGAGGTGCTTACAGTGATTCCCTCCATTCAGTGGTCTGTCCATTTTACTGCTCTTACTCAGGATACAATGATGAGCTTCTATGGGGTGCCTCATGGATATACAAAGCCTCAAAAAACAGCATACATTTGAACTATATCCAGTCCAATGGCCATATACTGGGAGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCGGGAACCAAAATCCTTCTCTCCCAGGATTTCCTAGTGCAAAATTCAGAGGAGTTCCAAATCTATAAAGCACACTCAGATAACTACATATGCTCCCTCATTCCAGGAACTTCCAGTTCTAGTGGTCAATATACTCCTGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCCTTCCTCCTTGTAACGTACGCGAAATACCTAAGCTCCAACGGAGGAGCAATTCGATGTGGAACTTCAAGGATTTCACCAGAAGAACTAATCGCAGAGGCAAAGAAGCAAGTTGATTACATATTAGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCACCATAGAGGCTCCTCTGTGCCGTCTCTTCATTCGCGCCCTAATCAAGTTTCTTGCAATGAAGGCTTCCAGTTTCTGTACTCTTCTTCCCCCAATCCGAACGTGCTCGCTGGCGCCATTGTTGGTGGACCTGATAATGGCGATAAGTTCTCCGACGATCGCAATAACTATCAGCAGTCGGAGCCCGCCACTTACATAAATGCCCCATTCATAAGAAGAAAACTCCGATCGTCTTCTTCGTCGGCTTCAACTCTCGTCGCCGTGGCCGCCATGGATTTCTCCTCCATCTTTCTCCTCCTTGTTTTCCTCTTACCTATATCTTCTTTTGCAGAGATTCGCTTCACCGACATCAGAAATGACAATCGACCAATCATTCCCTTCGACGTCTTCGGCTTCAGCCATGGCGGTCGTCTCGAGCTCAACGTCTCTCATGTCTCACTCTCCGATACCAATCCAGATCTGGACCTCTCCAAGGCTGGATTCTTCCTCTGTACTCGAGAATCGTGGCTTCATGTGATCCAGCAATTAGAGGAAGCGGAAATCTCTTGCGCTCTTCAATCCGACCTCGTCAAACCGGTCTACACCTTCAATTCACTCAAAGGTCAGGACCGATTCGATGTTCTTTATTCCGAGAGTGATGCCGATCAATACACTCTCGTTTTTGCTAACTGTCTTCAGCAACTTAAGGTTTCTATGGACGTTCGATCTGCTATGTACAATCTCGAAGGCAAAAGCGGTCGCCGGGATTATCTTTCTGCTGGGAAAACCATCCTCCCGAGGATTTACTTCATTTTCTCTATGATTTATTTCTTGCTCGCTATCGTATGGATTCATGTTCTTTACAAAAAGCGATTGACGGTTTATGGTATTCATTTCTTTATGCTTGCTGTTGTGATCTTGAAGGCTTTGAATCTTATCTGTGAGGCTGAGGATAAATCGTACATTAAGCGCACTGGGAGCGCCCATGGTTGGGATATTCTGTTCTACATATTTAGTTTCTTAAAGGGCATCACCTTGTTCACTTTGATCGTCTTGATTGGCACTGGTTGGTCTTTCTTGAAACCATATTTGCAGGACAAGGAGAAGAAGGTCTTGATGATTGTGATTCCATTGCAAGTGGTGGCTAATATTGCCCAGGTTGTGACTGATGAAACTGGGCCATTTGAGCAAGAATGGGTCACTTGGAAACAGGTGTTTTTGCTTGTTGATGTGATCTGTTGCTGTGCTGTTTTGTTCCCCATTGTTTGGTCAATCAAGAACTTGCGTGAGGCTGCACGAACAGATGGAAAAGCAGCTGTGAATTTGATGAAATTGACCCTTTTTAGACAGTATTACATTGTGGTTATATGCTATATCTACTTCACTCGGGTTGTCGTTTATGCACTTGAGACAATTACATCGTATCGGTATCTTTGGACGAGTGTGGTGGCCGGGGAACTGGCAACGTTTGCATTTTATGCATTCACTGGTTACAAGTTCAAGCCTGAGGCTCATAATCCTTATTTTGTAGTTGATGATGAGGAGGAGGAAGCAGCTGCTGAGGCACTGAAGCTTGAAGATGAGTTTGAATTGTGA
Coding sequence (CDS)
ATGCCTTATTCCATCACTTTGGCCCTCTACTTCATTTTGCCTTTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTACTCTACTGCTCTTCAGTATTCCATTCTCTTCTTTGAGGGACAGCGATCCGGGAAGCTTCCCTCGAACCAACGTCTCACATGGAGAGGAAATTCTGCCTTATCAGATGGCTCCTCTTATCATGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTTACTACTACATTGTTGGCTTGGAGTGTCATTGAGTTTGGTGACTCGATGGGGAACGAGATTGAGAATGCAAGAGAAGCAGTCCGTTGGGGGTCGGATTATCTATTGAAGGCTGCCACCGACGCGCCTAATGGCTTATATGTTCAAGTGGGAGATCCAAACCTTGATCATAAATGTTGGGAAAGGCCAGAAGATATGGACACACCACGCACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCCGATGTGGCATCAGAGACTGCAGCTGCATTGGCTGCAGCTTCGATTGTGTTCAATACATCCGATCCTTCATATTCCAACAAATTGCTTGACGCGGCCTTAAAAGTATTCGATTTTGCAGACAAGCATAGAGGTGCTTACAGTGATTCCCTCCATTCAGTGGTCTGTCCATTTTACTGCTCTTACTCAGGATACAATGATGAGCTTCTATGGGGTGCCTCATGGATATACAAAGCCTCAAAAAACAGCATACATTTGAACTATATCCAGTCCAATGGCCATATACTGGGAGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCGGGAACCAAAATCCTTCTCTCCCAGGATTTCCTAGTGCAAAATTCAGAGGAGTTCCAAATCTATAAAGCACACTCAGATAACTACATATGCTCCCTCATTCCAGGAACTTCCAGTTCTAGTGGTCAATATACTCCTGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCCTTCCTCCTTGTAACGTACGCGAAATACCTAAGCTCCAACGGAGGAGCAATTCGATGTGGAACTTCAAGGATTTCACCAGAAGAACTAATCGCAGAGGCAAAGAAGCAAGTTGATTACATATTAGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCACCATAGAGGCTCCTCTGTGCCGTCTCTTCATTCGCGCCCTAATCAAGTTTCTTGCAATGAAGGCTTCCAGTTTCTGTACTCTTCTTCCCCCAATCCGAACGTGCTCGCTGGCGCCATTGTTGGTGGACCTGATAATGGCGATAAGTTCTCCGACGATCGCAATAACTATCAGCAGTCGGAGCCCGCCACTTACATAAATGCCCCATTCATAAGAAGAAAACTCCGATCGTCTTCTTCGTCGGCTTCAACTCTCGTCGCCGTGGCCGCCATGGATTTCTCCTCCATCTTTCTCCTCCTTGTTTTCCTCTTACCTATATCTTCTTTTGCAGAGATTCGCTTCACCGACATCAGAAATGACAATCGACCAATCATTCCCTTCGACGTCTTCGGCTTCAGCCATGGCGGTCGTCTCGAGCTCAACGTCTCTCATGTCTCACTCTCCGATACCAATCCAGATCTGGACCTCTCCAAGGCTGGATTCTTCCTCTGTACTCGAGAATCGTGGCTTCATGTGATCCAGCAATTAGAGGAAGCGGAAATCTCTTGCGCTCTTCAATCCGACCTCGTCAAACCGGTCTACACCTTCAATTCACTCAAAGGTCAGGACCGATTCGATGTTCTTTATTCCGAGAGTGATGCCGATCAATACACTCTCGTTTTTGCTAACTGTCTTCAGCAACTTAAGGTTTCTATGGACGTTCGATCTGCTATGTACAATCTCGAAGGCAAAAGCGGTCGCCGGGATTATCTTTCTGCTGGGAAAACCATCCTCCCGAGGATTTACTTCATTTTCTCTATGATTTATTTCTTGCTCGCTATCGTATGGATTCATGTTCTTTACAAAAAGCGATTGACGGTTTATGGTATTCATTTCTTTATGCTTGCTGTTGTGATCTTGAAGGCTTTGAATCTTATCTGTGAGGCTGAGGATAAATCGTACATTAAGCGCACTGGGAGCGCCCATGGTTGGGATATTCTGTTCTACATATTTAGTTTCTTAAAGGGCATCACCTTGTTCACTTTGATCGTCTTGATTGGCACTGGTTGGTCTTTCTTGAAACCATATTTGCAGGACAAGGAGAAGAAGGTCTTGATGATTGTGATTCCATTGCAAGTGGTGGCTAATATTGCCCAGGTTGTGACTGATGAAACTGGGCCATTTGAGCAAGAATGGGTCACTTGGAAACAGGTGTTTTTGCTTGTTGATGTGATCTGTTGCTGTGCTGTTTTGTTCCCCATTGTTTGGTCAATCAAGAACTTGCGTGAGGCTGCACGAACAGATGGAAAAGCAGCTGTGAATTTGATGAAATTGACCCTTTTTAGACAGTATTACATTGTGGTTATATGCTATATCTACTTCACTCGGGTTGTCGTTTATGCACTTGAGACAATTACATCGTATCGGTATCTTTGGACGAGTGTGGTGGCCGGGGAACTGGCAACGTTTGCATTTTATGCATTCACTGGTTACAAGTTCAAGCCTGAGGCTCATAATCCTTATTTTGTAGTTGATGATGAGGAGGAGGAAGCAGCTGCTGAGGCACTGAAGCTTGAAGATGAGTTTGAATTGTGA
Protein sequence
MPYSITLALYFILPFFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPFIRRKLRSSSSSASTLVAVAAMDFSSIFLLLVFLLPISSFAEIRFTDIRNDNRPIIPFDVFGFSHGGRLELNVSHVSLSDTNPDLDLSKAGFFLCTRESWLHVIQQLEEAEISCALQSDLVKPVYTFNSLKGQDRFDVLYSESDADQYTLVFANCLQQLKVSMDVRSAMYNLEGKSGRRDYLSAGKTILPRIYFIFSMIYFLLAIVWIHVLYKKRLTVYGIHFFMLAVVILKALNLICEAEDKSYIKRTGSAHGWDILFYIFSFLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVANIAQVVTDETGPFEQEWVTWKQVFLLVDVICCCAVLFPIVWSIKNLREAARTDGKAAVNLMKLTLFRQYYIVVICYIYFTRVVVYALETITSYRYLWTSVVAGELATFAFYAFTGYKFKPEAHNPYFVVDDEEEEAAAEALKLEDEFEL
Homology
BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match:
P05522 (Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1)
HSP 1 Score: 731.5 bits (1887), Expect = 1.2e-209
Identity = 347/481 (72.14%), Postives = 404/481 (83.99%), Query Frame = 0
Query: 1 MPYSITLALYFILPFFTLSSSAFTSQ--HYSTALQYSILFFEGQRSGKLPSNQRLTWRGN 60
M S L+L+ +L T+ ++ HYS AL+ SILFFEGQRSGKLP+NQRLTWRG+
Sbjct: 1 MDCSSPLSLFHLLLVCTVMVKCCSASDLHYSDALEKSILFFEGQRSGKLPTNQRLTWRGD 60
Query: 61 SALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVR 120
S LSDGSSYHVDLVGGYYDAGDN+KFGLPMAFTTT+LAW +IEFG M ++ENAR A+R
Sbjct: 61 SGLSDGSSYHVDLVGGYYDAGDNLKFGLPMAFTTTMLAWGIIEFGCLMPEQVENARAALR 120
Query: 121 WGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETA 180
W +DYLLKA+T N LYVQVG+PN DH+CWERPEDMDTPR VYK++ QNPGSDVA+ETA
Sbjct: 121 WSTDYLLKASTATSNSLYVQVGEPNADHRCWERPEDMDTPRNVYKVSTQNPGSDVAAETA 180
Query: 181 AALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDE 240
AALAAASIVF SD SYS KLL A+KVF+FAD++RG+YSDSL SVVCPFYCSYSGYNDE
Sbjct: 181 AALAAASIVFGDSDSSYSTKLLHTAVKVFEFADQYRGSYSDSLGSVVCPFYCSYSGYNDE 240
Query: 241 LLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSE 300
LLWGASW+++AS+N+ ++ YIQSNGH LGADDDDY+FSWDDKR GTK+LLS+ FL E
Sbjct: 241 LLWGASWLHRASQNASYMTYIQSNGHTLGADDDDYSFSWDDKRVGTKVLLSKGFLQDRIE 300
Query: 301 EFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSS 360
E Q+YK H+DNYICSLIPGTSS QYTPGGL +KGS SNLQYVTS AFLL+TYA YL+S
Sbjct: 301 ELQLYKVHTDNYICSLIPGTSSFQAQYTPGGLLYKGSASNLQYVTSTAFLLLTYANYLNS 360
Query: 361 NGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSL 420
+GG CGT+ ++ + LI+ AKKQVDYILG+NP KMSYMVGFGERYPQH+HHRGSS+PS+
Sbjct: 361 SGGHASCGTTTVTAKNLISLAKKQVDYILGQNPAKMSYMVGFGERYPQHVHHRGSSLPSV 420
Query: 421 HSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPF 480
PN + CN GFQ+LYSS PNPN+L GAI+GGPDN D FSDDRNNYQQSEPATYINAP
Sbjct: 421 QVHPNSIPCNAGFQYLYSSPPNPNILVGAILGGPDNRDSFSDDRNNYQQSEPATYINAPL 480
BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match:
Q6YXT7 (Endoglucanase 19 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0114200 PE=2 SV=1)
HSP 1 Score: 645.6 bits (1664), Expect = 8.7e-184
Identity = 303/481 (62.99%), Postives = 379/481 (78.79%), Query Frame = 0
Query: 5 ITLALYFILPFFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDG 64
I L L +L L S+ + +Y+ AL SI+FFEGQRSGKLP R+ WR +S L+DG
Sbjct: 32 IRLRLLVVLHLLLLVPSSAMAFNYADALAKSIIFFEGQRSGKLPPGNRMPWRADSGLTDG 91
Query: 65 SSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDYL 124
+ Y+VDLVGGYYDAGDNVKFGLPMAF+TT+LAWSV++FG MG E+ NAR AVRWG+DYL
Sbjct: 92 AQYNVDLVGGYYDAGDNVKFGLPMAFSTTMLAWSVLDFGKFMGAELPNARAAVRWGADYL 151
Query: 125 LKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAAA 184
LKAAT P LYVQV DPN DH+CWERPEDMDTPR+VY++TA PGSDVA ETAAALAA+
Sbjct: 152 LKAATATPGALYVQVADPNQDHRCWERPEDMDTPRSVYRVTADKPGSDVAGETAAALAAS 211
Query: 185 SIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGAS 244
S+VF +DP+YS +LL AA +VFDFAD+HRG+YSDSL S VCPFYCSYSGY+DELLWGAS
Sbjct: 212 SMVFRRADPAYSARLLHAATQVFDFADRHRGSYSDSLASSVCPFYCSYSGYHDELLWGAS 271
Query: 245 WIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIYK 304
W+++AS+N+ ++Y+++NG LGA DDDY+FSWDDKR GTK+LL++ FL ++YK
Sbjct: 272 WLHRASRNASFMSYVEANGMQLGAGDDDYSFSWDDKRVGTKVLLAKGFLRNRLHGLELYK 331
Query: 305 AHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAIR 364
AHSD+YICSL+PGT+S +YTPGGL ++ SN+QYVT+A FL++ YAKYL S+G
Sbjct: 332 AHSDSYICSLVPGTASFQSRYTPGGLLYREGSSNMQYVTTATFLMLAYAKYLRSSGATAS 391
Query: 365 CG------TSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSL 424
CG +S EL+A AK+QVDYILG+NP MSYMVGFG RYP+ HHRG+S+PS+
Sbjct: 392 CGDGGGGARGEVSAAELVAVAKRQVDYILGKNPAGMSYMVGFGCRYPRRAHHRGASMPSV 451
Query: 425 HSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPF 480
+ P ++SC+ GF +L+S PNPNVL GA+VGGPD+ D F+DDR N+ QSEPATYINAP
Sbjct: 452 RAHPGRISCDAGFGYLHSGEPNPNVLVGAVVGGPDSRDAFADDRGNFAQSEPATYINAPL 511
BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match:
Q6Z715 (Endoglucanase 4 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU14 PE=2 SV=1)
HSP 1 Score: 624.4 bits (1609), Expect = 2.1e-177
Identity = 306/489 (62.58%), Postives = 372/489 (76.07%), Query Frame = 0
Query: 9 LYFILPFFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDGSSYH 68
L ++ L+ + +Y+ AL +ILFFE QRSGKLP QR+ WR +S LSDGS+
Sbjct: 6 LLLVVAAVCLAGREAAAFNYADALDKAILFFEAQRSGKLPPGQRVAWRADSGLSDGSADG 65
Query: 69 VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG----------------NEIEN 128
VDL GGYYDAGDNVKFGLPMAFT T+L+WSVIEFGD M +++N
Sbjct: 66 VDLAGGYYDAGDNVKFGLPMAFTVTMLSWSVIEFGDMMPARRSSFLGGIFGGGGVAQLDN 125
Query: 129 AREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSD 188
AR AVRWG+DYLLKAAT P+ LYVQV DP DH+CWERPEDMDTPR+VYK+T Q+PGSD
Sbjct: 126 ARAAVRWGADYLLKAATATPDTLYVQVADPYQDHRCWERPEDMDTPRSVYKVTPQSPGSD 185
Query: 189 VASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSY 248
VA ETAAALAAASIVF SDPSYS KLLDAA VFDFADK+RG+YSDSL SVVCPFYCS+
Sbjct: 186 VAGETAAALAAASIVFRVSDPSYSAKLLDAAQLVFDFADKYRGSYSDSLSSVVCPFYCSH 245
Query: 249 SGYNDELLWGASWIYKAS--KNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQ 308
S Y+DELLW ASW++ AS K ++L+YI SNGH LGA+ DD+TFSWDDKR TK
Sbjct: 246 S-YHDELLWAASWLHLASPEKKDVYLSYIGSNGHALGAEQDDFTFSWDDKRVATK----- 305
Query: 309 DFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLV 368
FL ++ Q+YKAH+DNYICSL+PG + QYTPGGL FK +SN+QYVTS AFLL+
Sbjct: 306 GFLQSRADGLQLYKAHTDNYICSLVPGANGFQSQYTPGGLLFKEGDSNMQYVTSTAFLLL 365
Query: 369 TYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHH 428
TYAKYLSS+ + CG++ +SP LI+ AKKQVDYILG NP MSYMVGFG RYP+H+HH
Sbjct: 366 TYAKYLSSSAATVSCGSTAVSPSTLISLAKKQVDYILGANPAGMSYMVGFGARYPRHVHH 425
Query: 429 RGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEP 480
RG+S+PS+ P ++ C+EGF++L+S P+ N+LAGA+VGGPD GD F+D R+NY Q+EP
Sbjct: 426 RGASMPSVRDHPARIGCDEGFRYLHSPEPDRNLLAGAVVGGPDAGDAFADGRDNYAQAEP 485
BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match:
Q9SRX3 (Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1)
HSP 1 Score: 610.1 bits (1572), Expect = 4.1e-173
Identity = 298/492 (60.57%), Postives = 368/492 (74.80%), Query Frame = 0
Query: 3 YSITLALYFILPFFTLSSSAFTS---------------QHYSTALQYSILFFEGQRSGKL 62
Y + L L F L S+ F+S +Y AL SILFFEGQRSGKL
Sbjct: 4 YLSSSRLITFLSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKL 63
Query: 63 PSNQRLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG 122
P NQR+TWR NS LSDGS+ +VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG M
Sbjct: 64 PPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMK 123
Query: 123 NEIENAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQ 182
+E+ NA++A+RW +D+LLK AT P+ +YVQVGDPN+DH CWERPEDMDTPR+V+K+
Sbjct: 124 SELPNAKDAIRWATDFLLK-ATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKN 183
Query: 183 NPGSDVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCP 242
NPGSD+A E AAALAAASIVF DPSYSN LL A+ VF FADK+RG YS L VCP
Sbjct: 184 NPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCP 243
Query: 243 FYCSYSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKIL 302
FYCSYSGY DELLWGA+W+ KA+ N +LNYI++NG ILGAD+ D FSWD+K G +IL
Sbjct: 244 FYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARIL 303
Query: 303 LSQDFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAF 362
LS++FL+Q + + YK H+D++ICS++PG SSS QYTPGGL FK ESN+QYVTS +F
Sbjct: 304 LSKEFLIQKVKSLEEYKEHADSFICSVLPGASSS--QYTPGGLLFKMGESNMQYVTSTSF 363
Query: 363 LLVTYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQH 422
LL+TYAKYL+S CG S ++P L + AKKQVDY+LG NP KMSYMVG+G +YP+
Sbjct: 364 LLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRR 423
Query: 423 IHHRGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQ 480
IHHRGSS+PS+ P ++ C++GF S SPNPN L GA+VGGPD D+F D+R++Y +
Sbjct: 424 IHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGR 483
BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match:
O81416 (Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1)
HSP 1 Score: 604.7 bits (1558), Expect = 1.7e-171
Identity = 295/488 (60.45%), Postives = 370/488 (75.82%), Query Frame = 0
Query: 4 SITLALYFILP---FFTLSSSAFTSQH---------YSTALQYSILFFEGQRSGKLPSNQ 63
+I L+ +F L + +SS F + H Y AL SILFFEGQRSGKLPSNQ
Sbjct: 17 TIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQ 76
Query: 64 RLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIE 123
R++WR +S LSDGS+ HVDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG M +E++
Sbjct: 77 RMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQ 136
Query: 124 NAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGS 183
NA+ A+RW +DYLLK AT P+ +YVQVGD N DH CWERPEDMDT R+V+K+ PGS
Sbjct: 137 NAKIAIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGS 196
Query: 184 DVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCS 243
DVA+ETAAALAAA+IVF SDPSYS LL A+ VF FADK+RG YS L VCPFYCS
Sbjct: 197 DVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCS 256
Query: 244 YSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQD 303
YSGY DELLWGA+W+ KA+KN +LNYI+ NG ILGA + D TF WD+K G +ILL++
Sbjct: 257 YSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKA 316
Query: 304 FLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVT 363
FLVQN + YK H+DN+ICS+IPG SS QYTPGGL FK +++N+QYVTS +FLL+T
Sbjct: 317 FLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLT 376
Query: 364 YAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHR 423
YAKYL+S + CG S +P L + AK+QVDY+LG+NP +MSYMVG+G ++P+ IHHR
Sbjct: 377 YAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHR 436
Query: 424 GSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPA 480
GSS+P + S P ++ C++GF + S SPNPN L GA+VGGPD D+F D+R++Y+QSEPA
Sbjct: 437 GSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPA 496
BLAST of CmaCh16G012330 vs. TAIR 10
Match:
AT5G42090.1 (Lung seven transmembrane receptor family protein )
HSP 1 Score: 708.4 bits (1827), Expect = 7.8e-204
Identity = 351/434 (80.88%), Postives = 400/434 (92.17%), Query Frame = 0
Query: 501 FSSIFLLLVFLLPISSFAEIRFTDIRNDNRPIIPFDVFGFSHGGRLELNVSHVSLSDTNP 560
FSSI +LL+ + I+S AEIR ++IR+D+RPIIP D FGF+H GRLEL+ S + LS++NP
Sbjct: 7 FSSILILLLISISIAS-AEIRKSEIRSDDRPIIPLDEFGFTHSGRLELDASKIWLSNSNP 66
Query: 561 DLDLSKAGFFLCTRESWLHVIQQLEEAEISCALQSDLVKPVYTFNSLKGQD--RFDVLYS 620
DLDLSK GFFLCTR++W+HVIQQLEE EI+CALQSDLVK V+TFN+LKG D RF +++
Sbjct: 67 DLDLSKVGFFLCTRDAWVHVIQQLEEEEITCALQSDLVKHVFTFNNLKGGDKSRFSTVFT 126
Query: 621 ESDADQYTLVFANCLQQLKVSMDVRSAMYNLEGKSGRRDYLSAGKTILPRIYFIFSMIYF 680
E+DADQY+LVFANCLQQ+K+SMDVRSAMYNLEGK G RDYLSAG+T+LP++YF+FS+IYF
Sbjct: 127 ENDADQYSLVFANCLQQVKISMDVRSAMYNLEGKKGGRDYLSAGRTVLPKVYFLFSVIYF 186
Query: 681 LLAIVWIHVLYKKRLTVYGIHFFMLAVVILKALNLICEAEDKSYIKRTGSAHGWDILFYI 740
LA WI+VLYKKRLTV+ IHFFML VV+LKALNL+CEAEDKSYIK+TG+AHGWD+LFYI
Sbjct: 187 SLAATWIYVLYKKRLTVFAIHFFMLGVVVLKALNLLCEAEDKSYIKKTGTAHGWDVLFYI 246
Query: 741 FSFLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVANIAQVVTDETGPFEQ 800
F+FLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVAN AQVV DETGP+ Q
Sbjct: 247 FNFLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVANFAQVVIDETGPYGQ 306
Query: 801 EWVTWKQVFLLVDVICCCAVLFPIVWSIKNLREAARTDGKAAVNLMKLTLFRQYYIVVIC 860
+WVTWKQ+FLLVDV+CCCAVLFPIVWSIKNLREAA+TDGKAAVNL+KLTLFRQYYIVVIC
Sbjct: 307 DWVTWKQIFLLVDVVCCCAVLFPIVWSIKNLREAAKTDGKAAVNLVKLTLFRQYYIVVIC 366
Query: 861 YIYFTRVVVYALETITSYRYLWTSVVAGELATFAFYAFTGYKFKPEAHNPYFVVDDEEEE 920
YIYFTRVVVYALETITSY+Y+WTSVVA ELAT AFY FTGYKF+PE HNPYFVVDDEEEE
Sbjct: 367 YIYFTRVVVYALETITSYKYMWTSVVASELATLAFYLFTGYKFRPEVHNPYFVVDDEEEE 426
Query: 921 AAAEALKLEDEFEL 933
AAAEALKLEDEFEL
Sbjct: 427 AAAEALKLEDEFEL 439
BLAST of CmaCh16G012330 vs. TAIR 10
Match:
AT1G02800.1 (cellulase 2 )
HSP 1 Score: 610.1 bits (1572), Expect = 2.9e-174
Identity = 298/492 (60.57%), Postives = 368/492 (74.80%), Query Frame = 0
Query: 3 YSITLALYFILPFFTLSSSAFTS---------------QHYSTALQYSILFFEGQRSGKL 62
Y + L L F L S+ F+S +Y AL SILFFEGQRSGKL
Sbjct: 4 YLSSSRLITFLSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKL 63
Query: 63 PSNQRLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG 122
P NQR+TWR NS LSDGS+ +VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG M
Sbjct: 64 PPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMK 123
Query: 123 NEIENAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQ 182
+E+ NA++A+RW +D+LLK AT P+ +YVQVGDPN+DH CWERPEDMDTPR+V+K+
Sbjct: 124 SELPNAKDAIRWATDFLLK-ATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKN 183
Query: 183 NPGSDVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCP 242
NPGSD+A E AAALAAASIVF DPSYSN LL A+ VF FADK+RG YS L VCP
Sbjct: 184 NPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCP 243
Query: 243 FYCSYSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKIL 302
FYCSYSGY DELLWGA+W+ KA+ N +LNYI++NG ILGAD+ D FSWD+K G +IL
Sbjct: 244 FYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARIL 303
Query: 303 LSQDFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAF 362
LS++FL+Q + + YK H+D++ICS++PG SSS QYTPGGL FK ESN+QYVTS +F
Sbjct: 304 LSKEFLIQKVKSLEEYKEHADSFICSVLPGASSS--QYTPGGLLFKMGESNMQYVTSTSF 363
Query: 363 LLVTYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQH 422
LL+TYAKYL+S CG S ++P L + AKKQVDY+LG NP KMSYMVG+G +YP+
Sbjct: 364 LLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRR 423
Query: 423 IHHRGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQ 480
IHHRGSS+PS+ P ++ C++GF S SPNPN L GA+VGGPD D+F D+R++Y +
Sbjct: 424 IHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGR 483
BLAST of CmaCh16G012330 vs. TAIR 10
Match:
AT4G02290.1 (glycosyl hydrolase 9B13 )
HSP 1 Score: 604.7 bits (1558), Expect = 1.2e-172
Identity = 295/488 (60.45%), Postives = 370/488 (75.82%), Query Frame = 0
Query: 4 SITLALYFILP---FFTLSSSAFTSQH---------YSTALQYSILFFEGQRSGKLPSNQ 63
+I L+ +F L + +SS F + H Y AL SILFFEGQRSGKLPSNQ
Sbjct: 17 TIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQ 76
Query: 64 RLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIE 123
R++WR +S LSDGS+ HVDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG M +E++
Sbjct: 77 RMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQ 136
Query: 124 NAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGS 183
NA+ A+RW +DYLLK AT P+ +YVQVGD N DH CWERPEDMDT R+V+K+ PGS
Sbjct: 137 NAKIAIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGS 196
Query: 184 DVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCS 243
DVA+ETAAALAAA+IVF SDPSYS LL A+ VF FADK+RG YS L VCPFYCS
Sbjct: 197 DVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCS 256
Query: 244 YSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQD 303
YSGY DELLWGA+W+ KA+KN +LNYI+ NG ILGA + D TF WD+K G +ILL++
Sbjct: 257 YSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKA 316
Query: 304 FLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVT 363
FLVQN + YK H+DN+ICS+IPG SS QYTPGGL FK +++N+QYVTS +FLL+T
Sbjct: 317 FLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLT 376
Query: 364 YAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHR 423
YAKYL+S + CG S +P L + AK+QVDY+LG+NP +MSYMVG+G ++P+ IHHR
Sbjct: 377 YAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHR 436
Query: 424 GSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPA 480
GSS+P + S P ++ C++GF + S SPNPN L GA+VGGPD D+F D+R++Y+QSEPA
Sbjct: 437 GSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPA 496
BLAST of CmaCh16G012330 vs. TAIR 10
Match:
AT1G22880.1 (cellulase 5 )
HSP 1 Score: 562.8 bits (1449), Expect = 5.3e-160
Identity = 270/471 (57.32%), Postives = 343/471 (72.82%), Query Frame = 0
Query: 10 YFILPFFTLS-SSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDGSSYH 69
+F+ LS + + S +Y AL S+LFF+GQRSG+LPS+Q+L+WR +S LSDGSS H
Sbjct: 6 FFVFLLSALSLENTYASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSSAH 65
Query: 70 VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDYLLKAA 129
VDL GGYYDAGDNVKF PMAFTTT+L+WS +E+G MG E++N+R A+RW +DYLLK A
Sbjct: 66 VDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLKCA 125
Query: 130 TDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAAASIVF 189
P LYV VGDPN DHKCWERPEDMDTPRTVY ++ NPGSDVA+ETAAALAA+S+VF
Sbjct: 126 RATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSMVF 185
Query: 190 NTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGASWIYK 249
DP YS LL A KV FA ++RGAYS+SL S VCPFYCSYSGY DELLWGA+W+++
Sbjct: 186 RKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWLHR 245
Query: 250 ASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIYKAHSD 309
A+ + + N+I+S LG D FSWD+K G +LLS+ ++ F++YK ++
Sbjct: 246 ATNDPYYTNFIKS----LGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAE 305
Query: 310 NYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAIRCGTS 369
N++C ++P + SSS +YT GGL +K +SNLQYVTS FLL TYAKY+ S CG S
Sbjct: 306 NFMCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNS 365
Query: 370 RISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSRPNQVSCN 429
I P LI +K+QVDY+LG NP KMSYMVGF +P+ IHHRGSS+PS R N + CN
Sbjct: 366 LIVPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCN 425
Query: 430 EGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPFI 480
GFQ + +PNPN+L GAIVGGP+ D++ D R++Y +SEPATYINA F+
Sbjct: 426 GGFQSFRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFV 472
BLAST of CmaCh16G012330 vs. TAIR 10
Match:
AT1G71380.1 (cellulase 3 )
HSP 1 Score: 560.5 bits (1443), Expect = 2.6e-159
Identity = 273/476 (57.35%), Postives = 342/476 (71.85%), Query Frame = 0
Query: 5 ITLALYFILPFFT-LSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSD 64
+T +F+L F + L S+ + +Y AL S+LFF+GQRSG LP Q+++WR +S LSD
Sbjct: 1 MTSLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSD 60
Query: 65 GSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDY 124
GS+ HVDL GGYYDAGDNVKF LPMAFTTT+L+WS +E+G MG E+ENAR +RW +DY
Sbjct: 61 GSAAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDY 120
Query: 125 LLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAA 184
LLK A P LYV VGDPN+DHKCWERPEDMDTPRTVY ++A NPGSDVA+ETAAALAA
Sbjct: 121 LLKCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAA 180
Query: 185 ASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGA 244
AS+VF D YS LL A V FA +++GAYSDSL S VCPFYCSYSGY DEL+WGA
Sbjct: 181 ASMVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGA 240
Query: 245 SWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIY 304
SW+ +A+ N + N+I+S LG D FSWD+K G +LLS+ L+ F+ Y
Sbjct: 241 SWLLRATNNPYYANFIKS----LGGGDQPDIFSWDNKYAGAYVLLSRRALLNKDSNFEQY 300
Query: 305 KAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAI 364
K ++N+IC ++P + SSS QYT GGL +K +SNLQYVTS FLL TYAKY+ +
Sbjct: 301 KQAAENFICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTF 360
Query: 365 RCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSRPN 424
CG+S I P LI+ +K+QVDYILG+NP KMSYMVGF +P+ IHHR SS+PS R
Sbjct: 361 NCGSSVIVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQ 420
Query: 425 QVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPFI 480
+ CN GFQ Y+ +PNPN+L GAIVGGP+ D + D R++Y +EPATYINA F+
Sbjct: 421 SLGCNGGFQSFYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFV 472
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P05522 | 1.2e-209 | 72.14 | Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1 | [more] |
Q6YXT7 | 8.7e-184 | 62.99 | Endoglucanase 19 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0114200 PE=2 S... | [more] |
Q6Z715 | 2.1e-177 | 62.58 | Endoglucanase 4 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU14 PE=2 SV=1 | [more] |
Q9SRX3 | 4.1e-173 | 60.57 | Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1 | [more] |
O81416 | 1.7e-171 | 60.45 | Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1 | [more] |