Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTCGCGCGAGGAGGCAATGGCGAACGACGAAGAGAGAGTTCTGGAGCACCAGCTAGAGGTTCAATTGCATGAGCAGAGAGAATCTCTCGCCGCCTTGCAAGATGCCTTAGCCTCCGATGCCTCCAATCCGGAGCTTCTCGAGGTACGTTCTTTTGTCCAATATGAATTTCCGTTTGGCTTTGAGAATATTCATGAGAGGAATGCGTGATCGCCTCTTATTGAATTTGATTATTATTTTTGAAGTTTTGAAGACCTCTGCTTGTTGATTTATTTGAATATAATCTGTTGATGTCAAGGTTCATGATGAGCTTGTCCAAGCAATTAAAGATGCCGAGGAAGGGCTGCTTCACCTTAAGCGTTCTAGATTACTAAGAGAAGCAGATTTGGTGTTGTGTGGTCGTGATAGTAACGCAGCGGAGGATGTTAAGGTGGAGCCTCTTCATTCTACGGACGTCGAACCTGAATCACCAGAGGATCAGAGTTTCGTCGTTGGATCGAAATGCAGATTTCGGCACACTGATGGACGTTGGTATGACGGTGAAATTGTTGGATTGGATGGTTCTAATTCTGCGAAAATTTCTTTCCTCACTCCTACAACTGAAAATATGTTGGTAAGCAATCTGGGACTCTATTCGTCGGCTTTTGCGAAACTGTTAGTTCTAATTTCAAACCACTCCTCATTTTGTCTTTTGGATTGTATTTATGCACATATTATTTTGACTTGAATTTACCGTTTTCTGTCTGATTGTGGCTGTCTCGGTCTTCTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGGGAATTCGATTAGCTTCTTCCCTTTAATAGTACCGAGGATCTGTTTGTGCCTACTTACTAGATGAGTTTTTAAGTGTTGGCAGTGACTTGGGTTTTGATTCACCTAACTAAACTATTGAGCAATCACTATGGTTTTTTCTCACATCATGATACTAATGCCATAACACCACCAAAGTGCAGATATGCAAGTTCTTCTTACAGCAAAGGTGTCGGTTTGGCACTAGCTGCCGCTTATCGCATGGTATGTCATATTCTTTCACGTGGACCATGTAGATCCAACTACTCATTAAGTAAAGCCTTCCCCTCGTTTCAATGGCTGTTCCCACCAGCAATACTATCTTGAGATGCTAGAATGGATTATAAACTCTTTAATAAAGGAACCTGCAGAATTAAAATAATGGTGAAGTTCCTGCATAGGTCTAGGACTAAAAATGTTTTCCTTCATAAAGAGTTCAGTATGAAAAAAATGCTGGTGCTGTTCGGTTTTCTTGTTGATGAATCTTCAATCATTTGTATTCCTAATTAGAAGAATGTGAGTAAAACTTAGATCTTCTACAACTCGTTACTTCTTATTCCCCGTTGAGACTAGAAACAGTTTTGTTTCTATTTAAAGAAGAATGCAAGAGGTAATGGGGGACTAGTTTACCAATCCTAGAACTAAAGAAAATGGAGATTACACTGAAGAAAAACTGTCAAGGAACTGTGAATTAAGCTATTTCACCTAAGAGGAGAGCAAAGAGAAACTGTCTTGAATTTCTCTTTCTAATATCTTGGAAAATTCAATTGCCACATTTCATGCATTGGGCAGATGTTTGTGAATATCGCATTTCTAGAACTCCCCTTCATTTCAGCTGTCTCTATGTGATACATTCTAAGACTGCCTTATGTAATGCAAGGTAGTCTTGCCTTTCAACATTTTTGGAAAATACATGTTGCATTGCATTTTACCTCCATTCCAATAGTACCACATGTATTTCTTTCCCCTTGATTTAATGTTTCCATTTTATTTAAAATGAACTTTCCCTAGTTTTTTATTCACCTTTTTAGAGACTGGTCTTTTTCCTTCCATGTAAAAGAACAGGCAATAGCTAAGCTGCTGTTCTCAACTATATTTGTATACTTCTGTATATCATTAAAGTCAATAGCTTTGCTGCTATTCTCCATTGGGTTTGTCGACCAATTGGGAGTTTTGAGGAGGAAAGAATTTTTACATTTATTTCTTTGATTAGGTTTTTGTCCTTCTGTATAAAGTATATCTTAACAGGCAATAGCTATGCTTTTATTCTCAATTATCTGAGAGAGTTTTTCATGCACAGGAGTTGATATCCCTTTAACCTCTCTTAGGAGATATGCGCCAACAATTTGGAATCAGTCACTGACAGGGTCCAGTATCTGGGCTCTCTCGTCCAGGAATGGCATTTGGAGGCATGCTGAACTTGAATCTTGGGATGATGCACTACAAATTGCACAAGTTGTTTTTAAAGGTGATGGATCCTCTCAAAAGCTTGGACCGGAGGACATAGCGTTATCTGTGCGTGCTCAAATTAGTGATGGAGAAGAAAGTGATTCCAGCTTGGAAAAGTCTGACTCAAGTGATTATGAAGATGATGATTTGCAGGGTTTGGGATTCCTCGAAAGCTCTACACAGCAGAGGGGCATTCAGATGGAAACCACCATATTTGCAAAATGGGAGAACCATACCCGGGGAATCGCCTCCAAGATGATGGCTAATATGGGCTATCGAGAAGGAATGGGTTTGGGTGCATCTGGGCAGGGGATGCTAAATCCTATCCCTGTCAAAGTTCTTCCAGCAAAACAATCTCTTGATCATGCTCTAGAATCACAAAAGGAGAATAATACTAACGACGAGAATAATGGCAAGAAACGAAGTAGAGGCGGTAAGAGGAAACGTGATAAGAAGTTTGCTGCAGCAATGCAGGCAGCTAAAGAGGAAGAAGACTCAAGACCTGATGTCTTTAATCTTATCAACAACCACCTTGCAATGCATAATGGAGCACCGAATGATGGATCTGTTAAGAAACAGAAAGATAAAGGTTCAGCAGATGGAAAGAAGGTAGATAGACGAACTCTAATCGCGTACGATGGTGAGGTGAAAGACCTGCGAGTACGAATAGAGAAGCTTGAAGAAATGGTGAACAGAAATAAGAATGAGAAGGTTGTTTTCGAGGCTGCCTTAAGAAAGCTGAACGAGACTCGGAAAGCTTTGGCCGAGGCCGAGGCAGCTCATGCGTCTGCATCAAATGCAGTTACCAGCAGAGAAAAGGAAAAAAAATGGTTGAAGTTTTAGGGCTTGACTTGATGTTATTGTATAATGATAGCTCCACCTGTAGAAGATTCATGTCAATCTCTGAAACTGAACCCCACGTGAATTATTGAAGGAAATCGTGTGATGATTTGTGTTTCATGGTGGCTGGCCGTTGGTGTATATTCATCCTAATATGAAGTCCAAATCTTAATGTAGGCGACAGGCTGATTCTTGTTCCTTTTTCTCAATTAGTACTTCTTATAGTTCGGTCGTTTTTGTTTCTTTTGTGATTTGCATACTTTTCCCTAATAATGTATAAATTACTTTTTGGGATAGGATGAGCATTCTAGATGGAATTGGTTAAATATGTCATTTCCTACTCGTGCAAATTAGAATTTCATTCAATTTTATCGGTTTGATCCTGTTTGAAAATTTTTGCCTTTTTCACAATCGCACTTTTCTTACCACGAACTCTGCGATTGTGTGGCACTCGCTGTTTTTCAATAGCAAGTGAGTCAAGTCGATTTGTATTTGCGTTGCAATGCTCAAGTCTCTGACTCCTATTTGAAAAGAAGGTTTAGAAAACAGGGGCAGAGAAAGATTGAGTGAGAGAATTTGAAAACAAGGTTATATGACATACAAGCAAAAGGTAATACAACGGCTTAAGTAAAATTCATCCAATTCTTCACGTTCAAACAATTTAATTTTTTCTTCTCGCGTATTCCATTGGGTATAACTCCTCATCTTCATCCAATTAAAATTCTGAAATGTTTAAAAAATATTTTTAAAAAATTGATGTAATTTAATAAGTTGGTAAAATTAGTACATATCCCCACCACCGGTGAAAAAACATGTATTTGAATGTTGCTTTCGGGGAAACTATGAATTGAAGCTTCATCTGTGCAGGTAGGAGTTAAATGTGCTATTTTCGGTGTCCTCTAGTTTGAGAAAGAGAGCTGATATCCCGTTCAGGTACTCATTCCTCTCAATTTGACTCGATTACTTTCTGTTTCTTTTCCTGGGAATAGAAAATTTGGCTACCTATGAATGAGGTCATTGTACTCGGTAACTGGGACATCGTTTCGAGCTTTGTCCAATCTTTTGTTAACCCATTCTCTGGCCATCAAGTTGGGTACCATAGCAGACGTTTACACTTGCAACAATATCCTAAGTGGGTATTGGAAATGCAAAGAGTTTCGATCTGCAGACGTACTGTTCGACGAAATGCCGCTGAGAGACTCTGTATCTTGGAACACGATGATCGCGGGGTATATTAACTCTGGAAACTTGGAGAATTCATGGGAAGTTCTTAAATGCATGAGAAGATTTGGTTTTGATCAAGATGAGTACACCTTTGGAAGCATGCTGAAGGGCATTGCTTGTGCTGGTATGCTTGATTTGGGTCAGCAAATACATTCTATGATCATTAAGATGGGTTTTGCTGGAAATGTATATGCAGGGAGTGCTCTTCTGGATATGTATGCGAAATGTGAGAGACTTGAGGATGCATATTTGACATTCCTAAATATATCTAAGCAGAACACTGTTTCGTGGAATGCAATGATTGCTGGATACGCACAAACGGGTGATCGCGAGACCGCGTTTTTGTTGTTAGATTGTATGGAGCAAGAAGGTGAGAAGGTTGATGATGGCTCATTTGCTCCTCTTTTGCCTTTACTAGATGATGCTGAGTTTTGTAGATTGACAAGGCAAGTTCATGGAAAAGTCATAAAACATGGATTGGAGTCTGCTAATACAATGTGTAATGCTTTGATCACTTCTTATTCAGAATGTGGATCCCTTGTCGATGCCAAAAGGGTTTTCAATTGTTCGGCGGGCGTTCGAGATTTGGTGTCGTGGAACTCCCTGTTGGGTGCTTTTTTGGTGCATAATCAGGAAGATCTTGCTTTTAAACTCTTGATTGATATGCAAGAACATGGTTTTGAACCAGATTTGTACTCTTACACAAGCATTATCAGTGCTTGTTTCAACAAAGAGCTTAGCAATAATGGGAAATCCCTGCATGGGATGGTCATTAAAAGAGGATTAGAACAATCAGTGCCAATTTCAAATGCATTGATATCTATGTATCTTAAATCAGACGGTGGTTCGATGAAGGAAGCTTTATGTATATTCGAATCCTTGGAGATTAAGGATCGTGTGTCGTGGAACTCGATCTTGACGGGATTATCACAAATGGGGTCGAGCGAAGATGCTGTGAAGTCGTTTCTGCATATGAGATCTTTAGCAATGGATATTGATCGGTATTCGTTTTCTGCTGTGCTCAAATCATGCTCAGATTTGGCCACCTTTCAATTGGGACAACAATTTCATGTCTTGGCGCTGAAATATGGTATGGATTCCAATGAGTTTGTTTCAAGTTCATTAATCTTCATGTATTCAAAGTGTGGGATTATGGAAGATGCTAAAAGATCATTTGAAGGAGCTTCAAAAAGCTCTTCAATCACCTGGAATGCACTCATGTTTGGCTATGCACAACATGGGCAATGCCATGTTGCATTAGACCTCTTCTTTCTAATGGAAGAGAAGAAGGTGAAAATGGATCACATAACATTCGTTGCAGTTCTGACCGCTTGTAGCCATATCGGTTTAGTCGAACGGGGCTGCGAATTCTTACGATGTATGGAATCTGATTATGGGGTTCCTCCACGAATGGAGCATTATGCTTGTGCAGTTGATCTATATGGCCGTTCTGGGCGTCTTGATGAAGCCAAGGCCTTGATTGAGGCAATGCCATTCAAGCCGAACGCGATGGTGTGGAAGACGTTCTTGGGGGCATGTCGTTCTTGTGGGAACGTTGAGTTAGCTTGTCAGGTTGCAAGGCATCTACTAGAGATGGAGCCTGAAGAGCATTGCACTTATGTTCTTCTCTCAAACATGTATGGAGATCTAATGAGATGGGAGGAGAAGGCTCAGGTGAAGAGGTTAATGAAGGAAAGAGGAGTTAAGAAAACGCCTGGTTGGAGTTGGATTGAAGTTAAGAACAAGGTTCATGCTTTCATTGCTGAAGATCGTTCTCATCCCAGTTGCCAACAGATATACTTTTTGCTGGAAGTTCTTATGGAGGAAATCACAAGAATTGAAGCTGCTGCTGATGGTTTTGAGAGTTTTTTGGAGCAGGAAGAGCTAAGTTATGCATATGCATAA
mRNA sequence
ATCTCGCGCGAGGAGGCAATGGCGAACGACGAAGAGAGAGTTCTGGAGCACCAGCTAGAGGTTCAATTGCATGAGCAGAGAGAATCTCTCGCCGCCTTGCAAGATGCCTTAGCCTCCGATGCCTCCAATCCGGAGCTTCTCGAGGTTCATGATGAGCTTGTCCAAGCAATTAAAGATGCCGAGGAAGGGCTGCTTCACCTTAAGCGTTCTAGATTACTAAGAGAAGCAGATTTGGTGTTGTGTGGTCGTGATAGTAACGCAGCGGAGGATGTTAAGGTGGAGCCTCTTCATTCTACGGACGTCGAACCTGAATCACCAGAGGATCAGAGTTTCGTCGTTGGATCGAAATGCAGATTTCGGCACACTGATGGACGTTGGTATGACGGTGAAATTGTTGGATTGGATGGTTCTAATTCTGCGAAAATTTCTTTCCTCACTCCTACAACTGAAAATATGTTGATATGCAAGTTCTTCTTACAGCAAAGGTGTCGGTTTGGCACTAGCTGCCGCTTATCGCATGGAGTTGATATCCCTTTAACCTCTCTTAGGAGATATGCGCCAACAATTTGGAATCAGTCACTGACAGGGTCCAGTATCTGGGCTCTCTCGTCCAGGAATGGCATTTGGAGGCATGCTGAACTTGAATCTTGGGATGATGCACTACAAATTGCACAAGTTGTTTTTAAAGGTGATGGATCCTCTCAAAAGCTTGGACCGGAGGACATAGCGTTATCTGTGCGTGCTCAAATTAGTGATGGAGAAGAAAGTGATTCCAGCTTGGAAAAGTCTGACTCAAGTGATTATGAAGATGATGATTTGCAGGGTTTGGGATTCCTCGAAAGCTCTACACAGCAGAGGGGCATTCAGATGGAAACCACCATATTTGCAAAATGGGAGAACCATACCCGGGGAATCGCCTCCAAGATGATGGCTAATATGGGCTATCGAGAAGGAATGGGTTTGGGTGCATCTGGGCAGGGGATGCTAAATCCTATCCCTGTCAAAGTTCTTCCAGCAAAACAATCTCTTGATCATGCTCTAGAATCACAAAAGGAGAATAATACTAACGACGAGAATAATGGCAAGAAACGAAGTAGAGGCGGTAAGAGGAAACGTGATAAGAAGTTTGCTGCAGCAATGCAGGCAGCTAAAGAGGAAGAAGACTCAAGACCTGATGTCTTTAATCTTATCAACAACCACCTTGCAATGCATAATGGAGCACCGAATGATGGATCTGTTAAGAAACAGAAAGATAAAGGTTCAGCAGATGGAAAGAAGGTAGATAGACGAACTCTAATCGCGTACGATGGTGAGGTGAAAGACCTGCGAGTACGAATAGAGAAGCTTGAAGAAATGGTGAACAGAAATAAGAATGAGAAGGTTGTTTTCGAGGCTGCCTTAAGAAAGCTGAACGAGACTCGGAAAGCTTTGGCCGAGGCCGAGGCAGCTCATGCGTCTGCATCAAATGCAGTTACCAGCAGAGAAAAGGAAAAAAAATGGTTGAAGTCATTGTACTCGGTAACTGGGACATCGTTTCGAGCTTTGTCCAATCTTTTGTTAACCCATTCTCTGGCCATCAAGTTGGGTACCATAGCAGACGTTTACACTTGCAACAATATCCTAAGTGGGTATTGGAAATGCAAAGAGTTTCGATCTGCAGACGTACTGTTCGACGAAATGCCGCTGAGAGACTCTGTATCTTGGAACACGATGATCGCGGGGTATATTAACTCTGGAAACTTGGAGAATTCATGGGAAGTTCTTAAATGCATGAGAAGATTTGGTTTTGATCAAGATGAGTACACCTTTGGAAGCATGCTGAAGGGCATTGCTTGTGCTGGTATGCTTGATTTGGGTCAGCAAATACATTCTATGATCATTAAGATGGGTTTTGCTGGAAATGTATATGCAGGGAGTGCTCTTCTGGATATGTATGCGAAATGTGAGAGACTTGAGGATGCATATTTGACATTCCTAAATATATCTAAGCAGAACACTGTTTCGTGGAATGCAATGATTGCTGGATACGCACAAACGGGTGATCGCGAGACCGCGTTTTTGTTGTTAGATTGTATGGAGCAAGAAGGTGAGAAGGTTGATGATGGCTCATTTGCTCCTCTTTTGCCTTTACTAGATGATGCTGAGTTTTGTAGATTGACAAGGCAAGTTCATGGAAAAGTCATAAAACATGGATTGGAGTCTGCTAATACAATGTGTAATGCTTTGATCACTTCTTATTCAGAATGTGGATCCCTTGTCGATGCCAAAAGGGTTTTCAATTGTTCGGCGGGCGTTCGAGATTTGGTGTCGTGGAACTCCCTGTTGGGTGCTTTTTTGGTGCATAATCAGGAAGATCTTGCTTTTAAACTCTTGATTGATATGCAAGAACATGGTTTTGAACCAGATTTGTACTCTTACACAAGCATTATCAGTGCTTGTTTCAACAAAGAGCTTAGCAATAATGGGAAATCCCTGCATGGGATGGTCATTAAAAGAGGATTAGAACAATCAGTGCCAATTTCAAATGCATTGATATCTATGTATCTTAAATCAGACGGTGGTTCGATGAAGGAAGCTTTATGTATATTCGAATCCTTGGAGATTAAGGATCGTGTGTCGTGGAACTCGATCTTGACGGGATTATCACAAATGGGGTCGAGCGAAGATGCTGTGAAGTCGTTTCTGCATATGAGATCTTTAGCAATGGATATTGATCGGTATTCGTTTTCTGCTGTGCTCAAATCATGCTCAGATTTGGCCACCTTTCAATTGGGACAACAATTTCATGTCTTGGCGCTGAAATATGGTATGGATTCCAATGAGTTTGTTTCAAGTTCATTAATCTTCATGTATTCAAAGTGTGGGATTATGGAAGATGCTAAAAGATCATTTGAAGGAGCTTCAAAAAGCTCTTCAATCACCTGGAATGCACTCATGTTTGGCTATGCACAACATGGGCAATGCCATGTTGCATTAGACCTCTTCTTTCTAATGGAAGAGAAGAAGGTGAAAATGGATCACATAACATTCGTTGCAGTTCTGACCGCTTGTAGCCATATCGGTTTAGTCGAACGGGGCTGCGAATTCTTACGATGTATGGAATCTGATTATGGGGTTCCTCCACGAATGGAGCATTATGCTTGTGCAGTTGATCTATATGGCCGTTCTGGGCGTCTTGATGAAGCCAAGGCCTTGATTGAGGCAATGCCATTCAAGCCGAACGCGATGGTGTGGAAGACGTTCTTGGGGGCATGTCGTTCTTGTGGGAACGTTGAGTTAGCTTGTCAGGTTGCAAGGCATCTACTAGAGATGGAGCCTGAAGAGCATTGCACTTATGTTCTTCTCTCAAACATGTATGGAGATCTAATGAGATGGGAGGAGAAGGCTCAGGTGAAGAGGTTAATGAAGGAAAGAGGAGTTAAGAAAACGCCTGGTTGGAGTTGGATTGAAGTTAAGAACAAGGTTCATGCTTTCATTGCTGAAGATCGTTCTCATCCCAGTTGCCAACAGATATACTTTTTGCTGGAAGTTCTTATGGAGGAAATCACAAGAATTGAAGCTGCTGCTGATGGTTTTGAGAGTTTTTTGGAGCAGGAAGAGCTAAGTTATGCATATGCATAA
Coding sequence (CDS)
ATGGCGAACGACGAAGAGAGAGTTCTGGAGCACCAGCTAGAGGTTCAATTGCATGAGCAGAGAGAATCTCTCGCCGCCTTGCAAGATGCCTTAGCCTCCGATGCCTCCAATCCGGAGCTTCTCGAGGTTCATGATGAGCTTGTCCAAGCAATTAAAGATGCCGAGGAAGGGCTGCTTCACCTTAAGCGTTCTAGATTACTAAGAGAAGCAGATTTGGTGTTGTGTGGTCGTGATAGTAACGCAGCGGAGGATGTTAAGGTGGAGCCTCTTCATTCTACGGACGTCGAACCTGAATCACCAGAGGATCAGAGTTTCGTCGTTGGATCGAAATGCAGATTTCGGCACACTGATGGACGTTGGTATGACGGTGAAATTGTTGGATTGGATGGTTCTAATTCTGCGAAAATTTCTTTCCTCACTCCTACAACTGAAAATATGTTGATATGCAAGTTCTTCTTACAGCAAAGGTGTCGGTTTGGCACTAGCTGCCGCTTATCGCATGGAGTTGATATCCCTTTAACCTCTCTTAGGAGATATGCGCCAACAATTTGGAATCAGTCACTGACAGGGTCCAGTATCTGGGCTCTCTCGTCCAGGAATGGCATTTGGAGGCATGCTGAACTTGAATCTTGGGATGATGCACTACAAATTGCACAAGTTGTTTTTAAAGGTGATGGATCCTCTCAAAAGCTTGGACCGGAGGACATAGCGTTATCTGTGCGTGCTCAAATTAGTGATGGAGAAGAAAGTGATTCCAGCTTGGAAAAGTCTGACTCAAGTGATTATGAAGATGATGATTTGCAGGGTTTGGGATTCCTCGAAAGCTCTACACAGCAGAGGGGCATTCAGATGGAAACCACCATATTTGCAAAATGGGAGAACCATACCCGGGGAATCGCCTCCAAGATGATGGCTAATATGGGCTATCGAGAAGGAATGGGTTTGGGTGCATCTGGGCAGGGGATGCTAAATCCTATCCCTGTCAAAGTTCTTCCAGCAAAACAATCTCTTGATCATGCTCTAGAATCACAAAAGGAGAATAATACTAACGACGAGAATAATGGCAAGAAACGAAGTAGAGGCGGTAAGAGGAAACGTGATAAGAAGTTTGCTGCAGCAATGCAGGCAGCTAAAGAGGAAGAAGACTCAAGACCTGATGTCTTTAATCTTATCAACAACCACCTTGCAATGCATAATGGAGCACCGAATGATGGATCTGTTAAGAAACAGAAAGATAAAGGTTCAGCAGATGGAAAGAAGGTAGATAGACGAACTCTAATCGCGTACGATGGTGAGGTGAAAGACCTGCGAGTACGAATAGAGAAGCTTGAAGAAATGGTGAACAGAAATAAGAATGAGAAGGTTGTTTTCGAGGCTGCCTTAAGAAAGCTGAACGAGACTCGGAAAGCTTTGGCCGAGGCCGAGGCAGCTCATGCGTCTGCATCAAATGCAGTTACCAGCAGAGAAAAGGAAAAAAAATGGTTGAAGTCATTGTACTCGGTAACTGGGACATCGTTTCGAGCTTTGTCCAATCTTTTGTTAACCCATTCTCTGGCCATCAAGTTGGGTACCATAGCAGACGTTTACACTTGCAACAATATCCTAAGTGGGTATTGGAAATGCAAAGAGTTTCGATCTGCAGACGTACTGTTCGACGAAATGCCGCTGAGAGACTCTGTATCTTGGAACACGATGATCGCGGGGTATATTAACTCTGGAAACTTGGAGAATTCATGGGAAGTTCTTAAATGCATGAGAAGATTTGGTTTTGATCAAGATGAGTACACCTTTGGAAGCATGCTGAAGGGCATTGCTTGTGCTGGTATGCTTGATTTGGGTCAGCAAATACATTCTATGATCATTAAGATGGGTTTTGCTGGAAATGTATATGCAGGGAGTGCTCTTCTGGATATGTATGCGAAATGTGAGAGACTTGAGGATGCATATTTGACATTCCTAAATATATCTAAGCAGAACACTGTTTCGTGGAATGCAATGATTGCTGGATACGCACAAACGGGTGATCGCGAGACCGCGTTTTTGTTGTTAGATTGTATGGAGCAAGAAGGTGAGAAGGTTGATGATGGCTCATTTGCTCCTCTTTTGCCTTTACTAGATGATGCTGAGTTTTGTAGATTGACAAGGCAAGTTCATGGAAAAGTCATAAAACATGGATTGGAGTCTGCTAATACAATGTGTAATGCTTTGATCACTTCTTATTCAGAATGTGGATCCCTTGTCGATGCCAAAAGGGTTTTCAATTGTTCGGCGGGCGTTCGAGATTTGGTGTCGTGGAACTCCCTGTTGGGTGCTTTTTTGGTGCATAATCAGGAAGATCTTGCTTTTAAACTCTTGATTGATATGCAAGAACATGGTTTTGAACCAGATTTGTACTCTTACACAAGCATTATCAGTGCTTGTTTCAACAAAGAGCTTAGCAATAATGGGAAATCCCTGCATGGGATGGTCATTAAAAGAGGATTAGAACAATCAGTGCCAATTTCAAATGCATTGATATCTATGTATCTTAAATCAGACGGTGGTTCGATGAAGGAAGCTTTATGTATATTCGAATCCTTGGAGATTAAGGATCGTGTGTCGTGGAACTCGATCTTGACGGGATTATCACAAATGGGGTCGAGCGAAGATGCTGTGAAGTCGTTTCTGCATATGAGATCTTTAGCAATGGATATTGATCGGTATTCGTTTTCTGCTGTGCTCAAATCATGCTCAGATTTGGCCACCTTTCAATTGGGACAACAATTTCATGTCTTGGCGCTGAAATATGGTATGGATTCCAATGAGTTTGTTTCAAGTTCATTAATCTTCATGTATTCAAAGTGTGGGATTATGGAAGATGCTAAAAGATCATTTGAAGGAGCTTCAAAAAGCTCTTCAATCACCTGGAATGCACTCATGTTTGGCTATGCACAACATGGGCAATGCCATGTTGCATTAGACCTCTTCTTTCTAATGGAAGAGAAGAAGGTGAAAATGGATCACATAACATTCGTTGCAGTTCTGACCGCTTGTAGCCATATCGGTTTAGTCGAACGGGGCTGCGAATTCTTACGATGTATGGAATCTGATTATGGGGTTCCTCCACGAATGGAGCATTATGCTTGTGCAGTTGATCTATATGGCCGTTCTGGGCGTCTTGATGAAGCCAAGGCCTTGATTGAGGCAATGCCATTCAAGCCGAACGCGATGGTGTGGAAGACGTTCTTGGGGGCATGTCGTTCTTGTGGGAACGTTGAGTTAGCTTGTCAGGTTGCAAGGCATCTACTAGAGATGGAGCCTGAAGAGCATTGCACTTATGTTCTTCTCTCAAACATGTATGGAGATCTAATGAGATGGGAGGAGAAGGCTCAGGTGAAGAGGTTAATGAAGGAAAGAGGAGTTAAGAAAACGCCTGGTTGGAGTTGGATTGAAGTTAAGAACAAGGTTCATGCTTTCATTGCTGAAGATCGTTCTCATCCCAGTTGCCAACAGATATACTTTTTGCTGGAAGTTCTTATGGAGGAAATCACAAGAATTGAAGCTGCTGCTGATGGTTTTGAGAGTTTTTTGGAGCAGGAAGAGCTAAGTTATGCATATGCATAA
Protein sequence
MANDEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLHLKRSRLLREADLVLCGRDSNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRWYDGEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYAPTIWNQSLTGSSIWALSSRNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALSVRAQISDGEESDSSLEKSDSSDYEDDDLQGLGFLESSTQQRGIQMETTIFAKWENHTRGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNTNDENNGKKRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHLAMHNGAPNDGSVKKQKDKGSADGKKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETRKALAEAEAAHASASNAVTSREKEKKWLKSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYAYA
Homology
BLAST of CmaCh05G003410 vs. ExPASy Swiss-Prot
Match:
Q9LU94 (Putative pentatricopeptide repeat-containing protein At3g25970 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E46 PE=3 SV=2)
HSP 1 Score: 796.6 bits (2056), Expect = 3.9e-229
Identity = 387/686 (56.41%), Postives = 505/686 (73.62%), Query Frame = 0
Query: 498 LYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLR 557
L S+ +S + L LTH AIK G+I+D+Y N IL Y K A++LFDEMP R
Sbjct: 5 LASLLESSLNSFQKLSLTHCYAIKCGSISDIYVSNRILDSYIKFGFLGYANMLFDEMPKR 64
Query: 558 DSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIH 617
DSVSWNTMI+GY + G LE++W + CM+R G D D Y+F +LKGIA DLG+Q+H
Sbjct: 65 DSVSWNTMISGYTSCGKLEDAWCLFTCMKRSGSDVDGYSFSRLLKGIASVKRFDLGEQVH 124
Query: 618 SMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRE 677
++IK G+ NVY GS+L+DMYAKCER+EDA+ F IS+ N+VSWNA+IAG+ Q D +
Sbjct: 125 GLVIKGGYECNVYVGSSLVDMYAKCERVEDAFEAFKEISEPNSVSWNALIAGFVQVRDIK 184
Query: 678 TAFLLLDCMEQEGE-KVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNAL 737
TAF LL ME + +D G+FAPLL LLDD FC L +QVH KV+K GL+ T+CNA+
Sbjct: 185 TAFWLLGLMEMKAAVTMDAGTFAPLLTLLDDPMFCNLLKQVHAKVLKLGLQHEITICNAM 244
Query: 738 ITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPD 797
I+SY++CGS+ DAKRVF+ G +DL+SWNS++ F H ++ AF+L I MQ H E D
Sbjct: 245 ISSYADCGSVSDAKRVFDGLGGSKDLISWNSMIAGFSKHELKESAFELFIQMQRHWVETD 304
Query: 798 LYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCI 857
+Y+YT ++SAC +E GKSLHGMVIK+GLEQ +NALISMY++ G+M++AL +
Sbjct: 305 IYTYTGLLSACSGEEHQIFGKSLHGMVIKKGLEQVTSATNALISMYIQFPTGTMEDALSL 364
Query: 858 FESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATF 917
FESL+ KD +SWNSI+TG +Q G SEDAVK F ++RS + +D Y+FSA+L+SCSDLAT
Sbjct: 365 FESLKSKDLISWNSIITGFAQKGLSEDAVKFFSYLRSSEIKVDDYAFSALLRSCSDLATL 424
Query: 918 QLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEG-ASKSSSITWNALMFG 977
QLGQQ H LA K G SNEFV SSLI MYSKCGI+E A++ F+ +SK S++ WNA++ G
Sbjct: 425 QLGQQIHALATKSGFVSNEFVISSLIVMYSKCGIIESARKCFQQISSKHSTVAWNAMILG 484
Query: 978 YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1037
YAQHG V+LDLF M + VK+DH+TF A+LTACSH GL++ G E L ME Y + P
Sbjct: 485 YAQHGLGQVSLDLFSQMCNQNVKLDHVTFTAILTACSHTGLIQEGLELLNLMEPVYKIQP 544
Query: 1038 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1097
RMEHYA AVDL GR+G +++AK LIE+MP P+ MV KTFLG CR+CG +E+A QVA HL
Sbjct: 545 RMEHYAAAVDLLGRAGLVNKAKELIESMPLNPDPMVLKTFLGVCRACGEIEMATQVANHL 604
Query: 1098 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1157
LE+EPE+H TYV LS+MY DL +WEEKA VK++MKERGVKK PGWSWIE++N+V AF AE
Sbjct: 605 LEIEPEDHFTYVSLSHMYSDLKKWEEKASVKKMMKERGVKKVPGWSWIEIRNQVKAFNAE 664
Query: 1158 DRSHPSCQQIYFLLEVLMEEITRIEA 1182
DRS+P CQ IY +++ L +E+ +++
Sbjct: 665 DRSNPLCQDIYMMIKDLTQEMQWLDS 690
BLAST of CmaCh05G003410 vs. ExPASy Swiss-Prot
Match:
Q9SK49 (Zinc finger CCCH domain-containing protein 22 OS=Arabidopsis thaliana OX=3702 GN=At2g24830 PE=2 SV=1)
HSP 1 Score: 536.6 bits (1381), Expect = 7.3e-151
Identity = 297/508 (58.46%), Postives = 377/508 (74.21%), Query Frame = 0
Query: 1 MANDEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLH 60
MA++E LE+ L++QL EQ+ESL+++ +AL SD SNPELL VH+EL+ AIK+ EEGLLH
Sbjct: 1 MASEENNDLENLLDIQLIEQKESLSSIDEALLSDPSNPELLSVHEELLSAIKEVEEGLLH 60
Query: 61 LKRSRLLREADLVLCGRDSNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRW 120
LKR+RLL EAD+VL G + D V+P H +EPE E++ + GSKCRFRHTDGRW
Sbjct: 61 LKRARLLEEADIVLNGLN----HDAGVKPEH---LEPEKTEEKKDLDGSKCRFRHTDGRW 120
Query: 121 YDGEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYA 180
Y+G I+G +GS+SAKISFLTPT+E+M+ICKFF+QQRCRFG+SCR SHG+D+P++SL+ Y
Sbjct: 121 YNGRIIGFEGSDSAKISFLTPTSESMMICKFFMQQRCRFGSSCRSSHGLDVPISSLKNYE 180
Query: 181 PTIWNQSLTGSSIWALS-SRNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALS 240
T W Q + GS IWA+S S+ IWR AELESWDD LQ+ VVF+ D SS KLG + +ALS
Sbjct: 181 QTEWKQLMVGSKIWAVSGSKYDIWRKAELESWDDELQVGGVVFRDDKSSAKLGSDSLALS 240
Query: 241 VRAQIS--DGEE--------SDSSLEKSDSSDYEDDDLQGLGFLESSTQQRGIQMETTIF 300
AQ++ DGEE S S E S SSDY++ QG+GFLES+ RG+Q +T +F
Sbjct: 241 EYAQMTDDDGEEEEEEDEQQSASDSEDSVSSDYDEGSPQGIGFLESTNLPRGVQTDTALF 300
Query: 301 AKWENHTRGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNT 360
AKWENHTRGIASKMMA+MGYREGMGLG SGQG+LNPI VKVLPAK+SLD+ALE +
Sbjct: 301 AKWENHTRGIASKMMASMGYREGMGLGVSGQGILNPILVKVLPAKRSLDYALEHIRNGEC 360
Query: 361 NDENNGKKRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHL-AMHNGAPNDGSVK 420
E KKRSRGGKRKR KKFA A +AAK+EE+S+PD+F+LIN + + + SVK
Sbjct: 361 KSEKQKKKRSRGGKRKRGKKFAEAAKAAKQEEESKPDLFSLINEQIFPTRHEKVHSESVK 420
Query: 421 KQKDKGSADGKKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETR 480
+++KG VDR+ L+ Y EV+DL++ + KLE+MVNRNK + VV EAA R+L E R
Sbjct: 421 NRQNKG-----PVDRKALVEYQDEVRDLKLEMLKLEQMVNRNKKDLVVSEAATRRLKEVR 480
Query: 481 KALAEAEAAHASASNAVTSREKEKKWLK 497
KALA A A+ASNA+ S+E EKKWLK
Sbjct: 481 KALASTLACQAAASNAIVSKENEKKWLK 496
BLAST of CmaCh05G003410 vs. ExPASy Swiss-Prot
Match:
Q6K687 (Zinc finger CCCH domain-containing protein 18 OS=Oryza sativa subsp. japonica OX=39947 GN=Os02g0793000 PE=2 SV=1)
HSP 1 Score: 527.3 bits (1357), Expect = 4.4e-148
Identity = 296/502 (58.96%), Postives = 372/502 (74.10%), Query Frame = 0
Query: 4 DEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLHLKR 63
DE +E QLE L EQR SL A+ +ALA+D SN +LLEVH+EL+ AIKDAEEGLLHLKR
Sbjct: 8 DEAASIELQLEHHLQEQRASLTAVDEALAADPSNADLLEVHEELLAAIKDAEEGLLHLKR 67
Query: 64 SRLLREADLVLCGRD-SNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRWYD 123
SRL+++ D + ++ ++ A +V V+P DVEPE E Q F VGSKCRFRH DGRWY+
Sbjct: 68 SRLVKQIDEIFPNQEPTSEAPEVAVDP--PDDVEPEPLEPQEFSVGSKCRFRHKDGRWYN 127
Query: 124 GEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYAPT 183
G ++GL+GS+ A+ISFLTPT+ENM +CKFFLQQRCRFG++CRLSHG+ IP+ SL+++ PT
Sbjct: 128 GCVIGLEGSSDARISFLTPTSENMSMCKFFLQQRCRFGSNCRLSHGIVIPILSLKQFTPT 187
Query: 184 IWNQSLTGSSIWALSS-RNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALSVR 243
W QSL GSSI A S +G+WR AELESWDD L++ QVVF+ DGSS +L + +++S
Sbjct: 188 RWQQSLVGSSILAASGHHSGLWRRAELESWDDDLKVGQVVFQDDGSSARLPSDSLSISEY 247
Query: 244 AQISD----GEESDSSLEKSDSSDYEDDDL-QGLGFLESSTQQRGIQMETTIFAKWENHT 303
A SD G SD + S+ D ED+ + QGLG LES G+Q ET IFAKWE+HT
Sbjct: 248 ADESDEDGEGSSSDEGSDFSEDGDQEDESVHQGLGLLESKNLS-GVQTETAIFAKWEHHT 307
Query: 304 RGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNTNDENNGK 363
RG+ASKMMA MGYREGMGLG SGQGML+PIPVKVLP KQSLDHA+ + + N++ GK
Sbjct: 308 RGVASKMMAKMGYREGMGLGVSGQGMLDPIPVKVLPPKQSLDHAVAASEVNDS--VGPGK 367
Query: 364 KRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHLAMHNGAPNDGSVKKQKDKGSA 423
KRSRGGKRKR+KKFA +AAK EE+ R VF+ IN+ L + A K+ G A
Sbjct: 368 KRSRGGKRKREKKFAEQARAAKAEEEER-SVFSFINSQLVGQDVAEGSAVKSKKDSSGEA 427
Query: 424 DG--KKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETRKALAEA 483
+G KK DRR+L+AYD EVK+LR R+EKLEEM+ RN+ +K +EAA +KL +TRKALA+A
Sbjct: 428 NGHAKKEDRRSLLAYDDEVKELRSRVEKLEEMMKRNRKDKAFYEAASKKLKQTRKALADA 487
Query: 484 EAAHASASNAVTSREKEKKWLK 497
EA HASA+NAV +EKEKKWLK
Sbjct: 488 EATHASATNAVARKEKEKKWLK 503
BLAST of CmaCh05G003410 vs. ExPASy Swiss-Prot
Match:
Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)
HSP 1 Score: 471.5 bits (1212), Expect = 2.9e-131
Identity = 239/671 (35.62%), Postives = 373/671 (55.59%), Query Frame = 0
Query: 516 HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
H L +KLG +D Y CN ++S Y+ SA+ +F M RD+V++NT+I G G
Sbjct: 311 HGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYG 370
Query: 576 ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
E + E+ K M G + D T S++ + G L GQQ+H+ K+GFA N AL
Sbjct: 371 EKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGAL 430
Query: 636 LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
L++YAKC +E A FL +N V WN M+ Y D +F + M+ E +
Sbjct: 431 LNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQ 490
Query: 696 GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
++ +L L Q+H ++IK + +C+ LI Y++ G L A +
Sbjct: 491 YTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIR 550
Query: 756 SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
AG +D+VSW +++ + +N +D A M + G D T+ +SAC +
Sbjct: 551 FAG-KDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKE 610
Query: 816 GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
G+ +H G +P NAL+++Y S G ++E+ FE E D ++WN++++G
Sbjct: 611 GQQIHAQACVSGFSSDLPFQNALVTLY--SRCGKIEESYLAFEQTEAGDNIAWNALVSGF 670
Query: 876 SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQFHVLALKYGMDSNE 935
Q G++E+A++ F+ M +D + ++F + +K+ S+ A + G+Q H + K G DS
Sbjct: 671 QQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSET 730
Query: 936 FVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEEK 995
V ++LI MY+KCG + DA++ F S + ++WNA++ Y++HG ALD F M
Sbjct: 731 EVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHS 790
Query: 996 KVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLDE 1055
V+ +H+T V VL+ACSHIGLV++G + M S+YG+ P+ EHY C VD+ R+G L
Sbjct: 791 NVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSR 850
Query: 1056 AKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYGD 1115
AK I+ MP KP+A+VW+T L AC N+E+ A HLLE+EPE+ TYVLLSN+Y
Sbjct: 851 AKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAV 910
Query: 1116 LMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVLMEE 1175
+W+ + ++ MKE+GVKK PG SWIEVKN +H+F D++HP +I+ + L +
Sbjct: 911 SKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKR 970
Query: 1176 ITRIEAAADGF 1187
+ I D F
Sbjct: 971 ASEIGYVQDCF 978
BLAST of CmaCh05G003410 vs. ExPASy Swiss-Prot
Match:
Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)
HSP 1 Score: 450.7 bits (1158), Expect = 5.3e-125
Identity = 234/658 (35.56%), Postives = 384/658 (58.36%), Query Frame = 0
Query: 516 HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
H IK G + DV +++ Y K F+ +FDEM R+ V+W T+I+GY +
Sbjct: 116 HCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMN 175
Query: 576 ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
+ + M+ G + +TF + L +A G+ G Q+H++++K G + ++L
Sbjct: 176 DEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSL 235
Query: 636 LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
+++Y KC + A + F ++ V+WN+MI+GYA G A + M ++ +
Sbjct: 236 INLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSE 295
Query: 696 GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
SFA ++ L + + R T Q+H V+K+G + AL+ +YS+C +++DA R+F
Sbjct: 296 SSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKE 355
Query: 756 SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
V ++VSW +++ FL ++ ++ A L +M+ G P+ ++Y+ I++A +
Sbjct: 356 IGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTAL----PVIS 415
Query: 816 GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
+H V+K E+S + AL+ Y+K G ++EA +F ++ KD V+W+++L G
Sbjct: 416 PSEVHAQVVKTNYERSSTVGTALLDAYVKL--GKVEEAAKVFSGIDDKDIVAWSAMLAGY 475
Query: 876 SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDL-ATFQLGQQFHVLALKYGMDSN 935
+Q G +E A+K F + + + ++FS++L C+ A+ G+QFH A+K +DS+
Sbjct: 476 AQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSS 535
Query: 936 EFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEE 995
VSS+L+ MY+K G +E A+ F+ + ++WN+++ GYAQHGQ ALD+F M++
Sbjct: 536 LCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKK 595
Query: 996 KKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLD 1055
+KVKMD +TF+ V AC+H GLVE G ++ M D + P EH +C VDLY R+G+L+
Sbjct: 596 RKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLE 655
Query: 1056 EAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYG 1115
+A +IE MP + +W+T L ACR EL A ++ M+PE+ YVLLSNMY
Sbjct: 656 KAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYA 715
Query: 1116 DLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVL 1173
+ W+E+A+V++LM ER VKK PG+SWIEVKNK ++F+A DRSHP QIY LE L
Sbjct: 716 ESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDL 767
BLAST of CmaCh05G003410 vs. TAIR 10
Match:
AT3G25970.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 796.6 bits (2056), Expect = 2.8e-230
Identity = 387/686 (56.41%), Postives = 505/686 (73.62%), Query Frame = 0
Query: 498 LYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLR 557
L S+ +S + L LTH AIK G+I+D+Y N IL Y K A++LFDEMP R
Sbjct: 5 LASLLESSLNSFQKLSLTHCYAIKCGSISDIYVSNRILDSYIKFGFLGYANMLFDEMPKR 64
Query: 558 DSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIH 617
DSVSWNTMI+GY + G LE++W + CM+R G D D Y+F +LKGIA DLG+Q+H
Sbjct: 65 DSVSWNTMISGYTSCGKLEDAWCLFTCMKRSGSDVDGYSFSRLLKGIASVKRFDLGEQVH 124
Query: 618 SMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRE 677
++IK G+ NVY GS+L+DMYAKCER+EDA+ F IS+ N+VSWNA+IAG+ Q D +
Sbjct: 125 GLVIKGGYECNVYVGSSLVDMYAKCERVEDAFEAFKEISEPNSVSWNALIAGFVQVRDIK 184
Query: 678 TAFLLLDCMEQEGE-KVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNAL 737
TAF LL ME + +D G+FAPLL LLDD FC L +QVH KV+K GL+ T+CNA+
Sbjct: 185 TAFWLLGLMEMKAAVTMDAGTFAPLLTLLDDPMFCNLLKQVHAKVLKLGLQHEITICNAM 244
Query: 738 ITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPD 797
I+SY++CGS+ DAKRVF+ G +DL+SWNS++ F H ++ AF+L I MQ H E D
Sbjct: 245 ISSYADCGSVSDAKRVFDGLGGSKDLISWNSMIAGFSKHELKESAFELFIQMQRHWVETD 304
Query: 798 LYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCI 857
+Y+YT ++SAC +E GKSLHGMVIK+GLEQ +NALISMY++ G+M++AL +
Sbjct: 305 IYTYTGLLSACSGEEHQIFGKSLHGMVIKKGLEQVTSATNALISMYIQFPTGTMEDALSL 364
Query: 858 FESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATF 917
FESL+ KD +SWNSI+TG +Q G SEDAVK F ++RS + +D Y+FSA+L+SCSDLAT
Sbjct: 365 FESLKSKDLISWNSIITGFAQKGLSEDAVKFFSYLRSSEIKVDDYAFSALLRSCSDLATL 424
Query: 918 QLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEG-ASKSSSITWNALMFG 977
QLGQQ H LA K G SNEFV SSLI MYSKCGI+E A++ F+ +SK S++ WNA++ G
Sbjct: 425 QLGQQIHALATKSGFVSNEFVISSLIVMYSKCGIIESARKCFQQISSKHSTVAWNAMILG 484
Query: 978 YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1037
YAQHG V+LDLF M + VK+DH+TF A+LTACSH GL++ G E L ME Y + P
Sbjct: 485 YAQHGLGQVSLDLFSQMCNQNVKLDHVTFTAILTACSHTGLIQEGLELLNLMEPVYKIQP 544
Query: 1038 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1097
RMEHYA AVDL GR+G +++AK LIE+MP P+ MV KTFLG CR+CG +E+A QVA HL
Sbjct: 545 RMEHYAAAVDLLGRAGLVNKAKELIESMPLNPDPMVLKTFLGVCRACGEIEMATQVANHL 604
Query: 1098 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1157
LE+EPE+H TYV LS+MY DL +WEEKA VK++MKERGVKK PGWSWIE++N+V AF AE
Sbjct: 605 LEIEPEDHFTYVSLSHMYSDLKKWEEKASVKKMMKERGVKKVPGWSWIEIRNQVKAFNAE 664
Query: 1158 DRSHPSCQQIYFLLEVLMEEITRIEA 1182
DRS+P CQ IY +++ L +E+ +++
Sbjct: 665 DRSNPLCQDIYMMIKDLTQEMQWLDS 690
BLAST of CmaCh05G003410 vs. TAIR 10
Match:
AT2G24830.1 (zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein )
HSP 1 Score: 536.6 bits (1381), Expect = 5.2e-152
Identity = 297/508 (58.46%), Postives = 377/508 (74.21%), Query Frame = 0
Query: 1 MANDEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLH 60
MA++E LE+ L++QL EQ+ESL+++ +AL SD SNPELL VH+EL+ AIK+ EEGLLH
Sbjct: 1 MASEENNDLENLLDIQLIEQKESLSSIDEALLSDPSNPELLSVHEELLSAIKEVEEGLLH 60
Query: 61 LKRSRLLREADLVLCGRDSNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRW 120
LKR+RLL EAD+VL G + D V+P H +EPE E++ + GSKCRFRHTDGRW
Sbjct: 61 LKRARLLEEADIVLNGLN----HDAGVKPEH---LEPEKTEEKKDLDGSKCRFRHTDGRW 120
Query: 121 YDGEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYA 180
Y+G I+G +GS+SAKISFLTPT+E+M+ICKFF+QQRCRFG+SCR SHG+D+P++SL+ Y
Sbjct: 121 YNGRIIGFEGSDSAKISFLTPTSESMMICKFFMQQRCRFGSSCRSSHGLDVPISSLKNYE 180
Query: 181 PTIWNQSLTGSSIWALS-SRNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALS 240
T W Q + GS IWA+S S+ IWR AELESWDD LQ+ VVF+ D SS KLG + +ALS
Sbjct: 181 QTEWKQLMVGSKIWAVSGSKYDIWRKAELESWDDELQVGGVVFRDDKSSAKLGSDSLALS 240
Query: 241 VRAQIS--DGEE--------SDSSLEKSDSSDYEDDDLQGLGFLESSTQQRGIQMETTIF 300
AQ++ DGEE S S E S SSDY++ QG+GFLES+ RG+Q +T +F
Sbjct: 241 EYAQMTDDDGEEEEEEDEQQSASDSEDSVSSDYDEGSPQGIGFLESTNLPRGVQTDTALF 300
Query: 301 AKWENHTRGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNT 360
AKWENHTRGIASKMMA+MGYREGMGLG SGQG+LNPI VKVLPAK+SLD+ALE +
Sbjct: 301 AKWENHTRGIASKMMASMGYREGMGLGVSGQGILNPILVKVLPAKRSLDYALEHIRNGEC 360
Query: 361 NDENNGKKRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHL-AMHNGAPNDGSVK 420
E KKRSRGGKRKR KKFA A +AAK+EE+S+PD+F+LIN + + + SVK
Sbjct: 361 KSEKQKKKRSRGGKRKRGKKFAEAAKAAKQEEESKPDLFSLINEQIFPTRHEKVHSESVK 420
Query: 421 KQKDKGSADGKKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETR 480
+++KG VDR+ L+ Y EV+DL++ + KLE+MVNRNK + VV EAA R+L E R
Sbjct: 421 NRQNKG-----PVDRKALVEYQDEVRDLKLEMLKLEQMVNRNKKDLVVSEAATRRLKEVR 480
Query: 481 KALAEAEAAHASASNAVTSREKEKKWLK 497
KALA A A+ASNA+ S+E EKKWLK
Sbjct: 481 KALASTLACQAAASNAIVSKENEKKWLK 496
BLAST of CmaCh05G003410 vs. TAIR 10
Match:
AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 471.5 bits (1212), Expect = 2.1e-132
Identity = 239/671 (35.62%), Postives = 373/671 (55.59%), Query Frame = 0
Query: 516 HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
H L +KLG +D Y CN ++S Y+ SA+ +F M RD+V++NT+I G G
Sbjct: 311 HGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYG 370
Query: 576 ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
E + E+ K M G + D T S++ + G L GQQ+H+ K+GFA N AL
Sbjct: 371 EKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGAL 430
Query: 636 LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
L++YAKC +E A FL +N V WN M+ Y D +F + M+ E +
Sbjct: 431 LNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQ 490
Query: 696 GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
++ +L L Q+H ++IK + +C+ LI Y++ G L A +
Sbjct: 491 YTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIR 550
Query: 756 SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
AG +D+VSW +++ + +N +D A M + G D T+ +SAC +
Sbjct: 551 FAG-KDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKE 610
Query: 816 GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
G+ +H G +P NAL+++Y S G ++E+ FE E D ++WN++++G
Sbjct: 611 GQQIHAQACVSGFSSDLPFQNALVTLY--SRCGKIEESYLAFEQTEAGDNIAWNALVSGF 670
Query: 876 SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQFHVLALKYGMDSNE 935
Q G++E+A++ F+ M +D + ++F + +K+ S+ A + G+Q H + K G DS
Sbjct: 671 QQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSET 730
Query: 936 FVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEEK 995
V ++LI MY+KCG + DA++ F S + ++WNA++ Y++HG ALD F M
Sbjct: 731 EVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHS 790
Query: 996 KVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLDE 1055
V+ +H+T V VL+ACSHIGLV++G + M S+YG+ P+ EHY C VD+ R+G L
Sbjct: 791 NVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSR 850
Query: 1056 AKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYGD 1115
AK I+ MP KP+A+VW+T L AC N+E+ A HLLE+EPE+ TYVLLSN+Y
Sbjct: 851 AKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAV 910
Query: 1116 LMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVLMEE 1175
+W+ + ++ MKE+GVKK PG SWIEVKN +H+F D++HP +I+ + L +
Sbjct: 911 SKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKR 970
Query: 1176 ITRIEAAADGF 1187
+ I D F
Sbjct: 971 ASEIGYVQDCF 978
BLAST of CmaCh05G003410 vs. TAIR 10
Match:
AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 450.7 bits (1158), Expect = 3.8e-126
Identity = 234/658 (35.56%), Postives = 384/658 (58.36%), Query Frame = 0
Query: 516 HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
H IK G + DV +++ Y K F+ +FDEM R+ V+W T+I+GY +
Sbjct: 116 HCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMN 175
Query: 576 ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
+ + M+ G + +TF + L +A G+ G Q+H++++K G + ++L
Sbjct: 176 DEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSL 235
Query: 636 LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
+++Y KC + A + F ++ V+WN+MI+GYA G A + M ++ +
Sbjct: 236 INLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSE 295
Query: 696 GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
SFA ++ L + + R T Q+H V+K+G + AL+ +YS+C +++DA R+F
Sbjct: 296 SSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKE 355
Query: 756 SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
V ++VSW +++ FL ++ ++ A L +M+ G P+ ++Y+ I++A +
Sbjct: 356 IGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTAL----PVIS 415
Query: 816 GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
+H V+K E+S + AL+ Y+K G ++EA +F ++ KD V+W+++L G
Sbjct: 416 PSEVHAQVVKTNYERSSTVGTALLDAYVKL--GKVEEAAKVFSGIDDKDIVAWSAMLAGY 475
Query: 876 SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDL-ATFQLGQQFHVLALKYGMDSN 935
+Q G +E A+K F + + + ++FS++L C+ A+ G+QFH A+K +DS+
Sbjct: 476 AQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSS 535
Query: 936 EFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEE 995
VSS+L+ MY+K G +E A+ F+ + ++WN+++ GYAQHGQ ALD+F M++
Sbjct: 536 LCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKK 595
Query: 996 KKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLD 1055
+KVKMD +TF+ V AC+H GLVE G ++ M D + P EH +C VDLY R+G+L+
Sbjct: 596 RKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLE 655
Query: 1056 EAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYG 1115
+A +IE MP + +W+T L ACR EL A ++ M+PE+ YVLLSNMY
Sbjct: 656 KAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYA 715
Query: 1116 DLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVL 1173
+ W+E+A+V++LM ER VKK PG+SWIEVKNK ++F+A DRSHP QIY LE L
Sbjct: 716 ESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDL 767
BLAST of CmaCh05G003410 vs. TAIR 10
Match:
AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 430.6 bits (1106), Expect = 4.0e-120
Identity = 251/692 (36.27%), Postives = 382/692 (55.20%), Query Frame = 0
Query: 492 KKWLKSLYSVTGTSFRAL---SNL---LLTHSLAIKLGTIADVYTCNNILSGYWKCKEFR 551
K +KS S G+ A+ +NL L+ H+ AIKLG +++Y ++++S Y KC++
Sbjct: 320 KSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKME 379
Query: 552 SADVLFDEMPLRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIA 611
+A +F+ + ++ V WN MI GY ++G E+ M+ G++ D++TF S+L A
Sbjct: 380 AAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCA 439
Query: 612 CAGMLDLGQQIHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNA 671
+ L++G Q HS+IIK A N++ G+AL+DMYAKC LEDA F + ++ V+WN
Sbjct: 440 ASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNT 499
Query: 672 MIAGYAQTGDRETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHG 731
+I Y Q + AF L M G D A L +QVH +K G
Sbjct: 500 IIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKCG 559
Query: 732 LESANTMCNALITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLL 791
L+ ++LI YS+CG + DA++VF+ S +VS N+L+ + +N E+ A L
Sbjct: 560 LDRDLHTGSSLIDMYSKCGIIKDARKVFS-SLPEWSVVSMNALIAGYSQNNLEE-AVVLF 619
Query: 792 IDMQEHGFEPDLYSYTSIISACFNKELSNNGKSLHGMVIKRGL-EQSVPISNALISMYLK 851
+M G P ++ +I+ AC E G HG + KRG + + +L+ MY+
Sbjct: 620 QEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMN 679
Query: 852 SDGGSMKEALCIFESLEI-KDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSF 911
S G M EA +F L K V W +++G SQ G E+A+K + MR + D+ +F
Sbjct: 680 SRG--MTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATF 739
Query: 912 SAVLKSCSDLATFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSF-EGAS 971
VL+ CS L++ + G+ H L D +E S++LI MY+KCG M+ + + F E
Sbjct: 740 VTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRR 799
Query: 972 KSSSITWNALMFGYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCE 1031
+S+ ++WN+L+ GYA++G AL +F M + + D ITF+ VLTACSH G V G +
Sbjct: 800 RSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRK 859
Query: 1032 FLRCMESDYGVPPRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSC 1091
M YG+ R++H AC VDL GR G L EA IEA KP+A +W + LGACR
Sbjct: 860 IFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLKPDARLWSSLLGACRIH 919
Query: 1092 GNVELACQVARHLLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSW 1151
G+ A L+E+EP+ YVLLSN+Y WE+ ++++M++RGVKK PG+SW
Sbjct: 920 GDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALRKVMRDRGVKKVPGYSW 979
Query: 1152 IEVKNKVHAFIAEDRSHPSCQQIYFLLEVLME 1175
I+V+ + H F A D+SH +I LE L +
Sbjct: 980 IDVEQRTHIFAAGDKSHSEIGKIEMFLEDLYD 1007
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LU94 | 3.9e-229 | 56.41 | Putative pentatricopeptide repeat-containing protein At3g25970 OS=Arabidopsis th... | [more] |
Q9SK49 | 7.3e-151 | 58.46 | Zinc finger CCCH domain-containing protein 22 OS=Arabidopsis thaliana OX=3702 GN... | [more] |
Q6K687 | 4.4e-148 | 58.96 | Zinc finger CCCH domain-containing protein 18 OS=Oryza sativa subsp. japonica OX... | [more] |
Q9SVP7 | 2.9e-131 | 35.62 | Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... | [more] |
Q9ZUW3 | 5.3e-125 | 35.56 | Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
AT3G25970.1 | 2.8e-230 | 56.41 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT2G24830.1 | 5.2e-152 | 58.46 | zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein | [more] |
AT4G13650.1 | 2.1e-132 | 35.62 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT2G27610.1 | 3.8e-126 | 35.56 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G09040.1 | 4.0e-120 | 36.27 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |