Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGTTCATAGCCTCCCGCGCCCACGAATTAAACTTCTTCCGCCGCCCGCCTCCAGTTTTCTTCCCGCGGACCGCCGCCCGCCTCCAGTTTTCTTCCCGCGGACCGCCGCCGTCCTTACGAGCTACCTCCCCCTCTCTCTCATACGCCGGCGACTGCTCCGTTCATCTCTCTCGCTTCTATTCGTTGTTTCGTCTGGGTGAGAATTTCTTAAGCTCAGTGATTACAGCCATAATCTTCAATAAAAAGAACATGTTCTTTAATTACCTTAAATTTTTATTTTTATTATTAAACTTTCAAATTAATAGTTAATACCACCTCTCCACCTTCACTTCTCAAGCTTTCATCGCCTGTTAAGCTTAATGTTTTAAAATTTTAATTTCTTTGATTGTGTTCTACTGATTCTTGCAATTTCTAGCTTCAATCTGTCATCCCTTCCATTTATTTGGTTACATCTTCGTTGTTTATCACTTTTACAACTTACCAATTTCTTCATTGTAGAGAAAAAATCTCCCAACGAATTGTTGAGCTCTCGTGGAGTTATCGAGGAGGAAGTGGCTGCATCTGGTTTGTATGTTGTGGAACAACTCTGCTTTAGATGTCACTCCCCCTTATTTGGTTTCTGGAGTTATAACAGAGAAGGTAATCTTCTAATTCTTCCTTGAAGTCTGTTTATACAAGGTGAAATGTGTTCTTTATGATAGAACCCTAAAATGAAATTAGTTCCAAATGTTTTTGTTCTTCTTTAATTTGATGTATTACTGAATATAGTTTTACTGTAGGAGTAATTCTTTTGGTCTGAATTGATGGAACAGAACTTCATTGCGATGAGAATTCATGGTACTCCTCTTGGGTTCCAAAATCTGTTGATATCTAGTTGGTTACATAGTTCCTCTCAATGCCCAAACAAGTTTCAAATTACTACTAGGTCTTCGTTGTTTTCGATTCGACGGAGTAGTTTCAAAATACCACCAAAGCCCAGGTACCCTTCTGACTCTATTGGAATTTCAATGTCAAAAGATCAATTTGGTCATAAATTTAAGAATAGAGTACAGAATGTCCCACATAGATATAGCTTGGAACACCAAAAGACTGAAGATGTTATGGAAAATCGAGTATGCTTGAGTAGCAAGGAGAAGTTGAAATATTATTCATGGATGTTGCATGAATGTGCATCGAATCGATCTTTAGGTGCTGCAAAAGCGATCCATGGGCTTGTCGTCAAGGATGTGATTAATCCAGATTCCCATTTGTGGGTTTCGTTGGTGAATGTATATGCGAAGTGTAGGTACTCTGCATATGCTCGATTAGTGCTAGCTAAAATGCCTGATCGTGATGTTGTTTCTTGGACGGCGTTAATTCAAGGCCTTGTGGCAGAAGGATTTGTTAATGATAGTATTTATTTATTTCAGGAGATGCAAAATGAAGGAATCATGCCCAATGAGTTCACTCTAGCTACTGGATTAAAAGCATGTTCTTTGTGTATGGCCTTAGATCTTGGAAAACAGATGCATGCTCAAGCTTTTAAACTTGGATTATTACTAGATTTGTTTGTTGGATCTGCTCTTGTTGACCTTTATTCTAAATGTGGAGAGATGGAACTTGCGTCTAGAATGTTCTTTGGTATACCTGAGCAAAATGAAGTGACATGGAATGTGCTACTCAATGGTTATGCTCAAGCGGGCGATGGGATTGGAGTCTTGAAGTTATTTTGTTGTATGATGGAATCAGATGTGAAGTCTAGCAAGTTCACGTTAACTACGGTACTAAAGGGTTGTGCAAACTCCAAAAATTTACGACAGGGGCAGGTAATCCATTCCCTGATTATCAAATATGGGTATGAAGGCGATGAATTCTTAGGTTGTGGTTTGGTCGATACGTACTCGAAGTGTGGGATGGCAATTGATGCATTAGAAGTTTTCAAAAAGATTAAAAAGCCTGATATAGTTGTTTGGAGTGCCATGATTACATGCCTTGATCAGCAAGGACAAAGCGGCGAATCAATTAAGTTATTCCACTTAATGCGATCGAGTAGTACTAGACCAAACCATTATACTATTTGCAGCCTCGTAAGTGCCGCTACAAATATGGAAGATTATCGATATGGGCGAAGCATTCATGCTTGTGTTTGGAAATATGGATTTGAAACTGATATTTCAATCAACAATGCATTAGTCACAATGTACATGAAAAGTGGATGTGTGAATGAGGGTGCAAGGTTGTTTGAATCGATGATCGAACGAGATTTGGTTTCGTGGAATACATATTTATCTGGATTTCATGACTCTGGAATGTACGATCGTTCACTTACTATCTTTGGTCACCTGTTAGAGGACGGTTTTATACCGAACATGTATACTTTTATCGGTATTTTAAGATCGTGTTCTTGTTTTTTAGATGTGCACTTTGGGAGGCAAGTACATACCCATATCATCAAAAATGATCTGGATGATAACGATTTTGTTCAAACAGCTCTGATTGACATGTACGCCAAGTGTATGTGCATGGAAGATGCTGATGTAGCTTTCAACAGGTTAAGTTCTAGAGATCTTTTTACTTGGACAGTTATCATTACGAGTCATGCACAGACGAACCAGGGGGAAAAGGCTCTTAGTTATTTCAGGCAGATGCAACAGGAAGGTGTGAAGCCGAATGAGTTCACGCTTGCTGGCTGTTTGAGTGGTTGCTCGTCCCTCGCTTCTCTAGAAGGTGGACAACAACTACATTCCATGGCTTTTAAGAGTGGACACTTAAGTGATATGTTTGTTGGTAGTGCCCTTGTCGACATGTACGCAAAATGTGGTTGCATGGAAGAGGCTGAGATGTTATTCGAAGCTTTGATTTGTCGAGATACAGTCGCATGGAACACTATTATATGTGGATATTCACAAAACGGGCAAGGAAATAAAGCTCTCGAGGCGTTTCAGATGATGTTAGACGAAGGCATATCGCCCGACGAGGTCACCTTCATAGGCATTCTTTCTGCATGCAGTCACCAAGGCTTAGTTGAAGAGGGCAAAAAACATTTTAACTCTATGTATAGAGATTTTGGTATTTCTCTGACCGTGAACCACTGTGCTTGTATGGTTGATATTCTAGGCCGTGTGGGAAAATTCGATGAGCTCGAAGATTTCATTAAAAAAATGCAACTATCACAACATGCACTGATATGGGAGACTGTCCTTGGAGCTTGTAAAATGCATGGCAATTTGGCTTTGGGTGAGAAAGCTGGAAACAAACTCATTGACCTTCAACCGGAGAAGGAGACTAATTATATATTACTCTCGAATATTTTTGCGACAAAAGGAAAGTGGGACGATGTCAAAAGAGTTCGAACTTTGATGTCTAGTAAAGGAGTTAAAAAGGAGCCAGGGTGTAGCTGGGTCGAGGCTAATGGTCAAGCTCACACGTTCGTGTCTCATGATTGTTCACATCCACAAATTCAGGAAATACATCTAAAGCTAGAGGAGCTTGATAAAGAACTTACTGCCATAGGATATGTGCCCAAAACTGAATACGTGCTTCATAATGTAGAAGAAACTGAAAAAAGGGAATACCTTCGATTTCACAGTGAAAGATTGGCCCTTGCTTTTGCTCTTATAAATACCAGCGCAACAAAAAAAATTCGTATCTTAAAAAATCTACGTATTTGTGGAGATTGCCATGATGTTATGAAGTTTTTATCGAGTATCACCGATCGGGAAATAGTTATTCGTGATGTTCATAGGTTCCACCATTTTAAGAGTGGTGCTTGCTCATGTAATGATTTTTGGTAACGTGGCTTCGCCGTGGTATCTATTATGGAGTCGGTTTATTGCATTTGATACCAAGGAAGCTTGCTCCGGATATCACAGCTTCTGGACTAAAGAAACTTGGTTTCGTTTAAGGGTTTTGCCTATCCAGGTATGGTCTTGTATGGATTCATTTTTCTCGTACTATTTGCATTTGAATTCTCATCCATTTTTTCTTGTACTATTTGCATTTGAATTCTCATCCATAGAACGCGTCTGCTAGGGAGAGGTTTTCACACCCTTATAAAGAATGTTTCATTCCTCTCTCCAATCAATGTGGGATCTCACAATCCACCCCTCTTCAGGGCCCAGCGTTCTCGCTAGCACTCGTTCTCCTCTCCAATCGATATGGGTCCCCCCAATTCACTCTTCTTCGATGCGTAGCGTCCTTGCTGGCACATCGCCTCGTCTCCACCCCATCTTTCAGGCTCAACCTCCTCGCTGGCACATCACCCAGTGTCTGGCTCTGATACCATTTGTAGCGGCCCAAGCCCACCACTAACAAATATTGTCCTCTTTAGGTTAGAAAAAGGTTTTCATACCCTTATAAAGAATGTTCCGTTCCCCTTTCCAACCAATGTGGAATCTCACATGTTCAATATTTGCTCGCCTCTTGTTTCATCGGTCCCTTCCTCTTTGCCCATGGTTGTAATCTTATCCTCTCTTTACTCTCTTAGATTCCGAATTCACTGAATCCTACTTCTTTTGCAGCTTCAACTTGCTAGCATCACCGGGAGGATTGTAAATGTTGCAGGCTGTTTGGACGTGCTGTTGCAAACTGCCATCATACCCTTTCTCAACCTCTGCAACATATGCTTTTGCTCTTAACGTTAATATTAGTTTCCATGTTAGATTCGAGATTCTTGATTCAATGAGGTAGACATGTATACTGATGGGTTCGCAAGCATGTGTAGGCAAATCCGAGCACGGTCTATCACAAAGCTTAGAGGCTTCTTGCTAGGAATCCTATGCATACCATGACCATTCCCTGTAGAACCAGACTGGGTCGTTCCATTGTTAAAATCTTCTTTGAGCTGGTATGTTCTTGGATCATGCCTCAATACATCTTGTTCTTGCTGCTTATATGTCATTTTCCTAATTCTTTGCTGTTCTAACCACTACTTAGAGCCTCATTTTTGGTGTTGAGCCTGTGGTCTGTCTCTGATACCTTCTAAAATCAGAATTTCCATGACTTGGATAGAACACTCCCATCTCACTCACCATGAATAGATATGAATCCGAAACAGTAAGGTATGGTTCTTGTATTCAATCTGTTTGCCTTTTATTGTTTTATAAATGTTATGAATGTCCTTCATGCTTTGAGGCTGAGAAATTGACTTGGATGTAGTTTGTGTGTTTTGCAGTCCAAGATGTTTTCGTATGGCTATGTTCGTATTTGCACAATAGATTATAATCGTAGGATGTGATGATTTACGAGTCGTTGGTTGATCTGATCCTGACCCAAATTTGCAACTCGGGAGGGTGGAAAGGACAAGATATAAAACAATATAGCTTACTCTTAGGGTCGATCTCCCATTGCATACCAGATATCTCATATCTACTTGAGAAAGAGTCTTCGATATCACATATTGAAGCCACATGCCCCTCAAGCCCAAGTCTATCGCTAGAAAATATTGTCCGCTTTAGCTCGTTACGTATCGCCATCAGCCTCACGATTGTAAAATACATCTACTAGGGAAAGATTTTTACACCCTTGTAAGGAATATTGCCTTCCCTTGTAAGTTTGTTATAGCATGCATAGCTCGATATTACACAAAATTGAAGCCATAATCGATAATTTTGCTTTAAAATGAAAAATTATATCAGTTTAAGTTCGAGTTTGACGGAA
mRNA sequence
GAGTTCATAGCCTCCCGCGCCCACGAATTAAACTTCTTCCGCCGCCCGCCTCCAGTTTTCTTCCCGCGGACCGCCGCCCGCCTCCAGTTTTCTTCCCGCGGACCGCCGCCGTCCTTACGAGCTACCTCCCCCTCTCTCTCATACGCCGGCGACTGCTCCGTTCATCTCTCTCGCTTCTATTCGTTGTTTCGTCTGGAGAAAAAATCTCCCAACGAATTGTTGAGCTCTCGTGGAGTTATCGAGGAGGAAGTGGCTGCATCTGGTTTGTATGTTGTGGAACAACTCTGCTTTAGATGTCACTCCCCCTTATTTGGTTTCTGGAGTTATAACAGAGAAGAACTTCATTGCGATGAGAATTCATGGTCTTCGTTGTTTTCGATTCGACGGAGTAGTTTCAAAATACCACCAAAGCCCAGGTACCCTTCTGACTCTATTGGAATTTCAATGTCAAAAGATCAATTTGGTCATAAATTTAAGAATAGAGTACAGAATGTCCCACATAGATATAGCTTGGAACACCAAAAGACTGAAGATGTTATGGAAAATCGAGTATGCTTGAGTAGCAAGGAGAAGTTGAAATATTATTCATGGATGTTGCATGAATGTGCATCGAATCGATCTTTAGGTGCTGCAAAAGCGATCCATGGGCTTGTCGTCAAGGATGTGATTAATCCAGATTCCCATTTGTGGGTTTCGTTGGTGAATGTATATGCGAAGTGTAGGTACTCTGCATATGCTCGATTAGTGCTAGCTAAAATGCCTGATCGTGATGTTGTTTCTTGGACGGCGTTAATTCAAGGCCTTGTGGCAGAAGGATTTGTTAATGATAGTATTTATTTATTTCAGGAGATGCAAAATGAAGGAATCATGCCCAATGAGTTCACTCTAGCTACTGGATTAAAAGCATGTTCTTTGTGTATGGCCTTAGATCTTGGAAAACAGATGCATGCTCAAGCTTTTAAACTTGGATTATTACTAGATTTGTTTGTTGGATCTGCTCTTGTTGACCTTTATTCTAAATGTGGAGAGATGGAACTTGCGTCTAGAATGTTCTTTGGTATACCTGAGCAAAATGAAGTGACATGGAATGTGCTACTCAATGGTTATGCTCAAGCGGGCGATGGGATTGGAGTCTTGAAGTTATTTTGTTGTATGATGGAATCAGATGTGAAGTCTAGCAAGTTCACGTTAACTACGGTACTAAAGGGTTGTGCAAACTCCAAAAATTTACGACAGGGGCAGGTAATCCATTCCCTGATTATCAAATATGGGTATGAAGGCGATGAATTCTTAGGTTGTGGTTTGGTCGATACGTACTCGAAGTGTGGGATGGCAATTGATGCATTAGAAGTTTTCAAAAAGATTAAAAAGCCTGATATAGTTGTTTGGAGTGCCATGATTACATGCCTTGATCAGCAAGGACAAAGCGGCGAATCAATTAAGTTATTCCACTTAATGCGATCGAGTAGTACTAGACCAAACCATTATACTATTTGCAGCCTCGTAAGTGCCGCTACAAATATGGAAGATTATCGATATGGGCGAAGCATTCATGCTTGTGTTTGGAAATATGGATTTGAAACTGATATTTCAATCAACAATGCATTAGTCACAATGTACATGAAAAGTGGATGTGTGAATGAGGGTGCAAGGTTGTTTGAATCGATGATCGAACGAGATTTGGTTTCGTGGAATACATATTTATCTGGATTTCATGACTCTGGAATGTACGATCGTTCACTTACTATCTTTGGTCACCTGTTAGAGGACGGTTTTATACCGAACATGTATACTTTTATCGGTATTTTAAGATCGTGTTCTTGTTTTTTAGATGTGCACTTTGGGAGGCAAGTACATACCCATATCATCAAAAATGATCTGGATGATAACGATTTTGTTCAAACAGCTCTGATTGACATGTACGCCAAGTGTATGTGCATGGAAGATGCTGATGTAGCTTTCAACAGGTTAAGTTCTAGAGATCTTTTTACTTGGACAGTTATCATTACGAGTCATGCACAGACGAACCAGGGGGAAAAGGCTCTTAGTTATTTCAGGCAGATGCAACAGGAAGGTGTGAAGCCGAATGAGTTCACGCTTGCTGGCTGTTTGAGTGGTTGCTCGTCCCTCGCTTCTCTAGAAGGTGGACAACAACTACATTCCATGGCTTTTAAGAGTGGACACTTAAGTGATATGTTTGTTGGTAGTGCCCTTGTCGACATGTACGCAAAATGTGGTTGCATGGAAGAGGCTGAGATGTTATTCGAAGCTTTGATTTGTCGAGATACAGTCGCATGGAACACTATTATATGTGGATATTCACAAAACGGGCAAGGAAATAAAGCTCTCGAGGCGTTTCAGATGATGTTAGACGAAGGCATATCGCCCGACGAGGTCACCTTCATAGGCATTCTTTCTGCATGCAGTCACCAAGGCTTAGTTGAAGAGGGCAAAAAACATTTTAACTCTATGTATAGAGATTTTGGTATTTCTCTGACCGTGAACCACTGTGCTTGTATGGTTGATATTCTAGGCCGTGTGGGAAAATTCGATGAGCTCGAAGATTTCATTAAAAAAATGCAACTATCACAACATGCACTGATATGGGAGACTGTCCTTGGAGCTTGTAAAATGCATGGCAATTTGGCTTTGGGTGAGAAAGCTGGAAACAAACTCATTGACCTTCAACCGGAGAAGGAGACTAATTATATATTACTCTCGAATATTTTTGCGACAAAAGGAAAGTGGGACGATGTCAAAAGAGTTCGAACTTTGATGTCTAGTAAAGGAGTTAAAAAGGAGCCAGGGTGTAGCTGGGTCGAGGCTAATGGTCAAGCTCACACGTTCGTGTCTCATGATTGTTCACATCCACAAATTCAGGAAATACATCTAAAGCTAGAGGAGCTTGATAAAGAACTTACTGCCATAGGATATGTGCCCAAAACTGAATACGTGCTTCATAATGTAGAAGAAACTGAAAAAAGGGAATACCTTCGATTTCACAGTGAAAGATTGGCCCTTGCTTTTGCTCTTATAAATACCAGCGCAACAAAAAAAATTCGTTCCACCATTTTAAGAGTGGTGCTTGCTCATGTAATGATTTTTGGTAACGTGGCTTCGCCGTGGTATCTATTATGGAGTCGGTTTATTGCATTTGATACCAAGGAAGCTTGCTCCGGATATCACAGCTTCTGGACTAAAGAAACTTGGTTTCGTTTAAGGGTTTTGCCTATCCAGCTTCAACTTGCTAGCATCACCGGGAGGATTGTAAATGTTGCAGGCTGTTTGGACGTGCTGTTGCAAACTGCCATCATACCCTTTCTCAACCTCTGCAACATATGCTTTTGCTCTTAACGTTAATATTAGTTTCCATGTTAGATTCGAGATTCTTGATTCAATGAGGTAGACATGTATACTGATGGGTTCGCAAGCATGTGTAGGCAAATCCGAGCACGGTCTATCACAAAGCTTAGAGGCTTCTTGCTAGGAATCCTATGCATACCATGACCATTCCCTGTAGAACCAGACTGGGTCGTTCCATTGTTAAAATCTTCTTTGAGCTGAGCCTCATTTTTGGTGTTGAGCCTGTGGTCTGTCTCTGATACCTTCTAAAATCAGAATTTCCATGACTTGGATAGAACACTCCCATCTCACTCACCATGAATAGATATGAATCCGAAACAGTAAGTCCAAGATGTTTTCGTATGGCTATGTTCGTATTTGCACAATAGATTATAATCGTAGGATGTGATGATTTACGAGTCGTTGGTTGATCTGATCCTGACCCAAATTTGCAACTCGGGAGGGTGGAAAGGACAAGATATAAAACAATATAGCTTACTCTTAGGGTCGATCTCCCATTGCATACCAGATATCTCATATCTACTTGAGAAAGAGTCTTCGATATCACATATTGAAGCCACATGCCCCTCAAGCCCAAGTCTATCGCTAGAAAATATTGTCCGCTTTAGCTCGTTACGTATCGCCATCAGCCTCACGATTGTAAAATACATCTACTAGGGAAAGATTTTTACACCCTTGTAAGGAATATTGCCTTCCCTTGTAAGTTTGTTATAGCATGCATAGCTCGATATTACACAAAATTGAAGCCATAATCGATAATTTTGCTTTAAAATGAAAAATTATATCAGTTTAAGTTCGAGTTTGACGGAA
Coding sequence (CDS)
GAGTTCATAGCCTCCCGCGCCCACGAATTAAACTTCTTCCGCCGCCCGCCTCCAGTTTTCTTCCCGCGGACCGCCGCCCGCCTCCAGTTTTCTTCCCGCGGACCGCCGCCGTCCTTACGAGCTACCTCCCCCTCTCTCTCATACGCCGGCGACTGCTCCGTTCATCTCTCTCGCTTCTATTCGTTGTTTCGTCTGGAGAAAAAATCTCCCAACGAATTGTTGAGCTCTCGTGGAGTTATCGAGGAGGAAGTGGCTGCATCTGGTTTGTATGTTGTGGAACAACTCTGCTTTAGATGTCACTCCCCCTTATTTGGTTTCTGGAGTTATAACAGAGAAGAACTTCATTGCGATGAGAATTCATGGTCTTCGTTGTTTTCGATTCGACGGAGTAGTTTCAAAATACCACCAAAGCCCAGGTACCCTTCTGACTCTATTGGAATTTCAATGTCAAAAGATCAATTTGGTCATAAATTTAAGAATAGAGTACAGAATGTCCCACATAGATATAGCTTGGAACACCAAAAGACTGAAGATGTTATGGAAAATCGAGTATGCTTGAGTAGCAAGGAGAAGTTGAAATATTATTCATGGATGTTGCATGAATGTGCATCGAATCGATCTTTAGGTGCTGCAAAAGCGATCCATGGGCTTGTCGTCAAGGATGTGATTAATCCAGATTCCCATTTGTGGGTTTCGTTGGTGAATGTATATGCGAAGTGTAGGTACTCTGCATATGCTCGATTAGTGCTAGCTAAAATGCCTGATCGTGATGTTGTTTCTTGGACGGCGTTAATTCAAGGCCTTGTGGCAGAAGGATTTGTTAATGATAGTATTTATTTATTTCAGGAGATGCAAAATGAAGGAATCATGCCCAATGAGTTCACTCTAGCTACTGGATTAAAAGCATGTTCTTTGTGTATGGCCTTAGATCTTGGAAAACAGATGCATGCTCAAGCTTTTAAACTTGGATTATTACTAGATTTGTTTGTTGGATCTGCTCTTGTTGACCTTTATTCTAAATGTGGAGAGATGGAACTTGCGTCTAGAATGTTCTTTGGTATACCTGAGCAAAATGAAGTGACATGGAATGTGCTACTCAATGGTTATGCTCAAGCGGGCGATGGGATTGGAGTCTTGAAGTTATTTTGTTGTATGATGGAATCAGATGTGAAGTCTAGCAAGTTCACGTTAACTACGGTACTAAAGGGTTGTGCAAACTCCAAAAATTTACGACAGGGGCAGGTAATCCATTCCCTGATTATCAAATATGGGTATGAAGGCGATGAATTCTTAGGTTGTGGTTTGGTCGATACGTACTCGAAGTGTGGGATGGCAATTGATGCATTAGAAGTTTTCAAAAAGATTAAAAAGCCTGATATAGTTGTTTGGAGTGCCATGATTACATGCCTTGATCAGCAAGGACAAAGCGGCGAATCAATTAAGTTATTCCACTTAATGCGATCGAGTAGTACTAGACCAAACCATTATACTATTTGCAGCCTCGTAAGTGCCGCTACAAATATGGAAGATTATCGATATGGGCGAAGCATTCATGCTTGTGTTTGGAAATATGGATTTGAAACTGATATTTCAATCAACAATGCATTAGTCACAATGTACATGAAAAGTGGATGTGTGAATGAGGGTGCAAGGTTGTTTGAATCGATGATCGAACGAGATTTGGTTTCGTGGAATACATATTTATCTGGATTTCATGACTCTGGAATGTACGATCGTTCACTTACTATCTTTGGTCACCTGTTAGAGGACGGTTTTATACCGAACATGTATACTTTTATCGGTATTTTAAGATCGTGTTCTTGTTTTTTAGATGTGCACTTTGGGAGGCAAGTACATACCCATATCATCAAAAATGATCTGGATGATAACGATTTTGTTCAAACAGCTCTGATTGACATGTACGCCAAGTGTATGTGCATGGAAGATGCTGATGTAGCTTTCAACAGGTTAAGTTCTAGAGATCTTTTTACTTGGACAGTTATCATTACGAGTCATGCACAGACGAACCAGGGGGAAAAGGCTCTTAGTTATTTCAGGCAGATGCAACAGGAAGGTGTGAAGCCGAATGAGTTCACGCTTGCTGGCTGTTTGAGTGGTTGCTCGTCCCTCGCTTCTCTAGAAGGTGGACAACAACTACATTCCATGGCTTTTAAGAGTGGACACTTAAGTGATATGTTTGTTGGTAGTGCCCTTGTCGACATGTACGCAAAATGTGGTTGCATGGAAGAGGCTGAGATGTTATTCGAAGCTTTGATTTGTCGAGATACAGTCGCATGGAACACTATTATATGTGGATATTCACAAAACGGGCAAGGAAATAAAGCTCTCGAGGCGTTTCAGATGATGTTAGACGAAGGCATATCGCCCGACGAGGTCACCTTCATAGGCATTCTTTCTGCATGCAGTCACCAAGGCTTAGTTGAAGAGGGCAAAAAACATTTTAACTCTATGTATAGAGATTTTGGTATTTCTCTGACCGTGAACCACTGTGCTTGTATGGTTGATATTCTAGGCCGTGTGGGAAAATTCGATGAGCTCGAAGATTTCATTAAAAAAATGCAACTATCACAACATGCACTGATATGGGAGACTGTCCTTGGAGCTTGTAAAATGCATGGCAATTTGGCTTTGGGTGAGAAAGCTGGAAACAAACTCATTGACCTTCAACCGGAGAAGGAGACTAATTATATATTACTCTCGAATATTTTTGCGACAAAAGGAAAGTGGGACGATGTCAAAAGAGTTCGAACTTTGATGTCTAGTAAAGGAGTTAAAAAGGAGCCAGGGTGTAGCTGGGTCGAGGCTAATGGTCAAGCTCACACGTTCGTGTCTCATGATTGTTCACATCCACAAATTCAGGAAATACATCTAAAGCTAGAGGAGCTTGATAAAGAACTTACTGCCATAGGATATGTGCCCAAAACTGAATACGTGCTTCATAATGTAGAAGAAACTGAAAAAAGGGAATACCTTCGATTTCACAGTGAAAGATTGGCCCTTGCTTTTGCTCTTATAAATACCAGCGCAACAAAAAAAATTCGTTCCACCATTTTAAGAGTGGTGCTTGCTCATGTAATGATTTTTGGTAACGTGGCTTCGCCGTGGTATCTATTATGGAGTCGGTTTATTGCATTTGATACCAAGGAAGCTTGCTCCGGATATCACAGCTTCTGGACTAAAGAAACTTGGTTTCGTTTAAGGGTTTTGCCTATCCAGCTTCAACTTGCTAGCATCACCGGGAGGATTGTAAATGTTGCAGGCTGTTTGGACGTGCTGTTGCAAACTGCCATCATACCCTTTCTCAACCTCTGCAACATATGCTTTTGCTCTTAA
Protein sequence
EFIASRAHELNFFRRPPPVFFPRTAARLQFSSRGPPPSLRATSPSLSYAGDCSVHLSRFYSLFRLEKKSPNELLSSRGVIEEEVAASGLYVVEQLCFRCHSPLFGFWSYNREELHCDENSWSSLFSIRRSSFKIPPKPRYPSDSIGISMSKDQFGHKFKNRVQNVPHRYSLEHQKTEDVMENRVCLSSKEKLKYYSWMLHECASNRSLGAAKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYARLVLAKMPDRDVVSWTALIQGLVAEGFVNDSIYLFQEMQNEGIMPNEFTLATGLKACSLCMALDLGKQMHAQAFKLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQAGDGIGVLKLFCCMMESDVKSSKFTLTTVLKGCANSKNLRQGQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNMEDYRYGRSIHACVWKYGFETDISINNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHDSGMYDRSLTIFGHLLEDGFIPNMYTFIGILRSCSCFLDVHFGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDADVAFNRLSSRDLFTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLASLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIICGYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGISLTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETVLGACKMHGNLALGEKAGNKLIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVSHDCSHPQIQEIHLKLEELDKELTAIGYVPKTEYVLHNVEETEKREYLRFHSERLALAFALINTSATKKIRSTILRVVLAHVMIFGNVASPWYLLWSRFIAFDTKEACSGYHSFWTKETWFRLRVLPIQLQLASITGRIVNVAGCLDVLLQTAIIPFLNLCNICFCS
Homology
BLAST of CmaCh08G003410 vs. ExPASy Swiss-Prot
Match:
Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)
HSP 1 Score: 517.7 bits (1332), Expect = 3.3e-145
Identity = 311/843 (36.89%), Postives = 475/843 (56.35%), Query Frame = 0
Query: 198 MLHECASNRSLGA--AKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYS-AYARLVLAKMP 257
+L C S+G + IHGL+ K D+ + L+++Y KC S YA +
Sbjct: 108 VLRACQEIGSVGILFGRQIHGLMFKLSYAVDAVVSNVLISMYWKCIGSVGYALCAFGDIE 167
Query: 258 DRDVVSWTALIQGLVAEGFVNDSIYLFQEMQNEGIMPNEFTLATGL-KACSLCMA-LDLG 317
++ VSW ++I G + +F MQ +G P E+T + + ACSL + L
Sbjct: 168 VKNSVSWNSIISVYSQAGDQRSAFRIFSSMQYDGSRPTEYTFGSLVTTACSLTEPDVRLL 227
Query: 318 KQMHAQAFKLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQA 377
+Q+ K GLL DLFVGS LV ++K G + A ++F + +N VT N L+ G +
Sbjct: 228 EQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSLSYARKVFNQMETRNAVTLNGLMVGLVRQ 287
Query: 378 GDGIGVLKLFC---CMMESDVKSSKFTLTTVLK-GCANSKNLRQGQVIHSLIIKYG-YEG 437
G KLF M++ +S L++ + A L++G+ +H +I G +
Sbjct: 288 KWGEEATKLFMDMNSMIDVSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITTGLVDF 347
Query: 438 DEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMR 497
+G GLV+ Y+KCG DA VF + D V W++MIT LDQ G E+++ + MR
Sbjct: 348 MVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMR 407
Query: 498 SSSTRPNHYTICSLVSAATNMEDYRYGRSIHACVWKYGFETDISINNALVTMYMKSGCVN 557
P +T+ S +S+ +++ + G+ IH K G + ++S++NAL+T+Y ++G +N
Sbjct: 408 RHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLN 467
Query: 558 EGARLFESMIERDLVSWNTYLSGFHDSGMYDRSL----TIFGHLLEDGFIPNMYTFIGIL 617
E ++F SM E D VSWN+ + S +RSL F + G N TF +L
Sbjct: 468 ECRKIFSSMPEHDQVSWNSIIGALARS---ERSLPEAVVCFLNAQRAGQKLNRITFSSVL 527
Query: 618 RSCSCFLDVHFGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDADVAFNRLSS-RDL 677
+ S G+Q+H +KN++ D + ALI Y KC M+ + F+R++ RD
Sbjct: 528 SAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDN 587
Query: 678 FTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLASLEGGQQLHSM 737
TW +I+ + KAL M Q G + + F A LS +S+A+LE G ++H+
Sbjct: 588 VTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHAC 647
Query: 738 AFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIICGYSQNGQGNKA 797
+ ++ SD+ VGSALVDMY+KCG ++ A F + R++ +WN++I GY+++GQG +A
Sbjct: 648 SVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEA 707
Query: 798 LEAFQ-MMLDEGISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGISLTVNHCACMV 857
L+ F+ M LD PD VTF+G+LSACSH GL+EEG KHF SM +G++ + H +CM
Sbjct: 708 LKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMA 767
Query: 858 DILGRVGKFDELEDFIKKMQLSQHALIWETVLGA-CKMHGNLA-LGEKAGNKLIDLQPEK 917
D+LGR G+ D+LEDFI+KM + + LIW TVLGA C+ +G A LG+KA L L+PE
Sbjct: 768 DVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPEN 827
Query: 918 ETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVSHDCSHPQI 977
NY+LL N++A G+W+D+ + R M VKKE G SWV H FV+ D SHP
Sbjct: 828 AVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDA 887
Query: 978 QEIHLKLEELDKELTAIGYVPKTEYVLHNVEETEKREYLRFHSERLALAFAL-INTSATK 1022
I+ KL+EL++++ GYVP+T + L+++E+ K E L +HSE+LA+AF L S+T
Sbjct: 888 DVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEILSYHSEKLAVAFVLAAQRSSTL 947
BLAST of CmaCh08G003410 vs. ExPASy Swiss-Prot
Match:
Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)
HSP 1 Score: 517.3 bits (1331), Expect = 4.3e-145
Identity = 265/748 (35.43%), Postives = 424/748 (56.68%), Query Frame = 0
Query: 225 PDSHLWVSLVNVYAKCRYSAYARLVLAKMPDRDVVSWTALIQGLVAEGFVNDSIYLFQEM 284
PD +V+++N Y + ARL+ +M DVV+W +I G G +I F M
Sbjct: 259 PDHLAFVTVINTYIRLGKLKDARLLFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNM 318
Query: 285 QNEGIMPNEFTLATGLKACSLCMALDLGKQMHAQAFKLGLLLDLFVGSALVDLYSKCGEM 344
+ + TL + L A + LDLG +HA+A KLGL +++VGS+LV +YSKC +M
Sbjct: 319 RKSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKM 378
Query: 345 ELASRMFFGIPEQNEVTWNVLLNGYAQAGDGIGVLKLFCCMMESDVKSSKFTLTTVLKGC 404
E A+++F + E+N+V WN ++ GYA G+ V++LF M S FT T++L C
Sbjct: 379 EAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTC 438
Query: 405 ANSKNLRQGQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWS 464
A S +L G HS+IIK + F+G LVD Y+KCG DA ++F+++ D V W+
Sbjct: 439 AASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWN 498
Query: 465 AMITCLDQQGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNMEDYRYGRSIHACVWKY 524
+I Q E+ LF M + + S + A T++ G+ +H K
Sbjct: 499 TIIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKC 558
Query: 525 GFETDISINNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHDSGMYDRSLTIF 584
G + D+ ++L+ MY K G + + ++F S+ E +VS N ++G+ + + + ++ +F
Sbjct: 559 GLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNALIAGYSQNNL-EEAVVLF 618
Query: 585 GHLLEDGFIPNMYTFIGILRSCSCFLDVHFGRQVHTHIIKNDL-DDNDFVQTALIDMYAK 644
+L G P+ TF I+ +C + G Q H I K + +++ +L+ MY
Sbjct: 619 QEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMN 678
Query: 645 CMCMEDADVAFNRLSS-RDLFTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAG 704
M +A F+ LSS + + WT +++ H+Q E+AL ++++M+ +GV P++ T
Sbjct: 679 SRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATFVT 738
Query: 705 CLSGCSSLASLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICR- 764
L CS L+SL G+ +HS+ F H D + L+DMYAKCG M+ + +F+ + R
Sbjct: 739 VLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRRRS 798
Query: 765 DTVAWNTIICGYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQGLVEEGKKHF 824
+ V+WN++I GY++NG AL+ F M I PDE+TF+G+L+ACSH G V +G+K F
Sbjct: 799 NVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRKIF 858
Query: 825 NSMYRDFGISLTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETVLGACKMHGN 884
M +GI V+H ACMVD+LGR G E +DFI+ L A +W ++LGAC++HG+
Sbjct: 859 EMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLKPDARLWSSLLGACRIHGD 918
Query: 885 LALGEKAGNKLIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVE 944
GE + KLI+L+P+ + Y+LLSNI+A++G W+ +R +M +GVKK PG SW++
Sbjct: 919 DIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALRKVMRDRGVKKVPGYSWID 978
Query: 945 ANGQAHTFVSHDCSHPQIQEIHLKLEEL 970
+ H F + D SH +I +I + LE+L
Sbjct: 979 VEQRTHIFAAGDKSHSEIGKIEMFLEDL 1005
BLAST of CmaCh08G003410 vs. ExPASy Swiss-Prot
Match:
Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)
HSP 1 Score: 507.7 bits (1306), Expect = 3.4e-142
Identity = 282/910 (30.99%), Postives = 470/910 (51.65%), Query Frame = 0
Query: 187 SSKEKLKYYSWMLHECASNRSLGAAKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYA 246
SS + L ++ L K H ++ NP+ L +L+++Y+KC YA
Sbjct: 34 SSSSSSSQWFGFLRNAITSSDLMLGKCTHARILTFEENPERFLINNLISMYSKCGSLTYA 93
Query: 247 RLVLAKMPDRDVVSWTALIQGLVAEG-----FVNDSIYLFQEMQNEGIMPNEFTLATGLK 306
R V KMPDRD+VSW +++ + + LF+ ++ + + + TL+ LK
Sbjct: 94 RRVFDKMPDRDLVSWNSILAAYAQSSECVVENIQQAFLLFRILRQDVVYTSRMTLSPMLK 153
Query: 307 ACSLCMALDLGKQMHAQAFKLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVT 366
C + + H A K+GL D FV ALV++Y K G+++ +F +P ++ V
Sbjct: 154 LCLHSGYVWASESFHGYACKIGLDGDEFVAGALVNIYLKFGKVKEGKVLFEEMPYRDVVL 213
Query: 367 WNVLLNGYAQ-------------------------------------------------- 426
WN++L Y +
Sbjct: 214 WNLMLKAYLEMGFKEEAIDLSSAFHSSGLNPNEITLRLLARISGDDSDAGQVKSFANGND 273
Query: 427 -------------------AGDGIGVLKLFCCMMESDVKSSKFTLTTVLKGCANSKNLRQ 486
+G +LK F M+ESDV+ + T +L +L
Sbjct: 274 ASSVSEIIFRNKGLSEYLHSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLAL 333
Query: 487 GQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQ 546
GQ +H + +K G + + L++ Y K A VF + + D++ W+++I + Q
Sbjct: 334 GQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQ 393
Query: 547 QGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNM-EDYRYGRSIHACVWKYGFETDIS 606
G E++ LF + +P+ YT+ S++ AA+++ E + +H K +D
Sbjct: 394 NGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSF 453
Query: 607 INNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHDSGMYDRSLTIFGHLLEDG 666
++ AL+ Y ++ C+ E LFE DLV+WN ++G+ S ++L +F + + G
Sbjct: 454 VSTALIDAYSRNRCMKEAEILFERH-NFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQG 513
Query: 667 FIPNMYTFIGILRSCSCFLDVHFGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDAD 726
+ +T + ++C ++ G+QVH + IK+ D + +V + ++DMY KC M A
Sbjct: 514 ERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQ 573
Query: 727 VAFNRLSSRDLFTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLA 786
AF+ + D WT +I+ + + E+A F QM+ GV P+EFT+A S L
Sbjct: 574 FAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLT 633
Query: 787 SLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIIC 846
+LE G+Q+H+ A K +D FVG++LVDMYAKCG +++A LF+ + + AWN ++
Sbjct: 634 ALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLV 693
Query: 847 GYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGIS 906
G +Q+G+G + L+ F+ M GI PD+VTFIG+LSACSH GLV E KH SM+ D+GI
Sbjct: 694 GLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIK 753
Query: 907 LTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETVLGACKMHGNLALGEKAGNK 966
+ H +C+ D LGR G + E+ I+ M + A ++ T+L AC++ G+ G++ K
Sbjct: 754 PEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATK 813
Query: 967 LIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVS 1022
L++L+P + Y+LLSN++A KWD++K RT+M VKK+PG SW+E + H FV
Sbjct: 814 LLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVV 873
BLAST of CmaCh08G003410 vs. ExPASy Swiss-Prot
Match:
Q9SVA5 (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)
HSP 1 Score: 503.8 bits (1296), Expect = 4.9e-141
Identity = 257/781 (32.91%), Postives = 436/781 (55.83%), Query Frame = 0
Query: 195 YSWMLHECASNRSLGAAKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYARLVLAKMP 254
++ +L AS+ L +HG ++ + D++L L+N+Y++ YAR V KMP
Sbjct: 47 FARLLQLRASDDLLHYQNVVHGQIIVWGLELDTYLSNILINLYSRAGGMVYARKVFEKMP 106
Query: 255 DRDVVSWTALIQGLVAEGFVNDSIYLFQEM-QNEGIMPNEFTLATGLKACSLCMALDLGK 314
+R++VSW+ ++ G +S+ +F E + PNE+ L++ ++ACS
Sbjct: 107 ERNLVSWSTMVSACNHHGIYEESLVVFLEFWRTRKDSPNEYILSSFIQACSGLDGRGRWM 166
Query: 315 QMHAQAF--KLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQ 374
Q+F K G D++VG+ L+D Y K G ++ A +F +PE++ VTW +++G +
Sbjct: 167 VFQLQSFLVKSGFDRDVYVGTLLIDFYLKDGNIDYARLVFDALPEKSTVTWTTMISGCVK 226
Query: 375 AGDGIGVLKLFCCMMESDVKSSKFTLTTVLKGCANSKNLRQGQVIHSLIIKYGYEGDEFL 434
G L+LF +ME +V + L+TVL C+ L G+ IH+ I++YG E D L
Sbjct: 227 MGRSYVSLQLFYQLMEDNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLEMDASL 286
Query: 435 GCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMRSSST 494
L+D+Y KCG I A ++F + +I+ W+ +++ Q E+++LF M
Sbjct: 287 MNVLIDSYVKCGRVIAAHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSMSKFGL 346
Query: 495 RPNHYTICSLVSAATNMEDYRYGRSIHACVWKYGFETDISINNALVTMYMKSGCVNEGAR 554
+P+ Y S++++ ++ +G +HA K D + N+L+ MY K C+ + +
Sbjct: 347 KPDMYACSSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARK 406
Query: 555 LFESMIERDLVSWNTYLSGFHDSGM---YDRSLTIFGHLLEDGFIPNMYTFIGILRSCSC 614
+F+ D+V +N + G+ G +L IF + P++ TF+ +LR+ +
Sbjct: 407 VFDIFAAADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASAS 466
Query: 615 FLDVHFGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDADVAFNRLSSRDLFTWTVI 674
+ +Q+H + K L+ + F +ALID+Y+ C C++D+ + F+ + +DL W +
Sbjct: 467 LTSLGLSKQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSM 526
Query: 675 ITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLASLEGGQQLHSMAFKSGH 734
+ Q ++ E+AL+ F ++Q +P+EFT A ++ +LAS++ GQ+ H K G
Sbjct: 527 FAGYVQQSENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGL 586
Query: 735 LSDMFVGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIICGYSQNGQGNKALEAFQM 794
+ ++ +AL+DMYAKCG E+A F++ RD V WN++I Y+ +G+G KAL+ +
Sbjct: 587 ECNPYITNALLDMYAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEK 646
Query: 795 MLDEGISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGISLTVNHCACMVDILGRVG 854
M+ EGI P+ +TF+G+LSACSH GLVE+G K F M R FGI H CMV +LGR G
Sbjct: 647 MMSEGIEPNYITFVGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEHYVCMVSLLGRAG 706
Query: 855 KFDELEDFIKKMQLSQHALIWETVLGACKMHGNLALGEKAGNKLIDLQPEKETNYILLSN 914
+ ++ + I+KM A++W ++L C GN+ L E A I P+ ++ +LSN
Sbjct: 707 RLNKARELIEKMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSN 766
Query: 915 IFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVSHDCSHPQIQEIHLKLEE 970
I+A+KG W + K+VR M +GV KEPG SW+ N + H F+S D SH + +I+ L++
Sbjct: 767 IYASKGMWTEAKKVRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDD 826
BLAST of CmaCh08G003410 vs. ExPASy Swiss-Prot
Match:
Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)
HSP 1 Score: 502.3 bits (1292), Expect = 1.4e-140
Identity = 258/827 (31.20%), Postives = 448/827 (54.17%), Query Frame = 0
Query: 195 YSWMLHEC-ASNRSLGAAKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYARLVLAKM 254
+S +L C + + + IH ++ + + + L+++Y++ + AR V +
Sbjct: 189 FSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGL 248
Query: 255 PDRDVVSWTALIQGLVAEGFVNDSIYLFQEMQNEGIMPNEFTLATGLKACSLCMALDLGK 314
+D SW A+I GL ++I LF +M GIMP + ++ L AC +L++G+
Sbjct: 249 RLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGE 308
Query: 315 QMHAQAFKLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQAG 374
Q+H KLG D +V +ALV LY G + A +F + +++ VT+N L+NG +Q G
Sbjct: 309 QLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCG 368
Query: 375 DGIGVLKLFCCMMESDVKSSKFTLTTVLKGCANSKNLRQGQVIHSLIIKYGYEGDEFLGC 434
G ++LF M ++ TL +++ C+ L +GQ +H+ K G+ + +
Sbjct: 369 YGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEG 428
Query: 435 GLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMRSSSTRP 494
L++ Y+KC AL+ F + + ++V+W+ M+ S ++F M+ P
Sbjct: 429 ALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVP 488
Query: 495 NHYTICSLVSAATNMEDYRYGRSIHACVWKYGFETDISINNALVTMYMKSGCVNEGARLF 554
N YT S++ + D G IH+ + K F+ + + + L+ MY K G ++ +
Sbjct: 489 NQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDIL 548
Query: 555 ESMIERDLVSWNTYLSGFHDSGMYDRSLTIFGHLLEDGFIPNMYTFIGILRSCSCFLDVH 614
+D+VSW T ++G+ D++LT F +L+ G + + +C+ +
Sbjct: 549 IRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALK 608
Query: 615 FGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDADVAFNRLSSRDLFTWTVIITSHA 674
G+Q+H + + Q AL+ +Y++C +E++ +AF + + D W +++
Sbjct: 609 EGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQ 668
Query: 675 QTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLASLEGGQQLHSMAFKSGHLSDMF 734
Q+ E+AL F +M +EG+ N FT + S A+++ G+Q+H++ K+G+ S+
Sbjct: 669 QSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETE 728
Query: 735 VGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIICGYSQNGQGNKALEAFQMMLDEG 794
V +AL+ MYAKCG + +AE F + ++ V+WN II YS++G G++AL++F M+
Sbjct: 729 VCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSN 788
Query: 795 ISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGISLTVNHCACMVDILGRVGKFDEL 854
+ P+ VT +G+LSACSH GLV++G +F SM ++G+S H C+VD+L R G
Sbjct: 789 VRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRA 848
Query: 855 EDFIKKMQLSQHALIWETVLGACKMHGNLALGEKAGNKLIDLQPEKETNYILLSNIFATK 914
++FI++M + AL+W T+L AC +H N+ +GE A + L++L+PE Y+LLSN++A
Sbjct: 849 KEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVS 908
Query: 915 GKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVSHDCSHPQIQEIHLKLEELDKEL 974
KWD R M KGVKKEPG SW+E H+F D +HP EIH ++L K
Sbjct: 909 KKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRA 968
Query: 975 TAIGYVPKTEYVLHNVEETEKREYLRFHSERLALAFALINTSATKKI 1021
+ IGYV +L+ ++ +K + HSE+LA++F L++ AT I
Sbjct: 969 SEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPI 1015
BLAST of CmaCh08G003410 vs. TAIR 10
Match:
AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 517.7 bits (1332), Expect = 2.3e-146
Identity = 311/843 (36.89%), Postives = 475/843 (56.35%), Query Frame = 0
Query: 198 MLHECASNRSLGA--AKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYS-AYARLVLAKMP 257
+L C S+G + IHGL+ K D+ + L+++Y KC S YA +
Sbjct: 108 VLRACQEIGSVGILFGRQIHGLMFKLSYAVDAVVSNVLISMYWKCIGSVGYALCAFGDIE 167
Query: 258 DRDVVSWTALIQGLVAEGFVNDSIYLFQEMQNEGIMPNEFTLATGL-KACSLCMA-LDLG 317
++ VSW ++I G + +F MQ +G P E+T + + ACSL + L
Sbjct: 168 VKNSVSWNSIISVYSQAGDQRSAFRIFSSMQYDGSRPTEYTFGSLVTTACSLTEPDVRLL 227
Query: 318 KQMHAQAFKLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQA 377
+Q+ K GLL DLFVGS LV ++K G + A ++F + +N VT N L+ G +
Sbjct: 228 EQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSLSYARKVFNQMETRNAVTLNGLMVGLVRQ 287
Query: 378 GDGIGVLKLFC---CMMESDVKSSKFTLTTVLK-GCANSKNLRQGQVIHSLIIKYG-YEG 437
G KLF M++ +S L++ + A L++G+ +H +I G +
Sbjct: 288 KWGEEATKLFMDMNSMIDVSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITTGLVDF 347
Query: 438 DEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMR 497
+G GLV+ Y+KCG DA VF + D V W++MIT LDQ G E+++ + MR
Sbjct: 348 MVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMR 407
Query: 498 SSSTRPNHYTICSLVSAATNMEDYRYGRSIHACVWKYGFETDISINNALVTMYMKSGCVN 557
P +T+ S +S+ +++ + G+ IH K G + ++S++NAL+T+Y ++G +N
Sbjct: 408 RHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLN 467
Query: 558 EGARLFESMIERDLVSWNTYLSGFHDSGMYDRSL----TIFGHLLEDGFIPNMYTFIGIL 617
E ++F SM E D VSWN+ + S +RSL F + G N TF +L
Sbjct: 468 ECRKIFSSMPEHDQVSWNSIIGALARS---ERSLPEAVVCFLNAQRAGQKLNRITFSSVL 527
Query: 618 RSCSCFLDVHFGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDADVAFNRLSS-RDL 677
+ S G+Q+H +KN++ D + ALI Y KC M+ + F+R++ RD
Sbjct: 528 SAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDN 587
Query: 678 FTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLASLEGGQQLHSM 737
TW +I+ + KAL M Q G + + F A LS +S+A+LE G ++H+
Sbjct: 588 VTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHAC 647
Query: 738 AFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIICGYSQNGQGNKA 797
+ ++ SD+ VGSALVDMY+KCG ++ A F + R++ +WN++I GY+++GQG +A
Sbjct: 648 SVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEA 707
Query: 798 LEAFQ-MMLDEGISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGISLTVNHCACMV 857
L+ F+ M LD PD VTF+G+LSACSH GL+EEG KHF SM +G++ + H +CM
Sbjct: 708 LKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMA 767
Query: 858 DILGRVGKFDELEDFIKKMQLSQHALIWETVLGA-CKMHGNLA-LGEKAGNKLIDLQPEK 917
D+LGR G+ D+LEDFI+KM + + LIW TVLGA C+ +G A LG+KA L L+PE
Sbjct: 768 DVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPEN 827
Query: 918 ETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVSHDCSHPQI 977
NY+LL N++A G+W+D+ + R M VKKE G SWV H FV+ D SHP
Sbjct: 828 AVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDA 887
Query: 978 QEIHLKLEELDKELTAIGYVPKTEYVLHNVEETEKREYLRFHSERLALAFAL-INTSATK 1022
I+ KL+EL++++ GYVP+T + L+++E+ K E L +HSE+LA+AF L S+T
Sbjct: 888 DVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEILSYHSEKLAVAFVLAAQRSSTL 947
BLAST of CmaCh08G003410 vs. TAIR 10
Match:
AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 517.3 bits (1331), Expect = 3.0e-146
Identity = 265/748 (35.43%), Postives = 424/748 (56.68%), Query Frame = 0
Query: 225 PDSHLWVSLVNVYAKCRYSAYARLVLAKMPDRDVVSWTALIQGLVAEGFVNDSIYLFQEM 284
PD +V+++N Y + ARL+ +M DVV+W +I G G +I F M
Sbjct: 259 PDHLAFVTVINTYIRLGKLKDARLLFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNM 318
Query: 285 QNEGIMPNEFTLATGLKACSLCMALDLGKQMHAQAFKLGLLLDLFVGSALVDLYSKCGEM 344
+ + TL + L A + LDLG +HA+A KLGL +++VGS+LV +YSKC +M
Sbjct: 319 RKSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKM 378
Query: 345 ELASRMFFGIPEQNEVTWNVLLNGYAQAGDGIGVLKLFCCMMESDVKSSKFTLTTVLKGC 404
E A+++F + E+N+V WN ++ GYA G+ V++LF M S FT T++L C
Sbjct: 379 EAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTC 438
Query: 405 ANSKNLRQGQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWS 464
A S +L G HS+IIK + F+G LVD Y+KCG DA ++F+++ D V W+
Sbjct: 439 AASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWN 498
Query: 465 AMITCLDQQGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNMEDYRYGRSIHACVWKY 524
+I Q E+ LF M + + S + A T++ G+ +H K
Sbjct: 499 TIIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKC 558
Query: 525 GFETDISINNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHDSGMYDRSLTIF 584
G + D+ ++L+ MY K G + + ++F S+ E +VS N ++G+ + + + ++ +F
Sbjct: 559 GLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNALIAGYSQNNL-EEAVVLF 618
Query: 585 GHLLEDGFIPNMYTFIGILRSCSCFLDVHFGRQVHTHIIKNDL-DDNDFVQTALIDMYAK 644
+L G P+ TF I+ +C + G Q H I K + +++ +L+ MY
Sbjct: 619 QEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMN 678
Query: 645 CMCMEDADVAFNRLSS-RDLFTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAG 704
M +A F+ LSS + + WT +++ H+Q E+AL ++++M+ +GV P++ T
Sbjct: 679 SRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATFVT 738
Query: 705 CLSGCSSLASLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICR- 764
L CS L+SL G+ +HS+ F H D + L+DMYAKCG M+ + +F+ + R
Sbjct: 739 VLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRRRS 798
Query: 765 DTVAWNTIICGYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQGLVEEGKKHF 824
+ V+WN++I GY++NG AL+ F M I PDE+TF+G+L+ACSH G V +G+K F
Sbjct: 799 NVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRKIF 858
Query: 825 NSMYRDFGISLTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETVLGACKMHGN 884
M +GI V+H ACMVD+LGR G E +DFI+ L A +W ++LGAC++HG+
Sbjct: 859 EMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLKPDARLWSSLLGACRIHGD 918
Query: 885 LALGEKAGNKLIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVE 944
GE + KLI+L+P+ + Y+LLSNI+A++G W+ +R +M +GVKK PG SW++
Sbjct: 919 DIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALRKVMRDRGVKKVPGYSWID 978
Query: 945 ANGQAHTFVSHDCSHPQIQEIHLKLEEL 970
+ H F + D SH +I +I + LE+L
Sbjct: 979 VEQRTHIFAAGDKSHSEIGKIEMFLEDL 1005
BLAST of CmaCh08G003410 vs. TAIR 10
Match:
AT1G16480.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 515.4 bits (1326), Expect = 1.2e-145
Identity = 268/809 (33.13%), Postives = 454/809 (56.12%), Query Frame = 0
Query: 214 IHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYARLVLAKMPDRDVVSWTALIQGLVAEGF 273
+HG V K + D ++ +++++Y + +R V +MPDR+VVSWT+L+ G +G
Sbjct: 81 VHGFVAKSGLLSDVYVSTAILHLYGVYGLVSCSRKVFEEMPDRNVVSWTSLMVGYSDKGE 140
Query: 274 VNDSIYLFQEMQNEGIMPNEFTLATGLKACSLCMALDLGKQMHAQAFKLGLLLDLFVGSA 333
+ I +++ M+ EG+ NE +++ + +C L LG+Q+ Q K GL L V ++
Sbjct: 141 PEEVIDIYKGMRGEGVGCNENSMSLVISSCGLLKDESLGRQIIGQVVKSGLESKLAVENS 200
Query: 334 LVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQAGDGIGVLKLFCCMMESDVKSS 393
L+ + G ++ A+ +F + E++ ++WN + YAQ G ++F M + +
Sbjct: 201 LISMLGSMGNVDYANYIFDQMSERDTISWNSIAAAYAQNGHIEESFRIFSLMRRFHDEVN 260
Query: 394 KFTLTTVLKGCANSKNLRQGQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFK 453
T++T+L + + + G+ IH L++K G++ + L+ Y+ G +++A VFK
Sbjct: 261 STTVSTLLSVLGHVDHQKWGRGIHGLVVKMGFDSVVCVCNTLLRMYAGAGRSVEANLVFK 320
Query: 454 KIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNMEDYRY 513
++ D++ W++++ G+S +++ L M SS N+ T S ++A + +
Sbjct: 321 QMPTKDLISWNSLMASFVNDGRSLDALGLLCSMISSGKSVNYVTFTSALAACFTPDFFEK 380
Query: 514 GRSIHACVWKYGFETDISINNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHD 573
GR +H V G + I NALV+MY K G ++E R+ M RD+V+WN + G+ +
Sbjct: 381 GRILHGLVVVSGLFYNQIIGNALVSMYGKIGEMSESRRVLLQMPRRDVVAWNALIGGYAE 440
Query: 574 SGMYDRSLTIFGHLLEDGFIPNMYTFIGILRSCSCFLD-VHFGRQVHTHIIKNDLDDNDF 633
D++L F + +G N T + +L +C D + G+ +H +I+ + ++
Sbjct: 441 DEDPDKALAAFQTMRVEGVSSNYITVVSVLSACLLPGDLLERGKPLHAYIVSAGFESDEH 500
Query: 634 VQTALIDMYAKCMCMEDADVAFNRLSSRDLFTWTVIITSHAQTNQGEKALSYFRQMQQEG 693
V+ +LI MYAKC + + FN L +R++ TW ++ ++A GE+ L +M+ G
Sbjct: 501 VKNSLITMYAKCGDLSSSQDLFNGLDNRNIITWNAMLAANAHHGHGEEVLKLVSKMRSFG 560
Query: 694 VKPNEFTLAGCLSGCSSLASLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAE 753
V ++F+ + LS + LA LE GQQLH +A K G D F+ +A DMY+KCG + E
Sbjct: 561 VSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHDSFIFNAAADMYSKCGEIGEVV 620
Query: 754 MLFEALICRDTVAWNTIICGYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQG 813
+ + R +WN +I ++G + F ML+ GI P VTF+ +L+ACSH G
Sbjct: 621 KMLPPSVNRSLPSWNILISALGRHGYFEEVCATFHEMLEMGIKPGHVTFVSLLTACSHGG 680
Query: 814 LVEEGKKHFNSMYRDFGISLTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETV 873
LV++G +++ + RDFG+ + HC C++D+LGR G+ E E FI KM + + L+W ++
Sbjct: 681 LVDKGLAYYDMIARDFGLEPAIEHCICVIDLLGRSGRLAEAETFISKMPMKPNDLVWRSL 740
Query: 874 LGACKMHGNLALGEKAGNKLIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVK 933
L +CK+HGNL G KA L L+PE ++ Y+L SN+FAT G+W+DV+ VR M K +K
Sbjct: 741 LASCKIHGNLDRGRKAAENLSKLEPEDDSVYVLSSNMFATTGRWEDVENVRKQMGFKNIK 800
Query: 934 KEPGCSWVEANGQAHTFVSHDCSHPQIQEIHLKLEELDKELTAIGYVPKTEYVLHNVEET 993
K+ CSWV+ + +F D +HPQ EI+ KLE++ K + GYV T L + +E
Sbjct: 801 KKQACSWVKLKDKVSSFGIGDRTHPQTMEIYAKLEDIKKLIKESGYVADTSQALQDTDEE 860
Query: 994 EKREYLRFHSERLALAFALINTSATKKIR 1022
+K L HSERLALA+AL++T +R
Sbjct: 861 QKEHNLWNHSERLALAYALMSTPEGSTVR 889
BLAST of CmaCh08G003410 vs. TAIR 10
Match:
AT1G16480.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 515.4 bits (1326), Expect = 1.2e-145
Identity = 268/809 (33.13%), Postives = 454/809 (56.12%), Query Frame = 0
Query: 214 IHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYARLVLAKMPDRDVVSWTALIQGLVAEGF 273
+HG V K + D ++ +++++Y + +R V +MPDR+VVSWT+L+ G +G
Sbjct: 64 VHGFVAKSGLLSDVYVSTAILHLYGVYGLVSCSRKVFEEMPDRNVVSWTSLMVGYSDKGE 123
Query: 274 VNDSIYLFQEMQNEGIMPNEFTLATGLKACSLCMALDLGKQMHAQAFKLGLLLDLFVGSA 333
+ I +++ M+ EG+ NE +++ + +C L LG+Q+ Q K GL L V ++
Sbjct: 124 PEEVIDIYKGMRGEGVGCNENSMSLVISSCGLLKDESLGRQIIGQVVKSGLESKLAVENS 183
Query: 334 LVDLYSKCGEMELASRMFFGIPEQNEVTWNVLLNGYAQAGDGIGVLKLFCCMMESDVKSS 393
L+ + G ++ A+ +F + E++ ++WN + YAQ G ++F M + +
Sbjct: 184 LISMLGSMGNVDYANYIFDQMSERDTISWNSIAAAYAQNGHIEESFRIFSLMRRFHDEVN 243
Query: 394 KFTLTTVLKGCANSKNLRQGQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFK 453
T++T+L + + + G+ IH L++K G++ + L+ Y+ G +++A VFK
Sbjct: 244 STTVSTLLSVLGHVDHQKWGRGIHGLVVKMGFDSVVCVCNTLLRMYAGAGRSVEANLVFK 303
Query: 454 KIKKPDIVVWSAMITCLDQQGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNMEDYRY 513
++ D++ W++++ G+S +++ L M SS N+ T S ++A + +
Sbjct: 304 QMPTKDLISWNSLMASFVNDGRSLDALGLLCSMISSGKSVNYVTFTSALAACFTPDFFEK 363
Query: 514 GRSIHACVWKYGFETDISINNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHD 573
GR +H V G + I NALV+MY K G ++E R+ M RD+V+WN + G+ +
Sbjct: 364 GRILHGLVVVSGLFYNQIIGNALVSMYGKIGEMSESRRVLLQMPRRDVVAWNALIGGYAE 423
Query: 574 SGMYDRSLTIFGHLLEDGFIPNMYTFIGILRSCSCFLD-VHFGRQVHTHIIKNDLDDNDF 633
D++L F + +G N T + +L +C D + G+ +H +I+ + ++
Sbjct: 424 DEDPDKALAAFQTMRVEGVSSNYITVVSVLSACLLPGDLLERGKPLHAYIVSAGFESDEH 483
Query: 634 VQTALIDMYAKCMCMEDADVAFNRLSSRDLFTWTVIITSHAQTNQGEKALSYFRQMQQEG 693
V+ +LI MYAKC + + FN L +R++ TW ++ ++A GE+ L +M+ G
Sbjct: 484 VKNSLITMYAKCGDLSSSQDLFNGLDNRNIITWNAMLAANAHHGHGEEVLKLVSKMRSFG 543
Query: 694 VKPNEFTLAGCLSGCSSLASLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAE 753
V ++F+ + LS + LA LE GQQLH +A K G D F+ +A DMY+KCG + E
Sbjct: 544 VSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHDSFIFNAAADMYSKCGEIGEVV 603
Query: 754 MLFEALICRDTVAWNTIICGYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQG 813
+ + R +WN +I ++G + F ML+ GI P VTF+ +L+ACSH G
Sbjct: 604 KMLPPSVNRSLPSWNILISALGRHGYFEEVCATFHEMLEMGIKPGHVTFVSLLTACSHGG 663
Query: 814 LVEEGKKHFNSMYRDFGISLTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETV 873
LV++G +++ + RDFG+ + HC C++D+LGR G+ E E FI KM + + L+W ++
Sbjct: 664 LVDKGLAYYDMIARDFGLEPAIEHCICVIDLLGRSGRLAEAETFISKMPMKPNDLVWRSL 723
Query: 874 LGACKMHGNLALGEKAGNKLIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVK 933
L +CK+HGNL G KA L L+PE ++ Y+L SN+FAT G+W+DV+ VR M K +K
Sbjct: 724 LASCKIHGNLDRGRKAAENLSKLEPEDDSVYVLSSNMFATTGRWEDVENVRKQMGFKNIK 783
Query: 934 KEPGCSWVEANGQAHTFVSHDCSHPQIQEIHLKLEELDKELTAIGYVPKTEYVLHNVEET 993
K+ CSWV+ + +F D +HPQ EI+ KLE++ K + GYV T L + +E
Sbjct: 784 KKQACSWVKLKDKVSSFGIGDRTHPQTMEIYAKLEDIKKLIKESGYVADTSQALQDTDEE 843
Query: 994 EKREYLRFHSERLALAFALINTSATKKIR 1022
+K L HSERLALA+AL++T +R
Sbjct: 844 QKEHNLWNHSERLALAYALMSTPEGSTVR 872
BLAST of CmaCh08G003410 vs. TAIR 10
Match:
AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 507.7 bits (1306), Expect = 2.4e-143
Identity = 282/910 (30.99%), Postives = 470/910 (51.65%), Query Frame = 0
Query: 187 SSKEKLKYYSWMLHECASNRSLGAAKAIHGLVVKDVINPDSHLWVSLVNVYAKCRYSAYA 246
SS + L ++ L K H ++ NP+ L +L+++Y+KC YA
Sbjct: 34 SSSSSSSQWFGFLRNAITSSDLMLGKCTHARILTFEENPERFLINNLISMYSKCGSLTYA 93
Query: 247 RLVLAKMPDRDVVSWTALIQGLVAEG-----FVNDSIYLFQEMQNEGIMPNEFTLATGLK 306
R V KMPDRD+VSW +++ + + LF+ ++ + + + TL+ LK
Sbjct: 94 RRVFDKMPDRDLVSWNSILAAYAQSSECVVENIQQAFLLFRILRQDVVYTSRMTLSPMLK 153
Query: 307 ACSLCMALDLGKQMHAQAFKLGLLLDLFVGSALVDLYSKCGEMELASRMFFGIPEQNEVT 366
C + + H A K+GL D FV ALV++Y K G+++ +F +P ++ V
Sbjct: 154 LCLHSGYVWASESFHGYACKIGLDGDEFVAGALVNIYLKFGKVKEGKVLFEEMPYRDVVL 213
Query: 367 WNVLLNGYAQ-------------------------------------------------- 426
WN++L Y +
Sbjct: 214 WNLMLKAYLEMGFKEEAIDLSSAFHSSGLNPNEITLRLLARISGDDSDAGQVKSFANGND 273
Query: 427 -------------------AGDGIGVLKLFCCMMESDVKSSKFTLTTVLKGCANSKNLRQ 486
+G +LK F M+ESDV+ + T +L +L
Sbjct: 274 ASSVSEIIFRNKGLSEYLHSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLAL 333
Query: 487 GQVIHSLIIKYGYEGDEFLGCGLVDTYSKCGMAIDALEVFKKIKKPDIVVWSAMITCLDQ 546
GQ +H + +K G + + L++ Y K A VF + + D++ W+++I + Q
Sbjct: 334 GQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQ 393
Query: 547 QGQSGESIKLFHLMRSSSTRPNHYTICSLVSAATNM-EDYRYGRSIHACVWKYGFETDIS 606
G E++ LF + +P+ YT+ S++ AA+++ E + +H K +D
Sbjct: 394 NGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSF 453
Query: 607 INNALVTMYMKSGCVNEGARLFESMIERDLVSWNTYLSGFHDSGMYDRSLTIFGHLLEDG 666
++ AL+ Y ++ C+ E LFE DLV+WN ++G+ S ++L +F + + G
Sbjct: 454 VSTALIDAYSRNRCMKEAEILFERH-NFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQG 513
Query: 667 FIPNMYTFIGILRSCSCFLDVHFGRQVHTHIIKNDLDDNDFVQTALIDMYAKCMCMEDAD 726
+ +T + ++C ++ G+QVH + IK+ D + +V + ++DMY KC M A
Sbjct: 514 ERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQ 573
Query: 727 VAFNRLSSRDLFTWTVIITSHAQTNQGEKALSYFRQMQQEGVKPNEFTLAGCLSGCSSLA 786
AF+ + D WT +I+ + + E+A F QM+ GV P+EFT+A S L
Sbjct: 574 FAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLT 633
Query: 787 SLEGGQQLHSMAFKSGHLSDMFVGSALVDMYAKCGCMEEAEMLFEALICRDTVAWNTIIC 846
+LE G+Q+H+ A K +D FVG++LVDMYAKCG +++A LF+ + + AWN ++
Sbjct: 634 ALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLV 693
Query: 847 GYSQNGQGNKALEAFQMMLDEGISPDEVTFIGILSACSHQGLVEEGKKHFNSMYRDFGIS 906
G +Q+G+G + L+ F+ M GI PD+VTFIG+LSACSH GLV E KH SM+ D+GI
Sbjct: 694 GLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIK 753
Query: 907 LTVNHCACMVDILGRVGKFDELEDFIKKMQLSQHALIWETVLGACKMHGNLALGEKAGNK 966
+ H +C+ D LGR G + E+ I+ M + A ++ T+L AC++ G+ G++ K
Sbjct: 754 PEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATK 813
Query: 967 LIDLQPEKETNYILLSNIFATKGKWDDVKRVRTLMSSKGVKKEPGCSWVEANGQAHTFVS 1022
L++L+P + Y+LLSN++A KWD++K RT+M VKK+PG SW+E + H FV
Sbjct: 814 LLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVV 873
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FIB2 | 3.3e-145 | 36.89 | Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... | [more] |
Q9SS83 | 4.3e-145 | 35.43 | Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... | [more] |
Q9SMZ2 | 3.4e-142 | 30.99 | Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... | [more] |
Q9SVA5 | 4.9e-141 | 32.91 | Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... | [more] |
Q9SVP7 | 1.4e-140 | 31.20 | Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
AT5G09950.1 | 2.3e-146 | 36.89 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G09040.1 | 3.0e-146 | 35.43 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G16480.1 | 1.2e-145 | 33.13 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G16480.2 | 1.2e-145 | 33.13 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G33170.1 | 2.4e-143 | 30.99 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |