Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAAGAAAAATTATGTTTCTGTCAAACTTTTCGAAGACGAAGATGTTTACACCTGGATGATGTGAAGGCTGTCTTTGATGATCCTGAGGTTGGTAAGGATTAGAGTACTATATTGGCATCTGTGAACTGGGATTTATTGAACTCTGGGTATCGTTAAATTTGAAGATTAGTTAGGGATTCTATTCTTATTTTCATATCATTTGTTGGCAAGCTCTTGTTGAGGATTATTGGGAGTGAGTCCCACATTGGTTAATTTAGTGGAAGATCATGGGTTTATAAGTGAGAAATACTGTCTCCATTGGTATGAGGTCTTTTGGAGAAGCCCAAAGCAAAGCTATATATAAACAAATATCATATCATCACGGCCATCTCACCTAGCTATAACTTTAGTTCCTATCCATTCATTACTTTCAAATAAATCCATGATAAAAATGATGAATTTACTATTATGGGTTAAATGTTTCCAGCCAACCTGTTGTCTACCAACATAGTTTCCCCATTATTGAGACATAGAAGCTCACATTATGTGCTTATGTCCTTTTACAAGGATTGTTTTTCCTTTTCTTTTGGTTTATATTTTTTGCCTATGCTAAGATTTTAGTCTCTAAATTTTAGTCTCTACAGACAGTTTATTTAAATCCTAATTGTTGTAAATACGTGGTTGAAGGTGGTGGCGTTCAAGTGTGGTGGTTATTATTATCTACTCCTGTCTCACAAAATTTTTCCATAGGAACAAAGGACCGAGGAACAACGTGGAAACGAGCAGCAAATGAGCCCTCTGATCCTTGATGCATTTGATTTAATAGTTCTATCTCAAGGCTTAAACCTGGGAGCGATGTTTGACCGTGGCCAGGTATGGTTGTGTTTTATTAAACTCCTTTTTTTCATCTAAAACATGGAGCCATATAGAGCCAATTTGCCTCCATTGATTCTGCTAAAAGCTACTACAAATTGTTAATTTGTTTTTATTCTTGTATTCCAACACAACATTAAGCATGTAAAGTCCTTATTTCAGACTCGCAATATCATGCATTTCAAAATATGTTTGTATAAGAAATCGTAATATGCACATCTTGCTCGAAAAGTGATTGTTGAATTTCTTCTGATCATGGTAGGATTCTATGAAGTACCCAACACGCTTTGTCAGCCAGAAGCCTGCAAAGGTTTTATTGTCAACTGTGGAAGTTGTAGCTCAATCAATGGGTTTCAAGACGCACATTCGCAATTACAAGGTAAACATTTGGTTAGTTCTTACCTGAATTCTTGCAATTCAACTTCATGCCCCTCCCTGTCTGGCTACCCATCTTATCGTTATAAACACATGGAATAAGCCAGCTTCCTCCTGTTCTGCATAATCCTAAGTTAAATTAGAAGTTCAGTATTCCATAATCTAACTTAAATTATAAGTATAGTCTAGGAGTTTTAAAAATCTCTCTAAATAACGTCTATTAGGTTCTTGATATATTTGTAACTTTTAAAAATTAATTGATTTATGTTTAATGGAACTGCATTTTCAATTCGCCTTAATAGATTCATGAATTTTGTAAAATATCAAATAGATAAATGACCTATTAGACACAAAATTCAAAGTTTAAGAACTTATTAGACACGTGTTAAAGTTTAAAGACTTATCAAACATTGTCTTGAAAGTTTGATAAACTATTTGACATATTTTAATGTTCAAAGATCAAATATACACGAACAAATCTGAACATTAAAGAATTAAACTCGTAATTTAACCTAATTTTTTCAAAGTTTCTGGTGCAAATATCAGATGAGAGTAGAAGGTCCATCAGCAAGCAAAACTTCGTATTTCTCAGTCATTATGGAAGTAAGCTGTATCTTTTTCTGCCAAAAAGTTGTACAATTTCCATTTTCTGTTAAGGATTTTGATCGGAATCATTTTTCCCATAGATTTTTGAAGTTGCTCCCACATAGTTTATGGTGGACATTAAGAAAGCAGCAGGGGAAACTAGCGAGTACATTTGTTAGGATCAATACCAATTGTTAGCTCTGATACCAATTGTTAAGATCGCTCAACAACTCACACTCATCAAATGAACACAAAGAACAGAGAGAGAAAATACAAGGAAGAATATTGGCTAATGTTTTATTATTGAGACTTCAGTGACGAGAGAATACCTAGGGGTATATATATGGTTTCAAATATATGTCTCTCAAATCTTTACCAAATATCTACCATATATATCTCTATCATACAAATATCTTCCAAATATATCTCTAACGCATCAGGCTCCATATCCCAACATCATTAACATCCAGAATTCTTTTACCCTACTTATTTGTTAAATGCAGTTCTACAAGAGCTTATATAGCAATCTTGAAGATATCATCTGGAAACCTTCAATTGACACCAGCAAATCAAGGATCGCCAAGAACAAGAGTAAGAAACGTTGATTTTGTATGAATTATCTAACCAAAGGAAGACCAAAGCAGTATGATTTTGAGATATGTTTACCGTTCTTTAATTAATTAGACCTTGCCTGCCCTTCTTTTACAGTAGTGAAGAGAACAAATCCTACACATTTGTATATTACCATCATCTAGTTTTATTAACATCTCTTTGCAAATTAGTAGATCGACGTCGTGCCGTTCTTTCGCCGAATAGAGAAGCAACAATGGCGTCGACATTAAACTGTCTTGGTATTTGCCTGAAGCTCATCGATATCATCTGGTTATTCTCTCAATCCTACTTTCTCAAATTAGCTTCATATGAGTATTTTGACAGTTGAATATGTTAATAGGCTTGTAAAGAATCTATGTTTATATCTCACCCAAGCACATAAAATATCTCCAAAATTTCCACCAGAACAGTACCGTTCAAAAACTTGTAAACCTGCTTGTTATCTCGGATGACTATTTCCCTACCTTATAAAAAGAAATGAATCCGTGTTGAGGATTGTTGGGAGTGAGTACGTGACTAATTTAGAGAATGGTCATCACTTTATAATTAAAGTAATAGATCTCCATTGGTATGAGACCTTTTGGGAAGCCCAAAGCAAAACCATGAGAATTTATGCTCAAAGTAAAACTATTGTAGAGATCTGTGATTCTTAAGGTGATGCTCTTAGAAATTAATTTCATAATTCCGATCTCTTGATGTTAATACAAAGAACAGGGGAGAGAAAATAGGAGAAATATTTGTTGGAGGTTTATATTGATGGGGAAGTACATCTTCAAACACCCCAACTATATATATACCTTTATCAGATTAGTCTATAACTGGTTTAGTTTACTATCATCTACTGTATATAGGTTTATATGTCATTTACCATATATAATAAATATATAACCAAATTATAAATAATCATTAGACTCCATAACCTACGCTGGATAGCTCAATATAAACTATATAATAAATTTAACTCGTTTTCCGTAGTTAGAAAACATACTAAATTTTTTGTGGATGATCAATAAGATTGGAGGGAGAATAAAAATGTGATAGTGTTAAAGATATTTAGTAATAAATTATAGCTTATCATATATATTATATTTGATATTTTATCTTTCCATTTATTGTTATTTTTATTTATTACTCTTTTCATATCCTTATAATTTATTTGATTATAAATAAGATATCTTTACACCTTATGTGTTATGTGTGGTGAATTAATCAAACATTCTTATGGTATCAGAGCCCTCTCGTTTTAACAACTCCGATTCTTTCTTTGAATCCTCTAATCTTCTTCTCCACCACCTACTGGCTTTGACGCCGCAGGGGTAGTCTTCATGGCTGCCGTAGGCGAAACTGATTGATTACTCTTCGGCTTTACCCTATTGACCTCCAAATCACCTTTAGTCATAATCTGTACATACAATCTCACATTTTTCAACCCAATCGGTTTTCCCATGGGCTAATATAGTCTCTCTCATTTCCTTCCCAATCAACCCTTTAAACCCTAATCTCAGGGAGTTCCTTCCCAATCAACCCTTTAAACCCTAATCTCAGGGATTCTCACCAATGTACAGAAACCCTAATCTCATGGATCCTCATCAATGTTCTCTTAAGGATCTTCATCAATGTTCTCTTAAGAAATATATGGAATCTAGACAAGACCATCAACCTTTTGCAGAGATTTACCTTCGGAATCTAGGTGCCGCACTAGGCTCTTCACCAACCAAGTTACAGCCGCCCTTCCTGCCAAAATTGGCCCCTTCGCCCATTGTGGCGAACTCTTCTAACTACCCTTCTCGCCAAAATTGGCTCCTTCGCCCAATGTGGTGACTCTTCTAGCTTCAACCTTGTCTGTCAGTTGCTAGAGGTTTCCCCTTCTACTTCTTGGGGTTAGACATATTCTCACCAGAGCCAAGGCAACATCGTCGCTAAGTGACAATCTCTCTTCATGGGTTAAACATATTCTAGCCGAAGCCAAGGCAGCATCACCGCTAACCCTTTGCAATTTTTGATGACCAAGAAAGTGTTTCTCTACGCCTCCTCTGAACCAGGTTATTGACATCTTCACGAAAAGTGTTTCTCAACCTCTCTTTGAATTTTTCAGATTCAAGCTTTAGGTTCGTTTAAATCCGACGGTTAGCTTGCGGTGGGGTGTTAAAAATATTTAGTAATAAATTATGGTTTACCATATATATTATATTTGATATTTTATTATAAGACAAATATTAGATATCTAACATTTGCCTTGTTACTCTAGGTATCTTTCCATTTATTGGGGTTAAGAATATTTAGTAATAAATTATAGTTTACTATATATATTATATTTGATATTTAATATTTGTCGGGTTACTATATCTTTCTATTTATTACTCCTCCTATATTCTTGTAATTTATTTGATTATATATAAAATAACTTTCAAATATATTAATCGATGATTTGACATAATAATTAATTTGATAGGTGATTTTTTTTTTCATTAAATAAACTAAAGGCATAAAAATAATTTTATCATCGTGCTTGCGTCATAATTATTTTCCTGGACGACGACAGGCCCAGAAACTGGATAAGCAAACACGCCGCCTCCCCACTCTCCCACCTCCATCTTTCCCTCTTCAATTTTTACACTCTGTAAGTTTCCCCTTCTCTGGTTCTCTCTTTTTGCAAAACCCTCAATCTCTGCCCTTTCATACACTGAAAAAACAAGGCCCCATTGGCCTCCGCTTCTTCAATTTCTTCACAGTCGATTAAAGTGGATCTCGAAGGATCAAAACCCTTTTGTTTCCTTCAAGTTGGAGGAGTATGGATATAACGTTGCATGCAAAGCATTACACGGATGGATTCTGTTTGTTTCAGCGTCGCAGCACCAAAAATAGCAGCCGTAATGTTAGGGCCGAAAGAAGGGCTACTCCGGAGACCAATTCTGTAAGTTTTTCTGTGGGAGGAAAAGGGAAACCTCGATTTCTTGTTATTCCTTCTGATTGTTCTGAAGAATCCTTTGTTCGTGCGGTTCTGAAGAATCCTTTGCAACAGGGGGAGAAAAATCTTCACACCCATTTGAATGGCTCTAGTTCTTCATCATTTTCTTCGAATCATTCGCAGAGTTTTGAGGAATTTGAGAACAATAATCATCTTCGTCGACTGGTAAGAAATGGGGAATTGGAAGAAGGGTTTAAATTCATAGAGGGTATGGTTTATCGTGGTGATATTCCTGATGCAATTGTGTGTACTAGTTTGATTCGTGGTTTGTGTAAAACTGGAAAAACTAGAAAGGCTGCTCGAGTTATGGAAATTTTGGAGGATTCTGGGGCTGTTCCTGATGTGATAACTTACAATGTTCTTATTAGTGGTTACTGTAAATCTGGTGAAATTGACAATGCTTTGAAACTTTTGGATCGAATGAGTATTTCTCCTGATGTTATTACTTATAATATAATCTTGCGTACGCTCTGTGACAGTGGGAAGTTGAAGGAAGCCATGGAAGTTCTTCATAGACAGTTGCAGAGGGAGTGTTATCCTGATGTAATAACTTATACTATATTGATTGAAGCAACTTGTAAGGAGAGTGGAGTTGGGCAGGCGATGAAATTGTTGGATGAAATGAGGGAGAAAGGATGTAAGCCTGATGTTGTCACTTACAATGTTCTTATAAATGGGATTTGCAAGGAAGGAAGGTTGGATGAAGCTATTGAGTTTTTGAATGAAATGTCTTCTTATGGCTGCCAACCTAATGTAATCACTCATAACATCATCTTGCGTAGTATGTGTAGTACGGGGAGGTGGACGGATGCCGAGAAGCTGTTGGCTGAAATGGTTCGTAAGGGATGTTCTCCTAGTGTTGTTACTTTCAATATCTTGATTAATTTCTTGTGTCGAAAGGGGTTGTTGGGTCGAGCTATTGATATTTTGGAGAAGATGCCACAGCATGGCTGTACTCCGAATTCCTCGAGCTACAACCCGTTGCTTCATGGGTTTTGTAAAGAGAAGAAGATGGAGCGGGCGATCGAGTATTTGGATATCATGGCTTCTAGAGGTTGTTACCCTGATATTGTGACGTATAATACTCTATTGACTGCATTGTGTAAAGATGGGAAGGTAGATGTTGCTGTTGAGATACTGAATCAACTTGGTACTAAAAGTTGCTCTCCTGTATTGATTACATACAACACAGTGATTGATGGGCTATCAAAGGCAGGCAAAACAGAAGATGCTGTTAAACTTCTAGACGAGATGAAGGAAAAAGGACTCAAACCCGATATAATTACATACTCGTCGCTCATTGGAGGACTGTGCAGGGAAGGAAAAGTTGATGAAGCAATTGCATTTTTCCATGATCTAGAAGAAGTTGGTGTGAGGCCAAATGTTATCACATACAACTCTATCATGTTAGGACTCTGTAAGGTTCAACAAACTGTTCGTGCCATCGATTTCTTGGCATCGATGGTTGCCAGAGGCTGTAAACCGAACGAGGCTTCGTACATGATTCTTATTGAGGGGTTGGCCTATGAAGGTTTAGCCAAGGAGGCATTGGAGTTGCTTGATGAATTGTGCTCTAGAGGAGTTATGAAGAAGAGTTCTGCTGAGAGGGTAGTGATCAAAAACTCTTTTTGA
mRNA sequence
ATGGTTCAAGAAAAATTATGTTTCTGTCAAACTTTTCGAAGACGAAGATGTTTACACCTGGATGATGTGAAGGCTGTCTTTGATGATCCTGAGGAACAAAGGACCGAGGAACAACGTGGAAACGAGCAGCAAATGAGCCCTCTGATCCTTGATGCATTTGATTTAATAGTTCTATCTCAAGGCTTAAACCTGGGAGCGATGTTTGACCGTGGCCAGGATTCTATGAAGTACCCAACACGCTTTGTCAGCCAGAAGCCTGCAAAGGTTTTATTGTCAACTGTGGAAGTTGTAGCTCAATCAATGGGTTTCAAGACGCACATTCGCAATTACAAGATGAGAGTAGAAGGTCCATCAGCAAGCAAAACTTCGTATTTCTCAGTCATTATGGAATTCTACAAGAGCTTATATAGCAATCTTGAAGATATCATCTGGAAACCTTCAATTGACACCAGCAAATCAAGGATCGCCAAGAACAAGAATCGACGTCGTGCCGTTCTTTCGCCGAATAGAGAAGCAACAATGGCGTCGACATTAAACTGTCTTGGTATTTGCCTGAAGCTCATCGATATCATCTGGCCCAGAAACTGGATAAGCAAACACGCCGCCTCCCCACTCTCCCACCTCCATCTTTCCCTCTTCAATTTTTACACTCTGATCAAAACCCTTTTGTTTCCTTCAAGTTGGAGGAGTATGGATATAACGTTGCATGCAAAGCATTACACGGATGGATTCTGTTTGTTTCAGCGTCGCAGCACCAAAAATAGCAGCCGTAATGTTAGGGCCGAAAGAAGGGCTACTCCGGAGACCAATTCTGTAAGTTTTTCTGTGGGAGGAAAAGGGAAACCTCGATTTCTTGTTATTCCTTCTGATTGTTCTGAAGAATCCTTTGTTCGTGCGGTTCTGAAGAATCCTTTGCAACAGGGGGAGAAAAATCTTCACACCCATTTGAATGGCTCTAGTTCTTCATCATTTTCTTCGAATCATTCGCAGAGTTTTGAGGAATTTGAGAACAATAATCATCTTCGTCGACTGGTAAGAAATGGGGAATTGGAAGAAGGGTTTAAATTCATAGAGGGTATGGTTTATCGTGGTGATATTCCTGATGCAATTGTGTGTACTAGTTTGATTCGTGGTTTGTGTAAAACTGGAAAAACTAGAAAGGCTGCTCGAGTTATGGAAATTTTGGAGGATTCTGGGGCTGTTCCTGATGTGATAACTTACAATGTTCTTATTAGTGGTTACTGTAAATCTGGTGAAATTGACAATGCTTTGAAACTTTTGGATCGAATGAGTATTTCTCCTGATGTTATTACTTATAATATAATCTTGCGTACGCTCTGTGACAGTGGGAAGTTGAAGGAAGCCATGGAAGTTCTTCATAGACAGTTGCAGAGGGAGTGTTATCCTGATGTAATAACTTATACTATATTGATTGAAGCAACTTGTAAGGAGAGTGGAGTTGGGCAGGCGATGAAATTGTTGGATGAAATGAGGGAGAAAGGATGTAAGCCTGATGTTGTCACTTACAATGTTCTTATAAATGGGATTTGCAAGGAAGGAAGGTTGGATGAAGCTATTGAGTTTTTGAATGAAATGTCTTCTTATGGCTGCCAACCTAATGTAATCACTCATAACATCATCTTGCGTAGTATGTGTAGTACGGGGAGGTGGACGGATGCCGAGAAGCTGTTGGCTGAAATGGTTCGTAAGGGATGTTCTCCTAGTGTTGTTACTTTCAATATCTTGATTAATTTCTTGTGTCGAAAGGGGTTGTTGGGTCGAGCTATTGATATTTTGGAGAAGATGCCACAGCATGGCTGTACTCCGAATTCCTCGAGCTACAACCCGTTGCTTCATGGGTTTTGTAAAGAGAAGAAGATGGAGCGGGCGATCGAGTATTTGGATATCATGGCTTCTAGAGGTTGTTACCCTGATATTGTGACGTATAATACTCTATTGACTGCATTGTGTAAAGATGGGAAGGTAGATGTTGCTGTTGAGATACTGAATCAACTTGGTACTAAAAGTTGCTCTCCTGTATTGATTACATACAACACAGTGATTGATGGGCTATCAAAGGCAGGCAAAACAGAAGATGCTGTTAAACTTCTAGACGAGATGAAGGAAAAAGGACTCAAACCCGATATAATTACATACTCGTCGCTCATTGGAGGACTGTGCAGGGAAGGAAAAGTTGATGAAGCAATTGCATTTTTCCATGATCTAGAAGAAGTTGGTGTGAGGCCAAATGTTATCACATACAACTCTATCATGTTAGGACTCTGTAAGGTTCAACAAACTGTTCGTGCCATCGATTTCTTGGCATCGATGGTTGCCAGAGGCTGTAAACCGAACGAGGCTTCGTACATGATTCTTATTGAGGGGTTGGCCTATGAAGGTTTAGCCAAGGAGGCATTGGAGTTGCTTGATGAATTGTGCTCTAGAGGAGTTATGAAGAAGAGTTCTGCTGAGAGGGTAGTGATCAAAAACTCTTTTTGA
Coding sequence (CDS)
ATGGTTCAAGAAAAATTATGTTTCTGTCAAACTTTTCGAAGACGAAGATGTTTACACCTGGATGATGTGAAGGCTGTCTTTGATGATCCTGAGGAACAAAGGACCGAGGAACAACGTGGAAACGAGCAGCAAATGAGCCCTCTGATCCTTGATGCATTTGATTTAATAGTTCTATCTCAAGGCTTAAACCTGGGAGCGATGTTTGACCGTGGCCAGGATTCTATGAAGTACCCAACACGCTTTGTCAGCCAGAAGCCTGCAAAGGTTTTATTGTCAACTGTGGAAGTTGTAGCTCAATCAATGGGTTTCAAGACGCACATTCGCAATTACAAGATGAGAGTAGAAGGTCCATCAGCAAGCAAAACTTCGTATTTCTCAGTCATTATGGAATTCTACAAGAGCTTATATAGCAATCTTGAAGATATCATCTGGAAACCTTCAATTGACACCAGCAAATCAAGGATCGCCAAGAACAAGAATCGACGTCGTGCCGTTCTTTCGCCGAATAGAGAAGCAACAATGGCGTCGACATTAAACTGTCTTGGTATTTGCCTGAAGCTCATCGATATCATCTGGCCCAGAAACTGGATAAGCAAACACGCCGCCTCCCCACTCTCCCACCTCCATCTTTCCCTCTTCAATTTTTACACTCTGATCAAAACCCTTTTGTTTCCTTCAAGTTGGAGGAGTATGGATATAACGTTGCATGCAAAGCATTACACGGATGGATTCTGTTTGTTTCAGCGTCGCAGCACCAAAAATAGCAGCCGTAATGTTAGGGCCGAAAGAAGGGCTACTCCGGAGACCAATTCTGTAAGTTTTTCTGTGGGAGGAAAAGGGAAACCTCGATTTCTTGTTATTCCTTCTGATTGTTCTGAAGAATCCTTTGTTCGTGCGGTTCTGAAGAATCCTTTGCAACAGGGGGAGAAAAATCTTCACACCCATTTGAATGGCTCTAGTTCTTCATCATTTTCTTCGAATCATTCGCAGAGTTTTGAGGAATTTGAGAACAATAATCATCTTCGTCGACTGGTAAGAAATGGGGAATTGGAAGAAGGGTTTAAATTCATAGAGGGTATGGTTTATCGTGGTGATATTCCTGATGCAATTGTGTGTACTAGTTTGATTCGTGGTTTGTGTAAAACTGGAAAAACTAGAAAGGCTGCTCGAGTTATGGAAATTTTGGAGGATTCTGGGGCTGTTCCTGATGTGATAACTTACAATGTTCTTATTAGTGGTTACTGTAAATCTGGTGAAATTGACAATGCTTTGAAACTTTTGGATCGAATGAGTATTTCTCCTGATGTTATTACTTATAATATAATCTTGCGTACGCTCTGTGACAGTGGGAAGTTGAAGGAAGCCATGGAAGTTCTTCATAGACAGTTGCAGAGGGAGTGTTATCCTGATGTAATAACTTATACTATATTGATTGAAGCAACTTGTAAGGAGAGTGGAGTTGGGCAGGCGATGAAATTGTTGGATGAAATGAGGGAGAAAGGATGTAAGCCTGATGTTGTCACTTACAATGTTCTTATAAATGGGATTTGCAAGGAAGGAAGGTTGGATGAAGCTATTGAGTTTTTGAATGAAATGTCTTCTTATGGCTGCCAACCTAATGTAATCACTCATAACATCATCTTGCGTAGTATGTGTAGTACGGGGAGGTGGACGGATGCCGAGAAGCTGTTGGCTGAAATGGTTCGTAAGGGATGTTCTCCTAGTGTTGTTACTTTCAATATCTTGATTAATTTCTTGTGTCGAAAGGGGTTGTTGGGTCGAGCTATTGATATTTTGGAGAAGATGCCACAGCATGGCTGTACTCCGAATTCCTCGAGCTACAACCCGTTGCTTCATGGGTTTTGTAAAGAGAAGAAGATGGAGCGGGCGATCGAGTATTTGGATATCATGGCTTCTAGAGGTTGTTACCCTGATATTGTGACGTATAATACTCTATTGACTGCATTGTGTAAAGATGGGAAGGTAGATGTTGCTGTTGAGATACTGAATCAACTTGGTACTAAAAGTTGCTCTCCTGTATTGATTACATACAACACAGTGATTGATGGGCTATCAAAGGCAGGCAAAACAGAAGATGCTGTTAAACTTCTAGACGAGATGAAGGAAAAAGGACTCAAACCCGATATAATTACATACTCGTCGCTCATTGGAGGACTGTGCAGGGAAGGAAAAGTTGATGAAGCAATTGCATTTTTCCATGATCTAGAAGAAGTTGGTGTGAGGCCAAATGTTATCACATACAACTCTATCATGTTAGGACTCTGTAAGGTTCAACAAACTGTTCGTGCCATCGATTTCTTGGCATCGATGGTTGCCAGAGGCTGTAAACCGAACGAGGCTTCGTACATGATTCTTATTGAGGGGTTGGCCTATGAAGGTTTAGCCAAGGAGGCATTGGAGTTGCTTGATGAATTGTGCTCTAGAGGAGTTATGAAGAAGAGTTCTGCTGAGAGGGTAGTGATCAAAAACTCTTTTTGA
Protein sequence
MVQEKLCFCQTFRRRRCLHLDDVKAVFDDPEEQRTEEQRGNEQQMSPLILDAFDLIVLSQGLNLGAMFDRGQDSMKYPTRFVSQKPAKVLLSTVEVVAQSMGFKTHIRNYKMRVEGPSASKTSYFSVIMEFYKSLYSNLEDIIWKPSIDTSKSRIAKNKNRRRAVLSPNREATMASTLNCLGICLKLIDIIWPRNWISKHAASPLSHLHLSLFNFYTLIKTLLFPSSWRSMDITLHAKHYTDGFCLFQRRSTKNSSRNVRAERRATPETNSVSFSVGGKGKPRFLVIPSDCSEESFVRAVLKNPLQQGEKNLHTHLNGSSSSSFSSNHSQSFEEFENNNHLRRLVRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNVLISGYCKSGEIDNALKLLDRMSISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLLDEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLCKVQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSAERVVIKNSF
Homology
BLAST of CmaCh05G005600 vs. ExPASy Swiss-Prot
Match:
Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)
HSP 1 Score: 848.2 bits (2190), Expect = 7.9e-245
Identity = 422/600 (70.33%), Postives = 494/600 (82.33%), Query Frame = 0
Query: 231 MDITLHAKHYTDGFCLFQRRSTKNSSRNVRAERRATPETNSVSFSVGGKGKPRFLVIPSD 290
MD+ + +GFCL Q+ + N T + S +G + + R +++ +
Sbjct: 1 MDLMVSTSSAQEGFCLIQQFHREYKRGNKLDVSCRTSGSISSKIPLGSRKRNRLVLVSAA 60
Query: 291 CSEESFVRAVLKNPLQQGEKNLHTHLNGSSSSSFSS-NHSQSFEEFENNNHLRRLVRNGE 350
ES + L Q+ E + N + + +SS N S + E+ E+NNHLR++VR GE
Sbjct: 61 SKVES---SGLNGRAQKFETLSSGYSNSNGNGHYSSVNSSFALEDVESNNHLRQMVRTGE 120
Query: 351 LEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNV 410
LEEGFKF+E MVY G++PD I CT+LIRG C+ GKTRKAA+++EILE SGAVPDVITYNV
Sbjct: 121 LEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNV 180
Query: 411 LISGYCKSGEIDNALKLLDRMSISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRECYP 470
+ISGYCK+GEI+NAL +LDRMS+SPDV+TYN ILR+LCDSGKLK+AMEVL R LQR+CYP
Sbjct: 181 MISGYCKAGEINNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYP 240
Query: 471 DVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAIEFL 530
DVITYTILIEATC++SGVG AMKLLDEMR++GC PDVVTYNVL+NGICKEGRLDEAI+FL
Sbjct: 241 DVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFL 300
Query: 531 NEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFLCRK 590
N+M S GCQPNVITHNIILRSMCSTGRW DAEKLLA+M+RKG SPSVVTFNILINFLCRK
Sbjct: 301 NDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRK 360
Query: 591 GLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYPDIVTY 650
GLLGRAIDILEKMPQHGC PNS SYNPLLHGFCKEKKM+RAIEYL+ M SRGCYPDIVTY
Sbjct: 361 GLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTY 420
Query: 651 NTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLLDEMKE 710
NT+LTALCKDGKV+ AVEILNQL +K CSPVLITYNTVIDGL+KAGKT A+KLLDEM+
Sbjct: 421 NTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRA 480
Query: 711 KGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLCKVQQTVR 770
K LKPD ITYSSL+GGL REGKVDEAI FFH+ E +G+RPN +T+NSIMLGLCK +QT R
Sbjct: 481 KDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIMLGLCKSRQTDR 540
Query: 771 AIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSAERVVIK 830
AIDFL M+ RGCKPNE SY ILIEGLAYEG+AKEALELL+ELC++G+MKKSSAE+V K
Sbjct: 541 AIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGLMKKSSAEQVAGK 597
BLAST of CmaCh05G005600 vs. ExPASy Swiss-Prot
Match:
Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)
HSP 1 Score: 460.3 bits (1183), Expect = 4.6e-128
Identity = 235/526 (44.68%), Postives = 333/526 (63.31%), Query Frame = 0
Query: 308 GEKNLHTHLNGSSS--SSFSSNHSQS--FEEFENNNHLRRLVRNGELEEGFKFIEGMVYR 367
G +NL T ++ + HSQS F + + R R+G E +E MV +
Sbjct: 59 GARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRK 118
Query: 368 GDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNVLISGYCKSGEIDNA 427
G PD I+CT LI+G KA RVMEILE G PDV YN LI+G+CK ID+A
Sbjct: 119 GYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQ-PDVFAYNALINGFCKMNRIDDA 178
Query: 428 LKLLDRM---SISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRECYPDVITYTILIEA 487
++LDRM SPD +TYNI++ +LC GKL A++VL++ L C P VITYTILIEA
Sbjct: 179 TRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEA 238
Query: 488 TCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAIEFLNEMSSYGCQPN 547
T E GV +A+KL+DEM +G KPD+ TYN +I G+CKEG +D A E + + GC+P+
Sbjct: 239 TMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPD 298
Query: 548 VITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDILE 607
VI++NI+LR++ + G+W + EKL+ +M + C P+VVT++ILI LCR G + A+++L+
Sbjct: 299 VISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLK 358
Query: 608 KMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYPDIVTYNTLLTALCKDG 667
M + G TP++ SY+PL+ FC+E +++ AIE+L+ M S GC PDIV YNT+L LCK+G
Sbjct: 359 LMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNG 418
Query: 668 KVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLLDEMKEKGLKPDIITYS 727
K D A+EI +LG CSP +YNT+ L +G A+ ++ EM G+ PD ITY+
Sbjct: 419 KADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYN 478
Query: 728 SLIGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLCKVQQTVRAIDFLASMVAR 787
S+I LCREG VDEA D+ P+V+TYN ++LG CK + AI+ L SMV
Sbjct: 479 SMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGN 538
Query: 788 GCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSAERV 827
GC+PNE +Y +LIEG+ + G EA+EL ++L + + S +R+
Sbjct: 539 GCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRL 583
BLAST of CmaCh05G005600 vs. ExPASy Swiss-Prot
Match:
A3KPF8 (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g79080 PE=2 SV=1)
HSP 1 Score: 345.1 bits (884), Expect = 2.2e-93
Identity = 185/486 (38.07%), Postives = 289/486 (59.47%), Query Frame = 0
Query: 350 LEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNV 409
L + F +E +V G P+ T L+ LCK + +KA RV+E++ SG +PD Y
Sbjct: 87 LSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTY 146
Query: 410 LISGYCKSGEIDNALKLLDRM---SISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRE 469
L++ CK G + A++L+++M + +TYN ++R LC G L ++++ + R +Q+
Sbjct: 147 LVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKG 206
Query: 470 CYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAI 529
P+ TY+ L+EA KE G +A+KLLDE+ KG +P++V+YNVL+ G CKEGR D+A+
Sbjct: 207 LAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAM 266
Query: 530 EFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFL 589
E+ + G + NV+++NI+LR +C GRW +A LLAEM +PSVVT+NILIN L
Sbjct: 267 ALFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSL 326
Query: 590 CRKGLLGRAIDILEKMPQ--HGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYP 649
G +A+ +L++M + H ++SYNP++ CKE K++ ++ LD M R C P
Sbjct: 327 AFHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKP 386
Query: 650 DIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLL 709
+ TYN + + + KV A I+ L K Y +VI L + G T A +LL
Sbjct: 387 NEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 446
Query: 710 DEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEE-VGVRPNVITYNSIMLGLCK 769
EM G PD TYS+LI GLC EG A+ +EE +P V +N+++LGLCK
Sbjct: 447 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 506
Query: 770 VQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSA 829
+++T A++ MV + PNE +Y IL+EG+A+E + A E+LDEL R V+ +++
Sbjct: 507 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 566
BLAST of CmaCh05G005600 vs. ExPASy Swiss-Prot
Match:
Q9FRS4 (Pentatricopeptide repeat-containing protein At1g08610 OS=Arabidopsis thaliana OX=3702 GN=At1g08610 PE=2 SV=1)
HSP 1 Score: 341.3 bits (874), Expect = 3.1e-92
Identity = 184/491 (37.47%), Postives = 275/491 (56.01%), Query Frame = 0
Query: 333 EEFENNNHLRRLVRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVM 392
+E NN L L NG+L + K +E M +P C++L+RGL + + KA ++
Sbjct: 103 DEETNNEILHNLCSNGKLTDACKLVEVMARHNQVPHFPSCSNLVRGLARIDQLDKAMCIL 162
Query: 393 EILEDSGAVPDVITYNVLISGYCKSGEIDNALKLLDRMSIS---PDVITYNIILRTLCDS 452
++ SG VPD ITYN++I CK G I AL LL+ MS+S PDVITYN ++R + D
Sbjct: 163 RVMVMSGGVPDTITYNMIIGNLCKKGHIRTALVLLEDMSLSGSPPDVITYNTVIRCMFDY 222
Query: 453 GKLKEAMEVLHRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTY 512
G ++A+ QLQ C P +ITYT+L+E C+ G +A+++L++M +GC PD+VTY
Sbjct: 223 GNAEQAIRFWKDQLQNGCPPFMITYTVLVELVCRYCGSARAIEVLEDMAVEGCYPDIVTY 282
Query: 513 NVLINGICKEGRLDEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVR 572
N L+N C+ G L+E + + S+G + N +T+N +L S+CS W + E++L M +
Sbjct: 283 NSLVNYNCRRGNLEEVASVIQHILSHGLELNTVTYNTLLHSLCSHEYWDEVEEILNIMYQ 342
Query: 573 KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMER 632
P+V+T+NILIN LC+ LL RAID +
Sbjct: 343 TSYCPTVITYNILINGLCKARLLSRAIDFFYQ---------------------------- 402
Query: 633 AIEYLDIMASRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVID 692
M + C PDIVTYNT+L A+ K+G VD A+E+L L C P LITYN+VID
Sbjct: 403 -------MLEQKCLPDIVTYNTVLGAMSKEGMVDDAIELLGLLKNTCCPPGLITYNSVID 462
Query: 693 GLSKAGKTEDAVKLLDEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEEVGVRP 752
GL+K G + A++L +M + G+ PD IT SLI G CR V+EA + G
Sbjct: 463 GLAKKGLMKKALELYHQMLDAGIFPDDITRRSLIYGFCRANLVEEAGQVLKETSNRGNGI 522
Query: 753 NVITYNSIMLGLCKVQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELL 812
TY ++ GLCK ++ AI+ + M+ GCKP+E Y +++G+ G+ EA++L
Sbjct: 523 RGSTYRLVIQGLCKKKEIEMAIEVVEIMLTGGCKPDETIYTAIVKGVEEMGMGSEAVQLQ 558
Query: 813 DELCSRGVMKK 821
+L ++K+
Sbjct: 583 KKLKQWKLLKE 558
BLAST of CmaCh05G005600 vs. ExPASy Swiss-Prot
Match:
Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)
HSP 1 Score: 325.5 bits (833), Expect = 1.8e-87
Identity = 169/488 (34.63%), Postives = 272/488 (55.74%), Query Frame = 0
Query: 345 VRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAAR-VMEILEDSGAVPD 404
+ G+L+ + E MV G + ++ G CK G+ A + E+ G PD
Sbjct: 235 IEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPD 294
Query: 405 VITYNVLISGYCKSGEIDNALKLLDRM---SISPDVITYNIILRTLCDSGKLKEAMEVLH 464
T+N L++G CK+G + +A++++D M PDV TYN ++ LC G++KEA+EVL
Sbjct: 295 QYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLD 354
Query: 465 RQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEG 524
+ + R+C P+ +TY LI CKE+ V +A +L + KG PDV T+N LI G+C
Sbjct: 355 QMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTR 414
Query: 525 RLDEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFN 584
A+E EM S GC+P+ T+N+++ S+CS G+ +A +L +M GC+ SV+T+N
Sbjct: 415 NHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYN 474
Query: 585 ILINFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASR 644
LI+ C+ A +I ++M HG + NS +YN L+ G CK +++E A + +D M
Sbjct: 475 TLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIME 534
Query: 645 GCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDA 704
G PD TYN+LLT C+ G + A +I+ + + C P ++TY T+I GL KAG+ E A
Sbjct: 535 GQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVA 594
Query: 705 VKLLDEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHD-LEEVGVRPNVITYNSIML 764
KLL ++ KG+ Y+ +I GL R+ K EAI F + LE+ P+ ++Y +
Sbjct: 595 SKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFR 654
Query: 765 GLCKVQQTVR-AIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVM 824
GLC +R A+DFL ++ +G P +S +L EGL + + ++L++ + +
Sbjct: 655 GLCNGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARF 714
Query: 825 KKSSAERV 827
+ V
Sbjct: 715 SEEEVSMV 722
BLAST of CmaCh05G005600 vs. TAIR 10
Match:
AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )
HSP 1 Score: 848.2 bits (2190), Expect = 5.6e-246
Identity = 422/600 (70.33%), Postives = 494/600 (82.33%), Query Frame = 0
Query: 231 MDITLHAKHYTDGFCLFQRRSTKNSSRNVRAERRATPETNSVSFSVGGKGKPRFLVIPSD 290
MD+ + +GFCL Q+ + N T + S +G + + R +++ +
Sbjct: 1 MDLMVSTSSAQEGFCLIQQFHREYKRGNKLDVSCRTSGSISSKIPLGSRKRNRLVLVSAA 60
Query: 291 CSEESFVRAVLKNPLQQGEKNLHTHLNGSSSSSFSS-NHSQSFEEFENNNHLRRLVRNGE 350
ES + L Q+ E + N + + +SS N S + E+ E+NNHLR++VR GE
Sbjct: 61 SKVES---SGLNGRAQKFETLSSGYSNSNGNGHYSSVNSSFALEDVESNNHLRQMVRTGE 120
Query: 351 LEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNV 410
LEEGFKF+E MVY G++PD I CT+LIRG C+ GKTRKAA+++EILE SGAVPDVITYNV
Sbjct: 121 LEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNV 180
Query: 411 LISGYCKSGEIDNALKLLDRMSISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRECYP 470
+ISGYCK+GEI+NAL +LDRMS+SPDV+TYN ILR+LCDSGKLK+AMEVL R LQR+CYP
Sbjct: 181 MISGYCKAGEINNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYP 240
Query: 471 DVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAIEFL 530
DVITYTILIEATC++SGVG AMKLLDEMR++GC PDVVTYNVL+NGICKEGRLDEAI+FL
Sbjct: 241 DVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFL 300
Query: 531 NEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFLCRK 590
N+M S GCQPNVITHNIILRSMCSTGRW DAEKLLA+M+RKG SPSVVTFNILINFLCRK
Sbjct: 301 NDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRK 360
Query: 591 GLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYPDIVTY 650
GLLGRAIDILEKMPQHGC PNS SYNPLLHGFCKEKKM+RAIEYL+ M SRGCYPDIVTY
Sbjct: 361 GLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTY 420
Query: 651 NTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLLDEMKE 710
NT+LTALCKDGKV+ AVEILNQL +K CSPVLITYNTVIDGL+KAGKT A+KLLDEM+
Sbjct: 421 NTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRA 480
Query: 711 KGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLCKVQQTVR 770
K LKPD ITYSSL+GGL REGKVDEAI FFH+ E +G+RPN +T+NSIMLGLCK +QT R
Sbjct: 481 KDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIMLGLCKSRQTDR 540
Query: 771 AIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSAERVVIK 830
AIDFL M+ RGCKPNE SY ILIEGLAYEG+AKEALELL+ELC++G+MKKSSAE+V K
Sbjct: 541 AIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGLMKKSSAEQVAGK 597
BLAST of CmaCh05G005600 vs. TAIR 10
Match:
AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )
HSP 1 Score: 460.3 bits (1183), Expect = 3.3e-129
Identity = 235/526 (44.68%), Postives = 333/526 (63.31%), Query Frame = 0
Query: 308 GEKNLHTHLNGSSS--SSFSSNHSQS--FEEFENNNHLRRLVRNGELEEGFKFIEGMVYR 367
G +NL T ++ + HSQS F + + R R+G E +E MV +
Sbjct: 59 GARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRK 118
Query: 368 GDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNVLISGYCKSGEIDNA 427
G PD I+CT LI+G KA RVMEILE G PDV YN LI+G+CK ID+A
Sbjct: 119 GYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQ-PDVFAYNALINGFCKMNRIDDA 178
Query: 428 LKLLDRM---SISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRECYPDVITYTILIEA 487
++LDRM SPD +TYNI++ +LC GKL A++VL++ L C P VITYTILIEA
Sbjct: 179 TRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEA 238
Query: 488 TCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAIEFLNEMSSYGCQPN 547
T E GV +A+KL+DEM +G KPD+ TYN +I G+CKEG +D A E + + GC+P+
Sbjct: 239 TMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPD 298
Query: 548 VITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDILE 607
VI++NI+LR++ + G+W + EKL+ +M + C P+VVT++ILI LCR G + A+++L+
Sbjct: 299 VISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLK 358
Query: 608 KMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYPDIVTYNTLLTALCKDG 667
M + G TP++ SY+PL+ FC+E +++ AIE+L+ M S GC PDIV YNT+L LCK+G
Sbjct: 359 LMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNG 418
Query: 668 KVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLLDEMKEKGLKPDIITYS 727
K D A+EI +LG CSP +YNT+ L +G A+ ++ EM G+ PD ITY+
Sbjct: 419 KADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYN 478
Query: 728 SLIGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLCKVQQTVRAIDFLASMVAR 787
S+I LCREG VDEA D+ P+V+TYN ++LG CK + AI+ L SMV
Sbjct: 479 SMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGN 538
Query: 788 GCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSAERV 827
GC+PNE +Y +LIEG+ + G EA+EL ++L + + S +R+
Sbjct: 539 GCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRL 583
BLAST of CmaCh05G005600 vs. TAIR 10
Match:
AT1G79080.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 345.1 bits (884), Expect = 1.5e-94
Identity = 185/486 (38.07%), Postives = 289/486 (59.47%), Query Frame = 0
Query: 350 LEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVPDVITYNV 409
L + F +E +V G P+ T L+ LCK + +KA RV+E++ SG +PD Y
Sbjct: 87 LSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTY 146
Query: 410 LISGYCKSGEIDNALKLLDRM---SISPDVITYNIILRTLCDSGKLKEAMEVLHRQLQRE 469
L++ CK G + A++L+++M + +TYN ++R LC G L ++++ + R +Q+
Sbjct: 147 LVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKG 206
Query: 470 CYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRLDEAI 529
P+ TY+ L+EA KE G +A+KLLDE+ KG +P++V+YNVL+ G CKEGR D+A+
Sbjct: 207 LAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAM 266
Query: 530 EFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNILINFL 589
E+ + G + NV+++NI+LR +C GRW +A LLAEM +PSVVT+NILIN L
Sbjct: 267 ALFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSL 326
Query: 590 CRKGLLGRAIDILEKMPQ--HGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGCYP 649
G +A+ +L++M + H ++SYNP++ CKE K++ ++ LD M R C P
Sbjct: 327 AFHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKP 386
Query: 650 DIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVKLL 709
+ TYN + + + KV A I+ L K Y +VI L + G T A +LL
Sbjct: 387 NEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 446
Query: 710 DEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEE-VGVRPNVITYNSIMLGLCK 769
EM G PD TYS+LI GLC EG A+ +EE +P V +N+++LGLCK
Sbjct: 447 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 506
Query: 770 VQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSSA 829
+++T A++ MV + PNE +Y IL+EG+A+E + A E+LDEL R V+ +++
Sbjct: 507 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 566
BLAST of CmaCh05G005600 vs. TAIR 10
Match:
AT1G08610.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 341.3 bits (874), Expect = 2.2e-93
Identity = 184/491 (37.47%), Postives = 275/491 (56.01%), Query Frame = 0
Query: 333 EEFENNNHLRRLVRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVM 392
+E NN L L NG+L + K +E M +P C++L+RGL + + KA ++
Sbjct: 103 DEETNNEILHNLCSNGKLTDACKLVEVMARHNQVPHFPSCSNLVRGLARIDQLDKAMCIL 162
Query: 393 EILEDSGAVPDVITYNVLISGYCKSGEIDNALKLLDRMSIS---PDVITYNIILRTLCDS 452
++ SG VPD ITYN++I CK G I AL LL+ MS+S PDVITYN ++R + D
Sbjct: 163 RVMVMSGGVPDTITYNMIIGNLCKKGHIRTALVLLEDMSLSGSPPDVITYNTVIRCMFDY 222
Query: 453 GKLKEAMEVLHRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTY 512
G ++A+ QLQ C P +ITYT+L+E C+ G +A+++L++M +GC PD+VTY
Sbjct: 223 GNAEQAIRFWKDQLQNGCPPFMITYTVLVELVCRYCGSARAIEVLEDMAVEGCYPDIVTY 282
Query: 513 NVLINGICKEGRLDEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVR 572
N L+N C+ G L+E + + S+G + N +T+N +L S+CS W + E++L M +
Sbjct: 283 NSLVNYNCRRGNLEEVASVIQHILSHGLELNTVTYNTLLHSLCSHEYWDEVEEILNIMYQ 342
Query: 573 KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMER 632
P+V+T+NILIN LC+ LL RAID +
Sbjct: 343 TSYCPTVITYNILINGLCKARLLSRAIDFFYQ---------------------------- 402
Query: 633 AIEYLDIMASRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVID 692
M + C PDIVTYNT+L A+ K+G VD A+E+L L C P LITYN+VID
Sbjct: 403 -------MLEQKCLPDIVTYNTVLGAMSKEGMVDDAIELLGLLKNTCCPPGLITYNSVID 462
Query: 693 GLSKAGKTEDAVKLLDEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEEVGVRP 752
GL+K G + A++L +M + G+ PD IT SLI G CR V+EA + G
Sbjct: 463 GLAKKGLMKKALELYHQMLDAGIFPDDITRRSLIYGFCRANLVEEAGQVLKETSNRGNGI 522
Query: 753 NVITYNSIMLGLCKVQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELL 812
TY ++ GLCK ++ AI+ + M+ GCKP+E Y +++G+ G+ EA++L
Sbjct: 523 RGSTYRLVIQGLCKKKEIEMAIEVVEIMLTGGCKPDETIYTAIVKGVEEMGMGSEAVQLQ 558
Query: 813 DELCSRGVMKK 821
+L ++K+
Sbjct: 583 KKLKQWKLLKE 558
BLAST of CmaCh05G005600 vs. TAIR 10
Match:
AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 325.5 bits (833), Expect = 1.3e-88
Identity = 169/488 (34.63%), Postives = 272/488 (55.74%), Query Frame = 0
Query: 345 VRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAAR-VMEILEDSGAVPD 404
+ G+L+ + E MV G + ++ G CK G+ A + E+ G PD
Sbjct: 235 IEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPD 294
Query: 405 VITYNVLISGYCKSGEIDNALKLLDRM---SISPDVITYNIILRTLCDSGKLKEAMEVLH 464
T+N L++G CK+G + +A++++D M PDV TYN ++ LC G++KEA+EVL
Sbjct: 295 QYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLD 354
Query: 465 RQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEG 524
+ + R+C P+ +TY LI CKE+ V +A +L + KG PDV T+N LI G+C
Sbjct: 355 QMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTR 414
Query: 525 RLDEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFN 584
A+E EM S GC+P+ T+N+++ S+CS G+ +A +L +M GC+ SV+T+N
Sbjct: 415 NHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYN 474
Query: 585 ILINFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASR 644
LI+ C+ A +I ++M HG + NS +YN L+ G CK +++E A + +D M
Sbjct: 475 TLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIME 534
Query: 645 GCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDA 704
G PD TYN+LLT C+ G + A +I+ + + C P ++TY T+I GL KAG+ E A
Sbjct: 535 GQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVA 594
Query: 705 VKLLDEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHD-LEEVGVRPNVITYNSIML 764
KLL ++ KG+ Y+ +I GL R+ K EAI F + LE+ P+ ++Y +
Sbjct: 595 SKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFR 654
Query: 765 GLCKVQQTVR-AIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVM 824
GLC +R A+DFL ++ +G P +S +L EGL + + ++L++ + +
Sbjct: 655 GLCNGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARF 714
Query: 825 KKSSAERV 827
+ V
Sbjct: 715 SEEEVSMV 722
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q3EDF8 | 7.9e-245 | 70.33 | Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... | [more] |
Q9SR00 | 4.6e-128 | 44.68 | Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... | [more] |
A3KPF8 | 2.2e-93 | 38.07 | Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... | [more] |
Q9FRS4 | 3.1e-92 | 37.47 | Pentatricopeptide repeat-containing protein At1g08610 OS=Arabidopsis thaliana OX... | [more] |
Q9LFF1 | 1.8e-87 | 34.63 | Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... | [more] |
Match Name | E-value | Identity | Description | |
AT1G09900.1 | 5.6e-246 | 70.33 | Pentatricopeptide repeat (PPR-like) superfamily protein | [more] |
AT3G04760.1 | 3.3e-129 | 44.68 | Pentatricopeptide repeat (PPR-like) superfamily protein | [more] |
AT1G79080.1 | 1.5e-94 | 38.07 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G08610.1 | 2.2e-93 | 37.47 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT3G53700.1 | 1.3e-88 | 34.63 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |