MC09g0822 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC09g0822
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC09: 8554628 .. 8573661 (+)
RNA-Seq ExpressionMC09g0822
SyntenyMC09g0822
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTTCTTAGACTTAGATGTGAAAAATTTTATACAATTTTCGCTTCCTTTTTTGCTTTTCTAAATTATAACGAAAAAACACAATGTACTCCGTCCTCCCGGCAAAAAACCCATAAACCTTCCAAAAACCCCTTCCAAAACGTTTCAGAGCTCTCCCAAATTAGCTTATGCTCTCGTTCACACACAGACTCCGGACCTTGTCCTCATCTCCATGCTCCAAATCTCATCTTCTTCGATCCCTCCGGACGGACGCCGGACCGCCCAGGCGCCGGTCGAAGCCGGCGGCATTTGCTGCAAAGAACGCGGACGAGAAGTCAGAGTGGTGGGCAGTTGACGGCGAAATGCACGAAATTGGCGAAAATGTGCCGCCTCGCGAGCGCTTCGTTATACCCAGAGAAAATATCCCCAATCGGCGCCGGAAGCAGCTCAGGGAACAGTTCATGCGCCGGACTCGCCTCGTTCTCAAGGAATCTGTTAGCCTCTTTTCTTCTTATTCTTTTGTTCTTTCTTCCCCTGCCTTTGATTGGACAACTTTTGGATATGACATATGTTATGATTTCCCTTGGGATTAATCTTGATTACTTTTCTCTTGAATTGGGTATGAAGGAGAAACTAGGTTGAGGTTCTTATAACTATTAGTTATTGATTTGATGTTAGAGCACCAACAAGTAAATATTGGAGATTGTTATTAGGATATTACTAGTAGCATAATGCGATTATCGGAGTAGAGTGGCTATACATAGAGAGAGTTAGAGAGATGGAGGATAGGCAAAATTTTGGAAAGCAGTTTTCTCTGGGAGAGTCCAGGGCTTCTCGAAGTGCTTGATGTTGTGTTTTGTAATTTTCATTTCACCCTAAAAATAGGATAAGTTTTATTGACAAAGTCATTTGGTCCTCGGAAAAAGATCTACTAAATTCATCGAGTTGTCCCAAAGAATCAAAGAAAGGTCGATTTCAGTGTTGTTTTCTTACTTCTATTCTCAATGTCGTCAATCCTTGGGTGTTTGAAGGAGCATGAGCCTTGGTGCAAAAAGTACATGGAGCTTTATCAGGAGCTTAGAGAAAACTGGGAGAGGCTGTACTGGGACGAGGGATACTCTAAAAAACTTGCCCAGGAACATGCAAATTATGAGTCTGCTGAAGATGAGGATTTCTCCCCTTATAGGTTGGATTTTTTTGTGATTTCATGTATGTTTGTTTTGTATGGCTGGTGAAATGATCTTCAATTAAAAGCATTGTTCTTCGTGAATTGAATGGTCGTAACCTCTATTCATCTTTCATGATTTAAAATACGGTTCAATTTTCTATTCTTTTAAGTTTGCATCAGAGTATATCTGTAATGTGACCGTAGTATCTTAGTTTCATGGCTTTTTCTCGTGTTTCAGGAATAGGCAATCCAGTGTTGATCAAAGTAAGGTAAATTTTCTTTGTTAATTTTTTTAACATGTTTTATTTCAAAGCTTGTTATATTCTGAGTTTTTGTAGACGAAGCAAGTATTGTAGAAGTTGATTTGCTATTTTAACTCTCAAGAGTCAAGACTTGTCATTTCAAGTATGACTTCTGTAATGCTGTATTTGGTACAATTTTGTTAATCCAGAGAATGCACGAGGATTGTGAGGTGAACTTAATAGAAAATTAATAACCTTGTGAATTTGCTTCCCTTAATGAAGAACAAAAATGCAGTTGCTTATCCTTGATCCTTCATTGTGATATTGATGATTTATAGAGGAGCAATATTTGGTCTAAAAGTAAGACTAGGAAAAACATTAGTCATCTCATGGTTTTATCTTCTTCACTAGTGTTTATTATGTCACACATTGCACAAACTCCGAATATGGGTTTTGCACATGCTTGTGGTTGTATCACTTTGAAGTTACAATGTCGTTCCAGCATTGTCCTTATAGAAAGTCAAATATTTTTCCTCCTTTTCATGCACTTCAAATTAAGTATTTTTTACGAGTTAAATGATTTTGGAAAATGGTTTGATTCTTTTAACTCTCCTCCCCTATCTTTCCCTCTCTCCCCACTCATGGACTCCCTTTTGAGTCCAAGCATGCTTAATGAATTAAGGATTTGATGGTAACTTGGCAACCTTATTTCTGTTAGGGACAACCGGACCTCTCCATGTTTCACAATAGTGTGATATTGTCTGCTTTAATTTGAAATCCTCATTACTCTGTTTTTGGTTCACCCAAAAGGCTTCATACCAATGGAGATTTTGTCTCTCACTTACATATTTATTATCTTCTCCTCCTAACCGATATGGGACTTTGGTCGCATCCCCAATAGTTTCATCAAAAGTTTTTAAGAAATAACAATGTTAGTGAGGAAAGCTTCTTGAAGCTGCAAAAGATCACAATTCACAAATAAAAGAGGTTTCTTTGCTCCTATGGACATTGTTCTATAGAGGTACGAATACAGCATATAGAATTCAAAAGAAACTCCCCCACTTGGCGCTCTCCCCGACAATCTGCAATTTATGTTTGAGGAACTCAGAGAATGTGGACCATCTCTTTATGCACTGTACCTTCGCGTTCACAGCTTGGAATTATGTGGCTTCTATATTAGACTTATCCTTTTGTTTCCCTTAGCATATCGAAGATTTCATTTATGAAGGTTTTGGTGGTAAAGCACTCAAAGGAAAAGCTTTTATCATATGGAATTGTGCCGCTAGGGCGGTCCTTTGGCTTCTTTGGAAAGAAAGGAATAGTCGGATTTTTGAGGACAAAATGGTCTTGATAGATTATTTTTTTGTGAATATCAAGCATACGGCTTCTTGGTGGGCTTCTACACACAAAAAGGCTTTTTGTAATTACAGTTTTTTTTTTATTATGCAAGATTGGAAAGCTGTGATTGTATAGTCCTTTGGGGAGGGTTTTCTCTACCCTCCGCCTTTAGGTTGTTTTGGCCCTTTTTGGGTGTTTTCTCTAATATACCCTTTTTGTCTCTTATAAAAAAAAAGAGGTTTCTTTGCGAAAATCACAGAAGTTTAGAACGATGGAAAGAAGAGCAATATGATAGCTTCATGGAGGGTAGATCTGACATTTCAGGGGTTGGAAAGCTTTCACACTTTTAGTCAATGAGTTCTTGATCCTAGGAAGTAGGAAGGTGGAGTCAGACAAAGGAGAAAAATCTTATTGGAAGCATTTACCATTTAACGAAGTGTCCTCACTCCTAGGAGAAGTAAAGCTCTAGTTCAAATGTCATGCAACTAATACAAACCTTGCAGAGGTGGCTTCTACTGATAAGAATTTTGGTGGTGGCAAGATTATACTTTCTCAATGGATGGATCAAAATCCTAACTATTTTCACCCTGACAAAGGCCTTTAATGCCCAAATCTTAATGAAGCTCAATTAATTAGTTTTAATATGGGGTGGTGGTTTTAATGATTGACAAGATGTTTCTGGTGGTTTTAATGATTGATTTGATCTTTGAAGCTGGAGACATTTTCTGTTTCAGCTGTTGAAAAGTCTTTGAGATGGTTAAGTGTTTGATATTGCTGTTTTGGATTATTTCGCTAGGTATATAGGGGGTTTTTTAAATCTAGTTGTTTGGGTAAAATGGTTCGTTTGGTTGCTTTTGGATCTTTTAGTTGGTTTAGACCATTAGGACCTTGACCTTAGTGTCCCATGCTCATTTCACTGAAATTTGGGACGCGGATACGTGGCTTGCTTTTTTTTTCTTTTTTTTTTATAAGAAAACCTTTCATTAGAAAGAAAAAGAGCAATACAGCCAAAGGGCAGGGGGATGAGGAGACCCCCTCCCGAAACAACTAAACAAAAGCTCTCAAATCACAAATAATCGGAGAGAGGCTGAAGTTACAAAAGAATCTTCTATGTCTATGGCACCACCAAGGGAGGTGAGTTTTACACTGTCACAAAAAGAGACAAAAGAAGAAGACCTATCATAAAAAATCCTATGATTCCTTTCAAGCCAAATATTTCAAAGTAGAGCTCTAGCTATATACAAGACCACAGAACCTTCCATCTCCCTTTGAGCTGCCACCATTGAAGAGTCTGCCTTTTGAACAATATACCTTTCATAAGGTTTACTTTCATATGTCTTTGGGATTCTATTAGCAACTCTTTTGAGTGAACTTATGATACTTGAGTAGATAGGAGGGATACATGAATGAATAGACACTACATCCAAATGGGAGAGATCCTGAGATCATGCAAGCATGGTTAATAAAGGTTTAAAAGAATCGATATAACTACCTATACCAATAAGGTGCACCTTCCTTTTTGGTTTTTTAATAAAAAAACTCCAAAGTTAAGCGTGTTTGGTTTGAAGCAATCTTGGGTTGGGTGACTTCCTAAGGGATTTTCCTAAGAAACATGTGAGTGAAGACAAATCATGTTAAAAAGGACTCGTGTTGGTTTGTAGGGACAATCTTCACTCTTCCCAACAGCTCAGGTTAAGAAGGTGGCGCAACCTTGTTAAAAGGACTCGTGTTGGGTTGGGTGACTTCCTAGGGATTTTCCTAGAAAGCATGTGAATGAGGACAAAACATCTTAAAAGGACTCATGTTGGTCTGCAAGGACAATCTTCTCTCTTCCAAGAAGCTCAGGTTAAGAAGGTGGTGCAACCAGGTGGAGGGGAATGCCAGGGCCACTAGGTGCCAAATCCAAATTTCAAATTATAGTTCGAATTTTGGGCCTAGAGCATTACAGAACTATTCCTTATGAGTGCATAAACTAGCTTATCATGTTCCATGCTCATAATCCTTCCGACTAATCTATGTTATTGTTGCTTACCCTTTCTCTTTTGTTGCTTGCAATTTCAAATTATGTTTTACAGGAGTTCTGTAATATATCTAGAAAGTAATTTAAAGTTTCAGAAATTTACTTAGAGTTTTTTCTTTAAAATAGTTAATCTGGAATATCAATCTGTTATTCCCAAAACTGTGATGACCTCCCTCCTCTCCCCACCCCAACCCACCTCCGGCCACCCTCAACCCCCCCTCATGACTTTCCTTTCCCACCAATGACCCCTGCAGCACTCCTCTGACCTACATCCCTGCAAGCTCACTCAAGATGTCCCGGCTGAAAATTGAAAACAAGGTCTTCGACTGTGACTTCGACAAGAAGAAACAAGGACGATGGCTTTGAATCACAGAGAAACATCAAAGCAGATTTTTCGTCCTCTCAATAGAAGAAATAGTGGAGTTTGGTTGATAGACATGATCTCCGATCTTCTGCTGACTCCAAATATGCAGAAATTCTTCCGGAATTGCAATGAAGGCTTCATATGGATGCAGAAAACCTCAAACAAATGGGGAAGCTTCTTGGAAATTACCAAGGTGTTCAGTGGAGAAGGGAAGAGAAACCTGGTAGTGCCGGCAGACAGAGAGCACAAAGGGTGGCTGGCTTTTAAGAATTTTCTACTCGACTTTATCAATGGGAAACAAAGCAACATAATGGAGAATAAACCAAAGACGCACCACAAGTTACACCCAAACCGTTCTTTTGCCAAGGTAGTGAAACATAAGAACGTCAATCCACGCCCTGAAAAGAATGAGGAACCCCCTTCCCCCGATAACAAAGTTGAACTCCCCAGCCCAGCCCAGCCCTTAGAACAGATGTTAGGGCCTTCAACTGGAAGGACACTCTGATCCTCACTTGCCGTGATTTCCATGATGACTGGGGACGTATTCATGAATGTATACAAGAACAAATAGAAGAAACTTTCATTATCAACTCCTTCCGGCCAGACAAAGCCCTCCTTAAATGCCCCTCCGAAGAGATTGCAAATCTGCTGGCCCAAAACAGGGGATGGGTAACCTTTGGTCCTATCACAGCGAAGGTAGAAAGATGGGATCCGTTGAAGCATGGGCGAATAAATGTAGTCCCGTCCTATGGGGGATGGATACGCTTTCGAAACATTCCCCTCCACTTATGGAGCTTAGCCACCTTTAAAGCCATTGGTGATTTTTTTGGGGGTTTCATTGATTACGCTGATTCAAACTCGAGCCTGGTGGATTGTGTAGAAGTGGCAATAAATGTTCAAGGAAATTACTGTGGTTTCATTCCCTCAGAAGTCCAGATCATTGACGAAGATAGGATCTTCATGACACAAGTGTTAACATATCAGAATAGCAATCTCCTGATCGGAAAGGTCGTCGGAATCCATGGAAGTTTCAAACGGGAAAGTGCGAAATGCTTTTACGACGAGAAAGAGGGAACATGGCTTTGTCCGATAGATCGTTGGCGACTTGAAACAGATAACTTATGCCCAACGGTCAAAATATTTTACCCTCCTTTTCTCGTTTGTCAGACAAGACAAAAAGCCCTAGAAGTCCGGAAAGGAGTTAATTACCCGTTTGATGATTTGACCTGCTCCGTGCGCCAGAACATTGTAGGGATGGGACCCACTCACGAGGTTGGACAAAAAAGAAAAACAGGGGAGAAGAAGATGGTGTCCTTTAACAAATCAAATAAGATACGAAATTTTGATAAAAAGGGGAAAGTTTGCTCGACCTCCAAAGAAAACCAAAACAGCCCCTTACGGGCAAAGGGACGACTGAAAGAGAAAGAAAAAGACCACGCGGAGTTGTCAGAGTTCTCTCTCTCCAGCTTCGAGAGCAGCAAATCCTTGGCCGCTTCCCTTCCCAACGTGCAGGACGAGGACTTTCCAGAGGATTATCACCTCTGTTTTAAAGAAGACGACCCAGAGGAAATTACCGTTTTTACAGAACCACATGGAACTAAAGAGTGAGACTTAGTCCCAAACAGGAGTAAGGAAAACAGAGTCGCACCCCGCCACACTCTACCTTTCCCTCATCCTTTCTCTAAGAGGATGGTGAAGGCCCTTAAAGGAGTGGAATATGTGTATAAGGGCCATTTCTACAAAAGGACAATCAGACAAAAAAAAGAAGACGAGGGAAATCATCAGTCTTCTAAAAACTTGGGAAAATGAATCGAGGGAAATCATCAGTCTTCTAAAAACTTGGGAAAATGAATCAAAAGAGGAAGAGGAGCAGGATGATGAAGGTGAAGAAGTAGATTTGAATGATTCGGGCACTGAGGTGGTCCCATGAATATCATCTCCTGGAATGTAAGAGGTTTAGGTGCTTCCTCCAAAAAAGCCTTGGTTAAAGACTCCCTCATTAAAGTGAGTCCTAATGTAGTCATCCTTCAAGAAACTAAGCTCTCAGTAGTTGACAGGAAAACCATAAAGGAAATTTGGAGTTCAAGACGGATAGGTCGGGTTACTCTTGAAGCTGAGGGTAGGAGGCATTCTCATTTTGTGGAAGGAAGATTTTGTTGAGGTACACGATGCTATCCTTGGCTCTTTTTCCATTTCCATCTCTTTTACCTTTAATAATTTAGTTAAAGGGTGGATCTCGGGGGTTTATCGGCCTTCTACCCCCTTTAGAAGAGACCTCTTTTGGCAAGAAATTGCAGATTTGGCGGATCTATGTTCAGATTGCTAGTGCTTGGGTGGAGATTTCAACGTGGTCCGGAACCCAAGCGAGAAATCCAGAGGTGGCAGAGTAACACAAAGTATGAAGGTTTTTAATTCCATAATCGATGATCTAAGGTTGATGGATATTTCCTTGAACAATGCCAGGTTCACTTGGTCAGACAACAGAGAACAGCCTTCTTTCTCTCGTTTGGATAGATTTATGATCACCCAGCTTCCAAGGAAGTTGTCCTTCAGAGGCTCACCAGTACTACCTCAGATCACTTCCCTTTACAGCTTTGTTCAGGTTTTCAAAAGTGGGGTCCCTCCCCCTTTCGCTTTGAGAATATGTGGCTCTCTCACCCGTTTCAAGGTTTATATTGAGGAGTGGTGGAAGAATATGAGTCCCCAGGGCTGGGCAGGCTATGGGTTTATGGAGACACTAAGAGGTCTTAAAACCAAGCTAAAAGCTTGGAATAGGGATACTTTTGGCGCCCTAGAGCAGATGAAAGTCGACCTTCAAGCTAGGGTCAACCACCTAGATAAGGAGGAAGAGAACGGCCCCCTGAGCTCAGCTTTGTTCACTGAACGCTTAGATCTCAAAGCCTCTCTTCTCGAAGCCACTCTCAAAGACCAGAGGAGATGGGCAAAAAAATGTAGGCTGGATTGGCTTAAAGAGGGAGACGAGAATATAAGCTTCTTTCACAGATGGGCATCCCAACGAAAATGCAGAAATGTTATCTCTATCTTAGAGGATAAGCAAGGATTAACCCTCTCTTCGGCAGATGAAATCGAAAAGGAAATTATTAGCTTCTTCAATTCACTGTACAAAGAAAAACCTGGGCCTCGCTTCTGTTTTGAGGGGATGAACTGGAACCCTCTAACTCATGGAGACAGCCAAAGTTTGGAGACTCCTTTCAGTGAAGAAGAAATCCATCAAGCGATTCGGAACTTAGGCACCCTCAAGTCCCCAGGGTCGAATGGCATGACAAATGAATTTTATAAACAATCTTGGAACATCTTGAAATGTGACCTCATCAAAGTGTTCCATGATTTTTTTCAAAATGGGATTACCAATCGACGCACGAATGTGACGTACATTTGCCTAATTCCTAAGAAGAGGGTTGCCAACAAAGTTAGTGACTTTCGCGCAATCAGCTTGGTCACCTCTCTCTATAAAATTATTGCCAAGGTCCTTGCCGAAAGGCTTAGAAAGTCCCTCCCTGCTATTATCCATGACACTTAGGCTGCATTTGTGGAGGGTAGGAACATCTTGGATGCCATTTTGGCAGCCTCAAAAGGTGTAGGCGATTGGAGAGCTAAGAAGAAACAAGGCTTTGTGCTGAAGCTAGACTATGAAAAAGCCTATGACATGGTTGGCTGGAATTTTTTAATTGCTGTCTTAAAGAAGAAAGGGTTCGGTACAAGGTGGTGCAAATGGATTAGAGGCTGTATTACATCGATCAATTTCTCAGTTTTGATAAATGGAAGACCTAGAGGCAAAATTTCAGCCCAAAGAGGCCTATGACAAGGCTGTCCCCTCTCTTCCTTTTTTTTCACTCTTATTGGAGATGCCTTCAGCTGTTTAGTTCACTTTTGCCATGAGAAAAGGATTCTTAAAGGCTTCTCGGTGGGAGAAAAGAATGTAGCTCTAACCCACCTTCAATACGCAGATGATACGATAATCTTCAATCAAGGTAACATCGAAGATATCATGGATTGGTGGGGGCTCATCGCCCCGTTCATCACTGGGGCAGGCCTCTCCATCAACCTTGACAAAACTTCTCTAATTGGGGTCAACGTCGAAGAGTCCACCATTGGATTGGCTGTAGAATAGAGAAGCTTCCTTTTATGTATCTGGGTGTCCCCCTTGGGGGAAAAGCTAAATCTGTTGGGTTTTGGGAGCCAATGGTGGAAAAGTTTAGATGTAAGCTGTCCAAATGGAAAAACTTGAATCTTTCCAAGGGCGGACACCTCACTATGGCACAATCGGTGCTTAATAGCCTGCCAATCTACTACTTTTCCCTTTTGAAAACTCCCCTCACGCACGGAGGCCTTGGCATAGGCTCTTTCGAGCAGCGGAATAATGCCTTGTTGCTCGAATGGCTTTGGTGCTTTACCCAAGAAGAAGGAGTCCTTTGGAGAAGAATTGTGGATAGTATATATGGCACCCAATCTCACGGATGGGTGACCGCTATACCAAAGGGAGCCGTGAAAGGAAGGCCGTGGTTTGACATAGCTCGGAACATCAACCTTTTTCTTAAGTTCGTAACCTTCAAAGCTCGCAACGGGCAGAGAATCAGGTTTTGGAAGGATCACTAGGTGGGGACAGAACTCTAGAGGCAACTTTTCCTAATCTCTACATTATCTCAATGAAGAAAGATGATACAATAGCTAGCTGTTGAAATGCAGAGCATCAAGACTGGGACTTGGGTTTCAGAAGGGGGCTTTTTGACAGAGAATTTGAAAGCTGGTTGGGCCTCACAAACAAGCTAGACACGGTTAGTTTGGGAGATTCAAAAGATGGAGTCATCTGGAATATTGACAAAGGGGGTACTTTCTCAACTAAGGCTACCTTCATTCGCATCAATAAACCTTGTAGAACCCTCCAGCTCCCCCTTTTTAAGACCATTTGGAAATCTAAGATCCCAAAGAAGATAAAAGTTTTTCTTTGGTCGGTAGTTCACAGAAGCTTAAACACTCATGACCGCCTACAGAAGAAATGTCGAACCTGGTCGCTCTCCCGCTCAGCTTGCCCTCTCTACCTCAGTGAGGAAGAAACTCTAGATCACTTATTGCTCCACTGTCCCATAGCCTATAAGTGCTGCACACAAGCTTTTAATCTGTTTAACCTTGATTACTGCCTTCCCAACAAGGTTGATGAATGGCTCTTGCAAGCTCTAAATGGAGGATGCTTCAGTAATGCTAGAGGAACCTTGTGGTTTTTTGTGGTGGCTGCAATCTTGTGGAACACATGGAGGGAAAGAAACAATAGAATTTTTAGAACAAATCCTCAGATTTTCCTTTTTTGTGGAGTAATGTATAGCTTACCATGTCTAGGTGGTGCTACAGTTAACAGCAAATTCTTTTGTAATTACTCCTTAAGCTCTATACTCCGGAGGACTTTTCTGTAATAGCTCTTTTGGGGGGGGGGGGGCTACCCCTCGCCCTTAGGTTGTCTTCTTTTTTGTGATCAATACTTACATCTCATGTCTCTTATCAAAAAAAGAGATATTACATGTAAGGATAATGTCACCTCCCTAAGCAATCAAAATTGCACTTTTGTCATAGTGTTACAAATGGTGTTATCTCATCGGCTTGACTTTGAAGGAAATGGTCAAATTTCAATTATGCAACTTTGCCATTCTCAAATCCTTCATTTGAAATATCATGCATTGCTTTTGTGTTTTTACATTTATTTACTTCAAATTTCACTAGAAGGGATGGTTTTTTGCGTGCAGGACCAGGGTTTTAGAAGGAACACGCAAGGTGGTAACCGGGAGAAGGTCAGCCAGATTAGAGATAAGTTTGAATACGACAGGGACAGAAGAATGAGAGAAAGAGGTTAGTTTAAATGAATAAATTGTTGTTTTTTCCTTTGCTTTCTGCAATTCATTAATCGCTTGGTTCTTGATTTTACTATCCACCATGATGATATTCATTAGCCCAAGAAAATGGATGTTTGATCAAATTGTGCCTTCAAATGCAATCTGGCTTTCTCATTGCTTGAACAAACTAAATGAAAAATTACAATGAAGATTGAACGAGAGTAATAAATCTGAGCTAATTTATCCTTTTTATGTACTTTAGGTTTGATATTGCTTGATTGGAGCCCTTTTTTAGTTTCGCTCCCCTTTGTTGGGCTGTTTTTTCTGTTCATCCTTGCGTATTCTCTCATTTTTCTCAATAAAAGCTCGACTTTTCCTTAGGAAAAAAAGAATGTTAAGATTACTATAAGGCTAGATGTTGGAACTGTTATTAGGACAGTAAAAGTATGTTAATCAGCAACAATGAATGAGTATTGTGGGAGTTGGTTATAAACAGAATGAGTGAGGGGTGAAGGAAGCAGTTGTTTTGTCTTTAGAAAGAGTTGTCTTAAGTTTCTATAATTACTTGGGTATTTTTCTGTATTTTCTTACTCTAGAAGTTATTTTAATACAATTTTCTATTACAATTTTTTGGTGTCTATCAATATCTCGCTAGTGTAGCAATTCCTCATCCCGTTTACTTTTGTATTGCAAGTTAACTCTTAGGATTATGATTACTTGACTATTATTTAATTTAAGGTTAAATAATGCCCCACCCAGTTGGAGTCTCTTGGTCATACTGGCTTAGAGGTCTCAAGTTTGAATTTACCGGTGAGCTTAACACCAAAAACTCACCAAAAACTCCTGATGTCTCTCGGATCCGAGCTTGGGTGTCTTTGTGTATAGGGGAGCAAAGCTCCGACTCCCGATTTGAAAAAAAGAAAAGAAAAGTGAGTAGATACAATCCTAAATGTTTAGTAGGTCCTGAACTTTCAATTATGTGTCTAGTAAGTCTATTTTTTAAAATCCTGAAATCTATTGGACACAAAACTAAGTTTATGTCTCAATTTTGTGGTTCATGGTCAAGGATTTATTAGATTCGAGAATTACTAAACACCTTTTAAAATCCAGGGACGTATTAGACGCAACATTGAAAGATTAGGAATCGATTAAATACTTTTTAAAATTCAATGACCTATTAGACACAACTTTAAAAATTCAGAGAGTAAACTTATAATTTAACTTTAATTCACGTCGGCTCACTATTTAGTTTTCATCTACCTTTCGCTTTCATTTGGATTTATGAATTCACAAATATTAGTTGAAGTGCAAATACTAACAAATAGTACCGTAACTTTCAGCATTTGCACCCATACGTGGCGATCCCGCTTTTGTATCACAGAATCCAAATTCCCGGATGCAACGAAGAGATGTGGAGCCGCAAAGGTACTTTTCCGAGAGCGACAGTGACTGAAATGGACTAAGATTTATGTGCGTCTTTGTAAAGGCGAGGTTAGGAATGAGCTTACAGTTCATCTTTCCCTCTGCTGCCTGTTGTTACCAGTATCTGCAGCTGATAAGGTGTCTTGTCCATCTCAAGTTTTAAAACAACCAGAGTCGTTTTGGTTGTTTTGATAGATTCAACTCGAGATCCACGGTTGGTTTCGATTTTGTATTGATATGCTCCATAGAATGAACACAAATGAACTGACCTTTTATTACCGTTATATTCAGACGACGTTACCGATTGAGAGCTTGGTGATGGATATATTTAAATGTAGGGTCCATTCTCCCTAGCCTCTCGCAATTTTTGCCTTGTTTTTCAGTCTCCAAAGTAAGACTAACGAATGAGGGATCTTACCTTTAAATCTTAAGCTTTTGTAACAATTGCATCTCATGTGTGAACTTTGCAGTAAAAAGTACTTCAAGGAATTGAAACTACCCCTCATGCATATTCAAACACTGTTTTGGGAGTTTTTTTTGAAAGAACCCGAACCAGTAACAATATATTGAATTATAAAAGATAGAATAACTCAAATAACTCTCGAGTAATTTGAGATACTTGGACCTCTCCCTCTTTATAATCAAACTTCCTAACCCAATTACTAATATATCCTTAATACCTTAATAATATCCTCCATTTATCCCAATACTCACACGTTCAAGTTTTTTCATGTACTTTTAAATTTGTAATAATTTAATCTTTGATCTTTAGTATGTAACTTAGTACTTGTCGGGAAAAATGTTTGAAGGGCCAAATGTGTTTCTAGATGGTAAAGCTTACTATTACATAATGAAAATTGATATTTAATTTTGATGAAAATTAAATACTAAATTGCTATGCATTAAAGTTTATGGACTAAATTATTTGTCCTAATATGAAAGTATAGGGTACAAAAATGTGTTTTTAACCTTGATAAGTTTATGATTTTGGATGGTGGAAACTCTTTTTTCACTTCTTGGTCATGAATTTTTAAATGTTGCCGAAGTGAGCATTGCTCAGCGGTAATTGACATATATCTCCAGCCAAGAGGTTGTGAGATCGATCCCCACACCATGTGTTGTACTAAAATAATAAACGGTTAAGTGTTGCAATTTAACTAAATGTTCAAGCAAAGCTATTTATGTGTATATATTTACCACTTTTAATCTTAATTTTAACATAAATTTAATGGAATGATTGTTTAAAAAGATTTATTTAAAATGTATGAGTTAAAATAAAAATTTTAAAAATGATACACTAATTAAACTTATTCTAAAGCTTTTTAATTTTTATTATTCGACCACGTGTAGTTGGAGATTGGAATCTCTGATCTCTAATAGAGGTATAATCCATTTATATGACTTTATATTGATCGCTTTTTAAAAGTTTAGAAACTAATATAGACAAAATAAATGTAAAGCGAAGAGATTAGTGATTGAAGCTTTTTCATTTTTTAGTAAAACATGTAGGAGAGAGGATTCAAACTCACAATCTTATAGTTGGAGATAAATAGCTAAATTAATCGAGTTAAGCTCAGGTTGACGTGAGTCAATATACTTTAATTGAAGTAATGACTTAGTCCTAAATTGTAATGTATAATGATTCAATCTCTAAATTTTAATATGTAATAATTTAGTTCTATAATTTTAAAAATTTAATAATTTAGTCCCTATCACGTCAAATCTGATTAATGTTTAATGAGACTTCTTAGTCACCTATGTAGATGAATAAACTAATTAGAAACCAAACATGTTATAAACAAGACTAAATAGTTTCATAAAAAAAAAATCAATACATCACATTAATAAAAATTTTCACGGTAGAAACTAATTAAATTATTATAAAATTAAAAGTATAGAGACTAAATTGTTAAGAAAACTAAATTGTTAGCTAATAAAAAATATCATTTATAAAAAAAATGAAATTAAGAAAAAGAATCACACCTCAAATTTATGGTACTAAAAGAATCAACTTGTTAAAAAAAAAAACTAACCATTCATGCCACGGACCAAAGAGATTTAAATTTAAAAGCTACATTAATCTTTTCATTTTTAACTTTATTTTTTTTTTTTTTTGAGATTATGTTTCTTCTCTCACAATTTCTTTTATCGTAATTTCCACTTTTATATATGCTATAATAATGGACTTTCAAATCAATTTTCACAGGCATTATTAAACTTTTAAAAAATTTTAAAGTCTTTTATTTTGAAAACGTTTTTTTTTTTTAAATAAGAACAAAGAATAATAGTTATAGTAGCGGAATTTTATTAAAAAAAAAAAAAAGTAGTAGGGGTTATAAGCTTGGAGCTGGCTCGCGCCAGAAGCAGCAGTACGGCGGCGATGAAGGCGCACCACTGTCACCGTCACCTCCTAAAACCCAACTCCGCCGCCGCCGCCGCCTTCCCTTCCGCCATTCCGGCCACCGCGCATCGTCACTTCGCCACCAAATACACCGCCAAAATCACTTCCTCCTCTCCCACCGGACGCTCCGTTTCCGCAGAGGTCCGGCCGCCGGCTCCTCTCCCCGTCGACTCTCGCGGCTACACTCTACCCCGCAGAGATCTCATCTGCAGAGCCCTTCAGATACTCCTCAACCGCAAACCATCCGCCATTGACGATCGCTTCTCCGATTTATCCTCCTACTTCCAGTCTCTCTCCGTCTCCCTAACTCCAGCCGAAGCCTCTGAAATCCTTCGATCCTTGAACTGCCCCGACCTCGCTCTGCAATTTTTTCAGCTTTGCCCCTCCATTTGCCCTAAGTTTCGCCACGATGTCTTCACTTACACCCGCTTCCTTCTCATACTCTCCCAATCCTCTTCCCCGAAGCGGTTCGATCACGTTCGGGAGATTCTGTCGCGGATGGATCGAGATCAGATACGCGGTACCATTTCCACTGTTAATATCTTGATTGGGATTTTTGGTAGCAAGGAGGATTTGGAATTATGTTATGGGTTGATTAAGAAATGGGACTTGAGGCTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTACGGTCTCATGATTCAGATAAGGCTTTCAATTTGTATATGGAAATGAGGGGTCGAGGGTATAAGCTGGATATCTTTGCCTATAATATGCTATTGGATTCTCTGGCCAAGGATGAACAGGTTTGAATTTGATTCTAAAATGTTAGAAATTATGCTTGGTTCTATTTGAAAAATGGCTGTTCACTGGTTTCTTATGTTCCAAATTCTTTTGTTGTGTTCTATTTAACAGCTTGATCGAGCTTACAAAGTTTTTAAGGATATGAAATTGAAGCATTGTAACCCAGATGAGTATACGTATACTATTATGATTAGAATGACTGGAAAGATGGGTAGAACTGAAGAGTCTTTGGCTCTCTTTCAAGAAATGCTAGCAAAAGGATGTACTCCGAATGCAATTTCATATAATACAATGATCGAGACACTTTGCAAGAGTAGAATGGTTGACAAGGCAATTCTTCTCTTTTCTAATATGGTTAAGAACAATTGTAGGCCAAATGAGTTCACGTATAGTCTCATTTTGAATGTTTTGGTTGCAGAAGGACAGCTTGGTAAATTGGACGAAGTTCTGGAAGTGTCCAATAAGTTTATGAACAAATCAATATATGGATATCTTGTAAGGACTCTAAGCAAACTAGGCCATGCGAGTGAAGCTCATCGTATTTTTTGCAATATGTGGAAATTTCATGATAGAGGCGATAGAGATGCTTACGTTTCCATGTTGGAGAGTTTATGCAGTGCAGGTAAAACTGTAGAAGCAATGGAACTGCTCGATAAGGTTCACGAGAAGGGAGTTAGTTCTGATACTATGATGCATAACACACTGTTATCTACTTTGGGGAAGTTAAAGCAAGTATCTCATCTTCATGATCTTTACGAGAAGATGAAACAAGATGGGCCGTTGCCTGATGTATTCACATATAATATTCTTATATCAAGCCTAGGACGTGTTGGAAAAGTTAAGGAGGCTGTTGAAGTTTTTGAAGAACTTGAGAATAGTAGTTGTAAACCAGATATTATATCTTACAATTCTTTGATCAATTGCCTTGGGAAAAATGGGGATGTTGATGAAGCTCACATGAGGTTTCTAGAGATGCAAGAGAAGGGATTGAATCCTGACGTTGTAACGTACAGCACGCTCATTGAATGTTTCGGGAAAACAGATAAAGTTGAGATGGCTCAGAGTTTGTTTGATAAAATGATGGCTCAAGGATGTTGTCCAAATATTATAACGTACAACATCCTGCTTGACTGTCTTGAGAGAGCTGGGAGAACTGCTGAAACTGTTGATCTGTATGCAAAACTTAAACAGCAGGGATTAACACCCGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCGACTCGAAAATTTAGAGTCCGAAGGCAAAATCCAATTACTGGCTGGGTTGTTAGTCCTTTAAGGTAATATTGTGCAAGTTTTTAAGGAAAAGAAAAGAAGAAGGATGCTGACAACTCTATCTATTTGTCTGATACGCATCTCTCTTATGTCTAGTTTCTTTTAGTATAAAAGCTTCCAGCGCAGCGCCAATTAAGGGAAAGGTTGTGTCTATCATGACGGTTCATCCAATGATTAGCTCGAGTTAAGTTCAAGCTTTGATTCGTGTTTGCCTAGTTCTTGGGCTAATAAGGGAATGTCCATTGTTGGCCGTGAGTCTCTCTTTGAGAAGACCGACTAGTGCTTCGTCAAGCTTTTGGTGTACCTTCTCCTCTTAGGGGTTTTCTGGCATACTTTTACCAAGTAGTTGTCAGAGACAACGTACAGGAAGTAACATAGTTTCTCAGATATCTAGAAGAAATCCATCATATTTTTTTATTCTTGGGTAAAGAAAAAAATGCCGAAGTTTTTATCTTCAAAATATTCGTTGAGGTTTCTTCAGAGGTTTAAGTGGTCTAGCAAAGAAAATGTAGAAGTTCTTGGGTTGTGCAGCGTGAATATCATTATAGCTTTCAGGAGAATGTAACTGAAGCCTGACTTGAGGAATCCTCCTGGTTTGAACCTGAAATCTTGGAATTGGATCCTTTTGTAATTATGTTTAGGAGGTTTGAACCTGAAATCTTGGAATTGGATCCTTTTGTAATTACGTTTGGGAGGTTTGAACCTCCAACCTCAAGGGAAGGAGTAGACGAGAATCACCGTTGATCTATGTTCATGTTGGCCTTTTCAATTATGTTAAGAAACTGTATTGGAATACATCTTTCTTGAGTCACACAAGTTCTCTTGTAAAGAGAAATTAAAATGGTCTATTTCAGGAGGTTTAATAACTGTCCAGCAGAAGCTTCATCCAATTAAAAGTCAGCTCAATTTTCCACTTGGGCTCGAATAGCCCGATGATTTTCGAAGGTATCTCGAAACATCCTGAGCTCTTACCATCTTGACCAACCTGTATGTAAGCAACGTCACCTGGTTAACTCTGTATGTTTCTCTTTTTCCTGTTTTTGTCAGTTAATCCTCATTGTTTTACATATATGTACACTCCCTATTAGACAAGGCTGTAGTTTTCTTCATTCTAATATTTCTTTCCTCAGAAGATAATTCACAAAGGAAGGCTGTACTCTCAATGTAACTGAAGTGACTGGATTCAAATGTTATGTTAGCTTTATTGAGAAGTATAGTAAAATGAATCTTCCATTTTCATTCAGTTGATAAAACAGCACAAAAATCTGAAATCCCATGTTGGTCAATTAGAGAAGTTCATTGTAGAAGTTTGGGTATGAGCCTAATTTATGGTTCACCCCTTTCCAAGGTGACTGCTATGATAAGCTTCTGAAATCTTTTCAGAGAACAGCGATTATGTTACAGTCTCTGGTTCATGAAATGGTAGATCAGGGATTATTGTGAAGGAGCTGTATAGTTTAGCGTATATAGAGCTTTTCAACAAAAAAGTTGGATGTTCTCTGAAATTCATGTAATAGGTGGGTCTAATAAAGTCTCTAAAGGAATTGCTGAACAAAAACCAGAACCTGTGTGATGAAATGGAGATGGGCAAGTCTCCAAAATGATGGATGCAGAGCTCTCGCTCTCACTCTCAGTGAAGAAGATGTTGAGAAAATTGTGGGTTCCTTCTGTCAAAATGCAAATGAAACATTTAGCGAAGTGTACACAAATGATGAAGGTGAGCCAAATTTGAAGGGTCAAATGACACTCTGTGTGAGTTCAATTTGGGTGTATGGGTGTTTGATGAGAGAAACAATAGTGATGGAGAAAGAAGTGCTTCAACTGTTGAAACTGGAGAGTTCATCCACTCATTTGAATGTGCAAGAACTTCCAACAAAAGTAAATGCTTACCAGAAATAAATAAACTGAGCACAAATTAAACACTTGGTGATATGTTTATAGTTATACTTGGTTTCTGGTCTGAGTTGTGCAAGAGCATGAGATACAAAGAAGGATTTAAATGCCTTGATTGGTCTTTAAATCAAAAGGCCAGTAACTTACTGTACCTAAAATATCTTGATTTGTGCAAGTTTGCTTTATTGCAATAATTGTAATTTATTTCAAATACTTCAATGGGTTGTAAATGTAATACATTTCAAGTTTGCTTTTTCTCTGTCCTTACTTTAATTTAGTCCCCTA

mRNA sequence

CTTTTCTTAGACTTAGATGTGAAAAATTTTATACAATTTTCGCTTCCTTTTTTGCTTTTCTAAATTATAACGAAAAAACACAATGTACTCCGTCCTCCCGGCAAAAAACCCATAAACCTTCCAAAAACCCCTTCCAAAACGTTTCAGAGCTCTCCCAAATTAGCTTATGCTCTCGTTCACACACAGACTCCGGACCTTGTCCTCATCTCCATGCTCCAAATCTCATCTTCTTCGATCCCTCCGGACGGACGCCGGACCGCCCAGGCGCCGGTCGAAGCCGGCGGCATTTGCTGCAAAGAACGCGGACGAGAAGTCAGAGTGGTGGGCAGTTGACGGCGAAATGCACGAAATTGGCGAAAATGTGCCGCCTCGCGAGCGCTTCGTTATACCCAGAGAAAATATCCCCAATCGGCGCCGGAAGCAGCTCAGGGAACAGTTCATGCGCCGGACTCGCCTCGTTCTCAAGGAATCTGAGCATGAGCCTTGGTGCAAAAAGTACATGGAGCTTTATCAGGAGCTTAGAGAAAACTGGGAGAGGCTGTACTGGGACGAGGGATACTCTAAAAAACTTGCCCAGGAACATGCAAATTATGAGTCTGCTGAAGATGAGGATTTCTCCCCTTATAGGAATAGGCAATCCAGTGTTGATCAAAGTAAGGACCAGGGTTTTAGAAGGAACACGCAAGGTGGTAACCGGGAGAAGGTCAGCCAGATTAGAGATAAGTTTGAATACGACAGGGACAGAAGAATGAGAGAAAGAGGGGTTATAAGCTTGGAGCTGGCTCGCGCCAGAAGCAGCAGTACGGCGGCGATGAAGGCGCACCACTGTCACCGTCACCTCCTAAAACCCAACTCCGCCGCCGCCGCCGCCTTCCCTTCCGCCATTCCGGCCACCGCGCATCGTCACTTCGCCACCAAATACACCGCCAAAATCACTTCCTCCTCTCCCACCGGACGCTCCGTTTCCGCAGAGGTCCGGCCGCCGGCTCCTCTCCCCGTCGACTCTCGCGGCTACACTCTACCCCGCAGAGATCTCATCTGCAGAGCCCTTCAGATACTCCTCAACCGCAAACCATCCGCCATTGACGATCGCTTCTCCGATTTATCCTCCTACTTCCAGTCTCTCTCCGTCTCCCTAACTCCAGCCGAAGCCTCTGAAATCCTTCGATCCTTGAACTGCCCCGACCTCGCTCTGCAATTTTTTCAGCTTTGCCCCTCCATTTGCCCTAAGTTTCGCCACGATGTCTTCACTTACACCCGCTTCCTTCTCATACTCTCCCAATCCTCTTCCCCGAAGCGGTTCGATCACGTTCGGGAGATTCTGTCGCGGATGGATCGAGATCAGATACGCGGTACCATTTCCACTGTTAATATCTTGATTGGGATTTTTGGTAGCAAGGAGGATTTGGAATTATGTTATGGGTTGATTAAGAAATGGGACTTGAGGCTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTACGGTCTCATGATTCAGATAAGGCTTTCAATTTGTATATGGAAATGAGGGGTCGAGGGTATAAGCTGGATATCTTTGCCTATAATATGCTATTGGATTCTCTGGCCAAGGATGAACAGCTTGATCGAGCTTACAAAGTTTTTAAGGATATGAAATTGAAGCATTGTAACCCAGATGAGTATACGTATACTATTATGATTAGAATGACTGGAAAGATGGGTAGAACTGAAGAGTCTTTGGCTCTCTTTCAAGAAATGCTAGCAAAAGGATGTACTCCGAATGCAATTTCATATAATACAATGATCGAGACACTTTGCAAGAGTAGAATGGTTGACAAGGCAATTCTTCTCTTTTCTAATATGGTTAAGAACAATTGTAGGCCAAATGAGTTCACGTATAGTCTCATTTTGAATGTTTTGGTTGCAGAAGGACAGCTTGGTAAATTGGACGAAGTTCTGGAAGTGTCCAATAAGTTTATGAACAAATCAATATATGGATATCTTGTAAGGACTCTAAGCAAACTAGGCCATGCGAGTGAAGCTCATCGTATTTTTTGCAATATGTGGAAATTTCATGATAGAGGCGATAGAGATGCTTACGTTTCCATGTTGGAGAGTTTATGCAGTGCAGGTAAAACTGTAGAAGCAATGGAACTGCTCGATAAGGTTCACGAGAAGGGAGTTAGTTCTGATACTATGATGCATAACACACTGTTATCTACTTTGGGGAAGTTAAAGCAAGTATCTCATCTTCATGATCTTTACGAGAAGATGAAACAAGATGGGCCGTTGCCTGATGTATTCACATATAATATTCTTATATCAAGCCTAGGACGTGTTGGAAAAGTTAAGGAGGCTGTTGAAGTTTTTGAAGAACTTGAGAATAGTAGTTGTAAACCAGATATTATATCTTACAATTCTTTGATCAATTGCCTTGGGAAAAATGGGGATGTTGATGAAGCTCACATGAGGTTTCTAGAGATGCAAGAGAAGGGATTGAATCCTGACGTTGTAACGTACAGCACGCTCATTGAATGTTTCGGGAAAACAGATAAAGTTGAGATGGCTCAGAGTTTGTTTGATAAAATGATGGCTCAAGGATGTTGTCCAAATATTATAACGTACAACATCCTGCTTGACTGTCTTGAGAGAGCTGGGAGAACTGCTGAAACTGTTGATCTGTATGCAAAACTTAAACAGCAGGGATTAACACCCGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCGACTCGAAAATTTAGAGTCCGAAGGCAAAATCCAATTACTGGCTGGGTTGTTAGTCCTTTAAGGTAATATTGTGCAAGTTTTTAAGGAAAAGAAAAGAAGAAGGATGCTGACAACTCTATCTATTTGTCTGATACGCATCTCTCTTATGTCTAGTTTCTTTTAGTATAAAAGCTTCCAGCGCAGCGCCAATTAAGGGAAAGGTTGTGTCTATCATGACGGTTCATCCAATGATTAGCTCGAGTTAAGTTCAAGCTTTGATTCGTGTTTGCCTAGTTCTTGGGCTAATAAGGGAATGTCCATTGTTGGCCGTGAGTCTCTCTTTGAGAAGACCGACTAGTGCTTCGTCAAGCTTTTGGTGTACCTTCTCCTCTTAGGGGTTTTCTGGCATACTTTTACCAAGTAGTTGTCAGAGACAACGTACAGGAAGTAACATAGTTTCTCAGATATCTAGAAGAAATCCATCATATTTTTTTATTCTTGGGTAAAGAAAAAAATGCCGAAGTTTTTATCTTCAAAATATTCGTTGAGGTTTCTTCAGAGGTTTAAGTGGTCTAGCAAAGAAAATGTAGAAGTTCTTGGGTTGTGCAGCGTGAATATCATTATAGCTTTCAGGAGAATGTAACTGAAGCCTGACTTGAGGAATCCTCCTGGTTTGAACCTGAAATCTTGGAATTGGATCCTTTTGTAATTATGTTTAGGAGGTTTGAACCTGAAATCTTGGAATTGGATCCTTTTGTAATTACGTTTGGGAGGTTTGAACCTCCAACCTCAAGGGAAGGAGTAGACGAGAATCACCGTTGATCTATGTTCATGTTGGCCTTTTCAATTATGTTAAGAAACTGTATTGGAATACATCTTTCTTGAGTCACACAAGTTCTCTTGTAAAGAGAAATTAAAATGGTCTATTTCAGGAGGTTTAATAACTGTCCAGCAGAAGCTTCATCCAATTAAAAGTCAGCTCAATTTTCCACTTGGGCTCGAATAGCCCGATGATTTTCGAAGGTATCTCGAAACATCCTGAGCTCTTACCATCTTGACCAACCTGTATGTAAGCAACGTCACCTGGTTAACTCTGTATGTTTCTCTTTTTCCTGTTTTTGTCAGTTAATCCTCATTGTTTTACATATATGTACACTCCCTATTAGACAAGGCTGTAGTTTTCTTCATTCTAATATTTCTTTCCTCAGAAGATAATTCACAAAGGAAGGCTGTACTCTCAATGTAACTGAAGTGACTGGATTCAAATGTTATGTTAGCTTTATTGAGAAGTATAGTAAAATGAATCTTCCATTTTCATTCAGTTGATAAAACAGCACAAAAATCTGAAATCCCATGTTGGTCAATTAGAGAAGTTCATTGTAGAAGTTTGGGTATGAGCCTAATTTATGGTTCACCCCTTTCCAAGGTGACTGCTATGATAAGCTTCTGAAATCTTTTCAGAGAACAGCGATTATGTTACAGTCTCTGGTTCATGAAATGGTAGATCAGGGATTATTGTGAAGGAGCTGTATAGTTTAGCGTATATAGAGCTTTTCAACAAAAAAGTTGGATGTTCTCTGAAATTCATGTAATAGGTGGGTCTAATAAAGTCTCTAAAGGAATTGCTGAACAAAAACCAGAACCTGTGTGATGAAATGGAGATGGGCAAGTCTCCAAAATGATGGATGCAGAGCTCTCGCTCTCACTCTCAGTGAAGAAGATGTTGAGAAAATTGTGGGTTCCTTCTGTCAAAATGCAAATGAAACATTTAGCGAAGTGTACACAAATGATGAAGGTGAGCCAAATTTGAAGGGTCAAATGACACTCTGTGTGAGTTCAATTTGGGTGTATGGGTGTTTGATGAGAGAAACAATAGTGATGGAGAAAGAAGTGCTTCAACTGTTGAAACTGGAGAGTTCATCCACTCATTTGAATGTGCAAGAACTTCCAACAAAAGTAAATGCTTACCAGAAATAAATAAACTGAGCACAAATTAAACACTTGGTGATATGTTTATAGTTATACTTGGTTTCTGGTCTGAGTTGTGCAAGAGCATGAGATACAAAGAAGGATTTAAATGCCTTGATTGGTCTTTAAATCAAAAGGCCAGTAACTTACTGTACCTAAAATATCTTGATTTGTGCAAGTTTGCTTTATTGCAATAATTGTAATTTATTTCAAATACTTCAATGGGTTGTAAATGTAATACATTTCAAGTTTGCTTTTTCTCTGTCCTTACTTTAATTTAGTCCCCTA

Coding sequence (CDS)

ATGCTCTCGTTCACACACAGACTCCGGACCTTGTCCTCATCTCCATGCTCCAAATCTCATCTTCTTCGATCCCTCCGGACGGACGCCGGACCGCCCAGGCGCCGGTCGAAGCCGGCGGCATTTGCTGCAAAGAACGCGGACGAGAAGTCAGAGTGGTGGGCAGTTGACGGCGAAATGCACGAAATTGGCGAAAATGTGCCGCCTCGCGAGCGCTTCGTTATACCCAGAGAAAATATCCCCAATCGGCGCCGGAAGCAGCTCAGGGAACAGTTCATGCGCCGGACTCGCCTCGTTCTCAAGGAATCTGAGCATGAGCCTTGGTGCAAAAAGTACATGGAGCTTTATCAGGAGCTTAGAGAAAACTGGGAGAGGCTGTACTGGGACGAGGGATACTCTAAAAAACTTGCCCAGGAACATGCAAATTATGAGTCTGCTGAAGATGAGGATTTCTCCCCTTATAGGAATAGGCAATCCAGTGTTGATCAAAGTAAGGACCAGGGTTTTAGAAGGAACACGCAAGGTGGTAACCGGGAGAAGGTCAGCCAGATTAGAGATAAGTTTGAATACGACAGGGACAGAAGAATGAGAGAAAGAGGGGTTATAAGCTTGGAGCTGGCTCGCGCCAGAAGCAGCAGTACGGCGGCGATGAAGGCGCACCACTGTCACCGTCACCTCCTAAAACCCAACTCCGCCGCCGCCGCCGCCTTCCCTTCCGCCATTCCGGCCACCGCGCATCGTCACTTCGCCACCAAATACACCGCCAAAATCACTTCCTCCTCTCCCACCGGACGCTCCGTTTCCGCAGAGGTCCGGCCGCCGGCTCCTCTCCCCGTCGACTCTCGCGGCTACACTCTACCCCGCAGAGATCTCATCTGCAGAGCCCTTCAGATACTCCTCAACCGCAAACCATCCGCCATTGACGATCGCTTCTCCGATTTATCCTCCTACTTCCAGTCTCTCTCCGTCTCCCTAACTCCAGCCGAAGCCTCTGAAATCCTTCGATCCTTGAACTGCCCCGACCTCGCTCTGCAATTTTTTCAGCTTTGCCCCTCCATTTGCCCTAAGTTTCGCCACGATGTCTTCACTTACACCCGCTTCCTTCTCATACTCTCCCAATCCTCTTCCCCGAAGCGGTTCGATCACGTTCGGGAGATTCTGTCGCGGATGGATCGAGATCAGATACGCGGTACCATTTCCACTGTTAATATCTTGATTGGGATTTTTGGTAGCAAGGAGGATTTGGAATTATGTTATGGGTTGATTAAGAAATGGGACTTGAGGCTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTACGGTCTCATGATTCAGATAAGGCTTTCAATTTGTATATGGAAATGAGGGGTCGAGGGTATAAGCTGGATATCTTTGCCTATAATATGCTATTGGATTCTCTGGCCAAGGATGAACAGCTTGATCGAGCTTACAAAGTTTTTAAGGATATGAAATTGAAGCATTGTAACCCAGATGAGTATACGTATACTATTATGATTAGAATGACTGGAAAGATGGGTAGAACTGAAGAGTCTTTGGCTCTCTTTCAAGAAATGCTAGCAAAAGGATGTACTCCGAATGCAATTTCATATAATACAATGATCGAGACACTTTGCAAGAGTAGAATGGTTGACAAGGCAATTCTTCTCTTTTCTAATATGGTTAAGAACAATTGTAGGCCAAATGAGTTCACGTATAGTCTCATTTTGAATGTTTTGGTTGCAGAAGGACAGCTTGGTAAATTGGACGAAGTTCTGGAAGTGTCCAATAAGTTTATGAACAAATCAATATATGGATATCTTGTAAGGACTCTAAGCAAACTAGGCCATGCGAGTGAAGCTCATCGTATTTTTTGCAATATGTGGAAATTTCATGATAGAGGCGATAGAGATGCTTACGTTTCCATGTTGGAGAGTTTATGCAGTGCAGGTAAAACTGTAGAAGCAATGGAACTGCTCGATAAGGTTCACGAGAAGGGAGTTAGTTCTGATACTATGATGCATAACACACTGTTATCTACTTTGGGGAAGTTAAAGCAAGTATCTCATCTTCATGATCTTTACGAGAAGATGAAACAAGATGGGCCGTTGCCTGATGTATTCACATATAATATTCTTATATCAAGCCTAGGACGTGTTGGAAAAGTTAAGGAGGCTGTTGAAGTTTTTGAAGAACTTGAGAATAGTAGTTGTAAACCAGATATTATATCTTACAATTCTTTGATCAATTGCCTTGGGAAAAATGGGGATGTTGATGAAGCTCACATGAGGTTTCTAGAGATGCAAGAGAAGGGATTGAATCCTGACGTTGTAACGTACAGCACGCTCATTGAATGTTTCGGGAAAACAGATAAAGTTGAGATGGCTCAGAGTTTGTTTGATAAAATGATGGCTCAAGGATGTTGTCCAAATATTATAACGTACAACATCCTGCTTGACTGTCTTGAGAGAGCTGGGAGAACTGCTGAAACTGTTGATCTGTATGCAAAACTTAAACAGCAGGGATTAACACCCGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCGACTCGAAAATTTAGAGTCCGAAGGCAAAATCCAATTACTGGCTGGGTTGTTAGTCCTTTAAGGTAA

Protein sequence

MLSFTHRLRTLSSSPCSKSHLLRSLRTDAGPPRRRSKPAAFAAKNADEKSEWWAVDGEMHEIGENVPPRERFVIPRENIPNRRRKQLREQFMRRTRLVLKESEHEPWCKKYMELYQELRENWERLYWDEGYSKKLAQEHANYESAEDEDFSPYRNRQSSVDQSKDQGFRRNTQGGNREKVSQIRDKFEYDRDRRMRERGVISLELARARSSSTAAMKAHHCHRHLLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYTLPRRDLICRALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR
Homology
BLAST of MC09g0822 vs. ExPASy Swiss-Prot
Match: Q9ZU27 (Pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g51965 PE=2 SV=1)

HSP 1 Score: 889.0 bits (2296), Expect = 4.3e-257
Identity = 425/635 (66.93%), Postives = 527/635 (82.99%), Query Frame = 0

Query: 246 RHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYTLPRRDLICRALQILLNRKPSA 305
           RH+ATKY AK+TSSSP+GRS+SAEV  P PLP D RGY LPRR LICRA  ++     S 
Sbjct: 21  RHYATKYVAKVTSSSPSGRSLSAEVSLPNPLPADVRGYPLPRRHLICRATNLITG--ASN 80

Query: 306 IDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSICPKFRHDVFTYTR 365
           + D FSDLS Y  SLS+SLTP EASEIL+SLN P LA++FF+L PS+CP  ++D F Y R
Sbjct: 81  LSDAFSDLSDYLSSLSLSLTPDEASEILKSLNSPLLAVEFFKLVPSLCPYSQNDPFLYNR 140

Query: 366 FLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLIKKWD 425
            +LILS+S+ P RFD VR IL  M +  + G ISTVNILIG FG+ EDL++C  L+KKWD
Sbjct: 141 IILILSRSNLPDRFDRVRSILDSMVKSNVHGNISTVNILIGFFGNTEDLQMCLRLVKKWD 200

Query: 426 LRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQLDRAY 485
           L++N++TY+CLLQA++RS D  KAF++Y E+R  G+KLDIFAYNMLLD+LAKDE   +A 
Sbjct: 201 LKMNSFTYKCLLQAYLRSRDYSKAFDVYCEIRRGGHKLDIFAYNMLLDALAKDE---KAC 260

Query: 486 KVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETL 545
           +VF+DMK +HC  DEYTYTIMIR  G++G+ +E++ LF EM+ +G T N + YNT+++ L
Sbjct: 261 QVFEDMKKRHCRRDEYTYTIMIRTMGRIGKCDEAVGLFNEMITEGLTLNVVGYNTLMQVL 320

Query: 546 CKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNKFMNKSIY 605
            K +MVDKAI +FS MV+  CRPNE+TYSL+LN+LVAEGQL +LD V+E+S ++M + IY
Sbjct: 321 AKGKMVDKAIQVFSRMVETGCRPNEYTYSLLLNLLVAEGQLVRLDGVVEISKRYMTQGIY 380

Query: 606 GYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHE 665
            YLVRTLSKLGH SEAHR+FC+MW F  +G+RD+Y+SMLESLC AGKT+EA+E+L K+HE
Sbjct: 381 SYLVRTLSKLGHVSEAHRLFCDMWSFPVKGERDSYMSMLESLCGAGKTIEAIEMLSKIHE 440

Query: 666 KGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKE 725
           KGV +DTMM+NT+ S LGKLKQ+SH+HDL+EKMK+DGP PD+FTYNILI+S GRVG+V E
Sbjct: 441 KGVVTDTMMYNTVFSALGKLKQISHIHDLFEKMKKDGPSPDIFTYNILIASFGRVGEVDE 500

Query: 726 AVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIE 785
           A+ +FEELE S CKPDIISYNSLINCLGKNGDVDEAH+RF EMQEKGLNPDVVTYSTL+E
Sbjct: 501 AINIFEELERSDCKPDIISYNSLINCLGKNGDVDEAHVRFKEMQEKGLNPDVVTYSTLME 560

Query: 786 CFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTP 845
           CFGKT++VEMA SLF++M+ +GC PNI+TYNILLDCLE+ GRTAE VDLY+K+KQQGLTP
Sbjct: 561 CFGKTERVEMAYSLFEEMLVKGCQPNIVTYNILLDCLEKNGRTAEAVDLYSKMKQQGLTP 620

Query: 846 DSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPL 881
           DSITY +L+RLQS S  K R+RR+NPITGWVVSPL
Sbjct: 621 DSITYTVLERLQSVSHGKSRIRRKNPITGWVVSPL 650

BLAST of MC09g0822 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 4.4e-60
Identity = 150/539 (27.83%), Postives = 272/539 (50.46%), Query Frame = 0

Query: 320 LSVSLTPAEASE-ILRSLNCPDLALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKR 379
           LS + TP  AS  +L+S N   L L+F            H  FT     + L   +  K 
Sbjct: 42  LSANFTPEAASNLLLKSQNDQALILKFLNWANP------HQFFTLRCKCITLHILTKFKL 101

Query: 380 FDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQ 439
           +    +IL+    D    T+      +     +E  +LCY     +DL + +Y+   L+ 
Sbjct: 102 Y-KTAQILA---EDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLI- 161

Query: 440 AHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQ-LDRAYKVFKDMKLKHCN 499
                   DKA ++    +  G+   + +YN +LD+  + ++ +  A  VFK+M     +
Sbjct: 162 --------DKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVS 221

Query: 500 PDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILL 559
           P+ +TY I+IR     G  + +L LF +M  KGC PN ++YNT+I+  CK R +D    L
Sbjct: 222 PNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKL 281

Query: 560 FSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNK---FMNKSIYGYLVRTLSK 619
             +M      PN  +Y++++N L  EG++ ++  VL   N+    +++  Y  L++   K
Sbjct: 282 LRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCK 341

Query: 620 LGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMM 679
            G+  +A  +   M +         Y S++ S+C AG    AME LD++  +G+  +   
Sbjct: 342 EGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERT 401

Query: 680 HNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELE 739
           + TL+    +   ++  + +  +M  +G  P V TYN LI+     GK+++A+ V E+++
Sbjct: 402 YTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMK 461

Query: 740 NSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVE 799
                PD++SY+++++   ++ DVDEA     EM EKG+ PD +TYS+LI+ F +  + +
Sbjct: 462 EKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTK 521

Query: 800 MAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAIL 854
            A  L+++M+  G  P+  TY  L++     G   + + L+ ++ ++G+ PD +TY++L
Sbjct: 522 EACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVL 561

BLAST of MC09g0822 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 230.3 bits (586), Expect = 8.3e-59
Identity = 150/587 (25.55%), Postives = 265/587 (45.14%), Query Frame = 0

Query: 304 SAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSICPKFR--HDVF 363
           S +  + SD S      S     + + E+ R L         F    S+       H   
Sbjct: 60  SVVSMKSSDFSGSMIRKSSKPDLSSSEEVTRGLKSFPDTDSSFSYFKSVAGNLNLVHTTE 119

Query: 364 TYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLI 423
           T    L  L      +   +V +++ +    +   T  T+   + + G  +        +
Sbjct: 120 TCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSVKGGLKQAPYALRKM 179

Query: 424 KKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQL 483
           +++   LNAY+Y  L+   ++S    +A  +Y  M   G++  +  Y+ L+  L K   +
Sbjct: 180 REFGFVLNAYSYNGLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQTYSSLMVGLGKRRDI 239

Query: 484 DRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTM 543
           D    + K+M+     P+ YT+TI IR+ G+ G+  E+  + + M  +GC P+ ++Y  +
Sbjct: 240 DSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVL 299

Query: 544 IETLCKSRMVDKAILLF-----------------------------------SNMVKNNC 603
           I+ LC +R +D A  +F                                   S M K+  
Sbjct: 300 IDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGH 359

Query: 604 RPNEFTYSLILNVLVAEGQLGKLDEVLEVSNK---FMNKSIYGYLVRTLSKLGHASEAHR 663
            P+  T++++++ L   G  G+  + L+V        N   Y  L+  L ++    +A  
Sbjct: 360 VPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALE 419

Query: 664 IFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLG 723
           +F NM     +     Y+  ++    +G +V A+E  +K+  KG++ + +  N  L +L 
Sbjct: 420 LFGNMESLGVKPTAYTYIVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLA 479

Query: 724 KLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDII 783
           K  +      ++  +K  G +PD  TYN+++    +VG++ EA+++  E+  + C+PD+I
Sbjct: 480 KAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVI 539

Query: 784 SYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKM 843
             NSLIN L K   VDEA   F+ M+E  L P VVTY+TL+   GK  K++ A  LF+ M
Sbjct: 540 VVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGM 599

Query: 844 MAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITY 851
           + +GC PN IT+N L DCL +       + +  K+   G  PD  TY
Sbjct: 600 VQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKMMDMGCVPDVFTY 646

BLAST of MC09g0822 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 1.7e-56
Identity = 124/429 (28.90%), Postives = 228/429 (53.15%), Query Frame = 0

Query: 431 YTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQLDRAYKVFKD 490
           Y Y  ++  +  +   D+A++L    R +G    + AYN +L  L K  ++D A KVF++
Sbjct: 309 YAYNTMIMGYGSAGKFDEAYSLLERQRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEE 368

Query: 491 MKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETLCKSRM 550
           MK K   P+  TY I+I M  + G+ + +  L   M   G  PN  + N M++ LCKS+ 
Sbjct: 369 MK-KDAAPNLSTYNILIDMLCRAGKLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCKSQK 428

Query: 551 VDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNKFM------NKSI 610
           +D+A  +F  M    C P+E T+  +++ L   G++G++D+  +V  K +      N  +
Sbjct: 429 LDEACAMFEEMDYKVCTPDEITFCSLIDGL---GKVGRVDDAYKVYEKMLDSDCRTNSIV 488

Query: 611 YGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVH 670
           Y  L++     G   + H+I+ +M   +   D     + ++ +  AG+  +   + +++ 
Sbjct: 489 YTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIK 548

Query: 671 EKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVK 730
            +    D   ++ L+  L K    +  ++L+  MK+ G + D   YNI+I    + GKV 
Sbjct: 549 ARRFVPDARSYSILIHGLIKAGFANETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVN 608

Query: 731 EAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLI 790
           +A ++ EE++    +P +++Y S+I+ L K   +DEA+M F E + K +  +VV YS+LI
Sbjct: 609 KAYQLLEEMKTKGFEPTVVTYGSVIDGLAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLI 668

Query: 791 ECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLT 850
           + FGK  +++ A  + +++M +G  PN+ T+N LLD L +A    E +  +  +K+   T
Sbjct: 669 DGFGKVGRIDEAYLILEELMQKGLTPNLYTWNSLLDALVKAEEINEALVCFQSMKELKCT 728

Query: 851 PDSITYAIL 854
           P+ +TY IL
Sbjct: 729 PNQVTYGIL 733

BLAST of MC09g0822 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 212.2 bits (539), Expect = 2.3e-53
Identity = 124/412 (30.10%), Postives = 214/412 (51.94%), Query Frame = 0

Query: 429 NAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQLDRAYKVF 488
           + + Y  L+    + +  D A  +   MR + +  D   YN+++ SL    +LD A KV 
Sbjct: 157 DVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVL 216

Query: 489 KDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETLCKS 548
             +   +C P   TYTI+I  T   G  +E+L L  EML++G  P+  +YNT+I  +CK 
Sbjct: 217 NQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKE 276

Query: 549 RMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNKFM------NK 608
            MVD+A  +  N+    C P+  +Y+++L  L+ +   GK +E  ++  K        N 
Sbjct: 277 GMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQ---GKWEEGEKLMTKMFSEKCDPNV 336

Query: 609 SIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDK 668
             Y  L+ TL + G   EA  +   M +     D  +Y  ++ + C  G+   A+E L+ 
Sbjct: 337 VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 396

Query: 669 VHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGK 728
           +   G   D + +NT+L+TL K  +     +++ K+ + G  P+  +YN + S+L   G 
Sbjct: 397 MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 456

Query: 729 VKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYST 788
              A+ +  E+ ++   PD I+YNS+I+CL + G VDEA    ++M+    +P VVTY+ 
Sbjct: 457 KIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNI 516

Query: 789 LIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDL 835
           ++  F K  ++E A ++ + M+  GC PN  TY +L++ +  AG  AE ++L
Sbjct: 517 VLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMEL 565

BLAST of MC09g0822 vs. NCBI nr
Match: XP_022156087.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial, partial [Momordica charantia])

HSP 1 Score: 1180 bits (3053), Expect = 0.0
Identity = 589/589 (100.00%), Postives = 589/589 (100.00%), Query Frame = 0

Query: 293 RALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI 352
           RALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI
Sbjct: 1   RALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI 60

Query: 353 CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE 412
           CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE
Sbjct: 61  CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE 120

Query: 413 DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL 472
           DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL
Sbjct: 121 DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL 180

Query: 473 DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT 532
           DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT
Sbjct: 181 DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT 240

Query: 533 PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV 592
           PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV
Sbjct: 241 PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV 300

Query: 593 LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK 652
           LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK
Sbjct: 301 LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK 360

Query: 653 TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI 712
           TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI
Sbjct: 361 TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI 420

Query: 713 LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG 772
           LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG
Sbjct: 421 LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG 480

Query: 773 LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV 832
           LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV
Sbjct: 481 LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV 540

Query: 833 DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR 881
           DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR
Sbjct: 541 DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR 589

BLAST of MC09g0822 vs. NCBI nr
Match: XP_038898111.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Benincasa hispida])

HSP 1 Score: 1165 bits (3014), Expect = 0.0
Identity = 574/649 (88.44%), Postives = 615/649 (94.76%), Query Frame = 0

Query: 237 PSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYTLPRRDLICRALQ 296
           P+A  AT +RHFATKYTAKITSSSPTGRSVS EV PPA LPVDSRGY+LPRRDLICRA+ 
Sbjct: 17  PTATAAT-YRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYSLPRRDLICRAVD 76

Query: 297 ILLNRKPSA----IDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI 356
           ILL+RKP +    IDDRFSDL+SYFQSLSVSLTPAEASEIL+SLNCPDLALQFFQLCPS+
Sbjct: 77  ILLHRKPHSSSITIDDRFSDLASYFQSLSVSLTPAEASEILKSLNCPDLALQFFQLCPSL 136

Query: 357 CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE 416
           CPKFRHD FTY+R LL+LS SSS KRFD VREILS+MDRDQIRGTISTVNILI IFGSKE
Sbjct: 137 CPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTISTVNILIKIFGSKE 196

Query: 417 DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL 476
           DLE+C GLIKKWDLR NAYTYRCLLQAH+RSHDSD+AFN+YMEMRG+GY+LDIFAYNMLL
Sbjct: 197 DLEVCTGLIKKWDLRFNAYTYRCLLQAHLRSHDSDRAFNVYMEMRGKGYQLDIFAYNMLL 256

Query: 477 DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT 536
           D+LAK+EQLDR+Y+VFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL LF+EML KGCT
Sbjct: 257 DALAKNEQLDRSYRVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLVLFEEMLTKGCT 316

Query: 537 PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV 596
           PN I+YNTMI+ LCKSRMVDKAILLFSNM+KNNCRPNEFTYS+ILNVLVAEGQLG+LDEV
Sbjct: 317 PNLIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVLVAEGQLGRLDEV 376

Query: 597 LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK 656
           L VSNKFMNKSIY YLVRTLSKLGHASEAHR+FCNMW FHDRGDRDAY+SMLESLCS GK
Sbjct: 377 LGVSNKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSTGK 436

Query: 657 TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI 716
           TVEA++LL KVHE+G+SSDTMM+NT+LSTLGKLKQVSHLHDLYEKMK+DGP PD+FTYNI
Sbjct: 437 TVEAIDLLSKVHERGISSDTMMYNTVLSTLGKLKQVSHLHDLYEKMKRDGPFPDIFTYNI 496

Query: 717 LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG 776
           LISSLGRVGKVKEAVEVFEELENS CKPDIISYNSLINCLGKNGDVDEAHMRFLEMQ+KG
Sbjct: 497 LISSLGRVGKVKEAVEVFEELENSDCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQDKG 556

Query: 777 LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV 836
           LNPDVVTYSTLIECFGKTDKVEMA SLFDKM+ QGCCPNI+TYNILLDCLERAGRTAETV
Sbjct: 557 LNPDVVTYSTLIECFGKTDKVEMAHSLFDKMITQGCCPNIVTYNILLDCLERAGRTAETV 616

Query: 837 DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR 881
           DLYAKLKQQGLTPDSITYAILDRLQSGS RKFRVRRQNPITGWVVSPLR
Sbjct: 617 DLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPLR 664

BLAST of MC09g0822 vs. NCBI nr
Match: XP_023550137.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1154 bits (2984), Expect = 0.0
Identity = 575/661 (86.99%), Postives = 614/661 (92.89%), Query Frame = 0

Query: 225 LLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYT 284
           LLKP++ AA+          HRHFATKYTAKITSSSPTGRSVS EV PPAPLP+D RGY+
Sbjct: 11  LLKPSATAAS----------HRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPIDPRGYS 70

Query: 285 LPRRDLICRALQILLNRKP----SAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPD 344
           LPRRDLICRA+QILL+RKP    S +DDRFSDLSSYFQSLSVSLTPAEASEILRSLN PD
Sbjct: 71  LPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRSLN-PD 130

Query: 345 LALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTIST 404
           LALQFFQLCPS+CPKFRHDVFTY+R LLILS SSSPKRFD VREILS+M+RDQIRGTIST
Sbjct: 131 LALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIST 190

Query: 405 VNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRG 464
           VNILIGIFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFN+YMEMR RG
Sbjct: 191 VNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNRG 250

Query: 465 YKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 524
           +KLDIFAYNMLLD+LAKDEQLDRAYKVFKDMKLK CNPD YTYTIMIRMTGK GRTEESL
Sbjct: 251 FKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKQCNPDVYTYTIMIRMTGKRGRTEESL 310

Query: 525 ALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVL 584
           A F+EML  GCTPN I YNTMIE L KSRMVDKAILLFSNM+KNNCRPNEFTYS++LNVL
Sbjct: 311 AFFEEMLKNGCTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVVLNVL 370

Query: 585 VAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAY 644
           VAEGQ G+LDEVLE+SNKFMNKSIY YLVRTLSKLGH +EAHR+FCNMW FHDRGDR+AY
Sbjct: 371 VAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHVNEAHRLFCNMWSFHDRGDREAY 430

Query: 645 VSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQ 704
           +SMLESLCSAGKTVEA++LL KVHEKG+SSDTMM+N +LSTLGKLKQVSHLHDLYEKMKQ
Sbjct: 431 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMKQ 490

Query: 705 DGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDE 764
           DGPLPDVFTYNILISS GRVGKV+EAV+VFEELENSSCKPDIISYNSLINCLGKNGDVDE
Sbjct: 491 DGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVDE 550

Query: 765 AHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLD 824
           AHMRFLEMQEKGL PDVVTYSTLIECFGKTDKVEMA+SLFDKM+AQGCCPNI+TYNILLD
Sbjct: 551 AHMRFLEMQEKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILLD 610

Query: 825 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPL 881
           CLER GRTAE VDLYA+LKQ+GLTPDSITYA+LDRLQSGST+KFRVRRQNPITGWVVSPL
Sbjct: 611 CLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSPL 660

BLAST of MC09g0822 vs. NCBI nr
Match: KAG7025459.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1153 bits (2983), Expect = 0.0
Identity = 579/677 (85.52%), Postives = 625/677 (92.32%), Query Frame = 0

Query: 210 SSSTAAMKAHHCHRH-LLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSA 269
           +++TAAMK    + + LLKP++ AA+          +RHFATKYTAKITSSSPTGRSVS 
Sbjct: 27  ATNTAAMKVLRLYYYSLLKPSATAAS----------YRHFATKYTAKITSSSPTGRSVSV 86

Query: 270 EVRPPAPLPVDSRGYTLPRRDLICRALQILLNRKP----SAIDDRFSDLSSYFQSLSVSL 329
           EV PPAPLP+D RGY+LPRRDLICRA+QILL+RKP    S +DDRFSDLSSYFQSLSVSL
Sbjct: 87  EVTPPAPLPIDPRGYSLPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSL 146

Query: 330 TPAEASEILRSLNCPDLALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVRE 389
           TPAEASEILRSLN PDLALQFFQLCPS+CPKFRHDVFTY+R LLILS SSSPKRFD VRE
Sbjct: 147 TPAEASEILRSLN-PDLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVRE 206

Query: 390 ILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSH 449
           ILS+M+RDQIRGTISTVNILIGIFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSH
Sbjct: 207 ILSQMERDQIRGTISTVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSH 266

Query: 450 DSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYT 509
           DSD AFN+YMEMR RG+KLDIFAYNMLLD+LAKDEQLDRAYKVFKDMKLKHCNPD YTYT
Sbjct: 267 DSDGAFNVYMEMRNRGFKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYT 326

Query: 510 IMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKN 569
           IMIRMTGK GRTEESLA F+EML  G TPN I YNTMIE L KSRMVDKAILLFSNM+KN
Sbjct: 327 IMIRMTGKRGRTEESLAFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKN 386

Query: 570 NCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRI 629
           NCRPNEFTYS+ILNVLVAEGQ G+LDEVLE+SNKF+NKSIY YLVRTLSKLGHA+EAHR+
Sbjct: 387 NCRPNEFTYSVILNVLVAEGQCGRLDEVLEMSNKFLNKSIYAYLVRTLSKLGHANEAHRL 446

Query: 630 FCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGK 689
           FCNMW FHDRGDRDAY+SMLESLCSAGKTVEA++LL KVHEKG+SSDTMM+N +LSTLGK
Sbjct: 447 FCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGK 506

Query: 690 LKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIIS 749
           LKQVSHLHDLYEKMKQDGPLPDVFTYNILISS GRVGKV+EAV+VFEELENSSCKPDIIS
Sbjct: 507 LKQVSHLHDLYEKMKQDGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIIS 566

Query: 750 YNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMM 809
           YNSLINCLGKNGDVDEAHMRFLEM+EKGL PDVVTYSTLIECFGKTDKVEMA+SLFDKM+
Sbjct: 567 YNSLINCLGKNGDVDEAHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMI 626

Query: 810 AQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKF 869
           AQGCCPNI+TYNILLDCLE+ GRTAE VDLYA+LKQ+GLTPDSITYA+LDRLQSGST+KF
Sbjct: 627 AQGCCPNIVTYNILLDCLEKTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKF 686

Query: 870 RVRRQNPITGWVVSPLR 881
           RVRRQNPITGWVVSPLR
Sbjct: 687 RVRRQNPITGWVVSPLR 692

BLAST of MC09g0822 vs. NCBI nr
Match: XP_022960041.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita moschata])

HSP 1 Score: 1151 bits (2977), Expect = 0.0
Identity = 575/661 (86.99%), Postives = 614/661 (92.89%), Query Frame = 0

Query: 225 LLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYT 284
           LLKP++ AA+          HRHFATKYTAKITSSSPTGRSV  EV PPAPLP+D RGY+
Sbjct: 11  LLKPSATAAS----------HRHFATKYTAKITSSSPTGRSVYVEVTPPAPLPIDPRGYS 70

Query: 285 LPRRDLICRALQILLNRKP----SAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPD 344
           LPRRDLICRA+QILL+RKP    S +DDRFSDLSSYFQSLSVSLTPAEASEILR+LN PD
Sbjct: 71  LPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRALN-PD 130

Query: 345 LALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTIST 404
           LALQFFQLCPS+CPKFRHDVFTY+R LLILS SSSPKRFD VREILS+M+RDQIRGTIST
Sbjct: 131 LALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIST 190

Query: 405 VNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRG 464
           VNILIGIFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFN+YMEMR RG
Sbjct: 191 VNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNRG 250

Query: 465 YKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 524
           +KLDIFAYNMLLD+LAKDEQLDRAYKVFKDMKLKHCNPD YTYTIMIRMTGK GRTEESL
Sbjct: 251 FKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYTIMIRMTGKRGRTEESL 310

Query: 525 ALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVL 584
           A F+EML  G TPN I YNTMIE L KSRMVDKAILLFSNM+KNNCRPNEFTYS+ILNVL
Sbjct: 311 AFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 370

Query: 585 VAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAY 644
           VAEGQ G+LDEVLE+SNKFMNKSIY YLVRTLSKLGHA+EAHR+FCNMW FHDRGDRDAY
Sbjct: 371 VAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDAY 430

Query: 645 VSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQ 704
           +SMLESLCSAGKTVEA++LL KVHEKG+SSDTMM+N +LSTLGKLKQVSHLHDLYEKMKQ
Sbjct: 431 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMKQ 490

Query: 705 DGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDE 764
           DGPLPDVFTYNILISS GRVGKV+EAV+VFEELENSSCKPDIISYNSLINCLGKNGDVDE
Sbjct: 491 DGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVDE 550

Query: 765 AHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLD 824
           AHMRFLEM+EKGL PDVVTYSTLIECFGKTDKVEMA+SLFDKM+AQGCCPNI+TYNILLD
Sbjct: 551 AHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILLD 610

Query: 825 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPL 881
           CLER GRTAE VDLYA+LKQ+GLTPDSITYA+LDRLQSGST+KFRVRRQNPITGWVVSPL
Sbjct: 611 CLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSPL 660

BLAST of MC09g0822 vs. ExPASy TrEMBL
Match: A0A6J1DR37 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111023058 PE=3 SV=1)

HSP 1 Score: 1180 bits (3053), Expect = 0.0
Identity = 589/589 (100.00%), Postives = 589/589 (100.00%), Query Frame = 0

Query: 293 RALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI 352
           RALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI
Sbjct: 1   RALQILLNRKPSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSI 60

Query: 353 CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE 412
           CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE
Sbjct: 61  CPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKE 120

Query: 413 DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL 472
           DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL
Sbjct: 121 DLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLL 180

Query: 473 DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT 532
           DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT
Sbjct: 181 DSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCT 240

Query: 533 PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV 592
           PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV
Sbjct: 241 PNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEV 300

Query: 593 LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK 652
           LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK
Sbjct: 301 LEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGK 360

Query: 653 TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI 712
           TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI
Sbjct: 361 TVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNI 420

Query: 713 LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG 772
           LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG
Sbjct: 421 LISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKG 480

Query: 773 LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV 832
           LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV
Sbjct: 481 LNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETV 540

Query: 833 DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR 881
           DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR
Sbjct: 541 DLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPLR 589

BLAST of MC09g0822 vs. ExPASy TrEMBL
Match: A0A6J1H6J4 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111460908 PE=3 SV=1)

HSP 1 Score: 1151 bits (2977), Expect = 0.0
Identity = 575/661 (86.99%), Postives = 614/661 (92.89%), Query Frame = 0

Query: 225 LLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYT 284
           LLKP++ AA+          HRHFATKYTAKITSSSPTGRSV  EV PPAPLP+D RGY+
Sbjct: 11  LLKPSATAAS----------HRHFATKYTAKITSSSPTGRSVYVEVTPPAPLPIDPRGYS 70

Query: 285 LPRRDLICRALQILLNRKP----SAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPD 344
           LPRRDLICRA+QILL+RKP    S +DDRFSDLSSYFQSLSVSLTPAEASEILR+LN PD
Sbjct: 71  LPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRALN-PD 130

Query: 345 LALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTIST 404
           LALQFFQLCPS+CPKFRHDVFTY+R LLILS SSSPKRFD VREILS+M+RDQIRGTIST
Sbjct: 131 LALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIST 190

Query: 405 VNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRG 464
           VNILIGIFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFN+YMEMR RG
Sbjct: 191 VNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNRG 250

Query: 465 YKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 524
           +KLDIFAYNMLLD+LAKDEQLDRAYKVFKDMKLKHCNPD YTYTIMIRMTGK GRTEESL
Sbjct: 251 FKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYTIMIRMTGKRGRTEESL 310

Query: 525 ALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVL 584
           A F+EML  G TPN I YNTMIE L KSRMVDKAILLFSNM+KNNCRPNEFTYS+ILNVL
Sbjct: 311 AFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 370

Query: 585 VAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAY 644
           VAEGQ G+LDEVLE+SNKFMNKSIY YLVRTLSKLGHA+EAHR+FCNMW FHDRGDRDAY
Sbjct: 371 VAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDAY 430

Query: 645 VSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQ 704
           +SMLESLCSAGKTVEA++LL KVHEKG+SSDTMM+N +LSTLGKLKQVSHLHDLYEKMKQ
Sbjct: 431 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMKQ 490

Query: 705 DGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDE 764
           DGPLPDVFTYNILISS GRVGKV+EAV+VFEELENSSCKPDIISYNSLINCLGKNGDVDE
Sbjct: 491 DGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVDE 550

Query: 765 AHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLD 824
           AHMRFLEM+EKGL PDVVTYSTLIECFGKTDKVEMA+SLFDKM+AQGCCPNI+TYNILLD
Sbjct: 551 AHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILLD 610

Query: 825 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPL 881
           CLER GRTAE VDLYA+LKQ+GLTPDSITYA+LDRLQSGST+KFRVRRQNPITGWVVSPL
Sbjct: 611 CLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSPL 660

BLAST of MC09g0822 vs. ExPASy TrEMBL
Match: A0A6J1KR93 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111497945 PE=3 SV=1)

HSP 1 Score: 1148 bits (2969), Expect = 0.0
Identity = 572/661 (86.54%), Postives = 613/661 (92.74%), Query Frame = 0

Query: 225 LLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYT 284
           LLKP++ AA+          HRHFATKYTAKITSSSPTGRSVS EV  PAPLP+D RGY+
Sbjct: 11  LLKPSATAAS----------HRHFATKYTAKITSSSPTGRSVSVEVTSPAPLPIDPRGYS 70

Query: 285 LPRRDLICRALQILLNRK----PSAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPD 344
           LPRRDLICRA+QILL+RK     S +DDRF+DLSSYFQSLS+SLTPAEASEILRSLN PD
Sbjct: 71  LPRRDLICRAIQILLDRKRHSSSSTVDDRFTDLSSYFQSLSISLTPAEASEILRSLN-PD 130

Query: 345 LALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTIST 404
           LALQFFQLCPS+CPKFRHDVFTY+R LLILS SSSPKRFD VREILS+M+RDQIRGTIST
Sbjct: 131 LALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIST 190

Query: 405 VNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRG 464
           VNILIGIFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSH SD AFN+YMEMR RG
Sbjct: 191 VNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHHSDGAFNVYMEMRNRG 250

Query: 465 YKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 524
           +KLDIFAYNMLLD+LAKDEQLDRAYK+FKDMKLKHCNPD YTYT+MIRMTGK GRTEESL
Sbjct: 251 FKLDIFAYNMLLDALAKDEQLDRAYKIFKDMKLKHCNPDVYTYTVMIRMTGKRGRTEESL 310

Query: 525 ALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVL 584
           A F+EML  GCTPN I YNTMIE L KSRMVDKAILLFSNM+KNNCRPNEFTYS+ILNVL
Sbjct: 311 AFFEEMLKNGCTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 370

Query: 585 VAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAY 644
           VAEGQ G+LDEVLE+SNKFMNKSIY YLVRTLSKLGHA+EAHR+FCNMW FHDRGDRDAY
Sbjct: 371 VAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDAY 430

Query: 645 VSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQ 704
           +SMLESLCSAGKTVEA++LL KVHEKG+SSDTMM+N +LSTLGKLKQVSHLHDLYEKMKQ
Sbjct: 431 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMKQ 490

Query: 705 DGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNGDVDE 764
           DGPLPDVFTYNILISSLGRVGKV+EAV+VFEELENSSCKPDIISYNSLINC GKNGDVDE
Sbjct: 491 DGPLPDVFTYNILISSLGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCHGKNGDVDE 550

Query: 765 AHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLD 824
           AHMRFLEM+EKGL PDVVTYSTLIECFGKTDKVEMA+SLFDKM+AQGCCPNI+TYNILLD
Sbjct: 551 AHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILLD 610

Query: 825 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPL 881
           CLER GRTAE VDLYA+LKQQGLTPDSITYA+LDRLQSGST+KFRVRRQNPITGWVVSPL
Sbjct: 611 CLERTGRTAEAVDLYAELKQQGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSPL 660

BLAST of MC09g0822 vs. ExPASy TrEMBL
Match: A0A1S3BGX8 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103489714 PE=3 SV=1)

HSP 1 Score: 1130 bits (2923), Expect = 0.0
Identity = 562/665 (84.51%), Postives = 608/665 (91.43%), Query Frame = 0

Query: 221 CHRHLLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDS 280
           C+ HL KP + AAA          HRHFATKYTAKITSSSPTGRSV+  V PPA L VDS
Sbjct: 8   CYSHL-KPTATAAA----------HRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDS 67

Query: 281 RGYTLPRRDLICRALQILLNRKPSA----IDDRFSDLSSYFQSLSVSLTPAEASEILRSL 340
           RGY+LPRRDLICR + ILL+R P +    IDDRFSDLSSYFQSLSVSLTPAEASEIL+SL
Sbjct: 68  RGYSLPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSL 127

Query: 341 NCPDLALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRG 400
           NCPDLALQFF  CPS+C KFRHDVFTY+R LL+LS SSS KRFD VREILS+MDRDQIRG
Sbjct: 128 NCPDLALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRG 187

Query: 401 TISTVNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEM 460
           TISTVNILI IF S EDLELC GLIKKWDLR NAYTYRCLLQAHVRS DSD+AF++YMEM
Sbjct: 188 TISTVNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEM 247

Query: 461 RGRGYKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT 520
             +GY+LDIFAYNMLLD+LAKDE+LDR+YKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT
Sbjct: 248 WSKGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT 307

Query: 521 EESLALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLI 580
           EESLALF+EML KGCTPN I+YNTMI+ LCKSRMVDKAILLFSNM+KNNCRPNEFTYS+I
Sbjct: 308 EESLALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVI 367

Query: 581 LNVLVAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGD 640
           LNVLVAEGQLG+LDEVLEVSNKF+NKSIY YLVRTLSKLGH+SEAHR+FCNMW FHD GD
Sbjct: 368 LNVLVAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGD 427

Query: 641 RDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYE 700
           RDAY+SMLESLC  GKTVEA+ELL KVHEKG+S++TMM+NT+LSTLGKLKQVSHLHDLYE
Sbjct: 428 RDAYISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYE 487

Query: 701 KMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNG 760
           KMK+DGP PD+FTYNILISSLGRVGKVKEAVEVFEELE+S CKPDIISYNSLINCLGKNG
Sbjct: 488 KMKRDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNG 547

Query: 761 DVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYN 820
           DVDEAHMRFLEMQ+KGLNPDVVTYSTLIECFGKTDKVEMA+SLFD+M+ Q CCPNI+TYN
Sbjct: 548 DVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYN 607

Query: 821 ILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWV 880
           ILLDCLERAGRTAE VDLYAKL++QGLTPDSITYAILDRLQSGS RKFRVRRQNPITGWV
Sbjct: 608 ILLDCLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV 661

BLAST of MC09g0822 vs. ExPASy TrEMBL
Match: A0A5A7TJ34 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G002280 PE=3 SV=1)

HSP 1 Score: 1127 bits (2916), Expect = 0.0
Identity = 561/665 (84.36%), Postives = 607/665 (91.28%), Query Frame = 0

Query: 221 CHRHLLKPNSAAAAAFPSAIPATAHRHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDS 280
           C+ HL KP + AAA          HRHFATKYTAKITSSSPTGRSV+  V PPA L VDS
Sbjct: 8   CYSHL-KPTATAAA----------HRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDS 67

Query: 281 RGYTLPRRDLICRALQILLNRKPSA----IDDRFSDLSSYFQSLSVSLTPAEASEILRSL 340
           RGY+LPRRDLICR + ILL+R P +    IDDRFSDLSSYFQSLSVSLTPAEASEIL+SL
Sbjct: 68  RGYSLPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSL 127

Query: 341 NCPDLALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRG 400
           NCPDLALQFF  CPS+C KFRHDVFTY+R LL+LS SSS KRFD VREILS+MDRDQIRG
Sbjct: 128 NCPDLALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRG 187

Query: 401 TISTVNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEM 460
           TISTVNILI IF S EDLELC GLIKKWDLR NAYTYRCLLQAHVRS DSD+AF++YMEM
Sbjct: 188 TISTVNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEM 247

Query: 461 RGRGYKLDIFAYNMLLDSLAKDEQLDRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT 520
             +GY+LDIFAYNMLLD+LAKDE+LDR+YKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT
Sbjct: 248 WSKGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT 307

Query: 521 EESLALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILLFSNMVKNNCRPNEFTYSLI 580
           EESLALF+EML KGCTPN I+YNTMI+ LCKSRMVDKAILLFSNM+KNNCRPNEFTYS+I
Sbjct: 308 EESLALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVI 367

Query: 581 LNVLVAEGQLGKLDEVLEVSNKFMNKSIYGYLVRTLSKLGHASEAHRIFCNMWKFHDRGD 640
           LNVLVAEG LG+LDEVLEVSNKF+NKSIY YLVRTLSKLGH+SEAHR+FCNMW FHD GD
Sbjct: 368 LNVLVAEGLLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGD 427

Query: 641 RDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLGKLKQVSHLHDLYE 700
           RDAY+SMLESLC  GKTVEA+ELL KVHEKG+S++TMM+NT+LSTLGKLKQVSHLHDLYE
Sbjct: 428 RDAYISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYE 487

Query: 701 KMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDIISYNSLINCLGKNG 760
           KMK+DGP PD+FTYNILISSLGRVGKVKEAVEVFEELE+S CKPDIISYNSLINCLGKNG
Sbjct: 488 KMKRDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNG 547

Query: 761 DVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKMMAQGCCPNIITYN 820
           DVDEAHMRFLEMQ+KGLNPDVVTYSTLIECFGKTDKVEMA+SLFD+M+ Q CCPNI+TYN
Sbjct: 548 DVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYN 607

Query: 821 ILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSTRKFRVRRQNPITGWV 880
           ILLDCLERAGRTAE VDLYAKL++QGLTPDSITYAILDRLQSGS RKFRVRRQNPITGWV
Sbjct: 608 ILLDCLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV 661

BLAST of MC09g0822 vs. TAIR 10
Match: AT1G51965.1 (ABA Overly-Sensitive 5 )

HSP 1 Score: 889.0 bits (2296), Expect = 3.0e-258
Identity = 425/635 (66.93%), Postives = 527/635 (82.99%), Query Frame = 0

Query: 246 RHFATKYTAKITSSSPTGRSVSAEVRPPAPLPVDSRGYTLPRRDLICRALQILLNRKPSA 305
           RH+ATKY AK+TSSSP+GRS+SAEV  P PLP D RGY LPRR LICRA  ++     S 
Sbjct: 21  RHYATKYVAKVTSSSPSGRSLSAEVSLPNPLPADVRGYPLPRRHLICRATNLITG--ASN 80

Query: 306 IDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSICPKFRHDVFTYTR 365
           + D FSDLS Y  SLS+SLTP EASEIL+SLN P LA++FF+L PS+CP  ++D F Y R
Sbjct: 81  LSDAFSDLSDYLSSLSLSLTPDEASEILKSLNSPLLAVEFFKLVPSLCPYSQNDPFLYNR 140

Query: 366 FLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLIKKWD 425
            +LILS+S+ P RFD VR IL  M +  + G ISTVNILIG FG+ EDL++C  L+KKWD
Sbjct: 141 IILILSRSNLPDRFDRVRSILDSMVKSNVHGNISTVNILIGFFGNTEDLQMCLRLVKKWD 200

Query: 426 LRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQLDRAY 485
           L++N++TY+CLLQA++RS D  KAF++Y E+R  G+KLDIFAYNMLLD+LAKDE   +A 
Sbjct: 201 LKMNSFTYKCLLQAYLRSRDYSKAFDVYCEIRRGGHKLDIFAYNMLLDALAKDE---KAC 260

Query: 486 KVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETL 545
           +VF+DMK +HC  DEYTYTIMIR  G++G+ +E++ LF EM+ +G T N + YNT+++ L
Sbjct: 261 QVFEDMKKRHCRRDEYTYTIMIRTMGRIGKCDEAVGLFNEMITEGLTLNVVGYNTLMQVL 320

Query: 546 CKSRMVDKAILLFSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNKFMNKSIY 605
            K +MVDKAI +FS MV+  CRPNE+TYSL+LN+LVAEGQL +LD V+E+S ++M + IY
Sbjct: 321 AKGKMVDKAIQVFSRMVETGCRPNEYTYSLLLNLLVAEGQLVRLDGVVEISKRYMTQGIY 380

Query: 606 GYLVRTLSKLGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHE 665
            YLVRTLSKLGH SEAHR+FC+MW F  +G+RD+Y+SMLESLC AGKT+EA+E+L K+HE
Sbjct: 381 SYLVRTLSKLGHVSEAHRLFCDMWSFPVKGERDSYMSMLESLCGAGKTIEAIEMLSKIHE 440

Query: 666 KGVSSDTMMHNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKE 725
           KGV +DTMM+NT+ S LGKLKQ+SH+HDL+EKMK+DGP PD+FTYNILI+S GRVG+V E
Sbjct: 441 KGVVTDTMMYNTVFSALGKLKQISHIHDLFEKMKKDGPSPDIFTYNILIASFGRVGEVDE 500

Query: 726 AVEVFEELENSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIE 785
           A+ +FEELE S CKPDIISYNSLINCLGKNGDVDEAH+RF EMQEKGLNPDVVTYSTL+E
Sbjct: 501 AINIFEELERSDCKPDIISYNSLINCLGKNGDVDEAHVRFKEMQEKGLNPDVVTYSTLME 560

Query: 786 CFGKTDKVEMAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTP 845
           CFGKT++VEMA SLF++M+ +GC PNI+TYNILLDCLE+ GRTAE VDLY+K+KQQGLTP
Sbjct: 561 CFGKTERVEMAYSLFEEMLVKGCQPNIVTYNILLDCLEKNGRTAEAVDLYSKMKQQGLTP 620

Query: 846 DSITYAILDRLQSGSTRKFRVRRQNPITGWVVSPL 881
           DSITY +L+RLQS S  K R+RR+NPITGWVVSPL
Sbjct: 621 DSITYTVLERLQSVSHGKSRIRRKNPITGWVVSPL 650

BLAST of MC09g0822 vs. TAIR 10
Match: AT3G07440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48530.1); Has 37 Blast hits to 37 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 35; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 278.1 bits (710), Expect = 2.5e-74
Identity = 140/187 (74.87%), Postives = 159/187 (85.03%), Query Frame = 0

Query: 22  LRSLRTDAGPPRRRSKPAAFAAKNADEKSEWWAVDGEMHEIGENVPPRERFVIPRENIPN 81
           +R  RT+AG PRRR+K  +   K  +EKSEWW VDGEMHEIG++VPPRERF IPR+NIPN
Sbjct: 23  IRLARTEAGQPRRRNKLPSLPLKKKEEKSEWWIVDGEMHEIGDHVPPRERFTIPRDNIPN 82

Query: 82  RRRKQLREQFMRRTRLVLKESEHEPWCKKYMELYQELRENWERLYWDEGYSKKLAQEHAN 141
           +RRKQLR+QFMRRTRLVLKESEHEPWCKKYMELY ELRENWERLYWDEGYSKKLA +HAN
Sbjct: 83  KRRKQLRDQFMRRTRLVLKESEHEPWCKKYMELYNELRENWERLYWDEGYSKKLASDHAN 142

Query: 142 YESAE--DEDFSPYRNRQSSVDQSKDQGFRRNTQGGNREKVSQIRDKFEYDRDRRMRERG 201
           YESAE  DEDF+PYRNR+S  DQ+K+QGF R TQG N EKVSQIRDKFEYDR+RRMR++ 
Sbjct: 143 YESAEEDDEDFNPYRNRRSFSDQTKEQGFNRTTQGDNWEKVSQIRDKFEYDRERRMRDKA 202

Query: 202 VISLELA 207
              +  A
Sbjct: 203 FAPMNAA 209

BLAST of MC09g0822 vs. TAIR 10
Match: AT5G48530.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G07440.1); Has 32 Blast hits to 32 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 251.9 bits (642), Expect = 1.9e-66
Identity = 138/195 (70.77%), Postives = 158/195 (81.03%), Query Frame = 0

Query: 1   MLSFTHRLRTLSSSPCSKSHLLRSLRTDAGPPRRRSK-PAAFAAKNADEKSEWWAVDGEM 60
           MLS   RL + + S  + + LLR +RT+A  PRRR+K P+    K  +EKSEWW VDGEM
Sbjct: 1   MLSVARRLGSATPSLQNGASLLRFMRTEASQPRRRNKFPSLSPLKKKEEKSEWWIVDGEM 60

Query: 61  HEIGENVPPRERFVIPRENIPNRRRKQLREQFMRRTRLVLKESEHEPWCKKYMELYQELR 120
           HEIG++VP RERF IPR+NIPN+RRKQLREQFMRRTRLVLKESEHEPWCKKYMELY E+R
Sbjct: 61  HEIGDHVPLRERFTIPRDNIPNKRRKQLREQFMRRTRLVLKESEHEPWCKKYMELYNEVR 120

Query: 121 ENWERLYWDEGYSKKLAQEHANYESAE--DEDFSPYRNRQSSVDQSK-DQGFRRNTQG-G 180
           ENWERLYWDEGYSKK+A++HANYESAE  DEDF+PYRNR+   D  K +QGF R TQG  
Sbjct: 121 ENWERLYWDEGYSKKIARDHANYESAEEDDEDFNPYRNRRPYNDSIKQEQGFNRTTQGDD 180

Query: 181 NREKVSQIRDKFEYD 191
           N EKVSQIRDKFEYD
Sbjct: 181 NWEKVSQIRDKFEYD 195

BLAST of MC09g0822 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 234.6 bits (597), Expect = 3.1e-61
Identity = 150/539 (27.83%), Postives = 272/539 (50.46%), Query Frame = 0

Query: 320 LSVSLTPAEASE-ILRSLNCPDLALQFFQLCPSICPKFRHDVFTYTRFLLILSQSSSPKR 379
           LS + TP  AS  +L+S N   L L+F            H  FT     + L   +  K 
Sbjct: 42  LSANFTPEAASNLLLKSQNDQALILKFLNWANP------HQFFTLRCKCITLHILTKFKL 101

Query: 380 FDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLIKKWDLRLNAYTYRCLLQ 439
           +    +IL+    D    T+      +     +E  +LCY     +DL + +Y+   L+ 
Sbjct: 102 Y-KTAQILA---EDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLI- 161

Query: 440 AHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQ-LDRAYKVFKDMKLKHCN 499
                   DKA ++    +  G+   + +YN +LD+  + ++ +  A  VFK+M     +
Sbjct: 162 --------DKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVS 221

Query: 500 PDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTMIETLCKSRMVDKAILL 559
           P+ +TY I+IR     G  + +L LF +M  KGC PN ++YNT+I+  CK R +D    L
Sbjct: 222 PNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKL 281

Query: 560 FSNMVKNNCRPNEFTYSLILNVLVAEGQLGKLDEVLEVSNK---FMNKSIYGYLVRTLSK 619
             +M      PN  +Y++++N L  EG++ ++  VL   N+    +++  Y  L++   K
Sbjct: 282 LRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCK 341

Query: 620 LGHASEAHRIFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMM 679
            G+  +A  +   M +         Y S++ S+C AG    AME LD++  +G+  +   
Sbjct: 342 EGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERT 401

Query: 680 HNTLLSTLGKLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELE 739
           + TL+    +   ++  + +  +M  +G  P V TYN LI+     GK+++A+ V E+++
Sbjct: 402 YTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMK 461

Query: 740 NSSCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVE 799
                PD++SY+++++   ++ DVDEA     EM EKG+ PD +TYS+LI+ F +  + +
Sbjct: 462 EKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTK 521

Query: 800 MAQSLFDKMMAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAIL 854
            A  L+++M+  G  P+  TY  L++     G   + + L+ ++ ++G+ PD +TY++L
Sbjct: 522 EACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVL 561

BLAST of MC09g0822 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 230.3 bits (586), Expect = 5.9e-60
Identity = 150/587 (25.55%), Postives = 265/587 (45.14%), Query Frame = 0

Query: 304 SAIDDRFSDLSSYFQSLSVSLTPAEASEILRSLNCPDLALQFFQLCPSICPKFR--HDVF 363
           S +  + SD S      S     + + E+ R L         F    S+       H   
Sbjct: 60  SVVSMKSSDFSGSMIRKSSKPDLSSSEEVTRGLKSFPDTDSSFSYFKSVAGNLNLVHTTE 119

Query: 364 TYTRFLLILSQSSSPKRFDHVREILSRMDRDQIRGTISTVNILIGIFGSKEDLELCYGLI 423
           T    L  L      +   +V +++ +    +   T  T+   + + G  +        +
Sbjct: 120 TCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSVKGGLKQAPYALRKM 179

Query: 424 KKWDLRLNAYTYRCLLQAHVRSHDSDKAFNLYMEMRGRGYKLDIFAYNMLLDSLAKDEQL 483
           +++   LNAY+Y  L+   ++S    +A  +Y  M   G++  +  Y+ L+  L K   +
Sbjct: 180 REFGFVLNAYSYNGLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQTYSSLMVGLGKRRDI 239

Query: 484 DRAYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESLALFQEMLAKGCTPNAISYNTM 543
           D    + K+M+     P+ YT+TI IR+ G+ G+  E+  + + M  +GC P+ ++Y  +
Sbjct: 240 DSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVL 299

Query: 544 IETLCKSRMVDKAILLF-----------------------------------SNMVKNNC 603
           I+ LC +R +D A  +F                                   S M K+  
Sbjct: 300 IDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGH 359

Query: 604 RPNEFTYSLILNVLVAEGQLGKLDEVLEVSNK---FMNKSIYGYLVRTLSKLGHASEAHR 663
            P+  T++++++ L   G  G+  + L+V        N   Y  L+  L ++    +A  
Sbjct: 360 VPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALE 419

Query: 664 IFCNMWKFHDRGDRDAYVSMLESLCSAGKTVEAMELLDKVHEKGVSSDTMMHNTLLSTLG 723
           +F NM     +     Y+  ++    +G +V A+E  +K+  KG++ + +  N  L +L 
Sbjct: 420 LFGNMESLGVKPTAYTYIVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLA 479

Query: 724 KLKQVSHLHDLYEKMKQDGPLPDVFTYNILISSLGRVGKVKEAVEVFEELENSSCKPDII 783
           K  +      ++  +K  G +PD  TYN+++    +VG++ EA+++  E+  + C+PD+I
Sbjct: 480 KAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVI 539

Query: 784 SYNSLINCLGKNGDVDEAHMRFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMAQSLFDKM 843
             NSLIN L K   VDEA   F+ M+E  L P VVTY+TL+   GK  K++ A  LF+ M
Sbjct: 540 VVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGM 599

Query: 844 MAQGCCPNIITYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITY 851
           + +GC PN IT+N L DCL +       + +  K+   G  PD  TY
Sbjct: 600 VQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKMMDMGCVPDVFTY 646

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZU274.3e-25766.93Pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Arabidop... [more]
Q9FIX34.4e-6027.83Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9SZ528.3e-5925.55Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9M9071.7e-5628.90Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Q9SR002.3e-5330.10Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022156087.10.0100.00pentatricopeptide repeat-containing protein At1g51965, mitochondrial, partial [M... [more]
XP_038898111.10.088.44pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Benincasa ... [more]
XP_023550137.10.086.99pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita ... [more]
KAG7025459.10.085.52Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022960041.10.086.99pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A6J1DR370.0100.00pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Momordic... [more]
A0A6J1H6J40.086.99pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbit... [more]
A0A6J1KR930.086.54pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbit... [more]
A0A1S3BGX80.084.51pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucumis ... [more]
A0A5A7TJ340.084.36Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G51965.13.0e-25866.93ABA Overly-Sensitive 5 [more]
AT3G07440.12.5e-7474.87unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G48530.11.9e-6670.77unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G39710.13.1e-6127.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G31850.15.9e-6025.55proton gradient regulation 3 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 301..421
e-value: 1.6E-7
score: 32.8
coord: 478..587
e-value: 3.4E-34
score: 119.8
coord: 422..477
e-value: 6.9E-11
score: 43.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 765..862
e-value: 4.9E-26
score: 93.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 594..764
e-value: 3.3E-35
score: 123.9
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 433..594
e-value: 4.1E-19
score: 68.8
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 702..733
e-value: 5.7E-11
score: 42.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 536..570
e-value: 7.9E-10
score: 36.3
coord: 708..742
e-value: 9.9E-10
score: 36.0
coord: 743..777
e-value: 2.2E-10
score: 38.1
coord: 467..500
e-value: 4.5E-7
score: 27.6
coord: 674..707
e-value: 4.7E-4
score: 18.2
coord: 778..812
e-value: 8.8E-9
score: 33.0
coord: 813..847
e-value: 7.3E-6
score: 23.8
coord: 639..671
e-value: 1.2E-4
score: 20.0
coord: 431..464
e-value: 5.0E-5
score: 21.2
coord: 501..535
e-value: 7.8E-9
score: 33.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 639..668
e-value: 6.8E-4
score: 19.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 810..853
e-value: 3.8E-9
score: 36.6
coord: 740..787
e-value: 3.8E-15
score: 55.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 429..463
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 499..533
score: 12.75901
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 671..705
score: 9.218511
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 811..845
score: 11.355965
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 534..568
score: 12.627475
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 636..670
score: 10.248873
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 464..498
score: 11.454616
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 706..740
score: 13.361882
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 776..810
score: 12.474017
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 741..775
score: 12.978237
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 22..43
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..181
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 155..178
NoneNo IPR availablePANTHERPTHR46128:SF166BNAA05G15090D PROTEINcoord: 238..875
NoneNo IPR availablePANTHERPTHR46128MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1coord: 238..875
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 512..805

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC09g0822.1MC09g0822.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding