Clc09G10840 (gene) Watermelon (cordophanus) v2

Overview
NameClc09G10840
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationClcChr09: 9602933 .. 9620670 (-)
RNA-Seq ExpressionClc09G10840
SyntenyClc09G10840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCAGCAACTACCGAACAGAGGAAAAGGTGAAGAAGCCTTTAGTTTGTTCTAACAGAACGAACCCAATTTGTACCACAACTGTATTCAGCTGCAGTGCAAGCATATTGCACTCATGGCTTTGTTCACACTGCCATTTCATTCGGAATCTTGTCTTCAATAAGCTTTCGCCGCCGTCTCATCGGGGAAGTCAATGCACAGGCAAGTAAAATTCTCTCCGGGTCATGATAGCAGTTACAGATGCCCTACTAACCTAGTATCTCCTCTCCATTTCCTCAGGTTAAGCTCTCGTTTGGGATCAATAGCTGGCTCGATATGTAGATTCAGGCCCCACGAACATGTAAGCATCTTTAGTTCGGAAGTATGGCCGGATTTTATAAATTTGAATTTGTTATCTTGAAAGACGAAGAAAAGAAGGGAGGTCACTTAAGGGGCCAGCTGATTCAGTTAATGCATAATTTTCCTTTGGAGAATTGCTAAACATGGCAGTTATTCAATGAACTTATATTTAGTTCATTTGGTTTAGCATTTCTGCAAATTAGTTATTGTTAAATACTAGGTTGGCCATCGTGTAGGGTCAAGTAGTTGTATTGTGAGATTAGTCTTGCCTGAACATATAGGAATTCCCATTGGATTTTGTGAGTTCTTATTTATTCTCCTTACTACTATCAATAGGTGCGAAAAAAGGATGCTAGCAAATTGGTATTCCATCGGGCTCTTCTCATCTCCAAGGGTAATCTATGGAAAATAAGTTGTTTGAATATGTCATGTTTTTCAGGACGAGTGATTCGATGGTTTTTTTTGACATAGTATGTATAAATCGGTCTTCTTACATCTGTTTACAAATTACAACTTTGGCAAAGAGTTTGTTCTAGGCCATGTTTTCTGGTAGAATATACAATCAAATTTCATGTATGACATATGTTATAATACATATTTTACATGTTTATGTATCATTGTCCTTTATCGTAATTGTACATACCTAGTCTAGGGTTTTCCTTGTTCCCTGTATATGTGTAACTTTCTTGTGAATAATAGTACGAGAATATATTCTAAATTACTCAGTACATGGTATTAGAGTTTCTAGGGTTTAGAATATTCGTTTGATGACTACCCTCTCACAAAATTAGGGTTTTAGTTTAGGTCACAAAGTTCAAGTGGCTGTTTGTTTACATCGCTCACCTCAGGAGAAGTGTTGTTGTCATTCGTTGTCGTCTGTCTTCGTCGTTGGCTGAAAAACGCCGCCCGTGAGGCTCAATTGTCGGCGGAAGGAAACTGCTCCGTCGCCACGAGCTAGTGCGTGGAAGCTTCGTTTCGTATTCCAATGGTTGGGTTCATCTTGGGTCCTCTCCCCCGATCAGTTTGTGCATTTGGTTCCATCGAAAACTCTTTGGCCGTGAAGATATTTGAAGAATACATCTTTGTTTCTAGGGTTTGTGGTTGTTTTATTCAATTTTGTGGCTGTTTTGATCATGTGGAGTGATTTTACCCTAGTTTTGTATTATGGTTCTTTATTTTGGTTCTTTATTCTCGTTCCATCTGTGAGATCTACGGTTCGTGTGTAAATTCATTATGGATGCGATGAAGAATTTGGTAGTATCTAATGTGATTCCCCCAACCTCAAAGATCACATAACATAAGTTAAATAGATCTAATTACTATGATTGGCGTTTGACAGTTCTGTTTTATTTACGAAGTATGGATATGGGTAATCATATGACTGCAGATCCACCAGAAGATGATAAGAAGAAGAAGGATTGGCTTCGTGGTGATGTTCGACTTTATCTTTAGATCAAAAATTCCATTGAGAATGAGGTAATTGGTTTGGTCGATCATTGTAATTCTGTGAAGGAGCTCTTAAAATTTTTAGAATTTTTGTACTCGAGTAAAGAGCAAGTCCATAGATTGTTTGAAGTTTGTATGTAGTTCTTTCATGCCGAACAGATCGCAGAATCTATGACCAACTACTTTATGAGACTTAAGAGAATAGCTGCTGAGCTTGCTTTGTAACCTTTCAACCCAGATGTTAAAGTTCAACAAACTCAACGAGAAAAGATGGTTGTTATGATCTTTTTGAATGGACTTTTACTTGAATTTGGAAGGGCCAAAACACAAATTCTTTCTGAGTCTAAAATCCCATCATTAGAAGATGCTTTTAGTCGTGTTCTTCTCATTGAGAGTTCTTAATCCAGTTTGTCTGTTTCTCAACCCAACAGTGCTCTTATTAGCAAGAATAATAACCCAGAGTCTCTAGGGGATGGATAACAATTTTCAAGAACTAATTATGATAATCGAAAATCAGATTCTCAGGAGATTGTCTGTAACTATTACTGTAAGCCTGACCATTTGAAATGTGACTGTTGGAAGTTGTTGTATAAGAATCAACGATCTCTGCATGCTCAGATAGCTTCCAACAGTGATATGACAAAGAAGTCGGTTACCATTTCCGCAGATGAGTTTGCTAAATTTCAGATGTACTAGGACTCATTGCAAACATCACCTTCATTTAATCCTACTGCCACCATTGCTGACACAGGTAATACGAAATGTCTCCTTACATTATCTACCAAATGGGCCATAGATTTTGGTGTCACAGCTCATATGACAATTAATTCTAGCTTATTTTCTCCACTATTATCCCTTGGCTCTTCCCTATCTGTTACTTTGGTAGATGACTCAACATCCTCTGTTCTTGGTTTTGGCACCATTAACCTTTCCCATCTTTTTTTTTTTTTTTGTCTTCTGTTTTACATTTGCCTCAACTATCCTTTAATTTAATTTCTATTAGCCAACTTACTCATGACCTTAACTGTGTTGTCTCGTTTTCTCCTAGTCATTGCTTGTTTCAGGATCGTATGACGAAGAAGATTATTGGTAGAGGATATGAGTTAGGGGGCCTTTATTTTTTATCACCAAATGCGAAAGTTGTGGCTTGCTCAGGAGTTACATCTCCGTTTGAAGTTTATTGTCGTTTGGATCATCCGTCTTTGTTTGTGTTGAAGAAACTTTATCCAGAATTTTGGTCTTTGTCCTCTTTGAATTGTGATTCGTGTTAGTTCGCTAAATTTCATTGTCTTAGTTCTAGTCTTAGAGTCAATAAACGAGCAAGTTCTTCATTTGAATTAGTTCATTCTGATATTTTGGGTCCTTGCCCAATCGTATTCCGAACAGATTTTCGTTATTTTGTTACTTTTGTTGACAATCATTCTCGTTTGACTTGGTTATACTTAATGAAAAACCGTTATGAGTTGCTATCTCACTTTTGTGCTTTTCATGCTGAAATTCAAAATCAGTTCAATGTCTCTATTAAAACTTTGCGCACTAATAATGCTGGCAAATATTTTTCTCATGTGCTTGGATCTTACTTGTGTGAATATGATATCATTCATCAATTATCTTGTGCCGACACTCCATCCCAAAATGGAGTTGTTGAACGAAAGAATAGGCACTTACTTGAAACTGCACGTGTATTATCCTTTCAAATGCATGTTCCAAAGCAATTTTGGGCTGATCCTATTTCAACCCCTTGTTTTCTGATTAAGAGAATGCCTTTGTCTGTTCTTAATGGTGAGATTCCTTATCGTGTTCTTTTTCCTACCAAATCTTTATTTCCTATTGCTCCAAAGATATTTGGTTGTGTTTGTTTTGCTCGAGATGTTCATCCTCATCGTACCAAGTTAGATCTCAAATCCTTGAAATGCATATTCTTAGGTTATTCGCGTGTTCAAAAGGGGTATCGTTGTTATTGTCCTACTCTTAACAGGTACCTTGTCTCTCTTGGTGTTACGTTTTTCAAAGATATGCCTTTTAGTCCATCACCGACGAGTTCATGTCAGGGGGAGGATGACGATCTTTTTATCTATGAGATTACCTCTCCCACATCATCATCTACTCTGTCTTCATCTGTGTCTCTTCCTTCTCGCCCACCCACTACTCAAGTCTACTGCAGGAGACTTCCACAACAACCTTCAAGCACATGTCCTACACCAATGACTTCTTCGACAATTGATCTAGGACCCAGTGATGATCTTCCATCGCCCTTCGAAAAGGTAAACGCAATTGTACTTACCCTATTTCTTCGTTTGTTTTGCATCATCAGTTGTTCTCGCCCACATATTCTTTTCTAACATCCCTTGATTCCATCTCTATTCCTAACTTTGTTCATGAAGCTTTATCTCAACCTGGCTAGTATAGTGCAATAATTGAGGAGATGATTGCTTTAGATGATAATGGTACTTGGAATTTAGTCTCTCATTCTGTTGGAAAGAAGGCAATTGGATGTAAATGAGTGTTTGCTATCATGGTTAACCTTGATGGAACAGTGGCTCGATTGAAAGCTTATCTTGTTGCTAAAGGTTACGCCTAAATCTATGGGATTGATTATTTAGATACGTTTTCTCCAGTCTTCAAATTGAACTCCATCAGACTTTTTCTTTCCATGGTTGCTACCTATAATTGGCCTTTGTATCAACTTGACATAAAGAATGCCTTTTTGCATGGTGATCTCCAGGAGGAAGTTTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCACCTTCGACAATCTTTGTATGGTTTGAAACAGAGTCCACGTGCATGGTTTGGTAAGTTTAGGCAAACACTTGAACATTTTGGTATGAAGAAAAGTACGTCTGATTATTCTATCTTTTATCGGCAATCTGATAATGGTATTACTTTACTTGTTGTATATGTTAATGATATTGTTATTACTAGAAATAATGTATTGGGTATAACTTCTCTCAAGACTTTCCTTCAGGGTCAATTTCATACAAAAGATTTGGGACAATTGAAGTACTTTTTGGGTATTGAAGTAATGAGAAGCAAGAAAGGTATTTATTTGTCACAACGAAAATATGTACTTGATTTGTTGTTGGAGATAGGAAAATTAGGAGGCAAACCACGCAGTACCCCGATGATACCAAATCTGCAACTTACTAAAGAAGGAGAATTATTTAAAGATCCTGAGAGATATAGAAGATTAGTTGGGAAATTGAACTATTTAACAGTGACACGACTAGACATTGCTTATTCTGTAAGTATTATAAGTCAGTTTATGTCTTCTCCTATAGTGGACTATTGGGCTGTAGTAGAACAAATTCTATATTATTTGAAAGCTGCACCTGGACGTGAGATCTTATATAAAGATCATAACCATACAAGAGTTGAATGTTTTTCAGATGCTGATTGGCCTAGATCTTCGGAAGATAGGAGATCGACTTCTATATATTGTGTCTTTATAGGAGGAAACTTAGTATTGTGGAAGTGTAAGAAACATAATGTAGTTTCATGTTCGAGTGCTGAGTCAGAATATAGGGGCATGACACAATCTGTGTGTGAAATAGTGTGGATATACCAACTATTATCTGGGATTTAGTGTTACTGCCAGCTAAATTATGGCGTGATAATCAAGTTGCACTTCACATTGCATCTAACCTAGTGTTTCATGAACGGATGAAACATATGGAAGTGGATTGTCATTTCATTCGTGAGAAAATACAAGAAGGGTTGGTGTCCACAGGATATGTGAAGACTGGAGAACAATTGAGAGGTATTCTTACCAAAGCTGTAAATGGAGCAAGAATAAGCTATCAATACAACAAGCTTGGCATGATTGACATATTTGCTCCAGCTTGAGGGGGAGTGTTATAATACATATTTGACATATTTATGTATAATTGTCCTTTATTGTAATTGTACATACCTAGTCTAGGGTTTCCTTGTTCTCTATATATGTGTGACTTGATTCTGAATAATAGTGTGAGAATATATTCTAAATTACTCAGCGGATTATGATGGCGTAGTTTTCTAAAGTTCCTTAATTCATTTGGAAAATGATATATTAATGTTTCTGTGTAGGTAGCGAAAATTTGGGAAATGGAGCAAAGTCAACTATGTTCATGCAGAAGCAGATTGTCGATGCACTTCTTCTGGGTGATAGAAGTAGTGCATCCAACCTGCTTATGGATCTTGGCCAGGAAAAGCACTCTTTAACTGCGGATAGTTTTGTTCACATTTTGAGCTACTGTGCAATATCCCCTGATCCACTAGTAGGTTTTGTGTTCTTTGTTATCCTCTATTTTCTTTTTTTAGAATTTAAGGGTGTTGAGAAAGCAAGTTTTGTTTGCATTATGATTCAATATCACCATGAATAAGCTACAAACGGACAATGGTAGTTGGCATCTACCCATTACACGGGCTAGTCCAGGGAAAAAAATGAATTGAAGGTATTGTGTTTATACTGCACATCAGTATTTATTTGCGACTGTGTGCGTTGCATTGTAATTCTACTGTTATGTATTGTATGTACTTGAGAACATTGACTATAAATGGGAGCTGAAGCTCAGTATTCATAAGACAGACTTTACTCACTGATGTAATACCAAACGTGAGAAAAAATTCACAAATGAATTTATTGCTTGCTTACTGAAGAGTTCAGCCTATTATTTATAGGCAACAGGTTTAACAACCTAACTAATTGTTTAACAGTTTATCAACTAATATCTAAATAACTTTAGAAATTACAGACTTATTAAAAGTAATTAGTTAGTTGCTTATTTAATACATCATATTCCACCCCTCTGAAATTAAACTTGCCCTTGAGTTTTAATTGGCTAGCTAGAAGAAATCTGGGGCATTCTAAGGATGAATGTCAGCTACATTGAAGACAAGGTTGATGTGTAAATCTGATGGTAGTTCTAAGCGGTAGGCGTTGTTGCCATAAGCTCGGAGAATTTTGAAGGGGCCTAATTTCCTATTCTTCAATTTGTTGTATGTGACCATTGGAAATCTGCTCTTCCTTAGGTGTATCATTACTAAGTCTCCCTCTTGGAATGTTTGCAAACGTCTGTGCTTGTCAGCCGCTTTTTTGTGTGAAGCATTTGGTTGTTCAATGTTTGCTTGGACTTCTTGATGAAGGCGAATAATTTTTTCTGCCATTTCCTCGACTTCGTTACTAAGATCAGCAGAAGATGGAAGTTTAGCGAGGTCTACTGTTAAAGGCAAATTTAGTGTATACAATCCCAAATGGGCACTTCCCTGTGGATCGATTCTTCATGAAGTTGAAGGCAAATTCTGCTTGAGCTAGAATTAGGGCCCATTGTCTTGGATGATCTCCATATAAGCATCTAATTAAATTTCCCAATGTCCTGTTGGTGACCTTTGTTTGGCCATCAGTTTGAGGATGGCTTGTTGTGCTGTATTTGAGAGAAGTGTTGAACTTTCGCCATAATGTTTTCCAAAAATGACTTAGGAACTTAACGTCCCTATCTAAAACTATGGTTTTGGGTATACCATGGAGTCGAACTATTTCCCTAAAAAAAAGATTAGCAATATATGTAGCATCATGTGTTTTTCTACAAGGAAGGAAGTGTACCATTTTACTAAATCTGTCGACGACTACCATTACTGAGTCGTGTGCCTTTTGTGTTTTGGGTAGTCCAAGAACAAAATCCATGGAGTGGTCCTCCCAAATGGAGGTTGGAGTTGGGAGATGAGTGTAAAGTCCTGCATTGGTTGATGTCCCTTTTACCGTTTGGCAAATGAAGCAACGTTTGATGAAATTTTGGACATCTTTTCGAATTTGTGGCCAAAAGAAGCGAGTAGAAACGAGGTCCATAGTTTTTAGAGATACCAAAACGTCCGGCTAGGCCACCCGAGTGTAGGTCTTTGATTAATGTCTCTCGAAGAGATGTGTGAGGAATGCATAGACACTCTCCTTTGAACAAAAAATTACCGAATATGTGATATTCACACGAGTTAATGTTGTTGGAACAATGATGCCAGATTGTTCCAAAATCTTTGTCTAGTGAATAAAGTTTGGGTCGGTGGTGGATGCTGTGATTTCCCCTGCTAGAACAGTTAATAATTCTTCTTTCCTACTAAGTGCATCTGCTACTTTGTTGTCTTTCCCAACTTGATGTCCTATAACAAAGTCAACTCTCTGTAGAAAGGAGAGCCATCCGGTATGCATTCTATTAATTTGTTTCTGTGATTGTAAGTACTTCAAAGAAAAATGATCAGCGAGAAGTATAAATTCTTTACACAGTAGGTAGTGTTCCCATTGTTTTAAAGCCCGAACTAGAGCATAAAGTTCTTGTTCATAGATACTCAAATTTTGTCATGGTAAACTTAATTTCTCACTAAAGTACTCTATTGGATGGTTATTTTGGGATAACACTGCTCCTATTCCTACACCACATGCATCTACTGCTACTTGGAAGGGGATAGAAAAGTCAGGGAGTTTTAGAACAGGGTTGTTAGATAGAGAGTTTAATTTTGGAGAAACTTTCATCTTTTTCTTTATCCCATTTGAAGTTATTTTTTCGAAGGCAATCTGTAAGTGTTGCAGCTAATGTACCGAAGTTTTGAATAAATTTCCTATAAAAAGATGCAAGGCCTAGAAATGATTGGATCTGTTTTATAGAAGAGGGTTTTGGCCAATTAGAAATAGCTTCTATCTTTATAGGGTCAAGTTTAATTTGAGAAGTAAATTTTTGTAGTTAAAAAAACACATTTATTTTTCTGTTTATATGCAAGTGATTTGAAGACAAGGTGGACAAAACAGTGTGAAATGGTGCATATGGTCTTCTAAATTGTTGCTAAAAACAAGTATATCATCGAAATATATAACTACAAATTTATTAAGGAAAGGAAGAAAAACCTTATTCATCAACCTCATAAAGGTGCTTGGGGTGTTGGAGAGACCAAATGGCATAACTGACCATTCGAAGAGTCCTTCATTTGTTTTGAATGCGGTTTTCCATTCATCACCCGGTTTTATTCTGATTTGATGATATCCATTGCGAAGATCTAGCTTAGAAAAGATATTAGCACCACCCAGTTGGTCTAGATCATTAATTCGTGGATAAGGAATCTATACTTGACTGTGATTCTATTCATGGCATGGCTGTCCACACACACGCACCAACGACCATCCTTTTTTGGGGTGAGAAGTGTGGAAACTGTGCATGGACTGATGCTTGGTTGAATGCATCCTTTCTTTAGGAGTTCTTGAACTTGGTCGTGTATTATTTTATACTCCTGAGGGCTCATCTTGTGATGAGGGAGATTGGGTAATGTAGCTCCTGGGATGAGATCTATTCGGTGCTGTATGTCGTGCAATGGAGGTAATCCTGCAAGTGGTTCAAGAATGTTGGGGAAGTTTTTAAGTAATTCTTTCACCTGTTCTGGGAATTGTTCCTAAGGACTGTTTTCTGCATCTCCTTTTACAATTAATCCCCAAATCTCATAATCCTTCTCTTTGATGAATTGTTTTCCTTATATAATACTAAATAGTTGTCCCATGGTGTCTTTGGTAGCTGTAGAATATGGTTTATATAATGGTAGCAGGACAATTTTCCTTCCATGCCACGAAAATTCATAAGTGTTCTCCCTGCCTTTCTGTAGGACCTGCACATCATACTACCACAGCCGACCAAGTAGGATATGGCAGGCATCCATATCAAAAACATCGCAGACAATTTGGTCATTGTAATTGTTGCCAATGGAGAATTGAACGGTGCAAATATTGGATACTTGAGCCTCTCCACCCTTTTTTATCCAAACCTATTTTATAAGGATTTGGATGAGGTTCCATTGGAAGATTAAGGGCATTAACCAATTTTTTGAGACTATTCTCAGAACTTCCACTATCAATGATCACATTGCATATCTTTCCTTTAACTGTGCATCTTGTCTTAAACAATGCATGTCTCTAAGGATGAGAATCTGTCTTTGGTGTGAGAAGAATTTTTTGTAGAACACATGATAAATTTTCTCCCTCATTCGGTTCCAAAAAACCAACTCCTTCACAATCTTCCTCTGTTTCTTGGTCTTGAATTGTTCCTTCTTCTTCTATGGCTACTGTCCTTCAATTTAGGCAATTGTTGGATAAATGTCCATTTTTCCCACAATGAAAGCATTTTCCGAGCGTGGGACAGCCTTAAGGGTTGGCATTTTATTTTGCTGATCCAACAGTTGGGTCCTTTTGATTCATTGGGGCTTCTTCTTTGTTCTTGAGAAGTGGTTGCTCCTTTTTTTGTTCCACTCGTTGGCGCTTCGCAATGACACCGTCTGGTGTACTTTCTTGATTGAGATTCATTCATTTTTCTATTGCTTCAACTAGCGAAATGACATTGGCAAGGAGTCCTAAAGGCTGCAGCTTTACTTTTTCTTTAATGTCTGAACGTAGTTCTCGCACAAACCGTGCTATTAGATGTTGCTCACTTTCAGGTAAGTTGGTTCTTGCTCCTAGCCGGTAAAAATCCTCTGTATATTGATTCACTGTTTTGGTCCCTTGGCGCAGATTTTGGTATTGATTGTAAAGGATTTGCTCGTAATTGGTGGGCAAGAATCGATCCTTCATTAGTCCTTTCAATCTTTCCCAACTTGAAATGGGATTTTTCCAACAACAGTATCGATTGACCTCCAACTGTTCCCACCATGCAGAAGCACCAGCACGAAATTTCAAGGCCACCAACTGCACCTTTTTGTGTTCAGGTGTATTTGCATATTTGAAGAATCGTTCTACATTTTTAATCCATTCCAAGAATTCTTCTATATCTCTCTTACCATTGAACGTGGGAATGTCTAACTTGACTCTATACTCATTTGGTTCATGATGTATAGCAGCCCTCCAATGTCTGATTTGTTCTTCATCTTCTCCTTCTGCACTTGAGGAATCACTGACTTGTTGACTTCTCCTATGGTAAGCTTGAAATCTTGGACGATCTTGATTAACTTCTTGTAGTTCTTGGAGGTTTCTTGGATGATTGAACCGCCGCCCATTAAGATTCTGTCGTTCTTGTGGAAAATAATGAGTCTTGGCTGGTTTTCATGGTTTTGCGGTGGTAATCAAATGACTTGAACTTCGGTGGCAATCTCTTCCATTCTCACAGACATTTGCTCTAACAACCTATGAATACCACCAATGGACCCTTCAACAGATAGCATGCGTCTTGTTAAAGTTCTTGGTGATAGGGCGGTAGAAGAACCTGCCTCTTTGTTGGTAGTTTTGGGGTAGTTGCCATTGTTAGGCTTCTTACCTGCCATCAGATCCAAGGTTCTACTTACTCTGATACCACTTTGATGTAATACCAAACTTGTGAGAAAATTCACAAATGATTTTATTGCTTGCTTACTGAAGAGTCCAACCTTTTATTTATAGGCAACGGGTTTAACAGCCTAACTAATTATTTAACAGTTTATCAACTAATATCTAAATAACTTTAGAAATTACAGATTTATTAAAAGTAATTAGTTATTTGCTTATTTAATACATCACTCACCCAGTTCCTTAATGATACCTGTCCACTTTCCTCCTCTCTCTACATTTCTCTGCCACCACAACTAACTCACTCTCCGCCTTGAAAACCAAGAAATGAACCTTCTTCTAACCATAGCTAACTAATATTATAGTTTTCACTTTTCAATTTAGTCCTCATGTTTTTCTAGCTACAGTTTCACTCTCCTCAAATAAGTAAAACTGGTTGGGGTTGAATTTAAAATAATGTGCTGTCATTTTGCATCTATCTAGGAATGTAAAAAATCATTTGCTTATGACATGCTTGCATTTCACTTGCTTGTCTTATGCTAGTGTGGGAACGCAAGAATGATGACAACCTTATTACTTTCTATAGACTGATGAGTGTGCTGAAGCCTAAGGCTACACACACACACATATGTATATGTATATGTATATGTATATGTATATGTATTTATATATTTGTTTCTTCTTCTTGTAGTTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTCCTAAATAACAAATGCTCCTTACTTATGGTAGAAGCACTCTGTAAAGGGGGTTACTTTGATGAGGTATAAGCATATTCTCTATGCTAATATTTTATGCAATCTTTTCCCACACATCCAGTTGCAGTAATAACAATAATTTTCTCCTGTTGTAATGATACAGGCATTTGGTCTAATAAGTTTCCTAGCAGAAAGTCATGTCATGTTCCCTGTTCTGCCTGTGTACAATTGTTTCTTGAGAGCCTGTGCCAAAAGTCAAAGTACGGTTCATGTTAGTCAATGTTTGGATCTTATGGATCACAGAATGGTTGGGAAGAATGAAGCTACATATTCTGTGCTACTCGAGGTCTGCAAGGAGCCTGATTCCCTGTCATTTTTATAAGCTTTATTTCAAGATCATTTATTTCTTAGATTTCTTTTGAACTGGTGATCATCATGAAGATACTTGAAAGTCCTCTTATGCATTTTATTTTCTACACCAATTGGATCAATCATAGGCATCTTAGTGTCAATCTTGTCGGTAAACATAAATTAAAATTTGTTGTGGATATGATTCTAAATTTCTAAGTGTATACTTAACTGTCCATGCACCCTTGTTTAGCGTGATAAGAACATATTGATTCATTAAAGAGTATCTTTATAGAATAAAATGGATCGTGGGATTGGTTCTACTAGTCATTGATCTCTTTTAAATACTCTCTCCTGTTTCAATCACGTGAGGCCTTTTGGCAGGAAGGTTTCTGTTTTAAGAGTATTGCCATATCATATAAGTACCTGTAATTTATGATGATCATCCTTGTATTGCTGTTTTGAGAATTTATTTTTGATGCGGTGGATCTGAACTCTCTTAATGTATTTATAGTTAATTAATATAATACTAAGCTTTCTTTTCTCAATCAGCTTGCGGTTTGTCAGAAGAGCTTGTCTTCTGTGCATGAAATCTGGACAGACTTTGTAAAAAATTACAGTCCAAGTGTGTTATCTCTAAGAAAGTTTATATGGTCTTACACAAGGCTGGGAGACGTGAAATCTGCATATACTGCACTGCAAAAGATGGTGGCTTTGGCCATTGGACCTGCAGGAGGAAAGTTACGATCTTTGGAATTGGACATTCCTATACCTTCAAGAACTGAATCCCATCGTAACGATTTTAATTTTGAGGAAAATGGATCTTCTACTGATGAGCTATTCTGTAAGAAAATGGTTCCCAGCAATGGTGACGTAGGGAGAATTTCTGGTAATGATATGAAATGTGGAGAAGTTGAAAGTGGTCCACTAACTTTGCCAAACAATTACAGAAGCAGTTTTGTTAAAAAGGTTTTGAGTTGGTCTTTCAATGATGCGATTCACGCATGTGCACTTACTAGGAACTGTGGTCTTGCAGAGCAGCTAATGCAACAGGTCTTATTTCTTTCCTTTGAGATCATTTTTATTTAATGATATAGCCAATTGGTACAAGAAATCTTGATTTAAAGATAATATTTGTCATAACGTTATTTCTTGTTACCAAAAATTTTTGTTATTCAGATTTACTTGTTGCTTTCATTGCAGTATCGACGTATTGGTGAATTAATATTTGTTATTTTATACATTGTCCTCATGTTCATATTTTTACACTTTATTTAGTTTAGTGTTTCATGTGTACTGACGACACTAGGAACCATGTCTTTGAAGATGCATGAACTCGGATTGCAACCTTCATGCCATACATTTGATGGTTTTGTTAGATCAGTTGTATCAGAGAGAGGTTTCAGTGATGGCATGAAAATAGTAAGTTATTTTGGCTTCGATTAGATTTTGATTATCATTACAAGCTAATCTTTTGCGTATATTTATTATTTATTTTATTTTTTTCGTTTTTGATTTTCATTATGAGGCCAAATATGTATTAATCCTCTAACTAAAAAATCCCTCTTTTCCAGTTAAAAATAATGCAACAGAGGAAATTGAAGCCATATGATTCAACTCTTGCTGCTGTTTCAATAAGCTGTAGCAAGGCGCTAGAACTTGACTTGGCTGAAGCCCTACTTGAAGAAATTTCAGCTTGTCCTCACGGAGGCCCCTTCCATGCATATTTTTCCGCATGTGACACGATGGTAAGCTGACATTTTTTTTAGTAAGAAATTGGGCTGATGAAAGAATTACAGGGGCATACAAAAAATATGTCCAGAAAAGGAGTCCTCAACTACAAGGAAAGACTTCACCCAAAAGAATGATAACAAGCTGATAATTACGAAAAAAATTCTGGTTAATAGAAGCTAGACACATCAAATCTCACAACCTTCCATGCCTCCTCCAAACTCCTCTCAACCCCTCTAAAAATCCTATGATTCCTCTTGATTCACACTCTCCACAAGATAGCAAAAACCCTAAACCCTTGCCACTAGACTCTCCCTTTATCTCAAAAAGGAAGATTCGTTAGGATCTTCCAATTTAATGATTTTAAGGCGTAACTTCTTACTGTAATTGCTTATGTTCTCAATTTCCAAGTTTACTGTAATTACGCTACTCCAGGCTGCAAGCACAAAGCTGATGCTATGCTACTCAGGCTCCTTTAAGTAGTTATAAAGTAGTGTCAAGTTTCTGACATAATTATTGCAAAAGCCCCTAAATATGTTTTATTCCAACTCATAATTTGGATTTTTTTTTTTTTTTTTAAAAAATTCTGGACGGGGTACCAGTGATTTGTGACATTGGAGGATATAAATTATTAATGCAATCAGGTTTATTCGAGTCCAACTCAGATTGTGGGATCCTTATTGCAGGATCAGCCAGAACGTGCCATGCGTATGTTTGTTAAAATGAAACAAATGGATGTGCTTCCAGATGTCAAGACCTATGAGCTTTTATTTTCTTTATTTGGTAATGTGAATGCTCCATATGAGGAGGGGAACAGATTGTCACAGGTGGATGCTGCTAAAAGGATACGCATGATAGAGATGGATATGGAAAAACATGGGATCCAACATAGTCATGCCTCTATGATGAACTTGGTAAGCCAACAACTATCTTCTCTCTCTCCCCACCACAAAAAGCGGGGGGGAAAAAAAAAAACCTTGCAGGACCCCATCTACTTCCCTCGCAAAATTTCAAGCAGAATTTGATTATCTATTTGAAGTTTCTTTCATTCTTTCCTCGTACTTGTTCAGCACTTGTTCAGCACTGTAATTTATTTTGTATCTGCGGTACACTGCTGATATTGCTGGAAGAACATTTTCTCTGGATCCTAGCAGATGTACTTTGAATGTATTTATATGCTCACGTTTCACTTGGAGAATTGTTTAACATTTAGGAAAAGGTTTTAATGAATTAGGATTGGATAAATCTGATTTAATGTTTTTTTAGATCCCATTTAATGTTGTAGTAGTACCATCAATGTTATTGTTGTATGTAGTCCTGATTATGTATACACTAGCTAAGAAAATTGGCACAAATAAAATGTAAGTCTATGGCCGCGGTACTCCACCTTTGTCCTTCAATCATTAGCAGTTAGCGCTAGAATGTAGCTAGAAGAGCATTAAGTTTACTCAATTTTATTGGGTGCACGATTAAGTTATTTTCACTTGTTACTATAATTGTTATATCTCTGTACTTGAACCTTAACGGCAAGTTGAAGAATTTTTTCTTTTGTTGGATAAGAAACATTATATTGATGAAGAATTATGATTCCATTTTTGGATAAGAAACAAACATTTCATTGAACTTGAATTATCTCCCCTTAGTGCTTATGGCATTATTGTTGTTTATATTTTCTCTTTGATTCCATTTTTTCTTTTAGTTGAAAGCTCTAGGCACAGAGGGGATGACGAAGGAGCTTCTTCAGTATTTAAATGTGGCAGAGAACCTCTTCTATTACAATAACACTTGTCTGGGGACGCATGTTTATGATGCAGTGTTGCATTCCTTAGTTAAATCCAAGGAAGTAAGTGCTTGTGGTGATGCACTCTCTTCTGTTAAGTTATAGAGGAGTTCCTCATTTGTGTTAAACAGAGGTCTATTCCATACTTTTGCAGATTCACATGGCAATAAAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGAAGCTGCAACATTTGAGATAATGCTTGACTGTTGTTGTGTTATGGGATGCTTGAAATCAGCTTTTGCTCTTCTCTCCATCATGATCCGCTCAGGGTTTTGTCCAGGGGTATTAACTTATACGAGTCTAATAAAGGTATTCAATGAATTTTTATTGGTAGGCATATACATCACTCACTGCAAGTATACTTCTAAAGCGTTCATCTTGAGTTAATAGATCATGGGCTTCTGATGAAATCACTTTGTGGACTATCAGCTATGCATTCATGTTTTCAGATTTATACGAGTTTGTTAACTTGAATGTTGCTGATGCCCTCATATACAAGTCTGACCACAAAAAGTCTTGTATTAAAGAAGAGGGACTGTAGACTTCAAGCTTAGAGAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTCCAATGGTGGTTTCTTTTGACAATCGTATAGTTTGGCCAATTTACAATTCATTGTATGGTTCAAAAAGCTGTCCATGAATTTTCTCTTGTGCCACGACACCCATGGAGGATGTAATATTAAAATTTTCCAGAAGTTTCCATGTCAATCATTGTCTATTAATACTCATTGTTAGTCTTTTCCTTTATTGTGTTATTACATCTTTCAAAATTTAACGTTGGGATTATCCTGATACAGATTGTGCTAAGATTTGAGAGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCCGAAGGGATTGAACTTGATGTAACTATTATGAATACAATCATGAAGGAAGCTTGTGTAAAGGTACCCCCCATGGCCTTGTGTCACACTTGTAGTAATTTTCTGAAAACTTTTCAAAAACAGCACGGTGTTAATAGCGAAATACATTATTAACAACAGTCCTCAATTTACTAAAGGATATATCCAGCTGTCCATCAACTGTTTAAATAAATGTAATTCCTTTCAAGTACCTTAGACCTCGACAAATTACCTGCATATTTCTTAGTCTTTTGCCAAAATATCATGAATATCCGCATATTCTCTGTGTTTGCTATGTATTTATCAACAAAATGTTTCTTTTGCTAGAAAAACAAATTTTGACATTCTTGGAAATCTTGGTATACGTCTAAGTCATCATTTGTTATTGCATGCGTTTGTGATTAGTGATGCATGAGCTGAAATTACTATTTCTGGGTTCTTTTTTGAAATGGATGTTAGTTAAGGATTGATGTGATTGAGTTTCTCGTTGAGAAGATGAACCGTGAAAAGATCCAACCCGACCCTTCAACTTGTTGTGCTGTCTTCTCCACGTATGTGAACCTTGGCTATCACAGCACTGCCATGGAAGCACTGCAAGTACTGAGCATGCGTATGTTATGCAACAACAACGACGCCTCTCCAGACATGACAGAATATGTCGAAAACTTCGTGGTTGCAGAAGATGCCGGAGCTGATTCACGTATTTTGGAATTCTTCAAAGGCTATGAAGAGTTCCTAAGTTTTGCCCTCTTCAACTTGAGATGGTCTGCCATGCTGGGATATTCACTTTGTTCCTCCCCTAGTCGTAGTCCATGGGCAATGAGACTTGCAAGTTCCTATGATAGGCCACAGAATCTCATAAGATGACATTCTTTGTTCTATTCTGCCTAAGTTGAATGCCCGATGTTTTCAACGTATTGGACAAATCATGTTTCGATGACCTTACTTGAACAGAATTTGTTCTATAGGTTGTACAATGTCCTTGAGTTCTAAAGACAAATCATCTTTCGTGTAAAAGATTACATTTTTTCTAGATGTAGAGCGTTTATTCGACATGGAATCGGAACACGAGGTATTATAATTGTTGCAACTTTTATTATTGGAGAGCATTTAGACATGAAG

mRNA sequence

ATGAGCAGCAACTACCGAACAGAGGAAAAGGTGAAGAAGCCTTTAGTTTGTTCTAACAGAACGAACCCAATTTGTACCACAACTGTATTCAGCTGCAGTGCAAGCATATTGCACTCATGGCTTTGTTCACACTGCCATTTCATTCGGAATCTTGTCTTCAATAAGCTTTCGCCGCCGTCTCATCGGGGAAGTCAATGCACAGGCAAGTTAAGCTCTCGTTTGGGATCAATAGCTGGCTCGATATGTAGATTCAGGCCCCACGAACATGTGCGAAAAAAGGATGCTAGCAAATTGGTATTCCATCGGGCTCTTCTCATCTCCAAGGGTAGCGAAAATTTGGGAAATGGAGCAAAGTCAACTATGTTCATGCAGAAGCAGATTGTCGATGCACTTCTTCTGGGTGATAGAAGTAGTGCATCCAACCTGCTTATGGATCTTGGCCAGGAAAAGCACTCTTTAACTGCGGATAGTTTTGTTCACATTTTGAGCTACTGTGCAATATCCCCTGATCCACTATTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTCCTAAATAACAAATGCTCCTTACTTATGGTAGAAGCACTCTGTAAAGGGGGTTACTTTGATGAGGCATTTGGTCTAATAAGTTTCCTAGCAGAAAGTCATGTCATGTTCCCTGTTCTGCCTGTGTACAATTGTTTCTTGAGAGCCTGTGCCAAAAGTCAAAGTACGGTTCATGTTAGTCAATGTTTGGATCTTATGGATCACAGAATGGTTGGGAAGAATGAAGCTACATATTCTGTGCTACTCGAGCTTGCGGTTTGTCAGAAGAGCTTGTCTTCTGTGCATGAAATCTGGACAGACTTTGTAAAAAATTACAGTCCAAGTGTGTTATCTCTAAGAAAGTTTATATGGTCTTACACAAGGCTGGGAGACGTGAAATCTGCATATACTGCACTGCAAAAGATGGTGGCTTTGGCCATTGGACCTGCAGGAGGAAAGTTACGATCTTTGGAATTGGACATTCCTATACCTTCAAGAACTGAATCCCATCGTAACGATTTTAATTTTGAGGAAAATGGATCTTCTACTGATGAGCTATTCTGTAAGAAAATGGTTCCCAGCAATGGTGACGTAGGGAGAATTTCTGGTAATGATATGAAATGTGGAGAAGTTGAAAGTGGTCCACTAACTTTGCCAAACAATTACAGAAGCAGTTTTGTTAAAAAGGTTTTGAGTTGGTCTTTCAATGATGCGATTCACGCATGTGCACTTACTAGGAACTGTGGTCTTGCAGAGCAGCTAATGCAACAGTTTAGTGTTTCATGTGTACTGACGACACTAGGAACCATGTCTTTGAAGATGCATGAACTCGGATTGCAACCTTCATGCCATACATTTGATGGTTTTGTTAGATCAGTTGTATCAGAGAGAGGTTTCAGTGATGGCATGAAAATAGATCAGCCAGAACGTGCCATGCGTATGTTTGTTAAAATGAAACAAATGGATGTGCTTCCAGATGTCAAGACCTATGAGCTTTTATTTTCTTTATTTGGTAATGTGAATGCTCCATATGAGGAGGGGAACAGATTGTCACAGGTGGATGCTGCTAAAAGGATACGCATGATAGAGATGGATATGGAAAAACATGGGATCCAACATAGTCATGCCTCTATGATGAACTTGTTGAAAGCTCTAGGCACAGAGGGGATGACGAAGGAGCTTCTTCAGTATTTAAATGTGGCAGAGAACCTCTTCTATTACAATAACACTTGTCTGGGGACGCATGTTTATGATGCAGTGTTGCATTCCTTAGTTAAATCCAAGGAAATTCACATGGCAATAAAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGAAGCTGCAACATTTGAGATAATGCTTGACTGTTGTTGTGTTATGGGATGCTTGAAATCAGCTTTTGCTCTTCTCTCCATCATGATCCGCTCAGGGTTTTGTCCAGGGGTATTAACTTATACGAGTCTAATAAAGATTGTGCTAAGATTTGAGAGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCCGAAGGGATTGAACTTGATGTAACTATTATGAATACAATCATGAAGGAAGCTTGTGTAAAGTTAAGGATTGATGTGATTGAGTTTCTCGTTGAGAAGATGAACCGTGAAAAGATCCAACCCGACCCTTCAACTTGTTGTGCTGTCTTCTCCACGTATGTGAACCTTGGCTATCACAGCACTGCCATGGAAGCACTGCAAGTACTGAGCATGCGTATGTTATGCAACAACAACGACGCCTCTCCAGACATGACAGAATATGTCGAAAACTTCGTGGTTGCAGAAGATGCCGGAGCTGATTCACGTATTTTGGAATTCTTCAAAGGCTATGAAGAGTTCCTAAGTTTTGCCCTCTTCAACTTGAGATGGTCTGCCATGCTGGGATATTCACTTTGTTCCTCCCCTAGTCGTAGTCCATGGGCAATGAGACTTGCAAGTTCCTATGATAGGCCACAGAATCTCATAAGATGACATTCTTTGTTCTATTCTGCCTAAGTTGAATGCCCGATGTTTTCAACGTATTGGACAAATCATGTTTCGATGACCTTACTTGAACAGAATTTGTTCTATAGGTTGTACAATGTCCTTGAGTTCTAAAGACAAATCATCTTTCGTGTAAAAGATTACATTTTTTCTAGATGTAGAGCGTTTATTCGACATGGAATCGGAACACGAGGTATTATAATTGTTGCAACTTTTATTATTGGAGAGCATTTAGACATGAAG

Coding sequence (CDS)

ATGAGCAGCAACTACCGAACAGAGGAAAAGGTGAAGAAGCCTTTAGTTTGTTCTAACAGAACGAACCCAATTTGTACCACAACTGTATTCAGCTGCAGTGCAAGCATATTGCACTCATGGCTTTGTTCACACTGCCATTTCATTCGGAATCTTGTCTTCAATAAGCTTTCGCCGCCGTCTCATCGGGGAAGTCAATGCACAGGCAAGTTAAGCTCTCGTTTGGGATCAATAGCTGGCTCGATATGTAGATTCAGGCCCCACGAACATGTGCGAAAAAAGGATGCTAGCAAATTGGTATTCCATCGGGCTCTTCTCATCTCCAAGGGTAGCGAAAATTTGGGAAATGGAGCAAAGTCAACTATGTTCATGCAGAAGCAGATTGTCGATGCACTTCTTCTGGGTGATAGAAGTAGTGCATCCAACCTGCTTATGGATCTTGGCCAGGAAAAGCACTCTTTAACTGCGGATAGTTTTGTTCACATTTTGAGCTACTGTGCAATATCCCCTGATCCACTATTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTCCTAAATAACAAATGCTCCTTACTTATGGTAGAAGCACTCTGTAAAGGGGGTTACTTTGATGAGGCATTTGGTCTAATAAGTTTCCTAGCAGAAAGTCATGTCATGTTCCCTGTTCTGCCTGTGTACAATTGTTTCTTGAGAGCCTGTGCCAAAAGTCAAAGTACGGTTCATGTTAGTCAATGTTTGGATCTTATGGATCACAGAATGGTTGGGAAGAATGAAGCTACATATTCTGTGCTACTCGAGCTTGCGGTTTGTCAGAAGAGCTTGTCTTCTGTGCATGAAATCTGGACAGACTTTGTAAAAAATTACAGTCCAAGTGTGTTATCTCTAAGAAAGTTTATATGGTCTTACACAAGGCTGGGAGACGTGAAATCTGCATATACTGCACTGCAAAAGATGGTGGCTTTGGCCATTGGACCTGCAGGAGGAAAGTTACGATCTTTGGAATTGGACATTCCTATACCTTCAAGAACTGAATCCCATCGTAACGATTTTAATTTTGAGGAAAATGGATCTTCTACTGATGAGCTATTCTGTAAGAAAATGGTTCCCAGCAATGGTGACGTAGGGAGAATTTCTGGTAATGATATGAAATGTGGAGAAGTTGAAAGTGGTCCACTAACTTTGCCAAACAATTACAGAAGCAGTTTTGTTAAAAAGGTTTTGAGTTGGTCTTTCAATGATGCGATTCACGCATGTGCACTTACTAGGAACTGTGGTCTTGCAGAGCAGCTAATGCAACAGTTTAGTGTTTCATGTGTACTGACGACACTAGGAACCATGTCTTTGAAGATGCATGAACTCGGATTGCAACCTTCATGCCATACATTTGATGGTTTTGTTAGATCAGTTGTATCAGAGAGAGGTTTCAGTGATGGCATGAAAATAGATCAGCCAGAACGTGCCATGCGTATGTTTGTTAAAATGAAACAAATGGATGTGCTTCCAGATGTCAAGACCTATGAGCTTTTATTTTCTTTATTTGGTAATGTGAATGCTCCATATGAGGAGGGGAACAGATTGTCACAGGTGGATGCTGCTAAAAGGATACGCATGATAGAGATGGATATGGAAAAACATGGGATCCAACATAGTCATGCCTCTATGATGAACTTGTTGAAAGCTCTAGGCACAGAGGGGATGACGAAGGAGCTTCTTCAGTATTTAAATGTGGCAGAGAACCTCTTCTATTACAATAACACTTGTCTGGGGACGCATGTTTATGATGCAGTGTTGCATTCCTTAGTTAAATCCAAGGAAATTCACATGGCAATAAAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGAAGCTGCAACATTTGAGATAATGCTTGACTGTTGTTGTGTTATGGGATGCTTGAAATCAGCTTTTGCTCTTCTCTCCATCATGATCCGCTCAGGGTTTTGTCCAGGGGTATTAACTTATACGAGTCTAATAAAGATTGTGCTAAGATTTGAGAGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCCGAAGGGATTGAACTTGATGTAACTATTATGAATACAATCATGAAGGAAGCTTGTGTAAAGTTAAGGATTGATGTGATTGAGTTTCTCGTTGAGAAGATGAACCGTGAAAAGATCCAACCCGACCCTTCAACTTGTTGTGCTGTCTTCTCCACGTATGTGAACCTTGGCTATCACAGCACTGCCATGGAAGCACTGCAAGTACTGAGCATGCGTATGTTATGCAACAACAACGACGCCTCTCCAGACATGACAGAATATGTCGAAAACTTCGTGGTTGCAGAAGATGCCGGAGCTGATTCACGTATTTTGGAATTCTTCAAAGGCTATGAAGAGTTCCTAAGTTTTGCCCTCTTCAACTTGAGATGGTCTGCCATGCTGGGATATTCACTTTGTTCCTCCCCTAGTCGTAGTCCATGGGCAATGAGACTTGCAAGTTCCTATGATAGGCCACAGAATCTCATAAGATGA

Protein sequence

MSSNYRTEEKVKKPLVCSNRTNPICTTTVFSCSASILHSWLCSHCHFIRNLVFNKLSPPSHRGSQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVKSAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVPSNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQLMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKIDQPERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPSTCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGADSRILEFFKGYEEFLSFALFNLRWSAMLGYSLCSSPSRSPWAMRLASSYDRPQNLIR
Homology
BLAST of Clc09G10840 vs. NCBI nr
Match: KAA0061245.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK09374.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1266.9 bits (3277), Expect = 0.0e+00
Identity = 672/906 (74.17%), Postives = 727/906 (80.24%), Query Frame = 0

Query: 1   MSSNYRTEEKVKKPLVCSNRTNPICTTTVFSCSASILHSWLCSHCHFIRNLVFNKLSPPS 60
           M SN RTEE+ +KPLV SNRTN I T+TVF CSA ILH WLCSHCHFI +LVFNKL PPS
Sbjct: 1   MCSNCRTEERGRKPLVYSNRTNSISTSTVFGCSARILHLWLCSHCHFIWSLVFNKLLPPS 60

Query: 61  HRG-SQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKS 120
             G  QCTGK S RLGSIA SI RF+PHE VRK+DASKLVFHRALLISKGSE  GNGA+S
Sbjct: 61  LLGRQQCTGKASFRLGSIADSIYRFKPHELVRKQDASKLVFHRALLISKGSEIWGNGAES 120

Query: 121 TMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWK 180
           T FMQ QIVDAL LGDR+ ASNLLM LGQEK SLTAD+FV ILSYCA SPDPLFVMETWK
Sbjct: 121 TAFMQIQIVDALRLGDRNKASNLLMVLGQEKCSLTADNFVRILSYCAKSPDPLFVMETWK 180

Query: 181 IMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKS 240
           IMEERGIFLNN CSLLM+EALCKGGY DEAFGLI+FLAESHVMFPVLPVYNCFLRACA  
Sbjct: 181 IMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACAIR 240

Query: 241 QSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLR 300
           QSTVH SQCLDLMDHRMVGKNEATYS LL+LAVCQ++ SSVHEIWTDFVKNYSPSV SLR
Sbjct: 241 QSTVHASQCLDLMDHRMVGKNEATYSELLKLAVCQENSSSVHEIWTDFVKNYSPSVSSLR 300

Query: 301 KFIWSYTRLGDVKSAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENG 360
           KFIWS+ RLGD+ SAYTALQKMVALA G  G KL+S  LDIPIP RTE + N+FNFEE  
Sbjct: 301 KFIWSFARLGDLTSAYTALQKMVALATGATGRKLQS--LDIPIPLRTEFYHNNFNFEEKE 360

Query: 361 SSTDELFCKKMVPSNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIH 420
            S DE FCKKMVP NGDVG IS NDMKCG  E+GPLT+PNN+RSSFV+KVL WS ND + 
Sbjct: 361 PSIDEFFCKKMVPWNGDVGGISVNDMKCG--ETGPLTVPNNHRSSFVRKVLRWSSNDVMR 420

Query: 421 ACALTRNCGLAEQLMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFS 480
           +C+L  NCGLAEQLMQQ                MH+LGLQPS HTFDGFVRSVVSERGFS
Sbjct: 421 SCSLAGNCGLAEQLMQQ----------------MHKLGLQPSSHTFDGFVRSVVSERGFS 480

Query: 481 DGMKI------------------------------------------------------- 540
            GM+I                                                       
Sbjct: 481 AGMEILKVMQQRGLEPYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFLSAC 540

Query: 541 ---DQPERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMI 600
              DQPERAMRM VKMKQM V+PDV+TYELL+SLFGNVNAPYEEG++LSQVDAAKRIRMI
Sbjct: 541 GVMDQPERAMRMLVKMKQMKVVPDVRTYELLYSLFGNVNAPYEEGDKLSQVDAAKRIRMI 600

Query: 601 EMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLH 660
           EMDM KHGIQ+SH SMMNLLKALG EGM KE+LQYLN+AENLFYYNNT LG  VY+ VLH
Sbjct: 601 EMDMGKHGIQYSHFSMMNLLKALGAEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLH 660

Query: 661 SLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCP 720
            LV SKEI+MAI+LFNNMK+SGFFP+AATFEIMLDCC VMGCLKSAFALLS+MIRSGFCP
Sbjct: 661 FLVDSKEIYMAIELFNNMKNSGFFPDAATFEIMLDCCSVMGCLKSAFALLSLMIRSGFCP 720

Query: 721 GVLTYTSLIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLV 780
            +LTYTSL+KIVL F RFDDALNLLDQASSEGIELDV IMNTIM++AC K RIDVIEFLV
Sbjct: 721 QILTYTSLVKIVLGFGRFDDALNLLDQASSEGIELDVIIMNTIMRKACEKARIDVIEFLV 780

Query: 781 EKMNREKIQPDPSTCCAVFSTYVNLGYHSTAMEALQVLSMRM-LCNNNDASPDMTEYVEN 840
           EKMNREKIQPDPSTC  VFS YVNLGYHSTAMEALQVLSMRM LC  +D S  +TEY+EN
Sbjct: 781 EKMNREKIQPDPSTCHNVFSAYVNLGYHSTAMEALQVLSMRMLLCEEDDDS--VTEYMEN 840

Query: 841 FVVAEDAGADSRILEFFKGYEEFLSFALFNLRWSAMLGYSLCSSPSRSPWAMRLASSYDR 847
           FV+AED GADSRI EFFK   E+L FALFNLRW AMLGYS+C SP++SPWAMRLASSYD 
Sbjct: 841 FVLAEDTGADSRIAEFFKCSREYLGFALFNLRWCAMLGYSVCCSPNQSPWAMRLASSYDG 884

BLAST of Clc09G10840 vs. NCBI nr
Match: XP_038897387.1 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Benincasa hispida])

HSP 1 Score: 1231.9 bits (3186), Expect = 0.0e+00
Identity = 640/829 (77.20%), Postives = 686/829 (82.75%), Query Frame = 0

Query: 69  KLSSRLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIV 128
           + SSRLGSIA SI RFRPHEHVRK+DA KLVFHRA LIS GSE LGNGA ST FMQ QIV
Sbjct: 3   RASSRLGSIADSIYRFRPHEHVRKQDAGKLVFHRAFLISNGSEILGNGADSTTFMQMQIV 62

Query: 129 DALLLGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFL 188
           DAL LGD +SASNLLMDLGQEKHSLTADSFV ILSYCA SPDPLFVMETWKIMEERGIFL
Sbjct: 63  DALRLGDINSASNLLMDLGQEKHSLTADSFVPILSYCARSPDPLFVMETWKIMEERGIFL 122

Query: 189 NNKCSLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQC 248
           NN CSLL++EALCKGGY DEAFGLI+FLAESHVMFPVLPVYNCFLRAC K QS VHVSQC
Sbjct: 123 NNTCSLLIIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACVKRQSMVHVSQC 182

Query: 249 LDLMDHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRL 308
           LDLMDHRMVGKNEATYS LLELAVCQK+LSSVHEIWT+ VKNYSPSVLSLRKFIWS TRL
Sbjct: 183 LDLMDHRMVGKNEATYSKLLELAVCQKNLSSVHEIWTELVKNYSPSVLSLRKFIWSCTRL 242

Query: 309 GDVKSAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCK 368
           GD+KSAYTALQKMVALA G  GGK  SL+LDIPIPSRTE + N+F FEENG STDELFCK
Sbjct: 243 GDLKSAYTALQKMVALATGATGGKSPSLKLDIPIPSRTELYCNNFTFEENGPSTDELFCK 302

Query: 369 KMVPSNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCG 428
           K+VP +G VG+IS N MKCGEVESGPL LPNN+RSSFV KVL WSFND IHACA TRNCG
Sbjct: 303 KLVPCSGVVGKISVNGMKCGEVESGPLALPNNHRSSFVMKVLRWSFNDVIHACARTRNCG 362

Query: 429 LAEQLMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI---- 488
           LAEQLMQQ                MHELG+QPS HTFDGFVRSVVSERGFSDGMKI    
Sbjct: 363 LAEQLMQQ----------------MHELGVQPSRHTFDGFVRSVVSERGFSDGMKILKIM 422

Query: 489 ------------------------------------------------------DQPERA 548
                                                                 DQPERA
Sbjct: 423 QQRELEPYDSTLAAVSISCSKALELDLAEALLERISTCLYPHPFNAFLSACDTMDQPERA 482

Query: 549 MRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGI 608
           MRMF KM+QM+VLPD+KTY LL+SLFGNVNAPYEE +RLSQVDAAKRIRMIE+DMEKHGI
Sbjct: 483 MRMFAKMRQMEVLPDIKTYGLLYSLFGNVNAPYEEASRLSQVDAAKRIRMIEIDMEKHGI 542

Query: 609 QHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIH 668
           QHS  SMMNLLKALG EGMTKELLQYLNVAENLFYYN+TCLGT VY+ VLH LV+SKEIH
Sbjct: 543 QHSLVSMMNLLKALGAEGMTKELLQYLNVAENLFYYNHTCLGTPVYNTVLHFLVESKEIH 602

Query: 669 MAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLI 728
           MAI++FNNMKHSGFFP+A TFEIM+DCC VMGCLKSAF LLS+MIRSGFCP +LTYTSLI
Sbjct: 603 MAIEVFNNMKHSGFFPDAVTFEIMIDCCSVMGCLKSAFVLLSMMIRSGFCPQILTYTSLI 662

Query: 729 KIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQ 788
           KIVL FERFDDALNLLDQASSEGIELDV IMN I+++A  K+R+DVIEF+VEKMNR++IQ
Sbjct: 663 KIVLEFERFDDALNLLDQASSEGIELDVLIMNKILQKAREKVRVDVIEFVVEKMNRKRIQ 722

Query: 789 PDPSTCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGAD 840
           P+PSTC  VFSTYVNLGYHSTAMEALQVLSMRMLC  +D S  +TEY+ENFV+AEDAGAD
Sbjct: 723 PNPSTCHDVFSTYVNLGYHSTAMEALQVLSMRMLCKEDDTS--VTEYIENFVLAEDAGAD 782

BLAST of Clc09G10840 vs. NCBI nr
Match: XP_023536089.1 (pentatricopeptide repeat-containing protein At1g76280 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 627/825 (76.00%), Postives = 686/825 (83.15%), Query Frame = 0

Query: 73  RLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIVDALL 132
           RLGSIA S+ RFRPHEH RK+DA+K+VF RALLIS+G E LGN A+ST FMQ+QIVDAL 
Sbjct: 7   RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALR 66

Query: 133 LGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFLNNKC 192
           +GDRSSASNLLM+LGQEKHSLTAD+FV ILSYCA SPDPLFVMETWKIMEERG+FL+N C
Sbjct: 67  VGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGVFLDNTC 126

Query: 193 SLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLM 252
           +LLM++ALCKGGY DEAFGLISFLAES VMFPVLPVYN FLRAC K QSTVHVSQCLD+M
Sbjct: 127 TLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDMM 186

Query: 253 DHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVK 312
           D RMVGKNEATYS LL++AVCQK+LSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGD+K
Sbjct: 187 DRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDLK 246

Query: 313 SAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVP 372
           SAYTALQKMVAL IG AG KL SLELDIP+P RTES+  +FNFEENG STDEL+CKKMVP
Sbjct: 247 SAYTALQKMVALVIGAAGQKLPSLELDIPVPLRTESYHENFNFEENGPSTDELYCKKMVP 306

Query: 373 SNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQ 432
             GD+G+ S N MKCGEVESG LTLP+NYRS+FV KVL WSFND I ACALTRNCGLAEQ
Sbjct: 307 CEGDIGQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQ 366

Query: 433 LMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI-------- 492
           LMQQ                MHELGLQPS HTFDGFVRSVVSERGFSDG+KI        
Sbjct: 367 LMQQ----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKILKIMQQRK 426

Query: 493 --------------------------------------------------DQPERAMRMF 552
                                                             DQPERAMRM 
Sbjct: 427 LKPYDSTLAAVSISCSKALELDLAEALLEQISACVFPHPFNAFLSACDMMDQPERAMRML 486

Query: 553 VKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 612
           VKMKQM+VLPDVKTYELL+SLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH
Sbjct: 487 VKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 546

Query: 613 ASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIK 672
            SMMNLLKALG EGMTKELLQYLNVAENLFYY+NTCLGT +Y+  LH LV+SKEIHMAI+
Sbjct: 547 FSMMNLLKALGAEGMTKELLQYLNVAENLFYYDNTCLGTPIYNTALHFLVESKEIHMAIE 606

Query: 673 LFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVL 732
           LFNNMKHSG FP+AATFE+M++CC V+GCLKSAFALLS+MIRSGFCP +LTYTSL+KIVL
Sbjct: 607 LFNNMKHSGLFPDAATFEMMINCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVL 666

Query: 733 RFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPS 792
            FERFDDALNLLDQASSEGIELDV IMNTI+++AC K+  DVIEF+VEKM REKIQPDPS
Sbjct: 667 GFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKVTXDVIEFVVEKMKREKIQPDPS 726

Query: 793 TCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGADSRIL 840
           TC +VFS YV+LGYHSTAMEALQVLSMRMLC  +D SP +TEYVE+FV+AED+ A+SRIL
Sbjct: 727 TCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDSEAESRIL 786

BLAST of Clc09G10840 vs. NCBI nr
Match: XP_022976056.1 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1205.3 bits (3117), Expect = 0.0e+00
Identity = 622/825 (75.39%), Postives = 680/825 (82.42%), Query Frame = 0

Query: 73  RLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIVDALL 132
           RLGSIA S+ RFRPHEH RK+DA+K+VF RALLIS+G E LGN A+ST FMQ+QIVDAL 
Sbjct: 7   RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALR 66

Query: 133 LGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFLNNKC 192
           +GDRSSASNLLM+LGQEKHSLTAD+FV ILSYCA SPDPLFVMETWKIMEERG+FL+N C
Sbjct: 67  VGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGVFLDNTC 126

Query: 193 SLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLM 252
           +LLM++ALCKGGY DEAFGLISFLAES VMFPVLPVYN FLRAC K QSTVHVSQCLD+M
Sbjct: 127 TLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDMM 186

Query: 253 DHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVK 312
           D RMVGKNEATYS LL++AV QK+LSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGD+K
Sbjct: 187 DRRMVGKNEATYSELLKVAVGQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDLK 246

Query: 313 SAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVP 372
           SAYTALQKMV L IG AG KL SLELDIP+P RTE + ++FNFEENG STDEL+CKK+VP
Sbjct: 247 SAYTALQKMVTLVIGAAGQKLSSLELDIPVPLRTEFYHDNFNFEENGPSTDELYCKKVVP 306

Query: 373 SNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQ 432
             GD+ + S N MKCGEVESG LTLP+NYRS+FV KVL WSFND I ACA TRNCGLAEQ
Sbjct: 307 CEGDIWQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACARTRNCGLAEQ 366

Query: 433 LMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI-------- 492
           LMQQ                MHELGLQPS HTFDGFVRSVVSERGFSDG+KI        
Sbjct: 367 LMQQ----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKILKIMQQRK 426

Query: 493 --------------------------------------------------DQPERAMRMF 552
                                                             DQPERAMRM 
Sbjct: 427 LKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQPERAMRML 486

Query: 553 VKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 612
            KMKQM+VLPDVKTYELL+SLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH
Sbjct: 487 AKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 546

Query: 613 ASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIK 672
            SMMNLLKALG EGMTKELLQYLNVAENLFYYNNTCLGT +Y+  LH LV+SKEIHMA +
Sbjct: 547 FSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTCLGTPIYNTALHFLVESKEIHMATE 606

Query: 673 LFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVL 732
           LFNNMKHSG FP+AATFE+M+DCC V+GCLKSAFALLS+MIRSGFCP +LTYTSL+KIVL
Sbjct: 607 LFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVL 666

Query: 733 RFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPS 792
            FERFDDALNLLDQASSEGIELDV IMNTI+++AC K RIDVIEF+VEKM REKIQPDPS
Sbjct: 667 GFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKREKIQPDPS 726

Query: 793 TCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGADSRIL 840
           TC +VFS YV+LGYHSTAMEALQVLSMRMLC  +D SP +TEYVE+FV+AED+ A+SRIL
Sbjct: 727 TCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDSEAESRIL 786

BLAST of Clc09G10840 vs. NCBI nr
Match: XP_022937086.1 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1202.6 bits (3110), Expect = 0.0e+00
Identity = 622/825 (75.39%), Postives = 681/825 (82.55%), Query Frame = 0

Query: 73  RLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIVDALL 132
           RLGSIA S+ RFRPHEH RK+DA+K+VF RALLIS+G E LGN A+ST FMQ+QIVDAL 
Sbjct: 7   RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALR 66

Query: 133 LGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFLNNKC 192
           +GDRSSASNLLM+LGQEKHSLTAD+FV ILSYCA SPDPLFVMETWKIMEERG+FL+N C
Sbjct: 67  VGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGVFLDNTC 126

Query: 193 SLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLM 252
           +LLM++ALCKGGY DEAFGLISFLAES VMFPVLPVYN FLRAC K QSTVHVSQCLD+M
Sbjct: 127 TLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIM 186

Query: 253 DHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVK 312
           D RMVGKNEATYS LL++AVCQK+LSSVHEIWTDFVKNYSPSVLSLRKFIW YTRLGD+K
Sbjct: 187 DRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWCYTRLGDLK 246

Query: 313 SAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVP 372
           SA+TALQKMVAL IG AG KL SLELDIP+P RTE + ++FNFEENG STDE++CKKMVP
Sbjct: 247 SAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVP 306

Query: 373 SNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQ 432
             GD+ + S N MKCGEVESG  TLP+NYRS+FV KVL WSFND I ACALTRNCGLAEQ
Sbjct: 307 CEGDIEQFSVNGMKCGEVESG-RTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQ 366

Query: 433 LMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI-------- 492
           LMQQ                MHELGLQPS HTFDGFVRSVVSERGFSDG+KI        
Sbjct: 367 LMQQ----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKILKIMQQRK 426

Query: 493 --------------------------------------------------DQPERAMRMF 552
                                                             DQPERAMRM 
Sbjct: 427 LKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQPERAMRML 486

Query: 553 VKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 612
            KMKQM+VLPDVKTYELL+SLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH
Sbjct: 487 AKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 546

Query: 613 ASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIK 672
            SMMNLLKALG EGMTKELLQYLNVAENLFYYNNT LGT +Y+  LH LV+SKEIHMAI+
Sbjct: 547 FSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIE 606

Query: 673 LFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVL 732
           LFNNMKHSG FP+AATFE+M+DCC V+GCLKSAFALLS+MIRSGFCP +LTYTSL+KIVL
Sbjct: 607 LFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVL 666

Query: 733 RFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPS 792
            FERFDDALNLLDQASSEGIELDV IMNTI+++AC K RIDVIEF+VEKM R+KIQPDPS
Sbjct: 667 GFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPS 726

Query: 793 TCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGADSRIL 840
           TC +VFS YV+LGYHSTAMEALQVLSMRMLC   D SP +TEYVE+FV+AED+ A+SRIL
Sbjct: 727 TCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRIL 786

BLAST of Clc09G10840 vs. ExPASy Swiss-Prot
Match: Q9SGQ6 (Pentatricopeptide repeat-containing protein At1g76280 OS=Arabidopsis thaliana OX=3702 GN=At1g76280 PE=2 SV=2)

HSP 1 Score: 611.3 bits (1575), Expect = 1.7e-173
Identity = 365/824 (44.30%), Postives = 495/824 (60.07%), Query Frame = 0

Query: 40  WLCSHCHFIRNLVFNKLSPPSHRGSQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLV 99
           WL S   FI +   +KL+P   RG   +G  +    S + S+     +E +R +D SK+ 
Sbjct: 6   WL-SPLRFIVSSSSSKLTPYVSRGRGLSGIDNGAGCSCSRSVTTMIGNEFIRCQDESKI- 65

Query: 100 FHRALLISKGSENLGNGAKSTMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFV 159
                                  +Q QIVDAL  G+R  AS LL  L Q  +SL+AD F 
Sbjct: 66  -----------------------LQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFH 125

Query: 160 HILSYCAISPDPLFVMETWKIMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAES 219
            IL YCA SPDP+FVMET+ +M ++ I L+++  L +V++LC GG+ D+A   I  + E 
Sbjct: 126 DILYYCARSPDPVFVMETYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVRED 185

Query: 220 HVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSS 279
             + P+LP+YN FL ACA+++S  H S+CL+LMD R VGKN  TY  LL+LAV Q++LS+
Sbjct: 186 DRISPLLPIYNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLST 245

Query: 280 VHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVKSAYTALQKMVALA------IGPAGGKL 339
           V++IW  +V +Y+  +LSLR+FIWS+TRLGD+KSAY  LQ MV LA      +    GKL
Sbjct: 246 VNDIWKHYVNHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKL 305

Query: 340 RSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVPSNGDVGRISGNDMKCGEVESG 399
            S  L IP+PS+ E+    F F                   G   RI    + C    S 
Sbjct: 306 HSTRLYIPVPSKDETGSEKFAF-------------------GVTDRI----VDCN--SSS 365

Query: 400 PLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQLMQQFSV---------SCVLT 459
            + LP  +      +VL WSFND IHAC  ++N  LAEQLM Q  V            L 
Sbjct: 366 KVALPKGHNKILAIRVLRWSFNDVIHACGQSKNSELAEQLMLQLKVMQQQNLKPYDSTLA 425

Query: 460 TLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGM--------KIDQPERAMRMFVK 519
           T+     K  ++ L    H  D      +SE  +S            +DQPERA+R+  +
Sbjct: 426 TVAAYCSKALQVDLAE--HLLD-----QISECSYSYPFNNLLAAYDSLDQPERAVRVLAR 485

Query: 520 MKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHAS 579
           MK++ + PD++TYELLFSLFGNVNAPYEEGN LSQVD  KRI  IEMDM ++G QHS  S
Sbjct: 486 MKELKLRPDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPIS 545

Query: 580 MMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIKLF 639
            +N+L+ALG EGM  E++++L  AENL  ++N  LGT  Y+ VLHSL+++ E  M I +F
Sbjct: 546 RLNVLRALGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIF 605

Query: 640 NNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVLRF 699
             MK  G   + AT+ IM+DCC ++   KSA AL+S+MIR GF P  +T+T+L+KI+L  
Sbjct: 606 KRMKSCGCPADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKILLND 665

Query: 700 ERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPSTC 759
             F++ALNLLDQA+ E I LDV   NTI+++A  K  IDVIE++VE+M+REK+ PDP+TC
Sbjct: 666 ANFEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTC 725

Query: 760 CAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDAS--PDMTEYVENFVVAEDAGADSRIL 819
             VFS YV  GYH+TA+EAL VLS+RML   +  S      E  ENFV++ED  A+++I+
Sbjct: 726 HYVFSCYVEKGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKII 772

Query: 820 EFFKGYEEFLSFALFNLRWSAMLGYSLCSSPSRSPWAMRLASSY 839
           E F+  EE L+ AL NLRW AMLG  +  S  +SPWA  L++ Y
Sbjct: 786 ELFRKSEEHLAAALLNLRWCAMLGGRIIWSEDQSPWARALSNKY 772

BLAST of Clc09G10840 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 1.5e-12
Identity = 61/270 (22.59%), Postives = 115/270 (42.59%), Query Frame = 0

Query: 488 ERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAA------------ 547
           + AM +FVKMK  +  P V+TY +L           E  N + +++              
Sbjct: 305 DEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVL 364

Query: 548 ----------KRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFY 607
                     ++ R +   M + G+  +  +   L+      GM ++ +  + + E+   
Sbjct: 365 IDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKL 424

Query: 608 YNNTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLK 667
             N    T  Y+ ++    KS  +H A+ + N M      P+  T+  ++D  C  G   
Sbjct: 425 SPN----TRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFD 484

Query: 668 SAFALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIM 727
           SA+ LLS+M   G  P   TYTS+I  + + +R ++A +L D    +G+  +V +   ++
Sbjct: 485 SAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALI 544

Query: 728 KEACVKLRIDVIEFLVEKMNREKIQPDPST 736
              C   ++D    ++EKM  +   P+  T
Sbjct: 545 DGYCKAGKVDEAHLMLEKMLSKNCLPNSLT 569

BLAST of Clc09G10840 vs. ExPASy Swiss-Prot
Match: Q9SUD8 (Pentatricopeptide repeat-containing protein At4g28010 OS=Arabidopsis thaliana OX=3702 GN=At4g28010 PE=2 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 1.9e-12
Identity = 47/150 (31.33%), Postives = 74/150 (49.33%), Query Frame = 0

Query: 596 YDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMI 655
           Y+ +L SL K   +  A +LF  M+    FP+  +F IM+D     G +KSA +LL  M 
Sbjct: 532 YNCLLSSLCKEGSLDQAWRLFEEMQRDNNFPDVVSFNIMIDGSLKAGDIKSAESLLVGMS 591

Query: 656 RSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRID 715
           R+G  P + TY+ LI   L+    D+A++  D+    G E D  I ++++K    +   D
Sbjct: 592 RAGLSPDLFTYSKLINRFLKLGYLDEAISFFDKMVDSGFEPDAHICDSVLKYCISQGETD 651

Query: 716 VIEFLVEKMNREKIQPDPSTCCAVFSTYVN 746
            +  LV+K+  + I  D    C V     N
Sbjct: 652 KLTELVKKLVDKDIVLDKELTCTVMDYMCN 681

BLAST of Clc09G10840 vs. ExPASy Swiss-Prot
Match: Q9LMH5 (Putative pentatricopeptide repeat-containing protein At1g13800 OS=Arabidopsis thaliana OX=3702 GN=At1g13800 PE=3 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 1.9e-12
Identity = 60/228 (26.32%), Postives = 107/228 (46.93%), Query Frame = 0

Query: 483 KIDQPERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEG---------------N 542
           ++++P++A  +F  MK+ DV PDV TY +L +    ++   E                 N
Sbjct: 647 RLNEPKQAYALFEDMKRRDVKPDVVTYSVLLNSDPELDMKREMEAFDVIPDVVYYTIMIN 706

Query: 543 RLSQVDAAKRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYN 602
           R   ++  K++  +  DM++  I     +   LLK      +++E+  + +V  ++FYY 
Sbjct: 707 RYCHLNDLKKVYALFKDMKRREIVPDVVTYTVLLKNKPERNLSREMKAF-DVKPDVFYYT 766

Query: 603 NTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSA 662
                      ++    K  ++  A ++F+ M  SG  P+AA +  ++ CCC MG LK A
Sbjct: 767 ----------VLIDWQCKIGDLGEAKRIFDQMIESGVDPDAAPYTALIACCCKMGYLKEA 826

Query: 663 FALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIE 696
             +   MI SG  P V+ YT+LI    R      A+ L+ +   +GI+
Sbjct: 827 KMIFDRMIESGVKPDVVPYTALIAGCCRNGFVLKAVKLVKEMLEKGIK 863

BLAST of Clc09G10840 vs. ExPASy Swiss-Prot
Match: Q9ZQF1 (Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g15630 PE=3 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 3.3e-12
Identity = 74/310 (23.87%), Postives = 129/310 (41.61%), Query Frame = 0

Query: 453 MHELGLQPSCHTFDGFVRSVVSERGFSDGMKIDQPERAMRMFVKMKQMDVLPDVKTYELL 512
           M   G++P+  T++  V      +GFS   +I   E A  +  +MK     PD++TY  +
Sbjct: 251 MEVFGIKPTIVTYNTLV------QGFSLRGRI---EGARLIISEMKSKGFQPDMQTYNPI 310

Query: 513 FSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKE 572
            S   N      EG          R   +  +M++ G+     S   L++     G  + 
Sbjct: 311 LSWMCN------EG----------RASEVLREMKEIGLVPDSVSYNILIRGCSNNGDLEM 370

Query: 573 LLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFE 632
              Y     +           + Y+ ++H L    +I  A  L   ++  G   ++ T+ 
Sbjct: 371 AFAY----RDEMVKQGMVPTFYTYNTLIHGLFMENKIEAAEILIREIREKGIVLDSVTYN 430

Query: 633 IMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSE 692
           I+++  C  G  K AFAL   M+  G  P   TYTSLI ++ R  +  +A  L ++   +
Sbjct: 431 ILINGYCQHGDAKKAFALHDEMMTDGIQPTQFTYTSLIYVLCRKNKTREADELFEKVVGK 490

Query: 693 GIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPSTCCAVFSTYVNLGYHSTA 752
           G++ D+ +MNT+M   C    +D    L+++M+   I PD  T   +       G    A
Sbjct: 491 GMKPDLVMMNTLMDGHCAIGNMDRAFSLLKEMDMMSINPDDVTYNCLMRGLCGEGKFEEA 531

Query: 753 MEALQVLSMR 763
            E +  +  R
Sbjct: 551 RELMGEMKRR 531

BLAST of Clc09G10840 vs. ExPASy TrEMBL
Match: A0A5A7V601 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold5463G00050 PE=4 SV=1)

HSP 1 Score: 1266.9 bits (3277), Expect = 0.0e+00
Identity = 672/906 (74.17%), Postives = 727/906 (80.24%), Query Frame = 0

Query: 1   MSSNYRTEEKVKKPLVCSNRTNPICTTTVFSCSASILHSWLCSHCHFIRNLVFNKLSPPS 60
           M SN RTEE+ +KPLV SNRTN I T+TVF CSA ILH WLCSHCHFI +LVFNKL PPS
Sbjct: 1   MCSNCRTEERGRKPLVYSNRTNSISTSTVFGCSARILHLWLCSHCHFIWSLVFNKLLPPS 60

Query: 61  HRG-SQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKS 120
             G  QCTGK S RLGSIA SI RF+PHE VRK+DASKLVFHRALLISKGSE  GNGA+S
Sbjct: 61  LLGRQQCTGKASFRLGSIADSIYRFKPHELVRKQDASKLVFHRALLISKGSEIWGNGAES 120

Query: 121 TMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWK 180
           T FMQ QIVDAL LGDR+ ASNLLM LGQEK SLTAD+FV ILSYCA SPDPLFVMETWK
Sbjct: 121 TAFMQIQIVDALRLGDRNKASNLLMVLGQEKCSLTADNFVRILSYCAKSPDPLFVMETWK 180

Query: 181 IMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKS 240
           IMEERGIFLNN CSLLM+EALCKGGY DEAFGLI+FLAESHVMFPVLPVYNCFLRACA  
Sbjct: 181 IMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACAIR 240

Query: 241 QSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLR 300
           QSTVH SQCLDLMDHRMVGKNEATYS LL+LAVCQ++ SSVHEIWTDFVKNYSPSV SLR
Sbjct: 241 QSTVHASQCLDLMDHRMVGKNEATYSELLKLAVCQENSSSVHEIWTDFVKNYSPSVSSLR 300

Query: 301 KFIWSYTRLGDVKSAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENG 360
           KFIWS+ RLGD+ SAYTALQKMVALA G  G KL+S  LDIPIP RTE + N+FNFEE  
Sbjct: 301 KFIWSFARLGDLTSAYTALQKMVALATGATGRKLQS--LDIPIPLRTEFYHNNFNFEEKE 360

Query: 361 SSTDELFCKKMVPSNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIH 420
            S DE FCKKMVP NGDVG IS NDMKCG  E+GPLT+PNN+RSSFV+KVL WS ND + 
Sbjct: 361 PSIDEFFCKKMVPWNGDVGGISVNDMKCG--ETGPLTVPNNHRSSFVRKVLRWSSNDVMR 420

Query: 421 ACALTRNCGLAEQLMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFS 480
           +C+L  NCGLAEQLMQQ                MH+LGLQPS HTFDGFVRSVVSERGFS
Sbjct: 421 SCSLAGNCGLAEQLMQQ----------------MHKLGLQPSSHTFDGFVRSVVSERGFS 480

Query: 481 DGMKI------------------------------------------------------- 540
            GM+I                                                       
Sbjct: 481 AGMEILKVMQQRGLEPYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFLSAC 540

Query: 541 ---DQPERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMI 600
              DQPERAMRM VKMKQM V+PDV+TYELL+SLFGNVNAPYEEG++LSQVDAAKRIRMI
Sbjct: 541 GVMDQPERAMRMLVKMKQMKVVPDVRTYELLYSLFGNVNAPYEEGDKLSQVDAAKRIRMI 600

Query: 601 EMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLH 660
           EMDM KHGIQ+SH SMMNLLKALG EGM KE+LQYLN+AENLFYYNNT LG  VY+ VLH
Sbjct: 601 EMDMGKHGIQYSHFSMMNLLKALGAEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLH 660

Query: 661 SLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCP 720
            LV SKEI+MAI+LFNNMK+SGFFP+AATFEIMLDCC VMGCLKSAFALLS+MIRSGFCP
Sbjct: 661 FLVDSKEIYMAIELFNNMKNSGFFPDAATFEIMLDCCSVMGCLKSAFALLSLMIRSGFCP 720

Query: 721 GVLTYTSLIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLV 780
            +LTYTSL+KIVL F RFDDALNLLDQASSEGIELDV IMNTIM++AC K RIDVIEFLV
Sbjct: 721 QILTYTSLVKIVLGFGRFDDALNLLDQASSEGIELDVIIMNTIMRKACEKARIDVIEFLV 780

Query: 781 EKMNREKIQPDPSTCCAVFSTYVNLGYHSTAMEALQVLSMRM-LCNNNDASPDMTEYVEN 840
           EKMNREKIQPDPSTC  VFS YVNLGYHSTAMEALQVLSMRM LC  +D S  +TEY+EN
Sbjct: 781 EKMNREKIQPDPSTCHNVFSAYVNLGYHSTAMEALQVLSMRMLLCEEDDDS--VTEYMEN 840

Query: 841 FVVAEDAGADSRILEFFKGYEEFLSFALFNLRWSAMLGYSLCSSPSRSPWAMRLASSYDR 847
           FV+AED GADSRI EFFK   E+L FALFNLRW AMLGYS+C SP++SPWAMRLASSYD 
Sbjct: 841 FVLAEDTGADSRIAEFFKCSREYLGFALFNLRWCAMLGYSVCCSPNQSPWAMRLASSYDG 884

BLAST of Clc09G10840 vs. ExPASy TrEMBL
Match: A0A6J1IFV2 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476573 PE=4 SV=1)

HSP 1 Score: 1205.3 bits (3117), Expect = 0.0e+00
Identity = 622/825 (75.39%), Postives = 680/825 (82.42%), Query Frame = 0

Query: 73  RLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIVDALL 132
           RLGSIA S+ RFRPHEH RK+DA+K+VF RALLIS+G E LGN A+ST FMQ+QIVDAL 
Sbjct: 7   RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALR 66

Query: 133 LGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFLNNKC 192
           +GDRSSASNLLM+LGQEKHSLTAD+FV ILSYCA SPDPLFVMETWKIMEERG+FL+N C
Sbjct: 67  VGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGVFLDNTC 126

Query: 193 SLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLM 252
           +LLM++ALCKGGY DEAFGLISFLAES VMFPVLPVYN FLRAC K QSTVHVSQCLD+M
Sbjct: 127 TLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDMM 186

Query: 253 DHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVK 312
           D RMVGKNEATYS LL++AV QK+LSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGD+K
Sbjct: 187 DRRMVGKNEATYSELLKVAVGQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDLK 246

Query: 313 SAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVP 372
           SAYTALQKMV L IG AG KL SLELDIP+P RTE + ++FNFEENG STDEL+CKK+VP
Sbjct: 247 SAYTALQKMVTLVIGAAGQKLSSLELDIPVPLRTEFYHDNFNFEENGPSTDELYCKKVVP 306

Query: 373 SNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQ 432
             GD+ + S N MKCGEVESG LTLP+NYRS+FV KVL WSFND I ACA TRNCGLAEQ
Sbjct: 307 CEGDIWQFSVNGMKCGEVESGRLTLPSNYRSNFVMKVLRWSFNDVICACARTRNCGLAEQ 366

Query: 433 LMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI-------- 492
           LMQQ                MHELGLQPS HTFDGFVRSVVSERGFSDG+KI        
Sbjct: 367 LMQQ----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKILKIMQQRK 426

Query: 493 --------------------------------------------------DQPERAMRMF 552
                                                             DQPERAMRM 
Sbjct: 427 LKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQPERAMRML 486

Query: 553 VKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 612
            KMKQM+VLPDVKTYELL+SLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH
Sbjct: 487 AKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 546

Query: 613 ASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIK 672
            SMMNLLKALG EGMTKELLQYLNVAENLFYYNNTCLGT +Y+  LH LV+SKEIHMA +
Sbjct: 547 FSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTCLGTPIYNTALHFLVESKEIHMATE 606

Query: 673 LFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVL 732
           LFNNMKHSG FP+AATFE+M+DCC V+GCLKSAFALLS+MIRSGFCP +LTYTSL+KIVL
Sbjct: 607 LFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVL 666

Query: 733 RFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPS 792
            FERFDDALNLLDQASSEGIELDV IMNTI+++AC K RIDVIEF+VEKM REKIQPDPS
Sbjct: 667 GFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKREKIQPDPS 726

Query: 793 TCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGADSRIL 840
           TC +VFS YV+LGYHSTAMEALQVLSMRMLC  +D SP +TEYVE+FV+AED+ A+SRIL
Sbjct: 727 TCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEHDTSPVVTEYVEDFVLAEDSEAESRIL 786

BLAST of Clc09G10840 vs. ExPASy TrEMBL
Match: A0A6J1F9C6 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443493 PE=4 SV=1)

HSP 1 Score: 1202.6 bits (3110), Expect = 0.0e+00
Identity = 622/825 (75.39%), Postives = 681/825 (82.55%), Query Frame = 0

Query: 73  RLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIVDALL 132
           RLGSIA S+ RFRPHEH RK+DA+K+VF RALLIS+G E LGN A+ST FMQ+QIVDAL 
Sbjct: 7   RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALR 66

Query: 133 LGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFLNNKC 192
           +GDRSSASNLLM+LGQEKHSLTAD+FV ILSYCA SPDPLFVMETWKIMEERG+FL+N C
Sbjct: 67  VGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLFVMETWKIMEERGVFLDNTC 126

Query: 193 SLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLM 252
           +LLM++ALCKGGY DEAFGLISFLAES VMFPVLPVYN FLRAC K QSTVHVSQCLD+M
Sbjct: 127 TLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIM 186

Query: 253 DHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVK 312
           D RMVGKNEATYS LL++AVCQK+LSSVHEIWTDFVKNYSPSVLSLRKFIW YTRLGD+K
Sbjct: 187 DRRMVGKNEATYSELLKVAVCQKNLSSVHEIWTDFVKNYSPSVLSLRKFIWCYTRLGDLK 246

Query: 313 SAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVP 372
           SA+TALQKMVAL IG AG KL SLELDIP+P RTE + ++FNFEENG STDE++CKKMVP
Sbjct: 247 SAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVP 306

Query: 373 SNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQ 432
             GD+ + S N MKCGEVESG  TLP+NYRS+FV KVL WSFND I ACALTRNCGLAEQ
Sbjct: 307 CEGDIEQFSVNGMKCGEVESG-RTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQ 366

Query: 433 LMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI-------- 492
           LMQQ                MHELGLQPS HTFDGFVRSVVSERGFSDG+KI        
Sbjct: 367 LMQQ----------------MHELGLQPSSHTFDGFVRSVVSERGFSDGIKILKIMQQRK 426

Query: 493 --------------------------------------------------DQPERAMRMF 552
                                                             DQPERAMRM 
Sbjct: 427 LKPYDSTLAAVSISCSKALELDLAEALLEQISACVYPHPFNAFLSACDMMDQPERAMRML 486

Query: 553 VKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 612
            KMKQM+VLPDVKTYELL+SLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH
Sbjct: 487 AKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSH 546

Query: 613 ASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIK 672
            SMMNLLKALG EGMTKELLQYLNVAENLFYYNNT LGT +Y+  LH LV+SKEIHMAI+
Sbjct: 547 FSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIE 606

Query: 673 LFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVL 732
           LFNNMKHSG FP+AATFE+M+DCC V+GCLKSAFALLS+MIRSGFCP +LTYTSL+KIVL
Sbjct: 607 LFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVL 666

Query: 733 RFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPS 792
            FERFDDALNLLDQASSEGIELDV IMNTI+++AC K RIDVIEF+VEKM R+KIQPDPS
Sbjct: 667 GFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPS 726

Query: 793 TCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGADSRIL 840
           TC +VFS YV+LGYHSTAMEALQVLSMRMLC   D SP +TEYVE+FV+AED+ A+SRIL
Sbjct: 727 TCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRIL 786

BLAST of Clc09G10840 vs. ExPASy TrEMBL
Match: A0A1S3CEN8 (pentatricopeptide repeat-containing protein At1g76280 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103500041 PE=4 SV=1)

HSP 1 Score: 1199.5 bits (3102), Expect = 0.0e+00
Identity = 622/779 (79.85%), Postives = 673/779 (86.39%), Query Frame = 0

Query: 69  KLSSRLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIV 128
           + S RLGSIA SI RF+PHE VRK+DASKLVFHRALLISKGSE  GNGA+ST FMQ QIV
Sbjct: 3   RASFRLGSIADSIYRFKPHELVRKQDASKLVFHRALLISKGSEIWGNGAESTAFMQIQIV 62

Query: 129 DALLLGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFL 188
           DAL LGDRS ASNLLM LGQEK SLTAD+FV ILSYCA SPDPLFVMETWKIMEERGIFL
Sbjct: 63  DALRLGDRSKASNLLMVLGQEKCSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGIFL 122

Query: 189 NNKCSLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQC 248
           NN CSLLM+EALCKGGY DEAFGLI+FLAESHVMFPVLPVYNCFLRACA  QSTVH SQC
Sbjct: 123 NNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACAIRQSTVHASQC 182

Query: 249 LDLMDHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRL 308
           LDLMDHRMVGKNEATYS LL+LAVCQ++ SSVHEIWTDFVKNYSPSV SLRKFIWS+ RL
Sbjct: 183 LDLMDHRMVGKNEATYSELLKLAVCQENSSSVHEIWTDFVKNYSPSVSSLRKFIWSFARL 242

Query: 309 GDVKSAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCK 368
           GD+ SAYTALQKMVALA G  G KL+S  LDIPIP RTE + N+FNFEE   S DE FCK
Sbjct: 243 GDLTSAYTALQKMVALATGATGRKLQS--LDIPIPLRTEFYHNNFNFEEKEPSIDEFFCK 302

Query: 369 KMVPSNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCG 428
           KMVP NGDVG IS NDMKCG  E+GPLT+PNN+RSSFV+KVL WS ND + +C+L  NCG
Sbjct: 303 KMVPWNGDVGGISVNDMKCG--ETGPLTVPNNHRSSFVRKVLRWSSNDVMRSCSLAGNCG 362

Query: 429 LAEQLMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKIDQPE 488
           LAEQLMQQ                MH+LGLQPS HTFDGFVRSVVSERGFS GM+IDQPE
Sbjct: 363 LAEQLMQQ----------------MHKLGLQPSSHTFDGFVRSVVSERGFSAGMEIDQPE 422

Query: 489 RAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKH 548
           RAMRM VKMKQM V+PDV+TYELL+SLFGNVNAPYEEG++LSQVDAAKRIRMIEMDM KH
Sbjct: 423 RAMRMLVKMKQMKVVPDVRTYELLYSLFGNVNAPYEEGDKLSQVDAAKRIRMIEMDMGKH 482

Query: 549 GIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKE 608
           GIQ+SH SMMNLLKALG EGM KE+LQYLN+AENLFYYNNT LG  VY+ VLH LV SKE
Sbjct: 483 GIQYSHFSMMNLLKALGAEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 542

Query: 609 IHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTS 668
           I+MAI+LFNNMK+SGFFP+AATFEIMLDCC VMGCLKSAFALLS+MIRSGFCP +LTYTS
Sbjct: 543 IYMAIELFNNMKNSGFFPDAATFEIMLDCCSVMGCLKSAFALLSLMIRSGFCPQILTYTS 602

Query: 669 LIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREK 728
           L+KIVL F RFDDALNLLDQASSEGIELDV IMNTIM++AC K RIDVIEFLVEKMNREK
Sbjct: 603 LVKIVLGFGRFDDALNLLDQASSEGIELDVIIMNTIMRKACEKARIDVIEFLVEKMNREK 662

Query: 729 IQPDPSTCCAVFSTYVNLGYHSTAMEALQVLSMRM-LCNNNDASPDMTEYVENFVVAEDA 788
           IQPDPSTC  VFS YVNLGYHSTAMEALQVLSMRM LC  +D S  +TEY+ENFV+AED 
Sbjct: 663 IQPDPSTCHNVFSAYVNLGYHSTAMEALQVLSMRMLLCEEDDDS--VTEYMENFVLAEDT 722

Query: 789 GADSRILEFFKGYEEFLSFALFNLRWSAMLGYSLCSSPSRSPWAMRLASSYDRPQNLIR 847
           GADSRI EFFK   E+L FALFNLRW AMLGYS+C SP++SPWAMRLASSYD  +NLIR
Sbjct: 723 GADSRIAEFFKCSREYLGFALFNLRWCAMLGYSVCCSPNQSPWAMRLASSYDGYKNLIR 759

BLAST of Clc09G10840 vs. ExPASy TrEMBL
Match: A0A6J1BT64 (pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005486 PE=4 SV=1)

HSP 1 Score: 1179.1 bits (3049), Expect = 0.0e+00
Identity = 611/829 (73.70%), Postives = 674/829 (81.30%), Query Frame = 0

Query: 69  KLSSRLGSIAGSICRFRPHEHVRKKDASKLVFHRALLISKGSENLGNGAKSTMFMQKQIV 128
           + +SRLGSIA S+ RF+PHEH R++ +SKL FHR LLISK SE LGNGA++T FMQ QIV
Sbjct: 3   RATSRLGSIADSLYRFKPHEHGRQQASSKLAFHRTLLISKDSEFLGNGAETTKFMQMQIV 62

Query: 129 DALLLGDRSSASNLLMDLGQEKHSLTADSFVHILSYCAISPDPLFVMETWKIMEERGIFL 188
           DAL LGDRSSASNLLM+LGQEKHSLTAD+FV ILSYCA SPDPLFVMETW+IME+RGIFL
Sbjct: 63  DALRLGDRSSASNLLMELGQEKHSLTADNFVRILSYCAGSPDPLFVMETWRIMEDRGIFL 122

Query: 189 NNKCSLLMVEALCKGGYFDEAFGLISFLAESHVMFPVLPVYNCFLRACAKSQSTVHVSQC 248
           NN CSLLM+EALCKGGY DEAFGLI+FLAES VMFPVLPVYNCFLRAC K QSTVHV QC
Sbjct: 123 NNTCSLLMIEALCKGGYLDEAFGLINFLAESRVMFPVLPVYNCFLRACVKMQSTVHVGQC 182

Query: 249 LDLMDHRMVGKNEATYSVLLELAVCQKSLSSVHEIWTDFVKNYSPSVLSLRKFIWSYTRL 308
           LDLMDHRMVGKNEATYS LL+LAV Q++LSSVHEIWTDFVKNYSPSVLSLRKFIWSY RL
Sbjct: 183 LDLMDHRMVGKNEATYSELLKLAVFQQNLSSVHEIWTDFVKNYSPSVLSLRKFIWSYARL 242

Query: 309 GDVKSAYTALQKMVALAIGPAGGKLRSLELDIPIPSRTESHRNDFNFEENGSSTDELFCK 368
           GD+KSA  +LQKMVALA+G AGGKL SLELDIPIPS TE +RN+F+FE+N  S+DEL+ K
Sbjct: 243 GDLKSACISLQKMVALAVGAAGGKLPSLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRK 302

Query: 369 KMVPSNGDVGRISGNDMKCGEVESGPLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCG 428
           K+V  + D+G+ S N MKCG+ ESGPLT  NN RSSFV KVL WSFND IHACA TR+CG
Sbjct: 303 KLVTCDDDIGQFSVNGMKCGD-ESGPLTFQNNCRSSFVMKVLRWSFNDVIHACAFTRDCG 362

Query: 429 LAEQLMQQFSVSCVLTTLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGMKI---- 488
           LAEQLMQQ                M +LGLQPSCHTFDGFVRSVVSERGFSDGMKI    
Sbjct: 363 LAEQLMQQ----------------MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIM 422

Query: 489 ------------------------------------------------------DQPERA 548
                                                                 DQPERA
Sbjct: 423 QQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERA 482

Query: 549 MRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGI 608
           MRM VKMKQ+ VLP+V TYE L+SLFGNVNAPYEEGNRLSQ DA KRIRMIEMDM KHGI
Sbjct: 483 MRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGI 542

Query: 609 QHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIH 668
           QHS+ SM NLLKALG EGMTKELLQYL+VAENLFYYNNT LGT VY+ VLH LV+SKEIH
Sbjct: 543 QHSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIH 602

Query: 669 MAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLI 728
           MAI+LFNNMKHSGFFP+AATFE+M+DCC VM CLKSAFALLS+M+R+GFCP +LTYTSL+
Sbjct: 603 MAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLV 662

Query: 729 KIVLRFERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQ 788
           KIVLR E FDDALNLLDQASSEGI+LDV IMNTI+ +AC K R+DVIEF++E+MNREKIQ
Sbjct: 663 KIVLRSEGFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQ 722

Query: 789 PDPSTCCAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDASPDMTEYVENFVVAEDAGAD 840
           PDPSTC +VFS YVNLGYHSTAMEALQVLSMRML    DASPD+TEYVENFV+AED GAD
Sbjct: 723 PDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGAD 782

BLAST of Clc09G10840 vs. TAIR 10
Match: AT1G76280.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 611.3 bits (1575), Expect = 1.2e-174
Identity = 365/824 (44.30%), Postives = 495/824 (60.07%), Query Frame = 0

Query: 40  WLCSHCHFIRNLVFNKLSPPSHRGSQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLV 99
           WL S   FI +   +KL+P   RG   +G  +    S + S+     +E +R +D SK+ 
Sbjct: 6   WL-SPLRFIVSSSSSKLTPYVSRGRGLSGIDNGAGCSCSRSVTTMIGNEFIRCQDESKI- 65

Query: 100 FHRALLISKGSENLGNGAKSTMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFV 159
                                  +Q QIVDAL  G+R  AS LL  L Q  +SL+AD F 
Sbjct: 66  -----------------------LQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFH 125

Query: 160 HILSYCAISPDPLFVMETWKIMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAES 219
            IL YCA SPDP+FVMET+ +M ++ I L+++  L +V++LC GG+ D+A   I  + E 
Sbjct: 126 DILYYCARSPDPVFVMETYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVRED 185

Query: 220 HVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSS 279
             + P+LP+YN FL ACA+++S  H S+CL+LMD R VGKN  TY  LL+LAV Q++LS+
Sbjct: 186 DRISPLLPIYNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLST 245

Query: 280 VHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVKSAYTALQKMVALA------IGPAGGKL 339
           V++IW  +V +Y+  +LSLR+FIWS+TRLGD+KSAY  LQ MV LA      +    GKL
Sbjct: 246 VNDIWKHYVNHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKL 305

Query: 340 RSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVPSNGDVGRISGNDMKCGEVESG 399
            S  L IP+PS+ E+    F F                   G   RI    + C    S 
Sbjct: 306 HSTRLYIPVPSKDETGSEKFAF-------------------GVTDRI----VDCN--SSS 365

Query: 400 PLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQLMQQFSV---------SCVLT 459
            + LP  +      +VL WSFND IHAC  ++N  LAEQLM Q  V            L 
Sbjct: 366 KVALPKGHNKILAIRVLRWSFNDVIHACGQSKNSELAEQLMLQLKVMQQQNLKPYDSTLA 425

Query: 460 TLGTMSLKMHELGLQPSCHTFDGFVRSVVSERGFSDGM--------KIDQPERAMRMFVK 519
           T+     K  ++ L    H  D      +SE  +S            +DQPERA+R+  +
Sbjct: 426 TVAAYCSKALQVDLAE--HLLD-----QISECSYSYPFNNLLAAYDSLDQPERAVRVLAR 485

Query: 520 MKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHAS 579
           MK++ + PD++TYELLFSLFGNVNAPYEEGN LSQVD  KRI  IEMDM ++G QHS  S
Sbjct: 486 MKELKLRPDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPIS 545

Query: 580 MMNLLKALGTEGMTKELLQYLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIKLF 639
            +N+L+ALG EGM  E++++L  AENL  ++N  LGT  Y+ VLHSL+++ E  M I +F
Sbjct: 546 RLNVLRALGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIF 605

Query: 640 NNMKHSGFFPEAATFEIMLDCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVLRF 699
             MK  G   + AT+ IM+DCC ++   KSA AL+S+MIR GF P  +T+T+L+KI+L  
Sbjct: 606 KRMKSCGCPADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKILLND 665

Query: 700 ERFDDALNLLDQASSEGIELDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPSTC 759
             F++ALNLLDQA+ E I LDV   NTI+++A  K  IDVIE++VE+M+REK+ PDP+TC
Sbjct: 666 ANFEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTC 725

Query: 760 CAVFSTYVNLGYHSTAMEALQVLSMRMLCNNNDAS--PDMTEYVENFVVAEDAGADSRIL 819
             VFS YV  GYH+TA+EAL VLS+RML   +  S      E  ENFV++ED  A+++I+
Sbjct: 726 HYVFSCYVEKGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKII 772

Query: 820 EFFKGYEEFLSFALFNLRWSAMLGYSLCSSPSRSPWAMRLASSY 839
           E F+  EE L+ AL NLRW AMLG  +  S  +SPWA  L++ Y
Sbjct: 786 ELFRKSEEHLAAALLNLRWCAMLGGRIIWSEDQSPWARALSNKY 772

BLAST of Clc09G10840 vs. TAIR 10
Match: AT1G76280.3 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 609.0 bits (1569), Expect = 5.8e-174
Identity = 367/865 (42.43%), Postives = 497/865 (57.46%), Query Frame = 0

Query: 40  WLCSHCHFIRNLVFNKLSPPSHRGSQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLV 99
           WL S   FI +   +KL+P   RG   +G  +    S + S+     +E +R +D SK+ 
Sbjct: 6   WL-SPLRFIVSSSSSKLTPYVSRGRGLSGIDNGAGCSCSRSVTTMIGNEFIRCQDESKI- 65

Query: 100 FHRALLISKGSENLGNGAKSTMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFV 159
                                  +Q QIVDAL  G+R  AS LL  L Q  +SL+AD F 
Sbjct: 66  -----------------------LQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFH 125

Query: 160 HILSYCAISPDPLFVMETWKIMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAES 219
            IL YCA SPDP+    T+ +M ++ I L+++  L +V++LC GG+ D+A   I  + E 
Sbjct: 126 DILYYCARSPDPV----TYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVRED 185

Query: 220 HVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSS 279
             + P+LP+YN FL ACA+++S  H S+CL+LMD R VGKN  TY  LL+LAV Q++LS+
Sbjct: 186 DRISPLLPIYNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLST 245

Query: 280 VHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVKSAYTALQKMVALA------IGPAGGKL 339
           V++IW  +V +Y+  +LSLR+FIWS+TRLGD+KSAY  LQ MV LA      +    GKL
Sbjct: 246 VNDIWKHYVNHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKL 305

Query: 340 RSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVPSNGDVGRISGNDMKCGEVESG 399
            S  L IP+PS+ E+    F F                   G   RI    + C    S 
Sbjct: 306 HSTRLYIPVPSKDETGSEKFAF-------------------GVTDRI----VDCN--SSS 365

Query: 400 PLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQLMQQFSVSCVLTTLGTMSLKM 459
            + LP  +      +VL WSFND IHAC  ++N  LAEQLM                L+M
Sbjct: 366 KVALPKGHNKILAIRVLRWSFNDVIHACGQSKNSELAEQLM----------------LQM 425

Query: 460 HELGLQPSCHTFDGFVRSVVSERGFSDGM------------------------------- 519
             LGL PS HT+DGF+R+V    G+  GM                               
Sbjct: 426 QNLGLLPSSHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLATVAAYCSKALQV 485

Query: 520 ---------------------------KIDQPERAMRMFVKMKQMDVLPDVKTYELLFSL 579
                                       +DQPERA+R+  +MK++ + PD++TYELLFSL
Sbjct: 486 DLAEHLLDQISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLRPDMRTYELLFSL 545

Query: 580 FGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQ 639
           FGNVNAPYEEGN LSQVD  KRI  IEMDM ++G QHS  S +N+L+ALG EGM  E+++
Sbjct: 546 FGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRALGAEGMVNEMIR 605

Query: 640 YLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIML 699
           +L  AENL  ++N  LGT  Y+ VLHSL+++ E  M I +F  MK  G   + AT+ IM+
Sbjct: 606 HLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCGCPADVATYNIMI 665

Query: 700 DCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIE 759
           DCC ++   KSA AL+S+MIR GF P  +T+T+L+KI+L    F++ALNLLDQA+ E I 
Sbjct: 666 DCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKILLNDANFEEALNLLDQAALEEIH 725

Query: 760 LDVTIMNTIMKEACVKLRIDVIEFLVEKMNREKIQPDPSTCCAVFSTYVNLGYHSTAMEA 819
           LDV   NTI+++A  K  IDVIE++VE+M+REK+ PDP+TC  VFS YV  GYH+TA+EA
Sbjct: 726 LDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHYVFSCYVEKGYHATAIEA 785

Query: 820 LQVLSMRMLCNNNDAS--PDMTEYVENFVVAEDAGADSRILEFFKGYEEFLSFALFNLRW 839
           L VLS+RML   +  S      E  ENFV++ED  A+++I+E F+  EE L+ AL NLRW
Sbjct: 786 LNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKIIELFRKSEEHLAAALLNLRW 800

BLAST of Clc09G10840 vs. TAIR 10
Match: AT1G76280.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 507.3 bits (1305), Expect = 2.4e-143
Identity = 310/752 (41.22%), Postives = 424/752 (56.38%), Query Frame = 0

Query: 40  WLCSHCHFIRNLVFNKLSPPSHRGSQCTGKLSSRLGSIAGSICRFRPHEHVRKKDASKLV 99
           WL S   FI +   +KL+P   RG   +G  +    S + S+     +E +R +D SK+ 
Sbjct: 6   WL-SPLRFIVSSSSSKLTPYVSRGRGLSGIDNGAGCSCSRSVTTMIGNEFIRCQDESKI- 65

Query: 100 FHRALLISKGSENLGNGAKSTMFMQKQIVDALLLGDRSSASNLLMDLGQEKHSLTADSFV 159
                                  +Q QIVDAL  G+R  AS LL  L Q  +SL+AD F 
Sbjct: 66  -----------------------LQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFH 125

Query: 160 HILSYCAISPDPLFVMETWKIMEERGIFLNNKCSLLMVEALCKGGYFDEAFGLISFLAES 219
            IL YCA SPDP+FVMET+ +M ++ I L+++  L +V++LC GG+ D+A   I  + E 
Sbjct: 126 DILYYCARSPDPVFVMETYSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVRED 185

Query: 220 HVMFPVLPVYNCFLRACAKSQSTVHVSQCLDLMDHRMVGKNEATYSVLLELAVCQKSLSS 279
             + P+LP+YN FL ACA+++S  H S+CL+LMD R VGKN  TY  LL+LAV Q++LS+
Sbjct: 186 DRISPLLPIYNFFLGACARTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLST 245

Query: 280 VHEIWTDFVKNYSPSVLSLRKFIWSYTRLGDVKSAYTALQKMVALA------IGPAGGKL 339
           V++IW  +V +Y+  +LSLR+FIWS+TRLGD+KSAY  LQ MV LA      +    GKL
Sbjct: 246 VNDIWKHYVNHYNLDILSLRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRGKL 305

Query: 340 RSLELDIPIPSRTESHRNDFNFEENGSSTDELFCKKMVPSNGDVGRISGNDMKCGEVESG 399
            S  L IP+PS+ E+    F F                   G   RI    + C    S 
Sbjct: 306 HSTRLYIPVPSKDETGSEKFAF-------------------GVTDRI----VDCN--SSS 365

Query: 400 PLTLPNNYRSSFVKKVLSWSFNDAIHACALTRNCGLAEQLMQQFSVSCVLTTLGTMSLKM 459
            + LP  +      +VL WSFND IHAC  ++N  LAEQLM                L+M
Sbjct: 366 KVALPKGHNKILAIRVLRWSFNDVIHACGQSKNSELAEQLM----------------LQM 425

Query: 460 HELGLQPSCHTFDGFVRSVVSERGFSDGM------------------------------- 519
             LGL PS HT+DGF+R+V    G+  GM                               
Sbjct: 426 QNLGLLPSSHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLATVAAYCSKALQV 485

Query: 520 ---------------------------KIDQPERAMRMFVKMKQMDVLPDVKTYELLFSL 579
                                       +DQPERA+R+  +MK++ + PD++TYELLFSL
Sbjct: 486 DLAEHLLDQISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLRPDMRTYELLFSL 545

Query: 580 FGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQ 639
           FGNVNAPYEEGN LSQVD  KRI  IEMDM ++G QHS  S +N+L+ALG EGM  E+++
Sbjct: 546 FGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRALGAEGMVNEMIR 605

Query: 640 YLNVAENLFYYNNTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIML 699
           +L  AENL  ++N  LGT  Y+ VLHSL+++ E  M I +F  MK  G   + AT+ IM+
Sbjct: 606 HLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCGCPADVATYNIMI 665

Query: 700 DCCCVMGCLKSAFALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIE 727
           DCC ++   KSA AL+S+MIR GF P  +T+T+L+KI+L    F++ALNLLDQA+ E I 
Sbjct: 666 DCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKILLNDANFEEALNLLDQAALEEIH 691

BLAST of Clc09G10840 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 76.6 bits (187), Expect = 1.0e-13
Identity = 61/270 (22.59%), Postives = 115/270 (42.59%), Query Frame = 0

Query: 488 ERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEGNRLSQVDAA------------ 547
           + AM +FVKMK  +  P V+TY +L           E  N + +++              
Sbjct: 305 DEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVL 364

Query: 548 ----------KRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFY 607
                     ++ R +   M + G+  +  +   L+      GM ++ +  + + E+   
Sbjct: 365 IDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKL 424

Query: 608 YNNTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLK 667
             N    T  Y+ ++    KS  +H A+ + N M      P+  T+  ++D  C  G   
Sbjct: 425 SPN----TRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFD 484

Query: 668 SAFALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIELDVTIMNTIM 727
           SA+ LLS+M   G  P   TYTS+I  + + +R ++A +L D    +G+  +V +   ++
Sbjct: 485 SAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALI 544

Query: 728 KEACVKLRIDVIEFLVEKMNREKIQPDPST 736
              C   ++D    ++EKM  +   P+  T
Sbjct: 545 DGYCKAGKVDEAHLMLEKMLSKNCLPNSLT 569

BLAST of Clc09G10840 vs. TAIR 10
Match: AT1G13800.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 76.3 bits (186), Expect = 1.4e-13
Identity = 60/228 (26.32%), Postives = 107/228 (46.93%), Query Frame = 0

Query: 483 KIDQPERAMRMFVKMKQMDVLPDVKTYELLFSLFGNVNAPYEEG---------------N 542
           ++++P++A  +F  MK+ DV PDV TY +L +    ++   E                 N
Sbjct: 647 RLNEPKQAYALFEDMKRRDVKPDVVTYSVLLNSDPELDMKREMEAFDVIPDVVYYTIMIN 706

Query: 543 RLSQVDAAKRIRMIEMDMEKHGIQHSHASMMNLLKALGTEGMTKELLQYLNVAENLFYYN 602
           R   ++  K++  +  DM++  I     +   LLK      +++E+  + +V  ++FYY 
Sbjct: 707 RYCHLNDLKKVYALFKDMKRREIVPDVVTYTVLLKNKPERNLSREMKAF-DVKPDVFYYT 766

Query: 603 NTCLGTHVYDAVLHSLVKSKEIHMAIKLFNNMKHSGFFPEAATFEIMLDCCCVMGCLKSA 662
                      ++    K  ++  A ++F+ M  SG  P+AA +  ++ CCC MG LK A
Sbjct: 767 ----------VLIDWQCKIGDLGEAKRIFDQMIESGVDPDAAPYTALIACCCKMGYLKEA 826

Query: 663 FALLSIMIRSGFCPGVLTYTSLIKIVLRFERFDDALNLLDQASSEGIE 696
             +   MI SG  P V+ YT+LI    R      A+ L+ +   +GI+
Sbjct: 827 KMIFDRMIESGVKPDVVPYTALIAGCCRNGFVLKAVKLVKEMLEKGIK 863

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0061245.10.0e+0074.17pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK09374... [more]
XP_038897387.10.0e+0077.20pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Benincasa hisp... [more]
XP_023536089.10.0e+0076.00pentatricopeptide repeat-containing protein At1g76280 [Cucurbita pepo subsp. pep... [more]
XP_022976056.10.0e+0075.39pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxi... [more]
XP_022937086.10.0e+0075.39pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
Q9SGQ61.7e-17344.30Pentatricopeptide repeat-containing protein At1g76280 OS=Arabidopsis thaliana OX... [more]
Q9LSL91.5e-1222.59Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9SUD81.9e-1231.33Pentatricopeptide repeat-containing protein At4g28010 OS=Arabidopsis thaliana OX... [more]
Q9LMH51.9e-1226.32Putative pentatricopeptide repeat-containing protein At1g13800 OS=Arabidopsis th... [more]
Q9ZQF13.3e-1223.87Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5A7V6010.0e+0074.17Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1IFV20.0e+0075.39pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita ma... [more]
A0A6J1F9C60.0e+0075.39pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Cucurbita mo... [more]
A0A1S3CEN80.0e+0079.85pentatricopeptide repeat-containing protein At1g76280 isoform X3 OS=Cucumis melo... [more]
A0A6J1BT640.0e+0073.70pentatricopeptide repeat-containing protein At1g76280 isoform X1 OS=Momordica ch... [more]
Match NameE-valueIdentityDescription
AT1G76280.11.2e-17444.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G76280.35.8e-17442.43Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G76280.22.4e-14341.22Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G65560.11.0e-1322.59Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G13800.11.4e-1326.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 592..793
e-value: 2.4E-30
score: 108.0
coord: 122..328
e-value: 2.7E-14
score: 55.3
coord: 408..590
e-value: 2.1E-10
score: 42.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 665..694
e-value: 0.82
score: 10.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 593..639
e-value: 1.2E-10
score: 41.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 595..626
e-value: 1.8E-5
score: 22.6
coord: 630..662
e-value: 1.2E-4
score: 20.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 592..626
score: 9.96388
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 9.788499
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 662..696
score: 8.714292
NoneNo IPR availablePANTHERPTHR47859PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 484..840
coord: 71..483
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..32
score: 5.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc09G10840.2Clc09G10840.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding