ClCG01G008850.2 (mRNA) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G008850.2
TypemRNA
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr01: 10837622 .. 10847787 (+)
Sequence length3414
RNA-Seq ExpressionClCG01G008850.2
SyntenyClCG01G008850.2
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGCTATTGCAATTGTAAAGAAGAAACTCGCTTTCGTCTCTCCATCGGAGCGCCGCCGATAGCCGGAGAAGGTGTAGAAGGTCGAATAACGGCACGGAACTGCAGAGGTATAGTATCAACCAAATTTTCACGGACCCTTTGGTCGGTGCGAGAACCTAAAATGGCCGCTGGATCTGCCGATTGCCGGAGAAGATGTGAACAGTTCTATATTATCACGGCGCGGAATTGCAGAGCTTAAGTATCATCTGACTTTTCCGCGGAGAATTAGGTGGGTGCAACATGCTAAAATGCCTTATGAGCCGCAGATAGAACAGGCACAGCCGAGAATAATGATGTAATCGTGTCGCGATATTCAGGAAGAGGTTGCTCCTCCGTCATTCGGGAAAATACTTCAACTGGTGAAGTAAAAAGAGCAGTTGGGTTTTTTTTCCTTCTTTCTTTTACTTATGTGTTGAGATATTTACTCTGTGTTTCTACTTAGTGCCTGAGAGTTCGTTTGGAGTGTATTGGATTTGTTCATGAAATAGAGTAATTCTGAAATGGTGGGAGTAATAATGGCGAATGTAAATTTGTGCATCCCTAATTGTGAAAGAAATGGATTTCCGGCACTGCATTGTACCCAGAATTTCCATAATTTTTTCGGGTTTTCGTTCTTTCCTAGTTCAATTTCTGGAACTGACTTAAATATCGGCGACGTGAAGAATAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCGTCAAATGGAGAATCTGATATTCGATTGCCAAGTGGGAATCTCCTCGAAAATGATTTTCCATTTAAGCCATCGTTCGATGAATATGTGAGGGTCATGGAGACTGTTAGAACTAGAAGGTACAAGAAGCAGTCGGACGATCTTAATAAACTAACGATGAAGGAAAATGCAAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAAATAGATAATGGAAAAAACAAAGGGACTGATGTTCAACGTGATGTGGATGTAAAGAACATGTTTAAACGTGTTGATCGTAAAGATTTGTTCAATAATACAGAGAGAATTACTCGTAGAAGAGATTTGTCAGGAAATAAATTTGATAGCAAAAGGAAAGGTGTTACAAGATCAAATGATGAGGTTAAAGGCAAGGTGACCCCATTTTACTCACAGGCTGATGATAAACAACATGAAAAAAAAAGGAATAGAAACTGGTCGAGTGACATTGAGCCAAAAGTACCAAGGTCGTACAATGAGAAACTAATTAATTTTAAGGCTAATACATTGGATGTCAAAAGAGAAAACCGCCGTGTATGTGATCAAAGTCCCATGAAAATATCAGAAAAGATTTGGGACGATGATGACACTAAACCAGCTAAGGATATTCTCAAGGCTGGAAAATTTGGTGTTCAGCTTGAAAGAAACTATATCCCAGGTGAAAAGGTTGGTAGAAAGAAAAATGAGCAGTCCTACAGAGGGCCGTCCAAAAGTGGCAAGCAGTTTCTTGAATTTCCTCAAGAGAGTAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTAGACATAATGGATAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGGTTTGCTGTCCCTTGCTCACTTCTCTGTCTTGCTGAAAGTTAAATTTGACTTGGTACAAGTTTGAGATCAGAACTAATACCTTATTCGATATGTCTTCTTTGTTTGTTTTGTTTTTTTTTTTTCTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGTTCTGTGAAATTAATACTCTGAATTAGTTAGGTTTCAATCTTTTTTACTAGCATTATTCTAGGTCGCGAGGTTGTTTTTTTGGGTACTAGTTTTTCTGGTCGTGTTGAATATTTGCCTAGCTTGGTCAGGATATTTTATTGTTAAGTCATGGTGCTTTCTTGAGGGTTGAGTTTAAGTATTTCTGTGGCTTGATTTTAGTGTTATGTTGATTGAACTTCAAGTAATTTATAATAAAATTTTAATTTAGTTTGAAGAGTTTTAAATTCACCTGCACCACTTATTCAATTTTAGACTTTAGAGTTCAATCTAAAAACTTAGAAGTCTACTAAGCTATATATTTGAGGGGTAAATATGAAATTTAGAAGTCTACTTGCATTGAGATAATATATGTTAATGTGTTCAAATGATGGTCATGAACTGCTATTAGCCTATTGTTTTATATTTTCTTCTCAATCACTCACACATATATTGCGGGCTCTTGAGTTATTCATTGTATGGAGACTTCTTTCATAATCGTCTCCAAAATCTTTTATTATTTGAAATTTAATTACTAATTTGCACTCAACTGTACTATGAGGGGATTTCAAGAATACTAATCAATTATCAACTATTTGAACTTTGAGGATTATCAATTAGTCTCTCCATATTGATTTGTTTGAAGGACTTCCTTGTTAACAGATTGAATGGTTCAAACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCACTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGGTTTTCCCCTATTTCTCACCTTTGCTACTTGATTAGTGTAGTGGAAGTTCTAGAAAAGTTCTTGGCCATGATGTTTTAAAAGACTATGAGTTGCCTGTGATGCCTTTCCTCTGTCTAGAAGTGCAATTTAGATTTTTAAAAATTCTTTAACAATGCATTGCTAAAAATAATGATTGAAGAGATGCAAAACATTTTAAAAAAATAGAAACAAACAAAAACAAAAAAAAGATCAATTAATTATTTTCTTAAATAATTTATAATATCATTCTTAGCATCATTGTCATTATCTCTAATAGGATCAACTACTTTTCGATTTTTTCTCCTGGTATTTCTTAATCCTTAGTGATAAAGTGGTGAAGGGCTAAGTTAGCACGCACTTATCTTTTAATCCATTGGATTAAATTGTTTACCATGCACATTGGAAATTCCATTTTCATTTCCTTGATAGAAAAATTGCATACCTTGTTCAAAAAGAAAATTCTGTTTTCATATAACCATAAACGTTGCTCATTGGTTGTCATATGAAAATTTATTCAGCAGCATCTAGGAGTTAAAATGTGAATGATAGTGTGAGGGAAACAAGAAAGTGTTCTGATTTCGTTTAGATGTAGAACACCTGAAGGAAGTTCAATAACGACAAGAGATGATAAAAGGGAAAGATGAAGAGGCGTGAGGAAGGGGAATATGCTCATCAGATTTTATATTTATTCACTTTTATAAATATAAAGGAAGTGTTAAAATATTGATAGAATTAAATTTAACCCATCATAAAAGCTTTTAGGTAGATTGGATTTTTTCCCCTCATAAAGCTTTTAGTTACTTTCATTAAGAGAGGGGAAAAAAAAAAAAAAGAGAGAGAATGTAAAGGCATACAAAAAAATCAAGTCCCCGAAAAGGAAACCCCACTAAAGAAAAGGGGTTCAATCAAGCAAAATAGTACCTATAGAATAATTAAAAAAAAAAAAAAACATTGAAGTTGAAGCCAAAAGAGAAATGTGAAACCTAATAAGGGCACAAATATCACTAGGCTCCTTTTTCAACCCTCCAAACACTCTAATAGAGCAGAATTAGGTAGATTGGTTATTTAACATGGTATTAGAGCAAGAAGGGCCCTGTGTTTAAACCCCTAAAATATCATTTCCTCTTCAATTAATATTGATTTCTACTTGATGGGCCTTTCGCAAATTTTCAAACTAGGGGAGTGTTAAAGTATTAATATAATTAAATTTATTGCAACCTATCAACTTAAAGCTTAAGTTGGTTAGTGAACAAAAAGTCTTCATCCTAACATTTTCTAAAGAATCTTGCACCCACTAAAAAATTTAATTAGAGAGAATTCAAATCTTAGTCTAATAATTTATATGCTAATCATTAATCGCTTGTTCTCCATCTTCAACCTATGTTGGGTGTTGGGCAACCGCTCTAGTGATAATGTGTTGCTACTTTTGATTGGTCCCTCTCTCTAGAAGGGTCCTAAGACACTATGGTATAACGCAGTAAAAGCCTTGCTATTTGAAATTTGGTTTGAAAGAAACCAACGAGTTTTCCATTATAAAGCGACTCCTTGGCTTGAACGTTTCAAGATTGCAAGGCTCAATGCTTCCACTTGGTGCACACAATCAAGATCATTTGTAGATTTTTCCATCCAGGATATAGTTTTAAATTGGAGGACTTCCATTTTTTCATCTTACTAGTTTTTTGTTCTTGAATCACCGTCTTAGTGTCTTTTGTTTTAGGTGTACTTTTTGTCCTCGTTTTTGGGACTTCTTTGTAGGCGTTGGTGTATGGATACTGGGCAGCTAGTAGTTCCATTTATTTTCTCATTTGACAATGTTGTAAGGATATGATGATGGTGCTAAGGTGTTGTCAACCTAGTTGAGATGCCCGGGTGCACCAACTGATCCATAGTTTATTTGTTTTTTATGTATTTGTCTCTTGTACTTTGATTGAGCATTAGTCTCATTTCATTAATTCAACAAAGAGACTTGTTTCTGTTTAAAAAAAAAATCATTAATCAATTGAATTGGAAACCAACAATAATGCTGAAAAATAACTCAAATATAATAGAAAACATCAGAAATTTTAAATGGAAAAGATATTTACTTTCTTGCATTGTCCTGTCACCAAGTATGTTCTCTGACTACCCAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCAAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGTAGGCAGATAACTAACTACCTTCATTAGGATTTGCAGTCTTGGTCTGTGCATGTTTATATAATTAGATGCCGAAATTGATAGAAATAGACTAACCATGGTTGATTCATTTCCATTTCCCTTCAGCAACACTTTTCCTCATATCCTGACTTAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTTTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTGAGTCATAATTGATAGATTGTTTTATATTTTACAGGTAATATTTGGTTAATGTAAAATACTGAGCAATTTGGTGGAAGAATGTTAAAGAGCAATATAATCTTCTTGTGGATATTTTCTTGCATAATCAACTGTTTTGCTCATAACATATTTATTTCTAGATCTGGAAATTCAATTATGATAATGATCTAGATTCTAGTGAATATATTGTTATCATGGTTCCATTGTCGGGTAAAGATTTAGATTTTGTATACATGGTTGGGATTGTATTTATTTTTCTTTCCTTTTTCCTTTTGGATAATAGCACCACTTTTATTATGACAAATGAAAGAAATATAAGAAGAGGCCATTTGAAAAAACTAGGCTCTGCAACAAAAATGGGCAACCAAAAATGAGAGAAATTACACAAAGAACACTAATTAATAAAAAAAAAAAAAAAAAAAGGAGAAAAATCCCAAAAAAGTTGCAAAAACCTAAAAGGGTTTAGTGAAGGTGAGAGTCAGTGGCAATCTAAAATTTGGGTTTCAAGAGGGTTTTTATTTATTTATTTAGATGGTATGAAATTACAGAGTTTGAGTATCAAAGCATAGTTGATTTTCTTCACGTTATCATTTTCCATCCAACATACTCTTGCCATTTCCCATTCTATTTTGTTCATAATATTTTAATTTACTCAGTCAATAAGGTAAGTCTCCATCATTGCTTTCTCCATTTGGTGCAACTATATGTTTTTCTTTTAAAAGTAATATTTTTTCAATCTCTTCAAATATAACTTTAGCTCATTGTCATATATATTCATCACTGCTATGAAATGCTAATAGAAACAAAATGATTTCTTAATTCTTTATGGAAATATATCTTGCATGGGTTTCTTTCGAGTTGTTCATTTGTAGTAGTCTTTTTATATTTTTCTTTCAGGTTTTAAATGCTTGTGTTAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAGGTCTACAGCCTTCGACCTCAACATATGGATTGGTCATGGAGGTAGTTAGTTCTTTAATTTCTTTCTATTGTTCATGTGGTTTGCAAGTCTATTTTGAAATTCAGAATGATTTTTCTAATGCTTGAGATGCTGTGAATGAAAAATGCTTGATTTTTTGCACCCAATAAGGAATGTTCACTCATTAAATTGCACAGAATCCACTCTGATGTCAGTGTTAAATATTATTCTGACTTTTTTATTAGAAGAAAGTAGATGGTTTTGGGTATTGCATGGACATGGAGTGGATGTATTGGGTTCTGAGTCTTGGAAGAGGATTAGTCCTACTTGTAAAAGGTTACTATGAAGTAAGGCACTGAATGGTAAGCCTAGGAATAAAGAAGAAAATCGAGATAATTCTTTTTGAGATGTTGTTGAATAGCAATAAAGAACGGCAATCTTTCGTGCCGTTCGACTTATCCGTTCTCAATCAGCCTCATATAAGGAAGTGCATGCTTGGACTTGGTCAAGGATGCTTTTAATGGTTAGATTATGCCTTAACACTTTTCAGAAGCAGCTATTGAAGGATTACATTTGTCTATCCTTCATAAGGGGTATAAATAAGCTGTTTGGTCCAAGGTTGAGTTCATGTGTACTTAAAATTATTGTTCACAAGAAAATTTTAGTTCATGCCCTGTTGAGAATGAATTAAGGTAACGCAATTTCTAATACTAGTCACCTAATTATTTTTGAAAGTTTCTTTTTTCCTTATAAAACCAAGAAGTCAGTTCTTTAAGTCAACTCTTAAACTGCCCCTAGATTTTCAATGTAGGTGAAGACAAATAATTTGATCAGTTTGTACCTCTACTATTCTTGAAGGATTTGGGAAGCATTTGATGAAACTAGGCCATTAGCTTCTGATTCTACATCTCATGACTTGCTTTTGATGGCTCAAGTATTTTATGTTGTCTTCCAAAATTTTAACATTAGAAATTGATATCTCCTTAGTCATATTTTAGTAACATGTTTCCTAGTGTGATTTTGCTCAGCACTTTGGTCTGTATTCTTCTTCTTGTTATAGTGTCTGAACACATTTTAGTAAGCTCAACTTAGTTTATTTGATTGTGCTGAATTTCCATTTTCATATGCAAGTAAGCCTATAGTTGAAATATTCTATTTACTAAATGAAACGTTTTTCCAATGAATGATAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGGTAGTCGCAGTGTGTTTATTTGTTTCTAGTTATATATTTGCTTAATGCTTTAGCTTGTCAAACTTCCAGTTCTTGTCAATACACTTTGGAAAGAGGGAAAAACTGATGAGGCTGTGCTGGCCATTGAGAACATGGAAAGACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGGTATTTCATAGTAAATTTTTTTTGTTTCTTTCAGCCTTTTGTTTTTGTTCTTTTTCACTTCCACTTAACTTTTGATTTTTTTTTTTTTTAAAAAGGTTTCTTTTGGTTTTTTCTTGTGTGGATATTTTATATATTATTATTGATTATTTCCATGAGTTGGTTCAAAATGTTATCCCGAGTTTGAGCTCTATCTCCTAAGAAGGGAAAAGTAGAATTGGTTTATTAGTTAAGTAGAAATTAGTCGAATGACTCAATGATGACTGATAAGTCAAACAATATAAGGATTTGGCTTTTCTTTTTATCACTTCCTCTTTTTTGTTTAAGAAATATTGAATTTGAGGTTAACTAATCACCATATCTTAAAAACTCTCAAAACCTTTGAGCCTTCCTTAATCTTGCCTACAATTCCATGATTAGCTTTATTGTCCAATTGTAATGTTTTGGCCTCAGATATCTCTTTCTGAGTGAATTGCTGGCCTCTCCTTTGGGAGGTCAATTTTACTTCACAATTCCAAACTCTCTGGTTGCACTTCCTGATTGCAGTATGATACACGAATGATAATTAGTAAATAGACCTCACGACAGCCTTGCTATTGTTTACAGATGGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATATTCAACCACATGAAGACCTTTTGCTCCCCCAATCTTGTTACTTATAATATAGTGTTGAAAGGTTACTTGGAACATGGGATGTTTGAAGAGGCTAGAGAGCTGTTTCAAAATTTGTCAGAGCATGGACGAAATATTAGCACTGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCCGAAAAAAGATGGGATGATTTTGGCTATTTCTTTAACCAGATGCTTCTTTATGGATATCACTTCAACCCGAAACGTCATCTGCGGATGATATTGGAGGCTGCTAGGGCTGGAAAGGTGGGCTTATTAAATTCAATTTCTTACGCTTTCCTTGTTTCTCCTTCCTTTATATGTTTTTTTTTTAAAGTTTATCCCACCACTAGCCCATTTTAATTTGGACAGTTTATTACTAAACCTGTATAGTTTAAGCTGGCTCTAGGAAAGTTGCTTATAATTGACGATAATCTTAAATATTCTTTTTATAATAAGAGATATAATATCATGAAAGTATTGGAGGGAGAATTCGACCATTTTCCCAGTGATCATTTGTTGAAGTATTGAGAAGACACGATGCTAAATCAATTTATTATGTAATCTGTCATGAAATTCAAGATTGAAGTGTGTGTTCGCTATTTCTACTGAATACATATGTATATCATGGGGTTCACATTCATTCCTGAAAGTCGAGTTCTGCTGCTGTTTGGAACTAGATTTCTTTATGATCTTTGGATGATTTGGTGTCAATTTCCATCTCTGGCAAAGGAGTAAACTAATAAAAGTTAATTGCTTAAACTTGTTATGGTTTTGCTTTCAATTAATCACGGACTTTCAATAAACAGGATGAGCTACTGGAAACAACATGGAAGCACCTATCTCAGGCTGACCGGATTCCGCCTCCGGCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGACTACTCTGATGCTCTCTCTTGCATTTCAAATCACAATAGTAGCGATGCACATCATTTCTCTGAGCCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATACTGTCATTCAGTTAATTTATAAGGTTAGTATGCTTCTTACTAGAAATGACTCACCAAATCCAGTGTTTGAGAATCTGCTACTGAGTTGTAAAGAATTTTGTAGAACTAGAAATAGTGTAGCTGACCATAGACTTGAAGAAACTGTTTGTACAAATGAAACCCAATCTGCTGCTGTCATGCATATTTAGCGTAATTTGAGAGGA

mRNA sequence

TTTGCTATTGCAATTGTAAAGAAGAAACTCGCTTTCGTCTCTCCATCGGAGCGCCGCCGATAGCCGGAGAAGGTGTAGAAGGTCGAATAACGGCACGGAACTGCAGAGGTATAGTATCAACCAAATTTTCACGGACCCTTTGGTCGGTGCGAGAACCTAAAATGGCCGCTGGATCTGCCGATTGCCGGAGAAGATGTGAACAGTTCTATATTATCACGGCGCGGAATTGCAGAGCTTAAGTATCATCTGACTTTTCCGCGGAGAATTAGGTGGGTGCAACATGCTAAAATGCCTTATGAGCCGCAGATAGAACAGGCACAGCCGAGAATAATGATGTAATCGTGTCGCGATATTCAGGAAGAGGTTGCTCCTCCGTCATTCGGGAAAATACTTCAACTGGTGAAGTAAAAAGAGCAGTTGGGTTTTTTTTCCTTCTTTCTTTTACTTATGTGTTGAGATATTTACTCTGTGTTTCTACTTAGTGCCTGAGAGTTCGTTTGGAGTGTATTGGATTTGTTCATGAAATAGAGTAATTCTGAAATGGTGGGAGTAATAATGGCGAATGTAAATTTGTGCATCCCTAATTGTGAAAGAAATGGATTTCCGGCACTGCATTGTACCCAGAATTTCCATAATTTTTTCGGGTTTTCGTTCTTTCCTAGTTCAATTTCTGGAACTGACTTAAATATCGGCGACGTGAAGAATAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCGTCAAATGGAGAATCTGATATTCGATTGCCAAGTGGGAATCTCCTCGAAAATGATTTTCCATTTAAGCCATCGTTCGATGAATATGTGAGGGTCATGGAGACTGTTAGAACTAGAAGGTACAAGAAGCAGTCGGACGATCTTAATAAACTAACGATGAAGGAAAATGCAAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAAATAGATAATGGAAAAAACAAAGGGACTGATGTTCAACGTGATGTGGATGTAAAGAACATGTTTAAACGTGTTGATCGTAAAGATTTGTTCAATAATACAGAGAGAATTACTCGTAGAAGAGATTTGTCAGGAAATAAATTTGATAGCAAAAGGAAAGGTGTTACAAGATCAAATGATGAGGTTAAAGGCAAGGTGACCCCATTTTACTCACAGGCTGATGATAAACAACATGAAAAAAAAAGGAATAGAAACTGGTCGAGTGACATTGAGCCAAAAGTACCAAGGTCGTACAATGAGAAACTAATTAATTTTAAGGCTAATACATTGGATGTCAAAAGAGAAAACCGCCGTGTATGTGATCAAAGTCCCATGAAAATATCAGAAAAGATTTGGGACGATGATGACACTAAACCAGCTAAGGATATTCTCAAGGCTGGAAAATTTGGTGTTCAGCTTGAAAGAAACTATATCCCAGGTGAAAAGGTTGGTAGAAAGAAAAATGAGCAGTCCTACAGAGGGCCGTCCAAAAGTGGCAAGCAGTTTCTTGAATTTCCTCAAGAGAGTAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTAGACATAATGGATAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTTCAAACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCACTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCAAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGCAACACTTTTCCTCATATCCTGACTTAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTTTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAGGTCTACAGCCTTCGACCTCAACATATGGATTGGTCATGGAGAAGAAAGTAGATGGTTTTGGGTATTGCATGGACATGGAGTGGATGTATTGGGTTCTGAGTCTTGGAAGAGGATTAGTCCTACTTGTAAAAGGTTACTATGAAGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGTTCTTGTCAATACACTTTGGAAAGAGGGAAAAACTGATGAGGCTGTGCTGGCCATTGAGAACATGGAAAGACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATATTCAACCACATGAAGACCTTTTGCTCCCCCAATCTTGTTACTTATAATATAGTGTTGAAAGGTTACTTGGAACATGGGATGTTTGAAGAGGCTAGAGAGCTGTTTCAAAATTTGTCAGAGCATGGACGAAATATTAGCACTGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCCGAAAAAAGATGGGATGATTTTGGCTATTTCTTTAACCAGATGCTTCTTTATGGATATCACTTCAACCCGAAACGTCATCTGCGGATGATATTGGAGGCTGCTAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTATCTCAGGCTGACCGGATTCCGCCTCCGGCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGACTACTCTGATGCTCTCTCTTGCATTTCAAATCACAATAGTAGCGATGCACATCATTTCTCTGAGCCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATACTGTCATTCAGTTAATTTATAAGGTTAGTATGCTTCTTACTAGAAATGACTCACCAAATCCAGTGTTTGAGAATCTGCTACTGAGTTGTAAAGAATTTTGTAGAACTAGAAATAGTGTAGCTGACCATAGACTTGAAGAAACTGTTTGTACAAATGAAACCCAATCTGCTGCTGTCATGCATATTTAGCGTAATTTGAGAGGA

Coding sequence (CDS)

ATGGTGGGAGTAATAATGGCGAATGTAAATTTGTGCATCCCTAATTGTGAAAGAAATGGATTTCCGGCACTGCATTGTACCCAGAATTTCCATAATTTTTTCGGGTTTTCGTTCTTTCCTAGTTCAATTTCTGGAACTGACTTAAATATCGGCGACGTGAAGAATAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCGTCAAATGGAGAATCTGATATTCGATTGCCAAGTGGGAATCTCCTCGAAAATGATTTTCCATTTAAGCCATCGTTCGATGAATATGTGAGGGTCATGGAGACTGTTAGAACTAGAAGGTACAAGAAGCAGTCGGACGATCTTAATAAACTAACGATGAAGGAAAATGCAAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAAATAGATAATGGAAAAAACAAAGGGACTGATGTTCAACGTGATGTGGATGTAAAGAACATGTTTAAACGTGTTGATCGTAAAGATTTGTTCAATAATACAGAGAGAATTACTCGTAGAAGAGATTTGTCAGGAAATAAATTTGATAGCAAAAGGAAAGGTGTTACAAGATCAAATGATGAGGTTAAAGGCAAGGTGACCCCATTTTACTCACAGGCTGATGATAAACAACATGAAAAAAAAAGGAATAGAAACTGGTCGAGTGACATTGAGCCAAAAGTACCAAGGTCGTACAATGAGAAACTAATTAATTTTAAGGCTAATACATTGGATGTCAAAAGAGAAAACCGCCGTGTATGTGATCAAAGTCCCATGAAAATATCAGAAAAGATTTGGGACGATGATGACACTAAACCAGCTAAGGATATTCTCAAGGCTGGAAAATTTGGTGTTCAGCTTGAAAGAAACTATATCCCAGGTGAAAAGGTTGGTAGAAAGAAAAATGAGCAGTCCTACAGAGGGCCGTCCAAAAGTGGCAAGCAGTTTCTTGAATTTCCTCAAGAGAGTAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTAGACATAATGGATAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTTCAAACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCACTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCAAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGCAACACTTTTCCTCATATCCTGACTTAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTTTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAGGTCTACAGCCTTCGACCTCAACATATGGATTGGTCATGGAGAAGAAAGTAGATGGTTTTGGGTATTGCATGGACATGGAGTGGATGTATTGGGTTCTGAGTCTTGGAAGAGGATTAGTCCTACTTGTAAAAGGTTACTATGAAGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGTTCTTGTCAATACACTTTGGAAAGAGGGAAAAACTGATGAGGCTGTGCTGGCCATTGAGAACATGGAAAGACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATATTCAACCACATGAAGACCTTTTGCTCCCCCAATCTTGTTACTTATAATATAGTGTTGAAAGGTTACTTGGAACATGGGATGTTTGAAGAGGCTAGAGAGCTGTTTCAAAATTTGTCAGAGCATGGACGAAATATTAGCACTGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCCGAAAAAAGATGGGATGATTTTGGCTATTTCTTTAACCAGATGCTTCTTTATGGATATCACTTCAACCCGAAACGTCATCTGCGGATGATATTGGAGGCTGCTAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTATCTCAGGCTGACCGGATTCCGCCTCCGGCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGACTACTCTGATGCTCTCTCTTGCATTTCAAATCACAATAGTAGCGATGCACATCATTTCTCTGAGCCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATACTGTCATTCAGTTAATTTATAAGGTTAGTATGCTTCTTACTAGAAATGACTCACCAAATCCAGTGTTTGAGAATCTGCTACTGAGTTGTAAAGAATTTTGTAGAACTAGAAATAGTGTAGCTGACCATAGACTTGAAGAAACTGTTTGTACAAATGAAACCCAATCTGCTGCTGTCATGCATATTTAG

Protein sequence

MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVSMLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI
Homology
BLAST of ClCG01G008850.2 vs. NCBI nr
Match: XP_038894404.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1646.7 bits (4263), Expect = 0.0e+00
Identity = 830/952 (87.18%), Postives = 868/952 (91.18%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMANVNLCIP+CERNGFPALHCTQN HNFFGFSFFPSS+SG DLN GD K+RVLRH
Sbjct: 1   MVGVIMANVNLCIPSCERNGFPALHCTQNSHNFFGFSFFPSSVSGPDLNFGDAKHRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           R HKCG+IKASSNGESDIRLPS NLLENDF FKPSFDEYVRVMETVRTRRYK+QSDD NK
Sbjct: 61  RVHKCGSIKASSNGESDIRLPSENLLENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
           LTMKENAS KSAE TSISKIDNGKNK TDVQ +VDVKNMFKRVDRKDLFNNTERITR RD
Sbjct: 121 LTMKENASVKSAEITSISKIDNGKNKVTDVQGNVDVKNMFKRVDRKDLFNNTERITRERD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNK DSKRKG++RSNDEVKGKVTPF SQ +DKQHE+KRN N S+  EPKVPR YNEK 
Sbjct: 181 LSGNKIDSKRKGISRSNDEVKGKVTPFDSQVNDKQHEEKRNINRSNYTEPKVPRLYNEKR 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEK 300
           INFKANTLD+KRE+ R  + S M+IS KIW +DDTKPAKDIL A K+ VQLERNYI G+K
Sbjct: 241 INFKANTLDIKRESHRASNGSSMRISGKIWANDDTKPAKDILNAVKYSVQLERNYISGDK 300

Query: 301 VGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
           VGRKK EQSYR  SKSGK+FLEF ++SSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRESSKSGKRFLEFTEDSSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
           ML KRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR
Sbjct: 361 MLCKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHF+SYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNLFHAMQQHFTSYPDLVAYHSIAVTLGQAGY 480

Query: 481 MRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
           M+ELFDVIDSMR PPKKKFKTG LEKWDPRL+PDIVIYNAVLNACVKRKNLEGAFWVLQE
Sbjct: 481 MKELFDVIDSMRSPPKKKFKTGVLEKWDPRLEPDIVIYNAVLNACVKRKNLEGAFWVLQE 540

Query: 541 LKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKY 600
           LKKQGLQPSTSTYGLVM                                  EVMLECGKY
Sbjct: 541 LKKQGLQPSTSTYGLVM----------------------------------EVMLECGKY 600

Query: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660
           NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA
Sbjct: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660

Query: 661 RCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSP 720
           RCLCSAGRCKEALMQMEKICKVA KPLVVTYTGLIQACLDSKD++SAVYIFNHMKTFCSP
Sbjct: 661 RCLCSAGRCKEALMQMEKICKVATKPLVVTYTGLIQACLDSKDIRSAVYIFNHMKTFCSP 720

Query: 721 NLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780
           NLVTYN++LKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF
Sbjct: 721 NLVTYNMLLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780

Query: 781 AEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPA 840
           AEKRWDDFGYF++QMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHL+QADR PPP 
Sbjct: 781 AEKRWDDFGYFYDQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLAQADRTPPPP 840

Query: 841 LLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVS 900
           LLKERFCMKLARGDYS+ALSCISNH+SSD HHFSE  WLNLLKEKRFPKDTVIQLI KVS
Sbjct: 841 LLKERFCMKLARGDYSEALSCISNHDSSDVHHFSESGWLNLLKEKRFPKDTVIQLINKVS 900

Query: 901 MLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI 953
           MLLTRND PNPVF+NLLLSCKEFCRTR SVADHRLEETVCTNETQSAAV+ I
Sbjct: 901 MLLTRNDLPNPVFKNLLLSCKEFCRTRISVADHRLEETVCTNETQSAAVVRI 918

BLAST of ClCG01G008850.2 vs. NCBI nr
Match: XP_031741862.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus] >XP_031741863.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus] >KGN65965.1 hypothetical protein Csa_023210 [Cucumis sativus])

HSP 1 Score: 1567.4 bits (4057), Expect = 0.0e+00
Identity = 792/941 (84.17%), Postives = 841/941 (89.37%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMAN+NLCIPNCER GFP LHCT N HN F  SFFPSS+SGTD ++ D KNRVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           R HKCG+IKA SNGESDI LPSGNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q DD NK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
           LTMKEN SAKSAESTSISKIDNGKNK TDVQ +VDVKNMFKRVD+KDLFNNTERI   +D
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNKFD +RK VTRSND+VKGK+TPF S  +DKQHE+KRN NWSS IEP+V RS ++K 
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKI--WDDDDTKPAKDILKAGKFGVQLERNYIPG 300
           I+FKANTL+VK+E+ RV D + MK SEKI  W DDD KPAK +LKAGK+G+QLER+Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300

Query: 301 EKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEER 360
           +KVGRKK EQSYRG S SGK+FLEF +++SLEVEHAAFNNFDA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ 420
           IQMLSKRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
           GYMRELFDVIDSMR PPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540

Query: 541 QELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECG 600
           QELKKQ LQPSTSTYGLVM                                  EVMLECG
Sbjct: 541 QELKKQSLQPSTSTYGLVM----------------------------------EVMLECG 600

Query: 601 KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYD 660
           KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENME RGIVGSAALYYD
Sbjct: 601 KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYD 660

Query: 661 FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFC 720
           FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMK FC
Sbjct: 661 FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFC 720

Query: 721 SPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDA 780
           SPNLVTYNI+LKGYLEHGMFEEARELFQNLSE  RNISTVSDYRDRVLPDIYMFNTMLDA
Sbjct: 721 SPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDA 780

Query: 781 SFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPP 840
           SFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+QADR PP
Sbjct: 781 SFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPP 840

Query: 841 PALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYK 900
           P LLKERFCMKLARGDYS+ALS I +HNS DAHHFSE AWLNLLKEKRFP+DTVI+LI+K
Sbjct: 841 PPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHFSESAWLNLLKEKRFPRDTVIELIHK 900

Query: 901 VSMLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETV 940
           V M+LTRN+SPNPVF+NLLLSCKEFCRTR S+ADHRLEETV
Sbjct: 901 VGMVLTRNESPNPVFKNLLLSCKEFCRTRISLADHRLEETV 906

BLAST of ClCG01G008850.2 vs. NCBI nr
Match: XP_008459122.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis melo])

HSP 1 Score: 1550.4 bits (4013), Expect = 0.0e+00
Identity = 788/947 (83.21%), Postives = 834/947 (88.07%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSIS--GTDLNIGDVKNRVL 60
           MVGVIMANVNL IPNCER GFP LHCT N H  F  SFFPSS+S  GTDLN  D KNRVL
Sbjct: 1   MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60

Query: 61  RHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDL 120
           RHR HKCG+IKA SNGESDI LP+GNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q D  
Sbjct: 61  RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120

Query: 121 NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRR 180
           NKLTMKEN SAKSAESTSISKIDNGKNK TDVQ +V+VKNMFKRVD+KDLFNNTERI R 
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180

Query: 181 RDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNE 240
           + LSGNKFD + KGVTRSND+VKGK+TPF S  +DKQHE+K+N NWSS IEPKV RS  E
Sbjct: 181 KHLSGNKFD-RSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCE 240

Query: 241 KLINFKANTLDVKRENRRVCDQSPMKISEKI--WDDDDTKPAKDILKAGKFGVQLERNYI 300
           K I+FKAN L+ K+E  RV   + MK SEKI  W +DD KPAKD+LKAGK+G+QLER+Y 
Sbjct: 241 KPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYS 300

Query: 301 PGEKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEME 360
           PG+KVGRKK EQSYRG S SGK+FLEF +E+SLEVEHAAFNNFDALDIMDKPRVSKMEME
Sbjct: 301 PGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEME 360

Query: 361 ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
           ERIQMLSKRLNG++IDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW
Sbjct: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420

Query: 421 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLG 480
           LQMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLG
Sbjct: 421 LQMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480

Query: 481 QAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
           QAGYMRELFDVIDSMR PPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW
Sbjct: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540

Query: 541 VLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLE 600
           VLQELKKQGLQPSTSTYGLVM                                  EVMLE
Sbjct: 541 VLQELKKQGLQPSTSTYGLVM----------------------------------EVMLE 600

Query: 601 CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALY 660
           CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENME RG+VGSAALY
Sbjct: 601 CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEMRGVVGSAALY 660

Query: 661 YDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKT 720
           YDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MK 
Sbjct: 661 YDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYVFNQMKA 720

Query: 721 FCSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTML 780
           FCSPNLVTYNI+LKGYLEHGMFEEAREL QNLSE  +NISTVSDYRDRVLPDIYMFNTML
Sbjct: 721 FCSPNLVTYNILLKGYLEHGMFEEARELLQNLSEQRQNISTVSDYRDRVLPDIYMFNTML 780

Query: 781 DASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRI 840
           DASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+QADR 
Sbjct: 781 DASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARVGKDELLETTWKHLAQADRT 840

Query: 841 PPPALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLI 900
           PPP LLKERFCMK+ARGDY++AL CISNHNS DAHHFSE AWLNLLKEKRFPKDTVI+LI
Sbjct: 841 PPPPLLKERFCMKVARGDYTEALRCISNHNSGDAHHFSESAWLNLLKEKRFPKDTVIELI 900

Query: 901 YKVSMLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNE 944
           +KV M+   N+SPNPVF+NLLLSCKEFCRTR SVADHRLEETV TNE
Sbjct: 901 HKVGMVFATNESPNPVFKNLLLSCKEFCRTRISVADHRLEETVHTNE 912

BLAST of ClCG01G008850.2 vs. NCBI nr
Match: XP_038894405.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X2 [Benincasa hispida])

HSP 1 Score: 1528.8 bits (3957), Expect = 0.0e+00
Identity = 784/952 (82.35%), Postives = 823/952 (86.45%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMANVNLCIP+CERNGFPALHCTQN HNFFGFSFFPSS+SG DLN GD K+RVLRH
Sbjct: 1   MVGVIMANVNLCIPSCERNGFPALHCTQNSHNFFGFSFFPSSVSGPDLNFGDAKHRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           R HKCG+IKASSNGESDIRLPS NLLENDF FKPSFDEYVRVMETVRTRRYK+QSDD NK
Sbjct: 61  RVHKCGSIKASSNGESDIRLPSENLLENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
           LTMKENAS KSAE TSISKIDNGKNK TDVQ +VDVKNMFKRVDRKDLFNNTERITR RD
Sbjct: 121 LTMKENASVKSAEITSISKIDNGKNKVTDVQGNVDVKNMFKRVDRKDLFNNTERITRERD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNK DSKRKG++RSNDEVKGKVTPF SQ +DKQHE+KRN N S+  EPKVPR YNEK 
Sbjct: 181 LSGNKIDSKRKGISRSNDEVKGKVTPFDSQVNDKQHEEKRNINRSNYTEPKVPRLYNEKR 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEK 300
           INFKANTLD+KRE+ R  + S M+IS KIW +DDTKPAKDIL A K+ VQLERNYI G+K
Sbjct: 241 INFKANTLDIKRESHRASNGSSMRISGKIWANDDTKPAKDILNAVKYSVQLERNYISGDK 300

Query: 301 VGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
           VGRKK EQSYR  SKSGK+FLEF ++SSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRESSKSGKRFLEFTEDSSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
           ML KRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR
Sbjct: 361 MLCKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKL      + +  G  +                                  AGY
Sbjct: 421 ERFKSHKL------SEETCGGTQ-----------------------------FIPCNAGY 480

Query: 481 MRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
           M+ELFDVIDSMR PPKKKFKTG LEKWDPRL+PDIVIYNAVLNACVKRKNLEGAFWVLQE
Sbjct: 481 MKELFDVIDSMRSPPKKKFKTGVLEKWDPRLEPDIVIYNAVLNACVKRKNLEGAFWVLQE 540

Query: 541 LKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKY 600
           LKKQGLQPSTSTYGLVM                                  EVMLECGKY
Sbjct: 541 LKKQGLQPSTSTYGLVM----------------------------------EVMLECGKY 600

Query: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660
           NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA
Sbjct: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660

Query: 661 RCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSP 720
           RCLCSAGRCKEALMQMEKICKVA KPLVVTYTGLIQACLDSKD++SAVYIFNHMKTFCSP
Sbjct: 661 RCLCSAGRCKEALMQMEKICKVATKPLVVTYTGLIQACLDSKDIRSAVYIFNHMKTFCSP 720

Query: 721 NLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780
           NLVTYN++LKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF
Sbjct: 721 NLVTYNMLLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780

Query: 781 AEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPA 840
           AEKRWDDFGYF++QMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHL+QADR PPP 
Sbjct: 781 AEKRWDDFGYFYDQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLAQADRTPPPP 840

Query: 841 LLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVS 900
           LLKERFCMKLARGDYS+ALSCISNH+SSD HHFSE  WLNLLKEKRFPKDTVIQLI KVS
Sbjct: 841 LLKERFCMKLARGDYSEALSCISNHDSSDVHHFSESGWLNLLKEKRFPKDTVIQLINKVS 883

Query: 901 MLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI 953
           MLLTRND PNPVF+NLLLSCKEFCRTR SVADHRLEETVCTNETQSAAV+ I
Sbjct: 901 MLLTRNDLPNPVFKNLLLSCKEFCRTRISVADHRLEETVCTNETQSAAVVRI 883

BLAST of ClCG01G008850.2 vs. NCBI nr
Match: KAG7019446.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1520.8 bits (3936), Expect = 0.0e+00
Identity = 771/952 (80.99%), Postives = 836/952 (87.82%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMAN NLCIP CE NGFPAL+CTQN H   GFSFFPSS+SG+ LN G  K+RVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           RGHKCGAIKASS GESDI+L SGNLLE DF FKPSFDEYVRVME+VR+RRYK+QSDD NK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
             MKENASAKSAESTSIS I       TDVQ ++DVKN    VD +DLF+N+E+ITR+ D
Sbjct: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQHE+KRN NWS+ IEPK  RS ++K 
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHEEKRNGNWSNYIEPKATRSNHDKR 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEK 300
           ++FKANTLDVK E+  V   S MKIS+KIW DDDTKP KD+LK GK+GVQLE NYIPG+K
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISDKIWADDDTKPTKDVLKVGKYGVQLEGNYIPGDK 300

Query: 301 VGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
           VGRKK EQSYRG SKSGK+F EF +ESSLEVEHAAFN+FDA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSFDAEDIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
           MLSKRLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480

Query: 481 MRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
           MRELFDVIDSMR PPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540

Query: 541 LKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKY 600
           LK+QGLQPST+TYGLVM                                  EVML+CGKY
Sbjct: 541 LKEQGLQPSTTTYGLVM----------------------------------EVMLQCGKY 600

Query: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660
           NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGSAALYYDFA
Sbjct: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFA 660

Query: 661 RCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSP 720
           RCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSP
Sbjct: 661 RCLCSAGRCEEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSP 720

Query: 721 NLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780
           NLVT NI+LKGYL+HGMF+EA+ELFQN+SE+GRNIS VSDYRDRVLPDIY FNTMLDASF
Sbjct: 721 NLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASF 780

Query: 781 AEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPA 840
           AEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+QADR  PP 
Sbjct: 781 AEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPP 840

Query: 841 LLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVS 900
           L+KERFC+ LARGDYS+ALSCIS H+SSD HHFS+ AWLNLLKEKRFPKD+VI+LI+KVS
Sbjct: 841 LIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVS 900

Query: 901 MLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI 953
           MLL RNDSPNPV +NLLLS KEFCR+R +VAD RLEE VCTNE+QSA VMH+
Sbjct: 901 MLLARNDSPNPVLQNLLLSGKEFCRSRITVADPRLEEVVCTNESQSATVMHV 910

BLAST of ClCG01G008850.2 vs. ExPASy Swiss-Prot
Match: Q9SA76 (Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2279 PE=3 SV=1)

HSP 1 Score: 712.2 bits (1837), Expect = 7.7e-204
Identity = 420/912 (46.05%), Postives = 558/912 (61.18%), Query Frame = 0

Query: 62   GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNKL 121
            G    A+K S +GES + +P     +  F  + S  EY R  +T R      + D+L+ +
Sbjct: 172  GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231

Query: 122  T--------MKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTE 181
                      K+   +KS ES+   K  N       + +D   +  + + +         
Sbjct: 232  VEERRVQRIAKDARWSKSRESSVAVKWSNSGESSVTMPKDESFRRRYSKQEHH------- 291

Query: 182  RITRRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVP 241
               R  D S        +G+ R +   KG       +    Q   K  R WS   E  VP
Sbjct: 292  ---RSSDTS--------RGIARGS---KGDELELVVEERRVQRIAKDVR-WSKSDESLVP 351

Query: 242  RSYNEKLINFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGK-FGVQLE 301
             S +E     + N        +RV D S                 + I +  K  G+ L 
Sbjct: 352  VSEDESF--RRGNPKQEMVRYQRVSDTS-----------------RGIERGSKGDGLDLL 411

Query: 302  RNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKP 361
                  E++  +++E       G  + G +  +   +S   +E  AF   D + DI+DKP
Sbjct: 412  AEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 471

Query: 362  RVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 421
              S++EME+RI+ L+K LNG++I+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWR
Sbjct: 472  ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 531

Query: 422  RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAY 481
            RVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY
Sbjct: 532  RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 591

Query: 482  HSIAVTLGQAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR 541
             SIAVTLGQAG+++ELF VID+MR PPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 592  RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 651

Query: 542  KNLEGAFWVLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVK 601
            K  EGAFWVLQ+LK++G +PS  TYGL+M                               
Sbjct: 652  KQWEGAFWVLQQLKQRGQKPSPVTYGLIM------------------------------- 711

Query: 602  GYYEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRG 661
               EVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV  +E+ME RG
Sbjct: 712  ---EVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVHTVEDMESRG 771

Query: 662  IVGSAALYYDFARCLCSAGRCKEAL----------------------------MQMEKIC 721
            IVGSAALYYD ARCLCSAGRC E L                             Q++KIC
Sbjct: 772  IVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPVVLKLIENLIYKADLVHTIQFQLKKIC 831

Query: 722  KVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNIVLKGYLEHGMFEE 781
            +VANKPLVVTYTGLIQAC+DS ++++A YIF+ MK  CSPNLVT NI+LK YL+ G+FEE
Sbjct: 832  RVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKAYLQGGLFEE 891

Query: 782  ARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGY 841
            ARELFQ +SE G +I   SD+  RVLPD Y FNTMLD    +++WDDFGY + +ML +GY
Sbjct: 892  ARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYAYREMLRHGY 951

Query: 842  HFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPALLKERFCMKLARGDYSDALS 901
            HFN KRHLRM+LEA+RAGK+E++E TW+H+ +++RIPP  L+KERF  KL +GD+  A+S
Sbjct: 952  HFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLEKGDHISAIS 1002

Query: 902  CISNHN----SSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVSMLL-TRNDSPNPVFEN 928
             +++ N     ++   FS  AW  +L   RF +D+V++L+  V+  L +R++S + V  N
Sbjct: 1012 SLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRSESSDSVLGN 1002

BLAST of ClCG01G008850.2 vs. ExPASy Swiss-Prot
Match: Q9FJW6 (Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DG1 PE=1 SV=2)

HSP 1 Score: 383.3 bits (983), Expect = 8.2e-105
Identity = 217/589 (36.84%), Postives = 335/589 (56.88%), Query Frame = 0

Query: 357 ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 416
           E +++L  RL+G  I+   W F +MM  + +++++  +L+++  LG+  +W++   V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 417 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLG 476
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 477 QAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 536
           QAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV     +   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 537 VLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLE 596
           V  EL+K GL+P+ +TYGL M                                  EVMLE
Sbjct: 363 VFVELRKNGLRPNGATYGLAM----------------------------------EVMLE 422

Query: 597 CGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAAL 656
            GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+ +ME++G++G+ ++
Sbjct: 423 SGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALWREGKIEEAVEAVRDMEQKGVIGTGSV 482

Query: 657 YYDFARCLCSAGRCKEALMQMEKICKVAN-KPLVVTYTGLIQACLDSKDLQSAVYIFNHM 716
           YY+ A CLC+ GR  +A++++ ++ ++ N +PL +T+TGLI A L+   +   + IF +M
Sbjct: 483 YYELACCLCNNGRWCDAMLEVGRMKRLENCRPLEITFTGLIAASLNGGHVDDCMAIFQYM 542

Query: 717 KTFCSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNT 776
           K  C PN+ T N++LK Y  + MF EA+ELF+ +         VS     ++P+ Y ++ 
Sbjct: 543 KDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI---------VSRKETHLVPNEYTYSF 602

Query: 777 MLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQAD 836
           ML+AS    +W+ F + +  M+L GY  +  +H  M++EA+RAGK  LLE  +  + +  
Sbjct: 603 MLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHASMLIEASRAGKWSLLEHAFDAVLEDG 662

Query: 837 RIPPPALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQ 896
            IP P    E  C   A+GD+  A++ I N  +  +   SE  W +L +E    +D + Q
Sbjct: 663 EIPHPLFFTELLCHATAKGDFQRAITLI-NTVALASFQISEEEWTDLFEEH---QDWLTQ 722

Query: 897 -LIYKVSMLLTRND-SPNPVFENLLLSCKEFCRTRNSVADHRLEETVCT 942
             ++K+S  L   D    P   NL  S K  C + +S A   L   V T
Sbjct: 723 DNLHKLSDHLIECDYVSEPTVSNLSKSLKSRCGSSSSSAQPLLAVDVTT 724

BLAST of ClCG01G008850.2 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 2.5e-21
Identity = 96/431 (22.27%), Postives = 180/431 (41.76%), Query Frame = 0

Query: 432 YTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSM 491
           Y+  +D L K  R +EA  +F +M +     P++  Y ++       G + E+  ++D M
Sbjct: 304 YSLLMDYLCKNGRCMEARKIFDSMTKR-GLKPEITTYGTLLQGYATKGALVEMHGLLDLM 363

Query: 492 RFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTS 551
                   + G        + PD  +++ ++ A  K+  ++ A  V  ++++QGL P+  
Sbjct: 364 -------VRNG--------IHPDHYVFSILICAYAKQGKVDQAMLVFSKMRQQGLNPNAV 423

Query: 552 TYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLV---LLVKGYYEVMLECGKYNLVHEFFR 611
           TYG V+       G   D   +Y+   +  GL    ++       +  C K+    E   
Sbjct: 424 TYGAVIGILCKS-GRVED-AMLYFEQMIDEGLSPGNIVYNSLIHGLCTCNKWERAEELIL 483

Query: 612 KVQKSSI-PNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAG 671
           ++    I  N + +  ++++  KEG+  E+    E M R G+  +   Y       C AG
Sbjct: 484 EMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELMVRIGVKPNVITYNTLINGYCLAG 543

Query: 672 RCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYN 731
           +  EA+  +  +  V  KP  VTY+ LI        ++ A+ +F  M++   SP+++TYN
Sbjct: 544 KMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRMEDALVLFKEMESSGVSPDIITYN 603

Query: 732 IVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWD 791
           I+L+G  +      A+EL+  ++E G  I            ++  +N +L      K  D
Sbjct: 604 IILQGLFQTRRTAAAKELYVRITESGTQI------------ELSTYNIILHGLCKNKLTD 663

Query: 792 DFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDE-------------LLETTWKHLSQA 845
           D    F  + L       +    MI    + G+++             L+   W +   A
Sbjct: 664 DALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAKDLFVAFSSNGLVPNYWTYRLMA 704

BLAST of ClCG01G008850.2 vs. ExPASy Swiss-Prot
Match: Q9SSR4 (Pentatricopeptide repeat-containing protein At1g52620 OS=Arabidopsis thaliana OX=3702 GN=At1g52620 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 2.4e-19
Identity = 96/407 (23.59%), Postives = 173/407 (42.51%), Query Frame = 0

Query: 447 EALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSM--RFPPKKKFKTGAL 506
           +A+ ++  + + + S PD++A +S+   L ++  + +   V D M  R      + T  L
Sbjct: 152 KAVEIYDYVVELYDSVPDVIACNSLLSLLVKSRRLGDARKVYDEMCDRGDSVDNYSTCIL 211

Query: 507 EK------------------WDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGL 566
            K                  W     P+IV YN ++    K  ++E A+ V +ELK +G 
Sbjct: 212 VKGMCNEGKVEVGRKLIEGRWGKGCIPNIVFYNTIIGGYCKLGDIENAYLVFKELKLKGF 271

Query: 567 QPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLG----RGL---VLLVKGYYEVMLECG- 626
            P+  T+G ++       G+C + +++     L     RGL   V  +    +     G 
Sbjct: 272 MPTLETFGTMIN------GFCKEGDFVASDRLLSEVKERGLRVSVWFLNNIIDAKYRHGY 331

Query: 627 KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYD 686
           K +        +     P+  TY +L+N L KEGK + AV  ++   ++G++ +   Y  
Sbjct: 332 KVDPAESIGWIIANDCKPDVATYNILINRLCKEGKKEVAVGFLDEASKKGLIPNNLSYAP 391

Query: 687 FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYI-FNHMKTF 746
             +  C +     A   + ++ +   KP +VTY  LI   + S  +  AV +    +   
Sbjct: 392 LIQAYCKSKEYDIASKLLLQMAERGCKPDIVTYGILIHGLVVSGHMDDAVNMKVKLIDRG 451

Query: 747 CSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLD 806
            SP+   YN+++ G  + G F  A+ LF  + +  RNI          LPD Y++ T++D
Sbjct: 452 VSPDAAIYNMLMSGLCKTGRFLPAKLLFSEMLD--RNI----------LPDAYVYATLID 511

Query: 807 ASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGK-DELL 824
                  +D+    F+  +  G   +   H  MI    R+G  DE L
Sbjct: 512 GFIRSGDFDEARKVFSLSVEKGVKVDVVHHNAMIKGFCRSGMLDEAL 540

BLAST of ClCG01G008850.2 vs. ExPASy Swiss-Prot
Match: P0C894 (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana OX=3702 GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 3.1e-19
Identity = 78/334 (23.35%), Postives = 140/334 (41.92%), Query Frame = 0

Query: 467 AYHSIAVTLGQAGYMRELFDVIDSMRFPPKKKFKTGALEKW-----------------DP 526
           A  S+ + LG      + F  +   R  PK +   G L ++                   
Sbjct: 197 ALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGA 256

Query: 527 RLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMD 586
             +P +  YN +++   K  ++E A  + +E+K +GL P T TY       +DGFG    
Sbjct: 257 GARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTY----NSMIDGFGKVGR 316

Query: 587 MEWMYWVLSLGRGL-----VLLVKGYYEVMLECGKYNLVHEFFRKVQKSSI-PNALTYKV 646
           ++         + +     V+          + GK  +  EF+R+++ + + PN ++Y  
Sbjct: 317 LDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYST 376

Query: 647 LVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA 706
           LV+   KEG   +A+    +M R G+V +   Y       C  G   +A     ++ +V 
Sbjct: 377 LVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVG 436

Query: 707 NKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYNIVLKGYLEHGMFEEAR 766
            +  VVTYT LI    D++ ++ A  +F  M T    PNL +YN ++ G+++    + A 
Sbjct: 437 VEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRAL 496

Query: 767 ELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTML 777
           EL   L   G            + PD+ ++ T +
Sbjct: 497 ELLNELKGRG------------IKPDLLLYGTFI 514

BLAST of ClCG01G008850.2 vs. ExPASy TrEMBL
Match: A0A0A0LVN7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1)

HSP 1 Score: 1567.4 bits (4057), Expect = 0.0e+00
Identity = 792/941 (84.17%), Postives = 841/941 (89.37%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMAN+NLCIPNCER GFP LHCT N HN F  SFFPSS+SGTD ++ D KNRVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           R HKCG+IKA SNGESDI LPSGNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q DD NK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
           LTMKEN SAKSAESTSISKIDNGKNK TDVQ +VDVKNMFKRVD+KDLFNNTERI   +D
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNKFD +RK VTRSND+VKGK+TPF S  +DKQHE+KRN NWSS IEP+V RS ++K 
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKI--WDDDDTKPAKDILKAGKFGVQLERNYIPG 300
           I+FKANTL+VK+E+ RV D + MK SEKI  W DDD KPAK +LKAGK+G+QLER+Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300

Query: 301 EKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEER 360
           +KVGRKK EQSYRG S SGK+FLEF +++SLEVEHAAFNNFDA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ 420
           IQMLSKRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
           GYMRELFDVIDSMR PPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540

Query: 541 QELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECG 600
           QELKKQ LQPSTSTYGLVM                                  EVMLECG
Sbjct: 541 QELKKQSLQPSTSTYGLVM----------------------------------EVMLECG 600

Query: 601 KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYD 660
           KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENME RGIVGSAALYYD
Sbjct: 601 KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYD 660

Query: 661 FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFC 720
           FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMK FC
Sbjct: 661 FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFC 720

Query: 721 SPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDA 780
           SPNLVTYNI+LKGYLEHGMFEEARELFQNLSE  RNISTVSDYRDRVLPDIYMFNTMLDA
Sbjct: 721 SPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDA 780

Query: 781 SFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPP 840
           SFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+QADR PP
Sbjct: 781 SFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPP 840

Query: 841 PALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYK 900
           P LLKERFCMKLARGDYS+ALS I +HNS DAHHFSE AWLNLLKEKRFP+DTVI+LI+K
Sbjct: 841 PPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHFSESAWLNLLKEKRFPRDTVIELIHK 900

Query: 901 VSMLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETV 940
           V M+LTRN+SPNPVF+NLLLSCKEFCRTR S+ADHRLEETV
Sbjct: 901 VGMVLTRNESPNPVFKNLLLSCKEFCRTRISLADHRLEETV 906

BLAST of ClCG01G008850.2 vs. ExPASy TrEMBL
Match: A0A1S3C8Z0 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498323 PE=4 SV=1)

HSP 1 Score: 1550.4 bits (4013), Expect = 0.0e+00
Identity = 788/947 (83.21%), Postives = 834/947 (88.07%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSIS--GTDLNIGDVKNRVL 60
           MVGVIMANVNL IPNCER GFP LHCT N H  F  SFFPSS+S  GTDLN  D KNRVL
Sbjct: 1   MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60

Query: 61  RHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDL 120
           RHR HKCG+IKA SNGESDI LP+GNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q D  
Sbjct: 61  RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120

Query: 121 NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRR 180
           NKLTMKEN SAKSAESTSISKIDNGKNK TDVQ +V+VKNMFKRVD+KDLFNNTERI R 
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180

Query: 181 RDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNE 240
           + LSGNKFD + KGVTRSND+VKGK+TPF S  +DKQHE+K+N NWSS IEPKV RS  E
Sbjct: 181 KHLSGNKFD-RSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCE 240

Query: 241 KLINFKANTLDVKRENRRVCDQSPMKISEKI--WDDDDTKPAKDILKAGKFGVQLERNYI 300
           K I+FKAN L+ K+E  RV   + MK SEKI  W +DD KPAKD+LKAGK+G+QLER+Y 
Sbjct: 241 KPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYS 300

Query: 301 PGEKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEME 360
           PG+KVGRKK EQSYRG S SGK+FLEF +E+SLEVEHAAFNNFDALDIMDKPRVSKMEME
Sbjct: 301 PGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEME 360

Query: 361 ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
           ERIQMLSKRLNG++IDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW
Sbjct: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420

Query: 421 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLG 480
           LQMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLG
Sbjct: 421 LQMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480

Query: 481 QAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
           QAGYMRELFDVIDSMR PPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW
Sbjct: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540

Query: 541 VLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLE 600
           VLQELKKQGLQPSTSTYGLVM                                  EVMLE
Sbjct: 541 VLQELKKQGLQPSTSTYGLVM----------------------------------EVMLE 600

Query: 601 CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALY 660
           CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENME RG+VGSAALY
Sbjct: 601 CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEMRGVVGSAALY 660

Query: 661 YDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKT 720
           YDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MK 
Sbjct: 661 YDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYVFNQMKA 720

Query: 721 FCSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTML 780
           FCSPNLVTYNI+LKGYLEHGMFEEAREL QNLSE  +NISTVSDYRDRVLPDIYMFNTML
Sbjct: 721 FCSPNLVTYNILLKGYLEHGMFEEARELLQNLSEQRQNISTVSDYRDRVLPDIYMFNTML 780

Query: 781 DASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRI 840
           DASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+QADR 
Sbjct: 781 DASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARVGKDELLETTWKHLAQADRT 840

Query: 841 PPPALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLI 900
           PPP LLKERFCMK+ARGDY++AL CISNHNS DAHHFSE AWLNLLKEKRFPKDTVI+LI
Sbjct: 841 PPPPLLKERFCMKVARGDYTEALRCISNHNSGDAHHFSESAWLNLLKEKRFPKDTVIELI 900

Query: 901 YKVSMLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNE 944
           +KV M+   N+SPNPVF+NLLLSCKEFCRTR SVADHRLEETV TNE
Sbjct: 901 HKVGMVFATNESPNPVFKNLLLSCKEFCRTRISVADHRLEETVHTNE 912

BLAST of ClCG01G008850.2 vs. ExPASy TrEMBL
Match: A0A6J1KEH7 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495096 PE=4 SV=1)

HSP 1 Score: 1513.8 bits (3918), Expect = 0.0e+00
Identity = 772/952 (81.09%), Postives = 833/952 (87.50%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMAN NLCIP CE NGF AL+CTQN H   G SFFPSS+SG+ LN G  K+RVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFSALYCTQNSHYLLGLSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           RGHKCGAIKASS GESDI+L SGNLLE DF FKPSFDEYVRVME+VR+RRYK+QSDD NK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
             MKENASAKSAESTSIS I       TDVQ ++DVKN    VD +DLF+N+ERITR+ D
Sbjct: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKNKVVYVDGEDLFDNSERITRKTD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQHE+KRN NWS+ IEPKV RS ++K 
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQINDKQHEEKRNGNWSNYIEPKVTRSNHDKR 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEK 300
           ++FKANTLDVK E+  V   S MKISEKIW DDD KP KD+LK GK+GVQL+ NYIPG+K
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWADDDIKPTKDVLKVGKYGVQLKGNYIPGDK 300

Query: 301 VGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
           VGRKK EQSYRG SKSGK+F EF +ESSLEVEHAAFN+ DA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAADIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
           MLSKRLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480

Query: 481 MRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
           MRELFDVIDSMR PPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540

Query: 541 LKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKY 600
           LK+QGLQPST+TYGLVM                                  EVML+CGKY
Sbjct: 541 LKEQGLQPSTTTYGLVM----------------------------------EVMLQCGKY 600

Query: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660
           NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGSAALYYDFA
Sbjct: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFA 660

Query: 661 RCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSP 720
           RCLCSAGR +EALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSP
Sbjct: 661 RCLCSAGRWEEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSP 720

Query: 721 NLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780
           NLVT NI+LKGYL+HGMF EA+ELFQN+SE+GRNIS VSDYRDRVLPDIY FNTMLDASF
Sbjct: 721 NLVTCNILLKGYLDHGMFNEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASF 780

Query: 781 AEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPA 840
           AEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+QADRI PP 
Sbjct: 781 AEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRILPPP 840

Query: 841 LLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVS 900
           L+KERFC+ LARGDYS+ALSCIS H+SSD HHFS+ AWLNLLKEKRFPKD+VIQLI+KVS
Sbjct: 841 LIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLLKEKRFPKDSVIQLIHKVS 900

Query: 901 MLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI 953
           MLL RNDSPNPV +NLLLS KEFCR+R SVAD RLEE VCTNE+QSAAVMH+
Sbjct: 901 MLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTNESQSAAVMHV 910

BLAST of ClCG01G008850.2 vs. ExPASy TrEMBL
Match: A0A6J1EH18 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111434226 PE=4 SV=1)

HSP 1 Score: 1510.7 bits (3910), Expect = 0.0e+00
Identity = 767/952 (80.57%), Postives = 832/952 (87.39%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMAN NLCIP CE NGFPAL+CTQN H   GFS FPSS+SG+ LN G  K+RVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSVFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           RGHKCGAIKASS GESDI+L SGNLLE DF FKPSFDEYVRVME+VR+RRYK+QSDD NK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
             MKENASAKSAEST IS I       TDVQ ++DVKN    VD +DLF+N+E+ITR+ D
Sbjct: 121 --MKENASAKSAESTFISNI------VTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           LSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQHE+KRN NWS+ IEPK  RS ++K 
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFESQVNDKQHEEKRNGNWSNYIEPKATRSNHDKR 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEK 300
           ++FKANTLDVK E+  V   S MKIS+KIW DDD+KP KD+LK GK+GVQLE NYIPG+K
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISDKIWADDDSKPTKDVLKVGKYGVQLEGNYIPGDK 300

Query: 301 VGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
           VGRKK EQSYRG SKSGK+F EF +ESSLEVEHAAFN+ DA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
           MLS RLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSNRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480

Query: 481 MRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
           MRELFDVIDSMR PPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540

Query: 541 LKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKY 600
           LK+QGLQPST+TYGLVM                                  EVML+CGKY
Sbjct: 541 LKEQGLQPSTTTYGLVM----------------------------------EVMLQCGKY 600

Query: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660
           NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGSAALYYDFA
Sbjct: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFA 660

Query: 661 RCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSP 720
           RCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSP
Sbjct: 661 RCLCSAGRCEEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSP 720

Query: 721 NLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780
           NLVT NI+LKGYL+HGMF+EA+ELFQN+SE+GRNIS VSDYRDRVLPDIY FNTMLDASF
Sbjct: 721 NLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASF 780

Query: 781 AEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPA 840
           AEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+QADR  PP 
Sbjct: 781 AEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPP 840

Query: 841 LLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVS 900
           L+KERFC+ LARGDYS+ALSCIS H+SSD HHFS+ AWLNLLKEKRFPKD+VI+LI+KVS
Sbjct: 841 LIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVS 900

Query: 901 MLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI 953
           MLL RNDSPNPV +NLLLS KEFCR+R SVAD RLEE VCTNE+QSA VMH+
Sbjct: 901 MLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTNESQSATVMHV 910

BLAST of ClCG01G008850.2 vs. ExPASy TrEMBL
Match: A0A6J1CLQ9 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012614 PE=4 SV=1)

HSP 1 Score: 1504.6 bits (3894), Expect = 0.0e+00
Identity = 763/952 (80.15%), Postives = 825/952 (86.66%), Query Frame = 0

Query: 1   MVGVIMANVNLCIPNCERNGFPALHCTQNFHNFFGFSFFPSSISGTDLNIGDVKNRVLRH 60
           MVGVIMAN N+CIP CERNGF ALHCTQ+ HN FGFS FPS ISG  LN+G  KNR+ R+
Sbjct: 1   MVGVIMANANMCIPCCERNGFRALHCTQSSHNLFGFSLFPSPISGIGLNVGYEKNRIFRY 60

Query: 61  RGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNK 120
           RG+KCGAI+ SS GESDIRL +GN+LENDF FKPSFDEYVRVME+VRT RYKKQ DD NK
Sbjct: 61  RGNKCGAIRVSSKGESDIRLQNGNVLENDFLFKPSFDEYVRVMESVRTSRYKKQPDDPNK 120

Query: 121 LTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITRRRD 180
           L MKENASAKSAES+S+S+IDN K K TDVQ +VDVKNMFKRVD+K LFNN ER+TR++D
Sbjct: 121 LKMKENASAKSAESSSVSEIDNEKTKVTDVQGNVDVKNMFKRVDQKKLFNNAERVTRKKD 180

Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKL 240
           L  NKFD+KRKG+TR+ DE +GKVT F SQ +DKQHE++R RN    IEPKV R  NE L
Sbjct: 181 LLENKFDNKRKGITRTKDEFRGKVTHFDSQVNDKQHEEQRKRNRLDCIEPKVRRLNNEAL 240

Query: 241 INFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGKFGVQLERNYIPGEK 300
           +  KANTLD+KR+ +RVCD+S MK  E+IW D DTK AK  L+ GK GVQL RNY+PGEK
Sbjct: 241 VCSKANTLDIKRQRQRVCDESSMKTVERIWADGDTKLAKGDLEVGKSGVQLARNYVPGEK 300

Query: 301 VGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
           V  KK  QSY+G SKSGK F+E  +ESSLEVE AA NNFDALDIMDKPRVSKMEMEERIQ
Sbjct: 301 VSGKKTGQSYQGLSKSGKPFIESTEESSLEVERAALNNFDALDIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
           MLSKRLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480

Query: 481 MRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
           MRELFDVIDSMR PPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540

Query: 541 LKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLECGKY 600
           LKKQGLQPSTSTYGLVM                                  EVMLECGKY
Sbjct: 541 LKKQGLQPSTSTYGLVM----------------------------------EVMLECGKY 600

Query: 601 NLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFA 660
           NLVHEFFRKVQ+SSIPNALTYKVLVNTL KEGKTDEAVLAI+NMERRGIVGSAALYYDFA
Sbjct: 601 NLVHEFFRKVQRSSIPNALTYKVLVNTLSKEGKTDEAVLAIQNMERRGIVGSAALYYDFA 660

Query: 661 RCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSP 720
           RCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSK+L SAVYIFNHMK FCSP
Sbjct: 661 RCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLDSAVYIFNHMKAFCSP 720

Query: 721 NLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASF 780
           NLVTYNI+LKGYL+HGMFEEARELFQNLSE G++IST+SDY+DRVLPDIY FN MLDA F
Sbjct: 721 NLVTYNILLKGYLDHGMFEEARELFQNLSESGQSISTISDYKDRVLPDIYTFNIMLDAFF 780

Query: 781 AEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPA 840
           A KRWDDFGYF+NQM LYGYHFNPKRHLRMILEA RAGKDE+LETTWKHL+Q DR  PP 
Sbjct: 781 AVKRWDDFGYFYNQMFLYGYHFNPKRHLRMILEAGRAGKDEILETTWKHLAQTDRTLPPP 840

Query: 841 LLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVS 900
           L+KERFCMKLARGDYS+ALSCISNH+SSDAHHFSE AWLNLLKEK FPKDTVI LI+KVS
Sbjct: 841 LVKERFCMKLARGDYSEALSCISNHHSSDAHHFSESAWLNLLKEKGFPKDTVILLIHKVS 900

Query: 901 MLLTRNDSPNPVFENLLLSCKEFCRTRNSVADHRLEETVCTNETQSAAVMHI 953
           MLLT N  PNPVF+NLL SCKEFCRTR +VAD +LE+ VC +ETQSAAVMHI
Sbjct: 901 MLLTGNHPPNPVFQNLLSSCKEFCRTRITVADSKLEQIVCRDETQSAAVMHI 918

BLAST of ClCG01G008850.2 vs. TAIR 10
Match: AT1G30610.2 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 724.5 bits (1869), Expect = 1.1e-208
Identity = 419/884 (47.40%), Postives = 557/884 (63.01%), Query Frame = 0

Query: 62  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNKL 121
           G    A+K S +GES + +P     +  F  + S  EY R  +T R      + D+L+ +
Sbjct: 172 GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231

Query: 122 T--------MKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTE 181
                     K+   +KS ES+   K  N       + +D   +  + + +         
Sbjct: 232 VEERRVQRIAKDARWSKSRESSVAVKWSNSGESSVTMPKDESFRRRYSKQEHH------- 291

Query: 182 RITRRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVP 241
              R  D S        +G+ R +   KG       +    Q   K  R WS   E  VP
Sbjct: 292 ---RSSDTS--------RGIARGS---KGDELELVVEERRVQRIAKDVR-WSKSDESLVP 351

Query: 242 RSYNEKLINFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGK-FGVQLE 301
            S +E     + N        +RV D S                 + I +  K  G+ L 
Sbjct: 352 VSEDESF--RRGNPKQEMVRYQRVSDTS-----------------RGIERGSKGDGLDLL 411

Query: 302 RNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKP 361
                 E++  +++E       G  + G +  +   +S   +E  AF   D + DI+DKP
Sbjct: 412 AEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 471

Query: 362 RVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 421
             S++EME+RI+ L+K LNG++I+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWR
Sbjct: 472 ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 531

Query: 422 RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAY 481
           RVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY
Sbjct: 532 RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 591

Query: 482 HSIAVTLGQAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR 541
            SIAVTLGQAG+++ELF VID+MR PPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 592 RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 651

Query: 542 KNLEGAFWVLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVK 601
           K  EGAFWVLQ+LK++G +PS  TYGL+M                               
Sbjct: 652 KQWEGAFWVLQQLKQRGQKPSPVTYGLIM------------------------------- 711

Query: 602 GYYEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRG 661
              EVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV  +E+ME RG
Sbjct: 712 ---EVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVHTVEDMESRG 771

Query: 662 IVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAV 721
           IVGSAALYYD ARCLCSAGRC E L  ++KIC+VANKPLVVTYTGLIQAC+DS ++++A 
Sbjct: 772 IVGSAALYYDLARCLCSAGRCNEGLNMLKKICRVANKPLVVTYTGLIQACVDSGNIKNAA 831

Query: 722 YIFNHMKTFCSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPD 781
           YIF+ MK  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE G +I   SD+  RVLPD
Sbjct: 832 YIFDQMKKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPD 891

Query: 782 IYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWK 841
            Y FNTMLD    +++WDDFGY + +ML +GYHFN KRHLRM+LEA+RAGK+E++E TW+
Sbjct: 892 TYTFNTMLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWE 951

Query: 842 HLSQADRIPPPALLKERFCMKLARGDYSDALSCISNHN----SSDAHHFSEPAWLNLLKE 901
           H+ +++RIPP  L+KERF  KL +GD+  A+S +++ N     ++   FS  AW  +L  
Sbjct: 952 HMRRSNRIPPSPLIKERFFRKLEKGDHISAISSLADLNGKIEETELRAFSTSAWSRVL-- 974

Query: 902 KRFPKDTVIQLIYKVSMLL-TRNDSPNPVFENLLLSCKEFCRTR 928
            RF +D+V++L+  V+  L +R++S + V  NLL SCK++ +TR
Sbjct: 1012 SRFEQDSVLRLMDDVNRRLGSRSESSDSVLGNLLSSCKDYLKTR 974

BLAST of ClCG01G008850.2 vs. TAIR 10
Match: AT1G30610.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 712.2 bits (1837), Expect = 5.5e-205
Identity = 420/912 (46.05%), Postives = 558/912 (61.18%), Query Frame = 0

Query: 62   GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQSDDLNKL 121
            G    A+K S +GES + +P     +  F  + S  EY R  +T R      + D+L+ +
Sbjct: 172  GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231

Query: 122  T--------MKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTE 181
                      K+   +KS ES+   K  N       + +D   +  + + +         
Sbjct: 232  VEERRVQRIAKDARWSKSRESSVAVKWSNSGESSVTMPKDESFRRRYSKQEHH------- 291

Query: 182  RITRRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVP 241
               R  D S        +G+ R +   KG       +    Q   K  R WS   E  VP
Sbjct: 292  ---RSSDTS--------RGIARGS---KGDELELVVEERRVQRIAKDVR-WSKSDESLVP 351

Query: 242  RSYNEKLINFKANTLDVKRENRRVCDQSPMKISEKIWDDDDTKPAKDILKAGK-FGVQLE 301
             S +E     + N        +RV D S                 + I +  K  G+ L 
Sbjct: 352  VSEDESF--RRGNPKQEMVRYQRVSDTS-----------------RGIERGSKGDGLDLL 411

Query: 302  RNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKP 361
                  E++  +++E       G  + G +  +   +S   +E  AF   D + DI+DKP
Sbjct: 412  AEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 471

Query: 362  RVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 421
              S++EME+RI+ L+K LNG++I+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWR
Sbjct: 472  ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 531

Query: 422  RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAY 481
            RVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY
Sbjct: 532  RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 591

Query: 482  HSIAVTLGQAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR 541
             SIAVTLGQAG+++ELF VID+MR PPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 592  RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 651

Query: 542  KNLEGAFWVLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVK 601
            K  EGAFWVLQ+LK++G +PS  TYGL+M                               
Sbjct: 652  KQWEGAFWVLQQLKQRGQKPSPVTYGLIM------------------------------- 711

Query: 602  GYYEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRG 661
               EVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV  +E+ME RG
Sbjct: 712  ---EVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVHTVEDMESRG 771

Query: 662  IVGSAALYYDFARCLCSAGRCKEAL----------------------------MQMEKIC 721
            IVGSAALYYD ARCLCSAGRC E L                             Q++KIC
Sbjct: 772  IVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPVVLKLIENLIYKADLVHTIQFQLKKIC 831

Query: 722  KVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNIVLKGYLEHGMFEE 781
            +VANKPLVVTYTGLIQAC+DS ++++A YIF+ MK  CSPNLVT NI+LK YL+ G+FEE
Sbjct: 832  RVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKAYLQGGLFEE 891

Query: 782  ARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGY 841
            ARELFQ +SE G +I   SD+  RVLPD Y FNTMLD    +++WDDFGY + +ML +GY
Sbjct: 892  ARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYAYREMLRHGY 951

Query: 842  HFNPKRHLRMILEAARAGKDELLETTWKHLSQADRIPPPALLKERFCMKLARGDYSDALS 901
            HFN KRHLRM+LEA+RAGK+E++E TW+H+ +++RIPP  L+KERF  KL +GD+  A+S
Sbjct: 952  HFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLEKGDHISAIS 1002

Query: 902  CISNHN----SSDAHHFSEPAWLNLLKEKRFPKDTVIQLIYKVSMLL-TRNDSPNPVFEN 928
             +++ N     ++   FS  AW  +L   RF +D+V++L+  V+  L +R++S + V  N
Sbjct: 1012 SLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRSESSDSVLGN 1002

BLAST of ClCG01G008850.2 vs. TAIR 10
Match: AT5G67570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 383.3 bits (983), Expect = 5.8e-106
Identity = 217/589 (36.84%), Postives = 335/589 (56.88%), Query Frame = 0

Query: 357 ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 416
           E +++L  RL+G  I+   W F +MM  + +++++  +L+++  LG+  +W++   V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 417 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLG 476
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 477 QAGYMRELFDVIDSMRFPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 536
           QAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV     +   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 537 VLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLGRGLVLLVKGYYEVMLE 596
           V  EL+K GL+P+ +TYGL M                                  EVMLE
Sbjct: 363 VFVELRKNGLRPNGATYGLAM----------------------------------EVMLE 422

Query: 597 CGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAAL 656
            GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+ +ME++G++G+ ++
Sbjct: 423 SGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALWREGKIEEAVEAVRDMEQKGVIGTGSV 482

Query: 657 YYDFARCLCSAGRCKEALMQMEKICKVAN-KPLVVTYTGLIQACLDSKDLQSAVYIFNHM 716
           YY+ A CLC+ GR  +A++++ ++ ++ N +PL +T+TGLI A L+   +   + IF +M
Sbjct: 483 YYELACCLCNNGRWCDAMLEVGRMKRLENCRPLEITFTGLIAASLNGGHVDDCMAIFQYM 542

Query: 717 KTFCSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNT 776
           K  C PN+ T N++LK Y  + MF EA+ELF+ +         VS     ++P+ Y ++ 
Sbjct: 543 KDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI---------VSRKETHLVPNEYTYSF 602

Query: 777 MLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQAD 836
           ML+AS    +W+ F + +  M+L GY  +  +H  M++EA+RAGK  LLE  +  + +  
Sbjct: 603 MLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHASMLIEASRAGKWSLLEHAFDAVLEDG 662

Query: 837 RIPPPALLKERFCMKLARGDYSDALSCISNHNSSDAHHFSEPAWLNLLKEKRFPKDTVIQ 896
            IP P    E  C   A+GD+  A++ I N  +  +   SE  W +L +E    +D + Q
Sbjct: 663 EIPHPLFFTELLCHATAKGDFQRAITLI-NTVALASFQISEEEWTDLFEEH---QDWLTQ 722

Query: 897 -LIYKVSMLLTRND-SPNPVFENLLLSCKEFCRTRNSVADHRLEETVCT 942
             ++K+S  L   D    P   NL  S K  C + +S A   L   V T
Sbjct: 723 DNLHKLSDHLIECDYVSEPTVSNLSKSLKSRCGSSSSSAQPLLAVDVTT 724

BLAST of ClCG01G008850.2 vs. TAIR 10
Match: AT1G52620.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 99.4 bits (246), Expect = 1.7e-20
Identity = 96/407 (23.59%), Postives = 173/407 (42.51%), Query Frame = 0

Query: 447 EALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSM--RFPPKKKFKTGAL 506
           +A+ ++  + + + S PD++A +S+   L ++  + +   V D M  R      + T  L
Sbjct: 152 KAVEIYDYVVELYDSVPDVIACNSLLSLLVKSRRLGDARKVYDEMCDRGDSVDNYSTCIL 211

Query: 507 EK------------------WDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGL 566
            K                  W     P+IV YN ++    K  ++E A+ V +ELK +G 
Sbjct: 212 VKGMCNEGKVEVGRKLIEGRWGKGCIPNIVFYNTIIGGYCKLGDIENAYLVFKELKLKGF 271

Query: 567 QPSTSTYGLVMEKKVDGFGYCMDMEWMYWVLSLG----RGL---VLLVKGYYEVMLECG- 626
            P+  T+G ++       G+C + +++     L     RGL   V  +    +     G 
Sbjct: 272 MPTLETFGTMIN------GFCKEGDFVASDRLLSEVKERGLRVSVWFLNNIIDAKYRHGY 331

Query: 627 KYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYD 686
           K +        +     P+  TY +L+N L KEGK + AV  ++   ++G++ +   Y  
Sbjct: 332 KVDPAESIGWIIANDCKPDVATYNILINRLCKEGKKEVAVGFLDEASKKGLIPNNLSYAP 391

Query: 687 FARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYI-FNHMKTF 746
             +  C +     A   + ++ +   KP +VTY  LI   + S  +  AV +    +   
Sbjct: 392 LIQAYCKSKEYDIASKLLLQMAERGCKPDIVTYGILIHGLVVSGHMDDAVNMKVKLIDRG 451

Query: 747 CSPNLVTYNIVLKGYLEHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLD 806
            SP+   YN+++ G  + G F  A+ LF  + +  RNI          LPD Y++ T++D
Sbjct: 452 VSPDAAIYNMLMSGLCKTGRFLPAKLLFSEMLD--RNI----------LPDAYVYATLID 511

Query: 807 ASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGK-DELL 824
                  +D+    F+  +  G   +   H  MI    R+G  DE L
Sbjct: 512 GFIRSGDFDEARKVFSLSVEKGVKVDVVHHNAMIKGFCRSGMLDEAL 540

BLAST of ClCG01G008850.2 vs. TAIR 10
Match: AT2G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 99.0 bits (245), Expect = 2.2e-20
Identity = 78/334 (23.35%), Postives = 140/334 (41.92%), Query Frame = 0

Query: 467 AYHSIAVTLGQAGYMRELFDVIDSMRFPPKKKFKTGALEKW-----------------DP 526
           A  S+ + LG      + F  +   R  PK +   G L ++                   
Sbjct: 197 ALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGA 256

Query: 527 RLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEKKVDGFGYCMD 586
             +P +  YN +++   K  ++E A  + +E+K +GL P T TY       +DGFG    
Sbjct: 257 GARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTY----NSMIDGFGKVGR 316

Query: 587 MEWMYWVLSLGRGL-----VLLVKGYYEVMLECGKYNLVHEFFRKVQKSSI-PNALTYKV 646
           ++         + +     V+          + GK  +  EF+R+++ + + PN ++Y  
Sbjct: 317 LDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYST 376

Query: 647 LVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA 706
           LV+   KEG   +A+    +M R G+V +   Y       C  G   +A     ++ +V 
Sbjct: 377 LVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVG 436

Query: 707 NKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYNIVLKGYLEHGMFEEAR 766
            +  VVTYT LI    D++ ++ A  +F  M T    PNL +YN ++ G+++    + A 
Sbjct: 437 VEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRAL 496

Query: 767 ELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTML 777
           EL   L   G            + PD+ ++ T +
Sbjct: 497 ELLNELKGRG------------IKPDLLLYGTFI 514

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894404.10.0e+0087.18pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 ... [more]
XP_031741862.10.0e+0084.17pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sa... [more]
XP_008459122.10.0e+0083.21PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... [more]
XP_038894405.10.0e+0082.35pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X2 ... [more]
KAG7019446.10.0e+0080.99Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9SA767.7e-20446.05Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidop... [more]
Q9FJW68.2e-10536.84Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidop... [more]
Q76C992.5e-2122.27Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q9SSR42.4e-1923.59Pentatricopeptide repeat-containing protein At1g52620 OS=Arabidopsis thaliana OX... [more]
P0C8943.1e-1923.35Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0LVN70.0e+0084.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1[more]
A0A1S3C8Z00.0e+0083.21pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis ... [more]
A0A6J1KEH70.0e+0081.09pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbit... [more]
A0A6J1EH180.0e+0080.57LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chlo... [more]
A0A6J1CLQ90.0e+0080.15pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Momordic... [more]
Match NameE-valueIdentityDescription
AT1G30610.21.1e-20847.40pentatricopeptide (PPR) repeat-containing protein [more]
AT1G30610.15.5e-20546.05pentatricopeptide (PPR) repeat-containing protein [more]
AT5G67570.15.8e-10636.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G52620.11.7e-2023.59Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G02150.12.2e-2023.35Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 513..557
e-value: 6.9E-10
score: 39.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 620..649
e-value: 0.13
score: 12.6
coord: 431..457
e-value: 0.033
score: 14.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 516..549
e-value: 3.1E-5
score: 21.9
coord: 431..463
e-value: 6.5E-4
score: 17.7
coord: 723..752
e-value: 8.6E-7
score: 26.7
coord: 689..716
e-value: 0.0029
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 687..732
e-value: 2.6E-9
score: 37.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 514..548
score: 11.750571
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 617..651
score: 9.415814
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 721..755
score: 11.366925
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 683..833
e-value: 1.0E-23
score: 86.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 357..504
e-value: 4.6E-14
score: 54.2
coord: 505..564
e-value: 4.4E-10
score: 41.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 584..682
e-value: 1.5E-8
score: 36.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 605..748
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 180..231
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 121..144
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 111..153
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 210..227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 180..198
NoneNo IPR availablePANTHERPTHR46935:SF1OS01G0674700 PROTEINcoord: 4..558
coord: 592..932
IPR044645Pentatricopeptide repeat-containing protein DG1/EMB2279-likePANTHERPTHR46935OS01G0674700 PROTEINcoord: 4..558
coord: 592..932

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
ClCG01G008850ClCG01G008850gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10837622..10839255exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10840559..10840750exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10842899..10842980exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10843106..10843297exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10844216..10844329exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10844531..10844632exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10845476..10845566exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10845636..10845793exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10846397..10846828exon
ClCG01G008850.2-exonClCG01G008850.2-exon-CG_Chr01:10847371..10847787exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG01G008850.2-five_prime_utrClCG01G008850.2-five_prime_utr-CG_Chr01:10837622..10838161five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10838162..10839255CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10840559..10840750CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10842899..10842980CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10843106..10843297CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10844216..10844329CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10844531..10844632CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10845476..10845566CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10845636..10845793CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10846397..10846828CDS
ClCG01G008850.2-cdsClCG01G008850.2-cds-CG_Chr01:10847371..10847772CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG01G008850.2-three_prime_utrClCG01G008850.2-three_prime_utr-CG_Chr01:10847773..10847787three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
ClCG01G008850.2ClCG01G008850.2-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
molecular_function GO:0005515 protein binding