ClCG07G001810 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG07G001810
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLAGLIDADG_2 domain-containing protein
LocationCG_Chr07: 1965022 .. 1973211 (+)
RNA-Seq ExpressionClCG07G001810
SyntenyClCG07G001810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAAAGCCTTCAAAACACCTTGCTTTGCGAAAAGTTCTTTAAAGAAATGTGTTTTTTTGTGTTAAAAGCTTAATGGGTTTCTGTTGTTTTTATGAATTGTAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCTCAATCCACCAAGAAGGGGAAAAGTTGGGGTAAGAGAAAAATGCATAGTACCTGTTTGATAAAATGTCAAAGTGGCAATTATTTATTTCAAGTGTCATGAAACTTTTCTTTTTTGGCCAGATAAAGGAACTGCTAGATCAAAGGAGATAGTGGTGTCTAGTCCTTGTGGGAGTTCTCAAAAATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAAAGGATATTGCAAAGTCTTGCTTTTATAGTACCATTAATACAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCAATATTGGATCCAAAAGATGAAAAAGGAAGATATTGCTAAGGGAATTAGACAGATTGGTCAAAAGGTGGATTCTTCTTCTTCTGTTCATCCTGGCACCAAAGTTTTGCCAATAACTGATGCCACTTTAGTAGAGCAATATAGCAGTAAGCCTGAGAAGAAAGTTGTGGGCAATGGGGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAAATGGGCTATGGCTTCAAGATCAGAAAAAGGAGGGAAGTTCATCACTGGCAAGGTGATTTTCCTAACTTCTTAATAACCTTCCCCTTGTTTGGATATTTAAGAATCCTTAATTTTTACAAGTAATCAACAATATTTCTATTGGTTTGAATCATCTACTCCTTAAATCACCACATAACTCAATAGTTTAAGCTTATGAGTTCAGGTTCATATACTCTTACATTCTTAATACCATCTTTAAGTATGGGCTTGAGGTAATTTGACTTCTGGAACCCGAAACGTGAATTGTTCGAGTTAGGATCTCTTGTTCTGATACAATATTAAACTACTGCTAAATCCAAAAGAAGGTCATACTTCATAGGTCTAAATAAAATTTATTTTTTTCATAGGTATTTTTTTACATTACCAAATCCATATTGAATGGTGACAGTGATCTGTATCCATGCTGAATATGAAAGAAGCCAAAGCATAAGCCATTGGTGTTATTATCACCATTATGGCCACCAATATCCTTAATAATGAGCGAACCAATTTGAAGGGTGGATATAGAATGACTTCTTGAGTGCTTAAAACTCTTAAACGCTTTGAGAAGTGATTCCAAACAAGCTCGAAATATTTATGTTTCAATGATGCAATATTGAAGGGATACATAATAATGATGGCTGAAAATTGTGCAACTTCTTTCAGGTTTCACGACTGCGGAATCGAGGAACTCTTAAAGCAGGCTTGGACGATGATCGAGGGAGTAATGACTCCCCGAAGATCAGTTTCAGATGGGAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCTCATTCAAGAACTGTTCCATTACACTGAATTCCACTGCCATTCATGAAATCAATCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATGTAAGATTGTAATCTCCATTTTCTTTCACCAAACTGATCACTAATGAAAATTTGTTATAAAACAAAAGACATGAGGAACTGGCTTGACTTTTTCCTTCTTTACCTTGAATGAACAGTTGTTGTGCTAGAGCTTTGAAGGAGAAGGAAGACAAATGATCTGCAATGGCTTCATCAATTGTGGTATCTGCTTGCAATATGTGAATTGTAGATTCTATGCAAAATCAGAGGATTTGTTTGTAAAAATGCGTCAAATATTTATGCGAACACATGTATCTTAAATAACATTTTGATTGTAACTCAAATCATTATCTAACAAGTCTTTGGTGTAAGAGGTTAACTGAGCTTCACATCTGCATTCCCTTTTCGCAATGCGATTTTTTTTCGAGTTTTTGGAGGTTGAAATTGAGATAAGAAAAGGCATCTTAGGATATCAATGAGTAGCTTTCCTGCATCAAATTTATCATAGGAAAGAAAAAACAAGAGTGGTTATCAAACAAATTTCTATTTCTTTTTTTCCATTAGAAGAAAAAAGAAAAAAGAAAAAAAACTATAAACCGTTACCATATGCAATTATATTTTTCTTCTATTAAAAAAGGTGAAAATGGTAAAATCAGTACTAAAAGAATGAGAAACGAGAAACACAAATAATTTAATCATCGTAACAAATTGGAGTAAGAATTTAGAAACACTTAGGATGAGATGTGAGGAGCATAAACTCTACAAAGTAGCCCTTGATCCTTAATTATCTTTGCACTATTTAAGAAGAAACAAATGATTTTTTTTTTTCATAATATAGAGGGATGTGGAATCTAATCTTTGACATCAAAGTTGATAGTACAGACATTATGTTAGTTGAGCTATATTCGTTTTGACATAAAATAATTACTTTGGCATCTTTTAAACCACTTCATATTGGGCCCAATATGCCATAAAACTCAGAAAGACCACACAAAATTATCCTGTTAAGTTAGTGTAGAATAAGACTTTATGTAATATTTGTAAACAAACTCTTAACTTTATTCACAATTTTTTTTTTTTTTTAAAAAAACCTTCATTCATTTTAAGATTTATGCAAAATGCTACTTTTAAAAGGACATATATATTTTTGAAATATGGCATCGATAGTTTTTATTCATATATTACTAATAAGAGTATTTTTAAGTTTAAGAGTATTAATGAAATACTTAAAAGATTAATTGTATATGTGTATATATATATGTATGCATATATGTGTATGTATGTATATAAAAGTTGTTTTTAAATATAGCAAGATGAGTGAAACTATTTACACATATAGAAAAATTTCAACAATTTTTTTTTATATTTTTTCTATTTATATTTTTATATTTTTTTATATATATATAAGAGATGAAATGATATAGGGATATGATTTATGAGGTGTAAATAAGCAATTTTTTGTCTACACATATTAACAGTAGAGTTTTTTTAAAAAAATCAAATATATTAATGCAACTTTTGAAAGGTCATAGATATTTTTTAAATGTGGTAGAGCAGTTTTCATTCATATATTAATAATAAGAGTATTTTTAATATTTTTTTAAAAAAATTTAAAAATATTAATGAAACTTTTAAAAGTTAAAGCAAGATTATGAAATAAAATAATATATATATTATCTTTGAAGTTTAAATGTATTATTTAAAATTTAAAAAGTTGAGCTGTATTTTGAAGAAAGTGGGAAAAGAAAAATAATGGGTTCTTTGGTAGGCAATTTGAAAACATAGACTAAAAATAATGGATTCAACAAAAATAGAATTATATTTAATATTTTCAGTTATGTGTTTGGTAACAGATTTATTAATTGAATACAAATTTAAATAGTTATTCAAAATCTCTTTGATAATATATTTCTAAACCATAGAACTAAGGGTAGTTACCCAATAACTGTTAGGTTGAATATAAATTATTAGGGAAAATTGTAAAGACCACCCATGAAATATGGTAGTAGTCACAATTACCCCCAAACTTTTAATTGTAGAAATTAGGGCTCTAAACTTCTAAAGTGTTAGAATTGAATCCTCAAGCTTTCAATGATTGTAGAAATTGGTCCCACAAATGATAAAAATTAAAATTTCAAACTTATACAAACATTGCAATTTATACACATATAAGTTTGAAGTTCCAATTTGACCCAAATTTAATGATTATATAAGTTGGATGACTCAATTTTTAAAATTAAAAGTTTAAGGGTATAACTACAATTACCACCAAGTGATTTTTGGTATGTCAAATCATTATTTATTTATTTACAAACTTGCTTTATATTAAAAACTTTGTATAATTACTATTTTATAATATCATATAATAAATAACACATTATATATTTCATATTTTGAATTTAAAAAAATAATTGAATTTACAACACAAAGTATTGTAGTTATTAACATTTTTTATCTTTATTTTTAATATAATTATACTTTTGAAAATTTCAAATTCAAATTTTAAAAAGAAGTGAAAACATAAAAAATTGTTGTTTTCAAACCATTTTAAAGTTTAAAGGTTTGTTTGGTAGAGAATCTATATTTTATTTTATGTTTTCAGATTCATTCATTTAGGTAAATATGTGTTTGGTAGATAACTTTAATTCTTGAAAACTATTTAAAAATTTTATAATCAAAATCGTGCAAATTCTGAAAAATAATATTTTTATATTTTCATTGTATCTATATTTTAAATTTTAAATTTTAAAATATAAAATTATATTAAAATAAAGATACATATTATAAACTCAATTCATTTTATAAAATATATAATATATATTAGATAATGTATTATAAATTATATAAATTACAAAACAATAATGATATACATTGTTTTAACATAAATCATATTTATAAATAAATTAATAACAATTTATTTTCAACATAATATTAATAATATACTTCAACTAATTTTATAATTTATAAATATATAATATTAAATATATTTTAAAGTTGAAAATATAATTATTTTATTTTATTTATTGTATTTTTTTTTTCGCTCAAAACCACGCCATTAGGGTTTTCGTCATTCTTCTTCTTTTTCCCTCTCTTGCTTCCCGCCTTGTCAGAGAACCGCCATGGATAGGTACCAGAAGCTCGAGAAGCCTAAACCTGATTCTGCTCGGAATGAAAATTAAATCCGAAAAACTAGTCAAGGCGTTGCTCGGAACTAAATCTCGTATGCTTCCACCCTGCTCCAGGTATCATTTCACTGTTTCTTGTATCGTTTTCAACAGCTATTGTATCTCATTTCATTTTTTTCATTTTTGTGTATTCGTCTTATTCTTTCTGTTTATGTGTTGCGTTTGTTGATATTTTTCTTTAATGCTGAATCTGTAATAGTTGCTCGCTGGCTGATGACTTGAAGTCCTCCCAAATTCGGCTGTCACTCCCAATTCTGCATTTTATATCTATCCGTAATCCTCCTTCCGTTTTCTCCATGTCCATTCCTACTACCTCTGCATTTGCCACTGTGACCCTTTTCCGTTCTCTCACTCTTTCCCTCTCTCCATACCATCGCTACTTTCATTGTCCCAATCACATAGTCCGTACTCTCTTTATCCCAACATATTCTGTAAAAGGACAACTTCGGCGGATTCCGTCCTTTGCTTCCAGTTCTTTTGTTGAACAGCTGGTGCATGACCGGGATTCCCCGTTGGAGTCTGAAGAGCACGTATTTTCTTCATACAGTAATGAGGCTGATGGTTTTCATTTTGAAAATGGTTTTGCGTCGGCGGATTTGAAACATTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAGGATGAAGCGACCTATCTCACTGTGCATTGTTTGCGTATTCGTGAAAACGAGACTGCATTTAGGGTTAGTGTCTTGTCTCTTTCTTCTTTATTCTATTATGTTCAAATACCATAGGTATTGCAACTTGTAATGCAATTATGGGTTTGTTGAATAATTGTGGATTATTTGGAAGTCTTCAATTTTGGCTATGTTGTGCTTTATATGTTTGGATTATTCGTTATATAATACTGTAAATAAGTTTGGAATGTAATGGTAGGACTCTGAAGATAACTGTAGCATGAACGTCATTTGGAATAGACTGAAATTTCTGGAATAAAAGTTAAATTTGATAGTTCTCTTTATTTGCTATTTTTTTTAACCTATGCTTTGTGATATAATTTTCTTTCCATTTTTTGATCTTCTCTATTTTGGAGATTAGGTGTACAAGTGGATGATGCAACAACGTTGGTACCGATTCGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGCGTGCCTAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCGACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGGCTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTACCAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTAAGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGATGGGTGAACCAATGAAAGCTTTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTGTACAAGTGCTGCAGCATATCAGACGATTATTGGTATTTTATGTAAATCTCAACAGATAGAACTTGCAGAATCGATCATGGCAGGCTTCATAAAGAGTAATTTGAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTAGTAAAAGTTGGTAATATCTACAAGGCCGAAGAAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCGTGCAACATCATTTTAAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTAAGTAGGAAGGAGGTTAAGAAGCCACTAAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCAATTTGAATTCCTCAAAAACCGGAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGTGACGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTTCAATACCTAATCTAATTCACCGGTGGCTTTCACCTTGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATATATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTACATGAAAGATAGTCTACGGGCAGACAATCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTAATACAAGAGTTTTAGTTGCTGGTCCATTATCAGTTGGATTCTTTACTTTGCCAAATGACGGAAGCCTTGAATGTTTCATGGTTTTGGGAGGGTTTGTTATTCAACTAGGATCTTCCTTGACCAATTCGGAAGATGTAAATACTTGAATGGATAAGTAATGTTTATGTTCTGATTCTGTGTATTGTCGTAGCATTCTTGTTCACTTTAATATAGGTTAGTTAGTTTGGAAGTCATTAAAGTTTGTTGTTTTGTCGTTGTAAATATATATTTAAAAAGGATAAGTTACCTTCATAAGACTTGCGCTTACCTACTGATTGGTTAGTAGAAATCTTCTATTGATCACCATAAAGTTTTTGACTGGTAAGAAAGAGAG

mRNA sequence

ATGCAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCTCAATCCACCAAGAAGGGGAAAAGTTGGGATAAAGGAACTGCTAGATCAAAGGAGATAGTGGTGTCTAGTCCTTGTGGGAGTTCTCAAAAATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAAAGGATATTGCAAAGTCTTGCTTTTATAGTACCATTAATACAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCAATATTGGATCCAAAAGATGAAAAAGGAAGATATTGCTAAGGGAATTAGACAGATTGGTCAAAAGGTGGATTCTTCTTCTTCTGTTCATCCTGGCACCAAAGTTTTGCCAATAACTGATGCCACTTTAGTAGAGCAATATAGCAGTAAGCCTGAGAAGAAAGTTGTGGGCAATGGGGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAAATGGGCTATGGCTTCAAGATCAGAAAAAGGAGGGAAGTTCATCACTGGCAAGGTTTCACGACTGCGGAATCGAGGAACTCTTAAAGCAGGCTTGGACGATGATCGAGGGAGTAATGACTCCCCGAAGATCAGTTTCAGATGGGAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCTCATTCAAGAACTGTTCCATTACACTGAATTCCACTGCCATTCATGAAATCAATCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATGTACCAGAAGCTCGAGAAGCCTAAACCTGATTCTGCTCGGAATGAAAATTAAATCCGAAAAACTAGTCAAGGCGTTGCTCGGAACTAAATCTCGTATGCTTCCACCCTGCTCCAGTTGCTCGCTGGCTGATGACTTGAAGTCCTCCCAAATTCGGCTGTCACTCCCAATTCTGCATTTTATATCTATCCGTAATCCTCCTTCCGTTTTCTCCATGTCCATTCCTACTACCTCTGCATTTGCCACTGTGACCCTTTTCCGTTCTCTCACTCTTTCCCTCTCTCCATACCATCGCTACTTTCATTGTCCCAATCACATAGTCCGTACTCTCTTTATCCCAACATATTCTGTAAAAGGACAACTTCGGCGGATTCCGTCCTTTGCTTCCAGTTCTTTTGTTGAACAGCTGGTGCATGACCGGGATTCCCCGTTGGAGTCTGAAGAGCACGTATTTTCTTCATACAGTAATGAGGCTGATGGTTTTCATTTTGAAAATGGTTTTGCGTCGGCGGATTTGAAACATTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAGGATGAAGCGACCTATCTCACTGTGCATTGTTTGCGTATTCGTGAAAACGAGACTGCATTTAGGGTGTACAAGTGGATGATGCAACAACGTTGGTACCGATTCGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGCGTGCCTAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCGACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGGCTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTACCAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTAAGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGATGGGTGAACCAATGAAAGCTTTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTGTACAAGTGCTGCAGCATATCAGACGATTATTGGTATTTTATGTAAATCTCAACAGATAGAACTTGCAGAATCGATCATGGCAGGCTTCATAAAGAGTAATTTGAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTAGTAAAAGTTGGTAATATCTACAAGGCCGAAGAAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCGTGCAACATCATTTTAAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTAAGTAGGAAGGAGGTTAAGAAGCCACTAAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCAATTTGAATTCCTCAAAAACCGGAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGTGACGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTTCAATACCTAATCTAATTCACCGGTGGCTTTCACCTTGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATATATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTACATGAAAGATAGTCTACGGGCAGACAATCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTAATACAAGAGTTTTAGTTGCTGGTCCATTATCAGTTGGATTCTTTACTTTGCCAAATGACGGAAGCCTTGAATGTTTCATGGTTTTGGGAGGGTTTGTTATTCAACTAGGATCTTCCTTGACCAATTCGGAAGATGTAAATACTTGAATGGATAAGTAATGTTTATGTTCTGATTCTGTGTATTGTCGTAGCATTCTTGTTCACTTTAATATAGGTTAGTTAGTTTGGAAGTCATTAAAGTTTGTTGTTTTGTCGTTGTAAATATATATTTAAAAAGGATAAGTTACCTTCATAAGACTTGCGCTTACCTACTGATTGGTTAGTAGAAATCTTCTATTGATCACCATAAAGTTTTTGACTGGTAAGAAAGAGAG

Coding sequence (CDS)

ATGCAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCTCAATCCACCAAGAAGGGGAAAAGTTGGGATAAAGGAACTGCTAGATCAAAGGAGATAGTGGTGTCTAGTCCTTGTGGGAGTTCTCAAAAATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAAAGGATATTGCAAAGTCTTGCTTTTATAGTACCATTAATACAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCAATATTGGATCCAAAAGATGAAAAAGGAAGATATTGCTAAGGGAATTAGACAGATTGGTCAAAAGGTGGATTCTTCTTCTTCTGTTCATCCTGGCACCAAAGTTTTGCCAATAACTGATGCCACTTTAGTAGAGCAATATAGCAGTAAGCCTGAGAAGAAAGTTGTGGGCAATGGGGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAAATGGGCTATGGCTTCAAGATCAGAAAAAGGAGGGAAGTTCATCACTGGCAAGGTTTCACGACTGCGGAATCGAGGAACTCTTAAAGCAGGCTTGGACGATGATCGAGGGAGTAATGACTCCCCGAAGATCAGTTTCAGATGGGAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCTCATTCAAGAACTGTTCCATTACACTGAATTCCACTGCCATTCATGAAATCAATCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATGTACCAGAAGCTCGAGAAGCCTAAACCTGATTCTGCTCGGAATGAAAATTAAATCCGAAAAACTAGTCAAGGCGTTGCTCGGAACTAAATCTCGTATGCTTCCACCCTGCTCCAGTTGCTCGCTGGCTGATGACTTGAAGTCCTCCCAAATTCGGCTGTCACTCCCAATTCTGCATTTTATATCTATCCGTAATCCTCCTTCCGTTTTCTCCATGTCCATTCCTACTACCTCTGCATTTGCCACTGTGACCCTTTTCCGTTCTCTCACTCTTTCCCTCTCTCCATACCATCGCTACTTTCATTGTCCCAATCACATAGTCCGTACTCTCTTTATCCCAACATATTCTGTAAAAGGACAACTTCGGCGGATTCCGTCCTTTGCTTCCAGTTCTTTTGTTGAACAGCTGGTGCATGACCGGGATTCCCCGTTGGAGTCTGAAGAGCACGTATTTTCTTCATACAGTAATGAGGCTGATGGTTTTCATTTTGAAAATGGTTTTGCGTCGGCGGATTTGAAACATTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAGGATGAAGCGACCTATCTCACTGTGCATTGTTTGCGTATTCGTGAAAACGAGACTGCATTTAGGGTGTACAAGTGGATGATGCAACAACGTTGGTACCGATTCGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGCGTGCCTAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCGACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGGCTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTACCAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTAAGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGATGGGTGAACCAATGAAAGCTTTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTGTACAAGTGCTGCAGCATATCAGACGATTATTGGTATTTTATGTAAATCTCAACAGATAGAACTTGCAGAATCGATCATGGCAGGCTTCATAAAGAGTAATTTGAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTAGTAAAAGTTGGTAATATCTACAAGGCCGAAGAAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCGTGCAACATCATTTTAAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTAAGTAGGAAGGAGGTTAAGAAGCCACTAAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCAATTTGAATTCCTCAAAAACCGGAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGTGACGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTTCAATACCTAATCTAATTCACCGGTGGCTTTCACCTTGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATATATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTACATGAAAGATAGTCTACGGGCAGACAATCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTAA

Protein sequence

MQQVLQWVFKETNEKQGQRTSQSTKKGKSWDKGTARSKEIVVSSPCGSSQKSKGFEGRKFDLKKLKSFALLCRKDIAKSCFYSTINTKRPRTQGFDNMHQYWIQKMKKEDIAKGIRQIGQKVDSSSSVHPGTKVLPITDATLVEQYSSKPEKKVVGNGESKTKTKSRMKELLKWAMASRSEKGGKFITGKVSRLRNRGTLKAGLDDDRGSNDSPKISFRWEAESCSSISSAYSSVSAVSSFKNCSITLNSTAIHEINQYYPRRGSWITTDSECTRSSRSLNLILLGMKIKSEKLVKALLGTKSRMLPPCSSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
Homology
BLAST of ClCG07G001810 vs. NCBI nr
Match: XP_008465080.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo])

HSP 1 Score: 1441.8 bits (3731), Expect = 0.0e+00
Identity = 716/797 (89.84%), Postives = 756/797 (94.86%), Query Frame = 0

Query: 341  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK-GQLRRIPSF 400
            VFSMSIP TSAF+TVTL RSLTLSLSPYH YFH PNHI+ TLFI +YSVK  QL RI +F
Sbjct: 2    VFSMSIP-TSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF 61

Query: 401  ASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDE 460
            AS SFV+QLV+DRDSP ESEEH+ S YSN  DGFHFENGFAS DLKHLGTPALEVKELDE
Sbjct: 62   ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 121

Query: 461  LPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRV 520
            LPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQ+KW+GQD+ATYLTVHCLRIRENETAFRV
Sbjct: 122  LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181

Query: 521  YKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 580
            YKWMMQQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL
Sbjct: 182  YKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 241

Query: 581  SAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL 640
            SAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Sbjct: 242  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL 301

Query: 641  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMG 700
            VTSGLELHKDIYGGLIWLHSYQDTIDKERIV LRKEMQQAGIKEE+EVLLSILRASSKMG
Sbjct: 302  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMG 361

Query: 701  DVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQT 760
            DVVEAER WQKLKY DG+MP QAFVYKMEVYAKMG+PMKALEIFREMEQLN T+AAAYQT
Sbjct: 362  DVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQT 421

Query: 761  IIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 820
            IIGILCK Q+IELAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKC
Sbjct: 422  IIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKC 481

Query: 821  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 880
            KPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIG+NARSCN+IL GYLLFGNY+KAE
Sbjct: 482  KPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAE 541

Query: 881  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES 940
            KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Sbjct: 542  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES 601

Query: 941  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 1000
            DEERKNHRIQFEF KN  +HS+LRRHIYEQYH+WLH ASKL+DGDIDIPYKFCTVSHSYF
Sbjct: 602  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYF 661

Query: 1001 GFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1060
            GFYADQFWPRG  +IPNLIHRWLSP  LAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK
Sbjct: 662  GFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 721

Query: 1061 SLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNET 1120
            SLREKSMHCKVKRKG++YWIGLLGSNATWFWKLIEPFILD +K+S +AD+LNL  VLNET
Sbjct: 722  SLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNET 781

Query: 1121 ENINFDSQSDSVGEASN 1137
            ENINFDSQSDSV E SN
Sbjct: 782  ENINFDSQSDSVEETSN 796

BLAST of ClCG07G001810 vs. NCBI nr
Match: XP_004152074.2 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sativus] >KGN58344.1 hypothetical protein Csa_017589 [Cucumis sativus])

HSP 1 Score: 1426.0 bits (3690), Expect = 0.0e+00
Identity = 702/798 (87.97%), Postives = 753/798 (94.36%), Query Frame = 0

Query: 341  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK--GQLRRIPS 400
            VFSMSIP TSAF+TVT  RSLTLSLSPYH YFHCPNHI+ TLF+P YSVK   QL RI +
Sbjct: 2    VFSMSIP-TSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRA 61

Query: 401  FASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELD 460
            FAS SFV+QLV+D DSP ESEEH+ SS+SN  DGFHFENGFAS DLKHLGTP LEVKELD
Sbjct: 62   FASGSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELD 121

Query: 461  ELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFR 520
            ELPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQKKW+GQD+ATYL VHCLRIRENETAFR
Sbjct: 122  ELPEQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFR 181

Query: 521  VYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY 580
            VYKWMMQQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY
Sbjct: 182  VYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY 241

Query: 581  LSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHN 640
            LSAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHN
Sbjct: 242  LSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHN 301

Query: 641  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKM 700
            LVTSGLELHKD+YGGLIWLHSYQDTID+ERIV LRKEMQQAGIKEEREVLLSILRASSKM
Sbjct: 302  LVTSGLELHKDMYGGLIWLHSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKM 361

Query: 701  GDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQ 760
            GDV+EAE+ WQ+LKY DG+MPSQAFVYKMEVYAKMG+PMKALEIFREMEQLN T+AAAYQ
Sbjct: 362  GDVMEAEKLWQELKYLDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQ 421

Query: 761  TIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK 820
            TIIGILCK Q IELAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEK
Sbjct: 422  TIIGILCKFQVIELAESIMAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEK 481

Query: 821  CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKA 880
            CKPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIGINARSCNIIL GYLL GNY+KA
Sbjct: 482  CKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKA 541

Query: 881  EKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIE 940
            EKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIE
Sbjct: 542  EKIYDLMCQKRYDIDPPLMEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE 601

Query: 941  SDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSY 1000
            SD+ERKNHRIQFEF +N  +HS+LRRHIYEQYH+WLH ASKL+DGD+DIPYKFCTVSHSY
Sbjct: 602  SDDERKNHRIQFEFHRNCKTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSY 661

Query: 1001 FGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIV 1060
            FGFYADQFWPRG  +IPNLIHRWLSP VLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIV
Sbjct: 662  FGFYADQFWPRGRRAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIV 721

Query: 1061 KSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNE 1120
            KSLREKS+HCKVKRKGN+YWIGLLGSNATWFWKLIEPFILDY+K+S +AD+LNL  VLN 
Sbjct: 722  KSLREKSIHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNG 781

Query: 1121 TENINFDSQSDSVGEASN 1137
            +ENINFDS+SDSV E SN
Sbjct: 782  SENINFDSESDSVEETSN 798

BLAST of ClCG07G001810 vs. NCBI nr
Match: XP_038887990.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa hispida])

HSP 1 Score: 1421.0 bits (3677), Expect = 0.0e+00
Identity = 709/790 (89.75%), Postives = 746/790 (94.43%), Query Frame = 0

Query: 349  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVE 408
            TSAF++VTL RS +LSLSPYH YF CPNHIVRT+FIP YSVKG  QL RIPSFASSS VE
Sbjct: 5    TSAFSSVTLLRSPSLSLSPYHHYFRCPNHIVRTIFIPIYSVKGQQQLPRIPSFASSSSVE 64

Query: 409  QLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRR 468
            QLV+DRDS  ESEEH+ S YSN AD      GFASADLKHL  PALEVKELDELP+QWRR
Sbjct: 65   QLVYDRDSLFESEEHLSSPYSNGAD------GFASADLKHLEMPALEVKELDELPDQWRR 124

Query: 469  SKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQ 528
            SKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QD+ATYLTVHCLRIRENETAFRVYKWMMQQ
Sbjct: 125  SKLAWLCKELPAQKPGTLIRLLNAQRKWMRQDDATYLTVHCLRIRENETAFRVYKWMMQQ 184

Query: 529  RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 588
            RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC
Sbjct: 185  RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 244

Query: 589  IEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL 648
            IEEASTIYNRMIQLGGY+PRLSLHNSLFRAL SKPGDLSKHHLKQAEFIYHNLVTSGLE+
Sbjct: 245  IEEASTIYNRMIQLGGYQPRLSLHNSLFRALTSKPGDLSKHHLKQAEFIYHNLVTSGLEV 304

Query: 649  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAER 708
            HKDI GGLIWLHSYQDTIDKERIV LRKEMQQAGIKEEREVLLSILRASSKMG+V+EAER
Sbjct: 305  HKDICGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGNVMEAER 364

Query: 709  SWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCK 768
            SWQKLK FDG+MPSQAFVYKMEVYAKMG+PMKALEIFREMEQLN  +AAAY+TIIGILCK
Sbjct: 365  SWQKLKDFDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSANAAAYRTIIGILCK 424

Query: 769  SQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 828
             Q IELAESIM GFIKSNLKPLMPAYVDLMNMFFNLSLH+KLEL FSQCLEKCKP+RTIY
Sbjct: 425  FQDIELAESIMKGFIKSNLKPLMPAYVDLMNMFFNLSLHNKLELIFSQCLEKCKPDRTIY 484

Query: 829  SIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC 888
            SIYLDSLVKVGN+ +AEEIF+QMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC
Sbjct: 485  SIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC 544

Query: 889  QKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH 948
            QKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGG+EIESDEERKNH
Sbjct: 545  QKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGVEIESDEERKNH 604

Query: 949  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQF 1008
            RIQFEF +N N+HSLLRRHIYEQYHEWLH ASKL+DGDIDIPYKFCTVSHSYFGFYADQF
Sbjct: 605  RIQFEFQQNCNTHSLLRRHIYEQYHEWLHSASKLNDGDIDIPYKFCTVSHSYFGFYADQF 664

Query: 1009 WPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSM 1068
            WP+GHP+IPNLIHRWLSP VLAYWYMYGGCRTSSGDILLKLKGS EGVEKIVKSLREKSM
Sbjct: 665  WPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSREGVEKIVKSLREKSM 724

Query: 1069 HCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDS 1128
             CKVKRKG++YWIGLLG+NATWFWKL+EPFILDY+KDSL AD+ NL RVLNETENINFDS
Sbjct: 725  QCKVKRKGSMYWIGLLGNNATWFWKLVEPFILDYLKDSLEADSPNLGRVLNETENINFDS 784

Query: 1129 QSDSVGEASN 1137
            QSDSV EASN
Sbjct: 785  QSDSVEEASN 788

BLAST of ClCG07G001810 vs. NCBI nr
Match: XP_022998786.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1360.9 bits (3521), Expect = 0.0e+00
Identity = 675/790 (85.44%), Postives = 727/790 (92.03%), Query Frame = 0

Query: 349  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVE 408
            TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE
Sbjct: 5    TSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASSSSVE 64

Query: 409  QLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRR 468
             LV+DRDSP ESEE + S YSN A+       FASADLKHLG PALEVKELDELPEQWRR
Sbjct: 65   ALVYDRDSPAESEEPLCSPYSNGAE------EFASADLKHLGAPALEVKELDELPEQWRR 124

Query: 469  SKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQ 528
            SKLAWLCKELPA KPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ
Sbjct: 125  SKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQ 184

Query: 529  RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 588
             WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC
Sbjct: 185  HWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 244

Query: 589  IEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL 648
            IEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+GLEL
Sbjct: 245  IEEASTIYNRMIQLGGYPPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLVTTGLEL 304

Query: 649  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAER 708
            HKDIYGGLIWLHSYQDT+DKERI+ LRKEMQQAGI+EEREVL+SILRASSK+GDV+EAER
Sbjct: 305  HKDIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAER 364

Query: 709  SWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCK 768
            SW K+K FDGSMPSQAFVYKMEVYAK+G PMKALEIFREMEQLN  S+AAYQTIIGILCK
Sbjct: 365  SWLKIKSFDGSMPSQAFVYKMEVYAKVGNPMKALEIFREMEQLNSISSAAYQTIIGILCK 424

Query: 769  SQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 828
             +++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY
Sbjct: 425  FEEVTLAESVMAGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 484

Query: 829  SIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC 888
            SIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMC
Sbjct: 485  SIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMC 544

Query: 889  QKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH 948
            QKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Sbjct: 545  QKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNH 604

Query: 949  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQF 1008
            RIQFEF ++ ++HS LRRH+YEQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQF
Sbjct: 605  RIQFEFHEDCSTHSCLRRHVYEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQF 664

Query: 1009 WPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSM 1068
            WPRGHP+IPNLIHRWLSP VLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLREKSM
Sbjct: 665  WPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVVKIVKSLREKSM 724

Query: 1069 HCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDS 1128
             CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+ADNLNLE+ +NET NINFDS
Sbjct: 725  SCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADNLNLEKAVNETYNINFDS 784

Query: 1129 QSDSVGEASN 1137
            QSDS  EAS+
Sbjct: 785  QSDSDEEASS 788

BLAST of ClCG07G001810 vs. NCBI nr
Match: XP_022949171.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1352.8 bits (3500), Expect = 0.0e+00
Identity = 670/790 (84.81%), Postives = 723/790 (91.52%), Query Frame = 0

Query: 349  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVE 408
            TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE
Sbjct: 5    TSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVE 64

Query: 409  QLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRR 468
             LV+DRDSP ESEE + S YS  A+      GFASADLKHLG PALEVKELDELPEQWRR
Sbjct: 65   ALVYDRDSPAESEEPLCSPYSTGAE------GFASADLKHLGAPALEVKELDELPEQWRR 124

Query: 469  SKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQ 528
            SKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ
Sbjct: 125  SKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQ 184

Query: 529  RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 588
             WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGC
Sbjct: 185  HWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGC 244

Query: 589  IEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL 648
            IEE+STIYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL T+GLEL
Sbjct: 245  IEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLEL 304

Query: 649  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAER 708
            HKDIYGGLIWLHSYQDT+DKERI+ LRKEM QAGI+EEREVL+SILRASSK+GDV+EAER
Sbjct: 305  HKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAER 364

Query: 709  SWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCK 768
            SW KLK FDGSMPSQAFVYKMEVYAK+G PMKA EIFREMEQLN  SAAAYQTIIGILCK
Sbjct: 365  SWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGILCK 424

Query: 769  SQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 828
             +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY
Sbjct: 425  FEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 484

Query: 829  SIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC 888
            SIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMC
Sbjct: 485  SIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMC 544

Query: 889  QKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH 948
            QKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Sbjct: 545  QKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNH 604

Query: 949  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQF 1008
            RIQFEF ++ ++HS LRRHI+EQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQF
Sbjct: 605  RIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQF 664

Query: 1009 WPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSM 1068
            WPRGHP IPNLIHRWLSP VLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLREKSM
Sbjct: 665  WPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSM 724

Query: 1069 HCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDS 1128
             CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+AD+LN+E+  NET NINFDS
Sbjct: 725  SCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDS 784

Query: 1129 QSDSVGEASN 1137
            QSDS  EAS+
Sbjct: 785  QSDSDEEASS 788

BLAST of ClCG07G001810 vs. ExPASy Swiss-Prot
Match: Q9XIL5 (Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=OTP51 PE=2 SV=3)

HSP 1 Score: 894.4 bits (2310), Expect = 1.3e-258
Identity = 462/844 (54.74%), Postives = 623/844 (73.82%), Query Frame = 0

Query: 310  SSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYH 369
            SS SLA    SS   +S+   +  S+ + P++ + S          TLFRSL+ SL   H
Sbjct: 21   SSFSLA---SSSSSTVSVTTFNISSLSSNPNIINSS---------STLFRSLSFSLI-RH 80

Query: 370  RYFHCPNHIVRTLFIPTYSVKGQL------RRIPSFASSSFVEQ---LVHDRDSPLESEE 429
            R  +    + R      +  K Q       R  P F ++S  ++    V       ESEE
Sbjct: 81   RSSYSRRSLRRLSIHTVHGNKTQFFSHSSTRTPPLFTANSTAQRSGTFVEHLTGITESEE 140

Query: 430  HVFSSYSNEADGFHFENGFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKEL 489
             +     +EA+GF  +   A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+
Sbjct: 141  GI-----SEANGFG-DVESARNDIRNVATRRIETEFEVRELEELPEEWRRSKLAWLCKEV 200

Query: 490  PAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALA 549
            P  K  TL+RLLNAQKKW+ Q++ATY++VHC+RIRENET FRVY+WM QQ WYRFD+ L 
Sbjct: 201  PTHKAVTLVRLLNAQKKWVRQEDATYISVHCMRIRENETGFRVYRWMTQQNWYRFDFGLT 260

Query: 550  TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYN 609
            TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YN
Sbjct: 261  TKLAEYLGKERKFTKCREVFDDVLNQGRVPSESTFHILVVAYLSSLSVEGCLEEACSVYN 320

Query: 610  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLI 669
            RMIQLGGY+PRLSLHNSLFRAL+SK G +    LKQAEFI+HN+VT+GLE+ KDIY GLI
Sbjct: 321  RMIQLGGYKPRLSLHNSLFRALVSKQGGILNDQLKQAEFIFHNVVTTGLEVQKDIYSGLI 380

Query: 670  WLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFD 729
            WLHS QD +D  RI  LR+EM++AG +E +EV++S+LRA +K G V E ER+W +L   D
Sbjct: 381  WLHSCQDEVDIGRINSLREEMKKAGFQESKEVVVSLLRAYAKEGGVEEVERTWLELLDLD 440

Query: 730  GSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQ-LNCTSAAAYQTIIGILCKSQQIELAE 789
              +PSQAFVYK+E Y+K+G+  KA+EIFREME+ +   + + Y  II +LCK QQ+EL E
Sbjct: 441  CGIPSQAFVYKIEAYSKVGDFAKAMEIFREMEKHIGGATMSGYHKIIEVLCKVQQVELVE 500

Query: 790  SIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLV 849
            ++M  F +S  KPL+P+++++  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYLDSL 
Sbjct: 501  TLMKEFEESGKKPLLPSFIEIAKMYFDLGLHEKLEMAFVQCLEKCQPSQPIYNIYLDSLT 560

Query: 850  KVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP 909
            K+GN+ KA ++FN+M+ NG I ++ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+P
Sbjct: 561  KIGNLEKAGDVFNEMKNNGTINVSARSCNSLLKGYLDCGKQVQAERIYDLMRMKKYEIEP 620

Query: 910  PLMEKLDYVLSLSRKEVKK-PLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFL 969
            PLMEKLDY+LSL +KEVKK P S+KLSK+QRE+L+GLLLGGL+IESD+E+K+H I+FEF 
Sbjct: 621  PLMEKLDYILSLKKKEVKKRPFSMKLSKDQREVLVGLLLGGLQIESDKEKKSHMIKFEFR 680

Query: 970  KNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPS 1029
            +N  +H +L+++I++Q+ EWLHP S   + DI IP++F +V HSYFGFYA+ +WP+G P 
Sbjct: 681  ENSQAHLVLKQNIHDQFREWLHPLSNFQE-DI-IPFEFYSVPHSYFGFYAEHYWPKGQPE 740

Query: 1030 IPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRK 1089
            IP LIHRWLSP  LAYWYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+K
Sbjct: 741  IPKLIHRWLSPHSLAYWYMYSGVKTSSGDIILRLKGSLEGVEKVVKALQAKSMECRVKKK 800

Query: 1090 GNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLN-ETENINFDSQSDSVG 1137
            G ++WIGL G+N+  FWKLIEP +L+ +K+ L+  + +L+ V   E ++INF S SD   
Sbjct: 801  GKVFWIGLQGTNSALFWKLIEPHVLENLKEHLKPASESLDNVKEAEEQSINFKSNSDHSD 843

BLAST of ClCG07G001810 vs. ExPASy Swiss-Prot
Match: Q6ZHJ5 (Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=OTP51 PE=3 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 4.1e-228
Identity = 392/737 (53.19%), Postives = 540/737 (73.27%), Query Frame = 0

Query: 396  IPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVK 455
            IP+ AS+  +E L+ D D   E E+        E   F  E   A+ + + + +P L V 
Sbjct: 53   IPAVASA--LESLILDLDDDEEDEDE-----ETEFGLFQGEAWAAADEREAVRSPELVVP 112

Query: 456  ELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENET 515
            EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQ+KW+ QD+ATY+ VHCLRIR N+ 
Sbjct: 113  ELEELPEQWRRSRIAWLCKELPAYKHSTFTRILNAQRKWITQDDATYVAVHCLRIRNNDA 172

Query: 516  AFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILI 575
            AFRVY WM++Q W+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILI
Sbjct: 173  AFRVYSWMVRQHWFRFNFALATRVADCLGRDGKVEKCREVFEAMVKQGRVPAESTFHILI 232

Query: 576  VAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFI 635
            VAYLS P   C+EEA TIYN+MIQ+GGY+PRLSLHNSLFRAL+SK G  +K++LKQAEF+
Sbjct: 233  VAYLSVPKGRCLEEACTIYNQMIQMGGYKPRLSLHNSLFRALVSKTGGTAKYNLKQAEFV 292

Query: 636  YHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS 695
            YHN+VT+ L++HKD+Y GLIWLHSYQD ID+ERI+ LRKEM+QAG  E  +VL+S++RA 
Sbjct: 293  YHNVVTTNLDVHKDVYAGLIWLHSYQDVIDRERIIALRKEMKQAGFDEGIDVLVSVMRAF 352

Query: 696  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNC-TSA 755
            SK G+V E E +W  +      +P QA+V +ME YA+ GEPMK+L++F+EM+  N   + 
Sbjct: 353  SKEGNVAETEATWHNILQSGSDLPVQAYVCRMEAYARTGEPMKSLDMFKEMKDKNIPPNV 412

Query: 756  AAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQ 815
            A+Y  II I+ K+ ++++ E +M  FI+S++K LMPA++DLM M+ +L +H+KLELTF +
Sbjct: 413  ASYHKIIEIMTKALEVDIVEQLMNEFIESDMKHLMPAFLDLMYMYMDLDMHEKLELTFLK 472

Query: 816  CLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGN 875
            C+ +C+PNR +Y+IYL+SLVKVGNI KAEE+F +M  NG IG N +SCNI+L GYL   +
Sbjct: 473  CIARCRPNRILYTIYLESLVKVGNIEKAEEVFGEMHNNGMIGTNTKSCNIMLRGYLSAED 532

Query: 876  YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVK-KPLSLKLSKEQREILIGLLLG 935
            Y KAEK+YD+M +KKYD+    +EKL   L L++K +K K +S+KL +EQREILIGLLLG
Sbjct: 533  YQKAEKVYDMMSKKKYDVQADSLEKLQSGLLLNKKVIKPKTVSMKLDQEQREILIGLLLG 592

Query: 936  GLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT 995
            G  +ES  +R  H + F+F ++ N+HS+LR HI+E++ EWL  AS+  D    IPY+F T
Sbjct: 593  GTRMESYAQRGVHIVHFQFQEDSNAHSVLRVHIHERFFEWLSSASRSFDDGSKIPYQFST 652

Query: 996  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSH-E 1055
            + H +F F+ DQF+ +G P +P LIHRWL+P VLAYW+M+GG +  SGDI+LKL G + E
Sbjct: 653  IPHQHFSFFVDQFFLKGQPVLPKLIHRWLTPRVLAYWFMFGGSKLPSGDIVLKLSGGNSE 712

Query: 1056 GVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNL 1115
            GVE+IV SL  +S+  KVKRKG  +WIG  GSNA  FW++IEP +L+     +  +  ++
Sbjct: 713  GVERIVNSLHTQSLTSKVKRKGRFFWIGFQGSNAESFWRIIEPHVLNNFASLVTQEGSSI 772

Query: 1116 ERVLNETENINFDSQSD 1130
                + T++ + DS  D
Sbjct: 773  GS--DGTQDTDTDSDDD 780

BLAST of ClCG07G001810 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 1.7e-11
Identity = 82/341 (24.05%), Postives = 148/341 (43.40%), Query Frame = 0

Query: 545 KERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYR 604
           KE    K    + +++++G +P   T++ +I A   A     +++A  + N M++  G  
Sbjct: 208 KEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQ---AMDKAMEVLNTMVK-NGVM 267

Query: 605 PRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI 664
           P    +NS+        G  S    K+A      + + G+E     Y  L+         
Sbjct: 268 PDCMTYNSILH------GYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRC 327

Query: 665 DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQ-AF 724
            + R +F    M + G+K E     ++L+  +  G +VE       L   +G  P    F
Sbjct: 328 MEARKIF--DSMTKRGLKPEITTYGTLLQGYATKGALVEM-HGLLDLMVRNGIHPDHYVF 387

Query: 725 VYKMEVYAKMGEPMKALEIFREMEQLNCT-SAAAYQTIIGILCKSQQIELAESIMAGFIK 784
              +  YAK G+  +A+ +F +M Q     +A  Y  +IGILCKS ++E A       I 
Sbjct: 388 SILICAYAKQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMID 447

Query: 785 SNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK--CKPNRTIYSIYLDSLVKVGNIY 844
             L P    Y  L++     +  ++ E    + L++  C  N   ++  +DS  K G + 
Sbjct: 448 EGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICL-NTIFFNSIIDSHCKEGRVI 507

Query: 845 KAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKI 882
           ++E++F  M   G +  N  + N +++GY L G   +A K+
Sbjct: 508 ESEKLFELMVRIG-VKPNVITYNTLINGYCLAGKMDEAMKL 533

BLAST of ClCG07G001810 vs. ExPASy Swiss-Prot
Match: O82178 (Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX=3702 GN=At2g35130 PE=3 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 2.2e-11
Identity = 89/433 (20.55%), Postives = 193/433 (44.57%), Query Frame = 0

Query: 469 LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQR 528
           L+++ KE    K   ++  L +    W   D+   ++V     ++ ++   V +W++++ 
Sbjct: 93  LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 152

Query: 529 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI 588
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 153 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 212

Query: 589 EEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGL 648
           E A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     
Sbjct: 213 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 272

Query: 649 ELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEA 708
           +   + Y   + ++ Y           L  EM+    K       +++ A ++ G   +A
Sbjct: 273 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 332

Query: 709 ERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII 768
           E  +++L+  DG  P   +VY   ME Y++ G P  A EIF  M+ + C    A+Y  ++
Sbjct: 333 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 392

Query: 769 GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 828
               ++     AE++     +  + P M +++ L++ +       K E    +  E   +
Sbjct: 393 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 452

Query: 829 PNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEK 888
           P+  + +  L+   ++G   K E+I  +ME NG    +  + NI+++ Y   G   + E+
Sbjct: 453 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 508

Query: 889 IYDLMCQKKYDID 894
           ++  + +K +  D
Sbjct: 513 LFVELKEKNFRPD 508

BLAST of ClCG07G001810 vs. ExPASy Swiss-Prot
Match: Q5G1S8 (Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB1270 PE=2 SV=2)

HSP 1 Score: 71.6 bits (174), Expect = 6.3e-11
Identity = 68/309 (22.01%), Postives = 138/309 (44.66%), Query Frame = 0

Query: 548 KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRL 607
           KFSK +E+ D +  +GCVP   +F+ LI A L +   G     +     M++  G RP  
Sbjct: 240 KFSKAQELVDAMRQRGCVPDLISFNTLINARLKS--GGLTPNLAVELLDMVRNSGLRPDA 299

Query: 608 SLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKE 667
             +N+L  A           +L  A  ++ ++     +     Y  +I ++       + 
Sbjct: 300 ITYNTLLSACS------RDSNLDGAVKVFEDMEAHRCQPDLWTYNAMISVYGRCGLAAEA 359

Query: 668 RIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKM 727
             +F+  E++  G   +     S+L A ++  +  + +  +Q+++          +   +
Sbjct: 360 ERLFM--ELELKGFFPDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGKDEMTYNTII 419

Query: 728 EVYAKMGEPMKALEIFREMEQLNCTS--AAAYQTIIGILCKSQQIELAESIMAGFIKSNL 787
            +Y K G+   AL+++++M+ L+  +  A  Y  +I  L K+ +   A ++M+  +   +
Sbjct: 420 HMYGKQGQLDLALQLYKDMKGLSGRNPDAITYTVLIDSLGKANRTVEAAALMSEMLDVGI 479

Query: 788 KPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE 847
           KP +  Y  L+  +      ++ E TFS  L    KP+   YS+ LD L++     KA  
Sbjct: 480 KPTLQTYSALICGYAKAGKREEAEDTFSCMLRSGTKPDNLAYSVMLDVLLRGNETRKAWG 538

Query: 848 IFNQMETNG 854
           ++  M ++G
Sbjct: 540 LYRDMISDG 538

BLAST of ClCG07G001810 vs. ExPASy TrEMBL
Match: A0A1S3CPK0 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502781 PE=4 SV=1)

HSP 1 Score: 1441.8 bits (3731), Expect = 0.0e+00
Identity = 716/797 (89.84%), Postives = 756/797 (94.86%), Query Frame = 0

Query: 341  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK-GQLRRIPSF 400
            VFSMSIP TSAF+TVTL RSLTLSLSPYH YFH PNHI+ TLFI +YSVK  QL RI +F
Sbjct: 2    VFSMSIP-TSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF 61

Query: 401  ASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDE 460
            AS SFV+QLV+DRDSP ESEEH+ S YSN  DGFHFENGFAS DLKHLGTPALEVKELDE
Sbjct: 62   ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 121

Query: 461  LPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRV 520
            LPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQ+KW+GQD+ATYLTVHCLRIRENETAFRV
Sbjct: 122  LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181

Query: 521  YKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 580
            YKWMMQQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL
Sbjct: 182  YKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 241

Query: 581  SAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL 640
            SAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Sbjct: 242  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL 301

Query: 641  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMG 700
            VTSGLELHKDIYGGLIWLHSYQDTIDKERIV LRKEMQQAGIKEE+EVLLSILRASSKMG
Sbjct: 302  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMG 361

Query: 701  DVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQT 760
            DVVEAER WQKLKY DG+MP QAFVYKMEVYAKMG+PMKALEIFREMEQLN T+AAAYQT
Sbjct: 362  DVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQT 421

Query: 761  IIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 820
            IIGILCK Q+IELAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKC
Sbjct: 422  IIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKC 481

Query: 821  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 880
            KPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIG+NARSCN+IL GYLLFGNY+KAE
Sbjct: 482  KPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAE 541

Query: 881  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES 940
            KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Sbjct: 542  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES 601

Query: 941  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 1000
            DEERKNHRIQFEF KN  +HS+LRRHIYEQYH+WLH ASKL+DGDIDIPYKFCTVSHSYF
Sbjct: 602  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYF 661

Query: 1001 GFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1060
            GFYADQFWPRG  +IPNLIHRWLSP  LAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK
Sbjct: 662  GFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 721

Query: 1061 SLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNET 1120
            SLREKSMHCKVKRKG++YWIGLLGSNATWFWKLIEPFILD +K+S +AD+LNL  VLNET
Sbjct: 722  SLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNET 781

Query: 1121 ENINFDSQSDSVGEASN 1137
            ENINFDSQSDSV E SN
Sbjct: 782  ENINFDSQSDSVEETSN 796

BLAST of ClCG07G001810 vs. ExPASy TrEMBL
Match: A0A0A0LBL0 (LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100 PE=4 SV=1)

HSP 1 Score: 1426.0 bits (3690), Expect = 0.0e+00
Identity = 702/798 (87.97%), Postives = 753/798 (94.36%), Query Frame = 0

Query: 341  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK--GQLRRIPS 400
            VFSMSIP TSAF+TVT  RSLTLSLSPYH YFHCPNHI+ TLF+P YSVK   QL RI +
Sbjct: 2    VFSMSIP-TSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRA 61

Query: 401  FASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELD 460
            FAS SFV+QLV+D DSP ESEEH+ SS+SN  DGFHFENGFAS DLKHLGTP LEVKELD
Sbjct: 62   FASGSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELD 121

Query: 461  ELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFR 520
            ELPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQKKW+GQD+ATYL VHCLRIRENETAFR
Sbjct: 122  ELPEQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFR 181

Query: 521  VYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY 580
            VYKWMMQQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY
Sbjct: 182  VYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY 241

Query: 581  LSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHN 640
            LSAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHN
Sbjct: 242  LSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHN 301

Query: 641  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKM 700
            LVTSGLELHKD+YGGLIWLHSYQDTID+ERIV LRKEMQQAGIKEEREVLLSILRASSKM
Sbjct: 302  LVTSGLELHKDMYGGLIWLHSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKM 361

Query: 701  GDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQ 760
            GDV+EAE+ WQ+LKY DG+MPSQAFVYKMEVYAKMG+PMKALEIFREMEQLN T+AAAYQ
Sbjct: 362  GDVMEAEKLWQELKYLDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQ 421

Query: 761  TIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK 820
            TIIGILCK Q IELAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEK
Sbjct: 422  TIIGILCKFQVIELAESIMAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEK 481

Query: 821  CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKA 880
            CKPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIGINARSCNIIL GYLL GNY+KA
Sbjct: 482  CKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKA 541

Query: 881  EKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIE 940
            EKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIE
Sbjct: 542  EKIYDLMCQKRYDIDPPLMEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE 601

Query: 941  SDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSY 1000
            SD+ERKNHRIQFEF +N  +HS+LRRHIYEQYH+WLH ASKL+DGD+DIPYKFCTVSHSY
Sbjct: 602  SDDERKNHRIQFEFHRNCKTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSY 661

Query: 1001 FGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIV 1060
            FGFYADQFWPRG  +IPNLIHRWLSP VLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIV
Sbjct: 662  FGFYADQFWPRGRRAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIV 721

Query: 1061 KSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNE 1120
            KSLREKS+HCKVKRKGN+YWIGLLGSNATWFWKLIEPFILDY+K+S +AD+LNL  VLN 
Sbjct: 722  KSLREKSIHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNG 781

Query: 1121 TENINFDSQSDSVGEASN 1137
            +ENINFDS+SDSV E SN
Sbjct: 782  SENINFDSESDSVEETSN 798

BLAST of ClCG07G001810 vs. ExPASy TrEMBL
Match: A0A6J1KB64 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493350 PE=4 SV=1)

HSP 1 Score: 1360.9 bits (3521), Expect = 0.0e+00
Identity = 675/790 (85.44%), Postives = 727/790 (92.03%), Query Frame = 0

Query: 349  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVE 408
            TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE
Sbjct: 5    TSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASSSSVE 64

Query: 409  QLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRR 468
             LV+DRDSP ESEE + S YSN A+       FASADLKHLG PALEVKELDELPEQWRR
Sbjct: 65   ALVYDRDSPAESEEPLCSPYSNGAE------EFASADLKHLGAPALEVKELDELPEQWRR 124

Query: 469  SKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQ 528
            SKLAWLCKELPA KPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ
Sbjct: 125  SKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQ 184

Query: 529  RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 588
             WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC
Sbjct: 185  HWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 244

Query: 589  IEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL 648
            IEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+GLEL
Sbjct: 245  IEEASTIYNRMIQLGGYPPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLVTTGLEL 304

Query: 649  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAER 708
            HKDIYGGLIWLHSYQDT+DKERI+ LRKEMQQAGI+EEREVL+SILRASSK+GDV+EAER
Sbjct: 305  HKDIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAER 364

Query: 709  SWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCK 768
            SW K+K FDGSMPSQAFVYKMEVYAK+G PMKALEIFREMEQLN  S+AAYQTIIGILCK
Sbjct: 365  SWLKIKSFDGSMPSQAFVYKMEVYAKVGNPMKALEIFREMEQLNSISSAAYQTIIGILCK 424

Query: 769  SQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 828
             +++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY
Sbjct: 425  FEEVTLAESVMAGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 484

Query: 829  SIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC 888
            SIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMC
Sbjct: 485  SIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMC 544

Query: 889  QKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH 948
            QKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Sbjct: 545  QKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNH 604

Query: 949  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQF 1008
            RIQFEF ++ ++HS LRRH+YEQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQF
Sbjct: 605  RIQFEFHEDCSTHSCLRRHVYEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQF 664

Query: 1009 WPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSM 1068
            WPRGHP+IPNLIHRWLSP VLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLREKSM
Sbjct: 665  WPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVVKIVKSLREKSM 724

Query: 1069 HCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDS 1128
             CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+ADNLNLE+ +NET NINFDS
Sbjct: 725  SCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADNLNLEKAVNETYNINFDS 784

Query: 1129 QSDSVGEASN 1137
            QSDS  EAS+
Sbjct: 785  QSDSDEEASS 788

BLAST of ClCG07G001810 vs. ExPASy TrEMBL
Match: A0A6J1GB98 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452602 PE=4 SV=1)

HSP 1 Score: 1352.8 bits (3500), Expect = 0.0e+00
Identity = 670/790 (84.81%), Postives = 723/790 (91.52%), Query Frame = 0

Query: 349  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVE 408
            TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE
Sbjct: 5    TSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVE 64

Query: 409  QLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRR 468
             LV+DRDSP ESEE + S YS  A+      GFASADLKHLG PALEVKELDELPEQWRR
Sbjct: 65   ALVYDRDSPAESEEPLCSPYSTGAE------GFASADLKHLGAPALEVKELDELPEQWRR 124

Query: 469  SKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQ 528
            SKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ
Sbjct: 125  SKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQ 184

Query: 529  RWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGC 588
             WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGC
Sbjct: 185  HWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGC 244

Query: 589  IEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL 648
            IEE+STIYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL T+GLEL
Sbjct: 245  IEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLEL 304

Query: 649  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAER 708
            HKDIYGGLIWLHSYQDT+DKERI+ LRKEM QAGI+EEREVL+SILRASSK+GDV+EAER
Sbjct: 305  HKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAER 364

Query: 709  SWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCK 768
            SW KLK FDGSMPSQAFVYKMEVYAK+G PMKA EIFREMEQLN  SAAAYQTIIGILCK
Sbjct: 365  SWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGILCK 424

Query: 769  SQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 828
             +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY
Sbjct: 425  FEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY 484

Query: 829  SIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMC 888
            SIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMC
Sbjct: 485  SIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMC 544

Query: 889  QKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH 948
            QKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Sbjct: 545  QKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNH 604

Query: 949  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQF 1008
            RIQFEF ++ ++HS LRRHI+EQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQF
Sbjct: 605  RIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQF 664

Query: 1009 WPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSM 1068
            WPRGHP IPNLIHRWLSP VLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLREKSM
Sbjct: 665  WPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSM 724

Query: 1069 HCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDS 1128
             CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+AD+LN+E+  NET NINFDS
Sbjct: 725  SCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDS 784

Query: 1129 QSDSVGEASN 1137
            QSDS  EAS+
Sbjct: 785  QSDSDEEASS 788

BLAST of ClCG07G001810 vs. ExPASy TrEMBL
Match: A0A5N6QQ61 (LAGLIDADG_2 domain-containing protein OS=Carpinus fangiana OX=176857 GN=FH972_005424 PE=4 SV=1)

HSP 1 Score: 1087.8 bits (2812), Expect = 0.0e+00
Identity = 534/791 (67.51%), Postives = 651/791 (82.30%), Query Frame = 0

Query: 344  MSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQLRR---IPSFA 403
            +S+P  S+ +++   RSL+LSLS +HR     +H  R++F P +    + R+   + + +
Sbjct: 35   LSLPMRSSLSSL---RSLSLSLSHHHR-----SHYFRSIFAPAFCSFPKPRKFLSLRALS 94

Query: 404  SSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDEL 463
             S+ VE L  +   P   E   FS+ S+    F F+    S DLK L  PAL+VKEL +L
Sbjct: 95   RSTSVEHLACEVSRPETEELWNFSNNSDSEAAFDFDKNVGSLDLKRLEVPALDVKELGDL 154

Query: 464  PEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVY 523
            PEQWRRSKLAWLCKELPA K GTLIR+LNAQ+KW+ Q +ATY+ VHC+RIRENET F+VY
Sbjct: 155  PEQWRRSKLAWLCKELPAHKGGTLIRVLNAQRKWVRQQDATYVAVHCMRIRENETGFKVY 214

Query: 524  KWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLS 583
            KWMMQQ WY+FD+ALATKLADYMGKERKFSKCRE+FDDIINQG VP ESTFHILI+AYLS
Sbjct: 215  KWMMQQHWYQFDFALATKLADYMGKERKFSKCREIFDDIINQGRVPCESTFHILIIAYLS 274

Query: 584  APVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLV 643
            AP+Q C+EEA +IYNRMIQLGGY+P+LSLHN LFRAL+SKPG  SK +LKQAEFI+HNL+
Sbjct: 275  APIQVCLEEACSIYNRMIQLGGYQPQLSLHNCLFRALVSKPGASSKQYLKQAEFIFHNLL 334

Query: 644  TSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGD 703
            TSGLE+HKDIYGGLIWLHSYQDTID+ERI  L+KEM+ AGI+E REVLLSILRA SK  +
Sbjct: 335  TSGLEIHKDIYGGLIWLHSYQDTIDRERIASLKKEMEDAGIEEGREVLLSILRACSKESN 394

Query: 704  VVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM-EQLNCTSAAAYQT 763
            V EAER+W KL   DG +P  AFVYKMEVYAK+GEPMK+LEIFREM E+L+ TS AAY  
Sbjct: 395  VEEAERTWLKLLQLDGGIPHLAFVYKMEVYAKVGEPMKSLEIFREMQEKLSSTSIAAYHE 454

Query: 764  IIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 823
            II +LCK+Q++ELAES+M  FIKSNLKPL P+Y+D+MN++FNL+LHDKLEL FSQ L+KC
Sbjct: 455  IIQVLCKAQEVELAESLMIEFIKSNLKPLTPSYIDMMNLYFNLNLHDKLELAFSQSLDKC 514

Query: 824  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 883
            +PN T+YSIYLDSLV +GN+ KAEEIFNQM +NG IG+N+RSCN IL GYL  G+Y+KAE
Sbjct: 515  RPNCTMYSIYLDSLVTIGNLDKAEEIFNQMRSNGAIGVNSRSCNTILRGYLSSGDYVKAE 574

Query: 884  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES 943
            KIYDLMCQKKY ID PLMEK+DYVLSLSR+ VKKP+S+KLSKEQREIL+GLLLGGL+IES
Sbjct: 575  KIYDLMCQKKYQIDSPLMEKIDYVLSLSRQHVKKPVSMKLSKEQREILVGLLLGGLQIES 634

Query: 944  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 1003
            DEERK+H ++FEF +N ++H +L+RHI++QYHEWLHP+ K  +G  DIP +FCT+SHSYF
Sbjct: 635  DEERKSHMLRFEFRENSSTHYVLKRHIHDQYHEWLHPSCKPGEGADDIPCRFCTISHSYF 694

Query: 1004 GFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1063
            GFYADQFWP+G P IP LIHRWLSP VLAYWYMYGG RTSSGDILL+LKG+HEGVEK+  
Sbjct: 695  GFYADQFWPKGRPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLRLKGNHEGVEKVAN 754

Query: 1064 SLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNET 1123
            +L EKS+ C++KRKG+++WIG LGSN+ WFWKLIEP++LD MKD L+A    LE    ET
Sbjct: 755  ALMEKSLDCRMKRKGSVFWIGFLGSNSLWFWKLIEPYVLDDMKDFLKAGVATLENSSVET 814

Query: 1124 ENINFDSQSDS 1131
            ++I++DS S+S
Sbjct: 815  QDIDYDSGSES 817

BLAST of ClCG07G001810 vs. TAIR 10
Match: AT2G15820.1 (endonucleases )

HSP 1 Score: 894.4 bits (2310), Expect = 9.3e-260
Identity = 462/844 (54.74%), Postives = 623/844 (73.82%), Query Frame = 0

Query: 310  SSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYH 369
            SS SLA    SS   +S+   +  S+ + P++ + S          TLFRSL+ SL   H
Sbjct: 21   SSFSLA---SSSSSTVSVTTFNISSLSSNPNIINSS---------STLFRSLSFSLI-RH 80

Query: 370  RYFHCPNHIVRTLFIPTYSVKGQL------RRIPSFASSSFVEQ---LVHDRDSPLESEE 429
            R  +    + R      +  K Q       R  P F ++S  ++    V       ESEE
Sbjct: 81   RSSYSRRSLRRLSIHTVHGNKTQFFSHSSTRTPPLFTANSTAQRSGTFVEHLTGITESEE 140

Query: 430  HVFSSYSNEADGFHFENGFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKEL 489
             +     +EA+GF  +   A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+
Sbjct: 141  GI-----SEANGFG-DVESARNDIRNVATRRIETEFEVRELEELPEEWRRSKLAWLCKEV 200

Query: 490  PAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALA 549
            P  K  TL+RLLNAQKKW+ Q++ATY++VHC+RIRENET FRVY+WM QQ WYRFD+ L 
Sbjct: 201  PTHKAVTLVRLLNAQKKWVRQEDATYISVHCMRIRENETGFRVYRWMTQQNWYRFDFGLT 260

Query: 550  TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYN 609
            TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YN
Sbjct: 261  TKLAEYLGKERKFTKCREVFDDVLNQGRVPSESTFHILVVAYLSSLSVEGCLEEACSVYN 320

Query: 610  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLI 669
            RMIQLGGY+PRLSLHNSLFRAL+SK G +    LKQAEFI+HN+VT+GLE+ KDIY GLI
Sbjct: 321  RMIQLGGYKPRLSLHNSLFRALVSKQGGILNDQLKQAEFIFHNVVTTGLEVQKDIYSGLI 380

Query: 670  WLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFD 729
            WLHS QD +D  RI  LR+EM++AG +E +EV++S+LRA +K G V E ER+W +L   D
Sbjct: 381  WLHSCQDEVDIGRINSLREEMKKAGFQESKEVVVSLLRAYAKEGGVEEVERTWLELLDLD 440

Query: 730  GSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQ-LNCTSAAAYQTIIGILCKSQQIELAE 789
              +PSQAFVYK+E Y+K+G+  KA+EIFREME+ +   + + Y  II +LCK QQ+EL E
Sbjct: 441  CGIPSQAFVYKIEAYSKVGDFAKAMEIFREMEKHIGGATMSGYHKIIEVLCKVQQVELVE 500

Query: 790  SIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLV 849
            ++M  F +S  KPL+P+++++  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYLDSL 
Sbjct: 501  TLMKEFEESGKKPLLPSFIEIAKMYFDLGLHEKLEMAFVQCLEKCQPSQPIYNIYLDSLT 560

Query: 850  KVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP 909
            K+GN+ KA ++FN+M+ NG I ++ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+P
Sbjct: 561  KIGNLEKAGDVFNEMKNNGTINVSARSCNSLLKGYLDCGKQVQAERIYDLMRMKKYEIEP 620

Query: 910  PLMEKLDYVLSLSRKEVKK-PLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFL 969
            PLMEKLDY+LSL +KEVKK P S+KLSK+QRE+L+GLLLGGL+IESD+E+K+H I+FEF 
Sbjct: 621  PLMEKLDYILSLKKKEVKKRPFSMKLSKDQREVLVGLLLGGLQIESDKEKKSHMIKFEFR 680

Query: 970  KNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPS 1029
            +N  +H +L+++I++Q+ EWLHP S   + DI IP++F +V HSYFGFYA+ +WP+G P 
Sbjct: 681  ENSQAHLVLKQNIHDQFREWLHPLSNFQE-DI-IPFEFYSVPHSYFGFYAEHYWPKGQPE 740

Query: 1030 IPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRK 1089
            IP LIHRWLSP  LAYWYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+K
Sbjct: 741  IPKLIHRWLSPHSLAYWYMYSGVKTSSGDIILRLKGSLEGVEKVVKALQAKSMECRVKKK 800

Query: 1090 GNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLN-ETENINFDSQSDSVG 1137
            G ++WIGL G+N+  FWKLIEP +L+ +K+ L+  + +L+ V   E ++INF S SD   
Sbjct: 801  GKVFWIGLQGTNSALFWKLIEPHVLENLKEHLKPASESLDNVKEAEEQSINFKSNSDHSD 843

BLAST of ClCG07G001810 vs. TAIR 10
Match: AT2G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 73.2 bits (178), Expect = 1.5e-12
Identity = 89/433 (20.55%), Postives = 193/433 (44.57%), Query Frame = 0

Query: 469 LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQR 528
           L+++ KE    K   ++  L +    W   D+   ++V     ++ ++   V +W++++ 
Sbjct: 93  LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 152

Query: 529 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI 588
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 153 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 212

Query: 589 EEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGL 648
           E A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     
Sbjct: 213 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 272

Query: 649 ELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEA 708
           +   + Y   + ++ Y           L  EM+    K       +++ A ++ G   +A
Sbjct: 273 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 332

Query: 709 ERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII 768
           E  +++L+  DG  P   +VY   ME Y++ G P  A EIF  M+ + C    A+Y  ++
Sbjct: 333 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 392

Query: 769 GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 828
               ++     AE++     +  + P M +++ L++ +       K E    +  E   +
Sbjct: 393 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 452

Query: 829 PNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEK 888
           P+  + +  L+   ++G   K E+I  +ME NG    +  + NI+++ Y   G   + E+
Sbjct: 453 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 508

Query: 889 IYDLMCQKKYDID 894
           ++  + +K +  D
Sbjct: 513 LFVELKEKNFRPD 508

BLAST of ClCG07G001810 vs. TAIR 10
Match: AT2G35130.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 73.2 bits (178), Expect = 1.5e-12
Identity = 89/433 (20.55%), Postives = 193/433 (44.57%), Query Frame = 0

Query: 469 LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQR 528
           L+++ KE    K   ++  L +    W   D+   ++V     ++ ++   V +W++++ 
Sbjct: 115 LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 174

Query: 529 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI 588
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 175 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 234

Query: 589 EEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGL 648
           E A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     
Sbjct: 235 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 294

Query: 649 ELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEA 708
           +   + Y   + ++ Y           L  EM+    K       +++ A ++ G   +A
Sbjct: 295 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 354

Query: 709 ERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII 768
           E  +++L+  DG  P   +VY   ME Y++ G P  A EIF  M+ + C    A+Y  ++
Sbjct: 355 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 414

Query: 769 GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 828
               ++     AE++     +  + P M +++ L++ +       K E    +  E   +
Sbjct: 415 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 474

Query: 829 PNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEK 888
           P+  + +  L+   ++G   K E+I  +ME NG    +  + NI+++ Y   G   + E+
Sbjct: 475 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 530

Query: 889 IYDLMCQKKYDID 894
           ++  + +K +  D
Sbjct: 535 LFVELKEKNFRPD 530

BLAST of ClCG07G001810 vs. TAIR 10
Match: AT3G18110.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 71.6 bits (174), Expect = 4.5e-12
Identity = 68/309 (22.01%), Postives = 138/309 (44.66%), Query Frame = 0

Query: 548 KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRL 607
           KFSK +E+ D +  +GCVP   +F+ LI A L +   G     +     M++  G RP  
Sbjct: 240 KFSKAQELVDAMRQRGCVPDLISFNTLINARLKS--GGLTPNLAVELLDMVRNSGLRPDA 299

Query: 608 SLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKE 667
             +N+L  A           +L  A  ++ ++     +     Y  +I ++       + 
Sbjct: 300 ITYNTLLSACS------RDSNLDGAVKVFEDMEAHRCQPDLWTYNAMISVYGRCGLAAEA 359

Query: 668 RIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKM 727
             +F+  E++  G   +     S+L A ++  +  + +  +Q+++          +   +
Sbjct: 360 ERLFM--ELELKGFFPDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGKDEMTYNTII 419

Query: 728 EVYAKMGEPMKALEIFREMEQLNCTS--AAAYQTIIGILCKSQQIELAESIMAGFIKSNL 787
            +Y K G+   AL+++++M+ L+  +  A  Y  +I  L K+ +   A ++M+  +   +
Sbjct: 420 HMYGKQGQLDLALQLYKDMKGLSGRNPDAITYTVLIDSLGKANRTVEAAALMSEMLDVGI 479

Query: 788 KPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE 847
           KP +  Y  L+  +      ++ E TFS  L    KP+   YS+ LD L++     KA  
Sbjct: 480 KPTLQTYSALICGYAKAGKREEAEDTFSCMLRSGTKPDNLAYSVMLDVLLRGNETRKAWG 538

Query: 848 IFNQMETNG 854
           ++  M ++G
Sbjct: 540 LYRDMISDG 538

BLAST of ClCG07G001810 vs. TAIR 10
Match: AT5G08310.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 67.0 bits (162), Expect = 1.1e-10
Identity = 62/274 (22.63%), Postives = 119/274 (43.43%), Query Frame = 0

Query: 516 AFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILI 575
           A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I
Sbjct: 89  AYLFFNWASKQEGYRNDMYAYNAMASILSRARQNASLKALVVDVLNSRCFMSPGAFGFFI 148

Query: 576 VAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFI 635
               +A   G ++EAS++++R+ ++G   P    +N L  A       +SK +    E +
Sbjct: 149 RCLGNA---GLVDEASSVFDRVREMGLCVPNAYTYNCLLEA-------ISKSNSSSVELV 208

Query: 636 YHNL-VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRA 695
              L        H D +     L  Y +T   ER + +  E+   G  +E  +   ++ +
Sbjct: 209 EARLKEMRDCGFHFDKFTLTPVLQVYCNTGKSERALSVFNEILSRGWLDE-HISTILVVS 268

Query: 696 SSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTS- 755
             K G V +A    + L+  D  +  + +   +  + K     KA ++F +M ++   + 
Sbjct: 269 FCKWGQVDKAFELIEMLEERDIRLNYKTYCVLIHGFVKESRIDKAFQLFEKMRRMGMNAD 328

Query: 756 AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKP 788
            A Y  +IG LCK + +E+A S+     +S + P
Sbjct: 329 IALYDVLIGGLCKHKDLEMALSLYLEIKRSGIPP 351

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008465080.10.0e+0089.84PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic ... [more]
XP_004152074.20.0e+0087.97pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sa... [more]
XP_038887990.10.0e+0089.75pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa ... [more]
XP_022998786.10.0e+0085.44pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
XP_022949171.10.0e+0084.81pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9XIL51.3e-25854.74Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidop... [more]
Q6ZHJ54.1e-22853.19Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa... [more]
Q76C991.7e-1124.05Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
O821782.2e-1120.55Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX... [more]
Q5G1S86.3e-1122.01Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A1S3CPK00.0e+0089.84pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis ... [more]
A0A0A0LBL00.0e+0087.97LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100... [more]
A0A6J1KB640.0e+0085.44pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A6J1GB980.0e+0084.81pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A5N6QQ610.0e+0067.51LAGLIDADG_2 domain-containing protein OS=Carpinus fangiana OX=176857 GN=FH972_00... [more]
Match NameE-valueIdentityDescription
AT2G15820.19.3e-26054.74endonucleases [more]
AT2G35130.11.5e-1220.55Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G35130.21.5e-1220.55Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G18110.14.5e-1222.01Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G08310.11.1e-1022.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1102..1122
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..52
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..24
NoneNo IPR availablePANTHERPTHR47539PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN OTP51, CHLOROPLASTICcoord: 355..1131
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 697..891
IPR027434Homing endonucleaseGENE3D3.10.28.10Homing endonucleasescoord: 1013..1110
e-value: 3.7E-11
score: 45.3
IPR027434Homing endonucleaseGENE3D3.10.28.10Homing endonucleasescoord: 907..1011
e-value: 5.1E-18
score: 67.1
IPR027434Homing endonucleaseSUPERFAMILY55608Homing endonucleasescoord: 915..1102
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 825..853
e-value: 0.0063
score: 16.7
coord: 728..751
e-value: 0.0049
score: 17.0
coord: 861..888
e-value: 0.033
score: 14.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 825..853
e-value: 5.3E-4
score: 18.0
coord: 540..568
e-value: 6.8E-4
score: 17.6
coord: 861..892
e-value: 5.1E-4
score: 18.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 473..625
e-value: 3.4E-16
score: 61.1
coord: 673..806
e-value: 1.6E-8
score: 36.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 808..906
e-value: 2.9E-12
score: 48.7
IPR004860Homing endonuclease, LAGLIDADGPFAMPF03161LAGLIDADG_2coord: 924..1089
e-value: 2.3E-43
score: 148.1

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G001810.1ClCG07G001810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010239 chloroplast mRNA processing
biological_process GO:0000373 Group II intron splicing
biological_process GO:0045292 mRNA cis splicing, via spliceosome
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0048564 photosystem I assembly
biological_process GO:0006388 tRNA splicing, via endonucleolytic cleavage and ligation
cellular_component GO:0009507 chloroplast
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0005515 protein binding