Clc02G05770 (gene) Watermelon (cordophanus) v2

Overview
NameClc02G05770
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationClcChr02: 5360884 .. 5368595 (+)
RNA-Seq ExpressionClc02G05770
SyntenyClc02G05770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTTCTTCTTCTTCTCTTTTGCATTCGTCTTCTTTCACTTCTTCTTGTTATCTGCTTTCTGTTGGTTTTGCATTGTTGTTTTCTTTATCTTCTTATTCTTTCATTTTTTTCCATGTTTTTCTTCTTCTTCTTCCTTTTTTTACTTTTGTTCGATAGTTTTTTTTTCCTCTCCCTTATTTGTTCAGTAGTTTTTTTACGAAGAAAAAAATTGCAGTTGTATCTAATGAAGAACTAGAAGAAGGAGTGAGAAGTTAGAGCGAGCATTGAAAAGTTAGAGCGTTAATCCAAACAGGGAGAAAAAATCAAAGAAAATATTGAAGTTGTGTGTTTGATACGTGGGCCATTTTTCATTTGGCTCCCGAAAAAAAATTCCCTTGACATGGCAGTTGGTAAGTTCGGCTGAGATGTACAAAGGCAAATACAACAGAGACAAAACCTTTGTCACTAGCAAGCTTCTAAAGTTGCCGTTGTATTGTTTCCTCCTCCTCCTCTTCTTCTTCCAATGAAGCCTTTACCCTGATTCAACCCACTGCCTGCCCTTTTAAGTTAAGCAGTATATTCATATGTAAATGAAGCATTTGAAACATGGGTTACTGTGCCATGCCCAAGCCATCAAATCCGGATTTACACCGACCATTTTCACGTCGAACCAACTCATAACCTTATATGCGAAACATGGTTTTCTTGCTGATGCCCAGAAATTGTTCGACGAAATGCCTGAACGAAATGTCTTCTCATGGAATGCTATAATAGCAGCGTATATTAAGTCTCAAAACTTGAGACAAGCGCGGGCGTTATTTGATTGTGCTGTCTATAGAGATTTGGTCACTTACAACTCTATGCTGTCTGGTTATGTTAGCTCTGATGGGTATGAGGCTCAAGCACTTGGGTTTTTTGTGGAAATGCAAACGGCCCCTGATTTGATTAGAATTGATGAATTCAGTCTCACAATCATGCTTAATTTAACTGCTAAGTTATGTGTGGTTTCTTATGGAAAGCAGTTGCATTCCTTCATGTTGAAGACTGCTAATGATTTAAGTGTGTTTGCTGCTAGTTCGTTGATTGATATGTACTCTAAGTGTGGGTTTTTTAAAGAAGCCTGTAGAGTTTATTATGGATGTGGTGAGGTAGTTGATTTGGTCTGTAGAAATGCCATGGTTGCAGCTTGTTGTAGAGAAGGGGAGATAGATGTGGCTATGAATCTTTTCTGGAAGGAATTGGAGCGAAATGACGTTGTAGCGTGGAACACAATGATTTCAGGTTTTGTTCAAAATGGTTATGAGGAAGAATCATTGCAGTTATTTGTTCGTATGGCGGATGAAAAGGTTGGATGGAATGAACACACTTTTGCAAGTATCTTGAGTGCTTGCTCCAATCTGAGGAACTTGAAGCTTGGAAAGGAAGTCCATGCTTATGTTTTGAAGAATGGGCTGATTCTCAATCCCTTCATTGGTAGTGGACTTGTTGATGTTTATTGCAAGTGCAGTAACATGAGGTATGCAGAGTCAGTTAATTCAGAATTGAGGACGCTGAATGTATATTCGATCACTTCAATGATTGTTGGCTATTCATCTCAAGGTAACATGGCAGAAGCCAGAAAGCTTTTTGATTCCTTGGATGAAAAGAATTCTGTGGTGTGGACTGCTTTATTTTCTGGGTATGTTAAGTTACAGCAGTGTGAAGCAGTTCTTGAACTTTTAAGTGAATATAGGAAGGAGGCAACAGTTCCTGATGTGCTACTTCTTATCAGCATAATTGGCGCTTGTGCTATACAAGCTGCTCTGGCTCCTGGGAAGCAGATACACGGTTACATGCTCCGAGCAGGCATCGAACTTGATACGAAACTGACCAGTTCATTGGTTGATATGTACTCAAAATGTGGAAGTATCATTTATGCAGAAAGGATGTTTAGAGAGGTTACTGATAAGGATTCCATTCTTTACAACATTATGATAGCTGGCTATGCTCACCATGGGTGGGAAAATGAAGCAGTCCAGCTTTTCAAGGAAATGATGGAGAATGATCTCAGACCAGATGCAATCACTTTTGTTGCACTACTTTCTGCATGTCGACACGGCGGTTTGGTAGAACTAGGTGAACATTTTTTCGATTCTATGTCTAGGGATCACAATATTAGTCCTGAAATCGATCATTATGCTTGTATGATTGATTTGTATGGAAGGGCTAATCAACTAGATAAGGCATTGGAATTCATGAAAAAGATTCCCATACAGTTAGATGCTGTCATATGGGGATCATTTCTGAATGCTTGTAGGATCAATGGGAATGCTGAACTTGCAAGAAAAGCAGAAGATAAACTGTTGATAATCGAAGGAGAAAACGGAGCTCGATACGTGCAGTTAGCTAATGTCTATGCTGCAGAAGGAAACTGGGAGGAGATGGGAAGAATAAGGAAGAAAATGAAAGGAAAGGAGGTTAAGAAGAATGCTGGTTGTAGTTGGGTTTTTGTGGAAAATAAGTTCCATGTATTCACGTCTGGTGATAGATTTCACCCAAAAAATGAGGCTATATATTTAACCTTAGCCTCCTTGACTGATGAGCTACTTCACAAAGAGGAAGCATTTTGTTAATGTCAAGCTTCAGCCATTAATTTGTAGTCTGGAATGGCCATGTTTCAGATTTGGAATGGTTTATACATTTGCTTTATTCGAGGTATTTTCTTTTTAAACAAAACTCTCTTTTTTCTAGTAAATAGTTATCAAACTGAGGTTTGAATTCAATTTAAGATATTGATTCTTAAGTTAGATTCATATGTGAAAGATATCATACTTACAACATCACAATCACTTTATTTCGAAGTTGGAAGCATTTTTTAGGTTACATTACGAATTCAGCCCTTATGATTTGAATAAAAATTAGAATTTAGTCCATATGATTTTAAAAGTTAGAATTTGGTCCTTTATGGTTTGAAGAAAGCTAAAATTTAGTCCTTACTTTAGAGACTATTAATGAGAGTTTTAACAAACTATAAGGACTAAATTATAATTTTAAACCGTTCCAAATTATAGGAACTAAATTTATAATTTAACTCATTTTTTAATATATGATTATTGAATGCGATTATTTTTCTACTCATTTTAACAGCTGACTTTCCTTTTTCAGGTTACTATCTATACCAATTCAAAGTTTTGCAGTTTCATTTCCACACAATTTCCATCTACAATGTATGTTTTCTAACTACCTTTGTATTATTTATACTTAATGTTGGGTTTTTCATATATAAATTTATAAGCAATATTTTGTTCAATAGATTGCATCATACTAAATTTAGAATTAATTATTTTTAATTAAAATTAATAGAAATTAGATTAAAGAATTAATTTTTATATTAATCCTTTTTGTGGCTTAAGATGCTAATTTTCAATAGTATTCGAAACTCATTTTCTTTCTTTCTGGAAATCTGCCTCGGCATTGGTGTATAATTCATTCTTGGAGTTCGTAAAGTTCTCTCCCTTCCAAATGTGAGTGTATATTAATATTATGATTTTTTTTAGTAAATTTGTTTTTAAATATATTAAAATGAATCAAATATTTATAAATACGATCACATTTAGCTATATTTAAAAATATATATATATTTTAATATTTACTCATTTAATATGAACTTTATATGTCAAAAAAAATTTAGATATGTTTTTTTTTCAAATATCACTCGGTTAACGTTATACAAATAGAAAAAAGATTTAGATCATTTTTTTTTTTAATATCCACTCATTTAACATGAGACAAATATGGAAAAAATTTAGATAGTTTTTTCTTTTTTTTTTTATTTTAATATCTATCCATTTAACGTGAAACAAATACGAAAAAAAATTTAGGTAGATTAATTTTTAATATCCACTCATTTAATTTAATGTAGAACAAATATGCAAAAAATATTATCAATTTTGATTTTCTTTTCCTTAAAAAATTATGTGCGATGCTCAAAATTAGAGGAAAAATTATTTAATAAATATGAAATCACCTAAGATAACCATCTAACCCAAAATCAAAAGTTTCTCTCTATAAGCCCTTTTAGATAATATAGAAGATATACATAGATTACCCTTTTTAATACTTATATTTTTTATTAAAAAATATTAAATTAAGTTGAATATTGTTTAAACTCTATAAATGTCAAAGTGAGCATGACTTAACTGAGATAAGATTGGTTTAACGACCATTCAAGATATTCGTGATTGGAATCTAGGTTTGGCAAAAAAACTCGCGGGGTCGAGAAATCCTCGATCAAACCGGGGATGGAGGTTAAATCGTTCGGGTACTCGCGACAGGGCAGGGATGGGGAGGGTCTCTCGACCCCGAATCCCTGATACCAACGTATTTAATTTATAGTTGACTCTGGCGCTCTCACATCATTTCCTCACTCACAATCACTCTCTCTCTTGCTCTCTCATTCTCTCTTGCTCTCGCTCTCGTCTCTGTCTCGCTCACTCTCTTGCTCTCACTCACTCTCTCTCTCGCCCCAACCCAAGTCCCAACCAAACCCAAAAGATTTCTTAGCTTGTGAATTTTGTTTATTTTTTGTGGAGTAGTCCCTTAACTTATTAATTGGTAGTAATTTGGTTCCTCTCCGAACAACCTCTCATAGTCTCGCTCTCGCTCTTGCCCTCTCAATGGCTCGCTCTCGCTCTCGCTCTCGCTCTATCTCGGGGAATGGGTCCCCACGGGGACCTGTTTCCCCGACAAAGAATTCCCGCCCCCGTCTCCGCCTATTATAAGCGGGAGACGGGGGCGGGGACGGGGAGCCCCCACCTGCCTCGCCTCGGCCCCGTTGCCAACCCTATTGGAATCTCCTATTCCTAATGTACTAAGAATACAAACAGAAGATCTACAATTATTTTATGTATTTTATAGTACATATTTTTGTAATTTGTAGGAAATAAGTGAACTTTTTGATAGAATAGGAAGAAAAATAGTTCAATTGATCAAGACTTGTACACCTAAATGTTTGAATCCTATTTATAATAAATACTGCATTTGTATTTTACAGCCACCCAGCACGGCACAGCTTGAAAATTCCACGGCTTGTGATCCTATTCCCCTTCATCAATTAATTAAACCATCAACGATTACTAATCAATGCTAATCTATGAATCTTTACCTGCCTCCCTGCTTCCCTCTACCAAGTCAACTCTGTAGAACCAGCAGCCTTTGAGTTCTATGAATCCCAGATTGGCCCAAATTTTATAGCACCGATCTGGTCCTTTTTTCTTCGTACTCCCTTACATTCCGGTGTTTAATGACTTGAGCTTTCAATTGAGGGTAATCGTCATTTTCGCCCTTCTGGGTCCCTCTGAAACTTTCGGTTTGTGCTGAACGGGAAGCCATGAATGGCCGTAGAGACGGGCCGTTGATGAGGAACTCCTCGCAAGGATCCACCAAATCCAAGATTGGGACTGCCATATTTATCGGCGTCCTTCTTGGATTCGCTTTTGCCTTCTTGTTCCCTGGTGGGATCTTCACATCGAGCTCTCTCTCAATTCGTGATTCTCGTTCCGTGAAAACCCAGGTTTTTGTTTACACTTGCCCTGTTTTCTGCTTTCTCGGTTTTCCCCCCCTTCCTTCATTACCCTTCTCGATTTGAGCCTCTTGTTGAATCTTCTTGTGTGCTTAATTCTTTTGTTCAGAATGGTGAAATTGGGTCTGTTGAATGCGTTGATTAAAACTTTGTTAATTTCAATTGATTTGGAGGTGGGGTGTATTAATGTTTACAAGTTCCAGGCTACCTAAAATAACTTACTGTACTGAGTTTAGGTGTTCGATTGCTGTGCATTTCCAATTCCATTAATATTTTGGTTCGTTATCTCGATGAGTTGGTTTCTGTGCATTATTTTTCATTTTGAGAATGTCACCATTTAAAAAACTGGTGGAGATTCAAATGTTTAAGTCAGTAATTGATGTGCATTATTTGGTTCTCCTTCATATATGCTTAAGTTCCATTTAGTGAATCTTCGGGATGATAGCCAGATTATTTGAATGGGTTAATATAAGCCTTTTATGCATCTCTGGTGTGCATAATTGTGACTTCATATGTTAGATGTTTTAAATTTTGTTCATCCTATGTTCTGTCTTCTGATACCATGATTTTGTAGAAGTATAGCCGATATTGACATGGTCTGATTGTAATTTTATACGCTGGTTATGCACTAGGCTGATTCAAGTTCATGCGATTCATCTGAACGGATTGACATGTTAAAATCTGAGTTCATAACAGCATCGGAGAAGAATGCTCAGCTGGAAAAACAGATCAGGGAATTAACTGAAAAGCTTAAGTTGGCTGAGCAAGGTAAAGATCATGCACAAAAGCAAGTTCTTGCTTTAGGTAAACGGTCAAAAGCCGGGCCTTTTGGTACTGTAAAAGGTTTAAGAACAAACCCACCTGTTATCCCTGATGAGTCTGTCAATCCAAGATTAGCAAAGATATTAGAGAAAGTTGCTGTTAATAGAGAACTTATAGTCGCGGTAGCGAATTCAAATGTGAAGGCGATGTTAGAATTATGGTTCACTAGCATCAAGAAAGCAGGCATACCTAATTATCTTGTGGTTGCTTTGGACGATGAAATAGTTCAATTCTGCAAAACAAATGATGTTCCAGTGTATAAGAGAGATCCTGATGAAAAAGTTGATTCAATCGGAAGAACAGGAGGGAACCATGCTGTCTCGGGGACAAAATTCCGCATCTTGAGGGAGTTTCTGCAGTTGGGATATGCTGTTTTACTATCTGATGTAGACATTATTTACTTGCAAAATCCTTTCAATCATCTATATCGGGACTCAGATGTAGAATCGATGACTGATGGTCATGACAATGCTACTGCTTACGGATACAACCATGTCTTTGAGGAACCTGCAATGGGTTGGGCTCGATTCGCGCACACGATGCGTATCTGGGTTTATAACTCAGGCTTCTTCTATATTAGACCAACTATTCCTGCAATTGAGCTTTTAGATCGTGTGGCTAGCCGGCTTTCACGAGAACAGAACTCCTGGGACCAGGCTGTTTTCAATGAGGAACTGTTTTTCCCTTCGCATTCAAACTACGAGGGGCTTTATGCCTCTAGGAGAACTATGGATTTCTATCTTTTCATGAATAGTAAGGTTCTATTCAAGACTGTAAGGAAGGATGATAATCTGAAAAAGTTGAAGCCAGTCATTGTTCATGTGAATTACCATCCCGATAAGTTTCCTCGAATGAAAGCCGTGGTCGACTTCTACGTCAATGGAAAGCAGGATGCACTGAATCCTTTCCCTGATGGTTCAGATTGGTGAACTTCTTAATTTATGCACTTCAGATCATAGGATTTTACTTGGACAAAACATTGGCTCAAAAAACATCAAAGTTGGATGTTTTCACAAACTTTCAGGTCAAATGAGACTTGAGAGGAGCCTGCTCTTTTCAAATCAAAGTGATTATTGTTGTCTTCCACATGATTTTTTGTACTGCTTTATTTTTAAAAAAGAAGGAAAAAGGAAAAAATTACTCAGGAAAGAAGAGGCCTTCTTGGTTGTAACTTGTATCTTTAAAAAGTTTTCTTAGTATGGCCACAGAATGAACAATGGATGTTAGACTCTAATGGCTTTCACTTTCTCTGGCTCAATTGGTTTTGGTTTGTTCCTTTGCCAATAGTGTTGTGTATAGTTGCATCTTGCATGGAAAGTCTGAATCTTGATGAATTTATTTCTTGCAGGC

mRNA sequence

TTCTTCTTCTTCTTCTCTTTTGCATTCGTCTTCTTTCACTTCTTCTTGTTATCTGCTTTCTGTTGGTTTTGCATTGTTGTTTTCTTTATCTTCTTATTCTTTCATTTTTTTCCATGTTTTTCTTCTTCTTCTTCCTTTTTTTACTTTTGTTCGATAGTTTTTTTTTCCTCTCCCTTATTTGTTCAGTAGTTTTTTTACGAAGAAAAAAATTGCAGTTGTATCTAATGAAGAACTAGAAGAAGGAGTGAGAAGTTAGAGCGAGCATTGAAAAGTTAGAGCGTTAATCCAAACAGGGAGAAAAAATCAAAGAAAATATTGAAGTTGTGTGTTTGATACGTGGGCCATTTTTCATTTGGCTCCCGAAAAAAAATTCCCTTGACATGGCAGTTGGTAAGTTCGGCTGAGATGTACAAAGGCAAATACAACAGAGACAAAACCTTTGTCACTAGCAAGCTTCTAAAGTTGCCGTTGTATTGTTTCCTCCTCCTCCTCTTCTTCTTCCAATGAAGCCTTTACCCTGATTCAACCCACTGCCTGCCCTTTTAAGTTAAGCAGTATATTCATATGTAAATGAAGCATTTGAAACATGGGTTACTGTGCCATGCCCAAGCCATCAAATCCGGATTTACACCGACCATTTTCACGTCGAACCAACTCATAACCTTATATGCGAAACATGGTTTTCTTGCTGATGCCCAGAAATTGTTCGACGAAATGCCTGAACGAAATGTCTTCTCATGGAATGCTATAATAGCAGCGTATATTAAGTCTCAAAACTTGAGACAAGCGCGGGCGTTATTTGATTGTGCTGTCTATAGAGATTTGGTCACTTACAACTCTATGCTGTCTGGTTATGTTAGCTCTGATGGGTATGAGGCTCAAGCACTTGGGTTTTTTGTGGAAATGCAAACGGCCCCTGATTTGATTAGAATTGATGAATTCAGTCTCACAATCATGCTTAATTTAACTGCTAAGTTATGTGTGGTTTCTTATGGAAAGCAGTTGCATTCCTTCATGTTGAAGACTGCTAATGATTTAAGTGTGTTTGCTGCTAGTTCGTTGATTGATATGTACTCTAAGTGTGGGTTTTTTAAAGAAGCCTGTAGAGTTTATTATGGATGTGGTGAGGTAGTTGATTTGGTCTGTAGAAATGCCATGGTTGCAGCTTGTTGTAGAGAAGGGGAGATAGATGTGGCTATGAATCTTTTCTGGAAGGAATTGGAGCGAAATGACGTTGTAGCGTGGAACACAATGATTTCAGGTTTTGTTCAAAATGGTTATGAGGAAGAATCATTGCAGTTATTTGTTCGTATGGCGGATGAAAAGGTTGGATGGAATGAACACACTTTTGCAAGTATCTTGAGTGCTTGCTCCAATCTGAGGAACTTGAAGCTTGGAAAGGAAGTCCATGCTTATGTTTTGAAGAATGGGCTGATTCTCAATCCCTTCATTGGTAGTGGACTTGTTGATGTTTATTGCAAGTGCAGTAACATGAGGTATGCAGAGTCAGTTAATTCAGAATTGAGGACGCTGAATGTATATTCGATCACTTCAATGATTGTTGGCTATTCATCTCAAGGTAACATGGCAGAAGCCAGAAAGCTTTTTGATTCCTTGGATGAAAAGAATTCTGTGGTGTGGACTGCTTTATTTTCTGGGTATGTTAAGTTACAGCAGTGTGAAGCAGTTCTTGAACTTTTAAGTGAATATAGGAAGGAGGCAACAGTTCCTGATGTGCTACTTCTTATCAGCATAATTGGCGCTTGTGCTATACAAGCTGCTCTGGCTCCTGGGAAGCAGATACACGGTTACATGCTCCGAGCAGGCATCGAACTTGATACGAAACTGACCAGTTCATTGGTTGATATGTACTCAAAATGTGGAAGTATCATTTATGCAGAAAGGATGTTTAGAGAGGTTACTGATAAGGATTCCATTCTTTACAACATTATGATAGCTGGCTATGCTCACCATGGGTGGGAAAATGAAGCAGTCCAGCTTTTCAAGGAAATGATGGAGAATGATCTCAGACCAGATGCAATCACTTTTGTTGCACTACTTTCTGCATGTCGACACGGCGGTTTGGTAGAACTAGGTGAACATTTTTTCGATTCTATGTCTAGGGATCACAATATTAGTCCTGAAATCGATCATTATGCTTGTATGATTGATTTGTATGGAAGGGCTAATCAACTAGATAAGGCATTGGAATTCATGAAAAAGATTCCCATACAGTTAGATGCTGTCATATGGGGATCATTTCTGAATGCTTGTAGGATCAATGGGAATGCTGAACTTGCAAGAAAAGCAGAAGATAAACTGTTGATAATCGAAGGAGAAAACGGAGCTCGATACGTGCAGTTAGCTAATGTCTATGCTGCAGAAGGAAACTGGGAGGAGATGGGAAGAATAAGGAAGAAAATGAAAGGAAAGGAGGTTAAGAAGAATGCTGGTTGTAGTTGGGTTTTTGTGGAAAATAAGTTCCATGTATTCACGTCTGGTGATAGATTTCACCCAAAAAATGAGGCTATATATTTAACCTTAGCCTCCTTGACTGATGAGCTACTTCACAAAGAGGAAGCATTTTGTTAATGTCAAGCTTCAGCCATTAATTTGTAGTCTGGAATGGCCATGTTTCAGATTTGGAATGGTTTATACATTTGCTTTATTCGAGGTTACTATCTATACCAATTCAAAGTTTTGCAGTTTCATTTCCACACAATTTCCATCTACAATGCTGATTCAAGTTCATGCGATTCATCTGAACGGATTGACATGTTAAAATCTGAGTTCATAACAGCATCGGAGAAGAATGCTCAGCTGGAAAAACAGATCAGGGAATTAACTGAAAAGCTTAAGTTGGCTGAGCAAGGTAAAGATCATGCACAAAAGCAAGTTCTTGCTTTAGGTAAACGGTCAAAAGCCGGGCCTTTTGGTACTGTAAAAGGTTTAAGAACAAACCCACCTGTTATCCCTGATGAGTCTGTCAATCCAAGATTAGCAAAGATATTAGAGAAAGTTGCTGTTAATAGAGAACTTATAGTCGCGGTAGCGAATTCAAATGTGAAGGCGATGTTAGAATTATGGTTCACTAGCATCAAGAAAGCAGGCATACCTAATTATCTTGTGGTTGCTTTGGACGATGAAATAGTTCAATTCTGCAAAACAAATGATGTTCCAGTGTATAAGAGAGATCCTGATGAAAAAGTTGATTCAATCGGAAGAACAGGAGGGAACCATGCTGTCTCGGGGACAAAATTCCGCATCTTGAGGGAGTTTCTGCAGTTGGGATATGCTGTTTTACTATCTGATGTAGACATTATTTACTTGCAAAATCCTTTCAATCATCTATATCGGGACTCAGATGTAGAATCGATGACTGATGGTCATGACAATGCTACTGCTTACGGATACAACCATGTCTTTGAGGAACCTGCAATGGGTTGGGCTCGATTCGCGCACACGATGCGTATCTGGGTTTATAACTCAGGCTTCTTCTATATTAGACCAACTATTCCTGCAATTGAGCTTTTAGATCGTGTGGCTAGCCGGCTTTCACGAGAACAGAACTCCTGGGACCAGGCTGTTTTCAATGAGGAACTGTTTTTCCCTTCGCATTCAAACTACGAGGGGCTTTATGCCTCTAGGAGAACTATGGATTTCTATCTTTTCATGAATAGTAAGGTTCTATTCAAGACTGTAAGGAAGGATGATAATCTGAAAAAGTTGAAGCCAGTCATTGTTCATGTGAATTACCATCCCGATAAGTTTCCTCGAATGAAAGCCGTGGTCGACTTCTACGTCAATGGAAAGCAGGATGCACTGAATCCTTTCCCTGATGGTTCAGATTGGTGAACTTCTTAATTTATGCACTTCAGATCATAGGATTTTACTTGGACAAAACATTGGCTCAAAAAACATCAAAGTTGGATGTTTTCACAAACTTTCAGGTCAAATGAGACTTGAGAGGAGCCTGCTCTTTTCAAATCAAAGTGATTATTGTTGTCTTCCACATGATTTTTTGTACTGCTTTATTTTTAAAAAAGAAGGAAAAAGGAAAAAATTACTCAGGAAAGAAGAGGCCTTCTTGGTTGTAACTTGTATCTTTAAAAAGTTTTCTTAGTATGGCCACAGAATGAACAATGGATGTTAGACTCTAATGGCTTTCACTTTCTCTGGCTCAATTGGTTTTGGTTTGTTCCTTTGCCAATAGTGTTGTGTATAGTTGCATCTTGCATGGAAAGTCTGAATCTTGATGAATTTATTTCTTGCAGGC

Coding sequence (CDS)

ATGAAGCATTTGAAACATGGGTTACTGTGCCATGCCCAAGCCATCAAATCCGGATTTACACCGACCATTTTCACGTCGAACCAACTCATAACCTTATATGCGAAACATGGTTTTCTTGCTGATGCCCAGAAATTGTTCGACGAAATGCCTGAACGAAATGTCTTCTCATGGAATGCTATAATAGCAGCGTATATTAAGTCTCAAAACTTGAGACAAGCGCGGGCGTTATTTGATTGTGCTGTCTATAGAGATTTGGTCACTTACAACTCTATGCTGTCTGGTTATGTTAGCTCTGATGGGTATGAGGCTCAAGCACTTGGGTTTTTTGTGGAAATGCAAACGGCCCCTGATTTGATTAGAATTGATGAATTCAGTCTCACAATCATGCTTAATTTAACTGCTAAGTTATGTGTGGTTTCTTATGGAAAGCAGTTGCATTCCTTCATGTTGAAGACTGCTAATGATTTAAGTGTGTTTGCTGCTAGTTCGTTGATTGATATGTACTCTAAGTGTGGGTTTTTTAAAGAAGCCTGTAGAGTTTATTATGGATGTGGTGAGGTAGTTGATTTGGTCTGTAGAAATGCCATGGTTGCAGCTTGTTGTAGAGAAGGGGAGATAGATGTGGCTATGAATCTTTTCTGGAAGGAATTGGAGCGAAATGACGTTGTAGCGTGGAACACAATGATTTCAGGTTTTGTTCAAAATGGTTATGAGGAAGAATCATTGCAGTTATTTGTTCGTATGGCGGATGAAAAGGTTGGATGGAATGAACACACTTTTGCAAGTATCTTGAGTGCTTGCTCCAATCTGAGGAACTTGAAGCTTGGAAAGGAAGTCCATGCTTATGTTTTGAAGAATGGGCTGATTCTCAATCCCTTCATTGGTAGTGGACTTGTTGATGTTTATTGCAAGTGCAGTAACATGAGGTATGCAGAGTCAGTTAATTCAGAATTGAGGACGCTGAATGTATATTCGATCACTTCAATGATTGTTGGCTATTCATCTCAAGGTAACATGGCAGAAGCCAGAAAGCTTTTTGATTCCTTGGATGAAAAGAATTCTGTGGTGTGGACTGCTTTATTTTCTGGGTATGTTAAGTTACAGCAGTGTGAAGCAGTTCTTGAACTTTTAAGTGAATATAGGAAGGAGGCAACAGTTCCTGATGTGCTACTTCTTATCAGCATAATTGGCGCTTGTGCTATACAAGCTGCTCTGGCTCCTGGGAAGCAGATACACGGTTACATGCTCCGAGCAGGCATCGAACTTGATACGAAACTGACCAGTTCATTGGTTGATATGTACTCAAAATGTGGAAGTATCATTTATGCAGAAAGGATGTTTAGAGAGGTTACTGATAAGGATTCCATTCTTTACAACATTATGATAGCTGGCTATGCTCACCATGGGTGGGAAAATGAAGCAGTCCAGCTTTTCAAGGAAATGATGGAGAATGATCTCAGACCAGATGCAATCACTTTTGTTGCACTACTTTCTGCATGTCGACACGGCGGTTTGGTAGAACTAGGTGAACATTTTTTCGATTCTATGTCTAGGGATCACAATATTAGTCCTGAAATCGATCATTATGCTTGTATGATTGATTTGTATGGAAGGGCTAATCAACTAGATAAGGCATTGGAATTCATGAAAAAGATTCCCATACAGTTAGATGCTGTCATATGGGGATCATTTCTGAATGCTTGTAGGATCAATGGGAATGCTGAACTTGCAAGAAAAGCAGAAGATAAACTGTTGATAATCGAAGGAGAAAACGGAGCTCGATACGTGCAGTTAGCTAATGTCTATGCTGCAGAAGGAAACTGGGAGGAGATGGGAAGAATAAGGAAGAAAATGAAAGGAAAGGAGGTTAAGAAGAATGCTGGTTGTAGTTGGGTTTTTGTGGAAAATAAGTTCCATGTATTCACGTCTGGTGATAGATTTCACCCAAAAAATGAGGCTATATATTTAACCTTAGCCTCCTTGACTGATGAGCTACTTCACAAAGAGGAAGCATTTTGTTAA

Protein sequence

MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIRIDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLTDELLHKEEAFC
Homology
BLAST of Clc02G05770 vs. NCBI nr
Match: XP_038887152.1 (putative pentatricopeptide repeat-containing protein At3g18840 [Benincasa hispida])

HSP 1 Score: 1248.4 bits (3229), Expect = 0.0e+00
Identity = 612/671 (91.21%), Postives = 645/671 (96.13%), Query Frame = 0

Query: 3    HLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIA 62
            +L HGLL   QAIKSGFTPTIFTSNQLITLYAKHG L DAQK+FDEMPERNVFSWNAIIA
Sbjct: 416  YLIHGLLSLTQAIKSGFTPTIFTSNQLITLYAKHGLLNDAQKMFDEMPERNVFSWNAIIA 475

Query: 63   AYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIRID 122
            AY+KSQNL QARALFD AVYRDLVTYNSMLSGYVSSDGYEAQALG F +MQTAPD+IRID
Sbjct: 476  AYLKSQNLTQARALFDSAVYRDLVTYNSMLSGYVSSDGYEAQALGLFRKMQTAPDMIRID 535

Query: 123  EFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYY 182
            + SLTIMLNLTAKLCVVSYGKQLHSFM+KTANDLSVFAASSLIDMYSKCG+FK+ACRVYY
Sbjct: 536  DVSLTIMLNLTAKLCVVSYGKQLHSFMMKTANDLSVFAASSLIDMYSKCGYFKDACRVYY 595

Query: 183  GCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESL 242
            GCGEVVDLV RNAMVAACCREGEI+VAM+LFWKELERND VAWNTMISGFVQNGYE+ESL
Sbjct: 596  GCGEVVDLVSRNAMVAACCREGEINVAMDLFWKELERNDAVAWNTMISGFVQNGYEQESL 655

Query: 243  QLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVY 302
            +LFVRMADE VGWNEHTFAS+LSACSNLR+LK GKEVHAYVLKNGLI+NPFIGSGLVDVY
Sbjct: 656  KLFVRMADETVGWNEHTFASVLSACSNLRSLKYGKEVHAYVLKNGLIVNPFIGSGLVDVY 715

Query: 303  CKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFS 362
            CKC+NMRYAESVNSELR  NVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFS
Sbjct: 716  CKCNNMRYAESVNSELRMRNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFS 775

Query: 363  GYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIEL 422
            GYVKLQQCEAV ELL+EYRKEATVPDVL+L+SIIGACAIQAALAPGKQIHGY++RAGI+L
Sbjct: 776  GYVKLQQCEAVFELLTEYRKEATVPDVLILLSIIGACAIQAALAPGKQIHGYIVRAGIKL 835

Query: 423  DTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMM 482
            DTKLTSSLVDMYSKCGSIIYAER+FREVTDKDSI+YNIMIAGYAHHGWEN+AVQLFKEMM
Sbjct: 836  DTKLTSSLVDMYSKCGSIIYAERIFREVTDKDSIIYNIMIAGYAHHGWENDAVQLFKEMM 895

Query: 483  ENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQL 542
            ENDL+PDAITFVALLSACRHGG VELGEHFFDSMS DHNI PEIDHYACMIDLYGRANQL
Sbjct: 896  ENDLKPDAITFVALLSACRHGGSVELGEHFFDSMSSDHNIIPEIDHYACMIDLYGRANQL 955

Query: 543  DKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVY 602
            DKALEFMK IPIQLDAVIWG+FLNACRINGNAELARKAEDKLL+IEGENGARYVQLANVY
Sbjct: 956  DKALEFMKTIPIQLDAVIWGAFLNACRINGNAELARKAEDKLLVIEGENGARYVQLANVY 1015

Query: 603  AAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLT 662
            AAEG+WE+MGRIRKKMKGKEVKKNAGCSWVFVENKFHVF SGDRFH K+EAIY TLASLT
Sbjct: 1016 AAEGDWEQMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFISGDRFHSKSEAIYSTLASLT 1075

Query: 663  DELLHKEEAFC 674
            DELL KEE+FC
Sbjct: 1076 DELLDKEESFC 1086

BLAST of Clc02G05770 vs. NCBI nr
Match: XP_016900344.1 (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g18840 [Cucumis melo])

HSP 1 Score: 1244.2 bits (3218), Expect = 0.0e+00
Identity = 609/672 (90.62%), Postives = 641/672 (95.39%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MKHLKHGLLCH Q IKSGFTPTIFTSNQLI  YAKHG L DAQKLFDEMPERNVFSWNAI
Sbjct: 1   MKHLKHGLLCHVQGIKSGFTPTIFTSNQLINFYAKHGLLNDAQKLFDEMPERNVFSWNAI 60

Query: 61  IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
           I+AYIKSQNLR ARALFD AVY+DLVTYNSMLSGY  SDGYE +ALGFF+EMQTAPD+IR
Sbjct: 61  ISAYIKSQNLRTARALFDSAVYKDLVTYNSMLSGYARSDGYEGKALGFFMEMQTAPDMIR 120

Query: 121 IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
           IDEFSL IMLNLTAKLCV+SYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV
Sbjct: 121 IDEFSLIIMLNLTAKLCVISYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180

Query: 181 YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
           YYGCGEVVD V RNAMVAACCREGEIDVA++LFWKELE+NDVVAWNTMISGFVQNGYEEE
Sbjct: 181 YYGCGEVVDSVSRNAMVAACCREGEIDVALDLFWKELEQNDVVAWNTMISGFVQNGYEEE 240

Query: 241 SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
           SL+LFVRMADEKVGWNEHTFAS+LSACSNLR+LKLGKEVH YVLKN LI NPFI SGLVD
Sbjct: 241 SLKLFVRMADEKVGWNEHTFASVLSACSNLRSLKLGKEVHTYVLKNRLIANPFICSGLVD 300

Query: 301 VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
           VYCKC+NMRYAESV+SELR  NVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL
Sbjct: 301 VYCKCNNMRYAESVHSELRMQNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360

Query: 361 FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
           F GYVKLQQCEAV ELLSEYRKEA VPDVL+LISIIGACAIQAALAPGKQIH YMLRAGI
Sbjct: 361 FFGYVKLQQCEAVFELLSEYRKEAKVPDVLILISIIGACAIQAALAPGKQIHSYMLRAGI 420

Query: 421 ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
           +LDTKLTSSLVDMYSKCGSIIYAER+F   +DKDSI+YNIMIAGYAHHGWENEAVQLF+E
Sbjct: 421 KLDTKLTSSLVDMYSKCGSIIYAERIFXRSSDKDSIIYNIMIAGYAHHGWENEAVQLFEE 480

Query: 481 MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
           M+ENDL+PDAITFVALLSACRHGGLVELGEHFFDSMS D+NI+PEIDHYACMIDLYGRAN
Sbjct: 481 MVENDLKPDAITFVALLSACRHGGLVELGEHFFDSMSNDYNINPEIDHYACMIDLYGRAN 540

Query: 541 QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
           QLDKALEFM+KIPIQLDAVIWG+FLNACRINGNAELAR+AED+LL+IEGENGARYVQLAN
Sbjct: 541 QLDKALEFMRKIPIQLDAVIWGAFLNACRINGNAELAREAEDELLVIEGENGARYVQLAN 600

Query: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
           VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVE+KFHVF SGDRFH KNEAIY TLAS
Sbjct: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVESKFHVFISGDRFHSKNEAIYSTLAS 660

Query: 661 LTDELLHKEEAF 673
           LTDELL+K+EAF
Sbjct: 661 LTDELLYKDEAF 672

BLAST of Clc02G05770 vs. NCBI nr
Match: XP_031745080.1 (putative pentatricopeptide repeat-containing protein At3g18840 [Cucumis sativus])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 606/673 (90.04%), Postives = 639/673 (94.95%), Query Frame = 0

Query: 1    MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
            MKHLKHGLLCH Q IKSGFTPTIF SNQLIT YAKHG L DAQKLFDEMPERNVFSWNAI
Sbjct: 424  MKHLKHGLLCHLQGIKSGFTPTIFMSNQLITFYAKHGLLNDAQKLFDEMPERNVFSWNAI 483

Query: 61   IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
            IAAY+KS NLRQARALFD AV +DLVTYNSMLSGY  SDGY+ QALGFF+EMQTAPD+IR
Sbjct: 484  IAAYVKSHNLRQARALFDSAVCKDLVTYNSMLSGYARSDGYQGQALGFFMEMQTAPDMIR 543

Query: 121  IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
            IDEF+L  MLNLTAKLCV+SYGKQLHSFMLKTANDL+VFAASSLIDMYSKCGFFKEACRV
Sbjct: 544  IDEFTLITMLNLTAKLCVISYGKQLHSFMLKTANDLTVFAASSLIDMYSKCGFFKEACRV 603

Query: 181  YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
            YYGCGEVVD V RNAMVAACCREGEIDVA++LFWKELE+NDVVAWNTMISGFVQNGYEEE
Sbjct: 604  YYGCGEVVDSVSRNAMVAACCREGEIDVALDLFWKELEQNDVVAWNTMISGFVQNGYEEE 663

Query: 241  SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
            SL+LFVRMADEKVGWNEHTFAS+LSACSNLR+LKLGKEVHAYVLKN LI NPFI SGLVD
Sbjct: 664  SLKLFVRMADEKVGWNEHTFASVLSACSNLRSLKLGKEVHAYVLKNRLIANPFICSGLVD 723

Query: 301  VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
            VYCKC+NMRYA+SVNSELR  NVYSITSMIVGYSSQGNMAEARKLFDSLDEKNS VWTAL
Sbjct: 724  VYCKCNNMRYAKSVNSELRMQNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSAVWTAL 783

Query: 361  FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
            F GYVKLQQCEAV ELLSEYRKEA VPDVL+LISIIGACAIQAAL PGKQIH YMLRAGI
Sbjct: 784  FFGYVKLQQCEAVFELLSEYRKEAKVPDVLILISIIGACAIQAALVPGKQIHSYMLRAGI 843

Query: 421  ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
            +LDTKLTSSLVDMYSKCGSIIYAER+FREVTDKDSI+YNIMIAGYAHHGWENEAVQLFKE
Sbjct: 844  KLDTKLTSSLVDMYSKCGSIIYAERIFREVTDKDSIIYNIMIAGYAHHGWENEAVQLFKE 903

Query: 481  MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
            M+++  +PDAITFVALLSACRHGGLVELGEHFFDSMS D+NI PEIDHYACMIDLYGRAN
Sbjct: 904  MVKHGFKPDAITFVALLSACRHGGLVELGEHFFDSMSNDYNICPEIDHYACMIDLYGRAN 963

Query: 541  QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
            QLDKALEFM+KIPIQLDAVIWG+FLNACRINGNAELARKAED+LL+IEGENG+RYVQLAN
Sbjct: 964  QLDKALEFMRKIPIQLDAVIWGAFLNACRINGNAELARKAEDELLVIEGENGSRYVQLAN 1023

Query: 601  VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
            VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVE+KFHVF SGDRFH KNEAIY TLAS
Sbjct: 1024 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVESKFHVFISGDRFHSKNEAIYSTLAS 1083

Query: 661  LTDELLHKEEAFC 674
            LTDELL++EEAFC
Sbjct: 1084 LTDELLYREEAFC 1096

BLAST of Clc02G05770 vs. NCBI nr
Match: KGN44321.1 (hypothetical protein Csa_015981 [Cucumis sativus])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 606/673 (90.04%), Postives = 639/673 (94.95%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MKHLKHGLLCH Q IKSGFTPTIF SNQLIT YAKHG L DAQKLFDEMPERNVFSWNAI
Sbjct: 1   MKHLKHGLLCHLQGIKSGFTPTIFMSNQLITFYAKHGLLNDAQKLFDEMPERNVFSWNAI 60

Query: 61  IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
           IAAY+KS NLRQARALFD AV +DLVTYNSMLSGY  SDGY+ QALGFF+EMQTAPD+IR
Sbjct: 61  IAAYVKSHNLRQARALFDSAVCKDLVTYNSMLSGYARSDGYQGQALGFFMEMQTAPDMIR 120

Query: 121 IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
           IDEF+L  MLNLTAKLCV+SYGKQLHSFMLKTANDL+VFAASSLIDMYSKCGFFKEACRV
Sbjct: 121 IDEFTLITMLNLTAKLCVISYGKQLHSFMLKTANDLTVFAASSLIDMYSKCGFFKEACRV 180

Query: 181 YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
           YYGCGEVVD V RNAMVAACCREGEIDVA++LFWKELE+NDVVAWNTMISGFVQNGYEEE
Sbjct: 181 YYGCGEVVDSVSRNAMVAACCREGEIDVALDLFWKELEQNDVVAWNTMISGFVQNGYEEE 240

Query: 241 SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
           SL+LFVRMADEKVGWNEHTFAS+LSACSNLR+LKLGKEVHAYVLKN LI NPFI SGLVD
Sbjct: 241 SLKLFVRMADEKVGWNEHTFASVLSACSNLRSLKLGKEVHAYVLKNRLIANPFICSGLVD 300

Query: 301 VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
           VYCKC+NMRYA+SVNSELR  NVYSITSMIVGYSSQGNMAEARKLFDSLDEKNS VWTAL
Sbjct: 301 VYCKCNNMRYAKSVNSELRMQNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSAVWTAL 360

Query: 361 FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
           F GYVKLQQCEAV ELLSEYRKEA VPDVL+LISIIGACAIQAAL PGKQIH YMLRAGI
Sbjct: 361 FFGYVKLQQCEAVFELLSEYRKEAKVPDVLILISIIGACAIQAALVPGKQIHSYMLRAGI 420

Query: 421 ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
           +LDTKLTSSLVDMYSKCGSIIYAER+FREVTDKDSI+YNIMIAGYAHHGWENEAVQLFKE
Sbjct: 421 KLDTKLTSSLVDMYSKCGSIIYAERIFREVTDKDSIIYNIMIAGYAHHGWENEAVQLFKE 480

Query: 481 MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
           M+++  +PDAITFVALLSACRHGGLVELGEHFFDSMS D+NI PEIDHYACMIDLYGRAN
Sbjct: 481 MVKHGFKPDAITFVALLSACRHGGLVELGEHFFDSMSNDYNICPEIDHYACMIDLYGRAN 540

Query: 541 QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
           QLDKALEFM+KIPIQLDAVIWG+FLNACRINGNAELARKAED+LL+IEGENG+RYVQLAN
Sbjct: 541 QLDKALEFMRKIPIQLDAVIWGAFLNACRINGNAELARKAEDELLVIEGENGSRYVQLAN 600

Query: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
           VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVE+KFHVF SGDRFH KNEAIY TLAS
Sbjct: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVESKFHVFISGDRFHSKNEAIYSTLAS 660

Query: 661 LTDELLHKEEAFC 674
           LTDELL++EEAFC
Sbjct: 661 LTDELLYREEAFC 673

BLAST of Clc02G05770 vs. NCBI nr
Match: XP_022136238.1 (putative pentatricopeptide repeat-containing protein At3g18840 [Momordica charantia])

HSP 1 Score: 1199.5 bits (3102), Expect = 0.0e+00
Identity = 591/673 (87.82%), Postives = 628/673 (93.31%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MKHLKHG LCH QAIKSGFTPTIFTSNQLI+LYAKHG L +AQKLFDEMPERNVFSWNAI
Sbjct: 5   MKHLKHGFLCHIQAIKSGFTPTIFTSNQLISLYAKHGHLRNAQKLFDEMPERNVFSWNAI 64

Query: 61  IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
           IAAYIKS NL QARALFD A YRDLVTYNSMLSGYVSSDGYEA AL  FVEMQTAPD+IR
Sbjct: 65  IAAYIKSHNLGQARALFDSASYRDLVTYNSMLSGYVSSDGYEAHALELFVEMQTAPDMIR 124

Query: 121 IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
           IDEF+LTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCG FKEA RV
Sbjct: 125 IDEFTLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGCFKEAYRV 184

Query: 181 YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
           Y GCG+VVDLV RNAMVAACCREGEIDVA++LFWKELERND VAWNTMISGFVQNGYEEE
Sbjct: 185 YDGCGDVVDLVSRNAMVAACCREGEIDVAVDLFWKELERNDNVAWNTMISGFVQNGYEEE 244

Query: 241 SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
           SL+LFVRM DE V WNEHTFAS+LSACSNLR+LKLGKEVHAYVLKNGLI+NPFIGSGLVD
Sbjct: 245 SLKLFVRMGDENVRWNEHTFASVLSACSNLRSLKLGKEVHAYVLKNGLIVNPFIGSGLVD 304

Query: 301 VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
           VYCKC+NM YAESVN+ +   N YSITSMIVGYSSQGNMAEARKLFDSLDEKN+VVWTAL
Sbjct: 305 VYCKCNNMEYAESVNANM-ARNAYSITSMIVGYSSQGNMAEARKLFDSLDEKNTVVWTAL 364

Query: 361 FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
           FSGYVKLQQCEAV ELLSEYR+EA VPDVL+L+SIIGACAIQAALAPGKQIHGY+LRAG+
Sbjct: 365 FSGYVKLQQCEAVFELLSEYREEAAVPDVLILVSIIGACAIQAALAPGKQIHGYILRAGV 424

Query: 421 ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
           ELD KL SSLVDMYSKCGSII+A R+FR+VTDKDSILYNIMIAGYAHHGWEN+AV LFKE
Sbjct: 425 ELDKKLASSLVDMYSKCGSIIFAARIFRQVTDKDSILYNIMIAGYAHHGWENKAVLLFKE 484

Query: 481 MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
           MMEN L+PDA+TFVALLSACRH GLVELG+HFF+SM+ +HN+SPEIDHYACMIDLYGRAN
Sbjct: 485 MMENGLKPDAVTFVALLSACRHCGLVELGDHFFESMTNEHNMSPEIDHYACMIDLYGRAN 544

Query: 541 QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
           QL+KAL FMK IPI+LDAVIWG+FLNACRINGN ELA++AED+LLIIEGENGARYVQLAN
Sbjct: 545 QLEKALAFMKSIPIELDAVIWGAFLNACRINGNTELAKEAEDELLIIEGENGARYVQLAN 604

Query: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
           VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVF S DR H  NEAIY TLAS
Sbjct: 605 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFMSSDRCHSANEAIYSTLAS 664

Query: 661 LTDELLHKEEAFC 674
           LTDELL  EEAFC
Sbjct: 665 LTDELLDIEEAFC 676

BLAST of Clc02G05770 vs. ExPASy Swiss-Prot
Match: Q9LHN5 (Putative pentatricopeptide repeat-containing protein At3g18840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E92 PE=3 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 1.4e-215
Identity = 367/675 (54.37%), Postives = 499/675 (73.93%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MK LK G L H ++IKSG T T  +SNQL+ LY+K G L +A+ +FDEM ERNV+SWNA+
Sbjct: 1   MKCLKDGFLHHIRSIKSGSTLTAVSSNQLVNLYSKSGLLREARNVFDEMLERNVYSWNAV 60

Query: 61  IAAYIKSQNLRQARALFDC-AVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEM-QTAPDL 120
           IAAY+K  N+++AR LF+     RDL+TYN++LSG+  +DG E++A+  F EM +   D 
Sbjct: 61  IAAYVKFNNVKEARELFESDNCERDLITYNTLLSGFAKTDGCESEAIEMFGEMHRKEKDD 120

Query: 121 IRIDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEAC 180
           I ID+F++T M+ L+AKL  V YG+QLH  ++KT ND + FA SSLI MYSKCG FKE C
Sbjct: 121 IWIDDFTVTTMVKLSAKLTNVFYGEQLHGVLVKTGNDGTKFAVSSLIHMYSKCGKFKEVC 180

Query: 181 RVYYG-CGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGY 240
            ++ G C E VD V RNAM+AA CREG+ID A+++FW+  E ND ++WNT+I+G+ QNGY
Sbjct: 181 NIFNGSCVEFVDSVARNAMIAAYCREGDIDKALSVFWRNPELNDTISWNTLIAGYAQNGY 240

Query: 241 EEESLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSG 300
           EEE+L++ V M +  + W+EH+F ++L+  S+L++LK+GKEVHA VLKNG   N F+ SG
Sbjct: 241 EEEALKMAVSMEENGLKWDEHSFGAVLNVLSSLKSLKIGKEVHARVLKNGSYSNKFVSSG 300

Query: 301 LVDVYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVW 360
           +VDVYCKC NM+YAES +      N+YS +SMIVGYSSQG M EA++LFDSL EKN VVW
Sbjct: 301 IVDVYCKCGNMKYAESAHLLYGFGNLYSASSMIVGYSSQGKMVEAKRLFDSLSEKNLVVW 360

Query: 361 TALFSGYVKLQQCEAVLELLSEY-RKEATVPDVLLLISIIGACAIQAALAPGKQIHGYML 420
           TA+F GY+ L+Q ++VLEL   +   E   PD L+++S++GAC++QA + PGK+IHG+ L
Sbjct: 361 TAMFLGYLNLRQPDSVLELARAFIANETNTPDSLVMVSVLGACSLQAYMEPGKEIHGHSL 420

Query: 421 RAGIELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQ 480
           R GI +D KL ++ VDMYSKCG++ YAER+F    ++D+++YN MIAG AHHG E ++ Q
Sbjct: 421 RTGILMDKKLVTAFVDMYSKCGNVEYAERIFDSSFERDTVMYNAMIAGCAHHGHEAKSFQ 480

Query: 481 LFKEMMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLY 540
            F++M E   +PD ITF+ALLSACRH GLV  GE +F SM   +NISPE  HY CMIDLY
Sbjct: 481 HFEDMTEGGFKPDEITFMALLSACRHRGLVLEGEKYFKSMIEAYNISPETGHYTCMIDLY 540

Query: 541 GRANQLDKALEFMKKI-PIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARY 600
           G+A +LDKA+E M+ I  ++ DAVI G+FLNAC  N N EL ++ E+KLL+IEG NG+RY
Sbjct: 541 GKAYRLDKAIELMEGIDQVEKDAVILGAFLNACSWNKNTELVKEVEEKLLVIEGSNGSRY 600

Query: 601 VQLANVYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIY 660
           +Q+AN YA+ G W+EM RIR +M+GKE++  +GCSW  ++ +FH+FTS D  H + EAIY
Sbjct: 601 IQIANAYASSGRWDEMQRIRHQMRGKELEIFSGCSWANIDKQFHMFTSSDISHYETEAIY 660

Query: 661 LTLASLTDELLHKEE 671
             L  +T +L   +E
Sbjct: 661 AMLHFVTKDLSEIDE 675

BLAST of Clc02G05770 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 9.2e-127
Identity = 246/667 (36.88%), Postives = 386/667 (57.87%), Query Frame = 0

Query: 11  HAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQNL 70
           HA  IKSGF+  IF  N+LI  Y+K G L D +++FD+MP+RN+++WN+++    K   L
Sbjct: 43  HASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFL 102

Query: 71  RQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIRIDEFSLTIML 130
            +A +LF     RD  T+NSM+SG+   D  E +AL +F  M    +   ++E+S   +L
Sbjct: 103 DEADSLFRSMPERDQCTWNSMVSGFAQHDRCE-EALCYFAMMH--KEGFVLNEYSFASVL 162

Query: 131 NLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGEVVDL 190
           +  + L  ++ G Q+HS + K+     V+  S+L+DMYSKCG   +A RV+         
Sbjct: 163 SACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVF--------- 222

Query: 191 VCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFVRMAD 250
                                    E+   +VV+WN++I+ F QNG   E+L +F  M +
Sbjct: 223 ------------------------DEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE 282

Query: 251 EKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFI-GSGLVDVYCKCSNMR 310
            +V  +E T AS++SAC++L  +K+G+EVH  V+KN  + N  I  +  VD+Y KCS ++
Sbjct: 283 SRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIK 342

Query: 311 YAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKLQQ 370
            A  +   +   NV + TSMI GY+   +   AR +F  + E+N V W AL +GY +  +
Sbjct: 343 EARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGE 402

Query: 371 CEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQI------HGYMLRAGIELD 430
            E  L L    ++E+  P      +I+ ACA  A L  G Q       HG+  ++G E D
Sbjct: 403 NEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDD 462

Query: 431 TKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMME 490
             + +SL+DMY KCG +     +FR++ ++D + +N MI G+A +G+ NEA++LF+EM+E
Sbjct: 463 IFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLE 522

Query: 491 NDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLD 550
           +  +PD IT + +LSAC H G VE G H+F SM+RD  ++P  DHY CM+DL GRA  L+
Sbjct: 523 SGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLE 582

Query: 551 KALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVYA 610
           +A   ++++P+Q D+VIWGS L AC+++ N  L +   +KLL +E  N   YV L+N+YA
Sbjct: 583 EAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYA 642

Query: 611 AEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLTD 670
             G WE++  +RK M+ + V K  GCSW+ ++   HVF   D+ HP+ + I+  L  L  
Sbjct: 643 ELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIA 673

BLAST of Clc02G05770 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 453.0 bits (1164), Expect = 6.0e-126
Identity = 243/663 (36.65%), Postives = 387/663 (58.37%), Query Frame = 0

Query: 9   LCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQ 68
           L H + IKSG   +++  N L+ +Y+K G+   A+KLFDEMP R  FSWN +++AY K  
Sbjct: 35  LVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMPLRTAFSWNTVLSAYSKRG 94

Query: 69  NLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYE--AQALGFFVEMQTAPDLIRIDEFSL 128
           ++      FD    RD V++ +M+ GY +   Y    + +G  V+    P      +F+L
Sbjct: 95  DMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEP-----TQFTL 154

Query: 129 TIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGE 188
           T +L   A    +  GK++HSF++K     +V  ++SL++MY+KCG    A +  +    
Sbjct: 155 TNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMA-KFVFDRMV 214

Query: 189 VVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFV 248
           V D+   NAM+A   + G++D+AM  F +  ER D+V WN+MISGF Q GY+  +L +F 
Sbjct: 215 VRDISSWNAMIALHMQVGQMDLAMAQFEQMAER-DIVTWNSMISGFNQRGYDLRALDIFS 274

Query: 249 RM-ADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKC 308
           +M  D  +  +  T AS+LSAC+NL  L +GK++H++++  G  ++  + + L+ +Y +C
Sbjct: 275 KMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRC 334

Query: 309 SNMRYAESVNSELRT--LNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSG 368
             +  A  +  +  T  L +   T+++ GY   G+M +A+ +F SL +++ V WTA+  G
Sbjct: 335 GGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVG 394

Query: 369 YVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELD 428
           Y +       + L          P+   L +++   +  A+L+ GKQIHG  +++G    
Sbjct: 395 YEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYS 454

Query: 429 TKLTSSLVDMYSKCGSIIYAERMFREV-TDKDSILYNIMIAGYAHHGWENEAVQLFKEMM 488
             ++++L+ MY+K G+I  A R F  +  ++D++ +  MI   A HG   EA++LF+ M+
Sbjct: 455 VSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETML 514

Query: 489 ENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQL 548
              LRPD IT+V + SAC H GLV  G  +FD M     I P + HYACM+DL+GRA  L
Sbjct: 515 MEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLL 574

Query: 549 DKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVY 608
            +A EF++K+PI+ D V WGS L+ACR++ N +L + A ++LL++E EN   Y  LAN+Y
Sbjct: 575 QEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLY 634

Query: 609 AAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLT 666
           +A G WEE  +IRK MK   VKK  G SW+ V++K HVF   D  HP+   IY+T+  + 
Sbjct: 635 SACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIW 690

BLAST of Clc02G05770 vs. ExPASy Swiss-Prot
Match: Q9FWA6 (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 380.2 bits (975), Expect = 4.9e-104
Identity = 237/834 (28.42%), Postives = 386/834 (46.28%), Query Frame = 0

Query: 4   LKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAA 63
           L+ G   HA  I SGF PT F  N L+ +Y        A  +FD+MP R+V SWN +I  
Sbjct: 64  LELGKQAHAHMIISGFRPTTFVLNCLLQVYTNSRDFVSASMVFDKMPLRDVVSWNKMING 123

Query: 64  YIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIRIDE 123
           Y KS ++ +A + F+    RD+V++NSMLSGY+  +G   +++  FV+M    + I  D 
Sbjct: 124 YSKSNDMFKANSFFNMMPVRDVVSWNSMLSGYL-QNGESLKSIEVFVDM--GREGIEFDG 183

Query: 124 FSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYG 183
            +  I+L + + L   S G Q+H  +++   D  V AAS+L+DMY+K   F E+ RV+ G
Sbjct: 184 RTFAIILKVCSFLEDTSLGMQIHGIVVRVGCDTDVVAASALLDMYAKGKRFVESLRVFQG 243

Query: 184 CGEVVDLVCRNAMVAACCREGEIDVAMNLF------------------------------ 243
             E  + V  +A++A C +   + +A+  F                              
Sbjct: 244 IPE-KNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRL 303

Query: 244 ------------------------------------------------------------ 303
                                                                       
Sbjct: 304 GGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQ 363

Query: 304 ------------------------------------------------------------ 363
                                                                       
Sbjct: 364 EEHGFKALLLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCV 423

Query: 364 --------------------WKELERNDVVAWNTMISGFVQNGYEEESLQLFVRMADEKV 423
                               + E+ R D V+WN +I+   QNG   E+L LFV M   ++
Sbjct: 424 ANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRI 483

Query: 424 GWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKCSNMRYAES 483
             +E TF SIL AC+   +L  G E+H+ ++K+G+  N  +G  L+D+Y KC  +  AE 
Sbjct: 484 EPDEFTFGSILKACTG-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEK 543

Query: 484 VNSE-LRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKLQQCEA 543
           ++S   +  NV             G M E  K+ +   ++  V W ++ SGYV  +Q E 
Sbjct: 544 IHSRFFQRANV------------SGTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSED 603

Query: 544 VLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELDTKLTSSLVD 603
              L +   +    PD     +++  CA  A+   GKQIH  +++  ++ D  + S+LVD
Sbjct: 604 AQMLFTRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVD 663

Query: 604 MYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMMENDLRPDAIT 663
           MYSKCG +  +  MF +   +D + +N MI GYAHHG   EA+QLF+ M+  +++P+ +T
Sbjct: 664 MYSKCGDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVT 723

Query: 664 FVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLDKALEFMKKI 666
           F+++L AC H GL++ G  +F  M RD+ + P++ HY+ M+D+ G++ ++ +ALE ++++
Sbjct: 724 FISILRACAHMGLIDKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREM 783

BLAST of Clc02G05770 vs. ExPASy Swiss-Prot
Match: Q9CAA8 (Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H22 PE=3 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 6.4e-104
Identity = 209/659 (31.71%), Postives = 361/659 (54.78%), Query Frame = 0

Query: 9   LCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQ 68
           + H   I++   P  F  N ++  YA       A+++FD +P+ N+FSWN ++ AY K+ 
Sbjct: 27  MIHGNIIRALPYPETFLYNNIVHAYALMKSSTYARRVFDRIPQPNLFSWNNLLLAYSKAG 86

Query: 69  NLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQT-APDLIRIDEFSLT 128
            + +  + F+    RD VT+N ++ GY  S    A    +   M+  + +L R+   +L 
Sbjct: 87  LISEMESTFEKLPDRDGVTWNVLIEGYSLSGLVGAAVKAYNTMMRDFSANLTRV---TLM 146

Query: 129 IMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGEV 188
            ML L++    VS GKQ+H  ++K   +  +   S L+ MY+  G   +A +V+YG  + 
Sbjct: 147 TMLKLSSSNGHVSLGKQIHGQVIKLGFESYLLVGSPLLYMYANVGCISDAKKVFYGLDD- 206

Query: 189 VDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFVR 248
            + V  N+++      G I+ A+ LF + +E+ D V+W  MI G  QNG  +E+++ F  
Sbjct: 207 RNTVMYNSLMGGLLACGMIEDALQLF-RGMEK-DSVSWAAMIKGLAQNGLAKEAIECFRE 266

Query: 249 MADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKCSN 308
           M  + +  +++ F S+L AC  L  +  GK++HA +++     + ++GS L+D+YCKC  
Sbjct: 267 MKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVGSALIDMYCKCKC 326

Query: 309 MRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKL 368
           + YA++V   ++  NV S T+M+VGY   G   EA K+F  LD + S +           
Sbjct: 327 LHYAKTVFDRMKQKNVVSWTAMVVGYGQTGRAEEAVKIF--LDMQRSGI----------- 386

Query: 369 QQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELDTKLT 428
                              PD   L   I ACA  ++L  G Q HG  + +G+     ++
Sbjct: 387 ------------------DPDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVS 446

Query: 429 SSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMMENDLR 488
           +SLV +Y KCG I  + R+F E+  +D++ +  M++ YA  G   E +QLF +M+++ L+
Sbjct: 447 NSLVTLYGKCGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLK 506

Query: 489 PDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLDKALE 548
           PD +T   ++SAC   GLVE G+ +F  M+ ++ I P I HY+CMIDL+ R+ +L++A+ 
Sbjct: 507 PDGVTLTGVISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMR 566

Query: 549 FMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVYAAEGN 608
           F+  +P   DA+ W + L+ACR  GN E+ + A + L+ ++  + A Y  L+++YA++G 
Sbjct: 567 FINGMPFPPDAIGWTTLLSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGK 626

Query: 609 WEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLTDELL 667
           W+ + ++R+ M+ K VKK  G SW+  + K H F++ D   P  + IY  L  L ++++
Sbjct: 627 WDSVAQLRRGMREKNVKKEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNNKII 648

BLAST of Clc02G05770 vs. ExPASy TrEMBL
Match: A0A1S4DWI0 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g18840 OS=Cucumis melo OX=3656 GN=LOC107990873 PE=4 SV=1)

HSP 1 Score: 1244.2 bits (3218), Expect = 0.0e+00
Identity = 609/672 (90.62%), Postives = 641/672 (95.39%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MKHLKHGLLCH Q IKSGFTPTIFTSNQLI  YAKHG L DAQKLFDEMPERNVFSWNAI
Sbjct: 1   MKHLKHGLLCHVQGIKSGFTPTIFTSNQLINFYAKHGLLNDAQKLFDEMPERNVFSWNAI 60

Query: 61  IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
           I+AYIKSQNLR ARALFD AVY+DLVTYNSMLSGY  SDGYE +ALGFF+EMQTAPD+IR
Sbjct: 61  ISAYIKSQNLRTARALFDSAVYKDLVTYNSMLSGYARSDGYEGKALGFFMEMQTAPDMIR 120

Query: 121 IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
           IDEFSL IMLNLTAKLCV+SYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV
Sbjct: 121 IDEFSLIIMLNLTAKLCVISYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180

Query: 181 YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
           YYGCGEVVD V RNAMVAACCREGEIDVA++LFWKELE+NDVVAWNTMISGFVQNGYEEE
Sbjct: 181 YYGCGEVVDSVSRNAMVAACCREGEIDVALDLFWKELEQNDVVAWNTMISGFVQNGYEEE 240

Query: 241 SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
           SL+LFVRMADEKVGWNEHTFAS+LSACSNLR+LKLGKEVH YVLKN LI NPFI SGLVD
Sbjct: 241 SLKLFVRMADEKVGWNEHTFASVLSACSNLRSLKLGKEVHTYVLKNRLIANPFICSGLVD 300

Query: 301 VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
           VYCKC+NMRYAESV+SELR  NVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL
Sbjct: 301 VYCKCNNMRYAESVHSELRMQNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360

Query: 361 FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
           F GYVKLQQCEAV ELLSEYRKEA VPDVL+LISIIGACAIQAALAPGKQIH YMLRAGI
Sbjct: 361 FFGYVKLQQCEAVFELLSEYRKEAKVPDVLILISIIGACAIQAALAPGKQIHSYMLRAGI 420

Query: 421 ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
           +LDTKLTSSLVDMYSKCGSIIYAER+F   +DKDSI+YNIMIAGYAHHGWENEAVQLF+E
Sbjct: 421 KLDTKLTSSLVDMYSKCGSIIYAERIFXRSSDKDSIIYNIMIAGYAHHGWENEAVQLFEE 480

Query: 481 MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
           M+ENDL+PDAITFVALLSACRHGGLVELGEHFFDSMS D+NI+PEIDHYACMIDLYGRAN
Sbjct: 481 MVENDLKPDAITFVALLSACRHGGLVELGEHFFDSMSNDYNINPEIDHYACMIDLYGRAN 540

Query: 541 QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
           QLDKALEFM+KIPIQLDAVIWG+FLNACRINGNAELAR+AED+LL+IEGENGARYVQLAN
Sbjct: 541 QLDKALEFMRKIPIQLDAVIWGAFLNACRINGNAELAREAEDELLVIEGENGARYVQLAN 600

Query: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
           VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVE+KFHVF SGDRFH KNEAIY TLAS
Sbjct: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVESKFHVFISGDRFHSKNEAIYSTLAS 660

Query: 661 LTDELLHKEEAF 673
           LTDELL+K+EAF
Sbjct: 661 LTDELLYKDEAF 672

BLAST of Clc02G05770 vs. ExPASy TrEMBL
Match: A0A0A0K940 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G253760 PE=4 SV=1)

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 606/673 (90.04%), Postives = 639/673 (94.95%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MKHLKHGLLCH Q IKSGFTPTIF SNQLIT YAKHG L DAQKLFDEMPERNVFSWNAI
Sbjct: 1   MKHLKHGLLCHLQGIKSGFTPTIFMSNQLITFYAKHGLLNDAQKLFDEMPERNVFSWNAI 60

Query: 61  IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
           IAAY+KS NLRQARALFD AV +DLVTYNSMLSGY  SDGY+ QALGFF+EMQTAPD+IR
Sbjct: 61  IAAYVKSHNLRQARALFDSAVCKDLVTYNSMLSGYARSDGYQGQALGFFMEMQTAPDMIR 120

Query: 121 IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
           IDEF+L  MLNLTAKLCV+SYGKQLHSFMLKTANDL+VFAASSLIDMYSKCGFFKEACRV
Sbjct: 121 IDEFTLITMLNLTAKLCVISYGKQLHSFMLKTANDLTVFAASSLIDMYSKCGFFKEACRV 180

Query: 181 YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
           YYGCGEVVD V RNAMVAACCREGEIDVA++LFWKELE+NDVVAWNTMISGFVQNGYEEE
Sbjct: 181 YYGCGEVVDSVSRNAMVAACCREGEIDVALDLFWKELEQNDVVAWNTMISGFVQNGYEEE 240

Query: 241 SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
           SL+LFVRMADEKVGWNEHTFAS+LSACSNLR+LKLGKEVHAYVLKN LI NPFI SGLVD
Sbjct: 241 SLKLFVRMADEKVGWNEHTFASVLSACSNLRSLKLGKEVHAYVLKNRLIANPFICSGLVD 300

Query: 301 VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
           VYCKC+NMRYA+SVNSELR  NVYSITSMIVGYSSQGNMAEARKLFDSLDEKNS VWTAL
Sbjct: 301 VYCKCNNMRYAKSVNSELRMQNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSAVWTAL 360

Query: 361 FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
           F GYVKLQQCEAV ELLSEYRKEA VPDVL+LISIIGACAIQAAL PGKQIH YMLRAGI
Sbjct: 361 FFGYVKLQQCEAVFELLSEYRKEAKVPDVLILISIIGACAIQAALVPGKQIHSYMLRAGI 420

Query: 421 ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
           +LDTKLTSSLVDMYSKCGSIIYAER+FREVTDKDSI+YNIMIAGYAHHGWENEAVQLFKE
Sbjct: 421 KLDTKLTSSLVDMYSKCGSIIYAERIFREVTDKDSIIYNIMIAGYAHHGWENEAVQLFKE 480

Query: 481 MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
           M+++  +PDAITFVALLSACRHGGLVELGEHFFDSMS D+NI PEIDHYACMIDLYGRAN
Sbjct: 481 MVKHGFKPDAITFVALLSACRHGGLVELGEHFFDSMSNDYNICPEIDHYACMIDLYGRAN 540

Query: 541 QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
           QLDKALEFM+KIPIQLDAVIWG+FLNACRINGNAELARKAED+LL+IEGENG+RYVQLAN
Sbjct: 541 QLDKALEFMRKIPIQLDAVIWGAFLNACRINGNAELARKAEDELLVIEGENGSRYVQLAN 600

Query: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
           VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVE+KFHVF SGDRFH KNEAIY TLAS
Sbjct: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVESKFHVFISGDRFHSKNEAIYSTLAS 660

Query: 661 LTDELLHKEEAFC 674
           LTDELL++EEAFC
Sbjct: 661 LTDELLYREEAFC 673

BLAST of Clc02G05770 vs. ExPASy TrEMBL
Match: A0A6J1C740 (putative pentatricopeptide repeat-containing protein At3g18840 OS=Momordica charantia OX=3673 GN=LOC111007983 PE=4 SV=1)

HSP 1 Score: 1199.5 bits (3102), Expect = 0.0e+00
Identity = 591/673 (87.82%), Postives = 628/673 (93.31%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MKHLKHG LCH QAIKSGFTPTIFTSNQLI+LYAKHG L +AQKLFDEMPERNVFSWNAI
Sbjct: 5   MKHLKHGFLCHIQAIKSGFTPTIFTSNQLISLYAKHGHLRNAQKLFDEMPERNVFSWNAI 64

Query: 61  IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
           IAAYIKS NL QARALFD A YRDLVTYNSMLSGYVSSDGYEA AL  FVEMQTAPD+IR
Sbjct: 65  IAAYIKSHNLGQARALFDSASYRDLVTYNSMLSGYVSSDGYEAHALELFVEMQTAPDMIR 124

Query: 121 IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
           IDEF+LTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCG FKEA RV
Sbjct: 125 IDEFTLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGCFKEAYRV 184

Query: 181 YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
           Y GCG+VVDLV RNAMVAACCREGEIDVA++LFWKELERND VAWNTMISGFVQNGYEEE
Sbjct: 185 YDGCGDVVDLVSRNAMVAACCREGEIDVAVDLFWKELERNDNVAWNTMISGFVQNGYEEE 244

Query: 241 SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
           SL+LFVRM DE V WNEHTFAS+LSACSNLR+LKLGKEVHAYVLKNGLI+NPFIGSGLVD
Sbjct: 245 SLKLFVRMGDENVRWNEHTFASVLSACSNLRSLKLGKEVHAYVLKNGLIVNPFIGSGLVD 304

Query: 301 VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
           VYCKC+NM YAESVN+ +   N YSITSMIVGYSSQGNMAEARKLFDSLDEKN+VVWTAL
Sbjct: 305 VYCKCNNMEYAESVNANM-ARNAYSITSMIVGYSSQGNMAEARKLFDSLDEKNTVVWTAL 364

Query: 361 FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
           FSGYVKLQQCEAV ELLSEYR+EA VPDVL+L+SIIGACAIQAALAPGKQIHGY+LRAG+
Sbjct: 365 FSGYVKLQQCEAVFELLSEYREEAAVPDVLILVSIIGACAIQAALAPGKQIHGYILRAGV 424

Query: 421 ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
           ELD KL SSLVDMYSKCGSII+A R+FR+VTDKDSILYNIMIAGYAHHGWEN+AV LFKE
Sbjct: 425 ELDKKLASSLVDMYSKCGSIIFAARIFRQVTDKDSILYNIMIAGYAHHGWENKAVLLFKE 484

Query: 481 MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
           MMEN L+PDA+TFVALLSACRH GLVELG+HFF+SM+ +HN+SPEIDHYACMIDLYGRAN
Sbjct: 485 MMENGLKPDAVTFVALLSACRHCGLVELGDHFFESMTNEHNMSPEIDHYACMIDLYGRAN 544

Query: 541 QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
           QL+KAL FMK IPI+LDAVIWG+FLNACRINGN ELA++AED+LLIIEGENGARYVQLAN
Sbjct: 545 QLEKALAFMKSIPIELDAVIWGAFLNACRINGNTELAKEAEDELLIIEGENGARYVQLAN 604

Query: 601 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
           VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVF S DR H  NEAIY TLAS
Sbjct: 605 VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFMSSDRCHSANEAIYSTLAS 664

Query: 661 LTDELLHKEEAFC 674
           LTDELL  EEAFC
Sbjct: 665 LTDELLDIEEAFC 676

BLAST of Clc02G05770 vs. ExPASy TrEMBL
Match: A0A6J1GNG3 (putative pentatricopeptide repeat-containing protein At3g18840 OS=Cucurbita moschata OX=3662 GN=LOC111455543 PE=4 SV=1)

HSP 1 Score: 1189.9 bits (3077), Expect = 0.0e+00
Identity = 585/673 (86.92%), Postives = 626/673 (93.02%), Query Frame = 0

Query: 1    MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
            MKHLKHGLLCH QAIKSGFTPTIFTSNQLI+LYAKHG L DA K+FDEMPERNVFSWNAI
Sbjct: 427  MKHLKHGLLCHVQAIKSGFTPTIFTSNQLISLYAKHGLLGDAHKVFDEMPERNVFSWNAI 486

Query: 61   IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
            IAAYIKSQNLR+AR LFD A YRDLVTYNSMLSGYVSSDGYEAQALG FVEMQTA D+IR
Sbjct: 487  IAAYIKSQNLRKARELFDSAHYRDLVTYNSMLSGYVSSDGYEAQALGLFVEMQTASDMIR 546

Query: 121  IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
            IDEFSLTIMLNLTAKLCV+SYGKQLH+FMLKTANDLSVFAASSLIDMYSKCG FKEACRV
Sbjct: 547  IDEFSLTIMLNLTAKLCVLSYGKQLHTFMLKTANDLSVFAASSLIDMYSKCGCFKEACRV 606

Query: 181  YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
            Y GCGEV+DLV RNAMVAACCR GEID+A++LF +E +RNDVVAWNTMISGFVQNGY++E
Sbjct: 607  YDGCGEVIDLVSRNAMVAACCRAGEIDLAVDLFSRERDRNDVVAWNTMISGFVQNGYDKE 666

Query: 241  SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
            SL+LFV  ADE V WNEHTFAS+LSACSNL++LKLGKE+HAYVLKNGLI+NPFIGSG+VD
Sbjct: 667  SLKLFVNTADENVRWNEHTFASVLSACSNLKSLKLGKEIHAYVLKNGLIVNPFIGSGVVD 726

Query: 301  VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
            VYCKC+NMRYAESV+ EL T NVYSITSMIVGYSSQGNM EARKLFDSLDEKNSVVWTAL
Sbjct: 727  VYCKCNNMRYAESVHLELTTRNVYSITSMIVGYSSQGNMVEARKLFDSLDEKNSVVWTAL 786

Query: 361  FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
            F+ YVK QQ EAV ELLSEYRKEA VPDVL+L+SIIGACA QAALAPGKQIHGYMLRAGI
Sbjct: 787  FTEYVKSQQFEAVFELLSEYRKEAAVPDVLILVSIIGACARQAALAPGKQIHGYMLRAGI 846

Query: 421  ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
            E D KL SSLVDMYSKCGSIIYAER+FREV DKDSILYNIMIAGYAHHGWENEAV LFKE
Sbjct: 847  EFDVKLASSLVDMYSKCGSIIYAERIFREVLDKDSILYNIMIAGYAHHGWENEAVHLFKE 906

Query: 481  MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
            MMENDL PDAITF+ALLSACRH GLVELGE FF+SM+ D+NISPEIDHYACMIDLYGRAN
Sbjct: 907  MMENDLEPDAITFIALLSACRHSGLVELGERFFNSMTNDYNISPEIDHYACMIDLYGRAN 966

Query: 541  QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
            +LDKAL FMK+IPI+LDAVIWG+FLNACRINGN ELAR+AED+LL+IEGEN ARYVQLAN
Sbjct: 967  ELDKALAFMKRIPIELDAVIWGAFLNACRINGNTELAREAEDELLMIEGENSARYVQLAN 1026

Query: 601  VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
            VYAA+GNWEEMGRIRKKMKGK+VKKNAG SWVFVENKFHVF SGDRFH +NEAIY TLAS
Sbjct: 1027 VYAAKGNWEEMGRIRKKMKGKDVKKNAGFSWVFVENKFHVFISGDRFHLENEAIYSTLAS 1086

Query: 661  LTDELLHKEEAFC 674
            LTDELL  EEAFC
Sbjct: 1087 LTDELLAVEEAFC 1099

BLAST of Clc02G05770 vs. ExPASy TrEMBL
Match: A0A6J1I9U9 (putative pentatricopeptide repeat-containing protein At3g18840 OS=Cucurbita maxima OX=3661 GN=LOC111470970 PE=4 SV=1)

HSP 1 Score: 1189.5 bits (3076), Expect = 0.0e+00
Identity = 586/673 (87.07%), Postives = 625/673 (92.87%), Query Frame = 0

Query: 1    MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
            MKHLKHGLLCH QAIKSGFTPTIFTSNQLI+LYAKHG L DA K+FDEMPERNVFSWNAI
Sbjct: 425  MKHLKHGLLCHVQAIKSGFTPTIFTSNQLISLYAKHGLLGDAHKVFDEMPERNVFSWNAI 484

Query: 61   IAAYIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIR 120
            IAAYIKSQNLR+AR LFD A YRDLVTYNSMLSGYVSSDGYEAQALG FVEMQTA D+IR
Sbjct: 485  IAAYIKSQNLRKARELFDSAHYRDLVTYNSMLSGYVSSDGYEAQALGLFVEMQTASDMIR 544

Query: 121  IDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRV 180
            +DEFSLTIMLNLTAKLCV+SYGKQLH+FMLKTANDLSVFAASSLIDMYSKCG FKEACRV
Sbjct: 545  LDEFSLTIMLNLTAKLCVLSYGKQLHTFMLKTANDLSVFAASSLIDMYSKCGCFKEACRV 604

Query: 181  YYGCGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEE 240
            Y GCGEV+DLV RNAMVAACCR GEID+A++LF +E +RNDVVAWNTMISGFVQNGY+EE
Sbjct: 605  YDGCGEVIDLVSRNAMVAACCRAGEIDLAVDLFSRERDRNDVVAWNTMISGFVQNGYDEE 664

Query: 241  SLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVD 300
            SL+LFV MADE V WNEHTFAS+LSACSNL++LKLGKE+HAYVLKNGLI+NPFIGSGLVD
Sbjct: 665  SLKLFVNMADENVRWNEHTFASVLSACSNLKSLKLGKEIHAYVLKNGLIVNPFIGSGLVD 724

Query: 301  VYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTAL 360
            VYCKC+NMRYAESV+SEL T NVYSITSMIVGYSSQGNM EARKLFDSLDEKNSVVWTAL
Sbjct: 725  VYCKCNNMRYAESVHSELTTRNVYSITSMIVGYSSQGNMVEARKLFDSLDEKNSVVWTAL 784

Query: 361  FSGYVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGI 420
            F  YVK QQ EAV ELLSEYRKEA V DVL+L+SIIGACA QAALAPGKQIHGYMLRAGI
Sbjct: 785  FIEYVKSQQFEAVFELLSEYRKEAAVLDVLILVSIIGACARQAALAPGKQIHGYMLRAGI 844

Query: 421  ELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKE 480
            E   KL SSLVDMYSKCGSIIYAER+FREV DKDSILYNIMIAGYAHHGWENEAV LFKE
Sbjct: 845  EFKVKLASSLVDMYSKCGSIIYAERIFREVLDKDSILYNIMIAGYAHHGWENEAVHLFKE 904

Query: 481  MMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRAN 540
            M+END  PDAITF+ALLSACRH GLVELGE FFDSM+ D+NISPEIDHYACMIDLYGRAN
Sbjct: 905  MIENDFEPDAITFIALLSACRHSGLVELGERFFDSMTNDYNISPEIDHYACMIDLYGRAN 964

Query: 541  QLDKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLAN 600
            +L+KAL FMK+IPI+LDAVIWG+FLNACRINGN ELAR+AED+LL+IEGENGARYVQLAN
Sbjct: 965  ELNKALAFMKRIPIELDAVIWGAFLNACRINGNTELAREAEDELLMIEGENGARYVQLAN 1024

Query: 601  VYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLAS 660
            VYAAEGNWEEMGRIRKKMKGK+VKKNAG SWVFVENKFHVF SGDRFH +NEAIY TLAS
Sbjct: 1025 VYAAEGNWEEMGRIRKKMKGKDVKKNAGFSWVFVENKFHVFISGDRFHLQNEAIYSTLAS 1084

Query: 661  LTDELLHKEEAFC 674
            LTDELL  EEAFC
Sbjct: 1085 LTDELLAVEEAFC 1097

BLAST of Clc02G05770 vs. TAIR 10
Match: AT3G18840.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 750.7 bits (1937), Expect = 9.9e-217
Identity = 367/675 (54.37%), Postives = 499/675 (73.93%), Query Frame = 0

Query: 1   MKHLKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAI 60
           MK LK G L H ++IKSG T T  +SNQL+ LY+K G L +A+ +FDEM ERNV+SWNA+
Sbjct: 1   MKCLKDGFLHHIRSIKSGSTLTAVSSNQLVNLYSKSGLLREARNVFDEMLERNVYSWNAV 60

Query: 61  IAAYIKSQNLRQARALFDC-AVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEM-QTAPDL 120
           IAAY+K  N+++AR LF+     RDL+TYN++LSG+  +DG E++A+  F EM +   D 
Sbjct: 61  IAAYVKFNNVKEARELFESDNCERDLITYNTLLSGFAKTDGCESEAIEMFGEMHRKEKDD 120

Query: 121 IRIDEFSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEAC 180
           I ID+F++T M+ L+AKL  V YG+QLH  ++KT ND + FA SSLI MYSKCG FKE C
Sbjct: 121 IWIDDFTVTTMVKLSAKLTNVFYGEQLHGVLVKTGNDGTKFAVSSLIHMYSKCGKFKEVC 180

Query: 181 RVYYG-CGEVVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGY 240
            ++ G C E VD V RNAM+AA CREG+ID A+++FW+  E ND ++WNT+I+G+ QNGY
Sbjct: 181 NIFNGSCVEFVDSVARNAMIAAYCREGDIDKALSVFWRNPELNDTISWNTLIAGYAQNGY 240

Query: 241 EEESLQLFVRMADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSG 300
           EEE+L++ V M +  + W+EH+F ++L+  S+L++LK+GKEVHA VLKNG   N F+ SG
Sbjct: 241 EEEALKMAVSMEENGLKWDEHSFGAVLNVLSSLKSLKIGKEVHARVLKNGSYSNKFVSSG 300

Query: 301 LVDVYCKCSNMRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVW 360
           +VDVYCKC NM+YAES +      N+YS +SMIVGYSSQG M EA++LFDSL EKN VVW
Sbjct: 301 IVDVYCKCGNMKYAESAHLLYGFGNLYSASSMIVGYSSQGKMVEAKRLFDSLSEKNLVVW 360

Query: 361 TALFSGYVKLQQCEAVLELLSEY-RKEATVPDVLLLISIIGACAIQAALAPGKQIHGYML 420
           TA+F GY+ L+Q ++VLEL   +   E   PD L+++S++GAC++QA + PGK+IHG+ L
Sbjct: 361 TAMFLGYLNLRQPDSVLELARAFIANETNTPDSLVMVSVLGACSLQAYMEPGKEIHGHSL 420

Query: 421 RAGIELDTKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQ 480
           R GI +D KL ++ VDMYSKCG++ YAER+F    ++D+++YN MIAG AHHG E ++ Q
Sbjct: 421 RTGILMDKKLVTAFVDMYSKCGNVEYAERIFDSSFERDTVMYNAMIAGCAHHGHEAKSFQ 480

Query: 481 LFKEMMENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLY 540
            F++M E   +PD ITF+ALLSACRH GLV  GE +F SM   +NISPE  HY CMIDLY
Sbjct: 481 HFEDMTEGGFKPDEITFMALLSACRHRGLVLEGEKYFKSMIEAYNISPETGHYTCMIDLY 540

Query: 541 GRANQLDKALEFMKKI-PIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARY 600
           G+A +LDKA+E M+ I  ++ DAVI G+FLNAC  N N EL ++ E+KLL+IEG NG+RY
Sbjct: 541 GKAYRLDKAIELMEGIDQVEKDAVILGAFLNACSWNKNTELVKEVEEKLLVIEGSNGSRY 600

Query: 601 VQLANVYAAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIY 660
           +Q+AN YA+ G W+EM RIR +M+GKE++  +GCSW  ++ +FH+FTS D  H + EAIY
Sbjct: 601 IQIANAYASSGRWDEMQRIRHQMRGKELEIFSGCSWANIDKQFHMFTSSDISHYETEAIY 660

Query: 661 LTLASLTDELLHKEE 671
             L  +T +L   +E
Sbjct: 661 AMLHFVTKDLSEIDE 675

BLAST of Clc02G05770 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 455.7 bits (1171), Expect = 6.6e-128
Identity = 246/667 (36.88%), Postives = 386/667 (57.87%), Query Frame = 0

Query: 11  HAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQNL 70
           HA  IKSGF+  IF  N+LI  Y+K G L D +++FD+MP+RN+++WN+++    K   L
Sbjct: 43  HASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFL 102

Query: 71  RQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIRIDEFSLTIML 130
            +A +LF     RD  T+NSM+SG+   D  E +AL +F  M    +   ++E+S   +L
Sbjct: 103 DEADSLFRSMPERDQCTWNSMVSGFAQHDRCE-EALCYFAMMH--KEGFVLNEYSFASVL 162

Query: 131 NLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGEVVDL 190
           +  + L  ++ G Q+HS + K+     V+  S+L+DMYSKCG   +A RV+         
Sbjct: 163 SACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVF--------- 222

Query: 191 VCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFVRMAD 250
                                    E+   +VV+WN++I+ F QNG   E+L +F  M +
Sbjct: 223 ------------------------DEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE 282

Query: 251 EKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFI-GSGLVDVYCKCSNMR 310
            +V  +E T AS++SAC++L  +K+G+EVH  V+KN  + N  I  +  VD+Y KCS ++
Sbjct: 283 SRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIK 342

Query: 311 YAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKLQQ 370
            A  +   +   NV + TSMI GY+   +   AR +F  + E+N V W AL +GY +  +
Sbjct: 343 EARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGE 402

Query: 371 CEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQI------HGYMLRAGIELD 430
            E  L L    ++E+  P      +I+ ACA  A L  G Q       HG+  ++G E D
Sbjct: 403 NEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDD 462

Query: 431 TKLTSSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMME 490
             + +SL+DMY KCG +     +FR++ ++D + +N MI G+A +G+ NEA++LF+EM+E
Sbjct: 463 IFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLE 522

Query: 491 NDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLD 550
           +  +PD IT + +LSAC H G VE G H+F SM+RD  ++P  DHY CM+DL GRA  L+
Sbjct: 523 SGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLE 582

Query: 551 KALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVYA 610
           +A   ++++P+Q D+VIWGS L AC+++ N  L +   +KLL +E  N   YV L+N+YA
Sbjct: 583 EAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYA 642

Query: 611 AEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLTD 670
             G WE++  +RK M+ + V K  GCSW+ ++   HVF   D+ HP+ + I+  L  L  
Sbjct: 643 ELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIA 673

BLAST of Clc02G05770 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 453.0 bits (1164), Expect = 4.2e-127
Identity = 243/663 (36.65%), Postives = 387/663 (58.37%), Query Frame = 0

Query: 9   LCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQ 68
           L H + IKSG   +++  N L+ +Y+K G+   A+KLFDEMP R  FSWN +++AY K  
Sbjct: 35  LVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMPLRTAFSWNTVLSAYSKRG 94

Query: 69  NLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYE--AQALGFFVEMQTAPDLIRIDEFSL 128
           ++      FD    RD V++ +M+ GY +   Y    + +G  V+    P      +F+L
Sbjct: 95  DMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEP-----TQFTL 154

Query: 129 TIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGE 188
           T +L   A    +  GK++HSF++K     +V  ++SL++MY+KCG    A +  +    
Sbjct: 155 TNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMA-KFVFDRMV 214

Query: 189 VVDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFV 248
           V D+   NAM+A   + G++D+AM  F +  ER D+V WN+MISGF Q GY+  +L +F 
Sbjct: 215 VRDISSWNAMIALHMQVGQMDLAMAQFEQMAER-DIVTWNSMISGFNQRGYDLRALDIFS 274

Query: 249 RM-ADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKC 308
           +M  D  +  +  T AS+LSAC+NL  L +GK++H++++  G  ++  + + L+ +Y +C
Sbjct: 275 KMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRC 334

Query: 309 SNMRYAESVNSELRT--LNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSG 368
             +  A  +  +  T  L +   T+++ GY   G+M +A+ +F SL +++ V WTA+  G
Sbjct: 335 GGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVG 394

Query: 369 YVKLQQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELD 428
           Y +       + L          P+   L +++   +  A+L+ GKQIHG  +++G    
Sbjct: 395 YEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYS 454

Query: 429 TKLTSSLVDMYSKCGSIIYAERMFREV-TDKDSILYNIMIAGYAHHGWENEAVQLFKEMM 488
             ++++L+ MY+K G+I  A R F  +  ++D++ +  MI   A HG   EA++LF+ M+
Sbjct: 455 VSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETML 514

Query: 489 ENDLRPDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQL 548
              LRPD IT+V + SAC H GLV  G  +FD M     I P + HYACM+DL+GRA  L
Sbjct: 515 MEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLL 574

Query: 549 DKALEFMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVY 608
            +A EF++K+PI+ D V WGS L+ACR++ N +L + A ++LL++E EN   Y  LAN+Y
Sbjct: 575 QEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLY 634

Query: 609 AAEGNWEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLT 666
           +A G WEE  +IRK MK   VKK  G SW+ V++K HVF   D  HP+   IY+T+  + 
Sbjct: 635 SACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIW 690

BLAST of Clc02G05770 vs. TAIR 10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 380.2 bits (975), Expect = 3.5e-105
Identity = 237/834 (28.42%), Postives = 386/834 (46.28%), Query Frame = 0

Query: 4   LKHGLLCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAA 63
           L+ G   HA  I SGF PT F  N L+ +Y        A  +FD+MP R+V SWN +I  
Sbjct: 64  LELGKQAHAHMIISGFRPTTFVLNCLLQVYTNSRDFVSASMVFDKMPLRDVVSWNKMING 123

Query: 64  YIKSQNLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQTAPDLIRIDE 123
           Y KS ++ +A + F+    RD+V++NSMLSGY+  +G   +++  FV+M    + I  D 
Sbjct: 124 YSKSNDMFKANSFFNMMPVRDVVSWNSMLSGYL-QNGESLKSIEVFVDM--GREGIEFDG 183

Query: 124 FSLTIMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYG 183
            +  I+L + + L   S G Q+H  +++   D  V AAS+L+DMY+K   F E+ RV+ G
Sbjct: 184 RTFAIILKVCSFLEDTSLGMQIHGIVVRVGCDTDVVAASALLDMYAKGKRFVESLRVFQG 243

Query: 184 CGEVVDLVCRNAMVAACCREGEIDVAMNLF------------------------------ 243
             E  + V  +A++A C +   + +A+  F                              
Sbjct: 244 IPE-KNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRL 303

Query: 244 ------------------------------------------------------------ 303
                                                                       
Sbjct: 304 GGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQ 363

Query: 304 ------------------------------------------------------------ 363
                                                                       
Sbjct: 364 EEHGFKALLLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCV 423

Query: 364 --------------------WKELERNDVVAWNTMISGFVQNGYEEESLQLFVRMADEKV 423
                               + E+ R D V+WN +I+   QNG   E+L LFV M   ++
Sbjct: 424 ANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRI 483

Query: 424 GWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKCSNMRYAES 483
             +E TF SIL AC+   +L  G E+H+ ++K+G+  N  +G  L+D+Y KC  +  AE 
Sbjct: 484 EPDEFTFGSILKACTG-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEK 543

Query: 484 VNSE-LRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKLQQCEA 543
           ++S   +  NV             G M E  K+ +   ++  V W ++ SGYV  +Q E 
Sbjct: 544 IHSRFFQRANV------------SGTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSED 603

Query: 544 VLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELDTKLTSSLVD 603
              L +   +    PD     +++  CA  A+   GKQIH  +++  ++ D  + S+LVD
Sbjct: 604 AQMLFTRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVD 663

Query: 604 MYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMMENDLRPDAIT 663
           MYSKCG +  +  MF +   +D + +N MI GYAHHG   EA+QLF+ M+  +++P+ +T
Sbjct: 664 MYSKCGDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVT 723

Query: 664 FVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLDKALEFMKKI 666
           F+++L AC H GL++ G  +F  M RD+ + P++ HY+ M+D+ G++ ++ +ALE ++++
Sbjct: 724 FISILRACAHMGLIDKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREM 783

BLAST of Clc02G05770 vs. TAIR 10
Match: AT1G68930.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 379.8 bits (974), Expect = 4.6e-105
Identity = 209/659 (31.71%), Postives = 361/659 (54.78%), Query Frame = 0

Query: 9   LCHAQAIKSGFTPTIFTSNQLITLYAKHGFLADAQKLFDEMPERNVFSWNAIIAAYIKSQ 68
           + H   I++   P  F  N ++  YA       A+++FD +P+ N+FSWN ++ AY K+ 
Sbjct: 27  MIHGNIIRALPYPETFLYNNIVHAYALMKSSTYARRVFDRIPQPNLFSWNNLLLAYSKAG 86

Query: 69  NLRQARALFDCAVYRDLVTYNSMLSGYVSSDGYEAQALGFFVEMQT-APDLIRIDEFSLT 128
            + +  + F+    RD VT+N ++ GY  S    A    +   M+  + +L R+   +L 
Sbjct: 87  LISEMESTFEKLPDRDGVTWNVLIEGYSLSGLVGAAVKAYNTMMRDFSANLTRV---TLM 146

Query: 129 IMLNLTAKLCVVSYGKQLHSFMLKTANDLSVFAASSLIDMYSKCGFFKEACRVYYGCGEV 188
            ML L++    VS GKQ+H  ++K   +  +   S L+ MY+  G   +A +V+YG  + 
Sbjct: 147 TMLKLSSSNGHVSLGKQIHGQVIKLGFESYLLVGSPLLYMYANVGCISDAKKVFYGLDD- 206

Query: 189 VDLVCRNAMVAACCREGEIDVAMNLFWKELERNDVVAWNTMISGFVQNGYEEESLQLFVR 248
            + V  N+++      G I+ A+ LF + +E+ D V+W  MI G  QNG  +E+++ F  
Sbjct: 207 RNTVMYNSLMGGLLACGMIEDALQLF-RGMEK-DSVSWAAMIKGLAQNGLAKEAIECFRE 266

Query: 249 MADEKVGWNEHTFASILSACSNLRNLKLGKEVHAYVLKNGLILNPFIGSGLVDVYCKCSN 308
           M  + +  +++ F S+L AC  L  +  GK++HA +++     + ++GS L+D+YCKC  
Sbjct: 267 MKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVGSALIDMYCKCKC 326

Query: 309 MRYAESVNSELRTLNVYSITSMIVGYSSQGNMAEARKLFDSLDEKNSVVWTALFSGYVKL 368
           + YA++V   ++  NV S T+M+VGY   G   EA K+F  LD + S +           
Sbjct: 327 LHYAKTVFDRMKQKNVVSWTAMVVGYGQTGRAEEAVKIF--LDMQRSGI----------- 386

Query: 369 QQCEAVLELLSEYRKEATVPDVLLLISIIGACAIQAALAPGKQIHGYMLRAGIELDTKLT 428
                              PD   L   I ACA  ++L  G Q HG  + +G+     ++
Sbjct: 387 ------------------DPDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVS 446

Query: 429 SSLVDMYSKCGSIIYAERMFREVTDKDSILYNIMIAGYAHHGWENEAVQLFKEMMENDLR 488
           +SLV +Y KCG I  + R+F E+  +D++ +  M++ YA  G   E +QLF +M+++ L+
Sbjct: 447 NSLVTLYGKCGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLK 506

Query: 489 PDAITFVALLSACRHGGLVELGEHFFDSMSRDHNISPEIDHYACMIDLYGRANQLDKALE 548
           PD +T   ++SAC   GLVE G+ +F  M+ ++ I P I HY+CMIDL+ R+ +L++A+ 
Sbjct: 507 PDGVTLTGVISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMR 566

Query: 549 FMKKIPIQLDAVIWGSFLNACRINGNAELARKAEDKLLIIEGENGARYVQLANVYAAEGN 608
           F+  +P   DA+ W + L+ACR  GN E+ + A + L+ ++  + A Y  L+++YA++G 
Sbjct: 567 FINGMPFPPDAIGWTTLLSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGK 626

Query: 609 WEEMGRIRKKMKGKEVKKNAGCSWVFVENKFHVFTSGDRFHPKNEAIYLTLASLTDELL 667
           W+ + ++R+ M+ K VKK  G SW+  + K H F++ D   P  + IY  L  L ++++
Sbjct: 627 WDSVAQLRRGMREKNVKKEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNNKII 648

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887152.10.0e+0091.21putative pentatricopeptide repeat-containing protein At3g18840 [Benincasa hispid... [more]
XP_016900344.10.0e+0090.63PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
XP_031745080.10.0e+0090.04putative pentatricopeptide repeat-containing protein At3g18840 [Cucumis sativus][more]
KGN44321.10.0e+0090.04hypothetical protein Csa_015981 [Cucumis sativus][more]
XP_022136238.10.0e+0087.82putative pentatricopeptide repeat-containing protein At3g18840 [Momordica charan... [more]
Match NameE-valueIdentityDescription
Q9LHN51.4e-21554.37Putative pentatricopeptide repeat-containing protein At3g18840 OS=Arabidopsis th... [more]
Q9SIT79.2e-12736.88Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SHZ86.0e-12636.65Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9FWA64.9e-10428.42Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Q9CAA86.4e-10431.71Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A1S4DWI00.0e+0090.63LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g18... [more]
A0A0A0K9400.0e+0090.04Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G253760 PE=4 SV=1[more]
A0A6J1C7400.0e+0087.82putative pentatricopeptide repeat-containing protein At3g18840 OS=Momordica char... [more]
A0A6J1GNG30.0e+0086.92putative pentatricopeptide repeat-containing protein At3g18840 OS=Cucurbita mosc... [more]
A0A6J1I9U90.0e+0087.07putative pentatricopeptide repeat-containing protein At3g18840 OS=Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
AT3G18840.29.9e-21754.37Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G13600.16.6e-12836.88Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.14.2e-12736.65pentatricopeptide (PPR) repeat-containing protein [more]
AT3G02330.13.5e-10528.42Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G68930.14.6e-10531.71pentatricopeptide (PPR) repeat-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 218..320
e-value: 2.6E-15
score: 58.7
coord: 98..217
e-value: 1.0E-11
score: 46.9
coord: 321..403
e-value: 3.9E-11
score: 44.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 504..634
e-value: 4.3E-14
score: 54.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 404..503
e-value: 7.0E-22
score: 79.6
coord: 1..97
e-value: 1.7E-20
score: 75.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 324..353
e-value: 0.0017
score: 16.4
coord: 457..490
e-value: 1.9E-9
score: 35.1
coord: 194..223
e-value: 0.0024
score: 15.9
coord: 223..256
e-value: 2.8E-5
score: 22.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 221..268
e-value: 1.8E-8
score: 34.5
coord: 453..500
e-value: 4.2E-11
score: 42.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 355..381
e-value: 0.082
score: 13.2
coord: 194..213
e-value: 0.051
score: 13.8
coord: 428..452
e-value: 0.79
score: 10.1
coord: 25..54
e-value: 5.1E-5
score: 23.3
coord: 529..551
e-value: 0.025
score: 14.8
coord: 327..353
e-value: 0.0036
score: 17.5
coord: 55..78
e-value: 0.11
score: 12.8
coord: 86..99
e-value: 0.42
score: 11.0
coord: 162..181
e-value: 0.11
score: 12.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 322..356
score: 8.999285
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 22..56
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 454..488
score: 12.528824
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 221..255
score: 10.062531
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 84..115
score: 8.549871
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 189..220
score: 8.670445
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 220..645
coord: 3..215
coord: 66..349

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc02G05770.1Clc02G05770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071704 organic substance metabolic process
biological_process GO:0016310 phosphorylation
molecular_function GO:0016301 kinase activity
molecular_function GO:0016773 phosphotransferase activity, alcohol group as acceptor
molecular_function GO:0005515 protein binding