Clc07G10870 (gene) Watermelon (cordophanus) v2

Overview
NameClc07G10870
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationClcChr07: 25500767 .. 25507330 (+)
RNA-Seq ExpressionClc07G10870
SyntenyClc07G10870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATTTGACAAGATTTAAAATCAATAAGACAGTTCCCGTATTGTTTCCCTTCTCTTGCCGGCTGGCCTGTGTGTTTTCAACTCAACCGCATGAACAACACCACCAGGACCCGCCATGGCAGCTCCAGGATCAGCTGCTCTATTGGGTATCTTCTATTCTCTCTAATTCGTCTCTCGACTCTTCCAAATGTAGAGCCCTCTTACCCCATTTGTCTCCTTTTCAGTTTGATCAGCTCTTCTTCTCCGTTGGATTGAAAGCCAACCCCCACACTTGTCTTAATTTCTTTTACTTTGCGTCTGATTCTTTCAAGTTTCGATTTACCATTCGTTCTTATTGTATATTGATTCTTTTGCTTGTTCATTCCAAGTTTTTACCCCCCGCGAGATTGGTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTCGGATACGAATAAGCTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCGGTTGTTGGACGGTTTGAATGGACACAGGCATTTGATTTGTTGATACATGTATACAGCACACAATTCAAAAATCTTGGCTTTAGTTGCGCTGTTGATGTGTTTTATTTGTTTGCTCGTAAGGGAATCTTTCCATCGTTAAAGACTTGTAGTTTTTTATTGAGCTCTTTGGTAAAGGCTAATGAACTTGAGAAATGTTGTGAAATATTTGAAGTGATGTCCCAAGGTGTTCGTCCAGATGTTTTCTTATTTACGAATGTGATTAATGCTTTGTGCAAGGGAGGGAAGATGGAAAAAGCTATTGAGTTATTCATGAAAATGGAGAAGTTGGGTGTTTCTCCCAATGATGTTACTTATAATATTATTATTCATGGTTTATGCCAGAATGGGAGAATAGACACTGCCTTTGAGCTCAAGGAGAAGATGACAATTAAAGGGGTAAAGCCAAGTCTTATAACTTATTGCGTGCTTATTAATGGTTTGATAAAACGGGAACATTTTGACAAAGTGAATCATGTTTTAGATGAAATGGTTGATGCGGGTTTTGTTCCGAATGATGCTGTCTACAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCAATGAAGCACTTAGGATTAAAGATTTGATGATATACAAAAATATAACTCCTACTTCAGTTACATTACATACTCTCATGCAAGGATTTTGCAAGAGCAATCAAATCGAGCAAGCAGAGAATGCTCTTGAGGAGATATTATCACATGGGCTATCTATAAACCCTGTTACTTGTTATTCGGTTGTCCACTGGTTATGTAAGAAGTCTAGGTTCCATTCTGCATTCCGATTTACTAAGGAAATATTATCAAAGAACTTCAAGCCTACAGATCAACTCTTAACCATATTGGTACGTGGGCTGTGTAAGGACGGTAAACATTTAGAAGCAACCGAACTTTGGTTTAGGCTATTGGAGAAAGGGTCTCCAGCAAGTACAGTAACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAATTTGCCAGGGGCTGTTAGAATAGTCAAAGAGATGTTGGAGAGGGGTTTTTCAATGGATCGGATCACATACAACACACTCATCTTAGGTTTTTGCAAAGTGGGAAAAGTTGAGGAATGCTTTAGACTTAAAGAAGAGATGACCAAACAAGGAATTCAACCAGACATCTACACTTACAATTTTCTATTGCATGGACTATGCAATGCAGGAAAGTTGGATTATGCTATTAAGCTTTGGGATGAATACAAAGCTAGTGGGCTGGTTTCTAATGTTCACACCTACGGAGTAATGATGGATGGTTATTGTAGAGCTAACAGAGTGGAAGATGTTGAAAAATTATTTAATGAATTGGTTGCTAAGAAAATAGAACTGAATACCATTGTCTACAATGTAATAATCAGAGCAAATTGCCAAAATGGAAATGTTGCCGCAGCTTTGCAACATCGTGATGATATGAAAAGCAAGGGAATTTTACCAACCTGTGCCACATATTCTTCTCTAATACATGGCATGTGCAGCATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAGAGGAAGGGTTGTTGCCGAATGTTGTTTGCTATACTGCATTAATCGGTGGTTATTGTAAGCTGGGGCAAATGGATGCTGCCGAAGCTACTTGGCTTGAGATGACCTCGTTTAACATAGCACCTAACAAATTTACCTATACCGTCATGATTGATGGGTACTGTAAATTAGGGAATATGGAAGAAGCAAATAACCTTCTGAGCAAAATGAAAGAAAGTGGAATCGTTCCAGATGTTGTTACTTACAATGCCTTGACTAATGGATTTTGCAAGGGAAAGAACATGGATAAAGCTTTTGAAGTATGCAATCAAATGGCCACTGGAGGATTATCTTTAGATGAAATTACTTATACAACTCTCTTACATGGTTGGAATCGACCTACAATTATGAGCCAAGACTGATCTAATTTCTGCAGAGGTTCTTCGTTACCTCATTATCTGTTGATTTTGTGATTGTTTTTGTATGTTCAATTGATCTGAAAATTATTCTTTTGTTTTCTCTTTTGATTTTTCCATGAGTGACCAGATCCGTCAGTGTTCTTTTCTTTATGCATTGAGGCTATCCACCTTCTCTTGTTCAAATGAAAGAACCAAAAAGGAGGAGCCGTCCTTAGTCTCTGGAGGTTGGTAGGCTTTATCTTATAGTGTATACCATTTCTGTTTTGTGTATAATGTATCAATAGTTTATATGTGGGGTTGCTACATCTAAAGCATATATTCTGAAAACTTGGTATTACTGTTTAACAGATCATGGGAAGCAGAAGTGCTTAGCTCCTATTGATATAACATGTTGAGCATGTGAACTTAGGGCGCTCTATCAAGATCCTCAGAGAAGAAAATGATGGCATTTGATTTGATAATGATAGAGTGACATTGGTTCTCCGACTCAAAGCCAAGGATGAAAACTCGCAAATGTATTAGGTTGGGTATGTTATCTTTAGTGTAGTATATTTGGCATGGAAAGCTGGAACAGTTGCGGGTTTTTCATAACTTATAATGAACATGTATAGTGTTGATGCAGATTTTGCAGGCTTGAGTCTGTATGTAGCTGTGCTCAGGGAGCAATGCCGTATTTGAGAGTTGTTCATTGGCATATTGGAATTGGAGCATAAAAAGCGCTCAGTTTGAACTTTCATGTATGTGTAAGTTCATCTACTCTATCTTGTGTTCAATCTTATACTCAGAGAAATGACATTGATATGCTCTTTTGTTATTGTTTTTCTTTTACAACTGAAAATGCTGTAAGCACTGTATGCAAGTTTTTCTATAACCAAGGTGTTGGGCCCTACCGCTTGCATTCAGTGACCAATAAGCTTTTGAAGCTTTAGGCCTGAGGTCTCTGAACTGCACGTCCAAGTTCTTTCCTTCTTGTTCTTCTTGTTGTCGTTCTTCTGTTTAAATTATTTTTATTAGGAACTGCACACTTAAATGTTTAAGTTCCATCTAAAAAAAGTGCTGAAAAGTGAATATAATTTGGTTAATACTTTTGTTTTTGTTATAGAAAAGAAAATCTGGCTTGAAGCAAAAATATATAATTGACAACCTTATTTAACAGGCTCGGTTGTGTAGGCTTATTTTATTAATTTTATTTTCTTAAGTTGAACAATATATGGGGTGTGAACTCAAACTTTTGAACTTTAGGTCACGGGTACAACACCTTGACTAGTTGAGTTTGAGCTATGCTTATATTGGTTTTTGTTGTATAGATCGTACAAGTGTCTTTGAGTTCAACAAAAAAAACTTGTTGGATGATGCTATTTTATTTATTCATCTATTTTGGTCTCTAACTTTCAAAAGGTCTACTTAAGTCACTAGACTTAGAAAATACTCATTTTTGGTATTTGTCATTATTCTGCAGTTAATTATTGGGCAAAGGAATGATGAACTTTTAATGTGCGTACTAACCTGTTGAGGTGAGTATTGATTGGAATATAACCCAAATCATCGCTCAGAATTTGCTTGCGTCAATATGATATTTCACTTTTTTCTTTTGAACTGATGTGATATTTACACATTATTTAAAGTCATGTTATCCGTTTGTTAAATAATTTATGGCAATATAATGCCAAAGACAAAAACAATTTTTTTTTAAGTTTTGGTCCAAAGTAATCAATGAAAGTTAAAAGATGAAAATCAAATTAATTGGAAAAGTATCAAAATGGTATCCAAAACTATATTTGAGATGTTTTGTGTTCACCGTATATACATTTAGCAAATGGCCAACTAGGCATATACTCTTGACCTAGCTAGAGGGTTAGAGAATCAAATCGTTCCACACATGTAGTCGAACTAAAAAATATATATGTATTCAAATATACGCGCGGTCTGCTTATTTCTTTATGATTTTTGTTTGATGCATATCTGATTTATTTTGCCGCTTGAAAAGTTACTTGTTTGTTGATTTCCCATTCACTTTTCAAAGTTCTTCAGAATACTACATATCCAACTCATCTAATCTCTTTTCTGGTCTTCTTTATTGCCCATGTTAGCCAGATTATTCTCCACTGGTTTATTTATTTATTTGTTGCTATATATGGTAAGTGAAGATATTATATTTAAATATTATGAGATAAAATATTATTGAATTGAAATATTCATTTTAAAGGAAATCTACTTATACCAAAAGCAACCCACGGGTATATTCTCTTCTTATGTGGAGAAAAAATCAGGAAAGATTTAAATGGATCAATAAAATCCTTTCCTTCTGCAAATGGACATGATTCACTCTGGTTTCCTTGATTTTGCAGAGGTATTCAATCTACCCAGTTTGCGAATCAATTTCAGCCGCATGGAATTCGAATAAGCCCTTCACTCTGTGAGTGGCGTTACTTTCTCTATCCATACTTAGTCATTTAAAATGTGTGCTATAAATCCACCTTTTGGTTATAGTGTCGGTATATCTAATCTTTAATTTTCTTAAACTAGCTTGTCTCGGAGAAATTCGCTTCTTGTTTGTATACCACGTCAGCAAGCTTGTAAGCACAGTCGATTCCAGTTCATTATATATAACAAACACAACCACAAATTTTCTGAAATTTTCTGTGCTTGTTGTTGATTGAATCTTTGTATTTACCTCTAAATTGGACTTATTGAGGTTAATTGATACAATTGCTTTTTAAATGGATATTGGTGATTTAATCGGTTTAACAACAAAAGGGCCTGGCTCTGTTTGCTTTTTGTGAATTCAAAAGCAATAAGGTAATCTGGATGGAGCAACAGGCCTATGTTAAACACCCTTTTAAATGTTTTCATCGTTAAACCAATTGACCTGGGTGTTAACTACAGCCCCACCGCCATCGGGAATGCTCTCGAGGACCTTCTCGTGGCCTTTAGGGTAATAAATTCCAGTTCGCCCGTCAAGCACCCAAGGAGTTGCCTCACCCTCCACATGGCTCTTATTGCTCTTCAGTTCCTTGCTTTCTGTTTCAACTTTCTCCCTCATCATATCTGGAGAAGAGCTTTTCCTGTGGAAGCCCTTTCGCCCAATCGGTCTGCTAAAGAATGCAACCGAATCAGAATCAGAAAAATAGGTAACATTCTGTATCGATATTCCACGTGTTGTGTTATACATCTCAAGTAAATACAAATTTAGATAATGTAATAATTGAACCAGCCAACAACCGTACTTCAAACAGAACCTAGTTGAGTTCGGTTTCATTGCTGTTTTTGGTTTTGGTCTGCAGCATTCTTATCTTAGGTCATGTTTACTACTACATAATCAAAATTATTGCAATCGTCGAATTTTCTTCATTATTCACAATTCACGTGATGTTGTTAAACTAGTTTTTTGTGGACGTATTTAAATAAGCTTCACTTTTCATATTTTGATTGATTGATTGATTACGTTTGATGGAAAGTAAACGTGGCCTTAGATAAACGATAAGTTTATGTTGCAAATACAAATAAATGTTTCAAAATTTTCTGATTTTTTTGAAAAATGTTTTCTTGCAGATAAACAGAATCATAAACATGAGAAATTAGTTGATGGCTGAGTTGAAGAGGGGTCTTACTTGAAGAACATGGTGTTCATTGTCTGAGCTCCTCTCACAAATGCTGTGGCCATACCTTCTCCCAATAAGGTAAGTGAAGATGTGATGGACAGTGGAATTTGCTGGATTTGTATTTATAGGTGAAAAGAGAGCCTATCCATTTCCTCCATCCCAATGAAGATTCTGAAGAATGGGATGAATTTGTCCTTTTACTCCTTTATGCTCCCCCCCACTCCTTGGAGTTGGTTTGTGAAAATGAACCTTCAAAATTCCTCTTCCCATACTTCATAAGCTGTATGTACATGAGCCTAATAGGCTTTGCTTTCTCTGAGGAGCTAAAGTTTGGGCTTGGACGTAGTGGGCCTAATGGGCTTTGCTTTGCCTCTCTTTTGGTCTGTACCTTCCTGGCCCATTAATGAAATTATTGATTCTTGTTGTCAGGAATATCAATACTTAACTGGTGAATTGATGATTTATTAAGGGAATTTGGAGTGGATAGCTAAAATTA

mRNA sequence

ATGCATTTGACAAGATTTAAAATCAATAAGACAGTTCCCGTATTGTTTCCCTTCTCTTGCCGGCTGGCCTGTGTGTTTTCAACTCAACCGCATGAACAACACCACCAGGACCCGCCATGGCAGCTCCAGGATCAGCTGCTCTATTGGGTATCTTCTATTCTCTCTAATTCGTCTCTCGACTCTTCCAAATGTAGAGCCCTCTTACCCCATTTGTCTCCTTTTCAGTTTGATCAGCTCTTCTTCTCCGTTGGATTGAAAGCCAACCCCCACACTTGTCTTAATTTCTTTTACTTTGCGTCTGATTCTTTCAAGTTTCGATTTACCATTCGTTCTTATTGTATATTGATTCTTTTGCTTGTTCATTCCAAGTTTTTACCCCCCGCGAGATTGGTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTCGGATACGAATAAGCTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCGGTTGTTGGACGGTTTGAATGGACACAGGCATTTGATTTGTTGATACATGTATACAGCACACAATTCAAAAATCTTGGCTTTAGTTGCGCTGTTGATGTGTTTTATTTGTTTGCTCGTAAGGGAATCTTTCCATCGTTAAAGACTTGTAGTTTTTTATTGAGCTCTTTGGTAAAGGCTAATGAACTTGAGAAATGTTGTGAAATATTTGAAGTGATGTCCCAAGGTGTTCGTCCAGATGTTTTCTTATTTACGAATGTGATTAATGCTTTGTGCAAGGGAGGGAAGATGGAAAAAGCTATTGAGTTATTCATGAAAATGGAGAAGTTGGGTGTTTCTCCCAATGATGTTACTTATAATATTATTATTCATGGTTTATGCCAGAATGGGAGAATAGACACTGCCTTTGAGCTCAAGGAGAAGATGACAATTAAAGGGGTAAAGCCAAGTCTTATAACTTATTGCGTGCTTATTAATGGTTTGATAAAACGGGAACATTTTGACAAAGTGAATCATGTTTTAGATGAAATGGTTGATGCGGGTTTTGTTCCGAATGATGCTGTCTACAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCAATGAAGCACTTAGGATTAAAGATTTGATGATATACAAAAATATAACTCCTACTTCAGTTACATTACATACTCTCATGCAAGGATTTTGCAAGAGCAATCAAATCGAGCAAGCAGAGAATGCTCTTGAGGAGATATTATCACATGGGCTATCTATAAACCCTGTTACTTGTTATTCGGTTGTCCACTGGTTATGTAAGAAGTCTAGGTTCCATTCTGCATTCCGATTTACTAAGGAAATATTATCAAAGAACTTCAAGCCTACAGATCAACTCTTAACCATATTGGTACGTGGGCTGTGTAAGGACGGTAAACATTTAGAAGCAACCGAACTTTGGTTTAGGCTATTGGAGAAAGGGTCTCCAGCAAGTACAGTAACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAATTTGCCAGGGGCTGTTAGAATAGTCAAAGAGATGTTGGAGAGGGGTTTTTCAATGGATCGGATCACATACAACACACTCATCTTAGGTTTTTGCAAAGTGGGAAAAGTTGAGGAATGCTTTAGACTTAAAGAAGAGATGACCAAACAAGGAATTCAACCAGACATCTACACTTACAATTTTCTATTGCATGGACTATGCAATGCAGGAAAGTTGGATTATGCTATTAAGCTTTGGGATGAATACAAAGCTAGTGGGCTGGTTTCTAATGTTCACACCTACGGAGTAATGATGGATGGTTATTGTAGAGCTAACAGAGTGGAAGATGTTGAAAAATTATTTAATGAATTGGTTGCTAAGAAAATAGAACTGAATACCATTGTCTACAATGTAATAATCAGAGCAAATTGCCAAAATGGAAATGTTGCCGCAGCTTTGCAACATCGTGATGATATGAAAAGCAAGGGAATTTTACCAACCTGTGCCACATATTCTTCTCTAATACATGGCATGTGCAGCATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAGAGGAAGGGTTGTTGCCGAATGTTGTTTGCTATACTGCATTAATCGGTGGTTATTGTAAGCTGGGGCAAATGGATGCTGCCGAAGCTACTTGGCTTGAGATGACCTCGTTTAACATAGCACCTAACAAATTTACCTATACCGTCATGATTGATGGGTACTGTAAATTAGGGAATATGGAAGAAGCAAATAACCTTCTGAGCAAAATGAAAGAAAGTGGAATCGTTCCAGATGTTGTTACTTACAATGCCTTGACTAATGGATTTTGCAAGGGAAAGAACATGGATAAAGCTTTTGAAGTATGCAATCAAATGGCCACTGGAGGATTATCTTTAGATGAAATTACTTATACAACTCTCTTACATGCCAAGACTGATCTAATTTCTGCAGAGATCCGTCAGTGTTCTTTTCTTTATGCATTGAGGCTATCCACCTTCTCTTGTTCAAATGAAAGAACCAAAAAGGAGGAGCCGTCCTTAGTCTCTGGAGATCATGGGAAGCAGAAAGTGACATTGGTTCTCCGACTCAAAGCCAAGGATGAAAACTCGCAAATATTTTGCAGGCTTGAGTCTGTATGTAGCTGTGCTCAGGGAGCAATGCCGTATTTGAGAGTTTTAATTATTGGGCAAAGGAATGATGAACTTTTAATGAAATCTACTTATACCAAAAGCAACCCACGGAGGTATTCAATCTACCCAGTTTGCGAATCAATTTCAGCCGCATGGAATTCGAATAAGCCCTTCACTCTCCCCACCGCCATCGGGAATGCTCTCGAGGACCTTCTCGTGGCCTTTAGGGTAATAAATTCCAGTTCGCCCGTCAAGCACCCAAGGAGTTGCCTCACCCTCCACATGGCTCTTATTGCTCTTCAGTTCCTTGCTTTCTGTTTCAACTTTCTCCCTCATCATATCTGGAGAAGAGCTTTTCCTGTGGAAGCCCTTTCGCCCAATCGGTCTGCTAAAGAATGCAACCGAATCAGAATCAGAAAAATAGATAAACAGAATCATAAACATGAGAAATTAGTTGATGGCTGAGTTGAAGAGGGGTCTTACTTGAAGAACATGGTGTTCATTGTCTGAGCTCCTCTCACAAATGCTGTGGCCATACCTTCTCCCAATAAGGTAAGTGAAGATGTGATGGACAGTGGAATTTGCTGGATTTGTATTTATAGGTGAAAAGAGAGCCTATCCATTTCCTCCATCCCAATGAAGATTCTGAAGAATGGGATGAATTTGTCCTTTTACTCCTTTATGCTCCCCCCCACTCCTTGGAGTTGGTTTGTGAAAATGAACCTTCAAAATTCCTCTTCCCATACTTCATAAGCTGTATGTACATGAGCCTAATAGGCTTTGCTTTCTCTGAGGAGCTAAAGTTTGGGCTTGGACGTAGTGGGCCTAATGGGCTTTGCTTTGCCTCTCTTTTGGTCTGTACCTTCCTGGCCCATTAATGAAATTATTGATTCTTGTTGTCAGGAATATCAATACTTAACTGGTGAATTGATGATTTATTAAGGGAATTTGGAGTGGATAGCTAAAATTA

Coding sequence (CDS)

ATGCATTTGACAAGATTTAAAATCAATAAGACAGTTCCCGTATTGTTTCCCTTCTCTTGCCGGCTGGCCTGTGTGTTTTCAACTCAACCGCATGAACAACACCACCAGGACCCGCCATGGCAGCTCCAGGATCAGCTGCTCTATTGGGTATCTTCTATTCTCTCTAATTCGTCTCTCGACTCTTCCAAATGTAGAGCCCTCTTACCCCATTTGTCTCCTTTTCAGTTTGATCAGCTCTTCTTCTCCGTTGGATTGAAAGCCAACCCCCACACTTGTCTTAATTTCTTTTACTTTGCGTCTGATTCTTTCAAGTTTCGATTTACCATTCGTTCTTATTGTATATTGATTCTTTTGCTTGTTCATTCCAAGTTTTTACCCCCCGCGAGATTGGTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTCGGATACGAATAAGCTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCGGTTGTTGGACGGTTTGAATGGACACAGGCATTTGATTTGTTGATACATGTATACAGCACACAATTCAAAAATCTTGGCTTTAGTTGCGCTGTTGATGTGTTTTATTTGTTTGCTCGTAAGGGAATCTTTCCATCGTTAAAGACTTGTAGTTTTTTATTGAGCTCTTTGGTAAAGGCTAATGAACTTGAGAAATGTTGTGAAATATTTGAAGTGATGTCCCAAGGTGTTCGTCCAGATGTTTTCTTATTTACGAATGTGATTAATGCTTTGTGCAAGGGAGGGAAGATGGAAAAAGCTATTGAGTTATTCATGAAAATGGAGAAGTTGGGTGTTTCTCCCAATGATGTTACTTATAATATTATTATTCATGGTTTATGCCAGAATGGGAGAATAGACACTGCCTTTGAGCTCAAGGAGAAGATGACAATTAAAGGGGTAAAGCCAAGTCTTATAACTTATTGCGTGCTTATTAATGGTTTGATAAAACGGGAACATTTTGACAAAGTGAATCATGTTTTAGATGAAATGGTTGATGCGGGTTTTGTTCCGAATGATGCTGTCTACAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCAATGAAGCACTTAGGATTAAAGATTTGATGATATACAAAAATATAACTCCTACTTCAGTTACATTACATACTCTCATGCAAGGATTTTGCAAGAGCAATCAAATCGAGCAAGCAGAGAATGCTCTTGAGGAGATATTATCACATGGGCTATCTATAAACCCTGTTACTTGTTATTCGGTTGTCCACTGGTTATGTAAGAAGTCTAGGTTCCATTCTGCATTCCGATTTACTAAGGAAATATTATCAAAGAACTTCAAGCCTACAGATCAACTCTTAACCATATTGGTACGTGGGCTGTGTAAGGACGGTAAACATTTAGAAGCAACCGAACTTTGGTTTAGGCTATTGGAGAAAGGGTCTCCAGCAAGTACAGTAACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAATTTGCCAGGGGCTGTTAGAATAGTCAAAGAGATGTTGGAGAGGGGTTTTTCAATGGATCGGATCACATACAACACACTCATCTTAGGTTTTTGCAAAGTGGGAAAAGTTGAGGAATGCTTTAGACTTAAAGAAGAGATGACCAAACAAGGAATTCAACCAGACATCTACACTTACAATTTTCTATTGCATGGACTATGCAATGCAGGAAAGTTGGATTATGCTATTAAGCTTTGGGATGAATACAAAGCTAGTGGGCTGGTTTCTAATGTTCACACCTACGGAGTAATGATGGATGGTTATTGTAGAGCTAACAGAGTGGAAGATGTTGAAAAATTATTTAATGAATTGGTTGCTAAGAAAATAGAACTGAATACCATTGTCTACAATGTAATAATCAGAGCAAATTGCCAAAATGGAAATGTTGCCGCAGCTTTGCAACATCGTGATGATATGAAAAGCAAGGGAATTTTACCAACCTGTGCCACATATTCTTCTCTAATACATGGCATGTGCAGCATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAGAGGAAGGGTTGTTGCCGAATGTTGTTTGCTATACTGCATTAATCGGTGGTTATTGTAAGCTGGGGCAAATGGATGCTGCCGAAGCTACTTGGCTTGAGATGACCTCGTTTAACATAGCACCTAACAAATTTACCTATACCGTCATGATTGATGGGTACTGTAAATTAGGGAATATGGAAGAAGCAAATAACCTTCTGAGCAAAATGAAAGAAAGTGGAATCGTTCCAGATGTTGTTACTTACAATGCCTTGACTAATGGATTTTGCAAGGGAAAGAACATGGATAAAGCTTTTGAAGTATGCAATCAAATGGCCACTGGAGGATTATCTTTAGATGAAATTACTTATACAACTCTCTTACATGCCAAGACTGATCTAATTTCTGCAGAGATCCGTCAGTGTTCTTTTCTTTATGCATTGAGGCTATCCACCTTCTCTTGTTCAAATGAAAGAACCAAAAAGGAGGAGCCGTCCTTAGTCTCTGGAGATCATGGGAAGCAGAAAGTGACATTGGTTCTCCGACTCAAAGCCAAGGATGAAAACTCGCAAATATTTTGCAGGCTTGAGTCTGTATGTAGCTGTGCTCAGGGAGCAATGCCGTATTTGAGAGTTTTAATTATTGGGCAAAGGAATGATGAACTTTTAATGAAATCTACTTATACCAAAAGCAACCCACGGAGGTATTCAATCTACCCAGTTTGCGAATCAATTTCAGCCGCATGGAATTCGAATAAGCCCTTCACTCTCCCCACCGCCATCGGGAATGCTCTCGAGGACCTTCTCGTGGCCTTTAGGGTAATAAATTCCAGTTCGCCCGTCAAGCACCCAAGGAGTTGCCTCACCCTCCACATGGCTCTTATTGCTCTTCAGTTCCTTGCTTTCTGTTTCAACTTTCTCCCTCATCATATCTGGAGAAGAGCTTTTCCTGTGGAAGCCCTTTCGCCCAATCGGTCTGCTAAAGAATGCAACCGAATCAGAATCAGAAAAATAGATAAACAGAATCATAAACATGAGAAATTAGTTGATGGCTGA

Protein sequence

MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFELKEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCKMGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVTCYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLLEKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGILPTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEATWLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKAFEVCNQMATGGLSLDEITYTTLLHAKTDLISAEIRQCSFLYALRLSTFSCSNERTKKEEPSLVSGDHGKQKVTLVLRLKAKDENSQIFCRLESVCSCAQGAMPYLRVLIIGQRNDELLMKSTYTKSNPRRYSIYPVCESISAAWNSNKPFTLPTAIGNALEDLLVAFRVINSSSPVKHPRSCLTLHMALIALQFLAFCFNFLPHHIWRRAFPVEALSPNRSAKECNRIRIRKIDKQNHKHEKLVDG
Homology
BLAST of Clc07G10870 vs. NCBI nr
Match: XP_038884789.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884795.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884803.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884810.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884818.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida])

HSP 1 Score: 1463.4 bits (3787), Expect = 0.0e+00
Identity = 712/811 (87.79%), Postives = 761/811 (93.83%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           M LTRF INKTVPV FPFS RLAC+ STQPH++HHQDPP  +Q+QL YWVSS+LSNSSLD
Sbjct: 1   MRLTRFNINKTVPVFFPFSRRLACLLSTQPHKEHHQDPPRHIQEQLHYWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYCILILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPITCLNFFYFASDSFKFRFTIRSYCILILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDG LPVLNSD+NKLHIEIANALFGLTSVVGRFEWTQ FD LIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANALFGLTSVVGRFEWTQLFDFLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQFKN G +CAVDVFYLFARKGIFPS+KTC+FLLSSLVKANELEKCCE FEVMSQGVR
Sbjct: 181 YSTQFKNFGLNCAVDVFYLFARKGIFPSIKTCNFLLSSLVKANELEKCCEGFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTN INALCKGGKMEKAIEL MKMEKLG+SPN VTYN IIHGLCQNGR+D AFEL
Sbjct: 241 PDVFLFTNAINALCKGGKMEKAIELLMKMEKLGISPNVVTYNCIIHGLCQNGRLDNAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMT++GV+P+L TY  LINGL K ++FDKVNHVLDEMVDAG  PN  VYNNLIDGYCK
Sbjct: 301 KEKMTMEGVQPNLKTYGALINGLTKLKYFDKVNHVLDEMVDAGIDPNVIVYNNLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MGNINEALRIKD+M+ KNI+PTSVTL+TLMQGFCKS+QIEQAENALEEILS+GLSINP T
Sbjct: 361 MGNINEALRIKDVMMSKNISPTSVTLYTLMQGFCKSDQIEQAENALEEILSNGLSINPDT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKKSR++SAF+FTK +L+KNF+P DQLLTILVRGLC+DGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKSRYYSAFQFTKVMLAKNFRPRDQLLTILVRGLCEDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST+TSNALIHGLCGAGNLP AVRIVKEMLERG  MDR+TYN LILGFCK GKVE
Sbjct: 481 EKGSPASTLTSNALIHGLCGAGNLPEAVRIVKEMLERGIPMDRMTYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           E F+LKE+MTKQGIQPDIYT NFLLHGLCNAGKLD AIKLWDE+KASGLVSNVHTYGVMM
Sbjct: 541 EGFKLKEKMTKQGIQPDIYTCNFLLHGLCNAGKLDDAIKLWDEFKASGLVSNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLFNELV+KK+E NTIVYN+ IRANC NGNVAAALQ  DDMKSKGIL
Sbjct: 601 DVYCKANRIEDVEKLFNELVSKKMEPNTIVYNLFIRANCHNGNVAAALQLCDDMKSKGIL 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           P CATYSSLIHGMC+IGLVE+AKHLIDEMR+EGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PNCATYSSLIHGMCNIGLVENAKHLIDEMRKEGLLPNVVCYTALIGGYCKLGQMDTAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEM SFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 WLEMISFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+VC+QMATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKVCDQMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. NCBI nr
Match: XP_023552294.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023552295.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023552296.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1446.0 bits (3742), Expect = 0.0e+00
Identity = 703/811 (86.68%), Postives = 750/811 (92.48%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFS ++ACV ST+PH++HHQDPPWQLQDQLLY VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLID  LPVLNSD NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDRKLPVLNSDLNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGFS AVDVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGV 
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVS 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLG+SPN VTYN IIHGLCQNGR+  AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GVKPSLITY VLINGL K E FDK N VL+EMVDAGFVPN  VYN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MG INEAL+I+D+M+ KNITPTSVTL+TL+QGFCK+NQIEQAEN LEEILS G  INPVT
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKNNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSV+HWLC KSRFH A RFT  +LSKNF+P+DQLLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST TSNALIHGLCGAG +  AVRI+KEMLERGFS+DRITYNTLILG CK GKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYT N LL+GLCNAGKLD AIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLFNELV KK+ELN+IVYN+ IRA+C+NGNVAAALQ RDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           PTCATYSSLIHGMC+IG VEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PTCATYSSLIHGMCNIGRVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEMTSFNI PNK TYTVMIDGYCK+GNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSFNIRPNKITYTVMIDGYCKIGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+ C++MATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. NCBI nr
Match: KAG6577115.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1446.0 bits (3742), Expect = 0.0e+00
Identity = 703/811 (86.68%), Postives = 750/811 (92.48%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFS ++ CV ST+PH++HHQDPPWQLQDQLLY VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVVCVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGFS AVDVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLG+SPN VTYN IIHGLCQNGR+  AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GVKPSLITY VLINGL K E FDK N VL+EMVD GFVPN  VYN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDVGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MG INEAL+I+D+M+ KNITPTSVTL+TL+QGFCKSNQIEQAEN LEEILS G  INPVT
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKSNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSV+HWLC KSRFH A RFT  +LSKNF+P+DQLLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST TSNALIHGLCGAG +  AVRI+KEMLERGFS+DRITYNTLILG CK GKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYT N LL+GLCNAGKLD AIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLF+ELV KK+ELN+IVYN+ IRA+C+NGNVAAALQ RDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFDELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           PTCATYSSLIHGMC+IGLVEDAK LIDEMREEGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PTCATYSSLIHGMCNIGLVEDAKRLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEMTS NI PNK TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSLNIRPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+ C++MATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. NCBI nr
Match: XP_022984601.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita maxima] >XP_022984602.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita maxima] >XP_022984603.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 701/811 (86.44%), Postives = 751/811 (92.60%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKT+PV+FPFS ++ACV ST+PH++HHQDPPWQLQDQLLY VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTLPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NL FS AVDVFYLFARKGIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLCFSYAVDVFYLFARKGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLG+SPN VTYN +IHGLCQNGR+  AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSVIHGLCQNGRLGDAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GVKPSLITY VLINGL K E FDK N VL+EMVDAGFVPN  VYN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MG INEAL+I+D+M+ KNITPTSVT +TL+QGFCKSNQIEQA+N LEEILS G  INPVT
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTFYTLLQGFCKSNQIEQAQNTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSV+HWLC K RFH A RFT  +L KNF+P+DQLLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKFRFHYALRFTTVMLLKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST TSNALIHGLCGAG +  AVRI+KEMLERGFS+DRITYNTLILG CK GKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYT NFLL+GLCNAGKLD AIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNFLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLFNELV KK+ELN+IVYN+ IRA+C+NGNVAAALQ RDDMKSKGI 
Sbjct: 601 DAYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           PTCATYSSLIHGMC+IG VEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PTCATYSSLIHGMCNIGRVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEMTS NI+PNK TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSLNISPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+ C++MATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. NCBI nr
Match: XP_022931380.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata] >XP_022931381.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata] >XP_022931382.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1436.8 bits (3718), Expect = 0.0e+00
Identity = 700/811 (86.31%), Postives = 747/811 (92.11%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFS ++ACV ST+PH++HHQDPPWQLQDQLLY VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFD LFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDHLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGFS AVDVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLG+SPN VTYN IIHGLCQNGR+  AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GVKPSLITY VLINGL K E FDK N VL+EMV AGFVPN  VYN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVGAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MG INEAL+I+D+M+ KNITPTSVTL+TL+QGFCKSNQIEQAEN LEEILS G  INPVT
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKSNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSV+HWLC KSRFH A RFT  +LSKNF+P+DQLLTILV GLCKDGKHLEATELWFRL 
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLF 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST TSNALIHGLCGAG +  AVRI+KEMLERGFS+DRITYNT ILG CK GKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTFILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYT N LL+GLCNAGKLD AIKLW E+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWGEFKASGLISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLFNELV KK+ELN+IVYN+ IRA+C+NGNVAAALQ RDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           PTCATYSSLIHGMC+IGLVEDAK LIDEMREEGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PTCATYSSLIHGMCNIGLVEDAKRLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           +LEMTS NI PNK TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 FLEMTSLNIRPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+ C++MATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. ExPASy Swiss-Prot
Match: Q940A6 (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 736.9 bits (1901), Expect = 3.2e-211
Identity = 375/740 (50.68%), Postives = 504/740 (68.11%), Query Frame = 0

Query: 50  VSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTI 109
           +SS+LS  SLD  +C+ L+  LSP +FD+LF     K NP T L+FF  ASDSF F F++
Sbjct: 80  LSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDSFSFSFSL 139

Query: 110 RSYCILILLLVHSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFE 169
           RSYC+LI LL+ +  L  AR+VLIRLI+GN+PVL        + IA+A+  L+       
Sbjct: 140 RSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEI 199

Query: 170 WTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCC 229
             +  DLLI VY TQFK  G   A+DVF + A KG+FPS  TC+ LL+SLV+ANE +KCC
Sbjct: 200 RRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCC 259

Query: 230 EIFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLC 289
           E F+V+ +GV PDV+LFT  INA CKGGK+E+A++LF KME+ GV+PN VT+N +I GL 
Sbjct: 260 EAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDGLG 319

Query: 290 QNGRIDTAFELKEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDA 349
             GR D AF  KEKM  +G++P+LITY +L+ GL + +       VL EM   GF PN  
Sbjct: 320 MCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKRIGDAYFVLKEMTKKGFPPNVI 379

Query: 350 VYNNLIDGYCKMGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEI 409
           VYNNLID + + G++N+A+ IKDLM+ K ++ TS T +TL++G+CK+ Q + AE  L+E+
Sbjct: 380 VYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNTLIKGYCKNGQADNAERLLKEM 439

Query: 410 LSHGLSINPVTCYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKH 469
           LS G ++N  +  SV+  LC    F SA RF  E+L +N  P   LLT L+ GLCK GKH
Sbjct: 440 LSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRNMSPGGGLLTTLISGLCKHGKH 499

Query: 470 LEATELWFRLLEKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTL 529
            +A ELWF+ L KG    T TSNAL+HGLC AG L  A RI KE+L RG  MDR++YNTL
Sbjct: 500 SKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAFRIQKEILGRGCVMDRVSYNTL 559

Query: 530 ILGFCKVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGL 589
           I G C   K++E F   +EM K+G++PD YTY+ L+ GL N  K++ AI+ WD+ K +G+
Sbjct: 560 ISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGLFNMNKVEEAIQFWDDCKRNGM 619

Query: 590 VSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQ 649
           + +V+TY VM+DG C+A R E+ ++ F+E+++K ++ NT+VYN +IRA C++G ++ AL+
Sbjct: 620 LPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNTVVYNHLIRAYCRSGRLSMALE 679

Query: 650 HRDDMKSKGILPTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYC 709
            R+DMK KGI P  ATY+SLI GM  I  VE+AK L +EMR EGL PNV  YTALI GY 
Sbjct: 680 LREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHYTALIDGYG 739

Query: 710 KLGQMDAAEATWLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVV 769
           KLGQM   E    EM S N+ PNK TYTVMI GY + GN+ EA+ LL++M+E GIVPD +
Sbjct: 740 KLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMREKGIVPDSI 799

Query: 770 TYNALTNGFCKGKNMDKAFE 790
           TY     G+ K   + +AF+
Sbjct: 800 TYKEFIYGYLKQGGVLEAFK 819

BLAST of Clc07G10870 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 365.5 bits (937), Expect = 1.9e-99
Identity = 238/801 (29.71%), Postives = 378/801 (47.19%), Query Frame = 0

Query: 83  VGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLPPARLVLIRLIDGNLPV 142
           +G   +P   L FF F      F  +  S+CILI  LV +    PA  +L  L+   L  
Sbjct: 78  IGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLL---LRA 137

Query: 143 LNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFAR 202
           L         ++ N LF       +   + +FDLLI  Y    + L     V VF +   
Sbjct: 138 LKPS------DVFNVLFSCYEKC-KLSSSSSFDLLIQHYVRSRRVLD---GVLVFKMMIT 197

Query: 203 K-GIFPSLKTCSFLLSSLVKANELEKCCEIF-EVMSQGVRPDVFLFTNVINALCKGGKME 262
           K  + P ++T S LL  LVK        E+F +++S G+RPDV+++T VI +LC+   + 
Sbjct: 198 KVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLS 257

Query: 263 KAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFELKEKMTIKGVKPSLITYCVLI 322
           +A E+   ME  G   N V YN++I GLC+  ++  A  +K+ +  K +KP ++TYC L+
Sbjct: 258 RAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLV 317

Query: 323 NGLIKREHFDKVNHVLDEM-----------------------------------VDAGFV 382
            GL K + F+    ++DEM                                   VD G  
Sbjct: 318 YGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVS 377

Query: 383 PNDAVYNNLIDGYCKMGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENA 442
           PN  VYN LID  CK    +EA  + D M    + P  VT   L+  FC+  +++ A + 
Sbjct: 378 PNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSF 437

Query: 443 LEEILSHGLSINPVTCYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCK 502
           L E++  GL ++     S+++  CK     +A  F  E+++K  +PT    T L+ G C 
Sbjct: 438 LGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCS 497

Query: 503 DGKHLEATELWFRLLEKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRIT 562
            GK  +A  L+  +  KG   S  T   L+ GL  AG +  AV++  EM E     +R+T
Sbjct: 498 KGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVT 557

Query: 563 YNTLILGFCKVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYK 622
           YN +I G+C+ G + + F   +EMT++GI PD Y+Y  L+HGLC  G+   A    D   
Sbjct: 558 YNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLH 617

Query: 623 ASGLVSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVA 682
                 N   Y  ++ G+CR  ++E+   +  E+V + ++L+ + Y V+I  + ++ +  
Sbjct: 618 KGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRK 677

Query: 683 AALQHRDDMKSKGILPTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALI 742
                  +M  +G+ P    Y+S+I      G  ++A  + D M  EG +PN V YTA+I
Sbjct: 678 LFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVI 737

Query: 743 GGYCKLGQMDAAEATWLEMTSFNIAPNKF------------------------------- 802
            G CK G ++ AE    +M   +  PN+                                
Sbjct: 738 NGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLL 797

Query: 803 ----TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKAFEV 812
               TY ++I G+C+ G +EEA+ L+++M   G+ PD +TY  + N  C+  ++ KA E+
Sbjct: 798 ANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIEL 857

BLAST of Clc07G10870 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 347.4 bits (890), Expect = 5.4e-94
Identity = 218/743 (29.34%), Postives = 349/743 (46.97%), Query Frame = 0

Query: 109 IRSYCILILLLVHSKFLPPARLVL--IRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVG 168
           ++  CI   +LV ++   PAR +L  + L+ G                ++ +FG      
Sbjct: 72  VQLVCITTHILVRARMYDPARHILKELSLMSGK---------------SSFVFGALMTTY 131

Query: 169 RF--EWTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANE 228
           R        +D+LI VY    +      ++++F L    G  PS+ TC+ +L S+VK+ E
Sbjct: 132 RLCNSNPSVYDILIRVY---LREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGE 191

Query: 229 -LEKCCEIFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNI 288
            +     + E++ + + PDV  F  +IN LC  G  EK+  L  KMEK G +P  VTYN 
Sbjct: 192 DVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNT 251

Query: 289 IIHGLCQNGRIDTAFELKEKMTIKGV---------------------------------- 348
           ++H  C+ GR   A EL + M  KGV                                  
Sbjct: 252 VLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRM 311

Query: 349 -KPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCKMGNINEAL 408
             P+ +TY  LING          + +L+EM+  G  PN   +N LIDG+   GN  EAL
Sbjct: 312 IHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEAL 371

Query: 409 RIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVTCYSVVHWL 468
           ++  +M  K +TP+ V+   L+ G CK+ + + A      +  +G+ +  +T   ++  L
Sbjct: 372 KMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGL 431

Query: 469 CKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLLEKGSPAST 528
           CK      A     E+      P     + L+ G CK G+   A E+  R+   G   + 
Sbjct: 432 CKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNG 491

Query: 529 VTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVEECFRLKEE 588
           +  + LI+  C  G L  A+RI + M+  G + D  T+N L+   CK GKV E       
Sbjct: 492 IIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRC 551

Query: 589 MTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMMDGYCRANR 648
           MT  GI P+  +++ L++G  N+G+   A  ++DE    G      TYG ++ G C+   
Sbjct: 552 MTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGH 611

Query: 649 VEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGILPTCATYSS 708
           + + EK    L A    ++T++YN ++ A C++GN+A A+    +M  + ILP   TY+S
Sbjct: 612 LREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTS 671

Query: 709 LIHGMCSIGLVEDAKHLIDEMREEG-LLPNVVCYTALIGGYCKLGQMDAAEATWLEMTSF 768
           LI G+C  G    A     E    G +LPN V YT  + G  K GQ  A      +M + 
Sbjct: 672 LISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNL 731

Query: 769 NIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKA 811
              P+  T   MIDGY ++G +E+ N+LL +M      P++ TYN L +G+ K K++  +
Sbjct: 732 GHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTS 791

BLAST of Clc07G10870 vs. ExPASy Swiss-Prot
Match: Q9LN69 (Putative pentatricopeptide repeat-containing protein At1g19290 OS=Arabidopsis thaliana OX=3702 GN=At1g19290 PE=3 SV=2)

HSP 1 Score: 314.7 bits (805), Expect = 3.9e-84
Identity = 216/780 (27.69%), Postives = 371/780 (47.56%), Query Frame = 0

Query: 77  DQLFFSV--GLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLPPARLVLIR 136
           D+L  S+   L+ NP  CL  F  AS   KFR   ++YC ++ +L  ++     +  L  
Sbjct: 70  DELLNSILRRLRLNPEACLEIFNLASKQQKFRPDYKAYCKMVHILSRARNYQQTKSYLCE 129

Query: 137 LIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWT-QAFDLLIHVYSTQ--FKNLGFS 196
           L+      LN     +  E       L  V   F ++   FD+++ VY+ +   KN    
Sbjct: 130 LV-----ALNHSGFVVWGE-------LVRVFKEFSFSPTVFDMILKVYAEKGLVKN---- 189

Query: 197 CAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVM-SQGVRPDVFLFTNVI 256
            A+ VF      G  PSL +C+ LLS+LV+  E      +++ M S  V PDVF  + V+
Sbjct: 190 -ALHVFDNMGNYGRIPSLLSCNSLLSNLVRKGENFVALHVYDQMISFEVSPDVFTCSIVV 249

Query: 257 NALCKGGKMEKAIELFMKME-KLGVSPNDVTYNIIIHGLCQNGRIDTAFELKEKMTIKGV 316
           NA C+ G ++KA+    + E  LG+  N VTYN +I+G    G ++    +   M+ +GV
Sbjct: 250 NAYCRSGNVDKAMVFAKETESSLGLELNVVTYNSLINGYAMIGDVEGMTRVLRLMSERGV 309

Query: 317 KPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCKMGNINEALR 376
             +++TY  LI G  K+   ++  HV + + +   V +  +Y  L+DGYC+ G I +A+R
Sbjct: 310 SRNVVTYTSLIKGYCKKGLMEEAEHVFELLKEKKLVADQHMYGVLMDGYCRTGQIRDAVR 369

Query: 377 IKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVTCYSVVHWLC 436
           + D MI   +   +   ++L+ G+CKS Q+ +AE     +    L  +  T  ++V   C
Sbjct: 370 VHDNMIEIGVRTNTTICNSLINGYCKSGQLVEAEQIFSRMNDWSLKPDHHTYNTLVDGYC 429

Query: 437 KKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLLEKGSPASTV 496
           +      A +   ++  K   PT     IL++G  + G   +   LW  +L++G  A  +
Sbjct: 430 RAGYVDEALKLCDQMCQKEVVPTVMTYNILLKGYSRIGAFHDVLSLWKMMLKRGVNADEI 489

Query: 497 TSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFC--------------- 556
           + + L+  L   G+   A+++ + +L RG   D IT N +I G C               
Sbjct: 490 SCSTLLEALFKLGDFNEAMKLWENVLARGLLTDTITLNVMISGLCKMEKVNEAKEILDNV 549

Query: 557 --------------------KVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKL 616
                               KVG ++E F +KE M ++GI P I  YN L+ G      L
Sbjct: 550 NIFRCKPAVQTYQALSHGYYKVGNLKEAFAVKEYMERKGIFPTIEMYNTLISGAFKYRHL 609

Query: 617 DYAIKLWDEYKASGLVSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVI 676
           +    L  E +A GL   V TYG ++ G+C    ++       E++ K I LN  + + I
Sbjct: 610 NKVADLVIELRARGLTPTVATYGALITGWCNIGMIDKAYATCFEMIEKGITLNVNICSKI 669

Query: 677 IRANCQNGNV-AAALQHRDDMKSKGILPTCATYSSLIHGMCSIGLVED--AKHLIDEMRE 736
             +  +   +  A L  +  +    +LP   +    +    +  L     A+ + +   +
Sbjct: 670 ANSLFRLDKIDEACLLLQKIVDFDLLLPGYQSLKEFLEASATTCLKTQKIAESVENSTPK 729

Query: 737 EGLLPNVVCYTALIGGYCKLGQMDAAEATWLE-MTSFNIAPNKFTYTVMIDGYCKLGNME 796
           + L+PN + Y   I G CK G+++ A   + + ++S    P+++TYT++I G    G++ 
Sbjct: 730 KLLVPNNIVYNVAIAGLCKAGKLEDARKLFSDLLSSDRFIPDEYTYTILIHGCAIAGDIN 789

Query: 797 EANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKAFEVCNQMATGGLSLDEITYTTLL 811
           +A  L  +M   GI+P++VTYNAL  G CK  N+D+A  + +++   G++ + ITY TL+
Sbjct: 790 KAFTLRDEMALKGIIPNIVTYNALIKGLCKLGNVDRAQRLLHKLPQKGITPNAITYNTLI 832

BLAST of Clc07G10870 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 1.9e-83
Identity = 164/558 (29.39%), Postives = 286/558 (51.25%), Query Frame = 0

Query: 258 KMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFELKEKMTIKGVKPSLITYC 317
           K+  AI+LF  M +    P  + +N +   + +  + D      + M + G++  + T  
Sbjct: 50  KVNDAIDLFESMIQSRPLPTPIDFNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMT 109

Query: 318 VLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCKMGNINEALRIKDLMIYK 377
           ++IN   +++       VL      G+ P+   ++ L++G+C  G ++EA+ + D M+  
Sbjct: 110 IMINCYCRKKKLLFAFSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEM 169

Query: 378 NITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVTCYSVVHWLCKKSRFHSA 437
              P  VT+ TL+ G C   ++ +A   ++ ++ +G   + VT   V++ LCK      A
Sbjct: 170 KQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALA 229

Query: 438 FRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLLEKGSPASTVTSNALIHG 497
               +++  +N K +    +I++  LCKDG   +A  L+  +  KG  A  VT ++LI G
Sbjct: 230 LDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGG 289

Query: 498 LCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVEECFRLKEEMTKQGIQPD 557
           LC  G      ++++EM+ R    D +T++ LI  F K GK+ E   L  EM  +GI PD
Sbjct: 290 LCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPD 349

Query: 558 IYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMMDGYCRANRVEDVEKLFN 617
             TYN L+ G C    L  A +++D   + G   ++ TY ++++ YC+A RV+D  +LF 
Sbjct: 350 TITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFR 409

Query: 618 ELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGILPTCATYSSLIHGMCSIG 677
           E+ +K +  NTI YN ++   CQ+G + AA +   +M S+G+ P+  TY  L+ G+C  G
Sbjct: 410 EISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNG 469

Query: 678 LVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEATWLEMTSFNIAPNKFTYT 737
            +  A  + ++M++  +   +  Y  +I G C   ++D A + +  ++   + P+  TY 
Sbjct: 470 ELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYN 529

Query: 738 VMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKAFEVCNQMATG 797
           VMI G CK G++ EA+ L  KMKE G  PD  TYN L      G  +  + E+  +M   
Sbjct: 530 VMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEMKVC 589

Query: 798 GLSLDEITYTTLLHAKTD 816
           G S D  T   ++   +D
Sbjct: 590 GFSADSSTIKMVIDMLSD 607

BLAST of Clc07G10870 vs. ExPASy TrEMBL
Match: A0A6J1J2L6 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111482844 PE=4 SV=1)

HSP 1 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 701/811 (86.44%), Postives = 751/811 (92.60%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKT+PV+FPFS ++ACV ST+PH++HHQDPPWQLQDQLLY VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTLPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NL FS AVDVFYLFARKGIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLCFSYAVDVFYLFARKGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLG+SPN VTYN +IHGLCQNGR+  AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSVIHGLCQNGRLGDAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GVKPSLITY VLINGL K E FDK N VL+EMVDAGFVPN  VYN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MG INEAL+I+D+M+ KNITPTSVT +TL+QGFCKSNQIEQA+N LEEILS G  INPVT
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTFYTLLQGFCKSNQIEQAQNTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSV+HWLC K RFH A RFT  +L KNF+P+DQLLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKFRFHYALRFTTVMLLKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST TSNALIHGLCGAG +  AVRI+KEMLERGFS+DRITYNTLILG CK GKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYT NFLL+GLCNAGKLD AIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNFLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLFNELV KK+ELN+IVYN+ IRA+C+NGNVAAALQ RDDMKSKGI 
Sbjct: 601 DAYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           PTCATYSSLIHGMC+IG VEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PTCATYSSLIHGMCNIGRVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEMTS NI+PNK TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSLNISPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+ C++MATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. ExPASy TrEMBL
Match: A0A6J1ETG9 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111437580 PE=4 SV=1)

HSP 1 Score: 1436.8 bits (3718), Expect = 0.0e+00
Identity = 700/811 (86.31%), Postives = 747/811 (92.11%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFS ++ACV ST+PH++HHQDPPWQLQDQLLY VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFD LFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDHLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGFS AVDVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLG+SPN VTYN IIHGLCQNGR+  AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GVKPSLITY VLINGL K E FDK N VL+EMV AGFVPN  VYN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVGAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MG INEAL+I+D+M+ KNITPTSVTL+TL+QGFCKSNQIEQAEN LEEILS G  INPVT
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKSNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSV+HWLC KSRFH A RFT  +LSKNF+P+DQLLTILV GLCKDGKHLEATELWFRL 
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLF 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAST TSNALIHGLCGAG +  AVRI+KEMLERGFS+DRITYNT ILG CK GKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTFILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYT N LL+GLCNAGKLD AIKLW E+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWGEFKASGLISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           D YC+ANR+EDVEKLFNELV KK+ELN+IVYN+ IRA+C+NGNVAAALQ RDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           PTCATYSSLIHGMC+IGLVEDAK LIDEMREEGLLPNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PTCATYSSLIHGMCNIGLVEDAKRLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           +LEMTS NI PNK TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK
Sbjct: 721 FLEMTSLNIRPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
           GK+MDKAF+ C++MATGGLSLDEITYTTL+H
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVH 811

BLAST of Clc07G10870 vs. ExPASy TrEMBL
Match: A0A1S4DYY2 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493057 PE=4 SV=1)

HSP 1 Score: 1428.7 bits (3697), Expect = 0.0e+00
Identity = 689/811 (84.96%), Postives = 747/811 (92.11%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKT+PVLFPFS RLACV STQPH++HHQDPPWQ QDQL  WVSS+LSNSSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKC ALLPHLSPFQFDQLFFS+GLKANP TCLNFFYFASDSFKFRFTI SYCILILLLV
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDGNLPVLNSD  K HIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGF CA+DVFYL ARKG FPSLKTC+FLLSSLVKANE EKCCE+F+VMS+GV 
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVF FTNVINALCKGGKMEKA ELFMKMEKLG+SPN VTYN II+GLCQNGR+D AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GV+P+L TY  L+NGLIK + FDKVNH+LDEM+ AGF PN  V+NNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MGNI EALRIKD+MI KNITPTSVTL+TL+QGFCKS+QIEQAENALEEILS+GLSI+P  
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R+HSAFRFTK +LS+NF+P+D LLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAS VTSNALIHGLC AGNLP A RIVKEMLERG  +DRITYN LILGFCK GKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
            CFRLKEEMTK+GIQPDIYTYNFLL GLCNAGKLD AIKLWDE+KASG +SNVHTYGVMM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           DGYC+ANR+EDVE LFNEL++KK+ELN+IVYN+IIRA+CQNGNVAAALQ R++MKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           P CATYSSLIHGMC IGLVEDAKHLIDEMR+EG +PNVVCYTALIGGYCKLGQMD AE+T
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEM SFNI PNKFTYTVMIDGYCKLGNME+A NLL+KMKESGIVPDVVTYN LTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
             +MD AF+VC+QMAT GLS+DEITYTTL+H
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVH 811

BLAST of Clc07G10870 vs. ExPASy TrEMBL
Match: A0A5D3CYQ1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G002960 PE=4 SV=1)

HSP 1 Score: 1428.7 bits (3697), Expect = 0.0e+00
Identity = 689/811 (84.96%), Postives = 747/811 (92.11%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKT+PVLFPFS RLACV STQPH++HHQDPPWQ QDQL  WVSS+LSNSSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKC ALLPHLSPFQFDQLFFS+GLKANP TCLNFFYFASDSFKFRFTI SYCILILLLV
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDGNLPVLNSD  K HIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGF CA+DVFYL ARKG FPSLKTC+FLLSSLVKANE EKCCE+F+VMS+GV 
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVF FTNVINALCKGGKMEKA ELFMKMEKLG+SPN VTYN II+GLCQNGR+D AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GV+P+L TY  L+NGLIK + FDKVNH+LDEM+ AGF PN  V+NNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MGNI EALRIKD+MI KNITPTSVTL+TL+QGFCKS+QIEQAENALEEILS+GLSI+P  
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R+HSAFRFTK +LS+NF+P+D LLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAS VTSNALIHGLC AGNLP A RIVKEMLERG  +DRITYN LILGFCK GKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
            CFRLKEEMTK+GIQPDIYTYNFLL GLCNAGKLD AIKLWDE+KASG +SNVHTYGVMM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           DGYC+ANR+EDVE LFNEL++KK+ELN+IVYN+IIRA+CQNGNVAAALQ R++MKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           P CATYSSLIHGMC IGLVEDAKHLIDEMR+EG +PNVVCYTALIGGYCKLGQMD AE+T
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEM SFNI PNKFTYTVMIDGYCKLGNME+A NLL+KMKESGIVPDVVTYN LTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
             +MD AF+VC+QMAT GLS+DEITYTTL+H
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVH 811

BLAST of Clc07G10870 vs. ExPASy TrEMBL
Match: A0A5A7TTX4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002890 PE=4 SV=1)

HSP 1 Score: 1424.1 bits (3685), Expect = 0.0e+00
Identity = 688/811 (84.83%), Postives = 746/811 (91.99%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSCRLACVFSTQPHEQHHQDPPWQLQDQLLYWVSSILSNSSLD 60
           MHLTRFKINKT+PVLFPFS RLACV STQPH++HHQDPPWQ QDQL  WVSS+LSNSSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKC ALLPHLSPFQFDQLFFS+GLKANP TCLNFFYFASDSFKFRFTI SYCILILLLV
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 HSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFLPPARL+LIRLIDGNLPVLNSD  K HIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVMSQGVR 240
           YSTQF+NLGF CA+DVFYL ARKG FPSLKTC+FLLSSLVKANE EKCCE+F+VMS+GV 
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFEL 300
           PDVF FTNVINALCKGGKMEKA ELFMKMEKLG+SPN VTYN II+GLCQNGR+D AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCK 360
           KEKMTI+GV+P+L TY  L+NGLIK + FDKVNH+LDEM+ AGF PN  V+NNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVT 420
           MGNI EALRIKD+MI KNITPTSVTL+TL+QGFCKS+QIEQAENALEEILS+GLSI+P  
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R+HSAFRFTK +LS+NF+P+D LLTILV GLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVE 540
           EKGSPAS VTSNALIHGLC AGNLP A RIVKEMLERG  +DRITYN LILGFCK GKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMM 600
            CFRLKEEMTK+GIQPDIYTYNFLL GLCNAGKLD AIKLWDE+KASG +SNVHTYGVMM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGIL 660
           DGYC+ANR+EDVE LFNEL++KK+ELN+IVYN+IIRA+CQNGNVAAALQ R++MKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDAAEAT 720
           P CATYSSLIHGMC IGLVEDAKHLIDEMR+EG +PNVVCYTALIGGYCKLGQMD AE+T
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780
           WLEM SFNI PNKFTYTVMIDGY KLGNME+A NLL+KMKESGIVPDVVTYN LTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYGKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 GKNMDKAFEVCNQMATGGLSLDEITYTTLLH 812
             +MD AF+VC+QMAT GLS+DEITYTTL+H
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVH 811

BLAST of Clc07G10870 vs. TAIR 10
Match: AT4G19440.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 736.9 bits (1901), Expect = 2.3e-212
Identity = 375/740 (50.68%), Postives = 504/740 (68.11%), Query Frame = 0

Query: 50  VSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTI 109
           +SS+LS  SLD  +C+ L+  LSP +FD+LF     K NP T L+FF  ASDSF F F++
Sbjct: 67  LSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDSFSFSFSL 126

Query: 110 RSYCILILLLVHSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFE 169
           RSYC+LI LL+ +  L  AR+VLIRLI+GN+PVL        + IA+A+  L+       
Sbjct: 127 RSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEI 186

Query: 170 WTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCC 229
             +  DLLI VY TQFK  G   A+DVF + A KG+FPS  TC+ LL+SLV+ANE +KCC
Sbjct: 187 RRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCC 246

Query: 230 EIFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLC 289
           E F+V+ +GV PDV+LFT  INA CKGGK+E+A++LF KME+ GV+PN VT+N +I GL 
Sbjct: 247 EAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDGLG 306

Query: 290 QNGRIDTAFELKEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDA 349
             GR D AF  KEKM  +G++P+LITY +L+ GL + +       VL EM   GF PN  
Sbjct: 307 MCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKRIGDAYFVLKEMTKKGFPPNVI 366

Query: 350 VYNNLIDGYCKMGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEI 409
           VYNNLID + + G++N+A+ IKDLM+ K ++ TS T +TL++G+CK+ Q + AE  L+E+
Sbjct: 367 VYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNTLIKGYCKNGQADNAERLLKEM 426

Query: 410 LSHGLSINPVTCYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKH 469
           LS G ++N  +  SV+  LC    F SA RF  E+L +N  P   LLT L+ GLCK GKH
Sbjct: 427 LSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRNMSPGGGLLTTLISGLCKHGKH 486

Query: 470 LEATELWFRLLEKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTL 529
            +A ELWF+ L KG    T TSNAL+HGLC AG L  A RI KE+L RG  MDR++YNTL
Sbjct: 487 SKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAFRIQKEILGRGCVMDRVSYNTL 546

Query: 530 ILGFCKVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGL 589
           I G C   K++E F   +EM K+G++PD YTY+ L+ GL N  K++ AI+ WD+ K +G+
Sbjct: 547 ISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGLFNMNKVEEAIQFWDDCKRNGM 606

Query: 590 VSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQ 649
           + +V+TY VM+DG C+A R E+ ++ F+E+++K ++ NT+VYN +IRA C++G ++ AL+
Sbjct: 607 LPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNTVVYNHLIRAYCRSGRLSMALE 666

Query: 650 HRDDMKSKGILPTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYC 709
            R+DMK KGI P  ATY+SLI GM  I  VE+AK L +EMR EGL PNV  YTALI GY 
Sbjct: 667 LREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHYTALIDGYG 726

Query: 710 KLGQMDAAEATWLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVV 769
           KLGQM   E    EM S N+ PNK TYTVMI GY + GN+ EA+ LL++M+E GIVPD +
Sbjct: 727 KLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMREKGIVPDSI 786

Query: 770 TYNALTNGFCKGKNMDKAFE 790
           TY     G+ K   + +AF+
Sbjct: 787 TYKEFIYGYLKQGGVLEAFK 806

BLAST of Clc07G10870 vs. TAIR 10
Match: AT4G19440.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 736.9 bits (1901), Expect = 2.3e-212
Identity = 375/740 (50.68%), Postives = 504/740 (68.11%), Query Frame = 0

Query: 50  VSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKANPHTCLNFFYFASDSFKFRFTI 109
           +SS+LS  SLD  +C+ L+  LSP +FD+LF     K NP T L+FF  ASDSF F F++
Sbjct: 67  LSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDSFSFSFSL 126

Query: 110 RSYCILILLLVHSKFLPPARLVLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFE 169
           RSYC+LI LL+ +  L  AR+VLIRLI+GN+PVL        + IA+A+  L+       
Sbjct: 127 RSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEI 186

Query: 170 WTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCC 229
             +  DLLI VY TQFK  G   A+DVF + A KG+FPS  TC+ LL+SLV+ANE +KCC
Sbjct: 187 RRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCC 246

Query: 230 EIFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNIIIHGLC 289
           E F+V+ +GV PDV+LFT  INA CKGGK+E+A++LF KME+ GV+PN VT+N +I GL 
Sbjct: 247 EAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDGLG 306

Query: 290 QNGRIDTAFELKEKMTIKGVKPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDA 349
             GR D AF  KEKM  +G++P+LITY +L+ GL + +       VL EM   GF PN  
Sbjct: 307 MCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKRIGDAYFVLKEMTKKGFPPNVI 366

Query: 350 VYNNLIDGYCKMGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEI 409
           VYNNLID + + G++N+A+ IKDLM+ K ++ TS T +TL++G+CK+ Q + AE  L+E+
Sbjct: 367 VYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNTLIKGYCKNGQADNAERLLKEM 426

Query: 410 LSHGLSINPVTCYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKH 469
           LS G ++N  +  SV+  LC    F SA RF  E+L +N  P   LLT L+ GLCK GKH
Sbjct: 427 LSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRNMSPGGGLLTTLISGLCKHGKH 486

Query: 470 LEATELWFRLLEKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTL 529
            +A ELWF+ L KG    T TSNAL+HGLC AG L  A RI KE+L RG  MDR++YNTL
Sbjct: 487 SKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAFRIQKEILGRGCVMDRVSYNTL 546

Query: 530 ILGFCKVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGL 589
           I G C   K++E F   +EM K+G++PD YTY+ L+ GL N  K++ AI+ WD+ K +G+
Sbjct: 547 ISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGLFNMNKVEEAIQFWDDCKRNGM 606

Query: 590 VSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQ 649
           + +V+TY VM+DG C+A R E+ ++ F+E+++K ++ NT+VYN +IRA C++G ++ AL+
Sbjct: 607 LPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNTVVYNHLIRAYCRSGRLSMALE 666

Query: 650 HRDDMKSKGILPTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALIGGYC 709
            R+DMK KGI P  ATY+SLI GM  I  VE+AK L +EMR EGL PNV  YTALI GY 
Sbjct: 667 LREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHYTALIDGYG 726

Query: 710 KLGQMDAAEATWLEMTSFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVV 769
           KLGQM   E    EM S N+ PNK TYTVMI GY + GN+ EA+ LL++M+E GIVPD +
Sbjct: 727 KLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMREKGIVPDSI 786

Query: 770 TYNALTNGFCKGKNMDKAFE 790
           TY     G+ K   + +AF+
Sbjct: 787 TYKEFIYGYLKQGGVLEAFK 806

BLAST of Clc07G10870 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 365.5 bits (937), Expect = 1.4e-100
Identity = 238/801 (29.71%), Postives = 378/801 (47.19%), Query Frame = 0

Query: 83  VGLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLPPARLVLIRLIDGNLPV 142
           +G   +P   L FF F      F  +  S+CILI  LV +    PA  +L  L+   L  
Sbjct: 78  IGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLL---LRA 137

Query: 143 LNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFAR 202
           L         ++ N LF       +   + +FDLLI  Y    + L     V VF +   
Sbjct: 138 LKPS------DVFNVLFSCYEKC-KLSSSSSFDLLIQHYVRSRRVLD---GVLVFKMMIT 197

Query: 203 K-GIFPSLKTCSFLLSSLVKANELEKCCEIF-EVMSQGVRPDVFLFTNVINALCKGGKME 262
           K  + P ++T S LL  LVK        E+F +++S G+RPDV+++T VI +LC+   + 
Sbjct: 198 KVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLS 257

Query: 263 KAIELFMKMEKLGVSPNDVTYNIIIHGLCQNGRIDTAFELKEKMTIKGVKPSLITYCVLI 322
           +A E+   ME  G   N V YN++I GLC+  ++  A  +K+ +  K +KP ++TYC L+
Sbjct: 258 RAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLV 317

Query: 323 NGLIKREHFDKVNHVLDEM-----------------------------------VDAGFV 382
            GL K + F+    ++DEM                                   VD G  
Sbjct: 318 YGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVS 377

Query: 383 PNDAVYNNLIDGYCKMGNINEALRIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENA 442
           PN  VYN LID  CK    +EA  + D M    + P  VT   L+  FC+  +++ A + 
Sbjct: 378 PNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSF 437

Query: 443 LEEILSHGLSINPVTCYSVVHWLCKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCK 502
           L E++  GL ++     S+++  CK     +A  F  E+++K  +PT    T L+ G C 
Sbjct: 438 LGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCS 497

Query: 503 DGKHLEATELWFRLLEKGSPASTVTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRIT 562
            GK  +A  L+  +  KG   S  T   L+ GL  AG +  AV++  EM E     +R+T
Sbjct: 498 KGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVT 557

Query: 563 YNTLILGFCKVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYK 622
           YN +I G+C+ G + + F   +EMT++GI PD Y+Y  L+HGLC  G+   A    D   
Sbjct: 558 YNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLH 617

Query: 623 ASGLVSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVA 682
                 N   Y  ++ G+CR  ++E+   +  E+V + ++L+ + Y V+I  + ++ +  
Sbjct: 618 KGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRK 677

Query: 683 AALQHRDDMKSKGILPTCATYSSLIHGMCSIGLVEDAKHLIDEMREEGLLPNVVCYTALI 742
                  +M  +G+ P    Y+S+I      G  ++A  + D M  EG +PN V YTA+I
Sbjct: 678 LFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVI 737

Query: 743 GGYCKLGQMDAAEATWLEMTSFNIAPNKF------------------------------- 802
            G CK G ++ AE    +M   +  PN+                                
Sbjct: 738 NGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLL 797

Query: 803 ----TYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKAFEV 812
               TY ++I G+C+ G +EEA+ L+++M   G+ PD +TY  + N  C+  ++ KA E+
Sbjct: 798 ANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIEL 857

BLAST of Clc07G10870 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 347.4 bits (890), Expect = 3.9e-95
Identity = 218/743 (29.34%), Postives = 349/743 (46.97%), Query Frame = 0

Query: 109 IRSYCILILLLVHSKFLPPARLVL--IRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVG 168
           ++  CI   +LV ++   PAR +L  + L+ G                ++ +FG      
Sbjct: 112 VQLVCITTHILVRARMYDPARHILKELSLMSGK---------------SSFVFGALMTTY 171

Query: 169 RF--EWTQAFDLLIHVYSTQFKNLGFSCAVDVFYLFARKGIFPSLKTCSFLLSSLVKANE 228
           R        +D+LI VY    +      ++++F L    G  PS+ TC+ +L S+VK+ E
Sbjct: 172 RLCNSNPSVYDILIRVY---LREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGE 231

Query: 229 -LEKCCEIFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGVSPNDVTYNI 288
            +     + E++ + + PDV  F  +IN LC  G  EK+  L  KMEK G +P  VTYN 
Sbjct: 232 DVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNT 291

Query: 289 IIHGLCQNGRIDTAFELKEKMTIKGV---------------------------------- 348
           ++H  C+ GR   A EL + M  KGV                                  
Sbjct: 292 VLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRM 351

Query: 349 -KPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCKMGNINEAL 408
             P+ +TY  LING          + +L+EM+  G  PN   +N LIDG+   GN  EAL
Sbjct: 352 IHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEAL 411

Query: 409 RIKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVTCYSVVHWL 468
           ++  +M  K +TP+ V+   L+ G CK+ + + A      +  +G+ +  +T   ++  L
Sbjct: 412 KMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGL 471

Query: 469 CKKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLLEKGSPAST 528
           CK      A     E+      P     + L+ G CK G+   A E+  R+   G   + 
Sbjct: 472 CKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNG 531

Query: 529 VTSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFCKVGKVEECFRLKEE 588
           +  + LI+  C  G L  A+RI + M+  G + D  T+N L+   CK GKV E       
Sbjct: 532 IIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRC 591

Query: 589 MTKQGIQPDIYTYNFLLHGLCNAGKLDYAIKLWDEYKASGLVSNVHTYGVMMDGYCRANR 648
           MT  GI P+  +++ L++G  N+G+   A  ++DE    G      TYG ++ G C+   
Sbjct: 592 MTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGH 651

Query: 649 VEDVEKLFNELVAKKIELNTIVYNVIIRANCQNGNVAAALQHRDDMKSKGILPTCATYSS 708
           + + EK    L A    ++T++YN ++ A C++GN+A A+    +M  + ILP   TY+S
Sbjct: 652 LREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTS 711

Query: 709 LIHGMCSIGLVEDAKHLIDEMREEG-LLPNVVCYTALIGGYCKLGQMDAAEATWLEMTSF 768
           LI G+C  G    A     E    G +LPN V YT  + G  K GQ  A      +M + 
Sbjct: 712 LISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNL 771

Query: 769 NIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKA 811
              P+  T   MIDGY ++G +E+ N+LL +M      P++ TYN L +G+ K K++  +
Sbjct: 772 GHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTS 831

BLAST of Clc07G10870 vs. TAIR 10
Match: AT1G19290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 314.7 bits (805), Expect = 2.8e-85
Identity = 216/780 (27.69%), Postives = 371/780 (47.56%), Query Frame = 0

Query: 77  DQLFFSV--GLKANPHTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLPPARLVLIR 136
           D+L  S+   L+ NP  CL  F  AS   KFR   ++YC ++ +L  ++     +  L  
Sbjct: 70  DELLNSILRRLRLNPEACLEIFNLASKQQKFRPDYKAYCKMVHILSRARNYQQTKSYLCE 129

Query: 137 LIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWT-QAFDLLIHVYSTQ--FKNLGFS 196
           L+      LN     +  E       L  V   F ++   FD+++ VY+ +   KN    
Sbjct: 130 LV-----ALNHSGFVVWGE-------LVRVFKEFSFSPTVFDMILKVYAEKGLVKN---- 189

Query: 197 CAVDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCEIFEVM-SQGVRPDVFLFTNVI 256
            A+ VF      G  PSL +C+ LLS+LV+  E      +++ M S  V PDVF  + V+
Sbjct: 190 -ALHVFDNMGNYGRIPSLLSCNSLLSNLVRKGENFVALHVYDQMISFEVSPDVFTCSIVV 249

Query: 257 NALCKGGKMEKAIELFMKME-KLGVSPNDVTYNIIIHGLCQNGRIDTAFELKEKMTIKGV 316
           NA C+ G ++KA+    + E  LG+  N VTYN +I+G    G ++    +   M+ +GV
Sbjct: 250 NAYCRSGNVDKAMVFAKETESSLGLELNVVTYNSLINGYAMIGDVEGMTRVLRLMSERGV 309

Query: 317 KPSLITYCVLINGLIKREHFDKVNHVLDEMVDAGFVPNDAVYNNLIDGYCKMGNINEALR 376
             +++TY  LI G  K+   ++  HV + + +   V +  +Y  L+DGYC+ G I +A+R
Sbjct: 310 SRNVVTYTSLIKGYCKKGLMEEAEHVFELLKEKKLVADQHMYGVLMDGYCRTGQIRDAVR 369

Query: 377 IKDLMIYKNITPTSVTLHTLMQGFCKSNQIEQAENALEEILSHGLSINPVTCYSVVHWLC 436
           + D MI   +   +   ++L+ G+CKS Q+ +AE     +    L  +  T  ++V   C
Sbjct: 370 VHDNMIEIGVRTNTTICNSLINGYCKSGQLVEAEQIFSRMNDWSLKPDHHTYNTLVDGYC 429

Query: 437 KKSRFHSAFRFTKEILSKNFKPTDQLLTILVRGLCKDGKHLEATELWFRLLEKGSPASTV 496
           +      A +   ++  K   PT     IL++G  + G   +   LW  +L++G  A  +
Sbjct: 430 RAGYVDEALKLCDQMCQKEVVPTVMTYNILLKGYSRIGAFHDVLSLWKMMLKRGVNADEI 489

Query: 497 TSNALIHGLCGAGNLPGAVRIVKEMLERGFSMDRITYNTLILGFC--------------- 556
           + + L+  L   G+   A+++ + +L RG   D IT N +I G C               
Sbjct: 490 SCSTLLEALFKLGDFNEAMKLWENVLARGLLTDTITLNVMISGLCKMEKVNEAKEILDNV 549

Query: 557 --------------------KVGKVEECFRLKEEMTKQGIQPDIYTYNFLLHGLCNAGKL 616
                               KVG ++E F +KE M ++GI P I  YN L+ G      L
Sbjct: 550 NIFRCKPAVQTYQALSHGYYKVGNLKEAFAVKEYMERKGIFPTIEMYNTLISGAFKYRHL 609

Query: 617 DYAIKLWDEYKASGLVSNVHTYGVMMDGYCRANRVEDVEKLFNELVAKKIELNTIVYNVI 676
           +    L  E +A GL   V TYG ++ G+C    ++       E++ K I LN  + + I
Sbjct: 610 NKVADLVIELRARGLTPTVATYGALITGWCNIGMIDKAYATCFEMIEKGITLNVNICSKI 669

Query: 677 IRANCQNGNV-AAALQHRDDMKSKGILPTCATYSSLIHGMCSIGLVED--AKHLIDEMRE 736
             +  +   +  A L  +  +    +LP   +    +    +  L     A+ + +   +
Sbjct: 670 ANSLFRLDKIDEACLLLQKIVDFDLLLPGYQSLKEFLEASATTCLKTQKIAESVENSTPK 729

Query: 737 EGLLPNVVCYTALIGGYCKLGQMDAAEATWLE-MTSFNIAPNKFTYTVMIDGYCKLGNME 796
           + L+PN + Y   I G CK G+++ A   + + ++S    P+++TYT++I G    G++ 
Sbjct: 730 KLLVPNNIVYNVAIAGLCKAGKLEDARKLFSDLLSSDRFIPDEYTYTILIHGCAIAGDIN 789

Query: 797 EANNLLSKMKESGIVPDVVTYNALTNGFCKGKNMDKAFEVCNQMATGGLSLDEITYTTLL 811
           +A  L  +M   GI+P++VTYNAL  G CK  N+D+A  + +++   G++ + ITY TL+
Sbjct: 790 KAFTLRDEMALKGIIPNIVTYNALIKGLCKLGNVDRAQRLLHKLPQKGITPNAITYNTLI 832

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884789.10.0e+0087.79pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa ... [more]
XP_023552294.10.0e+0086.68pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
KAG6577115.10.0e+0086.68Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022984601.10.0e+0086.44pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
XP_022931380.10.0e+0086.31pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q940A63.2e-21150.68Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
Q9FJE61.9e-9929.71Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Q9LVQ55.4e-9429.34Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q9LN693.9e-8427.69Putative pentatricopeptide repeat-containing protein At1g19290 OS=Arabidopsis th... [more]
Q6NQ831.9e-8329.39Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1J2L60.0e+0086.44pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cuc... [more]
A0A6J1ETG90.0e+0086.31pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cuc... [more]
A0A1S4DYY20.0e+0084.96pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 ... [more]
A0A5D3CYQ10.0e+0084.96Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5A7TTX40.0e+0084.83Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT4G19440.12.3e-21250.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G19440.22.3e-21250.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G59900.11.4e-10029.71Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G55840.13.9e-9529.34Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G19290.12.8e-8527.69Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 559..592
e-value: 2.5E-6
score: 25.3
coord: 314..347
e-value: 2.7E-7
score: 28.3
coord: 524..558
e-value: 2.8E-9
score: 34.6
coord: 629..663
e-value: 9.4E-4
score: 17.2
coord: 734..768
e-value: 3.1E-11
score: 40.7
coord: 594..627
e-value: 2.3E-8
score: 31.7
coord: 245..277
e-value: 3.7E-8
score: 31.0
coord: 279..312
e-value: 9.8E-9
score: 32.9
coord: 419..452
e-value: 0.0023
score: 16.0
coord: 384..417
e-value: 7.1E-4
score: 17.6
coord: 699..732
e-value: 2.1E-7
score: 28.7
coord: 665..698
e-value: 2.4E-8
score: 31.7
coord: 769..802
e-value: 6.8E-7
score: 27.1
coord: 350..382
e-value: 2.3E-6
score: 25.4
coord: 492..522
e-value: 1.0E-5
score: 23.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 457..483
e-value: 0.0055
score: 16.9
coord: 489..519
e-value: 5.7E-5
score: 23.1
coord: 314..344
e-value: 0.006
score: 16.7
coord: 211..235
e-value: 0.61
score: 10.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 627..675
e-value: 2.6E-12
score: 46.7
coord: 346..395
e-value: 2.1E-11
score: 43.8
coord: 522..570
e-value: 5.3E-18
score: 65.0
coord: 766..812
e-value: 1.6E-12
score: 47.5
coord: 696..745
e-value: 2.2E-16
score: 59.8
coord: 241..290
e-value: 2.5E-18
score: 66.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 588..618
e-value: 8.5E-7
score: 28.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 557..591
score: 11.73961
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 10.676364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 11.728648
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 732..766
score: 14.282632
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 13.065928
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 11.049048
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 592..626
score: 11.684803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 242..276
score: 12.868624
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 382..416
score: 9.678885
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 767..801
score: 11.312119
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 522..556
score: 13.383805
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 662..696
score: 12.112294
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 487..521
score: 10.785976
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 417..451
score: 9.174665
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 697..731
score: 11.586152
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 654..727
e-value: 2.5E-20
score: 74.8
coord: 305..373
e-value: 5.8E-17
score: 63.8
coord: 586..653
e-value: 2.8E-16
score: 61.6
coord: 374..444
e-value: 7.2E-11
score: 43.9
coord: 516..585
e-value: 2.2E-22
score: 81.5
coord: 445..515
e-value: 3.8E-11
score: 44.9
coord: 190..304
e-value: 4.0E-29
score: 103.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 728..823
e-value: 2.2E-29
score: 104.1
NoneNo IPR availablePANTHERPTHR47938:SF7REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 28..807
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 28..807

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc07G10870.2Clc07G10870.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding