CSPI01G25380 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G25380
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1: 20852771 .. 20863514 (+)
RNA-Seq ExpressionCSPI01G25380
SyntenyCSPI01G25380
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAATATTTATAAATATCAGGCTTTAATATTTAAACAAAATGGAGTTCATCAAAGTCTATCATTGATTGATACAAGTGTAGTAGATGCATTTTGTTATTGTTTTACATAAAGTGAAAATGCATATAGTTAATTGTGTTTAGGCAAAATATTAGATTTAATAGTGTTGAGTAGACATAGATAAAGATCAATAGTAGGGAGGATACTATTTATCCAAATCCTCTTTTCTCTTTCCCCTCTCCGTCGGAGTCCCGTCGTTAGCCGCTGAAGGTCTACAAGCTCTCATCTACAATAGCGGCACGGAATTGCAGACGTCAGCTATCAACTCATTTTTTTCACGCATCTTTTGAACTTATAAATTCTCTCTCTACCCACCACCGCCCATCGCCGGAGAAGGTGTGAACGGTTCTATATGTAATCACAGCACGGAATTGAAGAGAGGTTAACCATTATCTTAGCTTTCGGCGGAGAACTTGGTCGGTGCAACTTGCTAAAATGCGATCTGACCCGCAAATAGAACGGGCACAGCCGCGAATAATGGTTTAATCGTATCGCGATATTCAGGGCCAGGTTGCTCCGCCGTCATTTTGAAAAATACTTCAATTGGTGACCTACACAGAACAGTTAGTTTGTTTTTTCTTCTCTCTTTTACTTTTTGTGTGGAGATATTTACTCTGTGTTTCTACTTCGTGTCTCAGACTTCTTTTGGATTATATTGGATTTATTCATGAAATTGGAGTTATTTTGAAATGGTGGGAGTTATAATGGCGAACCTAAATTTGTGCATCCCTAATTGTGAAAGATATGGATTTCCGACACTGCATTGTACCCATAATTCCCACAATTCTTTTTGGGTTTCGTTCTTTCCTAGTTCGGTTCCTGGAACTGACTTAAGTCTTAGTGACGCGAAGAATAGAGTTTTGAGACATAGGGTTCATAAATGTGGATCAATTAAGGCTTTGTCGAATGGAGAATCTGATATTTCATTGCCAAGTGGGAATCTCCTCGAACATGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAAGGTCATGGAGACTGTTAGAACTAGAAGGTATAAGAGGCAGTTGGATGATCCTAATAAACTGACAATGAAGGAAAATGGGAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAGATAGATAATGGAAAAAACAAAGTAACTGATGTTCAACATAACGTGGACGTAAAGAACATGTTTAAACGGGTGGATAAAAAAGATTTGTTCAATAATACAGAGAGAATTGCTCGTGAAAAGGATTTGTCAGGAAATAAATTTGATAGAAGGAAGGTAGTTACAAGATCAAATGATAAGGTTAAAGGCAAGATGACCCCTTTTGGCTCACTGGTTAATGATAAACAGCATGAAGAGAAAAGGAACGAAAACTGGTCAAGTTACATTGAGCCTAGAGTAACACGATCGAACAGCGAGAAACCAATTCATTTTAAAGCTAATATGTTGGAGGTCAAAAAAGAAAGCAGCCGTGTCTCTGATGGAAATTCCATGAAAACATCAGAAAAGATTTGGGCTTGGGGTGATGATGACGCTAAACCACCTAAGGGTGTTCTTAAGGCTGGGAAATATGGCATTCAGCTCGAAAGAAGCTATAATCCTGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGACATCCACAAGTGGTAAGCGTTTTCTTGAATTTAATGAAAAGAATAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTCGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGGTTTGGTGTCCCTTGCTCACTTGTCTGTCTTGCTGAAAGTTTAAATTTGACTTGGTACAAGTCTGAAATCAAAACTATTACCTTATTCGATACTTCTTCTTTGTTTGGGCGGGGGTCTGTGAAATTAATATTCTTAATTTGTTAAATTTGAATTTCTTAACAACATTTCTACTAGGATTAGTATGGGTGGACAAGGTTGTGTACTAGTTTTTTAATGGTTTTAAATATTTCCTTTTCTTGGCCAGGATATTTTATTGTTAAGTTCTTCAGGGTTGAGTTCAAGTATTTCTGGGGCCCGATTTTTAGTGTTATGTTTATTGAGCTTCAAGTCGTTTCTTTCTAAGGAAATTTTTAATTTAGTTTGAAGAGTATTAACTTCAGCTGCACCACTTATTCAATTTCAGACAACAGAGTCCAATCTAAGAACTTTGAAGTCTACTGAGCTAAATATTTGAGTGGTAAATGTGAAAGTTAGAAATCTACTTGCATTTGGTTATAACTCAACTCCATCGAGATAATATATGTTATTGGGTTCAAATGATAGTCATGAACTGTTTGCTTGCCTGCTGTTTTATGTTTTCTTTTTATCACTCACACATGTATTGCAGGCTCTTGAGTTATTCATTGTATGGAGACTTCTTTCATAGTCATCTCCAACATCTTTTAGGATTTGAAATTTAATTACTAATCTGTACTCAACTGTGCTATGAGGGGATTCCAAGAACACTAATCAAATTTCAATTATTTGAACTTTGAGGATTATTAATTAGTCTCTCCATATTGATTTATTTGAAGGACTTCCTTGTCAACAGATTGAATGGTGCAGACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAATCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGGTTTTTCCCTGTTTCTCACCTTTACTACTTGATTAATGTAGTGAAAGTTCTAATAAAGTCCTTGGTTATGATGCCTTAAAAGAGTAAGAGTGGCCAGTGATGCCTTTCCTCGGTCTTGAAGTGCAATTTAGATTTTCTTTTTTATATTTGTAGTGTCTCAACCAACTTACATGCATTTTGGCTAATTTCACGAGACAATCCACCCGATCCTACAATATTTTGGTGTCAAGAAAACTCATAGAACATTAATTCTTTGGTAGGTGACCACTATATAATAAATCCACGACCTATTAGTTAGTTATTGAGACAATGTCTCCTTTTTACCACTAGGCCAACCTACGAGGGAGGATCAGTGTTTTGAACGGCGCACTTGGGCACACGCCTAAGTGCAAGGCTCAACGGTGGTGCCTCGCCTCAGAAAGTTGAGGTGCATGAAATAAGGCGCACGCCTTTCGGTGAATCACTTAAAATGTAAACTATTTTGCATTTTAGGGTTTCTGTTTGCCCATTTATTGTAAATATGTTTATACATATATAATTTTTAAACCTAATTTGGCATAATTTCTTAAAAAAATAGAACGTCTTTTCCTCTCCTTTCTCTCACCTTCTTCATTCTTCACTGTATCACCTTCTTCCTCTTCGTCTCCAATCTTCACTGAATCATCATTTACAATGGATATCAAAGATTTAAAAGACCATCAAAATTATCTGGCACTTGATTCTTCTCCAAAATACAAAGGAATGCCAACATTCTTCCTGACCTCTTGGTCCTCCCCTCTTCAATTCTTCTAACTAAATCTTTTTTTTTTTTTGTATAATCTTCTTTAAGTCCCATTTCTATCCATCTAAGTTTTGGTTGTGTAAATAAACATGTAAGTTTATAATGCCTAGGCTCCAGAGCCATTGCGCCTTGGGCATTTTAAGACACTGGGGAGGATAAAAAGGAAAGATAAAAGAGGAGAGAGAGGGGGGATATGGCCATCAGATTTTAAATTTAATCTCTTTTAGGAATATAAAGAAAGTGCTAAAGTATTGAATAATAAATTAAATTTAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCAACCCCCACACACCACACGATTAGGTACATTGCTGATTTAACATGGTATTAGAGCAAGAGGTCCCACTGTTCAAACTCCTAAAATATCATTTTTTCTTCAATTAATACTAATTTCTAATTTTCAAGCCCACAAGTGAAGGTGAGTATTAAAGTATTAATATAACTAAATTTATTTAAACCCATCAACTTAAGCTTTTTGAGTTTGTTGGGAATTTGTTAGATACCTAGATTAGTATACATGGTTTATCTTGTATAAGGGTAATTAGATTAGTGGGTGTAAGGGTAATTAGATACTTAGGAAGTTACTAGTAGTTATTGTGTAAGTGTGTTTACTAGTAGTTATTATTTAAGTGTGATTACTGGGGTTGTTACATCTTGTTATAAATGGAGGGAGGGTAAGTGAGAGGGACGTTACGGTGGAGTGATTTGGGGTTTGGGTGAGAGTACTCAAGAGGGAGGTTCTAGGTGCCTTATACTTGGGTTTATCTTGTATCTTCTTATAGTTCATTATAATAAATTAAGATCTTGTTAACAAGTATTCTTACAGAATTTAACAAAAAATCTTCATCCTAAGTTCCTAACATTTTCTAAAGAATCTTGCACCCACTAAAAAAAATTAATTAGAGAGAATTGAACCTTAATCTAATTATTTGTCCTCATCATTAATCAAATGAACTGGAAAGCTGCGATAATGCTGAAAAATAACTCAAATATAATAAAAACATCAGAAATCTTAAAATGGAAAAGATATTCACTTTCTTGCATTGCCCTGTCACCAAGTATGTTCTTTGACTAACAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGTAGGCAGTTAACTAACAACCTTCACTAGGATTTGCTGTCTTGGTCTGCGCACGTTTATGCAAGTGGATGCTGAAATTGACAGAATTTTTACTAACTCTGGTTGTGCCATCTCGGTTTCCCTTCAGGAACACTTTTCCTCATATCCTGACTTGGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCAATCTCCTCCAAAGAAGAAGTTTAAAACAGGGGTTCTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTGAGTCATAATTGGTAGATTATTTTATATTTTACAGGCAATATTTGGTTAATGTAATATACTGAGCAATTTTGTGGAAAAAAATGTTAAAGAGCAAAATAATCTTGTTATGGATATTTTCTGATGTTTTGCTCGTAATATATTTATTCTTAGATCTGAGAATTCAAATATGACAATCATCTAGATTTTTGTGAATGTTTTGTTATCATTGTTCCATGATTGACGAAAGATTTAGATTTTGTACACATGGTTGGATTTCATTATTTTTCTTTTCCTTTTCCTTTTGGATAAAAGGCACCACTTCTAATTTGACAAATGAAAGGAATATAAAAAGAGGCATAGGAAAAAACTAGGCTCTACAACAAAATTGGGCAACCAAAAACGAGAGAAACACACATTAAGAACACTAATTCATCAGAAAAAGGAGAAAAATCCGAAAACAAATTGCAAAAACAACCTAATAGGGTCGAGTGAAGCTGAGAGTTAGTGGCAATTTTTTTAAAAAAATTTGATTAATTAATTAATTAATATTATTATTGGTTTCTTCTGGTAGGAAATGTTGACATTTTTTTAGATGATATGAAATTACAGAGTTGATTCTTCAAAGCATTGTTTATTTTCTTTACGTTTTCACTTTCCATCTAACATACTGTTGCTATTTTCCTTTCTATTTTTTTCATAAAATTTTAAATTACTCAGTCATTAAGGCAAGTCTCCAGCATTGCTTCCTCCATTTGTTGCAAGGATATGTTTTTCTTCAAAACTTTTCTCATGTTTATTCATTATTACTATGAGATGCCAATAGAAACAACGTGATTTCCGAATTCTTTATGGAAATATATCTTGCATGGGTTTCTTTCAAGTTGTTCACTTGCAGTAGCCATTTTATTTCTTTCTTTCAGGTTTTAAATGCTTGTGTCAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAAGTCTACAGCCTTCAACCTCAACATATGGATTGGTCATGGAGGTAGTTGGTCCTTTAATTTCTTTCTATTGTTCATGTAATTTGCAAGTCTATTTTGAAATTCAGAATGATTTTTCTAATGCTCGTTGCTTGAGATGCTGTGAATGAGAAATGCTTGATTTTGTGCACCCAAGAAGGAATGAACACTCATTAAGTTGCGTTGAATTCACTCAGTAACGTGTTAAATATTATTCTTACCTTTTGATTAGTAGAAAGTAAATGGTTTTGGGTATTGCATGGACACAGAGTGGACATATTGGGTTCTGAGTCTTTGAAGAGAATTAGTCGTATTTGTAAAAGGTAGATTCTCTACCAAGTCCATGTTCCTAAACTTCACTAAGGAAGTTGCAAAAAATAATTATAGACAACCTTATGTGGTATTGGAAGCTTAAAATTTCCAAGAAAGTAAAGATTTTCCTTTGGTCACTTGCTTATAGAAGTCTAAATATTCACAAGAAGCTTCAGAGAACGTTCCCTAATTGGTCCCTCTCCTCCATTTGCTGTCTTTTTCTTAGGGAGATGGAAACTATAGGTCACTTGTTCTTGCATTCTGAGTTTACTTTTAGAGGTTGGCAAATTCTCTTTAGTACTTTTGTGGTGGCTAGTTGCCTTCCTAAAAAAATCGATGATTGAATGATGGAAGTTTCTGCAGAAAAGGGAAAATCCTTCGGAGAAGTGCTACTCAAGTGCTTTTGTGGTTTCTTTGGAAAAAAAAAGAGAATAATACATTGTTTGACGATAATTTTGTTTCTTTTGATTTTTTTGGGCTTTTGTTCAACGTGCAACCTCTTGGTGATGTTCAAGCTACACTAATTTTTTTTTTCCAAAGCATTTCTATAACTATTTGCTATGAAACATTAATCTTAGTTGGATCCCCTTCCTTTAGTGGGGCTCTTTTTTTGTTGCGCTTGTTATTTTTTTCTATTTCCTTGTATTCTTTCATTTTTCTCAATTAAAGTTGTTTCTATTAAAAAAAAAAAAGAAGTTAGACCAAATTCTTTTTTAATTATAGCCTTCTCATGATTATGAGCAATGGAGGGCTTTTCTCTATAGTTTTGTGGAGAGGTTTTCTCTACCCCTGCCTCTAGGCTGTTCTGGTAGCTCTTTTGATGAATATATATCTGTTTCTTATAAACAAAAAAGTCCTAACAGTGAATGGTAAGCCTAGAAAGAAAGAAGAAGATCTAGACCATCTTTTCAAGACATTGTTCAATAGGTATAACACCAATCTTTCGTGTTGTTTGCCTTATCCTTTCTCAATCAATCGCATATGAGTAAGTGCGTGCTTGGACTTGGTCAAGGATACATTTTATGGTTAGATTATGTAATCACTTATCACAAGCAGCTTTTGAACGATTACTTGCGTCTATCCTTCATAAAATACAAGTCACTTGATTGACGGTTATTCATGTGCACTGAAAATTATTGTTCACAAGTTCATGATTTTTTAAAAGAAATTTTAGTTCATGCACTGTTGAGAATGAATTAAGGTAACCCAATTTCTAAAACCTGTTATTTAATTATTTTCCCAAGTTCCTCTTTTCCCTGTGAAGCAAAAAAAGTTCGTTCTTTAAGTCGACTCTTAAACTGCCCCTAGATTTTCAATTTAGGTGAAGAAAAATAATTTGATCGAATGTTCTTCTACTATTAAAATTTGAAAGTCATGCTTCCCTGACTTAATTTATTTGATTGTGCTGAAGTTACATTTTCATATGCAAGTAAGCCTATTGAAATTTTCTATTTACTGAATGAAACAGTTGAAATATTCTATTTACTGAACGAAACATTTTTCAATGAATGATAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAATTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGGTAGACTCGGTAGTCACAGTGTGTTTATTTGTTTCTTGTTATATATTTGCTTAATGGCTTGTCAAATTTCCAGTTCTTGTTAATACACTTTGGAAAGAAGGAAAAACAGATGAGGCTGTGCTGGCCATTGAGAACATGGAAATACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAGGCCCTGATGCAGGTATTTCATAGTAAATTTTTGTTTTTTCTTTCAGCCTTTTATTTGTTTTTCCCTTTCTCAACCAACTTATATTTTATTTTTTCTCTTATGTGGATATTTTATATATTACTATTGGTTATTTCCATGAGTTGGTTCAAAATGTTATCGCCTGAGTTTGAGCTCTATTTCCTAAGATGGGAGAAGTAGAAGTTGTTTATTATTTAAGAAGAAATTAGTCGACCTGACTCAATATGATAAGTCAAACAACATAGGATTTGGTTTCTCTTTATCACTCGCTCTTTTTGTTTAAAAAATACTGAATTTGAGGTTAACTAATCACTATATCTTAAAAATTCTCAAAACCTTTGAGTCTTTCTTAATCATGCCTACAATTTCATGATGAGCTTTATTGCCCCATAGTAATGTTTCTGCCTCAGACATCTCTTTCTGAGTGAATTTCTGGCTCCTTAATATTCTTGGCCTCTCCTTTGGGAGGTCAAATTTACTTCATAGTTCAAACTGTCTGGTTGCACATCCTGATTGCAGTCTGTTTACACGAATGATAATTAATGAATAGACCTCATGACGACCTTGTTATTGTTTACAGATGGAGAAGATATGTAAAGTTGCTAACAAGCCTCTTGTAGTAACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATTTTCAACCACATGAAGGCCTTTTGCTCACCGAATCTTGTTACTTATAATATATTGTTGAAGGGTTACTTGGAACATGGAATGTTTGAAGAGGCTAGAGAGCTGTTTCAGAATTTGTCAGAGCAAAGACGAAATATCAGCACTGTATCTGACTACAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCAGAAAAAAGATGGGATGATTTTAGCTATTTCTATAACCAGATGTTTCTTTATGGTTATCATTTCAATCCAAAACGTCATTTGAGGATGATATTGGAGGCTGCTAGGGGTGGAAAGGTGGACCTTTTAAATTCAACTTCTTTTTCTTTCCTTGGTTCTCCTCCTTCCTTTCTACGTTTCTATATCTTTTTATATTGCACCACTAGCCTATTTTAAACGTGTATAGTTTAAGTTGGCTCTAGGAATGTTGCTTAATGCTTATAATTGACGATAGTTTTAAATATGCACATGCTAAAGAGATGTAATGATAATAATAGATAATATCATGAAACTATTGGAGGCTGAATTCGACCATTTTCCTATGACGTTTGACTTCTTAGGTGTATGACATTGATAATTTGTTGAAGTATTGAGAAAACAAAACGCTAAATCAATTTATTTTTTAATTGATCATGAAATTCAAGATTAAATTGTTCCTTGGCTATTAAGGGGTTCATAACTTCACATTCATGCTTAGAATTCGAGTTCTGCTTATACTAAGAAAAGAAACTATATTCCTTTATGATCTGTGGATGATTTGGTCTTAATTTCCTGCTCTACAAAAGGAATAAACTAAGATAAGTTGATCGCTTAGACTCATTATGGTTTTGCTTTCCTTTAACCATCGACTTCTGTTGTACAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTCAGGCTGACCGTACTCCACCACCACCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGATTACTCTGAAGCGCTCTCTTCCATTTGGAGTCACAATAGTGGTGATGCACATCATTTCTCTGAGTCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAAGATACTGTTATTGAGCTAATTCATAAGGTTAGCATGGTTCTTACTAGAAATGAATCACCAAATCCAGTGTTTAAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTTTAGCTGACCATAGACTTGAAGAAACTGTTTATTAAAATGAAATCTAACCTGCTGCTATCACATATCTATCTATCTATCTATATATATTTAGTATAATTTGAGAG

mRNA sequence

CCAATATTTATAAATATCAGGCTTTAATATTTAAACAAAATGGAGTTCATCAAAGTCTATCATTGATTGATACAAGTGTAGTAGATGCATTTTGTTATTGTTTTACATAAAGTGAAAATGCATATAGTTAATTGTGTTTAGGCAAAATATTAGATTTAATAGTGTTGAGTAGACATAGATAAAGATCAATAGTAGGGAGGATACTATTTATCCAAATCCTCTTTTCTCTTTCCCCTCTCCGTCGGAGTCCCGTCGTTAGCCGCTGAAGGTCTACAAGCTCTCATCTACAATAGCGGCACGGAATTGCAGACGTCAGCTATCAACTCATTTTTTTCACGCATCTTTTGAACTTATAAATTCTCTCTCTACCCACCACCGCCCATCGCCGGAGAAGGTGTGAACGGTTCTATATGTAATCACAGCACGGAATTGAAGAGAGGTTAACCATTATCTTAGCTTTCGGCGGAGAACTTGGTCGGTGCAACTTGCTAAAATGCGATCTGACCCGCAAATAGAACGGGCACAGCCGCGAATAATGGTTTAATCGTATCGCGATATTCAGGGCCAGGTTGCTCCGCCGTCATTTTGAAAAATACTTCAATTGGTGACCTACACAGAACAGTTAGTTTGTTTTTTCTTCTCTCTTTTACTTTTTGTGTGGAGATATTTACTCTGTGTTTCTACTTCGTGTCTCAGACTTCTTTTGGATTATATTGGATTTATTCATGAAATTGGAGTTATTTTGAAATGGTGGGAGTTATAATGGCGAACCTAAATTTGTGCATCCCTAATTGTGAAAGATATGGATTTCCGACACTGCATTGTACCCATAATTCCCACAATTCTTTTTGGGTTTCGTTCTTTCCTAGTTCGGTTCCTGGAACTGACTTAAGTCTTAGTGACGCGAAGAATAGAGTTTTGAGACATAGGGTTCATAAATGTGGATCAATTAAGGCTTTGTCGAATGGAGAATCTGATATTTCATTGCCAAGTGGGAATCTCCTCGAACATGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAAGGTCATGGAGACTGTTAGAACTAGAAGGTATAAGAGGCAGTTGGATGATCCTAATAAACTGACAATGAAGGAAAATGGGAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAGATAGATAATGGAAAAAACAAAGTAACTGATGTTCAACATAACGTGGACGTAAAGAACATGTTTAAACGGGTGGATAAAAAAGATTTGTTCAATAATACAGAGAGAATTGCTCGTGAAAAGGATTTGTCAGGAAATAAATTTGATAGAAGGAAGGTAGTTACAAGATCAAATGATAAGGTTAAAGGCAAGATGACCCCTTTTGGCTCACTGGTTAATGATAAACAGCATGAAGAGAAAAGGAACGAAAACTGGTCAAGTTACATTGAGCCTAGAGTAACACGATCGAACAGCGAGAAACCAATTCATTTTAAAGCTAATATGTTGGAGGTCAAAAAAGAAAGCAGCCGTGTCTCTGATGGAAATTCCATGAAAACATCAGAAAAGATTTGGGCTTGGGGTGATGATGACGCTAAACCACCTAAGGGTGTTCTTAAGGCTGGGAAATATGGCATTCAGCTCGAAAGAAGCTATAATCCTGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGACATCCACAAGTGGTAAGCGTTTTCTTGAATTTAATGAAAAGAATAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTCGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTGCAGACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAATCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGAACACTTTTCCTCATATCCTGACTTGGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCAATCTCCTCCAAAGAAGAAGTTTAAAACAGGGGTTCTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTCAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAAGTCTACAGCCTTCAACCTCAACATATGGATTGGTCATGGAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAATTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGTTCTTGTTAATACACTTTGGAAAGAAGGAAAAACAGATGAGGCTGTGCTGGCCATTGAGAACATGGAAATACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAGGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAACAAGCCTCTTGTAGTAACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATTTTCAACCACATGAAGGCCTTTTGCTCACCGAATCTTGTTACTTATAATATATTGTTGAAGGGTTACTTGGAACATGGAATGTTTGAAGAGGCTAGAGAGCTGTTTCAGAATTTGTCAGAGCAAAGACGAAATATCAGCACTGTATCTGACTACAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCAGAAAAAAGATGGGATGATTTTAGCTATTTCTATAACCAGATGTTTCTTTATGGTTATCATTTCAATCCAAAACGTCATTTGAGGATGATATTGGAGGCTGCTAGGGGTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTCAGGCTGACCGTACTCCACCACCACCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGATTACTCTGAAGCGCTCTCTTCCATTTGGAGTCACAATAGTGGTGATGCACATCATTTCTCTGAGTCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAAGATACTGTTATTGAGCTAATTCATAAGGTTAGCATGGTTCTTACTAGAAATGAATCACCAAATCCAGTGTTTAAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTTTAGCTGACCATAGACTTGAAGAAACTGTTTATTAAAATGAAATCTAACCTGCTGCTATCACATATCTATCTATCTATCTATATATATTTAGTATAATTTGAGAG

Coding sequence (CDS)

ATGGTGGGAGTTATAATGGCGAACCTAAATTTGTGCATCCCTAATTGTGAAAGATATGGATTTCCGACACTGCATTGTACCCATAATTCCCACAATTCTTTTTGGGTTTCGTTCTTTCCTAGTTCGGTTCCTGGAACTGACTTAAGTCTTAGTGACGCGAAGAATAGAGTTTTGAGACATAGGGTTCATAAATGTGGATCAATTAAGGCTTTGTCGAATGGAGAATCTGATATTTCATTGCCAAGTGGGAATCTCCTCGAACATGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAAGGTCATGGAGACTGTTAGAACTAGAAGGTATAAGAGGCAGTTGGATGATCCTAATAAACTGACAATGAAGGAAAATGGGAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAGATAGATAATGGAAAAAACAAAGTAACTGATGTTCAACATAACGTGGACGTAAAGAACATGTTTAAACGGGTGGATAAAAAAGATTTGTTCAATAATACAGAGAGAATTGCTCGTGAAAAGGATTTGTCAGGAAATAAATTTGATAGAAGGAAGGTAGTTACAAGATCAAATGATAAGGTTAAAGGCAAGATGACCCCTTTTGGCTCACTGGTTAATGATAAACAGCATGAAGAGAAAAGGAACGAAAACTGGTCAAGTTACATTGAGCCTAGAGTAACACGATCGAACAGCGAGAAACCAATTCATTTTAAAGCTAATATGTTGGAGGTCAAAAAAGAAAGCAGCCGTGTCTCTGATGGAAATTCCATGAAAACATCAGAAAAGATTTGGGCTTGGGGTGATGATGACGCTAAACCACCTAAGGGTGTTCTTAAGGCTGGGAAATATGGCATTCAGCTCGAAAGAAGCTATAATCCTGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGACATCCACAAGTGGTAAGCGTTTTCTTGAATTTAATGAAAAGAATAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTCGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTGCAGACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAATCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGAACACTTTTCCTCATATCCTGACTTGGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCAATCTCCTCCAAAGAAGAAGTTTAAAACAGGGGTTCTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTCAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAAGTCTACAGCCTTCAACCTCAACATATGGATTGGTCATGGAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAATTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGTTCTTGTTAATACACTTTGGAAAGAAGGAAAAACAGATGAGGCTGTGCTGGCCATTGAGAACATGGAAATACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAGGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAACAAGCCTCTTGTAGTAACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATTTTCAACCACATGAAGGCCTTTTGCTCACCGAATCTTGTTACTTATAATATATTGTTGAAGGGTTACTTGGAACATGGAATGTTTGAAGAGGCTAGAGAGCTGTTTCAGAATTTGTCAGAGCAAAGACGAAATATCAGCACTGTATCTGACTACAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCAGAAAAAAGATGGGATGATTTTAGCTATTTCTATAACCAGATGTTTCTTTATGGTTATCATTTCAATCCAAAACGTCATTTGAGGATGATATTGGAGGCTGCTAGGGGTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTCAGGCTGACCGTACTCCACCACCACCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGATTACTCTGAAGCGCTCTCTTCCATTTGGAGTCACAATAGTGGTGATGCACATCATTTCTCTGAGTCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAAGATACTGTTATTGAGCTAATTCATAAGGTTAGCATGGTTCTTACTAGAAATGAATCACCAAATCCAGTGTTTAAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTTTAGCTGACCATAGACTTGAAGAAACTGTTTATTAA

Protein sequence

MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRHRVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKPIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLADHRLEETVY*
Homology
BLAST of CSPI01G25380 vs. ExPASy Swiss-Prot
Match: Q9SA76 (Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2279 PE=3 SV=1)

HSP 1 Score: 726.5 bits (1874), Expect = 3.8e-208
Identity = 444/999 (44.44%), Postives = 595/999 (59.56%), Query Frame = 0

Query: 30   SHNSFWVSFFPSSVPGTDLSLSDAKNRVLRHRVHKCGS--IKALSNGESDISLPSGNLLE 89
            S NSFW   F                RV+R    K  S  +  L+    ++ L      +
Sbjct: 19   SRNSFWRPLFHQPYYNC--------RRVVRLNSRKLNSKVMFCLNLNTKEVGLQKPG--D 78

Query: 90   HDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKEN----GSAKSAESTSISKIDNG 149
              F+FKPSFD+Y+++ME+V+T R K++ D   +L ++E+    G+  S       KI +G
Sbjct: 79   KGFEFKPSFDQYLQIMESVKTARKKKKFD---RLKVEEDDGGGGNGDSVYEVKDMKIKSG 138

Query: 150  KNK-------------VTD------VQHNVDVKNMFKRVDKKDLFNNTERIAREKDLSG- 209
            + K             V+D       + N +++N     D K   +    +A +   SG 
Sbjct: 139  ELKDETFRKRYSRQEIVSDKRNERVFKRNGEIENHRVATDLKWSKSGESSVALKLSKSGE 198

Query: 210  -----------NKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNE----------- 269
                        K   ++   RS+D  +G          D   EE+R +           
Sbjct: 199  SSVTVPEDESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLVVEERRVQRIAKDARWSKS 258

Query: 270  -------NWSSYIEPRVTRSNSE-------KPIHF-------------KANMLEVKKESS 329
                    WS+  E  VT    E       K  H              K + LE+  E  
Sbjct: 259  RESSVAVKWSNSGESSVTMPKDESFRRRYSKQEHHRSSDTSRGIARGSKGDELELVVEER 318

Query: 330  RV----SDGNSMKTSEKIWAWGDDD-----------------AKPPKGVLKAGK-YGIQL 389
            RV     D    K+ E +    +D+                 +   +G+ +  K  G+ L
Sbjct: 319  RVQRIAKDVRWSKSDESLVPVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGDGLDL 378

Query: 390  ERSYNPGDKVGRKKTE---QSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFD-AFDIMDK 449
                   +++  ++ E       GT   G +  + ++ +   +E  AF   D + DI+DK
Sbjct: 379  LAEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDK 438

Query: 450  PRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNW 509
            P  S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNW
Sbjct: 439  PATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNW 498

Query: 510  RRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVA 569
            RRVLQ+IEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VA
Sbjct: 499  RRVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVA 558

Query: 570  YHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVK 629
            Y SIAVTLGQAG+++ELF VID+M+SPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+
Sbjct: 559  YRSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQ 618

Query: 630  RKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTY 689
            RK  EGAFWVLQ+LK++  +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y
Sbjct: 619  RKQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAY 678

Query: 690  KVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEAL-------- 749
            +VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L        
Sbjct: 679  RVLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNP 738

Query: 750  --------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNH 809
                                 Q++KIC+VANKPLVVTYTGLIQAC+DS ++++A YIF+ 
Sbjct: 739  VVLKLIENLIYKADLVHTIQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQ 798

Query: 810  MKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFN 869
            MK  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE   +I   SD+  RVLPD Y FN
Sbjct: 799  MKKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFN 858

Query: 870  TMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQA 895
            TMLD    +++WDDF Y Y +M  +GYHFN KRHLRM+LEA+R GK+E++E TW+H+ ++
Sbjct: 859  TMLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRS 918

BLAST of CSPI01G25380 vs. ExPASy Swiss-Prot
Match: Q9FJW6 (Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DG1 PE=1 SV=2)

HSP 1 Score: 387.5 bits (994), Expect = 4.2e-106
Identity = 207/546 (37.91%), Postives = 327/546 (59.89%), Query Frame = 0

Query: 358 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEW 417
           E +++L  RL+G +I+   W F +MM  + +++++  +L+++  LG+  +W++   ++ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 418 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 477
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 478 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 537
           QAG ++EL  VI+ M+  P K  K    + WDP L+PD+V+YNA+LNACV     +   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 538 VLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 597
           V  EL+K  L+P+ +TYGL MEVMLE GK++ VH+FFRK++ S   P A+TYKVLV  LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422

Query: 598 KEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 657
           +EGK +EAV A+ +ME +G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482

Query: 658 VTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNL 717
           +T+TGLI A L+   +   + IF +MK  C PN+ T N++LK Y  + MF EA+ELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542

Query: 718 SEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHL 777
                    VS     ++P+ Y ++ ML+AS    +W+ F + Y  M L GY  +  +H 
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602

Query: 778 RMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSG 837
            M++EA+R GK  LLE  +  + +    P P    E  C   A+GD+  A++ I +  + 
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLEDGEIPHPLFFTELLCHATAKGDFQRAITLI-NTVAL 662

Query: 838 DAHHFSESAWLNLLKEKR--FPKDTVIELIHKVS-MVLTRNESPNPVFKNLLLSCKEFCR 897
            +   SE  W +L +E +    +D     +HK+S  ++  +    P   NL  S K  C 
Sbjct: 663 ASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRCG 714

Query: 898 TRISLA 899
           +  S A
Sbjct: 723 SSSSSA 714

BLAST of CSPI01G25380 vs. ExPASy Swiss-Prot
Match: Q9FMD3 (Pentatricopeptide repeat-containing protein At5g16640, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g16640 PE=2 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 3.7e-22
Identity = 77/314 (24.52%), Postives = 145/314 (46.18%), Query Frame = 0

Query: 432 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 491
           IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  
Sbjct: 188 IYNTIIDGLCKSKQVDNALDLLNRMEKD-GIGPDVVTYNSLISGLCSSGRWSDATRMVSC 247

Query: 492 MQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPST 551
           M                   + PD+  +NA+++ACVK   +  A    +E+ ++SL P  
Sbjct: 248 MTK---------------REIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDI 307

Query: 552 STYGLVMEVMLECGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIEN 611
            TY L++  +    + +   E F   V K   P+ +TY +L+N   K  K +  +     
Sbjct: 308 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCE 367

Query: 612 MEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKD 671
           M  RG+V +   Y    +  C AG+   A     ++      P ++TY  L+    D+  
Sbjct: 368 MSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGK 427

Query: 672 LQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYR 731
           ++ A+ I   M K     ++VTYNI+++G  + G   +A +++ +L+ Q           
Sbjct: 428 IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQ----------- 473

Query: 732 DRVLPDIYMFNTML 744
             ++PDI+ + TM+
Sbjct: 488 -GLMPDIWTYTTMM 473

BLAST of CSPI01G25380 vs. ExPASy Swiss-Prot
Match: Q0WPZ6 (Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX=3702 GN=At2g17140 PE=2 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 8.3e-22
Identity = 57/214 (26.64%), Postives = 109/214 (50.93%), Query Frame = 0

Query: 510 PRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNL 569
           P  +P + +YN +L +C+K + +E   W+ +++    + P T T+ L++  + +    + 
Sbjct: 106 PENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165

Query: 570 VHEFFRKV-QKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFAR 629
             E F ++ +K   PN  T+ +LV    K G TD+ +  +  ME  G++ +  +Y     
Sbjct: 166 ARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVS 225

Query: 630 CLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMK-----A 689
             C  GR  ++   +EK+ +    P +VT+   I A      +  A  IF+ M+      
Sbjct: 226 SFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLG 285

Query: 690 FCSPNLVTYNILLKGYLEHGMFEEARELFQNLSE 718
              PN +TYN++LKG+ + G+ E+A+ LF+++ E
Sbjct: 286 LPRPNSITYNLMLKGFCKVGLLEDAKTLFESIRE 319

BLAST of CSPI01G25380 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 7.0e-21
Identity = 85/342 (24.85%), Postives = 146/342 (42.69%), Query Frame = 0

Query: 491 SMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPS 550
           ++++ PK      +LEK+    QPD+  YNA++N   K   ++ A  VL  ++ +   P 
Sbjct: 136 TLRNIPKAVRVMEILEKFG---QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 195

Query: 551 TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSI-PNALTYKVLVNTLWKEGKTDEAVLAIE 610
           T TY +++  +   GK +L  +   ++   +  P  +TY +L+     EG  DEA+  ++
Sbjct: 196 TVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMD 255

Query: 611 NMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 670
            M  RG+      Y    R +C  G    A   +  +     +P V++Y  L++A L+  
Sbjct: 256 EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQG 315

Query: 671 DLQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDY 730
             +    +   M    C PN+VTY+IL+      G  EEA  L + + E+          
Sbjct: 316 KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEK---------- 375

Query: 731 RDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGK-D 790
              + PD Y ++ ++ A   E R D    F   M   G   +   +  ++    + GK D
Sbjct: 376 --GLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKAD 435

Query: 791 ELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSI 830
           + LE   K L +   +P        F    + GD   AL  I
Sbjct: 436 QALEIFGK-LGEVGCSPNSSSYNTMFSALWSSGDKIRALHMI 461

BLAST of CSPI01G25380 vs. ExPASy TrEMBL
Match: A0A0A0LVN7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1)

HSP 1 Score: 1807.0 bits (4679), Expect = 0.0e+00
Identity = 898/907 (99.01%), Postives = 901/907 (99.34%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSV GTD SLSDAKNRVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
           LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIA EKD
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKPI 240
           LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNS+KPI
Sbjct: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKPI 240

Query: 241 HFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPGD 300
           HFKAN LEVKKESSRVSDGNSMKTSEKIWAWGDDDAKP KGVLKAGKYGIQLERSYNPGD
Sbjct: 241 HFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPGD 300

Query: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360
           KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI
Sbjct: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360

Query: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420
           QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM
Sbjct: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420

Query: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480
           RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG
Sbjct: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480

Query: 481 YMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540
           YMRELFDVIDSM+SPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ
Sbjct: 481 YMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540

Query: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600
           ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK
Sbjct: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600

Query: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660
           TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG
Sbjct: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660

Query: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720
           LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR
Sbjct: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720

Query: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780
           NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE
Sbjct: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780

Query: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840
           AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF
Sbjct: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840

Query: 841 SESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900
           SESAWLNLLKEKRFP+DTVIELIHKV MVLTRNESPNPVFKNLLLSCKEFCRTRISLADH
Sbjct: 841 SESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900

Query: 901 RLEETVY 908
           RLEETVY
Sbjct: 901 RLEETVY 907

BLAST of CSPI01G25380 vs. ExPASy TrEMBL
Match: A0A1S3C8Z0 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498323 PE=4 SV=1)

HSP 1 Score: 1714.1 bits (4438), Expect = 0.0e+00
Identity = 853/909 (93.84%), Postives = 874/909 (96.15%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVP--GTDLSLSDAKNRVL 60
           MVGVIMAN+NL IPNCERYGFPTLHCTHNSH SFWVSFFPSSV   GTDL+ SDAKNRVL
Sbjct: 1   MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60

Query: 61  RHRVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDP 120
           RHR+HKCGSIKALSNGESDISLP+GNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLD P
Sbjct: 61  RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120

Query: 121 NKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIARE 180
           NKLTMKEN SAKSAESTSISKIDNGKNKVTDVQHNV+VKNMFKRVDKKDLFNNTERIARE
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180

Query: 181 KDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEK 240
           K LSGNKFDR K VTRSNDKVKGKMTPFGSLVNDKQHEEK+N NWSSYIEP+VTRSN EK
Sbjct: 181 KHLSGNKFDRSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCEK 240

Query: 241 PIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNP 300
           PIHFKAN LE KKE SRVS GNSMKTSEKIWAWG+DDAKP K VLKAGKYGIQLERSY+P
Sbjct: 241 PIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYSP 300

Query: 301 GDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEE 360
           GDKVGRKKTEQSYRGTSTSGKRFLEF E+NSLEVEHAAFNNFDA DIMDKPRVSKMEMEE
Sbjct: 301 GDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEMEE 360

Query: 361 RIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWL 420
           RIQMLSKRLNGADIDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWL
Sbjct: 361 RIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL 420

Query: 421 QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ 480
           QMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ
Sbjct: 421 QMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ 480

Query: 481 AGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV 540
           AGYMRELFDVIDSM+SPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV
Sbjct: 481 AGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV 540

Query: 541 LQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE 600
           LQELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE
Sbjct: 541 LQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE 600

Query: 601 GKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY 660
           GKTDEAVLAIENME+RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY
Sbjct: 601 GKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY 660

Query: 661 TGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQ 720
           TGLIQACLDSKDLQSAVY+FN MKAFCSPNLVTYNILLKGYLEHGMFEEAREL QNLSEQ
Sbjct: 661 TGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSEQ 720

Query: 721 RRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI 780
           R+NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI
Sbjct: 721 RQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI 780

Query: 781 LEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAH 840
           LEAAR GKDELLETTWKHLAQADRTPPPPLLKERFCMK+ARGDY+EAL  I +HNSGDAH
Sbjct: 781 LEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDAH 840

Query: 841 HFSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLA 900
           HFSESAWLNLLKEKRFPKDTVIELIHKV MV   NESPNPVFKNLLLSCKEFCRTRIS+A
Sbjct: 841 HFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISVA 900

Query: 901 DHRLEETVY 908
           DHRLEETV+
Sbjct: 901 DHRLEETVH 909

BLAST of CSPI01G25380 vs. ExPASy TrEMBL
Match: A0A5D3CBM0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold10007G00030 PE=4 SV=1)

HSP 1 Score: 1622.8 bits (4201), Expect = 0.0e+00
Identity = 813/876 (92.81%), Postives = 833/876 (95.09%), Query Frame = 0

Query: 45  GTDLSLSDAKNRVLRHRVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVME 104
           GTDL+ SDAKNRVLRHR+HKCGSIKALSNGESDISLP+GNLLEHDFQFKPSFDEYVKVME
Sbjct: 19  GTDLNFSDAKNRVLRHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVME 78

Query: 105 TVRTRRYKRQLDDPNKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVD 164
           TVRTRRYKRQLD PNKLTMKEN SAKSAESTSISKIDNGKNKVTDVQHNV+VKNMFKRVD
Sbjct: 79  TVRTRRYKRQLDYPNKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVD 138

Query: 165 KKDLFNNTERIAREKDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWS 224
           KKDLFNNTERIAREK LSGNKFDR K VTRSNDKVKGKMTPFGSLVNDKQHEEK+N NWS
Sbjct: 139 KKDLFNNTERIAREKHLSGNKFDRSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWS 198

Query: 225 SYIEPRVTRSNSEKPIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLK 284
           SYIEP+VTRSN EKPIHFKAN LE KKE SRVS GNSMKTSEKIWAWG+DDAKP K VLK
Sbjct: 199 SYIEPKVTRSNCEKPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLK 258

Query: 285 AGKYGIQLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFD 344
           AGKYGIQLERSY+PGDKVGRKKTEQSYRGTSTSGKRFLEF E+NSLEVEHAAFNNFDA D
Sbjct: 259 AGKYGIQLERSYSPGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALD 318

Query: 345 IMDKPRVSKMEMEERIQMLSK-------------RLNGADIDMPEWMFSQMMRSAKIRYS 404
           IMDKPRVSKMEMEERIQMLSK             RLNGADIDMPEWMFSQMMR AKIRYS
Sbjct: 319 IMDKPRVSKMEMEERIQMLSKRFGVPCSLDFLVNRLNGADIDMPEWMFSQMMRGAKIRYS 378

Query: 405 DHSILRVIQVLGKLGNWRRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN 464
           DHSILRVIQVLGKLGNWRRVLQ+IEWLQMRERFKSHK RFIYTTALDVLGKARRPVEALN
Sbjct: 379 DHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKPRFIYTTALDVLGKARRPVEALN 438

Query: 465 VFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPR 524
           VFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSM+SPPKKKFKTG LEKWDPR
Sbjct: 439 VFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR 498

Query: 525 LQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVH 584
           LQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQ LQPSTSTYGLVMEVMLECGKYNLVH
Sbjct: 499 LQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVH 558

Query: 585 EFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLC 644
           EFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENME+RG+VGSAALYYDFARCLC
Sbjct: 559 EFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEMRGVVGSAALYYDFARCLC 618

Query: 645 SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVT 704
           SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MKAFCSPNLVT
Sbjct: 619 SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVT 678

Query: 705 YNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKR 764
           YNILLKGYLEHGMFEEAREL QNLSEQR+NISTVSDYRDRVLPDIYMFNTMLDASFAEKR
Sbjct: 679 YNILLKGYLEHGMFEEARELLQNLSEQRQNISTVSDYRDRVLPDIYMFNTMLDASFAEKR 738

Query: 765 WDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKE 824
           WDDFSYFYNQMFLYGYHFNPKRHLRMILEAAR GKDELLETTWKHLAQADRTPPPPLLKE
Sbjct: 739 WDDFSYFYNQMFLYGYHFNPKRHLRMILEAARVGKDELLETTWKHLAQADRTPPPPLLKE 798

Query: 825 RFCMKLARGDYSEALSSIWSHNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKVSMVLT 884
           RFCMK+ARGDY+EAL  I +HNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKV MV  
Sbjct: 799 RFCMKVARGDYTEALRCISNHNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKVGMVFA 858

Query: 885 RNESPNPVFKNLLLSCKEFCRTRISLADHRLEETVY 908
            NESPNPVFKNLLLSCKEFCRTRIS+ADHRLEETV+
Sbjct: 859 TNESPNPVFKNLLLSCKEFCRTRISVADHRLEETVH 894

BLAST of CSPI01G25380 vs. ExPASy TrEMBL
Match: A0A6J1EH18 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111434226 PE=4 SV=1)

HSP 1 Score: 1483.8 bits (3840), Expect = 0.0e+00
Identity = 752/907 (82.91%), Postives = 810/907 (89.31%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMAN NLCIP CE  GFP L+CT NSH     S FPSSV G+ L+   AK+RVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSVFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           R HKCG+IKA S GESDI L SGNLLE DFQFKPSFDEYV+VME+VR+RRYKRQ DDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
             MKEN SAKSAEST IS I      VTDVQ N+DVKN    VD +DLF+N+E+I R+ D
Sbjct: 121 --MKENASAKSAESTFISNI------VTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTD 180

Query: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKP 240
           LSGNKFD +RK VTRS D++KGK+TPF S VNDKQHEEKRN NWS+YIEP+ TRSN +K 
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFESQVNDKQHEEKRNGNWSNYIEPKATRSNHDKR 240

Query: 241 IHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPG 300
           +HFKAN L+VK ES  V  G+SMK S+KIWA  DDD+KP K VLK GKYG+QLE +Y PG
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISDKIWA--DDDSKPTKDVLKVGKYGVQLEGNYIPG 300

Query: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
           DKVGRKKTEQSYRG S SGKRF EF E++SLEVEHAAFN+ DA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
           IQMLS RLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQ+IEWLQ
Sbjct: 361 IQMLSNRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
           GYMRELFDVIDSM+SPPKKKFKTG  EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL 540

Query: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
           QELK+Q LQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600

Query: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
           KTDEAVLAI+ ME RGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYT 660

Query: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
           GLIQACLDSK+LQSAVYIFNHMKAFCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE  
Sbjct: 661 GLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENG 720

Query: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
           RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRMI+
Sbjct: 721 RNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIM 780

Query: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
           EAARGGKDELLETTWKHLAQADRT PPPL+KERFC+ LARGDYSEALS I  H+S D HH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHH 840

Query: 841 FSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
           FS+SAWLNLLKEKRFPKD+VIELIHKVSM+L RN+SPNPV +NLLLS KEFCR+RIS+AD
Sbjct: 841 FSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVAD 897

Query: 901 HRLEETV 907
            RLEE V
Sbjct: 901 PRLEEVV 897

BLAST of CSPI01G25380 vs. ExPASy TrEMBL
Match: A0A6J1KEH7 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495096 PE=4 SV=1)

HSP 1 Score: 1482.2 bits (3836), Expect = 0.0e+00
Identity = 752/907 (82.91%), Postives = 810/907 (89.31%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMAN NLCIP CE  GF  L+CT NSH    +SFFPSSV G+ L+   AK+RVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFSALYCTQNSHYLLGLSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           R HKCG+IKA S GESDI L SGNLLE DFQFKPSFDEYV+VME+VR+RRYKRQ DDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
             MKEN SAKSAESTSIS I      VTDVQ N+DVKN    VD +DLF+N+ERI R+ D
Sbjct: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKNKVVYVDGEDLFDNSERITRKTD 180

Query: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKP 240
           LSGNKFD +RK VTRS D++KGK+TPF S +NDKQHEEKRN NWS+YIEP+VTRSN +K 
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQINDKQHEEKRNGNWSNYIEPKVTRSNHDKR 240

Query: 241 IHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPG 300
           +HFKAN L+VK ES  V  G+SMK SEKIWA  DDD KP K VLK GKYG+QL+ +Y PG
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWA--DDDIKPTKDVLKVGKYGVQLKGNYIPG 300

Query: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
           DKVGRKKTEQSYRG S SGKRF EF E++SLEVEHAAFN+ DA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAADIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
           IQMLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
           GYMRELFDVIDSM+SPPKKKFKTG  EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL 540

Query: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
           QELK+Q LQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600

Query: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
           KTDEAVLAI+ ME RGIVGSAALYYDFARCLCSAGR +EALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRWEEALMQMEKICKVANKPLVVTYT 660

Query: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
           GLIQACLDSK+LQSAVYIFNHMKAFCSPNLVT NILLKGYL+HGMF EA+ELFQN+SE  
Sbjct: 661 GLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFNEAKELFQNMSENG 720

Query: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
           RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRMI+
Sbjct: 721 RNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIM 780

Query: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
           EAARGGKDELLETTWKHLAQADR  PPPL+KERFC+ LARGDYSEALS I  H+S D HH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRILPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHH 840

Query: 841 FSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
           FS+SAWLNLLKEKRFPKD+VI+LIHKVSM+L RN+SPNPV +NLLLS KEFCR+RIS+AD
Sbjct: 841 FSKSAWLNLLKEKRFPKDSVIQLIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVAD 897

Query: 901 HRLEETV 907
            RLEE V
Sbjct: 901 PRLEEVV 897

BLAST of CSPI01G25380 vs. NCBI nr
Match: XP_031741862.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus] >XP_031741863.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus] >KGN65965.1 hypothetical protein Csa_023210 [Cucumis sativus])

HSP 1 Score: 1807.0 bits (4679), Expect = 0.0e+00
Identity = 898/907 (99.01%), Postives = 901/907 (99.34%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSV GTD SLSDAKNRVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
           LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIA EKD
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKPI 240
           LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNS+KPI
Sbjct: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKPI 240

Query: 241 HFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPGD 300
           HFKAN LEVKKESSRVSDGNSMKTSEKIWAWGDDDAKP KGVLKAGKYGIQLERSYNPGD
Sbjct: 241 HFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPGD 300

Query: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360
           KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI
Sbjct: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360

Query: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420
           QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM
Sbjct: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420

Query: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480
           RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG
Sbjct: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480

Query: 481 YMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540
           YMRELFDVIDSM+SPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ
Sbjct: 481 YMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540

Query: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600
           ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK
Sbjct: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600

Query: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660
           TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG
Sbjct: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660

Query: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720
           LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR
Sbjct: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720

Query: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780
           NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE
Sbjct: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780

Query: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840
           AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF
Sbjct: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840

Query: 841 SESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900
           SESAWLNLLKEKRFP+DTVIELIHKV MVLTRNESPNPVFKNLLLSCKEFCRTRISLADH
Sbjct: 841 SESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900

Query: 901 RLEETVY 908
           RLEETVY
Sbjct: 901 RLEETVY 907

BLAST of CSPI01G25380 vs. NCBI nr
Match: XP_008459122.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis melo])

HSP 1 Score: 1714.1 bits (4438), Expect = 0.0e+00
Identity = 853/909 (93.84%), Postives = 874/909 (96.15%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVP--GTDLSLSDAKNRVL 60
           MVGVIMAN+NL IPNCERYGFPTLHCTHNSH SFWVSFFPSSV   GTDL+ SDAKNRVL
Sbjct: 1   MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60

Query: 61  RHRVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDP 120
           RHR+HKCGSIKALSNGESDISLP+GNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLD P
Sbjct: 61  RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120

Query: 121 NKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIARE 180
           NKLTMKEN SAKSAESTSISKIDNGKNKVTDVQHNV+VKNMFKRVDKKDLFNNTERIARE
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180

Query: 181 KDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEK 240
           K LSGNKFDR K VTRSNDKVKGKMTPFGSLVNDKQHEEK+N NWSSYIEP+VTRSN EK
Sbjct: 181 KHLSGNKFDRSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCEK 240

Query: 241 PIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNP 300
           PIHFKAN LE KKE SRVS GNSMKTSEKIWAWG+DDAKP K VLKAGKYGIQLERSY+P
Sbjct: 241 PIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYSP 300

Query: 301 GDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEE 360
           GDKVGRKKTEQSYRGTSTSGKRFLEF E+NSLEVEHAAFNNFDA DIMDKPRVSKMEMEE
Sbjct: 301 GDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEMEE 360

Query: 361 RIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWL 420
           RIQMLSKRLNGADIDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWL
Sbjct: 361 RIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL 420

Query: 421 QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ 480
           QMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ
Sbjct: 421 QMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ 480

Query: 481 AGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV 540
           AGYMRELFDVIDSM+SPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV
Sbjct: 481 AGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV 540

Query: 541 LQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE 600
           LQELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE
Sbjct: 541 LQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE 600

Query: 601 GKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY 660
           GKTDEAVLAIENME+RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY
Sbjct: 601 GKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY 660

Query: 661 TGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQ 720
           TGLIQACLDSKDLQSAVY+FN MKAFCSPNLVTYNILLKGYLEHGMFEEAREL QNLSEQ
Sbjct: 661 TGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSEQ 720

Query: 721 RRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI 780
           R+NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI
Sbjct: 721 RQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI 780

Query: 781 LEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAH 840
           LEAAR GKDELLETTWKHLAQADRTPPPPLLKERFCMK+ARGDY+EAL  I +HNSGDAH
Sbjct: 781 LEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDAH 840

Query: 841 HFSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLA 900
           HFSESAWLNLLKEKRFPKDTVIELIHKV MV   NESPNPVFKNLLLSCKEFCRTRIS+A
Sbjct: 841 HFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISVA 900

Query: 901 DHRLEETVY 908
           DHRLEETV+
Sbjct: 901 DHRLEETVH 909

BLAST of CSPI01G25380 vs. NCBI nr
Match: KAA0047051.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK09387.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1622.8 bits (4201), Expect = 0.0e+00
Identity = 813/876 (92.81%), Postives = 833/876 (95.09%), Query Frame = 0

Query: 45  GTDLSLSDAKNRVLRHRVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVME 104
           GTDL+ SDAKNRVLRHR+HKCGSIKALSNGESDISLP+GNLLEHDFQFKPSFDEYVKVME
Sbjct: 19  GTDLNFSDAKNRVLRHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVME 78

Query: 105 TVRTRRYKRQLDDPNKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVD 164
           TVRTRRYKRQLD PNKLTMKEN SAKSAESTSISKIDNGKNKVTDVQHNV+VKNMFKRVD
Sbjct: 79  TVRTRRYKRQLDYPNKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVD 138

Query: 165 KKDLFNNTERIAREKDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWS 224
           KKDLFNNTERIAREK LSGNKFDR K VTRSNDKVKGKMTPFGSLVNDKQHEEK+N NWS
Sbjct: 139 KKDLFNNTERIAREKHLSGNKFDRSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWS 198

Query: 225 SYIEPRVTRSNSEKPIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLK 284
           SYIEP+VTRSN EKPIHFKAN LE KKE SRVS GNSMKTSEKIWAWG+DDAKP K VLK
Sbjct: 199 SYIEPKVTRSNCEKPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLK 258

Query: 285 AGKYGIQLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFD 344
           AGKYGIQLERSY+PGDKVGRKKTEQSYRGTSTSGKRFLEF E+NSLEVEHAAFNNFDA D
Sbjct: 259 AGKYGIQLERSYSPGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALD 318

Query: 345 IMDKPRVSKMEMEERIQMLSK-------------RLNGADIDMPEWMFSQMMRSAKIRYS 404
           IMDKPRVSKMEMEERIQMLSK             RLNGADIDMPEWMFSQMMR AKIRYS
Sbjct: 319 IMDKPRVSKMEMEERIQMLSKRFGVPCSLDFLVNRLNGADIDMPEWMFSQMMRGAKIRYS 378

Query: 405 DHSILRVIQVLGKLGNWRRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN 464
           DHSILRVIQVLGKLGNWRRVLQ+IEWLQMRERFKSHK RFIYTTALDVLGKARRPVEALN
Sbjct: 379 DHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKPRFIYTTALDVLGKARRPVEALN 438

Query: 465 VFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPR 524
           VFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSM+SPPKKKFKTG LEKWDPR
Sbjct: 439 VFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR 498

Query: 525 LQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVH 584
           LQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQ LQPSTSTYGLVMEVMLECGKYNLVH
Sbjct: 499 LQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVH 558

Query: 585 EFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLC 644
           EFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENME+RG+VGSAALYYDFARCLC
Sbjct: 559 EFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEMRGVVGSAALYYDFARCLC 618

Query: 645 SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVT 704
           SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MKAFCSPNLVT
Sbjct: 619 SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVT 678

Query: 705 YNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKR 764
           YNILLKGYLEHGMFEEAREL QNLSEQR+NISTVSDYRDRVLPDIYMFNTMLDASFAEKR
Sbjct: 679 YNILLKGYLEHGMFEEARELLQNLSEQRQNISTVSDYRDRVLPDIYMFNTMLDASFAEKR 738

Query: 765 WDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKE 824
           WDDFSYFYNQMFLYGYHFNPKRHLRMILEAAR GKDELLETTWKHLAQADRTPPPPLLKE
Sbjct: 739 WDDFSYFYNQMFLYGYHFNPKRHLRMILEAARVGKDELLETTWKHLAQADRTPPPPLLKE 798

Query: 825 RFCMKLARGDYSEALSSIWSHNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKVSMVLT 884
           RFCMK+ARGDY+EAL  I +HNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKV MV  
Sbjct: 799 RFCMKVARGDYTEALRCISNHNSGDAHHFSESAWLNLLKEKRFPKDTVIELIHKVGMVFA 858

Query: 885 RNESPNPVFKNLLLSCKEFCRTRISLADHRLEETVY 908
            NESPNPVFKNLLLSCKEFCRTRIS+ADHRLEETV+
Sbjct: 859 TNESPNPVFKNLLLSCKEFCRTRISVADHRLEETVH 894

BLAST of CSPI01G25380 vs. NCBI nr
Match: XP_038894404.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1584.7 bits (4102), Expect = 0.0e+00
Identity = 793/907 (87.43%), Postives = 840/907 (92.61%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMAN+NLCIP+CER GFP LHCT NSHN F  SFFPSSV G DL+  DAK+RVLRH
Sbjct: 1   MVGVIMANVNLCIPSCERNGFPALHCTQNSHNFFGFSFFPSSVSGPDLNFGDAKHRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           RVHKCGSIKA SNGESDI LPS NLLE+DFQFKPSFDEYV+VMETVRTRRYKRQ DDPNK
Sbjct: 61  RVHKCGSIKASSNGESDIRLPSENLLENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
           LTMKEN S KSAE TSISKIDNGKNKVTDVQ NVDVKNMFKRVD+KDLFNNTERI RE+D
Sbjct: 121 LTMKENASVKSAEITSISKIDNGKNKVTDVQGNVDVKNMFKRVDRKDLFNNTERITRERD 180

Query: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKP 240
           LSGNK D +RK ++RSND+VKGK+TPF S VNDKQHEEKRN N S+Y EP+V R  +EK 
Sbjct: 181 LSGNKIDSKRKGISRSNDEVKGKVTPFDSQVNDKQHEEKRNINRSNYTEPKVPRLYNEKR 240

Query: 241 IHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPG 300
           I+FKAN L++K+ES R S+G+SM+ S KIWA  +DD KP K +L A KY +QLER+Y  G
Sbjct: 241 INFKANTLDIKRESHRASNGSSMRISGKIWA--NDDTKPAKDILNAVKYSVQLERNYISG 300

Query: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
           DKVGRKKTEQSYR +S SGKRFLEF E +SLEVEHAAFNNFDA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRESSKSGKRFLEFTEDSSLEVEHAAFNNFDALDIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
           IQML KRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWLQ
Sbjct: 361 IQMLCKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQ+HF+SYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNLFHAMQQHFTSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
           GYM+ELFDVIDSM+SPPKKKFKTGVLEKWDPRL+PDIVIYNAVLNACVKRKNLEGAFWVL
Sbjct: 481 GYMKELFDVIDSMRSPPKKKFKTGVLEKWDPRLEPDIVIYNAVLNACVKRKNLEGAFWVL 540

Query: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
           QELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600

Query: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
           KTDEAVLAIENME RGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA KPLVVTYT
Sbjct: 601 KTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVATKPLVVTYT 660

Query: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
           GLIQACLDSKD++SAVYIFNHMK FCSPNLVTYN+LLKGYLEHGMFEEARELFQNLSE  
Sbjct: 661 GLIQACLDSKDIRSAVYIFNHMKTFCSPNLVTYNMLLKGYLEHGMFEEARELFQNLSEHG 720

Query: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
           RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YFY+QM LYGYHFNPKRHLRMIL
Sbjct: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFYDQMLLYGYHFNPKRHLRMIL 780

Query: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
           EAAR GKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALS I +H+S D HH
Sbjct: 781 EAARAGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSCISNHDSSDVHH 840

Query: 841 FSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
           FSES WLNLLKEKRFPKDTVI+LI+KVSM+LTRN+ PNPVFKNLLLSCKEFCRTRIS+AD
Sbjct: 841 FSESGWLNLLKEKRFPKDTVIQLINKVSMLLTRNDLPNPVFKNLLLSCKEFCRTRISVAD 900

Query: 901 HRLEETV 907
           HRLEETV
Sbjct: 901 HRLEETV 905

BLAST of CSPI01G25380 vs. NCBI nr
Match: KAG7019446.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1492.6 bits (3863), Expect = 0.0e+00
Identity = 755/907 (83.24%), Postives = 813/907 (89.64%), Query Frame = 0

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMAN NLCIP CE  GFP L+CT NSH     SFFPSSV G+ L+   AK+RVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           R HKCG+IKA S GESDI L SGNLLE DFQFKPSFDEYV+VME+VR+RRYKRQ DDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
             MKEN SAKSAESTSIS I      VTDVQ N+DVKN    VD +DLF+N+E+I R+ D
Sbjct: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTD 180

Query: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKP 240
           LSGNKFD +RK VTRS D++KGK+TPF S VNDKQHEEKRN NWS+YIEP+ TRSN +K 
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHEEKRNGNWSNYIEPKATRSNHDKR 240

Query: 241 IHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPG 300
           +HFKAN L+VK ES  V  G+SMK S+KIWA  DDD KP K VLK GKYG+QLE +Y PG
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISDKIWA--DDDTKPTKDVLKVGKYGVQLEGNYIPG 300

Query: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
           DKVGRKKTEQSYRG S SGKRF EF E++SLEVEHAAFN+FDA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSFDAEDIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
           IQMLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
           GYMRELFDVIDSM+SPPKKKFKTG  EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL 540

Query: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
           QELK+Q LQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600

Query: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
           KTDEAVLAI+ ME RGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYT 660

Query: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
           GLIQACLDSK+LQSAVYIFNHMKAFCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE  
Sbjct: 661 GLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENG 720

Query: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
           RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRMI+
Sbjct: 721 RNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIM 780

Query: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
           EAARGGKDELLETTWKHLAQADRT PPPL+KERFC+ LARGDYSEALS I  H+S D HH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHH 840

Query: 841 FSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
           FS+SAWLNLLKEKRFPKD+VIELIHKVSM+L RN+SPNPV +NLLLS KEFCR+RI++AD
Sbjct: 841 FSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRITVAD 897

Query: 901 HRLEETV 907
            RLEE V
Sbjct: 901 PRLEEVV 897

BLAST of CSPI01G25380 vs. TAIR 10
Match: AT1G30610.2 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 738.8 bits (1906), Expect = 5.2e-213
Identity = 443/971 (45.62%), Postives = 594/971 (61.17%), Query Frame = 0

Query: 30  SHNSFWVSFFPSSVPGTDLSLSDAKNRVLRHRVHKCGS--IKALSNGESDISLPSGNLLE 89
           S NSFW   F                RV+R    K  S  +  L+    ++ L      +
Sbjct: 19  SRNSFWRPLFHQPYYNC--------RRVVRLNSRKLNSKVMFCLNLNTKEVGLQKPG--D 78

Query: 90  HDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKEN----GSAKSAESTSISKIDNG 149
             F+FKPSFD+Y+++ME+V+T R K++ D   +L ++E+    G+  S       KI +G
Sbjct: 79  KGFEFKPSFDQYLQIMESVKTARKKKKFD---RLKVEEDDGGGGNGDSVYEVKDMKIKSG 138

Query: 150 KNK-------------VTD------VQHNVDVKNMFKRVDKKDLFNNTERIAREKDLSG- 209
           + K             V+D       + N +++N     D K   +    +A +   SG 
Sbjct: 139 ELKDETFRKRYSRQEIVSDKRNERVFKRNGEIENHRVATDLKWSKSGESSVALKLSKSGE 198

Query: 210 -----------NKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNE----------- 269
                       K   ++   RS+D  +G          D   EE+R +           
Sbjct: 199 SSVTVPEDESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLVVEERRVQRIAKDARWSKS 258

Query: 270 -------NWSSYIEPRVTRSNSE-------KPIHF-------------KANMLEVKKESS 329
                   WS+  E  VT    E       K  H              K + LE+  E  
Sbjct: 259 RESSVAVKWSNSGESSVTMPKDESFRRRYSKQEHHRSSDTSRGIARGSKGDELELVVEER 318

Query: 330 RV----SDGNSMKTSEKIWAWGDDD-----------------AKPPKGVLKAGK-YGIQL 389
           RV     D    K+ E +    +D+                 +   +G+ +  K  G+ L
Sbjct: 319 RVQRIAKDVRWSKSDESLVPVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGDGLDL 378

Query: 390 ERSYNPGDKVGRKKTE---QSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFD-AFDIMDK 449
                  +++  ++ E       GT   G +  + ++ +   +E  AF   D + DI+DK
Sbjct: 379 LAEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDK 438

Query: 450 PRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNW 509
           P  S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNW
Sbjct: 439 PATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNW 498

Query: 510 RRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVA 569
           RRVLQ+IEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VA
Sbjct: 499 RRVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVA 558

Query: 570 YHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVK 629
           Y SIAVTLGQAG+++ELF VID+M+SPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+
Sbjct: 559 YRSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQ 618

Query: 630 RKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTY 689
           RK  EGAFWVLQ+LK++  +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y
Sbjct: 619 RKQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAY 678

Query: 690 KVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICK 749
           +VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L  ++KIC+
Sbjct: 679 RVLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMLKKICR 738

Query: 750 VANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEA 809
           VANKPLVVTYTGLIQAC+DS ++++A YIF+ MK  CSPNLVT NI+LK YL+ G+FEEA
Sbjct: 739 VANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKAYLQGGLFEEA 798

Query: 810 RELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYH 869
           RELFQ +SE   +I   SD+  RVLPD Y FNTMLD    +++WDDF Y Y +M  +GYH
Sbjct: 799 RELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYAYREMLRHGYH 858

Query: 870 FNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSS 895
           FN KRHLRM+LEA+R GK+E++E TW+H+ +++R PP PL+KERF  KL +GD+  A+SS
Sbjct: 859 FNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLEKGDHISAISS 918

BLAST of CSPI01G25380 vs. TAIR 10
Match: AT1G30610.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 726.5 bits (1874), Expect = 2.7e-209
Identity = 444/999 (44.44%), Postives = 595/999 (59.56%), Query Frame = 0

Query: 30   SHNSFWVSFFPSSVPGTDLSLSDAKNRVLRHRVHKCGS--IKALSNGESDISLPSGNLLE 89
            S NSFW   F                RV+R    K  S  +  L+    ++ L      +
Sbjct: 19   SRNSFWRPLFHQPYYNC--------RRVVRLNSRKLNSKVMFCLNLNTKEVGLQKPG--D 78

Query: 90   HDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKEN----GSAKSAESTSISKIDNG 149
              F+FKPSFD+Y+++ME+V+T R K++ D   +L ++E+    G+  S       KI +G
Sbjct: 79   KGFEFKPSFDQYLQIMESVKTARKKKKFD---RLKVEEDDGGGGNGDSVYEVKDMKIKSG 138

Query: 150  KNK-------------VTD------VQHNVDVKNMFKRVDKKDLFNNTERIAREKDLSG- 209
            + K             V+D       + N +++N     D K   +    +A +   SG 
Sbjct: 139  ELKDETFRKRYSRQEIVSDKRNERVFKRNGEIENHRVATDLKWSKSGESSVALKLSKSGE 198

Query: 210  -----------NKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNE----------- 269
                        K   ++   RS+D  +G          D   EE+R +           
Sbjct: 199  SSVTVPEDESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLVVEERRVQRIAKDARWSKS 258

Query: 270  -------NWSSYIEPRVTRSNSE-------KPIHF-------------KANMLEVKKESS 329
                    WS+  E  VT    E       K  H              K + LE+  E  
Sbjct: 259  RESSVAVKWSNSGESSVTMPKDESFRRRYSKQEHHRSSDTSRGIARGSKGDELELVVEER 318

Query: 330  RV----SDGNSMKTSEKIWAWGDDD-----------------AKPPKGVLKAGK-YGIQL 389
            RV     D    K+ E +    +D+                 +   +G+ +  K  G+ L
Sbjct: 319  RVQRIAKDVRWSKSDESLVPVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGDGLDL 378

Query: 390  ERSYNPGDKVGRKKTE---QSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFD-AFDIMDK 449
                   +++  ++ E       GT   G +  + ++ +   +E  AF   D + DI+DK
Sbjct: 379  LAEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDK 438

Query: 450  PRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNW 509
            P  S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNW
Sbjct: 439  PATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNW 498

Query: 510  RRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVA 569
            RRVLQ+IEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VA
Sbjct: 499  RRVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVA 558

Query: 570  YHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVK 629
            Y SIAVTLGQAG+++ELF VID+M+SPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+
Sbjct: 559  YRSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQ 618

Query: 630  RKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTY 689
            RK  EGAFWVLQ+LK++  +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y
Sbjct: 619  RKQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAY 678

Query: 690  KVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEAL-------- 749
            +VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L        
Sbjct: 679  RVLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNP 738

Query: 750  --------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNH 809
                                 Q++KIC+VANKPLVVTYTGLIQAC+DS ++++A YIF+ 
Sbjct: 739  VVLKLIENLIYKADLVHTIQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQ 798

Query: 810  MKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFN 869
            MK  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE   +I   SD+  RVLPD Y FN
Sbjct: 799  MKKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFN 858

Query: 870  TMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQA 895
            TMLD    +++WDDF Y Y +M  +GYHFN KRHLRM+LEA+R GK+E++E TW+H+ ++
Sbjct: 859  TMLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRS 918

BLAST of CSPI01G25380 vs. TAIR 10
Match: AT5G67570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 387.5 bits (994), Expect = 3.0e-107
Identity = 207/546 (37.91%), Postives = 327/546 (59.89%), Query Frame = 0

Query: 358 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEW 417
           E +++L  RL+G +I+   W F +MM  + +++++  +L+++  LG+  +W++   ++ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 418 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 477
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 478 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 537
           QAG ++EL  VI+ M+  P K  K    + WDP L+PD+V+YNA+LNACV     +   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 538 VLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 597
           V  EL+K  L+P+ +TYGL MEVMLE GK++ VH+FFRK++ S   P A+TYKVLV  LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422

Query: 598 KEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 657
           +EGK +EAV A+ +ME +G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482

Query: 658 VTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNL 717
           +T+TGLI A L+   +   + IF +MK  C PN+ T N++LK Y  + MF EA+ELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542

Query: 718 SEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHL 777
                    VS     ++P+ Y ++ ML+AS    +W+ F + Y  M L GY  +  +H 
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602

Query: 778 RMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSG 837
            M++EA+R GK  LLE  +  + +    P P    E  C   A+GD+  A++ I +  + 
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLEDGEIPHPLFFTELLCHATAKGDFQRAITLI-NTVAL 662

Query: 838 DAHHFSESAWLNLLKEKR--FPKDTVIELIHKVS-MVLTRNESPNPVFKNLLLSCKEFCR 897
            +   SE  W +L +E +    +D     +HK+S  ++  +    P   NL  S K  C 
Sbjct: 663 ASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRCG 714

Query: 898 TRISLA 899
           +  S A
Sbjct: 723 SSSSSA 714

BLAST of CSPI01G25380 vs. TAIR 10
Match: AT5G16640.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 108.6 bits (270), Expect = 2.7e-23
Identity = 77/314 (24.52%), Postives = 145/314 (46.18%), Query Frame = 0

Query: 432 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 491
           IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  
Sbjct: 188 IYNTIIDGLCKSKQVDNALDLLNRMEKD-GIGPDVVTYNSLISGLCSSGRWSDATRMVSC 247

Query: 492 MQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPST 551
           M                   + PD+  +NA+++ACVK   +  A    +E+ ++SL P  
Sbjct: 248 MTK---------------REIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDI 307

Query: 552 STYGLVMEVMLECGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIEN 611
            TY L++  +    + +   E F   V K   P+ +TY +L+N   K  K +  +     
Sbjct: 308 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCE 367

Query: 612 MEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKD 671
           M  RG+V +   Y    +  C AG+   A     ++      P ++TY  L+    D+  
Sbjct: 368 MSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGK 427

Query: 672 LQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYR 731
           ++ A+ I   M K     ++VTYNI+++G  + G   +A +++ +L+ Q           
Sbjct: 428 IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQ----------- 473

Query: 732 DRVLPDIYMFNTML 744
             ++PDI+ + TM+
Sbjct: 488 -GLMPDIWTYTTMM 473

BLAST of CSPI01G25380 vs. TAIR 10
Match: AT2G17140.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 107.5 bits (267), Expect = 5.9e-23
Identity = 57/214 (26.64%), Postives = 109/214 (50.93%), Query Frame = 0

Query: 510 PRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNL 569
           P  +P + +YN +L +C+K + +E   W+ +++    + P T T+ L++  + +    + 
Sbjct: 106 PENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165

Query: 570 VHEFFRKV-QKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFAR 629
             E F ++ +K   PN  T+ +LV    K G TD+ +  +  ME  G++ +  +Y     
Sbjct: 166 ARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVS 225

Query: 630 CLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMK-----A 689
             C  GR  ++   +EK+ +    P +VT+   I A      +  A  IF+ M+      
Sbjct: 226 SFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLG 285

Query: 690 FCSPNLVTYNILLKGYLEHGMFEEARELFQNLSE 718
              PN +TYN++LKG+ + G+ E+A+ LF+++ E
Sbjct: 286 LPRPNSITYNLMLKGFCKVGLLEDAKTLFESIRE 319

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SA763.8e-20844.44Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidop... [more]
Q9FJW64.2e-10637.91Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidop... [more]
Q9FMD33.7e-2224.52Pentatricopeptide repeat-containing protein At5g16640, mitochondrial OS=Arabidop... [more]
Q0WPZ68.3e-2226.64Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX... [more]
Q9SR007.0e-2124.85Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LVN70.0e+0099.01Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1[more]
A0A1S3C8Z00.0e+0093.84pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis ... [more]
A0A5D3CBM00.0e+0092.81Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1EH180.0e+0082.91LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chlo... [more]
A0A6J1KEH70.0e+0082.91pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
XP_031741862.10.0e+0099.01pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sa... [more]
XP_008459122.10.0e+0093.84PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... [more]
KAA0047051.10.0e+0092.81pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK09387... [more]
XP_038894404.10.0e+0087.43pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 ... [more]
KAG7019446.10.0e+0083.24Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
AT1G30610.25.2e-21345.62pentatricopeptide (PPR) repeat-containing protein [more]
AT1G30610.12.7e-20944.44pentatricopeptide (PPR) repeat-containing protein [more]
AT5G67570.13.0e-10737.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G16640.12.7e-2324.52Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G17140.15.9e-2326.64Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 353..373
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..144
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..144
NoneNo IPR availablePANTHERPTHR46935:SF1OS01G0674700 PROTEINcoord: 4..896
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 525..720
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 514..558
e-value: 2.4E-9
score: 37.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 656..683
e-value: 0.0022
score: 16.1
coord: 690..718
e-value: 1.2E-6
score: 26.4
coord: 517..550
e-value: 2.3E-4
score: 19.1
coord: 432..465
e-value: 2.6E-4
score: 19.0
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 654..699
e-value: 9.4E-10
score: 38.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 432..458
e-value: 0.018
score: 15.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 515..549
score: 10.972319
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 584..618
score: 8.681407
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 688..722
score: 10.785976
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 509..650
e-value: 7.7E-22
score: 80.0
coord: 651..797
e-value: 5.5E-21
score: 77.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 358..505
e-value: 2.1E-14
score: 55.3
IPR044645Pentatricopeptide repeat-containing protein DG1/EMB2279-likePANTHERPTHR46935OS01G0674700 PROTEINcoord: 4..896

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G25380.1CSPI01G25380.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
molecular_function GO:0005515 protein binding