CSPI01G25380.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI01G25380.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr1 : 20852771 .. 20863514 (+)
Sequence length2724
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAATATTTATAAATATCAGGCTTTAATATTTAAACAAAATGGAGTTCATCAAAGTCTATCATTGATTGATACAAGTGTAGTAGATGCATTTTGTTATTGTTTTACATAAAGTGAAAATGCATATAGTTAATTGTGTTTAGGCAAAATATTAGATTTAATAGTGTTGAGTAGACATAGATAAAGATCAATAGTAGGGAGGATACTATTTATCCAAATCCTCTTTTCTCTTTCCCCTCTCCGTCGGAGTCCCGTCGTTAGCCGCTGAAGGTCTACAAGCTCTCATCTACAATAGCGGCACGGAATTGCAGACGTCAGCTATCAACTCATTTTTTTCACGCATCTTTTGAACTTATAAATTCTCTCTCTACCCACCACCGCCCATCGCCGGAGAAGGTGTGAACGGTTCTATATGTAATCACAGCACGGAATTGAAGAGAGGTTAACCATTATCTTAGCTTTCGGCGGAGAACTTGGTCGGTGCAACTTGCTAAAATGCGATCTGACCCGCAAATAGAACGGGCACAGCCGCGAATAATGGTTTAATCGTATCGCGATATTCAGGGCCAGGTTGCTCCGCCGTCATTTTGAAAAATACTTCAATTGGTGACCTACACAGAACAGTTAGTTTGTTTTTTCTTCTCTCTTTTACTTTTTGTGTGGAGATATTTACTCTGTGTTTCTACTTCGTGTCTCAGACTTCTTTTGGATTATATTGGATTTATTCATGAAATTGGAGTTATTTTGAAATGGTGGGAGTTATAATGGCGAACCTAAATTTGTGCATCCCTAATTGTGAAAGATATGGATTTCCGACACTGCATTGTACCCATAATTCCCACAATTCTTTTTGGGTTTCGTTCTTTCCTAGTTCGGTTCCTGGAACTGACTTAAGTCTTAGTGACGCGAAGAATAGAGTTTTGAGACATAGGGTTCATAAATGTGGATCAATTAAGGCTTTGTCGAATGGAGAATCTGATATTTCATTGCCAAGTGGGAATCTCCTCGAACATGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAAGGTCATGGAGACTGTTAGAACTAGAAGGTATAAGAGGCAGTTGGATGATCCTAATAAACTGACAATGAAGGAAAATGGGAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAGATAGATAATGGAAAAAACAAAGTAACTGATGTTCAACATAACGTGGACGTAAAGAACATGTTTAAACGGGTGGATAAAAAAGATTTGTTCAATAATACAGAGAGAATTGCTCGTGAAAAGGATTTGTCAGGAAATAAATTTGATAGAAGGAAGGTAGTTACAAGATCAAATGATAAGGTTAAAGGCAAGATGACCCCTTTTGGCTCACTGGTTAATGATAAACAGCATGAAGAGAAAAGGAACGAAAACTGGTCAAGTTACATTGAGCCTAGAGTAACACGATCGAACAGCGAGAAACCAATTCATTTTAAAGCTAATATGTTGGAGGTCAAAAAAGAAAGCAGCCGTGTCTCTGATGGAAATTCCATGAAAACATCAGAAAAGATTTGGGCTTGGGGTGATGATGACGCTAAACCACCTAAGGGTGTTCTTAAGGCTGGGAAATATGGCATTCAGCTCGAAAGAAGCTATAATCCTGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGACATCCACAAGTGGTAAGCGTTTTCTTGAATTTAATGAAAAGAATAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTCGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGGTTTGGTGTCCCTTGCTCACTTGTCTGTCTTGCTGAAAGTTTAAATTTGACTTGGTACAAGTCTGAAATCAAAACTATTACCTTATTCGATACTTCTTCTTTGTTTGGGCGGGGGTCTGTGAAATTAATATTCTTAATTTGTTAAATTTGAATTTCTTAACAACATTTCTACTAGGATTAGTATGGGTGGACAAGGTTGTGTACTAGTTTTTTAATGGTTTTAAATATTTCCTTTTCTTGGCCAGGATATTTTATTGTTAAGTTCTTCAGGGTTGAGTTCAAGTATTTCTGGGGCCCGATTTTTAGTGTTATGTTTATTGAGCTTCAAGTCGTTTCTTTCTAAGGAAATTTTTAATTTAGTTTGAAGAGTATTAACTTCAGCTGCACCACTTATTCAATTTCAGACAACAGAGTCCAATCTAAGAACTTTGAAGTCTACTGAGCTAAATATTTGAGTGGTAAATGTGAAAGTTAGAAATCTACTTGCATTTGGTTATAACTCAACTCCATCGAGATAATATATGTTATTGGGTTCAAATGATAGTCATGAACTGTTTGCTTGCCTGCTGTTTTATGTTTTCTTTTTATCACTCACACATGTATTGCAGGCTCTTGAGTTATTCATTGTATGGAGACTTCTTTCATAGTCATCTCCAACATCTTTTAGGATTTGAAATTTAATTACTAATCTGTACTCAACTGTGCTATGAGGGGATTCCAAGAACACTAATCAAATTTCAATTATTTGAACTTTGAGGATTATTAATTAGTCTCTCCATATTGATTTATTTGAAGGACTTCCTTGTCAACAGATTGAATGGTGCAGACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAATCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGGTTTTTCCCTGTTTCTCACCTTTACTACTTGATTAATGTAGTGAAAGTTCTAATAAAGTCCTTGGTTATGATGCCTTAAAAGAGTAAGAGTGGCCAGTGATGCCTTTCCTCGGTCTTGAAGTGCAATTTAGATTTTCTTTTTTATATTTGTAGTGTCTCAACCAACTTACATGCATTTTGGCTAATTTCACGAGACAATCCACCCGATCCTACAATATTTTGGTGTCAAGAAAACTCATAGAACATTAATTCTTTGGTAGGTGACCACTATATAATAAATCCACGACCTATTAGTTAGTTATTGAGACAATGTCTCCTTTTTACCACTAGGCCAACCTACGAGGGAGGATCAGTGTTTTGAACGGCGCACTTGGGCACACGCCTAAGTGCAAGGCTCAACGGTGGTGCCTCGCCTCAGAAAGTTGAGGTGCATGAAATAAGGCGCACGCCTTTCGGTGAATCACTTAAAATGTAAACTATTTTGCATTTTAGGGTTTCTGTTTGCCCATTTATTGTAAATATGTTTATACATATATAATTTTTAAACCTAATTTGGCATAATTTCTTAAAAAAATAGAACGTCTTTTCCTCTCCTTTCTCTCACCTTCTTCATTCTTCACTGTATCACCTTCTTCCTCTTCGTCTCCAATCTTCACTGAATCATCATTTACAATGGATATCAAAGATTTAAAAGACCATCAAAATTATCTGGCACTTGATTCTTCTCCAAAATACAAAGGAATGCCAACATTCTTCCTGACCTCTTGGTCCTCCCCTCTTCAATTCTTCTAACTAAATCTTTTTTTTTTTTTGTATAATCTTCTTTAAGTCCCATTTCTATCCATCTAAGTTTTGGTTGTGTAAATAAACATGTAAGTTTATAATGCCTAGGCTCCAGAGCCATTGCGCCTTGGGCATTTTAAGACACTGGGGAGGATAAAAAGGAAAGATAAAAGAGGAGAGAGAGGGGGGATATGGCCATCAGATTTTAAATTTAATCTCTTTTAGGAATATAAAGAAAGTGCTAAAGTATTGAATAATAAATTAAATTTAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCAACCCCCACACACCACACGATTAGGTACATTGCTGATTTAACATGGTATTAGAGCAAGAGGTCCCACTGTTCAAACTCCTAAAATATCATTTTTTCTTCAATTAATACTAATTTCTAATTTTCAAGCCCACAAGTGAAGGTGAGTATTAAAGTATTAATATAACTAAATTTATTTAAACCCATCAACTTAAGCTTTTTGAGTTTGTTGGGAATTTGTTAGATACCTAGATTAGTATACATGGTTTATCTTGTATAAGGGTAATTAGATTAGTGGGTGTAAGGGTAATTAGATACTTAGGAAGTTACTAGTAGTTATTGTGTAAGTGTGTTTACTAGTAGTTATTATTTAAGTGTGATTACTGGGGTTGTTACATCTTGTTATAAATGGAGGGAGGGTAAGTGAGAGGGACGTTACGGTGGAGTGATTTGGGGTTTGGGTGAGAGTACTCAAGAGGGAGGTTCTAGGTGCCTTATACTTGGGTTTATCTTGTATCTTCTTATAGTTCATTATAATAAATTAAGATCTTGTTAACAAGTATTCTTACAGAATTTAACAAAAAATCTTCATCCTAAGTTCCTAACATTTTCTAAAGAATCTTGCACCCACTAAAAAAAATTAATTAGAGAGAATTGAACCTTAATCTAATTATTTGTCCTCATCATTAATCAAATGAACTGGAAAGCTGCGATAATGCTGAAAAATAACTCAAATATAATAAAAACATCAGAAATCTTAAAATGGAAAAGATATTCACTTTCTTGCATTGCCCTGTCACCAAGTATGTTCTTTGACTAACAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGTAGGCAGTTAACTAACAACCTTCACTAGGATTTGCTGTCTTGGTCTGCGCACGTTTATGCAAGTGGATGCTGAAATTGACAGAATTTTTACTAACTCTGGTTGTGCCATCTCGGTTTCCCTTCAGGAACACTTTTCCTCATATCCTGACTTGGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCAATCTCCTCCAAAGAAGAAGTTTAAAACAGGGGTTCTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTGAGTCATAATTGGTAGATTATTTTATATTTTACAGGCAATATTTGGTTAATGTAATATACTGAGCAATTTTGTGGAAAAAAATGTTAAAGAGCAAAATAATCTTGTTATGGATATTTTCTGATGTTTTGCTCGTAATATATTTATTCTTAGATCTGAGAATTCAAATATGACAATCATCTAGATTTTTGTGAATGTTTTGTTATCATTGTTCCATGATTGACGAAAGATTTAGATTTTGTACACATGGTTGGATTTCATTATTTTTCTTTTCCTTTTCCTTTTGGATAAAAGGCACCACTTCTAATTTGACAAATGAAAGGAATATAAAAAGAGGCATAGGAAAAAACTAGGCTCTACAACAAAATTGGGCAACCAAAAACGAGAGAAACACACATTAAGAACACTAATTCATCAGAAAAAGGAGAAAAATCCGAAAACAAATTGCAAAAACAACCTAATAGGGTCGAGTGAAGCTGAGAGTTAGTGGCAATTTTTTTAAAAAAATTTGATTAATTAATTAATTAATATTATTATTGGTTTCTTCTGGTAGGAAATGTTGACATTTTTTTAGATGATATGAAATTACAGAGTTGATTCTTCAAAGCATTGTTTATTTTCTTTACGTTTTCACTTTCCATCTAACATACTGTTGCTATTTTCCTTTCTATTTTTTTCATAAAATTTTAAATTACTCAGTCATTAAGGCAAGTCTCCAGCATTGCTTCCTCCATTTGTTGCAAGGATATGTTTTTCTTCAAAACTTTTCTCATGTTTATTCATTATTACTATGAGATGCCAATAGAAACAACGTGATTTCCGAATTCTTTATGGAAATATATCTTGCATGGGTTTCTTTCAAGTTGTTCACTTGCAGTAGCCATTTTATTTCTTTCTTTCAGGTTTTAAATGCTTGTGTCAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAAGTCTACAGCCTTCAACCTCAACATATGGATTGGTCATGGAGGTAGTTGGTCCTTTAATTTCTTTCTATTGTTCATGTAATTTGCAAGTCTATTTTGAAATTCAGAATGATTTTTCTAATGCTCGTTGCTTGAGATGCTGTGAATGAGAAATGCTTGATTTTGTGCACCCAAGAAGGAATGAACACTCATTAAGTTGCGTTGAATTCACTCAGTAACGTGTTAAATATTATTCTTACCTTTTGATTAGTAGAAAGTAAATGGTTTTGGGTATTGCATGGACACAGAGTGGACATATTGGGTTCTGAGTCTTTGAAGAGAATTAGTCGTATTTGTAAAAGGTAGATTCTCTACCAAGTCCATGTTCCTAAACTTCACTAAGGAAGTTGCAAAAAATAATTATAGACAACCTTATGTGGTATTGGAAGCTTAAAATTTCCAAGAAAGTAAAGATTTTCCTTTGGTCACTTGCTTATAGAAGTCTAAATATTCACAAGAAGCTTCAGAGAACGTTCCCTAATTGGTCCCTCTCCTCCATTTGCTGTCTTTTTCTTAGGGAGATGGAAACTATAGGTCACTTGTTCTTGCATTCTGAGTTTACTTTTAGAGGTTGGCAAATTCTCTTTAGTACTTTTGTGGTGGCTAGTTGCCTTCCTAAAAAAATCGATGATTGAATGATGGAAGTTTCTGCAGAAAAGGGAAAATCCTTCGGAGAAGTGCTACTCAAGTGCTTTTGTGGTTTCTTTGGAAAAAAAAAGAGAATAATACATTGTTTGACGATAATTTTGTTTCTTTTGATTTTTTTGGGCTTTTGTTCAACGTGCAACCTCTTGGTGATGTTCAAGCTACACTAATTTTTTTTTTCCAAAGCATTTCTATAACTATTTGCTATGAAACATTAATCTTAGTTGGATCCCCTTCCTTTAGTGGGGCTCTTTTTTTGTTGCGCTTGTTATTTTTTTCTATTTCCTTGTATTCTTTCATTTTTCTCAATTAAAGTTGTTTCTATTAAAAAAAAAAAAGAAGTTAGACCAAATTCTTTTTTAATTATAGCCTTCTCATGATTATGAGCAATGGAGGGCTTTTCTCTATAGTTTTGTGGAGAGGTTTTCTCTACCCCTGCCTCTAGGCTGTTCTGGTAGCTCTTTTGATGAATATATATCTGTTTCTTATAAACAAAAAAGTCCTAACAGTGAATGGTAAGCCTAGAAAGAAAGAAGAAGATCTAGACCATCTTTTCAAGACATTGTTCAATAGGTATAACACCAATCTTTCGTGTTGTTTGCCTTATCCTTTCTCAATCAATCGCATATGAGTAAGTGCGTGCTTGGACTTGGTCAAGGATACATTTTATGGTTAGATTATGTAATCACTTATCACAAGCAGCTTTTGAACGATTACTTGCGTCTATCCTTCATAAAATACAAGTCACTTGATTGACGGTTATTCATGTGCACTGAAAATTATTGTTCACAAGTTCATGATTTTTTAAAAGAAATTTTAGTTCATGCACTGTTGAGAATGAATTAAGGTAACCCAATTTCTAAAACCTGTTATTTAATTATTTTCCCAAGTTCCTCTTTTCCCTGTGAAGCAAAAAAAGTTCGTTCTTTAAGTCGACTCTTAAACTGCCCCTAGATTTTCAATTTAGGTGAAGAAAAATAATTTGATCGAATGTTCTTCTACTATTAAAATTTGAAAGTCATGCTTCCCTGACTTAATTTATTTGATTGTGCTGAAGTTACATTTTCATATGCAAGTAAGCCTATTGAAATTTTCTATTTACTGAATGAAACAGTTGAAATATTCTATTTACTGAACGAAACATTTTTCAATGAATGATAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAATTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGGTAGACTCGGTAGTCACAGTGTGTTTATTTGTTTCTTGTTATATATTTGCTTAATGGCTTGTCAAATTTCCAGTTCTTGTTAATACACTTTGGAAAGAAGGAAAAACAGATGAGGCTGTGCTGGCCATTGAGAACATGGAAATACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAGGCCCTGATGCAGGTATTTCATAGTAAATTTTTGTTTTTTCTTTCAGCCTTTTATTTGTTTTTCCCTTTCTCAACCAACTTATATTTTATTTTTTCTCTTATGTGGATATTTTATATATTACTATTGGTTATTTCCATGAGTTGGTTCAAAATGTTATCGCCTGAGTTTGAGCTCTATTTCCTAAGATGGGAGAAGTAGAAGTTGTTTATTATTTAAGAAGAAATTAGTCGACCTGACTCAATATGATAAGTCAAACAACATAGGATTTGGTTTCTCTTTATCACTCGCTCTTTTTGTTTAAAAAATACTGAATTTGAGGTTAACTAATCACTATATCTTAAAAATTCTCAAAACCTTTGAGTCTTTCTTAATCATGCCTACAATTTCATGATGAGCTTTATTGCCCCATAGTAATGTTTCTGCCTCAGACATCTCTTTCTGAGTGAATTTCTGGCTCCTTAATATTCTTGGCCTCTCCTTTGGGAGGTCAAATTTACTTCATAGTTCAAACTGTCTGGTTGCACATCCTGATTGCAGTCTGTTTACACGAATGATAATTAATGAATAGACCTCATGACGACCTTGTTATTGTTTACAGATGGAGAAGATATGTAAAGTTGCTAACAAGCCTCTTGTAGTAACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATTTTCAACCACATGAAGGCCTTTTGCTCACCGAATCTTGTTACTTATAATATATTGTTGAAGGGTTACTTGGAACATGGAATGTTTGAAGAGGCTAGAGAGCTGTTTCAGAATTTGTCAGAGCAAAGACGAAATATCAGCACTGTATCTGACTACAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCAGAAAAAAGATGGGATGATTTTAGCTATTTCTATAACCAGATGTTTCTTTATGGTTATCATTTCAATCCAAAACGTCATTTGAGGATGATATTGGAGGCTGCTAGGGGTGGAAAGGTGGACCTTTTAAATTCAACTTCTTTTTCTTTCCTTGGTTCTCCTCCTTCCTTTCTACGTTTCTATATCTTTTTATATTGCACCACTAGCCTATTTTAAACGTGTATAGTTTAAGTTGGCTCTAGGAATGTTGCTTAATGCTTATAATTGACGATAGTTTTAAATATGCACATGCTAAAGAGATGTAATGATAATAATAGATAATATCATGAAACTATTGGAGGCTGAATTCGACCATTTTCCTATGACGTTTGACTTCTTAGGTGTATGACATTGATAATTTGTTGAAGTATTGAGAAAACAAAACGCTAAATCAATTTATTTTTTAATTGATCATGAAATTCAAGATTAAATTGTTCCTTGGCTATTAAGGGGTTCATAACTTCACATTCATGCTTAGAATTCGAGTTCTGCTTATACTAAGAAAAGAAACTATATTCCTTTATGATCTGTGGATGATTTGGTCTTAATTTCCTGCTCTACAAAAGGAATAAACTAAGATAAGTTGATCGCTTAGACTCATTATGGTTTTGCTTTCCTTTAACCATCGACTTCTGTTGTACAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTCAGGCTGACCGTACTCCACCACCACCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGATTACTCTGAAGCGCTCTCTTCCATTTGGAGTCACAATAGTGGTGATGCACATCATTTCTCTGAGTCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAAGATACTGTTATTGAGCTAATTCATAAGGTTAGCATGGTTCTTACTAGAAATGAATCACCAAATCCAGTGTTTAAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTTTAGCTGACCATAGACTTGAAGAAACTGTTTATTAAAATGAAATCTAACCTGCTGCTATCACATATCTATCTATCTATCTATATATATTTAGTATAATTTGAGAG

mRNA sequence

ATGGTGGGAGTTATAATGGCGAACCTAAATTTGTGCATCCCTAATTGTGAAAGATATGGATTTCCGACACTGCATTGTACCCATAATTCCCACAATTCTTTTTGGGTTTCGTTCTTTCCTAGTTCGGTTCCTGGAACTGACTTAAGTCTTAGTGACGCGAAGAATAGAGTTTTGAGACATAGGGTTCATAAATGTGGATCAATTAAGGCTTTGTCGAATGGAGAATCTGATATTTCATTGCCAAGTGGGAATCTCCTCGAACATGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAAGGTCATGGAGACTGTTAGAACTAGAAGGTATAAGAGGCAGTTGGATGATCCTAATAAACTGACAATGAAGGAAAATGGGAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAGATAGATAATGGAAAAAACAAAGTAACTGATGTTCAACATAACGTGGACGTAAAGAACATGTTTAAACGGGTGGATAAAAAAGATTTGTTCAATAATACAGAGAGAATTGCTCGTGAAAAGGATTTGTCAGGAAATAAATTTGATAGAAGGAAGGTAGTTACAAGATCAAATGATAAGGTTAAAGGCAAGATGACCCCTTTTGGCTCACTGGTTAATGATAAACAGCATGAAGAGAAAAGGAACGAAAACTGGTCAAGTTACATTGAGCCTAGAGTAACACGATCGAACAGCGAGAAACCAATTCATTTTAAAGCTAATATGTTGGAGGTCAAAAAAGAAAGCAGCCGTGTCTCTGATGGAAATTCCATGAAAACATCAGAAAAGATTTGGGCTTGGGGTGATGATGACGCTAAACCACCTAAGGGTGTTCTTAAGGCTGGGAAATATGGCATTCAGCTCGAAAGAAGCTATAATCCTGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGACATCCACAAGTGGTAAGCGTTTTCTTGAATTTAATGAAAAGAATAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTCGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTGCAGACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAATCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGAACACTTTTCCTCATATCCTGACTTGGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCAATCTCCTCCAAAGAAGAAGTTTAAAACAGGGGTTCTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTCAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAAGTCTACAGCCTTCAACCTCAACATATGGATTGGTCATGGAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAATTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGTTCTTGTTAATACACTTTGGAAAGAAGGAAAAACAGATGAGGCTGTGCTGGCCATTGAGAACATGGAAATACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAGGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAACAAGCCTCTTGTAGTAACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATTTTCAACCACATGAAGGCCTTTTGCTCACCGAATCTTGTTACTTATAATATATTGTTGAAGGGTTACTTGGAACATGGAATGTTTGAAGAGGCTAGAGAGCTGTTTCAGAATTTGTCAGAGCAAAGACGAAATATCAGCACTGTATCTGACTACAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCAGAAAAAAGATGGGATGATTTTAGCTATTTCTATAACCAGATGTTTCTTTATGGTTATCATTTCAATCCAAAACGTCATTTGAGGATGATATTGGAGGCTGCTAGGGGTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTCAGGCTGACCGTACTCCACCACCACCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGATTACTCTGAAGCGCTCTCTTCCATTTGGAGTCACAATAGTGGTGATGCACATCATTTCTCTGAGTCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAAGATACTGTTATTGAGCTAATTCATAAGGTTAGCATGGTTCTTACTAGAAATGAATCACCAAATCCAGTGTTTAAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTTTAGCTGACCATAGACTTGAAGAAACTGTTTATTAA

Coding sequence (CDS)

ATGGTGGGAGTTATAATGGCGAACCTAAATTTGTGCATCCCTAATTGTGAAAGATATGGATTTCCGACACTGCATTGTACCCATAATTCCCACAATTCTTTTTGGGTTTCGTTCTTTCCTAGTTCGGTTCCTGGAACTGACTTAAGTCTTAGTGACGCGAAGAATAGAGTTTTGAGACATAGGGTTCATAAATGTGGATCAATTAAGGCTTTGTCGAATGGAGAATCTGATATTTCATTGCCAAGTGGGAATCTCCTCGAACATGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAAGGTCATGGAGACTGTTAGAACTAGAAGGTATAAGAGGCAGTTGGATGATCCTAATAAACTGACAATGAAGGAAAATGGGAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAGATAGATAATGGAAAAAACAAAGTAACTGATGTTCAACATAACGTGGACGTAAAGAACATGTTTAAACGGGTGGATAAAAAAGATTTGTTCAATAATACAGAGAGAATTGCTCGTGAAAAGGATTTGTCAGGAAATAAATTTGATAGAAGGAAGGTAGTTACAAGATCAAATGATAAGGTTAAAGGCAAGATGACCCCTTTTGGCTCACTGGTTAATGATAAACAGCATGAAGAGAAAAGGAACGAAAACTGGTCAAGTTACATTGAGCCTAGAGTAACACGATCGAACAGCGAGAAACCAATTCATTTTAAAGCTAATATGTTGGAGGTCAAAAAAGAAAGCAGCCGTGTCTCTGATGGAAATTCCATGAAAACATCAGAAAAGATTTGGGCTTGGGGTGATGATGACGCTAAACCACCTAAGGGTGTTCTTAAGGCTGGGAAATATGGCATTCAGCTCGAAAGAAGCTATAATCCTGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGACATCCACAAGTGGTAAGCGTTTTCTTGAATTTAATGAAAAGAATAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTCGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTGCAGACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAATCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGAACACTTTTCCTCATATCCTGACTTGGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCAATCTCCTCCAAAGAAGAAGTTTAAAACAGGGGTTCTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTCAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAAGTCTACAGCCTTCAACCTCAACATATGGATTGGTCATGGAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAATTCTTCAGAAAAGTGCAGAAATCTTCCATTCCTAATGCTTTAACATATAAAGTTCTTGTTAATACACTTTGGAAAGAAGGAAAAACAGATGAGGCTGTGCTGGCCATTGAGAACATGGAAATACGAGGGATAGTAGGGTCTGCAGCTCTTTATTATGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAGGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAACAAGCCTCTTGTAGTAACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATTTTCAACCACATGAAGGCCTTTTGCTCACCGAATCTTGTTACTTATAATATATTGTTGAAGGGTTACTTGGAACATGGAATGTTTGAAGAGGCTAGAGAGCTGTTTCAGAATTTGTCAGAGCAAAGACGAAATATCAGCACTGTATCTGACTACAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCAGAAAAAAGATGGGATGATTTTAGCTATTTCTATAACCAGATGTTTCTTTATGGTTATCATTTCAATCCAAAACGTCATTTGAGGATGATATTGGAGGCTGCTAGGGGTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTCAGGCTGACCGTACTCCACCACCACCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGATTACTCTGAAGCGCTCTCTTCCATTTGGAGTCACAATAGTGGTGATGCACATCATTTCTCTGAGTCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAAGATACTGTTATTGAGCTAATTCATAAGGTTAGCATGGTTCTTACTAGAAATGAATCACCAAATCCAGTGTTTAAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTTTAGCTGACCATAGACTTGAAGAAACTGTTTATTAA
BLAST of CSPI01G25380.1 vs. Swiss-Prot
Match: PPR64_ARATH (Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidopsis thaliana GN=EMB2279 PE=2 SV=1)

HSP 1 Score: 729.6 bits (1882), Expect = 4.3e-209
Identity = 371/638 (58.15%), Postives = 473/638 (74.14%), Query Frame = 1

Query: 291  QLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFD-AFDIMDKP 350
            ++ER  N   ++   K      GT   G +  + ++ +   +E  AF   D + DI+DKP
Sbjct: 371  RIERLANERHEIRSSKLS----GTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 430

Query: 351  RVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 410
              S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWR
Sbjct: 431  ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 490

Query: 411  RVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAY 470
            RVLQ+IEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY
Sbjct: 491  RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 550

Query: 471  HSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKR 530
             SIAVTLGQAG+++ELF VID+M+SPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 551  RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 610

Query: 531  KNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYK 590
            K  EGAFWVLQ+LK++  +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+
Sbjct: 611  KQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYR 670

Query: 591  VLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEAL--------- 650
            VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L         
Sbjct: 671  VLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPV 730

Query: 651  -------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHM 710
                                Q++KIC+VANKPLVVTYTGLIQAC+DS ++++A YIF+ M
Sbjct: 731  VLKLIENLIYKADLVHTIQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQM 790

Query: 711  KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNT 770
            K  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE   +I   SD+  RVLPD Y FNT
Sbjct: 791  KKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNT 850

Query: 771  MLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQAD 830
            MLD    +++WDDF Y Y +M  +GYHFN KRHLRM+LEA+R GK+E++E TW+H+ +++
Sbjct: 851  MLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSN 910

Query: 831  RTPPPPLLKERFCMKLARGDYSEALSSIWSHN----SGDAHHFSESAWLNLLKEKRFPKD 890
            R PP PL+KERF  KL +GD+  A+SS+   N      +   FS SAW  +L   RF +D
Sbjct: 911  RIPPSPLIKERFFRKLEKGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQD 970

Query: 891  TVIELIHKVSMVL-TRNESPNPVFKNLLLSCKEFCRTR 895
            +V+ L+  V+  L +R+ES + V  NLL SCK++ +TR
Sbjct: 971  SVLRLMDDVNRRLGSRSESSDSVLGNLLSSCKDYLKTR 1002

BLAST of CSPI01G25380.1 vs. Swiss-Prot
Match: PP451_ARATH (Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidopsis thaliana GN=DG1 PE=1 SV=2)

HSP 1 Score: 387.5 bits (994), Expect = 4.0e-106
Identity = 207/546 (37.91%), Postives = 327/546 (59.89%), Query Frame = 1

Query: 358 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEW 417
           E +++L  RL+G +I+   W F +MM  + +++++  +L+++  LG+  +W++   ++ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 418 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 477
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 478 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 537
           QAG ++EL  VI+ M+  P K  K    + WDP L+PD+V+YNA+LNACV     +   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 538 VLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 597
           V  EL+K  L+P+ +TYGL MEVMLE GK++ VH+FFRK++ S   P A+TYKVLV  LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422

Query: 598 KEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 657
           +EGK +EAV A+ +ME +G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482

Query: 658 VTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNL 717
           +T+TGLI A L+   +   + IF +MK  C PN+ T N++LK Y  + MF EA+ELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542

Query: 718 SEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHL 777
                    VS     ++P+ Y ++ ML+AS    +W+ F + Y  M L GY  +  +H 
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602

Query: 778 RMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSG 837
            M++EA+R GK  LLE  +  + +    P P    E  C   A+GD+  A++ I +  + 
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLEDGEIPHPLFFTELLCHATAKGDFQRAITLI-NTVAL 662

Query: 838 DAHHFSESAWLNLLKEKR--FPKDTVIELIHKVS-MVLTRNESPNPVFKNLLLSCKEFCR 897
            +   SE  W +L +E +    +D     +HK+S  ++  +    P   NL  S K  C 
Sbjct: 663 ASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRCG 714

Query: 898 TRISLA 899
           +  S A
Sbjct: 723 SSSSSA 714

BLAST of CSPI01G25380.1 vs. Swiss-Prot
Match: PPR96_ARATH (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 109.8 bits (273), Expect = 1.6e-22
Identity = 86/332 (25.90%), Postives = 153/332 (46.08%), Query Frame = 1

Query: 418 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 477
           L+  E+ K      IYTT +D L   +   +ALN+F  M ++    P++V Y+S+   L 
Sbjct: 243 LKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEM-DNKGIRPNVVTYNSLIRCLC 302

Query: 478 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 537
             G   +   ++  M            +E+   ++ P++V ++A+++A VK   L  A  
Sbjct: 303 NYGRWSDASRLLSDM------------IER---KINPNVVTFSALIDAFVKEGKLVEAEK 362

Query: 538 VLQELKKQSLQPSTSTY-----GLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLV 597
           +  E+ K+S+ P   TY     G  M   L+  K    H F   + K   PN +TY  L+
Sbjct: 363 LYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAK----HMFELMISKDCFPNVVTYNTLI 422

Query: 598 NTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANK 657
               K  + +E +     M  RG+VG+   Y    + L  AG C  A    +K+      
Sbjct: 423 KGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVP 482

Query: 658 PLVVTYTGLIQACLDSKDLQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEAREL 717
           P ++TY+ L+        L+ A+ +F ++ K+   P++ TYNI+++G  + G  E+  +L
Sbjct: 483 PDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDL 542

Query: 718 FQNLSEQRRNISTVSDYRDRVLPDIYMFNTML 744
           F +LS +             V P++ ++ TM+
Sbjct: 543 FCSLSLK------------GVKPNVIIYTTMI 542

BLAST of CSPI01G25380.1 vs. Swiss-Prot
Match: PPR91_ARATH (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 109.4 bits (272), Expect = 2.1e-22
Identity = 80/314 (25.48%), Postives = 144/314 (45.86%), Query Frame = 1

Query: 432 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 491
           IY T +D L K +   +ALN+F  M+      P++V Y S+   L   G   +   ++  
Sbjct: 258 IYNTIIDGLCKYKHMDDALNLFKEMETK-GIRPNVVTYSSLISCLCNYGRWSDASRLLSD 317

Query: 492 MQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPST 551
           M            +E+   ++ PD+  ++A+++A VK   L  A  +  E+ K+S+ PS 
Sbjct: 318 M------------IER---KINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSI 377

Query: 552 STYGLVMEVMLECGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIEN 611
            TY  ++       + +   + F   V K   P+ +TY  L+    K  + +E +     
Sbjct: 378 VTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFRE 437

Query: 612 MEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKD 671
           M  RG+VG+   Y    + L  AG C  A    +++      P ++TY  L+     +  
Sbjct: 438 MSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGK 497

Query: 672 LQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYR 731
           L+ A+ +F ++ ++   P + TYNI+++G  + G  E+  +LF NLS +           
Sbjct: 498 LEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLK----------- 543

Query: 732 DRVLPDIYMFNTML 744
             V PD+  +NTM+
Sbjct: 558 -GVKPDVVAYNTMI 543

BLAST of CSPI01G25380.1 vs. Swiss-Prot
Match: PP389_ARATH (Pentatricopeptide repeat-containing protein At5g16640, mitochondrial OS=Arabidopsis thaliana GN=At5g16640 PE=2 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 3.6e-22
Identity = 77/314 (24.52%), Postives = 145/314 (46.18%), Query Frame = 1

Query: 432 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 491
           IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  
Sbjct: 188 IYNTIIDGLCKSKQVDNALDLLNRMEKDGIG-PDVVTYNSLISGLCSSGRWSDATRMVSC 247

Query: 492 MQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPST 551
           M                   + PD+  +NA+++ACVK   +  A    +E+ ++SL P  
Sbjct: 248 MTKR---------------EIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDI 307

Query: 552 STYGLVMEVMLECGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIEN 611
            TY L++  +    + +   E F   V K   P+ +TY +L+N   K  K +  +     
Sbjct: 308 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCE 367

Query: 612 MEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKD 671
           M  RG+V +   Y    +  C AG+   A     ++      P ++TY  L+    D+  
Sbjct: 368 MSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGK 427

Query: 672 LQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYR 731
           ++ A+ I   M K     ++VTYNI+++G  + G   +A +++ +L+ Q           
Sbjct: 428 IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQ----------- 473

Query: 732 DRVLPDIYMFNTML 744
             ++PDI+ + TM+
Sbjct: 488 -GLMPDIWTYTTMM 473

BLAST of CSPI01G25380.1 vs. TrEMBL
Match: A0A0A0LVN7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G553530 PE=4 SV=1)

HSP 1 Score: 1807.0 bits (4679), Expect = 0.0e+00
Identity = 898/907 (99.01%), Postives = 901/907 (99.34%), Query Frame = 1

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSV GTD SLSDAKNRVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
           LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIA EKD
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKPI 240
           LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNS+KPI
Sbjct: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKPI 240

Query: 241 HFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPGD 300
           HFKAN LEVKKESSRVSDGNSMKTSEKIWAWGDDDAKP KGVLKAGKYGIQLERSYNPGD
Sbjct: 241 HFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPGD 300

Query: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360
           KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI
Sbjct: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360

Query: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420
           QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM
Sbjct: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420

Query: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480
           RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG
Sbjct: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480

Query: 481 YMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540
           YMRELFDVIDSM+SPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ
Sbjct: 481 YMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540

Query: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600
           ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK
Sbjct: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600

Query: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660
           TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG
Sbjct: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660

Query: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720
           LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR
Sbjct: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720

Query: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780
           NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE
Sbjct: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780

Query: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840
           AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF
Sbjct: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840

Query: 841 SESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900
           SESAWLNLLKEKRFP+DTVIELIHKV MVLTRNESPNPVFKNLLLSCKEFCRTRISLADH
Sbjct: 841 SESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900

Query: 901 RLEETVY 908
           RLEETVY
Sbjct: 901 RLEETVY 907

BLAST of CSPI01G25380.1 vs. TrEMBL
Match: M5WJN1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001195mg PE=4 SV=1)

HSP 1 Score: 877.9 bits (2267), Expect = 1.1e-251
Identity = 469/841 (55.77%), Postives = 587/841 (69.80%), Query Frame = 1

Query: 68  IKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKENG 127
           I A+S   SD     G +LE +F+FKPSFD+Y+KVM TVR R  + + D   +   K N 
Sbjct: 65  ISAVSKEGSDNRSVGGEILEKEFEFKPSFDQYLKVMGTVRLRSDRDKQDSSKEQNPKHNL 124

Query: 128 SAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKDLSGNKFD 187
            ++    + +S+ +    K+ + + + + +   K   + +   N   I  +    G    
Sbjct: 125 RSRGVSRSLVSEGNEEHVKLGESEEHSNQEKASKAAKQNEALGNRNGIMGKSKRQG---- 184

Query: 188 RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKR---------NENWSSYIEPRVTRSNSEK 247
                      VKG    + S  +++  +EK+            +S  +EP       E 
Sbjct: 185 -----------VKGFKDEYDSRQSNRDEKEKKKIRGEARDGRSKYSGRLEP-------EL 244

Query: 248 PIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNP 307
               K+ M    K+  RV      K+++K +  G    K   G          LER++  
Sbjct: 245 NFRGKSTMARNVKDDLRV-----YKSTDKSFDRGKVGVKIQGG----------LERNHIN 304

Query: 308 GDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAF-DIMDKPRVSKMEME 367
            +    +   +     + SG+ F + N  NS+EVE AAF NFD F DIMDKPRVS+MEME
Sbjct: 305 AENATDRGFSRRSEKLTKSGRDFPKKNYDNSMEVERAAFKNFDEFGDIMDKPRVSQMEME 364

Query: 368 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEW 427
           ERIQ L+K LNGADIDMPEWMFS+MMRSA+IR++DHSILRVIQ+LGKLGNWRRVLQ+IEW
Sbjct: 365 ERIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKLGNWRRVLQVIEW 424

Query: 428 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 487
           LQMRERFKSHKLR+IYTTALDVLGKARRPVEALNVFHAM +  SSYPDLVAYHSIAVTLG
Sbjct: 425 LQMRERFKSHKLRYIYTTALDVLGKARRPVEALNVFHAMLQEMSSYPDLVAYHSIAVTLG 484

Query: 488 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 547
           QAG+MRELFDVID+M+SPPKKKFKTG L KWDPRL+PDIV+++AVLNACV+RK  EGAFW
Sbjct: 485 QAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDIVVFHAVLNACVQRKQWEGAFW 544

Query: 548 VLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 607
           VLQ+L++Q LQP+ +TYGLVMEVML CGKYNLVHEFF+KVQKSSIPNALT++V+VNTLW+
Sbjct: 545 VLQQLQQQGLQPAATTYGLVMEVMLACGKYNLVHEFFKKVQKSSIPNALTFRVIVNTLWR 604

Query: 608 EGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 667
           EGK  EAVL ++NME RGIVGSAALYYDFARCLCSAGRC+EALMQ+EKICKVANKPLVVT
Sbjct: 605 EGKVGEAVLVVQNMERRGIVGSAALYYDFARCLCSAGRCQEALMQIEKICKVANKPLVVT 664

Query: 668 YTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSE 727
           YTGLIQACLD+  +++  Y+F  M+ FCSPNLVT N +LKGYL+HGMFEEA+ELF  + +
Sbjct: 665 YTGLIQACLDAGSIKNGAYVFKQMENFCSPNLVTCNTMLKGYLDHGMFEEAKELFLKMLD 724

Query: 728 QRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRM 787
              NIS+ SD + RV PD Y FNT+LDA   EKRWDDF + Y  M  +GYHFN KRHLRM
Sbjct: 725 NGNNISSKSDCKARVKPDSYTFNTLLDACITEKRWDDFEFVYKMMLHHGYHFNAKRHLRM 784

Query: 788 ILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDA 847
           IL+A   GK ELL+ TW HL +A R+PPPPL+KERFC KL + DY+ AL+ I   N  + 
Sbjct: 785 ILDACEAGKGELLDITWTHLTEAGRSPPPPLIKERFCTKLEKDDYAAALTCITDPNLSEL 844

Query: 848 H-HFSESAWLNLLKE--KRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTR 896
              FS++AWL L KE  ++F KDT + L+H+ S+++ R +  NPVF+NL+ +C E  RT 
Sbjct: 845 QTFFSKNAWLKLFKENAEKFQKDTFVRLVHEGSILINRTDRSNPVFQNLMAACGELDRTS 868

BLAST of CSPI01G25380.1 vs. TrEMBL
Match: W9RFN3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_025948 PE=4 SV=1)

HSP 1 Score: 862.4 bits (2227), Expect = 4.8e-247
Identity = 486/922 (52.71%), Postives = 620/922 (67.25%), Query Frame = 1

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           M G+I  N  L + +    G     C   S +S   S       G  L++        ++
Sbjct: 1   MAGMIATNGKLGVSSFHGNGVFASKCRQTSFSSCGFSLIRRPNFGIGLNV--------KN 60

Query: 61  RVHKCGSI-KALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPN 120
           R   CG++ +A SNG SD  L  G+LLE +F+FKPSFD+Y+KVME+VRT R K+Q    N
Sbjct: 61  RRRNCGTVTRAGSNGGSDSKLVGGSLLEKEFEFKPSFDDYLKVMESVRTVRDKKQKSTHN 120

Query: 121 -KLTMKENGSAKSAE-STSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAR 180
            + T    G+ +S     S  ++D GK                  VDK + F + + + +
Sbjct: 121 LRETFLSEGNEESVRLGKSEERLDRGK--------------ALDFVDKDESFKSRDGVKK 180

Query: 181 EKDLSGNKFDRRKVV--------TRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEP 240
           ++        R+K+         T +N   +GK  P  SL   K         WS     
Sbjct: 181 KES------QRKKITELKGRFEGTENNWTGRGKRKPVRSLTGRK---------WSK---- 240

Query: 241 RVTRSNSEKPIHFKANML---EVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAG 300
           + TR    +  ++  +M    E K  SSRV  GN  ++ + IW   +D +    GV +  
Sbjct: 241 QQTREEDAEANNYNIDMRREHEDKANSSRVL-GNK-RSDDSIW---NDGSMAKAGVRE-- 300

Query: 301 KYGIQLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAF-DI 360
           + G+   +          K  ++          R  E ++K SL  E AAF NFD + DI
Sbjct: 301 ETGVVNNKWRERNRIQDNKVIDKDIVPKHGRINRRTEVDDK-SLREERAAFRNFDDYNDI 360

Query: 361 MDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKL 420
           + KPR+ +MEM+ERIQ L+  LNGAD+DMPEWMFS+MMRSA+I ++DHSI RVIQ+LGK 
Sbjct: 361 LGKPRLPRMEMDERIQKLAMSLNGADVDMPEWMFSKMMRSARIIFTDHSISRVIQILGKF 420

Query: 421 GNWRRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPD 480
           GNWRRV+Q+IEWLQ+RERFKSHKLR+IYTTAL+VLGKARRPVEALNVF+AM +H SSYPD
Sbjct: 421 GNWRRVVQVIEWLQIRERFKSHKLRYIYTTALNVLGKARRPVEALNVFNAMLQHMSSYPD 480

Query: 481 LVAYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNA 540
           LVAYHSIAVTLGQAGYM+ELFDVID+M+SPPKKKFKTG L KWDPR++PDI++YNAVLNA
Sbjct: 481 LVAYHSIAVTLGQAGYMKELFDVIDTMRSPPKKKFKTGALGKWDPRVEPDIIMYNAVLNA 540

Query: 541 CVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNA 600
           CV+RK  EGAFWVLQ+LK+++L PS +TYGLVMEVML CGKYNLVH+FFRKVQKSSIPNA
Sbjct: 541 CVQRKQWEGAFWVLQQLKEKALNPSVTTYGLVMEVMLVCGKYNLVHDFFRKVQKSSIPNA 600

Query: 601 LTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEK 660
           LTY+VL+NTL KEGK DEAVLA++NME RGIVGSAALYYD ARCLCSAGRC+EALMQ++K
Sbjct: 601 LTYRVLLNTLSKEGKLDEAVLAVQNMEKRGIVGSAALYYDLARCLCSAGRCQEALMQIDK 660

Query: 661 ICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMF 720
           ICKVA+KPLVVTYTGLIQACLDS +++   YIFNHMK FCS NLVT NI+LKGYL+HG F
Sbjct: 661 ICKVASKPLVVTYTGLIQACLDSGNIEDGAYIFNHMKDFCSRNLVTCNIMLKGYLKHGKF 720

Query: 721 EEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLY 780
           +EA+ELF+ + +    I + +D++  V PDIY FNTM DA   EK+WDDF Y Y +M  +
Sbjct: 721 KEAKELFEKMLQDASLIKSKADHKALVAPDIYTFNTMFDACITEKKWDDFEYAYKKMLHH 780

Query: 781 GYHFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEA 840
           GYHFN KRHL+MIL A+R GK ELL+ TW HL +ADR PP  L+KE+FCMKL + DY  A
Sbjct: 781 GYHFNAKRHLQMILNASRVGKGELLDITWNHLVEADRIPPSSLIKEKFCMKLEKEDYIAA 840

Query: 841 LSSIWSHNSGDAHHFSESAWLNLLKE--KRFPKDTVIELIHKVSMVLTRNESPNPVFKNL 900
           LS I + N  ++  FS+ AW  LL E  +RF K T++ LI ++  ++ R++ P+ V  NL
Sbjct: 841 LSCICNQNLSESREFSKKAWSKLLDENSERFRKGTLVRLIREIDNIIARSDQPDSVLVNL 872

Query: 901 LLSCKEFCRTRISLADHRLEET 906
           L+SCKE  RT + +AD  L ET
Sbjct: 901 LVSCKELSRTCV-VADVELTET 872

BLAST of CSPI01G25380.1 vs. TrEMBL
Match: B9T6B9_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0237710 PE=4 SV=1)

HSP 1 Score: 857.8 bits (2215), Expect = 1.2e-245
Identity = 463/855 (54.15%), Postives = 596/855 (69.71%), Query Frame = 1

Query: 68  IKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKENG 127
           IKALS+G+SD  L  G +LE + +FKPSFDEY+K ME+V+T   K+        T K +G
Sbjct: 10  IKALSSGDSDNRLVGGGILEKELEFKPSFDEYLKAMESVKTGITKKH-------TRKLSG 69

Query: 128 SAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKDLSGNKFD 187
           +    +S   S+   GK          + +   K  +  +L  N +     KD + +K  
Sbjct: 70  NKVKDDSKEGSRTSVGKT---------EWRGKLKFKENDELGENEDGEIDRKDETSSKIY 129

Query: 188 RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENW-SSYIEPRVTRSNSEKPIHFKA-- 247
           + + +  SN KV GK +   + V  K     R+  W ++     +T       +  K   
Sbjct: 130 KERGIRESNLKVTGKESRAYANVKRKIRGATRDREWLNNGTSSMITELEDINQVKVKRTQ 189

Query: 248 NMLEVKKESSRVSDGNSMKTSEKIWAWGDD--DAKPPKGVLKAG--------KYGIQLER 307
           N+ E       V    S    ++ +A+G +  +    KG    G        K G +L R
Sbjct: 190 NVQERTLAIDGVRRSQSTTGKKEEFAYGQNFPEMLRRKGKTHIGEEDGVSGNKMGGRLVR 249

Query: 308 SYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFD-IMDKPRVSK 367
           +Y   DK   K+  +       + + FL++  ++  EVE AAF + + ++    +P+ SK
Sbjct: 250 NYVQIDKNTDKEFMEKKGLIRRTNQAFLDYGHEDDSEVERAAFKSLEEYNNFTGRPQNSK 309

Query: 368 MEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ 427
            E+E+R+Q L+K LNGADIDMPEWMFS+MMRSA+I+Y+DHS+LR+IQ+LGKLGNWRRVLQ
Sbjct: 310 REVEDRLQKLAKCLNGADIDMPEWMFSKMMRSARIKYTDHSVLRIIQILGKLGNWRRVLQ 369

Query: 428 IIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIA 487
           +IEWLQMRERFKSH+LR IYTTAL+VLGKA+RPVEALNVFH MQ+  SSYPDLVAYH IA
Sbjct: 370 VIEWLQMRERFKSHRLRNIYTTALNVLGKAQRPVEALNVFHVMQQQMSSYPDLVAYHCIA 429

Query: 488 VTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLE 547
           VTLGQAG+M +LFDVIDSM+SPPKKKFK   + KWDPRL+PDIV+YNAVLNACV+RK  E
Sbjct: 430 VTLGQAGHMEQLFDVIDSMRSPPKKKFKMAAVHKWDPRLEPDIVVYNAVLNACVQRKQWE 489

Query: 548 GAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVN 607
           GAFWVLQ+LK+Q LQPST+TYGL+MEVM  CGKYNLVHEFFRKVQKSSIPNAL YKVLVN
Sbjct: 490 GAFWVLQQLKQQGLQPSTTTYGLIMEVMFACGKYNLVHEFFRKVQKSSIPNALVYKVLVN 549

Query: 608 TLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKP 667
           TLW+EGKTDEAVLA+E ME RGIVG AALYYD ARCLCSAGRC+EAL+Q+EKIC+VANKP
Sbjct: 550 TLWREGKTDEAVLAVEEMERRGIVGFAALYYDLARCLCSAGRCQEALLQIEKICRVANKP 609

Query: 668 LVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQ 727
           LVVTYTGLIQACLDS ++ +AVYIFN MK FCSPNLVT+N++LK Y EHG+FE+A+ELF 
Sbjct: 610 LVVTYTGLIQACLDSGNIHNAVYIFNQMKHFCSPNLVTFNVMLKAYFEHGLFEDAKELFH 669

Query: 728 NLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKR 787
            ++E   +I    DY+ RV+PDIY FNTMLDA  +EK WDDF Y Y +M  +G+HFN KR
Sbjct: 670 KMTEDSNHIRGNHDYKVRVIPDIYTFNTMLDACISEKSWDDFEYVYRRMLHHGFHFNGKR 729

Query: 788 HLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHN 847
           HLRMIL+A+R GK E LE TWKHLA+ADR PPP L+KERF + L + D   AL+ I ++ 
Sbjct: 730 HLRMILDASRAGKVEPLEMTWKHLARADRIPPPNLIKERFRIMLEKDDCKSALACITTNP 789

Query: 848 SGDAHHFSESAWLNLLKE--KRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFC 907
            G++  F + AWLNL KE  ++  +DT+I+L H+VSM++     P+PV +NLL SC +F 
Sbjct: 790 MGESPAFHKVAWLNLFKENAEQIRRDTLIQLKHEVSMLV---NPPDPVLQNLLASCNDFL 845

BLAST of CSPI01G25380.1 vs. TrEMBL
Match: A0A061FSP7_THECC (Pentatricopeptide repeat-containing protein isoform 1 OS=Theobroma cacao GN=TCM_042369 PE=4 SV=1)

HSP 1 Score: 849.4 bits (2193), Expect = 4.2e-243
Identity = 455/828 (54.95%), Postives = 575/828 (69.44%), Query Frame = 1

Query: 83  GNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKLTMKENGSAKSAESTSISKIDN 142
           G +LE +  FKPSFDEY+K ME+VR ++                 S KS    SI K + 
Sbjct: 64  GGILEKELDFKPSFDEYLKTMESVREKKQ----------------SLKSNRGNSIEKSNR 123

Query: 143 GKNKVTDVQHNVDVKNMFKRVDK-KDLFNNTERIAREKDLSGNKFDRRKVVTRSNDKVKG 202
           GK+K        D +  F   +K   +  + E   + K+ +  +  +  +V   +D +K 
Sbjct: 124 GKSKD-------DSRRKFGEEEKVSKVVEHNEVKMKSKEATRTRSRKALLVKGEDDDLKA 183

Query: 203 KMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKPIHFKANMLEVKKESSRVSDGNS 262
           +   + +        +K          P+V+R   E  I   AN+   K +S   SD   
Sbjct: 184 ETDEYKNFEGSNDVVDK----------PQVSRIKMEGRITKLANL--GKYDSKSKSDEGD 243

Query: 263 MKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPGDKVGRKKTEQSYRGTSTSGKRF 322
           ++                  ++K G++  +++ S     K     T       + S K F
Sbjct: 244 VR------------------LMKFGEFSEEVKMSKIV--KWNGVNTMNEGARRTRSRKAF 303

Query: 323 LEFNEKNSLEVEHAAFNNFD-AFDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFS 382
           LE +E + L +E +AF NF+ + D+ DKPR SKMEMEER+Q L+K LNGADIDMPEWMFS
Sbjct: 304 LEEDEDDDLRMERSAFKNFEESNDVFDKPRASKMEMEERVQRLAKSLNGADIDMPEWMFS 363

Query: 383 QMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQMRERFKSHKLRFIYTTALDVL 442
           +MMRSAKI+++D+ ILRVIQ LGKLGNWRRVLQ+IEWLQMRERFKS++LR IYTTALDVL
Sbjct: 364 KMMRSAKIKFTDYCILRVIQALGKLGNWRRVLQVIEWLQMRERFKSYRLRHIYTTALDVL 423

Query: 443 GKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKF 502
           GKARRPVEALN+FH+MQ+  +SYPD+VAYHSIAVTLGQAG+MRELF VIDSM+SPPKKKF
Sbjct: 424 GKARRPVEALNIFHSMQQQMASYPDIVAYHSIAVTLGQAGHMRELFHVIDSMRSPPKKKF 483

Query: 503 KTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEV 562
           KT ++ KWDPRL+PDIV+YNAVLNAC +RK  EGAFWVLQ+LK+Q LQ S +TYGLVMEV
Sbjct: 484 KTRIIGKWDPRLEPDIVVYNAVLNACAQRKQWEGAFWVLQQLKQQHLQLSATTYGLVMEV 543

Query: 563 MLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSA 622
           M  CGKYNLVHEFFRK++KSS+PNALTY+VLVNTLWKEGK D+AVLA++ ME RGIVGSA
Sbjct: 544 MFACGKYNLVHEFFRKIEKSSMPNALTYRVLVNTLWKEGKIDDAVLAVQGMEKRGIVGSA 603

Query: 623 ALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNH 682
           ALYYD ARCLCS+GRC+EALMQ+EKICKVA+KPLVVTYTGLIQACLDS ++Q+  YIFN 
Sbjct: 604 ALYYDLARCLCSSGRCQEALMQIEKICKVASKPLVVTYTGLIQACLDSGNIQNGAYIFNE 663

Query: 683 MKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFN 742
           M+ FCSPNLVT NI+LK YL+H +F++A++LFQ + E    IS+ SDY  RV+PD Y FN
Sbjct: 664 MQNFCSPNLVTCNIMLKAYLDHRLFDQAKDLFQKMLEDANQISSKSDYLHRVIPDSYTFN 723

Query: 743 TMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQA 802
            MLDA   +KRWD+F   Y +M  + +HFN KRHL MIL+AAR GK EL+ETTW+H+A+A
Sbjct: 724 IMLDACVQQKRWDEFERVYRKMLHHEFHFNAKRHLHMILDAARAGKGELIETTWEHMARA 783

Query: 803 DRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHFSESAWLNLLKE--KRFPKDT 862
           DRTPP PL+KERFCMKL + DY  ALS I  H   +   FS+SAW N  K+   RF KD 
Sbjct: 784 DRTPPLPLIKERFCMKLEKNDYISALSCITIHPLRELQAFSKSAWSNFFKDNASRFRKDI 836

Query: 863 VIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLADHRLEETV 907
           ++ L+ +V  +L R++SPNP+  NLL S KEF RT  + AD  L +TV
Sbjct: 844 IVGLVDEVENILGRSDSPNPILHNLLTSSKEFLRTHWTSADANLTQTV 836

BLAST of CSPI01G25380.1 vs. TAIR10
Match: AT1G30610.1 (AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 729.6 bits (1882), Expect = 2.4e-210
Identity = 371/638 (58.15%), Postives = 473/638 (74.14%), Query Frame = 1

Query: 291  QLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFD-AFDIMDKP 350
            ++ER  N   ++   K      GT   G +  + ++ +   +E  AF   D + DI+DKP
Sbjct: 371  RIERLANERHEIRSSKLS----GTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 430

Query: 351  RVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 410
              S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWR
Sbjct: 431  ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 490

Query: 411  RVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAY 470
            RVLQ+IEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY
Sbjct: 491  RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 550

Query: 471  HSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKR 530
             SIAVTLGQAG+++ELF VID+M+SPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 551  RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 610

Query: 531  KNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYK 590
            K  EGAFWVLQ+LK++  +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+
Sbjct: 611  KQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYR 670

Query: 591  VLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEAL--------- 650
            VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L         
Sbjct: 671  VLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPV 730

Query: 651  -------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHM 710
                                Q++KIC+VANKPLVVTYTGLIQAC+DS ++++A YIF+ M
Sbjct: 731  VLKLIENLIYKADLVHTIQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQM 790

Query: 711  KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNT 770
            K  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE   +I   SD+  RVLPD Y FNT
Sbjct: 791  KKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNT 850

Query: 771  MLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILEAARGGKDELLETTWKHLAQAD 830
            MLD    +++WDDF Y Y +M  +GYHFN KRHLRM+LEA+R GK+E++E TW+H+ +++
Sbjct: 851  MLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSN 910

Query: 831  RTPPPPLLKERFCMKLARGDYSEALSSIWSHN----SGDAHHFSESAWLNLLKEKRFPKD 890
            R PP PL+KERF  KL +GD+  A+SS+   N      +   FS SAW  +L   RF +D
Sbjct: 911  RIPPSPLIKERFFRKLEKGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQD 970

Query: 891  TVIELIHKVSMVL-TRNESPNPVFKNLLLSCKEFCRTR 895
            +V+ L+  V+  L +R+ES + V  NLL SCK++ +TR
Sbjct: 971  SVLRLMDDVNRRLGSRSESSDSVLGNLLSSCKDYLKTR 1002

BLAST of CSPI01G25380.1 vs. TAIR10
Match: AT5G67570.1 (AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 387.5 bits (994), Expect = 2.3e-107
Identity = 207/546 (37.91%), Postives = 327/546 (59.89%), Query Frame = 1

Query: 358 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEW 417
           E +++L  RL+G +I+   W F +MM  + +++++  +L+++  LG+  +W++   ++ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 418 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 477
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 478 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 537
           QAG ++EL  VI+ M+  P K  K    + WDP L+PD+V+YNA+LNACV     +   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 538 VLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 597
           V  EL+K  L+P+ +TYGL MEVMLE GK++ VH+FFRK++ S   P A+TYKVLV  LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422

Query: 598 KEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 657
           +EGK +EAV A+ +ME +G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482

Query: 658 VTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNL 717
           +T+TGLI A L+   +   + IF +MK  C PN+ T N++LK Y  + MF EA+ELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542

Query: 718 SEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHL 777
                    VS     ++P+ Y ++ ML+AS    +W+ F + Y  M L GY  +  +H 
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602

Query: 778 RMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSG 837
            M++EA+R GK  LLE  +  + +    P P    E  C   A+GD+  A++ I +  + 
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLEDGEIPHPLFFTELLCHATAKGDFQRAITLI-NTVAL 662

Query: 838 DAHHFSESAWLNLLKEKR--FPKDTVIELIHKVS-MVLTRNESPNPVFKNLLLSCKEFCR 897
            +   SE  W +L +E +    +D     +HK+S  ++  +    P   NL  S K  C 
Sbjct: 663 ASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRCG 714

Query: 898 TRISLA 899
           +  S A
Sbjct: 723 SSSSSA 714

BLAST of CSPI01G25380.1 vs. TAIR10
Match: AT1G62930.1 (AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 109.8 bits (273), Expect = 9.1e-24
Identity = 86/332 (25.90%), Postives = 153/332 (46.08%), Query Frame = 1

Query: 418 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 477
           L+  E+ K      IYTT +D L   +   +ALN+F  M ++    P++V Y+S+   L 
Sbjct: 243 LKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEM-DNKGIRPNVVTYNSLIRCLC 302

Query: 478 QAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 537
             G   +   ++  M            +E+   ++ P++V ++A+++A VK   L  A  
Sbjct: 303 NYGRWSDASRLLSDM------------IER---KINPNVVTFSALIDAFVKEGKLVEAEK 362

Query: 538 VLQELKKQSLQPSTSTY-----GLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLV 597
           +  E+ K+S+ P   TY     G  M   L+  K    H F   + K   PN +TY  L+
Sbjct: 363 LYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAK----HMFELMISKDCFPNVVTYNTLI 422

Query: 598 NTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANK 657
               K  + +E +     M  RG+VG+   Y    + L  AG C  A    +K+      
Sbjct: 423 KGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVP 482

Query: 658 PLVVTYTGLIQACLDSKDLQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEAREL 717
           P ++TY+ L+        L+ A+ +F ++ K+   P++ TYNI+++G  + G  E+  +L
Sbjct: 483 PDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDL 542

Query: 718 FQNLSEQRRNISTVSDYRDRVLPDIYMFNTML 744
           F +LS +             V P++ ++ TM+
Sbjct: 543 FCSLSLK------------GVKPNVIIYTTMI 542

BLAST of CSPI01G25380.1 vs. TAIR10
Match: AT1G62670.1 (AT1G62670.1 rna processing factor 2)

HSP 1 Score: 109.4 bits (272), Expect = 1.2e-23
Identity = 80/314 (25.48%), Postives = 144/314 (45.86%), Query Frame = 1

Query: 432 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 491
           IY T +D L K +   +ALN+F  M+      P++V Y S+   L   G   +   ++  
Sbjct: 258 IYNTIIDGLCKYKHMDDALNLFKEMETK-GIRPNVVTYSSLISCLCNYGRWSDASRLLSD 317

Query: 492 MQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPST 551
           M            +E+   ++ PD+  ++A+++A VK   L  A  +  E+ K+S+ PS 
Sbjct: 318 M------------IER---KINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSI 377

Query: 552 STYGLVMEVMLECGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIEN 611
            TY  ++       + +   + F   V K   P+ +TY  L+    K  + +E +     
Sbjct: 378 VTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFRE 437

Query: 612 MEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKD 671
           M  RG+VG+   Y    + L  AG C  A    +++      P ++TY  L+     +  
Sbjct: 438 MSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGK 497

Query: 672 LQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYR 731
           L+ A+ +F ++ ++   P + TYNI+++G  + G  E+  +LF NLS +           
Sbjct: 498 LEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLK----------- 543

Query: 732 DRVLPDIYMFNTML 744
             V PD+  +NTM+
Sbjct: 558 -GVKPDVVAYNTMI 543

BLAST of CSPI01G25380.1 vs. TAIR10
Match: AT5G16640.1 (AT5G16640.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 108.6 bits (270), Expect = 2.0e-23
Identity = 77/314 (24.52%), Postives = 145/314 (46.18%), Query Frame = 1

Query: 432 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 491
           IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  
Sbjct: 188 IYNTIIDGLCKSKQVDNALDLLNRMEKDGIG-PDVVTYNSLISGLCSSGRWSDATRMVSC 247

Query: 492 MQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQSLQPST 551
           M                   + PD+  +NA+++ACVK   +  A    +E+ ++SL P  
Sbjct: 248 MTKR---------------EIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDI 307

Query: 552 STYGLVMEVMLECGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIEN 611
            TY L++  +    + +   E F   V K   P+ +TY +L+N   K  K +  +     
Sbjct: 308 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCE 367

Query: 612 MEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKD 671
           M  RG+V +   Y    +  C AG+   A     ++      P ++TY  L+    D+  
Sbjct: 368 MSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGK 427

Query: 672 LQSAVYIFNHM-KAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRRNISTVSDYR 731
           ++ A+ I   M K     ++VTYNI+++G  + G   +A +++ +L+ Q           
Sbjct: 428 IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQ----------- 473

Query: 732 DRVLPDIYMFNTML 744
             ++PDI+ + TM+
Sbjct: 488 -GLMPDIWTYTTMM 473

BLAST of CSPI01G25380.1 vs. NCBI nr
Match: gi|778662053|ref|XP_004135752.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus])

HSP 1 Score: 1807.0 bits (4679), Expect = 0.0e+00
Identity = 898/907 (99.01%), Postives = 901/907 (99.34%), Query Frame = 1

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAKNRVLRH 60
           MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSV GTD SLSDAKNRVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
           RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKD 180
           LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIA EKD
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEKPI 240
           LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNS+KPI
Sbjct: 181 LSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKPI 240

Query: 241 HFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNPGD 300
           HFKAN LEVKKESSRVSDGNSMKTSEKIWAWGDDDAKP KGVLKAGKYGIQLERSYNPGD
Sbjct: 241 HFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPGD 300

Query: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360
           KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI
Sbjct: 301 KVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEERI 360

Query: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420
           QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM
Sbjct: 361 QMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQM 420

Query: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480
           RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG
Sbjct: 421 RERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAG 480

Query: 481 YMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540
           YMRELFDVIDSM+SPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ
Sbjct: 481 YMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQ 540

Query: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600
           ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK
Sbjct: 541 ELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGK 600

Query: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660
           TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG
Sbjct: 601 TDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTG 660

Query: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720
           LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR
Sbjct: 661 LIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQRR 720

Query: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780
           NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE
Sbjct: 721 NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMILE 780

Query: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840
           AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF
Sbjct: 781 AARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHHF 840

Query: 841 SESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900
           SESAWLNLLKEKRFP+DTVIELIHKV MVLTRNESPNPVFKNLLLSCKEFCRTRISLADH
Sbjct: 841 SESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLADH 900

Query: 901 RLEETVY 908
           RLEETVY
Sbjct: 901 RLEETVY 907

BLAST of CSPI01G25380.1 vs. NCBI nr
Match: gi|659118444|ref|XP_008459122.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis melo])

HSP 1 Score: 1714.1 bits (4438), Expect = 0.0e+00
Identity = 853/909 (93.84%), Postives = 874/909 (96.15%), Query Frame = 1

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPG--TDLSLSDAKNRVL 60
           MVGVIMAN+NL IPNCERYGFPTLHCTHNSH SFWVSFFPSSV G  TDL+ SDAKNRVL
Sbjct: 1   MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60

Query: 61  RHRVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDP 120
           RHR+HKCGSIKALSNGESDISLP+GNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLD P
Sbjct: 61  RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120

Query: 121 NKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIARE 180
           NKLTMKEN SAKSAESTSISKIDNGKNKVTDVQHNV+VKNMFKRVDKKDLFNNTERIARE
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180

Query: 181 KDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSEK 240
           K LSGNKFDR K VTRSNDKVKGKMTPFGSLVNDKQHEEK+N NWSSYIEP+VTRSN EK
Sbjct: 181 KHLSGNKFDRSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCEK 240

Query: 241 PIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQLERSYNP 300
           PIHFKAN LE KKE SRVS GNSMKTSEKIWAWG+DDAKP K VLKAGKYGIQLERSY+P
Sbjct: 241 PIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYSP 300

Query: 301 GDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEE 360
           GDKVGRKKTEQSYRGTSTSGKRFLEF E+NSLEVEHAAFNNFDA DIMDKPRVSKMEMEE
Sbjct: 301 GDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEMEE 360

Query: 361 RIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWL 420
           RIQMLSKRLNGADIDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWL
Sbjct: 361 RIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL 420

Query: 421 QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ 480
           QMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ
Sbjct: 421 QMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQ 480

Query: 481 AGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV 540
           AGYMRELFDVIDSM+SPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV
Sbjct: 481 AGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWV 540

Query: 541 LQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE 600
           LQELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE
Sbjct: 541 LQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKE 600

Query: 601 GKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY 660
           GKTDEAVLAIENME+RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY
Sbjct: 601 GKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTY 660

Query: 661 TGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQ 720
           TGLIQACLDSKDLQSAVY+FN MKAFCSPNLVTYNILLKGYLEHGMFEEAREL QNLSEQ
Sbjct: 661 TGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSEQ 720

Query: 721 RRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI 780
           R+NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI
Sbjct: 721 RQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMI 780

Query: 781 LEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAH 840
           LEAAR GKDELLETTWKHLAQADRTPPPPLLKERFCMK+ARGDY+EAL  I +HNSGDAH
Sbjct: 781 LEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDAH 840

Query: 841 HFSESAWLNLLKEKRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLLSCKEFCRTRISLA 900
           HFSESAWLNLLKEKRFPKDTVIELIHKV MV   NESPNPVFKNLLLSCKEFCRTRIS+A
Sbjct: 841 HFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISVA 900

Query: 901 DHRLEETVY 908
           DHRLEETV+
Sbjct: 901 DHRLEETVH 909

BLAST of CSPI01G25380.1 vs. NCBI nr
Match: gi|645238617|ref|XP_008225762.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Prunus mume])

HSP 1 Score: 905.6 bits (2339), Expect = 7.0e-260
Identity = 495/921 (53.75%), Postives = 626/921 (67.97%), Query Frame = 1

Query: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAK-NRVLR 60
           MVG+IM N  L + N +R      +C          S F   +    L   + K NR   
Sbjct: 1   MVGMIMTNAQLGVSNFQRNDIFAANCISKPGPLSGFSLFRRPIFCVGLYEKNVKKNRGFG 60

Query: 61  HRV-HKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDP 120
            ++ ++   I A+S   SD     G +LE +F+FKPSFD+Y+KVM TVR R  + + D  
Sbjct: 61  IKIPNRRTVISAVSKEGSDNRSVGGEILEKEFEFKPSFDQYLKVMGTVRLRSDRDKQDSS 120

Query: 121 NKLTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIARE 180
            +   K N  ++    + +S+ +    K+ + + + + +   K   + +   N   I  +
Sbjct: 121 KEQNPKHNLRSRGVSRSLVSEGNEEHVKLGESEGHSNQEKASKAAKQNEALGNRNGIMGK 180

Query: 181 KDLSGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKR---------NENWSSYIEP 240
               G               VKG    + S  +++  +EK+            +S  +EP
Sbjct: 181 SKRQG---------------VKGFKDEYDSRQSNRDEKEKKKIRGEARDGRSKYSGRLEP 240

Query: 241 RVTRSNSEKPIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYG 300
                  E     K+ M    K+  RV      K+++K +  G    K   G        
Sbjct: 241 -------ELNFRGKSTMARNMKDDLRV-----YKSTDKSFERGKVGVKIQGG-------- 300

Query: 301 IQLERSYNPGDKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAF-DIMDK 360
             LER++   +K   +   +     + SG+ F + N  NS++VE AAF NFD F DIMDK
Sbjct: 301 --LERNHINAEKATDRGFSRRSEKLTKSGRDFPKKNYDNSMKVERAAFKNFDEFGDIMDK 360

Query: 361 PRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNW 420
           PRVS+MEMEERIQ L+K LNGADIDMPEWMFS+MMRSA+IR++DHSILRVIQ+LGKLGNW
Sbjct: 361 PRVSQMEMEERIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKLGNW 420

Query: 421 RRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVA 480
           RRVLQ+IEWLQMRERFKSHKLR+IYTTALDVLGKARRPVEALNVFHAM +  SSYPDLVA
Sbjct: 421 RRVLQVIEWLQMRERFKSHKLRYIYTTALDVLGKARRPVEALNVFHAMLQEMSSYPDLVA 480

Query: 481 YHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVK 540
           YHSIAVTLGQAG+MRELFDVID+M+SPPKKKFKTG L KWDPRL+PDIV+++AVLNACV+
Sbjct: 481 YHSIAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDIVVFHAVLNACVQ 540

Query: 541 RKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTY 600
           RK  EGAFWVLQ+L++Q LQP+T+TYGLVMEVML CGKYNLVH+FF+KVQKSSIPNALTY
Sbjct: 541 RKQWEGAFWVLQQLQQQGLQPATTTYGLVMEVMLACGKYNLVHDFFKKVQKSSIPNALTY 600

Query: 601 KVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICK 660
           +V+VNTLW+EGK DEAVL ++NME RGIVGSAALYYDFARCLCSAGRC+EALMQ+EKICK
Sbjct: 601 RVIVNTLWREGKVDEAVLVVQNMERRGIVGSAALYYDFARCLCSAGRCQEALMQIEKICK 660

Query: 661 VANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEA 720
           VANKPLVVTYTGLIQACLD+  +++  Y+F  M+ FCSPNLVT N +LKGYL+HGMFEEA
Sbjct: 661 VANKPLVVTYTGLIQACLDAGSIKNGAYVFKQMENFCSPNLVTCNTMLKGYLDHGMFEEA 720

Query: 721 RELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYH 780
           +ELF  + +   NIS+ SDY+ RV+PD Y FNT+LDA   EKRWDDF + Y  M  +GYH
Sbjct: 721 KELFLKMLDDGNNISSKSDYKVRVIPDSYTFNTLLDACIIEKRWDDFEFVYKMMLHHGYH 780

Query: 781 FNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSS 840
           FN KRHLRMIL+A   GK ELL+ TW HL +A R+PPPPL+KERFC KL + DY+ ALS 
Sbjct: 781 FNAKRHLRMILDAREAGKGELLDITWTHLTEAGRSPPPPLVKERFCTKLEKDDYAAALSC 840

Query: 841 IWSHNSGDAH-HFSESAWLNLLKE--KRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLL 900
           I + N G+    FS++AWL L KE  +RF KDT + L+H+ S+++ R +  NPVF+NL+ 
Sbjct: 841 ITNPNLGELRTFFSKNAWLKLFKENAERFQKDTFVRLVHEGSILINRTDRSNPVFQNLMA 884

Query: 901 SCKEFCRTRISLADHRLEETV 907
           +C E  RT +  AD +  ETV
Sbjct: 901 ACGELDRTCLVGADFKPSETV 884

BLAST of CSPI01G25380.1 vs. NCBI nr
Match: gi|657999772|ref|XP_008392321.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic-like [Malus domestica])

HSP 1 Score: 896.7 bits (2316), Expect = 3.3e-257
Identity = 501/921 (54.40%), Postives = 628/921 (68.19%), Query Frame = 1

Query: 4   VIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAK-NRVLRHR- 63
           ++MAN    + N +R G    +C   S      S F   + G  L+  + K NRV   + 
Sbjct: 4   MVMANAQPGVSNFQRNGVFATNCCPKSLPLSGFSIFRRPIFGIGLNEKNVKRNRVFGIKF 63

Query: 64  VHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKL 123
           V+    I A+S   S+I       LE +F+FKPSFD+Y+KVM TVR R      D   + 
Sbjct: 64  VNSRTVISAVSKEGSEI-------LEKEFEFKPSFDQYLKVMGTVRLRS-----DRDRQQ 123

Query: 124 TMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKDL 183
             KE     S  S  +S+    +    D +      N+ +  +K   F N     R + L
Sbjct: 124 RSKEENPKHSVRSRGVSRRLLSEGSEEDAKLGEPEGNLNR--EKASKFEN-----RYESL 183

Query: 184 SGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEK-------RNENWSSY---IEPRV 243
            GN    R   T  +++V+G    + S  N+K  ++K       R+  WS Y   +EP +
Sbjct: 184 -GN----RNGSTHESERVEGFKDEYDSRQNNKDEKDKKMIRGETRDGRWSKYTGRVEPGL 243

Query: 244 TRSNSEKPIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQ 303
                   +    +   V     +  D     T  +    G    K     ++ GK+G++
Sbjct: 244 DFKGKSTTVRNAKDGPGVTGRLEQEVDFKGKSTMARNARDGLRVYKSRDKAVERGKFGVR 303

Query: 304 LERSYNPGDKVGRKKTEQSY--RGTSTSGKRFLE-FNEKNSLEVEHAAFNNFDAF-DIMD 363
            E      D    K T++ +  R  + SG+ F + FNEK SLEVE AAF NFD F DIMD
Sbjct: 304 NEDGVERNDSNADKATDRGFVPRSVTKSGRDFPKRFNEK-SLEVERAAFQNFDEFGDIMD 363

Query: 364 KPRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGN 423
           KPRVS+MEME+RIQ L+K LNGADIDMPEWMFS+MMRSA+IR++DHSILRVIQ+LGKLGN
Sbjct: 364 KPRVSQMEMEQRIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKLGN 423

Query: 424 WRRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLV 483
           WRRVLQ+IEWLQMRERFKSHKLR+IYTTALDVLGKARRPVEALNVFHAM E  SSYPDLV
Sbjct: 424 WRRVLQVIEWLQMRERFKSHKLRYIYTTALDVLGKARRPVEALNVFHAMLEQMSSYPDLV 483

Query: 484 AYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACV 543
           AYHSIAVTLGQAG+MRELFDVID+M+SPPKKKFKTG L KWDPRL+PDIV+++AVLNACV
Sbjct: 484 AYHSIAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDIVVFHAVLNACV 543

Query: 544 KRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALT 603
           +RK  EGAFWVLQ+LK+Q LQP+T+TYGLVMEVML CGKYNLVHEFF+KVQKSSIPNALT
Sbjct: 544 QRKQWEGAFWVLQQLKQQGLQPATTTYGLVMEVMLACGKYNLVHEFFKKVQKSSIPNALT 603

Query: 604 YKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKIC 663
           Y+V+VNTLW+EGK DEAV  + NME RGIVG AALYYDFARCLCSAGRC+EALMQ+EKIC
Sbjct: 604 YRVIVNTLWREGKIDEAVSVVHNMERRGIVGYAALYYDFARCLCSAGRCQEALMQIEKIC 663

Query: 664 KVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEE 723
           KVANKPLVVTYTGLIQACLD+  +++A Y+F  M+ FCSPNLVT NI+LK YL+H MFE+
Sbjct: 664 KVANKPLVVTYTGLIQACLDTGSVENAAYVFKQMENFCSPNLVTCNIMLKAYLDHRMFEK 723

Query: 724 ARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGY 783
           A++LF  + +   NI+  SDY+ R++PD Y FNT+LDA   EKRWDDF Y Y +M  +G+
Sbjct: 724 AKDLFLRMLDDGNNITNGSDYKVRIIPDSYTFNTLLDACVTEKRWDDFEYVYRRMLHHGF 783

Query: 784 HFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALS 843
           HFN KRHLRMIL+A + G+ ELL+ TW HL +ADR PPPPL+KERFC KL + DY+ ALS
Sbjct: 784 HFNAKRHLRMILDACKAGRAELLDMTWMHLTEADRIPPPPLVKERFCTKLEKDDYAAALS 843

Query: 844 SIWSHNSGDAHHFSESAWLNLLKE--KRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLL 903
            I + N G+   FS++AWL L KE  +RF  DT + L+ + S+++ R++  NPVF+NL+ 
Sbjct: 844 CITTQNLGELQAFSKTAWLKLFKENAERFQNDTFVRLVDEGSILVNRSDRSNPVFQNLMA 899

Query: 904 SCKEFCRTRISLADHRLEETV 907
           +C E  R R++ A     ETV
Sbjct: 904 ACGEVDRIRLAGAAGSTRETV 899

BLAST of CSPI01G25380.1 vs. NCBI nr
Match: gi|694367514|ref|XP_009362169.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Pyrus x bretschneideri])

HSP 1 Score: 884.8 bits (2285), Expect = 1.3e-253
Identity = 487/921 (52.88%), Postives = 625/921 (67.86%), Query Frame = 1

Query: 4   VIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVPGTDLSLSDAK-NRVLRHR- 63
           ++MAN    + N +R G     C   S      S F   + G  L+  + K NRV   + 
Sbjct: 4   MVMANAQPGVSNFQRNGVFATDCCPKSLPLSGFSIFRRPIFGIGLNEKNVKRNRVFGIKF 63

Query: 64  VHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNKL 123
           V+    I A+S   S+I       LE +F+FKPSFD+Y+KVM TVR R  + +     + 
Sbjct: 64  VNSRTVISAVSKEGSEI-------LEKEFEFKPSFDQYLKVMGTVRLRSDRDKQQRSKEE 123

Query: 124 TMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAREKDL 183
             K +  ++      +S+    + K+ + + N++ +   K  ++ +L  N          
Sbjct: 124 NPKHSVRSRGVSRRLLSEGSEEEAKLGEPEGNLNREKASKVENRYELLGN---------- 183

Query: 184 SGNKFDRRKVVTRSNDKVKGKMTPFGSLVNDKQHEEK-------RNENWSSY---IEPRV 243
                  R   T    +VKG    + S  N+K  ++K       R+  WS Y   +EP +
Sbjct: 184 -------RNGSTHERQRVKGFKDEYDSRQNNKDEKDKKMIRGETRDGRWSKYTGRVEPGL 243

Query: 244 TRSNSEKPIHFKANMLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPPKGVLKAGKYGIQ 303
                   +    +   V     +  D     +  +    G    +     ++ GK+G++
Sbjct: 244 DFKGKSTTVRNAKDGPGVTGRLEQEVDFKGKSSMARNARDGPRVYQSRDEAVERGKFGVR 303

Query: 304 LERSYNPGDKVGRKKTEQSY--RGTSTSGKRFLE-FNEKNSLEVEHAAFNNFDAF-DIMD 363
            E           K T++ +  R  + SG+ F + FNEK SLEVE AAF NFD F DIMD
Sbjct: 304 NEDGVERNHSNADKATDRGFVPRSVTKSGRDFPKRFNEK-SLEVERAAFRNFDEFGDIMD 363

Query: 364 KPRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGN 423
           KPRVS+MEME+RIQ L+K LNGADIDMPEWMFS+MMRSA+IR++DHSILRVIQ+LGKLGN
Sbjct: 364 KPRVSQMEMEQRIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKLGN 423

Query: 424 WRRVLQIIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLV 483
           WRRVLQ+IEWLQMRERFKSHKLR+I+TTALDVLGKARRPVEALNVFHAM E  SSYPDLV
Sbjct: 424 WRRVLQVIEWLQMRERFKSHKLRYIFTTALDVLGKARRPVEALNVFHAMLEQMSSYPDLV 483

Query: 484 AYHSIAVTLGQAGYMRELFDVIDSMQSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACV 543
           AYHSIAVTLGQAG+MRELFDVID+M+SPPKKKFKTG L KWDPRL+PD+V+++AVLNACV
Sbjct: 484 AYHSIAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDVVVFHAVLNACV 543

Query: 544 KRKNLEGAFWVLQELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALT 603
           +RK  EGAFWVLQ+LK+Q LQP+T+TYGLVMEVML CGKYNLVHEFF+KVQKSSIPNALT
Sbjct: 544 QRKQWEGAFWVLQQLKQQGLQPATTTYGLVMEVMLACGKYNLVHEFFKKVQKSSIPNALT 603

Query: 604 YKVLVNTLWKEGKTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKIC 663
           Y+V+VNTLW+EGK DEAV  I NME RGIVG AALYYDFARCLCSAGRC+EALMQ+EKIC
Sbjct: 604 YRVIVNTLWREGKIDEAVSVIHNMERRGIVGYAALYYDFARCLCSAGRCQEALMQIEKIC 663

Query: 664 KVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEE 723
           KVA+KPLVVTYTGLIQACLD+  +++A Y+F  M+  CSPNLVT NI+LK YL+HGMFE+
Sbjct: 664 KVASKPLVVTYTGLIQACLDAGSVENAAYVFKQMENICSPNLVTCNIMLKAYLDHGMFEK 723

Query: 724 ARELFQNLSEQRRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGY 783
           A++LF  + +   NI++ SDY+ R++PD Y FNT+LDA  AEKRWDDF Y Y +M  +G+
Sbjct: 724 AKDLFLRMLDDGNNITSRSDYKVRIIPDSYTFNTLLDACVAEKRWDDFEYVYKRMLHHGF 783

Query: 784 HFNPKRHLRMILEAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALS 843
           HFN KRHLRMIL+A +  K ELL+ TW HL +ADR PPPPL+KERFC KL + DY+ ALS
Sbjct: 784 HFNAKRHLRMILDACKAEKAELLDITWMHLTEADRIPPPPLVKERFCTKLEKNDYAAALS 843

Query: 844 SIWSHNSGDAHHFSESAWLNLLKE--KRFPKDTVIELIHKVSMVLTRNESPNPVFKNLLL 903
            + + N G+   FS++AWL L  E  +RF KDT + L+ + S+++ R++  NPV++NL+ 
Sbjct: 844 CVTTQNLGEPQAFSKAAWLKLFMENAERFQKDTFVRLVDEGSILVNRSDRSNPVYQNLMA 899

Query: 904 SCKEFCRTRISLADHRLEETV 907
           +  E  R R++ A     ETV
Sbjct: 904 ASGEVDRIRLTGAAVSTRETV 899

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR64_ARATH4.3e-20958.15Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidop... [more]
PP451_ARATH4.0e-10637.91Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidop... [more]
PPR96_ARATH1.6e-2225.90Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
PPR91_ARATH2.1e-2225.48Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
PP389_ARATH3.6e-2224.52Pentatricopeptide repeat-containing protein At5g16640, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LVN7_CUCSA0.0e+0099.01Uncharacterized protein OS=Cucumis sativus GN=Csa_1G553530 PE=4 SV=1[more]
M5WJN1_PRUPE1.1e-25155.77Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001195mg PE=4 SV=1[more]
W9RFN3_9ROSA4.8e-24752.71Uncharacterized protein OS=Morus notabilis GN=L484_025948 PE=4 SV=1[more]
B9T6B9_RICCO1.2e-24554.15Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A061FSP7_THECC4.2e-24354.95Pentatricopeptide repeat-containing protein isoform 1 OS=Theobroma cacao GN=TCM_... [more]
Match NameE-valueIdentityDescription
AT1G30610.12.4e-21058.15 pentatricopeptide (PPR) repeat-containing protein[more]
AT5G67570.12.3e-10737.91 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G62930.19.1e-2425.90 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G62670.11.2e-2325.48 rna processing factor 2[more]
AT5G16640.12.0e-2324.52 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778662053|ref|XP_004135752.2|0.0e+0099.01PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... [more]
gi|659118444|ref|XP_008459122.1|0.0e+0093.84PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... [more]
gi|645238617|ref|XP_008225762.1|7.0e-26053.75PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... [more]
gi|657999772|ref|XP_008392321.1|3.3e-25754.40PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic-... [more]
gi|694367514|ref|XP_009362169.1|1.3e-25352.88PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI01G25380CSPI01G25380gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI01G25380.1CSPI01G25380.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G25380.1.utr5p1CSPI01G25380.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G25380.1.cds1CSPI01G25380.1.cds1CDS
CSPI01G25380.1.cds2CSPI01G25380.1.cds2CDS
CSPI01G25380.1.cds3CSPI01G25380.1.cds3CDS
CSPI01G25380.1.cds4CSPI01G25380.1.cds4CDS
CSPI01G25380.1.cds5CSPI01G25380.1.cds5CDS
CSPI01G25380.1.cds6CSPI01G25380.1.cds6CDS
CSPI01G25380.1.cds7CSPI01G25380.1.cds7CDS
CSPI01G25380.1.cds8CSPI01G25380.1.cds8CDS
CSPI01G25380.1.cds9CSPI01G25380.1.cds9CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G25380.1.utr3p1CSPI01G25380.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 432..458
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 514..558
score: 9.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 654..699
score: 8.5
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 517..550
score: 2.3E-4coord: 690..718
score: 1.2E-6coord: 432..465
score: 2.6E-4coord: 656..683
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 735..769
score: 7.278coord: 429..459
score: 7.015coord: 654..684
score: 7.103coord: 515..549
score: 10.972coord: 688..722
score: 10.786coord: 465..495
score: 6.182coord: 619..653
score: 6.171coord: 550..580
score: 6.643coord: 584..618
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 552..634
score: 7.4E-4coord: 635..715
score: 8.
NoneNo IPR availableunknownCoilCoilcoord: 353..373
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 349..776
score: 2.5E
NoneNo IPR availablePANTHERPTHR24015:SF327SUBFAMILY NOT NAMEDcoord: 349..776
score: 2.5E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 525..720
score: 3.