Cp4.1LG15g06440 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g06440
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG15 : 7112476 .. 7120947 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCGAGCAGCCATGCTCTCTAAAGCCGTTTACCTGAATTCCAAAAGCCCCGAATTAGCATGGCTTCTCTTCAAGCGAGTTCTTTCTTCCCCCATCTCTGCCTCCTCTTCCTTCTTCAAAACCTCCCTTCAATCTATACCTCTTATTGCCCGCATCCTCACCACTGCAAAAATGCACCCACAGATCGATCACCTTCACCAACTCCTCTTGTCGCAGCACCGGGATTTTGCTCATCCATCTGGATTTGCACTCGTTCGAGCCTTGGCTGATTTGGGTCTCTTCGAAAATGCCATTTCTCAGTTTCGATCGCTTCGAGCCAGGTTTCCTAATGACCCTCCAGATATTTCTTTCTATAATTTTCTGTTTCGATGCTCTTTGAAGGAGAGCCGGGTTGATTTTGTGATTTGGCTTTATAAGGATATGGTTTTTGCGAGAGTTAACCCACAGACTTATACTTTTAATCTGTTGATACGTGCGCTTTGTGAAATGGGGTACTTGGAGAATGCACGTCAAGTGTTTGATAAAATGTCTGAAAAAGGGTGTAACCCTAATGAGTTTAGTCTTGGACTTATAGTTCGTGGGTATTGTAGAGCTGGGCTCCATGATCGAGGTATTGAACTTTTGGACGAGATGAGGAGCTCTGGTGCTCTTCCCAATAGGGTTGCATACAATACTGTGATATCTTCTCTTTGTGGAGAAGGTCAGACTGGGGAGGCTGAGAAATTGGTGGAAAGGATGAGAGAGGTTGGTCTTTCTCCAGATATTGTAACTTTCAATTGCAGAATTGCTGCCCTCTGTAAATCCGGGCAAATTTTAGAAGCTTCCAGAATTTTTAGAGATATGCAAATAGATGAAGAGTTAGGGTTACCTCAGCCTAATACCGTAACATATAATTTAATGCTAGAAGGATTTTGTAATGAAGGAATGTTCGAGGAATCCAAGGCTCTCTTTGATTCTATGAAAAAATCTGAAACTCATTTGACCATGGAGAGCTATAACATATGGTTGTTAGGTTTGGTTAGAAGTGGAAAGCTCCTTGAAGCTCGTTTAATTCTTAATGAAATGGCAGAAAAGAGTATAAAACCCAATCTTTACTCCTATAACATTTTGATTTATGGCCTTTGTAAATATGGAATGTTTTCTGATGCAAGATCTATAATAGGTCTAATGAGAGAGAGTGGTGTAGCTCCAGATATTGTATCTTATAGTACCTTACTTCATGGATACTGCTGTAGAGGAAAGATACTTGAATCCAATTATGTTCTTCGCGAAATGATACAGGTTGGTTGTTTTCCCAATATGTATACTTGTAATATCCTGCTTCACAGCCTGTGGAAAGAGGGGAAAGTATCAGAGGCAGAAGAGTTGCTACAAAAGATGAATGAAAGAGGTTATGGCTTGAATAATGTAACTTGTAATACAGTGATTAAGGGCCTCTGTAAATCTGGGAATCTGGACAAAGCTATTGAAATAGTGAGTGGCATGTGGAACCATGGAAGCGCTTCTCTTGGTAATCTTGGAAACTCTTTTATTGGTCTTTTTGATATTGGCAATAATGGGATGAAGTGTTTACCTGACTCGATCACATATGCAACCATAATAAGTTGGTTATGTAAGGCGGGGCGGGTTGATGAAGCAAAAAAGAAGCTTCTGGAGATGATTGGGAAAAAGCTATCTCCGGATTCGCTAATATTTGATACTTTCATACATAGTTACTGTAAACAAGGAAAGTTGTCATCTGCTTTTAGAGTACTCAAGGAAATGGAGAAAAAAGGGTGCAACAAGAGCCTTCGAACGTATAATTCATTGATCCAGGGTTTAAGTTCCAAAAATCAAATATTTGAAATATATGGGTTGATGGAGGAGATGAAAGAAAAAGGGATTTTTCCTAATGTTTACACTTACAATAACATTATTAGCTGCCTTTCTGAAGGTGGGAAACTGAAAGATGCCACCAGTCTTTTGGATGAAATGCTGCAGAAGGGGATATCTCCTAATATATATACGTTTAGGATTTTAATTGGAGCTTTCTTTAAGGCTTGCGACTTTGGAGCCGCTCAAGAGTTATTTGAGATAGCTTTAAGCATATGTGGCCACAAGGAATCCTTGTATAGTTTTATGTTCAATGAGCTATTAACTGGAGGTGAAACATCCAAGGCTAAAGAGCTTTTTGAAGCTGCATTAGATAGATCTCTAGCCTTGAAAAATTTTCTTTACAGGGACCTAATTGAAAGGCTTTGCATGGACGGAAAGTTAGATGATGCTAGTTTCATTCTTCATAAGATGATGGATAAGCAGTATAGGTTTGACCCTGCATCATTCATGCCAGTGATTGATGGATTAGGTAAAAGTGGGAACAAGCATGCAGCCGATGAATTTGCAGAAAAAATGATGGAAATGGCTTCAGAAACTGACATCAACCAACATGAGAATAAGATTATCCGAGGAAGATCAAATAATGATGATGAAAGAGATTGGCACAAGATCGTTCACAGGTATAATTCTTTAAAAATATACTTGATGTGGTTCTTATTTATCTCCATGCTGCATAATCGAGTATTTGGTTGAGAACATTGGTTATCTGACATTATTCAAATTCAACTGAAGTGTTTCCTAATTTCCTGTTGTTCTTAATATACTGTAAGAGGAATATAAGTTAATTGGTGGCAATTTTAAGTTTTAAATACGTATCACTTATTTTGGTCCAAATATTGAATGAGTTCATCGATAGGATTTTAGCATCCAATCATTACCCTTAAGATTCTTGCAATTTGTTTATCCTAATAATCCACAAAAAGAATCATAAAGATATACTTTTTCCCTGCATGATGCATATTTTTAGGCAGTTCTTTTTTTCCACGGCAACACGTAAGATGATCCTTTGAGGGTCTTTTAATGTCACAACCTTAACTATAGAGTTAAATCCCTCTGGAGACGACCTCTAGTAGTAGATTGTAATTTCCAGTTATCAAATCAGTTGAACATTAGGCCAAGGATAGTTAAAATAATGAGGCTATTCATTAAATTAATTTATTTTGTATGGTCAATGAATGTTTAATTTTCTGTTCCATGTGTGTCATGTGCTTGGCTTGGACCTGCACAACTTCAACAGAAACGATGGCAGTGGGATTGCACAGAAGACTCTTAAGCGTGTGTTGAAAGGATGGGGTCAAGGAAGTATATCAACTTCGCAGCCACAGAAATTTAGCACACATGATTGCTGGGATGGTGTTGCTTAAAGCTTTTAAAGTTCTTGAGGAACAGGCAAATATGGGAACATGCTCGAACCTGCTGAGGCATCAAATCTTCAAGTCAGGGTGCTTAAGGACATCAGTACATCAGGTAGAGCCATTTTTTCTGGATTTTTTTAACAATGAGAAGATAGGTAGAGAACGATTAACTTTTTTCTGGGTTCTGCATGCTTAATAATTTCCAATTATTCTTCAATATTTAATCAAGGGTTTTACCTATATTAGACTTGAATTGGAGGTGGGCTCAAACTTCTAATTGTAGTGAAAGAACTAGGAATGAATGAGATTCATTTTGCTATAATGGCTGATTGATAGTTTTATTCTGTTATATACATTTTCAAGCGATAATGCAATTTTTATCTAGTGATACCATTTGGTAGAGAAGAAAAGGAATAATAAGTACACAAGCACTGAGCTTTTGTATACATTATGGAAAGTGGAAACTCGAGCACCATTTGGTAGCTTTACTCTGTCAGCTTGGAGAGCTATTAGAGAGTAGTCTGGGAATAATTTCTTCCACGTTCAAATGATTTGGAACGCTTACAAGTTTTAAACTCGAGCACGTTCTGTTATGTTATTTCTAGATCTTACAGCCTAGTACAATGGAATATGGAAGGGAGCGTCGAACTAAGAAACGGTGAGAAAGCNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGTTTTTAGACTCTGGAACTTCACTATAGCATGAAAGAAATATAAATGCACTGGGGCCATCACATTGTTATGTGTTGACATGTAAAATATTACATTAATATCACCGTATGCACTTCGCGAAGTATGTCTATTGTGAAATACGGTATTCATTATGAAATGTTGTATAGACAAGTAATTATATGATTTTTGGAGGTGGGGAATGGTTTAAATTCACATAGAACCACTGGCCTTTCATGTAATCCCTCGACTAGGTAGATCAAAAGTTCGTTCTTATATGACACGTACCTAGCCTGAAAGTTTTACAAGCATCCCAATATGTTATATTTATTTTCTCTACTATGCCATTTATCTTCTAAAAAGGATAAAAGTAACTCTTTTTCTGCTACAAAAAGTGACTTTGATGTTAGTTTAGTCTTCAGCTGAGTATCCCACAATTAGCAACTGCTACGTTATGAAAAATAACTGATAAGTTCTCTAAAATAGTAAGTATGATGGTTTCAGGATGGACTGCTACTGCCAAAGAAAGAAATCCATTAGAGGCAGATTCGACAGTTTCACATGTGAGTAGAACTGATTCCCCATGGTTGTCGACATACCTGATATCGACTCATGTAATATTGTTCTAGCAAACCTCGAAGACCCCAATTTTGGTGCATCAACCTGATTTTTTCTTCACCAAAAGTGGCTTTCACTTTCAGCGCACCCCCATCTCGTGTTCGGGCGCCATGTAAGGGAGGTAACTATCAGGATGTTCGCTAAATGATAGGGGCTGTGAGATCTTAGAGTCTTTGGGTCCCCCCTGACATGGTGCGTACAATTGTGCTTTAATGCACGTTTTATTTAGCATTCCAACAGGCTCTTCTGACTGCCTGGACCAGAAGTCTGACTACATGAAGAGGTTCGGGATGTTAGACAAACACGACTCTCCAGAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTGGGCTTCCCCAAAAGGCCTCATGCCAATGGAGTTAGTATTCCTCACTTATAAACCCATGATCATTCCCTAAATTAGTCGATGTGGGACTTCCATCATCCAACACCTCCCCTCGAACAAAGTGCGCCTCCCCTTAATCGAGGCTCGACTCCTTTGGAGTCTTAGTCATTTTTGACTGCCTTCGAGGAGGGGCTTGGCTCCTTTTCTTTAGGAGTTCTTTGTTCGATATTTGAGGATTTGAGGATTTACCAATCTAATCTATTGGCACGACTAAGTTTAGGGCATGACTCTGATACCATGTTAGACAAACACGACTCTCCAGAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTGGGCTTCCCCAAAAGGCCTCATGCCAATGGAGTTAGTATTCCTCACTTATAAACCCATGATCATTCCCTAAATTAGTCGATGTGGGACTTCCATCATCCAACACGGGATATCGCAGGGGATCCATGGCTGAATGAGCCACTTTCAGCTTGATGGTTTATTTTACTCATGATGGTCACTCATCTTCATGGATGAATAATGGCCAATCTCAGGAAAACTGGGAGAGTTCAACTGTGAGAAGCTTTTATAGAAAGACCCATTCTGTGTTGCACCATCAGCACGTTCAACCGTGTCGATCACGAGCTGGAGTTTCCTTAGAGAATGGCCTACGTTCTTGATCTTTTGGGAAGGCCACCTTGTTATCCATGTTGCCTCCATATCATTTTCAGAGTAGTTGGACAAACTACACAAACAGGAAGTTCGAGCAGACAATGGCCAGGTAGCTAGTACATTATTACTCGATTGAAAAAAAACATAGAAAAAAATTCTAGTGGACTCATGCAATAAAAAAGTTGTCTGTGTATATATATATATATATATATATATATGTATTGTTCATATTCAAACTACTTTTTTCTGTTTAACCAGACTATTCTCTTCCAAATCAAGACATCAGTGGTACTGCAATGGATGTCTAGCATGGGACAAAGGCATCTCAATGGTGAAGGTAAACTTATTCAACTGATCATCAGCAAAAGATTGGACAGTTTTATTAATTAAACAGCTCAAAATAATTATGTTTCAAGAAATTTTATAACATTAACAAACATTAAACAAGAAACTAGACTCTTGAATTATAGTTGTTATTTATTTATCATTATTTTTTTAAAAAAGATTCCCTTCTTAGTTGATCTCAAACACTAAAACGGCTCTTTTCTTTTGAGGATCTCAACCTATCTTCACAGTAGTACGATGTCGTCTATTTTAGCCATAAACTCTTATGAATTTGCTTTTATACCATTACTCTTTTGCTTCTCCAGCCATGGTGAGACTTTATTTGCACTCCAAACATCAACATTAAGGGGGAGAAGCTGATGTTGGAGTCGTTTTGTTGCAGCAATTTCTGCCTCAAACCGTTCGTTCTTCTATTCATGTAAGAGTTTTTCAATGGCTCTCATTTTGATTTTAGCATGAACTAACTTATTTAGAGTCCAAAAATTTATGAACATCTTAGTTCATATAGTATTTAGAAGAGTAAACGAAAAATGATTAATTAGTGTCTGAAAATTCTTCCAAATTCACGAGCTGCTGTTTATGAACTAAGTTCAACCTTTTGTATTACCGGCCTCAACGACAACATATTTATATAACAGGGTAAGAAAAAAATAACAAGGTTTAAAAGGTTGATATCGATTTTTTTTTTTTTTTTTTTTAAAGTAAAAATAAAAAATAAAAGTATAAATAAGCATTTTATATTATATTCATATTTAATAATATTTTATTGATTATTTTATCATAGTGTTTTTTTTAATGTTTTTTTTTTTAATGATATATTGAACATGTCAATTTACCATTTATATTGATATCAAACTCATAAGCATAAAAATATGAAATATTGAGAAAATTTAATCATTTCAGTTAAACGAGGTTTAATATATATATATATATATATATAGATAAAAGTAGTTGATTTAATAAAATTTAAAAGAAAAATAATATAACAATAATTTTAAAATAAATAGAAACATTAGAACAGTGCAGTCTATCAGTAGTAGTAGAAGTAGTCAACAAATTCTCTAATCTCGAATACTTTAGCCATCAAATATCTAGAAAGTATAAAAACTTAACATGTCAAGAGAGCTTTTTTTTTAAAAAAGAACAGACACGAGCAAAAAATACATAATACATGAATCAAAATTATGTCTATGCCAAGAATCCTAATAAAAACTTTAGTTTCTTCGAAGTGTTTTTGTTATAATTGTCTCCTAATAAAGGCATTACGCTCGATTGAACTCTTGCTGACCCATAAGTCACAATAGCTCTTTTTTTTTTATAGCGGACTTGCGATCGTACGATACTCACTTTTCAGCAACAAGTGAGTTAGCCAATTCGTTTGCGCGCTGTACGCCTACGGCCAGACAGCGTATGCTCGGGCCTTTGACCCCGATTTAACAGGGCAGAAAATTTTTCAAAGAGTTTGAAAGCAAGATTTAAAGAGCGATTGAAAATGGTGGTATATAGCGTGCAAGCAAAGGCAACAACAACGGCTTAGCAGTATATAACTATTCGGGTAGGGGGATATGACTACTTTTACTTTCCGCTATGGCCATTGTTTTCCTAACTAGTAACTCCCTGTAATCTACTTTTTGTTGCATAATCTCAAAGTAGGTTTGATGCATATGAAGACTCAGTGCTGGCTAAGCTTGATATCTCAAACAACATTTGTCAAAGCATCAAGGATTGCAGCACAGAAGCAATTAGGATCAGTCAGACATGGAGGTTGATGTTTTTCTTTGATAATTTCAACTGTTTCTCCATGGAATCTGTGACCCCATGTTTCCAGTTGCTCAGGAGTAGGACAACCAAATCACCTATCAACAAGGATAAATGGAGGAGTGGACTTGGAGATCCAAACATCTAAACTGAGTAGCTAGAATGTTGGACTGCATTGAAACTGACTTCAGATCTTGATCTTTTATGGAACATTGTCTAATCACTTCATCTTATTTCATATGGAAGGGAGACACAGAGGGACGGTTCATGAGCGAAGCAGCTCGACTGTAAAGGAGAGGAGGCACAACGACGCAATTATCAGGGGCGTGCTTTACCACTGAGCTAATAGCCCGTTGTGCAGACCTCCCGAGGAGAAAGGAGGCTCCGCGCGAAGAGGATACCTCGTCCTCCATTTCAACCACCACGTTTTCATTATGGTCTTACAAAACGACAATTACTTAAATACGTTCGTATCGCCGGAACAAAAACACAGGTCTCTGCAAAGTCGTAAGACCACGTATATCGAATAGTCTCATGAAGCTAAAAAGATGTCTCTAAATGACAAGCTTAGATTAGTTTCCCCTTTCAGGGAAGCAGAGTAAATGCCTTAGTTGACATGTTCGAAACAATTTTCCTAGCACCAGACACTTTGATCATATCAACTATCAAATAGAACGGGCACACGGATATCAGCATCACAGAGGTTCTTGCTTCTAAATTTTGGTAAACTGAAATGTGGA

mRNA sequence

ATGGACCGAGCAGCCATGCTCTCTAAAGCCGTTTACCTGAATTCCAAAAGCCCCGAATTAGCATGGCTTCTCTTCAAGCGAGTTCTTTCTTCCCCCATCTCTGCCTCCTCTTCCTTCTTCAAAACCTCCCTTCAATCTATACCTCTTATTGCCCGCATCCTCACCACTGCAAAAATGCACCCACAGATCGATCACCTTCACCAACTCCTCTTGTCGCAGCACCGGGATTTTGCTCATCCATCTGGATTTGCACTCGTTCGAGCCTTGGCTGATTTGGGTCTCTTCGAAAATGCCATTTCTCAGTTTCGATCGCTTCGAGCCAGGTTTCCTAATGACCCTCCAGATATTTCTTTCTATAATTTTCTGTTTCGATGCTCTTTGAAGGAGAGCCGGGTTGATTTTGTGATTTGGCTTTATAAGGATATGGTTTTTGCGAGAGTTAACCCACAGACTTATACTTTTAATCTGTTGATACGTGCGCTTTGTGAAATGGGGTACTTGGAGAATGCACGTCAAGTGTTTGATAAAATGTCTGAAAAAGGGTGTAACCCTAATGAGTTTAGTCTTGGACTTATAGTTCGTGGGTATTGTAGAGCTGGGCTCCATGATCGAGGTATTGAACTTTTGGACGAGATGAGGAGCTCTGGTGCTCTTCCCAATAGGGTTGCATACAATACTGTGATATCTTCTCTTTGTGGAGAAGGTCAGACTGGGGAGGCTGAGAAATTGGTGGAAAGGATGAGAGAGGTTGGTCTTTCTCCAGATATTGTAACTTTCAATTGCAGAATTGCTGCCCTCTGTAAATCCGGGCAAATTTTAGAAGCTTCCAGAATTTTTAGAGATATGCAAATAGATGAAGAGTTAGGGTTACCTCAGCCTAATACCGTAACATATAATTTAATGCTAGAAGGATTTTGTAATGAAGGAATGTTCGAGGAATCCAAGGCTCTCTTTGATTCTATGAAAAAATCTGAAACTCATTTGACCATGGAGAGCTATAACATATGGTTGTTAGGTTTGGTTAGAAGTGGAAAGCTCCTTGAAGCTCGTTTAATTCTTAATGAAATGGCAGAAAAGAGTATAAAACCCAATCTTTACTCCTATAACATTTTGATTTATGGCCTTTGTAAATATGGAATGTTTTCTGATGCAAGATCTATAATAGGTCTAATGAGAGAGAGTGGTGTAGCTCCAGATATTGTATCTTATAGTACCTTACTTCATGGATACTGCTGTAGAGGAAAGATACTTGAATCCAATTATGTTCTTCGCGAAATGATACAGGTTGGTTGTTTTCCCAATATGTATACTTGTAATATCCTGCTTCACAGCCTGTGGAAAGAGGGGAAAGTATCAGAGGCAGAAGAGTTGCTACAAAAGATGAATGAAAGAGGTTATGGCTTGAATAATGTAACTTGTAATACAGTGATTAAGGGCCTCTGTAAATCTGGGAATCTGGACAAAGCTATTGAAATAGTGAGTGGCATGTGGAACCATGGAAGCGCTTCTCTTGGTAATCTTGGAAACTCTTTTATTGGTCTTTTTGATATTGGCAATAATGGGATGAAGTGTTTACCTGACTCGATCACATATGCAACCATAATAAGTTGGTTATGTAAGGCGGGGCGGGTTGATGAAGCAAAAAAGAAGCTTCTGGAGATGATTGGGAAAAAGCTATCTCCGGATTCGCTAATATTTGATACTTTCATACATAGTTACTGTAAACAAGGAAAGTTGTCATCTGCTTTTAGAGTACTCAAGGAAATGGAGAAAAAAGGGTGCAACAAGAGCCTTCGAACGTATAATTCATTGATCCAGGGTTTAAGTTCCAAAAATCAAATATTTGAAATATATGGGTTGATGGAGGAGATGAAAGAAAAAGGGATTTTTCCTAATGTTTACACTTACAATAACATTATTAGCTGCCTTTCTGAAGGTGGGAAACTGAAAGATGCCACCAGTCTTTTGGATGAAATGCTGCAGAAGGGGATATCTCCTAATATATATACGTTTAGGATTTTAATTGGAGCTTTCTTTAAGGCTTGCGACTTTGGAGCCGCTCAAGAGTTATTTGAGATAGCTTTAAGCATATGTGGCCACAAGGAATCCTTGTATAGTTTTATGTTCAATGAGCTATTAACTGGAGGTGAAACATCCAAGGCTAAAGAGCTTTTTGAAGCTGCATTAGATAGATCTCTAGCCTTGAAAAATTTTCTTTACAGGGACCTAATTGAAAGGCTTTGCATGGACGGAAAGTTAGATGATGCTAGTTTCATTCTTCATAAGATGATGGATAAGCAGTATAGGTTTGACCCTGCATCATTCATGCCAGTGATTGATGGATTAGGTAAAAGTGGGAACAAGCATGCAGCCGATGAATTTGCAGAAAAAATGATGGAAATGGCTTCAGAAACTGACATCAACCAACATGAGAATAAGATTATCCGAGGAAGATCAAATAATGATGATGAAAGAGATTGGCACAAGATCGTTCACAGAAACGATGGCAGTGGGATTGCACAGAAGACTCTTAAGCGTGTGTTGAAAGGATGGGGTCAAGGAAGCAAATATGGGAACATGCTCGAACCTGCTGAGGCATCAAATCTTCAAGTCAGGGTGCTTAAGGACATCAGTACATCAGGATGGACTGCTACTGCCAAAGAAAGAAATCCATTAGAGGCAGATTCGACAGTTTCACATGTGAGTAGAACTGATTCCCCATGGTTGTCGACATACCTGATATCGACTCATAGTAGTTGGACAAACTACACAAACAGGAAGTTCGAGCAGACAATGGCCAGACTATTCTCTTCCAAATCAAGACATCAGTGGTACTGCAATGGATGTCTAGCATGGGACAAAGGCATCTCAATGGTGAAGGGGGAGAAGCTGATGTTGGAGTCGTTTTGTTGCAGCAATTTCTGCCTCAAACCGTTCGTTCTTCTATTCATGTTTGATGCATATGAAGACTCAGTGCTGGCTAAGCTTGATATCTCAAACAACATTTGTCAAAGCATCAAGGATTGCAGCACAGAAGCAATTAGGATCAGTCAGACATGGAGGTTGATGTTTTTCTTTGATAATTTCAACTGTTTCTCCATGGAATCTGTGACCCCATGTTTCCAGTTGCTCAGGAGTAGGACAACCAAATCACCTATCAACAAGGATAAATGGAGGAGTGGACTTGGAGATCCAAACATCTAAACTGAGTAGCTAGAATGTTGGACTGCATTGAAACTGACTTCAGATCTTGATCTTTTATGGAACATTGTCTAATCACTTCATCTTATTTCATATGGAAGGGAGACACAGAGGGACGGTTCATGAGCGAAGCAGCTCGACTGTAAAGGAGAGGAGGCACAACGACGCAATTATCAGGGGCGTGCTTTACCACTGAGCTAATAGCCCGTTGTGCAGACCTCCCGAGGAGAAAGGAGGCTCCGCGCGAAGAGGATACCTCGTCCTCCATTTCAACCACCACGTTTTCATTATGGTCTTACAAAACGACAATTACTTAAATACGTTCGTATCGCCGGAACAAAAACACAGGTCTCTGCAAAGTCGTAAGACCACGTATATCGAATAGTCTCATGAAGCTAAAAAGATGTCTCTAAATGACAAGCTTAGATTAGTTTCCCCTTTCAGGGAAGCAGAGTAAATGCCTTAGTTGACATGTTCGAAACAATTTTCCTAGCACCAGACACTTTGATCATATCAACTATCAAATAGAACGGGCACACGGATATCAGCATCACAGAGGTTCTTGCTTCTAAATTTTGGTAAACTGAAATGTGGA

Coding sequence (CDS)

ATGGACCGAGCAGCCATGCTCTCTAAAGCCGTTTACCTGAATTCCAAAAGCCCCGAATTAGCATGGCTTCTCTTCAAGCGAGTTCTTTCTTCCCCCATCTCTGCCTCCTCTTCCTTCTTCAAAACCTCCCTTCAATCTATACCTCTTATTGCCCGCATCCTCACCACTGCAAAAATGCACCCACAGATCGATCACCTTCACCAACTCCTCTTGTCGCAGCACCGGGATTTTGCTCATCCATCTGGATTTGCACTCGTTCGAGCCTTGGCTGATTTGGGTCTCTTCGAAAATGCCATTTCTCAGTTTCGATCGCTTCGAGCCAGGTTTCCTAATGACCCTCCAGATATTTCTTTCTATAATTTTCTGTTTCGATGCTCTTTGAAGGAGAGCCGGGTTGATTTTGTGATTTGGCTTTATAAGGATATGGTTTTTGCGAGAGTTAACCCACAGACTTATACTTTTAATCTGTTGATACGTGCGCTTTGTGAAATGGGGTACTTGGAGAATGCACGTCAAGTGTTTGATAAAATGTCTGAAAAAGGGTGTAACCCTAATGAGTTTAGTCTTGGACTTATAGTTCGTGGGTATTGTAGAGCTGGGCTCCATGATCGAGGTATTGAACTTTTGGACGAGATGAGGAGCTCTGGTGCTCTTCCCAATAGGGTTGCATACAATACTGTGATATCTTCTCTTTGTGGAGAAGGTCAGACTGGGGAGGCTGAGAAATTGGTGGAAAGGATGAGAGAGGTTGGTCTTTCTCCAGATATTGTAACTTTCAATTGCAGAATTGCTGCCCTCTGTAAATCCGGGCAAATTTTAGAAGCTTCCAGAATTTTTAGAGATATGCAAATAGATGAAGAGTTAGGGTTACCTCAGCCTAATACCGTAACATATAATTTAATGCTAGAAGGATTTTGTAATGAAGGAATGTTCGAGGAATCCAAGGCTCTCTTTGATTCTATGAAAAAATCTGAAACTCATTTGACCATGGAGAGCTATAACATATGGTTGTTAGGTTTGGTTAGAAGTGGAAAGCTCCTTGAAGCTCGTTTAATTCTTAATGAAATGGCAGAAAAGAGTATAAAACCCAATCTTTACTCCTATAACATTTTGATTTATGGCCTTTGTAAATATGGAATGTTTTCTGATGCAAGATCTATAATAGGTCTAATGAGAGAGAGTGGTGTAGCTCCAGATATTGTATCTTATAGTACCTTACTTCATGGATACTGCTGTAGAGGAAAGATACTTGAATCCAATTATGTTCTTCGCGAAATGATACAGGTTGGTTGTTTTCCCAATATGTATACTTGTAATATCCTGCTTCACAGCCTGTGGAAAGAGGGGAAAGTATCAGAGGCAGAAGAGTTGCTACAAAAGATGAATGAAAGAGGTTATGGCTTGAATAATGTAACTTGTAATACAGTGATTAAGGGCCTCTGTAAATCTGGGAATCTGGACAAAGCTATTGAAATAGTGAGTGGCATGTGGAACCATGGAAGCGCTTCTCTTGGTAATCTTGGAAACTCTTTTATTGGTCTTTTTGATATTGGCAATAATGGGATGAAGTGTTTACCTGACTCGATCACATATGCAACCATAATAAGTTGGTTATGTAAGGCGGGGCGGGTTGATGAAGCAAAAAAGAAGCTTCTGGAGATGATTGGGAAAAAGCTATCTCCGGATTCGCTAATATTTGATACTTTCATACATAGTTACTGTAAACAAGGAAAGTTGTCATCTGCTTTTAGAGTACTCAAGGAAATGGAGAAAAAAGGGTGCAACAAGAGCCTTCGAACGTATAATTCATTGATCCAGGGTTTAAGTTCCAAAAATCAAATATTTGAAATATATGGGTTGATGGAGGAGATGAAAGAAAAAGGGATTTTTCCTAATGTTTACACTTACAATAACATTATTAGCTGCCTTTCTGAAGGTGGGAAACTGAAAGATGCCACCAGTCTTTTGGATGAAATGCTGCAGAAGGGGATATCTCCTAATATATATACGTTTAGGATTTTAATTGGAGCTTTCTTTAAGGCTTGCGACTTTGGAGCCGCTCAAGAGTTATTTGAGATAGCTTTAAGCATATGTGGCCACAAGGAATCCTTGTATAGTTTTATGTTCAATGAGCTATTAACTGGAGGTGAAACATCCAAGGCTAAAGAGCTTTTTGAAGCTGCATTAGATAGATCTCTAGCCTTGAAAAATTTTCTTTACAGGGACCTAATTGAAAGGCTTTGCATGGACGGAAAGTTAGATGATGCTAGTTTCATTCTTCATAAGATGATGGATAAGCAGTATAGGTTTGACCCTGCATCATTCATGCCAGTGATTGATGGATTAGGTAAAAGTGGGAACAAGCATGCAGCCGATGAATTTGCAGAAAAAATGATGGAAATGGCTTCAGAAACTGACATCAACCAACATGAGAATAAGATTATCCGAGGAAGATCAAATAATGATGATGAAAGAGATTGGCACAAGATCGTTCACAGAAACGATGGCAGTGGGATTGCACAGAAGACTCTTAAGCGTGTGTTGAAAGGATGGGGTCAAGGAAGCAAATATGGGAACATGCTCGAACCTGCTGAGGCATCAAATCTTCAAGTCAGGGTGCTTAAGGACATCAGTACATCAGGATGGACTGCTACTGCCAAAGAAAGAAATCCATTAGAGGCAGATTCGACAGTTTCACATGTGAGTAGAACTGATTCCCCATGGTTGTCGACATACCTGATATCGACTCATAGTAGTTGGACAAACTACACAAACAGGAAGTTCGAGCAGACAATGGCCAGACTATTCTCTTCCAAATCAAGACATCAGTGGTACTGCAATGGATGTCTAGCATGGGACAAAGGCATCTCAATGGTGAAGGGGGAGAAGCTGATGTTGGAGTCGTTTTGTTGCAGCAATTTCTGCCTCAAACCGTTCGTTCTTCTATTCATGTTTGATGCATATGAAGACTCAGTGCTGGCTAAGCTTGATATCTCAAACAACATTTGTCAAAGCATCAAGGATTGCAGCACAGAAGCAATTAGGATCAGTCAGACATGGAGGTTGATGTTTTTCTTTGATAATTTCAACTGTTTCTCCATGGAATCTGTGACCCCATGTTTCCAGTTGCTCAGGAGTAGGACAACCAAATCACCTATCAACAAGGATAAATGGAGGAGTGGACTTGGAGATCCAAACATCTAA

Protein sequence

MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMHPQIDHLHQLLLSQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYNFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEMLQKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSKAKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVIDGLGKSGNKHAADEFAEKMMEMASETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIAQKTLKRVLKGWGQGSKYGNMLEPAEASNLQVRVLKDISTSGWTATAKERNPLEADSTVSHVSRTDSPWLSTYLISTHSSWTNYTNRKFEQTMARLFSSKSRHQWYCNGCLAWDKGISMVKGEKLMLESFCCSNFCLKPFVLLFMFDAYEDSVLAKLDISNNICQSIKDCSTEAIRISQTWRLMFFFDNFNCFSMESVTPCFQLLRSRTTKSPINKDKWRSGLGDPNI
BLAST of Cp4.1LG15g06440 vs. Swiss-Prot
Match: PP158_ARATH (Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana GN=At2g17140 PE=2 SV=1)

HSP 1 Score: 1005.7 bits (2599), Expect = 3.7e-292
Identity = 493/853 (57.80%), Postives = 631/853 (73.97%), Query Frame = 1

Query: 7   LSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMHPQIDHL 66
           L KA+  N+ +P LAW +FKR+ SSP   S      SL + P IARIL  AKMH +I  L
Sbjct: 5   LVKALLKNTNNPRLAWRIFKRIFSSPSEESHGI---SLDATPTIARILVRAKMHEEIQEL 64

Query: 67  HQLLLSQHRDFAHPSGF-ALVRALADLGLFENAISQFRSLRARFPNDPPDISFYNFLFRC 126
           H L+LS        S   ++V   A     + A  QF+ +R+RFP + P +  YN L   
Sbjct: 65  HNLILSSSIQKTKLSSLLSVVSIFAKSNHIDKAFPQFQLVRSRFPENKPSVYLYNLLLES 124

Query: 127 SLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEKGCNPN 186
            +KE RV+FV WLYKDMV   + PQTYTFNLLIRALC+   ++ AR++FD+M EKGC PN
Sbjct: 125 CIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPN 184

Query: 187 EFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEAEKLVE 246
           EF+ G++VRGYC+AGL D+G+ELL+ M S G LPN+V YNT++SS C EG+  ++EK+VE
Sbjct: 185 EFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCREGRNDDSEKMVE 244

Query: 247 RMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNLMLEGF 306
           +MRE GL PDIVTFN RI+ALCK G++L+ASRIF DM++DE LGLP+PN++TYNLML+GF
Sbjct: 245 KMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSITYNLMLKGF 304

Query: 307 CNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKSIKPNL 366
           C  G+ E++K LF+S+++++   +++SYNIWL GLVR GK +EA  +L +M +K I P++
Sbjct: 305 CKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSI 364

Query: 367 YSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESNYVLRE 426
           YSYNIL+ GLCK GM SDA++I+GLM+ +GV PD V+Y  LLHGYC  GK+  +  +L+E
Sbjct: 365 YSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQE 424

Query: 427 MIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGLCKSGN 486
           M++  C PN YTCNILLHSLWK G++SEAEELL+KMNE+GYGL+ VTCN ++ GLC SG 
Sbjct: 425 MMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIVDGLCGSGE 484

Query: 487 LDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCKAGRVD 546
           LDKAIEIV GM  HGSA+LGNLGNS+IGL D       CLPD ITY+T+++ LCKAGR  
Sbjct: 485 LDKAIEIVKGMRVHGSAALGNLGNSYIGLVDDSLIENNCLPDLITYSTLLNGLCKAGRFA 544

Query: 547 EAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLI 606
           EAK    EM+G+KL PDS+ ++ FIH +CKQGK+SSAFRVLK+MEKKGC+KSL TYNSLI
Sbjct: 545 EAKNLFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLI 604

Query: 607 QGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEMLQKGIS 666
            GL  KNQIFEI+GLM+EMKEKGI PN+ TYN  I  L EG K++DAT+LLDEM+QK I+
Sbjct: 605 LGLGIKNQIFEIHGLMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIA 664

Query: 667 PNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSKAKELF 726
           PN+++F+ LI AF K  DF  AQE+FE A+SICG KE LYS MFNELL  G+  KA EL 
Sbjct: 665 PNVFSFKYLIEAFCKVPDFDMAQEVFETAVSICGQKEGLYSLMFNELLAAGQLLKATELL 724

Query: 727 EAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVIDGLGKS 786
           EA LDR   L  FLY+DL+E LC   +L+ AS ILHKM+D+ Y FDPA+ MPVIDGLGK 
Sbjct: 725 EAVLDRGFELGTFLYKDLVESLCKKDELEVASGILHKMIDRGYGFDPAALMPVIDGLGKM 784

Query: 787 GNKHAADEFAEKMMEMAS----ETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIAQ 846
           GNK  A+ FA+KMMEMAS       ++ +   I + + N +   +W  I+HR+DGSGIA 
Sbjct: 785 GNKKEANSFADKMMEMASVGEVANKVDPNARDIHQKKHNKNGGNNWQNILHRDDGSGIAL 844

Query: 847 KTLKRVLKGWGQG 855
           ++L RV KGWGQG
Sbjct: 845 RSLSRVKKGWGQG 854

BLAST of Cp4.1LG15g06440 vs. Swiss-Prot
Match: RF1_ORYSI (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 2.1e-74
Identity = 187/691 (27.06%), Postives = 328/691 (47.47%), Query Frame = 1

Query: 83  FALVRALADLGLFENA--ISQF-RSLRARFPNDPPDISFYNFLFRCSLKESRVDFVIWLY 142
           + L RALAD+     A  +S++ R  RA      PD+  Y  L  C  +  R+D      
Sbjct: 51  YGLNRALADVARDSPAAAVSRYNRMARAGADEVTPDLCTYGILIGCCCRAGRLDLGFAAL 110

Query: 143 KDMVFARVNPQTYTFNLLIRALCEMGYLENARQ-VFDKMSEKGCNPNEFSLGLIVRGYCR 202
            +++          F  L++ LC      +A   V  +M+E GC PN FS  ++++G C 
Sbjct: 111 GNVIKKGFRVDAIAFTPLLKGLCADKRTSDAMDIVLRRMTELGCIPNVFSYNILLKGLCD 170

Query: 203 AGLHDRGIELLDEM---RSSGALPNRVAYNTVISSLCGEGQTGEAEKLVERMREVGLSPD 262
                  +ELL  M   R  G+ P+ V+Y TVI+    EG + +A      M + G+ PD
Sbjct: 171 ENRSQEALELLHMMADDRGGGSPPDVVSYTTVINGFFKEGDSDKAYSTYHEMLDRGILPD 230

Query: 263 IVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNLMLEGFCNEGMFEESK 322
           +VT+N  IAALCK+  + +A  +   M  +  +    P+ +TYN +L G+C+ G  +E+ 
Sbjct: 231 VVTYNSIIAALCKAQAMDKAMEVLNTMVKNGVM----PDCMTYNSILHGYCSSGQPKEAI 290

Query: 323 ALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKSIKPNLYSYNILIYGL 382
                M+       + +Y++ +  L ++G+ +EAR I + M ++ +KP + +Y  L+ G 
Sbjct: 291 GFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEARKIFDSMTKRGLKPEITTYGTLLQGY 350

Query: 383 CKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESNYVLREMIQVGCFPNM 442
              G   +   ++ LM  +G+ PD   +S L+  Y  +GK+ ++  V  +M Q G  PN 
Sbjct: 351 ATKGALVEMHGLLDLMVRNGIHPDHYVFSILICAYAKQGKVDQAMLVFSKMRQQGLNPNA 410

Query: 443 YTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGLCKSGNLDKAIEIVSG 502
            T   ++  L K G+V +A    ++M + G    N+  N++I GLC     ++A E++  
Sbjct: 411 VTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGNIVYNSLIHGLCTCNKWERAEELILE 470

Query: 503 MWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCKAGRVDEAKKKLLEMI 562
           M + G                       CL ++I + +II   CK GRV E++K    M+
Sbjct: 471 MLDRGI----------------------CL-NTIFFNSIIDSHCKEGRVIESEKLFELMV 530

Query: 563 GKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLSSKNQIF 622
              + P+ + ++T I+ YC  GK+  A ++L  M   G   +  TY++LI G    +++ 
Sbjct: 531 RIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRME 590

Query: 623 EIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEMLQKGISPNIYTFRILI 682
           +   L +EM+  G+ P++ TYN I+  L +  +   A  L   + + G    + T+ I++
Sbjct: 591 DALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKELYVRITESGTQIELSTYNIIL 650

Query: 683 GAFFKACDFGAAQELFE-IALSICGHKESLYSFMFNELLTGGETSKAKELFEAALDRSLA 742
               K      A ++F+ + L     +   ++ M + LL  G   +AK+LF A     L 
Sbjct: 651 HGLCKNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAKDLFVAFSSNGLV 710

Query: 743 LKNFLYRDLIERLCMDGKLDDASFILHKMMD 766
              + YR + E +   G L++   +   M D
Sbjct: 711 PNYWTYRLMAENIIGQGLLEELDQLFLSMED 714

BLAST of Cp4.1LG15g06440 vs. Swiss-Prot
Match: PP444_ARATH (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 2.8e-74
Identity = 162/563 (28.77%), Postives = 283/563 (50.27%), Query Frame = 1

Query: 114 PDISFYNFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQV 173
           P    YN +    +  +       ++ DM+  ++ P  +TF ++++A C +  +++A  +
Sbjct: 180 PTFKSYNVVLEILVSGNCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSL 239

Query: 174 FDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCG 233
              M++ GC PN      ++    +    +  ++LL+EM   G +P+   +N VI  LC 
Sbjct: 240 LRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCK 299

Query: 234 EGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQP 293
             +  EA K+V RM   G +PD +T+   +  LCK G++  A  +F          +P+P
Sbjct: 300 FDRINEAAKMVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLF--------YRIPKP 359

Query: 294 NTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLT-MESYNIWLLGLVRSGKLLEARLI 353
             V +N ++ GF   G  +++KA+   M  S   +  + +YN  + G  + G +  A  +
Sbjct: 360 EIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEV 419

Query: 354 LNEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCC 413
           L++M  K  KPN+YSY IL+ G CK G   +A +++  M   G+ P+ V ++ L+  +C 
Sbjct: 420 LHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCK 479

Query: 414 RGKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVT 473
             +I E+  + REM + GC P++YT N L+  L +  ++  A  LL+ M   G   N VT
Sbjct: 480 EHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVT 539

Query: 474 CNTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYA 533
            NT+I    + G + +A ++V+ M   GS                         D ITY 
Sbjct: 540 YNTLINAFLRRGEIKEARKLVNEMVFQGSPL-----------------------DEITYN 599

Query: 534 TIISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKK 593
           ++I  LC+AG VD+A+    +M+    +P ++  +  I+  C+ G +  A    KEM  +
Sbjct: 600 SLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLR 659

Query: 594 GCNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDA 653
           G    + T+NSLI GL    +I +   +  +++ +GI P+  T+N ++S L +GG + DA
Sbjct: 660 GSTPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAEGIPPDTVTFNTLMSWLCKGGFVYDA 711

Query: 654 TSLLDEMLQKGISPNIYTFRILI 676
             LLDE ++ G  PN  T+ IL+
Sbjct: 720 CLLLDEGIEDGFVPNHRTWSILL 711

BLAST of Cp4.1LG15g06440 vs. Swiss-Prot
Match: PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 2.8e-74
Identity = 186/695 (26.76%), Postives = 318/695 (45.76%), Query Frame = 1

Query: 114 PDISFYNFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQV 173
           P++   + L    +K       + L+ DMV   + P  Y +  +IR+LCE+  L  A+++
Sbjct: 190 PEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEM 249

Query: 174 FDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCG 233
              M   GC+ N     +++ G C+       + +  ++      P+ V Y T++  LC 
Sbjct: 250 IAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCK 309

Query: 234 EGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQP 293
             +     ++++ M  +  SP     +  +  L K G+I EA  + + +    + G+  P
Sbjct: 310 VQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVV---DFGV-SP 369

Query: 294 NTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLIL 353
           N   YN +++  C    F E++ LFD M K        +Y+I +    R GKL  A   L
Sbjct: 370 NLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFL 429

Query: 354 NEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCR 413
            EM +  +K ++Y YN LI G CK+G  S A   +  M    + P +V+Y++L+ GYC +
Sbjct: 430 GEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSK 489

Query: 414 GKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTC 473
           GKI ++  +  EM   G  P++YT   LL  L++ G + +A +L  +M E     N VT 
Sbjct: 490 GKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTY 549

Query: 474 NTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYAT 533
           N +I+G C+ G++ KA E +  M   G                        +PD+ +Y  
Sbjct: 550 NVMIEGYCEEGDMSKAFEFLKEMTEKG-----------------------IVPDTYSYRP 609

Query: 534 IISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKG 593
           +I  LC  G+  EAK  +  +       + + +   +H +C++GKL  A  V +EM ++G
Sbjct: 610 LIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRG 669

Query: 594 CNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDAT 653
            +  L  Y  LI G          +GL++EM ++G+ P+   Y ++I   S+ G  K+A 
Sbjct: 670 VDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAF 729

Query: 654 SLLDEMLQKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELL 713
            + D M+ +G  PN  T+  +I    KA     A+ L      +      +    F ++L
Sbjct: 730 GIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDIL 789

Query: 714 TGGET--SKAKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFD 773
           T GE    KA EL  A L + L      Y  LI   C  G++++AS ++ +M+      D
Sbjct: 790 TKGEVDMQKAVELHNAIL-KGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPD 849

Query: 774 PASFMPVIDGLGKSGNKHAADEFAEKMMEMASETD 807
             ++  +I+ L +  +   A E    M E     D
Sbjct: 850 CITYTTMINELCRRNDVKKAIELWNSMTEKGIRPD 856

BLAST of Cp4.1LG15g06440 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 4.0e-73
Identity = 162/526 (30.80%), Postives = 277/526 (52.66%), Query Frame = 1

Query: 155 NLLIRALCEMGYLENARQVFDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRS 214
           N LI +L  +G++E A  V+ ++S  G   N ++L ++V   C+ G  ++    L +++ 
Sbjct: 204 NALIGSLVRIGWVELAWGVYQEISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQE 263

Query: 215 SGALPNRVAYNTVISSLCGEGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILE 274
            G  P+ V YNT+IS+   +G   EA +L+  M   G SP + T+N  I  LCK G+   
Sbjct: 264 KGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYER 323

Query: 275 ASRIFRDMQIDEELGLPQPNTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLTMESYN 334
           A  +F +M      GL  P++ TY  +L   C +G   E++ +F  M+  +    +  ++
Sbjct: 324 AKEVFAEML---RSGL-SPDSTTYRSLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFS 383

Query: 335 IWLLGLVRSGKLLEARLILNEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRES 394
             +    RSG L +A +  N + E  + P+   Y ILI G C+ GM S A ++   M + 
Sbjct: 384 SMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQ 443

Query: 395 GVAPDIVSYSTLLHGYCCRGKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEA 454
           G A D+V+Y+T+LHG C R  + E++ +  EM +   FP+ YT  IL+    K G +  A
Sbjct: 444 GCAMDVVTYNTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNA 503

Query: 455 EELLQKMNERGYGLNNVTCNTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGL 514
            EL QKM E+   L+ VT NT++ G  K G++D A EI + M +                
Sbjct: 504 MELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVS---------------- 563

Query: 515 FDIGNNGMKCLPDSITYATIISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYC 574
                   + LP  I+Y+ +++ LC  G + EA +   EMI K + P  +I ++ I  YC
Sbjct: 564 -------KEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYC 623

Query: 575 KQGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEK--GIFPN 634
           + G  S     L++M  +G      +YN+LI G   +  + + +GL+++M+E+  G+ P+
Sbjct: 624 RSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQGGLVPD 683

Query: 635 VYTYNNIISCLSEGGKLKDATSLLDEMLQKGISPNIYTFRILIGAF 679
           V+TYN+I+       ++K+A  +L +M+++G++P+  T+  +I  F
Sbjct: 684 VFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGF 702

BLAST of Cp4.1LG15g06440 vs. TrEMBL
Match: A0A0A0K4X0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G024100 PE=4 SV=1)

HSP 1 Score: 1543.5 bits (3995), Expect = 0.0e+00
Identity = 762/855 (89.12%), Postives = 804/855 (94.04%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MDRAA LSKA+YLNS +P LAWLLFKR+LSSPI ASSSFFK SLQS+P IARIL TAKMH
Sbjct: 3   MDRAAKLSKAIYLNSNNPNLAWLLFKRILSSPIPASSSFFKPSLQSVPAIARILITAKMH 62

Query: 61  PQIDHLHQLLLSQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYN 120
           PQIDHLHQLLLSQHRDFAHPSGF+LVR LADLGL ENAISQFRSLR RFP+DPP ISFYN
Sbjct: 63  PQIDHLHQLLLSQHRDFAHPSGFSLVRTLADLGLLENAISQFRSLRDRFPHDPPPISFYN 122

Query: 121 FLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEK 180
            LFRCSLKESRVD VIWLYKDM  ARV PQTYTFNLLI ALCEMGYLENAR+VFDKMSEK
Sbjct: 123 LLFRCSLKESRVDCVIWLYKDMAVARVKPQTYTFNLLISALCEMGYLENAREVFDKMSEK 182

Query: 181 GCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEA 240
           GC PNEFSLG++VRGYCRAGLH  GI+LLDEMRSSGALPNRVAYNTVISSLCGEGQT EA
Sbjct: 183 GCKPNEFSLGILVRGYCRAGLHSHGIDLLDEMRSSGALPNRVAYNTVISSLCGEGQTVEA 242

Query: 241 EKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNL 300
           EKLVE+MREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEE+GLP+PNTVTYNL
Sbjct: 243 EKLVEKMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEEMGLPKPNTVTYNL 302

Query: 301 MLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKS 360
           MLEGFC+EGMFEE++A+FDSMK SET L++ SYNIW+LGLVRSGKLLEA LILNEMAEK+
Sbjct: 303 MLEGFCSEGMFEEARAIFDSMKNSET-LSLRSYNIWMLGLVRSGKLLEAHLILNEMAEKN 362

Query: 361 IKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESN 420
           IKPNLYSYNIL++GLCKYGMFSDARSI+GLMRESGVAPD V+YSTLLHGYC RGKILE+N
Sbjct: 363 IKPNLYSYNILVHGLCKYGMFSDARSILGLMRESGVAPDTVTYSTLLHGYCRRGKILEAN 422

Query: 421 YVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGL 480
           YVLREMIQVGCFPNMYTCNILLHSLWKEG+ SEAE+LLQ MNERGYGL+NVTCNT+I GL
Sbjct: 423 YVLREMIQVGCFPNMYTCNILLHSLWKEGRASEAEDLLQMMNERGYGLDNVTCNTMINGL 482

Query: 481 CKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCK 540
           CK+GNLDKAIEIVSGMW  GSASLGNLGNSFI LFDI NNG KCLPDSITYATII  LCK
Sbjct: 483 CKAGNLDKAIEIVSGMWTRGSASLGNLGNSFIDLFDIRNNGKKCLPDSITYATIIGGLCK 542

Query: 541 AGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRT 600
            GRVDEAKKKLLEMIGKKLSPDSLIFDTFI++YCKQGKLSSAFRVLKEMEKKGCNKSLRT
Sbjct: 543 VGRVDEAKKKLLEMIGKKLSPDSLIFDTFIYNYCKQGKLSSAFRVLKEMEKKGCNKSLRT 602

Query: 601 YNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEML 660
           YNSLIQGL S+NQIFEIYGLM+EMKE+GIFPNVYTYNNIISCLSEGGKLKDAT LLDEML
Sbjct: 603 YNSLIQGLGSENQIFEIYGLMDEMKERGIFPNVYTYNNIISCLSEGGKLKDATCLLDEML 662

Query: 661 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSK 720
           QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALS+CGHKESLYSFMFNELL GGET K
Sbjct: 663 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSLCGHKESLYSFMFNELLAGGETLK 722

Query: 721 AKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVID 780
           AKELFEAALDRSLALKNFLYRDLIE+LC DGKLDDASFILHKMMDKQY FDPASFMPVID
Sbjct: 723 AKELFEAALDRSLALKNFLYRDLIEKLCKDGKLDDASFILHKMMDKQYSFDPASFMPVID 782

Query: 781 GLGKSGNKHAADEFAEKMMEMASETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIA 840
            LGK G+KHAADEFAE+MMEMASETD N+HENK IRGR NN+DE DW KIVHRNDGSGIA
Sbjct: 783 ELGKRGSKHAADEFAERMMEMASETDFNEHENKNIRGRLNNNDESDWQKIVHRNDGSGIA 842

Query: 841 QKTLKRVLKGWGQGS 856
           QKTLKRVLKGWGQGS
Sbjct: 843 QKTLKRVLKGWGQGS 856

BLAST of Cp4.1LG15g06440 vs. TrEMBL
Match: D7U4S8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g03720 PE=4 SV=1)

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 568/859 (66.12%), Postives = 682/859 (79.39%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MD+   L+KA+  N+ +P LAW LFKR+LS P S+SS   ++ L+SIP+I  IL  AKM 
Sbjct: 1   MDQRNKLTKALIKNTHNPTLAWHLFKRILSIPTSSSSISSRSILRSIPIITHILIRAKMI 60

Query: 61  PQIDHLHQLLLSQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYN 120
            QIDHL QLLL Q ++ +H S  AL+R LA  GL + A SQF+S R++ P +PP +  YN
Sbjct: 61  SQIDHLQQLLLQQPQEVSHVSLIALIRILAKSGLSDLAFSQFQSFRSQVPANPPPVYLYN 120

Query: 121 FLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEK 180
            +   SL+E +VD   WLYKDMV A V+P+TYT NLLI  LC+ G  E+AR+VFDKM  K
Sbjct: 121 MVLESSLREDKVDSFSWLYKDMVVAGVSPETYTLNLLIAGLCDSGRFEDAREVFDKMGVK 180

Query: 181 GCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEA 240
           GC PNEFS G++VRGYCRAGL  R +ELLD M S G  PN+V YNT+ISS C EG+  EA
Sbjct: 181 GCRPNEFSFGILVRGYCRAGLSMRALELLDGMGSFGVQPNKVIYNTLISSFCREGRNEEA 240

Query: 241 EKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNL 300
           E+LVERMRE GL PD+VTFN RI+ALC +G+ILEASRIFRDMQIDEELGLP+PN  T+NL
Sbjct: 241 ERLVERMREDGLFPDVVTFNSRISALCSAGKILEASRIFRDMQIDEELGLPRPNITTFNL 300

Query: 301 MLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKS 360
           MLEGFC EGM EE+K L +SMK++   + +ESYNIWLLGLVR+GKLLEA+L L EM +K 
Sbjct: 301 MLEGFCKEGMLEEAKTLVESMKRNGNLMELESYNIWLLGLVRNGKLLEAQLALKEMVDKG 360

Query: 361 IKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESN 420
           I+PN+YS+N ++ GLCK G+ SDAR I+GLM  SG+ PD V+YSTLLHG C  GK+L++N
Sbjct: 361 IEPNIYSFNTVMDGLCKNGLISDARMIMGLMISSGIGPDTVTYSTLLHGCCSTGKVLKAN 420

Query: 421 YVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGL 480
            +L EM++ GC PN YTCNILLHSLWKEG++ EAE+LLQKMNER Y L+NVTCN VI GL
Sbjct: 421 NILHEMMRRGCSPNTYTCNILLHSLWKEGRIFEAEKLLQKMNERSYDLDNVTCNIVIDGL 480

Query: 481 CKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCK 540
           CKSG LD+A+EIV GMW HGSA+LGNLGNSFIGL D  +NG KCLPD ITY+ II+ LCK
Sbjct: 481 CKSGKLDEAVEIVEGMWIHGSAALGNLGNSFIGLVDSSSNGKKCLPDLITYSIIINGLCK 540

Query: 541 AGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRT 600
           AGR+DEA+KK +EM+GK L PDS+I+DTFIHS+CK GK+SSAFRVLK+MEK+GCNKSL+T
Sbjct: 541 AGRLDEARKKFIEMVGKSLHPDSIIYDTFIHSFCKHGKISSAFRVLKDMEKRGCNKSLQT 600

Query: 601 YNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEML 660
           YNSLI GL SKNQIFEIYGL+++MKEKGI PN+ TYNN+ISCL EGG++KDATSLLDEML
Sbjct: 601 YNSLILGLGSKNQIFEIYGLLDDMKEKGITPNICTYNNMISCLCEGGRIKDATSLLDEML 660

Query: 661 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSK 720
           QKGISPNI +FR+LI AF KA DFG  +E+FEIALSICGHKE+LYS MFNELL GGE S+
Sbjct: 661 QKGISPNISSFRLLIKAFCKASDFGVVKEVFEIALSICGHKEALYSLMFNELLIGGEVSE 720

Query: 721 AKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVID 780
           AKELF+AALDR   L NF Y DLIE+LC D  L++AS ILHKM+DK YRFDPASFMPVID
Sbjct: 721 AKELFDAALDRCFDLGNFQYNDLIEKLCKDEMLENASDILHKMIDKGYRFDPASFMPVID 780

Query: 781 GLGKSGNKHAADEFAEKMMEMAS----ETDINQHENKIIRGRSNNDDERDWHKIVHRNDG 840
           GLGK G KH ADE AE+MM+MAS    E  I ++E+   R + N     DW  I+HR+DG
Sbjct: 781 GLGKRGKKHDADELAERMMDMASEGMVENKITRNESAFNRQKRNKFSGSDWQTIIHRDDG 840

Query: 841 SGIAQKTLKRVLKGWGQGS 856
           SG+A K LKRV KGWGQGS
Sbjct: 841 SGLALKALKRVQKGWGQGS 859

BLAST of Cp4.1LG15g06440 vs. TrEMBL
Match: M5VK94_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001249mg PE=4 SV=1)

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 561/860 (65.23%), Postives = 683/860 (79.42%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MD    L+KA++ N+ +P+LAW LFKR+LSSP S+SSS     L+S+P++ RIL  +KMH
Sbjct: 1   MDPTTSLTKALFKNTNNPKLAWHLFKRILSSPTSSSSS--DLCLRSLPIVTRILIDSKMH 60

Query: 61  PQIDHLHQLLL-SQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFY 120
            +ID L QLLL SQ  +   P   +LVR LA   L + A+S F+ LR+RFP++PP +  Y
Sbjct: 61  HEIDSLRQLLLVSQPSETLRPCLVSLVRFLAKSSLSDMAVSCFKDLRSRFPDEPPSVYLY 120

Query: 121 NFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSE 180
           N L   SL+E  VDFV+WLYKDM+ + + P+TYTFNLLI +LCE   L++AR+VFDKM E
Sbjct: 121 NLLVESSLREKHVDFVLWLYKDMIVSGMKPETYTFNLLICSLCESDRLDDAREVFDKMRE 180

Query: 181 KGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGE 240
           KGC PNE+S+G++VRGYCRAGL  RG+E+LD+MRS   LPNRV YNT+ISS C + +T +
Sbjct: 181 KGCQPNEYSVGILVRGYCRAGLAVRGLEVLDQMRSCNLLPNRVVYNTLISSFCKQSKTDD 240

Query: 241 AEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYN 300
           AEKLVERMRE G+ PD VTFN RI+ALC +G+ILEASRIFRDM ID+E+GLPQPN VTYN
Sbjct: 241 AEKLVERMREDGMLPDAVTFNSRISALCSAGKILEASRIFRDMHIDQEMGLPQPNVVTYN 300

Query: 301 LMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEK 360
           LML+GFC E M EE++ LF SM+K+   + +ESYNIWLLGLV++GKLLEARL+L EM +K
Sbjct: 301 LMLQGFCREDMLEEAENLFKSMEKAGNFINLESYNIWLLGLVKNGKLLEARLVLKEMVDK 360

Query: 361 SIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILES 420
            I+PN+YSYNI+I GLCK GM  DAR ++ LM  + ++PD V+YSTLLHG+C +GK+ E+
Sbjct: 361 GIEPNIYSYNIVINGLCKNGMLRDARMVMTLMVRNNISPDTVTYSTLLHGFCNKGKVFEA 420

Query: 421 NYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKG 480
           + +L EM+   CFPN +TCNILLHSLWKEG+ SEAEELLQKMNERGYGL+ VTCN VI G
Sbjct: 421 SNILHEMMMNNCFPNTHTCNILLHSLWKEGRTSEAEELLQKMNERGYGLDTVTCNIVIDG 480

Query: 481 LCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLC 540
           LC  G LDKAIEIVSGMW HGSA+LGNLGNSFIGL D  NNG KC+PD ITY+TIIS LC
Sbjct: 481 LCNDGKLDKAIEIVSGMWTHGSAALGNLGNSFIGLVDDSNNGKKCIPDLITYSTIISGLC 540

Query: 541 KAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLR 600
           KAGR+DEAKKK +EM+GK L PDS+I+D FI+S+CKQG++SSAFRVLK+MEKKGCNKS++
Sbjct: 541 KAGRLDEAKKKFMEMMGKNLHPDSVIYDMFINSFCKQGRISSAFRVLKDMEKKGCNKSIQ 600

Query: 601 TYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEM 660
           TYNSL+ GL SK QIFEIYGLM+EM+E+G+ P+V TYN +++CL EG ++KDATSLLDEM
Sbjct: 601 TYNSLVLGLGSKKQIFEIYGLMDEMRERGVTPDVCTYNYMMNCLCEGERVKDATSLLDEM 660

Query: 661 LQKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETS 720
           LQKGISPNI TFRILI AF KACDFG   E+F+IALS+CGHKE LYS MFNELL GGE  
Sbjct: 661 LQKGISPNISTFRILIKAFCKACDFGVTHEVFDIALSVCGHKEVLYSLMFNELLAGGEIL 720

Query: 721 KAKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVI 780
           KAK LFE ALDR   L NFLY+DLI+RLC D KL+DAS ILH M +K Y FDPASF+PVI
Sbjct: 721 KAKALFEVALDRYFYLGNFLYKDLIDRLCKDEKLEDASSILHTMKNKGYGFDPASFLPVI 780

Query: 781 DGLGKSGNKHAADEFAEKMMEMASETDINQH----ENKIIRGRSNNDDERDWHKIVHRND 840
           DGL K GNK  ADE AE MM+M SE  +       E +II G+ +N+   DW  IVHR+D
Sbjct: 781 DGLSKRGNKQEADELAEAMMDMESEGRVGDKVYRIEREIIGGKPSNNGGSDWQTIVHRDD 840

Query: 841 GSGIAQKTLKRVLKGWGQGS 856
           GSGIA KTLKRV KGWG+GS
Sbjct: 841 GSGIALKTLKRVQKGWGRGS 858

BLAST of Cp4.1LG15g06440 vs. TrEMBL
Match: A0A0D2TQE1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G159300 PE=4 SV=1)

HSP 1 Score: 1137.1 bits (2940), Expect = 0.0e+00
Identity = 558/854 (65.34%), Postives = 688/854 (80.56%), Query Frame = 1

Query: 7   LSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMHPQIDHL 66
           L++A+  N+K+P+LAW LFKR+ SSP   S+  F   L S+P IARIL  +KM P+IDHL
Sbjct: 6   LTQALLKNTKNPKLAWQLFKRIQSSP---SNPCF---LSSVPTIARILIRSKMLPEIDHL 65

Query: 67  HQLLLS-QHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYNFLFRC 126
           H LLLS Q ++ + PS  +LV  LA  G F+ A SQF+S+R  FP +PP I  YN LF C
Sbjct: 66  HLLLLSSQPQEKSLPSLISLVNLLAKSGFFDKAFSQFQSIRKMFPQNPPSICLYNVLFGC 125

Query: 127 SLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEKGCNPN 186
            +KE R D V+WLYKDMV A V+P+TYTFNLLI  LC++G+LE+AR++FDKM EKGC PN
Sbjct: 126 CIKERRSDCVLWLYKDMVLAGVSPETYTFNLLICGLCDLGHLEDARELFDKMPEKGCLPN 185

Query: 187 EFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEAEKLVE 246
           EFS G++VRGYCR GL ++G+ELLDEMRSSG LPNRV YNT+ISS C EG+TG+AEKLVE
Sbjct: 186 EFSFGILVRGYCRFGLANKGLELLDEMRSSGILPNRVVYNTLISSFCKEGKTGDAEKLVE 245

Query: 247 RMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNLMLEGF 306
           RMRE GL PD+VTFN RI+ALC +G++LEASRIFRDMQIDE LGLP+PN +TYNLMLEGF
Sbjct: 246 RMREDGLFPDVVTFNARISALCSAGKVLEASRIFRDMQIDEALGLPRPNVITYNLMLEGF 305

Query: 307 CNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKSIKPNL 366
           C +GM  E+KAL +SM+K+   + ++SYNIWLLGL+R+ KL+EA+L+L +M +K ++PN+
Sbjct: 306 CKQGMLVEAKALVESMEKNGDLMNLDSYNIWLLGLLRNAKLVEAQLVLEDMVDKGVEPNI 365

Query: 367 YSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESNYVLRE 426
           YSYNI++ GLCK GM SDAR ++G +  SG++PD V+YSTLLHGYC +GK+ E+N +L E
Sbjct: 366 YSYNIVMDGLCKNGMLSDARMVMGFIVRSGLSPDTVTYSTLLHGYCRKGKLSEANAILNE 425

Query: 427 MIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGLCKSGN 486
           M++ G  PN YTCNILLHSLWKEGK+ EAEELLQKMNE+GYG++ VTCN VI GLCKSG 
Sbjct: 426 MMRSGYVPNTYTCNILLHSLWKEGKILEAEELLQKMNEKGYGVDTVTCNIVIDGLCKSGK 485

Query: 487 LDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCKAGRVD 546
           LDKA+EI   MW HGSA+LGNLGNSFIGL D  +  M+C+PD +TY+ IIS LCKAG++D
Sbjct: 486 LDKAMEIAHEMWTHGSAALGNLGNSFIGLVDDVSRSMRCIPDLVTYSIIISALCKAGKID 545

Query: 547 EAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLI 606
           EAKKK  EM+GK L PD++IFDTFIH +CK+GK+SSAFRVLK+MEKKGCNKS++TYNSLI
Sbjct: 546 EAKKKFREMMGKNLQPDAVIFDTFIHIFCKEGKISSAFRVLKDMEKKGCNKSVQTYNSLI 605

Query: 607 QGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEMLQKGIS 666
            GL SKNQIFEIYGL++EM+E+GI PNV  YNNII  L + GK++D TS+LD+MLQ GI+
Sbjct: 606 LGLGSKNQIFEIYGLVDEMRERGITPNVCIYNNIIQSLCKNGKIQDTTSILDDMLQMGIN 665

Query: 667 PNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSKAKELF 726
           PNI TFR+LI AF KA DFG A+ELFEI LSICGHKE+ YS MFNELL+GG+ S+AK +F
Sbjct: 666 PNISTFRMLIEAFCKASDFGVAKELFEIGLSICGHKEAFYSLMFNELLSGGQLSEAKVIF 725

Query: 727 EAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVIDGLGKS 786
           EAALDRS  L NFLY+DLIE+LC DGKL++AS ILHK++ K Y+FDPASFMPV+D LGK 
Sbjct: 726 EAALDRSFHLGNFLYKDLIEKLCKDGKLEEASGILHKLIIKGYKFDPASFMPVVDDLGKR 785

Query: 787 GNKHAADEFAEKMMEMAS----ETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIAQ 846
           GNKH ADE AEKM+EMAS    E  I++   ++I  +       DW  IVHR+DGSGIA 
Sbjct: 786 GNKHEADELAEKMLEMASDGRVENKISRKPKELIHRKETKYGGDDWQTIVHRDDGSGIAL 845

Query: 847 KTLKRVLKGWGQGS 856
           KTLKRV KGWGQGS
Sbjct: 846 KTLKRVQKGWGQGS 853

BLAST of Cp4.1LG15g06440 vs. TrEMBL
Match: A0A061DYE2_THECC (Pentatricopeptide repeat (PPR) superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_006452 PE=4 SV=1)

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 554/854 (64.87%), Postives = 679/854 (79.51%), Query Frame = 1

Query: 7   LSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMHPQIDHL 66
           L+ A+  N+K+P+LAW LFKR+ S P    SSF    L S+P I+RIL  + M  +IDHL
Sbjct: 6   LTLALLKNTKNPKLAWQLFKRIQSLP--TDSSF----LPSVPTISRILIRSNMLQEIDHL 65

Query: 67  HQLLLSQHRDFAHPSGF-ALVRALADLGLFENAISQFRSLRARFPNDPPDISFYNFLFRC 126
           H LLLS        S   +LV+ LA  G F+ A SQF+S+R +FP +PP I  YN LF C
Sbjct: 66  HHLLLSSQPQLNPLSSLISLVKLLARSGFFDRAFSQFQSIRTKFPQNPPSICLYNVLFEC 125

Query: 127 SLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEKGCNPN 186
            +KE   D+V+WLYKDMV A V+PQTYTFNLLI  LC++G+L++AR++FDKMSEKGC PN
Sbjct: 126 CIKERCSDYVLWLYKDMVGAGVSPQTYTFNLLICGLCDLGHLDDARELFDKMSEKGCVPN 185

Query: 187 EFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEAEKLVE 246
           EFS G++VRGYCR GL D+G+ELLD+MR     PNRV YNT+ISS C EG+T +AEKLVE
Sbjct: 186 EFSFGILVRGYCRFGLADKGVELLDDMRRFEIRPNRVVYNTLISSFCKEGKTDDAEKLVE 245

Query: 247 RMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNLMLEGF 306
           RMRE GL PD+VTFN RI+ALC++G+ILEASRIFRDMQ+DEELGLP+PN +TYNLMLEGF
Sbjct: 246 RMREDGLFPDVVTFNSRISALCRAGKILEASRIFRDMQMDEELGLPRPNVITYNLMLEGF 305

Query: 307 CNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKSIKPNL 366
           C +GM EE+K L +SM+K    + +ESYNIWLLGL+R+ KL+EA+L+L +M  K ++PN+
Sbjct: 306 CKQGMLEEAKTLVESMEKKGDLMNLESYNIWLLGLLRNAKLVEAQLVLKDMIYKGVEPNI 365

Query: 367 YSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESNYVLRE 426
           YSYN+++ GLCK GM SDAR ++G +  SG++PD V++STLLHGYCC+G++  +N +L E
Sbjct: 366 YSYNVVMDGLCKNGMLSDARMVMGFIISSGLSPDTVTFSTLLHGYCCKGRLYAANSILHE 425

Query: 427 MIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGLCKSGN 486
           M++ GCFPN YTCNILLHSLWKEGK+SEAE+LLQKMNE+GYG++ VTCN VI GLCKSG 
Sbjct: 426 MMRNGCFPNTYTCNILLHSLWKEGKISEAEDLLQKMNEKGYGVDTVTCNIVIDGLCKSGK 485

Query: 487 LDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCKAGRVD 546
           LDKA+EI + MW HGSA+LGNLGNSFIGL D  N+  +C+PD +TY+ IIS LCKAGR+D
Sbjct: 486 LDKAMEIGNEMWTHGSAALGNLGNSFIGLVDDANSSKQCIPDLVTYSIIISALCKAGRLD 545

Query: 547 EAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLI 606
           EAKKK  EM+GK L PDS+IFD FIH +CK+GK+SSAFRVLK+MEKKGCNKSL+TYNSLI
Sbjct: 546 EAKKKFKEMMGKNLQPDSVIFDIFIHIFCKEGKISSAFRVLKDMEKKGCNKSLQTYNSLI 605

Query: 607 QGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEMLQKGIS 666
            GL SKNQIFEIYGL++EM+E+GI PNV TYNNII CL E GK++D TS+LDEMLQKGI+
Sbjct: 606 LGLGSKNQIFEIYGLVDEMRERGITPNVCTYNNIIRCLCENGKMQDTTSILDEMLQKGIN 665

Query: 667 PNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSKAKELF 726
           PNI +FR+LI AF KACDFG AQELFEIALSICGHKE+LY  MFNELL GG+ S+AK +F
Sbjct: 666 PNISSFRMLIEAFCKACDFGVAQELFEIALSICGHKEALYKLMFNELLVGGQLSEAKLVF 725

Query: 727 EAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVIDGLGKS 786
           EAAL RS  L  FLY+DLIE+LC D KL++AS ILHKM++K Y+FDPA+FMPV+D LGK 
Sbjct: 726 EAALYRSFHLGGFLYKDLIEKLCKDKKLEEASRILHKMINKGYKFDPATFMPVVDELGKR 785

Query: 787 GNKHAADEFAEKMMEMASE----TDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIAQ 846
           GNKH ADE AEKMMEMAS+      I  +  + I  +       DW  IVHR+DGSGIA 
Sbjct: 786 GNKHEADELAEKMMEMASDGRVGNKIYLNAREPIHRKEIKFGGDDWQTIVHRDDGSGIAL 845

Query: 847 KTLKRVLKGWGQGS 856
           K LKRV KGWGQGS
Sbjct: 846 KALKRVQKGWGQGS 853

BLAST of Cp4.1LG15g06440 vs. TAIR10
Match: AT2G17140.1 (AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 1005.7 bits (2599), Expect = 2.1e-293
Identity = 493/853 (57.80%), Postives = 631/853 (73.97%), Query Frame = 1

Query: 7   LSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMHPQIDHL 66
           L KA+  N+ +P LAW +FKR+ SSP   S      SL + P IARIL  AKMH +I  L
Sbjct: 5   LVKALLKNTNNPRLAWRIFKRIFSSPSEESHGI---SLDATPTIARILVRAKMHEEIQEL 64

Query: 67  HQLLLSQHRDFAHPSGF-ALVRALADLGLFENAISQFRSLRARFPNDPPDISFYNFLFRC 126
           H L+LS        S   ++V   A     + A  QF+ +R+RFP + P +  YN L   
Sbjct: 65  HNLILSSSIQKTKLSSLLSVVSIFAKSNHIDKAFPQFQLVRSRFPENKPSVYLYNLLLES 124

Query: 127 SLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEKGCNPN 186
            +KE RV+FV WLYKDMV   + PQTYTFNLLIRALC+   ++ AR++FD+M EKGC PN
Sbjct: 125 CIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPN 184

Query: 187 EFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEAEKLVE 246
           EF+ G++VRGYC+AGL D+G+ELL+ M S G LPN+V YNT++SS C EG+  ++EK+VE
Sbjct: 185 EFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCREGRNDDSEKMVE 244

Query: 247 RMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNLMLEGF 306
           +MRE GL PDIVTFN RI+ALCK G++L+ASRIF DM++DE LGLP+PN++TYNLML+GF
Sbjct: 245 KMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSITYNLMLKGF 304

Query: 307 CNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKSIKPNL 366
           C  G+ E++K LF+S+++++   +++SYNIWL GLVR GK +EA  +L +M +K I P++
Sbjct: 305 CKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSI 364

Query: 367 YSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESNYVLRE 426
           YSYNIL+ GLCK GM SDA++I+GLM+ +GV PD V+Y  LLHGYC  GK+  +  +L+E
Sbjct: 365 YSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQE 424

Query: 427 MIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGLCKSGN 486
           M++  C PN YTCNILLHSLWK G++SEAEELL+KMNE+GYGL+ VTCN ++ GLC SG 
Sbjct: 425 MMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIVDGLCGSGE 484

Query: 487 LDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCKAGRVD 546
           LDKAIEIV GM  HGSA+LGNLGNS+IGL D       CLPD ITY+T+++ LCKAGR  
Sbjct: 485 LDKAIEIVKGMRVHGSAALGNLGNSYIGLVDDSLIENNCLPDLITYSTLLNGLCKAGRFA 544

Query: 547 EAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLI 606
           EAK    EM+G+KL PDS+ ++ FIH +CKQGK+SSAFRVLK+MEKKGC+KSL TYNSLI
Sbjct: 545 EAKNLFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLI 604

Query: 607 QGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEMLQKGIS 666
            GL  KNQIFEI+GLM+EMKEKGI PN+ TYN  I  L EG K++DAT+LLDEM+QK I+
Sbjct: 605 LGLGIKNQIFEIHGLMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIA 664

Query: 667 PNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSKAKELF 726
           PN+++F+ LI AF K  DF  AQE+FE A+SICG KE LYS MFNELL  G+  KA EL 
Sbjct: 665 PNVFSFKYLIEAFCKVPDFDMAQEVFETAVSICGQKEGLYSLMFNELLAAGQLLKATELL 724

Query: 727 EAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVIDGLGKS 786
           EA LDR   L  FLY+DL+E LC   +L+ AS ILHKM+D+ Y FDPA+ MPVIDGLGK 
Sbjct: 725 EAVLDRGFELGTFLYKDLVESLCKKDELEVASGILHKMIDRGYGFDPAALMPVIDGLGKM 784

Query: 787 GNKHAADEFAEKMMEMAS----ETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIAQ 846
           GNK  A+ FA+KMMEMAS       ++ +   I + + N +   +W  I+HR+DGSGIA 
Sbjct: 785 GNKKEANSFADKMMEMASVGEVANKVDPNARDIHQKKHNKNGGNNWQNILHRDDGSGIAL 844

Query: 847 KTLKRVLKGWGQG 855
           ++L RV KGWGQG
Sbjct: 845 RSLSRVKKGWGQG 854

BLAST of Cp4.1LG15g06440 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 282.0 bits (720), Expect = 1.6e-75
Identity = 186/695 (26.76%), Postives = 318/695 (45.76%), Query Frame = 1

Query: 114 PDISFYNFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQV 173
           P++   + L    +K       + L+ DMV   + P  Y +  +IR+LCE+  L  A+++
Sbjct: 190 PEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEM 249

Query: 174 FDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCG 233
              M   GC+ N     +++ G C+       + +  ++      P+ V Y T++  LC 
Sbjct: 250 IAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCK 309

Query: 234 EGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQP 293
             +     ++++ M  +  SP     +  +  L K G+I EA  + + +    + G+  P
Sbjct: 310 VQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVV---DFGV-SP 369

Query: 294 NTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLIL 353
           N   YN +++  C    F E++ LFD M K        +Y+I +    R GKL  A   L
Sbjct: 370 NLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFL 429

Query: 354 NEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCR 413
            EM +  +K ++Y YN LI G CK+G  S A   +  M    + P +V+Y++L+ GYC +
Sbjct: 430 GEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSK 489

Query: 414 GKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTC 473
           GKI ++  +  EM   G  P++YT   LL  L++ G + +A +L  +M E     N VT 
Sbjct: 490 GKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTY 549

Query: 474 NTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYAT 533
           N +I+G C+ G++ KA E +  M   G                        +PD+ +Y  
Sbjct: 550 NVMIEGYCEEGDMSKAFEFLKEMTEKG-----------------------IVPDTYSYRP 609

Query: 534 IISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKG 593
           +I  LC  G+  EAK  +  +       + + +   +H +C++GKL  A  V +EM ++G
Sbjct: 610 LIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRG 669

Query: 594 CNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDAT 653
            +  L  Y  LI G          +GL++EM ++G+ P+   Y ++I   S+ G  K+A 
Sbjct: 670 VDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAF 729

Query: 654 SLLDEMLQKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELL 713
            + D M+ +G  PN  T+  +I    KA     A+ L      +      +    F ++L
Sbjct: 730 GIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDIL 789

Query: 714 TGGET--SKAKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFD 773
           T GE    KA EL  A L + L      Y  LI   C  G++++AS ++ +M+      D
Sbjct: 790 TKGEVDMQKAVELHNAIL-KGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPD 849

Query: 774 PASFMPVIDGLGKSGNKHAADEFAEKMMEMASETD 807
             ++  +I+ L +  +   A E    M E     D
Sbjct: 850 CITYTTMINELCRRNDVKKAIELWNSMTEKGIRPD 856

BLAST of Cp4.1LG15g06440 vs. TAIR10
Match: AT5G64320.1 (AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 282.0 bits (720), Expect = 1.6e-75
Identity = 162/563 (28.77%), Postives = 283/563 (50.27%), Query Frame = 1

Query: 114 PDISFYNFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQV 173
           P    YN +    +  +       ++ DM+  ++ P  +TF ++++A C +  +++A  +
Sbjct: 180 PTFKSYNVVLEILVSGNCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSL 239

Query: 174 FDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCG 233
              M++ GC PN      ++    +    +  ++LL+EM   G +P+   +N VI  LC 
Sbjct: 240 LRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCK 299

Query: 234 EGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQP 293
             +  EA K+V RM   G +PD +T+   +  LCK G++  A  +F          +P+P
Sbjct: 300 FDRINEAAKMVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLF--------YRIPKP 359

Query: 294 NTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLT-MESYNIWLLGLVRSGKLLEARLI 353
             V +N ++ GF   G  +++KA+   M  S   +  + +YN  + G  + G +  A  +
Sbjct: 360 EIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEV 419

Query: 354 LNEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCC 413
           L++M  K  KPN+YSY IL+ G CK G   +A +++  M   G+ P+ V ++ L+  +C 
Sbjct: 420 LHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCK 479

Query: 414 RGKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVT 473
             +I E+  + REM + GC P++YT N L+  L +  ++  A  LL+ M   G   N VT
Sbjct: 480 EHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVT 539

Query: 474 CNTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYA 533
            NT+I    + G + +A ++V+ M   GS                         D ITY 
Sbjct: 540 YNTLINAFLRRGEIKEARKLVNEMVFQGSPL-----------------------DEITYN 599

Query: 534 TIISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKK 593
           ++I  LC+AG VD+A+    +M+    +P ++  +  I+  C+ G +  A    KEM  +
Sbjct: 600 SLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLR 659

Query: 594 GCNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDA 653
           G    + T+NSLI GL    +I +   +  +++ +GI P+  T+N ++S L +GG + DA
Sbjct: 660 GSTPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAEGIPPDTVTFNTLMSWLCKGGFVYDA 711

Query: 654 TSLLDEMLQKGISPNIYTFRILI 676
             LLDE ++ G  PN  T+ IL+
Sbjct: 720 CLLLDEGIEDGFVPNHRTWSILL 711

BLAST of Cp4.1LG15g06440 vs. TAIR10
Match: AT5G01110.1 (AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 278.1 bits (710), Expect = 2.3e-74
Identity = 162/526 (30.80%), Postives = 277/526 (52.66%), Query Frame = 1

Query: 155 NLLIRALCEMGYLENARQVFDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRS 214
           N LI +L  +G++E A  V+ ++S  G   N ++L ++V   C+ G  ++    L +++ 
Sbjct: 204 NALIGSLVRIGWVELAWGVYQEISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQE 263

Query: 215 SGALPNRVAYNTVISSLCGEGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILE 274
            G  P+ V YNT+IS+   +G   EA +L+  M   G SP + T+N  I  LCK G+   
Sbjct: 264 KGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYER 323

Query: 275 ASRIFRDMQIDEELGLPQPNTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLTMESYN 334
           A  +F +M      GL  P++ TY  +L   C +G   E++ +F  M+  +    +  ++
Sbjct: 324 AKEVFAEML---RSGL-SPDSTTYRSLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFS 383

Query: 335 IWLLGLVRSGKLLEARLILNEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRES 394
             +    RSG L +A +  N + E  + P+   Y ILI G C+ GM S A ++   M + 
Sbjct: 384 SMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQ 443

Query: 395 GVAPDIVSYSTLLHGYCCRGKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEA 454
           G A D+V+Y+T+LHG C R  + E++ +  EM +   FP+ YT  IL+    K G +  A
Sbjct: 444 GCAMDVVTYNTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNA 503

Query: 455 EELLQKMNERGYGLNNVTCNTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGL 514
            EL QKM E+   L+ VT NT++ G  K G++D A EI + M +                
Sbjct: 504 MELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVS---------------- 563

Query: 515 FDIGNNGMKCLPDSITYATIISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYC 574
                   + LP  I+Y+ +++ LC  G + EA +   EMI K + P  +I ++ I  YC
Sbjct: 564 -------KEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYC 623

Query: 575 KQGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEK--GIFPN 634
           + G  S     L++M  +G      +YN+LI G   +  + + +GL+++M+E+  G+ P+
Sbjct: 624 RSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQGGLVPD 683

Query: 635 VYTYNNIISCLSEGGKLKDATSLLDEMLQKGISPNIYTFRILIGAF 679
           V+TYN+I+       ++K+A  +L +M+++G++P+  T+  +I  F
Sbjct: 684 VFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGF 702

BLAST of Cp4.1LG15g06440 vs. TAIR10
Match: AT1G12300.1 (AT1G12300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 273.5 bits (698), Expect = 5.6e-73
Identity = 159/580 (27.41%), Postives = 284/580 (48.97%), Query Frame = 1

Query: 96  ENAISQFRSLRARFPNDPPDISFYNFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFN 155
           ++AI  FR +    P   P +  ++ LF    K  + D V+ L K M    +    YT +
Sbjct: 70  DDAIDLFRDMIHSRPL--PTVIDFSRLFSAIAKTKQYDLVLALCKQMELKGIAHNLYTLS 129

Query: 156 LLIRALCEMGYLENARQVFDKMSEKGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSS 215
           ++I   C    L  A     K+ + G  PN  +   ++ G C  G     +EL+D M   
Sbjct: 130 IMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSEALELVDRMVEM 189

Query: 216 GALPNRVAYNTVISSLCGEGQTGEAEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEA 275
           G  P+ +  NT+++ LC  G+  EA  L+++M E G  P+ VT+   +  +CKSGQ   A
Sbjct: 190 GHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALA 249

Query: 276 SRIFRDMQIDEELGLPQPNTVTYNLMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNI 335
             + R M+ +  + L   + V Y+++++G C  G  + +  LF+ M+       + +YNI
Sbjct: 250 MELLRKME-ERNIKL---DAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITTNIITYNI 309

Query: 336 WLLGLVRSGKLLEARLILNEMAEKSIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESG 395
            + G   +G+  +   +L +M ++ I PN+ ++++LI    K G   +A  +   M   G
Sbjct: 310 LIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELHKEMIHRG 369

Query: 396 VAPDIVSYSTLLHGYCCRGKILESNYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAE 455
           +APD ++Y++L+ G+C    + ++N ++  M+  GC PN+ T NIL++   K  ++ +  
Sbjct: 370 IAPDTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKANRIDDGL 429

Query: 456 ELLQKMNERGYGLNNVTCNTVIKGLCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLF 515
           EL +KM+ RG   + VT NT+I+G C+ G L+ A E+   M +                 
Sbjct: 430 ELFRKMSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSR---------------- 489

Query: 516 DIGNNGMKCLPDSITYATIISWLCKAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCK 575
                  K  P+ +TY  ++  LC  G  ++A +   ++   K+  D  I++  IH  C 
Sbjct: 490 -------KVPPNIVTYKILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYNIIIHGMCN 549

Query: 576 QGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYT 635
             K+  A+ +   +  KG    ++TYN +I GL  K  + E   L  +M+E G  P+ +T
Sbjct: 550 ASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKMEEDGHAPDGWT 609

Query: 636 YNNIISCLSEGGKLKDATSLLDEMLQKGISPNIYTFRILI 676
           YN +I      G    +  L++E+ + G S +  T +++I
Sbjct: 610 YNILIRAHLGDGDATKSVKLIEELKRCGFSVDASTIKMVI 620

BLAST of Cp4.1LG15g06440 vs. NCBI nr
Match: gi|449454285|ref|XP_004144886.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Cucumis sativus])

HSP 1 Score: 1543.5 bits (3995), Expect = 0.0e+00
Identity = 762/855 (89.12%), Postives = 804/855 (94.04%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MDRAA LSKA+YLNS +P LAWLLFKR+LSSPI ASSSFFK SLQS+P IARIL TAKMH
Sbjct: 3   MDRAAKLSKAIYLNSNNPNLAWLLFKRILSSPIPASSSFFKPSLQSVPAIARILITAKMH 62

Query: 61  PQIDHLHQLLLSQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYN 120
           PQIDHLHQLLLSQHRDFAHPSGF+LVR LADLGL ENAISQFRSLR RFP+DPP ISFYN
Sbjct: 63  PQIDHLHQLLLSQHRDFAHPSGFSLVRTLADLGLLENAISQFRSLRDRFPHDPPPISFYN 122

Query: 121 FLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEK 180
            LFRCSLKESRVD VIWLYKDM  ARV PQTYTFNLLI ALCEMGYLENAR+VFDKMSEK
Sbjct: 123 LLFRCSLKESRVDCVIWLYKDMAVARVKPQTYTFNLLISALCEMGYLENAREVFDKMSEK 182

Query: 181 GCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEA 240
           GC PNEFSLG++VRGYCRAGLH  GI+LLDEMRSSGALPNRVAYNTVISSLCGEGQT EA
Sbjct: 183 GCKPNEFSLGILVRGYCRAGLHSHGIDLLDEMRSSGALPNRVAYNTVISSLCGEGQTVEA 242

Query: 241 EKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNL 300
           EKLVE+MREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEE+GLP+PNTVTYNL
Sbjct: 243 EKLVEKMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEEMGLPKPNTVTYNL 302

Query: 301 MLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKS 360
           MLEGFC+EGMFEE++A+FDSMK SET L++ SYNIW+LGLVRSGKLLEA LILNEMAEK+
Sbjct: 303 MLEGFCSEGMFEEARAIFDSMKNSET-LSLRSYNIWMLGLVRSGKLLEAHLILNEMAEKN 362

Query: 361 IKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESN 420
           IKPNLYSYNIL++GLCKYGMFSDARSI+GLMRESGVAPD V+YSTLLHGYC RGKILE+N
Sbjct: 363 IKPNLYSYNILVHGLCKYGMFSDARSILGLMRESGVAPDTVTYSTLLHGYCRRGKILEAN 422

Query: 421 YVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGL 480
           YVLREMIQVGCFPNMYTCNILLHSLWKEG+ SEAE+LLQ MNERGYGL+NVTCNT+I GL
Sbjct: 423 YVLREMIQVGCFPNMYTCNILLHSLWKEGRASEAEDLLQMMNERGYGLDNVTCNTMINGL 482

Query: 481 CKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCK 540
           CK+GNLDKAIEIVSGMW  GSASLGNLGNSFI LFDI NNG KCLPDSITYATII  LCK
Sbjct: 483 CKAGNLDKAIEIVSGMWTRGSASLGNLGNSFIDLFDIRNNGKKCLPDSITYATIIGGLCK 542

Query: 541 AGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRT 600
            GRVDEAKKKLLEMIGKKLSPDSLIFDTFI++YCKQGKLSSAFRVLKEMEKKGCNKSLRT
Sbjct: 543 VGRVDEAKKKLLEMIGKKLSPDSLIFDTFIYNYCKQGKLSSAFRVLKEMEKKGCNKSLRT 602

Query: 601 YNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEML 660
           YNSLIQGL S+NQIFEIYGLM+EMKE+GIFPNVYTYNNIISCLSEGGKLKDAT LLDEML
Sbjct: 603 YNSLIQGLGSENQIFEIYGLMDEMKERGIFPNVYTYNNIISCLSEGGKLKDATCLLDEML 662

Query: 661 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSK 720
           QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALS+CGHKESLYSFMFNELL GGET K
Sbjct: 663 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSLCGHKESLYSFMFNELLAGGETLK 722

Query: 721 AKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVID 780
           AKELFEAALDRSLALKNFLYRDLIE+LC DGKLDDASFILHKMMDKQY FDPASFMPVID
Sbjct: 723 AKELFEAALDRSLALKNFLYRDLIEKLCKDGKLDDASFILHKMMDKQYSFDPASFMPVID 782

Query: 781 GLGKSGNKHAADEFAEKMMEMASETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIA 840
            LGK G+KHAADEFAE+MMEMASETD N+HENK IRGR NN+DE DW KIVHRNDGSGIA
Sbjct: 783 ELGKRGSKHAADEFAERMMEMASETDFNEHENKNIRGRLNNNDESDWQKIVHRNDGSGIA 842

Query: 841 QKTLKRVLKGWGQGS 856
           QKTLKRVLKGWGQGS
Sbjct: 843 QKTLKRVLKGWGQGS 856

BLAST of Cp4.1LG15g06440 vs. NCBI nr
Match: gi|659094034|ref|XP_008447846.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Cucumis melo])

HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 756/855 (88.42%), Postives = 804/855 (94.04%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MDRA  LSKA+YLNS +P LAWLLFKR+LSSPI ASSSFFK SLQS+P+IARIL T+KMH
Sbjct: 1   MDRATKLSKAIYLNSNNPNLAWLLFKRILSSPIPASSSFFKPSLQSVPIIARILITSKMH 60

Query: 61  PQIDHLHQLLLSQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYN 120
           PQIDHLHQLLLSQHRDFAHPSGF+LVR LADLGLFENAISQFRSLRARFP+DPP ISFYN
Sbjct: 61  PQIDHLHQLLLSQHRDFAHPSGFSLVRTLADLGLFENAISQFRSLRARFPHDPPPISFYN 120

Query: 121 FLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEK 180
            LFRCSLKE RVD VIWLYKDMV ARVNPQTYTFNLLI ALCEMGYLENAR+VFDKMSEK
Sbjct: 121 LLFRCSLKEGRVDCVIWLYKDMVVARVNPQTYTFNLLISALCEMGYLENAREVFDKMSEK 180

Query: 181 GCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEA 240
           GC PNEFSLG++VRGYCRAGLH  GI+LLDEMRSSGA PNRVAYNTVISSLCGEGQT EA
Sbjct: 181 GCKPNEFSLGILVRGYCRAGLHSNGIDLLDEMRSSGAFPNRVAYNTVISSLCGEGQTEEA 240

Query: 241 EKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNL 300
           EKLVE+MREVGLSPD VTFNCRIAALCKSGQILEASRIFRDMQIDEE+GLP+PNTVTYNL
Sbjct: 241 EKLVEKMREVGLSPDTVTFNCRIAALCKSGQILEASRIFRDMQIDEEMGLPKPNTVTYNL 300

Query: 301 MLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKS 360
           MLEGFC+EGMFEE++A+FDSMK SET L ++SYNIWLLGLVRSGKLLEARLILNEMAEK+
Sbjct: 301 MLEGFCSEGMFEEARAIFDSMKISET-LNLKSYNIWLLGLVRSGKLLEARLILNEMAEKN 360

Query: 361 IKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESN 420
           IKPNLYSYNIL++GLC+YGMFSDARSI+G+MRESGVAPD V+YSTLLHGYC RGK+LE+N
Sbjct: 361 IKPNLYSYNILVHGLCRYGMFSDARSILGVMRESGVAPDTVTYSTLLHGYCRRGKMLEAN 420

Query: 421 YVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGL 480
           YVLREMIQVGCFPNMYTCNILL+SLWKEG+ SEAE+LLQKMNERGYGL+NVTCNT+I GL
Sbjct: 421 YVLREMIQVGCFPNMYTCNILLNSLWKEGRASEAEDLLQKMNERGYGLDNVTCNTMINGL 480

Query: 481 CKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCK 540
           CK+GNLDKAIEIVSGMW HGSASLGNLGNSFIGLFDI N+G KCLPD ITYATII  LCK
Sbjct: 481 CKAGNLDKAIEIVSGMWTHGSASLGNLGNSFIGLFDIRNSGKKCLPDPITYATIIGGLCK 540

Query: 541 AGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRT 600
            GRVDEAKKKLLEMIGK LSPDSLIFDTFI++YCKQGKLSSAFRVLKEMEKKGCNKSLRT
Sbjct: 541 VGRVDEAKKKLLEMIGKNLSPDSLIFDTFIYNYCKQGKLSSAFRVLKEMEKKGCNKSLRT 600

Query: 601 YNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEML 660
           YNSLIQG  S+NQIFEIYGLM+EMKE+GIFPNVYTYNNI+ CLSEGGKLKDAT LLDEML
Sbjct: 601 YNSLIQGFGSENQIFEIYGLMDEMKERGIFPNVYTYNNIMRCLSEGGKLKDATCLLDEML 660

Query: 661 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSK 720
           QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALS+CGHKESLYSFMFNELL GGET K
Sbjct: 661 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSLCGHKESLYSFMFNELLAGGETLK 720

Query: 721 AKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVID 780
           AKELFEAALDRSLALKNFLYRDLIERLC DGKLDDASFILHKMMDKQY FDPASFMPVID
Sbjct: 721 AKELFEAALDRSLALKNFLYRDLIERLCKDGKLDDASFILHKMMDKQYSFDPASFMPVID 780

Query: 781 GLGKSGNKHAADEFAEKMMEMASETDINQHENKIIRGRSNNDDERDWHKIVHRNDGSGIA 840
            LGK G+KHAADEFAE+MMEMASET  NQHENK IRGR NN+DE DW KI+HRNDGSGIA
Sbjct: 781 ELGKRGSKHAADEFAERMMEMASETGFNQHENKNIRGRLNNNDESDWQKIIHRNDGSGIA 840

Query: 841 QKTLKRVLKGWGQGS 856
           QKTLKRVLKGWGQGS
Sbjct: 841 QKTLKRVLKGWGQGS 854

BLAST of Cp4.1LG15g06440 vs. NCBI nr
Match: gi|225428276|ref|XP_002279589.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Vitis vinifera])

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 568/859 (66.12%), Postives = 682/859 (79.39%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MD+   L+KA+  N+ +P LAW LFKR+LS P S+SS   ++ L+SIP+I  IL  AKM 
Sbjct: 1   MDQRNKLTKALIKNTHNPTLAWHLFKRILSIPTSSSSISSRSILRSIPIITHILIRAKMI 60

Query: 61  PQIDHLHQLLLSQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFYN 120
            QIDHL QLLL Q ++ +H S  AL+R LA  GL + A SQF+S R++ P +PP +  YN
Sbjct: 61  SQIDHLQQLLLQQPQEVSHVSLIALIRILAKSGLSDLAFSQFQSFRSQVPANPPPVYLYN 120

Query: 121 FLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSEK 180
            +   SL+E +VD   WLYKDMV A V+P+TYT NLLI  LC+ G  E+AR+VFDKM  K
Sbjct: 121 MVLESSLREDKVDSFSWLYKDMVVAGVSPETYTLNLLIAGLCDSGRFEDAREVFDKMGVK 180

Query: 181 GCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGEA 240
           GC PNEFS G++VRGYCRAGL  R +ELLD M S G  PN+V YNT+ISS C EG+  EA
Sbjct: 181 GCRPNEFSFGILVRGYCRAGLSMRALELLDGMGSFGVQPNKVIYNTLISSFCREGRNEEA 240

Query: 241 EKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYNL 300
           E+LVERMRE GL PD+VTFN RI+ALC +G+ILEASRIFRDMQIDEELGLP+PN  T+NL
Sbjct: 241 ERLVERMREDGLFPDVVTFNSRISALCSAGKILEASRIFRDMQIDEELGLPRPNITTFNL 300

Query: 301 MLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEKS 360
           MLEGFC EGM EE+K L +SMK++   + +ESYNIWLLGLVR+GKLLEA+L L EM +K 
Sbjct: 301 MLEGFCKEGMLEEAKTLVESMKRNGNLMELESYNIWLLGLVRNGKLLEAQLALKEMVDKG 360

Query: 361 IKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILESN 420
           I+PN+YS+N ++ GLCK G+ SDAR I+GLM  SG+ PD V+YSTLLHG C  GK+L++N
Sbjct: 361 IEPNIYSFNTVMDGLCKNGLISDARMIMGLMISSGIGPDTVTYSTLLHGCCSTGKVLKAN 420

Query: 421 YVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKGL 480
            +L EM++ GC PN YTCNILLHSLWKEG++ EAE+LLQKMNER Y L+NVTCN VI GL
Sbjct: 421 NILHEMMRRGCSPNTYTCNILLHSLWKEGRIFEAEKLLQKMNERSYDLDNVTCNIVIDGL 480

Query: 481 CKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLCK 540
           CKSG LD+A+EIV GMW HGSA+LGNLGNSFIGL D  +NG KCLPD ITY+ II+ LCK
Sbjct: 481 CKSGKLDEAVEIVEGMWIHGSAALGNLGNSFIGLVDSSSNGKKCLPDLITYSIIINGLCK 540

Query: 541 AGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLRT 600
           AGR+DEA+KK +EM+GK L PDS+I+DTFIHS+CK GK+SSAFRVLK+MEK+GCNKSL+T
Sbjct: 541 AGRLDEARKKFIEMVGKSLHPDSIIYDTFIHSFCKHGKISSAFRVLKDMEKRGCNKSLQT 600

Query: 601 YNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEML 660
           YNSLI GL SKNQIFEIYGL+++MKEKGI PN+ TYNN+ISCL EGG++KDATSLLDEML
Sbjct: 601 YNSLILGLGSKNQIFEIYGLLDDMKEKGITPNICTYNNMISCLCEGGRIKDATSLLDEML 660

Query: 661 QKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETSK 720
           QKGISPNI +FR+LI AF KA DFG  +E+FEIALSICGHKE+LYS MFNELL GGE S+
Sbjct: 661 QKGISPNISSFRLLIKAFCKASDFGVVKEVFEIALSICGHKEALYSLMFNELLIGGEVSE 720

Query: 721 AKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVID 780
           AKELF+AALDR   L NF Y DLIE+LC D  L++AS ILHKM+DK YRFDPASFMPVID
Sbjct: 721 AKELFDAALDRCFDLGNFQYNDLIEKLCKDEMLENASDILHKMIDKGYRFDPASFMPVID 780

Query: 781 GLGKSGNKHAADEFAEKMMEMAS----ETDINQHENKIIRGRSNNDDERDWHKIVHRNDG 840
           GLGK G KH ADE AE+MM+MAS    E  I ++E+   R + N     DW  I+HR+DG
Sbjct: 781 GLGKRGKKHDADELAERMMDMASEGMVENKITRNESAFNRQKRNKFSGSDWQTIIHRDDG 840

Query: 841 SGIAQKTLKRVLKGWGQGS 856
           SG+A K LKRV KGWGQGS
Sbjct: 841 SGLALKALKRVQKGWGQGS 859

BLAST of Cp4.1LG15g06440 vs. NCBI nr
Match: gi|645260741|ref|XP_008235960.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Prunus mume])

HSP 1 Score: 1142.5 bits (2954), Expect = 0.0e+00
Identity = 563/860 (65.47%), Postives = 684/860 (79.53%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MD    L+KA++ N+ +P+LAW LFKR+LSSP S+SSS     L+S+P++ RIL  +KMH
Sbjct: 1   MDPTTSLTKALFKNTNNPKLAWHLFKRILSSPTSSSSSS-DLCLRSLPIVTRILIDSKMH 60

Query: 61  PQIDHLHQLLL-SQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFY 120
            +ID L QLLL SQ  +   P   +LVR LA   L + A+S F+ LR+RFP++PP +  Y
Sbjct: 61  HEIDSLRQLLLVSQPSETLRPCLVSLVRLLAKSNLSDMAVSYFKDLRSRFPDEPPSVYLY 120

Query: 121 NFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSE 180
           N L   SL+E  VDFV+WLYKDM+ + + P+TYTFNLLI +LCE   L +AR+VFDKM E
Sbjct: 121 NLLLESSLREKHVDFVLWLYKDMIVSGMKPETYTFNLLICSLCESDRLGDAREVFDKMRE 180

Query: 181 KGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGE 240
           KGC PNE+S+G++VRGYCRAGL  RG+E+LD+MRS   LPNRV YNT+ISS C +G+T +
Sbjct: 181 KGCQPNEYSVGILVRGYCRAGLAVRGLEVLDQMRSCNLLPNRVVYNTLISSFCKQGKTDD 240

Query: 241 AEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYN 300
           AEKLVERMRE G+ PD VTFN RI+ALC +G+ILEASRIFRDM ID+E+GLPQPN VTYN
Sbjct: 241 AEKLVERMREDGMLPDAVTFNSRISALCSAGKILEASRIFRDMHIDQEMGLPQPNVVTYN 300

Query: 301 LMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEK 360
           LML+GFC E M EE++ LF SM+K+   + +ESYNIWLLGLV++GKLLEARL+L EM +K
Sbjct: 301 LMLQGFCREDMLEEAETLFKSMEKAGNFINLESYNIWLLGLVKNGKLLEARLVLKEMVDK 360

Query: 361 SIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILES 420
            I+PN+YSYNI+I GLCK GM  DAR ++ LM  + ++PD V+YSTLLHG+C +GK+ E+
Sbjct: 361 GIEPNIYSYNIVINGLCKNGMLRDARMVMTLMVRNNISPDTVTYSTLLHGFCNKGKVFEA 420

Query: 421 NYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKG 480
           + +L EM+   CFPN +TCNILLHSLWKEG+ SEAEELLQKMNERGYGL+ VTCN VI G
Sbjct: 421 SNILHEMMMNNCFPNTHTCNILLHSLWKEGRTSEAEELLQKMNERGYGLDTVTCNIVIDG 480

Query: 481 LCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLC 540
           LC  G LDKAIEIVSGMW HGSA+LGNLGNSFIGL D  NNG  C+PD ITY+TIIS LC
Sbjct: 481 LCNDGKLDKAIEIVSGMWTHGSAALGNLGNSFIGLVDDSNNGKLCIPDLITYSTIISGLC 540

Query: 541 KAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLR 600
           KAGR+DEAKKK +EM+GK L PDS+I+D FI+S+CKQG++SSAF+VLK+MEKKGCNKS++
Sbjct: 541 KAGRLDEAKKKFMEMMGKNLHPDSVIYDMFINSFCKQGRISSAFQVLKDMEKKGCNKSIQ 600

Query: 601 TYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEM 660
           TYNSLI GL SK QIFEIYGLM+EM+E+G+ P+V TYN +++CL EG ++KDATSLLDEM
Sbjct: 601 TYNSLILGLGSKKQIFEIYGLMDEMRERGVTPDVCTYNYMMNCLCEGERVKDATSLLDEM 660

Query: 661 LQKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETS 720
           LQKGISPNI TFRILI AF KACDFG A E+F+IAL++CGHKE LYS MFNELL GGE  
Sbjct: 661 LQKGISPNISTFRILIKAFCKACDFGVAHEVFDIALTVCGHKEVLYSLMFNELLAGGEIL 720

Query: 721 KAKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVI 780
           KAK LFE ALDR   L NFLY+DLI+RLC D KL+DAS ILH M +K Y FDPASF+PVI
Sbjct: 721 KAKALFEVALDRYFYLGNFLYKDLIDRLCKDEKLEDASSILHTMKNKGYGFDPASFLPVI 780

Query: 781 DGLGKSGNKHAADEFAEKMMEMASE----TDINQHENKIIRGRSNNDDERDWHKIVHRND 840
           DGL K GNK  ADE AE MMEM SE      + Q E +II G+ +N+   DW  IVHR+D
Sbjct: 781 DGLSKRGNKQEADELAEAMMEMESEGRVADKVYQIEREIIGGKPSNNGGSDWQTIVHRDD 840

Query: 841 GSGIAQKTLKRVLKGWGQGS 856
           GSGIA KTLKRV KGWG+GS
Sbjct: 841 GSGIALKTLKRVQKGWGRGS 859

BLAST of Cp4.1LG15g06440 vs. NCBI nr
Match: gi|595793129|ref|XP_007200313.1| (hypothetical protein PRUPE_ppa001249mg [Prunus persica])

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 561/860 (65.23%), Postives = 683/860 (79.42%), Query Frame = 1

Query: 1   MDRAAMLSKAVYLNSKSPELAWLLFKRVLSSPISASSSFFKTSLQSIPLIARILTTAKMH 60
           MD    L+KA++ N+ +P+LAW LFKR+LSSP S+SSS     L+S+P++ RIL  +KMH
Sbjct: 1   MDPTTSLTKALFKNTNNPKLAWHLFKRILSSPTSSSSS--DLCLRSLPIVTRILIDSKMH 60

Query: 61  PQIDHLHQLLL-SQHRDFAHPSGFALVRALADLGLFENAISQFRSLRARFPNDPPDISFY 120
            +ID L QLLL SQ  +   P   +LVR LA   L + A+S F+ LR+RFP++PP +  Y
Sbjct: 61  HEIDSLRQLLLVSQPSETLRPCLVSLVRFLAKSSLSDMAVSCFKDLRSRFPDEPPSVYLY 120

Query: 121 NFLFRCSLKESRVDFVIWLYKDMVFARVNPQTYTFNLLIRALCEMGYLENARQVFDKMSE 180
           N L   SL+E  VDFV+WLYKDM+ + + P+TYTFNLLI +LCE   L++AR+VFDKM E
Sbjct: 121 NLLVESSLREKHVDFVLWLYKDMIVSGMKPETYTFNLLICSLCESDRLDDAREVFDKMRE 180

Query: 181 KGCNPNEFSLGLIVRGYCRAGLHDRGIELLDEMRSSGALPNRVAYNTVISSLCGEGQTGE 240
           KGC PNE+S+G++VRGYCRAGL  RG+E+LD+MRS   LPNRV YNT+ISS C + +T +
Sbjct: 181 KGCQPNEYSVGILVRGYCRAGLAVRGLEVLDQMRSCNLLPNRVVYNTLISSFCKQSKTDD 240

Query: 241 AEKLVERMREVGLSPDIVTFNCRIAALCKSGQILEASRIFRDMQIDEELGLPQPNTVTYN 300
           AEKLVERMRE G+ PD VTFN RI+ALC +G+ILEASRIFRDM ID+E+GLPQPN VTYN
Sbjct: 241 AEKLVERMREDGMLPDAVTFNSRISALCSAGKILEASRIFRDMHIDQEMGLPQPNVVTYN 300

Query: 301 LMLEGFCNEGMFEESKALFDSMKKSETHLTMESYNIWLLGLVRSGKLLEARLILNEMAEK 360
           LML+GFC E M EE++ LF SM+K+   + +ESYNIWLLGLV++GKLLEARL+L EM +K
Sbjct: 301 LMLQGFCREDMLEEAENLFKSMEKAGNFINLESYNIWLLGLVKNGKLLEARLVLKEMVDK 360

Query: 361 SIKPNLYSYNILIYGLCKYGMFSDARSIIGLMRESGVAPDIVSYSTLLHGYCCRGKILES 420
            I+PN+YSYNI+I GLCK GM  DAR ++ LM  + ++PD V+YSTLLHG+C +GK+ E+
Sbjct: 361 GIEPNIYSYNIVINGLCKNGMLRDARMVMTLMVRNNISPDTVTYSTLLHGFCNKGKVFEA 420

Query: 421 NYVLREMIQVGCFPNMYTCNILLHSLWKEGKVSEAEELLQKMNERGYGLNNVTCNTVIKG 480
           + +L EM+   CFPN +TCNILLHSLWKEG+ SEAEELLQKMNERGYGL+ VTCN VI G
Sbjct: 421 SNILHEMMMNNCFPNTHTCNILLHSLWKEGRTSEAEELLQKMNERGYGLDTVTCNIVIDG 480

Query: 481 LCKSGNLDKAIEIVSGMWNHGSASLGNLGNSFIGLFDIGNNGMKCLPDSITYATIISWLC 540
           LC  G LDKAIEIVSGMW HGSA+LGNLGNSFIGL D  NNG KC+PD ITY+TIIS LC
Sbjct: 481 LCNDGKLDKAIEIVSGMWTHGSAALGNLGNSFIGLVDDSNNGKKCIPDLITYSTIISGLC 540

Query: 541 KAGRVDEAKKKLLEMIGKKLSPDSLIFDTFIHSYCKQGKLSSAFRVLKEMEKKGCNKSLR 600
           KAGR+DEAKKK +EM+GK L PDS+I+D FI+S+CKQG++SSAFRVLK+MEKKGCNKS++
Sbjct: 541 KAGRLDEAKKKFMEMMGKNLHPDSVIYDMFINSFCKQGRISSAFRVLKDMEKKGCNKSIQ 600

Query: 601 TYNSLIQGLSSKNQIFEIYGLMEEMKEKGIFPNVYTYNNIISCLSEGGKLKDATSLLDEM 660
           TYNSL+ GL SK QIFEIYGLM+EM+E+G+ P+V TYN +++CL EG ++KDATSLLDEM
Sbjct: 601 TYNSLVLGLGSKKQIFEIYGLMDEMRERGVTPDVCTYNYMMNCLCEGERVKDATSLLDEM 660

Query: 661 LQKGISPNIYTFRILIGAFFKACDFGAAQELFEIALSICGHKESLYSFMFNELLTGGETS 720
           LQKGISPNI TFRILI AF KACDFG   E+F+IALS+CGHKE LYS MFNELL GGE  
Sbjct: 661 LQKGISPNISTFRILIKAFCKACDFGVTHEVFDIALSVCGHKEVLYSLMFNELLAGGEIL 720

Query: 721 KAKELFEAALDRSLALKNFLYRDLIERLCMDGKLDDASFILHKMMDKQYRFDPASFMPVI 780
           KAK LFE ALDR   L NFLY+DLI+RLC D KL+DAS ILH M +K Y FDPASF+PVI
Sbjct: 721 KAKALFEVALDRYFYLGNFLYKDLIDRLCKDEKLEDASSILHTMKNKGYGFDPASFLPVI 780

Query: 781 DGLGKSGNKHAADEFAEKMMEMASETDINQH----ENKIIRGRSNNDDERDWHKIVHRND 840
           DGL K GNK  ADE AE MM+M SE  +       E +II G+ +N+   DW  IVHR+D
Sbjct: 781 DGLSKRGNKQEADELAEAMMDMESEGRVGDKVYRIEREIIGGKPSNNGGSDWQTIVHRDD 840

Query: 841 GSGIAQKTLKRVLKGWGQGS 856
           GSGIA KTLKRV KGWG+GS
Sbjct: 841 GSGIALKTLKRVQKGWGRGS 858

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP158_ARATH3.7e-29257.80Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana GN... [more]
RF1_ORYSI2.1e-7427.06Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1[more]
PP444_ARATH2.8e-7428.77Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
PP437_ARATH2.8e-7426.76Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
PP360_ARATH4.0e-7330.80Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K4X0_CUCSA0.0e+0089.12Uncharacterized protein OS=Cucumis sativus GN=Csa_7G024100 PE=4 SV=1[more]
D7U4S8_VITVI0.0e+0066.12Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g03720 PE=4 SV=... [more]
M5VK94_PRUPE0.0e+0065.23Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001249mg PE=4 SV=1[more]
A0A0D2TQE1_GOSRA0.0e+0065.34Uncharacterized protein OS=Gossypium raimondii GN=B456_012G159300 PE=4 SV=1[more]
A0A061DYE2_THECC0.0e+0064.87Pentatricopeptide repeat (PPR) superfamily protein isoform 1 OS=Theobroma cacao ... [more]
Match NameE-valueIdentityDescription
AT2G17140.12.1e-29357.80 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G59900.11.6e-7526.76 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G64320.11.6e-7528.77 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G01110.12.3e-7430.80 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G12300.15.6e-7327.41 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449454285|ref|XP_004144886.1|0.0e+0089.12PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Cucumis sativu... [more]
gi|659094034|ref|XP_008447846.1|0.0e+0088.42PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Cucumis melo][more]
gi|225428276|ref|XP_002279589.1|0.0e+0066.12PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Vitis vinifera... [more]
gi|645260741|ref|XP_008235960.1|0.0e+0065.47PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Prunus mume][more]
gi|595793129|ref|XP_007200313.1|0.0e+0065.23hypothetical protein PRUPE_ppa001249mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g06440.1Cp4.1LG15g06440.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 523..554
score: 6.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 631..679
score: 1.7E-14coord: 293..335
score: 1.2E-9coord: 149..198
score: 6.1E-12coord: 219..268
score: 3.4E-13coord: 561..609
score: 2.5E-12coord: 367..411
score: 3.5E-11coord: 433..482
score: 1.2
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 600..633
score: 8.7E-6coord: 471..500
score: 5.6E-7coord: 366..400
score: 8.8E-10coord: 257..283
score: 1.0E-4coord: 192..220
score: 2.2E-5coord: 566..594
score: 4.9E-7coord: 152..186
score: 3.1E-9coord: 332..364
score: 5.4E-5coord: 222..256
score: 1.9E-8coord: 436..468
score: 4.1E-5coord: 634..668
score: 4.9E-8coord: 401..434
score: 1.2E-6coord: 296..324
score: 3.5E-7coord: 529..563
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 469..503
score: 10.874coord: 255..285
score: 9.339coord: 667..697
score: 6.292coord: 771..805
score: 7.487coord: 364..398
score: 12.386coord: 220..254
score: 12.485coord: 434..468
score: 11.104coord: 562..596
score: 11.674coord: 597..631
score: 10.928coord: 185..219
score: 11.444coord: 632..666
score: 12.858coord: 150..184
score: 12.934coord: 115..149
score: 7.837coord: 399..433
score: 11.246coord: 736..770
score: 7.618coord: 294..328
score: 11.772coord: 527..561
score: 11.444coord: 701..735
score: 6.336coord: 329..363
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 83..177
score: 2.2E-12coord: 557..558
score: 2.2E-12coord: 635..732
score: 2.2
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 571..733
score: 4.9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 13..500
score: 1.8E-286coord: 524..673
score: 1.8E
NoneNo IPR availablePANTHERPTHR24015:SF272SUBFAMILY NOT NAMEDcoord: 13..500
score: 1.8E-286coord: 524..673
score: 1.8E

The following gene(s) are paralogous to this gene:

None