Cp4.1LG16g08460 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG16g08460
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat superfamily protein
LocationCp4.1LG16 : 8059764 .. 8071381 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTACAGTTTTGTATTCGAAGCATTGGATTTCACTTCGCTCAAATCGCTCGATTTCAATTCAGAAACTACGTTCGAAGTACAGAGAAAAATTCTGCCGTTCACATCAACCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATTCTTAATTACGCCAAGTTTCAACACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGATTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATCAGTCTTTTGCCGATTTCTGCTGGCCTTGAAGACGAGGAGATGACAAGACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTCGATGCGTATGGGAAATGTGGGAGTGTGAAAGCTTCATGGCAAGTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAACTTTTAAGGGTCATTTCTGGGATGCCTTGGATGTTTTTAGGACGATGATCGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCACTGCCAATTCCCTGATCGATATGTATGCCAAGTCTGGTCATTCAACTGAGGCATCTAGCATATTCCACAGCATGGATAGAAGGAATATAGTTTCTTGGAACACTATGATCGCTAATTATGCGCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGCCCTAATGCAGTGACCTTTACCAATGTTCTTCCTGCTTGTGCACGTTTGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTTTGTAACCAATGCTCTGACCGACATGTATGCGAAATGTGGTTGCTTTCGTTCTGCTCGAAATGTGTTCAACACGTCCCATAAAGATGAAGTTTCTTATAACATATTAATTACAGGATATTCTGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCTTGTGCAAACCTAGCCGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACTCTCATCTATTTGTCTCAAACTCCCTATTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGATATGGAATGATAGGAGAGTTGGAATCTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTGGATCAACATCTTGAACCCACCGAAATGCACTATACATGTCTGGTTGATCTACTCGGCCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGTAAGGCAGCAGAGCATTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAACATGTATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGGCCAGCTGCACGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATTCGTTTGATCTGATATGTTATTACAATGTCAAATTCATTATAACAAAGAGTTTAGTCTGCGTTTAATCGGTTTGGATACAGCAGGTTTGTAGTGTCCATGAAATAGAATTTTATTTATTTCATCTCATTACAATTAAATATGAAAGGATGAGGTTACAGACATGTTATTGTTAAGGGGAAAAGAAAAGAAAAAAGAAAGAATGTGAATGAGGTAATTAATGATCAACCCAATCGACTGCCACTCTTCCTGCAACTGTGAAAGCCTATAGTTTGGGAGGAGGAAGCAACCACTTTTGGGCCTGCACGTAATCAAGGTTGATGAAGGGTTTCGCTTGTGCTTCACTCAGCTGTTTGCTGAATTTTACACGCTTGCTCGTCTCCGCTCCCGGCCCCGAACACTTAAATTCTCCGAAGAAAACCGTCTTGTCAAAAGCGGGGTTGTTCATGTCGTCCCAGCCTTCAGGATGAAGGATGTCGGCAATGGTGGTGTAGGCAAACACAACTCTGGACCAAGGCCTCCAGGATCTTGCCAAATAAGTGTTCGGCCCGCCAGTGCCTGTAACGCTGCAATGCGCAAACACATACCCACTGGGGTCGTCTTCCTTTTCTCTTGAATGCGCTGCAATCACTCCCAATCCCCCTTCTCCCAGCACATTCAGCTGACTGTTCAAATACAGGGACGTTGCCTTCCCGAAGATGAAGTCCACAGTGCCTTCAATGATGCAATCCTTGTACACATGCATCCCGTCGTCGTCGCACAGGGTGTCTTGGAACCCTAGGAATTTGCAATTGTAAATTGCGGCTTTATTCCCTCTGATTCTCGCCGCCAGTGCCGGACCTCCTTTTCTCGCATCCGGTCTTGGAGAAGTGTTCTGATTTTATTCAAAAATTGATTAGAAAACGATACCCAATTAATCCTAAAATGCATTTAATTAACAAATTACCTCGATCACGAGATTGGCAGCAACAAAATACTCAGCCTCGACGGTCAAAGTTCCACTGTACACGGTGCCGTATTTTTTGGAATCTCCATCGAATGTCAATTTTGGCATATTCTTAGGGGAACCGTAAAGGGTAATGAAGGGCTGATTTCTCTCGATCGTAAGCTTCTCCTTATAAACTCCTTCCCCTATCCAAATCACGACGCGTTTGGTGTTACCGGCTGGAACGCTGGCGATGGCTTCGGTGACAGTCTTGAAGTCTCCGCTTCCGTCGGCCCTCACTTTCACGACGGTGGAGCCTTCTTCAGCGGCCACAAGGGCGGGATCAAGCTCGGCCTTGCGGTCGGCTAAGGGCTTCACATGTCCGGAAAACCAAGCTTCTAACTGCGACTTCTCTGCAGGAACCTCTGTTGTAGCAGAAGAGATGGGTGCAAAAGCTGCCAGGAAGATGAGCAATGTTGTGAGAGCAATATTCATATTTTTTGTTGTTTTCTTGAAGCGATTTATTTTATTTTTAAAATAAGGATGGGTGAATGAGTACGTTGAATTGCATGGAGAAGGTAGGGATTATATAGGGTGATGGGTCCCACTGTGAACCATAAGAACCCACCTAATATTTGTGAATTCGTTTTACCTTTCCGTATCTGGTGGAAGACTATTTCCTCCAAACAAACAAAAATATAATTTTAAAATAACATTTTATTTTCTTCCCAATAAAAGGCTCTACAAACAAATAAATAATTAACAAGAGAGGCTCTAGAACATCATAGTGCCATTGTCCCTCGGCAACTTGTAGCTGAAAGGACCGGTGCTGTTCTGAAGTTCGTGAGTTTAGATGAACATGATGTCCCAAACTTGAAAGATTTCAAGGAGATGTTTACAACAAAAACAAAGCCTGTGGTTACCCATCATGTCTCAAATGTACTAGTACACCCTTCTATTTGATGTTGTTTTTCGATAAGGCTAGGGAAATCCTTTCTTTTCCATGAGGTTGTTGGAAGAAAAATAAAATGCATAAAATAAGGTGCCGTGCCCTTCGATCTCGACGACGGAGCCGTCGCCGAATTTTACTGTCCCGCGAATTCCCGAGTCAAGCTCGGAGAATGCAGACTTGGCCCCGGTCATATGGTTTGTTGCCCCCGTGTCAAGGACCATCGCCGGTGCTCTTGCTGTTCACCTCTTTCACTGATTTGAGCGAACTCGCTCCTCTTCCAGTTGAATTGGCTCCCCAGTTGGTGCCTTTGTTATGCACAATTGGAGCTCTTCCTCGGGTTCGAGGCTCACCGGAGCAGTTATGCCGTCGTCGATTACCTCTGTGGATTTGGGGAGCTCTTCCCCGGGATTAGGGACATCAGGGTGTCACCATGAAAAGGGCCCGGCTCATCCTTCGGATTGAGCCACATGGGCCTTCTCGGCCATCATTTCTGTCTCGCTGATGCTGCCACCGAGTGTGGTGATGCTGTTGGCGAGCCCCGTGATCCGCATTGAAAAATCATCGACGGATCCGCCGTCCTTAAAGCGAATCTCCGAGAGCTCCTTCCACAGCTGCTCGATGTTGGATTCCCGCACTCGTTGTACCATCGGGCGCAGTGCACTTGGTGGAGAGAGATGCATCTCTGGGGGCACAGATCGACAGCGTGCCATAACCCGTGCGCCTGTAGGTTGACACGCATCAACAAGGTCCAACTCAATATCGGATACTGAACAACGACAGTCGTTATCTCCCTGATCACTCACTCAACGACAACTCGACACCCATCATCACAAGGTCGGCCTTCTCGTGGTGGTGATGTGGATGCCAGCACTTTGCATTTTGTTCGAGTTACCTGAGACCTGTGTGTTATTAGGTCAACACTCTAACCACTTGAGCTATTCAACTTGATTGTTGATAAGTTGGCTCCTCTGCAATGATATACACCGATGGCTTCAACCTGACTCTGATACCACTTGTTGTTTTTCGATAAGACCGGTGTGTTATTAGGTCGACACTCTAACCACTTGAGCTATTCAACTTGATTGTTGATAGGTTGGCTCCTCCGCAATGATATACACCGATGGCTTCAACCTGGCTCTGATACCACTTGTTGTTTTTCGATAAGACCTGTGTTATTAGGTCGACACTCTAACCACTTGAACTATTCAACTTGATTGTTGATAGGTTGGCTCCTCCGCAATGATATACACCAATGGCTTAAACCTGGCTCTGATACCACTTGTTGTTTTTCGATAATCGACCAGGGAAATCTTTCTTTTCGTGAGAGGAAATCATCGACAGGAGGTTGTTGGAAGAAAAATAAAATGCGTAAAATAAGAAAGAGAGAATGGCTGAAATTTTGACTGCGCTTCAAAATCAGGGTATTAAAGCTTCTTAACAGATTTTCTATTTAAATATAACCTATCATAAAAACAGGCCAACTAAATCTGTGACAATTCCTACAAAATTGCTTCTAAAACTTTGTTAATTCTTTGTCAAATGGTCAAACGGAGAACTAATAAAATCGTGTGAATCTAAATAAGGAAAAACTAAGTTTTATCTAACATTTGATATCCACTATTGTTGTGAGATGTCACATTGGCTTTATCTAACATTTGATATCCACTATTGTTGTGAGATGTCACATTGGTGGAGAAGGGAACAGCATGCTTTATAAGGGTTTGGAAACCTCTCCCCATCAGACGTGTTTTAAAACCTTGAAAGGAAGCCCGAAAGAGAAAGTCCAAAGAGAACAATATCTGCTAGTGGTGGGTTTGGGCCCAGTCCCTAATGCTGAACGTGCTGCTCTGTGCTCGTTCAATATTGAAAACATGCATCCAACGGACATTGCAACTTTTCTTGATCAACAGGTACATCAAGGGAGATTGTATTACTAGAGTGCATCAGTTACTTTCTATGGGTCCTTAAAAATATTTTAATGCAGCATGGTATAGCCATCAGATCAGGACATCACTGTGCGCAGCCCCTCCATCGGGCCCTCGGTGTGAGCTCGAGCGCACGAGCTAGCCTTTACTTCTACAACACAAAAGAGGATGTGGACTACTTTATCCAGGCTCTAAATGACACTGTAAGTTTCTTCGACTCCTTCAAGTAGGGATCTGGTAATACCAAAGCTTGTTGGTTTGGTGGGTCCACTGCATGGATTATGCTCTGTTTCTTGGTTTGGTGAAGATTGTCTTTCTTCTTGCTATCTTGAGCTGAGCCTGTGACATTTACTACCAGATGTTTGCCCATATAATGTTCTTGTAAAGACCACAGATTTCTCCTAGTTTTAGACTTTAGTTTATCTATAAATTATCGAATTAAGCTATGCTTTCACTCATTTTTCTCAAGATAACAAATTATTATATACAGCAATAAACCATCGTGTCATGAGCCCAAAGAATCTGAGACTTCTAATATCTAAACGTATACATAAATTCTAATTCTAAAGCTTAAGCTTATATATGAGATGAGATTCTACCTCAGTTGTTGGAGAGGAGACTGAAACATTCTTATAAATCAAGATATAAAGAAAATTTGTAAAAAGATAAAGAAAGTGGAAAGAAAATCCTTATTAGGTTATTTTCTTCTTTTTAGAAGCTTGTGAATTATTTTTACGAACCTTGTGCTTTAGGTCTAAGTTTTAATCTTAGTTTTATAGTGATAGTTTTATGTTTGGGTTCAAAATGTAACGAAAAACATACCAATTAAGTAAAATGATCCATATATTATTCTCAAACAAAACTGTCTATCATACTACAATACGATGTCGTATAATTTGACGACAATAATTGAGAATAAGGTCATGGGAAACGACATTGTAAAAGGGGAAAAAAGAAAGAATGTGAATGAGGTAATTAATGATCAACCCGATCGATTGCGGTTGCGGAGAGGTATTTGGTGTATGTGAGAATACTGCCACTCGAATACTGCCACTCGATCGAATACTGCCACTCTCCCTGCAACTATGAAAGCCTATAGTTTGGGAGGAGGAAGCAGCCACTTTTGAGCCTGCACGTAATCAAGGTTGATGAAGGGTTTTGCTTGTGCTTCACTCAGCTGTCTGCTGAATTTTATACGCTTGCTCGTCTCCGCTCCCGGCCCCGAACACTTGTATTCTCCAAAGAAAACCGTCTTGTCAAAAGCGGGGTTGTTCATGTCGTCCCAGCCTTCAGGATGAAGGATGTCGGCAATGGTGGTGTAGGCAAACACAACTCTGGACCAGGGCCTCCAAGATCTTCCCAAATAAGTGTTCGGCCCGCCAGTGCCTGTAATGCTGCAATGCGCAAACACATATCCACTGGGGTCGTCTTCCTTTTCTCTTGAATGCGCTGCAATCACTCCCAATCCCCCGACTCCCACCACATTCAGCTGACTGTTCAAATACAGGGCCGTTGCCTTCCCGAAGATGAAGTCCACGGTGCCTTCAATGAAGCAATCCTTGTACACATGCAGCCCGTCGTCGTCGCACAGGGTGTCTTGGAACCCTATGAATTTGCAATTGTAAACTGCGGCTTTATTCCCTCTGATTCTCGCTGCCAATGCCGGACCTCCTTTTCCGGCATGCGGTCTTGGAGAAGTGTTCTGTTTTTATTCATTCAAAAATTGATCAGAAGGAGAAGTGTTCTGTTTTTTATTCAAAAATTGATCAGAAGAAGGAGAAGTGTTCGCATTTAATTCACAAATTACCTCGAGCACCAGATTGGCAGCAACAAAATAATCAGCCTCCACGGTCAAAGTTCCACTGTAAACGGTACCGTATTTTTTGGCATCTCCATCAAATGTCAATTTCGGCATATTCTTAGGGGAACCGTAAAGGGTAATGAAGGGCTGATTTCTCTCGATCGTAAGCTTCTCCTTATAAACTCCTTCCCCAATCCAAATCACGACGCGTTTGGTGTTACCGGCTGGAACGCTGGCGATGGCTTCGGTGACAGTCTTGAAGTCTCCGCTTCCGTCGGCCCTCACTTTCACGACGGTGGAGCCTTCTTCAGCGGCCACAAGGGCGGGATCAAGCTCGGCCTTGCGGTCGGCTAAGGGCTTCACATGCCCGGAAAACCAAGCTTCTAACTGCGACTTCTCTGCAGGAACCTCTGTTGTAGCAGAAGAGATGGGTGCAAAAGCTGCGAGGAAGATGAGCAATGTTGTGAGAGGAATGTTCATATTTCTGTTGTTTTCTTGAAGGGATTTATTTTATTTTTAAAATAAGGATGGGTGGATGAGTACGTTGAATTGCATGGAGAAGGTAGGGATTATATTGGGTGATGGGGCCCACTGTGAACCATAAGAACCTCCCTAACATTTCTGGATTAGTTTTAGCTTTCCGTCTCTAGTGGACGACTATTTCCTCCAAACAAAAAATATATATATATATATTTTCTAATATACCCTTCATTTCCCCTCTAATTACTCTAAACTCCCCTCCCTTTATCAAGGTCCAGATATTAAGTGGAATGGGACGCTTGATGGGTTGCAGACGTACAACAGGATGGTTCGACTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGCTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTACGAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATCAGTCTTTTGCCGATTTCTGCTGGCCTTGAAGACGAGGAGATGACAAGACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTCGATGCGTATGGGAAATGTGGGAGTGTGAAAGCTTCATGGCAAGTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAACTTTTAAGGGTCATTTCTGGGATGCCTTGGATGTTTTTAGGACGATGATCGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCACTGCCAATTCCCTGATCGATATGTATGCCAAGTCTGGTCATTCAACTGAGGCATCTAGCATATTCCACAGCATGGATAGAAGGAATATAGTTTCTTGGAACACTATGATCGCTAATTATGCGCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGCCCTAATGCAGTGACCTTTACCAATGTTCTTCCTGCTTGTGCACGTTTGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTTTGTAACCAATGCTCTGACCGACATGTATGCGAAATGTGGTTGCTTTCGTTCTGCTCGAAATGTGTTCAACACGTCCCATAAAGATGAAGTTTCTTATAACATATTAATTACAGGATATTCTGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCTTGTGCAAACCTAGCCGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACTCTCATCTATTTGTCTCAAACTCCCTATTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGATATGGAATGATAGGAGAGTTGGAATCTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTGGATCAACATCTTGAACCCACCGAAATGCACTATACATGTCTGGTTGATCTACTCGGCCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGTAAGGCAGCAGAGCATTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAACATGTATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGGCCAGCTGCACGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATTCGTTTGATCTGATATGTTATTACAATGTCAAATTCATTATAACAAAGAGTTTAGTCTGCGTTTAATCGGTTTGGATACAGCAGGTTTGTAGTGTCCATGAAATAGAATTTTATTTATTTCATCTCATTACAATTCAATATGAAAGGATGAGGTTACAGACATGTTATTGTTAAGGGGAAAAGAAAAGAAAAAAGAAAGAATGTGAATGAGGTAATTAATGATCAACCCAATCGACTGCCACTCTTCCTGCAACTGTGAAAGCCTATAGTTTGGGAGGAGGAAGCAACCACTTTTGGGCCTGCACGTAATCAAGGTTGATGAAGGGTTTCGCTTGTGCTTCACTCAGCTGTTTGCTGAATTTTACACGCTTGCTCGTCTCCGCTCCCGGCCCCGAACACTTAAATTCTCCGAAGAAAACCGTCTTGTCAAAAGCGGGGTTGTTCATGTCGTCCCAGCCTTCAGGATGAAGGATGTCGGCAATGGTGGTGTAGGCAAACACAACTCTGGACCAAGGCCTCCAGGATCTTGCCAAATAAGTGTTCGGCCCGCCAGTGCCTGTAACGCTGCAATGCGCAAACACATACCCACTGGGGTCGTCTTCCTTTTCTCTTGAATGCGCTGCAATCACTCCCAATCCCCCTTCTCCCAGCACATTCAGCTGACTGTTCAAATACAGGGACGTTGCCTTCCCGAAGATGAAGTCCACAGTGCCTTCAATGATGCAATCCTTGTACACATGCATCCCGTCGTCGTCGCACAGGGTGTCTTGGAACCCTAG

mRNA sequence

ATGTTACACATTGGATTTCACTTCGCTCAAATCGCTCGATTTCAATTCAGAAACTACGTTCGAAGTACAGAGAAAAATTCTGCCGTTCACATCAACCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATTCTTAATTACGCCAAGTTTCAACACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGATTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCACTGCCAATTCCCTGATCGATATGTATGCCAAGTCTGGTCATTCAACTGAGGCATCTAGCATATTCCACAGCATGGATAGAAGGAATATAGTTTCTTGGAACACTATGATCGCTAATTATGCGCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGCCCTAATGCAGTGACCTTTACCAATGTTCTTCCTGCTTGTGCACGATATTCTGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCTTGTGCAAACCTAGCCGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACTCTCATCTATTTGTCTCAAACTCCCTATTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGATATGGAATGATAGGAGAGTTGGAATCTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTGGATCAACATCTTGAACCCACCGAAATGCACTATACATGTCTGGTTGATCTACTCGGCCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGTAAGGCAGCAGAGCATTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAACATGTATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGGCCAGCTGCACGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATTCTGGAATGGGACGCTTGATGGGTTGCAGACGTACAACAGGATGGTTCGACTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGCTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTACGAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCACTGCCAATTCCCTGATCGATATGTATGCCAAGTCTGGTCATTCAACTGAGGCATCTAGCATATTCCACAGCATGGATAGAAGGAATATAGTTTCTTGGAACACTATGATCGCTAATTATGCGCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGCCCTAATGCAGTGACCTTTACCAATGTTCTTCCTGCTTGTGCACGATATTCTGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCTTGTGCAAACCTAGCCGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACTCTCATCTATTTGTCTCAAACTCCCTATTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGATATGGAATGATAGGAGAGTTGGAATCTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTGGATCAACATCTTGAACCCACCGAAATGCACTATACATGTCTGGTTGATCTACTCGGCCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGTAAGGCAGCAGAGCATTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAACATGTATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGGCCAGCTGCACGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATTCTTTGGGAGGAGGAAGCAACCACTTTTGGGCCTGCACGTAATCAAGGTTGATGAAGGGTTTCGCTTGTGCTTCACTCAGCTGTTTGCTGAATTTTACACGCTTGCTCGTCTCCGCTCCCGGCCCCGAACACTTAAATTCTCCGAAGAAAACCGTCTTGTCAAAAGCGGGGTTGTTCATGTCGTCCCAGCCTTCAGGATGAAGGATGTCGGCAATGGTGGTGTAGGCAAACACAACTCTGGACCAAGGCCTCCAGGATCTTGCCAAATAAGTGTTCGGCCCGCCAGTGCCTGTAACGCTGCAATGCGCAAACACATACCCACTGGGGTCGGACGTTGCCTTCCCGAAGATGAAGTCCACAGTGCCTTCAATGATGCAATCCTTGTACACATGCATCCCGTCGTCGTCGCACAGGGTGTCTTGGAACCCTAG

Coding sequence (CDS)

ATGTTACACATTGGATTTCACTTCGCTCAAATCGCTCGATTTCAATTCAGAAACTACGTTCGAAGTACAGAGAAAAATTCTGCCGTTCACATCAACCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATTCTTAATTACGCCAAGTTTCAACACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGATTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCACTGCCAATTCCCTGATCGATATGTATGCCAAGTCTGGTCATTCAACTGAGGCATCTAGCATATTCCACAGCATGGATAGAAGGAATATAGTTTCTTGGAACACTATGATCGCTAATTATGCGCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGCCCTAATGCAGTGACCTTTACCAATGTTCTTCCTGCTTGTGCACGATATTCTGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCTTGTGCAAACCTAGCCGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACTCTCATCTATTTGTCTCAAACTCCCTATTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGATATGGAATGATAGGAGAGTTGGAATCTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTGGATCAACATCTTGAACCCACCGAAATGCACTATACATGTCTGGTTGATCTACTCGGCCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGTAAGGCAGCAGAGCATTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAACATGTATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGGCCAGCTGCACGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATTCTGGAATGGGACGCTTGATGGGTTGCAGACGTACAACAGGATGGTTCGACTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGCTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTACGAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCACTGCCAATTCCCTGATCGATATGTATGCCAAGTCTGGTCATTCAACTGAGGCATCTAGCATATTCCACAGCATGGATAGAAGGAATATAGTTTCTTGGAACACTATGATCGCTAATTATGCGCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGCCCTAATGCAGTGACCTTTACCAATGTTCTTCCTGCTTGTGCACGATATTCTGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCTTGTGCAAACCTAGCCGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACTCTCATCTATTTGTCTCAAACTCCCTATTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGATATGGAATGATAGGAGAGTTGGAATCTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTGGATCAACATCTTGAACCCACCGAAATGCACTATACATGTCTGGTTGATCTACTCGGCCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGTAAGGCAGCAGAGCATTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAACATGTATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGGCCAGCTGCACGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATTCTTTGGGAGGAGGAAGCAACCACTTTTGGGCCTGCACGTAATCAAGGTTGATGAAGGGTTTCGCTTGTGCTTCACTCAGCTGTTTGCTGAATTTTACACGCTTGCTCGTCTCCGCTCCCGGCCCCGAACACTTAAATTCTCCGAAGAAAACCGTCTTGTCAAAAGCGGGGTTGTTCATGTCGTCCCAGCCTTCAGGATGAAGGATGTCGGCAATGGTGGTGTAGGCAAACACAACTCTGGACCAAGGCCTCCAGGATCTTGCCAAATAAGTGTTCGGCCCGCCAGTGCCTGTAACGCTGCAATGCGCAAACACATACCCACTGGGGTCGGACGTTGCCTTCCCGAAGATGAAGTCCACAGTGCCTTCAATGATGCAATCCTTGTACACATGCATCCCGTCGTCGTCGCACAGGGTGTCTTGGAACCCTAG

Protein sequence

MLHIGFHFAQIARFQFRNYVRSTEKNSAVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFEFWNGTLDGLQTYNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLLECFKAGKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFEFFGRRKQPLLGLHVIKVDEGFRLCFTQLFAEFYTLARLRSRPRTLKFSEENRLVKSGVVHVVPAFRMKDVGNGGVGKHNSGPRPPGSCQISVRPASACNAAMRKHIPTGVGRCLPEDEVHSAFNDAILVHMHPVVVAQGVLEP
BLAST of Cp4.1LG16g08460 vs. Swiss-Prot
Match: PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 2.1e-105
Identity = 204/539 (37.85%), Postives = 311/539 (57.70%), Query Frame = 1

Query: 601  WNGTLDGLQTYNR----------MVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVF 660
            WN  + G   ++R          M + G  L++++F  VL  CS   D+ KG++VH ++ 
Sbjct: 120  WNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIA 179

Query: 661  KLGFDSDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARN 720
            K  F SDVY+G+ L+ +Y  CG +NDA +VFDEM +R+VVSWN++I     NG   EA +
Sbjct: 180  KSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALD 239

Query: 721  YYFWMTLRSGIQPN------------LLECFKAGKEIHGFSMRMGT-ETDLFTANSLIDM 780
              F M L S ++P+             L   K G+E+HG  ++      D+  +N+ +DM
Sbjct: 240  V-FQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDM 299

Query: 781  YAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTF 840
            YAK     EA  IF SM  RN+++  +MI+ YA+   + +A R  ++  +  ER N V++
Sbjct: 300  YAKCSRIKEARFIFDSMPIRNVIAETSMISGYAM-AASTKAAR--LMFTKMAER-NVVSW 359

Query: 841  TNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIH-- 900
              ++   A Y++  +  E+L+LF  ++     P   SF  ++ ACA+LA +  G + H  
Sbjct: 360  NALI---AGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVH 419

Query: 901  ----GVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMI 960
                G   ++     +FV NSL+D Y KCG ++    +F +++ +D  SWN MI+G+   
Sbjct: 420  VLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQN 479

Query: 961  GELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEML-DQHLEPTEMH 1020
            G    A+ +F  M +   + D ++ I VLSAC H G VE G  YFS M  D  + P   H
Sbjct: 480  GYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDH 539

Query: 1021 YTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELK 1080
            YTC+VDLLGRAGF+EEA  +I  +P+ PDS IWG+LL AC+++ N+ LG   AE L E++
Sbjct: 540  YTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVE 599

Query: 1081 PQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRA 1110
            P + G Y+LLSNMYAE G+W++V  +R+ M+  G  K PGCSW++I G  H F+V D++
Sbjct: 600  PSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKS 650

BLAST of Cp4.1LG16g08460 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 361.3 bits (926), Expect = 4.3e-98
Identity = 184/547 (33.64%), Postives = 306/547 (55.94%), Query Frame = 1

Query: 611  YNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGN 670
            ++RM   G+  D H  P + K+C++      G ++H V    G D D +V  ++  +Y  
Sbjct: 104  FSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSMFHMYMR 163

Query: 671  CGFLNDATKVFDEMSERDVV-----------------------------------SWNTV 730
            CG + DA KVFD MS++DVV                                   SWN +
Sbjct: 164  CGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGI 223

Query: 731  IGLLSVNGDYREARNYYFWMTLRSGIQPNLL------------ECFKAGKEIHGFSMRMG 790
            +   + +G ++EA    F      G  P+ +            E    G+ IHG+ ++ G
Sbjct: 224  LSGFNRSGYHKEAV-VMFQKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQG 283

Query: 791  TETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVI 850
               D    +++IDMY KSGH     S+F+  +       N  I   + NG+  +A+    
Sbjct: 284  LLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFE 343

Query: 851  LLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACA 910
            L +E     N V++T+++  CA+  +    +E+L LF EM++ G KP+ V+   ++ AC 
Sbjct: 344  LFKEQTMELNVVSWTSIIAGCAQNGKD---IEALELFREMQVAGVKPNHVTIPSMLPACG 403

Query: 911  NLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNT 970
            N+AA+  G+  HG A+R HL  ++ V ++L+D Y KCGRI+L+  +FN +  K++  WN+
Sbjct: 404  NIAALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNS 463

Query: 971  MILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQH 1030
            ++ G+ M G+ +  +++FE++   +++ D +S+ ++LSAC   GL + GW+YF  M +++
Sbjct: 464  LMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEY 523

Query: 1031 -LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKA 1090
             ++P   HY+C+V+LLGRAG ++EA +LI+ +P  PDS +WGALL +CR+  NV+L   A
Sbjct: 524  GIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIA 583

Query: 1091 AEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHA 1110
            AE LF L+P++ G Y+LLSN+YA  G W EV+ IR  M+S G KK+PGCSW+Q+  +++ 
Sbjct: 584  AEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYT 643

BLAST of Cp4.1LG16g08460 vs. Swiss-Prot
Match: PP207_ARATH (Pentatricopeptide repeat-containing protein At3g02330 OS=Arabidopsis thaliana GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 354.4 bits (908), Expect = 5.2e-96
Identity = 237/851 (27.85%), Postives = 402/851 (47.24%), Query Frame = 1

Query: 322  SLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHG---------------VALR 381
            S + F++           +F  V   CA   A++ GK+ H                  L+
Sbjct: 32   SFSYFTDFLNQVNSVSTTNFSFVFKECAKQGALELGKQAHAHMIISGFRPTTFVLNCLLQ 91

Query: 382  NHLNSHLFVS----------------NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTM 441
             + NS  FVS                N +++ Y+K   +  A   FN +  +DV SWN+M
Sbjct: 92   VYTNSRDFVSASMVFDKMPLRDVVSWNKMINGYSKSNDMFKANSFFNMMPVRDVVSWNSM 151

Query: 442  ILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHL 501
            + GY   GE   +I +F  M  + +++D  ++  +L  CS       G Q    ++    
Sbjct: 152  LSGYLQNGESLKSIEVFVDMGREGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGC 211

Query: 502  EPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE 561
            +   +  + L+D+  +     E+  + + +P   +S  W A++  C     + L  K  +
Sbjct: 212  DTDVVAASALLDMYAKGKRFVESLRVFQGIP-EKNSVSWSAIIAGCVQNNLLSLALKFFK 271

Query: 562  HLFELKP--QHCGYYILLSNMYA----ETGRWDEVNRIRELMKSRGAKKSPGCS-WVQIH 621
             + ++        Y  +L +  A      G     + ++    + G  ++     + +  
Sbjct: 272  EMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCD 331

Query: 622  GQLHAFVVDDRAEGF--EFWNGTLDG----------LQTYNRMVRLGVQLDDHTFPFVLK 681
                A ++ D +E    + +N  + G          L  ++R++  G+  D+ +   V +
Sbjct: 332  NMQDAQILFDNSENLNRQSYNAMITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFR 391

Query: 682  ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVS 741
             C+    + +G++++G+  K     DV V N  + +YG C  L +A +VFDEM  RD VS
Sbjct: 392  ACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVS 451

Query: 742  WNTVIGLLSVNGDYREARNYYFWMTLRSGIQPN-------LLECFKA----GKEIHGFSM 801
            WN +I     NG   E   + F   LRS I+P+       L  C       G EIH   +
Sbjct: 452  WNAIIAAHEQNGKGYETL-FLFVSMLRSRIEPDEFTFGSILKACTGGSLGYGMEIHSSIV 511

Query: 802  RMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIR 861
            + G  ++     SLIDMY+K G   EA  I     +R  VS  TM     ++   L+ + 
Sbjct: 512  KSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRANVS-GTMEELEKMHNKRLQEM- 571

Query: 862  FVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVIS 921
                          V++ +++       ++ D   +  LF+ M  +G  PD  ++  V+ 
Sbjct: 572  -------------CVSWNSIISGYVMKEQSED---AQMLFTRMMEMGITPDKFTYATVLD 631

Query: 922  ACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVAS 981
             CANLA+   GK+IH   ++  L S +++ ++L+D Y+KCG +  +  +F + L +D  +
Sbjct: 632  TCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLRRDFVT 691

Query: 982  WNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEM- 1041
            WN MI GY   G+ E AI +FE M  + ++ + V++I++L AC+H GL+++G +YF  M 
Sbjct: 692  WNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLIDKGLEYFYMMK 751

Query: 1042 LDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIY-GNVEL 1101
             D  L+P   HY+ +VD+LG++G V+ A ELIR +P   D  IW  LLG C I+  NVE+
Sbjct: 752  RDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIHRNNVEV 811

Query: 1102 GCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHG 1110
              +A   L  L PQ    Y LLSN+YA+ G W++V+ +R  M+    KK PGCSWV++  
Sbjct: 812  AEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCSWVELKD 862

BLAST of Cp4.1LG16g08460 vs. Swiss-Prot
Match: PP261_ARATH (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 9.8e-95
Identity = 202/557 (36.27%), Postives = 310/557 (55.66%), Query Frame = 1

Query: 42  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNT 101
           +L Q K++HA  +   L    + +   LI   +  +       +F+Q  +      L N+
Sbjct: 31  NLNQVKQLHAQIIRRNL-HEDLHIAPKLISALSLCRQTNLAVRVFNQVQEP--NVHLCNS 90

Query: 102 LIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKL 161
           LIRAH+   +        ++ M RFG+  D+ T+PF+LK CS    +     +H  + KL
Sbjct: 91  LIRAHA-QNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEKL 150

Query: 162 GFDSDVYVGNTLLMLYGNCGFLN--DAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARK 221
           G  SD+YV N L+  Y  CG L   DA K+F++MSERD VSWN+++G L   G+ R+AR+
Sbjct: 151 GLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELRDARR 210

Query: 222 EIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNG 281
                  R     DL + N+++D YA+    ++A  +F  M  RN VSW+TM+  Y+  G
Sbjct: 211 LFDEMPQR-----DLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAG 270

Query: 282 VALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVV 341
             +E  R V+  +      N VT+T ++   A Y+E     E+  L  +M   G K D  
Sbjct: 271 -DMEMAR-VMFDKMPLPAKNVVTWTIII---AGYAEKGLLKEADRLVDQMVASGLKFDAA 330

Query: 342 SFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQI 401
           + + +++AC     +  G  IH +  R++L S+ +V N+LLD Y KCG +  A  +FN I
Sbjct: 331 AVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCGNLKKAFDVFNDI 390

Query: 402 LFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGW 461
             KD+ SWNTM+ G G+ G  + AI +F  MR + ++ D V++IAVL +C+H GL++ G 
Sbjct: 391 PKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGI 450

Query: 462 QYFSEMLDQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRI 521
            YF  M   + L P   HY CLVDLLGR G ++EA ++++ +P+ P+  IWGALLGACR+
Sbjct: 451 DYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPNVVIWGALLGACRM 510

Query: 522 YGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCS 581
           +  V++  +  ++L +L P   G Y LLSN+YA    W+ V  IR  MKS G +K  G S
Sbjct: 511 HNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSKMKSMGVEKPSGAS 570

Query: 582 WVQIHGQLHAFVVDDRA 596
            V++   +H F V D++
Sbjct: 571 SVELEDGIHEFTVFDKS 573

BLAST of Cp4.1LG16g08460 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 8.3e-94
Identity = 213/678 (31.42%), Postives = 359/678 (52.95%), Query Frame = 1

Query: 26  NSAVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSV-----SLCASLILNYAKFQHPE 85
           +S V ++  T    ++S    + VH    L+G +  S      S+  SL+  Y K Q  +
Sbjct: 188 SSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVD 247

Query: 86  SFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVL 145
           S   +F +  +  R    WN++I  +    NG  + GL  + +M+  G+++D  T   V 
Sbjct: 248 SARKVFDEMTE--RDVISWNSIINGY--VSNGLAEKGLSVFVQMLVSGIEIDLATIVSVF 307

Query: 146 KICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVV 205
             C+DS  I  G  VH +  K  F  +    NTLL +Y  CG L+ AK VF EMS+R VV
Sbjct: 308 AGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVV 367

Query: 206 SWNTVIGLLSVNGDYREARKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHS 265
           S+ ++I   +  G   EA K         G   D++T  ++++  A+     E   + H 
Sbjct: 368 SYTSMIAGYAREGLAGEAVKLFEEMEEE-GISPDVYTVTAVLNCCARYRLLDEGKRV-HE 427

Query: 266 MDRRNIVSWNTMIANYALNGVA-LEAIRFVILLQESGERPNAVTFTNVLPACARYSETND 325
             + N + ++  ++N  ++  A   +++   L+       + +++  ++   ++    N+
Sbjct: 428 WIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANE 487

Query: 326 CLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNS 385
            L   NL  E +     PD  +   V+ ACA+L+A  +G+EIHG  +RN   S   V+NS
Sbjct: 488 ALSLFNLLLEEKRFS--PDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANS 547

Query: 386 LLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYD 445
           L+D Y KCG + LA  +F+ I  KD+ SW  MI GYGM G  + AI +F  MR   ++ D
Sbjct: 548 LVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEAD 607

Query: 446 LVSYIAVLSACSHGGLVERGWQYFSEMLDQ-HLEPTEMHYTCLVDLLGRAGFVEEAAELI 505
            +S++++L ACSH GLV+ GW++F+ M  +  +EPT  HY C+VD+L R G + +A   I
Sbjct: 608 EISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFI 667

Query: 506 RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWD 565
             +PI PD+ IWGALL  CRI+ +V+L  K AE +FEL+P++ GYY+L++N+YAE  +W+
Sbjct: 668 ENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWE 727

Query: 566 EVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFEFWNGTLDGLQTYNRMVRL 625
           +V R+R+ +  RG +K+PGCSW++I G+++ FV  D +      N   + ++ + R VR 
Sbjct: 728 QVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSS------NPETENIEAFLRKVRA 787

Query: 626 GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVG--NTLLMLYGN---CG 685
            + +++   P       D+ ++ K   + G   KL     +       ++ +  N   CG
Sbjct: 788 RM-IEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCG 847

Query: 686 FLNDATKVFDEMSERDVV 691
             ++  K   +++ R++V
Sbjct: 848 DCHEMAKFMSKLTRREIV 850

BLAST of Cp4.1LG16g08460 vs. TrEMBL
Match: A0A0A0KEH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G428560 PE=4 SV=1)

HSP 1 Score: 631.3 bits (1627), Expect = 2.5e-177
Identity = 337/634 (53.15%), Postives = 421/634 (66.40%), Query Frame = 1

Query: 1   MLHIGFHFAQIARFQFRNYVRSTEKNSAVH-INLLTLCFNAQSLRQTKEVHAICLLNGLL 60
           +L +   + +   + F   +RS  K + V  I+LL +    +    T+ +H   +  GL 
Sbjct: 234 LLSVNGDYTEARNYYFWMILRSVIKPNLVSVISLLPISAALEDEEMTRRIHCYSVKVGL- 293

Query: 61  PHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLET 120
              V+ C +L+  Y K    ++   +F++TV+    +  WN++I   +  G    D L  
Sbjct: 294 DSQVTTCNALVDAYGKCGSVKALWQVFNETVEKNEVS--WNSIINGLACKGR-CWDALNA 353

Query: 121 YNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGN 180
           +  M+  G Q +  T   +L +  +      G E+HG   ++G ++D+++ N+L+ +Y  
Sbjct: 354 FRMMIDAGAQPNSVTISSILPVLVELECFKAGKEIHGFSMRMGTETDIFIANSLIDMYAK 413

Query: 181 CGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREA---------------------- 240
            G   +A  +F  +  R++VSWN +I   ++N    EA                      
Sbjct: 414 SGHSTEASTIFHNLDRRNIVSWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNV 473

Query: 241 ------------RKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNI 300
                        KEIH   +R+G  +DLF +NSLIDMYAK G         HS   RN+
Sbjct: 474 LPACARLGFLGPGKEIHAMGVRIGLTSDLFVSNSLIDMYAKCG-------CLHSA--RNV 533

Query: 301 VSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNL 360
             +NT                          R + V++ N+L     YSET+DCL+SLNL
Sbjct: 534 --FNT-------------------------SRKDEVSY-NIL--IIGYSETDDCLQSLNL 593

Query: 361 FSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTK 420
           FSEMRLLGKKPDVVSF+GVISACANLAA+KQGKE+HGVALRNHL SHLFVSNSLLDFYTK
Sbjct: 594 FSEMRLLGKKPDVVSFVGVISACANLAALKQGKEVHGVALRNHLYSHLFVSNSLLDFYTK 653

Query: 421 CGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAV 480
           CGRID+AC++FNQILFKDVASWNTMILGYGMIGELE+AI+MFEAMRDD VQYDLVSYIAV
Sbjct: 654 CGRIDIACRLFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDTVQYDLVSYIAV 713

Query: 481 LSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPD 540
           LSACSHGGLVERGWQYFSEML Q LEPTEMHYTC+VDLLGRAGFVEEAA+LI++LPIAPD
Sbjct: 714 LSACSHGGLVERGWQYFSEMLAQRLEPTEMHYTCMVDLLGRAGFVEEAAKLIQQLPIAPD 773

Query: 541 SNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIREL 600
           +NIWGALLGACRIYGNVELG +AAEHLFELKPQHCGYYILLSN+YAETGRWDE N+IREL
Sbjct: 774 ANIWGALLGACRIYGNVELGRRAAEHLFELKPQHCGYYILLSNIYAETGRWDEANKIREL 824

BLAST of Cp4.1LG16g08460 vs. TrEMBL
Match: A0A151T0A3_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_022898 PE=4 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 3.2e-169
Identity = 294/470 (62.55%), Postives = 363/470 (77.23%), Query Frame = 1

Query: 656  SDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWM 715
            S V VGN L+ +YG CG    + KVFDE+ ER+VVSWN++I   S  G Y +A +  F +
Sbjct: 118  SHVKVGNALVDVYGKCGSEKASKKVFDEIDERNVVSWNSIITSFSFRGKYMDALDV-FRL 177

Query: 716  TLRSGIQPNL------------LECFKAGKEIHGFSMRMGTETDLFTANSLIDMYAKSGH 775
             + +G++PN             L  FK G EIHGFS+RM  E+D+F ANSLID+YAKSG 
Sbjct: 178  MIGTGMRPNSVTISSMLPVLGELGLFKWGMEIHGFSLRMAIESDIFIANSLIDIYAKSGS 237

Query: 776  STEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPA 835
            S  AS IF+ M+ R+IVSWN MIAN+A N +  EA+  V  +Q  GE PN VTFTNVLPA
Sbjct: 238  SRIASIIFNKMEGRSIVSWNAMIANFAQNRLEFEAVELVRQMQAKGETPNNVTFTNVLPA 297

Query: 836  CARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHL 895
            C  YS TNDC ES++LFSEM LLG  PD+VSFMGVISACANLA+++QGKE+HG+ +R   
Sbjct: 298  C--YSRTNDCSESISLFSEMILLGMLPDIVSFMGVISACANLASIRQGKEVHGLLMRKLF 357

Query: 896  NSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEA 955
            ++HLFV+NSLLD YT+CGRIDLA K+F++I  KDVASWNTMILGYGM+GELE+AIN+FEA
Sbjct: 358  HTHLFVANSLLDLYTRCGRIDLATKVFDRIQNKDVASWNTMILGYGMLGELETAINLFEA 417

Query: 956  MRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGF 1015
            M++D V+YD VS+IAVLSACSHGGL+E+G +YF  M D ++EP   HY C+VDLLGRAG 
Sbjct: 418  MKEDGVEYDSVSFIAVLSACSHGGLIEKGRKYFKMMHDLNIEPAHTHYACMVDLLGRAGL 477

Query: 1016 VEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNM 1075
            + EAA+LIR L I PD+NIWGALLGACRI+GN+ELG  AAEHLF+LKPQH GYYILLSNM
Sbjct: 478  MREAADLIRGLSIVPDTNIWGALLGACRIHGNIELGHWAAEHLFKLKPQHSGYYILLSNM 537

Query: 1076 YAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFE 1114
            YAE  RWDE N++RELMKSRGAKK+PGCSWVQ   Q+HAF+V ++ +  +
Sbjct: 538  YAEAERWDEANKVRELMKSRGAKKNPGCSWVQTGDQVHAFLVGEKIDSLD 584

BLAST of Cp4.1LG16g08460 vs. TrEMBL
Match: M5W843_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015109mg PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 3.3e-158
Identity = 287/514 (55.84%), Postives = 356/514 (69.26%), Query Frame = 1

Query: 611  YNRMVRLGVQLDDHTFPF--VLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLY 670
            Y R + LG+    +      VL +C++  D    +++H  V K G D  V  GN L+ +Y
Sbjct: 28   YYREMNLGIGFKPNLVSVISVLPVCAELEDERMAIQIHCYVVKAGLDLLVTTGNALVDVY 87

Query: 671  GNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNL--- 730
            G CG  N + +VF E+ +++ VSWN  I  LS  G   EA   + WM +  G++PN    
Sbjct: 88   GKCGNANASKQVFGEIIQKNEVSWNAAITSLSYMGHNIEALATFRWM-IDEGVKPNSVTI 147

Query: 731  ---------LECFKAGKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDR 790
                     L  F  G+ +HGFS+RMG E+D+F ANSLIDMYAKSG S EAS++F  MD+
Sbjct: 148  SSMIPVLVELAFFGVGRRLHGFSIRMGIESDVFIANSLIDMYAKSGRSNEASNVFQEMDK 207

Query: 791  RNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLES 850
            RNIVSWN MIAN+  N + LEAI  V  +Q                            ES
Sbjct: 208  RNIVSWNAMIANFGQNRLELEAIGLVRQMQGH--------------------------ES 267

Query: 851  LNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDF 910
            LNLFSEM+L+G   D+VSF+GVISACAN+ A+KQGKEIHG  +R   ++HLFV+NSLLDF
Sbjct: 268  LNLFSEMKLVGMIHDIVSFVGVISACANVTAIKQGKEIHGSLVRKLFHTHLFVANSLLDF 327

Query: 911  YTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSY 970
            YTKCGRIDLA K+F++I  KDVASWNTMILGYGM+GEL +AI++FEAMR+D V+YD VSY
Sbjct: 328  YTKCGRIDLAAKVFDRIPSKDVASWNTMILGYGMLGELNTAISLFEAMREDGVEYDSVSY 387

Query: 971  IAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPI 1030
            IAVLS+CSHGGLVE+G +YF  M   ++EPTE HY C+VDLLGRAG +EEA ELI+ +PI
Sbjct: 388  IAVLSSCSHGGLVEKGKKYFEGMQALNIEPTEKHYACMVDLLGRAGLMEEAVELIKGMPI 447

Query: 1031 APDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRI 1090
             PD+NIWGALLGACRI+GNVEL   AA+HLF L P+HCGYYILLSNMYAE GRWDEVNR+
Sbjct: 448  VPDANIWGALLGACRIHGNVELASWAADHLFRLNPEHCGYYILLSNMYAEAGRWDEVNRV 507

Query: 1091 RELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAE 1111
            RELMKSRG KK+  CSWVQ+  Q+HAF V +  E
Sbjct: 508  RELMKSRGVKKNRACSWVQVQDQVHAFAVGESLE 514

BLAST of Cp4.1LG16g08460 vs. TrEMBL
Match: A0A072UUM5_MEDTR (Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_3g045450 PE=4 SV=1)

HSP 1 Score: 542.0 bits (1395), Expect = 2.0e-150
Identity = 318/827 (38.45%), Postives = 450/827 (54.41%), Query Frame = 1

Query: 340  SFMGVISACANLAAVKQGKEIHGVALRNHLNSH-LFVSNSLLDFYTKCGRIDLACKIFNQ 399
            S   ++  C +   + Q  ++H  ++ N    H + +S SL+  Y      + +  +F  
Sbjct: 33   SSSNLLHLCTHSQTLSQTNQLHAFSILNAFLPHSVSISASLILKYASFRHPETSLILFQN 92

Query: 400  IL--FKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSAC------- 459
             L   K    WNT+I  Y + G  +    ++  M    V+ D  +Y  VL AC       
Sbjct: 93   TLPFSKTAFLWNTLIRAYSIAGFFDG-FGVYNTMVRSGVKPDDHTYPFVLKACSDYLKFD 152

Query: 460  ----SHGGLVERGWQ-----------------YFSEML---DQHLEPTEMHYTCLVDLLG 519
                 HG + + G+                  +F + +   D+  E  ++ +  ++ L  
Sbjct: 153  KGREVHGVVFKVGFDKDVFVGNTLLMFYGNCGFFVDAMNVFDEMFERDKVSWNTVIGLCS 212

Query: 520  RAGFVEEAAELIRRLPIA-----PDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQ-H 579
              GF EE+    + + +A     PD     ++L  C    NV +      ++F++    H
Sbjct: 213  DRGFHEESLCFFKEMVVAAPVVRPDLVTVVSVLPVCADSENVVMARIVHGYVFKVGLSGH 272

Query: 580  CGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFE 639
                  L ++Y + G  +   ++ + M  R        SW  +              GF 
Sbjct: 273  VKVGNALVDVYGKCGSEEACKKVFDEMDERNE-----VSWNAV------------ITGFS 332

Query: 640  FWNGTLDGLQTYNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVY 699
            F   ++D L  +  M+  G++ +  T   +L +  +      GMEVHG   ++G +SD++
Sbjct: 333  FRGLSMDALDAFRSMINTGMRPNPVTISSMLPVLGELGLFKLGMEVHGYSLRMGIESDIF 392

Query: 700  VGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRS 759
            +GN+L+ +Y   G    A+ +F++M +R++VSWN+++   + N  +  A      M    
Sbjct: 393  IGNSLIDMYAKSGSSRVASTIFNKMGDRNIVSWNSMVANFAQNRHHFAAVELLRQMQAH- 452

Query: 760  GIQPN-------LLECFK-----AGKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEA 819
            G  PN       L  C +      GKEIH   ++ G  TDLF +N+L DMY+K GH    
Sbjct: 453  GENPNNVTFTNVLPACARLGFLNVGKEIHARIIQTGCATDLFLSNALTDMYSKCGH---- 512

Query: 820  SSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARY 879
                              +A    N    + + + IL+                     Y
Sbjct: 513  ----------------LSLARNVFNVSIKDKVSYNILI-------------------IGY 572

Query: 880  SETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHL 939
            S+T +  ESLNLFSEMRL G  PD+VSF+G+ISACA+L+++KQGKEIHG  +R   ++HL
Sbjct: 573  SQTTNSSESLNLFSEMRLSGMTPDIVSFIGIISACAHLSSIKQGKEIHGHLVRKLFHTHL 632

Query: 940  FVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDD 999
            F +NSLLD YTKCGRIDLA K+F++I  KDVASWNTMILGYGM GE E+AIN+FEAM++D
Sbjct: 633  FAANSLLDLYTKCGRIDLATKVFDRIQHKDVASWNTMILGYGMRGEFETAINLFEAMKED 692

Query: 1000 K-VQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEE 1059
              V+YD VSYIAVLSACSHGGL+E+G +YF +M D ++EPT  HY C+VDLLGRAG +EE
Sbjct: 693  GGVEYDSVSYIAVLSACSHGGLIEKGNKYFKQMQDYNIEPTHTHYACMVDLLGRAGQIEE 752

Query: 1060 AAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAE 1114
            AA LIR L   PD+NIWGALLGACRIYGNVELG  AAEHLF+LKP HCGYYILLSNMYAE
Sbjct: 753  AANLIRGLSFEPDANIWGALLGACRIYGNVELGHWAAEHLFKLKPDHCGYYILLSNMYAE 801

BLAST of Cp4.1LG16g08460 vs. TrEMBL
Match: A0A0D2QBB4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G076800 PE=4 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 1.1e-145
Identity = 308/800 (38.50%), Postives = 433/800 (54.12%), Query Frame = 1

Query: 324  NLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFY 383
            ++++ M   G KPDV +F  ++ ACA++   K+G EIHG  ++    + + V N+LL FY
Sbjct: 101  HVYNTMLRTGIKPDVHTFPFLLKACADVFCFKKGVEIHGSVIKTGFGADVSVGNTLLLFY 160

Query: 384  TKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYI 443
              CG                                L     +F+ MR+     D+VS+ 
Sbjct: 161  GNCGG-------------------------------LRETRKVFDEMRER----DVVSWN 220

Query: 444  AVLSACSHGGLVERGWQYFSEM-LDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPI 503
             VL   S  G       +FS M     ++P  + +  L+ + GR G     A+       
Sbjct: 221  TVLGVFSVNGFYLEALNFFSLMNFSSGMKPNMVTFVTLLPVCGRIGDKRLVAQ------- 280

Query: 504  APDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRI 563
                 I G+++   ++  N E+    A                L + Y +    D+  R+
Sbjct: 281  -----IHGSVV---KVGFNFEVSIGNA----------------LVDAYGKCWNSDDSKRV 340

Query: 564  RELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFEFWNGTLDGLQTYNRMVRLGVQLD 623
             + M  +      G SW  I   L A++  +R           D L  +  M+ +G++ D
Sbjct: 341  FDEMVDKN-----GVSWNAIITSL-AYMGLNR-----------DALDMFRLMMDVGLKPD 400

Query: 624  DHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDATKVFD 683
              T   ++ +  +        E+HG   + G + DV++ NTL+ +Y   G  + A+ VF 
Sbjct: 401  SFTISSMIALLVELEFFNLAKEIHGFSLRFGIEHDVFISNTLIDMYAKSGHPSAASNVFH 460

Query: 684  EMS-ERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNL-----------LECFK 743
             M+  R+VVSWN ++   + N     A      M     +  ++           +   +
Sbjct: 461  HMNVRRNVVSWNAMVANFAQNRLELAAIELLREMQAHGEVPDSITLTNVLPACGQVGFLR 520

Query: 744  AGKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYA 803
             GKEIHG ++R+G+  DLF +N+L DMYAK G+                      +A   
Sbjct: 521  NGKEIHGRTIRLGSNHDLFVSNALTDMYAKCGYLN--------------------LAQNV 580

Query: 804  LNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKP 863
             N    + I + IL+                     YS+T++  +S+ LFSEM L+G K 
Sbjct: 581  FNNSVKDEISYNILI-------------------VGYSQTSEWTKSVGLFSEMGLIGLKH 640

Query: 864  DVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIF 923
            DVVSFMGV+SACAN AA KQGKEIHG+A+R H ++HLFV+NSLLDFYT CG ID A K+F
Sbjct: 641  DVVSFMGVVSACANQAAFKQGKEIHGLAVRKHFHTHLFVANSLLDFYTTCGEIDTARKLF 700

Query: 924  NQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVE 983
            +QI  KDVASWNTMILGYGM+GEL  AI+ FEA+++  ++YD VSYIA+LSACSHGGL++
Sbjct: 701  DQIQHKDVASWNTMILGYGMLGELNLAISFFEALKEAGIEYDSVSYIAILSACSHGGLLD 760

Query: 984  RGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGAC 1043
             G +YF  M  Q  +PTEMHY C+VDLLGRAG +EEA +LI+ LPI PD+NIWGALLGAC
Sbjct: 761  EGRKYFEAMKAQKFKPTEMHYACMVDLLGRAGLLEEAEQLIKSLPITPDANIWGALLGAC 778

Query: 1044 RIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPG 1103
            RI+GNV LGC AAE+LF+LKPQH GYY +LSNM+AE G+WDE NR++E+MK RGA+K PG
Sbjct: 821  RIFGNVGLGCWAAENLFKLKPQHAGYYAVLSNMFAEAGKWDEANRVKEMMKLRGARKKPG 778

Query: 1104 CSWVQIHGQLHAFVVDDRAE 1111
            CSWV I  Q+HAFVV +R E
Sbjct: 881  CSWVHIQDQVHAFVVGERME 778

BLAST of Cp4.1LG16g08460 vs. TAIR10
Match: AT2G13600.1 (AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 385.6 bits (989), Expect = 1.2e-106
Identity = 204/539 (37.85%), Postives = 311/539 (57.70%), Query Frame = 1

Query: 601  WNGTLDGLQTYNR----------MVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVF 660
            WN  + G   ++R          M + G  L++++F  VL  CS   D+ KG++VH ++ 
Sbjct: 120  WNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIA 179

Query: 661  KLGFDSDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARN 720
            K  F SDVY+G+ L+ +Y  CG +NDA +VFDEM +R+VVSWN++I     NG   EA +
Sbjct: 180  KSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALD 239

Query: 721  YYFWMTLRSGIQPN------------LLECFKAGKEIHGFSMRMGT-ETDLFTANSLIDM 780
              F M L S ++P+             L   K G+E+HG  ++      D+  +N+ +DM
Sbjct: 240  V-FQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDM 299

Query: 781  YAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTF 840
            YAK     EA  IF SM  RN+++  +MI+ YA+   + +A R  ++  +  ER N V++
Sbjct: 300  YAKCSRIKEARFIFDSMPIRNVIAETSMISGYAM-AASTKAAR--LMFTKMAER-NVVSW 359

Query: 841  TNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIH-- 900
              ++   A Y++  +  E+L+LF  ++     P   SF  ++ ACA+LA +  G + H  
Sbjct: 360  NALI---AGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVH 419

Query: 901  ----GVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMI 960
                G   ++     +FV NSL+D Y KCG ++    +F +++ +D  SWN MI+G+   
Sbjct: 420  VLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQN 479

Query: 961  GELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEML-DQHLEPTEMH 1020
            G    A+ +F  M +   + D ++ I VLSAC H G VE G  YFS M  D  + P   H
Sbjct: 480  GYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDH 539

Query: 1021 YTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELK 1080
            YTC+VDLLGRAGF+EEA  +I  +P+ PDS IWG+LL AC+++ N+ LG   AE L E++
Sbjct: 540  YTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVE 599

Query: 1081 PQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRA 1110
            P + G Y+LLSNMYAE G+W++V  +R+ M+  G  K PGCSW++I G  H F+V D++
Sbjct: 600  PSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKS 650

BLAST of Cp4.1LG16g08460 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 361.3 bits (926), Expect = 2.4e-99
Identity = 184/547 (33.64%), Postives = 306/547 (55.94%), Query Frame = 1

Query: 611  YNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGN 670
            ++RM   G+  D H  P + K+C++      G ++H V    G D D +V  ++  +Y  
Sbjct: 104  FSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSMFHMYMR 163

Query: 671  CGFLNDATKVFDEMSERDVV-----------------------------------SWNTV 730
            CG + DA KVFD MS++DVV                                   SWN +
Sbjct: 164  CGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGI 223

Query: 731  IGLLSVNGDYREARNYYFWMTLRSGIQPNLL------------ECFKAGKEIHGFSMRMG 790
            +   + +G ++EA    F      G  P+ +            E    G+ IHG+ ++ G
Sbjct: 224  LSGFNRSGYHKEAV-VMFQKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQG 283

Query: 791  TETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVI 850
               D    +++IDMY KSGH     S+F+  +       N  I   + NG+  +A+    
Sbjct: 284  LLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFE 343

Query: 851  LLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACA 910
            L +E     N V++T+++  CA+  +    +E+L LF EM++ G KP+ V+   ++ AC 
Sbjct: 344  LFKEQTMELNVVSWTSIIAGCAQNGKD---IEALELFREMQVAGVKPNHVTIPSMLPACG 403

Query: 911  NLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNT 970
            N+AA+  G+  HG A+R HL  ++ V ++L+D Y KCGRI+L+  +FN +  K++  WN+
Sbjct: 404  NIAALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNS 463

Query: 971  MILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQH 1030
            ++ G+ M G+ +  +++FE++   +++ D +S+ ++LSAC   GL + GW+YF  M +++
Sbjct: 464  LMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEY 523

Query: 1031 -LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKA 1090
             ++P   HY+C+V+LLGRAG ++EA +LI+ +P  PDS +WGALL +CR+  NV+L   A
Sbjct: 524  GIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIA 583

Query: 1091 AEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHA 1110
            AE LF L+P++ G Y+LLSN+YA  G W EV+ IR  M+S G KK+PGCSW+Q+  +++ 
Sbjct: 584  AEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYT 643

BLAST of Cp4.1LG16g08460 vs. TAIR10
Match: AT3G02330.1 (AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 354.4 bits (908), Expect = 2.9e-97
Identity = 237/851 (27.85%), Postives = 402/851 (47.24%), Query Frame = 1

Query: 322  SLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHG---------------VALR 381
            S + F++           +F  V   CA   A++ GK+ H                  L+
Sbjct: 32   SFSYFTDFLNQVNSVSTTNFSFVFKECAKQGALELGKQAHAHMIISGFRPTTFVLNCLLQ 91

Query: 382  NHLNSHLFVS----------------NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTM 441
             + NS  FVS                N +++ Y+K   +  A   FN +  +DV SWN+M
Sbjct: 92   VYTNSRDFVSASMVFDKMPLRDVVSWNKMINGYSKSNDMFKANSFFNMMPVRDVVSWNSM 151

Query: 442  ILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHL 501
            + GY   GE   +I +F  M  + +++D  ++  +L  CS       G Q    ++    
Sbjct: 152  LSGYLQNGESLKSIEVFVDMGREGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGC 211

Query: 502  EPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE 561
            +   +  + L+D+  +     E+  + + +P   +S  W A++  C     + L  K  +
Sbjct: 212  DTDVVAASALLDMYAKGKRFVESLRVFQGIP-EKNSVSWSAIIAGCVQNNLLSLALKFFK 271

Query: 562  HLFELKP--QHCGYYILLSNMYA----ETGRWDEVNRIRELMKSRGAKKSPGCS-WVQIH 621
             + ++        Y  +L +  A      G     + ++    + G  ++     + +  
Sbjct: 272  EMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCD 331

Query: 622  GQLHAFVVDDRAEGF--EFWNGTLDG----------LQTYNRMVRLGVQLDDHTFPFVLK 681
                A ++ D +E    + +N  + G          L  ++R++  G+  D+ +   V +
Sbjct: 332  NMQDAQILFDNSENLNRQSYNAMITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFR 391

Query: 682  ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVS 741
             C+    + +G++++G+  K     DV V N  + +YG C  L +A +VFDEM  RD VS
Sbjct: 392  ACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVS 451

Query: 742  WNTVIGLLSVNGDYREARNYYFWMTLRSGIQPN-------LLECFKA----GKEIHGFSM 801
            WN +I     NG   E   + F   LRS I+P+       L  C       G EIH   +
Sbjct: 452  WNAIIAAHEQNGKGYETL-FLFVSMLRSRIEPDEFTFGSILKACTGGSLGYGMEIHSSIV 511

Query: 802  RMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIR 861
            + G  ++     SLIDMY+K G   EA  I     +R  VS  TM     ++   L+ + 
Sbjct: 512  KSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRANVS-GTMEELEKMHNKRLQEM- 571

Query: 862  FVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVIS 921
                          V++ +++       ++ D   +  LF+ M  +G  PD  ++  V+ 
Sbjct: 572  -------------CVSWNSIISGYVMKEQSED---AQMLFTRMMEMGITPDKFTYATVLD 631

Query: 922  ACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVAS 981
             CANLA+   GK+IH   ++  L S +++ ++L+D Y+KCG +  +  +F + L +D  +
Sbjct: 632  TCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLRRDFVT 691

Query: 982  WNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEM- 1041
            WN MI GY   G+ E AI +FE M  + ++ + V++I++L AC+H GL+++G +YF  M 
Sbjct: 692  WNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLIDKGLEYFYMMK 751

Query: 1042 LDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIY-GNVEL 1101
             D  L+P   HY+ +VD+LG++G V+ A ELIR +P   D  IW  LLG C I+  NVE+
Sbjct: 752  RDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIHRNNVEV 811

Query: 1102 GCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHG 1110
              +A   L  L PQ    Y LLSN+YA+ G W++V+ +R  M+    KK PGCSWV++  
Sbjct: 812  AEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCSWVELKD 862

BLAST of Cp4.1LG16g08460 vs. TAIR10
Match: AT3G29230.1 (AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 350.1 bits (897), Expect = 5.5e-96
Identity = 202/557 (36.27%), Postives = 310/557 (55.66%), Query Frame = 1

Query: 42  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNT 101
           +L Q K++HA  +   L    + +   LI   +  +       +F+Q  +      L N+
Sbjct: 31  NLNQVKQLHAQIIRRNL-HEDLHIAPKLISALSLCRQTNLAVRVFNQVQEP--NVHLCNS 90

Query: 102 LIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKL 161
           LIRAH+   +        ++ M RFG+  D+ T+PF+LK CS    +     +H  + KL
Sbjct: 91  LIRAHA-QNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEKL 150

Query: 162 GFDSDVYVGNTLLMLYGNCGFLN--DAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARK 221
           G  SD+YV N L+  Y  CG L   DA K+F++MSERD VSWN+++G L   G+ R+AR+
Sbjct: 151 GLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELRDARR 210

Query: 222 EIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNIVSWNTMIANYALNG 281
                  R     DL + N+++D YA+    ++A  +F  M  RN VSW+TM+  Y+  G
Sbjct: 211 LFDEMPQR-----DLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAG 270

Query: 282 VALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNLFSEMRLLGKKPDVV 341
             +E  R V+  +      N VT+T ++   A Y+E     E+  L  +M   G K D  
Sbjct: 271 -DMEMAR-VMFDKMPLPAKNVVTWTIII---AGYAEKGLLKEADRLVDQMVASGLKFDAA 330

Query: 342 SFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFNQI 401
           + + +++AC     +  G  IH +  R++L S+ +V N+LLD Y KCG +  A  +FN I
Sbjct: 331 AVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCGNLKKAFDVFNDI 390

Query: 402 LFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGW 461
             KD+ SWNTM+ G G+ G  + AI +F  MR + ++ D V++IAVL +C+H GL++ G 
Sbjct: 391 PKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGLIDEGI 450

Query: 462 QYFSEMLDQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRI 521
            YF  M   + L P   HY CLVDLLGR G ++EA ++++ +P+ P+  IWGALLGACR+
Sbjct: 451 DYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPNVVIWGALLGACRM 510

Query: 522 YGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCS 581
           +  V++  +  ++L +L P   G Y LLSN+YA    W+ V  IR  MKS G +K  G S
Sbjct: 511 HNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSKMKSMGVEKPSGAS 570

Query: 582 WVQIHGQLHAFVVDDRA 596
            V++   +H F V D++
Sbjct: 571 SVELEDGIHEFTVFDKS 573

BLAST of Cp4.1LG16g08460 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 347.1 bits (889), Expect = 4.7e-95
Identity = 213/678 (31.42%), Postives = 359/678 (52.95%), Query Frame = 1

Query: 26  NSAVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSV-----SLCASLILNYAKFQHPE 85
           +S V ++  T    ++S    + VH    L+G +  S      S+  SL+  Y K Q  +
Sbjct: 188 SSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVD 247

Query: 86  SFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVL 145
           S   +F +  +  R    WN++I  +    NG  + GL  + +M+  G+++D  T   V 
Sbjct: 248 SARKVFDEMTE--RDVISWNSIINGY--VSNGLAEKGLSVFVQMLVSGIEIDLATIVSVF 307

Query: 146 KICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVV 205
             C+DS  I  G  VH +  K  F  +    NTLL +Y  CG L+ AK VF EMS+R VV
Sbjct: 308 AGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVV 367

Query: 206 SWNTVIGLLSVNGDYREARKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHS 265
           S+ ++I   +  G   EA K         G   D++T  ++++  A+     E   + H 
Sbjct: 368 SYTSMIAGYAREGLAGEAVKLFEEMEEE-GISPDVYTVTAVLNCCARYRLLDEGKRV-HE 427

Query: 266 MDRRNIVSWNTMIANYALNGVA-LEAIRFVILLQESGERPNAVTFTNVLPACARYSETND 325
             + N + ++  ++N  ++  A   +++   L+       + +++  ++   ++    N+
Sbjct: 428 WIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANE 487

Query: 326 CLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNS 385
            L   NL  E +     PD  +   V+ ACA+L+A  +G+EIHG  +RN   S   V+NS
Sbjct: 488 ALSLFNLLLEEKRFS--PDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANS 547

Query: 386 LLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYD 445
           L+D Y KCG + LA  +F+ I  KD+ SW  MI GYGM G  + AI +F  MR   ++ D
Sbjct: 548 LVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEAD 607

Query: 446 LVSYIAVLSACSHGGLVERGWQYFSEMLDQ-HLEPTEMHYTCLVDLLGRAGFVEEAAELI 505
            +S++++L ACSH GLV+ GW++F+ M  +  +EPT  HY C+VD+L R G + +A   I
Sbjct: 608 EISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFI 667

Query: 506 RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWD 565
             +PI PD+ IWGALL  CRI+ +V+L  K AE +FEL+P++ GYY+L++N+YAE  +W+
Sbjct: 668 ENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWE 727

Query: 566 EVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFEFWNGTLDGLQTYNRMVRL 625
           +V R+R+ +  RG +K+PGCSW++I G+++ FV  D +      N   + ++ + R VR 
Sbjct: 728 QVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSS------NPETENIEAFLRKVRA 787

Query: 626 GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVG--NTLLMLYGN---CG 685
            + +++   P       D+ ++ K   + G   KL     +       ++ +  N   CG
Sbjct: 788 RM-IEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCG 847

Query: 686 FLNDATKVFDEMSERDVV 691
             ++  K   +++ R++V
Sbjct: 848 DCHEMAKFMSKLTRREIV 850

BLAST of Cp4.1LG16g08460 vs. NCBI nr
Match: gi|449445027|ref|XP_004140275.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14170 [Cucumis sativus])

HSP 1 Score: 631.3 bits (1627), Expect = 3.5e-177
Identity = 337/634 (53.15%), Postives = 421/634 (66.40%), Query Frame = 1

Query: 1   MLHIGFHFAQIARFQFRNYVRSTEKNSAVH-INLLTLCFNAQSLRQTKEVHAICLLNGLL 60
           +L +   + +   + F   +RS  K + V  I+LL +    +    T+ +H   +  GL 
Sbjct: 234 LLSVNGDYTEARNYYFWMILRSVIKPNLVSVISLLPISAALEDEEMTRRIHCYSVKVGL- 293

Query: 61  PHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLET 120
              V+ C +L+  Y K    ++   +F++TV+    +  WN++I   +  G    D L  
Sbjct: 294 DSQVTTCNALVDAYGKCGSVKALWQVFNETVEKNEVS--WNSIINGLACKGR-CWDALNA 353

Query: 121 YNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGN 180
           +  M+  G Q +  T   +L +  +      G E+HG   ++G ++D+++ N+L+ +Y  
Sbjct: 354 FRMMIDAGAQPNSVTISSILPVLVELECFKAGKEIHGFSMRMGTETDIFIANSLIDMYAK 413

Query: 181 CGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREA---------------------- 240
            G   +A  +F  +  R++VSWN +I   ++N    EA                      
Sbjct: 414 SGHSTEASTIFHNLDRRNIVSWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNV 473

Query: 241 ------------RKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNI 300
                        KEIH   +R+G  +DLF +NSLIDMYAK G         HS   RN+
Sbjct: 474 LPACARLGFLGPGKEIHAMGVRIGLTSDLFVSNSLIDMYAKCG-------CLHSA--RNV 533

Query: 301 VSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNL 360
             +NT                          R + V++ N+L     YSET+DCL+SLNL
Sbjct: 534 --FNT-------------------------SRKDEVSY-NIL--IIGYSETDDCLQSLNL 593

Query: 361 FSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTK 420
           FSEMRLLGKKPDVVSF+GVISACANLAA+KQGKE+HGVALRNHL SHLFVSNSLLDFYTK
Sbjct: 594 FSEMRLLGKKPDVVSFVGVISACANLAALKQGKEVHGVALRNHLYSHLFVSNSLLDFYTK 653

Query: 421 CGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAV 480
           CGRID+AC++FNQILFKDVASWNTMILGYGMIGELE+AI+MFEAMRDD VQYDLVSYIAV
Sbjct: 654 CGRIDIACRLFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDTVQYDLVSYIAV 713

Query: 481 LSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPD 540
           LSACSHGGLVERGWQYFSEML Q LEPTEMHYTC+VDLLGRAGFVEEAA+LI++LPIAPD
Sbjct: 714 LSACSHGGLVERGWQYFSEMLAQRLEPTEMHYTCMVDLLGRAGFVEEAAKLIQQLPIAPD 773

Query: 541 SNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIREL 600
           +NIWGALLGACRIYGNVELG +AAEHLFELKPQHCGYYILLSN+YAETGRWDE N+IREL
Sbjct: 774 ANIWGALLGACRIYGNVELGRRAAEHLFELKPQHCGYYILLSNIYAETGRWDEANKIREL 824

BLAST of Cp4.1LG16g08460 vs. NCBI nr
Match: gi|659097428|ref|XP_008449620.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial [Cucumis melo])

HSP 1 Score: 620.5 bits (1599), Expect = 6.2e-174
Identity = 333/634 (52.52%), Postives = 412/634 (64.98%), Query Frame = 1

Query: 1   MLHIGFHFAQIARFQFRNYVRSTEKNSAVH-INLLTLCFNAQSLRQTKEVHAICLLNGLL 60
           +L +   + +   + F   +RS  K + V  I+LL +    +    T+ +H   +  GL 
Sbjct: 234 LLSVNGDYKEARNYYFWMILRSGIKPNLVSVISLLPISAALEDEEMTRRIHCFSVKVGL- 293

Query: 61  PHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLET 120
              V+ C +L+  Y K    ++   +F++ V+  R    WN++I   +  G    D L+ 
Sbjct: 294 DSQVTTCNALVDAYGKCGSVKALWQVFNEMVE--RNEVSWNSIINGLACKGR-CWDTLKA 353

Query: 121 YNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGN 180
           +  M+  G + +  T   +L +  +      G E+HG   ++G ++D+++ N+L+ +Y  
Sbjct: 354 FRMMIDAGAKPNSVTISSILPVLVELECFKAGKEIHGFSMRIGTETDIFIANSLIDMYAK 413

Query: 181 CGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREA---------------------- 240
            G   +A  +F  +  R+VV+WN +I   ++N    EA                      
Sbjct: 414 SGRSTEASTIFHNLDRRNVVTWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNV 473

Query: 241 ------------RKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDRRNI 300
                        KEIH   +R+G  +DLF +NSLID                       
Sbjct: 474 LPACARLGFLGPGKEIHAMVVRIGLTSDLFVSNSLID----------------------- 533

Query: 301 VSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLESLNL 360
                    YA  G    A         +    + V++ N+L     YSETNDC +SLNL
Sbjct: 534 --------MYAKCGSLCSARNLF-----NTSHKDEVSY-NIL--IIGYSETNDCFQSLNL 593

Query: 361 FSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTK 420
           FSEMRLLGKKPDVVSF+GVISACANLAA+KQGKEIHGVALRNHL SHLFVSNSLLDFYTK
Sbjct: 594 FSEMRLLGKKPDVVSFVGVISACANLAALKQGKEIHGVALRNHLYSHLFVSNSLLDFYTK 653

Query: 421 CGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAV 480
           CGRID+AC++FNQILFKDVASWNTMILGYGMIGELE+AI+MFEAMRDD VQYDLVSYIAV
Sbjct: 654 CGRIDIACRVFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDTVQYDLVSYIAV 713

Query: 481 LSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPD 540
           LSACSHGGLVERGWQYFSEML QHLEPTEMHYTC+VDLLGRAGFVEEAAELI+RLPIAPD
Sbjct: 714 LSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCMVDLLGRAGFVEEAAELIQRLPIAPD 773

Query: 541 SNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRIREL 600
           +NIWGALLGACRIYGNVELGC+AAEHLFELKPQHCGYYILLSN+YAETGRWDE NRIREL
Sbjct: 774 ANIWGALLGACRIYGNVELGCRAAEHLFELKPQHCGYYILLSNIYAETGRWDEANRIREL 824

BLAST of Cp4.1LG16g08460 vs. NCBI nr
Match: gi|1012349302|gb|KYP60492.1| (hypothetical protein KK1_022898 [Cajanus cajan])

HSP 1 Score: 604.4 bits (1557), Expect = 4.6e-169
Identity = 294/470 (62.55%), Postives = 363/470 (77.23%), Query Frame = 1

Query: 656  SDVYVGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWM 715
            S V VGN L+ +YG CG    + KVFDE+ ER+VVSWN++I   S  G Y +A +  F +
Sbjct: 118  SHVKVGNALVDVYGKCGSEKASKKVFDEIDERNVVSWNSIITSFSFRGKYMDALDV-FRL 177

Query: 716  TLRSGIQPNL------------LECFKAGKEIHGFSMRMGTETDLFTANSLIDMYAKSGH 775
             + +G++PN             L  FK G EIHGFS+RM  E+D+F ANSLID+YAKSG 
Sbjct: 178  MIGTGMRPNSVTISSMLPVLGELGLFKWGMEIHGFSLRMAIESDIFIANSLIDIYAKSGS 237

Query: 776  STEASSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPA 835
            S  AS IF+ M+ R+IVSWN MIAN+A N +  EA+  V  +Q  GE PN VTFTNVLPA
Sbjct: 238  SRIASIIFNKMEGRSIVSWNAMIANFAQNRLEFEAVELVRQMQAKGETPNNVTFTNVLPA 297

Query: 836  CARYSETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHL 895
            C  YS TNDC ES++LFSEM LLG  PD+VSFMGVISACANLA+++QGKE+HG+ +R   
Sbjct: 298  C--YSRTNDCSESISLFSEMILLGMLPDIVSFMGVISACANLASIRQGKEVHGLLMRKLF 357

Query: 896  NSHLFVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEA 955
            ++HLFV+NSLLD YT+CGRIDLA K+F++I  KDVASWNTMILGYGM+GELE+AIN+FEA
Sbjct: 358  HTHLFVANSLLDLYTRCGRIDLATKVFDRIQNKDVASWNTMILGYGMLGELETAINLFEA 417

Query: 956  MRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGF 1015
            M++D V+YD VS+IAVLSACSHGGL+E+G +YF  M D ++EP   HY C+VDLLGRAG 
Sbjct: 418  MKEDGVEYDSVSFIAVLSACSHGGLIEKGRKYFKMMHDLNIEPAHTHYACMVDLLGRAGL 477

Query: 1016 VEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNM 1075
            + EAA+LIR L I PD+NIWGALLGACRI+GN+ELG  AAEHLF+LKPQH GYYILLSNM
Sbjct: 478  MREAADLIRGLSIVPDTNIWGALLGACRIHGNIELGHWAAEHLFKLKPQHSGYYILLSNM 537

Query: 1076 YAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFE 1114
            YAE  RWDE N++RELMKSRGAKK+PGCSWVQ   Q+HAF+V ++ +  +
Sbjct: 538  YAEAERWDEANKVRELMKSRGAKKNPGCSWVQTGDQVHAFLVGEKIDSLD 584

BLAST of Cp4.1LG16g08460 vs. NCBI nr
Match: gi|595841762|ref|XP_007208277.1| (hypothetical protein PRUPE_ppa015109mg [Prunus persica])

HSP 1 Score: 567.8 bits (1462), Expect = 4.8e-158
Identity = 287/514 (55.84%), Postives = 356/514 (69.26%), Query Frame = 1

Query: 611  YNRMVRLGVQLDDHTFPF--VLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLY 670
            Y R + LG+    +      VL +C++  D    +++H  V K G D  V  GN L+ +Y
Sbjct: 28   YYREMNLGIGFKPNLVSVISVLPVCAELEDERMAIQIHCYVVKAGLDLLVTTGNALVDVY 87

Query: 671  GNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNL--- 730
            G CG  N + +VF E+ +++ VSWN  I  LS  G   EA   + WM +  G++PN    
Sbjct: 88   GKCGNANASKQVFGEIIQKNEVSWNAAITSLSYMGHNIEALATFRWM-IDEGVKPNSVTI 147

Query: 731  ---------LECFKAGKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHSMDR 790
                     L  F  G+ +HGFS+RMG E+D+F ANSLIDMYAKSG S EAS++F  MD+
Sbjct: 148  SSMIPVLVELAFFGVGRRLHGFSIRMGIESDVFIANSLIDMYAKSGRSNEASNVFQEMDK 207

Query: 791  RNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARYSETNDCLES 850
            RNIVSWN MIAN+  N + LEAI  V  +Q                            ES
Sbjct: 208  RNIVSWNAMIANFGQNRLELEAIGLVRQMQGH--------------------------ES 267

Query: 851  LNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDF 910
            LNLFSEM+L+G   D+VSF+GVISACAN+ A+KQGKEIHG  +R   ++HLFV+NSLLDF
Sbjct: 268  LNLFSEMKLVGMIHDIVSFVGVISACANVTAIKQGKEIHGSLVRKLFHTHLFVANSLLDF 327

Query: 911  YTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSY 970
            YTKCGRIDLA K+F++I  KDVASWNTMILGYGM+GEL +AI++FEAMR+D V+YD VSY
Sbjct: 328  YTKCGRIDLAAKVFDRIPSKDVASWNTMILGYGMLGELNTAISLFEAMREDGVEYDSVSY 387

Query: 971  IAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPI 1030
            IAVLS+CSHGGLVE+G +YF  M   ++EPTE HY C+VDLLGRAG +EEA ELI+ +PI
Sbjct: 388  IAVLSSCSHGGLVEKGKKYFEGMQALNIEPTEKHYACMVDLLGRAGLMEEAVELIKGMPI 447

Query: 1031 APDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDEVNRI 1090
             PD+NIWGALLGACRI+GNVEL   AA+HLF L P+HCGYYILLSNMYAE GRWDEVNR+
Sbjct: 448  VPDANIWGALLGACRIHGNVELASWAADHLFRLNPEHCGYYILLSNMYAEAGRWDEVNRV 507

Query: 1091 RELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAE 1111
            RELMKSRG KK+  CSWVQ+  Q+HAF V +  E
Sbjct: 508  RELMKSRGVKKNRACSWVQVQDQVHAFAVGESLE 514

BLAST of Cp4.1LG16g08460 vs. NCBI nr
Match: gi|922378529|ref|XP_013459503.1| (pentatricopeptide (PPR) repeat protein [Medicago truncatula])

HSP 1 Score: 542.0 bits (1395), Expect = 2.8e-150
Identity = 318/827 (38.45%), Postives = 450/827 (54.41%), Query Frame = 1

Query: 340  SFMGVISACANLAAVKQGKEIHGVALRNHLNSH-LFVSNSLLDFYTKCGRIDLACKIFNQ 399
            S   ++  C +   + Q  ++H  ++ N    H + +S SL+  Y      + +  +F  
Sbjct: 33   SSSNLLHLCTHSQTLSQTNQLHAFSILNAFLPHSVSISASLILKYASFRHPETSLILFQN 92

Query: 400  IL--FKDVASWNTMILGYGMIGELESAINMFEAMRDDKVQYDLVSYIAVLSAC------- 459
             L   K    WNT+I  Y + G  +    ++  M    V+ D  +Y  VL AC       
Sbjct: 93   TLPFSKTAFLWNTLIRAYSIAGFFDG-FGVYNTMVRSGVKPDDHTYPFVLKACSDYLKFD 152

Query: 460  ----SHGGLVERGWQ-----------------YFSEML---DQHLEPTEMHYTCLVDLLG 519
                 HG + + G+                  +F + +   D+  E  ++ +  ++ L  
Sbjct: 153  KGREVHGVVFKVGFDKDVFVGNTLLMFYGNCGFFVDAMNVFDEMFERDKVSWNTVIGLCS 212

Query: 520  RAGFVEEAAELIRRLPIA-----PDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQ-H 579
              GF EE+    + + +A     PD     ++L  C    NV +      ++F++    H
Sbjct: 213  DRGFHEESLCFFKEMVVAAPVVRPDLVTVVSVLPVCADSENVVMARIVHGYVFKVGLSGH 272

Query: 580  CGYYILLSNMYAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHGQLHAFVVDDRAEGFE 639
                  L ++Y + G  +   ++ + M  R        SW  +              GF 
Sbjct: 273  VKVGNALVDVYGKCGSEEACKKVFDEMDERNE-----VSWNAV------------ITGFS 332

Query: 640  FWNGTLDGLQTYNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVY 699
            F   ++D L  +  M+  G++ +  T   +L +  +      GMEVHG   ++G +SD++
Sbjct: 333  FRGLSMDALDAFRSMINTGMRPNPVTISSMLPVLGELGLFKLGMEVHGYSLRMGIESDIF 392

Query: 700  VGNTLLMLYGNCGFLNDATKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRS 759
            +GN+L+ +Y   G    A+ +F++M +R++VSWN+++   + N  +  A      M    
Sbjct: 393  IGNSLIDMYAKSGSSRVASTIFNKMGDRNIVSWNSMVANFAQNRHHFAAVELLRQMQAH- 452

Query: 760  GIQPN-------LLECFK-----AGKEIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEA 819
            G  PN       L  C +      GKEIH   ++ G  TDLF +N+L DMY+K GH    
Sbjct: 453  GENPNNVTFTNVLPACARLGFLNVGKEIHARIIQTGCATDLFLSNALTDMYSKCGH---- 512

Query: 820  SSIFHSMDRRNIVSWNTMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARY 879
                              +A    N    + + + IL+                     Y
Sbjct: 513  ----------------LSLARNVFNVSIKDKVSYNILI-------------------IGY 572

Query: 880  SETNDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHL 939
            S+T +  ESLNLFSEMRL G  PD+VSF+G+ISACA+L+++KQGKEIHG  +R   ++HL
Sbjct: 573  SQTTNSSESLNLFSEMRLSGMTPDIVSFIGIISACAHLSSIKQGKEIHGHLVRKLFHTHL 632

Query: 940  FVSNSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELESAINMFEAMRDD 999
            F +NSLLD YTKCGRIDLA K+F++I  KDVASWNTMILGYGM GE E+AIN+FEAM++D
Sbjct: 633  FAANSLLDLYTKCGRIDLATKVFDRIQHKDVASWNTMILGYGMRGEFETAINLFEAMKED 692

Query: 1000 K-VQYDLVSYIAVLSACSHGGLVERGWQYFSEMLDQHLEPTEMHYTCLVDLLGRAGFVEE 1059
              V+YD VSYIAVLSACSHGGL+E+G +YF +M D ++EPT  HY C+VDLLGRAG +EE
Sbjct: 693  GGVEYDSVSYIAVLSACSHGGLIEKGNKYFKQMQDYNIEPTHTHYACMVDLLGRAGQIEE 752

Query: 1060 AAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLSNMYAE 1114
            AA LIR L   PD+NIWGALLGACRIYGNVELG  AAEHLF+LKP HCGYYILLSNMYAE
Sbjct: 753  AANLIRGLSFEPDANIWGALLGACRIYGNVELGHWAAEHLFKLKPDHCGYYILLSNMYAE 801

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP151_ARATH2.1e-10537.85Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN... [more]
PPR53_ARATH4.3e-9833.64Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
PP207_ARATH5.2e-9627.85Pentatricopeptide repeat-containing protein At3g02330 OS=Arabidopsis thaliana GN... [more]
PP261_ARATH9.8e-9536.27Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana GN... [more]
PP320_ARATH8.3e-9431.42Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0KEH6_CUCSA2.5e-17753.15Uncharacterized protein OS=Cucumis sativus GN=Csa_6G428560 PE=4 SV=1[more]
A0A151T0A3_CAJCA3.2e-16962.55Uncharacterized protein OS=Cajanus cajan GN=KK1_022898 PE=4 SV=1[more]
M5W843_PRUPE3.3e-15855.84Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015109mg PE=4 SV=1[more]
A0A072UUM5_MEDTR2.0e-15038.45Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_3g045450 PE... [more]
A0A0D2QBB4_GOSRA1.1e-14538.50Uncharacterized protein OS=Gossypium raimondii GN=B456_009G076800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G13600.11.2e-10637.85 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G20230.12.4e-9933.64 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G02330.12.9e-9727.85 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G29230.15.5e-9636.27 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.14.7e-9531.42 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449445027|ref|XP_004140275.1|3.5e-17753.15PREDICTED: pentatricopeptide repeat-containing protein At4g14170 [Cucumis sativu... [more]
gi|659097428|ref|XP_008449620.1|6.2e-17452.52PREDICTED: putative pentatricopeptide repeat-containing protein At1g69350, mitoc... [more]
gi|1012349302|gb|KYP60492.1|4.6e-16962.55hypothetical protein KK1_022898 [Cajanus cajan][more]
gi|595841762|ref|XP_007208277.1|4.8e-15855.84hypothetical protein PRUPE_ppa015109mg [Prunus persica][more]
gi|922378529|ref|XP_013459503.1|2.8e-15038.45pentatricopeptide (PPR) repeat protein [Medicago truncatula][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g08460.1Cp4.1LG16g08460.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 266..295
score: 0.27coord: 662..688
score: 0.0042coord: 990..1013
score: 0.085coord: 780..809
score: 0.27coord: 199..218
score: 0.16coord: 690..712
score: 0.083coord: 476..499
score: 0.085coord: 377..400
score: 0.2coord: 891..914
score: 0.2coord: 171..197
score: 0.0015coord: 236..265
score: 8.9E-5coord: 750..779
score: 8.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 402..450
score: 1.2E-7coord: 916..964
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 406..438
score: 1.7E-4coord: 750..780
score: 1.9E-4coord: 954..987
score: 1.4E-4coord: 236..266
score: 1.9E-4coord: 440..473
score: 1.4E-4coord: 171..199
score: 0.0029coord: 920..952
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 268..298
score: 5.59coord: 403..437
score: 10.106coord: 782..812
score: 5.59coord: 987..1017
score: 6.829coord: 917..951
score: 10.106coord: 657..687
score: 8.32coord: 95..130
score: 8.627coord: 813..850
score: 7.805coord: 299..336
score: 7.805coord: 473..503
score: 6.829coord: 438..472
score: 10.369coord: 1053..1087
score: 7.311coord: 747..781
score: 10.348coord: 688..723
score: 9.471coord: 886..916
score: 6.928coord: 337..371
score: 5.514coord: 539..573
score: 7.311coord: 233..267
score: 10.348coord: 372..402
score: 6.928coord: 851..885
score: 5.514coord: 952..986
score: 10.369coord: 166..200
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 520..556
score: 1.1E-8coord: 1018..1072
score: 1.1E-8coord: 889..948
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 382..427
score: 1.68E-6coord: 664..712
score: 1.68E-6coord: 520..556
score: 1.68E-6coord: 893..1074
score: 2.81E-5coord: 516..566
score: 2.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 748..792
score: 3.7E-286coord: 831..1094
score: 3.7E-286coord: 614..724
score: 3.7E-286coord: 407..429
score: 3.7E-286coord: 54..200
score: 3.7E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG16g08460CmoCh20G001390Cucurbita moschata (Rifu)cmocpeB517
The following gene(s) are paralogous to this gene:

None