Cp4.1LG02g01210 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g01210
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionpentatricopeptide repeat-containing protein At4g04790, mitochondrial-like
LocationCp4.1LG02: 4775345 .. 4788438 (-)
RNA-Seq ExpressionCp4.1LG02g01210
SyntenyCp4.1LG02g01210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGAACCCAGCGGTAATCGGAAGTTGCCGTCGCCGATAAAACCATAAACCCCACAAAACAGCGGCGCAGCTCAGTTCGCCCGAGGCCGGGAAGTTGCAGAGCTTTCGGAATCTGAATCTCAAATGCCTTCCAAATCCAAAAATCTAAGCTCTCTATTCCGCTCCGCCATTATAGCTTCAAAACCTTCACAGAATCCCCAAGATGCAGCTCTCAAAAACTACGTCTCCGCTATCGATCCCTTCTCTCCTAGCACCTCTCTCTCCAAAGCCATCGATAAACGATCCATCAAGTCCGCGAACTCCAAGAAGCTTCCCCAGAAACTAAACTCTGACGTTCAATTTCCTGCTCTCATTTTAGAAGAGCCGTCAGGGTCTGGTATGCATGTTGTTTTATGATGTTTGATTATGTATTCATCTTGTTTTTTGTTATTTCAGTAATCGGGCTGTAGATTATCTCCAATCAAAGCTTATTTTTGCCTTAGTTTTGGTGGTTTAGTGTTTTTTTTGAATCTGAAGATGAATTTTATTGTAGTTATCTCTCATTTGCAGTGAAGGGACTGATGTGTGAATGATATTTAATTTTTCAAGTTTTAATTCTCTCAAAACATCATGTTTGAATGCAGGGGATTCTATGAAGCATTTAACCAAGGCCATATCTTCCATTCTGTGCGGTACGTGTTATTGAATGTTCGTTGTAGGAATTGCTTCTGAATATTTTTAGAATTCATCACAACGTAGTTTTATTCTTAAGTTGTTGATAAAACAATAGGGATATGTTACTTCGAAAATATCAGAGTGGTTTCTTTAACAGCCACCTTTTGGATCTCACGTTTCAAATTTTTATGTTGCCTATTTCATCGTATCGAGAACGAGGGGAATGTGAACAATTGAACTAATTTCCTGTCCAGTTATACAATCTGATTGAGTTATTTTATCTGTACTTTCATGCAAGTGCTATAATTTATATTGACTTGAACTGTAATGATGAAACAGTAAACTAAACACTTGGGTTTCCCAATTGGGCAGGTTTGTCACTCAATACAGGTTTTAGAATTATATTCAATATTTATAACCATTTGGCTCGTAATTTTCATAGACATTATTTTATAAATTGTAGTCTTGCCTTCAATTATGGTTCAATCTGAATGTAGTTTTTTGTTTCTTTGCCAAGTAAACTGCTTACCTACGCGTTATTTCCTTTTGCAGAAGGCTCATCTGTTAGGTCTCCTAATGCACAAGAAAATGGTGATGAGAACTCTTTGAAACAATTATTAGACATACCATGGTTTTCCAACATGTCCAATTATAGCATATCATTGCGTCGCAGAGAAATATCCCGTGAAAGAAAGCAAAAATGGATATTCAAAAATAGCCAAAACTATAGGTTTAGTCAATTGGTTAGAAATTGTGCACAGAAGCTGGGGACTGATGTTACTTTAGAAGTCTTTGGTAAATTGGGACGAGAAACTGGTGTGAAAGAATACAATGCACTGATAGATATATGTTTAGAGAAGGCTAAGACAAGTAAAGACTTGGAGGTTGTATTGGAACAGATTGCAAAGGTTTATCAGCTATTTAAATTAATGAAAGAACATGGTTTTCAGTTAGAAGATGAAACTTACGGTCCAGTTCTTACATATTTAATTGACATGGACATGATGGAAGAATTTAATTTTTTCTGTGAGGCTGTAAAGGATGGAAATCCAGGCTCGAATTCAAGGTTGGGTTACTATGAGATGTTGTTCTATGTTAAAATCAATGATGTAGAAAAAATTCAGGAGCTCTGTGAGCATGCTATAGCCCATGATGGAGTGGACAAGTACAGTTTACAAGGTACGTCCTACAATTTTCAGATGCATCGATTTTTTCTGAACCCTTTTGTCCTCTTGCATTGAAATAAACGATTGAGAAGTTCTGCTTTCCGTGGAAATTATATATTTTTTAAGATAAGAAACAACCTGTTGATAATTAGCATGAATAAGAAGGGAGGTTGGGTCTGTTTGTGTCAGAGATAATCAGAAGGATCTCTAAACAGCATAAGTAAAGAATAATAGCGAATAACAAAACTCCTTGGTATTTTCAGCCTAAAGTGATGTATATAGTCAAACTCTCTCCCAACTCTTTCTATACCCATGGAAAATACCTAAAAGATTAATATTCCTAACTGATTATATATCCCCCGGCTAAAGTAGAGGTGATCTACAGTACCTCAGCATTCTTTTTAACAACAGGGTCGGCTATGACGTAGGACAATTGCTAGAAATTTTATTTAAATTCAGAGTTAAATTTGAGAGTTTTGTGACTTTTAAAGTTTTATGAATGGTGGGGATTTGAACCTCCCCTGGTGGAACTTGAAGTTGGATTTTTGTCTAGTATTTGCAGTGAAAACTAAATAGCTAAGTTAAAGAAATATATAATAGGAAAAAACTGTATTGTAAAACACCTTTGGCGTCCACTGTGGGTTTCATTGTATCACTGTTTCTGGATAAAGGTTATAGCTATTGCTTTAAAGAGATTGACTGAACGTTTCTTTGTTTATCAATCACGGACTAATACGAAGAGGAAGATCAGAAAGTTTCTACTTCCCCAGCTTCTGTTACCCACAATGACAAAAGCATAATCTCTCTAGATTTCTTCTCATAATCTCTCTAGATTTCTTCTCATAATCTCTCTAGATGCTCACTCAGAACTTGACTTCTCTCCAATCACTTGTCAGATCACTACTTAGTTACAGGTCATGATGGATTTGAAATATGCGTTCACAATGTTTTGATTTTTTTGATAAGCTATTGTCATGTTTTTGGTGAAGCATATGCTTACAAATAGATCCATATTTTTGGAGATCAATCTGTTTGGTTAACAATTTGGAACCTGTGGTTACAGAAAATTATTTGTTGGCCCTTTGTGAATGTGAGCAGAAGGAGGAACTTTTGCAGATGCTGGAAATTGTAGACATCACAAAACTTTCGTCAACTGTACTCGCAGCTAACATTTTCAAGTGCTTAGGGAGATTATTACTTCACTCTATTGCAGAGAAGTTGCTTGTGGCGTTGAAAACTTCTGGTATGTGATGATTACTGTAAACTTTTTTTGTCCAAAGAATTTGAAGCTATCATATATTTGTTTAGGAGCATATCTGGATGAAGCAGAGATGTATTAAAATTAGCAACTGAATCGAACTTTCCAAATAAGAAAAAAGTAACTTCTTTTACACCTTTTGAATTTTTGAAATGGAGACGGCTTTATTCCTAGCTGCTTCAGTGATGCAAACATGATGTCCTTGACACTTACTATCGACTTTGATGATTTATGGCTACTTATCGTTCATGGAAGATCGATGTGCATATATTTCCATTCCATGGGTGGCATGGTAGCAGCAGTCAGTTGGTCAAATTGNAGTTCCATCTTATAGTAAAAGATGAGGAATATCTTAGGCTTTTAACATGTTATTCACTAGTCATTAATAACATTTAAATATGTTTGCATGGGTATTATATTTATTTGACTTTCTTCAATTGATATTATCCATCTTTATTTATTGTACTCTGCTCAGATCACATTTCTTCGGCTAGAATTGTGTTTTTGGTTTAATAGTCGTGATATGCTCATCATTATTGCATAAATATCTTATCATCTTCATACCCATATTGTATTGTTGAATTGAAACTGCAGGGAATGGAGCAGAGAATATCCCTTACCTTATATATAATCATGTTGTCAGCATTCCAAATTTAGCGGTGAGGTTCCCATGAATAGTGCTGGTAATTTCCTTGAGTCTACAAATCTCTCAGGCACTTCTGTTTTTCAGGCAGTTTGATTGTTTTGGAAGTGGTTGCTCTTTTTCTTTTCCAAAATTTTATTTTGCAAGTCATTTTTTTATTTTCGTGTTGTCATATTGCTCAACCTCCATGTCTCTAAACACTATTGTTCCTTTCAAGGCACCGACCCTAAAGAGAATCAAATAAGTCGGCCTACCACAAAACCCTACCCTTCCCACGAAAGGGTAAATGCAAGAGGATCTCCGACATCCTAGTATTGCACTCCGCCTTTGTATATAAGAACCCAATCCTTATCGTGAAAAGTCTGATGTAACACCAAACGACTGAAAATAGCTATCCTAGATAGAGTGAGCAAACTGACAATCCCGAGGTCATGATCCTAAGTCTTTAGATGTACTCTTACAAAGAATGCACCACAGAGGACCAAGAAAAAATGTCTTAAAACTTGATTCAAAATTAACTAATTGAGATAGCTACAACTTTATTAATTGTTTTCCTTTGTTTCTCTAGAGCCTATTATGCTAGCGAAGTAGAATGGAAAAACCACAATTACACATAGGCCAACTCCTTGCTACTAATTTCAATCATTCATGACAATTGTGGATCACAAACTTTTGAACAACAATAGAACAATCAATACTAAACTAAGGTGTCAATAAAAAAAGAACAAANGGTGGTGTGGATGAGGGAATGTGAGGAGGACATGGTTTGGGTGGTACTGTGGAAAAGAATTGATTTTTAGAATCAAACAAATGTCAATTTTGTAAAAACTTAAAAAATACCCATCAATACATCCCACTAAATAATATTTGAAGTTCCAATAGATAAGGATTTTTTTAACGAAGTTCCATCTTATAGTAAAAGATGAGGAATATCTTAGGCTTTTAACATGTTATTCACTAGTCATTAATAACATTTAAATATGTTTGCATGGGTATTATATTTATTTGACTTTCTTCAATTGATATTATCCATCTTTATTTATTGTACTCTGCTCAGATCACATTTCTTCGGCTAGAATTGTGTTTTTGGTTTAATAGTCGTGATATGCTCATCATTATTGCATAAATATCTTATCATCTTCATACCCATATTGTATTGTTGAATTGAAACTGCAGGGAATGGAGCAGAGAATATCCCTTACCTTATATATAATCATGTTGTCAGCATTCCAAATTTAGCGGTGAGGTTCCCATGAATAGTGCTGGTAATTTCCTTGAGTCTACAAATCTCTCAGGCACTTCTGTTTTTCAGGCAGTTTGATTGTTTTGGAAGTGGTTGCTCTTTTTCTTTTCCAAAATTTTATTTTGCAAGTCATTTTTTTATTTTCGTGTTGTCATATTGCTCAACCTCCATGTCTCTAAACACTATTGTTCCTTTCAAGGCACCGACCCTAAAGAGAATCAAATAAGTCGGCCTACCACAAAACCCTACCCTTCCCACGAAAGGGTAAATGCAAGAGGATCTCCGACATCCTAGTATTGCACTCCGCCTTTGTATATAAGAACCCAATCCTTATCGTGAAAAGTCTGATGTAACACCAAACGACTGAAAATAGCTATCCTAGATAGAGTGAGCAAACTGACAATCCCGAGGTCATGATCCTAAGTCTTTAGATGTACTCTTACAAAGAATGCACCACAGAGGACCAAGAAAAAATGTCTTAAAACTTGATTCAAAATTAACTAATTGAGATAGCTACAACTTTATTAATTGTTTTCCTTTGTTTCTCTAGAGCCTATTATGCTAGCGAAGTAGAATGGAAAAACCACAATTACACATAGGCCAACTCCTTGCTACTAATTTCAATCATTCATGACAATTGTGGATCACAAACTTTTGAACAACAATAGAACAATCAATACTAAACTAAGGTGTCAATAAAAAAAGAACAAAACTTGAATTAAAGAAGACTCCTATTCAACTCAAAGAGGCACAAAATTATAATTGGAATGTTTCACCCTTGAATAAAAAATGAAAACTTAACCACACATCATAAAAGAGAAACAAAAACTGGAAAGGGAGAAGAAGCCTTGTACCCCTTTGTCTCTCTCTCTCTCTCTCTCTCTCTTGTTGTCTTTTTCTGCTCAAGGGGAAGTAAACTCACTGCTCCCGCACTATTTTCCATAGTACAAGGCTCTCTATTTATACTCGAGTGCAGAGGAATCTCATTGACAATAGGCTTCTCCCTTCCTAAAATACTGCTGAAAAAAAACTGAATGAAGAAGTCTGCCGCCTTCTTCAGCCTGAAACGGAAGTTTGCGCATGACTCACAGACCAAAAGTGTGCTTTACCAAAAGTTCAACTAACCTCTGTTGTTGAAAGCTCTGTTGTTTTGCTTCTGCAACAAAGCCTGGAGGACACCCAGTTATTATGTAGTCTAAGGGTCTTTTTTCCTTCTAAAAAGATGTCTTTATAACACAAGGATAAGAGAAAATTCATATTGCTAGGAAGGATCATCTCCCAATTGAATACAGACTGTATTTGATTTCAATAGTTCAACTCGTAAAAAGATGTTCTTGTCTCTTTGATTCTTTTTTGTGCACAACTCTCCTACTCAAGAGAGAGAAAGAGATGGATTCTCCCTTGGATCTTCTCTTGGGTGTTGATTCTCTCATGACTAAGGTCGTATGGAAATAACTTTACTTTTTTGATAGCCTCCATGTCAAATCACCCTGTACAATGGCTCATAATACTCTGTAAAGAATATTAATGCTCTTCCTAGAAAATGCATGTTAACTATAGACTATAAAAAGAATTTTACTTTCTTGAAGGCTTTATCCTACAACTCAAATTGGTTAATTTGAACGTTGAAGGCTTTATCCTACAGTTAAGTCTTCTATTACCTTCAATAAAGTTGAAGGTTTGATCTCTTACCCNGACTGAAAATAGCTATCCTAGATAGAGTGAGCAAACTGACAATCCCGAGGTCATGATCCTAAGTCTTTAGATGTACTCTTACAAAGAATGCACCACAGAGGACCAAGAAAAAATGTCTTAAAACTTGATTCAAAATTAACTAATTGAGATAGCTACAACTTTATTAATTGTTTTCCTTTGTTTCTCTAGAGCCTATTATGCTAGCGAAGTAGAATGGAAAAACCACAATTACACATAGGCCAACTCCTTGCTACTAATTTCAATCATTCATGACAATTGTGGATCACAAACTTTTGAACAACAATAGAACAATCAATACTAAACTAAGGTGTCAATAAAAAAAGAACAAAACTTGAATTAAAGAAGACTCCTATTCAACTCAAAGAGGCACAAAATTANAAGTACTTATCAAGCTGGTAGCCATATTGATTAGAGAGGTCGAAGTAGATTTAGCAGCAGTGGTTCTTTTAATATCAGCATTCAGCAGAGTTTGCAGTTTGCTTTTATTTATTCTTTTCTGGTCTTTATAAATGTCCAGTGCAATGTTGAAATCTTGAGACCGAGGGCTTCTCTTCTTTCAAGTTATCATGAAGTCTTATTAGGGTTTCTACATACTCATAACGATTGCTTGCAGTACATGCATGCAAATCTTTCTACGGTTGTATCCTTCTGTAGTTGTCTGGGTGTTTGATAATGCCTAAGACTTTTTTATGGATCTTCAAGGTTTTATTAAAGGAATAACTTCTTTTGTATTCTATCATTGATATCTTTCACAATTTTTTTTTTTTTTGTTACGTGGAAGATTGAGGATATGGTTTCTAAGTTCAAGAGCATGCACTTCGATTTGGACATCAAGCCTTCATCTGTTTCATATGAGAAGCTCATCTGTTACTGTTGTGGTTCACTTAAGGTACTGGGGGAAGATTGCTAATCATAAATTTGTTTTGAACTACAGCGAAAACCTTTGCTTATTTAGAGTAAGATAATTTAAATTTGTTTGTCTTATTAATATCTTGGGTGCAGTTTCGATACTGCTTTGAAATTAGCTTTTTTCTTTTTAAATCTGATGTAGGTGCATATGGCTCTTGATATAGCAAATGAAATGTGTGATGCAGATTTCACACCATCCACAGATGTCTTGCACTCCATTTTACATGCTATGGATGAAGGCTGTGAGTTTAATTTGGTAAGTGTTTTGTCTTCTCTATTTTTAACTAATACTTGAAGTTAAATTAGTATTGTATGTCATGTGGTTTCTTCCATTCACATCTTCTTTGGTTTTTGCTTCTCATTATCTTGATATTTGCACTTTATATTTTGAACTTGCGTATCTTGAAGGACCTATGACAGAAGTTCCTTTTTTATTTATCTTTTATTTTTTTTATTTTTTTATATATATATCCTTTAGGTGAAAGAATAAGATTGAATTTTAATAACGTGTGATGATATGGTTTACTGTATTTTTGTATTTATCCTGCATGATAAAAGGCAATGTGGCAGCTACTAGGTTTGGCCAGTGATTAGGGGTGTCTTGATATCTTTTGAGGTTACTAGTTCAAATCATAATGAGCAATGATTTTAGGATATTAAATATTCTATGAGTTACCTAATACCAAATGTTGTAGGGGACTGGTCCTTCTTGTTATATTCGATGCAATTTTGGGATTCTGCATTCATTTTCACCGTGACAGGGGAGTGCACACCTGTCAAAATGGTGAAGGGTAGTGAAGAGAGGGAACATTAGCAAGAAAGAAATAGAGGGAAGACTAAGTAGCAGACTGATTAGCTCAGACTGGAAAGTTTGTCATGCACAAACAATTTAAAATTCCCATGCTGTTTGGCTTCTATGTTTAGTTTAAAATTCCTATGCTGTTTGGTCGTATTCCCTATATATTGTTTATTTTTATTGAAGATCTCAATGAACTTCAGTGAGTTTTTATCTTTTTATCCATTTTTGTGGCAGTGTAACTGATGCTGCACTTATATTTCCATGTGATTTCCTTATTTTCTCATTTCAGTTCAATTATCATCGCTGTGTTATTACCAATAACAATTATTATTATTATCCTGGCTGTGCTTATTGCCAAGGTTCATCAAGTCTATTCACTCATATGTCGCCACAACCTGAAGCCGGATAGTGAAGTGTTCAGGAGAATGATAAGTCTGTGCATCAAGATGAAAGACGTAATTTTTTACTTAAATTTTGTTTTCATGTTCACTTTATTGTTATTTTATGTGCTCAAATGTATTGCATTTCAAAAATTGAACTTTCATGGATACCAAATTTTAATTTTTCTTTATGCAGTTTAATGGTGCCTATGACATGCTTAAGGAATCTGAGAAAATGAACTTTACTCCTACAGCTAGCATGTACAATGCTATCATGGCCGGATACTTCCGAGAGGTACCTTTTTCAGTTCTATGGTCGTCCCTCCTCCCTCTCTATAACATAAATTTGACCTGTGAATGTACTTTGTGCATTTTCTGTCCTCATAATATAAAAGTTTCATTTTACACATTCATTCTCTTATTTAATATATAAACATTGGACGATGTTAACTCTTCCAAGTACTCTGCAGAAGAACACTTCTGATGGATTGATGGTTCTCAAGCAAATGGAACTTGCGGATGTAAAACCAGATTCCAAGACTTTCAGCTATTTGATCAGCTATTGTGAATGTGAGGAGGATATTATCAAGGTTACCACAGAACTCTTCAAACTTTGTCTAATTTTTACGTGTAACAAGAATTAGAAAACACTTTTGATATCATATGCCTGGTTTCTTTCAATTTCATTTGCAAGTGGCTTTGCCCCAAAAAAACAATTATTTCCACAGAATTCTTCAAACTTAATGCCTAATTTTTGTGTTTCAAGAATTAGCAAACACATTTGATACAATATGCCTAATTTCTTTCTATTTTCATTTACTAGAGGCTTTGCCCCAAAAAATAACTATTACCACAGAAGTTTTCGAACTCTTATGTCTAATTTTACGTGTAACAAAAATTCGAAAACACTTCTGATATCATATGCCTAATATCTTTCAATTTTCATTTATAAGTGGCTTTGCCCCAAAAAACAATTATTACCTATTATAGTGAATGTTACTTTTGTTTCTTGAATACAAGGATAGTGAAACCTAGTCTATAGTGAAGTATTTAACATACTAAAATCCTCCTAAAATGATAGGGAGAATAAAAGTCTAGTGGAAAATGCAATGTAACATCTTTTTGTTGATTCTTCACTTGGTTTGAACAAAGGTCAAACTGACATGTTCAGAAAACGTGCCCCAGATTGCTAATGCTATTATTATTTCTTCCAATGCCTGTTTTACATAGGATAAAACACATTGAAGTCAAATTTTATTTGTTGAGGGAAGTGGAGAAAGGTGATCAAGCATGTTCATCATAGTTTTGAATCACAGCTAGCTAGCGTTCTTATGAAGTTCTGCATTATAGATATTTGAATTTTTGTATGGCAAATTTGGTTTGACAAGAAAAAGTTTGAGTATCATGATTTTCAGTTTTATTATTTATTCGATAAATCTAATTTTATTCAACTATTTTCATTTATATTTGACTTACAATTATGCATTGCTAGAGAAGTTTGTTTCTCAAACATTTCAGCATGGGTAACGTTGAGGTTATACTATTGCTCCTACAATATCTTTACCATTTTTTGTTCATTTACTTGCTTCTTCCTTTCAAATTCCTTACTTTTATCTATAGTATTCCAAGTTCCAACCCCGCCTAGCAAACTCCTTTTTCACAGTTAGACTCTGATAAACTGCCAGCATCTATCCCATATAACTAACGTAAATGCCTAACAAACCATGTTTTGTATTAATCACTGAGGATGTTGTTTTTTAAATTCAATTATTATTCACCCTCTTTAAACTATAAAGGCCAAGCATTATATGGATAAAATTTTGAAGCCGCTTATCTTCCTGTAGTATTATGAAGAGCTGAAGAGTTCTGGAGTTCAAGCGACAAAGCATATTTTCATGGCCCTTATAAATGCATATGCTGCTCATGGACAATTTGAGAAGGCAAAACAGGTACTTGATCTTGATCTTCATGTTTTTATTCTTATAAAGATTTTCATAAGAGTTGGCTGAAGTGTACAATGTAAGTAGGTTATATCAGATGAAGAAATACCAGTCAAGAACTTGAATGAAATTAGAAGTGTGCTAGTCTCTGCTCTTGCTTCAAATGCGCAATTAGCTGATGCCCTGAAAATTTATGATGAAATGAAACAAGCTGGATGTAATTTGGAGCCCAAGGCTGTCATCAGTCTTATTGTAAGTTAAACGTCTGTAATGAATGATTATTACTTCTTCTCGGTTTGTCAACTAAAGAGTTTTTCCCTTTTGTCTCTAGGAGCACTATCCATTCGATGGGCCGGTGAACAGAATTTTTCAGTTACTTGGTGAATTGCATCATGATCTCGACGACTGGATAGATTGTTGTCGCAGAATTCTCTTGTTTTCTGTAAAACACAATGATCTGAGGTTCATCTATGGAAGTTCTTTGTATACTTGGATGATTTGCATTAGATTTATTCTCGTCTCCCACGAAAGAAACTCATTGTCTTCTTCTTTTGGTAGTTCGACCGTCGATCTATTGAAGCAGCTCAGTTACAGATGCTGTAATGATGAAGTGATGATGGGAGTTGCTTTTGATGAGGTTTGTTCTTCCCTTCCCATTATATTTGTTATGTACGACTTTCCATTTAAAGCACTTTTTGATATTTTTGTTAGATAATCATTTAGTGATTTTGAAAATACTTAAAATGCTTTAGTTGGTACCTTTTCCAAAAAAATTCTTATAGAAGTCGTGAGATCCCACATTAGTTGGGGAGGAGATCGAAACACCCTTTTTAAGGGTCTGGAAACCTCTCCCTAGCAGACACGTTTTAAAACCTTGAGGGTAAGCCCGAAAGAGAAAGACGAAAGAGGATAATATTTGCTAGCGGTGGGCTTGGGCCATTACAAATGGTACCAGGGCCAGACACCAAACGATGTGCCAGCGAGGAGGTTGTTCCCCGAAAGGGGTAGACACGAGGCGGTGTGCCAGCAAGGACCCTGGGCCTCGAAGTGGGGTGGATTTGGTGGAGTCCCACATCAATTGGAGAAAGGAACGAGTGTCAGCGAGGATGCTGGGCCCTGAAGGGGGGTTGATTGTGAGATCTCACATCGGTTGGGGAGGAGAACGAAACACCCTTTATAAGAGTGGAAACCTCTCCCTAGCATACGCGATTTAAAAATCTTGAGGGTAAGCTCGAAAGAGAAAGCCTAAAGAGGACAATATCTGCTGGCGGTAGGCTTGGGCCGTTACCGAAGTGATCTTAGTTGAAACATTTCTTTCAAAATTCATCTCAAGCTCACCCTAAAATGCCATAAAACTAATGAAGAATTTGGTTTATGGGCTTTGGTGCCTTTTTTTACCTTTTAAGACATAGAGGATTCTATACAAATAGCCAGTTCATCCTTCATAATCTTGTCCAAGTATTAAAAGACTACGAAACAGATGACGAGTTTTGAATCCGAGACAAATCTCCGAGCATTAACTTACAAATCGCTATCGTGTCCAGATTTTTTCCCTCATTGCAGAGTCCGAGCCATCATATTTAGAAACAGGCCTGCAATTGCTTCAATTCATAAAGAATGATCTCGGTCTATCTCCCCCACGTAGATGCCTTGATTTTCTCCTGGGTGCTTGTGCCAACGCCAAAGATGCAGAGAGCTCTCTCCTCATCTGGAAAGAATATGAAAAAGCTGGCCTCCCACACAATACTATTAGTTACTTGCGGTAAACTTATGATGTTATTGTTTATTATCTTTCCTTTGTACGAGTCCTCGCTCGAACCTGTTTTGTTTTCTTCGAGCCTAAACGATCCAGTTTGCCTGACTACTCTGTCCATGGGGCATTTTTGCAGGATGTATCAAGCTCTCTTAGCATCTGGGGACCAAACATCTGCCAAAGTTTTGCTTGAAAAAATTCCGAAAGACGATGCTCACGTTTGCTACGTAATCAAGGAATGCGAATCGGTTTATGTTGCCTCTTCCTCAGTAAACAAGAAAAAAGGAAGACACAAGAAAAAGATGCTGAAAATGAGCAGAAATGGAAGAGAGGTATGAATGGAGAAGTATATTAGTATTAAACTAACTTGCATTTTTGGGTTGTAGGAGTAGATTAGAGTATGATCATAGGTATGCATGTGAAGCCCTTTTATTTATTGAATCTAATATCATTTAAAGACTCAACAATGTGGAAGAAAGATTAGACATTTGTTCAAGTAGATACATCATTGATAGTTGTAATAAAATCATCCACTTTTCCTTGGCAATAAAAATATGGGGAGAAATGATTGAAACTCCCTTTTCCTTTGCTCTTCAGGTTCAAATGTTTTTTTTTTTTTTTTTTTTTATTTGCC

mRNA sequence

CAGAACCCAGCGGTAATCGGAAGTTGCCGTCGCCGATAAAACCATAAACCCCACAAAACAGCGGCGCAGCTCAGTTCGCCCGAGGCCGGGAAGTTGCAGAGCTTTCGGAATCTGAATCTCAAATGCCTTCCAAATCCAAAAATCTAAGCTCTCTATTCCGCTCCGCCATTATAGCTTCAAAACCTTCACAGAATCCCCAAGATGCAGCTCTCAAAAACTACGTCTCCGCTATCGATCCCTTCTCTCCTAGCACCTCTCTCTCCAAAGCCATCGATAAACGATCCATCAAGTCCGCGAACTCCAAGAAGCTTCCCCAGAAACTAAACTCTGACGTTCAATTTCCTGCTCTCATTTTAGAAGAGCCGTCAGGGTCTGGGGATTCTATGAAGCATTTAACCAAGGCCATATCTTCCATTCTGTGCGAAGGCTCATCTGTTAGGTCTCCTAATGCACAAGAAAATGGTGATGAGAACTCTTTGAAACAATTATTAGACATACCATGGTTTTCCAACATGTCCAATTATAGCATATCATTGCGTCGCAGAGAAATATCCCGTGAAAGAAAGCAAAAATGGATATTCAAAAATAGCCAAAACTATAGGTTTAGTCAATTGGTTAGAAATTGTGCACAGAAGCTGGGGACTGATGTTACTTTAGAAGTCTTTGGTAAATTGGGACGAGAAACTGGTGTGAAAGAATACAATGCACTGATAGATATATGTTTAGAGAAGGCTAAGACAAGTAAAGACTTGGAGGTTGTATTGGAACAGATTGCAAAGGTTTATCAGCTATTTAAATTAATGAAAGAACATGGTTTTCAGTTAGAAGATGAAACTTACGGTCCAGTTCTTACATATTTAATTGACATGGACATGATGGAAGAATTTAATTTTTTCTGTGAGGCTGTAAAGGATGGAAATCCAGGCTCGAATTCAAGGTTGGGTTACTATGAGATGTTGTTCTATGTTAAAATCAATGATGTAGAAAAAATTCAGGAGCTCTGTGAGCATGCTATAGCCCATGATGGAGTGGACAAGTACAGTTTACAAGAAAATTATTTGTTGGCCCTTTGTGAATGTGAGCAGAAGGAGGAACTTTTGCAGATGCTGGAAATTGTAGACATCACAAAACTTTCGTCAACTGTACTCGCAGCTAACATTTTCAAGTGCTTAGGGAGATTATTACTTCACTCTATTGCAGAGAAGTTGCTTGTGGCGTTGAAAACTTCTGGGAATGGAGCAGAGAATATCCCTTACCTTATATATAATCATGTTGTCAGCATTCCAAATTTAGCGGTGAGGTTCCCATGAATAGTGCTGATTGAGGATATGGTTTCTAAGTTCAAGAGCATGCACTTCGATTTGGACATCAAGCCTTCATCTGTTTCATATGAGAAGCTCATCTGTTACTGTTGTGGTTCACTTAAGGTGCATATGGCTCTTGATATAGCAAATGAAATGTGTGATGCAGATTTCACACCATCCACAGATGTCTTGCACTCCATTTTACATGCTATGGATGAAGGCTGTGAGTTTAATTTGGTTCATCAAGTCTATTCACTCATATGTCGCCACAACCTGAAGCCGGATAGTGAAGTGTTCAGGAGAATGATAAGTCTGTGCATCAAGATGAAAGACTTTAATGGTGCCTATGACATGCTTAAGGAATCTGAGAAAATGAACTTTACTCCTACAGCTAGCATGTACAATGCTATCATGGCCGGATACTTCCGAGAGAAGAACACTTCTGATGGATTGATGGTTCTCAAGCAAATGGAACTTGCGGATGTAAAACCAGATTCCAAGACTTTCAGCTATTTGATCAGCTATTGTGAATGTGAGGAGGATATTATCAAGTATTATGAAGAGCTGAAGAGTTCTGGAGTTCAAGCGACAAAGCATATTTTCATGGCCCTTATAAATGCATATGCTGCTCATGGACAATTTGAGAAGGCAAAACAGGTTATATCAGATGAAGAAATACCAGTCAAGAACTTGAATGAAATTAGAAGTGTGCTAGTCTCTGCTCTTGCTTCAAATGCGCAATTAGCTGATGCCCTGAAAATTTATGATGAAATGAAACAAGCTGGATGTAATTTGGAGCCCAAGGCTGTCATCAGTCTTATTGAGCACTATCCATTCGATGGGCCGGTGAACAGAATTTTTCAGTTACTTGGTGAATTGCATCATGATCTCGACGACTGGATAGATTGTTGTCGCAGAATTCTCTTGTTTTCTGTAAAACACAATGATCTGAGTTCGACCGTCGATCTATTGAAGCAGCTCAGTTACAGATGCTGTAATGATGAAGTGATGATGGGAGTTGCTTTTGATGAGATTTTTTCCCTCATTGCAGAGTCCGAGCCATCATATTTAGAAACAGGCCTGCAATTGCTTCAATTCATAAAGAATGATCTCGGTCTATCTCCCCCACGTAGATGCCTTGATTTTCTCCTGGGTGCTTGTGCCAACGCCAAAGATGCAGAGAGCTCTCTCCTCATCTGGAAAGAATATGAAAAAGCTGGCCTCCCACACAATACTATTAGTTACTTGCGGATGTATCAAGCTCTCTTAGCATCTGGGGACCAAACATCTGCCAAAGTTTTGCTTGAAAAAATTCCGAAAGACGATGCTCACGTTTGCTACGTAATCAAGGAATGCGAATCGGTTTATGTTGCCTCTTCCTCAGTAAACAAGAAAAAAGGAAGACACAAGAAAAAGATGCTGAAAATGAGCAGAAATGGAAGAGAGGTATGAATGGAGAAGTATATTAGTATTAAACTAACTTGCATTTTTGGGTTGTAGGAGTAGATTAGAGTATGATCATAGGTATGCATGTGAAGCCCTTTTATTTATTGAATCTAATATCATTTAAAGACTCAACAATGTGGAAGAAAGATTAGACATTTGTTCAAGTAGATACATCATTGATAGTTGTAATAAAATCATCCACTTTTCCTTGGCAATAAAAATATGGGGAGAAATGATTGAAACTCCCTTTTCCTTTGCTCTTCAGGTTCAAATGTTTTTTTTTTTTTTTTTTTTTATTTGCC

Coding sequence (CDS)

ATGGTTTCTAAGTTCAAGAGCATGCACTTCGATTTGGACATCAAGCCTTCATCTGTTTCATATGAGAAGCTCATCTGTTACTGTTGTGGTTCACTTAAGGTGCATATGGCTCTTGATATAGCAAATGAAATGTGTGATGCAGATTTCACACCATCCACAGATGTCTTGCACTCCATTTTACATGCTATGGATGAAGGCTGTGAGTTTAATTTGGTTCATCAAGTCTATTCACTCATATGTCGCCACAACCTGAAGCCGGATAGTGAAGTGTTCAGGAGAATGATAAGTCTGTGCATCAAGATGAAAGACTTTAATGGTGCCTATGACATGCTTAAGGAATCTGAGAAAATGAACTTTACTCCTACAGCTAGCATGTACAATGCTATCATGGCCGGATACTTCCGAGAGAAGAACACTTCTGATGGATTGATGGTTCTCAAGCAAATGGAACTTGCGGATGTAAAACCAGATTCCAAGACTTTCAGCTATTTGATCAGCTATTGTGAATGTGAGGAGGATATTATCAAGTATTATGAAGAGCTGAAGAGTTCTGGAGTTCAAGCGACAAAGCATATTTTCATGGCCCTTATAAATGCATATGCTGCTCATGGACAATTTGAGAAGGCAAAACAGGTTATATCAGATGAAGAAATACCAGTCAAGAACTTGAATGAAATTAGAAGTGTGCTAGTCTCTGCTCTTGCTTCAAATGCGCAATTAGCTGATGCCCTGAAAATTTATGATGAAATGAAACAAGCTGGATGTAATTTGGAGCCCAAGGCTGTCATCAGTCTTATTGAGCACTATCCATTCGATGGGCCGGTGAACAGAATTTTTCAGTTACTTGGTGAATTGCATCATGATCTCGACGACTGGATAGATTGTTGTCGCAGAATTCTCTTGTTTTCTGTAAAACACAATGATCTGAGTTCGACCGTCGATCTATTGAAGCAGCTCAGTTACAGATGCTGTAATGATGAAGTGATGATGGGAGTTGCTTTTGATGAGATTTTTTCCCTCATTGCAGAGTCCGAGCCATCATATTTAGAAACAGGCCTGCAATTGCTTCAATTCATAAAGAATGATCTCGGTCTATCTCCCCCACGTAGATGCCTTGATTTTCTCCTGGGTGCTTGTGCCAACGCCAAAGATGCAGAGAGCTCTCTCCTCATCTGGAAAGAATATGAAAAAGCTGGCCTCCCACACAATACTATTAGTTACTTGCGGATGTATCAAGCTCTCTTAGCATCTGGGGACCAAACATCTGCCAAAGTTTTGCTTGAAAAAATTCCGAAAGACGATGCTCACGTTTGCTACGTAATCAAGGAATGCGAATCGGTTTATGTTGCCTCTTCCTCAGTAAACAAGAAAAAAGGAAGACACAAGAAAAAGATGCTGAAAATGAGCAGAAATGGAAGAGAGGTATGA

Protein sequence

MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEELKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILLFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKNDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQTSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV
Homology
BLAST of Cp4.1LG02g01210 vs. ExPASy Swiss-Prot
Match: Q6NQ81 (Pentatricopeptide repeat-containing protein At4g04790, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g04790 PE=2 SV=2)

HSP 1 Score: 475.3 bits (1222), Expect = 7.9e-133
Identity = 242/468 (51.71%), Postives = 332/468 (70.94%), Query Frame = 0

Query: 2   VSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILH 61
           + KF  MH +LD+ PSS SYEKL+ Y C S +V  ALD+  +M +A    S D+LHS+LH
Sbjct: 357 ILKFNKMHEELDVMPSSTSYEKLVKYSCDSNEVVTALDVVEKMGEAGLMISADILHSLLH 416

Query: 62  AMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTP 121
           A+DE  EF+LV +++S++C  ++KP++E FR +I LC ++KDF GAY+ML   +  N  P
Sbjct: 417 AIDEVLEFDLVRRIHSIMCTKSVKPNTENFRSIIRLCTRIKDFEGAYNMLGNLKNFNLEP 476

Query: 122 TASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEEL 181
            +SM+N I+AGYFREKN S  LMV+KQM+ A VKPDS TF YLI+ C  E+ I KYYEE+
Sbjct: 477 NSSMFNCILAGYFREKNVSSALMVVKQMKEAGVKPDSITFGYLINNCTQEDAITKYYEEM 536

Query: 182 KSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLA 241
           K +GVQATK I+M+LI+AYAA G+FEKAKQV+ D ++P  N NE++SVL+SALAS  + A
Sbjct: 537 KQAGVQATKRIYMSLIDAYAASGKFEKAKQVLVDPDVPAINQNELKSVLISALASRGKWA 596

Query: 242 DALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILL 301
           DAL IY+EM++A C+++PK++ISLIE+    G ++ + QL  +L  D   WID   R++L
Sbjct: 597 DALHIYEEMRKAECHVDPKSIISLIEYSDSKGELSTLVQLADDLQDD-TSWIDGFFRMIL 656

Query: 302 FSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKN 361
           F+V++   S  VDLLK+   R     + +   FDE+F  IAE+EPS +  G+ LL+F+K+
Sbjct: 657 FAVRNKKSSDIVDLLKRNKVRLLKKGIPVEAHFDEVFWAIAETEPSKVHLGMDLLRFMKD 716

Query: 362 DLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQT 421
           +LG  P R+CLDFLL AC NAKD E  LL+WKEY+ A  P N +S+LRMYQ LLA+GD  
Sbjct: 717 ELGFVPSRKCLDFLLHACVNAKDLEHGLLVWKEYQSAAFPCNVLSFLRMYQVLLAAGDSE 776

Query: 422 SAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMS 470
            AK L+ KIPKDD  V ++I+E +S +  S + NKKK   KKKM+ +S
Sbjct: 777 GAKALVSKIPKDDKDVQHIIEESQSAF--SQAPNKKK--PKKKMIVLS 819

BLAST of Cp4.1LG02g01210 vs. ExPASy Swiss-Prot
Match: O49711 (Pentatricopeptide repeat-containing protein At4g21880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21880 PE=3 SV=2)

HSP 1 Score: 469.2 bits (1206), Expect = 5.7e-131
Identity = 232/461 (50.33%), Postives = 324/461 (70.28%), Query Frame = 0

Query: 2   VSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILH 61
           + KF  +H +LDI PSS SYE L+ Y CGS +V  ALDI   MC+A    S ++LHS+L 
Sbjct: 377 IFKFNKLHEELDIVPSSTSYENLVSYLCGSNEVVTALDIVENMCEAGLVISANILHSLLQ 436

Query: 62  AMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTP 121
           A+++  EFNLV ++YS++   ++KP+SE FR+ I+LCI++KDF GAY+ML   +  N  P
Sbjct: 437 AIEQILEFNLVQRIYSIMSNKSVKPNSETFRKSINLCIRIKDFEGAYNMLGNLKNFNLAP 496

Query: 122 TASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEEL 181
            +SMYN+IMAGYFREK  +  L VLK+M+ ADVKPDS TFSYLI+YC  E  I KYY+E+
Sbjct: 497 NSSMYNSIMAGYFREKKVNSALKVLKEMKEADVKPDSVTFSYLINYCGEEATIAKYYKEM 556

Query: 182 KSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLA 241
           K +GV+  KH++M+L+ AYA+ GQFEKAKQV+ D E+P K+ NE++SVL+SALASN  + 
Sbjct: 557 KQAGVEVNKHVYMSLVKAYASCGQFEKAKQVLMDLEVPAKDHNELKSVLISALASNGNIT 616

Query: 242 DALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILL 301
           +AL IY+EMK+  C +EPKA++SLIE+   +  +  + +L  EL  D   WID   +I++
Sbjct: 617 EALSIYEEMKKLRCPVEPKAILSLIENSDSNAELGTLVELTHEL-RDSKFWIDGFFKIIV 676

Query: 302 FSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKN 361
           F+V++N  SS +DLL+Q       D+V +   F+E+F  IAE+E S ++ GL L+ F+K 
Sbjct: 677 FAVRNNRSSSILDLLEQTKNHLSKDDVGVEYWFEEVFKSIAETESSDVKVGLDLVSFMKE 736

Query: 362 DLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQT 421
           +L L P R+CLDFLL AC NAKD +S+LL+W+EY+ A LP+N I+YLRMYQ L+A+GD  
Sbjct: 737 ELELCPSRKCLDFLLHACVNAKDKQSALLVWEEYQCAELPYNVINYLRMYQVLVAAGDSK 796

Query: 422 SAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHK 463
           SA+ ++ KIP DD  V  +IKE   V+       KKK + K
Sbjct: 797 SAEAIVSKIPNDDKDVKCIIKESRIVFTPKLKKKKKKSKQK 836

BLAST of Cp4.1LG02g01210 vs. ExPASy Swiss-Prot
Match: Q9FMQ1 (Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g12100 PE=2 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 7.2e-17
Identity = 89/425 (20.94%), Postives = 174/425 (40.94%), Query Frame = 0

Query: 13  DIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNLV 72
           D +PS   Y K I        V   L++ N M      PS  + + ++  + +G   N  
Sbjct: 174 DFRPSKFMYGKAIQAAVKLSDVGKGLELFNRMKHDRIYPSVFIYNVLIDGLCKGKRMNDA 233

Query: 73  HQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNAIMAG 132
            Q++  +    L P    +  +I    K  +   ++ + +  +  +  P+   +N ++ G
Sbjct: 234 EQLFDEMLARRLLPSLITYNTLIDGYCKAGNPEKSFKVRERMKADHIEPSLITFNTLLKG 293

Query: 133 YFREKNTSDGLMVLKQMELADVKPDSKTFSYLI---SYCECEEDIIKYYEELKSSGVQAT 192
            F+     D   VLK+M+     PD+ TFS L    S  E  E  +  YE    SGV+  
Sbjct: 294 LFKAGMVEDAENVLKEMKDLGFVPDAFTFSILFDGYSSNEKAEAALGVYETAVDSGVKMN 353

Query: 193 KHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNE-IRSVLVSALASNAQLADALKIYD 252
            +    L+NA    G+ EKA++++  E       NE I + ++        L  A    +
Sbjct: 354 AYTCSILLNALCKEGKIEKAEEILGREMAKGLVPNEVIYNTMIDGYCRKGDLVGARMKIE 413

Query: 253 EMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELH-HDLDDWIDCCRRILLFSVKHN 312
            M++ G   +  A   LI  +   G +    + + ++    +   ++    ++    +  
Sbjct: 414 AMEKQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVETYNILIGGYGRKY 473

Query: 313 DLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKNDLGLSP 372
           +     D+LK++     N  +   V++  + + + +     LE   Q+++    D G+SP
Sbjct: 474 EFDKCFDILKEME---DNGTMPNVVSYGTLINCLCKGS-KLLEA--QIVKRDMEDRGVSP 533

Query: 373 PRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQTSAKVLL 432
             R  + L+  C +    E +    KE  K G+  N ++Y  +   L  +G  + A+ LL
Sbjct: 534 KVRIYNMLIDGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLSEAEDLL 592

BLAST of Cp4.1LG02g01210 vs. ExPASy Swiss-Prot
Match: Q9SXD8 (Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX=3702 GN=At1g62590 PE=2 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 1.8e-12
Identity = 77/418 (18.42%), Postives = 171/418 (40.91%), Query Frame = 0

Query: 12  LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNL 71
           L+I     +Y  LI   C   ++ +AL +  +M    + PS   L S+L+    G   + 
Sbjct: 114 LEIVHGLYTYNILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISD 173

Query: 72  VHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNAIMA 131
              +   +     +PD+  F  +I         + A  ++    +    P    Y  ++ 
Sbjct: 174 AVALVDQMVEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVN 233

Query: 132 GYFREKNTSDGLMVLKQMELADVKPDSKTFSYLI-SYCECE--EDIIKYYEELKSSGVQA 191
           G  +  +T   L +L +ME A ++ D   F+ +I S C+    +D +  ++E+++ G++ 
Sbjct: 234 GLCKRGDTDLALNLLNKMEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRP 293

Query: 192 TKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIR-SVLVSALASNAQLADALKIY 251
               + +LI+   ++G++  A Q++SD      N N +  + L+ A     +  +A K+Y
Sbjct: 294 NVVTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLY 353

Query: 252 DEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILLFSV--- 311
           D+M +   + +     SL+  +     +++  Q+   +        DC   ++ ++    
Sbjct: 354 DDMIKRSIDPDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSK-----DCFPDVVTYNTLIK 413

Query: 312 ---KHNDLSSTVDLLKQLSYR-CCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 371
              K   +    +L +++S+R    D V        +F           +   ++ + + 
Sbjct: 414 GFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLIQGLF------HDGDCDNAQKVFKQMV 473

Query: 372 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASG 419
           +D G+ P       LL    N    E +L ++   +K+ +  +   Y  M + +  +G
Sbjct: 474 SD-GVPPDIMTYSILLDGLCNNGKLEKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAG 519

BLAST of Cp4.1LG02g01210 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 70.9 bits (172), Expect = 4.5e-11
Identity = 60/248 (24.19%), Postives = 116/248 (46.77%), Query Frame = 0

Query: 12  LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNL 71
           L  +P+ V+   L+   C S ++  A+ + ++M    + P+T   ++++H +      N 
Sbjct: 145 LGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGL---FLHNK 204

Query: 72  VHQVYSLICR---HNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNA 131
             +  +LI R      +PD   +  +++   K  D + A+++L + E+    P   +YN 
Sbjct: 205 ASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNT 264

Query: 132 IMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECE----EDIIKYYEELKSS 191
           I+ G  + K+  D L + K+ME   ++P+  T+S LIS C C      D  +   ++   
Sbjct: 265 IIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLIS-CLCNYGRWSDASRLLSDMIER 324

Query: 192 GVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLADAL 251
            +      F ALI+A+   G+  +A+++  +    VK   +   V  S+L +   + D L
Sbjct: 325 KINPDVFTFSALIDAFVKEGKLVEAEKLYDE---MVKRSIDPSIVTYSSLINGFCMHDRL 382

Query: 252 KIYDEMKQ 253
              DE KQ
Sbjct: 385 ---DEAKQ 382

BLAST of Cp4.1LG02g01210 vs. NCBI nr
Match: XP_023524928.1 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 956 bits (2471), Expect = 0.0
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 395 MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 454

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 455 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 514

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 515 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 574

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL
Sbjct: 575 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 634

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL
Sbjct: 635 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 694

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 695 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 754

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 755 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 814

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
           TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV
Sbjct: 815 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 869

BLAST of Cp4.1LG02g01210 vs. NCBI nr
Match: XP_023524929.1 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 956 bits (2471), Expect = 0.0
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 394 MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 453

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 454 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 513

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 514 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 573

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL
Sbjct: 574 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 633

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL
Sbjct: 634 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 693

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 694 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 753

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 754 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 813

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
           TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV
Sbjct: 814 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 868

BLAST of Cp4.1LG02g01210 vs. NCBI nr
Match: KAG7015217.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 927 bits (2397), Expect = 0.0
Identity = 462/475 (97.26%), Postives = 467/475 (98.32%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMH +LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 395 MVSKFKSMHLELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 454

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 455 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 514

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 515 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 574

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 575 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 634

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISLIEHYPFD P+NR+FQLLGELHHDLD WID CRRIL
Sbjct: 635 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDWPMNRMFQLLGELHHDLDVWIDSCRRIL 694

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 695 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 754

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 755 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 814

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
            SAKVLLE+IPKDDAHVCYVIKECESVYVAS SV KKKGRHKKKMLKMSRN REV
Sbjct: 815 KSAKVLLERIPKDDAHVCYVIKECESVYVASFSVKKKKGRHKKKMLKMSRNEREV 869

BLAST of Cp4.1LG02g01210 vs. NCBI nr
Match: XP_022940187.1 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 924 bits (2387), Expect = 0.0
Identity = 460/475 (96.84%), Postives = 466/475 (98.11%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMH +LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 395 MVSKFKSMHLELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 454

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 455 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 514

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 515 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 574

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 575 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 634

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISL+EHYPFD P+NR+FQLLGELHHDLD WID CRRIL
Sbjct: 635 ADALKIYDEMKQAGCNLEPKAVISLVEHYPFDWPMNRMFQLLGELHHDLDVWIDSCRRIL 694

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 695 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 754

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIW+EYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 755 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWEEYEKAGLPHNTISYLRMYQALLASGDQ 814

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
            SAKVLLEKIPKDDAHVCYVIKECESVYVAS SV KKKGRHKKKMLKMS N REV
Sbjct: 815 KSAKVLLEKIPKDDAHVCYVIKECESVYVASFSVKKKKGRHKKKMLKMSINEREV 869

BLAST of Cp4.1LG02g01210 vs. NCBI nr
Match: XP_022940188.1 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 924 bits (2387), Expect = 0.0
Identity = 460/475 (96.84%), Postives = 466/475 (98.11%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMH +LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 394 MVSKFKSMHLELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 453

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 454 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 513

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 514 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 573

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 574 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 633

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISL+EHYPFD P+NR+FQLLGELHHDLD WID CRRIL
Sbjct: 634 ADALKIYDEMKQAGCNLEPKAVISLVEHYPFDWPMNRMFQLLGELHHDLDVWIDSCRRIL 693

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 694 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 753

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIW+EYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 754 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWEEYEKAGLPHNTISYLRMYQALLASGDQ 813

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
            SAKVLLEKIPKDDAHVCYVIKECESVYVAS SV KKKGRHKKKMLKMS N REV
Sbjct: 814 KSAKVLLEKIPKDDAHVCYVIKECESVYVASFSVKKKKGRHKKKMLKMSINEREV 868

BLAST of Cp4.1LG02g01210 vs. ExPASy TrEMBL
Match: A0A6J1FNL3 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445890 PE=4 SV=1)

HSP 1 Score: 924 bits (2387), Expect = 0.0
Identity = 460/475 (96.84%), Postives = 466/475 (98.11%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMH +LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 394 MVSKFKSMHLELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 453

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 454 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 513

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 514 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 573

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 574 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 633

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISL+EHYPFD P+NR+FQLLGELHHDLD WID CRRIL
Sbjct: 634 ADALKIYDEMKQAGCNLEPKAVISLVEHYPFDWPMNRMFQLLGELHHDLDVWIDSCRRIL 693

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 694 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 753

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIW+EYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 754 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWEEYEKAGLPHNTISYLRMYQALLASGDQ 813

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
            SAKVLLEKIPKDDAHVCYVIKECESVYVAS SV KKKGRHKKKMLKMS N REV
Sbjct: 814 KSAKVLLEKIPKDDAHVCYVIKECESVYVASFSVKKKKGRHKKKMLKMSINEREV 868

BLAST of Cp4.1LG02g01210 vs. ExPASy TrEMBL
Match: A0A6J1FIW6 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445890 PE=4 SV=1)

HSP 1 Score: 924 bits (2387), Expect = 0.0
Identity = 460/475 (96.84%), Postives = 466/475 (98.11%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMH +LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 395 MVSKFKSMHLELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 454

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 455 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 514

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE
Sbjct: 515 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 574

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 575 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 634

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISL+EHYPFD P+NR+FQLLGELHHDLD WID CRRIL
Sbjct: 635 ADALKIYDEMKQAGCNLEPKAVISLVEHYPFDWPMNRMFQLLGELHHDLDVWIDSCRRIL 694

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 695 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 754

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIW+EYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 755 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWEEYEKAGLPHNTISYLRMYQALLASGDQ 814

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSRNGREV 475
            SAKVLLEKIPKDDAHVCYVIKECESVYVAS SV KKKGRHKKKMLKMS N REV
Sbjct: 815 KSAKVLLEKIPKDDAHVCYVIKECESVYVASFSVKKKKGRHKKKMLKMSINEREV 869

BLAST of Cp4.1LG02g01210 vs. ExPASy TrEMBL
Match: A0A6J1J1Y5 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111480523 PE=4 SV=1)

HSP 1 Score: 916 bits (2368), Expect = 0.0
Identity = 454/464 (97.84%), Postives = 458/464 (98.71%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMHF+LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 394 MVSKFKSMHFELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 453

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 454 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 513

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
            TASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLIS CECEEDIIKYYEE
Sbjct: 514 STASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISNCECEEDIIKYYEE 573

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 574 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 633

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGP+NRIFQLLGE HHDLD WIDCCRRIL
Sbjct: 634 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPMNRIFQLLGEFHHDLDHWIDCCRRIL 693

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 694 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 753

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 754 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 813

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKK 464
            SAKVLLEKIPKDDAHVCYVIKECESVYVASSSV KKKGRHKK+
Sbjct: 814 KSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVKKKKGRHKKR 857

BLAST of Cp4.1LG02g01210 vs. ExPASy TrEMBL
Match: A0A6J1ITU1 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480523 PE=4 SV=1)

HSP 1 Score: 916 bits (2368), Expect = 0.0
Identity = 454/464 (97.84%), Postives = 458/464 (98.71%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKSMHF+LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 395 MVSKFKSMHFELDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 454

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT
Sbjct: 455 HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 514

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
            TASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLIS CECEEDIIKYYEE
Sbjct: 515 STASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISNCECEEDIIKYYEE 574

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQ+
Sbjct: 575 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQI 634

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGP+NRIFQLLGE HHDLD WIDCCRRIL
Sbjct: 635 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPMNRIFQLLGEFHHDLDHWIDCCRRIL 694

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK
Sbjct: 695 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 754

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
           NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ
Sbjct: 755 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 814

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKK 464
            SAKVLLEKIPKDDAHVCYVIKECESVYVASSSV KKKGRHKK+
Sbjct: 815 KSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVKKKKGRHKKR 858

BLAST of Cp4.1LG02g01210 vs. ExPASy TrEMBL
Match: A0A6J1CCQ2 (pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like OS=Momordica charantia OX=3673 GN=LOC111010449 PE=4 SV=1)

HSP 1 Score: 817 bits (2111), Expect = 1.30e-290
Identity = 405/470 (86.17%), Postives = 434/470 (92.34%), Query Frame = 0

Query: 1   MVSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSIL 60
           MVSKFKS+H  LDIKPSS SYEKLI YCCGS KVHMALDIANEMCDADFTPSTDVLHSIL
Sbjct: 395 MVSKFKSLHSKLDIKPSSTSYEKLIHYCCGSFKVHMALDIANEMCDADFTPSTDVLHSIL 454

Query: 61  HAMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFT 120
           HAMDEGCEFNLVHQVYSLICRHNLKPD E FR +ISL +KMKDFNGA+DMLKESEKMN T
Sbjct: 455 HAMDEGCEFNLVHQVYSLICRHNLKPDCETFRSIISLRVKMKDFNGAFDMLKESEKMNLT 514

Query: 121 PTASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEE 180
           PT+SMYNAIMAGYFREKNTS  LMVLKQMELADVKPDSKTFSYLIS CECEEDIIKYYEE
Sbjct: 515 PTSSMYNAIMAGYFREKNTSGALMVLKQMELADVKPDSKTFSYLISNCECEEDIIKYYEE 574

Query: 181 LKSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQL 240
           LKSSG+QATK IFMALINAYAAHGQFEKAKQVI DE IP+KNLNEIRSVLVSALASN Q 
Sbjct: 575 LKSSGIQATKQIFMALINAYAAHGQFEKAKQVILDEGIPMKNLNEIRSVLVSALASNGQT 634

Query: 241 ADALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRIL 300
           +DALK+YDEMK+AGC+LEPKA ISLIEHYP DGP++R+ QLL EL+  LDDWIDCCRRI+
Sbjct: 635 SDALKVYDEMKEAGCDLEPKAAISLIEHYPLDGPLSRMLQLLAELN-GLDDWIDCCRRII 694

Query: 301 LFSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 360
           LFS+KHNDLSST+DLLKQLSYRCCNDEV+M VAFDE+FS I ES+P+YLETGL LLQFIK
Sbjct: 695 LFSLKHNDLSSTLDLLKQLSYRCCNDEVIMRVAFDEVFSFITESDPTYLETGLLLLQFIK 754

Query: 361 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQ 420
            DLGLSPPRRCLDFLLGACANAKDAESS L+WKEY+KAGLP+NTIS+LRMYQALLASGDQ
Sbjct: 755 KDLGLSPPRRCLDFLLGACANAKDAESSRLVWKEYKKAGLPYNTISFLRMYQALLASGDQ 814

Query: 421 TSAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMSR 470
           TSAKVLL+KIPKDD HVC +IKECE VY ASSSV K KGR+KKKMLKMSR
Sbjct: 815 TSAKVLLDKIPKDDTHVCSIIKECEMVYGASSSVKKTKGRNKKKMLKMSR 863

BLAST of Cp4.1LG02g01210 vs. TAIR 10
Match: AT4G04790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 475.3 bits (1222), Expect = 5.6e-134
Identity = 242/468 (51.71%), Postives = 332/468 (70.94%), Query Frame = 0

Query: 2   VSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILH 61
           + KF  MH +LD+ PSS SYEKL+ Y C S +V  ALD+  +M +A    S D+LHS+LH
Sbjct: 357 ILKFNKMHEELDVMPSSTSYEKLVKYSCDSNEVVTALDVVEKMGEAGLMISADILHSLLH 416

Query: 62  AMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTP 121
           A+DE  EF+LV +++S++C  ++KP++E FR +I LC ++KDF GAY+ML   +  N  P
Sbjct: 417 AIDEVLEFDLVRRIHSIMCTKSVKPNTENFRSIIRLCTRIKDFEGAYNMLGNLKNFNLEP 476

Query: 122 TASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEEL 181
            +SM+N I+AGYFREKN S  LMV+KQM+ A VKPDS TF YLI+ C  E+ I KYYEE+
Sbjct: 477 NSSMFNCILAGYFREKNVSSALMVVKQMKEAGVKPDSITFGYLINNCTQEDAITKYYEEM 536

Query: 182 KSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLA 241
           K +GVQATK I+M+LI+AYAA G+FEKAKQV+ D ++P  N NE++SVL+SALAS  + A
Sbjct: 537 KQAGVQATKRIYMSLIDAYAASGKFEKAKQVLVDPDVPAINQNELKSVLISALASRGKWA 596

Query: 242 DALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILL 301
           DAL IY+EM++A C+++PK++ISLIE+    G ++ + QL  +L  D   WID   R++L
Sbjct: 597 DALHIYEEMRKAECHVDPKSIISLIEYSDSKGELSTLVQLADDLQDD-TSWIDGFFRMIL 656

Query: 302 FSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKN 361
           F+V++   S  VDLLK+   R     + +   FDE+F  IAE+EPS +  G+ LL+F+K+
Sbjct: 657 FAVRNKKSSDIVDLLKRNKVRLLKKGIPVEAHFDEVFWAIAETEPSKVHLGMDLLRFMKD 716

Query: 362 DLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQT 421
           +LG  P R+CLDFLL AC NAKD E  LL+WKEY+ A  P N +S+LRMYQ LLA+GD  
Sbjct: 717 ELGFVPSRKCLDFLLHACVNAKDLEHGLLVWKEYQSAAFPCNVLSFLRMYQVLLAAGDSE 776

Query: 422 SAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHKKKMLKMS 470
            AK L+ KIPKDD  V ++I+E +S +  S + NKKK   KKKM+ +S
Sbjct: 777 GAKALVSKIPKDDKDVQHIIEESQSAF--SQAPNKKK--PKKKMIVLS 819

BLAST of Cp4.1LG02g01210 vs. TAIR 10
Match: AT4G21880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 469.2 bits (1206), Expect = 4.0e-132
Identity = 232/461 (50.33%), Postives = 324/461 (70.28%), Query Frame = 0

Query: 2   VSKFKSMHFDLDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILH 61
           + KF  +H +LDI PSS SYE L+ Y CGS +V  ALDI   MC+A    S ++LHS+L 
Sbjct: 377 IFKFNKLHEELDIVPSSTSYENLVSYLCGSNEVVTALDIVENMCEAGLVISANILHSLLQ 436

Query: 62  AMDEGCEFNLVHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTP 121
           A+++  EFNLV ++YS++   ++KP+SE FR+ I+LCI++KDF GAY+ML   +  N  P
Sbjct: 437 AIEQILEFNLVQRIYSIMSNKSVKPNSETFRKSINLCIRIKDFEGAYNMLGNLKNFNLAP 496

Query: 122 TASMYNAIMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECEEDIIKYYEEL 181
            +SMYN+IMAGYFREK  +  L VLK+M+ ADVKPDS TFSYLI+YC  E  I KYY+E+
Sbjct: 497 NSSMYNSIMAGYFREKKVNSALKVLKEMKEADVKPDSVTFSYLINYCGEEATIAKYYKEM 556

Query: 182 KSSGVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLA 241
           K +GV+  KH++M+L+ AYA+ GQFEKAKQV+ D E+P K+ NE++SVL+SALASN  + 
Sbjct: 557 KQAGVEVNKHVYMSLVKAYASCGQFEKAKQVLMDLEVPAKDHNELKSVLISALASNGNIT 616

Query: 242 DALKIYDEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILL 301
           +AL IY+EMK+  C +EPKA++SLIE+   +  +  + +L  EL  D   WID   +I++
Sbjct: 617 EALSIYEEMKKLRCPVEPKAILSLIENSDSNAELGTLVELTHEL-RDSKFWIDGFFKIIV 676

Query: 302 FSVKHNDLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKN 361
           F+V++N  SS +DLL+Q       D+V +   F+E+F  IAE+E S ++ GL L+ F+K 
Sbjct: 677 FAVRNNRSSSILDLLEQTKNHLSKDDVGVEYWFEEVFKSIAETESSDVKVGLDLVSFMKE 736

Query: 362 DLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQT 421
           +L L P R+CLDFLL AC NAKD +S+LL+W+EY+ A LP+N I+YLRMYQ L+A+GD  
Sbjct: 737 ELELCPSRKCLDFLLHACVNAKDKQSALLVWEEYQCAELPYNVINYLRMYQVLVAAGDSK 796

Query: 422 SAKVLLEKIPKDDAHVCYVIKECESVYVASSSVNKKKGRHK 463
           SA+ ++ KIP DD  V  +IKE   V+       KKK + K
Sbjct: 797 SAEAIVSKIPNDDKDVKCIIKESRIVFTPKLKKKKKKSKQK 836

BLAST of Cp4.1LG02g01210 vs. TAIR 10
Match: AT5G12100.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 90.1 bits (222), Expect = 5.1e-18
Identity = 89/425 (20.94%), Postives = 174/425 (40.94%), Query Frame = 0

Query: 13  DIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNLV 72
           D +PS   Y K I        V   L++ N M      PS  + + ++  + +G   N  
Sbjct: 174 DFRPSKFMYGKAIQAAVKLSDVGKGLELFNRMKHDRIYPSVFIYNVLIDGLCKGKRMNDA 233

Query: 73  HQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNAIMAG 132
            Q++  +    L P    +  +I    K  +   ++ + +  +  +  P+   +N ++ G
Sbjct: 234 EQLFDEMLARRLLPSLITYNTLIDGYCKAGNPEKSFKVRERMKADHIEPSLITFNTLLKG 293

Query: 133 YFREKNTSDGLMVLKQMELADVKPDSKTFSYLI---SYCECEEDIIKYYEELKSSGVQAT 192
            F+     D   VLK+M+     PD+ TFS L    S  E  E  +  YE    SGV+  
Sbjct: 294 LFKAGMVEDAENVLKEMKDLGFVPDAFTFSILFDGYSSNEKAEAALGVYETAVDSGVKMN 353

Query: 193 KHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNE-IRSVLVSALASNAQLADALKIYD 252
            +    L+NA    G+ EKA++++  E       NE I + ++        L  A    +
Sbjct: 354 AYTCSILLNALCKEGKIEKAEEILGREMAKGLVPNEVIYNTMIDGYCRKGDLVGARMKIE 413

Query: 253 EMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELH-HDLDDWIDCCRRILLFSVKHN 312
            M++ G   +  A   LI  +   G +    + + ++    +   ++    ++    +  
Sbjct: 414 AMEKQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVETYNILIGGYGRKY 473

Query: 313 DLSSTVDLLKQLSYRCCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIKNDLGLSP 372
           +     D+LK++     N  +   V++  + + + +     LE   Q+++    D G+SP
Sbjct: 474 EFDKCFDILKEME---DNGTMPNVVSYGTLINCLCKGS-KLLEA--QIVKRDMEDRGVSP 533

Query: 373 PRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASGDQTSAKVLL 432
             R  + L+  C +    E +    KE  K G+  N ++Y  +   L  +G  + A+ LL
Sbjct: 534 KVRIYNMLIDGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLSEAEDLL 592

BLAST of Cp4.1LG02g01210 vs. TAIR 10
Match: AT1G62590.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 75.5 bits (184), Expect = 1.3e-13
Identity = 77/418 (18.42%), Postives = 171/418 (40.91%), Query Frame = 0

Query: 12  LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNL 71
           L+I     +Y  LI   C   ++ +AL +  +M    + PS   L S+L+    G   + 
Sbjct: 114 LEIVHGLYTYNILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISD 173

Query: 72  VHQVYSLICRHNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNAIMA 131
              +   +     +PD+  F  +I         + A  ++    +    P    Y  ++ 
Sbjct: 174 AVALVDQMVEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVN 233

Query: 132 GYFREKNTSDGLMVLKQMELADVKPDSKTFSYLI-SYCECE--EDIIKYYEELKSSGVQA 191
           G  +  +T   L +L +ME A ++ D   F+ +I S C+    +D +  ++E+++ G++ 
Sbjct: 234 GLCKRGDTDLALNLLNKMEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRP 293

Query: 192 TKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIR-SVLVSALASNAQLADALKIY 251
               + +LI+   ++G++  A Q++SD      N N +  + L+ A     +  +A K+Y
Sbjct: 294 NVVTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLY 353

Query: 252 DEMKQAGCNLEPKAVISLIEHYPFDGPVNRIFQLLGELHHDLDDWIDCCRRILLFSV--- 311
           D+M +   + +     SL+  +     +++  Q+   +        DC   ++ ++    
Sbjct: 354 DDMIKRSIDPDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSK-----DCFPDVVTYNTLIK 413

Query: 312 ---KHNDLSSTVDLLKQLSYR-CCNDEVMMGVAFDEIFSLIAESEPSYLETGLQLLQFIK 371
              K   +    +L +++S+R    D V        +F           +   ++ + + 
Sbjct: 414 GFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLIQGLF------HDGDCDNAQKVFKQMV 473

Query: 372 NDLGLSPPRRCLDFLLGACANAKDAESSLLIWKEYEKAGLPHNTISYLRMYQALLASG 419
           +D G+ P       LL    N    E +L ++   +K+ +  +   Y  M + +  +G
Sbjct: 474 SD-GVPPDIMTYSILLDGLCNNGKLEKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAG 519

BLAST of Cp4.1LG02g01210 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 70.9 bits (172), Expect = 3.2e-12
Identity = 60/248 (24.19%), Postives = 116/248 (46.77%), Query Frame = 0

Query: 12  LDIKPSSVSYEKLICYCCGSLKVHMALDIANEMCDADFTPSTDVLHSILHAMDEGCEFNL 71
           L  +P+ V+   L+   C S ++  A+ + ++M    + P+T   ++++H +      N 
Sbjct: 145 LGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGL---FLHNK 204

Query: 72  VHQVYSLICR---HNLKPDSEVFRRMISLCIKMKDFNGAYDMLKESEKMNFTPTASMYNA 131
             +  +LI R      +PD   +  +++   K  D + A+++L + E+    P   +YN 
Sbjct: 205 ASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNT 264

Query: 132 IMAGYFREKNTSDGLMVLKQMELADVKPDSKTFSYLISYCECE----EDIIKYYEELKSS 191
           I+ G  + K+  D L + K+ME   ++P+  T+S LIS C C      D  +   ++   
Sbjct: 265 IIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLIS-CLCNYGRWSDASRLLSDMIER 324

Query: 192 GVQATKHIFMALINAYAAHGQFEKAKQVISDEEIPVKNLNEIRSVLVSALASNAQLADAL 251
            +      F ALI+A+   G+  +A+++  +    VK   +   V  S+L +   + D L
Sbjct: 325 KINPDVFTFSALIDAFVKEGKLVEAEKLYDE---MVKRSIDPSIVTYSSLINGFCMHDRL 382

Query: 252 KIYDEMKQ 253
              DE KQ
Sbjct: 385 ---DEAKQ 382

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6NQ817.9e-13351.71Pentatricopeptide repeat-containing protein At4g04790, mitochondrial OS=Arabidop... [more]
O497115.7e-13150.33Pentatricopeptide repeat-containing protein At4g21880, mitochondrial OS=Arabidop... [more]
Q9FMQ17.2e-1720.94Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidop... [more]
Q9SXD81.8e-1218.42Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX... [more]
Q9SXD14.5e-1124.19Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023524928.10.0100.00pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
XP_023524929.10.0100.00pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
KAG7015217.10.097.26Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022940187.10.096.84pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
XP_022940188.10.096.84pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
Match NameE-valueIdentityDescription
A0A6J1FNL30.096.84pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
A0A6J1FIW60.096.84pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
A0A6J1J1Y50.097.84pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
A0A6J1ITU10.097.84pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isofor... [more]
A0A6J1CCQ21.30e-29086.17pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like OS=Mom... [more]
Match NameE-valueIdentityDescription
AT4G04790.15.6e-13451.71Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21880.14.0e-13250.33Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G12100.15.1e-1820.94pentatricopeptide (PPR) repeat-containing protein [more]
AT1G62590.11.3e-1318.42pentatricopeptide (PPR) repeat-containing protein [more]
AT1G62670.13.2e-1224.19rna processing factor 2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 125..158
e-value: 9.8E-7
score: 26.6
coord: 229..256
e-value: 0.0011
score: 17.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 229..255
e-value: 0.013
score: 15.7
coord: 193..213
e-value: 0.035
score: 14.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 122..168
e-value: 5.6E-9
score: 36.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 122..156
score: 9.514466
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 87..121
score: 8.527949
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..65
e-value: 3.4E-5
score: 25.2
coord: 334..458
e-value: 3.7E-6
score: 28.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 173..329
e-value: 1.2E-12
score: 49.9
coord: 66..172
e-value: 3.4E-19
score: 71.4
NoneNo IPR availablePANTHERPTHR47262OS02G0132600 PROTEINcoord: 1..462

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g01210.1Cp4.1LG02g01210.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding