Cp4.1LG03g08470 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g08470
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG03: 2744183 .. 2752521 (-)
RNA-Seq ExpressionCp4.1LG03g08470
SyntenyCp4.1LG03g08470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCTCTCACCACCACAAATTTTCATATCCCGCCGATTTGACTGGTCGGAGCGTGGATTCCACTTTTCCTAAAACCCTCATCATTTGCAAGAATTCGAAGAACGAGTCGGCATTCGAAGAGACAAAGCAAGTTTTAGTGGACTACGACAATGGCAAGCATGAAGTTCGGACCCTCGTTAACGGGCTCAGAAAACTCGATATTCCCAGGCGGTACCAGCTTCGAGTTGGGGGCGAACGATTCCAGAAGGACTGGACGGTCACTGAGGTCGTACAGAGGATTCTGAAGCTACAACGCCATGGTGATGTCGAGGCTTTGTTGAATTGTTGGGTTGGACGGTTTGCTCGGAAGAATTATCCTGCTCTTATGAAGGTATCGCCCTTCCACACCGTCTTTTTAACAGCTTATTGTGCAATGATTAAACTCTAATTCTTCGACCGGTATGATTTTACAAGATTGATGTAAGTTAGTGAAACCGTCCTGAATAAGTTCCATTTACGTTATGAAGATTTGAATCAACCGTTCTTTCCTCTTGTGGTAGGAGTTGACTCAAAATGGATCTATTGAACACTGCGTCCAAGTATTTGATTGGATGAAGAACCAAAAGAATTATTGTGCCCGTAACGATATTTACAATATGATGATAAGGTTGCATGCCAGACATAACCGAATAGATCAAGCTCGTGGTTTGTTTTTTGAAATGCAAAAATGGAGGTAGGTCTTAGATTCCCCATGTAATGTTGAAGTATTGAATAGTGTTTCTTATGTGGTTCTGTGAAAGTATTCTTGGTGCTTTACTTGTTGTGGCGCAATTTTGAGCTCTGTTCAAATATTGTTGATTGAACAATATTTCTTATTAGATGCAAACCTGATGCCGAGACCTACAANTCTCTGTTCAGATATTGTTGATTGAACAATATTTCTTATTAGATGCAAACCTGATGCCGAGACCTACAATTCCCTAATCAATGCACATGGTAGAGCAGGCCAATGGCGTTGGGCGATGAATATAATGGAGGACATGCTACGTGCTGCTGTATGTATTTATATACTTCCCTTATGATTTTAGTGTCTCTGACTGACTGTGTGAAACCTTCTTGCCTTTGATTTGTCAAGTGTATATGTATACATATGATGCATCTGTATGCCCATAATTCCTCACCCTTCAATGTTCATCAATATGAAGAACTAGGAATACGAACTTTCAAGATCAACACCTCAGGAACTATAGAGTTTTATACTCGTACCACTATGTTCGGCTCTAGGATTTATACCGAAGTGTGAGTTACTTTTTTGTTTAATTTATTTGGACATCAAGTCCATTCGCTGTCAGCGATGGGCAACCTTTGTAGCTTATTTACATTTGTGCACATTTGGTTCCTTTAGCGGCGCTAATTTTTTTCTTCTTGAATCGATACTTTATATCCTTTGCCTGCACCTATATCCTTTGCCTGCACCTTTAGGGGGTTTTACCTTTTGGGCTTGGTTGCCTAATTTCTTAGTGAAGTTCAGCCCCTGGCTTCTTTTGGCTGATAGTCCTATCCCTCAATCTTTAAGAGGGAGGGGTGTGAGGAGAGGGGAGGACCCTTTTTGGAGAGGATCAAGGGAGTGGAAGGATTTTTTATTCTGATATTTGCAATGGGAAGCATGCTTCTGCAAATGCAAGGGAGTCAACTCTTTACATATGAATTTTTTTATTGGAAACTTAGAGGACTTGAGGGAGGGGTAAGAGAAGGTTGGTTTAGGAGCTTTTTCTCCAAAAAAAAAAATAATAAAAAAACCTTCATGTGGTATTTTGTTAGATTCCAAGTGATCTAGTTTTCTCGTGAGCTTTATTAGGAGTATTTAGGGGGAAATTCCTGGGATGGGTGGAATGACATGCATAGGAGAGGTAGATGCATAGATTGCTTTTTGATTGAATGATATAAGTCCTCTCCTTCCCAGTTTTAGCATTTTTGGGTCATATTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGCACGCCTCTGTCCCTTCTATACAAAAACCCAAACATATATTTAATATTTTACATTAAAAACCAACTCATTTGAAAGATTTTTGAAACTGTTCAGTTTCTACTGATCTTAATTCTTGTTTATTTATACATATAAAATACCTGTCAATGACAAAAATTAACAATGAAGAAAGTTATAAAAAAAATATTGTGAATATCAAAACAATGTTAATGACTTTACGTGAAAATTAAATGAATTTATGTGAAAATTAACTTTTAAATTTTAATTTATCAATATATACATTGTACTAAGAAAATTATTATTTTCCAATTGATGCAGATCCCTCCTAGTCGATCAACATTTAATAATTTGATAAATGCATGCGGATCTTCTGGAAATTGGAGAGAAGCTTTGAGAGTTTGCAAGAAAATGACAGACAATGGGGTTGGCCCTGATCTTGTGACCCACAATATTGTTCTATCTGCATATAAAAGTGGGGCTCAGTATTCAAAAGCTTTGTCATATTTTGAATTGATGAAGGGTACAAACATCCGGCCTGACACAACAACACTTAATATTGTGATTCATTGTTTGATAAAGGTCAAACAATATGGACAAGCCATTGAAATCTTTAATTCTATGCGGGAGAAGAGGGCTGAATGTCGTCCTGATGTTGTAACATTCANTGATAAAGGTCAAACAATATGGACAAGCCATTGAAATCTTTAGTTCTATGCGGGAGAAGAGGGCTGAATGTCGCCCTGACGTTGTAACATTTACAAGTATCATTCATCTTTATTCTGTTTGTGGACAGATTGAAGATTGTAAAGCTGTATTTAGTACAATGCTGGGTGAAGGAATAAAACCTACCATTGTTTCGTATAATGCTCTAATAAGTGCATATGCTTCCCATGGGATGGATAAAGAAGCTTCTACAGTTTTTGATGAGATGAAAAGGAGTGGTTTTCGCCCTGATGTTGTATCATATACATCTTTACTCAGTACATTTGGAAGATCTCAGCAACCTACAAGGGCTAGAGAAGTGTTTGATATGATGAAGAGAAACAAATGCAAGCCAAATCTTGTTAGCTACAATGCACTGATAGATGCATATGGATCTAATGGCTATTTAGCTCAAGCTGTTGACATCTTACGTGAGATGGAGCAAGATGGAATTCATCCAAATGTTGTTTCAATATGCACCCTCTTGGCTGCCTGTGGACGATTTGGTCAAAAGGTGAATGTAGATGCTGTGCTTTCTGCTGCCGAGCTACGGGGAATTCGTTTGAACACTATTGCATATAATTCAGCTATTGGTAGCTACATGAATATTGGTGAATATGAAAAGGCTGTTGATTTGTATAGATCAATGAAGAATAAGAACACTAAACCAGACTCTGTCACATATACTATCTTGATAAGTAGTTGTTGTAGGATGTCAAAATATGACGAGGCAATCCACTTTTTCAAAGAAATGGTAGATTTAAAGATTCCTTTGTCTGAAGAGATCTACTCTTCTATGATCTGTGCCTATAGTAAACAGGTTAGTACTTTAGCAAGACAGACACTTCCCTTGTTTAGACTACACATCAATTGAGATAAATCCCGGTCTTTTATTCATTGTTTAATTTTCATGCATTGCAGGGTCAACTTGTGAAAGCAGAAACTTTGTTCAATTCATTGAAGGGAAGTGGTTGTTGTCCCGATTTAGTTACATATACAGCAATGATAAATGCATATAGTACAACTGGTAGGTTTTTTCAATTCATTGATCTGATTTGCCACTATAAAAAGAGATTGTTTGATTCTGAACTTTTGTAATTTGATTTTCATATCAGAGACATGGGAAAAAGCCTGCTCCTTATATCATGAAATGGAAACAAATAACATTCAACTAGATTCTATAGCATGCTCCGCTTTGATGAAAGCATATAATAAGGGAAATCAGGCTTCCAACGTTCTTACTCTGGCAGAAATTATGAAGGAGAAGGGAATTCCTTTCAACGATGCCAATTTCTTTGAAATGTTGTCAGCTTGTAGCACGTAAGATTTTACATGTACATATACGTTACATATTTTCTTTATACACATTTTGCTAGCATATTTGTATGGCAAGGATAGTTAATTCGCTGCCCTCGGTAGAAAATTTATTGAGCATTTGGTTTGGTTGAACTTAGACTCACCTTGATCCTTCCGTGTAGTCCTCAGNCCCTCGGTAGAAAATTTATTGAGCATTTGGTTTGGTTGAACTAGTAGACTCACCTTGATCCTTCCGTGTAGTCCTCGGAAAAGAAGAATCTTTGAGCTTTATTAAATGTGATCTGACTGGGTGGTATAGCTATGAATGTTTATTAAGTTGAATGTCTCAAGAATTCAGCATAGATGCTTGTTCTCTTCTTTAACTCTCTTAACATTTTTCTTAGGTTTTCCTACTCTACTTCCTGTGCATGTTTCAACTGTCAGTCCACTCACTTAACTTCCTAACGAATTCTTGACTCATGAACAATCCAATTAACTGGCACCAATAGGTATAACAGTTAATGTGTTGCTAATTACTCTTCCTCTGTTCTGCCTAATCTATGTAAAATGTACGGGTACCCTAGTATGCCCATTAATACTTTGCCCAAACTTTTACAAGAANTTTTTTTTTTTTTTGTGTGGAAGGTTAAATTTCTAAGAAAGTGAAGTTTCTTTTCTTTCCAAGTCTTACATGGAAGAGTGAACACCCAAGATCGCAGCAGCACTCTACCTTGGTTTTACGCCCGCAATGGTGTGTTCTTTGTAGAAAACATGGGGGATCTTGATATCACTTATGGGGTTGCCAGTTTGCTTACTCCCTTTGGAATCGGTGCTTCAGTTTGTTTGGGGTTCCTGTGGTGTGTAATAGGGACTTATGGCCTTTGTTTGAGGAAAGTGTGATGAACTTTTTGTTTCGTGTTAAGGGGAGGTCTTGTGGCAGACTTACTGTTTTGCAATCTTGTAGGGCATGTGGATTGAGAGAAATATAAGAATTCTTAGAGGGATGGAGAGGTCTTTAGACAAGGTTTTGGAGGGATGATGTTTGATAAAGGGCATTTGGTAGGCCTATTCTCTCTAACTCTCATAATTTTGACTATCTTCTTCTCAGCCTCTTGTACACACAACTAACAGCCTCTCAATTTACATCAACCAGCTATAAAAGGCAGTTAGCTACTTAACTAGTCTTCTCTCGCTATGCCTAGCATTATTTATCTATCTAGGTGACATAGTTTTCCCATCATCCTCACACATTTCTAATATGATTACCACTTATGATTACAGTTGTATATAATCATTACAATTTTTTCGAGGCATCCTTCTTTGCTTTGAATCATTTGAGCACAATGGTTTTCAGGTTTACTTATTGATGGTTTATTCTTAAATATATTAACCATATGTTATTTGTTAGGTGTACTTGAAGTGTGATTATGTGTATAAGGTTTTTTCTTTTGTGTATTAGGTCTTTCTTCTACTTAAGATAGACCCTTGTAACTATTGTAAAAGTGAGAGAATTATTATCTGACTTTTACCTTGAAAATTAGTAACTTATTGTTTTCATTCTTTCACTAAGAAACTATTTACTGAAATTTGTTTGCAGACTACGAGATTGGAGGAAAGCAACTGACATAATAAAGCTTATGGAGCCCTCTTTGCATCTTGTCTCAGTTGGAACTATTAATCATCTTCTGCATTTTCTGGGAAAAAGTGGAAAGATGGAGATTATGATGAAGGTAGTTGGAAAATATAAGTGTCTATATATTTCTTATGGGTTAATGTTGGGAATCAAGCAAAATTTTCTACACATTCTTTTCCTTCAAAATTTCCCCTGTTTCCTTAGCGTTTAAAGTCTGATATCTATGAACATGATTTACATATGCCAACTGTTCGTATCCTTGTAGTTCGTTGAACTCTAAGCTATTTTATCCCAAAATACAGCTGTTTTACAGATTCATGGCACTGGGATCCAGTGTCAATATTAGTACTTATTCAATCTTGTTGAAGAATCTCTTGTCTTTTGGAAATTGGAGGAAATACATTGAGGTACGCTTTATGTTTCAGTAACTTGGCATTTCAAAGCCCTTGAGCTGTTCTTACTAATATATACCTACGACTGCTGATTATAGGAGTTGTATATTACTTGGACATTAGTTGTCCTATGTTTTTTTTTCCTGATACCTCGGTGCTAATACTAAAGCAACTTTGAGCAAGCCTAGACTTTGGAGCAGTACCATATTAGTCTTTGTCTATACATGATAGTCCAAATTCTTTTGAATCTTTAGTTAAGATGCCAAGAGTCCTGATTTCTAGAGCAGGATACAACATTTGAATGTCTTATGCAAGTTTATGTTGGATGAGTCTGTCATTGATCAATCAAGTTCCTGAATCAAGGAAGAATTAGAGAGCTTTCCGTTGCTACTCATTACCACTCTCTCTCTGACTGACTATAGCAAAAACGCCCCTCCTAGCTCATTACGAATTCCATACCATGTTACTGTCTCTTTTCTTCAGTTCCTTGTTTTTAGTCTGTGGAGTTTGGTACAATTTTGGTTCGTACTTTTACCCTTTCATTAAATTTTTGCTTTCTATTAAATAACATGGCTTTTTAAGTGTTTTCATCATATTCCCAGCCAGGAATATAGAAATTTAAATAAACTTATGTACAGGTCTTGCAGTGGATGGATGATGCTGGAATCCAACCCTCTAATGCAATGTACAATGACATACTTTACTTCGCACAAAATTGTGGTGGTGCAGAGTGCGCTGCTATCATCAAGGAAAGAGTTGGTATGTTGGTTGTGTGTATTTCTGCTATTATAATTGTTATTTTAGCGAGAAACCATACTCATACATGAGGTACTAGTTGCATATCGTTTACCTAATAAAAATATCTCTGAACTTTCAAAAGTTTTAATGATACCCTTGTACTTTAAAAAAAAATCAAAAGTACATTTACCATTATTTTTAGAGAGACCATCAATATTCTATTTCAAAAATACCTTTGAACTTTCAAATCATTAATACTCTTAAAAAGTTCGAAATACCATTGAACTCTCAACAGTTTTATTATTGTTCTTTAAAACTTTTTTAATAATGTTAAAATATACCCTTAACCTTTAAGAGTTTCANAAACAAAGGATAGTTGTTGAACCTAGCCATAGTCCAAAACTCCCCAACACATCTCTACCCCACTAAAAGTCTTCTTGCTCCTCTGAAGCTAAATGCTCCACAAACTAATGAAAATGATTTCCCTCTGCAAGGCACCAACTCCTTATGAGTTAAAAACAACTTTTGGTGAACTTATGATGGAGCAGGAGGATGTTACCCTATTTAGAAATTTAGTAGAGATAAAATAAGACAAATAGATATGTAATTTATTATCTCATGCTGGTGTATAAGGTTCAACTATTGAATTATAAAGCTTGTTACTGTAATTTTCAGTCATTTCAGGTTGAGTTGAGTTGTACTCTTTGAAATGTTTGAAAGTTTCATTATATTGTTATCGTTTTTTGATCTGGTAGTGATCTCTTGCACCATAAAATCCCATGCAGAATCCCTGAAAAGATAGCATGGAGCAAACTCTGCATTCACATCCTTCCATTTTATCCACGCGAGGTGTGAATATGTGTTTGCCCTCTATGCGGATGCAGCAGTCCGCTTGAGAGGCTCTCGTTGAGAGATCCAATAAGGTGATTTAGCCACCAAAGTTTCTTTGTATGTGGCTTATATTTGTCCGCATTTATCCAAAGATGGTACAGTAACACTTCTTGTATAGCTCAAGTGGGAAGGGATTGGTGGCCCCAGAACTTTGAATCCTCATTGAATCCTCATTGTTTGTAATGTATGTTGAATGAAAGTCCCACATCGGCTAATTTAGGGAATGATCATGAGTTTATAATCAATTGGTATGAGGCCTTTTGGAGAAGCCCAAAGCAAAGCCATGAGAGCGTATGCTCAAAGTAGACATTCATTGGTATGAGGCCTTTTGGAGAAGCCCAAAGCAAAGCCACGAGAGCGTATCCTCAATCATACCATTGTGGAAAGTTGTGTTCATCTAACAATGTAAACTTTTACAATAAAATGTTATAATTGATTGTAATGTAATCTTCTATATTTCAATAAGAAATAAGAAATAATGTT

mRNA sequence

CCCTCTCACCACCACAAATTTTCATATCCCGCCGATTTGACTGGTCGGAGCGTGGATTCCACTTTTCCTAAAACCCTCATCATTTGCAAGAATTCGAAGAACGAGTCGGCATTCGAAGAGACAAAGCAAGTTTTAGTGGACTACGACAATGGCAAGCATGAAGTTCGGACCCTCGTTAACGGGCTCAGAAAACTCGATATTCCCAGGCGGTACCAGCTTCGAGTTGGGGGCGAACGATTCCAGAAGGACTGGACGGTCACTGAGGTCGTACAGAGGATTCTGAAGCTACAACGCCATGGTGATGTCGAGGCTTTGTTGAATTGTTGGGTTGGACGGTTTGCTCGGAAGAATTATCCTGCTCTTATGAAGGAGTTGACTCAAAATGGATCTATTGAACACTGCGTCCAAGTATTTGATTGGATGAAGAACCAAAAGAATTATTGTGCCCGTAACGATATTTACAATATGATGATAAGGTTGCATGCCAGACATAACCGAATAGATCAAGCTCGTGGTTTGTTTTTTGAAATGCAAAAATGGAGATGCAAACCTGATGCCGAGACCTACAATTCCCTAATCAATGCACATGGTAGAGCAGGCCAATGGCGTTGGGCGATGAATATAATGGAGGACATGCTACGTGCTGCTATCCCTCCTAGTCGATCAACATTTAATAATTTGATAAATGCATGCGGATCTTCTGGAAATTGGAGAGAAGCTTTGAGAGTTTGCAAGAAAATGACAGACAATGGGGTTGGCCCTGATCTTGTGACCCACAATATTGTTCTATCTGCATATAAAAGTGGGGCTCAGTATTCAAAAGCTTTGTCATATTTTGAATTGATGAAGGGTACAAACATCCGGCCTGACACAACAACACTTAATATTGTGATTCATTGTTTGATAAAGGTCAAACAATATGGACAAGCCATTGAAATCTTTAGTTCTATGCGGGAGAAGAGGGCTGAATGTCGCCCTGACGTTGTAACATTTACAAGTATCATTCATCTTTATTCTGTTTGTGGACAGATTGAAGATTGTAAAGCTGTATTTAGTACAATGCTGGGTGAAGGAATAAAACCTACCATTGTTTCGTATAATGCTCTAATAAGTGCATATGCTTCCCATGGGATGGATAAAGAAGCTTCTACAGTTTTTGATGAGATGAAAAGGAGTGGTTTTCGCCCTGATGTTGTATCATATACATCTTTACTCAGTACATTTGGAAGATCTCAGCAACCTACAAGGGCTAGAGAAGTGTTTGATATGATGAAGAGAAACAAATGCAAGCCAAATCTTGTTAGCTACAATGCACTGATAGATGCATATGGATCTAATGGCTATTTAGCTCAAGCTGTTGACATCTTACGTGAGATGGAGCAAGATGGAATTCATCCAAATGTTGTTTCAATATGCACCCTCTTGGCTGCCTGTGGACGATTTGGTCAAAAGGTGAATGTAGATGCTGTGCTTTCTGCTGCCGAGCTACGGGGAATTCGTTTGAACACTATTGCATATAATTCAGCTATTGGTAGCTACATGAATATTGGTGAATATGAAAAGGCTGTTGATTTGTATAGATCAATGAAGAATAAGAACACTAAACCAGACTCTGTCACATATACTATCTTGATAAGTAGTTGTTGTAGGATGTCAAAATATGACGAGGCAATCCACTTTTTCAAAGAAATGGTAGATTTAAAGATTCCTTTGTCTGAAGAGATCTACTCTTCTATGATCTGTGCCTATAGTAAACAGGGTCAACTTGTGAAAGCAGAAACTTTGTTCAATTCATTGAAGGGAAGTGGTTGTTGTCCCGATTTAGTTACATATACAGCAATGATAAATGCATATAGTACAACTGAGACATGGGAAAAAGCCTGCTCCTTATATCATGAAATGGAAACAAATAACATTCAACTAGATTCTATAGCATGCTCCGCTTTGATGAAAGCATATAATAAGGGAAATCAGGCTTCCAACGTTCTTACTCTGGCAGAAATTATGAAGGAGAAGGGAATTCCTTTCAACGATGCCAATTTCTTTGAAATGTTGTCAGCTTGTAGCACACTACGAGATTGGAGGAAAGCAACTGACATAATAAAGCTTATGGAGCCCTCTTTGCATCTTGTCTCAGTTGGAACTATTAATCATCTTCTGCATTTTCTGGGAAAAAGTGGAAAGATGGAGATTATGATGAAGCTGTTTTACAGATTCATGGCACTGGGATCCAGTGTCAATATTAGTACTTATTCAATCTTGTTGAAGAATCTCTTGTCTTTTGGAAATTGGAGGAAATACATTGAGGTCTTGCAGTGGATGGATGATGCTGGAATCCAACCCTCTAATGCAATGTACAATGACATACTTTACTTCGCACAAAATTGTGGTGGTGCAGAGTGCGCTGCTATCATCAAGGAAAGAGTTGAATCCCTGAAAAGATAGCATGGAGCAAACTCTGCATTCACATCCTTCCATTTTATCCACGCGAGGTGTGAATATGTGTTTGCCCTCTATGCGGATGCAGCAGTCCGCTTGAGAGGCTCTCGTTGAGAGATCCAATAAGGTGATTTAGCCACCAAAGTTTCTTTGTATGTGGCTTATATTTGTCCGCATTTATCCAAAGATGGTACAGTAACACTTCTTGTATAGCTCAAGTGGGAAGGGATTGGTGGCCCCAGAACTTTGAATCCTCATTGAATCCTCATTGTTTGTAATGTATGTTGAATGAAAGTCCCACATCGGCTAATTTAGGGAATGATCATGAGTTTATAATCAATTGGTATGAGGCCTTTTGGAGAAGCCCAAAGCAAAGCCATGAGAGCGTATGCTCAAAGTAGACATTCATTGGTATGAGGCCTTTTGGAGAAGCCCAAAGCAAAGCCACGAGAGCGTATCCTCAATCATACCATTGTGGAAAGTTGTGTTCATCTAACAATGTAAACTTTTACAATAAAATGTTATAATTGATTGTAATGTAATCTTCTATATTTCAATAAGAAATAAGAAATAATGTT

Coding sequence (CDS)

CCCTCTCACCACCACAAATTTTCATATCCCGCCGATTTGACTGGTCGGAGCGTGGATTCCACTTTTCCTAAAACCCTCATCATTTGCAAGAATTCGAAGAACGAGTCGGCATTCGAAGAGACAAAGCAAGTTTTAGTGGACTACGACAATGGCAAGCATGAAGTTCGGACCCTCGTTAACGGGCTCAGAAAACTCGATATTCCCAGGCGGTACCAGCTTCGAGTTGGGGGCGAACGATTCCAGAAGGACTGGACGGTCACTGAGGTCGTACAGAGGATTCTGAAGCTACAACGCCATGGTGATGTCGAGGCTTTGTTGAATTGTTGGGTTGGACGGTTTGCTCGGAAGAATTATCCTGCTCTTATGAAGGAGTTGACTCAAAATGGATCTATTGAACACTGCGTCCAAGTATTTGATTGGATGAAGAACCAAAAGAATTATTGTGCCCGTAACGATATTTACAATATGATGATAAGGTTGCATGCCAGACATAACCGAATAGATCAAGCTCGTGGTTTGTTTTTTGAAATGCAAAAATGGAGATGCAAACCTGATGCCGAGACCTACAATTCCCTAATCAATGCACATGGTAGAGCAGGCCAATGGCGTTGGGCGATGAATATAATGGAGGACATGCTACGTGCTGCTATCCCTCCTAGTCGATCAACATTTAATAATTTGATAAATGCATGCGGATCTTCTGGAAATTGGAGAGAAGCTTTGAGAGTTTGCAAGAAAATGACAGACAATGGGGTTGGCCCTGATCTTGTGACCCACAATATTGTTCTATCTGCATATAAAAGTGGGGCTCAGTATTCAAAAGCTTTGTCATATTTTGAATTGATGAAGGGTACAAACATCCGGCCTGACACAACAACACTTAATATTGTGATTCATTGTTTGATAAAGGTCAAACAATATGGACAAGCCATTGAAATCTTTAGTTCTATGCGGGAGAAGAGGGCTGAATGTCGCCCTGACGTTGTAACATTTACAAGTATCATTCATCTTTATTCTGTTTGTGGACAGATTGAAGATTGTAAAGCTGTATTTAGTACAATGCTGGGTGAAGGAATAAAACCTACCATTGTTTCGTATAATGCTCTAATAAGTGCATATGCTTCCCATGGGATGGATAAAGAAGCTTCTACAGTTTTTGATGAGATGAAAAGGAGTGGTTTTCGCCCTGATGTTGTATCATATACATCTTTACTCAGTACATTTGGAAGATCTCAGCAACCTACAAGGGCTAGAGAAGTGTTTGATATGATGAAGAGAAACAAATGCAAGCCAAATCTTGTTAGCTACAATGCACTGATAGATGCATATGGATCTAATGGCTATTTAGCTCAAGCTGTTGACATCTTACGTGAGATGGAGCAAGATGGAATTCATCCAAATGTTGTTTCAATATGCACCCTCTTGGCTGCCTGTGGACGATTTGGTCAAAAGGTGAATGTAGATGCTGTGCTTTCTGCTGCCGAGCTACGGGGAATTCGTTTGAACACTATTGCATATAATTCAGCTATTGGTAGCTACATGAATATTGGTGAATATGAAAAGGCTGTTGATTTGTATAGATCAATGAAGAATAAGAACACTAAACCAGACTCTGTCACATATACTATCTTGATAAGTAGTTGTTGTAGGATGTCAAAATATGACGAGGCAATCCACTTTTTCAAAGAAATGGTAGATTTAAAGATTCCTTTGTCTGAAGAGATCTACTCTTCTATGATCTGTGCCTATAGTAAACAGGGTCAACTTGTGAAAGCAGAAACTTTGTTCAATTCATTGAAGGGAAGTGGTTGTTGTCCCGATTTAGTTACATATACAGCAATGATAAATGCATATAGTACAACTGAGACATGGGAAAAAGCCTGCTCCTTATATCATGAAATGGAAACAAATAACATTCAACTAGATTCTATAGCATGCTCCGCTTTGATGAAAGCATATAATAAGGGAAATCAGGCTTCCAACGTTCTTACTCTGGCAGAAATTATGAAGGAGAAGGGAATTCCTTTCAACGATGCCAATTTCTTTGAAATGTTGTCAGCTTGTAGCACACTACGAGATTGGAGGAAAGCAACTGACATAATAAAGCTTATGGAGCCCTCTTTGCATCTTGTCTCAGTTGGAACTATTAATCATCTTCTGCATTTTCTGGGAAAAAGTGGAAAGATGGAGATTATGATGAAGCTGTTTTACAGATTCATGGCACTGGGATCCAGTGTCAATATTAGTACTTATTCAATCTTGTTGAAGAATCTCTTGTCTTTTGGAAATTGGAGGAAATACATTGAGGTCTTGCAGTGGATGGATGATGCTGGAATCCAACCCTCTAATGCAATGTACAATGACATACTTTACTTCGCACAAAATTGTGGTGGTGCAGAGTGCGCTGCTATCATCAAGGAAAGAGTTGAATCCCTGAAAAGATAG

Protein sequence

PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVNGLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPALMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQASNVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQPSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Homology
BLAST of Cp4.1LG03g08470 vs. ExPASy Swiss-Prot
Match: Q8RWS8 (Pentatricopeptide repeat-containing protein At2g41720 OS=Arabidopsis thaliana OX=3702 GN=EMB2654 PE=2 SV=1)

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 520/782 (66.50%), Postives = 646/782 (82.61%), Query Frame = 0

Query: 32  SKNESAFEETKQVLVDYDNGKHEVRTLVNGLRKLDIPRRYQLRVGGERFQKDWTVTEVVQ 91
           +K   AF+E K V V+YD G+HEV   + GLRK DIPRRY++RV  +RFQKDW+V+EVV 
Sbjct: 24  TKASDAFQEKKSVSVNYDRGEHEVSVNIGGLRKADIPRRYRIRVENDRFQKDWSVSEVVD 83

Query: 92  RILKLQRHGDVEALLNCWVGRFARKNYPALMKELTQNGSIEHCVQVFDWMKNQKNYCARN 151
           R++ L R  +V+ +LN WVGRFARKN+P L++EL++ G IE CV VF WMK QKNYCARN
Sbjct: 84  RLMALNRWEEVDGVLNSWVGRFARKNFPVLIRELSRRGCIELCVNVFKWMKIQKNYCARN 143

Query: 152 DIYNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAGQWRWAMNIMED 211
           DIYNMMIRLHARHN +DQARGLFFEMQKW CKPDAETY++LINAHGRAGQWRWAMN+M+D
Sbjct: 144 DIYNMMIRLHARHNWVDQARGLFFEMQKWSCKPDAETYDALINAHGRAGQWRWAMNLMDD 203

Query: 212 MLRAAIPPSRSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQ 271
           MLRAAI PSRST+NNLINACGSSGNWREAL VCKKMTDNGVGPDLVTHNIVLSAYKSG Q
Sbjct: 204 MLRAAIAPSRSTYNNLINACGSSGNWREALEVCKKMTDNGVGPDLVTHNIVLSAYKSGRQ 263

Query: 272 YSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREKRAECRPDVVTF 331
           YSKALSYFELMKG  +RPDTTT NI+I+CL K+ Q  QA+++F+SMREKRAECRPDVVTF
Sbjct: 264 YSKALSYFELMKGAKVRPDTTTFNIIIYCLSKLGQSSQALDLFNSMREKRAECRPDVVTF 323

Query: 332 TSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDKEASTVFDEMKR 391
           TSI+HLYSV G+IE+C+AVF  M+ EG+KP IVSYNAL+ AYA HGM   A +V  ++K+
Sbjct: 324 TSIMHLYSVKGEIENCRAVFEAMVAEGLKPNIVSYNALMGAYAVHGMSGTALSVLGDIKQ 383

Query: 392 SGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALIDAYGSNGYLAQ 451
           +G  PDVVSYT LL+++GRS+QP +A+EVF MM++ + KPN+V+YNALIDAYGSNG+LA+
Sbjct: 384 NGIIPDVVSYTCLLNSYGRSRQPGKAKEVFLMMRKERRKPNVVTYNALIDAYGSNGFLAE 443

Query: 452 AVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIRLNTIAYNSAIG 511
           AV+I R+MEQDGI PNVVS+CTLLAAC R  +KVNVD VLSAA+ RGI LNT AYNSAIG
Sbjct: 444 AVEIFRQMEQDGIKPNVVSVCTLLAACSRSKKKVNVDTVLSAAQSRGINLNTAAYNSAIG 503

Query: 512 SYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHFFKEMVDLKIPL 571
           SY+N  E EKA+ LY+SM+ K  K DSVT+TILIS  CRMSKY EAI + KEM DL IPL
Sbjct: 504 SYINAAELEKAIALYQSMRKKKVKADSVTFTILISGSCRMSKYPEAISYLKEMEDLSIPL 563

Query: 572 SEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYSTTETWEKACSLY 631
           ++E+YSS++CAYSKQGQ+ +AE++FN +K +GC PD++ YT+M++AY+ +E W KAC L+
Sbjct: 564 TKEVYSSVLCAYSKQGQVTEAESIFNQMKMAGCEPDVIAYTSMLHAYNASEKWGKACELF 623

Query: 632 HEMETNNIQLDSIACSALMKAYNKGNQASNVLTLAEIMKEKGIPFNDANFFEMLSACSTL 691
            EME N I+ DSIACSALM+A+NKG Q SNV  L ++M+EK IPF  A FFE+ SAC+TL
Sbjct: 624 LEMEANGIEPDSIACSALMRAFNKGGQPSNVFVLMDLMREKEIPFTGAVFFEIFSACNTL 683

Query: 692 RDWRKATDIIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKLFYRFMALGSSVNISTY 751
           ++W++A D+I++M+P L  +S+G  N +LH  GKSGK+E MMKLFY+ +A G  +N+ TY
Sbjct: 684 QEWKRAIDLIQMMDPYLPSLSIGLTNQMLHLFGKSGKVEAMMKLFYKIIASGVGINLKTY 743

Query: 752 SILLKNLLSFGNWRKYIEVLQWMDDAGIQPSNAMYNDILYFAQNCGGAECAAIIKERVES 811
           +ILL++LL+ GNWRKYIEVL+WM  AGIQPSN MY DI+ F +   G E   +I++++ES
Sbjct: 744 AILLEHLLAVGNWRKYIEVLEWMSGAGIQPSNQMYRDIISFGERSAGIEFEPLIRQKLES 803

Query: 812 LK 814
           L+
Sbjct: 804 LR 805

BLAST of Cp4.1LG03g08470 vs. ExPASy Swiss-Prot
Match: Q9LYZ9 (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 1.7e-66
Identity = 160/631 (25.36%), Postives = 308/631 (48.81%), Query Frame = 0

Query: 161 HARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPS 220
           H + +   +A   F + + ++   D      +I+  G+ G+   A N+   +        
Sbjct: 148 HKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLD 207

Query: 221 RSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHNIVLSAY-KSGAQYSKALSYF 280
             ++ +LI+A  +SG +REA+ V KKM ++G  P L+T+N++L+ + K G  ++K  S  
Sbjct: 208 VYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLV 267

Query: 281 ELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYS 340
           E MK   I PD  T N +I C  +   + +A ++F  M  K A    D VT+ +++ +Y 
Sbjct: 268 EKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEM--KAAGFSYDKVTYNALLDVYG 327

Query: 341 VCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVV 400
              + ++   V + M+  G  P+IV+YN+LISAYA  GM  EA  + ++M   G +PDV 
Sbjct: 328 KSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVF 387

Query: 401 SYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREM 460
           +YT+LLS F R+ +   A  +F+ M+   CKPN+ ++NA I  YG+ G   + + I  E+
Sbjct: 388 TYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEI 447

Query: 461 EQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEY 520
              G+ P++V+  TLLA  G+ G    V  V    +  G       +N+ I +Y   G +
Sbjct: 448 NVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSF 507

Query: 521 EKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSM 580
           E+A+ +YR M +    PD  TY  ++++  R   ++++     EM D +   +E  Y S+
Sbjct: 508 EQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSL 567

Query: 581 ICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNI 640
           + AY+   ++    +L   +      P  V    ++   S  +   +A   + E++    
Sbjct: 568 LHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGF 627

Query: 641 QLDSIACSALMKAYNKGNQASNVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATD 700
             D    ++++  Y +    +    + + MKE+G   + A +  ++   S   D+ K+ +
Sbjct: 628 SPDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEE 687

Query: 701 IIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLL 760
           I++ +        + + N +++   ++ +M    ++F      G   ++ TY+  + +  
Sbjct: 688 ILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYA 747

Query: 761 SFGNWRKYIEVLQWMDDAGIQPSNAMYNDIL 791
           +   + + I V+++M   G +P+   YN I+
Sbjct: 748 ADSMFEEAIGVVRYMIKHGCRPNQNTYNSIV 776

BLAST of Cp4.1LG03g08470 vs. ExPASy Swiss-Prot
Match: Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 5.6e-62
Identity = 163/598 (27.26%), Postives = 279/598 (46.66%), Query Frame = 0

Query: 75  VGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPALMKELTQNGSIEHC 134
           V  E+ +  + V  ++ ++  L   G +   L+ +  + +  ++  + KE    G  +  
Sbjct: 65  VSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRS 124

Query: 135 VQVFDWMKNQKNYCARND-IYNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLI 194
           +++F +M+ Q  +C  N+ IY +MI L  R   +D+   +F EM          +Y +LI
Sbjct: 125 LRLFKYMQRQ-IWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALI 184

Query: 195 NAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSG-NWREALRVCKKMTDNGV 254
           NA+GR G++  ++ +++ M    I PS  T+N +INAC   G +W   L +  +M   G+
Sbjct: 185 NAYGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGI 244

Query: 255 GPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIE 314
            PD+VT+N +LSA        +A   F  M    I PD TT             Y   +E
Sbjct: 245 QPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTT-------------YSHLVE 304

Query: 315 IFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISA 374
            F  +R                  L  VC        +   M   G  P I SYN L+ A
Sbjct: 305 TFGKLR-----------------RLEKVCD-------LLGEMASGGSLPDITSYNVLLEA 364

Query: 375 YASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPN 434
           YA  G  KEA  VF +M+ +G  P+  +Y+ LL+ FG+S +    R++F  MK +   P+
Sbjct: 365 YAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPD 424

Query: 435 LVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLS 494
             +YN LI+ +G  GY  + V +  +M ++ I P++ +   ++ ACG+ G   +   +L 
Sbjct: 425 AATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYEGIIFACGKGGLHEDARKILQ 484

Query: 495 AAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMS 554
                 I  ++ AY   I ++     YE+A+  + +M    + P   T+  L+ S  R  
Sbjct: 485 YMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNPSIETFHSLLYSFARGG 544

Query: 555 KYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYT 614
              E+      +VD  IP + + +++ I AY + G+  +A   +  ++ S C PD  T  
Sbjct: 545 LVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEAVKTYVDMEKSRCDPDERTLE 604

Query: 615 AMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKA-YNKGNQASNVLTLAEIM 670
           A+++ YS     ++    + EM+ ++I L SI C  +M A Y K  +  +V  L E M
Sbjct: 605 AVLSVYSFARLVDECREQFEEMKASDI-LPSIMCYCMMLAVYGKTERWDDVNELLEEM 623

BLAST of Cp4.1LG03g08470 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 3.8e-58
Identity = 174/739 (23.55%), Postives = 327/739 (44.25%), Query Frame = 0

Query: 85  TVTEVVQRILKLQRHGDVEALLNCWVGRFARKN---YPALMKELTQNGSIEHCVQVFDWM 144
           T++ ++  ++K +  G    L N  V    R +   Y  +++ L +   +    ++   M
Sbjct: 194 TLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHM 253

Query: 145 KNQKNYCARNDI-YNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAG 204
             +   C  N + YN++I    +  ++ +A G+  ++     KPD  TY +L+    +  
Sbjct: 254 --EATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQ 313

Query: 205 QWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHN 264
           ++   + +M++ML     PS +  ++L+      G   EAL + K++ D GV P+L  +N
Sbjct: 314 EFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYN 373

Query: 265 IVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREK 324
            ++ +   G ++ +A   F+ M    +RP+  T +I+I    +  +   A+     M + 
Sbjct: 374 ALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVD- 433

Query: 325 RAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDK 384
               +  V  + S+I+ +   G I   +   + M+ + ++PT+V+Y +L+  Y S G   
Sbjct: 434 -TGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKIN 493

Query: 385 EASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALI 444
           +A  ++ EM   G  P + ++T+LLS   R+     A ++F+ M     KPN V+YN +I
Sbjct: 494 KALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMI 553

Query: 445 DAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIR 504
           + Y   G +++A + L+EM + GI P+  S   L+      GQ       +         
Sbjct: 554 EGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCE 613

Query: 505 LNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHF 564
           LN I Y   +  +   G+ E+A+ + + M  +    D V Y +LI    +          
Sbjct: 614 LNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGL 673

Query: 565 FKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYST 624
            KEM D  +   + IY+SMI A SK G   +A  +++ +   GC P+ VTYTA+IN    
Sbjct: 674 LKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCK 733

Query: 625 TETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS-----NVLTLAEIMKEKGIP 684
                +A                + CS +    +  NQ +     ++LT  E+  +K + 
Sbjct: 734 AGFVNEA---------------EVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVE 793

Query: 685 FNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKL 744
            ++A                    I+K +     L +  T N L+    + G++E   +L
Sbjct: 794 LHNA--------------------ILKGL-----LANTATYNMLIRGFCRQGRIEEASEL 853

Query: 745 FYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQPSNAMYNDILYFAQN 804
             R +  G S +  TY+ ++  L    + +K IE+   M + GI+P    YN +++    
Sbjct: 854 ITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGC-- 886

Query: 805 CGGAECAAIIKERVESLKR 815
           C   E     + R E L++
Sbjct: 914 CVAGEMGKATELRNEMLRQ 886

BLAST of Cp4.1LG03g08470 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 225.7 bits (574), Expect = 1.9e-57
Identity = 148/607 (24.38%), Postives = 281/607 (46.29%), Query Frame = 0

Query: 184 PDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREALR- 243
           PD  TY  LI    RAG+       + ++++         F  L+    +     +A+  
Sbjct: 85  PDLCTYGILIGCCCRAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLCADKRTSDAMDI 144

Query: 244 VCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELM---KGTNIRPDTTTLNIVIH 303
           V ++MT+ G  P++ ++NI+L       +  +AL    +M   +G    PD  +   VI+
Sbjct: 145 VLRRMTELGCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYTTVIN 204

Query: 304 CLIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGI 363
              K     +A   +  M ++     PDVVT+ SII        ++    V +TM+  G+
Sbjct: 205 GFFKEGDSDKAYSTYHEMLDR--GILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKNGV 264

Query: 364 KPTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRARE 423
            P  ++YN+++  Y S G  KEA     +M+  G  PDVV+Y+ L+    ++ +   AR+
Sbjct: 265 MPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEARK 324

Query: 424 VFDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACG 483
           +FD M +   KP + +Y  L+  Y + G L +   +L  M ++GIHP+      L+ A  
Sbjct: 325 IFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVFSILICAYA 384

Query: 484 RFGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSV 543
           + G+      V S    +G+  N + Y + IG     G  E A+  +  M ++   P ++
Sbjct: 385 KQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGNI 444

Query: 544 TYTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSL 603
            Y  LI   C  +K++ A     EM+D  I L+   ++S+I ++ K+G+++++E LF  +
Sbjct: 445 VYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELM 504

Query: 604 KGSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQA 663
              G  P+++TY  +IN Y      ++A  L   M +  ++ +++  S L+  Y K ++ 
Sbjct: 505 VRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRM 564

Query: 664 SNVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHL 723
            + L L + M+  G+  +   +  +L      R    A ++   +  S   + + T N +
Sbjct: 565 EDALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKELYVRITESGTQIELSTYNII 624

Query: 724 LHFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGI 783
           LH L K+   +  +++F     +   +   T++I++  LL  G   +  ++       G+
Sbjct: 625 LHGLCKNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAKDLFVAFSSNGL 684

Query: 784 QPSNAMY 787
            P+   Y
Sbjct: 685 VPNYWTY 689

BLAST of Cp4.1LG03g08470 vs. NCBI nr
Match: XP_023526155.1 (pentatricopeptide repeat-containing protein At2g41720 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1651 bits (4276), Expect = 0.0
Identity = 814/814 (100.00%), Postives = 814/814 (100.00%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 814
           PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 825

BLAST of Cp4.1LG03g08470 vs. NCBI nr
Match: XP_023526156.1 (pentatricopeptide repeat-containing protein At2g41720 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1650 bits (4273), Expect = 0.0
Identity = 813/814 (99.88%), Postives = 814/814 (100.00%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 814
           PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 825

BLAST of Cp4.1LG03g08470 vs. NCBI nr
Match: XP_023526157.1 (pentatricopeptide repeat-containing protein At2g41720 isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1642 bits (4253), Expect = 0.0
Identity = 809/809 (100.00%), Postives = 809/809 (100.00%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERV 809
           PSNAMYNDILYFAQNCGGAECAAIIKERV
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERV 820

BLAST of Cp4.1LG03g08470 vs. NCBI nr
Match: XP_022934373.1 (pentatricopeptide repeat-containing protein At2g41720 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1638 bits (4241), Expect = 0.0
Identity = 807/814 (99.14%), Postives = 810/814 (99.51%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQR+GDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRYGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVF WMKNQKNYCARNDIYNMMIRLH RHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFVWMKNQKNYCARNDIYNMMIRLHTRHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAV SAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVFSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 814
           PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 825

BLAST of Cp4.1LG03g08470 vs. NCBI nr
Match: KAG6580916.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1632 bits (4227), Expect = 0.0
Identity = 804/814 (98.77%), Postives = 808/814 (99.26%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSH HKFSYPADLTGRS DST P TLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHQHKFSYPADLTGRSSDSTLPTTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHC+QVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCIQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAV+SAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVISAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLY EMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYREMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 814
           PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 825

BLAST of Cp4.1LG03g08470 vs. ExPASy TrEMBL
Match: A0A6J1F7H2 (pentatricopeptide repeat-containing protein At2g41720 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441562 PE=4 SV=1)

HSP 1 Score: 1638 bits (4241), Expect = 0.0
Identity = 807/814 (99.14%), Postives = 810/814 (99.51%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQR+GDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRYGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVF WMKNQKNYCARNDIYNMMIRLH RHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFVWMKNQKNYCARNDIYNMMIRLHTRHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAV SAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVFSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 814
           PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 825

BLAST of Cp4.1LG03g08470 vs. ExPASy TrEMBL
Match: A0A6J1J042 (pentatricopeptide repeat-containing protein At2g41720 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482245 PE=4 SV=1)

HSP 1 Score: 1629 bits (4218), Expect = 0.0
Identity = 801/814 (98.40%), Postives = 808/814 (99.26%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PS HHKFSY ADLTGRS+DST PKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSRHHKFSYTADLTGRSLDSTLPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNI+EDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNILEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFT+IIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTTIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLA+CGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLASCGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKA+DLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAIDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           Y ILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YNILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVL LAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLILAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 814
           PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERVESLKR 825

BLAST of Cp4.1LG03g08470 vs. ExPASy TrEMBL
Match: A0A6J1F2E1 (pentatricopeptide repeat-containing protein At2g41720 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441562 PE=4 SV=1)

HSP 1 Score: 1629 bits (4218), Expect = 0.0
Identity = 802/809 (99.13%), Postives = 805/809 (99.51%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQR+GDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRYGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVF WMKNQKNYCARNDIYNMMIRLH RHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFVWMKNQKNYCARNDIYNMMIRLHTRHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAV SAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVFSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERV 809
           PSNAMYNDILYFAQNCGGAECAAIIKERV
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERV 820

BLAST of Cp4.1LG03g08470 vs. ExPASy TrEMBL
Match: A0A6J1J354 (pentatricopeptide repeat-containing protein At2g41720 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482245 PE=4 SV=1)

HSP 1 Score: 1620 bits (4195), Expect = 0.0
Identity = 796/809 (98.39%), Postives = 803/809 (99.26%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PS HHKFSY ADLTGRS+DST PKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSRHHKFSYTADLTGRSLDSTLPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNI+EDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNILEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFT+IIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTTIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLA+CGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLASCGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKA+DLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAIDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           Y ILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YNILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVL LAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLILAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERV 809
           PSNAMYNDILYFAQNCGGAECAAIIKERV
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERV 820

BLAST of Cp4.1LG03g08470 vs. ExPASy TrEMBL
Match: A0A6J1J8H8 (pentatricopeptide repeat-containing protein At2g41720 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111482245 PE=4 SV=1)

HSP 1 Score: 1620 bits (4195), Expect = 0.0
Identity = 796/809 (98.39%), Postives = 803/809 (99.26%), Query Frame = 0

Query: 1   PSHHHKFSYPADLTGRSVDSTFPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 60
           PS HHKFSY ADLTGRS+DST PKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN
Sbjct: 12  PSRHHKFSYTADLTGRSLDSTLPKTLIICKNSKNESAFEETKQVLVDYDNGKHEVRTLVN 71

Query: 61  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 120
           GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA
Sbjct: 72  GLRKLDIPRRYQLRVGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPA 131

Query: 121 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 180
           LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW
Sbjct: 132 LMKELTQNGSIEHCVQVFDWMKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKW 191

Query: 181 RCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREA 240
           RCKPDAETYNSLINAHGRAGQWRWAMNI+EDMLRAAIPPSRSTFNNLINACGSSGNWREA
Sbjct: 192 RCKPDAETYNSLINAHGRAGQWRWAMNILEDMLRAAIPPSRSTFNNLINACGSSGNWREA 251

Query: 241 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 300
           LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC
Sbjct: 252 LRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHC 311

Query: 301 LIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIK 360
           LIKVKQYGQAIEIF+SMREKRAECRPDVVTFT+IIHLYSVCGQIEDCKAVFSTMLGEGIK
Sbjct: 312 LIKVKQYGQAIEIFNSMREKRAECRPDVVTFTTIIHLYSVCGQIEDCKAVFSTMLGEGIK 371

Query: 361 PTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 420
           P IVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV
Sbjct: 372 PNIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREV 431

Query: 421 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGR 480
           FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLA+CGR
Sbjct: 432 FDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLASCGR 491

Query: 481 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVT 540
           FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKA+DLYRSMKNKNTKPDSVT
Sbjct: 492 FGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEYEKAIDLYRSMKNKNTKPDSVT 551

Query: 541 YTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 600
           Y ILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK
Sbjct: 552 YNILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLK 611

Query: 601 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 660
           GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS
Sbjct: 612 GSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS 671

Query: 661 NVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 720
           NVL LAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL
Sbjct: 672 NVLILAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLL 731

Query: 721 HFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 780
           HFLGKSGKMEIMMKLFYRF+ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ
Sbjct: 732 HFLGKSGKMEIMMKLFYRFVALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQ 791

Query: 781 PSNAMYNDILYFAQNCGGAECAAIIKERV 809
           PSNAMYNDILYFAQNCGGAECAAIIKERV
Sbjct: 792 PSNAMYNDILYFAQNCGGAECAAIIKERV 820

BLAST of Cp4.1LG03g08470 vs. TAIR 10
Match: AT2G41720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 520/782 (66.50%), Postives = 646/782 (82.61%), Query Frame = 0

Query: 32  SKNESAFEETKQVLVDYDNGKHEVRTLVNGLRKLDIPRRYQLRVGGERFQKDWTVTEVVQ 91
           +K   AF+E K V V+YD G+HEV   + GLRK DIPRRY++RV  +RFQKDW+V+EVV 
Sbjct: 24  TKASDAFQEKKSVSVNYDRGEHEVSVNIGGLRKADIPRRYRIRVENDRFQKDWSVSEVVD 83

Query: 92  RILKLQRHGDVEALLNCWVGRFARKNYPALMKELTQNGSIEHCVQVFDWMKNQKNYCARN 151
           R++ L R  +V+ +LN WVGRFARKN+P L++EL++ G IE CV VF WMK QKNYCARN
Sbjct: 84  RLMALNRWEEVDGVLNSWVGRFARKNFPVLIRELSRRGCIELCVNVFKWMKIQKNYCARN 143

Query: 152 DIYNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAGQWRWAMNIMED 211
           DIYNMMIRLHARHN +DQARGLFFEMQKW CKPDAETY++LINAHGRAGQWRWAMN+M+D
Sbjct: 144 DIYNMMIRLHARHNWVDQARGLFFEMQKWSCKPDAETYDALINAHGRAGQWRWAMNLMDD 203

Query: 212 MLRAAIPPSRSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHNIVLSAYKSGAQ 271
           MLRAAI PSRST+NNLINACGSSGNWREAL VCKKMTDNGVGPDLVTHNIVLSAYKSG Q
Sbjct: 204 MLRAAIAPSRSTYNNLINACGSSGNWREALEVCKKMTDNGVGPDLVTHNIVLSAYKSGRQ 263

Query: 272 YSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREKRAECRPDVVTF 331
           YSKALSYFELMKG  +RPDTTT NI+I+CL K+ Q  QA+++F+SMREKRAECRPDVVTF
Sbjct: 264 YSKALSYFELMKGAKVRPDTTTFNIIIYCLSKLGQSSQALDLFNSMREKRAECRPDVVTF 323

Query: 332 TSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDKEASTVFDEMKR 391
           TSI+HLYSV G+IE+C+AVF  M+ EG+KP IVSYNAL+ AYA HGM   A +V  ++K+
Sbjct: 324 TSIMHLYSVKGEIENCRAVFEAMVAEGLKPNIVSYNALMGAYAVHGMSGTALSVLGDIKQ 383

Query: 392 SGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALIDAYGSNGYLAQ 451
           +G  PDVVSYT LL+++GRS+QP +A+EVF MM++ + KPN+V+YNALIDAYGSNG+LA+
Sbjct: 384 NGIIPDVVSYTCLLNSYGRSRQPGKAKEVFLMMRKERRKPNVVTYNALIDAYGSNGFLAE 443

Query: 452 AVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIRLNTIAYNSAIG 511
           AV+I R+MEQDGI PNVVS+CTLLAAC R  +KVNVD VLSAA+ RGI LNT AYNSAIG
Sbjct: 444 AVEIFRQMEQDGIKPNVVSVCTLLAACSRSKKKVNVDTVLSAAQSRGINLNTAAYNSAIG 503

Query: 512 SYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHFFKEMVDLKIPL 571
           SY+N  E EKA+ LY+SM+ K  K DSVT+TILIS  CRMSKY EAI + KEM DL IPL
Sbjct: 504 SYINAAELEKAIALYQSMRKKKVKADSVTFTILISGSCRMSKYPEAISYLKEMEDLSIPL 563

Query: 572 SEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYSTTETWEKACSLY 631
           ++E+YSS++CAYSKQGQ+ +AE++FN +K +GC PD++ YT+M++AY+ +E W KAC L+
Sbjct: 564 TKEVYSSVLCAYSKQGQVTEAESIFNQMKMAGCEPDVIAYTSMLHAYNASEKWGKACELF 623

Query: 632 HEMETNNIQLDSIACSALMKAYNKGNQASNVLTLAEIMKEKGIPFNDANFFEMLSACSTL 691
            EME N I+ DSIACSALM+A+NKG Q SNV  L ++M+EK IPF  A FFE+ SAC+TL
Sbjct: 624 LEMEANGIEPDSIACSALMRAFNKGGQPSNVFVLMDLMREKEIPFTGAVFFEIFSACNTL 683

Query: 692 RDWRKATDIIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKLFYRFMALGSSVNISTY 751
           ++W++A D+I++M+P L  +S+G  N +LH  GKSGK+E MMKLFY+ +A G  +N+ TY
Sbjct: 684 QEWKRAIDLIQMMDPYLPSLSIGLTNQMLHLFGKSGKVEAMMKLFYKIIASGVGINLKTY 743

Query: 752 SILLKNLLSFGNWRKYIEVLQWMDDAGIQPSNAMYNDILYFAQNCGGAECAAIIKERVES 811
           +ILL++LL+ GNWRKYIEVL+WM  AGIQPSN MY DI+ F +   G E   +I++++ES
Sbjct: 744 AILLEHLLAVGNWRKYIEVLEWMSGAGIQPSNQMYRDIISFGERSAGIEFEPLIRQKLES 803

Query: 812 LK 814
           L+
Sbjct: 804 LR 805

BLAST of Cp4.1LG03g08470 vs. TAIR 10
Match: AT2G41720.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 963.4 bits (2489), Expect = 1.2e-280
Identity = 455/673 (67.61%), Postives = 562/673 (83.51%), Query Frame = 0

Query: 141 MKNQKNYCARNDIYNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAG 200
           MK QKNYCARNDIYNMMIRLHARHN +DQARGLFFEMQKW CKPDAETY++LINAHGRAG
Sbjct: 1   MKIQKNYCARNDIYNMMIRLHARHNWVDQARGLFFEMQKWSCKPDAETYDALINAHGRAG 60

Query: 201 QWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHN 260
           QWRWAMN+M+DMLRAAI PSRST+NNLINACGSSGNWREAL VCKKMTDNGVGPDLVTHN
Sbjct: 61  QWRWAMNLMDDMLRAAIAPSRSTYNNLINACGSSGNWREALEVCKKMTDNGVGPDLVTHN 120

Query: 261 IVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREK 320
           IVLSAYKSG QYSKALSYFELMKG  +RPDTTT NI+I+CL K+ Q  QA+++F+SMREK
Sbjct: 121 IVLSAYKSGRQYSKALSYFELMKGAKVRPDTTTFNIIIYCLSKLGQSSQALDLFNSMREK 180

Query: 321 RAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDK 380
           RAECRPDVVTFTSI+HLYSV G+IE+C+AVF  M+ EG+KP IVSYNAL+ AYA HGM  
Sbjct: 181 RAECRPDVVTFTSIMHLYSVKGEIENCRAVFEAMVAEGLKPNIVSYNALMGAYAVHGMSG 240

Query: 381 EASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALI 440
            A +V  ++K++G  PDVVSYT LL+++GRS+QP +A+EVF MM++ + KPN+V+YNALI
Sbjct: 241 TALSVLGDIKQNGIIPDVVSYTCLLNSYGRSRQPGKAKEVFLMMRKERRKPNVVTYNALI 300

Query: 441 DAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIR 500
           DAYGSNG+LA+AV+I R+MEQDGI PNVVS+CTLLAAC R  +KVNVD VLSAA+ RGI 
Sbjct: 301 DAYGSNGFLAEAVEIFRQMEQDGIKPNVVSVCTLLAACSRSKKKVNVDTVLSAAQSRGIN 360

Query: 501 LNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHF 560
           LNT AYNSAIGSY+N  E EKA+ LY+SM+ K  K DSVT+TILIS  CRMSKY EAI +
Sbjct: 361 LNTAAYNSAIGSYINAAELEKAIALYQSMRKKKVKADSVTFTILISGSCRMSKYPEAISY 420

Query: 561 FKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYST 620
            KEM DL IPL++E+YSS++CAYSKQGQ+ +AE++FN +K +GC PD++ YT+M++AY+ 
Sbjct: 421 LKEMEDLSIPLTKEVYSSVLCAYSKQGQVTEAESIFNQMKMAGCEPDVIAYTSMLHAYNA 480

Query: 621 TETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQASNVLTLAEIMKEKGIPFNDAN 680
           +E W KAC L+ EME N I+ DSIACSALM+A+NKG Q SNV  L ++M+EK IPF  A 
Sbjct: 481 SEKWGKACELFLEMEANGIEPDSIACSALMRAFNKGGQPSNVFVLMDLMREKEIPFTGAV 540

Query: 681 FFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKLFYRFM 740
           FFE+ SAC+TL++W++A D+I++M+P L  +S+G  N +LH  GKSGK+E MMKLFY+ +
Sbjct: 541 FFEIFSACNTLQEWKRAIDLIQMMDPYLPSLSIGLTNQMLHLFGKSGKVEAMMKLFYKII 600

Query: 741 ALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQPSNAMYNDILYFAQNCGGAE 800
           A G  +N+ TY+ILL++LL+ GNWRKYIEVL+WM  AGIQPSN MY DI+ F +   G E
Sbjct: 601 ASGVGINLKTYAILLEHLLAVGNWRKYIEVLEWMSGAGIQPSNQMYRDIISFGERSAGIE 660

Query: 801 CAAIIKERVESLK 814
              +I++++  ++
Sbjct: 661 FEPLIRQKLGEMR 673

BLAST of Cp4.1LG03g08470 vs. TAIR 10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 255.8 bits (652), Expect = 1.2e-67
Identity = 160/631 (25.36%), Postives = 308/631 (48.81%), Query Frame = 0

Query: 161 HARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAGQWRWAMNIMEDMLRAAIPPS 220
           H + +   +A   F + + ++   D      +I+  G+ G+   A N+   +        
Sbjct: 148 HKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLD 207

Query: 221 RSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHNIVLSAY-KSGAQYSKALSYF 280
             ++ +LI+A  +SG +REA+ V KKM ++G  P L+T+N++L+ + K G  ++K  S  
Sbjct: 208 VYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLV 267

Query: 281 ELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREKRAECRPDVVTFTSIIHLYS 340
           E MK   I PD  T N +I C  +   + +A ++F  M  K A    D VT+ +++ +Y 
Sbjct: 268 EKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEM--KAAGFSYDKVTYNALLDVYG 327

Query: 341 VCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDKEASTVFDEMKRSGFRPDVV 400
              + ++   V + M+  G  P+IV+YN+LISAYA  GM  EA  + ++M   G +PDV 
Sbjct: 328 KSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVF 387

Query: 401 SYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALIDAYGSNGYLAQAVDILREM 460
           +YT+LLS F R+ +   A  +F+ M+   CKPN+ ++NA I  YG+ G   + + I  E+
Sbjct: 388 TYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEI 447

Query: 461 EQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIRLNTIAYNSAIGSYMNIGEY 520
              G+ P++V+  TLLA  G+ G    V  V    +  G       +N+ I +Y   G +
Sbjct: 448 NVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSF 507

Query: 521 EKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHFFKEMVDLKIPLSEEIYSSM 580
           E+A+ +YR M +    PD  TY  ++++  R   ++++     EM D +   +E  Y S+
Sbjct: 508 EQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSL 567

Query: 581 ICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYSTTETWEKACSLYHEMETNNI 640
           + AY+   ++    +L   +      P  V    ++   S  +   +A   + E++    
Sbjct: 568 LHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGF 627

Query: 641 QLDSIACSALMKAYNKGNQASNVLTLAEIMKEKGIPFNDANFFEMLSACSTLRDWRKATD 700
             D    ++++  Y +    +    + + MKE+G   + A +  ++   S   D+ K+ +
Sbjct: 628 SPDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEE 687

Query: 701 IIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKLFYRFMALGSSVNISTYSILLKNLL 760
           I++ +        + + N +++   ++ +M    ++F      G   ++ TY+  + +  
Sbjct: 688 ILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYA 747

Query: 761 SFGNWRKYIEVLQWMDDAGIQPSNAMYNDIL 791
           +   + + I V+++M   G +P+   YN I+
Sbjct: 748 ADSMFEEAIGVVRYMIKHGCRPNQNTYNSIV 776

BLAST of Cp4.1LG03g08470 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 240.7 bits (613), Expect = 4.0e-63
Identity = 163/598 (27.26%), Postives = 279/598 (46.66%), Query Frame = 0

Query: 75  VGGERFQKDWTVTEVVQRILKLQRHGDVEALLNCWVGRFARKNYPALMKELTQNGSIEHC 134
           V  E+ +  + V  ++ ++  L   G +   L+ +  + +  ++  + KE    G  +  
Sbjct: 65  VSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRS 124

Query: 135 VQVFDWMKNQKNYCARND-IYNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLI 194
           +++F +M+ Q  +C  N+ IY +MI L  R   +D+   +F EM          +Y +LI
Sbjct: 125 LRLFKYMQRQ-IWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALI 184

Query: 195 NAHGRAGQWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSG-NWREALRVCKKMTDNGV 254
           NA+GR G++  ++ +++ M    I PS  T+N +INAC   G +W   L +  +M   G+
Sbjct: 185 NAYGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGI 244

Query: 255 GPDLVTHNIVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIE 314
            PD+VT+N +LSA        +A   F  M    I PD TT             Y   +E
Sbjct: 245 QPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTT-------------YSHLVE 304

Query: 315 IFSSMREKRAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISA 374
            F  +R                  L  VC        +   M   G  P I SYN L+ A
Sbjct: 305 TFGKLR-----------------RLEKVCD-------LLGEMASGGSLPDITSYNVLLEA 364

Query: 375 YASHGMDKEASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPN 434
           YA  G  KEA  VF +M+ +G  P+  +Y+ LL+ FG+S +    R++F  MK +   P+
Sbjct: 365 YAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPD 424

Query: 435 LVSYNALIDAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLS 494
             +YN LI+ +G  GY  + V +  +M ++ I P++ +   ++ ACG+ G   +   +L 
Sbjct: 425 AATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYEGIIFACGKGGLHEDARKILQ 484

Query: 495 AAELRGIRLNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMS 554
                 I  ++ AY   I ++     YE+A+  + +M    + P   T+  L+ S  R  
Sbjct: 485 YMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNPSIETFHSLLYSFARGG 544

Query: 555 KYDEAIHFFKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYT 614
              E+      +VD  IP + + +++ I AY + G+  +A   +  ++ S C PD  T  
Sbjct: 545 LVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEAVKTYVDMEKSRCDPDERTLE 604

Query: 615 AMINAYSTTETWEKACSLYHEMETNNIQLDSIACSALMKA-YNKGNQASNVLTLAEIM 670
           A+++ YS     ++    + EM+ ++I L SI C  +M A Y K  +  +V  L E M
Sbjct: 605 AVLSVYSFARLVDECREQFEEMKASDI-LPSIMCYCMMLAVYGKTERWDDVNELLEEM 623

BLAST of Cp4.1LG03g08470 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 228.0 bits (580), Expect = 2.7e-59
Identity = 174/739 (23.55%), Postives = 327/739 (44.25%), Query Frame = 0

Query: 85  TVTEVVQRILKLQRHGDVEALLNCWVGRFARKN---YPALMKELTQNGSIEHCVQVFDWM 144
           T++ ++  ++K +  G    L N  V    R +   Y  +++ L +   +    ++   M
Sbjct: 194 TLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHM 253

Query: 145 KNQKNYCARNDI-YNMMIRLHARHNRIDQARGLFFEMQKWRCKPDAETYNSLINAHGRAG 204
             +   C  N + YN++I    +  ++ +A G+  ++     KPD  TY +L+    +  
Sbjct: 254 --EATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQ 313

Query: 205 QWRWAMNIMEDMLRAAIPPSRSTFNNLINACGSSGNWREALRVCKKMTDNGVGPDLVTHN 264
           ++   + +M++ML     PS +  ++L+      G   EAL + K++ D GV P+L  +N
Sbjct: 314 EFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYN 373

Query: 265 IVLSAYKSGAQYSKALSYFELMKGTNIRPDTTTLNIVIHCLIKVKQYGQAIEIFSSMREK 324
            ++ +   G ++ +A   F+ M    +RP+  T +I+I    +  +   A+     M + 
Sbjct: 374 ALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVD- 433

Query: 325 RAECRPDVVTFTSIIHLYSVCGQIEDCKAVFSTMLGEGIKPTIVSYNALISAYASHGMDK 384
               +  V  + S+I+ +   G I   +   + M+ + ++PT+V+Y +L+  Y S G   
Sbjct: 434 -TGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKIN 493

Query: 385 EASTVFDEMKRSGFRPDVVSYTSLLSTFGRSQQPTRAREVFDMMKRNKCKPNLVSYNALI 444
           +A  ++ EM   G  P + ++T+LLS   R+     A ++F+ M     KPN V+YN +I
Sbjct: 494 KALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMI 553

Query: 445 DAYGSNGYLAQAVDILREMEQDGIHPNVVSICTLLAACGRFGQKVNVDAVLSAAELRGIR 504
           + Y   G +++A + L+EM + GI P+  S   L+      GQ       +         
Sbjct: 554 EGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCE 613

Query: 505 LNTIAYNSAIGSYMNIGEYEKAVDLYRSMKNKNTKPDSVTYTILISSCCRMSKYDEAIHF 564
           LN I Y   +  +   G+ E+A+ + + M  +    D V Y +LI    +          
Sbjct: 614 LNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGL 673

Query: 565 FKEMVDLKIPLSEEIYSSMICAYSKQGQLVKAETLFNSLKGSGCCPDLVTYTAMINAYST 624
            KEM D  +   + IY+SMI A SK G   +A  +++ +   GC P+ VTYTA+IN    
Sbjct: 674 LKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCK 733

Query: 625 TETWEKACSLYHEMETNNIQLDSIACSALMKAYNKGNQAS-----NVLTLAEIMKEKGIP 684
                +A                + CS +    +  NQ +     ++LT  E+  +K + 
Sbjct: 734 AGFVNEA---------------EVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVE 793

Query: 685 FNDANFFEMLSACSTLRDWRKATDIIKLMEPSLHLVSVGTINHLLHFLGKSGKMEIMMKL 744
            ++A                    I+K +     L +  T N L+    + G++E   +L
Sbjct: 794 LHNA--------------------ILKGL-----LANTATYNMLIRGFCRQGRIEEASEL 853

Query: 745 FYRFMALGSSVNISTYSILLKNLLSFGNWRKYIEVLQWMDDAGIQPSNAMYNDILYFAQN 804
             R +  G S +  TY+ ++  L    + +K IE+   M + GI+P    YN +++    
Sbjct: 854 ITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGC-- 886

Query: 805 CGGAECAAIIKERVESLKR 815
           C   E     + R E L++
Sbjct: 914 CVAGEMGKATELRNEMLRQ 886

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8RWS80.0e+0066.50Pentatricopeptide repeat-containing protein At2g41720 OS=Arabidopsis thaliana OX... [more]
Q9LYZ91.7e-6625.36Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
Q9S7Q25.6e-6227.26Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
Q9FJE63.8e-5823.55Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Q76C991.9e-5724.38Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Match NameE-valueIdentityDescription
XP_023526155.10.0100.00pentatricopeptide repeat-containing protein At2g41720 isoform X1 [Cucurbita pepo... [more]
XP_023526156.10.099.88pentatricopeptide repeat-containing protein At2g41720 isoform X2 [Cucurbita pepo... [more]
XP_023526157.10.0100.00pentatricopeptide repeat-containing protein At2g41720 isoform X3 [Cucurbita pepo... [more]
XP_022934373.10.099.14pentatricopeptide repeat-containing protein At2g41720 isoform X1 [Cucurbita mosc... [more]
KAG6580916.10.098.77Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1F7H20.099.14pentatricopeptide repeat-containing protein At2g41720 isoform X1 OS=Cucurbita mo... [more]
A0A6J1J0420.098.40pentatricopeptide repeat-containing protein At2g41720 isoform X2 OS=Cucurbita ma... [more]
A0A6J1F2E10.099.13pentatricopeptide repeat-containing protein At2g41720 isoform X2 OS=Cucurbita mo... [more]
A0A6J1J3540.098.39pentatricopeptide repeat-containing protein At2g41720 isoform X1 OS=Cucurbita ma... [more]
A0A6J1J8H80.098.39pentatricopeptide repeat-containing protein At2g41720 isoform X3 OS=Cucurbita ma... [more]
Match NameE-valueIdentityDescription
AT2G41720.10.0e+0066.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G41720.21.2e-28067.61Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G02860.11.2e-6725.36Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74850.14.0e-6327.26plastid transcriptionally active 2 [more]
AT5G59900.12.7e-5923.55Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 575..604
e-value: 6.9E-6
score: 26.0
coord: 293..320
e-value: 0.035
score: 14.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 505..537
e-value: 1.7E-6
score: 25.8
coord: 609..642
e-value: 3.7E-6
score: 24.8
coord: 293..320
e-value: 1.7E-4
score: 19.5
coord: 434..468
e-value: 3.1E-8
score: 31.3
coord: 364..398
e-value: 1.2E-9
score: 35.8
coord: 399..432
e-value: 1.1E-7
score: 29.5
coord: 329..362
e-value: 4.6E-5
score: 21.3
coord: 539..572
e-value: 1.3E-8
score: 32.5
coord: 153..186
e-value: 4.1E-8
score: 30.9
coord: 575..607
e-value: 1.8E-8
score: 32.1
coord: 223..255
e-value: 1.4E-7
score: 29.2
coord: 188..220
e-value: 3.0E-6
score: 25.0
coord: 257..291
e-value: 6.7E-5
score: 20.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 223..266
e-value: 6.8E-9
score: 35.8
coord: 606..653
e-value: 1.5E-10
score: 41.1
coord: 502..550
e-value: 7.3E-16
score: 58.1
coord: 153..195
e-value: 1.2E-11
score: 44.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 744..790
e-value: 0.0033
score: 17.4
coord: 449..479
e-value: 0.0061
score: 16.6
coord: 385..446
e-value: 7.8E-19
score: 67.5
coord: 321..373
e-value: 1.9E-11
score: 43.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 572..606
score: 10.89559
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 397..431
score: 11.772493
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..219
score: 11.99172
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 607..641
score: 10.818861
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 362..396
score: 13.164578
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 220..254
score: 11.8273
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 537..571
score: 12.221907
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..289
score: 9.755614
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 290..324
score: 9.065053
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 502..536
score: 11.081932
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 327..361
score: 11.158661
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 747..781
score: 9.755614
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 150..184
score: 10.215989
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 432..466
score: 12.660359
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 245..319
e-value: 8.9E-14
score: 53.4
coord: 462..568
e-value: 9.4E-24
score: 86.0
coord: 391..461
e-value: 4.7E-21
score: 77.2
coord: 320..390
e-value: 2.6E-18
score: 68.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 569..660
e-value: 5.5E-23
score: 83.2
coord: 86..233
e-value: 1.5E-34
score: 121.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 661..812
e-value: 6.2E-18
score: 67.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 180..354
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 494..634
NoneNo IPR availablePANTHERPTHR47938:SF5OS07G0213300 PROTEINcoord: 24..790
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 24..790

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g08470.1Cp4.1LG03g08470.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010239 chloroplast mRNA processing
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0008380 RNA splicing
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0003735 structural constituent of ribosome