Cmc04g0104111 (gene) Melon (Charmono) v1.1

Overview
NameCmc04g0104111
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
Descriptionpentatricopeptide repeat-containing protein At2g15820, chloroplastic
LocationCMiso1.1chr04: 21510270 .. 21518914 (-)
RNA-Seq ExpressionCmc04g0104111
SyntenyCmc04g0104111
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGCACCCATAGAAGAACCCTAAGCCCAATTTAAAAATAAAAAGTAAAATTAGTTGGAAAGCGCTAAAACTTCTCCCTTTCCCTCAAGCCTAGTCACGCCTCACGCCTCCTCCCTCTGCAGACACTTCTCCTCCCGACGCCGTCCGCCGCAACAATTCAGGTATTATTTCACTTTTAATCTTCTCATTTTCCAAACCTATTGCAAACTCTTTTCTTTTTAATTCTGAATCTGGCTAATGGGGTTACCTTTGTGCTGTTTGGATTTTTGACACACCAAACGAATTAGAAGATTGTCTTCCTTTTTTGGTTGATTGTGTTTAAGACACTTATTAATTTTGTTTTTGAAATAGTTGCTTGCTGGCTGATGCTTTGAAGTGGCTCCCAGATTCGGGTTAGTCACTCCAAACTCTGCGTTTTCTTTCTAAGCGTAATCCTCCTATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGTACCATCACTACTTTCATTATCCCAATCATATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCTTTTGTTAAACAGCTGGTGTATGACCGGGATTCCCCGTCCGAATCGGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTTCATTTTGAAAATGGTTTTGCATCAGTGGATTTGAAACATTTGGGAACGCCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACAGTGATACGACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAACGAGACAGCATTTAGGGTTAGTGTCTTTTCTCCTTTCTTTTTTATTCTATTATGTTCCAACACCATACTAACGCAGCTTGTAATGCAATCATGGGTTTGTTGAATAATAGGGAATTACTTGGAAGTGTTCAATTTCGGCTATATTTTTGTGATGTTTGTATTATAATACTGTAGAAAAGTTTGGAATGTTATAGTAGGACTCTAAAGATAACTGTAGCACGAACAACATTTAGAATGGAACAAAAATTTCAGAAAAAGTAGAATTTGATAGTTCTCTTTATTTGCTATTTTTTGTAACCTATGTGATAATTTTCTTCATTTCGGAGAGTAGGTGTACAAGTGGATGATGCAACAACATTGGTATCGATTTGATTATGCTTTATCTACTAAGCTTGCCGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGAGAAGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAGTTCCCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCATAATTTGGTAACAAGTGGACTTGAGCTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTATTGACAAAGAAAGGATAGTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAAAGAAGTCCTTTTGTCCATCTTGAGAGCAAGCTCGAAAATGGGGGATGTAGTGGAAGCAGAAAGATTGTGGCAAAAACTTAAGTATTTAGATGGTAACATGCCATATCAGGCTTTTGTTTATAAAATGGAAGTCTACGCAAAGATGGGTAAACCAATGAAGGCTTTGGAGATATTTAGGGAGATGGAGCAGTTGAACTCTACTAATGCTGCGGCATATCAGACAATTATTGGTATTTTATGTAAATTTCAAGAGATAGAACTTGCAGAATCAATCATGGCAGGCTTCATAGAGAGTAATTTGAAACCCCTCACGCCAGCTTATGTTGATATGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACCATCTATAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTTGACAGGGCTGAAGAAATATTTAGTCAGATGGAAACAAATGGAGAAATTGGTGTAAATGCTCGTTCATGCAACCTCATTTTATGTGGGTATCTTTTATTTGGAAATTATATGAAGGCTGAAAAGATATATGATTTGATGTGTCAGAAAAAGTATGACATTGATCCTCCTTTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGAAAGGAGGTTAAGAAGCCAATGAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAGTAGGGTTGTTGTTAGGTGGTCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCACAGAATCCAATTCGAATTCCACAAAAACTGTAAAACTCACTCTGTTTTGAGGAGGCACATATATGAGCAATACCACAAGTGGTTGCATTCTGCTTCAAAGTTGACCGATGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCACGAGGGCGTCAGACAATCCCTAATCTTATTCACCGGTGGCTTTCACCTCGTGCTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTATTGAAACTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAGCATGTATTGGATAGGTTTACTTGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAGCCTTTCATTCTGGATGACTTGAAAGAAAGTACACAGGCAGACAGTCTTAACTTGGGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGAGGAGACTTCAAATTAATTTAAGAGTTTTAGTTATTAGGCCCATTATCAGTTGGATTCCTTAATTTGCCAACTGACGGAAGCCTACAATGTTTCATGATTTTGGTAGGGTATGTTATTCACATAGGATCTTCCTTGATCAATTTGGAAGATGTAAATATGTAAATCGATATGTAATGCTTATGTTCTGATTCTGTGTATTGTTGTAGCATTCTTGTTCACTTTTTAATATACGGTATATGTTAGTTTGGAAGTCATACTTTCTTTTCTTTATCTTGTTTCTTAAAAAAATATAGTTTTCTTCTTTTGATAGGTAACTTTTTAGTGGTGGTTCTTACTGGATTCGTCGGAGGAAAGGGAACTGCTCAGTTTTTGTAATAATTTAAATTATTATTATTTTTAATAACTATTTTGTTCCCTCCGCTGGTCCAAAATTAAGTTCCCCTCAAGAACCTCATATGGATAATGGATGTACCAAAAATGATTTAAATTTTCCTGTAGTATCTTTGCGTATAGGAGCTCTAAATGCACAGCCGACTTCAACACTCTCAGTAGAGACACCTCAGTCGGCTTCTCTCTCCCTCGGGTAGCAGTCTTTGTTTTCATGATGGGAAATCCCAGATCATCTCTTCTGCCTACTGCCACCATCCCATTGTGGCCGGAGTGTGGTTCTTTACGGCCATTTTTGGTATGTCTGCTTCCCCAAAGTATTGACGATTGTTTTGCTGAAGTTCTTTTGTTGGTCGGTTGGTGACTGAAAGCAAGCCTGAAAACTTTTAGACATTTGCAGGTCAAGCTGTTCTTTGGCTTATTTGGTTAGAATGAAACGAAAAGATCTTTTTTTCTAATCTTTGTGGCCATAATTTATTTTAATGTCGTATGGTAAATGGCCCTTGTAGACTGTACCTTAGATTTTATTTTTTATGACGATACTACGATGGTGTTATTTTTAATCTTGATCCCTGGAGTAACAAATCTTTTTTGGTGGGGGTTCACTCTACCAAGCTCGTAAGTGTGTGGGCGCTTGTTCGTTTGCGTGTGACCATGATCGTCCTTTCTCTATCTCTGTCGTTGGATCTGCTGCAACCCCGAGGGTATAACATATTTTTCTGTTGTTGGCATGTTCCTCACTGTCTTCTGCAACCCCTCCATCTTTCAATTTCTAGTTGTTGAGCTGTCAGGGATTAATTTTTTTTGTTTCACTTTTTTTCTTCCATTCTACCAATTCGTAAATATGGTCAAATTGCAAATGGGTCATATTTAATCTAAAGTCATGCTTAATTAAAACCTAGTTGTGAAAGGGATTTGTTCGAAGGGTAACCGAGTGTTTAGCTTCACACTTTCTTTCCTTCCTTTTATTTCCTCTCCTTTTTGCCTGATAAAGCAACTGCTAGACCAAAAGAGATGTCATTGGGGAGGTTCTCAAAGATAAAGGGATTTGACCTTCTTACGTACCCCAATCCCAAATTAGGCCGATTTTAATAGGTGTGCCTTTAACTCTTTTATAAGAACCTTTCCTCATTTTTGACAAAAATTACATTGCCAAGCAAAATTCTCTCCACTGCCCAATCTTTCATGTTTTGTGACATATTGCTATAATGGTTTTGTTTCATCTTCAATAGTCATTCTTGAATTCATACTGTCCATTGTTTGGCCACAAAGCATGAAAAATAAATAAATAAATAAATAAAAGGAGAGACACAAAAAGAAACTGATAGTGAGATGGCATCCTTACTTTGGGGAAACTGGACTAAAGCCAACGACAAGAGGGATGGGCGGCGACCGACGAGAGGGTTGGCGGTGACCCGTGAGCAAGTTAGGGCAGAGAGGGTTGGCAATGATAAATTGGAGAAATCGGCATAGGGGAGAAATGCTTGAAAAAAATATTTGAAAAGATAAAATAAAAAAAACCTAATAATTGTTGGGTCGGTCTAATGAGCTTTTCTAACATTTTCTGTGTTCTTTTCCTTGGAGTTATGCCTTGGTTGTGTTGGGGCCATGTCCGAAATAAAAAAATAAAAAAAATAATCTACATGCCAGCTGGCATAGGAATGTATCTGATACTAACACCTAGCCATCTTAGAAGTGTTCGTGCTTCTTAGGTTCATGTTATCTGTATCTTGCGCACCTTCCCAGAAAGAATCACGAATCAACTTATCCAAAACTTTGATAATAACGAAAGGAACGTGATGGAGAGATGACGACTCCCTTTCAAAATATATGCATATTTCCAATTATAAAGCTTATATTGCATTCTCTCTATAATGGAATGCTAGAAAGCATTGGAAGTAGAATTCCACCTAAAAGCAAGCTAAGGTAGGCAGAAGGTCAATTAACGTGTTTGCAACCAAAAGAAGAAGCCAAGGAAGTCATAACAAAAGCATTTATTTTAAGATGCTCACTCTTGAAAAGATTAATAGACAATCTAGAAGCAACTTCAAAGATACGAATCACCTAAAAAAGATGCAAAAGGCAGAAGGAGTCATAGAAGAAAAGATCAATGTATCATCAACGAACTGTAAATGGCTCGAAACAAAATTAGAATCACCAATAAAGTGGACAACCTTGGATAAACTATGGATCCATAAGACAACTACGACAATCAACAACCAAATTAAATAAAAATGGGGATAGGGTTGCGTTGCTTGATACCAACCGAAGAGATATTCTTACCCCTAGCCCGACCATTAATAATGATGGAGATATTTGTACTAGAGATCCAACCACGAAACCAAGATTGCCACAACTGACCTAAGCCTTCTATCTAAAAAATAGCATCGAGAAAATCCTAGTCCACCGTGTCAAAAGCTTTCTTCAAATTGAGTTTCATAATTTTATATGTTGTCATTGTAGCGTAAGTTTTTCATTGTTTCTTTGTTAGCTCTTTGTTGATTATGATATTCTTTTGCTGAAAGAACTAAAAGAAATGGAAAACAGAGCAAATCGTACAGCAATATGCAGAGCAAAGAAAGGGGGTTTCAAGGATACTCCTGCGGAGGATCTTCTTGCATCTGTTTTAAAGGTATGATTAATTCATTCGTTTGATGTTTTTTATATTCTTCTCAAGTATTAAATTTTGTATTGGCTTGTTTAGTTTCACATATTTGTGTTACTTACCACTTTTACTCATTCCCATTAGTTACTTGGAAATCTCCTACTAGCCACCATTAAGATCTTGATGAAGCTAGAAACTTTTGAATGGTGGTGCGTAGGAAGATCTGTACTCGCACTCACGACCGATCATTATCCCTTTTCTTTTCGTCTTTGCGGGTTTTTTCAAATTAAGAAATTGGTGCGAATTTTGAGAATGCACCTTTTCCTTCCTTATTTAGGTTTTTTAGTATAACAATTTTTGGATGGAGATTGAACTGCTGACCTTAAATGAGAGAATGCCTGCAACTTACAAAAACTAAGCTCACTTTGACTGTGGTTAGTTGTTGCCACCATTTGAATTGGCCCCATAAACTTATTCAATTCTAAACCCCCAAAGTAATTTAGTTTGTGGATAGGCATGTGTGTTTCAATGAATTACAACCTCTTTAATTGCAAAATGCATTATGAGGTTACACCATCATTTCCTTAATTGATATTTGATGTTTCATGATGAAATTGGACTCGATTATCGTAATAATAGCAATAACCAATGTTTTGTGATTATTGACGAGAGTAAAGACTCAATTCAATATTCTGCCAACTACTTAACCAAACTTTGTTTCTTTTGATTATGTTCATAGATCTCTCTTGAAAAAAATGTTTAAGATGTTAGGCACCCTATTTTATTCATACTTACATACCCAGGGGTAAAAAGGGGATTGATTAGGTGACAACATAGCCAATGGGATAAATGGGGAGAATATCCGTTGAGGAACCCACCAGGGTGAATTGGAGTACTTAAATAATGTTTTGTGATTGCTGAAAATTAGGTCACATTTTTTGGAAAAAGCTGTAGAATGCTTTTATGGGAGAAGAACTAGCCTTCTTAAAATTGCTGGGGGATCCTTTCTCTTTGATTTCCTTTATTATTTGTTGTTTGTTTGCTGTATTTTTTTCTCTGTTCTTGAACAAGCATTGTGAAGACTATACTTAATTCCATTTCAATATATATTATCACATAGTACTGTCCCTGTTTTCTCTAGTCTTGTTGGTGTTTTTTGAGTGGCAGGTAGAGGAGTCCTGACAAATTGGTATCAGAGCTGCCAAAGATCCTGAGAGGATACGAACAATAGCACATAAATAATTAAAGGCAAGGATGGAGATGAGTGAGAGAGAGATCATGGGTTTGAAACAAATGATACTCGGTCTAACTAAGAGTGGAAAAACTGTCCGACAAAGTGAAAGAAAGCAGTGTGACCAAGTGACCAGAGAGAAGAATTGTGTGCGTCGGATGGGTTTGGGTTGAAACTAAAAGGTAAGGTGGAGGAAGTTGATGCGACTTCTAGCCTCGTTAAGGGTCCTCCTAATAGAAGCAAGTATAAGAAGTTGGAAATGCCCGTATTTGCCGGTGTAAACTCAAAATCATGGATTTATAAGGCAGAACATTATTTTGAGATCAATGAGCTAATTGACACGGAGGTGTAGGTGGCTGTCGTTAGTTTCGCCCAAGACGAAGTGGATTCATTTCGATGGAGCAACAATTGGAAGAAAATCACGTCGTGGGAAGACCTAAAGGGGAGGATGTTTGAACACTTTAAGGTCCCTAGAGAAGGAAGCCTGAGCGCTTGCCTCATATGCTAGCAAGATGGAATGTATACAAACTATGTGAAGAGATTTTTGAACTACTCCACACTTTTGCTGAAGATGGCAAAGAGTGTTTGGATAAATGCTTTCGTAACCGATTTAGAACCAGTGCTTCAAGTAGAGGTGAAGAGCCGCTATCCCATAACTATGAGAGAGGCCCAATTAGTGAAGGATAGAAATTTGGCTCTCAAGATGGCCCTAAATGAGTTGGGTGGCAGTGGACCGAGTATTTCAGAGGCTCAAACCCAAACTATGAAAGATGGAAGAACAAATACGAAAAAGAAAGGGGGAAGACAAACTGAGTACCCTATGAGGCAAATTTCGATTCTAGTCAAGGGAAGTTATACAAGGGGTGAGCTGCCAGTAAGATTTTTGTCAGATAATGAGTTCAAGGAAAGATTGGACAAGGGGTTATGCTTTTGTTGTAATGATAAGTACTCCCATGGGCACATATGCAAGATCAAGGAGAATCGTTACACTACAACAATTAAGAATTATTATTCTTGACGGTTTTAAAATCATCATTGAAGCCAATGTTAAGAAAGTCAAAGTTCATGACGGTTAATAAACGTCAAGAATAATACGTTGACAGTTTATAAACACATTGTTTATTATTTTCTTAGATCCCTCTCCTCCTAAGGAGTCGATTTTTGTGTGGTACGTCTTACATCTTGGTATTTCGACTAAATCAATTCGGCTATAGCCCAACTCTTAGCTTGTTTATGATGTTTTCCTAAGATCTTTTGACATGTACGAGTACTAACATACTTAGAAATTTTAAAC

mRNA sequence

GCGCACCCATAGAAGAACCCTAAGCCCAATTTAAAAATAAAAAGTAAAATTAGTTGGAAAGCGCTAAAACTTCTCCCTTTCCCTCAAGCCTAGTCACGCCTCACGCCTCCTCCCTCTGCAGACACTTCTCCTCCCGACGCCGTCCGCCGCAACAATTCAGTTGCTTGCTGGCTGATGCTTTGAAGTGGCTCCCAGATTCGGGTTAGTCACTCCAAACTCTGCGTTTTCTTTCTAAGCGTAATCCTCCTATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGTACCATCACTACTTTCATTATCCCAATCATATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCTTTTGTTAAACAGCTGGTGTATGACCGGGATTCCCCGTCCGAATCGGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTTCATTTTGAAAATGGTTTTGCATCAGTGGATTTGAAACATTTGGGAACGCCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACAGTGATACGACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAACGAGACAGCATTTAGGGTAACTTTTTAGTGGTGGTTCTTACTGGATTCGTCGGAGGAAAGGGAACTGCTCAGTTTTTGTAATAATTTAAATTATTATTATTTTTAATAACTATTTTGTTCCCTCCGCTGGTCCAAAATTAAGTTCCCCTCAAGAACCTCATATGGATAATGGATGTACCAAAAATGATTTAAATTTTCCTGTAGTATCTTTGCGTATAGGAGCTCTAAATGCACAGCCGACTTCAACACTCTCAGTAGAGACACCTCAGTCGGCTTCTCTCTCCCTCGGGTAGCAGTCTTTGTTTTCATGATGGGAAATCCCAGATCATCTCTTCTGCCTACTGCCACCATCCCATTGTGGCCGGAGTGTGGTTCTTTACGGCCATTTTTGGTATGTCTGCTTCCCCAAAGTATTGACGATTGTTTTGCTGAAGTTCTTTTGTTGGTCGGTTGGTGACTGAAAGCAAGCCTGAAAACTTTTAGACATTTGCAGGTCAAGCTGTTCTTTGGCTTATTTGGTTAGAATGAAACGAAAAGATCTTTTTTTCTAATCTTTGTGGCCATAATTTATTTTAATGTCGTATGGTAAATGGCCCTTGTAGACTGTACCTTAGATTTTATTTTTTATGACGATACTACGATGGTGTTATTTTTAATCTTGATCCCTGGAGTAACAAATCTTTTTTGGTGGGGGTTCACTCTACCAAGCTCGTAAGTGTGTGGGCGCTTGTTCGTTTGCGTGTGACCATGATCGTCCTTTCTCTATCTCTGTCGTTGGATCTGCTGCAACCCCGAGGGTATAACATATTTTTCTGTTGTTGGCATGTTCCTCACTGTCTTCTGCAACCCCTCCATCTTTCAATTTCTAGTTGTTGAGCTGTCAGGGATTAATTTTTTTTGTTTCACTTTTTTTCTTCCATTCTACCAATTCGTAAATATGGTCAAATTGCAAATGGGTCATATTTAATCTAAAGTCATGCTTAATTAAAACCTAGTTGTGAAAGGGATTTGTTCGAAGGGTAACCGAGTGTTTAGCTTCACACTTTCTTTCCTTCCTTTTATTTCCTCTCCTTTTTGCCTGATAAAGCAACTGCTAGACCAAAAGAGATGTCATTGGGGAGGTTCTCAAAGATAAAGGGATTTGACCTTCTTACGTACCCCAATCCCAAATTAGGCCGATTTTAATAGGTGTGCCTTTAACTCTTTTATAAGAACCTTTCCTCATTTTTGACAAAAATTACATTGCCAAGCAAAATTCTCTCCACTGCCCAATCTTTCATGTTTTGTGACATATTGCTATAATGGTTTTGTTTCATCTTCAATAGTCATTCTTGAATTCATACTGTCCATTGTTTGGCCACAAAGCATGAAAAATAAATAAATAAATAAATAAAAGGAGAGACACAAAAAGAAACTGATAGTGAGATGGCATCCTTACTTTGGGGAAACTGGACTAAAGCCAACGACAAGAGGGATGGGCGGCGACCGACGAGAGGGTTGGCGGTGACCCGTGAGCAAGTTAGGGCAGAGAGGGTTGGCAATGATAAATTGGAGAAATCGGCATAGGGGAGAAATGCTTGAAAAAAATATTTGAAAAGATAAAATAAAAAAAACCTAATAATTGTTGGGTCGGTCTAATGAGCTTTTCTAACATTTTCTGTGTTCTTTTCCTTGGAGTTATGCCTTGGTTGTGTTGGGGCCATGTCCGAAATAAAAAAATAAAAAAAATAATCTACATGCCAGCTGGCATAGGAATGTATCTGATACTAACACCTAGCCATCTTAGAAGTGTTCGTGCTTCTTAGGTTCATGTTATCTGTATCTTGCGCACCTTCCCAGAAAGAATCACGAATCAACTTATCCAAAACTTTGATAATAACGAAAGGAACGTGATGGAGAGATGACGACTCCCTTTCAAAATATATGCATATTTCCAATTATAAAGCTTATATTGCATTCTCTCTATAATGGAATGCTAGAAAGCATTGGAAGTAGAATTCCACCTAAAAGCAAGCTAAGGTAGGCAGAAGGTCAATTAACGTGTTTGCAACCAAAAGAAGAAGCCAAGGAAGTCATAACAAAAGCATTTATTTTAAGATGCTCACTCTTGAAAAGATTAATAGACAATCTAGAAGCAACTTCAAAGATACGAATCACCTAAAAAAGATGCAAAAGGCAGAAGGAGTCATAGAAGAAAAGATCAATGTATCATCAACGAACTGTAAATGGCTCGAAACAAAATTAGAATCACCAATAAAGTGGACAACCTTGGATAAACTATGGATCCATAAGACAACTACGACAATCAACAACCAAATTAAATAAAAATGGGGATAGGGTTGCGTTGCTTGATACCAACCGAAGAGATATTCTTACCCCTAGCCCGACCATTAATAATGATGGAGATATTTGTACTAGAGATCCAACCACGAAACCAAGATTGCCACAACTGACCTAAGCCTTCTATCTAAAAAATAGCATCGAGAAAATCCTAGTCCACCGTGTCAAAAGCTTTCTTCAAATTGAGTTTCATAATTTTATATGTTGTCATTGTAGCCAAATCGTACAGCAATATGCAGAGCAAAGAAAGGGGGTTTCAAGGATACTCCTGCGGAGGATCTTCTTGCATCTGTTTTAAAGGTAGAGGAGTCCTGACAAATTGGTATCAGAGCTGCCAAAGATCCTGAGAGGATACGAACAATAGCACATAAATAATTAAAGGCAAGGATGGAGATGAGTGAGAGAGAGATCATGGGTTTGAAACAAATGATACTCGGTCTAACTAAGAGTGGAAAAACTGTCCGACAAAGTGAAAGAAAGCAGTGTGACCAAGTGACCAGAGAGAAGAATTGTGTGCGTCGGATGGGTTTGGGTTGAAACTAAAAGGTAAGGTGGAGGAAGTTGATGCGACTTCTAGCCTCGTTAAGGGTCCTCCTAATAGAAGCAAGTATAAGAAGTTGGAAATGCCCGTATTTGCCGGTGTAAACTCAAAATCATGGATTTATAAGGCAGAACATTATTTTGAGATCAATGAGCTAATTGACACGGAGGTGTAGGTGGCTGTCGTTAGTTTCGCCCAAGACGAAGTGGATTCATTTCGATGGAGCAACAATTGGAAGAAAATCACGTCGTGGGAAGACCTAAAGGGGAGGATGTTTGAACACTTTAAGGTCCCTAGAGAAGGAAGCCTGAGCGCTTGCCTCATATGCTAGCAAGATGGAATGTATACAAACTATGTGAAGAGATTTTTGAACTACTCCACACTTTTGCTGAAGATGGCAAAGAGTGTTTGGATAAATGCTTTCGTAACCGATTTAGAACCAGTGCTTCAAGTAGAGGTGAAGAGCCGCTATCCCATAACTATGAGAGAGGCCCAATTAGTGAAGGATAGAAATTTGGCTCTCAAGATGGCCCTAAATGAGTTGGGTGGCAGTGGACCGAGTATTTCAGAGGCTCAAACCCAAACTATGAAAGATGGAAGAACAAATACGAAAAAGAAAGGGGGAAGACAAACTGAGTACCCTATGAGGCAAATTTCGATTCTAGTCAAGGGAAGTTATACAAGGGGTGAGCTGCCAGTAAGATTTTTGTCAGATAATGAGTTCAAGGAAAGATTGGACAAGGGGTTATGCTTTTGTTGTAATGATAAGTACTCCCATGGGCACATATGCAAGATCAAGGAGAATCGTTACACTACAACAATTAAGAATTATTATTCTTGACGGTTTTAAAATCATCATTGAAGCCAATGTTAAGAAAGTCAAAGTTCATGACGGTTAATAAACGTCAAGAATAATACGTTGACAGTTTATAAACACATTGTTTATTATTTTCTTAGATCCCTCTCCTCCTAAGGAGTCGATTTTTGTGTGGTACGTCTTACATCTTGGTATTTCGACTAAATCAATTCGGCTATAGCCCAACTCTTAGCTTGTTTATGATGTTTTCCTAAGATCTTTTGACATGTACGAGTACTAACATACTTAGAAATTTTAAAC

Coding sequence (CDS)

ATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGTACCATCACTACTTTCATTATCCCAATCATATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCTTTTGTTAAACAGCTGGTGTATGACCGGGATTCCCCGTCCGAATCGGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTTCATTTTGAAAATGGTTTTGCATCAGTGGATTTGAAACATTTGGGAACGCCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACAGTGATACGACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAACGAGACAGCATTTAGGGTAACTTTTTAG

Protein sequence

MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVTF
Homology
BLAST of Cmc04g0104111 vs. NCBI nr
Match: XP_008465080.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo])

HSP 1 Score: 364.0 bits (933), Expect = 7.5e-97
Identity = 180/180 (100.00%), Postives = 180/180 (100.00%), Query Frame = 0

Query: 1   MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF 60
           MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF
Sbjct: 1   MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF 60

Query: 61  ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 120
           ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE
Sbjct: 61  ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 120

Query: 121 LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 180
           LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
Sbjct: 121 LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 180

BLAST of Cmc04g0104111 vs. NCBI nr
Match: XP_004152074.2 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sativus] >KGN58344.1 hypothetical protein Csa_017589 [Cucumis sativus])

HSP 1 Score: 337.0 bits (863), Expect = 9.9e-89
Identity = 168/181 (92.82%), Postives = 173/181 (95.58%), Query Frame = 0

Query: 1   MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRA 60
           MVFSMSIPTSAFSTVT LRSLTLSLSPYHHYFH PNHIIPTLF+ +YSVKV RQLPRIRA
Sbjct: 1   MVFSMSIPTSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRA 60

Query: 61  FASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELD 120
           FASGSFVKQLVYD DSPSESEEHLSS +SNGGDGFHFENGFASVDLKHLGTP LEVKELD
Sbjct: 61  FASGSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELD 120

Query: 121 ELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFR 180
           ELPEQWRRSK+AWLCKELPAQKPGTVIRLLNAQ+KWMGQDDATYL VHCLRIRENETAFR
Sbjct: 121 ELPEQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFR 180

BLAST of Cmc04g0104111 vs. NCBI nr
Match: XP_038887990.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa hispida])

HSP 1 Score: 272.7 bits (696), Expect = 2.3e-69
Identity = 143/177 (80.79%), Postives = 152/177 (85.88%), Query Frame = 0

Query: 5   MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASG 64
           MSI TSAFS+VTLLRS +LSLSPYHHYF  PNHI+ T+FI  YSVK  +QLPRI +FAS 
Sbjct: 1   MSIHTSAFSSVTLLRSPSLSLSPYHHYFRCPNHIVRTIFIPIYSVKGQQQLPRIPSFASS 60

Query: 65  SFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPE 124
           S V+QLVYDRDS  ESEEHLSSPYSNG D      GFAS DLKHL  PALEVKELDELP+
Sbjct: 61  SSVEQLVYDRDSLFESEEHLSSPYSNGAD------GFASADLKHLEMPALEVKELDELPD 120

Query: 125 QWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181
           QWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDATYLTVHCLRIRENETAFRV
Sbjct: 121 QWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMRQDDATYLTVHCLRIRENETAFRV 171

BLAST of Cmc04g0104111 vs. NCBI nr
Match: XP_022158727.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Momordica charantia])

HSP 1 Score: 257.7 bits (657), Expect = 7.6e-65
Identity = 128/166 (77.11%), Postives = 139/166 (83.73%), Query Frame = 0

Query: 16  TLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFASGSFVKQLVYDRD 75
           TL RSLT SL  +H +F   N+I+ TLFI ++S K R +LPRI AFAS S V QL+YDRD
Sbjct: 3   TLFRSLTHSLPSHHRHFRCHNYIVRTLFIQTFSAKRRPKLPRITAFASSSLVAQLLYDRD 62

Query: 76  SPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLC 135
           SPS+SEEH  SPYSNG DGFHFEN FAS DLKHLG PALEVKELDELPEQWRRSKLAWLC
Sbjct: 63  SPSDSEEHSCSPYSNGADGFHFENSFASADLKHLGNPALEVKELDELPEQWRRSKLAWLC 122

Query: 136 KELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181
           KELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Sbjct: 123 KELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRV 168

BLAST of Cmc04g0104111 vs. NCBI nr
Match: XP_022949171.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata])

HSP 1 Score: 251.1 bits (640), Expect = 7.1e-63
Identity = 134/177 (75.71%), Postives = 144/177 (81.36%), Query Frame = 0

Query: 5   MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASG 64
           MSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS 
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASS 60

Query: 65  SFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPE 124
           S V+ LVYDRDSP+ESEE L SPYS G +      GFAS DLKHLG PALEVKELDELPE
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSTGAE------GFASADLKHLGAPALEVKELDELPE 120

Query: 125 QWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181
           QWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Sbjct: 121 QWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRV 171

BLAST of Cmc04g0104111 vs. ExPASy Swiss-Prot
Match: Q9XIL5 (Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=OTP51 PE=2 SV=3)

HSP 1 Score: 123.6 bits (309), Expect = 2.2e-27
Identity = 83/187 (44.39%), Postives = 110/187 (58.82%), Query Frame = 0

Query: 6   SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPR 65
           S P    S+ TL RSL+ SL  +   +          +  H   T F S  S +   L  
Sbjct: 45  SNPNIINSSSTLFRSLSFSLIRHRSSYSRRSLRRLSIHTVHGNKTQFFSHSSTRTPPLFT 104

Query: 66  IRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDGFHFENGFASVDLKHLGTPAL 125
             + A  SG+FV+ L       +ESEE +S   +NG GD     N   +V  + + T   
Sbjct: 105 ANSTAQRSGTFVEHLT----GITESEEGISE--ANGFGDVESARNDIRNVATRRIET-EF 164

Query: 126 EVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRE 181
           EV+EL+ELPE+WRRSKLAWLCKE+P  K  T++RLLNAQ+KW+ Q+DATY++VHC+RIRE
Sbjct: 165 EVRELEELPEEWRRSKLAWLCKEVPTHKAVTLVRLLNAQKKWVRQEDATYISVHCMRIRE 224

BLAST of Cmc04g0104111 vs. ExPASy Swiss-Prot
Match: Q6ZHJ5 (Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=OTP51 PE=3 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 2.1e-25
Identity = 64/126 (50.79%), Postives = 80/126 (63.49%), Query Frame = 0

Query: 55  PRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALE 114
           P I A AS   ++ L+ D D   E E+            F  E   A+ + + + +P L 
Sbjct: 51  PGIPAVASA--LESLILDLDDDEEDEDE-----ETEFGLFQGEAWAAADEREAVRSPELV 110

Query: 115 VKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIREN 174
           V EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQRKW+ QDDATY+ VHCLRIR N
Sbjct: 111 VPELEELPEQWRRSRIAWLCKELPAYKHSTFTRILNAQRKWITQDDATYVAVHCLRIRNN 169

Query: 175 ETAFRV 181
           + AFRV
Sbjct: 171 DAAFRV 169

BLAST of Cmc04g0104111 vs. ExPASy TrEMBL
Match: A0A1S3CPK0 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502781 PE=4 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 3.6e-97
Identity = 180/180 (100.00%), Postives = 180/180 (100.00%), Query Frame = 0

Query: 1   MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF 60
           MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF
Sbjct: 1   MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAF 60

Query: 61  ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 120
           ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE
Sbjct: 61  ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 120

Query: 121 LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 180
           LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
Sbjct: 121 LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 180

BLAST of Cmc04g0104111 vs. ExPASy TrEMBL
Match: A0A0A0LBL0 (LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 4.8e-89
Identity = 168/181 (92.82%), Postives = 173/181 (95.58%), Query Frame = 0

Query: 1   MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRA 60
           MVFSMSIPTSAFSTVT LRSLTLSLSPYHHYFH PNHIIPTLF+ +YSVKV RQLPRIRA
Sbjct: 1   MVFSMSIPTSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRA 60

Query: 61  FASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELD 120
           FASGSFVKQLVYD DSPSESEEHLSS +SNGGDGFHFENGFASVDLKHLGTP LEVKELD
Sbjct: 61  FASGSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELD 120

Query: 121 ELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFR 180
           ELPEQWRRSK+AWLCKELPAQKPGTVIRLLNAQ+KWMGQDDATYL VHCLRIRENETAFR
Sbjct: 121 ELPEQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFR 180

BLAST of Cmc04g0104111 vs. ExPASy TrEMBL
Match: A0A6J1DXY9 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111025188 PE=4 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 3.7e-65
Identity = 128/166 (77.11%), Postives = 139/166 (83.73%), Query Frame = 0

Query: 16  TLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFASGSFVKQLVYDRD 75
           TL RSLT SL  +H +F   N+I+ TLFI ++S K R +LPRI AFAS S V QL+YDRD
Sbjct: 3   TLFRSLTHSLPSHHRHFRCHNYIVRTLFIQTFSAKRRPKLPRITAFASSSLVAQLLYDRD 62

Query: 76  SPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLC 135
           SPS+SEEH  SPYSNG DGFHFEN FAS DLKHLG PALEVKELDELPEQWRRSKLAWLC
Sbjct: 63  SPSDSEEHSCSPYSNGADGFHFENSFASADLKHLGNPALEVKELDELPEQWRRSKLAWLC 122

Query: 136 KELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181
           KELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Sbjct: 123 KELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRV 168

BLAST of Cmc04g0104111 vs. ExPASy TrEMBL
Match: A0A6J1GB98 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452602 PE=4 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 3.4e-63
Identity = 134/177 (75.71%), Postives = 144/177 (81.36%), Query Frame = 0

Query: 5   MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASG 64
           MSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS 
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASS 60

Query: 65  SFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPE 124
           S V+ LVYDRDSP+ESEE L SPYS G +      GFAS DLKHLG PALEVKELDELPE
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSTGAE------GFASADLKHLGAPALEVKELDELPE 120

Query: 125 QWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181
           QWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Sbjct: 121 QWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRV 171

BLAST of Cmc04g0104111 vs. ExPASy TrEMBL
Match: A0A6J1KB64 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493350 PE=4 SV=1)

HSP 1 Score: 245.0 bits (624), Expect = 2.5e-61
Identity = 132/177 (74.58%), Postives = 143/177 (80.79%), Query Frame = 0

Query: 5   MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASG 64
           MSI TSAF+TVTLLRSLTL  S  H++F   N++I +L I +YS K  RQLPRI AFAS 
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASS 60

Query: 65  SFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPE 124
           S V+ LVYDRDSP+ESEE L SPYSNG +       FAS DLKHLG PALEVKELDELPE
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSNGAE------EFASADLKHLGAPALEVKELDELPE 120

Query: 125 QWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181
           QWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Sbjct: 121 QWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRV 171

BLAST of Cmc04g0104111 vs. TAIR 10
Match: AT2G15820.1 (endonucleases )

HSP 1 Score: 123.6 bits (309), Expect = 1.6e-28
Identity = 83/187 (44.39%), Postives = 110/187 (58.82%), Query Frame = 0

Query: 6   SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPR 65
           S P    S+ TL RSL+ SL  +   +          +  H   T F S  S +   L  
Sbjct: 45  SNPNIINSSSTLFRSLSFSLIRHRSSYSRRSLRRLSIHTVHGNKTQFFSHSSTRTPPLFT 104

Query: 66  IRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDGFHFENGFASVDLKHLGTPAL 125
             + A  SG+FV+ L       +ESEE +S   +NG GD     N   +V  + + T   
Sbjct: 105 ANSTAQRSGTFVEHLT----GITESEEGISE--ANGFGDVESARNDIRNVATRRIET-EF 164

Query: 126 EVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRE 181
           EV+EL+ELPE+WRRSKLAWLCKE+P  K  T++RLLNAQ+KW+ Q+DATY++VHC+RIRE
Sbjct: 165 EVRELEELPEEWRRSKLAWLCKEVPTHKAVTLVRLLNAQKKWVRQEDATYISVHCMRIRE 224

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008465080.17.5e-97100.00PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic ... [more]
XP_004152074.29.9e-8992.82pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sa... [more]
XP_038887990.12.3e-6980.79pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa ... [more]
XP_022158727.17.6e-6577.11pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Momor... [more]
XP_022949171.17.1e-6375.71pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9XIL52.2e-2744.39Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidop... [more]
Q6ZHJ52.1e-2550.79Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa... [more]
Match NameE-valueIdentityDescription
A0A1S3CPK03.6e-97100.00pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis ... [more]
A0A0A0LBL04.8e-8992.82LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100... [more]
A0A6J1DXY93.7e-6577.11pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like OS=Mom... [more]
A0A6J1GB983.4e-6375.71pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A6J1KB642.5e-6174.58pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT2G15820.11.6e-2844.39endonucleases [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR47539PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN OTP51, CHLOROPLASTICcoord: 15..180

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc04g0104111.1Cmc04g0104111.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000373 Group II intron splicing
biological_process GO:0045292 mRNA cis splicing, via spliceosome
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0048564 photosystem I assembly
molecular_function GO:0004519 endonuclease activity