Cmc04g0098491.1 (mRNA) Melon (Charmono) v1.1

Overview
NameCmc04g0098491.1
TypemRNA
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCMiso1.1chr04: 14210601 .. 14216588 (+)
Sequence length5124
RNA-Seq ExpressionCmc04g0098491.1
SyntenyCmc04g0098491.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGGAAATGGGAGAGGGCGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTCGATTCTTCTTAAGTGAAAAACTCGACTTCTCAACTTCTATAATCTCCAGAATGTTTCAATTAAATGCGAAAGAATAGCTGCACCCTGCAGGGATGTTCTCATTTGTGACTACGATCGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTTCAATCTCAATGCCTCTTCAATCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATAAAAAACTGTTCCAACATAAACGAACTGCATGTTGTATATGCTTCCATGATCAAAAGTAATGCAATCCAAGATTGTTTTCTGGTGCATCAGTTTATTAGCGCGTCTTTTGCTTTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAACCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACCGTGGGTACCCATTTCGTGGTTTACAATGTTATGTACATATGTTGGAAGGATCGAACGTTTTGCCAAATAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCGAAGTTGGAGAAACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTACTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATACGGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACTAAGGACATAATCTCCTGGACAACCATGATCACTTGTTATTCACAGAACAAGCAATATCAAGATGCGTTGGCAATTTATAGTGAGACGAGATTGAATGGGATTATTCCTGATCAGGTAACAATGTCAACTGTTGTTTCAGCTTGCGCCCACGTTGGAGCTCTTGAACTAGGAAAAGAGATACATCAATATGTAATGTCTCAGGGGCTTAATCATGACGTCTATATTGGTTCTGCATTAGTTGATATGTATGCTAAGTGTGGGAGTTTAGATTGGTCTCTTTTGATTTTCTTCAAATTGAAGGATAAAAATTTATATTGCTGGAATGCAGTAATTGAAGGGCTTGCAGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCCTGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGAAGAAGGCAGGAGTAGATTTTTGAGCATGACTCGTGATTATGGCATTTCTCCTGAAATCAGACACTACGGTTGCATGGTTGATATGTTAAGTAAAGCAGGATTGCTCAAAGAAGCATTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCACGGGAACTCTGTGATTGCAAAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTTAGCATGTGTGCTGAAGAGAAGGATTGGATGGAGGTTGCGCATATTCGATTAATGATGAAAGAACAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCGTACTGACAGAACTAGATGGACAACTGAAGCTAGCTGGTTACATACTTGAGCCTTCAGTATGCAGTACTGCTTTGGTTTTTCCAGAGGAAATTTGATCAACATTAATTGAGGTCATAGTGAGATCGAATATTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTACATTGAACTGAAGGGAAAATTCTTGAGGTCAAGTGCTAAATGTCAAAGCAGGGCTACTATAAGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATTAAGAGCCATGGTAATATAAAGTCTAATCTTAAGTTCTGCTAGAATGTCTTCTCATTTTAAGTAAATTCTCAACATTTGACAGAATGAAATGAATGAATACTTCTTAGTCTTTTACATGCACAATTGAGCATCTTGATGATGCATTTCTTTTCTATGTTTGGCTTCTATCATTGAGGCTACCTATTCATGTTAAGATATATTTGATGATTAGATTTAATCGACAGTCAATAAAATGAAGTGAATGGTTACTTCTTAGTTTCTTACGTGCACATTTGAGCATCTTGATGCATTTTCTTTCTCTGTTTAACTTGTCTAATTGAGGCTATCTATTCATATTAAGAATATTTTGACCGACTATATCTAACCAACTCTTTTAGTTCTTAAGTTTTAAGATTAGTGATGGTTGAAACTCTAAAAAGTTGTTACCAATCTCAACTGTTTGAATGATCAATCTCAACATCTAAACTCAAACTCCTGCAATGGTTTGATTAATGAGGCCCAGTAAATATGTTATTGATATTGAAATAAAAAAGAAACAACTCTAAAAATCTTTGAACTTAGGCTTGTAAAGTTAGAATTCCCCACTCTTGTTGATCGAAGTAGATTCAATCATTTTCTTTATGCTTTCAGTTTACTCACATTCATTTTGTGACTTCTGCTCCAGCTATCAAAAGAAGGGGCAGGGACTAAATTTATAACGAGATAACGTACCACTATTGTACAACCGAAGCTGACTTAGTGAAGTGATTTGATCGAAAAACCCAAGCTAACAGGATATCCTGTAACAGGTCGGTCCTTTTTCCTCCTTTGTACAGCACTTCCGTATATTTCAAATACATACATGATCATATTGTCTGCTTATGCACTTCCTTAGAAACAAGCTAGGTTTTGAGGAAAATTGAAACCAAAACTTATCAGAATCTTCACGTATAATTTCATTCATGACGATAAATTGCCAATACCGTTTGAACCGTGTTATAATATTTCTAAAACATCCAACTTGTCCCACAGCTTTTTTTTGAACTATGTTATAATATTTCCAAAACATCCAACTCGTCCGACAGAATTTTTTTAGGTTGGAATGGTGTGGAGAAAATAAAACTCCTAGCTTGATCTTATATTGTTTTAATTTTCTTTTTCTAAACGAAGAACATGCCAAATCTTCTCTTTCCTACTTCAAGAGTGTTCTTAATCCATCAATTTAGCATTGGAGAATGCTAAAACACTGGCTAAAATTGAATGGAAGTGCGACTCTCGTCTTCAAGCACCAGCCATTTTTATTATTATGGGAGTAGCTTGCTTTCTTCTGGCAAGATACATGGTTCAAGGGATAGGGGAAATTCTTCAAACCTTGTAGGCTATGCATCTGCTGTAAGATATGGTTCTTAGGGAACATTATACACGTTTCTATCTTAGTCTCTTTTCATTTTCTTAATGAAAAGTTTCGTATCATTTCAAGGTTCAACGATAGTGATTGCTGTTATTCCACTAGTTTGACTTTACCTTACCAAGTTGGTGGGGGAAAATCCTGCCCAAGTATCGCATAGTCTTTCTCATACTCTGCCAAAGTAGAAGTGTGTGCTTTCTGACCTTGCTATAAGCTTACATCCACAAAGATCATGAAAATTTGTAAAAAGGAGGTGGATTAATCAGGTTGATTATAGAGACAGAAGTGGCCTCTGTTTACTAACAAGTTAGTATTGTATTTTTTTTTTTTTTTTTTTTTTTTTTTGAGCATTTTATCACAATAAGAAAGGGAAAAAAATTGTTTCACTAGACACTTATCGGCTTTAGAGACGCCCAAATTCATAGTTTCTTCTAACAGGATTTCGTGCACTTGGCAATGGACCAGGATTGGGTTAATATTTAATCACATGATATTTTTACGTTCCAAATTTGTTTAAAAAATGATCTGAAGTACTTACTGCTTCCGGTTCAATTCACTAGACCCTTTTATAGTTCTCTCGTTTTTGAGAGTAACAAAAAAGTCTGTTGTTGTTCTTTAGTGGTGTAAACCTTTTCTCCTTCCAACAATTATAGTCTAAATAGTCTTTTCCTTTTAGTTTAGAACTTGAGGCCCTCGTTATTCTCATATCTCATACCATCTATGAACTTGTTTTTTATTTTAAAAGAAGGTCCGTGCTCGAGATGAAAACTTCTTGAAAAATTCAGAACAAAAGGCCATGATTTGGCTAATAATTTAACTGAGATTAAATAAAAGTACAGTAAGTCTAACTTATCTAAGCATATAATCGATTTGGATGAAACTAATGTACGGATATAGAGCTAACGATCTAAACGTTGTTGCACAAACTAACAATTAGAAGACGGTATATAGCTTGAGACTTAAGATGAAAAGTATACACCTCCCCTCCCCAAAGCTTAGAGATAAAAATTGATCCCCACACCAAAAGCCCTCTCGTCCTTTGAATTTACTTTTTAATTGACCATCCGTTTAGGAATCATCTTCAAAACCTACTCTTGGCTTATATAGAATGAGCGGAATCATCGTATATCTAAAGATAAAGAATCAGATTTCACACATTTTTTTATCACTTTTGTATACATCTCTATCATGGTGTAAATTTACTTCTAACTTTTGTGACTATAATCTCACCTCTTAAATCTCAATGGAATAGCATGATGTAATCTTTCTCGCCACTTGATGCCCTTTTGTAATTTCATTTTATCAATGAAATTTGATCGACTATTTCTAAAATCCCCCACACCAAAAGAGGAAAAGCTATTGCCAGTACAAGGGAAACAAAACAGAGGAAAATGAACGAAGAGACTAGAATTATGTAAAGTGACAACTGACCGTGATAGGAAAAAGAAGAAAGTTTCATAAAACTCTTAAATAAATGATGATTAAAAAATAGTCTACAACCATTAGGGAGGGGCTTGACCGATCAAAGCTCGAGTTCAACATCAACTGTATTCATGAAAAAGTCCAAAAAAAAAAAAAAAGAAAAGAAAAGAAAAATCTACGAGCCGAATAAGTTGATAAGGATGTCGCTATGGCAACAATAGGTTTACCGACATAGAGGAGACATTGCTCAAACTGGAAGCCATTATTACTTTTTCTTTAAAACATTAAAAACCCCATCATCTGACCAAAAATTTAGACTTTTTTTTTGGAGGGGGTGAAGAAAGAAACACCAAACAAAAACAAACAAGACTGCAAAACCTGAAGCCAAACTCAGTTGAAAGATGGAACTTTTTTTAAAAGGTTAAATCTTCTCTCTGTGTCTTTTTAAATGGGATACAACCAACTTTTGATGGCCTGCACTGTACAGACTTTCTCGGCCTCATTCATTATGGAACATCTAATTAATTTTCTAAGTTGATTGGGATGAGTAGTTTGTGTGGTTCAAAAGTAAAAGTACTGATAAGAAATTTAGAATAATGCTTCCAATTGTGTCAGTTTTTGTGCTGAAGCCTGAGGATAGAAAGAGTTTTTGTAGTGTTGTGGGTTTGTGGGGCTTTCTTGCATCCTCGAAAACCTAGAAAGGAATTCCATCTTCTTAAAATTATAAAACAATGGTTCAATTATATATTTATTGAACAAATACTTGTACTTTGCCATATTACCCTTTAGCTCTTCGTGAACATTTCAAAACTATTGGAATAGAAATAGTTTAGAATTTTAGATGAAAGTTGAGACATCAAATTTTGTGGCGATTCTATTCTAACACCCTAGTTTTCCAGACTAGGATTCAAAATCCGAACTTAGCACCTTATAATCTTACATTACTCACAAATCAATATGGATCTTTTTAGCATTGTTTGTTTTTACTCAGACGATTCATAGAAAATTTTTAAGAAGGCAGTCAACATAGAATTGCTTCCAAAGCT

mRNA sequence

GTTGGAAATGGGAGAGGGCGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTCGATTCTTCTTAAGTGAAAAACTCGACTTCTCAACTTCTATAATCTCCAGAATGTTTCAATTAAATGCGAAAGAATAGCTGCACCCTGCAGGGATGTTCTCATTTGTGACTACGATCGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTTCAATCTCAATGCCTCTTCAATCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATAAAAAACTGTTCCAACATAAACGAACTGCATGTTGTATATGCTTCCATGATCAAAAGTAATGCAATCCAAGATTGTTTTCTGGTGCATCAGTTTATTAGCGCGTCTTTTGCTTTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAACCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACCGTGGGTACCCATTTCGTGGTTTACAATGTTATGTACATATGTTGGAAGGATCGAACGTTTTGCCAAATAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCGAAGTTGGAGAAACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTACTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATACGGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACTAAGGACATAATCTCCTGGACAACCATGATCACTTGTTATTCACAGAACAAGCAATATCAAGATGCGTTGGCAATTTATAGTGAGACGAGATTGAATGGGATTATTCCTGATCAGATTGTAATTGAAGGGCTTGCAGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCCTGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGAAGAAGGCAGGAGTAGATTTTTGAGCATGACTCGTGATTATGGCATTTCTCCTGAAATCAGACACTACGGTTGCATGGTTGATATGTTAAGTAAAGCAGGATTGCTCAAAGAAGCATTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCACGGGAACTCTGTGATTGCAAAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTTAGCATGTGTGCTGAAGAGAAGGATTGGATGGAGGTTGCGCATATTCGATTAATGATGAAAGAACAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCGTACTGACAGAACTAGATGGACAACTGAAGCTAGCTGGTTACATACTTGAGCCTTCAGTATGCAGTACTGCTTTGGTTTTTCCAGAGGAAATTTGATCAACATTAATTGAGGTCATAGTGAGATCGAATATTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTACATTGAACTGAAGGGAAAATTCTTGAGGTCAAGTGCTAAATGTCAAAGCAGGGCTACTATAAGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATTAAGAGCCATGCTATCAAAAGAAGGGGCAGGGACTAAATTTATAACGAGATAACGTACCACTATTGTACAACCGAAGCTGACTTAGTGAAGTGATTTGATCGAAAAACCCAAGCTAACAGGATATCCTGTAACAGGTCGGTCCTTTTTCCTCCTTTGTACAGCACTTCCGTATATTTCAAATACATACATGATCATATTGTCTGCTTATGCACTTCCTTAGAAACAAGCTAGGTTTTGAGGAAAATTGAAACCAAAACTTATCAGAATCTTCACGTATAATTTCATTCATGACGATAAATTGCCAATACCGTTTGAACCGTGTTATAATATTTCTAAAACATCCAACTTGTCCCACAGCTTTTTTTTGAACTATGTTATAATATTTCCAAAACATCCAACTCGTCCGACAGAATTTTTTTAGGTTGGAATGGTGTGGAGAAAATAAAACTCCTAGCTTGATCTTATATTGTTTTAATTTTCTTTTTCTAAACGAAGAACATGCCAAATCTTCTCTTTCCTACTTCAAGAGTGTTCTTAATCCATCAATTTAGCATTGGAGAATGCTAAAACACTGGCTAAAATTGAATGGAAGTGCGACTCTCGTCTTCAAGCACCAGCCATTTTTATTATTATGGGAGTAGCTTGCTTTCTTCTGGCAAGATACATGGTTCAAGGGATAGGGGAAATTCTTCAAACCTTGTAGGCTATGCATCTGCTGTAAGATATGGTTCTTAGGGAACATTATACACGTTTCTATCTTAGTCTCTTTTCATTTTCTTAATGAAAAGTTTCGTATCATTTCAAGGTTCAACGATAGTGATTGCTGTTATTCCACTAGTTTGACTTTACCTTACCAAGTTGGTGGGGGAAAATCCTGCCCAAGTATCGCATAGTCTTTCTCATACTCTGCCAAAGTAGAAGTGTGTGCTTTCTGACCTTGCTATAAGCTTACATCCACAAAGATCATGAAAATTTGTAAAAAGGAGGTGGATTAATCAGGTTGATTATAGAGACAGAAGTGGCCTCTGTTTACTAACAAGTTAGTATTGTATTTTTTTTTTTTTTTTTTTTTTTTTTTGAGCATTTTATCACAATAAGAAAGGGAAAAAAATTGTTTCACTAGACACTTATCGGCTTTAGAGACGCCCAAATTCATAGTTTCTTCTAACAGGATTTCGTGCACTTGGCAATGGACCAGGATTGGGTTAATATTTAATCACATGATATTTTTACGTTCCAAATTTGTTTAAAAAATGATCTGAAGTACTTACTGCTTCCGGTTCAATTCACTAGACCCTTTTATAGTTCTCTCGTTTTTGAGAGTAACAAAAAAGTCTGTTGTTGTTCTTTAGTGGTGTAAACCTTTTCTCCTTCCAACAATTATAGTCTAAATAGTCTTTTCCTTTTAGTTTAGAACTTGAGGCCCTCGTTATTCTCATATCTCATACCATCTATGAACTTGTTTTTTATTTTAAAAGAAGGTCCGTGCTCGAGATGAAAACTTCTTGAAAAATTCAGAACAAAAGGCCATGATTTGGCTAATAATTTAACTGAGATTAAATAAAAGTACAGTAAGTCTAACTTATCTAAGCATATAATCGATTTGGATGAAACTAATGTACGGATATAGAGCTAACGATCTAAACGTTGTTGCACAAACTAACAATTAGAAGACGGTATATAGCTTGAGACTTAAGATGAAAAGTATACACCTCCCCTCCCCAAAGCTTAGAGATAAAAATTGATCCCCACACCAAAAGCCCTCTCGTCCTTTGAATTTACTTTTTAATTGACCATCCGTTTAGGAATCATCTTCAAAACCTACTCTTGGCTTATATAGAATGAGCGGAATCATCGTATATCTAAAGATAAAGAATCAGATTTCACACATTTTTTTATCACTTTTGTATACATCTCTATCATGGTGTAAATTTACTTCTAACTTTTGTGACTATAATCTCACCTCTTAAATCTCAATGGAATAGCATGATGTAATCTTTCTCGCCACTTGATGCCCTTTTGTAATTTCATTTTATCAATGAAATTTGATCGACTATTTCTAAAATCCCCCACACCAAAAGAGGAAAAGCTATTGCCAGTACAAGGGAAACAAAACAGAGGAAAATGAACGAAGAGACTAGAATTATGTAAAGTGACAACTGACCGTGATAGGAAAAAGAAGAAAGTTTCATAAAACTCTTAAATAAATGATGATTAAAAAATAGTCTACAACCATTAGGGAGGGGCTTGACCGATCAAAGCTCGAGTTCAACATCAACTGTATTCATGAAAAAGTCCAAAAAAAAAAAAAAAGAAAAGAAAAGAAAAATCTACGAGCCGAATAAGTTGATAAGGATGTCGCTATGGCAACAATAGGTTTACCGACATAGAGGAGACATTGCTCAAACTGGAAGCCATTATTACTTTTTCTTTAAAACATTAAAAACCCCATCATCTGACCAAAAATTTAGACTTTTTTTTTGGAGGGGGTGAAGAAAGAAACACCAAACAAAAACAAACAAGACTGCAAAACCTGAAGCCAAACTCAGTTGAAAGATGGAACTTTTTTTAAAAGGTTAAATCTTCTCTCTGTGTCTTTTTAAATGGGATACAACCAACTTTTGATGGCCTGCACTGTACAGACTTTCTCGGCCTCATTCATTATGGAACATCTAATTAATTTTCTAAGTTGATTGGGATGAGTAGTTTGTGTGGTTCAAAAGTAAAAGTACTGATAAGAAATTTAGAATAATGCTTCCAATTGTGTCAGTTTTTGTGCTGAAGCCTGAGGATAGAAAGAGTTTTTGTAGTGTTGTGGGTTTGTGGGGCTTTCTTGCATCCTCGAAAACCTAGAAAGGAATTCCATCTTCTTAAAATTATAAAACAATGGTTCAATTATATATTTATTGAACAAATACTTGTACTTTGCCATATTACCCTTTAGCTCTTCGTGAACATTTCAAAACTATTGGAATAGAAATAGTTTAGAATTTTAGATGAAAGTTGAGACATCAAATTTTGTGGCGATTCTATTCTAACACCCTAGTTTTCCAGACTAGGATTCAAAATCCGAACTTAGCACCTTATAATCTTACATTACTCACAAATCAATATGGATCTTTTTAGCATTGTTTGTTTTTACTCAGACGATTCATAGAAAATTTTTAAGAAGGCAGTCAACATAGAATTGCTTCCAAAGCT

Coding sequence (CDS)

ATGTTCTCATTTGTGACTACGATCGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTTCAATCTCAATGCCTCTTCAATCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATAAAAAACTGTTCCAACATAAACGAACTGCATGTTGTATATGCTTCCATGATCAAAAGTAATGCAATCCAAGATTGTTTTCTGGTGCATCAGTTTATTAGCGCGTCTTTTGCTTTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAACCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACCGTGGGTACCCATTTCGTGGTTTACAATGTTATGTACATATGTTGGAAGGATCGAACGTTTTGCCAAATAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCGAAGTTGGAGAAACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTACTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATACGGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACTAAGGACATAATCTCCTGGACAACCATGATCACTTGTTATTCACAGAACAAGCAATATCAAGATGCGTTGGCAATTTATAGTGAGACGAGATTGAATGGGATTATTCCTGATCAGATTGTAATTGAAGGGCTTGCAGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCCTGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGAAGAAGGCAGGAGTAGATTTTTGAGCATGACTCGTGATTATGGCATTTCTCCTGAAATCAGACACTACGGTTGCATGGTTGATATGTTAAGTAAAGCAGGATTGCTCAAAGAAGCATTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCACGGGAACTCTGTGATTGCAAAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTTAGCATGTGTGCTGAAGAGAAGGATTGGATGGAGGTTGCGCATATTCGATTAATGATGAAAGAACAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCGTACTGACAGAACTAGATGGACAACTGAAGCTAGCTGGTTACATACTTGAGCCTTCAGTATGCAGTACTGCTTTGGTTTTTCCAGAGGAAATTTGA

Protein sequence

MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI
Homology
BLAST of Cmc04g0098491.1 vs. NCBI nr
Match: XP_008447444.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis melo] >KAA0038095.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK20512.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1038.9 bits (2685), Expect = 1.5e-299
Identity = 528/599 (88.15%), Postives = 529/599 (88.31%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS
Sbjct: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG
Sbjct: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD
Sbjct: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
           FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM
Sbjct: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
Sbjct: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALELGKEIHQYVMSQGLNHDVYIGSALVDMYAKCGSLDWSLLIFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 530
           LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG
Sbjct: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. NCBI nr
Match: XP_011651448.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >XP_011651449.1 pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >XP_011651450.1 pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >KGN57932.1 hypothetical protein Csa_011399 [Cucumis sativus])

HSP 1 Score: 961.8 bits (2485), Expect = 2.4e-276
Identity = 494/600 (82.33%), Postives = 506/600 (84.33%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVS-PSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYA 60
           MFSFVTT ALKQLTRSIGNFVS PSISMPLQ PS PSFKQTLLNRIKNCS INELH + A
Sbjct: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSCPSFKQTLLNRIKNCSTINELHGLCA 60

Query: 61  SMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFR 120
           SMIK+NAIQDCFLVHQFISASFA NSVHYPVFAFTQMENPNVFVYNAMIKGFVY GYPFR
Sbjct: 61  SMIKTNAIQDCFLVHQFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120

Query: 121 GLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
            LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNT 240
           DFYSKLE LSEARKVFDEMCERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300

Query: 301 QI---------------------------------------------------------- 360
           ++                                                          
Sbjct: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360

Query: 361 ------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEE 420
                       VIEGLAVHGYAEKALRMFAIMEREKI+PNGVTFISILSACTHAGLV+E
Sbjct: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420

Query: 421 GRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYP 530
           KLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEEKDWMEVAHIR MMKE+GVEKKYP
Sbjct: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVAHIRSMMKEKGVEKKYP 540

BLAST of Cmc04g0098491.1 vs. NCBI nr
Match: XP_038888390.1 (pentatricopeptide repeat-containing protein At1g06143 [Benincasa hispida])

HSP 1 Score: 895.6 bits (2313), Expect = 2.1e-256
Identity = 458/594 (77.10%), Postives = 479/594 (80.64%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFSFV T ALKQLTRSI NFVS SISMP Q PS PSFKQTLLNRIKNCS INEL  +YAS
Sbjct: 1   MFSFVITNALKQLTRSISNFVSSSISMPPQPPSIPSFKQTLLNRIKNCSTINELDGIYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIK+NA QDCFLV+QFIS S AFNSV YPV AFTQMENPNVFVYNAMI+GFVY GYPF  
Sbjct: 61  MIKTNATQDCFLVNQFISTSLAFNSVDYPVIAFTQMENPNVFVYNAMIRGFVYCGYPFGA 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           LQCYVHMLE + V P SYTFSSLVKACTFMCAVELG+M+HCHIWK GFESHLFVQTAL+D
Sbjct: 121 LQCYVHMLEEAKVFPTSYTFSSLVKACTFMCAVELGRMIHCHIWKSGFESHLFVQTALID 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
           FYS LE+LSEARKVFDEM ERD+FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTM
Sbjct: 181 FYSNLERLSEARKVFDEMRERDSFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQYQ+AL IY + RLNGIIPD+
Sbjct: 241 IDGYARLGNVESAEFLFNQMPVRDIISWTTMITCYSQNKQYQEALMIYIKMRLNGIIPDE 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTLSTVVSACAHVGALELGKTIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDRSLLVFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMF IMEREKI PNGVTFISILSACTHAGLVEEG
Sbjct: 361 LMDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIGPNGVTFISILSACTHAGLVEEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRFLSMTRDYGI PEI HYGCMVDMLSKAG L EALELIKSMEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFLSMTRDYGIRPEIGHYGCMVDMLSKAGFLDEALELIKSMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 525
           LHGNS IAKDAV+QLMILEPM+SGHYNLLVSM AEEKDWMEVAHIR MMKEQGVEKKYPG
Sbjct: 481 LHGNSEIAKDAVQQLMILEPMSSGHYNLLVSMYAEEKDWMEVAHIRAMMKEQGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. NCBI nr
Match: XP_022967388.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima])

HSP 1 Score: 829.7 bits (2142), Expect = 1.4e-236
Identity = 424/588 (72.11%), Postives = 455/588 (77.38%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFS   T ALKQ+TRSI NFVS S S  LQ P  P+FKQTLL+RIKNCS INEL  +YAS
Sbjct: 1   MFSITPTNALKQITRSISNFVSSSTSRTLQGPYVPTFKQTLLDRIKNCSTINELDGIYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPNVFVYNAMI+GFVY GYPFR 
Sbjct: 61  MIKANATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFRA 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HC IW  G E  +FVQT+L+D
Sbjct: 121 IQCYVHMLE-SQVLPSSYTFSSLVKACTCMCALDLGRMIHCQIWTHGLELDVFVQTSLID 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
            YS LE+  +ARKVFDEM ERD FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTM
Sbjct: 181 LYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY + RLNGIIPD+
Sbjct: 241 IDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALMIYGDMRLNGIIPDE 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALELGKEIHHYAMSRGLNLDVYIGSALVDMYAKCGSLDRSLLVFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMF IMEREKI+PNGVTFISILSACTHAGLV EG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVVEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRFLSM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFLSMIRDYGIHPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 519
           LHGNS IAKDAV +L ILEP NSGHYNLLVSM AEEK W+EVAHIR MMKE GVEKKYPG
Sbjct: 481 LHGNSEIAKDAVRRLNILEPKNSGHYNLLVSMYAEEKHWIEVAHIRAMMKENGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. NCBI nr
Match: XP_023554768.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 828.6 bits (2139), Expect = 3.1e-236
Identity = 424/588 (72.11%), Postives = 452/588 (76.87%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFS   T ALKQ+TRSI NF S S    LQ     +FKQTLL+RIKNCS INEL  +YAS
Sbjct: 1   MFSITPTNALKQITRSISNFASSSTPRTLQGSYVSTFKQTLLDRIKNCSTINELDGIYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPNVFVYNAMI+GFVY GYPFR 
Sbjct: 61  MIKTNATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFRA 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HCHIWK G E  +FVQT+L+D
Sbjct: 121 IQCYVHMLE-SQVLPSSYTFSSLVKACTCMCALDLGRMIHCHIWKNGLELDVFVQTSLID 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
            YS LE+  +ARKVFDEM ERD FAWTTMVSALAR GDMDTARKLFEEMPE NTATWNTM
Sbjct: 181 LYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDTARKLFEEMPESNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY + RLNGIIPD+
Sbjct: 241 IDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALTIYGDMRLNGIIPDE 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALDLGKEIHHYAMSWGLNLDVYIGSALVDMYAKCGSLDRSLLVFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMF IMEREKI+PNGVTFISILSACTHAGLV EG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVIEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRF SM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFSSMIRDYGIRPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 519
           LHGNS IAKDAV QL ILEP NSGHYNLLVSM AEEK WMEVAHIR MMKE GVEKKYPG
Sbjct: 481 LHGNSEIAKDAVRQLTILEPKNSGHYNLLVSMYAEEKHWMEVAHIRAMMKENGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. ExPASy Swiss-Prot
Match: Q56X05 (Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=2)

HSP 1 Score: 484.2 bits (1245), Expect = 1.9e-135
Identity = 261/541 (48.24%), Postives = 337/541 (62.29%), Query Frame = 0

Query: 45  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVY 104
           IK CS    L    A+MIK++  QDC L++QFI+A  +F  +   V   TQM+ PNVFVY
Sbjct: 35  IKQCSTPKLLESALAAMIKTSLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVY 94

Query: 105 NAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIW 164
           NA+ KGFV   +P R L+ YV ML  S V P+SYT+SSLVKA +F  A   G+ +  HIW
Sbjct: 95  NALFKGFVTCSHPIRSLELYVRMLRDS-VSPSSYTYSSLVKASSF--ASRFGESLQAHIW 154

Query: 165 KKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARK 224
           K GF  H+ +QT L+DFYS   ++ EARKVFDEM ERD  AWTTMVSA  RV DMD+A  
Sbjct: 155 KFGFGFHVKIQTTLIDFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANS 214

Query: 225 LFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDA 284
           L  +M E+N AT N +I+GY  LGN+E AE LFNQMP KDIISWTTMI  YSQNK+Y++A
Sbjct: 215 LANQMSEKNEATSNCLINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREA 274

Query: 285 LAIYSETRLNGIIPDQI------------------------------------------- 344
           +A++ +    GIIPD++                                           
Sbjct: 275 IAVFYKMMEEGIIPDEVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYIGSALVDM 334

Query: 345 ---------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTF 404
                                      +IEGLA HG+A++AL+MFA ME E + PN VTF
Sbjct: 335 YSKCGSLERALLVFFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTF 394

Query: 405 ISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSME 464
           +S+ +ACTHAGLV+EGR  + SM  DY I   + HYG MV + SKAGL+ EALELI +ME
Sbjct: 395 VSVFTACTHAGLVDEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNME 454

Query: 465 FEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAH 516
           FEPN++IWGALL+GC++H N VIA+ A  +LM+LEPMNSG+Y LLVSM AE+  W +VA 
Sbjct: 455 FEPNAVIWGALLDGCRIHKNLVIAEIAFNKLMVLEPMNSGYYFLLVSMYAEQNRWRDVAE 514

BLAST of Cmc04g0098491.1 vs. ExPASy Swiss-Prot
Match: Q9LS72 (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 8.7e-88
Identity = 186/586 (31.74%), Postives = 312/586 (53.24%), Query Frame = 0

Query: 26  SMPLQSPSRPSFKQTLLNRIKN---CSNINELHVVYASMIKSNAIQDCFLVHQFISASFA 85
           S+P+++PS  S ++    R+++   C+N+N++  ++A +I+ N  +D  +  + ISA   
Sbjct: 4   SLPVRAPSWVSSRRIFEERLQDLPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSL 63

Query: 86  FNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSS 145
               +  V  F Q++ PNV + N++I+       P++    +  M +   +  +++T+  
Sbjct: 64  CRQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEM-QRFGLFADNFTYPF 123

Query: 146 LVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSK------------LEKLSE 205
           L+KAC+    + + +M+H HI K G  S ++V  AL+D YS+             EK+SE
Sbjct: 124 LLKACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSE 183

Query: 206 ---------------------ARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEM 265
                                AR++FDEM +RD  +W TM+   AR  +M  A +LFE+M
Sbjct: 184 RDTVSWNSMLGGLVKAGELRDARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKM 243

Query: 266 PERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDA--- 325
           PERNT +W+TM+ GY++ G++E A ++F++M  P K++++WT +I  Y++    ++A   
Sbjct: 244 PERNTVSWSTMVMGYSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRL 303

Query: 326 -------------------LAIYSETRL-------------------------------- 385
                              LA  +E+ L                                
Sbjct: 304 VDQMVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAK 363

Query: 386 ----------------NGIIPDQIVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISI 445
                             ++    ++ GL VHG+ ++A+ +F+ M RE I P+ VTFI++
Sbjct: 364 CGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAV 423

Query: 446 LSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEP 504
           L +C HAGL++EG   F SM + Y + P++ HYGC+VD+L + G LKEA++++++M  EP
Sbjct: 424 LCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEP 483

BLAST of Cmc04g0098491.1 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 2.4e-82
Identity = 169/513 (32.94%), Postives = 271/513 (52.83%), Query Frame = 0

Query: 45  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFIS---ASFAFNSVHYPVFAFTQMENPNV 104
           ++ CS   EL  ++A M+K+  +QD + + +F+S   +S + + + Y    F   + P+ 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 105 FVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHC 164
           F++N MI+GF     P R L  Y  ML  S+   N+YTF SL+KAC+ + A E    +H 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRML-CSSAPHNAYTFPSLLKACSNLSAFEETTQIHA 140

Query: 165 HIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDT 224
            I K G+E+ ++   +L++ Y+       A  +FD + E D  +W +++    + G MD 
Sbjct: 141 QITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDI 200

Query: 225 ARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDI---------------- 284
           A  LF +M E+N  +W TMI GY +    + A  LF++M   D+                
Sbjct: 201 ALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQ 260

Query: 285 -----------------------ISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIV 344
                                  +    +I  Y++  + ++AL ++   +   +     +
Sbjct: 261 LGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTAL 320

Query: 345 IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYG 404
           I G A HG+  +A+  F  M++  I PN +TF ++L+AC++ GLVEEG+  F SM RDY 
Sbjct: 321 ISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYN 380

Query: 405 ISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAV 464
           + P I HYGC+VD+L +AGLL EA   I+ M  +PN++IWGALL  C++H N  + ++  
Sbjct: 381 LKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIG 440

Query: 465 EQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQ 516
           E L+ ++P + G Y    ++ A +K W + A  R +MKEQGV  K PG S I LEGT H+
Sbjct: 441 EILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGV-AKVPGCSTISLEGTTHE 500

BLAST of Cmc04g0098491.1 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.7e-80
Identity = 186/647 (28.75%), Postives = 300/647 (46.37%), Query Frame = 0

Query: 23  PSISMP---LQSPSRPSF----KQTLLNRIKNCSNINELHVVYASMIK---SNAIQDCFL 82
           PS S P   L S S P +        L+ + NC  +  L +++A MIK    N       
Sbjct: 11  PSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 70

Query: 83  VHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSN 142
           + +F   S  F  + Y +  F  ++ PN+ ++N M +G      P   L+ YV M+    
Sbjct: 71  LIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI-SLG 130

Query: 143 VLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEAR 202
           +LPNSYTF  ++K+C    A + GQ +H H+ K G +  L+V T+L+  Y +  +L +A 
Sbjct: 131 LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAH 190

Query: 203 KVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGN--- 262
           KVFD+   RD  ++T ++   A  G ++ A+KLF+E+P ++  +WN MI GYA  GN   
Sbjct: 191 KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE 250

Query: 263 ------------------------------------------------------------ 322
                                                                       
Sbjct: 251 ALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALID 310

Query: 323 -------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI- 382
                  +E+A  LF ++P KD+ISW T+I  Y+    Y++AL ++ E   +G  P+ + 
Sbjct: 311 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVT 370

Query: 383 ------------------------------------------------------------ 442
                                                                       
Sbjct: 371 MLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS 430

Query: 443 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 502
                      +I G A+HG A+ +  +F+ M +  I P+ +TF+ +LSAC+H+G+++ G
Sbjct: 431 ILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLG 490

Query: 503 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 518
           R  F +MT+DY ++P++ HYGCM+D+L  +GL KEA E+I  ME EP+ +IW +LL  CK
Sbjct: 491 RHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACK 550

BLAST of Cmc04g0098491.1 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 9.6e-79
Identity = 172/523 (32.89%), Postives = 289/523 (55.26%), Query Frame = 0

Query: 37  FKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISA-------SFAFNSVHYP 96
           FK   L  +++CS+ ++L +++  +++++ I D F+  + ++        +   N + Y 
Sbjct: 11  FKHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYA 70

Query: 97  VFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTF 156
              F+Q++NPN+FV+N +I+ F     P +    Y  ML+ S + P++ TF  L+KA + 
Sbjct: 71  YGIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLK-SRIWPDNITFPFLIKASSE 130

Query: 157 MCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTM 216
           M  V +G+  H  I + GF++ ++V+ +LV  Y+    ++ A ++F +M  RD  +WT+M
Sbjct: 131 MECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSM 190

Query: 217 VSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWT 276
           V+   + G ++ AR++F+EMP RN  TW+ MI+GYA+    E A  LF  M  + +++  
Sbjct: 191 VAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANE 250

Query: 277 TMITCYSQNKQYQDAL---------AIYSETRLNGIIPDQIV------------------ 336
           T++     +  +  AL          + S   +N I+   +V                  
Sbjct: 251 TVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEG 310

Query: 337 ------------IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 396
                       I+GLAVHG+A KA+  F+ M     +P  VTF ++LSAC+H GLVE+G
Sbjct: 311 LPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKG 370

Query: 397 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 456
              + +M +D+GI P + HYGC+VDML +AG L EA   I  M  +PN+ I GALL  CK
Sbjct: 371 LEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACK 430

Query: 457 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 513
           ++ N+ +A+     L+ ++P +SG+Y LL ++ A    W ++  +R MMKE+ V KK PG
Sbjct: 431 IYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLV-KKPPG 490

BLAST of Cmc04g0098491.1 vs. ExPASy TrEMBL
Match: A0A5A7T9J0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold237G00860 PE=4 SV=1)

HSP 1 Score: 1038.9 bits (2685), Expect = 7.4e-300
Identity = 528/599 (88.15%), Postives = 529/599 (88.31%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS
Sbjct: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG
Sbjct: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD
Sbjct: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
           FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM
Sbjct: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
Sbjct: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALELGKEIHQYVMSQGLNHDVYIGSALVDMYAKCGSLDWSLLIFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 530
           LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG
Sbjct: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. ExPASy TrEMBL
Match: A0A1S3BHH1 (pentatricopeptide repeat-containing protein At1g06145-like OS=Cucumis melo OX=3656 GN=LOC103489889 PE=4 SV=1)

HSP 1 Score: 1038.9 bits (2685), Expect = 7.4e-300
Identity = 528/599 (88.15%), Postives = 529/599 (88.31%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS
Sbjct: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG
Sbjct: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD
Sbjct: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
           FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM
Sbjct: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ
Sbjct: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALELGKEIHQYVMSQGLNHDVYIGSALVDMYAKCGSLDWSLLIFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 530
           LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG
Sbjct: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. ExPASy TrEMBL
Match: A0A0A0LB99 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G395920 PE=4 SV=1)

HSP 1 Score: 961.8 bits (2485), Expect = 1.1e-276
Identity = 494/600 (82.33%), Postives = 506/600 (84.33%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVS-PSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYA 60
           MFSFVTT ALKQLTRSIGNFVS PSISMPLQ PS PSFKQTLLNRIKNCS INELH + A
Sbjct: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSCPSFKQTLLNRIKNCSTINELHGLCA 60

Query: 61  SMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFR 120
           SMIK+NAIQDCFLVHQFISASFA NSVHYPVFAFTQMENPNVFVYNAMIKGFVY GYPFR
Sbjct: 61  SMIKTNAIQDCFLVHQFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120

Query: 121 GLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
            LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNT 240
           DFYSKLE LSEARKVFDEMCERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300

Query: 301 QI---------------------------------------------------------- 360
           ++                                                          
Sbjct: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360

Query: 361 ------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEE 420
                       VIEGLAVHGYAEKALRMFAIMEREKI+PNGVTFISILSACTHAGLV+E
Sbjct: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420

Query: 421 GRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYP 530
           KLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEEKDWMEVAHIR MMKE+GVEKKYP
Sbjct: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVAHIRSMMKEKGVEKKYP 540

BLAST of Cmc04g0098491.1 vs. ExPASy TrEMBL
Match: A0A6J1HUY6 (pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita maxima OX=3661 GN=LOC111466933 PE=4 SV=1)

HSP 1 Score: 829.7 bits (2142), Expect = 6.8e-237
Identity = 424/588 (72.11%), Postives = 455/588 (77.38%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFS   T ALKQ+TRSI NFVS S S  LQ P  P+FKQTLL+RIKNCS INEL  +YAS
Sbjct: 1   MFSITPTNALKQITRSISNFVSSSTSRTLQGPYVPTFKQTLLDRIKNCSTINELDGIYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPNVFVYNAMI+GFVY GYPFR 
Sbjct: 61  MIKANATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFRA 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HC IW  G E  +FVQT+L+D
Sbjct: 121 IQCYVHMLE-SQVLPSSYTFSSLVKACTCMCALDLGRMIHCQIWTHGLELDVFVQTSLID 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
            YS LE+  +ARKVFDEM ERD FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTM
Sbjct: 181 LYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY + RLNGIIPD+
Sbjct: 241 IDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALMIYGDMRLNGIIPDE 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALELGKEIHHYAMSRGLNLDVYIGSALVDMYAKCGSLDRSLLVFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMF IMEREKI+PNGVTFISILSACTHAGLV EG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVVEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRFLSM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFLSMIRDYGIHPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 519
           LHGNS IAKDAV +L ILEP NSGHYNLLVSM AEEK W+EVAHIR MMKE GVEKKYPG
Sbjct: 481 LHGNSEIAKDAVRRLNILEPKNSGHYNLLVSMYAEEKHWIEVAHIRAMMKENGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. ExPASy TrEMBL
Match: A0A6J1HIB3 (pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita moschata OX=3662 GN=LOC111463844 PE=4 SV=1)

HSP 1 Score: 824.7 bits (2129), Expect = 2.2e-235
Identity = 421/588 (71.60%), Postives = 453/588 (77.04%), Query Frame = 0

Query: 1   MFSFVTTIALKQLTRSIGNFVSPSISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYAS 60
           MFS   T ALKQ+TRSI NFVS S    LQ     +FKQTLL+RIKNCS INEL  +YAS
Sbjct: 1   MFSITPTNALKQITRSISNFVSSSTPRTLQGSYVSTFKQTLLDRIKNCSTINELDGIYAS 60

Query: 61  MIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRG 120
           MIK+NA QDCFLV+QFISAS  FNSV YPV AFTQMENPNVFVYNAMI+GFVY GYPFR 
Sbjct: 61  MIKTNATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFRA 120

Query: 121 LQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVD 180
           +QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HCHIWK G E  +FVQT+L+D
Sbjct: 121 IQCYVHMLE-SKVLPSSYTFSSLVKACTCMCALDLGRMIHCHIWKNGLELDVFVQTSLID 180

Query: 181 FYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTM 240
            YS LE+  +ARKVFDEM ERD FAWTTMVSALAR GDMD+ARKLFEEMPE NTATWNTM
Sbjct: 181 LYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNTM 240

Query: 241 IDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQ 300
           IDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY   RLNGIIPD+
Sbjct: 241 IDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALTIYGNMRLNGIIPDE 300

Query: 301 I----------------------------------------------------------- 360
           +                                                           
Sbjct: 301 VTMSTVVSACAHVGALELGKEIHHYAMSRGLNLDVYIGSALVDMYAKCGSLDRSLLVFFK 360

Query: 361 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 420
                      VIEGLAVHGYAEKALRMF IMEREKI+PNGVTFISILSACTHAGLV EG
Sbjct: 361 LKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVIEG 420

Query: 421 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 480
           RSRF SM RDYGI PE+ HYGCMVDMLSKAGLL EALELI  MEFEPNSIIWGALLNGCK
Sbjct: 421 RSRFSSMIRDYGIRPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGCK 480

Query: 481 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 519
           LHGNS IAKDAV+QL +LEP NSGHYNLLVSM AEEK WM+VAHIR MMKE GVEKKYPG
Sbjct: 481 LHGNSEIAKDAVQQLTVLEPKNSGHYNLLVSMYAEEKHWMKVAHIRAMMKENGVEKKYPG 540

BLAST of Cmc04g0098491.1 vs. TAIR 10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 484.2 bits (1245), Expect = 1.4e-136
Identity = 261/541 (48.24%), Postives = 337/541 (62.29%), Query Frame = 0

Query: 45   IKNCSNINELHVVYASMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVY 104
            IK CS    L    A+MIK++  QDC L++QFI+A  +F  +   V   TQM+ PNVFVY
Sbjct: 780  IKQCSTPKLLESALAAMIKTSLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVY 839

Query: 105  NAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIW 164
            NA+ KGFV   +P R L+ YV ML  S V P+SYT+SSLVKA +F  A   G+ +  HIW
Sbjct: 840  NALFKGFVTCSHPIRSLELYVRMLRDS-VSPSSYTYSSLVKASSF--ASRFGESLQAHIW 899

Query: 165  KKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARK 224
            K GF  H+ +QT L+DFYS   ++ EARKVFDEM ERD  AWTTMVSA  RV DMD+A  
Sbjct: 900  KFGFGFHVKIQTTLIDFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANS 959

Query: 225  LFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDA 284
            L  +M E+N AT N +I+GY  LGN+E AE LFNQMP KDIISWTTMI  YSQNK+Y++A
Sbjct: 960  LANQMSEKNEATSNCLINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREA 1019

Query: 285  LAIYSETRLNGIIPDQI------------------------------------------- 344
            +A++ +    GIIPD++                                           
Sbjct: 1020 IAVFYKMMEEGIIPDEVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYIGSALVDM 1079

Query: 345  ---------------------------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTF 404
                                       +IEGLA HG+A++AL+MFA ME E + PN VTF
Sbjct: 1080 YSKCGSLERALLVFFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTF 1139

Query: 405  ISILSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSME 464
            +S+ +ACTHAGLV+EGR  + SM  DY I   + HYG MV + SKAGL+ EALELI +ME
Sbjct: 1140 VSVFTACTHAGLVDEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNME 1199

Query: 465  FEPNSIIWGALLNGCKLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAH 516
            FEPN++IWGALL+GC++H N VIA+ A  +LM+LEPMNSG+Y LLVSM AE+  W +VA 
Sbjct: 1200 FEPNAVIWGALLDGCRIHKNLVIAEIAFNKLMVLEPMNSGYYFLLVSMYAEQNRWRDVAE 1259

BLAST of Cmc04g0098491.1 vs. TAIR 10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 325.9 bits (834), Expect = 6.2e-89
Identity = 186/586 (31.74%), Postives = 312/586 (53.24%), Query Frame = 0

Query: 26  SMPLQSPSRPSFKQTLLNRIKN---CSNINELHVVYASMIKSNAIQDCFLVHQFISASFA 85
           S+P+++PS  S ++    R+++   C+N+N++  ++A +I+ N  +D  +  + ISA   
Sbjct: 4   SLPVRAPSWVSSRRIFEERLQDLPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSL 63

Query: 86  FNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSS 145
               +  V  F Q++ PNV + N++I+       P++    +  M +   +  +++T+  
Sbjct: 64  CRQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEM-QRFGLFADNFTYPF 123

Query: 146 LVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSK------------LEKLSE 205
           L+KAC+    + + +M+H HI K G  S ++V  AL+D YS+             EK+SE
Sbjct: 124 LLKACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSE 183

Query: 206 ---------------------ARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEM 265
                                AR++FDEM +RD  +W TM+   AR  +M  A +LFE+M
Sbjct: 184 RDTVSWNSMLGGLVKAGELRDARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKM 243

Query: 266 PERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDA--- 325
           PERNT +W+TM+ GY++ G++E A ++F++M  P K++++WT +I  Y++    ++A   
Sbjct: 244 PERNTVSWSTMVMGYSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRL 303

Query: 326 -------------------LAIYSETRL-------------------------------- 385
                              LA  +E+ L                                
Sbjct: 304 VDQMVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAK 363

Query: 386 ----------------NGIIPDQIVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISI 445
                             ++    ++ GL VHG+ ++A+ +F+ M RE I P+ VTFI++
Sbjct: 364 CGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAV 423

Query: 446 LSACTHAGLVEEGRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEP 504
           L +C HAGL++EG   F SM + Y + P++ HYGC+VD+L + G LKEA++++++M  EP
Sbjct: 424 LCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEP 483

BLAST of Cmc04g0098491.1 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 307.8 bits (787), Expect = 1.7e-83
Identity = 169/513 (32.94%), Postives = 271/513 (52.83%), Query Frame = 0

Query: 45  IKNCSNINELHVVYASMIKSNAIQDCFLVHQFIS---ASFAFNSVHYPVFAFTQMENPNV 104
           ++ CS   EL  ++A M+K+  +QD + + +F+S   +S + + + Y    F   + P+ 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 105 FVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHC 164
           F++N MI+GF     P R L  Y  ML  S+   N+YTF SL+KAC+ + A E    +H 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRML-CSSAPHNAYTFPSLLKACSNLSAFEETTQIHA 140

Query: 165 HIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDT 224
            I K G+E+ ++   +L++ Y+       A  +FD + E D  +W +++    + G MD 
Sbjct: 141 QITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDI 200

Query: 225 ARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDI---------------- 284
           A  LF +M E+N  +W TMI GY +    + A  LF++M   D+                
Sbjct: 201 ALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQ 260

Query: 285 -----------------------ISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQIV 344
                                  +    +I  Y++  + ++AL ++   +   +     +
Sbjct: 261 LGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTAL 320

Query: 345 IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYG 404
           I G A HG+  +A+  F  M++  I PN +TF ++L+AC++ GLVEEG+  F SM RDY 
Sbjct: 321 ISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYN 380

Query: 405 ISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCKLHGNSVIAKDAV 464
           + P I HYGC+VD+L +AGLL EA   I+ M  +PN++IWGALL  C++H N  + ++  
Sbjct: 381 LKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIG 440

Query: 465 EQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPGSSWIELEGTIHQ 516
           E L+ ++P + G Y    ++ A +K W + A  R +MKEQGV  K PG S I LEGT H+
Sbjct: 441 EILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGV-AKVPGCSTISLEGTTHE 500

BLAST of Cmc04g0098491.1 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 301.6 bits (771), Expect = 1.2e-81
Identity = 186/647 (28.75%), Postives = 300/647 (46.37%), Query Frame = 0

Query: 23  PSISMP---LQSPSRPSF----KQTLLNRIKNCSNINELHVVYASMIK---SNAIQDCFL 82
           PS S P   L S S P +        L+ + NC  +  L +++A MIK    N       
Sbjct: 11  PSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 70

Query: 83  VHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSN 142
           + +F   S  F  + Y +  F  ++ PN+ ++N M +G      P   L+ YV M+    
Sbjct: 71  LIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI-SLG 130

Query: 143 VLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEAR 202
           +LPNSYTF  ++K+C    A + GQ +H H+ K G +  L+V T+L+  Y +  +L +A 
Sbjct: 131 LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAH 190

Query: 203 KVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGN--- 262
           KVFD+   RD  ++T ++   A  G ++ A+KLF+E+P ++  +WN MI GYA  GN   
Sbjct: 191 KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE 250

Query: 263 ------------------------------------------------------------ 322
                                                                       
Sbjct: 251 ALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALID 310

Query: 323 -------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPDQI- 382
                  +E+A  LF ++P KD+ISW T+I  Y+    Y++AL ++ E   +G  P+ + 
Sbjct: 311 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVT 370

Query: 383 ------------------------------------------------------------ 442
                                                                       
Sbjct: 371 MLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS 430

Query: 443 -----------VIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 502
                      +I G A+HG A+ +  +F+ M +  I P+ +TF+ +LSAC+H+G+++ G
Sbjct: 431 ILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLG 490

Query: 503 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 518
           R  F +MT+DY ++P++ HYGCM+D+L  +GL KEA E+I  ME EP+ +IW +LL  CK
Sbjct: 491 RHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACK 550

BLAST of Cmc04g0098491.1 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 295.8 bits (756), Expect = 6.8e-80
Identity = 172/523 (32.89%), Postives = 289/523 (55.26%), Query Frame = 0

Query: 37  FKQTLLNRIKNCSNINELHVVYASMIKSNAIQDCFLVHQFISA-------SFAFNSVHYP 96
           FK   L  +++CS+ ++L +++  +++++ I D F+  + ++        +   N + Y 
Sbjct: 11  FKHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYA 70

Query: 97  VFAFTQMENPNVFVYNAMIKGFVYRGYPFRGLQCYVHMLEGSNVLPNSYTFSSLVKACTF 156
              F+Q++NPN+FV+N +I+ F     P +    Y  ML+ S + P++ TF  L+KA + 
Sbjct: 71  YGIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLK-SRIWPDNITFPFLIKASSE 130

Query: 157 MCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEKLSEARKVFDEMCERDAFAWTTM 216
           M  V +G+  H  I + GF++ ++V+ +LV  Y+    ++ A ++F +M  RD  +WT+M
Sbjct: 131 MECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSM 190

Query: 217 VSALARVGDMDTARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWT 276
           V+   + G ++ AR++F+EMP RN  TW+ MI+GYA+    E A  LF  M  + +++  
Sbjct: 191 VAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANE 250

Query: 277 TMITCYSQNKQYQDAL---------AIYSETRLNGIIPDQIV------------------ 336
           T++     +  +  AL          + S   +N I+   +V                  
Sbjct: 251 TVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEG 310

Query: 337 ------------IEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEEG 396
                       I+GLAVHG+A KA+  F+ M     +P  VTF ++LSAC+H GLVE+G
Sbjct: 311 LPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKG 370

Query: 397 RSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGCK 456
              + +M +D+GI P + HYGC+VDML +AG L EA   I  M  +PN+ I GALL  CK
Sbjct: 371 LEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACK 430

Query: 457 LHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYPG 513
           ++ N+ +A+     L+ ++P +SG+Y LL ++ A    W ++  +R MMKE+ V KK PG
Sbjct: 431 IYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLV-KKPPG 490

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008447444.11.5e-29988.15PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis m... [more]
XP_011651448.12.4e-27682.33pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >XP_0116... [more]
XP_038888390.12.1e-25677.10pentatricopeptide repeat-containing protein At1g06143 [Benincasa hispida][more]
XP_022967388.11.4e-23672.11pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima][more]
XP_023554768.13.1e-23672.11pentatricopeptide repeat-containing protein At1g06143 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q56X051.9e-13548.24Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX... [more]
Q9LS728.7e-8831.74Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
Q9FJY72.4e-8232.94Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9LN011.7e-8028.75Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FG169.6e-7932.89Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7T9J07.4e-30088.15Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BHH17.4e-30088.15pentatricopeptide repeat-containing protein At1g06145-like OS=Cucumis melo OX=36... [more]
A0A0A0LB991.1e-27682.33Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G395920 PE=4 SV=1[more]
A0A6J1HUY66.8e-23772.11pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita maxima OX=366... [more]
A0A6J1HIB32.2e-23571.60pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT1G06150.11.4e-13648.24basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G29230.16.2e-8931.74Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.11.7e-8332.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.11.2e-8128.75Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G06540.16.8e-8032.89Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 303..341
e-value: 0.0043
score: 17.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 99..147
e-value: 1.0E-8
score: 35.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 236..265
e-value: 3.5E-8
score: 31.1
coord: 266..299
e-value: 2.8E-6
score: 25.2
coord: 175..202
e-value: 5.0E-4
score: 18.1
coord: 205..233
e-value: 1.5E-6
score: 26.0
coord: 370..394
e-value: 0.0028
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 369..394
e-value: 8.3E-4
score: 19.4
coord: 236..264
e-value: 1.5E-8
score: 34.3
coord: 266..296
e-value: 1.2E-4
score: 22.1
coord: 205..233
e-value: 6.4E-7
score: 29.2
coord: 176..201
e-value: 0.0033
score: 17.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..232
score: 10.972319
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 233..267
score: 11.783455
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 36..153
e-value: 7.6E-10
score: 40.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 309..484
e-value: 1.9E-25
score: 92.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 154..232
e-value: 4.0E-16
score: 60.9
coord: 233..308
e-value: 1.5E-19
score: 72.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 178..285
NoneNo IPR availablePANTHERPTHR47925:SF99BNAC05G04200D PROTEINcoord: 302..517
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 302..517
coord: 35..301
NoneNo IPR availablePANTHERPTHR47925:SF99BNAC05G04200D PROTEINcoord: 35..301

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cmc04g0098491Cmc04g0098491gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cmc04g0098491.1-exonCmc04g0098491.1-exon-CMiso1.1chr04:14210601..14211654exon
Cmc04g0098491.1-exonCmc04g0098491.1-exon-CMiso1.1chr04:14211809..14211812exon
Cmc04g0098491.1-exonCmc04g0098491.1-exon-CMiso1.1chr04:14211869..14212743exon
Cmc04g0098491.1-exonCmc04g0098491.1-exon-CMiso1.1chr04:14213398..14216588exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cmc04g0098491.1-five_prime_utrCmc04g0098491.1-five_prime_utr-CMiso1.1chr04:14210601..14210754five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cmc04g0098491.1-cdsCmc04g0098491.1-cds-CMiso1.1chr04:14210755..14211654CDS
Cmc04g0098491.1-cdsCmc04g0098491.1-cds-CMiso1.1chr04:14211809..14211812CDS
Cmc04g0098491.1-cdsCmc04g0098491.1-cds-CMiso1.1chr04:14211869..14212554CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cmc04g0098491.1-three_prime_utrCmc04g0098491.1-three_prime_utr-CMiso1.1chr04:14212555..14212743three_prime_UTR
Cmc04g0098491.1-three_prime_utrCmc04g0098491.1-three_prime_utr-CMiso1.1chr04:14213398..14216588three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cmc04g0098491.1Cmc04g0098491.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding