ClCG06G002140 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG06G002140
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionMADS-box transcription factor 8-like
LocationCG_Chr06: 2343242 .. 2367304 (+)
RNA-Seq ExpressionClCG06G002140
SyntenyClCG06G002140
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATAAAAAGGAGAAGCGAAGAGAGGGAGAAAGAGGAATTAGGGTTGACTGTGGAGATACACAAACAAGAGGAGAAATGGGACGGAAGAAGATCGAGGTGAAGCTAATCGAAGATAGGTGTAATCGCCATGTCACTTTCTGCAAGAGAAGATCTGGATTAATCAAGAAAGCGCGAGAACTCTCTGTTCTTTGCGACGTTGAGGTAGGACTCGTCGTCTTCACCAATCGCGGCCGTCTCTATGAGTTTTGCAGCGGAAATAGGTACTTATAACGCTTTTTCATTCTCCGTTTTTCTTTCCGATTCTTTAATCTTCGAATGTGTGCGCGGTTTCCTTCATTTCGATTATTATATTTATGATAGTGCAAGCTGAAGCTGAATGCCTTTAAAATGAACTTGAATTCAACTTCTGCCCTTTCTTTTTCTTTCTATATATCTGCTATTTTAAGTGATTTGATTTCTTCGAAGCTCTGTCATAGCTTCTAATTTCTGTCTTCTAAATATCGACGAACGAAATGCAAACGCTCTTCGTCTTTCTGTTCTTCGATTCTTCCTCAGGAAAAATCGATGAGATGGAGTACTTTTAGTGTTGCCGTGTGCTTTCTTCGATCTGAGTTTATTTTCTTCTGTTTTTTTTTGGATTATTTCTCCTGAGTGTTTCCACTAACATAATGCGGGACTGTTTGATCTTTCTGAGAACATCCCGGAAAATCGTGGGAACTTTGCTTTCCGACGGTTTTCTCGCTCTCTGATTTTTTTTTCTTCCTCAATGTTTCTTTTTCAGCAGTATTTACGTGTCAACAAATATTTTAAAGACATGTAATTACGAAACTGTCACTGCCTAGTTATAACACATGTTTTATGGTGAAATTTAAATGCCACAGTAAATGTTTCACAAAAACTCACTAACTCATGAAAGAAATAGTACCTAAAATTATATATTTTTTGTTTTAAGTTAATTTACCATGAGTGTTTATGGTAAAATACAGTAAATATTTGAATTCAATTTTAAAGTAAGGAAAGAAATAGAAATGAAAAAAAATTGATAGAATAAATTTTGAATACGAGATCTAAGTTATGTACAAAATTTTTTAAGTAGCATAATAAGATAGCTGGTTACCTCATATGACTAGTTTTTTTTTTTTTTTTTTTTTCATGTGTTTTCTTTAAATTGATTGATTTCTGGAGGAGAAAAAAATGTTGAATTAGAATATAAAATTTATAAGTAACTTATAACTAACTTAATGAATTTATAAATTTGACTAAAAATAGAAATACATTAATTTTTACTTGTTTGGAGGGTACAGGGAGCTATTTAATCCATAATATGTGGAGTGGGAGATCAGACTTCATACCTTAAAATTGATAATACACATTTATACTAACATGTTCGCAGTAAAAAAACTATTTGTTATTAATTAATTTTCTATCAAATCTTATTTTCATAAAAGAATAATATGTTATAATTGAAATTATAAAATAGAATTTTCATTTATAAATCATTCAACAATAAATTTGCCAACATCATAGCTCAATTGGTTATAATTTGTATTATTAACATTGTGTTCAGATTCACTCCCTCACATTGTGAATAAAAAAAAGGAAAAAAAGACTCCACTTAAATCCTGTTTTGGAATTTATATTAAAATTTATTGTTAGAAAAGTACGCCTACCCTATTGTAGGTGAGACCTTTTTGTATATAATAGGTATGGGATATAAAAAGCTTATCTTATTCTTGTTATTATTATTATTATTTTCTTGTCCGATAAAAAACTTGAATTTTATTATTATCAAATCATTCTATTTAATTGATTTTAAAAAGTTCGAAAAGAGTTTACTTGTCGAGGACATATCTTAGAAAATAAAAATAGAGGGAGAATGGATATAGTAATTAAATTAATATTGAAATTTTTGAGGAAGTGGAGGAAGATGAATAGGTAGTTGTGCTTTTAAAATTAAGATTCATATAAAAATTAAAAGGATTAAATCTATTTTTAAAATTATGAGAAAGATAAATAGTTCTTACTATTTTAAAATTATGATTGAAATTAAAAATTAACGAGATTAATTGCTCCTTTATTTTTCTTTCCTGAATTGTCGTCGGTTTGTAGTAGAGGTGTTTTGCTAGACAGTTTTGGGTTGACATTGTTGCATAAGAAATTATTCAAAATGATGAGAAATAAAGTAAAGAAAATATCTTATTATTAAAATTTATTTAGAACTATTTGAAGGGTAATTGGGAATGTGAGCTTCTCTTATAAATGATTTTGGCCAGTTTTCATATTATTATATTATTAATTTTTTTAACCACTGAATAACTTGGTCATATTTCTAATTTTGTTTCGATTATATCGTTAGCTGTCAACATCAAACTATTTATGTATTCAGTCAAGGGATGAGGATTACAGATCAGAAAATAATAAGACAAAATCAAATATTTTAGAATAGTGACAAGAAATTTTTATTGAGGGTCTGTTGAATCGATTTTTTTATTATTATTTTTGGTTGAACAGCGCTTCGTCGACTAAGGCATGTTTACTTTAAAGTTAAAAATTGAATATTTGTAGACTAACTTAATATAGAGCATTTTTATTTTTATAAAATTAGTGATCTTTTCTATACATTATATTGAATAATGCATTATCAATTTTAATAGTTACTGATGACGAGAAACAAAATTATAAAATAATAATATATAAAGGAATAAATTAAACAAAAATTCGATCAGAAAATACCGAAAGATTAGAGACTAAAACATCCCTATTTATCTCAGAAAGGAAAAAGGTAATAACATTTTAAGGTCTTTTAGTAAATATTTGAATTCTCAGCCGATTGTAAAAACTCCTTTTCTTTAGTTTGGGAAGCCTAATTTAGACTTTCATACACCTCCCAAAAAAGGAAATTGGATGACAAAAAACGAAGAAATAAAAAAATCGCCAAATCGTTAGAGAAACTAAATATCTTTATCTTCTAAGCATGAGAGTGCATTTAAAAGATTACAAATATAAAAAATATCGATCCAAATATAGAATTATTATTATCTAAAAAGAAAAAAAAAAGCTTGGGAAAAGCAAAATTAAGATGACTGCTGGAGATACCCGATCGACGAATTTCTTCCCCTTTAAGGAACGATCAGGTGTATCCTTTTTATAATTATTACGCACAAACCGACATGGTTATTCAAATGAATCATTACTACCCCTTTTTCTCTCTCTTACCTGCCCACTTCTGAAATCCTTCGTAATCCATTTCCTCTGTTTATACACACTTCTCTTACTTTGCTGAAATGAATACTTTGGTATTTATTTCACGTGTTCAGCATTCATCAACTGCACTTGTTGAATGTGTTGAATGGTGGTGTGAACAATTCCGCACATACTTAAAAACAATGGTTGAACCTTTGAACTATCATGTTCAACTTCAATCTTTTATTTATTTATTCATTCATTCATTTTTTTTCCAAATAAATTAATCAAAGTAAAACGCATGAGAAAATCAATAAACTTTTATAGTGAAAAACAGTACAGTTGTTTTTATTTTTCAAGCTTGTAAATTGTGATTACATGAGCATTATTACCTTAGTTGTTCAAAAGAAAAATGTCTGGCTTCAAAATATTAGGATATTACATAATTACCTTCTTTAACTGCTGAAGTATTGCATATTTATGAGAAACTAAATTTTTACGTAGGGTCTCATTTGCATGGTAATTTACTAATAAGGTTACATCCTATAAGGCTCTATCCAAGTTTCATAATTGTGTGTTCGTGGAGGGCTTTTTTCTGATAGCTAAGAGTAGGCTTTCTTGTGATGGATTAATGATGAGGACTTGGTCATTAGCTTTGGATGTACTTATTATATAGATACACGAAGCTAAAATTATCCAGAATTGGCTCAGTTAATCAAAGAGCTATTAAGATACTAAGATAATGGATTTCTTAATTGTTGCCATGTCATAGGATGTATAATCAAGTTGGCCTTAAGAAGAAGAAGATTCAGTAGCTACTATAATTTTAAAAACATAAAACTATTACCGGTATAGATAAGATAGATTTCTCTAGGTCTTGACAATAAAATATGAAAAGAAAATTCTTAGGCTTGGAAGTGCAGAAGGAATTTGATCATAAGATCATTACTCCTGCTTTTACATAAGAGAGAGAGAGAGAGAGAGGGAAAAAAAAAAAAGCCGTAAAGATTATTTTGATGGCTACCTGACAGATAGTCCGTCTATGCATTACATCATCAAGTTGTATAGAGATTACAAACTGTCTTTCTTTACATAGAAAGAAAAACCACACGGTATCTTCATGATCTCTTGGAAATGTCAAATAACTCTGAATTTAAGTTGAGAGAAAAAATAAAGGCTTTAGAGGGGAAAATTTCTATTACACAATATACTTCTGTCACACCCTATTAGAAATAACGCGAGACCCACAAACATCCAAAGAATATAAGAGTATAGTGCTCATTCTTTCACTTTTAGAGAGAGTGAAACTTTTGTGAAGGTTGTGATCATTATGTTTATTGTCTCGATTGGGCCTTGGTGGCTTAGATCGTTTGTATTTATCAACTAGGCATACTTTTTGGCTTAGAGCCTTGTTTCGTAGTTATTTGGTTCCCTCTATGCCCTTTTAAATATTCTTTCAGATTCCACAGCTTTATTTATTTCTTTCTTTTAAAAGACCACCAGTTTTAGCAGAAAATTCAAAATTCTAAATCACATCCACCTTTAGACACTACAGTACACGAGTTTAGTTAAGTTAGATCGAGTTATGTATACTCACTTGGTTCGAATCAAAATTTAAACTCAATTTAAACTCATGAATCAAACTAAACCGATCAATTTTTGAAAATTTTGACACAACCTAAATCGAGAAAATTTATCTGACCCAACCTCCATCCCTTATGATTTGAGTTGATTAGTTGGGCTTTTTTAGTTATATATATAAGATTTTGTCAACATTTTAATAAGACCATTCATTTTTAATTTCGGACAGAACTATTTTTGTTGTATGAATCTCTTTAACAAATCTAATAATGGATATTAAAATAGATTAGGTATGATTAGCATTTTGAAAAAATTCAAATTTCTTAACATTTAAATGTTTATAATTATTCAAAAATAAAATTCATTTAGTGAGATGAAGTTCATTGATGAATACAGACTTCATCATTTATTTTCTTTTAATGAAAATGACAGGATTTGCATTTACTAGGTAAGTTGGAAAAGGTAAATGAGAGAAAAAAACTAAATTATAGGAAATACTCTAAATTTTATATTGTTTCAAAATACTCCTAAATTATCAAAAGTTTCATCAATATCCTAAAACTCAACATTTATAAAATTTAACTTTCTAAGACCTCTAACCATAGTCAGTACTACATGCAAGGGAATATATATATATATAACGAAAATATGACCTTCATGGAAAATGAAAGAATATACTGAAATAAAATAGAAAAAACCTACAAAAAGAAATCAAAAGGGCTTCAGTCTAGAGAAATAAGACCAAGCGAATAGTTACGAAAGAAATAGAAGCCCAAATAGACATAGAAAATCTAGCTAGTGACCAAACTAACATTCTTTGAAGGTTCTATTATATTTCACACCCCAAAACCACCAAGTAATAGTGCACACCTTCGCCTGCCATTAAATCCTCTTTTTATCGATCTCTAAAGGGGAATGAAGGAAAATCTCTTCAAATGCCTTCCTACAACTTCGAGGGCTAACAATGCCGAAGTTGAGCACCTCAAAATAATTGTGCTAGACGCTCGAGCAAAATCACTACTCCACATAATAGATATAGTTAAAGTATTCTTTCTTTCTATCCTTTTTTTTCCCTCAAAACTCATTTTCATCCTTAAGAACTACTCTCACTTTCTTCTCATTCAATTCTTGCACCAATTTTCTCAATAGCAAAAGAGAATCCAAAGCTTCAAGTTAAATATAAAATCTCATCGTTCCAAATCAAAATCTCATCTCAACACTCAGCAAAACATAATTGATAAACACAAAAAGGACTGAGAAAGAAATATCATCTTTGATCACTGGGGACCCTTTTGAAATGACCACGAAAAATAAACTAATTTCAAATGCAAAAAGGACCATCAAAACCTTCATCTCTACAAATTTGGTATTACCTATTCTTTTTTCTAAGAACCACATGTACGGATACAAGACACAAACACGACAATAGGTCATTTTTTTAAAATTTAAGACAAGGACACGATAAGAACATTTTTAAAAATGCATAGTTTATGAGATTTATATGCTTAAAGAATTAGTTTGGTATATATTTCACTATCTAATTTTATAATTTTTTTCATATATGTGTATTTAGTATTTGTAACAAGTGTTTGATGCATACCTAACAAATGTTGGTTCTATTTTGGCTTTGTACAACTAGTGTCTAACACATCCATTTCACTAACAAGTATCTAATACGTGTCCAACAACTGTCGAGTGTCGAAGTGTCTGACATAGACATGCTAGCCAAACTAAGATGTTTGTGCTTACAACTATGACCCACTGAGTAAAACAATAAATCACCAAACACAAAAAGAGCCAATGAATGAATATCAATATCGATCATTAGGGAACGTTTGAACAAACCATAACAAATAAACTAATTTCAACTATAAAAATGATCATCAAAATCTTCATCTCTACACAAATTTGACGATGTCATGTTTTCTTTATTTTTCCTCCATTAGTTGTCCAAAAATTAAGATCCATATTCTTAATATGACTCCTTTTATGTTTTAAACCCAATTATATTGACTCTCACTCAATATCTCTAGGTCCCTAGAGATTCCATTATTTTCCTTTCTTCATACTATAACCTATATACTTTGAAGGGGATGGTTTTGGAGATTGGAGAATGGAGGAAGGAAAGTAAAAGGAAAAATAGTGTAGATAAATAGTTTCCATCCATATACCAAGGATAAAACATTTTTAACCTTTTCTTTTAGAGCTTAATAGTATTAAATACAACTTTTAAAACTTCAGGGGAATTTTTGAAATATGGTACTAACAATTTCTATCCATTCTAACCATAAAAGTACTTTTTAATTGCTTTTTAAAGTTTTGAGTATTGCTTTTTCTATTATTTAAACATAAAAAGGTAATTGATTACATTCACTTTCCAAGGAAAGTGAATATAGCTCAATTGACATCCGAGATGTGCTAGTGACCATGAAGTTCATGGTTCGAATCCCCCACCTTTAATTGTACTAAAAAAAATCACATCGACTACCTATTTAAGATGTTGAAATCGTATGAATTTCTAACAATTAAATATTGTAGGATTAGGTGTTTCTATAACATTAGTTAAAAATCACATAAGATAGTTCAAATGCTATCATATATTAGAATATCAAATTTCATTAATCTAGGAAAGATATGTCAAGGAAAAAAAAGCAAGATCCTAATTGAATTTAAGAGATACTAATCCTTTGATTTAGTACACAATCTAGAAAGAAGAAAAAAAAAAAAAAAAAAAAAATCTAGGCCTAGGGTCCTTGTGATTTCTTTCAAGCCACAATCTTTTTTAAAAAAATTCTTTCAAACCACAATCTCTTTGACAAAAATTTATAACTAAATTCAAGGAGCATATGTGAAGACTTTAAGAAATGCCTTTCAGACCAAAGAATCTTACCACATATAACCCCCTACACTGAACATAACGATATAACTACAAAAATGTGATTTTGATGCCCTTTCCTACCGCCAAGAACACATTCTAGTTTATGGCCAAGGACAAAAAGAGGATGTACTCCATGGAGAATGTCATAGGGATTCCATAGCTTATCTTGAAAATGATGCCTCCGTTGTTCATAGTTAAACAATCAGAAATAGCTTTGTTAATAATTAAAAGGAAGTAGAAAAAGAAAAAGTCACCTCCTAAGAAGTATAAATATCGTTATGAGACACGAACACGATATAACACATGGACATGGAGATACGTCATTTTTTAGAAATCTAGCACACAACGAGGCAATGACACATTTATTAAATTATACAGTTTTTAAAATGTATATCATTTAAGTAAATGGATTGATGCATTTATATGCTTAAAAAACTTAGTTTAATGTATTTCACACTTGAAATTAATCTCTTTAGTCTACTCAACAAATGTTCTATGCATAACTAACACATTTGTTATACTAATAAGTGTCCAATATAAGTCCAAGTGTCAAAATGTCCAAATATCCGACACGTGTTGGACATAGACACGCTAGCTAAACTAAAGTGTTTGTGCTTCTTAGATCGTATCTCAATGGAAAACCCTGTCTTCTTAATTTTTTCGTCGTGGCTTTTGTTTCTTTGGGTTTCATTTTAATTTTTCATTTGTTTTCATCTTGGTTTTTTTTATCATCAATTCCTTAGCTTGTTGCGGACCTTTCTTCTCCATTGTCTTTCTCCACCGTCGCCTTATACGGTCGTTGGATTTTTCCTTTTTTGGTTGAAGTTTTCTTTGTTCATTTTAGCTTTTGTTGACTCCGGGGCATAATTGGATTTGGTTCTAGATTAGGTTTGTCTTCTATTCATTTGGGTTCACCGTTTAACGAGCAGACTTCATGGATTCCATAAATTTTTGTGTTAGTAATCGATACTTTTGCCTTTGGTATGAAGACAGAATCTTCCATGTCAAAGAAATAAAGACCAAATTGTGCACTCCCCACTTATCACAGTTGAAATGGTCCGAAACAACTTCATTAGAATAGATGTTTTTCCTGTCTTATTGGCATTTCTCAGAAGATAATGATGATCTTGGAGTTACTCGTTTATCAAAATCCGAATCTTCCTCTAGTTGGAAGATTGAATATGTGGTTTGGCCACCCACCGAGGACGTAAGTTGACCCAAGTGCCTAGGGAGATTCCAAAAGAGGACAACCTATCTTTTGAGAAATGTTAAAAGATTATGATGTAGTAGGTATTAGGGTAAATTGGGTTATCTAATCAAATTCTTAATTAAGATTAGGATTAGGATTAGTTTCTTTGGTTTATTAGGTTTAGGATTAATTTCCTTTCCAATTCTCTATAAATAGGATTAGGATTAGTTTCCTTTCCAATTCTTGATTGATTTTTGGTAGATTATTCGCCAATCTTCTCAATTCCAATTTTAGCTTTTCTACAGCTTGCTGCAGCCTGACCATTGAGGTCTGCGTTACTGTCATTTTATCCAGTAAATCACCCATTAAAGTTTACAAATGTCTTCGTTCACAGTTTGAGGAGATGTGGAGAAGGATGCTTCCACATCATGAGTTTGAAGGTTGGAAACCTTGTTGGTGTTCTTGATCACCATCAAGCCCTTTCACAAAGGCACTCTAATACCAAATTGATGTAGATTACGAATCTTGGAGAATTCTTGAAGGAAAGTCTTATTCATTCATTGGGGGAAAACTCCTATAGTGGATGCTTTGTCAATAGGATTGTTGGTGGCTAGAAAAAAGACTATTTTCTTATCTAACTACTCAAATTCGTTAGTGTTATCATGACCTTATGTAAGGTTTGATTATTGTACTGCCATGATCTTCCTAAAAGGATATGGCATATGTCCATATTGATAATATCATGTTGATTATTTTTTTTTATTTTTATTTTTTTATATTCTCCCTTTATATTTATTGTAAAGTTATTTGCTATATTTATTTTTTCCTTATTGTAATAGGTTTCCTTTTATTTAAGAAAACCCTTGTCTAACAAAGAAAATAAGAGAGAAATAAAATATTTTCAGCATGATATTAGAGCATATTGCTTGAAACCCTAATTTTTTTTTAAAAAAGAAACATAAACCCTAATTTGTCGCCACCGCGCTCCAGATCTTCGCCGGACGTCTCACCTCAGTCACCGGACAACGCCCAGCCTCCCAGTTGACGCCAGACCGCTGGTCGTTCGTGGATCCTGTGAGTGCAGAAGAAAGTCGCAAATCGCGTGCCCTAACCAACGCCGATGCCGACGCTGTTGTGACTTGGGTTCACTCCTTCTCCGTCGGCCGACCAAGTCTACGCCATTGTCCGAGCGATTTCTGGCCAGTTCCGACGATTCTCCGATGGTTCCTTGTTTTGCCGCTGAGTGGAAGTTTTTTGGGTTTGTTAAAACCGTTTTTGATTCCTTTTTTTCTTTTCAGATTTATTGTTGTTTGGGTTGTGTGCTTTTATCTCTTCAATATGTCAGAGACTAAGGTATCTGCCACCAAAGTCTTCGACAATCGGATCCATTCCCACACTCTCACTGTCCAAATCACCACCATTCGACTTAATGGGGATAACTTTCTTCGTTGGTCCCAGAGTGTTCGGATGTATATTTGTGGCCAAGGGAAGATAGGGCATCTCACCAGAGAAAAAATCGCTCCAAGTCCAGATGACCCTTTATTTGTTGTGTGGGACGTGAAAAACTCCATGGTTATGATATGGCTCGTCAACTCTATGGTGGAAGACATCAGTAGTAACTACATGTGCTACATTACGACCAAGAAATTATGGGACAGTGTGACTCAAATGTATTCTGATTTGGGTAACTAGTCACAAGTGTTCGAGCTGAATCTTAAGTTGGGTGAAATACGACAAAGAGGCAACTCAGTTACACAATATTTTCACTATTTGGAAAGGATGTGGTAAGAACTTAAGCTGTTTAATACGTATGAGTAGAAGTCCACAGATGACCAAAAACATTATCGGAAAACTGTTGAAAATGGTCGCATTTAGAAATTCCTTGCTGGCCTCAATGTTGAGTTTGATGAGGTTAGAGACAGGATACTTGGGAAAAGTACTCTTCCAAATATTAATGATGTTTTTTCTAAAGTTCGCAGAGAGGAAAGTCGCAGGAATGTTATGATTGGAAAGAAGGCAGTTGACTCAGTTGAAAGTTCCGCATTAGTGATTGAAAATACTGCAATGAAAGCTTTTGATCAATCCAACAAAACTCATGACAAGCCTCGTGTCTGGTGTGATCACTGCAACAAACCCCATCATACGAGAGAAACTTGTTGGAAACTACATGCCAAACCTGCAAAATTGGAAGAGCTCCCATCAGCATGCCTCCAATGTAAATATTGTTGATTCCAGTCCACTCAAAGAGCAAATTGATCAAATCCTGAAGCTACTAAAATCCAATTCATCGGGTAATCCTAGTGTTTACTTGGGACAAACAGTAATTCCCCTCAAGCTCTCTCGTGTCTAAATTCCTCTCCGTGGATCATCGATTCCGGAGCTACTGATCATATGACTAGTTTCTCGTGGTTATTTGAGTCATACTCCCCTATTTATTGTAAAGAAAAAGTGTGTATTGCCGATGGTAGTTGTACATCTATTGCAGGTAAAGGAACTATTCCCCCAAGTACAAAACTCATACTACATTATGTCCTTCATGCTCCTCAACTAGCTTGTAATTTATTATTTGTGAGCAAAATATCTAAGGATGCTAACTGCTGTGTTATCTTTTGTGGAACCCATTGTCTCTTTTAGGATCAAGACTCGGGGGAGACGATGAGATGTGCTAGGATGATTGATGGTCTCTAATACTTTGATGAAGTTTCAACTAGTCATAAAAAGATTCAAGGCTTGAGTAGTGTTAGTTCTTTTTCTGTTCAAGAAACTCTTATGCTTTGGCATCGTAGAGTAGGAACCACTTAAGTTTGGCTTTGGATGAATCCATCGGTGAAGATCTAAATACGCTTTTCCAGACCAACCAAAACAGAGAAGACTGCAAGAGCTCAGCCGACCACCTCTCTTCAATGCTTAACGAAAAAATTCCTCATCACTTGAGATCATTTGTAGAAGATTGTGGAATTTCACTGGTCTAAGTGAAGGATTGACGGTTCAGAACATAGAAAAGAAAGGTAGTATTCTCAATGAAAATCGTTTCCTGGAACATTAAAGGCCTTGGAGACTATTCGAAACCGCTAGCAGTTAAGCACCTTAATATGAAGATAAATCCAGAATTGGTTTTAATTCAAGAAACAAAGAAAGAGGCATTTAAAGTCGAAGCAATCAAGAAACTTTGGAGTTCAAAAGACATCGGTTGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTATTGTTGATCATGTGGGATGAAAGTAAAATATCAGTCATCGAAACACTCAAAGGAGGCTACACTCTTTCCGTTAAATGTAAGACCTTATGCAAAAAAGTTTGTTGGGTAACAAATGTATACGGACCAACCGATTATAAAGAAAGAAAACACATCTGGCCGGAGCTACAAGCTTTGGCAGCTTATTGCACAAATGCCTGGTGCCTGGGTGGGGACTTCAACATCACTAGAGCAATCCATGAAAGAGTTCCAACTGGAAGATTAACTAGAGGAATGAAGAAATTCAACAAATTCATAGAAAAGGCACACTTAATGGAAATCCCTTTGAGCAATGGGCGGTTCACATGGTCAAGAGAAGGAATCAGAATATCAAGAACCTTGTTAGACAGATTTCTAGTGACAAACGAATGGGATGAAGCTTTTGAAGGCACTTGAGTCTTTAGACAGGTTCGCATTGTATCAGACCACTTTCCTCTCTTATTAGAAGTTGGGGCTTTAGAATGGGGTCCTTCCCCCTTTCGTTTTTGCAACAGTTGGTTGCTCAACTCCCAGTGCTGCAGCACTATAATTAGAACTCTCGAAGCTGGACATCATCAAGGTTGGGCTGGTTTTGTCATTTTTGCTAAGTTGAGATGAGTTAAAACCTCCCTACAACAGTGGCACGAGGGAGTAGACTTACTCTATGTAAGACCGTGCTTTCAAACCTCCCGTCTTACTACATGTCTATCTTTCAAATGCCGGAAAAGGTAGTCCTATCAATAGAAAGAGCCATAAGAAGTTTCTTTTGGGAAGGCAATGGAGGAAGCAAACTGAACCACCTTGCCCGATGGGAAACAGTTACAAAAAACCATAAGGATGGAGGTCTCGGGCTGGAAAAATATTGAAACTCCGAAACTTGGCAATGCTGTCCAAATGGGGATGGCGTTTTATGCAAGAATCTGAAGCTCTTTGGCGCAATGTTATCACAAGCATTCATGGGGAGGATCACTTCCAATGGCACACTATCAGAAAGGAGGTTGCAAACTTAAGAAGCCCATGGATAAGCATTTCAAGACAATGGCAGAAAATTGAAGCCCTTGCAATCTTTAAAGTAGGAGATGGAAGAAGAATAACATTTTGGTCAGATCCTTGGATTGAGAATTTGCCCCTCAAGGTCAGATTTCCAAGACTCTTCAAGCTAGCTCTCAAGCCAACTGGAACAGTAGCAGATCATTGGGACCCAAGTTCCTCCTCGTGGGATTTAGCGTTCAAAAGACAGTTGAAAGAGGAAGAATTAGTTGAGTTTCTGTCCCTCTCAAGCAGTGTGACAAATAAGAAAGTTACATTGCAGCCGGATAAAAGGATTTGGGCATTAGAAGGAAATGGTAGTTTGTCCATAAATTCCCTAACCACTCATCTTTCAGTAGCTTCCCCAATTGACGTTAAGCTGGTTAAAGGCCTATGGAAGTCCAAATGCCCTAGGAGAGTCAACATCCTGATTTGGACCATGCTGTTTGGTTCATTAAATTGTCCTACCACGATTCAAAGGAAACTCCCGTACCATTGCTTGTCCCCCCATATGTGTGTCCTCTGCCGCCAAGACCAAGAAGACATTCAGCACCTATTCTTCGGCTGCAGCTATGCTTCAAGTTGTTGGTCGAGGCTGTTTGGTTTCTTCAATTTTAGCTGGACAATGGGGAGTGACTTCAAAAGCAATGTGCTGCAAGTTTTGTTAGGTCCAAGACTAAACAAGGAACCAAAGTTGCTGTGGATCAATGCAGTCAAAGCTTTGTTATCAGAAATTTGGTTCGAAGGGAATCAACGCGTATTTAATGACAAAGCATCCAATTGGGAGGATCAGTTCGAACAGGCAAGGGTAAATGCTTCTTCATGGTGCACTTTGTCAAAGGCATTCAAGGATTATTCCATTCAAGAGTTTGTTCTAAATTGGAGAGCCTTCATTTTCACTCACCCCTGAACTTATGTCATTTCCTCTCACCAATGAACTCATCCTGTTTTTGTAGTAGTAGTTATTTTTTATGCTTATCACATTTTGTAATACTCGATTGGGTCTAGATTTTGTATTGGCTTGTTAAAAGGTTTAGTCTACTAGGACATGATGTTGGGTGCTAAGGGGGTGTCAACCTAGTTGAGATGCCCGGGTGCACCCTCTGATCCATTATTAGTCATTGCTTGTAGTGTTTCATATGTACTATGAGCTTTGTCTCAACACATTAATTCAATGATAGAGACTGTTTCCTTTTAAAAAAAAAAAAAATACTTGTTTCCTGATTTATTTAAAGGAATTGATTGTTCTGTTTTTCAATGCAAAGATTGCATTTTTGCCAAACATCATCGATCTACTTTTTCACCCAAATCTTATAAATCTTCATCACCTTTTTACTTAATTCATACTGATGTTTGGGGTCCGTCTAAGGTTTTGACTAAAAATGGCAAGCATTGGTTTGTTACCTTTATCGATGATCACACCCGTTTAACTTGGCTTTATTTGCTAACAAAAAAGTCGAATGTAAAAGAGGTATTTGTTCGTTTTTATAAAATGATTGAGACTCAATTTCAAACTAAAATTTGTATTCTTCACTCTGATAATGGGACTGAATTTTTTAACAAACCACTAATCACCTTTTTACATGATAAGGGCATCGTTCATCAAGCTACATGTCGCGATACTCCTTAGCAAAATGGTGTTGCTGAACAAAAAAATCGACATTTGCTTGAAGTTGCTCGTGCCCTTATGTTTTCTATGCATGTTCCAAAATATTTGTGGGGGATGATGTTCTAACCACTGCTTACCTAATCAATAGAATGCCTACTAAGGTATTGAATTTTAAAACTCCTCTGCAGTACCTCAAAGAGTTTTTTCCTACAGCTTGACTGTTCTCAGAGTTACCTTTAAAAGTTTTTGGGTGTACTGCTTATGTTCATCGAACCCTCTTTTCCCAAACTAAATTGGACCATCGGGCCATTAAATGTGTTTTTGTAGGCTATGTTCCCCTTAAGAAGGCGTACAAATGTTTTGACCCCCTAACTAACAATTATTTTGAGAGTATAGATGTGTCCTTCGTGGAAAATCAATCTTTTTTTAGCCTAACTTCTCTTCAAGGGGAGTCATCTCTCCTTGAAGAGAATTTTTGGAACACTTCACCTCTCCCAAACATCATTAGTCTTTAAATTATGAGCTCTAGTCCTTCGATCCCAAGTGTGGAAAATTTGCTGAAAGGAGGAGAAACACTACAAACAGATCTGACAGGTCGAAATCCTAAATTTTAGTTTTATACTAGAAGAAACATAACTCAAAGGGATAGAAATCAAACATCTCTACCAATGATGAAATCAAAATTAGGTCCCAGTCGAGCTAACACAGGACCAATCTGATACTCCAATGAATGATCCTGAAAAATCTGGGTATGTCTCTTAGTCCTTCCTCTCATAATATGTTGCCTGATATCTCTAATCTTGATATTCCAATTGCCCAGAGAAAAGGTACCCGCCAATGTACAAAATATCCCATTGCGAACTATCTCTCCTATCATAGATTATCGGATAGTCATAGAGCTTTCACATCCAAAATAACCAACCTATTTGTTCCAAGGAATATACAGGAAGCTCTAAATGATTCGAATTGGAAATTAGCAGTGATGGAAGAGATGAACGTGTCAAAACAAAGTGTACTATGGACATAGTTGATCTATCAGAAGACAAGAAAGCAGTGGGATGTAAGTGAGTTTTCACGATAAAATGTAATGTTGATGGTAGTATCGAAAGGTACAAGGCCAGATTAGTGGCTAAGGGATTCACCCAGACCTATGGAATTGATTATCAAGAGACATTTGCCCCTGTAGCTAAAATTAACTCAATTAGAATTTTGCTCTCTGTTGTAGTTAATTTTGATTGGCCACTTTATCAACTTGATATTAAAAATGCGTTTCTTAATGGGGAACTTAAGAAGTATTTATGGACTTACCACCTGGTTTTGAAGCTGACCTTGGGATTAACAAGGTATGTAAATTAAAAAAATCACTATACGGCCTTCAATAGTCTCCTAGAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCACAAGCTATGGATTCAGCCAAAGTCAAGCCAATCACACTATGTTCTAAAAGCATACGGGAAATGACAAGGTGGTTGTTCTGATAGTGTATGTCGATGATATCATTCTTACAGGTAATGATGAGACAGGAACGTCTATTGTAAAGGAAAAATTGGCGAATGATTTCAAGATCAAAGACCTGGGGTCCTTAAAGTACTTCCTTGGCATGGAGTTTGCTAGGTCTAAAAGTGGTATTCTTGTCAATCAAAGAAAGTATATCCTTGATCTACTCAAAGAGACAGGTTTACTTGGTTGCTGAATTGTAGAAACTCCCATTGAGCATAACTTAAAATTGGAAGCTACAACAGAAAATGATGTAAAAGAAAAGGGAAAGTACCAAAGACTCGTGGAAAGACTAATATACCTCTCACACACGTCTCGACATTGCCTTTGTAGTTAGTATGGTAAGCCAGTTCATGCATGCCCCTGGGCCAACTCATTTTGAAGCTGCCTTTTGAATCCTGAGATATTTGAAAGGTACTTCAGGGAAAGGGATACTCTTAAAAAAACAATAGTCACCCAAAAGTTGAAGTTTATACTAATGTTGATTGGGCAGTTAGCACGACAGATAGGGGATCGACTTCTGGGTATTGCTCCTTTGTTGGAAGAAACTTTAGTTACTTGGTGTAGCAAAAAACAGAGTGTAGTTTCAAGAAGTAGTGCTGAAGCAGAATTTAGGGCATTGGCTCATGGTATTTGTGAAGGCATATAGATAAAAAGGCTGCTGGAAGAATTGAAATTCGCTCAGACAATGCCTATACACATTTACTGTGGTAACAAGGCAGCAATCTTCATTGCCCATAATCCAGTCTTTCATGATAGGACGAAACACATTGAAATCGATAACATTTCATAAAAGAAAAGATTGATGCAAGAGTGATATGCATTCTCTACCTCCCAACAACAGAACAAATTGCAGATGTATTAACTAAAGGTCTTCCTAAGTTGCAATTCAACAAGTTAACAAACAAGCTAGCCATGAGTGATATTTTCAAACCAGCTTGAGGGGGAGTGTTGATTATTTCCTTTTTTTTTGTTTATATTCTCCCTTTGTACTTATTGTAAAGATATTTGCTATATTTATTTTTTCCTTATTTGTAATAGGTTTTCTTTTATTTAAGAAAACTCTTATCTAACAAAGAAAATAAGAGAGAAATAAAATATTTTCAGCACATTACATATAGTTTGGTCCTTGTGGTGATTGCCGATGTATAATGGTACTGTACATAATTCATTGATCACTGTATCACCACCTCTTTTAATCCATCCTACCTTACATGGTTTTGGACGGGGATCCACTTTGAGTTTCAAAGCCTGCACCAACTTTTTGGATACCATATTCTCACTGCTTTCGCTGTTCCACAATGATATTGCAAATTTTACCTCGAATATTACAACGAGTGGGAAAAAGTCTCTAAGGGTGGTTGGTTGTGTTATGGGTTAAAAGCATCCTTTGGATTACACAGGACAATTCTTCCCAATCATCAGGGTTGGTAAACTCCACATCCTCGTTGTCTCTTGTTCATCTTCCTCTTCAACATTAGATTCTTGGATTGTTAAGTTTCTTCTTTGGGACAATTGTTGGATAAATATCCCATTTGTCCACACCTAAAACACTTACCCAAATTAGGTATGTTGTGTGGATTTGGATTTTTATTTTGAAAAGCACTAGAACTTCCCTTTTGATCATTTTCTTGTTTTGGGTTGGTGTCTTTGGGCTTTTTGCTAAAGAAGAGCTTGTCGTGTTTTGATTCTTTGTGTTATCCCAATTGTTTTTCCTTGCTTGTCCCAGTGGTTTCTCCTAGTTGTTCTTTCTTCTTGACCACTTCTTTTGTTTTTTAGTTTTGTCCTTCTCAATTCTTGTTGCCACCGAAACCACTTCTAAAAGATAGGCCATTGGGTGAAGATGTACCCTCTCCTTTATGTCTTGTCGAAGACCCTCAAGAAATCGAACTATCTGTTGGTGAGAAATCGAACTATCTATTGGTTCTCACTCTCAGCTAAGTTGTTTCTAGCACTTAAAATGCTGGAATTTCCTCGTGTATCTTGTCCCTTGTTTGGAGTGTTGGAATTGATTGTACATCAATTTTTCACAGTATCAGCAACAATCTCTTTTCATAAGTTTCAGCATCTTAGGCTAACTTTTAAGGACTATTCCTAATCCGCCATCAATTAAAGTTCAATCCTTCAACAGGACCTATGGTTCGTTCTAGGTTGGAAAGGCCAACGAAGTGGTTTCCTTGGATGTTAACCATTTATTGGTTGCATTCTAATTATTTGCACAAAATGTTGATATCAATACTACATTATTAGAAGAGTATTTTGAAGTCCTTATTTATTTATATCCAATCCATTTTTCTGAAAAAACGTTGATTCAATTCAATAAAAGAGTTTTACCTTGGGTGCAGTGGTCCAACAAGAGGTACGATTGATGGATCTCGATTAAGAACATGCCATTATTGCCATATTGGACATGTCGAACTTCCAAAGAAATCATAGAATATTTCAACACTTTGAACTTAATTGATTGTGATGCTTGTATTATAGAGTTAAGTGATTTCATCCCTTCCATGATTGAAATTAAGGATAAAAAACTAGGGAATACCTTTCTAAGTTATGAGCATAATTCTCCTTTGACGACCTTACTTGAACAAGATATGTTTTATAGTACAATTGTGTCCTTGAGTTCTAAGGAGACAAAGATATTTCTATTTTAGATCCTCCCTGCATTATCCAACATGAATTGTCACTAAGTGATTTCACTAACACTTTGGACCTTTTTAGACTCCATAAGATTATGGACGAGGAGAAAGTCGGTCCCAAATGATTAAAAGTTGGGCAATGGTTTCTTTAAGTTTGCATTTAACGATCTATTGTCTAGTAAGGTCTTTAAAGTATTATCAAAGGAAGTAGAAATGATGCTTCCTTGAGTTCTAAGGAGACAAAGATATTTCTCTTTTAGATCCTCCATACATTATCAACGTGAATTGTCACTAAGTGATTTCACTAACCCTTTGGACCTTTTTAGATTGCACAAGATTATGGATGAGGAGAAAGTCCTTGTCCCAAATGATCAGAAAATGGATGATGGTTTCTTTAAGTTTGCATTTAGTGATCTATTGGTTATTAAGGTCTTTAAAGTATTATCAAAGTAGAAGTGATGCCCAAGGGAAAATACACCTTAAATTGTAGAAAAGCAGAGAAAGTCCCAGATTTATTATGTAGAAAATAGATCGTATAAACCTAAAGAAAACATCAACAAAGCTAATATTATACTTAAATTCATTTGAAGAAACAGTTGCAGCAGGACCTCATGAATCAGCCTCAATTAATTGTAAAGGTAATGTATAAAAGGTAGAAGATAAAGCAAATGACAATCTATATATATCTCTCCAACAGAACATGAATGAAATACTCTTTTGTTTCTTGATCTTTAATACTTGAGAAACAAGTTTATAACGAGGATGTCCTAATCTATAATGCCACAAAATAAAAGGATCCAAAGTAATAGAATTTGTTGAAACAAGTTCTTTTGGTTTATCTATCTGTGAAAGTATAAAGCTGCGATGTTATTATCAAGAGAATGATTTTGGACAAGATAAATCCTCTTTTGTTGTAAGATTGAACTTGTACCGACCATCACATATTTGGCCTTGAGGAAGAGGTTATCCAGTTGCTCAATCCTTCACGTAGCAAAGATTAGGGTGAAACTCAAAAAATACATCATTTTCTTGAGCAAATAGACTGACACTTATTAAGTTCTTTGTTATGTGAGGCACATAGAGTAAGTTTTTTTACTAAGTTTCTTAGATGAAAAGAACAATTAGACATATGATCTTGAGAATAAAGAGTAGAATCTCTAATTTCTAGTGGGTATGTTACAGTCGTTACCACAAGATTGAATCTGGGAAGACAACAGCATGGATTTTATAGAGGATACAATTTTTATTGTGACCATATGAGCAAGTACGCACCCTTTATTCCTCTGACTCGCCCAGTTAACACCAAAAGTGTGGCTAATGCCTTTATTAGAGAGGCCGCCCAACTCCACAAATCTCCCTGCTTGATAGTTTCTAAGTGAGATTACATCTTTCTTAGCCATTTTTGGATCAAGCTTCTCCATTCCCAAGGCACGCATCTCCACCGGAGCACGACCTATCGCCCTTAAACTGATGGCTAGATGTGGTTAACTCGTGTTGAGACCTATTTTACGGTGCTTCCACAATGAAAAACCCAAAGGATGGGTTGAATGGCTATCATAGGTGAAATTTTGGTACAACACTATACCATAACTCGGTACCACGAATTCATGCCACCTTTGGTTACCCCTCCTTCAATGATATTCTATGAGTACCAGAAGACTACCAATGCTTCCCTTGATCAATAGTTAGCCAAACGTGATTTTGCCTAATTAAATCTAAAGTGCCACCTTGCTACAGCCAGAAATCGATGGAAGAAAATTGTGAATTACAAACGTACTAAGACGAAATTTGTTGAGGGTGATTGAGTGTATATGAAGCTGAGGCCATAATGTCAACATTTGGTCGCCAAAAGCCAAAAGGAAAAGCTCTTCCCTAAGTACCATGGCCCTTATCACATTATGTCTCAAGTTGGGCAATTTGCCCACACGTAGGAACTACCACCAACAGGTTTATACACCTAGTTTTCCATGTCTCCCAATTGAAGAAAGTCGTGCACAAGAGCCAGATGGTCCAACCCACGCCCCATGTTGAATGATGAATTCAAATGGATTACAAAGCCTCAAGAGGTATATGAATTCCAAAAGAATGTTGACTCTGATGAGATTGAGGTGTTGGTTGAATGGAAGGATTTACCCGCTCACAAAGCCACCTGAGAATCTTACAACACTATGCAACAACAATTTCCTACTTTTCACCTAAATGTCAAGGTGGTGTTATACCCAAGGGTAATACTAGGCCTTCTGCAGTTCGCACATATGCTAGAAGAGGCAAAAATGTAATTGTGGAGGGTACTTAGAGTGCTGGTCAAGTTATTGAAGCTTTGTTTTTTGTGCTTAGTTTTCTTGGTCTATATTGGTTTCATTTCTGATTTATTTAGGTAGCTAGTCATTTTTGTCGACAAAAACTTGTTTAGGAGAGACCAAATCCTTTCAGAAGGTCGTTTCCTTTGGACTCTTTGTGAAAGAAATACATCGAAAACTCAATTTTTTTTAATAATAATAATAATAATAAATAAACAAATAAACAAATAAATAAAACTCCATTTAAGGCCATCAATCATTAAAAAGACACTAGTACCAGAGGCGAAGGTATATCCTTCCCACTAATACTTAATGAGCATTTTCTTCCGGCCTGGAATGGATAAAAGGTGATCTTTCCTCCTATAGGAGAAGTCTTCTAAATCCATAGGGTGATTAGAACAAAATGGTGTCGTGTTCATGTCAATGTAATATGGCTTTTGTACAACACGTGCTACATAATTTGGAGTTCAAACTCTCAGCTTTAAACTACTTAAAATTATCATTTTATCCTTCCAAATGCTTTAGTGAAAATAGTTACGTCCATTTACTTTCTCAGCACTTACTCTGTTATGCTCTTTCATGATTGTGATGCAGCTTATTGAATATTATTATGCGCTACCAACGCCACCTTCAAGGGAGAAGTGAAAGTCCAATTGAAAATGACCTTCAAGAGAGAAGTGAAAGTCCAATTGACAATGACCTTCAAGAGAGAAGTGGAAATCTAATTGATAATGGTAGTGGTACAAAGGTAAGTGAGGACTTACAATCTCTAGTTAAGGTAACAAAAATATGTTTAACATGTCTACTAAAATATTTATAGATTACTAAAAGTAATTTTCATAAACATACAAAAAAAAGTAAATACATAAAATAAAAAAATAAATCAAATGACGTCCCATTCCAATTTTAAAGAGACTAGTTTTTATCAAATTGAAAGCGCTTTTAAAAATGGTAGTAGTTATAGTATTGTTCTTTAGTAAGTGTTTCATTTCAAAACACAAGCATACCAGAGAAATAATAAGAAGGCTGGTTAGAAGTGCTTGAGCAGTTCACTATTCGATCCTTTTTTTCCTGGGTGGGGAATAATGTGACTATCATTACCAAGGGGGGGAAATCTTGCATCTCCTTTTTCTTTGAAATATAATCTTCCATCTTTCTGCTGATTTTCTGCAGAACCAGGAATCTGATGAGACAATATTGGTTTCACTTGGAAAGCTCCTACAAACTATTCAAAGGCAATTTTCCGTATTTTTACTACAGCCATTTTTATGTATTAGAATTGCTCAAATCTCAACTAGGGTGTCCTCATTTCCAGTCAGGTTGAAGAACCAAATTTCAAGAAGCTCGATGTAACTCAGATGATGCAGTTGGAGAATCAACTGGAAGGCACACTTGACAAAATCAAATCTCAAAGAGTGAGTTTATTTATTTGTACCGCATATTCATCTAAGAGGTTTCAACTTGTTAACCCTTCTGAGACCCTCACTGCAAAAGTTATATGGTGGTCCCCTTGGAAGCTTATGCAATGTTGCATAGGATCATTTTAGGAAAATATAGCCCTTTCCTTAACTAATCTAACTAAAGCGTTGAAGGCACTTTTAGGGCATGTGAAAGCCTATTTTGGTATATATGAAATTTTCTAGAATCGTTAGGCTCTTTGTGCTCATTGGTTTCAACTAGAACTAGCTCAGTTGTTGTCTTACATGTCTTTATCAATTATCCTGTTTTCAAGTTTCTTCCAATGACATCTATTCTCTAGACTAACTGAAGGATTGAAGGCACTTTTAGGGACTGTGAAAGCCTATTTTGGGGTATTTGAAATTTTCTAGGTCCATTAGGTTATTCACGCTGATTGATTTCGGCTTAGGAATAACCCTAGTTGTTGTCTTCCCTGTCTTTATCAATTATCCTCTTTTCAAGTTTATTCCCATTACATCTGTCCGTATTTTGAGGATGCTATATGATTCAGTCATTTTTGTCTCATTTACAAGTATTTAGATGTTACTTTGCTACATACTGTGCTAATCGACTTTTGTCTTCTTGTATGTGGGTCCAACCCCTCACTTCCTCTTTATTTTGCGTACAGTACTTTTGTGTTTACAAGCCAAAGTTTTAACACAAGAAAGGCGAAGTATATATTGGATTCGATACAAATTGCTGGTCATATGGATAGAACTAACTTCGGGATTGGACACAGTATAAAATATCTTTTAGAAGCACATATTCTGCCAGGGGGACGATTGGGGCGTGGACATAAAGGTCTTTATGACACAATCAATAATTCGCTATTACACCTTAATCATCTTCTGTCATGTTGGGCCTACATAATGTAAGGAGGCGTCGATGAATCTGGAGCATGTTTTGTGGGAATAGGTCTAATTCCCCACTTCATGACTGAGGGGTGATTATTTGGCACACAGCTTTATTTAGGTTTTGTAGAACGTTGGCTTAAGCAAAAGAGGATCTTTACAAGTGAGGAAAGTTCTTGAGAAGAGATTAAGCAAAAGAGGATCTTTACAAGTGAGGAAAGTTCTTGAGAAGAGATTAGAACCTAGGCTAGGCTCATTACCTCCTCATGGTGATGTAGGAAGAATGGGTTAATTGGATAATATTCTAGGTTAATTTCCTTTTTACCCACATTGTAATAAAACTCTATAACTAAGAGTATTCCCCTCTTGTATTAGGCACATTATCATCTAAATAAAAGATTCACAAGTTCGGTTCTTGGAGGATTACTCCTTGAGGCTACTCAGGCTATATTACATTGGGTCTCAATTACGGAAAATTTTTGTTTCCCCCGTGGAACATCTGACATGGGCAGAAAAGGAAAATCTTTAAGCATGAAGAAAGTTCTTGGAAGTTGGAAGGAGGTTGGGACCTAA

mRNA sequence

ATGGAAAATAAAAAGGAGAAGCGAAGAGAGGGAGAAAGAGGAATTAGGGTTGACTGTGGAGATACACAAACAAGAGGAGAAATGGGACGGAAGAAGATCGAGGTGAAGCTAATCGAAGATAGGTGTAATCGCCATGTCACTTTCTGCAAGAGAAGATCTGGATTAATCAAGAAAGCGCGAGAACTCTCTGTTCTTTGCGACGTTGAGGTAGGACTCGTCGTCTTCACCAATCGCGGCCGTCTCTATGAGTTTTGCAGCGGAAATAGCTTATTGAATATTATTATGCGCTACCAACGCCACCTTCAAGGGAGAAGTGAAAGTCCAATTGAAAATGACCTTCAAGAGAGAAGTGAAAGTCCAATTGACAATGACCTTCAAGAGAGAAGTGGAAATCTAATTGATAATGGTAGTGGTACAAAGAACCAGGAATCTGATGAGACAATATTGGTTTCACTTGGAAAGCTCCTACAAACTATTCAAAGTCAGGTTGAAGAACCAAATTTCAAGAAGCTCGATGTAACTCAGATGATGCAGTTGGAGAATCAACTGGAAGGCACACTTGACAAAATCAAATCTCAAAGAAACGTTGGCTTAAGCAAAAGAGGATCTTTACAAGTGAGGAAAGTTCTTGAGAAGAGATTAAGCAAAAGAGGATCTTTACAAGTGAGGAAAGTTCTTGAGAAGAGATTAGAACCTAGGCTAGGCTCATTACCTCCTCATGGTGATGCACATTATCATCTAAATAAAAGATTCACAAGTTCGGTTCTTGGAGGATTACTCCTTGAGGCTACTCAGGCTATATTACATTGGGTCTCAATTACGGAAAATTTTTGTTTCCCCCGTGGAACATCTGACATGGGCAGAAAAGGAAAATCTTTAAGCATGAAGAAAGTTCTTGGAAGTTGGAAGGAGGTTGGGACCTAA

Coding sequence (CDS)

ATGGAAAATAAAAAGGAGAAGCGAAGAGAGGGAGAAAGAGGAATTAGGGTTGACTGTGGAGATACACAAACAAGAGGAGAAATGGGACGGAAGAAGATCGAGGTGAAGCTAATCGAAGATAGGTGTAATCGCCATGTCACTTTCTGCAAGAGAAGATCTGGATTAATCAAGAAAGCGCGAGAACTCTCTGTTCTTTGCGACGTTGAGGTAGGACTCGTCGTCTTCACCAATCGCGGCCGTCTCTATGAGTTTTGCAGCGGAAATAGCTTATTGAATATTATTATGCGCTACCAACGCCACCTTCAAGGGAGAAGTGAAAGTCCAATTGAAAATGACCTTCAAGAGAGAAGTGAAAGTCCAATTGACAATGACCTTCAAGAGAGAAGTGGAAATCTAATTGATAATGGTAGTGGTACAAAGAACCAGGAATCTGATGAGACAATATTGGTTTCACTTGGAAAGCTCCTACAAACTATTCAAAGTCAGGTTGAAGAACCAAATTTCAAGAAGCTCGATGTAACTCAGATGATGCAGTTGGAGAATCAACTGGAAGGCACACTTGACAAAATCAAATCTCAAAGAAACGTTGGCTTAAGCAAAAGAGGATCTTTACAAGTGAGGAAAGTTCTTGAGAAGAGATTAAGCAAAAGAGGATCTTTACAAGTGAGGAAAGTTCTTGAGAAGAGATTAGAACCTAGGCTAGGCTCATTACCTCCTCATGGTGATGCACATTATCATCTAAATAAAAGATTCACAAGTTCGGTTCTTGGAGGATTACTCCTTGAGGCTACTCAGGCTATATTACATTGGGTCTCAATTACGGAAAATTTTTGTTTCCCCCGTGGAACATCTGACATGGGCAGAAAAGGAAAATCTTTAAGCATGAAGAAAGTTCTTGGAAGTTGGAAGGAGGTTGGGACCTAA

Protein sequence

MENKKEKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQRNVGLSKRGSLQVRKVLEKRLSKRGSLQVRKVLEKRLEPRLGSLPPHGDAHYHLNKRFTSSVLGGLLLEATQAILHWVSITENFCFPRGTSDMGRKGKSLSMKKVLGSWKEVGT
Homology
BLAST of ClCG06G002140 vs. NCBI nr
Match: XP_008459847.1 (PREDICTED: MADS-box transcription factor 8-like [Cucumis melo])

HSP 1 Score: 253.8 bits (647), Expect = 1.9e-63
Identity = 135/167 (80.84%), Postives = 147/167 (88.02%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGRKKIEVKLIEDRCNRHVTFCKRRSGL+KKARELSVLCDVEVG+++FTNRGRLYEFCSG
Sbjct: 1   MGRKKIEVKLIEDRCNRHVTFCKRRSGLLKKARELSVLCDVEVGIILFTNRGRLYEFCSG 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
           NSLLNIIMRYQ HLQGR+ESPI+NDLQ RSES ID+D ++   +       T    SDET
Sbjct: 61  NSLLNIIMRYQSHLQGRNESPIDNDLQGRSESLIDSDAKDHVSD------ETILDVSDET 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQR 195
           ILVSL K LQTIQSQVEEPNFKKLD+TQM+QLENQLEGTLDKIKSQR
Sbjct: 121 ILVSLKKQLQTIQSQVEEPNFKKLDITQMVQLENQLEGTLDKIKSQR 161

BLAST of ClCG06G002140 vs. NCBI nr
Match: XP_011656809.1 (MADS-box transcription factor 8 [Cucumis sativus] >KAE8646765.1 hypothetical protein Csa_005710 [Cucumis sativus])

HSP 1 Score: 248.1 bits (632), Expect = 1.0e-61
Identity = 132/167 (79.04%), Postives = 143/167 (85.63%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGRKKIEVKLIEDRCNRHVTFCKRRSGL+KKA+ELSVLCDV+VG+++FTNRGRLYEF SG
Sbjct: 1   MGRKKIEVKLIEDRCNRHVTFCKRRSGLLKKAKELSVLCDVQVGIILFTNRGRLYEFSSG 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
           NSLLNIIMRYQ HLQGR+ESPI+NDLQ  SES IDND               K+  SDET
Sbjct: 61  NSLLNIIMRYQSHLQGRNESPIDNDLQGTSESLIDND--------------AKDHVSDET 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQR 195
           ILVSL KLLQTIQSQVEEPNFKKLD+TQM+QLENQLE TLDKIKSQR
Sbjct: 121 ILVSLKKLLQTIQSQVEEPNFKKLDITQMVQLENQLESTLDKIKSQR 153

BLAST of ClCG06G002140 vs. NCBI nr
Match: XP_022155417.1 (MADS-box transcription factor 8-like isoform X3 [Momordica charantia])

HSP 1 Score: 228.8 bits (582), Expect = 6.4e-56
Identity = 128/189 (67.72%), Postives = 147/189 (77.78%), Query Frame = 0

Query: 6   EKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVL 65
           +KRR   +GIRVD G      +MGRKKIEVK IED CNRHVTFCKRRSGLIKKARELSVL
Sbjct: 23  DKRRVEGKGIRVDSG-----AKMGRKKIEVKRIEDSCNRHVTFCKRRSGLIKKARELSVL 82

Query: 66  CDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDL 125
           CDVE+GL++FTNRGRLYEFC GNSLLNII RYQ HL+G+S+          S+SPID   
Sbjct: 83  CDVELGLLIFTNRGRLYEFCRGNSLLNIIERYQSHLKGKSQ----------SQSPID--- 142

Query: 126 QERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEG 185
                  ID  +  ++ +S++TILVSLGKLLQTIQSQVEEP+FKKL+VT+MMQLENQLE 
Sbjct: 143 -------IDTNASDQDHQSNQTILVSLGKLLQTIQSQVEEPDFKKLNVTEMMQLENQLEA 186

Query: 186 TLDKIKSQR 195
           TLDKIK QR
Sbjct: 203 TLDKIKIQR 186

BLAST of ClCG06G002140 vs. NCBI nr
Match: XP_022155416.1 (MADS-box protein FLOWERING LOCUS C-like isoform X2 [Momordica charantia])

HSP 1 Score: 227.6 bits (579), Expect = 1.4e-55
Identity = 134/207 (64.73%), Postives = 155/207 (74.88%), Query Frame = 0

Query: 6   EKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVL 65
           +KRR   +GIRVD G      +MGRKKIEVK IED CNRHVTFCKRRSGLIKKARELSVL
Sbjct: 23  DKRRVEGKGIRVDSG-----AKMGRKKIEVKRIEDSCNRHVTFCKRRSGLIKKARELSVL 82

Query: 66  CDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDL 125
           CDVE+GL++FTNRGRLYEFC GNSLLNII RYQ HL+G+S+          S+SPID   
Sbjct: 83  CDVELGLLIFTNRGRLYEFCRGNSLLNIIERYQSHLKGKSQ----------SQSPID--- 142

Query: 126 QERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEG 185
                  ID  +  ++ +S++TILVSLGKLLQTIQSQVEEP+FKKL+VT+MMQLENQLE 
Sbjct: 143 -------IDTNASDQDHQSNQTILVSLGKLLQTIQSQVEEPDFKKLNVTEMMQLENQLEA 201

Query: 186 TLDKIKSQRNVGLSKRGSLQVRKVLEK 213
           TLDKIK QR   L    S+ +RKV  K
Sbjct: 203 TLDKIKIQR---LYLAMSILLRKVEPK 201

BLAST of ClCG06G002140 vs. NCBI nr
Match: XP_022155415.1 (MADS-box protein FLOWERING LOCUS C-like isoform X1 [Momordica charantia])

HSP 1 Score: 227.6 bits (579), Expect = 1.4e-55
Identity = 134/207 (64.73%), Postives = 155/207 (74.88%), Query Frame = 0

Query: 6   EKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVL 65
           +KRR   +GIRVD G      +MGRKKIEVK IED CNRHVTFCKRRSGLIKKARELSVL
Sbjct: 23  DKRRVEGKGIRVDSG-----AKMGRKKIEVKRIEDSCNRHVTFCKRRSGLIKKARELSVL 82

Query: 66  CDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDL 125
           CDVE+GL++FTNRGRLYEFC GNSLLNII RYQ HL+G+S+          S+SPID   
Sbjct: 83  CDVELGLLIFTNRGRLYEFCRGNSLLNIIERYQSHLKGKSQ----------SQSPID--- 142

Query: 126 QERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEG 185
                  ID  +  ++ +S++TILVSLGKLLQTIQSQVEEP+FKKL+VT+MMQLENQLE 
Sbjct: 143 -------IDTNASDQDHQSNQTILVSLGKLLQTIQSQVEEPDFKKLNVTEMMQLENQLEA 201

Query: 186 TLDKIKSQRNVGLSKRGSLQVRKVLEK 213
           TLDKIK QR   L    S+ +RKV  K
Sbjct: 203 TLDKIKIQR---LYLAMSILLRKVEPK 201

BLAST of ClCG06G002140 vs. ExPASy Swiss-Prot
Match: Q9SAR1 (MADS-box transcription factor 8 OS=Oryza sativa subsp. japonica OX=39947 GN=MADS8 PE=1 SV=1)

HSP 1 Score: 105.1 bits (261), Expect = 1.4e-21
Identity = 77/231 (33.33%), Postives = 118/231 (51.08%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCSG
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSG 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
            S+   + RYQ+   G  ++ I+N          +N+L + S N             +  
Sbjct: 61  QSMTRTLERYQKFSYGGPDTAIQNK---------ENELVQSSRN-------------EYL 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQRN-------VGLSK 207
            L +  + LQ  Q  +   +   L + ++ QLE QL+ +L  I+S R          L +
Sbjct: 121 KLKARVENLQRTQRNLLGEDLGTLGIKELEQLEKQLDSSLRHIRSTRTQHMLDQLTDLQR 180

Query: 208 RGSL--QVRKVLEKRLSKRGSLQ--------VRKVLEKRLEPRLGSLPPHG 242
           R  +  +  K L ++L +   L              E++    +  +PPHG
Sbjct: 181 REQMLCEANKCLRRKLEESNQLHGQVWEHGATLLGYERQSPHAVQQVPPHG 209

BLAST of ClCG06G002140 vs. ExPASy Swiss-Prot
Match: Q7Y040 (MADS-box protein EJ2 OS=Solanum lycopersicum OX=4081 GN=EJ2 PE=1 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 1.5e-20
Identity = 75/201 (37.31%), Postives = 110/201 (54.73%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCST 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
           +S++  I +YQR      E+         ++S  D                T+N   +  
Sbjct: 61  SSMVKTIEKYQRCSYATLEA---------NQSVTD----------------TQNNYHEYL 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQR-NVGLSKRGSLQV 207
            L +  +LLQ  Q      +   L    + QLENQLE +L +I+S++    L +   LQ 
Sbjct: 121 RLKARVELLQRSQRNFLGEDLGTLSSKDLEQLENQLESSLKQIRSRKTQFMLDQLADLQQ 173

Query: 208 RKVLEKRLSKRGSLQVRKVLE 228
           +   E+ L++   L  RK+ E
Sbjct: 181 K---EQMLAESNRLLRRKLEE 173

BLAST of ClCG06G002140 vs. ExPASy Swiss-Prot
Match: Q03489 (Agamous-like MADS-box protein AGL9 homolog OS=Petunia hybrida OX=4102 GN=FBP2 PE=1 SV=2)

HSP 1 Score: 101.3 bits (251), Expect = 2.0e-20
Identity = 62/147 (42.18%), Postives = 86/147 (58.50%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIEN-------------DLQERSESPIDNDLQERSGNLID 147
           +S+L  + RYQ+   G  E+ I                L+ R E+     LQ    NL+ 
Sbjct: 61  SSMLKTLERYQKCNYGAPETNISTREALEISSQQEYLKLKARYEA-----LQRSQRNLLG 120

Query: 148 NGSGTKNQESDETILVSLGKLLQTIQS 162
              G  N +  E++   L   L+ I+S
Sbjct: 121 EDLGPLNSKELESLERQLDMSLKQIRS 142

BLAST of ClCG06G002140 vs. ExPASy Swiss-Prot
Match: Q38694 (Agamous-like MADS-box protein AGL9 homolog OS=Aranda deborah OX=29714 PE=2 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 2.6e-20
Identity = 74/232 (31.90%), Postives = 115/232 (49.57%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K+IE++ NR VTF KRR  L+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKMIENKINRQVTFAKRRKRLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
            S+L  + +YQ+   G  ES I +                           T++ + +  
Sbjct: 61  TSMLKTLEKYQKCNFGSPESTIIS-------------------------RETQSSQQEYL 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQRN-------VGLSK 207
            L +  + LQ  Q  +   +   L   ++ QLE QL+ +L +I+S R          L +
Sbjct: 121 KLKNRVEALQRSQRNLLGEDLGPLGSKELEQLERQLDSSLRQIRSTRTQFMLDQLADLQR 180

Query: 208 RGSL--QVRKVLEKRLSKRGSLQVRKVLEKRLEPRLG----SLPPHGDAHYH 247
           R  +  +  K L++R  +      ++V +      +G        HG+A YH
Sbjct: 181 REQMLCEANKTLKRRFEESSQANQQQVWDPSNTHAVGYGRQPAQHHGEAFYH 207

BLAST of ClCG06G002140 vs. ExPASy Swiss-Prot
Match: Q42464 (Agamous-like MADS-box protein AGL9 homolog OS=Solanum lycopersicum OX=4081 GN=TDR5 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 7.7e-20
Identity = 62/147 (42.18%), Postives = 84/147 (57.14%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE + NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIEGKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIEN-------------DLQERSESPIDNDLQERSGNLID 147
           +S+L  + RYQ+   G  E  I                L+ R E+     LQ    NL+ 
Sbjct: 61  SSMLKTLERYQKCNYGAPEPNISTREALEISSQQEYLKLKGRYEA-----LQRSQRNLLG 120

Query: 148 NGSGTKNQESDETILVSLGKLLQTIQS 162
              G  N +  E++   L   L+ I+S
Sbjct: 121 EDLGPLNSKELESLERQLDMSLKQIRS 142

BLAST of ClCG06G002140 vs. ExPASy TrEMBL
Match: A0A1S3CCC5 (MADS-box transcription factor 8-like OS=Cucumis melo OX=3656 GN=LOC103498847 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 9.0e-64
Identity = 135/167 (80.84%), Postives = 147/167 (88.02%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGRKKIEVKLIEDRCNRHVTFCKRRSGL+KKARELSVLCDVEVG+++FTNRGRLYEFCSG
Sbjct: 1   MGRKKIEVKLIEDRCNRHVTFCKRRSGLLKKARELSVLCDVEVGIILFTNRGRLYEFCSG 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
           NSLLNIIMRYQ HLQGR+ESPI+NDLQ RSES ID+D ++   +       T    SDET
Sbjct: 61  NSLLNIIMRYQSHLQGRNESPIDNDLQGRSESLIDSDAKDHVSD------ETILDVSDET 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQR 195
           ILVSL K LQTIQSQVEEPNFKKLD+TQM+QLENQLEGTLDKIKSQR
Sbjct: 121 ILVSLKKQLQTIQSQVEEPNFKKLDITQMVQLENQLEGTLDKIKSQR 161

BLAST of ClCG06G002140 vs. ExPASy TrEMBL
Match: A0A6J1DPA0 (MADS-box transcription factor 8-like isoform X3 OS=Momordica charantia OX=3673 GN=LOC111022564 PE=4 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 3.1e-56
Identity = 128/189 (67.72%), Postives = 147/189 (77.78%), Query Frame = 0

Query: 6   EKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVL 65
           +KRR   +GIRVD G      +MGRKKIEVK IED CNRHVTFCKRRSGLIKKARELSVL
Sbjct: 23  DKRRVEGKGIRVDSG-----AKMGRKKIEVKRIEDSCNRHVTFCKRRSGLIKKARELSVL 82

Query: 66  CDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDL 125
           CDVE+GL++FTNRGRLYEFC GNSLLNII RYQ HL+G+S+          S+SPID   
Sbjct: 83  CDVELGLLIFTNRGRLYEFCRGNSLLNIIERYQSHLKGKSQ----------SQSPID--- 142

Query: 126 QERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEG 185
                  ID  +  ++ +S++TILVSLGKLLQTIQSQVEEP+FKKL+VT+MMQLENQLE 
Sbjct: 143 -------IDTNASDQDHQSNQTILVSLGKLLQTIQSQVEEPDFKKLNVTEMMQLENQLEA 186

Query: 186 TLDKIKSQR 195
           TLDKIK QR
Sbjct: 203 TLDKIKIQR 186

BLAST of ClCG06G002140 vs. ExPASy TrEMBL
Match: A0A6J1DME3 (MADS-box protein FLOWERING LOCUS C-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022564 PE=4 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 6.9e-56
Identity = 134/207 (64.73%), Postives = 155/207 (74.88%), Query Frame = 0

Query: 6   EKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVL 65
           +KRR   +GIRVD G      +MGRKKIEVK IED CNRHVTFCKRRSGLIKKARELSVL
Sbjct: 23  DKRRVEGKGIRVDSG-----AKMGRKKIEVKRIEDSCNRHVTFCKRRSGLIKKARELSVL 82

Query: 66  CDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDL 125
           CDVE+GL++FTNRGRLYEFC GNSLLNII RYQ HL+G+S+          S+SPID   
Sbjct: 83  CDVELGLLIFTNRGRLYEFCRGNSLLNIIERYQSHLKGKSQ----------SQSPID--- 142

Query: 126 QERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEG 185
                  ID  +  ++ +S++TILVSLGKLLQTIQSQVEEP+FKKL+VT+MMQLENQLE 
Sbjct: 143 -------IDTNASDQDHQSNQTILVSLGKLLQTIQSQVEEPDFKKLNVTEMMQLENQLEA 201

Query: 186 TLDKIKSQRNVGLSKRGSLQVRKVLEK 213
           TLDKIK QR   L    S+ +RKV  K
Sbjct: 203 TLDKIKIQR---LYLAMSILLRKVEPK 201

BLAST of ClCG06G002140 vs. ExPASy TrEMBL
Match: A0A6J1DRL9 (MADS-box protein FLOWERING LOCUS C-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022564 PE=4 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 6.9e-56
Identity = 134/207 (64.73%), Postives = 155/207 (74.88%), Query Frame = 0

Query: 6   EKRREGERGIRVDCGDTQTRGEMGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVL 65
           +KRR   +GIRVD G      +MGRKKIEVK IED CNRHVTFCKRRSGLIKKARELSVL
Sbjct: 23  DKRRVEGKGIRVDSG-----AKMGRKKIEVKRIEDSCNRHVTFCKRRSGLIKKARELSVL 82

Query: 66  CDVEVGLVVFTNRGRLYEFCSGNSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDL 125
           CDVE+GL++FTNRGRLYEFC GNSLLNII RYQ HL+G+S+          S+SPID   
Sbjct: 83  CDVELGLLIFTNRGRLYEFCRGNSLLNIIERYQSHLKGKSQ----------SQSPID--- 142

Query: 126 QERSGNLIDNGSGTKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEG 185
                  ID  +  ++ +S++TILVSLGKLLQTIQSQVEEP+FKKL+VT+MMQLENQLE 
Sbjct: 143 -------IDTNASDQDHQSNQTILVSLGKLLQTIQSQVEEPDFKKLNVTEMMQLENQLEA 201

Query: 186 TLDKIKSQRNVGLSKRGSLQVRKVLEK 213
           TLDKIK QR   L    S+ +RKV  K
Sbjct: 203 TLDKIKIQR---LYLAMSILLRKVEPK 201

BLAST of ClCG06G002140 vs. ExPASy TrEMBL
Match: A0A6J1KPU7 (MADS-box protein 04g005320-like OS=Cucurbita maxima OX=3661 GN=LOC111497621 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 2.3e-51
Identity = 116/176 (65.91%), Postives = 130/176 (73.86%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGRKKIE+K IED CNRHVTFCKRRSGLIKKARELSVLCDVEVGLV+FTNRGRLYEFCSG
Sbjct: 1   MGRKKIELKRIEDMCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVIFTNRGRLYEFCSG 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIENDLQERSESPIDNDLQERSGNLIDNGSGTKNQESDET 147
           +SLLNII RYQ H +GRSES  E D +E                         N+ESDET
Sbjct: 61  DSLLNIIKRYQSHFEGRSESQNEVDTKE-------------------------NEESDET 120

Query: 148 ILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLENQLEGTLDKIKSQRNVGLSKRGS 204
           ++VS GKLLQTIQS VEEP+FKKL+V  M+ LENQLE +LDKIKSQR   + +  S
Sbjct: 121 LMVSFGKLLQTIQSHVEEPDFKKLNVNDMVHLENQLEASLDKIKSQRVEAMMENSS 151

BLAST of ClCG06G002140 vs. TAIR 10
Match: AT1G24260.3 (K-box region and MADS-box transcription factor family protein )

HSP 1 Score: 99.8 bits (247), Expect = 4.2e-21
Identity = 64/153 (41.83%), Postives = 90/153 (58.82%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSESPIEN---------------DLQERSESPIDNDLQERSGNL 147
           +S+L  + RYQ+   G  E  + +                L+ER ++     LQ    NL
Sbjct: 61  SSMLRTLERYQKCNYGAPEPNVPSREALAVELSSQQEYLKLKERYDA-----LQRTQRNL 120

Query: 148 IDNGSG---TKNQESDETILVSLGKLLQTIQSQ 163
           +    G   TK  ES E  L S  K ++ +++Q
Sbjct: 121 LGEDLGPLSTKELESLERQLDSSLKQIRALRTQ 148

BLAST of ClCG06G002140 vs. TAIR 10
Match: AT1G24260.1 (K-box region and MADS-box transcription factor family protein )

HSP 1 Score: 99.4 bits (246), Expect = 5.4e-21
Identity = 69/165 (41.82%), Postives = 92/165 (55.76%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSES--PIENDLQERSESPIDNDLQER-------SGNLIDNGSG 147
           +S+L  + RYQ+   G  E   P    L E S       L+ER         NL+    G
Sbjct: 61  SSMLRTLERYQKCNYGAPEPNVPSREALAELSSQQEYLKLKERYDALQRTQRNLLGEDLG 120

Query: 148 ---TKNQESDETILVSLGKLLQTIQSQVEEPNFKKLDVTQMMQLE 181
              TK  ES E  L S  K ++ +++Q        L   + M  E
Sbjct: 121 PLSTKELESLERQLDSSLKQIRALRTQFMLDQLNDLQSKERMLTE 165

BLAST of ClCG06G002140 vs. TAIR 10
Match: AT3G02310.1 (K-box region and MADS-box transcription factor family protein )

HSP 1 Score: 99.4 bits (246), Expect = 5.4e-21
Identity = 60/141 (42.55%), Postives = 87/141 (61.70%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L+VF+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVSLIVFSNRGKLYEFCST 60

Query: 88  NSLLNIIMRYQRHLQGRSE------SPIENDLQE--RSESPIDNDLQERSGNLIDNGSGT 147
           +++L  + RYQ+   G  E        +EN  +E  + +   +N LQ +  NL+    G 
Sbjct: 61  SNMLKTLERYQKCSYGSIEVNNKPAKELENSYREYLKLKGRYEN-LQRQQRNLLGEDLGP 120

Query: 148 KNQESDETILVSLGKLLQTIQ 161
            N +  E +   L   L+ ++
Sbjct: 121 LNSKELEQLERQLDGSLKQVR 140

BLAST of ClCG06G002140 vs. TAIR 10
Match: AT5G15800.1 (K-box region and MADS-box transcription factor family protein )

HSP 1 Score: 99.4 bits (246), Expect = 5.4e-21
Identity = 65/164 (39.63%), Postives = 96/164 (58.54%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSE------SPIENDLQE--RSESPIDNDLQERSGNLIDNGSGT 147
           +++L  + RYQ+   G  E        +EN  +E  + +   +N LQ +  NL+    G 
Sbjct: 61  SNMLKTLDRYQKCSYGSIEVNNKPAKELENSYREYLKLKGRYEN-LQRQQRNLLGEDLGP 120

Query: 148 KNQESDETILVSLG---KLLQTIQSQVEEPNFKKLDVTQMMQLE 181
            N +  E +   L    K +++I++Q        L   + M LE
Sbjct: 121 LNSKELEQLERQLDGSLKQVRSIKTQYMLDQLSDLQNKEQMLLE 163

BLAST of ClCG06G002140 vs. TAIR 10
Match: AT5G15800.2 (K-box region and MADS-box transcription factor family protein )

HSP 1 Score: 99.4 bits (246), Expect = 5.4e-21
Identity = 65/164 (39.63%), Postives = 96/164 (58.54%), Query Frame = 0

Query: 28  MGRKKIEVKLIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG 87
           MGR ++E+K IE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS 
Sbjct: 1   MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCSS 60

Query: 88  NSLLNIIMRYQRHLQGRSE------SPIENDLQE--RSESPIDNDLQERSGNLIDNGSGT 147
           +++L  + RYQ+   G  E        +EN  +E  + +   +N LQ +  NL+    G 
Sbjct: 61  SNMLKTLDRYQKCSYGSIEVNNKPAKELENSYREYLKLKGRYEN-LQRQQRNLLGEDLGP 120

Query: 148 KNQESDETILVSLG---KLLQTIQSQVEEPNFKKLDVTQMMQLE 181
            N +  E +   L    K +++I++Q        L   + M LE
Sbjct: 121 LNSKELEQLERQLDGSLKQVRSIKTQYMLDQLSDLQNKEQMLLE 163

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008459847.11.9e-6380.84PREDICTED: MADS-box transcription factor 8-like [Cucumis melo][more]
XP_011656809.11.0e-6179.04MADS-box transcription factor 8 [Cucumis sativus] >KAE8646765.1 hypothetical pro... [more]
XP_022155417.16.4e-5667.72MADS-box transcription factor 8-like isoform X3 [Momordica charantia][more]
XP_022155416.11.4e-5564.73MADS-box protein FLOWERING LOCUS C-like isoform X2 [Momordica charantia][more]
XP_022155415.11.4e-5564.73MADS-box protein FLOWERING LOCUS C-like isoform X1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9SAR11.4e-2133.33MADS-box transcription factor 8 OS=Oryza sativa subsp. japonica OX=39947 GN=MADS... [more]
Q7Y0401.5e-2037.31MADS-box protein EJ2 OS=Solanum lycopersicum OX=4081 GN=EJ2 PE=1 SV=1[more]
Q034892.0e-2042.18Agamous-like MADS-box protein AGL9 homolog OS=Petunia hybrida OX=4102 GN=FBP2 PE... [more]
Q386942.6e-2031.90Agamous-like MADS-box protein AGL9 homolog OS=Aranda deborah OX=29714 PE=2 SV=1[more]
Q424647.7e-2042.18Agamous-like MADS-box protein AGL9 homolog OS=Solanum lycopersicum OX=4081 GN=TD... [more]
Match NameE-valueIdentityDescription
A0A1S3CCC59.0e-6480.84MADS-box transcription factor 8-like OS=Cucumis melo OX=3656 GN=LOC103498847 PE=... [more]
A0A6J1DPA03.1e-5667.72MADS-box transcription factor 8-like isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DME36.9e-5664.73MADS-box protein FLOWERING LOCUS C-like isoform X1 OS=Momordica charantia OX=367... [more]
A0A6J1DRL96.9e-5664.73MADS-box protein FLOWERING LOCUS C-like isoform X2 OS=Momordica charantia OX=367... [more]
A0A6J1KPU72.3e-5165.91MADS-box protein 04g005320-like OS=Cucurbita maxima OX=3661 GN=LOC111497621 PE=4... [more]
Match NameE-valueIdentityDescription
AT1G24260.34.2e-2141.83K-box region and MADS-box transcription factor family protein [more]
AT1G24260.15.4e-2141.82K-box region and MADS-box transcription factor family protein [more]
AT3G02310.15.4e-2142.55K-box region and MADS-box transcription factor family protein [more]
AT5G15800.15.4e-2139.63K-box region and MADS-box transcription factor family protein [more]
AT5G15800.25.4e-2139.63K-box region and MADS-box transcription factor family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 176..196
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..123
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..142
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..142
NoneNo IPR availablePANTHERPTHR48019:SF2BNAC02G43470D PROTEINcoord: 28..194
NoneNo IPR availablePANTHERPTHR48019SERUM RESPONSE FACTOR HOMOLOGcoord: 28..194
IPR002100Transcription factor, MADS-boxPRINTSPR00404MADSDOMAINcoord: 30..50
score: 53.57
coord: 65..86
score: 52.14
coord: 50..65
score: 76.85
IPR002100Transcription factor, MADS-boxSMARTSM00432madsneu2coord: 28..87
e-value: 9.0E-32
score: 121.5
IPR002100Transcription factor, MADS-boxPFAMPF00319SRF-TFcoord: 38..84
e-value: 1.5E-22
score: 78.9
IPR002100Transcription factor, MADS-boxPROSITEPS00350MADS_BOX_1coord: 30..84
IPR002100Transcription factor, MADS-boxPROSITEPS50066MADS_BOX_2coord: 28..88
score: 29.422029
IPR036879Transcription factor, MADS-box superfamilyGENE3D3.40.1810.10coord: 40..119
e-value: 4.0E-24
score: 86.5
IPR036879Transcription factor, MADS-box superfamilySUPERFAMILY55455SRF-likecoord: 28..101
IPR033896MADS MEF2-likeCDDcd00265MADS_MEF2_likecoord: 29..100
e-value: 1.1887E-31
score: 111.105

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG06G002140.2ClCG06G002140.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding
molecular_function GO:0003677 DNA binding