Cp4.1LG04g06860 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g06860
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionmediator of RNA polymerase II transcription subunit 15a-like
LocationCp4.1LG04: 4599218 .. 4620775 (-)
RNA-Seq ExpressionCp4.1LG04g06860
SyntenyCp4.1LG04g06860
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTATGATTTCTTGCTTCTTTCATGCATATTTGAGAAATTTGTTTGAGCTTTTGTAACTTCTTTTGTGTTCATAGTTCGCTGCTTAGATTAGTTACTATGGGATGGTTTTCTGAATTAAGCTATGAACAGCAAAATTTTGGAAGTATAGAATCATCTTCGACCTTATTCTACCATTTCTTAGATTTGTGTATAACTGATCGATGCGAACTGATTCCAAAACCTAGTTCTATAATATTATCTCCTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGGTATTAGCCCCTTCTTTGAACATAAGAAGTATAAGTTATTTGAAGTAGTATGGATCCTATTGAATTTTCCTCGACTGATCTAAAGTCCCTTTCTTCAAAATGCCTAATTGTGAGATCCCACATTGGTTGAAGAGGGTGGAACGAAACATTCCTCCTTAAGGTGTGGAAACCTCTCCCTAGTAAACGTGTTTTAAAACTGCGAGACTGACGATGATACGTAATAGGCCAAAGCAGGCAATATCTGTTAGAGGTGGGCTTGGGATGTTACAAATGGTATCAGAATCAAACACCGGGAGGTATGCCAGCGAGGACGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACAACATTCCTTATAAGGGTGTAGAAACGTCTCCCTAGAAGACACACTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCTAAGCGGACAATATCTACTAGCGGTGGGCTTGAGCTGTTACAAATGGTATCAGAGTCAGACACTGGGCGGTGTGCCAGCGAAGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCATGTTGGTTGGAGAGGGGAACGAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGATAGCGATAAGGGGCCAAAGCGAACAATATCTGTTAGTGGNATTTAGATTGTTTCGGAACTTCTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTATGATTTCTTGCTTCTTTCATGCATATTTGAGAAATTTGTTTGAGCTTTTGTAACTTCTTTTGTGTTCATAGTTCGCTGCTTAGATTAGTTACTATGGGATGGTTTTCTGAATTAAGCTATGAACAGCAAAATTTTGGAAGTATAGAATCATCTTCGACCTTATTCTACCATTTCTTAGATTTGTGTATAACTGATCGATGCGAACTGATTCCAAAACCTAGTTCTATAATATTATCTCCTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGGTATTAGCCCCTTCTTTGAACATAAGAAGTATAAGTTATTTGAAGTAGTATGGATCCTATTGAATTTTCCTCGACTGATCTAAAGTCCCTTTCTTCAAAATGCCTAATTGTGAGATCCCACATTGGTTGAAGAGGGTGGAACGAAACATTCCTCCTTAAGGTGTGGAAACCTCTCCCTAGTAAACGTGTTTTAAAACTGCGAGACTGACGATGATACGTAATAGGCCAAAGCAGGCAATATCTGTTAGAGGTGGGCTTGGGATGTTACAAATGGTATCAGAATCAAACACCGGGAGGTATGCCAGCGAGGACGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACAACATTCCTTATAAGGGTGTAGAAACGTCTCCCTAGAAGACACACTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCTAAGCGGACAATATCTACTAGCGGTGGGCTTGAGCTGTTACAAATGGTATCAGAGTCAGACACTGGGCGGTGTGCCAGCGAAGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCATGTTGGTTGGAGAGGGGAACGAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGATAGCGATAAGGGGCCAAAGCGAACAATATCTGTTAGTGGTGAGCTTGGATTGTTACAAATGGTATCAGAGTCAGACATCGGGCGGTGTGCCAGCGAGGACGTTAGGCCCTCAAGGGGGTGGAATGTTAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTAATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAAGNGCCAAAGCAGGCAATATCTGTTAGAGGTGGGCTTGGGATGTTACAAATGGTATCAGAATCAAACACCGGGAGGTATGCCAGCGAGGACGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACAACATTCCTTATAAGGGTGTAGAAACGTCTCCCTAGAAGACACACTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCTAAGCGGACAATATCTACTAGCGGTGGGCTTGAGCTGTTACAAATGGTATCAGAGTCAGACACTGGGCGGTGTGCCAGCGAAGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCATGTTGGTTGGAGAGGGGAACGAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGATAGCGATAAGGGGCCAAAGCGAACAATATCTGTTAGTGGTGAGCTTGGATTGTTACAAATGGTATCAGAGTCAGACATCGGGCGGTGTGCCAGCGAGGACGTTAGGCCCTCAAGGGGGTGGAATGTTAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTAATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAAGAGCGAAACATAACGGACCAAAGCGGACAATATCTGCTAGTGGTGGGCTTAGACTGTTACACAAATATTTTGAATCATGAACGTGGATTACTGGGGTAGTTTTTCCTTCCTGAACAGCTTCCCATGCTTTTGTGTTTAATGAACTTGTCAGCCATCATCTTCTCCATTTTTATGTAGAATGGAGACATTGAAGAGGCTCCTTCCTGTTTCTGGTCCTGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGGTACTTCCTGCTCACTTGTTATAGTTCTCAAGGATTGTAAGAACAAACATTATCTTATTTATGTTTAGCCCATTTGCTATAACAATAAGATTCCAATTTGATTTGGTTCCTAAGATTTAGTTCTGTACTGAAAGATACTTTTGTATAAGAACTTGTTCATAAAGCCCTGATTTATTATGCTATGTAGTGTTGCACTAAATAAATACCAATTATATTTTGAACCTCTGATTTCAGTCAGAGTACCGAAGGAAAATANGTTGGAGAGGGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGAAGACACGCTTTAAAAACGTGAGGCTTACGACAATACGTAACAGGCCAAAGCGGACAATATCTATTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTCAGTCACTGGCCGGTGTGCCAGTGAGGACTCTAGGCCCTCAAGGGGGGGGGGGGANAACTGTGAAGATGCTTACTATGGAACCCAACTCTGAGACGACCACTGCATTACCACCAAAGTCTGCGCCCACGATCAACTGCCGAATTCGTTCGACCCGATTCACATCTCCGCCGCCGAATTCTGGCTTTCGTTCAGCCCTAAACAACGAGTAAACTCCATAACAGCCTTACTTTCCATCCTTGAAGGACTCTATTCAATCTCTCTTTTTGCTTTGTTATTGGGATTTTGACGCAGTTTTTTCTCGTGGGTTGTTTTGGAACTTTTGAATGGATTCGAATAATTGGAGGCCTGCTCAAGGTGGAGAACCCAGAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTATGATTTCTTGCTTCTTTCATGCATATTTGAGAAATTTGTTTGAGCTTTTGTAACTTCTTTTGTGTTCATAGTTCGCTGCTTAGATTAGTTACTATGGGATGGTTTTCTGAATTAAGCTATGAACAGCAAAATTTTGGAAGTATAGAATCATCTTCGACCTTATTCTACCATTTCTTAGATTTGTGTATAACTGATCGATGCGAACTGATTCCAAAACCTAGTTCTATAATATTATCTCCTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGGTATTAGCCCCTTCTTTGAACATAAGAAGTATAAGTTATTTGAAGTAGTATGGATCCTATTGAATTTTCCTCGACTGATCTAAAGTCCATTCCTTCAAAATGCCTAATTGTGAGATCCCACATTGGTGGAAGAGGGAACAAAACATTTCTCATTAAGATGTAGAAACCTCTCCCTAGTAGACGTGTTTTAAAACTGCGAGACTGACGACGATATGTAACGGGCTAAAGCGGACAATATCTGTTAGCGGTGGGCTTGAGATGTTACAAATGGTATCAGAATCAAACACTGGGAGGTATGCCAGCGAGGGTGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGAGGAACGAAACACTCCTATAAGGGTGTAGAAACCTCTCCCTAAAAGACACGGTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCATAGCGGACAATATCCCAGTGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTAGACACTTGGCGGTGTGCCAGCGAGGACTCTAGGCCCTCAATAGGGGTGGATTGTGAGATCCTACATTAGTTGGAGAGGGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGAAGACACGCTTTAAAAACGTGAGGCTTACGACAATACGTAACAGGCCAAAGCGGACAATATCTATTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTCAGTCACTGGCCGGTGTGCCAGTGAGGACTCTAGGCCCTCAAGGGGGGGGGATTGTGAGATCCCACATTGGTTGGAGAGTGGAACGAAACATTCCTTAGAAGGGTGTAGGAACCTCTACCTAGAAGACACGCTTTAAAATCGTGAGGCTGACAACAATATGTAACGGGCCAAAGCGGACAATATCTACTAGTGGTGAGCTTGGGCTGTTACAAACGGTATCAAAGTCAGACACTGGGCGGTGTGTCAGCGAGGGCTCTAGGCCCTCAAGGGGGGTGGATTGTGAAATCCTACATTGGTTGGAGGGGGGAACGAAACATTCCTTAGAAGGGTGTAGAAACCTCTCCCTAGAAGACATACTTTAAAATCGTGAGGCTGATGACAATACATAACGGGCCAAAGCGGACAATATCTACTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTCAGACACTGGCCGGTGTGCTAGCGAGGACTCTAGGCCTTCAAGGTGGGTGAATTGTGAGGTCCCACATTGGTTGGAGAGGGGAACAAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGACAGCGATAGGAGGCCAAAGCGGACAATATATATTAGTGGTGAACTTGGACTATTACAAATGGTATTAGAGTCAGACATCTGGCGGTGTGCCAGCGAGGACGCTAGGCCCTCAAGGGGGTGGAATGTGAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTCATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAACNATAAGGGTGTAGAAACCTCTCCCTAAAAGACACGGTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCATAGCGGACAATATCCCAGTGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTAGACACTTGGCGGTGTGCCAGCGAGGACTCTAGGCCCTCAATAGGGGTGGATTGTGAGATCCTACATTAGTTGGAGAGGGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGAAGACACGCTTTAAAAACGTGAGGCTTACGACAATACGTAACAGGCCAAAGCGGACAATATCTATTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTCAGTCACTGGCCGGTGTGCCAGTGAGGACTCTAGGCCCTCAAGGNAAGACATGCTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCAAAGCGGGCAATATCTACTAGTGGTGTGCTTGGACTGTTACAAATGGTATCAGAGTCGGACAATGGGTGGTGTGCCAGCGAGGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGTGGAACGAAACATTCCTTAGAAGGGTGTAGGAACCTCTACCTAGAAGACACGCTTTAAAATCGTGAGGCTGACAACAATATGTAACGGGCCAAAGCGGACAATATCTACTAGTGGTGAGCTTGGGCTGTTACAAACGGTATCAAAGTCAGACACTGGGCGGTGTGTCAGCGAGGGCTCTAGGCCCTCAAGGGGGGTGGATTGTGAAATCCTACATTGGTTGGAGGGGGGAACGAAACATTCCTTAGAAGGGTGTAGAAACCTCTCCCTAGAAGACATACTTTAAAATCGTGAGGCTGATGACAATACATAACGGGCCAAAGCGGACAATATCTACTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTCAGACACTGGCCGGTGTGCTAGCGAGGACTCTAGGCCTTCAAGGTGGGTGAATTGTGAGGTCCCACATTGGTTGGAGAGGGGAACAAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGACAGCGATAGGAGGCCAAAGCGGACAATATATATTAGTGGTGAACTTGGACTATTACAAATGGTATTAGAGTCAGACATCTGGCGGTGTGCCAGCGAGGACGCTAGGCCCTCAAGGGGGTGGAATGTGAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTCATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAACAGCGAAACGTAACGGGACAAAGCGAACAATATCGGCTAGCGGTGGGCTTAGGCTATTACACTAATATTTTGAATCATGAACGTGGATTACTACGGTAGTTTTTCCTTCCTGAACAGCTTCCCATGCTTTTGTGTTTAATAACCTTGTCAGCCATCATCTTTTCCATTTTTATGTAGAATGGAGACATTGAAGAGGATCCTTCCTGTTTCTGGTCCCGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGGTACTTCCTGCTCACTTGTTATAGTTCTCAAGGATTTTAAGAACAAACATTATCTTATTTATGTTTAGCCCATTTGCTAGAACAATAAGATTCCAATTTGATTTGGTTCCTAAGATTTAGTTCTGTACTGATAGATACTTTTGTATAAGAACTTGTTCATAAAGCCCTGATTTATTATGCTATGTAGTGTTGCACTAAATAAATACCAATTATATTTTGAACCTCTGATTTCAGTCAGAGTACCGAAGGAAAATATATGTGAAGATGCTTCTTATGGAACCCAACTCTGAGACGACCACTGCATTACCATCAATGTCTGGGCCTATGGCTTCCGATCAACCTTAGAAACAGGCTGCAATTACTCTTCCACAACATCCAAAATAACACTGCTTCCTCTAATGTATTTACCAACTCTTAGAGGCCAATATACTTCATTGGCTAATAAGGTGAATGAGTCTACAGTTTGAACGTGTTAGTGTTGGAATGAGATATTGTATGATAATTGAAACTGTTCTGAATCTTGAATGTTCATGAGATTGTTTTGAATGAAATATTGTATGACAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTAAATGAGATATTATATGATCATTGAAATTGTTTTTAATCTTGAATGTTGATGATATTGTTTTGAATGAAATATTATATGATAATTGAAATTGTTCTTAATCTTGAATGTCCATGATATTGTTTTGAATGAGATATTATATGATCATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGTTTTGAATGAGATATTATGTGATAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGANTCCCTAGTAGACGCGTTTTAAAACAGCGAAACGTAACGGGACAAAGCGAACAATATCGGCTAGCGGTGGGCTTAGGCTATTACACTAATATTTTGAATCATGAACGTGGATTACTACGGTAGTTTTTCCTTCCTGAACAGCTTCCCATGCTTTTGTGTTTAATAACCTTGTCAGCCATCATCTTTTCCATTTTTATGTAGAATGGAGACATTGAAGAGGATCCTTCCTGTTTCTGGTCCCGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGGTACTTCCTGCTCACTTGTTATAGTTCTCAAGGATTTTAAGAACAAACATTATCTTATTTATGTTTAGCCCATTTGCTAGAACAATAAGATTCCAATTTGATTTGGTTCCTAAGATTTAGTTCTGTACTGATAGATACTTTTGTATAAGAACTTGTTCATAAAGCCCTGATTTATTATGCTATGTAGTGTTGCACTAAATAAATACCAATTATATTTTGAACCTCTGATTTCAGTCAGAGTACCGAAGGAAAATATATGTGAAGATGCTTCTTATGGAACCCAACTCTGAGACGACCACTGCATTACCATCAATGTCTGGGCCTATGGCTTCCGATCAACCTTAGAAACAGGCTGCAATTACTCTTCCACAACATCCAAAATAACACTGCTTCCTCTAATGTATTTACCAACTCTTAGAGGCCAATATACTTCATTGGCTAATAAGGTGAATGAGTCTACAGTTTGAACGTGTTAGTGTTGGAATGAGATATTGTATGATAATTGAAACTGTTCTGAATCTTGAATGTTCATGAGATTGTTTTGAATGAAATATTGTATGACAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTAAATGAGATATTATATGATCATTGAAATTGTTTTTAATCTTGAATGTTGATGATATTGTTTTGAATGAAATATTATATGATAATTGAAATTGTTCTTAATCTTGAATGTCCATGATATTGTTTTGAATGAGATATTATATGATCATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGTTTTGAATGAGATATTATGTGATAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGAATGAGATATTATCTGATAATTGAAACGGTTTTGAATCTTGTATGTTCATGATATTGTTGTGATAGTTTTTCATGTTCGTATCACTAAATGAAACTGATTCGAAATTTCACGCATATTTGTGACTTATCGGTGACAATAACACGAAAATGGACTTAGAGTGATGTTGTGTGAGTGAAGAGTAGACTCGTGGTGGCCGAAAATCAAGCAGAGACGTCAGAAGGAAAAAGAATCGATCGAAGATTAGGGTTGCGATTAGCGTGACAGTCGGGTGAAGAAGAAACGACCCGTGGACTCGGGTTTGGACTCGGGGCACACCATCTGCAGCCACTGACTTCGTGTAGCCAACGACGACAGGTAAGCTAGGCCCACTTAATGCTACGCATATGGCCTATGAGACGCACCTTGCAACCCACAAAACTCGCGCGATCTGCACTCCCGAGCAACCTGACCCGAAGTCTGTGCCTCAACCCGACTAATGACTCGCGCTTCTCAGCCCCACACGTGTCTTGTCTCCAGTTGCGGCTCGTTCAGCATTGCACTCCTCGACTCGGCTCTGGCGCTGCTCCGTGGCGTAAACGGCTCGGGTTTGTGTATTTCAACATGCAATATAATCTAAACTTAGCCCTACAGATAAAGAATCACTTCAAGGGAAAGTAAAATTCAGATTAACTTGTTGATCAAATATCTCAAGACAATAATACTTGTTTGAGATTCGAATCACTCCACAAACAAGATTGATCTCGAGCTTGAATAATTCTATATGCAACCTAAACTACATAGAATTGCAAAGAAACAACATTGGCTAAAGGAAAGCACAAATGCTCATTTACTGTATTTTAAAAGTCTATTTTACAACTCTAACATACATGGCATTATATAGGCTCAAAATAAAAACTCTTTAACCTTCCACGGGGCTTTCCAAGAGATGTAACTTTCATACTTTATGACCACAATTAGACCATAATTAGTCACCAGGTAAATAAACCTTAAAATACACTAAAAATACAATAACTCTAGATTATCAACATTTAAGAATAAATGAAGCCCCATCTTGAAGTCTTTGTTGCATAAATGAAGCTTGATTATTCTTAACGTGACATTAATGTTGCATGATAAGATGCCTTGGTTCATATCGATTCCTGTTTCCTCAATTTAAGTTGAATTAAGTTAGGAGTTGGATTTACGAAAAGAATTAGGTAAATATTTCTAATAGGTTGAAGCGCCTTCGAATAAGTTGAAACGAACTTCGAGGGACGAAGTCAAACCAATTCAGAAATTAGATAGAGTTTCGGTTAAAGCCAACCAAGTAAGTGGCTCTACTATCAGTATGGTTGGAAGAGTTGCTTTATATATGATGACACGTGCCTAGTGGCCATGTATGTGTCATATGTTTGATGATATAATATGTTTTGCTACATGATATGCCCAGATATGCCAATTAATATTAGAACGTAAATGAATTTCGTATTATGCCTCGATGAATTATGATATGCTATGAAATGCCATTATACGCTGTGATATGATTATACTAGGTCATGATATGTTACGACATGCTATGACATGTTTTGAGACAAGAAGATTTATGATCGACAACCCTTAAGATATGTTTCAAAGATTATGCTATAAATGAAATGATTTTGTAAGGGCTGTCTTGCACGATTTGTTTGCAATGAAAAATGTTGGGACCTCATGCATAAATGTATGTTCACAAACGTAGGGATATTTTCTCTTATGAAGAGTACGAATGCGTACGTTATGAAAAGGAAAATGATCATCATGTTTATCATGATGCTACGACGGTCGCTATTGAATGTTGCAATGTTGCCTTCTAACTTGGGCTGTCTTTAGGATGGTTGATGGCTCGACCGCGCACGACCTTTTGGGCTTGGCTTCGAATGATGATTTTGAACCCCCGCTTGCGGTGGGTGTGAATAGTCGATCGAAGTCCAAGCTCGTTTTAGCGTACGACGCATGAGTGGTGATTTGGTTGCTTGGGTCTATAGGCTTGATGCTCGGGCATCAAACTGTAGGCGACATCTCCTCCTCATTGGTGTGTCCCAAAGTATTTTGAGTGACCGTAGTCCATTTGGGGGTCCGACAATAACACCAGAGAAGCTAGCTCGACCAACACTAGAATTTATGTTTGAATTTTGGGCATTGGATGGGATGAGGTGTGCCTCCTTTAGAACATATTCAAATTTGGTGAAGATCCAACGATTGAAAATGAAGATACGGTAGATTTTCTGATTTGTACTATTTCCGTAATAATCTCAAAGTTCAAGGGTATTTTGGTAATTTCAAAGGTCTAAAATTACTTCTTAGGGTATTGAAAATAACTTGCTATAAGAGTGTCTTAAGATTGGACTCCAATGTGACTACGACTGGACTAGGTCAGTTGCCCAATCCAACATTTGTTGACTGATTCAAAGATTTCAAGAGGTAATTTCTTAGAAATTCATCAGAAAATTTGATGAACATTGAGGAGAAACCGAGTAACTTGGTACCCAAGGTGTGAGGAGACAATCATTTATAAGGCTGTTCCCGATTCACTCTCTCCTTTTGCTTTGTTATTAGGGTTTTGACGCAGTTTAGATTGTTTCGAAACTTTTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATNAACATTGGCTAAAGGAAAGCACAAATGCTCATTTACTGTATTTTAAAAGTCTATTTTACAACTCTAACATACATGGCATTATATAGGCTCAAAATAAAAACTCTTTAACCTTCCACGGGGCTTTCCAAGAGATGTAACTTTCATACTTTATGACCACAATTAGACCATAATTAGTCACCAGGTAAATAAACCTTAAAATACACTAAAAATACAATAACTCTAGATTATCAACATTTAAGAATAAATGAAGCCCCATCTTGAAGTCTTTGTTGCATAAATGAAGCTTGATTATTCTTAACGTGACATTAATGTTGCATGATAAGATGCCTTGGTTCATATCGATTCCTGTTTCCTCAATTTAAGTTGAATTAAGTTAGGAGTTGGATTTACGAAAAGAATTAGGTAAATATTTCTAATAGGTTGAAGCGCCTTCGAATAAGTTGAAACGAACTTCGAGGGACGAAGTCAAACCAATTCAGAAATTAGATAGAGTTTCGGTTAAAGCCAACCAAGTAAGTGGCTCTACTATCAGTATGGTTGGAAGAGTTGCTTTATATATGATGACACGTGCCTAGTGGCCATGTATGTGTCATATGTTTGATGATATAATATGTTTTGCTACATGATATGCCCAGATATGCCAATTAATATTAGAACGTAAATGAATTTCGTATTATGCCTCGATGAATTATGATATGCTATGAAATGCCATTATACGCTGTGATATGATTATACTAGGTCATGATATGTTACGACATGCTATGACATGTTTTGAGACAAGAAGATTTATGATCGACAACCCTTAAGATATGTTTCAAAGATTATGCTATAAATGAAATGATTTTGTAAGGGCTGTCTTGCACGATTTGTTTGCAATGAAAAATGTTGGGACCTCATGCATAAATGTATGTTCACAAACGTAGGGATATTTTCTCTTATGAAGAGTACGAATGCGTACGTTATGAAAAGGAAAATGATCATCATGTTTATCATGATGCTACGACGGTCGCTATTGAATGTTGCAATGTTGCCTTCTAACTTGGGCTGTCTTTAGGATGGTTGATGGCTCGACCGCGCACGACCTTTTGGGCTTGGCTTCGAATGATGATTTTGAACCCCCGCTTGCGGTGGGTGTGAATAGTCGATCGAAGTCCAAGCTCGTTTTAGCGTACGACGCATGAGTGGTGATTTGGTTGCTTGGGTCTATAGGCTTGATGCTCGGGCATCAAACTGTAGGCGACATCTCCTCCTCATTGGTGTGTCCCAAAGTATTTTGAGTGACCGTAGTCCATTTGGGGGTCCGACAATAACACCAGAGAAGCTAGCTCGACCAACACTAGAATTTATGTTTGAATTTTGGGCATTGGATGGGATGAGGTGTGCCTCCTTTAGAACATATTCAAATTTGGTGAAGATCCAACGATTGAAAATGAAGATACGGTAGATTTTCTGATTTGTACTATTTCCGTAATAATCTCAAAGTTCAAGGGTATTTTGGTAATTTCAAAGGTCTAAAATTACTTCTTAGGGTATTGAAAATAACTTGCTATAAGAGTGTCTTAAGATTGGACTCCAATGTGACTACGACTGGACTAGGTCAGTTGCCCAATCCAACATTTGTTGACTGATTCAAAGATTTCAAGAGGTAATTTCTTAGAAATTCATCAGAAAATTTGATGAACATTGAGGAGAAACCGAGTAACTTGGTACCCAAGGTGTGAGGAGACAATCATTTATAAGGCTGTTCCCGATTCACTCTCTCCTTTTGCTTTGTTATTAGGGTTTTGACGCAGTTTAGATTGTTTCGAAACTTTTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATGTATGATTACTTTCTTCTTTCATGCCATTAATACCCTATTTGAGAAATTTGTTTGCGTTTTTGTAACTTTTTGATTCATAGTTCGCTGCTTAGATTAGTTACTAGGGGATGGTTTTCTGAATTAAGCTATGAACGGCAAAACCTTATCCTACTTTTTCTTAGATTTGTGTTGAACTGATGGATGGGAATTGATTCCAAAACCTCGTTGAAGATTGTTGGGAGGGAATTCCATGTTGGTTAATTAAGAGGATGATTGTAAGAAATATTATCTTCATTGGTATGAGGCCTTTTGGAAAACCAAAAGAAAAGCCACGAGAGGTTATGCTCAAAGTGGAAACTATCATATCATTGTAAAGAGTCGTGATTCCTAACATGGTATCAAAGTCATGCACTTAACTTAGCAATGTTAATAGAATCTTCAATTGTCGAACAAATTGTAAGCCTGGAAGGTATAGTCAAAAGTGCTCAAGAGAAAGGAGTCGAGCCTCGTTTAAGGGGCAAATGTTTGACAGCCACATAGACCTCAAAGAAGGCTCTATGGTGTAGTTTGTTCGAGGAAAGGATTGTTGAGAATTGTTGGGAGAGTCGTGGTTCCTAACCAATTTGTTAAAATTATCCCCGTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGATTTTGTGGAGGCAGTTCCTATTTGTTTCTTCAGGTATTATGGTAGGTTAGATTTTGTGGAGGCGGTTCCTATTTGTTGCTTCAGGTATCACCCCCTTCATCGAACATAAGAAGTATAAGTTATTTGAAGCAGTATGGATCCTATTGAATTTTCCTCAACTATTCTAAATTCCATTCCTTTAGAATGCTCTTATATTAGTTGAAAATGCTCCTATATTAGTTGGAGAAGGGAACAAAACATTTCTCGTAAGGGTGTGAAAACGTTTCCTTAGTAGACGTGTTTTAGAACTGTGAGGTTGACGACGATACGAAACAGGCCAAAGTGGACAATATTTGTNAAGTCCAAGCTCGTTTTAGCGTACGACGCATGAGTGGTGATTTGGTTGCTTGGGTCTATAGGCTTGATGCTCGGGCATCAAACTGTAGGCGACATCTCCTCCTCATTGGTGTGTCCCAAAGTATTTTGAGTGACCGTAGTCCATTTGGGGGTCCGACAATAACACCAGAGAAGCTAGCTCGACCAACACTAGAATTTATGTTTGAATTTTGGGCATTGGATGGGATGAGGTGTGCCTCCTTTAGAACATATTCAAATTTGGTGAAGATCCAACGATTGAAAATGAAGATACGGTAGATTTTCTGATTTGTACTATTTCCGTAATAATCTCAAAGTTCAAGGGTATTTTGGTAATTTCAAAGGTCTAAAATTACTTCTTAGGGTATTGAAAATAACTTGCTATAAGAGTGTCTTAAGATTGGACTCCAATGTGACTACGACTGGACTAGGTCAGTTGCCCAATCCAACATTTGTTGACTGATTCAAAGATTTCAAGAGGTAATTTCTTAGAAATTCATCAGAAAATTTGATGAACATTGAGGAGAAACCGAGTAACTTGGTACCCAAGGTGTGAGGAGACAATCATTTATAAGGCTGTTCCCGATTCACTCTCTCCTTTTGCTTTGTTATTAGGGTTTTGACGCAGTTTAGATTGTTTCGAAACTTTTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATGTATGATTACTTTCTTCTTTCATGCCATTAATACCCTATTTGAGAAATTTGTTTGCGTTTTTGTAACTTTTTGATTCATAGTTCGCTGCTTAGATTAGTTACTAGGGGATGGTTTTCTGAATTAAGCTATGAACGGCAAAACCTTATCCTACTTTTTCTTAGATTTGTGTTGAACTGATGGATGGGAATTGATTCCAAAACCTCGTTGAAGATTGTTGGGAGGGAATTCCATGTTGGTTAATTAAGAGGATGATTGTAAGAAATATTATCTTCATTGGTATGAGGCCTTTTGGAAAACCAAAAGAAAAGCCACGAGAGGTTATGCTCAAAGTGGAAACTATCATATCATTGTAAAGAGTCGTGATTCCTAACATGGTATCAAAGTCATGCACTTAACTTAGCAATGTTAATAGAATCTTCAATTGTCGAACAAATTGTAAGCCTGGAAGGTATAGTCAAAAGTGCTCAAGAGAAAGGAGTCGAGCCTCGTTTAAGGGGCAAATGTTTGACAGCCACATAGACCTCAAAGAAGGCTCTATGGTGTAGTTTGTTCGAGGAAAGGATTGTTGAGAATTGTTGGGAGAGTCGTGGTTCCTAACCAATTTGTTAAAATTATCCCCGTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGATTTTGTGGAGGCAGTTCCTATTTGTTTCTTCAGGTATTATGGTAGGTTAGATTTTGTGGAGGCGGTTCCTATTTGTTGCTTCAGGTATCACCCCCTTCATCGAACATAAGAAGTATAAGTTATTTGAAGCAGTATGGATCCTATTGAATTTTCCTCAACTATTCTAAATTCCATTCCTTTAGAATGCTCTTATATTAGTTGAAAATGCTCCTATATTAGTTGGAGAAGGGAACAAAACATTTCTCGTAAGGGTGTGAAAACGTTTCCTTAGTAGACGTGTTTTAGAACTGTGAGGTTGACGACGATACGAAACAGGCCAAAGTGGACAATATTTGTTAGTAGCGAGCTTAGACTGTTACAAATGGTATCAGAGTCAGACATCGAAAGATGTGCCAGCGAAGACGCTAGGCTCTCAAAGGGGTGGATTGTGAGATCCCACGTTGGTTGGAGAGGGGAATGAAACATTCTTCATAAGAGTGTGGAAACCTCTCCCTAGTAGATGCGTTTTAAAACTGTGAGGCTGACAGTAATATGTAACGGACCAAAGTGGACAATATCTGCTAGCGGTAAGCTTAAGCTGTTACACTAATATTTTGAATCATGGACGTGGGTTACAGCTATAGTGTTTCGCTCCCAAACAGCTTCCTGTGCTTTTGCATTTAATAAACTTGTCAGCCACCATCTTTTCCATTTTTATGTAGAATTGAGACATTGAAGAGGCACCTTCCTGTTTCGGGTCCTGAGGGGTTGAGTGAGGTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTCACACTGTCGCTACCAGTCAGGTACTTGCTGCTCTCACTTGTATGAAGCCCTGATCTATTATGCTATCTACTAAATACAACACTGAGTACTCACCATTTCTATTTTGAACCTCTGATTTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGCCTGGGACGACCCCTGCATTACCATCAATGTCTGGGCCTATGGCTTCCAATCAACCTTAGCCACGCCTGCAATGACTCTCCCAGAATATCCAAAATAACACTGCTCCTTCTAATGTATTTACCAACTCTGTGAGGCCAATAATGTAAATATACTTTATTCGCTATTAAGGTAAACACTCTACTATTATTAGAACGTTTCAGTGCTTGGAATGAGATATTTATATGATAATTGAAACTCTTCTGCCTCTTTTAAGTGTTCATGAGATTGGTTTGAATGAGATATCATATCATAATTAAAATTGTTCTGAATCTTGAGTGTTCATGAGACTGGTTTGAATTGAAACTGTTCTGAATCTTGAGTGTTCACGAGATTGGTTTGAATGAGATATTATATGATAATTGAAACTGTTTTGAATCTTGAGTGTTCATATTAGATGATGATTGAAACTGTTCTGAATCTTGAATGTTCATGAGATTGGTTTGAATGAGATATTACATGATAATTGAAGCGGTTCTGAATCTTGAATGTTCATGAGATTGTTTTGAATGAGATATGATATGATAATTGAAATTGTTTTGAATCTTGAATGTTTATGAGATTGTTTTGAATGAGATGTTATATGATAATTGAAACGGATATGAATCTTGAATGTTCATGATATTGTTGTGATAGTTTTTCATGTTGGTATCACTAAATAAAAATGTTTCGGAATCTCACGCATATTTGTGACTTCATTAAACGCTATGTTTACAAAGGTTATGTGTAACTGCAGCAGAAAAAAGAAAAAAAAAGAATGATGAATGATGNCTTTTGTTGGTCCCTTTTTCCTTCTCGAACACCTTGAAATTCTTTCTCGGAGAAATTAAAATTTCGAGAAATTCCTTAAAATTTGAAAAAATGTGCTTACTTTTTCCGTGCCTAAAATTACTTCTTATGAGGATGAAAATAACTCATTTAAGACTCTTGTTCTCGTTTGAATGTCTTAATTTCTTTTCTTGATGTTTTGGGTCGATCCTTATTTTGGATATTACACCAAACCTCAAAGCCCAATCCAACAAGCTTACAGACTCGATTCCAATGTGATTACGACTGGACTATGTCAGTTGTCCATCCCAACATTTGTCGACTGATGCGAAGATGTCAAGAGGTAATTTCTTAGAAATTCATAAGAAAATATTTTATGAACATTGAGGAGAAACCGAGGTGTGTCGATTCTTCACTCTCTCCTTTTGCTTTGTTATTGGGGTTTTGACGCAATTTCTTCTCGTAGGTTATTTCGCAACTTTTGAATGGATTCAAATCATTTAGGGCCTGCTCAAGGTGGAGAACCCGAAGTCGATGCCGGGGTTTGGAGGTCTCAGTTGCCGCCGGATTCTCGGCATCGAATTGTCAACAGCATGTATGATTTCTTGCTTCTTTGAGAAGTTTGTCTGAGATTTTGTAACTTTTTGTGTTCACAGTTCGCTGCTTAGATTAGTTACTATGTGATGGTTTTCTGAATTAAGCTATGAACGGCAAAACCTTTTTCTACCTTTTCTTAAATTTGTGTTTTAACTGATGGATGCGAACTGATTTTTCCAAAACCTTATCAAAGTAAAAACTACGAGAGATTATGCTTAAAGTGGAAAATATCATGCAATTGCATAAAGTCGTGATTCCTAACTTGGTATCAGAGTCATGCTATATTAAGATATAGCAATGTTGATAGAATCTCAAATGTCGAACAAAGAAGTTGTGAGCTTTGAAGGTGTAGTCAAAAGTGACTCAAGTGTTCTCTGTTCAAGGGATCCAGAGAAAGAAGTCGAGGGTGTACTTTGTTCGAGGGCTTCAGAGAAAGGAGTCGAGCCTCGTTTAAGGGGAGATTGTTCTTGAGGGCTACATAGTTCTCAGGAGAGTCTCTATAGTGTAGTTACTTTGTTCGAGGGCTCCAGAGAAAGTAGTCGAGCTTCGTTTAAGGGGAAATTGTTCTCGAGGACTACATAGTTCTCAGGAGAAGTTCTATAGTGTAGTTTGTTCGAGGGAAGGATTGTTGAAGATTGGTGAGAGAGTGGTGATTTAAAATTATCCCGTTCACTTTTCCATTTCTGGTTTGAAGATGTGGTAGGTTAGATTTTGTGGATGCGGTTCCTATTTGTTTCTTCAAGTATTACCCCCTTCATTGAACATAAGACGTATATCTTATTTGAAGCAGTATGGATCCTATTGAATTTTCCTCAACTGTTCTAAAGTCCATTCGTTCCGAATGCCTAATTGTGAGATCCCATAGTAGTTGGAGAGGGGAACGAAACATTCGTCGTAATGGTGTGAAAACCTCTCCCTAGTAGACGCGTTTTAAAACCGTGAGACTGACAATGATACTTAACGGACCAAAGCGGACAATATATTTTAGTGGTGGGCTTAGGCTGTTACAAATGGTACTAGAGTCACCTAGCGAGGACGCTAGACCCTCTAGGGGTGGAATGTGAGATCCCACATCGGTTGGAGAGCGGAACCCTAGACCCTCAAGAGGTGATTGCGAGATCCCATATCGGTTGGAGAGGGGAACCCTAGACCCTCAAGGGGTGAATTGTGAGATCCCACATCAGTTGGAGAGGGGAACCTTAGACCCTCAAGGCATGGATTGTGAGATCCCACATTGGTTGGAGAGGGAACGAAACATTCCTCGTAAGGGTGTGGAAACCTCCATAGTAAACGCGTTTTAAAACGGCGATACATAACAGGTCAAAACAAATAATATCTGCTAGCGGTAAGCTTAAGCTGTTACACTAATATTTTGAATCATGGACGTGGGTTACAGCCATAGTGTTTCGCTCTCGAACAGCTTCCCATGCTTTTGCGTTTAATAAACTTGTCAGCCATCATCTTTTCCATTTTTATGTAGCATTGAGACATTGAAGAGGCACATTCCTGTTTCTGGTCCTGAGGGATTGAATGAGTTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTTACAATGCCGCTACCAGTCAGGTACTTGCTGCTCTCACTTGTATAAAGCCCTGATTTATTATGCTATCTACTAAATACAACGCTGAGTACTCACCATTTCCATTTTGAACCTCTGATTTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGTCTGGGACGTTCACTGCATTACCATCAGTGTCGGGCCCTGTGTCTTCCAATCAACCTTAGTCAGAGAACATCCACAATAACAATGTTTCTTCTAATGTATTTACCAACTCTCAGAGGCCAATATACTTTATTAGTATATTGGCTAATAAGGTAAATGACTCTACTATTAGAACGTCTAGAACGTCTTAGTATGTCTTAGTGTTGGAATGAGATATTATATGATAGTCTTAGTACGTCTTAGTGTTGGAATAGAACTGTTCTGAATCTTGAATGTTTATTATGTTAGTCTTAGTACGTCTTAGTGTTGGAATGAGATATTATATGATAATTAGAACTGTTCTAAATCTTGAATGTTTATTGAGAACGGTTCTGAATCTTGAATGTTCGTTCAGAACGGTTATGAATCTTGAATGTTCATTGAGAACAGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCCGAATCTTGGTTCTGAATCTTGAATGTTCGTTGAGAACCGTTCTGAATCTTGAATGTTCGTTGAGAACGATTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCCGAATCTTGAATGTTCATTGAGATTGTTGTGATAGTTTTTCATGTTCGTATCACTTAATGAAAATGTTACTTAATGAAAATGTTACGGAATCTCATGCATATTTGTGACTTTATAAACGCTATGTTTCTAAGGAGCTGTATAACGG

mRNA sequence

GGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGAATGGAGACATTGAAGAGGCTCCTTCCTGTTTCTGGTCCTGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGTTTTTTCTCGTGGGTTGTTTTGGAACTTTTGAATGGATTCGAATAATTGGAGGCCTGCTCAAGGTGGAGAACCCAGAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATAATTGAGACATTGAAGAGGCACCTTCCTGTTTCGGGTCCTGAGGGGTTGAGTGAGGTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTCACACTGTCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGCCTGGGACGACCCCTGCATTACCATCAATTTGTCCATCCCAACATTTGTCGACTGATGCGAAGATGTCAAGAGGGCCTGCTCAAGGTGGAGAACCCGAAGTCGATGCCGGGGTTTGGAGGTCTCAGTTGCCGCCGGATTCTCGGCATCGAATTGTCAACAGCATCATTGAGACATTGAAGAGGCACATTCCTGTTTCTGGTCCTGAGGGATTGAATGAGTTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTTACAATGCCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGTCTGGGACGTTCACTGCATTACCATCAGTGTCGGGCCCTGTGTCTTCCAATCAACCTTAGTCAGAGAACATCCACAATAACAATGTTTCTTCTAATGTATTTACCAACTCTCAGAGGCCAATATACTTTATTAGTATATTGGCTAATAAGGTAAATGACTCTACTATTAGAACGTCTAGAACGTCTTAGTATGTCTTAGTGTTGGAATGAGATATTATATGATAGTCTTAGTACGTCTTAGTGTTGGAATAGAACTGTTCTGAATCTTGAATGTTTATTATGTTAGTCTTAGTACGTCTTAGTGTTGGAATGAGATATTATATGATAATTAGAACTGTTCTAAATCTTGAATGTTTATTGAGAACGGTTCTGAATCTTGAATGTTCGTTCAGAACGGTTATGAATCTTGAATGTTCATTGAGAACAGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCCGAATCTTGGTTCTGAATCTTGAATGTTCGTTGAGAACCGTTCTGAATCTTGAATGTTCGTTGAGAACGATTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCCGAATCTTGAATGTTCATTGAGATTGTTGTGATAGTTTTTCATGTTCGTATCACTTAATGAAAATGTTACTTAATGAAAATGTTACGGAATCTCATGCATATTTGTGACTTTATAAACGCTATGTTTCTAAGGAGCTGTATAACGG

Coding sequence (CDS)

GGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGAATGGAGACATTGAAGAGGCTCCTTCCTGTTTCTGGTCCTGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGTTTTTTCTCGTGGGTTGTTTTGGAACTTTTGAATGGATTCGAATAATTGGAGGCCTGCTCAAGGTGGAGAACCCAGAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATAATTGAGACATTGAAGAGGCACCTTCCTGTTTCGGGTCCTGAGGGGTTGAGTGAGGTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTCACACTGTCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGCCTGGGACGACCCCTGCATTACCATCAATTTGTCCATCCCAACATTTGTCGACTGATGCGAAGATGTCAAGAGGGCCTGCTCAAGGTGGAGAACCCGAAGTCGATGCCGGGGTTTGGAGGTCTCAGTTGCCGCCGGATTCTCGGCATCGAATTGTCAACAGCATCATTGAGACATTGAAGAGGCACATTCCTGTTTCTGGTCCTGAGGGATTGAATGAGTTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTTACAATGCCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGTCTGGGACGTTCACTGCATTACCATCAGTGTCGGGCCCTGTGTCTTCCAATCAACCTTAG

Protein sequence

GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSRILGIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKSGTFTALPSVSGPVSSNQP
Homology
BLAST of Cp4.1LG04g06860 vs. ExPASy Swiss-Prot
Match: F4I171 (Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana OX=3702 GN=MED15A PE=1 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 5.8e-29
Identity = 61/99 (61.62%), Postives = 80/99 (80.81%), Query Frame = 0

Query: 237 GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAA 296
           GEP +D G WR+QLPPDSR +IVN I+ETLK+H+P SGPEG+NELR+IA RFEEK+++ A
Sbjct: 13  GEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRIAARFEEKIFSGA 72

Query: 297 TSQSDYLRKISLRMLTMETKS----GTFTALPSVSGPVS 332
            +Q+DYLRKIS++MLTMETKS    G+  A+P+ +   S
Sbjct: 73  LNQTDYLRKISMKMLTMETKSQNAAGSSAAIPAANNGTS 111

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: XP_023531819.1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023531820.1 mediator of RNA polymerase II transcription subunit 15a-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023531821.1 mediator of RNA polymerase II transcription subunit 15a-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 277 bits (709), Expect = 2.40e-89
Identity = 144/223 (64.57%), Postives = 175/223 (78.48%), Query Frame = 0

Query: 128 GESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVA 187
           GESEVDAGDWRSQLQP+SRH+IVD+I+ETLKR LPVSGPEGLSE+RKIAVRFEEK++T A
Sbjct: 12  GESEVDAGDWRSQLQPDSRHQIVDRIMETLKRLLPVSGPEGLSEMRKIAVRFEEKIYTAA 71

Query: 188 TSQSDYLRKISLRMLTMETKPGTTPALPS---------------ICPSQHLSTDAKMSRG 247
           TS+S+Y RKI+++MLTME    TT ALP                  P  +    + ++  
Sbjct: 72  TSESEYRRKITVKMLTMEPNSETTTALPPKSAPTINCRIRSTRFTSPPPNSGFRSALNNE 131

Query: 248 PAQGGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKV 307
           PAQGGEP VDAG WRSQL PDSRH+IV+ I+ETLKR +PVSGPEGL+E+RKIAVRFEEK+
Sbjct: 132 PAQGGEPRVDAGDWRSQLQPDSRHQIVDRIMETLKRILPVSGPEGLSEMRKIAVRFEEKI 191

Query: 308 YNAATSQSDYLRKISLRMLTMETKSGTFTALPSVSGPVSSNQP 335
           Y AATS+S+Y RKI ++ML ME  S T TALPS+SGP++S+QP
Sbjct: 192 YTAATSESEYRRKIYVKMLLMEPNSETTTALPSMSGPMASDQP 234

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: XP_042968216.1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X6 [Carya illinoinensis])

HSP 1 Score: 295 bits (756), Expect = 1.81e-86
Identity = 173/337 (51.34%), Postives = 217/337 (64.39%), Query Frame = 0

Query: 1   GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSE 60
           GE  +D GDWR+ L PDSR +IV +I                ++ LKR LPVSG EGL E
Sbjct: 26  GEPTMDTGDWRTGLPPDSRQRIVSKI----------------LDALKRHLPVSGQEGLHE 85

Query: 61  MRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENP------ESMLGIGG-- 120
           +RKIAVRFEEKI+TAATS+    G +     +R+     K +NP       +  G G   
Sbjct: 86  LRKIAVRFEEKIFTAATSQ----GDYLRKISLRMPTMETKSQNPLANSLPSNSAGNGNRP 145

Query: 121 --LNCSRILGIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLS 180
             L    + GI      GE  +DAGDWR++LQP++R RIV KI++TLKRHLP S  EGL 
Sbjct: 146 PDLGQGPVGGIGGGGGVGEPTMDAGDWRTELQPDARQRIVSKIMDTLKRHLPFSTQEGLQ 205

Query: 181 EVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAK-- 240
           E+R++A+ FEEK  T ATSQ DYLRKISL+ML METK    PA  +  PS       +  
Sbjct: 206 ELREMAISFEEKTFTAATSQGDYLRKISLKMLPMETKSQNPPA--NSLPSNSAGNGNRPP 265

Query: 241 -MSRGPAQG-------GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNE 300
            + +GP  G       GEP +DAG WR++L P++R RIV+ I++TLKRH+PVSG EGL E
Sbjct: 266 DLGQGPVGGIGGGGGVGEPTMDAGDWRTELQPEARQRIVSKIMDTLKRHLPVSGQEGLQE 325

Query: 301 LRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKS 317
           LRKIA+RFEEK++ AATSQ DYLRKISL+MLTMETKS
Sbjct: 326 LRKIAIRFEEKIFTAATSQGDYLRKISLKMLTMETKS 340

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: XP_042968221.1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X11 [Carya illinoinensis])

HSP 1 Score: 294 bits (753), Expect = 4.56e-86
Identity = 170/337 (50.45%), Postives = 218/337 (64.69%), Query Frame = 0

Query: 1   GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSE 60
           GE  +DAGDWR++LQPD+R +IV +I                METLKR LPVS  EGL E
Sbjct: 21  GEPTMDAGDWRTELQPDARQRIVSKI----------------METLKRNLPVSSQEGLQE 80

Query: 61  MRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENP------ESMLGIGG-- 120
           +RK+A+RFEEK +TAATS+    G +     ++++    K +NP       +  G G   
Sbjct: 81  LRKMAIRFEEKTFTAATSQ----GDYLRKISLKLLPMETKSQNPPANSLPSNSAGNGNRP 140

Query: 121 --LNCSRILGIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLS 180
             L    +  I      GE  +DA DWR++LQP++R RIV KI++TLKRHLP S  EGL 
Sbjct: 141 PDLGQGPVGAIGGGGGVGEPTMDARDWRTELQPDARQRIVSKIMDTLKRHLPFSTQEGLQ 200

Query: 181 EVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAK-- 240
           E+R++A+ FEEK  T ATSQ DYLRKISL+ML METK    PA  +  PS       +  
Sbjct: 201 ELREMAISFEEKTFTAATSQGDYLRKISLKMLPMETKSQNPPA--NSLPSNSAGNGNRPP 260

Query: 241 -MSRGPAQG-------GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNE 300
            + +GP  G       GEP +DAG WR++L P++R RIV+ I++TLKRH+PVSG EGL E
Sbjct: 261 DLGQGPVGGIGGGGGVGEPTMDAGDWRTELQPEARQRIVSKIMDTLKRHLPVSGQEGLQE 320

Query: 301 LRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKS 317
           LRKIA+RFEEK++ AATSQ DYLRKISL+MLTMETKS
Sbjct: 321 LRKIAIRFEEKIFTAATSQGDYLRKISLKMLTMETKS 335

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: XP_042968220.1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X10 [Carya illinoinensis])

HSP 1 Score: 294 bits (753), Expect = 4.58e-86
Identity = 170/337 (50.45%), Postives = 218/337 (64.69%), Query Frame = 0

Query: 1   GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSE 60
           GE  +DAGDWR++LQPD+R +IV +I                METLKR LPVS  EGL E
Sbjct: 25  GEPTMDAGDWRTELQPDARQRIVSKI----------------METLKRNLPVSSQEGLQE 84

Query: 61  MRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENP------ESMLGIGG-- 120
           +RK+A+RFEEK +TAATS+    G +     ++++    K +NP       +  G G   
Sbjct: 85  LRKMAIRFEEKTFTAATSQ----GDYLRKISLKLLPMETKSQNPPANSLPSNSAGNGNRP 144

Query: 121 --LNCSRILGIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLS 180
             L    +  I      GE  +DA DWR++LQP++R RIV KI++TLKRHLP S  EGL 
Sbjct: 145 PDLGQGPVGAIGGGGGVGEPTMDARDWRTELQPDARQRIVSKIMDTLKRHLPFSTQEGLQ 204

Query: 181 EVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAK-- 240
           E+R++A+ FEEK  T ATSQ DYLRKISL+ML METK    PA  +  PS       +  
Sbjct: 205 ELREMAISFEEKTFTAATSQGDYLRKISLKMLPMETKSQNPPA--NSLPSNSAGNGNRPP 264

Query: 241 -MSRGPAQG-------GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNE 300
            + +GP  G       GEP +DAG WR++L P++R RIV+ I++TLKRH+PVSG EGL E
Sbjct: 265 DLGQGPVGGIGGGGGVGEPTMDAGDWRTELQPEARQRIVSKIMDTLKRHLPVSGQEGLQE 324

Query: 301 LRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKS 317
           LRKIA+RFEEK++ AATSQ DYLRKISL+MLTMETKS
Sbjct: 325 LRKIAIRFEEKIFTAATSQGDYLRKISLKMLTMETKS 339

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: XP_042968234.1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X20 [Carya illinoinensis])

HSP 1 Score: 294 bits (752), Expect = 4.65e-86
Identity = 174/337 (51.63%), Postives = 216/337 (64.09%), Query Frame = 0

Query: 1   GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSE 60
           GE  +D GDWR+ L PDSR +IV +I                ++ LKR LPVSG EGL E
Sbjct: 26  GEPTMDTGDWRTGLPPDSRQRIVSKI----------------LDALKRHLPVSGQEGLHE 85

Query: 61  MRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENP------ESMLGIGG-- 120
           +RKIAVRFEEKI+TAATS+    G +     +R+     K +NP       +  G G   
Sbjct: 86  LRKIAVRFEEKIFTAATSQ----GDYLRKISLRMPTMETKSQNPLANSLPSNSAGNGNRP 145

Query: 121 --LNCSRILGIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLS 180
             L    + GI      GE  +DAGDWR++LQP++R RIV KI+ETLKR+LPVS  EGL 
Sbjct: 146 PDLGQGPVGGIGGGGGVGEPTMDAGDWRTELQPDARQRIVSKIMETLKRNLPVSSQEGLQ 205

Query: 181 EVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAK-- 240
           E+RK+A+RFEEK  T ATSQ DYLRKISL++L METK    PA  +  PS       +  
Sbjct: 206 ELRKMAIRFEEKTFTAATSQGDYLRKISLKLLPMETKSQNPPA--NSLPSNSAGNGNRPP 265

Query: 241 -MSRGPAQG-------GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNE 300
            + +GP          GEP +DA  WR++L PD+R RIV+ I +TLKRH+PVSG EGL E
Sbjct: 266 DLGQGPVGAIGGGGGVGEPTMDARDWRTELQPDARQRIVSKIRDTLKRHLPVSGQEGLQE 325

Query: 301 LRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKS 317
           LRKIA+RFEEK++ AATSQ DYLRKISL+MLTMETKS
Sbjct: 326 LRKIAIRFEEKIFTAATSQGDYLRKISLKMLTMETKS 340

BLAST of Cp4.1LG04g06860 vs. ExPASy TrEMBL
Match: A0A6J1JEE1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486139 PE=4 SV=1)

HSP 1 Score: 267 bits (682), Expect = 1.29e-85
Identity = 141/223 (63.23%), Postives = 170/223 (76.23%), Query Frame = 0

Query: 128 GESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVA 187
           GESEVDAGDWRSQLQP+SRH+IV++I+ETLKR LPVS PEGL   RKIAVRFEEK++T A
Sbjct: 12  GESEVDAGDWRSQLQPDSRHQIVNRIMETLKRLLPVSDPEGL---RKIAVRFEEKIYTAA 71

Query: 188 TSQSDYLRKISLRMLTMETKPGTTPALPSIC---------------PSQHLSTDAKMSRG 247
           TS+SDYLRKI+++MLTME     T ALP +                P  +    + ++  
Sbjct: 72  TSESDYLRKITVKMLTMEPNSKKTTALPPMSAPTINCRIRSTRFTSPPPNSGFHSALNHE 131

Query: 248 PAQGGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKV 307
           PAQGGEP VDAG WRSQL PDSRH+ +N I+ETLKR +PVSGPEGL+ELRKIA RFEEK+
Sbjct: 132 PAQGGEPRVDAGDWRSQLQPDSRHQSINKIMETLKRILPVSGPEGLSELRKIAARFEEKI 191

Query: 308 YNAATSQSDYLRKISLRMLTMETKSGTFTALPSVSGPVSSNQP 335
           Y AATS+S+YLRKI ++ML ME    T TALPS+SGP++SNQP
Sbjct: 192 YTAATSESEYLRKIYVKMLIMEPNFETTTALPSMSGPMTSNQP 231

BLAST of Cp4.1LG04g06860 vs. ExPASy TrEMBL
Match: A0A498K991 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_038220 PE=4 SV=1)

HSP 1 Score: 268 bits (686), Expect = 1.18e-83
Identity = 156/325 (48.00%), Postives = 199/325 (61.23%), Query Frame = 0

Query: 1   GESEVDAG-DWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLS 60
           GE  ++AG DW SQLQ +SRH+IV +I                +ET+ R +P  GPEGL 
Sbjct: 129 GEPPMEAGVDWMSQLQSESRHRIVAKI----------------IETMVRHIPFDGPEGLR 188

Query: 61  EMRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSRIL 120
           E+ +IAV FEE+IY +A+S+          +++R I   +     +S + +  ++   +L
Sbjct: 189 ELERIAVTFEEEIYVSASSQ---------TDYLRKISIKMFTIETKSQIAVSHISLDPVL 248

Query: 121 ---GIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIA 180
                +   E GE  ++ GDWRSQLQ +SR RIV KI+ETLKRHLP SG EGL E+ KIA
Sbjct: 249 MDGNYQRPLEGGEPSMETGDWRSQLQLDSRRRIVAKIMETLKRHLPFSGEEGLRELEKIA 308

Query: 181 VRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQG 240
            RFEEK++  A+SQSDYL+KIS++MLTME KP                          QG
Sbjct: 309 ARFEEKIYVAASSQSDYLQKISMKMLTMENKP--------------------------QG 368

Query: 241 GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAA 300
           GE  +    WRSQL PDSRHRI+  I E LKRH+P SG EGL+EL +IAVRFEEK+Y  A
Sbjct: 369 GETSMVTSNWRSQLQPDSRHRIIAKITEVLKRHLPFSGEEGLSELERIAVRFEEKIYTVA 402

Query: 301 TSQSDYLRKISLRML--TMETKSGT 319
            SQSDYLRKISL+ML  TME KS T
Sbjct: 429 VSQSDYLRKISLKMLMLTMENKSQT 402

BLAST of Cp4.1LG04g06860 vs. ExPASy TrEMBL
Match: A0A1U7WSK9 (uncharacterized protein LOC104227178 isoform X10 OS=Nicotiana sylvestris OX=4096 GN=LOC104227178 PE=4 SV=1)

HSP 1 Score: 265 bits (677), Expect = 1.37e-83
Identity = 158/316 (50.00%), Postives = 197/316 (62.34%), Query Frame = 0

Query: 5   VDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKI 64
           +D+ DWR+QL PDSR +IV+ I                 ETLKR L V+  EG+ E++KI
Sbjct: 1   MDSADWRTQLLPDSRQRIVNNI----------------TETLKRQLSVTREEGVQELKKI 60

Query: 65  AVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLG---IGGLNCSRILGI 124
           AV FEEKIYTAATS+   +        ++I+    K  NP + L      G N       
Sbjct: 61  AVGFEEKIYTAATSQPDYLQKIS----LKILTMETKSHNPMTNLSNAASSGQNAHDPGTA 120

Query: 125 KLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEE 184
           +     G   +DA DWR+QL P+ R  IV+KI ETL RHLPV+G EG+ E++KIA+RFEE
Sbjct: 121 RAGAAAGA--MDAADWRTQLLPDFRQSIVNKITETLMRHLPVTGEEGVQELKKIALRFEE 180

Query: 185 KVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEV 244
           K++T A SQ DYLRKISL+MLTMET     P   S   +            PA     ++
Sbjct: 181 KIYTAAISQPDYLRKISLKMLTMETD-SQNPMTNSANAASSGQNAHDPGTAPAGAAAGDM 240

Query: 245 DAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSD 304
           DA  WR+QL PDSR RIVN I ETLKRH+PV+G EG+ EL+KIA+RFEEK+Y AA SQ D
Sbjct: 241 DAVDWRTQLLPDSRQRIVNKITETLKRHLPVTGEEGVQELKKIALRFEEKIYTAAISQPD 293

Query: 305 YLRKISLRMLTMETKS 317
           YLRKISL+MLTMETKS
Sbjct: 301 YLRKISLKMLTMETKS 293

BLAST of Cp4.1LG04g06860 vs. ExPASy TrEMBL
Match: A0A1U7WG84 (uncharacterized protein LOC104227178 isoform X9 OS=Nicotiana sylvestris OX=4096 GN=LOC104227178 PE=4 SV=1)

HSP 1 Score: 265 bits (677), Expect = 1.88e-83
Identity = 158/316 (50.00%), Postives = 197/316 (62.34%), Query Frame = 0

Query: 5   VDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKI 64
           +D+ DWR+QL PDSR +IV+ I                 ETLKR L V+  EG+ E++KI
Sbjct: 1   MDSADWRTQLLPDSRQRIVNNI----------------TETLKRQLSVTREEGVQELKKI 60

Query: 65  AVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLG---IGGLNCSRILGI 124
           AV FEEKIYTAATS+   +        ++I+    K  NP + L      G N       
Sbjct: 61  AVGFEEKIYTAATSQPDYLQKIS----LKILTMETKSHNPMTNLSNAASSGQNAHDPGTA 120

Query: 125 KLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEE 184
           +     G   +DA DWR+QL P+ R  IV+KI ETL RHLPV+G EG+ E++KIA+RFEE
Sbjct: 121 RAGAAAGA--MDAADWRTQLLPDFRQSIVNKITETLMRHLPVTGEEGVQELKKIALRFEE 180

Query: 185 KVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEV 244
           K++T A SQ DYLRKISL+MLTMET     P   S   +            PA     ++
Sbjct: 181 KIYTAAISQPDYLRKISLKMLTMETD-SQNPMTNSANAASSGQNAHDPGTAPAGAAAGDM 240

Query: 245 DAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSD 304
           DA  WR+QL PDSR RIVN I ETLKRH+PV+G EG+ EL+KIA+RFEEK+Y AA SQ D
Sbjct: 241 DAVDWRTQLLPDSRQRIVNKITETLKRHLPVTGEEGVQELKKIALRFEEKIYTAAISQPD 293

Query: 305 YLRKISLRMLTMETKS 317
           YLRKISL+MLTMETKS
Sbjct: 301 YLRKISLKMLTMETKS 293

BLAST of Cp4.1LG04g06860 vs. ExPASy TrEMBL
Match: A0A5N5FPT5 (Uncharacterized protein OS=Pyrus ussuriensis x Pyrus communis OX=2448454 GN=D8674_038952 PE=4 SV=1)

HSP 1 Score: 265 bits (678), Expect = 9.88e-83
Identity = 170/392 (43.37%), Postives = 214/392 (54.59%), Query Frame = 0

Query: 1   GESEVDAGD-WRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLS 60
           GE  ++AGD WR+QLQ +SRH+IV +I                +ET+KR +P  GPEGL 
Sbjct: 13  GEPPMEAGDDWRTQLQSESRHRIVAKI----------------IETMKRHVPFDGPEGLR 72

Query: 61  EMRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSRIL 120
           E+ +IAV FEE +Y  A+S+          ++IR I   +     +S   +   +    L
Sbjct: 73  EIERIAVTFEENMYVGASSQ---------SDYIRKISLKMLTMETKSQTAVSHASLDPFL 132

Query: 121 -GIKLST---ECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKI 180
               + T   E GE  ++ GDWRSQLQ +SRHRIV KI+ETLKRHLP +G EGL E+ KI
Sbjct: 133 MDTNIQTRPPEGGEPSMETGDWRSQLQLDSRHRIVAKIMETLKRHLPFNGEEGLRELEKI 192

Query: 181 AVRFEEKVHTVATSQSDYLRKISLRMLTMETKP--GTTPALPSICPSQHLSTDAKM---- 240
           A RFEEK++  A+SQSDYLRKIS++ML ME KP  G T  + S   SQ L  D++     
Sbjct: 193 AARFEEKIYVAASSQSDYLRKISMKMLAMENKPQGGETSMVTSNWRSQ-LQPDSRHRIIA 252

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 253 KITEVLRRHLPFTGDEGLHELERIAVRFEEKIYTVALSQIWVFQFNFVSKSAGVCGWSVL 312

Query: 301 ----SRGPAQGGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIA 317
               ++ PAQGGEP ++AG WRSQL  DSR RIV+ I ETLKRH+P  G EGL EL KIA
Sbjct: 313 MDLNNQRPAQGGEPSMEAGDWRSQLQQDSRRRIVHKITETLKRHLPFEGEEGLRELEKIA 372

BLAST of Cp4.1LG04g06860 vs. TAIR 10
Match: AT1G15790.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 170 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 170; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 160.2 bits (404), Expect = 2.8e-39
Identity = 84/179 (46.93%), Postives = 118/179 (65.92%), Query Frame = 0

Query: 135 GDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYL 194
           GDWR+Q    SR RIV+KI+ET  + LP   PEG +E+RKIAVRFEEK+   A++Q++YL
Sbjct: 5   GDWRTQFPSASRSRIVNKIMETQLKQLPFIRPEGTNELRKIAVRFEEKLFNNASNQTEYL 64

Query: 195 RKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEVDAGVWRSQLPPDS 254
           R+I ++ML METK            S   +T   +        EP V+ G WR+Q P DS
Sbjct: 65  RQICMKMLNMETKSQNAAG----SSSADDNTPPLVPEPSVPNNEPAVNTGDWRTQQPQDS 124

Query: 255 RHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTM 314
           R + +N++++TLK+ +P SG EG++EL +IAV  EE ++N+A +Q DYL KISL+M TM
Sbjct: 125 RQKNINALLDTLKKIVPHSGKEGIDELMRIAVSLEELIFNSAINQEDYLGKISLKMRTM 179

BLAST of Cp4.1LG04g06860 vs. TAIR 10
Match: AT1G15790.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 160.2 bits (404), Expect = 2.8e-39
Identity = 84/179 (46.93%), Postives = 118/179 (65.92%), Query Frame = 0

Query: 135 GDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYL 194
           GDWR+Q    SR RIV+KI+ET  + LP   PEG +E+RKIAVRFEEK+   A++Q++YL
Sbjct: 5   GDWRTQFPSASRSRIVNKIMETQLKQLPFIRPEGTNELRKIAVRFEEKLFNNASNQTEYL 64

Query: 195 RKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEVDAGVWRSQLPPDS 254
           R+I ++ML METK            S   +T   +        EP V+ G WR+Q P DS
Sbjct: 65  RQICMKMLNMETKSQNAAG----SSSADDNTPPLVPEPSVPNNEPAVNTGDWRTQQPQDS 124

Query: 255 RHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTM 314
           R + +N++++TLK+ +P SG EG++EL +IAV  EE ++N+A +Q DYL KISL+M TM
Sbjct: 125 RQKNINALLDTLKKIVPHSGKEGIDELMRIAVSLEELIFNSAINQEDYLGKISLKMRTM 179

BLAST of Cp4.1LG04g06860 vs. TAIR 10
Match: AT1G15780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G10440.1); Has 103701 Blast hits to 43153 proteins in 1828 species: Archae - 30; Bacteria - 7385; Metazoa - 38639; Fungi - 11531; Plants - 7727; Viruses - 307; Other Eukaryotes - 38082 (source: NCBI BLink). )

HSP 1 Score: 129.8 bits (325), Expect = 4.1e-30
Identity = 61/99 (61.62%), Postives = 80/99 (80.81%), Query Frame = 0

Query: 237 GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAA 296
           GEP +D G WR+QLPPDSR +IVN I+ETLK+H+P SGPEG+NELR+IA RFEEK+++ A
Sbjct: 13  GEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRIAARFEEKIFSGA 72

Query: 297 TSQSDYLRKISLRMLTMETKS----GTFTALPSVSGPVS 332
            +Q+DYLRKIS++MLTMETKS    G+  A+P+ +   S
Sbjct: 73  LNQTDYLRKISMKMLTMETKSQNAAGSSAAIPAANNGTS 111

BLAST of Cp4.1LG04g06860 vs. TAIR 10
Match: AT2G10440.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 1628 Blast hits to 1350 proteins in 149 species: Archae - 0; Bacteria - 39; Metazoa - 480; Fungi - 159; Plants - 187; Viruses - 2; Other Eukaryotes - 761 (source: NCBI BLink). )

HSP 1 Score: 50.8 bits (120), Expect = 2.4e-06
Identity = 31/106 (29.25%), Postives = 53/106 (50.00%), Query Frame = 0

Query: 136 DWRSQLQPESRHRIVDKIIETLKRHLPVS------GPEGLSEVRKIAVRFEEKVHTVATS 195
           DWRSQ +PE R +++ KI+ +L  +  V             ++  IA +FEE  +++AT 
Sbjct: 24  DWRSQHEPELRQKVLSKIVCSLNDYRKVEKFKEKFHAHEEYKINDIASKFEENFYSIATD 83

Query: 196 QSDYLRKIS----------LRMLTMETKPGTTPALPSICPSQHLST 226
           ++DYLRK+S           R+L  +       +LP++ P    +T
Sbjct: 84  KNDYLRKLSETLHYIQRTYTRVLASQVVVNQAQSLPALLPYMQTTT 129

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4I1715.8e-2961.62Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana ... [more]
Match NameE-valueIdentityDescription
XP_023531819.12.40e-8964.57mediator of RNA polymerase II transcription subunit 15a-like isoform X1 [Cucurbi... [more]
XP_042968216.11.81e-8651.34mediator of RNA polymerase II transcription subunit 15a-like isoform X6 [Carya i... [more]
XP_042968221.14.56e-8650.45mediator of RNA polymerase II transcription subunit 15a-like isoform X11 [Carya ... [more]
XP_042968220.14.58e-8650.45mediator of RNA polymerase II transcription subunit 15a-like isoform X10 [Carya ... [more]
XP_042968234.14.65e-8651.63mediator of RNA polymerase II transcription subunit 15a-like isoform X20 [Carya ... [more]
Match NameE-valueIdentityDescription
A0A6J1JEE11.29e-8563.23mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucur... [more]
A0A498K9911.18e-8348.00Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_038220 PE=4 SV=1[more]
A0A1U7WSK91.37e-8350.00uncharacterized protein LOC104227178 isoform X10 OS=Nicotiana sylvestris OX=4096... [more]
A0A1U7WG841.88e-8350.00uncharacterized protein LOC104227178 isoform X9 OS=Nicotiana sylvestris OX=4096 ... [more]
A0A5N5FPT59.88e-8343.37Uncharacterized protein OS=Pyrus ussuriensis x Pyrus communis OX=2448454 GN=D867... [more]
Match NameE-valueIdentityDescription
AT1G15790.12.8e-3946.93unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15790.22.8e-3946.93unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15780.14.1e-3061.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G10440.22.4e-0629.25unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036546Mediator complex subunit 15, KIX domainPFAMPF16987KIX_2coord: 244..317
e-value: 2.4E-34
score: 117.2
coord: 134..208
e-value: 4.7E-32
score: 109.8
coord: 7..78
e-value: 3.4E-16
score: 59.0
IPR036529Coactivator CBP, KIX domain superfamilyGENE3D1.10.246.20Coactivator CBP, KIX domaincoord: 7..81
e-value: 8.0E-6
score: 27.9
IPR036529Coactivator CBP, KIX domain superfamilyGENE3D1.10.246.20Coactivator CBP, KIX domaincoord: 136..211
e-value: 5.1E-19
score: 70.2
coord: 246..321
e-value: 5.9E-22
score: 79.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 223..248
NoneNo IPR availablePANTHERPTHR33137:SF27OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A, PUTATIVE-RELATEDcoord: 233..320
NoneNo IPR availablePANTHERPTHR33137:SF27OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A, PUTATIVE-RELATEDcoord: 129..211
coord: 3..79
IPR044661Mediator of RNA polymerase II transcription subunit 15a/b/c-likePANTHERPTHR33137MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A-RELATEDcoord: 233..320
IPR044661Mediator of RNA polymerase II transcription subunit 15a/b/c-likePANTHERPTHR33137MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A-RELATEDcoord: 129..211
IPR044661Mediator of RNA polymerase II transcription subunit 15a/b/c-likePANTHERPTHR33137MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A-RELATEDcoord: 3..79

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g06860.1Cp4.1LG04g06860.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
molecular_function GO:0031490 chromatin DNA binding
molecular_function GO:0003713 transcription coactivator activity
molecular_function GO:0003712 transcription coregulator activity