Cp4.1LG04g06860 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g06860
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranscription cofactor, putative
LocationCp4.1LG04 : 4599218 .. 4620775 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTATGATTTCTTGCTTCTTTCATGCATATTTGAGAAATTTGTTTGAGCTTTTGTAACTTCTTTTGTGTTCATAGTTCGCTGCTTAGATTAGTTACTATGGGATGGTTTTCTGAATTAAGCTATGAACAGCAAAATTTTGGAAGTATAGAATCATCTTCGACCTTATTCTACCATTTCTTAGATTTGTGTATAACTGATCGATGCGAACTGATTCCAAAACCTAGTTCTATAATATTATCTCCTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGGTATTAGCCCCTTCTTTGAACATAAGAAGTATAAGTTATTTGAAGTAGTATGGATCCTATTGAATTTTCCTCGACTGATCTAAAGTCCCTTTCTTCAAAATGCCTAATTGTGAGATCCCACATTGGTTGAAGAGGGTGGAACGAAACATTCCTCCTTAAGGTGTGGAAACCTCTCCCTAGTAAACGTGTTTTAAAACTGCGAGACTGACGATGATACGTAATAGGCCAAAGCAGGCAATATCTGTTAGAGGTGGGCTTGGGATGTTACAAATGGTATCAGAATCAAACACCGGGAGGTATGCCAGCGAGGACGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACAACATTCCTTATAAGGGTGTAGAAACGTCTCCCTAGAAGACACACTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCTAAGCGGACAATATCTACTAGCGGTGGGCTTGAGCTGTTACAAATGGTATCAGAGTCAGACACTGGGCGGTGTGCCAGCGAAGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCATGTTGGTTGGAGAGGGGAACGAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGATAGCGATAAGGGGCCAAAGCGAACAATATCTGTTAGTGGNATTTAGATTGTTTCGGAACTTCTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTATGATTTCTTGCTTCTTTCATGCATATTTGAGAAATTTGTTTGAGCTTTTGTAACTTCTTTTGTGTTCATAGTTCGCTGCTTAGATTAGTTACTATGGGATGGTTTTCTGAATTAAGCTATGAACAGCAAAATTTTGGAAGTATAGAATCATCTTCGACCTTATTCTACCATTTCTTAGATTTGTGTATAACTGATCGATGCGAACTGATTCCAAAACCTAGTTCTATAATATTATCTCCTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGGTATTAGCCCCTTCTTTGAACATAAGAAGTATAAGTTATTTGAAGTAGTATGGATCCTATTGAATTTTCCTCGACTGATCTAAAGTCCCTTTCTTCAAAATGCCTAATTGTGAGATCCCACATTGGTTGAAGAGGGTGGAACGAAACATTCCTCCTTAAGGTGTGGAAACCTCTCCCTAGTAAACGTGTTTTAAAACTGCGAGACTGACGATGATACGTAATAGGCCAAAGCAGGCAATATCTGTTAGAGGTGGGCTTGGGATGTTACAAATGGTATCAGAATCAAACACCGGGAGGTATGCCAGCGAGGACGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACAACATTCCTTATAAGGGTGTAGAAACGTCTCCCTAGAAGACACACTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCTAAGCGGACAATATCTACTAGCGGTGGGCTTGAGCTGTTACAAATGGTATCAGAGTCAGACACTGGGCGGTGTGCCAGCGAAGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCATGTTGGTTGGAGAGGGGAACGAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGATAGCGATAAGGGGCCAAAGCGAACAATATCTGTTAGTGGTGAGCTTGGATTGTTACAAATGGTATCAGAGTCAGACATCGGGCGGTGTGCCAGCGAGGACGTTAGGCCCTCAAGGGGGTGGAATGTTAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTAATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAAGNGCCAAAGCAGGCAATATCTGTTAGAGGTGGGCTTGGGATGTTACAAATGGTATCAGAATCAAACACCGGGAGGTATGCCAGCGAGGACGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACAACATTCCTTATAAGGGTGTAGAAACGTCTCCCTAGAAGACACACTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCTAAGCGGACAATATCTACTAGCGGTGGGCTTGAGCTGTTACAAATGGTATCAGAGTCAGACACTGGGCGGTGTGCCAGCGAAGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCATGTTGGTTGGAGAGGGGAACGAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGATAGCGATAAGGGGCCAAAGCGAACAATATCTGTTAGTGGTGAGCTTGGATTGTTACAAATGGTATCAGAGTCAGACATCGGGCGGTGTGCCAGCGAGGACGTTAGGCCCTCAAGGGGGTGGAATGTTAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTAATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAAGAGCGAAACATAACGGACCAAAGCGGACAATATCTGCTAGTGGTGGGCTTAGACTGTTACACAAATATTTTGAATCATGAACGTGGATTACTGGGGTAGTTTTTCCTTCCTGAACAGCTTCCCATGCTTTTGTGTTTAATGAACTTGTCAGCCATCATCTTCTCCATTTTTATGTAGAATGGAGACATTGAAGAGGCTCCTTCCTGTTTCTGGTCCTGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGGTACTTCCTGCTCACTTGTTATAGTTCTCAAGGATTGTAAGAACAAACATTATCTTATTTATGTTTAGCCCATTTGCTATAACAATAAGATTCCAATTTGATTTGGTTCCTAAGATTTAGTTCTGTACTGAAAGATACTTTTGTATAAGAACTTGTTCATAAAGCCCTGATTTATTATGCTATGTAGTGTTGCACTAAATAAATACCAATTATATTTTGAACCTCTGATTTCAGTCAGAGTACCGAAGGAAAATANGTTGGAGAGGGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGAAGACACGCTTTAAAAACGTGAGGCTTACGACAATACGTAACAGGCCAAAGCGGACAATATCTATTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTCAGTCACTGGCCGGTGTGCCAGTGAGGACTCTAGGCCCTCAAGGGGGGGGGGGGANAACTGTGAAGATGCTTACTATGGAACCCAACTCTGAGACGACCACTGCATTACCACCAAAGTCTGCGCCCACGATCAACTGCCGAATTCGTTCGACCCGATTCACATCTCCGCCGCCGAATTCTGGCTTTCGTTCAGCCCTAAACAACGAGTAAACTCCATAACAGCCTTACTTTCCATCCTTGAAGGACTCTATTCAATCTCTCTTTTTGCTTTGTTATTGGGATTTTGACGCAGTTTTTTCTCGTGGGTTGTTTTGGAACTTTTGAATGGATTCGAATAATTGGAGGCCTGCTCAAGGTGGAGAACCCAGAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTATGATTTCTTGCTTCTTTCATGCATATTTGAGAAATTTGTTTGAGCTTTTGTAACTTCTTTTGTGTTCATAGTTCGCTGCTTAGATTAGTTACTATGGGATGGTTTTCTGAATTAAGCTATGAACAGCAAAATTTTGGAAGTATAGAATCATCTTCGACCTTATTCTACCATTTCTTAGATTTGTGTATAACTGATCGATGCGAACTGATTCCAAAACCTAGTTCTATAATATTATCTCCTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGGTATTAGCCCCTTCTTTGAACATAAGAAGTATAAGTTATTTGAAGTAGTATGGATCCTATTGAATTTTCCTCGACTGATCTAAAGTCCATTCCTTCAAAATGCCTAATTGTGAGATCCCACATTGGTGGAAGAGGGAACAAAACATTTCTCATTAAGATGTAGAAACCTCTCCCTAGTAGACGTGTTTTAAAACTGCGAGACTGACGACGATATGTAACGGGCTAAAGCGGACAATATCTGTTAGCGGTGGGCTTGAGATGTTACAAATGGTATCAGAATCAAACACTGGGAGGTATGCCAGCGAGGGTGCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGAGGAACGAAACACTCCTATAAGGGTGTAGAAACCTCTCCCTAAAAGACACGGTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCATAGCGGACAATATCCCAGTGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTAGACACTTGGCGGTGTGCCAGCGAGGACTCTAGGCCCTCAATAGGGGTGGATTGTGAGATCCTACATTAGTTGGAGAGGGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGAAGACACGCTTTAAAAACGTGAGGCTTACGACAATACGTAACAGGCCAAAGCGGACAATATCTATTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTCAGTCACTGGCCGGTGTGCCAGTGAGGACTCTAGGCCCTCAAGGGGGGGGGATTGTGAGATCCCACATTGGTTGGAGAGTGGAACGAAACATTCCTTAGAAGGGTGTAGGAACCTCTACCTAGAAGACACGCTTTAAAATCGTGAGGCTGACAACAATATGTAACGGGCCAAAGCGGACAATATCTACTAGTGGTGAGCTTGGGCTGTTACAAACGGTATCAAAGTCAGACACTGGGCGGTGTGTCAGCGAGGGCTCTAGGCCCTCAAGGGGGGTGGATTGTGAAATCCTACATTGGTTGGAGGGGGGAACGAAACATTCCTTAGAAGGGTGTAGAAACCTCTCCCTAGAAGACATACTTTAAAATCGTGAGGCTGATGACAATACATAACGGGCCAAAGCGGACAATATCTACTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTCAGACACTGGCCGGTGTGCTAGCGAGGACTCTAGGCCTTCAAGGTGGGTGAATTGTGAGGTCCCACATTGGTTGGAGAGGGGAACAAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGACAGCGATAGGAGGCCAAAGCGGACAATATATATTAGTGGTGAACTTGGACTATTACAAATGGTATTAGAGTCAGACATCTGGCGGTGTGCCAGCGAGGACGCTAGGCCCTCAAGGGGGTGGAATGTGAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTCATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAACNATAAGGGTGTAGAAACCTCTCCCTAAAAGACACGGTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCATAGCGGACAATATCCCAGTGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTAGACACTTGGCGGTGTGCCAGCGAGGACTCTAGGCCCTCAATAGGGGTGGATTGTGAGATCCTACATTAGTTGGAGAGGGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGAAGACACGCTTTAAAAACGTGAGGCTTACGACAATACGTAACAGGCCAAAGCGGACAATATCTATTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTTCAGTCACTGGCCGGTGTGCCAGTGAGGACTCTAGGCCCTCAAGGNAAGACATGCTTTAAAACCGTGAGGCTGACGACAATACGTAACGGGCCAAAGCGGGCAATATCTACTAGTGGTGTGCTTGGACTGTTACAAATGGTATCAGAGTCGGACAATGGGTGGTGTGCCAGCGAGGACTCTAGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGTGGAACGAAACATTCCTTAGAAGGGTGTAGGAACCTCTACCTAGAAGACACGCTTTAAAATCGTGAGGCTGACAACAATATGTAACGGGCCAAAGCGGACAATATCTACTAGTGGTGAGCTTGGGCTGTTACAAACGGTATCAAAGTCAGACACTGGGCGGTGTGTCAGCGAGGGCTCTAGGCCCTCAAGGGGGGTGGATTGTGAAATCCTACATTGGTTGGAGGGGGGAACGAAACATTCCTTAGAAGGGTGTAGAAACCTCTCCCTAGAAGACATACTTTAAAATCGTGAGGCTGATGACAATACATAACGGGCCAAAGCGGACAATATCTACTAGCGGTGAGCTTGGGCTGTTACAAATGGTATCAGAGTCAGACACTGGCCGGTGTGCTAGCGAGGACTCTAGGCCTTCAAGGTGGGTGAATTGTGAGGTCCCACATTGGTTGGAGAGGGGAACAAAACATTCTTCCTAAGGGTGTGGAGACCTCTCCCTAGTAGACGTGTTTTAAAACCGTGAGGCTGACAGCGATAGGAGGCCAAAGCGGACAATATATATTAGTGGTGAACTTGGACTATTACAAATGGTATTAGAGTCAGACATCTGGCGGTGTGCCAGCGAGGACGCTAGGCCCTCAAGGGGGTGGAATGTGAGATCCCACATTAGCTTGAGAGGGGAACGAAACGTTCCTCATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTTAAAACAGCGAAACGTAACGGGACAAAGCGAACAATATCGGCTAGCGGTGGGCTTAGGCTATTACACTAATATTTTGAATCATGAACGTGGATTACTACGGTAGTTTTTCCTTCCTGAACAGCTTCCCATGCTTTTGTGTTTAATAACCTTGTCAGCCATCATCTTTTCCATTTTTATGTAGAATGGAGACATTGAAGAGGATCCTTCCTGTTTCTGGTCCCGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGGTACTTCCTGCTCACTTGTTATAGTTCTCAAGGATTTTAAGAACAAACATTATCTTATTTATGTTTAGCCCATTTGCTAGAACAATAAGATTCCAATTTGATTTGGTTCCTAAGATTTAGTTCTGTACTGATAGATACTTTTGTATAAGAACTTGTTCATAAAGCCCTGATTTATTATGCTATGTAGTGTTGCACTAAATAAATACCAATTATATTTTGAACCTCTGATTTCAGTCAGAGTACCGAAGGAAAATATATGTGAAGATGCTTCTTATGGAACCCAACTCTGAGACGACCACTGCATTACCATCAATGTCTGGGCCTATGGCTTCCGATCAACCTTAGAAACAGGCTGCAATTACTCTTCCACAACATCCAAAATAACACTGCTTCCTCTAATGTATTTACCAACTCTTAGAGGCCAATATACTTCATTGGCTAATAAGGTGAATGAGTCTACAGTTTGAACGTGTTAGTGTTGGAATGAGATATTGTATGATAATTGAAACTGTTCTGAATCTTGAATGTTCATGAGATTGTTTTGAATGAAATATTGTATGACAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTAAATGAGATATTATATGATCATTGAAATTGTTTTTAATCTTGAATGTTGATGATATTGTTTTGAATGAAATATTATATGATAATTGAAATTGTTCTTAATCTTGAATGTCCATGATATTGTTTTGAATGAGATATTATATGATCATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGTTTTGAATGAGATATTATGTGATAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGANTCCCTAGTAGACGCGTTTTAAAACAGCGAAACGTAACGGGACAAAGCGAACAATATCGGCTAGCGGTGGGCTTAGGCTATTACACTAATATTTTGAATCATGAACGTGGATTACTACGGTAGTTTTTCCTTCCTGAACAGCTTCCCATGCTTTTGTGTTTAATAACCTTGTCAGCCATCATCTTTTCCATTTTTATGTAGAATGGAGACATTGAAGAGGATCCTTCCTGTTTCTGGTCCCGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGGTACTTCCTGCTCACTTGTTATAGTTCTCAAGGATTTTAAGAACAAACATTATCTTATTTATGTTTAGCCCATTTGCTAGAACAATAAGATTCCAATTTGATTTGGTTCCTAAGATTTAGTTCTGTACTGATAGATACTTTTGTATAAGAACTTGTTCATAAAGCCCTGATTTATTATGCTATGTAGTGTTGCACTAAATAAATACCAATTATATTTTGAACCTCTGATTTCAGTCAGAGTACCGAAGGAAAATATATGTGAAGATGCTTCTTATGGAACCCAACTCTGAGACGACCACTGCATTACCATCAATGTCTGGGCCTATGGCTTCCGATCAACCTTAGAAACAGGCTGCAATTACTCTTCCACAACATCCAAAATAACACTGCTTCCTCTAATGTATTTACCAACTCTTAGAGGCCAATATACTTCATTGGCTAATAAGGTGAATGAGTCTACAGTTTGAACGTGTTAGTGTTGGAATGAGATATTGTATGATAATTGAAACTGTTCTGAATCTTGAATGTTCATGAGATTGTTTTGAATGAAATATTGTATGACAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTAAATGAGATATTATATGATCATTGAAATTGTTTTTAATCTTGAATGTTGATGATATTGTTTTGAATGAAATATTATATGATAATTGAAATTGTTCTTAATCTTGAATGTCCATGATATTGTTTTGAATGAGATATTATATGATCATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGTTTTGAATGAGATATTATGTGATAATTGAAACTGTTCTTAATCTTGAATGTTCATGATATTGTTTTGAATGAGATATTATCTGATAATTGAAACGGTTTTGAATCTTGTATGTTCATGATATTGTTGTGATAGTTTTTCATGTTCGTATCACTAAATGAAACTGATTCGAAATTTCACGCATATTTGTGACTTATCGGTGACAATAACACGAAAATGGACTTAGAGTGATGTTGTGTGAGTGAAGAGTAGACTCGTGGTGGCCGAAAATCAAGCAGAGACGTCAGAAGGAAAAAGAATCGATCGAAGATTAGGGTTGCGATTAGCGTGACAGTCGGGTGAAGAAGAAACGACCCGTGGACTCGGGTTTGGACTCGGGGCACACCATCTGCAGCCACTGACTTCGTGTAGCCAACGACGACAGGTAAGCTAGGCCCACTTAATGCTACGCATATGGCCTATGAGACGCACCTTGCAACCCACAAAACTCGCGCGATCTGCACTCCCGAGCAACCTGACCCGAAGTCTGTGCCTCAACCCGACTAATGACTCGCGCTTCTCAGCCCCACACGTGTCTTGTCTCCAGTTGCGGCTCGTTCAGCATTGCACTCCTCGACTCGGCTCTGGCGCTGCTCCGTGGCGTAAACGGCTCGGGTTTGTGTATTTCAACATGCAATATAATCTAAACTTAGCCCTACAGATAAAGAATCACTTCAAGGGAAAGTAAAATTCAGATTAACTTGTTGATCAAATATCTCAAGACAATAATACTTGTTTGAGATTCGAATCACTCCACAAACAAGATTGATCTCGAGCTTGAATAATTCTATATGCAACCTAAACTACATAGAATTGCAAAGAAACAACATTGGCTAAAGGAAAGCACAAATGCTCATTTACTGTATTTTAAAAGTCTATTTTACAACTCTAACATACATGGCATTATATAGGCTCAAAATAAAAACTCTTTAACCTTCCACGGGGCTTTCCAAGAGATGTAACTTTCATACTTTATGACCACAATTAGACCATAATTAGTCACCAGGTAAATAAACCTTAAAATACACTAAAAATACAATAACTCTAGATTATCAACATTTAAGAATAAATGAAGCCCCATCTTGAAGTCTTTGTTGCATAAATGAAGCTTGATTATTCTTAACGTGACATTAATGTTGCATGATAAGATGCCTTGGTTCATATCGATTCCTGTTTCCTCAATTTAAGTTGAATTAAGTTAGGAGTTGGATTTACGAAAAGAATTAGGTAAATATTTCTAATAGGTTGAAGCGCCTTCGAATAAGTTGAAACGAACTTCGAGGGACGAAGTCAAACCAATTCAGAAATTAGATAGAGTTTCGGTTAAAGCCAACCAAGTAAGTGGCTCTACTATCAGTATGGTTGGAAGAGTTGCTTTATATATGATGACACGTGCCTAGTGGCCATGTATGTGTCATATGTTTGATGATATAATATGTTTTGCTACATGATATGCCCAGATATGCCAATTAATATTAGAACGTAAATGAATTTCGTATTATGCCTCGATGAATTATGATATGCTATGAAATGCCATTATACGCTGTGATATGATTATACTAGGTCATGATATGTTACGACATGCTATGACATGTTTTGAGACAAGAAGATTTATGATCGACAACCCTTAAGATATGTTTCAAAGATTATGCTATAAATGAAATGATTTTGTAAGGGCTGTCTTGCACGATTTGTTTGCAATGAAAAATGTTGGGACCTCATGCATAAATGTATGTTCACAAACGTAGGGATATTTTCTCTTATGAAGAGTACGAATGCGTACGTTATGAAAAGGAAAATGATCATCATGTTTATCATGATGCTACGACGGTCGCTATTGAATGTTGCAATGTTGCCTTCTAACTTGGGCTGTCTTTAGGATGGTTGATGGCTCGACCGCGCACGACCTTTTGGGCTTGGCTTCGAATGATGATTTTGAACCCCCGCTTGCGGTGGGTGTGAATAGTCGATCGAAGTCCAAGCTCGTTTTAGCGTACGACGCATGAGTGGTGATTTGGTTGCTTGGGTCTATAGGCTTGATGCTCGGGCATCAAACTGTAGGCGACATCTCCTCCTCATTGGTGTGTCCCAAAGTATTTTGAGTGACCGTAGTCCATTTGGGGGTCCGACAATAACACCAGAGAAGCTAGCTCGACCAACACTAGAATTTATGTTTGAATTTTGGGCATTGGATGGGATGAGGTGTGCCTCCTTTAGAACATATTCAAATTTGGTGAAGATCCAACGATTGAAAATGAAGATACGGTAGATTTTCTGATTTGTACTATTTCCGTAATAATCTCAAAGTTCAAGGGTATTTTGGTAATTTCAAAGGTCTAAAATTACTTCTTAGGGTATTGAAAATAACTTGCTATAAGAGTGTCTTAAGATTGGACTCCAATGTGACTACGACTGGACTAGGTCAGTTGCCCAATCCAACATTTGTTGACTGATTCAAAGATTTCAAGAGGTAATTTCTTAGAAATTCATCAGAAAATTTGATGAACATTGAGGAGAAACCGAGTAACTTGGTACCCAAGGTGTGAGGAGACAATCATTTATAAGGCTGTTCCCGATTCACTCTCTCCTTTTGCTTTGTTATTAGGGTTTTGACGCAGTTTAGATTGTTTCGAAACTTTTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATNAACATTGGCTAAAGGAAAGCACAAATGCTCATTTACTGTATTTTAAAAGTCTATTTTACAACTCTAACATACATGGCATTATATAGGCTCAAAATAAAAACTCTTTAACCTTCCACGGGGCTTTCCAAGAGATGTAACTTTCATACTTTATGACCACAATTAGACCATAATTAGTCACCAGGTAAATAAACCTTAAAATACACTAAAAATACAATAACTCTAGATTATCAACATTTAAGAATAAATGAAGCCCCATCTTGAAGTCTTTGTTGCATAAATGAAGCTTGATTATTCTTAACGTGACATTAATGTTGCATGATAAGATGCCTTGGTTCATATCGATTCCTGTTTCCTCAATTTAAGTTGAATTAAGTTAGGAGTTGGATTTACGAAAAGAATTAGGTAAATATTTCTAATAGGTTGAAGCGCCTTCGAATAAGTTGAAACGAACTTCGAGGGACGAAGTCAAACCAATTCAGAAATTAGATAGAGTTTCGGTTAAAGCCAACCAAGTAAGTGGCTCTACTATCAGTATGGTTGGAAGAGTTGCTTTATATATGATGACACGTGCCTAGTGGCCATGTATGTGTCATATGTTTGATGATATAATATGTTTTGCTACATGATATGCCCAGATATGCCAATTAATATTAGAACGTAAATGAATTTCGTATTATGCCTCGATGAATTATGATATGCTATGAAATGCCATTATACGCTGTGATATGATTATACTAGGTCATGATATGTTACGACATGCTATGACATGTTTTGAGACAAGAAGATTTATGATCGACAACCCTTAAGATATGTTTCAAAGATTATGCTATAAATGAAATGATTTTGTAAGGGCTGTCTTGCACGATTTGTTTGCAATGAAAAATGTTGGGACCTCATGCATAAATGTATGTTCACAAACGTAGGGATATTTTCTCTTATGAAGAGTACGAATGCGTACGTTATGAAAAGGAAAATGATCATCATGTTTATCATGATGCTACGACGGTCGCTATTGAATGTTGCAATGTTGCCTTCTAACTTGGGCTGTCTTTAGGATGGTTGATGGCTCGACCGCGCACGACCTTTTGGGCTTGGCTTCGAATGATGATTTTGAACCCCCGCTTGCGGTGGGTGTGAATAGTCGATCGAAGTCCAAGCTCGTTTTAGCGTACGACGCATGAGTGGTGATTTGGTTGCTTGGGTCTATAGGCTTGATGCTCGGGCATCAAACTGTAGGCGACATCTCCTCCTCATTGGTGTGTCCCAAAGTATTTTGAGTGACCGTAGTCCATTTGGGGGTCCGACAATAACACCAGAGAAGCTAGCTCGACCAACACTAGAATTTATGTTTGAATTTTGGGCATTGGATGGGATGAGGTGTGCCTCCTTTAGAACATATTCAAATTTGGTGAAGATCCAACGATTGAAAATGAAGATACGGTAGATTTTCTGATTTGTACTATTTCCGTAATAATCTCAAAGTTCAAGGGTATTTTGGTAATTTCAAAGGTCTAAAATTACTTCTTAGGGTATTGAAAATAACTTGCTATAAGAGTGTCTTAAGATTGGACTCCAATGTGACTACGACTGGACTAGGTCAGTTGCCCAATCCAACATTTGTTGACTGATTCAAAGATTTCAAGAGGTAATTTCTTAGAAATTCATCAGAAAATTTGATGAACATTGAGGAGAAACCGAGTAACTTGGTACCCAAGGTGTGAGGAGACAATCATTTATAAGGCTGTTCCCGATTCACTCTCTCCTTTTGCTTTGTTATTAGGGTTTTGACGCAGTTTAGATTGTTTCGAAACTTTTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATGTATGATTACTTTCTTCTTTCATGCCATTAATACCCTATTTGAGAAATTTGTTTGCGTTTTTGTAACTTTTTGATTCATAGTTCGCTGCTTAGATTAGTTACTAGGGGATGGTTTTCTGAATTAAGCTATGAACGGCAAAACCTTATCCTACTTTTTCTTAGATTTGTGTTGAACTGATGGATGGGAATTGATTCCAAAACCTCGTTGAAGATTGTTGGGAGGGAATTCCATGTTGGTTAATTAAGAGGATGATTGTAAGAAATATTATCTTCATTGGTATGAGGCCTTTTGGAAAACCAAAAGAAAAGCCACGAGAGGTTATGCTCAAAGTGGAAACTATCATATCATTGTAAAGAGTCGTGATTCCTAACATGGTATCAAAGTCATGCACTTAACTTAGCAATGTTAATAGAATCTTCAATTGTCGAACAAATTGTAAGCCTGGAAGGTATAGTCAAAAGTGCTCAAGAGAAAGGAGTCGAGCCTCGTTTAAGGGGCAAATGTTTGACAGCCACATAGACCTCAAAGAAGGCTCTATGGTGTAGTTTGTTCGAGGAAAGGATTGTTGAGAATTGTTGGGAGAGTCGTGGTTCCTAACCAATTTGTTAAAATTATCCCCGTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGATTTTGTGGAGGCAGTTCCTATTTGTTTCTTCAGGTATTATGGTAGGTTAGATTTTGTGGAGGCGGTTCCTATTTGTTGCTTCAGGTATCACCCCCTTCATCGAACATAAGAAGTATAAGTTATTTGAAGCAGTATGGATCCTATTGAATTTTCCTCAACTATTCTAAATTCCATTCCTTTAGAATGCTCTTATATTAGTTGAAAATGCTCCTATATTAGTTGGAGAAGGGAACAAAACATTTCTCGTAAGGGTGTGAAAACGTTTCCTTAGTAGACGTGTTTTAGAACTGTGAGGTTGACGACGATACGAAACAGGCCAAAGTGGACAATATTTGTNAAGTCCAAGCTCGTTTTAGCGTACGACGCATGAGTGGTGATTTGGTTGCTTGGGTCTATAGGCTTGATGCTCGGGCATCAAACTGTAGGCGACATCTCCTCCTCATTGGTGTGTCCCAAAGTATTTTGAGTGACCGTAGTCCATTTGGGGGTCCGACAATAACACCAGAGAAGCTAGCTCGACCAACACTAGAATTTATGTTTGAATTTTGGGCATTGGATGGGATGAGGTGTGCCTCCTTTAGAACATATTCAAATTTGGTGAAGATCCAACGATTGAAAATGAAGATACGGTAGATTTTCTGATTTGTACTATTTCCGTAATAATCTCAAAGTTCAAGGGTATTTTGGTAATTTCAAAGGTCTAAAATTACTTCTTAGGGTATTGAAAATAACTTGCTATAAGAGTGTCTTAAGATTGGACTCCAATGTGACTACGACTGGACTAGGTCAGTTGCCCAATCCAACATTTGTTGACTGATTCAAAGATTTCAAGAGGTAATTTCTTAGAAATTCATCAGAAAATTTGATGAACATTGAGGAGAAACCGAGTAACTTGGTACCCAAGGTGTGAGGAGACAATCATTTATAAGGCTGTTCCCGATTCACTCTCTCCTTTTGCTTTGTTATTAGGGTTTTGACGCAGTTTAGATTGTTTCGAAACTTTTGAATGGATTCAAATCATTTGGGGCCTGCTCAAGGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATGTATGATTACTTTCTTCTTTCATGCCATTAATACCCTATTTGAGAAATTTGTTTGCGTTTTTGTAACTTTTTGATTCATAGTTCGCTGCTTAGATTAGTTACTAGGGGATGGTTTTCTGAATTAAGCTATGAACGGCAAAACCTTATCCTACTTTTTCTTAGATTTGTGTTGAACTGATGGATGGGAATTGATTCCAAAACCTCGTTGAAGATTGTTGGGAGGGAATTCCATGTTGGTTAATTAAGAGGATGATTGTAAGAAATATTATCTTCATTGGTATGAGGCCTTTTGGAAAACCAAAAGAAAAGCCACGAGAGGTTATGCTCAAAGTGGAAACTATCATATCATTGTAAAGAGTCGTGATTCCTAACATGGTATCAAAGTCATGCACTTAACTTAGCAATGTTAATAGAATCTTCAATTGTCGAACAAATTGTAAGCCTGGAAGGTATAGTCAAAAGTGCTCAAGAGAAAGGAGTCGAGCCTCGTTTAAGGGGCAAATGTTTGACAGCCACATAGACCTCAAAGAAGGCTCTATGGTGTAGTTTGTTCGAGGAAAGGATTGTTGAGAATTGTTGGGAGAGTCGTGGTTCCTAACCAATTTGTTAAAATTATCCCCGTTCATTTTTCCATTTCTGGTTTGAAGATATGGTAGGTTAGATTTTGTGGAGGCAGTTCCTATTTGTTTCTTCAGGTATTATGGTAGGTTAGATTTTGTGGAGGCGGTTCCTATTTGTTGCTTCAGGTATCACCCCCTTCATCGAACATAAGAAGTATAAGTTATTTGAAGCAGTATGGATCCTATTGAATTTTCCTCAACTATTCTAAATTCCATTCCTTTAGAATGCTCTTATATTAGTTGAAAATGCTCCTATATTAGTTGGAGAAGGGAACAAAACATTTCTCGTAAGGGTGTGAAAACGTTTCCTTAGTAGACGTGTTTTAGAACTGTGAGGTTGACGACGATACGAAACAGGCCAAAGTGGACAATATTTGTTAGTAGCGAGCTTAGACTGTTACAAATGGTATCAGAGTCAGACATCGAAAGATGTGCCAGCGAAGACGCTAGGCTCTCAAAGGGGTGGATTGTGAGATCCCACGTTGGTTGGAGAGGGGAATGAAACATTCTTCATAAGAGTGTGGAAACCTCTCCCTAGTAGATGCGTTTTAAAACTGTGAGGCTGACAGTAATATGTAACGGACCAAAGTGGACAATATCTGCTAGCGGTAAGCTTAAGCTGTTACACTAATATTTTGAATCATGGACGTGGGTTACAGCTATAGTGTTTCGCTCCCAAACAGCTTCCTGTGCTTTTGCATTTAATAAACTTGTCAGCCACCATCTTTTCCATTTTTATGTAGAATTGAGACATTGAAGAGGCACCTTCCTGTTTCGGGTCCTGAGGGGTTGAGTGAGGTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTCACACTGTCGCTACCAGTCAGGTACTTGCTGCTCTCACTTGTATGAAGCCCTGATCTATTATGCTATCTACTAAATACAACACTGAGTACTCACCATTTCTATTTTGAACCTCTGATTTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGCCTGGGACGACCCCTGCATTACCATCAATGTCTGGGCCTATGGCTTCCAATCAACCTTAGCCACGCCTGCAATGACTCTCCCAGAATATCCAAAATAACACTGCTCCTTCTAATGTATTTACCAACTCTGTGAGGCCAATAATGTAAATATACTTTATTCGCTATTAAGGTAAACACTCTACTATTATTAGAACGTTTCAGTGCTTGGAATGAGATATTTATATGATAATTGAAACTCTTCTGCCTCTTTTAAGTGTTCATGAGATTGGTTTGAATGAGATATCATATCATAATTAAAATTGTTCTGAATCTTGAGTGTTCATGAGACTGGTTTGAATTGAAACTGTTCTGAATCTTGAGTGTTCACGAGATTGGTTTGAATGAGATATTATATGATAATTGAAACTGTTTTGAATCTTGAGTGTTCATATTAGATGATGATTGAAACTGTTCTGAATCTTGAATGTTCATGAGATTGGTTTGAATGAGATATTACATGATAATTGAAGCGGTTCTGAATCTTGAATGTTCATGAGATTGTTTTGAATGAGATATGATATGATAATTGAAATTGTTTTGAATCTTGAATGTTTATGAGATTGTTTTGAATGAGATGTTATATGATAATTGAAACGGATATGAATCTTGAATGTTCATGATATTGTTGTGATAGTTTTTCATGTTGGTATCACTAAATAAAAATGTTTCGGAATCTCACGCATATTTGTGACTTCATTAAACGCTATGTTTACAAAGGTTATGTGTAACTGCAGCAGAAAAAAGAAAAAAAAAGAATGATGAATGATGNCTTTTGTTGGTCCCTTTTTCCTTCTCGAACACCTTGAAATTCTTTCTCGGAGAAATTAAAATTTCGAGAAATTCCTTAAAATTTGAAAAAATGTGCTTACTTTTTCCGTGCCTAAAATTACTTCTTATGAGGATGAAAATAACTCATTTAAGACTCTTGTTCTCGTTTGAATGTCTTAATTTCTTTTCTTGATGTTTTGGGTCGATCCTTATTTTGGATATTACACCAAACCTCAAAGCCCAATCCAACAAGCTTACAGACTCGATTCCAATGTGATTACGACTGGACTATGTCAGTTGTCCATCCCAACATTTGTCGACTGATGCGAAGATGTCAAGAGGTAATTTCTTAGAAATTCATAAGAAAATATTTTATGAACATTGAGGAGAAACCGAGGTGTGTCGATTCTTCACTCTCTCCTTTTGCTTTGTTATTGGGGTTTTGACGCAATTTCTTCTCGTAGGTTATTTCGCAACTTTTGAATGGATTCAAATCATTTAGGGCCTGCTCAAGGTGGAGAACCCGAAGTCGATGCCGGGGTTTGGAGGTCTCAGTTGCCGCCGGATTCTCGGCATCGAATTGTCAACAGCATGTATGATTTCTTGCTTCTTTGAGAAGTTTGTCTGAGATTTTGTAACTTTTTGTGTTCACAGTTCGCTGCTTAGATTAGTTACTATGTGATGGTTTTCTGAATTAAGCTATGAACGGCAAAACCTTTTTCTACCTTTTCTTAAATTTGTGTTTTAACTGATGGATGCGAACTGATTTTTCCAAAACCTTATCAAAGTAAAAACTACGAGAGATTATGCTTAAAGTGGAAAATATCATGCAATTGCATAAAGTCGTGATTCCTAACTTGGTATCAGAGTCATGCTATATTAAGATATAGCAATGTTGATAGAATCTCAAATGTCGAACAAAGAAGTTGTGAGCTTTGAAGGTGTAGTCAAAAGTGACTCAAGTGTTCTCTGTTCAAGGGATCCAGAGAAAGAAGTCGAGGGTGTACTTTGTTCGAGGGCTTCAGAGAAAGGAGTCGAGCCTCGTTTAAGGGGAGATTGTTCTTGAGGGCTACATAGTTCTCAGGAGAGTCTCTATAGTGTAGTTACTTTGTTCGAGGGCTCCAGAGAAAGTAGTCGAGCTTCGTTTAAGGGGAAATTGTTCTCGAGGACTACATAGTTCTCAGGAGAAGTTCTATAGTGTAGTTTGTTCGAGGGAAGGATTGTTGAAGATTGGTGAGAGAGTGGTGATTTAAAATTATCCCGTTCACTTTTCCATTTCTGGTTTGAAGATGTGGTAGGTTAGATTTTGTGGATGCGGTTCCTATTTGTTTCTTCAAGTATTACCCCCTTCATTGAACATAAGACGTATATCTTATTTGAAGCAGTATGGATCCTATTGAATTTTCCTCAACTGTTCTAAAGTCCATTCGTTCCGAATGCCTAATTGTGAGATCCCATAGTAGTTGGAGAGGGGAACGAAACATTCGTCGTAATGGTGTGAAAACCTCTCCCTAGTAGACGCGTTTTAAAACCGTGAGACTGACAATGATACTTAACGGACCAAAGCGGACAATATATTTTAGTGGTGGGCTTAGGCTGTTACAAATGGTACTAGAGTCACCTAGCGAGGACGCTAGACCCTCTAGGGGTGGAATGTGAGATCCCACATCGGTTGGAGAGCGGAACCCTAGACCCTCAAGAGGTGATTGCGAGATCCCATATCGGTTGGAGAGGGGAACCCTAGACCCTCAAGGGGTGAATTGTGAGATCCCACATCAGTTGGAGAGGGGAACCTTAGACCCTCAAGGCATGGATTGTGAGATCCCACATTGGTTGGAGAGGGAACGAAACATTCCTCGTAAGGGTGTGGAAACCTCCATAGTAAACGCGTTTTAAAACGGCGATACATAACAGGTCAAAACAAATAATATCTGCTAGCGGTAAGCTTAAGCTGTTACACTAATATTTTGAATCATGGACGTGGGTTACAGCCATAGTGTTTCGCTCTCGAACAGCTTCCCATGCTTTTGCGTTTAATAAACTTGTCAGCCATCATCTTTTCCATTTTTATGTAGCATTGAGACATTGAAGAGGCACATTCCTGTTTCTGGTCCTGAGGGATTGAATGAGTTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTTACAATGCCGCTACCAGTCAGGTACTTGCTGCTCTCACTTGTATAAAGCCCTGATTTATTATGCTATCTACTAAATACAACGCTGAGTACTCACCATTTCCATTTTGAACCTCTGATTTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGTCTGGGACGTTCACTGCATTACCATCAGTGTCGGGCCCTGTGTCTTCCAATCAACCTTAGTCAGAGAACATCCACAATAACAATGTTTCTTCTAATGTATTTACCAACTCTCAGAGGCCAATATACTTTATTAGTATATTGGCTAATAAGGTAAATGACTCTACTATTAGAACGTCTAGAACGTCTTAGTATGTCTTAGTGTTGGAATGAGATATTATATGATAGTCTTAGTACGTCTTAGTGTTGGAATAGAACTGTTCTGAATCTTGAATGTTTATTATGTTAGTCTTAGTACGTCTTAGTGTTGGAATGAGATATTATATGATAATTAGAACTGTTCTAAATCTTGAATGTTTATTGAGAACGGTTCTGAATCTTGAATGTTCGTTCAGAACGGTTATGAATCTTGAATGTTCATTGAGAACAGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCCGAATCTTGGTTCTGAATCTTGAATGTTCGTTGAGAACCGTTCTGAATCTTGAATGTTCGTTGAGAACGATTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCCGAATCTTGAATGTTCATTGAGATTGTTGTGATAGTTTTTCATGTTCGTATCACTTAATGAAAATGTTACTTAATGAAAATGTTACGGAATCTCATGCATATTTGTGACTTTATAAACGCTATGTTTCTAAGGAGCTGTATAACGG

mRNA sequence

GGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGAATGGAGACATTGAAGAGGCTCCTTCCTGTTTCTGGTCCTGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGTTTTTTCTCGTGGGTTGTTTTGGAACTTTTGAATGGATTCGAATAATTGGAGGCCTGCTCAAGGTGGAGAACCCAGAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATAATTGAGACATTGAAGAGGCACCTTCCTGTTTCGGGTCCTGAGGGGTTGAGTGAGGTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTCACACTGTCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGCCTGGGACGACCCCTGCATTACCATCAATTTGTCCATCCCAACATTTGTCGACTGATGCGAAGATGTCAAGAGGGCCTGCTCAAGGTGGAGAACCCGAAGTCGATGCCGGGGTTTGGAGGTCTCAGTTGCCGCCGGATTCTCGGCATCGAATTGTCAACAGCATCATTGAGACATTGAAGAGGCACATTCCTGTTTCTGGTCCTGAGGGATTGAATGAGTTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTTACAATGCCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGTCTGGGACGTTCACTGCATTACCATCAGTGTCGGGCCCTGTGTCTTCCAATCAACCTTAGTCAGAGAACATCCACAATAACAATGTTTCTTCTAATGTATTTACCAACTCTCAGAGGCCAATATACTTTATTAGTATATTGGCTAATAAGGTAAATGACTCTACTATTAGAACGTCTAGAACGTCTTAGTATGTCTTAGTGTTGGAATGAGATATTATATGATAGTCTTAGTACGTCTTAGTGTTGGAATAGAACTGTTCTGAATCTTGAATGTTTATTATGTTAGTCTTAGTACGTCTTAGTGTTGGAATGAGATATTATATGATAATTAGAACTGTTCTAAATCTTGAATGTTTATTGAGAACGGTTCTGAATCTTGAATGTTCGTTCAGAACGGTTATGAATCTTGAATGTTCATTGAGAACAGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCGTTGAGAACGGTTCCGAATCTTGGTTCTGAATCTTGAATGTTCGTTGAGAACCGTTCTGAATCTTGAATGTTCGTTGAGAACGATTCTGAATCTTGAATGTTCGTTGAGAACGGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCTGAATCTTGAATGTTCATTGAGAACTGTTCCGAATCTTGAATGTTCATTGAGATTGTTGTGATAGTTTTTCATGTTCGTATCACTTAATGAAAATGTTACTTAATGAAAATGTTACGGAATCTCATGCATATTTGTGACTTTATAAACGCTATGTTTCTAAGGAGCTGTATAACGG

Coding sequence (CDS)

GGAGAATCCGAAGTCGATGCCGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATATATGGTAGGTTAGGTTTTGTGGAGGCGGTTCCTATTTGTTACTTCAGAATGGAGACATTGAAGAGGCTCCTTCCTGTTTCTGGTCCTGAGGGATTGAGTGAGATGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGATTTATACTGCCGCTACTAGTGAGTTTTTTCTCGTGGGTTGTTTTGGAACTTTTGAATGGATTCGAATAATTGGAGGCCTGCTCAAGGTGGAGAACCCAGAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGATTCTCGGCATCAAATTGTCGACAGAATGTGGAGAATCCGAAGTCGATGCTGGGGATTGGAGGTCTCAATTGCAGCCGGAATCTCGGCATCGAATTGTCGACAAGATAATTGAGACATTGAAGAGGCACCTTCCTGTTTCGGGTCCTGAGGGGTTGAGTGAGGTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTCACACTGTCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGCCTGGGACGACCCCTGCATTACCATCAATTTGTCCATCCCAACATTTGTCGACTGATGCGAAGATGTCAAGAGGGCCTGCTCAAGGTGGAGAACCCGAAGTCGATGCCGGGGTTTGGAGGTCTCAGTTGCCGCCGGATTCTCGGCATCGAATTGTCAACAGCATCATTGAGACATTGAAGAGGCACATTCCTGTTTCTGGTCCTGAGGGATTGAATGAGTTGAGGAAAATTGCTGTAAGGTTCGAGGAAAAGGTTTACAATGCCGCTACCAGTCAGTCAGATTACCTAAGGAAAATATCTCTGAGGATGCTTACTATGGAAACCAAGTCTGGGACGTTCACTGCATTACCATCAGTGTCGGGCCCTGTGTCTTCCAATCAACCTTAG

Protein sequence

GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSRILGIKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKSGTFTALPSVSGPVSSNQP
BLAST of Cp4.1LG04g06860 vs. Swiss-Prot
Match: MD15A_ARATH (Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana GN=MED15A PE=1 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 5.6e-29
Identity = 61/99 (61.62%), Postives = 80/99 (80.81%), Query Frame = 1

Query: 237 GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAA 296
           GEP +D G WR+QLPPDSR +IVN I+ETLK+H+P SGPEG+NELR+IA RFEEK+++ A
Sbjct: 13  GEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRIAARFEEKIFSGA 72

Query: 297 TSQSDYLRKISLRMLTMETKS----GTFTALPSVSGPVS 332
            +Q+DYLRKIS++MLTMETKS    G+  A+P+ +   S
Sbjct: 73  LNQTDYLRKISMKMLTMETKSQNAAGSSAAIPAANNGTS 111

BLAST of Cp4.1LG04g06860 vs. TrEMBL
Match: A0A078H3Z0_BRANA (BnaC05g12030D protein OS=Brassica napus GN=BnaC05g12030D PE=4 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 9.5e-60
Identity = 138/329 (41.95%), Postives = 192/329 (58.36%), Query Frame = 1

Query: 2   ESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEM 61
           E  V+A DWR+ L PDSR +  ++I G                T+K+ +P  G E   E+
Sbjct: 49  EPAVNASDWRTCLLPDSRKKNANKIKG----------------TVKKHIPNRGKERNKEL 108

Query: 62  RKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSRILGI 121
           +KIA  FEE I+  A  + F+      F W                       C +I+ +
Sbjct: 109 KKIAATFEELIFNTAIDQGFVF-----FFWS---------------------GCEKIIKV 168

Query: 122 KL----STECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAV 181
            +    S   GE  +D  DWR+ L  +SR +IV KI+ TL +HLP SGPEG++E+++IAV
Sbjct: 169 LILMAPSVNNGEPAMDTNDWRNHLPFDSRQKIVSKIMATLMKHLPYSGPEGINELKRIAV 228

Query: 182 RFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLST-----DAKMSRG 241
           RFEEKV + A  Q+DYLRKIS++MLTMET+        S  P+   +      D  M  G
Sbjct: 229 RFEEKVFSSAVYQTDYLRKISMKMLTMETRSQNVAGSASYIPADRSNLALDELDNLMING 288

Query: 242 PAQ----GGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRF 301
             +      EP +++G WR+QLPP SR  IVN I++TLKRH P SGPEG+NEL++IA RF
Sbjct: 289 NVEPFLLNEEPAINSGDWRTQLPPGSRQNIVNKIMDTLKRHFPYSGPEGINELKRIAARF 335

Query: 302 EEKVYNAATSQSDYLRKISLRMLTMETKS 318
           EEK++++A +Q+DYLRKIS++MLTMETK+
Sbjct: 349 EEKIFSSAVNQTDYLRKISMKMLTMETKA 335

BLAST of Cp4.1LG04g06860 vs. TrEMBL
Match: F2DL41_HORVD (Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1)

HSP 1 Score: 219.5 bits (558), Expect = 6.0e-54
Identity = 118/213 (55.40%), Postives = 153/213 (71.83%), Query Frame = 1

Query: 135 GDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYL 194
           GDWR++LQPE+R R+V+KI+ETLK+HLPVS PEGL+E++ IAVRFEEK++T AT+QSDYL
Sbjct: 19  GDWRAELQPEARGRVVNKIMETLKKHLPVSVPEGLNELQIIAVRFEEKMYTAATNQSDYL 78

Query: 195 RKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMS---------RGPAQGGEPE----- 254
           R+IS++ML+METK   TP    + P+Q+    A +S           P QG +P      
Sbjct: 79  RRISIKMLSMETKTQQTPGNAQVIPNQNNPGQAPVSCLRMPDGTPWRPTQGSDPAAVTAV 138

Query: 255 ---------VDA-------GVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKI 314
                    VD        G WR+QL P++R R+VN I+ +L++H+PVSGPEG NEL KI
Sbjct: 139 VAAAAATVGVDPNAPAPTRGDWRAQLQPEARSRVVNKIMVSLQKHLPVSGPEGPNELEKI 198

Query: 315 AVRFEEKVYNAATSQSDYLRKISLRMLTMETKS 318
           AVRFEEK+YNAATSQSDYLRKISL+ML+METK+
Sbjct: 199 AVRFEEKIYNAATSQSDYLRKISLKMLSMETKT 231

BLAST of Cp4.1LG04g06860 vs. TrEMBL
Match: M4EBE6_BRARP (Uncharacterized protein OS=Brassica rapa subsp. pekinensis PE=4 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 3.9e-53
Identity = 125/291 (42.96%), Postives = 179/291 (61.51%), Query Frame = 1

Query: 42  RMETLKRLLPVSGPEGLSEMRKIAVRFEEKIYTAATSEFFLVGCFG----TFEWI--RII 101
           RM TL + LP SGPEG++E+++IAVRFEEK+++++  +   +        T E     + 
Sbjct: 208 RMATLMKHLPYSGPEGINELKRIAVRFEEKVFSSSVHQNDYLRKISMKMLTMETKSQNVA 267

Query: 102 GGLLKVENPESMLGIGGLNCSRILGIKLSTECGESE--VDAGDWRSQLQPESRHRIVDKI 161
           G    +    S L    LN   I    +       E  + +GDWR+QL P SR  IV+KI
Sbjct: 268 GSASSIPADSSNLAFDELNNLMINNGNVEPFLLNEEPAIKSGDWRTQLPPGSRQNIVNKI 327

Query: 162 IETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPA 221
           ++TLK+H P SGPEG++E+++IA RFEEK+ + A  Q+DYLRKIS+++LTMETK      
Sbjct: 328 MDTLKKHFPYSGPEGINELKRIAARFEEKIFSSAVHQTDYLRKISMKILTMETKAQNAAG 387

Query: 222 LPS--ICPSQHLSTDAKMSRGPAQGGEPE-------VDAGVWRSQLPPDSRHRIVNSIIE 281
             S  +  S +L+ D  M+       EP        +++G WR QLPPDSR + ++ + E
Sbjct: 388 SDSSILADSNNLTLDDIMNHLIKDNAEPSLLNVEPAINSGDWRIQLPPDSRQKNIDKLTE 447

Query: 282 TLKR-HIPVSGPEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTME 315
            LK+ H+P SGPEG+NE  KIA RFE+KV+N A + +DYLRKISL +LT+E
Sbjct: 448 ALKKQHLPFSGPEGVNEHSKIASRFEDKVFNTAANLNDYLRKISLEVLTIE 498

BLAST of Cp4.1LG04g06860 vs. TrEMBL
Match: W5AVR2_WHEAT (Uncharacterized protein OS=Triticum aestivum PE=4 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 6.6e-53
Identity = 119/205 (58.05%), Postives = 151/205 (73.66%), Query Frame = 1

Query: 136 DWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYLR 195
           DWR+QL PE+R R+V+KI+E LK+HLPVS PEGL+E++ IAVRFE+K++  ATSQSDYLR
Sbjct: 23  DWRAQLLPEARSRVVNKIMECLKKHLPVSVPEGLNELQIIAVRFEDKIYAAATSQSDYLR 82

Query: 196 KISLRMLTMETK--PGTTPALPS-------ICPSQHLSTDAKMSRGP---------AQGG 255
           KISL+ML+METK  PG    +P+        C      T  + ++GP         A GG
Sbjct: 83  KISLKMLSMETKTQPGNAQVIPNQMNLGQASCLRMPDGTPWRPTQGPDPAAVAAAVAAGG 142

Query: 256 --EPEVDA---GVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKV 315
             +P   A   G WR QL P++R RIVN+I+E+LK+H+PVS PEGLNEL KIAVRFEEK+
Sbjct: 143 GVDPNASAPTGGDWRPQLQPEARGRIVNNIMESLKKHLPVSRPEGLNELEKIAVRFEEKI 202

Query: 316 YNAATSQSDYLRKISLRMLTMETKS 318
           YNAATSQSDYLRK+SL+ML+METK+
Sbjct: 203 YNAATSQSDYLRKVSLKMLSMETKT 227

BLAST of Cp4.1LG04g06860 vs. TrEMBL
Match: A0A0D3CAL8_BRAOL (Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 6.6e-53
Identity = 127/325 (39.08%), Postives = 189/325 (58.15%), Query Frame = 1

Query: 1   GESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSE 60
           GE  +D  DWR+ L  DSR +IV +I                M TL + L  SGPEG++E
Sbjct: 8   GEPAMDTNDWRNHLPFDSRQKIVSKI----------------MATLMKHLSYSGPEGINE 67

Query: 61  MRKIAVRFEEKIYTAATSEFFLVGCFG----TFEWI--RIIGGLLKVENPESMLGIGGLN 120
           +++IAVRFEEK++++A  +   +        T E     + G    +    S L +  L+
Sbjct: 68  LKRIAVRFEEKVFSSAVYQTDYLRKISMKMLTMETRSQNVAGSASYIPADRSNLALDELD 127

Query: 121 CSRILG-IKLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVR 180
              I G ++      E  +++GDWR+QL P SR  IV+KI++TLKRH P SGPEG++E++
Sbjct: 128 NLMINGNVEPFLLNEEPAINSGDWRTQLPPGSRQNIVNKIMDTLKRHFPYSGPEGINELK 187

Query: 181 KIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPS--ICPSQHLSTDAKMSR 240
           +IA RFEEK+ + A +Q+DYLRKIS++MLTMETK        S  +  S +L+ D    +
Sbjct: 188 RIAARFEEKIFSSAVNQTDYLRKISMKMLTMETKAQNAAGSDSSILADSNNLTLDIISKQ 247

Query: 241 GPAQGG-------------EPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGL 300
            P+                EP +++G WR QLP DSR + ++ ++ETLK+H+P SG EG+
Sbjct: 248 RPSWNHLIKDNAETSLLNVEPTINSGDWRIQLPLDSRQKNIDKLMETLKKHVPYSGQEGI 307

Query: 301 NELRKIAVRFEEKVYNAATSQSDYL 304
            ELR+IA+ FEE ++N A +Q  +L
Sbjct: 308 EELRRIALSFEELIFNTAINQDTHL 316

BLAST of Cp4.1LG04g06860 vs. TAIR10
Match: AT1G15790.1 (AT1G15790.1 unknown protein)

HSP 1 Score: 159.5 bits (402), Expect = 3.7e-39
Identity = 84/179 (46.93%), Postives = 118/179 (65.92%), Query Frame = 1

Query: 135 GDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEEKVHTVATSQSDYL 194
           GDWR+Q    SR RIV+KI+ET  + LP   PEG +E+RKIAVRFEEK+   A++Q++YL
Sbjct: 5   GDWRTQFPSASRSRIVNKIMETQLKQLPFIRPEGTNELRKIAVRFEEKLFNNASNQTEYL 64

Query: 195 RKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEVDAGVWRSQLPPDS 254
           R+I ++ML METK        S   +    T   +        EP V+ G WR+Q P DS
Sbjct: 65  RQICMKMLNMETKSQNAAGSSSADDN----TPPLVPEPSVPNNEPAVNTGDWRTQQPQDS 124

Query: 255 RHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTM 314
           R + +N++++TLK+ +P SG EG++EL +IAV  EE ++N+A +Q DYL KISL+M TM
Sbjct: 125 RQKNINALLDTLKKIVPHSGKEGIDELMRIAVSLEELIFNSAINQEDYLGKISLKMRTM 179

BLAST of Cp4.1LG04g06860 vs. TAIR10
Match: AT1G15780.1 (AT1G15780.1 unknown protein)

HSP 1 Score: 129.8 bits (325), Expect = 3.1e-30
Identity = 61/99 (61.62%), Postives = 80/99 (80.81%), Query Frame = 1

Query: 237 GEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAA 296
           GEP +D G WR+QLPPDSR +IVN I+ETLK+H+P SGPEG+NELR+IA RFEEK+++ A
Sbjct: 13  GEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRIAARFEEKIFSGA 72

Query: 297 TSQSDYLRKISLRMLTMETKS----GTFTALPSVSGPVS 332
            +Q+DYLRKIS++MLTMETKS    G+  A+P+ +   S
Sbjct: 73  LNQTDYLRKISMKMLTMETKSQNAAGSSAAIPAANNGTS 111

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: gi|828322731|ref|XP_012573082.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 15a-like [Cicer arietinum])

HSP 1 Score: 277.3 bits (708), Expect = 3.5e-71
Identity = 173/363 (47.66%), Postives = 218/363 (60.06%), Query Frame = 1

Query: 2   ESEVDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEM 61
           E  +D  +WR+QLQPDSR +IV++I                M+T K+ LPVSG EGL E+
Sbjct: 13  EPTIDTSEWRAQLQPDSRQRIVNKI----------------MDTSKKHLPVSGSEGLLEL 72

Query: 62  RKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSRILGI 121
            KIA RFEEKI+TAATS+          +++R I   +     +S   I     S   G 
Sbjct: 73  WKIAQRFEEKIFTAATSQS---------DYLRKISMKMLTMETKSQSSIANNMPSNEGGP 132

Query: 122 KLS-----------------TECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVS 181
                                   E  +D  +WR+QLQP+SR RIV+KI++TLK+HLPVS
Sbjct: 133 SNKPPDQDNSHLMDSNNWRPNPGTEPTIDTSEWRAQLQPDSRQRIVNKIMDTLKKHLPVS 192

Query: 182 GPEGLSEVRKIAVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPAL--------PS 241
           G EGL E+ KIA RFEEK+ T ATSQSDYLRKIS++MLTMETK  ++ A         PS
Sbjct: 193 GSEGLLELWKIAQRFEEKIFTAATSQSDYLRKISMKMLTMETKSQSSIANNMPSNEGGPS 252

Query: 242 ICPSQHLSTDAKMSRG--PAQGGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSG 301
             P    ++    S    P  G EP +D   WR+QL PDSR RIVN I++TLK+H+PVSG
Sbjct: 253 NKPPDQDNSHLMDSNNWRPNPGTEPTIDTSEWRAQLQPDSRQRIVNKIMDTLKKHLPVSG 312

Query: 302 PEGLNELRKIAVRFEEKVYNAATSQSDYLRKISLRMLTMETKSGTFTA--LPSVSGPVSS 336
            EGL EL KIA RFEEK++ AATSQSDYLRKIS++MLTMETKS +  A  +PS  G  S+
Sbjct: 313 SEGLLELWKIAQRFEEKIFTAATSQSDYLRKISMKMLTMETKSQSSIANNMPSNEGGPSN 350

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: gi|698581864|ref|XP_009777664.1| (PREDICTED: uncharacterized protein LOC104227178 isoform X10 [Nicotiana sylvestris])

HSP 1 Score: 265.0 bits (676), Expect = 1.8e-67
Identity = 158/316 (50.00%), Postives = 197/316 (62.34%), Query Frame = 1

Query: 5   VDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKI 64
           +D+ DWR+QL PDSR +IV+ I                 ETLKR L V+  EG+ E++KI
Sbjct: 1   MDSADWRTQLLPDSRQRIVNNI----------------TETLKRQLSVTREEGVQELKKI 60

Query: 65  AVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLG---IGGLNCSRILGI 124
           AV FEEKIYTAATS+   +        ++I+    K  NP + L      G N       
Sbjct: 61  AVGFEEKIYTAATSQPDYLQKIS----LKILTMETKSHNPMTNLSNAASSGQNAHDPGTA 120

Query: 125 KLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEE 184
           +     G   +DA DWR+QL P+ R  IV+KI ETL RHLPV+G EG+ E++KIA+RFEE
Sbjct: 121 RAGAAAGA--MDAADWRTQLLPDFRQSIVNKITETLMRHLPVTGEEGVQELKKIALRFEE 180

Query: 185 KVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEV 244
           K++T A SQ DYLRKISL+MLTMET     P   S   +            PA     ++
Sbjct: 181 KIYTAAISQPDYLRKISLKMLTMETD-SQNPMTNSANAASSGQNAHDPGTAPAGAAAGDM 240

Query: 245 DAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSD 304
           DA  WR+QL PDSR RIVN I ETLKRH+PV+G EG+ EL+KIA+RFEEK+Y AA SQ D
Sbjct: 241 DAVDWRTQLLPDSRQRIVNKITETLKRHLPVTGEEGVQELKKIALRFEEKIYTAAISQPD 293

Query: 305 YLRKISLRMLTMETKS 318
           YLRKISL+MLTMETKS
Sbjct: 301 YLRKISLKMLTMETKS 293

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: gi|698581860|ref|XP_009777663.1| (PREDICTED: uncharacterized protein LOC104227178 isoform X9 [Nicotiana sylvestris])

HSP 1 Score: 265.0 bits (676), Expect = 1.8e-67
Identity = 158/316 (50.00%), Postives = 197/316 (62.34%), Query Frame = 1

Query: 5   VDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKI 64
           +D+ DWR+QL PDSR +IV+ I                 ETLKR L V+  EG+ E++KI
Sbjct: 1   MDSADWRTQLLPDSRQRIVNNI----------------TETLKRQLSVTREEGVQELKKI 60

Query: 65  AVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLG---IGGLNCSRILGI 124
           AV FEEKIYTAATS+   +        ++I+    K  NP + L      G N       
Sbjct: 61  AVGFEEKIYTAATSQPDYLQKIS----LKILTMETKSHNPMTNLSNAASSGQNAHDPGTA 120

Query: 125 KLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEE 184
           +     G   +DA DWR+QL P+ R  IV+KI ETL RHLPV+G EG+ E++KIA+RFEE
Sbjct: 121 RAGAAAGA--MDAADWRTQLLPDFRQSIVNKITETLMRHLPVTGEEGVQELKKIALRFEE 180

Query: 185 KVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEV 244
           K++T A SQ DYLRKISL+MLTMET     P   S   +            PA     ++
Sbjct: 181 KIYTAAISQPDYLRKISLKMLTMETD-SQNPMTNSANAASSGQNAHDPGTAPAGAAAGDM 240

Query: 245 DAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSD 304
           DA  WR+QL PDSR RIVN I ETLKRH+PV+G EG+ EL+KIA+RFEEK+Y AA SQ D
Sbjct: 241 DAVDWRTQLLPDSRQRIVNKITETLKRHLPVTGEEGVQELKKIALRFEEKIYTAAISQPD 293

Query: 305 YLRKISLRMLTMETKS 318
           YLRKISL+MLTMETKS
Sbjct: 301 YLRKISLKMLTMETKS 293

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: gi|698581831|ref|XP_009777654.1| (PREDICTED: uncharacterized protein LOC104227178 isoform X1 [Nicotiana sylvestris])

HSP 1 Score: 265.0 bits (676), Expect = 1.8e-67
Identity = 158/316 (50.00%), Postives = 197/316 (62.34%), Query Frame = 1

Query: 5   VDAGDWRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLSEMRKI 64
           +D+ DWR+QL PDSR +IV+ I                 ETLKR L V+  EG+ E++KI
Sbjct: 1   MDSADWRTQLLPDSRQRIVNNI----------------TETLKRQLSVTREEGVQELKKI 60

Query: 65  AVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLG---IGGLNCSRILGI 124
           AV FEEKIYTAATS+   +        ++I+    K  NP + L      G N       
Sbjct: 61  AVGFEEKIYTAATSQPDYLQKIS----LKILTMETKSHNPMTNLSNAASSGQNAHDPGTA 120

Query: 125 KLSTECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKIAVRFEE 184
           +     G   +DA DWR+QL P+ R  IV+KI ETL RHLPV+G EG+ E++KIA+RFEE
Sbjct: 121 RAGAAAGA--MDAADWRTQLLPDFRQSIVNKITETLMRHLPVTGEEGVQELKKIALRFEE 180

Query: 185 KVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQGGEPEV 244
           K++T A SQ DYLRKISL+MLTMET     P   S   +            PA     ++
Sbjct: 181 KIYTAAISQPDYLRKISLKMLTMETD-SQNPMTNSANAASSGQNAHDPGTAPAGAAAGDM 240

Query: 245 DAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNAATSQSD 304
           DA  WR+QL PDSR RIVN I ETLKRH+PV+G EG+ EL+KIA+RFEEK+Y AA SQ D
Sbjct: 241 DAVDWRTQLLPDSRQRIVNKITETLKRHLPVTGEEGVQELKKIALRFEEKIYTAAISQPD 293

Query: 305 YLRKISLRMLTMETKS 318
           YLRKISL+MLTMETKS
Sbjct: 301 YLRKISLKMLTMETKS 293

BLAST of Cp4.1LG04g06860 vs. NCBI nr
Match: gi|694444421|ref|XP_009348733.1| (PREDICTED: uncharacterized protein LOC103940355 [Pyrus x bretschneideri])

HSP 1 Score: 264.6 bits (675), Expect = 2.3e-67
Identity = 152/324 (46.91%), Postives = 197/324 (60.80%), Query Frame = 1

Query: 1   GESEVDAGD-WRSQLQPDSRHQIVDRIYGRLGFVEAVPICYFRMETLKRLLPVSGPEGLS 60
           GE  ++AGD WR+QLQ +SRH+IV +I                +ET+KR +P  GPEGL 
Sbjct: 13  GEPPMEAGDDWRTQLQSESRHRIVAKI----------------IETMKRHVPFDGPEGLR 72

Query: 61  EMRKIAVRFEEKIYTAATSEFFLVGCFGTFEWIRIIGGLLKVENPESMLGIGGLNCSR-I 120
           E+ +IAV FEE +Y  A+S+          ++IR I   +     +S   +   +    +
Sbjct: 73  EIERIAVTFEENMYVGASSQ---------SDYIRKISLKMLTMETKSQTAVPHASLDPFL 132

Query: 121 LGIKLST---ECGESEVDAGDWRSQLQPESRHRIVDKIIETLKRHLPVSGPEGLSEVRKI 180
           +   + T   E GE  ++ GDWRSQLQ +SRHRIV KI+ETLKRHLP +G EGL E+ KI
Sbjct: 133 MDTNIQTRPPEGGEPSMETGDWRSQLQLDSRHRIVAKILETLKRHLPFNGEEGLRELEKI 192

Query: 181 AVRFEEKVHTVATSQSDYLRKISLRMLTMETKPGTTPALPSICPSQHLSTDAKMSRGPAQ 240
           A RFEEK++  A+SQSDYLRKISL+ML ME KP                          Q
Sbjct: 193 AARFEEKIYVAASSQSDYLRKISLKMLAMENKP--------------------------Q 252

Query: 241 GGEPEVDAGVWRSQLPPDSRHRIVNSIIETLKRHIPVSGPEGLNELRKIAVRFEEKVYNA 300
           GGE  +    WRSQL PDSRHRI+  I E L+RH+P +G EGL+EL ++AVRFEEK+Y  
Sbjct: 253 GGETSMVTSNWRSQLQPDSRHRIIAKITEVLRRHLPFTGEEGLHELERVAVRFEEKIYTV 285

Query: 301 ATSQSDYLRKISLRMLTMETKSGT 320
           A SQSDYL+KISL++ TME KS T
Sbjct: 313 ALSQSDYLQKISLKLHTMENKSQT 285

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MD15A_ARATH5.6e-2961.62Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana ... [more]
Match NameE-valueIdentityDescription
A0A078H3Z0_BRANA9.5e-6041.95BnaC05g12030D protein OS=Brassica napus GN=BnaC05g12030D PE=4 SV=1[more]
F2DL41_HORVD6.0e-5455.40Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1[more]
M4EBE6_BRARP3.9e-5342.96Uncharacterized protein OS=Brassica rapa subsp. pekinensis PE=4 SV=1[more]
W5AVR2_WHEAT6.6e-5358.05Uncharacterized protein OS=Triticum aestivum PE=4 SV=1[more]
A0A0D3CAL8_BRAOL6.6e-5339.08Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G15790.13.7e-3946.93 unknown protein[more]
AT1G15780.13.1e-3061.62 unknown protein[more]
Match NameE-valueIdentityDescription
gi|828322731|ref|XP_012573082.1|3.5e-7147.66PREDICTED: mediator of RNA polymerase II transcription subunit 15a-like [Cicer a... [more]
gi|698581864|ref|XP_009777664.1|1.8e-6750.00PREDICTED: uncharacterized protein LOC104227178 isoform X10 [Nicotiana sylvestri... [more]
gi|698581860|ref|XP_009777663.1|1.8e-6750.00PREDICTED: uncharacterized protein LOC104227178 isoform X9 [Nicotiana sylvestris... [more]
gi|698581831|ref|XP_009777654.1|1.8e-6750.00PREDICTED: uncharacterized protein LOC104227178 isoform X1 [Nicotiana sylvestris... [more]
gi|694444421|ref|XP_009348733.1|2.3e-6746.91PREDICTED: uncharacterized protein LOC103940355 [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003712transcription cofactor activity
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR003101Coactivator CBP, KIX domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003712 transcription cofactor activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g06860.1Cp4.1LG04g06860.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003101Coactivator CBP, KIX domainGENE3DG3DSA:1.10.246.20coord: 246..314
score: 2.
IPR003101Coactivator CBP, KIX domainPFAMPF16987KIX_2coord: 245..317
score: 1.4E-35coord: 135..208
score: 1.4E-33coord: 8..78
score: 3.2
NoneNo IPR availablePANTHERPTHR33137FAMILY NOT NAMEDcoord: 213..317
score: 2.1