CmaCh06G017350 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh06G017350
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionCCR4-NOT transcription complex subunit 3-like isoform X1
LocationCma_Chr06: 10601181 .. 10619922 (-)
RNA-Seq ExpressionCmaCh06G017350
SyntenyCmaCh06G017350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGCGAGTCGAAAGCTTCAAGGGGAGATTGACCGAGTTCTTAAGAAGGTCCAAGAAGGGGTTGACGTCTTTGACAGCATTTGGAACAAGGTTCGTTTTCACCACACACCCTCTCGCTTCCGGGAATGGAGCTGGGCGTCTAATTTTGGAGCTCCGTCTTCTTACCTTTTCTTGTGTGCCTCACTTCTAGGTTTATGATACCGACAATTCCAATCAGAAGGAGAAGTTCGAGGCGGATTTGAAGAAGGAGATAAAGAAGCTTCAGAGGTACAGGGACCAAATAAAGACCTGGATTCAGTCCAGTGAGATTAAGGACAAGAAGGTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATTATGTCCTTCGTTCTCCAAGTTAGTGTGGAATCTGTGAGCTTTCAGTGGCACATTGACTCAAGAACGGCGATTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTAATACAGTCAGTTCTTAGTAAACCTGACCTGAACCTTTCACTCGCACCAACATTTTATAATTGAGAATCGAAGTTCGTTTGGGCAGGGAGTGTGGCTTTAGAGGAATAGTAGGATTTTTAGAGGGGTGGAGAGATTTTTGGAGGAGGTGGTTCCATATTGAATTTTTTGTTATTATCGATTTGGTTGTATTCTTTTGGATTGGACTCCTTTCTGGTAGGCTACAGACTCCTTTTGTAGGCTTATTATTTTCTTTTTTGTATGTCCTTGTACAATTGTCCATTTCTCTCCATTAAAGCTTGTTTTTTTTATCTTAGAAAAACAATAGTAGTAATGAGAATAGGAGTCTATTTTCGTTTTAGAGAGAAATCCAGAAGTTATCGCTTTGATGACTGACATAGAAAGAATCTGTGGAAGCAATAAATCTCGTGCTAACAAAGACAATATGGAAAAGAAATTATTCTAAAAAGGTGAAGTTTTTCTTTGGGAAATAGTGCATAAAGTCATTAACACGAGTGAAAATCTACAAAAAAGGAAAAAAAGAAAAAAAAAACGCCCTACATGACTATTTTCAAATGGGTCATTATGCAAGAAAAAGAAAAAGAATCACAAAGCCACACTCAGAATTTCTGGACAAAGCTTGGGAGTGTCAGTCTTGGAGCCAGCAGTATAGTGTGTACCTTTTGAACATAGCTTGTCTGAGATGGAACGAATGGAGGATCCTACCTTGGACTCTATCCATCATGTTGAGACTATCCAGAACACACGTCCTAATAGAAATTTTGCTTTCTCTTGGATACTTGGTCTTCCATATGGCTTTGGTCAAGCTTTTATTAATAGTTTTTTTGTCAGGGTTTTATTAATAGCAGTAGACCTAGAGTTTGGTTGGTTGGTTGAAGAAAGAGATCCTAGAATGTTGACCTTAGCTGTCCAGAAGTCATTGCATATTATCCTGACCCAACTTTTGGCCAGAATTATCTGAAATGTTAACTATGAATGTGCTCCTTTTCTGCTTTAGCTAATGATCCAACGCAAAATCTGGCCACCCGTCACCTTTTTTCCATAGAAAAGCATAAAGGACTAGTACTACGGCTAACAGTTTCAATTTGAGTGCGCTTTGGAACCCTGCAAGAGAGCTAGTCTTCAAGAGAAATAGTTTATGATCTGAATTATTCCTCACTTTCCTTGAGAGCCTCACGTTCTGAAATGTTAAGAACCAACCTTCCTATTTCTAAGATTTGTGAAAAGTTATACCCAATTTATCTATGTGTCTTATTCTAATACCTTGTCTTGCGTAATATTCTGATCTCTAATTAATAATATCTGTGTTTATATTCCATATGTAAAAAATTAGATTATACATGTTGCAGCAATTTATTTGATTGATTTTTGTGAGCCAAGAGAGATCTACAATATGGTTTATTATGAAATTTTCATTTTTAATATAATTCTTAATGATGGGGACTAATTATTTCTATTTTGATATTTGTTGACACAGGTCAGTGCCTCTTATGAGCAGGCTTTGTTGGATGCTCGTAAACTTATTGAGCGTGAAATGGAAAGATTTAAGATTTGTGAAAAGGAGACGAAAACCAAAGCCTTCTCAAAAGAAGGTTTGGGTCAACAGCCTAAAACTGTATGTCCTTAAACTCTGTTCAATTTCTCTTTGAGCTTATCAATTGACAGCTTGCTAAACATGTTTATAAAAAAATTTCCTCCATGCTTAGATATGTTCCAGGTGCTTGTCCCATTAATTGTTTGCAATTATCTTCTGGAGATTTGGTTTGAAAAAAATAGGATTTTTTTTTTTTGAAAATATGTTCCTGATTGGCTGGATCATTTTGGATTCATCAAGATCTCAGCTTCTCGTTGTTGTTTTTTTATCTTTAGTTTGTGTTGGCTATTTTCTTAATGATATTTGTATTATTTGGGTTTTCATTTGAGTTTTTTTAGGGGCAATCATCAGCTTTCCATGGTTTATATATATATATATATATATATACACACATGTATGTATGTATTTTGTATGTATATATTGCTTTTAATCTTTCTCAATCTATACTTATAGTCTTTGAATTTTTGTATTCCTTACCTTTTAAATCTGTGACACACGAGTAAAGGAGAGATACATGAATGAACTGAGAACACAATTGAACGAGAGTGATTTTGAGGATAGGATGCTTGAGAAAAGCTCAAAAGTATTGAAGTTATTACCTATACCAACAAGGTGCACTTTACTTTCCTTTTCGTAGCTCAATCATAAAACTTCAAGTTAAGCATGCTTTGTTTGGAGCAATCTTATGTTGGGTGACCTCCTGGAAATTGAAGCATGAGAGTGAGGACAAAACATGTTGAAAAGACTCGTATTGGTTTGTGGGGATAGTAGTCTTCACATTTCAAAGTGACAAGTAGGTAGCGTGATCATGTCATAGGGAATGCATGGCAATGTTGAGATAATGAGGTGTCGAATCCAAATTCCAAATCTTGGGCATGAGGCGTTACACAATCATTAATGAAAAGTTTTGTTTATTGTTCAAAACAAATGTCTATCTACTGTTTTTGCTATCCAAAAGATGGTTGTTCTTAGTCAGGTGATCACACTGTAAGGTTCCTGTCCATTATTTTTTCAAGAAAATTCATGGATACACTTGCGTACAACTCAAAACTGAAACAGTCATATGCATTTTCTGAACTATTTTGGGTTTTTGTTTAGGATCAAACCAGTAATTTCTTCTTTACTTGGTATAAAGAGCCAAAATTTCAACGGGAGTGGACGGGAAATGCAGAAATACATTTGCAAATAGAAATAGGCTAATATAAAATGTGTTCTATTTCTAAGGGGCCTTCCAAAGTCTATTGGTGACCCAGTTGCAATATTCTAGAATGAGGTTATGTCTCCCTAACAAAGGTTTCTTTTTAACGGGTGTTGTCATTATTTTCATTAAAGGATAAGGGGATGTGTTCATGATGCCAAATTTTCTATTTTCATTAATGAAAAGACCTAGGAAAAGAATCCAAGCTTCAAGGAGGTTTACGTTAAGGTGATTCACTCTTACATTTTATTTTCCTATTGGTTGGTGGTGTGCTGAGTAGCATGTTAGATCATATTTACCGAAAGGATCTTTTAAGGTTTTGTGGGCAATGATTGAGTTCATATTTCTCACCTGCGATTTGCTGATACCTGATGATACTTAGTTGTTTTTTAAATATTATGATGTTTGTTGAATTCTTGATTGATACAATTGGAGCATTTGAATGGCTTTCAAACTGGAAAGTTAATTGGGACAAATCAATGGTTTGTGGTATTAGTATCGATCTTTCTACACTTGATTATATGGCAGCAAGACTTAATGTTAAAGTGGAGTCATTACCAGTTTCTTATTCGAGAATGCCGGGGGGAGGTAATCTGAGATTTTCACAATTCTAGTCTCCTATTGTTGAAAAGGTTTCAAAGAAATTGGTTAAATGGAAAAGATTCCAGCTTTTTTGTGGTGGAAGTTTGACATTATGCAACTCGGTTTCTTCCAACATTCCTATATGTTATTTTTAATTTTTCCAATTTCCTTCCAATGTATGTTCTCAATTAGAAAAGGCTGAGTAAATTCTTTGGGAAAGCAATGCTGATGACAAGTTTAATCGTTTAACTCACTGGAATATAGTTTCTCAATTGATGGATGATGGCGGACTTGACACTGGAGGTCTGAGTTAGAAAAATATGGCAATTCTAGCAAAATGGGAGCAGAGAAGATTCTGTCATGAACATTCTGCTTTATGTTGAAAGGTGGCGGCTATTATTTATGGGACTGATTTCTTTGATTGGCATTGAATCTGCAAGATTAGTGGTTGACTTAGATGCCCTTGGAATAATATTTATAAGCAATTGAGATTGGTGGAAAATTTCTCCCTTTTCAAGGCTGGCAATGGGTAAAGATTTTTCTTTCAGCATGATGTTTGGCTTGGGTATTCTTTAAAATTATCTTATACAAATTTGTTGAAGATGGTTTCTTACCTTCTTAGTTTGGTTAATGGCAATTGGGGTTCTGCTCTTTTTTCGTGGAAGCTGGAAACTAGATGAAGTTTGAAGGAAATTGAATTGTGAATATTGATTACTGCTGTCTTCTTTGAGTTCAGTGAGATTACAGGCTAAAGAAGATCAAATTTGTTGGAAAATCGACCCCTCTGGATTGTTTCCTGTAAGTTCTTTGACAAAGCATCAAATATCTCATCCTCCCTTGCCTAAGGATTTGTTCTCAGCAATATGCAAATCTAAATGTCAAAGGAAAGTTAGTGTCTTGGCATGGATCATGCTCAATGGCCATCTCAATACGATTGGTTTGCTCCAAAAAGAGCTTCCATCTTCAGCCTTGCGGGCCTCGCGGTGTGTTTTATGTTTTGCAGATAATGAAAGGCAGGATCACGTCTTCCTTCATTGTCGAAATACAATGGAGCGTTGGTTATTTATTTATTTAATATTTTCAAATCTTCATTGGGTTTTCTCAAAGGATTAAAGGAGCAACTTCAGTTTCTCTATGGGCCAATCTTATCATCTCAAGGTCCCTTCTACCGATTGATGCCATAAAGTCTATTTTTTAAAAAGTTATGGTTATCTACATATTTTTTAGGATAAGAGTAAGCCTTGGTTAGAGCATTATGATATTGCTAGTTGAAGGCCTCTCATTGGTGCTCACTTTCTAATTTGTAATGTTTGTATTAATTTGAGTGCTTTTGTTCAATCTTCTTAGATTTCTTTGTTTTCTTTTTGTTTTTTTCTTCTCTTTCTAGGAGTTCGTATCATGAATATGTTTTCTTCATAGAATAAATGAAAAATTGTTTCTTGCTAAAAAAAAAAAAAAACCCATCAGTGTTGTAATTATTATATTTTAGCTATTCATTTTTTAGATTTGGTCTGTTAGCTTCTTGGTTTTGGGTTTAGTTGTTTCCTCTTTTCAGCTTCTATCCAAGTGTCATTTTCCGAATTATTGGCAGTTTTTTATGTTATGGGCTATATATTTGTTATTTTATCTATAACCTTATTGGTTTTGGACTTCCTATCTTTGTCAGTAATGGAAATTGGTCCTCTTGTTTTTCAGCTAACTTTCTTTTAACGGGCTCGGAAAAAATTCTAGTTGTTTTATCAGTTCATTTACAAATAACTTGTAAATAGTAGCTCTTCCTACACGTGAAAGCTATTTATTATTTTTATTGTTAGTTCTAACGAAGCTTCTCTTTGGTTTTCAAATATAGGACCCAAAGGAGAAAGCTAAATCAGAGACACGAGATTGGTTGAACAATTTGGTATGAAAGCCCCCTTTGTACAGAACCTCCGTAGACTCTACTTCTGTGGAGAACAATGCATTTGATTCAAGAATCAGCATGCTGTTCTTTGTTGCAATGGTTGAGTTTCCAGATTTCTGGAAAAAAATATTTTTTTTGATAAGAAACCAAAACTTATGTAATTATGGCTTGATGCAAATTTAGCCTACTAGGAAGTGTTTTTTTTTATCCCTTTGTGTGGTGGGGAAATTTTGTCTCCCTTCCTTTGTACTCCCATTGATAATGAAATGTTTGTTTTGTATTTAAAAAAAAAAATGTTCTCGAAAAAGGTACATTAAACCCACACATCTAATCAAAATCTTAGCCCTAATCTGAGTTCAACTCACTTTGTTTTGGCATCTTCCTTAATATATCTGTTTTACGTGGCTAGTCTTTTGACCACTATAGTCTCTCTTTCTTCAGTTCATTGACATTCCAACTTATCAGCTTCAATTCATTGGACCTTTCCCACCCTAGAAGAGGTGTATGTGGAAGAAATAAGATAGTTTGTTACGGGGGAGAATGTTTTGTATGAATGTGGGAGTTTCCTACCAGTTATTTGGGTAAATTTAGTATACATGGTAGGAAGTTAGAAAGCTTAGGGCAGTTATTTCTTTTATGTTTTTGTGGATAGACCAGACTCTTGAAAGCTTCGAACTTTATGCAAAACTTTTCTTTTGTTGTTTATCCTCGGTAGGAGAAAGAAAAACTCTGTCCAATTGCTTAACAGGCACTCAATGTTTTTATCCTGGTCAGGTTAGTGAGTTGGAATCTCAGATTGATAATTTTGAAGCTGAGATGGAGGGTCTATCTGTGAAGAAGGGAAAATCAAGGCCACCTAGATTGGTAAGTGGTATTATGCTCTGCATTTTCTAAACTGATATGTTGTTCTTTCCTAGAACTTTGGGTTTATAATTATCTGTCCATTTTGTTGTTATAGATTCATCTGGAAACTTCTATTACTCGGCACAAGGCTCATATAATGAAGCTGGAACTAATCTTGAGACTGCTTGATAATGATGAATTGAGTCATGAGGATGTCAACGATGTCAGGGAGTTTTTAGAAGACTATGTGGAAAGGAATCAGGTTATTACTCCTTCCATCTCTTTTTGCAATCGAATTGTGTTGTATGCCTGCATTGTACTCTCTCTCCTCTTGATAAGAAAGCAACAATAATTAAGATGATGAAGGGGTAAAGGGGGCTTGTGTAAAAGAACAACCCAGACAATGAAGCAAACAAAAATGAAAGCCAATTACAAGAGATTTTGCTCTTATTAATGATGATTAAAAATAAAGCATCATTACAAAACTCTAAAGATGAGGAACCACTTGGGAGGCAATGGAATGAGCTGCATTCGAGCCTCTAGACTTCTTGAGCCAAATATTCTGTAGGATAATACTAAATCCTTTCTTGTTGGTTTGCAATTGGGGTGCTATTATTGACTTTTTGTTGTTATTACTATTATTATTATTTTAAGTTCAAACCCACATGAGTCCGTTTCATGGACATTAAGATATGTGCAAAAAAGTAAAAAAGTAAGAGACTTTTTGGGGGATGTATTTAAGCCCTATGGTGGTTACCTACTTGGAAGATAACTTTTTAGATGTTTTTGTTCCTCATCCTAGTCATATTTCTGCTGATTTTACTAATCATTTTCGAGCTCTTACTAGTCCATTGCAATGTACTTGTCCTTCGATGCCAAGTGTAGAAATTTATTAATTAGAGGGAGAAATATTGCTGAATAGTTCTACCAATCAAGCTCTTGAGTTGAAGTTTTATACTAGAAGAAAATCCAATCAAAGGAACCAAAACCAAATAGTTTATCCTTCTCCGAGCCTATTTGAGACTAAATAATGAAATTGAAAAGTCTGGTAACCCATCTTTTGATCCTACTCCTCAACTATTCAAAATATTGAACCAATCATGTCTGATGTTGATGATCCCATAACCATTAGAAAAGCGTATCGAAAGATGTACAAAATATCCCATCACAAACTACCATTTATTTCAAAATTTTTCAAATAGTCCTAAAGCTTTATATGATGTCCCACATTGGTTGGGGAGGAGAACAAATCACCATTTATAAGGGTGTGGAAACCTTTCCCTAGCAGACGCGTTTTAAAGCCTTAAGAGGAAGCCCGAAAGGGAAAGCCCAAAGAGGACAATATCTGCTGGCGGTGGATCTAGGTCGTTACACTTTCACATCTAGGATAAAGAATTTATTTGTTTCAAGGAATATGTATGAAGCTCTAAATGATTCACATTGAAAACTAGCCGATATGGAGGATATGATGAATGCTCTAAAACAAAATAATTCTTTAAGAATAGCAGAGTTGCTTGAAGATAAGAAACTTGCGGGATTTACAAGTGTTCACCCTAAATTTTAAAGCTGGTGTTAGTATTGAGAGATAAAAGATGAGACTGCTATCTAAAAGAGGTTTTTATGAGCTTATTAATAGGTTATGATACGGGTGTTCCAAAACCGAGCCTAACCAACAATATATTTTCTTGTTTCCTACCAGAAACAAATGACTATGCCAGTAGTGTTAATCATATCAGTATTATAATTTATTTGGCTTATTAAAAAAAAAAAGGAAGAGAAAAAAAATACTTGCAATTGGTCTTCTGTTATTATCAGTTTGCCTTTGTTTTTTTCAATTGGAGTCTCTCTCTATATATCTTGGACTATCTTTTTTGTATGCTCTTGTATAATCTTTTATATTTCTCAATGGATGCTCGATTGTTTATAATGTTTTTTTATATTCGGCTTGGACATCTAGGATTTTGAATTTTATTACTTCAATAATAAATGCAAATCTTATAGTCTTATGTTGACTACCTATCTAAGATGTTAATAATAATTTTTTTTTTAAATAACCAAATGTTGTAAGGTTAGGTTGTTGCCTTATTAAACTGATTGAAATTTACTTGTGGTGTGTGTAAGCTGGTCTTAACATTTACATATATTAAAAAAAATCTTAATTATAGAAGGGATTATAAATGATACCCATGTAGAAGTATCATGTAATAATTTAGAACTATTCATAAATATAGATTGATATATTTGCAAATACATTTTTATATGTTTTTATCTGGACTGATATTGTTCATTTGACTATCAGCACCATTGAAACTGGATACTTTTATGCCTTCATTCCAATCGATTGAGTTCATATGATTATATTTCAGTTCATGAAAATATCAAACTCACATTAGCCATTGACATCTCAAAAACTGTCTATATAGGAGGATTTTGATGAATTCAGTGATGTGGATGATCTTTACAGCTCATTGCCACTCGATAAGGTGGAATCCCTTGAAGATCTGGGTACAATTTGCCCTCCTAGCCTTGTGAAGGTATGGATTTTTCCTCCCTCTGTTCCATATCAAGTTGACCAAACATAATAATATATGTTTTGCTGTTAACTTGAATGGTAAGTTTATTTATTTATTCTTATTTTGTTTTATTTTATTTGATTTTAAATTGGGGGGTATGGAATTGCAGAGCCTTCATGTCACCTATGAAGACACCTTTAACCCATGGATAGTCTTTCCCTTATATATGTTAGTTTGAATATTTGATGAGAGGTTGTGTATTACGGCTTGGTCTTTATCCCTTATATATGCTACTTCGTAAAAAATCCCTTGTGGGTCTCAGTGGTCTTCATCCTCCCAATTTTTCTAGTCTTTGTTTTATCTCAGTTTGATTGTTTCGTCTTCTTTGATGAGTTGTATCAACGGATGGGTCTTTTGTGGAAGGAATGAACAACAAACATAAGATTTCTCTTTCTTGCACTCTCGTACATTGGTGTGAATGTTCCTTGGTTGAACTATTGCAATGTCCAGTAAACTTGTTTTTTTTTTTTTTACAAGAAATTTAGGGAGGGTTTTGGGGCCATTTGATTGTCTAAGTTTCGATCATCCCTTGGTTGGCACTTTGAATATTCCAAGTTTCTTGGATTTATATCATTTACACCAAGGCATGGAAGATTTAGGAGTTTTTAAAATGATTGATGTAGAAATTTATCACACACCAGCCTTATTACCAGAGACGGGAGACAGTATCCTTTGACATAGGGTTCGGTGCTTTTAATGGGACTAGTCTGCACGGTTATCAGTAGTCTGCTATAATAGAGCAAGTTCTTAGGGAAATGGTTGTCTTCAATAATAAAACTCCTTGTGAACTGTCTGCCCTAAATTAACGAGCCTATTATTGATGCTTCTGTTGCTCATTGTTGTTCAATCCTCCCCAAGCAGCCCAATCGAGATTACAGAGTCCTCTCTTAAGGCTCCCATTAAATCTCAGTGTTAAAAGAAGTTAAAAATTGTTACATTGAAACTTATCCCTTATCCTATACTTGAAGAAGAGAGCCCCTCATTTCTTCTCATTGACTCATCAAGGTTAATACTATGGATCCTGATTTGATTGAAGAAAATTGTTCACGAGTTTTGGACAAAGGCCAATATGGATCCTGATTTGATTGAAGAAAATTGTTCACGAGTTTTGGACAAAGGCCAATAAGCGTCTATATTCACATATACTATCACATAGGCCACTTCTATTGGAGATGCAAGGTTCTTGTATGATCCAAAGAAGGCATATAGAAAAGAATTTGATCTGAAGACAAAAGCATGAGGAGCCTATTAAATTATTTTGTTTTTGATAAAAATAAGTTGAAAATTTAGTATCTAAATTGATATGTTGTCTTTTGGCAGGGTATAACAGCTCTCAGCTTGAAGACTACTTTGGCAACAACGGGAACTCAAGTGCCTGTACGTCCTTCTATTAATATTTATATAGCAGAGACATTTACTTTTTTTTTTTTCAACTGATATTATAAAGGATTTAGATGAACCTCCAGACTTCATTTCAACGAACTTTTTCTACTTTTAGATTTCTTCATGATGAACATTCTGCTTTATTTGATGCATGATCCTTGAAACAATTTTAGGGCATCTGGACAAGGACGGTATTATGAAATTCTATTATTATCCTGTAATTTGTATTCTTGTTTTTTTTGGCAGTTCAATGACATGCTTTTTTCCTTTGTATGATCAGAAAATATGAGCGTGGTTAACTATAGAATTCTGTAAGGCATGATGGGATTAGTTTTTTGTAACGTCCATTGTATTCCTTTTTATTTGTTGTCATGTCAATGAGGTCTTACTTTTTGATATTGAATTGGGTTCTCAATTCAAATTATTGCTAATATAGTGCGAAGCTAATTAATGTTCTTATTCAATTTGCTTTAGTTTTAGGAGACGTGTTATTGATAAAGTTCTACTTTGGTCGAAGTTTTGGAAAATTTTCATAGATCTCTATCTTGAATTTAGATCTTTGATTGTCTTGTAGAGAGAACCTTTTAGGATTTGCATGTAAATTAACGAGATACAAGCTGTTTGAGTTGGAATCAAATGCAATAACTGCCTTTTCTACTGTACCTTGGGATAAAAAAAGAATACTAACACATTTTAGTCCCTTGTTTGTCCTAATTGTCACTATGTAGATGCTACAGTACGTTAGTAATGTATGCTAATTTTCCAGGTTACTGTTGCTCCTAATCATCAACCAAATACTGTCACTCAGGATCAGGTTGATGATTCAGCTTTGCTAGATGGTAACACTGATACTCTTTTGAAAACCCCACCTCCTAAGAATAGTGTCCTTGGTTCTTCTGCTGCTACAACACCTACCGGGAACCATGCAGCCTCAACTGCCTTGAATGGTGCAGTGCATGGGTCTGGCTTGTCCGCTACATCAGCCATTCTTCCAGGTTCAAGTTCTGTTCGTGCTGTGGAGGCTACGGGTGCTTCTAATTCATCTCCGGTAAATATGCCCACTTCTGCAAAGGATGAAGAAATTGCTAGCTTCCCAGGCCGTAAATTGTCTCCATCATTTGCGGATACTGGACTTGTAAGGGGTGGCATGGGAAGAGGTGTCACTGCTAATCAACCAGCCTCTAGTTCCACCCATACTTCTGGTATTGTGGTTCCTAGCACTATAACTCTTGGTAACGTTTCTTCTGCCTCTGAAGTCACAAAGAGAAACATTTTGGGATCTGAAGAACGGGCTGGTAACAGTGGCTTGGTGCAGTCTATGGGTTCTCCTTTAAGTAATAGAATGGTTTTGCCTACAGCAGCTAAACCTAGTGATGGAACGAGCACAGTTGATCCTAGCAATGTTAGTGATGCAGCGGCTATAGGAAGTCGAGTTTTCTCTCCAGTGGTTCCTAGCATGCAGTGGAGGCCAGGAAGTTCTTTTCAAAATCCGAATGAAGGAGTATGATACTTGCTACTAGTTCATTCAAAACTGAGCTAGTGGGTTCTTATTAATGAGGATTATTTAATTACTGATGATTAATGGAAAATGTTAAGATATATTCAAATCATTTATTGGGTTGATAATGAATTGATCTATTGAGGATGAAGTATTACTTAGTTATATATTTTAGTAAAAATTTAATAAAAATAATGTAGCTGCTGTTCTTACTTCTTTCCTTGAGATCAAAGGTTCAAATCCTCACTCCCATACTCGTAATAATAAAATAGGAAAAGAAAAGAAAAGAAAAGATAAGATAAGGTACTTTGAGCTTAGGTGATTGTTTCAAGTTTGTTCTGAACCTTGGTTGTTTCCTTTATAAACTATTCGTATGAATTTAGCGGTAAATGGTTTAATTCCTGATGACTTAATCATATCTTATATCCCACTTTCTTCTATGCGAAATACTAGCTCGAGCATCTTCATGATTACTGGCCTATTCCCAAGAAAATCTATAAAATGTTTCCTGATGTGGTTAATTCCAGATGATTTAATCATTTCTTATATCCCATTTTCTTCGAGGCAAAATACTTGCATTAGAATAATCTGCTCAAATACCTTCAGTTTTAATCTGTTTTATTACTGCCCTTTTCCCAAGAAAATCTATAGCTTTTCTTTCTTATGTAGTTAACCATTTTTTCTTCAGGGGCAGTTCCGTGGAAGAGCTGAAATAGCGCCTGATCAGAGAGAGAAGTTCTTGCAGCGTCTCCAGCAAGTTCAGCAACAGGGTCATAGTACACTTCTTAGCATGAATCTTGGTGGAGGGAACAACAAACAATTTTCTTCGCAACAGCAAAGTTCACTTCTACAGCAGGTTTTCTTATTCATAATTTTTAGCCTTCTTTATTCGTAATTTATAGCCTTTTTCTTCACTACAGCTACTTTGTGCCTTTTATTCCTTTTATTTGGGTGGGGGTCATGATATGTTGAATAAGAATAGAGTGCAAACTTCATACATTCTATTGAATATCGACTCGCTCTTTTCACGAGTATAAAAATTCTAAGAACACGAATTTTTCCTTCATAATATCACAATGATTTACCAAACTATCTACCTCACCCTCAAGTGAATTCTTGATTTGAATATACAAAATAGCATCATCACAAACCCAAACATTCTTTGTGTTATCCAATATAATCATCAATCTTAGTGCCTTGTTGATAAAATCGAATCGTGTTACTCCATTTAATAATCATAATAATTATAATAATTGGAGCCATTTAACTTATGCTTCATGATCTTTGACGAAGGGGGAACTATCTTAAATGCTATCACAAGTTTCATTTCAACCGTTAGTTACACACAAACAATCAGTAGACAAACCTTAAGAGAAACAAAAAAACAGGCCTGAACTGCAAGAATGTGAAGTAGGTATTATTGAACCCAACCACACAGAACGACCGCAAAACTAAACATGAGACAAACCAGGGCGACAAAAGGTGGAACATAGCTCTCATGCGCCGGTGCGTGGGATAGAGGGCAGCGGCGCATGGAGAACACATGGTTGCTTTGGCTGTTGAATTCTGGGGTTTGATAAATCAATAGTGGCAGACACTGGCAGCGGCAGAGGTAGGGGTTGACTACAAGTAAATTCACAAAATCAAAGATGAAAACAATACATCTAACTTCCAAACTCTAAAGGCTCCCAAAAACCTAATGGTTAACAAACCCTTAATTGCACTTTCAAAATTTTATACATTTATTGTCTTGTCCTATAATAGTCTATTTAGCATATTTAACCAGTGTCCGACATATGTCTAACAAATATTGGACCTATATTGACTTTGTACAACTAGTACTTTTTAAAATATATCTATTGTGTTAATAAGTGTTTGTTATGTGTCCAACAAGTGTTAGAGTGTCCGAGTGTTCTACACGTGTCAGAAACAAACATGCCAACCAAACTTAAGTGTCCATACTTTCTAAATTATGCCCAAGGTAGACAACATTATACCAGTGTGGAGATGGGTGGAAGTTCCATTGTCCTCTACATTTGCCAACTTGAGCTTAGCTCAAATAGTTAAGACATTTATCCTCTACCGGGTAAAGATTCAAATCCAGTGTGCCATTGAAAAATTGAAGTTGGCCTGTGCATTACACCATAATACAAAGGGGACCTTTTTTTATGTACAGGTTTTTTTGTGAATATTTCTGCTTGGTGATCACTATGTTCAGTCTATCCATAGTTTGGCATTGTTAAATAGTATCAATGATTTGGTTTTCTGTATTTGCAGTTCAACTCCCAAAATTCATCTGTTAGTTCTCAAGCTGGTCTGGGAATTGGAGTTCAAGCACCAGGAGTTAATGTTGTTACATCTGGTTCATTACAGCAGCAGCCAAGTTCCTTCCAGCAGTCTAATCAGCAGGCATTAATGACAAGTGGGGCGAAAGAATCTGGTACGGTCTGGCTTCTTGTCTTCATATACCAATGTGTTAATGTTTCTCTATCCCTATGGGTTGTCCGTCATATTCAATTTCTTCATGGGTGAGCAGTGGGCACAATTTCATTTATGGTGTCCCAATGTAATTATCTTGGAATAGGACATCACATATAAATGCATATTTTTATTTGACCATCCTACACAGATTAATTACATGAAGGGCAAAAACATGGAGCCTTGAGAAATCTAGAATTTCTCTTGCAAACATATGTCAATGCTCACTAAATCAATAGACACTTTAGCTCATATTCATATTCACTCATTCTTATATTAAACTAGATGACATAAGGTTGTGGATGAATAGGAATAAAAGAAATTTTCAGGAAACTGAAGTACTTAATGATACTTTTCAGGACATGGTAGTTCCTTGAGCTTTAAAATTGAAACTCTTCTTACAATTCATCTTAAATGCATGAAAAGTTGGAATGTTAATTCTGCTAACTTGGATGCATTGTAATCTCGTAAGTTCTTTTCGGAACTTTATGCCCTTCTTTTTCTATTCTTTATTACATTGGATACGCAAACTCATAAACCTAAATATCCTCTTAAATTCTTCTGTTTGTTGCCATTTGGAACCTTAAAAATTTTCCTAGTCCAATAACTTCAATTTTCCTAATGGTCCTTGTTCATGCTAAAATTCCTTTTGAAATCAAATTAAGAATTCAGATTGTCGCTAAGAAGCACTCTTGCAAAAATATTTATCGAGTGCATAGAAAATTTCTTCTGAAGGTGTTTTTTTGGTAAGAAATGCATGGGTCTCCTTTACGATTCAACAAATGAGATACTGGTAACTAAAATGTTTACGTTCTCTCATAGAATTACATAAACAAGACTGACAATTAATTCTTGTGTAATGCAGATGTTGCCCCTGTAAAAGTTGAAGAGCAGCAGCAGCCACAGCAGCAACAGAGTTTACCTGAAGATACTACTACTGATTCTGCTGCTGGTTCTGTCCTTGGAAAGAATCTGATGAGCGACGATGATTTAAAAGGCGCATATCCAGTTGATACTCCAGTATGTGTTATTTCTTGTTTATTGTAATCATAGTTGCTTCAGTACACTAATCTTTCCTTGTCAAACAGGTTGTTGAATGTGTTATTGCGCAAACAACATGAGCCTTGTTTGTTAAAGAGCCTACATCTTAGTCCTGACGTTGCTTAGAGCTTCTAGTTGCAAATGAAAATAGTGAAATAGCTTTGTTGGACATTTAATTAACTTGCGCTGAATAAAGTGTTCTAAAACATTTCTCATTGGATGGGAAAGGATGCCGGGAGTACTACTGAGTTATTTAAGAAAAGGAAGAGAATAATTTTTTTAAAAAAAAATTTCTTCGATGATACAATGATATGCCCCCCACAAATTCCATTTTTGTTTAAGAAACATGGCAGTTTATTAGATTATTGGAATATATCAAGGGAAGAAAGGGAACCCCCACAACTATAGGAAGTCAGTATTTTCAAAGCTTGTGGCACATTCTAAGGTGACAAATCTCTTAGTCGCCTTGGGATGAAAACAACAGAAAGGGTGTCAGTGTCACAAGCACACGAGAAGAGTCTTTTTATTTAATTATTTGTTTTAGAAAAACTGTACTTTATTGCTTGATAACCAAAACTGTACTTTATTATTTGTTGTAGGTTGGCGTACCTGTTTCATTGACTGAGACTGCTTCAGTGTCGAGAGAGGATGACCTTTCTCCAGGTCAACCTTTACAGCATGGTCAACCTTCTAAAAGTCTTGGTGTCATTGGCCGAAGAAGTGTTTCTGACTTGGGTGCCATTGGTGATAACCTCGGTGGATCCTCGTTGACTACTAGCGGAATGCATGATCAGTTCCATAATTTGCAAATGCTTGAGGCTGCATTCTACAAGCTACCTCAGCCAAAAGACTCAGAGCGTCCGAGGAGCTACACTCCAGTTCAGATACAAAATTTCTTTGTCCTATGACTGTATTCTATATATATATACTCTTTTTTTTAATTATTTTTTTTGTGTGAAATTATTTCATCTACAGAGGCACCCTGCAGTTACTCCTCCGAGCTATCCTCAAGTGCAGGCACCTATTATAAACAATCCTGCTTTATGGGACCGATTAGGTCTTGAGACCTATGGCACTGACACATTGTTCTTTGCATTTTACTATCAACCGGTATTACTTTTATCTTTTTATGTAATGCTGTATTTCTTTGACATCCAGTCTAATCATGTGCTGAACTGCAGAACACCTATCAACAATATTTGGCTGCTAGAGAATTAAAGAAGCAATCTTGGAGATATCACAGAAAATATCAGACATGGTTCCAAAGACATGAAGAGCCAAAAATTGCTACAGATGAATATGAGCAGGGAACTTATGTGTACTTCGATTTCCATTTTAATAATGATGACCTACAACATGGATGGTATGTTTACTTGTTAATCTTGTTTGCTGCCTCATCTCTTGTTGCCTTTTGCGCATGTAAGGTGTGCTTCTTTGTTTTCTAACATTAACTGGTTAGCGCTACTCTTATGGAGATGGTAATGAGCTGATTTGCTGCTATAAAAATATGATTGTGAAATTTAGAACATTGAGAAGCTTGTGATGCGACTTTGTTCATTTCCGTTGATTTAGAAAAGAAATCACAAGCTCAATCACTTGCTCTAATAGGTAAAGGACTCCAAATTCTCGATTAAAAAAATAGTAATGCAGTTCTATTTTTGGTGAAAAGTTAATTGACTGCATAGAGAGGTGTTTTATTGTGCAGTTCTAAGCCTTGCAAAACTGGATCTTGGATGATCTGAAGAAATAATTTGTTGTTGAGGATCTTGGAAAATCTTTTAGATCGCAAAACACAATATATATTAAACAACCTTAACATACTCTATACATCTACACACCTCAAATTATGTACTACTATGTTAACTATATTTGAATTGTTTTTGTACTGTAGGTGCCAAAGGATTAAAACAGAGTTCACTTTTGAGTATAACTACCTTGAAGATGAACTCAACATATAGAGGATGTATGGCAGTGATCAGGTGATTCTAAGCTTTCCCAGGTTTCAAAATTTCCGCCATCTTTTTGAAGTTCTAACTCTTTTCCCGTTTTACAGAAACTAGAGATAAAAATGTATCCGGAGCTTGACAACTTGTTGTATTAAATATACTGGGCAATGGGTGAAAAGAACCGAAGTCTGCACATTATTATAATCTTTGGTGAAGTCTCATTTCTTGTTGAGATGCTTGAGTTCTTTTCACTGATTTGAGATAAACTTTTGATAGCATGAAGGGGTCTGTTCTATCATCTTTATTTTGGGTTTACATTTGTCATTTTCAGGTTCCATATCAGGCAAGGTTGCAGTGAGTTAAAGGTTGATTTAGTCTTTTTACTTCCCATTCCCTTTTTAATTGTAAAGGAGACATGATGCATGTTGAGTTATAAAGATAAGGCGTTCCCTTGTATTATGTATTTTTCTCATATCTGACCTTTACAAAGTCCAGGAGAAAATGAGTTAAAATCTCCACCTTGGCAACTTATCTGAATCATTTTGAATGAATTTGTTTTTGAAGATCCATACTTCCAGTCATGAATGTTCACGAGTAAAAGCAGC

mRNA sequence

ATGGGTGCGAGTCGAAAGCTTCAAGGGGAGATTGACCGAGTTCTTAAGAAGGTCCAAGAAGGGGTTGACGTCTTTGACAGCATTTGGAACAAGGTTTATGATACCGACAATTCCAATCAGAAGGAGAAGTTCGAGGCGGATTTGAAGAAGGAGATAAAGAAGCTTCAGAGGTACAGGGACCAAATAAAGACCTGGATTCAGTCCAGTGAGATTAAGGACAAGAAGGTCAGTGCCTCTTATGAGCAGGCTTTGTTGGATGCTCGTAAACTTATTGAGCGTGAAATGGAAAGATTTAAGATTTGTGAAAAGGAGACGAAAACCAAAGCCTTCTCAAAAGAAGGTTTGGGTCAACAGCCTAAAACTGACCCAAAGGAGAAAGCTAAATCAGAGACACGAGATTGGTTGAACAATTTGGTTAGTGAGTTGGAATCTCAGATTGATAATTTTGAAGCTGAGATGGAGGGTCTATCTGTGAAGAAGGGAAAATCAAGGCCACCTAGATTGATTCATCTGGAAACTTCTATTACTCGGCACAAGGCTCATATAATGAAGCTGGAACTAATCTTGAGACTGCTTGATAATGATGAATTGAGTCATGAGGATGTCAACGATGTCAGGGAGTTTTTAGAAGACTATGTGGAAAGGAATCAGGAGGATTTTGATGAATTCAGTGATGTGGATGATCTTTACAGCTCATTGCCACTCGATAAGGTGGAATCCCTTGAAGATCTGGGTACAATTTGCCCTCCTAGCCTTGTGAAGGGTATAACAGCTCTCAGCTTGAAGACTACTTTGGCAACAACGGGAACTCAAGTGCCTGTTACTGTTGCTCCTAATCATCAACCAAATACTGTCACTCAGGATCAGGTTGATGATTCAGCTTTGCTAGATGGTAACACTGATACTCTTTTGAAAACCCCACCTCCTAAGAATAGTGTCCTTGGTTCTTCTGCTGCTACAACACCTACCGGGAACCATGCAGCCTCAACTGCCTTGAATGGTGCAGTGCATGGGTCTGGCTTGTCCGCTACATCAGCCATTCTTCCAGGTTCAAGTTCTGTTCGTGCTGTGGAGGCTACGGGTGCTTCTAATTCATCTCCGGTAAATATGCCCACTTCTGCAAAGGATGAAGAAATTGCTAGCTTCCCAGGCCGTAAATTGTCTCCATCATTTGCGGATACTGGACTTGTAAGGGGTGGCATGGGAAGAGGTGTCACTGCTAATCAACCAGCCTCTAGTTCCACCCATACTTCTGGTATTGTGGTTCCTAGCACTATAACTCTTGGTAACGTTTCTTCTGCCTCTGAAGTCACAAAGAGAAACATTTTGGGATCTGAAGAACGGGCTGGTAACAGTGGCTTGGTGCAGTCTATGGGTTCTCCTTTAAGTAATAGAATGGTTTTGCCTACAGCAGCTAAACCTAGTGATGGAACGAGCACAGTTGATCCTAGCAATGTTAGTGATGCAGCGGCTATAGGAAGTCGAGTTTTCTCTCCAGTGGTTCCTAGCATGCAGTGGAGGCCAGGAAGTTCTTTTCAAAATCCGAATGAAGGAGGGCAGTTCCGTGGAAGAGCTGAAATAGCGCCTGATCAGAGAGAGAAGTTCTTGCAGCGTCTCCAGCAAGTTCAGCAACAGGGTCATAGTACACTTCTTAGCATGAATCTTGGTGGAGGGAACAACAAACAATTTTCTTCGCAACAGCAAAGTTCACTTCTACAGCAGTTCAACTCCCAAAATTCATCTGTTAGTTCTCAAGCTGGTCTGGGAATTGGAGTTCAAGCACCAGGAGTTAATGTTGTTACATCTGGTTCATTACAGCAGCAGCCAAGTTCCTTCCAGCAGTCTAATCAGCAGGCATTAATGACAAGTGGGGCGAAAGAATCTGATGTTGCCCCTGTAAAAGTTGAAGAGCAGCAGCAGCCACAGCAGCAACAGAGTTTACCTGAAGATACTACTACTGATTCTGCTGCTGGTTCTGTCCTTGGAAAGAATCTGATGAGCGACGATGATTTAAAAGGCGCATATCCAGTTGATACTCCAGTTGGCGTACCTGTTTCATTGACTGAGACTGCTTCAGTGTCGAGAGAGGATGACCTTTCTCCAGGTCAACCTTTACAGCATGGTCAACCTTCTAAAAGTCTTGGTGTCATTGGCCGAAGAAGTGTTTCTGACTTGGGTGCCATTGGTGATAACCTCGGTGGATCCTCGTTGACTACTAGCGGAATGCATGATCAGTTCCATAATTTGCAAATGCTTGAGGCTGCATTCTACAAGCTACCTCAGCCAAAAGACTCAGAGCGTCCGAGGAGCTACACTCCAAGGCACCCTGCAGTTACTCCTCCGAGCTATCCTCAAGTGCAGGCACCTATTATAAACAATCCTGCTTTATGGGACCGATTAGGTCTTGAGACCTATGGCACTGACACATTGTTCTTTGCATTTTACTATCAACCGAACACCTATCAACAATATTTGGCTGCTAGAGAATTAAAGAAGCAATCTTGGAGATATCACAGAAAATATCAGACATGGTTCCAAAGACATGAAGAGCCAAAAATTGCTACAGATGAATATGAGCAGGGAACTTATGTGTACTTCGATTTCCATTTTAATAATGATGACCTACAACATGGATGGTGCCAAAGGATTAAAACAGAGTTCACTTTTGAGTATAACTACCTTGAAGATGAACTCAACATATAGAGGATGTATGGCAGTGATCAGAAACTAGAGATAAAAATGTATCCGGAGCTTGACAACTTGTTGTATTAAATATACTGGGCAATGGGTGAAAAGAACCGAAGTCTGCACATTATTATAATCTTTGGTTCCATATCAGGCAAGGTTGCAGTGAGTTAAAGGTTGATTTAGTCTTTTTACTTCCCATTCCCTTTTTAATTGTAAAGGAGACATGATGCATGTTGAGTTATAAAGATAAGGCGTTCCCTTGTATTATGTATTTTTCTCATATCTGACCTTTACAAAGTCCAGGAGAAAATGAGTTAAAATCTCCACCTTGGCAACTTATCTGAATCATTTTGAATGAATTTGTTTTTGAAGATCCATACTTCCAGTCATGAATGTTCACGAGTAAAAGCAGC

Coding sequence (CDS)

ATGGGTGCGAGTCGAAAGCTTCAAGGGGAGATTGACCGAGTTCTTAAGAAGGTCCAAGAAGGGGTTGACGTCTTTGACAGCATTTGGAACAAGGTTTATGATACCGACAATTCCAATCAGAAGGAGAAGTTCGAGGCGGATTTGAAGAAGGAGATAAAGAAGCTTCAGAGGTACAGGGACCAAATAAAGACCTGGATTCAGTCCAGTGAGATTAAGGACAAGAAGGTCAGTGCCTCTTATGAGCAGGCTTTGTTGGATGCTCGTAAACTTATTGAGCGTGAAATGGAAAGATTTAAGATTTGTGAAAAGGAGACGAAAACCAAAGCCTTCTCAAAAGAAGGTTTGGGTCAACAGCCTAAAACTGACCCAAAGGAGAAAGCTAAATCAGAGACACGAGATTGGTTGAACAATTTGGTTAGTGAGTTGGAATCTCAGATTGATAATTTTGAAGCTGAGATGGAGGGTCTATCTGTGAAGAAGGGAAAATCAAGGCCACCTAGATTGATTCATCTGGAAACTTCTATTACTCGGCACAAGGCTCATATAATGAAGCTGGAACTAATCTTGAGACTGCTTGATAATGATGAATTGAGTCATGAGGATGTCAACGATGTCAGGGAGTTTTTAGAAGACTATGTGGAAAGGAATCAGGAGGATTTTGATGAATTCAGTGATGTGGATGATCTTTACAGCTCATTGCCACTCGATAAGGTGGAATCCCTTGAAGATCTGGGTACAATTTGCCCTCCTAGCCTTGTGAAGGGTATAACAGCTCTCAGCTTGAAGACTACTTTGGCAACAACGGGAACTCAAGTGCCTGTTACTGTTGCTCCTAATCATCAACCAAATACTGTCACTCAGGATCAGGTTGATGATTCAGCTTTGCTAGATGGTAACACTGATACTCTTTTGAAAACCCCACCTCCTAAGAATAGTGTCCTTGGTTCTTCTGCTGCTACAACACCTACCGGGAACCATGCAGCCTCAACTGCCTTGAATGGTGCAGTGCATGGGTCTGGCTTGTCCGCTACATCAGCCATTCTTCCAGGTTCAAGTTCTGTTCGTGCTGTGGAGGCTACGGGTGCTTCTAATTCATCTCCGGTAAATATGCCCACTTCTGCAAAGGATGAAGAAATTGCTAGCTTCCCAGGCCGTAAATTGTCTCCATCATTTGCGGATACTGGACTTGTAAGGGGTGGCATGGGAAGAGGTGTCACTGCTAATCAACCAGCCTCTAGTTCCACCCATACTTCTGGTATTGTGGTTCCTAGCACTATAACTCTTGGTAACGTTTCTTCTGCCTCTGAAGTCACAAAGAGAAACATTTTGGGATCTGAAGAACGGGCTGGTAACAGTGGCTTGGTGCAGTCTATGGGTTCTCCTTTAAGTAATAGAATGGTTTTGCCTACAGCAGCTAAACCTAGTGATGGAACGAGCACAGTTGATCCTAGCAATGTTAGTGATGCAGCGGCTATAGGAAGTCGAGTTTTCTCTCCAGTGGTTCCTAGCATGCAGTGGAGGCCAGGAAGTTCTTTTCAAAATCCGAATGAAGGAGGGCAGTTCCGTGGAAGAGCTGAAATAGCGCCTGATCAGAGAGAGAAGTTCTTGCAGCGTCTCCAGCAAGTTCAGCAACAGGGTCATAGTACACTTCTTAGCATGAATCTTGGTGGAGGGAACAACAAACAATTTTCTTCGCAACAGCAAAGTTCACTTCTACAGCAGTTCAACTCCCAAAATTCATCTGTTAGTTCTCAAGCTGGTCTGGGAATTGGAGTTCAAGCACCAGGAGTTAATGTTGTTACATCTGGTTCATTACAGCAGCAGCCAAGTTCCTTCCAGCAGTCTAATCAGCAGGCATTAATGACAAGTGGGGCGAAAGAATCTGATGTTGCCCCTGTAAAAGTTGAAGAGCAGCAGCAGCCACAGCAGCAACAGAGTTTACCTGAAGATACTACTACTGATTCTGCTGCTGGTTCTGTCCTTGGAAAGAATCTGATGAGCGACGATGATTTAAAAGGCGCATATCCAGTTGATACTCCAGTTGGCGTACCTGTTTCATTGACTGAGACTGCTTCAGTGTCGAGAGAGGATGACCTTTCTCCAGGTCAACCTTTACAGCATGGTCAACCTTCTAAAAGTCTTGGTGTCATTGGCCGAAGAAGTGTTTCTGACTTGGGTGCCATTGGTGATAACCTCGGTGGATCCTCGTTGACTACTAGCGGAATGCATGATCAGTTCCATAATTTGCAAATGCTTGAGGCTGCATTCTACAAGCTACCTCAGCCAAAAGACTCAGAGCGTCCGAGGAGCTACACTCCAAGGCACCCTGCAGTTACTCCTCCGAGCTATCCTCAAGTGCAGGCACCTATTATAAACAATCCTGCTTTATGGGACCGATTAGGTCTTGAGACCTATGGCACTGACACATTGTTCTTTGCATTTTACTATCAACCGAACACCTATCAACAATATTTGGCTGCTAGAGAATTAAAGAAGCAATCTTGGAGATATCACAGAAAATATCAGACATGGTTCCAAAGACATGAAGAGCCAAAAATTGCTACAGATGAATATGAGCAGGGAACTTATGTGTACTTCGATTTCCATTTTAATAATGATGACCTACAACATGGATGGTGCCAAAGGATTAAAACAGAGTTCACTTTTGAGTATAACTACCTTGAAGATGAACTCAACATATAG

Protein sequence

MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDNSNQKEKFEADLKKEIKKLQRYRDQIKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQPKTDPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSVKKGKSRPPRLIHLETSITRHKAHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPLDKVESLEDLGTICPPSLVKGITALSLKTTLATTGTQVPVTVAPNHQPNTVTQDQVDDSALLDGNTDTLLKTPPPKNSVLGSSAATTPTGNHAASTALNGAVHGSGLSATSAILPGSSSVRAVEATGASNSSPVNMPTSAKDEEIASFPGRKLSPSFADTGLVRGGMGRGVTANQPASSSTHTSGIVVPSTITLGNVSSASEVTKRNILGSEERAGNSGLVQSMGSPLSNRMVLPTAAKPSDGTSTVDPSNVSDAAAIGSRVFSPVVPSMQWRPGSSFQNPNEGGQFRGRAEIAPDQREKFLQRLQQVQQQGHSTLLSMNLGGGNNKQFSSQQQSSLLQQFNSQNSSVSSQAGLGIGVQAPGVNVVTSGSLQQQPSSFQQSNQQALMTSGAKESDVAPVKVEEQQQPQQQQSLPEDTTTDSAAGSVLGKNLMSDDDLKGAYPVDTPVGVPVSLTETASVSREDDLSPGQPLQHGQPSKSLGVIGRRSVSDLGAIGDNLGGSSLTTSGMHDQFHNLQMLEAAFYKLPQPKDSERPRSYTPRHPAVTPPSYPQVQAPIINNPALWDRLGLETYGTDTLFFAFYYQPNTYQQYLAARELKKQSWRYHRKYQTWFQRHEEPKIATDEYEQGTYVYFDFHFNNDDLQHGWCQRIKTEFTFEYNYLEDELNI
Homology
BLAST of CmaCh06G017350 vs. ExPASy Swiss-Prot
Match: O75175 (CCR4-NOT transcription complex subunit 3 OS=Homo sapiens OX=9606 GN=CNOT3 PE=1 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 4.4e-84
Identity = 300/926 (32.40%), Postives = 415/926 (44.82%), Query Frame = 0

Query: 1   MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDNSNQKEKFEADLKKEIKKLQRYRD 60
           M   RKLQGEIDR LKKV EGV+ F+ IW K+++  N+NQKEK+EADLKKEIKKLQR RD
Sbjct: 1   MADKRKLQGEIDRCLKKVSEGVEQFEDIWQKLHNAANANQKEKYEADLKKEIKKLQRLRD 60

Query: 61  QIKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQPK 120
           QIKTW+ S+EIKDK+        L+D RKLIE +MERFK+ E+ETKTKA+SKEGLG   K
Sbjct: 61  QIKTWVASNEIKDKR-------QLIDNRKLIETQMERFKVVERETKTKAYSKEGLGLAQK 120

Query: 121 TDPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSV----KKG-KSRPPRLIHLETSI 180
            DP +K K E   WL N +  L  Q+D FE+E+E LSV    KKG K +  R+  L+  I
Sbjct: 121 VDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDKDKQDRIEGLKRHI 180

Query: 181 TRHKAHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPL 240
            +H+ H+  LE ILR+LDND +  + +  +++ +E YV+ +Q+   +F + + LY  L L
Sbjct: 181 EKHRYHVRMLETILRMLDNDSILVDAIRKIKDDVEYYVDSSQD--PDFEENEFLYDDLDL 240

Query: 241 DKVESLEDLGTICPPSLVKGITALSLKTTLATTGTQVPVTVAPNHQPNTVTQDQVDDSAL 300
           + +          P +LV                     T  P+H         ++D   
Sbjct: 241 EDI----------PQALV--------------------ATSPPSH-------SHMEDE-- 300

Query: 301 LDGNTDTLLKTPPPKNSVLGSSAATTPTGNHAASTALNGAVHGSGLSATSAILPGSSSVR 360
                             + + +++TPT     ST  +  +  S  + T+          
Sbjct: 301 ------------------IFNQSSSTPT-----STTSSSPIPPSPANCTT---------- 360

Query: 361 AVEATGASNSSPVNMPTSAKDEEIASFPGRKLSPSFADTGLVRGGMGRGVTANQPASSST 420
                   NS        + D E++  P +                      ++P  S+ 
Sbjct: 361 -------ENSEDDKKRGRSTDSEVSQSPAK--------------------NGSKPVHSNQ 420

Query: 421 HTSGIVVPSTITLGNVSSASEVTKRNILGSEERAGNSGLVQSMGSPLSNRMVLPTAAKPS 480
           H     VP T   G   +AS ++           GN+G+      P +       A   +
Sbjct: 421 HPQSPAVPPTYPSGPPPAASALS--------TTPGNNGVPAPAAPPSALGPKASPAPSHN 480

Query: 481 DGTSTVDPSNVSDAAAIGSRVFSPVVPSMQWRPGSSFQNPNEGGQFRGRAEIAPDQREKF 540
            GT       V+  A  G     P  PS+Q   G        GG   G            
Sbjct: 481 SGTPAPYAQAVAPPAPSGPSTTQPRPPSVQPSGGGG------GGSGGG------------ 540

Query: 541 LQRLQQVQQQGHSTLLSMNLGGGNNKQFSSQQQSSLLQQFNSQNSSVSSQAGLGIGVQAP 600
                     G S+  + + GGG  KQ  +   SS++   +    ++SS  G     QA 
Sbjct: 541 ----------GSSSSSNSSAGGGAGKQNGATSYSSVVAD-SPAEVALSSSGGNNASSQAL 600

Query: 601 GVNVVTSGSLQQQPSSFQQSNQQALMTSGAKESDVAPVKVEEQQQPQQQQSLPEDTTTD- 660
           G     SG     PS+ ++ +  A   +G     VAP        P     LP +  +  
Sbjct: 601 G---PPSGPHNPPPSTSKEPSAAAPTGAGG----VAPGSGNNSGGPSLLVPLPVNPPSSP 660

Query: 661 -------SAAGSVLG--KNLMSDDDLKGAYPVDTPVGVPVSLTETASVS----------- 720
                   AAG++L       +  ++K   P+ +      S+ E A++S           
Sbjct: 661 TPSFSDAKAAGALLNGPPQFSTAPEIKAPEPLSS----LKSMAERAAISSGIEDPVPTLH 720

Query: 721 -REDDL---SPGQPLQHGQPSKSLGVIGRRSVSDLGAIGDNLGGSSLTTSGMHDQFHNLQ 780
             E D+   S   P    QP   L  +       LG     LG   LT     +Q +   
Sbjct: 721 LTERDIILSSTSAPPASAQPPLQLSEV--NIPLSLGVC--PLGPVPLT----KEQLYQQA 749

Query: 781 MLEAAFYKLPQPKDSERPRSYTPRHPAVTPPSYPQVQAPIINNPALWDRLGLETYGTDTL 840
           M EAA++ +P P DSER R Y PR+P  TPP + Q+  P  +    + RL      T+TL
Sbjct: 781 MEEAAWHHMPHPSDSERIRQYLPRNPCPTPPYHHQMPPPHSDTVEFYQRL-----STETL 749

Query: 841 FFAFYYQPNTYQQYLAARELKKQSWRYHRKYQTWFQRHEEPKIATDEYEQGTYVYFDFHF 897
           FF FYY   T  QYLAA+ LKKQSWR+H KY  WFQRHEEPK  TDE+EQGTY+YFD+  
Sbjct: 841 FFIFYYLEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKTITDEFEQGTYIYFDY-- 749

BLAST of CmaCh06G017350 vs. ExPASy Swiss-Prot
Match: Q8K0V4 (CCR4-NOT transcription complex subunit 3 OS=Mus musculus OX=10090 GN=Cnot3 PE=1 SV=1)

HSP 1 Score: 313.2 bits (801), Expect = 9.9e-84
Identity = 307/924 (33.23%), Postives = 422/924 (45.67%), Query Frame = 0

Query: 1   MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDNSNQKEKFEADLKKEIKKLQRYRD 60
           M   RKLQGEIDR LKKV EGV+ F+ IW K+++  N+NQKEK+EADLKKEIKKLQR RD
Sbjct: 1   MADKRKLQGEIDRCLKKVSEGVEQFEDIWQKLHNAANANQKEKYEADLKKEIKKLQRLRD 60

Query: 61  QIKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQPK 120
           QIKTW+ S+EIKDK+        L++ RKLIE +MERFK+ E+ETKTKA+SKEGLG   K
Sbjct: 61  QIKTWVASNEIKDKR-------QLIENRKLIETQMERFKVVERETKTKAYSKEGLGLAQK 120

Query: 121 TDPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSV----KKG-KSRPPRLIHLETSI 180
            DP +K K E   WL N +  L  Q+D FE+E+E LSV    KKG K +  R+  L+  I
Sbjct: 121 VDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDKDKQDRIEGLKRHI 180

Query: 181 TRHKAHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPL 240
            +H+ H+  LE ILR+LDND +  + +  +++ +E YV+ +Q+   +F + + LY  L L
Sbjct: 181 EKHRYHVRMLETILRMLDNDSILVDAIRKIKDDVEYYVDSSQD--PDFEENEFLYDDLDL 240

Query: 241 DKVESLEDLGTICPPSLVKGITALSLKTTLATTGTQVPVTVAPNHQPNTVTQDQVDDSAL 300
           + +   + L    PPS       +  +++   T T     + P+  P   T +  +D   
Sbjct: 241 EDIP--QALVATSPPSHSHMEDEIFNQSSSTPTSTTSSSPIPPS--PANCTTENSEDDKK 300

Query: 301 LDGNTDTLLKTPPPKNSVLGSSAATTPTGNHAASTALNGAVHGSGLSATSAILPGSSSVR 360
              +TD+ +   P KN      +    +  H  S A+    + SG   T++ L  +    
Sbjct: 301 RGRSTDSEVSQSPAKN-----GSKPVHSNQHPQSPAV-PPTYPSGPPPTTSALSST---- 360

Query: 361 AVEATGASNSSPVNMPTSAKDEEIASFPGRKLSPSFADTGLVRGGMGRGVTANQPASSST 420
                G + +S    PTSA         G K SP                       + +
Sbjct: 361 ----PGNNGASTPAAPTSAL--------GPKASP-----------------------APS 420

Query: 421 HTSGIVVPSTITLGNVSSASEVTKRNILGSEERAGNSGLVQSMGSPLSNRMVLPTAAKPS 480
           H SG   P          A  V   N  G                  SN    P +A+PS
Sbjct: 421 HNSGTPAP---------YAQAVAPPNASGP-----------------SNAQPRPPSAQPS 480

Query: 481 DGTSTVDPSNVSDAAAIGSRVFSPVVPSMQWRPGSSFQNPNEGGQFRGRAEIAPDQREKF 540
            G+              GS              G S  N N G                 
Sbjct: 481 GGSGG------------GS--------------GGSSSNSNSG----------------- 540

Query: 541 LQRLQQVQQQGHSTLLSMNLGGGNNKQFSSQQQSSLLQQFNSQNSSVSSQAGLGIGVQAP 600
                               GGG  KQ  +   SS++   +    ++SS  G     QA 
Sbjct: 541 -------------------TGGGAGKQNGATSYSSVVAD-SPAEVTLSSSGGSSASSQAL 600

Query: 601 GVNVVTSGSLQQQPSSFQQSNQQA-----LMTSGAKESDVAPVKVEEQQQPQQQQSLPED 660
           G    TSG     PS+ ++S+  A      + SG+  +   P  +     P    S P  
Sbjct: 601 G---PTSGPHNPAPSTSKESSTAAPSGAGNVASGSGNNSGGPSLL--VPLPVNPPSSPTP 660

Query: 661 TTTDS-AAGSVLG--KNLMSDDDLKGAYPVDTPVGVPVSLTETASVSR--EDDL------ 720
           + +++ AAG++L       +  ++K   P+ +      S+ E A++S   ED +      
Sbjct: 661 SFSEAKAAGTLLNGPPQFSTTPEIKAPEPLSS----LKSMAERAAISSGIEDPVPTLHLT 720

Query: 721 -------SPGQPLQHGQPSKSLGVIGRRSVSDLGAIGDNLGGSSLTTSGMHDQFHNLQML 780
                  S   P    QP   L  +       LG     LG  SLT     +Q +   M 
Sbjct: 721 DRDIILSSTSAPPTSSQPPLQLSEV--NIPLSLGVC--PLGPVSLT----KEQLYQQAME 747

Query: 781 EAAFYKLPQPKDSERPRSYTPRHPAVTPPSYPQVQAPIINNPALWDRLGLETYGTDTLFF 840
           EAA++ +P P DSER R Y PR+P  TPP + Q+  P  +    + RL      T+TLFF
Sbjct: 781 EAAWHHMPHPSDSERIRQYLPRNPCPTPPYHHQMPPPHSDTVEFYQRL-----STETLFF 747

Query: 841 AFYYQPNTYQQYLAARELKKQSWRYHRKYQTWFQRHEEPKIATDEYEQGTYVYFDFHFNN 897
            FYY   T  QYLAA+ LKKQSWR+H KY  WFQRHEEPK  TDE+EQGTY+YFD+    
Sbjct: 841 IFYYLEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKTITDEFEQGTYIYFDY---- 747

BLAST of CmaCh06G017350 vs. ExPASy Swiss-Prot
Match: O13870 (General negative regulator of transcription subunit 3 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=not3 PE=1 SV=2)

HSP 1 Score: 234.6 bits (597), Expect = 4.5e-60
Identity = 251/900 (27.89%), Postives = 362/900 (40.22%), Query Frame = 0

Query: 3   ASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDNSNQKEKFEADLKKEIKKLQRYRDQI 62
           ++RKLQ EI++  KKV +G+ +FD ++ K+  +++ +QKEK E DLK +IKKLQR RDQI
Sbjct: 2   SARKLQVEIEKTFKKVTDGIAIFDEVYEKLSASNSVSQKEKLEGDLKTQIKKLQRLRDQI 61

Query: 63  KTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQPKTD 122
           KTW  S++IKDKK       ALL+ R+LIE +ME FK  E+E K KAFSKEGL    K D
Sbjct: 62  KTWASSNDIKDKK-------ALLENRRLIEAKMEEFKAVEREMKIKAFSKEGLSIASKLD 121

Query: 123 PKEKAKSETRDWLNNLVSELESQIDNFEAEMEGL--SVKKGKSRPPRLIH---LETSITR 182
           PKEK K +T  W++N V ELE Q +  EAE E L  + K+GK    +L H   LE+ I R
Sbjct: 122 PKEKEKQDTIQWISNAVEELERQAELIEAEAESLKATFKRGKKDLSKLSHLSELESRIER 181

Query: 183 HKAHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPLDK 242
           HK H  KLELI+R L+N ++S E VND++E +  YVE +Q   ++F++ ++LY  L LD+
Sbjct: 182 HKWHQDKLELIMRRLENSQISPEAVNDIQEDIMYYVECSQS--EDFAEDENLYDELNLDE 241

Query: 243 VESLEDLGTICPPSLVKGITALSLKTTLATTGTQVPVTVAPNHQPNTVTQDQVDDSALLD 302
             +  D                                                  A   
Sbjct: 242 ASASYD--------------------------------------------------AERS 301

Query: 303 GNTDTLLKTPPPKNSVLGSSAATTPTGNHAASTALNGAVHGSGLSATSAILPGSSSVRAV 362
           G + +   +P P  S   SS           +   + A     +SA +++      +   
Sbjct: 302 GRSSSSSHSPSPSASSSSSS----------ENLLQDKAEAEEKVSADASV----QDIAEK 361

Query: 363 EATGASNSSPVNMPTSAKDEEIASFPGRKLSPSFADTGLVRGGMGRGVTANQPASSSTHT 422
           E+  A      N     ++E  A                         T    A S+   
Sbjct: 362 ESLDADKELATNDQEDDEEENQAE------------------------TQKDGAISNNEN 421

Query: 423 SGIVVPSTITLGNVSSASEVTKRNILGSEERAGNSGLVQSMGSPLSNRMVLPTAAKPSDG 482
               V +T    + S+ + +TK              L+Q+  +PLS              
Sbjct: 422 MQSEVQTTNPSASTSAVTNITKPT------------LIQNPSTPLS-------------- 481

Query: 483 TSTVDPSNVSDAAAIGSRVFSPVVPSMQWRPGSSFQNPNEGGQFRGRAEIAPDQREKFLQ 542
                           S+V SP  P+      ++   P    ++   A  A     K   
Sbjct: 482 -------------VSNSKVASPETPN------ATHTAPKVEMRYASAAAAAAAALAK--- 541

Query: 543 RLQQVQQQGHSTLLSMNLGGGNNKQFSSQQQSSLLQQFNSQNSSVSSQAGLGIGVQAPGV 602
                                      S     ++QQ   +  +        I  +   +
Sbjct: 542 --------------------------ESPSHHYIMQQVRPETPNSPRLNSTVIQSKWDSL 601

Query: 603 NVVTSGSLQQQPSSFQQSNQQALMTSGAKESDVAPVKVEEQQQPQQQQSLPEDTTTDSAA 662
               S  +Q QP       +    +S   E++V P K E    P                
Sbjct: 602 GHTASPKMQTQPV------RSVSQSSATTETNVKPTKEENADVP---------------- 635

Query: 663 GSVLGKNLMSDDDLKGAYPVDTPVGVPVSLTETASVSREDDLSPGQPLQHGQPSKSLGVI 722
                  + S D LK              L    + S+E         QH          
Sbjct: 662 -------VSSPDYLK-------------DLVNALNTSKE---------QH---------- 635

Query: 723 GRRSVSDLGAIGDNLGGSSLTTSGMHDQFHNLQMLEAAFYKLPQPKDSERPRSYTPRHPA 782
                                  G  D+    + L  +   +P   D+ +P+ Y P+ P 
Sbjct: 722 ----------------------KGAIDKEKLTEALNISCVYVPDATDAAKPQYYIPKDPY 635

Query: 783 VTPPSYPQVQAPIINNPALWDRLGLETYGTDTLFFAFYYQPNTYQQYLAARELKKQSWRY 842
             P  YPQ   P+ ++  +      E    DTLF+ FYY+P TYQQY+A +ELKKQSWR+
Sbjct: 782 PVPHYYPQQPLPLFDSSEM-----TELVDPDTLFYMFYYRPGTYQQYIAGQELKKQSWRF 635

Query: 843 HRKYQTWFQRHEEPKIATDEYEQGTYVYFDFHFNNDDLQHGWCQRIKTEFTFEYNYLEDE 898
           H+KY TWFQRHEEPK+ TDE+E G+Y YFDF       +  W QR K +F F Y YLED+
Sbjct: 842 HKKYTTWFQRHEEPKMITDEFESGSYRYFDF-------EGDWVQRKKADFRFTYQYLEDD 635

BLAST of CmaCh06G017350 vs. ExPASy Swiss-Prot
Match: P06102 (General negative regulator of transcription subunit 3 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=NOT3 PE=1 SV=2)

HSP 1 Score: 145.6 bits (366), Expect = 2.7e-33
Identity = 91/246 (36.99%), Postives = 150/246 (60.98%), Query Frame = 0

Query: 3   ASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYD-TDNSNQKEKFEADLKKEIKKLQRYRDQ 62
           A RKLQ E+DRV KK+ EG+++F+S + +    T+N +QK+K E+DLK+E+KKLQR R+Q
Sbjct: 2   AHRKLQQEVDRVFKKINEGLEIFNSYYERHESCTNNPSQKDKLESDLKREVKKLQRLREQ 61

Query: 63  IKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQPKT 122
           IK+W  S +IKDK        +LLD R+ +E  ME++K  EK +K KA+S   L +    
Sbjct: 62  IKSWQSSPDIKDK-------DSLLDYRRSVEIAMEKYKAVEKASKEKAYSNISLKKSETL 121

Query: 123 DPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSVKKGKSRPPRLIHLETS------I 182
           DP+E+ + +  ++L+ ++ ELE Q D+ + E++ L +   K +     + E         
Sbjct: 122 DPQERERRDISEYLSQMIDELERQYDSLQVEIDKLLLLNKKKKTSSTTNDEKKEQYKRFQ 181

Query: 183 TRHKAHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPL 242
            R++ H  ++EL LRLL N+EL  +DV +V++ +  +VE NQ+   +F + + +Y  L L
Sbjct: 182 ARYRWHQQQMELALRLLANEELDPQDVKNVQDDINYFVESNQD--PDFVEDETIYDGLNL 238

BLAST of CmaCh06G017350 vs. ExPASy Swiss-Prot
Match: Q12514 (General negative regulator of transcription subunit 5 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=NOT5 PE=1 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 1.0e-32
Identity = 94/234 (40.17%), Postives = 138/234 (58.97%), Query Frame = 0

Query: 3   ASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTD--NSNQKEKFEADLKKEIKKLQRYRD 62
           + RKLQ +ID++LKKV+EG++ FD I+ K   TD  NS+ +EK E+DLK+EIKKLQ++RD
Sbjct: 2   SQRKLQQDIDKLLKKVKEGIEDFDDIYEKFQSTDPSNSSHREKLESDLKREIKKLQKHRD 61

Query: 63  QIKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGL-GQQP 122
           QIKTW+   ++KDK      +  L+  R+LIE  MERFK  EK  KTK FSKE L     
Sbjct: 62  QIKTWLSKEDVKDK------QSVLMTNRRLIENGMERFKSVEKLMKTKQFSKEALTNPDI 121

Query: 123 KTDPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSVKKGKSRPPRLIHLETSITRHK 182
             DPKE  K +   ++++ + EL+ Q++ +EA+                   E    RH+
Sbjct: 122 IKDPKELKKRDQVLFIHDCLDELQKQLEQYEAQEN-----------------EEQTERHE 181

Query: 183 AHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSL 234
            HI  LE IL+ L N+E+  E V + ++ ++ YVE N  D  +F + D +Y  +
Sbjct: 182 FHIANLENILKKLQNNEMDPEPVEEFQDDIKYYVENN--DDPDFIEYDTIYEDM 210

BLAST of CmaCh06G017350 vs. TAIR 10
Match: AT5G18230.1 (transcription regulator NOT2/NOT3/NOT5 family protein )

HSP 1 Score: 947.6 bits (2448), Expect = 7.3e-276
Identity = 566/902 (62.75%), Postives = 668/902 (74.06%), Query Frame = 0

Query: 1   MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDNSNQKEKFEADLKKEIKKLQRYRD 60
           MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDN NQKEKFEADLKKEIKKLQRYRD
Sbjct: 1   MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKVYDTDNVNQKEKFEADLKKEIKKLQRYRD 60

Query: 61  QIKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQPK 120
           QIKTWIQSSEIKDKKVSASYEQ+L+DARKLIE+EMERFKICEKETKTKAFSKEGLGQQPK
Sbjct: 61  QIKTWIQSSEIKDKKVSASYEQSLVDARKLIEKEMERFKICEKETKTKAFSKEGLGQQPK 120

Query: 121 TDPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSVKKGKSRPPRLIHLETSITRHKA 180
           TDPKEKAKSETRDWLNN+VSELESQID+FEAE+EGLSVKKGK+RPPRL HLETSITRHK 
Sbjct: 121 TDPKEKAKSETRDWLNNVVSELESQIDSFEAELEGLSVKKGKTRPPRLTHLETSITRHKD 180

Query: 181 HIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPLDKVES 240
           HI+KLELILRLLDNDELS E VNDV++FL+DYVERNQ+DFDEFSDVD+LYS+LPLD+VE 
Sbjct: 181 HIIKLELILRLLDNDELSPEQVNDVKDFLDDYVERNQDDFDEFSDVDELYSTLPLDEVEG 240

Query: 241 LEDLGTICPPSLVKGITALSLKTTLATTGTQVPVTVAPNHQPNTVTQDQVDDSALLDGNT 300
           LEDL T  P  LVKG T LS+K++LA + +QV     P H      Q++ +D++L D + 
Sbjct: 241 LEDLVTAGP--LVKG-TPLSMKSSLAASASQVRSISLPTHH-----QEKTEDTSLPDSSA 300

Query: 301 DTLLKTPPPKNSVLGSSAATTPTGNHAASTALNGAVHGSGLSATSAILPGSSSVRAVEAT 360
           + + KTPPPKN     SA +TP G   +     G V  + ++ +++I P  +S   +E+ 
Sbjct: 301 EMVPKTPPPKNGAGLHSAPSTPAGGRPSLNVPAGNVSNTSVTLSTSI-PTQTS---IESM 360

Query: 361 GASNSSPVNMPTSAKDEEIASFPGRKLSPSFADTGLVRGGMGRGVTANQPASSSTHTSGI 420
           G+ +      P +AK+E+  + P RK   S ADT L   G+GR    NQP  S   +   
Sbjct: 361 GSLS------PVAAKEEDATTLPSRKPPSSVADTPL--RGIGRVGIPNQPQPSQPPSP-- 420

Query: 421 VVPSTITLGNVSSASEVTKRNILGSEERAGNSGLVQSMGSPLSNRMVLPTAAKPSDGTST 480
            +P+  +  + +SA+EV KRNI+G E        VQ + SPLS +MVLP  AK +DGT++
Sbjct: 421 -IPANGSRISATSAAEVAKRNIMGVESN------VQPLTSPLS-KMVLPPTAKGNDGTAS 480

Query: 481 VDPSNVSDAAAIGSRVFSP-VVPSMQWRPGSSFQNPNEGGQFRGRAEIAPDQREKFLQRL 540
              SN  D AA   R FSP +V   QWRPGS FQ+ NE    RGR EIAPDQREKFLQRL
Sbjct: 481 --DSNPGDVAASIGRAFSPSIVSGSQWRPGSPFQSQNE--TVRGRTEIAPDQREKFLQRL 540

Query: 541 QQVQQQGHSTLLSM-NLGGGNNKQFSSQQQSSLLQQFNSQNSSVSSQAGLGIGVQAPGVN 600
           QQV QQGH  LL + +L GGN KQFSSQQQ+ LLQ    Q+SS+S    LGIGVQAPG N
Sbjct: 541 QQV-QQGHGNLLGIPSLSGGNEKQFSSQQQNPLLQ----QSSSISPHGSLGIGVQAPGFN 600

Query: 601 VVTSGSLQQQPSSF-QQSNQQALMTSGAKESDVAPVKVEEQQQPQQQQSLPEDTTTDSAA 660
           V++S SLQQQ ++  QQ  QQ  +      +DV  V+ ++    Q QQ+LP+D+ + +A+
Sbjct: 601 VMSSASLQQQSNAMSQQLGQQPSV------ADVDHVRNDD----QSQQNLPDDSASIAAS 660

Query: 661 GSVLGKNLMSDDDLKGAYPVDTPVGVPVSLTETASVSREDDLSPGQPLQHGQPSKSLGVI 720
                K + S+DD K  +  DTP G+P  + +   VS   D SPGQP+Q GQ S SLGVI
Sbjct: 661 -----KAIQSEDDSKVLF--DTPSGMPSYMLDPVQVSSGPDFSPGQPIQPGQSSSSLGVI 720

Query: 721 GRRSVSDLGAIGDNLGGSSLTTSGMHDQFHNLQMLEAAFYKLPQPKDSERPRSYTPRHPA 780
           GRRS S+LGAIGD           MHDQ HNLQMLEAAFYK PQP DSERPR Y+PR+PA
Sbjct: 721 GRRSNSELGAIGD-----PSAVGPMHDQMHNLQMLEAAFYKRPQPSDSERPRPYSPRNPA 780

Query: 781 VTPPSYPQVQAPIINNPALWDRLGLETYGTDTLFFAFYYQPNTYQQYLAARELKKQSWRY 840
           +TP ++PQ QAPIINNP LW+RLG + YGTDTLFFAFYYQ N+YQQYLAA+ELKKQSWRY
Sbjct: 781 ITPQTFPQTQAPIINNPLLWERLGSDAYGTDTLFFAFYYQQNSYQQYLAAKELKKQSWRY 840

Query: 841 HRKYQTWFQRHEEPKIATDEYEQGTYVYFDFHFNNDDLQH-GWCQRIKTEFTFEYNYLED 899
           HRK+ TWFQRH+EPKIATDEYEQG YVYFDF    D+ Q  GWCQRIK EFTFEY+YLED
Sbjct: 841 HRKFNTWFQRHKEPKIATDEYEQGAYVYFDFQTPKDENQEGGWCQRIKNEFTFEYSYLED 841

BLAST of CmaCh06G017350 vs. TAIR 10
Match: AT5G18230.2 (transcription regulator NOT2/NOT3/NOT5 family protein )

HSP 1 Score: 942.6 bits (2435), Expect = 2.4e-274
Identity = 566/904 (62.61%), Postives = 668/904 (73.89%), Query Frame = 0

Query: 1   MGASRKLQGEIDRVLKKVQEGVDVFDSIWNK--VYDTDNSNQKEKFEADLKKEIKKLQRY 60
           MGASRKLQGEIDRVLKKVQEGVDVFDSIWNK  VYDTDN NQKEKFEADLKKEIKKLQRY
Sbjct: 1   MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKWNVYDTDNVNQKEKFEADLKKEIKKLQRY 60

Query: 61  RDQIKTWIQSSEIKDKKVSASYEQALLDARKLIEREMERFKICEKETKTKAFSKEGLGQQ 120
           RDQIKTWIQSSEIKDKKVSASYEQ+L+DARKLIE+EMERFKICEKETKTKAFSKEGLGQQ
Sbjct: 61  RDQIKTWIQSSEIKDKKVSASYEQSLVDARKLIEKEMERFKICEKETKTKAFSKEGLGQQ 120

Query: 121 PKTDPKEKAKSETRDWLNNLVSELESQIDNFEAEMEGLSVKKGKSRPPRLIHLETSITRH 180
           PKTDPKEKAKSETRDWLNN+VSELESQID+FEAE+EGLSVKKGK+RPPRL HLETSITRH
Sbjct: 121 PKTDPKEKAKSETRDWLNNVVSELESQIDSFEAELEGLSVKKGKTRPPRLTHLETSITRH 180

Query: 181 KAHIMKLELILRLLDNDELSHEDVNDVREFLEDYVERNQEDFDEFSDVDDLYSSLPLDKV 240
           K HI+KLELILRLLDNDELS E VNDV++FL+DYVERNQ+DFDEFSDVD+LYS+LPLD+V
Sbjct: 181 KDHIIKLELILRLLDNDELSPEQVNDVKDFLDDYVERNQDDFDEFSDVDELYSTLPLDEV 240

Query: 241 ESLEDLGTICPPSLVKGITALSLKTTLATTGTQVPVTVAPNHQPNTVTQDQVDDSALLDG 300
           E LEDL T  P  LVKG T LS+K++LA + +QV     P H      Q++ +D++L D 
Sbjct: 241 EGLEDLVTAGP--LVKG-TPLSMKSSLAASASQVRSISLPTHH-----QEKTEDTSLPDS 300

Query: 301 NTDTLLKTPPPKNSVLGSSAATTPTGNHAASTALNGAVHGSGLSATSAILPGSSSVRAVE 360
           + + + KTPPPKN     SA +TP G   +     G V  + ++ +++I P  +S   +E
Sbjct: 301 SAEMVPKTPPPKNGAGLHSAPSTPAGGRPSLNVPAGNVSNTSVTLSTSI-PTQTS---IE 360

Query: 361 ATGASNSSPVNMPTSAKDEEIASFPGRKLSPSFADTGLVRGGMGRGVTANQPASSSTHTS 420
           + G+ +      P +AK+E+  + P RK   S ADT L   G+GR    NQP  S   + 
Sbjct: 361 SMGSLS------PVAAKEEDATTLPSRKPPSSVADTPL--RGIGRVGIPNQPQPSQPPSP 420

Query: 421 GIVVPSTITLGNVSSASEVTKRNILGSEERAGNSGLVQSMGSPLSNRMVLPTAAKPSDGT 480
              +P+  +  + +SA+EV KRNI+G E        VQ + SPLS +MVLP  AK +DGT
Sbjct: 421 ---IPANGSRISATSAAEVAKRNIMGVESN------VQPLTSPLS-KMVLPPTAKGNDGT 480

Query: 481 STVDPSNVSDAAAIGSRVFSP-VVPSMQWRPGSSFQNPNEGGQFRGRAEIAPDQREKFLQ 540
           ++   SN  D AA   R FSP +V   QWRPGS FQ+ NE    RGR EIAPDQREKFLQ
Sbjct: 481 AS--DSNPGDVAASIGRAFSPSIVSGSQWRPGSPFQSQNE--TVRGRTEIAPDQREKFLQ 540

Query: 541 RLQQVQQQGHSTLLSM-NLGGGNNKQFSSQQQSSLLQQFNSQNSSVSSQAGLGIGVQAPG 600
           RLQQV QQGH  LL + +L GGN KQFSSQQQ+ LLQ    Q+SS+S    LGIGVQAPG
Sbjct: 541 RLQQV-QQGHGNLLGIPSLSGGNEKQFSSQQQNPLLQ----QSSSISPHGSLGIGVQAPG 600

Query: 601 VNVVTSGSLQQQPSSF-QQSNQQALMTSGAKESDVAPVKVEEQQQPQQQQSLPEDTTTDS 660
            NV++S SLQQQ ++  QQ  QQ  +      +DV  V+ ++    Q QQ+LP+D+ + +
Sbjct: 601 FNVMSSASLQQQSNAMSQQLGQQPSV------ADVDHVRNDD----QSQQNLPDDSASIA 660

Query: 661 AAGSVLGKNLMSDDDLKGAYPVDTPVGVPVSLTETASVSREDDLSPGQPLQHGQPSKSLG 720
           A+     K + S+DD K  +  DTP G+P  + +   VS   D SPGQP+Q GQ S SLG
Sbjct: 661 AS-----KAIQSEDDSKVLF--DTPSGMPSYMLDPVQVSSGPDFSPGQPIQPGQSSSSLG 720

Query: 721 VIGRRSVSDLGAIGDNLGGSSLTTSGMHDQFHNLQMLEAAFYKLPQPKDSERPRSYTPRH 780
           VIGRRS S+LGAIGD           MHDQ HNLQMLEAAFYK PQP DSERPR Y+PR+
Sbjct: 721 VIGRRSNSELGAIGD-----PSAVGPMHDQMHNLQMLEAAFYKRPQPSDSERPRPYSPRN 780

Query: 781 PAVTPPSYPQVQAPIINNPALWDRLGLETYGTDTLFFAFYYQPNTYQQYLAARELKKQSW 840
           PA+TP ++PQ QAPIINNP LW+RLG + YGTDTLFFAFYYQ N+YQQYLAA+ELKKQSW
Sbjct: 781 PAITPQTFPQTQAPIINNPLLWERLGSDAYGTDTLFFAFYYQQNSYQQYLAAKELKKQSW 840

Query: 841 RYHRKYQTWFQRHEEPKIATDEYEQGTYVYFDFHFNNDDLQH-GWCQRIKTEFTFEYNYL 899
           RYHRK+ TWFQRH+EPKIATDEYEQG YVYFDF    D+ Q  GWCQRIK EFTFEY+YL
Sbjct: 841 RYHRKFNTWFQRHKEPKIATDEYEQGAYVYFDFQTPKDENQEGGWCQRIKNEFTFEYSYL 843

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O751754.4e-8432.40CCR4-NOT transcription complex subunit 3 OS=Homo sapiens OX=9606 GN=CNOT3 PE=1 S... [more]
Q8K0V49.9e-8433.23CCR4-NOT transcription complex subunit 3 OS=Mus musculus OX=10090 GN=Cnot3 PE=1 ... [more]
O138704.5e-6027.89General negative regulator of transcription subunit 3 OS=Schizosaccharomyces pom... [more]
P061022.7e-3336.99General negative regulator of transcription subunit 3 OS=Saccharomyces cerevisia... [more]
Q125141.0e-3240.17General negative regulator of transcription subunit 5 OS=Saccharomyces cerevisia... [more]
Match NameE-valueIdentityDescription
AT5G18230.17.3e-27662.75transcription regulator NOT2/NOT3/NOT5 family protein [more]
AT5G18230.22.4e-27462.61transcription regulator NOT2/NOT3/NOT5 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 132..159
NoneNo IPR availableCOILSCoilCoilcoord: 41..61
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 110..131
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 762..785
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 691..716
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 603..665
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 635..660
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 603..624
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 297..326
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 555..576
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 691..710
NoneNo IPR availablePANTHERPTHR23326:SF21BNAA10G16600D PROTEINcoord: 1..899
IPR012270CCR4-NOT complex, subunit 3/ 5PIRSFPIRSF005290NOT_su_3_5coord: 1..900
e-value: 7.7E-258
score: 856.1
IPR007282NOT2/NOT3/NOT5, C-terminalPFAMPF04153NOT2_3_5coord: 756..894
e-value: 1.9E-39
score: 134.8
IPR007207CCR4-Not complex component, Not N-terminal domainPFAMPF04065Not3coord: 4..236
e-value: 2.9E-81
score: 272.3
IPR038635CCR4-NOT complex subunit 2/3/5, N-terminal domain superfamilyGENE3D2.30.30.1020coord: 738..895
e-value: 3.6E-57
score: 194.4
IPR040168Not2/Not3/Not5PANTHERPTHR23326CCR4 NOT-RELATEDcoord: 1..899

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh06G017350.1CmaCh06G017350.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0030015 CCR4-NOT core complex
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus