CmaCh12G008780 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh12G008780
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionSET domain-containing protein
LocationCma_Chr12: 6619369 .. 6638979 (+)
RNA-Seq ExpressionCmaCh12G008780
SyntenyCmaCh12G008780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATTACCTGGAAAATGTTGCAGAAGAGTGGGTATGGCGGAAAGAAAGGCATTGGGAAGACGTACGGTTTCCTCCATGGATGGGGAGAATCAGCGCAGTGGAAGCCATCGCCAAAGACCAGCGTTTCTTGACATCAACCCTTATCTATTGGTGAGATAAAGCATTTTTATATGGAATTAGTAACCGATTTCAGTTGCCCACATCCACACTTATCTTGCAGAGAGGGACGGTTTTGTAGTGGCTCTCATCGCTGATGCAGAGCTTGTTTTGATCCATTCGTATCTATCTGATTGGGATTTTGACTTGTCTCAACGCTCTAATATGGAAAGGAAAAAAAAGTTTTAAATGCTGAATTAATAAAATACACTTATATTTTATAATTTATTTAAAAATATATATACTTAATTTTTTAAATATACCAGTAATAAAAAAATATTTTAAAATTTTTAAAATAATTTAAGAATATTTTTAGAGTAATTTATTTTCCGAGATAAATGTAAAAAAAAAACAAAATTAAATATTTAGAAGGTAGATAAATGATTTATGTTTATATTACTAACCATAAAAAAAAATTTAACAATTTTTATATTAATTTTGTTTCAGAAAGTTTGTGGGTATTTTTATAATATGGTAGCAAAAAAAAAAATTGTAATGAAAATTTTGAAGATAAAAATATATTTTTTTTATATATTTTTGAAACAAAAATATTTATTAAACCAAAAAGGCGTCCACATTCTTACAACGAGGGGACAACAATTTTGTGGTAGTTATTCTAATCTAACGAACAAGTTAGCCGTAAAAAATTCAGACTTAGCGAACAACGTCAACATATGTTCTGGTCACGTTTGAGAGAGATGTTTTAGAAAACGTGAGCGAGGAAATACGTTGAAAACAATCTTAAAGAAAGCAACGTTATAAGCTAAAAGCGAGTTGACAATAACACCGATTTTGATAGTATATAACACGGTAAGAAGAATTGACAACTAATCCCTTCGACTGATCAAAAAGCATTTTCCACTGTTTTTTTTAGTATAAAATATCTTCTAGGGGTGAACCTATTGCCAAGAAAATTCAAAAAAATATGTGAACAACAAAATCTTAACCTACACACAAAAAGCTCAATGGATTTGAAACCTAAAAAATTATTTTTAATTTTATAAATGGTCAATTTGACTTATAAATCCGGTCCACAAACATTTTATGTCGTGAGACTATAAATACTCACTTATGTAAAATGCAAATTCGTGGATCAATGTAATTCAAATCTCAAATTAAGACGTTTCCCTTGAAAATATGAGCATCATATTTGCTATTTCTAAGTTTCAAGCTTTGTGGTAGAATTGCACCCTAGCCATTCAATCAAGCCTTGATCAAGTTAGATTGTGGACTGACTTTCAGATCTTCGATCGTAAGAGGTAATCTAATTCTAGCCTTATCTCTTGGTTGATTCAAATTTCGTATAAGGAGGTGTTAATTATTTGATTTAAGGATTCTTGCTAAAACAAAAGCTGGGATCGTTGGGCTGAGGATTAAGGTTGCCTTATCAGTTATTATGAACTATTAATGTAACAACCCGACCTCCTAGACGGAGAAAAAAACATTACTTGAAAAACATGAAAGAAGTGCTTACAACGTCTGAGTAATCGAGTCAAGATAGGAGACAAATTTCACCAAGAAGGAATAGTGGGCGTCAACGAGTAAGTAGCATATCGAAGTTTAGAAACAAGTTTTAACTTGCTATGTATGCGTACAAAGCATGTTGAGCTCACGATAATAGGATGTGAGCATATTGGAAATTCATGATTAGTATGTATGATGAGGTTTATGCTTATGAGCTTATTATGATATGTGATGTATATGATATAGTCTACTAAACCACGCAAAAGCTACATTGTGTGTTAAGTTAGACACAATGAGTATTCTTGACCACGGGCTATGTGTTACTTCCAACTGCCAAAGAGTTTAGTAGATGAGTCATATTTGACCGATAGCGAGTCTTACATATGAGACCGAAATCTAAGAGAATAGTTGCAAGACATGTGCCCCAATTAGTCTACCAAGCCCCGTGAAGCCTGACGTGTTGTGTGCTAAGTTAATGATAATGAGTATTCCTGACCATCAGACTGTGTGCTATTTTAGGCCACAAAGTGTTTTATAGACGGGTTGTATTTGATCGAAAATGAGTCTTACTTACGGTTCAAAATTTAATGTAAGCATAAACAAGTAAACATGTGTGGTCTCTGTAATGAGGTGAGTCAAAATTCTATTTTACTCGTATAATTACATTTTATACATTGAGTACTATTGATCTAATAATAAACAATTTGTTTATGGTTCAATCATAAAGTCTATTTTGGGCGTGAGAGGTTGAGGTCAATTGTTCTTGTCATGACATGAATTTTGAGGTTTTGAAACTCATATAGAGACTAAAAAGATGAAAGCGAAGACAAATAATAATAAAATAAAATAAATCAAATTGAATGACAAATAACACCAAAATTAAATAAAAATAAAATTCAAATATAAAAAGTTGCGGTTCAAATTTCCTCCAAAAAGGGAAATGAGTTTGAGATGATCTCATCTATGACTGCCCAGCTCTCGAACGCTTCTCCACCTCGACTAGTAGCCTGCGAACACTTGAAAAGAAAGGGAAAATGGTGAGTATATAAATATACCCAGTAAGCAACCTACTTGTAGGCTCTCATCGCATCATTACCTAAGCATCTGGAGTGCCTCAAAGTTTCGAGCAATACTTAAGTTGCTCATGGACTCACTCCGGATTCAAACCCTAATTTTTTAAAAGGGAGAAAAAGAAATCAACGACTGAAGCTCATGCTTGTATTGAATCACTCCGGATTCTCTTGGTAAGTTGAGCTCATGCTTGTATTGACGGAAGCTAAAAAGCTTTAGTGATTCAATAGGGATACCAAGAGAAGAAATCAACGAGCATAAGAGTTATCAGCACAACAATTAAATGAGTTAATAATTCACATAAACTCTCTGAAGAATCAACAATATAATTACTATTGTAGGTCTGTTTTTGGGAGAGAGTCTCATATTGGCTAATTTAGGAAATGATAATAGATTTATAGGTAAGGAATACATCTCTATCTACATGAGATCTTTTGGAGAAATCCAAGGCTAGGAGAGCTTACACTCAAAATGTACAATATTATATCATTAGAAGAGTCGTAATTCCTAACATATATTTGAATTTTATATATTCTCTCACATAATTTTTTCTCATCTATATTTATTTATTTATTTATGTTTTGAATTGTTACTGGACTTGAAAATGAAGAGAAAAGGAGAAAAATGCGCCGAGCTTTAAATAAACCCGTTCTACAAATTCTGCCGCTCAATTCCTACTTCCTCGCTCACTTGCTACATCAACTTCTGTGTTGCAATGGAGATTGACAATCTGGTAAACAAACTTTATCATTGATATTCGCGTTCTTTGATTGGCCTTTTCATTTCTGTTGGATTGTGAAGGAGTTGGTGTTTCTGAACTTATTGCTTGAATTCTTGTTCCCTGCAGTTCTTATTTGTTTGGATTCTTGTGCGATTTCTCCTGTTACTGTTTTTCTTTGTTTTGTTGAGCTTTGCTAGTATAATTTAGTGCCGGAATGTTGGTTTTCGGGTCTTGGAATTTTAGGAAATGTGTATTTTCTTTGGTTTGATGGATTATCTTGTTTTCCTAGTATGATTCTTATTTCAGTTCGTCATATTAGTTTTGTTCTTCACTGTTTCTTCTTTTCTGATTTCGTTTCTTACTTTGTCCCCGAAAAGGAACAAGGTTTATTTTCATCTTCTTGTTCTCTTTGTTAATCGCGGCATTTTTTAGTTGTTTCTTTAATGTTTACCATTGAATCAGAGGGGAAAAACCTTGGTTTAAATTTCTATACCATGGATCCTGTGAAAGTATTGTCAACTGAAAGATTTGAAAATTGTAAAAATTAAAAGAAAAAATAAAGAAGAAAATCTATTCAATATGATAAACACCGCATTGAGAGAAACTGTTTGAGATTTGATTGCTCCCTACTGTTCCATATTGGTATATTCTTTAAAAGTCATGTAACTCTATTCTAATATGTCTATAATTTGATAAGATGTTTAGTTCAATATGAGCTTAGCGTAGCAACTAAAGTATCTCTACTTTCTCTCAAAGATTTGTATCCTCCCGCCCCTCATGCGGTGTAGTATATTAAAGGAAGTAGACAATTGGTTCTGAACTGTTTGTCGCATTTAATTGTGTGTTGCAGCCTTCCGCATCTGATGAAGTAAGAGATGTCAACGATTGCTCAATATGTCTTGCATTGAACCAAAATGACCATCTTTTCAGCAAAAAGAAGGTGAAATATGGATACCCACATTTTGCTTCTACGCTGCTCTCTTTATCAGGAAAGTGTAGAAAGTGAGAGTCTCTATTAAATTTTCAGCAAAATTTGAATAGCAAGGATCGCATAATTTGGAATTTACTACCTTTATTGATAGATAATTTTAGCTTATCCTTTTTCTCCATTAATCGGCATTCTAACTTTTTGAGTGATGGCTAGTACTGATACGAATGCTACATTCAAGTGAATAATGAAGTTGATTATGATTTATGCTCACAGATTAAGCTATAATAAAGAACCAGCAGTTACATAAAACAGGCGATTTTGACCAAAGTCCCTTGCTAGCTGCAATACAATTATTTATTTATCAAGAAACAAGGACTTCCATAAGTATGAAGTGTAGGAAAAAAGTTGGACAAGGAATCCAACCCCAATGGATAAACCCGTACATTAAACCATATTAAGAATAAAGGGAAAATGAATAGGAGTTCATATTTGTTGCCTATCCTTCCCAGTTTTAATTATCAAAATTGGCATGCTCCCAAACTGTACTAACTAAGCCCCATGCTAACAGTCAAGTGGACTCCATGATGAATCATTTTCATTATCTTTGTCTTTTAATCATTTACATGTAATCCTTTTGATCTACATTTCCCAAGTAATTGCATAACTTTGATAGGAAAAAGAAACCTTTTCCCTAAATTAGTATTCTAGTATCCTTTCTAAACCTCACAACTTGGGTTATAAGGCTCGGAGATGCTAAACCCATTCGGGAAGTGTTCCCACATAAAGGAAGTGCAAGAAAATGTCCCGCATGGTTTTAGTCTTGTTGATTAAGAGAGAGTTTCCATTGTCATTTATCCCTCCTGACAAAAGTACAATTAACAATCAACCTAGGAAGCTTAATCTATGCAATCAAACTAGGACTCCATTTATCTCATTTATATATGTTTCCTCGAGAATCTATGCATTTTGATCTGTTGGACGTCATTTAGTTTGGTGGACAGCTTATAATATTCTAACTTACTACGATTTACTATCTCTTATCATTTTGTACATTTCTTTTTTCTGTAGAAATTATTAGAAAGACAGGGTTTCAAGTCTGAGAATTGCATTTACTTGAAGTGCTCTCTTTGTCCTGAGGAAGTAGACACTGTTCTGAAAGAGTTGGTACAAATTGCAAGAATTATTCACTTAAATGAGGTAACCTGAAGAATTTCTTTAGGTCAATGTTTTCCTGTCGTTTCTGGAGATATTTAAGTTACTGCTACACCGCTACGCAGGAATGTTGTATATCTTGTAATAAATTGGAAGGAATGCCAAGTAATTTTGGTACTTGGGTGTGCTTAACTGAGATTCATTTGCATTCCAGTTTTTTTTTTCTTTATGTTGGGGAAATAAGACGGGAAACAAATAGCATTAATTTGTATTTTAGTCTGAAATCCATTTAACATAAGTAATAGATATTGATAGCTACTTTAATGATTGAGCTGCAGCCTGAAATGTATTTTGAAGAAGATGACGCATGTACACCAGCAGATTCCTACAGCCCCAGGAATGAAATGGAGGCCCTCAATACAATAATTTCTCTTGTTGACATCTGTCTCTCTAGTTGTAAGCCTGTCCAGCTTAATGTCCTGCAAGAGCTACGGAAAGCAGCTATTCGTATGATCCATAAGTATGGACATGTATACAGTATGGATGCTAAAACTTTGGGAGACAATTGTGTGAAAGAAAATTGTTTGTTACAGTGGGGTGAGAGCAATGGCGTCAGGACAAGGTTGAAGATAGCTTGTAAGTTTTTTCCATCATAGTTTCTTGGCAATATTCTTGTAATCTACATCTATTGATGCTTGATTTTGAATAATAGCATAATGGAAAGCCTGAAATATTGGAGGTGCACCGTAAGTTTTTCAATACCAAGCAATTGTAATTTGTATGCTTTATTCTTCATATAGATGTTGAAGGTGCTGGTAGAGGAACCATAGCCAAAGAAGATCTAAACGTTGGTGACACCGTATTGGAGATCCCTCTGGATATTGTTATTTCTGAGGAACTTGTGCAGAAAACCACCATGGTAAATTCATCGCTAAGAATCTGTTGTAGTTTCCATCTTAGAACTTGAATCTTCTAGAACTTTCCTGACTGAAAATGATATTAAAAGAATTACAATCTTGGCCTTCATACAGATACTGTTGTTCAATTCCCCGTCTCATCTTTAACAATGCTCTGTTGCACATTGATAATTGAGTGCTTATGTCTAGAGTTAAAGTTGGTTGAATTTCTAGCGATTTTATATTTTCAAGAATTTTTTATCATACAATGATGCTAAGAAGATTATTTTTCTCCCCACCAGCCGTGGGATGCATTGACGTTACATTGTTCCCTCTAAGCATGCATGCTTTAACAATTGATTACAACAAGTTTAGTGGTATGGTTCAAGGTGCGCCAGTTCTGTCTAAAATACTTTTCTATCTAGGTATCTGTTGCAATTAGGATCAGTGTTTCCTCGCATCACAGTAAATTCCATTCTTATAACGAGCATGCACAATTGTATTCAAGTTGGTTACTTTTTGTTGATGTAAGTGCTAGTTAATTCCTTTGGAGAGGCCGGTGAGATTTGTGCTTGTGCTATGACCTAGATAAGGTATAGTTCCAGACAGACATTTTCATTTGGAGACATCATTGCACACACCACTTGTCTTTAAATGGTCTTTTGGATATGTAATCACAGTGAGCAATCCGACTGTGGTAGTTAGAAGTGCTATTTGAAGTGAATGCCCATGGTATAGATTTCAATAGGATGATTTATTTATTTATTTATATTATTATTAATATTTAATTTGTAGCATTAATGTGGAGAGAGAAGTTTGTAACGAATGATTACAAATAGAAAGTAGAGCCATCATGACTTTATTCGTAAATTTCAGTATCCCATATTATCAAAGATCGAAGGCATGTCGTCTGAGACAATGCTGTTGATATGGAGCATGAAGGAGAAGCACATTGTCGATTCCAAATTCAAGGTCTACTTTGATACTCTCCCCGAAGCGTTTAACACTGGTATTTACCCTTACCTTTATTAGATTGTTCTGACTGACTTTTGGAGCTATCATTATGTTTAGAATGAGAACTTGAAAGTGTTAGACGAACACGACTCTCCACAATGATATGATGTTGTGCGCCGAAAGCGGAACAAAGGTACTCACAACATAAATGAACGATTTTACACAAAGAGGCACTAAGATATTTACGTGGTTCGGAGAGAAGTTCCCCTACATCCACGAACCAATCACTAATAAACCAAAGCATTACAAGTGAATATCTCACTATCACACTACAAAGACTCAGAACACATGTCACTCAACAAATACAATGTGACTCCGAAAGCTCTCCCAAATTCCGATACCAAATTCTGATATCAAATCCCGATATATAACTAAACCCTGGTGTGACTAAATCTAATTCAAATAACCAAATCAAATCTAATTCAAATAACCAAATCAAATATAATCCAAAAACCAAATCAAATCAAAACTTATGACAATTTACAACAATCTCCACCTTGTCATAAGATCTCCAAATCAAATCAAAACTTAATGACAATTTACAACAATCTCCACCTTGTCATAAGATCTCCAAGTTGATAACTGAGTTGGTTGCTACCACCTCAAGCCTCAATACGTAAGTTGGGAACTCCATCAATTATAGAGAAAGCATATGCTATCGTATCATCAAACCTTGCAGGTTTTCGAATTACCCTTCGTGGTCTATTCTTGGCGATAGACTCAGGTTCCTCCACTCTTTCATTTACAAGGCTTTGCTCTTTCTGTTCTAAGACATCAAAGTTATTATTTACCTCGTCAATAGGTTCTTCAGTAGGAGCAACCACATCAGGAGAGAACACCACCTTCTCCACCTGCTTCAATGCCTCGTCATTATTTTCAATTTTCTGCAAAAACATTCCAGACTCATCGAATGTCACATCTCGACTATTAACAATCTTACTTGTTTCTGGACACCATAACCTATAGCCCTTCACACCTTTGCTAAAGCCCATAAACTTGGCTTTTTTTGCTCTAGGATTCAGCTTTGAGTCTGTCACATGATAATAAGCAGGACACCCAAACACATGTAATTTATCATAATCACTAACAGGACTACCTGACCATACCTCAAGCGGAGTTCTCCCACCGTTTCCAGAAACAGGTAAACGATTCACCAAGTGAACTGCATAACTGAGAGCCTCAGCCCAAAATGCCTTACTCAATCCTGCTTGAGACAATATGCATCGAACCTTCTCTATTAATGTCTGGTTCATTCTCTCAGCAACTCCATTTTGTTGTGGCTTACCAGGAACAGTGAAGTGTCGAACGATCCCTTCATCTCGGCATACTTTAAGGAATGGATCAAAGTATATTCTCCCCCGTTGTCTGATCTCAGCCTTTTGATTTTCTTGTCCGTCTGATTCTCTACCATTTTCTTCCACTCTAGGAAGATTTGGAGAACCTCGTTTTTGTGCCTCATGGGATACATCCAAACTCTCTTCGAGTAGTCATCAATAAAGGTGACGAACCACCTCTTTCCTCCCAATGAGACATTCTTTGTGGGACCCCATACATCAGTGTGAACATAATCCAAAATGCCCTTTGTTTGGTGTATAGCTGTACCAAACTTCACATTTTTTTGTTTGCCAAATATACAATGTTCACAGAAATCAATTCTTCCAGTTGTGGCACCCTTAAGAACTCCTTGGTTCACCAGTGTCTGAAGCGCCTTCTCACCAGCATGCCCGAGTCGCATGTGCCATAATTTCGATATGTCTTGTTCTTTCGAACTTGCCACAGCAGCATGCCCAATTATTGTGGTTCCGTTGAGGTAGTATATGCTACCTCTTCTAGTTCCTGTAATTGCCACCAAAGCTCCACGAGCTACTTTCAAATTCCCTCCTTCCAAAATGATGCGATAACCACTCGCATCCAAAACGCCAAGTGAAATCAAATTTTTCTTCAAGTCAGGGACATACCTCACATCTTTGAGTTTTCGTATAACCCCGTCAAACATCTTGATTTGTACTGAGCCGATTCCCATTATCTTACAAGTACTATCATTTCCCATATAGACAACTCTACCATTGAACTCCTTGAAGTCCAAAAACATCTCTTTGTTGGGACACATATGATAGGAGCACCCAGAATCAAGAATCCATTCACTGGTTTGGCTGACTGATAGGGAGATTGTCAAGGCATCATCTGTATCAGTGTCGTCATCTCTAACCACATTGGACTTCGACCCCTTCTCTTTTAAGATCGAACAATCTTTTTTCCAATGGCCTTTCTGTCGACAATATGCACACTCATTTTTATCAAGTTTTCTCCAATTATCAGATCTACCTCTCGACTTTAGTCGAGGCTCTCCACATTCGTTCTTTTTCCAATTGGAAGTCCTTCCATGAGCAGTGAGAACATTTGAGCTCGAGCTTTGATACGTCTTCTTTTCCTTCTTTCGCATCTCATTGTTTATTAAGGCATTTGAAACATCTTCAAAATCAATATCAGTAATTCCATGTAGTAGGGTTGTTACTAAAAACTCGTAGGATTCCGGCAAGGAGTTTAATAACAACAACGCCTTGTCCTCGTCATCAATCTTAACATCAAGATTGAGCAAATCAGTGATGATTTTGTTAAAATCATCCAAATGCTCAGCCATTGAGATGCCTTCCTTGTAATCAAAACGAAAGAGTTTCTTCTTCAAATAGAGTTTATTTTCCACACTCTTCTTCATATATTTCGCTTCAAGCTTGTCCCATAATTCTTTCGCCATGGTAACCTTCATAAATGGATATTTCTGATCCTTGCCAAGACAGGATCGAATAATTCCACAGGCTTGCCGATTTAATTTCGTCCATTCTGCTTCTGTCATGTCTTCAGGCATTTCCTCTTGGAGAGAGACATCGAGGTCTTGCATGACCAATAAATCGCTAACTTCGCCTCGCCAAACACCAAAATTATTGGTTCCATCAAATTTTTCTACTTCGAGATTGGAACTCCACACTCTTGATCCACGTCGTGATGGTTCTGCCATTTGCAATAATTAATCTGAATGAAACTCAAAAACCCAATTCTTGAAAATTGATTTTCAAGAGAATAAACACGATTGTGAGCACGTCTAATGGTCGATGAGGCCCAAGCTTTCACAATACCCCAAAATACCACAACATAGCTCTGATACCACTTGTTGTGCGCCGAAAGCGGAACAAAGGTACTCACAACATAAATGAACGATTTTACACAAAGAGGCACTAAGATATTTACGTGGTTCGGAGAGAAGTTCCCCTACATCCACGAACCAATCACTAATAAACCAAAGCATTACAAGTGAATATCTCACTATCACACTACAAAGACTCAGAACACATGTCACTCAACAAATACAATGTGACTCCGAAAGCTCTCCCAAATTCCGATACCAAATTCTGATATCAAATCCCGATATATAACTAAACCCTGGTGTGACTAAATCTAATTCAAATAACCAAATCAAATCTAATTCAAATAACCAAATCAAATATAATCCAAAAACCAAATCAAATCAAAACTTATGACAATTTACAACATATGATATTGTCCACTTTGAGCATAAGCTCTCATAGCTTTGCTTTGGGTTTCCCCAAAAGGCCTCATGCCAATATTAGCCGATGTGAGACTTTCATCATCCAACACCTCCCCTCGAACAAAGTATGCCTCCCCTTAATCGAGGCTCGACTCCTTTTTCTTTTGGAGTCTTGGTCATTTTTGACTATGCCTTCGAGGAGTCTCGACTTCTTTTCTTTTGGAGTTCTTTGTTCGACATTTGAGGATTCTATTGGCATGGCTAAATTTAGGGCATGACTCTTATACCATGTTAGACGAACACGACTCTCCACAATGGTATGATATTGTTCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTCCCTTCCCCAAAAGACCTCATGCCAATGGAGATAGTATTCCTGGATTATAAACCCATGATCATCCAACAAAAAGCACATTTATCAAATGGTTTATTTTTCTTTTTGTTGCATGCTTGTAGAATCTTCAGTTAGCCGTTAGCACCGATGATCTTTTTATGTCTATTGCCATGCATACATTATTTTTGCTCTTTATGTACGAGTAAGGAAATAACTGTGTTTTCTTTTTCCTTTGAGAGGGAAATGGAATTTCAATTATGTAGTGAAGTTTACAAAAGATTAGTGGGGTAGCGAAAAGCCCTTTCAGTTGGAGCACAAGTATCACAGAAAGAAAATTCTTTGCTTCCACCAAGTGGCTGCTGTGTGCAAACTAAGATCAAATAAGTTTAGTTTCACTACTTTCCTTCCACTGAAAGTTATTTACTCCTCTCAAGCCAAGTGGTTCAAAGGAAAACTGTGATCAAGTTCAACCATAGAGATTTTGATTGCTTTTTCTAGGGGTGAAGCTACCATAATAGTGTGTTTGTTAATCACAACATTCCTGTGCAATGGGGTGGTCCTTCCAAATGCCTGAAAAAGAGCATCCCAGGAAGCAGAATCGAAATGGTGGCTTACAGAAATATGATCATGGTTCTAGACCATTCAATTCCTTAACAAATCCAATCGTCTATCTCGAGACTTGATCCTGAAATACTAAACATGATGACGATAAAATATAACTCTATAGTGGCCGTCATGGGTAAGCTTTTATATGAAGATTAGACTATTAGATAGTTTTGGGACGTATAGGACATTTTTTAGCGTAAGGGTTTGAGAGATGAATATATCACTGGCTTCTGCAATGGTGGCAATGGTTCCATTTGCTATTATAAGTTTTTGATTACTCGGGCATGGTTTATATGTTTTAAAATATTGTGAATATGGATTCATGTGATCCGTTGCATTGGAGTCTAATCCAAGAATTATCCTTGTTCATCGCTGAAGAATTTATACAAAAGGTGAATGAAGATTTACCTAAGATTTTCGAGTCATTTGAACTTGATGTTTTGTGGAGGGAGTCCAATAATACCCTCAACCTCTTCATTATTGAATCCATTAGTCTTCGCAATTGGTGGAATTTGTTCTTATTTGGACTCCTTTGTGATGTACGCTCGTTCCTTCGACATGTCTCCCTTGTTGATCCAACCACGGTTTTGTGAGCCTCCTCGATTTCTCCATTCATGGTTGGGAGGTTTTCCATTCAACTTCCAACAACTCTCCACCGTGTGTCGCGGTTTCTTACCGTGGGTACACCATAAATTATCTTTATTACTCTTCCATTGGTTGTCGCTTTTGTTTGTATCTTTCTCTGTTTATCAAGCATAAGGCTTCACCTGCTCTAATAATGGAAATTGTTTCTTCTAGGATGATGCTTCAAACTCGAATGAACACTCATATCGGAAAACTTCTTCAAGTGAGAGTGTTATAAGAAAAAAGTTAGAACATAATTTTCAAAATGTTCCTCTAGTGTATGAGACAACCAACTACACCAGCCAGACAAGGTGAGATTCCAAAATAACGATTTGAACTCGAATCAACCGTGAAAATGTTGTTAAGGAAAAAGTAAAAACATAATTTTAATAATGTCTGAGCCCAAGTTCACACTGGGGACCAATAGTGTGTGAGAGTGTGTGAGACTAGCGTGATAACCAACTACACCACCCAGACAAGGTGAGATTCTAAAATAACGATTTGAACTCGAATCAACAGTGAATATGTTGTTAAGGAAAAAGTAAAAACATAAATTTAATAATGTCTGAGCCCAAGTTCAAACTGGGGACCTATAGTGTGTGAGAGTGTGTGAGACTAGCGTGATAACCAACTACACCACCCAGACAAGGTGAGATTCTAAAATAACGATTTGAACTCGAATCAACCGTGAATATGTTGTTAAGGAAAAAGTAAAAACATAAATTTAATAATGTCTGAGCCTAGGTTCGAACTGGGGACCTCTAGTGTGAGACTAGCGTGATAACCAACTACACTAGAGATTCCAAAATAACGATTTGATCTCGAGTCAACAGTGAAAATGTTGTTAAGGAAAAAGTAAAAACCTCTAGTGTGTGAGACTAACGTGATAACCAACTACACTAGAGATCCCAAAATAACGATTTGAACTCGAATTCGACAGTGAAAATGTTGTTAAGGAAAAAGTAAAAACATAATTTTAATAATGTCTGAGCCCAGGTTCTGGGGACCTCTAGTGTGTGAGACTAGCGTGATAACCAACTACACCACCCAAACAAGGTGAAATTTCAAAATAACGATTTGAACTTGAATTAACAGTGAAAATGTTGTTAAGGAAAAAGTTAAAAACATAATTTTACCAGGACCTCTAGTGTGTGAGACTAGCGAGATAACCAACTACACCACCCAGACAAGATAAGAATCTGAACTAATTCTCTGAACTCGAATTTACACTCATATTTCGAAAAACTTCGAACTGGGGATCTCTAGTGTGTGAGACCTGCGTGTTGAGAGTGAAAGTTAAAACATAATTTTAAAAATGTTTAAGATTGAGGACTTCTAATGTATGCACTAGCGTGATAACCAACTACACCACCTAGACAAGATACAAATTTAAATTAACGCTTTGAACTCAAATCAACACTCATATTTTGCAAGACTTCTTAAAGTGAGAATGTTGTAAGTAAAAAGTGTTAAAATATAAATTAAAAAATGCCTGAGCATGGGTTTAAACTTGGGGCTTCTAGTGTGTTTGAGACTGATGTGATAATCAACTATACCACCAAGACAAGGTGAGAATCCAAATTAACGCTTTGAACTCGAATCAACATTCATACTTTGTGTGACTTCTTAAAGTGAGATTGTTGTAAGACGAGTTTAGAAACAAGTTTTTTTACTCTTGAACTTGTTTATCAAGGTAATCGTCCAAAAACTTTTCCTTGGGATGATTCCTTATTCTTGGTTTTCTCGGATAATTTAGATGTAAAGATTTTTTTGTCAACAAGATTAAAATGATAGGAAACTTTTGGAATTTTATTAATAATCTTGTTGAAAAGCAAAATCTTTAGATCTAAATTATCTCAGAACCCCAAGAACGACAACGAATCAACCCAAGGGAAAGTTTTGGAACGGATTACCTTGATAATCAAGATCAAGAGTAAAGAAAATTTGTTTCTAAACTCGTGATTCAAATGACTCCATAAGATAGTTTGATCAAAACTACTTGAAATGCAAAGTAACAAACTAATGGCTTAAGAAACATAAGCCAATTTCATTTTTAATACCCAAAAATCTTTCTAGTAAAACAATATTACATGAATGACTATTTATAGATTTGAAAATAAACTTTATAATAATAACCTATATGGTTAATTGAAGGGCATTAACTTTCATGGCCATCAATACCATAACGACCACTAACTAATTATAACTTATAATTTAAAGATAATTCTCTAAAACTTAAAAAATACAATAAAAGTTTCTTAAAATCTAAATCATCCATGACATGATTAAGACACCTCATTTCTTATCACACTTTCTTGGTCTTTTCCCTACTCAACTCAATTGTTATCTCGCAGCCACGTCGTTTACCATATGAACTTGCATTGTGCACAATTTAGCACTTGGCTTGAACATGTCACATCTAACCTTCGACTTTTTTGCTTACAACTATTGACACTATTCCTCTGTTCTTTCAAAGGCACCCTACTTGGTATACCTTCTAACCATATGTTGCCCATTTGGAACTCCCAAGATCTTGGCAAATGAGCTTGGTTCATGCTTCATTGATGGGTAAGATTCTTCCTCGGGAGTCTAGCTTCAACCCACAAAGCTATTCTCGAATCTGACTCACAATCATCCTGCCCAATCTAGCACTCATGACACACTTGTGAAGTTTTCATTCAACCATTGACTTTATTTCCTAACTGCCTTGATCGATTTAGACACAAAGCGTGACTCTGATACCAACTGTCATAGTCATACTTTTTCAACTGTGTGACGTCATGATCTTAACATGCTTAAGACATTTCTTAAGGGAAATATTCTCCTTAAAACATATTCTCATTTGCTTGACTTTTTAGCTTTATCAGACCTCAGGCTAAGACATTTCTTAAGGGAAATATTCTCCTTAGGATTGTAGCATCGGCAGTGCAATTCATCTTGAATACACGATAATAATCGAGTTCCTGCCACAATGTTGGTAAGACATTGGCATATTCAGCGACAGTTTTTACTTCTTGCTTCATAGCTCTCGTCTTGATTTGAATTTCATAGATCGAAGTAGCATCTCTAGCCTTGGAGTAAATGCAATGAATAAAATCTCATATGGCTTGTGTTAGAATCTCAAAGCCATGACATTATCATGGAGTCTTCCTCGTCCCATGTAGTAAATCTAGGATCCGTCTTTGCTGGTCTTGGTCCAATGAGGTGTTGGAGTTTTCCCTTTTTTTAAGGTACGTTCCCACAATTTGGACCATTTGAGGAAGTTTTTTCTGTTGGGTCGGTATGACAATCAGATGGATTGAAGGTGATAAGGAATCACTACAAGGGAAAGTAAGATTTGAATTACTTTGTTGATCGAATATCTCAAGATAAGAACACTTGTTTGAGATTCGAATCACTCCACAAACAAAATCCATCACATGCAACCTAAACTAGATAGAATTGTAAAAAAACTTAGCCATTGGCTAAAGGAAATGACAAATATTCTTTTTACTACATTTTCCCAAGTCTACTTTACAAATATAACATACATGGCTTTATATAGCCTCAAAATGAAAATTCTTGATCTTCTATGAGAGGCATTTCACGAGTCATACCTTCATACTTTAGGACCATAGTAATTGAGCTGCATACTTTTTATCTCACATGTTGTAACCTTTCCTTTGTAATCAATATTATCATTATGACTATCTGACTTTTGATCTCTAAAGTGAGAAAATGACAAGACTATAGTTATGTTTGAATAATCTAAACACTAACCGTGTGCTGACATGTGCATGAAGTAATTTTTATTTTCCTGGTCTAATATTTTACAAGTTCTCAGGGTTGAGTTTTGGAGTTGGTGCAATGATGACTTTGGACGGAACCCTGCTTTTCGGTGAGATAATGCAAGCAAAAGAGGTTAGTTCTCACATATCTTATCGGTCAATATTTATTTGTTAAGACAACCTCTAGGGGCTAGCATGTGTTGGATGAAAGTCCCATATCGGCTAATTTAGGGAATGATCATGGTTTATAATAAAAAAATATTATCTCCATTGGTATGAGGTCTTTTGGAGACGCCCAAAGCAAAACCACGAGAGCTTATGTTCAAAGTGGACAATATCATACCTTTGTGGAGAGTTGTGCTCATCTAACTTGGTATTAGAGCAATGCCCTAAATTTAGCCATGTTAATAGAATCTTCAAATATCGAACAAAAGTATTGTGAGCCTTGAAGGCATAGTAAAAAATGACTAAGACTCCAAAAGAAAAAGGAGAACAAAAAAGTATGGTGAGCCTCGAAGGCATAGTAAAAAATGACTAAGACTCCAAAAGAAAAAGGAGTCGAACCTTGATTAAGGGGAGGTGTACTTTGTTCGAGAGGAGGTATTGGATGAAAGTCCCACATCGATTAATTTAGGGAATGATCATGGGTTTATAATCATCTATCTTCATTGGTATGAGACCTAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACTATTGTGGAGAGTCCTGTTCATCTAACAAGCTAGCATGAATTCTAGTTTCCTGTAATTAGGTTGTTAAAAAGGTAATCTCACTACATTTTGTAATATCATTATTCGATTTTCAAACTTGTATTTATATGTAACGTTGACTTGTTTGCAGCACTTGCGGGAACAATACAATGAGTTATTTCCTGCGTTATGTAACAACCATCCTGACGTTTTCCCTGAGGAGTACTACTCATGGGAGAAGTTCTTATGGGCTTGTGAACTTTGGTATTCAAATAGCATGAAAATCATGTTTTCTGATGGAAGTCTTAGAACCTGCTTGGTTCCGATTGCAGGTTTTCTCAATCACTCGGTATGTTACTGTAGTGTCCGTCCCCTACCAGAGTCCTGCTTTGTGGATGGGGGATGAGTTTAATTTGAGAAAATTGTACGAAATGTAGTTTAATTCACTTGAACTCGGAATCCGCTCCTTGTTTGGTATCTGCTGTTACCTCTGTTCTTACTTCGAATTGATTTCCTTTAGTTACATGACGGTCAATAGCTTCCAAATTTGTGATTGTGCTGTTGACCATAGTTTCTGAGAGATCCCACGTCGGTTGGAAGGAGAACGAAACACCCTTTATAAGGGTGTGGAAACTTCTCCCTAGCAGACGTGTTTTAAAAACCTTGAGGGGAACCCCGAAAGGGAAAGCCCAAAGAGGACAATATCTACTAGCGGTGGCCTTGGGCCGTTACAAATGATATCAGAGTCAGACACCGGACGATGTGTCAGCGAGGAGGCTCTTCCCCGAAGGGGGTAGATATGAGGCGGTGTGCCAGCCAGAACACTGGGCCCCGAAGAGGGGTGGATTTGGTGGGGGTCTTACATCGATTGGAGAAAGGAACGAGTGCCAGCGAGGACGCTGGATCCTAAAGGGGGTGGATTGTGAAATCCCATATCGATTGGGGAGGAGAACGAAACACTCTTTACAAGGATGTAGAAACCTCTCCCTAGCAGATGCGTTTTAAAAACCTTGAGGGGAACCTCGAAAGGGAAAGCCCAAACAGGACAATATCTACTAGCGGTGGGCTTGGGCCTTTACAGTTTCATATTTCTAAAACATAATTTTTAACATTAAACTCAGATGCATTCTATTCCTTAGTTCTTCGATTGGAAGTATTTACTTTACTATTTAGAAATTGAGTTCCCATTTTCCTTTGCTTCATTGTTATATGGTTCTTTGGTTCAGTTGCATCCACACATACTACACTATAGCAAAGCTAATTCAGATACAAATTCATTGAAATTCCGCCTATCAAGACCATGCCGTGCAGGAGAAGAATGTTATCTTAGTTATGGAAATTACTCTGCTTCTCATCTGGTTGCTTTTTATGGATTTTTACCTGAAGGAGACAACGTAAATGATGTCATTCCATTAGGTACTCTCTTGTTCTGTCTAGTGGACAATTGAAGGATGCTTTGTTTACTTCCATATGTGATCTATGTATGACTTTCTTATATCACTGTTGTTTATTTGTCGCGTGCAGACATCGACTTTGGTGATGATGCTAGCAATAGCAACACATCAGACTGGAGTACTCATATGGTGAGGGGAACATGGTTGTCAAAGAACCAAAGCATATTCCATTATGGTCTGCCCTCACCGTTATTAGAGTGTTTACGTAAAGCTCGATGCCCTGAATTACGCACCAAGCTTAAGGTAATTCTTATCTCGTTTCGCTCATTTCTCAATCGTAACGTTGTAGGTCAAGGATACAGGATTTTGTTTATCAGTTATTATTACTTGTTTATCTTAACTTCGTAGCTGCAGGGAAGCTTGGAAAATGAAATGGAAGTCCTTAATGATCTCCTCTCAATCTTTGATGGAATGATGGAAAATCTTGAGGATGTGAATGAAGACAGGTAATGAAGCATTTTCGAAACAGTAATAGAGTATAAACTATACAATTTCGAAAGCAATGTCGTCTCGAAACATCGCTTTTCCATGGAATAAACTTGAAAGTTTTGGTCCTTGCAGGAGCAGTACAGAATGGGATATAAAGTTAGCATTGAACTATAAAGATCTTCAAAGGAGGATAGTTTCCTCATGTCTGAATTCATGTCATGCTGGTTTGAAGATGGTCGAACTTGCATTATACGAATGCATGGAAGAGGACACTCGAGGCTAAAAACACGAGCGGACAGACGTGCTCGTTCCAGGTAAGCCAGTTTTGAGATTTTATCACAAAGCAGATTGTTAGTGTCTTGTAAGTAGATAAGAGTATTTGTATCAAAATTAGGCCCCAAAAGGTGGGATTGACTGATCCGATCATGTTGTGAGCTTTTACTTGGATATGTATGATGTAAGCTGCATGGAGTCTGCTCTTCATAGACAAACTTGGGATTTGCGATACCAGGTAAGATCGGTTCAGAATTATGGGATCCTATATCTATCTTTATGAAGAAGAAATCATGTAACAAAAGGGGTCCTTTTTTGTACAAATGATACTTCGAACTTAGAACGAGCATAAAGAAATTAACGATACTTTTTTTATCTTTTGAGTTGTTATGTTATAGAAGTAATTCTTGTGTTTAATGGAATG

mRNA sequence

ATGAAATTACCTGGAAAATGTTGCAGAAGAGTGGGTATGGCGGAAAGAAAGGCATTGGGAAGACGTACGGTTTCCTCCATGGATGGGGAGAATCAGCGCAGTGGAAGCCATCGCCAAAGACCAGCGTTTCTTGACATCAACCCTTATCTATTGCCTTCCGCATCTGATGAAGTAAGAGATGTCAACGATTGCTCAATATGTCTTGCATTGAACCAAAATGACCATCTTTTCAGCAAAAAGAAGAAATTATTAGAAAGACAGGGTTTCAAGTCTGAGAATTGCATTTACTTGAAGTGCTCTCTTTGTCCTGAGGAAGTAGACACTGTTCTGAAAGAGTTGGTACAAATTGCAAGAATTATTCACTTAAATGAGCCTGAAATGTATTTTGAAGAAGATGACGCATGTACACCAGCAGATTCCTACAGCCCCAGGAATGAAATGGAGGCCCTCAATACAATAATTTCTCTTGTTGACATCTGTCTCTCTAGTTGTAAGCCTGTCCAGCTTAATGTCCTGCAAGAGCTACGGAAAGCAGCTATTCGTATGATCCATAAGTATGGACATGTATACAGTATGGATGCTAAAACTTTGGGAGACAATTGTGTGAAAGAAAATTGTTTGTTACAGTGGGGTGAGAGCAATGGCGTCAGGACAAGGTTGAAGATAGCTTATGTTGAAGGTGCTGGTAGAGGAACCATAGCCAAAGAAGATCTAAACGTTGGTGACACCGTATTGGAGATCCCTCTGGATATTGTTATTTCTGAGGAACTTGTGCAGAAAACCACCATGTATCCCATATTATCAAAGATCGAAGGCATGTCGTCTGAGACAATGCTGTTGATATGGAGCATGAAGGAGAAGCACATTGTCGATTCCAAATTCAAGTGTATGAGACAACCAACTACACCAGCCAGACAAGGGTTGAGTTTTGGAGTTGGTGCAATGATGACTTTGGACGGAACCCTGCTTTTCGGTGAGATAATGCAAGCAAAAGAGCACTTGCGGGAACAATACAATGAGTTATTTCCTGCGTTATGTAACAACCATCCTGACGTTTTCCCTGAGGAGTACTACTCATGGGAGAAGTTCTTATGGGCTTGTGAACTTTGGTATTCAAATAGCATGAAAATCATGTTTTCTGATGGAAGTCTTAGAACCTGCTTGGTTCCGATTGCAGGTTTTCTCAATCACTCGTTGCATCCACACATACTACACTATAGCAAAGCTAATTCAGATACAAATTCATTGAAATTCCGCCTATCAAGACCATGCCGTGCAGGAGAAGAATGTTATCTTAGTTATGGAAATTACTCTGCTTCTCATCTGGTTGCTTTTTATGGATTTTTACCTGAAGGAGACAACGTAAATGATGTCATTCCATTAGACATCGACTTTGGTGATGATGCTAGCAATAGCAACACATCAGACTGGAGTACTCATATGGTGAGGGGAACATGGTTGTCAAAGAACCAAAGCATATTCCATTATGGTCTGCCCTCACCGTTATTAGAGTGTTTACGTAAAGCTCGATGCCCTGAATTACGCACCAAGCTTAAGCTGCAGGGAAGCTTGGAAAATGAAATGGAAGTCCTTAATGATCTCCTCTCAATCTTTGATGGAATGATGGAAAATCTTGAGGATGTGAATGAAGACAGGAGCAGTACAGAATGGGATATAAAGTTAGCATTGAACTATAAAGATCTTCAAAGGAGGATAGTTTCCTCATGTCTGAATTCATGTCATGCTGGTTTGAAGATGGTCGAACTTGCATTATACGAATGCATGGAAGAGGACACTCGAGGCTAAAAACACGAGCGGACAGACGTGCTCGTTCCAGGCCCCAAAAGGTGGGATTGACTGATCCGATCATGTTGTGAGCTTTTACTTGGATATGTATGATGTAAGCTGCATGGAGTCTGCTCTTCATAGACAAACTTGGGATTTGCGATACCAGGTAAGATCGGTTCAGAATTATGGGATCCTATATCTATCTTTATGAAGAAGAAATCATGTAACAAAAGGGGTCCTTTTTTGTACAAATGATACTTCGAACTTAGAACGAGCATAAAGAAATTAACGATACTTTTTTTATCTTTTGAGTTGTTATGTTATAGAAGTAATTCTTGTGTTTAATGGAATG

Coding sequence (CDS)

ATGAAATTACCTGGAAAATGTTGCAGAAGAGTGGGTATGGCGGAAAGAAAGGCATTGGGAAGACGTACGGTTTCCTCCATGGATGGGGAGAATCAGCGCAGTGGAAGCCATCGCCAAAGACCAGCGTTTCTTGACATCAACCCTTATCTATTGCCTTCCGCATCTGATGAAGTAAGAGATGTCAACGATTGCTCAATATGTCTTGCATTGAACCAAAATGACCATCTTTTCAGCAAAAAGAAGAAATTATTAGAAAGACAGGGTTTCAAGTCTGAGAATTGCATTTACTTGAAGTGCTCTCTTTGTCCTGAGGAAGTAGACACTGTTCTGAAAGAGTTGGTACAAATTGCAAGAATTATTCACTTAAATGAGCCTGAAATGTATTTTGAAGAAGATGACGCATGTACACCAGCAGATTCCTACAGCCCCAGGAATGAAATGGAGGCCCTCAATACAATAATTTCTCTTGTTGACATCTGTCTCTCTAGTTGTAAGCCTGTCCAGCTTAATGTCCTGCAAGAGCTACGGAAAGCAGCTATTCGTATGATCCATAAGTATGGACATGTATACAGTATGGATGCTAAAACTTTGGGAGACAATTGTGTGAAAGAAAATTGTTTGTTACAGTGGGGTGAGAGCAATGGCGTCAGGACAAGGTTGAAGATAGCTTATGTTGAAGGTGCTGGTAGAGGAACCATAGCCAAAGAAGATCTAAACGTTGGTGACACCGTATTGGAGATCCCTCTGGATATTGTTATTTCTGAGGAACTTGTGCAGAAAACCACCATGTATCCCATATTATCAAAGATCGAAGGCATGTCGTCTGAGACAATGCTGTTGATATGGAGCATGAAGGAGAAGCACATTGTCGATTCCAAATTCAAGTGTATGAGACAACCAACTACACCAGCCAGACAAGGGTTGAGTTTTGGAGTTGGTGCAATGATGACTTTGGACGGAACCCTGCTTTTCGGTGAGATAATGCAAGCAAAAGAGCACTTGCGGGAACAATACAATGAGTTATTTCCTGCGTTATGTAACAACCATCCTGACGTTTTCCCTGAGGAGTACTACTCATGGGAGAAGTTCTTATGGGCTTGTGAACTTTGGTATTCAAATAGCATGAAAATCATGTTTTCTGATGGAAGTCTTAGAACCTGCTTGGTTCCGATTGCAGGTTTTCTCAATCACTCGTTGCATCCACACATACTACACTATAGCAAAGCTAATTCAGATACAAATTCATTGAAATTCCGCCTATCAAGACCATGCCGTGCAGGAGAAGAATGTTATCTTAGTTATGGAAATTACTCTGCTTCTCATCTGGTTGCTTTTTATGGATTTTTACCTGAAGGAGACAACGTAAATGATGTCATTCCATTAGACATCGACTTTGGTGATGATGCTAGCAATAGCAACACATCAGACTGGAGTACTCATATGGTGAGGGGAACATGGTTGTCAAAGAACCAAAGCATATTCCATTATGGTCTGCCCTCACCGTTATTAGAGTGTTTACGTAAAGCTCGATGCCCTGAATTACGCACCAAGCTTAAGCTGCAGGGAAGCTTGGAAAATGAAATGGAAGTCCTTAATGATCTCCTCTCAATCTTTGATGGAATGATGGAAAATCTTGAGGATGTGAATGAAGACAGGAGCAGTACAGAATGGGATATAAAGTTAGCATTGAACTATAAAGATCTTCAAAGGAGGATAGTTTCCTCATGTCTGAATTCATGTCATGCTGGTTTGAAGATGGTCGAACTTGCATTATACGAATGCATGGAAGAGGACACTCGAGGCTAA

Protein sequence

MKLPGKCCRRVGMAERKALGRRTVSSMDGENQRSGSHRQRPAFLDINPYLLPSASDEVRDVNDCSICLALNQNDHLFSKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEMYFEEDDACTPADSYSPRNEMEALNTIISLVDICLSSCKPVQLNVLQELRKAAIRMIHKYGHVYSMDAKTLGDNCVKENCLLQWGESNGVRTRLKIAYVEGAGRGTIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSKIEGMSSETMLLIWSMKEKHIVDSKFKCMRQPTTPARQGLSFGVGAMMTLDGTLLFGEIMQAKEHLREQYNELFPALCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLRTCLVPIAGFLNHSLHPHILHYSKANSDTNSLKFRLSRPCRAGEECYLSYGNYSASHLVAFYGFLPEGDNVNDVIPLDIDFGDDASNSNTSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPELRTKLKLQGSLENEMEVLNDLLSIFDGMMENLEDVNEDRSSTEWDIKLALNYKDLQRRIVSSCLNSCHAGLKMVELALYECMEEDTRG
Homology
BLAST of CmaCh12G008780 vs. ExPASy Swiss-Prot
Match: P94026 (Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic OS=Nicotiana tabacum OX=4097 GN=RBCMT PE=2 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 6.5e-07
Identity = 63/272 (23.16%), Postives = 120/272 (44.12%), Query Frame = 0

Query: 209 QWGESNGV---RTRLKIAYVEGAGRGTIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYP 268
           QW    GV   +T +K   V   G G +AK D+  G+TVL++P    I+ + V ++ +  
Sbjct: 62  QWLCKEGVVTTKTPVKPGIVP-EGLGLVAKRDIAKGETVLQVPKRFWINPDAVAESEIGN 121

Query: 269 ILSKIEGMSSETMLLIWSMKEKHIVDSKFKCMRQPTTPARQG-LSFGVGAMMTLDGTLLF 328
           + S ++   S  + L   ++EK   DSK+K        +    + +    +  + GT L 
Sbjct: 122 VCSGLKPWISVALFL---LREKWRDDSKWKYYMDVLPKSTDSTIYWSEEELSEIQGTQLL 181

Query: 329 GEIMQAKEHLREQYNELFPALCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSL 388
              M  K++++ ++ ++   +   +  +FP    + + F WA  +  S +   + +   +
Sbjct: 182 STTMSVKDYVQNEFQKVEEEVILRNKQLFPFP-ITLDDFFWAFGILRSRAFSRLRNQNLI 241

Query: 389 RTCLVPIAGFLNHSL------HPHILHYSKANSDTNSLKFRLSRP--CRAGEECYLSYG- 448
              LVP A   NH+       H H +    A   +  L F L  P   +AG++ ++ Y  
Sbjct: 242 ---LVPFADLTNHNARVTTEDHAHEVR-GPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDL 301

Query: 449 NYSASHLVAFYGFLPEGDNVNDVIPLDIDFGD 468
           N S + +   YGF+ E  +  D   L ++  +
Sbjct: 302 NKSNADMALDYGFI-EPSSARDAFTLTLEISE 323

BLAST of CmaCh12G008780 vs. ExPASy Swiss-Prot
Match: P58467 (SET domain-containing protein 4 OS=Mus musculus OX=10090 GN=Setd4 PE=1 SV=2)

HSP 1 Score: 56.2 bits (134), Expect = 1.5e-06
Identity = 59/264 (22.35%), Postives = 108/264 (40.91%), Query Frame = 0

Query: 218 TRLKIAYVEGAGRGTIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSKIEGMSSET 277
           T L  A   G GRG ++K  L  G  ++ +P   +++ + V ++++ P + K +   S  
Sbjct: 47  TDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIRSSLGPYIKKWKPPVSPL 106

Query: 278 M-LLIWSMKEKHIVDSKFKCMRQPTTPARQGLSFGVGAMMTLDGTLLFGEIMQAK-EHLR 337
           + L  + + EKH              P     S+     +  +   L    ++AK E  R
Sbjct: 107 LALCTFLVSEKHAGCRSLWKSYLDILPK----SYTCPVCLEPEVVDLLPSPLKAKAEEQR 166

Query: 338 EQYNELFPALCNNHPDVFP------EEYYSWEKFLWACELWYSNSMKIMFSDGSLRTC-- 397
            +  +LF +       + P      +  +S+  FLWA   W + + + ++     + C  
Sbjct: 167 ARVQDLFTSARGFFSTLQPLFAEPVDSVFSYRAFLWA---WCTVNTRAVYLRSRRQECLS 226

Query: 398 -------LVPIAGFLNHSLHPHILHYSKANSDTNSLKFRLSRPCRAGEECYLSYGNYSAS 457
                  L P    LNHS  PH+   +  N  T   + R +  CR  +E ++ YG +   
Sbjct: 227 AEPDTCALAPFLDLLNHS--PHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQ 286

Query: 458 HLVAFYGFLPEGDNVNDVIPLDID 465
            L+  YGF+    N +  +P+  D
Sbjct: 287 RLLLEYGFV-SVRNPHACVPVSAD 300

BLAST of CmaCh12G008780 vs. TAIR 10
Match: AT2G18850.1 (SET domain-containing protein )

HSP 1 Score: 523.9 bits (1348), Expect = 1.7e-148
Identity = 273/531 (51.41%), Postives = 370/531 (69.68%), Query Frame = 0

Query: 66  ICLALNQNDHLFSKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEP 125
           +C+ L ++D  +  KKK L  +G   +  + L  SL  + ++  L++L+   RI++L++ 
Sbjct: 14  LCVELPKDDPFYHHKKKFLSCKGLCVKETLNLSGSLSQQLLNAALEKLLHFGRIVNLDKV 73

Query: 126 EMYFEEDDACTPADSYSPRNEMEALNTIISLVDICLSSCK-PVQLNVLQELRKAAIRMIH 185
           E+YF E DACTPA  YS RNE+ AL+ I+SL+ +   SCK   Q++  + LR A    I+
Sbjct: 74  EVYFGE-DACTPAGIYSVRNEISALSWILSLIPV---SCKMQTQVDTFEALRAALKGRIN 133

Query: 186 KYGHVYSMDAKTLGD-NCVKENCLLQWGESNGVRTRLKIAYVEGAGRGTIAKEDLNVGDT 245
           +        A+ +    C KE+ L++WG+ NGV+T+L+IA ++G GRG IA EDL  GD 
Sbjct: 134 EVVGAEKEKARVVDSYRCEKESKLVEWGQDNGVKTKLQIAQIDGYGRGAIASEDLKFGDV 193

Query: 246 VLEIPLDIVISEELVQKTTMYPILSKIEGMSSETMLLIWSMKEKHIVDSKFKCMRQPTTP 305
            LEIP+  +ISEE V  + MYPIL   +G++SETMLL+W+M+EKH +DSKFK    P   
Sbjct: 194 ALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSKFK----PYFD 253

Query: 306 ARQ-----GLSFGVGAMMTLDGTLLFGEIMQAKEHLREQYNELFPALCNNHPDVFPEEYY 365
           + Q     GLSFGV A+M LDGTLL  EIMQAKE LRE+Y+EL P L +NH +VFP E Y
Sbjct: 254 SLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIP-LLSNHREVFPPELY 313

Query: 366 SWEKFLWACELWYSNSMKIMFSDGSLRTCLVPIAGFLNHSLHPHILHYSKANSDTNSLKF 425
           +WE +LWACEL+YSNSM+I F DG L+TCL+P+AGFLNHS++PHI+ Y K + +T+SLKF
Sbjct: 314 TWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKF 373

Query: 426 RLSRPCRAGEECYLSYGNYSASHLVAFYGFLPEGDNVNDVIPLDIDFGDDASNSNTSDWS 485
            +SRPC  GE+C+LSYGNYS+SHL+ FYGFLP+GDN  DVIPLD D  DD        W+
Sbjct: 374 PVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYDVIPLDFDVIDDEDIETEFSWT 433

Query: 486 THMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPELRTKLKLQGSLENEMEVLNDLLSIF 545
           THM+RGTWLS N +IFHYGLP+PLL  LRKA      ++  L  +LE E+ VL +L S F
Sbjct: 434 THMLRGTWLSSNHNIFHYGLPTPLLNYLRKAHGLVHHSETDLWKNLEVEIGVLENLQSTF 493

Query: 546 DGMMENLEDVNE-DRSSTEWDIKLALNYKDLQRRIVSSCLNSCHAGLKMVE 589
           D MM+NL D +  DR + +WD+KLA+ +K+ QR+IVSS L+SC AG+K+V+
Sbjct: 494 DDMMQNLGDADSIDRENADWDVKLAMEFKERQRKIVSSILDSCSAGIKLVQ 535

BLAST of CmaCh12G008780 vs. TAIR 10
Match: AT2G18850.2 (SET domain-containing protein )

HSP 1 Score: 519.6 bits (1337), Expect = 3.3e-147
Identity = 273/531 (51.41%), Postives = 368/531 (69.30%), Query Frame = 0

Query: 66  ICLALNQNDHLFSKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEP 125
           +C+ L ++D  +  KKK L  +G   +  + L  SL  + ++  L++L+   RI++L++ 
Sbjct: 14  LCVELPKDDPFYHHKKKFLSCKGLCVKETLNLSGSLSQQLLNAALEKLLHFGRIVNLDKV 73

Query: 126 EMYFEEDDACTPADSYSPRNEMEALNTIISLVDICLSSCK-PVQLNVLQELRKAAIRMIH 185
           E+YF E DACTPA  YS RNE+ AL+ I+SL+ +   SCK   Q++  + LR A    I+
Sbjct: 74  EVYFGE-DACTPAGIYSVRNEISALSWILSLIPV---SCKMQTQVDTFEALRAALKGRIN 133

Query: 186 KYGHVYSMDAKTLGD-NCVKENCLLQWGESNGVRTRLKIAYVEGAGRGTIAKEDLNVGDT 245
           +        A+ +    C KE+ L++WG+ NGV+T+L+IA ++G GRG IA EDL  GD 
Sbjct: 134 EVVGAEKEKARVVDSYRCEKESKLVEWGQDNGVKTKLQIAQIDGYGRGAIASEDLKFGDV 193

Query: 246 VLEIPLDIVISEELVQKTTMYPILSKIEGMSSETMLLIWSMKEKHIVDSKFKCMRQPTTP 305
            LEIP+  +ISEE V  + MYPIL   +G++SETMLL+W+M+EKH +DSKFK    P   
Sbjct: 194 ALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSKFK----PYFD 253

Query: 306 ARQ-----GLSFGVGAMMTLDGTLLFGEIMQAKEHLREQYNELFPALCNNHPDVFPEEYY 365
           + Q     GLSFGV A+M LDGTLL  EIMQAKE LRE+Y+EL P L +NH +VFP E Y
Sbjct: 254 SLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIP-LLSNHREVFPPELY 313

Query: 366 SWEKFLWACELWYSNSMKIMFSDGSLRTCLVPIAGFLNHSLHPHILHYSKANSDTNSLKF 425
           +WE +LWACEL+YSNSM+I F DG L+TCL+P+AGFLNHS++PHI+ Y K + +T+SLKF
Sbjct: 314 TWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKF 373

Query: 426 RLSRPCRAGEECYLSYGNYSASHLVAFYGFLPEGDNVNDVIPLDIDFGDDASNSNTSDWS 485
            +SRPC  GE+C+LSYGNYS+SHL+ FYGFLP+GDN  DVIPLD D  DD        W+
Sbjct: 374 PVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYDVIPLDFDVIDDEDIETEFSWT 433

Query: 486 THMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPELRTKLKLQGSLENEMEVLNDLLSIF 545
           THM+RGTWLS N +IFHYGLP+PLL  LRKA          L  +LE E+ VL +L S F
Sbjct: 434 THMLRGTWLSSNHNIFHYGLPTPLLNYLRKAHG-------LLWKNLEVEIGVLENLQSTF 493

Query: 546 DGMMENLEDVNE-DRSSTEWDIKLALNYKDLQRRIVSSCLNSCHAGLKMVE 589
           D MM+NL D +  DR + +WD+KLA+ +K+ QR+IVSS L+SC AG+K+V+
Sbjct: 494 DDMMQNLGDADSIDRENADWDVKLAMEFKERQRKIVSSILDSCSAGIKLVQ 528

BLAST of CmaCh12G008780 vs. TAIR 10
Match: AT3G55080.2 (SET domain-containing protein )

HSP 1 Score: 43.9 bits (102), Expect = 5.3e-04
Identity = 45/174 (25.86%), Postives = 77/174 (44.25%), Query Frame = 0

Query: 326 EIMQAKEHLREQYNELFPALCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLR 385
           E ++ K  + + ++ +  A   + P V   E    E F++A  L  S +      + S R
Sbjct: 53  ETVKQKAQIEKDFSFVAQAFKQHCPIV--TERPDLEDFMYAYALVGSRAW-----ENSKR 112

Query: 386 TCLVPIAGFLNH-SLHPHILHYSKAN--SDTNSLKFRLSRPCRAGEECYLSYGNYSASHL 445
             L+P A F+NH  L   I+   + N  S+ ++L+    R    G+E ++ YG +S + L
Sbjct: 113 ISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATL 172

Query: 446 VAFYGF-LPEGDNVNDVIPLDIDFGDDASNSNTSDWSTHMVRGTWLSKNQSIFH 496
           +  +GF  P   +    I +D+   D   N       TH  R     K+ +IFH
Sbjct: 173 MLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTV---KDINIFH 216

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P940266.5e-0723.16Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferas... [more]
P584671.5e-0622.35SET domain-containing protein 4 OS=Mus musculus OX=10090 GN=Setd4 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
AT2G18850.11.7e-14851.41SET domain-containing protein [more]
AT2G18850.23.3e-14751.41SET domain-containing protein [more]
AT3G55080.25.3e-0425.86SET domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.90.1410.10set domain protein methyltransferase, domain 1coord: 203..449
e-value: 7.8E-42
score: 145.4
NoneNo IPR availablePANTHERPTHR13271:SF103BNAA07G01600D PROTEINcoord: 59..596
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 59..596
NoneNo IPR availableCDDcd10527SET_LSMTcoord: 219..449
e-value: 2.7473E-45
score: 158.381
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 206..454
IPR001214SET domainPFAMPF00856SETcoord: 229..435
e-value: 1.4E-8
score: 35.3
IPR001214SET domainPROSITEPS50280SETcoord: 218..435
score: 11.584598
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 72..163
e-value: 7.1E-6
score: 26.7
IPR036464Rubisco LSMT, substrate-binding domain superfamilyGENE3D3.90.1420.10coord: 60..201
e-value: 6.9E-11
score: 44.1

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G008780.1CmaCh12G008780.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0005515 protein binding