CmaCh08G000190 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh08G000190
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein-serine/threonine phosphatase
LocationCma_Chr08: 108994 .. 116710 (+)
RNA-Seq ExpressionCmaCh08G000190
SyntenyCmaCh08G000190
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCCGTTTTTTGATTTTTGATTTTCGATTTTCGATCTTCTCTCCGTATCTTCCCCAAATCATTTTCTTCTCCAAATGCTTTGCGGATTTCCAGATCTCCCCTCCCAATTCTATTCTTCTCCCTACTCGCCTTTCTTTCTTCGAATCAATCATCGCGTTACCCTTTTTCTTCCGCCGAAGGAACAACAAAGAAACTCCCCGATTCCCAACCGAACTTTAATCTAATTTTCGAATGCCCCCATTTATTTCCCTTCAACAACATCCGTCCGCTTCTTCTCCCTTATTGCTTCTTCCACATTTTCTAACAACTTCATGCTATTTATGGTACTTTTCGGATCTCATTCGAACCATTTCTCGAAATTTTCTAAGAAACCCTAGGTGGGGTTAATGGGGAAACACACCAACTGTCTTAAAACTCAAGACGTCGAAGAAGGGGAAATCTCTGATACACCTTCAGTTGAGGAGATCACTGAGGAGGATTTTAATAACCTTGAAACTGTCCCTAAGCTGCTGCCTTCCAAACATTCCAATCGGGAGACTACAGTTTGGACCATGAGTGATTTGTATAACAATTATCCCACCATGTGTCGTGGTTATGCTTCTGGTCTCTACAATTTAGCTTGGGCCAAGGCAGTGCAGAATAAGCCTCTCAATGAAATCTTTCTTACGGAGGCCGACCCCGACGACAAATCCCACCGCTCTTCTTCCTCTCCTTTTCGGAATGCCAAGGAACATGGAAACGGTACAATAGAAGAGGCTGCTAAGCTCATCATTGATATTACCGGCGACGACATGAATACCAACAATGCGGATGTCGAGAAAGAGGAAGGCGAATTGGAGGAGGGCGAAATTGATATGGATACGGAGTTCGTAGAAGAGGTTGTTGACTCCAGACCAATGTTGTCCGACTCTCTCGATACTGACTGTCAGGAGATTGATTTCAAAAATAAGGAATTGGATGATCAGCTCAAATTGGTTCACAAAACATTGGATGGTGTTACAATCGACGCTGCACAGAAGTAAGTGAATCCATCTACTTACGTTGTTACGTTGTAGATCAATTGCTGAATGGTGCTATGTCATGCGTCCCCAATATGAAATTACCGGGTTGCTGGAAAATTTACTGTCTCGTTTCTTTCTTGTAGATCGTTTCAGGAAATTTGCTCCCAACTGCTTAGTTCTATAGAGACGTTTCTGGAATTGGTCCAGGGAAAGGTAGTCCCGAGAAAGGATGCGCTCATTCAACGACTGTATGCTGCTCTTCGAATAATCAATTCTGTAAGGCACCCCAAGAATCTCGGTCCTCTTTCGTTTGCCAAATAAATTTCATTTTGACGGGAAATCTTATTTAACGTATATTATTAACTGACGATAATGACAATGAACACTCATTTTCCATCTCAGGTGTTTTGTTCCATGAACCCCAAGGAAAAAGACGAGTATAAGCAACATTTATCGAGGTTTTTGCTCATCCTTGTCCGTAGTTTCTTTTTCATGTGAAGGGTTAGTCTGTTTGTCTACTATCTAACGACCACTTGCTGTGATATTGTTCATAGGTTGCTTTCTTTTGTTAAAAATTGCAACCCTGCTCTTTTTTCTCCTGAGCAGATAAAATCGGTACGGTATACTTTCCTTAGTCTGCTTGTTGTTACTCCTTATGCTGATTCTGGCAACTTATGGAGGAGGTTGTGATTAGGTGCTAAGATAAAGCCACAATATTATTCTACAGGTAGAGGTCAAAATGCCATCTACAGATTCCCTTGACCATTTCCCCGACATGAGAGACAGTGCTAAAGATGTCGAGATCCATATACCTAATGGGGTGAAAAATAAGGACTTTTATTCTGCATATGCAACTGCTACTCCACATTTAACTTCTTCAACTAAGTTGCCTTCAGACTCCATGCCTGTTGGGGTTACGGTAAAAAATAATCTAAATCTCTCATCAGACAGTTTGCTATCTGGCGTACCCAATGTAAAAGGTAGAGGTCCCCTACTCCCTCTGTTAGACCTTCACAAGGATCATGATGTGGACAGTCTCCCATCACCTACCAGAGAAGCTCCTACAGTTTTTTCTGTCCAAAAATCAGGGCATATCCCTGTGAAGGTGGCACATGCTATGGATGGATCGAGAGTACATCCTTATGAAACTGATGCCCTAAAAGCTGTTTCAACCTATCAACAGAAGTTTGGTCGAAGTTCCTTTTCAATGGCTGATCGACTTCCTAGCCCAACCCCTTCAGAAGAATGTGATGGGGGTGGTGATATTGGTGGGGAGGTTTCTAGTTCTTCCATTTTCAGAAGCTCAAAAGCTTCAAATTCTTCTAAACTGGCTCAAACGGTGTCAAATTCTGCTTCTAGCATATCTACAGGTCTTTTTCCTAACCTGGAAAGTTCCAGCACTAAGGGACTGATTAGTCCTTTAAACGTTGCTCCTCCTAGTTCTGTGTCTAATCCAATAGCAAAGCCTTTGGCAAAAAGTAGAGACCCTAGGCTACGAATGGTCAATTCTGAGGCAAGTGCTATGGATCTTAATCCACGCACAATGACTTCTGTGCAGAATCCTTCTGTTGTAGAATCTGCTGTAACCGTAAACATGAGAAAGCAGAAGATGGATGTAGAGCCTAATATAGATGCCCCTGAAATGAAAAGGCAGAGGATTGGATCTCAGAATCATGCATTTTCTGCAAGTGATTTGAGAGCTGGCTCTGGAAGCGGTGGCTGGTTGGAAGATAATACCATGTCCGCTGTTCCCAGGCTTTCAAGCAGGAATCAGATGGAGATTGCTGAAGCAAATGCAATAGAAAAAAACAACGTTACAAACAATTCTGGTGCAGGAAATTCGCGTGGGCCTATGATAAGTGCTAGCAAAGAAGCTTCCTTGCCCTCACTATTGAAAGATATTGTTGTGAACCCAACCATGCTCTTAAGTTTACTTAAAATGAACCAACAGAAGCAAGTAGCTGCAGAATTGAAATTAAACTCGAGTGAACCTGAAAAAAATGCAATTTGTCCTACGGCCGTGAATCCTTGTCTAGGATCCAGTCCACTAGTAAATGCTCCTGCTGTGACCTCAGGAATTTTGCAGCAATCAGCAGGAACACCAAGTGTACCTTCACCACCGGTGGTCACTGTGGTAAGTGATTGATTACATCACAACTCTTTGCATTATAAAACTTCTCATTATTTTTGTCTCCTTGATTGATAAGGGATCGTTTACCAATCAAGGGGTTGGTAGTATCCTTTACACAAGTGCTTGGGCTCATCTTTCTTCTAATCTGATTCAAGAATTTATTAAATGTTCTTTTTTCCCTTTTTTTCTAATACAACTCATTCTTTGATTCAAACAAATGGTGAAGGGTGCTTTTTCCACGTTTTCTTGGATATATTCTAATTCTATGCCGTTGAAACATTTGTGAGATTGTTTATTATTTTCAATTTTTCTGTGTAGTGTAGTATCTGTATTTTTATTCTAGATTAATAATGTACTTATTTTAGATTCTGCACGTTTCACTAGGATGATGTGGGAAAAGTTCGTATGAAACCTCGTGACCCTCGCCGTATCCTCCATGGTAATTCTCTTCACAAGGTTGACAGCATGCGAAATGAGCAGTTAAAAAGTGTTGTACCAGCTGTTCCAAACCCAGAAGGAAGTAGGGATATAATACCAAATGGCCATAAGCAAGAAGGCCAGGGAAATTTGAGATTAGCTTCTTCACAACCATTACTACCTGACATTGGTAGACAGTTCACTAACAATCTGAAAAATATTGCTGATATCATGTCTGTTCCCTCGCCACCAACTTCTTCTCACAATTTATCTTCAAAACCAGTTAAATTGGACAGAAAGGATGCTAATGCTGTTGGGTCAAGCTCTATTGACAGTAAAATTGTGGCAACTGCTACCCAAGCGGTAGATATGGTTGGCCCCTCCCGTTCACATGGTGCATGGGGAGATCTTGAGCATCTTTTTGAGGGTTATGATGACAAGCAAAAGGCTGCTATCCAGAGAGAAAGAGCAAGACGGATAGATGAGCAGAAAAAAATGTTTGCTGCACGCAAACTCTGCCTTGTTTTGGATTTGGATCACACCCTTCTGAATTCAGCTAAGGTTATTTTTCCTTTTACAAATGAAAACTGATTTCACCATCAAACTTGGAATTACAATCCCATGACTTGGTTTGTTCTTGTTCTAGTTTGTGGAAGTAGATCCATTGCATGATGAAATTTTGAGGAAGAAGGAGGAACAGGATCGTGAGAAGGTGCAGAGACATCTTTTCCGATTTCCTCATATGGGAATGTGGACCAAACTACGGCCTGGGGTCTGGAACTTTTTGGAAAAGGTAGGTTGAATATTGAAGTTGTTTTCAACACAACTAATCCAAATAATAGAAGACTCTTCTTTCATCTCTCTTTATAATTTTAAAAAAAATATTTTTTCTTTTAAAGTGTACTGGATGTTGAATGCACATTTTGGTGCTTGTGGTTTCTTTTGCATTTCAGGCCAGCGAGCTCTATGAACTTCATTTGTACACAATGGGAAACAAGTTATATGCAACGGAGATGGCAAAGGTTCTTGATCCAAAAGGGGTTCTGTTTGCCGGGCGAGTTCTTTCTCGAGGTGATGATGGAGACCCCTTGGACGGGGAAGAAAGGGTACCCAAGAGTAAGGATCTGGAGGGTGTTTTGGGCATGGAGTCTGCTGTTGTTATAATTGATGATTCCATCAGGGTGTGGCCACATAACAAAATGAACTTGATTGTCGTAGAAAGGCATGCATATCATCCATCCCTCTTAACTAAAAGATGCTTTTTACGTTTCATAACCCCACGAATTTGAGAGCTCATGACATTGCTTTTGCTGTCTTCCAATGCAGGTATACTTACTTTCCATGTAGTAGGCGCCAATTTGGGCTTCTGGGTCCTTCTCTTCTAGAGATTGACCATGATGAGAGACCTGAAGATGGTACTTTGGCTTCTTCACTGGCGGTAATTACGTGGATTCACTTATTCACCATATTGCATTGAATTAATCTATTTTACACCTAATCATCTTAAATGTTGGTCCAATAGGTTATCCAGAGAATTCATCAAACCTTTTTCTCCCATCCTGTATTAGATGAAGTAGATGTTAGAAATATCTTGGCCTCCGAGCAACAAAGGATTTTGGCTGGTTGCCGTATAGTGTTTAGCAGGGTTTTTCCAGTTGGTGAGGCAAATCCTCACCTGCATCCGCTGTGGCAGACAGCTGAACAGTTTGGTGCGGTGTGCACCAACCAGATTGATGAACAGGTTACCCATGTCGTTGCAAACTCGCTTGGGACTGATAAGGTAGTGTACCGCTCATTTAACTGTGGACTCGGGTCTGATGTGTATTTAGGAATTCATTGTTTCAATTTTTATGTGGAAATTTTGTTGAAATCTCCTTGCAACTGACCTGTTAATGAGATGAGAGTTTGCAGGCAGGCAGCTTGAGGCTAAAATATCATTGGACCGGGGGGATGTATAGATATAACACACACACACACACACACACACACACACACACATATATATCTGATTTAACTACTGAACCTATGCGATGTGTTGATGTTTGTAGTTGCCATTGTTTTGAACAGGTGAATTGGGCTCTCTCCACTGGCAGATTCGTGGTCCATCCAGGGTGGTGAGATTCTTGTTCCAAACTTCAAATTCTCGTAGATGTTTAAACTTGTAATTATATATTTGTTGGGTTTAAACTGTCTGAGTAATGCGCAGGGTGGAAGCATCGGCTCTACTTTACCGGAGGGCAAACGAGCAGGACTTTGCCATTAAACCATAAACAACCCACCCACCAGCAACCTTAACACACCCTTTGCTAAATTAAAAAACAAAAGATTATAATACAAGATGATATAACACGGAGAAGAAATTCAACACGTTGCCTTATATTTGGGTGGTAGTCTTAGGCAAAGTCCAAGATTACTAAGAGACACATCAACCCCCAATTTAAAACGCCCTACAACTACAAGAATGTGTGGAAGAGCTGTGAAAGAGAGATCATCAAGAAATTATGTGCCTCTCACCACATTGACTTTCGAGATCGAGTTTGATTAGATTTGAAATAATCAACGGTGAGGCCACGGGAAGGAGGAGAGAGGGGGGTTGCAAAGAAAGTGAGACGAATGGGTAGAAGGAATTTGGGTAAATTGTGAGTGTGGGGCGAATTTATACTCAACTCGAGGTTGGAAGCTTCTCATTTCTTTTTCTTCTGCTTTTGCTTTGAATTTTCGGTTGGAATTACGAGAGAGAAGAACACTAGGGATTTTATTCTTGATTTCGTCCTGTCCCGTTCGTTCTTGACCAAAATTCGTTGAGAATGACAAGGGGGAGGGGAAGGGGAAGGGCAAATGCGAATGTGAATGATATTGGGAGAATGACAGCATCTGAAAAAGGCTTCGGGTCACATGATAAATGACATCAGCAATCCCTGCAAATGAACCTTTTTTTTTTTGTTTGTTTCATTTTCAAACACTCTTTTCTTTTCCTTAAACAAGGAAAAAAAAAATGAAATTAGCATATACCAGATACGAGGTTTTTTTTTTTTTCCTTTCATTTTTTGTGTAAAGGATAAGCAAGCTTTGATAACAATTTCATCTTGTTCTTAACAAGCTTTTGTCTTTTTAGAATTTGTTTAGAAGACGAAAGGAAAGCCGAGAATGAATATCAATACAAAATTTGCAACAATAAAGCATTAGCGAGGTTGATTGAGAATTGTGAATGCTTTAAACTATGTTGCGTGCTTCCCAAGTCGTCACCCCCATTTTCTTCAAGCTCTTGGTCTGCATGTAAGCTAAAGTTATTGATTTCTTTTCATACAATTAATGACTCATTCTACACCAATTAATGAAAATACTAATTTTCTATTAAATTTAGGTGGTAAAAGTTGTAGTTGTTGAACTTATGGTTTCAATTTCCATTTTCAACTATTTTATTTGCTAAAACTTCACCATTTTAGGTAAAGATGTCTATAGGGCATGTTAAGTTGTGGGGGCTTGCATGACCTGACATAACCTATTTACAGCAAAACTCCCAATATCTGGCCATATAAGGTATGCACCAGGAGGCTTGAATTGCCGGATGGATATTCTAAAAGCTGAGTGTCAGGTAAGGAAAGTGGAGTAGCAAGCATCATTGGTGTATGAATAGGCTTACAATATCGATCATCTTCGCACCACAAAGGAGATCGGAAATGTAGTTCCATTGGGGAAGGTGCATGGATTGAGTATCCCGTATCAGCTGAATTCCAAGAAAGTAGGAATTAGTAAAAACAAGAATTTATTTGCGAACCAAGCATGTGGCTTTGTTTGAGAGCATGCAGAAAGGGGATTTATGCTGTTATCATCAAAGGCATCAGGTTGACGTATGAACCAAGCAAGCTTGTTATTTGATGGGTCAACCGAAGGAGAGTGGGTTGAATATTCCAGGCCTGGGATGGGTGATCGTGGTGTGGACGTTCAAGGGAGCTGTCGGTCGAGATGGTTTGAGGGTGGAAGATGACATTACTTTCCAAGTCTGGAATTCAATTAGTGAATGAGGTAGGGTTGTATTAATTGTAAGATTATATATGACACATCCTTCACTTTTACGTACTTTCAAGCTTAACCCACCAATGCACTTAAAAGGGAAACTACATTACTTCTACTTGGCATACCTACTCTTCAAATTTCTCGAAGGGA

mRNA sequence

TCTCCGTTTTTTGATTTTTGATTTTCGATTTTCGATCTTCTCTCCGTATCTTCCCCAAATCATTTTCTTCTCCAAATGCTTTGCGGATTTCCAGATCTCCCCTCCCAATTCTATTCTTCTCCCTACTCGCCTTTCTTTCTTCGAATCAATCATCGCGTTACCCTTTTTCTTCCGCCGAAGGAACAACAAAGAAACTCCCCGATTCCCAACCGAACTTTAATCTAATTTTCGAATGCCCCCATTTATTTCCCTTCAACAACATCCGTCCGCTTCTTCTCCCTTATTGCTTCTTCCACATTTTCTAACAACTTCATGCTATTTATGGTACTTTTCGGATCTCATTCGAACCATTTCTCGAAATTTTCTAAGAAACCCTAGGTGGGGTTAATGGGGAAACACACCAACTGTCTTAAAACTCAAGACGTCGAAGAAGGGGAAATCTCTGATACACCTTCAGTTGAGGAGATCACTGAGGAGGATTTTAATAACCTTGAAACTGTCCCTAAGCTGCTGCCTTCCAAACATTCCAATCGGGAGACTACAGTTTGGACCATGAGTGATTTGTATAACAATTATCCCACCATGTGTCGTGGTTATGCTTCTGGTCTCTACAATTTAGCTTGGGCCAAGGCAGTGCAGAATAAGCCTCTCAATGAAATCTTTCTTACGGAGGCCGACCCCGACGACAAATCCCACCGCTCTTCTTCCTCTCCTTTTCGGAATGCCAAGGAACATGGAAACGGTACAATAGAAGAGGCTGCTAAGCTCATCATTGATATTACCGGCGACGACATGAATACCAACAATGCGGATGTCGAGAAAGAGGAAGGCGAATTGGAGGAGGGCGAAATTGATATGGATACGGAGTTCGTAGAAGAGGTTGTTGACTCCAGACCAATGTTGTCCGACTCTCTCGATACTGACTGTCAGGAGATTGATTTCAAAAATAAGGAATTGGATGATCAGCTCAAATTGGTTCACAAAACATTGGATGGTGTTACAATCGACGCTGCACAGAAATCGTTTCAGGAAATTTGCTCCCAACTGCTTAGTTCTATAGAGACGTTTCTGGAATTGGTCCAGGGAAAGGTAGTCCCGAGAAAGGATGCGCTCATTCAACGACTGTATGCTGCTCTTCGAATAATCAATTCTGTGTTTTGTTCCATGAACCCCAAGGAAAAAGACGAGTATAAGCAACATTTATCGAGGTTGCTTTCTTTTGTTAAAAATTGCAACCCTGCTCTTTTTTCTCCTGAGCAGATAAAATCGGTAGAGGTCAAAATGCCATCTACAGATTCCCTTGACCATTTCCCCGACATGAGAGACAGTGCTAAAGATGTCGAGATCCATATACCTAATGGGGTGAAAAATAAGGACTTTTATTCTGCATATGCAACTGCTACTCCACATTTAACTTCTTCAACTAAGTTGCCTTCAGACTCCATGCCTGTTGGGGTTACGGTAAAAAATAATCTAAATCTCTCATCAGACAGTTTGCTATCTGGCGTACCCAATGTAAAAGGTAGAGGTCCCCTACTCCCTCTGTTAGACCTTCACAAGGATCATGATGTGGACAGTCTCCCATCACCTACCAGAGAAGCTCCTACAGTTTTTTCTGTCCAAAAATCAGGGCATATCCCTGTGAAGGTGGCACATGCTATGGATGGATCGAGAGTACATCCTTATGAAACTGATGCCCTAAAAGCTGTTTCAACCTATCAACAGAAGTTTGGTCGAAGTTCCTTTTCAATGGCTGATCGACTTCCTAGCCCAACCCCTTCAGAAGAATGTGATGGGGGTGGTGATATTGGTGGGGAGGTTTCTAGTTCTTCCATTTTCAGAAGCTCAAAAGCTTCAAATTCTTCTAAACTGGCTCAAACGGTGTCAAATTCTGCTTCTAGCATATCTACAGGTCTTTTTCCTAACCTGGAAAGTTCCAGCACTAAGGGACTGATTAGTCCTTTAAACGTTGCTCCTCCTAGTTCTGTGTCTAATCCAATAGCAAAGCCTTTGGCAAAAAGTAGAGACCCTAGGCTACGAATGGTCAATTCTGAGGCAAGTGCTATGGATCTTAATCCACGCACAATGACTTCTGTGCAGAATCCTTCTGTTGTAGAATCTGCTGTAACCGTAAACATGAGAAAGCAGAAGATGGATGTAGAGCCTAATATAGATGCCCCTGAAATGAAAAGGCAGAGGATTGGATCTCAGAATCATGCATTTTCTGCAAGTGATTTGAGAGCTGGCTCTGGAAGCGGTGGCTGGTTGGAAGATAATACCATGTCCGCTGTTCCCAGGCTTTCAAGCAGGAATCAGATGGAGATTGCTGAAGCAAATGCAATAGAAAAAAACAACGTTACAAACAATTCTGGTGCAGGAAATTCGCGTGGGCCTATGATAAGTGCTAGCAAAGAAGCTTCCTTGCCCTCACTATTGAAAGATATTGTTGTGAACCCAACCATGCTCTTAAGTTTACTTAAAATGAACCAACAGAAGCAAGTAGCTGCAGAATTGAAATTAAACTCGAGTGAACCTGAAAAAAATGCAATTTGTCCTACGGCCGTGAATCCTTGTCTAGGATCCAGTCCACTAGTAAATGCTCCTGCTGTGACCTCAGGAATTTTGCAGCAATCAGCAGGAACACCAAGTGTACCTTCACCACCGGTGGTCACTGTGGATGATGTGGGAAAAGTTCGTATGAAACCTCGTGACCCTCGCCGTATCCTCCATGGTAATTCTCTTCACAAGGTTGACAGCATGCGAAATGAGCAGTTAAAAAGTGTTGTACCAGCTGTTCCAAACCCAGAAGGAAGTAGGGATATAATACCAAATGGCCATAAGCAAGAAGGCCAGGGAAATTTGAGATTAGCTTCTTCACAACCATTACTACCTGACATTGGTAGACAGTTCACTAACAATCTGAAAAATATTGCTGATATCATGTCTGTTCCCTCGCCACCAACTTCTTCTCACAATTTATCTTCAAAACCAGTTAAATTGGACAGAAAGGATGCTAATGCTGTTGGGTCAAGCTCTATTGACAGTAAAATTGTGGCAACTGCTACCCAAGCGGTAGATATGGTTGGCCCCTCCCGTTCACATGGTGCATGGGGAGATCTTGAGCATCTTTTTGAGGGTTATGATGACAAGCAAAAGGCTGCTATCCAGAGAGAAAGAGCAAGACGGATAGATGAGCAGAAAAAAATGTTTGCTGCACGCAAACTCTGCCTTGTTTTGGATTTGGATCACACCCTTCTGAATTCAGCTAAGTTTGTGGAAGTAGATCCATTGCATGATGAAATTTTGAGGAAGAAGGAGGAACAGGATCGTGAGAAGGTGCAGAGACATCTTTTCCGATTTCCTCATATGGGAATGTGGACCAAACTACGGCCTGGGGTCTGGAACTTTTTGGAAAAGGCCAGCGAGCTCTATGAACTTCATTTGTACACAATGGGAAACAAGTTATATGCAACGGAGATGGCAAAGGTTCTTGATCCAAAAGGGGTTCTGTTTGCCGGGCGAGTTCTTTCTCGAGGTGATGATGGAGACCCCTTGGACGGGGAAGAAAGGGTACCCAAGAGTAAGGATCTGGAGGGTGTTTTGGGCATGGAGTCTGCTGTTGTTATAATTGATGATTCCATCAGGGTGTGGCCACATAACAAAATGAACTTGATTGTCGTAGAAAGGTATACTTACTTTCCATGTAGTAGGCGCCAATTTGGGCTTCTGGGTCCTTCTCTTCTAGAGATTGACCATGATGAGAGACCTGAAGATGGTACTTTGGCTTCTTCACTGGCGGTTATCCAGAGAATTCATCAAACCTTTTTCTCCCATCCTGTATTAGATGAAGTAGATGTTAGAAATATCTTGGCCTCCGAGCAACAAAGGATTTTGGCTGGTTGCCGTATAGTGTTTAGCAGGGTTTTTCCAGTTGGTGAGGCAAATCCTCACCTGCATCCGCTGTGGCAGACAGCTGAACAGTTTGGTGCGGTGTGCACCAACCAGATTGATGAACAGGTTACCCATGTCGTTGCAAACTCGCTTGGGACTGATAAGGTGAATTGGGCTCTCTCCACTGGCAGATTCGTGGTCCATCCAGGGTGGGTGGAAGCATCGGCTCTACTTTACCGGAGGGCAAACGAGCAGGACTTTGCCATTAAACCATAAACAACCCACCCACCAGCAACCTTAACACACCCTTTGCTAAATTAAAAAACAAAAGATTATAATACAAGATGATATAACACGGAGAAGAAATTCAACACGTTGCCTTATATTTGGGTGGTAGTCTTAGGCAAAGTCCAAGATTACTAAGAGACACATCAACCCCCAATTTAAAACGCCCTACAACTACAAGAATGTGTGGAAGAGCTGTGAAAGAGAGATCATCAAGAAATTATGTGCCTCTCACCACATTGACTTTCGAGATCGAGTTTGATTAGATTTGAAATAATCAACGGTGAGGCCACGGGAAGGAGGAGAGAGGGGGGTTGCAAAGAAAGTGAGACGAATGGGTAGAAGGAATTTGGGTAAATTGTGAGTGTGGGGCGAATTTATACTCAACTCGAGGTTGGAAGCTTCTCATTTCTTTTTCTTCTGCTTTTGCTTTGAATTTTCGGTTGGAATTACGAGAGAGAAGAACACTAGGGATTTTATTCTTGATTTCGTCCTGTCCCGTTCGTTCTTGACCAAAATTCGTTGAGAATGACAAGGGGGAGGGGAAGGGGAAGGGCAAATGCGAATGTGAATGATATTGGGAGAATGACAGCATCTGAAAAAGGCTTCGGGTCACATGATAAATGACATCAGCAATCCCTGCAAATGAACCTTTTTTTTTTTGTTTGTTTCATTTTCAAACACTCTTTTCTTTTCCTTAAACAAGGAAAAAAAAAATGAAATTAGCATATACCAGATACGAGGTTTTTTTTTTTTTCCTTTCATTTTTTGTGTAAAGGATAAGCAAGCTTTGATAACAATTTCATCTTGTTCTTAACAAGCTTTTGTCTTTTTAGAATTTGTTTAGAAGACGAAAGGAAAGCCGAGAATGAATATCAATACAAAATTTGCAACAATAAAGCATTAGCGAGGTTGATTGAGAATTGTGAATGCTTTAAACTATGTTGCGTGCTTCCCAAGTCGTCACCCCCATTTTCTTCAAGCTCTTGGTCTGCATGTAAAGATGTCTATAGGGCATGTTAAGTTGTGGGGGCTTGCATGACCTGACATAACCTATTTACAGCAAAACTCCCAATATCTGGCCATATAAGGTATGCACCAGGAGGCTTGAATTGCCGGATGGATATTCTAAAAGCTGAGTGTCAGGTAAGGAAAGTGGAGTAGCAAGCATCATTGGTGTATGAATAGGCTTACAATATCGATCATCTTCGCACCACAAAGGAGATCGGAAATGTAGTTCCATTGGGGAAGGTGCATGGATTGAGTATCCCGTATCAGCTGAATTCCAAGAAAGTAGGAATTAGTAAAAACAAGAATTTATTTGCGAACCAAGCATGTGGCTTTGTTTGAGAGCATGCAGAAAGGGGATTTATGCTGTTATCATCAAAGGCATCAGGTTGACGTATGAACCAAGCAAGCTTGTTATTTGATGGGTCAACCGAAGGAGAGTGGGTTGAATATTCCAGGCCTGGGATGGGTGATCGTGGTGTGGACGTTCAAGGGAGCTGTCGGTCGAGATGGTTTGAGGGTGGAAGATGACATTACTTTCCAAGTCTGGAATTCAATTAGTGAATGAGGTAGGGTTGTATTAATTGTAAGATTATATATGACACATCCTTCACTTTTACGTACTTTCAAGCTTAACCCACCAATGCACTTAAAAGGGAAACTACATTACTTCTACTTGGCATACCTACTCTTCAAATTTCTCGAAGGGA

Coding sequence (CDS)

ATGGGGAAACACACCAACTGTCTTAAAACTCAAGACGTCGAAGAAGGGGAAATCTCTGATACACCTTCAGTTGAGGAGATCACTGAGGAGGATTTTAATAACCTTGAAACTGTCCCTAAGCTGCTGCCTTCCAAACATTCCAATCGGGAGACTACAGTTTGGACCATGAGTGATTTGTATAACAATTATCCCACCATGTGTCGTGGTTATGCTTCTGGTCTCTACAATTTAGCTTGGGCCAAGGCAGTGCAGAATAAGCCTCTCAATGAAATCTTTCTTACGGAGGCCGACCCCGACGACAAATCCCACCGCTCTTCTTCCTCTCCTTTTCGGAATGCCAAGGAACATGGAAACGGTACAATAGAAGAGGCTGCTAAGCTCATCATTGATATTACCGGCGACGACATGAATACCAACAATGCGGATGTCGAGAAAGAGGAAGGCGAATTGGAGGAGGGCGAAATTGATATGGATACGGAGTTCGTAGAAGAGGTTGTTGACTCCAGACCAATGTTGTCCGACTCTCTCGATACTGACTGTCAGGAGATTGATTTCAAAAATAAGGAATTGGATGATCAGCTCAAATTGGTTCACAAAACATTGGATGGTGTTACAATCGACGCTGCACAGAAATCGTTTCAGGAAATTTGCTCCCAACTGCTTAGTTCTATAGAGACGTTTCTGGAATTGGTCCAGGGAAAGGTAGTCCCGAGAAAGGATGCGCTCATTCAACGACTGTATGCTGCTCTTCGAATAATCAATTCTGTGTTTTGTTCCATGAACCCCAAGGAAAAAGACGAGTATAAGCAACATTTATCGAGGTTGCTTTCTTTTGTTAAAAATTGCAACCCTGCTCTTTTTTCTCCTGAGCAGATAAAATCGGTAGAGGTCAAAATGCCATCTACAGATTCCCTTGACCATTTCCCCGACATGAGAGACAGTGCTAAAGATGTCGAGATCCATATACCTAATGGGGTGAAAAATAAGGACTTTTATTCTGCATATGCAACTGCTACTCCACATTTAACTTCTTCAACTAAGTTGCCTTCAGACTCCATGCCTGTTGGGGTTACGGTAAAAAATAATCTAAATCTCTCATCAGACAGTTTGCTATCTGGCGTACCCAATGTAAAAGGTAGAGGTCCCCTACTCCCTCTGTTAGACCTTCACAAGGATCATGATGTGGACAGTCTCCCATCACCTACCAGAGAAGCTCCTACAGTTTTTTCTGTCCAAAAATCAGGGCATATCCCTGTGAAGGTGGCACATGCTATGGATGGATCGAGAGTACATCCTTATGAAACTGATGCCCTAAAAGCTGTTTCAACCTATCAACAGAAGTTTGGTCGAAGTTCCTTTTCAATGGCTGATCGACTTCCTAGCCCAACCCCTTCAGAAGAATGTGATGGGGGTGGTGATATTGGTGGGGAGGTTTCTAGTTCTTCCATTTTCAGAAGCTCAAAAGCTTCAAATTCTTCTAAACTGGCTCAAACGGTGTCAAATTCTGCTTCTAGCATATCTACAGGTCTTTTTCCTAACCTGGAAAGTTCCAGCACTAAGGGACTGATTAGTCCTTTAAACGTTGCTCCTCCTAGTTCTGTGTCTAATCCAATAGCAAAGCCTTTGGCAAAAAGTAGAGACCCTAGGCTACGAATGGTCAATTCTGAGGCAAGTGCTATGGATCTTAATCCACGCACAATGACTTCTGTGCAGAATCCTTCTGTTGTAGAATCTGCTGTAACCGTAAACATGAGAAAGCAGAAGATGGATGTAGAGCCTAATATAGATGCCCCTGAAATGAAAAGGCAGAGGATTGGATCTCAGAATCATGCATTTTCTGCAAGTGATTTGAGAGCTGGCTCTGGAAGCGGTGGCTGGTTGGAAGATAATACCATGTCCGCTGTTCCCAGGCTTTCAAGCAGGAATCAGATGGAGATTGCTGAAGCAAATGCAATAGAAAAAAACAACGTTACAAACAATTCTGGTGCAGGAAATTCGCGTGGGCCTATGATAAGTGCTAGCAAAGAAGCTTCCTTGCCCTCACTATTGAAAGATATTGTTGTGAACCCAACCATGCTCTTAAGTTTACTTAAAATGAACCAACAGAAGCAAGTAGCTGCAGAATTGAAATTAAACTCGAGTGAACCTGAAAAAAATGCAATTTGTCCTACGGCCGTGAATCCTTGTCTAGGATCCAGTCCACTAGTAAATGCTCCTGCTGTGACCTCAGGAATTTTGCAGCAATCAGCAGGAACACCAAGTGTACCTTCACCACCGGTGGTCACTGTGGATGATGTGGGAAAAGTTCGTATGAAACCTCGTGACCCTCGCCGTATCCTCCATGGTAATTCTCTTCACAAGGTTGACAGCATGCGAAATGAGCAGTTAAAAAGTGTTGTACCAGCTGTTCCAAACCCAGAAGGAAGTAGGGATATAATACCAAATGGCCATAAGCAAGAAGGCCAGGGAAATTTGAGATTAGCTTCTTCACAACCATTACTACCTGACATTGGTAGACAGTTCACTAACAATCTGAAAAATATTGCTGATATCATGTCTGTTCCCTCGCCACCAACTTCTTCTCACAATTTATCTTCAAAACCAGTTAAATTGGACAGAAAGGATGCTAATGCTGTTGGGTCAAGCTCTATTGACAGTAAAATTGTGGCAACTGCTACCCAAGCGGTAGATATGGTTGGCCCCTCCCGTTCACATGGTGCATGGGGAGATCTTGAGCATCTTTTTGAGGGTTATGATGACAAGCAAAAGGCTGCTATCCAGAGAGAAAGAGCAAGACGGATAGATGAGCAGAAAAAAATGTTTGCTGCACGCAAACTCTGCCTTGTTTTGGATTTGGATCACACCCTTCTGAATTCAGCTAAGTTTGTGGAAGTAGATCCATTGCATGATGAAATTTTGAGGAAGAAGGAGGAACAGGATCGTGAGAAGGTGCAGAGACATCTTTTCCGATTTCCTCATATGGGAATGTGGACCAAACTACGGCCTGGGGTCTGGAACTTTTTGGAAAAGGCCAGCGAGCTCTATGAACTTCATTTGTACACAATGGGAAACAAGTTATATGCAACGGAGATGGCAAAGGTTCTTGATCCAAAAGGGGTTCTGTTTGCCGGGCGAGTTCTTTCTCGAGGTGATGATGGAGACCCCTTGGACGGGGAAGAAAGGGTACCCAAGAGTAAGGATCTGGAGGGTGTTTTGGGCATGGAGTCTGCTGTTGTTATAATTGATGATTCCATCAGGGTGTGGCCACATAACAAAATGAACTTGATTGTCGTAGAAAGGTATACTTACTTTCCATGTAGTAGGCGCCAATTTGGGCTTCTGGGTCCTTCTCTTCTAGAGATTGACCATGATGAGAGACCTGAAGATGGTACTTTGGCTTCTTCACTGGCGGTTATCCAGAGAATTCATCAAACCTTTTTCTCCCATCCTGTATTAGATGAAGTAGATGTTAGAAATATCTTGGCCTCCGAGCAACAAAGGATTTTGGCTGGTTGCCGTATAGTGTTTAGCAGGGTTTTTCCAGTTGGTGAGGCAAATCCTCACCTGCATCCGCTGTGGCAGACAGCTGAACAGTTTGGTGCGGTGTGCACCAACCAGATTGATGAACAGGTTACCCATGTCGTTGCAAACTCGCTTGGGACTGATAAGGTGAATTGGGCTCTCTCCACTGGCAGATTCGTGGTCCATCCAGGGTGGGTGGAAGCATCGGCTCTACTTTACCGGAGGGCAAACGAGCAGGACTTTGCCATTAAACCATAA

Protein sequence

MGKHTNCLKTQDVEEGEISDTPSVEEITEEDFNNLETVPKLLPSKHSNRETTVWTMSDLYNNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLTEADPDDKSHRSSSSPFRNAKEHGNGTIEEAAKLIIDITGDDMNTNNADVEKEEGELEEGEIDMDTEFVEEVVDSRPMLSDSLDTDCQEIDFKNKELDDQLKLVHKTLDGVTIDAAQKSFQEICSQLLSSIETFLELVQGKVVPRKDALIQRLYAALRIINSVFCSMNPKEKDEYKQHLSRLLSFVKNCNPALFSPEQIKSVEVKMPSTDSLDHFPDMRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTVKNNLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPVKVAHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSSSIFRSSKASNSSKLAQTVSNSASSISTGLFPNLESSSTKGLISPLNVAPPSSVSNPIAKPLAKSRDPRLRMVNSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAPEMKRQRIGSQNHAFSASDLRAGSGSGGWLEDNTMSAVPRLSSRNQMEIAEANAIEKNNVTNNSGAGNSRGPMISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEKNAICPTAVNPCLGSSPLVNAPAVTSGILQQSAGTPSVPSPPVVTVDDVGKVRMKPRDPRRILHGNSLHKVDSMRNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDIGRQFTNNLKNIADIMSVPSPPTSSHNLSSKPVKLDRKDANAVGSSSIDSKIVATATQAVDMVGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSHPVLDEVDVRNILASEQQRILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP
Homology
BLAST of CmaCh08G000190 vs. ExPASy Swiss-Prot
Match: Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)

HSP 1 Score: 966.8 bits (2498), Expect = 2.3e-280
Identity = 619/1318 (46.97%), Postives = 796/1318 (60.39%), Query Frame = 0

Query: 1    MGKHTNCLKTQDVEEGEISDTPSVE-------EITEEDFNNLETVPKLLPSK-----HSN 60
            MG   N +   DVEEGEI D+ + E         T  D      V  +   +      SN
Sbjct: 15   MGNDENLMVMVDVEEGEIPDSVNTEIEVKHKSTTTTADVGGDVDVGVVAGGRGGGGGGSN 74

Query: 61   RETTVWTMSDLYNNYPTMCRGYA-SGLYNLAWAKAVQNKPLNEIFLTEADPDDKSHRSSS 120
              + VWTM +L + YP   R YA SGL NLAWA+AVQNKP NE  + + +P         
Sbjct: 75   GNSRVWTMEELISQYPAY-RPYANSGLSNLAWARAVQNKPFNEGLVMDYEP--------- 134

Query: 121  SPFRNAKEHGNGTIEEAAKLIIDITGDDMNTNNADVEKEEGELEEGEIDM-----DTEFV 180
                           E+ K++I+         ++D EKEEGELEEGEID+     D   V
Sbjct: 135  --------------RESDKIVIE---------DSDDEKEEGELEEGEIDLVDNASDDNLV 194

Query: 181  EEVVDSRPMLS-DSLDTDCQEIDFKNKELDDQLKLVHKTLDGVTIDAAQKSFQEICSQLL 240
            E+  +S  ++S D ++ D      K ++L+ ++KL+   L+  ++  AQ  F+ +CS++L
Sbjct: 195  EKDTESVVLISADKVEDD---RILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRIL 254

Query: 241  SSIETFLELV-QGKVVPRKDALIQRLYAALRIINSVFCSMNPKEKDEYKQHLSRLLSFVK 300
             ++E+  ELV      P++D L+Q  +A+L+ IN VFCSMN   K+  K+ +SRLL+ V 
Sbjct: 255  GALESLRELVSDNDDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVN 314

Query: 301  NCNPALFSPEQIKSVEVKMPSTDSLDHFPDMRDSAKDVEIHIPNGVKNKDFYSAYA-TAT 360
                                     DHF       +  EI   N   ++   + +A T++
Sbjct: 315  -------------------------DHFSQFLSFNQKNEIETMNQDLSRSAIAVFAGTSS 374

Query: 361  PHLTSSTKLPSDSMPVGVTVKNNLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLP 420
                +    PS+             L+S+S   G   ++ R P+LPLLDLHKDHD DSLP
Sbjct: 375  EENVNQMTQPSNGDSFLAK-----KLTSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLP 434

Query: 421  SPTREAPTVFSVQ------KSGHIPVKVAHAMDGSRVHPYETDALKAVSTYQQKFGRSSF 480
            SPTRE      V       + G    + +   +G++V+ YE+DA KAVSTYQQKFG +S 
Sbjct: 435  SPTRETTPSLPVNGRHTMVRPGFPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSV 494

Query: 481  SMADRLPSPTPS-EECDGGGDIGGEVSSSSIFRSSKASNSSKLAQTVSNSASSISTGL-F 540
               D LPSPTPS E  DG GD+GGEV SSS+ +SS   +     Q V   ++  S  +  
Sbjct: 495  FKTDDLPSPTPSGEPNDGNGDVGGEV-SSSVVKSSNPGSHLIYGQDVPLPSNFNSRSMPV 554

Query: 541  PNLESSSTKGLISPLNVAPPSSVSNPIAKPLAKSRDPRLRMVNSEASAMDLNPRTMTSVQ 600
             N  SS+       ++     + S+   KP AKSRDPRLR+   +A+ + +   +    +
Sbjct: 555  ANSVSSTVPPHHLSIHAISAPTASDQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDAR 614

Query: 601  NPSVVE-SAVTVNMRKQKMDVEPNIDAPEMKRQRIGSQNHAFSASDLRAGSGSGGWLEDN 660
            N S VE SA  VN RKQK   E  ID P  KRQ+        S +D    +G+GGWLED 
Sbjct: 615  NLSKVELSADLVNPRKQKAADEFLIDGPAWKRQK--------SDTDAPKAAGTGGWLEDT 674

Query: 661  TMSAVPRLSSRNQMEIAEANAIEKNNVTNNSGAGNSRGPMISASKEASLPSLLKDIVVNP 720
              S + +L S+ ++ I        ++V   S    S+    +++  ASL SLLKDI VNP
Sbjct: 675  ESSGLLKLESKPRL-IENGVTSMTSSVMPTSAVSVSQKVRTASTDTASLQSLLKDIAVNP 734

Query: 721  TMLLSLLKMNQQKQVAAELKLNSSEPEKNAICP-TAVNPCLG---SSPLVNAPA---VTS 780
            TMLL+LLKM ++++V  +      +P + A  P ++V P +    S P  NA A   + S
Sbjct: 735  TMLLNLLKMGERQKVPEKAIQKPMDPRRAAQLPGSSVQPGVSTPLSIPASNALAANSLNS 794

Query: 781  GILQQSA-GTPSVPSPPVVTVDDVGKVRMKPRDPRRILHGNSLHKVDSMRNEQLKSVVPA 840
            G+LQ S+   P+  S         G +RMKPRDPRRILHG++L + DS   +Q K   P+
Sbjct: 795  GVLQDSSQNAPAAES---------GSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPS 854

Query: 841  V----------------PNPEGSRDIIPNG-HKQEGQGNLRLASSQPLLPDIGRQFTNNL 900
                             P  +  ++I  NG  K +  G L    +    PD   QFT NL
Sbjct: 855  TLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKT----PDFSTQFTKNL 914

Query: 901  KNIADIMSVP----SPPTSSHNLSSKPVKLDRKDANAVGSSSIDSKIVATATQAVDMVGP 960
            K+IAD++ V     +PP S H++  K  + D K  N    ++ D  +  +A       GP
Sbjct: 915  KSIADMVVVSQQLGNPPASMHSVQLK-TERDVKH-NPSNPNAQDEDVSVSAASVTAAAGP 974

Query: 961  SRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAK 1020
            +RS  +WGD+EHLFEGYDD Q+ AIQRER RR++EQ KMFA++KL LVLD+DHTLLNSAK
Sbjct: 975  TRSMNSWGDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAK 1034

Query: 1021 FVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYT 1080
            F EV+  H+EILRKKEEQDREK  RHLFRF HMGMWTKLRPG+WNFLEKAS+LYELHLYT
Sbjct: 1035 FNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYT 1094

Query: 1081 MGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVII 1140
            MGNKLYATEMAK+LDPKGVLF GRV+S+GDDGDPLDG+ERVPKSKDLEGV+GMES+VVII
Sbjct: 1095 MGNKLYATEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVII 1154

Query: 1141 DDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQR 1200
            DDS+RVWP +KMNLI VERY YFPCSRRQFGLLGPSLLE+D DE PE+GTLASSLAVI++
Sbjct: 1155 DDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEK 1214

Query: 1201 IHQTFFSHPVLDEVDVRNILASEQQRILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA 1260
            IHQ FFSH  LDEVDVRNILASEQ++ILAGCRIVFSR+ PVGEA PHLHPLWQTAEQFGA
Sbjct: 1215 IHQNFFSHTSLDEVDVRNILASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGA 1241

BLAST of CmaCh08G000190 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 3.7e-68
Identity = 152/347 (43.80%), Postives = 211/347 (60.81%), Query Frame = 0

Query: 918  YDDKQKAAIQRERARRIDEQKKMF-AARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKK 977
            Y  K+    + E +R  D   +     RKL LVLDLDHTLLN+    ++ P  +E L+  
Sbjct: 94   YIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKP-EEEYLKSH 153

Query: 978  EE--QDREKVQ-RHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAK 1037
                QD   V    LF    M M TKLRP V +FL++ASE++ +++YTMG++ YA +MAK
Sbjct: 154  THSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAK 213

Query: 1038 VLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKM 1097
            +LDPKG  F  RV+SR DDG        V   K L+ VLG ESAV+I+DD+   WP +K 
Sbjct: 214  LLDPKGEYFGDRVISR-DDG-------TVRHEKSLDVVLGQESAVLILDDTENAWPKHKD 273

Query: 1098 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSH--PV 1157
            NLIV+ERY +F  S RQF     SL E+  DE   DG LA+ L V+++ H  FF +    
Sbjct: 274  NLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG 333

Query: 1158 LDEVDVRNILASEQQRILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQV 1217
            +   DVR +L   ++ IL GC+IVFSRVFP  +A P  HPLW+ AE+ GA C  ++D  V
Sbjct: 334  ISNRDVRLMLKQVRKEILKGCKIVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASV 393

Query: 1218 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1259
            THVVA  +GT+K  WA+   ++VVH GW++A+  L+ +  E++F ++
Sbjct: 394  THVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEENFGLE 430

BLAST of CmaCh08G000190 vs. ExPASy Swiss-Prot
Match: Q95QG8 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis elegans OX=6239 GN=fcp-1 PE=1 SV=2)

HSP 1 Score: 137.1 bits (344), Expect = 1.4e-30
Identity = 105/338 (31.07%), Postives = 169/338 (50.00%), Query Frame = 0

Query: 936  EQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMG 995
            ++  +   RKL L++DLD T+++++         D+ +    E  ++  + +L    H  
Sbjct: 134  DENNLITNRKLVLLVDLDQTIIHTS---------DKPMTVDTENHKDITKYNL----HSR 193

Query: 996  MW-TKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGD 1055
            ++ TKLRP    FL K S +YE+H+ T G + YA  +A++LDP   LF  R+LSR    D
Sbjct: 194  VYTTKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSR----D 253

Query: 1056 PLDGEERVPKSKDLEGVLGM-ESAVVIIDDSIRVWPHNKMNLIVVERYTYF--------- 1115
             L   +   K+ +L+ +    ++ VVIIDD   VW +++  LI ++ Y +F         
Sbjct: 254  ELFSAQH--KTNNLKALFPCGDNLVVIIDDRSDVWMYSEA-LIQIKPYRFFKEVGDINAP 313

Query: 1116 PCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSHPVL---DEV--DVRN 1175
              S+ Q     P  +E   D+  ED  L     V+  IH  ++    L   +EV  DV+ 
Sbjct: 314  KNSKEQM----PVQIE---DDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSEEVLLDVKE 373

Query: 1176 ILASEQQRILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSL 1235
            ++  E+ ++L GC IVFS + P+GE       +++   QFGAV    + + VTHVV    
Sbjct: 374  VIKEERHKVLDGCVIVFSGIVPMGEKLERT-DIYRLCTQFGAVIVPDVTDDVTHVVGARY 433

Query: 1236 GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1258
            GT KV  A    +FVV   WV A    + +A+E  F +
Sbjct: 434  GTQKVYQANRLNKFVVTVQWVYACVEKWLKADENLFQL 443

BLAST of CmaCh08G000190 vs. ExPASy Swiss-Prot
Match: F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)

HSP 1 Score: 118.6 bits (296), Expect = 5.0e-25
Identity = 103/376 (27.39%), Postives = 175/376 (46.54%), Query Frame = 0

Query: 809  SRDIIPNGHKQ--EGQGNLRLASSQPLLPDIGRQFTNNLKNIADIMSVPSPPTSSHNLSS 868
            SR + P+  ++  E + N  LA+   LL  I  +F   ++   +   V    +   N SS
Sbjct: 245  SRVLKPHSEEKTDESENNGGLANVLKLLKGIHHKFF-KVEEEVESQDVRLTMSVVENFSS 304

Query: 869  KPVKLDRKDANAVGSSSIDSKIVATATQAVDMVGPSRSHGAWGDLEHLFEGYDDKQKAAI 928
            +P K  R+            KI  T  ++   +  S S G W    ++  G     K+ +
Sbjct: 305  EP-KAKRR------------KIEPTINESSSSLSSSSSCGHW----YICHGICIGCKSTV 364

Query: 929  QRERARRID-------------EQKKMFAA-------RKLCLVLDLDHTLLNSAKFVEVD 988
            ++ + R  D                K F         +KL LVLDLDHTLL++     + 
Sbjct: 365  KKSQGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLS 424

Query: 989  PLHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKL 1048
                 ++ +     R+ + +       M   TKLRP + +FL++A+E + +++YT G+++
Sbjct: 425  QAEKYLIEEAGSATRDDLWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 484

Query: 1049 YATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVIIDDSIR 1108
            YA ++ +++DPK + F  RV+++ +           P  K L+ VL  E  VVI+DD+  
Sbjct: 485  YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 544

Query: 1109 VWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTF 1162
            VWP +K NL+ + +Y+YF    R  G       E   DE   +G LA+ L +++ +HQ F
Sbjct: 545  VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRF 588

BLAST of CmaCh08G000190 vs. ExPASy Swiss-Prot
Match: Q9P376 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=fcp1 PE=1 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 6.8e-22
Identity = 120/477 (25.16%), Postives = 182/477 (38.16%), Query Frame = 0

Query: 915  FEGYDDKQKA-----------AIQRERARRIDEQ--KKMFAARKLCLVLDLDHTLLNSAK 974
            + GY D  +A            +  E A R++ +  K++   ++L L++DLD T++++  
Sbjct: 121  YMGYSDMARANISMTHNTGDLTVSLEEASRLESENVKRLRQEKRLSLIVDLDQTIIHAT- 180

Query: 975  FVEVDPLHDEILRKKEEQD----REKVQRHLFRFPH---MGMWTKLRPGVWNFLEKASEL 1034
               VDP   E +      +    R+    +L   P       + K RPG+  FL+K SEL
Sbjct: 181  ---VDPTVGEWMSDPGNVNYDVLRDVRSFNLQEGPSGYTSCYYIKFRPGLAQFLQKISEL 240

Query: 1035 YELHLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGM 1094
            YELH+YTMG K YA E+AK++DP G LF  RVLSR D G            K L  +   
Sbjct: 241  YELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPC 300

Query: 1095 E-SAVVIIDDSIRVWPHNKMNLIVVERYTYF-----------------PCSRRQFGLLGP 1154
            + S VV+IDD   VW  N  NLI V  Y +F                 P   +   L  P
Sbjct: 301  DTSMVVVIDDRGDVWDWNP-NLIKVVPYEFFVGIGDINSNFLAKSTPLPEQEQLIPLEIP 360

Query: 1155 -----------------------------------------------------------S 1214
                                                                       +
Sbjct: 361  KDEPDSVDEINEENEETPEYDSSNSSYAQDSSTIPEKTLLKDTFLQNREALEEQNKERVT 420

Query: 1215 LLEIDHDERP------------------------EDGTLASSLAVIQRIHQTFFSHPVLD 1260
             LE+   ERP                         D  L     V++ IH  ++     +
Sbjct: 421  ALELQKSERPLAKQQNALLEDEGKPTPSHTLLHNRDHELERLEKVLKDIHAVYYEEE--N 480

BLAST of CmaCh08G000190 vs. TAIR 10
Match: AT2G33540.1 (C-terminal domain phosphatase-like 3 )

HSP 1 Score: 966.8 bits (2498), Expect = 1.6e-281
Identity = 619/1318 (46.97%), Postives = 796/1318 (60.39%), Query Frame = 0

Query: 1    MGKHTNCLKTQDVEEGEISDTPSVE-------EITEEDFNNLETVPKLLPSK-----HSN 60
            MG   N +   DVEEGEI D+ + E         T  D      V  +   +      SN
Sbjct: 15   MGNDENLMVMVDVEEGEIPDSVNTEIEVKHKSTTTTADVGGDVDVGVVAGGRGGGGGGSN 74

Query: 61   RETTVWTMSDLYNNYPTMCRGYA-SGLYNLAWAKAVQNKPLNEIFLTEADPDDKSHRSSS 120
              + VWTM +L + YP   R YA SGL NLAWA+AVQNKP NE  + + +P         
Sbjct: 75   GNSRVWTMEELISQYPAY-RPYANSGLSNLAWARAVQNKPFNEGLVMDYEP--------- 134

Query: 121  SPFRNAKEHGNGTIEEAAKLIIDITGDDMNTNNADVEKEEGELEEGEIDM-----DTEFV 180
                           E+ K++I+         ++D EKEEGELEEGEID+     D   V
Sbjct: 135  --------------RESDKIVIE---------DSDDEKEEGELEEGEIDLVDNASDDNLV 194

Query: 181  EEVVDSRPMLS-DSLDTDCQEIDFKNKELDDQLKLVHKTLDGVTIDAAQKSFQEICSQLL 240
            E+  +S  ++S D ++ D      K ++L+ ++KL+   L+  ++  AQ  F+ +CS++L
Sbjct: 195  EKDTESVVLISADKVEDD---RILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRIL 254

Query: 241  SSIETFLELV-QGKVVPRKDALIQRLYAALRIINSVFCSMNPKEKDEYKQHLSRLLSFVK 300
             ++E+  ELV      P++D L+Q  +A+L+ IN VFCSMN   K+  K+ +SRLL+ V 
Sbjct: 255  GALESLRELVSDNDDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVN 314

Query: 301  NCNPALFSPEQIKSVEVKMPSTDSLDHFPDMRDSAKDVEIHIPNGVKNKDFYSAYA-TAT 360
                                     DHF       +  EI   N   ++   + +A T++
Sbjct: 315  -------------------------DHFSQFLSFNQKNEIETMNQDLSRSAIAVFAGTSS 374

Query: 361  PHLTSSTKLPSDSMPVGVTVKNNLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLP 420
                +    PS+             L+S+S   G   ++ R P+LPLLDLHKDHD DSLP
Sbjct: 375  EENVNQMTQPSNGDSFLAK-----KLTSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLP 434

Query: 421  SPTREAPTVFSVQ------KSGHIPVKVAHAMDGSRVHPYETDALKAVSTYQQKFGRSSF 480
            SPTRE      V       + G    + +   +G++V+ YE+DA KAVSTYQQKFG +S 
Sbjct: 435  SPTRETTPSLPVNGRHTMVRPGFPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSV 494

Query: 481  SMADRLPSPTPS-EECDGGGDIGGEVSSSSIFRSSKASNSSKLAQTVSNSASSISTGL-F 540
               D LPSPTPS E  DG GD+GGEV SSS+ +SS   +     Q V   ++  S  +  
Sbjct: 495  FKTDDLPSPTPSGEPNDGNGDVGGEV-SSSVVKSSNPGSHLIYGQDVPLPSNFNSRSMPV 554

Query: 541  PNLESSSTKGLISPLNVAPPSSVSNPIAKPLAKSRDPRLRMVNSEASAMDLNPRTMTSVQ 600
             N  SS+       ++     + S+   KP AKSRDPRLR+   +A+ + +   +    +
Sbjct: 555  ANSVSSTVPPHHLSIHAISAPTASDQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDAR 614

Query: 601  NPSVVE-SAVTVNMRKQKMDVEPNIDAPEMKRQRIGSQNHAFSASDLRAGSGSGGWLEDN 660
            N S VE SA  VN RKQK   E  ID P  KRQ+        S +D    +G+GGWLED 
Sbjct: 615  NLSKVELSADLVNPRKQKAADEFLIDGPAWKRQK--------SDTDAPKAAGTGGWLEDT 674

Query: 661  TMSAVPRLSSRNQMEIAEANAIEKNNVTNNSGAGNSRGPMISASKEASLPSLLKDIVVNP 720
              S + +L S+ ++ I        ++V   S    S+    +++  ASL SLLKDI VNP
Sbjct: 675  ESSGLLKLESKPRL-IENGVTSMTSSVMPTSAVSVSQKVRTASTDTASLQSLLKDIAVNP 734

Query: 721  TMLLSLLKMNQQKQVAAELKLNSSEPEKNAICP-TAVNPCLG---SSPLVNAPA---VTS 780
            TMLL+LLKM ++++V  +      +P + A  P ++V P +    S P  NA A   + S
Sbjct: 735  TMLLNLLKMGERQKVPEKAIQKPMDPRRAAQLPGSSVQPGVSTPLSIPASNALAANSLNS 794

Query: 781  GILQQSA-GTPSVPSPPVVTVDDVGKVRMKPRDPRRILHGNSLHKVDSMRNEQLKSVVPA 840
            G+LQ S+   P+  S         G +RMKPRDPRRILHG++L + DS   +Q K   P+
Sbjct: 795  GVLQDSSQNAPAAES---------GSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPS 854

Query: 841  V----------------PNPEGSRDIIPNG-HKQEGQGNLRLASSQPLLPDIGRQFTNNL 900
                             P  +  ++I  NG  K +  G L    +    PD   QFT NL
Sbjct: 855  TLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKT----PDFSTQFTKNL 914

Query: 901  KNIADIMSVP----SPPTSSHNLSSKPVKLDRKDANAVGSSSIDSKIVATATQAVDMVGP 960
            K+IAD++ V     +PP S H++  K  + D K  N    ++ D  +  +A       GP
Sbjct: 915  KSIADMVVVSQQLGNPPASMHSVQLK-TERDVKH-NPSNPNAQDEDVSVSAASVTAAAGP 974

Query: 961  SRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAK 1020
            +RS  +WGD+EHLFEGYDD Q+ AIQRER RR++EQ KMFA++KL LVLD+DHTLLNSAK
Sbjct: 975  TRSMNSWGDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAK 1034

Query: 1021 FVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYT 1080
            F EV+  H+EILRKKEEQDREK  RHLFRF HMGMWTKLRPG+WNFLEKAS+LYELHLYT
Sbjct: 1035 FNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYT 1094

Query: 1081 MGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVII 1140
            MGNKLYATEMAK+LDPKGVLF GRV+S+GDDGDPLDG+ERVPKSKDLEGV+GMES+VVII
Sbjct: 1095 MGNKLYATEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVII 1154

Query: 1141 DDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQR 1200
            DDS+RVWP +KMNLI VERY YFPCSRRQFGLLGPSLLE+D DE PE+GTLASSLAVI++
Sbjct: 1155 DDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEK 1214

Query: 1201 IHQTFFSHPVLDEVDVRNILASEQQRILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA 1260
            IHQ FFSH  LDEVDVRNILASEQ++ILAGCRIVFSR+ PVGEA PHLHPLWQTAEQFGA
Sbjct: 1215 IHQNFFSHTSLDEVDVRNILASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGA 1241

BLAST of CmaCh08G000190 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 261.9 bits (668), Expect = 2.6e-69
Identity = 152/347 (43.80%), Postives = 211/347 (60.81%), Query Frame = 0

Query: 918  YDDKQKAAIQRERARRIDEQKKMF-AARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKK 977
            Y  K+    + E +R  D   +     RKL LVLDLDHTLLN+    ++ P  +E L+  
Sbjct: 94   YIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKP-EEEYLKSH 153

Query: 978  EE--QDREKVQ-RHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAK 1037
                QD   V    LF    M M TKLRP V +FL++ASE++ +++YTMG++ YA +MAK
Sbjct: 154  THSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAK 213

Query: 1038 VLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKM 1097
            +LDPKG  F  RV+SR DDG        V   K L+ VLG ESAV+I+DD+   WP +K 
Sbjct: 214  LLDPKGEYFGDRVISR-DDG-------TVRHEKSLDVVLGQESAVLILDDTENAWPKHKD 273

Query: 1098 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSH--PV 1157
            NLIV+ERY +F  S RQF     SL E+  DE   DG LA+ L V+++ H  FF +    
Sbjct: 274  NLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG 333

Query: 1158 LDEVDVRNILASEQQRILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQV 1217
            +   DVR +L   ++ IL GC+IVFSRVFP  +A P  HPLW+ AE+ GA C  ++D  V
Sbjct: 334  ISNRDVRLMLKQVRKEILKGCKIVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASV 393

Query: 1218 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1259
            THVVA  +GT+K  WA+   ++VVH GW++A+  L+ +  E++F ++
Sbjct: 394  THVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEENFGLE 430

BLAST of CmaCh08G000190 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 134.0 bits (336), Expect = 8.2e-31
Identity = 81/225 (36.00%), Postives = 130/225 (57.78%), Query Frame = 0

Query: 944  RKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMG----MWTK 1003
            +KL LVLDLDHTLL+S     +      ++++   + RE     L++F  +G       K
Sbjct: 65   KKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTRE----DLWKFRPIGHPIDRLIK 124

Query: 1004 LRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGE 1063
            LRP V +FL++A+E++ + +YTMG+++YA  + +++DPK + F  RV+++ +        
Sbjct: 125  LRPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDES------- 184

Query: 1064 ERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLL 1123
               P+ K L  VL  E  VVI+DD+  +WPH+K NLI + +Y YF    R+ GL   S  
Sbjct: 185  ---PRMKTLNLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYS 244

Query: 1124 EIDHDERPEDGTLASSLAVIQRIHQTFF---SHPVLDEVDVRNIL 1162
            E   DE   DG LA+ L +++ +H+ FF      VL+ +DVR++L
Sbjct: 245  EKKTDEGENDGGLANVLKLLREVHRRFFIVEVEEVLESMDVRSLL 271

BLAST of CmaCh08G000190 vs. TAIR 10
Match: AT5G54210.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 129.0 bits (323), Expect = 2.6e-29
Identity = 95/292 (32.53%), Postives = 147/292 (50.34%), Query Frame = 0

Query: 866  SKPVKLDRKDANAVGSSSIDSKIVATATQAVDMVGPSRSHGAWGDLEHLFEGYDDKQKAA 925
            SK  K+D +  N+  S++ D   V             R  G     ++L +G    Q + 
Sbjct: 14   SKKRKIDSEINNSSSSTNCDHFFVRYGICCNCRSNVERHRGR--SFDYLVDGL---QLSD 73

Query: 926  IQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKVQ 985
            I     +R+  Q   F  +KL LVLDLDHTLL++   V +  L  E     EE+D  +  
Sbjct: 74   IAVTVTKRVTTQITCFNDKKLHLVLDLDHTLLHT---VMISNLTKEETYLIEEEDSREDL 133

Query: 986  RHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGR 1045
            R L          KLRP V  FL++A++++ +++YTMG++ YA  +  ++DP+ V F  R
Sbjct: 134  RRLNGGYSSEFLIKLRPFVHEFLKEANKMFSMYVYTMGDRDYAMNVLNLIDPEKVYFGDR 193

Query: 1046 VLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFP 1105
            V++R +           P  K L+ VL  E  VVI+DD+  VWP +K NL+ + +Y YF 
Sbjct: 194  VITRNES----------PYIKTLDLVLADECGVVIVDDTPHVWPDHKRNLLEITKYNYFS 253

Query: 1106 CSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSHPVLDEVDV 1158
               R       S  E   DE   DG+LA+ L VI+++++ FFS  V  ++D+
Sbjct: 254  DKTRHDVKYTKSYAEEKRDESRNDGSLANVLKVIKQVYEGFFSGGVEKDLDI 287

BLAST of CmaCh08G000190 vs. TAIR 10
Match: AT1G20320.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 124.8 bits (312), Expect = 5.0e-28
Identity = 85/234 (36.32%), Postives = 125/234 (53.42%), Query Frame = 0

Query: 944  RKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKE-EQDREKVQRHLFRFPHMGMWTKLRP 1003
            RKL LVLDLDHTLL+S     +      +L + +  +D   + R         M  KLRP
Sbjct: 75   RKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFREDLWTLDRE--------MLIKLRP 134

Query: 1004 GVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERV 1063
             V  FL++A+E++ +++YTMGN+ YA  + K +DPK V F  RV++R + G         
Sbjct: 135  FVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGDRVITRDESG--------- 194

Query: 1064 PKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEID 1123
              SK L+ VL  E  VVI+DD+  VWP ++ NL+ + +Y+YF            S  E  
Sbjct: 195  -FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF--RDYSHDKESKSYAEEK 254

Query: 1124 HDERPEDGTLASSLAVIQRIHQTFFSHPV--LDEVDVRNILASEQQRILAGCRI 1175
             DE    G+LA+ L V++ +HQ FF   +  LD  DVR +L  ++Q I    +I
Sbjct: 255  RDESRNQGSLANVLKVLKDVHQEFFRGGIEELDSKDVRLLL--QEQHIAVSIKI 286

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LL042.3e-28046.97RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... [more]
Q00IB63.7e-6843.80RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
Q95QG81.4e-3031.07RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis eleg... [more]
F4JCB25.0e-2527.39RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... [more]
Q9P3766.8e-2225.16RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces... [more]
Match NameE-valueIdentityDescription
AT2G33540.11.6e-28146.97C-terminal domain phosphatase-like 3 [more]
AT5G58003.12.6e-6943.80C-terminal domain phosphatase-like 4 [more]
AT2G04930.18.2e-3136.00Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT5G54210.12.6e-2932.53Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT1G20320.15.0e-2836.32Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 944..1108
e-value: 6.9E-54
score: 195.0
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 946..1102
e-value: 5.8E-22
score: 78.1
IPR004274FCP1 homology domainPROSITEPS50969FCP1coord: 941..1121
score: 29.868853
IPR001357BRCT domainSMARTSM00292BRCT_7coord: 1166..1247
e-value: 5.1E-6
score: 36.0
IPR001357BRCT domainPFAMPF12738PTCB-BRCTcoord: 1189..1239
e-value: 3.2E-5
score: 23.8
IPR001357BRCT domainPROSITEPS50172BRCTcoord: 1164..1257
score: 12.53282
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 930..1167
e-value: 1.3E-48
score: 167.4
IPR011947FCP1-like phosphatase, phosphatase domainTIGRFAMTIGR02250TIGR02250coord: 941..1104
e-value: 6.1E-49
score: 164.1
IPR036420BRCT domain superfamilyGENE3D3.40.50.10190BRCT domaincoord: 1168..1257
e-value: 1.2E-19
score: 72.4
IPR036420BRCT domain superfamilySUPERFAMILY52113BRCT domaincoord: 1168..1258
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 527..547
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 97..117
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 479..493
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 452..493
NoneNo IPR availablePANTHERPTHR23081:SF2RNA POLYMERASE II C-TERMINAL DOMAIN PHOSPHATASE-LIKE 3coord: 173..1259
NoneNo IPR availableCDDcd07521HAD_FCP1-likecoord: 945..1098
e-value: 1.54237E-38
score: 138.497
NoneNo IPR availableCDDcd17729BRCT_CTDP1coord: 1156..1252
e-value: 2.75704E-39
score: 139.205
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 173..1259
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 936..1113

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh08G000190.1CmaCh08G000190.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity
molecular_function GO:0004721 phosphoprotein phosphatase activity