CmaCh09G012530.1 (mRNA) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh09G012530.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
LocationCma_Chr09: 8381799 .. 8395215 (-)
Sequence length2118
RNA-Seq ExpressionCmaCh09G012530.1
SyntenyCmaCh09G012530.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGGGTTGGATTAGCATTTCAGGTAGGGTTGAGTTTATTTTTGTTATATACAACATCTTTAAACAATTCATTTTTATTATTTATTTGATTTCATGATCTATCCTTATATTTTTGTTTCTGCTATATGTAAATTTTTTTTTTTGTTTCTGATACAAATTTGAAGGTCATTTTTGTCTCTGCTACATCTACCCATTTTTGTTTCTACTACATCTATTGTTAAATTTGAAGGCCATTTTTGTTTCTGGTAAATCTGAAGGCATTTTTGTTTATGTTATATCGGTGAATCTGAAGGTCATTTTTGTTTTTGTTATGGTAAATCTAAGGACATTTTTGTTTTTTCTTCTCTATAAATAAATCGAAGTTATTATTTTTTATTTATTTATTTGTATGGCAGTTTCAGCCGACGTTGTTGCAGGAGGGCATTGTTGATCGGTGACAAACATGGAGAGATTTGGGAGTAAGTAGTGTAAGAGTATGTTGTTCGAGAGAATTTTCAATTAAATACTTAATAACGTTTTCTAAAGTTGATTTTTCTTAGGGTGTACATTCACTCGACAACTAGAACTATGAGGGTTGGGTTGAGTTGGATTAGTATGTAGGGTTGGATTCATTTTTCTAAACTCGAACTGAATCGATTCGGTTTCGAGTTCATGGGACAAAATTTTCAGGTTGACATGAGCCAAATAAAAATATTTTTAATTTCTTTACTTAATTTCTCTTATATATATATTTTAAAAATATCTTAATATTTCATACTATAAAATAAGTTAAATATATTATTTATTATATAAATGCAAATTATTATTTATTTAATTTAAGAATATGCAAGAAGGTTCATAAATTATATATATTATAAATTTTAATTTATTTATTATCTATTTGTTGGAATATTATTATATAATAATTTATATATATTTTTATTTTTTTCTAAATAATCCAATATCAATTAAAATATTTTATATACCATATTTTTAAACTAATCATTTTTATAATTTATTTAATTTTATGATCTATTTTATAATTTTGTTTGTTACATGAACGTATTCAAGAAATATAAATATTTAGTGGTCTTTAATTTTATGACGATGTTAAATATTAATATATTTTCATTTTTTATTTAATTTGTATTTATTTTTTTAAATAAATATAGATTAATTTGTATTTGTTTGTGAAATTTTATTATATTTTATTTAATTGTATCCAAATAATTATAAGCGTGGATAAATAAAACTTAAAAAAAAATTTGACAATCTAAACTGGGTTGGGTTGTGAAAAGTTTTGAGTTGGATTGAGTTGGGATAAGTTTTGGATTTGGACGAGCAACGAAACTATTGGTTGGGTAAAAAAAAAATAGCTCAACACAGCCCGTACCATATACACCCCTAATTTTTATCGTCTCATTTAAATATTTATTTTCAAATTTTTAAAAAAGAGGCATTTAAATTATTTCAAACACTACAAATTTTATGTTTTGAGTCTTAAATTAATATAATTGAAAAATAAAAGGGAATTATGAGGAAGGGAAATCCTGATTTTTTTAATTTTTTTTAATTTTATTTATTTATTTTCTTTTAAATAAATTAAAAAAAAGAAGGAAAAGAAAATAAGAAAACCCTACCTTTTTGTTCATCTTACAGTCCTTCCGCCGGTACCGTTTCAGCCGAACAGCTCCCTTTCGTTTTCCCTCTCTCTCCGGCGGCTATACCTCGACGATTCCAGTAATATCGTAGGGCGGCGAACAGCAGCACCTCGCCGACGTCGTTCTTTGGTAGCTTGCGACCTACGGCGGCAATTTCTTCTCCCTCTCGTCTTCTCCGGCGAGCGACGGACGAAAACACATAATTTCCAGCGTCTTCATCTTCTCTCTGGCGTGCGCAGAGACACACGACGCGGGAACCACGCAACGCGACGCTTCAACACTTCCGCAACGACGGATGGTTTGTTGGACGGCTGTGCAACCAGCCGACGTGACCTCACGGATTCGCAGCCATTTCTGCATTGGTTAGAGTTTTCAACAGCGTTTTCTTCAGGTTTGGTAAAGACCCATATCTTTTCGGCTTTAGTTTAGTGTTACCCATTCGGTAAAGGATTTAAATACGCCGGATTAAGTTTATTCAAACACTAACAAGCAAGCCTTCGAGGTTGTTGGTGGTGTTCAATAGTTCAGGACTGGTCCGGTGGGTTCAAATTAAAGTTTAATATTGTTTTGGGAGTCTTCAGAATAGCTTAGGGGCGCTTTAGAACGTTTAGAAGTAGATTAGTAGTCTGATTGTTGTGTGCTAGCTTGCTGTGTCATGTCTTGAATTTGTGATATGGATGCATGCTGTAATCTGTGATGTCTTCTCTTTCGGCAAGGTTTGGTAAGGACCCACTTCTTTTCGGCTTTAGTTTAGTGTTACCCATCTGGTAACAGTCTCAAATAAGCAGGATTAAGTATATTCAAACACCAACCAGCAAGCCTTTGAGGTTGTTGGTGGCGTTCAATAGTTCAGGACTGCTCCAGTGGGTTCAAATTAATGTTTAATATTGTTTTGAGAGTCTTTAGAATAGCTTACGAGCGCTTTCGAGCGTTAAGAAGTAGATTAGTAGTCTGATTGTGTGTTCTTGCTTGCTGTGTCATGTCCTGAATTTGTGATATGGATGTTAAGCATGCTGTCATCTGTGATGTCTTCTCTTTCAGCAAGGTTCGGTAAAGACCCACTTCATTTCGGCTTTAGTTTAGTGTTACCCATCTGGTAACAGCCTCAGATTCGCAGGATTAAGTTTATTCAATCACCAACCAGCAAGCCTTTGAGGTTGTTGGTGGCGTTCAATAGTTCAGGACTGGTCCAGTGGGTTCAAATTAATGTTTATTATTGTTTTGGGAGTCTTTAGAATAGCTTAGGAGTTCTTTCGAACGTTAAGTAATTAGATTATTAGTCTGTTTGTTGTGTGCCAGCATGCGGCGTGATGTCTTGAATCTGTGATATTTATGCTACAATGTACTAAGGTATATTTTTCCCCTTCTTTTTGTGGATATTTCAGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCTAGTAGTGACGATTTTGCTGCATTTCTTGATGTAGCCCTAGAATCCCATTCTTCTGACTCGTCACCAAACAAAAATGCCGAGGTTGGCAATAATGTTGAAAGTTCGAGGTACAGCTAATTTCTAAACTTAATAAATTTCCGATGATCTTTGAAATTAATAATGTCTAAAGACTGTGGTTTGGGGTTCTTTTTTTGTTTTGTTAATCTAATGTGTTTATTACTTCAATCTACAATTAAGTGGGACGTATTTTGTAGTTGCCTGGCTCTCTTTTGAATCTCATACAGAGAAAAAGATGAAATGTGAATTATAATGCTTAAGAGTGTTGAGTTCCTTTTTTTCAGTTTCTGGATGGTATTTTTAAAACAATTGATCATGTTTGGTAGGGTACGATGTTCGATAATTCTTGTTGAAATGGGATTTCATGATTGGTTTAACATTCCAGATTTCCCCATGTGACTCTCTTGTGCTAGTTTTATAGCAGTTTGTCGATTCTAAGCTTTAAAATCTCAAGCCCTCAGTTTTTTAAGACTTTTTTTAAAAAAGAAACCTTATTTCCAAGAGGAAGAATGAAAGAATAAGGTCAAGCTGCAACGAAAAAAAAGAAGATTATAAAATCTTAGACTAGATATTTACAGCAGTCTAAATTATTTAAACTCTCAGGTAATGAGTTTATCTGTCATTTGGTTTCTCTCTCATCTAATTACTTTCTTTTATAGATATCCCAAGTGGTCTGGTTTGTGAAAGTTTCTATCATTTACCGTACGTGTAATTATTATTAGAAGTCTAAAACTATGGTAGATTGGATTCTGCTAATAACTTCTCATTGCCCAATAAGTGTCGTAAGTAATTTCTGTGTTTGTAGTTTGTATTCTTTTTCCTTTGGCATGATTTACAGCATTGAGTTGTGGCAAATGGCGAGTATCATTTCTGCACTTAATAATGGGGTTTGTAGGATTGGTTATAAGTTATAGCCGTATAACTAAACTGTGGTATTTTATCTTCATTTTATCAATTTAATCTTTCAAGCAGTTTTTATTCAATTCATTTAATTTTGTAGGATGAAACGTCGTAAGCTGGTATGCTCAGAGGAGGACATTCTGTGTGGAGTTGAAGAGCAAAGTTTAGGTAAATTAAACTGTTCCTACTAACCCCTTTTGTTGTCCATCTCATACTGTCTCTACTCTTTACTTTCACTATAGCCAACTTGTGGTTGCCACCTGTACCGTACTTAACTTAGTCCTTACTTATTAGACGGCTGACTAACTATATGACGTACTTGCTTATTGTGTTTTTGATGGTTGCCTTCTCTTCCCTTCATGTATGCAAATCTGATTGGAGGCTTGTCACCCTTTTTTATCCATCTTTTTTTTGAATTGCATGAGCTTCTCTAAGAATCATCTAATGTTGGCTCCAAAATGATCTAGATATAAATACCTATCGGGGTTTTCACGTAAGAAACTATGAAATTGGTCCATAAGGGTTAGGGAGAACTGGTTCAAGCTTTGGAGGCCACCTACCTAGGGTTTATGCTAACATAGGATGAGAGTTAACCACCTTGAGTTGGCTTAGTAGTCAAATAAGGGTTATGAAAATAATAAAAAGCTTAGAGAGAATGAGTTCAAGCCTTGATAGCCACCTACCTAGGATTTAATATCCTATGAGTTTTCTTGGAAACCGAATGTGTATGGTTTGACAGTTGTCTCGTGAATTGTTGTAATGTGTACAAACTGGCTCAAACACTCATTGGACGAGAGTTGTTCTTATATATGGTAGTTGTGATCGATCCTTGGTCTAAATTAATAGTCATTTTTAGTTCCTTGGTCTAAATTAATAGTCATTTTTAGTTCCTTGGTCTAACTTTGTGGTATCTGAATTCATCAACTATTTCAGTCCAGCAGTCCTCTGCTCGCCATATATCCCTTACTTTGGTTCCTTTATTCCTTTACTAATGTATCCTGTTCAGTTCCGTGTAATTGTAATTTATTAACATAAAGATACAAAAGCCTTCATTTCGAAGCAAGTATTTGACTCCTTGTTTTTCTGCATTGGGTTTTCTCTAACTGTTGGATTCTGCTATTTTGAAAATAATAATCATTGCAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATCTGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACGTTTGGGTATATTCATAAGGTTTGTCTTCTTCATATGGATATCTAATTATCTACTGTTCTTTGTTACCATGTATAGAATGTTTGGGTGGGGTGGGCTCAAAAGGCATGGTAGTTTGCAAAATGGGCACCATTTAGAAGAGGAATGAGAAATGATGTCAATCATAAGGCCTTGTAATGATAGTTTACAAAATGGGCGCCATTTAGAAGAACGAGAAATTGAAACGACCCTATTTTTCTTTTGAAATAGAGCTGCTACTACATGCATGATATGATTGTTAAGTGTGGAAATATTCATTTAGAATATTCCAAAAACGATGTCATAGACTTAAACGTTCATAATATAAGCTTTAAATCACATGACAATTTTAATAAAGTAAACAAAATAATGAACAATGTCTTGGTCTAACCTAAAGGCTACAAAATAGACAAAATACAACTACCCTAACTTAGGTTCACGTCTTTGATCGCGACGTCACTATCATCCCGTCATCCATAAAGGGATGCCTTACCTTTACCTGAAATGTAAAAGTAACACGACTTGAGTACTTAAAAAAAAAAAAACTAAAAAAGTAACCCTATTGTTGGGGCTAGGAAATGGGAACACATGCAACATGGTGGGATCTATTTTAACATATCATAGCATGACCTTAGGCTGTTTGACAGTCAAGGCATAGTTGGATGTGTAGTACTTCTCTACACATGACCCACGTGTGAGCATGTGTCTACTAGGCGGGCTTGTACACCCCTTTGGCCTGGCTCGTCGGGCTAGCACTTGCTAACGGGCTCTTGTCGAACACCACGGCAGGAAAGGGCCTGCATAATATTATGACAAACATAAATGTCGTAAATTATAAGTCGTAATGCACGTGTCTGTATTCATCATATTGTAGCAGTATCCTGATGCACATAACACACAAATATGCACGAGGTTTCCATAGCATATAATCATGGAATAATGTATGATGCAAGACGAACCTCAATCTATCTAAACATAATAAGTTAGTGAAGTATGCTTACATAATTTTTCTGTATATTTTGTAATACGTAGTATTAACTTTTATGACATGGTCTAAGGAGGTTTATGCATCATAGATTAACGATTTCTTAACAACTTTATACATCGTCACATGATCACAAAAATAACACAATTTTATTACATGATATAGCTTATCATAGCATATGTGACATTACATGGCATCACATGATAAAACATAGCCTATCATGTCAACCAGGGGCCACAAGGCATGTATCATCATACATTATACAATTCATCCAATCAAGTCGATAGTAGAGTCACTTACTTGGTTGGCCTAGGTCGAAGTAACTACACTTTCTTGATACTAGCTTAATGCTGAACCTTGAAATTTAATTCTACCTAAGGAGTCCTACACTCATAACTTCCAATTCAATAGGTATAAAATAATTTTAACTTAAGTTTAAATAATTTTAAATTAGCTAAAATTTTTTTCCCTACTAACTTTAAATGCCTCCCAAGTGTTGAAACTAGTAGGGAAGGGACCTAGCATATCAGACTTAAGCTGGATCAGACGAAAAGTTGACTTGGAGCCGTCAAAAAATGTAGTCTTTGATTGGCTTCGAACTACTCAAGTGAGATGAAATGGAACAATTGTTGTAACTTAGTAAGCCAACCCAACGTTCTCTTCTTTATAAATGGAGTCCAAGTCTCCACCTTCTCAACCTAACAAATATGAGATAACTTTTATGATATTTTATTTTTATTTTTTAATTTTGGGTTATTACATCATACACATCTAAAAGGAGCTTTTATCTTCGGAAGTGTTTTAATGTTGAAACTACTTGAGGAATTTCTCTCTCATCTCTTCTTCCTGATCTCATAGAGCTTCTCCTGCTTGATGGTTTTGCTATAATGACTTTGGCCAAGTAAGTGTGATGTCAGTATTAGTGGTACCAGTGCTCCAGTCTCATAAACAGTTATGCTGTGTGAGGTGTAGAAACTTGGAGTGGTAATAGACGGATAAGTTATGCTTGAAGGTGGCACTGATGAAGTGCATTATGTGATTCAAACGACAGGGCCTAAGTTAGCCGTCTATTTCACTCCAAGTTTCTACTTCAACTGTGACATAGATAAGCTCTATTGCCCTCCTTTGCGATGCATAGCGTATTCCACGTATCCATGCTTCACAAATACGTGATTGACCTGACTCACGTGGTAGAGTACGAGTTCATACCGCTGCAACAAATCCTAAGTTACGTGGAATGTTAGACCAAGAAGTGAAGGTTCTATTTATTAGGAAAATGGTTCTTGTTAAGGTCTATAGCAAAACCATCAAGCGTGGAGAAGTCTTGTGGGATCTAGAAGAAATGAGAGAGAAATTCCCTGAGTAGTTTCAACATTAAGATACTTTCGAGGAAGAAAGTTCCTTTTATATATGTATGATGTTATAACCCAAAATTAAAAAGGAAAAAATAAAATAAAGTAAAAGTTGTCCCGCATCTATTAGGTTGAGAAGGTGGACATTTGGATTTCATTATAAAAGGGAGGTCTCTTGGCTAATTAAGTTACAACAATTGCTTTCTCTCATCCCACTTGAGCAGTTCGGAGCTAGTGGGAGAATACGTTTTTTATACTCGGGTGCTTCCCTATTGCAAATTTAGGTATCGGAGCCAGTGGGAGAATACGTTTTTTATACTCGGGCGCTTCCCTATTGCAAAGTTAGATAAGCCAATTTCCATAGTAGAATTATAAGTTATGAGCATAAGATTCCTCGAATAGAATTAAATTTCAAGGGGCATCTTTAAGTTAGCATCAAGGGAGTGTAGGCTTGGTTGGATGAATTGTATTAGGTATTTTGATACGTGCCTCGTGGCCTTTGCATATGTTATTTGTATGATTGATATGTTGTGCTATGATATGCCATATTATGCTATGATAAAATATGCCAGGTAATAAAACTATTATATTTTTGTGATCATGTTATGTTGTATAAAGTTGTTAAAAAATCGTAATTTATGTTGTATAAACTTTTTTAGACTATGCCATGAAAGGTACGTTATGTATTACATAATGTACATGAATTATGAACTTATTATGTTATAGATAAATGGATGTTTGTCTTCCATCATATATTATGCTATGATTACAATCTATTGGGACCTCATGCATTTTTGTGTCTCACGTGCATTGGGATACTTTCACAATATGATGAGTATGGACGCATACACTACGACTTATGATTTATGACATCTATGTTTCCCATGATGTTATTCGGGCTCTTTGTCTCGTGGTGTTCGACAAGAGTCCATTAGCGAGGACTAGCCTAACGAGTCAGGCCTATAGGGTGTGTGAGCCCGCTTGTTGGACCCATCCTCGCACGCGCAAGCCATGTGTAGAGAAAGTACTACACATTCAACCTTGCTTGGATTGGAAAACTACTCTAGGTCATGCTATTATACGTTCCACCATGTTGCATATGTTTGCATTTTCTGTCCCCAATAGTGAGGTTGTTTAAAATACTCAAGCCATGGGCTACTCTTACATTTTAGGTGAAAGTAAGACACCCTTATATGGATTATAGCGGCTTGCAATGAAGGCCATGGAACTAGTGTTAGTAGTTGCATTTTGTCTATTTTGTAGCCCTTAGGTTAGACAATGACATTTTTTCTTTGTTTTATTTAAATTTTCACGTTATTTATTTTTCATTGTTTTGTTTAAAGCTTTTATTACAAATGTTTAAGTCCATGACATCGTTTTTTAAATATTTTAAATGAATATTTTCTCCCTTAACAATTATGTCAGGTACGTAGTAGCAGCTCTATTTTTATAGCACTTGAGAGTCTGTCTGGTTATACAGCTTTGTTATTCTGTTTTTCTCGGACATCGGTTCTGGCATGCATAGTTGTCTGAGGCATTTTTCTCTCTTTGGGACATAAATCTATGAAAGAATTGGAGTTTACATTTCATCTTGTCAACCTCTTAACCATACATATCTGACTCCTCCTTCTCGCCTCTTTGACATTCAAATACTTGTGCATTCCCCTGTTTATGATTTCCAGTTAATGGAGCCATGAATCCAGTTTTCTCAACTTTCTGCTGGAAGATAGTGAATAATTCCATGATCCTTTGGCTTTGCAGAATATCTCATTCCCCTTCATTTATATTTATCTGTTTCTTATATAAAAAAGGAAAATAGTGAAATAGTCCCATGCAGTTGACATGTTTATGTTGGCCCTTATTTTCATATTTGTCATTCCTTACCTTAATTCCGAATCCCCTTATTGAAAGAAAATAAGTTTCTTGAAATTTTCAATTTTGAATCTTAATCCCTACAAAAAAAAAGAATATTGCAAAGTCAAAAGACTACTTCACCTAATCATTCGAATCTTTGTTTCGTAGTCTATAAATTATACGCCCTCTGGTTCTGGTTGAATCCCGTTATATGCATGCTAATGGGGCTAGTATGTTTCTCATTGTGGTTCACCTTCATATTTTTCGTGGTCTATATCATGCGAGTTATAACAGTCCTAGGGAATTTGTTCAGTGTCTCGGAGTTGTAATCTTCCTATTAATGATTGTGACAGCTTTTATAGGATGCGTACTACCTTGGGGTCAGACAAAATAGATTCTAGCTATTGATATTTACTTGCTATAACTGGAGAAGTATCTTCATTTATATATGCCTCGTTGATCAATAAATTAGCAATTTCTTATTAAAAAGAAGAAAATAGTGGAATAATCCGATGTAGTTTGAGATGTTTTGTTAATATATATTCTCCCTTTAGTCATATTCAACTACCGTTGACAAAATAGATGGTGAGTACCAATAACTTGTAACGTGCTAGTAGATCGAACTATGTGCTAACGATATGGCCCATCTGATAATCATGTAGGGACTCCGACTTAATAATGATGAAATTAACCGGCTACGCAACATCGACATGAAGAGTTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAATTCAACCCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTATGGAGTCAAATAGATTCTCTAGAAGGTACATTGTCCTTCTCATCTATACACAACTATATCTTCCAAGTCCATACTAACAATCCATCCTTCTCCTCCAAAAGGAGATTTTAGTAAATAACCAAATACATAGTTCTTCACTTGCTGTCTTTTCACTATATGACTGAAGATCTGTTAAGAATTTAGGCTTCAATGGCCTGTTTTCGGATGAAGGGTTTGACATAATTTCCTCTTTCAATTGAGCATTTTTTCTCTTGTTCCCTTGGACCATGAAGCGAATTGCACCTTTTTCTGGTCTTGACTGTAAAATTAGTGTTTAGTTTAATTTTATGCTTGGTTTTTAATTAATCTTTAAAATTCGACTTAATTGTTTAACCTTCCAACGAATTCCTGATGTTGGTTGCTCTGTTTGTTTTTGATGCTGTGATGAAAATTTTGAATAGATGTCACGAAAGGCAGCCTTTTCCTATTGAATTCCGTGCATACGATGACAAAGTTGAGACCATTTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATACGCGTCCGAAATGGCAAAGCTGTTGGACCCCAAAAGGGAGTATTTTAATTCCAAAGTGATTTCTAGGGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATATCGTGCTGGGTCAGGAAAGTGCTGTTCTGATCCTCGATGATACCGAAAATGTAAGTGCATCTTCAATTAGCTAATATACGGATCTCTTGATTCTGCCTAGTTATCACCATTGAATGAGTGTGTAACCGTAGGATTCCCGTCATACATGGAAAGAGGCTGATAACTCGAGTAATGCACTCGGTTGAAAAATGTTCTCTTCCATGTCTGAGAAGAAGGTCTAAATCGTTCAATAGATACGAATCTTGTCTCTTGCTATGAAAGTATCTTAATTCATAATAAAATGCAGCATATTGGCTCCCGAACTCCGAAAGTTTATTCTTTAGGCTTTCTTTTGTGTTTTTTCTTGAAGAATCTCCCTTCATTACTCCTGTGGATTATCTTGTAATGGGTAGTTTTCTGAACTGCCTATTGTTCATACATATGGAGCTAAACATTGTCAATTGATATGATAGGCATGGACAAAGCATAAAGAAAACTTGATATTGATGGAGAGATATCACTTTTTTGCTTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGATGAGAGTGAAACTGATGGGGCACTGGCGACCATCCTGAAAGTTCTTAAGCAAGTTCATAATATATTCTTTAATGTATTTCTCTCTCCCTCTTGTTTCCAAACTTTTGTTCCTTTTTTTGCCCCAATCTCACTGTTGGGAATTCTTCTGCAGGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTACATGCTTCTTGTAATGATTCCTAAATTTTTGTCTGTAAACTTCAGTTCCCATTTTGATTATGTAGCTGTTGACGGATAACACATTGCTTAAGTTACAAGTTTAAGCTTTGAACTTTCGCGATTCATATCTAATAAGTCCTTGAACTTTTTAAAAAAAAATGTCTAATAGGATTTATATTTCCAATTTTTGGACTGAGAGTGTAATAACGAAGGCCCGCCGCTAGCAGATATTGTCTTTTTTAGGCTTTTCCTTTCGGGTTTCCGCTCAAGGTTTTTATATGCATGTTCTTGGGAGAGATTTCTACACTCTTATAAAAAAATGTTTCGTTCCCCTCTCTAGTCGATGTGGGATCTCACAATCCACTCCTTTCTAGGTCCAGCGTCCTCGTTGGCACTCGTTCCCCTCCTCAATCGATGTGGGAACTCCAAATCCCCACCTCCTTTGGGGCACAGTGTCCTCGCTTGCACTCGTTCCTCTTTTTGGGATTTCACAATCCACCACTCTTCGAGGGCTTAGTGCCCTCGCTGGCACACTGTTGGTGTCCACCCCTCTTCAGAGCTCAGTGTCCTTGCTAGCACACTGCCCAGTGTCTCGGATGCCATTTGTAACAATTACATCTATAAAGAATGTTACGGCCCAAGCTGACCACTAGCAGATATTGTCCTCTTTAGGCTTTTCCTTTTAGGCTTTCCCTCAAGGTTTTTAAACACGTTTGTTAGGGAGAGGTTTCTACAACCATTCATTCCGTCATTCCCGTCAGTACGGATTGATTTTAGTTCCCCTCTCCAACTGTGTTGGATCTCACGAAGAGGTTCATAATCTTTTAGTTCTTTTGTACGAACTTTAATTAGTCTCTAATTTTCCAATTTTATGTAGACAATTTCTTGATAGTTTTGATGTTTCAAAATTCATGGGAAAATTGAAAGTTTTAGCTCTTATTGGATAAAATTCACTGTCATGAAAGTTTTGATATTTATATTTTCATTACCTTAAATTAAACAGGTTTTGAAGACAGTTCGAAGCAAAGTTCTTGAGGGAAGCAAGGTCGTCTTCAGCCGAGTCTTTCCGACCATATTTCAGGCCGAAAACCATCATCTTTGGAAGATGGTAGAGCAGCTGGGAGGCACTTGCTCAACCGAACTCGACTCGTCCGTGACACACGTTGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCATTGAAAGAGGAAAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCAAACTACTTCTGGAAACGGCAAGCCGAAGACAACTTCCCGGTCGAGCAAAGCAAGAAACAATAACACAGGTTCTCTCTTTACAGCAAGTAGTCTCTCTTTAAATATAGATCTTTGCATTGGTTGGGGGCTTACTTGCTCACCTGTCTGTAGGTATTCATCTCTTCAAGGGTTGATTCAAAGTCCAGCTCTTTGGTTCATGTAGATGTGTAGATGCGTTGTGTTGTAATTTTGGGTAACCATTTTGAGTTGTGGGTTATATAATTGCAGGGCTGTTACTTTCACCAGTTTGGCTTGTGTTCCTTTTAAACAAATTCAAGCCCTATGGTTCTGTATTTGTAATTCTTCTCATGTATTTATTGGAGTTTAGATAAGAACACGTAACTCGAAAGTTTCGCGAAGGAATCTAGCAGAATTGAGGTCGACTTCGAGTTGGTTTATAGGTTC

mRNA sequence

GTTGGGTTGGATTAGCATTTCAGTTTCAGCCGACGTTGTTGCAGGAGGGCATTGTTGATCGGTGACAAACATGGAGAGATTTGGGATCCTTCCGCCGGTACCGTTTCAGCCGAACAGCTCCCTTTCGTTTTCCCTCTCTCTCCGGCGGCTATACCTCGACGATTCCAGTAATATCGTAGGGCGGCGAACAGCAGCACCTCGCCGACGTCGTTCTTTGGTAGCTTGCGACCTACGGCGGCAATTTCTTCTCCCTCTCGTCTTCTCCGGCGAGCGACGGACGAAAACACATAATTTCCAGCGTCTTCATCTTCTCTCTGGCGTGCGCAGAGACACACGACGCGGGAACCACGCAACGCGACGCTTCAACACTTCCGCAACGACGGATGGTTTGTTGGACGGCTGTGCAACCAGCCGACGTGACCTCACGGATTCGCAGCCATTTCTGCATTGGTTAGAGTTTTCAACAGCGTTTTCTTCAGGTTTGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCTAGTAGTGACGATTTTGCTGCATTTCTTGATGTAGCCCTAGAATCCCATTCTTCTGACTCGTCACCAAACAAAAATGCCGAGGTTGGCAATAATGTTGAAAGTTCGAGGATGAAACGTCGTAAGCTGGTATGCTCAGAGGAGGACATTCTGTGTGGAGTTGAAGAGCAAAGTTTAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATCTGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACGTTTGGGTATATTCATAAGGGACTCCGACTTAATAATGATGAAATTAACCGGCTACGCAACATCGACATGAAGAGTTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAATTCAACCCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTATGGAGTCAAATAGATTCTCTAGAAGATGTCACGAAAGGCAGCCTTTTCCTATTGAATTCCGTGCATACGATGACAAAGTTGAGACCATTTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATACGCGTCCGAAATGGCAAAGCTGTTGGACCCCAAAAGGGAGTATTTTAATTCCAAAGTGATTTCTAGGGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATATCGTGCTGGGTCAGGAAAGTGCTGTTCTGATCCTCGATGATACCGAAAATGCATGGACAAAGCATAAAGAAAACTTGATATTGATGGAGAGATATCACTTTTTTGCTTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGATGAGAGTGAAACTGATGGGGCACTGGCGACCATCCTGAAAGTTCTTAAGCAAGTTCATAATATATTCTTTAATGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTTTTGAAGACAGTTCGAAGCAAAGTTCTTGAGGGAAGCAAGGTCGTCTTCAGCCGAGTCTTTCCGACCATATTTCAGGCCGAAAACCATCATCTTTGGAAGATGGTAGAGCAGCTGGGAGGCACTTGCTCAACCGAACTCGACTCGTCCGTGACACACGTTGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCATTGAAAGAGGAAAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCAAACTACTTCTGGAAACGGCAAGCCGAAGACAACTTCCCGGTCGAGCAAAGCAAGAAACAATAACACAGGTATTCATCTCTTCAAGGGTTGATTCAAAGTCCAGCTCTTTGGTTCATGTAGATGTGTAGATGCGTTGTGTTGTAATTTTGGGTAACCATTTTGAGTTGTGGGTTATATAATTGCAGGGCTGTTACTTTCACCAGTTTGGCTTGTGTTCCTTTTAAACAAATTCAAGCCCTATGGTTCTGTATTTGTAATTCTTCTCATGTATTTATTGGAGTTTAGATAAGAACACGTAACTCGAAAGTTTCGCGAAGGAATCTAGCAGAATTGAGGTCGACTTCGAGTTGGTTTATAGGTTC

Coding sequence (CDS)

ATGGAGAGATTTGGGATCCTTCCGCCGGTACCGTTTCAGCCGAACAGCTCCCTTTCGTTTTCCCTCTCTCTCCGGCGGCTATACCTCGACGATTCCAGTAATATCGTAGGGCGGCGAACAGCAGCACCTCGCCGACGTCGTTCTTTGGTAGCTTGCGACCTACGGCGGCAATTTCTTCTCCCTCTCGTCTTCTCCGGCGAGCGACGGACGAAAACACATAATTTCCAGCGTCTTCATCTTCTCTCTGGCGTGCGCAGAGACACACGACGCGGGAACCACGCAACGCGACGCTTCAACACTTCCGCAACGACGGATGGTTTGTTGGACGGCTGTGCAACCAGCCGACGTGACCTCACGGATTCGCAGCCATTTCTGCATTGGTTAGAGTTTTCAACAGCGTTTTCTTCAGGTTTGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCTAGTAGTGACGATTTTGCTGCATTTCTTGATGTAGCCCTAGAATCCCATTCTTCTGACTCGTCACCAAACAAAAATGCCGAGGTTGGCAATAATGTTGAAAGTTCGAGGATGAAACGTCGTAAGCTGGTATGCTCAGAGGAGGACATTCTGTGTGGAGTTGAAGAGCAAAGTTTAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATCTGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACGTTTGGGTATATTCATAAGGGACTCCGACTTAATAATGATGAAATTAACCGGCTACGCAACATCGACATGAAGAGTTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAATTCAACCCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTATGGAGTCAAATAGATTCTCTAGAAGATGTCACGAAAGGCAGCCTTTTCCTATTGAATTCCGTGCATACGATGACAAAGTTGAGACCATTTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATACGCGTCCGAAATGGCAAAGCTGTTGGACCCCAAAAGGGAGTATTTTAATTCCAAAGTGATTTCTAGGGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATATCGTGCTGGGTCAGGAAAGTGCTGTTCTGATCCTCGATGATACCGAAAATGCATGGACAAAGCATAAAGAAAACTTGATATTGATGGAGAGATATCACTTTTTTGCTTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGATGAGAGTGAAACTGATGGGGCACTGGCGACCATCCTGAAAGTTCTTAAGCAAGTTCATAATATATTCTTTAATGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTTTTGAAGACAGTTCGAAGCAAAGTTCTTGAGGGAAGCAAGGTCGTCTTCAGCCGAGTCTTTCCGACCATATTTCAGGCCGAAAACCATCATCTTTGGAAGATGGTAGAGCAGCTGGGAGGCACTTGCTCAACCGAACTCGACTCGTCCGTGACACACGTTGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCATTGAAAGAGGAAAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCAAACTACTTCTGGAAACGGCAAGCCGAAGACAACTTCCCGGTCGAGCAAAGCAAGAAACAATAA

Protein sequence

MERFGILPPVPFQPNSSLSFSLSLRRLYLDDSSNIVGRRTAAPRRRRSLVACDLRRQFLLPLVFSGERRTKTHNFQRLHLLSGVRRDTRRGNHATRRFNTSATTDGLLDGCATSRRDLTDSQPFLHWLEFSTAFSSGLMSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ
Homology
BLAST of CmaCh09G012530.1 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 7.4e-149
Identity = 278/449 (61.92%), Postives = 345/449 (76.84%), Query Frame = 0

Query: 139 MSLATNSPAH-SSSSDDFAAFLDVALESHSSDSS-PNKNAEVGNNVESSRMKRRKLVCSE 198
           MS+A++SP H SSSSDD AAFLD  L+S S  SS P++  E  ++VES  +KR+KL    
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVESG-LKRQKL---- 60

Query: 199 EDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEI 258
                   E   E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIHK +RLN DEI
Sbjct: 61  --------EHLEEASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNEDEI 120

Query: 259 NRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLED---VTKGS 318
           +RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL S   SL+D   V+ GS
Sbjct: 121 SRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVSGGS 180

Query: 319 LFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVI 378
           LFLL  +  MTKLRPFVH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +VI
Sbjct: 181 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 240

Query: 379 SRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNC 438
           SRDDGT +H+K LD+VLGQESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF    
Sbjct: 241 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 300

Query: 439 KSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSK 498
           KSLSELKSDESE DGALAT+LKVLKQ H +FF  + + + +RDVR +LK VR ++L+G K
Sbjct: 301 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEILKGCK 360

Query: 499 VVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFL 558
           +VFSRVFPT  + E+H LWKM E+LG TC+TE+D+SVTHVV+ D GTEK+RWA++E+K++
Sbjct: 361 IVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYV 420

Query: 559 VHPRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           VH  WI+A+NY W +Q E+NF +EQ KKQ
Sbjct: 421 VHRGWIDAANYLWMKQPEENFGLEQLKKQ 435

BLAST of CmaCh09G012530.1 vs. ExPASy Swiss-Prot
Match: Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)

HSP 1 Score: 242.7 bits (618), Expect = 1.1e-62
Identity = 136/336 (40.48%), Postives = 203/336 (60.42%), Query Frame = 0

Query: 251  LNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLWSQIDSLEDV 310
            +  + + RL   +   +   +KL LVLD+DHTLLNS +   + +  EE L  + +   + 
Sbjct: 908  IQRERVRRLE--EQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREK 967

Query: 311  TKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFN 370
                LF    +   TKLRP +  FL++AS+L+E+++YTMG + YA+EMAKLLDPK   FN
Sbjct: 968  PYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFN 1027

Query: 371  SKVISRD------DGTQKHQKGLDI--VLGQESAVLILDDTENAWTKHKENLILMERYHF 430
             +VIS+       DG ++  K  D+  V+G ES+V+I+DD+   W +HK NLI +ERY +
Sbjct: 1028 GRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLY 1087

Query: 431  FASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVL 490
            F  S  QFG    SL EL  DE   +G LA+ L V++++H  FF+  S D V  DVR +L
Sbjct: 1088 FPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEV--DVRNIL 1147

Query: 491  KTVRSKVLEGSKVVFSRVFPT-IFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGT 550
             + + K+L G ++VFSR+ P    +   H LW+  EQ G  C+T++D  VTHVV+   GT
Sbjct: 1148 ASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGT 1207

Query: 551  EKSRWALKEEKFLVHPRWIEASNYFWKRQAEDNFPV 577
            +K  WAL   +F+VHP W+EAS + ++R  E+ + +
Sbjct: 1208 DKVNWALTRGRFVVHPGWVEASAFLYQRANENLYAI 1239

BLAST of CmaCh09G012530.1 vs. ExPASy Swiss-Prot
Match: F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)

HSP 1 Score: 192.2 bits (487), Expect = 1.6e-47
Identity = 119/312 (38.14%), Postives = 176/312 (56.41%), Query Frame = 0

Query: 178 VGNNVESSRMKRRKLVCSEEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEE 237
           V N     + KRRK+   E  I      +S   LS    C H      +CI C   + + 
Sbjct: 298 VENFSSEPKAKRRKI---EPTI-----NESSSSLSSSSSCGHWYICHGICIGCKSTVKKS 357

Query: 238 SGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEE 297
            G  F YI  GL+L+++ +   +    K S L  KKL LVLDLDHTLL++  +  L+  E
Sbjct: 358 QGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLSQAE 417

Query: 298 EYLWSQIDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERA 357
           +YL   I+     T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R 
Sbjct: 418 KYL---IEEAGSATRDDLWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 477

Query: 358 YASEMAKLLDPKREYFNSKVISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKEN 417
           YA ++ +L+DPK+ YF  +VI++ +    H K LD VL +E  V+I+DDT N W  HK N
Sbjct: 478 YAKQVLELIDPKKLYFGDRVITKTE--SPHMKTLDFVLAEERGVVIVDDTRNVWPDHKSN 537

Query: 418 LILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDL 477
           L+ + +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF  + ++L
Sbjct: 538 LVDISKYSYFRLK----GQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRFF-RVEEEL 591

Query: 478 VDRDVRQVLKTV 485
             +DVR +L+ +
Sbjct: 598 ESKDVRSLLQEI 591

BLAST of CmaCh09G012530.1 vs. ExPASy Swiss-Prot
Match: Q95QG8 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis elegans OX=6239 GN=fcp-1 PE=1 SV=2)

HSP 1 Score: 146.7 bits (369), Expect = 7.9e-34
Identity = 107/386 (27.72%), Postives = 191/386 (49.48%), Query Frame = 0

Query: 209 EVLSKQQLCSHPGSFGNMCIICGQRLDEESG----------VTFGYIH--KGLRLNNDEI 268
           +V++    C+H     +MC  CG+ L E+ G               IH    L +++   
Sbjct: 68  QVIATVSECTHAIVIKDMCATCGKDLREKGGRAGQRKEQSTANVSMIHHVPELIVSDTLA 127

Query: 269 NRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFL 328
             + + D  +L+ ++KL+L++DLD T+++++        E +        +D+TK +L  
Sbjct: 128 KEIGSADENNLITNRKLVLLVDLDQTIIHTSDKPMTVDTENH--------KDITKYNLH- 187

Query: 329 LNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRD 388
            + V+T TKLRP    FL + S ++EM+I T G+R YA  +A++LDP    F  +++SRD
Sbjct: 188 -SRVYT-TKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSRD 247

Query: 389 D--GTQKHQKGLDIVLG-QESAVLILDDTENAWTKHKENLILMERYHFF--ASSCHQFGF 448
           +    Q     L  +    ++ V+I+DD  + W  + E LI ++ Y FF      +    
Sbjct: 248 ELFSAQHKTNNLKALFPCGDNLVVIIDDRSDVW-MYSEALIQIKPYRFFKEVGDINAPKN 307

Query: 449 NCKSLSELKSDESETDGALATILKVLKQVHNIFFNE---LSDDLVDRDVRQVLKTVRSKV 508
           + + +     D++  D  L  I +VL  +H+ ++ +      + V  DV++V+K  R KV
Sbjct: 308 SKEQMPVQIEDDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSEEVLLDVKEVIKEERHKV 367

Query: 509 LEGSKVVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALK 568
           L+G  +VFS + P   + E   ++++  Q G     ++   VTHVV    GT+K   A +
Sbjct: 368 LDGCVIVFSGIVPMGEKLERTDIYRLCTQFGAVIVPDVTDDVTHVVGARYGTQKVYQANR 427

Query: 569 EEKFLVHPRWIEASNYFWKRQAEDNF 575
             KF+V  +W+ A    W +  E+ F
Sbjct: 428 LNKFVVTVQWVYACVEKWLKADENLF 441

BLAST of CmaCh09G012530.1 vs. ExPASy Swiss-Prot
Match: Q8SV03 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Encephalitozoon cuniculi (strain GB-M1) OX=284813 GN=FCP1 PE=1 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 7.9e-26
Identity = 108/412 (26.21%), Postives = 174/412 (42.23%), Query Frame = 0

Query: 217 CSHPGSFGNMCIICGQRLDEESGVTFG-YIHKGLRLNNDEINRLRNIDMKSLLQHKKLIL 276
           C+HP   G +C +CG  + EES +    Y    +++ ++E   +    M++L    KLIL
Sbjct: 4   CNHPIRLGTLCGVCGMEIQEESHLFCALYNTDNVKITHEEAVAIHKEKMEALEMQMKLIL 63

Query: 277 VLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK 336
           VLDLD T+L++T                 SLE   K   F+++      KLRP +   L+
Sbjct: 64  VLDLDQTVLHTTY-------------GTSSLEGTVK---FVIDRCRYCVKLRPNLDYMLR 123

Query: 337 EASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDGTQKHQKGLDIVLGQESA 396
             S+L+E+++YTMG RAYA  + +++DP  +YF+ ++I+RD+      K L  +   +  
Sbjct: 124 RISKLYEIHVYTMGTRAYAERIVEIIDPSGKYFDDRIITRDENQGVLVKRLSRLFPHDHR 183

Query: 397 -VLILDDTENAWTKHKENLILMERYHFF-------------------------------- 456
            ++ILDD  + W  + ENL+L+  + +F                                
Sbjct: 184 NIVILDDRPDVW-DYCENLVLIRPFWYFNRVDINDPLRLKRKIEKEAGENKALEEFVSKR 243

Query: 457 --------------------ASSCHQFGFNCKSLSELKSDESET------DGALATILKV 516
                                SSC   G    S S  + + SE       D  L  +   
Sbjct: 244 KKIEDIRNPEIASRLDDMVLESSCGSEGVEDDSRSTEEKEVSEVQSVASGDSELLKVAGF 303

Query: 517 LKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSRVFPTIFQAENHHLWKMVE 569
           L++VH  +F         R+V+++L+ +R +V  G +   + +      A    L K +E
Sbjct: 304 LRKVHRKYFAS-----KQRNVKRILRKIRRRVFGGDRFFVAEI------ANRAWLVKTIE 363

BLAST of CmaCh09G012530.1 vs. ExPASy TrEMBL
Match: A0A6J1IA13 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111472572 PE=4 SV=1)

HSP 1 Score: 881.3 bits (2276), Expect = 2.2e-252
Identity = 444/444 (100.00%), Postives = 444/444 (100.00%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEED 198
           MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEED
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEED 60

Query: 199 ILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINR 258
           ILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINR
Sbjct: 61  ILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINR 120

Query: 259 LRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLN 318
           LRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLN
Sbjct: 121 LRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLN 180

Query: 319 SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDG 378
           SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDG
Sbjct: 181 SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDG 240

Query: 379 TQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSE 438
           TQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSE
Sbjct: 241 TQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSE 300

Query: 439 LKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSR 498
           LKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSR
Sbjct: 301 LKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSR 360

Query: 499 VFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRW 558
           VFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRW
Sbjct: 361 VFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRW 420

Query: 559 IEASNYFWKRQAEDNFPVEQSKKQ 583
           IEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 421 IEASNYFWKRQAEDNFPVEQSKKQ 444

BLAST of CmaCh09G012530.1 vs. ExPASy TrEMBL
Match: A0A6J1EFC1 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111432775 PE=4 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 5.7e-245
Identity = 436/447 (97.54%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRR---KLVCS 198
           MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVES R+KRR   KLVCS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 199 EEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 258
           EED LCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 259 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLF 318
           INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL SQIDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 319 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISR 378
           LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKREYFNSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 379 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 438
           DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 439 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVV 498
           LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEG KVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 499 FSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVH 558
           FSRVFPT FQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTD GTEKSRWALKEEKFLVH
Sbjct: 361 FSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDPGTEKSRWALKEEKFLVH 420

Query: 559 PRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           PRWIEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 421 PRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of CmaCh09G012530.1 vs. ExPASy TrEMBL
Match: A0A6J1ID30 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111471991 PE=4 SV=1)

HSP 1 Score: 796.6 bits (2056), Expect = 7.0e-227
Identity = 403/447 (90.16%), Postives = 422/447 (94.41%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKR---RKLVCS 198
           MSL TNSPAHSSSSDDFAAFLDVAL+SHSSDS PN+ AE  NNVE+ R+KR    KL  S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSLPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 199 EEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 258
            EDIL GVEE S EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 259 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLF 318
           INRLRNIDMK LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL +Q+DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKKLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRNQMDSLEDVTKGSLF 180

Query: 319 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISR 378
           LL+SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKREYF+SKVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 379 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 438
           DDGTQKHQKGLD+VLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 439 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVV 498
           LSELKSDESE+DGALATILKVLKQVHNIFFNELS+DLVDRDVRQVLKTVRSKVLEG KVV
Sbjct: 301 LSELKSDESESDGALATILKVLKQVHNIFFNELSEDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 499 FSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVH 558
           FSRVFPT FQA+NHHLWKMVEQLGGTCSTELD SVTH+VSTDAGTEKSRWA+KE+KFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDPSVTHIVSTDAGTEKSRWAIKEQKFLVH 420

Query: 559 PRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           P+WIEASNYFWKR+AE+ FPVE +KKQ
Sbjct: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447

BLAST of CmaCh09G012530.1 vs. ExPASy TrEMBL
Match: A0A6J1IEP6 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111472572 PE=4 SV=1)

HSP 1 Score: 796.2 bits (2055), Expect = 9.2e-227
Identity = 396/396 (100.00%), Postives = 396/396 (100.00%), Query Frame = 0

Query: 187 MKRRKLVCSEEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH 246
           MKRRKLVCSEEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH
Sbjct: 3   MKRRKLVCSEEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH 62

Query: 247 KGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSL 306
           KGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSL
Sbjct: 63  KGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSL 122

Query: 307 EDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKRE 366
           EDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKRE
Sbjct: 123 EDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKRE 182

Query: 367 YFNSKVISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC 426
           YFNSKVISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC
Sbjct: 183 YFNSKVISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC 242

Query: 427 HQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRS 486
           HQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRS
Sbjct: 243 HQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRS 302

Query: 487 KVLEGSKVVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWA 546
           KVLEGSKVVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWA
Sbjct: 303 KVLEGSKVVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWA 362

Query: 547 LKEEKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           LKEEKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 363 LKEEKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ 398

BLAST of CmaCh09G012530.1 vs. ExPASy TrEMBL
Match: A0A6J1BV42 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 3.5e-226
Identity = 403/450 (89.56%), Postives = 422/450 (93.78%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSE-- 198
           MSL TNSPAHSSSSDDFAAFLDVAL+SHSSDSSP + AE  NNVES RMKRRK+   E  
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 199 ----EDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLN 258
               EDI  GVEEQS EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIHKGLRLN
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHKGLRLN 120

Query: 259 NDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKG 318
           NDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYL SQ DSLEDVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKG 180

Query: 319 SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKV 378
           SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKREYF++KV
Sbjct: 181 SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 240

Query: 379 ISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFN 438
           ISRDDGTQKH+KGLD+VLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+N
Sbjct: 241 ISRDDGTQKHKKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGYN 300

Query: 439 CKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGS 498
           CKSLSELKSDESETDGALATILKVLKQVH IFFNEL DDLVDRDVRQVLKTVRSKVLEG 
Sbjct: 301 CKSLSELKSDESETDGALATILKVLKQVHTIFFNELLDDLVDRDVRQVLKTVRSKVLEGC 360

Query: 499 KVVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKF 558
           KVVF+RVFPT F A+NHHLWKMVEQLGG+CST+LDSSVTHVVSTDAGTEKSRWA+KE+KF
Sbjct: 361 KVVFTRVFPTKFPADNHHLWKMVEQLGGSCSTDLDSSVTHVVSTDAGTEKSRWAVKEQKF 420

Query: 559 LVHPRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           LVHPRWIEASNYFWKRQ E+NFPVEQ+KKQ
Sbjct: 421 LVHPRWIEASNYFWKRQVEENFPVEQTKKQ 450

BLAST of CmaCh09G012530.1 vs. NCBI nr
Match: XP_022973946.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita maxima] >XP_022973947.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita maxima])

HSP 1 Score: 881.3 bits (2276), Expect = 4.5e-252
Identity = 444/444 (100.00%), Postives = 444/444 (100.00%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEED 198
           MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEED
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRRKLVCSEED 60

Query: 199 ILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINR 258
           ILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINR
Sbjct: 61  ILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINR 120

Query: 259 LRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLN 318
           LRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLN
Sbjct: 121 LRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLN 180

Query: 319 SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDG 378
           SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDG
Sbjct: 181 SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDG 240

Query: 379 TQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSE 438
           TQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSE
Sbjct: 241 TQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSE 300

Query: 439 LKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSR 498
           LKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSR
Sbjct: 301 LKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVVFSR 360

Query: 499 VFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRW 558
           VFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRW
Sbjct: 361 VFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVHPRW 420

Query: 559 IEASNYFWKRQAEDNFPVEQSKKQ 583
           IEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 421 IEASNYFWKRQAEDNFPVEQSKKQ 444

BLAST of CmaCh09G012530.1 vs. NCBI nr
Match: XP_022925487.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita moschata] >XP_022925488.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita moschata])

HSP 1 Score: 856.7 bits (2212), Expect = 1.2e-244
Identity = 436/447 (97.54%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRR---KLVCS 198
           MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVES R+KRR   KLVCS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 199 EEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 258
           EED LCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 259 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLF 318
           INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL SQIDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 319 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISR 378
           LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKREYFNSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 379 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 438
           DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 439 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVV 498
           LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEG KVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 499 FSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVH 558
           FSRVFPT FQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTD GTEKSRWALKEEKFLVH
Sbjct: 361 FSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDPGTEKSRWALKEEKFLVH 420

Query: 559 PRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           PRWIEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 421 PRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of CmaCh09G012530.1 vs. NCBI nr
Match: KAG7025178.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 854.0 bits (2205), Expect = 7.7e-244
Identity = 435/447 (97.32%), Postives = 436/447 (97.54%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRR---KLVCS 198
           MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVES R+KRR   KLVCS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 199 EEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 258
           EED LCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 259 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLF 318
           INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL SQIDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 319 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISR 378
           LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKREYFNSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 379 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 438
           DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 439 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVV 498
           LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEG KVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 499 FSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVH 558
           FSRVFPT FQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTD GTEKSRWALKE KFLVH
Sbjct: 361 FSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDPGTEKSRWALKEGKFLVH 420

Query: 559 PRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           PRWIEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 421 PRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of CmaCh09G012530.1 vs. NCBI nr
Match: XP_023535497.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023535498.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 849.7 bits (2194), Expect = 1.4e-242
Identity = 432/447 (96.64%), Postives = 436/447 (97.54%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRR---KLVCS 198
           MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVG+NVES R+KRR   KLVCS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGSNVESERIKRRKVEKLVCS 60

Query: 199 EEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 258
           EED LCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 259 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLF 318
           INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL SQIDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 319 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISR 378
           LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKREYFNSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 379 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 438
           DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLIL+ERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILIERYHFFASSCHQFGFNCKS 300

Query: 439 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVV 498
           LSELK+DESETDGALATILKVLKQVHNIFFNEL DDLVDRDVRQVLKTVRSKVLEG KVV
Sbjct: 301 LSELKTDESETDGALATILKVLKQVHNIFFNELLDDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 499 FSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVH 558
           FSRVFPT FQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTD GTEKSRWALKEEKFLVH
Sbjct: 361 FSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDPGTEKSRWALKEEKFLVH 420

Query: 559 PRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           PRWIEASNYFWKRQAEDNFPVEQSKKQ
Sbjct: 421 PRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of CmaCh09G012530.1 vs. NCBI nr
Match: XP_038890381.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida] >XP_038890382.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 802.7 bits (2072), Expect = 2.0e-228
Identity = 406/447 (90.83%), Postives = 423/447 (94.63%), Query Frame = 0

Query: 139 MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESSRMKRR---KLVCS 198
           MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP + AE  NN ES R+KRR   KL  S
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEGDNNAESERIKRRKVEKLENS 60

Query: 199 EEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 258
           EEDIL GVEEQS E +SKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  EEDILYGVEEQSSEAISKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 259 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLF 318
           INRLRNIDMKSLL HKKLILVLDLDHTLLNSTQLGHLTPEEEYL SQ DSL+DVTKGSLF
Sbjct: 121 INRLRNIDMKSLLLHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLDDVTKGSLF 180

Query: 319 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISR 378
           LLNSVHTMTKLRPFVH+FLKEA+QLFEMYIYTMGERAYA EMAKLLDPK+EYFN KVISR
Sbjct: 181 LLNSVHTMTKLRPFVHSFLKEANQLFEMYIYTMGERAYAFEMAKLLDPKKEYFNGKVISR 240

Query: 379 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 438
           DDGTQKHQKGLD+VLGQESAVLILDDTENAW KHK+NLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGQESAVLILDDTENAWPKHKKNLILMERYHFFASSCHQFGFNCKS 300

Query: 439 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSKVV 498
           LSELKSDESETDGALATILKVLKQVH++FFNELSDDLVDRDVRQ+LKTVRSKVLEG KVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHSVFFNELSDDLVDRDVRQILKTVRSKVLEGCKVV 360

Query: 499 FSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFLVH 558
           FSRVFPT FQA+NHHLWKMVEQLGGTCSTELD SVTHVVS DAGTEKSRWALKEEKFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDQSVTHVVSMDAGTEKSRWALKEEKFLVH 420

Query: 559 PRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           PRWIEASNYFWKRQ+E+NFPVEQ+KKQ
Sbjct: 421 PRWIEASNYFWKRQSEENFPVEQTKKQ 447

BLAST of CmaCh09G012530.1 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 528.9 bits (1361), Expect = 5.3e-150
Identity = 278/449 (61.92%), Postives = 345/449 (76.84%), Query Frame = 0

Query: 139 MSLATNSPAH-SSSSDDFAAFLDVALESHSSDSS-PNKNAEVGNNVESSRMKRRKLVCSE 198
           MS+A++SP H SSSSDD AAFLD  L+S S  SS P++  E  ++VES  +KR+KL    
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVESG-LKRQKL---- 60

Query: 199 EDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEI 258
                   E   E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIHK +RLN DEI
Sbjct: 61  --------EHLEEASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNEDEI 120

Query: 259 NRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLED---VTKGS 318
           +RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL S   SL+D   V+ GS
Sbjct: 121 SRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVSGGS 180

Query: 319 LFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVI 378
           LFLL  +  MTKLRPFVH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +VI
Sbjct: 181 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 240

Query: 379 SRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNC 438
           SRDDGT +H+K LD+VLGQESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF    
Sbjct: 241 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 300

Query: 439 KSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGSK 498
           KSLSELKSDESE DGALAT+LKVLKQ H +FF  + + + +RDVR +LK VR ++L+G K
Sbjct: 301 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEILKGCK 360

Query: 499 VVFSRVFPTIFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGTEKSRWALKEEKFL 558
           +VFSRVFPT  + E+H LWKM E+LG TC+TE+D+SVTHVV+ D GTEK+RWA++E+K++
Sbjct: 361 IVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYV 420

Query: 559 VHPRWIEASNYFWKRQAEDNFPVEQSKKQ 583
           VH  WI+A+NY W +Q E+NF +EQ KKQ
Sbjct: 421 VHRGWIDAANYLWMKQPEENFGLEQLKKQ 435

BLAST of CmaCh09G012530.1 vs. TAIR 10
Match: AT2G33540.1 (C-terminal domain phosphatase-like 3 )

HSP 1 Score: 242.7 bits (618), Expect = 7.5e-64
Identity = 136/336 (40.48%), Postives = 203/336 (60.42%), Query Frame = 0

Query: 251  LNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLWSQIDSLEDV 310
            +  + + RL   +   +   +KL LVLD+DHTLLNS +   + +  EE L  + +   + 
Sbjct: 908  IQRERVRRLE--EQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREK 967

Query: 311  TKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFN 370
                LF    +   TKLRP +  FL++AS+L+E+++YTMG + YA+EMAKLLDPK   FN
Sbjct: 968  PYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFN 1027

Query: 371  SKVISRD------DGTQKHQKGLDI--VLGQESAVLILDDTENAWTKHKENLILMERYHF 430
             +VIS+       DG ++  K  D+  V+G ES+V+I+DD+   W +HK NLI +ERY +
Sbjct: 1028 GRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLY 1087

Query: 431  FASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVL 490
            F  S  QFG    SL EL  DE   +G LA+ L V++++H  FF+  S D V  DVR +L
Sbjct: 1088 FPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEV--DVRNIL 1147

Query: 491  KTVRSKVLEGSKVVFSRVFPT-IFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDAGT 550
             + + K+L G ++VFSR+ P    +   H LW+  EQ G  C+T++D  VTHVV+   GT
Sbjct: 1148 ASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGT 1207

Query: 551  EKSRWALKEEKFLVHPRWIEASNYFWKRQAEDNFPV 577
            +K  WAL   +F+VHP W+EAS + ++R  E+ + +
Sbjct: 1208 DKVNWALTRGRFVVHPGWVEASAFLYQRANENLYAI 1239

BLAST of CmaCh09G012530.1 vs. TAIR 10
Match: AT3G17550.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 200.3 bits (508), Expect = 4.3e-51
Identity = 118/288 (40.97%), Postives = 176/288 (61.11%), Query Frame = 0

Query: 203 VEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINRLRNI 262
           + E S  + S +  C H      +CI C   +++  G  F Y+ +GL+L+++     +  
Sbjct: 15  INESSSSLSSSRSSCGHWYVRYGVCIACKSTVNKRHGRAFDYLVQGLQLSHEAAAFTKRF 74

Query: 263 DMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDSLEDVTKGSLFLLNSVH 322
             +   L  KKL LVLDLDHTLL+S ++  L+  E+ L   I+     T+  L+ L+S +
Sbjct: 75  TTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKCL---IEEACSTTREDLWKLDSDY 134

Query: 323 TMTKLRPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDGTQK 382
            +TKLRPFVH FLKEA++LF MY+YTMG R YA  + KL+DPKR YF  +VI+RD+    
Sbjct: 135 -LTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESLLKLIDPKRIYFGDRVITRDE--SP 194

Query: 383 HQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKS 442
           + K LD+VL +E  V+I+DDT + WT HK NL+ +  YHFF  +  +      S +E K 
Sbjct: 195 YVKTLDLVLAEERGVVIVDDTSDVWTHHKSNLVEINEYHFFRVNGPE---ESNSYTEEKR 254

Query: 443 DESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVL 490
           DES+ +G LA +LK+LK+VH  FF  + ++L  +DVR +L+ +  K+L
Sbjct: 255 DESKNNGGLANVLKLLKEVHYGFF-RVKEELESQDVRFLLQEIDFKLL 292

BLAST of CmaCh09G012530.1 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 196.4 bits (498), Expect = 6.2e-50
Identity = 113/277 (40.79%), Postives = 169/277 (61.01%), Query Frame = 0

Query: 210 VLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINRLRNIDMK-SLL 269
           V +    C H   F  +CI C  ++ +     F YI KGL+L+N+ +   +++  K S L
Sbjct: 3   VTTSSSCCGHWYVFQGICIGCKSKVHKSQFRKFDYIFKGLQLSNEAVALTKSLTTKHSCL 62

Query: 270 QHKKLILVLDLDHTLLNSTQLGHLTPEEEYLWSQIDS--LEDVTKGSLFLLNSVHTMTKL 329
             KKL LVLDLDHTLL+S  + +L+  E YL  +  S   ED+ K    + + +  + KL
Sbjct: 63  NEKKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTREDLWKFRP-IGHPIDRLIKL 122

Query: 330 RPFVHTFLKEASQLFEMYIYTMGERAYASEMAKLLDPKREYFNSKVISRDDGTQKHQKGL 389
           RPFV  FLKEA+++F M++YTMG R YA  + +++DPK+ YF ++VI++D+  +   K L
Sbjct: 123 RPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDESPR--MKTL 182

Query: 390 DIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESET 449
           ++VL +E  V+I+DDT + W  HK NLI + +Y +F  S    G +  S SE K+DE E 
Sbjct: 183 NLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYFRRS----GLDSNSYSEKKTDEGEN 242

Query: 450 DGALATILKVLKQVHNIFF-NELSDDLVDRDVRQVLK 483
           DG LA +LK+L++VH  FF  E+ + L   DVR +LK
Sbjct: 243 DGGLANVLKLLREVHRRFFIVEVEEVLESMDVRSLLK 272

BLAST of CmaCh09G012530.1 vs. TAIR 10
Match: AT3G19595.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 192.2 bits (487), Expect = 1.2e-48
Identity = 119/312 (38.14%), Postives = 176/312 (56.41%), Query Frame = 0

Query: 178 VGNNVESSRMKRRKLVCSEEDILCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEE 237
           V N     + KRRK+   E  I      +S   LS    C H      +CI C   + + 
Sbjct: 4   VENFSSEPKAKRRKI---EPTI-----NESSSSLSSSSSCGHWYICHGICIGCKSTVKKS 63

Query: 238 SGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEE 297
            G  F YI  GL+L+++ +   +    K S L  KKL LVLDLDHTLL++  +  L+  E
Sbjct: 64  QGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLSQAE 123

Query: 298 EYLWSQIDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERA 357
           +YL   I+     T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R 
Sbjct: 124 KYL---IEEAGSATRDDLWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 183

Query: 358 YASEMAKLLDPKREYFNSKVISRDDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKEN 417
           YA ++ +L+DPK+ YF  +VI++ +    H K LD VL +E  V+I+DDT N W  HK N
Sbjct: 184 YAKQVLELIDPKKLYFGDRVITKTE--SPHMKTLDFVLAEERGVVIVDDTRNVWPDHKSN 243

Query: 418 LILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNELSDDL 477
           L+ + +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF  + ++L
Sbjct: 244 LVDISKYSYFRLK----GQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRFF-RVEEEL 297

Query: 478 VDRDVRQVLKTV 485
             +DVR +L+ +
Sbjct: 304 ESKDVRSLLQEI 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q00IB67.4e-14961.92RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
Q8LL041.1e-6240.48RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... [more]
F4JCB21.6e-4738.14RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... [more]
Q95QG87.9e-3427.72RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis eleg... [more]
Q8SV037.9e-2626.21RNA polymerase II subunit A C-terminal domain phosphatase OS=Encephalitozoon cun... [more]
Match NameE-valueIdentityDescription
A0A6J1IA132.2e-252100.00RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1EFC15.7e-24597.54RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1ID307.0e-22790.16RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1IEP69.2e-227100.00RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1BV423.5e-22689.56RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
Match NameE-valueIdentityDescription
XP_022973946.14.5e-252100.00RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita max... [more]
XP_022925487.11.2e-24497.54RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita mos... [more]
KAG7025178.17.7e-24497.32RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma s... [more]
XP_023535497.11.4e-24296.64RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita pep... [more]
XP_038890381.12.0e-22890.83RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa his... [more]
Match NameE-valueIdentityDescription
AT5G58003.15.3e-15061.92C-terminal domain phosphatase-like 4 [more]
AT2G33540.17.5e-6440.48C-terminal domain phosphatase-like 3 [more]
AT3G17550.14.3e-5140.97Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT2G04930.16.2e-5040.79Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT3G19595.11.2e-4838.14Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 271..426
e-value: 9.7E-53
score: 191.2
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 274..421
e-value: 5.1E-27
score: 94.5
IPR004274FCP1 homology domainPROSITEPS50969FCP1coord: 268..439
score: 30.460117
IPR001357BRCT domainSMARTSM00292BRCT_7coord: 486..566
e-value: 4.7E-6
score: 36.1
IPR001357BRCT domainPFAMPF12738PTCB-BRCTcoord: 507..558
e-value: 1.2E-7
score: 31.6
IPR001357BRCT domainPROSITEPS50172BRCTcoord: 484..576
score: 13.14486
IPR011947FCP1-like phosphatase, phosphatase domainTIGRFAMTIGR02250TIGR02250coord: 267..422
e-value: 1.7E-53
score: 178.9
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 255..487
e-value: 1.4E-52
score: 180.4
IPR036420BRCT domain superfamilyGENE3D3.40.50.10190BRCT domaincoord: 488..576
e-value: 9.3E-22
score: 79.1
IPR036420BRCT domain superfamilySUPERFAMILY52113BRCT domaincoord: 488..576
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 140..577
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 140..577
NoneNo IPR availableCDDcd17729BRCT_CTDP1coord: 476..571
e-value: 9.90642E-36
score: 127.264
NoneNo IPR availableCDDcd07521HAD_FCP1-likecoord: 272..416
e-value: 9.46213E-36
score: 128.482
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 263..426

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh09G012530CmaCh09G012530gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh09G012530.1:exon:1369CmaCh09G012530.1:exon:1369exon
CmaCh09G012530.1:exon:1368CmaCh09G012530.1:exon:1368exon
CmaCh09G012530.1:exon:1367CmaCh09G012530.1:exon:1367exon
CmaCh09G012530.1:exon:1366CmaCh09G012530.1:exon:1366exon
CmaCh09G012530.1:exon:1365CmaCh09G012530.1:exon:1365exon
CmaCh09G012530.1:exon:1364CmaCh09G012530.1:exon:1364exon
CmaCh09G012530.1:exon:1363CmaCh09G012530.1:exon:1363exon
CmaCh09G012530.1:exon:1362CmaCh09G012530.1:exon:1362exon
CmaCh09G012530.1:exon:1361CmaCh09G012530.1:exon:1361exon
CmaCh09G012530.1:exon:1360CmaCh09G012530.1:exon:1360exon
CmaCh09G012530.1:exon:1359CmaCh09G012530.1:exon:1359exon
CmaCh09G012530.1:exon:1358CmaCh09G012530.1:exon:1358exon


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh09G012530.1:three_prime_utrCmaCh09G012530.1:three_prime_utr_2three_prime_UTR
CmaCh09G012530.1:three_prime_utrCmaCh09G012530.1:three_prime_utrthree_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_10CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_9CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_8CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_7CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_6CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_5CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_4CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_3CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cds_2CDS
CmaCh09G012530.1:cdsCmaCh09G012530.1:cdsCDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh09G012530.1:five_prime_utrCmaCh09G012530.1:five_prime_utr_2five_prime_UTR
CmaCh09G012530.1:five_prime_utrCmaCh09G012530.1:five_prime_utrfive_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh09G012530.1CmaCh09G012530.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0004721 phosphoprotein phosphatase activity
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity