Tan0002421 (gene) Snake gourd v1

Overview
NameTan0002421
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein-serine/threonine phosphatase
LocationLG06: 19371154 .. 19377994 (-)
RNA-Seq ExpressionTan0002421
SyntenyTan0002421
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCACTCACCGCCACTCTTTTCTTTATTTTTCTTTTTCTCGTTTTTATTTTATTTGATTTGATTTTCTCTTTTTTCCCTTTTTCTTTTCCTATTACTATACTATCTTCCACTTCCCTTCCGCCTCGATTCCCCGTCTCCTTATAATTCATTCATACCTCAATTCAATCTCCTTTTCAAATTCTTTGTGGATTTCCAGATCTCACATCAATTTTTCTTCCCCAACTCGCTTTTCTTCCTTTAATTCAATCAACACTGTTTGCTTTTTTTTTTAATCCATCCCAATACGTTTCTCTTTGTTCTACATCTACTTGGATTGGATTCTCTTCCACCGAAGTAAGAAGAAAGAAAAACTCCACAATTCCAAACCAAAGTTGTTTAATTTAATTTTTTAATGTTCCAGCGTAATTCGCATCAAGATCGATTGGCTTTTTCTCCCTTCTTGCTTCTTCCAGATTTTCTAACAACTCCTAATACTATTTATGGTAGTTTTCGGATCTGATTCCAACAATTTGTCGAAATTTTCCAAGAAACCCTAGGGTGGGGTTAATGGGGAAAGACGAAAGTGTTAAAATTCAAGACGTTGAAGAAGGGGAAATCTCTGATACAGCTTCAGTTGAGGAGATCACTGAGGAAGATTTTAATAAGCTTGAAACTGGTCCTAAGGTGTTGCCATCCAAAGATTCCAATCGGGACACTAGAGTTTGGACCATGAGTGATTTGTATAAGAATTATCCCACCATGTGTCGTGGTTATGCTTCTGGTTTGTACAATTTAGCTTGGGCGCAGGCAGTCCAGAATAAGCCTCTCAATGATATTTTTGTTATGGAGGCCGACCCCGATGAGAAATCGAAGCGGTCCTCTTCCTCCCCTTTTGCGAATGGCAAGGAAGATGGAAACGCTACAAAAGAAGAGGGTAAAGTCGTGATTGATGTTAGCGGCGATGACATGAATTGTGATAATGCGAATGTCGAGAAAGAGGAAGGTGAATTGGAGGAGGGCGAAATTGATATGGATACGGAGTTCGTCGAAGAAGTTGTGGACTCTAAAGCAATGTTGTCCGACTGTATGGATACTGAATGTCAAGATATTGATTTGAAAGTTAAGGATTTGGATGACCAAGTGAAATTGATTCAGAAAACATTGGATGGTGTGACAATCGACGCTGCACAGAAGTAAGTGAACCGATCTACTTGAGTTTTGGATCAATGGGTGAATGGTGCCATGTGATGCGTCCCCGATATGAAATTAATGGGTTGCTGTCAAATTTACTATCTGGTTTCTTTCTTGTAGATCGTTTCAGGAAGTTTGCTCCCAACTGCATAGTTCTATTGAGACGTTTTTGGAATTGGTCCAGGGAAAGGTAGTCCCGAGGAAGGATGCGCTCATTCAACGATTATATGCTGCTCTTCGAATAATCAATTCTGTAAGGCACCCCCAAGAATCTCGGCCCTCTTTCGTTTGGCCAATAAATTTCATTTTGCAGGGAAATCTTATTTAACGTATATTATTAATTGATGATAATGACAATGAACACTGAAGACGTTTAATTTTCATCTCAGGTATTTTGTTCCATGAACCCCAATGAGAAGGAGGAGCATAAGCAACATTTATCGAGGTTTTTACTCATCCTAGTCTATAGTTTCTTTTTCGATGTAAAGGGTTTAGTCAATTTGTTTAGTATCTGACGACCACTTGCTGTGATATTGTTCATAGGTTGCTTTCTTATGCTAAAAATTGCAACCCTCCTCTCTTTTCTCCCGAGCAGATAAAATCGGTACAGTATACTATCCTTACTCTGCTTGTTATTACTCCTTGTGCTAATGTTGGTATCTTGTGGAGGAGATTGTGATATGGTGCTGAGCTAAACCAGGATATTTTCTACAGATAGAGGTCAAAATGCCATCTACTGATTCCCTCGACAATTTGCCCAGCACGAGAGCCAGTGCTAAAGAGGTCGAGATCCATATACCTAATGGGGTGAAAAATAAGGACTTTTATTCTGCATACACAAATGCTAGTCCACATTTAACTTCTTCAACTAAGTTGTCTTCCATGCCCGTTGAGGTTACGGCAAAAAATAATATAAATATCTCATCCGATGGTTTGCAATCTGGAGTATCTAATGTAAAGGGTAGAGGCCCCCTACTCCCTCTGTTAGACCTTCACAAGGATCATGATGCAGACAGTCTTCCGTCACCTACCAGAGAAGCTCCTACAATTTTTTCTGTCCAAAAATCAGGGCATATCCCTGCAAAGGTGGCCCATGCTATGGATGGATCAAGATCACATCCTTATGAAACTGATGCCCTAAAAGCTGTTTCAACCTACCAACAGAAGTTTGGTCGAAGTTCCTTTTCAATGGCTGATAGACTTCCTAGCCCAACCCCTTCAGAAGAATGTGACGGGGGTGGGGATATTGGTGGGGAGGTTTCTAGTTCTTCTATTATCAGAAGTTCAAAGGCTTCAAATTCTTCTAAACTGGGTCAAAAGGTGTCAAATTCATCTTCGAACATATCTATAGGCCTTTTTCCTCACCTGGAAAGTTCCAACACTAAAGGGCTGAGTAATCCTTTAAATGTTGCTCCTCCAAGTTGTGTGTCTAATCCAACGGTAAAGCCTTTAGCAAAAAGTAGAGACCCTAGGCTACGAATGGTCAATTCGGATGCAAGTGCTATGGATCTTAATCCACGCACAGTGACTTCAGTGCAGAATCCTTCTGTCGTAGAATCTGCTACAACAATAAACTTGAGAAAGCAAAAGATGGATGTGGAGTCTAATCTAGATGGCCCTGAAATGAAAAGGCAGAGGATTGGATCTCAGAATCATGCAGTAGCTGCAAACGATGTGAGAGCCCCCTTTGGAAGCGGGGGCTGGTTGGAAGATACCATGTCCGCTGGACCCAGACTTCCAAGTAGGAATCAGATGGAGATTGCTGAAGCAAATGCAACAGAAAAAATAAATGTTTCAAACAATTCTGGTGCTGGAAATGAGTGTGCACCTACCATAAGTGCTACCAACGATGCTTCCTTACCCTCGCTGTTGAAAGATATTGTTGTGAACCCAACCATGCTCCTTAGTTTACTTAAAATGAGCCAACAGCAGCAATTAGCTGCAGAATTGAAATTAAAGTCAAGTGAACCTGAAAAAAATGTAATTTGTCCAACGGCCGTGAATCCTTGTCTAGGATCTAGTCCACTAGTAAATGCTCCTCCTGTGACCTCAGGAATTTTGCAGCAATCAGCAGGAACACTAAGTGTACCTTCACCACCGGTGGTTACTGTGGTAAGTGATTGATTACATCACAACTCTGAAAGTTATGCATTATATTATATGCTCATTTATGCATATCTATTTGTCAAGTGGTGCCGCTTTCCCTCTCCCCATTTGCATTATAAAATTTCCCATTGCTTTTTCTCCTTGATTGATAAGGGATCCCTTCCCAATCAAGTGGTTGGTAATATCCTGGACACAAGTGCTTGGGTTCATCTTTCTTCTAATTTGATTCAAGAATTTCTTCCCTTTATTCTAATATGGCTCAAGAATTTATCAAATAACAATTTCCTTGAAACCTATGGTGGAAGGTGCTTTTTCCACATTTTCTGGCTTATATTCTAATTCTATGCTGTTGAAACATGGCTGAGATTGTTTGTTATTTTTAATTTTTCTGTGTTGTATAGTATCTATATTTTTATTCCAGATTATGTACTTATTCTAGATTCTGCACATGTTTCATCATTCTCAGACTCGACAGGATGATGTGGGAAAAGTTCGTATGAAACCTCGTGACCCTCGTCGTATCCTCCATGGTAATTCTCTTCAGAAGGTTGGCAGCTTGGGAAATGAGCAGTTAAAAGGTGTTGTACCAACTGCTCCAAACACAGAAGGAAGTGGGGATATACCAAATGGCCATAAGCAAGAAGGCCAGGGAGATTTAAGATTAGCTTCTTCACAACCATTACTACCTGACATTGGTAGGCAGTTCACTAACAATCTAAAAAATATTGCTGATATCATGTCTGTTCCCTCGCCACCAACTTCTTCACCGAATTCATCTGCAAAGCCAGTTAAATTGGATGGGATGGATACTAATGCTGTTGGGTCCAGCTCTGTTGACAGTAAAATTGTGACAACTGCTACCCAACCGGTAGATATGGTTGGCCCCTCTCGTTCACAGGGTGCATGGGGAGATCTTGAGCATCTATTCGAGGGTTACGATGACAAGCAAAAGGCTGCCATTCAGCGTGAGAGGGCAAGACGGATAGACGAACAGAAAAAAATGTTTGCTGCACGCAAACTCTGCCTTGTTTTGGATTTAGATCACACACTTCTGAATTCAGCTAAGGTTATTTTTGTTTGTATAAGTAAAACTGATTTCACCATCAACTTGGATTTACAATCCCATAACTTGGTTTGATCTTGTTTTAGTTCGTAGAAGTAGATCCAGTGCATGATGAAATTTTGAGAAAGAAGGAGGAACAGGATCGTGAGAAGGCGCAAAGACATCTTTTCCGATTTCCTCATATGGGAATGTGGACCAAACTACGGCCTGGAGTCTGGAACTTTTTGGAAAAGGTAGGTTGAAATATTTGTTTTCAATACAACCAATCCAAATAGTAGAAGACTGCTTTTATTTTCTTTTCTTTTGAAGTTGTATTGGATGTTGAATGCATATTTTGGTGCTTGTGTTCTCTTTTGCATTTTAGGCCAGCGAGCTCTATGAACTTCATTTGTACACAATGGGGAACAAGTTATACGCAACAGAGATGGCAAAGGTTCTTGATCCAAAAGGGGCTCTGTTTGCCGGGCGAGTTATTTCTCGCGGTGATGATGGAGACCCTTTGGACGGTGATGAGAGGGTACCCAAGAGTAAGGATCTGGAGGGTGTTTTGGGCATGGAGTCTGCTGTTGTTATAATTGATGATTCCATCAGGGTGTGGCCACATAATAAAATGAACTTGATTGTTGTCGAAAGGCAAGCATGCTCTTAACTAAAAGATGTTTTTTAGCGTTTCATAACCTCACAAATTTGAGAGCCCGTGACACTGCTTGTGCTTGACTTCCAATGCAGGTATACTTACTTTCCTTGTAGCCGGCGCCAGTTTGGGCTTCTGGGTCCTTCCCTTCTAGAGATTGACCATGACGAGAGACCTGAAGATGGTACTTTGGCATCTTCACTGGGGGTAATTCTGTGGCTTCACTTGTTCACCTTATTTCATTGAATTAATTTATTTTACTCTTAATCATCTTAAATTTTTGTCCATTAGGTTATCCAAAGAATCCATCAAACCTTTTTCTCTCATCCTTTATTAGATGAAATAGATGTTAGAAATATCTTGGCCTCCGAGCAACAAAAGATTTTGGCTGGTTGCCGTATAGTGTTTAGCAGGGTTTTTCCGGTTGGTGAGGCAAATCCTCACCTGCATCCGTTGTGGCAGACAGCTGAACAGTTTGGTGCGGTGTGCACCAACCAGATTGATGAGCAGGTTACCCACGTCGTTGCAAACTCTCTTGGGACTGATAAGGTAATGTACCGCGGTCTGATGTTTGTATTTTAGGAATTCATTGTTCCAGTTTTTTATGAGCAATTTTTTGTTGAAATCTCCTTGCAACTGACCTGTTAAATATAAAAGTTTGCAGGCAGTTTGAGGCAAAAGTATCATTGGACTCGGGGGGGTGTATGAATATAACATATATATCTGATTTAAATGTGAACCTATGCTTTGTGTTGATGTTTCTCGTTGCCATTGTTTTGAACAGGTGAATTGGGCACTCTCCACTGGCAGATTCGTGGTCCATCCAGGGTGGTGAGATTCTTGTTCCAAGTTTCACATTGTCGTATATGTTTGAACTTGCAGTCATATATATTTATTGGGTTTAACCTTTCTGAGTATGCATGCACAGGGTGGAAGCATCGGCTCTGCTTTATCGGAGGGCAAACGAGCAGGACTTTGCCATTAAACCATAACCAACCCACCCACCAGCAACCTTAACACACCCTTTGCTAAATTAAAAGTGAAAGATTATAACACAAGATGATATAACACGGAGAAGAAATTCAACACGGTGCCTTATATTTGGGTGGTAGTGTTAGGCAAAGCCTAAGACTACTAAGAGAGAGACACATATAACTCCCTATTTAACACGCCCTACAACTACAAGAATGTGTCGAAAGAGCCCAGAAAGAGAAAGATCACACCGAAATTATGTTGCCCCCCCTCCCCCCATTGACTGTGGAGAGGGAGTTTGGTTAGATTTGAAATAATCAACGGTCAGGTCAATGGAGGGAGGAGGAGGAGAGAGTTACAAAAGAGGAGTGAGAGACGAACGGGTAAATTGTGACTGGTGGGGCGGAGCTTAAACCCAATTCAACCCAATTCAACGTCTGAAACTTTTCATTTATTTTCTTTTTCATTTTCTTCTGCTTTTGCTTATTAAATTTTTGGTTGGAGTTGCATGAGAGAGAACGAACACTTTGGTTTTTACTCTCGAATCAGTCCTGCCCCATTTGTTGTTGCCTAAAAATGATAGAGAATGAAATGAGGGAAGGGCAAATGCAAAGGATGAGCCTGGAGAATGTCCATGATGTGGCAGAATTGCGTCTGTTTTGAGCTAAAATACAAGTCAATGTCAAAGGAGGAATAAGGTGCTCAGAGGATGTGCTATGATATGATTTGGAGACTGTGGGGTTGAGTGAGGCGTGAGGACAACATTGAAAAGGCTTCGGGTCACTTGATAAATGACATCAACAATCCTTGCAAATGGACCAATTTTCTGATTCATTTTCAAACCTCTTTTTTTTCCTTAAACAAGAAATAAATAAATAAATAAATAAAATTGGC

mRNA sequence

CCTCACTCACCGCCACTCTTTTCTTTATTTTTCTTTTTCTCGTTTTTATTTTATTTGATTTGATTTTCTCTTTTTTCCCTTTTTCTTTTCCTATTACTATACTATCTTCCACTTCCCTTCCGCCTCGATTCCCCGTCTCCTTATAATTCATTCATACCTCAATTCAATCTCCTTTTCAAATTCTTTGTGGATTTCCAGATCTCACATCAATTTTTCTTCCCCAACTCGCTTTTCTTCCTTTAATTCAATCAACACTGTTTGCTTTTTTTTTTAATCCATCCCAATACGTTTCTCTTTGTTCTACATCTACTTGGATTGGATTCTCTTCCACCGAAGTAAGAAGAAAGAAAAACTCCACAATTCCAAACCAAAGTTGTTTAATTTAATTTTTTAATGTTCCAGCGTAATTCGCATCAAGATCGATTGGCTTTTTCTCCCTTCTTGCTTCTTCCAGATTTTCTAACAACTCCTAATACTATTTATGGTAGTTTTCGGATCTGATTCCAACAATTTGTCGAAATTTTCCAAGAAACCCTAGGGTGGGGTTAATGGGGAAAGACGAAAGTGTTAAAATTCAAGACGTTGAAGAAGGGGAAATCTCTGATACAGCTTCAGTTGAGGAGATCACTGAGGAAGATTTTAATAAGCTTGAAACTGGTCCTAAGGTGTTGCCATCCAAAGATTCCAATCGGGACACTAGAGTTTGGACCATGAGTGATTTGTATAAGAATTATCCCACCATGTGTCGTGGTTATGCTTCTGGTTTGTACAATTTAGCTTGGGCGCAGGCAGTCCAGAATAAGCCTCTCAATGATATTTTTGTTATGGAGGCCGACCCCGATGAGAAATCGAAGCGGTCCTCTTCCTCCCCTTTTGCGAATGGCAAGGAAGATGGAAACGCTACAAAAGAAGAGGGTAAAGTCGTGATTGATGTTAGCGGCGATGACATGAATTGTGATAATGCGAATGTCGAGAAAGAGGAAGGTGAATTGGAGGAGGGCGAAATTGATATGGATACGGAGTTCGTCGAAGAAGTTGTGGACTCTAAAGCAATGTTGTCCGACTGTATGGATACTGAATGTCAAGATATTGATTTGAAAGTTAAGGATTTGGATGACCAAGTGAAATTGATTCAGAAAACATTGGATGGTGTGACAATCGACGCTGCACAGAAATCGTTTCAGGAAGTTTGCTCCCAACTGCATAGTTCTATTGAGACGTTTTTGGAATTGGTCCAGGGAAAGGTAGTCCCGAGGAAGGATGCGCTCATTCAACGATTATATGCTGCTCTTCGAATAATCAATTCTGTATTTTGTTCCATGAACCCCAATGAGAAGGAGGAGCATAAGCAACATTTATCGAGGTTGCTTTCTTATGCTAAAAATTGCAACCCTCCTCTCTTTTCTCCCGAGCAGATAAAATCGATAGAGGTCAAAATGCCATCTACTGATTCCCTCGACAATTTGCCCAGCACGAGAGCCAGTGCTAAAGAGGTCGAGATCCATATACCTAATGGGGTGAAAAATAAGGACTTTTATTCTGCATACACAAATGCTAGTCCACATTTAACTTCTTCAACTAAGTTGTCTTCCATGCCCGTTGAGGTTACGGCAAAAAATAATATAAATATCTCATCCGATGGTTTGCAATCTGGAGTATCTAATGTAAAGGGTAGAGGCCCCCTACTCCCTCTGTTAGACCTTCACAAGGATCATGATGCAGACAGTCTTCCGTCACCTACCAGAGAAGCTCCTACAATTTTTTCTGTCCAAAAATCAGGGCATATCCCTGCAAAGGTGGCCCATGCTATGGATGGATCAAGATCACATCCTTATGAAACTGATGCCCTAAAAGCTGTTTCAACCTACCAACAGAAGTTTGGTCGAAGTTCCTTTTCAATGGCTGATAGACTTCCTAGCCCAACCCCTTCAGAAGAATGTGACGGGGGTGGGGATATTGGTGGGGAGGTTTCTAGTTCTTCTATTATCAGAAGTTCAAAGGCTTCAAATTCTTCTAAACTGGGTCAAAAGGTGTCAAATTCATCTTCGAACATATCTATAGGCCTTTTTCCTCACCTGGAAAGTTCCAACACTAAAGGGCTGAGTAATCCTTTAAATGTTGCTCCTCCAAGTTGTGTGTCTAATCCAACGGTAAAGCCTTTAGCAAAAAGTAGAGACCCTAGGCTACGAATGGTCAATTCGGATGCAAGTGCTATGGATCTTAATCCACGCACAGTGACTTCAGTGCAGAATCCTTCTGTCGTAGAATCTGCTACAACAATAAACTTGAGAAAGCAAAAGATGGATGTGGAGTCTAATCTAGATGGCCCTGAAATGAAAAGGCAGAGGATTGGATCTCAGAATCATGCAGTAGCTGCAAACGATGTGAGAGCCCCCTTTGGAAGCGGGGGCTGGTTGGAAGATACCATGTCCGCTGGACCCAGACTTCCAAGTAGGAATCAGATGGAGATTGCTGAAGCAAATGCAACAGAAAAAATAAATGTTTCAAACAATTCTGGTGCTGGAAATGAGTGTGCACCTACCATAAGTGCTACCAACGATGCTTCCTTACCCTCGCTGTTGAAAGATATTGTTGTGAACCCAACCATGCTCCTTAGTTTACTTAAAATGAGCCAACAGCAGCAATTAGCTGCAGAATTGAAATTAAAGTCAAGTGAACCTGAAAAAAATGTAATTTGTCCAACGGCCGTGAATCCTTGTCTAGGATCTAGTCCACTAGTAAATGCTCCTCCTGTGACCTCAGGAATTTTGCAGCAATCAGCAGGAACACTAAGTGTACCTTCACCACCGGTGGTTACTGTGACTCGACAGGATGATGTGGGAAAAGTTCGTATGAAACCTCGTGACCCTCGTCGTATCCTCCATGGTAATTCTCTTCAGAAGGTTGGCAGCTTGGGAAATGAGCAGTTAAAAGGTGTTGTACCAACTGCTCCAAACACAGAAGGAAGTGGGGATATACCAAATGGCCATAAGCAAGAAGGCCAGGGAGATTTAAGATTAGCTTCTTCACAACCATTACTACCTGACATTGGTAGGCAGTTCACTAACAATCTAAAAAATATTGCTGATATCATGTCTGTTCCCTCGCCACCAACTTCTTCACCGAATTCATCTGCAAAGCCAGTTAAATTGGATGGGATGGATACTAATGCTGTTGGGTCCAGCTCTGTTGACAGTAAAATTGTGACAACTGCTACCCAACCGGTAGATATGGTTGGCCCCTCTCGTTCACAGGGTGCATGGGGAGATCTTGAGCATCTATTCGAGGGTTACGATGACAAGCAAAAGGCTGCCATTCAGCGTGAGAGGGCAAGACGGATAGACGAACAGAAAAAAATGTTTGCTGCACGCAAACTCTGCCTTGTTTTGGATTTAGATCACACACTTCTGAATTCAGCTAAGTTCGTAGAAGTAGATCCAGTGCATGATGAAATTTTGAGAAAGAAGGAGGAACAGGATCGTGAGAAGGCGCAAAGACATCTTTTCCGATTTCCTCATATGGGAATGTGGACCAAACTACGGCCTGGAGTCTGGAACTTTTTGGAAAAGGCCAGCGAGCTCTATGAACTTCATTTGTACACAATGGGGAACAAGTTATACGCAACAGAGATGGCAAAGGTTCTTGATCCAAAAGGGGCTCTGTTTGCCGGGCGAGTTATTTCTCGCGGTGATGATGGAGACCCTTTGGACGGTGATGAGAGGGTACCCAAGAGTAAGGATCTGGAGGGTGTTTTGGGCATGGAGTCTGCTGTTGTTATAATTGATGATTCCATCAGGGTGTGGCCACATAATAAAATGAACTTGATTGTTGTCGAAAGGTATACTTACTTTCCTTGTAGCCGGCGCCAGTTTGGGCTTCTGGGTCCTTCCCTTCTAGAGATTGACCATGACGAGAGACCTGAAGATGGTACTTTGGCATCTTCACTGGGGGTAATTCTGTGGCTTCACTTGTTCACCTTATTTCATTGAATTAATTTATTTTACTCTTAATCATCTTAAATTTTTGTCCATTAGGTTATCCAAAGAATCCATCAAACCTTTTTCTCTCATCCTTTATTAGATGAAATAGATGTTAGAAATATCTTGGCCTCCGAGCAACAAAAGATTTTGGCTGGTTGCCGTATAGTGTTTAGCAGGGTTTTTCCGGTTGGTGAGGCAAATCCTCACCTGCATCCGTTGTGGCAGACAGCTGAACAGTTTGGTGCGGTGTGCACCAACCAGATTGATGAGCAGGTTACCCACGTCGTTGCAAACTCTCTTGGGACTGATAAGGTGAATTGGGCACTCTCCACTGGCAGATTCGTGGTCCATCCAGGGTGGGTGGAAGCATCGGCTCTGCTTTATCGGAGGGCAAACGAGCAGGACTTTGCCATTAAACCATAACCAACCCACCCACCAGCAACCTTAACACACCCTTTGCTAAATTAAAAGTGAAAGATTATAACACAAGATGATATAACACGGAGAAGAAATTCAACACGGTGCCTTATATTTGGGTGGTAGTGTTAGGCAAAGCCTAAGACTACTAAGAGAGAGACACATATAACTCCCTATTTAACACGCCCTACAACTACAAGAATGTGTCGAAAGAGCCCAGAAAGAGAAAGATCACACCGAAATTATGTTGCCCCCCCTCCCCCCATTGACTGTGGAGAGGGAGTTTGGTTAGATTTGAAATAATCAACGGTCAGGTCAATGGAGGGAGGAGGAGGAGAGAGTTACAAAAGAGGAGTGAGAGACGAACGGGTAAATTGTGACTGGTGGGGCGGAGCTTAAACCCAATTCAACCCAATTCAACGTCTGAAACTTTTCATTTATTTTCTTTTTCATTTTCTTCTGCTTTTGCTTATTAAATTTTTGGTTGGAGTTGCATGAGAGAGAACGAACACTTTGGTTTTTACTCTCGAATCAGTCCTGCCCCATTTGTTGTTGCCTAAAAATGATAGAGAATGAAATGAGGGAAGGGCAAATGCAAAGGATGAGCCTGGAGAATGTCCATGATGTGGCAGAATTGCGTCTGTTTTGAGCTAAAATACAAGTCAATGTCAAAGGAGGAATAAGGTGCTCAGAGGATGTGCTATGATATGATTTGGAGACTGTGGGGTTGAGTGAGGCGTGAGGACAACATTGAAAAGGCTTCGGGTCACTTGATAAATGACATCAACAATCCTTGCAAATGGACCAATTTTCTGATTCATTTTCAAACCTCTTTTTTTTCCTTAAACAAGAAATAAATAAATAAATAAATAAAATTGGC

Coding sequence (CDS)

ATGGGGAAAGACGAAAGTGTTAAAATTCAAGACGTTGAAGAAGGGGAAATCTCTGATACAGCTTCAGTTGAGGAGATCACTGAGGAAGATTTTAATAAGCTTGAAACTGGTCCTAAGGTGTTGCCATCCAAAGATTCCAATCGGGACACTAGAGTTTGGACCATGAGTGATTTGTATAAGAATTATCCCACCATGTGTCGTGGTTATGCTTCTGGTTTGTACAATTTAGCTTGGGCGCAGGCAGTCCAGAATAAGCCTCTCAATGATATTTTTGTTATGGAGGCCGACCCCGATGAGAAATCGAAGCGGTCCTCTTCCTCCCCTTTTGCGAATGGCAAGGAAGATGGAAACGCTACAAAAGAAGAGGGTAAAGTCGTGATTGATGTTAGCGGCGATGACATGAATTGTGATAATGCGAATGTCGAGAAAGAGGAAGGTGAATTGGAGGAGGGCGAAATTGATATGGATACGGAGTTCGTCGAAGAAGTTGTGGACTCTAAAGCAATGTTGTCCGACTGTATGGATACTGAATGTCAAGATATTGATTTGAAAGTTAAGGATTTGGATGACCAAGTGAAATTGATTCAGAAAACATTGGATGGTGTGACAATCGACGCTGCACAGAAATCGTTTCAGGAAGTTTGCTCCCAACTGCATAGTTCTATTGAGACGTTTTTGGAATTGGTCCAGGGAAAGGTAGTCCCGAGGAAGGATGCGCTCATTCAACGATTATATGCTGCTCTTCGAATAATCAATTCTGTATTTTGTTCCATGAACCCCAATGAGAAGGAGGAGCATAAGCAACATTTATCGAGGTTGCTTTCTTATGCTAAAAATTGCAACCCTCCTCTCTTTTCTCCCGAGCAGATAAAATCGATAGAGGTCAAAATGCCATCTACTGATTCCCTCGACAATTTGCCCAGCACGAGAGCCAGTGCTAAAGAGGTCGAGATCCATATACCTAATGGGGTGAAAAATAAGGACTTTTATTCTGCATACACAAATGCTAGTCCACATTTAACTTCTTCAACTAAGTTGTCTTCCATGCCCGTTGAGGTTACGGCAAAAAATAATATAAATATCTCATCCGATGGTTTGCAATCTGGAGTATCTAATGTAAAGGGTAGAGGCCCCCTACTCCCTCTGTTAGACCTTCACAAGGATCATGATGCAGACAGTCTTCCGTCACCTACCAGAGAAGCTCCTACAATTTTTTCTGTCCAAAAATCAGGGCATATCCCTGCAAAGGTGGCCCATGCTATGGATGGATCAAGATCACATCCTTATGAAACTGATGCCCTAAAAGCTGTTTCAACCTACCAACAGAAGTTTGGTCGAAGTTCCTTTTCAATGGCTGATAGACTTCCTAGCCCAACCCCTTCAGAAGAATGTGACGGGGGTGGGGATATTGGTGGGGAGGTTTCTAGTTCTTCTATTATCAGAAGTTCAAAGGCTTCAAATTCTTCTAAACTGGGTCAAAAGGTGTCAAATTCATCTTCGAACATATCTATAGGCCTTTTTCCTCACCTGGAAAGTTCCAACACTAAAGGGCTGAGTAATCCTTTAAATGTTGCTCCTCCAAGTTGTGTGTCTAATCCAACGGTAAAGCCTTTAGCAAAAAGTAGAGACCCTAGGCTACGAATGGTCAATTCGGATGCAAGTGCTATGGATCTTAATCCACGCACAGTGACTTCAGTGCAGAATCCTTCTGTCGTAGAATCTGCTACAACAATAAACTTGAGAAAGCAAAAGATGGATGTGGAGTCTAATCTAGATGGCCCTGAAATGAAAAGGCAGAGGATTGGATCTCAGAATCATGCAGTAGCTGCAAACGATGTGAGAGCCCCCTTTGGAAGCGGGGGCTGGTTGGAAGATACCATGTCCGCTGGACCCAGACTTCCAAGTAGGAATCAGATGGAGATTGCTGAAGCAAATGCAACAGAAAAAATAAATGTTTCAAACAATTCTGGTGCTGGAAATGAGTGTGCACCTACCATAAGTGCTACCAACGATGCTTCCTTACCCTCGCTGTTGAAAGATATTGTTGTGAACCCAACCATGCTCCTTAGTTTACTTAAAATGAGCCAACAGCAGCAATTAGCTGCAGAATTGAAATTAAAGTCAAGTGAACCTGAAAAAAATGTAATTTGTCCAACGGCCGTGAATCCTTGTCTAGGATCTAGTCCACTAGTAAATGCTCCTCCTGTGACCTCAGGAATTTTGCAGCAATCAGCAGGAACACTAAGTGTACCTTCACCACCGGTGGTTACTGTGACTCGACAGGATGATGTGGGAAAAGTTCGTATGAAACCTCGTGACCCTCGTCGTATCCTCCATGGTAATTCTCTTCAGAAGGTTGGCAGCTTGGGAAATGAGCAGTTAAAAGGTGTTGTACCAACTGCTCCAAACACAGAAGGAAGTGGGGATATACCAAATGGCCATAAGCAAGAAGGCCAGGGAGATTTAAGATTAGCTTCTTCACAACCATTACTACCTGACATTGGTAGGCAGTTCACTAACAATCTAAAAAATATTGCTGATATCATGTCTGTTCCCTCGCCACCAACTTCTTCACCGAATTCATCTGCAAAGCCAGTTAAATTGGATGGGATGGATACTAATGCTGTTGGGTCCAGCTCTGTTGACAGTAAAATTGTGACAACTGCTACCCAACCGGTAGATATGGTTGGCCCCTCTCGTTCACAGGGTGCATGGGGAGATCTTGAGCATCTATTCGAGGGTTACGATGACAAGCAAAAGGCTGCCATTCAGCGTGAGAGGGCAAGACGGATAGACGAACAGAAAAAAATGTTTGCTGCACGCAAACTCTGCCTTGTTTTGGATTTAGATCACACACTTCTGAATTCAGCTAAGTTCGTAGAAGTAGATCCAGTGCATGATGAAATTTTGAGAAAGAAGGAGGAACAGGATCGTGAGAAGGCGCAAAGACATCTTTTCCGATTTCCTCATATGGGAATGTGGACCAAACTACGGCCTGGAGTCTGGAACTTTTTGGAAAAGGCCAGCGAGCTCTATGAACTTCATTTGTACACAATGGGGAACAAGTTATACGCAACAGAGATGGCAAAGGTTCTTGATCCAAAAGGGGCTCTGTTTGCCGGGCGAGTTATTTCTCGCGGTGATGATGGAGACCCTTTGGACGGTGATGAGAGGGTACCCAAGAGTAAGGATCTGGAGGGTGTTTTGGGCATGGAGTCTGCTGTTGTTATAATTGATGATTCCATCAGGGTGTGGCCACATAATAAAATGAACTTGATTGTTGTCGAAAGGTATACTTACTTTCCTTGTAGCCGGCGCCAGTTTGGGCTTCTGGGTCCTTCCCTTCTAGAGATTGACCATGACGAGAGACCTGAAGATGGTACTTTGGCATCTTCACTGGGGGTAATTCTGTGGCTTCACTTGTTCACCTTATTTCATTGA

Protein sequence

MGKDESVKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNATKEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTECQDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKDALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMPSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKLSSMPVEVTAKNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAKVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVKPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEMKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNNSGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNVICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDIPNGHKQEGQGDLRLASSQPLLPDIGRQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDMVGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVILWLHLFTLFH
Homology
BLAST of Tan0002421 vs. ExPASy Swiss-Prot
Match: Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)

HSP 1 Score: 768.5 bits (1983), Expect = 1.1e-220
Identity = 528/1195 (44.18%), Postives = 688/1195 (57.57%), Query Frame = 0

Query: 1    MGKDESVKIQ-DVEEGEISDTASVE-EITEEDFNK-----------LETGPKVLPSKDSN 60
            MG DE++ +  DVEEGEI D+ + E E+  +               +  G +      SN
Sbjct: 15   MGNDENLMVMVDVEEGEIPDSVNTEIEVKHKSTTTTADVGGDVDVGVVAGGRGGGGGGSN 74

Query: 61   RDTRVWTMSDLYKNYPTMCRGYA-SGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSS 120
             ++RVWTM +L   YP   R YA SGL NLAWA+AVQNKP N+  VM+ +P         
Sbjct: 75   GNSRVWTMEELISQYPAY-RPYANSGLSNLAWARAVQNKPFNEGLVMDYEP--------- 134

Query: 121  SPFANGKEDGNATKEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDM-----DTEFVE 180
                         +E  K+VI+ S D         EKEEGELEEGEID+     D   VE
Sbjct: 135  -------------RESDKIVIEDSDD---------EKEEGELEEGEIDLVDNASDDNLVE 194

Query: 181  EVVDSKAMLSDCMDTECQDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSS 240
            +  +S  ++S   D    D  LK +DL+ +VKLI+  L+  ++  AQ  F+ VCS++  +
Sbjct: 195  KDTESVVLIS--ADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGA 254

Query: 241  IETFLELV-QGKVVPRKDALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNC 300
            +E+  ELV      P++D L+Q  +A+L+ IN VFCSMN   KE +K+ +SRLL+   + 
Sbjct: 255  LESLRELVSDNDDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDH 314

Query: 301  NPPLFSPEQIKSIEVKMPSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHL 360
                 S  Q   IE  M    S   +     ++ E  ++      N D + A        
Sbjct: 315  FSQFLSFNQKNEIET-MNQDLSRSAIAVFAGTSSEENVNQMTQPSNGDSFLAK------- 374

Query: 361  TSSTKLSSMPVEVTAKNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTRE 420
                                ++S+    G + ++ R P+LPLLDLHKDHDADSLPSPTRE
Sbjct: 375  -------------------KLTSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRE 434

Query: 421  APTIFSVQ------KSGHIPAKVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADR 480
                  V       + G    + +   +G++ + YE+DA KAVSTYQQKFG +S    D 
Sbjct: 435  TTPSLPVNGRHTMVRPGFPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDD 494

Query: 481  LPSPTPS-EECDGGGDIGGEVSSSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESS 540
            LPSPTPS E  DG GD+GGEV SSS+++SS   +    GQ V    SN +    P   S 
Sbjct: 495  LPSPTPSGEPNDGNGDVGGEV-SSSVVKSSNPGSHLIYGQDVP-LPSNFNSRSMPVANSV 554

Query: 541  NTKGLSNPLNVAPPSC--VSNPTVKPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSV 600
            ++    + L++   S    S+ TVKP AKSRDPRLR+   DA+ + +   +    +N S 
Sbjct: 555  SSTVPPHHLSIHAISAPTASDQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSK 614

Query: 601  VE-SATTINLRKQKMDVESNLDGPEMKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAG 660
            VE SA  +N RKQK   E  +DGP  KRQ+        +  D     G+GGWLEDT S+G
Sbjct: 615  VELSADLVNPRKQKAADEFLIDGPAWKRQK--------SDTDAPKAAGTGGWLEDTESSG 674

Query: 661  PRLPSRNQMEIAEANATEKI-NVSNNSGAGNECAPTISATNDASLPSLLKDIVVNPTMLL 720
              L   ++  + E   T    +V   S          ++T+ ASL SLLKDI VNPTMLL
Sbjct: 675  -LLKLESKPRLIENGVTSMTSSVMPTSAVSVSQKVRTASTDTASLQSLLKDIAVNPTMLL 734

Query: 721  SLLKMSQQQQLAAELKLKSSEPEKNVICP-TAVNP------CLGSSPLVNAPPVTSGILQ 780
            +LLKM ++Q++  +   K  +P +    P ++V P       + +S  + A  + SG+LQ
Sbjct: 735  NLLKMGERQKVPEKAIQKPMDPRRAAQLPGSSVQPGVSTPLSIPASNALAANSLNSGVLQ 794

Query: 781  QSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRILHGNSLQKVGSLGNEQLKGVVPT-- 840
             S+      + P        + G +RMKPRDPRRILHG++LQ+  S   +Q K   P+  
Sbjct: 795  DSS-----QNAPAA------ESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTL 854

Query: 841  --------APNTEGSGDIPNGHKQEGQGDLRLASSQPLL----PDIGRQFTNNLKNIADI 900
                    A + E    +         G  ++  S  LL    PD   QFT NLK+IAD 
Sbjct: 855  GTLTMKGKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIAD- 914

Query: 901  MSVPSPPTSSPNSSAKPVKLD---GMDTNAVGSSSVDSKIVTTATQPVDMVGPSRSQGAW 960
            M V S    +P +S   V+L     +  N    ++ D  +  +A       GP+RS  +W
Sbjct: 915  MVVVSQQLGNPPASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRSMNSW 974

Query: 961  GDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPV 1020
            GD+EHLFEGYDD Q+ AIQRER RR++EQ KMFA++KL LVLD+DHTLLNSAKF EV+  
Sbjct: 975  GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 1034

Query: 1021 HDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYA 1080
            H+EILRKKEEQDREK  RHLFRF HMGMWTKLRPG+WNFLEKAS+LYELHLYTMGNKLYA
Sbjct: 1035 HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1094

Query: 1081 TEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSIRVW 1140
            TEMAK+LDPKG LF GRVIS+GDDGDPLDGDERVPKSKDLEGV+GMES+VVIIDDS+RVW
Sbjct: 1095 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1125

BLAST of Tan0002421 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 4.4e-36
Identity = 100/230 (43.48%), Postives = 133/230 (57.83%), Query Frame = 0

Query: 915  YDDKQKAAIQRERARRIDEQKKMF-AARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKK 974
            Y  K+    + E +R  D   +     RKL LVLDLDHTLLN+    ++ P  +E L+  
Sbjct: 94   YIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKP-EEEYLKSH 153

Query: 975  EEQDREKAQ---RHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAK 1034
                ++        LF    M M TKLRP V +FL++ASE++ +++YTMG++ YA +MAK
Sbjct: 154  THSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAK 213

Query: 1035 VLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKM 1094
            +LDPKG  F  RVISR DDG        V   K L+ VLG ESAV+I+DD+   WP +K 
Sbjct: 214  LLDPKGEYFGDRVISR-DDG-------TVRHEKSLDVVLGQESAVLILDDTENAWPKHKD 273

Query: 1095 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVILWLH 1141
            NLIV+ERY +F  S RQF     SL E+  DE   DG LA+ L V+   H
Sbjct: 274  NLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATVLKVLKQAH 314

BLAST of Tan0002421 vs. ExPASy Swiss-Prot
Match: F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)

HSP 1 Score: 107.8 bits (268), Expect = 8.0e-22
Identity = 65/200 (32.50%), Postives = 107/200 (53.50%), Query Frame = 0

Query: 941  RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPG 1000
            +KL LVLDLDHTLL++     +      ++ +     R+   +       M   TKLRP 
Sbjct: 384  KKLHLVLDLDHTLLHTVMVPSLSQAEKYLIEEAGSATRDDLWKIKAVGDPMEFLTKLRPF 443

Query: 1001 VWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVP 1060
            + +FL++A+E + +++YT G+++YA ++ +++DPK   F  RVI++ +           P
Sbjct: 444  LRDFLKEANEFFTMYVYTKGSRVYAKQVLELIDPKKLYFGDRVITKTES----------P 503

Query: 1061 KSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDH 1120
              K L+ VL  E  VVI+DD+  VWP +K NL+ + +Y+YF    R  G       E   
Sbjct: 504  HMKTLDFVLAEERGVVIVDDTRNVWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKT 563

Query: 1121 DERPEDGTLASSLGVILWLH 1141
            DE   +G LA+ L ++  +H
Sbjct: 564  DESESEGGLANVLKLLKEVH 569

BLAST of Tan0002421 vs. ExPASy Swiss-Prot
Match: Q9P376 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=fcp1 PE=1 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 7.0e-18
Identity = 73/211 (34.60%), Postives = 105/211 (49.76%), Query Frame = 0

Query: 912  FEGYDDKQKA-----------AIQRERARRIDEQ--KKMFAARKLCLVLDLDHTLLNSAK 971
            + GY D  +A            +  E A R++ +  K++   ++L L++DLD T++++  
Sbjct: 121  YMGYSDMARANISMTHNTGDLTVSLEEASRLESENVKRLRQEKRLSLIVDLDQTIIHAT- 180

Query: 972  FVEVDPVHDEILRKKEEQD----REKAQRHLFRFPH---MGMWTKLRPGVWNFLEKASEL 1031
               VDP   E +      +    R+    +L   P       + K RPG+  FL+K SEL
Sbjct: 181  ---VDPTVGEWMSDPGNVNYDVLRDVRSFNLQEGPSGYTSCYYIKFRPGLAQFLQKISEL 240

Query: 1032 YELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGM 1091
            YELH+YTMG K YA E+AK++DP G LF  RV+SR D G            K L  +   
Sbjct: 241  YELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPC 300

Query: 1092 E-SAVVIIDDSIRVWPHNKMNLIVVERYTYF 1102
            + S VV+IDD   VW  N  NLI V  Y +F
Sbjct: 301  DTSMVVVIDDRGDVWDWNP-NLIKVVPYEFF 318

BLAST of Tan0002421 vs. ExPASy Swiss-Prot
Match: Q7TSG2 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Mus musculus OX=10090 GN=Ctdp1 PE=1 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 1.0e-16
Identity = 63/188 (33.51%), Postives = 104/188 (55.32%), Query Frame = 0

Query: 923  IQRERARRI--DEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK 982
            +  E+A ++  ++Q+++   RKL L++DLD TL+++                 E+   + 
Sbjct: 161  VSSEQAEKLGREDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQHCPQM 220

Query: 983  AQRHLFRFPHMG-----MWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPK 1042
            + + +F F  +G     + T+LRP   +FLEK ++LYELH++T G++LYA  +A  LDP+
Sbjct: 221  SNKGIFHF-QLGRGEPMLHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPE 280

Query: 1043 GALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGM-ESAVVIIDDSIRVWPHNKMNLIV 1102
              LF+ R++SR +  DP        K+ +L  +    +S V IIDD   VW     NLI 
Sbjct: 281  KKLFSHRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKF-APNLIT 324

BLAST of Tan0002421 vs. NCBI nr
Match: XP_022960085.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita moschata])

HSP 1 Score: 1930.6 bits (5000), Expect = 0.0e+00
Identity = 1002/1144 (87.59%), Postives = 1057/1144 (92.40%), Query Frame = 0

Query: 1    MGKDES-VKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLY 60
            MGK  + VK  DVEEGEISDT SVEEITEEDFNKLET PK+LPSK SNR+T VWTMSDLY
Sbjct: 1    MGKHTNCVKTPDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60

Query: 61   KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNAT 120
             NYPTMCRGYA GLYNLAWA+AVQNKPLN+IF+M+ADPD+KS RSSSSPF N KE GN T
Sbjct: 61   NNYPTMCRGYAPGLYNLAWAKAVQNKPLNEIFLMDADPDDKSNRSSSSPFRNAKEHGNGT 120

Query: 121  KEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTECQ 180
            KEE K++ID++GDDMN DNA+VEKEEGELEEGEIDMDTEFVEEVVDSK MLSD +DT+CQ
Sbjct: 121  KEEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDCQ 180

Query: 181  DIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKDA 240
            +IDLK K+LDDQ+KLI KTLDGVTIDAAQKSFQEVCSQL SSIETFLELVQGKVVPRKD 
Sbjct: 181  EIDLKNKELDDQLKLIHKTLDGVTIDAAQKSFQEVCSQLLSSIETFLELVQGKVVPRKDV 240

Query: 241  LIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMPS 300
            LIQRLYAALRIINSVFCSMNP EKEE+KQHLSRLLS+ KNCNPPLFSPEQIKS+EVKMPS
Sbjct: 241  LIQRLYAALRIINSVFCSMNPKEKEEYKQHLSRLLSFVKNCNPPLFSPEQIKSVEVKMPS 300

Query: 301  TDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTAKN 360
            TDSLD  P  RASAK+VEIHIPNGVKNKDFYSAY  A+PHLTSSTKL   SMPV VT KN
Sbjct: 301  TDSLDQFPDMRASAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTVKN 360

Query: 361  NINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAKV 420
            ++N+SSD L SGV NVKGRGPLLPLLDLHKDHD DSLPSPTREAPT+FSVQKSGHIP KV
Sbjct: 361  SLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPVKV 420

Query: 421  AHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480
            AHAMDGSR HPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS
Sbjct: 421  AHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480

Query: 481  SIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVKP 540
            SI+RSSKASNSSKL Q VS S+S+IS GLFP+LESS+TKGL +P NVAPPSCVSNP  KP
Sbjct: 481  SILRSSKASNSSKLAQTVSQSASSISTGLFPNLESSSTKGLISPSNVAPPSCVSNPIAKP 540

Query: 541  LAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEMK 600
            LAKSRDPRLRMVNS+ASAMDLNPRT+TSVQ+PSVVESA T+NLRKQKMDVE N+D PEMK
Sbjct: 541  LAKSRDPRLRMVNSEASAMDLNPRTMTSVQSPSVVESAVTVNLRKQKMDVEPNIDAPEMK 600

Query: 601  RQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNNSG 660
            RQRIGSQNHA +A+D+RA  GSGGWLEDTMSA PRL SRNQMEIAEANATEK NV+NNSG
Sbjct: 601  RQRIGSQNHAFSASDLRAGSGSGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVTNNSG 660

Query: 661  AGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNVIC 720
            AGN C PTISA+ +ASLPSLLKDIVVNPTMLLSLLKM+QQ+Q+AAELKLKSSEPEKN IC
Sbjct: 661  AGNSCGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLKSSEPEKNAIC 720

Query: 721  PTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRI 780
            PTAVNPCLGSSPLVNAP +TSGILQQSAGT SVPSPPVVTV   DDVGKVRMKPRDPRRI
Sbjct: 721  PTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTV---DDVGKVRMKPRDPRRI 780

Query: 781  LHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDI-PNGHKQEGQGDLRLASSQPLLPDIGR 840
            LHGNSL KVGS+GNEQLK VVP  PN EGS DI PNGHKQEGQG+LRLASSQPLLPDIGR
Sbjct: 781  LHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIVPNGHKQEGQGNLRLASSQPLLPDIGR 840

Query: 841  QFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDMV 900
            QFTNNLKNIADIMSVPSPPTSS NSS+KPVKLD  DTNAVGSSS+DSKIV TATQ VDMV
Sbjct: 841  QFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDIKDTNAVGSSSIDSKIVATATQVVDMV 900

Query: 901  GPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960
            GPSRS GAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS
Sbjct: 901  GPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960

Query: 961  AKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020
            AKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL
Sbjct: 961  AKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020

Query: 1021 YTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVV 1080
            YTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAVV
Sbjct: 1021 YTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVV 1080

Query: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVI 1140
            IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL VI
Sbjct: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVI 1140

BLAST of Tan0002421 vs. NCBI nr
Match: KAG6592819.1 (RNA polymerase II C-terminal domain phosphatase-like 3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1915.2 bits (4960), Expect = 0.0e+00
Identity = 996/1144 (87.06%), Postives = 1053/1144 (92.05%), Query Frame = 0

Query: 1    MGKDES-VKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLY 60
            MGK  + VK QDVEEGEISDT SVEEITEEDFNKLET PK+LPSK SNR+T VWTMSDLY
Sbjct: 1    MGKHTNCVKTQDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60

Query: 61   KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNAT 120
             NYPTMCRGYASGLYNLAWA+AVQNKPLN+IF+M+ADPD KS RSSSSPF N KE GN T
Sbjct: 61   NNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLMDADPDHKSNRSSSSPFRNAKEHGNGT 120

Query: 121  KEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTECQ 180
            K+E K++ID++GDDMN DNA+VEKEEGELEEGEIDMDTEFVEEVVDSK MLSD +DT+C+
Sbjct: 121  KQEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDCR 180

Query: 181  DIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKDA 240
            +IDLK K+LDDQ+KLI KTLDGVTIDAAQKSFQ+VCSQL SSIETFLELVQGKVVPRKDA
Sbjct: 181  EIDLKNKELDDQLKLIHKTLDGVTIDAAQKSFQQVCSQLLSSIETFLELVQGKVVPRKDA 240

Query: 241  LIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMPS 300
            LIQR YAALRIINSVFCSMNP EKEE+KQHLSRLLS+ KNCNPPLFSPEQIKS+EVKMPS
Sbjct: 241  LIQRCYAALRIINSVFCSMNPKEKEEYKQHLSRLLSFVKNCNPPLFSPEQIKSVEVKMPS 300

Query: 301  TDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTAKN 360
            TDSLD+ P TR SAK+VEIHIPNGVKNKDFYSAY  A+PHLTSSTKL   SMPV VT KN
Sbjct: 301  TDSLDHFPDTRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTIKN 360

Query: 361  NINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAKV 420
            N+N+SSD L SGV NVKGRGPL PLLDLHKDHD DSLPSPTREAPT+FSVQKSGHIP KV
Sbjct: 361  NLNLSSDSLLSGVPNVKGRGPLHPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPMKV 420

Query: 421  AHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480
            AH MDGSR HPYETDA+KAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS
Sbjct: 421  AHDMDGSRVHPYETDAVKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480

Query: 481  SIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVKP 540
            SI RSSKAS+SSKL Q VSNS+S+IS GLFP+LESS TKGL +PLNVAPPS VSNP  KP
Sbjct: 481  SIFRSSKASSSSKLAQTVSNSASSISTGLFPNLESSTTKGLISPLNVAPPSSVSNPIAKP 540

Query: 541  LAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEMK 600
            LAKSRDPRLRMV S+ASAMDLNPRT+TSVQNPSVVESA T+N+RKQKMDVE N+D PEMK
Sbjct: 541  LAKSRDPRLRMVTSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAPEMK 600

Query: 601  RQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNNSG 660
            RQRIGSQNHA +A+D+RA  GSGGWLEDTMSA PRL SRNQMEIAEANATEK NV+NNSG
Sbjct: 601  RQRIGSQNHAFSASDLRAGSGSGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVTNNSG 660

Query: 661  AGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNVIC 720
            AGN   PTISA+ +ASLPSLLKDIVVNPTMLLSLLKM+QQ+Q+AAELKL SSEPEKN IC
Sbjct: 661  AGNLRGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEKNAIC 720

Query: 721  PTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRI 780
            PTAVNPCLGSSPLVNAP +TSGILQQSAGT SVPSPPVVTV   DDVGKVRMKPRDPRRI
Sbjct: 721  PTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTV---DDVGKVRMKPRDPRRI 780

Query: 781  LHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGD-IPNGHKQEGQGDLRLASSQPLLPDIGR 840
            LHGNSL KVGS+GNEQLK VVP  PN EGS D IPNGHKQEGQG+LRLASSQPLLPDIGR
Sbjct: 781  LHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDIGR 840

Query: 841  QFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDMV 900
            QFTNNLKNIADIMSVPSPPTSS NSS+KPVKLD  DTNAVGSSS+DSKIV TATQ VDMV
Sbjct: 841  QFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDRKDTNAVGSSSIDSKIVATATQAVDMV 900

Query: 901  GPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960
            GPSRS GAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS
Sbjct: 901  GPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960

Query: 961  AKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020
            AKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL
Sbjct: 961  AKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020

Query: 1021 YTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVV 1080
            YTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAVV
Sbjct: 1021 YTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVV 1080

Query: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVI 1140
            IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL VI
Sbjct: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVI 1140

BLAST of Tan0002421 vs. NCBI nr
Match: XP_023514332.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1907.5 bits (4940), Expect = 0.0e+00
Identity = 995/1144 (86.98%), Postives = 1050/1144 (91.78%), Query Frame = 0

Query: 1    MGKDES-VKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLY 60
            MGK  + VK QDVEEGEISDT SVEEITEEDFNKLET PK+LPSK SNR+T VWTMSDLY
Sbjct: 1    MGKHTNCVKTQDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60

Query: 61   KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNAT 120
             NYPTMCRGYASGLYNLAWA+AVQNKPLN+IF+M+ADPD+KS RSSSSPF N KE GN T
Sbjct: 61   NNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLMDADPDDKSNRSSSSPFRNAKEHGNGT 120

Query: 121  KEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTECQ 180
            K+E K++ID++GDDMN DNA+VEKEEGELEEGEIDMDTEFVEEVVDSK MLSD +DT+ Q
Sbjct: 121  KQEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDYQ 180

Query: 181  DIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKDA 240
            +IDLK K+LDDQ+KLI KTLD VTIDAAQKSF EVCSQL SSIETFLELVQGKVVPRKDA
Sbjct: 181  EIDLKNKELDDQLKLIHKTLDAVTIDAAQKSFHEVCSQLLSSIETFLELVQGKVVPRKDA 240

Query: 241  LIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMPS 300
            LIQRLYAALRIINSVFCSMNP EKEE K HLSRLLS+ KNCN PLFSPEQIKS+EVKMPS
Sbjct: 241  LIQRLYAALRIINSVFCSMNPKEKEECKPHLSRLLSFVKNCNTPLFSPEQIKSVEVKMPS 300

Query: 301  TDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTAKN 360
            TDSLD+ P  R SAK+VEIHIPNGVKNKDFYSAY  A+PHLTSSTKL   SMPV VT KN
Sbjct: 301  TDSLDHFPHMRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTVKN 360

Query: 361  NINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAKV 420
            N+N+SSD L SGV NVKGRGPLLPLLDLHKDHD DSLPSPTREAPT+FSVQKSGHIP KV
Sbjct: 361  NLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPVKV 420

Query: 421  AHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480
            A AMDGSR HPYETDA+KAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS
Sbjct: 421  ARAMDGSRVHPYETDAVKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480

Query: 481  SIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVKP 540
            SI RSSKASNS KL Q VSNS+S+IS GLFP+LESS+TKGL +PLNVAPPS VSNP  KP
Sbjct: 481  SIFRSSKASNSYKLAQTVSNSASSISTGLFPNLESSSTKGLISPLNVAPPSSVSNPIAKP 540

Query: 541  LAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEMK 600
            LAKSRDPRLRMV S+ASAMDLNPRT+TSVQNPSVVESA T+N+RKQKMDVE N+D PEMK
Sbjct: 541  LAKSRDPRLRMVTSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAPEMK 600

Query: 601  RQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNNSG 660
            RQRIGSQNHA +A+D+RA  GSGGWLEDTMSA PRL SRNQMEIAEANATEK NV+NNSG
Sbjct: 601  RQRIGSQNHAFSASDLRAGSGSGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVTNNSG 660

Query: 661  AGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNVIC 720
            AGN   PTISA+ +ASLPSLLKDIVVNPTMLLSLLKM+QQ+Q+AAELKL SSEPEKN IC
Sbjct: 661  AGNSRGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEKNAIC 720

Query: 721  PTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRI 780
            PTAVNPCLGSSPLVNAP +TSGILQQSAGT SVPSPPVVTV   DDVGKVRMKPRDPRRI
Sbjct: 721  PTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTV---DDVGKVRMKPRDPRRI 780

Query: 781  LHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGD-IPNGHKQEGQGDLRLASSQPLLPDIGR 840
            LHGNSL KVGS+GNEQLK VVP  PN EGS D IPNGHKQEGQG+LRLASSQPLLPDIGR
Sbjct: 781  LHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDIGR 840

Query: 841  QFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDMV 900
            QFTNNLKNIADIMSVPSPPTSS NSS+KPVKLD  DTNAVGSSS+DSKIV TATQ VDMV
Sbjct: 841  QFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDRKDTNAVGSSSIDSKIVATATQAVDMV 900

Query: 901  GPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960
            GPSRS GAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS
Sbjct: 901  GPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960

Query: 961  AKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020
            AKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL
Sbjct: 961  AKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020

Query: 1021 YTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVV 1080
            YTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAVV
Sbjct: 1021 YTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVV 1080

Query: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVI 1140
            IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL VI
Sbjct: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVI 1140

BLAST of Tan0002421 vs. NCBI nr
Match: XP_023005106.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita maxima])

HSP 1 Score: 1889.8 bits (4894), Expect = 0.0e+00
Identity = 988/1146 (86.21%), Postives = 1047/1146 (91.36%), Query Frame = 0

Query: 1    MGKDES-VKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLY 60
            MGK  + +K QDVEEGEISDT SVEEITEEDFN LET PK+LPSK SNR+T VWTMSDLY
Sbjct: 1    MGKHTNCLKTQDVEEGEISDTPSVEEITEEDFNNLETVPKLLPSKHSNRETTVWTMSDLY 60

Query: 61   KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNAT 120
             NYPTMCRGYASGLYNLAWA+AVQNKPLN+IF+ EADPD+KS RSSSSPF N KE GN T
Sbjct: 61   NNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLTEADPDDKSHRSSSSPFRNAKEHGNGT 120

Query: 121  KEE-GKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTEC 180
             EE  K++ID++GDDMN +NA+VEKEEGELEEGEIDMDTEFVEEVVDS+ MLSD +DT+C
Sbjct: 121  IEEAAKLIIDITGDDMNTNNADVEKEEGELEEGEIDMDTEFVEEVVDSRPMLSDSLDTDC 180

Query: 181  QDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKD 240
            Q+ID K K+LDDQ+KL+ KTLDGVTIDAAQKSFQE+CSQL SSIETFLELVQGKVVPRKD
Sbjct: 181  QEIDFKNKELDDQLKLVHKTLDGVTIDAAQKSFQEICSQLLSSIETFLELVQGKVVPRKD 240

Query: 241  ALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMP 300
            ALIQRLYAALRIINSVFCSMNP EK+E+KQHLSRLLS+ KNCNP LFSPEQIKS+EVKMP
Sbjct: 241  ALIQRLYAALRIINSVFCSMNPKEKDEYKQHLSRLLSFVKNCNPALFSPEQIKSVEVKMP 300

Query: 301  STDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTAK 360
            STDSLD+ P  R SAK+VEIHIPNGVKNKDFYSAY  A+PHLTSSTKL   SMPV VT K
Sbjct: 301  STDSLDHFPDMRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTVK 360

Query: 361  NNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAK 420
            NN+N+SSD L SGV NVKGRGPLLPLLDLHKDHD DSLPSPTREAPT+FSVQKSGHIP K
Sbjct: 361  NNLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPVK 420

Query: 421  VAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSS 480
            VAHAMDGSR HPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSS
Sbjct: 421  VAHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSS 480

Query: 481  SSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVK 540
            SSI RSSKASNSSKL Q VSNS+S+IS GLFP+LESS+TKGL +PLNVAPPS VSNP  K
Sbjct: 481  SSIFRSSKASNSSKLAQTVSNSASSISTGLFPNLESSSTKGLISPLNVAPPSSVSNPIAK 540

Query: 541  PLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEM 600
            PLAKSRDPRLRMVNS+ASAMDLNPRT+TSVQNPSVVESA T+N+RKQKMDVE N+D PEM
Sbjct: 541  PLAKSRDPRLRMVNSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAPEM 600

Query: 601  KRQRIGSQNHAVAANDVRAPFGSGGWLED-TMSAGPRLPSRNQMEIAEANATEKINVSNN 660
            KRQRIGSQNHA +A+D+RA  GSGGWLED TMSA PRL SRNQMEIAEANA EK NV+NN
Sbjct: 601  KRQRIGSQNHAFSASDLRAGSGSGGWLEDNTMSAVPRLSSRNQMEIAEANAIEKNNVTNN 660

Query: 661  SGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNV 720
            SGAGN   P ISA+ +ASLPSLLKDIVVNPTMLLSLLKM+QQ+Q+AAELKL SSEPEKN 
Sbjct: 661  SGAGNSRGPMISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEKNA 720

Query: 721  ICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPR 780
            ICPTAVNPCLGSSPLVNAP VTSGILQQSAGT SVPSPPVVTV   DDVGKVRMKPRDPR
Sbjct: 721  ICPTAVNPCLGSSPLVNAPAVTSGILQQSAGTPSVPSPPVVTV---DDVGKVRMKPRDPR 780

Query: 781  RILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGD-IPNGHKQEGQGDLRLASSQPLLPDI 840
            RILHGNSL KV S+ NEQLK VVP  PN EGS D IPNGHKQEGQG+LRLASSQPLLPDI
Sbjct: 781  RILHGNSLHKVDSMRNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDI 840

Query: 841  GRQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVD 900
            GRQFTNNLKNIADIMSVPSPPTSS N S+KPVKLD  D NAVGSSS+DSKIV TATQ VD
Sbjct: 841  GRQFTNNLKNIADIMSVPSPPTSSHNLSSKPVKLDRKDANAVGSSSIDSKIVATATQAVD 900

Query: 901  MVGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLL 960
            MVGPSRS GAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLL
Sbjct: 901  MVGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLL 960

Query: 961  NSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYEL 1020
            NSAKFVEVDP+HDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYEL
Sbjct: 961  NSAKFVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYEL 1020

Query: 1021 HLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESA 1080
            HLYTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESA
Sbjct: 1021 HLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESA 1080

Query: 1081 VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLG 1140
            VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL 
Sbjct: 1081 VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1140

BLAST of Tan0002421 vs. NCBI nr
Match: XP_022148889.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Momordica charantia])

HSP 1 Score: 1867.8 bits (4837), Expect = 0.0e+00
Identity = 977/1145 (85.33%), Postives = 1040/1145 (90.83%), Query Frame = 0

Query: 1    MGKDESVKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLYK 60
            MGKDESVKI+DVEEGEISDTASVEEI+EEDFNKLETG K++PSKDSNR+ RVWTMSDLYK
Sbjct: 1    MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLETGAKLVPSKDSNREPRVWTMSDLYK 60

Query: 61   NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSS-SPFANGKEDGNAT 120
            NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADP+EKSKRSSS SP AN    GN+T
Sbjct: 61   NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPEEKSKRSSSPSPLAN----GNST 120

Query: 121  KEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTEC- 180
            KEEGKV ID S D+M+  NANVE+EEGELEEGEIDMDTEFVEEVV+SKAMLSD  DT+C 
Sbjct: 121  KEEGKVTIDDSSDEMDYGNANVEREEGELEEGEIDMDTEFVEEVVESKAMLSDSGDTDCD 180

Query: 181  -QDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRK 240
             Q+ DL  K+LDDQVKLIQKTLDGVTIDAAQKSF+EVC+QLHSSIE FL+L+Q KV P K
Sbjct: 181  GQESDLVKKELDDQVKLIQKTLDGVTIDAAQKSFEEVCTQLHSSIEIFLKLLQEKVFPXK 240

Query: 241  DALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKM 300
            DALIQRLYAALRIINSVFCSMN NEKEE+KQHLSRLLSY KNCNPPLFSPEQIKS+EVKM
Sbjct: 241  DALIQRLYAALRIINSVFCSMNLNEKEEYKQHLSRLLSYVKNCNPPLFSPEQIKSVEVKM 300

Query: 301  PSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTA 360
            PSTDSLD L   RA+AKE EIHIPNGVKNKDFYS  TNA PHLTSSTKL   SMPV V A
Sbjct: 301  PSTDSLDYLSIIRANAKEAEIHIPNGVKNKDFYSGSTNAGPHLTSSTKLPSDSMPVGVMA 360

Query: 361  KNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPA 420
            KNN NI SDG QSGVSN++GRGPLLPLLDLHKDHD DSLPSPTREAP+IF VQK G+ P 
Sbjct: 361  KNNPNILSDGSQSGVSNLRGRGPLLPLLDLHKDHDVDSLPSPTREAPSIFPVQKLGNTPP 420

Query: 421  KVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVS 480
            KVA AMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDG GDIGGEVS
Sbjct: 421  KVALAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGAGDIGGEVS 480

Query: 481  SSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTV 540
            SSSIIRS KASNS KLGQ VSNS+SNIS G FP++ESS+ KGL +P+NVAPPSCVSNPTV
Sbjct: 481  SSSIIRSLKASNSPKLGQNVSNSASNISAGYFPNMESSSIKGLISPINVAPPSCVSNPTV 540

Query: 541  KPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPE 600
            KPL KSRDPR R++NSDASA+DLNPRT+ SVQN S+ ES  TINLRKQKM  E N+DGPE
Sbjct: 541  KPLPKSRDPRRRIINSDASALDLNPRTIASVQNSSIAESDATINLRKQKMGEEPNVDGPE 600

Query: 601  MKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNN 660
            MKRQR GSQNHAVAA+DVR   GSGGWLEDTM  GPRL SRNQMEI+EA+ATEK+NV+NN
Sbjct: 601  MKRQRTGSQNHAVAASDVRT--GSGGWLEDTMPVGPRLSSRNQMEISEADATEKLNVTNN 660

Query: 661  SGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNV 720
            S AGNEC P+ISA+NDASLPSLLKDI VNPTM LSLLKMSQQQ LAAELKLKSSE EKN 
Sbjct: 661  SVAGNECTPSISASNDASLPSLLKDIAVNPTMFLSLLKMSQQQHLAAELKLKSSELEKNA 720

Query: 721  ICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPR 780
            ICPT++NPC GSSPLVN P VTSGILQQS GT SVPSPPV TV+RQDD+GKVRMKPRDPR
Sbjct: 721  ICPTSLNPCQGSSPLVNTPSVTSGILQQSTGTSSVPSPPVATVSRQDDLGKVRMKPRDPR 780

Query: 781  RILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDIPNGHKQEGQGDLRLASSQPLLPDIG 840
            RILHGNSLQKVG+LGNEQ KG+VPTAPNTEGS D+PNGHKQEG GDLRLASSQ + PDI 
Sbjct: 781  RILHGNSLQKVGNLGNEQSKGIVPTAPNTEGSKDVPNGHKQEGLGDLRLASSQSVPPDIT 840

Query: 841  RQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDM 900
            R FT NLKNIADI+S  SPPTSS +SS+KPVKLD MDTN+VGSSS+DSK+VTTATQ VDM
Sbjct: 841  RPFTKNLKNIADILSGSSPPTSSLSSSSKPVKLDRMDTNSVGSSSIDSKVVTTATQAVDM 900

Query: 901  VGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLN 960
            VG SRSQG WGDLEHLFEGYDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHTLLN
Sbjct: 901  VGLSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLN 960

Query: 961  SAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
            SAKFVEV+PVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH
Sbjct: 961  SAKFVEVEPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020

Query: 1021 LYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080
            LYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV
Sbjct: 1021 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080

Query: 1081 VIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGV 1140
            VIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGV
Sbjct: 1081 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGV 1139

BLAST of Tan0002421 vs. ExPASy TrEMBL
Match: A0A6J1H839 (Protein-serine/threonine phosphatase OS=Cucurbita moschata OX=3662 GN=LOC111460939 PE=4 SV=1)

HSP 1 Score: 1930.6 bits (5000), Expect = 0.0e+00
Identity = 1002/1144 (87.59%), Postives = 1057/1144 (92.40%), Query Frame = 0

Query: 1    MGKDES-VKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLY 60
            MGK  + VK  DVEEGEISDT SVEEITEEDFNKLET PK+LPSK SNR+T VWTMSDLY
Sbjct: 1    MGKHTNCVKTPDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60

Query: 61   KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNAT 120
             NYPTMCRGYA GLYNLAWA+AVQNKPLN+IF+M+ADPD+KS RSSSSPF N KE GN T
Sbjct: 61   NNYPTMCRGYAPGLYNLAWAKAVQNKPLNEIFLMDADPDDKSNRSSSSPFRNAKEHGNGT 120

Query: 121  KEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTECQ 180
            KEE K++ID++GDDMN DNA+VEKEEGELEEGEIDMDTEFVEEVVDSK MLSD +DT+CQ
Sbjct: 121  KEEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDCQ 180

Query: 181  DIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKDA 240
            +IDLK K+LDDQ+KLI KTLDGVTIDAAQKSFQEVCSQL SSIETFLELVQGKVVPRKD 
Sbjct: 181  EIDLKNKELDDQLKLIHKTLDGVTIDAAQKSFQEVCSQLLSSIETFLELVQGKVVPRKDV 240

Query: 241  LIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMPS 300
            LIQRLYAALRIINSVFCSMNP EKEE+KQHLSRLLS+ KNCNPPLFSPEQIKS+EVKMPS
Sbjct: 241  LIQRLYAALRIINSVFCSMNPKEKEEYKQHLSRLLSFVKNCNPPLFSPEQIKSVEVKMPS 300

Query: 301  TDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTAKN 360
            TDSLD  P  RASAK+VEIHIPNGVKNKDFYSAY  A+PHLTSSTKL   SMPV VT KN
Sbjct: 301  TDSLDQFPDMRASAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTVKN 360

Query: 361  NINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAKV 420
            ++N+SSD L SGV NVKGRGPLLPLLDLHKDHD DSLPSPTREAPT+FSVQKSGHIP KV
Sbjct: 361  SLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPVKV 420

Query: 421  AHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480
            AHAMDGSR HPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS
Sbjct: 421  AHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSSS 480

Query: 481  SIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVKP 540
            SI+RSSKASNSSKL Q VS S+S+IS GLFP+LESS+TKGL +P NVAPPSCVSNP  KP
Sbjct: 481  SILRSSKASNSSKLAQTVSQSASSISTGLFPNLESSSTKGLISPSNVAPPSCVSNPIAKP 540

Query: 541  LAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEMK 600
            LAKSRDPRLRMVNS+ASAMDLNPRT+TSVQ+PSVVESA T+NLRKQKMDVE N+D PEMK
Sbjct: 541  LAKSRDPRLRMVNSEASAMDLNPRTMTSVQSPSVVESAVTVNLRKQKMDVEPNIDAPEMK 600

Query: 601  RQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNNSG 660
            RQRIGSQNHA +A+D+RA  GSGGWLEDTMSA PRL SRNQMEIAEANATEK NV+NNSG
Sbjct: 601  RQRIGSQNHAFSASDLRAGSGSGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVTNNSG 660

Query: 661  AGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNVIC 720
            AGN C PTISA+ +ASLPSLLKDIVVNPTMLLSLLKM+QQ+Q+AAELKLKSSEPEKN IC
Sbjct: 661  AGNSCGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLKSSEPEKNAIC 720

Query: 721  PTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRI 780
            PTAVNPCLGSSPLVNAP +TSGILQQSAGT SVPSPPVVTV   DDVGKVRMKPRDPRRI
Sbjct: 721  PTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTV---DDVGKVRMKPRDPRRI 780

Query: 781  LHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDI-PNGHKQEGQGDLRLASSQPLLPDIGR 840
            LHGNSL KVGS+GNEQLK VVP  PN EGS DI PNGHKQEGQG+LRLASSQPLLPDIGR
Sbjct: 781  LHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIVPNGHKQEGQGNLRLASSQPLLPDIGR 840

Query: 841  QFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDMV 900
            QFTNNLKNIADIMSVPSPPTSS NSS+KPVKLD  DTNAVGSSS+DSKIV TATQ VDMV
Sbjct: 841  QFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDIKDTNAVGSSSIDSKIVATATQVVDMV 900

Query: 901  GPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960
            GPSRS GAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS
Sbjct: 901  GPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNS 960

Query: 961  AKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020
            AKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL
Sbjct: 961  AKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHL 1020

Query: 1021 YTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVV 1080
            YTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAVV
Sbjct: 1021 YTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAVV 1080

Query: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVI 1140
            IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL VI
Sbjct: 1081 IIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVI 1140

BLAST of Tan0002421 vs. ExPASy TrEMBL
Match: A0A6J1KU06 (Protein-serine/threonine phosphatase OS=Cucurbita maxima OX=3661 GN=LOC111498198 PE=4 SV=1)

HSP 1 Score: 1889.8 bits (4894), Expect = 0.0e+00
Identity = 988/1146 (86.21%), Postives = 1047/1146 (91.36%), Query Frame = 0

Query: 1    MGKDES-VKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLY 60
            MGK  + +K QDVEEGEISDT SVEEITEEDFN LET PK+LPSK SNR+T VWTMSDLY
Sbjct: 1    MGKHTNCLKTQDVEEGEISDTPSVEEITEEDFNNLETVPKLLPSKHSNRETTVWTMSDLY 60

Query: 61   KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDGNAT 120
             NYPTMCRGYASGLYNLAWA+AVQNKPLN+IF+ EADPD+KS RSSSSPF N KE GN T
Sbjct: 61   NNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLTEADPDDKSHRSSSSPFRNAKEHGNGT 120

Query: 121  KEE-GKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTEC 180
             EE  K++ID++GDDMN +NA+VEKEEGELEEGEIDMDTEFVEEVVDS+ MLSD +DT+C
Sbjct: 121  IEEAAKLIIDITGDDMNTNNADVEKEEGELEEGEIDMDTEFVEEVVDSRPMLSDSLDTDC 180

Query: 181  QDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRKD 240
            Q+ID K K+LDDQ+KL+ KTLDGVTIDAAQKSFQE+CSQL SSIETFLELVQGKVVPRKD
Sbjct: 181  QEIDFKNKELDDQLKLVHKTLDGVTIDAAQKSFQEICSQLLSSIETFLELVQGKVVPRKD 240

Query: 241  ALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKMP 300
            ALIQRLYAALRIINSVFCSMNP EK+E+KQHLSRLLS+ KNCNP LFSPEQIKS+EVKMP
Sbjct: 241  ALIQRLYAALRIINSVFCSMNPKEKDEYKQHLSRLLSFVKNCNPALFSPEQIKSVEVKMP 300

Query: 301  STDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTAK 360
            STDSLD+ P  R SAK+VEIHIPNGVKNKDFYSAY  A+PHLTSSTKL   SMPV VT K
Sbjct: 301  STDSLDHFPDMRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTVK 360

Query: 361  NNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPAK 420
            NN+N+SSD L SGV NVKGRGPLLPLLDLHKDHD DSLPSPTREAPT+FSVQKSGHIP K
Sbjct: 361  NNLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPVK 420

Query: 421  VAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSS 480
            VAHAMDGSR HPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSS
Sbjct: 421  VAHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVSS 480

Query: 481  SSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTVK 540
            SSI RSSKASNSSKL Q VSNS+S+IS GLFP+LESS+TKGL +PLNVAPPS VSNP  K
Sbjct: 481  SSIFRSSKASNSSKLAQTVSNSASSISTGLFPNLESSSTKGLISPLNVAPPSSVSNPIAK 540

Query: 541  PLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPEM 600
            PLAKSRDPRLRMVNS+ASAMDLNPRT+TSVQNPSVVESA T+N+RKQKMDVE N+D PEM
Sbjct: 541  PLAKSRDPRLRMVNSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAPEM 600

Query: 601  KRQRIGSQNHAVAANDVRAPFGSGGWLED-TMSAGPRLPSRNQMEIAEANATEKINVSNN 660
            KRQRIGSQNHA +A+D+RA  GSGGWLED TMSA PRL SRNQMEIAEANA EK NV+NN
Sbjct: 601  KRQRIGSQNHAFSASDLRAGSGSGGWLEDNTMSAVPRLSSRNQMEIAEANAIEKNNVTNN 660

Query: 661  SGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNV 720
            SGAGN   P ISA+ +ASLPSLLKDIVVNPTMLLSLLKM+QQ+Q+AAELKL SSEPEKN 
Sbjct: 661  SGAGNSRGPMISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEKNA 720

Query: 721  ICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPR 780
            ICPTAVNPCLGSSPLVNAP VTSGILQQSAGT SVPSPPVVTV   DDVGKVRMKPRDPR
Sbjct: 721  ICPTAVNPCLGSSPLVNAPAVTSGILQQSAGTPSVPSPPVVTV---DDVGKVRMKPRDPR 780

Query: 781  RILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGD-IPNGHKQEGQGDLRLASSQPLLPDI 840
            RILHGNSL KV S+ NEQLK VVP  PN EGS D IPNGHKQEGQG+LRLASSQPLLPDI
Sbjct: 781  RILHGNSLHKVDSMRNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDI 840

Query: 841  GRQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVD 900
            GRQFTNNLKNIADIMSVPSPPTSS N S+KPVKLD  D NAVGSSS+DSKIV TATQ VD
Sbjct: 841  GRQFTNNLKNIADIMSVPSPPTSSHNLSSKPVKLDRKDANAVGSSSIDSKIVATATQAVD 900

Query: 901  MVGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLL 960
            MVGPSRS GAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLL
Sbjct: 901  MVGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLL 960

Query: 961  NSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYEL 1020
            NSAKFVEVDP+HDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYEL
Sbjct: 961  NSAKFVEVDPLHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYEL 1020

Query: 1021 HLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESA 1080
            HLYTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESA
Sbjct: 1021 HLYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESA 1080

Query: 1081 VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLG 1140
            VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL 
Sbjct: 1081 VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1140

BLAST of Tan0002421 vs. ExPASy TrEMBL
Match: A0A6J1D5D6 (Protein-serine/threonine phosphatase OS=Momordica charantia OX=3673 GN=LOC111017451 PE=4 SV=1)

HSP 1 Score: 1867.8 bits (4837), Expect = 0.0e+00
Identity = 977/1145 (85.33%), Postives = 1040/1145 (90.83%), Query Frame = 0

Query: 1    MGKDESVKIQDVEEGEISDTASVEEITEEDFNKLETGPKVLPSKDSNRDTRVWTMSDLYK 60
            MGKDESVKI+DVEEGEISDTASVEEI+EEDFNKLETG K++PSKDSNR+ RVWTMSDLYK
Sbjct: 1    MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLETGAKLVPSKDSNREPRVWTMSDLYK 60

Query: 61   NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSS-SPFANGKEDGNAT 120
            NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADP+EKSKRSSS SP AN    GN+T
Sbjct: 61   NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPEEKSKRSSSPSPLAN----GNST 120

Query: 121  KEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDCMDTEC- 180
            KEEGKV ID S D+M+  NANVE+EEGELEEGEIDMDTEFVEEVV+SKAMLSD  DT+C 
Sbjct: 121  KEEGKVTIDDSSDEMDYGNANVEREEGELEEGEIDMDTEFVEEVVESKAMLSDSGDTDCD 180

Query: 181  -QDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVVPRK 240
             Q+ DL  K+LDDQVKLIQKTLDGVTIDAAQKSF+EVC+QLHSSIE FL+L+Q KV P K
Sbjct: 181  GQESDLVKKELDDQVKLIQKTLDGVTIDAAQKSFEEVCTQLHSSIEIFLKLLQEKVFPXK 240

Query: 241  DALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIEVKM 300
            DALIQRLYAALRIINSVFCSMN NEKEE+KQHLSRLLSY KNCNPPLFSPEQIKS+EVKM
Sbjct: 241  DALIQRLYAALRIINSVFCSMNLNEKEEYKQHLSRLLSYVKNCNPPLFSPEQIKSVEVKM 300

Query: 301  PSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKL--SSMPVEVTA 360
            PSTDSLD L   RA+AKE EIHIPNGVKNKDFYS  TNA PHLTSSTKL   SMPV V A
Sbjct: 301  PSTDSLDYLSIIRANAKEAEIHIPNGVKNKDFYSGSTNAGPHLTSSTKLPSDSMPVGVMA 360

Query: 361  KNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGHIPA 420
            KNN NI SDG QSGVSN++GRGPLLPLLDLHKDHD DSLPSPTREAP+IF VQK G+ P 
Sbjct: 361  KNNPNILSDGSQSGVSNLRGRGPLLPLLDLHKDHDVDSLPSPTREAPSIFPVQKLGNTPP 420

Query: 421  KVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGGEVS 480
            KVA AMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDG GDIGGEVS
Sbjct: 421  KVALAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGAGDIGGEVS 480

Query: 481  SSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSNPTV 540
            SSSIIRS KASNS KLGQ VSNS+SNIS G FP++ESS+ KGL +P+NVAPPSCVSNPTV
Sbjct: 481  SSSIIRSLKASNSPKLGQNVSNSASNISAGYFPNMESSSIKGLISPINVAPPSCVSNPTV 540

Query: 541  KPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLDGPE 600
            KPL KSRDPR R++NSDASA+DLNPRT+ SVQN S+ ES  TINLRKQKM  E N+DGPE
Sbjct: 541  KPLPKSRDPRRRIINSDASALDLNPRTIASVQNSSIAESDATINLRKQKMGEEPNVDGPE 600

Query: 601  MKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINVSNN 660
            MKRQR GSQNHAVAA+DVR   GSGGWLEDTM  GPRL SRNQMEI+EA+ATEK+NV+NN
Sbjct: 601  MKRQRTGSQNHAVAASDVRT--GSGGWLEDTMPVGPRLSSRNQMEISEADATEKLNVTNN 660

Query: 661  SGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPEKNV 720
            S AGNEC P+ISA+NDASLPSLLKDI VNPTM LSLLKMSQQQ LAAELKLKSSE EKN 
Sbjct: 661  SVAGNECTPSISASNDASLPSLLKDIAVNPTMFLSLLKMSQQQHLAAELKLKSSELEKNA 720

Query: 721  ICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPR 780
            ICPT++NPC GSSPLVN P VTSGILQQS GT SVPSPPV TV+RQDD+GKVRMKPRDPR
Sbjct: 721  ICPTSLNPCQGSSPLVNTPSVTSGILQQSTGTSSVPSPPVATVSRQDDLGKVRMKPRDPR 780

Query: 781  RILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDIPNGHKQEGQGDLRLASSQPLLPDIG 840
            RILHGNSLQKVG+LGNEQ KG+VPTAPNTEGS D+PNGHKQEG GDLRLASSQ + PDI 
Sbjct: 781  RILHGNSLQKVGNLGNEQSKGIVPTAPNTEGSKDVPNGHKQEGLGDLRLASSQSVPPDIT 840

Query: 841  RQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQPVDM 900
            R FT NLKNIADI+S  SPPTSS +SS+KPVKLD MDTN+VGSSS+DSK+VTTATQ VDM
Sbjct: 841  RPFTKNLKNIADILSGSSPPTSSLSSSSKPVKLDRMDTNSVGSSSIDSKVVTTATQAVDM 900

Query: 901  VGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLN 960
            VG SRSQG WGDLEHLFEGYDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHTLLN
Sbjct: 901  VGLSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLN 960

Query: 961  SAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
            SAKFVEV+PVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH
Sbjct: 961  SAKFVEVEPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020

Query: 1021 LYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080
            LYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV
Sbjct: 1021 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080

Query: 1081 VIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGV 1140
            VIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGV
Sbjct: 1081 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGV 1139

BLAST of Tan0002421 vs. ExPASy TrEMBL
Match: A0A0A0KAB9 (Protein-serine/threonine phosphatase OS=Cucumis sativus OX=3659 GN=Csa_6G091910 PE=4 SV=1)

HSP 1 Score: 1827.8 bits (4733), Expect = 0.0e+00
Identity = 963/1148 (83.89%), Postives = 1039/1148 (90.51%), Query Frame = 0

Query: 1    MGKDESVKIQDVEEGEISDTASVEEITEEDFNKLET--GPK-VLPSKDSNRDTRVWTMSD 60
            MGKDE +KI+DVEEGEISDTASVEEI+EEDFNKL++   PK V+PSKDSNR+TRVWTMSD
Sbjct: 1    MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSD 60

Query: 61   LYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDG- 120
            LYKNYP M  GYASGLYNLAWAQAVQNKPLNDIFVMEAD DEKSK SSS+PF N K+DG 
Sbjct: 61   LYKNYPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGS 120

Query: 121  NATKEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDC--M 180
            N TKEE +VVID SGD+MNCDNAN EKEEGELEEGEIDMDTEFVEEV DSKAMLSD   M
Sbjct: 121  NTTKEEDRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDM 180

Query: 181  DTECQDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVV 240
            D   Q+ DL+ K+LD+ +K IQKTLDGVTIDAAQKSFQEVCSQ+HSSIETF+EL+QGKVV
Sbjct: 181  DINGQEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVV 240

Query: 241  PRKDALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIE 300
            PRKDALIQRLYAALR+INSVFCSMN +EKEEHK+HLSRLLSY KNC+PPLFSPEQIKS+E
Sbjct: 241  PRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVE 300

Query: 301  VKMPSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKLS--SMPVE 360
            VKMPSTDSLD+LPS R SAKEVEIHIPNGVK+ DFYSAYT+ S  LT S KL+  S+P  
Sbjct: 301  VKMPSTDSLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFG 360

Query: 361  VTAKNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGH 420
            V  KNN+NI S+GLQSGVS++KGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSG+
Sbjct: 361  VKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGN 420

Query: 421  IPAKVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGG 480
             P K+A  +DGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE DGGGDIGG
Sbjct: 421  APTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEHDGGGDIGG 480

Query: 481  EVSSSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSN 540
            EVSSSSIIRS K+SN SK GQK SNS+SN+S GLFP+++SS+T+ L +PLNVAPPS VSN
Sbjct: 481  EVSSSSIIRSLKSSNVSKPGQK-SNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSN 540

Query: 541  PTVKPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLD 600
            PTVKPLAKSRDPRLR+VNSDAS MDLNPRT+ SVQ+ S++ESA T++LRKQKMD E N D
Sbjct: 541  PTVKPLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNTD 600

Query: 601  GPEMKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINV 660
            GPE+KR RIGSQN AVAA+DVRA  GSGGWLEDTM AGPRL +RNQMEIAEANATEK NV
Sbjct: 601  GPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQMEIAEANATEKSNV 660

Query: 661  SNNSGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPE 720
            +NNSG+GNEC PT++ +NDASLPSLLKDIVVNPTMLL+LLKMSQQQQLAAELKLKSSEPE
Sbjct: 661  TNNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPE 720

Query: 721  KNVICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPR 780
            KN ICPT++NPC GSSPL+NAP  TSGILQQSAGT S  + PVV V RQDD+GKVRMKPR
Sbjct: 721  KNAICPTSLNPCQGSSPLINAPVATSGILQQSAGTPS--ASPVVAVGRQDDLGKVRMKPR 780

Query: 781  DPRRILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDIPNGHKQEGQGDLRLASSQPLLP 840
            DPRR+LHGNSLQKVGSLGN+QLKGVVPTA NTEGS DIPNGHKQEGQGD +LASSQ +LP
Sbjct: 781  DPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLASSQTILP 840

Query: 841  DIGRQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQP 900
            DIGRQFTNNLKNIADIMSVPSPPTSSPNSS+KP          VGSSS+DSK VTTA Q 
Sbjct: 841  DIGRQFTNNLKNIADIMSVPSPPTSSPNSSSKP----------VGSSSMDSKPVTTAFQA 900

Query: 901  VDMVGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHT 960
            VDM   SRSQGAWGDLEHLF+ YDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHT
Sbjct: 901  VDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHT 960

Query: 961  LLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELY 1020
            LLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELY
Sbjct: 961  LLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELY 1020

Query: 1021 ELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGME 1080
            ELHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGD+RVPKSKDLEGVLGME
Sbjct: 1021 ELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGME 1080

Query: 1081 SAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASS 1140
            S VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASS
Sbjct: 1081 SGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASS 1135

BLAST of Tan0002421 vs. ExPASy TrEMBL
Match: A0A5A7TDW7 (Protein-serine/threonine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold122G001260 PE=4 SV=1)

HSP 1 Score: 1802.3 bits (4667), Expect = 0.0e+00
Identity = 957/1148 (83.36%), Postives = 1034/1148 (90.07%), Query Frame = 0

Query: 1    MGKDESVKIQDVEEGEISDTASVEEITEEDFNKLETG--PK-VLPSKDSNRDTRVWTMSD 60
            MGKDE +KI+DVEEGEISDTASVEEI+EEDFNKL++   PK V+PSKDSNR+ RVWTMS+
Sbjct: 1    MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSAPPKVVVPSKDSNRE-RVWTMSE 60

Query: 61   LYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSSSPFANGKEDG- 120
            LYKNYP+M  GYASGLYNLAWAQAVQNKPLNDIFVMEAD DEKSKRSSS+   N K+DG 
Sbjct: 61   LYKNYPSMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKRSSSTTVGNAKDDGS 120

Query: 121  NATKEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDMDTEFVEEVVDSKAMLSDC--M 180
            N TKEE +V+ID SGD+MNCDNAN EKEEGELEEGEIDMDTEFVEEV DSKAMLSD   M
Sbjct: 121  NTTKEEDRVLIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSREM 180

Query: 181  DTECQDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFLELVQGKVV 240
            D   Q+ DL+ K+LD+ +KLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETF+ELVQGKVV
Sbjct: 181  DIHGQEFDLENKELDELLKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFVELVQGKVV 240

Query: 241  PRKDALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNCNPPLFSPEQIKSIE 300
            PRKDAL+QRLYAA R+INSVFCSMN NEKEEHK+ LSRLLSY KNC+PPLFSPEQIKS+E
Sbjct: 241  PRKDALVQRLYAAFRLINSVFCSMNLNEKEEHKEQLSRLLSYVKNCDPPLFSPEQIKSVE 300

Query: 301  VKMPSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHLTSSTKLSSMPVE-- 360
            VKMPSTD LD L S + SAKEVEIHIPNGVK KDFYSAYT+AS  LT S KL+S  +   
Sbjct: 301  VKMPSTDYLDQLLSMKGSAKEVEIHIPNGVKVKDFYSAYTDASSQLTPSNKLASDSITFG 360

Query: 361  VTAKNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGH 420
            V  KNN NI S+GLQSGVS++KGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSG+
Sbjct: 361  VKGKNNPNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGN 420

Query: 421  IPAKVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDGGGDIGG 480
             P K+A A+DG RSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE DGGGDIGG
Sbjct: 421  APTKMAFAVDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEHDGGGDIGG 480

Query: 481  EVSSSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESSNTKGLSNPLNVAPPSCVSN 540
            EVSSSSIIRS K+SN+SK GQK SNS+SN+S GLFP+++SS+T+ L +PLNVAPPS VSN
Sbjct: 481  EVSSSSIIRSLKSSNASKPGQK-SNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSN 540

Query: 541  PTVKPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSVVESATTINLRKQKMDVESNLD 600
            PTVKPLAKSRDPRLR+VNSDASAMDLNPRT+TSVQ+ S++ESA T++LRKQKMD E N D
Sbjct: 541  PTVKPLAKSRDPRLRIVNSDASAMDLNPRTITSVQSSSILESAATLHLRKQKMDGEPNTD 600

Query: 601  GPEMKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAGPRLPSRNQMEIAEANATEKINV 660
            GPEMKR RIGSQN AVAA+DVRA  GSGGWLEDT+ AGPRL +RNQMEIAEANATEK NV
Sbjct: 601  GPEMKRPRIGSQNLAVAASDVRAVSGSGGWLEDTIPAGPRLFNRNQMEIAEANATEKTNV 660

Query: 661  SNNSGAGNECAPTISATNDASLPSLLKDIVVNPTMLLSLLKMSQQQQLAAELKLKSSEPE 720
            +NNSG+ NEC PTI+ + DASLPSLLKDIVVNPTMLL+LLKMSQQQQLAAELKLKSSEPE
Sbjct: 661  TNNSGSENECTPTINNSKDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPE 720

Query: 721  KNVICPTAVNPCLGSSPLVNAPPVTSGILQQSAGTLSVPSPPVVTVTRQDDVGKVRMKPR 780
            KN ICPT++NPC GSSPL+NAP VTSGILQQSAGT S  + PVV V RQDD+GKVRMKPR
Sbjct: 721  KNAICPTSLNPCQGSSPLINAPAVTSGILQQSAGTPS--ASPVVAVGRQDDLGKVRMKPR 780

Query: 781  DPRRILHGNSLQKVGSLGNEQLKGVVPTAPNTEGSGDIPNGHKQEGQGDLRLASSQPLLP 840
            DPRR+LHGNSLQKVGSLGN+QLKG+VPT  NTEGS DI NGHKQ+GQGD +LASSQ LLP
Sbjct: 781  DPRRVLHGNSLQKVGSLGNDQLKGIVPTTSNTEGSRDILNGHKQDGQGDSKLASSQTLLP 840

Query: 841  DIGRQFTNNLKNIADIMSVPSPPTSSPNSSAKPVKLDGMDTNAVGSSSVDSKIVTTATQP 900
            DIGRQFTNNLKNIADIMSVPSPPTSS NSS+KP          VGSSS+DSK VTTA+Q 
Sbjct: 841  DIGRQFTNNLKNIADIMSVPSPPTSSQNSSSKP----------VGSSSMDSKPVTTASQA 900

Query: 901  VDMVGPSRSQGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHT 960
            VDM  PSRSQGAWGDLEHLF+ YDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHT
Sbjct: 901  VDMAAPSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHT 960

Query: 961  LLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELY 1020
            LLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELY
Sbjct: 961  LLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELY 1020

Query: 1021 ELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGME 1080
            ELHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGD+RVPKSKDLEGVLGME
Sbjct: 1021 ELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGME 1080

Query: 1081 SAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASS 1140
            S VVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASS
Sbjct: 1081 SGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASS 1134

BLAST of Tan0002421 vs. TAIR 10
Match: AT2G33540.1 (C-terminal domain phosphatase-like 3 )

HSP 1 Score: 768.5 bits (1983), Expect = 7.8e-222
Identity = 528/1195 (44.18%), Postives = 688/1195 (57.57%), Query Frame = 0

Query: 1    MGKDESVKIQ-DVEEGEISDTASVE-EITEEDFNK-----------LETGPKVLPSKDSN 60
            MG DE++ +  DVEEGEI D+ + E E+  +               +  G +      SN
Sbjct: 15   MGNDENLMVMVDVEEGEIPDSVNTEIEVKHKSTTTTADVGGDVDVGVVAGGRGGGGGGSN 74

Query: 61   RDTRVWTMSDLYKNYPTMCRGYA-SGLYNLAWAQAVQNKPLNDIFVMEADPDEKSKRSSS 120
             ++RVWTM +L   YP   R YA SGL NLAWA+AVQNKP N+  VM+ +P         
Sbjct: 75   GNSRVWTMEELISQYPAY-RPYANSGLSNLAWARAVQNKPFNEGLVMDYEP--------- 134

Query: 121  SPFANGKEDGNATKEEGKVVIDVSGDDMNCDNANVEKEEGELEEGEIDM-----DTEFVE 180
                         +E  K+VI+ S D         EKEEGELEEGEID+     D   VE
Sbjct: 135  -------------RESDKIVIEDSDD---------EKEEGELEEGEIDLVDNASDDNLVE 194

Query: 181  EVVDSKAMLSDCMDTECQDIDLKVKDLDDQVKLIQKTLDGVTIDAAQKSFQEVCSQLHSS 240
            +  +S  ++S   D    D  LK +DL+ +VKLI+  L+  ++  AQ  F+ VCS++  +
Sbjct: 195  KDTESVVLIS--ADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGA 254

Query: 241  IETFLELV-QGKVVPRKDALIQRLYAALRIINSVFCSMNPNEKEEHKQHLSRLLSYAKNC 300
            +E+  ELV      P++D L+Q  +A+L+ IN VFCSMN   KE +K+ +SRLL+   + 
Sbjct: 255  LESLRELVSDNDDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDH 314

Query: 301  NPPLFSPEQIKSIEVKMPSTDSLDNLPSTRASAKEVEIHIPNGVKNKDFYSAYTNASPHL 360
                 S  Q   IE  M    S   +     ++ E  ++      N D + A        
Sbjct: 315  FSQFLSFNQKNEIET-MNQDLSRSAIAVFAGTSSEENVNQMTQPSNGDSFLAK------- 374

Query: 361  TSSTKLSSMPVEVTAKNNINISSDGLQSGVSNVKGRGPLLPLLDLHKDHDADSLPSPTRE 420
                                ++S+    G + ++ R P+LPLLDLHKDHDADSLPSPTRE
Sbjct: 375  -------------------KLTSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRE 434

Query: 421  APTIFSVQ------KSGHIPAKVAHAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADR 480
                  V       + G    + +   +G++ + YE+DA KAVSTYQQKFG +S    D 
Sbjct: 435  TTPSLPVNGRHTMVRPGFPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDD 494

Query: 481  LPSPTPS-EECDGGGDIGGEVSSSSIIRSSKASNSSKLGQKVSNSSSNISIGLFPHLESS 540
            LPSPTPS E  DG GD+GGEV SSS+++SS   +    GQ V    SN +    P   S 
Sbjct: 495  LPSPTPSGEPNDGNGDVGGEV-SSSVVKSSNPGSHLIYGQDVP-LPSNFNSRSMPVANSV 554

Query: 541  NTKGLSNPLNVAPPSC--VSNPTVKPLAKSRDPRLRMVNSDASAMDLNPRTVTSVQNPSV 600
            ++    + L++   S    S+ TVKP AKSRDPRLR+   DA+ + +   +    +N S 
Sbjct: 555  SSTVPPHHLSIHAISAPTASDQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSK 614

Query: 601  VE-SATTINLRKQKMDVESNLDGPEMKRQRIGSQNHAVAANDVRAPFGSGGWLEDTMSAG 660
            VE SA  +N RKQK   E  +DGP  KRQ+        +  D     G+GGWLEDT S+G
Sbjct: 615  VELSADLVNPRKQKAADEFLIDGPAWKRQK--------SDTDAPKAAGTGGWLEDTESSG 674

Query: 661  PRLPSRNQMEIAEANATEKI-NVSNNSGAGNECAPTISATNDASLPSLLKDIVVNPTMLL 720
              L   ++  + E   T    +V   S          ++T+ ASL SLLKDI VNPTMLL
Sbjct: 675  -LLKLESKPRLIENGVTSMTSSVMPTSAVSVSQKVRTASTDTASLQSLLKDIAVNPTMLL 734

Query: 721  SLLKMSQQQQLAAELKLKSSEPEKNVICP-TAVNP------CLGSSPLVNAPPVTSGILQ 780
            +LLKM ++Q++  +   K  +P +    P ++V P       + +S  + A  + SG+LQ
Sbjct: 735  NLLKMGERQKVPEKAIQKPMDPRRAAQLPGSSVQPGVSTPLSIPASNALAANSLNSGVLQ 794

Query: 781  QSAGTLSVPSPPVVTVTRQDDVGKVRMKPRDPRRILHGNSLQKVGSLGNEQLKGVVPT-- 840
             S+      + P        + G +RMKPRDPRRILHG++LQ+  S   +Q K   P+  
Sbjct: 795  DSS-----QNAPAA------ESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTL 854

Query: 841  --------APNTEGSGDIPNGHKQEGQGDLRLASSQPLL----PDIGRQFTNNLKNIADI 900
                    A + E    +         G  ++  S  LL    PD   QFT NLK+IAD 
Sbjct: 855  GTLTMKGKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIAD- 914

Query: 901  MSVPSPPTSSPNSSAKPVKLD---GMDTNAVGSSSVDSKIVTTATQPVDMVGPSRSQGAW 960
            M V S    +P +S   V+L     +  N    ++ D  +  +A       GP+RS  +W
Sbjct: 915  MVVVSQQLGNPPASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRSMNSW 974

Query: 961  GDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPV 1020
            GD+EHLFEGYDD Q+ AIQRER RR++EQ KMFA++KL LVLD+DHTLLNSAKF EV+  
Sbjct: 975  GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 1034

Query: 1021 HDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYA 1080
            H+EILRKKEEQDREK  RHLFRF HMGMWTKLRPG+WNFLEKAS+LYELHLYTMGNKLYA
Sbjct: 1035 HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1094

Query: 1081 TEMAKVLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSIRVW 1140
            TEMAK+LDPKG LF GRVIS+GDDGDPLDGDERVPKSKDLEGV+GMES+VVIIDDS+RVW
Sbjct: 1095 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1125

BLAST of Tan0002421 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 155.2 bits (391), Expect = 3.1e-37
Identity = 100/230 (43.48%), Postives = 133/230 (57.83%), Query Frame = 0

Query: 915  YDDKQKAAIQRERARRIDEQKKMF-AARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKK 974
            Y  K+    + E +R  D   +     RKL LVLDLDHTLLN+    ++ P  +E L+  
Sbjct: 94   YIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKP-EEEYLKSH 153

Query: 975  EEQDREKAQ---RHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAK 1034
                ++        LF    M M TKLRP V +FL++ASE++ +++YTMG++ YA +MAK
Sbjct: 154  THSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAK 213

Query: 1035 VLDPKGALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKM 1094
            +LDPKG  F  RVISR DDG        V   K L+ VLG ESAV+I+DD+   WP +K 
Sbjct: 214  LLDPKGEYFGDRVISR-DDG-------TVRHEKSLDVVLGQESAVLILDDTENAWPKHKD 273

Query: 1095 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVILWLH 1141
            NLIV+ERY +F  S RQF     SL E+  DE   DG LA+ L V+   H
Sbjct: 274  NLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATVLKVLKQAH 314

BLAST of Tan0002421 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 120.6 bits (301), Expect = 8.5e-27
Identity = 74/204 (36.27%), Postives = 115/204 (56.37%), Query Frame = 0

Query: 941  RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMG----MWTK 1000
            +KL LVLDLDHTLL+S     +      ++++   + RE     L++F  +G       K
Sbjct: 65   KKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTRE----DLWKFRPIGHPIDRLIK 124

Query: 1001 LRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGD 1060
            LRP V +FL++A+E++ + +YTMG+++YA  + +++DPK   F  RVI++ +        
Sbjct: 125  LRPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDES------- 184

Query: 1061 ERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLL 1120
               P+ K L  VL  E  VVI+DD+  +WPH+K NLI + +Y YF    R+ GL   S  
Sbjct: 185  ---PRMKTLNLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYS 244

Query: 1121 EIDHDERPEDGTLASSLGVILWLH 1141
            E   DE   DG LA+ L ++  +H
Sbjct: 245  EKKTDEGENDGGLANVLKLLREVH 250

BLAST of Tan0002421 vs. TAIR 10
Match: AT5G54210.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 115.5 bits (288), Expect = 2.7e-25
Identity = 78/221 (35.29%), Postives = 114/221 (51.58%), Query Frame = 0

Query: 916  DDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE 975
            D  Q + I     +R+  Q   F  +KL LVLDLDHTLL++   V +  +  E     EE
Sbjct: 62   DGLQLSDIAVTVTKRVTTQITCFNDKKLHLVLDLDHTLLHT---VMISNLTKEETYLIEE 121

Query: 976  QDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPK 1035
            +D  +  R L          KLRP V  FL++A++++ +++YTMG++ YA  +  ++DP+
Sbjct: 122  EDSREDLRRLNGGYSSEFLIKLRPFVHEFLKEANKMFSMYVYTMGDRDYAMNVLNLIDPE 181

Query: 1036 GALFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVV 1095
               F  RVI+R +           P  K L+ VL  E  VVI+DD+  VWP +K NL+ +
Sbjct: 182  KVYFGDRVITRNES----------PYIKTLDLVLADECGVVIVDDTPHVWPDHKRNLLEI 241

Query: 1096 ERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVI 1137
             +Y YF    R       S  E   DE   DG+LA+ L VI
Sbjct: 242  TKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLANVLKVI 269

BLAST of Tan0002421 vs. TAIR 10
Match: AT1G20320.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 108.6 bits (270), Expect = 3.3e-23
Identity = 73/201 (36.32%), Postives = 105/201 (52.24%), Query Frame = 0

Query: 941  RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKE-EQDREKAQRHLFRFPHMGMWTKLRP 1000
            RKL LVLDLDHTLL+S     +      +L + +  +D     R         M  KLRP
Sbjct: 75   RKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFREDLWTLDRE--------MLIKLRP 134

Query: 1001 GVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGALFAGRVISRGDDGDPLDGDERV 1060
             V  FL++A+E++ +++YTMGN+ YA  + K +DPK   F  RVI+R + G         
Sbjct: 135  FVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGDRVITRDESG--------- 194

Query: 1061 PKSKDLEGVLGMESAVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEID 1120
              SK L+ VL  E  VVI+DD+  VWP ++ NL+ + +Y+YF            S  E  
Sbjct: 195  -FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF--RDYSHDKESKSYAEEK 254

Query: 1121 HDERPEDGTLASSLGVILWLH 1141
             DE    G+LA+ L V+  +H
Sbjct: 255  RDESRNQGSLANVLKVLKDVH 255

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LL041.1e-22044.18RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... [more]
Q00IB64.4e-3643.48RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
F4JCB28.0e-2232.50RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... [more]
Q9P3767.0e-1834.60RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces... [more]
Q7TSG21.0e-1633.51RNA polymerase II subunit A C-terminal domain phosphatase OS=Mus musculus OX=100... [more]
Match NameE-valueIdentityDescription
XP_022960085.10.0e+0087.59RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita moschata][more]
KAG6592819.10.0e+0087.06RNA polymerase II C-terminal domain phosphatase-like 3, partial [Cucurbita argyr... [more]
XP_023514332.10.0e+0086.98RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita pepo subsp. pe... [more]
XP_023005106.10.0e+0086.21RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita maxima][more]
XP_022148889.10.0e+0085.33RNA polymerase II C-terminal domain phosphatase-like 3 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1H8390.0e+0087.59Protein-serine/threonine phosphatase OS=Cucurbita moschata OX=3662 GN=LOC1114609... [more]
A0A6J1KU060.0e+0086.21Protein-serine/threonine phosphatase OS=Cucurbita maxima OX=3661 GN=LOC111498198... [more]
A0A6J1D5D60.0e+0085.33Protein-serine/threonine phosphatase OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A0A0KAB90.0e+0083.89Protein-serine/threonine phosphatase OS=Cucumis sativus OX=3659 GN=Csa_6G091910 ... [more]
A0A5A7TDW70.0e+0083.36Protein-serine/threonine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
Match NameE-valueIdentityDescription
AT2G33540.17.8e-22244.18C-terminal domain phosphatase-like 3 [more]
AT5G58003.13.1e-3743.48C-terminal domain phosphatase-like 4 [more]
AT2G04930.18.5e-2736.27Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT5G54210.12.7e-2535.29Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT1G20320.13.3e-2336.32Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 941..1105
e-value: 1.3E-53
score: 194.1
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 943..1099
e-value: 5.1E-22
score: 78.3
IPR004274FCP1 homology domainPROSITEPS50969FCP1coord: 938..1118
score: 29.803158
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 927..1136
e-value: 8.6E-46
score: 158.2
IPR011947FCP1-like phosphatase, phosphatase domainTIGRFAMTIGR02250TIGR02250coord: 938..1101
e-value: 3.0E-49
score: 165.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 514..543
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 449..498
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..121
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 475..498
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 786..820
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 514..530
NoneNo IPR availablePANTHERPTHR23081:SF2RNA POLYMERASE II C-TERMINAL DOMAIN PHOSPHATASE-LIKE 3coord: 170..1140
NoneNo IPR availableCDDcd07521HAD_FCP1-likecoord: 942..1095
e-value: 1.32255E-37
score: 135.416
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 170..1140
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 933..1110

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002421.1Tan0002421.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity
molecular_function GO:0004721 phosphoprotein phosphatase activity