IVF0022293 (gene) Melon (IVF77) v1

Overview
NameIVF0022293
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Locationchr09: 381570 .. 390540 (-)
RNA-Seq ExpressionIVF0022293
SyntenyIVF0022293
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTCCAGCAGTCCTCGCCGACATAGTCGAGCATTTCCGCCGATTCTTCGTCGAACACTGCCGCTACCATCGGTCTTTTACGGGATTCTTGGTTTTGGGTAAGATTTGGTGTTTTATGGTTTTTTTTTAGTATTCTTTGAGTACCCACTAAGATTTTGGCCTAAATTTGATTTCCCATAATGTTCGGATTTTCGATTCAAGATTTGGAAGACGTTAGCTTGCTGTCCAGCAAGTGTTAAGGACCGTATTGTTTGATTTTTTGGTTAGTATCAGATTTTGGAACCATTTTGGATTAGGAATAATCGTTGTAAACTGGGTATATAATTCTTATTTGTGGTTTATGCATAGGTAGGAAGTTTTTGCTGATGTTGTGAGGTGGTTTTGGATTAAGCCACGTATTGGGTAAGTTTATCTTTCCTTTTTTTTTTTTTTTTTTTTTTTTGAGTTTTTTAGTCAAATTTGTGTTTTGGACTTGGCCCCTAAAGTTTAGGTTTGATTATGGTTTGTATTAGGGGCTTTGATCCAAGATTACTCAAAAACAAGAAGAAGATAATTGCTTGTTATTCAGCTCAATTTTGAGGTAAGTATCATTCTATTGGAACACTTTTGGGCCAGGCGAACTAAGTTTAAAACGTTGTTTTTCGCTTTGCCAAATAGTTTTTAGAAAAGCTTACATGATATGTTTTCATGCAAGCGTCTATTGTGCTTAGCCTAGCTTATTTACCTTTTATAGTTTAGTTCGTGTTGTAAGCTTGCTGTTAATTTATCTGGTTGTTATTGTTAAGTTTGGGGTTGGAAATTTAGTGTTGGAGTTGTTAATGTTATTTTGAAGTTAAGTTGTCCTTAGTGGTTTATCAAAGTGATTGTTTCATAAATTAAGTTTGGATCACATTGGTTCTTTCAGGTGACCTATTTAATGTTATAAGTGTTGAATTTCTAGTGTTGTTTAGTTGCTCCGAGCAAATATTAGAGTGTTAGTTTCTTTTAATTGTTTTGATGTTGAACCTGAGATTAATTAGGTATGAGAGTTTGGTACTAAAATTTTGGATTGTTTTGAGGTTATAAGTTCATGTAACAATTATAGTATATTATATATTCTATGAATTTTTGAACCCCAAATGTTGTAGCCGTAGGTGTTTTGTATGTGAAATTAGTTGAAGTCACGCCTGAGTTTATTTGGAAATGATTAGAAAAATAAAGAAGCGTGGCCTACTTATTGAAGTCGGCTTATTTAAGAGTTGAAATTAATATTATCCTAGAATTCTTTGTCCATCTGCTATACACAGCATTTATTTTATCTTTTTCTTTCCACTTCTAAATGTTTCATAAGATTACATGCATAATATTTTTATAAGATCTTTTCCCTTCTGAATTTGGCTCTATTGTTTTTAATCTTGGTATTGCATATTAATGCTAAAGTCTACTAAGGTCATTTACTTTCCCCTTTCTATTTTGTGGATTTTTCAGATGAGCCTTGCAACCAATTCTCCGGCTCACTCATCAAGTAGTGACGATTTTGCTGCATTTCTTGCTGTAGATCTTGATTCCCATTCGTCTGACTCATCACCTGATGAAGAGACCGAGGGTGACAATAATGCTGAAAGCGAGAGGTACGGTTAATTTTTAAGATCAATAAACTTCCGTTGTTCTTTGAAGCTAGTAATGCATAAACACAGTGGTTTAGGGTTCTAATTTTTGTTAATTTAATGTCGTTATGCTATTTTATTTTTCATCCCTCAAAAATCATTTTCTTGCCAACTTTTTGGTGTAGTCTGAGAGTTCTTGTTCCCTTTCCTTTGGGTTCGTCATTTTCTGTCCGATAGAGAAACAATGGCAGTGGTAGCTCTTTCTTTCTTTACCTGAGTGTCAACCCTTTAGAATCAGGAGAAAGGAAATTAGAGTTTGGAGCCGTAATCCTTTGGAAGGGTTCTGGTGCAAATCTTTCTTTGAGTGTTTTGTTGATCCTTCTCCTTTAGGTGTTTCAGTCTTTTTGGTGCTTTGGAGGATTAAGGTCCCTAGAAAGGTGAGATTCTTTACTTGGCAGGTTCTTCACGACTGAGCTAGCATATTGAACAGGCTTGTTAGGAAGTTGCCTCCATTGGTTGGGCATTTTTGTTGTATTTTTGTCACAGGGCAGAGGAAGAGTTGGACTGTATTCTCCGGCATTGTGACTTTGTGAGTGGAGTCTGGATTCTTTTGTTTAGACATTTGGTTGATGTGCGCTCGTGGTAGAGATGTCAGGGTATTATCGAGGAGTTCCTCCTCAATTTTCCGTTTAGGGTGAAAGGCTGTTTGATTGTCTTGCTAGTTTGTGTGCGGTTTTATGGGTATTGTGGGGCAAACCAAATACTAGGGTGTTTTGGGGTGTTGAGAGAGATCCTATGGAGTTATGGTCCCTTACTTGCTTTCATGTTTTCCAGTGGCTTGGATGCCGAAGTTTTTTTTATCATGTTATATAGGTATTAGATGGGAGTCCCTTGTAAAGGGAATCTCCTTTTTTTGTGGGCTTGACTTTTATATGTTCGTGTATTCTTTATTTTTTCTCCTTGAAAGATGTTCTCATAAAAAAATTAATGTTTTTATTACGTAATTTAGAATAAAATGGAAAGTATTTTGTAGTTTCTGGCTCTCTTTTGAATTTTCTTATGGATAAAAATATGAAATAAGTTTATCATGATTAAGATTATGGGTTCCTTTTTCAATTTCTGGACGGTATTTTCTAAAAAATAGATTGTGTTTGTATTGTACGACATTTGAAAATGCTTGTTGAAACAGGATTTCATGATTGGTTTAACAGTTCAGATTTCCCCATCTCACTTTCTTGTGATAGTTTATAGCAATTTATAGATTCTAAGCTTCTAATCTCAAACTATTGGTTATTCTTTTAAATGATTTTTTTAAGAAAAGCCAAATTTCCATGCTGAATGAAAGAATAAGAGCAAACTGCAAAGTAGCACAAAAAGACTAATATAAAATAAAAAAATTAGACATTTACAGCAGTCATAATTATTTAAGTTCACAGGTAATGAGTTTTTCTGTCATTTAGTTTCACTTTCATCTAATCATTTTCTTTATAGATATTTCAAATGGTCTAGTTCATGTAAGTTTTTAGGCTACCATAAGTGTAATTATTATTTTGTATTAAAAGTATAAGACTACCGTAAACAGGATGCTGCTATTCAATAAGAGTCTTTGTGATTTCTATGTTTGTAATTTGTATTATTTTTCCTTTTGCATGAATAACAGCATTGAGTTGTGGCAAATGGCGACTATGATTTCTGCGCATAATAACAGGGGTTTGTAGGATTGGTTATAAGTTATAACCATATAACTAAATTGTGGTAGTTCATCCTTGTTTTCTCAATTTAACATATCAAATACTTTTTATTCAATTCGTTCAATTTTTTTAGGATAAAGCGTCGTAAGGTAGAGAAACTGGAAAACTCAGAGGATATTGTGCATGAAGTTGAAGAGCAAAGTTTAGGTGAGTTAAACTTTTGGTACTAATCCCTTTTGTTGTCCATATTTTAAACTATTGTTCAACCCTTTACTTTCAATGTAGCCTATATGTGGCTGTGAGCTGTACTATACAGTCTATACTGAACGACTTTCTTACTTCTTAGACAGCTATACAACGTGCTTGCTGTGTTTGTCCTTTTTCCCGCTCACTCTGCATATTCATAGAGCTTTTCTCTTGGCTAATCTTCTGTCACATATTGTTTTTGGTGGTTGCCTCCTCTATCCTTCTTATATGGGAATCTGATTGGAGGTTTTCCATCCTTTTTATCCATCTTTTATTTGAATCATATGAACTTTAAGAACCATATCAAATTGGCCCCAAAATGGTCTATATATAAACACCTCCCTATTAGTGTTATTGAGTAAGAAAATATTCCCTAGAAACTATGAAATCAAAAGGGCAGGGAACGAAGCATACTACATCCTTGTACAACCAAAAGTGAAATAACATTTCTCTGCTGTATGTTTAATTGGTCCAAATGCCAGTGGATAAAAATAAAGTACATAGTATGGGACTTGCTCTTATATATGATTCATAATCTTTGGTCTAAATGAAAATAGTCATTCTTAGTTTCCTTTAAAACTAATAGTTATAGGGAAGCCTGATGGATATGTACCTGAAATTTTGTGGCATCTGATTTTGTGAACTATTTCAGTCTAGCGGTCTTCTGCTCCTCATATATTCTTTATTTCGGTTCCTTTCTTCCCCTACCAACGTAACCTATTCAGGTTCGTGTAATGTAACTTATTACCATTGAAATAACTTATTACCATCTGACTCCTTGTATCTTTGCGTTGGTTTTTAATTCTCTAATTTCTGAATTGTTATGTTTTGAAAATATTGATCTAATGCAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCAGGCGTGACATTTGGGTATATACATAAGGTATGTTTTCTTTATATGAATATTTGTTTGTCTTCTATTCTTTGTTGTCATTTATAGAATGTTTGTTGCAGTGGGGGTGGGTTAGAAGGAGTGGGAGTTTACAAAATGGGCACCAGTGGTTAGAATGACATACCAGTAATTAGAAGGAATGAGATCTTGAGTTGACCTAGTGGTAAAAAGGAGACATAATCTCAATCTATGGAGGCCCTCCATTTCTTAATAAGCTATCAGTTTCTTAAAAAATAAAGAAAAAAGAAAAAGTGAAAAAATCTCCATGAAGCTTGAGATGTTGCTAATATATATATTTTCCTTCAGTTATATTTGTTCACTGTTGGCAAAATAGACACTACATTAGCTTTGGATTTTAACTTAGCATGCCTGGAGAAGTACCAATAACTAGTAGTTTGCTGGTATATCAAATGATGGGATAAAAATATGGACCATCTGATAATCTTGTAGGAGCTGAGGCTTAATAATGATGAAATTAACCGATGCGTAACAAAGAAATGAAGGAGTTGTTGCAGCGTAAAAAGCTTATCTTGGTTCTTGATCTGGATCACACACTATTGAACTCAACTGAGCTACGGTATTTGACAGTTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTTGAAGGTAGATTGTCCTCTCATCTGTACATAGTTCTACTTTCTTAAGTAACTTCGGTATGTTATTGACTGTTATTTCATTCTCATTGTGTTTTTGCTTTCACCATTGCTGTTTTGTGTCTGCTGTGTTCTTTTTTCTTTTATATATAGGACACAAATAAGTACATTCCCCTAATCCTAAGCTTTTGTATTTGCTGCTTTAGATTAGTTGCTTTTTGAGCAGCTATGACAGTCTTCTCTTATTTAAAACTAAATACTTTAAATTCAGATTGGATAAGTTGCAAACATGGAGTTCGATATTTTTGTGAACTTAATTCAAATTTGGATTTTTTTTTTTTGTGTATTTATGTTGAGGGGCTCTTTGTTGGTAATTTAGTCTTTTGGAACGTTGGAATTACTTTGAAACTTTCCGAATATCAAGAAACGTAGAAAAAAAGGTTGGGATATTTCTTTTGCTCGACTAGTAAAGTAAAAGTGAAAAACGTAGGTTTTGCCGATCTTTCACTATATGAATGAATGTTATTTTATAATTTATTGGTGTCGAAACTATTTACAAATTATAGGAAAAACTATCAAATTCTCCATAGTTCACTTAAAATTTTTGTTATATTTTGTAAATAGTTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCTATCCATAAGAATTTCTCAAATTTATTTTATATTCTTTAGAAAAATAAAGATAAGTACTTTATTATGTATAAGCACTTCATAAGCATGATGCATGAAACTATCTCCACAACTTGACTTACAAAGGTCTCTGAGGGGTGGTTTGAGGCAAGGGGTGGAGTTCTCAAGCCTGGGGGTAGTTTAGTTTTAAATTGATGTTTTTTAGTTGTGGGGCTTACAAATTCATTGGACCAAAACAAAGAGTTGAGTTTTTTAGCTTCACTATCCTCAACTCCTACTCCCTTGCCAAACACCCCTTGAGAGATCTGCAAGAGTTTGAGTTTGAATGGGCCGTTATGGGATGAGGGTTCTGACATATATTTCTTTTTATTAATTCTGGAATTTGCATCTGACGATCTTTTTCCTCGCTCTGTTGAACCATGAAGCTGAATTGCACCCTTTCTTGGGTCTAAACAGTTCGGTAGTGTTTAATTTAAAATTTGCTTGCCTTCTTATATAGTGTCAAGTAATTAACCTTGAAGATTTGGCTTAATTATCTAAGCTTCCTAAAAATTTGTGACGTTTTGTTTTTGATGCTGTGGTAAAATTTTTGAATAGATGTCACGAAAGGAGGAAGCCTTTTCCTATTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATTTGTCCATTCATTTTTGAAAGAAGCTAGTAAGTTGTTTGAGATGTATATATACACTATGGGGGAGCGAAGATATGCCTTTGAAATGGCAAAGTTGTTGGACCCCAAGAAGGAGTACTTCAGTTCCAAAGTTATTTCTCGGGATGATGGCACTCAAAAGCATCAGAAAGGTCTTGATGTGGTGCTGGGTAAGGAAAGTGCTGTTCTGATCCTCGATGATACTGAAAATGTAAGTGCATCTAGAATTAGATAATACTTCCGTATACTTGATCTATTGTTGCTACCTAGTTATCATAATTAAATGGTAACGGTGATTCCCATCATGCATTAAGAAAAGGCTGATAAATTGCGTAGTTCACTCGGTGGAAATTTTCTTCCACATTTGAGAGGAAGGTTAAATACATTTGATGGATAAGAAGATTAAGACGACTATGTAATGCAAAAGAAGGATTGAAATTCTAAAATTTAATTCAACAGTTTGGTTCCCTAATTCCAAAACTTATTTTATTTTACTTTTAGTTCATTGCTGATGATTGTTACGGATGTTTTCTATAATCTTTAGTTTCTCTTTCTTTTCTTTTGTGTGTGTGTGTGTGTTTTAAACTTTTCAAGGAACTCCCGCCGTTACTCCTGTGGATTATTTAATGCAACATGGTCCTCTTATTAAACAATAGAATCTTGCAAAGGGATTGCTTGTGAATACTTAGTGAGTGTCTTTATTCCTCAAGAAAAACATCCCATAAGCCTTGTTTTATATTCTTTGTGATTCAGTTTTAGTTATTTTTTTCTGCGTCTTGGAACATTGATTTTCTCATTTTTGGGTTGCTGAACTACCTATTCTTTATACACATGTGGAGCTAAACATGATCAATTTTTACGATAGGCATGGACAAAACATAAAGAAAACTTAATATTGATGGAGAGATATCACTTTTTTGCTTCAAGTTGTCGTCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAATGATGAGAGCGAAACTGATGGGGCACTGACAACCATCCTGAAAGTTCTGAAACAAGTCCATCATATATTCTTTAATGTATTTTCCTCTCCCTCTCTATTGTTACACTCTTGTTTTTCCATCTTTTTAACCTCAATCTCATCGTTAGACTTTCTTTTGTAGGAAGTCTCGGGTGATTTGGTTGATAGAGATGTGAGACAGGTAAAAGTTTGTTCCCAACTTGAAGTTCGTTATCTAATGCTTCTTGTGATGACTATCATTTCTAGTTGATTCCTACAGTTCTATCTTGTCGATAATCTATAATACTTCTTGTGATGACTATCATTTCTAATTGATTCCTACATTTGTTCATGAACTAAACATTAATTAATTTATACGGTTTATTTTTAAATGTAGCAAAATAAACCAAAATATTTACAAAATATAGTAATATGATAGTCTATCTATAATGGATTGCAATAGACCAAGCTAGATGCATCTTATGGTAGTTTATATTGGTCTATCTATTATATCGTAAATAGTTTGTGATATTTTGTTATATTTCTAAATATTTTTAGAAGTTTTGTCATTTAAAATCATTTTACTAAATTATTAGAAGTTCATGGATTTATTAGAAAAGTTTATTTATGTCTAATGGATTAATTAATTTTTATTTTTATTTTAATATATCAAAGTACTAAGAGATAAAAATTGACAATTTGCATGAAATAGAAATTCAATGGCATCGTGGTATTATAGCATGGACTTAATATTCAAATTCCCTTCAACTAAACAGGTACTAAAGACTGTTCGTGCTAAAGTTCTCGAGGGATGCAAAGTCGTCTTCAGCCGTGTTTTTCCTACCAAATTTCAGGCTGAGAACCATCATCTCTGGAAGATGGTAGAGCAGTTAGGAGGCACTTGCTCAACCGAACTCGACCAATCCGTCACACACGTGGTCTCGACGGATGCTGGAACTGAAAAGTCACGTTGGGCTTTGAAGGAGAAGAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCAAACTACTTCTGGAAACGACAAGTGGAAGAGAACTTTACCGTTGAGCAAACCAAAGTCGAGCAAACCAAGAAACAATGACACCGTTTCTGTTTATTACTGTAGTAGTTCCACATATCACTTAAATGGATCATGGACATGTTGGGGTGGGGTTTGCATTCTGTGCTCTTAGTGTTTCCCCCATAAATATTACTTAGGACTCTCTCACCTTATTTGTAGGTCAGCGTCTCATTTTTGAAGTAACCCTTAAAATCAACTTCAAGGGTTGATTGAAATTCAAGCTCTTTTGGTCTCTGTAGATGTGTAGATGCGTTGCTTTGTAACCATTTTGGGTTGGGTTATAATCATAGGGTGTGTTATTTTTGGCTTGTAATTCATTTTAAAAAAAATTTCAAGCCCTAACAACTATGATTCTGTATTTGTAATTCTGCTAATGTATTTGTCTCATATCAAAGACAGCATTAGTTGATAAGTTTCACGACGGAAAGATCAGTGTTAGAATTTTTATCACTATCGTGC

mRNA sequence

ATGAGTTCCAGCAGTCCTCGCCGACATAGTCGAGCATTTCCGCCGATTCTTCGTCGAACACTGCCGCTACCATCGGTCTTTTACGGGATTCTTGATTTGGAAGACGTTAGCTTGCTGTCCAGCAAGTTTTTTGCTGATGTTGTGAGGTGGTTTTGGATTAAGCCACGTATTGGGGGCTTTGATCCAAGATTACTCAAAAACAAGAAGAAGATAATTGCTTGTTATTCACCTAGCTTATTTACCTTTTATAGTTTAGTTCGTGTTATGAGCCTTGCAACCAATTCTCCGGCTCACTCATCAAGTAGTGACGATTTTGCTGCATTTCTTGCTGTAGATCTTGATTCCCATTCGTCTGACTCATCACCTGATGAAGAGACCGAGGGTGACAATAATGCTGAAAGCGAGAGGATAAAGCGTCGTAAGGTAGAGAAACTGGAAAACTCAGAGGATATTGTGCATGAAGTTGAAGAGCAAAGTTTAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCAGGCGTGACATTTGGGTATATACATAAGGAGTTGTTGCAGCGTAAAAAGCTTATCTTGGTTCTTGATCTGGATCACACACTATTGAACTCAACTGAGCTACGGTATTTGACAGTTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTTGAAGATGTCACGAAAGGAGGAAGCCTTTTCCTATTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATTTGTCCATTCATTTTTGAAAGAAGCTAGTAAGTTGTTTGAGATGTATATATACACTATGGGGGAGCGAAGATATGCCTTTGAAATGGCAAAGTTGTTGGACCCCAAGAAGGAGTACTTCAGTTCCAAAGTTATTTCTCGGGATGATGGCACTCAAAAGCATCAGAAAGGTCTTGATGTGGTGCTGGGTAAGGAAAGTGCTGTTCTGATCCTCGATGATACTGAAAATGCATGGACAAAACATAAAGAAAACTTAATATTGATGGAGAGATATCACTTTTTTGCTTCAAGTTGTCGTCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAATGATGAGAGCGAAACTGATGGGGCACTGACAACCATCCTGAAAGTTCTGAAACAAGTCCATCATATATTCTTTAATTCTCGGGTGATTTGGTTGATAGAGATGTGAGACAGGTACTAAAGACTGTTCGTGCTAAAGTTCTCGAGGGATGCAAAGTCGTCTTCAGCCGTGTTTTTCCTACCAAATTTCAGGCTGAGAACCATCATCTCTGGAAGATGGTAGAGCAGTTAGGAGGCACTTGCTCAACCGAACTCGACCAATCCGTCACACACGTGGTCTCGACGGATGCTGGAACTGAAAAGTCACGTTGGGCTTTGAAGGAGAAGAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCAAACTACTTCTGGAAACGACAAGTGGAAGAGAACTTTACCGTTGAGCAAACCAAAGTCGAGCAAACCAAGAAACAATGACACCGTTTCTGTTTATTACTGTAGTAGTTCCACATATCACTTAAATGGATCATGGACATGTTGGGGTGGGGTTTGCATTCTGTGCTCTTAGTGTTTCCCCCATAAATATTACTTAGGACTCTCTCACCTTATTTGTAGGTCAGCGTCTCATTTTTGAAGTAACCCTTAAAATCAACTTCAAGGGTTGATTGAAATTCAAGCTCTTTTGGTCTCTGTAGATGTGTAGATGCGTTGCTTTGTAACCATTTTGGGTTGGGTTATAATCATAGGGTGTGTTATTTTTGGCTTGTAATTCATTTTAAAAAAAATTTCAAGCCCTAACAACTATGATTCTGTATTTGTAATTCTGCTAATGTATTTGTCTCATATCAAAGACAGCATTAGTTGATAAGTTTCACGACGGAAAGATCAGTGTTAGAATTTTTATCACTATCGTGC

Coding sequence (CDS)

ATGAGTTCCAGCAGTCCTCGCCGACATAGTCGAGCATTTCCGCCGATTCTTCGTCGAACACTGCCGCTACCATCGGTCTTTTACGGGATTCTTGATTTGGAAGACGTTAGCTTGCTGTCCAGCAAGTTTTTTGCTGATGTTGTGAGGTGGTTTTGGATTAAGCCACGTATTGGGGGCTTTGATCCAAGATTACTCAAAAACAAGAAGAAGATAATTGCTTGTTATTCACCTAGCTTATTTACCTTTTATAGTTTAGTTCGTGTTATGAGCCTTGCAACCAATTCTCCGGCTCACTCATCAAGTAGTGACGATTTTGCTGCATTTCTTGCTGTAGATCTTGATTCCCATTCGTCTGACTCATCACCTGATGAAGAGACCGAGGGTGACAATAATGCTGAAAGCGAGAGGATAAAGCGTCGTAAGGTAGAGAAACTGGAAAACTCAGAGGATATTGTGCATGAAGTTGAAGAGCAAAGTTTAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCAGGCGTGACATTTGGGTATATACATAAGGAGTTGTTGCAGCGTAAAAAGCTTATCTTGGTTCTTGATCTGGATCACACACTATTGAACTCAACTGAGCTACGGTATTTGACAGTTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTTGAAGATGTCACGAAAGGAGGAAGCCTTTTCCTATTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATTTGTCCATTCATTTTTGAAAGAAGCTAGTAAGTTGTTTGAGATGTATATATACACTATGGGGGAGCGAAGATATGCCTTTGAAATGGCAAAGTTGTTGGACCCCAAGAAGGAGTACTTCAGTTCCAAAGTTATTTCTCGGGATGATGGCACTCAAAAGCATCAGAAAGGTCTTGATGTGGTGCTGGGTAAGGAAAGTGCTGTTCTGATCCTCGATGATACTGAAAATGCATGGACAAAACATAAAGAAAACTTAATATTGATGGAGAGATATCACTTTTTTGCTTCAAGTTGTCGTCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAATGATGAGAGCGAAACTGATGGGGCACTGACAACCATCCTGAAAGTTCTGAAACAAGTCCATCATATATTCTTTAATTCTCGGGTGATTTGGTTGATAGAGATGTGA

Protein sequence

MSSSSPRRHSRAFPPILRRTLPLPSVFYGILDLEDVSLLSSKFFADVVRWFWIKPRIGGFDPRLLKNKKKIIACYSPSLFTFYSLVRVMSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHIFFNSRVIWLIEM
Homology
BLAST of IVF0022293 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 3.2e-98
Identity = 208/336 (61.90%), Postives = 245/336 (72.92%), Query Frame = 0

Query: 89  MSLATNSPAH-SSSSDDFAAFLAVDLDSHSSDSS-PDEETEGDNNAESERIKRRKVEKLE 148
           MS+A++SP H SSSSDD AAFL  +LDS S  SS P EE E +++ ES  +KR+K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 149 NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL----- 208
                         E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIHKE+     
Sbjct: 61  --------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNED 120

Query: 209 ------------LQR-KKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLED--VTKG 268
                       LQR +KL LVLDLDHTLLN+T LR L  EEEYL+S T SL+D     G
Sbjct: 121 EISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVSG 180

Query: 269 GSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSK 328
           GSLFLL  +  MTKLRPFVHSFLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +
Sbjct: 181 GSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDR 240

Query: 329 VISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGF 388
           VISRDDGT +H+K LDVVLG+ESAVLILDDTENAW KHK+NLI++ERYHFF+SSCRQF  
Sbjct: 241 VISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDH 300

Query: 389 NCKSLSELKNDESETDGALTTILKVLKQVHHIFFNS 403
             KSLSELK+DESE DGAL T+LKVLKQ H +FF +
Sbjct: 301 RYKSLSELKSDESEPDGALATVLKVLKQAHALFFEN 320

BLAST of IVF0022293 vs. ExPASy Swiss-Prot
Match: F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)

HSP 1 Score: 163.7 bits (413), Expect = 4.4e-39
Identity = 109/293 (37.20%), Postives = 155/293 (52.90%), Query Frame = 0

Query: 129 DNNAESERIKRRKVEKLENSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDE 188
           +N +   + KRRK+E   N          +S   LS    C H      +CI C   + +
Sbjct: 299 ENFSSEPKAKRRKIEPTIN----------ESSSSLSSSSSCGHWYICHGICIGCKSTVKK 358

Query: 189 ESGVTFGYIHKEL-------------------LQRKKLILVLDLDHTLLNSTELRYLTVE 248
             G  F YI   L                   L  KKL LVLDLDHTLL++  +  L+  
Sbjct: 359 SQGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLSQA 418

Query: 249 EEYLRSQTDSL--EDVTKGGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERR 308
           E+YL  +  S   +D+ K  ++   + +  +TKLRPF+  FLKEA++ F MY+YT G R 
Sbjct: 419 EKYLIEEAGSATRDDLWKIKAVG--DPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 478

Query: 309 YAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKEN 368
           YA ++ +L+DPKK YF  +VI++ +    H K LD VL +E  V+I+DDT N W  HK N
Sbjct: 479 YAKQVLELIDPKKLYFGDRVITKTE--SPHMKTLDFVLAEERGVVIVDDTRNVWPDHKSN 538

Query: 369 LILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHIFF 401
           L+ + +Y +F    R  G +    SE K DESE++G L  +LK+LK+VH  FF
Sbjct: 539 LVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRFF 573

BLAST of IVF0022293 vs. ExPASy Swiss-Prot
Match: Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)

HSP 1 Score: 151.8 bits (382), Expect = 1.7e-35
Identity = 88/210 (41.90%), Postives = 127/210 (60.48%), Query Frame = 0

Query: 200  ELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLFLLNSVHTMT 259
            ++   +KL LVLD+DHTLLNS +   +    E +  + +  +       LF    +   T
Sbjct: 921  KMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWT 980

Query: 260  KLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DDGTQ--- 319
            KLRP + +FL++ASKL+E+++YTMG + YA EMAKLLDPK   F+ +VIS+ DDG     
Sbjct: 981  KLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISKGDDGDPLDG 1040

Query: 320  ----KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSL 379
                   K L+ V+G ES+V+I+DD+   W +HK NLI +ERY +F  S RQFG    SL
Sbjct: 1041 DERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLGPSL 1100

Query: 380  SELKNDESETDGALTTILKVLKQVHHIFFN 402
             EL  DE   +G L + L V++++H  FF+
Sbjct: 1101 LELDRDEVPEEGTLASSLAVIEKIHQNFFS 1130

BLAST of IVF0022293 vs. ExPASy Swiss-Prot
Match: Q8SV03 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Encephalitozoon cuniculi (strain GB-M1) OX=284813 GN=FCP1 PE=1 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 2.7e-20
Identity = 67/209 (32.06%), Postives = 106/209 (50.72%), Query Frame = 0

Query: 169 CSHPGSFGNMCIICGQRLDEESGVTFGY---------------IHKELLQ----RKKLIL 228
           C+HP   G +C +CG  + EES +                   IHKE ++    + KLIL
Sbjct: 4   CNHPIRLGTLCGVCGMEIQEESHLFCALYNTDNVKITHEEAVAIHKEKMEALEMQMKLIL 63

Query: 229 VLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLFLLNSVHTMTKLRPFVHSFL 288
           VLDLD T+L++T               T SLE   K    F+++      KLRP +   L
Sbjct: 64  VLDLDQTVLHTT-------------YGTSSLEGTVK----FVIDRCRYCVKLRPNLDYML 123

Query: 289 KEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKES 348
           +  SKL+E+++YTMG R YA  + +++DP  +YF  ++I+RD+      K L  +   + 
Sbjct: 124 RRISKLYEIHVYTMGTRAYAERIVEIIDPSGKYFDDRIITRDENQGVLVKRLSRLFPHDH 183

Query: 349 A-VLILDDTENAWTKHKENLILMERYHFF 358
             ++ILDD  + W  + ENL+L+  + +F
Sbjct: 184 RNIVILDDRPDVW-DYCENLVLIRPFWYF 194

BLAST of IVF0022293 vs. ExPASy Swiss-Prot
Match: Q95QG8 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis elegans OX=6239 GN=fcp-1 PE=1 SV=2)

HSP 1 Score: 92.4 bits (228), Expect = 1.2e-17
Identity = 75/279 (26.88%), Postives = 134/279 (48.03%), Query Frame = 0

Query: 161 EVLSKQQLCSHPGSFGNMCIICGQRLDEESG----------VTFGYIH------------ 220
           +V++    C+H     +MC  CG+ L E+ G               IH            
Sbjct: 68  QVIATVSECTHAIVIKDMCATCGKDLREKGGRAGQRKEQSTANVSMIHHVPELIVSDTLA 127

Query: 221 --------KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 280
                     L+  +KL+L++DLD T++++++ + +TV+       T++ +D+TK    +
Sbjct: 128 KEIGSADENNLITNRKLVLLVDLDQTIIHTSD-KPMTVD-------TENHKDITK----Y 187

Query: 281 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 340
            L+S    TKLRP    FL + S ++EM+I T G+R+YA  +A++LDP    F  +++SR
Sbjct: 188 NLHSRVYTTKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSR 247

Query: 341 DD--GTQKHQKGLDVVLG-KESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG-- 400
           D+    Q     L  +    ++ V+I+DD  + W  + E LI ++ Y FF    ++ G  
Sbjct: 248 DELFSAQHKTNNLKALFPCGDNLVVIIDDRSDVW-MYSEALIQIKPYRFF----KEVGDI 307

BLAST of IVF0022293 vs. ExPASy TrEMBL
Match: A0A0A0KW61 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis sativus OX=3659 GN=Csa_5G650420 PE=4 SV=1)

HSP 1 Score: 696.4 bits (1796), Expect = 7.0e-197
Identity = 376/427 (88.06%), Postives = 384/427 (89.93%), Query Frame = 0

Query: 1   MSSSSPRRHSRAFPPILRRTLPLPSVF-----YGILDLEDVSLLSSK--FFADVVRWFWI 60
           MSS+ PRRHSRAFPPIL RTLPLPS F     + I DLEDVSLLSS+  FFADVVRWFWI
Sbjct: 1   MSSTCPRRHSRAFPPILHRTLPLPSFFLRDSWFWIQDLEDVSLLSSRLEFFADVVRWFWI 60

Query: 61  KPRIGGFDPRLLKNKKKIIACYSPSLFTFYSLVRVMSLATNSPAHSSSSDDFAAFLAVDL 120
           KPRIGGFDPRLLKNKK     +S +L    SLVRVMSLATNSPAHSSSSDDFAAFLAVDL
Sbjct: 61  KPRIGGFDPRLLKNKKINCLLFSSTL----SLVRVMSLATNSPAHSSSSDDFAAFLAVDL 120

Query: 121 DSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS-EDIVHEVEEQSLEVLSKQQLCSHP 180
           DSHSSDSSPDEETEGDNNAES RIKRRKVEKLENS EDI+HEVEEQSLEVLSKQQLCSHP
Sbjct: 121 DSHSSDSSPDEETEGDNNAESVRIKRRKVEKLENSEEDIMHEVEEQSLEVLSKQQLCSHP 180

Query: 181 GSFGNMCIICGQRLDEESGVTFGYIH------------------KELLQRKKLILVLDLD 240
           GSFGNMCIICGQRLDEESGVTFGYIH                  KELLQRKKLILVLDLD
Sbjct: 181 GSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLD 240

Query: 241 HTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLFLLNSVHTMTKLRPFVHSFLKEASK 300
           HTLLNSTELRYLTVEEEYLRSQTDSL+DVTK GSLFLLNSVHTMTKLRPFVHSFLKEASK
Sbjct: 241 HTLLNSTELRYLTVEEEYLRSQTDSLDDVTK-GSLFLLNSVHTMTKLRPFVHSFLKEASK 300

Query: 301 LFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLIL 360
           LFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLIL
Sbjct: 301 LFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLIL 360

Query: 361 DDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQ 402
           DDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQ
Sbjct: 361 DDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQ 420

BLAST of IVF0022293 vs. ExPASy TrEMBL
Match: A0A5A7T8Q9 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G00590 PE=4 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 1.2e-167
Identity = 314/333 (94.29%), Postives = 315/333 (94.59%), Query Frame = 0

Query: 87  RVMSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLE 146
           R+MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLE
Sbjct: 22  RMMSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLE 81

Query: 147 NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH-------- 206
           NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH        
Sbjct: 82  NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNND 141

Query: 207 ----------KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGS 266
                     KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGS
Sbjct: 142 EINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGS 201

Query: 267 LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI 326
           LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI
Sbjct: 202 LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI 261

Query: 327 SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC 386
           SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC
Sbjct: 262 SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC 321

Query: 387 KSLSELKNDESETDGALTTILKVLKQVHHIFFN 402
           KSLSELKNDESETDGALTTILKVLKQVHHIFFN
Sbjct: 322 KSLSELKNDESETDGALTTILKVLKQVHHIFFN 354

BLAST of IVF0022293 vs. ExPASy TrEMBL
Match: A0A1S3CAQ0 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis melo OX=3656 GN=LOC103498688 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 5.8e-167
Identity = 313/331 (94.56%), Postives = 313/331 (94.56%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 60

Query: 149 EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH---------- 208
           EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH          
Sbjct: 61  EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEI 120

Query: 209 --------KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 268
                   KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF
Sbjct: 121 NRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 180

Query: 269 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 328
           LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 240

Query: 329 DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS 388
           DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS 300

Query: 389 LSELKNDESETDGALTTILKVLKQVHHIFFN 402
           LSELKNDESETDGALTTILKVLKQVHHIFFN
Sbjct: 301 LSELKNDESETDGALTTILKVLKQVHHIFFN 331

BLAST of IVF0022293 vs. ExPASy TrEMBL
Match: A0A5D3BMG0 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G00580 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 5.8e-167
Identity = 313/331 (94.56%), Postives = 313/331 (94.56%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 60

Query: 149 EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH---------- 208
           EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH          
Sbjct: 61  EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEI 120

Query: 209 --------KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 268
                   KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF
Sbjct: 121 NRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 180

Query: 269 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 328
           LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 240

Query: 329 DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS 388
           DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS 300

Query: 389 LSELKNDESETDGALTTILKVLKQVHHIFFN 402
           LSELKNDESETDGALTTILKVLKQVHHIFFN
Sbjct: 301 LSELKNDESETDGALTTILKVLKQVHHIFFN 331

BLAST of IVF0022293 vs. ExPASy TrEMBL
Match: A0A6J1BUF9 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 1.4e-144
Identity = 280/343 (81.63%), Postives = 300/343 (87.46%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSL TNSPAHSSSSDDFAAFL V LDSHSSDSSP+E+ EGDNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 149 ----EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH------ 208
               EDI + VEEQS EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH      
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHKGLRLN 120

Query: 209 ------------KELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKG 268
                       K LLQ KKLILVLDLDHTLLNST+L ++T EEEYLRSQTDSLEDVTK 
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTK- 180

Query: 269 GSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSK 328
           GSLFLLNSVHTMTKLRPFVH+FLKEAS+LFEMYIYTMGER YAFEMAKLLDPK+EYFS+K
Sbjct: 181 GSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAK 240

Query: 329 VISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGF 388
           VISRDDGTQKH+KGLDVVLG+ESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG+
Sbjct: 241 VISRDDGTQKHKKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGY 300

Query: 389 NCKSLSELKNDESETDGALTTILKVLKQVHHIFFNSRVIWLIE 410
           NCKSLSELK+DESETDGAL TILKVLKQVH IFFN  +  L++
Sbjct: 301 NCKSLSELKSDESETDGALATILKVLKQVHTIFFNELLDDLVD 342

BLAST of IVF0022293 vs. NCBI nr
Match: KAA0039268.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 598 bits (1541), Expect = 8.48e-211
Identity = 314/334 (94.01%), Postives = 315/334 (94.31%), Query Frame = 0

Query: 87  RVMSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLE 146
           R+MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLE
Sbjct: 22  RMMSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLE 81

Query: 147 NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL----- 206
           NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL     
Sbjct: 82  NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNND 141

Query: 207 -------------LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGS 266
                        LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGS
Sbjct: 142 EINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGS 201

Query: 267 LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI 326
           LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI
Sbjct: 202 LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI 261

Query: 327 SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC 386
           SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC
Sbjct: 262 SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC 321

Query: 387 KSLSELKNDESETDGALTTILKVLKQVHHIFFNS 402
           KSLSELKNDESETDGALTTILKVLKQVHHIFFN 
Sbjct: 322 KSLSELKNDESETDGALTTILKVLKQVHHIFFNE 355

BLAST of IVF0022293 vs. NCBI nr
Match: XP_008459611.1 (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo] >XP_008459612.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo] >XP_008459613.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo] >XP_016902439.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo] >TYK00454.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 595 bits (1535), Expect = 3.04e-210
Identity = 313/332 (94.28%), Postives = 313/332 (94.28%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 60

Query: 149 EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL------- 208
           EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL       
Sbjct: 61  EDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEI 120

Query: 209 -----------LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 268
                      LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF
Sbjct: 121 NRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLF 180

Query: 269 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 328
           LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR 240

Query: 329 DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS 388
           DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKS 300

Query: 389 LSELKNDESETDGALTTILKVLKQVHHIFFNS 402
           LSELKNDESETDGALTTILKVLKQVHHIFFN 
Sbjct: 301 LSELKNDESETDGALTTILKVLKQVHHIFFNE 332

BLAST of IVF0022293 vs. NCBI nr
Match: XP_011656096.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis sativus] >KAE8649005.1 hypothetical protein Csa_007911 [Cucumis sativus])

HSP 1 Score: 578 bits (1489), Expect = 3.00e-203
Identity = 308/333 (92.49%), Postives = 311/333 (93.39%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAES RIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESVRIKRRKVEKLENS 60

Query: 149 E-DIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL------ 208
           E DI+HEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL      
Sbjct: 61  EEDIMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDE 120

Query: 209 ------------LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSL 268
                       LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSL+DVTKG SL
Sbjct: 121 INRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLDDVTKG-SL 180

Query: 269 FLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVIS 328
           FLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVIS
Sbjct: 181 FLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVIS 240

Query: 329 RDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCK 388
           RDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCK
Sbjct: 241 RDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCK 300

Query: 389 SLSELKNDESETDGALTTILKVLKQVHHIFFNS 402
           SLSELKNDESETDGALTTILKVLKQVHH+FFN 
Sbjct: 301 SLSELKNDESETDGALTTILKVLKQVHHMFFNE 332

BLAST of IVF0022293 vs. NCBI nr
Match: XP_022133135.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Momordica charantia])

HSP 1 Score: 521 bits (1343), Expect = 1.08e-182
Identity = 279/336 (83.04%), Postives = 296/336 (88.10%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSL TNSPAHSSSSDDFAAFL V LDSHSSDSSP+E+ EGDNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 149 E----DIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK----- 208
           E    DI + VEEQS EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIHK     
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHKGLRLN 120

Query: 209 -------------ELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKG 268
                         LLQ KKLILVLDLDHTLLNST+L ++T EEEYLRSQTDSLEDVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKG 180

Query: 269 GSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSK 328
            SLFLLNSVHTMTKLRPFVH+FLKEAS+LFEMYIYTMGER YAFEMAKLLDPK+EYFS+K
Sbjct: 181 -SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAK 240

Query: 329 VISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGF 388
           VISRDDGTQKH+KGLDVVLG+ESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG+
Sbjct: 241 VISRDDGTQKHKKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGY 300

Query: 389 NCKSLSELKNDESETDGALTTILKVLKQVHHIFFNS 402
           NCKSLSELK+DESETDGAL TILKVLKQVH IFFN 
Sbjct: 301 NCKSLSELKSDESETDGALATILKVLKQVHTIFFNE 335

BLAST of IVF0022293 vs. NCBI nr
Match: XP_038890381.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida] >XP_038890382.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 522 bits (1345), Expect = 1.91e-181
Identity = 280/333 (84.08%), Postives = 294/333 (88.29%), Query Frame = 0

Query: 89  MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDSSPDEETEGDNNAESERIKRRKVEKLENS 148
           MSLATNSPAHSSSSDDFAAFL V LDSHSSDSSP E+ EGDNNAESERIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEGDNNAESERIKRRKVEKLENS 60

Query: 149 E-DIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 208
           E DI++ VEEQS E +SKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK        
Sbjct: 61  EEDILYGVEEQSSEAISKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 209 ----------ELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSL 268
                      LL  KKLILVLDLDHTLLNST+L +LT EEEYLRSQTDSL+DVTKG SL
Sbjct: 121 INRLRNIDMKSLLLHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLDDVTKG-SL 180

Query: 269 FLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVIS 328
           FLLNSVHTMTKLRPFVHSFLKEA++LFEMYIYTMGER YAFEMAKLLDPKKEYF+ KVIS
Sbjct: 181 FLLNSVHTMTKLRPFVHSFLKEANQLFEMYIYTMGERAYAFEMAKLLDPKKEYFNGKVIS 240

Query: 329 RDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCK 388
           RDDGTQKHQKGLDVVLG+ESAVLILDDTENAW KHK+NLILMERYHFFASSC QFGFNCK
Sbjct: 241 RDDGTQKHQKGLDVVLGQESAVLILDDTENAWPKHKKNLILMERYHFFASSCHQFGFNCK 300

Query: 389 SLSELKNDESETDGALTTILKVLKQVHHIFFNS 402
           SLSELK+DESETDGAL TILKVLKQVH +FFN 
Sbjct: 301 SLSELKSDESETDGALATILKVLKQVHSVFFNE 332

BLAST of IVF0022293 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 360.1 bits (923), Expect = 2.3e-99
Identity = 208/336 (61.90%), Postives = 245/336 (72.92%), Query Frame = 0

Query: 89  MSLATNSPAH-SSSSDDFAAFLAVDLDSHSSDSS-PDEETEGDNNAESERIKRRKVEKLE 148
           MS+A++SP H SSSSDD AAFL  +LDS S  SS P EE E +++ ES  +KR+K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 149 NSEDIVHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL----- 208
                         E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIHKE+     
Sbjct: 61  --------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNED 120

Query: 209 ------------LQR-KKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLED--VTKG 268
                       LQR +KL LVLDLDHTLLN+T LR L  EEEYL+S T SL+D     G
Sbjct: 121 EISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVSG 180

Query: 269 GSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSK 328
           GSLFLL  +  MTKLRPFVHSFLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +
Sbjct: 181 GSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDR 240

Query: 329 VISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGF 388
           VISRDDGT +H+K LDVVLG+ESAVLILDDTENAW KHK+NLI++ERYHFF+SSCRQF  
Sbjct: 241 VISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDH 300

Query: 389 NCKSLSELKNDESETDGALTTILKVLKQVHHIFFNS 403
             KSLSELK+DESE DGAL T+LKVLKQ H +FF +
Sbjct: 301 RYKSLSELKSDESEPDGALATVLKVLKQAHALFFEN 320

BLAST of IVF0022293 vs. TAIR 10
Match: AT5G54210.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 175.6 bits (444), Expect = 8.0e-44
Identity = 104/257 (40.47%), Postives = 144/257 (56.03%), Query Frame = 0

Query: 169 CSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL-------------------LQRKKLIL 228
           C H      +C  C   ++   G +F Y+   L                      KKL L
Sbjct: 32  CDHFFVRYGICCNCRSNVERHRGRSFDYLVDGLQLSDIAVTVTKRVTTQITCFNDKKLHL 91

Query: 229 VLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTK--GGSLFLLNSVHTMTKLRPFVHS 288
           VLDLDHTLL++  +  LT EE YL  + DS ED+ +  GG      S   + KLRPFVH 
Sbjct: 92  VLDLDHTLLHTVMISNLTKEETYLIEEEDSREDLRRLNGG-----YSSEFLIKLRPFVHE 151

Query: 289 FLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGK 348
           FLKEA+K+F MY+YTMG+R YA  +  L+DP+K YF  +VI+R++    + K LD+VL  
Sbjct: 152 FLKEANKMFSMYVYTMGDRDYAMNVLNLIDPEKVYFGDRVITRNE--SPYIKTLDLVLAD 211

Query: 349 ESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTT 405
           E  V+I+DDT + W  HK NL+ + +Y++F+   R      KS +E K DES  DG+L  
Sbjct: 212 ECGVVIVDDTPHVWPDHKRNLLEITKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLAN 271

BLAST of IVF0022293 vs. TAIR 10
Match: AT3G17550.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 174.5 bits (441), Expect = 1.8e-43
Identity = 107/265 (40.38%), Postives = 151/265 (56.98%), Query Frame = 0

Query: 155 VEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL------------- 214
           + E S  + S +  C H      +CI C   +++  G  F Y+ + L             
Sbjct: 15  INESSSSLSSSRSSCGHWYVRYGVCIACKSTVNKRHGRAFDYLVQGLQLSHEAAAFTKRF 74

Query: 215 ------LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLFLLNSV 274
                 L  KKL LVLDLDHTLL+S  +  L+  E+ L  +  S    T    L+ L+S 
Sbjct: 75  TTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKCLIEEACS----TTREDLWKLDSD 134

Query: 275 HTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQ 334
           + +TKLRPFVH FLKEA++LF MY+YTMG R YA  + KL+DPK+ YF  +VI+RD+   
Sbjct: 135 Y-LTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESLLKLIDPKRIYFGDRVITRDE--S 194

Query: 335 KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELK 394
            + K LD+VL +E  V+I+DDT + WT HK NL+ +  YHFF  +  +      S +E K
Sbjct: 195 PYVKTLDLVLAEERGVVIVDDTSDVWTHHKSNLVEINEYHFFRVNGPE---ESNSYTEEK 254

Query: 395 NDESETDGALTTILKVLKQVHHIFF 401
            DES+ +G L  +LK+LK+VH+ FF
Sbjct: 255 RDESKNNGGLANVLKLLKEVHYGFF 269

BLAST of IVF0022293 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 166.8 bits (421), Expect = 3.7e-41
Identity = 103/269 (38.29%), Postives = 149/269 (55.39%), Query Frame = 0

Query: 162 VLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKEL-------------------L 221
           V +    C H   F  +CI C  ++ +     F YI K L                   L
Sbjct: 3   VTTSSSCCGHWYVFQGICIGCKSKVHKSQFRKFDYIFKGLQLSNEAVALTKSLTTKHSCL 62

Query: 222 QRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDS--LEDVTKGGSLFLLNSVHTMTK 281
             KKL LVLDLDHTLL+S  +  L+  E YL  +  S   ED+ K   +   + +  + K
Sbjct: 63  NEKKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTREDLWKFRPIG--HPIDRLIK 122

Query: 282 LRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKG 341
           LRPFV  FLKEA+++F M++YTMG R YA  + +++DPKK YF ++VI++D+  +   K 
Sbjct: 123 LRPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDESPR--MKT 182

Query: 342 LDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESE 401
           L++VL +E  V+I+DDT + W  HK NLI + +Y +F    R+ G +  S SE K DE E
Sbjct: 183 LNLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYSEKKTDEGE 242

Query: 402 TDGALTTILKVLKQVHHIFFNSRVIWLIE 410
            DG L  +LK+L++VH  FF   V  ++E
Sbjct: 243 NDGGLANVLKLLREVHRRFFIVEVEEVLE 263

BLAST of IVF0022293 vs. TAIR 10
Match: AT1G20320.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 165.2 bits (417), Expect = 1.1e-40
Identity = 106/271 (39.11%), Postives = 145/271 (53.51%), Query Frame = 0

Query: 155 VEEQSLEVLSKQQ--LCSHPGSFGNMCIICGQRLDEESGVTFGYI-------HKEL---- 214
           +E   LE  SK    +C H      +C  C   +D + G  F Y+       HK +    
Sbjct: 4   IENSCLEPESKTATLICGHFFVRYGICCNCRSTVDRDYGRAFDYLVHGLQLSHKAVAVTK 63

Query: 215 --------LQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLEDVTKGGSLFLLN 274
                   L  +KL LVLDLDHTLL+S  +  L+  E+YL  ++D  ED      L+ L+
Sbjct: 64  SLTTQLACLNERKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFRED------LWTLD 123

Query: 275 SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDG 334
               + KLRPFVH FLKEA+++F MY+YTMG R YA  + K +DPKK YF  +VI+RD+ 
Sbjct: 124 R-EMLIKLRPFVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGDRVITRDE- 183

Query: 335 TQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSE 394
                K LD+VL  E  V+I+DDT + W  H+ NL+ + +Y +F           KS +E
Sbjct: 184 -SGFSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYFRDYSHD--KESKSYAE 243

Query: 395 LKNDESETDGALTTILKVLKQVHHIFFNSRV 405
            K DES   G+L  +LKVLK VH  FF   +
Sbjct: 244 EKRDESRNQGSLANVLKVLKDVHQEFFRGGI 263

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q00IB63.2e-9861.90RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
F4JCB24.4e-3937.20RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... [more]
Q8LL041.7e-3541.90RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... [more]
Q8SV032.7e-2032.06RNA polymerase II subunit A C-terminal domain phosphatase OS=Encephalitozoon cun... [more]
Q95QG81.2e-1726.88RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis eleg... [more]
Match NameE-valueIdentityDescription
A0A0A0KW617.0e-19788.06RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis sativus OX=3659 ... [more]
A0A5A7T8Q91.2e-16794.29RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis melo var. makuwa... [more]
A0A1S3CAQ05.8e-16794.56RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis melo OX=3656 GN=... [more]
A0A5D3BMG05.8e-16794.56RNA polymerase II C-terminal domain phosphatase-like OS=Cucumis melo var. makuwa... [more]
A0A6J1BUF91.4e-14481.63RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
Match NameE-valueIdentityDescription
KAA0039268.18.48e-21194.01RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis melo ... [more]
XP_008459611.13.04e-21094.28PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cu... [more]
XP_011656096.13.00e-20392.49RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucumis sativ... [more]
XP_022133135.11.08e-18283.04RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Momordica cha... [more]
XP_038890381.11.91e-18184.08RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa his... [more]
Match NameE-valueIdentityDescription
AT5G58003.12.3e-9961.90C-terminal domain phosphatase-like 4 [more]
AT5G54210.18.0e-4440.47Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT3G17550.11.8e-4340.38Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT2G04930.13.7e-4138.29Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT1G20320.11.1e-4039.11Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 138..158
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..140
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 90..401
NoneNo IPR availableCDDcd07521HAD_FCP1-likecoord: 206..351
e-value: 3.59691E-34
score: 121.934
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 205..361
e-value: 3.9E-54
score: 195.8
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 208..356
e-value: 5.9E-27
score: 94.3
IPR004274FCP1 homology domainPROSITEPS50969FCP1coord: 202..374
score: 30.657206
IPR011947FCP1-like phosphatase, phosphatase domainTIGRFAMTIGR02250TIGR02250coord: 201..357
e-value: 8.4E-54
score: 179.9
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 197..407
e-value: 1.3E-49
score: 170.8
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 90..401
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 199..361

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0022293.1IVF0022293.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity
molecular_function GO:0004721 phosphoprotein phosphatase activity