Sgr029339 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029339
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionProtein-serine/threonine phosphatase
Locationtig00153293: 1354562 .. 1361097 (-)
RNA-Seq ExpressionSgr029339
SyntenySgr029339
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATAAATCAGTGGTTTATCAAGGGGATGAGTTACTGGGAGAGGTAGAGATTTACCCAGAAGAAAAGAACGGCAACAAGAACATCGAAGTGAAGGAAATCAGAATAACTCACTTCTCGCAACCGAGTGAGAGGTGCCCACCACTTGCTGTGCTTCATACCATTGCAGCCTCAGGAATTTGCTTCAAAATGGAGTCAAAGACCTCGCAGTCACAAGACACGCCGCTCTATCTTTTGCACTCCTCGTGTATCATGGAGAATAAGGTATAAAAACACAACTCAAAAATAAGAAAGGAGACCACAAGACAAAGTAAGGAAAAAGATTAAAACTAACTAGGAAAAGAAAATCTGTATTCAAAAGCAAACTTTGTTCATCTTACAAGGACTAGAAACATTCCAATCATGTTGAGAAAATTTTGTCACTTTAATTGATAATAATGCTGCAATTTTGCCTCACGTTTTGTGAAATTTGTATGATATCTGCAACTTATCCCCGATTGTCTTTCTACATATCTTGGTTAATCCTATGGAAGAAAATGGCCGTGCTTATGACGGCAAAAGCTAATTCCTAGACTGTAGACTCATAAGAAGTTCATTGCTTCCTTCTTTCTTCCTATTTTCTTGCTTTATATGTCTGTTTGGTGATACTATTTTGAACATTCTGCTGCTGATAGTTTCAATAAAGCAATAATTTCAATAACCTGAAAATTGGTGTAGATGTTAACATGTTGCATGCGAGTACTTGGTTGGCCCCTGGAGTTGATTGTTCAAACTTCTAATCTACTTATGTTTTTGTTGTTTTATCTAGTTGGAAAATTTTCTGCTTTCTTATTGCATATCATTTTAGTTTTCCACATGTGGTTACGTATTCTTCCTCAGCCTGTAGTCCTTGATATGGATGTGTGCAAAAACTGATGACTGGGATTTGCTTCAAACAGACTGCTGTAATGATGTTAGGAGCGGAGGAGCTCCATTTGGTAGCCATGTATTCTAGAGATCATGACAAGCAGTATCCATGTTTCTGGGGTTTCAATGTTACAATGGGACTTTACAATTCTTGTCTTGTCATGTTGAATCTTAGATGTCTTGGCATTGTATTTGATCTTGATGAGACACTTGTGGTTGCAAATACAATGCGCTCATTTGAGGATAGAATTGAAGCCCTACAGCGGAAAATAAGCAGTGAGGTGGATCCACAGCGTGCTACTGGCATGTTGGCAGAGGTTAAGCGTTATCAAGACGACAAGTTCATTCTGAAGCAATATGCTGAAAATGACCAGATTATTGAGAATGGAAAAGTGATTAAAAGTCAATCTGAGGTTGTTCCTGCACTGTCTGACAATCATCAACCTGTTGTTAGACCACTTATACGATTGCATGAAAAGATTCTGACTCGTATCAACCCCCAGGTAACTAGGTTTAACCCAACCTTTGCTTGCAATTATCTATGGGGATTGTGTAGGGACCACCATACAAGACTGTTTGCTTGTTTTTCACTTTGACTTTTGATATATACATACATATTGCTCTTTTTTCTTGTATCCTTTTGTGGAATTTTTTTCTCTTTATTTTATCAGTCTCCTAATTTGTCCACGGATCTTTGACTGTAATGCTTGAAGGTATTTTCATACTAATGTCATTTGCTCGGGACAAGATTAGGTTGGTTTAAGTTGTGCATGAAATGATATTTCAGCTTTTATATTTGTTTAAGAATGCTATCACTTAGTCTCATCTGTCGCACAATGAAAGTTACAAACTATTTTATGAAATCATGGAATGATATTCTATTAACTCATCTACTACTGTAATTCCTTAGATTCGTGATACAAGTGTTCTCGTGAAGTTGAGACCTGCATGGGAAGATCTCCGGAGTTACTTGACTGCAAGAGGTCGCAAGCGCTTTGAGGTCTATGTGTGTACATGGCTGAAAGGGATTATGCTTTGGAGATGTGGAGGCTTCTTGATCCAGATGCAAATTTGATAAATCCTAAGGAATTGTTGGATCGCATTGTTTGTGTCAAGTCCGGTTAGTATTCCAAAGAAGAAAGAGATCCCTTCCCAAACCAGTAGCACTTTTCATACACATCTTGACAAGGGACGTGTATTGCATGGCAGTTTGTTGATGAGAGAAATAATTCCAAAACTGCTCATGATTGGTCATACCTCTTTTAGTTTTTCAGCCATTTTGAAGATTTCAAGTTCATTTTTTGTTTTTTTGGTTTTGGTCCTGTCCTTAGGTTCTAGGAAGTCTTTGTTCAACGTCTTCCAAGATGGGTTTTGCCACCCCAAGATGGCTCTGGTAATTGATGATCGTTTGAAAGTGTGGGATGAAAAGGATCAACCTCGAGTGCATGTTGTTCCTGCATTTGCTCCTTACTATGCCCCTCTTGCTGAAGTATATTCTTATTTGCACTTTGGCATTTCAGTTTGAAAGATTTCTTAAAAAATCAGGCAATACCTGTTTTATCCCGTTTATCTCATGTTGCTGTTTTTTTCTTTTTTGTAGGTAAGTAATACTGTCCCTGTTTTATGTGTAGCAAGAAACGTTGCCTGCAATGTCAGAGGTGGTTTTTTCAAGTAAGATTCAGAAGACTGCTAAAGTTCCTTACAAGAGGATATGCATTCGCCCATGCTTATTCTTAATACTTGCAAATTAAGGTTTTAGCTCTGGAAGATGGTCCGAGTTGATGCATGTTCTCAGTATTCTTTTGTTAATCACGACCTTCCACTGATTTCAGGGACTTTGATGAGGTCTTCTGCAAAAAATTTCTGATATTTCATATGAGGATGATGCCAATGATATTCCCTCTCCTGATGTGAGCAACTATCTCGCTTCAGAGGTTAGTCGGGATTTCTTCTCAAATTCTTTTCCCCTAAAATAATCAAATTTAGTTTGAAAATGAGGTCATGTCATTACTTTTTCTTCCTTTTCTTCGTTTTGGGGGGGTGTTCTTCTCTTGCTTCTTGACTTGTTTCCCTTCTTTCTGTTTCAGGACGATTATTCTGTTTCTAATGGAAACAAAGACATGCTTACTTTTGATGGCATGTCAGACATGGAAGTTGAAAGAAGAATGAAGGTACTGACAAAGATTGGTCATTGGTTGAGCTCATAAGTAAACTTTCATTAGCGTATACCAGAGACTAATTAGGTCAAGGAAAGTGAATGGAAATATCTTGCATAACATTTTTTAGGAGGAATGAGTTGGTAATTAAATTTTCAAGTTGCCCTGATACATCTGCATCCAACAATAACTAGATCCAATTTAAGATCTCTCTCTCTCTGTCTCTCCATATATATATATATATATATAGATCGATAGATAGATATAGATACACAAACTGACACACATACATATATGAAAAGGATTGCTCCTCCATCTTCCCCCAGGATGCATTTCTGGCTTCTTCTACTATCACGAGTGCAGATCACGAGTGTCTTCTCTTCAATATACAATGGCTTCTGCTCTGGCTCAGTTCCACTTCCACCAAAGCAGGTGTCACTGCCATATTTTCCAAATATGCAGCTTCCCCATGTCAACTCAGTGGTTCATGTAGCCCCCACTGAACCAAGTTTGCAAAGTTCTCCTGCTAGAGAGGAGGGTGAGGTACCAGAGTCAGAATTAGATCCGGATACAAGGCGGAGGCTCCTTATATTACAACATGGGCAAGATACAAGAGAGCGTCCATCAAGTGAACCTGCATTTTCAGTGAGGCCTCCTCCATTACAGCAGGTTACTGGTCCTCGTGCACAATCGCGTGGAAGTTGGTCTCCAATGGAAGAAGAAATGAGCCCACGGCAACTAAGTCGGACTGCACGCAAAGAGTTCCCTGTTGATGCAGAACCAATGCGGGAGAAGTATAGGTCTCATCATCCTTCATTTTACCCCAAGATTGAGAGTTCCATTCCATCTGATAGAACTCCTCATGAAAACCAGAGATTGTCAAAAGAGGTAATTAGATTATGATTTGGCTTATTTTCTTTCCATGAAGTTCTCCCTCTTTTGCTCCCCCCTTTTTCAGGGTGGCAGGATTCAGGAAAGCTACTGTCTGATTTTTTGGGGCTTCTTTTCTTTTCTTCTTTTTTTAATATGTGCAGGCTTTTTATAGAGATGATCGTGCGAGAGTAAGTCGTAGGCCATCTAGTTATCCTGCTTTCTCAGGTGAGCCATGTTTGACATGCTTGTGTGATTTTTCGAATCTTATGGTTTCAAAATTTTTAGTTTTTTTTTCTCTTCCTCTTTTTTTTTTCCTCATGCTAAATGTTCAGGGATCATCATTAGTTTGCCTTTCTGTAACTACTTCTTGGGGCCCTCTCTTTTATTTTCTCTTTTATAATCTTTCTTTTCGTTTTTAGCTAGCATCTTGTTAATAATAATCATGTAGTTATTTATTATATTTCATTGGATTATGCTTCATTCTGTAATGCTTTTTTAAAGGTGATGAGGTTCCGTGAATCAATCATCTTCAAGAAACGGGATATTGATGTTGAATCTGGACGCTCCATCTGGAGTGAAACTCCTGTTGGAGCACTACAGGAAATTGCAATGAAGTTTGGCACCAAGGTAATCAGATATAATTTTTACCATGACAGATTATTGAGGTTTTTAAGTTATGTATGCATTGCACCCTTCTCTCAAGTCCTTAAGAGTTGGAAAAACTTTTAGGTGGAATTTAAGCCGGCGTTAGTTTCCAGCGCAGAACTAGAGTTTTCCGTGGAGGTAACTCATGACTTCCACCTTCATGTTTTCTTTGGTTTTTAAATCTTTGATCAGAATGTAGAATTTTTTTTTCTTGGAATTTGTTTTGGACAGTTCAATGCCATAAGTTATCTACATTACTTGAATAAGAGTTTTCTGCTTTTCTCTACAAATGTTATGCGTATTTTAAACGTTGCCTTCTAGAGGAGCAACTGGATTTTAACAGAATAATGTTTGCAAAACTTGTTACTAGATGAGTGAACTAAAGGATTGTGCGGTTAGTGGAGTTCGTGAACCTATTATTCTGTTTGTGCTTAGATAGGGAAAACGAGCAATTAGTTTCCACAAATTAGTTTATTGCTGCATAAAATAAGTCTAAGGCGTTTCAAAGTGTGGAAATATATAGAGAATTAGAAGAGAATAGAATAGGAAAGAGAAATAGATTGGGGAGAAGATAGGCCAAAGCTGGAAAAGTAAATAGAGGGAGTGAATGTTGTCTGGTTTGGCAACAAGTGGGAAGATACGGAAATTTTAGGATTTTTATTAACTCAAGTCTCATTATCGATGGCCTTTTCTAGTACAATAGCGCTTATTTATACCAGAGTAAAATCAGGAAACCAATTATTAAATAAAATTATATCTGACAAAATACAGGATGCGATATCAAATTCCTATACTACTCTTGATGAATTTTTAAGTTCTTGTATTTTTGTTCTCAGGTTTTTTATATCACATTCTTTTAATTTTAAGATTTGCTTGTCTCCACTTGTTTTTGATTATAGAGTATGTGAACCTGTTCCTTGCTTTTGTAGGCATGGTTTGTAGGGGAGAAAATTGGAGAAGGAATTGGTAAGACAAGAAGGGAAGCTCAGCGACTTGCTGCTGAAGGTTCTATAAAGAATTTGGCTAGTATGGAGTCTTCTCTCATTTTTACCATTATATGTTTAGAACATTTGTGTTTTCCTATATTGTTTATGTTATTATACCTTCTTTTTTGTTGTTTGCTGAAATCTAGCATGAAGATATTGTTTCTTCAAGATAATCTGCTGTCGTTTGGAAAATTAAAATTATTATTATTATCTAGAATATCCTATTTATAATGATTTAAAAACATGATAACTCGCTGCAATATGTCCTGAGTTATGAACTATTTGGCTCAATGTAGACAACTTAGCTTTATCTTTGATCAATTGAGTAAATTCATTGCTATATTAATTTCTAGAAAAGGTTTTTCTTAGAATTGTCTTCTGTATTAGGGTTAGGCATGTGATGTATAGTTTAAAACTGCAAGCGTGACCTTAGATCTAAATTGCCAAGTATGAAATGGCTGGTTATATGATGCAATGATTAATTGCTTGAACTTGCAAATTTTTCTCTGTTTGGAGATAAATTTAAGGAGCATGAATATGAAGATGACCTCCTAAAGTGAACCTGGCTGACATATAATATGTTTTGCAGACATTTACGTATCGCGTTGTAAGGCTGACTCTACATCTGCAAATGATATAAATAAGTTTCCTAGCGATAATGGATCGGGAAAACGACTGAAGCTAGACTTTCCACGGACACCTTCCTCTGCTAAATAAATATTCTACAAATGCTTCTGTTCCATGAATGACCTCCATATTTGTTGACCCTCTTGTAAAGCATGATTCAGCATTTGAGGTTAAGGCCACCAAGACTCAGCATTTTGGACTAGTCCTTGGCCTTCTGCTATTTTCCTTGATCGACAAGAGGCAAATTGCAAGATTATCTTATGCCTTCATTCAATTTCTTCCGATATCAGCAGCGCATTCTCCACGTGACATGACCGTGTATCCCTGTGGAAGTTCATACTGA

mRNA sequence

ATGTATAAATCAGTGGTTTATCAAGGGGATGAGTTACTGGGAGAGGTAGAGATTTACCCAGAAGAAAAGAACGGCAACAAGAACATCGAAGTGAAGGAAATCAGAATAACTCACTTCTCGCAACCGAGTGAGAGGTGCCCACCACTTGCTGTGCTTCATACCATTGCAGCCTCAGGAATTTGCTTCAAAATGGAGTCAAAGACCTCGCAGTCACAAGACACGCCGCTCTATCTTTTGCACTCCTCGTGTATCATGGAGAATAAGACTGCTGTAATGATGTTAGGAGCGGAGGAGCTCCATTTGGTAGCCATGTATTCTAGAGATCATGACAAGCAGTATCCATGTTTCTGGGGTTTCAATGTTACAATGGGACTTTACAATTCTTGTCTTGTCATGTTGAATCTTAGATGTCTTGGCATTGTATTTGATCTTGATGAGACACTTGTGGTTGCAAATACAATGCGCTCATTTGAGGATAGAATTGAAGCCCTACAGCGGAAAATAAGCAGTGAGGTGGATCCACAGCGTGCTACTGGCATGTTGGCAGAGGTTAAGCGTTATCAAGACGACAAGTTCATTCTGAAGCAATATGCTGAAAATGACCAGATTATTGAGAATGGAAAAGTGATTAAAAGTCAATCTGAGGTTGTTCCTGCACTGTCTGACAATCATCAACCTGTTGTTAGACCACTTATACGATTGCATGAAAAGATTCTGACTCGTATCAACCCCCAGATTCGTGATACAAGTGTTCTCGTGAAGTTGAGACCTGCATGGGAAGATCTCCGGAGTTACTTGACTGCAAGAGGTCGCAAGCGCTTTGAGATGTGGAGGCTTCTTGATCCAGATGCAAATTTGATAAATCCTAAGGAATTGTTGGATCGCATTGTTTGTGTCAAGTCCGGTTCTAGGAAGTCTTTGTTCAACGTCTTCCAAGATGGGTTTTGCCACCCCAAGATGGCTCTGGTAATTGATGATCGTTTGAAAGTGTGGGATGAAAAGGATCAACCTCGAGTGCATGTTGTTCCTGCATTTGCTCCTTACTATGCCCCTCTTGCTGAAGTAAGTAATACTGTCCCTGTTTTATGTGTAGCAAGAAACGTTGCCTGCAATGTCAGAGGTCTTCTGCAAAAAATTTCTGATATTTCATATGAGGATGATGCCAATGATATTCCCTCTCCTGATGTGAGCAACTATCTCGCTTCAGAGGACGATTATTCTGTTTCTAATGGAAACAAAGACATGCTTACTTTTGATGGCATGTCAGACATGGAAGTTGAAAGAAGAATGAAGATCACGAGTGTCTTCTCTTCAATATACAATGGCTTCTGCTCTGGCTCAGTTCCACTTCCACCAAAGCAGGTGTCACTGCCATATTTTCCAAATATGCAGCTTCCCCATGTCAACTCAGTGGTTCATGTAGCCCCCACTGAACCAAGTTTGCAAAGTTCTCCTGCTAGAGAGGAGGGTGAGGTACCAGAGTCAGAATTAGATCCGGATACAAGGCGGAGGCTCCTTATATTACAACATGGGCAAGATACAAGAGAGCGTCCATCAAGTGAACCTGCATTTTCAGTGAGGCCTCCTCCATTACAGCAGGTTACTGGTCCTCGTGCACAATCGCGTGGAAGTTGGTCTCCAATGGAAGAAGAAATGAGCCCACGGCAACTAAGTCGGACTGCACGCAAAGAGTTCCCTGTTGATGCAGAACCAATGCGGGAGAAGTATAGGTCTCATCATCCTTCATTTTACCCCAAGATTGAGAGTTCCATTCCATCTGATAGAACTCCTCATGAAAACCAGAGATTGTCAAAAGAGGTGATGAGGTTCCGTGAATCAATCATCTTCAAGAAACGGGATATTGATGTTGAATCTGGACGCTCCATCTGGAGTGAAACTCCTGTTGGAGCACTACAGGAAATTGCAATGAAGTTTGGCACCAAGGTGGAATTTAAGCCGGCGTTAGTTTCCAGCGCAGAACTAGAGTTTTCCGTGGAGGCATGGTTTGTAGGGGAGAAAATTGGAGAAGGAATTGGTAAGACAAGAAGGGAAGCTCAGCGACTTGCTGCTGAAGCATTTGAGGTTAAGGCCACCAAGACTCAGCATTTTGGACTAGTCCTTGGCCTTCTGCTATTTTCCTTGATCGACAAGAGGCAAATTGCAAGATTATCTTATGCCTTCATTCAATTTCTTCCGATATCAGCAGCGCATTCTCCACGTGACATGACCGTGTATCCCTGTGGAAGTTCATACTGA

Coding sequence (CDS)

ATGTATAAATCAGTGGTTTATCAAGGGGATGAGTTACTGGGAGAGGTAGAGATTTACCCAGAAGAAAAGAACGGCAACAAGAACATCGAAGTGAAGGAAATCAGAATAACTCACTTCTCGCAACCGAGTGAGAGGTGCCCACCACTTGCTGTGCTTCATACCATTGCAGCCTCAGGAATTTGCTTCAAAATGGAGTCAAAGACCTCGCAGTCACAAGACACGCCGCTCTATCTTTTGCACTCCTCGTGTATCATGGAGAATAAGACTGCTGTAATGATGTTAGGAGCGGAGGAGCTCCATTTGGTAGCCATGTATTCTAGAGATCATGACAAGCAGTATCCATGTTTCTGGGGTTTCAATGTTACAATGGGACTTTACAATTCTTGTCTTGTCATGTTGAATCTTAGATGTCTTGGCATTGTATTTGATCTTGATGAGACACTTGTGGTTGCAAATACAATGCGCTCATTTGAGGATAGAATTGAAGCCCTACAGCGGAAAATAAGCAGTGAGGTGGATCCACAGCGTGCTACTGGCATGTTGGCAGAGGTTAAGCGTTATCAAGACGACAAGTTCATTCTGAAGCAATATGCTGAAAATGACCAGATTATTGAGAATGGAAAAGTGATTAAAAGTCAATCTGAGGTTGTTCCTGCACTGTCTGACAATCATCAACCTGTTGTTAGACCACTTATACGATTGCATGAAAAGATTCTGACTCGTATCAACCCCCAGATTCGTGATACAAGTGTTCTCGTGAAGTTGAGACCTGCATGGGAAGATCTCCGGAGTTACTTGACTGCAAGAGGTCGCAAGCGCTTTGAGATGTGGAGGCTTCTTGATCCAGATGCAAATTTGATAAATCCTAAGGAATTGTTGGATCGCATTGTTTGTGTCAAGTCCGGTTCTAGGAAGTCTTTGTTCAACGTCTTCCAAGATGGGTTTTGCCACCCCAAGATGGCTCTGGTAATTGATGATCGTTTGAAAGTGTGGGATGAAAAGGATCAACCTCGAGTGCATGTTGTTCCTGCATTTGCTCCTTACTATGCCCCTCTTGCTGAAGTAAGTAATACTGTCCCTGTTTTATGTGTAGCAAGAAACGTTGCCTGCAATGTCAGAGGTCTTCTGCAAAAAATTTCTGATATTTCATATGAGGATGATGCCAATGATATTCCCTCTCCTGATGTGAGCAACTATCTCGCTTCAGAGGACGATTATTCTGTTTCTAATGGAAACAAAGACATGCTTACTTTTGATGGCATGTCAGACATGGAAGTTGAAAGAAGAATGAAGATCACGAGTGTCTTCTCTTCAATATACAATGGCTTCTGCTCTGGCTCAGTTCCACTTCCACCAAAGCAGGTGTCACTGCCATATTTTCCAAATATGCAGCTTCCCCATGTCAACTCAGTGGTTCATGTAGCCCCCACTGAACCAAGTTTGCAAAGTTCTCCTGCTAGAGAGGAGGGTGAGGTACCAGAGTCAGAATTAGATCCGGATACAAGGCGGAGGCTCCTTATATTACAACATGGGCAAGATACAAGAGAGCGTCCATCAAGTGAACCTGCATTTTCAGTGAGGCCTCCTCCATTACAGCAGGTTACTGGTCCTCGTGCACAATCGCGTGGAAGTTGGTCTCCAATGGAAGAAGAAATGAGCCCACGGCAACTAAGTCGGACTGCACGCAAAGAGTTCCCTGTTGATGCAGAACCAATGCGGGAGAAGTATAGGTCTCATCATCCTTCATTTTACCCCAAGATTGAGAGTTCCATTCCATCTGATAGAACTCCTCATGAAAACCAGAGATTGTCAAAAGAGGTGATGAGGTTCCGTGAATCAATCATCTTCAAGAAACGGGATATTGATGTTGAATCTGGACGCTCCATCTGGAGTGAAACTCCTGTTGGAGCACTACAGGAAATTGCAATGAAGTTTGGCACCAAGGTGGAATTTAAGCCGGCGTTAGTTTCCAGCGCAGAACTAGAGTTTTCCGTGGAGGCATGGTTTGTAGGGGAGAAAATTGGAGAAGGAATTGGTAAGACAAGAAGGGAAGCTCAGCGACTTGCTGCTGAAGCATTTGAGGTTAAGGCCACCAAGACTCAGCATTTTGGACTAGTCCTTGGCCTTCTGCTATTTTCCTTGATCGACAAGAGGCAAATTGCAAGATTATCTTATGCCTTCATTCAATTTCTTCCGATATCAGCAGCGCATTCTCCACGTGACATGACCGTGTATCCCTGTGGAAGTTCATACTGA

Protein sequence

MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGICFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFNVTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGMLAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEKILTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEMWRLLDPDANLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPLAEVSNTVPVLCVARNVACNVRGLLQKISDISYEDDANDIPSPDVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYNGFCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTARKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVMRFRESIIFKKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSAELEFSVEAWFVGEKIGEGIGKTRREAQRLAAEAFEVKATKTQHFGLVLGLLLFSLIDKRQIARLSYAFIQFLPISAAHSPRDMTVYPCGSSY
Homology
BLAST of Sgr029339 vs. NCBI nr
Match: XP_022142219.1 (LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 1 [Momordica charantia])

HSP 1 Score: 1194.1 bits (3088), Expect = 0.0e+00
Identity = 622/752 (82.71%), Postives = 651/752 (86.57%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVY+GDE+LGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI
Sbjct: 5   MYKSVVYRGDEVLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 64

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQSQDTPLYLLHSSC+MENKTAVM LG EELHLVAMYSRDH+KQYPCFWGFN
Sbjct: 65  CFKMESKTSQSQDTPLYLLHSSCVMENKTAVMALGVEELHLVAMYSRDHEKQYPCFWGFN 124

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           VT GLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVD QRA GM
Sbjct: 125 VTRGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDSQRAAGM 184

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
           LAEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVP LSDNHQP+VRP++RLHEK  I
Sbjct: 185 LAEVKRYQDDKIILKQYAENDQIIENGKVIKSQSEVVPPLSDNHQPIVRPILRLHEKNII 244

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 245 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 304

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 305 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 364

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYED ANDIPS P
Sbjct: 365 FAPYYAPYAEGNNAVPVLCVARNVACNVRGGFFKEFDEVLLQKISDISYEDGANDIPSPP 424

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYN------------- 480
           DVSNYL SED+YS+SNGN+DMLTFD MSDMEVERR+K   + SS                
Sbjct: 425 DVSNYLVSEDEYSISNGNRDMLTFDSMSDMEVERRLKDAFLASSTITTTDPRVSSLQYTM 484

Query: 481 GFCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              S SVPLPPKQ S+PYFPNMQ PHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT
Sbjct: 485 ASASSSVPLPPKQGSMPYFPNMQPPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 544

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQDTRER  SEP+F VRPPPLQQV GPRAQSRGSWSPMEEEMSPRQLSRTA
Sbjct: 545 RRRLLILQHGQDTRERLPSEPSFPVRPPPLQQVAGPRAQSRGSWSPMEEEMSPRQLSRTA 604

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKE-------VMRFRESI 660
           RKEFPVDAEPMREK+RS+HP+++PKIE+SIP DR PHENQRLSKE       V   R S 
Sbjct: 605 RKEFPVDAEPMREKHRSNHPTYFPKIETSIPPDRIPHENQRLSKEAFYRDDRVRASRRSS 664

Query: 661 IF---------------KKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
            +               + RDI++ESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSS 
Sbjct: 665 SYPAFSGEEIPMNQSSSRSRDIEIESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSST 724

BLAST of Sgr029339 vs. NCBI nr
Match: KAG7026584.1 (RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1166.4 bits (3016), Expect = 0.0e+00
Identity = 614/752 (81.65%), Postives = 644/752 (85.64%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQ QDTPLY LHSSCIMENKTA+MM GAEELHLVAMYSRDHDKQYPCFWGF 
Sbjct: 61  CFKMESKTSQLQDTPLYQLHSSCIMENKTAIMMFGAEELHLVAMYSRDHDKQYPCFWGFI 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V +GLYNSCL+MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSE+D QRA+GM
Sbjct: 121 VAIGLYNSCLIMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSELDQQRASGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
            AEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVPALS NHQP+VRP++RL+EK  I
Sbjct: 181 QAEVKRYQDDKLILKQYAENDQIIENGKVIKSQSEVVPALSGNHQPIVRPILRLYEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYEDDANDIPS P
Sbjct: 361 FAPYYAPNAEGNNAVPVLCVARNVACNVRGGFFKEFDEILLQKISDISYEDDANDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYNG------------ 480
           DVSNYL SED+YSVSNGNKDMLTFDGM DM+VERR+K   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSVSNGNKDMLTFDGMPDMDVERRLKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 -FCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              SG+VP PPKQVSLPYFP+MQLPHVNS   VAPTEPS+Q SPAREEGEVPESELDPDT
Sbjct: 481 PSASGAVPPPPKQVSLPYFPDMQLPHVNS---VAPTEPSIQCSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQD RERP SEPAFSVRPPPLQQV GPR QSRGSWSPMEEEMSPRQL+R A
Sbjct: 541 RRRLLILQHGQDIRERPPSEPAFSVRPPPLQQVAGPRGQSRGSWSPMEEEMSPRQLNRPA 600

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFR------ 660
           RKEFPVDAEPMREK+RS+H SF+PKI+ SIP DR PHENQRLSKE      R R      
Sbjct: 601 RKEFPVDAEPMREKHRSNHSSFFPKIDGSIPPDRIPHENQRLSKEAFYRDDRARLSRRPS 660

Query: 661 -------ESIIF-----KKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
                  E ++      + RD D+ESG SIWSETPVGALQEIAMKFGTKVEFK ALVSSA
Sbjct: 661 NYPAFSGEEVLMNQSSSRSRDNDIESGCSIWSETPVGALQEIAMKFGTKVEFKSALVSSA 720

BLAST of Sgr029339 vs. NCBI nr
Match: KAG6594614.1 (RNA polymerase II C-terminal domain phosphatase-like 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 613/752 (81.52%), Postives = 643/752 (85.51%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRI+HFS PSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRISHFSPPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQ QDTPLY LHSSCIMENKTA+MM GAEELHLVAMYSRDHDKQYPCFWGF 
Sbjct: 61  CFKMESKTSQLQDTPLYQLHSSCIMENKTAIMMFGAEELHLVAMYSRDHDKQYPCFWGFI 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V +GLYNSCL+MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSE+D QRA+GM
Sbjct: 121 VAIGLYNSCLIMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSELDQQRASGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
            AEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVPALS NHQP+VRP++RL+EK  I
Sbjct: 181 QAEVKRYQDDKLILKQYAENDQIIENGKVIKSQSEVVPALSGNHQPIVRPILRLYEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYEDDANDIPS P
Sbjct: 361 FAPYYAPNAEGNNAVPVLCVARNVACNVRGGFFKEFDEILLQKISDISYEDDANDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYNG------------ 480
           DVSNYL SED+YSVSNGNKDMLTFDGM DM+VERR+K   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSVSNGNKDMLTFDGMPDMDVERRLKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 -FCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              SG+VP PPKQVSLPYFP+MQLPHVNS   VAPTEPS+Q SPAREEGEVPESELDPDT
Sbjct: 481 PSASGAVPPPPKQVSLPYFPDMQLPHVNS---VAPTEPSIQCSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQD RERP SEPAFSVRPPPLQQV GPR QSRGSWSPMEEEMSPRQL+R A
Sbjct: 541 RRRLLILQHGQDIRERPPSEPAFSVRPPPLQQVAGPRGQSRGSWSPMEEEMSPRQLNRPA 600

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFR------ 660
           RKEFPVDAEPMREK+RS+H SF+PKI+ SIP DR PHENQRLSKE      R R      
Sbjct: 601 RKEFPVDAEPMREKHRSNHSSFFPKIDGSIPPDRIPHENQRLSKEAFYRDDRARLSRRPS 660

Query: 661 -------ESIIF-----KKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
                  E ++      + RD D+ESG SIWSETPVGALQEIAMKFGTKVEFK ALVSSA
Sbjct: 661 NYPAFSGEEVLMNQSSSRSRDNDIESGCSIWSETPVGALQEIAMKFGTKVEFKSALVSSA 720

BLAST of Sgr029339 vs. NCBI nr
Match: XP_022926826.1 (RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita moschata] >XP_022926827.1 RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita moschata])

HSP 1 Score: 1163.7 bits (3009), Expect = 0.0e+00
Identity = 613/752 (81.52%), Postives = 643/752 (85.51%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVYQGDELLGEVEIYPEEKNGNKNI VKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIGVKEIRISHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQ QDTPLY LHSSCIMENKTA+MM GAEELHLVAMYSRDHDKQYPCFWGF 
Sbjct: 61  CFKMESKTSQLQDTPLYQLHSSCIMENKTAIMMFGAEELHLVAMYSRDHDKQYPCFWGFI 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V +GLYNSCL+MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSE+D QRA+GM
Sbjct: 121 VAIGLYNSCLIMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSELDQQRASGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
            AEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVPALS NHQP+VRP++RL+EK  I
Sbjct: 181 QAEVKRYQDDKLILKQYAENDQIIENGKVIKSQSEVVPALSGNHQPIVRPILRLYEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYEDDANDIPS P
Sbjct: 361 FAPYYAPNAEGNNAVPVLCVARNVACNVRGGFFKEFDEILLQKISDISYEDDANDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYNG------------ 480
           DVSNYL SED+YSVSNGNKDMLTFDGM DM+VERR+K   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSVSNGNKDMLTFDGMPDMDVERRLKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 -FCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              SG+VP PPKQVSLPYFP+MQLPHVNS   VAPTEPS+Q SPAREEGEVPESELDPDT
Sbjct: 481 PSASGAVPPPPKQVSLPYFPDMQLPHVNS---VAPTEPSIQCSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQD RERP SEPAFSVRPPPLQQV GPR QSRGSWSPMEEEMSPRQL+R A
Sbjct: 541 RRRLLILQHGQDIRERPPSEPAFSVRPPPLQQVAGPRGQSRGSWSPMEEEMSPRQLNRPA 600

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFR------ 660
           RKEFPVDAEPMREK+RS+H SF+PKI+ SIP DR PHENQRLSKE      R R      
Sbjct: 601 RKEFPVDAEPMREKHRSNHSSFFPKIDGSIPPDRIPHENQRLSKEAFYRDDRARLSRRPS 660

Query: 661 -------ESIIF-----KKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
                  E ++      + RD D+ESG SIWSETPVGALQEIAMKFGTKVEFK ALVSSA
Sbjct: 661 NYPAFSGEEVLMNQSSSRSRDNDIESGCSIWSETPVGALQEIAMKFGTKVEFKSALVSSA 720

BLAST of Sgr029339 vs. NCBI nr
Match: XP_023003826.1 (RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita maxima] >XP_023003827.1 RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita maxima])

HSP 1 Score: 1162.1 bits (3005), Expect = 0.0e+00
Identity = 613/752 (81.52%), Postives = 641/752 (85.24%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVYQGDELLGEVEIYPEEKNG+KNIEVKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYQGDELLGEVEIYPEEKNGDKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQSQDTPLY LHSSCIMENKTA+MM GAEELHLVAMYSRDHDKQYPCFWGF 
Sbjct: 61  CFKMESKTSQSQDTPLYQLHSSCIMENKTAIMMFGAEELHLVAMYSRDHDKQYPCFWGFI 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V +GLYNSCL+MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVD QRA+GM
Sbjct: 121 VAIGLYNSCLIMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDQQRASGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
            AEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVPALS NHQP+VRP++RL+EK  I
Sbjct: 181 QAEVKRYQDDKLILKQYAENDQIIENGKVIKSQSEVVPALSGNHQPIVRPILRLYEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYEDDANDIPS P
Sbjct: 361 FAPYYAPNAEGNNAVPVLCVARNVACNVRGGFFKEFDEILLQKISDISYEDDANDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYN------------- 480
           DVSNYL SED+YSVSNGNKDMLTFDGM DM+VERR+K   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSVSNGNKDMLTFDGMPDMDVERRLKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 GFCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              SG+VP PPKQVSLPYFP+MQLPHVNS   V PTEPS+Q SPAREEGEVPESELDPDT
Sbjct: 481 ASASGAVPPPPKQVSLPYFPDMQLPHVNS---VTPTEPSIQCSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQD RERP SEPAFSVRPPPLQQV GPR QSRGSWSPMEEEMSPRQL+R A
Sbjct: 541 RRRLLILQHGQDIRERPPSEPAFSVRPPPLQQVAGPRVQSRGSWSPMEEEMSPRQLNRPA 600

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFRES---- 660
           RKEFPVDAEPMREK+RS+H SF+PKI+ SIP DR PHENQRLSKE      R R S    
Sbjct: 601 RKEFPVDAEPMREKHRSNHSSFFPKIDGSIPPDRIPHENQRLSKEAFYRDDRARLSRRPS 660

Query: 661 --------------IIFKKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
                            + RD D+ESG SIWS TPVGALQEIAMKFGTKVEFK ALVSSA
Sbjct: 661 NYPAFSGEEVPMNQSSSRSRDNDIESGCSIWSGTPVGALQEIAMKFGTKVEFKSALVSSA 720

BLAST of Sgr029339 vs. ExPASy Swiss-Prot
Match: Q5YDB6 (RNA polymerase II C-terminal domain phosphatase-like 1 OS=Arabidopsis thaliana OX=3702 GN=CPL1 PE=1 SV=1)

HSP 1 Score: 795.0 bits (2052), Expect = 7.2e-229
Identity = 450/781 (57.62%), Postives = 537/781 (68.76%), Query Frame = 0

Query: 6   VYQGDELLGEVEIYPE-----------EKNGNKNIEVKE-----IRITHFSQPSERCPPL 65
           V+ GD  LGE+EIYP            ++   K  EV E     IRI+HFSQ  ERCPPL
Sbjct: 9   VFHGDGRLGELEIYPSRELNQQQDDVMKQRKKKQREVMELAKMGIRISHFSQSGERCPPL 68

Query: 66  AVLHTIAASGICFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDH 125
           A+L TI++ G+CFK+E+  S +Q++ L L +SSC+ +NKTAVM+LG EELHLVAMYS + 
Sbjct: 69  AILTTISSCGLCFKLEASPSPAQES-LSLFYSSCLRDNKTAVMLLGGEELHLVAMYSENI 128

Query: 126 DKQYPCFWGFNVTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKIS 185
               PCFW F+V  G+Y+SCLVMLNLRCLGIVFDLDETLVVANTMRSFED+I+  QR+I+
Sbjct: 129 KNDRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIDGFQRRIN 188

Query: 186 SEVDPQRATGMLAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVR 245
           +E+DPQR   ++AE+KRYQDDK +LKQY E+DQ++ENG+VIK QSE+VPALSDNHQP+VR
Sbjct: 189 NEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQSEIVPALSDNHQPLVR 248

Query: 246 PLIRLHEK--ILTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF------------- 305
           PLIRL EK  ILTRINP IRDTSVLV++RP+WE+LRSYLTA+GRKRF             
Sbjct: 249 PLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRKRFEVYVCTMAERDYA 308

Query: 306 -EMWRLLDPDANLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDE 365
            EMWRLLDP+ NLIN  +LL RIVCVKSG +KSLFNVF DG CHPKMALVIDDRLKVWDE
Sbjct: 309 LEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHPKMALVIDDRLKVWDE 368

Query: 366 KDQPRVHVVPAFAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISY 425
           KDQPRVHVVPAFAPYY+P AE + T PVLCVARNVAC VRG         LL +I++ISY
Sbjct: 369 KDQPRVHVVPAFAPYYSPQAEAAAT-PVLCVARNVACGVRGGFFRDFDDSLLPRIAEISY 428

Query: 426 EDDANDIPS-PDVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSI---- 485
           E+DA DIPS PDVS+YL SEDD S  NGNKD L+FDGM+D EVERR+K     SS     
Sbjct: 429 ENDAEDIPSPPDVSHYLVSEDDTSGLNGNKDPLSFDGMADTEVERRLKEAISASSAVLPA 488

Query: 486 --------------YNGFCSGSVPLP---------PKQVSLPYFPNMQLPHVNSVV-HVA 545
                              S SVP+P         P  ++ P  P  Q     S+  H+ 
Sbjct: 489 ANIDPRIAAPVQFPMASASSVSVPVPVQVVQQAIQPSAMAFPSIPFQQPQQPTSIAKHLV 548

Query: 546 PTEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVT 605
           P+EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+   SEP+F  RPP   Q  
Sbjct: 549 PSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQRPP--VQAP 608

Query: 606 GPRAQSRGSWSPMEEEMSPRQLSRTARKEFPVDAEPMR-EKYRSHHPSFYPKIESSIPSD 665
               QSR  W P+EEEM P Q+ R   KE+P+D+E +  EK+R  HPSF+ KI++S  SD
Sbjct: 609 PSHVQSRNGWFPVEEEMDPAQIRRAVSKEYPLDSEMIHMEKHRPRHPSFFSKIDNSTQSD 668

Query: 666 RTPHENQRLSKEVMRFRESI----------------------IFKKRDIDVESGRSI-WS 693
           R  HEN+R  KE +R  E +                        +  D+D    RS+  +
Sbjct: 669 RMLHENRRPPKESLRRDEQLRSNNNLPDSHPFYGEDASWNQSSSRNSDLDFLPERSVSAT 728

BLAST of Sgr029339 vs. ExPASy Swiss-Prot
Match: Q5YDB5 (RNA polymerase II C-terminal domain phosphatase-like 2 OS=Arabidopsis thaliana OX=3702 GN=CPL2 PE=1 SV=3)

HSP 1 Score: 526.2 bits (1354), Expect = 6.2e-148
Identity = 327/728 (44.92%), Postives = 429/728 (58.93%), Query Frame = 0

Query: 2   YKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGIC 61
           +KSVVY GD  LGE+++     +        EIRI H S   ERCPPLA+L TIA+  + 
Sbjct: 6   HKSVVYHGDLRLGELDVNHVSSSHEFRFPNDEIRIHHLSPAGERCPPLAILQTIASFAVR 65

Query: 62  FKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFNV 121
            K+ES ++  +   L  LH+ C  E KTAV+MLG EE+HLVAM S+  +K++PCFW F+V
Sbjct: 66  CKLES-SAPVKSQELMHLHAVCFHELKTAVVMLGDEEIHLVAMPSK--EKKFPCFWCFSV 125

Query: 122 TMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGML 181
             GLY+SCL MLN RCL IVFDLDETL+VANTM+SFEDRIEAL+  IS E+DP R  GM 
Sbjct: 126 PSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDPVRINGMS 185

Query: 182 AEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--IL 241
           AE+KRY DD+ +LKQY +ND   +NG ++K+Q E V   SD  + V RP+IRL EK  +L
Sbjct: 186 AELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRLPEKNTVL 245

Query: 242 TRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDAN 301
           TRI P+IRDTSVLVKLRPAWE+LRSYLTA+ RKRF              EMWRLLDP+A+
Sbjct: 246 TRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWRLLDPEAH 305

Query: 302 LINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 361
           LI+ KEL DRIVCVK  ++KSL +VF  G CHPKMA+VIDDR+KVW++KDQPRVHVV A+
Sbjct: 306 LISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPRVHVVSAY 365

Query: 362 APYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDI-PSPD 421
            PYYAP AE +  VP LCVARNVACNVRG         L+  IS + YEDD  ++ PSPD
Sbjct: 366 LPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVENLPPSPD 425

Query: 422 VSNYLASEDDYSVSNGNKDMLTF-DGMSDMEVERRMKITSVFSSIYNGFCSGSVPLPPKQ 481
           VSNY+  ED    SNGN +     +GM   EVERR+   +            ++P     
Sbjct: 426 VSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQAAAAD-------HSTLPATSNA 485

Query: 482 VSLPYFPNMQLPHVNSVVHVAPT-------EPSLQSSPAREEGEVPESELDPDTRRRLLI 541
              P  P  Q+  + +    A         +PSL  +P R+     +         R L+
Sbjct: 486 EQKPETPKPQIAVIPNNASTATAAALLPSHKPSLLGAPRRDGFTFSDG-------GRPLM 545

Query: 542 LQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMS--PRQLSRTARKEF 601
           ++ G D R +  ++P    + P   Q       S G W   +E     P + S     +F
Sbjct: 546 MRPGVDIRNQNFNQPPILAKIP--MQPPSSSMHSPGGWLVDDENRPSFPGRPSGLYPSQF 605

Query: 602 PVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVMRFRESIIFKKRDIDVES 661
           P             HPS     E ++  D       R + E    +  ++   R+   + 
Sbjct: 606 PHGTPGSAPVGPFAHPSHLRSEEVAMDDDLKRQNPSRQTTEGGISQNHLVSNGREHHTDG 665

Query: 662 GRSIWSETP--VGALQEIAMKFGTKVEFKPALVSSAELEFSVEAWFVGEKIGEGIGKTRR 692
           G+S   ++   V ALQEI  + G+KVEF+  + ++ EL+FSVE  F GEKIG G+ KT++
Sbjct: 666 GKSNGGQSHLFVSALQEIGRRCGSKVEFRTVISTNKELQFSVEVLFTGEKIGIGMAKTKK 714

BLAST of Sgr029339 vs. ExPASy TrEMBL
Match: A0A6J1CKY6 (Protein-serine/threonine phosphatase OS=Momordica charantia OX=3673 GN=LOC111012387 PE=4 SV=1)

HSP 1 Score: 1194.1 bits (3088), Expect = 0.0e+00
Identity = 622/752 (82.71%), Postives = 651/752 (86.57%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVY+GDE+LGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI
Sbjct: 5   MYKSVVYRGDEVLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 64

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQSQDTPLYLLHSSC+MENKTAVM LG EELHLVAMYSRDH+KQYPCFWGFN
Sbjct: 65  CFKMESKTSQSQDTPLYLLHSSCVMENKTAVMALGVEELHLVAMYSRDHEKQYPCFWGFN 124

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           VT GLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVD QRA GM
Sbjct: 125 VTRGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDSQRAAGM 184

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
           LAEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVP LSDNHQP+VRP++RLHEK  I
Sbjct: 185 LAEVKRYQDDKIILKQYAENDQIIENGKVIKSQSEVVPPLSDNHQPIVRPILRLHEKNII 244

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 245 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 304

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 305 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 364

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYED ANDIPS P
Sbjct: 365 FAPYYAPYAEGNNAVPVLCVARNVACNVRGGFFKEFDEVLLQKISDISYEDGANDIPSPP 424

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYN------------- 480
           DVSNYL SED+YS+SNGN+DMLTFD MSDMEVERR+K   + SS                
Sbjct: 425 DVSNYLVSEDEYSISNGNRDMLTFDSMSDMEVERRLKDAFLASSTITTTDPRVSSLQYTM 484

Query: 481 GFCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              S SVPLPPKQ S+PYFPNMQ PHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT
Sbjct: 485 ASASSSVPLPPKQGSMPYFPNMQPPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 544

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQDTRER  SEP+F VRPPPLQQV GPRAQSRGSWSPMEEEMSPRQLSRTA
Sbjct: 545 RRRLLILQHGQDTRERLPSEPSFPVRPPPLQQVAGPRAQSRGSWSPMEEEMSPRQLSRTA 604

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKE-------VMRFRESI 660
           RKEFPVDAEPMREK+RS+HP+++PKIE+SIP DR PHENQRLSKE       V   R S 
Sbjct: 605 RKEFPVDAEPMREKHRSNHPTYFPKIETSIPPDRIPHENQRLSKEAFYRDDRVRASRRSS 664

Query: 661 IF---------------KKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
            +               + RDI++ESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSS 
Sbjct: 665 SYPAFSGEEIPMNQSSSRSRDIEIESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSST 724

BLAST of Sgr029339 vs. ExPASy TrEMBL
Match: A0A6J1EFY6 (Protein-serine/threonine phosphatase OS=Cucurbita moschata OX=3662 GN=LOC111433824 PE=4 SV=1)

HSP 1 Score: 1163.7 bits (3009), Expect = 0.0e+00
Identity = 613/752 (81.52%), Postives = 643/752 (85.51%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVYQGDELLGEVEIYPEEKNGNKNI VKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIGVKEIRISHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQ QDTPLY LHSSCIMENKTA+MM GAEELHLVAMYSRDHDKQYPCFWGF 
Sbjct: 61  CFKMESKTSQLQDTPLYQLHSSCIMENKTAIMMFGAEELHLVAMYSRDHDKQYPCFWGFI 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V +GLYNSCL+MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSE+D QRA+GM
Sbjct: 121 VAIGLYNSCLIMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSELDQQRASGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
            AEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVPALS NHQP+VRP++RL+EK  I
Sbjct: 181 QAEVKRYQDDKLILKQYAENDQIIENGKVIKSQSEVVPALSGNHQPIVRPILRLYEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYEDDANDIPS P
Sbjct: 361 FAPYYAPNAEGNNAVPVLCVARNVACNVRGGFFKEFDEILLQKISDISYEDDANDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYNG------------ 480
           DVSNYL SED+YSVSNGNKDMLTFDGM DM+VERR+K   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSVSNGNKDMLTFDGMPDMDVERRLKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 -FCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              SG+VP PPKQVSLPYFP+MQLPHVNS   VAPTEPS+Q SPAREEGEVPESELDPDT
Sbjct: 481 PSASGAVPPPPKQVSLPYFPDMQLPHVNS---VAPTEPSIQCSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQD RERP SEPAFSVRPPPLQQV GPR QSRGSWSPMEEEMSPRQL+R A
Sbjct: 541 RRRLLILQHGQDIRERPPSEPAFSVRPPPLQQVAGPRGQSRGSWSPMEEEMSPRQLNRPA 600

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFR------ 660
           RKEFPVDAEPMREK+RS+H SF+PKI+ SIP DR PHENQRLSKE      R R      
Sbjct: 601 RKEFPVDAEPMREKHRSNHSSFFPKIDGSIPPDRIPHENQRLSKEAFYRDDRARLSRRPS 660

Query: 661 -------ESIIF-----KKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
                  E ++      + RD D+ESG SIWSETPVGALQEIAMKFGTKVEFK ALVSSA
Sbjct: 661 NYPAFSGEEVLMNQSSSRSRDNDIESGCSIWSETPVGALQEIAMKFGTKVEFKSALVSSA 720

BLAST of Sgr029339 vs. ExPASy TrEMBL
Match: A0A6J1KNP9 (Protein-serine/threonine phosphatase OS=Cucurbita maxima OX=3661 GN=LOC111497294 PE=4 SV=1)

HSP 1 Score: 1162.1 bits (3005), Expect = 0.0e+00
Identity = 613/752 (81.52%), Postives = 641/752 (85.24%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVYQGDELLGEVEIYPEEKNG+KNIEVKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYQGDELLGEVEIYPEEKNGDKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQSQDTPLY LHSSCIMENKTA+MM GAEELHLVAMYSRDHDKQYPCFWGF 
Sbjct: 61  CFKMESKTSQSQDTPLYQLHSSCIMENKTAIMMFGAEELHLVAMYSRDHDKQYPCFWGFI 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V +GLYNSCL+MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVD QRA+GM
Sbjct: 121 VAIGLYNSCLIMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDQQRASGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
            AEVKRYQDDK ILKQYAENDQIIENGKVIKSQSEVVPALS NHQP+VRP++RL+EK  I
Sbjct: 181 QAEVKRYQDDKLILKQYAENDQIIENGKVIKSQSEVVPALSGNHQPIVRPILRLYEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N VPVLCVARNVACNVRG         LLQKISDISYEDDANDIPS P
Sbjct: 361 FAPYYAPNAEGNNAVPVLCVARNVACNVRGGFFKEFDEILLQKISDISYEDDANDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYN------------- 480
           DVSNYL SED+YSVSNGNKDMLTFDGM DM+VERR+K   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSVSNGNKDMLTFDGMPDMDVERRLKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 GFCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              SG+VP PPKQVSLPYFP+MQLPHVNS   V PTEPS+Q SPAREEGEVPESELDPDT
Sbjct: 481 ASASGAVPPPPKQVSLPYFPDMQLPHVNS---VTPTEPSIQCSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQD RERP SEPAFSVRPPPLQQV GPR QSRGSWSPMEEEMSPRQL+R A
Sbjct: 541 RRRLLILQHGQDIRERPPSEPAFSVRPPPLQQVAGPRVQSRGSWSPMEEEMSPRQLNRPA 600

Query: 601 RKEFPVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFRES---- 660
           RKEFPVDAEPMREK+RS+H SF+PKI+ SIP DR PHENQRLSKE      R R S    
Sbjct: 601 RKEFPVDAEPMREKHRSNHSSFFPKIDGSIPPDRIPHENQRLSKEAFYRDDRARLSRRPS 660

Query: 661 --------------IIFKKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVSSA 692
                            + RD D+ESG SIWS TPVGALQEIAMKFGTKVEFK ALVSSA
Sbjct: 661 NYPAFSGEEVPMNQSSSRSRDNDIESGCSIWSGTPVGALQEIAMKFGTKVEFKSALVSSA 720

BLAST of Sgr029339 vs. ExPASy TrEMBL
Match: A0A5A7UGR2 (Protein-serine/threonine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G003140 PE=4 SV=1)

HSP 1 Score: 1156.0 bits (2989), Expect = 0.0e+00
Identity = 616/787 (78.27%), Postives = 651/787 (82.72%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVY GDELLG+VEIYPEEKNG KNI+VKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYHGDELLGDVEIYPEEKNGYKNIDVKEIRISHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQSQDTPL LLHSSCIMENKTA+MM G EELHLVAM+SRD D+QYPCFWGFN
Sbjct: 61  CFKMESKTSQSQDTPLNLLHSSCIMENKTAIMMFGVEELHLVAMFSRDLDRQYPCFWGFN 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V MGLYNSCL MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRA GM
Sbjct: 121 VAMGLYNSCLDMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRANGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
           LAEVKRYQDDK ILKQYAENDQ+IENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK  I
Sbjct: 181 LAEVKRYQDDKIILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLV+LRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           F+PYYAP AE +N +PVLCVARNVACNVRG         LLQKISDISYED  NDIPS P
Sbjct: 361 FSPYYAPNAEGNNAIPVLCVARNVACNVRGGFFKEFDDILLQKISDISYEDGVNDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYN------------- 480
           DVSNYL SED+YS++NGNKD+ TFDGM DMEV+RRMK   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSIANGNKDIPTFDGMPDMEVDRRMKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 GFCSGSVPLPPKQVSL-PYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPD 540
              SG+VPLPPKQVS+ PYFPNM +PHVNSV HVAP EPSLQSSPAREEGEVPESELDPD
Sbjct: 481 ASASGAVPLPPKQVSMPPYFPNMPIPHVNSVAHVAPNEPSLQSSPAREEGEVPESELDPD 540

Query: 541 TRRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRT 600
           TRRRLLILQHGQDTRER SSEPAF  RPPPLQQV  PRAQSRGSWSPMEEEMSPRQLSRT
Sbjct: 541 TRRRLLILQHGQDTRERLSSEPAFPGRPPPLQQVAAPRAQSRGSWSPMEEEMSPRQLSRT 600

Query: 601 ARKEFPVDAE--PMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFRES- 660
           ARKEFPVDAE  PMREK+RS+HPSF+PK+++ I  DR PHENQRL K       R R S 
Sbjct: 601 ARKEFPVDAEPMPMREKHRSNHPSFFPKVDNPILPDRIPHENQRLPKGAFYRDDRMRVSR 660

Query: 661 -----------------IIFKKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALV 715
                               + RD D+ESGRSIWSETPVGALQEIAMKFGTKVEFKP LV
Sbjct: 661 RPSSYPAFPGEEIPMNQSSSRSRDDDIESGRSIWSETPVGALQEIAMKFGTKVEFKPGLV 720

BLAST of Sgr029339 vs. ExPASy TrEMBL
Match: A0A0A0KLF7 (Protein-serine/threonine phosphatase OS=Cucumis sativus OX=3659 GN=Csa_6G517200 PE=4 SV=1)

HSP 1 Score: 1154.0 bits (2984), Expect = 0.0e+00
Identity = 606/754 (80.37%), Postives = 637/754 (84.48%), Query Frame = 0

Query: 1   MYKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
           MYKSVVY GDELLG+VEIYPEEKNG KNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI
Sbjct: 1   MYKSVVYHGDELLGDVEIYPEEKNGYKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60

Query: 61  CFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFN 120
           CFKMESKTSQSQDTPL LLHSSCIMENKTA+MM G EELHLVAM+SRD DKQYPCFWGFN
Sbjct: 61  CFKMESKTSQSQDTPLNLLHSSCIMENKTAIMMFGVEELHLVAMFSRDLDKQYPCFWGFN 120

Query: 121 VTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGM 180
           V MGLYNSCL MLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRA GM
Sbjct: 121 VAMGLYNSCLDMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRANGM 180

Query: 181 LAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--I 240
           LAEVKRYQDDK ILKQYAENDQ+IENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK  I
Sbjct: 181 LAEVKRYQDDKIILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEKNII 240

Query: 241 LTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDA 300
           LTRINPQIRDTSVLV+LRPAWEDLRSYLTARGRKRF              EMWRLLDPD+
Sbjct: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300

Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
           NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360

Query: 361 FAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDIPS-P 420
           FAPYYAP AE +N +PVLCVARNVACNVRG         LLQKISDISYEDD NDIPS P
Sbjct: 361 FAPYYAPNAEGNNAIPVLCVARNVACNVRGGFFKEFDDILLQKISDISYEDDVNDIPSPP 420

Query: 421 DVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSIYN------------- 480
           DVSNYL SED+YS++NGNKDM TFDGM DMEV+RRMK   + SS  N             
Sbjct: 421 DVSNYLVSEDEYSIANGNKDMPTFDGMPDMEVDRRMKDAFLASSTINSADPRVSSLQYTM 480

Query: 481 GFCSGSVPLPPKQVSLPYFPNMQLPHVNSVVHVAPTEPSLQSSPAREEGEVPESELDPDT 540
              S SVPLPPKQV++PYFPNM LPHVNSV HVAP EPSLQSSPAREEGEVPESELDPDT
Sbjct: 481 ASASCSVPLPPKQVTMPYFPNMPLPHVNSVAHVAPNEPSLQSSPAREEGEVPESELDPDT 540

Query: 541 RRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMSPRQLSRTA 600
           RRRLLILQHGQDTRER SSEPAF  RPPPLQQV  PRAQSRG+WSPMEEEMSPRQL+R+A
Sbjct: 541 RRRLLILQHGQDTRERLSSEPAFPARPPPLQQVAAPRAQSRGNWSPMEEEMSPRQLNRSA 600

Query: 601 RKEFPVDAE--PMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVM----RFRES-- 660
           RK+FPVDAE  PMREK+RS+HPSF+ K+++SI  DR PH+NQRL KE      R R S  
Sbjct: 601 RKDFPVDAEPMPMREKHRSNHPSFFAKVDNSILPDRIPHDNQRLPKEAFYRDDRMRVSRR 660

Query: 661 ----------------IIFKKRDIDVESGRSIWSETPVGALQEIAMKFGTKVEFKPALVS 692
                              + RD D+ESGRSIWSETPVGALQEIAMKFGTKVEFKP LV 
Sbjct: 661 PSSYPAFSGEEIPMNQSSSRSRDDDIESGRSIWSETPVGALQEIAMKFGTKVEFKPGLVP 720

BLAST of Sgr029339 vs. TAIR 10
Match: AT4G21670.1 (C-terminal domain phosphatase-like 1 )

HSP 1 Score: 795.0 bits (2052), Expect = 5.1e-230
Identity = 450/781 (57.62%), Postives = 537/781 (68.76%), Query Frame = 0

Query: 6   VYQGDELLGEVEIYPE-----------EKNGNKNIEVKE-----IRITHFSQPSERCPPL 65
           V+ GD  LGE+EIYP            ++   K  EV E     IRI+HFSQ  ERCPPL
Sbjct: 9   VFHGDGRLGELEIYPSRELNQQQDDVMKQRKKKQREVMELAKMGIRISHFSQSGERCPPL 68

Query: 66  AVLHTIAASGICFKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDH 125
           A+L TI++ G+CFK+E+  S +Q++ L L +SSC+ +NKTAVM+LG EELHLVAMYS + 
Sbjct: 69  AILTTISSCGLCFKLEASPSPAQES-LSLFYSSCLRDNKTAVMLLGGEELHLVAMYSENI 128

Query: 126 DKQYPCFWGFNVTMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKIS 185
               PCFW F+V  G+Y+SCLVMLNLRCLGIVFDLDETLVVANTMRSFED+I+  QR+I+
Sbjct: 129 KNDRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIDGFQRRIN 188

Query: 186 SEVDPQRATGMLAEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVR 245
           +E+DPQR   ++AE+KRYQDDK +LKQY E+DQ++ENG+VIK QSE+VPALSDNHQP+VR
Sbjct: 189 NEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQSEIVPALSDNHQPLVR 248

Query: 246 PLIRLHEK--ILTRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF------------- 305
           PLIRL EK  ILTRINP IRDTSVLV++RP+WE+LRSYLTA+GRKRF             
Sbjct: 249 PLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRKRFEVYVCTMAERDYA 308

Query: 306 -EMWRLLDPDANLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDE 365
            EMWRLLDP+ NLIN  +LL RIVCVKSG +KSLFNVF DG CHPKMALVIDDRLKVWDE
Sbjct: 309 LEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHPKMALVIDDRLKVWDE 368

Query: 366 KDQPRVHVVPAFAPYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISY 425
           KDQPRVHVVPAFAPYY+P AE + T PVLCVARNVAC VRG         LL +I++ISY
Sbjct: 369 KDQPRVHVVPAFAPYYSPQAEAAAT-PVLCVARNVACGVRGGFFRDFDDSLLPRIAEISY 428

Query: 426 EDDANDIPS-PDVSNYLASEDDYSVSNGNKDMLTFDGMSDMEVERRMKITSVFSSI---- 485
           E+DA DIPS PDVS+YL SEDD S  NGNKD L+FDGM+D EVERR+K     SS     
Sbjct: 429 ENDAEDIPSPPDVSHYLVSEDDTSGLNGNKDPLSFDGMADTEVERRLKEAISASSAVLPA 488

Query: 486 --------------YNGFCSGSVPLP---------PKQVSLPYFPNMQLPHVNSVV-HVA 545
                              S SVP+P         P  ++ P  P  Q     S+  H+ 
Sbjct: 489 ANIDPRIAAPVQFPMASASSVSVPVPVQVVQQAIQPSAMAFPSIPFQQPQQPTSIAKHLV 548

Query: 546 PTEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERPSSEPAFSVRPPPLQQVT 605
           P+EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+   SEP+F  RPP   Q  
Sbjct: 549 PSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQRPP--VQAP 608

Query: 606 GPRAQSRGSWSPMEEEMSPRQLSRTARKEFPVDAEPMR-EKYRSHHPSFYPKIESSIPSD 665
               QSR  W P+EEEM P Q+ R   KE+P+D+E +  EK+R  HPSF+ KI++S  SD
Sbjct: 609 PSHVQSRNGWFPVEEEMDPAQIRRAVSKEYPLDSEMIHMEKHRPRHPSFFSKIDNSTQSD 668

Query: 666 RTPHENQRLSKEVMRFRESI----------------------IFKKRDIDVESGRSI-WS 693
           R  HEN+R  KE +R  E +                        +  D+D    RS+  +
Sbjct: 669 RMLHENRRPPKESLRRDEQLRSNNNLPDSHPFYGEDASWNQSSSRNSDLDFLPERSVSAT 728

BLAST of Sgr029339 vs. TAIR 10
Match: AT5G01270.1 (carboxyl-terminal domain (ctd) phosphatase-like 2 )

HSP 1 Score: 526.2 bits (1354), Expect = 4.4e-149
Identity = 327/728 (44.92%), Postives = 429/728 (58.93%), Query Frame = 0

Query: 2   YKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGIC 61
           +KSVVY GD  LGE+++     +        EIRI H S   ERCPPLA+L TIA+  + 
Sbjct: 6   HKSVVYHGDLRLGELDVNHVSSSHEFRFPNDEIRIHHLSPAGERCPPLAILQTIASFAVR 65

Query: 62  FKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFNV 121
            K+ES ++  +   L  LH+ C  E KTAV+MLG EE+HLVAM S+  +K++PCFW F+V
Sbjct: 66  CKLES-SAPVKSQELMHLHAVCFHELKTAVVMLGDEEIHLVAMPSK--EKKFPCFWCFSV 125

Query: 122 TMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGML 181
             GLY+SCL MLN RCL IVFDLDETL+VANTM+SFEDRIEAL+  IS E+DP R  GM 
Sbjct: 126 PSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDPVRINGMS 185

Query: 182 AEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--IL 241
           AE+KRY DD+ +LKQY +ND   +NG ++K+Q E V   SD  + V RP+IRL EK  +L
Sbjct: 186 AELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRLPEKNTVL 245

Query: 242 TRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDAN 301
           TRI P+IRDTSVLVKLRPAWE+LRSYLTA+ RKRF              EMWRLLDP+A+
Sbjct: 246 TRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWRLLDPEAH 305

Query: 302 LINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 361
           LI+ KEL DRIVCVK  ++KSL +VF  G CHPKMA+VIDDR+KVW++KDQPRVHVV A+
Sbjct: 306 LISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPRVHVVSAY 365

Query: 362 APYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDI-PSPD 421
            PYYAP AE +  VP LCVARNVACNVRG         L+  IS + YEDD  ++ PSPD
Sbjct: 366 LPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVENLPPSPD 425

Query: 422 VSNYLASEDDYSVSNGNKDMLTF-DGMSDMEVERRMKITSVFSSIYNGFCSGSVPLPPKQ 481
           VSNY+  ED    SNGN +     +GM   EVERR+   +            ++P     
Sbjct: 426 VSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQAAAAD-------HSTLPATSNA 485

Query: 482 VSLPYFPNMQLPHVNSVVHVAPT-------EPSLQSSPAREEGEVPESELDPDTRRRLLI 541
              P  P  Q+  + +    A         +PSL  +P R+     +         R L+
Sbjct: 486 EQKPETPKPQIAVIPNNASTATAAALLPSHKPSLLGAPRRDGFTFSDG-------GRPLM 545

Query: 542 LQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMS--PRQLSRTARKEF 601
           ++ G D R +  ++P    + P   Q       S G W   +E     P + S     +F
Sbjct: 546 MRPGVDIRNQNFNQPPILAKIP--MQPPSSSMHSPGGWLVDDENRPSFPGRPSGLYPSQF 605

Query: 602 PVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVMRFRESIIFKKRDIDVES 661
           P             HPS     E ++  D       R + E    +  ++   R+   + 
Sbjct: 606 PHGTPGSAPVGPFAHPSHLRSEEVAMDDDLKRQNPSRQTTEGGISQNHLVSNGREHHTDG 665

Query: 662 GRSIWSETP--VGALQEIAMKFGTKVEFKPALVSSAELEFSVEAWFVGEKIGEGIGKTRR 692
           G+S   ++   V ALQEI  + G+KVEF+  + ++ EL+FSVE  F GEKIG G+ KT++
Sbjct: 666 GKSNGGQSHLFVSALQEIGRRCGSKVEFRTVISTNKELQFSVEVLFTGEKIGIGMAKTKK 714

BLAST of Sgr029339 vs. TAIR 10
Match: AT5G01270.2 (carboxyl-terminal domain (ctd) phosphatase-like 2 )

HSP 1 Score: 526.2 bits (1354), Expect = 4.4e-149
Identity = 327/728 (44.92%), Postives = 429/728 (58.93%), Query Frame = 0

Query: 2   YKSVVYQGDELLGEVEIYPEEKNGNKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGIC 61
           +KSVVY GD  LGE+++     +        EIRI H S   ERCPPLA+L TIA+  + 
Sbjct: 6   HKSVVYHGDLRLGELDVNHVSSSHEFRFPNDEIRIHHLSPAGERCPPLAILQTIASFAVR 65

Query: 62  FKMESKTSQSQDTPLYLLHSSCIMENKTAVMMLGAEELHLVAMYSRDHDKQYPCFWGFNV 121
            K+ES ++  +   L  LH+ C  E KTAV+MLG EE+HLVAM S+  +K++PCFW F+V
Sbjct: 66  CKLES-SAPVKSQELMHLHAVCFHELKTAVVMLGDEEIHLVAMPSK--EKKFPCFWCFSV 125

Query: 122 TMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRATGML 181
             GLY+SCL MLN RCL IVFDLDETL+VANTM+SFEDRIEAL+  IS E+DP R  GM 
Sbjct: 126 PSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDPVRINGMS 185

Query: 182 AEVKRYQDDKFILKQYAENDQIIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEK--IL 241
           AE+KRY DD+ +LKQY +ND   +NG ++K+Q E V   SD  + V RP+IRL EK  +L
Sbjct: 186 AELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRLPEKNTVL 245

Query: 242 TRINPQIRDTSVLVKLRPAWEDLRSYLTARGRKRF--------------EMWRLLDPDAN 301
           TRI P+IRDTSVLVKLRPAWE+LRSYLTA+ RKRF              EMWRLLDP+A+
Sbjct: 246 TRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWRLLDPEAH 305

Query: 302 LINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 361
           LI+ KEL DRIVCVK  ++KSL +VF  G CHPKMA+VIDDR+KVW++KDQPRVHVV A+
Sbjct: 306 LISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPRVHVVSAY 365

Query: 362 APYYAPLAEVSNTVPVLCVARNVACNVRG---------LLQKISDISYEDDANDI-PSPD 421
            PYYAP AE +  VP LCVARNVACNVRG         L+  IS + YEDD  ++ PSPD
Sbjct: 366 LPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVENLPPSPD 425

Query: 422 VSNYLASEDDYSVSNGNKDMLTF-DGMSDMEVERRMKITSVFSSIYNGFCSGSVPLPPKQ 481
           VSNY+  ED    SNGN +     +GM   EVERR+   +            ++P     
Sbjct: 426 VSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQAAAAD-------HSTLPATSNA 485

Query: 482 VSLPYFPNMQLPHVNSVVHVAPT-------EPSLQSSPAREEGEVPESELDPDTRRRLLI 541
              P  P  Q+  + +    A         +PSL  +P R+     +         R L+
Sbjct: 486 EQKPETPKPQIAVIPNNASTATAAALLPSHKPSLLGAPRRDGFTFSDG-------GRPLM 545

Query: 542 LQHGQDTRERPSSEPAFSVRPPPLQQVTGPRAQSRGSWSPMEEEMS--PRQLSRTARKEF 601
           ++ G D R +  ++P    + P   Q       S G W   +E     P + S     +F
Sbjct: 546 MRPGVDIRNQNFNQPPILAKIP--MQPPSSSMHSPGGWLVDDENRPSFPGRPSGLYPSQF 605

Query: 602 PVDAEPMREKYRSHHPSFYPKIESSIPSDRTPHENQRLSKEVMRFRESIIFKKRDIDVES 661
           P             HPS     E ++  D       R + E    +  ++   R+   + 
Sbjct: 606 PHGTPGSAPVGPFAHPSHLRSEEVAMDDDLKRQNPSRQTTEGGISQNHLVSNGREHHTDG 665

Query: 662 GRSIWSETP--VGALQEIAMKFGTKVEFKPALVSSAELEFSVEAWFVGEKIGEGIGKTRR 692
           G+S   ++   V ALQEI  + G+KVEF+  + ++ EL+FSVE  F GEKIG G+ KT++
Sbjct: 666 GKSNGGQSHLFVSALQEIGRRCGSKVEFRTVISTNKELQFSVEVLFTGEKIGIGMAKTKK 714

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022142219.10.0e+0082.71LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 1 [Mom... [more]
KAG7026584.10.0e+0081.65RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita argyrosperma s... [more]
KAG6594614.10.0e+0081.52RNA polymerase II C-terminal domain phosphatase-like 1, partial [Cucurbita argyr... [more]
XP_022926826.10.0e+0081.52RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita moschata] >XP_... [more]
XP_023003826.10.0e+0081.52RNA polymerase II C-terminal domain phosphatase-like 1 [Cucurbita maxima] >XP_02... [more]
Match NameE-valueIdentityDescription
Q5YDB67.2e-22957.62RNA polymerase II C-terminal domain phosphatase-like 1 OS=Arabidopsis thaliana O... [more]
Q5YDB56.2e-14844.92RNA polymerase II C-terminal domain phosphatase-like 2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1CKY60.0e+0082.71Protein-serine/threonine phosphatase OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1EFY60.0e+0081.52Protein-serine/threonine phosphatase OS=Cucurbita moschata OX=3662 GN=LOC1114338... [more]
A0A6J1KNP90.0e+0081.52Protein-serine/threonine phosphatase OS=Cucurbita maxima OX=3661 GN=LOC111497294... [more]
A0A5A7UGR20.0e+0078.27Protein-serine/threonine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A0A0KLF70.0e+0080.37Protein-serine/threonine phosphatase OS=Cucumis sativus OX=3659 GN=Csa_6G517200 ... [more]
Match NameE-valueIdentityDescription
AT4G21670.15.1e-23057.62C-terminal domain phosphatase-like 1 [more]
AT5G01270.14.4e-14944.92carboxyl-terminal domain (ctd) phosphatase-like 2 [more]
AT5G01270.24.4e-14944.92carboxyl-terminal domain (ctd) phosphatase-like 2 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 194..352
e-value: 1.3E-5
score: 10.2
IPR014720Double-stranded RNA-binding domainSMARTSM00358DRBM_3coord: 634..699
e-value: 2.5E-6
score: 37.0
IPR014720Double-stranded RNA-binding domainPROSITEPS50137DS_RBDcoord: 633..691
score: 9.367816
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 129..390
e-value: 6.0E-11
score: 44.3
NoneNo IPR availableGENE3D3.30.160.20coord: 628..702
e-value: 2.9E-6
score: 29.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 477..562
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 489..517
NoneNo IPR availablePANTHERPTHR23081:SF17RNA POLYMERASE II C-TERMINAL DOMAIN PHOSPHATASE-LIKE 1coord: 1..692
NoneNo IPR availableSUPERFAMILY54768dsRNA-binding domain-likecoord: 633..692
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 1..692
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 132..368

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029339.1Sgr029339.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0003723 RNA binding
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity