HG10003444 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003444
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
LocationChr08: 1416131 .. 1420921 (-)
RNA-Seq ExpressionHG10003444
SyntenyHG10003444
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAATACCGAGGGTGACAATAATGCTGAAAGTGAGAGGTACGATTAATTTTTAAAATTAATGAACTTCCATTGTTCTTTGAAGTTAATAATGCCTAAAGACAATGGTTTAGGGTTTTAGTTTTAGGTTAATCTAATGTTTTTATTACTTAAATCTAGAATTAAGTGGAAAGTATCTTGTAGTGGCCTGGCTCTCTTTTGAATCCCCATATAGATAAAAATATGAAATAAGTGCTATCATGATTAAGATTTAAGAGTGGCGGGTTCCTTTTTCCAATTTCTGGACGGTATTCTTTAAAAAAATAGATTGTGTTTGGTATCATACGACATTTGAAAATTCTTGTTGAAATAGGATTTCATGATTGGTTTAACATTCCAGATTTCCCCATCTGACTTTCTTGTGATTTATAGCAATTTGTAGATTCTAAGCTTCTAATCTCAAACTCCGAGTTACCCTTTTTAAACGATTTTTTTAAGAAGCCAAATTTCCAAGAAGAAGAATGAAAGAATAATAGCAAACTGCAAAGCAGCACAAAAAAAGAAGAATATAAAACCGAAGACTAGACATTTACAACAGTCATAGTTATTTAAACTCTCAGTTAATGAGTTTATGTGCCATTTAGTTTCACTTTCATCTAATTATTTTCTTTTATAGATATCCCAAGTGGTCTAGTTCATGAAAGTTTTTATCAGTTACAATGTGTAATTATTATTTTGTATTAAAAGTCTAAAACCATGGTAAACAGGATGATGCTGCTATCCAATAAGAGTAATTTGTGATTTCTGTGTTCGTAATTTGTATTATTTTTCCTTTTGCATGAATAACAGCATTGAGTTGTGGCAAGTGGCGACTATCATTTCTGCACATAATAACGAGGGTTTGTAGGATTGGTTATAAGTTATAACCATATAACTAAATTGTGGCAGTTTATCGTCGTTTTATCAATTTAACATGTCAAATATTTTTTATTCAATCCATTCAATTTATAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAGGTAAGTTAAACTCTTCGTACCAACCCCTTTTGTTCATCTTTTACTCTTGCTCTACCCCTTACTTTCAATGTAGCCAATTTGTGACTGCCACATGTATTATACAATGTATACTTACCTACTTCCTTACTTCTTAGAGAGCTATACGACGTGCTTGCTGTGTTAGTTCTTTTTTTCCACCCACTCTGCATATTCATAGAGCTTTTCCCTTGCTAATCTTCTGGTACATATTTTGTTTTTTGTGGTTGCCTCCTCTTCCCTTCTTATAGAATCTGATTGGAGGTTTGTCACCCTTTTTTATCCACCTTTTATTTAAATTCTATGTTCTTTAAGAACCATATCGAGTTGGCCCCCAAATGATCTATATTTAATCACCTCCCTATTAGTGTTGTTGAGTAAGAAAGTATTCCCTAGAAACTATGAAATCACAAAGGGAGGGGAAAGATAGCTATATTCCTGTATATCCAAAAGTGAAATAACATATTCTCGGGCTCAATGTTCAAAGTGGTCTGAACACTCGTGGATAAAAAATAAAGAACATAGGATGGGAGTTGTTCTTATATATGATACTTGTGAACGATCTTTGGTCTGAACGAAAATAGTCATTTTTAATTTCTTTGAAAACTAAGAATTACGGGATGTTAGATGGATATACACCTGAAATTTGGTGGTATCTGATTTCAGGAACTATTTCAGTCCAGCTGTCCTCTGCTCACCATATATTCTTTATTTCGGTTCCTTTCTTCCTCTACCAATGTAACCTATTCAGTTCTGTGTAATTGTAACTTAATAACCGAGAAATATAGAAGCCTTCATATTGAAGCAAGTATTTGACTCCTTGTATCTCTTTGTTGGTTTTTCTCTACTCTCCGAATTCTTCTGTTTTGAAAATAATAATCAATTGCAGAAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATCGGGTATGTTTTCTTTGTATGAATATCCAATTATCTTCTATTCTTTGTTACCATGTATAGAATGTTTGGGTGGGGTGGGGGGTGGGTTAGAAGGAACTCATGAGGTTTGGGGGTTTACAGAATGGGCACCATTTAGAAGAGGAATGAGAAATGATGTCAAGCCTTTTATTTTCTTCTTGGTTTTACAAAATGGGCACCATCTACTTCTACTGCCTCATTTGCTATAGCCTATGAGGATGCAATAAGTTTTTGTCTATCTTTTATGAAGGGCTCGTTGATTTTCTGAACTCGTGCTTTTTTGGGTTTTGTTTTGAAACCTTGGCAAAGATTTCATGTATTCTAGTTCCAGTCACTAAGCCGATGGGTCATAGATGTGACGTTGTTTAGTGCTGGACATTTTTGGAATCAAGCAAATAGAGTCTTCCTTTCATGCATTAATAAGGGTCACATTCCCAAAATTCAACACTGCTTCTTCTCTCTCTCTCTCTAAAAATGTAAGAGGATAGAGGGAATCGGGAAGATCTTTGTCAATAGGAAGTCTTTTACAGTTGGAAGAAGTGGAAATGGAAGAAGATCAGTTTTAGAGGAGCGTAGGGGACTGAAAGTCAGGAAAGTCGAGTTGGAGATAGGTATTGTGGTATGGGTCAGAGCCTGATTGCCTTTGGTGAAGGATTCAAGCAATCCTTTGGGTTTTTAGCCGAGAAGTAGATTAGAGGCAGCCATTATTTTCTTCCATGTCCTGTCAAACAAAAGAAGCGCTTTGCATAGTATCCTTAGAAACCTTAAAGGTAGAAATTCATAGATTCTTATCTCAGAAGGTGAAAGAAAGAAAGGTTAGAATGCTTTAGTTGATGAAATTTCTGGGTTTTTGCATCTTCTATTTAATGTAGCTTTTTTTTGGCTTTTCTTTAAGTTTTAGTTAGGTTCTGCTTGGTTCTTTAAAAGATCTTTTATGTTCAGTTCTTTTGTAAGTTACCGTGTGGTAACTGGTAACTTCTGTTGGTTTAGTTGTTCTGTGGTTTAGTATTTAAGCTAAACCTTCAGCTCTTTAATAAAGCATTGAGGATTCAGAAGTTAAAAACTTCCCCCAACTTGTTACACCACTTTGGCAGTGGGTATGGTGGGAAGGGTGAGTCCTGGATATGGAGGCTGGTCCCTAATCCTCTTTATTTTTTAATTCACCCCCCTCCCTCTTTTTTTTCGGTTGTGCCTTACCTTACACACCGGTTTATTCCCTAGGAATTCATTGCTGAATGCTGTTGGTAATGGGAGAGGAGGGGGCTTTTTTTGTGTAGGTTGGTGGATTGGAGAGCCATATTCCTTTTCTAACCCCTTGAGTTTTCCTCCTGCTAATAAGGGAGAATTGATGGGAAATTGGGCTTCCAGTCAGTCTGCTCTTGCCTCGGTAGGTATTCAGGCGATTTTAGCTCTGCAATGGTCTCGGGATCCTAAAGAAAGGGTGTTTTTGGGAAAAAGAAAGATGGGGAGGGAGGAAAGGAACTTGAAAGAGAGAGATCGTGGACTAATCCCATTTAGTTTGACATGTTTATGTTTTTTTTTAATATATATTCATTCACTGTTGTCAAAATAGATACTAGCTATGGATTTTAACTTTCCATGACTGGAGAAGTATCGTCGTTTATATGCCTCTTTGATCAATATACTATAGTTTCTTAAAAACAAAAAGGAAAATAGTGAAATAGTCCCATGTAGTTTGAGATGTTTTTGGAGAGCCATATTCCTTTTCTAACCCCTTGAGTTTTCCTCCTGCTAATAAGGGAGAATTGATGGGAAATTGGGCTTCCAGTCAGTCTGCTCTTGCCTCGGTAGGTATTCAGGCGATTTTAGCTCTGCAATGGTCTCGGGATCCTAAAGAAAGGGTGCTTTTGGGAAAAAGAAAGATGGGGAGGGAGGAAAGGAACTTGAAAGAGAGAGATCGTGGACTAATCCCATTTAGTTTGACATGTTTATGTTTTTTTTTAATATATATTCATTCACTGTTGTCAAAATAGATACTAGCTATGGATTTTAACTTTCCATGACTGGAGAAGTATCGTCGTTTATATGCCTCTTTGATCAATATACTATAGTTTCTTAAAAACAAAAAGGAAAATAGTGAAATAGTCCCATGTAGTTTGAGATGTTTTTGTTAATATAATATATATTTTTCCTTCAGTTATATTTGTTCACTGATGACAAAATAGACACTGGCTATGAATTTTAATTTAGCATGGCTGGTGAAGTACAGATTACTTGTAATTTGCTAGTATATCAAACTATGTGCTAAATGTATGACCCATCCGATAATCTTGCAGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGGTACATTGTCCTCTCATCTGTACATAGTTCTACTTTCTAAAGTAACTTTGATGTGGTGTTTACTGTTCTTTGATAAGGTGTTTACTGTTATTTCATTCCCATCGTGA

mRNA sequence

ATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAATACCGAGGGTGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAGAAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATCGGAGGATAGAGGGAATCGGGAAGATCTTTGTCAATAGGAAGTCTTTTACAGTTGGAAGAAGTGGAAATGGAAGAAGATCAGTTTTAGAGGAGCGTAGGGGACTGAAAGTCAGGAAAGTCGAGTTGGAGATAGGTATTGTGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGGTACATTGTCCTCTCATCTGTACATAGTTCTACTTTCTAAAGTAACTTTGATGTGGTGTTTACTGTTCTTTGATAAGGTGTTTACTGTTATTTCATTCCCATCGTGA

Coding sequence (CDS)

ATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAATACCGAGGGTGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAGAAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATCGGAGGATAGAGGGAATCGGGAAGATCTTTGTCAATAGGAAGTCTTTTACAGTTGGAAGAAGTGGAAATGGAAGAAGATCAGTTTTAGAGGAGCGTAGGGGACTGAAAGTCAGGAAAGTCGAGTTGGAGATAGGTATTGTGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGGTACATTGTCCTCTCATCTGTACATAGTTCTACTTTCTAAAGTAACTTTGATGTGGTGTTTACTGTTCTTTGATAAGGTGTTTACTGTTATTTCATTCCCATCGTGA

Protein sequence

MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIVLLSKVTLMWCLLFFDKVFTVISFPS
Homology
BLAST of HG10003444 vs. NCBI nr
Match: XP_038890381.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida] >XP_038890382.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 291.6 bits (745), Expect = 6.6e-75
Identity = 162/228 (71.05%), Postives = 170/228 (74.56%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNNAESERIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEGDNNAESERIKRRKVEKLENS 60

Query: 61  EEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKI 120
           EEDILYGVEEQ+ E +SKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+        
Sbjct: 61  EEDILYGVEEQSSEAISKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 120

Query: 121 FVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQ 180
                                                 GLRLNNDEINRLRNIDMK+LL 
Sbjct: 121 --------------------------------------GLRLNNDEINRLRNIDMKSLLL 180

Query: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSL+      L+++
Sbjct: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLDDVTKGSLFLL 182

BLAST of HG10003444 vs. NCBI nr
Match: XP_023525838.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 283.5 bits (724), Expect = 1.8e-72
Identity = 158/228 (69.30%), Postives = 166/228 (72.81%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  EEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKI 120
            EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+        
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 120

Query: 121 FVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQ 180
                                                 GLRLNNDEINRLRNIDMKNLLQ
Sbjct: 121 --------------------------------------GLRLNNDEINRLRNIDMKNLLQ 180

Query: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           HKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLE      L+++
Sbjct: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLFLL 182

BLAST of HG10003444 vs. NCBI nr
Match: XP_022133134.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia])

HSP 1 Score: 281.2 bits (718), Expect = 8.9e-72
Identity = 159/231 (68.83%), Postives = 168/231 (72.73%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 61  E---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGI 120
           E   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH+     
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK----- 120

Query: 121 GKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKN 180
                                                    GLRLNNDEINRLRNIDMKN
Sbjct: 121 -----------------------------------------GLRLNNDEINRLRNIDMKN 180

Query: 181 LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLE      L+++
Sbjct: 181 LLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKGSLFLL 185

BLAST of HG10003444 vs. NCBI nr
Match: XP_022133135.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Momordica charantia])

HSP 1 Score: 281.2 bits (718), Expect = 8.9e-72
Identity = 159/231 (68.83%), Postives = 168/231 (72.73%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 61  E---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGI 120
           E   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH+     
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK----- 120

Query: 121 GKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKN 180
                                                    GLRLNNDEINRLRNIDMKN
Sbjct: 121 -----------------------------------------GLRLNNDEINRLRNIDMKN 180

Query: 181 LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLE      L+++
Sbjct: 181 LLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKGSLFLL 185

BLAST of HG10003444 vs. NCBI nr
Match: KAG7037160.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 280.4 bits (716), Expect = 1.5e-71
Identity = 157/228 (68.86%), Postives = 164/228 (71.93%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS
Sbjct: 20  MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 79

Query: 61  EEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKI 120
            EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+        
Sbjct: 80  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 139

Query: 121 FVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQ 180
                                                 GLRLNNDEINRLRNIDMKNLLQ
Sbjct: 140 --------------------------------------GLRLNNDEINRLRNIDMKNLLQ 199

Query: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           HKKLILVLDLDHTLLNSTQLGHL PEEEYLR+Q DSLE      L+++
Sbjct: 200 HKKLILVLDLDHTLLNSTQLGHLAPEEEYLRNQMDSLEDVTKGSLFLL 201

BLAST of HG10003444 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 2.0e-29
Identity = 95/220 (43.18%), Postives = 119/220 (54.09%), Query Frame = 0

Query: 1   MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLE 60
           MS+A++SP H SSSSDD AAFLD  LDS S  SS P E  E +++ ES  +KR+K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 61  NSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIG 120
                          E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIH+      
Sbjct: 61  ---------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKE----- 120

Query: 121 KIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNL 180
                                                    +RLN DEI+RLR+ D + L
Sbjct: 121 -----------------------------------------MRLNEDEISRLRDSDSRFL 157

Query: 181 LQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLE 219
            + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+
Sbjct: 181 QRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQ 157

BLAST of HG10003444 vs. ExPASy TrEMBL
Match: A0A6J1BUF9 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 4.3e-72
Identity = 159/231 (68.83%), Postives = 168/231 (72.73%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 61  E---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGI 120
           E   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH+     
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK----- 120

Query: 121 GKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKN 180
                                                    GLRLNNDEINRLRNIDMKN
Sbjct: 121 -----------------------------------------GLRLNNDEINRLRNIDMKN 180

Query: 181 LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLE      L+++
Sbjct: 181 LLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKGSLFLL 185

BLAST of HG10003444 vs. ExPASy TrEMBL
Match: A0A6J1BV42 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 4.3e-72
Identity = 159/231 (68.83%), Postives = 168/231 (72.73%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 61  E---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGI 120
           E   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH+     
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK----- 120

Query: 121 GKIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKN 180
                                                    GLRLNNDEINRLRNIDMKN
Sbjct: 121 -----------------------------------------GLRLNNDEINRLRNIDMKN 180

Query: 181 LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLE      L+++
Sbjct: 181 LLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKGSLFLL 185

BLAST of HG10003444 vs. ExPASy TrEMBL
Match: A0A6J1GC38 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111452801 PE=4 SV=1)

HSP 1 Score: 279.6 bits (714), Expect = 1.3e-71
Identity = 157/228 (68.86%), Postives = 165/228 (72.37%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNS AHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS
Sbjct: 1   MSLVTNSLAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  EEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKI 120
            EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+        
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 120

Query: 121 FVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQ 180
                                                 GLRLNNDEINRLRNIDMKNLLQ
Sbjct: 121 --------------------------------------GLRLNNDEINRLRNIDMKNLLQ 180

Query: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           HKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLE      L+++
Sbjct: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLFLL 182

BLAST of HG10003444 vs. ExPASy TrEMBL
Match: A0A6J1ID30 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111471991 PE=4 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 4.8e-71
Identity = 156/228 (68.42%), Postives = 163/228 (71.49%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDS P E  EG NN E+ERIKR KVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSLPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  EEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKI 120
            EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+        
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 120

Query: 121 FVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQ 180
                                                 GLRLNNDEINRLRNIDMK LLQ
Sbjct: 121 --------------------------------------GLRLNNDEINRLRNIDMKKLLQ 180

Query: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           HKKLILVLDLDHTLLNSTQLGHLTPEEEYLR+Q DSLE      L+++
Sbjct: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRNQMDSLEDVTKGSLFLL 182

BLAST of HG10003444 vs. ExPASy TrEMBL
Match: A0A6J1EFC1 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111432775 PE=4 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 6.2e-71
Identity = 158/228 (69.30%), Postives = 166/228 (72.81%), Query Frame = 0

Query: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENS 60
           MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP +N E  NN ESERIKRRKVEKL  S
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 61  EEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIGKI 120
           EED L GVEEQ+LEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+        
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK-------- 120

Query: 121 FVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNLLQ 180
                                                 GLRLNNDEINRLRNIDMK+LLQ
Sbjct: 121 --------------------------------------GLRLNNDEINRLRNIDMKSLLQ 180

Query: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEGTLSSHLYIV 229
           HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLE      L+++
Sbjct: 181 HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLFLL 182

BLAST of HG10003444 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 131.0 bits (328), Expect = 1.4e-30
Identity = 95/220 (43.18%), Postives = 119/220 (54.09%), Query Frame = 0

Query: 1   MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLE 60
           MS+A++SP H SSSSDD AAFLD  LDS S  SS P E  E +++ ES  +KR+K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 61  NSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRRIEGIG 120
                          E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIH+      
Sbjct: 61  ---------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKE----- 120

Query: 121 KIFVNRKSFTVGRSGNGRRSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMKNL 180
                                                    +RLN DEI+RLR+ D + L
Sbjct: 121 -----------------------------------------MRLNEDEISRLRDSDSRFL 157

Query: 181 LQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLE 219
            + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+
Sbjct: 181 QRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQ 157

BLAST of HG10003444 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 42.4 bits (98), Expect = 6.5e-04
Identity = 25/59 (42.37%), Postives = 38/59 (64.41%), Query Frame = 0

Query: 159 GLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDS 217
           GL+L+N+ +   +++  K + L  KKL LVLDLDHTLL+S  + +L+  E YL  +  S
Sbjct: 41  GLQLSNEAVALTKSLTTKHSCLNEKKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASS 99

BLAST of HG10003444 vs. TAIR 10
Match: AT5G54210.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 42.0 bits (97), Expect = 8.5e-04
Identity = 30/83 (36.14%), Postives = 45/83 (54.22%), Query Frame = 0

Query: 137 RSVLEERRGLKVRKVELEIGIVGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLL 196
           RS +E  RG        +  + GL+L++  +   + +  +      KKL LVLDLDHTLL
Sbjct: 46  RSNVERHRGR-----SFDYLVDGLQLSDIAVTVTKRVTTQITCFNDKKLHLVLDLDHTLL 105

Query: 197 NSTQLGHLTPEEEYLRSQTDSLE 219
           ++  + +LT EE YL  + DS E
Sbjct: 106 HTVMISNLTKEETYLIEEEDSRE 123

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890381.16.6e-7571.05RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa his... [more]
XP_023525838.11.8e-7269.30RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pe... [more]
XP_022133134.18.9e-7268.83RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica cha... [more]
XP_022133135.18.9e-7268.83RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Momordica cha... [more]
KAG7037160.11.5e-7168.86RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma s... [more]
Match NameE-valueIdentityDescription
Q00IB62.0e-2943.18RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1BUF94.3e-7268.83RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
A0A6J1BV424.3e-7268.83RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
A0A6J1GC381.3e-7168.86RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1ID304.8e-7168.42RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1EFC16.2e-7169.30RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
Match NameE-valueIdentityDescription
AT5G58003.11.4e-3043.18C-terminal domain phosphatase-like 4 [more]
AT2G04930.16.5e-0442.37Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT5G54210.18.5e-0436.14Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 43..63
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..47
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 159..228
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 2..114
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 166..234
e-value: 2.7E-8
score: 35.6
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 2..114
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 159..228

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003444.1HG10003444.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity