Cp4.1LG01g05410.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG01g05410.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCTD small phosphatase-like protein
LocationCp4.1LG01 : 760660 .. 763466 (+)
Sequence length1290
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGAGTTGACTCAGCCTGATGTCTACTCGCTGAGGACTCTGCAGGTGTGGAAGACGTTGTTGAACTGGTTGGCTTTCTTCTTCCAGATCTTCGTTCAGATCCTCAGAGCTTTTGGACACCTTCGGCTTCTTTCTTCTTCTGATTCTGATTCTTCGTCTCCGTCTTTTAAGCCTTTGCCGGTCGTTGAGCTTCCTGATCATGAATTTCTTGCTGCCTCTGCCGTTGATATTGCCTCTGTTGAAGATGAGGAGGAGTCCGACGGGCTTATGGAGAAACTGACGGTATTGTGATTTTCTTGTTGATTTCTCGTTGTGAACATCGTGTTTTCCCCCCAAATTTGTTTGCTGGAAAGAATTTGTGTTTGGATTCTATTTGTGCTGGTTTTTTTTCATGAAACTGTGCAATTGTTGCGCTGATTAGGTTTGCTTGATTTTGCAATTATTCGTCTATGGCAATGATTTCAACAGATATGCTTAAGTTCGTTTAATGTTTGTGAGAGTTTGGGAGTAATGTTTAACCCGGCTTGGCCCTGGTGTACGTGAGAAAATTATTGATATCATTTTGGGGAAGAGGAGACGAGCCTTTTCTTGATGAAAATAGGGACAGGAAAGCTTCAACTTCATTCCCTATGGTTCTGCAGTATTTCATTTTGATGATTCATTAGTAGTAGATGAAAGAATCTAGCTCTGGGACTCTGTAAATGATTATTCCAATCTGGACGAACTATATTCGTCCATTTTAGAACTGAATTTTATGATTTGAAAATGTTTTCCTGTGCTATGTAAAGCTTCTGATGTTAGAAGTTTGTGGAGATGCCGTGTGAAGCTGTGAAGTTAAAGTCGTTTCAAGAAATTACATATGGCTCATTCCTCTTTTATCTTCCTTTTGTTTCCCTTGTGTGCTTCCTTTATTAATTCCACATCTGGCATTGGACAATTACAGATAGTTCTCGATTTGGATGAAACTCTCATATGTGCATATGAGACATCCACTCTGCCTGCTGTTGTTCGTAATCAAGCAATAGAAGCTGGACTGAAGTGTTTCGAACTCGAGTGTTTTTCTTCAGATAAGGTAGTTTACTTTCCAGTAACCTTGTATTGATTTGGTTTTGTGAGTTTTAGTTAAACCATCTTACCTGGGTATGCTGCTATTTGAAGGACTGTGAAGGGAAGCCTAAGGTCAACTATGTTACTGTCTTTGAGCGTCCGGGTTTGCATGATTTCTTAAATCAACTAGGTAGCTTTGCAGATCTTGTACTATTCACGGCTGGTCTTGAAGGTTTGTACACTTTACTATCCATTATATTATCAGGTAATATCCTTGCCCATAATAGAAAAATAAAAGAGAATCTGAAGGATGTCATCTATGTGTATGCCTCAGGATATGCAAGACCTCTTGTTGACAGGATAGATGAAGAAAATCGATTTAGTCTTCGACTGTATAGGCCTTCAACAATTAGCACGTAAGTACTTCTCTGCTATTTTCTTCTTTTCTATTCGATCAGCATAGCAACTATTTGCTATGTTTACTTATCTAGATTGGCAATATTCAAGTCTATTGACGCTTCAACTGTGGAAAGACATTATACTATTATGATTATTGAACACACCCTTTCCAAAAACAAGTTTGAATTATTCAGGAGTTCGAATAAGAATTTGTTCGTGTACATGTATTTATATCCTTAAGTCCATTCTTTATTGATATTGAATGCAGTGATTAAGTGCATATGTTTGGCAGTGATTTTGGATAGGACATAAACATGCCCCAATTGTTGATAAGTTTGTTTAGTTAGTAGTTAAATTTATCAAGATTAAATGGAAACTGAGTTATTATTATTATTTTTTTGGTAAGAAACTGAGCTTTCACTTGGTACATTATTTAGATCCTCACAGAAGTTAAAAGCACCCACTTTCAAAGTGTTTTTTTTTTTTTTCTTTTACCCATTTCAAGTTAAGCTAAAAGCACTTACTTTAAAGCTAAGTATCCCAATTGTGTCTATTCTCTGTAAAACCCATGGTTCTGGTATGTAATCTACAGGGAATATCGAGATCACGTGAAGGATCTCAGCTGCCTATCCAAGGATCTAAGAAGAACAGTCATCGTTGATAACAATCCTTTCAGTTTTCTATTGCAACCTGTGAATGGAATTCCTTGCATTCCATTTTCTGCAGGGCAACCACACGATTCACAGGTGGGAAAACAATGAAAATTCTGAATCTTTCAACAACAGTACATCTTCACAATAATTCATTTTGTGACCGTCTTCCTTGTTCTCCATATACAGCTTCTAGATGTCATTCTTCCACTCCTAAAGCACCTCTCGCTGCAGAACGACGTCAGATCAGTGCTGTACGAAAGATTCCATATGCCTGAATGGTTTCAGAAGCATGGAATTCCAACCTAAATTCAACGAAAATCTGTGCCAAGGAGAAAGGTACATTCTTCTCTGCTCATGCAACAGTTAAACACAGTTAGCTTGCTTGTTCAAAAGGGCAGAGCACATTTAGGAGTCAGATTCTGATTCTTGAACTTCTGTATAGCTTCATGTTGTACATCGTGGATATGGTCCTGAAAAGTTGCATCCAATTGTTTGAAAATCATAGTAGTATTAAGATTGTAAATTTGTAGGAATTCTGATTTGTAATTTATTAATGCAGTGTTGTTGGATATGTTTGTAGGAATTTTCTTGAACTTTTTACCAATAAATATCAGTTGTTTCCTCCCTTTGGTTGCTTTAACTGTTTTGTGGCTTCTCCAGAATATTAAAAATTAAGGTATTTAGAAGAGTTAGAGTAAAAAAGAA

mRNA sequence

ATGGCTGAGTTGACTCAGCCTGATGTCTACTCGCTGAGGACTCTGCAGGTGTGGAAGACGTTGTTGAACTGGTTGGCTTTCTTCTTCCAGATCTTCGTTCAGATCCTCAGAGCTTTTGGACACCTTCGGCTTCTTTCTTCTTCTGATTCTGATTCTTCGTCTCCGTCTTTTAAGCCTTTGCCGGTCGTTGAGCTTCCTGATCATGAATTTCTTGCTGCCTCTGCCGTTGATATTGCCTCTGTTGAAGATGAGGAGGAGTCCGACGGGCTTATGGAGAAACTGACGATAGTTCTCGATTTGGATGAAACTCTCATATGTGCATATGAGACATCCACTCTGCCTGCTGTTGTTCGTAATCAAGCAATAGAAGCTGGACTGAAGTGTTTCGAACTCGAGTGTTTTTCTTCAGATAAGGACTGTGAAGGGAAGCCTAAGGTCAACTATGTTACTGTCTTTGAGCGTCCGGGTTTGCATGATTTCTTAAATCAACTAGGTAGCTTTGCAGATCTTGTACTATTCACGGCTGGTCTTGAAGGATATGCAAGACCTCTTGTTGACAGGATAGATGAAGAAAATCGATTTAGTCTTCGACTGTATAGGCCTTCAACAATTAGCACGGAATATCGAGATCACGTGAAGGATCTCAGCTGCCTATCCAAGGATCTAAGAAGAACAGTCATCGTTGATAACAATCCTTTCAGTTTTCTATTGCAACCTGTGAATGGAATTCCTTGCATTCCATTTTCTGCAGGGCAACCACACGATTCACAGCTTCTAGATGTCATTCTTCCACTCCTAAAGCACCTCTCGCTGCAGAACGACGTCAGATCAGTGCTGTACGAAAGATTCCATATGCCTGAATGGTTTCAGAAGCATGGAATTCCAACCTAAATTCAACGAAAATCTGTGCCAAGGAGAAAGGTACATTCTTCTCTGCTCATGCAACAGTTAAACACAGTTAGCTTGCTTGTTCAAAAGGGCAGAGCACATTTAGGAGTCAGATTCTGATTCTTGAACTTCTGTATAGCTTCATGTTGTACATCGTGGATATGGTCCTGAAAAGTTGCATCCAATTGTTTGAAAATCATAGTAGTATTAAGATTGTAAATTTGTAGGAATTCTGATTTGTAATTTATTAATGCAGTGTTGTTGGATATGTTTGTAGGAATTTTCTTGAACTTTTTACCAATAAATATCAGTTGTTTCCTCCCTTTGGTTGCTTTAACTGTTTTGTGGCTTCTCCAGAATATTAAAAATTAAGGTATTTAGAAGAGTTAGAGTAAAAAAGAA

Coding sequence (CDS)

ATGGCTGAGTTGACTCAGCCTGATGTCTACTCGCTGAGGACTCTGCAGGTGTGGAAGACGTTGTTGAACTGGTTGGCTTTCTTCTTCCAGATCTTCGTTCAGATCCTCAGAGCTTTTGGACACCTTCGGCTTCTTTCTTCTTCTGATTCTGATTCTTCGTCTCCGTCTTTTAAGCCTTTGCCGGTCGTTGAGCTTCCTGATCATGAATTTCTTGCTGCCTCTGCCGTTGATATTGCCTCTGTTGAAGATGAGGAGGAGTCCGACGGGCTTATGGAGAAACTGACGATAGTTCTCGATTTGGATGAAACTCTCATATGTGCATATGAGACATCCACTCTGCCTGCTGTTGTTCGTAATCAAGCAATAGAAGCTGGACTGAAGTGTTTCGAACTCGAGTGTTTTTCTTCAGATAAGGACTGTGAAGGGAAGCCTAAGGTCAACTATGTTACTGTCTTTGAGCGTCCGGGTTTGCATGATTTCTTAAATCAACTAGGTAGCTTTGCAGATCTTGTACTATTCACGGCTGGTCTTGAAGGATATGCAAGACCTCTTGTTGACAGGATAGATGAAGAAAATCGATTTAGTCTTCGACTGTATAGGCCTTCAACAATTAGCACGGAATATCGAGATCACGTGAAGGATCTCAGCTGCCTATCCAAGGATCTAAGAAGAACAGTCATCGTTGATAACAATCCTTTCAGTTTTCTATTGCAACCTGTGAATGGAATTCCTTGCATTCCATTTTCTGCAGGGCAACCACACGATTCACAGCTTCTAGATGTCATTCTTCCACTCCTAAAGCACCTCTCGCTGCAGAACGACGTCAGATCAGTGCTGTACGAAAGATTCCATATGCCTGAATGGTTTCAGAAGCATGGAATTCCAACCTAA

Protein sequence

MAELTQPDVYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKPLPVVELPDHEFLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT
BLAST of Cp4.1LG01g05410.1 vs. Swiss-Prot
Match: CTDS1_MOUSE (Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 OS=Mus musculus GN=Ctdsp1 PE=1 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.2e-19
Identity = 65/188 (34.57%), Postives = 100/188 (53.19%), Query Frame = 1

Query: 92  EKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTV 151
           +K+ +V+DLDETL+                  +  K      F    + +G   V+ V V
Sbjct: 89  DKICVVIDLDETLV-----------------HSSFKPVNNADFIIPVEIDGV--VHQVYV 148

Query: 152 FERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDH 211
            +RP + +FL ++G   + VLFTA L  YA P+ D +D+   F  RL+R S +     ++
Sbjct: 149 LKRPHVDEFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRARLFRESCV-FHRGNY 208

Query: 212 VKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSL 271
           VKDLS L +DLRR +I+DN+P S++  P N +P   +      D++L D +LP  + LS 
Sbjct: 209 VKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASWFDNM-SDTELHD-LLPFFEQLSR 254

Query: 272 QNDVRSVL 280
            +DV SVL
Sbjct: 269 VDDVYSVL 254

BLAST of Cp4.1LG01g05410.1 vs. Swiss-Prot
Match: CTDS1_HUMAN (Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 OS=Homo sapiens GN=CTDSP1 PE=1 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.2e-19
Identity = 65/188 (34.57%), Postives = 100/188 (53.19%), Query Frame = 1

Query: 92  EKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTV 151
           +K+ +V+DLDETL+                  +  K      F    + +G   V+ V V
Sbjct: 89  DKICVVIDLDETLV-----------------HSSFKPVNNADFIIPVEIDGV--VHQVYV 148

Query: 152 FERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDH 211
            +RP + +FL ++G   + VLFTA L  YA P+ D +D+   F  RL+R S +     ++
Sbjct: 149 LKRPHVDEFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRARLFRESCV-FHRGNY 208

Query: 212 VKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSL 271
           VKDLS L +DLRR +I+DN+P S++  P N +P   +      D++L D +LP  + LS 
Sbjct: 209 VKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASWFDNM-SDTELHD-LLPFFEQLSR 254

Query: 272 QNDVRSVL 280
            +DV SVL
Sbjct: 269 VDDVYSVL 254

BLAST of Cp4.1LG01g05410.1 vs. Swiss-Prot
Match: YA22_SCHPO (Uncharacterized protein C2F7.02c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC2F7.02c PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 4.6e-19
Identity = 62/187 (33.16%), Postives = 101/187 (54.01%), Query Frame = 1

Query: 93  KLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVF 152
           K  ++LDLDETL+                  +  K  E   F    + +G    + V V 
Sbjct: 157 KKCLILDLDETLV-----------------HSSFKYIEPADFVVSIEIDGLQ--HDVRVV 216

Query: 153 ERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHV 212
           +RPG+ +FL ++G   ++V+FTA L  YA P++D +D  +    RL+R +  + E  + V
Sbjct: 217 KRPGVDEFLKKMGDMFEIVVFTASLAKYADPVLDMLDHSHVIRHRLFREACCNYE-GNFV 276

Query: 213 KDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQ 272
           KDLS L ++L  ++I+DN+P S++  P + +P I       HD +L+D+I P L+HL+  
Sbjct: 277 KDLSQLGRNLEDSIIIDNSPSSYIFHPSHAVP-ISSWFNDMHDMELIDLI-PFLEHLARV 321

Query: 273 NDVRSVL 280
            DV +VL
Sbjct: 337 PDVSTVL 321

BLAST of Cp4.1LG01g05410.1 vs. Swiss-Prot
Match: PSR2_YEAST (Probable phosphatase PSR2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PSR2 PE=1 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 6.0e-19
Identity = 63/190 (33.16%), Postives = 103/190 (54.21%), Query Frame = 1

Query: 92  EKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTV 151
           +K  ++LDLDETL+                  +  K      F    + + +  V+ V V
Sbjct: 226 QKKCLILDLDETLV-----------------HSSFKYMHSADFVLPVEIDDQ--VHNVYV 285

Query: 152 FERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDH 211
            +RPG+ +FLN++    ++V+FTA +  YA PL+D +D       RL+R +  + E  ++
Sbjct: 286 IKRPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTIHHRLFREACYNYE-GNY 345

Query: 212 VKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSL 271
           +K+LS + + L  T+I+DN+P S++  P + +P I       HD++LLD+I PLL+ LS 
Sbjct: 346 IKNLSQIGRPLSETIILDNSPASYIFHPQHAVP-ISSWFSDTHDNELLDII-PLLEDLSS 393

Query: 272 QN--DVRSVL 280
            N  DV SVL
Sbjct: 406 GNVLDVGSVL 393

BLAST of Cp4.1LG01g05410.1 vs. Swiss-Prot
Match: CNEP1_DROPS (CTD nuclear envelope phosphatase 1 homolog OS=Drosophila pseudoobscura pseudoobscura GN=l(1)G0269 PE=3 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 7.9e-19
Identity = 67/196 (34.18%), Postives = 101/196 (51.53%), Query Frame = 1

Query: 90  LMEKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYV 149
           L+++ T+VLDLDETLI ++      A+ RN  ++ G        F+     +  P   +V
Sbjct: 57  LVQRKTLVLDLDETLIHSHHN----AMPRN-TVKPGTP----HDFTVKVTIDRNPVRFFV 116

Query: 150 TVFERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYR 209
              +RP +  FL+ +  + DLV+FTA +E Y   + D++D       R Y     + +Y 
Sbjct: 117 --HKRPHVDYFLDVVSQWYDLVVFTASMEIYGAAVADKLDNGRNILRRRYYRQHCTPDYG 176

Query: 210 DHVKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHL 269
            + KDLS +  DL R  I+DN+P ++   P N IP I      P D+ LL  +LP+L  L
Sbjct: 177 SYTKDLSAICSDLNRIFIIDNSPGAYRCFPNNAIP-IKSWFSDPMDTALLS-LLPMLDAL 236

Query: 270 SLQNDVRSVLYERFHM 286
              NDVRSVL    H+
Sbjct: 237 RFTNDVRSVLSRNLHL 239

BLAST of Cp4.1LG01g05410.1 vs. TrEMBL
Match: A0A0A0KM39_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G203410 PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 1.4e-152
Identity = 273/296 (92.23%), Postives = 278/296 (93.92%), Query Frame = 1

Query: 1   MAELTQPDVYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKPL 60
           MAELTQP+VYS  TL VWKTLLNWLAFFFQIFV IL AFGHLRLLSS  S  SSPSFKPL
Sbjct: 1   MAELTQPEVYSPGTLHVWKTLLNWLAFFFQIFVSILTAFGHLRLLSSHSS--SSPSFKPL 60

Query: 61  PVVELPDHEFLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAVVRNQ 120
           PVVELPDHE LAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPA++R+Q
Sbjct: 61  PVVELPDHELLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAIIRSQ 120

Query: 121 AIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLEGY 180
           AIEAGLK FELECFSSDKDCEGKPKVNYVTVFERPGLHDFL  LG FADLVLFTAGLEGY
Sbjct: 121 AIEAGLKSFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLKHLGGFADLVLFTAGLEGY 180

Query: 181 ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQPV 240
           ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRR VIVDNNPFSFLLQP 
Sbjct: 181 ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRVVIVDNNPFSFLLQPQ 240

Query: 241 NGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           NGIPCIPFSAGQPHDSQLLDVILPLL+HLS QNDVR VLYERFHMPEWFQKHGIPT
Sbjct: 241 NGIPCIPFSAGQPHDSQLLDVILPLLEHLSQQNDVRPVLYERFHMPEWFQKHGIPT 294

BLAST of Cp4.1LG01g05410.1 vs. TrEMBL
Match: W9S4E3_9ROSA (CTD small phosphatase-like protein OS=Morus notabilis GN=L484_007675 PE=4 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 1.5e-125
Identity = 232/297 (78.11%), Postives = 257/297 (86.53%), Query Frame = 1

Query: 2   AELTQPD--VYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKP 61
           AELTQ    VYS RTLQVW+ LLNWL FFFQIF+QILRA GHL LLSSS S SSSPSFKP
Sbjct: 3   AELTQGGDVVYSPRTLQVWRALLNWLGFFFQIFLQILRALGHLPLLSSSSS-SSSPSFKP 62

Query: 62  LPVVELPDHEFLAASAVDIASVEDEEESDGLMEK-LTIVLDLDETLICAYETSTLPAVVR 121
           LP +ELP+H+  A SAV IA++ D +  D   EK LT+VLDLDETL+CAYETS+LP  VR
Sbjct: 63  LPNIELPEHDLPADSAVHIAALHDSDSDDVSEEKKLTVVLDLDETLVCAYETSSLPPGVR 122

Query: 122 NQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLE 181
           NQA EAGLK FELEC SSDK+ EG+PK+N+VTVFERPGLH+FL QL  FADLVLFTAGLE
Sbjct: 123 NQATEAGLKWFELECVSSDKEFEGRPKINHVTVFERPGLHEFLKQLSEFADLVLFTAGLE 182

Query: 182 GYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQ 241
           GYARPLVDRIDE N FSLRLYRPSTISTE+R+HVKDLSC+SKD+ R+VIVDNNPFSFLLQ
Sbjct: 183 GYARPLVDRIDEGNLFSLRLYRPSTISTEFREHVKDLSCISKDMCRSVIVDNNPFSFLLQ 242

Query: 242 PVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIP 296
           PVNGIPCIPFSAGQPHD+QLLDV+LPLLK LSLQ DVR +L+ERFHMPEWFQK GIP
Sbjct: 243 PVNGIPCIPFSAGQPHDTQLLDVLLPLLKDLSLQKDVRPMLHERFHMPEWFQKQGIP 298

BLAST of Cp4.1LG01g05410.1 vs. TrEMBL
Match: V4SMH1_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031856mg PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 4.5e-122
Identity = 225/300 (75.00%), Postives = 258/300 (86.00%), Query Frame = 1

Query: 1   MAELTQPDV-YSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKP 60
           MAELTQ +V YS R++QVW+TLLNWLAFFFQIF +ILRA GH  LLSSS S +S+ +FKP
Sbjct: 76  MAELTQAEVAYSHRSIQVWRTLLNWLAFFFQIFAKILRALGHHPLLSSSAS-ASTHAFKP 135

Query: 61  LPVVELPDHEFLAASAVDIASVEDEEE---SDGLMEKLTIVLDLDETLICAYETSTLPAV 120
           L VVELP+ +  +++ VDI +V D  +   S+  ++KLT+VLDLDETL+CAYETS+LP  
Sbjct: 136 LQVVELPETD--SSATVDIGAVRDSGDDVVSEERLQKLTVVLDLDETLVCAYETSSLPVT 195

Query: 121 VRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAG 180
           +RNQA  AGLK FE+EC SSDK+CEGKPK+N+VTVFERPGL +FL QL  FADL+LFTAG
Sbjct: 196 LRNQATGAGLKWFEMECLSSDKECEGKPKINHVTVFERPGLREFLKQLSEFADLILFTAG 255

Query: 181 LEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFL 240
           LEGYARPLVDRID EN FSLRLYRPST STEYR+HVKDLSCLSKDL RTVIVDNNPFSFL
Sbjct: 256 LEGYARPLVDRIDGENLFSLRLYRPSTTSTEYREHVKDLSCLSKDLCRTVIVDNNPFSFL 315

Query: 241 LQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           LQP+NGIPCIPFSAGQPHD+QLL+V+LPLLKHLSLQ DVR  LYERFHMPEWFQK GIPT
Sbjct: 316 LQPLNGIPCIPFSAGQPHDNQLLNVLLPLLKHLSLQKDVRPELYERFHMPEWFQKQGIPT 372

BLAST of Cp4.1LG01g05410.1 vs. TrEMBL
Match: A0A061FM78_THECC (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_042494 PE=4 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 1.0e-121
Identity = 228/306 (74.51%), Postives = 254/306 (83.01%), Query Frame = 1

Query: 1   MAELTQPDVYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPS---- 60
           MAELTQ +VYS R++QVW+ LLNWLAFFFQIF QI+RA G   LLSSS S SSS S    
Sbjct: 1   MAELTQAEVYSPRSMQVWRALLNWLAFFFQIFAQIIRAVGQYPLLSSSSSSSSSSSSTTS 60

Query: 61  -----FKPLPVVELPDHEFLAASAVDIASVEDEE--ESDGLMEKLTIVLDLDETLICAYE 120
                FKPLPV +  + E  + + V+IA+V D      +  +EKLT+VLDLDETL+CAYE
Sbjct: 61  SSPHRFKPLPVDDSTEIE--SPATVEIAAVLDSSVLADEDSVEKLTVVLDLDETLVCAYE 120

Query: 121 TSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFAD 180
           TS+LP  +RNQA +AGLK FELEC SSDK+ EGKPK+NYVTVFERPGL +FLNQL  FA+
Sbjct: 121 TSSLPPALRNQATDAGLKWFELECVSSDKEFEGKPKINYVTVFERPGLQEFLNQLSEFAE 180

Query: 181 LVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVD 240
           LVLFTAGLEGYARPLVDRID ENRFSLRLYRPSTISTEYR+HVKDLSCLSKDL RTVIVD
Sbjct: 181 LVLFTAGLEGYARPLVDRIDAENRFSLRLYRPSTISTEYREHVKDLSCLSKDLCRTVIVD 240

Query: 241 NNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWF 296
           NNPFSFLLQP+NGIPCIPFSAGQPHD+QLLDV+LPLLKHLS Q DVRSVLYERF MPEWF
Sbjct: 241 NNPFSFLLQPLNGIPCIPFSAGQPHDTQLLDVLLPLLKHLSQQKDVRSVLYERFRMPEWF 300

BLAST of Cp4.1LG01g05410.1 vs. TrEMBL
Match: A0A067FZR4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022258mg PE=4 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 1.0e-121
Identity = 224/300 (74.67%), Postives = 258/300 (86.00%), Query Frame = 1

Query: 1   MAELTQPDV-YSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKP 60
           MAELTQ +V YS R++QVW+TLLNWLAFFFQIF +ILRA GH  LLSSS S +S+ +FKP
Sbjct: 1   MAELTQAEVAYSHRSIQVWRTLLNWLAFFFQIFAKILRALGHHPLLSSSAS-ASTHAFKP 60

Query: 61  LPVVELPDHEFLAASAVDIASVEDEEE---SDGLMEKLTIVLDLDETLICAYETSTLPAV 120
           L VVELP+ +  +++ VDI +V D  +   S+  ++KLT+VLDLDETL+CAYETS+LP  
Sbjct: 61  LQVVELPETD--SSATVDIGAVRDSGDDVVSEERLQKLTVVLDLDETLVCAYETSSLPVT 120

Query: 121 VRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAG 180
           +RNQA  AGLK FE+EC SSDK+CEGKPK+N+VTVFERPGL +FL QL  FADL+LFTAG
Sbjct: 121 LRNQATGAGLKWFEMECLSSDKECEGKPKINHVTVFERPGLREFLKQLSEFADLILFTAG 180

Query: 181 LEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFL 240
           LEGYARPLVDRID EN FSLRLYRPST STEYR+HVKDLSCLSKDL RT+IVDNNPFSFL
Sbjct: 181 LEGYARPLVDRIDGENLFSLRLYRPSTTSTEYREHVKDLSCLSKDLCRTLIVDNNPFSFL 240

Query: 241 LQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           LQP+NGIPCIPFSAGQPHD+QLL+V+LPLLKHLSLQ DVR  LYERFHMPEWFQK GIPT
Sbjct: 241 LQPLNGIPCIPFSAGQPHDNQLLNVLLPLLKHLSLQKDVRPELYERFHMPEWFQKQGIPT 297

BLAST of Cp4.1LG01g05410.1 vs. TAIR10
Match: AT3G55960.1 (AT3G55960.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein)

HSP 1 Score: 405.6 bits (1041), Expect = 2.6e-113
Identity = 207/300 (69.00%), Postives = 244/300 (81.33%), Query Frame = 1

Query: 1   MAELTQPDV-YSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKP 60
           MAELTQ DV YS R+ QVWKTL+NWLAFF+QIF+QILRA G+  LLSSS + +S+  FKP
Sbjct: 1   MAELTQADVVYSPRSFQVWKTLVNWLAFFYQIFLQILRAVGYHPLLSSS-AKASADGFKP 60

Query: 61  LPVVELPDHEFLAASAVDIASVEDEEE-SDGL---MEKLTIVLDLDETLICAYETSTLPA 120
           LP +EL D    + + V+IA+    +  SDG     ++L +VLDLDETL+CAYETS+LPA
Sbjct: 61  LPAIELLDRASESPTTVEIAATTTSDSCSDGARSRFQRLKVVLDLDETLVCAYETSSLPA 120

Query: 121 VVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTA 180
            +RNQAIEAGLK FELEC S+DK+ +GKPK+NYVTVFERPGLH+FL QL  FADL+LFTA
Sbjct: 121 ALRNQAIEAGLKWFELECLSTDKEYDGKPKINYVTVFERPGLHEFLEQLSEFADLILFTA 180

Query: 181 GLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSF 240
           GLEGYARPLVDRID     + RLYRPST+ST+YRDHVKDL   SK++ RTVIVDNNPFSF
Sbjct: 181 GLEGYARPLVDRIDTRKVLTNRLYRPSTVSTQYRDHVKDLLSTSKNMCRTVIVDNNPFSF 240

Query: 241 LLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIP 296
           LLQP NGIPCI FSAGQP+D+QLLDVILPLLK LS ++DVR  LY+RF MPEWF+K GIP
Sbjct: 241 LLQPSNGIPCIAFSAGQPNDTQLLDVILPLLKQLSEEDDVRPTLYDRFRMPEWFEKQGIP 299

BLAST of Cp4.1LG01g05410.1 vs. TAIR10
Match: AT1G29780.1 (AT1G29780.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein)

HSP 1 Score: 100.9 bits (250), Expect = 1.4e-21
Identity = 65/171 (38.01%), Postives = 97/171 (56.73%), Query Frame = 1

Query: 93  KLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVF 152
           K TI+LDLDETL+ A  T+ LP V  +  +   ++                 ++  + V 
Sbjct: 49  KRTIILDLDETLVHA--TTHLPGVKHDFMVMVKME----------------REIMPIFVV 108

Query: 153 ERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDHV 212
           +RPG+ +FL +LG    +V+FTAGLE YA  ++D++D+    S RLYR S      + +V
Sbjct: 109 KRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVISQRLYRDSCTEVNGK-YV 168

Query: 213 KDLS-CLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVI 263
           KDLS  + KDLR  +IVD+NP S+ LQP NG+P   F      D +LL+++
Sbjct: 169 KDLSLVVGKDLRSALIVDDNPSSYSLQPENGVPIKAF-VDDLKDQELLNLV 199

BLAST of Cp4.1LG01g05410.1 vs. TAIR10
Match: AT5G45700.1 (AT5G45700.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein)

HSP 1 Score: 91.7 bits (226), Expect = 8.4e-19
Identity = 61/159 (38.36%), Postives = 82/159 (51.57%), Query Frame = 1

Query: 93  KLTIVLDLDETLI-CAYETSTLPA-VVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVT 152
           K TIVLDLDETL+  + E   +P   V N  I+  +  F                     
Sbjct: 97  KKTIVLDLDETLVHSSMEKPEVPYDFVVNPKIDGQILTF--------------------F 156

Query: 153 VFERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRD 212
           V +RPG+ +FL ++G    +V+FTAGL  YA  ++D++D E R   R +     S     
Sbjct: 157 VIKRPGVDEFLKKIGEKYQIVVFTAGLREYASLVLDKLDPERRVISRSFYRDACSEIDGR 216

Query: 213 HVKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFS 250
            VKDL  + +DLRR VIVD+NP S+ LQP N  P  PFS
Sbjct: 217 LVKDLGFVMRDLRRVVIVDDNPNSYALQPENAFPIKPFS 235

BLAST of Cp4.1LG01g05410.1 vs. TAIR10
Match: AT5G46410.2 (AT5G46410.2 SCP1-like small phosphatase 4)

HSP 1 Score: 88.6 bits (218), Expect = 7.1e-18
Identity = 69/193 (35.75%), Postives = 101/193 (52.33%), Query Frame = 1

Query: 92  EKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTV 151
           + +T+VLDLDETL+     STL +          +  F    F + ++       N V V
Sbjct: 282 KSVTLVLDLDETLV----HSTLES--------CNVADFSFRVFFNMQE-------NTVYV 341

Query: 152 FERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRF-SLRLYRPSTISTEYRD 211
            +RP L+ FL ++G    +V+FTA    YA  L+D +D + +F S R YR S I  +   
Sbjct: 342 RQRPHLYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYRDSCILLD-GI 401

Query: 212 HVKDLSCLSKDLRRTVIVDNNPFSFLLQPVNGIPCIPFSAGQPHDSQLLDVILPLLKHLS 271
           + KDL+ L  DL +  I+DN P  + LQ  NGIP I      P D  L+  ILP L+ L+
Sbjct: 402 YTKDLTVLGLDLAKVAIIDNCPQVYRLQINNGIP-IKSWYDDPTDDGLI-TILPFLETLA 452

Query: 272 LQNDVRSVLYERF 284
           + +DVR ++  RF
Sbjct: 462 VADDVRPIIGRRF 452

BLAST of Cp4.1LG01g05410.1 vs. TAIR10
Match: AT1G29770.1 (AT1G29770.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein)

HSP 1 Score: 87.4 bits (215), Expect = 1.6e-17
Identity = 59/158 (37.34%), Postives = 86/158 (54.43%), Query Frame = 1

Query: 92  EKLTIVLDLDETLICAYETSTLPAVVRNQAIEAGLKCFELECFSSDKDCEGKPKVNYVTV 151
           +K TI LDLDETL+    ++  P +  N      +K             EG   V  + V
Sbjct: 101 KKRTIFLDLDETLV---HSTMEPPIRVNVDFMVRIKI------------EGA--VIPMFV 160

Query: 152 FERPGLHDFLNQLGSFADLVLFTAGLEGYARPLVDRIDEENRFSLRLYRPSTISTEYRDH 211
            +RPG+ +FL ++     + +FTAGL  YA  ++D++D+    S RLYR S      R +
Sbjct: 161 VKRPGVTEFLERISKNYRVAIFTAGLPEYASQVLDKLDKNRVISQRLYRDSCTEVNGR-Y 220

Query: 212 VKDLSCLSK-DLRRTVIVDNNPFSFLLQPVNGIPCIPF 249
            KDLS ++K DL   ++VD+NPFS+ LQP NG+P  PF
Sbjct: 221 AKDLSLVAKNDLGSVLLVDDNPFSYSLQPDNGVPIKPF 240

BLAST of Cp4.1LG01g05410.1 vs. NCBI nr
Match: gi|449463020|ref|XP_004149232.1| (PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 [Cucumis sativus])

HSP 1 Score: 547.0 bits (1408), Expect = 2.1e-152
Identity = 273/296 (92.23%), Postives = 278/296 (93.92%), Query Frame = 1

Query: 1   MAELTQPDVYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKPL 60
           MAELTQP+VYS  TL VWKTLLNWLAFFFQIFV IL AFGHLRLLSS  S  SSPSFKPL
Sbjct: 1   MAELTQPEVYSPGTLHVWKTLLNWLAFFFQIFVSILTAFGHLRLLSSHSS--SSPSFKPL 60

Query: 61  PVVELPDHEFLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAVVRNQ 120
           PVVELPDHE LAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPA++R+Q
Sbjct: 61  PVVELPDHELLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAIIRSQ 120

Query: 121 AIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLEGY 180
           AIEAGLK FELECFSSDKDCEGKPKVNYVTVFERPGLHDFL  LG FADLVLFTAGLEGY
Sbjct: 121 AIEAGLKSFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLKHLGGFADLVLFTAGLEGY 180

Query: 181 ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQPV 240
           ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRR VIVDNNPFSFLLQP 
Sbjct: 181 ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRVVIVDNNPFSFLLQPQ 240

Query: 241 NGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           NGIPCIPFSAGQPHDSQLLDVILPLL+HLS QNDVR VLYERFHMPEWFQKHGIPT
Sbjct: 241 NGIPCIPFSAGQPHDSQLLDVILPLLEHLSQQNDVRPVLYERFHMPEWFQKHGIPT 294

BLAST of Cp4.1LG01g05410.1 vs. NCBI nr
Match: gi|659126551|ref|XP_008463241.1| (PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 [Cucumis melo])

HSP 1 Score: 545.8 bits (1405), Expect = 4.6e-152
Identity = 272/296 (91.89%), Postives = 279/296 (94.26%), Query Frame = 1

Query: 1   MAELTQPDVYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKPL 60
           MAELTQP+VYS  TLQVWKTLLNWLAFFFQIFV IL AFGHLRLLSS  S  SSPSFKPL
Sbjct: 1   MAELTQPEVYSPGTLQVWKTLLNWLAFFFQIFVSILTAFGHLRLLSSHSS--SSPSFKPL 60

Query: 61  PVVELPDHEFLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAVVRNQ 120
           PVVELPDHE LAASAVDIASVEDE ESDGLMEKLTIVLDLDETLICAYETSTLPA++R+Q
Sbjct: 61  PVVELPDHELLAASAVDIASVEDEGESDGLMEKLTIVLDLDETLICAYETSTLPAIIRSQ 120

Query: 121 AIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLEGY 180
           AIEAGLK FELEC+SSDKDCEGKPKVNYVTVFERPGLHDFL  LG FADLVLFTAGLEGY
Sbjct: 121 AIEAGLKSFELECYSSDKDCEGKPKVNYVTVFERPGLHDFLKHLGGFADLVLFTAGLEGY 180

Query: 181 ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQPV 240
           ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRR VIVDNNPFSFLLQP+
Sbjct: 181 ARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRIVIVDNNPFSFLLQPL 240

Query: 241 NGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           NGIPCIPFSAGQPHDSQLLDVILPLL+HLS QNDVR VLYERFHMPEWFQKHGIPT
Sbjct: 241 NGIPCIPFSAGQPHDSQLLDVILPLLEHLSQQNDVRPVLYERFHMPEWFQKHGIPT 294

BLAST of Cp4.1LG01g05410.1 vs. NCBI nr
Match: gi|703117626|ref|XP_010101420.1| (CTD small phosphatase-like protein [Morus notabilis])

HSP 1 Score: 457.2 bits (1175), Expect = 2.2e-125
Identity = 232/297 (78.11%), Postives = 257/297 (86.53%), Query Frame = 1

Query: 2   AELTQPD--VYSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDSSSPSFKP 61
           AELTQ    VYS RTLQVW+ LLNWL FFFQIF+QILRA GHL LLSSS S SSSPSFKP
Sbjct: 3   AELTQGGDVVYSPRTLQVWRALLNWLGFFFQIFLQILRALGHLPLLSSSSS-SSSPSFKP 62

Query: 62  LPVVELPDHEFLAASAVDIASVEDEEESDGLMEK-LTIVLDLDETLICAYETSTLPAVVR 121
           LP +ELP+H+  A SAV IA++ D +  D   EK LT+VLDLDETL+CAYETS+LP  VR
Sbjct: 63  LPNIELPEHDLPADSAVHIAALHDSDSDDVSEEKKLTVVLDLDETLVCAYETSSLPPGVR 122

Query: 122 NQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLE 181
           NQA EAGLK FELEC SSDK+ EG+PK+N+VTVFERPGLH+FL QL  FADLVLFTAGLE
Sbjct: 123 NQATEAGLKWFELECVSSDKEFEGRPKINHVTVFERPGLHEFLKQLSEFADLVLFTAGLE 182

Query: 182 GYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQ 241
           GYARPLVDRIDE N FSLRLYRPSTISTE+R+HVKDLSC+SKD+ R+VIVDNNPFSFLLQ
Sbjct: 183 GYARPLVDRIDEGNLFSLRLYRPSTISTEFREHVKDLSCISKDMCRSVIVDNNPFSFLLQ 242

Query: 242 PVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIP 296
           PVNGIPCIPFSAGQPHD+QLLDV+LPLLK LSLQ DVR +L+ERFHMPEWFQK GIP
Sbjct: 243 PVNGIPCIPFSAGQPHDTQLLDVLLPLLKDLSLQKDVRPMLHERFHMPEWFQKQGIP 298

BLAST of Cp4.1LG01g05410.1 vs. NCBI nr
Match: gi|657944345|ref|XP_008374036.1| (PREDICTED: CTD small phosphatase-like protein [Malus domestica])

HSP 1 Score: 448.0 bits (1151), Expect = 1.3e-122
Identity = 228/298 (76.51%), Postives = 253/298 (84.90%), Query Frame = 1

Query: 1   MAELTQPDV-YSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDS-SSPSFK 60
           MAELTQ +V YS +TLQVW+ LLNWL FFFQIF++ILRA GH  LLSSS S S SS +FK
Sbjct: 1   MAELTQGEVVYSPKTLQVWRALLNWLGFFFQIFLRILRALGHHPLLSSSSSSSASSAAFK 60

Query: 61  PLPVVELPDHEFLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAVVR 120
            LPV+ELP+++  AASAV IA   D + SD   EKLT+VLDLDETL+CAYETS+LPAVVR
Sbjct: 61  SLPVIELPENDSPAASAVVIADTPDSD-SDDPFEKLTVVLDLDETLVCAYETSSLPAVVR 120

Query: 121 NQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLE 180
            QA E G+K FELEC SSDK+C+GKPKVNYVTVFERPGLHDFL Q+  FA+LVLFTAGLE
Sbjct: 121 TQATEGGMKWFELECVSSDKECDGKPKVNYVTVFERPGLHDFLKQVSQFAELVLFTAGLE 180

Query: 181 GYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQ 240
           GYARPLVDRID +N FS RLYRPST STEYR+HVKDLS LSKD+RR VIVDNNPFSFLLQ
Sbjct: 181 GYARPLVDRIDVDNLFSCRLYRPSTTSTEYREHVKDLSGLSKDMRRIVIVDNNPFSFLLQ 240

Query: 241 PVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           P NGIPCIPFSAGQ HD+QLLDV+LPLLK LSLQ DVR  LYERFHMPEWFQK GIP+
Sbjct: 241 PSNGIPCIPFSAGQTHDTQLLDVLLPLLKQLSLQKDVRPALYERFHMPEWFQKQGIPS 297

BLAST of Cp4.1LG01g05410.1 vs. NCBI nr
Match: gi|694318956|ref|XP_009344958.1| (PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1-like [Pyrus x bretschneideri])

HSP 1 Score: 446.8 bits (1148), Expect = 2.9e-122
Identity = 227/298 (76.17%), Postives = 252/298 (84.56%), Query Frame = 1

Query: 1   MAELTQPDV-YSLRTLQVWKTLLNWLAFFFQIFVQILRAFGHLRLLSSSDSDS-SSPSFK 60
           MAELTQ +V YS +TLQVW+ LLNW  FFFQIF++ILRA GH  LLSSS S S SS +FK
Sbjct: 1   MAELTQGEVVYSPKTLQVWRALLNWFGFFFQIFLRILRALGHRPLLSSSSSSSASSAAFK 60

Query: 61  PLPVVELPDHEFLAASAVDIASVEDEEESDGLMEKLTIVLDLDETLICAYETSTLPAVVR 120
            LPV+ELP+++  AASAV IA   D + SD   EKLT+VLDLDETL+CAYETS+LPAVVR
Sbjct: 61  SLPVIELPENDSPAASAVVIADTPDSD-SDDPFEKLTVVLDLDETLVCAYETSSLPAVVR 120

Query: 121 NQAIEAGLKCFELECFSSDKDCEGKPKVNYVTVFERPGLHDFLNQLGSFADLVLFTAGLE 180
            QA E G+K FELEC SSDK+C+GKPKVNYVTVFERPGLHDFL Q+  FA+LVLFTAGLE
Sbjct: 121 TQATEGGMKWFELECVSSDKECDGKPKVNYVTVFERPGLHDFLKQVSQFAELVLFTAGLE 180

Query: 181 GYARPLVDRIDEENRFSLRLYRPSTISTEYRDHVKDLSCLSKDLRRTVIVDNNPFSFLLQ 240
           GYARPLVDRID +N FS RLYRPST STEYR+HVKDLS LSKD+RR VIVDNNPFSFLLQ
Sbjct: 181 GYARPLVDRIDVDNLFSCRLYRPSTTSTEYREHVKDLSGLSKDMRRIVIVDNNPFSFLLQ 240

Query: 241 PVNGIPCIPFSAGQPHDSQLLDVILPLLKHLSLQNDVRSVLYERFHMPEWFQKHGIPT 297
           P NGIPCIPFSAGQ HD+QLLDV+LPLLK LSLQ DVR  LYERFHMPEWFQK GIP+
Sbjct: 241 PSNGIPCIPFSAGQTHDTQLLDVLLPLLKQLSLQKDVRPALYERFHMPEWFQKQGIPS 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CTDS1_MOUSE1.2e-1934.57Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 OS=M... [more]
CTDS1_HUMAN1.2e-1934.57Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 OS=H... [more]
YA22_SCHPO4.6e-1933.16Uncharacterized protein C2F7.02c OS=Schizosaccharomyces pombe (strain 972 / ATCC... [more]
PSR2_YEAST6.0e-1933.16Probable phosphatase PSR2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288... [more]
CNEP1_DROPS7.9e-1934.18CTD nuclear envelope phosphatase 1 homolog OS=Drosophila pseudoobscura pseudoobs... [more]
Match NameE-valueIdentityDescription
A0A0A0KM39_CUCSA1.4e-15292.23Uncharacterized protein OS=Cucumis sativus GN=Csa_5G203410 PE=4 SV=1[more]
W9S4E3_9ROSA1.5e-12578.11CTD small phosphatase-like protein OS=Morus notabilis GN=L484_007675 PE=4 SV=1[more]
V4SMH1_9ROSI4.5e-12275.00Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031856mg PE=4 SV=1[more]
A0A061FM78_THECC1.0e-12174.51Haloacid dehalogenase-like hydrolase (HAD) superfamily protein isoform 1 OS=Theo... [more]
A0A067FZR4_CITSI1.0e-12174.67Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022258mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55960.12.6e-11369.00 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein[more]
AT1G29780.11.4e-2138.01 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein[more]
AT5G45700.18.4e-1938.36 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein[more]
AT5G46410.27.1e-1835.75 SCP1-like small phosphatase 4[more]
AT1G29770.11.6e-1737.34 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449463020|ref|XP_004149232.1|2.1e-15292.23PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A small phospha... [more]
gi|659126551|ref|XP_008463241.1|4.6e-15291.89PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A small phospha... [more]
gi|703117626|ref|XP_010101420.1|2.2e-12578.11CTD small phosphatase-like protein [Morus notabilis][more]
gi|657944345|ref|XP_008374036.1|1.3e-12276.51PREDICTED: CTD small phosphatase-like protein [Malus domestica][more]
gi|694318956|ref|XP_009344958.1|2.9e-12276.17PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A small phospha... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0016791phosphatase activity
Vocabulary: INTERPRO
TermDefinition
IPR023214HAD_sf
IPR011948Dullard_phosphatase
IPR004274FCP1_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006470 protein dephosphorylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0016791 phosphatase activity
molecular_function GO:0004721 phosphoprotein phosphatase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG01g05410Cp4.1LG01g05410gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG01g05410.1Cp4.1LG01g05410.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g05410.1:cds:006Cp4.1LG01g05410.1:cds:006CDS
Cp4.1LG01g05410.1:cds:005Cp4.1LG01g05410.1:cds:005CDS
Cp4.1LG01g05410.1:cds:004Cp4.1LG01g05410.1:cds:004CDS
Cp4.1LG01g05410.1:cds:003Cp4.1LG01g05410.1:cds:003CDS
Cp4.1LG01g05410.1:cds:002Cp4.1LG01g05410.1:cds:002CDS
Cp4.1LG01g05410.1:cds:001Cp4.1LG01g05410.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g05410.1:three_prime_utr:001Cp4.1LG01g05410.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 94..276
score: 8.8
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 92..255
score: 6.0
IPR004274FCP1 homology domainPROFILEPS50969FCP1coord: 89..269
score: 34
IPR011948Dullard phosphatase domain, eukaryoticTIGRFAMsTIGR02251TIGR02251coord: 93..279
score: 1.8
IPR023214HAD-like domainGENE3DG3DSA:3.40.50.1000coord: 91..279
score: 2.2
IPR023214HAD-like domainunknownSSF56784HAD-likecoord: 92..280
score: 2.57
NoneNo IPR availablePANTHERPTHR12210NUCLEAR LIM INTERACTOR-INTERACTING FACTOR-RELATEDcoord: 140..296
score: 1.3E-164coord: 1..120
score: 1.3E
NoneNo IPR availablePANTHERPTHR12210:SF57HALOACID DEHALOGENASE-LIKE HYDROLASE (HAD) SUPERFAMILY PROTEINcoord: 140..296
score: 1.3E-164coord: 1..120
score: 1.3E