Cp4.1LG20g08550 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g08550
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionmethyl-CpG-binding domain-containing protein 5-like
LocationCp4.1LG20: 7417591 .. 7420362 (+)
RNA-Seq ExpressionCp4.1LG20g08550
SyntenyCp4.1LG20g08550
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACTTTCCCTGTTTGCTTCTTCGTCTTCGTCTTGCAAAGCCCCGACCCACATGTCTGCTTTCGGAACTTCGATATCGGACCCCGACTGGCCCCGCCGAGATCCGACCCGATTCAACTTGCCTTTCTCCAACCACCTTCCGCCGGACCCTCTCCTCGACGCCGGCTCCTTCATCGACTCCTCCGCCGCCCGATGTGGAACCCTTCCTTCTTCCCCTAACCACTCCCGAGATCTTGACCTGAATCATCGCAAGAAGGCGGAGACCTCTTCCACTCTCCGATCTCCATTCCGCGATGACTCTCTCAACGCTGGAAATGGAGATTCTCCCGCAACTAAGAAGCTCTCGGGCTCCAGAGGGACTGACGCGTCTCCGAAATCCGCTTCGTCCGAGAAGCCTAATTGGTTGCCGCCGGGCTGGATTATGGAAGATAGAGTTCGCACCTCTGGTGCTACAGCTGGAACAGTCGATAAGGTGAATTGATTCCTTCTCTTTTGACTAGTATCATTTCGATGTTGAATTTTGAGTTCCTTTTCTTGACTGGCGTTCTTGAGTTTTGGAAGATCGTGTTCAAAGTGGAGATGAGTCAGTTTTGGTAATTGTTCGACCATGCAAGCAATATTCTGTTCCATGGCTCTGGTTACTGAACTTCTCTATGCAATTTAATGTCCGATGATAGAACAGGGCTTTACCGTCGCTGGAATTTTTTTTCTTCTACTCTAGCCGTTGGACTCTTGGTTTAGAATTTCTACCAGTCAGGAATGATTTTTCGGATGAAAACATGAACCAATTAGAGAAATTATCATTGTTTACTGAAAATGATCTTTTGTTTTTGCTTCGAGGAGTTGGTTGTAGTAGCATTCCCTAGTCGATTGTTCTCTCATCAGTTATGGGAAAGAAGAACTGTATCAAGTAGTAAAGGGTTTAGTAATTACTTCTGGAATGTGATCATCCGAAAAATGAATCACTACCTAAATTCTCTGTCGTATTGATGGGGATTGCAGTTTTAGTTACCTGATTGGGAATTTTGAAGTTCCGTAGGAACAAGTAAACTACAAGAATTCATTTCCTTCCTCTCTAAATTTGTTTAGGGAATTCATTTTTCATTTATTTGGTATAGTATTTAAGTATGTCCATGATTTCGAGTGAGTTCATCTGACAATCCGAGTGATTCCGTCGTGCAGTTACGATATCGAACTTTCTTAGTCCTTTTCTCTGGATGGTGAACAAGCCTCGACGAACTTTGATATAGACTTAGGTGCTTCTTTGGATTTTGGAACTCTAGCTACTTACCATTTATATGAAATCAATTAGCATAACAGAGCTTGGTTGATTAGGATGGAGACTATCATTGGAAAAAGAATAACAAAAACCTCCACTGTTATGTTAGCCTGTTCATAAATGATGAGGTTCAAATTGTAGGATAGTATTTGGAACTGCCCAAGCCCACCGCTAGCAAATACTGTCTCTTTAGCCCGTTACGTATCATTGTCAGCCTCACGATTTAAAACGCGTCTACTAGGTAGAGGTTTCCATACCCTTATAAGAAATGTTTTGTTCCCTTCTCTAACCAATGTGGGATCTCACAATCCACCCCCTTAGGGCCCAACGTCTTGCTGGAACACCGCTCGGTGTTTGCTTTGATACCATTTGTAACAACCCAAGCTCCTGCTAGCAAATATTGTCCGCTTTGGCCCGTTACTTATCACCTTTATAAGGAATGTTTCGTTTCCTTCTCCAACCAATATGGGATCTCACCTCTGTATGCATGAGGGTGTACAGAGGTGAATAGCTCTTGACTCCATACTTGGACTGTTGCTTGGTGCTTGATATATGATGCTTCACTAAGAATCATGTTTGAGATGGATTATACCTTAACTTTGTATTCATATATAGCTCTGTTTAAAAGCACATTGACATTGTCAATAGACAAAACAAATGCAAGCAATTAAGTACGGTCACAAGTACCATTTAGCATTTAATGATTGACTACGAGTTCGAGTGTCGTTTGAATTGAGTGTTCTTTTTTTTTTTTCATGTCATGTTGAATTAGTTGGGAGCATGCTTGGGCCCATTTATTTCTTGAGATTCCCCTTTCTTATTATCCTGCTTTACATATTTATTAAACAAACAAACAGTAGAGAATGTCAAAGGAATCTAATCCATCCTCATGTTTCTTGCAGTACTACTTTGACCCAGTTTCAGGTCGTCGATTCCGGTCTAAGATTGAGGTTCTTTACTTTCTGGAAACAGGAACATTAAGGAAACGTAAGAAATCGTTAGATGGTAATGGGACGGTGAGTCGAAAGCTAACATTCCTTCTTTTGCACCCAATGGGATATTCAGACCTTCTCATACGTTTTATCGCTTCGATTTACAATTGCAGTCTGCTGACGGCCCCGAAGAACCCAAGAACAAAAAATCTTCCAACAATGCTAAATCTGCTCCTTTAAACTTTGATTTCTTCAATGTGCCTGAAAAGGTTGAGTGGGTTCTCACAGACCCTTCTCAGGAGGCTTGGACGCCGTTCATCAACAACGAGAAAGTACCCCAATCTACTAAACGCGAATGGGTAGCAGCATTTAAACTCCTCAGCCGATCAAAAGCTTGAGGTTTGAAGGCAATTTCATCCCTTTTGGACACTGTTTGGGATACCCATTCAGGTCAGAACCGTCCTCCCTCCCTCTAATGGCTTCCTATCTCCAATTCTTTTGGTCTTTTTTTTGGATGGAGCAGGTGAGGTTGAGGGGGAAGTGTTGTCAAAACCAGATTTGGT

mRNA sequence

CACTTTCCCTGTTTGCTTCTTCGTCTTCGTCTTGCAAAGCCCCGACCCACATGTCTGCTTTCGGAACTTCGATATCGGACCCCGACTGGCCCCGCCGAGATCCGACCCGATTCAACTTGCCTTTCTCCAACCACCTTCCGCCGGACCCTCTCCTCGACGCCGGCTCCTTCATCGACTCCTCCGCCGCCCGATGTGGAACCCTTCCTTCTTCCCCTAACCACTCCCGAGATCTTGACCTGAATCATCGCAAGAAGGCGGAGACCTCTTCCACTCTCCGATCTCCATTCCGCGATGACTCTCTCAACGCTGGAAATGGAGATTCTCCCGCAACTAAGAAGCTCTCGGGCTCCAGAGGGACTGACGCGTCTCCGAAATCCGCTTCGTCCGAGAAGCCTAATTGGTTGCCGCCGGGCTGGATTATGGAAGATAGAGTTCGCACCTCTGGTGCTACAGCTGGAACAGTCGATAAGTACTACTTTGACCCAGTTTCAGGTCGTCGATTCCGGTCTAAGATTGAGGTTCTTTACTTTCTGGAAACAGGAACATTAAGGAAACGTAAGAAATCGTTAGATGGTAATGGGACGTCTGCTGACGGCCCCGAAGAACCCAAGAACAAAAAATCTTCCAACAATGCTAAATCTGCTCCTTTAAACTTTGATTTCTTCAATGTGCCTGAAAAGGTTGAGTGGGTTCTCACAGACCCTTCTCAGGAGGCTTGGACGCCGTTCATCAACAACGAGAAAGTACCCCAATCTACTAAACGCGAATGGGTAGCAGCATTTAAACTCCTCAGCCGATCAAAAGCTTGAGGTTTGAAGGCAATTTCATCCCTTTTGGACACTGTTTGGGATACCCATTCAGGTCAGAACCGTCCTCCCTCCCTCTAATGGCTTCCTATCTCCAATTCTTTTGGTCTTTTTTTTGGATGGAGCAGGTGAGGTTGAGGGGGAAGTGTTGTCAAAACCAGATTTGGT

Coding sequence (CDS)

ATGTCTGCTTTCGGAACTTCGATATCGGACCCCGACTGGCCCCGCCGAGATCCGACCCGATTCAACTTGCCTTTCTCCAACCACCTTCCGCCGGACCCTCTCCTCGACGCCGGCTCCTTCATCGACTCCTCCGCCGCCCGATGTGGAACCCTTCCTTCTTCCCCTAACCACTCCCGAGATCTTGACCTGAATCATCGCAAGAAGGCGGAGACCTCTTCCACTCTCCGATCTCCATTCCGCGATGACTCTCTCAACGCTGGAAATGGAGATTCTCCCGCAACTAAGAAGCTCTCGGGCTCCAGAGGGACTGACGCGTCTCCGAAATCCGCTTCGTCCGAGAAGCCTAATTGGTTGCCGCCGGGCTGGATTATGGAAGATAGAGTTCGCACCTCTGGTGCTACAGCTGGAACAGTCGATAAGTACTACTTTGACCCAGTTTCAGGTCGTCGATTCCGGTCTAAGATTGAGGTTCTTTACTTTCTGGAAACAGGAACATTAAGGAAACGTAAGAAATCGTTAGATGGTAATGGGACGTCTGCTGACGGCCCCGAAGAACCCAAGAACAAAAAATCTTCCAACAATGCTAAATCTGCTCCTTTAAACTTTGATTTCTTCAATGTGCCTGAAAAGGTTGAGTGGGTTCTCACAGACCCTTCTCAGGAGGCTTGGACGCCGTTCATCAACAACGAGAAAGTACCCCAATCTACTAAACGCGAATGGGTAGCAGCATTTAAACTCCTCAGCCGATCAAAAGCTTGA

Protein sequence

MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRDLDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSADGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREWVAAFKLLSRSKA
Homology
BLAST of Cp4.1LG20g08550 vs. ExPASy Swiss-Prot
Match: Q9LTJ1 (Methyl-CpG-binding domain-containing protein 6 OS=Arabidopsis thaliana OX=3702 GN=MBD6 PE=1 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 9.3e-32
Identity = 84/228 (36.84%), Postives = 125/228 (54.82%), Query Frame = 0

Query: 30  PPDPLLDAGSFIDSSAARCGTLPSSPNHSRDLDLNHRKKAETSSTLRSPFRDDSLNAGNG 89
           PPDPLL +G+FI  S+A  GTL SS                     R P +     +G+G
Sbjct: 10  PPDPLLASGAFI--SSAGDGTLDSSAK-------------------RRPIQGGIGISGSG 69

Query: 90  DSPATKKLSG----SRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDP 149
           +S      +G    +  T++  +  ++   NWLPPGW +ED++RTSGATAG+VDKYY++P
Sbjct: 70  ESVRIGMANGTDQVNHQTESKSRKRAAPGDNWLPPGWRVEDKIRTSGATAGSVDKYYYEP 129

Query: 150 VSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSADGPEEPKNKKSSNNA-----KSAPL 209
            +GR+FRS+ EVLY+LE GT ++  K  +    + D  E   + + +  A        PL
Sbjct: 130 NTGRKFRSRTEVLYYLEHGTSKRGTKKAENTYFNPDHFEGQGSNRVTRTATVPPPPPPPL 189

Query: 210 NFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREWVAAFKLLS 249
           +FDF N P+KV W + +  +E W P I + KV  S +R+W  AF  ++
Sbjct: 190 DFDFKNPPDKVSWSMANAGEEGWIPNIGDVKVQDSVRRDWSTAFTFIT 216

BLAST of Cp4.1LG20g08550 vs. ExPASy Swiss-Prot
Match: Q9SNC0 (Methyl-CpG-binding domain-containing protein 5 OS=Arabidopsis thaliana OX=3702 GN=MBD5 PE=1 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 1.0e-30
Identity = 75/171 (43.86%), Postives = 98/171 (57.31%), Query Frame = 0

Query: 90  DSPATKKLSGSRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGR 149
           ++PAT   S SR      K A+    NWLPP W  E RVRTSG  AGTVDK+Y++P++GR
Sbjct: 13  ENPATPVDSKSR------KRATPGDDNWLPPDWRTEIRVRTSGTKAGTVDKFYYEPITGR 72

Query: 150 RFRSKIEVLYFLETGTLRKRKKSLDGNGTS--------ADGPEEPKNKKSSNNAKSAPLN 209
           +FRSK EVLY+LE GT +K+      NG S             + K+ K        PLN
Sbjct: 73  KFRSKNEVLYYLEHGTPKKKSVKTAENGDSHSEHSEGRGSARRQTKSNKKVTEPPPKPLN 132

Query: 210 FDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREWVAAFKLLSRSKA 253
           FDF NVPEKV W   + S+EAW PFI + K+ +S  ++W   F L++   A
Sbjct: 133 FDFLNVPEKVTWTGINGSEEAWLPFIGDYKIQESVSQDWDRVFTLVTSQNA 177

BLAST of Cp4.1LG20g08550 vs. ExPASy Swiss-Prot
Match: Q9UBB5 (Methyl-CpG-binding domain protein 2 OS=Homo sapiens OX=9606 GN=MBD2 PE=1 SV=1)

HSP 1 Score: 50.4 bits (119), Expect = 3.3e-05
Identity = 28/63 (44.44%), Postives = 38/63 (60.32%), Query Frame = 0

Query: 99  GSRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVL 158
           G RG  A+      + P  LPPGW  E+ +R SG +AG  D YYF P SG++FRSK ++ 
Sbjct: 137 GPRGPRATESGKRMDCPA-LPPGWKKEEVIRKSGLSAGKSDVYYFSP-SGKKFRSKPQLA 196

Query: 159 YFL 162
            +L
Sbjct: 197 RYL 197

BLAST of Cp4.1LG20g08550 vs. ExPASy Swiss-Prot
Match: Q9Z2E1 (Methyl-CpG-binding domain protein 2 OS=Mus musculus OX=10090 GN=Mbd2 PE=2 SV=2)

HSP 1 Score: 50.4 bits (119), Expect = 3.3e-05
Identity = 29/70 (41.43%), Postives = 40/70 (57.14%), Query Frame = 0

Query: 92  PATKKLSGSRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGRRF 151
           P+     G RG  A+      + P  LPPGW  E+ +R SG +AG  D YYF P SG++F
Sbjct: 133 PSGSSGPGPRGPRATESGKRMDCPA-LPPGWKKEEVIRKSGLSAGKSDVYYFSP-SGKKF 192

Query: 152 RSKIEVLYFL 162
           RSK ++  +L
Sbjct: 193 RSKPQLARYL 200

BLAST of Cp4.1LG20g08550 vs. NCBI nr
Match: XP_023519565.1 (methyl-CpG-binding domain-containing protein 5-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 509 bits (1310), Expect = 7.10e-182
Identity = 252/252 (100.00%), Postives = 252/252 (100.00%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSKA 252
           VAAFKLLSRSKA
Sbjct: 241 VAAFKLLSRSKA 252

BLAST of Cp4.1LG20g08550 vs. NCBI nr
Match: KAG6583427.1 (Methyl-CpG-binding domain-containing protein 6, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 503 bits (1295), Expect = 1.38e-179
Identity = 249/252 (98.81%), Postives = 250/252 (99.21%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTS+SDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSLSDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFR D LNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRADPLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSKA 252
           VAAFKLLSRSKA
Sbjct: 241 VAAFKLLSRSKA 252

BLAST of Cp4.1LG20g08550 vs. NCBI nr
Match: KAG7019190.1 (Methyl-CpG-binding domain-containing protein 6 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 500 bits (1287), Expect = 2.28e-178
Identity = 248/252 (98.41%), Postives = 249/252 (98.81%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTS+SDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSLSDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFR D LNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRADPLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEE KNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEESKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSKA 252
           VAAFKLLSRSKA
Sbjct: 241 VAAFKLLSRSKA 252

BLAST of Cp4.1LG20g08550 vs. NCBI nr
Match: XP_022964800.1 (methyl-CpG-binding domain-containing protein 5-like [Cucurbita moschata])

HSP 1 Score: 499 bits (1285), Expect = 4.61e-178
Identity = 247/251 (98.41%), Postives = 248/251 (98.80%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTS+SDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSLSDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFR D LNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRADPLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNG SA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGMSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSK 251
           VAAFKLLSRSK
Sbjct: 241 VAAFKLLSRSK 251

BLAST of Cp4.1LG20g08550 vs. NCBI nr
Match: XP_022970567.1 (methyl-CpG-binding domain-containing protein 5-like [Cucurbita maxima])

HSP 1 Score: 496 bits (1277), Expect = 7.64e-177
Identity = 246/251 (98.01%), Postives = 247/251 (98.41%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTS+SDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSLSDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFR D LNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRADPLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEEPKNKKSSNNAKSAPLNFDF NVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEEPKNKKSSNNAKSAPLNFDFLNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSK 251
           VAAF LLSRSK
Sbjct: 241 VAAFILLSRSK 251

BLAST of Cp4.1LG20g08550 vs. ExPASy TrEMBL
Match: A0A6J1HLU8 (methyl-CpG-binding domain-containing protein 5-like OS=Cucurbita moschata OX=3662 GN=LOC111464796 PE=4 SV=1)

HSP 1 Score: 499 bits (1285), Expect = 2.23e-178
Identity = 247/251 (98.41%), Postives = 248/251 (98.80%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTS+SDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSLSDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFR D LNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRADPLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNG SA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGMSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSK 251
           VAAFKLLSRSK
Sbjct: 241 VAAFKLLSRSK 251

BLAST of Cp4.1LG20g08550 vs. ExPASy TrEMBL
Match: A0A6J1HZG9 (methyl-CpG-binding domain-containing protein 5-like OS=Cucurbita maxima OX=3661 GN=LOC111469504 PE=4 SV=1)

HSP 1 Score: 496 bits (1277), Expect = 3.70e-177
Identity = 246/251 (98.01%), Postives = 247/251 (98.41%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSAFGTS+SDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD
Sbjct: 1   MSAFGTSLSDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120
           LDLNHRKKAETSSTLRSPFR D LNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP
Sbjct: 61  LDLNHRKKAETSSTLRSPFRADPLNAGNGDSPATKKLSGSRGTDASPKSASSEKPNWLPP 120

Query: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180
           GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA
Sbjct: 121 GWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSA 180

Query: 181 DGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240
           DGPEEPKNKKSSNNAKSAPLNFDF NVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW
Sbjct: 181 DGPEEPKNKKSSNNAKSAPLNFDFLNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREW 240

Query: 241 VAAFKLLSRSK 251
           VAAF LLSRSK
Sbjct: 241 VAAFILLSRSK 251

BLAST of Cp4.1LG20g08550 vs. ExPASy TrEMBL
Match: A0A6J1KC76 (methyl-CpG-binding domain-containing protein 5-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493004 PE=4 SV=1)

HSP 1 Score: 405 bits (1042), Expect = 3.43e-141
Identity = 205/261 (78.54%), Postives = 223/261 (85.44%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSA GTS  DPDWPR+  TR +LP  +H PPDPLL +GSFIDSS ARCGTLPSSP+++RD
Sbjct: 1   MSASGTSFPDPDWPRQGSTRPDLPSPHHFPPDPLLSSGSFIDSSTARCGTLPSSPDYARD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNA----------GNGDSPATKKLSGSRGTDASPKSA 120
           LD NHRKK +TS+TLRSP   D LN           GNGDS A KK+S SRGTDASPKSA
Sbjct: 61  LDGNHRKKPDTSATLRSPLPADPLNPKKLAPDSPTPGNGDSSAPKKVSASRGTDASPKSA 120

Query: 121 SSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRK 180
            S++PNWLPPGW++EDRVR SGATAGTVDKYYFDP SGRRFRSKIEVLYFLETGTLRKRK
Sbjct: 121 LSDRPNWLPPGWVVEDRVRASGATAGTVDKYYFDPNSGRRFRSKIEVLYFLETGTLRKRK 180

Query: 181 KSLDGNGTSADGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE 240
           KSLDGN  SADG EEPK+KKSS+NAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE
Sbjct: 181 KSLDGNAMSADGSEEPKSKKSSSNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE 240

Query: 241 KVPQSTKREWVAAFKLLSRSK 251
           KVP+STKREWVAAF+LL RSK
Sbjct: 241 KVPESTKREWVAAFQLLGRSK 261

BLAST of Cp4.1LG20g08550 vs. ExPASy TrEMBL
Match: A0A6J1GAC7 (methyl-CpG-binding domain-containing protein 5-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452236 PE=4 SV=1)

HSP 1 Score: 404 bits (1038), Expect = 1.40e-140
Identity = 205/261 (78.54%), Postives = 222/261 (85.06%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSA GTS  DPDWPR+  TR +LP  +H PPDPLL +GSFIDSS ARCGTLPSSP+++RD
Sbjct: 1   MSASGTSFPDPDWPRQGSTRPDLPSPHHFPPDPLLSSGSFIDSSTARCGTLPSSPDYARD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNA----------GNGDSPATKKLSGSRGTDASPKSA 120
           LD NHR K +TS+TLRSP   D LN           GNGDS A KK+S SRGTDASPKSA
Sbjct: 61  LDGNHRNKPDTSATLRSPLPADPLNPKKLPPDSPTPGNGDSSAPKKVSASRGTDASPKSA 120

Query: 121 SSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRK 180
            SE+PNWLPPGW++EDRVR SGATAGTVDKYYFDP SGRRFRSKIEVLYFLETGTLRKRK
Sbjct: 121 LSERPNWLPPGWVVEDRVRASGATAGTVDKYYFDPKSGRRFRSKIEVLYFLETGTLRKRK 180

Query: 181 KSLDGNGTSADGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE 240
           KSLDGN  SADG EEPK+KKSS+NAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE
Sbjct: 181 KSLDGNVKSADGSEEPKSKKSSSNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE 240

Query: 241 KVPQSTKREWVAAFKLLSRSK 251
           KVP+STKREWVAAF+LL RSK
Sbjct: 241 KVPESTKREWVAAFQLLGRSK 261

BLAST of Cp4.1LG20g08550 vs. ExPASy TrEMBL
Match: A0A6J1K9Z4 (methyl-CpG-binding domain-containing protein 5-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111493004 PE=4 SV=1)

HSP 1 Score: 401 bits (1030), Expect = 2.15e-139
Identity = 205/261 (78.54%), Postives = 223/261 (85.44%), Query Frame = 0

Query: 1   MSAFGTSISDPDWPRRDPTRFNLPFSNHLPPDPLLDAGSFIDSSAARCGTLPSSPNHSRD 60
           MSA GTS  DPDWPR+  TR +LP  +H PPDPLL +GSFIDSS ARCGTLPSSP+++RD
Sbjct: 1   MSASGTSFPDPDWPRQGSTRPDLPSPHHFPPDPLLSSGSFIDSSTARCGTLPSSPDYARD 60

Query: 61  LDLNHRKKAETSSTLRSPFRDDSLNA----------GNGDSPATKKLSGSRGTDASPKSA 120
           LD NHRKK +TS+TLRSP   D LN           GNGDS A KK+S SRGTDASPKSA
Sbjct: 61  LDGNHRKKPDTSATLRSPLPADPLNPKKLAPDSPTPGNGDSSAPKKVSASRGTDASPKSA 120

Query: 121 SSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGRRFRSKIEVLYFLETGTLRKRK 180
            S++PNWLPPGW++EDRVR SGATAGTVDKYYFDP SGRRFRSKIEVLYFLETGTLRKRK
Sbjct: 121 LSDRPNWLPPGWVVEDRVRASGATAGTVDKYYFDPNSGRRFRSKIEVLYFLETGTLRKRK 180

Query: 181 KSLDGNGTSADGPEEPKNKKSSNNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE 240
           KSLDGN  SADG EEPK+KKSS+NAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE
Sbjct: 181 KSLDGN--SADGSEEPKSKKSSSNAKSAPLNFDFFNVPEKVEWVLTDPSQEAWTPFINNE 240

Query: 241 KVPQSTKREWVAAFKLLSRSK 251
           KVP+STKREWVAAF+LL RSK
Sbjct: 241 KVPESTKREWVAAFQLLGRSK 259

BLAST of Cp4.1LG20g08550 vs. TAIR 10
Match: AT5G59380.1 (methyl-CPG-binding domain 6 )

HSP 1 Score: 138.7 bits (348), Expect = 6.6e-33
Identity = 84/228 (36.84%), Postives = 125/228 (54.82%), Query Frame = 0

Query: 30  PPDPLLDAGSFIDSSAARCGTLPSSPNHSRDLDLNHRKKAETSSTLRSPFRDDSLNAGNG 89
           PPDPLL +G+FI  S+A  GTL SS                     R P +     +G+G
Sbjct: 10  PPDPLLASGAFI--SSAGDGTLDSSAK-------------------RRPIQGGIGISGSG 69

Query: 90  DSPATKKLSG----SRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDP 149
           +S      +G    +  T++  +  ++   NWLPPGW +ED++RTSGATAG+VDKYY++P
Sbjct: 70  ESVRIGMANGTDQVNHQTESKSRKRAAPGDNWLPPGWRVEDKIRTSGATAGSVDKYYYEP 129

Query: 150 VSGRRFRSKIEVLYFLETGTLRKRKKSLDGNGTSADGPEEPKNKKSSNNA-----KSAPL 209
            +GR+FRS+ EVLY+LE GT ++  K  +    + D  E   + + +  A        PL
Sbjct: 130 NTGRKFRSRTEVLYYLEHGTSKRGTKKAENTYFNPDHFEGQGSNRVTRTATVPPPPPPPL 189

Query: 210 NFDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREWVAAFKLLS 249
           +FDF N P+KV W + +  +E W P I + KV  S +R+W  AF  ++
Sbjct: 190 DFDFKNPPDKVSWSMANAGEEGWIPNIGDVKVQDSVRRDWSTAFTFIT 216

BLAST of Cp4.1LG20g08550 vs. TAIR 10
Match: AT3G46580.1 (methyl-CPG-binding domain protein 5 )

HSP 1 Score: 135.2 bits (339), Expect = 7.3e-32
Identity = 75/171 (43.86%), Postives = 98/171 (57.31%), Query Frame = 0

Query: 90  DSPATKKLSGSRGTDASPKSASSEKPNWLPPGWIMEDRVRTSGATAGTVDKYYFDPVSGR 149
           ++PAT   S SR      K A+    NWLPP W  E RVRTSG  AGTVDK+Y++P++GR
Sbjct: 13  ENPATPVDSKSR------KRATPGDDNWLPPDWRTEIRVRTSGTKAGTVDKFYYEPITGR 72

Query: 150 RFRSKIEVLYFLETGTLRKRKKSLDGNGTS--------ADGPEEPKNKKSSNNAKSAPLN 209
           +FRSK EVLY+LE GT +K+      NG S             + K+ K        PLN
Sbjct: 73  KFRSKNEVLYYLEHGTPKKKSVKTAENGDSHSEHSEGRGSARRQTKSNKKVTEPPPKPLN 132

Query: 210 FDFFNVPEKVEWVLTDPSQEAWTPFINNEKVPQSTKREWVAAFKLLSRSKA 253
           FDF NVPEKV W   + S+EAW PFI + K+ +S  ++W   F L++   A
Sbjct: 133 FDFLNVPEKVTWTGINGSEEAWLPFIGDYKIQESVSQDWDRVFTLVTSQNA 177

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LTJ19.3e-3236.84Methyl-CpG-binding domain-containing protein 6 OS=Arabidopsis thaliana OX=3702 G... [more]
Q9SNC01.0e-3043.86Methyl-CpG-binding domain-containing protein 5 OS=Arabidopsis thaliana OX=3702 G... [more]
Q9UBB53.3e-0544.44Methyl-CpG-binding domain protein 2 OS=Homo sapiens OX=9606 GN=MBD2 PE=1 SV=1[more]
Q9Z2E13.3e-0541.43Methyl-CpG-binding domain protein 2 OS=Mus musculus OX=10090 GN=Mbd2 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
XP_023519565.17.10e-182100.00methyl-CpG-binding domain-containing protein 5-like [Cucurbita pepo subsp. pepo][more]
KAG6583427.11.38e-17998.81Methyl-CpG-binding domain-containing protein 6, partial [Cucurbita argyrosperma ... [more]
KAG7019190.12.28e-17898.41Methyl-CpG-binding domain-containing protein 6 [Cucurbita argyrosperma subsp. ar... [more]
XP_022964800.14.61e-17898.41methyl-CpG-binding domain-containing protein 5-like [Cucurbita moschata][more]
XP_022970567.17.64e-17798.01methyl-CpG-binding domain-containing protein 5-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1HLU82.23e-17898.41methyl-CpG-binding domain-containing protein 5-like OS=Cucurbita moschata OX=366... [more]
A0A6J1HZG93.70e-17798.01methyl-CpG-binding domain-containing protein 5-like OS=Cucurbita maxima OX=3661 ... [more]
A0A6J1KC763.43e-14178.54methyl-CpG-binding domain-containing protein 5-like isoform X1 OS=Cucurbita maxi... [more]
A0A6J1GAC71.40e-14078.54methyl-CpG-binding domain-containing protein 5-like isoform X1 OS=Cucurbita mosc... [more]
A0A6J1K9Z42.15e-13978.54methyl-CpG-binding domain-containing protein 5-like isoform X2 OS=Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
AT5G59380.16.6e-3336.84methyl-CPG-binding domain 6 [more]
AT3G46580.17.3e-3243.86methyl-CPG-binding domain protein 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001739Methyl-CpG DNA bindingPFAMPF01429MBDcoord: 110..210
e-value: 9.0E-15
score: 54.1
IPR001739Methyl-CpG DNA bindingPROSITEPS50982MBDcoord: 108..180
score: 14.911046
NoneNo IPR availableGENE3D3.30.890.10coord: 87..201
e-value: 2.5E-22
score: 81.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..113
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 59..75
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 170..196
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..123
NoneNo IPR availablePANTHERPTHR12396METHYL-CPG BINDING PROTEIN, MBDcoord: 86..246
NoneNo IPR availablePANTHERPTHR12396:SF46METHYL-CPG BINDING DOMAIN PROTEIN-LIKE, ISOFORM Ccoord: 86..246
IPR016177DNA-binding domain superfamilySUPERFAMILY54171DNA-binding domaincoord: 109..164

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g08550.1Cp4.1LG20g08550.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding