Cp4.1LG09g00820.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG09g00820.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMethyl-CpG-binding domain-containing 13-like protein
LocationCp4.1LG09 : 544636 .. 548309 (+)
Sequence length2958
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAAGTTTCTTTCACTTTTGAAACCAAAGAAAGGGTTCCACTTTCCACTTTGCAGATATACCCATTACCCAGAACAGCCCGCTCTTTTTTCAAATCTAGAACTTATTCCTCCGCTCTTAAAGCTTATTAAAAGCTTCGATATTTCCCCTTTACTCTTCCGTTACAGCAGACGTTATTGGAACCTCTAGCGGTTTCAATCCTCTGCTTTACTCCCAGATACGTTGCCATAGTCCTCCACTTCAATCGGCGGTACATTTGAATCGAGTTACGTTTTGAACTTACAGATGGGTGTAAAGAACAATCGTAAGATCCAGAAAAGAGGGTCGTAGTCAAATGTGAAAGGGGAGAGTTGTAGAGGATAATGAATGGGGTCCGACACGTTTATATGCCTCGCAGGGTGCTTCTCAGAACACGTGGTTAGAAAGAATATTGTGTCGGAATCCTTCATTTTTATAGCCTCCTGAAGGGATTTGACACCAAAATCAGACTCTTGGCGAGCCCAAAAGAGAGAAAAACATAGAGGGGAAAGAGGGTTATTGAGATTGGCCGGTGGGTTTCGCCGGAAAGTCTGAATGATGAACCAGAAGTCGGAGGAATGGCTACCCCCTGGCTGGACGGTGAAGGTCAAAGTGAGGAAGAGTGGCAAAAAGGATAAGGTACTGGTTTTCATTTCCTTTTTGCTTGGATTTTTACTTGCATTTTATGAATCAATAGATCTTTGGCTCTTGATCTACTTTCTTTTTCATGCTGCACTTGCATAGATTCTTGTGCTGTTTCTGAGCTAACAACATATCGTTGTTCTTCCTGTTCTTCCTGTTCTTGACTATGAATTCTACATGAGATTTCATTATGATTTGCAATGTTAATATGAAAAGATCTGATTCTAATGTTGCAGTATTATTATGAACCCTCAAGTCAAATGAGATTCAATTCTAGAGCAGAGGTGTTTAGATATCTCAAAACTGCTGCAATCTGTTATCCTGAATCTGAAGAGAGCAGAACCTTCAAGCAATCACAGAACAATGTCTGTCAAAAGAGTCGACCCTCTACTCTATTTCTTTTGTTTTCAATTTTATTGTTATGAACTTCATTGATTTTTCTTCAAAAACTTGTATTATTCAAGCAATCTAGGTGGAAGTTAAGAAGACTTTAGCAAAAGGGTTACCTCCTGGATGGATCAGAGAAATCAGAGAAACCAAGACTGCTAATAGGATAAGAAGAGATTCAGTAATTCATCATCTATTTCCTTTATGTCGTTTTCATTTGTCGTGTTGTTTCGGCTTCGGATATGCTCTCGAACCGTGAACTTAGTCTAATCCGCATATTTAGTGACCATAAGATTGTAAAATGCTGAAGAACAAGCAAGTTTACAGATGAACTTTTGTTTGCAGTTTTACATTGATCCTGTAAATGGAAATGTATTTCGCTCGATAAGGGAAGTACATCGGTATCTAACAAGTGGAACAGTGAGCCGATTAGCGTATAAATCAAGGGATCAGAGAGGCATCGACGTTGAGTTTCAACATGATGATATCTCTGTGAGTTCTCTAATGATGCTCCAAGTTCGAGTTTCAACAAACTTTGGTTTTATCATCTTTGCCCTTTGATTAAACTTTTGTCTTTCCCTCTCGTTCGTAGTCGCCCGCTGTCCCCAAGAAAGAGATGCAAGCTATTGGCAAAGCAAGGAGACAGATAATATGGAACGAGAACTTGTTGGAACCCCGTGAAATAACGAACGATGAAGCTATTTTTCCGAATGCTACGAGTGTCGGAGAAAGCATGCCTCATTCTGAACCTGATTCGAGTCATGGTATAGTTTTGAACATAGCTACTTTAGATAGCAAATTCTTTCATTCCTTATGGATTTGACCCCTTCTGCCAGTTGAAGGTTAAGGTGAATTCAGTGACTGCTGCAGGAGAAATAGGCATTGACGTGCATTGTTCAACTCTACCAGAGCAATCAGATGGAAAGAATGATTTTTTTGAACTTGTGACGACTCCTAGTGATGGACCAACAAAGAATGAGAGTAAAAAGAGACAAAGGAAACCTAAAGACATGAACTTGACTCGCCGTGCTTCAAAACGACTTGCAGGGCTCCAAGCTGAGCCAGTGCTTGAAGTGAAAACAGGTCGCCGAGCACGTCCAGGTGCATGTGAAGAGTCTGATAAGCAAGCAGTCAGTAGTACAACTAAGTCAGTGTCTCAATGTCTTGAGAATCCTGATGTCAAGCATGGGACAAAGGGCATCATTGATCCTTCTAAAAGTATTAACACAAACCCAGATTCAAGTAGAAAAGATGATGTAAAACAAGACCCTATTCTTAAATTGCCAGTGGAGGACTTATTGGCTGATCCTTGCATTGCATTTGCAGTCAAAACTCTAACTGGAGGAGTTTTTGACGCATCCATAAGCTCAGAACTTTCACTGATGCCAAACGATATTGATCGTCCTTCAAATGAGAGTAGAAGCCTAGGCCCTAATGAGAAGTTGCCATCTTCGGAACTTCCTGTTAATCGAATCGGGGTAGTAGAGAAGCTAGAATCAACTTTGGAGTTGCCAGTAGGGGAAATATTGGCAGACCCTTGCATAGAATTTGCTATTAAAACTCTAACAGGGGAAATCCCTCTTGATGACAGTCCAGATATTGAGGATTATTTTCACCAACTTAGCACCTCGAAAACGCAAGGATCTAGTTCAAATGCCTTGAATAGTTTCGGTTCGGATCACTTCTACAAGATGAATGTTCAAAGCCAGAAGCGGCAGGTTATTCAAGCTCTGGAGGCATCTCCAAATATTAACTTCCAAAGCTGTGGAACTGGTTTACACCAACAAAAATGTAACAACTTTATAAGAATAAAGGATGACAAAGCTTAGATCATTTGTATGAAACATGATAGGCCAACATTGCCTATGAAAACATTTTCAGGATCAAACGCATGTAATTCTGTACTTGTTTGGTTGGGAAATCTAATGCTACCAATTAGCCCTTGAATCAAGATTTTCATTACATGTTTGTAGTAAAGCTAGATAACAGGTGACAGGAAGCCAGATTGTGTGGGTGCCCTCAGGCCGGAGGATGCATGAATTAGGGGCCTCTAACGCCTGTCAATGAGAGAAACATTCCCAATAAGTACCAAAGAATTATCTCAAGTGGCTAATACAGATGTTTGAACTCGATTTTCCAAGTTCTATGAAGTTCATAAGGAAGCTCAGCTAAAGAAATCAGAAACATAAATCAACTAAATTTGTTTCATATTTGGATATAAGCAGCAGAATCAGAAGACGCCGCACCTGCTCGAGCTCTCAATGTTTCGGTTCTCTTACTTCTTTCCATTATATCGACTAACCATTCAACACTGTCCTTTATCCCCATCCTGGAATCAAAACAATGAAGAATTTCAGCTTTCTAGAATGTTTTCTGCTGGTGAAGGATAATTTAATGGATAAAAAGCAGGCATAAAGCATATATTAGATGTCTACCGATGTCGGATCTCATCCCGTAATCCCATTACGAAAGAACTTTAGTGTTCTTGAGATTATGTAAATAGGATATATATACGTACCCATCATAGCCAGAAACAGCTTCAAACATGTAAACTCTTTCATCCAATTTTTTAAGATCCAGATAACGAGAAAGTTCTTCAGCTGATACTGCTTCAGAAAGATCCTGCAT

mRNA sequence

TGAAGTTTCTTTCACTTTTGAAACCAAAGAAAGGGTTCCACTTTCCACTTTGCAGATATACCCATTACCCAGAACAGCCCGCTCTTTTTTCAAATCTAGAACTTATTCCTCCGCTCTTAAAGCTTATTAAAAGCTTCGATATTTCCCCTTTACTCTTCCGTTACAGCAGACGTTATTGGAACCTCTAGCGGTTTCAATCCTCTGCTTTACTCCCAGATACGTTGCCATAGTCCTCCACTTCAATCGGCGGTACATTTGAATCGAGTTACGTTTTGAACTTACAGATGGGTGTAAAGAACAATCGTAAGATCCAGAAAAGAGGGTCGTAGTCAAATGTGAAAGGGGAGAGTTGTAGAGGATAATGAATGGGGTCCGACACGTTTATATGCCTCGCAGGGTGCTTCTCAGAACACGTGGTTAGAAAGAATATTGTGTCGGAATCCTTCATTTTTATAGCCTCCTGAAGGGATTTGACACCAAAATCAGACTCTTGGCGAGCCCAAAAGAGAGAAAAACATAGAGGGGAAAGAGGGTTATTGAGATTGGCCGGTGGGTTTCGCCGGAAAGTCTGAATGATGAACCAGAAGTCGGAGGAATGGCTACCCCCTGGCTGGACGGTGAAGGTCAAAGTGAGGAAGAGTGGCAAAAAGGATAAGTATTATTATGAACCCTCAAGTCAAATGAGATTCAATTCTAGAGCAGAGGTGTTTAGATATCTCAAAACTGCTGCAATCTGTTATCCTGAATCTGAAGAGAGCAGAACCTTCAAGCAATCACAGAACAATGTGGAAGTTAAGAAGACTTTAGCAAAAGGGTTACCTCCTGGATGGATCAGAGAAATCAGAGAAACCAAGACTGCTAATAGGATAAGAAGAGATTCATTTTACATTGATCCTGTAAATGGAAATGTATTTCGCTCGATAAGGGAAGTACATCGGTATCTAACAAGTGGAACAGTGAGCCGATTAGCGTATAAATCAAGGGATCAGAGAGGCATCGACGTTGAGTTTCAACATGATGATATCTCTTCGCCCGCTGTCCCCAAGAAAGAGATGCAAGCTATTGGCAAAGCAAGGAGACAGATAATATGGAACGAGAACTTGTTGGAACCCCGTGAAATAACGAACGATGAAGCTATTTTTCCGAATGCTACGAGTGTCGGAGAAAGCATGCCTCATTCTGAACCTGATTCGAGTCATGGAGAAATAGGCATTGACGTGCATTGTTCAACTCTACCAGAGCAATCAGATGGAAAGAATGATTTTTTTGAACTTGTGACGACTCCTAGTGATGGACCAACAAAGAATGAGAGTAAAAAGAGACAAAGGAAACCTAAAGACATGAACTTGACTCGCCGTGCTTCAAAACGACTTGCAGGGCTCCAAGCTGAGCCAGTGCTTGAAGTGAAAACAGGTCGCCGAGCACGTCCAGGTGCATGTGAAGAGTCTGATAAGCAAGCAGTCAGTAGTACAACTAAGTCAGTGTCTCAATGTCTTGAGAATCCTGATGTCAAGCATGGGACAAAGGGCATCATTGATCCTTCTAAAAGTATTAACACAAACCCAGATTCAAGTAGAAAAGATGATGTAAAACAAGACCCTATTCTTAAATTGCCAGTGGAGGACTTATTGGCTGATCCTTGCATTGCATTTGCAGTCAAAACTCTAACTGGAGGAGTTTTTGACGCATCCATAAGCTCAGAACTTTCACTGATGCCAAACGATATTGATCGTCCTTCAAATGAGAGTAGAAGCCTAGGCCCTAATGAGAAGTTGCCATCTTCGGAACTTCCTGTTAATCGAATCGGGGTAGTAGAGAAGCTAGAATCAACTTTGGAGTTGCCAGTAGGGGAAATATTGGCAGACCCTTGCATAGAATTTGCTATTAAAACTCTAACAGGGGAAATCCCTCTTGATGACAGTCCAGATATTGAGGATTATTTTCACCAACTTAGCACCTCGAAAACGCAAGGATCTAGTTCAAATGCCTTGAATAGTTTCGGTTCGGATCACTTCTACAAGATGAATGTTCAAAGCCAGAAGCGGCAGGTTATTCAAGCTCTGGAGGCATCTCCAAATATTAACTTCCAAAGCTGTGGAACTGGTTTACACCAACAAAAATGTAACAACTTTATAAGAATAAAGGATGACAAAGCTTAGATCATTTGTATGAAACATGATAGGCCAACATTGCCTATGAAAACATTTTCAGGATCAAACGCATGTAATTCTGTACTTGTTTGGTTGGGAAATCTAATGCTACCAATTAGCCCTTGAATCAAGATTTTCATTACATGTTTGTAGTAAAGCTAGATAACAGGTGACAGGAAGCCAGATTGTGTGGGTGCCCTCAGGCCGGAGGATGCATGAATTAGGGGCCTCTAACGCCTGTCAATGAGAGAAACATTCCCAATAAGTACCAAAGAATTATCTCAAGTGGCTAATACAGATGTTTGAACTCGATTTTCCAAGTTCTATGAAGTTCATAAGGAAGCTCAGCTAAAGAAATCAGAAACATAAATCAACTAAATTTGTTTCATATTTGGATATAAGCAGCAGAATCAGAAGACGCCGCACCTGCTCGAGCTCTCAATGTTTCGGTTCTCTTACTTCTTTCCATTATATCGACTAACCATTCAACACTGTCCTTTATCCCCATCCTGGAATCAAAACAATGAAGAATTTCAGCTTTCTAGAATGTTTTCTGCTGGTGAAGGATAATTTAATGGATAAAAAGCAGGCATAAAGCATATATTAGATGTCTACCGATGTCGGATCTCATCCCGTAATCCCATTACGAAAGAACTTTAGTGTTCTTGAGATTATGTAAATAGGATATATATACGTACCCATCATAGCCAGAAACAGCTTCAAACATGTAAACTCTTTCATCCAATTTTTTAAGATCCAGATAACGAGAAAGTTCTTCAGCTGATACTGCTTCAGAAAGATCCTGCAT

Coding sequence (CDS)

ATGATGAACCAGAAGTCGGAGGAATGGCTACCCCCTGGCTGGACGGTGAAGGTCAAAGTGAGGAAGAGTGGCAAAAAGGATAAGTATTATTATGAACCCTCAAGTCAAATGAGATTCAATTCTAGAGCAGAGGTGTTTAGATATCTCAAAACTGCTGCAATCTGTTATCCTGAATCTGAAGAGAGCAGAACCTTCAAGCAATCACAGAACAATGTGGAAGTTAAGAAGACTTTAGCAAAAGGGTTACCTCCTGGATGGATCAGAGAAATCAGAGAAACCAAGACTGCTAATAGGATAAGAAGAGATTCATTTTACATTGATCCTGTAAATGGAAATGTATTTCGCTCGATAAGGGAAGTACATCGGTATCTAACAAGTGGAACAGTGAGCCGATTAGCGTATAAATCAAGGGATCAGAGAGGCATCGACGTTGAGTTTCAACATGATGATATCTCTTCGCCCGCTGTCCCCAAGAAAGAGATGCAAGCTATTGGCAAAGCAAGGAGACAGATAATATGGAACGAGAACTTGTTGGAACCCCGTGAAATAACGAACGATGAAGCTATTTTTCCGAATGCTACGAGTGTCGGAGAAAGCATGCCTCATTCTGAACCTGATTCGAGTCATGGAGAAATAGGCATTGACGTGCATTGTTCAACTCTACCAGAGCAATCAGATGGAAAGAATGATTTTTTTGAACTTGTGACGACTCCTAGTGATGGACCAACAAAGAATGAGAGTAAAAAGAGACAAAGGAAACCTAAAGACATGAACTTGACTCGCCGTGCTTCAAAACGACTTGCAGGGCTCCAAGCTGAGCCAGTGCTTGAAGTGAAAACAGGTCGCCGAGCACGTCCAGGTGCATGTGAAGAGTCTGATAAGCAAGCAGTCAGTAGTACAACTAAGTCAGTGTCTCAATGTCTTGAGAATCCTGATGTCAAGCATGGGACAAAGGGCATCATTGATCCTTCTAAAAGTATTAACACAAACCCAGATTCAAGTAGAAAAGATGATGTAAAACAAGACCCTATTCTTAAATTGCCAGTGGAGGACTTATTGGCTGATCCTTGCATTGCATTTGCAGTCAAAACTCTAACTGGAGGAGTTTTTGACGCATCCATAAGCTCAGAACTTTCACTGATGCCAAACGATATTGATCGTCCTTCAAATGAGAGTAGAAGCCTAGGCCCTAATGAGAAGTTGCCATCTTCGGAACTTCCTGTTAATCGAATCGGGGTAGTAGAGAAGCTAGAATCAACTTTGGAGTTGCCAGTAGGGGAAATATTGGCAGACCCTTGCATAGAATTTGCTATTAAAACTCTAACAGGGGAAATCCCTCTTGATGACAGTCCAGATATTGAGGATTATTTTCACCAACTTAGCACCTCGAAAACGCAAGGATCTAGTTCAAATGCCTTGAATAGTTTCGGTTCGGATCACTTCTACAAGATGAATGTTCAAAGCCAGAAGCGGCAGGTTATTCAAGCTCTGGAGGCATCTCCAAATATTAACTTCCAAAGCTGTGGAACTGGTTTACACCAACAAAAATGTAACAACTTTATAAGAATAAAGGATGACAAAGCTTAG

Protein sequence

MMNQKSEEWLPPGWTVKVKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREVHRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEPREITNDEAIFPNATSVGESMPHSEPDSSHGEIGIDVHCSTLPEQSDGKNDFFELVTTPSDGPTKNESKKRQRKPKDMNLTRRASKRLAGLQAEPVLEVKTGRRARPGACEESDKQAVSSTTKSVSQCLENPDVKHGTKGIIDPSKSINTNPDSSRKDDVKQDPILKLPVEDLLADPCIAFAVKTLTGGVFDASISSELSLMPNDIDRPSNESRSLGPNEKLPSSELPVNRIGVVEKLESTLELPVGEILADPCIEFAIKTLTGEIPLDDSPDIEDYFHQLSTSKTQGSSSNALNSFGSDHFYKMNVQSQKRQVIQALEASPNINFQSCGTGLHQQKCNNFIRIKDDKA
BLAST of Cp4.1LG09g00820.1 vs. Swiss-Prot
Match: MBD7_ARATH (Methyl-CpG-binding domain-containing protein 7 OS=Arabidopsis thaliana GN=MBD7 PE=1 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 8.6e-08
Identity = 43/120 (35.83%), Postives = 62/120 (51.67%), Query Frame = 1

Query: 10  LPPGWTVK-VKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEESRTFKQS 69
           LP GW+V+ V  + S   DKYY E  +  RF S   V RYL+         E   + +Q 
Sbjct: 116 LPRGWSVEEVPRKNSHYIDKYYVERKTGKRFRSLVSVERYLR---------ESRNSIEQQ 175

Query: 70  QNNVEVKKTLAKG--LPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREVHRYLTS 127
              ++ ++  +K   LP GWI E +  ++++ I  D  YI+P  GN FRS+  V RYL S
Sbjct: 176 LRVLQNRRGHSKDFRLPDGWIVEEKPRRSSSHI--DRSYIEPGTGNKFRSMAAVERYLIS 224

BLAST of Cp4.1LG09g00820.1 vs. Swiss-Prot
Match: MBD6_ARATH (Methyl-CpG-binding domain-containing protein 6 OS=Arabidopsis thaliana GN=MBD6 PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 3.3e-07
Identity = 40/130 (30.77%), Postives = 53/130 (40.77%), Query Frame = 1

Query: 7   EEWLPPGWTVKVKVRKSGKK----DKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEES 66
           + WLPPGW V+ K+R SG      DKYYYEP++  +F SR EV  YL+         +  
Sbjct: 78  DNWLPPGWRVEDKIRTSGATAGSVDKYYYEPNTGRKFRSRTEVLYYLEHGTSKRGTKKAE 137

Query: 67  RTF-------KQSQNNVEVKKTLAKGLPP-----------------------GWIREIRE 103
            T+        Q  N V    T+    PP                       GWI  I +
Sbjct: 138 NTYFNPDHFEGQGSNRVTRTATVPPPPPPPLDFDFKNPPDKVSWSMANAGEEGWIPNIGD 197

BLAST of Cp4.1LG09g00820.1 vs. Swiss-Prot
Match: MBD13_ARATH (Methyl-CpG-binding domain-containing protein 13 OS=Arabidopsis thaliana GN=MBD13 PE=2 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 3.3e-07
Identity = 40/137 (29.20%), Postives = 72/137 (52.55%), Query Frame = 1

Query: 69  QNNVEVKKTLAKGLPPGWIREIRETKTANR-IRRDSFYIDPVNGNVFRSIREVHRYLTSG 128
           ++ V V+K+ A+GLP GWI+++  T  + R  RRD F+IDP +  +F+S ++  RY+ +G
Sbjct: 26  KDKVIVEKSAAQGLPEGWIKKLEITNRSGRKTRRDPFFIDPKSEYIFQSFKDASRYVETG 85

Query: 129 TVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEPREITNDE 188
            +   A K ++    D+E   DD S        ++ + K        +++LE +E T D+
Sbjct: 86  NIGHYARKLKES---DIE---DDDSGNGKTVLRLEYVDKRSA-----DDVLE-KEKTIDD 145

Query: 189 AIFPNATSVGESMPHSE 205
                  ++  S  HS+
Sbjct: 146 VRRSKRRNLSSSDEHSK 150

BLAST of Cp4.1LG09g00820.1 vs. TrEMBL
Match: A0A0A0L6H4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G181990 PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 1.4e-110
Identity = 240/424 (56.60%), Postives = 282/424 (66.51%), Query Frame = 1

Query: 37  MRFNSRAEVFRYLKTAAICYPESEESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTA 96
           MRFNSRAEVFRYLKTAAIC+PESEESRT K+  NNVEVKKT+AK LP GWI EIRETKTA
Sbjct: 1   MRFNSRAEVFRYLKTAAICHPESEESRTVKEPGNNVEVKKTIAKDLPTGWIGEIRETKTA 60

Query: 97  NRIRRDSFYIDPVNGNVFRSIREVHRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPAV 156
           NRIRRDS YIDPVNGN  RSIR+VHRYLTSG VSRL +KSR+QR  ++EFQHD+ISSP V
Sbjct: 61  NRIRRDSSYIDPVNGNALRSIRDVHRYLTSGKVSRLTHKSRNQRDNNIEFQHDEISSPVV 120

Query: 157 PKKEMQAIGKARRQIIWNENLLEPREITNDEAIFPNATSVGESMPHSEPDSSHGEIGIDV 216
            KKE+  IGKARRQIIW+EN  EP E+ + EA+FPNA SVGE++    PD S G I    
Sbjct: 121 SKKEVLTIGKARRQIIWSENTSEPSEMVDGEAMFPNA-SVGETVLFPIPDPSLGGISAKP 180

Query: 217 HCSTLPE-----QSDGKNDFFELVTTPSD----------GPTKNESKKRQRKPKDMNLTR 276
           HCST  +     QSDGKND  E+V TP +          G TK ES+KRQRK  D+NL R
Sbjct: 181 HCSTPTDAKGLKQSDGKNDISEIVLTPKEFIQHKCPIENGATKGESRKRQRKTNDINLPR 240

Query: 277 RASKRLAGLQAEPVLEVKTGRRARPGACEESDKQAVSSTTKSVSQCLENPDVKHGTKGII 336
           RASKRLAGLQAEPVL+VKTGRRAR  ACEESDKQ  S+T     QC ENPDVKH T    
Sbjct: 241 RASKRLAGLQAEPVLQVKTGRRARSVACEESDKQVASTTKLVAFQCPENPDVKHKTNSTA 300

Query: 337 DPSKSINTNPDSSRKDDVKQDPILKLPVEDLLADPCIAFAVKTLTGGVFDASISSELSLM 396
           DPSK INT+PDS  K  +  D               ++  +K  +   ++     E SL 
Sbjct: 301 DPSK-INTSPDSGGKAHICVD---------------LSIVMKMKSADAYEQQPKPESSLP 360

Query: 397 PNDIDRPSNESRSLGPNEKLPSSELPVNRIGVVEKLESTLELPVGEILADPCIEFAIKTL 446
           P D+         L  +  +   E   N    V+K    L+LP+ ++L DPCI FA+KTL
Sbjct: 361 PEDV---------LEKHVGMVEIEDKAN----VKKQGPLLKLPMEDLLTDPCIAFAVKTL 394

BLAST of Cp4.1LG09g00820.1 vs. TrEMBL
Match: A0A061G045_THECC (Methyl-CPG-binding domain protein 13, putative isoform 2 OS=Theobroma cacao GN=TCM_015148 PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 1.1e-59
Identity = 204/603 (33.83%), Postives = 299/603 (49.59%), Query Frame = 1

Query: 1   MMNQKSEEWLPPGWTVKVKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESE 60
           M +Q +++WLPPGW V+V+ R++GKKDK YY P  ++RF SRAEV RYL     C  E +
Sbjct: 1   MEDQTTDDWLPPGWKVEVRQRRNGKKDKCYYAPCGELRFISRAEVSRYLDKCG-CKTEEK 60

Query: 61  ESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREV 120
           E+ + KQS  NV V+K  A+GLPPGWI+EIR TK A+R+R+D FY DPV+G VFRS+++ 
Sbjct: 61  ENGSGKQSSKNVTVEKAAAEGLPPGWIKEIRITKRAHRVRKDPFYTDPVSGYVFRSMKDA 120

Query: 121 HRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPA-VPKKEMQAIG---KARRQIIWNEN 180
            RY+ +G + +LA+K +D+   D + + D+I  PA V ++++   G   +  RQ    E 
Sbjct: 121 LRYVETGELGKLAFKPKDKGSNDEDLEEDNICEPAHVERQKIDVNGITDETERQSA--EQ 180

Query: 181 LLEPREITNDEAIFPNATSVGESMPHSEPDSSHGEIGIDVHCSTL-------PEQSDGK- 240
           +     IT +E +  +A S GE    S+  ++  + G+    S+L        EQ  GK 
Sbjct: 181 VSNLSGITKEEEMLASA-STGEQTSLSKHATNQHKAGVGAELSSLKLSEAKGSEQIGGKD 240

Query: 241 ---------NDFFELVTTPS--DGPTKNESKKRQ------RKPKDMNLTRRASKRLAGLQ 300
                    N    L+   S  +G  K+E++K Q      +  K  N+ RRASKRLAG+ 
Sbjct: 241 SEEGVHASGNVVGVLLDKQSSENGMIKDETEKTQQGRGKTKLKKAFNIPRRASKRLAGVA 300

Query: 301 AEPVLEVKTGRRAR------------------PGAC-EESDKQAVSSTTKSVSQC-LENP 360
            +P  E+KT R  R                  PG C   + KQ     +   + C L++P
Sbjct: 301 LDPTPELKTARARRSSFKQLSEVIPDAAESSSPGRCIHGASKQPDQPESALETSCDLDSP 360

Query: 361 DVKHGTKGIIDPSK---------------SINTNPDSSR------------------KDD 420
             K   + I+ P+                ++ T  D+                    + D
Sbjct: 361 KSK---ELILAPNNMLSSGEMLTMNGHVGNLETEADADNGVLPLGNAAIPGVHSGKVESD 420

Query: 421 VKQDPI----LKLPVEDLLADPCIAFAVKTLTGGVFDASISSEL--SLMPNDIDRP---- 480
           VK   +    + +P+ DL  DPCIAFA++TLTG   D    SEL  S  P  +  P    
Sbjct: 421 VKASEVPGSLVDMPLADLWTDPCIAFAIQTLTGIPCDNPKISELNSSKGPGILATPEVHA 480

Query: 481 --------SNESRSLGPNEKLPSSELPVNRIGVVE-------KLESTLELPVGEILADPC 496
                   S E +  G +  L    +P    G VE       K  S+L+ P+ +I ADPC
Sbjct: 481 ERKVNGNGSVERQGCGMDLPLADPAIPKEHAGKVEMGHKTDDKPGSSLDTPLADIWADPC 540

BLAST of Cp4.1LG09g00820.1 vs. TrEMBL
Match: A0A061G828_THECC (Methyl-CPG-binding domain protein 13, putative isoform 1 OS=Theobroma cacao GN=TCM_015148 PE=4 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 4.8e-58
Identity = 204/607 (33.61%), Postives = 299/607 (49.26%), Query Frame = 1

Query: 1   MMNQKSEEWLPPGWTVKVKVRKSGKKDKY----YYEPSSQMRFNSRAEVFRYLKTAAICY 60
           M +Q +++WLPPGW V+V+ R++GKKDK     YY P  ++RF SRAEV RYL     C 
Sbjct: 1   MEDQTTDDWLPPGWKVEVRQRRNGKKDKLRVMCYYAPCGELRFISRAEVSRYLDKCG-CK 60

Query: 61  PESEESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRS 120
            E +E+ + KQS  NV V+K  A+GLPPGWI+EIR TK A+R+R+D FY DPV+G VFRS
Sbjct: 61  TEEKENGSGKQSSKNVTVEKAAAEGLPPGWIKEIRITKRAHRVRKDPFYTDPVSGYVFRS 120

Query: 121 IREVHRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPA-VPKKEMQAIG---KARRQII 180
           +++  RY+ +G + +LA+K +D+   D + + D+I  PA V ++++   G   +  RQ  
Sbjct: 121 MKDALRYVETGELGKLAFKPKDKGSNDEDLEEDNICEPAHVERQKIDVNGITDETERQSA 180

Query: 181 WNENLLEPREITNDEAIFPNATSVGESMPHSEPDSSHGEIGIDVHCSTL-------PEQS 240
             E +     IT +E +  +A S GE    S+  ++  + G+    S+L        EQ 
Sbjct: 181 --EQVSNLSGITKEEEMLASA-STGEQTSLSKHATNQHKAGVGAELSSLKLSEAKGSEQI 240

Query: 241 DGK----------NDFFELVTTPS--DGPTKNESKKRQ------RKPKDMNLTRRASKRL 300
            GK          N    L+   S  +G  K+E++K Q      +  K  N+ RRASKRL
Sbjct: 241 GGKDSEEGVHASGNVVGVLLDKQSSENGMIKDETEKTQQGRGKTKLKKAFNIPRRASKRL 300

Query: 301 AGLQAEPVLEVKTGRRAR------------------PGAC-EESDKQAVSSTTKSVSQC- 360
           AG+  +P  E+KT R  R                  PG C   + KQ     +   + C 
Sbjct: 301 AGVALDPTPELKTARARRSSFKQLSEVIPDAAESSSPGRCIHGASKQPDQPESALETSCD 360

Query: 361 LENPDVKHGTKGIIDPSK---------------SINTNPDSSR----------------- 420
           L++P  K   + I+ P+                ++ T  D+                   
Sbjct: 361 LDSPKSK---ELILAPNNMLSSGEMLTMNGHVGNLETEADADNGVLPLGNAAIPGVHSGK 420

Query: 421 -KDDVKQDPI----LKLPVEDLLADPCIAFAVKTLTGGVFDASISSEL--SLMPNDIDRP 480
            + DVK   +    + +P+ DL  DPCIAFA++TLTG   D    SEL  S  P  +  P
Sbjct: 421 VESDVKASEVPGSLVDMPLADLWTDPCIAFAIQTLTGIPCDNPKISELNSSKGPGILATP 480

Query: 481 ------------SNESRSLGPNEKLPSSELPVNRIGVVE-------KLESTLELPVGEIL 496
                       S E +  G +  L    +P    G VE       K  S+L+ P+ +I 
Sbjct: 481 EVHAERKVNGNGSVERQGCGMDLPLADPAIPKEHAGKVEMGHKTDDKPGSSLDTPLADIW 540

BLAST of Cp4.1LG09g00820.1 vs. TrEMBL
Match: A0A0L9VIJ9_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan10g078700 PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 3.9e-52
Identity = 173/570 (30.35%), Postives = 276/570 (48.42%), Query Frame = 1

Query: 1   MMNQKSEEWLPPGWTVKVKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESE 60
           M    S++ LPPGWTV+V+VRK+G++DKYY  PSS ++F S+ EVFR++  A+     + 
Sbjct: 1   MEKTDSDDRLPPGWTVEVRVRKNGRRDKYYILPSSGLKFKSKVEVFRHIDNASN-KDNAS 60

Query: 61  ESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREV 120
              + ++   NV V+K +A+GLPPGW+++ R     +++RRD++YIDPV+G  F SI + 
Sbjct: 61  NKVSIQRISPNVVVEKAIAEGLPPGWVKKTRIATKGDKVRRDTYYIDPVSGYTFHSIEDA 120

Query: 121 HRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEP 180
           + YL SG + R  +K +D+   D   + D   S  V  K   ++  A+   +        
Sbjct: 121 YHYLESGEMGRNTFKPKDEDNNDTNLKDDKSPSACVTMKPTLSVSMAQSSTL-------- 180

Query: 181 REITNDEAIFPNATSVGESMPHSEPDS--SHGEIGIDVHCSTLPEQSDGKNDFFELVTTP 240
            +++N + I P + S GE M  S+ +   +HG    D     L E ++ K          
Sbjct: 181 DKVSNYQQI-PRSASSGEHMHMSDSNCIFNHGCTNKDTQEKKLQENTETKQG-------- 240

Query: 241 SDGPTKNESKKRQRKPKDMNLTRRASKRLAGLQAEPVLEVKT---GRRARPGACEE-SDK 300
           ++       K R +  K +NL RR+SKRLAG++ +PV E+KT    RRAR  A ++ S++
Sbjct: 241 TEKVQAQHHKCRNKHKKQINLPRRSSKRLAGIKLDPVPELKTRNRARRARQAAVKQSSEE 300

Query: 301 QAVSSTTKSVS--------QCLENPDVKHGTKGIIDPSKSIN---TNPDSSRKDDVKQDP 360
           + ++   KS S        Q L   D +  T   ++ + ++       ++  K D K D 
Sbjct: 301 ETITYVDKSPSSLHDDLAKQQLSGKDKECFTFSPLENNATVEECMRVTENGDKVDTKLDY 360

Query: 361 ILKLPVEDLLADPCIAFAVKTLTGGVFDASISSELSLMPNDIDRPSNES----------- 420
            L  P+++LL DPCIAFA++TLTG  F+ S +S+ S    DI    N +           
Sbjct: 361 NLGFPLKELLTDPCIAFAIQTLTGLTFETSKNSQTSCELKDIQHSENSAVSGCEGQGKKC 420

Query: 421 ------------RSLGPNEKLPSSELPVNRIGVVEKLESTLELPVGEILADPCIEFAIKT 480
                        SL  +++       ++     E    + E  +     DPCIEFAIKT
Sbjct: 421 NDGLGDSVFSSPGSLATSQEHAGDAAKIDMKAKNENTSPSSEKTLDMSWMDPCIEFAIKT 480

Query: 481 LTGEIPLDDSPDIEDYFHQLSTSKTQGSSSNALNSFGSDHFYKMNVQSQKRQVIQ----- 518
           LT  IPLD   + ++      T KT  S+ +  N + +D++      SQK    Q     
Sbjct: 481 LTDSIPLDSDQNPKNC--NQHTEKTM-SNVSLNNPYQTDYYCSQYFGSQKPMFTQSFVDP 540

BLAST of Cp4.1LG09g00820.1 vs. TrEMBL
Match: V7C2B4_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_004G117900g PE=4 SV=1)

HSP 1 Score: 209.1 bits (531), Expect = 1.3e-50
Identity = 178/557 (31.96%), Postives = 261/557 (46.86%), Query Frame = 1

Query: 1   MMNQKSEEWLPPGWTVKVKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESE 60
           M    S++ LPPGWTV+VKVRK+GK+DKYY+ PSS ++FNS+ EV+RYL  A        
Sbjct: 1   MEKLNSDDQLPPGWTVEVKVRKNGKRDKYYFLPSSGLKFNSKVEVYRYLDNA-------N 60

Query: 61  ESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREV 120
              + ++   NV V+K +A+GLPPGW+++ R     + +RRD++YIDPV+G  F SI +V
Sbjct: 61  NKVSIQKISPNVVVEKAIAEGLPPGWVKKTRIATKGDTVRRDTYYIDPVSGYAFHSIEDV 120

Query: 121 HRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEP 180
             YL SG V R   K +D+   D + + D   S  V  K   +I   +   +        
Sbjct: 121 DHYLESGEVGRNTLKPKDEDISDTKLKDDKSPSACVTMKPTSSISMGQSSDL-------- 180

Query: 181 REITNDEAIFPNATSVGESMPHSEPDS----SHGEIGIDVHCSTL--PEQSDGKN---DF 240
            ++  +    P + S GE M    PDS    +HG +G ++  S L   E SD K     F
Sbjct: 181 -DMVANYQQIPRSASSGEYMHVPVPDSKFIFNHGVVGTELSSSVLSRDENSDQKQVKVGF 240

Query: 241 FELVTT-------------PSDGPTKNESKK--------RQRKPKDMNLTRRASKRLAGL 300
            E  +                   TK  ++K        + +  K++NL RR SKRLAG+
Sbjct: 241 AESASVSGCTIKHTQEKQLQESSETKQGTEKVQAQHHQCKNKHKKEINLPRRCSKRLAGI 300

Query: 301 QAEPVLEVKTGRRARPGACEESDKQAVSSTTKSVSQCLENPDVKHGTKGIIDPSKSINTN 360
           + +PV E+KT  R R  A ++S +  V  ++ S+   L    +    K     S   N  
Sbjct: 301 KLDPVPELKTRNRTRRVAVKKSAE--VDKSSDSLHDGLAKQKLSGNYKEGFTFSHVQNNA 360

Query: 361 P--------DSSRKDDVKQDPILKLPVEDLLADPCIAFAVKTLTGGVFDASISSELSLMP 420
           P        ++      K D  L  P+ +LL DPCIAFA++TLTG  F+ S +S+ S   
Sbjct: 361 PVEECMRVTETGDNVVAKLDYNLDFPLRELLTDPCIAFAIQTLTGLTFETSKNSQTSSEL 420

Query: 421 NDI-----------------------DRPSNESRSLGPNEKLPSSELPVNRIGVVEKLES 480
            DI                       D   +   SL  +++  S     +     E    
Sbjct: 421 KDIQHSETSATAGCEGKGKKSNDGLSDNVFSSPGSLATSQEHASDAAKSDMKTKNENASP 480

Query: 481 TLELPVGEILADPCIEFAIKTLTGEIPLDDSPDIEDYF-HQLSTSKTQGSS---SNAL-- 491
           + E  +     DPCIEFAIKTLT  IPLD   + ++    QLS+S  Q S    SN    
Sbjct: 481 SSEKTLDMSWMDPCIEFAIKTLTDSIPLDSDQNPKNCLQQQLSSSSNQHSEMTMSNVSLN 539

BLAST of Cp4.1LG09g00820.1 vs. TAIR10
Match: AT5G59800.1 (AT5G59800.1 methyl-CPG-binding domain 7)

HSP 1 Score: 60.1 bits (144), Expect = 4.8e-09
Identity = 43/120 (35.83%), Postives = 62/120 (51.67%), Query Frame = 1

Query: 10  LPPGWTVK-VKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEESRTFKQS 69
           LP GW+V+ V  + S   DKYY E  +  RF S   V RYL+         E   + +Q 
Sbjct: 116 LPRGWSVEEVPRKNSHYIDKYYVERKTGKRFRSLVSVERYLR---------ESRNSIEQQ 175

Query: 70  QNNVEVKKTLAKG--LPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREVHRYLTS 127
              ++ ++  +K   LP GWI E +  ++++ I  D  YI+P  GN FRS+  V RYL S
Sbjct: 176 LRVLQNRRGHSKDFRLPDGWIVEEKPRRSSSHI--DRSYIEPGTGNKFRSMAAVERYLIS 224

BLAST of Cp4.1LG09g00820.1 vs. TAIR10
Match: AT5G59380.1 (AT5G59380.1 methyl-CPG-binding domain 6)

HSP 1 Score: 58.2 bits (139), Expect = 1.8e-08
Identity = 40/130 (30.77%), Postives = 53/130 (40.77%), Query Frame = 1

Query: 7   EEWLPPGWTVKVKVRKSGKK----DKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEES 66
           + WLPPGW V+ K+R SG      DKYYYEP++  +F SR EV  YL+         +  
Sbjct: 78  DNWLPPGWRVEDKIRTSGATAGSVDKYYYEPNTGRKFRSRTEVLYYLEHGTSKRGTKKAE 137

Query: 67  RTF-------KQSQNNVEVKKTLAKGLPP-----------------------GWIREIRE 103
            T+        Q  N V    T+    PP                       GWI  I +
Sbjct: 138 NTYFNPDHFEGQGSNRVTRTATVPPPPPPPLDFDFKNPPDKVSWSMANAGEEGWIPNIGD 197

BLAST of Cp4.1LG09g00820.1 vs. TAIR10
Match: AT5G52230.1 (AT5G52230.1 methyl-CPG-binding domain protein 13)

HSP 1 Score: 58.2 bits (139), Expect = 1.8e-08
Identity = 40/137 (29.20%), Postives = 72/137 (52.55%), Query Frame = 1

Query: 69  QNNVEVKKTLAKGLPPGWIREIRETKTANR-IRRDSFYIDPVNGNVFRSIREVHRYLTSG 128
           ++ V V+K+ A+GLP GWI+++  T  + R  RRD F+IDP +  +F+S ++  RY+ +G
Sbjct: 26  KDKVIVEKSAAQGLPEGWIKKLEITNRSGRKTRRDPFFIDPKSEYIFQSFKDASRYVETG 85

Query: 129 TVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEPREITNDE 188
            +   A K ++    D+E   DD S        ++ + K        +++LE +E T D+
Sbjct: 86  NIGHYARKLKES---DIE---DDDSGNGKTVLRLEYVDKRSA-----DDVLE-KEKTIDD 145

Query: 189 AIFPNATSVGESMPHSE 205
                  ++  S  HS+
Sbjct: 146 VRRSKRRNLSSSDEHSK 150

BLAST of Cp4.1LG09g00820.1 vs. TAIR10
Match: AT3G46580.1 (AT3G46580.1 methyl-CPG-binding domain protein 5)

HSP 1 Score: 49.3 bits (116), Expect = 8.5e-06
Identity = 25/71 (35.21%), Postives = 40/71 (56.34%), Query Frame = 1

Query: 7  EEWLPPGWTVKVKVRKSGKK----DKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEES 66
          + WLPP W  +++VR SG K    DK+YYEP +  +F S+ EV  YL+      P+ +  
Sbjct: 32 DNWLPPDWRTEIRVRTSGTKAGTVDKFYYEPITGRKFRSKNEVLYYLEHGT---PKKKSV 91

Query: 67 RTFKQSQNNVE 74
          +T +   ++ E
Sbjct: 92 KTAENGDSHSE 99

BLAST of Cp4.1LG09g00820.1 vs. NCBI nr
Match: gi|778679537|ref|XP_011651143.1| (PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X1 [Cucumis sativus])

HSP 1 Score: 473.0 bits (1216), Expect = 6.8e-130
Identity = 271/460 (58.91%), Postives = 316/460 (68.70%), Query Frame = 1

Query: 1   MMNQKSEEWLPPGWTVKVKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESE 60
           MMNQ S++ LPPGWTVKVKVRKSGKKDKYY+EPSSQMRFNSRAEVFRYLKTAAIC+PESE
Sbjct: 1   MMNQNSKDLLPPGWTVKVKVRKSGKKDKYYFEPSSQMRFNSRAEVFRYLKTAAICHPESE 60

Query: 61  ESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREV 120
           ESRT K+  NNVEVKKT+AK LP GWI EIRETKTANRIRRDS YIDPVNGN  RSIR+V
Sbjct: 61  ESRTVKEPGNNVEVKKTIAKDLPTGWIGEIRETKTANRIRRDSSYIDPVNGNALRSIRDV 120

Query: 121 HRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEP 180
           HRYLTSG VSRL +KSR+QR  ++EFQHD+ISSP V KKE+  IGKARRQIIW+EN  EP
Sbjct: 121 HRYLTSGKVSRLTHKSRNQRDNNIEFQHDEISSPVVSKKEVLTIGKARRQIIWSENTSEP 180

Query: 181 REITNDEAIFPNATSVGESMPHSEPDSSHGEIGIDVHCSTLPE-----QSDGKNDFFELV 240
            E+ + EA+FPNA SVGE++    PD S G I    HCST  +     QSDGKND  E+V
Sbjct: 181 SEMVDGEAMFPNA-SVGETVLFPIPDPSLGGISAKPHCSTPTDAKGLKQSDGKNDISEIV 240

Query: 241 TTPSD----------GPTKNESKKRQRKPKDMNLTRRASKRLAGLQAEPVLEVKTGRRAR 300
            TP +          G TK ES+KRQRK  D+NL RRASKRLAGLQAEPVL+VKTGRRAR
Sbjct: 241 LTPKEFIQHKCPIENGATKGESRKRQRKTNDINLPRRASKRLAGLQAEPVLQVKTGRRAR 300

Query: 301 PGACEESDKQAVSSTTKSVSQCLENPDVKHGTKGIIDPSKSINTNPDSSRKDDVKQDPIL 360
             ACEESDKQ  S+T     QC ENPDVKH T    DPSK INT+PDS  K  +  D   
Sbjct: 301 SVACEESDKQVASTTKLVAFQCPENPDVKHKTNSTADPSK-INTSPDSGGKAHICVD--- 360

Query: 361 KLPVEDLLADPCIAFAVKTLTGGVFDASISSELSLMPNDIDRPSNESRSLGPNEKLPSSE 420
                       ++  +K  +   ++     E SL P D+         L  +  +   E
Sbjct: 361 ------------LSIVMKMKSADAYEQQPKPESSLPPEDV---------LEKHVGMVEIE 420

Query: 421 LPVNRIGVVEKLESTLELPVGEILADPCIEFAIKTLTGEI 446
              N    V+K    L+LP+ ++L DPCI FA+KTLTG++
Sbjct: 421 DKAN----VKKQGPLLKLPMEDLLTDPCIAFAVKTLTGDV 430

BLAST of Cp4.1LG09g00820.1 vs. NCBI nr
Match: gi|778679540|ref|XP_011651144.1| (PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X2 [Cucumis sativus])

HSP 1 Score: 408.3 bits (1048), Expect = 2.0e-110
Identity = 240/424 (56.60%), Postives = 282/424 (66.51%), Query Frame = 1

Query: 37  MRFNSRAEVFRYLKTAAICYPESEESRTFKQSQNNVEVKKTLAKGLPPGWIREIRETKTA 96
           MRFNSRAEVFRYLKTAAIC+PESEESRT K+  NNVEVKKT+AK LP GWI EIRETKTA
Sbjct: 1   MRFNSRAEVFRYLKTAAICHPESEESRTVKEPGNNVEVKKTIAKDLPTGWIGEIRETKTA 60

Query: 97  NRIRRDSFYIDPVNGNVFRSIREVHRYLTSGTVSRLAYKSRDQRGIDVEFQHDDISSPAV 156
           NRIRRDS YIDPVNGN  RSIR+VHRYLTSG VSRL +KSR+QR  ++EFQHD+ISSP V
Sbjct: 61  NRIRRDSSYIDPVNGNALRSIRDVHRYLTSGKVSRLTHKSRNQRDNNIEFQHDEISSPVV 120

Query: 157 PKKEMQAIGKARRQIIWNENLLEPREITNDEAIFPNATSVGESMPHSEPDSSHGEIGIDV 216
            KKE+  IGKARRQIIW+EN  EP E+ + EA+FPNA SVGE++    PD S G I    
Sbjct: 121 SKKEVLTIGKARRQIIWSENTSEPSEMVDGEAMFPNA-SVGETVLFPIPDPSLGGISAKP 180

Query: 217 HCSTLPE-----QSDGKNDFFELVTTPSD----------GPTKNESKKRQRKPKDMNLTR 276
           HCST  +     QSDGKND  E+V TP +          G TK ES+KRQRK  D+NL R
Sbjct: 181 HCSTPTDAKGLKQSDGKNDISEIVLTPKEFIQHKCPIENGATKGESRKRQRKTNDINLPR 240

Query: 277 RASKRLAGLQAEPVLEVKTGRRARPGACEESDKQAVSSTTKSVSQCLENPDVKHGTKGII 336
           RASKRLAGLQAEPVL+VKTGRRAR  ACEESDKQ  S+T     QC ENPDVKH T    
Sbjct: 241 RASKRLAGLQAEPVLQVKTGRRARSVACEESDKQVASTTKLVAFQCPENPDVKHKTNSTA 300

Query: 337 DPSKSINTNPDSSRKDDVKQDPILKLPVEDLLADPCIAFAVKTLTGGVFDASISSELSLM 396
           DPSK INT+PDS  K  +  D               ++  +K  +   ++     E SL 
Sbjct: 301 DPSK-INTSPDSGGKAHICVD---------------LSIVMKMKSADAYEQQPKPESSLP 360

Query: 397 PNDIDRPSNESRSLGPNEKLPSSELPVNRIGVVEKLESTLELPVGEILADPCIEFAIKTL 446
           P D+         L  +  +   E   N    V+K    L+LP+ ++L DPCI FA+KTL
Sbjct: 361 PEDV---------LEKHVGMVEIEDKAN----VKKQGPLLKLPMEDLLTDPCIAFAVKTL 394

BLAST of Cp4.1LG09g00820.1 vs. NCBI nr
Match: gi|778679543|ref|XP_011651145.1| (PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X3 [Cucumis sativus])

HSP 1 Score: 350.5 bits (898), Expect = 5.1e-93
Identity = 210/392 (53.57%), Postives = 252/392 (64.29%), Query Frame = 1

Query: 69  QNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREVHRYLTSGT 128
           ++ VEVKKT+AK LP GWI EIRETKTANRIRRDS YIDPVNGN  RSIR+VHRYLTSG 
Sbjct: 26  KDKVEVKKTIAKDLPTGWIGEIRETKTANRIRRDSSYIDPVNGNALRSIRDVHRYLTSGK 85

Query: 129 VSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEPREITNDEA 188
           VSRL +KSR+QR  ++EFQHD+ISSP V KKE+  IGKARRQIIW+EN  EP E+ + EA
Sbjct: 86  VSRLTHKSRNQRDNNIEFQHDEISSPVVSKKEVLTIGKARRQIIWSENTSEPSEMVDGEA 145

Query: 189 IFPNATSVGESMPHSEPDSSHGEIGIDVHCSTLPE-----QSDGKNDFFELVTTPSD--- 248
           +FPNA SVGE++    PD S G I    HCST  +     QSDGKND  E+V TP +   
Sbjct: 146 MFPNA-SVGETVLFPIPDPSLGGISAKPHCSTPTDAKGLKQSDGKNDISEIVLTPKEFIQ 205

Query: 249 -------GPTKNESKKRQRKPKDMNLTRRASKRLAGLQAEPVLEVKTGRRARPGACEESD 308
                  G TK ES+KRQRK  D+NL RRASKRLAGLQAEPVL+VKTGRRAR  ACEESD
Sbjct: 206 HKCPIENGATKGESRKRQRKTNDINLPRRASKRLAGLQAEPVLQVKTGRRARSVACEESD 265

Query: 309 KQAVSSTTKSVSQCLENPDVKHGTKGIIDPSKSINTNPDSSRKDDVKQDPILKLPVEDLL 368
           KQ  S+T     QC ENPDVKH T    DPSK INT+PDS  K  +  D           
Sbjct: 266 KQVASTTKLVAFQCPENPDVKHKTNSTADPSK-INTSPDSGGKAHICVD----------- 325

Query: 369 ADPCIAFAVKTLTGGVFDASISSELSLMPNDIDRPSNESRSLGPNEKLPSSELPVNRIGV 428
               ++  +K  +   ++     E SL P D+         L  +  +   E   N    
Sbjct: 326 ----LSIVMKMKSADAYEQQPKPESSLPPEDV---------LEKHVGMVEIEDKAN---- 385

Query: 429 VEKLESTLELPVGEILADPCIEFAIKTLTGEI 446
           V+K    L+LP+ ++L DPCI FA+KTLTG++
Sbjct: 386 VKKQGPLLKLPMEDLLTDPCIAFAVKTLTGDV 387

BLAST of Cp4.1LG09g00820.1 vs. NCBI nr
Match: gi|449446179|ref|XP_004140849.1| (PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X4 [Cucumis sativus])

HSP 1 Score: 310.1 bits (793), Expect = 7.6e-81
Identity = 189/379 (49.87%), Postives = 234/379 (61.74%), Query Frame = 1

Query: 82  LPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREVHRYLTSGTVSRLAYKSRDQRG 141
           LPPGW  +++  K+    ++D  YIDPVNGN  RSIR+VHRYLTSG VSRL +KSR+QR 
Sbjct: 10  LPPGWTVKVKVRKSG---KKDKSYIDPVNGNALRSIRDVHRYLTSGKVSRLTHKSRNQRD 69

Query: 142 IDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEPREITNDEAIFPNATSVGESMP 201
            ++EFQHD+ISSP V KKE+  IGKARRQIIW+EN  EP E+ + EA+FPNA SVGE++ 
Sbjct: 70  NNIEFQHDEISSPVVSKKEVLTIGKARRQIIWSENTSEPSEMVDGEAMFPNA-SVGETVL 129

Query: 202 HSEPDSSHGEIGIDVHCSTLPE-----QSDGKNDFFELVTTPSD----------GPTKNE 261
              PD S G I    HCST  +     QSDGKND  E+V TP +          G TK E
Sbjct: 130 FPIPDPSLGGISAKPHCSTPTDAKGLKQSDGKNDISEIVLTPKEFIQHKCPIENGATKGE 189

Query: 262 SKKRQRKPKDMNLTRRASKRLAGLQAEPVLEVKTGRRARPGACEESDKQAVSSTTKSVSQ 321
           S+KRQRK  D+NL RRASKRLAGLQAEPVL+VKTGRRAR  ACEESDKQ  S+T     Q
Sbjct: 190 SRKRQRKTNDINLPRRASKRLAGLQAEPVLQVKTGRRARSVACEESDKQVASTTKLVAFQ 249

Query: 322 CLENPDVKHGTKGIIDPSKSINTNPDSSRKDDVKQDPILKLPVEDLLADPCIAFAVKTLT 381
           C ENPDVKH T    DPSK INT+PDS  K  +  D               ++  +K  +
Sbjct: 250 CPENPDVKHKTNSTADPSK-INTSPDSGGKAHICVD---------------LSIVMKMKS 309

Query: 382 GGVFDASISSELSLMPNDIDRPSNESRSLGPNEKLPSSELPVNRIGVVEKLESTLELPVG 441
              ++     E SL P D+         L  +  +   E   N    V+K    L+LP+ 
Sbjct: 310 ADAYEQQPKPESSLPPEDV---------LEKHVGMVEIEDKAN----VKKQGPLLKLPME 355

Query: 442 EILADPCIEFAIKTLTGEI 446
           ++L DPCI FA+KTLTG++
Sbjct: 370 DLLTDPCIAFAVKTLTGDV 355

BLAST of Cp4.1LG09g00820.1 vs. NCBI nr
Match: gi|1009122532|ref|XP_015878051.1| (PREDICTED: methyl-CpG-binding domain-containing protein 13 [Ziziphus jujuba])

HSP 1 Score: 266.2 bits (679), Expect = 1.3e-67
Identity = 204/567 (35.98%), Postives = 293/567 (51.68%), Query Frame = 1

Query: 5   KSEEWLPPGWTVKVKVRKSGKKDKYYYEPSSQMRFNSRAEVFRYLKTAAICYPESEESRT 64
           ++E+WLPPGWTV+VK+R +G+KDKYY+ P    +FNS+AEV RYL +  I     +   T
Sbjct: 3   ETEDWLPPGWTVEVKIRNNGRKDKYYHAPLDGPKFNSKAEVSRYLSSKQII----DGKGT 62

Query: 65  FKQSQNNVEVKKTLAKGLPPGWIREIRETKTANRIRRDSFYIDPVNGNVFRSIREVHRYL 124
           FK+ + NV V+K + KGLPPGWI+EIR TK A +IRRD +YIDP+NG +FRS+++V+RYL
Sbjct: 63  FKRFKRNVVVEKVIPKGLPPGWIKEIRMTKKAGKIRRDPYYIDPINGKIFRSMKDVYRYL 122

Query: 125 TSGTVSRLAYKSRDQRGIDVEFQHDDISSPAVPKKEMQAIGKARRQIIWNENLLEPREIT 184
            +  +  L  K +     D+E+  D+ SSP V K +  A+GK RRQI ++++     E+ 
Sbjct: 123 ETEELGSLGKKLKGSSDEDLEY--DETSSPVVSKGQKLAVGKTRRQIDFSQSS-NSNEML 182

Query: 185 NDEAIFPNATSVGESMPHSEPDSSHGE-----IGIDVHCSTLPEQSDGKN---------- 244
            DE I PN+T  G+     E  S  G         D+  + + EQ    N          
Sbjct: 183 KDEQI-PNSTFTGQCQFPLEHTSDQGRMSNELRNSDIQEAKVSEQELQSNSPKSTSASFP 242

Query: 245 --DFFELVTTPSDGPTKNESKK------RQRKPKDMNLTRRASKRLAGLQAEPVLEVKTG 304
             D  +   +P     K+E  +      + +  K+ NL RRASKRLAGL+ +P+ E+K  
Sbjct: 243 AGDVLQGEQSPECVLAKHERGRTRLGLSKSKAKKETNLPRRASKRLAGLEVDPIPELKPK 302

Query: 305 RRARPGACEESDKQAV----SSTTKSVSQCLENPD---VKHGTKGIIDPSKSINTNPDSS 364
            RAR  A ++S    V    SS+T       E PD   V+  T  I+D S+S      S+
Sbjct: 303 TRARRSAVKQSGDDGVNQSGSSSTPGSDCAFEQPDQLEVEPETYCIVDTSESTELPLQSN 362

Query: 365 RKDDVKQDPI----------------------LKLPVE----DLLADPCIAFAVKTLTGG 424
           ++  +  D +                      L+LP+E    +LL DPCIAFA+KTLTG 
Sbjct: 363 KRRRLPVDLVTPEKQVSEAETGINCDDRANEKLELPIELPLGELLTDPCIAFAIKTLTGV 422

Query: 425 VFDASISSELSL------------------MPNDIDRPSNESRSLGPNEKLPSSEL--PV 484
            FD   SSE++                   +  +++      R LGP+  LP   L  P 
Sbjct: 423 AFDTYKSSEVASAGSNSRDHSSGNLVTPIELAGNVETGKEAERELGPSVVLPMGVLSFPE 482

Query: 485 NRIGVV-------EKLESTLELPVGEILADPCIEFAIKTLTGEIPLDDSPDIEDYF-HQL 487
            + G +       EK    +E P      DPCIEFAIKTLT     D  PDI++YF  QL
Sbjct: 483 RQAGKIDTDKNADEKSGYPIEFPSSCSWLDPCIEFAIKTLTA----DAVPDIQNYFQQQL 542

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MBD7_ARATH8.6e-0835.83Methyl-CpG-binding domain-containing protein 7 OS=Arabidopsis thaliana GN=MBD7 P... [more]
MBD6_ARATH3.3e-0730.77Methyl-CpG-binding domain-containing protein 6 OS=Arabidopsis thaliana GN=MBD6 P... [more]
MBD13_ARATH3.3e-0729.20Methyl-CpG-binding domain-containing protein 13 OS=Arabidopsis thaliana GN=MBD13... [more]
Match NameE-valueIdentityDescription
A0A0A0L6H4_CUCSA1.4e-11056.60Uncharacterized protein OS=Cucumis sativus GN=Csa_3G181990 PE=4 SV=1[more]
A0A061G045_THECC1.1e-5933.83Methyl-CPG-binding domain protein 13, putative isoform 2 OS=Theobroma cacao GN=T... [more]
A0A061G828_THECC4.8e-5833.61Methyl-CPG-binding domain protein 13, putative isoform 1 OS=Theobroma cacao GN=T... [more]
A0A0L9VIJ9_PHAAN3.9e-5230.35Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan10g078700 PE=4 SV=1[more]
V7C2B4_PHAVU1.3e-5031.96Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_004G117900g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G59800.14.8e-0935.83 methyl-CPG-binding domain 7[more]
AT5G59380.11.8e-0830.77 methyl-CPG-binding domain 6[more]
AT5G52230.11.8e-0829.20 methyl-CPG-binding domain protein 13[more]
AT3G46580.18.5e-0635.21 methyl-CPG-binding domain protein 5[more]
Match NameE-valueIdentityDescription
gi|778679537|ref|XP_011651143.1|6.8e-13058.91PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X1 [Cucumis s... [more]
gi|778679540|ref|XP_011651144.1|2.0e-11056.60PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X2 [Cucumis s... [more]
gi|778679543|ref|XP_011651145.1|5.1e-9353.57PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X3 [Cucumis s... [more]
gi|449446179|ref|XP_004140849.1|7.6e-8149.87PREDICTED: methyl-CpG-binding domain-containing protein 13 isoform X4 [Cucumis s... [more]
gi|1009122532|ref|XP_015878051.1|1.3e-6735.98PREDICTED: methyl-CpG-binding domain-containing protein 13 [Ziziphus jujuba][more]
The following terms have been associated with this mRNA:
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR016177DNA-bd_dom_sf
IPR001739Methyl_CpG_DNA-bd
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG09g00820Cp4.1LG09g00820gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG09g00820.1Cp4.1LG09g00820.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG09g00820.1:five_prime_utr:001Cp4.1LG09g00820.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG09g00820.1:cds:001Cp4.1LG09g00820.1:cds:001CDS
Cp4.1LG09g00820.1:cds:002Cp4.1LG09g00820.1:cds:002CDS
Cp4.1LG09g00820.1:cds:003Cp4.1LG09g00820.1:cds:003CDS
Cp4.1LG09g00820.1:cds:004Cp4.1LG09g00820.1:cds:004CDS
Cp4.1LG09g00820.1:cds:005Cp4.1LG09g00820.1:cds:005CDS
Cp4.1LG09g00820.1:cds:006Cp4.1LG09g00820.1:cds:006CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG09g00820.1:three_prime_utr:001Cp4.1LG09g00820.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001739Methyl-CpG DNA bindingGENE3DG3DSA:3.30.890.10coord: 59..126
score: 3.5E-11coord: 8..52
score: 3.1
IPR001739Methyl-CpG DNA bindingPFAMPF01429MBDcoord: 77..128
score: 4.4E-8coord: 6..53
score: 4.
IPR001739Methyl-CpG DNA bindingPROFILEPS50982MBDcoord: 1..70
score: 14.432coord: 72..151
score: 15
IPR016177DNA-binding domainunknownSSF54171DNA-binding domaincoord: 7..54
score: 1.31E-12coord: 71..128
score: 2.62
NoneNo IPR availablePANTHERPTHR34067FAMILY NOT NAMEDcoord: 1..288
score: 2.5E-55coord: 333..454
score: 2.5
NoneNo IPR availablePANTHERPTHR34067:SF1METHYL-CPG-BINDING DOMAIN-CONTAINING PROTEIN 13coord: 333..454
score: 2.5E-55coord: 1..288
score: 2.5