Cp4.1LG04g00110 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g00110
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMethyl-CpG-binding domain protein 4
LocationCp4.1LG04 : 2023822 .. 2027147 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAGCGCGCCAAGTTCGTGAAGCCATCAGTCGGTTCAATGCACTTCCTTTTCGGCAGCCGCCATGAGTGCAACTACAATCATGAACCCTAATCTCTCCCCTCCCTCCTCATCTTCATTTCCCGATTTCTCGGTTCAATGCTCTTCCAGATTTCGCTTTCCTCCTTCCAAATGCCCCTCCGACTCGAATCCTCAAAACCCCACGCCGGAGGATTTTACCCAAAAGAGGACCACTCTCATGGCGCAAAACTCTCCGATTTCGACTCTTGAGGTTCTCCAAACCTCTGAATCGAATCATCAGAAGACAGCCGTAGGGCACGAGATTCCGATTTTGTGTATTGAGGATCTTCAGGATAACCCGAAGCGTGGGACTTCCACATTAACCGTAGAGGATGTCCAACAAGTTTCACCCAAAACCCCAACTTCTGAAAGGGAAAGGGTTTTAGCGCATGAGCCTCCTATATTGACTCTAGAGGATCTTCAAAATGCAAAATCAGACCATCAACCGGCGATAAAGCCTCCATTGGCTCGAAGGGTTTTACGGTTTTACCGGCAGTTTGGGTTTGATGAACAAATAGTGCAAAAAACCCCACCTTCTGTCCGAAATTCCATGCCAGTTCAACGAGATGAACGTGTAGTTTCGCGTCATTTCCAGGAATCAAAATCAACCCAGCAAGGAGAACGAATTGTATCACGCTACTTTCAACACTCGGAGATAGAACAAGCAGCCCATAATGAGGATGAGGATGAGGATGTCAATGTCACAGATCAACCAATTAAAAGATCAAGGGTCGGAGAATACAGAAAAAGGAGGAGGAAAGACGTAGCTTCTAGCTCCGATAATTCAAAAGCATATCAACGTTCAATCAGAAAATCCTCACGTTCTGTTAAAAAATCGGGAAAGGATAAACGAGTGCGGATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAACCCTGAAGTGGAGATTGAAGTTTCACCTTCATTACAAAATTCAAAAACAAAACAACAAGGAGAGCGCATAGTCTCACGTTTCTTTCAAAAATCAGAAGAACAAGAAGTAGTGAACAATCAACAAGAGGTTATACAGCTTCCAAGTCAGTGTGCAAAATCTGTTAAAAGAATCCGCAAACCAGCCAAGGAAAGAAAAGTGAGGGATAAAGTTTCTGCTAGGCCTAGAACCACTCTTTCGGCTGACGAGTTGTTTCTGGAAGCTTATAGAAGAAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCTTACGACCCTTGGAGGGTTCTTGTCATATGTATGCTCCTTAACCGGACGACTGGGCAGCAGGTATTTACTTGATCTTGAATGCAATTTCCAATTTCATACTGTACTATATCTTCCCCATGGCACATAAACTTGATTTGCTTTCATTACAGTTTTGCCACTCCACGACTTTGCCGCAGGACTGGGTTTTCATAAATGAACTTCAACAATATCCTTGAATCAAGTAAACAAGAAAAGTGGATAAATATCATCTCATTCAACCTTGTCAATTACCATATTCACCATGCCCTTCGAAGATATTCTGTTAGGTATCTAGAGTATTCTTTACGTATTGAATCATACTATAGTATTCTCCAATACTCAATTCCTTTCCCAAATCAGCACTTACCATCAGCTGCCTTACTAACTCACATTCCTTGTGCCCCACGGTAATACGATATCCACAATACCAAGTTGTAGATCCACCCCTTACTTTGTTAAGTGAATACACCCTGAGAATTGGTGTCTATTAGTTTACAAAGCATTGTTTTGGGGTATTTGGAAAGATAGGAACCAAATTAATTGAAGTGGATGATAGGCTGAAACTATTTAGGCATCAAAGAGCTACTTTGTGGTGCCATTTCTAGTGAATTTTGGTATCATTCTTCGAGTATGATTTACTGAACTTTCGTTCTCTAATTTTCTTGTGAACTCTTAAAAGGATTTCTTTATGTATAACTGTTCAAATATCAACGAAAATCTGTCTTCTGTTTGAAGGAAAATCTAAAAATTTGTCGTCCCGTTTCTTTGAGAGCTAGAAAGTATTAGATGAATGATACACATGCTCATGTATATATGAAGTTCTTATACCTAACCTCTGGTTCTTAGTTGTGCCAGTCTATTGGAGTGATCATTTAGTTGGTCGTGCACGTATTTATGAATGGTTGGGGAAATAAACTCCATGATACTATATGTGCACGTATTTATGAATGGTTGGGGAAATAAACTCCATGATACTATATATTATCGCAACCAAAACCTTCTATCATTTTGTGTTTCTGTGTCAACTTGTTACTCATAATTGACTCTCTGTTTTCTGATTATGCACCCTTAGTCCTGTAGGTTTCATTTTTGCTGAAAATTTTCAGTTTGATTCAGTCTGCGACTTCATTTTGAAAATTGCAGGCAAAAGATGTGATACCTAAACTCTTCACGTTGTGTCCCGATCCAAAGTCTGCTTTGGAGGTATCACAAGAGCAGATAGAAGATATTATTCGACCTCTTGGTTTACAAAGAAAAAGATCACTTACAATTCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGTGTTGGCAAGTAATTTAAGACTATCCTTTTAAACTCTTTGGTTTATAGTTCTATCTCTTCCATGCTTATTGAGCTTGGGACGTGTATTAAAATGAATTTGATTTGAGACAGGTATGGAGCTGATGCACATGCAATATTTTGCACTGGATATTGGACCGAAGTATTACCTAAAGATCACATGCTTAATTATTACTGGGAGTTCCTCCACAGCATAAAACACCTGCTCTGATCTTATCTGAGACGACTGTAGATGGTTCGGCACGAGAGAAGCTGTAAATTTCCCGGTCTACTTAACATATATTCTTTGGTACGTTACTCTTTTGACATAATTTTGTTTTGTTTTGTTAATGTTCATGTTGTTGTTGGGAGTCTGTGAGACACTTATATCAACACATACTAGAAAGGGCAGAAATGGAAGCTCTTCTCCAGTAGCTAATTAGGCTGTAGTTGTGGGGGCTGAATGAAGTAGCGTGAGGGTCATGGTATGTTTTGAAAGGTCTGGGTAGCTTTACTAGATGACTTGAGACTGTTTGTAGGGTAATTATGTTACTCTATGGGCTGTGCCTTCCCTTTTGAGGAACTAGTGCCAGCGAAGACGCTGAATTCCGATGGATTCCGACCTATCTTGCTTAATGTAAATATAAATATATTTTTAAAGAGAAATTTTCTGAATTTCACTTTCAAGTT

mRNA sequence

AGAAGCGCGCCAAGTTCGTGAAGCCATCAGTCGGTTCAATGCACTTCCTTTTCGGCAGCCGCCATGAGTGCAACTACAATCATGAACCCTAATCTCTCCCCTCCCTCCTCATCTTCATTTCCCGATTTCTCGGTTCAATGCTCTTCCAGATTTCGCTTTCCTCCTTCCAAATGCCCCTCCGACTCGAATCCTCAAAACCCCACGCCGGAGGATTTTACCCAAAAGAGGACCACTCTCATGGCGCAAAACTCTCCGATTTCGACTCTTGAGGTTCTCCAAACCTCTGAATCGAATCATCAGAAGACAGCCGTAGGGCACGAGATTCCGATTTTGTGTATTGAGGATCTTCAGGATAACCCGAAGCGTGGGACTTCCACATTAACCGTAGAGGATGTCCAACAAGTTTCACCCAAAACCCCAACTTCTGAAAGGGAAAGGGTTTTAGCGCATGAGCCTCCTATATTGACTCTAGAGGATCTTCAAAATGCAAAATCAGACCATCAACCGGCGATAAAGCCTCCATTGGCTCGAAGGGTTTTACGGTTTTACCGGCAGTTTGGGTTTGATGAACAAATAGTGCAAAAAACCCCACCTTCTGTCCGAAATTCCATGCCAGTTCAACGAGATGAACGTGTAGTTTCGCGTCATTTCCAGGAATCAAAATCAACCCAGCAAGGAGAACGAATTGTATCACGCTACTTTCAACACTCGGAGATAGAACAAGCAGCCCATAATGAGGATGAGGATGAGGATGTCAATGTCACAGATCAACCAATTAAAAGATCAAGGGTCGGAGAATACAGAAAAAGGAGGAGGAAAGACGTAGCTTCTAGCTCCGATAATTCAAAAGCATATCAACGTTCAATCAGAAAATCCTCACGTTCTGTTAAAAAATCGGGAAAGGATAAACGAGTGCGGATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAACCCTGAAGTGGAGATTGAAGTTTCACCTTCATTACAAAATTCAAAAACAAAACAACAAGGAGAGCGCATAGTCTCACGTTTCTTTCAAAAATCAGAAGAACAAGAAGTAGTGAACAATCAACAAGAGGTTATACAGCTTCCAAGTCAGTGTGCAAAATCTGTTAAAAGAATCCGCAAACCAGCCAAGGAAAGAAAAGTGAGGGATAAAGTTTCTGCTAGGCCTAGAACCACTCTTTCGGCTGACGAGTTGTTTCTGGAAGCTTATAGAAGAAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCTTACGACCCTTGGAGGGTTCTTGTCATATGTATGCTCCTTAACCGGACGACTGGGCAGCAGGCAAAAGATGTGATACCTAAACTCTTCACGTTGTGTCCCGATCCAAAGTCTGCTTTGGAGGTATCACAAGAGCAGATAGAAGATATTATTCGACCTCTTGGTTTACAAAGAAAAAGATCACTTACAATTCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTTTGCACTGGATATTGGACCGAAGTATTACCTAAAGATCACATGCTTAATTATTACTGGGAGTTCCTCCACAGCATAAAACACCTGCTCTGATCTTATCTGAGACGACTGTAGATGGTTCGGCACGAGAGAAGCTGTAAATTTCCCGGTCTACTTAACATATATTCTTTGGTACGTTACTCTTTTGACATAATTTTGTTTTGTTTTGTTAATGTTCATGTTGTTGTTGGGAGTCTGTGAGACACTTATATCAACACATACTAGAAAGGGCAGAAATGGAAGCTCTTCTCCAGTAGCTAATTAGGCTGTAGTTGTGGGGGCTGAATGAAGTAGCGTGAGGGTCATGGTATGTTTTGAAAGGTCTGGGTAGCTTTACTAGATGACTTGAGACTGTTTGTAGGGTAATTATGTTACTCTATGGGCTGTGCCTTCCCTTTTGAGGAACTAGTGCCAGCGAAGACGCTGAATTCCGATGGATTCCGACCTATCTTGCTTAATGTAAATATAAATATATTTTTAAAGAGAAATTTTCTGAATTTCACTTTCAAGTT

Coding sequence (CDS)

ATGAGTGCAACTACAATCATGAACCCTAATCTCTCCCCTCCCTCCTCATCTTCATTTCCCGATTTCTCGGTTCAATGCTCTTCCAGATTTCGCTTTCCTCCTTCCAAATGCCCCTCCGACTCGAATCCTCAAAACCCCACGCCGGAGGATTTTACCCAAAAGAGGACCACTCTCATGGCGCAAAACTCTCCGATTTCGACTCTTGAGGTTCTCCAAACCTCTGAATCGAATCATCAGAAGACAGCCGTAGGGCACGAGATTCCGATTTTGTGTATTGAGGATCTTCAGGATAACCCGAAGCGTGGGACTTCCACATTAACCGTAGAGGATGTCCAACAAGTTTCACCCAAAACCCCAACTTCTGAAAGGGAAAGGGTTTTAGCGCATGAGCCTCCTATATTGACTCTAGAGGATCTTCAAAATGCAAAATCAGACCATCAACCGGCGATAAAGCCTCCATTGGCTCGAAGGGTTTTACGGTTTTACCGGCAGTTTGGGTTTGATGAACAAATAGTGCAAAAAACCCCACCTTCTGTCCGAAATTCCATGCCAGTTCAACGAGATGAACGTGTAGTTTCGCGTCATTTCCAGGAATCAAAATCAACCCAGCAAGGAGAACGAATTGTATCACGCTACTTTCAACACTCGGAGATAGAACAAGCAGCCCATAATGAGGATGAGGATGAGGATGTCAATGTCACAGATCAACCAATTAAAAGATCAAGGGTCGGAGAATACAGAAAAAGGAGGAGGAAAGACGTAGCTTCTAGCTCCGATAATTCAAAAGCATATCAACGTTCAATCAGAAAATCCTCACGTTCTGTTAAAAAATCGGGAAAGGATAAACGAGTGCGGATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAACCCTGAAGTGGAGATTGAAGTTTCACCTTCATTACAAAATTCAAAAACAAAACAACAAGGAGAGCGCATAGTCTCACGTTTCTTTCAAAAATCAGAAGAACAAGAAGTAGTGAACAATCAACAAGAGGTTATACAGCTTCCAAGTCAGTGTGCAAAATCTGTTAAAAGAATCCGCAAACCAGCCAAGGAAAGAAAAGTGAGGGATAAAGTTTCTGCTAGGCCTAGAACCACTCTTTCGGCTGACGAGTTGTTTCTGGAAGCTTATAGAAGAAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCTTACGACCCTTGGAGGGTTCTTGTCATATGTATGCTCCTTAACCGGACGACTGGGCAGCAGGCAAAAGATGTGATACCTAAACTCTTCACGTTGTGTCCCGATCCAAAGTCTGCTTTGGAGGTATCACAAGAGCAGATAGAAGATATTATTCGACCTCTTGGTTTACAAAGAAAAAGATCACTTACAATTCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTTTGCACTGGATATTGGACCGAAGTATTACCTAAAGATCACATGCTTAATTATTACTGGGAGTTCCTCCACAGCATAAAACACCTGCTCTGA

Protein sequence

MSATTIMNPNLSPPSSSSFPDFSVQCSSRFRFPPSKCPSDSNPQNPTPEDFTQKRTTLMAQNSPISTLEVLQTSESNHQKTAVGHEIPILCIEDLQDNPKRGTSTLTVEDVQQVSPKTPTSERERVLAHEPPILTLEDLQNAKSDHQPAIKPPLARRVLRFYRQFGFDEQIVQKTPPSVRNSMPVQRDERVVSRHFQESKSTQQGERIVSRYFQHSEIEQAAHNEDEDEDVNVTDQPIKRSRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSEKNPEVEIEVSPSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAKERKVRDKVSARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKHLL
BLAST of Cp4.1LG04g00110 vs. Swiss-Prot
Match: MBD4L_ARATH (Methyl-CpG-binding domain protein 4-like protein OS=Arabidopsis thaliana GN=MBD4L PE=1 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 1.8e-50
Identity = 131/318 (41.19%), Postives = 185/318 (58.18%), Query Frame = 1

Query: 227 EDEDVNVTDQPIKRSRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSV-KKSGKDK--- 286
           +D+D +V+D  I+R    E+    R+       ++ + Q      S SV  K G  K   
Sbjct: 123 DDDDDSVSDSHIERQECSEFHVEVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQA 182

Query: 287 RVRIVSRYFQNSEKNPEVEIEVSPSLQNSKTKQQGE-------RIVSRFFQKSEEQEVVN 346
           +V  VS YFQ S  + + + ++  S Q+ +  ++G        R VS +FQ+S   E  N
Sbjct: 183 KVPRVSPYFQASTIS-QCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPN 242

Query: 347 -------NQQEVIQLPSQCAKSVKRIRKPAKERKVRDKVSARPRTTLSADELFLEAYRRK 406
                  N  +V+++         ++ +  KE+    + +      LS  +   + Y RK
Sbjct: 243 QAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRK 302

Query: 407 SSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSAL 466
           + D+TW PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q + VI  LF LC D K+A 
Sbjct: 303 TPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTAT 362

Query: 467 EVSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGY 526
           EV +E+IE++I+PLGLQ+KR+  IQRLS  YL+ESW+HVTQL GVGKY ADA+AIFC G 
Sbjct: 363 EVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGN 422

BLAST of Cp4.1LG04g00110 vs. Swiss-Prot
Match: MBD4_MOUSE (Methyl-CpG-binding domain protein 4 OS=Mus musculus GN=Mbd4 PE=1 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 1.6e-22
Identity = 68/228 (29.82%), Postives = 116/228 (50.88%), Query Frame = 1

Query: 303 EVSPSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPA--KE 362
           E +      +T  + E I S+  +K E        Q+  ++PS C+++ K        ++
Sbjct: 318 EAAGEANREQTFLESEEIRSKGDRKGEAHLHTGVLQDGSEMPS-CSQAKKHFTSETFQED 377

Query: 363 RKVRDKVSARPRTTLSADELFLEAYR--RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVI 422
              R +V  R  +   + +   EA    R+ S   W PP S   L+Q+   +DPW++L+ 
Sbjct: 378 SIPRTQVEKRKTSLYFSSKYNKEALSPPRRKSFKKWTPPRSPFNLVQEILFHDPWKLLIA 437

Query: 423 CMLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEM 482
            + LNRT+G+ A  V+ +     P  + A       + ++++PLGL   R+ TI + S+ 
Sbjct: 438 TIFLNRTSGKMAIPVLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDE 497

Query: 483 YLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFL 527
           YL + W +  +L G+GKYG D++ IFC   W +V P+DH LN Y ++L
Sbjct: 498 YLTKQWRYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWL 544

BLAST of Cp4.1LG04g00110 vs. Swiss-Prot
Match: MBD4_HUMAN (Methyl-CpG-binding domain protein 4 OS=Homo sapiens GN=MBD4 PE=1 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 2.1e-22
Identity = 66/233 (28.33%), Postives = 117/233 (50.21%), Query Frame = 1

Query: 310 NSKTKQQGERIVSRFFQKSE---EQEVVNNQQ----EVIQLPSQCAKSVKRIRKPAKERK 369
           ++K  +  E+    F +  E   + EVV  ++    ++++  S+   +    RK     K
Sbjct: 338 SAKDSEHNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSEMDNNCSPTRKDFTGEK 397

Query: 370 V-------RDKVSARPRTTLSADELFLEAYR--RKSSDDTWKPPPSGIRLLQQDHAYDPW 429
           +       R ++  R  +   + +   EA    R+ +   W PP S   L+Q+   +DPW
Sbjct: 398 IFQEDTIPRTQIERRKTSLYFSSKYNKEALSPPRRKAFKKWTPPRSPFNLVQETLFHDPW 457

Query: 430 RVLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQ 489
           ++L+  + LNRT+G+ A  V+ K     P  + A       + ++++PLGL   R+ TI 
Sbjct: 458 KLLIATIFLNRTSGKMAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAKTIV 517

Query: 490 RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFL 527
           + S+ YL + W +  +L G+GKYG D++ IFC   W +V P+DH LN Y ++L
Sbjct: 518 KFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWL 570

BLAST of Cp4.1LG04g00110 vs. TrEMBL
Match: A0A0A0KRW9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G630730 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 4.8e-130
Identity = 283/506 (55.93%), Postives = 346/506 (68.38%), Query Frame = 1

Query: 1   MSATTIMNPNLSPPSSSSFPDFSVQCSSRFRFPPSKCPSDSNPQNPTPEDFTQKRTTLMA 60
           M++TT ++PNL+PPSSSS+P                        +    +F  + T+   
Sbjct: 1   MASTTSIHPNLTPPSSSSYP------------------------HDLFSEFVFRGTSRSR 60

Query: 61  QNSPISTLEVLQTSESNHQKTAVGHEIPILCIEDLQDNPKRGTSTLTVEDVQQVSPKTPT 120
              P S        + N  + +  H  P+  + DLQ        T    +    S  +P+
Sbjct: 61  FRFPPSKSA---QQDPNPYQDSTQHS-PLSTLHDLQ--------TPEPSNHHNESLASPS 120

Query: 121 SERERVLAHEPPILTLEDLQNAKSDHQPAIKPPLARRVLRFYRQFGFDEQIVQKTPPSVR 180
           SE      HEPPILTLEDLQN K   Q   +P LARRVL FYR+FGFD++++Q T  SV 
Sbjct: 121 SE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVL 180

Query: 181 NSMPVQRDERVVSRHFQESKSTQQGERIVSRYFQHSEIEQAAHNEDEDEDVNVTDQPIKR 240
           NS+P Q   RVVSR+FQ S+STQQ +RIVSRYFQ S  E+ AH EDE++  N+T+QP KR
Sbjct: 181 NSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKR 240

Query: 241 SRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSEKNPEV 300
           S      KRRRKDV   SDNSK    S+ K++RSV+KSG D +VRIVS YFQ+ EK+ E+
Sbjct: 241 SS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEM 300

Query: 301 EIEVSPSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAKE 360
           + EVSPSLQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  E
Sbjct: 301 DREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRLRKPVNE 360

Query: 361 RKVRDKVSA-RPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVIC 420
           RK +DK S+ +PRTTL+A ELFLEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVIC
Sbjct: 361 RKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVIC 420

Query: 421 MLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMY 480
           MLLNRT+GQQAK+VIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMY
Sbjct: 421 MLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY 460

Query: 481 LKESWSHVTQLPGVGKYGADAHAIFC 506
           LKESWSHVTQLPGVGKY A    + C
Sbjct: 481 LKESWSHVTQLPGVGKYLAYPCTLSC 460

BLAST of Cp4.1LG04g00110 vs. TrEMBL
Match: B9RKX6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1564050 PE=4 SV=1)

HSP 1 Score: 227.3 bits (578), Expect = 4.5e-56
Identity = 134/260 (51.54%), Postives = 163/260 (62.69%), Query Frame = 1

Query: 288 SRYFQNSEKNPEVEIEV---SPSLQNSKTKQQGERI---------------VSRFFQKSE 347
           +R  +N + N  V I+V   SP+   S  +Q+  +I               VS +FQK  
Sbjct: 355 TRNIENEKPNSRVHIQVRKVSPNFNLSIGQQECMKIKPLKPCERVGLTVRNVSPYFQK-- 414

Query: 348 EQEVVNNQQEVIQLPSQCAKSVKRIRKPAKERKVRDKVSARPRTTLSADELFLEAYRRKS 407
               V  Q+E     S    + K  +K   E+K R    AR   TLSA E   EAYRRK+
Sbjct: 415 ----VPKQEEEEAADSNMIDN-KHGQKKLPEKKKRP---ARKSITLSAAEKRSEAYRRKT 474

Query: 408 SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSALE 467
            D+TWKPP S   LLQ+DHA DPWRVLVICMLLN TTG+Q + VI   FTLCPD K+A E
Sbjct: 475 PDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATE 534

Query: 468 VSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW 527
              E+IE II PLGLQ+KR++ IQRLS+ YL + W+HVTQL GVGKY ADA+AIFCTG W
Sbjct: 535 AKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKW 594

Query: 528 TEVLPKDHMLNYYWEFLHSI 530
            +V PKDHMLNYYW+FLH I
Sbjct: 595 DQVRPKDHMLNYYWDFLHKI 604

BLAST of Cp4.1LG04g00110 vs. TrEMBL
Match: A0A151SQL7_CAJCA (Methyl-CpG-binding domain protein 4 OS=Cajanus cajan GN=KK1_003378 PE=4 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 2.1e-53
Identity = 126/286 (44.06%), Postives = 170/286 (59.44%), Query Frame = 1

Query: 246 YRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSEKNPEVEIEVS 305
           ++KR   +    + N     +   K ++ + +      +R VS YF N       +I V 
Sbjct: 91  FKKRAISNKLRENGNEATTSKIKSKKTKPIVQKNVAHGIRYVSPYFHNDNGK---KINVK 150

Query: 306 PSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAKERKVRD 365
           P +++SK+    E I    F+ S E ++  N        S C++    I++         
Sbjct: 151 PLVKHSKS----ESIDLHAFENSAEDQLEVNT-------SSCSEESIEIKRK-------- 210

Query: 366 KVSARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT 425
                    LSA E++ EAY+R++ D+TWKPP S   L+Q+DH +DPWRVLVICMLLNRT
Sbjct: 211 ---------LSALEIWDEAYKRRTPDNTWKPPRSATGLIQEDHIHDPWRVLVICMLLNRT 270

Query: 426 TGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWS 485
           TG+QAK ++  LF LCPD KS  +V++E+IE  I+ LGLQ KR+  +QR SE YL ESW+
Sbjct: 271 TGRQAKKIVSDLFKLCPDAKSCTQVAREEIEKTIQSLGLQHKRAAMLQRFSEEYLDESWT 330

Query: 486 HVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKH 532
           HVTQL GVGKY ADA+AIF TG W  V P DHMLNYYWEFLH IK+
Sbjct: 331 HVTQLHGVGKYAADAYAIFITGMWDRVKPTDHMLNYYWEFLHRIKY 345

BLAST of Cp4.1LG04g00110 vs. TrEMBL
Match: A0A067LF05_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10788 PE=4 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.0e-51
Identity = 133/292 (45.55%), Postives = 173/292 (59.25%), Query Frame = 1

Query: 245 EYRKRRRK-DVASSSDNSKAYQRSIRKSSRSVKKS---GKDKRVRIVSRYFQNSEKNPEV 304
           E RK  RK    ++  N   Y + +     +   S   GK KR +       +S+KN E 
Sbjct: 317 EQRKTSRKRKTGATIQNVSPYFKKVSNEQEAEASSLIDGKRKRKK-------SSKKNKEE 376

Query: 305 EIEVS-PSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAK 364
             E++ P+++N          VS +F K E  +  N Q+       Q +K  KR      
Sbjct: 377 PCEIAGPTVRN----------VSPYFHKEEAADSNNGQK-------QSSKGRKR------ 436

Query: 365 ERKVRDKVSARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVIC 424
                   SAR    L+A E   EAY RK+ D+TWKPP S   LLQ++HA+DPWRVLVIC
Sbjct: 437 --------SARTSIVLTASEKRSEAYLRKTPDNTWKPPQSEHGLLQENHAHDPWRVLVIC 496

Query: 425 MLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMY 484
           MLLN TTG Q + VI  LFTLCP  ++A+ V +E+IE II PLGLQ+KR++ IQR+S+ Y
Sbjct: 497 MLLNCTTGTQVRRVIEDLFTLCPSAEAAINVMKEEIERIIEPLGLQKKRAVMIQRMSQEY 556

Query: 485 LKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKH 532
           L++ W+HVTQL GVGKY ADA+AIFCTG W +V P DHMLNYYWEFL  I +
Sbjct: 557 LEDHWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPADHMLNYYWEFLGRINN 570

BLAST of Cp4.1LG04g00110 vs. TrEMBL
Match: V7D2N5_PHAVU (Uncharacterized protein (Fragment) OS=Phaseolus vulgaris GN=PHAVU_L0004001g PE=4 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 5.8e-51
Identity = 101/174 (58.05%), Postives = 126/174 (72.41%), Query Frame = 1

Query: 356 KPAKERKVRDKVSARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRV 415
           KP + +    + S   +  LSA + + EAY+RK+ D TWKPP S   L+Q+DHA+DPWRV
Sbjct: 539 KPEENKSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRV 598

Query: 416 LVICMLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRL 475
           LVICMLLNRT+G+Q K+++   F LCPD KS  EVS+E+IE+ I+ LG Q KR+  ++RL
Sbjct: 599 LVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRL 658

Query: 476 SEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSI 530
           SE YL ESW+HVTQL GVGKY ADA+AIF TG    V P DHMLNYYWEFL  I
Sbjct: 659 SEEYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRI 712

BLAST of Cp4.1LG04g00110 vs. TAIR10
Match: AT3G07930.3 (AT3G07930.3 DNA glycosylase superfamily protein)

HSP 1 Score: 201.8 bits (512), Expect = 1.0e-51
Identity = 131/318 (41.19%), Postives = 185/318 (58.18%), Query Frame = 1

Query: 227 EDEDVNVTDQPIKRSRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSV-KKSGKDK--- 286
           +D+D +V+D  I+R    E+    R+       ++ + Q      S SV  K G  K   
Sbjct: 123 DDDDDSVSDSHIERQECSEFHVEVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQA 182

Query: 287 RVRIVSRYFQNSEKNPEVEIEVSPSLQNSKTKQQGE-------RIVSRFFQKSEEQEVVN 346
           +V  VS YFQ S  + + + ++  S Q+ +  ++G        R VS +FQ+S   E  N
Sbjct: 183 KVPRVSPYFQASTIS-QCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPN 242

Query: 347 -------NQQEVIQLPSQCAKSVKRIRKPAKERKVRDKVSARPRTTLSADELFLEAYRRK 406
                  N  +V+++         ++ +  KE+    + +      LS  +   + Y RK
Sbjct: 243 QAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRK 302

Query: 407 SSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSAL 466
           + D+TW PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q + VI  LF LC D K+A 
Sbjct: 303 TPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTAT 362

Query: 467 EVSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGY 526
           EV +E+IE++I+PLGLQ+KR+  IQRLS  YL+ESW+HVTQL GVGKY ADA+AIFC G 
Sbjct: 363 EVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGN 422

BLAST of Cp4.1LG04g00110 vs. NCBI nr
Match: gi|659121238|ref|XP_008460559.1| (PREDICTED: uncharacterized protein LOC103499353 [Cucumis melo])

HSP 1 Score: 554.3 bits (1427), Expect = 2.3e-154
Identity = 326/539 (60.48%), Postives = 384/539 (71.24%), Query Frame = 1

Query: 1   MSATTIMNPNLSPPSSSSFPDFSVQCSSRFRFPPSKCPSDSNPQNPTPEDFTQKRTTLMA 60
           M+ATT +NPNL+PPSSSS+P                        +    +F  + T+   
Sbjct: 1   MAATTSINPNLTPPSSSSYP------------------------HDLFSEFVFRGTSRSR 60

Query: 61  QNSPISTLEVLQTSESNHQ-----KTAVGHEIPILCIEDLQDNPKRGTSTLTVEDVQQVS 120
              P         S+S HQ     + +  H  PI  + DLQ        T    +    S
Sbjct: 61  FRFP--------PSKSAHQNPNPYQDSTQHS-PISTLYDLQ--------TSEPNNHHNKS 120

Query: 121 PKTPTSERERVLAHEPPILTLEDLQNAKSDHQPAIKPPLARRVLRFYRQFGFDEQIVQKT 180
             +P+SE     A EPPILTLEDLQN K   Q   KP LARRVL FYR+FGFD++++Q T
Sbjct: 121 LASPSSE-----ADEPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQAT 180

Query: 181 PPSVRNSMPVQRDERVVSRHFQESKSTQQGERIVSRYFQHSEIEQAAHNEDEDEDVNVTD 240
             SV NS PVQ   RVVSR+FQ S+STQQ ERIVSRYF+ S  E+AAH EDE++D N+T+
Sbjct: 181 SHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTE 240

Query: 241 QPIKRSRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSE 300
           QP KRS      KRRRKDV  SS NSK    S+ K+SRSV+KS  D R RIVS YFQ SE
Sbjct: 241 QPSKRSS-----KRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSE 300

Query: 301 KNPEVEIEVSPSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIR 360
           K+ E++ EVSPSLQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+R
Sbjct: 301 KSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVR 360

Query: 361 KPAKERKVRDKVSA-RPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWR 420
           KP  ERK ++K S+ +PRTTL+A ELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWR
Sbjct: 361 KPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWR 420

Query: 421 VLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQR 480
           VLVICMLLNRT+G+QAK+VIPKLF+LCP+PK+ LEVS+EQIEDIIRPLGL RKRS T+ R
Sbjct: 421 VLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHR 480

Query: 481 LSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKHLL 534
           LSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Sbjct: 481 LSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL 488

BLAST of Cp4.1LG04g00110 vs. NCBI nr
Match: gi|449449218|ref|XP_004142362.1| (PREDICTED: methyl-CpG-binding domain protein 4 [Cucumis sativus])

HSP 1 Score: 549.3 bits (1414), Expect = 7.5e-153
Identity = 315/534 (58.99%), Postives = 379/534 (70.97%), Query Frame = 1

Query: 1   MSATTIMNPNLSPPSSSSFPDFSVQCSSRFRFPPSKCPSDSNPQNPTPEDFTQKRTTLMA 60
           M++TT ++PNL+PPSSSS+P                        +    +F  + T+   
Sbjct: 1   MASTTSIHPNLTPPSSSSYP------------------------HDLFSEFVFRGTSRSR 60

Query: 61  QNSPISTLEVLQTSESNHQKTAVGHEIPILCIEDLQDNPKRGTSTLTVEDVQQVSPKTPT 120
              P S        + N  + +  H  P+  + DLQ        T    +    S  +P+
Sbjct: 61  FRFPPSKSA---QQDPNPYQDSTQHS-PLSTLHDLQ--------TPEPSNHHNESLASPS 120

Query: 121 SERERVLAHEPPILTLEDLQNAKSDHQPAIKPPLARRVLRFYRQFGFDEQIVQKTPPSVR 180
           SE      HEPPILTLEDLQN K   Q   +P LARRVL FYR+FGFD++++Q T  SV 
Sbjct: 121 SE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVL 180

Query: 181 NSMPVQRDERVVSRHFQESKSTQQGERIVSRYFQHSEIEQAAHNEDEDEDVNVTDQPIKR 240
           NS+P Q   RVVSR+FQ S+STQQ +RIVSRYFQ S  E+ AH EDE++  N+T+QP KR
Sbjct: 181 NSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKR 240

Query: 241 SRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSEKNPEV 300
           S      KRRRKDV   SDNSK    S+ K++RSV+KSG D +VRIVS YFQ+ EK+ E+
Sbjct: 241 SS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEM 300

Query: 301 EIEVSPSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAKE 360
           + EVSPSLQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  E
Sbjct: 301 DREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRLRKPVNE 360

Query: 361 RKVRDKVSA-RPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVIC 420
           RK +DK S+ +PRTTL+A ELFLEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVIC
Sbjct: 361 RKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVIC 420

Query: 421 MLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMY 480
           MLLNRT+GQQAK+VIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMY
Sbjct: 421 MLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY 480

Query: 481 LKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKHLL 534
           LKESWSHVTQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Sbjct: 481 LKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL 488

BLAST of Cp4.1LG04g00110 vs. NCBI nr
Match: gi|700197193|gb|KGN52370.1| (hypothetical protein Csa_5G630730 [Cucumis sativus])

HSP 1 Score: 473.0 bits (1216), Expect = 6.8e-130
Identity = 283/506 (55.93%), Postives = 346/506 (68.38%), Query Frame = 1

Query: 1   MSATTIMNPNLSPPSSSSFPDFSVQCSSRFRFPPSKCPSDSNPQNPTPEDFTQKRTTLMA 60
           M++TT ++PNL+PPSSSS+P                        +    +F  + T+   
Sbjct: 1   MASTTSIHPNLTPPSSSSYP------------------------HDLFSEFVFRGTSRSR 60

Query: 61  QNSPISTLEVLQTSESNHQKTAVGHEIPILCIEDLQDNPKRGTSTLTVEDVQQVSPKTPT 120
              P S        + N  + +  H  P+  + DLQ        T    +    S  +P+
Sbjct: 61  FRFPPSKSA---QQDPNPYQDSTQHS-PLSTLHDLQ--------TPEPSNHHNESLASPS 120

Query: 121 SERERVLAHEPPILTLEDLQNAKSDHQPAIKPPLARRVLRFYRQFGFDEQIVQKTPPSVR 180
           SE      HEPPILTLEDLQN K   Q   +P LARRVL FYR+FGFD++++Q T  SV 
Sbjct: 121 SE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVL 180

Query: 181 NSMPVQRDERVVSRHFQESKSTQQGERIVSRYFQHSEIEQAAHNEDEDEDVNVTDQPIKR 240
           NS+P Q   RVVSR+FQ S+STQQ +RIVSRYFQ S  E+ AH EDE++  N+T+QP KR
Sbjct: 181 NSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKR 240

Query: 241 SRVGEYRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSEKNPEV 300
           S      KRRRKDV   SDNSK    S+ K++RSV+KSG D +VRIVS YFQ+ EK+ E+
Sbjct: 241 SS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEM 300

Query: 301 EIEVSPSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAKE 360
           + EVSPSLQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  E
Sbjct: 301 DREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRLRKPVNE 360

Query: 361 RKVRDKVSA-RPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVIC 420
           RK +DK S+ +PRTTL+A ELFLEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVIC
Sbjct: 361 RKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVIC 420

Query: 421 MLLNRTTGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMY 480
           MLLNRT+GQQAK+VIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMY
Sbjct: 421 MLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY 460

Query: 481 LKESWSHVTQLPGVGKYGADAHAIFC 506
           LKESWSHVTQLPGVGKY A    + C
Sbjct: 481 LKESWSHVTQLPGVGKYLAYPCTLSC 460

BLAST of Cp4.1LG04g00110 vs. NCBI nr
Match: gi|255546672|ref|XP_002514395.1| (PREDICTED: uncharacterized protein LOC8285365 [Ricinus communis])

HSP 1 Score: 227.3 bits (578), Expect = 6.5e-56
Identity = 134/260 (51.54%), Postives = 163/260 (62.69%), Query Frame = 1

Query: 288 SRYFQNSEKNPEVEIEV---SPSLQNSKTKQQGERI---------------VSRFFQKSE 347
           +R  +N + N  V I+V   SP+   S  +Q+  +I               VS +FQK  
Sbjct: 355 TRNIENEKPNSRVHIQVRKVSPNFNLSIGQQECMKIKPLKPCERVGLTVRNVSPYFQK-- 414

Query: 348 EQEVVNNQQEVIQLPSQCAKSVKRIRKPAKERKVRDKVSARPRTTLSADELFLEAYRRKS 407
               V  Q+E     S    + K  +K   E+K R    AR   TLSA E   EAYRRK+
Sbjct: 415 ----VPKQEEEEAADSNMIDN-KHGQKKLPEKKKRP---ARKSITLSAAEKRSEAYRRKT 474

Query: 408 SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKDVIPKLFTLCPDPKSALE 467
            D+TWKPP S   LLQ+DHA DPWRVLVICMLLN TTG+Q + VI   FTLCPD K+A E
Sbjct: 475 PDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATE 534

Query: 468 VSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW 527
              E+IE II PLGLQ+KR++ IQRLS+ YL + W+HVTQL GVGKY ADA+AIFCTG W
Sbjct: 535 AKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKW 594

Query: 528 TEVLPKDHMLNYYWEFLHSI 530
            +V PKDHMLNYYW+FLH I
Sbjct: 595 DQVRPKDHMLNYYWDFLHKI 604

BLAST of Cp4.1LG04g00110 vs. NCBI nr
Match: gi|1012345928|gb|KYP57120.1| (Methyl-CpG-binding domain protein 4 [Cajanus cajan])

HSP 1 Score: 218.4 bits (555), Expect = 3.0e-53
Identity = 126/286 (44.06%), Postives = 170/286 (59.44%), Query Frame = 1

Query: 246 YRKRRRKDVASSSDNSKAYQRSIRKSSRSVKKSGKDKRVRIVSRYFQNSEKNPEVEIEVS 305
           ++KR   +    + N     +   K ++ + +      +R VS YF N       +I V 
Sbjct: 91  FKKRAISNKLRENGNEATTSKIKSKKTKPIVQKNVAHGIRYVSPYFHNDNGK---KINVK 150

Query: 306 PSLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKSVKRIRKPAKERKVRD 365
           P +++SK+    E I    F+ S E ++  N        S C++    I++         
Sbjct: 151 PLVKHSKS----ESIDLHAFENSAEDQLEVNT-------SSCSEESIEIKRK-------- 210

Query: 366 KVSARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT 425
                    LSA E++ EAY+R++ D+TWKPP S   L+Q+DH +DPWRVLVICMLLNRT
Sbjct: 211 ---------LSALEIWDEAYKRRTPDNTWKPPRSATGLIQEDHIHDPWRVLVICMLLNRT 270

Query: 426 TGQQAKDVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLSEMYLKESWS 485
           TG+QAK ++  LF LCPD KS  +V++E+IE  I+ LGLQ KR+  +QR SE YL ESW+
Sbjct: 271 TGRQAKKIVSDLFKLCPDAKSCTQVAREEIEKTIQSLGLQHKRAAMLQRFSEEYLDESWT 330

Query: 486 HVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKH 532
           HVTQL GVGKY ADA+AIF TG W  V P DHMLNYYWEFLH IK+
Sbjct: 331 HVTQLHGVGKYAADAYAIFITGMWDRVKPTDHMLNYYWEFLHRIKY 345

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MBD4L_ARATH1.8e-5041.19Methyl-CpG-binding domain protein 4-like protein OS=Arabidopsis thaliana GN=MBD4... [more]
MBD4_MOUSE1.6e-2229.82Methyl-CpG-binding domain protein 4 OS=Mus musculus GN=Mbd4 PE=1 SV=1[more]
MBD4_HUMAN2.1e-2228.33Methyl-CpG-binding domain protein 4 OS=Homo sapiens GN=MBD4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KRW9_CUCSA4.8e-13055.93Uncharacterized protein OS=Cucumis sativus GN=Csa_5G630730 PE=4 SV=1[more]
B9RKX6_RICCO4.5e-5651.54Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1564050 PE=4 SV=1[more]
A0A151SQL7_CAJCA2.1e-5344.06Methyl-CpG-binding domain protein 4 OS=Cajanus cajan GN=KK1_003378 PE=4 SV=1[more]
A0A067LF05_JATCU2.0e-5145.55Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10788 PE=4 SV=1[more]
V7D2N5_PHAVU5.8e-5158.05Uncharacterized protein (Fragment) OS=Phaseolus vulgaris GN=PHAVU_L0004001g PE=4... [more]
Match NameE-valueIdentityDescription
AT3G07930.31.0e-5141.19 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659121238|ref|XP_008460559.1|2.3e-15460.48PREDICTED: uncharacterized protein LOC103499353 [Cucumis melo][more]
gi|449449218|ref|XP_004142362.1|7.5e-15358.99PREDICTED: methyl-CpG-binding domain protein 4 [Cucumis sativus][more]
gi|700197193|gb|KGN52370.1|6.8e-13055.93hypothetical protein Csa_5G630730 [Cucumis sativus][more]
gi|255546672|ref|XP_002514395.1|6.5e-5651.54PREDICTED: uncharacterized protein LOC8285365 [Ricinus communis][more]
gi|1012345928|gb|KYP57120.1|3.0e-5344.06Methyl-CpG-binding domain protein 4 [Cajanus cajan][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006281DNA repair
GO:0006284base-excision repair
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
Vocabulary: INTERPRO
TermDefinition
IPR011257DNA_glycosylase
IPR003265HhH-GPD_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g00110.1Cp4.1LG04g00110.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 419..494
score: 6.
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 387..526
score: 1.2
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 399..527
score: 6.28
NoneNo IPR availablePANTHERPTHR150745-METHYLCYTOSINE G/T MISMATCH-SPECIFIC DNA GLYCOSYLASEcoord: 33..55
score: 1.8E-74coord: 368..526
score: 1.8

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG04g00110CmaCh11G011780Cucurbita maxima (Rimu)cmacpeB147
Cp4.1LG04g00110CmoCh11G012620Cucurbita moschata (Rifu)cmocpeB130
Cp4.1LG04g00110Carg17008Silver-seed gourdcarcpeB0657
The following gene(s) are paralogous to this gene:

None