ClCG08G005050 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG08G005050
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionmethyl-CpG-binding domain protein 4-like protein
LocationCG_Chr08: 16026709 .. 16032126 (-)
RNA-Seq ExpressionClCG08G005050
SyntenyClCG08G005050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATTTATAACCCAGAGGAGAAAGGCGCGCGAAGGTAATGAAGCGATCATTTCGTGGTGTTCTGTGGCAAATCCCGCCATGACTGCAACAGCAAGCATCAATCCTAACCTCACCCCTCCGTCCTCTTCTTCCTTTCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCGCGCTCCAGATTTTGCTTTCCTCCTTCAGAATCCACTCAACAAAACCCTACGTCCCAGGATTTTACCCAAAACACTACGATTCTCATGACCCAACACTCTCCAATTTCCACTCTTGAGGATTTCCAAATTTCAGAATCCAAGAATCATCAGAACAAACCCTTAGCCCGCAAGATTTCCATTTGCCCTTCTGATGATCTTCAAAACTGTCCAAACTGTGAGATTCCGGTAACATCCCTCTCTTCTGAAGCGCACGAGCCTCCTATTTTAACACTAGACGATCTTCAAAATGCCAAACCAGACCATTACCCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTTACGTTTTTACCGAGAATTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGACCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGATATTTCCAAAACTCAAAATCAACCCAACAAGGTGAACGAATTGTCGCACGATACTTTCAAAACTCGGAGAAGGAACGAGCAGCCCGTAATGAGGATGATGATGCCGATTTCACAGAGCAGACAAGTAAAAGATCAATGGTGGGAGGCTACAGCAAAAGGAGGAGGAAATACGTGGCCCCCAGCTCCGATAAGTCAAAAACAAATCAACATTCAATGGGAAAAGCTTCACGCTCTGTTCAGAAGTCAGGAACAGATAGACGAGTTCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTGCTTACGAAGTTCAAAATCAAATCAACAAACGGAGCAAATGGTCTCACGTTTCTTTCAAAAATCAGCAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTAAATCAGCGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTTAATGAAAGGAAAGATAGGGATAAAACAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTTGTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCCTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGGACAACTGGGCAGCAGGTATTTATCATCTGCTATACATTTACTTGATCTTGTATGCAATTTCCAATTCTGCACGGTATCATATTTTCCCCATAGCACATGAATTCGATTTGCTTTCATTACAGTTTTGCTACCTCACGGCTATGCAGTAGGATTTTCAAAAGGGGCAAGCACAACATACTTATGGCGTAAAACCTACTCTCTCCACCACTCATTTCCCTTTCCAAATCACAACATTACTGTCAGCTACCTAATTAACTCGCATTTCCTGGGTCCAACGATAATACACACGCACAACGCCAAGTAGTAGACCCACCCCTCACTTTTACTTGAATACAATTTGAGAACTGGGTGTCTATCAGTTTATAAAGCTTTGTTTTACCTCAGAATTTGGAGGGACAGAAACCAAGGATTAATTGAAGTGGATTATAGGCTGAAACTAAATTAGGTCAAACAGCAACTATGTGTGCTACTTCTAGTCAATCTTTGTTATTATTCTTCAAGTCTGATTAACTGAACTCTTGTTCTCTTGCTTTCTTGTAACTCTCAAAGGGGGTTCTTTATATTAGCGTTGACTGTTCAAATATCAATGTAAACTTTGTCTTGTGTTTAAAAGGAAATCTGAAAAATTTTGTTCCTACTTCTATGAGAGTAGAAGGTAGAAGTTTCTGATTTATATATATATATATGTGTGTAAAAATCTGATGTATCATTGTTGATGGACATATTTGTTCCCTAAAGAGTGAAGTTCCTATACCTGGTCTTTGGTCCATGGTCGTACAAGCATGTTGGAACTATCATCTTCTTGATCCTACATGTATTTATGAATGGTGGGGGAGCTACGACCTTACTTGAACAAGATCTATTCTATAGATTGTACAAATGTGTCCGAGTTCTAAGGAATCGTGGGGGAACGAAACGCAAACGAACAATGTATTATCACAACCAGAATTTGGAGCAGTTTGATTACTACATTAACCTTTTGTCATTGCTTGTTTCTTTGCCAACTTGTTAACTCATAATTGATAATTTCATCTCCTTGTTTCCTGATTATGCACCCTTATTCCTATAGGTTTTTTTTTTTTTTTTTTTTTTTTAACTTTCAATTTGACTCTATGTGTGACCTGTCATTTTGAAAATTGCAGGCAAAAGAAGTAATACCAAAACTCTTCAGTTTGTGTCCCAATGCAGAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTACAAAGAAAAAGATCACGAACAATGCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCTTCCTGGTGTTGGCAAGTAATTTAGCCTATCCTTATGCACTTTTTTTTTTTATCTTTATCTATTTCTTTGAAAGGAACCATTACTTTTCATTGATAAAATGAAAAGAGACTATCGCTCAAAATATAAGTATACAAAATAACTGGAAAACTTAAAAAGTACAACGCCAAGAGAACCAAAACATAAGTTCATCAAGGAGGCTGCTTTGGCAAGTTATGGAAGAAAGTTATGTTTTTGCACAACTTCCAGCTTTTATTTTGAAAAAATATATACAAAAAAAAGGTCTTCGAGACATCGCTCAACTCAGGTTCTCAATGCCTATGTTATTTCACTTCCTAATTCAAAACATAAGTTTATCAAGGAGGTGCTCAGCCACTTGAATTCAGAACACCTTTTGATTGAAGACAAATTGGACGATGAAGCTTATGTAAGTGTAAGCAGTGAAAAACCTGATATCTTGGCTTTGAACTCTGTTCCAGAAGTGGTGGATGAATACATTGAAGAAGATATTGCAAGTTTATATCAAAACTCTTCTGGAGAAAAGTCTAATCAGTCTCAATCATTTAAGTTCTCTGATTGAATCATCTTTAATTCCTTCTGTAATCTCATCTTTAATTGAGAATCGTGGTTTGAAGTTTGGAAAAGCCCCTCATCATATTTAGCAGCAGCCTTGACTGTTTTGCTTTGAAAAGTGATGGAAATTATTACTTGGAATGAGATCAAAACAGATTGTCCTGAAAAGTTCTCTCCAAAATCAGCATCCAGATGTTGTCTTAATCCAAGAAACAAAGAAAGAGGAAGTGGTTGGTATTCTTATTAAATTTATCTGGAGTTCAAAGGAAGATGGCTGGTCTTTTTTATGGAAGGAATAAAGGAAGAAAGGTGATTCATCTAGTGAAATGGGTTTTGGTCTCCATTTCTCAGAAGGAAGGTCATCTTGGTTTGGGAGGCATAAAAATGAAACTTCAGCATGGGCTTTTAGTTTTGAAGAATGATAAAAGATGCTGGAATTTCTGAGTTTCAACAGTTATTACATGCTTTATGGGGCAAATTAGTGGAGTCTCATCTTGATTTTTGTATCTGGAGTTTGGTTTCATCTGAAAGATTCAAAGTTATATCAGTCCTTGACAGTACAAAAGTCATAGATAAAGAAGTTTATAAAGCTGTTTTTTGTAAGCATGTTGGAAGAAATTGTTATCAATCTTTCATATTCACTGGGACTTTGGAGACTCTTTCAAAGAATTTGTTTTTAAGTTCTAGTTGGTCCTCAAACTTAGCCTCATTCCCAATTGCTATGGTGTAATGTTGTCAAAGCCTTGTTGAATTTTGGTTGAAGGGGAATCAAAGGGAACTCCATGTTAAATCCCTACATTGACTTGATCGTTTTGAAGCTGCAAGGATCATTGTTTCTTCTTGGTGTTCTCTTTCTAAGTTTTGTGTAAATTTTTCTATTCATTTTCAAACTTATTTCTATCCTAATAGCATTCCTATCAGATTGATCTCAAGAAAGGGAGTTCATGTTCTTCAAATTACTTGGTTTATCTTCTATATTTCAATATAATTTGATTCGGTTTTGATTTCTATCAAAGAATTGAAGAAGCTTCAAAGATCACTTGAATTGGAACCCGTACTCAGATGGAACACCTCACATGAATGACAATGAGGATGCTTGTAGTAGTAGACAGTACTTATAATTTTGTGGCAAAGATCATCTTGTTTTGGTTCATGTGAAGGTACCCTTTCAGCTTAAGTTAGTAGCTTGAGGTAGAAAGGAAGCTCTTGAAAGTAACTCTTGAATCTTCCCCTAAATTTGCACTTGTTTCCTCAGACTGTTAGTTTAACTCTATGAAGCCTAAGCCCTGTTCATAGGATAGTGAGGGCATGTTTGGGAGTAATTCTGAAATGGTCAAAATCACTTCAAAATATGCTTTTTAATCATTCAAAATCAATTTTGATGACATGAAAAATACATTTAAAAGTGAAAAGTTTAAGTATTAAATTGATTTTTGCGTGAGTAAAGACATGTTCGGAGTGATTTTGGATATGACAAAAGTGAGTTGAATTTGCTTGAGGGAGGCATCTAGTTACTTGAATTACTTCAGATTCTATATTTCTCCTTGTATTGCTCATTATGCCTTATATATTATGATATTGGTATTGGTATTTTTCTGAATGATGTTGCTTGACATGAATAAATTCCACATACATGCCCTCACCGTACCAAAATTTATCAGCTTTCCTAGAGCAACTTGTCTTAGGATGTGTGTTAAGCTAAATTTGATTGGAGACAGGTATGGAGCTGATGCACATGCAATATTCTGCACTGGATATTGGAATGAAGTAGTACCTGAAGATCACATGCTTAATTATTACTGGGATTTTCTCCACAGCATCAAACACCTGCTCTGATCTTATGTGACTATAGATAGTTTTGTACGGGATCGGAAGTTGGCTTCCTACTTCCTACATATATTTTTGAACGTTCTGACAGAAAGAAATTCTTTTGTGCTATCGGTTTTGTAAATGTTGTTGGGAGTTTGGGGGTTGATTGGGAACTACTATTATTAAAATATATACTTAGGAAGGGGAGAAATGGAAATTCTCCTCCAACAGCTAGTCAGGATAGACTGTAGCTGAAGCTGTGTAGTGTTTGTGGGGGAGCACGAGCAGCAGGGGATATTATGTTATGTTTTGTCTAGTCATCAAGGATAGCTTACCCTTGACTAAATATTTTGTAACATGACTATATGGCTGTAACCTTGCTTTTTAATCAACTAGTTTGTGC

mRNA sequence

AAAATTTATAACCCAGAGGAGAAAGGCGCGCGAAGGTAATGAAGCGATCATTTCGTGGTGTTCTGTGGCAAATCCCGCCATGACTGCAACAGCAAGCATCAATCCTAACCTCACCCCTCCGTCCTCTTCTTCCTTTCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCGCGCTCCAGATTTTGCTTTCCTCCTTCAGAATCCACTCAACAAAACCCTACGTCCCAGGATTTTACCCAAAACACTACGATTCTCATGACCCAACACTCTCCAATTTCCACTCTTGAGGATTTCCAAATTTCAGAATCCAAGAATCATCAGAACAAACCCTTAGCCCGCAAGATTTCCATTTGCCCTTCTGATGATCTTCAAAACTGTCCAAACTGTGAGATTCCGGTAACATCCCTCTCTTCTGAAGCGCACGAGCCTCCTATTTTAACACTAGACGATCTTCAAAATGCCAAACCAGACCATTACCCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTTACGTTTTTACCGAGAATTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGACCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGATATTTCCAAAACTCAAAATCAACCCAACAAGGTGAACGAATTGTCGCACGATACTTTCAAAACTCGGAGAAGGAACGAGCAGCCCGTAATGAGGATGATGATGCCGATTTCACAGAGCAGACAAGTAAAAGATCAATGGTGGGAGGCTACAGCAAAAGGAGGAGGAAATACGTGGCCCCCAGCTCCGATAAGTCAAAAACAAATCAACATTCAATGGGAAAAGCTTCACGCTCTGTTCAGAAGTCAGGAACAGATAGACGAGTTCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTGCTTACGAAGTTCAAAATCAAATCAACAAACGGAGCAAATGGTCTCACGTTTCTTTCAAAAATCAGCAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTAAATCAGCGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTTAATGAAAGGAAAGATAGGGATAAAACAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTTGTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCCTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGGACAACTGGGCAGCAGGCAAAAGAAGTAATACCAAAACTCTTCAGTTTGTGTCCCAATGCAGAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTACAAAGAAAAAGATCACGAACAATGCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCTTCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTCTGCACTGGATATTGGAATGAAGTAGTACCTGAAGATCACATGCTTAATTATTACTGGGATTTTCTCCACAGCATCAAACACCTGCTCTGATCTTATGTGACTATAGATAGTTTTGTACGGGATCGGAAGTTGGCTTCCTACTTCCTACATATATTTTTGAACGTTCTGACAGAAAGAAATTCTTTTGTGCTATCGGTTTTGTAAATGTTGTTGGGAGTTTGGGGGTTGATTGGGAACTACTATTATTAAAATATATACTTAGGAAGGGGAGAAATGGAAATTCTCCTCCAACAGCTAGTCAGGATAGACTGTAGCTGAAGCTGTGTAGTGTTTGTGGGGGAGCACGAGCAGCAGGGGATATTATGTTATGTTTTGTCTAGTCATCAAGGATAGCTTACCCTTGACTAAATATTTTGTAACATGACTATATGGCTGTAACCTTGCTTTTTAATCAACTAGTTTGTGC

Coding sequence (CDS)

ATGACTGCAACAGCAAGCATCAATCCTAACCTCACCCCTCCGTCCTCTTCTTCCTTTCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCGCGCTCCAGATTTTGCTTTCCTCCTTCAGAATCCACTCAACAAAACCCTACGTCCCAGGATTTTACCCAAAACACTACGATTCTCATGACCCAACACTCTCCAATTTCCACTCTTGAGGATTTCCAAATTTCAGAATCCAAGAATCATCAGAACAAACCCTTAGCCCGCAAGATTTCCATTTGCCCTTCTGATGATCTTCAAAACTGTCCAAACTGTGAGATTCCGGTAACATCCCTCTCTTCTGAAGCGCACGAGCCTCCTATTTTAACACTAGACGATCTTCAAAATGCCAAACCAGACCATTACCCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTTACGTTTTTACCGAGAATTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGACCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGATATTTCCAAAACTCAAAATCAACCCAACAAGGTGAACGAATTGTCGCACGATACTTTCAAAACTCGGAGAAGGAACGAGCAGCCCGTAATGAGGATGATGATGCCGATTTCACAGAGCAGACAAGTAAAAGATCAATGGTGGGAGGCTACAGCAAAAGGAGGAGGAAATACGTGGCCCCCAGCTCCGATAAGTCAAAAACAAATCAACATTCAATGGGAAAAGCTTCACGCTCTGTTCAGAAGTCAGGAACAGATAGACGAGTTCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTGCTTACGAAGTTCAAAATCAAATCAACAAACGGAGCAAATGGTCTCACGTTTCTTTCAAAAATCAGCAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTAAATCAGCGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTTAATGAAAGGAAAGATAGGGATAAAACAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTTGTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCCTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGGACAACTGGGCAGCAGGCAAAAGAAGTAATACCAAAACTCTTCAGTTTGTGTCCCAATGCAGAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTACAAAGAAAAAGATCACGAACAATGCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCTTCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTCTGCACTGGATATTGGAATGAAGTAGTACCTGAAGATCACATGCTTAATTATTACTGGGATTTTCTCCACAGCATCAAACACCTGCTCTGA

Protein sequence

MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL
Homology
BLAST of ClCG08G005050 vs. NCBI nr
Match: XP_038892490.1 (methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida])

HSP 1 Score: 722.2 bits (1863), Expect = 3.1e-204
Identity = 390/520 (75.00%), Postives = 424/520 (81.54%), Query Frame = 0

Query: 3   ATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTSQDFTQNTTILM 62
           ATASIN NLTPPSSSS+PDDLFSQFAFRGSSRSR C  PS+S+QQNPTSQDFTQNTTIL+
Sbjct: 4   ATASINSNLTPPSSSSYPDDLFSQFAFRGSSRSR-C--PSKSSQQNPTSQDFTQNTTILI 63

Query: 63  TQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEPPIL 122
            QHSPI+T ED Q SE KNHQNK L+R+I ICP          EIP++S SS+ +EPPIL
Sbjct: 64  PQHSPIATREDLQASEPKNHQNKSLSREIPICPFQ--------EIPISSPSSDVYEPPIL 123

Query: 123 TLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSR 182
           TL+DLQNAKP   PP+KP LARR+L FYREFGFDQK+ Q TSHS LN EPVQ+GAR+ SR
Sbjct: 124 TLEDLQNAKPALQPPKKPPLARRILNFYREFGFDQKIAQPTSHSVLNSEPVQEGARMASR 183

Query: 183 YFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD--FTEQTSKRSMVGGYSKRRRKYV 242
           YFQNSKSTQQGER V+RYFQ S K+R A NED+D D   TEQ SKRS     SKRRRK V
Sbjct: 184 YFQNSKSTQQGERFVSRYFQKSVKKRVAHNEDEDEDVNLTEQPSKRS-----SKRRRKDV 243

Query: 243 APSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSN 302
            PSSD SKTNQHSMGKASRS+QKSGTD+RVRIVSRYFQNSEKN+EVDR            
Sbjct: 244 DPSSDNSKTNQHSMGKASRSIQKSGTDKRVRIVSRYFQNSEKNIEVDR------------ 303

Query: 303 QQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRT 362
                                   EAT+Q+NQRAKS KRVRKPVNERK RDKTSS+KPRT
Sbjct: 304 ------------------------EATKQINQRAKSGKRVRKPVNERKQRDKTSSSKPRT 363

Query: 363 TLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEV 422
           TL+AAEL LEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAKEV
Sbjct: 364 TLTAAELSLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKEV 423

Query: 423 IPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGV 482
           IPKLF LCPN +A L+VS EQIEDIIRPLGLQRKRSRTMQ LSEMYLKE+WSHVTQLPGV
Sbjct: 424 IPKLFKLCPNPKATLDVSQEQIEDIIRPLGLQRKRSRTMQLLSEMYLKETWSHVTQLPGV 471

Query: 483 GKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL 521
           GKYGADAHAIFCTGYWNEV P+DHMLNYYW+FLHSI+HLL
Sbjct: 484 GKYGADAHAIFCTGYWNEVDPKDHMLNYYWEFLHSIRHLL 471

BLAST of ClCG08G005050 vs. NCBI nr
Match: XP_008460559.1 (PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo])

HSP 1 Score: 707.2 bits (1824), Expect = 1.0e-199
Identity = 391/523 (74.76%), Postives = 420/523 (80.31%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTT 60
           M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S  QNP   QD      
Sbjct: 1   MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQD------ 60

Query: 61  ILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEP 120
              TQHSPISTL D Q SE  NH NK LA                      S SSEA EP
Sbjct: 61  --STQHSPISTLYDLQTSEPNNHHNKSLA----------------------SPSSEADEP 120

Query: 121 PILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARV 180
           PILTL+DLQN K     P+KPSLARRVL FYREFGFD+K++Q TSHS LN EPVQ+G RV
Sbjct: 121 PILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRV 180

Query: 181 VSRYFQNSKSTQQGERIVARYFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRR 240
           VSRYFQNS+STQQ ERIV+RYF+ S KERAA  ED  DD + TEQ SKRS     SKRRR
Sbjct: 181 VSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRR 240

Query: 241 KYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS 300
           K V PSS  SKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSP L++S
Sbjct: 241 KDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNS 300

Query: 301 KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAK 360
           KSNQQ E+MVSRFF KS KQQAVN+Q+EATEQLNQ AKSVKRVRKPVNERK ++KTSS K
Sbjct: 301 KSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTK 360

Query: 361 PRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQA 420
           PRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QA
Sbjct: 361 PRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQA 420

Query: 421 KEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQL 480
           KEVIPKLFSLCPN +A LEVS EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQL
Sbjct: 421 KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQL 480

Query: 481 PGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL 521
           PGVGKYGADAHAIFCTGYW+EV P+DHMLNYYWDFLHSIKHLL
Sbjct: 481 PGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL 488

BLAST of ClCG08G005050 vs. NCBI nr
Match: XP_004142362.1 (methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus] >KAE8648887.1 hypothetical protein Csa_007768 [Cucumis sativus])

HSP 1 Score: 689.9 bits (1779), Expect = 1.7e-194
Identity = 376/523 (71.89%), Postives = 416/523 (79.54%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTT 60
           M +T SI+PNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S QQ+P   QD      
Sbjct: 1   MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQD------ 60

Query: 61  ILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEP 120
              TQHSP+STL D Q  E  NH N+ LA                      S SSE HEP
Sbjct: 61  --STQHSPLSTLHDLQTPEPSNHHNESLA----------------------SPSSEVHEP 120

Query: 121 PILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARV 180
           PILTL+DLQN K     P++PSLARRVL FYREFGFD+K++Q TSHS LN  P Q+G RV
Sbjct: 121 PILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRV 180

Query: 181 VSRYFQNSKSTQQGERIVARYFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRR 240
           VSRYFQNS+STQQ +RIV+RYFQ S KER A  ED  D  + TEQ SKRS     SKRRR
Sbjct: 181 VSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS-----SKRRR 240

Query: 241 KYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS 300
           K V P SD SKTN HS+GK +RSVQKSGTD +VRIVS YFQ+ EK+LE+DREVSP L++S
Sbjct: 241 KDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNS 300

Query: 301 KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAK 360
           KSNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQ AKSVKR+RKPVNERK++DKTSS K
Sbjct: 301 KSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRLRKPVNERKEKDKTSSTK 360

Query: 361 PRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQA 420
           PRTTL+AAELFLEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRT+GQQA
Sbjct: 361 PRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQA 420

Query: 421 KEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQL 480
           KEVIPKLFSLCPN +A LEVS EQIEDIIRPLG  RKRSRTM RLSEMYLKESWSHVTQL
Sbjct: 421 KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQL 480

Query: 481 PGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL 521
           PGVGKYGADAHAIFCTGYW+EV P+DHMLNYYWDFLHSIKHLL
Sbjct: 481 PGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL 488

BLAST of ClCG08G005050 vs. NCBI nr
Match: KAG7022375.1 (Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 688.3 bits (1775), Expect = 5.0e-194
Identity = 383/540 (70.93%), Postives = 432/540 (80.00%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQ 60
           MTAT  +NPN +PP SSSFPD LFSQFAF+G S SRF FP    PSES +QNPT +DFTQ
Sbjct: 1   MTATTIMNPNPSPP-SSSFPDFLFSQFAFQGCSSSRFRFPPSKCPSESNRQNPTPEDFTQ 60

Query: 61  NTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEI--------- 120
             + LM Q+SPISTLE  Q SE+ NHQ      +I I   +DLQ+ P  EI         
Sbjct: 61  KRSSLMAQNSPISTLEVLQTSEA-NHQKTAAWHEIPILCIEDLQDDPKREISTLTIEDVQ 120

Query: 121 ---PVTSLSS----EAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMV 180
              P T  S      AHEPPILTL+DLQNAK DH P  KP LARRVLRFYR+FGFD+++V
Sbjct: 121 EVSPKTPTSERERVSAHEPPILTLEDLQNAKSDHQPAMKPPLARRVLRFYRQFGFDEQIV 180

Query: 181 QTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDADFT 240
           Q T     N  PVQ   RVVSR+FQ SKSTQQGERIV+RYFQ+SE E+A+ NED+D + T
Sbjct: 181 QKTPPPVRNSMPVQLDERVVSRHFQESKSTQQGERIVSRYFQHSEIEQASHNEDEDVNAT 240

Query: 241 EQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNS 300
           +Q  KRS VG Y KRRRK VAPSSD SK  Q S+ K+SRSV+KSGTD+RVRIVSRYFQNS
Sbjct: 241 DQPIKRSGVGEYRKRRRKDVAPSSDNSKAYQRSIRKSSRSVKKSGTDKRVRIVSRYFQNS 300

Query: 301 EKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRV 360
           EKN EV+ EVSP L++SK+NQQ E++VSRFFQKS +Q+ VN+QQE T+Q +Q AKSVKR+
Sbjct: 301 EKNPEVEIEVSPSLQNSKTNQQGERVVSRFFQKSEEQEVVNNQQEVTQQPSQCAKSVKRI 360

Query: 361 RKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW 420
           RKP  ERK RDK  SA+PRTTLSA ELFLEAYRRKS DDTWKPPPSGIRLLQQDHAYDPW
Sbjct: 361 RKPAKERKVRDKV-SARPRTTLSADELFLEAYRRKSPDDTWKPPPSGIRLLQQDHAYDPW 420

Query: 421 RVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQ 480
           RVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS T+Q
Sbjct: 421 RVLVICMLLNRTTGQQAKEVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQ 480

Query: 481 RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL 521
           RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSIKHLL
Sbjct: 481 RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKHLL 537

BLAST of ClCG08G005050 vs. NCBI nr
Match: XP_022931728.1 (methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata])

HSP 1 Score: 679.9 bits (1753), Expect = 1.8e-191
Identity = 379/544 (69.67%), Postives = 431/544 (79.23%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQ 60
           MTAT  +NPNL+PPSSSSFPD LFSQFAF+G S SRF FP    PSES +QNPT +DFTQ
Sbjct: 1   MTATTIMNPNLSPPSSSSFPDFLFSQFAFQGCSSSRFRFPPSKCPSESNRQNPTPEDFTQ 60

Query: 61  NTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPN-----------C 120
             T LM Q+SPISTLE  Q SES NHQ     ++I I   +DLQ+ P             
Sbjct: 61  KRTTLMAQNSPISTLEVLQTSES-NHQKTAAGQEIPILCIEDLQDNPKRGSSTLTVEDVQ 120

Query: 121 EIPVTSLSSE-----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMV 180
           E+   + +SE      HEPPILTL+D+QNAK DH P  +P LARRVLRFYR+FGFD+++V
Sbjct: 121 EVSPKTPTSERERVLVHEPPILTLEDIQNAKSDHQPAIEPPLARRVLRFYRQFGFDEQIV 180

Query: 181 QTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD-- 240
           Q T  S  N  PVQ+  RVVSR+FQ SKS QQGERIV+RYFQ+SE ERAA NED+D D  
Sbjct: 181 QKTPPSVRNSMPVQRDERVVSRHFQESKSNQQGERIVSRYFQHSEIERAAHNEDEDEDED 240

Query: 241 --FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRY 300
              T+Q  KRS VG Y KRRRK VA SSD SK  Q S+ K+SR V++SGTD+RVR VSRY
Sbjct: 241 VNVTDQPIKRSRVGQYRKRRRKDVASSSDNSKAYQRSIRKSSRFVKESGTDKRVRFVSRY 300

Query: 301 FQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKS 360
           FQNSEKN EV+ EVSP L++SK+ QQ E++VSRFFQKS +Q+ VN+QQE  +  +Q AKS
Sbjct: 301 FQNSEKNPEVEIEVSPPLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKS 360

Query: 361 VKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHA 420
           VKR+RKP  ERK RDK  SA+PRTTLSA ELFLEAYRRKSSDDTWKPPPSGIRLLQQDHA
Sbjct: 361 VKRIRKPAKERKVRDKV-SARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHA 420

Query: 421 YDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRS 480
           YDPWRVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS
Sbjct: 421 YDPWRVLVICMLLNRTTGQQAKEVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRS 480

Query: 481 RTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSI 521
            T+QRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSI
Sbjct: 481 LTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSI 540

BLAST of ClCG08G005050 vs. ExPASy Swiss-Prot
Match: Q0IGK1 (Methyl-CpG-binding domain protein 4-like protein OS=Arabidopsis thaliana OX=3702 GN=MBD4L PE=1 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 1.7e-48
Identity = 132/320 (41.25%), Postives = 181/320 (56.56%), Query Frame = 0

Query: 213 EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR--- 272
           +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   
Sbjct: 123 DDDDDSVSDSHIERQECSEFHVEVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQA 182

Query: 273 RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQTE-QMVSRFFQKSAKQQAVNS 332
           +V  VS YFQ S     + ++    +     R   S +Q + + VS +FQ+S   +  N 
Sbjct: 183 KVPRVSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPN- 242

Query: 333 QQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYR 392
             +A + L    K VK  R        VNE    K R+   +      LS ++   + Y 
Sbjct: 243 --QAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 302

Query: 393 RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEA 452
           RK+ D+TW PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q + VI  LF LC +A+ 
Sbjct: 303 RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKT 362

Query: 453 ALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT 512
           A EV  E+IE++I+PLGLQ+KR++ +QRLS  YL+ESW+HVTQL GVGKY ADA+AIFC 
Sbjct: 363 ATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCN 422

Query: 513 GYWNEVVPEDHMLNYYWDFL 514
           G W+ V P DHMLNYYWD+L
Sbjct: 423 GNWDRVKPNDHMLNYYWDYL 439

BLAST of ClCG08G005050 vs. ExPASy Swiss-Prot
Match: O95243 (Methyl-CpG-binding domain protein 4 OS=Homo sapiens OX=9606 GN=MBD4 PE=1 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 1.1e-26
Identity = 54/140 (38.57%), Postives = 85/140 (60.71%), Query Frame = 0

Query: 374 RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEA 433
           R+ +   W PP S   L+Q+   +DPW++L+  + LNRT+G+ A  V+ K     P+AE 
Sbjct: 431 RRKAFKKWTPPRSPFNLVQETLFHDPWKLLIATIFLNRTSGKMAIPVLWKFLEKYPSAEV 490

Query: 434 ALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT 493
           A       + ++++PLGL   R++T+ + S+ YL + W +  +L G+GKYG D++ IFC 
Sbjct: 491 ARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCV 550

Query: 494 GYWNEVVPEDHMLNYYWDFL 514
             W +V PEDH LN Y D+L
Sbjct: 551 NEWKQVHPEDHKLNKYHDWL 570

BLAST of ClCG08G005050 vs. ExPASy Swiss-Prot
Match: Q9Z2D7 (Methyl-CpG-binding domain protein 4 OS=Mus musculus OX=10090 GN=Mbd4 PE=1 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 4.2e-26
Identity = 54/140 (38.57%), Postives = 85/140 (60.71%), Query Frame = 0

Query: 374 RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEA 433
           R+ S   W PP S   L+Q+   +DPW++L+  + LNRT+G+ A  V+ +     P+AE 
Sbjct: 405 RRKSFKKWTPPRSPFNLVQEILFHDPWKLLIATIFLNRTSGKMAIPVLWEFLEKYPSAEV 464

Query: 434 ALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT 493
           A       + ++++PLGL   R++T+ + S+ YL + W +  +L G+GKYG D++ IFC 
Sbjct: 465 ARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSYRIFCV 524

Query: 494 GYWNEVVPEDHMLNYYWDFL 514
             W +V PEDH LN Y D+L
Sbjct: 525 NEWKQVHPEDHKLNKYHDWL 544

BLAST of ClCG08G005050 vs. ExPASy Swiss-Prot
Match: Q7LX22 (Thymine/uracil-DNA glycosylase OS=Pyrobaculum aerophilum (strain ATCC 51768 / IM2 / DSM 7523 / JCM 9630 / NBRC 100827) OX=178306 GN=PAE3199 PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 3.3e-07
Identity = 32/97 (32.99%), Postives = 52/97 (53.61%), Query Frame = 0

Query: 396 AYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKR 455
           A DPW VLV  +LL +TT +Q  ++  +     P+     + S E+I+ II+PLG++  R
Sbjct: 36  AGDPWAVLVAALLLRKTTVKQVVDIYREFLRRYPSPARLADASVEEIKAIIQPLGMEHVR 95

Query: 456 SRTMQRLSEMYLKESWSHV-------TQLPGVGKYGA 486
           +  +++LSE  ++     +         LPGVG Y A
Sbjct: 96  ATLLKKLSEELVRRFNGQIPCDRDALKSLPGVGDYAA 132

BLAST of ClCG08G005050 vs. ExPASy Swiss-Prot
Match: Q9YDP0 (Thymine-DNA glycosylase OS=Aeropyrum pernix (strain ATCC 700893 / DSM 11879 / JCM 9820 / NBRC 100138 / K1) OX=272557 GN=APE_0875.1 PE=1 SV=2)

HSP 1 Score: 56.2 bits (134), Expect = 1.3e-06
Identity = 28/93 (30.11%), Postives = 49/93 (52.69%), Query Frame = 0

Query: 398 DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSR 457
           DPW +LV   LL +TT +Q   V  +     PN +A      +++ ++IRPLG++ +R++
Sbjct: 35  DPWAILVAAFLLRKTTARQVVRVYEEFLRRYPNPKALASAREDEVRELIRPLGIEHQRAK 94

Query: 458 TMQRLSEMY-------LKESWSHVTQLPGVGKY 484
            +  L++         +  S   + +LPGVG Y
Sbjct: 95  HLIELAKHIEARYGGRIPCSKEKLKELPGVGDY 127

BLAST of ClCG08G005050 vs. ExPASy TrEMBL
Match: A0A1S3CCU6 (methyl-CpG-binding domain protein 4-like protein OS=Cucumis melo OX=3656 GN=LOC103499353 PE=4 SV=1)

HSP 1 Score: 707.2 bits (1824), Expect = 5.0e-200
Identity = 391/523 (74.76%), Postives = 420/523 (80.31%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTT 60
           M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S  QNP   QD      
Sbjct: 1   MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQD------ 60

Query: 61  ILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEP 120
              TQHSPISTL D Q SE  NH NK LA                      S SSEA EP
Sbjct: 61  --STQHSPISTLYDLQTSEPNNHHNKSLA----------------------SPSSEADEP 120

Query: 121 PILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARV 180
           PILTL+DLQN K     P+KPSLARRVL FYREFGFD+K++Q TSHS LN EPVQ+G RV
Sbjct: 121 PILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRV 180

Query: 181 VSRYFQNSKSTQQGERIVARYFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRR 240
           VSRYFQNS+STQQ ERIV+RYF+ S KERAA  ED  DD + TEQ SKRS     SKRRR
Sbjct: 181 VSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRR 240

Query: 241 KYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS 300
           K V PSS  SKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSP L++S
Sbjct: 241 KDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNS 300

Query: 301 KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAK 360
           KSNQQ E+MVSRFF KS KQQAVN+Q+EATEQLNQ AKSVKRVRKPVNERK ++KTSS K
Sbjct: 301 KSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTK 360

Query: 361 PRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQA 420
           PRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QA
Sbjct: 361 PRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQA 420

Query: 421 KEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQL 480
           KEVIPKLFSLCPN +A LEVS EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQL
Sbjct: 421 KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQL 480

Query: 481 PGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL 521
           PGVGKYGADAHAIFCTGYW+EV P+DHMLNYYWDFLHSIKHLL
Sbjct: 481 PGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL 488

BLAST of ClCG08G005050 vs. ExPASy TrEMBL
Match: A0A6J1EZJ4 (methyl-CpG-binding domain protein 4-like protein OS=Cucurbita moschata OX=3662 GN=LOC111437878 PE=4 SV=1)

HSP 1 Score: 679.9 bits (1753), Expect = 8.6e-192
Identity = 379/544 (69.67%), Postives = 431/544 (79.23%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQ 60
           MTAT  +NPNL+PPSSSSFPD LFSQFAF+G S SRF FP    PSES +QNPT +DFTQ
Sbjct: 1   MTATTIMNPNLSPPSSSSFPDFLFSQFAFQGCSSSRFRFPPSKCPSESNRQNPTPEDFTQ 60

Query: 61  NTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPN-----------C 120
             T LM Q+SPISTLE  Q SES NHQ     ++I I   +DLQ+ P             
Sbjct: 61  KRTTLMAQNSPISTLEVLQTSES-NHQKTAAGQEIPILCIEDLQDNPKRGSSTLTVEDVQ 120

Query: 121 EIPVTSLSSE-----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMV 180
           E+   + +SE      HEPPILTL+D+QNAK DH P  +P LARRVLRFYR+FGFD+++V
Sbjct: 121 EVSPKTPTSERERVLVHEPPILTLEDIQNAKSDHQPAIEPPLARRVLRFYRQFGFDEQIV 180

Query: 181 QTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD-- 240
           Q T  S  N  PVQ+  RVVSR+FQ SKS QQGERIV+RYFQ+SE ERAA NED+D D  
Sbjct: 181 QKTPPSVRNSMPVQRDERVVSRHFQESKSNQQGERIVSRYFQHSEIERAAHNEDEDEDED 240

Query: 241 --FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRY 300
              T+Q  KRS VG Y KRRRK VA SSD SK  Q S+ K+SR V++SGTD+RVR VSRY
Sbjct: 241 VNVTDQPIKRSRVGQYRKRRRKDVASSSDNSKAYQRSIRKSSRFVKESGTDKRVRFVSRY 300

Query: 301 FQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKS 360
           FQNSEKN EV+ EVSP L++SK+ QQ E++VSRFFQKS +Q+ VN+QQE  +  +Q AKS
Sbjct: 301 FQNSEKNPEVEIEVSPPLQNSKTKQQGERIVSRFFQKSEEQEVVNNQQEVIQLPSQCAKS 360

Query: 361 VKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHA 420
           VKR+RKP  ERK RDK  SA+PRTTLSA ELFLEAYRRKSSDDTWKPPPSGIRLLQQDHA
Sbjct: 361 VKRIRKPAKERKVRDKV-SARPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHA 420

Query: 421 YDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRS 480
           YDPWRVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS
Sbjct: 421 YDPWRVLVICMLLNRTTGQQAKEVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRS 480

Query: 481 RTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSI 521
            T+QRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSI
Sbjct: 481 LTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSI 540

BLAST of ClCG08G005050 vs. ExPASy TrEMBL
Match: A0A5D3CU57 (Methyl-CpG-binding domain protein 4-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45G00130 PE=4 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 1.3e-187
Identity = 373/511 (72.99%), Postives = 404/511 (79.06%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTT 60
           M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S  QNP   QD      
Sbjct: 1   MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQD------ 60

Query: 61  ILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEP 120
              TQHSPISTL D Q SE  NH NK LA                      S SSEA EP
Sbjct: 61  --STQHSPISTLYDLQTSEPNNHHNKSLA----------------------SPSSEADEP 120

Query: 121 PILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARV 180
           PILTL+DLQN K     P+KPSLARRVL FYREFGFD+K++Q TSHS LN EPVQ+G RV
Sbjct: 121 PILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRV 180

Query: 181 VSRYFQNSKSTQQGERIVARYFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRR 240
           VSRYFQNS+STQQ ERIV+RYF+ S KERAA  ED  DD + TEQ SKRS     SKRRR
Sbjct: 181 VSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRR 240

Query: 241 KYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS 300
           K V PSS  SKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSP L++S
Sbjct: 241 KDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNS 300

Query: 301 KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAK 360
           KSNQQ E+MVSRFF KS KQQAVN+Q+EATEQLNQ AKSVKRVRKPVNERK ++KTSS K
Sbjct: 301 KSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTK 360

Query: 361 PRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQA 420
           PRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QA
Sbjct: 361 PRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQA 420

Query: 421 KEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQL 480
           KEVIPKLFSLCPN +A LEVS EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQL
Sbjct: 421 KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQL 476

Query: 481 PGVGKYGADAHAIFCTGYWNEVVPEDHMLNY 509
           PGVGKYGADAHAIFCTGYWN  V E  ++++
Sbjct: 481 PGVGKYGADAHAIFCTGYWNGFVREAEVVDF 476

BLAST of ClCG08G005050 vs. ExPASy TrEMBL
Match: A0A0A0KRW9 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G630730 PE=4 SV=1)

HSP 1 Score: 612.8 bits (1579), Expect = 1.3e-171
Identity = 344/495 (69.49%), Postives = 383/495 (77.37%), Query Frame = 0

Query: 1   MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTT 60
           M +T SI+PNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S QQ+P   QD      
Sbjct: 1   MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQD------ 60

Query: 61  ILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVTSLSSEAHEP 120
              TQHSP+STL D Q  E  NH N+ LA                      S SSE HEP
Sbjct: 61  --STQHSPLSTLHDLQTPEPSNHHNESLA----------------------SPSSEVHEP 120

Query: 121 PILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARV 180
           PILTL+DLQN K     P++PSLARRVL FYREFGFD+K++Q TSHS LN  P Q+G RV
Sbjct: 121 PILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRV 180

Query: 181 VSRYFQNSKSTQQGERIVARYFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRR 240
           VSRYFQNS+STQQ +RIV+RYFQ S KER A  ED  D  + TEQ SKRS     SKRRR
Sbjct: 181 VSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS-----SKRRR 240

Query: 241 KYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS 300
           K V P SD SKTN HS+GK +RSVQKSGTD +VRIVS YFQ+ EK+LE+DREVSP L++S
Sbjct: 241 KDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNS 300

Query: 301 KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAK 360
           KSNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQ AKSVKR+RKPVNERK++DKTSS K
Sbjct: 301 KSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRLRKPVNERKEKDKTSSTK 360

Query: 361 PRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQA 420
           PRTTL+AAELFLEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRT+GQQA
Sbjct: 361 PRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQA 420

Query: 421 KEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQL 480
           KEVIPKLFSLCPN +A LEVS EQIEDIIRPLG  RKRSRTM RLSEMYLKESWSHVTQL
Sbjct: 421 KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQL 460

Query: 481 PGVGKYGADAHAIFC 493
           PGVGKY A    + C
Sbjct: 481 PGVGKYLAYPCTLSC 460

BLAST of ClCG08G005050 vs. ExPASy TrEMBL
Match: A0A6J1HWM5 (methyl-CpG-binding domain protein 4-like protein isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111468538 PE=4 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 7.1e-170
Identity = 338/477 (70.86%), Postives = 379/477 (79.45%), Query Frame = 0

Query: 62  MTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEI------------PV 121
           M  +SPISTLE  Q SE+ NHQ      +I I   + LQ+ P  EI            P 
Sbjct: 1   MALNSPISTLEVLQTSEA-NHQKTAAGHEIPILCIEYLQDDPKREISTLTVEDVQEVSPK 60

Query: 122 TSLSSE----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSH 181
           T  S      AHEPPILTL+DLQNAK DH P  KP LARRVLRF R+FGFD+++VQ T  
Sbjct: 61  TPTSERERVLAHEPPILTLEDLQNAKSDHQPAIKPPLARRVLRFCRQFGFDEQIVQKTPP 120

Query: 182 SDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARN--EDDDADFTEQT 241
           S  N  PVQ+  RVVSR+FQ SKS QQGERIV+RYFQ+SE ERAA N  EDDD + T+Q 
Sbjct: 121 SVRNSMPVQRDERVVSRHFQESKSNQQGERIVSRYFQHSEIERAAHNEDEDDDVNVTDQP 180

Query: 242 SKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKN 301
            KRS VG Y KRRRK VA SSD SK  Q S+ K+SRS++KSGTD+RVRIVSRYFQNSEKN
Sbjct: 181 FKRSRVGQYRKRRRKDVASSSDNSKAYQRSIRKSSRSIKKSGTDKRVRIVSRYFQNSEKN 240

Query: 302 LEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKP 361
            EV+ EVSP L++SK+NQQ E++VSRFFQKS + + VN+QQE  +  +Q AKSVKR+RKP
Sbjct: 241 PEVEIEVSPSLQNSKTNQQEERVVSRFFQKSEEHEVVNNQQEVIQLPSQCAKSVKRIRKP 300

Query: 362 VNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVL 421
             ERK RDK  SAKPRTTLSA ELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVL
Sbjct: 301 AKERKVRDKV-SAKPRTTLSADELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVL 360

Query: 422 VICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLS 481
           VICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS T+QRLS
Sbjct: 361 VICMLLNRTTGQQAKEVIPKLFTLCPDPKSALEVSQEQIEDIIRPLGLQRKRSLTIQRLS 420

Query: 482 EMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL 521
           EMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSIKHLL
Sbjct: 421 EMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTEVLPKDHMLNYYWEFLHSIKHLL 475

BLAST of ClCG08G005050 vs. TAIR 10
Match: AT3G07930.3 (DNA glycosylase superfamily protein )

HSP 1 Score: 195.3 bits (495), Expect = 1.2e-49
Identity = 132/320 (41.25%), Postives = 181/320 (56.56%), Query Frame = 0

Query: 213 EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR--- 272
           +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   
Sbjct: 123 DDDDDSVSDSHIERQECSEFHVEVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQA 182

Query: 273 RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQTE-QMVSRFFQKSAKQQAVNS 332
           +V  VS YFQ S     + ++    +     R   S +Q + + VS +FQ+S   +  N 
Sbjct: 183 KVPRVSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPN- 242

Query: 333 QQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYR 392
             +A + L    K VK  R        VNE    K R+   +      LS ++   + Y 
Sbjct: 243 --QAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 302

Query: 393 RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEA 452
           RK+ D+TW PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q + VI  LF LC +A+ 
Sbjct: 303 RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKT 362

Query: 453 ALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT 512
           A EV  E+IE++I+PLGLQ+KR++ +QRLS  YL+ESW+HVTQL GVGKY ADA+AIFC 
Sbjct: 363 ATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCN 422

Query: 513 GYWNEVVPEDHMLNYYWDFL 514
           G W+ V P DHMLNYYWD+L
Sbjct: 423 GNWDRVKPNDHMLNYYWDYL 439

BLAST of ClCG08G005050 vs. TAIR 10
Match: AT3G07930.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 63.5 bits (153), Expect = 5.6e-10
Identity = 71/223 (31.84%), Postives = 104/223 (46.64%), Query Frame = 0

Query: 213 EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR--- 272
           +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   
Sbjct: 123 DDDDDSVSDSHIERQECSEFHVEVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQA 182

Query: 273 RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQTE-QMVSRFFQKSAKQQAVNS 332
           +V  VS YFQ S     + ++    +     R   S +Q + + VS +FQ+S   +  N 
Sbjct: 183 KVPRVSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPN- 242

Query: 333 QQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYR 392
             +A + L    K VK  R        VNE    K R+   +      LS ++   + Y 
Sbjct: 243 --QAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 302

Query: 393 RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ 417
           RK+ D+TW PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q
Sbjct: 303 RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQ 342

BLAST of ClCG08G005050 vs. TAIR 10
Match: AT3G07930.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 59.3 bits (142), Expect = 1.1e-08
Identity = 69/220 (31.36%), Postives = 102/220 (46.36%), Query Frame = 0

Query: 213 EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR--- 272
           +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   
Sbjct: 123 DDDDDSVSDSHIERQECSEFHVEVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQA 182

Query: 273 RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQTE-QMVSRFFQKSAKQQAVNS 332
           +V  VS YFQ S     + ++    +     R   S +Q + + VS +FQ+S   +  N 
Sbjct: 183 KVPRVSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPN- 242

Query: 333 QQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYR 392
             +A + L    K VK  R        VNE    K R+   +      LS ++   + Y 
Sbjct: 243 --QAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 302

Query: 393 RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTT 414
           RK+ D+TW PP S   LLQ+DH +DPWRVLVICMLLN+T+
Sbjct: 303 RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTS 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038892490.13.1e-20475.00methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida][more]
XP_008460559.11.0e-19974.76PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo][more]
XP_004142362.11.7e-19471.89methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus] >K... [more]
KAG7022375.15.0e-19470.93Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperm... [more]
XP_022931728.11.8e-19169.67methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q0IGK11.7e-4841.25Methyl-CpG-binding domain protein 4-like protein OS=Arabidopsis thaliana OX=3702... [more]
O952431.1e-2638.57Methyl-CpG-binding domain protein 4 OS=Homo sapiens OX=9606 GN=MBD4 PE=1 SV=1[more]
Q9Z2D74.2e-2638.57Methyl-CpG-binding domain protein 4 OS=Mus musculus OX=10090 GN=Mbd4 PE=1 SV=1[more]
Q7LX223.3e-0732.99Thymine/uracil-DNA glycosylase OS=Pyrobaculum aerophilum (strain ATCC 51768 / IM... [more]
Q9YDP01.3e-0630.11Thymine-DNA glycosylase OS=Aeropyrum pernix (strain ATCC 700893 / DSM 11879 / JC... [more]
Match NameE-valueIdentityDescription
A0A1S3CCU65.0e-20074.76methyl-CpG-binding domain protein 4-like protein OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A6J1EZJ48.6e-19269.67methyl-CpG-binding domain protein 4-like protein OS=Cucurbita moschata OX=3662 G... [more]
A0A5D3CU571.3e-18772.99Methyl-CpG-binding domain protein 4-like protein OS=Cucumis melo var. makuwa OX=... [more]
A0A0A0KRW91.3e-17169.49ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G630730 PE=4... [more]
A0A6J1HWM57.1e-17070.86methyl-CpG-binding domain protein 4-like protein isoform X1 OS=Cucurbita maxima ... [more]
Match NameE-valueIdentityDescription
AT3G07930.31.2e-4941.25DNA glycosylase superfamily protein [more]
AT3G07930.25.6e-1031.84DNA glycosylase superfamily protein [more]
AT3G07930.11.1e-0831.36DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 370..520
e-value: 2.4E-46
score: 159.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 206..267
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 342..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 318..333
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 206..222
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 244..261
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 318..360
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 284..306
NoneNo IPR availablePANTHERPTHR15074:SF0METHYL-CPG-BINDING DOMAIN PROTEIN 4-RELATEDcoord: 188..518
IPR045138Methyl-CpG binding protein MeCP2/MBD4PANTHERPTHR15074METHYL-CPG-BINDING PROTEINcoord: 188..518
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 386..515

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG08G005050.1ClCG08G005050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003677 DNA binding