Clc01G19750 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G19750
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA-3-methyladenine glycosylase 1-like
LocationClcChr01: 31901903 .. 31902866 (+)
RNA-Seq ExpressionClc01G19750
SyntenyClc01G19750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAAAAGAATCTCCCGGAAGTTCCTCTTACAGTCCGAGTCACAAATCGACGCCGATCCTCTTCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCTACAAAAGTACGGAAGATTTCCTCCAAACAAGAACCGGCCAAACCACAAATTACAACTTCCGGCGGAAATGACCCGACACGAGCATTTCCGAACCTGGCCGGTACCGTCAAATCATTATCGTCTTCGGATGAAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGCATATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCGTTTCTAGCACTAGCGAAGAGCATCCTCTACCAACAGCTCGCCACGAAGGCCGCCGAATCGATCTACAATCGTTTTGCCTCGCTATGCGGCGGCGAGGCGGCGGTGGTGCCGGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTTGCGACCAAATTCATAGATGGGATTTTATCAAATTCATCAATTCTAGAGATGGACGACGAGAGTCTGTTGGAGGCCTTGACGGCAGTGAAGGGAATCGGCGTCTGGTCGGTGCATATGTTCATGATATTTACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAACTGCCGAAGCCGGTGGAGATGGGGAAACTGTGTGAGAAATGGAAGCCTTACAGGTCGATGGGGGCTTGGTGTATGTGGAGGTTAATGGAAATGAAGGGGTAAGTGTTGGGTTGTGCACTGTAATTTTCAATGAATCGTTTAACTGTTTGGGTTAACTTGTATTTGTGTAATTTTAAGAACTTGTTCTTGGACCATGGCTGGATCTCACCTTGCAATTGCAATTGCTGACTAA

mRNA sequence

ATGGCCAAAAGAATCTCCCGGAAGTTCCTCTTACAGTCCGAGTCACAAATCGACGCCGATCCTCTTCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCTACAAAAGTACGGAAGATTTCCTCCAAACAAGAACCGGCCAAACCACAAATTACAACTTCCGGCGGAAATGACCCGACACGAGCATTTCCGAACCTGGCCGGTACCGTCAAATCATTATCGTCTTCGGATGAAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGCATATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCGTTTCTAGCACTAGCGAAGAGCATCCTCTACCAACAGCTCGCCACGAAGGCCGCCGAATCGATCTACAATCGTTTTGCCTCGCTATGCGGCGGCGAGGCGGCGGTGGTGCCGGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTTGCGACCAAATTCATAGATGGGATTTTATCAAATTCATCAATTCTAGAGATGGACGACGAGAGTCTGTTGGAGGCCTTGACGGCAGTGAAGGGAATCGGCGTCTGGTCGGTGCATATGTTCATGATATTTACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAACTGCCGAAGCCGGTGGAGATGGGGAAACTGTGTGAGAAATGGAAGCCTTACAGGTCGATGGGGGCTTGGTGTATGTGGAGGTTAATGGAAATGAAGGGAACTTGTTCTTGGACCATGGCTGGATCTCACCTTGCAATTGCAATTGCTGACTAA

Coding sequence (CDS)

ATGGCCAAAAGAATCTCCCGGAAGTTCCTCTTACAGTCCGAGTCACAAATCGACGCCGATCCTCTTCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCTACAAAAGTACGGAAGATTTCCTCCAAACAAGAACCGGCCAAACCACAAATTACAACTTCCGGCGGAAATGACCCGACACGAGCATTTCCGAACCTGGCCGGTACCGTCAAATCATTATCGTCTTCGGATGAAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGCATATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCGTTTCTAGCACTAGCGAAGAGCATCCTCTACCAACAGCTCGCCACGAAGGCCGCCGAATCGATCTACAATCGTTTTGCCTCGCTATGCGGCGGCGAGGCGGCGGTGGTGCCGGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTTGCGACCAAATTCATAGATGGGATTTTATCAAATTCATCAATTCTAGAGATGGACGACGAGAGTCTGTTGGAGGCCTTGACGGCAGTGAAGGGAATCGGCGTCTGGTCGGTGCATATGTTCATGATATTTACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAACTGCCGAAGCCGGTGGAGATGGGGAAACTGTGTGAGAAATGGAAGCCTTACAGGTCGATGGGGGCTTGGTGTATGTGGAGGTTAATGGAAATGAAGGGAACTTGTTCTTGGACCATGGCTGGATCTCACCTTGCAATTGCAATTGCTGACTAA

Protein sequence

MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGTCSWTMAGSHLAIAIAD
Homology
BLAST of Clc01G19750 vs. NCBI nr
Match: XP_038881017.1 (DNA-3-methyladenine glycosylase 1-like [Benincasa hispida])

HSP 1 Score: 497.3 bits (1279), Expect = 9.2e-137
Identity = 251/277 (90.61%), Postives = 263/277 (94.95%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MAKRI RKFL+QS+S+IDADPLPPSSSSKIPF STKVRKISSKQEPAKPQI+TSGGNDPT
Sbjct: 1   MAKRIRRKFLVQSDSRIDADPLPPSSSSKIPFPSTKVRKISSKQEPAKPQISTSGGNDPT 60

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           RAF NLA  +KSLSSSDEI TAIDHLRRSDPLLISIL+SCESPNFKSNPPFLAL KSILY
Sbjct: 61  RAFQNLASPLKSLSSSDEIFTAIDHLRRSDPLLISILESCESPNFKSNPPFLALTKSILY 120

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGGE +V+PD VLGLSPQQLRVIGVSGRKASYLHDLATKFI+
Sbjct: 121 QQLATKAAESIYNRFASLCGGEVSVLPDIVLGLSPQQLRVIGVSGRKASYLHDLATKFIE 180

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNS ILEMDDE+LLEALTAVKGIG+WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GNLSNSMILEMDDETLLEALTAVKGIGMWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG 278
           GLKELPKP EM KLCEKWKPYRSMGAW MWRLME+KG
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSMGAWYMWRLMEVKG 277

BLAST of Clc01G19750 vs. NCBI nr
Match: XP_008440714.1 (PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo] >KAA0036227.1 putative DNA-3-methyladenine glycosylase 2 [Cucumis melo var. makuwa] >TYK12621.1 putative DNA-3-methyladenine glycosylase 2 [Cucumis melo var. makuwa])

HSP 1 Score: 468.0 bits (1203), Expect = 6.0e-128
Identity = 239/277 (86.28%), Postives = 250/277 (90.25%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MAKRI RKFL QSES   A PL PSSSSKIPFRSTKVRKISS QEPAKPQ +   G +PT
Sbjct: 1   MAKRIRRKFLFQSESPTGAVPLSPSSSSKIPFRSTKVRKISSNQEPAKPQFSAPDGYNPT 60

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           R FPNLA  VKSLSS DEI TAI+HLRRSDPLLIS+LDSCESP+FKSNPPFLAL KSILY
Sbjct: 61  RTFPNLADPVKSLSSLDEISTAINHLRRSDPLLISLLDSCESPHFKSNPPFLALTKSILY 120

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGGEA+V+PDTVLGLSPQQLRV+GVSGRKASYLHDLATKFI+
Sbjct: 121 QQLATKAAESIYNRFASLCGGEASVLPDTVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 180

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNS ILEMDDE+LL  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GNLSNSLILEMDDETLLGELTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG 278
           GLKELPKP EM KLCEKWKPYRS+GAW MWRLME KG
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLMEEKG 277

BLAST of Clc01G19750 vs. NCBI nr
Match: XP_004143510.1 (DNA-3-methyladenine glycosylase 1 [Cucumis sativus] >KGN48831.1 hypothetical protein Csa_003752 [Cucumis sativus])

HSP 1 Score: 463.8 bits (1192), Expect = 1.1e-126
Identity = 236/276 (85.51%), Postives = 250/276 (90.58%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MAKRI RK L Q ES  DA PL PS+SSKIPF STKVRKISS QEP KPQI+  GG +PT
Sbjct: 1   MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPT 60

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           R FPNLA  VKSLSSSD+I TAI+HLRRSDPLLIS+LDSCE+PNFKSNPPFLAL KSILY
Sbjct: 61  RIFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKSNPPFLALTKSILY 120

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAE+IYNRFASLCGGEAAV+PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFI+
Sbjct: 121 QQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIE 180

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNS ILEMDDE+LL ALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GSLSNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK 277
           GLKELPKP EM KLCEKWKPYRS+GAW MWRL++ K
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK 276

BLAST of Clc01G19750 vs. NCBI nr
Match: XP_022978525.1 (DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima])

HSP 1 Score: 463.8 bits (1192), Expect = 1.1e-126
Identity = 234/277 (84.48%), Postives = 249/277 (89.89%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MA+R  RK LLQSESQ +ADP      SKI FR+T++RKISS ++P KPQI+T GG D T
Sbjct: 35  MARRTRRKLLLQSESQTEADP-----PSKISFRTTEIRKISSTRKPDKPQISTDGGGDRT 94

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           RAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSCESPNFKSNPPFLA+ KSILY
Sbjct: 95  RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILY 154

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGGEAAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKF++
Sbjct: 155 QQLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVE 214

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNSSILEMDDE+LL ALT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 215 GTLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 274

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG 278
           GLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMKG
Sbjct: 275 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMKG 306

BLAST of Clc01G19750 vs. NCBI nr
Match: XP_022949777.1 (DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata])

HSP 1 Score: 461.8 bits (1187), Expect = 4.3e-126
Identity = 236/276 (85.51%), Postives = 247/276 (89.49%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MA+R  RK LLQSESQ DADP      S I FR+TK+RKISS Q+  KPQI+T GG D T
Sbjct: 35  MARRTRRKLLLQSESQTDADP-----PSNISFRTTKIRKISSTQKSDKPQISTPGGGDRT 94

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           RAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSCESPNFKSNPPFLAL KSILY
Sbjct: 95  RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLALTKSILY 154

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGG+AAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKFI+
Sbjct: 155 QQLATKAAESIYNRFASLCGGDAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 214

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNSSILEMDDE+LL ALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 215 GSLSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 274

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK 277
           GLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMK
Sbjct: 275 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMK 305

BLAST of Clc01G19750 vs. ExPASy Swiss-Prot
Match: Q92383 (DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag1 PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 4.9e-24
Identity = 57/173 (32.95%), Postives = 97/173 (56.07%), Query Frame = 0

Query: 104 NFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGV 163
           + +   P+  L +++  QQL +KAA +I+NRF S+        P+ +  +  + +R  G 
Sbjct: 44  SMEKKEPYEELIRAVASQQLHSKAANAIFNRFKSISNNGQFPTPEEIRDMDFEIMRACGF 103

Query: 164 SGRKASYLHDLATKFIDGILSNSSILE-MDDESLLEALTAVKGIGVWSVHMFMIFTLHRP 223
           S RK   L  +A   I G++      E + +E L+E LT +KGIG W+V M +IF+L+R 
Sbjct: 104 SARKIDSLKSIAEATISGLIPTKEEAERLSNEELIERLTQIKGIGRWTVEMLLIFSLNRD 163

Query: 224 DVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEM 276
           DV+P  DL +R G + L+ L ++P  + + K  E   P+R+  AW +W+  ++
Sbjct: 164 DVMPADDLSIRNGYRYLHRLPKIPTKMYVLKHSEICAPFRTAAAWYLWKTSKL 216

BLAST of Clc01G19750 vs. ExPASy Swiss-Prot
Match: O94468 (Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag2 PE=1 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.1e-20
Identity = 53/185 (28.65%), Postives = 99/185 (53.51%), Query Frame = 0

Query: 93  LISILDSCESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCG-GEAAVVPDTVL 152
           L+  +  C       + P+  + ++I  Q+L+  A  SI N+F + C   +    P  ++
Sbjct: 24  LVKKVGPCTLTPHPEHAPYEGIIRAITSQKLSDAATNSIINKFCTQCSDNDEFPTPKQIM 83

Query: 153 GLSPQQLRVIGVSGRKASYLHDLATKFID-GILSNSSILEMDDESLLEALTAVKGIGVWS 212
               + L   G S  K+  +H +A   ++  I S S I +M +E L+E+L+ +KG+  W+
Sbjct: 84  ETDVETLHECGFSKLKSQEIHIVAEAALNKQIPSKSEIEKMSEEELMESLSKIKGVKRWT 143

Query: 213 VHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMW 272
           + M+ IFTL R D++P  D  ++   +  +GL   P+  E+ KL +  KPYR++ AW +W
Sbjct: 144 IEMYSIFTLGRLDIMPADDSTLKNEAKEFFGLSSKPQTEEVEKLTKPCKPYRTIAAWYLW 203

Query: 273 RLMEM 276
           ++ ++
Sbjct: 204 QIPKL 208

BLAST of Clc01G19750 vs. ExPASy Swiss-Prot
Match: O31544 (Putative DNA-3-methyladenine glycosylase YfjP OS=Bacillus subtilis (strain 168) OX=224308 GN=yfjP PE=3 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 8.1e-19
Identity = 64/255 (25.10%), Postives = 119/255 (46.67%), Query Frame = 0

Query: 29  KIPFRSTK----VRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAID 88
           ++P R+      + K+ +     +P+   SG  D       +    +     + +   +D
Sbjct: 37  RVPIRNQAGDVCIVKVQALGHAGEPEFLVSGETDQGEMMKEIK---RIFQWENHLQHVLD 96

Query: 89  HLRRSDPLLISILDSCESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAA 148
           H  ++  L     +   +P       +  + K I++QQL    A ++  RF    G +  
Sbjct: 97  HFSKTS-LSAIFEEHAGTPLVLDYSVYNCMMKCIIHQQLNLSFAYTLTERFVHAFGEQKD 156

Query: 149 VV-----PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEA 208
            V     P+T+  L  Q LR +  S RKA Y  D +    +G LS S +  M DE +++ 
Sbjct: 157 GVWCYPKPETIAELDYQDLRDLQFSMRKAEYTIDTSRMIAEGTLSLSELPHMADEDIMKK 216

Query: 209 LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWK 268
           L  ++GIG W+V   ++F L RP++ P+ D+G++  ++R + L + P    M  + ++W+
Sbjct: 217 LIKIRGIGPWTVQNVLMFGLGRPNLFPLADIGLQNAIKRHFQLDDKPAKDVMLAMSKEWE 276

Query: 269 PYRSMGAWCMWRLME 275
           PY S  +  +WR +E
Sbjct: 277 PYLSYASLYLWRSIE 287

BLAST of Clc01G19750 vs. ExPASy Swiss-Prot
Match: P22134 (DNA-3-methyladenine glycosylase OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=MAG1 PE=1 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 4.6e-14
Identity = 73/232 (31.47%), Postives = 104/232 (44.83%), Query Frame = 0

Query: 82  AIDHLRRSDPLLISILDSCESPNF--KSNPP------FLALAKSILYQQLATKAAESIYN 141
           A +H+   DP L  IL + E   +  ++  P      F+ LA +IL QQ++ +AAESI  
Sbjct: 47  ACEHILEKDPSLFPILKNNEFTLYLKETQVPNTLEDYFIRLASTILSQQISGQAAESIKA 106

Query: 142 RFASLCGGE----AAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSIL 201
           R  SL GG       +  D        ++   G+S RK  YL  LA  F +       + 
Sbjct: 107 RVVSLYGGAFPDYKILFEDFKDPAKCAEIAKCGLSKRKMIYLESLAVYFTEKYKDIEKLF 166

Query: 202 --EMDDESLLEAL-TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-EL 261
             + +DE ++E+L T VKGIG WS  MF+I  L R DV    DLG+ +G  +    K EL
Sbjct: 167 GQKDNDEEVIESLVTNVKGIGPWSAKMFLISGLKRMDVFAPEDLGIARGFSKYLSDKPEL 226

Query: 262 PKPVE-------------------------MGKLCEKWKPYRSMGAWCMWRL 273
            K +                          M K  E + PYRS+  + +WRL
Sbjct: 227 EKELMRERKVVKKSKIKHKKYNWKIYDDDIMEKCSETFSPYRSVFMFILWRL 278

BLAST of Clc01G19750 vs. ExPASy TrEMBL
Match: A0A5A7T3R0 (Putative DNA-3-methyladenine glycosylase 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002080 PE=4 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 2.9e-128
Identity = 239/277 (86.28%), Postives = 250/277 (90.25%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MAKRI RKFL QSES   A PL PSSSSKIPFRSTKVRKISS QEPAKPQ +   G +PT
Sbjct: 1   MAKRIRRKFLFQSESPTGAVPLSPSSSSKIPFRSTKVRKISSNQEPAKPQFSAPDGYNPT 60

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           R FPNLA  VKSLSS DEI TAI+HLRRSDPLLIS+LDSCESP+FKSNPPFLAL KSILY
Sbjct: 61  RTFPNLADPVKSLSSLDEISTAINHLRRSDPLLISLLDSCESPHFKSNPPFLALTKSILY 120

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGGEA+V+PDTVLGLSPQQLRV+GVSGRKASYLHDLATKFI+
Sbjct: 121 QQLATKAAESIYNRFASLCGGEASVLPDTVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 180

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNS ILEMDDE+LL  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GNLSNSLILEMDDETLLGELTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG 278
           GLKELPKP EM KLCEKWKPYRS+GAW MWRLME KG
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLMEEKG 277

BLAST of Clc01G19750 vs. ExPASy TrEMBL
Match: A0A1S3B2D5 (probable DNA-3-methyladenine glycosylase 2 OS=Cucumis melo OX=3656 GN=LOC103485049 PE=4 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 2.9e-128
Identity = 239/277 (86.28%), Postives = 250/277 (90.25%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MAKRI RKFL QSES   A PL PSSSSKIPFRSTKVRKISS QEPAKPQ +   G +PT
Sbjct: 1   MAKRIRRKFLFQSESPTGAVPLSPSSSSKIPFRSTKVRKISSNQEPAKPQFSAPDGYNPT 60

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           R FPNLA  VKSLSS DEI TAI+HLRRSDPLLIS+LDSCESP+FKSNPPFLAL KSILY
Sbjct: 61  RTFPNLADPVKSLSSLDEISTAINHLRRSDPLLISLLDSCESPHFKSNPPFLALTKSILY 120

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGGEA+V+PDTVLGLSPQQLRV+GVSGRKASYLHDLATKFI+
Sbjct: 121 QQLATKAAESIYNRFASLCGGEASVLPDTVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 180

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNS ILEMDDE+LL  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GNLSNSLILEMDDETLLGELTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG 278
           GLKELPKP EM KLCEKWKPYRS+GAW MWRLME KG
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLMEEKG 277

BLAST of Clc01G19750 vs. ExPASy TrEMBL
Match: A0A6J1IQD1 (DNA-3-methyladenine glycosylase 1-like OS=Cucurbita maxima OX=3661 GN=LOC111478481 PE=4 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 5.5e-127
Identity = 234/277 (84.48%), Postives = 249/277 (89.89%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MA+R  RK LLQSESQ +ADP      SKI FR+T++RKISS ++P KPQI+T GG D T
Sbjct: 35  MARRTRRKLLLQSESQTEADP-----PSKISFRTTEIRKISSTRKPDKPQISTDGGGDRT 94

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           RAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSCESPNFKSNPPFLA+ KSILY
Sbjct: 95  RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILY 154

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGGEAAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKF++
Sbjct: 155 QQLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVE 214

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNSSILEMDDE+LL ALT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 215 GTLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 274

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG 278
           GLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMKG
Sbjct: 275 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMKG 306

BLAST of Clc01G19750 vs. ExPASy TrEMBL
Match: A0A0A0KM62 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G502690 PE=4 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 5.5e-127
Identity = 236/276 (85.51%), Postives = 250/276 (90.58%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MAKRI RK L Q ES  DA PL PS+SSKIPF STKVRKISS QEP KPQI+  GG +PT
Sbjct: 1   MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPT 60

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           R FPNLA  VKSLSSSD+I TAI+HLRRSDPLLIS+LDSCE+PNFKSNPPFLAL KSILY
Sbjct: 61  RIFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKSNPPFLALTKSILY 120

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAE+IYNRFASLCGGEAAV+PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFI+
Sbjct: 121 QQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIE 180

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNS ILEMDDE+LL ALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GSLSNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK 277
           GLKELPKP EM KLCEKWKPYRS+GAW MWRL++ K
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK 276

BLAST of Clc01G19750 vs. ExPASy TrEMBL
Match: A0A6J1GD23 (DNA-3-methyladenine glycosylase 1-like OS=Cucurbita moschata OX=3662 GN=LOC111453068 PE=4 SV=1)

HSP 1 Score: 461.8 bits (1187), Expect = 2.1e-126
Identity = 236/276 (85.51%), Postives = 247/276 (89.49%), Query Frame = 0

Query: 1   MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPT 60
           MA+R  RK LLQSESQ DADP      S I FR+TK+RKISS Q+  KPQI+T GG D T
Sbjct: 35  MARRTRRKLLLQSESQTDADP-----PSNISFRTTKIRKISSTQKSDKPQISTPGGGDRT 94

Query: 61  RAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILY 120
           RAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSCESPNFKSNPPFLAL KSILY
Sbjct: 95  RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLALTKSILY 154

Query: 121 QQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID 180
           QQLATKAAESIYNRFASLCGG+AAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKFI+
Sbjct: 155 QQLATKAAESIYNRFASLCGGDAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 214

Query: 181 GILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240
           G LSNSSILEMDDE+LL ALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 215 GSLSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 274

Query: 241 GLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK 277
           GLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMK
Sbjct: 275 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMK 305

BLAST of Clc01G19750 vs. TAIR 10
Match: AT1G19480.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 273.5 bits (698), Expect = 2.0e-73
Identity = 151/275 (54.91%), Postives = 186/275 (67.64%), Query Frame = 0

Query: 25  SSSSKIPFRSTKVRKIS-----SKQEPAKPQITTSGGNDP---------------TRAFP 84
           S  SKIP R  K+RK++     S ++     I++S  N P                RA  
Sbjct: 65  SPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSPLATDGKSPGKGKLSHLRAIT 124

Query: 85  NLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-NPPFLALAKSILYQQL 144
                 + L+   E+ TAI +LR +DPLL +++D    P F+S   PFLAL ++ILYQQL
Sbjct: 125 VPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNILYQQL 184

Query: 145 ATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGIL 204
           A KA  SIY RF SLCGGE  VVP+TVL L+PQQLR IGVSGRKASYLHDLA K+ +GIL
Sbjct: 185 AMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQNGIL 244

Query: 205 SNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK 264
           S+S+IL MD++SL   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL 
Sbjct: 245 SDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLLYGLD 304

Query: 265 ELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT 279
           +LP+P +M + C KW+PYRS+G+W MWRL+E K T
Sbjct: 305 DLPRPSQMEQHCAKWRPYRSVGSWYMWRLIEAKST 339

BLAST of Clc01G19750 vs. TAIR 10
Match: AT1G19480.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 273.5 bits (698), Expect = 2.0e-73
Identity = 151/275 (54.91%), Postives = 186/275 (67.64%), Query Frame = 0

Query: 25  SSSSKIPFRSTKVRKIS-----SKQEPAKPQITTSGGNDP---------------TRAFP 84
           S  SKIP R  K+RK++     S ++     I++S  N P                RA  
Sbjct: 65  SPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSPLATDGKSPGKGKLSHLRAIT 124

Query: 85  NLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-NPPFLALAKSILYQQL 144
                 + L+   E+ TAI +LR +DPLL +++D    P F+S   PFLAL ++ILYQQL
Sbjct: 125 VPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNILYQQL 184

Query: 145 ATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGIL 204
           A KA  SIY RF SLCGGE  VVP+TVL L+PQQLR IGVSGRKASYLHDLA K+ +GIL
Sbjct: 185 AMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQNGIL 244

Query: 205 SNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK 264
           S+S+IL MD++SL   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL 
Sbjct: 245 SDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLLYGLD 304

Query: 265 ELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT 279
           +LP+P +M + C KW+PYRS+G+W MWRL+E K T
Sbjct: 305 DLPRPSQMEQHCAKWRPYRSVGSWYMWRLIEAKST 339

BLAST of Clc01G19750 vs. TAIR 10
Match: AT1G75230.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 270.4 bits (690), Expect = 1.7e-72
Identity = 145/270 (53.70%), Postives = 181/270 (67.04%), Query Frame = 0

Query: 25  SSSSKIPFRSTKVRKIS---------------SKQEPAKPQITTSGGNDPTRAFPNLAGT 84
           S  +KIP R  K+RK+S               S+    KP   +      T   P +   
Sbjct: 68  SPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMTTTKPATKSKLSQSRTVTVPRI--Q 127

Query: 85  VKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-NPPFLALAKSILYQQLATKAA 144
            +SL+   E+  A+ HLR  DPLL S++D    P F++   PFLAL +SILYQQLA KA 
Sbjct: 128 ARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFLALIRSILYQQLAAKAG 187

Query: 145 ESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSI 204
            SIY RF +LCGGE  VVP+ VL L+PQQLR IGVSGRKASYLHDLA K+ +GILS+S I
Sbjct: 188 NSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSGI 247

Query: 205 LEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP 264
           + MD++SL   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P
Sbjct: 248 VNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQMLNGMEDLPRP 307

Query: 265 VEMGKLCEKWKPYRSMGAWCMWRLMEMKGT 279
            +M +LCEKW+PYRS+ +W +WRL+E K T
Sbjct: 308 SKMEQLCEKWRPYRSVASWYLWRLIESKNT 335

BLAST of Clc01G19750 vs. TAIR 10
Match: AT1G75230.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 270.4 bits (690), Expect = 1.7e-72
Identity = 145/270 (53.70%), Postives = 181/270 (67.04%), Query Frame = 0

Query: 25  SSSSKIPFRSTKVRKIS---------------SKQEPAKPQITTSGGNDPTRAFPNLAGT 84
           S  +KIP R  K+RK+S               S+    KP   +      T   P +   
Sbjct: 68  SPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMTTTKPATKSKLSQSRTVTVPRI--Q 127

Query: 85  VKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-NPPFLALAKSILYQQLATKAA 144
            +SL+   E+  A+ HLR  DPLL S++D    P F++   PFLAL +SILYQQLA KA 
Sbjct: 128 ARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFLALIRSILYQQLAAKAG 187

Query: 145 ESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSI 204
            SIY RF +LCGGE  VVP+ VL L+PQQLR IGVSGRKASYLHDLA K+ +GILS+S I
Sbjct: 188 NSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSGI 247

Query: 205 LEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP 264
           + MD++SL   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P
Sbjct: 248 VNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQMLNGMEDLPRP 307

Query: 265 VEMGKLCEKWKPYRSMGAWCMWRLMEMKGT 279
            +M +LCEKW+PYRS+ +W +WRL+E K T
Sbjct: 308 SKMEQLCEKWRPYRSVASWYLWRLIESKNT 335

BLAST of Clc01G19750 vs. TAIR 10
Match: AT3G50880.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 268.9 bits (686), Expect = 5.0e-72
Identity = 153/275 (55.64%), Postives = 191/275 (69.45%), Query Frame = 0

Query: 11  LQSESQIDADPLPPS----SSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNL 70
           L  +S I A  L  S    SSS+I FR  K+RK+SS   P +  IT S            
Sbjct: 15  LPPDSLISAGNLTVSEVSGSSSRIRFRPRKIRKVSSDPSP-RIIITAS------------ 74

Query: 71  AGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNF--KSNPPFLALAKSILYQQLA 130
                 LS+   +  A+ HL+ SD LL +++ +   P     SN PFL+LA+SILYQQLA
Sbjct: 75  ----PPLSTKSTVDIALRHLQSSDELLGALITTHNDPPLFDSSNTPFLSLARSILYQQLA 134

Query: 131 TKAAESIYNRFASLC-GGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGIL 190
           TKAA+ IY+RF SL  GGEA VVP++V+ LS   LR IGVSGRKASYLHDLA K+ +G+L
Sbjct: 135 TKAAKCIYDRFISLFNGGEAGVVPESVISLSAVDLRKIGVSGRKASYLHDLADKYNNGVL 194

Query: 191 SNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK 250
           S+  IL+M DE L++ LT VKGIGVW+VHMFMIF+LHRPDVLPVGDLGVRKGV+ LYGLK
Sbjct: 195 SDELILKMSDEELIDRLTLVKGIGVWTVHMFMIFSLHRPDVLPVGDLGVRKGVKDLYGLK 254

Query: 251 ELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT 279
            LP P++M +LCEKW+PYRS+G+W MWRL+E + T
Sbjct: 255 NLPGPLQMEQLCEKWRPYRSVGSWYMWRLIESRKT 272

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881017.19.2e-13790.61DNA-3-methyladenine glycosylase 1-like [Benincasa hispida][more]
XP_008440714.16.0e-12886.28PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo] >KAA0036227... [more]
XP_004143510.11.1e-12685.51DNA-3-methyladenine glycosylase 1 [Cucumis sativus] >KGN48831.1 hypothetical pro... [more]
XP_022978525.11.1e-12684.48DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima][more]
XP_022949777.14.3e-12685.51DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q923834.9e-2432.95DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
O944681.1e-2028.65Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain... [more]
O315448.1e-1925.10Putative DNA-3-methyladenine glycosylase YfjP OS=Bacillus subtilis (strain 168) ... [more]
P221344.6e-1431.47DNA-3-methyladenine glycosylase OS=Saccharomyces cerevisiae (strain ATCC 204508 ... [more]
Match NameE-valueIdentityDescription
A0A5A7T3R02.9e-12886.28Putative DNA-3-methyladenine glycosylase 2 OS=Cucumis melo var. makuwa OX=119469... [more]
A0A1S3B2D52.9e-12886.28probable DNA-3-methyladenine glycosylase 2 OS=Cucumis melo OX=3656 GN=LOC1034850... [more]
A0A6J1IQD15.5e-12784.48DNA-3-methyladenine glycosylase 1-like OS=Cucurbita maxima OX=3661 GN=LOC1114784... [more]
A0A0A0KM625.5e-12785.51ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G502690 PE=4... [more]
A0A6J1GD232.1e-12685.51DNA-3-methyladenine glycosylase 1-like OS=Cucurbita moschata OX=3662 GN=LOC11145... [more]
Match NameE-valueIdentityDescription
AT1G19480.12.0e-7354.91DNA glycosylase superfamily protein [more]
AT1G19480.22.0e-7354.91DNA glycosylase superfamily protein [more]
AT1G75230.21.7e-7253.70DNA glycosylase superfamily protein [more]
AT1G75230.11.7e-7253.70DNA glycosylase superfamily protein [more]
AT3G50880.15.0e-7255.64DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 119..274
e-value: 3.4E-16
score: 69.8
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 116..259
e-value: 1.4E-20
score: 73.7
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 111..272
e-value: 1.16602E-33
score: 118.883
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 110..221
e-value: 3.1E-66
score: 224.5
NoneNo IPR availableGENE3D1.10.1670.40coord: 85..271
e-value: 3.1E-66
score: 224.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..37
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 45..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..62
NoneNo IPR availablePANTHERPTHR43003:SF8HHH-GPD BASE EXCISION DNA REPAIR FAMILY PROTEINcoord: 4..280
NoneNo IPR availablePANTHERPTHR43003DNA-3-METHYLADENINE GLYCOSYLASEcoord: 4..280
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 108..274

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G19750.2Clc01G19750.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006285 base-excision repair, AP site formation
biological_process GO:0006307 DNA dealkylation involved in DNA repair
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus
cellular_component GO:0032993 protein-DNA complex
molecular_function GO:0032131 alkylated DNA binding
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0043916 DNA-7-methylguanine glycosylase activity
molecular_function GO:0003824 catalytic activity