CmaCh03G007470 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh03G007470
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionDNA-3-methyladenine glycosylase 1-like
LocationCma_Chr03: 5902812 .. 5905501 (-)
RNA-Seq ExpressionCmaCh03G007470
SyntenyCmaCh03G007470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCCAACACAACCAAAAACCCCTCTTTCAATTTTTTTTAACTTTGCTCTGTGTTTTACGCTATGATGCAACGCTGTGGCAGCTACCAATGCTACTCGGCGGGAGAGTGTTCATGTGGGGCGTTCTATGGGCAGCAGGGCAGCTACTTCTCCACGCCCGCCTACAACAATTACTATGAATCTGAACATTATTCTTTTGACTCGTCCTCTCCGGTGGATTGTACGCTCTCTCTCGGAACACCCTCGACTCGTATGACGGAGTACGACGAGAAGCGCCGTGAGGAGCAGCACTCTGCTTCTAATTTTGCCTGGGATTTGTCTCGTACCAAACATGGTCACTCCTCCAAGACCAGTCGCCGTAGTGGCAATACTGGCGGTGATAAATCCAGAGCCAATGGAGACCAAATGTTCTCCCGCCACTGCGCTAATTGCGACACCACCACCACCCCCCTCTGGCGCAATGGCCCTAGCGGTCCTAAGGTAAACAACCCAATCCACGATCTGGGTGTGGATCAAAATTGAACCTTTTCAAACTTTTGGTCTGTGAGCTAATGGATTAATGTAAATTGATTTGCAGTCGTTGTGCAATGCGTGTGGGATTAGATACAAGAAGGAAGAGAGGAAAGCGGCGAGTTCAGGGCAGCAGGCTAATTCGATGTACAAGAATGAGGCCAGCTCATGGCTTCAGCATCATTCCCACAGCCAGAAAACCCCGAGATTCCCACATGGAATTACCAATGATCTGAATCCCGGCGTCGCCTTCCTCTCATGGAGCCTCAATGACACAGAGCAGCCTCAGCTGTACTACGATTTCACAAGTTGAAATTATTATCTCTCCCAAGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTATTAATATTTACTTTCTTTTTTATACCCTGGAGGAAAAAAAAAAAAGAAAAAAAAAAAAAAGAAAAAAGAAAGCCGCCACTATTATAGTGTTGTTGAAGGAATTTTAAAAGATGGTGATTTCATGGAAATACTGTCTCCTCTAACGAATCCACAACATCCTTTTTTGTTTTGTTTTTTTTTTTCTTTTCCTTTCAGAATTTTGTTGGCTGCTTCTTATTTTTCCATTATACGTCAATGTTTGTGAAAATCGAAATATTAAAATTATATATATATTATGTATGTAATCTCATTTCTCTCTCTTACTTTCTCTTTCTTAGTTCCCCAGTGGAATAGTACGGTGCATTGCTTATCCTCCTCAATTAGATTAATAAAAAGTTATAAAAAATTAAGTTTTAATAACCCTATTATGTTAATCAAATCTTTAAGAATTTTGTTAATCAAAATCAATAAATAATCTGAGTTAATTCAATGGACCAAACTATAAGCTTGACTATCAACCAACTCAACCAAACCTATTAATCTAAAATTTTAATAAATTTAAAATATTTTTAGGCCATAAAAATGTTGGGTTAAAATTAGGGAGTAAATTAAAGAAGGTAAAATTTACATTATTATTTAAGCTCAACGTTGTGTCTATTGAAATTTAAAGTTGAATCAAAATGATTAAAAATTGTTATTTTTCGTGAACAAAATTGAAGTAAATTAAATCATTAAAAAGTAAATTGTATAATTACAGTCAAAGCTGACGAGTTTGACCGAGTTATATCATATAATAACCTGGAAATCAAATCGTGCACCTCAAGCTTCGAGTTGCTAACGGAGGAGATGATATCATCTCAGCCAATCACAACGCGACGTCTGTACTTCACTCCACCTGAATCTCTCTTCAAAACCCAGCTTCTACACACAAATCATTTACCACCGCCGATGGCCAGAAGAACCCGCCGGAAGTTACTCTTACAGTCCGAGTCTCAAACCGAGGCCGATCCACCGTCTAAGATTTCGTTCCGAACTACAGAAATACGGAAGATTTCTTCCACTCGAAAACCGGACAAACCACAGATATCAACTGATGGCGGAGGCGACCGGACTCGAGCATTCCCGAACCAGGATGGTCCTGTCAAATCTTTATCGTCTTCGGATGTAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCCCTTCTGATAAGGCTATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCGTTTCTAGCAATAACAAAGAGCATCCTCTACCAGCAGCTCGCTACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCGTCGCTATGCGGCGGAGAGGCGGCAGTACTACCGGACGCCGTACTTGGACTCTCGCCTCAACAGCTGCGAGTAGTCGGAGTTTCGGGTAGAAAAGCAAGTTACCTTCATGACCTAGCGACCAAATTCGTAGAGGGCACTTTGTCGAATTCGTCGATTCTAGAGATGGACGACGAGACTCTACTGAGTGCGTTGACGGGGGTGAAGGGTATCGGCGTTTGGTCAGTGCACATGTTCATGATTTTTACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGCGTGAGAAAAGGGGTGCAGAGGTTGTACGGACTGAAAGAATTGCCAAAGCCAGTGGAGATGGAGAAACTTTGTGAAAAATGGAAGCCGTACAGGTCGATGGGGGCTTGGTATATGTGGAGGCTGATGGAAATGAAGGGAATCGCGAAGAATGATGGCGATTTGAAGAAGAACACGGCAAACGGCGGCGGCGGCGACGTTGTAATGTGA

mRNA sequence

CCCCCAACACAACCAAAAACCCCTCTTTCAATTTTTTTTAACTTTGCTCTGTGTTTTACGCTATGATGCAACGCTGTGGCAGCTACCAATGCTACTCGGCGGGAGAGTGTTCATGTGGGGCGTTCTATGGGCAGCAGGGCAGCTACTTCTCCACGCCCGCCTACAACAATTACTATGAATCTGAACATTATTCTTTTGACTCGTCCTCTCCGGTGGATTGTACGCTCTCTCTCGGAACACCCTCGACTCGTATGACGGAGTACGACGAGAAGCGCCGTGAGGAGCAGCACTCTGCTTCTAATTTTGCCTGGGATTTGTCTCGTACCAAACATGGTCACTCCTCCAAGACCAGTCGCCGTAGTGGCAATACTGGCGGTGATAAATCCAGAGCCAATGGAGACCAAATGTTCTCCCGCCACTGCGCTAATTGCGACACCACCACCACCCCCCTCTGGCGCAATGGCCCTAGCGGTCCTAAGTCGTTGTGCAATGCGTGTGGGATTAGATACAAGAAGGAAGAGAGGAAAGCGGCGAGTTCAGGGCAGCAGGCTAATTCGATGTACAAGAATGAGGCCAGCTCATGGCTTCAGCATCATTCCCACAGCCAGAAAACCCCGAGATTCCCACATGGAATTACCAATGATCTGAATCCCGGCGTCGCCTTCCTCTCATGGAGCCTCAATGACACAGAGCAGCCTCAGCTAATTTTGTTGGCTGCTTCTTATTTTTCCATTATACGTCAATTTCCCCAGTGGAATATCAAAGCTGACGAGTTTGACCGAGTTATATCATATAATAACCTGGAAATCAAATCGTGCACCTCAAGCTTCGAGTTGCTAACGGAGGAGATGATATCATCTCAGCCAATCACAACGCGACGTCTGTACTTCACTCCACCTGAATCTCTCTTCAAAACCCAGCTTCTACACACAAATCATTTACCACCGCCGATGGCCAGAAGAACCCGCCGGAAGTTACTCTTACAGTCCGAGTCTCAAACCGAGGCCGATCCACCGTCTAAGATTTCGTTCCGAACTACAGAAATACGGAAGATTTCTTCCACTCGAAAACCGGACAAACCACAGATATCAACTGATGGCGGAGGCGACCGGACTCGAGCATTCCCGAACCAGGATGGTCCTGTCAAATCTTTATCGTCTTCGGATGTAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCCCTTCTGATAAGGCTATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCGTTTCTAGCAATAACAAAGAGCATCCTCTACCAGCAGCTCGCTACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCGTCGCTATGCGGCGGAGAGGCGGCAGTACTACCGGACGCCGTACTTGGACTCTCGCCTCAACAGCTGCGAGTAGTCGGAGTTTCGGGTAGAAAAGCAAGTTACCTTCATGACCTAGCGACCAAATTCGTAGAGGGCACTTTGTCGAATTCGTCGATTCTAGAGATGGACGACGAGACTCTACTGAGTGCGTTGACGGGGGTGAAGGGTATCGGCGTTTGGTCAGTGCACATGTTCATGATTTTTACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGCGTGAGAAAAGGGGTGCAGAGGTTGTACGGACTGAAAGAATTGCCAAAGCCAGTGGAGATGGAGAAACTTTGTGAAAAATGGAAGCCGTACAGGTCGATGGGGGCTTGGTATATGTGGAGGCTGATGGAAATGAAGGGAATCGCGAAGAATGATGGCGATTTGAAGAAGAACACGGCAAACGGCGGCGGCGGCGACGTTGTAATGTGA

Coding sequence (CDS)

ATGATGCAACGCTGTGGCAGCTACCAATGCTACTCGGCGGGAGAGTGTTCATGTGGGGCGTTCTATGGGCAGCAGGGCAGCTACTTCTCCACGCCCGCCTACAACAATTACTATGAATCTGAACATTATTCTTTTGACTCGTCCTCTCCGGTGGATTGTACGCTCTCTCTCGGAACACCCTCGACTCGTATGACGGAGTACGACGAGAAGCGCCGTGAGGAGCAGCACTCTGCTTCTAATTTTGCCTGGGATTTGTCTCGTACCAAACATGGTCACTCCTCCAAGACCAGTCGCCGTAGTGGCAATACTGGCGGTGATAAATCCAGAGCCAATGGAGACCAAATGTTCTCCCGCCACTGCGCTAATTGCGACACCACCACCACCCCCCTCTGGCGCAATGGCCCTAGCGGTCCTAAGTCGTTGTGCAATGCGTGTGGGATTAGATACAAGAAGGAAGAGAGGAAAGCGGCGAGTTCAGGGCAGCAGGCTAATTCGATGTACAAGAATGAGGCCAGCTCATGGCTTCAGCATCATTCCCACAGCCAGAAAACCCCGAGATTCCCACATGGAATTACCAATGATCTGAATCCCGGCGTCGCCTTCCTCTCATGGAGCCTCAATGACACAGAGCAGCCTCAGCTAATTTTGTTGGCTGCTTCTTATTTTTCCATTATACGTCAATTTCCCCAGTGGAATATCAAAGCTGACGAGTTTGACCGAGTTATATCATATAATAACCTGGAAATCAAATCGTGCACCTCAAGCTTCGAGTTGCTAACGGAGGAGATGATATCATCTCAGCCAATCACAACGCGACGTCTGTACTTCACTCCACCTGAATCTCTCTTCAAAACCCAGCTTCTACACACAAATCATTTACCACCGCCGATGGCCAGAAGAACCCGCCGGAAGTTACTCTTACAGTCCGAGTCTCAAACCGAGGCCGATCCACCGTCTAAGATTTCGTTCCGAACTACAGAAATACGGAAGATTTCTTCCACTCGAAAACCGGACAAACCACAGATATCAACTGATGGCGGAGGCGACCGGACTCGAGCATTCCCGAACCAGGATGGTCCTGTCAAATCTTTATCGTCTTCGGATGTAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCCCTTCTGATAAGGCTATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCGTTTCTAGCAATAACAAAGAGCATCCTCTACCAGCAGCTCGCTACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCGTCGCTATGCGGCGGAGAGGCGGCAGTACTACCGGACGCCGTACTTGGACTCTCGCCTCAACAGCTGCGAGTAGTCGGAGTTTCGGGTAGAAAAGCAAGTTACCTTCATGACCTAGCGACCAAATTCGTAGAGGGCACTTTGTCGAATTCGTCGATTCTAGAGATGGACGACGAGACTCTACTGAGTGCGTTGACGGGGGTGAAGGGTATCGGCGTTTGGTCAGTGCACATGTTCATGATTTTTACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGCGTGAGAAAAGGGGTGCAGAGGTTGTACGGACTGAAAGAATTGCCAAAGCCAGTGGAGATGGAGAAACTTTGTGAAAAATGGAAGCCGTACAGGTCGATGGGGGCTTGGTATATGTGGAGGCTGATGGAAATGAAGGGAATCGCGAAGAATGATGGCGATTTGAAGAAGAACACGGCAAACGGCGGCGGCGGCGACGTTGTAATGTGA

Protein sequence

MMQRCGSYQCYSAGECSCGAFYGQQGSYFSTPAYNNYYESEHYSFDSSSPVDCTLSLGTPSTRMTEYDEKRREEQHSASNFAWDLSRTKHGHSSKTSRRSGNTGGDKSRANGDQMFSRHCANCDTTTTPLWRNGPSGPKSLCNACGIRYKKEERKAASSGQQANSMYKNEASSWLQHHSHSQKTPRFPHGITNDLNPGVAFLSWSLNDTEQPQLILLAASYFSIIRQFPQWNIKADEFDRVISYNNLEIKSCTSSFELLTEEMISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKISFRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKNDGDLKKNTANGGGGDVVM
Homology
BLAST of CmaCh03G007470 vs. ExPASy Swiss-Prot
Match: Q8LC79 (GATA transcription factor 18 OS=Arabidopsis thaliana OX=3702 GN=GATA18 PE=1 SV=2)

HSP 1 Score: 132.9 bits (333), Expect = 1.2e-29
Identity = 105/274 (38.32%), Postives = 131/274 (47.81%), Query Frame = 0

Query: 5   CGSYQCYSAGECSCGAFYGQQGSY---FSTPAYNNYYESEHYSFDSSSPVDCTLSLGTPS 64
           CG +  +S   C         GSY   FS      + ++      SSS VDCTLSLGTPS
Sbjct: 18  CGMFHHHSQSCCYNNNNNSNAGSYSMVFSMQNGGVFEQNGEDYHHSSSLVDCTLSLGTPS 77

Query: 65  TRMTEYDEKRREEQHSA-----SNFAWDLSRTKHGHS-------------SKTSRR-SGN 124
           TR+ E DEKRR    S      SNF WDL  TK+ +S             +K SR  SG 
Sbjct: 78  TRLCEEDEKRRRSTSSGASSCISNF-WDLIHTKNNNSKTAPYNNVPSFSANKPSRGCSGG 137

Query: 125 TGGDKSRANGDQMFSRHCANCDTTTTPLWRNGPSGPKSLCNACGIRYKKEERK-AASSGQ 184
            GG      GD + +R CANCDTT+TPLWRNGP GPKSLCNACGIR+KKEER+  A++G 
Sbjct: 138 GGGGGGGGGGDSLLARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFKKEERRTTAATGN 197

Query: 185 QA--------------NSMY----------KNEASSWLQHHSHSQKTPRFP--------- 216
                           NS Y           N  + W  HHS  +    +P         
Sbjct: 198 TVVGAAPVQTDQYGHHNSGYNNYHAATNNNNNNGTPWAHHHSTQRVPCNYPANEIRFMDD 257

BLAST of CmaCh03G007470 vs. ExPASy Swiss-Prot
Match: Q6QPM2 (GATA transcription factor 19 OS=Arabidopsis thaliana OX=3702 GN=GATA19 PE=1 SV=2)

HSP 1 Score: 120.9 bits (302), Expect = 4.7e-26
Identity = 83/224 (37.05%), Postives = 109/224 (48.66%), Query Frame = 0

Query: 34  YNNYYESEHYSFDSSSP---VDCTLSLGTPSTRMTEYDEKRREEQHSASNFAWDLSRTKH 93
           ++ ++  E+     SSP   VDCTLSLGTPSTR+   D++RR   H++    WD      
Sbjct: 3   FSMFFSPENDVSHHSSPYASVDCTLSLGTPSTRLCNEDDERRFSSHTSDTIGWDF----- 62

Query: 94  GHSSKTSRRSGNTGGDKSRANGDQMFSRHCANCDTTTTPLWRNGPSGPKSLCNACGIRYK 153
                   + G  GG      G  + +R CANCDTT+TPLWRNGP GPKSLCNACGIR+K
Sbjct: 63  ----LNGSKKGGGGG------GHNLLARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 122

Query: 154 KEERKAASSGQ-------------------QANSMYKNE---ASSWLQHHSH-SQKTPRF 213
           KEER+A+++                      AN  Y N    ASS   HH H +Q+ P +
Sbjct: 123 KEERRASTARNSTSGGGSTAAGVPTLDHQASANYYYNNNNQYASSSPWHHQHNTQRVPYY 182

Query: 214 ----------------PHGITNDLNPGVAFLSWSLNDTEQPQLI 216
                            H +T D      FLSW LN  ++  L+
Sbjct: 183 SPANNEYSYVDDVRVVDHDVTTD-----PFLSWRLNVADRTGLV 206

BLAST of CmaCh03G007470 vs. ExPASy Swiss-Prot
Match: Q92383 (DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag1 PE=1 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 3.4e-24
Identity = 62/213 (29.11%), Postives = 115/213 (53.99%), Query Frame = 0

Query: 361 VKSLSSSDVICTAIDHLRRSDPLLIRLLDSCE-SPNFKSNPPFLAITKSILYQQLATKAA 420
           V SL+ +++  + +D   +    L++L+ +   + + +   P+  + +++  QQL +KAA
Sbjct: 12  VTSLTKAEIHLSGLDENWKR---LVKLVGNYRPNRSMEKKEPYEELIRAVASQQLHSKAA 71

Query: 421 ESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSI 480
            +I+NRF S+        P+ +  +  + +R  G S RK   L  +A   + G +     
Sbjct: 72  NAIFNRFKSISNNGQFPTPEEIRDMDFEIMRACGFSARKIDSLKSIAEATISGLIPTKEE 131

Query: 481 LE-MDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPK 540
            E + +E L+  LT +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L ++P 
Sbjct: 132 AERLSNEELIERLTQIKGIGRWTVEMLLIFSLNRDDVMPADDLSIRNGYRYLHRLPKIPT 191

Query: 541 PVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAK 572
            + + K  E   P+R+  AWY+W+  ++    K
Sbjct: 192 KMYVLKHSEICAPFRTAAAWYLWKTSKLADYTK 221

BLAST of CmaCh03G007470 vs. ExPASy Swiss-Prot
Match: O94468 (Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag2 PE=1 SV=1)

HSP 1 Score: 106.7 bits (265), Expect = 9.2e-22
Identity = 54/185 (29.19%), Postives = 100/185 (54.05%), Query Frame = 0

Query: 384 LIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCG-GEAAVLPDAVL 443
           L++ +  C       + P+  I ++I  Q+L+  A  SI N+F + C   +    P  ++
Sbjct: 24  LVKKVGPCTLTPHPEHAPYEGIIRAITSQKLSDAATNSIINKFCTQCSDNDEFPTPKQIM 83

Query: 444 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTL-SNSSILEMDDETLLSALTGVKGIGVWS 503
               + L   G S  K+  +H +A   +   + S S I +M +E L+ +L+ +KG+  W+
Sbjct: 84  ETDVETLHECGFSKLKSQEIHIVAEAALNKQIPSKSEIEKMSEEELMESLSKIKGVKRWT 143

Query: 504 VHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMW 563
           + M+ IFTL R D++P  D  ++   +  +GL   P+  E+EKL +  KPYR++ AWY+W
Sbjct: 144 IEMYSIFTLGRLDIMPADDSTLKNEAKEFFGLSSKPQTEEVEKLTKPCKPYRTIAAWYLW 203

Query: 564 RLMEM 567
           ++ ++
Sbjct: 204 QIPKL 208

BLAST of CmaCh03G007470 vs. ExPASy Swiss-Prot
Match: Q9ZPX0 (GATA transcription factor 20 OS=Arabidopsis thaliana OX=3702 GN=GATA20 PE=2 SV=2)

HSP 1 Score: 104.4 bits (259), Expect = 4.6e-21
Identity = 70/168 (41.67%), Postives = 94/168 (55.95%), Query Frame = 0

Query: 35  NNYYESEHYSFDSSSPVDCTLSLGTPSTRMTEYDEKRREEQHSASNFAWDLSRTKHGHSS 94
           N++    + +F SS+ VDCTLSLGTPSTR+   D+  R    +++N + D     HG ++
Sbjct: 22  NHHNYDPYNNFSSSTSVDCTLSLGTPSTRL---DDHHRFSSANSNNISGDF--YIHGGNA 81

Query: 95  KTSRRSGNTGGDKSRANGDQMFSRHCANCDTTTTPLWRNGPSGPKSLCNACGIRYKKEER 154
           KTS  S   GG            R CA+CDTT+TPLWRNGP GPKSLCNACGIR+KKEER
Sbjct: 82  KTS--SYKKGGVA------HSLPRRCASCDTTSTPLWRNGPKGPKSLCNACGIRFKKEER 141

Query: 155 KA-------ASSGQQA-----NSMYKNEASSWLQHHSH-SQKTPRFPH 190
           +A       +  G  A      + Y    + +  HH H +  +P + H
Sbjct: 142 RATARNLTISGGGSSAAEVPVENSYNGGGNYYSHHHHHYASSSPSWAH 176

BLAST of CmaCh03G007470 vs. ExPASy TrEMBL
Match: A0A6J1IQD1 (DNA-3-methyladenine glycosylase 1-like OS=Cucurbita maxima OX=3661 GN=LOC111478481 PE=4 SV=1)

HSP 1 Score: 653.3 bits (1684), Expect = 9.8e-184
Identity = 328/328 (100.00%), Postives = 328/328 (100.00%), Query Frame = 0

Query: 263 MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 322
           MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS
Sbjct: 1   MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 60

Query: 323 FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 382
           FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP
Sbjct: 61  FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 120

Query: 383 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 442
           LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL
Sbjct: 121 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 180

Query: 443 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 502
           GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV
Sbjct: 181 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 240

Query: 503 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 562
           HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR
Sbjct: 241 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 300

Query: 563 LMEMKGIAKNDGDLKKNTANGGGGDVVM 591
           LMEMKGIAKNDGDLKKNTANGGGGDVVM
Sbjct: 301 LMEMKGIAKNDGDLKKNTANGGGGDVVM 328

BLAST of CmaCh03G007470 vs. ExPASy TrEMBL
Match: A0A6J1GD23 (DNA-3-methyladenine glycosylase 1-like OS=Cucurbita moschata OX=3662 GN=LOC111453068 PE=4 SV=1)

HSP 1 Score: 605.1 bits (1559), Expect = 3.0e-169
Identity = 303/323 (93.81%), Postives = 312/323 (96.59%), Query Frame = 0

Query: 263 MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 322
           MISSQPI TRRLY TPP+ LFKTQLL TNHLPPPMARRTRRKLLLQSESQT+ADPPS IS
Sbjct: 1   MISSQPIVTRRLYITPPDFLFKTQLLPTNHLPPPMARRTRRKLLLQSESQTDADPPSNIS 60

Query: 323 FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 382
           FRTT+IRKISST+K DKPQIST GGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP
Sbjct: 61  FRTTKIRKISSTQKSDKPQISTPGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 120

Query: 383 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 442
           LLIRLLDSCESPNFKSNPPFLA+TKSILYQQLATKAAESIYNRFASLCGG+AAVLPDAVL
Sbjct: 121 LLIRLLDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGDAAVLPDAVL 180

Query: 443 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 502
           GLSPQQLRVVGVSGRKASYLHDLATKF+EG+LSNSSILEMDDETLLSALT VKGIGVWSV
Sbjct: 181 GLSPQQLRVVGVSGRKASYLHDLATKFIEGSLSNSSILEMDDETLLSALTAVKGIGVWSV 240

Query: 503 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 562
           HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR
Sbjct: 241 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 300

Query: 563 LMEMKGIAKNDGDLKKNTANGGG 586
           LMEMK I K+DGDLK NTANGGG
Sbjct: 301 LMEMKEIVKDDGDLKMNTANGGG 323

BLAST of CmaCh03G007470 vs. ExPASy TrEMBL
Match: A0A5A7T3R0 (Putative DNA-3-methyladenine glycosylase 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002080 PE=4 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 1.5e-123
Identity = 233/293 (79.52%), Postives = 247/293 (84.30%), Query Frame = 0

Query: 297 MARRTRRKLLLQSESQTEADP-----PSKISFRTTEIRKISSTRKPDKPQISTDGGGDRT 356
           MA+R RRK L QSES T A P      SKI FR+T++RKISS ++P KPQ S   G + T
Sbjct: 1   MAKRIRRKFLFQSESPTGAVPLSPSSSSKIPFRSTKVRKISSNQEPAKPQFSAPDGYNPT 60

Query: 357 RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILY 416
           R FPN   PVKSLSS D I TAI+HLRRSDPLLI LLDSCESP+FKSNPPFLA+TKSILY
Sbjct: 61  RTFPNLADPVKSLSSLDEISTAINHLRRSDPLLISLLDSCESPHFKSNPPFLALTKSILY 120

Query: 417 QQLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVE 476
           QQLATKAAESIYNRFASLCGGEA+VLPD VLGLSPQQLRVVGVSGRKASYLHDLATKF+E
Sbjct: 121 QQLATKAAESIYNRFASLCGGEASVLPDTVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 180

Query: 477 GTLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 536
           G LSNS ILEMDDETLL  LT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GNLSNSLILEMDDETLLGELTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 537 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKNDGDLKKNTANGG 585
           GLKELPKP EMEKLCEKWKPYRS+GAWYMWRLME KG+ K   DL  N  N G
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLMEEKGVVKKGSDLPDNMENRG 293

BLAST of CmaCh03G007470 vs. ExPASy TrEMBL
Match: A0A1S3B2D5 (probable DNA-3-methyladenine glycosylase 2 OS=Cucumis melo OX=3656 GN=LOC103485049 PE=4 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 1.5e-123
Identity = 233/293 (79.52%), Postives = 247/293 (84.30%), Query Frame = 0

Query: 297 MARRTRRKLLLQSESQTEADP-----PSKISFRTTEIRKISSTRKPDKPQISTDGGGDRT 356
           MA+R RRK L QSES T A P      SKI FR+T++RKISS ++P KPQ S   G + T
Sbjct: 1   MAKRIRRKFLFQSESPTGAVPLSPSSSSKIPFRSTKVRKISSNQEPAKPQFSAPDGYNPT 60

Query: 357 RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILY 416
           R FPN   PVKSLSS D I TAI+HLRRSDPLLI LLDSCESP+FKSNPPFLA+TKSILY
Sbjct: 61  RTFPNLADPVKSLSSLDEISTAINHLRRSDPLLISLLDSCESPHFKSNPPFLALTKSILY 120

Query: 417 QQLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVE 476
           QQLATKAAESIYNRFASLCGGEA+VLPD VLGLSPQQLRVVGVSGRKASYLHDLATKF+E
Sbjct: 121 QQLATKAAESIYNRFASLCGGEASVLPDTVLGLSPQQLRVVGVSGRKASYLHDLATKFIE 180

Query: 477 GTLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 536
           G LSNS ILEMDDETLL  LT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GNLSNSLILEMDDETLLGELTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 537 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKNDGDLKKNTANGG 585
           GLKELPKP EMEKLCEKWKPYRS+GAWYMWRLME KG+ K   DL  N  N G
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLMEEKGVVKKGSDLPDNMENRG 293

BLAST of CmaCh03G007470 vs. ExPASy TrEMBL
Match: A0A0A0KM62 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G502690 PE=4 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 8.1e-122
Identity = 228/284 (80.28%), Postives = 247/284 (86.97%), Query Frame = 0

Query: 297 MARRTRRKLLLQSESQTEADP-----PSKISFRTTEIRKISSTRKPDKPQISTDGGGDRT 356
           MA+R RRK L Q ES ++A P      SKI F +T++RKISS ++P KPQIS  GG + T
Sbjct: 1   MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPT 60

Query: 357 RAFPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILY 416
           R FPN   PVKSLSSSD I TAI+HLRRSDPLLI LLDSCE+PNFKSNPPFLA+TKSILY
Sbjct: 61  RIFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKSNPPFLALTKSILY 120

Query: 417 QQLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVE 476
           QQLATKAAE+IYNRFASLCGGEAAVLPD VLGLSPQQLRV+GVSGRKASYLHDLATKF+E
Sbjct: 121 QQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIE 180

Query: 477 GTLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 536
           G+LSNS ILEMDDETLL ALT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY
Sbjct: 181 GSLSNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLY 240

Query: 537 GLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKNDGD 576
           GLKELPKP EMEKLCEKWKPYRS+GAWYMWRL++ K I KN  D
Sbjct: 241 GLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD 284

BLAST of CmaCh03G007470 vs. NCBI nr
Match: KAG6603971.1 (Zinc-finger homeodomain protein 6, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 543/585 (92.82%), Postives = 554/585 (94.70%), Query Frame = 0

Query: 1   MMQRCGSYQCYSAGECSCGAFYGQQGSYFSTPAYNNYYESEHYSFDSSSPVDCTLSLGTP 60
           MMQRCGSYQCYSAGECSCGAFY QQGSYFSTPAYNNYYESEHYSFDSSSPVDCTLSLGTP
Sbjct: 1   MMQRCGSYQCYSAGECSCGAFYAQQGSYFSTPAYNNYYESEHYSFDSSSPVDCTLSLGTP 60

Query: 61  STRMTEYDEKRREEQHSASNFAWDLSRTKHGHSSKTSRRSGNTGGDKSRANGDQMFSRHC 120
           STRMTEYDEKRREEQHSASNFAWDLSRTKHGHSSKTSRRSGNTG DKSRANGDQMFSRHC
Sbjct: 61  STRMTEYDEKRREEQHSASNFAWDLSRTKHGHSSKTSRRSGNTGSDKSRANGDQMFSRHC 120

Query: 121 ANCDTTTTPLWRNGPSGPKSLCNACGIRYKKEERKAASSGQQANSMYKNEASSWLQHHSH 180
           ANCDTTTTPLWRNGPSGPKSLCNACGIRYKKEERKAASSGQQANSMYKNEASSWLQHHSH
Sbjct: 121 ANCDTTTTPLWRNGPSGPKSLCNACGIRYKKEERKAASSGQQANSMYKNEASSWLQHHSH 180

Query: 181 SQKTPRFPHGITNDLNPGVAFLSWSLNDTEQPQLILLAASYFSIIRQFPQWNIKADEFDR 240
           SQKTPRFPHGITNDLNPGVAFLSWSLNDTEQPQL             +  + IKADEFDR
Sbjct: 181 SQKTPRFPHGITNDLNPGVAFLSWSLNDTEQPQL-------------YYDFTIKADEFDR 240

Query: 241 VISYNNLEIKSCTSSFELLTEEMISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARR 300
           VISY NLEIKSCTSSFELL EEMISSQPI TRRLY TPP+ LFKTQLL TNHLPPPMARR
Sbjct: 241 VISYYNLEIKSCTSSFELLKEEMISSQPIVTRRLYITPPDFLFKTQLLPTNHLPPPMARR 300

Query: 301 TRRKLLLQSESQTEADPPSKISFRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGP 360
           TRRKLLLQSESQT+ADPPS ISFRTT+IRKISST+K DKPQIST GGGDRTRAFPNQDGP
Sbjct: 301 TRRKLLLQSESQTDADPPSNISFRTTKIRKISSTQKSDKPQISTPGGGDRTRAFPNQDGP 360

Query: 361 VKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAE 420
           VKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLA+TKSILYQQLATKAAE
Sbjct: 361 VKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKSNPPFLALTKSILYQQLATKAAE 420

Query: 421 SIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSIL 480
           SIYNRFASLCGG+AAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKF+EG+LSNSSIL
Sbjct: 421 SIYNRFASLCGGDAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFIEGSLSNSSIL 480

Query: 481 EMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPV 540
           EMDDETLLSALT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPV
Sbjct: 481 EMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPV 540

Query: 541 EMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKNDGDLKKNTANGGG 586
           EMEKLCEKWKPYRSMGAWYMWRLMEMK I K+DGDLK NTANGGG
Sbjct: 541 EMEKLCEKWKPYRSMGAWYMWRLMEMKEIVKDDGDLKMNTANGGG 572

BLAST of CmaCh03G007470 vs. NCBI nr
Match: XP_022978525.1 (DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima])

HSP 1 Score: 653.3 bits (1684), Expect = 2.0e-183
Identity = 328/328 (100.00%), Postives = 328/328 (100.00%), Query Frame = 0

Query: 263 MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 322
           MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS
Sbjct: 1   MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 60

Query: 323 FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 382
           FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP
Sbjct: 61  FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 120

Query: 383 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 442
           LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL
Sbjct: 121 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 180

Query: 443 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 502
           GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV
Sbjct: 181 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 240

Query: 503 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 562
           HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR
Sbjct: 241 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 300

Query: 563 LMEMKGIAKNDGDLKKNTANGGGGDVVM 591
           LMEMKGIAKNDGDLKKNTANGGGGDVVM
Sbjct: 301 LMEMKGIAKNDGDLKKNTANGGGGDVVM 328

BLAST of CmaCh03G007470 vs. NCBI nr
Match: XP_022949777.1 (DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata])

HSP 1 Score: 605.1 bits (1559), Expect = 6.3e-169
Identity = 303/323 (93.81%), Postives = 312/323 (96.59%), Query Frame = 0

Query: 263 MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 322
           MISSQPI TRRLY TPP+ LFKTQLL TNHLPPPMARRTRRKLLLQSESQT+ADPPS IS
Sbjct: 1   MISSQPIVTRRLYITPPDFLFKTQLLPTNHLPPPMARRTRRKLLLQSESQTDADPPSNIS 60

Query: 323 FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 382
           FRTT+IRKISST+K DKPQIST GGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP
Sbjct: 61  FRTTKIRKISSTQKSDKPQISTPGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 120

Query: 383 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 442
           LLIRLLDSCESPNFKSNPPFLA+TKSILYQQLATKAAESIYNRFASLCGG+AAVLPDAVL
Sbjct: 121 LLIRLLDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGDAAVLPDAVL 180

Query: 443 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 502
           GLSPQQLRVVGVSGRKASYLHDLATKF+EG+LSNSSILEMDDETLLSALT VKGIGVWSV
Sbjct: 181 GLSPQQLRVVGVSGRKASYLHDLATKFIEGSLSNSSILEMDDETLLSALTAVKGIGVWSV 240

Query: 503 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 562
           HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR
Sbjct: 241 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 300

Query: 563 LMEMKGIAKNDGDLKKNTANGGG 586
           LMEMK I K+DGDLK NTANGGG
Sbjct: 301 LMEMKEIVKDDGDLKMNTANGGG 323

BLAST of CmaCh03G007470 vs. NCBI nr
Match: KAG7034142.1 (mag1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 602.8 bits (1553), Expect = 3.1e-168
Identity = 302/323 (93.50%), Postives = 311/323 (96.28%), Query Frame = 0

Query: 263 MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 322
           MISSQPI TRRLY TPP+ LFKTQLL TNHLPPPMARRTRRKLLLQSESQT+ DPPS IS
Sbjct: 1   MISSQPIVTRRLYITPPDFLFKTQLLPTNHLPPPMARRTRRKLLLQSESQTDDDPPSNIS 60

Query: 323 FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 382
           FRTT+IRKISST+K DKPQIST GGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP
Sbjct: 61  FRTTKIRKISSTQKSDKPQISTPGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 120

Query: 383 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 442
           LLIRLLDSCESPNFKSNPPFLA+TKSILYQQLATKAAESIYNRFASLCGG+AAVLPDAVL
Sbjct: 121 LLIRLLDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGDAAVLPDAVL 180

Query: 443 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 502
           GLSPQQLRVVGVSGRKASYLHDLATKF+EG+LSNSSILEMDDETLLSALT VKGIGVWSV
Sbjct: 181 GLSPQQLRVVGVSGRKASYLHDLATKFIEGSLSNSSILEMDDETLLSALTAVKGIGVWSV 240

Query: 503 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 562
           HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR
Sbjct: 241 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 300

Query: 563 LMEMKGIAKNDGDLKKNTANGGG 586
           LMEMK I K+DGDLK NTANGGG
Sbjct: 301 LMEMKEIVKDDGDLKMNTANGGG 323

BLAST of CmaCh03G007470 vs. NCBI nr
Match: XP_023543059.1 (DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 600.9 bits (1548), Expect = 1.2e-167
Identity = 304/323 (94.12%), Postives = 311/323 (96.28%), Query Frame = 0

Query: 263 MISSQPITTRRLYFTPPESLFKTQLLHTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 322
           MISSQPI TRRLY TP ESL KTQ+L TNHLPPPMARRTRRKLLLQSESQTEADPPSKIS
Sbjct: 1   MISSQPIITRRLYCTPLESLLKTQVLPTNHLPPPMARRTRRKLLLQSESQTEADPPSKIS 60

Query: 323 FRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLRRSDP 382
           FRTT+IRKISST+KPDKPQIST GGGDRTRAFPNQDGPVKSLSSSDVI TAIDHLRRSDP
Sbjct: 61  FRTTKIRKISSTQKPDKPQISTPGGGDRTRAFPNQDGPVKSLSSSDVIRTAIDHLRRSDP 120

Query: 383 LLIRLLDSCESPNFKSNPPFLAITKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 442
           LLIRLLDSCESPNFKSNPPFLA+TKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL
Sbjct: 121 LLIRLLDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVLPDAVL 180

Query: 443 GLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVKGIGVWSV 502
           GLSPQQLRVVGVSGRKASYLHDLATKF+EG+LSNSSILEMDDETLLSALT VKGIGVWSV
Sbjct: 181 GLSPQQLRVVGVSGRKASYLHDLATKFIEGSLSNSSILEMDDETLLSALTAVKGIGVWSV 240

Query: 503 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWR 562
           HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCE WKPYRSMGAWYMWR
Sbjct: 241 HMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCENWKPYRSMGAWYMWR 300

Query: 563 LMEMKGIAKNDGDLKKNTANGGG 586
           LMEMK I K+DGDLK NTANGGG
Sbjct: 301 LMEMKEIVKDDGDLKMNTANGGG 323

BLAST of CmaCh03G007470 vs. TAIR 10
Match: AT1G19480.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 269.6 bits (688), Expect = 5.8e-72
Identity = 146/275 (53.09%), Postives = 185/275 (67.27%), Query Frame = 0

Query: 315 ADPPSKISFRTTEIRKIS---------------STRKPDKPQISTDGGG------DRTRA 374
           + PPSKI  R  +IRK++               S+ + + P ++TDG           RA
Sbjct: 64  SSPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSP-LATDGKSPGKGKLSHLRA 123

Query: 375 FPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKS-NPPFLAITKSILYQ 434
                   + L+    + TAI +LR +DPLL  L+D    P F+S   PFLA+ ++ILYQ
Sbjct: 124 ITVPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNILYQ 183

Query: 435 QLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEG 494
           QLA KA  SIY RF SLCGGE  V+P+ VL L+PQQLR +GVSGRKASYLHDLA K+  G
Sbjct: 184 QLAMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQNG 243

Query: 495 TLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYG 554
            LS+S+IL MD+++L + LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYG
Sbjct: 244 ILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLLYG 303

Query: 555 LKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMK 568
           L +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Sbjct: 304 LDDLPRPSQMEQHCAKWRPYRSVGSWYMWRLIEAK 337

BLAST of CmaCh03G007470 vs. TAIR 10
Match: AT1G19480.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 269.6 bits (688), Expect = 5.8e-72
Identity = 146/275 (53.09%), Postives = 185/275 (67.27%), Query Frame = 0

Query: 315 ADPPSKISFRTTEIRKIS---------------STRKPDKPQISTDGGG------DRTRA 374
           + PPSKI  R  +IRK++               S+ + + P ++TDG           RA
Sbjct: 64  SSPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSP-LATDGKSPGKGKLSHLRA 123

Query: 375 FPNQDGPVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKS-NPPFLAITKSILYQ 434
                   + L+    + TAI +LR +DPLL  L+D    P F+S   PFLA+ ++ILYQ
Sbjct: 124 ITVPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNILYQ 183

Query: 435 QLATKAAESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEG 494
           QLA KA  SIY RF SLCGGE  V+P+ VL L+PQQLR +GVSGRKASYLHDLA K+  G
Sbjct: 184 QLAMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQNG 243

Query: 495 TLSNSSILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYG 554
            LS+S+IL MD+++L + LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYG
Sbjct: 244 ILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLLYG 303

Query: 555 LKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEMK 568
           L +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Sbjct: 304 LDDLPRPSQMEQHCAKWRPYRSVGSWYMWRLIEAK 337

BLAST of CmaCh03G007470 vs. TAIR 10
Match: AT1G75230.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 267.3 bits (682), Expect = 2.9e-71
Identity = 142/274 (51.82%), Postives = 180/274 (65.69%), Query Frame = 0

Query: 315 ADPPSKISFRTTEIRKIS---------------STRKPDKPQISTDGGGDRTRAFPNQDG 374
           + PP+KI  R  +IRK+S               S     KP   +     RT   P    
Sbjct: 67  SSPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMTTTKPATKSKLSQSRTVTVPRIQ- 126

Query: 375 PVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKS-NPPFLAITKSILYQQLATKA 434
             +SL+    +  A+ HLR  DPLL  L+D    P F++   PFLA+ +SILYQQLA KA
Sbjct: 127 -ARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFLALIRSILYQQLAAKA 186

Query: 435 AESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSS 494
             SIY RF +LCGGE  V+P+ VL L+PQQLR +GVSGRKASYLHDLA K+  G LS+S 
Sbjct: 187 GNSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSG 246

Query: 495 ILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPK 554
           I+ MD+++L + LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+
Sbjct: 247 IVNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQMLNGMEDLPR 306

Query: 555 PVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKN 573
           P +ME+LCEKW+PYRS+ +WY+WRL+E K    N
Sbjct: 307 PSKMEQLCEKWRPYRSVASWYLWRLIESKNTPPN 338

BLAST of CmaCh03G007470 vs. TAIR 10
Match: AT1G75230.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 267.3 bits (682), Expect = 2.9e-71
Identity = 142/274 (51.82%), Postives = 180/274 (65.69%), Query Frame = 0

Query: 315 ADPPSKISFRTTEIRKIS---------------STRKPDKPQISTDGGGDRTRAFPNQDG 374
           + PP+KI  R  +IRK+S               S     KP   +     RT   P    
Sbjct: 67  SSPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMTTTKPATKSKLSQSRTVTVPRIQ- 126

Query: 375 PVKSLSSSDVICTAIDHLRRSDPLLIRLLDSCESPNFKS-NPPFLAITKSILYQQLATKA 434
             +SL+    +  A+ HLR  DPLL  L+D    P F++   PFLA+ +SILYQQLA KA
Sbjct: 127 -ARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFLALIRSILYQQLAAKA 186

Query: 435 AESIYNRFASLCGGEAAVLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSS 494
             SIY RF +LCGGE  V+P+ VL L+PQQLR +GVSGRKASYLHDLA K+  G LS+S 
Sbjct: 187 GNSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSG 246

Query: 495 ILEMDDETLLSALTGVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPK 554
           I+ MD+++L + LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+
Sbjct: 247 IVNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQMLNGMEDLPR 306

Query: 555 PVEMEKLCEKWKPYRSMGAWYMWRLMEMKGIAKN 573
           P +ME+LCEKW+PYRS+ +WY+WRL+E K    N
Sbjct: 307 PSKMEQLCEKWRPYRSVASWYLWRLIESKNTPPN 338

BLAST of CmaCh03G007470 vs. TAIR 10
Match: AT3G50880.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 265.4 bits (677), Expect = 1.1e-70
Identity = 142/252 (56.35%), Postives = 178/252 (70.63%), Query Frame = 0

Query: 319 SKISFRTTEIRKISSTRKPDKPQISTDGGGDRTRAFPNQDGPVKSLSSSDVICTAIDHLR 378
           S+I FR  +IRK+SS   P              R       P+ + S+ D+   A+ HL+
Sbjct: 36  SRIRFRPRKIRKVSSDPSP--------------RIIITASPPLSTKSTVDI---ALRHLQ 95

Query: 379 RSDPLLIRLLDSCESPNF--KSNPPFLAITKSILYQQLATKAAESIYNRFASLC-GGEAA 438
            SD LL  L+ +   P     SN PFL++ +SILYQQLATKAA+ IY+RF SL  GGEA 
Sbjct: 96  SSDELLGALITTHNDPPLFDSSNTPFLSLARSILYQQLATKAAKCIYDRFISLFNGGEAG 155

Query: 439 VLPDAVLGLSPQQLRVVGVSGRKASYLHDLATKFVEGTLSNSSILEMDDETLLSALTGVK 498
           V+P++V+ LS   LR +GVSGRKASYLHDLA K+  G LS+  IL+M DE L+  LT VK
Sbjct: 156 VVPESVISLSAVDLRKIGVSGRKASYLHDLADKYNNGVLSDELILKMSDEELIDRLTLVK 215

Query: 499 GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSM 558
           GIGVW+VHMFMIF+LHRPDVLPVGDLGVRKGV+ LYGLK LP P++ME+LCEKW+PYRS+
Sbjct: 216 GIGVWTVHMFMIFSLHRPDVLPVGDLGVRKGVKDLYGLKNLPGPLQMEQLCEKWRPYRSV 270

Query: 559 GAWYMWRLMEMK 568
           G+WYMWRL+E +
Sbjct: 276 GSWYMWRLIESR 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LC791.2e-2938.32GATA transcription factor 18 OS=Arabidopsis thaliana OX=3702 GN=GATA18 PE=1 SV=2[more]
Q6QPM24.7e-2637.05GATA transcription factor 19 OS=Arabidopsis thaliana OX=3702 GN=GATA19 PE=1 SV=2[more]
Q923833.4e-2429.11DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
O944689.2e-2229.19Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain... [more]
Q9ZPX04.6e-2141.67GATA transcription factor 20 OS=Arabidopsis thaliana OX=3702 GN=GATA20 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1IQD19.8e-184100.00DNA-3-methyladenine glycosylase 1-like OS=Cucurbita maxima OX=3661 GN=LOC1114784... [more]
A0A6J1GD233.0e-16993.81DNA-3-methyladenine glycosylase 1-like OS=Cucurbita moschata OX=3662 GN=LOC11145... [more]
A0A5A7T3R01.5e-12379.52Putative DNA-3-methyladenine glycosylase 2 OS=Cucumis melo var. makuwa OX=119469... [more]
A0A1S3B2D51.5e-12379.52probable DNA-3-methyladenine glycosylase 2 OS=Cucumis melo OX=3656 GN=LOC1034850... [more]
A0A0A0KM628.1e-12280.28ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G502690 PE=4... [more]
Match NameE-valueIdentityDescription
KAG6603971.10.0e+0092.82Zinc-finger homeodomain protein 6, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022978525.12.0e-183100.00DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima][more]
XP_022949777.16.3e-16993.81DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata][more]
KAG7034142.13.1e-16893.50mag1, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023543059.11.2e-16794.12DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G19480.15.8e-7253.09DNA glycosylase superfamily protein [more]
AT1G19480.25.8e-7253.09DNA glycosylase superfamily protein [more]
AT1G75230.22.9e-7151.82DNA glycosylase superfamily protein [more]
AT1G75230.12.9e-7151.82DNA glycosylase superfamily protein [more]
AT3G50880.11.1e-7056.35DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 410..565
e-value: 1.9E-19
score: 80.6
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 407..550
e-value: 4.9E-21
score: 75.2
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 402..563
e-value: 4.35882E-32
score: 119.269
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 114..171
e-value: 1.7E-18
score: 77.4
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 120..154
e-value: 1.1E-17
score: 63.3
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 120..145
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 114..150
score: 14.232708
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 120..151
e-value: 4.21931E-15
score: 67.783
NoneNo IPR availableGENE3D1.10.1670.40coord: 376..562
e-value: 7.8E-67
score: 226.5
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 401..512
e-value: 7.8E-67
score: 226.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 310..328
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 308..364
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..78
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..114
NoneNo IPR availablePANTHERPTHR43003DNA-3-METHYLADENINE GLYCOSYLASEcoord: 308..573
NoneNo IPR availablePANTHERPTHR43003:SF8HHH-GPD BASE EXCISION DNA REPAIR FAMILY PROTEINcoord: 308..573
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 116..154
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 116..183
e-value: 9.2E-17
score: 62.9
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 399..563

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G007470.1CmaCh03G007470.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006285 base-excision repair, AP site formation
biological_process GO:0006307 DNA dealkylation involved in DNA repair
biological_process GO:0009908 flower development
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus
cellular_component GO:0032993 protein-DNA complex
molecular_function GO:0032131 alkylated DNA binding
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0043916 DNA-7-methylguanine glycosylase activity
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003824 catalytic activity