Cp4.1LG06g01130 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g01130
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDNA-3-methyladenine glycosylase 1
LocationCp4.1LG06: 536354 .. 540637 (-)
RNA-Seq ExpressionCp4.1LG06g01130
SyntenyCp4.1LG06g01130
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGTTTCATGAATCCTCCAACTCCACAACCACTATCGCCCAAGCTACTGTGGCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCATTCGCGTCTGCCCCGGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGGCATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTACAAAGCTGGCACCTCAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGTGTTCAACTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCGCAGATGGATCACTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGTTTGCTGAGGCGAAGGGGGCTTCTTCAAGCGCAGCAGCAATAGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCGGAGCACCAGCATCAACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGGTAAAACCTTTATGGCTTTGTTATCTTCTCTTGTTTATATTTTTCCACGTATTGCATGTAGTTTAAGTTGAAAAATCTCATCTTCCTCTATCGATGTAGAAACTAAAATAGTTTTTCTTTTTTGTGTTGGAGTTTCCCTTATTTGCAGTCCCAAATTGTGTTTCTTGCATCCATTATTTATCAAATGCGGCATACCATACTTATACATGGTGTTTAGAGAAGTTTTTTTTTTCTTCTTTTAATTGCATCTAGCTTCTAACGAAAGATGTCTGAATATCTAAATTGACGATACTTATTTGCATGTTTCTTGTGATATATGAAAGTAAGCTTTGCGTGGTATATTTCGACCTTTTTTCTGGGATCTGTATGCTTCTTTTTCTTGTCTGTACTCTTGTAACTCTATAGCGTGCTTCCAATTATCTGCATTGTTTTCTCAGATATTAATTTTTCTTGCAATCCTTTTACTTTTGAAATAAGCAGAAAGTTTCATCGATATAATGAAATATACAAAGCCACAACCTATGGGCTATATTAAATACTTGTTCCCTTTATTGTTTAGATGCTTGGTGAAAATGAGCACCATCCATGCTTGTTAGCCACATTATTTTTGTCTTCGGTTTTTGCAGGCTTCTTATGGTAGGGGAACATAGCATAGGCCAATATCATTGAAACAAAGCCACATGCAAAATTTGATAATATAGGAGCAGATTTCTGCATTTGTGGAATATTTATTCAAGTCAGTCATCTGTAGTGAAAAGTAGTTCTTCAGAATATTAGAAAGGTATATCATGTTCGAAGTTTAAGATGGTAACTTGAGTATACCTTGGGTGTACTTGGATCATTGATCTTCTTGGAGTGGTCAAGGAATGTGGAAGATGAATATGGAAGTGAGGATCACTCTTAAGGGGCATCGATTATTCATATCCTCCTCCTCCTCTTCTTCTCCTCTTCTTTGGTGCTTTCACGTGGTGAATGCATACTGTTTTTCTCCCCCTTATGCCGTTTTTGTTTCTTTTTCTTGGAAGTCTTGAAAGTTTGTAGTCTGAACTTGATAGATTGACAGACACGTGTTTGGAATTTGTCCTTGTTTTGATTTGCTAATACTAGAAGTCTAGAACTCTCAGATTTCGATTTTATATTCAATTTTTTATGAAGCCATAGCTTACCACTATCAGTGATTTGAAAAGCGCAAAAAAGCTCTCCTAGGTGCGCTTTTTAGAAAAGACAAAGGTGAGGTGCTCCTATGAAGCACACTTGAGGCCCCGTCTCTGTGCTTGAAGAACCTTTTTTTATTTTATTTTTTTATTTTTTTATTTTTTATTTTATTTCTAATTTTTTAGGAAGAGATTCAGAAGTGTTCTTAAAAAGTTCGTTTTTCACCAGATTCTTTTATAAAAGTTCTAATTTCTTAAACTTCTCATTATCCTATTAGTTGTAAACCTATATTTTCCATTTATCAAAAAGTTATTTGCATTTTTGTATTTGAGTGCTCCCAAAAAAGCCATCGTTTTTTTTTATCTGCACGCTTTGCAGAAGTCCTAGAAGAGAGTACTGTTTTTTAGTGTGCCGCTCTTTGAAAAACACTTACCACTTATGTTGATTCTATAGTTGAGGTTTGGAGAAGCCATGCAATTTCTCTGTTGGATCCAGAAAATTTAATTTGTATCGCTAAGGTTTATGGTCATGAACACAAATCTTGCTTGTTAATGCTCTTTTTTTTTTTAATAATTGAAAACGATGAGGCAGCAAGATAGGCCCCTTTTCTGGAATAAAGAGAGAACAAACTGTGGAACACTTGACAAAAACTTATTGTTGCTGCCCCAGGATCATCTTAGGCATCTGTTCTGTCTTGGCCGTTGGTTCTTGAATGATTGCTCTCTCGTCTCTTTTCCTATGTTTTGGATCATTGGGAAACAGTAACAAACTTTTCTCCGTTCTTTAATTGGTGTATCGTTCAATTCCTTTATTTGCTACTCTTCATGTGATTCGTGTTGCTCATGTGGATAATGTTATTTATGTTTAGGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTACATCTTTGCAGATAGCCCAATCAGTTCATCCACTGAATGATAAGTATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGAATATTTGGCTCCTCGAGGTTGGATACTCTAGACTTCAATATTAAATAATGAAATGACATGTGCTAGTTTCCTTCTCTAGGTAGGTTGATGTCACTTTTAAGGCCGTCAACATGTATCATGTGGCAAGGGATATTGATAGAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTGATTTAGCACGAGTTCGACTTCGAGAAACCCGCTTGTTCACTTGCGCATATTCAACGACACAACAATCTGTAATGACTTTATGATCCATCTTTTACCCTTTTCTCTACTTCAATTTTTCTGAAACATTTATCTTAGTTTGTACATAGAAAATAATCATTTATGCGGATAGTCGATGGCTGGCTGTGACGTTTGCTATTCGGACTTGTAATCCTATCTCATCTTGCCTCATCTATTCAACATGGGAGAAAATGCATTATTGCTCAGTAGCCTCATTCATCTTTTTTACCTTTGAAAATACGGCTGGTGTTTTTAGTAAGGGGTAGCATCGTAGAATTTACCTTTGAGAACTTGATTTATAGGGACTATGAAGCATGGATTCTGATGATTTTAGCTGGAGAAACGCTGCAGTTGGGTTTGTAAACTTGTCACTTCGAATCAAATGGCTTTGAATATCTGAAAGTGACAAGATTTGGGCCGACTACTTGTTGAAATGAACTTTGGAACCAATCAAAGTTGAATGAAAGAAATTACATTGGTTTGAAGTTTCTCAATGTAACCAAGCGAATTGAATGCACTCTTGCTTTTAATCTATTTTTTCTGAATTTTATTTGCACCATTTATTAATTGGTTACATATTTAATAACCTACATATGATTAATTTTTCTTTCTTTTTCTTTCCTTTTCAATTCAATTCAATGTGATTGATGGAATATGTTAGAAGTTATTGAAGTATGTTCAGATTTAGTCAATACACATAATTATAAACAATTTTTATGAGATGCATATTACAATGGGTACTATATTTAAAAAACATATGAAAATTTAAAAGTAACATTAATACCATTTATCTTTATAATA

mRNA sequence

ATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGTTTCATGAATCCTCCAACTCCACAACCACTATCGCCCAAGCTACTGTGGCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCATTCGCGTCTGCCCCGGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGGCATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTACAAAGCTGGCACCTCAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTACATCTTTGCAGATAGCCCAATCAGTTCATCCACTGAATGATAAGTATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGAATATTTGGCTCCTCGAGGTTGGATACTCTAGACTTCAATATTAAATAATGAAATGACATGTGCTAGTTTCCTTCTCTAGGTAGGTTGATGTCACTTTTAAGGCCGTCAACATGTATCATGTGGCAAGGGATATTGATAGAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTGATTTAGCACGAGTTCGACTTCGAGAAACCCGCTTGTTCACTTGCGCATATTCAACGACACAACAATCTGGACTATGAAGCATGGATTCTGATGATTTTAGCTGGAGAAACGCTGCAGTTGGGTTTGTAAACTTGTCACTTCGAATCAAATGGCTTTGAATATCTGAAAGTGACAAGATTTGGGCCGACTACTTGTTGAAATGAACTTTGGAACCAATCAAAGTTGAATGAAAGAAATTACATTGGTTTGAAGTTTCTCAATGTAACCAAGCGAATTGAATGCACTCTTGCTTTTAATCTATTTTTTCTGAATTTTATTTGCACCATTTATTAATTGGTTACATATTTAATAACCTACATATGATTAATTTTTCTTTCTTTTTCTTTCCTTTTCAATTCAATTCAATGTGATTGATGGAATATGTTAGAAGTTATTGAAGTATGTTCAGATTTAGTCAATACACATAATTATAAACAATTTTTATGAGATGCATATTACAATGGGTACTATATTTAAAAAACATATGAAAATTTAAAAGTAACATTAATACCATTTATCTTTATAATA

Coding sequence (CDS)

ATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGTTTCATGAATCCTCCAACTCCACAACCACTATCGCCCAAGCTACTGTGGCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCATTCGCGTCTGCCCCGGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGGCATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTACAAAGCTGGCACCTCAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTACATCTTTGCAGATAGCCCAATCAGTTCATCCACTGAATGA

Protein sequence

MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGPVLGGSDSDRKEYIFADSPISSSTE
Homology
BLAST of Cp4.1LG06g01130 vs. ExPASy Swiss-Prot
Match: Q92383 (DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag1 PE=1 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 1.1e-13
Identity = 42/127 (33.07%), Postives = 69/127 (54.33%), Query Frame = 0

Query: 159 PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSS 218
           P+  L R++  QQL  KA  +I+ RF ++        PE +  +  + +R  G S RK  
Sbjct: 50  PYEELIRAVASQQLHSKAANAIFNRFKSISNNGQFPTPEEIRDMDFEIMRACGFSARKID 109

Query: 219 YLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIN 278
            L  +A    +G++ +      + ++ L   LT + GIG W+V M +IFSL+R DV+P +
Sbjct: 110 SLKSIAEATISGLIPTKEEAERLSNEELIERLTQIKGIGRWTVEMLLIFSLNRDDVMPAD 169

Query: 279 DLNVRKG 285
           DL++R G
Sbjct: 170 DLSIRNG 176

BLAST of Cp4.1LG06g01130 vs. ExPASy Swiss-Prot
Match: O31544 (Putative DNA-3-methyladenine glycosylase YfjP OS=Bacillus subtilis (strain 168) OX=224308 GN=yfjP PE=3 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 2.5e-10
Identity = 45/171 (26.32%), Postives = 79/171 (46.20%), Query Frame = 0

Query: 117 LARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKA 176
           + R    E  ++  L H       L+ + + H         + +  + + I++QQL    
Sbjct: 78  IKRIFQWENHLQHVLDHFSKTS--LSAIFEEHAGTPLVLDYSVYNCMMKCIIHQQLNLSF 137

Query: 177 GTSIYTRFIALCGGEA-GV----LPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGI 236
             ++  RF+   G +  GV     PET+  L  Q LR +  S RK+ Y  D +R    G 
Sbjct: 138 AYTLTERFVHAFGEQKDGVWCYPKPETIAELDYQDLRDLQFSMRKAEYTIDTSRMIAEGT 197

Query: 237 LSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVR 283
           LS   + +M D+ +   L  + GIG W+V   ++F L RP++ P+ D+ ++
Sbjct: 198 LSLSELPHMADEDIMKKLIKIRGIGPWTVQNVLMFGLGRPNLFPLADIGLQ 246

BLAST of Cp4.1LG06g01130 vs. ExPASy Swiss-Prot
Match: O94468 (Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag2 PE=1 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 2.3e-08
Identity = 42/166 (25.30%), Postives = 78/166 (46.99%), Query Frame = 0

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGT 180
           +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  
Sbjct: 1   MSKDSDYKRAEKHLSSIDNKWSSLVKKVGPCTLTPHPEHAPYEGIIRAITSQKLSDAATN 60

Query: 181 SIYTRFIALCG-GEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQN-GILSDPA 240
           SI  +F   C   +    P+ ++    + L + G S  KS  +H +A    N  I S   
Sbjct: 61  SIINKFCTQCSDNDEFPTPKQIMETDVETLHECGFSKLKSQEIHIVAEAALNKQIPSKSE 120

Query: 241 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVR 283
           I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++
Sbjct: 121 IEKMSEEELMESLSKIKGVKRWTIEMYSIFTLGRLDIMPADDSTLK 166

BLAST of Cp4.1LG06g01130 vs. NCBI nr
Match: XP_023536439.1 (probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo] >KAG6591312.1 hypothetical protein SDJN03_13658, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 538 bits (1387), Expect = 1.21e-190
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. NCBI nr
Match: XP_022936456.1 (probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata])

HSP 1 Score: 538 bits (1387), Expect = 1.21e-190
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. NCBI nr
Match: XP_022976000.1 (probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima])

HSP 1 Score: 537 bits (1384), Expect = 3.47e-190
Identity = 283/284 (99.65%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. NCBI nr
Match: KAG7024195.1 (mag1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 538 bits (1387), Expect = 2.19e-189
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. NCBI nr
Match: XP_008466558.1 (PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo])

HSP 1 Score: 508 bits (1307), Expect = 2.03e-178
Identity = 269/284 (94.72%), Postives = 273/284 (96.13%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSK
Sbjct: 1   MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNKSKTAQQRAAFASA V LARS
Sbjct: 61  MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. ExPASy TrEMBL
Match: A0A6J1F7I4 (probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita moschata OX=3662 GN=LOC111443071 PE=4 SV=1)

HSP 1 Score: 538 bits (1387), Expect = 5.87e-191
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. ExPASy TrEMBL
Match: A0A6J1IIA5 (probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita maxima OX=3661 GN=LOC111476532 PE=4 SV=1)

HSP 1 Score: 537 bits (1384), Expect = 1.68e-190
Identity = 283/284 (99.65%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. ExPASy TrEMBL
Match: A0A1S3CRJ5 (DNA-3-methyladenine glycosylase 1 OS=Cucumis melo OX=3656 GN=LOC103503942 PE=4 SV=1)

HSP 1 Score: 508 bits (1307), Expect = 9.84e-179
Identity = 269/284 (94.72%), Postives = 273/284 (96.13%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSK
Sbjct: 1   MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNKSKTAQQRAAFASA V LARS
Sbjct: 61  MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. ExPASy TrEMBL
Match: A0A5A7TDX5 (DNA-3-methyladenine glycosylase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold242G00470 PE=4 SV=1)

HSP 1 Score: 508 bits (1307), Expect = 1.66e-177
Identity = 269/284 (94.72%), Postives = 273/284 (96.13%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSK
Sbjct: 87  MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 146

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNKSKTAQQRAAFASA V LARS
Sbjct: 147 MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 206

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 207 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 266

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 267 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 326

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 327 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 370

BLAST of Cp4.1LG06g01130 vs. ExPASy TrEMBL
Match: A0A0A0LIY1 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G857070 PE=4 SV=1)

HSP 1 Score: 504 bits (1298), Expect = 2.56e-177
Identity = 268/284 (94.37%), Postives = 271/284 (95.42%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSK
Sbjct: 1   MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNKSKTA QRAAFASA V  ARS
Sbjct: 61  MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVPPARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLAL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 284

BLAST of Cp4.1LG06g01130 vs. TAIR 10
Match: AT1G75230.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 312.0 bits (798), Expect = 5.4e-85
Identity = 179/303 (59.08%), Postives = 214/303 (70.63%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNT-------FHESSNSTTTIAQATVALSEVMNAPSQ 60
           MGEH+  Q  + T   +Q +S    T        ++  ++++     ++  S  + AP  
Sbjct: 1   MGEHSPSQPSSHTLPPNQPESPNHETPNPIPPETNDDDSASSAGVSGSIVSSTTIEAPQV 60

Query: 61  T-----SSPPSKMPLRPRKIRKLSPDES-------DPNSSQVVAIPDGPKPIATSKSNKS 120
           T     SSPP+K+PLRPRKIRKLSPD+        + N SQ+           T  + KS
Sbjct: 61  TELGNVSSPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMT---------TTKPATKS 120

Query: 121 KTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFL 180
           K +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFL
Sbjct: 121 KLSQSRT--VTVPRIQARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFL 180

Query: 181 ALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLH 240
           AL RSILYQQLA KAG SIYTRF+ALCGGE GV+PE VL L+PQQLRQIG+SGRK+SYLH
Sbjct: 181 ALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLH 240

Query: 241 DLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV 285
           DLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Sbjct: 241 DLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGV 292

BLAST of Cp4.1LG06g01130 vs. TAIR 10
Match: AT1G75230.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 312.0 bits (798), Expect = 5.4e-85
Identity = 179/303 (59.08%), Postives = 214/303 (70.63%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNT-------FHESSNSTTTIAQATVALSEVMNAPSQ 60
           MGEH+  Q  + T   +Q +S    T        ++  ++++     ++  S  + AP  
Sbjct: 1   MGEHSPSQPSSHTLPPNQPESPNHETPNPIPPETNDDDSASSAGVSGSIVSSTTIEAPQV 60

Query: 61  T-----SSPPSKMPLRPRKIRKLSPDES-------DPNSSQVVAIPDGPKPIATSKSNKS 120
           T     SSPP+K+PLRPRKIRKLSPD+        + N SQ+           T  + KS
Sbjct: 61  TELGNVSSPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMT---------TTKPATKS 120

Query: 121 KTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFL 180
           K +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFL
Sbjct: 121 KLSQSRT--VTVPRIQARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFL 180

Query: 181 ALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLH 240
           AL RSILYQQLA KAG SIYTRF+ALCGGE GV+PE VL L+PQQLRQIG+SGRK+SYLH
Sbjct: 181 ALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLH 240

Query: 241 DLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV 285
           DLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Sbjct: 241 DLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGV 292

BLAST of Cp4.1LG06g01130 vs. TAIR 10
Match: AT1G19480.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 299.3 bits (765), Expect = 3.6e-81
Identity = 169/296 (57.09%), Postives = 210/296 (70.95%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVALSEVMNAP 60
           MGE +  Q  TQ QS  Q+           ++ +   +S+  + +I  +T   +  +   
Sbjct: 1   MGEQSPSQPSTQCQSHPQSPKPDTHNLIPPESTDECLDSAGVSGSIVSSTTIDARRITEL 60

Query: 61  SQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRA 120
              SSPPSK+PLRPRKIRKL+ D     +   ++ ++      P+AT   +  K      
Sbjct: 61  GNVSSPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSPLATDGKSPGKGKLSHL 120

Query: 121 AFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSIL 180
              + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+IL
Sbjct: 121 RAITVPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNIL 180

Query: 181 YQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQ 240
           YQQLA KAG SIYTRF++LCGGE  V+PETVL+L+PQQLRQIG+SGRK+SYLHDLARKYQ
Sbjct: 181 YQQLAMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQ 240

Query: 241 NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 285
           NGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKG
Sbjct: 241 NGILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKG 296

BLAST of Cp4.1LG06g01130 vs. TAIR 10
Match: AT1G19480.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 299.3 bits (765), Expect = 3.6e-81
Identity = 169/296 (57.09%), Postives = 210/296 (70.95%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVALSEVMNAP 60
           MGE +  Q  TQ QS  Q+           ++ +   +S+  + +I  +T   +  +   
Sbjct: 1   MGEQSPSQPSTQCQSHPQSPKPDTHNLIPPESTDECLDSAGVSGSIVSSTTIDARRITEL 60

Query: 61  SQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRA 120
              SSPPSK+PLRPRKIRKL+ D     +   ++ ++      P+AT   +  K      
Sbjct: 61  GNVSSPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSPLATDGKSPGKGKLSHL 120

Query: 121 AFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSIL 180
              + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+IL
Sbjct: 121 RAITVPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNIL 180

Query: 181 YQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQ 240
           YQQLA KAG SIYTRF++LCGGE  V+PETVL+L+PQQLRQIG+SGRK+SYLHDLARKYQ
Sbjct: 181 YQQLAMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQ 240

Query: 241 NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 285
           NGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKG
Sbjct: 241 NGILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKG 296

BLAST of Cp4.1LG06g01130 vs. TAIR 10
Match: AT3G50880.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 210.7 bits (535), Expect = 1.7e-54
Identity = 125/235 (53.19%), Postives = 151/235 (64.26%), Query Frame = 0

Query: 52  SQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFA 111
           S+ S   S++  RPRKIRK+S D S             P+ I T               A
Sbjct: 29  SEVSGSSSRIRFRPRKIRKVSSDPS-------------PRIIIT---------------A 88

Query: 112 SAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQ 171
           S P      LS +  V++ALRHL+++D LL  LI  H   P FDS  TPFL+L RSILYQ
Sbjct: 89  SPP------LSTKSTVDIALRHLQSSDELLGALITTHNDPPLFDSSNTPFLSLARSILYQ 148

Query: 172 QLAYKAGTSIYTRFIALC-GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQN 231
           QLA KA   IY RFI+L  GGEAGV+PE+V++LS   LR+IG+SGRK+SYLHDLA KY N
Sbjct: 149 QLATKAAKCIYDRFISLFNGGEAGVVPESVISLSAVDLRKIGVSGRKASYLHDLADKYNN 208

Query: 232 GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG 285
           G+LSD  I+ M D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKG
Sbjct: 209 GVLSDELILKMSDEELIDRLTLVKGIGVWTVHMFMIFSLHRPDVLPVGDLGVRKG 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q923831.1e-1333.07DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
O315442.5e-1026.32Putative DNA-3-methyladenine glycosylase YfjP OS=Bacillus subtilis (strain 168) ... [more]
O944682.3e-0825.30Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain... [more]
Match NameE-valueIdentityDescription
XP_023536439.11.21e-190100.00probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo] >KAG6591... [more]
XP_022936456.11.21e-190100.00probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata][more]
XP_022976000.13.47e-19099.65probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima][more]
KAG7024195.12.19e-189100.00mag1 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_008466558.12.03e-17894.72PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A6J1F7I45.87e-191100.00probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A6J1IIA51.68e-19099.65probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A1S3CRJ59.84e-17994.72DNA-3-methyladenine glycosylase 1 OS=Cucumis melo OX=3656 GN=LOC103503942 PE=4 S... [more]
A0A5A7TDX51.66e-17794.72DNA-3-methyladenine glycosylase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A0A0LIY12.56e-17794.37ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G857070 PE=4... [more]
Match NameE-valueIdentityDescription
AT1G75230.25.4e-8559.08DNA glycosylase superfamily protein [more]
AT1G75230.15.4e-8559.08DNA glycosylase superfamily protein [more]
AT1G19480.13.6e-8157.09DNA glycosylase superfamily protein [more]
AT1G19480.23.6e-8157.09DNA glycosylase superfamily protein [more]
AT3G50880.11.7e-5453.19DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 168..308
e-value: 0.0078
score: -0.1
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 165..283
e-value: 5.7E-14
score: 52.5
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 160..284
e-value: 1.16202E-19
score: 81.9042
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 157..270
e-value: 6.4E-46
score: 158.2
NoneNo IPR availableGENE3D1.10.1670.40coord: 132..284
e-value: 6.4E-46
score: 158.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..103
NoneNo IPR availablePANTHERPTHR43003:SF7BNAC05G15000D PROTEINcoord: 1..284
NoneNo IPR availablePANTHERPTHR43003DNA-3-METHYLADENINE GLYCOSYLASEcoord: 1..284
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 155..293

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g01130.1Cp4.1LG06g01130.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006285 base-excision repair, AP site formation
biological_process GO:0006307 DNA dealkylation involved in DNA repair
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
cellular_component GO:0032993 protein-DNA complex
molecular_function GO:0032131 alkylated DNA binding
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0043916 DNA-7-methylguanine glycosylase activity
molecular_function GO:0003824 catalytic activity