Cp4.1LG13g04420 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g04420
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionN-glycosylase/DNA lyase
LocationCp4.1LG13 : 6068218 .. 6070953 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGGATGCAAATGCGAGATAGACTCTCGTTACTCCGGCCTCAAATATAAATTATTTAATTAACTTCAGCATAACTGAGTTGATTATTTTTACTTTTATTTGTGTTGATTGTATTAATATTTTTTACTTCTATTTACATAGGTATTGAACTCAATAACATATTTAACATAATTTAGACCAACAAAAGTACGGATAAGATCTTGATGCTTACTCGAGAATTCAAACTGTTGACCGTTCAGTCTGGTCGTTGGTGGTCAATATTTTAACGGAAGCGCCACTTACCAACCCCCCAAATGCCTGCATTTTCATTGAGACACCGTCTAATGGCGAAGAGGCTCAGACCCACCCCTCCCTCCACTCCCTCCGCCAAGCCATCGCCATCGCCACCGTCATTGCCGCCGTCTCCTCCGACCCCTCAACTCTTCCATTCAAAGCCCACCACCGCGTCCCTCCGCCACTCATCCAACGATCGAAGCAAAACCCTAACCCACCTCGTATCCCCCGCATCCGCAGCATCCTCCAACTGGGTCTCTCTAAATCTCACCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCAGCCCCCTTCACTTCACCGGCGTTGTTGGGCCTCATCTTATCTCTCTCACCCATCTCCCAAATGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACCTCCTCCTCCGCCGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCAGTATCTCCCTAAGTGCCATTTGGGAGGTCTTCTCGGCGGCTGATCCAAGATTCGATGTCTTGTCGCGCCATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTCGAGTGTTTGATTCAGTTTTTGTGTTCTTCTAATAACAACATTGGGAGAATCACCAAAATGGTGGATTACATCTCATCACTTGGGAACTACTTGGGTCAAATTGGAGGCTTTGATTTCCATGAATTTCCCTCTTTGGAGAGGCTATCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGTAAATCTCTCCATTCCAATACTAATTTGATCTAAGAAATCAATCGCTAGCTCTATCTGTGGGGATGGGAACACTTTTATTGATAGCATAATGTGAATAAGCAGGGCTAAATACATAATTGGCACTGTGAAAGAACTAAAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGCTCTCGAAGAAGTGATTGAAGCACTTACAGCTTTACCGGGCGTGGGTCCAAAGGTAGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCACCATGCCATTCCTGTTGACACACACGTCTGGCAGGTACTATATCCCTTTTCATATGAACTAAATTGATTCAAGTTCTTTCATATGAACTAAATTGATTTAAGTTTATTCTTGTATGTGATGAAAAGATTGCTACTAGGTACCTTGTTCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAATCGTGTGGCTGAGGCATTTGTCAACAAGTATGGAAAATATGCTGGATGGGCTCAAACTCTGCTCTTCGTCGCTGATTTGCCTCAACAGAAGGCCCTCTTACCTGCAAGTCTTGAGAATACCAAAAGGAAAAAAACTACAAAGGAGCAGATAGAAAAGGCACATACTGGTAAATATAGATCAATATGAATAGCTATATTAAGTTGGTCTCAGGCTTTTGGCATTCATGTAAATTCCGATTTGTTGTAGAGCAAGATCCATAGGTGTTGGCATAGCTCTTGAAATTCGTTCGGGTTTGCTTTGGCTCGTTGACGTATTCTCCTGGAGGAAGAATTCAAAATGTAATCCCCATGTTTGGGGAAGTCCTACAATCCAGCCTTCTACTAAATGTGATCTAGTTAAAAGGAGATTAATTTTCTGTGACTAGAGAACTGTGGTTTCATTTCATTGTTGATCCTTGACATGAATCAATTAATAATTGAAGTATTGCATGATTGTTTCTAGGACTCCTAAAAGAATGAAATAAGCTGTGATCATGGAGATGATTATTGCTAAAGCTATGTATCTATTGAAAGCTACTGTGAATTATCTTGCCATGTGCTTCATTCATGCTAGACATCTTCCATCATTTATGTGCATTGTGCATCAACCAACAGTGTTTTGGTGACAAACGGACATGGATCGCCTCTAAAGTTCCGATTCGAGATTGGAATCACGCATTTAGTCCTCCCTAGACACGCCTGAAACATCACAAAGCTTCCATGTGAGTCGTCGTCTGGAATACCTTTTATTTAATGCAAGATACAAATCTCTTTATAGAATGCTAATGCGTTTCAGGAAATTACTCACATGTTCTACAATGGCTCTGGAGTTTGGCGAGTGACACATTCCAACAGCGTAGCTTTGACAGTCACCAGACGGTGTCCCAAAGCTTGCAAATAAGATTTTGGAGATGTTCTTGTTAGTAGGGCAGCTTAATCGGACCTTAGGCCTTCTACTTTTGTTCTTTGTTCTACTTGCCCTCTGTTTCTTTGCACTCATCCATGAAGCTACCAGAGGATAATGTGATTCAGACACTTGCCCACATGTTTTGCTAATTGAAACAGAATCCAAAGATATCCCAACTGGGCTTCCTGTTTCTTCTTCCAAAATGACCAACAGGTTCCCAGTCGGCTTGAGGAAGGAACGTGGAACATTATACCTGGATTCAAAG

mRNA sequence

AAGGATGCAAATGCGAGATAGACTCTCGTTACTCCGGCCTCAAATATAAATTATTTAATTAACTTCAGCATAACTGAGTTGATTATTTTTACTTTTATTTGTGTTGATTGTATTAATATTTTTTACTTCTATTTACATAGGTATTGAACTCAATAACATATTTAACATAATTTAGACCAACAAAAGTACGGATAAGATCTTGATGCTTACTCGAGAATTCAAACTGTTGACCGTTCAGTCTGGTCGTTGGTGGTCAATATTTTAACGGAAGCGCCACTTACCAACCCCCCAAATGCCTGCATTTTCATTGAGACACCGTCTAATGGCGAAGAGGCTCAGACCCACCCCTCCCTCCACTCCCTCCGCCAAGCCATCGCCATCGCCACCGTCATTGCCGCCGTCTCCTCCGACCCCTCAACTCTTCCATTCAAAGCCCACCACCGCGTCCCTCCGCCACTCATCCAACGATCGAAGCAAAACCCTAACCCACCTCGTATCCCCCGCATCCGCAGCATCCTCCAACTGGGTCTCTCTAAATCTCACCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCAGCCCCCTTCACTTCACCGGCGTTGTTGGGCCTCATCTTATCTCTCTCACCCATCTCCCAAATGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACCTCCTCCTCCGCCGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCAGTATCTCCCTAAGTGCCATTTGGGAGGTCTTCTCGGCGGCTGATCCAAGATTCGATGTCTTGTCGCGCCATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTCGAGTGTTTGATTCAGTTTTTGTGTTCTTCTAATAACAACATTGGGAGAATCACCAAAATGGTGGATTACATCTCATCACTTGGGAACTACTTGGGTCAAATTGGAGGCTTTGATTTCCATGAATTTCCCTCTTTGGAGAGGCTATCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAATTGGCACTGTGAAAGAACTAAAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGCTCTCGAAGAAGTGATTGAAGCACTTACAGCTTTACCGGGCGTGGGTCCAAAGGTAGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCACCATGCCATTCCTGTTGACACACACGTCTGGCAGATTGCTACTAGGTACCTTGTTCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAATCGTGTGGCTGAGGCATTTGTCAACAAGTATGGAAAATATGCTGGATGGGCTCAAACTCTGCTCTTCGTCGCTGATTTGCCTCAACAGAAGGCCCTCTTACCTGCAAGTCTTGAGAATACCAAAAGGAAAAAAACTACAAAGGAGCAGATAGAAAAGGCACATACTGAGCAAGATCCATAGGTGTTGGCATAGCTCTTGAAATTCGTTCGGGTTTGCTTTGGCTCGTTGACGTATTCTCCTGGAGGAAGAATTCAAAATGACTCCTAAAAGAATGAAATAAGCTGTGATCATGGAGATGATTATTGCTAAAGCTATGTATCTATTGAAAGCTACTGTGAATTATCTTGCCATGTGCTTCATTCATGCTAGACATCTTCCATCATTTATGTGCATTGTGCATCAACCAACAGTGTTTTGGTGACAAACGGACATGGATCGCCTCTAAAGTTCCGATTCGAGATTGGAATCACGCATTTAGTCCTCCCTAGACACGCCTGAAACATCACAAAGCTTCCATGAAATTACTCACATGTTCTACAATGGCTCTGGAGTTTGGCGAGTGACACATTCCAACAGCGTAGCTTTGACAGTCACCAGACGGTGTCCCAAAGCTTGCAAATAAGATTTTGGAGATGTTCTTGTTAGTAGGGCAGCTTAATCGGACCTTAGGCCTTCTACTTTTGTTCTTTGTTCTACTTGCCCTCTGTTTCTTTGCACTCATCCATGAAGCTACCAGAGGATAATGTGATTCAGACACTTGCCCACATGTTTTGCTAATTGAAACAGAATCCAAAGATATCCCAACTGGGCTTCCTGTTTCTTCTTCCAAAATGACCAACAGGTTCCCAGTCGGCTTGAGGAAGGAACGTGGAACATTATACCTGGATTCAAAG

Coding sequence (CDS)

ATGCCTGCATTTTCATTGAGACACCGTCTAATGGCGAAGAGGCTCAGACCCACCCCTCCCTCCACTCCCTCCGCCAAGCCATCGCCATCGCCACCGTCATTGCCGCCGTCTCCTCCGACCCCTCAACTCTTCCATTCAAAGCCCACCACCGCGTCCCTCCGCCACTCATCCAACGATCGAAGCAAAACCCTAACCCACCTCGTATCCCCCGCATCCGCAGCATCCTCCAACTGGGTCTCTCTAAATCTCACCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCAGCCCCCTTCACTTCACCGGCGTTGTTGGGCCTCATCTTATCTCTCTCACCCATCTCCCAAATGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACCTCCTCCTCCGCCGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCAGTATCTCCCTAAGTGCCATTTGGGAGGTCTTCTCGGCGGCTGATCCAAGATTCGATGTCTTGTCGCGCCATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTCGAGTGTTTGATTCAGTTTTTGTGTTCTTCTAATAACAACATTGGGAGAATCACCAAAATGGTGGATTACATCTCATCACTTGGGAACTACTTGGGTCAAATTGGAGGCTTTGATTTCCATGAATTTCCCTCTTTGGAGAGGCTATCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAATTGGCACTGTGAAAGAACTAAAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGCTCTCGAAGAAGTGATTGAAGCACTTACAGCTTTACCGGGCGTGGGTCCAAAGGTAGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCACCATGCCATTCCTGTTGACACACACGTCTGGCAGATTGCTACTAGGTACCTTGTTCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAATCGTGTGGCTGAGGCATTTGTCAACAAGTATGGAAAATATGCTGGATGGGCTCAAACTCTGCTCTTCGTCGCTGATTTGCCTCAACAGAAGGCCCTCTTACCTGCAAGTCTTGAGAATACCAAAAGGAAAAAAACTACAAAGGAGCAGATAGAAAAGGCACATACTGAGCAAGATCCATAG

Protein sequence

MPAFSLRHRLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKTTKEQIEKAHTEQDP
BLAST of Cp4.1LG13g04420 vs. Swiss-Prot
Match: OGG1_ARATH (N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.2e-120
Identity = 228/358 (63.69%), Postives = 267/358 (74.58%), Query Frame = 1

Query: 23  PSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVSPASAASSNWVSLN 82
           P+++PS S    PP  P        P T  L+   +                +  W  L 
Sbjct: 6   PTSQPSISSTVKPPLSP--------PVTPILKQKLH-------------RTGTPKWFPLK 65

Query: 83  LTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGD-VSYCLHSCSTSSS 142
           LT ++L+LPLTFPTGQTFRWK+T  + ++G +GPHL+SL   P  D VSYC+H CSTS  
Sbjct: 66  LTHTELTLPLTFPTGQTFRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVH-CSTSPK 125

Query: 143 AAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQDPLECLIQFLCSS 202
             +A LALLDFLNA ISL+ +W  FS  DPRF  L+RHL GARVLRQDPLECLIQFLCSS
Sbjct: 126 --SAELALLDFLNAEISLAELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSS 185

Query: 203 NNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIG 262
           NNNI RITKMVD++SSLG +LG I GF+FH+FPSL+RLS VSE E R+AGFGYRAKYI G
Sbjct: 186 NNNIARITKMVDFVSSLGLHLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYRAKYITG 245

Query: 263 TVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALFSLDQHHAIPVDT 322
           TV  L+AKPGGG EWLLSLR + L+E + AL  LPGVGPKVAAC+ALFSLDQH AIPVDT
Sbjct: 246 TVNALQAKPGGGNEWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDT 305

Query: 323 HVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALL 380
           HVWQIAT YL+P+LAGA+LTPKL  RVAEAFV+KYG+YAGWAQTLLF+A+LP QK LL
Sbjct: 306 HVWQIATNYLLPDLAGAKLTPKLHGRVAEAFVSKYGEYAGWAQTLLFIAELPAQKTLL 339

BLAST of Cp4.1LG13g04420 vs. Swiss-Prot
Match: OGG1_RAT (N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1)

HSP 1 Score: 211.1 bits (536), Expect = 2.3e-53
Identity = 135/348 (38.79%), Postives = 193/348 (55.46%), Query Frame = 1

Query: 56  SSNDRSKTLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVG 115
           SS+ R +TLT   SPA      W S+   +S+L L L   +GQ+FRW++ SP H++GV+ 
Sbjct: 8   SSSMRHRTLTS--SPAL-----WASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLA 67

Query: 116 PHLISLTHLPNGDVSYCLHSCSTSSSAAAARLALLD----FLNASISLSAIWEVFSAADP 175
             + +LT     D  YC              L  L+    +    +SL+ ++  +++ D 
Sbjct: 68  DQVWTLTQ--TEDQLYCTVYRGDKGQVGRPTLEELETLHKYFQLDVSLTQLYSHWASVDS 127

Query: 176 RFDVLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGQIGGFDF 235
            F  +++  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L Q+    +
Sbjct: 128 HFQSVAQKFQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLVQLDDVTY 187

Query: 236 HEFPSLERLSLVS-EAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVI 295
           H FP+L  L+    E  LR+ G GYRA+Y+  + K +  + GG A WL  LR  + EE  
Sbjct: 188 HGFPNLHALAGPEVETHLRKLGLGYRARYVCASAKAILEEQGGPA-WLQQLRVASYEEAH 247

Query: 296 EALTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR 355
           +AL  LPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ +  +    L N+
Sbjct: 248 KALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIAHRDYGWQPKTSQTKGPSPLANK 307

Query: 356 -VAEAFVNKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKTTKE 395
            +   F N +G YAGWAQ +LF ADL QQ     +     KRKK +K+
Sbjct: 308 ELGNFFRNLWGPYAGWAQAVLFSADLRQQNL---SREPPAKRKKGSKK 342

BLAST of Cp4.1LG13g04420 vs. Swiss-Prot
Match: OGG1_MOUSE (N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2)

HSP 1 Score: 210.3 bits (534), Expect = 3.9e-53
Identity = 130/331 (39.27%), Postives = 188/331 (56.80%), Query Frame = 1

Query: 72  SAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSY 131
           S++ + W S+   +S+L L L   +GQ+FRWK+ SP H++GV+   + +LT     D  Y
Sbjct: 17  SSSPALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTLTQTE--DQLY 76

Query: 132 CLHSCSTSSSAAAARLALLDFLNA----SISLSAIWEVFSAADPRFDVLSRHLEGARVLR 191
           C       S  +   L  L+ L+      +SL+ ++  +++ D  F  +++  +G R+LR
Sbjct: 77  CTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQKFQGVRLLR 136

Query: 192 QDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGQIGGFDFHEFPSLERLS-LVSEA 251
           QDP ECL  F+CSSNNNI RIT MV+ +  + G  L Q+    +H FP+L  L+   +E 
Sbjct: 137 QDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHALAGPEAET 196

Query: 252 ELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAAC 311
            LR+ G GYRA+Y+  + K +  + GG A WL  LR    EE  +AL  LPGVG KVA C
Sbjct: 197 HLRKLGLGYRARYVRASAKAILEEQGGPA-WLQQLRVAPYEEAHKALCTLPGVGAKVADC 256

Query: 312 VALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVNKYGKYAGW 371
           + L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ +   F N +G YAGW
Sbjct: 257 ICLMALDKPQAVPVDVHVWQIAHRDYGWHPKTSQAKGPSPLANKELGNFFRNLWGPYAGW 316

Query: 372 AQTLLFVADLPQQKALLPASLENTKRKKTTK 394
           AQ +LF ADL Q      +     KRKK +K
Sbjct: 317 AQAVLFSADLRQPSL---SREPPAKRKKGSK 341

BLAST of Cp4.1LG13g04420 vs. Swiss-Prot
Match: OGG1_HUMAN (N-glycosylase/DNA lyase OS=Homo sapiens GN=OGG1 PE=1 SV=2)

HSP 1 Score: 206.5 bits (524), Expect = 5.7e-52
Identity = 125/329 (37.99%), Postives = 185/329 (56.23%), Query Frame = 1

Query: 78  WVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPN--------GDV 137
           W S+   +S+L L L  P+GQ+FRW++ SP H++GV+   + +LT            GD 
Sbjct: 23  WASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTVYRGDK 82

Query: 138 SYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQD 197
           S    S  T     A R     +    ++L+ ++  + + D  F  +++  +G R+LRQD
Sbjct: 83  SQA--SRPTPDELEAVR----KYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 142

Query: 198 PLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGQIGGFDFHEFPSLERLSLVS-EAEL 257
           P+ECL  F+CSSNNNI RIT MV+ +  + G  L Q+    +H FPSL+ L+    EA L
Sbjct: 143 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 202

Query: 258 REAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVA 317
           R+ G GYRA+Y+  + + +  + GG A WL  LR+ + EE  +AL  LPGVG KVA C+ 
Sbjct: 203 RKLGLGYRARYVSASARAILEEQGGLA-WLQQLRESSYEEAHKALCILPGVGTKVADCIC 262

Query: 318 LFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVNKYGKYAGWAQ 377
           L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F + +G YAGWAQ
Sbjct: 263 LMALDKPQAVPVDVHMWHIAQRDYSWHPTTSQAKGPSPQTNKELGNFFRSLWGPYAGWAQ 322

Query: 378 TLLFVADLPQQKALLPASLENTKRKKTTK 394
            +LF ADL Q +    A     KR+K +K
Sbjct: 323 AVLFSADLRQSR---HAQEPPAKRRKGSK 341

BLAST of Cp4.1LG13g04420 vs. Swiss-Prot
Match: OGG1_DROME (N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2 SV=2)

HSP 1 Score: 174.1 bits (440), Expect = 3.1e-42
Identity = 110/322 (34.16%), Postives = 169/322 (52.48%), Query Frame = 1

Query: 81  LNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSS 140
           + L+  +  L  T   GQ+FRW+     + T   G    +   L   +      +  TSS
Sbjct: 27  IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQEESFITYEAYGTSS 86

Query: 141 SAAAARLALL--DFLNASISLSAIWEVFSAADPRF-DVLSRHLEGARVLRQDPLECLIQF 200
             A    + L  D+L     L    + + + D  F   LS+ +   R+L Q+P E +  F
Sbjct: 87  PLATKDYSSLISDYLRVDFDLKVNQKDWLSKDDNFVKFLSKPV---RLLSQEPFENIFSF 146

Query: 201 LCSSNNNIGRITKMVDYI-SSLGNYLGQIGGFDFHEFPSLERLSLVS----EAELREAGF 260
           LCS NNNI RI+ M+++  ++ G  +G   G D + FP++ R   +      A+LR A F
Sbjct: 147 LCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKF 206

Query: 261 GYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALFSLD 320
           GYRAK+I  T++E++ K  GG  W +SL+ +  E+  E LT LPG+G KVA C+ L S+ 
Sbjct: 207 GYRAKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLLPGIGYKVADCICLMSMG 266

Query: 321 QHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVNKYGKYAGWAQTLLFVAD 380
              ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +LF AD
Sbjct: 267 HLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHFQKLHGKYAGWAQAILFSAD 326

Query: 381 LPQQKALLPASLENTKRKKTTK 394
           L Q +     + +    KK  K
Sbjct: 327 LSQFQNTSTVACKKKSNKKPKK 343

BLAST of Cp4.1LG13g04420 vs. TrEMBL
Match: A0A0A0KIU8_CUCSA (8-oxoguanine DNA glycosylase OS=Cucumis sativus GN=Csa_6G382890 PE=4 SV=1)

HSP 1 Score: 653.3 bits (1684), Expect = 2.0e-184
Identity = 339/401 (84.54%), Postives = 360/401 (89.78%), Query Frame = 1

Query: 1   MPAFSLRHRL-MAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSND 60
           MP+ S +  L M KRL+PTPPSTPS KPSP PPS    PPTPQL HSKPTT SL HSS +
Sbjct: 1   MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPS----PPTPQLSHSKPTTVSLHHSSKN 60

Query: 61  RSKTLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLI 120
            +KTL  L SP S +SSNWVSLNLT+SDLSLPLTFPTGQTFRWKQT+P  FTGVVG HLI
Sbjct: 61  PNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPFEFTGVVGSHLI 120

Query: 121 SLTHLPNGDVSYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRH 180
           SL HLPNGDVSYCLH  STSSS AAARLALLDFLNASISLS+IWEVFSAADPRFD L+RH
Sbjct: 121 SLNHLPNGDVSYCLHFSSTSSS-AAARLALLDFLNASISLSSIWEVFSAADPRFDALARH 180

Query: 181 LEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERL 240
            EGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLG +GGFDF+EFPSLERL
Sbjct: 181 FEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERL 240

Query: 241 SLVSEAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVG 300
           SLVSEAELREAGFGYRAKYIIG V  LKAKP GGAEWLLSLRD  LEEVIEAL+ LPGVG
Sbjct: 241 SLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIEALSTLPGVG 300

Query: 301 PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKY 360
           PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKY
Sbjct: 301 PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKY 360

Query: 361 AGWAQTLLFVADLPQQKALLPASLENTKRKKTTKEQIEKAH 401
           AGWAQTLLF+A+LPQQKALLPA+LENTKRK++TK+Q + AH
Sbjct: 361 AGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAH 396

BLAST of Cp4.1LG13g04420 vs. TrEMBL
Match: M5XQI4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006280mg PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 4.1e-142
Identity = 269/395 (68.10%), Postives = 307/395 (77.72%), Query Frame = 1

Query: 10  LMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVS 69
           +++  LRP    +   +P  SPPS   +PPTPQ  + K            R KT+     
Sbjct: 45  MLSLNLRPLSIMSKRQRPIQSPPS---TPPTPQTHNPK------------RPKTILK--- 104

Query: 70  PASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDV 129
                 + WV LNLT+S+LSLPLTFPTGQTFRW+QT PL +TGVVG HL+SL HL NGDV
Sbjct: 105 -----PTKWVPLNLTQSELSLPLTFPTGQTFRWRQTGPLQYTGVVGSHLVSLRHLENGDV 164

Query: 130 SYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQD 189
           S CLH  +TS +   A+LALLDFLN  ISL+ IWEVFSA+D RF  L+ +L GARVLRQD
Sbjct: 165 SCCLHHTTTSETN--AKLALLDFLNVGISLAGIWEVFSASDSRFAELASYLGGARVLRQD 224

Query: 190 PLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELRE 249
           P+ECLIQFLCSSNNNI RITKMVD++SSLGN+LG +GGF+FHEFPSLERLS+VSE E RE
Sbjct: 225 PVECLIQFLCSSNNNIQRITKMVDFVSSLGNHLGSVGGFEFHEFPSLERLSMVSEEEFRE 284

Query: 250 AGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALF 309
           AGFGYRAKYI GTVK L+ KPGGGAEWLLSLR   LEEVIEAL+ LPGVGPKVAAC+ALF
Sbjct: 285 AGFGYRAKYITGTVKALQLKPGGGAEWLLSLRKTELEEVIEALSTLPGVGPKVAACIALF 344

Query: 310 SLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFV 369
           SLDQHHAIPVDTHVWQIATRYL+PELAGARLTPKLC RVAEAFV+KYGKYAGWAQTLLF+
Sbjct: 345 SLDQHHAIPVDTHVWQIATRYLIPELAGARLTPKLCGRVAEAFVSKYGKYAGWAQTLLFI 404

Query: 370 ADLPQQKALLPASLENTKRKKTTKEQIEKAHTEQD 405
           A+LP QKALLPA   N K  K  K++  K+HT  D
Sbjct: 405 AELPSQKALLPAHFSNAKESKAAKKKDRKSHTAVD 414

BLAST of Cp4.1LG13g04420 vs. TrEMBL
Match: A0A0D2TR00_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G248000 PE=4 SV=1)

HSP 1 Score: 493.4 bits (1269), Expect = 2.6e-136
Identity = 252/378 (66.67%), Postives = 299/378 (79.10%), Query Frame = 1

Query: 16  RPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVSPASAAS 75
           RP PPS P   P+ +  + PP  P   L  + P  +S +  S+ +               
Sbjct: 3   RPRPPSPPPLSPASTKQTSPPPRPLKSLHPNTPPISSKKPKSHPK--------------- 62

Query: 76  SNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHS 135
             WV LNL++++LSLPLTFPTGQTFRWKQT PL +TG +GPHL+SL HL NGDVSY +H 
Sbjct: 63  --WVPLNLSQTELSLPLTFPTGQTFRWKQTGPLQYTGTIGPHLLSLKHLQNGDVSYFIHF 122

Query: 136 CSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQDPLECLI 195
              + S +AA+LALLDFLN SISL+ +WEVFS  D RF  L+++L+GARVLRQDP+ECL+
Sbjct: 123 ---TPSESAAKLALLDFLNVSISLANLWEVFSENDSRFAELAKYLKGARVLRQDPVECLV 182

Query: 196 QFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELREAGFGYR 255
           QFLCSSNNNIGRITKMVD+ISSLG +LG +GGFDFHEFPSLERLS VSE ELR+AGFGYR
Sbjct: 183 QFLCSSNNNIGRITKMVDFISSLGTHLGSVGGFDFHEFPSLERLSAVSEEELRQAGFGYR 242

Query: 256 AKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALFSLDQHH 315
           AKYI GTV  L++KP GGA+WLLSLR L L+E I+AL +LPGVGPKVAAC+ALFSLDQHH
Sbjct: 243 AKYITGTVDVLQSKPDGGAQWLLSLRKLDLQEAIDALCSLPGVGPKVAACIALFSLDQHH 302

Query: 316 AIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQ 375
           AIPVDTHVWQIAT+YL+PELAGARLTPKLC+RVAEAFV+KYG+YAGWAQTLLF+ADLP Q
Sbjct: 303 AIPVDTHVWQIATKYLLPELAGARLTPKLCSRVAEAFVSKYGEYAGWAQTLLFIADLPSQ 360

Query: 376 KALLPASLENTKRKKTTK 394
           KALLP+   + K KK+ K
Sbjct: 363 KALLPSHFWDIKEKKSAK 360

BLAST of Cp4.1LG13g04420 vs. TrEMBL
Match: W9R755_9ROSA (N-glycosylase/DNA lyase OS=Morus notabilis GN=L484_013377 PE=4 SV=1)

HSP 1 Score: 492.7 bits (1267), Expect = 4.4e-136
Identity = 262/395 (66.33%), Postives = 305/395 (77.22%), Query Frame = 1

Query: 11  MAKRLR--PTPPSTPSAKPSPSPPSLPPSPP-TPQLFHSKPTTASLRHSSNDRSKTLTHL 70
           M+KRL+  P PP  P   P+P P ++PP PP TPQ    K    + RH S+         
Sbjct: 1   MSKRLKSKPIPPPPP---PTPQPKTIPPPPPRTPQ---PKTIPKTHRHHSSK-------- 60

Query: 71  VSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTH-LPN 130
             P       WV LNL +S+LSLPLTFPTGQTFRW++T PL +TG VGPHL+SL H   N
Sbjct: 61  -IPVDPTKWAWVPLNLPQSELSLPLTFPTGQTFRWRKTGPLQYTGAVGPHLVSLKHDASN 120

Query: 131 GDVSYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVL 190
           GDVS+CLH    + S A A  AL DFLNASISL+ +WEVFSA+D RF  L+RHL GARVL
Sbjct: 121 GDVSFCLHR---TPSEAEAESALRDFLNASISLAEMWEVFSASDSRFAELARHLGGARVL 180

Query: 191 RQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAE 250
           RQDP ECL+QFLCSSNNNIGRITKMVD++SSLGNYLG + GFDFHEFPS+ERLS +SE E
Sbjct: 181 RQDPFECLVQFLCSSNNNIGRITKMVDFVSSLGNYLGTVEGFDFHEFPSMERLSTLSEQE 240

Query: 251 LREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACV 310
            R+AGFGYRAKYI GTVK+L++K GGG EWLLSLRD  LE+VI AL+ LPGVGPKVAAC+
Sbjct: 241 FRDAGFGYRAKYITGTVKKLQSKDGGGEEWLLSLRDSELEDVIYALSTLPGVGPKVAACI 300

Query: 311 ALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTL 370
           ALFSLDQHHAIPVDTHVWQIA RYL+PELAGA LTPKLC+RVAEAFVNK+GKYAGWAQT+
Sbjct: 301 ALFSLDQHHAIPVDTHVWQIAIRYLLPELAGAHLTPKLCSRVAEAFVNKFGKYAGWAQTM 360

Query: 371 LFVADLPQQKALLPASLENTKRKKTTKEQIEKAHT 402
           LF+A+LP QKA+LP+   N  RKK+ K +  +A T
Sbjct: 361 LFIAELPSQKAMLPSHFSNANRKKSIKRKNVEADT 377

BLAST of Cp4.1LG13g04420 vs. TrEMBL
Match: A0A0B0N3V4_GOSAR (N-glycosylase/DNA lyase OS=Gossypium arboreum GN=F383_08417 PE=4 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 9.8e-136
Identity = 257/391 (65.73%), Postives = 304/391 (77.75%), Query Frame = 1

Query: 5   SLRHRLMAKRLRPT--PPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSK 64
           SL+  L  KR RP   PP +P++    +PP  PP PP   L  + P  +S +  S+ +  
Sbjct: 3   SLKPPLAMKRPRPPSPPPLSPASTKHTTPP--PPPPPVKSLHPNTPPISSKKPKSHPK-- 62

Query: 65  TLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLT 124
                          WV LNL++++LSLPLTFPTGQTFRWKQT PL +TG +GPHL+SL 
Sbjct: 63  ---------------WVPLNLSQTELSLPLTFPTGQTFRWKQTGPLQYTGTIGPHLLSLK 122

Query: 125 HLPNGDVSYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEG 184
           HL NGDVSY +H    + S +AA+LALLDFLN  ISL+ +WEVFS  D RF  L++ L+G
Sbjct: 123 HLQNGDVSYFIHF---TPSESAAKLALLDFLNVGISLAKLWEVFSENDSRFAKLAKCLKG 182

Query: 185 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLV 244
           ARVLRQDP+ECL+QFLCSSNNNIGRITKMVD+ISSLG +LG +GGFDFHEFPSLERLS V
Sbjct: 183 ARVLRQDPVECLVQFLCSSNNNIGRITKMVDFISSLGTHLGSVGGFDFHEFPSLERLSAV 242

Query: 245 SEAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKV 304
           SE ELR+AGFGYRAKYI GTV  L++KP GGA+WLLSLR L L+E I+AL  LPGVGPKV
Sbjct: 243 SEEELRQAGFGYRAKYITGTVDVLQSKPDGGAQWLLSLRKLDLQEAIDALCTLPGVGPKV 302

Query: 305 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGW 364
           AAC+ALFSLDQHHAIPVDTHVWQIAT+YL+PELAGARLTPKLC+RVAEAFV+KYG+YAGW
Sbjct: 303 AACIALFSLDQHHAIPVDTHVWQIATKYLLPELAGARLTPKLCSRVAEAFVSKYGEYAGW 362

Query: 365 AQTLLFVADLPQQKALLPASLENTKRKKTTK 394
           AQTLLF+ADLP QKALLP+   + K KK+ K
Sbjct: 363 AQTLLFIADLPSQKALLPSHFWDIKEKKSAK 371

BLAST of Cp4.1LG13g04420 vs. TAIR10
Match: AT1G21710.1 (AT1G21710.1 8-oxoguanine-DNA glycosylase 1)

HSP 1 Score: 433.7 bits (1114), Expect = 1.2e-121
Identity = 228/358 (63.69%), Postives = 267/358 (74.58%), Query Frame = 1

Query: 23  PSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVSPASAASSNWVSLN 82
           P+++PS S    PP  P        P T  L+   +                +  W  L 
Sbjct: 6   PTSQPSISSTVKPPLSP--------PVTPILKQKLH-------------RTGTPKWFPLK 65

Query: 83  LTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGD-VSYCLHSCSTSSS 142
           LT ++L+LPLTFPTGQTFRWK+T  + ++G +GPHL+SL   P  D VSYC+H CSTS  
Sbjct: 66  LTHTELTLPLTFPTGQTFRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVH-CSTSPK 125

Query: 143 AAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQDPLECLIQFLCSS 202
             +A LALLDFLNA ISL+ +W  FS  DPRF  L+RHL GARVLRQDPLECLIQFLCSS
Sbjct: 126 --SAELALLDFLNAEISLAELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSS 185

Query: 203 NNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIG 262
           NNNI RITKMVD++SSLG +LG I GF+FH+FPSL+RLS VSE E R+AGFGYRAKYI G
Sbjct: 186 NNNIARITKMVDFVSSLGLHLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYRAKYITG 245

Query: 263 TVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALFSLDQHHAIPVDT 322
           TV  L+AKPGGG EWLLSLR + L+E + AL  LPGVGPKVAAC+ALFSLDQH AIPVDT
Sbjct: 246 TVNALQAKPGGGNEWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDT 305

Query: 323 HVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALL 380
           HVWQIAT YL+P+LAGA+LTPKL  RVAEAFV+KYG+YAGWAQTLLF+A+LP QK LL
Sbjct: 306 HVWQIATNYLLPDLAGAKLTPKLHGRVAEAFVSKYGEYAGWAQTLLFIAELPAQKTLL 339

BLAST of Cp4.1LG13g04420 vs. TAIR10
Match: AT3G47830.1 (AT3G47830.1 DNA glycosylase superfamily protein)

HSP 1 Score: 53.5 bits (127), Expect = 3.5e-07
Identity = 32/69 (46.38%), Postives = 42/69 (60.87%), Query Frame = 1

Query: 280 LRDLALEEVIEALTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGA 339
           LR L++EEV   L+   GVGPK  +CV +F+L QH+  PVDTHV++IA     VP+ A  
Sbjct: 175 LRGLSVEEVKTELSHFKGVGPKTVSCVLMFNL-QHNDFPVDTHVFEIAKALGWVPKTADR 234

Query: 340 RLTPKLCNR 348
             T    NR
Sbjct: 235 NKTYVHLNR 242

BLAST of Cp4.1LG13g04420 vs. NCBI nr
Match: gi|659133458|ref|XP_008466739.1| (PREDICTED: N-glycosylase/DNA lyase OGG1 [Cucumis melo])

HSP 1 Score: 656.0 bits (1691), Expect = 4.3e-185
Identity = 340/402 (84.58%), Postives = 362/402 (90.05%), Query Frame = 1

Query: 1   MPAFSLRHRL-MAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSND 60
           MP+ S +  L M KR +PT PSTPS KPSP PPS    PPTPQL HSKPTT S+ HSS +
Sbjct: 1   MPSLSFKPLLLMTKRFKPTTPSTPSTKPSPPPPS----PPTPQLSHSKPTTVSIHHSSKN 60

Query: 61  RSKTLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLI 120
            +KTLT L SP S +SSNWVSLNLT+SDLSLPLTFPTGQTFRWKQT+PL FTGVVG HLI
Sbjct: 61  PNKTLTLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLI 120

Query: 121 SLTHLPNGDVSYCLHSCSTS-SSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSR 180
           SL HLPNG+VSYCLH  STS SS+AAARLALLDFLNA ISLS+IWEVFSAADPRFD L+R
Sbjct: 121 SLNHLPNGEVSYCLHFSSTSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALAR 180

Query: 181 HLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLER 240
           HLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLG +GGFDFHEFPSLER
Sbjct: 181 HLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLER 240

Query: 241 LSLVSEAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGV 300
           LSLVSEAELREAGFGYRAKYIIGTV  LKAKPGGGAEWLLSLRD  LEEVI AL+ LPGV
Sbjct: 241 LSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIGALSTLPGV 300

Query: 301 GPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGK 360
           GPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGK
Sbjct: 301 GPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGK 360

Query: 361 YAGWAQTLLFVADLPQQKALLPASLENTKRKKTTKEQIEKAH 401
           YAGWAQTLLFVA+LPQQKALLPA+LENTKRK++TK+Q + AH
Sbjct: 361 YAGWAQTLLFVAELPQQKALLPATLENTKRKRSTKQQRDMAH 398

BLAST of Cp4.1LG13g04420 vs. NCBI nr
Match: gi|778715625|ref|XP_004149809.2| (PREDICTED: N-glycosylase/DNA lyase OGG1 [Cucumis sativus])

HSP 1 Score: 653.3 bits (1684), Expect = 2.8e-184
Identity = 339/401 (84.54%), Postives = 360/401 (89.78%), Query Frame = 1

Query: 1   MPAFSLRHRL-MAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSND 60
           MP+ S +  L M KRL+PTPPSTPS KPSP PPS    PPTPQL HSKPTT SL HSS +
Sbjct: 1   MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPS----PPTPQLSHSKPTTVSLHHSSKN 60

Query: 61  RSKTLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLI 120
            +KTL  L SP S +SSNWVSLNLT+SDLSLPLTFPTGQTFRWKQT+P  FTGVVG HLI
Sbjct: 61  PNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPFEFTGVVGSHLI 120

Query: 121 SLTHLPNGDVSYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRH 180
           SL HLPNGDVSYCLH  STSSS AAARLALLDFLNASISLS+IWEVFSAADPRFD L+RH
Sbjct: 121 SLNHLPNGDVSYCLHFSSTSSS-AAARLALLDFLNASISLSSIWEVFSAADPRFDALARH 180

Query: 181 LEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERL 240
            EGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLG +GGFDF+EFPSLERL
Sbjct: 181 FEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERL 240

Query: 241 SLVSEAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVG 300
           SLVSEAELREAGFGYRAKYIIG V  LKAKP GGAEWLLSLRD  LEEVIEAL+ LPGVG
Sbjct: 241 SLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIEALSTLPGVG 300

Query: 301 PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKY 360
           PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKY
Sbjct: 301 PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKY 360

Query: 361 AGWAQTLLFVADLPQQKALLPASLENTKRKKTTKEQIEKAH 401
           AGWAQTLLF+A+LPQQKALLPA+LENTKRK++TK+Q + AH
Sbjct: 361 AGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAH 396

BLAST of Cp4.1LG13g04420 vs. NCBI nr
Match: gi|645225491|ref|XP_008219604.1| (PREDICTED: N-glycosylase/DNA lyase OGG1 [Prunus mume])

HSP 1 Score: 515.8 bits (1327), Expect = 7.0e-143
Identity = 274/405 (67.65%), Postives = 312/405 (77.04%), Query Frame = 1

Query: 1   MPAFSLRH-RLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSND 60
           M + +LRH  +M+KR RP            SPPS   +PPTPQ  + K            
Sbjct: 1   MLSLNLRHLSIMSKRQRPIQ----------SPPS---TPPTPQTHNPK------------ 60

Query: 61  RSKTLTHLVSPASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLI 120
           R KT+           + WV LNLT+S+LSLPLTFPTGQTFRW+QT PL +TGVVG HL+
Sbjct: 61  RPKTILK--------PTKWVPLNLTQSELSLPLTFPTGQTFRWRQTGPLQYTGVVGSHLV 120

Query: 121 SLTHLPNGDVSYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRH 180
           SL HL NG+VSYCLH  +TS +   A+LALLDFLN  ISL+ IWEVFSA+D RF  L+ +
Sbjct: 121 SLRHLENGNVSYCLHHTTTSETN--AKLALLDFLNVGISLAGIWEVFSASDSRFAELASY 180

Query: 181 LEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERL 240
           L GARVLRQDP+ECLIQFLCSSNNNI RITKMVD++SSLGN+LG +GGF+FHEFPSLERL
Sbjct: 181 LGGARVLRQDPIECLIQFLCSSNNNIQRITKMVDFVSSLGNHLGSVGGFEFHEFPSLERL 240

Query: 241 SLVSEAELREAGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVG 300
           S+VSE E REAGFGYRAKYI GTVK L+ KPGGGAEWLLSLR   LEEVIEAL+ LPGVG
Sbjct: 241 SMVSEKEFREAGFGYRAKYITGTVKALQLKPGGGAEWLLSLRKTELEEVIEALSTLPGVG 300

Query: 301 PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKY 360
           PKVAAC+ALFSLDQHHAIPVDTHVWQIATRYL+PELAGARLTPKLC RVAEAFV+KYGKY
Sbjct: 301 PKVAACIALFSLDQHHAIPVDTHVWQIATRYLIPELAGARLTPKLCGRVAEAFVSKYGKY 360

Query: 361 AGWAQTLLFVADLPQQKALLPASLENTKRKKTTKEQIEKAHTEQD 405
           AGWAQTLLF+A+LP QKALLPA   N K  K  K++  K+HT  D
Sbjct: 361 AGWAQTLLFIAELPSQKALLPAHFWNAKESKAAKKKDRKSHTAVD 370

BLAST of Cp4.1LG13g04420 vs. NCBI nr
Match: gi|596143735|ref|XP_007222599.1| (hypothetical protein PRUPE_ppa006280mg [Prunus persica])

HSP 1 Score: 512.7 bits (1319), Expect = 5.9e-142
Identity = 269/395 (68.10%), Postives = 307/395 (77.72%), Query Frame = 1

Query: 10  LMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVS 69
           +++  LRP    +   +P  SPPS   +PPTPQ  + K            R KT+     
Sbjct: 45  MLSLNLRPLSIMSKRQRPIQSPPS---TPPTPQTHNPK------------RPKTILK--- 104

Query: 70  PASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDV 129
                 + WV LNLT+S+LSLPLTFPTGQTFRW+QT PL +TGVVG HL+SL HL NGDV
Sbjct: 105 -----PTKWVPLNLTQSELSLPLTFPTGQTFRWRQTGPLQYTGVVGSHLVSLRHLENGDV 164

Query: 130 SYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQD 189
           S CLH  +TS +   A+LALLDFLN  ISL+ IWEVFSA+D RF  L+ +L GARVLRQD
Sbjct: 165 SCCLHHTTTSETN--AKLALLDFLNVGISLAGIWEVFSASDSRFAELASYLGGARVLRQD 224

Query: 190 PLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELRE 249
           P+ECLIQFLCSSNNNI RITKMVD++SSLGN+LG +GGF+FHEFPSLERLS+VSE E RE
Sbjct: 225 PVECLIQFLCSSNNNIQRITKMVDFVSSLGNHLGSVGGFEFHEFPSLERLSMVSEEEFRE 284

Query: 250 AGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALF 309
           AGFGYRAKYI GTVK L+ KPGGGAEWLLSLR   LEEVIEAL+ LPGVGPKVAAC+ALF
Sbjct: 285 AGFGYRAKYITGTVKALQLKPGGGAEWLLSLRKTELEEVIEALSTLPGVGPKVAACIALF 344

Query: 310 SLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFV 369
           SLDQHHAIPVDTHVWQIATRYL+PELAGARLTPKLC RVAEAFV+KYGKYAGWAQTLLF+
Sbjct: 345 SLDQHHAIPVDTHVWQIATRYLIPELAGARLTPKLCGRVAEAFVSKYGKYAGWAQTLLFI 404

Query: 370 ADLPQQKALLPASLENTKRKKTTKEQIEKAHTEQD 405
           A+LP QKALLPA   N K  K  K++  K+HT  D
Sbjct: 405 AELPSQKALLPAHFSNAKESKAAKKKDRKSHTAVD 414

BLAST of Cp4.1LG13g04420 vs. NCBI nr
Match: gi|658006978|ref|XP_008338666.1| (PREDICTED: N-glycosylase/DNA lyase OGG1 [Malus domestica])

HSP 1 Score: 508.4 bits (1308), Expect = 1.1e-140
Identity = 264/392 (67.35%), Postives = 306/392 (78.06%), Query Frame = 1

Query: 10  LMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTASLRHSSNDRSKTLTHLVS 69
           +++  LR     +   +P  SPPS   +P TPQ  +SK     L                
Sbjct: 1   MLSLHLRSLSTMSKRQRPINSPPS---TPQTPQTHNSKRPKLPL---------------- 60

Query: 70  PASAASSNWVSLNLTKSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDV 129
                ++ WV LNLT+S+LSLPLTFPTGQTFRW+QT PL +TGVVG HL+SL HLPNGDV
Sbjct: 61  ---VPTTKWVPLNLTQSELSLPLTFPTGQTFRWRQTGPLQYTGVVGCHLVSLEHLPNGDV 120

Query: 130 SYCLHSCSTSSSAAAARLALLDFLNASISLSAIWEVFSAADPRFDVLSRHLEGARVLRQD 189
           SYCLHS +TSS    A  ALLDFLN  ISL+ +WEVFSA+D RF  L+ +L GARVLRQD
Sbjct: 121 SYCLHS-TTSSERGLAEAALLDFLNMGISLAGMWEVFSASDSRFAELAGYLGGARVLRQD 180

Query: 190 PLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGQIGGFDFHEFPSLERLSLVSEAELRE 249
           P+ECL+QFLCSSNNNI RITKMVD++SSLGN+LG +GGF+FHEFPSLERLS+VSE E RE
Sbjct: 181 PVECLVQFLCSSNNNIQRITKMVDFVSSLGNHLGSVGGFEFHEFPSLERLSMVSEKEFRE 240

Query: 250 AGFGYRAKYIIGTVKELKAKPGGGAEWLLSLRDLALEEVIEALTALPGVGPKVAACVALF 309
           AGFGYRAKYI GTVK L+ KPGGGAEWLLSLR + LEEVIEAL+ LPGVGPKVAAC+ALF
Sbjct: 241 AGFGYRAKYITGTVKALQLKPGGGAEWLLSLRKMELEEVIEALSTLPGVGPKVAACIALF 300

Query: 310 SLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFV 369
           SLDQHHAIPVDTHVWQIATRYL+PELAGARLTPKLC RVAEAFV+KYGKYAGWAQT+LF+
Sbjct: 301 SLDQHHAIPVDTHVWQIATRYLIPELAGARLTPKLCVRVAEAFVSKYGKYAGWAQTVLFI 360

Query: 370 ADLPQQKALLPASLENTKRKKTTKEQIEKAHT 402
           A+LP QKALLPA   + K  K TK++  ++HT
Sbjct: 361 AELPSQKALLPAHFTSAKESKATKKKDRESHT 369

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
OGG1_ARATH2.2e-12063.69N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1 SV=1[more]
OGG1_RAT2.3e-5338.79N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1[more]
OGG1_MOUSE3.9e-5339.27N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2[more]
OGG1_HUMAN5.7e-5237.99N-glycosylase/DNA lyase OS=Homo sapiens GN=OGG1 PE=1 SV=2[more]
OGG1_DROME3.1e-4234.16N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KIU8_CUCSA2.0e-18484.548-oxoguanine DNA glycosylase OS=Cucumis sativus GN=Csa_6G382890 PE=4 SV=1[more]
M5XQI4_PRUPE4.1e-14268.10Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006280mg PE=4 SV=1[more]
A0A0D2TR00_GOSRA2.6e-13666.67Uncharacterized protein OS=Gossypium raimondii GN=B456_009G248000 PE=4 SV=1[more]
W9R755_9ROSA4.4e-13666.33N-glycosylase/DNA lyase OS=Morus notabilis GN=L484_013377 PE=4 SV=1[more]
A0A0B0N3V4_GOSAR9.8e-13665.73N-glycosylase/DNA lyase OS=Gossypium arboreum GN=F383_08417 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G21710.11.2e-12163.69 8-oxoguanine-DNA glycosylase 1[more]
AT3G47830.13.5e-0746.38 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659133458|ref|XP_008466739.1|4.3e-18584.58PREDICTED: N-glycosylase/DNA lyase OGG1 [Cucumis melo][more]
gi|778715625|ref|XP_004149809.2|2.8e-18484.54PREDICTED: N-glycosylase/DNA lyase OGG1 [Cucumis sativus][more]
gi|645225491|ref|XP_008219604.1|7.0e-14367.65PREDICTED: N-glycosylase/DNA lyase OGG1 [Prunus mume][more]
gi|596143735|ref|XP_007222599.1|5.9e-14268.10hypothetical protein PRUPE_ppa006280mg [Prunus persica][more]
gi|658006978|ref|XP_008338666.1|1.1e-14067.35PREDICTED: N-glycosylase/DNA lyase OGG1 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008534oxidized purine nucleobase lesion DNA N-glycosylase activity
GO:0003684damaged DNA binding
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0006289nucleotide-excision repair
GO:0006281DNA repair
GO:0006284base-excision repair
Vocabulary: INTERPRO
TermDefinition
IPR023170HTH_base_excis_C
IPR012904OGG_N
IPR011257DNA_glycosylase
IPR003265HhH-GPD_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006285 base-excision repair, AP site formation
biological_process GO:0006308 DNA catabolic process
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006306 DNA methylation
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0003684 damaged DNA binding
molecular_function GO:0008534 oxidized purine nucleobase lesion DNA N-glycosylase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016829 lyase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g04420.1Cp4.1LG13g04420.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 195..331
score: 1.6
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 199..372
score: 7.0
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 189..304
score: 2.6
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 189..371
score: 1.04
IPR0129048-oxoguanine DNA glycosylase, N-terminalPFAMPF07934OGG_Ncoord: 80..194
score: 1.1
IPR023170Helix-turn-helix, base-excision DNA repair, C-terminalGENE3DG3DSA:1.10.1670.10coord: 305..372
score: 1.3
NoneNo IPR availableGENE3DG3DSA:3.30.310.40coord: 71..142
score: 1.1
NoneNo IPR availablePANTHERPTHR10242N-GLYCOSYLASE/DNA LYASEcoord: 6..404
score: 1.3E
NoneNo IPR availablePANTHERPTHR10242:SF2N-GLYCOSYLASE/DNA LYASEcoord: 6..404
score: 1.3E
NoneNo IPR availableunknownSSF55945TATA-box binding protein-likecoord: 74..188
score: 6.43

The following gene(s) are paralogous to this gene:

None