Tan0005606 (gene) Snake gourd v1

Overview
NameTan0005606
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
LocationLG11: 10677925 .. 10679847 (+)
RNA-Seq ExpressionTan0005606
SyntenyTan0005606
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACTCATTCTCATCGAGACCCCTTCTAATGTCGAAGAGGCTCAGACCCACTCCACCCTCCACTCCCTCCGCCAAGCCGCCGTCATCGCCGCCGCCTCCTCCGACTCCTCAACCCTCCCATTCAAAGCCCACCACCGTCTCCATTCACTATTCATCCAACAACCCCCCAAAAACGCTAACCCTCCTCAATTCCCCATCATCCTCCAACTGGGTCCCCCTCAATCTCTCCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCGCCCCTCTTCACTTCACCGGCGTCGTTGCCTCTCATCTCATCTCTCTCAAGCACCTTCCAAACGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACATCCTCCGTCGCCGCCGCCGCCAGATTGACCTTGCTTGACTTCCTTAACGCCGGCATCTCCCTCAGTGCCATTTGGGAGGTGTTCTCGGCGGCTGATCCGAGATTCGATGGCTTGGCTCGCCATTTGGAGGGTGCTCGAGTTCTCAGGCAAGACCCACTTGAGTGTTTGATTCAGTTTTTGTGTTCTTCGAATAACAATATTGGGAGAATCACGAAAATGGTGGATTATATCTCATCGCTAGGGAATTATTTGGGCAATGTTGGGGGCTTCGATTTCCATGAGTTCCCCTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGTTTTGGTTACAGGTTAATCTCTCTATTATCTAGCTAACTTTCTCCTCTTAAACTAGCACTAAATCCAAAAGTCATTGGTTTGAATCTTTATTATGATTCATGAAATCTAAAACAAGCCCTATATGTTGGGATGGGAACACTTCCCACATGTAAGGAAGGAAAATGTCAATGTTGTTGTATAATTTATGAGACCTTTTGGGTGAAACCAAAGTAAAGTTATGGCAGCTTAGGCCAAACGTGGAAAATGTATGGAAAATATGAGAGCTCCATTCTGCCAACACTATCAATTAATCTAAAAACTTAAGTGGATAGTTTAATCTTTTATATTAGCAAGATTCCCTCTTACTTATTTCAATGTGGTTGGTTTTATTGATAGTTTCTTGTGAATAAGCAGGGCTAAATACATAATTGGCACTGTAAATGCTTTAGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTATCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCCCTTTCCACTTTACCCGGTGTCGGTCCGAAGGTGGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCATCATGCCATTCCTGTTGACACACACGTGTGGCAGGTATTATTATCCCTTTCAAATGAATTGAAATGATTTAAGTTTCATTCTTATGATCCAAATGAGCGTTGGCTGGTCAAAATTCTCATCCTTGTCTTCTTTGGAACCTGAAAAGAGGTTACTGCTTATAGAGAAAAGTTTGTATTAAAATGGATAGCAGTGGACATATCCTACCTTTTGATTGTGATTCTGTTTTGGTGTTAAGAAATTTTCCATGCATTTGAGTCAATTCTTCATTCAGCTCAATAACAGAGAGTCATTAAATGGTGGTTGTCTTGTTTTAGTGGTACTGATTCTGATTTCATTGAAAGTTGTTTGAAAGATTTCTTCTTGAAAAGATTGCTACTAGATACCTTGTCCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAACCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGCAAATATGCTGGTTGGGCTCAAACGCTGCTTTTCGTTGCTGATTTACCTCAACAGAAGGCCCTCTTACCATCGAAGCTCGAGAATAGGAAAAGGAAAAAATCTACAAAGCAGCAGAAAGAAAAGGAACAGACTGGTAATATAGATCGATGTGAATAG

mRNA sequence

ATGCACTCATTCTCATCGAGACCCCTTCTAATGTCGAAGAGGCTCAGACCCACTCCACCCTCCACTCCCTCCGCCAAGCCGCCGTCATCGCCGCCGCCTCCTCCGACTCCTCAACCCTCCCATTCAAAGCCCACCACCGTCTCCATTCACTATTCATCCAACAACCCCCCAAAAACGCTAACCCTCCTCAATTCCCCATCATCCTCCAACTGGGTCCCCCTCAATCTCTCCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCGCCCCTCTTCACTTCACCGGCGTCGTTGCCTCTCATCTCATCTCTCTCAAGCACCTTCCAAACGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACATCCTCCGTCGCCGCCGCCGCCAGATTGACCTTGCTTGACTTCCTTAACGCCGGCATCTCCCTCAGTGCCATTTGGGAGGTGTTCTCGGCGGCTGATCCGAGATTCGATGGCTTGGCTCGCCATTTGGAGGGTGCTCGAGTTCTCAGGCAAGACCCACTTGAGTGTTTGATTCAGTTTTTGTGTTCTTCGAATAACAATATTGGGAGAATCACGAAAATGGTGGATTATATCTCATCGCTAGGGAATTATTTGGGCAATGTTGGGGGCTTCGATTTCCATGAGTTCCCCTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGTTTTGGTTACAGGGCTAAATACATAATTGGCACTGTAAATGCTTTAGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTATCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCCCTTTCCACTTTACCCGGTGTCGGTCCGAAGGTGGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCATCATGCCATTCCTGTTGACACACACGTGTGGCAGATTGCTACTAGATACCTTGTCCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAACCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGCAAATATGCTGGTTGGGCTCAAACGCTGCTTTTCGTTGCTGATTTACCTCAACAGAAGGCCCTCTTACCATCGAAGCTCGAGAATAGGAAAAGGAAAAAATCTACAAAGCAGCAGAAAGAAAAGGAACAGACTGGTAATATAGATCGATGTGAATAG

Coding sequence (CDS)

ATGCACTCATTCTCATCGAGACCCCTTCTAATGTCGAAGAGGCTCAGACCCACTCCACCCTCCACTCCCTCCGCCAAGCCGCCGTCATCGCCGCCGCCTCCTCCGACTCCTCAACCCTCCCATTCAAAGCCCACCACCGTCTCCATTCACTATTCATCCAACAACCCCCCAAAAACGCTAACCCTCCTCAATTCCCCATCATCCTCCAACTGGGTCCCCCTCAATCTCTCCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCGCCCCTCTTCACTTCACCGGCGTCGTTGCCTCTCATCTCATCTCTCTCAAGCACCTTCCAAACGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACATCCTCCGTCGCCGCCGCCGCCAGATTGACCTTGCTTGACTTCCTTAACGCCGGCATCTCCCTCAGTGCCATTTGGGAGGTGTTCTCGGCGGCTGATCCGAGATTCGATGGCTTGGCTCGCCATTTGGAGGGTGCTCGAGTTCTCAGGCAAGACCCACTTGAGTGTTTGATTCAGTTTTTGTGTTCTTCGAATAACAATATTGGGAGAATCACGAAAATGGTGGATTATATCTCATCGCTAGGGAATTATTTGGGCAATGTTGGGGGCTTCGATTTCCATGAGTTCCCCTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGTTTTGGTTACAGGGCTAAATACATAATTGGCACTGTAAATGCTTTAGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTATCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCCCTTTCCACTTTACCCGGTGTCGGTCCGAAGGTGGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCATCATGCCATTCCTGTTGACACACACGTGTGGCAGATTGCTACTAGATACCTTGTCCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAACCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGCAAATATGCTGGTTGGGCTCAAACGCTGCTTTTCGTTGCTGATTTACCTCAACAGAAGGCCCTCTTACCATCGAAGCTCGAGAATAGGAAAAGGAAAAAATCTACAAAGCAGCAGAAAGAAAAGGAACAGACTGGTAATATAGATCGATGTGAATAG

Protein sequence

MHSFSSRPLLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDRCE
Homology
BLAST of Tan0005606 vs. ExPASy Swiss-Prot
Match: Q9FNY7 (N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana OX=3702 GN=OGG1 PE=1 SV=1)

HSP 1 Score: 451.8 bits (1161), Expect = 7.9e-126
Identity = 232/343 (67.64%), Postives = 271/343 (79.01%), Query Frame = 0

Query: 33  PPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQT 92
           P PT QPS S  +TV    S    P     L+   +  W PL L+ ++L+LPLTFPTGQT
Sbjct: 4   PRPTSQPSIS--STVKPPLSPPVTPILKQKLHRTGTPKWFPLKLTHTELTLPLTFPTGQT 63

Query: 93  FRWKQTAPLHFTGVVASHLISLKHLPNGD-VSYCLHSCSTSSVAAAARLTLLDFLNAGIS 152
           FRWK+T  + ++G +  HL+SL+  P  D VSYC+H CSTS    +A L LLDFLNA IS
Sbjct: 64  FRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVH-CSTS--PKSAELALLDFLNAEIS 123

Query: 153 LSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSL 212
           L+ +W  FS  DPRF  LARHL GARVLRQDPLECLIQFLCSSNNNI RITKMVD++SSL
Sbjct: 124 LAELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSSNNNIARITKMVDFVSSL 183

Query: 213 GNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLL 272
           G +LG++ GF+FH+FPSL+RLS VSE E R+AGFGYRAKYI GTVNAL+AKPGGG EWLL
Sbjct: 184 GLHLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYRAKYITGTVNALQAKPGGGNEWLL 243

Query: 273 SLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGA 332
           SLR ++L+E + AL TLPGVGPKVAAC+ALFSLDQH AIPVDTHVWQIAT YL+P+LAGA
Sbjct: 244 SLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIATNYLLPDLAGA 303

Query: 333 RLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPS 375
           +LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL S
Sbjct: 304 KLTPKLHGRVAEAFVSKYGEYAGWAQTLLFIAELPAQKTLLQS 341

BLAST of Tan0005606 vs. ExPASy Swiss-Prot
Match: O08760 (N-glycosylase/DNA lyase OS=Mus musculus OX=10090 GN=Ogg1 PE=1 SV=2)

HSP 1 Score: 209.9 bits (533), Expect = 5.3e-53
Identity = 132/336 (39.29%), Postives = 189/336 (56.25%), Query Frame = 0

Query: 61  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNG 120
           TL +SP+   W  +   +S+L L L   +GQ+FRWK+ +P H++GV+A  + +L      
Sbjct: 15  TLSSSPAL--WASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTLTQ--TE 74

Query: 121 DVSYCLHSCSTSSVAAAARL----TLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGA 180
           D  YC       S  +   L    TL  +    +SL+ ++  +++ D  F  +A+  +G 
Sbjct: 75  DQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQKFQGV 134

Query: 181 RVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLS-L 240
           R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FP+L  L+  
Sbjct: 135 RLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHALAGP 194

Query: 241 VSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPK 300
            +E  LR+ G GYRA+Y+  +  A+  + GG A WL  LR    EE   AL TLPGVG K
Sbjct: 195 EAETHLRKLGLGYRARYVRASAKAILEEQGGPA-WLQQLRVAPYEEAHKALCTLPGVGAK 254

Query: 301 VAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGK 360
           VA C+ L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ +   F + +G 
Sbjct: 255 VADCICLMALDKPQAVPVDVHVWQIAHRDYGWHPKTSQAKGPSPLANKELGNFFRNLWGP 314

Query: 361 YAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQ 388
           YAGWAQ +LF ADL Q      S+    KRKK +K+
Sbjct: 315 YAGWAQAVLFSADLRQPSL---SREPPAKRKKGSKR 342

BLAST of Tan0005606 vs. ExPASy Swiss-Prot
Match: O70249 (N-glycosylase/DNA lyase OS=Rattus norvegicus OX=10116 GN=Ogg1 PE=2 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 6.9e-53
Identity = 130/336 (38.69%), Postives = 186/336 (55.36%), Query Frame = 0

Query: 61  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNG 120
           TL +SP+   W  +   +S+L L L   +GQ+FRW++ +P H++GV+A  + +L      
Sbjct: 15  TLTSSPAL--WASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQ--TE 74

Query: 121 DVSYCLHSCSTSSVAAAARL----TLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGA 180
           D  YC              L    TL  +    +SL+ ++  +++ D  F  +A+  +G 
Sbjct: 75  DQLYCTVYRGDKGQVGRPTLEELETLHKYFQLDVSLTQLYSHWASVDSHFQSVAQKFQGV 134

Query: 181 RVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLSLV 240
           R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FP+L  L+  
Sbjct: 135 RLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLVQLDDVTYHGFPNLHALAGP 194

Query: 241 S-EAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPK 300
             E  LR+ G GYRA+Y+  +  A+  + GG A WL  LR    EE   AL TLPGVG K
Sbjct: 195 EVETHLRKLGLGYRARYVCASAKAILEEQGGPA-WLQQLRVASYEEAHKALCTLPGVGTK 254

Query: 301 VAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGK 360
           VA C+ L +LD+  A+PVD HVWQIA R     P+ +  +    L N+ +   F + +G 
Sbjct: 255 VADCICLMALDKPQAVPVDIHVWQIAHRDYGWQPKTSQTKGPSPLANKELGNFFRNLWGP 314

Query: 361 YAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQ 388
           YAGWAQ +LF ADL QQ     S+    KRKK +K+
Sbjct: 315 YAGWAQAVLFSADLRQQNL---SREPPAKRKKGSKK 342

BLAST of Tan0005606 vs. ExPASy Swiss-Prot
Match: O15527 (N-glycosylase/DNA lyase OS=Homo sapiens OX=9606 GN=OGG1 PE=1 SV=2)

HSP 1 Score: 204.1 bits (518), Expect = 2.9e-51
Identity = 126/336 (37.50%), Postives = 185/336 (55.06%), Query Frame = 0

Query: 61  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH---- 120
           TL ++P+   W  +   +S+L L L  P+GQ+FRW++ +P H++GV+A  + +L      
Sbjct: 15  TLASTPAL--WASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQ 74

Query: 121 ----LPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARH 180
               +  GD S    S  T     A R     +    ++L+ ++  + + D  F  +A+ 
Sbjct: 75  LHCTVYRGDKSQA--SRPTPDELEAVR----KYFQLDVTLAQLYHHWGSVDSHFQEVAQK 134

Query: 181 LEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLER 240
            +G R+LRQDP+ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FPSL+ 
Sbjct: 135 FQGVRLLRQDPIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQA 194

Query: 241 LSLVS-EAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPG 300
           L+    EA LR+ G GYRA+Y+  +  A+  + GG A WL  LR+   EE   AL  LPG
Sbjct: 195 LAGPEVEAHLRKLGLGYRARYVSASARAILEEQGGLA-WLQQLRESSYEEAHKALCILPG 254

Query: 301 VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVS 360
           VG KVA C+ L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F S
Sbjct: 255 VGTKVADCICLMALDKPQAVPVDVHMWHIAQRDYSWHPTTSQAKGPSPQTNKELGNFFRS 314

Query: 361 KYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKK 384
            +G YAGWAQ +LF ADL Q +       + RK  K
Sbjct: 315 LWGPYAGWAQAVLFSADLRQSRHAQEPPAKRRKGSK 341

BLAST of Tan0005606 vs. ExPASy Swiss-Prot
Match: Q9V3I8 (N-glycosylase/DNA lyase OS=Drosophila melanogaster OX=7227 GN=Ogg1 PE=2 SV=2)

HSP 1 Score: 171.0 bits (432), Expect = 2.7e-41
Identity = 106/322 (32.92%), Postives = 168/322 (52.17%), Query Frame = 0

Query: 74  LNLSKSDLSLPLTFPTGQTFRWKQTAPLHFT--GVVASHLISLKHLPNGDVSYCLHSCST 133
           + LS  +  L  T   GQ+FRW+     + T  G V  +   +       ++Y  +  S+
Sbjct: 27  IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQEESFITYEAYGTSS 86

Query: 134 SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHL-EGARVLRQDPLECLIQF 193
                     + D+L     L    + + + D   D   + L +  R+L Q+P E +  F
Sbjct: 87  PLATKDYSSLISDYLRVDFDLKVNQKDWLSKD---DNFVKFLSKPVRLLSQEPFENIFSF 146

Query: 194 LCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFHEFPSLERLSLVS----EAELREAGF 253
           LCS NNNI RI+ M++ + ++ G  +G+  G D + FP++ R   +      A+LR A F
Sbjct: 147 LCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKF 206

Query: 254 GYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLD 313
           GYRAK+I  T+  ++ K  GG  W +SL+ +  E+  + L+ LPG+G KVA C+ L S+ 
Sbjct: 207 GYRAKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLLPGIGYKVADCICLMSMG 266

Query: 314 QHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVAD 373
              ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +LF AD
Sbjct: 267 HLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHFQKLHGKYAGWAQAILFSAD 326

Query: 374 LPQ---QKALLPSKLENRKRKK 384
           L Q      +   K  N+K KK
Sbjct: 327 LSQFQNTSTVACKKKSNKKPKK 343

BLAST of Tan0005606 vs. NCBI nr
Match: XP_038885236.1 (N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida])

HSP 1 Score: 679.5 bits (1752), Expect = 1.8e-191
Identity = 351/399 (87.97%), Postives = 370/399 (92.73%), Query Frame = 0

Query: 3   SFSSRPLLMSKRLRPTPPSTPSAKPP--SSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTL 62
           SF+ +PLLM+KRLRPTPPSTPSAKPP   SPP PPTPQ SHSKPTTVS+HYSS N  KTL
Sbjct: 5   SFNFKPLLMTKRLRPTPPSTPSAKPPPLPSPPSPPTPQLSHSKPTTVSVHYSSKNRNKTL 64

Query: 63  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNG 122
           T   S SS NWV LNL+KS+L+LPLTFPTGQTFRWKQT+PL FTGVV SHLISL HLPN 
Sbjct: 65  T-PQSSSSFNWVSLNLTKSELALPLTFPTGQTFRWKQTSPLQFTGVVGSHLISLNHLPNS 124

Query: 123 DVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLR 182
           DVSYCLHSCSTSS +AAARL LLDFLNAGISLS+IWEVF AADPRFD LARHLEGARVLR
Sbjct: 125 DVSYCLHSCSTSSSSAAARLALLDFLNAGISLSSIWEVFLAADPRFDVLARHLEGARVLR 184

Query: 183 QDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAEL 242
           QDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGN+GGFDF+EFPSLERLSLVSEAEL
Sbjct: 185 QDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNIGGFDFYEFPSLERLSLVSEAEL 244

Query: 243 REAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA 302
           REAGFGYRAKYIIG VNAL+AKPGGGAEWLLSLRDLDLEEVI+ALSTLPGVGPKVAACVA
Sbjct: 245 REAGFGYRAKYIIGAVNALKAKPGGGAEWLLSLRDLDLEEVIEALSTLPGVGPKVAACVA 304

Query: 303 LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLL 362
           LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLL
Sbjct: 305 LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLL 364

Query: 363 FVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDR 400
           FVADLPQQKALLP+ LEN KRK+STK QK+K  TGN+D+
Sbjct: 365 FVADLPQQKALLPANLENAKRKRSTKHQKDKAHTGNVDQ 402

BLAST of Tan0005606 vs. NCBI nr
Match: XP_008466739.1 (PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo])

HSP 1 Score: 673.7 bits (1737), Expect = 9.8e-190
Identity = 353/406 (86.95%), Postives = 369/406 (90.89%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRFKPTTPSTPSTKPSPPPPSPPTPQLSHSKPTTVSIHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           LTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT PL FTGVV SHLISL H
Sbjct: 61  LTLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEG 180
           LPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEG
Sbjct: 121 LPNGEVSYCLHFSSTSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHLEG 180

Query: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240
           ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV
Sbjct: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240

Query: 241 SEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV 300
           SEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Sbjct: 241 SEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIGALSTLPGVGPKV 300

Query: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360
           AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW
Sbjct: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360

Query: 361 AQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDRCE 402
           AQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++    GNID+CE
Sbjct: 361 AQTLLFVAELPQQKALLPATLENTKRKRSTKQQRDMAHAGNIDQCE 406

BLAST of Tan0005606 vs. NCBI nr
Match: XP_038885235.1 (N-glycosylase/DNA lyase OGG1 isoform X1 [Benincasa hispida])

HSP 1 Score: 670.6 bits (1729), Expect = 8.3e-189
Identity = 351/411 (85.40%), Postives = 370/411 (90.02%), Query Frame = 0

Query: 3   SFSSRPLLMSKRLRPTPPSTPSAKPP--SSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTL 62
           SF+ +PLLM+KRLRPTPPSTPSAKPP   SPP PPTPQ SHSKPTTVS+HYSS N  KTL
Sbjct: 5   SFNFKPLLMTKRLRPTPPSTPSAKPPPLPSPPSPPTPQLSHSKPTTVSVHYSSKNRNKTL 64

Query: 63  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNG 122
           T   S SS NWV LNL+KS+L+LPLTFPTGQTFRWKQT+PL FTGVV SHLISL HLPN 
Sbjct: 65  T-PQSSSSFNWVSLNLTKSELALPLTFPTGQTFRWKQTSPLQFTGVVGSHLISLNHLPNS 124

Query: 123 DVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLR 182
           DVSYCLHSCSTSS +AAARL LLDFLNAGISLS+IWEVF AADPRFD LARHLEGARVLR
Sbjct: 125 DVSYCLHSCSTSSSSAAARLALLDFLNAGISLSSIWEVFLAADPRFDVLARHLEGARVLR 184

Query: 183 QDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAEL 242
           QDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGN+GGFDF+EFPSLERLSLVSEAEL
Sbjct: 185 QDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNIGGFDFYEFPSLERLSLVSEAEL 244

Query: 243 REAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA 302
           REAGFGYRAKYIIG VNAL+AKPGGGAEWLLSLRDLDLEEVI+ALSTLPGVGPKVAACVA
Sbjct: 245 REAGFGYRAKYIIGAVNALKAKPGGGAEWLLSLRDLDLEEVIEALSTLPGVGPKVAACVA 304

Query: 303 LFSLDQHHAIPVDTHVWQ------------IATRYLVPELAGARLTPKLCNRVAEAFVSK 362
           LFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSK
Sbjct: 305 LFSLDQHHAIPVDTHVWQLIEKFLLAWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSK 364

Query: 363 YGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDR 400
           YGKYAGWAQTLLFVADLPQQKALLP+ LEN KRK+STK QK+K  TGN+D+
Sbjct: 365 YGKYAGWAQTLLFVADLPQQKALLPANLENAKRKRSTKHQKDKAHTGNVDQ 414

BLAST of Tan0005606 vs. NCBI nr
Match: XP_004149809.2 (N-glycosylase/DNA lyase OGG1 [Cucumis sativus] >KGN47701.1 hypothetical protein Csa_018253 [Cucumis sativus])

HSP 1 Score: 666.8 bits (1719), Expect = 1.2e-187
Identity = 348/405 (85.93%), Postives = 366/405 (90.37%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KRL+PTPPSTPS KP   PP PPTPQ SHSKPTTVS+H+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           L LL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT P  FTGVV SHLISL H
Sbjct: 61  LPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPFEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGA 180
           LPNGDVSYCLH  STSS +AAARL LLDFLNA ISLS+IWEVFSAADPRFD LARH EGA
Sbjct: 121 LPNGDVSYCLHFSSTSS-SAAARLALLDFLNASISLSSIWEVFSAADPRFDALARHFEGA 180

Query: 181 RVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVS 240
           RVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVS
Sbjct: 181 RVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVS 240

Query: 241 EAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA 300
           EAELREAGFGYRAKYIIG VNAL+AKP GGAEWLLSLRD DLEEVI+ALSTLPGVGPKVA
Sbjct: 241 EAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIEALSTLPGVGPKVA 300

Query: 301 ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWA 360
           ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWA
Sbjct: 301 ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWA 360

Query: 361 QTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDRCE 402
           QTLLF+A+LPQQKALLP+ LEN KRK+STKQQK+    GNID+CE
Sbjct: 361 QTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNIDQCE 404

BLAST of Tan0005606 vs. NCBI nr
Match: XP_016903621.1 (PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo] >TYK08955.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 661.8 bits (1706), Expect = 3.9e-186
Identity = 347/395 (87.85%), Postives = 362/395 (91.65%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRFKPTTPSTPSTKPSPPPPSPPTPQLSHSKPTTVSIHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           LTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT PL FTGVV SHLISL H
Sbjct: 61  LTLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEG 180
           LPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEG
Sbjct: 121 LPNGEVSYCLHFSSTSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHLEG 180

Query: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240
           ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV
Sbjct: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240

Query: 241 SEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV 300
           SEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Sbjct: 241 SEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIGALSTLPGVGPKV 300

Query: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360
           AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW
Sbjct: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360

Query: 361 AQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE 391
           AQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++
Sbjct: 361 AQTLLFVAELPQQKALLPATLENTKRKRSTKQQRD 395

BLAST of Tan0005606 vs. ExPASy TrEMBL
Match: A0A1S3CS00 (DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis melo OX=3656 GN=LOC103504082 PE=3 SV=1)

HSP 1 Score: 673.7 bits (1737), Expect = 4.7e-190
Identity = 353/406 (86.95%), Postives = 369/406 (90.89%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRFKPTTPSTPSTKPSPPPPSPPTPQLSHSKPTTVSIHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           LTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT PL FTGVV SHLISL H
Sbjct: 61  LTLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEG 180
           LPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEG
Sbjct: 121 LPNGEVSYCLHFSSTSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHLEG 180

Query: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240
           ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV
Sbjct: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240

Query: 241 SEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV 300
           SEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Sbjct: 241 SEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIGALSTLPGVGPKV 300

Query: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360
           AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW
Sbjct: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360

Query: 361 AQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDRCE 402
           AQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++    GNID+CE
Sbjct: 361 AQTLLFVAELPQQKALLPATLENTKRKRSTKQQRDMAHAGNIDQCE 406

BLAST of Tan0005606 vs. ExPASy TrEMBL
Match: A0A0A0KIU8 (DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis sativus OX=3659 GN=Csa_6G382890 PE=3 SV=1)

HSP 1 Score: 666.8 bits (1719), Expect = 5.8e-188
Identity = 348/405 (85.93%), Postives = 366/405 (90.37%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KRL+PTPPSTPS KP   PP PPTPQ SHSKPTTVS+H+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           L LL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT P  FTGVV SHLISL H
Sbjct: 61  LPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPFEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGA 180
           LPNGDVSYCLH  STSS +AAARL LLDFLNA ISLS+IWEVFSAADPRFD LARH EGA
Sbjct: 121 LPNGDVSYCLHFSSTSS-SAAARLALLDFLNASISLSSIWEVFSAADPRFDALARHFEGA 180

Query: 181 RVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVS 240
           RVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVS
Sbjct: 181 RVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVS 240

Query: 241 EAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA 300
           EAELREAGFGYRAKYIIG VNAL+AKP GGAEWLLSLRD DLEEVI+ALSTLPGVGPKVA
Sbjct: 241 EAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIEALSTLPGVGPKVA 300

Query: 301 ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWA 360
           ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWA
Sbjct: 301 ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWA 360

Query: 361 QTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDRCE 402
           QTLLF+A+LPQQKALLP+ LEN KRK+STKQQK+    GNID+CE
Sbjct: 361 QTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNIDQCE 404

BLAST of Tan0005606 vs. ExPASy TrEMBL
Match: A0A5D3CBS3 (DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold314G00460 PE=3 SV=1)

HSP 1 Score: 661.8 bits (1706), Expect = 1.9e-186
Identity = 347/395 (87.85%), Postives = 362/395 (91.65%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRFKPTTPSTPSTKPSPPPPSPPTPQLSHSKPTTVSIHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           LTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT PL FTGVV SHLISL H
Sbjct: 61  LTLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEG 180
           LPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEG
Sbjct: 121 LPNGEVSYCLHFSSTSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHLEG 180

Query: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240
           ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV
Sbjct: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240

Query: 241 SEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV 300
           SEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Sbjct: 241 SEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIGALSTLPGVGPKV 300

Query: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360
           AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW
Sbjct: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360

Query: 361 AQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE 391
           AQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++
Sbjct: 361 AQTLLFVAELPQQKALLPATLENTKRKRSTKQQRD 395

BLAST of Tan0005606 vs. ExPASy TrEMBL
Match: A0A1S4E5V3 (DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis melo OX=3656 GN=LOC103504082 PE=3 SV=1)

HSP 1 Score: 661.8 bits (1706), Expect = 1.9e-186
Identity = 347/395 (87.85%), Postives = 362/395 (91.65%), Query Frame = 0

Query: 1   MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKT 60
           M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KT
Sbjct: 1   MPSLSFKPLLLMTKRFKPTTPSTPSTKPSPPPPSPPTPQLSHSKPTTVSIHHSSKNPNKT 60

Query: 61  LTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH 120
           LTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWKQT PL FTGVV SHLISL H
Sbjct: 61  LTLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNH 120

Query: 121 LPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEG 180
           LPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEG
Sbjct: 121 LPNGEVSYCLHFSSTSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHLEG 180

Query: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240
           ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV
Sbjct: 181 ARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLV 240

Query: 241 SEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV 300
           SEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Sbjct: 241 SEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIGALSTLPGVGPKV 300

Query: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360
           AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW
Sbjct: 301 AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGW 360

Query: 361 AQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE 391
           AQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++
Sbjct: 361 AQTLLFVAELPQQKALLPATLENTKRKRSTKQQRD 395

BLAST of Tan0005606 vs. ExPASy TrEMBL
Match: A0A6J1FL67 (DNA-(apurinic or apyrimidinic site) lyase OS=Cucurbita moschata OX=3662 GN=LOC111445237 PE=3 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 2.1e-185
Identity = 347/405 (85.68%), Postives = 364/405 (89.88%), Query Frame = 0

Query: 1   MHSFSSRPLLMSKRLRPTPPSTPSAK----PPSSPPPPPTPQPSHSKPTTVSIHYSSNNP 60
           M S S R  LM+KRLRPTPPSTPSAK    PPS PP PPTPQ  HSKPTTVS+ +SSN+ 
Sbjct: 1   MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDR 60

Query: 61  PKTLTLLNSP---SSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLIS 120
            KTLT L SP   +SSNWV LNL++SDLSLPLTFPTGQTFRWKQT+PLHFTGVV  HLIS
Sbjct: 61  NKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLIS 120

Query: 121 LKHLPNGDVSYCLHSCST---SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLA 180
           L HLPNGDVSYCLHSCST   SS AAAARL LLDFLNAGISLSAIWEVFSAADPRFD L+
Sbjct: 121 LTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLS 180

Query: 181 RHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLE 240
           RHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGN+LGN+GGFDFHEFPSLE
Sbjct: 181 RHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLE 240

Query: 241 RLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPG 300
           RLSLVSEAELREAGFGYRAKYIIGTV  L+ KPGGGAEWLLSLRDL LEEVI+ L+ LPG
Sbjct: 241 RLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG 300

Query: 301 VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYG 360
           VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYG
Sbjct: 301 VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYG 360

Query: 361 KYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTG 396
           KYAGWAQTLLFVADLPQQKALLP+ LEN KRKKSTK+Q+EK  TG
Sbjct: 361 KYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQREKAHTG 405

BLAST of Tan0005606 vs. TAIR 10
Match: AT1G21710.1 (8-oxoguanine-DNA glycosylase 1 )

HSP 1 Score: 451.8 bits (1161), Expect = 5.6e-127
Identity = 232/343 (67.64%), Postives = 271/343 (79.01%), Query Frame = 0

Query: 33  PPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQT 92
           P PT QPS S  +TV    S    P     L+   +  W PL L+ ++L+LPLTFPTGQT
Sbjct: 4   PRPTSQPSIS--STVKPPLSPPVTPILKQKLHRTGTPKWFPLKLTHTELTLPLTFPTGQT 63

Query: 93  FRWKQTAPLHFTGVVASHLISLKHLPNGD-VSYCLHSCSTSSVAAAARLTLLDFLNAGIS 152
           FRWK+T  + ++G +  HL+SL+  P  D VSYC+H CSTS    +A L LLDFLNA IS
Sbjct: 64  FRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVH-CSTS--PKSAELALLDFLNAEIS 123

Query: 153 LSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSL 212
           L+ +W  FS  DPRF  LARHL GARVLRQDPLECLIQFLCSSNNNI RITKMVD++SSL
Sbjct: 124 LAELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSSNNNIARITKMVDFVSSL 183

Query: 213 GNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLL 272
           G +LG++ GF+FH+FPSL+RLS VSE E R+AGFGYRAKYI GTVNAL+AKPGGG EWLL
Sbjct: 184 GLHLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYRAKYITGTVNALQAKPGGGNEWLL 243

Query: 273 SLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGA 332
           SLR ++L+E + AL TLPGVGPKVAAC+ALFSLDQH AIPVDTHVWQIAT YL+P+LAGA
Sbjct: 244 SLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIATNYLLPDLAGA 303

Query: 333 RLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPS 375
           +LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL S
Sbjct: 304 KLTPKLHGRVAEAFVSKYGEYAGWAQTLLFIAELPAQKTLLQS 341

BLAST of Tan0005606 vs. TAIR 10
Match: AT3G47830.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 55.1 bits (131), Expect = 1.5e-07
Identity = 33/69 (47.83%), Postives = 41/69 (59.42%), Query Frame = 0

Query: 273 LRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGA 332
           LR L +EEV   LS   GVGPK  +CV +F+L QH+  PVDTHV++IA     VP+ A  
Sbjct: 175 LRGLSVEEVKTELSHFKGVGPKTVSCVLMFNL-QHNDFPVDTHVFEIAKALGWVPKTADR 234

Query: 333 RLTPKLCNR 341
             T    NR
Sbjct: 235 NKTYVHLNR 242

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FNY77.9e-12667.64N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana OX=3702 GN=OGG1 PE=1 SV=1[more]
O087605.3e-5339.29N-glycosylase/DNA lyase OS=Mus musculus OX=10090 GN=Ogg1 PE=1 SV=2[more]
O702496.9e-5338.69N-glycosylase/DNA lyase OS=Rattus norvegicus OX=10116 GN=Ogg1 PE=2 SV=1[more]
O155272.9e-5137.50N-glycosylase/DNA lyase OS=Homo sapiens OX=9606 GN=OGG1 PE=1 SV=2[more]
Q9V3I82.7e-4132.92N-glycosylase/DNA lyase OS=Drosophila melanogaster OX=7227 GN=Ogg1 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
XP_038885236.11.8e-19187.97N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida][more]
XP_008466739.19.8e-19086.95PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo][more]
XP_038885235.18.3e-18985.40N-glycosylase/DNA lyase OGG1 isoform X1 [Benincasa hispida][more]
XP_004149809.21.2e-18785.93N-glycosylase/DNA lyase OGG1 [Cucumis sativus] >KGN47701.1 hypothetical protein ... [more]
XP_016903621.13.9e-18687.85PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo] >TYK08955.1 N-... [more]
Match NameE-valueIdentityDescription
A0A1S3CS004.7e-19086.95DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis melo OX=3656 GN=LOC10350408... [more]
A0A0A0KIU85.8e-18885.93DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis sativus OX=3659 GN=Csa_6G38... [more]
A0A5D3CBS31.9e-18687.85DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A1S4E5V31.9e-18687.85DNA-(apurinic or apyrimidinic site) lyase OS=Cucumis melo OX=3656 GN=LOC10350408... [more]
A0A6J1FL672.1e-18585.68DNA-(apurinic or apyrimidinic site) lyase OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
Match NameE-valueIdentityDescription
AT1G21710.15.6e-12767.648-oxoguanine-DNA glycosylase 1 [more]
AT3G47830.11.5e-0747.83DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 192..365
e-value: 3.9E-20
score: 82.9
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 188..323
e-value: 5.0E-11
score: 43.0
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 185..361
e-value: 1.11224E-27
score: 105.401
IPR023170Helix-hairpin-helix, base-excision DNA repair, C-terminalGENE3D1.10.1670.10coord: 157..365
e-value: 6.2E-62
score: 210.6
NoneNo IPR availableGENE3D3.30.310.40coord: 61..143
e-value: 3.7E-16
score: 60.5
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 182..306
e-value: 6.2E-62
score: 210.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..53
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 20..41
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 373..401
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 387..401
NoneNo IPR availablePANTHERPTHR10242:SF2N-GLYCOSYLASE/DNA LYASEcoord: 69..391
NoneNo IPR availablePANTHERPTHR102428-OXOGUANINE DNA GLYCOSYLASEcoord: 69..391
NoneNo IPR availableSUPERFAMILY55945TATA-box binding protein-likecoord: 66..181
IPR0129048-oxoguanine DNA glycosylase, N-terminalPFAMPF07934OGG_Ncoord: 74..187
e-value: 1.3E-19
score: 70.5
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 182..364

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0005606.1Tan0005606.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006285 base-excision repair, AP site formation
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus
molecular_function GO:0034039 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity
molecular_function GO:0140078 class I DNA-(apurinic or apyrimidinic site) endonuclease activity
molecular_function GO:0003684 damaged DNA binding
molecular_function GO:0003824 catalytic activity
molecular_function GO:0008534 oxidized purine nucleobase lesion DNA N-glycosylase activity