CmaCh17G013890 (gene) Cucurbita maxima (Rimu)

NameCmaCh17G013890
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionA/G-specific adenine DNA glycosylase
LocationCma_Chr17 : 9254363 .. 9260445 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAATTAGACAGAATTCACATTTGGCTCAATTCATTATTCATTTTATTAGTGAAAACTAATTGAGTATTAATCTCGTCAAACTGGGATTTTCCCGCCCCAATCAGGTTAGGGGGTCGCAGTGTTTACCTTTTCTAACCCAGCTACACTAATAATGGACTGGGTCCTCGTACATCGCTACACATACCTTCCCTACTCGTTTCTTTTCTCTTCTTCTCTCCTTGGCCGGCAACTTTCACCTCGTGGGTAATCATTGAAGAGTCATCAGGGTAGTGGGACGGGTTGTTACAGTATGAGCGGCGGAGAAAAGAACGAGAACCATGAGGATGTGAAAAAGAAACCCACGAAGGGGGAAAAGCGCCGGGGCCGAAGTCCGTCCAAAAGGGAACCAATCGTTGACATTGAAGATATTATGTTCAGCATAGATAAAGTTCAGACAATGAGGTCATCGCTATTGGATTGGTACGACCTTAGCCATAGGGACCTTCCTTGGAGGAGGTTGGACAAAGGGCAGCCTGAAACACGGGGTTATGGTGTGTGGGTATCAGAAATAATGCTGCAGCAGACGAGAGTTCAGACCGTCGTTGAATATTACAAGCGTTGGATGCATAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTCGAGGTTGGTTTTATGATTTAGTTTGAGCTACAATATTGTCTTCTTCACTCTCAGTAAGCAGTTTAGATATCTGGCAGGAGGTGAATGAAATGTGGGCAGGCTTGGGATACTATAGACGAGCTCGTTTTCTCCTAGAGGTAATCTTTACTCGCTGCACTAATGGAATTTTTGGTCTCTTAATATCTGCCACCACGATTACTCCTTCCAATTTCCTTGGACATGGTTTAGGTTGGCATTTACTTTTAAATATTATCTTCGTTTGTTAAGAGGGCACCTTTTTGTTTTTAATCTTTCTCACGATAGAGGTAGGAGAGGCGGGAAGCTTTTGAAACTCCCTTAGCTTGTGTACGCCATGTTAACAAATAAGATTTTGATTGTATCACATCCCAACCAACCAATCAATTTTCTAATCGATTTAACAACCTTAAGAAAGAAAATTTGAAGTTAAAGTTTCTTTTGCTTTTAGATGCTGATTGTATCAAAATTATGTATATGTTGATAGTCTTTTGCAAAAAGTATCTCAGGCCTTGGCCTACTAACTACTGCCTTGCGACAAGGATTAGACTTGGAACCATACTGTGTAGGTCTCGTTCATGTGATTCTGGTATCCATTTGGAGTTTGTTTTCTACCACCACTGATCGATTGCTGGGAAAACACACACTGATTTGCTTTACTATAGCTTATATCACTGTCTCAATCTCTATCTACTCTTTCCTAAATAAAACAAAAATTCTGCAACAGGATAAGAAAAGTTAAGAGAACATTCGTTCGAGTTTACGCCCATCCTTCTGAGTTCAAAGGGAGATTCGAGGGAGGGAGGGGGGGTTTACATACAACTGATGCAAGATGGTGATGAATTGCAATTGGTCCAAACCTTCGACCAATGACTTATTACGGCCCCGTCAGCCTATGGGAAGATGAGGATAAGTAATAAAGAGTTGGATGGTTCAGCCAAAACTGTTGGACTCATATGGGTCTGCTATCCTACCATTTTAACATTAGAATAGTATTTAAGAAAGGTGAAGTACAAGATAACCCTAGTATTCAGGATATTGAAATAACAACCTAACTTGTGGTCTCCCTCCCAACAAGCTTCTCAAACTCCCTTACACCCTATAAGCTCCTTCCTCAAAGACATACTCAGAAAACCCCATGCTCGCTCTTGTTTCCTAAGGCTTCATGATTCTAGGCTTTATAAGACTCCCTTGATGGAGGGAGATTTACTCTTTTATCAAGCGTACTGATTTATTATATGTCTCCTTTTGCTTTGTCTGTGTCGATAAGGAATTTGAAAAGACCATGAGGCACTTTTTTCTCTTGGGTAGGCCATGGTGATACGGGAATCTCACATTTAGTGAGATATGTGAGAAGGATGGAATTTTGTAGAATCCCTTGGGTCACATTCTGGAAGTGAAAAATACTTTTCCTAATTTTATTTCCGATCTCTTTTTGGAAGGTTCCTTTGGATTGCCAATCAGTTATTGGCCAATTGATTTCTCATATTATATGAACTTTGATATCAGACAGGTATGTTAGCAGTGAGTCAGTGACATGTCTGGGACAGTTGCTAAGATCATCGAAACTTGAAAGTTTGTATTAAAAGATGAGGAGTTAACTGGATATTGAAGCTTTCTTGAGTTATCACGTGGAAGAGTTCTATGTAGAGAGGAAGATTGTAGAAGCTGGAATTTGGAGTCTTTAGGATGGGACTCTGTCAAATCTTTGCCAAAACATTTTGAGGGAGCTGCTGAAATCTTGAAATTTGAAGAAGGTGAATATGTTGACCTGGCGTTTATGGAGAGGGTAGACACATGCAAAGCAGAGCGATCCTGAGGATAGAATGAAAGCAAGCGATCCTGGTACATGAGTGAAACTGGACTACATTCGAATGAGAGTGATCCTAAGGATAAGATGGTTAAAATAGGTTTAAAAGAATTGGAAGTTACTACCTTTACCAATAAGGTGCATCTTCTTTTTGTTTTCGATGGCTCAATCATAGGAACTCTGAAGTTAAGCATGCTTTTCTTAAAGCAGTTCTATGTCGGGTGACCTCCTGAGAATTTGCCTAGGATGCATGTGAGTGATAACAAAACATGCTGAAAGGTCCTGTGTTGGTCTGTAGGGATAGTCTTCACTCTTAGAAGCATTAAGTAACATACCCATGTTGTAGAGGTGCAAGGTAATATCGAGGCACATAGGTGTCGAATCCAAATTTCAAATAAAGTACAAAAAGACTGTAATGCACACCATTTGCATACATCCTAGTGGGTTTTCTTTTCTACTTGTATATTCTTTAGTCAATTAGAAAATTGACTGCATGTTACTTATCAAAATATTACAATGAAGTACAAAAAGAACTGCAATGAAATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATTGACATCTCTGGATATTAATCATTGCTATGAAATTTTGAATAAACCCTTTCTAATTTTGTTGTTAGGGTGCAAAGTTGATAGTCAAAGAAGGTGGCGAATTTCCTAAAACGGTTCCTGACCTTCGAAAAATTCCTGGAATTGGAGAGTATACAGCAGGCGCTATTGCCTCCATAGCATTTGATGAAGTGAGTGTTTTTCTCGCCTATTGTTTCTCTGACTCTCCCTAGCGGACGCGTTTTAAAAACCTTGAGAGATAGTCCAAAGAGGACAATGTCTGCTAGCGGTGGGCTTGGGCCGTTACAAAGCACTTCTAAATATGTTTCCTGATCAGGTGGTGCCTGTGGTTGATGGTAATGTGATTCGGGTAATCGCTCGATTAAAGGCTATTTCAGGAAATCCAAAAGACTCGAAGTTGGTTAAGCAAGTTTGGTGAGCATATTTTCTAGTTGTTCTAAAGGCTTTTATAGAATCTTTGTATCCTAATTGTTCTCATTATTTTTTTTTGGCGTGGGGGGTGGATGAGTTAGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAGCTTGGTGCAACTTTATGCACACCAACAAGCCCAAGCTGCTCAACATGCCCTGTGTTTGACCACTGTGAGGCCCTTTCAATCTCAAAGGATGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAAACCAAACAAAGACATGATTACTCTGCTGTTTGTGTGGTTGAGATGTTGGAAAATCGGGGCACATCTGAGTTGAAGCAATGTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGCTTGCTTGCTGGCCTATGGGAGTTCCCATCTGTCTTGTTGAATGGAGAAGCTGATTCAAGTACAAGGAGAGAATCCATTAACAGCCTCTTGAGTAAATCCTTTGGACTTGAACCAAAAAAGAATTTTGAAATCGTTATTAGAGAAGATGTTGGAGATTTTGTCCATGTTTTCTCGCACATCCGTCTCAAGATATATGTTGAACACTTGGTGTTACGTTTAAAAGGTTCGAGTTACTTCTCCTTTCTTTCCTATCTGTGCATTCAGTATGCCATGAACTCTGTGACTAGAAACTTGCTACAAAACTTAACCTATATGAGATTTACTTATCTTTTACGCTGTCAAACATTTTGATGCAGCTACAATTAGTCATTATATGTTCTTTTGTGTGAGATCTCATATTAGTTGGAGAGGGAAACGAAACATTCCTTATAAGGGTGTGGAAACCTCTTTCTAGTAGACGTGTTAAAACCTTGAGGGAAAGCCTAAAGAGGACAAAATCTAGTAGCGGTGGACTTGGACTGTTACAAATGGTATCAGAGGCAGACACTGGGCAGTGTGCCAATGGGGACGTTGGGCCCTAAGGGGGGTGGATTGTGAGATCCCACATCGGCTGGAGAAGGAAACGAAACATTCCTTATAAGGGTGTGAAAACTTCTCCCTAGTAGACGTGTGTTATTTCTTTTCATTTCAATTAATTTGCTCAATCTGTTTGTATGTGTTCTTGAAGGTGAAGGTAGCAAGCTGTTTCGGAAACAGGAGAAGAAATCTATATCATGGAAATGTGTGGACAACAAGGTTATGTCGAGCATGGGGTTGACGTCCAGTGTGAGGAAGGTAAGCAAAGATGGTGCTTTAGATGACTTCCCTTGTGAAATATTTAACTTGAAGGTTCTATTTTCATCGTTAAACTTGAAAGAAAACTCTTTTAGATGGCATGCTTACAGTATCCCAAACCTACATCTAGCAGATATTGTTTTCTTTGGGCTTTCCTTTAAGGACTTTCCTTCAAGGTTTTAAAATGCGTCTTATAAGAAATGATTTGTTCTCCCCTCCAACCGATATGAGATGATCTCACAATCCACCCCTTAGGGGCCAGCGTCCCCACTCGCACACCACTCGGATCTGGCTCTAATACCATCATTTGTAACAACCCAAGCCCACCACTAGCAGATATTATCCACTTTGGCCTGATACGTATCGCCGTCAGCCTCACGGTTTTAAAACTTGTCTACTAGGGAAAGGGTCTAGCCCTACTCCGGCTAGTGCCTCGCACACTTTGGTGACCAGCCCTAATACTATTTGTAATAGCTCAAGGCCCACCCTTAGCAGATATTGTCCTGTTTAGGCTTTCCCTTCAAAGGTTTCTCCTCAAAGTTGTAAAATGCATCTACTAAGGAGGGATTTCCACACCCTTATAAGGAATGCTTCGTTCTCCTCTTCAACCGTTGTGAGATCTCACATTTAGTATTCCTTCGAACAAACAAATCAATACTCATATAGTCTCCGATGTTGAATGGTATGAACATTAACAACACTTGAGATTCAAAATCATATATACGTTGGATTAAGTGACATTGAGAATCGGTAAACAGGGACTTCTCTGATCACATGATCACTCTCTTTGAATGCCCTGGTGCATGCACATTTATTTATGGATTGCATAGAGATTTGATATCTGTAATCCAAGTTTGATGGACGTCAAGATTCACAAACACAGTATCCTTTTACAGTCCATGATATCATAATGAATACATGATATTATATACAATCAGGAGATCGAAGAAGGTCAACATACGTGTCGTTTTCACATTGTATTTGTAAAGAATGAGTGTTACTAAACATGATTATACTCGAGAGGTCAAGTTGATTAATAATCTGACATCTTTTTGATAGGTGTATGACATGGTGGAGAAATTTGAGGCAGAGATGATATCTCCTAGCCGTGCAGTAGCCACAAAAAAACAGAGGGCTACTTCAACAAACTTGAGCTGCAGGAGCTGTTGACCTTAATCAAAGTAAGCTAATATATCCACCCACCCGGGCTTGGGATTAGAAAACTAGTAAACTATCGGGGTCGAACCGACTAAACCATCAGTTTTTTGTTGGTTTAGTGAACCAATGATAGTATGACTAAAAAAAATTAAAAAAAATTTCTTTTCGGT

mRNA sequence

GAAATTAGACAGAATTCACATTTGGCTCAATTCATTATTCATTTTATTAGTGAAAACTAATTGAGTATTAATCTCGTCAAACTGGGATTTTCCCGCCCCAATCAGGTTAGGGGGTCGCAGTGTTTACCTTTTCTAACCCAGCTACACTAATAATGGACTGGGTCCTCGTACATCGCTACACATACCTTCCCTACTCGTTTCTTTTCTCTTCTTCTCTCCTTGGCCGGCAACTTTCACCTCGTGGGTAATCATTGAAGAGTCATCAGGGTAGTGGGACGGGTTGTTACAGTATGAGCGGCGGAGAAAAGAACGAGAACCATGAGGATGTGAAAAAGAAACCCACGAAGGGGGAAAAGCGCCGGGGCCGAAGTCCGTCCAAAAGGGAACCAATCGTTGACATTGAAGATATTATGTTCAGCATAGATAAAGTTCAGACAATGAGGTCATCGCTATTGGATTGGTACGACCTTAGCCATAGGGACCTTCCTTGGAGGAGGTTGGACAAAGGGCAGCCTGAAACACGGGGTTATGGTGTGTGGGTATCAGAAATAATGCTGCAGCAGACGAGAGTTCAGACCGTCGTTGAATATTACAAGCGTTGGATGCATAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTCGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGATACTATAGACGAGCTCGTTTTCTCCTAGAGGGTGCAAAGTTGATAGTCAAAGAAGGTGGCGAATTTCCTAAAACGGTTCCTGACCTTCGAAAAATTCCTGGAATTGGAGAGTATACAGCAGGCGCTATTGCCTCCATAGCATTTGATGAAGTGGTGCCTGTGGTTGATGGTAATGTGATTCGGGTAATCGCTCGATTAAAGGCTATTTCAGGAAATCCAAAAGACTCGAAGTTGGTTAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAGCTTGGTGCAACTTTATGCACACCAACAAGCCCAAGCTGCTCAACATGCCCTGTGTTTGACCACTGTGAGGCCCTTTCAATCTCAAAGGATGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAAACCAAACAAAGACATGATTACTCTGCTGTTTGTGTGGTTGAGATGTTGGAAAATCGGGGCACATCTGAGTTGAAGCAATGTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGCTTGCTTGCTGGCCTATGGGAGTTCCCATCTGTCTTGTTGAATGGAGAAGCTGATTCAAGTACAAGGAGAGAATCCATTAACAGCCTCTTGAGTAAATCCTTTGGACTTGAACCAAAAAAGAATTTTGAAATCGTTATTAGAGAAGATGTTGGAGATTTTGTCCATGTTTTCTCGCACATCCGTCTCAAGATATATGTTGAACACTTGGTGTTACGTTTAAAAGGTGAAGGTAGCAAGCTGTTTCGGAAACAGGAGAAGAAATCTATATCATGGAAATGTGTGGACAACAAGGTTATGTCGAGCATGGGGTTGACGTCCAGTGTGAGGAAGGTGTATGACATGGTGGAGAAATTTGAGGCAGAGATGATATCTCCTAGCCGTGCAGTAGCCACAAAAAAACAGAGGGCTACTTCAACAAACTTGAGCTGCAGGAGCTGTTGACCTTAATCAAAGTAAGCTAATATATCCACCCACCCGGGCTTGGGATTAGAAAACTAGTAAACTATCGGGGTCGAACCGACTAAACCATCAGTTTTTTGTTGGTTTAGTGAACCAATGATAGTATGACTAAAAAAAATTAAAAAAAATTTCTTTTCGGT

Coding sequence (CDS)

ATGAGCGGCGGAGAAAAGAACGAGAACCATGAGGATGTGAAAAAGAAACCCACGAAGGGGGAAAAGCGCCGGGGCCGAAGTCCGTCCAAAAGGGAACCAATCGTTGACATTGAAGATATTATGTTCAGCATAGATAAAGTTCAGACAATGAGGTCATCGCTATTGGATTGGTACGACCTTAGCCATAGGGACCTTCCTTGGAGGAGGTTGGACAAAGGGCAGCCTGAAACACGGGGTTATGGTGTGTGGGTATCAGAAATAATGCTGCAGCAGACGAGAGTTCAGACCGTCGTTGAATATTACAAGCGTTGGATGCATAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTCGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGATACTATAGACGAGCTCGTTTTCTCCTAGAGGGTGCAAAGTTGATAGTCAAAGAAGGTGGCGAATTTCCTAAAACGGTTCCTGACCTTCGAAAAATTCCTGGAATTGGAGAGTATACAGCAGGCGCTATTGCCTCCATAGCATTTGATGAAGTGGTGCCTGTGGTTGATGGTAATGTGATTCGGGTAATCGCTCGATTAAAGGCTATTTCAGGAAATCCAAAAGACTCGAAGTTGGTTAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAGCTTGGTGCAACTTTATGCACACCAACAAGCCCAAGCTGCTCAACATGCCCTGTGTTTGACCACTGTGAGGCCCTTTCAATCTCAAAGGATGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAAACCAAACAAAGACATGATTACTCTGCTGTTTGTGTGGTTGAGATGTTGGAAAATCGGGGCACATCTGAGTTGAAGCAATGTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGCTTGCTTGCTGGCCTATGGGAGTTCCCATCTGTCTTGTTGAATGGAGAAGCTGATTCAAGTACAAGGAGAGAATCCATTAACAGCCTCTTGAGTAAATCCTTTGGACTTGAACCAAAAAAGAATTTTGAAATCGTTATTAGAGAAGATGTTGGAGATTTTGTCCATGTTTTCTCGCACATCCGTCTCAAGATATATGTTGAACACTTGGTGTTACGTTTAAAAGGTGAAGGTAGCAAGCTGTTTCGGAAACAGGAGAAGAAATCTATATCATGGAAATGTGTGGACAACAAGGTTATGTCGAGCATGGGGTTGACGTCCAGTGTGAGGAAGGTGTATGACATGGTGGAGAAATTTGAGGCAGAGATGATATCTCCTAGCCGTGCAGTAGCCACAAAAAAACAGAGGGCTACTTCAACAAACTTGAGCTGCAGGAGCTGTTGA

Protein sequence

MSGGEKNENHEDVKKKPTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRSSLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVVEMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCVDNKVMSSMGLTSSVRKVYDMVEKFEAEMISPSRAVATKKQRATSTNLSCRSC
BLAST of CmaCh17G013890 vs. Swiss-Prot
Match: MUTYH_ARATH (Adenine DNA glycosylase OS=Arabidopsis thaliana GN=MYH PE=3 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 1.7e-132
Identity = 250/449 (55.68%), Postives = 317/449 (70.60%), Query Frame = 1

Query: 5   EKNENHEDVKKKPTKGEKRRGRSPSKREPIV--DIEDIMFSIDKVQTMRSSLLDWYDLSH 64
           E   + E+ +++  + E+         E  +  DIED+ FS ++ Q +R  LLDWYD++ 
Sbjct: 85  EAEADKEEAEEESEEEEEEEEEEAEAEEEALGGDIEDL-FSENETQKIRMGLLDWYDVNK 144

Query: 65  RDLPWR-RLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLE- 124
           RDLPWR R  + + E R Y VWVSEIMLQQTRVQTV++YYKRWM +WPT+  L +ASLE 
Sbjct: 145 RDLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKRWMQKWPTIYDLGQASLEN 204

Query: 125 ------------------EVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKI 184
                             EVNEMWAGLGYYRRARFLLEGAK++V     FP     L K+
Sbjct: 205 LIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKMVVAGTEGFPNQASSLMKV 264

Query: 185 PGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDP 244
            GIG+YTAGAIASIAF+E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVDP
Sbjct: 265 KGIGQYTAGAIASIAFNEAVPVVDGNVIRVLARLKAISANPKDRLTARNFWKLAAQLVDP 324

Query: 245 SRPGDFNQALMELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQ 304
           SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+++ ++ VTDYP K IK K 
Sbjct: 325 SRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSEENRTISVTDYPTKVIKAKP 384

Query: 305 RHDYSAVCVVEMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRE 364
           RHD+  VCV+E+         +   RF+LVKRP++GLLAGLWEFPSV+LN EADS+TRR 
Sbjct: 385 RHDFCCVCVLEI---HNLERNQSGGRFVLVKRPEQGLLAGLWEFPSVILNEEADSATRRN 444

Query: 365 SINSLLSKS--FGLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFR 424
           +IN  L ++  F +E KK   IV RE++G+FVH+F+HIR K+YVE LV++L G    LF+
Sbjct: 445 AINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYVELLVVQLTGGTEDLFK 504

Query: 425 KQEKKSISWKCVDNKVMSSMGLTSSVRKV 430
            Q K +++WKCV + V+S++GLTS+VRKV
Sbjct: 505 GQAKDTLTWKCVSSDVLSTLGLTSAVRKV 529

BLAST of CmaCh17G013890 vs. Swiss-Prot
Match: MUTYH_MOUSE (Adenine DNA glycosylase OS=Mus musculus GN=Mutyh PE=2 SV=2)

HSP 1 Score: 310.8 bits (795), Expect = 2.4e-83
Identity = 185/431 (42.92%), Postives = 244/431 (56.61%), Query Frame = 1

Query: 14  KKKPTKGEKRRGRSPS---------------KREPIVDIE----DIMFSIDKVQTMRSSL 73
           KK+P   ++RR R+ S               KRE ++        +   +  V   RS+L
Sbjct: 12  KKQPANHKRRRTRALSSSQAKPSSLDGLAKQKREELLQASVSPYHLFSDVADVTAFRSNL 71

Query: 74  LDWYDLSHRDLPWRRLDKGQPET--RGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 133
           L WYD   RDLPWR L K +  +  R Y VWVSE+MLQQT+V TV++YY RWM +WP +Q
Sbjct: 72  LSWYDQEKRDLPWRNLAKEEANSDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQ 131

Query: 134 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKE-GGEFPKTVPDLRKI-PGIGEYTA 193
            L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  P+T   L+++ PG+G YTA
Sbjct: 132 DLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTA 191

Query: 194 GAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQ 253
           GAIASIAFD+V  VVDGNV+RV+ R++AI  +P  + +   +W  A QLVDP+RPGDFNQ
Sbjct: 192 GAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPTSTLVSHHLWNLAQQLVDPARPGDFNQ 251

Query: 254 ALMELGATLCTPTSPSCSTCPVFDHCEAL------------------------------- 313
           A MELGAT+CTP  P CS CPV   C A                                
Sbjct: 252 AAMELGATVCTPQRPLCSHCPVQSLCRAYQRVQRGQLSALPGRPDIEECALNTRQCQLCL 311

Query: 314 -SISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVVEMLENRGTSELKQCSRFLLVKRPDEG 373
            S S  D S+ V ++P K  +   R +YSA CVVE     G   +      LLV+RPD G
Sbjct: 312 TSSSPWDPSMGVANFPRKASRRPPREEYSATCVVEQPGAIGGPLV------LLVQRPDSG 371

Query: 374 LLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNFEIVIREDVGDFVHVFSHI 390
           LLAGLWEFPSV L  E     + +++   L +  G  P      +  + +G+ +H+FSHI
Sbjct: 372 LLAGLWEFPSVTL--EPSEQHQHKALLQELQRWCGPLP-----AIRLQHLGEVIHIFSHI 429

BLAST of CmaCh17G013890 vs. Swiss-Prot
Match: MUTYH_RAT (Adenine DNA glycosylase OS=Rattus norvegicus GN=Mutyh PE=2 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 3.9e-81
Identity = 187/451 (41.46%), Postives = 254/451 (56.32%), Query Frame = 1

Query: 25  GRSPSKREPI----VDIEDIMFSIDKVQTMRSSLLDWYDLSHRDLPWRRLDKGQP--ETR 84
           G +  KRE +    V    +   I  V   R +LL WYD   RDLPWR+  K +   + R
Sbjct: 38  GLAKQKREELLKTPVSPYHLFSDIADVTAFRRNLLSWYDQEKRDLPWRKRVKEETNLDRR 97

Query: 85  GYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLEEVNEMWAGLGYYRRARFL 144
            Y VWVSE+MLQQT+V TV++YY RWM +WPT+Q L+ ASLEEVN++W+GLGYY R R L
Sbjct: 98  AYAVWVSEVMLQQTQVATVIDYYTRWMQKWPTLQDLASASLEEVNQLWSGLGYYSRGRRL 157

Query: 145 LEGAKLIVKE-GGEFPKTVPDLRKI-PGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARL 204
            EGA+ +V+E GG  P+T   L+++ PG+G YTAGAIASIAFD+V  VVDGNVIRV+ R+
Sbjct: 158 QEGARKVVEELGGHVPRTAETLQQLLPGVGRYTAGAIASIAFDQVTGVVDGNVIRVLCRV 217

Query: 205 KAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTSPSCSTCPVFDHC 264
           +AI  +P  S +   +W  A QLVDP+RPGDFNQA MELGAT+CTP  P C+ CPV   C
Sbjct: 218 RAIGADPTSSFVSHHLWDLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCNHCPVQSLC 277

Query: 265 EA-----------LSISKD---------------------DSSVLVTDYPAKGIKTKQRH 324
            A           L  S D                     D ++ V ++P K  +   R 
Sbjct: 278 RAHQRVGQGRLSALPGSPDIEECALNTRQCQLCLPSTNPWDPNMGVVNFPRKASRRPPRE 337

Query: 325 DYSAVCVVEMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESI 384
           +YSA CVVE     G   +      LLV+RP+ GLLAGLWEFPSV L  E     + +++
Sbjct: 338 EYSATCVVEQPGATGGPLI------LLVQRPNSGLLAGLWEFPSVTL--EPSGQHQHKAL 397

Query: 385 NSLLSKSFGLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEK 436
              L       P         + +G+ +HVFSHI+L   V  L   L+G+          
Sbjct: 398 LQELQHWSAPLPTTPL-----QHLGEVIHVFSHIKLTYQVYSLA--LEGQTPASTTLPGA 457

BLAST of CmaCh17G013890 vs. Swiss-Prot
Match: MUTYH_HUMAN (Adenine DNA glycosylase OS=Homo sapiens GN=MUTYH PE=1 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.4e-67
Identity = 126/219 (57.53%), Postives = 157/219 (71.69%), Query Frame = 1

Query: 44  IDKVQTMRSSLLDWYDLSHRDLPWRRL--DKGQPETRGYGVWVSEIMLQQTRVQTVVEYY 103
           + +V   R SLL WYD   RDLPWRR   D+   + R Y VWVSE+MLQQT+V TV+ YY
Sbjct: 87  VAEVTAFRGSLLSWYDQEKRDLPWRRRAEDEMDLDRRAYAVWVSEVMLQQTQVATVINYY 146

Query: 104 KRWMHRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKE-GGEFPKTVPDLR 163
             WM +WPT+Q L+ ASLEEVN++WAGLGYY R R L EGA+ +V+E GG  P+T   L+
Sbjct: 147 TGWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQ 206

Query: 164 KI-PGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQL 223
           ++ PG+G YTAGAIASIAF +   VVDGNV RV+ R++AI  +P  + + +Q+W  A QL
Sbjct: 207 QLLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQL 266

Query: 224 VDPSRPGDFNQALMELGATLCTPTSPSCSTCPVFDHCEA 259
           VDP+RPGDFNQA MELGAT+CTP  P CS CPV   C A
Sbjct: 267 VDPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRA 305

BLAST of CmaCh17G013890 vs. Swiss-Prot
Match: MYH1_SCHPO (Adenine DNA glycosylase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=myh1 PE=1 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 9.1e-54
Identity = 150/461 (32.54%), Postives = 230/461 (49.89%), Query Frame = 1

Query: 46  KVQTMRSSLLDWYD--------------LSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQ 105
           +V+  R SL+ +YD                  D P    D  QP  R Y V VSEIMLQQ
Sbjct: 17  EVERFRESLIQFYDKTKRILPWRKKECIPPSEDSPLE--DWEQPVQRLYEVLVSEIMLQQ 76

Query: 106 TRVQTVVEYYKRWMHRWPTVQHLSRASLE-EVNEMWAGLGYYRRARFLLEGAKLIVK-EG 165
           TRV+TV  YY +WM   PT++  + A    +V  +W+G+G+Y R + L +  + + K   
Sbjct: 77  TRVETVKRYYTKWMETLPTLKSCAEAEYNTQVMPLWSGMGFYTRCKRLHQACQHLAKLHP 136

Query: 166 GEFPKTVPDLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKL 225
            E P+T  +  K IPG+G YTAGA+ SIA+ +   +VDGNVIRV++R  AI  +    K 
Sbjct: 137 SEIPRTGDEWAKGIPGVGPYTAGAVLSIAWKQPTGIVDGNVIRVLSRALAIHSDCSKGKA 196

Query: 226 VKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTSPSCSTCPVFDHCEAL---SISKDD 285
              +WK A +LVDP RPGDFNQALMELGA  CTP SP CS CP+ + C+A    ++ +D 
Sbjct: 197 NALIWKLANELVDPVRPGDFNQALMELGAITCTPQSPRCSVCPISEICKAYQEQNVIRDG 256

Query: 286 SSV-------------------------LVTDYPAKGIKTKQRHDYSAVCVVEMLENRGT 345
           +++                         +V  YP    KTKQR + + V + +  +    
Sbjct: 257 NTIKYDIEDVPCNICITDIPSKEDLQNWVVARYPVHPAKTKQREERALVVIFQKTDPSTK 316

Query: 346 SELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNF 405
            +      FL+ KRP  GLLAGLW+FP++    E    +  + +++   KS       + 
Sbjct: 317 EKF-----FLIRKRPSAGLLAGLWDFPTI----EFGQESWPKDMDAEFQKSIAQWISNDS 376

Query: 406 EIVIR--EDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCVDNKVMS 453
             +I+  +  G ++H+FSHIR   +V + +         +   ++   IS   +++  M 
Sbjct: 377 RSLIKKYQSRGRYLHIFSHIRKTSHVFYAI-----ASPDIVTNEDFFWISQSDLEHVGMC 436

BLAST of CmaCh17G013890 vs. TrEMBL
Match: A0A0A0KC27_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G088720 PE=4 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 4.6e-214
Identity = 386/464 (83.19%), Postives = 412/464 (88.79%), Query Frame = 1

Query: 1   MSGGEKNENHEDVKK--------KPTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
           MS GEKNEN E +KK        KPT   KRRGRSPSK E +VDIEDIMFSID VQT+R+
Sbjct: 55  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRA 114

Query: 61  SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
           SLLDWYD S RDLPWR LDKG+PETR YGVWVSEIMLQQTRVQTVV++Y RWM +WPTVQ
Sbjct: 115 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 174

Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
           HLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FP+TV  LRKIPGIGEYTAGA
Sbjct: 175 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGA 234

Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
           IASIAF EVVPVVDGNVIRVIARLKAISGNPKD KL+KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 235 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 294

Query: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
           MELGATLCTPT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Sbjct: 295 MELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV 354

Query: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
           E+LE++GT EL Q SRFLLVKRPDEGLLAGLWEFPSV L+GEAD STRRESINSLLSK+F
Sbjct: 355 EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNF 414

Query: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
           GLE KKNFEIV REDVGDF+H+F+HIRLKIYVEHLVL LKGEGSKLFRKQEKKSI WKCV
Sbjct: 415 GLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 474

Query: 421 DNKVMSSMGLTSSVRKVYDMVEKFEAEMISPSR--AVATKKQRA 455
           +NKVMS+MGLTSSVRK Y MVEKF+A   S S   A+  KKQ++
Sbjct: 475 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS 518

BLAST of CmaCh17G013890 vs. TrEMBL
Match: E5GB45_CUCME (A/G-specific adenine DNA glycosylase OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 1.2e-193
Identity = 346/401 (86.28%), Postives = 365/401 (91.02%), Query Frame = 1

Query: 1   MSGGEKNENHEDVKKK--------PTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
           MS GEKNEN E+VKKK        PT   KRR RSPSK E +VDIEDIMFSID VQT+R+
Sbjct: 1   MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRA 60

Query: 61  SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
           SLLDWYD S RDLPWR LDKG+PETR YGVWVSEIMLQQTRVQTVV++Y RWM +WPTVQ
Sbjct: 61  SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120

Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
           HLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FPKTV  LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGA 180

Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
           IASIAF EVVPVVDGNVIRVIARLKAISGNPKD KL+KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240

Query: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
           MELGATLCTPT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300

Query: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
           E+LE++GTSEL Q SRFLLVKRPDEGLLAGLWEFPSV L+GEADSSTRRESI+SLLSK+F
Sbjct: 301 EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNF 360

Query: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKG 394
           GLEPKKNFEIV REDVGDF+HVF+HIRLKIYVEHLVL LKG
Sbjct: 361 GLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKG 401

BLAST of CmaCh17G013890 vs. TrEMBL
Match: W9QVM6_9ROSA (A/G-specific adenine DNA glycosylase OS=Morus notabilis GN=L484_005561 PE=4 SV=1)

HSP 1 Score: 550.1 bits (1416), Expect = 2.7e-153
Identity = 286/448 (63.84%), Postives = 346/448 (77.23%), Query Frame = 1

Query: 21  EKRRGRSPSKREPIVDIEDI--MFSIDKVQTMRSSLLDWYDLSHRDLPWR-----RLDKG 80
           E+R   S S     V  ED+  +FS  ++Q MR SLL WY L+ RDLPWR       D+ 
Sbjct: 9   ERRSSSSSSNAAAQVTEEDMEDLFSDVEIQKMRVSLLAWYGLNRRDLPWRVSLPEANDED 68

Query: 81  QPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLEEVNEMWAGLGYYR 140
             E R Y VWVSE+MLQQTRVQTVV+Y+ RWM +WPT+ HLS ASLEEVNEMWAGLGYYR
Sbjct: 69  DVEKRAYRVWVSEVMLQQTRVQTVVDYFNRWMLKWPTLLHLSTASLEEVNEMWAGLGYYR 128

Query: 141 RARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVI 200
           RAR+LLEGAK+IV EGG+FP+TV  LRK+PG+GEYTAGAIASIAF E VPVVDGNV+RVI
Sbjct: 129 RARYLLEGAKMIVSEGGQFPRTVSSLRKVPGVGEYTAGAIASIAFKEAVPVVDGNVVRVI 188

Query: 201 ARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTSPSCSTCPVF 260
           ARLKAIS NPKDS  +K+ W+ AAQLVDPS PGDFNQ LMELGAT+CTP SP+CS+CPV 
Sbjct: 189 ARLKAISANPKDSATIKKFWELAAQLVDPSNPGDFNQGLMELGATICTPLSPTCSSCPVS 248

Query: 261 DHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVVEMLENRGTSELKQCSRFLLVK 320
           D C A+SIS+ D SVLVTDYP+KG+K KQRHD+SAVCV+E+L  +G  ++   S FLLVK
Sbjct: 249 DQCRAVSISRRDRSVLVTDYPSKGMKMKQRHDFSAVCVLEVL--KGEEDMSD-SEFLLVK 308

Query: 321 RPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNFEIVIREDVGDFVH 380
           RPDEGLLAGLWEFPSVLL+GEAD   RRE++N  L   F +E +K  ++++RE VG+FVH
Sbjct: 309 RPDEGLLAGLWEFPSVLLDGEADVDNRREAMNRYLKAHFQIETRKAGKVMLREYVGEFVH 368

Query: 381 VFSHIRLKIYVEHLVLRLKGEGSKL---FRKQEKKSISWKCVDNKVMSSMGLTSSVRKVY 440
           VFSHIRL+IYVE++VL LKG G K+   FRK++ ++  WK V N V+SSMGLTSSVRKVY
Sbjct: 369 VFSHIRLRIYVEYMVLHLKG-GMKMKGAFRKRDTETPPWKYVGNDVISSMGLTSSVRKVY 428

Query: 441 DMVEKFEAEMISPSRAVATKKQRATSTN 459
            MVEKF+ + I+ S      ++R   +N
Sbjct: 429 TMVEKFKQQKIASSNPPVPSRKRNPRSN 452

BLAST of CmaCh17G013890 vs. TrEMBL
Match: K7KZX7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_07G059700 PE=4 SV=1)

HSP 1 Score: 545.4 bits (1404), Expect = 6.6e-152
Identity = 281/455 (61.76%), Postives = 347/455 (76.26%), Query Frame = 1

Query: 14  KKKPTKGEKRRG----RSPSKREPIVDIEDI----MFSIDKVQTMRSSLLDWYDLSHRDL 73
           +KK  K   RR         K +P+V++EDI     FS D+   +R +LLDWYDL+ RDL
Sbjct: 17  EKKKKKNSTRRSVVVVGESKKPQPLVEVEDIEDSLSFSKDETHKLRVALLDWYDLNRRDL 76

Query: 74  PWRRLDKGQPET---RGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLEEV 133
           PWR   K + E    R YGVWVSE+MLQQTRVQTV+ YY RWM +WPT+ HL++ASLEEV
Sbjct: 77  PWRTTFKQEDEEVERRAYGVWVSEVMLQQTRVQTVIAYYNRWMQKWPTIHHLAQASLEEV 136

Query: 134 NEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGAIASIAFDEVV 193
           NEMWAGLGYYRRARFLLEGAK IV EGG+ PK    LR IPGIGEYT+GAIASIAF EVV
Sbjct: 137 NEMWAGLGYYRRARFLLEGAKKIVAEGGQIPKVASMLRNIPGIGEYTSGAIASIAFKEVV 196

Query: 194 PVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQALMELGATLCTP 253
           PVVDGNV+RVIARL+AIS NPKDS  +K+ WK AAQLVDP RPGDFNQALMELGAT+CTP
Sbjct: 197 PVVDGNVVRVIARLRAISANPKDSATIKKFWKLAAQLVDPVRPGDFNQALMELGATVCTP 256

Query: 254 TSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVVEMLENRGTSE 313
            +PSCS+CP  + C ALS +K DS+V VTDYP KG+K KQR D+SAVCVVE++     ++
Sbjct: 257 LNPSCSSCPASEFCHALSNAKHDSTVAVTDYPVKGVKVKQRCDFSAVCVVELVGAETLNK 316

Query: 314 LKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNFEI 373
            +  S+F+LVKRP+EGLLAGLWEFPSVLL+GEA    RRE+++  L K+  ++ +K   I
Sbjct: 317 NQSSSKFILVKRPEEGLLAGLWEFPSVLLDGEAVPLARREAMDRFLEKNLKIDIRKTCNI 376

Query: 374 VIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCVDNKVMSSMGL 433
           V+RED+G+FVH+FSHIRLK+YVE LVL+LKG    LF+  + K+ +WKCV +  +SSMGL
Sbjct: 377 VLREDIGEFVHIFSHIRLKLYVELLVLQLKGVDD-LFKSPDNKT-TWKCVYSNALSSMGL 436

Query: 434 TSSVRKVYDMVEKFEAEMISPSRAVATKKQRATST 458
           T+SVRKVY+MV+ F+ + + PS  V TKK+  T+T
Sbjct: 437 TTSVRKVYNMVQNFKQKTL-PSSHVPTKKRTRTTT 468

BLAST of CmaCh17G013890 vs. TrEMBL
Match: K7KZX6_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_07G059700 PE=4 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 8.6e-152
Identity = 280/455 (61.54%), Postives = 346/455 (76.04%), Query Frame = 1

Query: 14  KKKPTKGEKRRG----RSPSKREPIVDIEDI----MFSIDKVQTMRSSLLDWYDLSHRDL 73
           +KK  K   RR         K +P+V++EDI     FS D+   +R +LLDWYDL+ RDL
Sbjct: 17  EKKKKKNSTRRSVVVVGESKKPQPLVEVEDIEDSLSFSKDETHKLRVALLDWYDLNRRDL 76

Query: 74  PWRRLDKGQPET---RGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLEEV 133
           PWR   K + E    R YGVWVSE+MLQQTRVQTV+ YY RWM +WPT+ HL++ASLEEV
Sbjct: 77  PWRTTFKQEDEEVERRAYGVWVSEVMLQQTRVQTVIAYYNRWMQKWPTIHHLAQASLEEV 136

Query: 134 NEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGAIASIAFDEVV 193
           NEMWAGLGYYRRARFLLEGAK IV EGG+ PK    LR IPGIGEYT+GAIASIAF EVV
Sbjct: 137 NEMWAGLGYYRRARFLLEGAKKIVAEGGQIPKVASMLRNIPGIGEYTSGAIASIAFKEVV 196

Query: 194 PVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQALMELGATLCTP 253
           PVVDGNV+RVIARL+AIS NPKDS  +K+ WK AAQLVDP RPGDFNQALMELGAT+CTP
Sbjct: 197 PVVDGNVVRVIARLRAISANPKDSATIKKFWKLAAQLVDPVRPGDFNQALMELGATVCTP 256

Query: 254 TSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVVEMLENRGTSE 313
            +PSCS+CP  + C ALS +K DS+V VTDYP KG+K KQR D+SAVCVVE++     ++
Sbjct: 257 LNPSCSSCPASEFCHALSNAKHDSTVAVTDYPVKGVKVKQRCDFSAVCVVELVGAETLNK 316

Query: 314 LKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNFEI 373
            +  S+F+LVKRP+EGLLAGLWEFPSVLL+GEA    RRE+++  L K+  ++ +K   I
Sbjct: 317 NQSSSKFILVKRPEEGLLAGLWEFPSVLLDGEAVPLARREAMDRFLEKNLKIDIRKTCNI 376

Query: 374 VIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCVDNKVMSSMGL 433
           V+RED+G+FVH+FSHIRLK+YVE LVL+LK     LF+  + K+ +WKCV +  +SSMGL
Sbjct: 377 VLREDIGEFVHIFSHIRLKLYVELLVLQLKVGVDDLFKSPDNKT-TWKCVYSNALSSMGL 436

Query: 434 TSSVRKVYDMVEKFEAEMISPSRAVATKKQRATST 458
           T+SVRKVY+MV+ F+ + + PS  V TKK+  T+T
Sbjct: 437 TTSVRKVYNMVQNFKQKTL-PSSHVPTKKRTRTTT 469

BLAST of CmaCh17G013890 vs. TAIR10
Match: AT4G12740.1 (AT4G12740.1 HhH-GPD base excision DNA repair family protein)

HSP 1 Score: 474.2 bits (1219), Expect = 9.4e-134
Identity = 250/449 (55.68%), Postives = 317/449 (70.60%), Query Frame = 1

Query: 5   EKNENHEDVKKKPTKGEKRRGRSPSKREPIV--DIEDIMFSIDKVQTMRSSLLDWYDLSH 64
           E   + E+ +++  + E+         E  +  DIED+ FS ++ Q +R  LLDWYD++ 
Sbjct: 85  EAEADKEEAEEESEEEEEEEEEEAEAEEEALGGDIEDL-FSENETQKIRMGLLDWYDVNK 144

Query: 65  RDLPWR-RLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLE- 124
           RDLPWR R  + + E R Y VWVSEIMLQQTRVQTV++YYKRWM +WPT+  L +ASLE 
Sbjct: 145 RDLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKRWMQKWPTIYDLGQASLEN 204

Query: 125 ------------------EVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKI 184
                             EVNEMWAGLGYYRRARFLLEGAK++V     FP     L K+
Sbjct: 205 LIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKMVVAGTEGFPNQASSLMKV 264

Query: 185 PGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDP 244
            GIG+YTAGAIASIAF+E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVDP
Sbjct: 265 KGIGQYTAGAIASIAFNEAVPVVDGNVIRVLARLKAISANPKDRLTARNFWKLAAQLVDP 324

Query: 245 SRPGDFNQALMELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQ 304
           SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+++ ++ VTDYP K IK K 
Sbjct: 325 SRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSEENRTISVTDYPTKVIKAKP 384

Query: 305 RHDYSAVCVVEMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRE 364
           RHD+  VCV+E+         +   RF+LVKRP++GLLAGLWEFPSV+LN EADS+TRR 
Sbjct: 385 RHDFCCVCVLEI---HNLERNQSGGRFVLVKRPEQGLLAGLWEFPSVILNEEADSATRRN 444

Query: 365 SINSLLSKS--FGLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFR 424
           +IN  L ++  F +E KK   IV RE++G+FVH+F+HIR K+YVE LV++L G    LF+
Sbjct: 445 AINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYVELLVVQLTGGTEDLFK 504

Query: 425 KQEKKSISWKCVDNKVMSSMGLTSSVRKV 430
            Q K +++WKCV + V+S++GLTS+VRKV
Sbjct: 505 GQAKDTLTWKCVSSDVLSTLGLTSAVRKV 529

BLAST of CmaCh17G013890 vs. NCBI nr
Match: gi|659119956|ref|XP_008459934.1| (PREDICTED: A/G-specific adenine DNA glycosylase [Cucumis melo])

HSP 1 Score: 762.3 bits (1967), Expect = 4.9e-217
Identity = 391/464 (84.27%), Postives = 416/464 (89.66%), Query Frame = 1

Query: 1   MSGGEKNENHEDVKKK--------PTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
           MS GEKNEN E+VKKK        PT   KRR RSPSK E +VDIEDIMFSID VQT+R+
Sbjct: 1   MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRA 60

Query: 61  SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
           SLLDWYD S RDLPWR LDKG+PETR YGVWVSEIMLQQTRVQTVV++Y RWM +WPTVQ
Sbjct: 61  SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120

Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
           HLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FPKTV  LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGA 180

Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
           IASIAF EVVPVVDGNVIRVIARLKAISGNPKD KL+KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240

Query: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
           MELGATLCTPT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300

Query: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
           E+LE++GTSEL Q SRFLLVKRPDEGLLAGLWEFPSV L+GEADSSTRRESI+SLLSK+F
Sbjct: 301 EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNF 360

Query: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
           GLEPKKNFEIV REDVGDF+HVF+HIRLKIYVEHLVL LKGEGSKLFRKQEKKSI WKCV
Sbjct: 361 GLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 420

Query: 421 DNKVMSSMGLTSSVRKVYDMVEKFEA--EMISPSRAVATKKQRA 455
           +NKVMS+MGLTSSVRK Y MVEKF+A     S SR +  KKQ++
Sbjct: 421 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS 464

BLAST of CmaCh17G013890 vs. NCBI nr
Match: gi|778711687|ref|XP_004140565.2| (PREDICTED: A/G-specific adenine DNA glycosylase [Cucumis sativus])

HSP 1 Score: 751.9 bits (1940), Expect = 6.6e-214
Identity = 386/464 (83.19%), Postives = 412/464 (88.79%), Query Frame = 1

Query: 1   MSGGEKNENHEDVKK--------KPTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
           MS GEKNEN E +KK        KPT   KRRGRSPSK E +VDIEDIMFSID VQT+R+
Sbjct: 1   MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRA 60

Query: 61  SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
           SLLDWYD S RDLPWR LDKG+PETR YGVWVSEIMLQQTRVQTVV++Y RWM +WPTVQ
Sbjct: 61  SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120

Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
           HLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FP+TV  LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGA 180

Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
           IASIAF EVVPVVDGNVIRVIARLKAISGNPKD KL+KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240

Query: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
           MELGATLCTPT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV 300

Query: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
           E+LE++GT EL Q SRFLLVKRPDEGLLAGLWEFPSV L+GEAD STRRESINSLLSK+F
Sbjct: 301 EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNF 360

Query: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
           GLE KKNFEIV REDVGDF+H+F+HIRLKIYVEHLVL LKGEGSKLFRKQEKKSI WKCV
Sbjct: 361 GLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 420

Query: 421 DNKVMSSMGLTSSVRKVYDMVEKFEAEMISPSR--AVATKKQRA 455
           +NKVMS+MGLTSSVRK Y MVEKF+A   S S   A+  KKQ++
Sbjct: 421 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS 464

BLAST of CmaCh17G013890 vs. NCBI nr
Match: gi|700191190|gb|KGN46394.1| (hypothetical protein Csa_6G088720 [Cucumis sativus])

HSP 1 Score: 751.9 bits (1940), Expect = 6.6e-214
Identity = 386/464 (83.19%), Postives = 412/464 (88.79%), Query Frame = 1

Query: 1   MSGGEKNENHEDVKK--------KPTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
           MS GEKNEN E +KK        KPT   KRRGRSPSK E +VDIEDIMFSID VQT+R+
Sbjct: 55  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRA 114

Query: 61  SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
           SLLDWYD S RDLPWR LDKG+PETR YGVWVSEIMLQQTRVQTVV++Y RWM +WPTVQ
Sbjct: 115 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 174

Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
           HLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FP+TV  LRKIPGIGEYTAGA
Sbjct: 175 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGA 234

Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
           IASIAF EVVPVVDGNVIRVIARLKAISGNPKD KL+KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 235 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 294

Query: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
           MELGATLCTPT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Sbjct: 295 MELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV 354

Query: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
           E+LE++GT EL Q SRFLLVKRPDEGLLAGLWEFPSV L+GEAD STRRESINSLLSK+F
Sbjct: 355 EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNF 414

Query: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
           GLE KKNFEIV REDVGDF+H+F+HIRLKIYVEHLVL LKGEGSKLFRKQEKKSI WKCV
Sbjct: 415 GLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 474

Query: 421 DNKVMSSMGLTSSVRKVYDMVEKFEAEMISPSR--AVATKKQRA 455
           +NKVMS+MGLTSSVRK Y MVEKF+A   S S   A+  KKQ++
Sbjct: 475 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS 518

BLAST of CmaCh17G013890 vs. NCBI nr
Match: gi|307135815|gb|ADN33687.1| (A/G-specific adenine DNA glycosylase [Cucumis melo subsp. melo])

HSP 1 Score: 684.1 bits (1764), Expect = 1.7e-193
Identity = 346/401 (86.28%), Postives = 365/401 (91.02%), Query Frame = 1

Query: 1   MSGGEKNENHEDVKKK--------PTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
           MS GEKNEN E+VKKK        PT   KRR RSPSK E +VDIEDIMFSID VQT+R+
Sbjct: 1   MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRA 60

Query: 61  SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
           SLLDWYD S RDLPWR LDKG+PETR YGVWVSEIMLQQTRVQTVV++Y RWM +WPTVQ
Sbjct: 61  SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120

Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
           HLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FPKTV  LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGA 180

Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
           IASIAF EVVPVVDGNVIRVIARLKAISGNPKD KL+KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240

Query: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
           MELGATLCTPT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300

Query: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
           E+LE++GTSEL Q SRFLLVKRPDEGLLAGLWEFPSV L+GEADSSTRRESI+SLLSK+F
Sbjct: 301 EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNF 360

Query: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKG 394
           GLEPKKNFEIV REDVGDF+HVF+HIRLKIYVEHLVL LKG
Sbjct: 361 GLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKG 401

BLAST of CmaCh17G013890 vs. NCBI nr
Match: gi|743928001|ref|XP_011008193.1| (PREDICTED: A/G-specific adenine DNA glycosylase [Populus euphratica])

HSP 1 Score: 558.9 bits (1439), Expect = 8.2e-156
Identity = 280/451 (62.08%), Postives = 346/451 (76.72%), Query Frame = 1

Query: 22  KRRGRSPSKREPIVDIEDIMFSIDKVQTMRSSLLDWYDLSHRDLPWRRL----------- 81
           K + +  +K++ + DIED+ FS  + Q +R+SLLDWYD + RDLPWRR+           
Sbjct: 68  KEQRQHSAKKQVVADIEDL-FSDKETQKIRASLLDWYDHNQRDLPWRRITQTKETPFKEE 127

Query: 82  DKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQHLSRASLEEVNEMWAGLG 141
           ++ + E R YGVWVSE+MLQQTRVQTV++YY RWM +WPT+ HL++ASLEEVNEMWAGLG
Sbjct: 128 EEEEEEERAYGVWVSEVMLQQTRVQTVIDYYNRWMLKWPTLHHLAQASLEEVNEMWAGLG 187

Query: 142 YYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVI 201
           YYRRARFLLEGAK+IV  G  FPK V  LRK+PGIG+YTAGAIASIAF EVVPVVDGNVI
Sbjct: 188 YYRRARFLLEGAKMIVAGGDGFPKIVSSLRKVPGIGDYTAGAIASIAFKEVVPVVDGNVI 247

Query: 202 RVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTSPSCSTC 261
           RV+ARLKAIS NPKD   VK+ WK AAQLVDP RPGDFNQ+LMELGAT+CTP +PSCS+C
Sbjct: 248 RVLARLKAISANPKDKVTVKKFWKLAAQLVDPHRPGDFNQSLMELGATVCTPVNPSCSSC 307

Query: 262 PVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVVEMLENRGTSELKQCSR-F 321
           PV   C AL+ISK D  VL+TDYPAK IK KQRH++SAVC VE+  +R   E  Q S  F
Sbjct: 308 PVSGQCRALTISKLDKLVLITDYPAKSIKLKQRHEFSAVCAVEISGSRDLIEGDQSSSVF 367

Query: 322 LLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSFGLEPKKNFEIVIREDVG 381
           LLVKRPDEGLLAGLWEFPSV+L  EAD + RR  +N  L KSF L+P+K   +++RED+G
Sbjct: 368 LLVKRPDEGLLAGLWEFPSVMLGKEADLTRRRNEMNRFLKKSFRLDPQKTCSVLLREDIG 427

Query: 382 DFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCVDNKVMSSMGLTSSVRKV 441
           +F+H+F+HIRLK+YVE L++ LKG+ S LF KQ  ++++WKCVD K +SS+GLTS VRKV
Sbjct: 428 EFIHIFTHIRLKVYVELLIVHLKGDMSDLFSKQSGENMTWKCVDRKALSSLGLTSGVRKV 487

Query: 442 YDMVEKFEAEMISPSRAVATKKQRATSTNLS 461
             MV+KF+ + +S   A A K+  +  T  S
Sbjct: 488 CTMVQKFKQKSLSTVSAAARKRTNSKKTGSS 517

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MUTYH_ARATH1.7e-13255.68Adenine DNA glycosylase OS=Arabidopsis thaliana GN=MYH PE=3 SV=1[more]
MUTYH_MOUSE2.4e-8342.92Adenine DNA glycosylase OS=Mus musculus GN=Mutyh PE=2 SV=2[more]
MUTYH_RAT3.9e-8141.46Adenine DNA glycosylase OS=Rattus norvegicus GN=Mutyh PE=2 SV=1[more]
MUTYH_HUMAN1.4e-6757.53Adenine DNA glycosylase OS=Homo sapiens GN=MUTYH PE=1 SV=1[more]
MYH1_SCHPO9.1e-5432.54Adenine DNA glycosylase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) G... [more]
Match NameE-valueIdentityDescription
A0A0A0KC27_CUCSA4.6e-21483.19Uncharacterized protein OS=Cucumis sativus GN=Csa_6G088720 PE=4 SV=1[more]
E5GB45_CUCME1.2e-19386.28A/G-specific adenine DNA glycosylase OS=Cucumis melo subsp. melo PE=4 SV=1[more]
W9QVM6_9ROSA2.7e-15363.84A/G-specific adenine DNA glycosylase OS=Morus notabilis GN=L484_005561 PE=4 SV=1[more]
K7KZX7_SOYBN6.6e-15261.76Uncharacterized protein OS=Glycine max GN=GLYMA_07G059700 PE=4 SV=1[more]
K7KZX6_SOYBN8.6e-15261.54Uncharacterized protein OS=Glycine max GN=GLYMA_07G059700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G12740.19.4e-13455.68 HhH-GPD base excision DNA repair family protein[more]
Match NameE-valueIdentityDescription
gi|659119956|ref|XP_008459934.1|4.9e-21784.27PREDICTED: A/G-specific adenine DNA glycosylase [Cucumis melo][more]
gi|778711687|ref|XP_004140565.2|6.6e-21483.19PREDICTED: A/G-specific adenine DNA glycosylase [Cucumis sativus][more]
gi|700191190|gb|KGN46394.1|6.6e-21483.19hypothetical protein Csa_6G088720 [Cucumis sativus][more]
gi|307135815|gb|ADN33687.1|1.7e-19386.28A/G-specific adenine DNA glycosylase [Cucumis melo subsp. melo][more]
gi|743928001|ref|XP_011008193.1|8.2e-15662.08PREDICTED: A/G-specific adenine DNA glycosylase [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000445HhH_motif
IPR003265HhH-GPD_domain
IPR004036Endonuclease-III-like_CS2
IPR005760A/G_AdeGlyc_MutY
IPR011257DNA_glycosylase
IPR015797NUDIX_hydrolase-like_dom_sf
IPR023170HTH_base_excis_C
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0019104DNA N-glycosylase activity
GO:0003824catalytic activity
GO:0016787hydrolase activity
Vocabulary: Biological Process
TermDefinition
GO:0006284base-excision repair
GO:0006281DNA repair
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006306 DNA methylation
biological_process GO:0006281 DNA repair
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0019104 DNA N-glycosylase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh17G013890.1CmaCh17G013890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000445Helix-hairpin-helix motifPFAMPF00633HHHcoord: 149..176
score: 4.
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 84..215
score: 4.1
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 88..238
score: 1.3
IPR004036Endonuclease III-like, conserved site-2PROSITEPS01155ENDONUCLEASE_III_2coord: 150..179
scor
IPR005760A/G-specific adenine glycosylase MutYTIGRFAMsTIGR01084TIGR01084coord: 50..329
score: 2.7
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 75..170
score: 1.4
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 47..259
score: 1.73
IPR015797NUDIX hydrolase domain-likeGENE3DG3DSA:3.90.79.10coord: 306..435
score: 9.7
IPR015797NUDIX hydrolase domain-likeunknownSSF55811Nudixcoord: 274..430
score: 1.97
IPR023170Helix-turn-helix, base-excision DNA repair, C-terminalGENE3DG3DSA:1.10.1670.10coord: 171..260
score: 1.2
NoneNo IPR availablePANTHERPTHR10359A/G-SPECIFIC ADENINE GLYCOSYLASE/ENDONUCLEASE IIIcoord: 35..440
score: 1.3E
NoneNo IPR availablePANTHERPTHR10359:SF1A/G-SPECIFIC ADENINE DNA GLYCOSYLASEcoord: 35..440
score: 1.3E