BLAST of Cla97C06G110910 vs. NCBI nr
Match:
XP_004140565.2 (PREDICTED: A/G-specific adenine DNA glycosylase [Cucumis sativus])
HSP 1 Score: 814.3 bits (2102), Expect = 2.1e-232
Identity = 417/464 (89.87%), Postives = 439/464 (94.61%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MS GEKNEN+E++K+NTDFRR+KKPT ERKRRGRSPSK EA VDIEDIMFSIDNVQ IRA
Sbjct: 1 MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRA 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQ
Sbjct: 61 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEGGRFP+TVS+LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAF EVVPVVDGNVIRVIARLKAI GNPKDPKL KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYSAV VV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILE+QGT +L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADLSTRRESINSLLSK F
Sbjct: 301 EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLE KKNFEIV RE+VGDFIH+FTHIRLKIYVEHLVLCLKGEGSKLF+KQEKKSILWKCV
Sbjct: 361 GLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 420
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKTSSSR--AVPRKKQKA 463
+++VMS+MGLTSSVRKAYAMVEKFQA KTSSS A+PRKKQK+
Sbjct: 421 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS 464
BLAST of Cla97C06G110910 vs. NCBI nr
Match:
KGN46394.1 (hypothetical protein Csa_6G088720 [Cucumis sativus])
HSP 1 Score: 784.6 bits (2025), Expect = 1.8e-223
Identity = 405/464 (87.28%), Postives = 426/464 (91.81%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MS GEKNEN+E++K+NTD GRSPSK EA VDIEDIMFSIDNVQ IRA
Sbjct: 55 MSDGEKNENDEYMKKNTDXXXXXXXXXXXXXXGRSPSKSEAVVDIEDIMFSIDNVQTIRA 114
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQ
Sbjct: 115 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 174
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEGGRFP+TVS+LRKIPGIGEYTAGA
Sbjct: 175 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGA 234
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAF EVVPVVDGNVIRVIARLKAI GNPKDPKL KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 235 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 294
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYSAV VV
Sbjct: 295 MELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV 354
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILE+QGT +L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADLSTRRESINSLLSK F
Sbjct: 355 EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNF 414
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLE KKNFEIV RE+VGDFIH+FTHIRLKIYVEHLVLCLKGEGSKLF+KQEKKSILWKCV
Sbjct: 415 GLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 474
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKTSSSR--AVPRKKQKA 463
+++VMS+MGLTSSVRKAYAMVEKFQA KTSSS A+PRKKQK+
Sbjct: 475 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS 518
BLAST of Cla97C06G110910 vs. NCBI nr
Match:
XP_023547668.1 (adenine DNA glycosylase [Cucurbita pepo subsp. pepo])
HSP 1 Score: 775.8 bits (2002), Expect = 8.3e-221
Identity = 400/462 (86.58%), Postives = 423/462 (91.56%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MSGGEKNENEE VK KKPTK KRRGRSPSKRE DIEDIMFSID VQ +R+
Sbjct: 1 MSGGEKNENEEDVK--------KKPTKGEKRRGRSPSKREPIADIEDIMFSIDKVQTMRS 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYD S RDLPWRRLDKGQP+TR YGVWVSEIMLQQTRVQTVV +Y RWM +WPTVQ
Sbjct: 61 SLLDWYDLSHRDLPWRRLDKGQPQTRGYGVWVSEIMLQQTRVQTVVEYYKRWMHKWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFLLEGAK+IVKEGG FPKTVSALRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVSALRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAFDEVVPVVDGNVIRVIARLKAI GNPKD KL KQVWKAAAQLVDPSRPGDFNQAL
Sbjct: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VV
Sbjct: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILENQG+S+L+QSSRFLLVKRPDEGLLAGLWEFPSVLL GEAD STRRES+NSLLSK F
Sbjct: 301 EILENQGSSELKQSSRFLLVKRPDEGLLAGLWEFPSVLLKGEADSSTRRESMNSLLSKSF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLEPKKNF+IVIRE+VGDFIHVF+HIRLKIYVEHLVL LKGEGSKLF+KQEKKSI WKCV
Sbjct: 361 GLEPKKNFDIVIREDVGDFIHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKTSSSRAVPRKKQKA 463
D++VMSS+GLTSSVRK YAMVEKF+A+K S RAV KKQ+A
Sbjct: 421 DNKVMSSVGLTSSVRKVYAMVEKFEADKISPIRAVATKKQRA 454
BLAST of Cla97C06G110910 vs. NCBI nr
Match:
XP_022953220.1 (adenine DNA glycosylase [Cucurbita moschata])
HSP 1 Score: 773.9 bits (1997), Expect = 3.2e-220
Identity = 400/461 (86.77%), Postives = 420/461 (91.11%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MSGGEK+ENEE VK KKPTK KRRGRSPSKRE DIEDIMFSID VQ +R+
Sbjct: 1 MSGGEKSENEEDVK--------KKPTKGEKRRGRSPSKREPITDIEDIMFSIDKVQTMRS 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
LL+WYD S RDLPWRRLDKGQPETR YGVWVSEIMLQQTRVQTVV +Y RWM RWPTVQ
Sbjct: 61 PLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFLLEGAK+IVKEGG FPKTVSALRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVSALRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAFDEVVPVVDGNVIRVIARLKAI NPKD KL KQVWKAAAQLVDPSRPGDFNQAL
Sbjct: 181 IASIAFDEVVPVVDGNVIRVIARLKAISRNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PT+PSCSTCPVFDHCEALS SK DSSVLVTDYPAKG+KTKQRHDYSAV VV
Sbjct: 241 MELGATLCTPTSPSCSTCPVFDHCEALSSSKDDSSVLVTDYPAKGVKTKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILENQGTS+L+QSSRFLLVKRPDEGLLAGLWEFPSVLL+GEAD STRRESINSLLSK F
Sbjct: 301 EILENQGTSELKQSSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLEPKKNFEIVIRE+VGDF+HVF+HIRLKIYVEHLVL LKGEGSKLF+KQEKKSI WKCV
Sbjct: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKTSSSRAVPRKKQK 462
D++VMSSMGLTSSVRK YAMVEKF+AEK S S AV KKQ+
Sbjct: 421 DNKVMSSMGLTSSVRKVYAMVEKFEAEKISPSPAVATKKQR 453
BLAST of Cla97C06G110910 vs. NCBI nr
Match:
XP_022991840.1 (adenine DNA glycosylase [Cucurbita maxima])
HSP 1 Score: 773.5 bits (1996), Expect = 4.1e-220
Identity = 400/462 (86.58%), Postives = 420/462 (90.91%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MSGGEKNEN E VK KKPTK KRRGRSPSKRE VDIEDIMFSID VQ +R+
Sbjct: 1 MSGGEKNENHEDVK--------KKPTKGEKRRGRSPSKREPIVDIEDIMFSIDKVQTMRS 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYD S RDLPWRRLDKGQPETR YGVWVSEIMLQQTRVQTVV +Y RWM RWPTVQ
Sbjct: 61 SLLDWYDLSHRDLPWRRLDKGQPETRGYGVWVSEIMLQQTRVQTVVEYYKRWMHRWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFLLEGAK+IVKEGG FPKTV LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKLIVKEGGEFPKTVPDLRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAFDEVVPVVDGNVIRVIARLKAI GNPKD KL KQVWKAAAQLVDPSRPGDFNQAL
Sbjct: 181 IASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLVKQVWKAAAQLVDPSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VV
Sbjct: 241 MELGATLCTPTSPSCSTCPVFDHCEALSISKDDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
E+LEN+GTS+L+Q SRFLLVKRPDEGLLAGLWEFPSVLL+GEAD STRRESINSLLSK F
Sbjct: 301 EMLENRGTSELKQCSRFLLVKRPDEGLLAGLWEFPSVLLNGEADSSTRRESINSLLSKSF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLEPKKNFEIVIRE+VGDF+HVF+HIRLKIYVEHLVL LKGEGSKLF+KQEKKSI WKCV
Sbjct: 361 GLEPKKNFEIVIREDVGDFVHVFSHIRLKIYVEHLVLRLKGEGSKLFRKQEKKSISWKCV 420
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKTSSSRAVPRKKQKA 463
D++VMSSMGLTSSVRK Y MVEKF+AE S SRAV KKQ+A
Sbjct: 421 DNKVMSSMGLTSSVRKVYDMVEKFEAEMISPSRAVATKKQRA 454
BLAST of Cla97C06G110910 vs. TrEMBL
Match:
tr|A0A0A0KC27|A0A0A0KC27_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G088720 PE=4 SV=1)
HSP 1 Score: 784.6 bits (2025), Expect = 1.2e-223
Identity = 405/464 (87.28%), Postives = 426/464 (91.81%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MS GEKNEN+E++K+NTD GRSPSK EA VDIEDIMFSIDNVQ IRA
Sbjct: 55 MSDGEKNENDEYMKKNTDXXXXXXXXXXXXXXGRSPSKSEAVVDIEDIMFSIDNVQTIRA 114
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQ
Sbjct: 115 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 174
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEGGRFP+TVS+LRKIPGIGEYTAGA
Sbjct: 175 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGA 234
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAF EVVPVVDGNVIRVIARLKAI GNPKDPKL KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 235 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 294
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYSAV VV
Sbjct: 295 MELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV 354
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILE+QGT +L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADLSTRRESINSLLSK F
Sbjct: 355 EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNF 414
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLE KKNFEIV RE+VGDFIH+FTHIRLKIYVEHLVLCLKGEGSKLF+KQEKKSILWKCV
Sbjct: 415 GLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 474
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKTSSSR--AVPRKKQKA 463
+++VMS+MGLTSSVRKAYAMVEKFQA KTSSS A+PRKKQK+
Sbjct: 475 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS 518
BLAST of Cla97C06G110910 vs. TrEMBL
Match:
tr|A0A1S3CBT2|A0A1S3CBT2_CUCME (adenine DNA glycosylase isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498904 PE=4 SV=1)
HSP 1 Score: 757.7 bits (1955), Expect = 1.5e-215
Identity = 397/464 (85.56%), Postives = 413/464 (89.01%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MS GEKNENEE EA VDIEDIMFSIDNVQ IRA
Sbjct: 1 MSDGEKNENEEXXXXXXXXXXXXXXXXXXXXXXXXXXXSEAVVDIEDIMFSIDNVQTIRA 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQ
Sbjct: 61 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEGGRFPKTVS+LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAF EVVPVVDGNVIRVIARLKAI GNPKDPKL KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILE+QGTS+L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEAD STRRESI+SLLSK F
Sbjct: 301 EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLEPKKNFEIV RE+VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLF+KQEKKSILWKCV
Sbjct: 361 GLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 420
Query: 421 DSEVMSSMGLTSSVRKAYAMVEKFQAEKT--SSSRAVPRKKQKA 463
+++VMS+MGLTSSVRKAYAMVEKFQA KT SSSR +P KKQK+
Sbjct: 421 ENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS 464
BLAST of Cla97C06G110910 vs. TrEMBL
Match:
tr|A0A1S4E2J7|A0A1S4E2J7_CUCME (adenine DNA glycosylase isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498904 PE=4 SV=1)
HSP 1 Score: 728.8 bits (1880), Expect = 7.7e-207
Identity = 376/436 (86.24%), Postives = 390/436 (89.45%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MS GEKNENEE EA VDIEDIMFSIDNVQ IRA
Sbjct: 1 MSDGEKNENEEXXXXXXXXXXXXXXXXXXXXXXXXXXXSEAVVDIEDIMFSIDNVQTIRA 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQ
Sbjct: 61 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEGGRFPKTVS+LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAF EVVPVVDGNVIRVIARLKAI GNPKDPKL KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILE+QGTS+L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEAD STRRESI+SLLSK F
Sbjct: 301 EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCV 420
GLEPKKNFEIV RE+VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLF+KQEKKSILWKCV
Sbjct: 361 GLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCV 420
Query: 421 DSEVMSSMGLTSSVRK 437
+++VMS+MGLTSSVRK
Sbjct: 421 ENKVMSTMGLTSSVRK 436
BLAST of Cla97C06G110910 vs. TrEMBL
Match:
tr|E5GB45|E5GB45_CUCME (A/G-specific adenine DNA glycosylase OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)
HSP 1 Score: 669.5 bits (1726), Expect = 5.5e-189
Identity = 346/401 (86.28%), Postives = 355/401 (88.53%), Query Frame = 0
Query: 1 MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREADVDIEDIMFSIDNVQIIRA 60
MS GEKNENEE EA VDIEDIMFSIDNVQ IRA
Sbjct: 1 MSDGEKNENEEXXXXXXXXXXXXXXXXXXXXXXXXXXXSEAVVDIEDIMFSIDNVQTIRA 60
Query: 61 SLLEWYDRSCRDLPWRRLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 120
SLL+WYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQ
Sbjct: 61 SLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQ 120
Query: 121 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGA 180
HLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEGGRFPKTVS+LRKIPGIGEYTAGA
Sbjct: 121 HLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGA 180
Query: 181 IASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQAL 240
IASIAF EVVPVVDGNVIRVIARLKAI GNPKDPKL KQVWKAAAQLVD SRPGDFNQAL
Sbjct: 181 IASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQAL 240
Query: 241 MELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVV 300
MELGATLC+PTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VV
Sbjct: 241 MELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV 300
Query: 301 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 360
EILE+QGTS+L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEAD STRRESI+SLLSK F
Sbjct: 301 EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNF 360
Query: 361 GLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKG 402
GLEPKKNFEIV RE+VGDFIHVFTHIRLKIYVEHLVLCLKG
Sbjct: 361 GLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKG 401
BLAST of Cla97C06G110910 vs. TrEMBL
Match:
tr|A0A2I4EXQ2|A0A2I4EXQ2_9ROSI (adenine DNA glycosylase OS=Juglans regia OX=51240 GN=LOC108993645 PE=4 SV=1)
HSP 1 Score: 573.5 bits (1477), Expect = 4.1e-160
Identity = 296/448 (66.07%), Postives = 356/448 (79.46%), Query Frame = 0
Query: 24 KPTKERKRRGRSPSKREADVDIEDIM---FSIDNVQIIRASLLEWYDRSCRDLPWR--RL 83
+P + + R SP++++ + DIEDI+ FS D Q IR LLEWYD + RDLPWR +
Sbjct: 67 RPRRVARERPHSPTEKQEEDDIEDIVKWSFSGDETQKIRECLLEWYDLNKRDLPWRSKKS 126
Query: 84 DKGQP----ETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMW 143
D P E RAYGVWVSE+MLQQTRVQTV+ +YNRWM +WPT+ LS+ASLEEVNEMW
Sbjct: 127 DSSSPSQPQEERAYGVWVSEVMLQQTRVQTVIQYYNRWMHKWPTLHLLSQASLEEVNEMW 186
Query: 144 AGLGYYRRARFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVD 203
AGLGYYRRARFLLEGAKM+V GGRFPKTVS LRKI GIG+YTAGAIASIAF EVVPVVD
Sbjct: 187 AGLGYYRRARFLLEGAKMVVAGGGRFPKTVSELRKIRGIGDYTAGAIASIAFGEVVPVVD 246
Query: 204 GNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQALMELGATLCSPTNPS 263
GNVIRVI RL+AI NPKD K++WK AAQLVDP RPGD NQALMELGAT+C+P NPS
Sbjct: 247 GNVIRVITRLRAISANPKDLVTVKKIWKLAAQLVDPIRPGDLNQALMELGATVCTPLNPS 306
Query: 264 CSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQS 323
CS+CP HC ALS+S HDS V VTD+PAKG+K KQRHD+SAV VVE+L Q T + QS
Sbjct: 307 CSSCPASGHCHALSVSGHDSLVQVTDFPAKGVKVKQRHDFSAVCVVELLGGQKTLEGNQS 366
Query: 324 -SRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIR 383
SRFLLVKRPDEGLLAGLWEFPSVLLDG+A L+TRRE+I+ L K FG++ KK IV+R
Sbjct: 367 DSRFLLVKRPDEGLLAGLWEFPSVLLDGDAGLATRREAIDHFLEKKFGIDSKKTGNIVVR 426
Query: 384 EEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSS 443
++VG+F+H+FTHIRLKI+VE LV+ LKG + LF KQ+K+++ WKCVD + +SS+GLTS+
Sbjct: 427 KDVGEFVHMFTHIRLKIFVELLVVRLKGRKNDLFGKQDKEAMHWKCVDGDALSSLGLTSA 486
Query: 444 VRKAYAMVEKFQAEKTSSSRAVPRKKQK 462
VRKAY MV+KF+ EK S++ RK+ +
Sbjct: 487 VRKAYIMVQKFKQEKLSNNFTPSRKRNR 514
BLAST of Cla97C06G110910 vs. Swiss-Prot
Match:
sp|F4JRF4|MUTYH_ARATH (Adenine DNA glycosylase OS=Arabidopsis thaliana OX=3702 GN=MYH PE=3 SV=1)
HSP 1 Score: 464.2 bits (1193), Expect = 1.7e-129
Identity = 244/415 (58.80%), Postives = 298/415 (71.81%), Query Frame = 0
Query: 44 DIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWR-RLDKGQPETRAYGVWVSEIMLQQTRV 103
DIED +FS + Q IR LL+WYD + RDLPWR R + + E RAY VWVSEIMLQQTRV
Sbjct: 118 DIED-LFSENETQKIRMGLLDWYDVNKRDLPWRNRRSESEKERRAYEVWVSEIMLQQTRV 177
Query: 104 QTVVHFYNRWMLRWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRA 163
QTV+ +Y RWM +WPT+ L +ASLE EVNEMWAGLGYYRRA
Sbjct: 178 QTVMKYYKRWMQKWPTIYDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRA 237
Query: 164 RFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIAR 223
RFLLEGAKM+V FP S+L K+ GIG+YTAGAIASIAF+E VPVVDGNVIRV+AR
Sbjct: 238 RFLLEGAKMVVAGTEGFPNQASSLMKVKGIGQYTAGAIASIAFNEAVPVVDGNVIRVLAR 297
Query: 224 LKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDH 283
LKAI NPKD + WK AAQLVDPSRPGDFNQ+LMELGATLC+ + PSCS+CPV
Sbjct: 298 LKAISANPKDRLTARNFWKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQ 357
Query: 284 CEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRP 343
C A S+S+ + ++ VTDYP K IK K RHD+ V V+EI + + RF+LVKRP
Sbjct: 358 CRAFSLSEENRTISVTDYPTKVIKAKPRHDFCCVCVLEI---HNLERNQSGGRFVLVKRP 417
Query: 344 DEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSK--YFGLEPKKNFEIVIREEVGDFIH 403
++GLLAGLWEFPSV+L+ EAD +TRR +IN L + F +E KK IV REE+G+F+H
Sbjct: 418 EQGLLAGLWEFPSVILNEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVH 477
Query: 404 VFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK 437
+FTHIR K+YVE LV+ L G LF+ Q K ++ WKCV S+V+S++GLTS+VRK
Sbjct: 478 IFTHIRRKVYVELLVVQLTGGTEDLFKGQAKDTLTWKCVSSDVLSTLGLTSAVRK 528
BLAST of Cla97C06G110910 vs. Swiss-Prot
Match:
sp|Q99P21|MUTYH_MOUSE (Adenine DNA glycosylase OS=Mus musculus OX=10090 GN=Mutyh PE=1 SV=2)
HSP 1 Score: 303.1 bits (775), Expect = 5.2e-81
Identity = 182/431 (42.23%), Postives = 245/431 (56.84%), Query Frame = 0
Query: 22 EKKPTKERKRRGRSPS---------------KRE----ADVDIEDIMFSIDNVQIIRASL 81
+K+P ++RR R+ S KRE A V + + +V R++L
Sbjct: 12 KKQPANHKRRRTRALSSSQAKPSSLDGLAKQKREELLQASVSPYHLFSDVADVTAFRSNL 71
Query: 82 LEWYDRSCRDLPWRRL--DKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQ 141
L WYD+ RDLPWR L ++ + RAY VWVSE+MLQQT+V TV+ +Y RWM +WP +Q
Sbjct: 72 LSWYDQEKRDLPWRNLAKEEANSDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQ 131
Query: 142 HLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTA 201
L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG P+T L++ +PG+G YTA
Sbjct: 132 DLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTA 191
Query: 202 GAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQ 261
GAIASIAFD+V VVDGNV+RV+ R++AI +P ++ +W A QLVDP+RPGDFNQ
Sbjct: 192 GAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPTSTLVSHHLWNLAQQLVDPARPGDFNQ 251
Query: 262 ALMELGATLCSPTNPSCSTCPVFDHCEA-------------------------------- 321
A MELGAT+C+P P CS CPV C A
Sbjct: 252 AAMELGATVCTPQRPLCSHCPVQSLCRAYQRVQRGQLSALPGRPDIEECALNTRQCQLCL 311
Query: 322 LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEG 381
S S D S+ V ++P K + R +YSA VVE G LLV+RPD G
Sbjct: 312 TSSSPWDPSMGVANFPRKASRRPPREEYSATCVVEQPGAIG------GPLVLLVQRPDSG 371
Query: 382 LLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHI 398
LLAGLWEFPSV L E + +++ L ++ G P + + +G+ IH+F+HI
Sbjct: 372 LLAGLWEFPSVTL--EPSEQHQHKALLQELQRWCGPLP-----AIRLQHLGEVIHIFSHI 429
BLAST of Cla97C06G110910 vs. Swiss-Prot
Match:
sp|Q9UIF7|MUTYH_HUMAN (Adenine DNA glycosylase OS=Homo sapiens OX=9606 GN=MUTYH PE=1 SV=1)
HSP 1 Score: 295.4 bits (755), Expect = 1.1e-78
Identity = 172/397 (43.32%), Postives = 231/397 (58.19%), Query Frame = 0
Query: 40 EADVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRRL--DKGQPETRAYGVWVSEIML 99
+A V + + V R SLL WYD+ RDLPWRR D+ + RAY VWVSE+ML
Sbjct: 75 QASVSSYHLFRDVAEVTAFRGSLLSWYDQEKRDLPWRRRAEDEMDLDRRAYAVWVSEVML 134
Query: 100 QQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKE- 159
QQT+V TV+++Y WM +WPT+Q L+ ASLEEVN++WAGLGYY R R L EGA+ +V+E
Sbjct: 135 QQTQVATVINYYTGWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEEL 194
Query: 160 GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPK 219
GG P+T L++ +PG+G YTAGAIASIAF + VVDGNV RV+ R++AI +P
Sbjct: 195 GGHMPRTAETLQQLLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTL 254
Query: 220 LNKQVWKAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEA--------- 279
+++Q+W A QLVDP+RPGDFNQA MELGAT+C+P P CS CPV C A
Sbjct: 255 VSQQLWGLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRARQRVEQEQL 314
Query: 280 -----LSISKH---------------------DSSVLVTDYPAKGIKTKQRHDYSAVSVV 339
LS S D ++ V ++P K + R + SA V
Sbjct: 315 LASGSLSGSPDVEECAPNTGQCHLCLPPSEPWDQTLGVVNFPRKASRKPPREESSATCV- 374
Query: 340 EILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYF 398
LE G ++ LLV+RP+ GLLAGLWEFPSV + L +R+++ L ++
Sbjct: 375 --LEQPGA----LGAQILLVQRPNSGLLAGLWEFPSVTWEPSEQL--QRKALLQELQRWA 434
BLAST of Cla97C06G110910 vs. Swiss-Prot
Match:
sp|Q8R5G2|MUTYH_RAT (Adenine DNA glycosylase OS=Rattus norvegicus OX=10116 GN=Mutyh PE=2 SV=1)
HSP 1 Score: 291.2 bits (744), Expect = 2.0e-77
Identity = 182/440 (41.36%), Postives = 240/440 (54.55%), Query Frame = 0
Query: 14 KQNTDFRREKKPTKERKRRGR----------------SPSKRE----ADVDIEDIMFSID 73
K R KK KRRG+ + KRE V + I
Sbjct: 3 KLRASVRSHKKQPANHKRRGKCALSSSQAKPSGLDGLAKQKREELLKTPVSPYHLFSDIA 62
Query: 74 NVQIIRASLLEWYDRSCRDLPWRRLDKGQP--ETRAYGVWVSEIMLQQTRVQTVVHFYNR 133
+V R +LL WYD+ RDLPWR+ K + + RAY VWVSE+MLQQT+V TV+ +Y R
Sbjct: 63 DVTAFRRNLLSWYDQEKRDLPWRKRVKEETNLDRRAYAVWVSEVMLQQTQVATVIDYYTR 122
Query: 134 WMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKE-GGRFPKTVSALRK- 193
WM +WPT+Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG P+T L++
Sbjct: 123 WMQKWPTLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVEELGGHVPRTAETLQQL 182
Query: 194 IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQVWKAAAQLVD 253
+PG+G YTAGAIASIAFD+V VVDGNVIRV+ R++AI +P ++ +W A QLVD
Sbjct: 183 LPGVGRYTAGAIASIAFDQVTGVVDGNVIRVLCRVRAIGADPTSSFVSHHLWDLAQQLVD 242
Query: 254 PSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEA----------------------- 313
P+RPGDFNQA MELGAT+C+P P C+ CPV C A
Sbjct: 243 PARPGDFNQAAMELGATVCTPQRPLCNHCPVQSLCRAHQRVGQGRLSALPGSPDIEECAL 302
Query: 314 ---------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRF 373
S + D ++ V ++P K + R +YSA VVE G
Sbjct: 303 NTRQCQLCLPSTNPWDPNMGVVNFPRKASRRPPREEYSATCVVEQPGATG------GPLI 362
Query: 374 LLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVG 398
LLV+RP+ GLLAGLWEFPSV L E + +++ L + P + +G
Sbjct: 363 LLVQRPNSGLLAGLWEFPSVTL--EPSGQHQHKALLQELQHWSAPLPTTPL-----QHLG 422
BLAST of Cla97C06G110910 vs. Swiss-Prot
Match:
sp|Q10159|MYH1_SCHPO (Adenine DNA glycosylase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=myh1 PE=1 SV=1)
HSP 1 Score: 224.2 bits (570), Expect = 3.0e-57
Identity = 139/385 (36.10%), Postives = 205/385 (53.25%), Query Frame = 0
Query: 55 VQIIRASLLEWYDRSCRDLPWRRL------------DKGQPETRAYGVWVSEIMLQQTRV 114
V+ R SL+++YD++ R LPWR+ D QP R Y V VSEIMLQQTRV
Sbjct: 18 VERFRESLIQFYDKTKRILPWRKKECIPPSEDSPLEDWEQPVQRLYEVLVSEIMLQQTRV 77
Query: 115 QTVVHFYNRWMLRWPTVQHLSRASLE-EVNEMWAGLGYYRRARFLLEGAKMIVK-EGGRF 174
+TV +Y +WM PT++ + A +V +W+G+G+Y R + L + + + K
Sbjct: 78 ETVKRYYTKWMETLPTLKSCAEAEYNTQVMPLWSGMGFYTRCKRLHQACQHLAKLHPSEI 137
Query: 175 PKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQ 234
P+T K IPG+G YTAGA+ SIA+ + +VDGNVIRV++R AI + K N
Sbjct: 138 PRTGDEWAKGIPGVGPYTAGAVLSIAWKQPTGIVDGNVIRVLSRALAIHSDCSKGKANAL 197
Query: 235 VWKAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEAL---------SIS 294
+WK A +LVDP RPGDFNQALMELGA C+P +P CS CP+ + C+A +
Sbjct: 198 IWKLANELVDPVRPGDFNQALMELGAITCTPQSPRCSVCPISEICKAYQEQNVIRDGNTI 257
Query: 295 KHD-----SSVLVTD--------------YPAKGIKTKQRHDYSAVSVVEILENQGTSKL 354
K+D ++ +TD YP KTKQR + + V + Q T
Sbjct: 258 KYDIEDVPCNICITDIPSKEDLQNWVVARYPVHPAKTKQREERALVVIF-----QKTDPS 317
Query: 355 EQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIV 397
+ FL+ KRP GLLAGLW+FP++ E+ ++++ + + I
Sbjct: 318 TKEKFFLIRKRPSAGLLAGLWDFPTIEFGQESWPKDMDAEFQKSIAQWISNDSRS--LIK 377
BLAST of Cla97C06G110910 vs. TAIR10
Match:
AT4G12740.1 (HhH-GPD base excision DNA repair family protein)
HSP 1 Score: 464.2 bits (1193), Expect = 9.7e-131
Identity = 244/415 (58.80%), Postives = 298/415 (71.81%), Query Frame = 0
Query: 44 DIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWR-RLDKGQPETRAYGVWVSEIMLQQTRV 103
DIED +FS + Q IR LL+WYD + RDLPWR R + + E RAY VWVSEIMLQQTRV
Sbjct: 118 DIED-LFSENETQKIRMGLLDWYDVNKRDLPWRNRRSESEKERRAYEVWVSEIMLQQTRV 177
Query: 104 QTVVHFYNRWMLRWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRA 163
QTV+ +Y RWM +WPT+ L +ASLE EVNEMWAGLGYYRRA
Sbjct: 178 QTVMKYYKRWMQKWPTIYDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRA 237
Query: 164 RFLLEGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIAR 223
RFLLEGAKM+V FP S+L K+ GIG+YTAGAIASIAF+E VPVVDGNVIRV+AR
Sbjct: 238 RFLLEGAKMVVAGTEGFPNQASSLMKVKGIGQYTAGAIASIAFNEAVPVVDGNVIRVLAR 297
Query: 224 LKAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDH 283
LKAI NPKD + WK AAQLVDPSRPGDFNQ+LMELGATLC+ + PSCS+CPV
Sbjct: 298 LKAISANPKDRLTARNFWKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQ 357
Query: 284 CEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRP 343
C A S+S+ + ++ VTDYP K IK K RHD+ V V+EI + + RF+LVKRP
Sbjct: 358 CRAFSLSEENRTISVTDYPTKVIKAKPRHDFCCVCVLEI---HNLERNQSGGRFVLVKRP 417
Query: 344 DEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSK--YFGLEPKKNFEIVIREEVGDFIH 403
++GLLAGLWEFPSV+L+ EAD +TRR +IN L + F +E KK IV REE+G+F+H
Sbjct: 418 EQGLLAGLWEFPSVILNEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVH 477
Query: 404 VFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK 437
+FTHIR K+YVE LV+ L G LF+ Q K ++ WKCV S+V+S++GLTS+VRK
Sbjct: 478 IFTHIRRKVYVELLVVQLTGGTEDLFKGQAKDTLTWKCVSSDVLSTLGLTSAVRK 528
BLAST of Cla97C06G110910 vs. TAIR10
Match:
AT1G05900.2 (endonuclease III 2)
HSP 1 Score: 61.2 bits (147), Expect = 1.9e-09
Identity = 57/224 (25.45%), Postives = 103/224 (45.98%), Query Frame = 0
Query: 83 PETRAYGVWVSEIMLQQTRVQ----TVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLG 142
P+ R + V + ++ QT+ V + +L T + + +A + E+ +G
Sbjct: 176 PKERRFYVLIGTLLSSQTKEHITGAAVERLHQNGLL---TPEAIDKADESTIKELIYPVG 235
Query: 143 YY-RRARFLLEGAKMIVKE-GGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPV-VDG 202
+Y R+A + + AK+ + E G P+T+ L +PG+G A + +A+++V + VD
Sbjct: 236 FYTRKATNVKKVAKICLMEYDGDIPRTLEELLSLPGVGPKIAHLVLHVAWNDVQGICVDT 295
Query: 203 NVIRVIARL--------KAILGNPKDPKLNKQVWKAAAQLVDPSRPGDFNQALMELGATL 262
+V R+ RL K +P++ ++ Q W + V N L+ G T+
Sbjct: 296 HVHRICNRLGWVSKPGTKQKTSSPEETRVALQQWLPKGEWV------AINFLLVGFGQTI 355
Query: 263 CSPTNPSCSTCPVFDHC-EALSISKHDSSVLVTDYPAKGIKTKQ 291
C+P P C TC + + C A + SS L K IK+K+
Sbjct: 356 CTPLRPHCGTCSITEICPSAFKETPSTSSKL-----KKSIKSKK 385
The following BLAST results are available for this feature: