Homology
BLAST of HG10006921 vs. NCBI nr
Match:
XP_008464733.1 (PREDICTED: DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo] >XP_008464734.1 PREDICTED: DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo] >XP_008464735.1 PREDICTED: DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo])
HSP 1 Score: 2999.5 bits (7775), Expect = 0.0e+00
Identity = 1479/1550 (95.42%), Postives = 1507/1550 (97.23%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDLK+SD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAAACSDFKV
Sbjct: 1 MAKKTRSQLMATSNDLKESDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAAACSDFKV 60
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTIQLPEKSS+IECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 61 KTIQLPEKSSIIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVNDLFISG+ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD
Sbjct: 121 LEVNDLFISGVILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CVRPAGGYKKLYNIFYEKANACVEVYKKLA+TSGGNPDLTLEELL GVVRSLNSSRNFPA
Sbjct: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLARTSGGNPDLTLEELLGGVVRSLNSSRNFPA 240
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAGTFD
Sbjct: 241 GMSVKDFIFLQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGTFD 300
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTNLGLKIKDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK
Sbjct: 301 GFTNLGLKIKDGEQLNPPNITGSGVEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDS
Sbjct: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDS 420
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSA
Sbjct: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSA 480
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAISIITLLKEQSRA
Sbjct: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAISIITLLKEQSRA 540
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSFA IIKKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT
Sbjct: 541 SKLSFAVIIKKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTT+LINRIWGEFYS
Sbjct: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTKLINRIWGEFYS 660
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NY PEDLKEADNNETKEDE EEEEEVEDEE EEVEEED QVDLKTKESKPVVKPAKAKLS
Sbjct: 661 NYSPEDLKEADNNETKEDEPEEEEEVEDEESEEVEEEDVQVDLKTKESKPVVKPAKAKLS 720
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EG NKWDG+IVG TSQGYPLYKQAIVHGDLVAVGGFV V+TDNVHDLPA+YLVEYMYEKS
Sbjct: 721 EGNNKWDGKIVGKTSQGYPLYKQAIVHGDLVAVGGFVYVETDNVHDLPALYLVEYMYEKS 780
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
N +KMVHGRL+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRETVVVESCMRPWGYQHR
Sbjct: 781 NGKKMVHGRLIVRGLETVLGNAAKEREVFLTNDCLEFELNEIRETVVVESCMRPWGYQHR 840
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
KAN K DKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 841 KANAKMDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED MKLHSSL SFTYRG DYSVND VYL+PHHFGTDERGIETFKGG+NVGLNAYVV
Sbjct: 901 ETQREDTMKLHSSLTSFTYRGTDYSVNDCVYLSPHHFGTDERGIETFKGGKNVGLNAYVV 960
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP P+STMVKVRRFFRPEDISVEKAYCSDIRELYYSDETT PVSAI
Sbjct: 961 CQLLGIESPKGSKQPCPISTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTMMPVSAI 1020
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIP+ANYPAIFDH FFCEHLYDPEKGAIKKLP +VKLSS SERQISDAAQ
Sbjct: 1021 EGKCEVRKKQDIPVANYPAIFDHIFFCEHLYDPEKGAIKKLPGSVKLSSPSERQISDAAQ 1080
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1081 RKKKGKCKEGEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLP
Sbjct: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLP 1200
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1261 SFNKGQTFRLTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEP
Sbjct: 1321 VFGSPELKISLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPA 1380
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGDV+VLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLI
Sbjct: 1381 SWFQKKIRGDVIVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLI 1440
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIE KSSKSVLS
Sbjct: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIETKSSKSVLS 1550
BLAST of HG10006921 vs. NCBI nr
Match:
KAA0065499.1 (DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo var. makuwa] >TYK08739.1 DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo var. makuwa])
HSP 1 Score: 2984.9 bits (7737), Expect = 0.0e+00
Identity = 1470/1541 (95.39%), Postives = 1498/1541 (97.21%), Query Frame = 0
Query: 10 MVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKS 69
M TSNDLK+SD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAAACSDFKVKTIQLPEKS
Sbjct: 1 MATSNDLKESDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAAACSDFKVKTIQLPEKS 60
Query: 70 SVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFIS 129
S+IECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFIS
Sbjct: 61 SIIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFIS 120
Query: 130 GIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYK 189
G+ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYK
Sbjct: 121 GVILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYK 180
Query: 190 KLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFIL 249
KLYNIFYEKANACVEVYKKLA+TSGGNPDLTLEELL GVVRSLNSSRNFPAGMSVKDFI
Sbjct: 181 KLYNIFYEKANACVEVYKKLARTSGGNPDLTLEELLGGVVRSLNSSRNFPAGMSVKDFIF 240
Query: 250 LQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKI 309
LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKI
Sbjct: 241 LQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKI 300
Query: 310 KDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNKFYIKINEDE 369
KDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNKFYIKINEDE
Sbjct: 301 KDGEQLNPPNITGSGVEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNKFYIKINEDE 360
Query: 370 IANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLP 429
IANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDSRLISLELLP
Sbjct: 361 IANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDSRLISLELLP 420
Query: 430 MKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFG 489
MKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSAIKEWMIEFG
Sbjct: 421 MKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSAIKEWMIEFG 480
Query: 490 SSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHII 549
SSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAISIITLLKEQSRASKLSFA II
Sbjct: 481 SSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAISIITLLKEQSRASKLSFAVII 540
Query: 550 KKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEER 609
KKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEER
Sbjct: 541 KKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEER 600
Query: 610 HHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYSNYLPEDLKE 669
HHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTT+LINRIWGEFYSNY PEDLKE
Sbjct: 601 HHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTKLINRIWGEFYSNYSPEDLKE 660
Query: 670 ADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGE 729
ADNNETKEDE EEEEEVEDEE EEVEEED QVDLKTKESKPVVKPAKAKLSEG NKWDG+
Sbjct: 661 ADNNETKEDEPEEEEEVEDEESEEVEEEDVQVDLKTKESKPVVKPAKAKLSEGNNKWDGK 720
Query: 730 IVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGR 789
IVG TSQGYPLYKQAIVHGDLVAVGGFV V+TDNVHDLPA+YLVEYMYEKSN +KMVHGR
Sbjct: 721 IVGKTSQGYPLYKQAIVHGDLVAVGGFVYVETDNVHDLPALYLVEYMYEKSNGKKMVHGR 780
Query: 790 LVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKA 849
L+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRETVVVESCMRPWGYQHRKAN K DKA
Sbjct: 781 LIVRGLETVLGNAAKEREVFLTNDCLEFELNEIRETVVVESCMRPWGYQHRKANAKMDKA 840
Query: 850 DEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGEDIMK 909
DEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQ ED MK
Sbjct: 841 DEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQREDTMK 900
Query: 910 LHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVVCQLLGIESP 969
LHSSL SFTYRG DYSVND VYL+PHHFGTDERGIETFKGG+NVGLNAYVVCQLLGIESP
Sbjct: 901 LHSSLTSFTYRGTDYSVNDCVYLSPHHFGTDERGIETFKGGKNVGLNAYVVCQLLGIESP 960
Query: 970 KGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKK 1029
KGSKQP P+STMVKVRRFFRPEDISVEKAYCSDIRELYYSDETT PVSAIEGKCEVRKK
Sbjct: 961 KGSKQPCPISTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTMMPVSAIEGKCEVRKK 1020
Query: 1030 QDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKE 1089
QDIP+ANYPAIFDH FFCEHLYDPEKGAIKKLP +VKLSS SERQISDAAQRKKKGKCKE
Sbjct: 1021 QDIPVANYPAIFDHIFFCEHLYDPEKGAIKKLPGSVKLSSPSERQISDAAQRKKKGKCKE 1080
Query: 1090 GESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSL 1149
GE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSL
Sbjct: 1081 GEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSL 1140
Query: 1150 NHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEFIN 1209
NHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLPRPGQVEFIN
Sbjct: 1141 NHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLPRPGQVEFIN 1200
Query: 1210 GGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFR 1269
GGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFR
Sbjct: 1201 GGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFR 1260
Query: 1270 LTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPELKI 1329
LTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVF SPELKI
Sbjct: 1261 LTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFGSPELKI 1320
Query: 1330 SLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKIRG 1389
SLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEP SWFQKKIRG
Sbjct: 1321 SLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPASWFQKKIRG 1380
Query: 1390 DVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNTAK 1449
DV+VLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLIPWCLPNTAK
Sbjct: 1381 DVIVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLIPWCLPNTAK 1440
Query: 1450 RHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKF 1509
RHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKF
Sbjct: 1441 RHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKF 1500
Query: 1510 AGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
AGNIQHKHRQIGNAVPPPLAYALGRKLREAIE KSSKSVLS
Sbjct: 1501 AGNIQHKHRQIGNAVPPPLAYALGRKLREAIETKSSKSVLS 1541
BLAST of HG10006921 vs. NCBI nr
Match:
XP_038892294.1 (DNA (cytosine-5)-methyltransferase 1B-like [Benincasa hispida] >XP_038892295.1 DNA (cytosine-5)-methyltransferase 1B-like [Benincasa hispida] >XP_038892296.1 DNA (cytosine-5)-methyltransferase 1B-like [Benincasa hispida])
HSP 1 Score: 2984.1 bits (7735), Expect = 0.0e+00
Identity = 1477/1550 (95.29%), Postives = 1506/1550 (97.16%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDLK+SD+KNVK VKPKQKRNRSE GEQ VGVRKMPKRAAACSDFKV
Sbjct: 1 MAKKTRSQLMATSNDLKESDTKNVKAVKPKQKRNRSENGEQVVGVRKMPKRAAACSDFKV 60
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTI LPEKSSVIECKREV VEDEIAAVGLTTSG DDSRPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 61 KTIPLPEKSSVIECKREVTVEDEIAAVGLTTSGLDDSRPNRRLTEFTFHDEDGKPQAVEM 120
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVNDLFISG+ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD
Sbjct: 121 LEVNDLFISGVILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CVRPAGGYKKLYNIFYEKANACVEVYKKLA+TSGGNPDLTLEELLAGVVRSLNSSRNFPA
Sbjct: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLARTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAG+FD
Sbjct: 241 GMSVKDFIFLQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGSFD 300
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
G TNLGLKIKDGEQ NPPNIPG GAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK
Sbjct: 301 GSTNLGLKIKDGEQRNPPNIPGSGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPA+YKTTKDEMDE++IFDGDMDICDPDDLPRSMLH+WSLYNSDS
Sbjct: 361 FYIKINEDEIANDYPLPAYYKTTKDEMDEFLIFDGDMDICDPDDLPRSMLHDWSLYNSDS 420
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGI TADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSA
Sbjct: 421 RLISLELLPMKPCDDIDVTIYGSGITTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSA 480
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAISIITLLKEQSRA
Sbjct: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAISIITLLKEQSRA 540
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSFA IIKKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT
Sbjct: 541 SKLSFAVIIKKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS
Sbjct: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NY PEDLKEADNNET+EDE EEEEVEDEE EEVEE+DGQVDLK +ESKPVVKPAK K S
Sbjct: 661 NYSPEDLKEADNNETREDE-PEEEEVEDEESEEVEEDDGQVDLKAEESKPVVKPAKRKQS 720
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
+GKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSV+TDNVHDLPAIYLVEYMYEKS
Sbjct: 721 DGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVETDNVHDLPAIYLVEYMYEKS 780
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
NARKMVHGRLVVRGLETVLGNAAKE+EVFLTNDCLDFELNEI+E VVVESCMRPWG+QHR
Sbjct: 781 NARKMVHGRLVVRGLETVLGNAAKEQEVFLTNDCLDFELNEIKEKVVVESCMRPWGHQHR 840
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAF CLPKETMGLGTGDCHSCKLK
Sbjct: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFVCLPKETMGLGTGDCHSCKLK 900
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
E Q +D MKLHSSL SFTY G DYSVNDY YLAPHHFGTDERGIETFKGGRNVGLNAYVV
Sbjct: 901 EAQRDDTMKLHSSLTSFTYGGADYSVNDYAYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIE+ KGSKQP PVSTM+KVRRFFRPEDIS EKAYCSDIRELYYSDE T KPVSAI
Sbjct: 961 CQLLGIETSKGSKQPCPVSTMIKVRRFFRPEDISAEKAYCSDIRELYYSDEITMKPVSAI 1020
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIP AN PAIFDH FFCEHLYDPEKGAIKKLPANVKLSSSSERQI+DAAQ
Sbjct: 1021 EGKCEVRKKQDIPAANCPAIFDHIFFCEHLYDPEKGAIKKLPANVKLSSSSERQINDAAQ 1080
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGE +PDETESKKD+PLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1081 RKKKGKCKEGEIIPDETESKKDMPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLP
Sbjct: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLP 1200
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKI LS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT
Sbjct: 1321 VFGSPELKILLSENTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGDVLVLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLSTGQMHDLI
Sbjct: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSTGQMHDLI 1440
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIE KSSKSVLS
Sbjct: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEKKSSKSVLS 1549
BLAST of HG10006921 vs. NCBI nr
Match:
XP_011654447.1 (DNA (cytosine-5)-methyltransferase 1B isoform X2 [Cucumis sativus])
HSP 1 Score: 2973.3 bits (7707), Expect = 0.0e+00
Identity = 1471/1550 (94.90%), Postives = 1500/1550 (96.77%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDL+KSD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAA+CSDFKV
Sbjct: 1 MAKKTRSQLMATSNDLEKSDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAASCSDFKV 60
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTIQLPEKSSVIECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 61 KTIQLPEKSSVIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVNDLFISG+ILPFED DKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD
Sbjct: 121 LEVNDLFISGVILPFEDISDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CVRPA GYKKLYNIFYEKANACVEVYKKLA++SGG PDLTLEELL GVVRSLNSSRNFPA
Sbjct: 181 CVRPAAGYKKLYNIFYEKANACVEVYKKLARSSGGYPDLTLEELLGGVVRSLNSSRNFPA 240
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI+LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAG FD
Sbjct: 241 GMSVKDFIILQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGIFD 300
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTNLGLKIKDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRK QRSTTSSNK
Sbjct: 301 GFTNLGLKIKDGEQLNPPNILGSGDEEDEDLKLAKLLQEEEYWRSAKQRKTQRSTTSSNK 360
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDS
Sbjct: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDS 420
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSA
Sbjct: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSA 480
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAI II LLKEQSRA
Sbjct: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAICIIMLLKEQSRA 540
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSFA IIKKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT
Sbjct: 541 SKLSFAVIIKKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLV+KKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS
Sbjct: 601 GLSDKMEERHHTKWLVRKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NY PEDLK ADNNETKEDE EEEEEVEDEE EEVEEEDGQVDLKTKESKPVVKPAKAKLS
Sbjct: 661 NYSPEDLKAADNNETKEDEPEEEEEVEDEESEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EG NKWDG++VG TS+GYPLYKQAIVHGDLVAVGGFVSV+TDNVHDLPAIYLVEYMYEKS
Sbjct: 721 EGNNKWDGKMVGKTSEGYPLYKQAIVHGDLVAVGGFVSVETDNVHDLPAIYLVEYMYEKS 780
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
N RKMVHGRL+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRE VVVESCMRPWGYQHR
Sbjct: 781 NGRKMVHGRLLVRGLETVLGNAAKEREVFLTNDCLEFELNEIREAVVVESCMRPWGYQHR 840
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
KAN KKDKA+EERAEERK RGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 841 KANAKKDKAEEERAEERKHRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED MKLHSSL SFTYRG DYSVND VYLAPHHFGTDERGIETFKGGRNV LNAYVV
Sbjct: 901 ETQREDTMKLHSSLTSFTYRGTDYSVNDCVYLAPHHFGTDERGIETFKGGRNVVLNAYVV 960
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP PVSTMV+VRRFFRPEDISVEKAYCSDIRELYYSDETT KPVSAI
Sbjct: 961 CQLLGIESPKGSKQPCPVSTMVQVRRFFRPEDISVEKAYCSDIRELYYSDETTMKPVSAI 1020
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIP+AN PAIFDH FFCEHLYDPEKGAIKKLPA+VKLSS SERQISDAAQ
Sbjct: 1021 EGKCEVRKKQDIPVANCPAIFDHIFFCEHLYDPEKGAIKKLPASVKLSSPSERQISDAAQ 1080
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1081 RKKKGKCKEGEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLP
Sbjct: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLP 1200
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1261 SFNKGQTFRLTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASV TMEYKSEP
Sbjct: 1321 VFGSPELKISLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVITMEYKSEPA 1380
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGDVLVLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLI
Sbjct: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLI 1440
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRL+WEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1441 PWCLPNTAKRHNQWKGLFGRLEWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS
Sbjct: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1550
BLAST of HG10006921 vs. NCBI nr
Match:
XP_011654446.1 (DNA (cytosine-5)-methyltransferase 1B isoform X1 [Cucumis sativus] >KGN50781.2 hypothetical protein Csa_018393 [Cucumis sativus])
HSP 1 Score: 2973.3 bits (7707), Expect = 0.0e+00
Identity = 1471/1550 (94.90%), Postives = 1500/1550 (96.77%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDL+KSD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAA+CSDFKV
Sbjct: 53 MAKKTRSQLMATSNDLEKSDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAASCSDFKV 112
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTIQLPEKSSVIECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 113 KTIQLPEKSSVIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEM 172
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVNDLFISG+ILPFED DKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD
Sbjct: 173 LEVNDLFISGVILPFEDISDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 232
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CVRPA GYKKLYNIFYEKANACVEVYKKLA++SGG PDLTLEELL GVVRSLNSSRNFPA
Sbjct: 233 CVRPAAGYKKLYNIFYEKANACVEVYKKLARSSGGYPDLTLEELLGGVVRSLNSSRNFPA 292
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI+LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAG FD
Sbjct: 293 GMSVKDFIILQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGIFD 352
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTNLGLKIKDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRK QRSTTSSNK
Sbjct: 353 GFTNLGLKIKDGEQLNPPNILGSGDEEDEDLKLAKLLQEEEYWRSAKQRKTQRSTTSSNK 412
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDS
Sbjct: 413 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDS 472
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSA
Sbjct: 473 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSA 532
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAI II LLKEQSRA
Sbjct: 533 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAICIIMLLKEQSRA 592
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSFA IIKKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT
Sbjct: 593 SKLSFAVIIKKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 652
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLV+KKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS
Sbjct: 653 GLSDKMEERHHTKWLVRKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 712
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NY PEDLK ADNNETKEDE EEEEEVEDEE EEVEEEDGQVDLKTKESKPVVKPAKAKLS
Sbjct: 713 NYSPEDLKAADNNETKEDEPEEEEEVEDEESEEVEEEDGQVDLKTKESKPVVKPAKAKLS 772
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EG NKWDG++VG TS+GYPLYKQAIVHGDLVAVGGFVSV+TDNVHDLPAIYLVEYMYEKS
Sbjct: 773 EGNNKWDGKMVGKTSEGYPLYKQAIVHGDLVAVGGFVSVETDNVHDLPAIYLVEYMYEKS 832
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
N RKMVHGRL+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRE VVVESCMRPWGYQHR
Sbjct: 833 NGRKMVHGRLLVRGLETVLGNAAKEREVFLTNDCLEFELNEIREAVVVESCMRPWGYQHR 892
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
KAN KKDKA+EERAEERK RGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 893 KANAKKDKAEEERAEERKHRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 952
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED MKLHSSL SFTYRG DYSVND VYLAPHHFGTDERGIETFKGGRNV LNAYVV
Sbjct: 953 ETQREDTMKLHSSLTSFTYRGTDYSVNDCVYLAPHHFGTDERGIETFKGGRNVVLNAYVV 1012
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP PVSTMV+VRRFFRPEDISVEKAYCSDIRELYYSDETT KPVSAI
Sbjct: 1013 CQLLGIESPKGSKQPCPVSTMVQVRRFFRPEDISVEKAYCSDIRELYYSDETTMKPVSAI 1072
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIP+AN PAIFDH FFCEHLYDPEKGAIKKLPA+VKLSS SERQISDAAQ
Sbjct: 1073 EGKCEVRKKQDIPVANCPAIFDHIFFCEHLYDPEKGAIKKLPASVKLSSPSERQISDAAQ 1132
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1133 RKKKGKCKEGEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1192
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLP
Sbjct: 1193 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLP 1252
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1253 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1312
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1313 SFNKGQTFRLTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1372
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASV TMEYKSEP
Sbjct: 1373 VFGSPELKISLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVITMEYKSEPA 1432
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGDVLVLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLI
Sbjct: 1433 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLI 1492
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRL+WEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1493 PWCLPNTAKRHNQWKGLFGRLEWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1552
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS
Sbjct: 1553 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1602
BLAST of HG10006921 vs. ExPASy Swiss-Prot
Match:
B1Q3J6 (DNA (cytosine-5)-methyltransferase 1B OS=Oryza sativa subsp. japonica OX=39947 GN=MET1B PE=2 SV=1)
HSP 1 Score: 1806.6 bits (4678), Expect = 0.0e+00
Identity = 914/1504 (60.77%), Postives = 1130/1504 (75.13%), Query Frame = 0
Query: 47 KMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEF 106
K P+RAAACSDFK K+++L +KSSV+ E+E+ AV LT G + RP R+L +F
Sbjct: 57 KRPRRAAACSDFKEKSVRLSDKSSVVATNGNKMEEEEMDAVKLTKLGPEVQRPCRKLIDF 116
Query: 107 TFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDG 166
HD DGK Q EM E++D FI+ +I+P +D +K++ KGVRCEGFGRIE W ISGY++G
Sbjct: 117 ILHDADGKLQPFEMSEIDDFFITALIMPMDDDLEKDRQKGVRCEGFGRIEDWAISGYDEG 176
Query: 167 SPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLA 226
+ +W+ST+VADY+CV+PAG YK Y+ FYEKA CVEVY+KLA++ GGNP+L LEELLA
Sbjct: 177 TAVVWVSTEVADYECVKPAGNYKSYYDHFYEKAQVCVEVYRKLARSVGGNPNLGLEELLA 236
Query: 227 GVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDES 286
VVRS+N+ + + +S KDF++ GEF+YNQLIGLD+T+ +D+ F LPVL ALRD
Sbjct: 237 SVVRSINAIKGYSGTLS-KDFVISNGEFVYNQLIGLDETANTDDEKFATLPVLLALRDGC 296
Query: 287 RKQ---GNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYW 346
+ + L PN +N LKI D E ED+D KLA+LLQ+EE W
Sbjct: 297 KSRVEVSKLQPN------ISNGSLKINDAECKE--------VSEDDDEKLARLLQQEEEW 356
Query: 347 RSAKQRKNQRSTTSSNKFYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPD 406
+ KQR +R TTS YIKI+E EIANDYPLPA+YK + EMDEY IFD +
Sbjct: 357 KMMKQR-GKRGTTSQKNVYIKISEAEIANDYPLPAYYKPSSQEMDEY-IFDSEDSFY--S 416
Query: 407 DLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQS 466
D+P +L+NW+LYN+DSRLI LEL+PMK + D+ ++GSG M DDGS C A++++
Sbjct: 417 DVPVRILNNWALYNADSRLIPLELIPMKAGAENDIVVFGSGFMREDDGS-CCSTAESAKL 476
Query: 467 CSAQMQNTD--GIPIYLSAIKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVL 526
S+ N G+ IYLS IKEW+IEFG SM+ I+IRTD+AWY+L +P+KQYAPW E VL
Sbjct: 477 SSSSSSNHQDAGVSIYLSPIKEWVIEFGGSMICITIRTDVAWYKLRQPTKQYAPWCEPVL 536
Query: 527 KTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQII 586
KTARL++SIITLLKEQSRASKLSFA +IKK++EF K +PA++SS ++VERY+VVHGQII
Sbjct: 537 KTARLSVSIITLLKEQSRASKLSFADVIKKVAEFDKGSPAFVSSNVALVERYIVVHGQII 596
Query: 587 LQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRK 646
LQ FS++PD+ IR+ AF TGL KME+R HTK ++KKK + + N+NP A+M P + RK
Sbjct: 597 LQQFSDFPDETIRRSAFATGLLMKMEQRRHTKLVMKKKVQVMRGENLNPSATMGPASRRK 656
Query: 647 AMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKE--DELEEEEEVEDEELEEVEEEDG 706
M ATTTRLINRIW ++Y+++ PED K+AD NE KE DELEE E+ + EE ++EEE+
Sbjct: 657 VMRATTTRLINRIWSDYYTHHFPEDSKDADVNEAKEIDDELEENEDEDAEEEAQIEEENV 716
Query: 707 QVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSV 766
+ S+ +V ++ +W+GE +G T G LYK A V + +G V++
Sbjct: 717 SKTPPSTRSRKLVSQTCKEI-----RWEGEAIGKTPSGEALYKCAYVRELRINLGRTVAL 776
Query: 767 DTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFEL 826
+ D+ + VEYM++K N KMVHGRL+ +G ETVLGNAA ER++FLTN+CL+FEL
Sbjct: 777 EDDSGE--LVMCFVEYMFQKLNGAKMVHGRLLQKGSETVLGNAANERDLFLTNECLEFEL 836
Query: 827 NEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAF 886
+I+E + V PWG+++RK N + D+ + +AE+RK++GLPME+ CKSLYWPEKGAF
Sbjct: 837 EDIKELMSVNLQSLPWGHKYRKENAEADRIERAKAEDRKKKGLPMEYLCKSLYWPEKGAF 896
Query: 887 FCLPKETMGLGTGDCHSCKLKETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGT 946
F LP + +GLG G C SC+ KE +++ L S SF YR I Y+VNDY+Y+ P F +
Sbjct: 897 FSLPHDKLGLGNGFCSSCQQKEPDCDELQIL--SKNSFIYRNITYNVNDYLYIRPDFF-S 956
Query: 947 DERGIETFKGGRNVGLNAYVVCQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAY 1006
E TFKGGRNVGL YVVC LL + P GS++ P ST + VRRF+RP+DIS KAY
Sbjct: 957 QEEDRATFKGGRNVGLKPYVVCHLLDVHEPAGSRKIHPASTKISVRRFYRPDDISSAKAY 1016
Query: 1007 CSDIRELYYSDETTTKPVSAIEGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIK 1066
SDIRE+YYS+ PV IEGKCEV+KK DI ++ P + +H FFCEH YDP GA+K
Sbjct: 1017 VSDIREVYYSENIVKVPVDMIEGKCEVKKKIDISNSDVPVMVEHEFFCEHFYDPATGALK 1076
Query: 1067 KLPANVKLSSSSERQISDAAQRKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCG 1126
+LP NVKL S ++ + A +K KGK + ES +++ + ENRLATLDIFAGCG
Sbjct: 1077 QLPPNVKLMSVQQK--ATGALKKNKGK-QICESDQVDSDKCTKVSKENRLATLDIFAGCG 1136
Query: 1127 GLSEGLQQAGVSVTKWAIEYEEPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCI 1186
GLSEGLQQAGVS TKWAIEYEEPAGEAF+ NHPEA FV+NCNVIL+A+M CGDADDCI
Sbjct: 1137 GLSEGLQQAGVSFTKWAIEYEEPAGEAFTKNHPEAAVFVDNCNVILKAIMDKCGDADDCI 1196
Query: 1187 STSEAIEVAEKLDDKEINNLPRPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAF 1246
STSEA E A K I NLP PG+VEFINGGPPCQGFSGMNRFNQS WSKVQCEMILAF
Sbjct: 1197 STSEAAEQAAKFSQDNIMNLPVPGEVEFINGGPPCQGFSGMNRFNQSPWSKVQCEMILAF 1256
Query: 1247 LSFAEYFRPKYFLLENVRNFVSFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRK 1306
LSFAEYFRP++FLLENVRNFVSFNKGQTFRLT+ASLLEMGYQVRFGILEAG +G++QSRK
Sbjct: 1257 LSFAEYFRPRFFLLENVRNFVSFNKGQTFRLTVASLLEMGYQVRFGILEAGTFGVAQSRK 1316
Query: 1307 RAFIWAASPEEILPEWPEPMHVFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIG 1366
RAFIWAA+P E LP+WPEPMHVF SPELKI+L YAA STAGGAPFR ITVRDTIG
Sbjct: 1317 RAFIWAAAPGETLPDWPEPMHVFASPELKINLPDGKYYAAAKSTAGGAPFRAITVRDTIG 1376
Query: 1367 DLPAVGNGASVTTMEYKSEPTSWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGS 1426
DLP V NGAS +EY EP SWFQKKIRG+ + LNDHISKEMNELNLIRCQ+IPKRPG
Sbjct: 1377 DLPKVENGASKLLLEYGGEPISWFQKKIRGNTIALNDHISKEMNELNLIRCQRIPKRPGC 1436
Query: 1427 DWRDLPDEKVRLSTGQMHDLIPWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGK 1486
DW DLPDEKV+LS+GQ+ DLIPWCLPNTAKRHNQWKGL+GRLDWEGNFPTS+TDPQPMGK
Sbjct: 1437 DWHDLPDEKVKLSSGQLVDLIPWCLPNTAKRHNQWKGLYGRLDWEGNFPTSVTDPQPMGK 1496
Query: 1487 VGMCFHPEQDRILTVRECARSQGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREA 1544
VGMCFHP+QDRI+TVRECARSQGFPD+Y+FAGNIQ KHRQIGNAVPPPLA+ALGRKL+EA
Sbjct: 1497 VGMCFHPDQDRIITVRECARSQGFPDNYQFAGNIQSKHRQIGNAVPPPLAFALGRKLKEA 1527
BLAST of HG10006921 vs. ExPASy Swiss-Prot
Match:
P34881 (DNA (cytosine-5)-methyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=DMT1 PE=1 SV=1)
HSP 1 Score: 1780.8 bits (4611), Expect = 0.0e+00
Identity = 920/1540 (59.74%), Postives = 1150/1540 (74.68%), Query Frame = 0
Query: 22 KNVKEVKPKQKRNRSETGE-QAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAV 81
+N + ++KR E E + V + P+RAAAC+ FK K+I++ EKS+ IE K++ V
Sbjct: 3 ENGAKAAKRKKRPLPEIQEVEDVPRTRRPRRAAACTSFKEKSIRVCEKSATIEVKKQQIV 62
Query: 82 EDEIAAVGLTTSGQD-DSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTP 141
E+E A+ LT D + RP RRL +F D DG PQ +EMLE++D+F+SG ILP +
Sbjct: 63 EEEFLALRLTALETDVEDRPTRRLNDFVLFDSDGVPQPLEMLEIHDIFVSGAILPSDVCT 122
Query: 142 DKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKA 201
DKEK KGVRC FGR+E W+ISGYEDGSP IWIST++ADYDC +PA Y+K+Y+ FYEKA
Sbjct: 123 DKEKEKGVRCTSFGRVEHWSISGYEDGSPVIWISTELADYDCRKPAASYRKVYDYFYEKA 182
Query: 202 NACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNS-SRNFPAGMSVKDFILLQGEFIYNQ 261
A V VYKKL+K+SGG+PD+ LEELLA VVRS++S S+ F +G ++ DF++ QG+FIYNQ
Sbjct: 183 RASVAVYKKLSKSSGGDPDIGLEELLAAVVRSMSSGSKYFSSGAAIIDFVISQGDFIYNQ 242
Query: 262 LIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPP 321
L GLD+T+KK++ + ++PVL ALR++S K L + + G++IK+ Q+
Sbjct: 243 LAGLDETAKKHESSYVEIPVLVALREKSSKIDKPLQR----ERNPSNGVRIKEVSQVAES 302
Query: 322 NI---PGFGAEEDEDLKLAKLLQEEEYWRSAKQ-RKNQRSTTSSNKFYIKINEDEIANDY 381
D+D + A LLQ+EE +S +Q RKN S ++SN FYIKINEDEIANDY
Sbjct: 303 EALTSDQLVDGTDDDRRYAILLQDEENRKSMQQPRKNSSSGSASNMFYIKINEDEIANDY 362
Query: 382 PLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCD 441
PLP++YKT+++E DE +++D ++ + LP MLHNW+LYNSD R ISLELLPMK CD
Sbjct: 363 PLPSYYKTSEEETDELILYDASYEV-QSEHLPHRMLHNWALYNSDLRFISLELLPMKQCD 422
Query: 442 DIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGS-SMV 501
DIDV I+GSG++T D+GS L+ S S Q + DG+ I+LS IKEWMIEFGS ++
Sbjct: 423 DIDVNIFGSGVVTDDNGSWISLNDPDSGS---QSHDPDGMCIFLSQIKEWMIEFGSDDII 482
Query: 502 FISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKIS 561
ISIRTD+AWYRLGKPSK YAPW++ VLKTAR+ ISI+T L+ +SR ++LSFA + K++S
Sbjct: 483 SISIRTDVAWYRLGKPSKLYAPWWKPVLKTARVGISILTFLRVESRVARLSFADVTKRLS 542
Query: 562 EFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTK 621
N+ AYISS P VERY+VVHGQIILQ F+ YPDD +++C F+ GL+ K+E+RHHTK
Sbjct: 543 GLQANDKAYISSDPLAVERYLVVHGQIILQLFAVYPDDNVKRCPFVVGLASKLEDRHHTK 602
Query: 622 WLVKKKAVLKQEANMNPRASMKPVTS-RKAMPATTTRLINRIWGEFYSNYLPEDLKEADN 681
W++KKK + +E N+NPRA M PV S RKAM ATTTRL+NRIWGEFYSNY PED +A
Sbjct: 603 WIIKKKKISLKELNLNPRAGMAPVASKRKAMQATTTRLVNRIWGEFYSNYSPEDPLQATA 662
Query: 682 NETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVK---PAKAKLSEGKN--KWD 741
E EDE+EEE +EE+EE E E+G + E V K P K + S GK KWD
Sbjct: 663 AENGEDEVEEEGGNGEEEVEE-EGENGLTEDTVPEPVEVQKPHTPKKIRGSSGKREIKWD 722
Query: 742 GEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVH 801
GE +G TS G PLY+QA+V G++VAVGG V+++ D+ ++PAIY VEYM+E ++ KM+H
Sbjct: 723 GESLGKTSAGEPLYQQALVGGEMVAVGGAVTLEVDDPDEMPAIYFVEYMFESTDHCKMLH 782
Query: 802 GRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKD 861
GR + RG TVLGNAA ERE+FLTN+C+ +L +I+ E RPWG+Q+RK N+ D
Sbjct: 783 GRFLQRGSMTVLGNAANERELFLTNECMTTQLKDIKGVASFEIRSRPWGHQYRKKNITAD 842
Query: 862 KADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-D 921
K D RA ERK + LP E+YCKSLY PE+G FF LP +G +G C SCK++E + +
Sbjct: 843 KLDWARALERKVKDLPTEYYCKSLYSPERGGFFSLPLSDIGRSSGFCTSCKIREDEEKRS 902
Query: 922 IMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIET-FKGGRNVGLNAYVVCQLLG 981
+KL+ S F GI+YSV D+VY+ P G + G +T FK GRN+GL AYVVCQLL
Sbjct: 903 TIKLNVSKTGFFINGIEYSVEDFVYVNPDSIGGLKEGSKTSFKSGRNIGLRAYVVCQLLE 962
Query: 982 IESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCE 1041
I PK S++ S VKVRRF+RPED+S EKAY SDI+ELY+S +T P A+EGKCE
Sbjct: 963 I-VPKESRKADLGSFDVKVRRFYRPEDVSAEKAYASDIQELYFSQDTVVLPPGALEGKCE 1022
Query: 1042 VRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKG 1101
VRKK D+P++ I DH FFC+ +D KG++K+LPAN+K S+ + D RKKKG
Sbjct: 1023 VRKKSDMPLSREYPISDHIFFCDLFFDTSKGSLKQLPANMKPKFSTIK--DDTLLRKKKG 1082
Query: 1102 KCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGE 1161
K E E + E + P E RLATLDIFAGCGGLS GL++AGVS KWAIEYEEPAG+
Sbjct: 1083 KGVESE-IESEIVKPVEPPKEIRLATLDIFAGCGGLSHGLKKAGVSDAKWAIEYEEPAGQ 1142
Query: 1162 AFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQV 1221
AF NHPE+ FV+NCNVILRA+M+ GD DDC+ST+EA E+A KL +++ + LP PGQV
Sbjct: 1143 AFKQNHPESTVFVDNCNVILRAIMEKGGDQDDCVSTTEANELAAKLTEEQKSTLPLPGQV 1202
Query: 1222 EFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKG 1281
+FINGGPPCQGFSGMNRFNQS+WSKVQCEMILAFLSFA+YFRP+YFLLENVR FVSFNKG
Sbjct: 1203 DFINGGPPCQGFSGMNRFNQSSWSKVQCEMILAFLSFADYFRPRYFLLENVRTFVSFNKG 1262
Query: 1282 QTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSP 1341
QTF+LTLASLLEMGYQVRFGILEAGAYG+SQSRKRAFIWAA+PEE+LPEWPEPMHVF P
Sbjct: 1263 QTFQLTLASLLEMGYQVRFGILEAGAYGVSQSRKRAFIWAAAPEEVLPEWPEPMHVFGVP 1322
Query: 1342 ELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQK 1401
+LKISLS YAAV STA GAPFRPITVRDTIGDLP+V NG S T EYK SWFQK
Sbjct: 1323 KLKISLSQGLHYAAVRSTALGAPFRPITVRDTIGDLPSVENGDSRTNKEYKEVAVSWFQK 1382
Query: 1402 KIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLP 1461
+IRG+ + L DHI K MNELNLIRC+ IP RPG+DW DLP KV LS G++ ++IP+CLP
Sbjct: 1383 EIRGNTIALTDHICKAMNELNLIRCKLIPTRPGADWHDLPKRKVTLSDGRVEEMIPFCLP 1442
Query: 1462 NTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPD 1521
NTA+RHN WKGL+GRLDW+GNFPTS+TDPQPMGKVGMCFHPEQ RILTVRECARSQGFPD
Sbjct: 1443 NTAERHNGWKGLYGRLDWQGNFPTSVTDPQPMGKVGMCFHPEQHRILTVRECARSQGFPD 1502
Query: 1522 SYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
SY+FAGNI HKHRQIGNAVPPPLA+ALGRKL+EA+ +K S
Sbjct: 1503 SYEFAGNINHKHRQIGNAVPPPLAFALGRKLKEALHLKKS 1529
BLAST of HG10006921 vs. ExPASy Swiss-Prot
Match:
Q7Y1I7 (DNA (cytosine-5)-methyltransferase 1A OS=Oryza sativa subsp. japonica OX=39947 GN=MET1A PE=2 SV=1)
HSP 1 Score: 1780.0 bits (4609), Expect = 0.0e+00
Identity = 914/1550 (58.97%), Postives = 1136/1550 (73.29%), Query Frame = 0
Query: 3 KKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKT 62
K+ R+++ ++ + ++ +V K+ +E G + V K PKRAAACS+FK K+
Sbjct: 14 KRRRAKVHKEDEPVENENLESEFDVSKKESNGATEPGNEPV-ASKRPKRAAACSNFKEKS 73
Query: 63 IQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEMLE 122
+ L EK S+I K E EI AV LT +G +D +P R++ +F HD DG Q EM E
Sbjct: 74 LDLSEKDSIITIKESRVEEKEIEAVNLTRTGPEDGQPCRKIIDFILHDGDGNLQPFEMSE 133
Query: 123 VNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCV 182
V+D+FI+ +I+P +D +K++ KG+ C GFGRIE+W ISGY++G+ IW+ST+ +DY CV
Sbjct: 134 VDDIFITALIMPLDDDLEKDRGKGICCSGFGRIENWAISGYDEGAAVIWVSTETSDYKCV 193
Query: 183 RPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGM 242
+PA Y+ + F EKA CVEVYKKLA++ GGNP + LEEL+AGVVRS+NS+R+F G
Sbjct: 194 KPASSYRSYFEHFSEKARVCVEVYKKLARSVGGNPQVDLEELIAGVVRSINSNRSF-NGT 253
Query: 243 SVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNL-----LPNAG 302
KDF++ GEFIY QLIGLD T+ +D++ LPVL AL+DE + + +P+ G
Sbjct: 254 VTKDFVISSGEFIYKQLIGLDHTAGNDDEMLATLPVLVALKDECKSRAGFTHLPAMPSNG 313
Query: 303 TFDGFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTS 362
T L+IKDG+ G EDED KLA+LLQEEE W+ KQR +R T+
Sbjct: 314 T--------LRIKDGQDK--------GLTEDEDAKLARLLQEEEEWKMMKQR-GKRGTSQ 373
Query: 363 SNKFYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYN 422
N YIKI E EIANDYPLPA+YK EMDEY IFD D+ + DD+P +L NW+LYN
Sbjct: 374 KN-IYIKICETEIANDYPLPAYYKPYNQEMDEY-IFDSDIGMYS-DDVPVRILDNWALYN 433
Query: 423 SDSRLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQ---SCSAQMQNTDGI 482
SDSRLISLEL+PMK + D+ ++GSG M DDGS C A+ +Q S S + G+
Sbjct: 434 SDSRLISLELIPMKAGAENDIVVFGSGFMREDDGS-CCSTAELAQLHSSSSKSGREDPGV 493
Query: 483 PIYLSAIKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLL 542
PIYLS IKEW++EFG SM+ I+IRTD+AWY+L +P+KQYAPW E VLKTARLA+SIITLL
Sbjct: 494 PIYLSPIKEWVVEFGGSMICITIRTDVAWYKLRQPTKQYAPWCEPVLKTARLAVSIITLL 553
Query: 543 KEQSRASKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIR 602
KEQSRASKLSFA +IKK++EF +PA+ISS VERYVVVHGQIILQ F+++PD+ ++
Sbjct: 554 KEQSRASKLSFAEVIKKVAEFDSRHPAFISSKAPTVERYVVVHGQIILQQFADFPDESVK 613
Query: 603 KCAFITGLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRI 662
+CAFITGL KMEE HTK +KKK+ + N+NP A M P+ +K M ATTT LI++I
Sbjct: 614 RCAFITGLLAKMEESRHTKLAIKKKSQQMRGENLNPSAKMGPILRKKLMRATTTMLISKI 673
Query: 663 WGEFYSNYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKP 722
WGE+Y+ Y P D KE D NE KE + ++EE +++ EEV +D K + P +
Sbjct: 674 WGEYYATYFPGDTKEEDQNEPKEIDDDQEENEDNDAEEEVNVQDE----KATRTPPSTRS 733
Query: 723 AKAKLSEGKN-KWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLV 782
K+ K KW+G+ G T G LYK IV ++VG V+ + D+ + + V
Sbjct: 734 RKSSADTRKEIKWEGQTAGKTVSGEVLYKCVIVQDLSISVGATVTTEDDSGETI--MCFV 793
Query: 783 EYMYEKSNARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMR 842
EYMYEK + + M+HG ++ G +TVLGNAA +REVFLTNDCL+FE ++I+E V V
Sbjct: 794 EYMYEKLDGKNMIHGIILQEGSQTVLGNAANDREVFLTNDCLEFEASDIKELVTVNIQSL 853
Query: 843 PWGYQHRKANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGD 902
PWG+++RK N + + ++ +AEERKR+GLP+E+ CKSLYWPEKG FF LP + +G GTG
Sbjct: 854 PWGHKYRKENSEAKRIEKAKAEERKRKGLPVEYICKSLYWPEKGGFFSLPYDKIGNGTGI 913
Query: 903 CHSCKLKETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNV 962
C SC+ K E KL S SF + I Y+++D++Y+ P F E G ET+K GRNV
Sbjct: 914 CSSCERKPVGNE--FKLLSE-SSFVFENITYNIHDFLYIRPEFFSQGE-GHETYKAGRNV 973
Query: 963 GLNAYVVCQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETT 1022
GL Y VC LL + P GS++ +P ST VKVRRF+RP+DIS KAY SDIRE+YYS++
Sbjct: 974 GLKPYAVCHLLSVHGPAGSRKANPESTKVKVRRFYRPDDISSTKAYSSDIREVYYSEDII 1033
Query: 1023 TKPVSAIEGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSER 1082
+ PV IEGKCEVR K D+P ++ PA+ +H F CE+LYDP GA+K+LP NV+L + + +
Sbjct: 1034 SVPVVMIEGKCEVRLKDDLPNSDLPAVVEHVFCCEYLYDPANGALKQLPPNVRLVTLTRK 1093
Query: 1083 QISDAAQRKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVT 1142
A +K KGK + + ++ KD EN LATLDIFAGCGGLSEGLQ++G+S+T
Sbjct: 1094 V---PASKKNKGK-QICDIELGGSDKPKDGQSENCLATLDIFAGCGGLSEGLQRSGLSLT 1153
Query: 1143 KWAIEYEEPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDD 1202
KWAIEYEEPAG+AF NHPEA FV NCNVIL+A+M CGD+DDCISTSEA E A KL +
Sbjct: 1154 KWAIEYEEPAGDAFGENHPEAAVFVENCNVILKAIMDKCGDSDDCISTSEAAERAAKLSE 1213
Query: 1203 KEINNLPRPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLL 1262
+I NLP PG+VEFINGGPPCQGFSGMNRFNQS WSKVQCEMILAFLSFAEYFRP++FLL
Sbjct: 1214 DKIKNLPVPGEVEFINGGPPCQGFSGMNRFNQSPWSKVQCEMILAFLSFAEYFRPRFFLL 1273
Query: 1263 ENVRNFVSFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILP 1322
ENVRNFVSFNKGQTFRLTLASLLEMGYQVRFGILEAGAYG++QSRKRAFIWAA+P E LP
Sbjct: 1274 ENVRNFVSFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGVAQSRKRAFIWAAAPGETLP 1333
Query: 1323 EWPEPMHVFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTM 1382
EWPEPMHVF SPELKI+L YAAV STA GAPFR ITVRDTIGDLPAV NGA T+
Sbjct: 1334 EWPEPMHVFASPELKITLPDGKFYAAVKSTAAGAPFRSITVRDTIGDLPAVENGAGKPTI 1393
Query: 1383 EYKSEPTSWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLST 1442
+Y S P SWFQKKIR D+ LNDHISKEMNELNLIRC+ IPKRPG DW DLPDEKV+LST
Sbjct: 1394 QYGSGPVSWFQKKIRSDMASLNDHISKEMNELNLIRCKHIPKRPGCDWHDLPDEKVKLST 1453
Query: 1443 GQMHDLIPWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILT 1502
GQM DLIPWCLPNTAKRHNQWKGL+GRLDWEGNFPTS+TDPQPMGKVGMCFHPEQDRI+T
Sbjct: 1454 GQMVDLIPWCLPNTAKRHNQWKGLYGRLDWEGNFPTSVTDPQPMGKVGMCFHPEQDRIIT 1513
Query: 1503 VRECARSQGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMK 1544
VRECARSQGFPDSY+FAGNIQ+KHRQIGNAVPPPLAYALGRKL++AI+ K
Sbjct: 1514 VRECARSQGFPDSYRFAGNIQNKHRQIGNAVPPPLAYALGRKLKQAIDAK 1526
BLAST of HG10006921 vs. ExPASy Swiss-Prot
Match:
O23273 (DNA (cytosine-5)-methyltransferase 4 OS=Arabidopsis thaliana OX=3702 GN=MET4 PE=1 SV=1)
HSP 1 Score: 1749.2 bits (4529), Expect = 0.0e+00
Identity = 889/1532 (58.03%), Postives = 1128/1532 (73.63%), Query Frame = 0
Query: 26 EVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIA 85
E K +++ RS + V + PKRAAAC++FK K++++ +KS +E K+E + +EI
Sbjct: 4 ETKAGKQKKRSVDSDDDVSKERRPKRAAACTNFKEKSLRISDKSETVEAKKEQILAEEIV 63
Query: 86 AVGLTTS--GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEK 145
A+ LT+S DD RPNRRLT+F HD +G PQ VEMLE+ D+FI G++LP D +K++
Sbjct: 64 AIQLTSSLESNDDPRPNRRLTDFVLHDSEGVPQPVEMLELGDIFIEGVVLPLGD--EKKE 123
Query: 146 NKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACV 205
KGVR + FGR+E+W ISGYEDGSP IWIST +ADYDC +P+ YKKLY+ F+EKA ACV
Sbjct: 124 EKGVRFQSFGRVENWNISGYEDGSPVIWISTALADYDCRKPSKKYKKLYDYFFEKACACV 183
Query: 206 EVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLD 265
EV+K L+K NPD +L+ELLA V RS++ S+ F +G ++++F++ QGEFIYNQL GLD
Sbjct: 184 EVFKSLSK----NPDTSLDELLAAVSRSMSGSKIFSSGGAIQEFVISQGEFIYNQLAGLD 243
Query: 266 DTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPGF 325
+T+K ++ F + VL +LRD + N + A +N+ L+I + + + ++
Sbjct: 244 ETAKNHETCFVENRVLVSLRDH---ESNKIHKA-----LSNVALRIDESKVVTSDHL--V 303
Query: 326 GAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTS---SNKFYIKINEDEIANDYPLPAFY 385
EDED+K AKL+QEEEY +S ++ +N+RS+T+ S++FYIKI+EDEIA+DYPLP++Y
Sbjct: 304 DGAEDEDVKYAKLIQEEEYRKSMERSRNKRSSTTSGGSSRFYIKISEDEIADDYPLPSYY 363
Query: 386 KTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTI 445
K TK+E DE V+F+ ++ D DLP LHNW+LYNSDSR+ISLE+LPM+PC +IDVT+
Sbjct: 364 KNTKEETDELVLFEAGYEV-DTRDLPCRTLHNWTLYNSDSRMISLEVLPMRPCAEIDVTV 423
Query: 446 YGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGSSMVFISIRTD 505
+GSG++ DDGSGFCLD D+ S S Q + DG+ I+LS IKEWMIEFG+ M+F+++RTD
Sbjct: 424 FGSGVVAEDDGSGFCLD-DSESSTSTQSNDHDGMNIFLSQIKEWMIEFGAEMIFVTLRTD 483
Query: 506 MAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNP 565
MAWYRLGKPSKQYAPW+ TV+KT R+ ISI +L +SR +KLS+A++IK++ +N+
Sbjct: 484 MAWYRLGKPSKQYAPWFGTVMKTVRVGISIFNMLMRESRVAKLSYANVIKRLCGLEENDK 543
Query: 566 AYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLV-KKK 625
AYISS VERYVVVHGQIILQ F EYPD I++C F+T L+ KM++ HHTKW++ KKK
Sbjct: 544 AYISSKLLDVERYVVVHGQIILQLFEEYPDKDIKRCPFVTSLASKMQDIHHTKWIIKKKK 603
Query: 626 AVLKQEANMNPRASMKPVTSR-KAMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKED 685
+L++ N+NPRA + PV SR KAM ATTTRL+NRIWGEFYS Y PE EA N E E+
Sbjct: 604 KILQKGKNLNPRAGIAPVVSRMKAMQATTTRLVNRIWGEFYSIYSPEVPSEAINAENVEE 663
Query: 686 ELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGY 745
E EE E EDE EE + E+ +++ ++ P K K + + KWDGEI+G TS G
Sbjct: 664 EELEEVEEEDEN-EEDDPEENELEAVEIQNSPTPKKIKGISEDMEIKWDGEILGKTSAGE 723
Query: 746 PLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETV 805
PLY +A V GD+V VG V ++ D+ D I VE+M+E SN KM+HG+L+ RG ETV
Sbjct: 724 PLYGRAFVGGDVVVVGSAVILEVDDQDDTQLICFVEFMFESSNHSKMLHGKLLQRGSETV 783
Query: 806 LGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERK 865
LG AA ERE+FLTN+CL +L +I+ TV +E R WG+Q+RK N+ DK D RAEERK
Sbjct: 784 LGMAANERELFLTNECLTVQLKDIKGTVSLEIRSRLWGHQYRKENIDVDKLDRARAEERK 843
Query: 866 RRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-DIMKLHSSLMSF 925
GLP ++YCKSLY PE+G FF LP+ MGLG+G C SCK++E + E KL+ S F
Sbjct: 844 TNGLPTDYYCKSLYSPERGGFFSLPRNDMGLGSGFCSSCKIRENEEERSKTKLNDSKTGF 903
Query: 926 TYRGIDYSVNDYVYLAPHHFGTD--ERGIE--TFKGGRNVGLNAYVVCQLLGIESPKGSK 985
GI+Y D+VY+ P++ D ++G T K GRNVGL A+VVCQLL + + S+
Sbjct: 904 LSNGIEYHNGDFVYVLPNYITKDGLKKGSRRTTLKCGRNVGLKAFVVCQLLDVIVLEESR 963
Query: 986 QPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKKQDIP 1045
+ S S VK+ RF+RPEDIS EKAY SDI+ELYYS +T P AI+GKCEVRKK D+P
Sbjct: 964 KASKASFQVKLTRFYRPEDISEEKAYASDIQELYYSQDTYILPPEAIQGKCEVRKKSDMP 1023
Query: 1046 MANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKEGESV 1105
+ I DH FFCE YD G +K+ PAN+KL S+ + + R+KKGK E +
Sbjct: 1024 LCREYPILDHIFFCEVFYDSSTGYLKQFPANMKLKFSTIK--DETLLREKKGKGVETGTS 1083
Query: 1106 PDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSLNHPE 1165
++P E LATLDIFAGCGGLS GL+ AGVS TKWAIEYEEPAG AF NHPE
Sbjct: 1084 SGMLMKPDEVPKEKPLATLDIFAGCGGLSHGLENAGVSTTKWAIEYEEPAGHAFKQNHPE 1143
Query: 1166 ALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEFINGGPP 1225
A FV+NCNVILRA+M+ CGD DDC+ST EA E+A KLD+ + + LP PGQV+FINGGPP
Sbjct: 1144 ATVFVDNCNVILRAIMEKCGDVDDCVSTVEAAELAAKLDENQKSTLPLPGQVDFINGGPP 1203
Query: 1226 CQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFRLTLA 1285
CQGFSGMNRF+ +WSKVQCEMILAFLSFA+YFRPKYFLLENV+ FV++NKG+TF+LT+A
Sbjct: 1204 CQGFSGMNRFSHGSWSKVQCEMILAFLSFADYFRPKYFLLENVKKFVTYNKGRTFQLTMA 1263
Query: 1286 SLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPELKISLSG 1345
SLLEMGYQVRFGILEAG YG+SQ RKR IWAASPEE+LPEWPEPMHVF +P KISL
Sbjct: 1264 SLLEMGYQVRFGILEAGTYGVSQPRKRVIIWAASPEEVLPEWPEPMHVFDNPGSKISLPR 1323
Query: 1346 NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKIRGDVLV 1405
RY A +T GAPFR ITVRDTIGDLP V NG S EY + P SWFQKKIRG++ V
Sbjct: 1324 GLRYDAGCNTKFGAPFRSITVRDTIGDLPPVENGESKINKEYGTTPASWFQKKIRGNMSV 1383
Query: 1406 LNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNTAKRHNQ 1465
L DHI K +NELNLIRC+KIPKRPG+DWRDLPDE V LS G + L P L TAK HN+
Sbjct: 1384 LTDHICKGLNELNLIRCKKIPKRPGADWRDLPDENVTLSNGLVEKLRPLALSKTAKNHNE 1443
Query: 1466 WKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKFAGNI 1525
WKGL+GRLDW+GN P SITDPQPMGKVGMCFHPEQDRI+TVRECARSQGFPDSY+F+G
Sbjct: 1444 WKGLYGRLDWQGNLPISITDPQPMGKVGMCFHPEQDRIITVRECARSQGFPDSYEFSGTT 1503
Query: 1526 QHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
+HKHRQIGNAVPPPLA+ALGRKL+EA+ +KSS
Sbjct: 1504 KHKHRQIGNAVPPPLAFALGRKLKEALYLKSS 1514
BLAST of HG10006921 vs. ExPASy Swiss-Prot
Match:
Q9M0S8 (DNA (cytosine-5)-methyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=MET2 PE=2 SV=1)
HSP 1 Score: 1735.7 bits (4494), Expect = 0.0e+00
Identity = 886/1535 (57.72%), Postives = 1127/1535 (73.42%), Query Frame = 0
Query: 26 EVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIA 85
E K +++ RS V + PKRAAAC +FK K +++ +KS +E K+E V +EI
Sbjct: 2 ETKVGKQKKRSVDSNDDVSKERRPKRAAACRNFKEKPLRISDKSETVEAKKEQNVVEEIV 61
Query: 86 AVGLTTS--GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEK 145
A+ LT+S DD RPNRRLT+F H+ DG PQ VEMLE+ D+F+ G++LP D DK +
Sbjct: 62 AIQLTSSLESNDDPRPNRRLTDFVLHNSDGVPQPVEMLELGDIFLEGVVLPLGD--DKNE 121
Query: 146 NKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACV 205
KGVR + FGR+E+W ISGYEDGSP IWIST +ADYDC +PA YKK+Y+ F+EKA ACV
Sbjct: 122 EKGVRFQSFGRVENWNISGYEDGSPGIWISTALADYDCRKPASKYKKIYDYFFEKACACV 181
Query: 206 EVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLD 265
EV+K L+K NPD +L+ELLA V RS++ S+ F +G ++++F++ QGEFIYNQL GLD
Sbjct: 182 EVFKSLSK----NPDTSLDELLAAVARSMSGSKIFSSGGAIQEFVISQGEFIYNQLAGLD 241
Query: 266 DTSKKNDQIFTDLPVLCALRD-ESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPG 325
+T+K ++ F + VL +LRD ES K L +N+ L+I + + + ++
Sbjct: 242 ETAKNHETCFVENSVLVSLRDHESSKIHKAL---------SNVALRIDESQLVKSDHL-- 301
Query: 326 FGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRS--TTSSNKFYIKINEDEIANDYPLPAFY 385
E ED++ AKL+QEEEY S ++ +N+RS T++SNKFYIKINE EIANDYPLP++Y
Sbjct: 302 VDGAEAEDVRYAKLIQEEEYRISMERSRNKRSSTTSASNKFYIKINEHEIANDYPLPSYY 361
Query: 386 KTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTI 445
K TK+E DE ++F+ ++ D DLP LHNW+LYNSDSR+ISLE+LPM+PC +IDVT+
Sbjct: 362 KNTKEETDELLLFEPGYEV-DTRDLPCRTLHNWALYNSDSRMISLEVLPMRPCAEIDVTV 421
Query: 446 YGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGSSMVFISIRTD 505
+GSG++ DDGSGFCLD D+ S S Q DG+ I+LS IKEWMIEFG+ M+F+++RTD
Sbjct: 422 FGSGVVAEDDGSGFCLD-DSESSTSTQSNVHDGMNIFLSQIKEWMIEFGAEMIFVTLRTD 481
Query: 506 MAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNP 565
MAWYRLGKPSKQYAPW+ETV+KT R+AISI +L +SR +KLS+A++IK++ +N+
Sbjct: 482 MAWYRLGKPSKQYAPWFETVMKTVRVAISIFNMLMRESRVAKLSYANVIKRLCGLEENDK 541
Query: 566 AYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLVK-KK 625
AYISS VERYVVVHGQIILQ F EYPD I++C F+TGL+ KM++ HHTKW++K KK
Sbjct: 542 AYISSKLLDVERYVVVHGQIILQLFEEYPDKDIKRCPFVTGLASKMQDIHHTKWIIKRKK 601
Query: 626 AVLKQEANMNPRASMKPVTSR-KAMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKED 685
+L++ N+NPRA + V +R K M ATTTRL+NRIWGEFYS Y PE EA +E +E+
Sbjct: 602 KILQKGKNLNPRAGLAHVVTRMKPMQATTTRLVNRIWGEFYSIYSPEVPSEA-IHEVEEE 661
Query: 686 ELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGY 745
E+EE+EE ++ E +++EEE +V K P K ++ + + KW+GEI+G TS G
Sbjct: 662 EIEEDEEEDENEEDDIEEEAVEVQ---KSHTP--KKSRGNSEDMEIKWNGEILGETSDGE 721
Query: 746 PLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETV 805
PLY +A+V G+ VAVG V ++ D+ + PAIY VE+M+E S+ KM+HG+L+ RG ETV
Sbjct: 722 PLYGRALVGGETVAVGSAVILEVDDPDETPAIYFVEFMFESSDQCKMLHGKLLQRGSETV 781
Query: 806 LGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERK 865
+G AA ERE+FLTN+CL L +I+ TV ++ RPWG+Q+RK N+ DK D RAEERK
Sbjct: 782 IGTAANERELFLTNECLTVHLKDIKGTVSLDIRSRPWGHQYRKENLVVDKLDRARAEERK 841
Query: 866 RRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-DIMKLHSSLMSF 925
GLP E+YCKSLY PE+G FF LP+ +GLG+G C SCK+KE + E KL+ S
Sbjct: 842 ANGLPTEYYCKSLYSPERGGFFSLPRNDIGLGSGFCSSCKIKEEEEERSKTKLNISKTGV 901
Query: 926 TYRGIDYSVNDYVYLAPHHF-------GTDERGIETFKGGRNVGLNAYVVCQLLGIESPK 985
GI+Y D+VY+ P++ GT R T K GRNVGL A+VVCQLL + +
Sbjct: 902 FSNGIEYYNGDFVYVLPNYITKDGLKKGTSRR--TTLKCGRNVGLKAFVVCQLLDVIVLE 961
Query: 986 GSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKKQ 1045
S++ S S VK+ RF+RPEDIS EKAY SDI+ELYYS +T P A++GKCEVRKK
Sbjct: 962 ESRKASNASFQVKLTRFYRPEDISEEKAYASDIQELYYSHDTYILPPEALQGKCEVRKKN 1021
Query: 1046 DIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKEG 1105
D+P+ I DH FFCE YD G +K+ PAN+KL S+ + + R+KKGK E
Sbjct: 1022 DMPLCREYPILDHIFFCEVFYDSSTGYLKQFPANMKLKFSTIK--DETLLREKKGKGVET 1081
Query: 1106 ESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSLN 1165
+ ++P E RLATLDIFAGCGGLS GL++AGVS TKWAIEYEEPAG AF N
Sbjct: 1082 GTSSGILMKPDEVPKEMRLATLDIFAGCGGLSHGLEKAGVSNTKWAIEYEEPAGHAFKQN 1141
Query: 1166 HPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEFING 1225
HPEA FV+NCNVILRA+M+ CGD DDC+ST EA E+ KLD+ + + LP PGQ +FI+G
Sbjct: 1142 HPEATVFVDNCNVILRAIMEKCGDVDDCVSTVEAAELVAKLDENQKSTLPLPGQADFISG 1201
Query: 1226 GPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFRL 1285
GPPCQGFSGMNRF+ +WSKVQCEMILAFLSFA+YFRPKYFLLENV+ FV++NKG+TF+L
Sbjct: 1202 GPPCQGFSGMNRFSDGSWSKVQCEMILAFLSFADYFRPKYFLLENVKKFVTYNKGRTFQL 1261
Query: 1286 TLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPELKIS 1345
T+ASLLE+GYQVRFGILEAG YG+SQ RKR IWAASPEE+LPEWPEPMHVF +P KIS
Sbjct: 1262 TMASLLEIGYQVRFGILEAGTYGVSQPRKRVIIWAASPEEVLPEWPEPMHVFDNPGSKIS 1321
Query: 1346 LSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKIRGD 1405
L Y V +T GAPFR ITVRDTIGDLP V NG S EY++ P SWFQKKIRG+
Sbjct: 1322 LPRGLHYDTVRNTKFGAPFRSITVRDTIGDLPLVENGESKINKEYRTTPVSWFQKKIRGN 1381
Query: 1406 VLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNTAKR 1465
+ VL DHI K +NELNLIRC+KIPKRPG+DWRDLPDE V LS G + L P L TAK
Sbjct: 1382 MSVLTDHICKGLNELNLIRCKKIPKRPGADWRDLPDENVTLSNGLVEKLRPLALSKTAKN 1441
Query: 1466 HNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKFA 1525
HN+WKGL+GRLDW+GN P SITDPQPMGKVGMCFHPEQDRI+TVRECARSQGFPDSY+F+
Sbjct: 1442 HNEWKGLYGRLDWQGNLPISITDPQPMGKVGMCFHPEQDRIITVRECARSQGFPDSYEFS 1501
Query: 1526 GNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
G +HKHRQIGNAVPPPLA+ALGRKL+EA+ +KSS
Sbjct: 1502 GTTKHKHRQIGNAVPPPLAFALGRKLKEALYLKSS 1507
BLAST of HG10006921 vs. ExPASy TrEMBL
Match:
A0A1S3CM89 (DNA (cytosine-5)-methyltransferase OS=Cucumis melo OX=3656 GN=LOC103502549 PE=3 SV=1)
HSP 1 Score: 2999.5 bits (7775), Expect = 0.0e+00
Identity = 1479/1550 (95.42%), Postives = 1507/1550 (97.23%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDLK+SD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAAACSDFKV
Sbjct: 1 MAKKTRSQLMATSNDLKESDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAAACSDFKV 60
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTIQLPEKSS+IECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 61 KTIQLPEKSSIIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVNDLFISG+ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD
Sbjct: 121 LEVNDLFISGVILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CVRPAGGYKKLYNIFYEKANACVEVYKKLA+TSGGNPDLTLEELL GVVRSLNSSRNFPA
Sbjct: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLARTSGGNPDLTLEELLGGVVRSLNSSRNFPA 240
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAGTFD
Sbjct: 241 GMSVKDFIFLQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGTFD 300
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTNLGLKIKDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK
Sbjct: 301 GFTNLGLKIKDGEQLNPPNITGSGVEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDS
Sbjct: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDS 420
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSA
Sbjct: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSA 480
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAISIITLLKEQSRA
Sbjct: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAISIITLLKEQSRA 540
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSFA IIKKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT
Sbjct: 541 SKLSFAVIIKKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTT+LINRIWGEFYS
Sbjct: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTKLINRIWGEFYS 660
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NY PEDLKEADNNETKEDE EEEEEVEDEE EEVEEED QVDLKTKESKPVVKPAKAKLS
Sbjct: 661 NYSPEDLKEADNNETKEDEPEEEEEVEDEESEEVEEEDVQVDLKTKESKPVVKPAKAKLS 720
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EG NKWDG+IVG TSQGYPLYKQAIVHGDLVAVGGFV V+TDNVHDLPA+YLVEYMYEKS
Sbjct: 721 EGNNKWDGKIVGKTSQGYPLYKQAIVHGDLVAVGGFVYVETDNVHDLPALYLVEYMYEKS 780
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
N +KMVHGRL+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRETVVVESCMRPWGYQHR
Sbjct: 781 NGKKMVHGRLIVRGLETVLGNAAKEREVFLTNDCLEFELNEIRETVVVESCMRPWGYQHR 840
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
KAN K DKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 841 KANAKMDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED MKLHSSL SFTYRG DYSVND VYL+PHHFGTDERGIETFKGG+NVGLNAYVV
Sbjct: 901 ETQREDTMKLHSSLTSFTYRGTDYSVNDCVYLSPHHFGTDERGIETFKGGKNVGLNAYVV 960
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP P+STMVKVRRFFRPEDISVEKAYCSDIRELYYSDETT PVSAI
Sbjct: 961 CQLLGIESPKGSKQPCPISTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTMMPVSAI 1020
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIP+ANYPAIFDH FFCEHLYDPEKGAIKKLP +VKLSS SERQISDAAQ
Sbjct: 1021 EGKCEVRKKQDIPVANYPAIFDHIFFCEHLYDPEKGAIKKLPGSVKLSSPSERQISDAAQ 1080
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1081 RKKKGKCKEGEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLP
Sbjct: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLP 1200
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1261 SFNKGQTFRLTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEP
Sbjct: 1321 VFGSPELKISLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPA 1380
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGDV+VLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLI
Sbjct: 1381 SWFQKKIRGDVIVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLI 1440
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIE KSSKSVLS
Sbjct: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIETKSSKSVLS 1550
BLAST of HG10006921 vs. ExPASy TrEMBL
Match:
A0A5A7VJ57 (DNA (cytosine-5)-methyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold76G00480 PE=3 SV=1)
HSP 1 Score: 2984.9 bits (7737), Expect = 0.0e+00
Identity = 1470/1541 (95.39%), Postives = 1498/1541 (97.21%), Query Frame = 0
Query: 10 MVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKS 69
M TSNDLK+SD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAAACSDFKVKTIQLPEKS
Sbjct: 1 MATSNDLKESDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAAACSDFKVKTIQLPEKS 60
Query: 70 SVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFIS 129
S+IECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFIS
Sbjct: 61 SIIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFIS 120
Query: 130 GIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYK 189
G+ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYK
Sbjct: 121 GVILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYK 180
Query: 190 KLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFIL 249
KLYNIFYEKANACVEVYKKLA+TSGGNPDLTLEELL GVVRSLNSSRNFPAGMSVKDFI
Sbjct: 181 KLYNIFYEKANACVEVYKKLARTSGGNPDLTLEELLGGVVRSLNSSRNFPAGMSVKDFIF 240
Query: 250 LQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKI 309
LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKI
Sbjct: 241 LQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKI 300
Query: 310 KDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNKFYIKINEDE 369
KDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNKFYIKINEDE
Sbjct: 301 KDGEQLNPPNITGSGVEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNKFYIKINEDE 360
Query: 370 IANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLP 429
IANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDSRLISLELLP
Sbjct: 361 IANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDSRLISLELLP 420
Query: 430 MKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFG 489
MKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSAIKEWMIEFG
Sbjct: 421 MKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSAIKEWMIEFG 480
Query: 490 SSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHII 549
SSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAISIITLLKEQSRASKLSFA II
Sbjct: 481 SSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAISIITLLKEQSRASKLSFAVII 540
Query: 550 KKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEER 609
KKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEER
Sbjct: 541 KKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEER 600
Query: 610 HHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYSNYLPEDLKE 669
HHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTT+LINRIWGEFYSNY PEDLKE
Sbjct: 601 HHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTKLINRIWGEFYSNYSPEDLKE 660
Query: 670 ADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGE 729
ADNNETKEDE EEEEEVEDEE EEVEEED QVDLKTKESKPVVKPAKAKLSEG NKWDG+
Sbjct: 661 ADNNETKEDEPEEEEEVEDEESEEVEEEDVQVDLKTKESKPVVKPAKAKLSEGNNKWDGK 720
Query: 730 IVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGR 789
IVG TSQGYPLYKQAIVHGDLVAVGGFV V+TDNVHDLPA+YLVEYMYEKSN +KMVHGR
Sbjct: 721 IVGKTSQGYPLYKQAIVHGDLVAVGGFVYVETDNVHDLPALYLVEYMYEKSNGKKMVHGR 780
Query: 790 LVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKA 849
L+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRETVVVESCMRPWGYQHRKAN K DKA
Sbjct: 781 LIVRGLETVLGNAAKEREVFLTNDCLEFELNEIRETVVVESCMRPWGYQHRKANAKMDKA 840
Query: 850 DEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGEDIMK 909
DEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQ ED MK
Sbjct: 841 DEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQREDTMK 900
Query: 910 LHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVVCQLLGIESP 969
LHSSL SFTYRG DYSVND VYL+PHHFGTDERGIETFKGG+NVGLNAYVVCQLLGIESP
Sbjct: 901 LHSSLTSFTYRGTDYSVNDCVYLSPHHFGTDERGIETFKGGKNVGLNAYVVCQLLGIESP 960
Query: 970 KGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKK 1029
KGSKQP P+STMVKVRRFFRPEDISVEKAYCSDIRELYYSDETT PVSAIEGKCEVRKK
Sbjct: 961 KGSKQPCPISTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTMMPVSAIEGKCEVRKK 1020
Query: 1030 QDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKE 1089
QDIP+ANYPAIFDH FFCEHLYDPEKGAIKKLP +VKLSS SERQISDAAQRKKKGKCKE
Sbjct: 1021 QDIPVANYPAIFDHIFFCEHLYDPEKGAIKKLPGSVKLSSPSERQISDAAQRKKKGKCKE 1080
Query: 1090 GESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSL 1149
GE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSL
Sbjct: 1081 GEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSL 1140
Query: 1150 NHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEFIN 1209
NHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLPRPGQVEFIN
Sbjct: 1141 NHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLPRPGQVEFIN 1200
Query: 1210 GGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFR 1269
GGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFR
Sbjct: 1201 GGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFR 1260
Query: 1270 LTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPELKI 1329
LTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVF SPELKI
Sbjct: 1261 LTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFGSPELKI 1320
Query: 1330 SLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKIRG 1389
SLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEP SWFQKKIRG
Sbjct: 1321 SLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPASWFQKKIRG 1380
Query: 1390 DVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNTAK 1449
DV+VLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLIPWCLPNTAK
Sbjct: 1381 DVIVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLIPWCLPNTAK 1440
Query: 1450 RHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKF 1509
RHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKF
Sbjct: 1441 RHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKF 1500
Query: 1510 AGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
AGNIQHKHRQIGNAVPPPLAYALGRKLREAIE KSSKSVLS
Sbjct: 1501 AGNIQHKHRQIGNAVPPPLAYALGRKLREAIETKSSKSVLS 1541
BLAST of HG10006921 vs. ExPASy TrEMBL
Match:
A0A0A0KPF1 (DNA (cytosine-5)-methyltransferase OS=Cucumis sativus OX=3659 GN=Csa_5G002610 PE=3 SV=1)
HSP 1 Score: 2973.3 bits (7707), Expect = 0.0e+00
Identity = 1471/1550 (94.90%), Postives = 1500/1550 (96.77%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDL+KSD+KNVK VKPKQKRNR E GEQ VGVRKMPKRAA+CSDFKV
Sbjct: 1 MAKKTRSQLMATSNDLEKSDTKNVKAVKPKQKRNRLENGEQVVGVRKMPKRAASCSDFKV 60
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTIQLPEKSSVIECKREV VEDEIAAVGLTT GQDDSRPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 61 KTIQLPEKSSVIECKREVTVEDEIAAVGLTTLGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVNDLFISG+ILPFED DKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD
Sbjct: 121 LEVNDLFISGVILPFEDISDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CVRPA GYKKLYNIFYEKANACVEVYKKLA++SGG PDLTLEELL GVVRSLNSSRNFPA
Sbjct: 181 CVRPAAGYKKLYNIFYEKANACVEVYKKLARSSGGYPDLTLEELLGGVVRSLNSSRNFPA 240
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI+LQGEFIYNQLIGLDDTSKKNDQ+FTDLPVLCALRDESRKQGNLLPNAG FD
Sbjct: 241 GMSVKDFIILQGEFIYNQLIGLDDTSKKNDQVFTDLPVLCALRDESRKQGNLLPNAGIFD 300
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTNLGLKIKDGEQLNPPNI G G EEDEDLKLAKLLQEEEYWRSAKQRK QRSTTSSNK
Sbjct: 301 GFTNLGLKIKDGEQLNPPNILGSGDEEDEDLKLAKLLQEEEYWRSAKQRKTQRSTTSSNK 360
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMD+CDPDDLPRSMLHNWSLYNSDS
Sbjct: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDVCDPDDLPRSMLHNWSLYNSDS 420
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCS+QMQNTDGIPIYLSA
Sbjct: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSSQMQNTDGIPIYLSA 480
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVLKTARLAI II LLKEQSRA
Sbjct: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLKTARLAICIIMLLKEQSRA 540
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSFA IIKKISEF KNNPAYISS PSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT
Sbjct: 541 SKLSFAVIIKKISEFDKNNPAYISSIPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLV+KKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS
Sbjct: 601 GLSDKMEERHHTKWLVRKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NY PEDLK ADNNETKEDE EEEEEVEDEE EEVEEEDGQVDLKTKESKPVVKPAKAKLS
Sbjct: 661 NYSPEDLKAADNNETKEDEPEEEEEVEDEESEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EG NKWDG++VG TS+GYPLYKQAIVHGDLVAVGGFVSV+TDNVHDLPAIYLVEYMYEKS
Sbjct: 721 EGNNKWDGKMVGKTSEGYPLYKQAIVHGDLVAVGGFVSVETDNVHDLPAIYLVEYMYEKS 780
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
N RKMVHGRL+VRGLETVLGNAAKEREVFLTNDCL+FELNEIRE VVVESCMRPWGYQHR
Sbjct: 781 NGRKMVHGRLLVRGLETVLGNAAKEREVFLTNDCLEFELNEIREAVVVESCMRPWGYQHR 840
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
KAN KKDKA+EERAEERK RGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 841 KANAKKDKAEEERAEERKHRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED MKLHSSL SFTYRG DYSVND VYLAPHHFGTDERGIETFKGGRNV LNAYVV
Sbjct: 901 ETQREDTMKLHSSLTSFTYRGTDYSVNDCVYLAPHHFGTDERGIETFKGGRNVVLNAYVV 960
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP PVSTMV+VRRFFRPEDISVEKAYCSDIRELYYSDETT KPVSAI
Sbjct: 961 CQLLGIESPKGSKQPCPVSTMVQVRRFFRPEDISVEKAYCSDIRELYYSDETTMKPVSAI 1020
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIP+AN PAIFDH FFCEHLYDPEKGAIKKLPA+VKLSS SERQISDAAQ
Sbjct: 1021 EGKCEVRKKQDIPVANCPAIFDHIFFCEHLYDPEKGAIKKLPASVKLSSPSERQISDAAQ 1080
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGE +PDE E+KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1081 RKKKGKCKEGEIIPDENENKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIE+AEKLDDKEINNLP
Sbjct: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIELAEKLDDKEINNLP 1200
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQV+FGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1261 SFNKGQTFRLTLASLLEMGYQVKFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLS NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASV TMEYKSEP
Sbjct: 1321 VFGSPELKISLSDNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVITMEYKSEPA 1380
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGDVLVLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLS GQMHDLI
Sbjct: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSNGQMHDLI 1440
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRL+WEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1441 PWCLPNTAKRHNQWKGLFGRLEWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1551
QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS
Sbjct: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKSVLS 1550
BLAST of HG10006921 vs. ExPASy TrEMBL
Match:
A0A6J1EP47 (DNA (cytosine-5)-methyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436438 PE=3 SV=1)
HSP 1 Score: 2906.3 bits (7533), Expect = 0.0e+00
Identity = 1429/1547 (92.37%), Postives = 1489/1547 (96.25%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDLKKSD+++VK VKPKQKR+RSET EQ VGVRKMPKRAAACSDFKV
Sbjct: 1 MAKKTRSQLMATSNDLKKSDTQDVKPVKPKQKRSRSETKEQVVGVRKMPKRAAACSDFKV 60
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTI LPEKSSVIECKREVAVEDEIAAVGLT SG DD RPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 61 KTIPLPEKSSVIECKREVAVEDEIAAVGLTASGMDDFRPNRRLTEFTFHDEDGKPQAVEM 120
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVN+LFISG ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDG PTIWISTD+ADYD
Sbjct: 121 LEVNNLFISGTILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGLPTIWISTDIADYD 180
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CV+PAG Y+KLYNIFYEKANACVEVYKKLA+TSGGNPDLTL+EL+AGVVRSLNSSRNFPA
Sbjct: 181 CVKPAGSYRKLYNIFYEKANACVEVYKKLARTSGGNPDLTLDELVAGVVRSLNSSRNFPA 240
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI+LQGEFIYNQL+GLD+TSK+NDQIF DLPVLCALRDESRKQ NL P+ G FD
Sbjct: 241 GMSVKDFIILQGEFIYNQLVGLDETSKRNDQIFIDLPVLCALRDESRKQRNLPPSTGIFD 300
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTN+GLKIKDGEQ NPPN+PGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTT SNK
Sbjct: 301 GFTNVGLKIKDGEQFNPPNMPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTT-SNK 360
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYN+DS
Sbjct: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNADS 420
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTS S SAQ+Q+TDGIPIYLSA
Sbjct: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSVSSSAQIQDTDGIPIYLSA 480
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVL+TARLAISIIT+LKEQSRA
Sbjct: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLRTARLAISIITILKEQSRA 540
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSF+ IIK++SEF KNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDD+IRKCAFIT
Sbjct: 541 SKLSFSDIIKRVSEFDKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDVIRKCAFIT 600
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLVKKKAVLK+E NMNPRASM PVTS++AMPATTTRLINRIWGEFYS
Sbjct: 601 GLSDKMEERHHTKWLVKKKAVLKKEVNMNPRASMNPVTSKRAMPATTTRLINRIWGEFYS 660
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NYLP+DLKEADNNETKEDELEE+EEVEDEELEEVEEEDGQVDLKT+ESKPVVKPAKAKLS
Sbjct: 661 NYLPDDLKEADNNETKEDELEEDEEVEDEELEEVEEEDGQVDLKTEESKPVVKPAKAKLS 720
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EGK W+ EIVGTTSQGYPLYKQAIVHG+LV VGGFVSV+ DNV DLPAI LVEYMYE S
Sbjct: 721 EGKINWESEIVGTTSQGYPLYKQAIVHGNLVTVGGFVSVEPDNVDDLPAICLVEYMYENS 780
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCL+ EL+EI+ETV +E CMRPWGYQHR
Sbjct: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLELELSEIKETVDIEICMRPWGYQHR 840
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
K N KK KADEE AEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 841 KDNAKKVKADEEIAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED ++LHSSL SF Y+G DYSVNDYVYLAPHHFGTD+RGIETFKGGRNVGLNAYVV
Sbjct: 901 ETQKEDTIELHSSLTSFKYKGTDYSVNDYVYLAPHHFGTDDRGIETFKGGRNVGLNAYVV 960
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP P+S MVKVRRFFRPEDISVEKAYCSDIRELYYSDE + KPVSAI
Sbjct: 961 CQLLGIESPKGSKQPCPLSAMVKVRRFFRPEDISVEKAYCSDIRELYYSDEISVKPVSAI 1020
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIPM+N P IFDH FFCEHLYDPEKGAIKKLPANVKLSSSSER +SDAAQ
Sbjct: 1021 EGKCEVRKKQDIPMSNCPVIFDHIFFCEHLYDPEKGAIKKLPANVKLSSSSERPMSDAAQ 1080
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGES+PDET++KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1081 RKKKGKCKEGESIPDETDNKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVM ACGDADDC+STSEAIE+AEKLD+KEINNLP
Sbjct: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMTACGDADDCVSTSEAIELAEKLDEKEINNLP 1200
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQ TWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1201 RPGQVEFINGGPPCQGFSGMNRFNQRTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLSGNTRYAAV STAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT
Sbjct: 1321 VFGSPELKISLSGNTRYAAVSSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGD+LVLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLSTGQMHD+I
Sbjct: 1381 SWFQKKIRGDILVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSTGQMHDMI 1440
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKS 1548
QGFPD+Y FAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMK SKS
Sbjct: 1501 QGFPDNYIFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKMSKS 1546
BLAST of HG10006921 vs. ExPASy TrEMBL
Match:
A0A6J1EPN0 (DNA (cytosine-5)-methyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436438 PE=3 SV=1)
HSP 1 Score: 2906.3 bits (7533), Expect = 0.0e+00
Identity = 1429/1547 (92.37%), Postives = 1489/1547 (96.25%), Query Frame = 0
Query: 1 MAKKTRSQLMVTSNDLKKSDSKNVKEVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKV 60
MAKKTRSQLM TSNDLKKSD+++VK VKPKQKR+RSET EQ VGVRKMPKRAAACSDFKV
Sbjct: 3 MAKKTRSQLMATSNDLKKSDTQDVKPVKPKQKRSRSETKEQVVGVRKMPKRAAACSDFKV 62
Query: 61 KTIQLPEKSSVIECKREVAVEDEIAAVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEM 120
KTI LPEKSSVIECKREVAVEDEIAAVGLT SG DD RPNRRLTEFTFHDEDGKPQAVEM
Sbjct: 63 KTIPLPEKSSVIECKREVAVEDEIAAVGLTASGMDDFRPNRRLTEFTFHDEDGKPQAVEM 122
Query: 121 LEVNDLFISGIILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYD 180
LEVN+LFISG ILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDG PTIWISTD+ADYD
Sbjct: 123 LEVNNLFISGTILPFEDTPDKEKNKGVRCEGFGRIESWTISGYEDGLPTIWISTDIADYD 182
Query: 181 CVRPAGGYKKLYNIFYEKANACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPA 240
CV+PAG Y+KLYNIFYEKANACVEVYKKLA+TSGGNPDLTL+EL+AGVVRSLNSSRNFPA
Sbjct: 183 CVKPAGSYRKLYNIFYEKANACVEVYKKLARTSGGNPDLTLDELVAGVVRSLNSSRNFPA 242
Query: 241 GMSVKDFILLQGEFIYNQLIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFD 300
GMSVKDFI+LQGEFIYNQL+GLD+TSK+NDQIF DLPVLCALRDESRKQ NL P+ G FD
Sbjct: 243 GMSVKDFIILQGEFIYNQLVGLDETSKRNDQIFIDLPVLCALRDESRKQRNLPPSTGIFD 302
Query: 301 GFTNLGLKIKDGEQLNPPNIPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSSNK 360
GFTN+GLKIKDGEQ NPPN+PGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTT SNK
Sbjct: 303 GFTNVGLKIKDGEQFNPPNMPGFGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTT-SNK 362
Query: 361 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDS 420
FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYN+DS
Sbjct: 363 FYIKINEDEIANDYPLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNADS 422
Query: 421 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSA 480
RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTS S SAQ+Q+TDGIPIYLSA
Sbjct: 423 RLISLELLPMKPCDDIDVTIYGSGIMTADDGSGFCLDADTSVSSSAQIQDTDGIPIYLSA 482
Query: 481 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRA 540
IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWY+TVL+TARLAISIIT+LKEQSRA
Sbjct: 483 IKEWMIEFGSSMVFISIRTDMAWYRLGKPSKQYAPWYQTVLRTARLAISIITILKEQSRA 542
Query: 541 SKLSFAHIIKKISEFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFIT 600
SKLSF+ IIK++SEF KNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDD+IRKCAFIT
Sbjct: 543 SKLSFSDIIKRVSEFDKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDVIRKCAFIT 602
Query: 601 GLSDKMEERHHTKWLVKKKAVLKQEANMNPRASMKPVTSRKAMPATTTRLINRIWGEFYS 660
GLSDKMEERHHTKWLVKKKAVLK+E NMNPRASM PVTS++AMPATTTRLINRIWGEFYS
Sbjct: 603 GLSDKMEERHHTKWLVKKKAVLKKEVNMNPRASMNPVTSKRAMPATTTRLINRIWGEFYS 662
Query: 661 NYLPEDLKEADNNETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLS 720
NYLP+DLKEADNNETKEDELEE+EEVEDEELEEVEEEDGQVDLKT+ESKPVVKPAKAKLS
Sbjct: 663 NYLPDDLKEADNNETKEDELEEDEEVEDEELEEVEEEDGQVDLKTEESKPVVKPAKAKLS 722
Query: 721 EGKNKWDGEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKS 780
EGK W+ EIVGTTSQGYPLYKQAIVHG+LV VGGFVSV+ DNV DLPAI LVEYMYE S
Sbjct: 723 EGKINWESEIVGTTSQGYPLYKQAIVHGNLVTVGGFVSVEPDNVDDLPAICLVEYMYENS 782
Query: 781 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHR 840
NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCL+ EL+EI+ETV +E CMRPWGYQHR
Sbjct: 783 NARKMVHGRLVVRGLETVLGNAAKEREVFLTNDCLELELSEIKETVDIEICMRPWGYQHR 842
Query: 841 KANVKKDKADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 900
K N KK KADEE AEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK
Sbjct: 843 KDNAKKVKADEEIAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLK 902
Query: 901 ETQGEDIMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIETFKGGRNVGLNAYVV 960
ETQ ED ++LHSSL SF Y+G DYSVNDYVYLAPHHFGTD+RGIETFKGGRNVGLNAYVV
Sbjct: 903 ETQKEDTIELHSSLTSFKYKGTDYSVNDYVYLAPHHFGTDDRGIETFKGGRNVGLNAYVV 962
Query: 961 CQLLGIESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAI 1020
CQLLGIESPKGSKQP P+S MVKVRRFFRPEDISVEKAYCSDIRELYYSDE + KPVSAI
Sbjct: 963 CQLLGIESPKGSKQPCPLSAMVKVRRFFRPEDISVEKAYCSDIRELYYSDEISVKPVSAI 1022
Query: 1021 EGKCEVRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQ 1080
EGKCEVRKKQDIPM+N P IFDH FFCEHLYDPEKGAIKKLPANVKLSSSSER +SDAAQ
Sbjct: 1023 EGKCEVRKKQDIPMSNCPVIFDHIFFCEHLYDPEKGAIKKLPANVKLSSSSERPMSDAAQ 1082
Query: 1081 RKKKGKCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1140
RKKKGKCKEGES+PDET++KKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE
Sbjct: 1083 RKKKGKCKEGESIPDETDNKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYE 1142
Query: 1141 EPAGEAFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLP 1200
EPAGEAFSLNHPEALTFVNNCNVILRAVM ACGDADDC+STSEAIE+AEKLD+KEINNLP
Sbjct: 1143 EPAGEAFSLNHPEALTFVNNCNVILRAVMTACGDADDCVSTSEAIELAEKLDEKEINNLP 1202
Query: 1201 RPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1260
RPGQVEFINGGPPCQGFSGMNRFNQ TWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV
Sbjct: 1203 RPGQVEFINGGPPCQGFSGMNRFNQRTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFV 1262
Query: 1261 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1320
SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH
Sbjct: 1263 SFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMH 1322
Query: 1321 VFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1380
VF SPELKISLSGNTRYAAV STAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT
Sbjct: 1323 VFGSPELKISLSGNTRYAAVSSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPT 1382
Query: 1381 SWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLI 1440
SWFQKKIRGD+LVLNDHISKEMNELNLIRCQ+IPKRPG+DWRDLPDEKVRLSTGQMHD+I
Sbjct: 1383 SWFQKKIRGDILVLNDHISKEMNELNLIRCQRIPKRPGADWRDLPDEKVRLSTGQMHDMI 1442
Query: 1441 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1500
PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS
Sbjct: 1443 PWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARS 1502
Query: 1501 QGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSSKS 1548
QGFPD+Y FAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMK SKS
Sbjct: 1503 QGFPDNYIFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKMSKS 1548
BLAST of HG10006921 vs. TAIR 10
Match:
AT5G49160.1 (methyltransferase 1 )
HSP 1 Score: 1780.8 bits (4611), Expect = 0.0e+00
Identity = 920/1540 (59.74%), Postives = 1150/1540 (74.68%), Query Frame = 0
Query: 22 KNVKEVKPKQKRNRSETGE-QAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAV 81
+N + ++KR E E + V + P+RAAAC+ FK K+I++ EKS+ IE K++ V
Sbjct: 3 ENGAKAAKRKKRPLPEIQEVEDVPRTRRPRRAAACTSFKEKSIRVCEKSATIEVKKQQIV 62
Query: 82 EDEIAAVGLTTSGQD-DSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTP 141
E+E A+ LT D + RP RRL +F D DG PQ +EMLE++D+F+SG ILP +
Sbjct: 63 EEEFLALRLTALETDVEDRPTRRLNDFVLFDSDGVPQPLEMLEIHDIFVSGAILPSDVCT 122
Query: 142 DKEKNKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKA 201
DKEK KGVRC FGR+E W+ISGYEDGSP IWIST++ADYDC +PA Y+K+Y+ FYEKA
Sbjct: 123 DKEKEKGVRCTSFGRVEHWSISGYEDGSPVIWISTELADYDCRKPAASYRKVYDYFYEKA 182
Query: 202 NACVEVYKKLAKTSGGNPDLTLEELLAGVVRSLNS-SRNFPAGMSVKDFILLQGEFIYNQ 261
A V VYKKL+K+SGG+PD+ LEELLA VVRS++S S+ F +G ++ DF++ QG+FIYNQ
Sbjct: 183 RASVAVYKKLSKSSGGDPDIGLEELLAAVVRSMSSGSKYFSSGAAIIDFVISQGDFIYNQ 242
Query: 262 LIGLDDTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPP 321
L GLD+T+KK++ + ++PVL ALR++S K L + + G++IK+ Q+
Sbjct: 243 LAGLDETAKKHESSYVEIPVLVALREKSSKIDKPLQR----ERNPSNGVRIKEVSQVAES 302
Query: 322 NI---PGFGAEEDEDLKLAKLLQEEEYWRSAKQ-RKNQRSTTSSNKFYIKINEDEIANDY 381
D+D + A LLQ+EE +S +Q RKN S ++SN FYIKINEDEIANDY
Sbjct: 303 EALTSDQLVDGTDDDRRYAILLQDEENRKSMQQPRKNSSSGSASNMFYIKINEDEIANDY 362
Query: 382 PLPAFYKTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCD 441
PLP++YKT+++E DE +++D ++ + LP MLHNW+LYNSD R ISLELLPMK CD
Sbjct: 363 PLPSYYKTSEEETDELILYDASYEV-QSEHLPHRMLHNWALYNSDLRFISLELLPMKQCD 422
Query: 442 DIDVTIYGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGS-SMV 501
DIDV I+GSG++T D+GS L+ S S Q + DG+ I+LS IKEWMIEFGS ++
Sbjct: 423 DIDVNIFGSGVVTDDNGSWISLNDPDSGS---QSHDPDGMCIFLSQIKEWMIEFGSDDII 482
Query: 502 FISIRTDMAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKIS 561
ISIRTD+AWYRLGKPSK YAPW++ VLKTAR+ ISI+T L+ +SR ++LSFA + K++S
Sbjct: 483 SISIRTDVAWYRLGKPSKLYAPWWKPVLKTARVGISILTFLRVESRVARLSFADVTKRLS 542
Query: 562 EFHKNNPAYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTK 621
N+ AYISS P VERY+VVHGQIILQ F+ YPDD +++C F+ GL+ K+E+RHHTK
Sbjct: 543 GLQANDKAYISSDPLAVERYLVVHGQIILQLFAVYPDDNVKRCPFVVGLASKLEDRHHTK 602
Query: 622 WLVKKKAVLKQEANMNPRASMKPVTS-RKAMPATTTRLINRIWGEFYSNYLPEDLKEADN 681
W++KKK + +E N+NPRA M PV S RKAM ATTTRL+NRIWGEFYSNY PED +A
Sbjct: 603 WIIKKKKISLKELNLNPRAGMAPVASKRKAMQATTTRLVNRIWGEFYSNYSPEDPLQATA 662
Query: 682 NETKEDELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVK---PAKAKLSEGKN--KWD 741
E EDE+EEE +EE+EE E E+G + E V K P K + S GK KWD
Sbjct: 663 AENGEDEVEEEGGNGEEEVEE-EGENGLTEDTVPEPVEVQKPHTPKKIRGSSGKREIKWD 722
Query: 742 GEIVGTTSQGYPLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVH 801
GE +G TS G PLY+QA+V G++VAVGG V+++ D+ ++PAIY VEYM+E ++ KM+H
Sbjct: 723 GESLGKTSAGEPLYQQALVGGEMVAVGGAVTLEVDDPDEMPAIYFVEYMFESTDHCKMLH 782
Query: 802 GRLVVRGLETVLGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKD 861
GR + RG TVLGNAA ERE+FLTN+C+ +L +I+ E RPWG+Q+RK N+ D
Sbjct: 783 GRFLQRGSMTVLGNAANERELFLTNECMTTQLKDIKGVASFEIRSRPWGHQYRKKNITAD 842
Query: 862 KADEERAEERKRRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-D 921
K D RA ERK + LP E+YCKSLY PE+G FF LP +G +G C SCK++E + +
Sbjct: 843 KLDWARALERKVKDLPTEYYCKSLYSPERGGFFSLPLSDIGRSSGFCTSCKIREDEEKRS 902
Query: 922 IMKLHSSLMSFTYRGIDYSVNDYVYLAPHHFGTDERGIET-FKGGRNVGLNAYVVCQLLG 981
+KL+ S F GI+YSV D+VY+ P G + G +T FK GRN+GL AYVVCQLL
Sbjct: 903 TIKLNVSKTGFFINGIEYSVEDFVYVNPDSIGGLKEGSKTSFKSGRNIGLRAYVVCQLLE 962
Query: 982 IESPKGSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCE 1041
I PK S++ S VKVRRF+RPED+S EKAY SDI+ELY+S +T P A+EGKCE
Sbjct: 963 I-VPKESRKADLGSFDVKVRRFYRPEDVSAEKAYASDIQELYFSQDTVVLPPGALEGKCE 1022
Query: 1042 VRKKQDIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKG 1101
VRKK D+P++ I DH FFC+ +D KG++K+LPAN+K S+ + D RKKKG
Sbjct: 1023 VRKKSDMPLSREYPISDHIFFCDLFFDTSKGSLKQLPANMKPKFSTIK--DDTLLRKKKG 1082
Query: 1102 KCKEGESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGE 1161
K E E + E + P E RLATLDIFAGCGGLS GL++AGVS KWAIEYEEPAG+
Sbjct: 1083 KGVESE-IESEIVKPVEPPKEIRLATLDIFAGCGGLSHGLKKAGVSDAKWAIEYEEPAGQ 1142
Query: 1162 AFSLNHPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQV 1221
AF NHPE+ FV+NCNVILRA+M+ GD DDC+ST+EA E+A KL +++ + LP PGQV
Sbjct: 1143 AFKQNHPESTVFVDNCNVILRAIMEKGGDQDDCVSTTEANELAAKLTEEQKSTLPLPGQV 1202
Query: 1222 EFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKG 1281
+FINGGPPCQGFSGMNRFNQS+WSKVQCEMILAFLSFA+YFRP+YFLLENVR FVSFNKG
Sbjct: 1203 DFINGGPPCQGFSGMNRFNQSSWSKVQCEMILAFLSFADYFRPRYFLLENVRTFVSFNKG 1262
Query: 1282 QTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSP 1341
QTF+LTLASLLEMGYQVRFGILEAGAYG+SQSRKRAFIWAA+PEE+LPEWPEPMHVF P
Sbjct: 1263 QTFQLTLASLLEMGYQVRFGILEAGAYGVSQSRKRAFIWAAAPEEVLPEWPEPMHVFGVP 1322
Query: 1342 ELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQK 1401
+LKISLS YAAV STA GAPFRPITVRDTIGDLP+V NG S T EYK SWFQK
Sbjct: 1323 KLKISLSQGLHYAAVRSTALGAPFRPITVRDTIGDLPSVENGDSRTNKEYKEVAVSWFQK 1382
Query: 1402 KIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLP 1461
+IRG+ + L DHI K MNELNLIRC+ IP RPG+DW DLP KV LS G++ ++IP+CLP
Sbjct: 1383 EIRGNTIALTDHICKAMNELNLIRCKLIPTRPGADWHDLPKRKVTLSDGRVEEMIPFCLP 1442
Query: 1462 NTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPD 1521
NTA+RHN WKGL+GRLDW+GNFPTS+TDPQPMGKVGMCFHPEQ RILTVRECARSQGFPD
Sbjct: 1443 NTAERHNGWKGLYGRLDWQGNFPTSVTDPQPMGKVGMCFHPEQHRILTVRECARSQGFPD 1502
Query: 1522 SYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
SY+FAGNI HKHRQIGNAVPPPLA+ALGRKL+EA+ +K S
Sbjct: 1503 SYEFAGNINHKHRQIGNAVPPPLAFALGRKLKEALHLKKS 1529
BLAST of HG10006921 vs. TAIR 10
Match:
AT4G14140.1 (DNA methyltransferase 2 )
HSP 1 Score: 1749.2 bits (4529), Expect = 0.0e+00
Identity = 889/1532 (58.03%), Postives = 1128/1532 (73.63%), Query Frame = 0
Query: 26 EVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIA 85
E K +++ RS + V + PKRAAAC++FK K++++ +KS +E K+E + +EI
Sbjct: 4 ETKAGKQKKRSVDSDDDVSKERRPKRAAACTNFKEKSLRISDKSETVEAKKEQILAEEIV 63
Query: 86 AVGLTTS--GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEK 145
A+ LT+S DD RPNRRLT+F HD +G PQ VEMLE+ D+FI G++LP D +K++
Sbjct: 64 AIQLTSSLESNDDPRPNRRLTDFVLHDSEGVPQPVEMLELGDIFIEGVVLPLGD--EKKE 123
Query: 146 NKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACV 205
KGVR + FGR+E+W ISGYEDGSP IWIST +ADYDC +P+ YKKLY+ F+EKA ACV
Sbjct: 124 EKGVRFQSFGRVENWNISGYEDGSPVIWISTALADYDCRKPSKKYKKLYDYFFEKACACV 183
Query: 206 EVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLD 265
EV+K L+K NPD +L+ELLA V RS++ S+ F +G ++++F++ QGEFIYNQL GLD
Sbjct: 184 EVFKSLSK----NPDTSLDELLAAVSRSMSGSKIFSSGGAIQEFVISQGEFIYNQLAGLD 243
Query: 266 DTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPGF 325
+T+K ++ F + VL +LRD + N + A +N+ L+I + + + ++
Sbjct: 244 ETAKNHETCFVENRVLVSLRDH---ESNKIHKA-----LSNVALRIDESKVVTSDHL--V 303
Query: 326 GAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTS---SNKFYIKINEDEIANDYPLPAFY 385
EDED+K AKL+QEEEY +S ++ +N+RS+T+ S++FYIKI+EDEIA+DYPLP++Y
Sbjct: 304 DGAEDEDVKYAKLIQEEEYRKSMERSRNKRSSTTSGGSSRFYIKISEDEIADDYPLPSYY 363
Query: 386 KTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTI 445
K TK+E DE V+F+ ++ D DLP LHNW+LYNSDSR+ISLE+LPM+PC +IDVT+
Sbjct: 364 KNTKEETDELVLFEAGYEV-DTRDLPCRTLHNWTLYNSDSRMISLEVLPMRPCAEIDVTV 423
Query: 446 YGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGSSMVFISIRTD 505
+GSG++ DDGSGFCLD D+ S S Q + DG+ I+LS IKEWMIEFG+ M+F+++RTD
Sbjct: 424 FGSGVVAEDDGSGFCLD-DSESSTSTQSNDHDGMNIFLSQIKEWMIEFGAEMIFVTLRTD 483
Query: 506 MAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNP 565
MAWYRLGKPSKQYAPW+ TV+KT R+ ISI +L +SR +KLS+A++IK++ +N+
Sbjct: 484 MAWYRLGKPSKQYAPWFGTVMKTVRVGISIFNMLMRESRVAKLSYANVIKRLCGLEENDK 543
Query: 566 AYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLV-KKK 625
AYISS VERYVVVHGQIILQ F EYPD I++C F+T L+ KM++ HHTKW++ KKK
Sbjct: 544 AYISSKLLDVERYVVVHGQIILQLFEEYPDKDIKRCPFVTSLASKMQDIHHTKWIIKKKK 603
Query: 626 AVLKQEANMNPRASMKPVTSR-KAMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKED 685
+L++ N+NPRA + PV SR KAM ATTTRL+NRIWGEFYS Y PE EA N E E+
Sbjct: 604 KILQKGKNLNPRAGIAPVVSRMKAMQATTTRLVNRIWGEFYSIYSPEVPSEAINAENVEE 663
Query: 686 ELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGY 745
E EE E EDE EE + E+ +++ ++ P K K + + KWDGEI+G TS G
Sbjct: 664 EELEEVEEEDEN-EEDDPEENELEAVEIQNSPTPKKIKGISEDMEIKWDGEILGKTSAGE 723
Query: 746 PLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETV 805
PLY +A V GD+V VG V ++ D+ D I VE+M+E SN KM+HG+L+ RG ETV
Sbjct: 724 PLYGRAFVGGDVVVVGSAVILEVDDQDDTQLICFVEFMFESSNHSKMLHGKLLQRGSETV 783
Query: 806 LGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERK 865
LG AA ERE+FLTN+CL +L +I+ TV +E R WG+Q+RK N+ DK D RAEERK
Sbjct: 784 LGMAANERELFLTNECLTVQLKDIKGTVSLEIRSRLWGHQYRKENIDVDKLDRARAEERK 843
Query: 866 RRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-DIMKLHSSLMSF 925
GLP ++YCKSLY PE+G FF LP+ MGLG+G C SCK++E + E KL+ S F
Sbjct: 844 TNGLPTDYYCKSLYSPERGGFFSLPRNDMGLGSGFCSSCKIRENEEERSKTKLNDSKTGF 903
Query: 926 TYRGIDYSVNDYVYLAPHHFGTD--ERGIE--TFKGGRNVGLNAYVVCQLLGIESPKGSK 985
GI+Y D+VY+ P++ D ++G T K GRNVGL A+VVCQLL + + S+
Sbjct: 904 LSNGIEYHNGDFVYVLPNYITKDGLKKGSRRTTLKCGRNVGLKAFVVCQLLDVIVLEESR 963
Query: 986 QPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKKQDIP 1045
+ S S VK+ RF+RPEDIS EKAY SDI+ELYYS +T P AI+GKCEVRKK D+P
Sbjct: 964 KASKASFQVKLTRFYRPEDISEEKAYASDIQELYYSQDTYILPPEAIQGKCEVRKKSDMP 1023
Query: 1046 MANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKEGESV 1105
+ I DH FFCE YD G +K+ PAN+KL S+ + + R+KKGK E +
Sbjct: 1024 LCREYPILDHIFFCEVFYDSSTGYLKQFPANMKLKFSTIK--DETLLREKKGKGVETGTS 1083
Query: 1106 PDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSLNHPE 1165
++P E LATLDIFAGCGGLS GL+ AGVS TKWAIEYEEPAG AF NHPE
Sbjct: 1084 SGMLMKPDEVPKEKPLATLDIFAGCGGLSHGLENAGVSTTKWAIEYEEPAGHAFKQNHPE 1143
Query: 1166 ALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEFINGGPP 1225
A FV+NCNVILRA+M+ CGD DDC+ST EA E+A KLD+ + + LP PGQV+FINGGPP
Sbjct: 1144 ATVFVDNCNVILRAIMEKCGDVDDCVSTVEAAELAAKLDENQKSTLPLPGQVDFINGGPP 1203
Query: 1226 CQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFRLTLA 1285
CQGFSGMNRF+ +WSKVQCEMILAFLSFA+YFRPKYFLLENV+ FV++NKG+TF+LT+A
Sbjct: 1204 CQGFSGMNRFSHGSWSKVQCEMILAFLSFADYFRPKYFLLENVKKFVTYNKGRTFQLTMA 1263
Query: 1286 SLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPELKISLSG 1345
SLLEMGYQVRFGILEAG YG+SQ RKR IWAASPEE+LPEWPEPMHVF +P KISL
Sbjct: 1264 SLLEMGYQVRFGILEAGTYGVSQPRKRVIIWAASPEEVLPEWPEPMHVFDNPGSKISLPR 1323
Query: 1346 NTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKIRGDVLV 1405
RY A +T GAPFR ITVRDTIGDLP V NG S EY + P SWFQKKIRG++ V
Sbjct: 1324 GLRYDAGCNTKFGAPFRSITVRDTIGDLPPVENGESKINKEYGTTPASWFQKKIRGNMSV 1383
Query: 1406 LNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNTAKRHNQ 1465
L DHI K +NELNLIRC+KIPKRPG+DWRDLPDE V LS G + L P L TAK HN+
Sbjct: 1384 LTDHICKGLNELNLIRCKKIPKRPGADWRDLPDENVTLSNGLVEKLRPLALSKTAKNHNE 1443
Query: 1466 WKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKFAGNI 1525
WKGL+GRLDW+GN P SITDPQPMGKVGMCFHPEQDRI+TVRECARSQGFPDSY+F+G
Sbjct: 1444 WKGLYGRLDWQGNLPISITDPQPMGKVGMCFHPEQDRIITVRECARSQGFPDSYEFSGTT 1503
Query: 1526 QHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
+HKHRQIGNAVPPPLA+ALGRKL+EA+ +KSS
Sbjct: 1504 KHKHRQIGNAVPPPLAFALGRKLKEALYLKSS 1514
BLAST of HG10006921 vs. TAIR 10
Match:
AT4G08990.1 (DNA (cytosine-5-)-methyltransferase family protein )
HSP 1 Score: 1735.7 bits (4494), Expect = 0.0e+00
Identity = 886/1535 (57.72%), Postives = 1127/1535 (73.42%), Query Frame = 0
Query: 26 EVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIA 85
E K +++ RS V + PKRAAAC +FK K +++ +KS +E K+E V +EI
Sbjct: 2 ETKVGKQKKRSVDSNDDVSKERRPKRAAACRNFKEKPLRISDKSETVEAKKEQNVVEEIV 61
Query: 86 AVGLTTS--GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEK 145
A+ LT+S DD RPNRRLT+F H+ DG PQ VEMLE+ D+F+ G++LP D DK +
Sbjct: 62 AIQLTSSLESNDDPRPNRRLTDFVLHNSDGVPQPVEMLELGDIFLEGVVLPLGD--DKNE 121
Query: 146 NKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACV 205
KGVR + FGR+E+W ISGYEDGSP IWIST +ADYDC +PA YKK+Y+ F+EKA ACV
Sbjct: 122 EKGVRFQSFGRVENWNISGYEDGSPGIWISTALADYDCRKPASKYKKIYDYFFEKACACV 181
Query: 206 EVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLD 265
EV+K L+K NPD +L+ELLA V RS++ S+ F +G ++++F++ QGEFIYNQL GLD
Sbjct: 182 EVFKSLSK----NPDTSLDELLAAVARSMSGSKIFSSGGAIQEFVISQGEFIYNQLAGLD 241
Query: 266 DTSKKNDQIFTDLPVLCALRD-ESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPG 325
+T+K ++ F + VL +LRD ES K L +N+ L+I + + + ++
Sbjct: 242 ETAKNHETCFVENSVLVSLRDHESSKIHKAL---------SNVALRIDESQLVKSDHL-- 301
Query: 326 FGAEEDEDLKLAKLLQEEEYWRSAKQRKNQRS--TTSSNKFYIKINEDEIANDYPLPAFY 385
E ED++ AKL+QEEEY S ++ +N+RS T++SNKFYIKINE EIANDYPLP++Y
Sbjct: 302 VDGAEAEDVRYAKLIQEEEYRISMERSRNKRSSTTSASNKFYIKINEHEIANDYPLPSYY 361
Query: 386 KTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTI 445
K TK+E DE ++F+ ++ D DLP LHNW+LYNSDSR+ISLE+LPM+PC +IDVT+
Sbjct: 362 KNTKEETDELLLFEPGYEV-DTRDLPCRTLHNWALYNSDSRMISLEVLPMRPCAEIDVTV 421
Query: 446 YGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGSSMVFISIRTD 505
+GSG++ DDGSGFCLD D+ S S Q DG+ I+LS IKEWMIEFG+ M+F+++RTD
Sbjct: 422 FGSGVVAEDDGSGFCLD-DSESSTSTQSNVHDGMNIFLSQIKEWMIEFGAEMIFVTLRTD 481
Query: 506 MAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNP 565
MAWYRLGKPSKQYAPW+ETV+KT R+AISI +L +SR +KLS+A++IK++ +N+
Sbjct: 482 MAWYRLGKPSKQYAPWFETVMKTVRVAISIFNMLMRESRVAKLSYANVIKRLCGLEENDK 541
Query: 566 AYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLVK-KK 625
AYISS VERYVVVHGQIILQ F EYPD I++C F+TGL+ KM++ HHTKW++K KK
Sbjct: 542 AYISSKLLDVERYVVVHGQIILQLFEEYPDKDIKRCPFVTGLASKMQDIHHTKWIIKRKK 601
Query: 626 AVLKQEANMNPRASMKPVTSR-KAMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKED 685
+L++ N+NPRA + V +R K M ATTTRL+NRIWGEFYS Y PE EA +E +E+
Sbjct: 602 KILQKGKNLNPRAGLAHVVTRMKPMQATTTRLVNRIWGEFYSIYSPEVPSEA-IHEVEEE 661
Query: 686 ELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGY 745
E+EE+EE ++ E +++EEE +V K P K ++ + + KW+GEI+G TS G
Sbjct: 662 EIEEDEEEDENEEDDIEEEAVEVQ---KSHTP--KKSRGNSEDMEIKWNGEILGETSDGE 721
Query: 746 PLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETV 805
PLY +A+V G+ VAVG V ++ D+ + PAIY VE+M+E S+ KM+HG+L+ RG ETV
Sbjct: 722 PLYGRALVGGETVAVGSAVILEVDDPDETPAIYFVEFMFESSDQCKMLHGKLLQRGSETV 781
Query: 806 LGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERK 865
+G AA ERE+FLTN+CL L +I+ TV ++ RPWG+Q+RK N+ DK D RAEERK
Sbjct: 782 IGTAANERELFLTNECLTVHLKDIKGTVSLDIRSRPWGHQYRKENLVVDKLDRARAEERK 841
Query: 866 RRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-DIMKLHSSLMSF 925
GLP E+YCKSLY PE+G FF LP+ +GLG+G C SCK+KE + E KL+ S
Sbjct: 842 ANGLPTEYYCKSLYSPERGGFFSLPRNDIGLGSGFCSSCKIKEEEEERSKTKLNISKTGV 901
Query: 926 TYRGIDYSVNDYVYLAPHHF-------GTDERGIETFKGGRNVGLNAYVVCQLLGIESPK 985
GI+Y D+VY+ P++ GT R T K GRNVGL A+VVCQLL + +
Sbjct: 902 FSNGIEYYNGDFVYVLPNYITKDGLKKGTSRR--TTLKCGRNVGLKAFVVCQLLDVIVLE 961
Query: 986 GSKQPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKKQ 1045
S++ S S VK+ RF+RPEDIS EKAY SDI+ELYYS +T P A++GKCEVRKK
Sbjct: 962 ESRKASNASFQVKLTRFYRPEDISEEKAYASDIQELYYSHDTYILPPEALQGKCEVRKKN 1021
Query: 1046 DIPMANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKEG 1105
D+P+ I DH FFCE YD G +K+ PAN+KL S+ + + R+KKGK E
Sbjct: 1022 DMPLCREYPILDHIFFCEVFYDSSTGYLKQFPANMKLKFSTIK--DETLLREKKGKGVET 1081
Query: 1106 ESVPDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSLN 1165
+ ++P E RLATLDIFAGCGGLS GL++AGVS TKWAIEYEEPAG AF N
Sbjct: 1082 GTSSGILMKPDEVPKEMRLATLDIFAGCGGLSHGLEKAGVSNTKWAIEYEEPAGHAFKQN 1141
Query: 1166 HPEALTFVNNCNVILRAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEFING 1225
HPEA FV+NCNVILRA+M+ CGD DDC+ST EA E+ KLD+ + + LP PGQ +FI+G
Sbjct: 1142 HPEATVFVDNCNVILRAIMEKCGDVDDCVSTVEAAELVAKLDENQKSTLPLPGQADFISG 1201
Query: 1226 GPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQTFRL 1285
GPPCQGFSGMNRF+ +WSKVQCEMILAFLSFA+YFRPKYFLLENV+ FV++NKG+TF+L
Sbjct: 1202 GPPCQGFSGMNRFSDGSWSKVQCEMILAFLSFADYFRPKYFLLENVKKFVTYNKGRTFQL 1261
Query: 1286 TLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPELKIS 1345
T+ASLLE+GYQVRFGILEAG YG+SQ RKR IWAASPEE+LPEWPEPMHVF +P KIS
Sbjct: 1262 TMASLLEIGYQVRFGILEAGTYGVSQPRKRVIIWAASPEEVLPEWPEPMHVFDNPGSKIS 1321
Query: 1346 LSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKIRGD 1405
L Y V +T GAPFR ITVRDTIGDLP V NG S EY++ P SWFQKKIRG+
Sbjct: 1322 LPRGLHYDTVRNTKFGAPFRSITVRDTIGDLPLVENGESKINKEYRTTPVSWFQKKIRGN 1381
Query: 1406 VLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNTAKR 1465
+ VL DHI K +NELNLIRC+KIPKRPG+DWRDLPDE V LS G + L P L TAK
Sbjct: 1382 MSVLTDHICKGLNELNLIRCKKIPKRPGADWRDLPDENVTLSNGLVEKLRPLALSKTAKN 1441
Query: 1466 HNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSYKFA 1525
HN+WKGL+GRLDW+GN P SITDPQPMGKVGMCFHPEQDRI+TVRECARSQGFPDSY+F+
Sbjct: 1442 HNEWKGLYGRLDWQGNLPISITDPQPMGKVGMCFHPEQDRIITVRECARSQGFPDSYEFS 1501
Query: 1526 GNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
G +HKHRQIGNAVPPPLA+ALGRKL+EA+ +KSS
Sbjct: 1502 GTTKHKHRQIGNAVPPPLAFALGRKLKEALYLKSS 1507
BLAST of HG10006921 vs. TAIR 10
Match:
AT4G14140.2 (DNA methyltransferase 2 )
HSP 1 Score: 1667.1 bits (4316), Expect = 0.0e+00
Identity = 864/1558 (55.46%), Postives = 1111/1558 (71.31%), Query Frame = 0
Query: 26 EVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIA 85
E K +++ RS + V + PKRAAAC++FK K++++ +KS +E K+E + +EI
Sbjct: 4 ETKAGKQKKRSVDSDDDVSKERRPKRAAACTNFKEKSLRISDKSETVEAKKEQILAEEIV 63
Query: 86 AVGLTTS--GQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEK 145
A+ LT+S DD RPNRRLT+F HD +G PQ VEMLE+ D+FI G++LP D +K++
Sbjct: 64 AIQLTSSLESNDDPRPNRRLTDFVLHDSEGVPQPVEMLELGDIFIEGVVLPLGD--EKKE 123
Query: 146 NKGVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACV 205
KGVR + FGR+E+W ISGYEDGSP IWIST +ADYDC +P+ YKKLY+ F+EKA ACV
Sbjct: 124 EKGVRFQSFGRVENWNISGYEDGSPVIWISTALADYDCRKPSKKYKKLYDYFFEKACACV 183
Query: 206 EVYKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLD 265
EV+K L+K NPD +L+ELLA V RS++ S+ F +G ++++F++ QGEFIYNQL GLD
Sbjct: 184 EVFKSLSK----NPDTSLDELLAAVSRSMSGSKIFSSGGAIQEFVISQGEFIYNQLAGLD 243
Query: 266 DTSKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPGF 325
+T+K ++ F + VL +LRD + N + A +N+ L+I + + + ++
Sbjct: 244 ETAKNHETCFVENRVLVSLRDH---ESNKIHKA-----LSNVALRIDESKVVTSDHL--V 303
Query: 326 GAEEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTS---SNKFYIKINEDEIANDYPLPAFY 385
EDED+K AKL+QEEEY +S ++ +N+RS+T+ S++FYIKI+EDEIA+DYPLP++Y
Sbjct: 304 DGAEDEDVKYAKLIQEEEYRKSMERSRNKRSSTTSGGSSRFYIKISEDEIADDYPLPSYY 363
Query: 386 KTTKDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTI 445
K TK+E DE V+F+ ++ D DLP LHNW+LYNSDSR+ISLE+LPM+PC +IDVT+
Sbjct: 364 KNTKEETDELVLFEAGYEV-DTRDLPCRTLHNWTLYNSDSRMISLEVLPMRPCAEIDVTV 423
Query: 446 YGSGIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGSSMVFISIRTD 505
+GSG++ DDGSGFCLD D+ S S Q + DG+ I+LS IKEWMIEFG+ M+F+++RTD
Sbjct: 424 FGSGVVAEDDGSGFCLD-DSESSTSTQSNDHDGMNIFLSQIKEWMIEFGAEMIFVTLRTD 483
Query: 506 MAWYRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNP 565
MAWYRLGKPSKQYAPW+ TV+KT R+ ISI +L +SR +KLS+A++IK++ +N+
Sbjct: 484 MAWYRLGKPSKQYAPWFGTVMKTVRVGISIFNMLMRESRVAKLSYANVIKRLCGLEENDK 543
Query: 566 AYISSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLV-KKK 625
AYISS VERYVVVHGQIILQ F EYPD I++C F+T L+ KM++ HHTKW++ KKK
Sbjct: 544 AYISSKLLDVERYVVVHGQIILQLFEEYPDKDIKRCPFVTSLASKMQDIHHTKWIIKKKK 603
Query: 626 AVLKQEANMNPRASMKPVTSR-KAMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKED 685
+L++ N+NPRA + PV SR KAM ATTTRL+NRIWGEFYS Y PE EA N E E+
Sbjct: 604 KILQKGKNLNPRAGIAPVVSRMKAMQATTTRLVNRIWGEFYSIYSPEVPSEAINAENVEE 663
Query: 686 ELEEEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGY 745
E EE E EDE EE + E+ +++ ++ P K K + + KWDGEI+G TS G
Sbjct: 664 EELEEVEEEDEN-EEDDPEENELEAVEIQNSPTPKKIKGISEDMEIKWDGEILGKTSAGE 723
Query: 746 PLYKQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETV 805
PLY +A V GD+V VG V ++ D+ D I VE+M+E SN KM+HG+L+ RG ETV
Sbjct: 724 PLYGRAFVGGDVVVVGSAVILEVDDQDDTQLICFVEFMFESSNHSKMLHGKLLQRGSETV 783
Query: 806 LGNAAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERK 865
LG AA ERE+FLTN+CL +L +I+ TV +E R WG+Q+RK N+ DK D RAEERK
Sbjct: 784 LGMAANERELFLTNECLTVQLKDIKGTVSLEIRSRLWGHQYRKENIDVDKLDRARAEERK 843
Query: 866 RRGLPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-DIMKLHSSLMSF 925
GLP ++YCKSLY PE+G FF LP+ MGLG+G C SCK++E + E KL+ S F
Sbjct: 844 TNGLPTDYYCKSLYSPERGGFFSLPRNDMGLGSGFCSSCKIRENEEERSKTKLNDSKTGF 903
Query: 926 TYRGIDYSVNDYVYLAPHHFGTD--ERGIE--TFKGGRNVGLNAYVVCQLLGIESPKGSK 985
GI+Y D+VY+ P++ D ++G T K GRNVGL A+VVCQLL + + S+
Sbjct: 904 LSNGIEYHNGDFVYVLPNYITKDGLKKGSRRTTLKCGRNVGLKAFVVCQLLDVIVLEESR 963
Query: 986 QPSPVSTMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKKQDIP 1045
+ S S VK+ RF+RPEDIS EKAY SDI+ELYYS +T P AI+GKCEVRKK D+P
Sbjct: 964 KASKASFQVKLTRFYRPEDISEEKAYASDIQELYYSQDTYILPPEAIQGKCEVRKKSDMP 1023
Query: 1046 MANYPAIFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKEGESV 1105
+ I DH FFCE YD G +K+ PAN+KL S+ + + R+KKGK E +
Sbjct: 1024 LCREYPILDHIFFCEVFYDSSTGYLKQFPANMKLKFSTIK--DETLLREKKGKGVETGTS 1083
Query: 1106 PDETESKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSV-------------TKWAIEYE 1165
++P E LATLDIFAGCGGLS GL+ AG+ + K I+
Sbjct: 1084 SGMLMKPDEVPKEKPLATLDIFAGCGGLSHGLENAGMYLYSHVMHILLSSKHLKTFIKMH 1143
Query: 1166 EPAGEAFSLNHPEA-----LTFVNNCNVILR--------AVMKACGDADDCISTSEAIEV 1225
+ + L + L N ++ + A+M+ CGD DDC+ST EA E+
Sbjct: 1144 VLCNKVYLLQSGRSSMKSQLVMRLNKTILKQRFLLTTAMAIMEKCGDVDDCVSTVEAAEL 1203
Query: 1226 AEKLDDKEINNLPRPGQVEFINGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFR 1285
A KLD+ + + LP PGQV+FINGGPPCQGFSGMNRF+ +WSKVQCEMILAFLSFA+YFR
Sbjct: 1204 AAKLDENQKSTLPLPGQVDFINGGPPCQGFSGMNRFSHGSWSKVQCEMILAFLSFADYFR 1263
Query: 1286 PKYFLLENVRNFVSFNKGQTFRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAAS 1345
PKYFLLENV+ FV++NKG+TF+LT+ASLLEMGYQVRFGILEAG YG+SQ RKR IWAAS
Sbjct: 1264 PKYFLLENVKKFVTYNKGRTFQLTMASLLEMGYQVRFGILEAGTYGVSQPRKRVIIWAAS 1323
Query: 1346 PEEILPEWPEPMHVFVSPELKISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNG 1405
PEE+LPEWPEPMHVF +P KISL RY A +T GAPFR ITVRDTIGDLP V NG
Sbjct: 1324 PEEVLPEWPEPMHVFDNPGSKISLPRGLRYDAGCNTKFGAPFRSITVRDTIGDLPPVENG 1383
Query: 1406 ASVTTMEYKSEPTSWFQKKIRGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDE 1465
S EY + P SWFQKKIRG++ VL DHI K +NELNLIRC+KIPKRPG+DWRDLPDE
Sbjct: 1384 ESKINKEYGTTPASWFQKKIRGNMSVLTDHICKGLNELNLIRCKKIPKRPGADWRDLPDE 1443
Query: 1466 KVRLSTGQMHDLIPWCLPNTAKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPE 1525
V LS G + L P L TAK HN+WKGL+GRLDW+GN P SITDPQPMGKVGMCFHPE
Sbjct: 1444 NVTLSNGLVEKLRPLALSKTAKNHNEWKGLYGRLDWQGNLPISITDPQPMGKVGMCFHPE 1503
Query: 1526 QDRILTVRECARSQGFPDSYKFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKSS 1546
QDRI+TVRECARSQGFPDSY+F+G +HKHRQIGNAVPPPLA+ALGRKL+EA+ +KSS
Sbjct: 1504 QDRIITVRECARSQGFPDSYEFSGTTKHKHRQIGNAVPPPLAFALGRKLKEALYLKSS 1540
BLAST of HG10006921 vs. TAIR 10
Match:
AT4G13610.1 (DNA (cytosine-5-)-methyltransferase family protein )
HSP 1 Score: 1512.3 bits (3914), Expect = 0.0e+00
Identity = 810/1537 (52.70%), Postives = 1028/1537 (66.88%), Query Frame = 0
Query: 26 EVKPKQKRNRSETGEQAVGVRKMPKRAAACSDFKVKTIQLPEKSSVIECKREVAVEDEIA 85
+ K +++ RS + V + PKRA + ++FK K+++ EK +E K+E V D
Sbjct: 2 KTKAGKQKKRSVDSDDDVSRERRPKRATSGTNFKEKSLRFSEKYETVEAKKEQIVGD--- 61
Query: 86 AVGLTTSGQDDSRPNRRLTEFTFHDEDGKPQAVEMLEVNDLFISGIILPFEDTPDKEKNK 145
D+++ K
Sbjct: 62 ------------------------------------------------------DEKEEK 121
Query: 146 GVRCEGFGRIESWTISGYEDGSPTIWISTDVADYDCVRPAGGYKKLYNIFYEKANACVEV 205
GVR + FGR+E+WTISGYEDGSP IWIST +ADYDC +P+ YKKLY+ F+EKA ACVEV
Sbjct: 122 GVRFQSFGRVENWTISGYEDGSPVIWISTVIADYDCRKPSKKYKKLYDYFFEKACACVEV 181
Query: 206 YKKLAKTSGGNPDLTLEELLAGVVRSLNSSRNFPAGMSVKDFILLQGEFIYNQLIGLDDT 265
K L+ NPD +L+ELLA VVRS+N + F +G +++F++ QGEFIYNQL GLD+T
Sbjct: 182 CKNLST----NPDTSLKELLAAVVRSMNGRKIFSSGGVIQEFVISQGEFIYNQLAGLDET 241
Query: 266 SKKNDQIFTDLPVLCALRDESRKQGNLLPNAGTFDGFTNLGLKIKDGEQLNPPNIPGFGA 325
SK ++ F D VL +LRDESRK F+N+ L+I + + L + G
Sbjct: 242 SKNHETKFVDNRVLVSLRDESRK---------IHKAFSNVALRIDESKVLTSDQLMDGG- 301
Query: 326 EEDEDLKLAKLLQEEEYWRSAKQRKNQRSTTSS--NKFYIKINEDEIANDYPLPAFYKTT 385
EDEDLK AKLLQEEE+ +S + +N+RS+T+S NKFYIKINEDEIA+DYPLP++YK T
Sbjct: 302 -EDEDLKYAKLLQEEEHMKSMDRSRNKRSSTTSAPNKFYIKINEDEIAHDYPLPSYYKNT 361
Query: 386 KDEMDEYVIFDGDMDICDPDDLPRSMLHNWSLYNSDSRLISLELLPMKPCDDIDVTIYGS 445
KDE DE V+F+ + D +LP LHNW+LYNSD LISLE LPMKPC DIDVT
Sbjct: 362 KDETDELVLFNAGYAV-DARNLPCRTLHNWALYNSDLMLISLEFLPMKPCADIDVT---- 421
Query: 446 GIMTADDGSGFCLDADTSQSCSAQMQNTDGIPIYLSAIKEWMIEFGSSMVFISIRTDMAW 505
YL IKEW I+FG M+F+ +RTDMAW
Sbjct: 422 ---------------------------------YLGQIKEWKIDFGEDMIFVLLRTDMAW 481
Query: 506 YRLGKPSKQYAPWYETVLKTARLAISIITLLKEQSRASKLSFAHIIKKISEFHKNNPAYI 565
YRLGKPS+QYAPW+E +LKT R+ SI+ LLK ++R +KLS+ +IK++ +N+ AYI
Sbjct: 482 YRLGKPSEQYAPWFEPILKTVRIGTSILALLKNETRMAKLSYTDVIKRLCGLEENDQAYI 541
Query: 566 SSTPSVVERYVVVHGQIILQTFSEYPDDMIRKCAFITGLSDKMEERHHTKWLVKKK-AVL 625
SST VERYV+VHGQIILQ +E PD+ I++C F+TGL+ KM++RHHTKW++KKK +L
Sbjct: 542 SSTFFDVERYVIVHGQIILQFLTECPDEYIKRCPFVTGLASKMQDRHHTKWIIKKKRKML 601
Query: 626 KQEANMNPRASMKP-VTSRKAMPATTTRLINRIWGEFYSNYLPEDLKEADNNETKEDELE 685
++ N+N R P V+ KAM ATTTRLINRIWGEFYS Y PED E E+E E
Sbjct: 602 QKGENLNLRRGKAPKVSKMKAMQATTTRLINRIWGEFYSIYSPEDPLEEIG---AEEEFE 661
Query: 686 EEEEVEDEELEEVEEEDGQVDLKTKESKPVVKPAKAKLSEGKNKWDGEIVGTTSQGYPLY 745
E E+VE+E +E EEED + +K + E + +W+GEI+G T G PLY
Sbjct: 662 EVEDVEEE--DENEEEDTIQKAIEVQKADTLKKIRGSCKEMEIRWEGEILGETCAGEPLY 721
Query: 746 KQAIVHGDLVAVGGFVSVDTDNVHDLPAIYLVEYMYEKSNARKMVHGRLVVRGLETVLGN 805
QA+V G + VGG V ++ D+ + P IY VEYM+E S+ K +HG+L+ RG ETVLG
Sbjct: 722 GQALVGGRKMDVGGAVILEVDDQGETPLIYFVEYMFESSDNSKKLHGKLLQRGSETVLGT 781
Query: 806 AAKEREVFLTNDCLDFELNEIRETVVVESCMRPWGYQHRKANVKKDKADEERAEERKRRG 865
AA ERE+FLTN+CL +L +I+ TV E RPWG+Q++K ++ DK D RAEERK +
Sbjct: 782 AANERELFLTNECLTVQLKDIKGTVSFEIRSRPWGHQYKKEHMAADKLDRARAEERKAKD 841
Query: 866 LPMEFYCKSLYWPEKGAFFCLPKETMGLGTGDCHSCKLKETQGE-DIMKLHSSLMSFTYR 925
LP+E+YCKSLY PEKG FF LP+ MGLG+G C SCK++E + E KL+ S F
Sbjct: 842 LPIEYYCKSLYSPEKGGFFSLPRSDMGLGSGFCSSCKIRENEEERSKTKLNDSKTRFLSN 901
Query: 926 GIDYSVNDYVYLAPHHFGTDE-RGIETFKGGRNVGLNAYVVCQLLGIESPKGSKQPSPVS 985
GI YSV D+VY P++ D + FK GRNVGL A+VVCQ+L I K K+ + S
Sbjct: 902 GIKYSVGDFVYQIPNYLSKDRGKRRPVFKYGRNVGLRAFVVCQILDIVDLKEPKKGNTTS 961
Query: 986 TMVKVRRFFRPEDISVEKAYCSDIRELYYSDETTTKPVSAIEGKCEVRKKQDIPMANYPA 1045
VKVRRF+RP+D+S E+AY SDI+E+YYS++T P AI+GKCEV KK D+P+
Sbjct: 962 FEVKVRRFYRPDDVSAEEAYASDIQEVYYSEDTYILPPEAIKGKCEVMKKTDMPLCREYP 1021
Query: 1046 IFDHFFFCEHLYDPEKGAIKKLPANVKLSSSSERQISDAAQRKKKGKCKEGESV-PDETE 1105
I DH +FC+ YD G +KKLP N+ L S+ + D R+KK + + PDE
Sbjct: 1022 ILDHVYFCDRFYDSSNGCLKKLPYNMMLKFSTIK--DDTLLREKKTETGSAMLLKPDE-- 1081
Query: 1106 SKKDLPLENRLATLDIFAGCGGLSEGLQQAGVSVTKWAIEYEEPAGEAFSLNHPEALTFV 1165
+P RLATLDIFAGCGGLS GL++AGVS TKWAIEYEEPA +AF NHP+ FV
Sbjct: 1082 ----VPKGKRLATLDIFAGCGGLSYGLEKAGVSDTKWAIEYEEPAAQAFKQNHPKTTVFV 1141
Query: 1166 NNCNVIL-----------RAVMKACGDADDCISTSEAIEVAEKLDDKEINNLPRPGQVEF 1225
+NCNVIL RA+M+ CGD DDCIST+EA E+A KLD+ + + LP PGQV+F
Sbjct: 1142 DNCNVILRISWLRLLINDRAIMEKCGDVDDCISTTEAAELATKLDENQKSTLPLPGQVDF 1201
Query: 1226 INGGPPCQGFSGMNRFNQSTWSKVQCEMILAFLSFAEYFRPKYFLLENVRNFVSFNKGQT 1285
I+GGPPCQGFS +NRF+ +WSK QC+MILAFLSFA+YFRPKYFLLENV+ FVSFN+G T
Sbjct: 1202 ISGGPPCQGFSRLNRFSDGSWSKNQCQMILAFLSFADYFRPKYFLLENVKTFVSFNEGHT 1261
Query: 1286 FRLTLASLLEMGYQVRFGILEAGAYGISQSRKRAFIWAASPEEILPEWPEPMHVFVSPEL 1345
F LT+ASLLEMGYQVRFG+LEAGAYGISQ RKRAFIWAA+P E+LPEWPEPMHVF +P
Sbjct: 1262 FHLTVASLLEMGYQVRFGLLEAGAYGISQPRKRAFIWAAAPNEVLPEWPEPMHVFNNPGF 1321
Query: 1346 KISLSGNTRYAAVPSTAGGAPFRPITVRDTIGDLPAVGNGASVTTMEYKSEPTSWFQKKI 1405
KI LS YAAV ST GAPFR ITVRD IGDLP + +G S E ++
Sbjct: 1322 KIPLSQGLHYAAVQSTKFGAPFRSITVRDAIGDLPPIESGESKINKE-----------EM 1381
Query: 1406 RGDVLVLNDHISKEMNELNLIRCQKIPKRPGSDWRDLPDEKVRLSTGQMHDLIPWCLPNT 1465
RG + VL DHI K+MNELNLIRC+KIPK PG+DWRDLPDE V LS G + +++P L N
Sbjct: 1382 RGSMTVLTDHICKKMNELNLIRCKKIPKTPGADWRDLPDEHVNLSNGIVKNIVPNLL-NK 1403
Query: 1466 AKRHNQWKGLFGRLDWEGNFPTSITDPQPMGKVGMCFHPEQDRILTVRECARSQGFPDSY 1525
AK HN +KGL+GRLDW GN PT IT+ QPMG VGMCFHP+QDRI++VRECARSQGFPDSY
Sbjct: 1442 AKDHNGYKGLYGRLDWHGNLPTCITNLQPMGLVGMCFHPDQDRIISVRECARSQGFPDSY 1403
Query: 1526 KFAGNIQHKHRQIGNAVPPPLAYALGRKLREAIEMKS 1545
KF+GNI+ KHRQ+GNAVPPPLA+ALGRKL+EA+ +++
Sbjct: 1502 KFSGNIKDKHRQVGNAVPPPLAFALGRKLKEALHLRN 1403
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_008464733.1 | 0.0e+00 | 95.42 | PREDICTED: DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo] >XP_0084647... | [more] |
KAA0065499.1 | 0.0e+00 | 95.39 | DNA (cytosine-5)-methyltransferase 1B-like [Cucumis melo var. makuwa] >TYK08739.... | [more] |
XP_038892294.1 | 0.0e+00 | 95.29 | DNA (cytosine-5)-methyltransferase 1B-like [Benincasa hispida] >XP_038892295.1 D... | [more] |
XP_011654447.1 | 0.0e+00 | 94.90 | DNA (cytosine-5)-methyltransferase 1B isoform X2 [Cucumis sativus] | [more] |
XP_011654446.1 | 0.0e+00 | 94.90 | DNA (cytosine-5)-methyltransferase 1B isoform X1 [Cucumis sativus] >KGN50781.2 h... | [more] |
Match Name | E-value | Identity | Description | |
B1Q3J6 | 0.0e+00 | 60.77 | DNA (cytosine-5)-methyltransferase 1B OS=Oryza sativa subsp. japonica OX=39947 G... | [more] |
P34881 | 0.0e+00 | 59.74 | DNA (cytosine-5)-methyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=DMT1 PE=... | [more] |
Q7Y1I7 | 0.0e+00 | 58.97 | DNA (cytosine-5)-methyltransferase 1A OS=Oryza sativa subsp. japonica OX=39947 G... | [more] |
O23273 | 0.0e+00 | 58.03 | DNA (cytosine-5)-methyltransferase 4 OS=Arabidopsis thaliana OX=3702 GN=MET4 PE=... | [more] |
Q9M0S8 | 0.0e+00 | 57.72 | DNA (cytosine-5)-methyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=MET2 PE=... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CM89 | 0.0e+00 | 95.42 | DNA (cytosine-5)-methyltransferase OS=Cucumis melo OX=3656 GN=LOC103502549 PE=3 ... | [more] |
A0A5A7VJ57 | 0.0e+00 | 95.39 | DNA (cytosine-5)-methyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
A0A0A0KPF1 | 0.0e+00 | 94.90 | DNA (cytosine-5)-methyltransferase OS=Cucumis sativus OX=3659 GN=Csa_5G002610 PE... | [more] |
A0A6J1EP47 | 0.0e+00 | 92.37 | DNA (cytosine-5)-methyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436438... | [more] |
A0A6J1EPN0 | 0.0e+00 | 92.37 | DNA (cytosine-5)-methyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436438... | [more] |