HG10011262 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10011262
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein SET DOMAIN GROUP 40-like
LocationChr01: 4111404 .. 4117232 (-)
RNA-Seq ExpressionHG10011262
SyntenyHG10011262
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGTACGCTCTGATTTCTCCTTTTCTAGTTACATCGCTCTTTTCTTTCTTTCGCCGCTTGTAAGTTGTTCATTTTGAGGGAAATTTTGCTTCAGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTCTTCCTTTGCTAACATAATTAGAACGAACTTTTGTGAGAAAAAGGATGCTTCCTTTTGTAGGTTTGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTATGCCTCGTAAGTTGGTCTTTGAGGGAGGGGACTATCAGATACATGAGGCACCTCCCTGTGCCTAATGAGGTAGGTTTTGAAAGTGATTACAGCCCAAAAAGTGATTACATTTCAAAGTAGGAGAGATATGGAATGGAAAATAGAAGGGATCTTTATACACGAGCGACCATAGGTATGCTTTTTTTTTTTTTTTCTTCCTGATAGAAACAACTGTCTTCATTTAGAAAAAAAAAATGAAAGAATACAAGGGCATACAAAAAACCAAGCCCACCAAAGACCAAAGACCAAACCTTGCTAGGGTTTCTCTCTACACCTTGAAACACTCTATTGTTCCTCTCCCCCCAAAGATTCCACAATAACGCACACACCCCTGCTAACCAAAGAAAACGACCTTTCTCTTTGAAGTGCGGATGGAGGAGGAACTTCCTGATGATCAACTGAACATCCCTCTGACGAGCAAGCAAAAAATCAAACGCTTGAAAGAAACCATTCCATACAGATCTCGCAAACTGACACTCCCATAGGAGGTGATCCATGTTTTCCTCCACCTTTCAACACAGGATACAACAAAAAGGACTCATTAAAGAAGATATCTTTCTCAAAATCCTATCCATCATGTTCACACAACCAAGCAAAACTTGCTTCGGAAAGAACTTAAACTTTCTTTGGAATTTTAATCCTCCATAAAACATCAAAGACTGACTCAACAAAAGGAGAGAGATCCAATAAAATCCAAAGAATGACTTACAAAAAAATCCCTCCAAAGGGTTAGGACTCCAAACACGAACATCCCTTCTCCCAAGCCTAAAGTTAAAAACCCTCAACAAGGAAAGAAGAGAAGCCACTTTCGTCGTTTCCCTATTGGACAACAGTCGACAGAAACCAAAGGAGAAAGAAACACAAGTTCCCCGACCAAACCAAAAAGTCCGACACGAAACAATTATTCAAGAAGGAAAAATGATGTAACCTAGAAAACACAGAAGAGAGAGGTCTATCCCATACCCAATGATCTTCCCAAAAATACGCTTCCTTACCATCTCCCACAACACAAAAAACAAGATGGAAAAAAGAAGGGAGCTGAATAGAACCACAAATTTCGGTGCGTGCCATTTAACCCCTCGCCACCCACTCACAGCGATGGGGACCATGTTTGCTCACAATAGTTCTCTGCCGTAGGTATGCTTAAAGATGTTTTATATTCCATTCTTTGTTAATGGTTAATCGTCCTTAATGCAACTGCGACAAAGATCTTAACTGATTTTAATTCTCTTTTATATTTAAGGTTTATCTAAAACATTTCTTGTGTGTTTATCTAAGACATTACCCATTTTCCCCTGTTAGTTCTAAAGGTGTACTAATATTAATTTCACAATCCCTCCCCCCCACTCAAATGCAGAAGTTGACTTTTTGTTTACTCTATGAGATTGGTAAAGGAACTAGTTCTTTGTGGCTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTCAATTTTCCGTTTGTGGCATTTTTATTTATTTCTTAGGATTATATTCGTATTCGTGAACCATATTTAATTAGGATTTTTTACATGTGCTTTTGACATTTGTTTTATAATAAAATAGTCAAAGTCATTAACAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTCATGCTAGAAGAATGTATTCCATAGCTAGATTGGACCTGGATTTGTTTTTTTTTTTGGGTAAGAAACCAAACTTTCATTGAGAAAAAAATGAGAGAAAGAAGGCATATGAAAAACAGCCAACAAAAAAAGGAGGATCCCCAACTAACTACAAAGAGGGCTCCAATCTAACAAATGCCAAATTCATAATTACAAAAGAGTATTAACAGACATATACTTATTACAAAAAACCTGCGTTTAGGTTTCTATTTTTTTTTAATAATTAATTATTTATTTGTTTGGTGAAAATTTTATGCATCATTTGAAGGTGGTTTAGCTGGATGTCCAAGGTCACTGCAAATATGCTTTTCTTATAGAAGAAAACACCACCAAGTGACACCCATTTGCTGATTATCTATCTTCCATTACCCATTACCTAAGAAGCACGGATACGGATACAAGACATGGATACGACACGGACACGGTGACACGCCATATTTTAAATATCTAGGACACGATACGACAATGACACGTTTAATAAAATATACATTTTTAAAAATATATATCATTTTCATACTAGAATAAAATTAAAATAAATGGGTTGATGCATTTATATGCTTAAAAGACTTAACTTGATGTATTTCACACTCAAAAGTTATTATTATTCTCATATATGTGTCTTCTTGGTCTACTCAACAAGTGTTCAATGCATGTCTAACATATTTGTTGTACTAACAAGTGTCCGATACGTGTCCAACAAGTGTCAGAGTGTCCAAGTGTCTGACACGTGTCGGACATGGACATGCTAGCCAAATTAAAGTGTCCGTGCTTCTTAGCCCATTACCTTGTAGTCCTGTACACATTACTCATTACCAGCCAGTAAGCATCATGACTTCTATTCGATCTAATATGCTCCCTCTCCCATCCCACCTTGCCCTCCCTAAAGTCCTGACCCCCTCACTTAAGGGCCCCCTTAACTTAGGTATTCTTTGTCTCCTTGAAATCTATGGACTGTAACTACTGTCATTGAACTTTGGTGGTATATAAAGGGACCTATATTACTGAAGTTGATGGTGGATGTCTATAAATGAAGCAAACCAAAGCTCTCTAATATTGGAAAGGATAAGTTTTTCTTCATCTCTCGTCAATGGAGCACTCTCGTGGATTCAAAATTGCTATGTAGGCCTGCTCCTCTTCCCTCTTAATGGTTTTTTCCCCCAAGAATTCTATAGTGAGGATTATGTGTTTGCATTGAATAAATCTCCTATAGGAAACACTAAAGAGACTACTGTTGATATCATTAAGCTGAACTCAAGAGACACATTTTATTCCATGTAGGACAGAATTGCTCAGGTCAGTTCTCCTCCTCATGAAAGATTACCCTTGTGCCACCCTGCCTCTAATAGTGGACTGTTTTTCTTCTGACTTGTGAAGTGGAAGTTGCTTGCACAAAGGCTTTTGGATCAGCCCAAACAAGGATACTAGGATGTTCGTTTAAGTACGATTGAATTTAGTACTCACATCCCTTCAATTTTTCCCATAGCTTTTTGTTGCCATAGAATATAGAATTCATCCTCTTTTTTATGGATGAGAATTGTCGAGACTTGAGAGTTTGTTTTGTAATTTTCTCTTAGCCCGTCTTTTGGAGGATATATTATCGTCCTCACTTGTATCTATTTTTAATGTTAATGCAATTATTTGGTTCTTATAGAAAAACGAGTGTTTCTTTATTTGTCTTCATGTGTTGTATGATGACTCTAAATGGCATGGTTTGGTCCGATGTACATTTTTTCAATCCTAATCTCAGTTTGTTGAACTTCCTTCTACGTGTAGTAGTTAAAAACTAAAAATTTATGTTGCCCTTTGAGAGATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTATTTTATATTTTCTTGGTGTCATTTAATATTTTGTACTCGTGAAAAGTGGAGTTGATGCAAAATACATTAATTGGAAGTTTAACATTTTAGTTATTGAGTTTGATTTCCTATTATGAAGAGGGTTGAAATTTACATTAACCTTTTTTATTTGCTCTCTGATGCATTCTTGTTAATCTGCACCTTGAGATTTCCATAGTAAAACGAAACCAGAGATTGTTTCTTTGTGTCTTAGACTGTTATTGCACTGATGTTGAACTTTTTCTTTTTTCATCTTTTTATTTTCTGTTATTACTGTTACTTTTTCTGCCCCAAAGGTATTTTATTCTCATTATAGTTCGTAGGTTAGTTGCTTTTTTGGGTCTTATCCATCTCTCCAATTCTTTGTGTTACTAACACATCCATCAAACGTGACTCTATACCTCACCATTTTTTTTTAAAAAAAAAGCAACAGCTGAAAACGAATTAATTTGATGTTTTGGTGGGTAGCAATATTTTCAAAATGCAAGGCGCCCCCAAAGTGTCCCGTCTTTTGATTGTTTGTATATGAGTGACCTAGAAAATTAACACCTCAGCGAAGCAAAACCATACTTTATAAATTAATAAACAAATACAAAATATTTAAGACATGAATATCAATAAGTCCATTTAAAATTTCAAAAGAAAAAAAATGGTCTGTCTCTCAAACATAATTGCTTTGCATGAATCTTTCGTTCTCAAAAAAGAATGTGTGGATGAGTAACGATTGAAACCTTTCTATTTTTTACTGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAA

mRNA sequence

ATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAA

Coding sequence (CDS)

ATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAA

Protein sequence

METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS
Homology
BLAST of HG10011262 vs. NCBI nr
Match: XP_038896047.1 (protein SET DOMAIN GROUP 40 [Benincasa hispida])

HSP 1 Score: 821.6 bits (2121), Expect = 3.2e-234
Identity = 412/486 (84.77%), Postives = 426/486 (87.65%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M TE SFGSLLRWAADHGISD VDQQTSHSCLG SLCVCFFPDAGGRGLGAVRQLNKGEL
Sbjct: 1   MGTEESFGSLLRWAADHGISDSVDQQTSHSCLGDSLCVCFFPDAGGRGLGAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           VLRVPKSVL TTQSLSLEDEKLA ALKRYPSLSST                         
Sbjct: 61  VLRVPKSVLFTTQSLSLEDEKLARALKRYPSLSSTQKLTFCLLYEIGKGTSSWWLPYLKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDY IWATEKAALKS  EWRGVKGLM+E NIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYGIWATEKAALKSLMEWRGVKGLMEEFNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGES+D  DVS FSPHASLNGD+TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESIDGTDVSFFSPHASLNGDITT 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           DELHEE+RDTQWALTDGGFEE+VSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL
Sbjct: 241 DELHEEQRDTQWALTDGGFEEDVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPNDKVFIP+EHDIY+SSSWPKESLY+HQNGNPSF+LLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPLEHDIYNSSSWPKESLYIHQNGNPSFSLLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           Y+GSQLS+KNEILVMQ LSKNC TVLNNLPTSVEEDNQLLCNICKIQDLQVPREL+KMLL
Sbjct: 361 YSGSQLSVKNEILVMQLLSKNCLTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELRKMLL 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           TYGGEF AFLETNG+VNR+EAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT
Sbjct: 421 TYGGEFSAFLETNGVVNRDEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480

BLAST of HG10011262 vs. NCBI nr
Match: XP_022983189.1 (protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima])

HSP 1 Score: 799.7 bits (2064), Expect = 1.3e-227
Identity = 396/486 (81.48%), Postives = 420/486 (86.42%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M TEGSF SLLRWAADHGISD VD+Q+SHSCLGRSLCVCFFPDAGGRGLGAVR L KGEL
Sbjct: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           VL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST                         
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYA+W  EKAA KS TEWRGVKGLM+ESNIKNQLQTFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLD+MDVSSFS HASLNG++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTYSNLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE+LVMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQ P EL KMLL
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           T GGEFCAFLET GLVNR E ELHL+GKIKRSLERWKLAVQWR+LYKKALVDC SYCTRT
Sbjct: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRT 480

BLAST of HG10011262 vs. NCBI nr
Match: XP_008457031.1 (PREDICTED: protein SET DOMAIN GROUP 40 isoform X3 [Cucumis melo])

HSP 1 Score: 795.4 bits (2053), Expect = 2.5e-226
Identity = 390/447 (87.25%), Postives = 412/447 (92.17%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEW 120
           NKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSSTQVDYAIWATEKAALKSR +W
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQVDYAIWATEKAALKSRMDW 120

Query: 121 RGVKGLMQESNIKNQLQTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGES 180
           RGVKGLMQESNIKNQLQTFKAWLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES
Sbjct: 121 RGVKGLMQESNIKNQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGES 180

Query: 181 LDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQ 240
            + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARESYKKGEQ
Sbjct: 181 FNAMDVLSFPSHASLNDEL---ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQ 240

Query: 241 VLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFAL 300
           VLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNGNPSFAL
Sbjct: 241 VLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFAL 300

Query: 301 LSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQL 360
           LSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+EED+QL
Sbjct: 301 LSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQL 360

Query: 361 LCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLA 420
           LCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLA
Sbjct: 361 LCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLA 420

Query: 421 VQWRLLYKKALVDCISYCTRTICSLSS 443
           VQWRLLYKKALVDCI YCTRTICSLSS
Sbjct: 421 VQWRLLYKKALVDCIGYCTRTICSLSS 444

BLAST of HG10011262 vs. NCBI nr
Match: KAG7017936.1 (Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 790.8 bits (2041), Expect = 6.1e-225
Identity = 393/486 (80.86%), Postives = 416/486 (85.60%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M TE SF SLLRWAADHGISD VD+Q SHSCLGRSLCVCFFPDAGGRGLGAVR L KGEL
Sbjct: 1   MGTEESFESLLRWAADHGISDSVDKQCSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           VL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST                         
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYA+W  EKAA KSR EWRGVKGLM+ES IKNQLQTFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSRAEWRGVKGLMEESIIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPE ES D+MDVSSFS HASLNG++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEAESFDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYTNLELLQYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EHDIYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHDIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE+LVMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQVPREL KM  
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQVPRELGKMPS 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           T GGEFCAFLETNGLVNR E EL L+GKIKRSLERWKLAVQWR+LYKKALVDCISYCTRT
Sbjct: 421 TVGGEFCAFLETNGLVNREETELQLTGKIKRSLERWKLAVQWRILYKKALVDCISYCTRT 480

BLAST of HG10011262 vs. NCBI nr
Match: XP_023528315.1 (protein SET DOMAIN GROUP 40 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 789.3 bits (2037), Expect = 1.8e-224
Identity = 391/486 (80.45%), Postives = 416/486 (85.60%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M TE SF SLLRWAADHGISD  D+Q+SHSCLGRSLCVCFFPDAGGRGLGAVR L KGEL
Sbjct: 1   MGTEESFESLLRWAADHGISDSGDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           VL+VPKSVLLT QSLSL+DEKL+ ALKRYPSLSST                         
Sbjct: 61  VLKVPKSVLLTAQSLSLQDEKLSTALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYA+W  EKAA KSR EWRGVKGLM+ESNIKNQLQTFKA
Sbjct: 121 LPTTYETLETFGEFEKQALQVDYALWEAEKAASKSRAEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPE ES D++DVSSFS HASLNG++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEAESFDIIDVSSFSQHASLNGNITT 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           D LH++++DTQ ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTYSNLELL+YYGFLL
Sbjct: 241 DGLHKDEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EHDIYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHDIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE+LVMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQVPREL KML 
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQVPRELGKMLS 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           T GGEFCAFLETNGLVNR E EL L+GKIKRSLERWKLAVQWR+LYKKALVDCISYCTRT
Sbjct: 421 TVGGEFCAFLETNGLVNREETELQLTGKIKRSLERWKLAVQWRILYKKALVDCISYCTRT 480

BLAST of HG10011262 vs. ExPASy Swiss-Prot
Match: Q6NQJ8 (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.1e-117
Identity = 238/482 (49.38%), Postives = 308/482 (63.90%), Query Frame = 0

Query: 9   SLLRWAADHGISDPVD-QQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 68
           + LRWAA+ GISD +D  +   SCLG SL V  FPDAGGRGLGA R+L KGELVL+VP+ 
Sbjct: 10  TFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRK 69

Query: 69  VLLTTQSLSLEDEKLAMALKRYPSLSSTQV--------------------------DY-- 128
            L+TT+S+  +D KL+ A+  + SLSSTQ+                          DY  
Sbjct: 70  ALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDL 129

Query: 129 ----------------AIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASAT 188
                           A+WATEKA  K ++EW+    LM+E  +K + ++F+AWLWASAT
Sbjct: 130 LATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWASAT 189

Query: 189 ISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEK 248
           ISSR L+VPWD AGCLCPVGDLFNY AP   S       S +   ++       E H E+
Sbjct: 190 ISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESAN---NVEEAGLVVETHSER 249

Query: 249 RDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDK 308
                 LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YGF+L+EN NDK
Sbjct: 250 ------LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 309

Query: 309 VFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 368
           VFIP+E  ++S +SSWPK+SLY+HQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 310 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 369

Query: 369 LSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGE 428
           +S+KNEILVM+W+S+ C +VL +LPTSV ED  LL NI K+QD ++  E QK    +G E
Sbjct: 370 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 429

Query: 429 FCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTIC 441
             AFL+ N L +    +   +  S K  R L +W+ +VQWRL YK+ L DCISYC   + 
Sbjct: 430 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of HG10011262 vs. ExPASy Swiss-Prot
Match: B7ZUF3 (Actin-histidine N-methyltransferase OS=Xenopus tropicalis OX=8364 GN=setd3 PE=2 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 1.1e-06
Identity = 95/429 (22.14%), Postives = 164/429 (38.23%), Query Frame = 0

Query: 4   EGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLR 63
           E  F  L+ W  ++G S    +               FP+  G GL A R++   EL L 
Sbjct: 76  EDYFPELMEWCKENGASTDGFELVE------------FPEE-GFGLKATREIKAEELFLW 135

Query: 64  VPKSVLLTTQS--------LSLEDEKL-AMA----------------------LKRYPS- 123
           VP+ +L+T +S        L  +D  L AM                       +K  P+ 
Sbjct: 136 VPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAFHLLCERADPNSFWLPYIKTLPNE 195

Query: 124 --------------LSSTQVDYAIWATEKAALKSRTEWRGVKGLMQESN---IKNQLQTF 183
                         L STQ    +++  K   +    +  V      +N   +K+   TF
Sbjct: 196 YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKDSF-TF 255

Query: 184 KAWLWASATISSRALYVPWDEAG----CLCPVGDLFNYAAPEGESLDVMDVSSFSPHASL 243
             + WA +++ +R   +P ++       L P+ D+ N+                      
Sbjct: 256 DDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT--------------------- 315

Query: 244 NGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLE 303
           NG +TT    E+ R    AL D                 +K GEQ+ + YGT SN E + 
Sbjct: 316 NGLITTGYNLEDDRCECVALQD-----------------FKSGEQIYIFYGTRSNAEFVI 375

Query: 304 YYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLY-VHQNGNP-SFALLS 357
           + GF  + N +D+V I            M+ ++ + +  P  S++ +H    P S  LL+
Sbjct: 376 HNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHVTEPPISAQLLA 435

BLAST of HG10011262 vs. ExPASy Swiss-Prot
Match: Q7SXS7 (Actin-histidine N-methyltransferase OS=Danio rerio OX=7955 GN=setd3 PE=2 SV=1)

HSP 1 Score: 51.6 bits (122), Expect = 2.6e-05
Identity = 58/251 (23.11%), Postives = 98/251 (39.04%), Query Frame = 0

Query: 133 TFKAWLWASATISSRALYVPWDEAG----CLCPVGDLFNYAAPEGESLDVMDVSSFSPHA 192
           TF  + WA +++ +R   +P  +       L P+ D+ N+                    
Sbjct: 240 TFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCNHT------------------- 299

Query: 193 SLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLEL 252
             NG +TT    E+ R    AL D                 YK+GEQ+ + YGT SN E 
Sbjct: 300 --NGLITTGYNLEDDRCECVALKD-----------------YKEGEQIYIFYGTRSNAEF 359

Query: 253 LEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--SFAL 312
           + + GF  ++N +D+V I            M+ ++ + +  P  S++      P  S  L
Sbjct: 360 VIHNGFFFEDNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCSEPPISAQL 419

Query: 313 LSALRLWATHPNKRRG----------VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNL 357
           L+ LR++     + R           +  L      +S +NEI +  +L      +L   
Sbjct: 420 LAFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTY 452

BLAST of HG10011262 vs. ExPASy Swiss-Prot
Match: B0VX69 (Actin-histidine N-methyltransferase OS=Callithrix jacchus OX=9483 GN=SETD3 PE=3 SV=2)

HSP 1 Score: 50.8 bits (120), Expect = 4.5e-05
Identity = 93/440 (21.14%), Postives = 163/440 (37.05%), Query Frame = 0

Query: 4   EGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLR 63
           E  F  L++WA+++G S    +  +                 G GL A R +   EL L 
Sbjct: 76  EDYFPDLMKWASENGASVEGFEMVNFK-------------EEGFGLRATRDIKAEELFLW 135

Query: 64  VPKSVLLTTQS--------LSLEDE------KLAMA-----------------LKRYPS- 123
           VP+ +L+T +S        L  +D        +A+A                 ++  PS 
Sbjct: 136 VPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLPSE 195

Query: 124 --------------LSSTQVDYAIWATEKAALKSRTEWRGVKGLMQESN---IKNQLQTF 183
                         L STQ  + +++  K   +    +  V      +N   +K+   T+
Sbjct: 196 YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSF-TY 255

Query: 184 KAWLWASATISSRALYVPWDEAG----CLCPVGDLFNYAAPEGESLDVMDVSSFSPHASL 243
           + + WA +++ +R   +P ++       L P+ D+ N+                      
Sbjct: 256 EDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT--------------------- 315

Query: 244 NGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLE 303
           NG +TT    E+ R    AL D                 ++ GEQ+ + YGT SN E + 
Sbjct: 316 NGLITTGYNLEDDRCECVALQD-----------------FRAGEQIYIFYGTRSNAEFVI 375

Query: 304 YYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--SFALLS 363
           + GF    N +D+V I            M+ ++ + +  P  S++      P  S  LL+
Sbjct: 376 HSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLA 435

Query: 364 ALRLWA-------THPNKRRGVGHLAYAGSQ---LSIKNEILVMQWLSKNCHTVLNNLPT 368
            LR++         H      +  +   G+    +S  NE+ +  +L      +L    T
Sbjct: 436 FLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKT 459

BLAST of HG10011262 vs. ExPASy Swiss-Prot
Match: A9X1D0 (Actin-histidine N-methyltransferase OS=Papio anubis OX=9555 GN=SETD3 PE=3 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 4.5e-05
Identity = 92/440 (20.91%), Postives = 164/440 (37.27%), Query Frame = 0

Query: 4   EGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLR 63
           E  F  L++WA+++G S    +  +                 G GL A R +   EL L 
Sbjct: 76  EDYFPDLMKWASENGASVEGFEMVNFK-------------EEGFGLRATRDIKAEELFLW 135

Query: 64  VPKSVLLTTQS--------LSLEDE------KLAMA-----------------LKRYPS- 123
           VP+ +L+T +S        L  +D        +A+A                 ++  PS 
Sbjct: 136 VPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERANPNSFWQPYIQTLPSE 195

Query: 124 --------------LSSTQVDYAIWATEKAALKSRTEWRGVKGLMQESN---IKNQLQTF 183
                         L STQ  + +++  K   +    +  V      +N   +K+   T+
Sbjct: 196 YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSF-TY 255

Query: 184 KAWLWASATISSRALYVPWDEAG----CLCPVGDLFNYAAPEGESLDVMDVSSFSPHASL 243
           + + WA +++ +R   +P ++       L P+ D+ N+                      
Sbjct: 256 EDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT--------------------- 315

Query: 244 NGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLE 303
           NG +TT    E+ R    AL D                 ++ GEQ+ + YGT SN E + 
Sbjct: 316 NGLITTGYNLEDDRCECVALQD-----------------FRAGEQIYIFYGTRSNAEFVI 375

Query: 304 YYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--SFALLS 363
           + GF    N +D+V I            M+ ++ + +  P  S++      P  S  LL+
Sbjct: 376 HSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLA 435

Query: 364 ALRLWATHPNKRR-------GVGHLAYAGSQ---LSIKNEILVMQWLSKNCHTVLNNLPT 368
            LR++     + +        +  +   G+    +S  NE+ +  +L      +L    T
Sbjct: 436 FLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKT 459

BLAST of HG10011262 vs. ExPASy TrEMBL
Match: A0A6J1J6L6 (protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481847 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 6.3e-228
Identity = 396/486 (81.48%), Postives = 420/486 (86.42%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M TEGSF SLLRWAADHGISD VD+Q+SHSCLGRSLCVCFFPDAGGRGLGAVR L KGEL
Sbjct: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           VL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST                         
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYA+W  EKAA KS TEWRGVKGLM+ESNIKNQLQTFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLD+MDVSSFS HASLNG++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTYSNLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE+LVMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQ P EL KMLL
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           T GGEFCAFLET GLVNR E ELHL+GKIKRSLERWKLAVQWR+LYKKALVDC SYCTRT
Sbjct: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRT 480

BLAST of HG10011262 vs. ExPASy TrEMBL
Match: A0A1S3C590 (protein SET DOMAIN GROUP 40 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 795.4 bits (2053), Expect = 1.2e-226
Identity = 390/447 (87.25%), Postives = 412/447 (92.17%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEW 120
           NKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSSTQVDYAIWATEKAALKSR +W
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQVDYAIWATEKAALKSRMDW 120

Query: 121 RGVKGLMQESNIKNQLQTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGES 180
           RGVKGLMQESNIKNQLQTFKAWLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES
Sbjct: 121 RGVKGLMQESNIKNQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGES 180

Query: 181 LDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQ 240
            + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARESYKKGEQ
Sbjct: 181 FNAMDVLSFPSHASLNDEL---ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQ 240

Query: 241 VLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFAL 300
           VLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNGNPSFAL
Sbjct: 241 VLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFAL 300

Query: 301 LSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQL 360
           LSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+EED+QL
Sbjct: 301 LSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQL 360

Query: 361 LCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLA 420
           LCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLA
Sbjct: 361 LCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLA 420

Query: 421 VQWRLLYKKALVDCISYCTRTICSLSS 443
           VQWRLLYKKALVDCI YCTRTICSLSS
Sbjct: 421 VQWRLLYKKALVDCIGYCTRTICSLSS 444

BLAST of HG10011262 vs. ExPASy TrEMBL
Match: A0A6J1F4A7 (protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442034 PE=4 SV=1)

HSP 1 Score: 789.3 bits (2037), Expect = 8.5e-225
Identity = 392/486 (80.66%), Postives = 415/486 (85.39%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M  E SF SLLRWAADHGISD VD+Q SHSCLGRSLCVCFFPDAGGRGLGAVR L KGEL
Sbjct: 1   MGNEESFESLLRWAADHGISDSVDKQCSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           VL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST                         
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYA+W  EKAA KSR EWRGVKGLM+ESNIKNQLQTFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSRAEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPE ES D+MDVSSFS HASLNG++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEAESFDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYTNLELLQYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EHDIYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHDIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE+LVMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQVPREL KM  
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQVPRELGKMPS 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           T  GEFCAFLETNGLVNR E EL L+GKIKRSLERWKLAVQWR+LYKKALVDCISYCTRT
Sbjct: 421 TVRGEFCAFLETNGLVNREETELQLTGKIKRSLERWKLAVQWRILYKKALVDCISYCTRT 480

BLAST of HG10011262 vs. ExPASy TrEMBL
Match: A0A5D3BQD3 (Protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G001720 PE=4 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 4.0e-222
Identity = 390/486 (80.25%), Postives = 412/486 (84.77%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GGRGL AVRQLNKGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           +LR PKSVLLTTQSLSLEDEKLAMALK +PSLSST                         
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            E  EE+RD+QW LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLL
Sbjct: 241 -ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPNDKVFIP+EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV REL+KMLL
Sbjct: 361 YAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLL 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           TYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDCI YCTRT
Sbjct: 421 TYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRT 480

BLAST of HG10011262 vs. ExPASy TrEMBL
Match: A0A1S3C4J5 (protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 4.0e-222
Identity = 390/486 (80.25%), Postives = 412/486 (84.77%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GGRGL AVRQLNKGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST------------------------- 120
           +LR PKSVLLTTQSLSLEDEKLAMALK +PSLSST                         
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 -------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                              QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            E  EE+RD+QW LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLL
Sbjct: 241 -ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLL 300

Query: 301 QENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPNDKVFIP+EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLL 420
           YAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV REL+KMLL
Sbjct: 361 YAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLL 420

Query: 421 TYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 443
           TYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDCI YCTRT
Sbjct: 421 TYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRT 480

BLAST of HG10011262 vs. TAIR 10
Match: AT5G17240.1 (SET domain group 40 )

HSP 1 Score: 424.9 bits (1091), Expect = 8.1e-119
Identity = 238/482 (49.38%), Postives = 308/482 (63.90%), Query Frame = 0

Query: 9   SLLRWAADHGISDPVD-QQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 68
           + LRWAA+ GISD +D  +   SCLG SL V  FPDAGGRGLGA R+L KGELVL+VP+ 
Sbjct: 10  TFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRK 69

Query: 69  VLLTTQSLSLEDEKLAMALKRYPSLSSTQV--------------------------DY-- 128
            L+TT+S+  +D KL+ A+  + SLSSTQ+                          DY  
Sbjct: 70  ALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDL 129

Query: 129 ----------------AIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASAT 188
                           A+WATEKA  K ++EW+    LM+E  +K + ++F+AWLWASAT
Sbjct: 130 LATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWASAT 189

Query: 189 ISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEK 248
           ISSR L+VPWD AGCLCPVGDLFNY AP   S       S +   ++       E H E+
Sbjct: 190 ISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESAN---NVEEAGLVVETHSER 249

Query: 249 RDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDK 308
                 LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YGF+L+EN NDK
Sbjct: 250 ------LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 309

Query: 309 VFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 368
           VFIP+E  ++S +SSWPK+SLY+HQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 310 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 369

Query: 369 LSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGE 428
           +S+KNEILVM+W+S+ C +VL +LPTSV ED  LL NI K+QD ++  E QK    +G E
Sbjct: 370 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 429

Query: 429 FCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTIC 441
             AFL+ N L +    +   +  S K  R L +W+ +VQWRL YK+ L DCISYC   + 
Sbjct: 430 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of HG10011262 vs. TAIR 10
Match: AT2G18850.1 (SET domain-containing protein )

HSP 1 Score: 46.6 bits (109), Expect = 6.0e-05
Identity = 57/245 (23.27%), Postives = 94/245 (38.37%), Query Frame = 0

Query: 130 QLQTFKAWLWASATISSRALYVPWDEA---GCLCPVGDLFNYAAPEGESLDVMDVSSFSP 189
           +L T++ +LWA     S ++ + + +     CL PV    N+              S  P
Sbjct: 302 ELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNH--------------SIYP 361

Query: 190 HASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL 249
           H    G +  +                      S+  F       KGEQ  LSYG YS+ 
Sbjct: 362 HIVKYGKVDIE---------------------TSSLKFPVSRPCNKGEQCFLSYGNYSSS 421

Query: 250 ELLEYYGFLLQ-ENPNDKVFIPMEHDIYSS---------------SSWPKESLYVHQNGN 309
            LL +YGFL + +NP D   IP++ D+                   +W   +  +   G 
Sbjct: 422 HLLTFYGFLPKGDNPYD--VIPLDFDVIDDEDIETEFSWTTHMLRGTWLSSNHNIFHYGL 481

Query: 310 PSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNL--PTS 354
           P+  LL+ LR       K  G+ H +      +++ EI V++ L      ++ NL    S
Sbjct: 482 PT-PLLNYLR-------KAHGLVHHSETDLWKNLEVEIGVLENLQSTFDDMMQNLGDADS 501

BLAST of HG10011262 vs. TAIR 10
Match: AT3G07670.1 (Rubisco methyltransferase family protein )

HSP 1 Score: 45.4 bits (106), Expect = 1.3e-04
Identity = 83/371 (22.37%), Postives = 151/371 (40.70%), Query Frame = 0

Query: 43  DAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIW 102
           D G RGL A + L KGE +L VP S++++  S    + +    +KRY        D+ + 
Sbjct: 97  DIGERGLVASQNLRKGEKLLFVPPSLVISADS-EWTNAEAGEVMKRY-----DVPDWPLL 156

Query: 103 AT---EKAALKSRTEWRGVKGLM--QESNIKNQLQTFKAWLWASATISSRALYVPWDEAG 162
           AT    +A+L+  + W      +  Q  ++    +T       ++ I  RA+    +  G
Sbjct: 157 ATYLISEASLQKSSRWFNYISALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVG 216

Query: 163 CLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWAL------- 222
                 DL +    +   L   +V +        G + +  +     D ++AL       
Sbjct: 217 ---TYEDLRSRIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADML 276

Query: 223 -------TDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQE--NPN 282
                  T   ++++     F     Y+ GEQV +SYG  SN ELL  YGF+ +E  NP+
Sbjct: 277 NHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPS 336

Query: 283 DKVFIPM-----------------EHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWAT 342
           D V + +                 +H + +   +P     V   G P   L++   L  +
Sbjct: 337 DSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFP-----VRITGWP-MELMAYAYLVVS 396

Query: 343 HPNKRRGVGHLAYAGS-QLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQ 374
            P+ R     +A A S + S KN++   +        +L++  TS+ + ++ L     + 
Sbjct: 397 PPDMRNNFEEMAKAASNKTSTKNDLKYPEIEEDALQFILDSCETSISKYSRFLKESGSMD 452

BLAST of HG10011262 vs. TAIR 10
Match: AT2G18850.2 (SET domain-containing protein )

HSP 1 Score: 44.7 bits (104), Expect = 2.3e-04
Identity = 39/147 (26.53%), Postives = 59/147 (40.14%), Query Frame = 0

Query: 130 QLQTFKAWLWASATISSRALYVPWDEA---GCLCPVGDLFNYAAPEGESLDVMDVSSFSP 189
           +L T++ +LWA     S ++ + + +     CL PV    N+              S  P
Sbjct: 302 ELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNH--------------SIYP 361

Query: 190 HASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL 249
           H    G +  +                      S+  F       KGEQ  LSYG YS+ 
Sbjct: 362 HIVKYGKVDIE---------------------TSSLKFPVSRPCNKGEQCFLSYGNYSSS 411

Query: 250 ELLEYYGFLLQ-ENPNDKVFIPMEHDI 273
            LL +YGFL + +NP D   IP++ D+
Sbjct: 422 HLLTFYGFLPKGDNPYD--VIPLDFDV 411

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896047.13.2e-23484.77protein SET DOMAIN GROUP 40 [Benincasa hispida][more]
XP_022983189.11.3e-22781.48protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima][more]
XP_008457031.12.5e-22687.25PREDICTED: protein SET DOMAIN GROUP 40 isoform X3 [Cucumis melo][more]
KAG7017936.16.1e-22580.86Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
XP_023528315.11.8e-22480.45protein SET DOMAIN GROUP 40 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q6NQJ81.1e-11749.38Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1[more]
B7ZUF31.1e-0622.14Actin-histidine N-methyltransferase OS=Xenopus tropicalis OX=8364 GN=setd3 PE=2 ... [more]
Q7SXS72.6e-0523.11Actin-histidine N-methyltransferase OS=Danio rerio OX=7955 GN=setd3 PE=2 SV=1[more]
B0VX694.5e-0521.14Actin-histidine N-methyltransferase OS=Callithrix jacchus OX=9483 GN=SETD3 PE=3 ... [more]
A9X1D04.5e-0520.91Actin-histidine N-methyltransferase OS=Papio anubis OX=9555 GN=SETD3 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1J6L66.3e-22881.48protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114818... [more]
A0A1S3C5901.2e-22687.25protein SET DOMAIN GROUP 40 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
A0A6J1F4A78.5e-22580.66protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A5D3BQD34.0e-22280.25Protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3C4J54.0e-22280.25protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
Match NameE-valueIdentityDescription
AT5G17240.18.1e-11949.38SET domain group 40 [more]
AT2G18850.16.0e-0523.27SET domain-containing protein [more]
AT3G07670.11.3e-0422.37Rubisco methyltransferase family protein [more]
AT2G18850.22.3e-0426.53SET domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 46..241
e-value: 3.7E-8
score: 33.9
IPR001214SET domainPROSITEPS50280SETcoord: 34..241
score: 11.631448
NoneNo IPR availableGENE3D3.90.1410.10set domain protein methyltransferase, domain 1coord: 92..252
e-value: 9.0E-23
score: 82.9
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 6..93
NoneNo IPR availablePANTHERPTHR13271:SF91PROTEIN SET DOMAIN GROUP 40coord: 96..427
coord: 6..93
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 96..427
NoneNo IPR availableCDDcd10527SET_LSMTcoord: 43..255
e-value: 5.93563E-26
score: 103.298
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 8..259
IPR036464Rubisco LSMT, substrate-binding domain superfamilyGENE3D3.90.1420.10coord: 253..441
e-value: 3.5E-7
score: 32.1
IPR036464Rubisco LSMT, substrate-binding domain superfamilySUPERFAMILY81822RuBisCo LSMT C-terminal, substrate-binding domaincoord: 251..365
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 291..357
e-value: 2.5E-6
score: 28.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10011262.1HG10011262.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005509 calcium ion binding
molecular_function GO:0005515 protein binding