HG10006220 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10006220
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSET domain-containing protein
LocationChr07: 15968627 .. 15974717 (-)
RNA-Seq ExpressionHG10006220
SyntenyHG10006220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTCCTGAGGAAGTAGATAATGTGCTGAAAGATTTGGTACAAATTGCAAGAATTATTCACTTAAATGAGGTAACCTGAGAAGTTTCTCTAGTTCAATGTTTTCTTGTCTGTGTCTGGAAATATTTAAGTTACTGCTATTCGGGAGTGCCAGATATCTTGGATTACATTGGATAGAGTGCTTGGTAATTTTTGCACTTAGGTGTGCCTAGCTAAAATTCATTTGCATTCTAGTTTCTTCTTTATGTTGGCGAAAGAAGAGGGGAAGCAAGTAGTATTTATTTGTATTTTGGTCTGAAATCATCTAATATATGTATGGATAAATAGTTATTGACAGCTATTGTAACACTTGAGGTGCAGCCTGAAATGTATTTTGGAGAAAATGATGAATGTACACCAGTAGATTTCTACAGCCCCAGGAATGAAGTGGAGGCCTTCAATACAATAATTTCTCTTGTTGGCATCTCTCTCTCTAGTTGTAAGCCTGTTCAATTTAATGTCCTGCAAGAATTACGGAAGGCAGTTATTCGCATGATCCATGAGTATGGAGATGTATACAGTATGGATGCTAAAACTTTGGATGACAGCTGCGCTAAAGAAAACTGTTTGTTACAGTGGGGTGAGAGCAATGGTGTTAGAACAAGCTTGGAGATAGCTTGTAAGTCTTCACCATCACAATTTCTTGGCAATATTCGTGTAATCCAAATCTATTAATGCTTGATTTTGAATTACTGTAGAATTATAAGTTCGAAATATTGGAGGTGCACCAAAGTTTCGCAATACCAAACAATTGTAATTTGTATGCTTTATTCTTCATGTAGATATTGAAGGTGCTGGTAGAGGAACCAAAGCCAAAGAAGATTTAGACGTTGGTGACACTGTATTGGAGATCCCTCTGGCTATTATCATTTCTGAGGAACTTGTGCAGAAAAGCACCATGGTAAATTCATCACTAAGAAGTTGTTGTAGTTGCCAATCTTGAAATTTGAATTGTCTAGAACTTTCCTGACAGAAAAATAGGATGAAAAGATTTATAATATGAATTCCTTATTAATTCTCACCATACAGATGCCATATTTCAGTTCCCCATCTCATTGTAAGTAAATGCTTATATCTAGAGTTAAAGTTGGTTTAAAGAATTTTTAGGGATTTTACATTTTCAAGAATCTTTGATCATTTAATGAAGCCATGGTGAAATGTTTTTTCTCCACCAGTATCCCATATTATCTAAGGTTGAAGGCATGCTATCTGAGACAATGATGTTGTTATGGAGTATGAAGGAGAAGCACATTGCCGATTCCAAATTCAAGGTCTACTTTGATACACTACCAGAAGCGTTTAATACTGGTATTACCATTACCTTTAGACCATTCCCACTGATTTCTTGTATTTCCATTGTGTTCAGCGCGAGATTTGGGTGCACACTTATCAAATGGTTGATTTTCCTTTTTGAAGAATTTTCAGTTAGCAGTTAGCACCGATGCTCTTTTTATGTCTATTTCCATACTGCTGCATGCAAGTGTATAAATTCTTTTTATTCTTTATGTATGAATAAGGATATTAGTTAACTTGCCTTCTTCTTTTCTTTGAGAGGGTAACAGAATTTCATTAATATAATGAAGTTTAAAAAGGCTAGTGGGGTTGCAAAAATCCCTTCCAATTTGGAGCCTAAATATCACTAAGAGAGTGATTACAAAAGAAAAAAAAATCTTAGCTTGCACCAAGTGGCTGCTGTAGGGCTGCTGTGTGCAAAATAAGGTCAAATAAGTGTAGTTCCTCTCCCCATCCACTATGCTTCTCAAGCGAAGTGATTCAGAGGGAGTTCTAGTCAAGTTCAACCACAGATATTTTGATTGCTTTTTCTAGGGGTGAGCTACCATAATAGTGCGTAGGTTGATCACAACATTTTTGTGCTAAAGGGTGGTTCATCCATAAGCCTCAAAAAGAGCATCCCAGGAAACAGAAGCGAAGTGGCAGCTTGTAGAAATATAACAACCTTACGACTCACGGTCCAACTTGAAAAGGGAGCATGTACTGGGGACAGAGCAAGAAATGGATATCTTCTTTGCATATTATCCATAGCATCAGTATAGCCCATGACCACCTCCTGTAGGAAGTTTACTTTTTTGGGGGATTTCCCTTTCCAGAAAACTTTGATAAGATCATGGCGGATGGAAGGATTTTTGGTGGCTAAAATGTGGTGGAGAGATCTACATGAAAAGGAACTGGTTGGGTCAAGAGGCCACTTCCGAGAGTCTAGTTAGGGCTTAAGTCGCCCTCAGTGTTGCTGATCAGAAAGAGCCAGCCTTCATACTCATCGTTAAACATCTCCTCAATGGTAGGTTTATCGTAGGTTGTAGGAGCTTGTGTTGCTGTCCCAGAGAATGGGCCCAATTCATCTTTGTGATAGGAATGGTATTACGGGTATAAATGATAGGAATGGTATTAGGGTATACTAGTATACCCCAATACTGTAATAATACTCTAATACCATTCCTATCACTTTGGAAGATGAATCTTAGTTGACAAATCTTCCCAAAATCAGTTTCCAATCTTGAAGGAGCCAATGGAAAAGATGAGAGCCTTTGGTTTCTAAATGTCTCTCCAGATACCTTGGAGTGACTGATGGAATTGTCATTGGGCCACCAATCAAGAAGCATGGAACCCCATTTGTCATAATAAATTTCCTCCAAATGGGACCCCATTTGGCCACGAGAGACTGATTTTTTTCCCTGCAGATTTCCAACGCCAAGATCACCCTTAGAGATGGGACATGAAACTTGTCCGAAGCAGCCAAATGAGGGCCATCTTTCCGATGAAAGTTGATATATTTTCTTAAAGGGACTTGGGAATCCTTTAAAGCCTTTTGAATAAAGTTAAAGACATATAACAGATGGGGACGCTAGAGAGAGGACTAGATGAGAGTTTGGTAGCCTCCTTTGGAGAAGAAAGGCATTTCCGCCCTTTTAGCTTCTTTTAAACCTTGATTAAAATATGGTCTCGAAAAGACTGCAGTTTGGCAGCAGATTGGGTGATGGAATCACCAACATTGATGCAAAGTAGCTCAGGTTTGCTCCAATTAACATTGAGAGTAGAGGAAGCTTAAAACATTCTGATGACTTGGAGGAGATTTGTAAATTTATCGATAGAAGTGGAAGAGAATAGGATGATGCCGTCCACCGATGGCCTGTTTCTGGATCTCTTTGGACCAAAGTGAAAGTAGTGAATTCGTAAGAAGACATTTTTCCTTCCCTCTCAAAATAGGCGATATGATTATCAGATTATAATGGGATATGTAAATACAGAATTTATGGGAAGGGAAAAAGGGCTTCCTCTGTCTCACACCCTCAAACACTCATGGTTAGTTTTCAATGGAGGAAAAAGAGCTTTAAGTTTGAATTTAAAAAAAAAAAAAATTCCATTGACTTTGTCAGGAAATAGGAGATTGAAAAGATATTTCAAGTGAGACATTAGAACTGGTGAAGCGGAGGGGGGAGTATCATATGGAAACTAATAGATATAGAGATCAACCCCAAGAAAGATAATAATATTGGAAAGAACAATAGCTCAGATACGAAGGAGGAGATTCATGGGAGATTTAACCTAAAAAAGAGTTCCATCTTGAATAGTTGATAGCGAGGTTGTTTATCATAGTAACTGAGGTGCATACTTTTTATCTCACATGTTCCATCCCTTCCTTTGTATATAGTTTTATCATTTTTACAATCTTGACTTCTCAGCACTAAAGCGGGAAAATGACAAAGCTACAGTTATGTTGAATCACCTAAACCTTAAATGTGTTCTGAAATGTGCATGAAGTCATTTTTATTTTCATAGTCTAATATCTTACAAAAAGTTCACAGGGTTAAGTTTTGGAGTTGGCGCAATGATGACTTTGGACGGGACCCTACTTTTAGATGAGCTAATGCAAGCAAAAGAGGTTCATTTCTCACAAATTTTCTCGGTCATTATTTATTTGTTAAAACAACTTCTAGTGGCAAATTACAGGTTCCTTTAATTATGTAGTTGAAAAGTAATCCCACTACATTTTCCTCATTATATTTTCGGGGTTCATGGAAGTTTGGAACAGATGCATTAAACATTTACTGAAATAAAGTGAAAAGATAAGACCATTTCTCGTCTGGTTCTCTTTTTAAGGATGGCCATTTAGCTCATTTGTCCAATGTTTCCTACTTCATTTTGATGGTCTGACTTTTGGATATCATAATTCGATATTCTACTTCTATTTATTTGTAATGTTAACTTCTTTGCAGCACTTGCGGGAACAATACAATGAGTTGTTTCCTGCGTTATGTAACAACCATCCTGATGTCTTCCCAGAGGAATTCTACTCATGGGAGCAGTTCTTGTGGGCTTGTGAACTTTGGTATTCAAATAGCTTGAAAATCATGTTCTCTGATGGAAAACTCAGAACCTGCTTGGTTCCAATTGCAGGTTTTCTCAACCACTCGGTATGTGTGATTCCTGCTTCTCATACCCCAGACGCCATTTTCCCCTCATGGATGGGGGTTGAGTTTAATTGTAAAAATTGTACGAAGTGTAGGTTAATTTACTTGAACTAGTTTATTTTAAAAAATGTCCTTCCTTGCTGTTACTTCTTACCTCAAATTGATTTCTTCTCGTCTTTATACTTGTTTGCTCAATCACGTGGCTGTTAATAGCTTGCAAATTTGTGACTGTCCTGCTTACCATAGATTCATGTTTTTTCTTCATCATTAAACTTGGATGCCATTCTGATTTCTGAGTTGTTTCATATTTCTTGTTTATTCCTTGTTTGCTTCATGTTTCTAATATATTATTTTGAATTTGAGGTCTCATTTTCCTTTGCTTCATTGTTATTTGGTTCTTTGGGACAGTTGTATCCGCACATACTACACTATGGCAAAGTTGATTCAGATACAAATTCCTTGAAATTCCGTCTATCAAGACCATGCCGTGCAGGGGAAGAGTGTTACCTTAGTTATGGCAATTACTCTGGTTCTCATCTAGTTACTTTCTATGGGTTTTTACCTGAAGGAGACAACGTAAATGATGTCATTCCATTAGGTATTTTCTAGTTCTGTCTACTAGACATTTAAATAGTGCTTCCTTTTACTTTAATAACAAATTAATGCTCATCTTTCTTATATTAGTATTACCATTGTTTATTTGTGAGTGCAGACATTGACTTCGGTGATGATGATAGCAATAGCATCACGTCAGACTGGAGTACTCATATGGTGAGGGGAACATGGTTGTCAAAGAACCAAAGTATATTCCATTATGGTCTTCCCTCACCATTATTAGAGTGTTTCCGGAAAGCTCGGTGCCCTGGATTACACACCAACCATAAGGTAATTTTTCATTTTTTAAAATCTTGTTTTGCTTATTTCTCAATTATTACTTCGTAGGAAAAATTTCCCCAATGTTTGTAATAGTCATCAAACTTGAAAACAGTAGACCCAACTGGATAACTATTTGGTTATTGAAAATTAAGCTCATAAGTTTCACCTATACGTTTCTTTGTTTTGTTATTGACTTTCTACCGATGTGTTAAAACACCAAGCCAAGTTTTGAAAACTAAAAAAAAAAAGTAGTTTTTAAGAACTTGTTTTTGTTTTTAGAATTTAACTAAAAGTGAAATCGTGTTATCATACAGGCTTTTAAATTTTACAAATACCTATATGCTTCTTATAGGATTTAGAATTTTTTCTACGTTGGTTATTATTACTTGTTTATCTTAAATTCGTAGCGACAGGGAAGCTTGGAAAATGAAATGGAAGTCCTCAATGAACTCCTGTCAATCTTTTCTGGAATGATGGAAAATCTAGAGGATGAGAATGAAGACAGGTAATGAAGTGTAAATTGTACATGCAGTATGCAACTTCATTGAAGCAACACTTTTCCCTGGAATAAACTTGAATGTTTTGGTCTTTGCAGGAGAAGTACAGAATGGGACATAAAGTTAGCACTGAACTACAAAGATCTACAAAGGAAGATAGTTTCCTCATGTCTGACTTCGTGTCATGCTGGTCGCAAGATAGTGGAATTTGCATTATGTGATTGCATGGAAGAGGACACTCGAGGCTAA

mRNA sequence

ATGTGTCCTGAGGAAGTAGATAATGTGCTGAAAGATTTGGTACAAATTGCAAGAATTATTCACTTAAATGAGTTATTGACAGCTATTGTAACACTTGAGGTGCAGCCTGAAATGTATTTTGGAGAAAATGATGAATGTACACCAGTAGATTTCTACAGCCCCAGGAATGAAGTGGAGGCCTTCAATACAATAATTTCTCTTGTTGGCATCTCTCTCTCTAGTTGTAAGCCTGTTCAATTTAATGTCCTGCAAGAATTACGGAAGGCAGTTATTCGCATGATCCATGAGTATGGAGATGTATACAGTATGGATGCTAAAACTTTGGATGACAGCTGCGCTAAAGAAAACTGTTTGTTACAGTGGGGTGAGAGCAATGGTGTTAGAACAAGCTTGGAGATAGCTTATATTGAAGGTGCTGGTAGAGGAACCAAAGCCAAAGAAGATTTAGACGTTGGTGACACTGTATTGGAGATCCCTCTGGCTATTATCATTTCTGAGGAACTTGTGCAGAAAAGCACCATGTATCCCATATTATCTAAGGTTGAAGGCATGCTATCTGAGACAATGATGTTGTTATGGAGTATGAAGGAGAAGCACATTGCCGATTCCAAATTCAAGGTCTACTTTGATACACTACCAGAAGCGTTTAATACTGGGTTAAGTTTTGGAGTTGGCGCAATGATGACTTTGGACGGGACCCTACTTTTAGATGAGCTAATGCAAGCAAAAGAGCACTTGCGGGAACAATACAATGAGTTGTTTCCTGCGTTATGTAACAACCATCCTGATGTCTTCCCAGAGGAATTCTACTCATGGGAGCAGTTCTTGTGGGCTTGTGAACTTTGGTATTCAAATAGCTTGAAAATCATGTTCTCTGATGGAAAACTCAGAACCTGCTTGGTTCCAATTGCAGGTTTTCTCAACCACTCGTTGTATCCGCACATACTACACTATGGCAAAGTTGATTCAGATACAAATTCCTTGAAATTCCGTCTATCAAGACCATGCCGTGCAGGGGAAGAGTGTTACCTTAGTTATGGCAATTACTCTGGTTCTCATCTAGTTACTTTCTATGGGTTTTTACCTGAAGGAGACAACGTAAATGATGTCATTCCATTAGACATTGACTTCGGTGATGATGATAGCAATAGCATCACGTCAGACTGGAGTACTCATATGGTGAGGGGAACATGGTTGTCAAAGAACCAAAGTATATTCCATTATGGTCTTCCCTCACCATTATTAGAGTGTTTCCGGAAAGCTCGGTGCCCTGGATTACACACCAACCATAAGCGACAGGGAAGCTTGGAAAATGAAATGGAAGTCCTCAATGAACTCCTGTCAATCTTTTCTGGAATGATGGAAAATCTAGAGGATGAGAATGAAGACAGGAGAAGTACAGAATGGGACATAAAGTTAGCACTGAACTACAAAGATCTACAAAGGAAGATAGTTTCCTCATGTCTGACTTCGTGTCATGCTGGTCGCAAGATAGTGGAATTTGCATTATGTGATTGCATGGAAGAGGACACTCGAGGCTAA

Coding sequence (CDS)

ATGTGTCCTGAGGAAGTAGATAATGTGCTGAAAGATTTGGTACAAATTGCAAGAATTATTCACTTAAATGAGTTATTGACAGCTATTGTAACACTTGAGGTGCAGCCTGAAATGTATTTTGGAGAAAATGATGAATGTACACCAGTAGATTTCTACAGCCCCAGGAATGAAGTGGAGGCCTTCAATACAATAATTTCTCTTGTTGGCATCTCTCTCTCTAGTTGTAAGCCTGTTCAATTTAATGTCCTGCAAGAATTACGGAAGGCAGTTATTCGCATGATCCATGAGTATGGAGATGTATACAGTATGGATGCTAAAACTTTGGATGACAGCTGCGCTAAAGAAAACTGTTTGTTACAGTGGGGTGAGAGCAATGGTGTTAGAACAAGCTTGGAGATAGCTTATATTGAAGGTGCTGGTAGAGGAACCAAAGCCAAAGAAGATTTAGACGTTGGTGACACTGTATTGGAGATCCCTCTGGCTATTATCATTTCTGAGGAACTTGTGCAGAAAAGCACCATGTATCCCATATTATCTAAGGTTGAAGGCATGCTATCTGAGACAATGATGTTGTTATGGAGTATGAAGGAGAAGCACATTGCCGATTCCAAATTCAAGGTCTACTTTGATACACTACCAGAAGCGTTTAATACTGGGTTAAGTTTTGGAGTTGGCGCAATGATGACTTTGGACGGGACCCTACTTTTAGATGAGCTAATGCAAGCAAAAGAGCACTTGCGGGAACAATACAATGAGTTGTTTCCTGCGTTATGTAACAACCATCCTGATGTCTTCCCAGAGGAATTCTACTCATGGGAGCAGTTCTTGTGGGCTTGTGAACTTTGGTATTCAAATAGCTTGAAAATCATGTTCTCTGATGGAAAACTCAGAACCTGCTTGGTTCCAATTGCAGGTTTTCTCAACCACTCGTTGTATCCGCACATACTACACTATGGCAAAGTTGATTCAGATACAAATTCCTTGAAATTCCGTCTATCAAGACCATGCCGTGCAGGGGAAGAGTGTTACCTTAGTTATGGCAATTACTCTGGTTCTCATCTAGTTACTTTCTATGGGTTTTTACCTGAAGGAGACAACGTAAATGATGTCATTCCATTAGACATTGACTTCGGTGATGATGATAGCAATAGCATCACGTCAGACTGGAGTACTCATATGGTGAGGGGAACATGGTTGTCAAAGAACCAAAGTATATTCCATTATGGTCTTCCCTCACCATTATTAGAGTGTTTCCGGAAAGCTCGGTGCCCTGGATTACACACCAACCATAAGCGACAGGGAAGCTTGGAAAATGAAATGGAAGTCCTCAATGAACTCCTGTCAATCTTTTCTGGAATGATGGAAAATCTAGAGGATGAGAATGAAGACAGGAGAAGTACAGAATGGGACATAAAGTTAGCACTGAACTACAAAGATCTACAAAGGAAGATAGTTTCCTCATGTCTGACTTCGTGTCATGCTGGTCGCAAGATAGTGGAATTTGCATTATGTGATTGCATGGAAGAGGACACTCGAGGCTAA

Protein sequence

MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEAFNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQWGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSKVEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCLVPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRKARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDLQRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG
Homology
BLAST of HG10006220 vs. NCBI nr
Match: XP_038889411.1 (uncharacterized protein LOC120079325 isoform X1 [Benincasa hispida])

HSP 1 Score: 934.5 bits (2414), Expect = 3.9e-268
Identity = 461/513 (89.86%), Postives = 475/513 (92.59%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           MCPEEVD VLK+LVQIARIIHLNE           PEMYFGENDE TPVDFYSPRNEVEA
Sbjct: 56  MCPEEVDTVLKELVQIARIIHLNE-----------PEMYFGENDERTPVDFYSPRNEVEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
           FNTIISLV ISLSSCKPVQF+VLQ LRKAVIRMIHEYG+VYSMDAK  +DSC KENCLLQ
Sbjct: 116 FNTIISLVDISLSSCKPVQFHVLQNLRKAVIRMIHEYGNVYSMDAKPSEDSCTKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRTSL+IAY+EGAGRGT A EDL+VGDTVLEIPLAIIISEELVQKSTMYPILSK
Sbjct: 176 WGESNGVRTSLKIAYVEGAGRGTIATEDLEVGDTVLEIPLAIIISEELVQKSTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           VEGML ETMMLLWSMKEKHIADSKFKVYFDTLPEAF TGLSFGVGAM TLDGTLL DELM
Sbjct: 236 VEGMLPETMMLLWSMKEKHIADSKFKVYFDTLPEAFKTGLSFGVGAMKTLDGTLLFDELM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAK+HLREQY+ELFP LCNNHPDVF EEFYSWEQFLWACELWYSNSLKIMFSDG LRTCL
Sbjct: 296 QAKQHLREQYDELFPVLCNNHPDVFSEEFYSWEQFLWACELWYSNSLKIMFSDGILRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHYGKVDS TNSLKF LSRPCRAGEECYLSYGNYSGSHLVTFYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYGKVDSGTNSLKFHLSRPCRAGEECYLSYGNYSGSHLVTFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDDDSN+ TS+WSTHMVRGTWLSKNQ+IFHYGLPSPLLECFRK
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDDSNNTTSNWSTHMVRGTWLSKNQNIFHYGLPSPLLECFRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           A  PGL TN K QGSLENEMEVLNELLSIFSGMMENLEDE+EDR STEWDIKLALNYKDL
Sbjct: 476 ALFPGLLTNRKLQGSLENEMEVLNELLSIFSGMMENLEDEDEDRTSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QRKIVSSCLTSCHAGRK VEFAL DCMEEDTRG
Sbjct: 536 QRKIVSSCLTSCHAGRKTVEFALYDCMEEDTRG 557

BLAST of HG10006220 vs. NCBI nr
Match: XP_038889412.1 (uncharacterized protein LOC120079325 isoform X2 [Benincasa hispida])

HSP 1 Score: 928.7 bits (2399), Expect = 2.2e-266
Identity = 460/513 (89.67%), Postives = 475/513 (92.59%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           MCPEEVD VLK+LVQIARIIHLNE           PEMYFGENDE TPVDFYSPRNEVEA
Sbjct: 56  MCPEEVDTVLKELVQIARIIHLNE-----------PEMYFGENDERTPVDFYSPRNEVEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
           FNTIISLV ISLSSCKPVQF+VLQ LRKAVIRMIHEYG+VYSMDAK  +DSC KENCLLQ
Sbjct: 116 FNTIISLVDISLSSCKPVQFHVLQNLRKAVIRMIHEYGNVYSMDAKPSEDSCTKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRTSL+IAY+EGAGRGT A EDL+VGDTVLEIPLAIIISEELVQKSTMYPILSK
Sbjct: 176 WGESNGVRTSLKIAYVEGAGRGTIATEDLEVGDTVLEIPLAIIISEELVQKSTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           VEGML ETMMLLWSMKEKHIADSKFKVYFDTLPEAF TGLSFGVGAM TLDGTLL DELM
Sbjct: 236 VEGMLPETMMLLWSMKEKHIADSKFKVYFDTLPEAFKTGLSFGVGAMKTLDGTLLFDELM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAK+HLREQY+ELFP LCNNHPDVF EEFYSWEQFLWACELWYSNSLKIMFSDG LRTCL
Sbjct: 296 QAKQHLREQYDELFPVLCNNHPDVFSEEFYSWEQFLWACELWYSNSLKIMFSDGILRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHYGKVDS TNSLKF LSRPCRAGEECYLSYGNYSGSHLVTFYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYGKVDSGTNSLKFHLSRPCRAGEECYLSYGNYSGSHLVTFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDDDSN+ TS+WSTHMVRGTWLSKNQ+IFHYGLPSPLLECFRK
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDDSNNTTSNWSTHMVRGTWLSKNQNIFHYGLPSPLLECFRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           A  PGL TN  R+GSLENEMEVLNELLSIFSGMMENLEDE+EDR STEWDIKLALNYKDL
Sbjct: 476 ALFPGLLTN--RKGSLENEMEVLNELLSIFSGMMENLEDEDEDRTSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QRKIVSSCLTSCHAGRK VEFAL DCMEEDTRG
Sbjct: 536 QRKIVSSCLTSCHAGRKTVEFALYDCMEEDTRG 555

BLAST of HG10006220 vs. NCBI nr
Match: XP_038889413.1 (protein SET DOMAIN GROUP 40 isoform X3 [Benincasa hispida])

HSP 1 Score: 912.5 bits (2357), Expect = 1.6e-261
Identity = 446/489 (91.21%), Postives = 461/489 (94.27%), Query Frame = 0

Query: 25  LLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEAFNTIISLVGISLSSCKPVQFNVLQ 84
           LL A +T+EVQPEMYFGENDE TPVDFYSPRNEVEAFNTIISLV ISLSSCKPVQF+VLQ
Sbjct: 3   LLVATLTIEVQPEMYFGENDERTPVDFYSPRNEVEAFNTIISLVDISLSSCKPVQFHVLQ 62

Query: 85  ELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQWGESNGVRTSLEIAYIEGAGRGTK 144
            LRKAVIRMIHEYG+VYSMDAK  +DSC KENCLLQWGESNGVRTSL+IAY+EGAGRGT 
Sbjct: 63  NLRKAVIRMIHEYGNVYSMDAKPSEDSCTKENCLLQWGESNGVRTSLKIAYVEGAGRGTI 122

Query: 145 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSKVEGMLSETMMLLWSMKEKHIADSK 204
           A EDL+VGDTVLEIPLAIIISEELVQKSTMYPILSKVEGML ETMMLLWSMKEKHIADSK
Sbjct: 123 ATEDLEVGDTVLEIPLAIIISEELVQKSTMYPILSKVEGMLPETMMLLWSMKEKHIADSK 182

Query: 205 FKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELMQAKEHLREQYNELFPALCNNHPDV 264
           FKVYFDTLPEAF TGLSFGVGAM TLDGTLL DELMQAK+HLREQY+ELFP LCNNHPDV
Sbjct: 183 FKVYFDTLPEAFKTGLSFGVGAMKTLDGTLLFDELMQAKQHLREQYDELFPVLCNNHPDV 242

Query: 265 FPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCLVPIAGFLNHSLYPHILHYGKVDSD 324
           F EEFYSWEQFLWACELWYSNSLKIMFSDG LRTCLVPIAGFLNHSL+PHILHYGKVDS 
Sbjct: 243 FSEEFYSWEQFLWACELWYSNSLKIMFSDGILRTCLVPIAGFLNHSLHPHILHYGKVDSG 302

Query: 325 TNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDDSNS 384
           TNSLKF LSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDDSN+
Sbjct: 303 TNSLKFHLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDDSNN 362

Query: 385 ITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRKARCPGLHTNHKRQGSLENEMEVLN 444
            TS+WSTHMVRGTWLSKNQ+IFHYGLPSPLLECFRKA  PGL TN K QGSLENEMEVLN
Sbjct: 363 TTSNWSTHMVRGTWLSKNQNIFHYGLPSPLLECFRKALFPGLLTNRKLQGSLENEMEVLN 422

Query: 445 ELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDLQRKIVSSCLTSCHAGRKIVEFALC 504
           ELLSIFSGMMENLEDE+EDR STEWDIKLALNYKDLQRKIVSSCLTSCHAGRK VEFAL 
Sbjct: 423 ELLSIFSGMMENLEDEDEDRTSTEWDIKLALNYKDLQRKIVSSCLTSCHAGRKTVEFALY 482

Query: 505 DCMEEDTRG 514
           DCMEEDTRG
Sbjct: 483 DCMEEDTRG 491

BLAST of HG10006220 vs. NCBI nr
Match: XP_022969685.1 (uncharacterized protein LOC111468639 isoform X1 [Cucurbita maxima] >XP_022969686.1 uncharacterized protein LOC111468639 isoform X1 [Cucurbita maxima])

HSP 1 Score: 901.0 bits (2327), Expect = 4.8e-258
Identity = 438/513 (85.38%), Postives = 462/513 (90.06%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           +CPEEVD VLK+LVQIARIIHLNE           PEMYF E+D CTP D YSPRNE+EA
Sbjct: 56  LCPEEVDTVLKELVQIARIIHLNE-----------PEMYFEEDDACTPADSYSPRNEMEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
            NTIISLV I LSSCKPVQ NVLQELRKA IRMIH+YG VYSMDAKTL D+C KENCLLQ
Sbjct: 116 LNTIISLVDICLSSCKPVQLNVLQELRKAAIRMIHKYGHVYSMDAKTLGDNCVKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRT L+IAY+EGAGRGT AKEDL+VGDTVLEIPL I+ISEELVQK+TMYPILSK
Sbjct: 176 WGESNGVRTRLKIAYVEGAGRGTIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           +EGM SETM+L+WSMKEKHI DSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLL  E+M
Sbjct: 236 IEGMSSETMLLIWSMKEKHIVDSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFGEIM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLREQYNELFPALCNNHPDVFPEE+YSWE+FLWACELWYSNS+KIMFSDG LRTCL
Sbjct: 296 QAKEHLREQYNELFPALCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHY K +SDTNSLKFRLSRPCRAGEECYLSYGNYS SHLV FYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYSKANSDTNSLKFRLSRPCRAGEECYLSYGNYSASHLVAFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDD SNS TSDWSTHMVRGTWLSKNQSIFHYGLPSPLLEC RK
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDASNSNTSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECLRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           ARCP L T  K QGSLENEMEVLN+LLSIF GMMENLED NEDR STEWDIKLALNYKDL
Sbjct: 476 ARCPELRTKLKLQGSLENEMEVLNDLLSIFDGMMENLEDVNEDRSSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QR+IVSSCL SCHAG K+VE AL +CMEEDTRG
Sbjct: 536 QRRIVSSCLNSCHAGLKMVELALYECMEEDTRG 557

BLAST of HG10006220 vs. NCBI nr
Match: XP_004150779.1 (uncharacterized protein LOC101212907 isoform X1 [Cucumis sativus] >KAE8651665.1 hypothetical protein Csa_021251 [Cucumis sativus])

HSP 1 Score: 898.7 bits (2321), Expect = 2.4e-257
Identity = 439/515 (85.24%), Postives = 470/515 (91.26%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           MCPEEVD VLK+LVQI+RIIHLNE           PEMYFGENDE TPVDFYSPRNEVE 
Sbjct: 56  MCPEEVDTVLKELVQISRIIHLNE-----------PEMYFGENDEGTPVDFYSPRNEVET 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
           F++IISL+ +SLSSC P QF+VLQELRKAVI MIHEYG+V+SM AKTL++SC K NCLL+
Sbjct: 116 FDSIISLLDLSLSSCTPAQFSVLQELRKAVIHMIHEYGNVHSMVAKTLENSCEKGNCLLE 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRTSL+IAY+EGAGRGT AKEDLDVGDTVLEIPLAIIISEELVQKSTMYP+LSK
Sbjct: 176 WGESNGVRTSLKIAYVEGAGRGTIAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           VEGML ETM LLWSMKEKHI DS+F+VYFDTLPEAFNTGLSFGVGAM TL GTLL DELM
Sbjct: 236 VEGMLPETMTLLWSMKEKHIVDSEFRVYFDTLPEAFNTGLSFGVGAMTTLVGTLLFDELM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLR+QYNELFPALCNNHPD+FPEEFYSWE+FLWACELWYSNSLKIMF DG +RTCL
Sbjct: 296 QAKEHLRKQYNELFPALCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHYGKVDSDT+SLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYGKVDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDDD+N+ITSDWSTHMVRGTWLSK QSIFHYGLPSP LECFRK
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDDNNNITSDWSTHMVRGTWLSKIQSIFHYGLPSPFLECFRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENED--RRSTEWDIKLALNYK 480
           A  PGLHTN K QGS+E E+EVLNELLSIFS MME LEDE+ED  R STEWDIKLAL YK
Sbjct: 476 ALFPGLHTNCKLQGSMEGEIEVLNELLSIFSEMMEKLEDEDEDESRTSTEWDIKLALEYK 535

Query: 481 DLQRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           DLQRKIVSSCLTSCH+G K+VE ALCDCM+EDTRG
Sbjct: 536 DLQRKIVSSCLTSCHSGLKMVEIALCDCMKEDTRG 559

BLAST of HG10006220 vs. ExPASy Swiss-Prot
Match: P94026 (Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic OS=Nicotiana tabacum OX=4097 GN=RBCMT PE=2 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 1.4e-13
Identity = 63/251 (25.10%), Postives = 120/251 (47.81%), Query Frame = 0

Query: 140 GRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSKVEGMLSETMMLLWSMKEKH 199
           G G  AK D+  G+TVL++P    I+ + V +S +  + S ++  +S  + LL   +EK 
Sbjct: 84  GLGLVAKRDIAKGETVLQVPKRFWINPDAVAESEIGNVCSGLKPWISVALFLL---REKW 143

Query: 200 IADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELMQAKEHLREQYNELFPALCN 259
             DSK+K Y D LP++ ++ + +    +  + GT LL   M  K++++ ++ ++   +  
Sbjct: 144 RDDSKWKYYMDVLPKSTDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVEEEVIL 203

Query: 260 NHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCLVPIAGFLNHSL------YP 319
            +  +FP    + + F WA  +  S +   + +   +   LVP A   NH+       + 
Sbjct: 204 RNKQLFPFPI-TLDDFFWAFGILRSRAFSRLRNQNLI---LVPFADLTNHNARVTTEDHA 263

Query: 320 HILHYGKVDSDTNSLKFRLSRP--CRAGEECYLSYG-NYSGSHLVTFYGFLPEGDNVNDV 379
           H +  G     +  L F L  P   +AG++ ++ Y  N S + +   YGF+ E  +  D 
Sbjct: 264 HEVR-GPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSNADMALDYGFI-EPSSARDA 323

Query: 380 IPLDIDFGDDD 382
             L ++  + D
Sbjct: 324 FTLTLEISESD 325

BLAST of HG10006220 vs. ExPASy Swiss-Prot
Match: Q9XI84 ([Fructose-bisphosphate aldolase]-lysine N-methyltransferase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LSMT-L PE=1 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 2.3e-13
Identity = 66/271 (24.35%), Postives = 125/271 (46.13%), Query Frame = 0

Query: 120 QWGESNGVRTSLEIA--YIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPI 179
           +W    GV +   +A   +   G G  A+ D+   + VLEIP  + I+ E V  S + P+
Sbjct: 54  KWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRLWINPETVTASKIGPL 113

Query: 180 LSKVEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLD 239
              ++  +S  + L+   +EK+  +S ++VY D LP++ ++ + +    +  L GT LL 
Sbjct: 114 CGGLKPWVSVALFLI---REKYEEESSWRVYLDMLPQSTDSTVFWSEEELAELKGTQLLS 173

Query: 240 ELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLR 299
             +  KE++  ++ +L   +   + D+F     + + F+WA  +  S +   +     + 
Sbjct: 174 TTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGILKSRAFSRLRGQNLV- 233

Query: 300 TCLVPIAGFLNHS----LYPHILHYGKVDSDTNSLKFRLSRP--CRAGEECYLSYG-NYS 359
             L+P+A  +NH+       +          +  L F L  P   +AGE+ Y+ Y  N S
Sbjct: 234 --LIPLADLINHNPAIKTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKS 293

Query: 360 GSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 382
            + L   YGF+ E +   +   L I+  + D
Sbjct: 294 NAELALDYGFV-ESNPKRNSYTLTIEIPESD 316

BLAST of HG10006220 vs. ExPASy Swiss-Prot
Match: Q08961 (Ribosomal lysine N-methyltransferase 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=RKM1 PE=1 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 3.4e-04
Identity = 68/302 (22.52%), Postives = 114/302 (37.75%), Query Frame = 0

Query: 118 LLQWGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQK-STMYP 177
           LLQWG S GV    E+ ++    +G     + D+ +  ++IP  I+IS  L  K   +  
Sbjct: 9   LLQWGASFGVIVPEELKFLYTDLKGIICVCEKDIDNPSIKIPPEIVISRNLPMKFFGLSE 68

Query: 178 ILSKVEGMLSETM-MLLWSMKEKHIADS-----KFKVYFDTLPEAFNTGLSFGVGAMMTL 237
               + G L      + +      I D+     KFK Y D LP   N+ L +    +  L
Sbjct: 69  STKNINGWLKLFFAKIKFDRDNDTIVDNVRVNDKFKPYLDALPSRLNSPLVWNPSELKRL 128

Query: 238 DGTLLLDELMQAKEHLREQYNEL--------------------------FPALCNNHPDV 297
             T + + + +  E + +++ EL                          + AL      +
Sbjct: 129 SSTNIGNSIHEKFEGIFKEWFELVSSSDMFDLERVADDVQTFHNLDELTYEALYEKILKI 188

Query: 298 F----PEEFYSWEQFLWACELWYSNSLKIMFSDGKL-RTC------LVPIAGFLNHSLYP 357
                P  +YS+  FLW+  ++ S +    F +  L R C      L+PI   LNH    
Sbjct: 189 TELQRPTIWYSFPAFLWSHLIFISRA----FPEYVLNRNCPDNSIVLLPIVDLLNHDYRS 248

Query: 358 HILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPL 376
            +  Y     +     +          E   +YG      L++ YGF+ E DN+ D + L
Sbjct: 249 KVKWY----PENGWFCYEKIGTASQSRELSNNYGGKGNEELLSGYGFVLE-DNIFDSVAL 301

BLAST of HG10006220 vs. ExPASy TrEMBL
Match: A0A6J1I0M2 (uncharacterized protein LOC111468639 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111468639 PE=4 SV=1)

HSP 1 Score: 901.0 bits (2327), Expect = 2.3e-258
Identity = 438/513 (85.38%), Postives = 462/513 (90.06%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           +CPEEVD VLK+LVQIARIIHLNE           PEMYF E+D CTP D YSPRNE+EA
Sbjct: 56  LCPEEVDTVLKELVQIARIIHLNE-----------PEMYFEEDDACTPADSYSPRNEMEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
            NTIISLV I LSSCKPVQ NVLQELRKA IRMIH+YG VYSMDAKTL D+C KENCLLQ
Sbjct: 116 LNTIISLVDICLSSCKPVQLNVLQELRKAAIRMIHKYGHVYSMDAKTLGDNCVKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRT L+IAY+EGAGRGT AKEDL+VGDTVLEIPL I+ISEELVQK+TMYPILSK
Sbjct: 176 WGESNGVRTRLKIAYVEGAGRGTIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           +EGM SETM+L+WSMKEKHI DSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLL  E+M
Sbjct: 236 IEGMSSETMLLIWSMKEKHIVDSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFGEIM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLREQYNELFPALCNNHPDVFPEE+YSWE+FLWACELWYSNS+KIMFSDG LRTCL
Sbjct: 296 QAKEHLREQYNELFPALCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHY K +SDTNSLKFRLSRPCRAGEECYLSYGNYS SHLV FYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYSKANSDTNSLKFRLSRPCRAGEECYLSYGNYSASHLVAFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDD SNS TSDWSTHMVRGTWLSKNQSIFHYGLPSPLLEC RK
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDASNSNTSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECLRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           ARCP L T  K QGSLENEMEVLN+LLSIF GMMENLED NEDR STEWDIKLALNYKDL
Sbjct: 476 ARCPELRTKLKLQGSLENEMEVLNDLLSIFDGMMENLEDVNEDRSSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QR+IVSSCL SCHAG K+VE AL +CMEEDTRG
Sbjct: 536 QRRIVSSCLNSCHAGLKMVELALYECMEEDTRG 557

BLAST of HG10006220 vs. ExPASy TrEMBL
Match: A0A6J1FHS4 (uncharacterized protein LOC111444075 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444075 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 2.6e-257
Identity = 437/513 (85.19%), Postives = 460/513 (89.67%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           +C EEVD VLK+LVQIARIIHLNE           PEMYFGE+D CTP D YSPRNE+EA
Sbjct: 56  LCSEEVDTVLKELVQIARIIHLNE-----------PEMYFGEDDACTPADSYSPRNEMEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
            NTIISLV I LSSCKPVQ NVLQELRKA IRMIH+YGDVYSMDAKTL DSC KENCLLQ
Sbjct: 116 LNTIISLVDICLSSCKPVQLNVLQELRKAAIRMIHKYGDVYSMDAKTLGDSCVKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRTSL+IAY+EGAGRG  AKEDL+VGDTVLEIPL I+ISEELVQK+TMYPILSK
Sbjct: 176 WGESNGVRTSLKIAYVEGAGRGAIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           +EGM SETM+L+WSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLL  E+M
Sbjct: 236 IEGMSSETMLLIWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFGEIM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLREQYNELFP LCNNHPDVFPEE+YSWE+FLWACELWYSNS+KIMFSDG L +CL
Sbjct: 296 QAKEHLREQYNELFPTLCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLTSCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHY K DSDTNSLKFRLSRPCRAGEECYLSYGNYS SHLV FYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYSKADSDTNSLKFRLSRPCRAGEECYLSYGNYSASHLVAFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDD S+ ITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLEC  K
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDASDIITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECLCK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           ARCP L T  K QGSLENEMEVLN+LLSIF GMMENLED NEDR STEWDIKLALNYKDL
Sbjct: 476 ARCPELRTKLKLQGSLENEMEVLNDLLSIFDGMMENLEDVNEDRSSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QR+IVSSCL SCHAG K VE AL +CMEEDTRG
Sbjct: 536 QRRIVSSCLNSCHAGLKTVELALYECMEEDTRG 557

BLAST of HG10006220 vs. ExPASy TrEMBL
Match: A0A6J1HX18 (uncharacterized protein LOC111468639 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111468639 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 9.8e-257
Identity = 437/513 (85.19%), Postives = 462/513 (90.06%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           +CPEEVD VLK+LVQIARIIHLNE           PEMYF E+D CTP D YSPRNE+EA
Sbjct: 56  LCPEEVDTVLKELVQIARIIHLNE-----------PEMYFEEDDACTPADSYSPRNEMEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
            NTIISLV I LSSCKPVQ NVLQELRKA IRMIH+YG VYSMDAKTL D+C KENCLLQ
Sbjct: 116 LNTIISLVDICLSSCKPVQLNVLQELRKAAIRMIHKYGHVYSMDAKTLGDNCVKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRT L+IAY+EGAGRGT AKEDL+VGDTVLEIPL I+ISEELVQK+TMYPILSK
Sbjct: 176 WGESNGVRTRLKIAYVEGAGRGTIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           +EGM SETM+L+WSMKEKHI DSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLL  E+M
Sbjct: 236 IEGMSSETMLLIWSMKEKHIVDSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFGEIM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLREQYNELFPALCNNHPDVFPEE+YSWE+FLWACELWYSNS+KIMFSDG LRTCL
Sbjct: 296 QAKEHLREQYNELFPALCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHY K +SDTNSLKFRLSRPCRAGEECYLSYGNYS SHLV FYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYSKANSDTNSLKFRLSRPCRAGEECYLSYGNYSASHLVAFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDD SNS TSDWSTHMVRGTWLSKNQSIFHYGLPSPLLEC RK
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDASNSNTSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECLRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           ARCP L T  K +GSLENEMEVLN+LLSIF GMMENLED NEDR STEWDIKLALNYKDL
Sbjct: 476 ARCPELRT--KLKGSLENEMEVLNDLLSIFDGMMENLEDVNEDRSSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QR+IVSSCL SCHAG K+VE AL +CMEEDTRG
Sbjct: 536 QRRIVSSCLNSCHAGLKMVELALYECMEEDTRG 555

BLAST of HG10006220 vs. ExPASy TrEMBL
Match: A0A6J1FC73 (N-lysine methyltransferase setd6 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444075 PE=4 SV=1)

HSP 1 Score: 892.1 bits (2304), Expect = 1.1e-255
Identity = 436/513 (84.99%), Postives = 460/513 (89.67%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           +C EEVD VLK+LVQIARIIHLNE           PEMYFGE+D CTP D YSPRNE+EA
Sbjct: 56  LCSEEVDTVLKELVQIARIIHLNE-----------PEMYFGEDDACTPADSYSPRNEMEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
            NTIISLV I LSSCKPVQ NVLQELRKA IRMIH+YGDVYSMDAKTL DSC KENCLLQ
Sbjct: 116 LNTIISLVDICLSSCKPVQLNVLQELRKAAIRMIHKYGDVYSMDAKTLGDSCVKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGESNGVRTSL+IAY+EGAGRG  AKEDL+VGDTVLEIPL I+ISEELVQK+TMYPILSK
Sbjct: 176 WGESNGVRTSLKIAYVEGAGRGAIAKEDLNVGDTVLEIPLDIVISEELVQKTTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           +EGM SETM+L+WSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLL  E+M
Sbjct: 236 IEGMSSETMLLIWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFGEIM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLREQYNELFP LCNNHPDVFPEE+YSWE+FLWACELWYSNS+KIMFSDG L +CL
Sbjct: 296 QAKEHLREQYNELFPTLCNNHPDVFPEEYYSWEKFLWACELWYSNSMKIMFSDGSLTSCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHY K DSDTNSLKFRLSRPCRAGEECYLSYGNYS SHLV FYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYSKADSDTNSLKFRLSRPCRAGEECYLSYGNYSASHLVAFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNVNDVIPLDIDFGDD S+ ITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLEC  K
Sbjct: 416 LPEGDNVNDVIPLDIDFGDDASDIITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECLCK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLALNYKDL 480
           ARCP L T  K +GSLENEMEVLN+LLSIF GMMENLED NEDR STEWDIKLALNYKDL
Sbjct: 476 ARCPELRT--KLKGSLENEMEVLNDLLSIFDGMMENLEDVNEDRSSTEWDIKLALNYKDL 535

Query: 481 QRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           QR+IVSSCL SCHAG K VE AL +CMEEDTRG
Sbjct: 536 QRRIVSSCLNSCHAGLKTVELALYECMEEDTRG 555

BLAST of HG10006220 vs. ExPASy TrEMBL
Match: A0A1S3BSA6 (uncharacterized protein LOC103492948 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492948 PE=4 SV=1)

HSP 1 Score: 883.2 bits (2281), Expect = 5.0e-253
Identity = 431/517 (83.37%), Postives = 467/517 (90.33%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELLTAIVTLEVQPEMYFGENDECTPVDFYSPRNEVEA 60
           M PEEVD VLK+LVQIARI+ LNEL           EMYFGEN+E TPVDFYSPRNEVEA
Sbjct: 56  MYPEEVDTVLKELVQIARILQLNEL-----------EMYFGENNEGTPVDFYSPRNEVEA 115

Query: 61  FNTIISLVGISLSSCKPVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDDSCAKENCLLQ 120
           F+TIISL+  SLSSC P QF+VLQELRKAVI MIHEYG+V+SMDAKTL++SC KENCLLQ
Sbjct: 116 FDTIISLLDSSLSSCTPAQFSVLQELRKAVIHMIHEYGNVHSMDAKTLENSCEKENCLLQ 175

Query: 121 WGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 180
           WGES+GVRTSL++AY+EGAGRGT AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK
Sbjct: 176 WGESSGVRTSLKVAYVEGAGRGTIAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPILSK 235

Query: 181 VEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLDELM 240
           VEGML ETMMLLWSMKEKHI DSKF+VYFDTLPEAFNTGLSFG+GAM TL GTLL DELM
Sbjct: 236 VEGMLPETMMLLWSMKEKHIVDSKFRVYFDTLPEAFNTGLSFGIGAMATLVGTLLFDELM 295

Query: 241 QAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCL 300
           QAKEHLR+QYNELFPALCN+HP +FP+EFYSWE+FLWACELWYSNSLKIMF DG +RTCL
Sbjct: 296 QAKEHLRKQYNELFPALCNDHPAIFPKEFYSWEEFLWACELWYSNSLKIMFPDGHVRTCL 355

Query: 301 VPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 360
           VPIAGFLNHSL+PHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF
Sbjct: 356 VPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF 415

Query: 361 LPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRK 420
           LPEGDNV+DVIPLDIDF ++D+N++TSDW+THMVRGTWLSKNQ IFHYGLPSP LECFRK
Sbjct: 416 LPEGDNVDDVIPLDIDFSEEDNNNVTSDWNTHMVRGTWLSKNQGIFHYGLPSPFLECFRK 475

Query: 421 ARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENED----RRSTEWDIKLALN 480
           A  PGLHTN K QGSLE E++VL+ELL IFSGMME  EDE+ED    R STEWD+KLAL 
Sbjct: 476 ALFPGLHTNCKLQGSLEGEIDVLSELLLIFSGMMEKFEDEDEDEDESRTSTEWDVKLALE 535

Query: 481 YKDLQRKIVSSCLTSCHAGRKIVEFALCDCMEEDTRG 514
           YKDLQRKIVSSCLTSCHAG K VEFALCDCM+ED RG
Sbjct: 536 YKDLQRKIVSSCLTSCHAGLKTVEFALCDCMKEDIRG 561

BLAST of HG10006220 vs. TAIR 10
Match: AT2G18850.1 (SET domain-containing protein )

HSP 1 Score: 531.9 bits (1369), Expect = 5.5e-151
Identity = 280/505 (55.45%), Postives = 355/505 (70.30%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELL--TAIVTLEVQPEMYFGENDECTPVDFYSPRNEV 60
           +C +E  N+   L Q      L +LL    IV L+ + E+YFGE D CTP   YS RNE+
Sbjct: 37  LCVKETLNLSGSLSQQLLNAALEKLLHFGRIVNLD-KVEVYFGE-DACTPAGIYSVRNEI 96

Query: 61  EAFNTIISLVGISLSSCK-PVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDD-SCAKEN 120
            A + I+SL+ +   SCK   Q +  + LR A+   I+E        A+ +D   C KE+
Sbjct: 97  SALSWILSLIPV---SCKMQTQVDTFEALRAALKGRINEVVGAEKEKARVVDSYRCEKES 156

Query: 121 CLLQWGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYP 180
            L++WG+ NGV+T L+IA I+G GRG  A EDL  GD  LEIP++ IISEE V  S MYP
Sbjct: 157 KLVEWGQDNGVKTKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYNSDMYP 216

Query: 181 ILSKVEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLL 240
           IL   +G+ SETM+LLW+M+EKH  DSKFK YFD+L E F TGLSFGV A+M LDGTLLL
Sbjct: 217 ILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLL 276

Query: 241 DELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKL 300
           DE+MQAKE LRE+Y+EL P L +NH +VFP E Y+WE +LWACEL+YSNS++I F DGKL
Sbjct: 277 DEIMQAKELLRERYDELIP-LLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKL 336

Query: 301 RTCLVPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVT 360
           +TCL+P+AGFLNHS+YPHI+ YGKVD +T+SLKF +SRPC  GE+C+LSYGNYS SHL+T
Sbjct: 337 KTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLT 396

Query: 361 FYGFLPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLE 420
           FYGFLP+GDN  DVIPLD D  DD+       W+THM+RGTWLS N +IFHYGLP+PLL 
Sbjct: 397 FYGFLPKGDNPYDVIPLDFDVIDDEDIETEFSWTTHMLRGTWLSSNHNIFHYGLPTPLLN 456

Query: 421 CFRKARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLED-ENEDRRSTEWDIKLAL 480
             RKA     H+      +LE E+ VL  L S F  MM+NL D ++ DR + +WD+KLA+
Sbjct: 457 YLRKAHGLVHHSETDLWKNLEVEIGVLENLQSTFDDMMQNLGDADSIDRENADWDVKLAM 516

Query: 481 NYKDLQRKIVSSCLTSCHAGRKIVE 501
            +K+ QRKIVSS L SC AG K+V+
Sbjct: 517 EFKERQRKIVSSILDSCSAGIKLVQ 535

BLAST of HG10006220 vs. TAIR 10
Match: AT2G18850.2 (SET domain-containing protein )

HSP 1 Score: 527.3 bits (1357), Expect = 1.4e-149
Identity = 281/505 (55.64%), Postives = 355/505 (70.30%), Query Frame = 0

Query: 1   MCPEEVDNVLKDLVQIARIIHLNELL--TAIVTLEVQPEMYFGENDECTPVDFYSPRNEV 60
           +C +E  N+   L Q      L +LL    IV L+ + E+YFGE D CTP   YS RNE+
Sbjct: 37  LCVKETLNLSGSLSQQLLNAALEKLLHFGRIVNLD-KVEVYFGE-DACTPAGIYSVRNEI 96

Query: 61  EAFNTIISLVGISLSSCK-PVQFNVLQELRKAVIRMIHEYGDVYSMDAKTLDD-SCAKEN 120
            A + I+SL+ +   SCK   Q +  + LR A+   I+E        A+ +D   C KE+
Sbjct: 97  SALSWILSLIPV---SCKMQTQVDTFEALRAALKGRINEVVGAEKEKARVVDSYRCEKES 156

Query: 121 CLLQWGESNGVRTSLEIAYIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYP 180
            L++WG+ NGV+T L+IA I+G GRG  A EDL  GD  LEIP++ IISEE V  S MYP
Sbjct: 157 KLVEWGQDNGVKTKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYNSDMYP 216

Query: 181 ILSKVEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLL 240
           IL   +G+ SETM+LLW+M+EKH  DSKFK YFD+L E F TGLSFGV A+M LDGTLLL
Sbjct: 217 ILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLL 276

Query: 241 DELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKL 300
           DE+MQAKE LRE+Y+EL P L +NH +VFP E Y+WE +LWACEL+YSNS++I F DGKL
Sbjct: 277 DEIMQAKELLRERYDELIP-LLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKL 336

Query: 301 RTCLVPIAGFLNHSLYPHILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVT 360
           +TCL+P+AGFLNHS+YPHI+ YGKVD +T+SLKF +SRPC  GE+C+LSYGNYS SHL+T
Sbjct: 337 KTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLT 396

Query: 361 FYGFLPEGDNVNDVIPLDIDFGDDDSNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLE 420
           FYGFLP+GDN  DVIPLD D  DD+       W+THM+RGTWLS N +IFHYGLP+PLL 
Sbjct: 397 FYGFLPKGDNPYDVIPLDFDVIDDEDIETEFSWTTHMLRGTWLSSNHNIFHYGLPTPLLN 456

Query: 421 CFRKARCPGLHTNHKRQGSLENEMEVLNELLSIFSGMMENLED-ENEDRRSTEWDIKLAL 480
             RKA   GL        +LE E+ VL  L S F  MM+NL D ++ DR + +WD+KLA+
Sbjct: 457 YLRKAH--GLLWK-----NLEVEIGVLENLQSTFDDMMQNLGDADSIDRENADWDVKLAM 516

Query: 481 NYKDLQRKIVSSCLTSCHAGRKIVE 501
            +K+ QRKIVSS L SC AG K+V+
Sbjct: 517 EFKERQRKIVSSILDSCSAGIKLVQ 528

BLAST of HG10006220 vs. TAIR 10
Match: AT1G14030.1 (Rubisco methyltransferase family protein )

HSP 1 Score: 78.6 bits (192), Expect = 1.7e-14
Identity = 66/271 (24.35%), Postives = 125/271 (46.13%), Query Frame = 0

Query: 120 QWGESNGVRTSLEIA--YIEGAGRGTKAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPI 179
           +W    GV +   +A   +   G G  A+ D+   + VLEIP  + I+ E V  S + P+
Sbjct: 54  KWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRLWINPETVTASKIGPL 113

Query: 180 LSKVEGMLSETMMLLWSMKEKHIADSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLLD 239
              ++  +S  + L+   +EK+  +S ++VY D LP++ ++ + +    +  L GT LL 
Sbjct: 114 CGGLKPWVSVALFLI---REKYEEESSWRVYLDMLPQSTDSTVFWSEEELAELKGTQLLS 173

Query: 240 ELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLR 299
             +  KE++  ++ +L   +   + D+F     + + F+WA  +  S +   +     + 
Sbjct: 174 TTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGILKSRAFSRLRGQNLV- 233

Query: 300 TCLVPIAGFLNHS----LYPHILHYGKVDSDTNSLKFRLSRP--CRAGEECYLSYG-NYS 359
             L+P+A  +NH+       +          +  L F L  P   +AGE+ Y+ Y  N S
Sbjct: 234 --LIPLADLINHNPAIKTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKS 293

Query: 360 GSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 382
            + L   YGF+ E +   +   L I+  + D
Sbjct: 294 NAELALDYGFV-ESNPKRNSYTLTIEIPESD 316

BLAST of HG10006220 vs. TAIR 10
Match: AT3G55080.1 (SET domain-containing protein )

HSP 1 Score: 54.3 bits (129), Expect = 3.4e-07
Identity = 86/374 (22.99%), Postives = 161/374 (43.05%), Query Frame = 0

Query: 140 GRGTKAKEDLDVGDTVLEIPL-AIIISEELVQKSTMYPILSKVEGMLSETMMLLWSMKEK 199
           GR   A + +  GD +L++P  A I  +EL   S +  +LS   G +   M+    ++EK
Sbjct: 70  GRSLFASKVIYAGDCMLKVPFNAQITPDEL--PSDIRVLLSNEVGNIG--MLAAVLIREK 129

Query: 200 HIAD-SKFKVYFDTLPE--AFNTGLSFGVGAMMTLDGTLLLDELMQAKEHLREQYNELFP 259
            +   S++  Y   LP+    ++ + +G   +  +  + +  E ++ K  + + ++ +  
Sbjct: 130 KMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFSFVAQ 189

Query: 260 ALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCLVPIAGFLNH-SLYPH 319
           A   + P V   E    E F++A  L  S +      +   R  L+P A F+NH  L   
Sbjct: 190 AFKQHCPIV--TERPDLEDFMYAYALVGSRAW-----ENSKRISLIPFADFMNHDGLSAS 249

Query: 320 ILHYGKVDSDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF-LPEGDNVNDVIPL 379
           I+     D D    +    R    G+E ++ YG +S + L+  +GF  P   N++D + +
Sbjct: 250 IV---LRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPY--NIHDEVQI 309

Query: 380 DIDFGDDD--SNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRKARCPGLHTNHK 439
            +D  +DD   N       TH  R     K+ +IFH    +  ++  + A   G      
Sbjct: 310 QMDVPNDDPLRNMKLGLLQTHHTRTV---KDINIFHSSCDTFTIKEVKSAIGKG------ 369

Query: 440 RQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLA-LNYKDLQRKIVSSCLT 499
            +G  ++       L  I    + +L  E     + + D +LA L +KD  R++ +  + 
Sbjct: 370 -KGIPQSLRAFARVLCCIIPQELNDLSKE-----AAQNDGRLARLPFKDGNRELEAHKIL 412

Query: 500 SCHAGRKIVEFALC 505
             H  R I + ++C
Sbjct: 430 LSHINRLIEDHSVC 412

BLAST of HG10006220 vs. TAIR 10
Match: AT3G55080.2 (SET domain-containing protein )

HSP 1 Score: 49.7 bits (117), Expect = 8.3e-06
Identity = 69/320 (21.56%), Postives = 137/320 (42.81%), Query Frame = 0

Query: 195 MKEKHIAD-SKFKVYFDTLPE--AFNTGLSFGVGAMMTLDGTLLLDELMQAKEHLREQYN 254
           ++EK +   S++  Y   LP+    ++ + +G   +  +  + +  E ++ K  + + ++
Sbjct: 7   IREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS 66

Query: 255 ELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFSDGKLRTCLVPIAGFLNHSL 314
            +  A   + P V   E    E F++A  L  S +      +   R  L+P A F+NH  
Sbjct: 67  FVAQAFKQHCPIV--TERPDLEDFMYAYALVGSRAW-----ENSKRISLIPFADFMNHDG 126

Query: 315 YPHILHYGKVD---SDTNSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGF-LPEGDNV 374
               +     D   S+ ++L+    R    G+E ++ YG +S + L+  +GF  P   N+
Sbjct: 127 LSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPY--NI 186

Query: 375 NDVIPLDIDFGDDD--SNSITSDWSTHMVRGTWLSKNQSIFHYGLPSPLLECFRKARCPG 434
           +D + + +D  +DD   N       TH  R     K+ +IFH    +  ++  + A   G
Sbjct: 187 HDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTV---KDINIFHSSCDTFTIKEVKSAIGKG 246

Query: 435 LHTNHKRQGSLENEMEVLNELLSIFSGMMENLEDENEDRRSTEWDIKLA-LNYKDLQRKI 494
                  +G  ++       L  I    + +L  E     + + D +LA L +KD  R++
Sbjct: 247 -------KGIPQSLRAFARVLCCIIPQELNDLSKE-----AAQNDGRLARLPFKDGNREL 302

Query: 495 VSSCLTSCHAGRKIVEFALC 505
            +  +   H  R I + ++C
Sbjct: 307 EAHKILLSHINRLIEDHSVC 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889411.13.9e-26889.86uncharacterized protein LOC120079325 isoform X1 [Benincasa hispida][more]
XP_038889412.12.2e-26689.67uncharacterized protein LOC120079325 isoform X2 [Benincasa hispida][more]
XP_038889413.11.6e-26191.21protein SET DOMAIN GROUP 40 isoform X3 [Benincasa hispida][more]
XP_022969685.14.8e-25885.38uncharacterized protein LOC111468639 isoform X1 [Cucurbita maxima] >XP_022969686... [more]
XP_004150779.12.4e-25785.24uncharacterized protein LOC101212907 isoform X1 [Cucumis sativus] >KAE8651665.1 ... [more]
Match NameE-valueIdentityDescription
P940261.4e-1325.10Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferas... [more]
Q9XI842.3e-1324.35[Fructose-bisphosphate aldolase]-lysine N-methyltransferase, chloroplastic OS=Ar... [more]
Q089613.4e-0422.52Ribosomal lysine N-methyltransferase 1 OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A6J1I0M22.3e-25885.38uncharacterized protein LOC111468639 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FHS42.6e-25785.19uncharacterized protein LOC111444075 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HX189.8e-25785.19uncharacterized protein LOC111468639 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FC731.1e-25584.99N-lysine methyltransferase setd6 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A1S3BSA65.0e-25383.37uncharacterized protein LOC103492948 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT2G18850.15.5e-15155.45SET domain-containing protein [more]
AT2G18850.21.4e-14955.64SET domain-containing protein [more]
AT1G14030.11.7e-1424.35Rubisco methyltransferase family protein [more]
AT3G55080.13.4e-0722.99SET domain-containing protein [more]
AT3G55080.28.3e-0621.56SET domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 440..460
NoneNo IPR availableGENE3D3.90.1410.10set domain protein methyltransferase, domain 1coord: 65..360
e-value: 3.4E-51
score: 176.3
NoneNo IPR availablePANTHERPTHR13271:SF103BNAA07G01600D PROTEINcoord: 1..509
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 1..509
NoneNo IPR availableCDDcd10527SET_LSMTcoord: 130..361
e-value: 1.87876E-52
score: 176.101
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 117..366
IPR001214SET domainPFAMPF00856SETcoord: 140..347
e-value: 3.7E-10
score: 40.5
IPR001214SET domainPROSITEPS50280SETcoord: 129..347
score: 10.777729

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10006220.1HG10006220.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0005515 protein binding