Cla97C05G093460 (gene) Watermelon (97103) v2

NameCla97C05G093460
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionProtein SET DOMAIN GROUP, putative
LocationCla97Chr05 : 12676138 .. 12680789 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACTGAAGGTAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGACCAACAGACTTCACATTCTTGTTTGGGCCATTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGTATGCTCTGATTTCTCCTTTTCTAGTTACACCACTCTTTTCTTTCTTTCGCCGCTTGTAAGTTGTTCATTTTGAGGGAACTTTTGCTTCAGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGTTGAGAGTTCCAAAATCTGTCTTGTTGACGACCCAAGGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTTTGAACGAATACCCATCTCTTTCTTCTACTCAGGTTCTTCCCTTGCTAACATAATTAGCGCGAACTTTTGTGAGAAAAAGATTCTTCCTTTTGTTGCTTTGGTTTTTTTTTTCTTTCTTTTTTTTGTATGTCTTGTAAGTTGAACTTTAAGGGGGAGGGAAAAAAAACTCTGCATTGGATTGAATTCGAGTTATTTAACACCAACTTCATAAGTTGGTGGGCGAAACAAATTTGGATACACGAGGCATCTTCATGTGCCTAATGAGGTAGGTTTTGAATAGTGATTACAGTCCAAAGAGTGATTACATCTCAAAGTAGGAGAGACCTAGAAAGAAACTTTAAGAATTACGAGCATGTCTGTCACTTGAATGAAAAAGAGAAGGGATCTTTATGCATGAATGACCATAGGCATGCTTAAAGATGTTTTTCTATTCCATTCTATGTTAATGTTTAATTGTCCTTGATGCAACTGCGACAAAGATCTTTGAGACTTAACTGATTTAAACTCTCCTTGATATTGAAGGTTGATTATCCAAAACATTTCTTGTGTGTTTATCTAAAACATTACCAACTTTCCCTTGTTAGTTCTAAAGATGTACTAATATTAATTTTACAATCCCTCCCTCCCACTCAAATGCAGAAGTTGACCTTTTGTTTACTCTATGAGATTGGTAAAGGAACTAGTTCTTGGTGGTTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTAGCAACTTTTGGAGATTTTGAAAAGCAAGCCCTGCAGGTTCAATTTTCTGTTTGTGGCTTTTTTATTTATTTCTTAGGATTATATTCGTATTCGTGAACCATATTCAATGAGGACTTTTTACATGTGTTTTTGACATTTGTTTTCTAATAAAATAGTTGAAGTCGTTAACAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCATTGAAATCTCGTATGGAGTGGGGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTTATGCTAGAAGAATGTATTGCAACTGTAAGCACACTGGATTTATGGTTAGACCTGGATTGATATTATTTATTTATTATTCTTTTTGTGAGAAACCAAACTTGGATTGAGAATTGAGAAAAAAATGAGAGAAAGAGGGCATATCTAAAACAACCTACAAAAAAAAGGAGGATCCCCAACTAACTACGAAAAGGGCTCCAATCTAACAAATGCCAAATTCATAATTACAAAATAGTATTAACTGACATTTTCTTATTACAAAAACCTGCTTTTAAGTTTCTACTTTTTTTAATAATTAATTAATTTATTTGTTTGGTGAAAAATTTATGCATCATTTGAAGGTGGTTTAGCTGGATGTCCAAGGTCACTGAAAATATGCTTTTCTTATAGAAGAAAAAACCACCAAGTGACACCCATTTGATGATTATCCTTCAATTACCCATTACCTTGTAGTCCTGTACACATTACCCATTACCAGCCAGTATGCATCCTTAAGACTCCTATTCTATCTAATATACTCTCTCTCCCATCGCCACCTTGCTCTGTGCCTAAAGTCCTGACCCCCTCGCTTAAGGTCACCCTTAACTTAGGTATTCTTTGTCTCCTTGAAATCTGTGGACTGTAACTACCGTCATAGAACTTTGGTGGTATATAAAGGGACCTATATTATTGAAGTTCATGGTGGATGTCTATATATGAAGCAAACCAAAGCTCTCTAAGATTGGAAAGGATAACTTTTTCTTCATCTCTCGTCAATGAAACGCTCTCGTGGATTCAAAATTGCTTTGTAGGCCTGCTCCTCTTCCCTCTTAATGGTTTTTCCCCCAAGAAGTCTATAGTGAAGATTATGTGTTTCCATTGAACAAATCTCCTATAGGAAAGAAACTACTGTTGAGATCATTAAGCTGAACACACATTTTATTCCATGTAGGTGAGAACTGCTCAGGTCAGTCCTCCTCCTCATGAAAGATTACCCTTGTGCCACCCTGCCTCCAATAGTTCACTGTTTTTCTTCTGACTTGTGGAGTGGAAGTTGCTTGCACAAAGGCTCTTGGATCAGCCCAAACGTGGATAGCAGTCCTCCTCATGAAAGATTACCCTTGTGCCACCCGACTCAAATAGTTCACTGTTTTTCTTCTGACTTGTGAAGTTGAAGTTGCTTGCCACAAAGGCTCTTGGATCAGCCCAAACGAGGATACTAGGATGTTGGTTCAAGTACGATTGAATTTAGTACTTACGTCCCTTCAAATTTCTCATAGCTTTTTGTTGCCATAGAATAGAATTCATCCTCTTTTTTATGAATGCAAATTGTCGAGACTCAAGAGCTTGTTTTGTAATTTTCTCTTAGCCCTTTTTTTAGAGGATATATTATTGTCCTCACTTGTATCTATTTTAACGTTAATACAATTATTTGTTTCTTATGGAAAGACACGTGTTTCTTTATTTGTCTAATGCATTGTATGATGACTCTAAATGTGAGCATGGTTTGGTCCGATGTCAATCTTAATATCAGTTCGTTAACTTCTTTCTACGTTTAGTAGTTAAAAACTAAAAATATATATTCCCCTTTGAGAGATATCATCTAGGACATTGCATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGCCCCTTGACGTTATGGATGTTTCGTCTTTTTCACCACATGTTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCCATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCTCGGGAGAGTTATAAGAAGGGAGAGCAGGTATTTTGGTGTCATTTAATATTTTGTACTCGTATGAAAAGTGGAGTTGATGTAAAATACATTAATTGGAAGTTTAACATTTCAATTATTGAGTTTGATTTCCTATTAGGAAGAAGGTTGAAATTTACATTAGCCTTTTTTATTTGCTCTCTGATGCATTCTTGTCAATCTGCACCTTGAGACTTCCATAATAAAACTAGAGATTGTTTCTTTATGACTTAGACTGTTATTACACTGATGTTTAACATTTTCTTTCTTCATCTTTTTATTTACTATTACCACTGTTATTTTTTGCCCCAAATGTATTTTCTCATTATAAGTTCCTAGGTTAGGTTCTGTTTGTGTCTTATACATCTTTCCAATTCTTTGTGTTACTAACAGTTCCATCAAACGTCACTCTATACCTCACCTTTATTTTGTGTGTATATATATATATATATATTAAAGCAACACCTGAAAAGGATCTAATTTGATGTTTTGGTGGGTAGCAATATTTTCAAAATGCAAGGCGCCCCCACAGTGACTCGTCTTTTGATTGCTTGTATGTGAGTGACCAAGAAAATTAACCTCAGTGAAGCAAAACCATACTTTATAAATTAATAAATAAATGCAAAATATTTAAGACGCGAATATCAATAAGTCCATTTTTAATTTCAAAAGAAAAGAATGGTCTGTCTGTCTCTCAAACATAATTGCTTTGCACAAGTCTCTCTCTCAAAAAAGAATATGAATGAGTAACAATTGAAATCTTTCTATTGTTTGTTGCAGGTTCTTTTAAGCTATGGTACATACTCAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGAAAAAGTTTTCATTCCTATAGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAATCTCTTTACATTCATCAAAATGGAAACCCATCTTTTGCTCTCCTTTCTGCCCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCTGGGTCACAACTCTCGATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACGATCTGCCTACATCAGTTGAAGAAGACAATCAGCTTCTATGCAACATCTCCAAAGTCCAAGATCTGCAGGTAACAAGGGAGCTCCGGAAGGTGCTGTTGACTTACGGAGGTGAGTTTTGTGCCTTCTTGGAGACCAATGGTCTGGTGAATAGTGATGACACCGAGGTACATATATCCCAGAAAATAAAACGCTCTCTGGAGAGATGGAAACTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCGTTGGTTGATTGCATAAGTTACTGCACCAGAACTATTAGTTCTCTATCTTCTTAA

mRNA sequence

ATGGAAACTGAAGGTAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGACCAACAGACTTCACATTCTTGTTTGGGCCATTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGTTGAGAGTTCCAAAATCTGTCTTGTTGACGACCCAAGGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTTTGAACGAATACCCATCTCTTTCTTCTACTCAGAAGTTGACCTTTTGTTTACTCTATGAGATTGGTAAAGGAACTAGTTCTTGGTGGTTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTAGCAACTTTTGGAGATTTTGAAAAGCAAGCCCTGCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCATTGAAATCTCGTATGGAGTGGGGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGACATTGCATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGCCCCTTGACGTTATGGATGTTTCGTCTTTTTCACCACATGTTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCCATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCTCGGGAGAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGAAAAAGTTTTCATTCCTATAGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAATCTCTTTACATTCATCAAAATGGAAACCCATCTTTTGCTCTCCTTTCTGCCCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCTGGGTCACAACTCTCGATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACGATCTGCCTACATCAGTTGAAGAAGACAATCAGCTTCTATGCAACATCTCCAAAGTCCAAGATCTGCAGGTAACAAGGGAGCTCCGGAAGGTGCTGTTGACTTACGGAGGTGAGTTTTGTGCCTTCTTGGAGACCAATGGTCTGGTGAATAGTGATGACACCGAGGTACATATATCCCAGAAAATAAAACGCTCTCTGGAGAGATGGAAACTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCGTTGGTTGATTGCATAAGTTACTGCACCAGAACTATTAGTTCTCTATCTTCTTAA

Coding sequence (CDS)

ATGGAAACTGAAGGTAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGACCAACAGACTTCACATTCTTGTTTGGGCCATTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGTTGAGAGTTCCAAAATCTGTCTTGTTGACGACCCAAGGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTTTGAACGAATACCCATCTCTTTCTTCTACTCAGAAGTTGACCTTTTGTTTACTCTATGAGATTGGTAAAGGAACTAGTTCTTGGTGGTTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTAGCAACTTTTGGAGATTTTGAAAAGCAAGCCCTGCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCATTGAAATCTCGTATGGAGTGGGGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGACATTGCATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGCCCCTTGACGTTATGGATGTTTCGTCTTTTTCACCACATGTTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCCATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCTCGGGAGAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGAAAAAGTTTTCATTCCTATAGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAATCTCTTTACATTCATCAAAATGGAAACCCATCTTTTGCTCTCCTTTCTGCCCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCTGGGTCACAACTCTCGATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACGATCTGCCTACATCAGTTGAAGAAGACAATCAGCTTCTATGCAACATCTCCAAAGTCCAAGATCTGCAGGTAACAAGGGAGCTCCGGAAGGTGCTGTTGACTTACGGAGGTGAGTTTTGTGCCTTCTTGGAGACCAATGGTCTGGTGAATAGTGATGACACCGAGGTACATATATCCCAGAAAATAAAACGCTCTCTGGAGAGATGGAAACTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCGTTGGTTGATTGCATAAGTTACTGCACCAGAACTATTAGTTCTCTATCTTCTTAA

Protein sequence

METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
BLAST of Cla97C05G093460 vs. NCBI nr
Match: XP_004145844.1 (PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus])

HSP 1 Score: 875.5 bits (2261), Expect = 8.1e-251
Identity = 423/486 (87.04%), Postives = 447/486 (91.98%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGS GSLLRWAADHGISDSVDQ TSHSCLGHSLCV FFPD GGRGL AVRQL KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKH 120
           VLR PKS+LLTTQ LSLEDEKL MAL  YPSLSSTQKLTFCLLYEI KG SSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKA 180
           LPQSYDILATFG+FEKQALQVDYAIWATEKAALKSR +W GV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTT 240
           WLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + +DV SF  H SLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            EL EE+RD+ WALTDGGFEEN SAYCFYARESY+KGEQVLLSYGTY+NLELLEYYGFLL
Sbjct: 241 -ELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLL 300

Query: 301 QENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLL 420
           YAGSQLS+KNEILVMQWLSKNCHTVLN+LPTS+EEDNQLLCNI+KVQDLQV REL+K LL
Sbjct: 361 YAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420

Query: 421 TYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480
           TYGGEFCAFLETNG+VN D+ E H SQK+KRSL+RWKLAVQWRLLYKKALVDCI YCT T
Sbjct: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTT 480

Query: 481 ISSLSS 487
           I SLSS
Sbjct: 481 ICSLSS 483

BLAST of Cla97C05G093460 vs. NCBI nr
Match: XP_008457030.1 (PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo])

HSP 1 Score: 866.7 bits (2238), Expect = 3.8e-248
Identity = 421/486 (86.63%), Postives = 447/486 (91.98%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GGRGL AVRQLNKGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKH 120
           +LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTFCLL EI KG SS WFPYLKH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKA 180
           LPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTT 240
           WLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLL
Sbjct: 241 -ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLL 300

Query: 301 QENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLL 420
           YAGSQLS+KNE LVMQWLSKNCHTVLN+LPTS+EED+QLLCNI+KVQDLQV RELRK+LL
Sbjct: 361 YAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLL 420

Query: 421 TYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480
           TYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI YCTRT
Sbjct: 421 TYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRT 480

Query: 481 ISSLSS 487
           I SLSS
Sbjct: 481 ICSLSS 483

BLAST of Cla97C05G093460 vs. NCBI nr
Match: XP_022983189.1 (protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima])

HSP 1 Score: 864.8 bits (2233), Expect = 1.4e-247
Identity = 413/486 (84.98%), Postives = 449/486 (92.39%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           M TEGSF SLLRWAADHGISDSVD+Q+SHSCLG SLCVCFFPDAGGRGLGAVR L KGEL
Sbjct: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKH 120
           VL+VPKSVLLTTQ LSL+DEKL+MAL  YPSLSSTQKLTFCLLYEIGKG+SSWWFPY KH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKA 180
           LP +Y+ LATFG+FEKQALQVDYA+W  EKAA KS  EW GVKGLM+ESNIKNQLQTFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTT 240
           WLWASATISSR L+VPWDEAGCLCPVGDLFNYAAPEGE LD+MDVSSFS H SLNG++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
           D LH+E++DT  ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTYSNLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPN++VFIP+EH+IYSSSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLL 420
           YAGSQLS+KNE+LVMQWLSKNCH VLN+LPTSVEEDNQLLCNI K+QDLQ   EL K+LL
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420

Query: 421 TYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480
           T GGEFCAFLET GLVN ++TE+H++ KIKRSLERWKLAVQWR+LYKKALVDC SYCTRT
Sbjct: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRT 480

Query: 481 ISSLSS 487
             SLSS
Sbjct: 481 TCSLSS 486

BLAST of Cla97C05G093460 vs. NCBI nr
Match: XP_008457029.1 (PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo])

HSP 1 Score: 860.5 bits (2222), Expect = 2.7e-246
Identity = 421/491 (85.74%), Postives = 447/491 (91.04%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWF 120
           NKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTFCLL EI KG SS WF
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 PYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQL 180
           PYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLN 240
           QTFKAWLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 GDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEY 300
            ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEY
Sbjct: 241 DEL---ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEY 300

Query: 301 YGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360
           YGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG
Sbjct: 301 YGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360

Query: 361 VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTREL 420
           VGHLAYAGSQLS+KNE LVMQWLSKNCHTVLN+LPTS+EED+QLLCNI+KVQDLQV REL
Sbjct: 361 VGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQREL 420

Query: 421 RKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCIS 480
           RK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI 
Sbjct: 421 RKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIG 480

Query: 481 YCTRTISSLSS 487
           YCTRTI SLSS
Sbjct: 481 YCTRTICSLSS 488

BLAST of Cla97C05G093460 vs. NCBI nr
Match: KGN57798.1 (hypothetical protein Csa_3G307670 [Cucumis sativus])

HSP 1 Score: 859.8 bits (2220), Expect = 4.6e-246
Identity = 415/481 (86.28%), Postives = 440/481 (91.48%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGS GSLLRWAADHGISDSVDQ TSHSCLGHSLCV FFPD GGRGL AVRQL KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKH 120
           VLR PKS+LLTTQ LSLEDEKL MAL  YPSLSSTQKLTFCLLYEI KG SSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKA 180
           LPQSYDILATFG+FEKQALQVDYAIWATEKAALKSR +W GV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTT 240
           WLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + +DV SF  H SLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            EL EE+RD+ WALTDGGFEEN SAYCFYARESY+KGEQVLLSYGTY+NLELLEYYGFLL
Sbjct: 241 -ELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLL 300

Query: 301 QENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLL 420
           YAGSQLS+KNEILVMQWLSKNCHTVLN+LPTS+EEDNQLLCNI+KVQDLQV REL+K LL
Sbjct: 361 YAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420

Query: 421 TYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480
           TYGGEFCAFLETNG+VN D+ E H SQK+KRSL+RWKLAVQWRLLYKKALVDCI    R 
Sbjct: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGKKIRM 478

Query: 481 I 482
           +
Sbjct: 481 V 478

BLAST of Cla97C05G093460 vs. TrEMBL
Match: tr|A0A1S3C4J5|A0A1S3C4J5_CUCME (protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 866.7 bits (2238), Expect = 2.5e-248
Identity = 421/486 (86.63%), Postives = 447/486 (91.98%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GGRGL AVRQLNKGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKH 120
           +LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTFCLL EI KG SS WFPYLKH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKA 180
           LPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTT 240
           WLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLL
Sbjct: 241 -ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLL 300

Query: 301 QENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLL 420
           YAGSQLS+KNE LVMQWLSKNCHTVLN+LPTS+EED+QLLCNI+KVQDLQV RELRK+LL
Sbjct: 361 YAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLL 420

Query: 421 TYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480
           TYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI YCTRT
Sbjct: 421 TYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRT 480

Query: 481 ISSLSS 487
           I SLSS
Sbjct: 481 ICSLSS 483

BLAST of Cla97C05G093460 vs. TrEMBL
Match: tr|A0A1S3C4N2|A0A1S3C4N2_CUCME (protein SET DOMAIN GROUP 40 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 860.5 bits (2222), Expect = 1.8e-246
Identity = 421/491 (85.74%), Postives = 447/491 (91.04%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWF 120
           NKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTFCLL EI KG SS WF
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 PYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQL 180
           PYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLN 240
           QTFKAWLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 GDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEY 300
            ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEY
Sbjct: 241 DEL---ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEY 300

Query: 301 YGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360
           YGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG
Sbjct: 301 YGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360

Query: 361 VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTREL 420
           VGHLAYAGSQLS+KNE LVMQWLSKNCHTVLN+LPTS+EED+QLLCNI+KVQDLQV REL
Sbjct: 361 VGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQREL 420

Query: 421 RKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCIS 480
           RK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI 
Sbjct: 421 RKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIG 480

Query: 481 YCTRTISSLSS 487
           YCTRTI SLSS
Sbjct: 481 YCTRTICSLSS 488

BLAST of Cla97C05G093460 vs. TrEMBL
Match: tr|A0A0A0L7L4|A0A0A0L7L4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G307670 PE=4 SV=1)

HSP 1 Score: 859.8 bits (2220), Expect = 3.0e-246
Identity = 415/481 (86.28%), Postives = 440/481 (91.48%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGS GSLLRWAADHGISDSVDQ TSHSCLGHSLCV FFPD GGRGL AVRQL KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKH 120
           VLR PKS+LLTTQ LSLEDEKL MAL  YPSLSSTQKLTFCLLYEI KG SSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKA 180
           LPQSYDILATFG+FEKQALQVDYAIWATEKAALKSR +W GV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTT 240
           WLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + +DV SF  H SLN ++  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDEL-- 240

Query: 241 DELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300
            EL EE+RD+ WALTDGGFEEN SAYCFYARESY+KGEQVLLSYGTY+NLELLEYYGFLL
Sbjct: 241 -ELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLL 300

Query: 301 QENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLL 420
           YAGSQLS+KNEILVMQWLSKNCHTVLN+LPTS+EEDNQLLCNI+KVQDLQV REL+K LL
Sbjct: 361 YAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420

Query: 421 TYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRT 480
           TYGGEFCAFLETNG+VN D+ E H SQK+KRSL+RWKLAVQWRLLYKKALVDCI    R 
Sbjct: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGKKIRM 478

Query: 481 I 482
           +
Sbjct: 481 V 478

BLAST of Cla97C05G093460 vs. TrEMBL
Match: tr|A0A1S3C590|A0A1S3C590_CUCME (protein SET DOMAIN GROUP 40 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 759.6 bits (1960), Expect = 4.3e-216
Identity = 382/491 (77.80%), Postives = 407/491 (82.89%), Query Frame = 0

Query: 1   METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWF 120
           NKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSST                    
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSST-------------------- 120

Query: 121 PYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQL 180
                                   QVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQL
Sbjct: 121 ------------------------QVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLN 240
           QTFKAWLWASATISSRTL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 GDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEY 300
            ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEY
Sbjct: 241 DEL---ESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEY 300

Query: 301 YGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360
           YGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG
Sbjct: 301 YGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360

Query: 361 VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTREL 420
           VGHLAYAGSQLS+KNE LVMQWLSKNCHTVLN+LPTS+EED+QLLCNI+KVQDLQV REL
Sbjct: 361 VGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQREL 420

Query: 421 RKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCIS 480
           RK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI 
Sbjct: 421 RKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIG 444

Query: 481 YCTRTISSLSS 487
           YCTRTI SLSS
Sbjct: 481 YCTRTICSLSS 444

BLAST of Cla97C05G093460 vs. TrEMBL
Match: tr|A0A1Q3B175|A0A1Q3B175_CEPFO (SET domain-containing protein/Rubis-subs-bind domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_05277 PE=4 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 3.2e-171
Identity = 301/482 (62.45%), Postives = 368/482 (76.35%), Query Frame = 0

Query: 2   ETEGSFGSLLRWAADHGISDSVDQQ-----TSHSCLGHSLCVCFFPDAGGRGLGAVRQLN 61
           E E    S L+WAA+ GI+DS   Q     TSHSCLGHSL V  FPDAGGRGLGAVR L 
Sbjct: 4   EEERRLESFLKWAAELGITDSTKNQQSQNATSHSCLGHSLKVSNFPDAGGRGLGAVRDLR 63

Query: 62  KGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFP 121
           KGE++LRVPKS L+T++ LS  D KL +ALN +PSLSSTQ+LT CLLYE+GKG SSWW+P
Sbjct: 64  KGEMILRVPKSALITSKTLSFNDHKLYLALNRHPSLSSTQRLTVCLLYEMGKGASSWWYP 123

Query: 122 YLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQ 181
           YL H P+SY ILATFG+FEKQALQVD AIW TEKA  K+ +EW     LM+E  +K QL 
Sbjct: 124 YLMHFPRSYHILATFGEFEKQALQVDDAIWTTEKAIAKAELEWKEANMLMKELKLKRQLL 183

Query: 182 TFKAWLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNG 241
           +F AWLWASA ISSRTLH+ WDEAGCLCPVGDLFNY AP+ E    + VSS     S++ 
Sbjct: 184 SFTAWLWASAAISSRTLHIHWDEAGCLCPVGDLFNYDAPD-EATPSLQVSSLRNGESMDA 243

Query: 242 DMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYY 301
             + D+L + +R     LTDGGFEE+V+AYCFYAR+SY++GEQVLLSYGTY+NLELLE+Y
Sbjct: 244 LDSEDQLAQSQR-----LTDGGFEEDVAAYCFYARKSYQEGEQVLLSYGTYTNLELLEHY 303

Query: 302 GFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGV 361
           GF L +NPN+KVFIP+E  +Y SSSWPKESLYIHQ+G PSFALLS LRLWAT  ++RR V
Sbjct: 304 GFFLNKNPNDKVFIPLEPKMYCSSSWPKESLYIHQDGKPSFALLSTLRLWATPQSQRRSV 363

Query: 362 GHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELR 421
           GHLAY+GSQLS+ NEI VM+W+SKNCH +L + P+S++ED+ LL  I ++ +     ELR
Sbjct: 364 GHLAYSGSQLSMDNEISVMRWISKNCHLILKNFPSSIKEDSFLLSAIDEIPNSCTALELR 423

Query: 422 KVLLTYGGEFCAFLETNGLVNSDD-TEVHISQKIKRSLERWKLAVQWRLLYKKALVDCIS 478
            ++ T GGE C FL   G++N +    +H+S+K + S+ERWKLAVQWRL YKK+LVDCI 
Sbjct: 424 NMMSTLGGEGCNFLRAIGMLNRESAANLHLSKKARSSIERWKLAVQWRLWYKKSLVDCID 479

BLAST of Cla97C05G093460 vs. Swiss-Prot
Match: sp|Q6NQJ8|SDG40_ARATH (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1)

HSP 1 Score: 521.9 bits (1343), Expect = 7.4e-147
Identity = 270/482 (56.02%), Postives = 348/482 (72.20%), Query Frame = 0

Query: 9   SLLRWAADHGISDSVD-QQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 68
           + LRWAA+ GISDS+D  +   SCLGHSL V  FPDAGGRGLGA R+L KGELVL+VP+ 
Sbjct: 10  TFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRK 69

Query: 69  VLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKHLPQSYDI 128
            L+TT+ +  +D KL+ A+N + SLSSTQ L+ CLLYE+ K   S+W+PYL H+P+ YD+
Sbjct: 70  ALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDL 129

Query: 129 LATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASAT 188
           LATFG+FEKQALQV+ A+WATEKA  K + EW     LM+E  +K + ++F+AWLWASAT
Sbjct: 130 LATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWASAT 189

Query: 189 ISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEK 248
           ISSRTLHVPWD AGCLCPVGDLFNY AP G+  +       + +V   G +   E H E+
Sbjct: 190 ISSRTLHVPWDSAGCLCPVGDLFNYDAP-GDYSNTPQGPESANNVEEAGLVV--ETHSER 249

Query: 249 RDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNEK 308
                 LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YGF+L+EN N+K
Sbjct: 250 ------LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 309

Query: 309 VFIPIEHDIYS-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 368
           VFIP+E  ++S +SSWPK+SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 310 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 369

Query: 369 LSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGE 428
           +S+KNEILVM+W+S+ C +VL DLPTSV ED  LL NI K+QD ++  E +K    +G E
Sbjct: 370 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 429

Query: 429 FCAFLETNGLVN---SDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTIS 485
             AFL+ N L +        +  S+K  R L +W+ +VQWRL YK+ L DCISYC   ++
Sbjct: 430 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of Cla97C05G093460 vs. Swiss-Prot
Match: sp|B7ZUF3|SETD3_XENTR (Histone-lysine N-methyltransferase setd3 OS=Xenopus tropicalis OX=8364 GN=setd3 PE=2 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 3.2e-17
Identity = 106/431 (24.59%), Postives = 180/431 (41.76%), Query Frame = 0

Query: 4   EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLR 63
           E  F  L+ W  ++G S            G  L    FP+  G GL A R++   EL L 
Sbjct: 76  EDYFPELMEWCKENGASTD----------GFELVE--FPEE-GFGLKATREIKAEELFLW 135

Query: 64  VPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FCLLYEIGKGTSSWWFPYLKHL 123
           VP+ +L+T +  S +   L    ++   L +   +T  F LL E     +S+W PY+K L
Sbjct: 136 VPRKLLMTVE--SAKGSVLGPLYSQDRILQAMGNITLAFHLLCE-RADPNSFWLPYIKTL 195

Query: 124 PQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQ----- 183
           P  YD    F + E Q LQ   AI         +  ++     ++Q     N+L      
Sbjct: 196 PNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKDSF 255

Query: 184 TFKAWLWASATISSRTLHVPWDEAG----CLCPVGDLFNYAAPEGEPLDVMDVSSFSPHV 243
           TF  + WA +++ +R   +P ++       L P+ D+ N+                    
Sbjct: 256 TFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT------------------- 315

Query: 244 SLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLEL 303
             NG +TT    E+ R    AL D                 +K GEQ+ + YGT SN E 
Sbjct: 316 --NGLITTGYNLEDDRCECVALQD-----------------FKSGEQIYIFYGTRSNAEF 375

Query: 304 LEYYGFLLQENPNEKVFIPI-----------EHDIYSSSSWPKESLY-IHQNGNP-SFAL 363
           + + GF  + N +++V I +           + ++ + +  P  S++ +H    P S  L
Sbjct: 376 VIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHVTEPPISAQL 435

Query: 364 LSALRLWATHPNKRRG----------VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDL 401
           L+ LR++  + ++ +G          +  L  +   +S +NEI +  +L      +L   
Sbjct: 436 LAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVSWENEIKLWTFLEARASLLLKTY 452

BLAST of Cla97C05G093460 vs. Swiss-Prot
Match: sp|B2KI88|SETD3_RHIFE (Histone-lysine N-methyltransferase setd3 OS=Rhinolophus ferrumequinum OX=59479 GN=SETD3 PE=3 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 2.6e-14
Identity = 100/433 (23.09%), Postives = 175/433 (40.42%), Query Frame = 0

Query: 4   EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLR 63
           E  F  L++WA+++G S    +  S                 G GL A R +   EL L 
Sbjct: 76  EDYFPDLMKWASENGASVEGFEMVSFK-------------EEGFGLRATRDIKAEELFLW 135

Query: 64  VPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FCLLYEIGKGTSSWWFPYLKHL 123
           VP+ +L+T +  S ++  L    ++   L +   +T  F LL E     +S+W PY++ L
Sbjct: 136 VPRKLLMTVE--SAKNSVLGPLYSQDRILQAMGNITLAFHLLCE-RADPNSFWQPYIQTL 195

Query: 124 PQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQ----- 183
           P  YD    FG+ E + LQ   AI         +  ++     ++Q     N+L      
Sbjct: 196 PSEYDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSF 255

Query: 184 TFKAWLWASATISSRTLHVPWDEAG----CLCPVGDLFNYAAPEGEPLDVMDVSSFSPHV 243
           T++ + WA +++ +R   +P ++       L P+ D+ N+                    
Sbjct: 256 TYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT------------------- 315

Query: 244 SLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLEL 303
             NG +TT    E+ R    AL D                 ++ GEQ+ + YGT SN E 
Sbjct: 316 --NGLITTGYNLEDDRCECVALQD-----------------FQAGEQIYIFYGTRSNAEF 375

Query: 304 LEYYGFLLQENPNEKVFIPI-----------EHDIYSSSSWPKESLY-IHQNGNP-SFAL 363
           + + GF    N +++V I +           + ++ + +  P  S++ +H    P S  L
Sbjct: 376 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQL 435

Query: 364 LSALRLWA-------THPNKRRGVGHLAYAGSQ---LSIKNEILVMQWLSKNCHTVLNDL 403
           L+ LR++         H      +  +   G+    +S  NE+ +  +L      +L   
Sbjct: 436 LAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTY 454

BLAST of Cla97C05G093460 vs. Swiss-Prot
Match: sp|Q5ZML9|SETD3_CHICK (Histone-lysine N-methyltransferase setd3 OS=Gallus gallus OX=9031 GN=SETD3 PE=2 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 1.7e-13
Identity = 96/428 (22.43%), Postives = 172/428 (40.19%), Query Frame = 0

Query: 7   FGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPK 66
           F  L++WA ++G S    +  +              +  G GL A R++   EL L VP+
Sbjct: 79  FPELIKWATENGASTEGFEIANF-------------EEEGFGLKATREIKAEELFLWVPR 138

Query: 67  SVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FCLLYEIGKGTSSWWFPYLKHLPQS 126
            +L+T +  S ++  L    ++   L +   +T  F LL E     +S+W PY++ LP  
Sbjct: 139 KLLMTVE--SAKNSVLGSLYSQDRILQAMGNITLAFHLLCE-RANPNSFWLPYIQTLPSE 198

Query: 127 YDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQ-----TFK 186
           YD    F + E Q L+   AI         +  ++     ++Q     ++L      T+ 
Sbjct: 199 YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYAYFYKVIQTHPNASKLPLKDSFTYD 258

Query: 187 AWLWASATISSRTLHVPWDEAG----CLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLN 246
            + WA +++ +R   +P ++       L P+ D+ N+                      N
Sbjct: 259 DYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT---------------------N 318

Query: 247 GDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEY 306
           G +TT    E+ R    AL D                 +K GEQ+ + YGT SN E + +
Sbjct: 319 GLITTGYNLEDDRCECVALQD-----------------FKAGEQIYIFYGTRSNAEFVIH 378

Query: 307 YGFLLQENPNEKVFIPI-----------EHDIYSSSSWPKESLYIHQNGNP--SFALLSA 366
            GF    N +++V I +           + ++ + +  P  S++   +  P  S  LL+ 
Sbjct: 379 SGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHSIEPPISAQLLAF 438

Query: 367 LRLWATHPN--KRRGVGH--------LAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTS 401
           LR++  +    K   +G         L  +   +S  NE+ +  +L      +L    T+
Sbjct: 439 LRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEVKLWTFLEARASLLLKTYKTT 452

BLAST of Cla97C05G093460 vs. Swiss-Prot
Match: sp|A9X1D0|SETD3_PAPAN (Histone-lysine N-methyltransferase setd3 OS=Papio anubis OX=9555 GN=SETD3 PE=3 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 1.7e-13
Identity = 101/446 (22.65%), Postives = 181/446 (40.58%), Query Frame = 0

Query: 4   EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLR 63
           E  F  L++WA+++G S    +  +                 G GL A R +   EL L 
Sbjct: 76  EDYFPDLMKWASENGASVEGFEMVNFK-------------EEGFGLRATRDIKAEELFLW 135

Query: 64  VPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQK--LTFCLLYEIGKGTSSWWFPYLKHL 123
           VP+ +L+T +  S ++  L    ++   L +     L F LL E     +S+W PY++ L
Sbjct: 136 VPRKLLMTVE--SAKNSVLGPLYSQDRILQAMGNIALAFHLLCE-RANPNSFWQPYIQTL 195

Query: 124 PQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQ----- 183
           P  YD    F + E + LQ   AI         +  ++     ++Q     N+L      
Sbjct: 196 PSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSF 255

Query: 184 TFKAWLWASATISSRTLHVPWDEAG----CLCPVGDLFNYAAPEGEPLDVMDVSSFSPHV 243
           T++ + WA +++ +R   +P ++       L P+ D+ N+                    
Sbjct: 256 TYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT------------------- 315

Query: 244 SLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLEL 303
             NG +TT    E+ R    AL D                 ++ GEQ+ + YGT SN E 
Sbjct: 316 --NGLITTGYNLEDDRCECVALQD-----------------FRAGEQIYIFYGTRSNAEF 375

Query: 304 LEYYGFLLQENPNEKVFIPI-----------EHDIYSSSSWPKESLY-IHQNGNP-SFAL 363
           + + GF    N +++V I +           + ++ + +  P  S++ +H    P S  L
Sbjct: 376 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQL 435

Query: 364 LSALRLWATHPNKRR-------GVGHLAYAGSQ---LSIKNEILVMQWLSKNCHTVLNDL 416
           L+ LR++     + +        +  +   G+    +S  NE+ +  +L      +L   
Sbjct: 436 LAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTY 463

BLAST of Cla97C05G093460 vs. TAIR10
Match: AT5G17240.1 (SET domain group 40)

HSP 1 Score: 521.9 bits (1343), Expect = 4.1e-148
Identity = 270/482 (56.02%), Postives = 348/482 (72.20%), Query Frame = 0

Query: 9   SLLRWAADHGISDSVD-QQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 68
           + LRWAA+ GISDS+D  +   SCLGHSL V  FPDAGGRGLGA R+L KGELVL+VP+ 
Sbjct: 10  TFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRK 69

Query: 69  VLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGTSSWWFPYLKHLPQSYDI 128
            L+TT+ +  +D KL+ A+N + SLSSTQ L+ CLLYE+ K   S+W+PYL H+P+ YD+
Sbjct: 70  ALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDL 129

Query: 129 LATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASAT 188
           LATFG+FEKQALQV+ A+WATEKA  K + EW     LM+E  +K + ++F+AWLWASAT
Sbjct: 130 LATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWASAT 189

Query: 189 ISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEK 248
           ISSRTLHVPWD AGCLCPVGDLFNY AP G+  +       + +V   G +   E H E+
Sbjct: 190 ISSRTLHVPWDSAGCLCPVGDLFNYDAP-GDYSNTPQGPESANNVEEAGLVV--ETHSER 249

Query: 249 RDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNEK 308
                 LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YGF+L+EN N+K
Sbjct: 250 ------LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 309

Query: 309 VFIPIEHDIYS-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 368
           VFIP+E  ++S +SSWPK+SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 310 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 369

Query: 369 LSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGE 428
           +S+KNEILVM+W+S+ C +VL DLPTSV ED  LL NI K+QD ++  E +K    +G E
Sbjct: 370 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 429

Query: 429 FCAFLETNGLVN---SDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTIS 485
             AFL+ N L +        +  S+K  R L +W+ +VQWRL YK+ L DCISYC   ++
Sbjct: 430 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of Cla97C05G093460 vs. TAIR10
Match: AT3G07670.1 (Rubisco methyltransferase family protein)

HSP 1 Score: 57.0 bits (136), Expect = 3.8e-08
Identity = 94/398 (23.62%), Postives = 150/398 (37.69%), Query Frame = 0

Query: 43  DAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCL 102
           D G RGL A + L KGE +L VP S++++       + +    +  Y  +     L   L
Sbjct: 97  DIGERGLVASQNLRKGEKLLFVPPSLVISADS-EWTNAEAGEVMKRY-DVPDWPLLATYL 156

Query: 103 LYEIGKGTSSWWFPYLKHLP-QSYDIL---ATFGDFEKQALQVDYAIWATEKAALKSRME 162
           + E     SS WF Y+  LP Q Y +L    T  D   +A Q+       E+A  +    
Sbjct: 157 ISEASLQKSSRWFNYISALPRQPYSLLYWTRTELDMYLEASQI------RERAIERITNV 216

Query: 163 WGGVKGLMQESNIKN-QL--------QTFKAWLWASATISSRTLHVP-WDEAGCLCPVGD 222
            G  + L      K+ QL        +TFK   W+   + SR + +P  D    L P  D
Sbjct: 217 VGTYEDLRSRIFSKHPQLFPKEVFNDETFK---WSFGILFSRLVRLPSMDGRFALVPWAD 276

Query: 223 LFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCF 282
           + N+       LD                                     ++++     F
Sbjct: 277 MLNHNCEVETFLD-------------------------------------YDKSSKGVVF 336

Query: 283 YARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKE--- 342
                Y+ GEQV +SYG  SN ELL  YGF+ +E  N    + +   +  +    +E   
Sbjct: 337 TTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLD 396

Query: 343 SLYIHQNGNPS----------FALLSALRLWATHPNKRRGVGHLAYAGS-QLSIKNEILV 402
           +L  H    P             L++   L  + P+ R     +A A S + S KN++  
Sbjct: 397 ALKKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKY 445

Query: 403 MQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVT 413
            +        +L+   TS+ + ++ L   S   DL +T
Sbjct: 457 PEIEEDALQFILDSCETSISKYSRFL-KESGSMDLDIT 445

BLAST of Cla97C05G093460 vs. TAIR10
Match: AT2G18850.1 (SET domain-containing protein)

HSP 1 Score: 53.1 bits (126), Expect = 5.4e-07
Identity = 64/282 (22.70%), Postives = 112/282 (39.72%), Query Frame = 0

Query: 43  DAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCL 102
           D  GRG  A   L  G++ L +P S +++ + +   D  +   L  +  ++S   L    
Sbjct: 172 DGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYNSD--MYPILETFDGITSETMLLLWT 231

Query: 103 LYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGV 162
           + E      S + PY   L +++    +FG      +++D  +   E    K  +     
Sbjct: 232 MRE-KHNLDSKFKPYFDSLQENFCTGLSFG--VDAIMELDGTLLLDEIMQAKELLRERYD 291

Query: 163 KGLMQESNIKN----QLQTFKAWLWASATISSRTLHVPWDEA---GCLCPVGDLFNYAAP 222
           + +   SN +     +L T++ +LWA     S ++ + + +     CL PV    N+   
Sbjct: 292 ELIPLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNH--- 351

Query: 223 EGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYK 282
                      S  PH+   G +  +                      S+  F       
Sbjct: 352 -----------SIYPHIVKYGKVDIE---------------------TSSLKFPVSRPCN 411

Query: 283 KGEQVLLSYGTYSNLELLEYYGFLLQ-ENPNEKVFIPIEHDI 317
           KGEQ  LSYG YS+  LL +YGFL + +NP +   IP++ D+
Sbjct: 412 KGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD--VIPLDFDV 411

BLAST of Cla97C05G093460 vs. TAIR10
Match: AT1G24610.1 (Rubisco methyltransferase family protein)

HSP 1 Score: 50.1 bits (118), Expect = 4.6e-06
Identity = 100/457 (21.88%), Postives = 170/457 (37.20%), Query Frame = 0

Query: 46  GRGLGAVRQLNKGELVLRVPKSVLLTTQG----LSLEDEKLAMALNEYPSLSSTQKLTFC 105
           G GL +  Q++ G  ++ +P  V L  +             A+A    P      KL   
Sbjct: 60  GIGLISTEQISPGTDLISLPPHVPLRFESDXXXXXXXXXXXALA-RRVPEELWAMKLGLR 119

Query: 106 LLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGG 165
           LL E     S WW PY+ +LP++Y +   F   + + LQ    +    K   +  +E+  
Sbjct: 120 LLQERANADSFWW-PYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKRC-RFLLEF-- 179

Query: 166 VKGLMQESNIKNQLQTFKAWLWASATISSRTLHVPWDEAGCLCPVGDLFNYAAPEGEPLD 225
                 E  I+  L+  KA     +        + W  +        L      +G   D
Sbjct: 180 ------EQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSD 239

Query: 226 VMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVL 285
             DV    P +    DM     H  K +        G + N +     A    K+ + +L
Sbjct: 240 --DVPMMLPLI----DMCN---HSFKPNARIIQEQNGADSN-TLVKVVAETEVKENDPLL 299

Query: 286 LSYGTYSNLELLEYYGFLLQENPNEKVFIPIEHDIYSSSSW------PKESL---YIHQN 345
           L+YG  SN   L  YGF+++ NP + + +  +  +  ++S       PK S    + HQ 
Sbjct: 300 LNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDAASMAAGVSSPKFSSPAPWQHQ- 359

Query: 346 GNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTS 405
                 LLS L L    PN +  +G       +L     IL           +  +L   
Sbjct: 360 ------LLSQLNLAGEMPNLKVTIGGPEPVEGRLLAALRIL-----------LCGELVEV 419

Query: 406 VEEDNQLLCNISKVQDLQVTREL---RKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKI 465
            + D+  L ++S V    +  E+   R V+       C    ++      + E  I Q +
Sbjct: 420 EKHDSDTLKSLSAVAPFGIANEIAVFRTVI-----ALCVIALSHFPTKIMEDEAIIKQGV 469

Query: 466 KRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS 487
             + E   L++++R+  K  ++D +   TR +  LSS
Sbjct: 480 SATAE---LSIKYRIQKKSVIIDVMKDLTRRVKLLSS 469

BLAST of Cla97C05G093460 vs. TAIR10
Match: AT5G14260.3 (Rubisco methyltransferase family protein)

HSP 1 Score: 46.6 bits (109), Expect = 5.1e-05
Identity = 69/283 (24.38%), Postives = 111/283 (39.22%), Query Frame = 0

Query: 49  LGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGK 108
           + A   L KG++   VP S+++T + + L +E +A  L     LS    L   L+YE  +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERV-LGNETIAELLTT-NKLSELACLALYLMYEKKQ 174

Query: 109 GTSSWWFPYLKHLPQSYDILATFGDFEKQA------LQVDYAIWATEKAALKSRME---- 168
           G  S W+PY++ L    D     G  + ++       ++DY   +  KA +  R E    
Sbjct: 175 GKKSVWYPYIREL----DRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKR 234

Query: 169 --------WGGVKGLMQE--SNIKNQLQTFKAWLWASATISSRTLHVPWDEAGCLCPVGD 228
                   W     L Q+   +I  +  +F+ +  A   I S  +H        L  VG 
Sbjct: 235 EYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVH--------LQNVGL 294

Query: 229 LFNYA-APEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYC 288
              +A  P G PL          + S    M T               DG  E  V    
Sbjct: 295 ARRFALVPLGPPL--------LAYCSNCKAMLT-------------AVDGAVELVVD--- 354

Query: 289 FYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNEKVFI 311
                 YK G+ +++  G   N +LL  YGF+ ++NP ++V +
Sbjct: 355 ----RPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRVIV 355

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145844.18.1e-25187.04PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus][more]
XP_008457030.13.8e-24886.63PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo][more]
XP_022983189.11.4e-24784.98protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima][more]
XP_008457029.12.7e-24685.74PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo][more]
KGN57798.14.6e-24686.28hypothetical protein Csa_3G307670 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
tr|A0A1S3C4J5|A0A1S3C4J5_CUCME2.5e-24886.63protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
tr|A0A1S3C4N2|A0A1S3C4N2_CUCME1.8e-24685.74protein SET DOMAIN GROUP 40 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
tr|A0A0A0L7L4|A0A0A0L7L4_CUCSA3.0e-24686.28Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G307670 PE=4 SV=1[more]
tr|A0A1S3C590|A0A1S3C590_CUCME4.3e-21677.80protein SET DOMAIN GROUP 40 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
tr|A0A1Q3B175|A0A1Q3B175_CEPFO3.2e-17162.45SET domain-containing protein/Rubis-subs-bind domain-containing protein OS=Cepha... [more]
Match NameE-valueIdentityDescription
sp|Q6NQJ8|SDG40_ARATH7.4e-14756.02Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1[more]
sp|B7ZUF3|SETD3_XENTR3.2e-1724.59Histone-lysine N-methyltransferase setd3 OS=Xenopus tropicalis OX=8364 GN=setd3 ... [more]
sp|B2KI88|SETD3_RHIFE2.6e-1423.09Histone-lysine N-methyltransferase setd3 OS=Rhinolophus ferrumequinum OX=59479 G... [more]
sp|Q5ZML9|SETD3_CHICK1.7e-1322.43Histone-lysine N-methyltransferase setd3 OS=Gallus gallus OX=9031 GN=SETD3 PE=2 ... [more]
sp|A9X1D0|SETD3_PAPAN1.7e-1322.65Histone-lysine N-methyltransferase setd3 OS=Papio anubis OX=9555 GN=SETD3 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT5G17240.14.1e-14856.02SET domain group 40[more]
AT3G07670.13.8e-0823.62Rubisco methyltransferase family protein[more]
AT2G18850.15.4e-0722.70SET domain-containing protein[more]
AT1G24610.14.6e-0621.88Rubisco methyltransferase family protein[more]
AT5G14260.35.1e-0524.38Rubisco methyltransferase family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR036464Rubisco_LSMT_subst-bd_sf
IPR001214SET_dom
IPR015353Rubisco_LSMT_subst-bd
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0032259 methylation
biological_process GO:0018026 peptidyl-lysine monomethylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0016279 protein-lysine N-methyltransferase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G093460.1Cla97C05G093460.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 336..437
e-value: 7.8E-6
score: 26.6
IPR001214SET domainPFAMPF00856SETcoord: 46..285
e-value: 5.0E-8
score: 33.4
IPR001214SET domainPROSITEPS50280SETcoord: 34..285
score: 11.179
NoneNo IPR availablePANTHERPTHR13271:SF8SET DOMAIN-CONTAINING PROTEIN 4coord: 5..474
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 5..474
NoneNo IPR availableSUPERFAMILYSSF82199SET domaincoord: 8..303
IPR036464Rubisco LSMT, substrate-binding domain superfamilySUPERFAMILYSSF81822RuBisCo LSMT C-terminal, substrate-binding domaincoord: 296..410