Cla97C10G194290 (gene) Watermelon (97103) v2.5

Overview
NameCla97C10G194290
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionHistone-lysine N-methyltransferase EZA1 isoform X3
LocationCla97Chr10: 23552585 .. 23553798 (-)
RNA-Seq ExpressionCla97C10G194290
SyntenyCla97C10G194290
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATTCTAAAGCCTTCCTTTTTGTTTCATCTCTTCTCTATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCGCTTCTCCTCCTCGCTCGCCTCTCTTCTGCTCTATATCTTTTTGGCTTGTTACCATTGTCATCATCGTTGTTGCTCTCTCTTATTCTCTACGTAAACACTCCTCTTCTTTTTCTTCTTGTTTCATTCGTCACTGTCTCCACCCTTCTCCACTCTCTCACGGGAAAATCCGCTCTCCCGACCAAACTCCCTAGTCCCATCTCCCGACCACGTCTTTACACTGCATGGATTTTTCTATGCACTCTCCAGGTATATTAAAACAACAATTAACTTAAAAGCTTAAGTTGATAAGTTATTACAATCACTTTTTTTTTTTTGGGTTTAAACGGTAATGTGGGGTGGAGATATTTAGAAGAGATATAAATCGATTGAAATTTACTCAAATTAATTAACTTTACAATAAATTTAAACCAAAAAACTTAAATAGATATAAGATGGAACTTAACTTGCATCCATTCAAGTTTTTAAATCATGTCATTGATGTTAATAAAACATATTTGTGATTAGTTAAGAATGTCATTATCATCAATGACGTGATTTTGAGTGTAGATAAGGTTAGAAATTTATTATATCATCGGAAATTGAAAATTTAGAATTAGAAAATTGACTTTCCAGAAAGCATAATCTATCGAGTTACGATCATTTATTCATATTCTTAACAACAACAACAGGTATGTGTCGGTGTTGGGATCGAGGGGAGCTTATCAAGCGGCCTGAACGACGCGCCGGCCGGCCACATCGAGAGTGGCCTGTGGAGGAGGTTGTTGTTCTTTTTTGGACTTCATGAGGCGGTGGTGCACTGGACGAGGGTGGTGGTAAAGCCGGTGGTGGACGACACCGTATTCGGGGAATCTCGGAAAGAGAAGTGGTTTGAGACGGCAGCTACGGCGGTGAGCTTGGGGGGACTGTGGTGGTGGCGGTTGAGGGATGAGGCGGAGGCGCTAGTGGTTGTGGCGGAAAGCAAGTGGTTGACGTCGGCGGAATTGGGTCCGGCAGACATTTCCGGCTGGTGCCTGTATTACATTACGGTGGCAATTGGAATTGCTAAGATTGTTAATTCTGTTGCTTGGTTTGGTGGGATTTTTGTCTTTAAAAAACATTCTAAAAGGCCCCATGAGGTTGGTGTTGAGGACAATGTTTGA

mRNA sequence

ATGGAAATTCTAAAGCCTTCCTTTTTGTTTCATCTCTTCTCTATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCGCTTCTCCTCCTCGCTCGCCTCTCTTCTGCTCTATATCTTTTTGGCTTGTTACCATTGTCATCATCGTTGTTGCTCTCTCTTATTCTCTACGTAAACACTCCTCTTCTTTTTCTTCTTGTTTCATTCGTCACTGTCTCCACCCTTCTCCACTCTCTCACGGGAAAATCCGCTCTCCCGACCAAACTCCCTAGTCCCATCTCCCGACCACGTCTTTACACTGCATGGATTTTTCTATGCACTCTCCAGGTATGTGTCGGTGTTGGGATCGAGGGGAGCTTATCAAGCGGCCTGAACGACGCGCCGGCCGGCCACATCGAGAGTGGCCTGTGGAGGAGGTTGTTGTTCTTTTTTGGACTTCATGAGGCGGTGGTGCACTGGACGAGGGTGGTGGTAAAGCCGGTGGTGGACGACACCGTATTCGGGGAATCTCGGAAAGAGAAGTGGTTTGAGACGGCAGCTACGGCGGTGAGCTTGGGGGGACTGTGGTGGTGGCGGTTGAGGGATGAGGCGGAGGCGCTAGTGGTTGTGGCGGAAAGCAAGTGGTTGACGTCGGCGGAATTGGGTCCGGCAGACATTTCCGGCTGGTGCCTGTATTACATTACGGTGGCAATTGGAATTGCTAAGATTGTTAATTCTGTTGCTTGGTTTGGTGGGATTTTTGTCTTTAAAAAACATTCTAAAAGGCCCCATGAGGTTGGTGTTGAGGACAATGTTTGA

Coding sequence (CDS)

ATGGAAATTCTAAAGCCTTCCTTTTTGTTTCATCTCTTCTCTATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCGCTTCTCCTCCTCGCTCGCCTCTCTTCTGCTCTATATCTTTTTGGCTTGTTACCATTGTCATCATCGTTGTTGCTCTCTCTTATTCTCTACGTAAACACTCCTCTTCTTTTTCTTCTTGTTTCATTCGTCACTGTCTCCACCCTTCTCCACTCTCTCACGGGAAAATCCGCTCTCCCGACCAAACTCCCTAGTCCCATCTCCCGACCACGTCTTTACACTGCATGGATTTTTCTATGCACTCTCCAGGTATGTGTCGGTGTTGGGATCGAGGGGAGCTTATCAAGCGGCCTGAACGACGCGCCGGCCGGCCACATCGAGAGTGGCCTGTGGAGGAGGTTGTTGTTCTTTTTTGGACTTCATGAGGCGGTGGTGCACTGGACGAGGGTGGTGGTAAAGCCGGTGGTGGACGACACCGTATTCGGGGAATCTCGGAAAGAGAAGTGGTTTGAGACGGCAGCTACGGCGGTGAGCTTGGGGGGACTGTGGTGGTGGCGGTTGAGGGATGAGGCGGAGGCGCTAGTGGTTGTGGCGGAAAGCAAGTGGTTGACGTCGGCGGAATTGGGTCCGGCAGACATTTCCGGCTGGTGCCTGTATTACATTACGGTGGCAATTGGAATTGCTAAGATTGTTAATTCTGTTGCTTGGTTTGGTGGGATTTTTGTCTTTAAAAAACATTCTAAAAGGCCCCATGAGGTTGGTGTTGAGGACAATGTTTGA

Protein sequence

MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPLLFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSSGLNDAPAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAATAVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNSVAWFGGIFVFKKHSKRPHEVGVEDNV
Homology
BLAST of Cla97C10G194290 vs. NCBI nr
Match: XP_038904328.1 (uncharacterized protein LOC120090682 [Benincasa hispida])

HSP 1 Score: 465.7 bits (1197), Expect = 2.7e-127
Identity = 240/263 (91.25%), Postives = 247/263 (93.92%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL PSFLFHLF+INLLGLLLPLS LLLARLSS LYL GLLPLSS LLLSLILYVN+PL
Sbjct: 1   MEILSPSFLFHLFAINLLGLLLPLSFLLLARLSSVLYLIGLLPLSSPLLLSLILYVNSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV VSTL HSLTGKSALPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LFLLVSFVIVSTLFHSLTGKSALPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDAPAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAATA 180
           GLN A AGHIE GLWRRLLFFFGLHEAVVHWTR VVKPVVDDTVFGESRKEKWFETAATA
Sbjct: 121 GLNHAAAGHIEGGLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKEKWFETAATA 180

Query: 181 VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNSVAW 240
           VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIV  VAW
Sbjct: 181 VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVRFVAW 240

Query: 241 FGGIFVFKKHSKRPHEVGVEDNV 264
           FGGIFV  KHSK+PHEVGVE++V
Sbjct: 241 FGGIFVSTKHSKKPHEVGVENHV 263

BLAST of Cla97C10G194290 vs. NCBI nr
Match: XP_008454283.1 (PREDICTED: uncharacterized protein LOC103494729 [Cucumis melo])

HSP 1 Score: 430.3 bits (1105), Expect = 1.2e-116
Identity = 225/265 (84.91%), Postives = 237/265 (89.43%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL  SFLFHLF+INLLGLLLPLSLLLLARLSSALYL  LLP   S LLSLILYVN+PL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV +STLLHSLTGKS LPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLND-APAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAAT 180
           GLND    GH+E G+WRRLLFFFGLHEAVVHWTRVVVKPVVDDT++GESRKEKWFETAAT
Sbjct: 121 GLNDLTSTGHVEGGMWRRLLFFFGLHEAVVHWTRVVVKPVVDDTIYGESRKEKWFETAAT 180

Query: 181 AVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVN-SV 240
           AVSLGGLWWWRLRDEAE LVVVAESKWLTS ELG ADISGWCLYYITV IGIAKIV   +
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSTELGWADISGWCLYYITVVIGIAKIVKYCI 240

Query: 241 AWFGGIFVFKKHSKRPHEVGVEDNV 264
            WFGGIFV +KHSK  + VGVEDNV
Sbjct: 241 GWFGGIFVSRKHSKTSNLVGVEDNV 265

BLAST of Cla97C10G194290 vs. NCBI nr
Match: XP_011652956.2 (uncharacterized protein LOC105435160 [Cucumis sativus])

HSP 1 Score: 423.7 bits (1088), Expect = 1.2e-114
Identity = 223/265 (84.15%), Postives = 232/265 (87.55%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL  SFLFHLF+INLLGLLLPLSLLLLARLSSALYL  LLP   S LLSLILYVN+PL
Sbjct: 22  MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 81

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV +STLLHSLTGKS LPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 82  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 141

Query: 121 GLND-APAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAAT 180
           GLND    GH+E GLWRRLLFF GLHEAVVHWTR VVKPVVDDT++GE R EKWFETAAT
Sbjct: 142 GLNDLTSTGHVEGGLWRRLLFFLGLHEAVVHWTRAVVKPVVDDTIYGEPRTEKWFETAAT 201

Query: 181 AVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVN-SV 240
           AVSLGGLWWWRLRDEAE LVVVAESKWLTSAELG ADISGWCLYYITV IGIAKIV   +
Sbjct: 202 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSAELGWADISGWCLYYITVVIGIAKIVKYCI 261

Query: 241 AWFGGIFVFKKHSKRPHEVGVEDNV 264
            WFGG FV K HSK  H VGVEDNV
Sbjct: 262 GWFGGSFVSKTHSKTSHLVGVEDNV 286

BLAST of Cla97C10G194290 vs. NCBI nr
Match: XP_022983396.1 (uncharacterized protein LOC111482002 [Cucurbita maxima])

HSP 1 Score: 412.5 bits (1059), Expect = 2.7e-111
Identity = 217/263 (82.51%), Postives = 232/263 (88.21%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL+P FLFHL ++NLL LLLPLSLLLLARLSSALYL G LPL   LLLSLILYV +PL
Sbjct: 1   MEILRPWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAG-LPLLPPLLLSLILYVTSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           L LLVSFV VS LLHSLTGKSALPTKLP+P+S+PRLYT WIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LILLVSFVVVSALLHSLTGKSALPTKLPAPLSQPRLYTTWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDAPAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAATA 180
           GLN A AGH+E GLWRRLLFFFGLHEAVVHWTR VVKPVVDDTVFGESRKE+WFETAATA
Sbjct: 121 GLNSAAAGHVEGGLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATA 180

Query: 181 VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNSVAW 240
           VSLGG+WWWRLRDEA+ALVVVAE KWLTS ELGPA+++ WCLYYI VAIGIAKIVNSVAW
Sbjct: 181 VSLGGVWWWRLRDEADALVVVAEIKWLTSTELGPAEVANWCLYYIIVAIGIAKIVNSVAW 240

Query: 241 FGGIFVFKKHSKRPHEVGVEDNV 264
              I V KKHSK   EV V +NV
Sbjct: 241 LVRILVPKKHSKCSDEVVVVNNV 262

BLAST of Cla97C10G194290 vs. NCBI nr
Match: XP_023529111.1 (uncharacterized protein LOC111791848 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 402.1 bits (1032), Expect = 3.6e-108
Identity = 211/263 (80.23%), Postives = 229/263 (87.07%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL+  FLFHL ++NLL LLLPLSLLLLARLSSALYL G LPL   LLLSL+LY+ +PL
Sbjct: 1   MEILRAWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAG-LPLLPPLLLSLVLYLTSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           L LLVSFV +S LLHSLTGKSALPTKLP+P+S+PRLYT WIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LILLVSFVVLSALLHSLTGKSALPTKLPAPVSQPRLYTTWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDAPAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAATA 180
           GLN+A AGHIE  LWRRLLFFFGLHEAVVHWTR VVKPVVDDTVFGESRKE+WFETAATA
Sbjct: 121 GLNNAAAGHIEGCLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATA 180

Query: 181 VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNSVAW 240
           VSLGG+WWWRLRDEA+ALVVVAE KWLTSAELGPA+++ WCLYYI V IGI KI NSVAW
Sbjct: 181 VSLGGVWWWRLRDEADALVVVAEIKWLTSAELGPAEVANWCLYYIIVGIGIGKIANSVAW 240

Query: 241 FGGIFVFKKHSKRPHEVGVEDNV 264
              I V KKHSK   EV V +NV
Sbjct: 241 LVRILVSKKHSKCSDEVVVVNNV 262

BLAST of Cla97C10G194290 vs. ExPASy TrEMBL
Match: A0A1S3BZH0 (uncharacterized protein LOC103494729 OS=Cucumis melo OX=3656 GN=LOC103494729 PE=4 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 6.0e-117
Identity = 225/265 (84.91%), Postives = 237/265 (89.43%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL  SFLFHLF+INLLGLLLPLSLLLLARLSSALYL  LLP   S LLSLILYVN+PL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV +STLLHSLTGKS LPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLND-APAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAAT 180
           GLND    GH+E G+WRRLLFFFGLHEAVVHWTRVVVKPVVDDT++GESRKEKWFETAAT
Sbjct: 121 GLNDLTSTGHVEGGMWRRLLFFFGLHEAVVHWTRVVVKPVVDDTIYGESRKEKWFETAAT 180

Query: 181 AVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVN-SV 240
           AVSLGGLWWWRLRDEAE LVVVAESKWLTS ELG ADISGWCLYYITV IGIAKIV   +
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSTELGWADISGWCLYYITVVIGIAKIVKYCI 240

Query: 241 AWFGGIFVFKKHSKRPHEVGVEDNV 264
            WFGGIFV +KHSK  + VGVEDNV
Sbjct: 241 GWFGGIFVSRKHSKTSNLVGVEDNV 265

BLAST of Cla97C10G194290 vs. ExPASy TrEMBL
Match: A0A0A0KWP5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G003705 PE=4 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 5.6e-115
Identity = 223/265 (84.15%), Postives = 232/265 (87.55%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL  SFLFHLF+INLLGLLLPLSLLLLARLSSALYL  LLP   S LLSLILYVN+PL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV +STLLHSLTGKS LPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLND-APAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAAT 180
           GLND    GH+E GLWRRLLFF GLHEAVVHWTR VVKPVVDDT++GE R EKWFETAAT
Sbjct: 121 GLNDLTSTGHVEGGLWRRLLFFLGLHEAVVHWTRAVVKPVVDDTIYGEPRTEKWFETAAT 180

Query: 181 AVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVN-SV 240
           AVSLGGLWWWRLRDEAE LVVVAESKWLTSAELG ADISGWCLYYITV IGIAKIV   +
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSAELGWADISGWCLYYITVVIGIAKIVKYCI 240

Query: 241 AWFGGIFVFKKHSKRPHEVGVEDNV 264
            WFGG FV K HSK  H VGVEDNV
Sbjct: 241 GWFGGSFVSKTHSKTSHLVGVEDNV 265

BLAST of Cla97C10G194290 vs. ExPASy TrEMBL
Match: A0A6J1J235 (uncharacterized protein LOC111482002 OS=Cucurbita maxima OX=3661 GN=LOC111482002 PE=4 SV=1)

HSP 1 Score: 412.5 bits (1059), Expect = 1.3e-111
Identity = 217/263 (82.51%), Postives = 232/263 (88.21%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL+P FLFHL ++NLL LLLPLSLLLLARLSSALYL G LPL   LLLSLILYV +PL
Sbjct: 1   MEILRPWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAG-LPLLPPLLLSLILYVTSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           L LLVSFV VS LLHSLTGKSALPTKLP+P+S+PRLYT WIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LILLVSFVVVSALLHSLTGKSALPTKLPAPLSQPRLYTTWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDAPAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAATA 180
           GLN A AGH+E GLWRRLLFFFGLHEAVVHWTR VVKPVVDDTVFGESRKE+WFETAATA
Sbjct: 121 GLNSAAAGHVEGGLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATA 180

Query: 181 VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNSVAW 240
           VSLGG+WWWRLRDEA+ALVVVAE KWLTS ELGPA+++ WCLYYI VAIGIAKIVNSVAW
Sbjct: 181 VSLGGVWWWRLRDEADALVVVAEIKWLTSTELGPAEVANWCLYYIIVAIGIAKIVNSVAW 240

Query: 241 FGGIFVFKKHSKRPHEVGVEDNV 264
              I V KKHSK   EV V +NV
Sbjct: 241 LVRILVPKKHSKCSDEVVVVNNV 262

BLAST of Cla97C10G194290 vs. ExPASy TrEMBL
Match: A0A5A7TRD0 (Histone-lysine N-methyltransferase EZA1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G001470 PE=4 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 1.5e-107
Identity = 208/238 (87.39%), Postives = 218/238 (91.60%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL  SFLFHLF+INLLGLLLPLSLLLLARLSSALYL  LLP   S LLSLILYVN+PL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV +STLLHSLTGKS LPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLND-APAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAAT 180
           GLND    GH+E G+WRRLLFFFGLHEAVVHWTRVVVKPVVDDT++GESRKEKWFETAAT
Sbjct: 121 GLNDLTSTGHVEGGMWRRLLFFFGLHEAVVHWTRVVVKPVVDDTIYGESRKEKWFETAAT 180

Query: 181 AVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNS 238
           AVSLGGLWWWRLRDEAE LVVVAESKWLTS ELG ADISGWCLYYITV IGIAKIVN+
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSTELGWADISGWCLYYITVVIGIAKIVNA 238

BLAST of Cla97C10G194290 vs. ExPASy TrEMBL
Match: A0A5D3E141 (Histone-lysine N-methyltransferase EZA1 isoform X3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001480 PE=4 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 1.5e-107
Identity = 208/238 (87.39%), Postives = 218/238 (91.60%), Query Frame = 0

Query: 1   MEILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPL 60
           MEIL  SFLFHLF+INLLGLLLPLSLLLLARLSSALYL  LLP   S LLSLILYVN+PL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSS 120
           LFLLVSFV +STLLHSLTGKS LPTKLP P+S+PRLYTAWIFLCTLQVCVGVGIEGSLSS
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLND-APAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAAT 180
           GLND    GH+E G+WRRLLFFFGLHEAVVHWTRVVVKPVVDDT++GESRKEKWFETAAT
Sbjct: 121 GLNDLTSTGHVEGGMWRRLLFFFGLHEAVVHWTRVVVKPVVDDTIYGESRKEKWFETAAT 180

Query: 181 AVSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNS 238
           AVSLGGLWWWRLRDEAE LVVVAESKWLTS ELG ADISGWCLYYITV IGIAKIVN+
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSTELGWADISGWCLYYITVVIGIAKIVNA 238

BLAST of Cla97C10G194290 vs. TAIR 10
Match: AT2G47360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02570.1); Has 58 Blast hits to 55 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 58; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 167.5 bits (423), Expect = 1.4e-41
Identity = 113/278 (40.65%), Postives = 158/278 (56.83%), Query Frame = 0

Query: 3   ILKPSFLFHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPL-----SSSLLLSLILYVN 62
           ++KP   F L +  LL LLLPLS LLL+RLSSA +LF L        SS  + SL L  N
Sbjct: 16  VVKP---FRLVTTTLLSLLLPLSFLLLSRLSSASFLFSLTKSQPQTESSFFVFSLFLRAN 75

Query: 63  TPLLFLLVSFVTVSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGS 122
             +++ +VS ++V TL+  LT K        S    P +  AW+ L  +Q+ VG+G+E +
Sbjct: 76  PAIVYAVVSSISVYTLVLGLTTKITATDPKHSIAFYPHVSIAWLTLFLVQISVGIGLETT 135

Query: 123 LSSGLNDAPAGHIESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVF----GESRKEKW 182
           +S+GL        E     RL+FFFGLHE ++ W RV+V+PVVD+T+     G+ R+E  
Sbjct: 136 ISNGLIIGS----ERNFLSRLVFFFGLHEVMLLWYRVIVRPVVDNTLLGGEDGQRREETV 195

Query: 183 FETAATAVSLGGLWWWRLRDEAEALVVVAESKWL------------TSAELGPADISGWC 242
            E  A AVS G LWWW+LRDE EALV VAE+K               S ++G  D   W 
Sbjct: 196 VERVALAVSCGTLWWWKLRDEVEALVGVAEAKRALLLLLPIDGNVNVSFDVGTVDFVNWW 255

Query: 243 LYYITVAIGIAKIVNSVAWFGGIFVFKKHSKRPHEVGV 260
           LYY+ V IG+ +I+    WFG I +F++ S+R +  G+
Sbjct: 256 LYYMVVTIGMVRIIKGSLWFGMILLFEQGSRRRNPRGI 286

BLAST of Cla97C10G194290 vs. TAIR 10
Match: AT1G02570.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02575.1); Has 108 Blast hits to 55 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 157.9 bits (398), Expect = 1.1e-38
Identity = 101/250 (40.40%), Postives = 141/250 (56.40%), Query Frame = 0

Query: 10  FHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPLLFLLVSFVT 69
           F + SI+LL LL+PLS L L+RLS +       P++ S + SL+   +  +L+ ++S + 
Sbjct: 20  FQMISISLLSLLVPLSFLFLSRLSVS---SSSAPVTVSGVFSLLHQADVGILYTILSLII 79

Query: 70  VSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSSGLN---DAP 129
           VSTL+H L+GK          +    LY  WI L  +Q CV  GIEG++S+ ++   D  
Sbjct: 80  VSTLIHILSGKP------ECSVLHSHLYICWIVLFIVQACVAFGIEGTMSTTISIDTDKS 139

Query: 130 AGHIESGLW--RRLLFFFGLHEAVVHWTRVVVKPVVDDTVFG-ESRKEKWFETAATAVSL 189
                   W   R++FF GLHE ++ W RVVVKPV+DDTVFG    +E+W E A  AV+ 
Sbjct: 140 FSLAAQERWVLVRVMFFLGLHEVMLMWFRVVVKPVIDDTVFGVYVEEERWSERAVVAVTF 199

Query: 190 GGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVNSVAWFGG 249
           G +WWWRLRDE E+LVVVAE K      L   D   W +YYI V IG+ KI     +F  
Sbjct: 200 GLMWWWRLRDEVESLVVVAEVKRNLQIRLEGLDFVNWWMYYICVGIGLVKIFKGFLYFVN 259

Query: 250 IFVFKKHSKR 254
           + +   +  R
Sbjct: 260 MLILTINRSR 260

BLAST of Cla97C10G194290 vs. TAIR 10
Match: AT1G02575.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02570.1). )

HSP 1 Score: 138.3 bits (347), Expect = 9.0e-33
Identity = 94/231 (40.69%), Postives = 134/231 (58.01%), Query Frame = 0

Query: 10  FHLFSINLLGLLLPLSLLLLARLSSALYLFGLLPLSSSLLLSLILYVNTPLLFLLVSFVT 69
           F + SI+ L LLLPLS L L+RLS  LY     P++ S + S+I   +  +L+ ++  + 
Sbjct: 20  FQMISISFLSLLLPLSFLFLSRLS--LYT-SSTPVTVSGVSSVIHQADVGVLYTILFLII 79

Query: 70  VSTLLHSLTGKSALPTKLPSPISRPRLYTAWIFLCTLQVCVGVGIEGSLSSGLNDAPAGH 129
           V TL+HSL+GK          +    LY  WI L   Q C   GI+ ++S+ ++  P  +
Sbjct: 80  VFTLIHSLSGKP------ECSVLHSHLYICWIVLFIAQAC-AFGIKRTMSTTMSINPDKN 139

Query: 130 I-----ESGLWRRLLFFFGLHEAVVHWTRVVVKPVVDDTVFGESRKEKWFETAATAVSLG 189
           +     E  +  R+LFF GLHE ++ W RVVVKPVVD+T++G   +E+W E A  AV+ G
Sbjct: 140 LFLATHERWMLVRVLFFLGLHEVMLMWFRVVVKPVVDNTIYGVYVEERWSERAVVAVTFG 199

Query: 190 GLWWWRLRDEAEALVVVAESKWLT-SAELGPADISGWCLYYITVAIGIAKI 235
            +WWWRLRDE E+LVVV  +  L     L   +   WC+YYI V IG+ KI
Sbjct: 200 IMWWWRLRDEVESLVVVVTADRLNLPIRLEGLNFVNWCMYYICVGIGLMKI 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904328.12.7e-12791.25uncharacterized protein LOC120090682 [Benincasa hispida][more]
XP_008454283.11.2e-11684.91PREDICTED: uncharacterized protein LOC103494729 [Cucumis melo][more]
XP_011652956.21.2e-11484.15uncharacterized protein LOC105435160 [Cucumis sativus][more]
XP_022983396.12.7e-11182.51uncharacterized protein LOC111482002 [Cucurbita maxima][more]
XP_023529111.13.6e-10880.23uncharacterized protein LOC111791848 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3BZH06.0e-11784.91uncharacterized protein LOC103494729 OS=Cucumis melo OX=3656 GN=LOC103494729 PE=... [more]
A0A0A0KWP55.6e-11584.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G003705 PE=4 SV=1[more]
A0A6J1J2351.3e-11182.51uncharacterized protein LOC111482002 OS=Cucurbita maxima OX=3661 GN=LOC111482002... [more]
A0A5A7TRD01.5e-10787.39Histone-lysine N-methyltransferase EZA1 isoform X1 OS=Cucumis melo var. makuwa O... [more]
A0A5D3E1411.5e-10787.39Histone-lysine N-methyltransferase EZA1 isoform X3 OS=Cucumis melo var. makuwa O... [more]
Match NameE-valueIdentityDescription
AT2G47360.11.4e-4140.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G02570.11.1e-3840.40unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G02575.19.0e-3340.69unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37172:SF3TRANSMEMBRANE PROTEINcoord: 7..253
NoneNo IPR availablePANTHERPTHR37172TRANSMEMBRANE PROTEINcoord: 7..253

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G194290.1Cla97C10G194290.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0031519 PcG protein complex
molecular_function GO:0008168 methyltransferase activity