CaUC09G166780 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC09G166780
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionKAT8 regulatory NSL complex subunit 2
LocationCiama_Chr09: 9256122 .. 9257816 (-)
RNA-Seq ExpressionCaUC09G166780
SyntenyCaUC09G166780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGAATCAAACTCGCCTGGTTCGTTTCAACCTCCTCCTGTTACCCCACTTCCTATTCTTATTGATGGGGCAGATCGCGATCTAGCACTCGCCTCTTCTGAGTTCTGTGCTCGTCGAGAAGTACTCGAGCGGCGGTCACGGAGAGTGAAACAACTTGTTCGAATCTTAAAGGAAGTGTACTGGTTTTTGTTGGAGGAAGTGAGGCGCAAATACAGGGAGTATTATTGGACATATGGCAAGAGTCCATTTAAGGAGGACGAGAAGGAGGCTGAGGGCATTGGTGATTATCCAGAGGGTATTGGGGAGAATGGAAAGCTAGGATTAGGTTCTACGACCGGGAGTGATGAGATTAGAAGGTGTGATGTCACAGGTTGCAAGGCAAAGGCAATGGCTTTGACAAAATACTGTCATGCTCATATCCTCTCGGATAAAAAGCAGAGGCTCTACAAGGGTTGCACCTTTGTAATCAAGAGGTTTGCATTCTGTATAACTGATAATTTCTTTTCGCTGCTTAGAAGTACTATCAAATATTGAATTTCTACTTTATTAGCAATAGCAATAAATAGAGAATGATTGTCTATTATTGATTGGAGTATGATGATTTATGATGTTCTTGCTTGTTATGCACTTTACTAACAGTTTATTCATAAATTAATTATCAATTTTCTTTAATTTTTCTTGTTTGGAGGACTGTCTAAGATACAATTTTCTTAGAGTTGGTCAGAGTAGACCCTAAGGATGAAGGCTGCATAAAGTTTTTTCTAATGTTCAAGTCCTACCATCCTCTTAGTTAAAGAACACGAACAATGGCTATCATCTATCTACTGATTTAATTTACCAAGTGAAACCATGTGATAACTGGTAAGAAACCAAGCAAACTTATCATTGCGATATTTACAGGAAATTTTGTTGTCCCATAGGCCTCTAACGTTTTGTGTGTCTCAATCCTACTTCACCAGTGCAAGGTTAGTGAGCTGAATCTAATTAATGAGAAAAGGCAGCCTCCTCTAGGGTCTGCTCCCACCCAAAGAACTCTCCAATATTTATGGTGGATCCTACAAAGAAAAGAAGCAAGTTTTGGGGTTTTATTTGTGTATTCTTTTAAGAATATTTTTACAATTGAAAAGTAGATAACATGGTTGAGCTGTGAATTGGATGTCACGCCATTTTTTTTAATCGATCTAAAGTTGATGTTCACACCCTTTGATTCAGCTTTTATCTTTCCTTTTTGGGGATTTTGTGTTTGAGACTCTGGAGTTAGGGGCAATGCAAATTGACATTTGGAATTTTGGGTAGGCATAGATGTCTTGTGTTGAGGAAACTATAATTTGGTGTCTGACTTGTGGATTTATTATCAATAATTATGATCAGCTCAACCTTCTATTGCAGCAATATGGTGCTTTTTGGTTTTCCATTAACAAGAAACTCTCTTCCACTTACAGTATGCAGTCAGGGCCGCTTCTTTGTTCAAAGCCTGTTTTAAGATCAACTGTTCCCTGCTATTGCCCTGGTCATCTACAAAAAGGCGAAAAGTGTTTAGCTCGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCGACTAGTAAGCTTCGTCCCGATTTCCATGTATTGGTAGCTGAATACATTCGCCAAATACAAAGCAAAAGGAGAGCGACGAGAAAGGCTACTGCTGTTAAAATTGAGAGTAACTGA

mRNA sequence

ATGGCAGAATCAAACTCGCCTGGTTCGTTTCAACCTCCTCCTGTTACCCCACTTCCTATTCTTATTGATGGGGCAGATCGCGATCTAGCACTCGCCTCTTCTGAGTTCTGTGCTCGTCGAGAAGTACTCGAGCGGCGGTCACGGAGAGTGAAACAACTTGTTCGAATCTTAAAGGAAGTGTACTGGTTTTTGTTGGAGGAAGTGAGGCGCAAATACAGGGAGTATTATTGGACATATGGCAAGAGTCCATTTAAGGAGGACGAGAAGGAGGCTGAGGGCATTGGTGATTATCCAGAGGGTATTGGGGAGAATGGAAAGCTAGGATTAGGTTCTACGACCGGGAGTGATGAGATTAGAAGGTGTGATGTCACAGGTTGCAAGGCAAAGGCAATGGCTTTGACAAAATACTGTCATGCTCATATCCTCTCGGATAAAAAGCAGAGGCTCTACAAGGGTTGCACCTTTCTCAACCTTCTATTGCAGCAATATGGTGCTTTTTGGTTTTCCATTAACAAGAAACTCTCTTCCACTTACAGTATGCAGTCAGGGCCGCTTCTTTGTTCAAAGCCTGTTTTAAGATCAACTGTTCCCTGCTATTGCCCTGGTCATCTACAAAAAGGCGAAAAGTGTTTAGCTCGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCGACTAGTAAGCTTCGTCCCGATTTCCATGTATTGGTAGCTGAATACATTCGCCAAATACAAAGCAAAAGGAGAGCGACGAGAAAGGCTACTGCTGTTAAAATTGAGAGTAACTGA

Coding sequence (CDS)

ATGGCAGAATCAAACTCGCCTGGTTCGTTTCAACCTCCTCCTGTTACCCCACTTCCTATTCTTATTGATGGGGCAGATCGCGATCTAGCACTCGCCTCTTCTGAGTTCTGTGCTCGTCGAGAAGTACTCGAGCGGCGGTCACGGAGAGTGAAACAACTTGTTCGAATCTTAAAGGAAGTGTACTGGTTTTTGTTGGAGGAAGTGAGGCGCAAATACAGGGAGTATTATTGGACATATGGCAAGAGTCCATTTAAGGAGGACGAGAAGGAGGCTGAGGGCATTGGTGATTATCCAGAGGGTATTGGGGAGAATGGAAAGCTAGGATTAGGTTCTACGACCGGGAGTGATGAGATTAGAAGGTGTGATGTCACAGGTTGCAAGGCAAAGGCAATGGCTTTGACAAAATACTGTCATGCTCATATCCTCTCGGATAAAAAGCAGAGGCTCTACAAGGGTTGCACCTTTCTCAACCTTCTATTGCAGCAATATGGTGCTTTTTGGTTTTCCATTAACAAGAAACTCTCTTCCACTTACAGTATGCAGTCAGGGCCGCTTCTTTGTTCAAAGCCTGTTTTAAGATCAACTGTTCCCTGCTATTGCCCTGGTCATCTACAAAAAGGCGAAAAGTGTTTAGCTCGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCGACTAGTAAGCTTCGTCCCGATTTCCATGTATTGGTAGCTGAATACATTCGCCAAATACAAAGCAAAAGGAGAGCGACGAGAAAGGCTACTGCTGTTAAAATTGAGAGTAACTGA

Protein sequence

MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEVYWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYIRQIQSKRRATRKATAVKIESN
Homology
BLAST of CaUC09G166780 vs. NCBI nr
Match: XP_038896100.1 (INO80 complex subunit D-like isoform X2 [Benincasa hispida] >XP_038896101.1 INO80 complex subunit D-like isoform X2 [Benincasa hispida])

HSP 1 Score: 449.1 bits (1154), Expect = 2.6e-122
Identity = 227/261 (86.97%), Postives = 235/261 (90.04%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAES+SPGSFQPPPVTPLPILIDGADRD ALA+SE CARREVLERRSRRVKQL RILK+V
Sbjct: 1   MAESSSPGSFQPPPVTPLPILIDGADRDRALAASEVCARREVLERRSRRVKQLCRILKQV 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS TGSDEIRR
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSVTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAE++
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEFV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQSKRRATRKATAVKIESN
Sbjct: 241 RQIQSKRRATRKATAVKIESN 241

BLAST of CaUC09G166780 vs. NCBI nr
Match: XP_038896099.1 (uncharacterized protein LOC120084404 isoform X1 [Benincasa hispida])

HSP 1 Score: 449.1 bits (1154), Expect = 2.6e-122
Identity = 227/261 (86.97%), Postives = 235/261 (90.04%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAES+SPGSFQPPPVTPLPILIDGADRD ALA+SE CARREVLERRSRRVKQL RILK+V
Sbjct: 107 MAESSSPGSFQPPPVTPLPILIDGADRDRALAASEVCARREVLERRSRRVKQLCRILKQV 166

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS TGSDEIRR
Sbjct: 167 YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSVTGSDEIRR 226

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 227 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 286

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAE++
Sbjct: 287 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEFV 346

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQSKRRATRKATAVKIESN
Sbjct: 347 RQIQSKRRATRKATAVKIESN 347

BLAST of CaUC09G166780 vs. NCBI nr
Match: XP_008447279.1 (PREDICTED: INO80 complex subunit D-like isoform X1 [Cucumis melo])

HSP 1 Score: 438.3 bits (1126), Expect = 4.5e-119
Identity = 219/261 (83.91%), Postives = 231/261 (88.51%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MA+SNSPGSFQPPPVTP PILIDGADRD ALASS  C+RREVLERRSRR KQL RI KE+
Sbjct: 1   MADSNSPGSFQPPPVTPFPILIDGADRDRALASSMVCSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS+TGSDEIRR
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSSTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVL+AEY+
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQSKRRAT++ATA+KIESN
Sbjct: 241 RQIQSKRRATKRATAIKIESN 241

BLAST of CaUC09G166780 vs. NCBI nr
Match: XP_004139660.1 (INO80 complex subunit D [Cucumis sativus] >KGN44580.1 hypothetical protein Csa_015804 [Cucumis sativus])

HSP 1 Score: 437.6 bits (1124), Expect = 7.7e-119
Identity = 219/261 (83.91%), Postives = 230/261 (88.12%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAESNSPGSFQPPPVTPLPILIDGADRD ALA+S  C+RREVLERRSRR KQL RI KE+
Sbjct: 1   MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGL S TGSDEIRR
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLASATGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVL+AEY+
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQSKRRAT++ATA+KIESN
Sbjct: 241 RQIQSKRRATKRATAIKIESN 241

BLAST of CaUC09G166780 vs. NCBI nr
Match: XP_022953677.1 (INO80 complex subunit D-like [Cucurbita moschata] >XP_022991632.1 INO80 complex subunit D-like [Cucurbita maxima] >XP_023548640.1 INO80 complex subunit D-like [Cucurbita pepo subsp. pepo] >KAG7014493.1 INO80 complex subunit D, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 420.2 bits (1079), Expect = 1.3e-113
Identity = 213/261 (81.61%), Postives = 224/261 (85.82%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAESNSPGSFQ PP  P P++IDGA+ DLALAS EF  RREVLERRSRRVKQL R+ +E+
Sbjct: 1   MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YW L+EE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS TGSDEIRR
Sbjct: 61  YWALMEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSVTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAE +
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQ KRRA RKATAVKIESN
Sbjct: 241 RQIQVKRRAARKATAVKIESN 241

BLAST of CaUC09G166780 vs. ExPASy Swiss-Prot
Match: Q54J07 (INO80 complex subunit D OS=Dictyostelium discoideum OX=44689 GN=DDB_G0288447 PE=3 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 8.3e-07
Identity = 51/208 (24.52%), Postives = 76/208 (36.54%), Query Frame = 0

Query: 21  LIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEVYWFLLEEVRRKYREYYWTYG 80
           L +  D D   ASS      E+++RR   + +L+ + K+ Y    E +R   R Y  T  
Sbjct: 407 LCEDFDSDFYFASSSVLTDEELIQRRKIYISKLILLYKKQYNRFKERLRIIRRHYISTSL 466

Query: 81  KSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGS------------------------D 140
               + D  + E   +    I  N      +   +                        +
Sbjct: 467 SLNQQNDSNKMEIDNNNDNNINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKLNKRKE 526

Query: 141 EIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSS 200
           E   C    CK K M L+KYC++HIL DK Q+L+  CT           +  S NKK   
Sbjct: 527 EGNLCLSVNCKVKPMLLSKYCYSHILQDKDQKLFHECT-----------YQLSANKK--- 586

Query: 201 TYSMQSGPLLCSKPVLRSTVPCYCPGHL 205
                     C  P+L+  +P  C  HL
Sbjct: 587 ----------CGYPILKVQIPTLCREHL 590

BLAST of CaUC09G166780 vs. ExPASy TrEMBL
Match: A0A1S3BH19 (KAT8 regulatory NSL complex subunit 2 OS=Cucumis melo OX=3656 GN=LOC103489752 PE=4 SV=1)

HSP 1 Score: 438.3 bits (1126), Expect = 2.2e-119
Identity = 219/261 (83.91%), Postives = 231/261 (88.51%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MA+SNSPGSFQPPPVTP PILIDGADRD ALASS  C+RREVLERRSRR KQL RI KE+
Sbjct: 1   MADSNSPGSFQPPPVTPFPILIDGADRDRALASSMVCSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS+TGSDEIRR
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSSTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVL+AEY+
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQSKRRAT++ATA+KIESN
Sbjct: 241 RQIQSKRRATKRATAIKIESN 241

BLAST of CaUC09G166780 vs. ExPASy TrEMBL
Match: A0A0A0K4I2 (KAT8 regulatory NSL complex subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_7G337070 PE=4 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 3.7e-119
Identity = 219/261 (83.91%), Postives = 230/261 (88.12%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAESNSPGSFQPPPVTPLPILIDGADRD ALA+S  C+RREVLERRSRR KQL RI KE+
Sbjct: 1   MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGL S TGSDEIRR
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLASATGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVL+AEY+
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQSKRRAT++ATA+KIESN
Sbjct: 241 RQIQSKRRATKRATAIKIESN 241

BLAST of CaUC09G166780 vs. ExPASy TrEMBL
Match: A0A6J1JMD6 (KAT8 regulatory NSL complex subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC111488191 PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 6.1e-114
Identity = 213/261 (81.61%), Postives = 224/261 (85.82%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAESNSPGSFQ PP  P P++IDGA+ DLALAS EF  RREVLERRSRRVKQL R+ +E+
Sbjct: 1   MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YW L+EE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS TGSDEIRR
Sbjct: 61  YWALMEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSVTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAE +
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQ KRRA RKATAVKIESN
Sbjct: 241 RQIQVKRRAARKATAVKIESN 241

BLAST of CaUC09G166780 vs. ExPASy TrEMBL
Match: A0A6J1GNX0 (KAT8 regulatory NSL complex subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111456134 PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 6.1e-114
Identity = 213/261 (81.61%), Postives = 224/261 (85.82%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MAESNSPGSFQ PP  P P++IDGA+ DLALAS EF  RREVLERRSRRVKQL R+ +E+
Sbjct: 1   MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YW L+EE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS TGSDEIRR
Sbjct: 61  YWALMEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSVTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYI 240
           QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAE +
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECV 240

Query: 241 RQIQSKRRATRKATAVKIESN 262
           RQIQ KRRA RKATAVKIESN
Sbjct: 241 RQIQVKRRAARKATAVKIESN 241

BLAST of CaUC09G166780 vs. ExPASy TrEMBL
Match: A0A5D3DT82 (KAT8 regulatory NSL complex subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold124G00240 PE=4 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 4.6e-109
Identity = 201/239 (84.10%), Postives = 209/239 (87.45%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEV 60
           MA+SNSPGSFQPPPVTP PILIDGADRD ALASS  C+RREVLERRSRR KQL RI KE+
Sbjct: 1   MADSNSPGSFQPPPVTPFPILIDGADRDRALASSMVCSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWFLLEEVRRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSTTGSDEIRR 120
           YWFLLEE++RKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGS+TGSDEIRR
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLGSSTGSDEIRR 120

Query: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSM 180
           CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTF+                      SM
Sbjct: 121 CDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFV--------------------IKSM 180

Query: 181 QSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEY 240
           QSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVL+AEY
Sbjct: 181 QSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEY 219

BLAST of CaUC09G166780 vs. TAIR 10
Match: AT2G31600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G53860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 162.9 bits (411), Expect = 3.4e-40
Identity = 104/266 (39.10%), Postives = 145/266 (54.51%), Query Frame = 0

Query: 6   SPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEVYWFLL 65
           +P +   P  +  PI +  +  D  LA S    R E+L+RRS  +KQL +  ++ YW L+
Sbjct: 49  NPSTSGLPSTSNSPITM--SQEDEILARSSHITRSELLKRRSHNLKQLAKCYRDNYWALM 108

Query: 66  EEVRRKYREYYWTYGKSPFKEDEKEA--------EGI----GDYPEGIGENGKLGLGSTT 125
           E+V+ ++R+Y+W YG S FK++  ++        EG     GD  EG G+N     G  +
Sbjct: 109 EDVKAQHRDYWWKYGISQFKDENNQSNKRRRLGQEGDIGDGGDAVEGSGDNVTNNDGVKS 168

Query: 126 GSDEIRRCD--VTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSIN 185
                  C   + GCKAKAMALTKYC  HIL D KQ+LY GCT              ++ 
Sbjct: 169 DQYANSNCGSCMYGCKAKAMALTKYCQLHILKDSKQKLYTGCT--------------NVI 228

Query: 186 KKLSSTYSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPD 245
           K+        +GPLLC KP L STVP  C  H QK +K +A+ L+ AG NVSSTSK  P 
Sbjct: 229 KR------APAGPLLCGKPTLASTVPALCNIHFQKAQKHVAKALKDAGHNVSSTSKPPPK 288

Query: 246 FHVLVAEYIRQIQSKRRATRKATAVK 258
            HV+VA ++  IQ+KR+  +K   +K
Sbjct: 289 LHVIVAAFVHHIQAKRKNPQKECKLK 292

BLAST of CaUC09G166780 vs. TAIR 10
Match: AT3G53860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G31600.1); Has 70 Blast hits to 70 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 66; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 155.2 bits (391), Expect = 7.1e-38
Identity = 96/231 (41.56%), Postives = 131/231 (56.71%), Query Frame = 0

Query: 28  DLALASSEFCARREVLERRSRRVKQLVRILKEVYWFLLEEVRRKYREYYWTYGKSPFKED 87
           D  LASS    R E+L RR+  +KQL +  K  YW L+E+++ ++R+Y+  YG S FK++
Sbjct: 65  DEILASSSHLTRPELLRRRADNLKQLAKCYKNHYWALMEDLKAQHRDYWCKYGVSQFKDE 124

Query: 88  EKEAEGIGDY-PEGIGENGKLGLGSTTGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKK 147
           + ++       PEG G+ G    G    +     C + GCKAKAMALTKYC  HIL D K
Sbjct: 125 QNQSNKRRRLDPEGSGDKG--NDGDQYANSNSGFC-MYGCKAKAMALTKYCQLHILKDSK 184

Query: 148 QRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSMQSGPLLCSKPVLRSTVPCYCPGHLQK 207
           Q+LY GCT +             IN+         +GPLLC KP L STVP  C  H QK
Sbjct: 185 QKLYTGCTNV-------------INRS-------PAGPLLCGKPTLASTVPVLCNVHYQK 244

Query: 208 GEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYIRQIQSKRRATRKATAVK 258
            +K +A+ L+ AG NVSSTSK  P  HV+VA ++  IQ++R+   K   +K
Sbjct: 245 AQKNVAKALKDAGHNVSSTSKPPPKLHVIVAAFVHHIQAQRKNPHKEGKLK 272

BLAST of CaUC09G166780 vs. TAIR 10
Match: AT1G05860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G31600.1); Has 101 Blast hits to 100 proteins in 32 species: Archae - 0; Bacteria - 0; Metazoa - 28; Fungi - 2; Plants - 66; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). )

HSP 1 Score: 154.8 bits (390), Expect = 9.3e-38
Identity = 97/243 (39.92%), Postives = 132/243 (54.32%), Query Frame = 0

Query: 22  IDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEVYWFLLEEVRRKYREYYWTYGK 81
           I  A  D  L +S    R E+L RRS  +KQL R  ++ YW L+E+++ ++R Y W YG 
Sbjct: 51  ISMAVEDQILGNSNHLTRPELLRRRSHNLKQLSRCYRDHYWALMEDLKAQHRYYSWNYGV 110

Query: 82  SPFKED------EKEAEG-IGDYPEGIGENGKLGLGSTTGSDEIRRCDVTGCKAKAMALT 141
           SPFK++       ++ EG  GD  EG G+N           + +  C  +GCK+KAMALT
Sbjct: 111 SPFKDENYHQNKRRKVEGQTGDEIEGSGDNDNNNNDGVKAGNCV-ACG-SGCKSKAMALT 170

Query: 142 KYCHAHILSDKKQRLYKGCTFLNLLLQQYGAFWFSINKKLSSTYSMQSGPLLCSKPVLRS 201
            YC  HIL DKKQ+LY  CT+              +NK+       QS  + C KP L S
Sbjct: 171 NYCQLHILMDKKQKLYTSCTY--------------VNKR------AQSKAITCPKPTLAS 230

Query: 202 TVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEYIRQIQSKRRATRKAT 258
           TVP  C  H QK +K +AR L+ AG NVSS S+  P  H +VA ++  IQ+KR+  RK  
Sbjct: 231 TVPALCNVHFQKAQKDVARALKDAGHNVSSASRPPPKLHDIVAAFVHHIQAKRKDPRKEG 271

BLAST of CaUC09G166780 vs. TAIR 10
Match: AT2G31600.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G53860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 102.4 bits (254), Expect = 5.5e-22
Identity = 64/163 (39.26%), Postives = 90/163 (55.21%), Query Frame = 0

Query: 6   SPGSFQPPPVTPLPILIDGADRDLALASSEFCARREVLERRSRRVKQLVRILKEVYWFLL 65
           +P +   P  +  PI +  +  D  LA S    R E+L+RRS  +KQL +  ++ YW L+
Sbjct: 49  NPSTSGLPSTSNSPITM--SQEDEILARSSHITRSELLKRRSHNLKQLAKCYRDNYWALM 108

Query: 66  EEVRRKYREYYWTYGKSPFKEDEKEA--------EGI----GDYPEGIGENGKLGLGSTT 125
           E+V+ ++R+Y+W YG S FK++  ++        EG     GD  EG G+N     G  +
Sbjct: 109 EDVKAQHRDYWWKYGISQFKDENNQSNKRRRLGQEGDIGDGGDAVEGSGDNVTNNDGVKS 168

Query: 126 GSDEIRRCD--VTGCKAKAMALTKYCHAHILSDKKQRLYKGCT 155
                  C   + GCKAKAMALTKYC  HIL D KQ+LY GCT
Sbjct: 169 DQYANSNCGSCMYGCKAKAMALTKYCQLHILKDSKQKLYTGCT 209

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896100.12.6e-12286.97INO80 complex subunit D-like isoform X2 [Benincasa hispida] >XP_038896101.1 INO8... [more]
XP_038896099.12.6e-12286.97uncharacterized protein LOC120084404 isoform X1 [Benincasa hispida][more]
XP_008447279.14.5e-11983.91PREDICTED: INO80 complex subunit D-like isoform X1 [Cucumis melo][more]
XP_004139660.17.7e-11983.91INO80 complex subunit D [Cucumis sativus] >KGN44580.1 hypothetical protein Csa_0... [more]
XP_022953677.11.3e-11381.61INO80 complex subunit D-like [Cucurbita moschata] >XP_022991632.1 INO80 complex ... [more]
Match NameE-valueIdentityDescription
Q54J078.3e-0724.52INO80 complex subunit D OS=Dictyostelium discoideum OX=44689 GN=DDB_G0288447 PE=... [more]
Match NameE-valueIdentityDescription
A0A1S3BH192.2e-11983.91KAT8 regulatory NSL complex subunit 2 OS=Cucumis melo OX=3656 GN=LOC103489752 PE... [more]
A0A0A0K4I23.7e-11983.91KAT8 regulatory NSL complex subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_7G337070... [more]
A0A6J1JMD66.1e-11481.61KAT8 regulatory NSL complex subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC11148819... [more]
A0A6J1GNX06.1e-11481.61KAT8 regulatory NSL complex subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111456... [more]
A0A5D3DT824.6e-10984.10KAT8 regulatory NSL complex subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
Match NameE-valueIdentityDescription
AT2G31600.13.4e-4039.10unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G53860.17.1e-3841.56unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G05860.19.3e-3839.92unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G31600.25.5e-2239.26unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025927Potential DNA-binding domainPFAMPF13891zf-C3Hc3Hcoord: 122..203
e-value: 6.1E-14
score: 52.1
IPR026316KAT8 regulatory NSL complex subunit 2PANTHERPTHR13453UNCHARACTERIZEDcoord: 178..259
coord: 9..156
NoneNo IPR availablePANTHERPTHR13453:SF7DOMAIN PROTEIN, PUTATIVE-RELATEDcoord: 178..259
coord: 9..156

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC09G166780.1CaUC09G166780.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043984 histone H4-K16 acetylation
biological_process GO:0043981 histone H4-K5 acetylation
biological_process GO:0043982 histone H4-K8 acetylation
cellular_component GO:0044545 NSL complex
cellular_component GO:0000123 histone acetyltransferase complex
molecular_function GO:0005524 ATP binding
molecular_function GO:0046983 protein dimerization activity