Sed0006305 (gene) Chayote v1

Overview
NameSed0006305
Typegene
OrganismSechium edule (Chayote v1)
DescriptionKAT8 regulatory NSL complex subunit 2
LocationLG09: 34886032 .. 34896999 (-)
RNA-Seq ExpressionSed0006305
SyntenySed0006305
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTGGAGTTGGACCCTTTTTATCTCTCCTCATCAATTTCTCTGCACATAGACTCACGCTCACAGTCCCTCACTCTCTCTTCTCATCCATACATCTCCCCAACTCGCCGGAAACCGCCATTAAAACCGCCGTCTGAAGCTTTCTCAGTCACCCCCTTCACCGCCGCAAAGCCTCCGTTCTCAACCCTTAACCACGAGCCGTCCGATTCAACCCTGCTGCACCGGAGAGACGTCCTTGGAACGCGAATTCGCCGAACCGGAATCTGCTCCACCAGCCACCTGTGTGACCCGGTTCCAACCCGAACGGGCTTCCCAACCTGTAGATCTAGACCCTTGAGTTTCGTGCTACTTGGGTAAATTTCCTTTGCTCAACTAATTCCTTGACAATTTTGTAAATTAACGCATATTAACTTATGGTTAGGTCTGGCTAGGTTCGTGATCATTTTTTTTGCTGATCGGAGTGGTGGAATCTCGATCAATTCATCCGGTTAAGGTAAGGAATGTTTTACCCTTTTGCGGCGGTAATACGATCATTAATGACGCTTATGTAGTTGCTTGGTTTGAATTGATGATTCTGGTATGAATTTCTGTGGAATTGGATTAAGTCAAGAATAATATGCTTGGTTTGAATTGATAGTATACTGTATCCCCAAGGAGTGGCACAGTGGTTGTAGACTTGGGCTTTGAGGGTATGCTCTCCTCAAGGTCCCAGGTTCGAGACTCAGTTGTGATATTACTCCTTTAATGTCTCCCGGCACTTGGCCTAGGGATAGGCGTGGTTATCCTTGTTTCAAAAAAAATATGTTGTGTGAGACTAAAATCATATTAAGATGGATGATGAGCTTCAAACGACATGTTAGACTTGAACTAGTCTAAGTAGGTACAATTTGCTTGAGTTGTTTATTCGGTTTAGTATTGTGGACTTGTTGGTCCCTACTCTTGCTAACACGGACTTGATGATCCTTGCTATTGCTATTGCGGACTTGTTGGTCCTCTTCATTCATATTGCGGGCTAGGAGGCTCCGATGTGAGGGGTTATATCCCATGTGTATTAATCTTAGAAGTTATATTAACCTTCACAATGATTTGGTTGATTAGATTATCCCATCAAACTGATCTCTATTACTTCATGAGAGGAGTATTCTACCTATCTTTTGCCTAAATCGACGAAGTTAAGTTCAGTTTTCTAGTCACCATTGGGTCGTCTTCTCAGTCTTCCTCTTTATCATCTTCATTTTCTTATTCCCTTTCTCTACCATTATCCAGCATCAAAAGGCTACCCAATTTTGCCTCAATCCTCCTCAACTGAATCTCGAAGTTGCATCAACTCTATTGATATATGATAGGAACCTCCCATTCTCAACTGAAATCTTGAACTTGGAATCTCCCATTCTCAAGGGAACTCTCTCTTTGAGCCCACAAGATTTCACACAATATTTAGATGCCTTTCACTACCTAATTCCCCCTTATTTATAACCATATTCTCTAATAAACTCTCTAACTAATTTCCTATATACCCCGACATTGTTCTTAATTCCTCTAAATTCCCAAACCTATCAATATCCTGTTTCACATTATTTTTTATCACCAATTAGGAGTAAATAGACAACTTCTGTTGTAATCACATCTCAATTAATTAAGTGTAAGAAAAATTTTTGTGTTCATTTGATCTTTAAGTATGTGTTGTGCGCCAAAAAGAGCATAGCTCTAGTGGCATTGTGTATGACCTCATAACTAAGTCGCTATGGGTTCGAATCGCCATCTTCAATTGTTATTGAATTAAAAAAAGCAGTGTTGTTGAGGCGACCCCAGGCGCTCGCCTATGGCGAGAGGCGTGGCGACCTGGGGCCCATGCGCCTCTTAAGCGGCCAGGGCGAGCGACTTCACTCAAGCGCTCGCCTACCTTCGCCTCTTTCCGCCCAACAACGCCTTCGCCTCTGTGGAAGTCGTGCGCTTGAGTTTTTTTTTTTCAGATTTATAAAACAGAGTCATCTTCAAGGCAAAAGGTTGAAGATGATGAACAGTGTATTTATTTATTTTCTTCGTGAAGTGTTTAGAAATGGATGATAATATAAAGCAAATTTCTTTTGGGTAAAATGCTTTATTAAAAGTGAAAAGGCAACCTATTAATTTTTAAGTTTTCTTTTAATTTTTTGTTTGATTAACTTTGAATAGCGTGCAAACTTGATAAAATCCTTTGAATAAAAGAACTTTTATTTAAAATGTGAAATGACAAGCCATCATTCAAAGTTAAGTTTTTTGTTTTTTTTTGTTCATTATTAACTATTAACTCTCAATATTGATAATTGGGAATTTTAGTTTTCATTTATTTTTTCTATTTTAAATTAGTTAAAATATAAACTATAATTTGTTTAGTTTTATTAATTCTGCAGATAAAGCTTACTATTTTAGTTACATTATTTTTTTACTTTATATTATGCTTAAAATATAAAGTAAAAATTGTTTATACCTTTAATAACCTGGAGCCCAAAGGTTCCAATTGACTTCAAAATATATAAATGGTGTATTTTAAATTTGTTCAACATTTTTTATTTTTTATTTTTTGCCATTTTCCATTTGATTGATTTTTTTGTTTGTTTATTTTGATGTTTTTGCTATGTAGTATTATTTGGTAAGAATTTGATATTTTATTTGGAAACATCTTATCATATTTTGTTTTATATTTTTATTCTTTTTTTCTTCTTTTATTTATTAATAGTATATATAAGGGTTGTTTAAAATTTAAAACAATTTATAATACAAGGGCATCAAAAGTTGATGACTTAATATTTTCACAATTTTAATGATTGTTTATTATTTATTGGCTTTTGATATATAAATTGTTTTTAACTTTAATATTTTTCACATTTAATATTTGGATGTTTTTACCATTTAGGATTTCATATTTTGTGTAAATAAGAATCTATTTTCAATATCAATTTATCTTTATATATTTATATATGTTATTATATTTTGTTATGTATTTATATATTGCGCCTCGCTTCACTCGGGCCTCGCCTTTTTGTCGCTTCTCTCCTTGAGGCAATCAAGGAGAGTGTCGCCTTGCGTTGCGCTTTGTGCTTTGAAAACACTGAAAAAAAGTATGTTATGTGCAAATGGGTGTTGAAGTTGGTGTTTGTTGAGCTATATGAAATTGAGGAGTCTTTGCTGGGTTTCGACATTCTTTCAAACTACATTTTCTCGTCTTCACAATGCTTAACAGACTCTTTCTCCTTCACATTTCTTGTTTTGTTTCACAGTGCCCAAATCACTAAATTTTTTCTTAACTTAATTTTCTATTGTGCCTGTTGCTCCAACTGTGAAACATAAAAAGGGACCTTCTGCGTCCGTTCAGAAAAGCAAGTCCAAGAGGGAATTAAAAAACCTGCACTCATCAATCTGCTACGATAAAATTGCCTCATTGGACTCAAGGGTGGTATTTGAAACTGATTAATGACATTCCTTTCGTGGAATGTTCGTGGAATCGACTCTTGGAAGAAAAGGGCTATGGTTAAGAATTGTATTTTACAGCAAAATCTTGATTTGTTCTCTTATAGCAGACTAAACTTTCTTATAGCAGACTAAATCTATTTGGAGTTCTTCACACATTGGATGGATGACACTTGACACTATAGATCATGCAGGGGGTATTTTAGTTCTCTGGAGTGAATTAGATTGTACTGTCTTAGAGGTCATTCGTGGGACTTACACTCTCTCCATCAATGTTTCATTAACGGACGGGTTCTCTTTTTGGGTCTCAGCCATCTATGGCCCTTCTGATAATAGCTATTGCCCAGACTTCTAGCTTGAATTGTACGACGTTGCGGGTCTAGTTGGTGTCTGTTGGATTATTGGAGGAGATTTTAATGTTACTCGTTGGTCTTGGGAAAAATTGCACAATCATTTTGTTAGACTAGAAGCATCATTTCTTTCAATCAGTGGATCCAAGATTTTGACTTAATGGATGCCCCCCTCTGTCGAATGGTACTTTTACTTGGTCCAGTTTTGGATTCACGCAATATCTTTCAATGCTAGACAAGTTTTTACTTACTGATGGCTTCTCCACTCATTTTGGGTCATCACTCTTCGTAGATAGGATAAAATCACATCGGATCACTACCCATTAATATTGAATTTTGGTAACATTGATGGGGGACCATCTCCCTTTCGTTTTGAGAATAAATGGCTGACAATACCATCCCTCAAGGATCTTTTGGTTTCGTGGTGGTCCCAGAATCCCTTGCAAGGATGGCCTGGTCATGCTTTAATGATAATGTTTAAAGGCCTCAAGATAGTTCTCAGAGAATGGAATAGATTGCAAAAAAGTGATTAGAGTCAACTTCCATCCTTGGTATCCAAATTGAATTTATTGGACGCATTGGAAGATGATAATGGGCTTAGTTTCGCGCTGGTCGAAAATCATCCTACACTTCACGAACATATAGAAGCTCTAAATGCACAAGAACATATTTTTTGGAAAAAGAGATGCAAAATTAAATGACTACAAGAGAGTGATGAAAATACGAAGTTTTTTCATAGATTGCTAGCGGAAAAGAAGAGGAAACATAGAATCAGTGAAATTCTGGTTACAGCTTCGAATATTGAATCTGAATTTCTAGACTTTTTAACCTTGTACTCCAAGGATGGTAATTCTCGCTTCTTGCCTGACAATATCGACTGGTCGCCTATTAGCCTGGTTCAAGCAAGTTCTCTTGAAGCCCCATTCACAGAGGAAAAGGTTTCGCCTGCTGTTAATGCTTTAGGCACATAAAAAACTCTTGACCCAGATGGATTCACCTCTGAATTTATTAAATATTTTTAGGACACGTTGAAATCAAATTTTATGATTATACTGCAGGATTTTTACTCCTCGAGAATTATTAATGTCTCAATGAATGAAACCTATATATGCTTAATTCAAAAAAAGCTTGCAGCGAAATGTGTTTTTTGACTATATGCCTATGAGCCTCACCTCTATGGCTTACAAGGTTTTGGCGTGTGTGTTATCAAATCGTTTGAAATTTGTTCTCCCTTCTACTATCTCTGAAAATCAATTGGCTTTCGTTGCTAATAGATAGATCATTGATGCGTCATTAATGGAAAATGAACTTATTGATGATTGGTTTTATAAAGGGTGAAAAAGGATTGTTTTGAAGCTAGATGTTGAAAAGGCTTTTGATACTGTAGATTGGGATTTCTTGGATGCTATTTTTCGGGCTAAACGTTTTTGTTTGTTATGGCGGAAATGGATTCAAGGTTGTATATCGATTGCCAATTTTTCCAATTTTCATCTCAATGGGAAACCTTGTGGTAAAATTATTCCTTCTAAGGGTATTCGACAAGGTGATCCCATTTCACCGTTCTTGTATATATTGGTCTCGGATTGATGTTGAATTATAGTAGCAGTATTAACTCACTTGCTTCTCACCCAATTGGGTCCAACAATTTCTATTTGAATCACTTACAGTTTGCAGACGATACCCTTCTTTTTTCAACATATGACAAGGTTTCTTTGCATAACCTTTTTTATTTAATTCATCTGTTTGATGGGCTTTAGATTTGAAAATTAACTTGTCTAAAAGTGAGTTCTTGGGTATTAACCGAAGTCGAGTTGAATTGGATGATTGCATCCTTCGGTTGCCAAAAGGGATCCTGGTCTGCCACCTATCTCGGTTTACCACTTGGGGGGAACTCGGGATCTATCTCTATGTGGCATCCTGCCATTGAAAAGATCTGCCATAAACTTCATAATTGGAAGTATGCATTTATTTCAAAAGGCTTCATATGATGTGTATCTCCTATTGGATAAAATTGTTCATAACTTTTTTTTGGGAAGGATCGCATGGTGATAGTGGTGTTCACAACCAGTGTTTTAAAAAGTGCAAGGCGCATCACGGTGCAGTGGTCCTCTGGAGCCTGAGGCGCTAGGCGCACCAAAAGGCGAGAGTTTTTTTTACTTAAGGGACACTATATAAAAAAAAAAAACTTATATATATGTGTACACAACAAAAACACTTTTCATGGACATAAATGAAGTTTTAACCAAGAATATCATATCAAACATACATAATATTCAACTATCAACCTTCACGCGTTTCGAATGACCTCGTACGGGGATCTAAAACACAAAAAGCGCAAAAGAAATAAAATATCCCACGAGGCGCGCGCCTTTTGGTGCTTGGGGCTTCACGTAAGGCGGCATAAGGCGCGCCTTTTAAAACACTGTTCACAATGTTAAGTGGAATACGATACAACTTCCTAAATTAATGGGGGGTATTTTTATCGGGAATTTCGAGCTTCACAACATTTCGTTATTGATCCCGACAAAGTATTATGATTCCTCTTTGGGGTTTTCTTGGCCTCCCCTTATTCAGTCGCGCTCCCATAAGTCTCCTTGGAAAGTCATTTGTTGCTTTAGTCATATAGTGGTAGTGCGTAGTGGCTAGTCGGGCTCGTTGGAAAATAGGTAATGGTGCCAATACTTTTTTCTGGTGTGACTTATGGAGTGTGGAGTTCTCAAGGAGCACTATATGCGACTTCATCGCCTTACTACCCATCCTCGGGCTTCTGTGTCTGAAGCATGAGATCTCTCTACTGCTGTTTGGAACCTTTGACCTCGTCATAATCTGACTCTCGCGAAATTGTAGAATGGGCATCTTTATCTAATCCTTTGTCCTCTTGTAGACTTTTGGGAAGCTTAGATTCATGGATTTGGCCTTTTGATCCTTTAGGTCTTTTCTCGGTTAAATCTCTAAATGAGATCTTGGTTGGGATTGTCGGAGCATCAGAAGTGTTTTTATTCTTTTATTTGGAAAGATGCCTATCCAAGAAAGATAATTTTTTTTGTGGGAACTTAGCTAGGGGGCAGTTAATACAGTTGATCGATTACAAAGGCGTATGCCTTATATGGCTCTTTCTTCTTCGTGGTGTTCCCTATGTCACTCGAGCGTAGAATCGGCAGGCCATTTGTTTGTTTATTGTTCCTTTTCTTGGAGATTTTGGAGTACTGTGTTTGACACATTTGGATGGTTGTTGACTCTACTCTCAACATTAGGGATTTCTTGTCATCAGTTGGTGGGTGCCCCATTTACTGGGCAAAGGAAAATTATATGGTTATCTCTTGTTCGATCCATCCTTTGGCATATTTGGTTGGAAAGGAATAGGCGGGTTTTCAGGGATGAATCTAATAACTTTGATTGTTTTTTTGAACCTGTTCTTACTGACGTTTATTTCTGGTGCAAATCTGTTTATCCTATTGGTATTTACAACCCTTTTTTTTTGTTACTCACTATTGATCGTTTTTGTAATCTTTAGATGATATTCTTCTAAGGCTCTGTATTGGATATTTTCCATCTCTTCATTATATCAATGAAATTGTTTCTTTGTAAAAAAAAAAACTAATTTTTTTCTTAACTTGTTATAGATATTAGTTGTTTTGATATTTTTGTTTTGAAGTCAAGTCTATGTTTTATTTCGTGGTTTGTGATTGTTAGCTTTTATTCAATAAACAAGGTTTTTTTATGGTATCAGACTATCAGTGATAGCTAGGTTTGTTTAGGTTCATATTTGTGTATCCATTTTAGTAATAACTGATTCCTTGATGGTTGGTGTTGTTTCTTTTGGGTTATGAAGTTCTGGTTGCAGATGTAGATTGAGAAATGAACTAAAATTTGGTTAGGTAGGAGGGAGATAGAAAATCTGGAATTCAGTACTCATGAGTTCTACCTGATTTAGTTTCTGAACATTTGGCCTGTTTGATAATCCTTTTTCTTAGGCCCCGTTTGATAACCATTTTGTTTTTTGTTTTTTGTTTTTTGTTTATGAAATTTAAGTCTATTTCTATTCAAATTTCCTACCATGTGCTCCATCTTTCCTACAATGTATCCATCTTTCCTTAAGAAAGTAGGAGAATACGAACCAAATTTTAAAAACATAAACTAGTTTTTGGAAGTTACTTTTTTTTGTTTACAAATTTTGGTTTTGTTTATCAAAATATAGGCAAGATGTAGATATCCAAATAAGGAAAAAGGTATAGTAAGGTAGTTGTTGCAGGCTTAAATTTCATAAACAAAAAACGAAAAACAAAATGGTTATCAAACGGGGCCTTAGTTTTCTATTTCTGAAAACCATGTTAGTTTCCTTATAGTTTGTTACTAAGTTTTTTGTCTTCCTTCAAACCTGTTTGAATTCTAAGCCAAAATCCAAAAAAATGAAGCATTTAGAAATTATTTTTAGTTTTCTTTTTTTTTTTGCTTGGACTTGGATAAACGTTTTAGAAAAGTAGGTGATAAAGTAAGAATACATGTGGAAGAAGTGTTTATATGATTAATTTTAATAAAAAACAGCTAATACAAAACAAAATTGTTATCAATTAAGGCCTGTGGAGTTTGTAGTAATCCATTTCTATAGAATTATTTTCACCCTTGCAACTTGGTGTTAGGAATAATCATTTTGAACTCGACAATTTTTTTAGGGCATGGCAGAATCGAACTCACCTGGTTCGTTTCAACCTCCTCCGGTTCCCCCACTTCCTATGGTTATTGATGGGGCAGATCATGATGTTGCGCTTGCCTCTTGTGAGTTCTTAACTCGTCGAGAAGTACTTCTGCGTCGGTCTCGGAGAGTGAAGCAACTTTGTAAACTCTATAGGGCACTATACTGGGCTTTAATGGAGGAAGTGAAGCACAAGTACCGTGAGTATTATTGGACGTACGGCAAGAGTCCGTTTAAGGAGGATGAGAAGGAGGCCGAGGGCGGCATTGGTGGTGATTATCCAGAGGGTATTGGGGAGAATGGGAAGCTAGGATCAAGTTCTGCAACGGTCGATGAGATTAGAAGGTGTGAGGTCACTGGTTGCAAGGCAAAGGCAATGGCATTGACAGACTACTGTCATGCTCATATCCTCTCAGATAAAAGGCAGAAGCTCTACAAGGGTTGCACCTTTGTAATCAAGAGGTTTGCATTCTAGCTGAATGGTCAATTCTTTTTGGTGCATTATAGTATTATTAATTTTCTAGTCAATAAGCAATAGCTATAAATTGAGAGTAATTGATCTATTATTGATATCAATATGATGATTTATTATGTTCTTGTGATAGCATTTACAGTAAGGGTATTTTAGTAATTAGTTAAGAGTTTGTTTCAAAGTATTTGGAAGAACAAAATAACTACCTTTTTTCTTCTCTTTGGCTTCTTCGATCTCTGTTTTGGCTCTCCTGTCATCAACATCCTCTCCCATTTTATTTGTTCCATAGCAGCTTGATAGGCACGTTCTTCATATTGTTTTCTTAATTCTTCTGCTCTTTTTTTGCTCTCTTTGGCACTTTGTCTCCACTCCATTTGAAATGATCCGAGAAGCGAATCGATCCTTCACCATGCTTTATATAGATCTATCCCCATGGCCAAAACAGAACAAACTTCTCCGATACCAAATTGATGAGGCCCGGATTAGATTTCAGATAAAAGATAACTAAAATACAAGATAACCCAAGTTTTAATCAGACTTGGAACTTCCTTTTACTCTTGAGAACCTCTCTCAAGTCCTAATACCAAGTCCAAAATGCTTGAGAGATATCTTTCTCCTCCTTCGCTACCTCTGTTTATAATCTAAAATACCAAACAAACTCTTAACAAATTACTAAATACCCTTACTATAAATCCTATCAGCTTGCTCTGCTTTGCACTTCACACTCAGCTATGAATAAATTAATTTTCAATATGTTTTCCCTTTGGACTGTCTTGAATACAATTTTATTAGAGTTGGTCAAAGTAGATCCTAAGGATGAAGGCATTTCTAACACTCAAGTCCTTAAAAATTCCTATTATAGAATGGAGCACCATTTCCCCCACCCTTTTAGTTAAAGAAAAGTGGCAATCATCTATATACTGCTTGAATTTACCCAATGAAAGCCAAGCTAACTTATCATTACAATACCTTTGAGAAATCATCAGTAAGGTCTTGTTGTGTTTGATTCAGTCATTTTTTTGGTTATTTTGGTCTTCTTTTTGTGTGTTTCATTTTTCAATTTTTTATATCAGAAAATAATTTAGTCTCCATGTTCTGTGTCCCAATCTCAAATTATGAGGGCAAGGTAAGGGTAGTTGTTGTCTACACTCTTTGATTAAGCTTTCTTTTTTCATCTTTGGGCTTTTGTATTTGCAAGTTCTTCTGGAATTAAGAGCAATGTACATTGACATTTAGCATTATTATGTAGCCATAGATTTCTTGTGTTGAGGAAATTATAATTTGGTGCTTGACTTGTGTTTTTATTATGTCATTTGTTTTGATTGAATCAACAACATATTGTTCATTTCGATTTCGACTAAGAAATTATCTTTCATTTACAGTATGCAGTCAGGACCCCTTCTATGTTCAAAGCCTGTTTTAAGATCTACTGTTCCCTGCTATTGTCCTGGTCATCTACAAAAAGGCGAAAAATGCTTAGCTAGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCTACGAGTAAGCTCTGCCCCGATTTCCACGTATTGTTAGCGGAATGCGTTCGCCACATACAATCCAAAAGGAGGGCTGCGAGGAAGGCAACTGCTGTTAAAATTGAGACTAACTGAGAAGGTGAAGATTTGTTTAGCCCTCAGATGGAATAAGAATACCTACTTTGCAGTCTGATAACTTTCGAAAGAAAGTGTTTGTTCATAAGTTTTGGTATTCAATCCTACCTCCATGGAAGAGTTTGTTGATATGAGATCGATTTTCATCGCGTAATATTTGAACAAAAGGTCGATGCGCAACATACATTTGATTGCATAATGTAGGGATGTACATTCCTTTTTTTTTTCTTTAAGGATGTTTTAGTTTTAGCCACCCTCTTTCTAACTTACAATTCAGTAATTTTGTTGAGGTTTGCTATTAGAATGATAATGGTAAAAAAATCATCAAATTGTGCTATTGATGGATGATTGTGATCAATTCATAGACTAAAAT

mRNA sequence

CTTGGAGTTGGACCCTTTTTATCTCTCCTCATCAATTTCTCTGCACATAGACTCACGCTCACAGTCCCTCACTCTCTCTTCTCATCCATACATCTCCCCAACTCGCCGGAAACCGCCATTAAAACCGCCGTCTGAAGCTTTCTCAGTCACCCCCTTCACCGCCGCAAAGCCTCCGTTCTCAACCCTTAACCACGAGCCGTCCGATTCAACCCTGCTGCACCGGAGAGACGTCCTTGGAACGCGAATTCGCCGAACCGGAATCTGCTCCACCAGCCACCTGTGTGACCCGGTTCCAACCCGAACGGGCTTCCCAACCTGTAGATCTAGACCCTTGAGTTTCGTGCTACTTGGGTCTGGCTAGGTTCGTGATCATTTTTTTTGCTGATCGGAGTGGTGGAATCTCGATCAATTCATCCGGTTAAGGGCATGGCAGAATCGAACTCACCTGGTTCGTTTCAACCTCCTCCGGTTCCCCCACTTCCTATGGTTATTGATGGGGCAGATCATGATGTTGCGCTTGCCTCTTGTGAGTTCTTAACTCGTCGAGAAGTACTTCTGCGTCGGTCTCGGAGAGTGAAGCAACTTTGTAAACTCTATAGGGCACTATACTGGGCTTTAATGGAGGAAGTGAAGCACAAGTACCGTGAGTATTATTGGACGTACGGCAAGAGTCCGTTTAAGGAGGATGAGAAGGAGGCCGAGGGCGGCATTGGTGGTGATTATCCAGAGGGTATTGGGGAGAATGGGAAGCTAGGATCAAGTTCTGCAACGGTCGATGAGATTAGAAGGTGTGAGGTCACTGGTTGCAAGGCAAAGGCAATGGCATTGACAGACTACTGTCATGCTCATATCCTCTCAGATAAAAGGCAGAAGCTCTACAAGGGTTGCACCTTTGTAATCAAGAGTATGCAGTCAGGACCCCTTCTATGTTCAAAGCCTGTTTTAAGATCTACTGTTCCCTGCTATTGTCCTGGTCATCTACAAAAAGGCGAAAAATGCTTAGCTAGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCTACGAGTAAGCTCTGCCCCGATTTCCACGTATTGTTAGCGGAATGCGTTCGCCACATACAATCCAAAAGGAGGGCTGCGAGGAAGGCAACTGCTGTTAAAATTGAGACTAACTGAGAAGGTGAAGATTTGTTTAGCCCTCAGATGGAATAAGAATACCTACTTTGCAGTCTGATAACTTTCGAAAGAAAGTGTTTGTTCATAAGTTTTGGTATTCAATCCTACCTCCATGGAAGAGTTTGTTGATATGAGATCGATTTTCATCGCGTAATATTTGAACAAAAGGTCGATGCGCAACATACATTTGATTGCATAATGTAGGGATGTACATTCCTTTTTTTTTTCTTTAAGGATGTTTTAGTTTTAGCCACCCTCTTTCTAACTTACAATTCAGTAATTTTGTTGAGGTTTGCTATTAGAATGATAATGGTAAAAAAATCATCAAATTGTGCTATTGATGGATGATTGTGATCAATTCATAGACTAAAAT

Coding sequence (CDS)

ATGGCAGAATCGAACTCACCTGGTTCGTTTCAACCTCCTCCGGTTCCCCCACTTCCTATGGTTATTGATGGGGCAGATCATGATGTTGCGCTTGCCTCTTGTGAGTTCTTAACTCGTCGAGAAGTACTTCTGCGTCGGTCTCGGAGAGTGAAGCAACTTTGTAAACTCTATAGGGCACTATACTGGGCTTTAATGGAGGAAGTGAAGCACAAGTACCGTGAGTATTATTGGACGTACGGCAAGAGTCCGTTTAAGGAGGATGAGAAGGAGGCCGAGGGCGGCATTGGTGGTGATTATCCAGAGGGTATTGGGGAGAATGGGAAGCTAGGATCAAGTTCTGCAACGGTCGATGAGATTAGAAGGTGTGAGGTCACTGGTTGCAAGGCAAAGGCAATGGCATTGACAGACTACTGTCATGCTCATATCCTCTCAGATAAAAGGCAGAAGCTCTACAAGGGTTGCACCTTTGTAATCAAGAGTATGCAGTCAGGACCCCTTCTATGTTCAAAGCCTGTTTTAAGATCTACTGTTCCCTGCTATTGTCCTGGTCATCTACAAAAAGGCGAAAAATGCTTAGCTAGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCTACGAGTAAGCTCTGCCCCGATTTCCACGTATTGTTAGCGGAATGCGTTCGCCACATACAATCCAAAAGGAGGGCTGCGAGGAAGGCAACTGCTGTTAAAATTGAGACTAACTGA

Protein sequence

MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRALYWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSATVDEIRRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKIETN
Homology
BLAST of Sed0006305 vs. NCBI nr
Match: XP_022953677.1 (INO80 complex subunit D-like [Cucurbita moschata] >XP_022991632.1 INO80 complex subunit D-like [Cucurbita maxima] >XP_023548640.1 INO80 complex subunit D-like [Cucurbita pepo subsp. pepo] >KAG7014493.1 INO80 complex subunit D, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 426.4 bits (1095), Expect = 1.6e-115
Identity = 214/243 (88.07%), Postives = 224/243 (92.18%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEF TRREVL RRSRRVKQLC++YR L
Sbjct: 1   MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YWALMEE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S T  DEI
Sbjct: 61  YWALMEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSVTGSDEI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AECVR IQ KRRAARKATAVKI
Sbjct: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECVRQIQVKRRAARKATAVKI 240

Query: 241 ETN 243
           E+N
Sbjct: 241 ESN 241

BLAST of Sed0006305 vs. NCBI nr
Match: KAG6575969.1 (INO80 complex subunit D, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 426.4 bits (1095), Expect = 1.6e-115
Identity = 214/243 (88.07%), Postives = 224/243 (92.18%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEF TRREVL RRSRRVKQLC++YR L
Sbjct: 49  MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 108

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YWALMEE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S T  DEI
Sbjct: 109 YWALMEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSVTGSDEI 168

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 169 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 228

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AECVR IQ KRRAARKATAVKI
Sbjct: 229 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECVRQIQVKRRAARKATAVKI 288

Query: 241 ETN 243
           E+N
Sbjct: 289 ESN 289

BLAST of Sed0006305 vs. NCBI nr
Match: XP_038896100.1 (INO80 complex subunit D-like isoform X2 [Benincasa hispida] >XP_038896101.1 INO80 complex subunit D-like isoform X2 [Benincasa hispida])

HSP 1 Score: 401.4 bits (1030), Expect = 5.7e-108
Identity = 203/243 (83.54%), Postives = 218/243 (89.71%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAES+SPGSFQPPPV PLP++IDGAD D ALA+ E   RREVL RRSRRVKQLC++ + +
Sbjct: 1   MAESSSPGSFQPPPVTPLPILIDGADRDRALAASEVCARREVLERRSRRVKQLCRILKQV 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YW L+EE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S T  DEI
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSVTGSDEI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AE VR IQSKRRA RKATAVKI
Sbjct: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEFVRQIQSKRRATRKATAVKI 240

Query: 241 ETN 243
           E+N
Sbjct: 241 ESN 241

BLAST of Sed0006305 vs. NCBI nr
Match: XP_038896099.1 (uncharacterized protein LOC120084404 isoform X1 [Benincasa hispida])

HSP 1 Score: 401.4 bits (1030), Expect = 5.7e-108
Identity = 203/243 (83.54%), Postives = 218/243 (89.71%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAES+SPGSFQPPPV PLP++IDGAD D ALA+ E   RREVL RRSRRVKQLC++ + +
Sbjct: 107 MAESSSPGSFQPPPVTPLPILIDGADRDRALAASEVCARREVLERRSRRVKQLCRILKQV 166

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YW L+EE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S T  DEI
Sbjct: 167 YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSVTGSDEI 226

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 227 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 286

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AE VR IQSKRRA RKATAVKI
Sbjct: 287 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAEFVRQIQSKRRATRKATAVKI 346

Query: 241 ETN 243
           E+N
Sbjct: 347 ESN 347

BLAST of Sed0006305 vs. NCBI nr
Match: XP_022150569.1 (INO80 complex subunit D-like isoform X1 [Momordica charantia] >XP_022150579.1 INO80 complex subunit D-like isoform X1 [Momordica charantia] >XP_022150584.1 INO80 complex subunit D-like isoform X1 [Momordica charantia])

HSP 1 Score: 399.4 bits (1025), Expect = 2.2e-107
Identity = 198/243 (81.48%), Postives = 217/243 (89.30%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQPPPVPP PMVIDG DHDVALAS +F+TR+E+L+RRSRRVKQL ++Y+A 
Sbjct: 1   MAESNSPGSFQPPPVPPPPMVIDGTDHDVALASSKFITRKELLVRRSRRVKQLIRIYKAF 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLG-SSSATVDEI 120
           YWALME+ K K+REYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGK G  S+A  D+I
Sbjct: 61  YWALMEDFKRKFREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKFGIGSAAGSDDI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCK KAMA+T YCHAHILSD +Q+LYKGCTFVIKSM SGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKVKAMAMTKYCHAHILSDSKQRLYKGCTFVIKSMPSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLN+SSTSKL P+ HVLL+E VR IQ KRRA RKATAVK 
Sbjct: 181 YCPGHLQKGEKCLARDLRKAGLNISSTSKLRPELHVLLSEYVRQIQLKRRAMRKATAVKT 240

Query: 241 ETN 243
           ETN
Sbjct: 241 ETN 241

BLAST of Sed0006305 vs. ExPASy Swiss-Prot
Match: Q54J07 (INO80 complex subunit D OS=Dictyostelium discoideum OX=44689 GN=DDB_G0288447 PE=3 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 2.0e-07
Identity = 48/185 (25.95%), Postives = 73/185 (39.46%), Query Frame = 0

Query: 26  DHDVALASCEFLTRREVLLRRSRRVKQLCKLYRALYWALMEEVKHKYREYYWTYGKSPFK 85
           D D   AS   LT  E++ RR   + +L  LY+  Y    E ++   R Y  T      +
Sbjct: 412 DSDFYFASSSVLTDEELIQRRKIYISKLILLYKKQYNRFKERLRIIRRHYISTSLSLNQQ 471

Query: 86  EDEKEAEGGIGGDYPEGIGENGKLGSS-------------------------SATVDEIR 145
            D  + E  I  +    I  N    ++                         +   +E  
Sbjct: 472 NDSNKME--IDNNNDNNINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKLNKRKEEGN 531

Query: 146 RCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCY 186
            C    CK K M L+ YC++HIL DK QKL+  CT+ + + +     C  P+L+  +P  
Sbjct: 532 LCLSVNCKVKPMLLSKYCYSHILQDKDQKLFHECTYQLSANKK----CGYPILKVQIPTL 590

BLAST of Sed0006305 vs. ExPASy TrEMBL
Match: A0A6J1JMD6 (KAT8 regulatory NSL complex subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC111488191 PE=4 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 8.0e-116
Identity = 214/243 (88.07%), Postives = 224/243 (92.18%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEF TRREVL RRSRRVKQLC++YR L
Sbjct: 1   MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YWALMEE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S T  DEI
Sbjct: 61  YWALMEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSVTGSDEI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AECVR IQ KRRAARKATAVKI
Sbjct: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECVRQIQVKRRAARKATAVKI 240

Query: 241 ETN 243
           E+N
Sbjct: 241 ESN 241

BLAST of Sed0006305 vs. ExPASy TrEMBL
Match: A0A6J1GNX0 (KAT8 regulatory NSL complex subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111456134 PE=4 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 8.0e-116
Identity = 214/243 (88.07%), Postives = 224/243 (92.18%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEF TRREVL RRSRRVKQLC++YR L
Sbjct: 1   MAESNSPGSFQSPPAPPHPMVIDGANHDLALASCEFFTRREVLERRSRRVKQLCRVYREL 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YWALMEE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S T  DEI
Sbjct: 61  YWALMEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSVTGSDEI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AECVR IQ KRRAARKATAVKI
Sbjct: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLVAECVRQIQVKRRAARKATAVKI 240

Query: 241 ETN 243
           E+N
Sbjct: 241 ESN 241

BLAST of Sed0006305 vs. ExPASy TrEMBL
Match: A0A6J1DAF9 (KAT8 regulatory NSL complex subunit 2 OS=Momordica charantia OX=3673 GN=LOC111018675 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 1.0e-107
Identity = 198/243 (81.48%), Postives = 217/243 (89.30%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQPPPVPP PMVIDG DHDVALAS +F+TR+E+L+RRSRRVKQL ++Y+A 
Sbjct: 1   MAESNSPGSFQPPPVPPPPMVIDGTDHDVALASSKFITRKELLVRRSRRVKQLIRIYKAF 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLG-SSSATVDEI 120
           YWALME+ K K+REYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGK G  S+A  D+I
Sbjct: 61  YWALMEDFKRKFREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKFGIGSAAGSDDI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCK KAMA+T YCHAHILSD +Q+LYKGCTFVIKSM SGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKVKAMAMTKYCHAHILSDSKQRLYKGCTFVIKSMPSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YCPGHLQKGEKCLARDLRKAGLN+SSTSKL P+ HVLL+E VR IQ KRRA RKATAVK 
Sbjct: 181 YCPGHLQKGEKCLARDLRKAGLNISSTSKLRPELHVLLSEYVRQIQLKRRAMRKATAVKT 240

Query: 241 ETN 243
           ETN
Sbjct: 241 ETN 241

BLAST of Sed0006305 vs. ExPASy TrEMBL
Match: A0A0A0K4I2 (KAT8 regulatory NSL complex subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_7G337070 PE=4 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 1.4e-107
Identity = 200/243 (82.30%), Postives = 219/243 (90.12%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MAESNSPGSFQPPPV PLP++IDGAD D ALA+    +RREVL RRSRR KQLC++++ L
Sbjct: 1   MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YW L+EE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG +SAT  DEI
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLASATGSDEI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YC GHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AE VR IQSKRRA ++ATA+KI
Sbjct: 181 YCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKI 240

Query: 241 ETN 243
           E+N
Sbjct: 241 ESN 241

BLAST of Sed0006305 vs. ExPASy TrEMBL
Match: A0A1S3BH19 (KAT8 regulatory NSL complex subunit 2 OS=Cucumis melo OX=3656 GN=LOC103489752 PE=4 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 1.5e-106
Identity = 198/243 (81.48%), Postives = 217/243 (89.30%), Query Frame = 0

Query: 1   MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRAL 60
           MA+SNSPGSFQPPPV P P++IDGAD D ALAS    +RREVL RRSRR KQLC++++ L
Sbjct: 1   MADSNSPGSFQPPPVTPFPILIDGADRDRALASSMVCSRREVLERRSRRAKQLCRIFKEL 60

Query: 61  YWALMEEVKHKYREYYWTYGKSPFKEDEKEAEGGIGGDYPEGIGENGKLGSSSAT-VDEI 120
           YW L+EE+K KYREYYWTYGKSPFKEDEKEAEG   GDYPEGIGENGKLG  S+T  DEI
Sbjct: 61  YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEG--IGDYPEGIGENGKLGLGSSTGSDEI 120

Query: 121 RRCEVTGCKAKAMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180
           RRC+VTGCKAKAMALT YCHAHILSDK+Q+LYKGCTFVIKSMQSGPLLCSKPVLRSTVPC
Sbjct: 121 RRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPC 180

Query: 181 YCPGHLQKGEKCLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVKI 240
           YC GHLQKGEKCLARDLRKAGLNVSSTSKL PDFHVL+AE VR IQSKRRA ++ATA+KI
Sbjct: 181 YCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKI 240

Query: 241 ETN 243
           E+N
Sbjct: 241 ESN 241

BLAST of Sed0006305 vs. TAIR 10
Match: AT2G31600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G53860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 188.3 bits (477), Expect = 7.0e-48
Identity = 109/228 (47.81%), Postives = 137/228 (60.09%), Query Frame = 0

Query: 25  ADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRALYWALMEEVKHKYREYYWTYGKSPF 84
           +  D  LA    +TR E+L RRS  +KQL K YR  YWALME+VK ++R+Y+W YG S F
Sbjct: 66  SQEDEILARSSHITRSELLKRRSHNLKQLAKCYRDNYWALMEDVKAQHRDYWWKYGISQF 125

Query: 85  KEDE----------KEAEGGIGGDYPEGIGE----NGKLGSSSATVDEIRRCEVTGCKAK 144
           K++           +E + G GGD  EG G+    N  + S          C + GCKAK
Sbjct: 126 KDENNQSNKRRRLGQEGDIGDGGDAVEGSGDNVTNNDGVKSDQYANSNCGSC-MYGCKAK 185

Query: 145 AMALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEK 204
           AMALT YC  HIL D +QKLY GCT VIK   +GPLLC KP L STVP  C  H QK +K
Sbjct: 186 AMALTKYCQLHILKDSKQKLYTGCTNVIKRAPAGPLLCGKPTLASTVPALCNIHFQKAQK 245

Query: 205 CLARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVK 239
            +A+ L+ AG NVSSTSK  P  HV++A  V HIQ+KR+  +K   +K
Sbjct: 246 HVAKALKDAGHNVSSTSKPPPKLHVIVAAFVHHIQAKRKNPQKECKLK 292

BLAST of Sed0006305 vs. TAIR 10
Match: AT1G05860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G31600.1); Has 101 Blast hits to 100 proteins in 32 species: Archae - 0; Bacteria - 0; Metazoa - 28; Fungi - 2; Plants - 66; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). )

HSP 1 Score: 177.2 bits (448), Expect = 1.6e-44
Identity = 102/227 (44.93%), Postives = 134/227 (59.03%), Query Frame = 0

Query: 17  PLPMVIDGADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRALYWALMEEVKHKYREYY 76
           P+ M ++    D  L +   LTR E+L RRS  +KQL + YR  YWALME++K ++R Y 
Sbjct: 50  PISMAVE----DQILGNSNHLTRPELLRRRSHNLKQLSRCYRDHYWALMEDLKAQHRYYS 109

Query: 77  WTYGKSPFKED-----EKEAEGGIGGDYPEGIGENGKLGSSSATVDEIRRCEVTGCKAKA 136
           W YG SPFK++     ++    G  GD  EG G+N    +          C  +GCK+KA
Sbjct: 110 WNYGVSPFKDENYHQNKRRKVEGQTGDEIEGSGDNDNNNNDGVKAGNCVACG-SGCKSKA 169

Query: 137 MALTDYCHAHILSDKRQKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKC 196
           MALT+YC  HIL DK+QKLY  CT+V K  QS  + C KP L STVP  C  H QK +K 
Sbjct: 170 MALTNYCQLHILMDKKQKLYTSCTYVNKRAQSKAITCPKPTLASTVPALCNVHFQKAQKD 229

Query: 197 LARDLRKAGLNVSSTSKLCPDFHVLLAECVRHIQSKRRAARKATAVK 239
           +AR L+ AG NVSS S+  P  H ++A  V HIQ+KR+  RK   +K
Sbjct: 230 VARALKDAGHNVSSASRPPPKLHDIVAAFVHHIQAKRKDPRKEGKLK 271

BLAST of Sed0006305 vs. TAIR 10
Match: AT3G53860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G31600.1); Has 70 Blast hits to 70 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 66; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 177.2 bits (448), Expect = 1.6e-44
Identity = 103/211 (48.82%), Postives = 133/211 (63.03%), Query Frame = 0

Query: 28  DVALASCEFLTRREVLLRRSRRVKQLCKLYRALYWALMEEVKHKYREYYWTYGKSPFKED 87
           D  LAS   LTR E+L RR+  +KQL K Y+  YWALME++K ++R+Y+  YG S FK++
Sbjct: 65  DEILASSSHLTRPELLRRRADNLKQLAKCYKNHYWALMEDLKAQHRDYWCKYGVSQFKDE 124

Query: 88  EKEAEGGIGGDYPEGIGENGKLGSSSATVDEIRRCEVTGCKAKAMALTDYCHAHILSDKR 147
           + ++      D PEG G+ G  G   A  +    C + GCKAKAMALT YC  HIL D +
Sbjct: 125 QNQSNKRRRLD-PEGSGDKGNDGDQYANSNS-GFC-MYGCKAKAMALTKYCQLHILKDSK 184

Query: 148 QKLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTS 207
           QKLY GCT VI    +GPLLC KP L STVP  C  H QK +K +A+ L+ AG NVSSTS
Sbjct: 185 QKLYTGCTNVINRSPAGPLLCGKPTLASTVPVLCNVHYQKAQKNVAKALKDAGHNVSSTS 244

Query: 208 KLCPDFHVLLAECVRHIQSKRRAARKATAVK 239
           K  P  HV++A  V HIQ++R+   K   +K
Sbjct: 245 KPPPKLHVIVAAFVHHIQAQRKNPHKEGKLK 272

BLAST of Sed0006305 vs. TAIR 10
Match: AT2G31600.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G53860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 114.4 bits (285), Expect = 1.3e-25
Identity = 69/149 (46.31%), Postives = 86/149 (57.72%), Query Frame = 0

Query: 25  ADHDVALASCEFLTRREVLLRRSRRVKQLCKLYRALYWALMEEVKHKYREYYWTYGKSPF 84
           +  D  LA    +TR E+L RRS  +KQL K YR  YWALME+VK ++R+Y+W YG S F
Sbjct: 66  SQEDEILARSSHITRSELLKRRSHNLKQLAKCYRDNYWALMEDVKAQHRDYWWKYGISQF 125

Query: 85  KEDE----------KEAEGGIGGDYPEGIGE----NGKLGSSSATVDEIRRCEVTGCKAK 144
           K++           +E + G GGD  EG G+    N  + S          C + GCKAK
Sbjct: 126 KDENNQSNKRRRLGQEGDIGDGGDAVEGSGDNVTNNDGVKSDQYANSNCGSC-MYGCKAK 185

Query: 145 AMALTDYCHAHILSDKRQKLYKGCTFVIK 160
           AMALT YC  HIL D +QKLY GCT VIK
Sbjct: 186 AMALTKYCQLHILKDSKQKLYTGCTNVIK 213

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022953677.11.6e-11588.07INO80 complex subunit D-like [Cucurbita moschata] >XP_022991632.1 INO80 complex ... [more]
KAG6575969.11.6e-11588.07INO80 complex subunit D, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_038896100.15.7e-10883.54INO80 complex subunit D-like isoform X2 [Benincasa hispida] >XP_038896101.1 INO8... [more]
XP_038896099.15.7e-10883.54uncharacterized protein LOC120084404 isoform X1 [Benincasa hispida][more]
XP_022150569.12.2e-10781.48INO80 complex subunit D-like isoform X1 [Momordica charantia] >XP_022150579.1 IN... [more]
Match NameE-valueIdentityDescription
Q54J072.0e-0725.95INO80 complex subunit D OS=Dictyostelium discoideum OX=44689 GN=DDB_G0288447 PE=... [more]
Match NameE-valueIdentityDescription
A0A6J1JMD68.0e-11688.07KAT8 regulatory NSL complex subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC11148819... [more]
A0A6J1GNX08.0e-11688.07KAT8 regulatory NSL complex subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111456... [more]
A0A6J1DAF91.0e-10781.48KAT8 regulatory NSL complex subunit 2 OS=Momordica charantia OX=3673 GN=LOC11101... [more]
A0A0A0K4I21.4e-10782.30KAT8 regulatory NSL complex subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_7G337070... [more]
A0A1S3BH191.5e-10681.48KAT8 regulatory NSL complex subunit 2 OS=Cucumis melo OX=3656 GN=LOC103489752 PE... [more]
Match NameE-valueIdentityDescription
AT2G31600.17.0e-4847.81unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G05860.11.6e-4444.93unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G53860.11.6e-4448.82unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G31600.21.3e-2546.31unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025927Potential DNA-binding domainPFAMPF13891zf-C3Hc3Hcoord: 122..184
e-value: 4.0E-16
score: 59.1
NoneNo IPR availablePANTHERPTHR13453:SF7DOMAIN PROTEIN, PUTATIVE-RELATEDcoord: 9..241
IPR026316KAT8 regulatory NSL complex subunit 2PANTHERPTHR13453UNCHARACTERIZEDcoord: 9..241

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0006305.1Sed0006305.1mRNA
Sed0006305.2Sed0006305.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043984 histone H4-K16 acetylation
biological_process GO:0043981 histone H4-K5 acetylation
biological_process GO:0043982 histone H4-K8 acetylation
cellular_component GO:0044545 NSL complex
cellular_component GO:0000123 histone acetyltransferase complex
molecular_function GO:0005524 ATP binding
molecular_function GO:0046983 protein dimerization activity