Cla97C01G002010.1 (mRNA) Watermelon (97103) v2

NameCla97C01G002010.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionKAT8 regulatory NSL complex subunit 3
LocationCla97Chr01 : 1816536 .. 1818901 (+)
Sequence length717
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCTCACCTCCTTCTAAACGCCGTCGTAAAAACCTCACTTTTGATGATGCCTTTGAGACCTTACTTTCAACTGCAACCTCTTCCTCCGTTAATCGATTGTCCCCTGTCGTCGTTTTCGCTCACGGTGCTGGTGCCCCTTCTTCTTCTGAATGGATGATCAGGTCTTTCTTCTTCTTTTATCTCTCTCTCCTCTTTAATTTACTTCATCAGCTACCACAACTCTGACCCAAGGCTGATACGAATAATTTAGGGTTCAATTTGATCTTTGCTGCTACTGTTTACTCGCATGATGAGTGTTGCTCTGTATGGTTTTTTATTCAGTATTTATTTGTTTTTTTTTTTATGAACCCCCTCAATCATTTGAAGGGTGGAATTCTTTTTCAGTTGAGGAAATTGGGAAATCTTGCCTGGAATTTCTAGTTGAGTTTGACCTTGTTTTCTAAGCCATTTGTGATCTCTTGTTTGAGATTCAAATTGTTGCACTATTGTTTGCTTTTATCCTGTAAAACATTTTTGACCAATTAGTTGTCTTATGAAGAGTAAAATTTATTGGTTACATCCAGATGGAAGGATATGTTAGGCAAGGCACTCCACGCTGCTGAAGTTGTTACTTTTGATTATCCATGTGAGTGTTTCACATTTAACAATACATTGATATAGGATCGTATACATTTTGCTATGTTACTAGGTATGACAACACAATGAACTTTAAATTTTGTTATTTCTTTCAACTAATCTTCAAAATTTTGAGTTTTGATTCGACAGACATTTCTGGAGGGAGGAAATCCCCTCCAAAGGCAGAAAAATTGGTCCCATATCATGTAGAAATTGTCAAAAGGACGATTGCAAAGTATCCCGGGCACCCCTTGATTTTGGCGGGTAAATCAATGGGTTCAAGGTAATAGTTTTGTTTCTTTGATTGCAATCATTGTTTTGCCAATTTTATGGAGTTCTCTTTAGCATCTTAATAAGTATGAAAATTTGAAATATTTCCAAATTTTATTGTCTTTGGTCAATACATCAAATATAACTTCTCAACTGGTTTTTCTTAAGGAGAAATTGGATTTCTTCAGATAGCTTATTGAAATGCATGAAAATATCCCAGGTGATTTTTGGGACTAATTCATTTTTTACTTTAGATTTTTTAAATCATATGTAACTTTGCAACATTCAACTTCAAATAAAATAAGTTTGAGATTTTCTGTCATGTAAATGTGGTTTTTACTTATTTTATTTATTGGCAGAGTAAGTTGCATGGTAGCTTGTGAGGAAGACATTCATGCTTCTGCAATTATTTGCTTGGGGTATCCCTTGAAGGTTTGAGATTATTTATGCATATCAGATTGAAAAAGCCTTCAGGCTTTAGGGTAAATAAACTGATTTTGAGTTGTGGTTTTTGTTGCATCTTGTAGATGGTTATCTATAAGTATATATATATATAAACAGTTATGGTCATAACAATCCTGGGACAGCTTTAACTTTTTAATAAGGAATTTTAATGGGTGAATAAACTTGTCCTTCTAGTTCAGATGGTAATAAGTTTGTGACAAAAGACTAATATATGCTGGATGTGGATTCATACTCAAATTAAGGTGTTTGGGGTTAATAGCTGTCAGTCTTAATGGGTTTGGAAGATCGTTGCAATTAGGAATGGCGATGAAGGTGACATTTTACGCTGTACCAATGAATCTTTAAGCTTGTGCCGACTAATCATGTAAAGTTTCATATTTTAGGGCTTGAAAGGGGATGTGCGAGACCTGACATTATTGCAAGTTACCGTTCCTATCATGTTTGTACAGGTACCTCATCTTTACACAACTTACAATGTTTGCTGCACTATTCTCTCTGGAAGATATAAAGCATAACCCACATCTTCTATAGTAAAGTAATAAATAGCCCTGGACCTACCTTGTTTTGTTCTGAATGGTCCAACATTTTTGTTGCTCACCTTTGTCGATTGGACCATGGTTTAAATGTTTATTATCCTGCACCCAGTATTTGAGTGCTTTGCGTAGTGAACAGTGGTATAGCTGGTCGATGATAGAACTTTAAAGAAACATAAACATTCATAGACAGTTGAGAAATTTGATTGATTTTCGATGATATCATTGTGCAGTCTATAATCTCATCATTGTGCCATTCAGGGTAGCAGGGATGCTCTGTGTCCTCTAGAGAAATTGGAGGATATCCGGAAGAGGATGAAATCAATTAGTGGGTTACATGTTATTGATGGTGGTGACCATTCTTTCCAGATTTCAAAGAAATACCTTCAAGGCAAAGGTTCAAGTAAAGACGAAGCCGAAAATCTTGCTGCTCAGGCCCTTGCAACTTTTGTTTCTGGGTGTCTTGGATGGCTGTAG

mRNA sequence

ATGGCTTCCTCACCTCCTTCTAAACGCCGTCGTAAAAACCTCACTTTTGATGATGCCTTTGAGACCTTACTTTCAACTGCAACCTCTTCCTCCGTTAATCGATTGTCCCCTGTCGTCGTTTTCGCTCACGGTGCTGGTGCCCCTTCTTCTTCTGAATGGATGATCAGATGGAAGGATATGTTAGGCAAGGCACTCCACGCTGCTGAAGTTGTTACTTTTGATTATCCATACATTTCTGGAGGGAGGAAATCCCCTCCAAAGGCAGAAAAATTGGTCCCATATCATGTAGAAATTGTCAAAAGGACGATTGCAAAGTATCCCGGGCACCCCTTGATTTTGGCGGGTAAATCAATGGGTTCAAGAGTAAGTTGCATGGTAGCTTGTGAGGAAGACATTCATGCTTCTGCAATTATTTGCTTGGGGTATCCCTTGAAGGGCTTGAAAGGGGATGTGCGAGACCTGACATTATTGCAAGTTACCGTTCCTATCATGTTTGTACAGGGTAGCAGGGATGCTCTGTGTCCTCTAGAGAAATTGGAGGATATCCGGAAGAGGATGAAATCAATTAGTGGGTTACATGTTATTGATGGTGGTGACCATTCTTTCCAGATTTCAAAGAAATACCTTCAAGGCAAAGGTTCAAGTAAAGACGAAGCCGAAAATCTTGCTGCTCAGGCCCTTGCAACTTTTGTTTCTGGGTGTCTTGGATGGCTGTAG

Coding sequence (CDS)

ATGGCTTCCTCACCTCCTTCTAAACGCCGTCGTAAAAACCTCACTTTTGATGATGCCTTTGAGACCTTACTTTCAACTGCAACCTCTTCCTCCGTTAATCGATTGTCCCCTGTCGTCGTTTTCGCTCACGGTGCTGGTGCCCCTTCTTCTTCTGAATGGATGATCAGATGGAAGGATATGTTAGGCAAGGCACTCCACGCTGCTGAAGTTGTTACTTTTGATTATCCATACATTTCTGGAGGGAGGAAATCCCCTCCAAAGGCAGAAAAATTGGTCCCATATCATGTAGAAATTGTCAAAAGGACGATTGCAAAGTATCCCGGGCACCCCTTGATTTTGGCGGGTAAATCAATGGGTTCAAGAGTAAGTTGCATGGTAGCTTGTGAGGAAGACATTCATGCTTCTGCAATTATTTGCTTGGGGTATCCCTTGAAGGGCTTGAAAGGGGATGTGCGAGACCTGACATTATTGCAAGTTACCGTTCCTATCATGTTTGTACAGGGTAGCAGGGATGCTCTGTGTCCTCTAGAGAAATTGGAGGATATCCGGAAGAGGATGAAATCAATTAGTGGGTTACATGTTATTGATGGTGGTGACCATTCTTTCCAGATTTCAAAGAAATACCTTCAAGGCAAAGGTTCAAGTAAAGACGAAGCCGAAAATCTTGCTGCTCAGGCCCTTGCAACTTTTGTTTCTGGGTGTCTTGGATGGCTGTAG

Protein sequence

MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDMLGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGSRVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLEDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLGWL
BLAST of Cla97C01G002010.1 vs. NCBI nr
Match: XP_008437385.1 (PREDICTED: KAT8 regulatory NSL complex subunit 3 [Cucumis melo])

HSP 1 Score: 451.4 bits (1160), Expect = 1.8e-123
Identity = 227/238 (95.38%), Postives = 229/238 (96.22%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSKRRRK+L  DDAFETLL  ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM
Sbjct: 1   MASSPPSKRRRKSLAIDDAFETLL--ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120
           LGKALHA EVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRT AKYPGHPLILAGKSMGS
Sbjct: 61  LGKALHAVEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTTAKYPGHPLILAGKSMGS 120

Query: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLE 180
           RVSCMVACEEDIHASAIICLGYPLKGLKGDVRD TLLQV VPIMFVQGSRDALCPLEKLE
Sbjct: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDQTLLQVMVPIMFVQGSRDALCPLEKLE 180

Query: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLGWL 239
           DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFV+G LGWL
Sbjct: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVAGFLGWL 236

BLAST of Cla97C01G002010.1 vs. NCBI nr
Match: XP_004143872.1 (PREDICTED: KAT8 regulatory NSL complex subunit 3 [Cucumis sativus] >KGN50026.1 hypothetical protein Csa_5G150440 [Cucumis sativus])

HSP 1 Score: 448.7 bits (1153), Expect = 1.2e-122
Identity = 225/238 (94.54%), Postives = 229/238 (96.22%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSKRRRK+LT DDAFETLL  ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM
Sbjct: 1   MASSPPSKRRRKSLTIDDAFETLL--ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120
           LGKALHA EVVTFDYPYISGGRKSPPKAEKLVP+HVEIVKR  AKYPGHPL+LAGKSMGS
Sbjct: 61  LGKALHAVEVVTFDYPYISGGRKSPPKAEKLVPHHVEIVKRATAKYPGHPLVLAGKSMGS 120

Query: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLE 180
           RVSCMVACEEDIH SAIICLGYPLKGLKGDVRD TLLQVTVPIMFVQGSRDALCPLEKLE
Sbjct: 121 RVSCMVACEEDIHPSAIICLGYPLKGLKGDVRDQTLLQVTVPIMFVQGSRDALCPLEKLE 180

Query: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLGWL 239
           DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAE+LAAQALATFVSG LGWL
Sbjct: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAESLAAQALATFVSGFLGWL 236

BLAST of Cla97C01G002010.1 vs. NCBI nr
Match: XP_022146002.1 (KAT8 regulatory NSL complex subunit 3 [Momordica charantia])

HSP 1 Score: 446.0 bits (1146), Expect = 7.8e-122
Identity = 224/239 (93.72%), Postives = 230/239 (96.23%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLT-FDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKD 60
           MASSPPSKRRRKNLT FDDAFETLL  ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWK+
Sbjct: 1   MASSPPSKRRRKNLTAFDDAFETLL--ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKE 60

Query: 61  MLGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMG 120
           MLG+ALHA EVVTFDYPYISGGRKSPPKAEKLVPYH EIVKRTIAKYPGHPLILAGKSMG
Sbjct: 61  MLGRALHAVEVVTFDYPYISGGRKSPPKAEKLVPYHAEIVKRTIAKYPGHPLILAGKSMG 120

Query: 121 SRVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKL 180
           SRVSCMVACEE IHASAIICLGYPLKGLKGDVRDLTL QVTVPIMFVQGS+DALCPLEKL
Sbjct: 121 SRVSCMVACEEGIHASAIICLGYPLKGLKGDVRDLTLFQVTVPIMFVQGSKDALCPLEKL 180

Query: 181 EDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLGWL 239
           EDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAE LA+QA+ATFVSG LGW+
Sbjct: 181 EDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAEGLASQAIATFVSGFLGWI 237

BLAST of Cla97C01G002010.1 vs. NCBI nr
Match: XP_022958507.1 (KAT8 regulatory NSL complex subunit 3-like [Cucurbita moschata])

HSP 1 Score: 441.8 bits (1135), Expect = 1.5e-120
Identity = 220/235 (93.62%), Postives = 226/235 (96.17%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSKRRR+NL  DDAFETLL  ATSSS NRLSPVVVFAHGAGAPSSSEWMIRWKD+
Sbjct: 1   MASSPPSKRRRQNLAIDDAFETLL--ATSSSDNRLSPVVVFAHGAGAPSSSEWMIRWKDV 60

Query: 61  LGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120
           LGKALHA +VVTFDYPYISGGRK PPKAEKLVPYHVEIVKRTIA+YPGHPLILAGKSMGS
Sbjct: 61  LGKALHAVDVVTFDYPYISGGRKPPPKAEKLVPYHVEIVKRTIARYPGHPLILAGKSMGS 120

Query: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLE 180
           RVSCMVACEEDIHASAIICLGYPLKGLKGDVRD TLLQVT+PIMFVQGSRDALCPLEKLE
Sbjct: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDQTLLQVTIPIMFVQGSRDALCPLEKLE 180

Query: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCL 236
           DIRKRMKSI GLHVIDGGDHSFQISKKYLQGKGSSKDEAEN+AAQALATFVSGCL
Sbjct: 181 DIRKRMKSIGGLHVIDGGDHSFQISKKYLQGKGSSKDEAENVAAQALATFVSGCL 233

BLAST of Cla97C01G002010.1 vs. NCBI nr
Match: XP_022996066.1 (KAT8 regulatory NSL complex subunit 3-like [Cucurbita maxima])

HSP 1 Score: 441.0 bits (1133), Expect = 2.5e-120
Identity = 222/235 (94.47%), Postives = 225/235 (95.74%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSKRRRKNL  DDAFETLL  ATSSS NRLSPVVVFAHGAGAPSSSEWMIRWKDM
Sbjct: 1   MASSPPSKRRRKNLAIDDAFETLL--ATSSSGNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120
           LGKALHA +VVTFDYPYISGGRK PPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS
Sbjct: 61  LGKALHAVDVVTFDYPYISGGRKPPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120

Query: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLE 180
           RVSCMVACEEDIHASAIICLGYPLKGLKGDVRD TLLQVT+PIMFVQGSRDALCPLEKLE
Sbjct: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDQTLLQVTIPIMFVQGSRDALCPLEKLE 180

Query: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCL 236
           DIRKRMKSISGLHVIDGGDHSF ISKKYLQGKGSSKDEAEN+AAQALATFVSG L
Sbjct: 181 DIRKRMKSISGLHVIDGGDHSFHISKKYLQGKGSSKDEAENVAAQALATFVSGGL 233

BLAST of Cla97C01G002010.1 vs. TrEMBL
Match: tr|A0A1S3AUG6|A0A1S3AUG6_CUCME (KAT8 regulatory NSL complex subunit 3 OS=Cucumis melo OX=3656 GN=LOC103482816 PE=4 SV=1)

HSP 1 Score: 451.4 bits (1160), Expect = 1.2e-123
Identity = 227/238 (95.38%), Postives = 229/238 (96.22%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSKRRRK+L  DDAFETLL  ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM
Sbjct: 1   MASSPPSKRRRKSLAIDDAFETLL--ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120
           LGKALHA EVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRT AKYPGHPLILAGKSMGS
Sbjct: 61  LGKALHAVEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTTAKYPGHPLILAGKSMGS 120

Query: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLE 180
           RVSCMVACEEDIHASAIICLGYPLKGLKGDVRD TLLQV VPIMFVQGSRDALCPLEKLE
Sbjct: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDQTLLQVMVPIMFVQGSRDALCPLEKLE 180

Query: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLGWL 239
           DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFV+G LGWL
Sbjct: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVAGFLGWL 236

BLAST of Cla97C01G002010.1 vs. TrEMBL
Match: tr|A0A0A0KQS0|A0A0A0KQS0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G150440 PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 7.9e-123
Identity = 225/238 (94.54%), Postives = 229/238 (96.22%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSKRRRK+LT DDAFETLL  ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM
Sbjct: 1   MASSPPSKRRRKSLTIDDAFETLL--ATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGGRKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMGS 120
           LGKALHA EVVTFDYPYISGGRKSPPKAEKLVP+HVEIVKR  AKYPGHPL+LAGKSMGS
Sbjct: 61  LGKALHAVEVVTFDYPYISGGRKSPPKAEKLVPHHVEIVKRATAKYPGHPLVLAGKSMGS 120

Query: 121 RVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKLE 180
           RVSCMVACEEDIH SAIICLGYPLKGLKGDVRD TLLQVTVPIMFVQGSRDALCPLEKLE
Sbjct: 121 RVSCMVACEEDIHPSAIICLGYPLKGLKGDVRDQTLLQVTVPIMFVQGSRDALCPLEKLE 180

Query: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLGWL 239
           DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAE+LAAQALATFVSG LGWL
Sbjct: 181 DIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAESLAAQALATFVSGFLGWL 236

BLAST of Cla97C01G002010.1 vs. TrEMBL
Match: tr|M5WAG3|M5WAG3_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G117200 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 8.8e-90
Identity = 170/237 (71.73%), Postives = 198/237 (83.54%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSK          A +  +S  +SS++ +LSPVVVFAHGAGAPSSS+WMIRWKDM
Sbjct: 1   MASSPPSKXXXXXXXXXXAND--MSATSSSTLEKLSPVVVFAHGAGAPSSSDWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGG-RKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMG 120
           LGKALHA EVVTFDYPYISGG R++PPKAEKLV +H ++V + +AKYPGHPLILAGKSMG
Sbjct: 61  LGKALHAVEVVTFDYPYISGGKRRAPPKAEKLVDFHADVVGKAVAKYPGHPLILAGKSMG 120

Query: 121 SRVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKL 180
           SRVSCMVAC+E I ASAI+CLGYPLKG+ G VRD  LLQ++VPIM VQGS+DALCPLEKL
Sbjct: 121 SRVSCMVACKEGIRASAILCLGYPLKGINGAVRDEILLQLSVPIMLVQGSKDALCPLEKL 180

Query: 181 EDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLG 237
           E  RK+MK  SGLHVIDGGDHSF+I KK+LQ  G ++DEAE+LA QALA+F+SG LG
Sbjct: 181 EVTRKKMKCPSGLHVIDGGDHSFKIGKKHLQTTGLTQDEAEDLALQALASFLSGSLG 235

BLAST of Cla97C01G002010.1 vs. TrEMBL
Match: tr|A0A251P732|A0A251P732_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G117200 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 8.8e-90
Identity = 170/237 (71.73%), Postives = 198/237 (83.54%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSK          A +  +S  +SS++ +LSPVVVFAHGAGAPSSS+WMIRWKDM
Sbjct: 1   MASSPPSKXXXXXXXXXXAND--MSATSSSTLEKLSPVVVFAHGAGAPSSSDWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGG-RKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMG 120
           LGKALHA EVVTFDYPYISGG R++PPKAEKLV +H ++V + +AKYPGHPLILAGKSMG
Sbjct: 61  LGKALHAVEVVTFDYPYISGGKRRAPPKAEKLVDFHADVVGKAVAKYPGHPLILAGKSMG 120

Query: 121 SRVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKL 180
           SRVSCMVAC+E I ASAI+CLGYPLKG+ G VRD  LLQ++VPIM VQGS+DALCPLEKL
Sbjct: 121 SRVSCMVACKEGIRASAILCLGYPLKGINGAVRDEILLQLSVPIMLVQGSKDALCPLEKL 180

Query: 181 EDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLG 237
           E  RK+MK  SGLHVIDGGDHSF+I KK+LQ  G ++DEAE+LA QALA+F+SG LG
Sbjct: 181 EVTRKKMKCPSGLHVIDGGDHSFKIGKKHLQTTGLTQDEAEDLALQALASFLSGSLG 235

BLAST of Cla97C01G002010.1 vs. TrEMBL
Match: tr|A0A251PAK8|A0A251PAK8_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G117200 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 8.8e-90
Identity = 170/237 (71.73%), Postives = 198/237 (83.54%), Query Frame = 0

Query: 1   MASSPPSKRRRKNLTFDDAFETLLSTATSSSVNRLSPVVVFAHGAGAPSSSEWMIRWKDM 60
           MASSPPSK          A +  +S  +SS++ +LSPVVVFAHGAGAPSSS+WMIRWKDM
Sbjct: 1   MASSPPSKXXXXXXXXXXAND--MSATSSSTLEKLSPVVVFAHGAGAPSSSDWMIRWKDM 60

Query: 61  LGKALHAAEVVTFDYPYISGG-RKSPPKAEKLVPYHVEIVKRTIAKYPGHPLILAGKSMG 120
           LGKALHA EVVTFDYPYISGG R++PPKAEKLV +H ++V + +AKYPGHPLILAGKSMG
Sbjct: 61  LGKALHAVEVVTFDYPYISGGKRRAPPKAEKLVDFHADVVGKAVAKYPGHPLILAGKSMG 120

Query: 121 SRVSCMVACEEDIHASAIICLGYPLKGLKGDVRDLTLLQVTVPIMFVQGSRDALCPLEKL 180
           SRVSCMVAC+E I ASAI+CLGYPLKG+ G VRD  LLQ++VPIM VQGS+DALCPLEKL
Sbjct: 121 SRVSCMVACKEGIRASAILCLGYPLKGINGAVRDEILLQLSVPIMLVQGSKDALCPLEKL 180

Query: 181 EDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENLAAQALATFVSGCLG 237
           E  RK+MK  SGLHVIDGGDHSF+I KK+LQ  G ++DEAE+LA QALA+F+SG LG
Sbjct: 181 EVTRKKMKCPSGLHVIDGGDHSFKIGKKHLQTTGLTQDEAEDLALQALASFLSGSLG 235

BLAST of Cla97C01G002010.1 vs. Swiss-Prot
Match: sp|Q499B3|KANL3_DANRE (KAT8 regulatory NSL complex subunit 3 OS=Danio rerio OX=7955 GN=kansl3 PE=2 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-13
Identity = 41/133 (30.83%), Postives = 78/133 (58.65%), Query Frame = 0

Query: 106 YPGHPLILAGKSMGSRVSCMVACEEDIHASAIICLGYPLK---GLKGDVRDLTLLQVTVP 165
           +P  P+IL G ++GS ++C V+  E  + +A++CLG+PL+   G +GDV D  LL +  P
Sbjct: 342 FPHKPIILVGWNVGSLMACHVSLME--YMTAVVCLGFPLQTISGPRGDVDD-PLLDMKTP 401

Query: 166 IMFVQGSRDALCPLEKLEDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKGSSKDEAENL 225
           ++FV G     C  E +E+ R+++++ + L V+ G D S +I+   ++ +G ++   +  
Sbjct: 402 VLFVVGQNALQCSPENMEEFREKIRADNSLVVVGGADDSLRINSTKMKTEGLTQTMVDRC 461

Query: 226 AAQALATFVSGCL 236
               +  F++G L
Sbjct: 462 IQDEIVDFLTGVL 471

BLAST of Cla97C01G002010.1 vs. Swiss-Prot
Match: sp|Q3TUU5|TEX30_MOUSE (Testis-expressed protein 30 OS=Mus musculus OX=10090 GN=Tex30 PE=1 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 2.5e-10
Identity = 36/111 (32.43%), Postives = 59/111 (53.15%), Query Frame = 0

Query: 101 RTIAKYPGHPLILAGKSMGSRVSCMVAC-----EEDIHASAIICLGYPLKGLKGD--VRD 160
           +T  +Y    + L G+SMGSR +  V C     + D     +IC+ YPL   K    +RD
Sbjct: 86  KTSGEYKLAGVFLGGRSMGSRAAASVMCHTEPDDADDFVRGLICISYPLHHPKQQHKLRD 145

Query: 161 LTLLQVTVPIMFVQGSRDALCPLEKLEDIRKRMKSISGLHVIDGGDHSFQI 205
             L ++  P++FV GS D +C    LE + ++M++ S +H I+  +HS  +
Sbjct: 146 EDLFRIKDPVLFVSGSADEMCEKNLLEKVAQKMQAPSKIHWIEKANHSMAV 196

BLAST of Cla97C01G002010.1 vs. Swiss-Prot
Match: sp|Q5JUR7|TEX30_HUMAN (Testis-expressed protein 30 OS=Homo sapiens OX=9606 GN=TEX30 PE=2 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 1.6e-09
Identity = 35/111 (31.53%), Postives = 58/111 (52.25%), Query Frame = 0

Query: 101 RTIAKYPGHPLILAGKSMGSRVSCMVAC-----EEDIHASAIICLGYPLKGLKGD--VRD 160
           +T  +Y    + L G+SMGSR +  V C     + D     +IC+ YPL   K    +RD
Sbjct: 86  KTSGEYKLAGVFLGGRSMGSRAAASVMCHIEPDDGDDFVRGLICISYPLHHPKQQHKLRD 145

Query: 161 LTLLQVTVPIMFVQGSRDALCPLEKLEDIRKRMKSISGLHVIDGGDHSFQI 205
             L ++  P++FV GS D +C    LE + ++M++   +H I+  +HS  +
Sbjct: 146 EDLFRLKEPVLFVSGSADEMCEKNLLEKVAQKMQAPHKIHWIEKANHSMAV 196

BLAST of Cla97C01G002010.1 vs. Swiss-Prot
Match: sp|Q3ZC52|TEX30_BOVIN (Testis-expressed protein 30 OS=Bos taurus OX=9913 GN=TEX30 PE=2 SV=2)

HSP 1 Score: 63.5 bits (153), Expect = 3.5e-09
Identity = 33/101 (32.67%), Postives = 54/101 (53.47%), Query Frame = 0

Query: 111 LILAGKSMGSRVSCMVAC-----EEDIHASAIICLGYPLKGLKGD--VRDLTLLQVTVPI 170
           + L G+SMGSR +  V C     + D     +IC+ YPL   K    +RD  L ++  P+
Sbjct: 95  VFLGGRSMGSRAAASVLCHIEPDDADDFVRGLICISYPLHHPKQQHKLRDEDLFRIKDPV 154

Query: 171 MFVQGSRDALCPLEKLEDIRKRMKSISGLHVIDGGDHSFQI 205
           +FV GS D +C    LE + ++M++   +H I+  +HS  +
Sbjct: 155 LFVSGSADEMCEKNLLEKVAQKMQAPHKIHWIEKANHSMAV 195

BLAST of Cla97C01G002010.1 vs. TAIR10
Match: AT5G41850.1 (alpha/beta-Hydrolases superfamily protein)

HSP 1 Score: 274.6 bits (701), Expect = 5.6e-74
Identity = 130/202 (64.36%), Postives = 165/202 (81.68%), Query Frame = 0

Query: 36  SPVVVFAHGAGAPSSSEWMIRWKDMLGKALHAAEVVTFDYPYISGGRKS-PPKAEKLVPY 95
           SPVV+FAHGAGAPSSS+WMIRWK+ML K L A EVVTFDYPY++ G+K   PKAEKL+ +
Sbjct: 19  SPVVIFAHGAGAPSSSDWMIRWKEMLKKTLEAVEVVTFDYPYLADGKKRVAPKAEKLIEF 78

Query: 96  HVEIVKRTIAKYPGHPLILAGKSMGSRVSCMV-ACEEDIHASAIICLGYPLKGLKGDVRD 155
           H+ +VK T AK+PGHPLIL GKSMGSRVSCMV A  ED+  SA+ICLGYPLKG KG +RD
Sbjct: 79  HLNVVKETAAKFPGHPLILVGKSMGSRVSCMVSAVNEDVTVSAVICLGYPLKGAKGAIRD 138

Query: 156 LTLLQVTVPIMFVQGSRDALCPLEKLEDIRKRMKSISGLHVIDGGDHSFQISKKYLQGKG 215
            TLL++ VP+MFVQGS+D +CPL KLE +  +MK+++ +HVIDGGDHSF+I KK+L+ K 
Sbjct: 139 ETLLEMGVPVMFVQGSKDPMCPLNKLEAVCNKMKAVTEVHVIDGGDHSFKIGKKHLETKE 198

Query: 216 SSKDEAENLAAQALATFVSGCL 236
            +++E E++A +A+A FVS  L
Sbjct: 199 LTQEEVEDVAMKAIAAFVSKSL 220

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008437385.11.8e-12395.38PREDICTED: KAT8 regulatory NSL complex subunit 3 [Cucumis melo][more]
XP_004143872.11.2e-12294.54PREDICTED: KAT8 regulatory NSL complex subunit 3 [Cucumis sativus] >KGN50026.1 h... [more]
XP_022146002.17.8e-12293.72KAT8 regulatory NSL complex subunit 3 [Momordica charantia][more]
XP_022958507.11.5e-12093.62KAT8 regulatory NSL complex subunit 3-like [Cucurbita moschata][more]
XP_022996066.12.5e-12094.47KAT8 regulatory NSL complex subunit 3-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3AUG6|A0A1S3AUG6_CUCME1.2e-12395.38KAT8 regulatory NSL complex subunit 3 OS=Cucumis melo OX=3656 GN=LOC103482816 PE... [more]
tr|A0A0A0KQS0|A0A0A0KQS0_CUCSA7.9e-12394.54Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G150440 PE=4 SV=1[more]
tr|M5WAG3|M5WAG3_PRUPE8.8e-9071.73Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G117200 PE=4 SV=1[more]
tr|A0A251P732|A0A251P732_PRUPE8.8e-9071.73Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G117200 PE=4 SV=1[more]
tr|A0A251PAK8|A0A251PAK8_PRUPE8.8e-9071.73Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G117200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q499B3|KANL3_DANRE1.1e-1330.83KAT8 regulatory NSL complex subunit 3 OS=Danio rerio OX=7955 GN=kansl3 PE=2 SV=1[more]
sp|Q3TUU5|TEX30_MOUSE2.5e-1032.43Testis-expressed protein 30 OS=Mus musculus OX=10090 GN=Tex30 PE=1 SV=1[more]
sp|Q5JUR7|TEX30_HUMAN1.6e-0931.53Testis-expressed protein 30 OS=Homo sapiens OX=9606 GN=TEX30 PE=2 SV=1[more]
sp|Q3ZC52|TEX30_BOVIN3.5e-0932.67Testis-expressed protein 30 OS=Bos taurus OX=9913 GN=TEX30 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
AT5G41850.15.6e-7464.36alpha/beta-Hydrolases superfamily protein[more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR026555NSL3/Tex30
IPR029059AB_hydrolase_5
IPR029058AB_hydrolase

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C01G002010Cla97C01G002010gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C01G002010.1Cla97C01G002010.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C01G002010.1.exon.1Cla97C01G002010.1.exon.1exon
Cla97C01G002010.1.exon.2Cla97C01G002010.1.exon.2exon
Cla97C01G002010.1.exon.3Cla97C01G002010.1.exon.3exon
Cla97C01G002010.1.exon.4Cla97C01G002010.1.exon.4exon
Cla97C01G002010.1.exon.5Cla97C01G002010.1.exon.5exon
Cla97C01G002010.1.exon.6Cla97C01G002010.1.exon.6exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C01G002010.1.CDS.1Cla97C01G002010.1.CDS.1CDS
Cla97C01G002010.1.CDS.2Cla97C01G002010.1.CDS.2CDS
Cla97C01G002010.1.CDS.3Cla97C01G002010.1.CDS.3CDS
Cla97C01G002010.1.CDS.4Cla97C01G002010.1.CDS.4CDS
Cla97C01G002010.1.CDS.5Cla97C01G002010.1.CDS.5CDS
Cla97C01G002010.1.CDS.6Cla97C01G002010.1.CDS.6CDS


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029058Alpha/Beta hydrolase foldGENE3DG3DSA:3.40.50.1820coord: 36..212
e-value: 9.9E-23
score: 82.6
IPR029058Alpha/Beta hydrolase foldSUPERFAMILYSSF53474alpha/beta-Hydrolasescoord: 32..205
IPR029059Alpha/beta hydrolase fold-5PFAMPF12695Abhydrolase_5coord: 99..221
e-value: 1.2E-6
score: 28.2
NoneNo IPR availablePANTHERPTHR13136:SF9SUBFAMILY NOT NAMEDcoord: 1..235
IPR026555KAT8 regulatory NSL complex subunit 3/Testis-expressed sequence 30 proteinPANTHERPTHR13136TESTIS DEVELOPMENT PROTEIN PRTDcoord: 1..235