CmoCh12G004390 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh12G004390
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionSAP30_Sin3_bdg domain-containing protein
LocationCmo_Chr12: 2692465 .. 2695452 (+)
RNA-Seq ExpressionCmoCh12G004390
SyntenyCmoCh12G004390
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGCTCTTTGCTTTCACTCGCAAAGCGCAAACACTCCCTTTCGCCGTGTTTTTTCTCAGCGAAAAAATCCCAAAAAATTTCCCACTGATCTCTCAAATTCCCACTGTTCTGTTCCCCATTGCATGGCCTTTCTTCTACGTCGATTCTTCCATTTTCATATGTGAATTTCTTCTCTGTTTTTCCCCTTTTTGAGAATCTCCATTTCAGTTTCAGAGATTTGGAAATGATTGAAGCTGTGCAGAGTTCCATCAATGGCGCTTTCTCGCAGTTTCAGAGCTGTGGGGATAGCAGCGAGGAGGAGCTTTCGGTTCTTCCTCGCCATACTAAGGTCGTCGTTACCGGAAATAATCGGACCAAATCGGTTCTCGTTGGACTTCATGGCGTCGTCAAGAAAGCCGTCGGTCTTGGCGGCTGGCATTGGCTGGTATGGATTCTATTAGAAACATTGAATTTCTTTGGATTCTCATTGAATTTCTAGATTCTGTGGTTTTCTAGGGTTAATTTCATGTTTTCTTATCAATGTCTGGTCAATTTCTTGCGTTTTTCAGTTCAATTTTGTGTTTCAGATCTTGATTTGTTGATTTTATTTGGGGTTTAGATCGTGTCTCATTGAATTCTAGATTCGAATTCTCCAATTTCTATCTTCTTCTCTTTAGTTGCATGTTTATTTAATCATTGGATCTCGTAATTGTGATAGCTTTTTCATGATTGCTTTGCTTATTAACTTTTGTTTATGCTGATTTAATTTGTTAGATGGAATCATGAGGGGTTTTGATTGTTAGGACTGTAAGTTTTGATAATTTTTCATGAATGTGGGTTTGAATTGAGGGGAAATGCAGGTTTTAACAAATGGTATAGAGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCAACGGGCAATGAAGAAGACGATGAACTCGAATTCGAGAACTCGACATGGAATGGATTGGATATGGGTGAGTTTTGCATCTGCTGATGGTTCAATGTCTTTGCTCAATCATTCTTCCTTTGCTATTTTTGTTAGCCGTCTTAGAATTGAGTTTATCCTATGATTGTTGATTGTTTGTGTCTTGTGTCTGGCTGTTTTTGGATTTATGTCTTGTTTCTTTCTCTTATTTACTTGTTAGGAGTTAATGATTTGATCATTTGAAGTGAGGATCATTTTTTGCCCCACCCACCGTACCCCTTTTTGTTGTTGTGGTTTTTATTTCGGTTGGAGAGGAGAACGAAACATTCTTTATAAGGGTGTGAAAACCTCTCTCTAGTAGACACATTTTAAAAACCGTCAAGTTGACGGCGATACATAACCAGCCAAAGAGTCACTAGTGGTAGGTTTGGGGGGTTACAAATAGTATTAGAGCTAGACACTGGGCAGTGTGCCTGTGAGGATGCTGGGCCCCCAAGGGGGTGGATTGTGAGATCTCATATCGGTTAGAGAGAACGAAACATTCCTTATAAGGATGTGGAAACCTCTCTCTAGTAGACATGTTTTAAAAACCGTGAGGCTGATGACAATACGTAACGAGCCAAAATGGAGCATATTTGCTAGCGGTGGGTTTGAGAGGTTACAAATGATATCAGAGCCAGACACCGTGCGGTGTGCCAGCGAAGATGCTGGACCCCAAGCGGGGTGGATTGTGAGATCCCACATCGGTTAGAGAGAGGAACGAAACATTCCTTATAAGGCTGGTCAAAGCAAACAATATCGGCTAGCGGTGGGTTTGTCATGTTTTAGTTTCGAGTCTACCGTGATTAAAACATAAGAACTTTAACCTTTACAAGTATCTTCACTCCAATTCTTGAAAAGGGTTTCGAAAACGAAAATAAGTTGTAAATTCTTCTCCGACCAGAAGTTGTGGCATTAGGTTGCTATTGATGACAACATAGGATCCTTTGTGAGCTCACTTTATATTTTGTCTTGAATTGTTGTTCTCTTCATGTTCTTTATCCTTTTCTGACCCTTTATCACTCATTCACGATAATGCTTTCAACTATTCATTATTCTCAGCATCTGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTTCACAAATCATGTGGGTCATCTAACAAGACTATAAGCAGATCGCTTTCCTGCGACTCACAGTCAAAGAGCTCGGTTTCTGCACCGCAACGATCCATGGTATATTCCTAAAATGGTTTGCACGTCAAAACGTGTTTAGCCTCCTCGTTTGAATCAATTCCCTGATGTATTATCTCTTCCTATGTAGAGGGTTGACCTTAGTAAACTAGAGATGACTGCATTGTGGAGATATTGGCGACACTTCAATCTCGTAAGTTTTGAAGGTCGATTCGGTTCAATAGTCGTTACTAAGAAATCTCATTCTTGACACCGAGATTGATGGGTTTGTGCAGGTTGATGCCTTTCCCAACCCGTCGAAAGAGCAATTAGTAGATCTAGTTCAAAGGCATTTCATGTCACTGGTACCCGACCACGCCTTACTTCCTAAAAACACTATCAAACGATCCTATTTTCACATGGATCGAAACCGTTTTTCCTTCTATCTATTGACGATTTCTTTCTTTTTCGGTTGCGAAGCAACTGGATGAGTTGCAGGTTATAATGGGTTTTGTGAAGGCTGCAAAGAGACTGAAAACAGTGTGCAAATGAGAGGAGAAGAACTGGGGAATCCTTTGAGTTGATTCTCATGCTAACAGATAATTCAGGTGAATGAGAAGTGTTTAGGTTCTCTCTGTAACATATGGATCGATCGATCGGAAATGGGTTTTGTGGGTCGAGATCGGAACGGGTTAGTATGATTCTGTTGTAGTTAATAGAGTAATATCTTGTTTTAGTGACGATGTTACTTGCAGTGTAGATATGTGTAAACTGCAAATATTAGAGACCAAAAAACATGTTTCTAAGCTTTTGTTTAATGTAATGCTGACTGACTTTAGGGGTGTTTTGATTGCCCAAAAAATGTACAAATTTTGCTGTTCCATATCCTAATG

mRNA sequence

CTGCTCTTTGCTTTCACTCGCAAAGCGCAAACACTCCCTTTCGCCGTGTTTTTTCTCAGCGAAAAAATCCCAAAAAATTTCCCACTGATCTCTCAAATTCCCACTGTTCTGTTCCCCATTGCATGGCCTTTCTTCTACGTCGATTCTTCCATTTTCATATGTGAATTTCTTCTCTGTTTTTCCCCTTTTTGAGAATCTCCATTTCAGTTTCAGAGATTTGGAAATGATTGAAGCTGTGCAGAGTTCCATCAATGGCGCTTTCTCGCAGTTTCAGAGCTGTGGGGATAGCAGCGAGGAGGAGCTTTCGGTTCTTCCTCGCCATACTAAGGTCGTCGTTACCGGAAATAATCGGACCAAATCGGTTCTCGTTGGACTTCATGGCGTCGTCAAGAAAGCCGTCGGTCTTGGCGGCTGGCATTGGCTGATTCTGTGGTTTTCTAGGGTTAATTTCATGTTTTCTTATCAATGTCTGGTCAATTTCTTGCGTTTTTCAGTTCAATTTTGTGTTTCAGATCTTGATTTGTTGATTTTATTTGGGGTTTTAACAAATGGTATAGAGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCAACGGGCAATGAAGAAGACGATGAACTCGAATTCGAGAACTCGACATGGAATGGATTGGATATGGCATCTGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTTCACAAATCATGTGGGTCATCTAACAAGACTATAAGCAGATCGCTTTCCTGCGACTCACAGTCAAAGAGCTCGGTTTCTGCACCGCAACGATCCATGAGGGTTGACCTTAGTAAACTAGAGATGACTGCATTGTGGAGATATTGGCGACACTTCAATCTCGTTGATGCCTTTCCCAACCCGTCGAAAGAGCAATTAGTAGATCTAGTTCAAAGGCATTTCATGTCACTGCAACTGGATGAGTTGCAGGTTATAATGGGTTTTGTGAAGGCTGCAAAGAGACTGAAAACAGTGTGCAAATGAGAGGAGAAGAACTGGGGAATCCTTTGAGTTGATTCTCATGCTAACAGATAATTCAGGTGAATGAGAAGTGTTTAGGTTCTCTCTGTAACATATGGATCGATCGATCGGAAATGGGTTTTGTGGGTCGAGATCGGAACGGGTTAGTATGATTCTGTTGTAGTTAATAGAGTAATATCTTGTTTTAGTGACGATGTTACTTGCAGTGTAGATATGTGTAAACTGCAAATATTAGAGACCAAAAAACATGTTTCTAAGCTTTTGTTTAATGTAATGCTGACTGACTTTAGGGGTGTTTTGATTGCCCAAAAAATGTACAAATTTTGCTGTTCCATATCCTAATG

Coding sequence (CDS)

ATGATTGAAGCTGTGCAGAGTTCCATCAATGGCGCTTTCTCGCAGTTTCAGAGCTGTGGGGATAGCAGCGAGGAGGAGCTTTCGGTTCTTCCTCGCCATACTAAGGTCGTCGTTACCGGAAATAATCGGACCAAATCGGTTCTCGTTGGACTTCATGGCGTCGTCAAGAAAGCCGTCGGTCTTGGCGGCTGGCATTGGCTGATTCTGTGGTTTTCTAGGGTTAATTTCATGTTTTCTTATCAATGTCTGGTCAATTTCTTGCGTTTTTCAGTTCAATTTTGTGTTTCAGATCTTGATTTGTTGATTTTATTTGGGGTTTTAACAAATGGTATAGAGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCAACGGGCAATGAAGAAGACGATGAACTCGAATTCGAGAACTCGACATGGAATGGATTGGATATGGCATCTGATGACGCCCAAAAATCCCACAAATCAAGGCATAAATTTCACAAATCATGTGGGTCATCTAACAAGACTATAAGCAGATCGCTTTCCTGCGACTCACAGTCAAAGAGCTCGGTTTCTGCACCGCAACGATCCATGAGGGTTGACCTTAGTAAACTAGAGATGACTGCATTGTGGAGATATTGGCGACACTTCAATCTCGTTGATGCCTTTCCCAACCCGTCGAAAGAGCAATTAGTAGATCTAGTTCAAAGGCATTTCATGTCACTGCAACTGGATGAGTTGCAGGTTATAATGGGTTTTGTGAAGGCTGCAAAGAGACTGAAAACAGTGTGCAAATGA

Protein sequence

MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVGLGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNALSVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCDSQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQLDELQVIMGFVKAAKRLKTVCK
Homology
BLAST of CmoCh12G004390 vs. ExPASy TrEMBL
Match: A0A6J1GIW5 (uncharacterized protein LOC111454286 OS=Cucurbita moschata OX=3662 GN=LOC111454286 PE=3 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 4.0e-113
Identity = 223/261 (85.44%), Postives = 223/261 (85.44%), Query Frame = 0

Query: 1   MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60
           MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG
Sbjct: 1   MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60

Query: 61  LGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNAL 120
           LGGWHWL                                      VLTNGIEVKLQRNAL
Sbjct: 61  LGGWHWL--------------------------------------VLTNGIEVKLQRNAL 120

Query: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD 180
           SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD
Sbjct: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD 180

Query: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 240
           SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL
Sbjct: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 223

Query: 241 DELQVIMGFVKAAKRLKTVCK 262
           DELQVIMGFVKAAKRLKTVCK
Sbjct: 241 DELQVIMGFVKAAKRLKTVCK 223

BLAST of CmoCh12G004390 vs. ExPASy TrEMBL
Match: A0A6J1KN45 (uncharacterized protein LOC111496089 OS=Cucurbita maxima OX=3661 GN=LOC111496089 PE=3 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 1.7e-111
Identity = 220/261 (84.29%), Postives = 221/261 (84.67%), Query Frame = 0

Query: 1   MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60
           MIEAVQ SINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG
Sbjct: 1   MIEAVQRSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60

Query: 61  LGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNAL 120
           LGGWHWL                                      VLTNGIEVKLQRNAL
Sbjct: 61  LGGWHWL--------------------------------------VLTNGIEVKLQRNAL 120

Query: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD 180
           SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSS+KTISRS SCD
Sbjct: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSHKTISRSFSCD 180

Query: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 240
           SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL
Sbjct: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 223

Query: 241 DELQVIMGFVKAAKRLKTVCK 262
           DELQVIMGFVKAAKRLKTVCK
Sbjct: 241 DELQVIMGFVKAAKRLKTVCK 223

BLAST of CmoCh12G004390 vs. ExPASy TrEMBL
Match: A0A5A7VFZ9 (Histone deacetylase complex subunit SAP30/SAP30-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005970 PE=3 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 1.3e-100
Identity = 203/261 (77.78%), Postives = 209/261 (80.08%), Query Frame = 0

Query: 1   MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60
           MIEAV+SSING FS  QSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVKKAVG
Sbjct: 1   MIEAVESSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNAL 120
           LGGWHWL                                      VLTNGIEVKLQRNAL
Sbjct: 61  LGGWHWL--------------------------------------VLTNGIEVKLQRNAL 120

Query: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD 180
           SVIEAPTGNEEDD+LEFEN  WN +DMASDDAQKSHKSRHK HKS GSS+KT+SRSLSCD
Sbjct: 121 SVIEAPTGNEEDDDLEFENLQWNRIDMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCD 180

Query: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 240
           SQSKSSVSAPQ S +VDLSKLEM ALWRYWRHFNLVDA PNPSKEQLVDLVQRHFMS QL
Sbjct: 181 SQSKSSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQQL 223

Query: 241 DELQVIMGFVKAAKRLKTVCK 262
           DELQVIMGFVKAAKRLKTVCK
Sbjct: 241 DELQVIMGFVKAAKRLKTVCK 223

BLAST of CmoCh12G004390 vs. ExPASy TrEMBL
Match: A0A0A0LNK2 (SAP30_Sin3_bdg domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G361400 PE=3 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 1.3e-100
Identity = 203/261 (77.78%), Postives = 209/261 (80.08%), Query Frame = 0

Query: 1   MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60
           MIEAV+SSING FS  QSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVKKAVG
Sbjct: 1   MIEAVESSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNAL 120
           LGGWHWL                                      VLTNGIEVKLQRNAL
Sbjct: 61  LGGWHWL--------------------------------------VLTNGIEVKLQRNAL 120

Query: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD 180
           SVIEAPTGNEEDD+LEFEN  WN +DMASDDAQKSHKSRHK HKS GSS+KT+SRSLSCD
Sbjct: 121 SVIEAPTGNEEDDDLEFENLQWNRIDMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCD 180

Query: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 240
           SQSKSSVSAPQ S +VDLSKLEM ALWRYWRHFNLVDA PNPSKEQLVDLVQRHFMS QL
Sbjct: 181 SQSKSSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQQL 223

Query: 241 DELQVIMGFVKAAKRLKTVCK 262
           DELQVIMGFVKAAKRLKTVCK
Sbjct: 241 DELQVIMGFVKAAKRLKTVCK 223

BLAST of CmoCh12G004390 vs. ExPASy TrEMBL
Match: A0A1S3BAV7 (uncharacterized protein LOC103487947 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103487947 PE=3 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 1.3e-100
Identity = 203/261 (77.78%), Postives = 209/261 (80.08%), Query Frame = 0

Query: 1   MIEAVQSSINGAFSQFQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVG 60
           MIEAV+SSING FS  QSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVKKAVG
Sbjct: 1   MIEAVESSINGGFSHLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNAL 120
           LGGWHWL                                      VLTNGIEVKLQRNAL
Sbjct: 61  LGGWHWL--------------------------------------VLTNGIEVKLQRNAL 120

Query: 121 SVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCD 180
           SVIEAPTGNEEDD+LEFEN  WN +DMASDDAQKSHKSRHK HKS GSS+KT+SRSLSCD
Sbjct: 121 SVIEAPTGNEEDDDLEFENLQWNRIDMASDDAQKSHKSRHKLHKSSGSSHKTMSRSLSCD 180

Query: 181 SQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQL 240
           SQSKSSVSAPQ S +VDLSKLEM ALWRYWRHFNLVDA PNPSKEQLVDLVQRHFMS QL
Sbjct: 181 SQSKSSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQQL 223

Query: 241 DELQVIMGFVKAAKRLKTVCK 262
           DELQVIMGFVKAAKRLKTVCK
Sbjct: 241 DELQVIMGFVKAAKRLKTVCK 223

BLAST of CmoCh12G004390 vs. TAIR 10
Match: AT1G19330.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 297.0 bits (759), Expect = 1.5e-80
Identity = 171/265 (64.53%), Postives = 191/265 (72.08%), Query Frame = 0

Query: 1   MIEAVQSS--INGAFSQFQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVK 60
           M+EAV SS  +NG F Q QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVK
Sbjct: 1   MLEAVDSSGVVNGGFPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60

Query: 61  KAVGLGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQ 120
           KAVGLGGWHWL                                      VLTNGIEVKLQ
Sbjct: 61  KAVGLGGWHWL--------------------------------------VLTNGIEVKLQ 120

Query: 121 RNALSVIEAPTGNEEDDELEFENSTWNGLDMASDDAQKSHKSRHKFHKSCGSSNKTISRS 180
           RNALSV+E PTGNEEDD+L+FEN+  NG DM S+D  K HKS+ +  +S  SS+KT+SRS
Sbjct: 121 RNALSVLEPPTGNEEDDDLDFENTQRNGSDMTSEDTLKPHKSKLRGQRSSRSSHKTMSRS 180

Query: 181 LSCDSQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFM 240
           LS DSQSKSS   P  +M+VDLSKLEM AL  YWRHFNLVDA PNPSKEQL+D+VQRHFM
Sbjct: 181 LSSDSQSKSSGFTPPENMKVDLSKLEMPALLNYWRHFNLVDAIPNPSKEQLIDIVQRHFM 227

Query: 241 SLQLDELQVIMGFVKAAKRLKTVCK 262
           S Q+DELQVI+GFV+AAKR+K  CK
Sbjct: 241 SQQMDELQVIVGFVQAAKRMKKACK 227

BLAST of CmoCh12G004390 vs. TAIR 10
Match: AT1G19330.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 292.4 bits (747), Expect = 3.7e-79
Identity = 172/270 (63.70%), Postives = 192/270 (71.11%), Query Frame = 0

Query: 1   MIEAVQSS--INGAFSQFQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVK 60
           M+EAV SS  +NG F Q QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVK
Sbjct: 1   MLEAVDSSGVVNGGFPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60

Query: 61  KAVGLGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQ 120
           KAVGLGGWHWL                                      VLTNGIEVKLQ
Sbjct: 61  KAVGLGGWHWL--------------------------------------VLTNGIEVKLQ 120

Query: 121 RNALSVIEAPTGNEEDDELEFENSTWNGLDM-----ASDDAQKSHKSRHKFHKSCGSSNK 180
           RNALSV+E PTGNEEDD+L+FEN+  NG DM     AS+D  K HKS+ +  +S  SS+K
Sbjct: 121 RNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPASEDTLKPHKSKLRGQRSSRSSHK 180

Query: 181 TISRSLSCDSQSKSSVSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLV 240
           T+SRSLS DSQSKSS   P  +M+VDLSKLEM AL  YWRHFNLVDA PNPSKEQL+D+V
Sbjct: 181 TMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLNYWRHFNLVDAIPNPSKEQLIDIV 232

Query: 241 QRHFMSLQLDELQVIMGFVKAAKRLKTVCK 262
           QRHFMS Q+DELQVI+GFV+AAKR+K  CK
Sbjct: 241 QRHFMSQQMDELQVIVGFVQAAKRMKKACK 232

BLAST of CmoCh12G004390 vs. TAIR 10
Match: AT1G19330.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75060.1). )

HSP 1 Score: 287.7 bits (735), Expect = 9.2e-78
Identity = 171/271 (63.10%), Postives = 192/271 (70.85%), Query Frame = 0

Query: 1   MIEAVQSS--INGAFSQFQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVK 60
           M+EAV SS  +NG F Q QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVK
Sbjct: 1   MLEAVDSSGVVNGGFPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60

Query: 61  KAVGLGGWHWLILWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQ 120
           KAVGLGGWHWL                                      VLTNGIEVKLQ
Sbjct: 61  KAVGLGGWHWL--------------------------------------VLTNGIEVKLQ 120

Query: 121 RNALSVIEAPTGNEEDDELEFENSTWNGLDM-----ASDDAQKSHKSRHKFHKSCGSSNK 180
           RNALSV+E PTGNEEDD+L+FEN+  NG DM     AS+D  K HKS+ +  +S  SS+K
Sbjct: 121 RNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPASEDTLKPHKSKLRGQRSSRSSHK 180

Query: 181 TISRSLSCDSQSKSS-VSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDL 240
           T+SRSLS DSQSKSS  + P+   +VDLSKLEM AL  YWRHFNLVDA PNPSKEQL+D+
Sbjct: 181 TMSRSLSSDSQSKSSGFTPPENMQKVDLSKLEMPALLNYWRHFNLVDAIPNPSKEQLIDI 233

Query: 241 VQRHFMSLQLDELQVIMGFVKAAKRLKTVCK 262
           VQRHFMS Q+DELQVI+GFV+AAKR+K  CK
Sbjct: 241 VQRHFMSQQMDELQVIVGFVQAAKRMKKACK 233

BLAST of CmoCh12G004390 vs. TAIR 10
Match: AT1G75060.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 268.9 bits (686), Expect = 4.4e-72
Identity = 153/255 (60.00%), Postives = 179/255 (70.20%), Query Frame = 0

Query: 11  GAFSQFQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVGLGGWHWLI 70
           G FSQ QSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVKKAVGLGGWHWL 
Sbjct: 17  GGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWL- 76

Query: 71  LWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNALSVIEAPTG 130
                                                VLTNGIEVKLQRNALSV+E PTG
Sbjct: 77  -------------------------------------VLTNGIEVKLQRNALSVLEHPTG 136

Query: 131 NEEDDELEFENST-WN-GLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCDSQSKSS 190
           NEED++LE ++ST WN   DM ++D  K HKS+ + H+S   S K + R +SCDS SK S
Sbjct: 137 NEEDNDLEVDHSTQWNHPSDMTTEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKIS 196

Query: 191 VSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQLDELQVI 250
              P+ +M+VDL+KL+M AL RYWRHFNLVDA PNP+KEQL+D++QRHFMS Q+DELQVI
Sbjct: 197 SITPRLNMKVDLTKLDMAALLRYWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVI 233

Query: 251 MGFVKAAKRLKTVCK 262
           +GFV+AA  +K  C+
Sbjct: 257 VGFVQAATGMKKACQ 233

BLAST of CmoCh12G004390 vs. TAIR 10
Match: AT1G75060.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 263.8 bits (673), Expect = 1.4e-70
Identity = 153/255 (60.00%), Postives = 178/255 (69.80%), Query Frame = 0

Query: 11  GAFSQFQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLHGVVKKAVGLGGWHWLI 70
           G FSQ QSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGL GVVKKAVGLGGWHWL 
Sbjct: 17  GGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWL- 76

Query: 71  LWFSRVNFMFSYQCLVNFLRFSVQFCVSDLDLLILFGVLTNGIEVKLQRNALSVIEAPTG 130
                                                VLTNGIEVKLQRNALSV+E PTG
Sbjct: 77  -------------------------------------VLTNGIEVKLQRNALSVLEHPTG 136

Query: 131 NEEDDELEFENST-WN-GLDMASDDAQKSHKSRHKFHKSCGSSNKTISRSLSCDSQSKSS 190
           NEED++LE ++ST WN   DM ++D  K HKS+ + H+S   S K + R +SCDS SK S
Sbjct: 137 NEEDNDLEVDHSTQWNHPSDMTTEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKIS 196

Query: 191 VSAPQRSMRVDLSKLEMTALWRYWRHFNLVDAFPNPSKEQLVDLVQRHFMSLQLDELQVI 250
              P+ +M VDL+KL+M AL RYWRHFNLVDA PNP+KEQL+D++QRHFMS Q+DELQVI
Sbjct: 197 SITPRLNM-VDLTKLDMAALLRYWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVI 232

Query: 251 MGFVKAAKRLKTVCK 262
           +GFV+AA  +K  C+
Sbjct: 257 VGFVQAATGMKKACQ 232

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GIW54.0e-11385.44uncharacterized protein LOC111454286 OS=Cucurbita moschata OX=3662 GN=LOC1114542... [more]
A0A6J1KN451.7e-11184.29uncharacterized protein LOC111496089 OS=Cucurbita maxima OX=3661 GN=LOC111496089... [more]
A0A5A7VFZ91.3e-10077.78Histone deacetylase complex subunit SAP30/SAP30-like protein OS=Cucumis melo var... [more]
A0A0A0LNK21.3e-10077.78SAP30_Sin3_bdg domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G361... [more]
A0A1S3BAV71.3e-10077.78uncharacterized protein LOC103487947 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT1G19330.21.5e-8064.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G19330.13.7e-7963.70unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G19330.39.2e-7863.10unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G75060.14.4e-7260.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G75060.21.4e-7060.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025718Histone deacetylase complex subunit SAP30, Sin3 binding domainPFAMPF13867SAP30_Sin3_bdgcoord: 201..254
e-value: 1.1E-20
score: 73.6
IPR038291SAP30, C-terminal domain superfamilyGENE3D6.10.160.20coord: 176..261
e-value: 1.6E-26
score: 94.0
IPR024145Histone deacetylase complex subunit SAP30/SAP30-likePANTHERPTHR13286SAP30coord: 1..69
IPR024145Histone deacetylase complex subunit SAP30/SAP30-likePANTHERPTHR13286SAP30coord: 106..261
NoneNo IPR availablePANTHERPTHR13286:SF10HISTONE DEACETYLASE COMPLEX SUBUNIT SAP30 SIN3-BINDING PROTEINcoord: 1..69
NoneNo IPR availablePANTHERPTHR13286:SF10HISTONE DEACETYLASE COMPLEX SUBUNIT SAP30 SIN3-BINDING PROTEINcoord: 106..261

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh12G004390.1CmoCh12G004390.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0000118 histone deacetylase complex
molecular_function GO:0005515 protein binding
molecular_function GO:0003712 transcription coregulator activity