Cp4.1LG02g07350 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g07350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionSAP30_Sin3_bdg domain-containing protein
LocationCp4.1LG02: 713714 .. 716199 (+)
RNA-Seq ExpressionCp4.1LG02g07350
SyntenyCp4.1LG02g07350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACTCTCGCTTCTCTCTCTCACATAAAACACAGCGTCTCTCTCTCTCTTCTTTCTCTCTCTACTCTATCTCTCTCCTGCTCTGCTCTGTGTTTGTTGTCAACGAAAATTCCCAAATTTCTCAGCTATTTCGCTGTCTCTTGCCGCTGAAATTCCCGAAAACATGCTCCAAACTGCATGATTTTTCTGCATCGAGTAATATCTAGTCTCTTCTGCTCTTGAATTTCGCTCTATTTGCCCGAATTCATCGCCAACGGTTTTCGGAATGCTTGAAGCAGTGGAAAGCTCCGTTAATGGCGGCTTCTCGCAGCTGCAAAGCAGTGGCGATAGTAGTGAAGAGGAGCTTTCTGTTCTTCCTCGTCATACGAAAGTGGTCGTCACCGGAAACAACCGCACCAAATCGGTGCTCGTTGGACTTCAAGGCGTTGTTAAGAAGGCTGTTGGTTTGGGCGGATGGCATTGGCTGGTACGTCGCTTTGTTAAATTGGGCTGTTTCTGGTAATGAGCTTAGGGTACAATGAGTCGTGATTTGATCAATTTTGAGTATCAATGTGGTTTTCCCCAATTCCATGTTCTGTTTCTTCGCAGTTTTTTAATCCATTTTATCGCGATTTCTGGTTGTTTATTGCATCCATTTCTGAGTTTTGAATATGTTTTAGAATAGGTTTCGAATCTGAGGTTTGAGCTAAGAATTGCATGGTCAGTGTGATCTATGGATCTGGTGATTAGTGATGCGGATTCATTTCACAATGTCAAATCGATTTTATGCAATTAGGGATAATCTGGTGATTAGTCTGGCTGTTGCATTGATTTAAGAACAGTGGGATTGTTCTTTTGATATCTGCAGGTCACCATGATTATCTTGATATTCTAGGTTTAGCATTTAATCTTGAAATTAAAAACCCTCAGGTTCTCACAAATGGCATTGAAGTTAAGCTACAGAGAAATGCCCTCAGTGTAATCGAAGCCCCAACCGGCAACGAGGACGACGATGATCTCGAATTCGAAAACTTGCAATGGAATGGATCGGATATGGGTGAGTTTAGAATCTTTTCAATTCTTATTCATTGGACTGAAGAACAACATTGAACTGTCGATTCTTTAAATGGATTAGATCGGTTAATTCGATAAGCTGGCCTCGATTAAACATAGAAAAAAGGCACAAAGAGAGCTATTTGTCTGCACTTTCCTTTCAATTCTTATTAGGGAATTCATGATTAAAACACTTGCTTGTTTCGAGTGATTGTTTCATTGGCCCACTGTAGTTGTTGTGGAATGATATTCTTTTTAGGAACAGTGGTTGAGTTTGGTACTCTTTGGATTCTTTGCTCATTCAACTTTTACTTTCTTCCAAACCTTGTTGTGGCATTTATTCTCTCTTCTGGACCCTCCTCATCCTTTTTTCATAATGCTTTTGGTATTTTCATTTCCCAGCATCTGATGATACTTTGAAGTCCCATAGACTGAGATATAGGACACACAAATCGTCGGGTTCATCGCACAAAACTATCAGCCGATCCTTCTCGTATGAATCGCAGTCCAAGGGGTCTATTTCGACCCCTCGTGGGTCCATGGTACTTGAGAAACAGACAGACTCTTTTGTTTCATTTGTTGAAGTGAGATATAATTAGGCTCATAATCTAATAAGTTGGATCTTTCTTAACGTTCTTGTACAGAAGGTCGACCTCGGTAAACTGGAAATGTCTGCTCTATGGAGATATTGGCGACACTTCAATCTTGTAAGTTACCGTTTATTCGATAAATGTGAGCATATTTTGGGCACATGTCCATTAAACTGCAAGATTTTACCACCATGATTGAGAGAACTCTATATCGTCTAGGTCGATGCTTTTCCTAACCCATCGAAAGAGCAGTTGGTGGATGTCGTTCAAAGGCATTTCATGTCACAGGTAATCAATTTTATCAGTTTAACCGTTCGAAAAATACATCAGCTATGTTGTAAACTACATGACATGACTTGTTTGTTTGTTCACTTCACTCGTGCAGCAACTAGACGAGTTGCAGGTGATTGTCGGGTTCGTCCATGCTGCAAAGAGACTCAAAACCGTTTGCAAATGACCGAGTAATGGCTGTAACACCAGGCTCATTCATCCATTTCTAACAAAAACACCGGTACCAAAATTCCCACTTCCCTGCTACATTGATGGTTCTCTGTAATATAAGAATCCTGGCTTGTTTTTGAGGTATTCATTGCCCGTTTGTTCGTGTAATTATGGTAGATAAATATAGGATATATAATATATATAAAATGTACGTATAGGCTAGCAGACCTTCTTAAACTGACAAGTCCTTGGTGACTTTGTTTTCCTAAACCTTGATTGAATGCTGACTTGGGAGAGTGGGTATCTTGAAATGAATGTGTACATTCCTTTAATTCTCTATTTATTTTGTCAATGAATCAAACAGTGTTAAGCTGTATTATAGACTAAGGGCTTAGAGGAAAAGAATCGGGACGGGGGCGGGTAGT

mRNA sequence

CACTCTCGCTTCTCTCTCTCACATAAAACACAGCGTCTCTCTCTCTCTTCTTTCTCTCTCTACTCTATCTCTCTCCTGCTCTGCTCTGTGTTTGTTGTCAACGAAAATTCCCAAATTTCTCAGCTATTTCGCTGTCTCTTGCCGCTGAAATTCCCGAAAACATGCTCCAAACTGCATGATTTTTCTGCATCGAGTAATATCTAGTCTCTTCTGCTCTTGAATTTCGCTCTATTTGCCCGAATTCATCGCCAACGGTTTTCGGAATGCTTGAAGCAGTGGAAAGCTCCGTTAATGGCGGCTTCTCGCAGCTGCAAAGCAGTGGCGATAGTAGTGAAGAGGAGCTTTCTGTTCTTCCTCGTCATACGAAAGTGGTCGTCACCGGAAACAACCGCACCAAATCGGTGCTCGTTGGACTTCAAGGCGTTGTTAAGAAGGCTGTTGGTTTGGGCGGATGGCATTGGCTGGTTCTCACAAATGGCATTGAAGTTAAGCTACAGAGAAATGCCCTCAGTGTAATCGAAGCCCCAACCGGCAACGAGGACGACGATGATCTCGAATTCGAAAACTTGCAATGGAATGGATCGGATATGGCATCTGATGATACTTTGAAGTCCCATAGACTGAGATATAGGACACACAAATCGTCGGGTTCATCGCACAAAACTATCAGCCGATCCTTCTCGTATGAATCGCAGTCCAAGGGGTCTATTTCGACCCCTCGTGGGTCCATGAAGGTCGACCTCGGTAAACTGGAAATGTCTGCTCTATGGAGATATTGGCGACACTTCAATCTTGTCGATGCTTTTCCTAACCCATCGAAAGAGCAGTTGGTGGATGTCGTTCAAAGGCATTTCATGTCACAGCAACTAGACGAGTTGCAGGTGATTGTCGGGTTCGTCCATGCTGCAAAGAGACTCAAAACCGTTTGCAAATGACCGAGTAATGGCTGTAACACCAGGCTCATTCATCCATTTCTAACAAAAACACCGGTACCAAAATTCCCACTTCCCTGCTACATTGATGGTTCTCTGTAATATAAGAATCCTGGCTTGTTTTTGAGGTATTCATTGCCCGTTTGTTCGTGTAATTATGGTAGATAAATATAGGATATATAATATATATAAAATGTACGTATAGGCTAGCAGACCTTCTTAAACTGACAAGTCCTTGGTGACTTTGTTTTCCTAAACCTTGATTGAATGCTGACTTGGGAGAGTGGGTATCTTGAAATGAATGTGTACATTCCTTTAATTCTCTATTTATTTTGTCAATGAATCAAACAGTGTTAAGCTGTATTATAGACTAAGGGCTTAGAGGAAAAGAATCGGGACGGGGGCGGGTAGT

Coding sequence (CDS)

ATGCTTGAAGCAGTGGAAAGCTCCGTTAATGGCGGCTTCTCGCAGCTGCAAAGCAGTGGCGATAGTAGTGAAGAGGAGCTTTCTGTTCTTCCTCGTCATACGAAAGTGGTCGTCACCGGAAACAACCGCACCAAATCGGTGCTCGTTGGACTTCAAGGCGTTGTTAAGAAGGCTGTTGGTTTGGGCGGATGGCATTGGCTGGTTCTCACAAATGGCATTGAAGTTAAGCTACAGAGAAATGCCCTCAGTGTAATCGAAGCCCCAACCGGCAACGAGGACGACGATGATCTCGAATTCGAAAACTTGCAATGGAATGGATCGGATATGGCATCTGATGATACTTTGAAGTCCCATAGACTGAGATATAGGACACACAAATCGTCGGGTTCATCGCACAAAACTATCAGCCGATCCTTCTCGTATGAATCGCAGTCCAAGGGGTCTATTTCGACCCCTCGTGGGTCCATGAAGGTCGACCTCGGTAAACTGGAAATGTCTGCTCTATGGAGATATTGGCGACACTTCAATCTTGTCGATGCTTTTCCTAACCCATCGAAAGAGCAGTTGGTGGATGTCGTTCAAAGGCATTTCATGTCACAGCAACTAGACGAGTTGCAGGTGATTGTCGGGTTCGTCCATGCTGCAAAGAGACTCAAAACCGTTTGCAAATGA

Protein sequence

MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRLRYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDAFPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Homology
BLAST of Cp4.1LG02g07350 vs. NCBI nr
Match: XP_022939981.1 (uncharacterized protein LOC111445752 isoform X1 [Cucurbita moschata] >XP_023523278.1 uncharacterized protein LOC111787523 isoform X2 [Cucurbita pepo subsp. pepo] >KAG6608519.1 hypothetical protein SDJN03_01861, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 436 bits (1121), Expect = 4.98e-154
Identity = 223/223 (100.00%), Postives = 223/223 (100.00%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223

BLAST of Cp4.1LG02g07350 vs. NCBI nr
Match: XP_022982128.1 (uncharacterized protein LOC111481057 isoform X2 [Cucurbita maxima])

HSP 1 Score: 435 bits (1119), Expect = 1.01e-153
Identity = 222/223 (99.55%), Postives = 223/223 (100.00%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFEN+QWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENMQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223

BLAST of Cp4.1LG02g07350 vs. NCBI nr
Match: XP_022939982.1 (uncharacterized protein LOC111445752 isoform X2 [Cucurbita moschata] >XP_023523277.1 uncharacterized protein LOC111787523 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 429 bits (1104), Expect = 1.88e-151
Identity = 222/223 (99.55%), Postives = 222/223 (99.55%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM VDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-VDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 222

BLAST of Cp4.1LG02g07350 vs. NCBI nr
Match: XP_022982127.1 (uncharacterized protein LOC111481057 isoform X1 [Cucurbita maxima])

HSP 1 Score: 429 bits (1102), Expect = 3.79e-151
Identity = 221/223 (99.10%), Postives = 222/223 (99.55%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFEN+QWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENMQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM VDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-VDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 222

BLAST of Cp4.1LG02g07350 vs. NCBI nr
Match: XP_022144205.1 (uncharacterized protein LOC111013658 isoform X1 [Momordica charantia])

HSP 1 Score: 428 bits (1101), Expect = 5.58e-151
Identity = 219/223 (98.21%), Postives = 222/223 (99.55%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTG+EDDDDLEFENLQWNGSDMASDDTLKSHR 
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGHEDDDDLEFENLQWNGSDMASDDTLKSHRP 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           R+RTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RHRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVD+VQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDLVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223

BLAST of Cp4.1LG02g07350 vs. ExPASy TrEMBL
Match: A0A6J1FHA1 (uncharacterized protein LOC111445752 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445752 PE=3 SV=1)

HSP 1 Score: 436 bits (1121), Expect = 2.41e-154
Identity = 223/223 (100.00%), Postives = 223/223 (100.00%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223

BLAST of Cp4.1LG02g07350 vs. ExPASy TrEMBL
Match: A0A6J1J423 (uncharacterized protein LOC111481057 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111481057 PE=3 SV=1)

HSP 1 Score: 435 bits (1119), Expect = 4.87e-154
Identity = 222/223 (99.55%), Postives = 223/223 (100.00%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFEN+QWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENMQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223

BLAST of Cp4.1LG02g07350 vs. ExPASy TrEMBL
Match: A0A6J1FID1 (uncharacterized protein LOC111445752 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445752 PE=3 SV=1)

HSP 1 Score: 429 bits (1104), Expect = 9.09e-152
Identity = 222/223 (99.55%), Postives = 222/223 (99.55%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM VDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-VDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 222

BLAST of Cp4.1LG02g07350 vs. ExPASy TrEMBL
Match: A0A6J1IVT7 (uncharacterized protein LOC111481057 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481057 PE=3 SV=1)

HSP 1 Score: 429 bits (1102), Expect = 1.83e-151
Identity = 221/223 (99.10%), Postives = 222/223 (99.55%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFEN+QWNGSDMASDDTLKSHRL
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENMQWNGSDMASDDTLKSHRL 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM VDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-VDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 222

BLAST of Cp4.1LG02g07350 vs. ExPASy TrEMBL
Match: A0A6J1CR00 (uncharacterized protein LOC111013658 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111013658 PE=3 SV=1)

HSP 1 Score: 428 bits (1101), Expect = 2.70e-151
Identity = 219/223 (98.21%), Postives = 222/223 (99.55%), Query Frame = 0

Query: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
           MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1   MLEAVESSVNGGFSQLQSSGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60

Query: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLKSHRL 120
           LGGWHWLVLTNGIEVKLQRNALSVIEAPTG+EDDDDLEFENLQWNGSDMASDDTLKSHR 
Sbjct: 61  LGGWHWLVLTNGIEVKLQRNALSVIEAPTGHEDDDDLEFENLQWNGSDMASDDTLKSHRP 120

Query: 121 RYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180
           R+RTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA
Sbjct: 121 RHRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDA 180

Query: 181 FPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223
           FPNPSKEQLVD+VQRHFMSQQLDELQVIVGFVHAAKRLKTVCK
Sbjct: 181 FPNPSKEQLVDLVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 223

BLAST of Cp4.1LG02g07350 vs. TAIR 10
Match: AT1G19330.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 335.9 bits (860), Expect = 2.5e-92
Identity = 180/227 (79.30%), Postives = 196/227 (86.34%), Query Frame = 0

Query: 1   MLEAVESS--VNGGFSQLQS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60
           MLEAV+SS  VNGGF Q+QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK
Sbjct: 1   MLEAVDSSGVVNGGFPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60

Query: 61  KAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDMASDDTLK 120
           KAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+DDDL+FEN Q NGSDM S+DTLK
Sbjct: 61  KAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMTSEDTLK 120

Query: 121 SHRLRYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFN 180
            H+ + R  +SS SSHKT+SRS S +SQSK S  TP  +MKVDL KLEM AL  YWRHFN
Sbjct: 121 PHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLNYWRHFN 180

Query: 181 LVDAFPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 224
           LVDA PNPSKEQL+D+VQRHFMSQQ+DELQVIVGFV AAKR+K  CK
Sbjct: 181 LVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKACK 227

BLAST of Cp4.1LG02g07350 vs. TAIR 10
Match: AT1G19330.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 331.3 bits (848), Expect = 6.2e-91
Identity = 181/232 (78.02%), Postives = 197/232 (84.91%), Query Frame = 0

Query: 1   MLEAVESS--VNGGFSQLQS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60
           MLEAV+SS  VNGGF Q+QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK
Sbjct: 1   MLEAVDSSGVVNGGFPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60

Query: 61  KAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDM-----AS 120
           KAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+DDDL+FEN Q NGSDM     AS
Sbjct: 61  KAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPAS 120

Query: 121 DDTLKSHRLRYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRY 180
           +DTLK H+ + R  +SS SSHKT+SRS S +SQSK S  TP  +MKVDL KLEM AL  Y
Sbjct: 121 EDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLNY 180

Query: 181 WRHFNLVDAFPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 224
           WRHFNLVDA PNPSKEQL+D+VQRHFMSQQ+DELQVIVGFV AAKR+K  CK
Sbjct: 181 WRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKACK 232

BLAST of Cp4.1LG02g07350 vs. TAIR 10
Match: AT1G19330.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75060.1). )

HSP 1 Score: 326.6 bits (836), Expect = 1.5e-89
Identity = 181/233 (77.68%), Postives = 197/233 (84.55%), Query Frame = 0

Query: 1   MLEAVESS--VNGGFSQLQS-SGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60
           MLEAV+SS  VNGGF Q+QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK
Sbjct: 1   MLEAVDSSGVVNGGFPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVK 60

Query: 61  KAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEDDDDLEFENLQWNGSDM-----AS 120
           KAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNE+DDDL+FEN Q NGSDM     AS
Sbjct: 61  KAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPAS 120

Query: 121 DDTLKSHRLRYRTHKSSGSSHKTISRSFSYESQSKGSISTPRGSM-KVDLGKLEMSALWR 180
           +DTLK H+ + R  +SS SSHKT+SRS S +SQSK S  TP  +M KVDL KLEM AL  
Sbjct: 121 EDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMQKVDLSKLEMPALLN 180

Query: 181 YWRHFNLVDAFPNPSKEQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 224
           YWRHFNLVDA PNPSKEQL+D+VQRHFMSQQ+DELQVIVGFV AAKR+K  CK
Sbjct: 181 YWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKACK 233

BLAST of Cp4.1LG02g07350 vs. TAIR 10
Match: AT1G75060.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 296.6 bits (758), Expect = 1.7e-80
Identity = 160/217 (73.73%), Postives = 182/217 (83.87%), Query Frame = 0

Query: 11  GGFSQLQSS-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV 70
           GGFSQLQS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV
Sbjct: 17  GGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV 76

Query: 71  LTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE-NLQWN-GSDMASDDTLKSHRLRYRTHK 130
           LTNGIEVKLQRNALSV+E PTGNE+D+DLE + + QWN  SDM ++DTLK H+ + R H+
Sbjct: 77  LTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMTTEDTLKPHKSKKRGHR 136

Query: 131 SSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDAFPNPSK 190
           SS  S K + R  S +S SK S  TPR +MKVDL KL+M+AL RYWRHFNLVDA PNP+K
Sbjct: 137 SSRLSQKALYREVSCDSHSKISSITPRLNMKVDLTKLDMAALLRYWRHFNLVDALPNPTK 196

Query: 191 EQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 224
           EQL+D++QRHFMSQQ+DELQVIVGFV AA  +K  C+
Sbjct: 197 EQLIDIIQRHFMSQQMDELQVIVGFVQAATGMKKACQ 233

BLAST of Cp4.1LG02g07350 vs. TAIR 10
Match: AT1G75060.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 290.0 bits (741), Expect = 1.6e-78
Identity = 159/217 (73.27%), Postives = 181/217 (83.41%), Query Frame = 0

Query: 11  GGFSQLQSS-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV 70
           GGFSQLQS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV
Sbjct: 17  GGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV 76

Query: 71  LTNGIEVKLQRNALSVIEAPTGNEDDDDLEFE-NLQWN-GSDMASDDTLKSHRLRYRTHK 130
           LTNGIEVKLQRNALSV+E PTGNE+D+DLE + + QWN  SDM ++DTLK H+ + R H+
Sbjct: 77  LTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMTTEDTLKPHKSKKRGHR 136

Query: 131 SSGSSHKTISRSFSYESQSKGSISTPRGSMKVDLGKLEMSALWRYWRHFNLVDAFPNPSK 190
           SS  S K + R  S +S SK S  TPR +M VDL KL+M+AL RYWRHFNLVDA PNP+K
Sbjct: 137 SSRLSQKALYREVSCDSHSKISSITPRLNM-VDLTKLDMAALLRYWRHFNLVDALPNPTK 196

Query: 191 EQLVDVVQRHFMSQQLDELQVIVGFVHAAKRLKTVCK 224
           EQL+D++QRHFMSQQ+DELQVIVGFV AA  +K  C+
Sbjct: 197 EQLIDIIQRHFMSQQMDELQVIVGFVQAATGMKKACQ 232

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022939981.14.98e-154100.00uncharacterized protein LOC111445752 isoform X1 [Cucurbita moschata] >XP_0235232... [more]
XP_022982128.11.01e-15399.55uncharacterized protein LOC111481057 isoform X2 [Cucurbita maxima][more]
XP_022939982.11.88e-15199.55uncharacterized protein LOC111445752 isoform X2 [Cucurbita moschata] >XP_0235232... [more]
XP_022982127.13.79e-15199.10uncharacterized protein LOC111481057 isoform X1 [Cucurbita maxima][more]
XP_022144205.15.58e-15198.21uncharacterized protein LOC111013658 isoform X1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1FHA12.41e-154100.00uncharacterized protein LOC111445752 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J4234.87e-15499.55uncharacterized protein LOC111481057 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FID19.09e-15299.55uncharacterized protein LOC111445752 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IVT71.83e-15199.10uncharacterized protein LOC111481057 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1CR002.70e-15198.21uncharacterized protein LOC111013658 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT1G19330.22.5e-9279.30unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G19330.16.2e-9178.02unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G19330.31.5e-8977.68unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G75060.11.7e-8073.73unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G75060.21.6e-7873.27unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR038291SAP30, C-terminal domain superfamilyGENE3D6.10.160.20coord: 139..223
e-value: 5.9E-27
score: 95.4
IPR025718Histone deacetylase complex subunit SAP30, Sin3 binding domainPFAMPF13867SAP30_Sin3_bdgcoord: 163..216
e-value: 2.2E-22
score: 79.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..25
NoneNo IPR availablePANTHERPTHR13286:SF10HISTONE DEACETYLASE COMPLEX SUBUNIT SAP30 SIN3-BINDING PROTEINcoord: 1..223
IPR024145Histone deacetylase complex subunit SAP30/SAP30-likePANTHERPTHR13286SAP30coord: 1..223

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g07350.1Cp4.1LG02g07350.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0000118 histone deacetylase complex
molecular_function GO:0005515 protein binding
molecular_function GO:0003712 transcription coregulator activity