Cp4.1LG16g08330 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG16g08330
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG16: 7913900 .. 7916203 (-)
RNA-Seq ExpressionCp4.1LG16g08330
SyntenyCp4.1LG16g08330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATAGGTGAGATACAACTATTTTTTTAATTTAGAAAAAGAAAAAGGAATAAAATGATATAAAGAAGAAGGGCGAAATTGTCTTTTCACTTGCGCTTGTCTTCCAATTCGACAGATAAAAATGCGCACCGTAGAATTGGCACCATGCGACGTCCATTCTCGTCACTTTGCACACCGCCAATCGTGCTACGGCTAAGGTGAGTACTGAATCTTCTTCCTTCCTCCTCGTGGTGGTTTTTCGTTGTTTTCTTTTAGGTGTACCACAGTTTCCAGATTCTTTGTTCTCTTTTCTTGCTGTCGAGAATATTCAAATGGCGAAGAAAAGAATCCATACGCCCATTTACAACTAACCACCGTCTGTGTTATCTGCAAGAAATATTCAAGTATCGAGTATCATTTACTTTATGCCTTCTCTTTTCTTTCTATTTCCAGTCAGAATCTAGCGCTTTCTTGGAGAATTTTACTCTAATTCAACTGTTTTCTCGTGCTTTGTTTCTTGTAGTTGGAAATCCAGTTTTTGGTTTGATTAATTAGATTTGAAAGGAATTTGATATGCATGGATTCATAATTTTGTGCCAATCATAGTTTATATGTAGCGATGAGATTGTATGGCATTGGATTAGGGCTTTAGGAAGTTCGATTGGTTGCAACATTGGGGAATTCGATTCTTCCTGGTTGGATTTGGGAGATGTTGTCATTTCGTTGGGTTTTCTATGCGAGGCTGTCTGAATTTAAGCTATTTGCATATTTGAAATCAGGGTTAGCCAGAGGGTGTTTGGTTACTCAATGCTCTGAATCTGCTTCTCTTGTGGTTCATATCTGTGAAGTCAGCCCAAGCTTATCGCTAGCAATATTGTCTGCTTTAGCCCTTATGTATCTCCCTTAGCCTCGCGGTTTTAAAACGCATATGTTAGGAAGAGGTTTCTACACCCTTCTAAGGAATGCTTCATTCCCCTCTCTAACCAATATGTGGTCTCACAATCCACCTCCTATGGGGGCCAACGTCCTCGCTATTACACCGCTTGATATCTGGCTCTGATACCATTTGTAAAGATCAAGCCCACCACTAGCAGATATTGTTCGCTTTGGCCCGTTACGTATAGCTGTCAGCCTCACGATTTTAAAATGCGTCTGTTAGGAAGAGGTTTTTACACCCTTATAAAGAATGTTTTGTTCCCCTCTCCGACCGATGTGAGACCTCACCTCTTGAGATCTTCAATGGCTAGTTACATGTTTGTTTGACAATTGTTGAAATGTAGTGTAGGCATCGTCCTTTTCTTGATAATTGGCTGTTTTTGATCGGACCATCTCTTGAATTGAGATGTTCTCGACTCTGCACCGTTCATGAGTTCGTTTCGTTTCTTTACTCCCTTGTTTATGATTGTCTTTGTATGTTATGTTAGCTTGATAGGTAAAATCTTTCTGATGCGGTTGGTTTTTCTGTATAATACTTTTCTCTTACAAAAATCACCGACAACTTCGAATATCAAGTGTTTTAGATCTGAGTTGCCCTGAATCCTTCTTTTTCTTTACACACAACTGTACAAGCCTTTCATGTTTGTTGTTAATGTTCATCTGTGCCAGCGTATTGGTATGTATCTGGAACTTCACACTTAATTGCTTGTTCAGATGCTTACCGTCAAATTTGAAGTTCTTTTGATCTAACCGTCATTAGTGTGCACAGCTTAATAAATTATGAAGAGCCTTTATCATATGAACATGTAAAAACCTTTGTTTTGAGAATTTAATTAGTCTCTTAGCTTCTTGGTTATTTCCTAACTGTCTTTGGTTTATGCCTTGAGATAAGTTTTTGGTTTTCAGCTTTTCTGGTAAGCCTACATGGGAAACAAACCCGCGAAGCAATCAGGAGGAAGGGATGAAATGTTGAAGATAGTACCTCCTTTGGACCAAGCATATATTAGGTGGCTCGCACGGGATCTCGAGAGGATTCATGGCTTCACCCCAAGAAACCCTCGTGCCGTGAAGCCTCCCGATCACTACATAGAGTTCATGCGCTTGAACGGATGGTTGGACGTGAGTTTAGACGATCCTGATCTCGCACGGTTACTCAAATAGCCTTCTCTTGCAACTTGAAAGTGTACCAATGTCAAGTTTTGTGTATCTTCTGCCATTTGCCATTTGAGGTAATTCTGTTTATATCAAACAGTTGCCGGTTAGCAATGTCTGATAACGAGGAATTATGGAATAAAAACTGTGTGGTAGCCCAACTTTCACATCAGACTGATGTTTGTAATGCCATACTTATCTCCATGGATGAACTTTCAAGATTTTACTGCACTTGA

mRNA sequence

ATGTATAGGGCGAAATTGTCTTTTCACTTGCGCTTGTCTTCCAATTCGACAGATAAAAATGCGCACCGTAGAATTGGCACCATGCGACGTCCATTCTCGTCACTTTGCACACCGCCAATCGTGCTACGGCTAAGGTGTACCACAGTTTCCAGATTCTTTGTTCTCTTTTCTTGCTGTCGAGAATATTCAAATGGCGAAGAAAAGAATCCATACGCCCATTTACAACTAACCACCGTCTGTGTTATCTGCAAGAAATATTCAACCTACATGGGAAACAAACCCGCGAAGCAATCAGGAGGAAGGGATGAAATGTTGAAGATAGTACCTCCTTTGGACCAAGCATATATTAGGTGGCTCGCACGGGATCTCGAGAGGATTCATGGCTTCACCCCAAGAAACCCTCGTGCCGTGAAGCCTCCCGATCACTACATAGAGTTCATGCGCTTGAACGGATGGTTGGACGTGAGTTTAGACGATCCTGATCTCGCACGTTGCCGGTTAGCAATGTCTGATAACGAGGAATTATGGAATAAAAACTGTGTGGTAGCCCAACTTTCACATCAGACTGATGTTTGTAATGCCATACTTATCTCCATGGATGAACTTTCAAGATTTTACTGCACTTGA

Coding sequence (CDS)

ATGTATAGGGCGAAATTGTCTTTTCACTTGCGCTTGTCTTCCAATTCGACAGATAAAAATGCGCACCGTAGAATTGGCACCATGCGACGTCCATTCTCGTCACTTTGCACACCGCCAATCGTGCTACGGCTAAGGTGTACCACAGTTTCCAGATTCTTTGTTCTCTTTTCTTGCTGTCGAGAATATTCAAATGGCGAAGAAAAGAATCCATACGCCCATTTACAACTAACCACCGTCTGTGTTATCTGCAAGAAATATTCAACCTACATGGGAAACAAACCCGCGAAGCAATCAGGAGGAAGGGATGAAATGTTGAAGATAGTACCTCCTTTGGACCAAGCATATATTAGGTGGCTCGCACGGGATCTCGAGAGGATTCATGGCTTCACCCCAAGAAACCCTCGTGCCGTGAAGCCTCCCGATCACTACATAGAGTTCATGCGCTTGAACGGATGGTTGGACGTGAGTTTAGACGATCCTGATCTCGCACGTTGCCGGTTAGCAATGTCTGATAACGAGGAATTATGGAATAAAAACTGTGTGGTAGCCCAACTTTCACATCAGACTGATGTTTGTAATGCCATACTTATCTCCATGGATGAACTTTCAAGATTTTACTGCACTTGA

Protein sequence

MYRAKLSFHLRLSSNSTDKNAHRRIGTMRRPFSSLCTPPIVLRLRCTTVSRFFVLFSCCREYSNGEEKNPYAHLQLTTVCVICKKYSTYMGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRLNGWLDVSLDDPDLARCRLAMSDNEELWNKNCVVAQLSHQTDVCNAILISMDELSRFYCT
Homology
BLAST of Cp4.1LG16g08330 vs. NCBI nr
Match: KAG6570469.1 (hypothetical protein SDJN03_29384, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 239 bits (610), Expect = 5.26e-78
Identity = 114/119 (95.80%), Postives = 116/119 (97.48%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL
Sbjct: 1   MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 60

Query: 150 NGWLDVSLDDPDLARCRLAMSDNEELWNKNCVVAQLSHQTDVCNAILISMDELSRFYCT 208
           NGWLDVSLDDPDLAR RLAMSDNEELWNK  V+AQLSHQTDVCNA+LISMDELSRFYCT
Sbjct: 61  NGWLDVSLDDPDLARFRLAMSDNEELWNKT-VLAQLSHQTDVCNALLISMDELSRFYCT 118

BLAST of Cp4.1LG16g08330 vs. NCBI nr
Match: XP_022943341.1 (uncharacterized protein LOC111448136 [Cucurbita moschata] >XP_022943342.1 uncharacterized protein LOC111448136 [Cucurbita moschata] >XP_023511908.1 uncharacterized protein LOC111776780 [Cucurbita pepo subsp. pepo] >XP_023511909.1 uncharacterized protein LOC111776780 [Cucurbita pepo subsp. pepo] >KAG7010334.1 hypothetical protein SDJN02_27127, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 163 bits (413), Expect = 1.36e-48
Identity = 75/75 (100.00%), Postives = 75/75 (100.00%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL
Sbjct: 1   MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 60

Query: 150 NGWLDVSLDDPDLAR 164
           NGWLDVSLDDPDLAR
Sbjct: 61  NGWLDVSLDDPDLAR 75

BLAST of Cp4.1LG16g08330 vs. NCBI nr
Match: XP_022986677.1 (uncharacterized protein LOC111484357 [Cucurbita maxima] >XP_022986678.1 uncharacterized protein LOC111484357 [Cucurbita maxima])

HSP 1 Score: 162 bits (410), Expect = 3.87e-48
Identity = 74/75 (98.67%), Postives = 75/75 (100.00%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARD+ERIHGFTPRNPRAVKPPDHYIEFMRL
Sbjct: 1   MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDVERIHGFTPRNPRAVKPPDHYIEFMRL 60

Query: 150 NGWLDVSLDDPDLAR 164
           NGWLDVSLDDPDLAR
Sbjct: 61  NGWLDVSLDDPDLAR 75

BLAST of Cp4.1LG16g08330 vs. NCBI nr
Match: XP_017982294.1 (PREDICTED: uncharacterized protein LOC18589038 [Theobroma cacao])

HSP 1 Score: 140 bits (354), Expect = 1.20e-39
Identity = 62/74 (83.78%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKP KQ    + +LKIVPPLDQAY+RWLARD+ERIHGFTPRNPRAVKPPDHYIE+MRL
Sbjct: 1   MGNKPVKQEQREEILLKIVPPLDQAYVRWLARDIERIHGFTPRNPRAVKPPDHYIEYMRL 60

Query: 150 NGWLDVSLDDPDLA 163
           NGWLDV LDDPDLA
Sbjct: 61  NGWLDVKLDDPDLA 74

BLAST of Cp4.1LG16g08330 vs. NCBI nr
Match: XP_021278844.1 (uncharacterized protein LOC110412594 [Herrania umbratica] >EOY33333.1 Uncharacterized protein TCM_041290 isoform 1 [Theobroma cacao] >EOY33334.1 Uncharacterized protein TCM_041290 isoform 1 [Theobroma cacao])

HSP 1 Score: 140 bits (354), Expect = 1.20e-39
Identity = 62/74 (83.78%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKP KQ    + +LKIVPPLDQAY+RWLARD+ERIHGFTPRNPRAVKPPDHYIE+MRL
Sbjct: 1   MGNKPVKQEQREEILLKIVPPLDQAYVRWLARDIERIHGFTPRNPRAVKPPDHYIEYMRL 60

Query: 150 NGWLDVSLDDPDLA 163
           NGWLDV LDDPDLA
Sbjct: 61  NGWLDVDLDDPDLA 74

BLAST of Cp4.1LG16g08330 vs. ExPASy TrEMBL
Match: A0A6J1FRG2 (uncharacterized protein LOC111448136 OS=Cucurbita moschata OX=3662 GN=LOC111448136 PE=4 SV=1)

HSP 1 Score: 163 bits (413), Expect = 6.57e-49
Identity = 75/75 (100.00%), Postives = 75/75 (100.00%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL
Sbjct: 1   MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 60

Query: 150 NGWLDVSLDDPDLAR 164
           NGWLDVSLDDPDLAR
Sbjct: 61  NGWLDVSLDDPDLAR 75

BLAST of Cp4.1LG16g08330 vs. ExPASy TrEMBL
Match: A0A6J1JH88 (uncharacterized protein LOC111484357 OS=Cucurbita maxima OX=3661 GN=LOC111484357 PE=4 SV=1)

HSP 1 Score: 162 bits (410), Expect = 1.87e-48
Identity = 74/75 (98.67%), Postives = 75/75 (100.00%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARD+ERIHGFTPRNPRAVKPPDHYIEFMRL
Sbjct: 1   MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDVERIHGFTPRNPRAVKPPDHYIEFMRL 60

Query: 150 NGWLDVSLDDPDLAR 164
           NGWLDVSLDDPDLAR
Sbjct: 61  NGWLDVSLDDPDLAR 75

BLAST of Cp4.1LG16g08330 vs. ExPASy TrEMBL
Match: A0A061GZN3 (Uncharacterized protein isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_041290 PE=4 SV=1)

HSP 1 Score: 140 bits (354), Expect = 5.80e-40
Identity = 62/74 (83.78%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKP KQ    + +LKIVPPLDQAY+RWLARD+ERIHGFTPRNPRAVKPPDHYIE+MRL
Sbjct: 1   MGNKPVKQEQREEILLKIVPPLDQAYVRWLARDIERIHGFTPRNPRAVKPPDHYIEYMRL 60

Query: 150 NGWLDVSLDDPDLA 163
           NGWLDV LDDPDLA
Sbjct: 61  NGWLDVDLDDPDLA 74

BLAST of Cp4.1LG16g08330 vs. ExPASy TrEMBL
Match: A0A6J0ZWT3 (uncharacterized protein LOC110412594 OS=Herrania umbratica OX=108875 GN=LOC110412594 PE=4 SV=1)

HSP 1 Score: 140 bits (354), Expect = 5.80e-40
Identity = 62/74 (83.78%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEMLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFMRL 149
           MGNKP KQ    + +LKIVPPLDQAY+RWLARD+ERIHGFTPRNPRAVKPPDHYIE+MRL
Sbjct: 1   MGNKPVKQEQREEILLKIVPPLDQAYVRWLARDIERIHGFTPRNPRAVKPPDHYIEYMRL 60

Query: 150 NGWLDVSLDDPDLA 163
           NGWLDV LDDPDLA
Sbjct: 61  NGWLDVDLDDPDLA 74

BLAST of Cp4.1LG16g08330 vs. ExPASy TrEMBL
Match: A0A6J1DJ99 (uncharacterized protein LOC111020571 OS=Momordica charantia OX=3673 GN=LOC111020571 PE=4 SV=1)

HSP 1 Score: 140 bits (352), Expect = 1.23e-39
Identity = 66/76 (86.84%), Postives = 68/76 (89.47%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDE--MLKIVPPLDQAYIRWLARDLERIHGFTPRNPRAVKPPDHYIEFM 149
           MGNKP KQ   + E   LKIVPPLDQAY+RWLARDLERIHGFTPRNPRAVKPPDHYIEFM
Sbjct: 1   MGNKPVKQQEVQREEIFLKIVPPLDQAYVRWLARDLERIHGFTPRNPRAVKPPDHYIEFM 60

Query: 150 RLNGWLDVSLDDPDLA 163
           RLNGWLDVSLDDPDLA
Sbjct: 61  RLNGWLDVSLDDPDLA 76

BLAST of Cp4.1LG16g08330 vs. TAIR 10
Match: AT5G22210.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 108.6 bits (270), Expect = 6.1e-24
Identity = 51/76 (67.11%), Postives = 60/76 (78.95%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEM-LKIVPPLDQAYIRWLARDLERIHGFTPR-NPRAVKPPDHYIEFM 149
           MGNK       R+E+ LKIVPPLD+ ++RWLARDL+R+HGF P+ N RA+ PPD YIEFM
Sbjct: 1   MGNKATTVKEEREEIHLKIVPPLDKVFLRWLARDLQRVHGFKPKNNTRAITPPDSYIEFM 60

Query: 150 RLNGWLDVSLDDPDLA 164
           RLNG LDV LDDPDLA
Sbjct: 61  RLNGSLDVDLDDPDLA 76

BLAST of Cp4.1LG16g08330 vs. TAIR 10
Match: AT5G22210.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 108.6 bits (270), Expect = 6.1e-24
Identity = 51/76 (67.11%), Postives = 60/76 (78.95%), Query Frame = 0

Query: 90  MGNKPAKQSGGRDEM-LKIVPPLDQAYIRWLARDLERIHGFTPR-NPRAVKPPDHYIEFM 149
           MGNK       R+E+ LKIVPPLD+ ++RWLARDL+R+HGF P+ N RA+ PPD YIEFM
Sbjct: 1   MGNKATTVKEEREEIHLKIVPPLDKVFLRWLARDLQRVHGFKPKNNTRAITPPDSYIEFM 60

Query: 150 RLNGWLDVSLDDPDLA 164
           RLNG LDV LDDPDLA
Sbjct: 61  RLNGSLDVDLDDPDLA 76

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6570469.15.26e-7895.80hypothetical protein SDJN03_29384, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022943341.11.36e-48100.00uncharacterized protein LOC111448136 [Cucurbita moschata] >XP_022943342.1 unchar... [more]
XP_022986677.13.87e-4898.67uncharacterized protein LOC111484357 [Cucurbita maxima] >XP_022986678.1 uncharac... [more]
XP_017982294.11.20e-3983.78PREDICTED: uncharacterized protein LOC18589038 [Theobroma cacao][more]
XP_021278844.11.20e-3983.78uncharacterized protein LOC110412594 [Herrania umbratica] >EOY33333.1 Uncharacte... [more]
Match NameE-valueIdentityDescription
A0A6J1FRG26.57e-49100.00uncharacterized protein LOC111448136 OS=Cucurbita moschata OX=3662 GN=LOC1114481... [more]
A0A6J1JH881.87e-4898.67uncharacterized protein LOC111484357 OS=Cucurbita maxima OX=3661 GN=LOC111484357... [more]
A0A061GZN35.80e-4083.78Uncharacterized protein isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_041290 PE=4 ... [more]
A0A6J0ZWT35.80e-4083.78uncharacterized protein LOC110412594 OS=Herrania umbratica OX=108875 GN=LOC11041... [more]
A0A6J1DJ991.23e-3986.84uncharacterized protein LOC111020571 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
Match NameE-valueIdentityDescription
AT5G22210.16.1e-2467.11unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT5G22210.26.1e-2467.11unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g08330.1Cp4.1LG16g08330.1mRNA