Cp4.1LG20g07310.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g07310.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG20: 6241704 .. 6247714 (+)
Sequence length588
RNA-Seq ExpressionCp4.1LG20g07310.1
SyntenyCp4.1LG20g07310.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTAAGGTATGTTCATGAAATGATACGACTATCGACTGTGAAATGCATAAGGACAATAAAGATATGACAAGGTTAGGAAACGATATGACTAAAATATGCATATGGAAATGATACAACTATGAAGTGTTTAAGAAAGAATATAATTATGATATAACATATGAGCGATGAATATATATATGTTGTAATATAATATGAGAATGATATGATATATGTTATGGTACGATATATGTGATAAATTTTTATAATATGATATGATATAATATATTGTGATACGATATGCTATGGTGTGATATGTGTTTGAAAATATGGAAACGTTATGATATGACATATGTATGTAATATGAATGATATGATTGACATGCAAAATGACGAACAAACATGAAATACAATCCCGATGTATGCATGAAGGATATGATAAATGATATGACATGAAGGAAGACGCTAACCGTGTTGTATTAAATGATAATGAAAATGGAAAAATGTTGTGACCTCATGCATTATTGTGTGCTCATGCATCGGGGTACCCCTTATCCTTCACGATAGTACGGACACATACGCTACGAGGCAGGGGAACGATCATTATGTTTATAATAATGCTATCGTGATGGTCGCTATTCCTAACGGGTGTCCAGCATCAATAGCTACCAAAGAATGCTCGTCCGACCAGAAAGGGGTCCAAAAGGTGTGCAAACTGATTGATGGGTCCACACTCACACATGTGAACCTTGTGTAGAGAAATTAAGTACGCATCCAATTGTCTGTTACGATAGTCACCGTAGTAAAATAGACTGATATGATAGGGTTTCATTCATTAAATTATATGTGTTTGCATTTAACCACAAGAGTGGGTCACTTACTGACTATTTCTTAAAATACTCAAGTCACGTGCTACCTCTATTTCAGGAAAAGGCAAGACATCTTTGTACGGCTGACGACGGTATCGCAACTCGAGACCATGATACATGTTTTATAAAGCATGAACGAGTGAATGCATTGAAAACAATTAGAAAGAAAAATAATATTATAGGCTAAAAACGAGCAAGCAACAACCACAACTTTGATAGCATATGACTCGGGTATGAAGAATTGACGACTAGTCACGTCCCCACAATAATACATTTTCGAGCATTTTAGCTTTGACGGTGTAAACATGATTTTGGTAAGAGCTATACACCCCGTATTGATATGACATGGATAGTAAAAACCTAACTAAACATGAAAGCAGGTAAAAGAGGATGGATCGAGCAAACGACCATGAATAACACGTCCTAGACAAGTATGATCGAGACGTTCGACACGTGGACCACAATGCCTTAAACAAACATGACTTTAAGGATGAACACCCTCTCACCTTTTGAACTTTAGATGTAATATATATAAACGATTCTCAAATTTTAATCATCAATTCGAGATGGAATGGTTGTTTTGAGTGGTTGTTCGATCGGATTCCAACTCCAGTCCTTGAAATTGAATGCTGTTCTTATTAAGGCGATCAGATTATATCAACCAAATTATCAACGAAGAGATGAAGTTCAGCTAACCCATTACAGAAATGAATCGTTGAAAAGAAGCAAAGAATCATGGAGAACGAATCAGAATTTTGCACAGTGCGAAGGCAGACCAGTCTCATGGTTCACCGCTACAATCTCAGAAACTTCACATTTCAGCACATCTTGCCCTAGCTTGGCTCCGGCGGACTTGAGTCCCTTCAGCGACACCGGTTGATTGAAGAGGTTCAGCGATGGAAGTTTCGAAGCCGCCGGTAATTTCCCATTCGGTAAGAAATTTGCGAAGTTTTCTTTCAATAATGGAACCTCGAAATCTTCCTTCTTGTGCTTCACGATATTGTCCTTTGCACTTATGTGAGCTCCTGGTTCCATGTGGTTCGTTGAAAAACTGGCTTGATTTGGGAAATTAGGGTATAGACTTACGTAGCCTCTTAAGTACATCATGTCGATAAGGAACTTCTTCCATGATGCTTGCCAACCGTTTGTTCTAGATTTAGGGATCTGAACTGGATTCTCTTTCGCATTTTCCGTGAATCTTGTGTTCATGTAAACATAAAACTCCCTCCATTGTTTGGGGAAGAAAACTGCTCCCCAACTGCAGGGGAGCTGGTGGAGGTAAGGTGTGTTTGGATGAATCCGCTTGAAGAACTCTGTTGCATTCCATTTAGGTCTTTCTTTAACCACTTCGACTAGCCGAGGCGTGTAGAGGGAGATCGATGATAGCTCGGGTAGAGATATTTGTGGATCGTAGTGGTATGCTAGGAGGGCGTATTTGATCCAAAGGTAGTAGTATGGAGAGACTTCAATATCGTCTTCGAGTAGGAGTCCATAATCGTCGTCTGAAGCAGGATACCAACTCTCGCTTACAGCTCGTATCAGCCCTCCTTGGATGATTCTTCTTCTGAGGCTTTTGGGGCCATGAGGCCACTCAAAGGAGCTTACTAATTTTATAGTTTCCTCGTCGACTTTACTGTCCATGTTGAAGCTGATAGGTATCTCATCCCCTAGGTAATATGCATCTTTTAGCGACTTGAGAAGCCTTGTTAACGAGCCTGCACGGTTTTGTGTGATAATGTTGATTGAAATCCGCATCTTGTTCCAATCTGAAAGGAAATAAAAAAGGATGAGAGCTCTGCTATTGCATGGCAAGAGTTTATATCAGGAGAAACTGAGAGAAAAGAGGAAGTATCTTAGCGTTCTTACTTGGAAGTGCTGTTGATCGAAGATCGGCCATCCAAAGAACTTTAGAAATCGAAGGCCTGGGTAAAAGAATCACTGTTGTACTACCATTCAAGTTAGCCTCTGAAGCCATTTTTAAAGCCTTCTTCACATTAGGATCAACATCGGCCACAGTGATGACGACGCTGGGATTGTGTATTTTGATCAATCCCTTCATACTGGCATACACTGCTTGCACCACAGGGACCTCAGAATTTGATATTCCAGAGAGAGCCCCAATGGCCAAGTCGAATATCTTGAACCTCCGTTCTTTACAGACCGACTTCGGCCATTTAAGAGCAGCTGCAGCATCTTCACACGGGCAAAAGTTACCGCCAGACACTGCAATATAAGCCTTCTTTCCAACAGTGGATCTGAACTTTTCTAGAAGTGGTGCTAGTGCTTTAGCTTCATCAACAGAATGAGCATAAAATAGAGCATCTATCTTTTGAGGATACATAGCTGCCCATTGTGTGATATAACCAGTAGACATCGCCTTCCACCATTGATCATCCCGAACTTGAACAATGTCCTTAAATATGACAGTGGTCTCGGAAACGTAAGCCAGCCTGTGTTCACTGTCACCCCAAGTTTCCTTGTCTTTTGGGGCTACAGGAAGAACAAATGAGGCAGCATTTCTATACTTTTGAAGCTGATAGCTGCATATTTGGTATTAAAACAATTAGCCACCCAAAACTTAGGAGATGTTGAAATATGAGTAGTGGAAACAGCTAGTAATAAGTTACCTTAGATGGAGATCTTCTCCAGTTGCAAAGGTGAAGGGGGTTTCAATGAAAAGTGTCTTGACAAGCTCAGCGGATAAGAACCAAGAGCTCGAGAGAAAGTCGACCTGCACAATTTTATTGACGGTGATGTCATAAGCAGGATCAGGCAAGTAAAGCCCTGCTTCCTTGGATCGAAACTTTCGATAACTCGGGAATGTGAAATCCTTCTGTCGAAATGGCAAAATCCTACCTATGCTGCCCAAAACTGCGTTCTTGTATTTGTCGGTCCCTGCTACATGAGACAAAATCTGTAGCATTTTACGGCCAGGAATCATGTCATCATCAAGAATATATACTAAATCAGCTTCGGTTTGTAAGGCCATTTGGAACCGTCCGTAGTACTTGAAGTCATAGCTTGAGCTAATAAAGCTAATTCTGGAGTTGTTATAGCTATCTACAATTCTTTTCAAAGAGAGCTCATTTGGGCTCCCAAATGCAAGCACCCAAACATGGTGGAAAGGAAGGCTTTGCTGAAGCAAAGAGTTAAGTTGTGCACACAGAGTTTTCCTCTTAAAATGGTTCAAAATCACTGTAACTTTTGGCTTGTTTGGTCCTCTCAAATCCCATTTGGACTTCATTGCCATTAGCTGAGAAAGAGTCTCAGTTCCAAAGCTTTTGCTTTGAAAATCTAGAATCTCATTGTAAAGCTCTGTTTTCAACTTTATCATCTGCCCATCCGTGGACTTCTTTTGCTCAAAATCAATCTTCTCATTCTCACAAGCTTCCTCCGGAGTCGGTCTGAATTCTGCTTGCATCATTAGAGAAGTTGGTTCTTCGATTCGACCCACTACGTGGGGCGGTATTACGAAGTGTCTCCATTGTTGAGCGATTCTTGTAGCCCAAGAGAAGTCCGGTTTGGTTCTCAAGTCTATTGTAGGGCTGACATAATACAGTAGAAATGTTGCATATAATGCGAAGGCAAACTGGAGACATGTAAGACCAGCAACAAGCTTTGTTGAAGAATTTCTTTGAGGTCTTAACTTTCCTTTTCCTCCAACATAATCATTGATCATCCCTTCTAAACAATCCCCGTTTCGTGTTGCAGGATTCCGAAAGATACCCATCTTCTTCTCAACGCAACTTCGGAACAATCGCAACCCTGTAGCAAAACCGCATATCAGATACGTCCTTTTCATATGATACTGAAAAGACTGATCATCCAGACCGAGCTGAAAGTAATTTTCTTACCTTAAAACTCACAAATATTGATATGTCGAATTCTGGACAAAAACTATCTTCCACCTGAGAAACCTGTGAACGGATACTGGAATGAGACAGGCTTGTCAAGAAAGTTGAGATATGAATAAGAACAGGAACGGGGAGACTTAAAAAAGGAAGAAGCTTTTGGGTTAGTGTATAAAATAGGAAAGAAAGTGTGTTTGTTCATGAGAAACGCACTCAATGTCACATGGGGTTGACGCTTATAATGATAGGAGTTTTTATTACAAACTGTGTCAAGAAACGTCTTACCCTCGCTGCTTTTTGTCTTGCAATGTTAAACCCCTCAAGGAACCACTTACAAAGCCCCGGATTTGAAGCCTTGTTTTGTGCACTTTCTAGACTTTCTCTCTCCTTCTAGAGTCTACTCTCTCTCTCTTGAAAGTAATTTATAGAAGATTTTTATAAGAGTATAACATTACCACCAAAGGTGTTTCAAACATTTTCCCCTGTTTTTTTAGAGTTTAATTGTAAGTGATGCGAGGGGATTTAAACTTTTGATCTCTTGATCCAGTATACGGGCTTTATGCCAGTTGAACAAAGATTCAAGGACATGTGCCTTATACCACTTGTCACAATCGCATTTTTTTTGGCAACAGACTTTGCGATTGTGCGGTACTCACTCCCAACACCAAGTGAGTCAAGCCAATTTGTATTCGCGTCGCCTACGACCGAGCAACATATGCTCAAGCCTCTAGCCTCATTTGAGAAGAAGGTTTTAGAAAACAGGACAAAGAGATTTTTGAAAGCGAGTTTGAAAGTAAGAATGAGGCAAGATTTAAAAGTAACGTTATATAAGCAAAAGGCCACAAAAGGCTTAAATAATATATAACTTTTTGGGCAGATAAAATTTGACAACTAATTGCATTCCCTTATATAGTACATCTTGAGTCGATTCAGGTTTAGCAGGCCTAAAAGTGATTTCGCCAAGCGGTCATACAAAATCAGGGGGTGTGGCAAGATATTTCGAACTTACAACCTTATAAGTGAGGTAAAAGTTTATTAAATTATACTCGAGTTGGATCTAGAAAGTTCGTAAGAGGTAAATGTACCTTTTAAAGTTGTTAACTATTGATTTATGCTTAAGTGTAGCTCTGAGCTCTGGAATATAATCTTATAGACTTGTAATTATTGAGAATTTTATCTCTATCTCTCTAGAAAAGAATCTTATAGATATCAACCCTCTATTTGTAGGGCTCTATTTTTAGAGGTTTAAGCCTACTAACTTAGCCATTCAAAGCCTCACTACAATATCCTTTGGTTAGAATTTGAAGGTTTAA

mRNA sequence

ATGACTAAGGCGATCAGATTATATCAACCAAATTATCAACGAAGAGATGAAGTTCAGCTAACCCATTACAGAAATGAATCGTTGAAAAGAAGCAAAGAATCATGGAGAACGAATCAGAATTTTGCACAGTGCGAAGGCAGACCAGTCTCATGGTTCACCGCTACAATCTCAGAAACTTCACATTTCAGCACATCTTGCCCTAGCTTGGCTCCGGCGGACTTGAGTCCCTTCAGCGACACCGGATCAACATCGGCCACAGTGATGACGACGCTGGGATTGTGTATTTTGATCAATCCCTTCATACTGGCATACACTGCTTGCACCACAGGGACCTCAGAATTTGATATTCCAGAGAGAGCCCCAATGGCCAAGTCGAATATCTTGAACCTCCGTTCTTTACAGACCGACTTCGGCCATTTAAGAGCAGCTGCAGCATCTTCACACGGGCAAAAGATTCCGAAAGATACCCATCTTCTTCTCAACGCAACTTCGGAACAATCGCAACCCTGTAGCAAAACCGCATATCAGATACGTCCTTTTCATATGATACTGAAAAGACTGATCATCCAGACCGAGCTGAAAGTTTAA

Coding sequence (CDS)

ATGACTAAGGCGATCAGATTATATCAACCAAATTATCAACGAAGAGATGAAGTTCAGCTAACCCATTACAGAAATGAATCGTTGAAAAGAAGCAAAGAATCATGGAGAACGAATCAGAATTTTGCACAGTGCGAAGGCAGACCAGTCTCATGGTTCACCGCTACAATCTCAGAAACTTCACATTTCAGCACATCTTGCCCTAGCTTGGCTCCGGCGGACTTGAGTCCCTTCAGCGACACCGGATCAACATCGGCCACAGTGATGACGACGCTGGGATTGTGTATTTTGATCAATCCCTTCATACTGGCATACACTGCTTGCACCACAGGGACCTCAGAATTTGATATTCCAGAGAGAGCCCCAATGGCCAAGTCGAATATCTTGAACCTCCGTTCTTTACAGACCGACTTCGGCCATTTAAGAGCAGCTGCAGCATCTTCACACGGGCAAAAGATTCCGAAAGATACCCATCTTCTTCTCAACGCAACTTCGGAACAATCGCAACCCTGTAGCAAAACCGCATATCAGATACGTCCTTTTCATATGATACTGAAAAGACTGATCATCCAGACCGAGCTGAAAGTTTAA

Protein sequence

MTKAIRLYQPNYQRRDEVQLTHYRNESLKRSKESWRTNQNFAQCEGRPVSWFTATISETSHFSTSCPSLAPADLSPFSDTGSTSATVMTTLGLCILINPFILAYTACTTGTSEFDIPERAPMAKSNILNLRSLQTDFGHLRAAAASSHGQKIPKDTHLLLNATSEQSQPCSKTAYQIRPFHMILKRLIIQTELKV
Homology
BLAST of Cp4.1LG20g07310.1 vs. NCBI nr
Match: KAG6583576.1 (hypothetical protein SDJN03_19508, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 91.7 bits (226), Expect = 1.50e-20
Identity = 44/46 (95.65%), Postives = 44/46 (95.65%), Query Frame = 0

Query: 149 GQKIPKDTHLLLNATSEQSQPCSKTAYQIRPFHMILKRLIIQTELK 194
           G  IPKDTHLLLNATSEQSQPCSKTAYQIRPFHMILKRLIIQTELK
Sbjct: 10  GDMIPKDTHLLLNATSEQSQPCSKTAYQIRPFHMILKRLIIQTELK 55

BLAST of Cp4.1LG20g07310.1 vs. NCBI nr
Match: EYU36173.1 (hypothetical protein MIMGU_mgv1a011636mg [Erythranthe guttata])

HSP 1 Score: 90.1 bits (222), Expect = 6.30e-18
Identity = 47/88 (53.41%), Postives = 57/88 (64.77%), Query Frame = 0

Query: 69  LAPADLSPFSDTGSTSATVMTTLGLCILINPFILAYTACTTGTSEFDIPERAPMAKSNIL 128
           ++ A L  F    S S T  T LGL ILINP +LAYTAC TGTS+FD  +  P+ +S IL
Sbjct: 160 VSEAILKAFFTCPSISETATTMLGLWILINPLMLAYTACKTGTSDFDFSDSKPIPRSKIL 219

Query: 129 NLRSLQTDFGHLRAAAASSHGQKIPKDT 156
           NL SL T  GH +A  ASSHGQ +P +T
Sbjct: 220 NLLSLHTALGHFKALTASSHGQNLPPET 247

BLAST of Cp4.1LG20g07310.1 vs. ExPASy TrEMBL
Match: A0A022R8W5 (Uncharacterized protein OS=Erythranthe guttata OX=4155 GN=MIMGU_mgv1a011636mg PE=4 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 3.05e-18
Identity = 47/88 (53.41%), Postives = 57/88 (64.77%), Query Frame = 0

Query: 69  LAPADLSPFSDTGSTSATVMTTLGLCILINPFILAYTACTTGTSEFDIPERAPMAKSNIL 128
           ++ A L  F    S S T  T LGL ILINP +LAYTAC TGTS+FD  +  P+ +S IL
Sbjct: 160 VSEAILKAFFTCPSISETATTMLGLWILINPLMLAYTACKTGTSDFDFSDSKPIPRSKIL 219

Query: 129 NLRSLQTDFGHLRAAAASSHGQKIPKDT 156
           NL SL T  GH +A  ASSHGQ +P +T
Sbjct: 220 NLLSLHTALGHFKALTASSHGQNLPPET 247

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6583576.11.50e-2095.65hypothetical protein SDJN03_19508, partial [Cucurbita argyrosperma subsp. sorori... [more]
EYU36173.16.30e-1853.41hypothetical protein MIMGU_mgv1a011636mg [Erythranthe guttata][more]
Match NameE-valueIdentityDescription
A0A022R8W53.05e-1853.41Uncharacterized protein OS=Erythranthe guttata OX=4155 GN=MIMGU_mgv1a011636mg PE... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 190..195

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG20g07310Cp4.1LG20g07310gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g07310.1:exon:005Cp4.1LG20g07310.1:exon:005exon
Cp4.1LG20g07310.1:exon:004Cp4.1LG20g07310.1:exon:004exon
Cp4.1LG20g07310.1:exon:003Cp4.1LG20g07310.1:exon:003exon
Cp4.1LG20g07310.1:exon:002Cp4.1LG20g07310.1:exon:002exon
Cp4.1LG20g07310.1:exon:001Cp4.1LG20g07310.1:exon:001exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g07310.1:cds:005Cp4.1LG20g07310.1:cds:005CDS
Cp4.1LG20g07310.1:cds:004Cp4.1LG20g07310.1:cds:004CDS
Cp4.1LG20g07310.1:cds:003Cp4.1LG20g07310.1:cds:003CDS
Cp4.1LG20g07310.1:cds:002Cp4.1LG20g07310.1:cds:002CDS
Cp4.1LG20g07310.1:cds:001Cp4.1LG20g07310.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG20g07310.1Cp4.1LG20g07310.1-proteinpolypeptide