CmoCh14G004740 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh14G004740
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionprotein CHUP1, chloroplastic-like
LocationCmo_Chr14: 2333159 .. 2337093 (+)
RNA-Seq ExpressionCmoCh14G004740
SyntenyCmoCh14G004740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCGTATTCCAATATTGGAAGCCATAACCCGGCGCCGCCCCTTTGACTTTTACAGATCCTACAATTTTGGCCGTTTCAACACCTGTAAACGCTTGTAACCTCCCTACATTGATGGTTATGGGGCAATTGATGATCTGCAGGTTTCTCAAAGTTTCGATCGATCCGTCAAGCCATTCAGGTAATTCATCCACTCCTGGAAGCTCTCTGATTGTGAATCTCTGAAGCCTAAATTCGGCGTTCTCTAAGGTGAGTGTTAGTTGCTTGCAGCCCCAGATAACCAAATTCTGTAGTGATTTTAGATGTATCATCTCATTTGGCAACGATTTCAAAGTATTGCAATTGTAAATCATTAGGGTTTCAAGGTCTGCAAGGCACTCTGGCCGTTCGAATAGAATTTTTAGGTTTTTGCACCCTCCAATGGCGAGAAATTGCAGAGAAGTCATGGTTCCAACTCCGTTTTTGTGCAAATAAAGCTCTTTTGTTGTTATCCACAAGTATTTCAGGCTGATCAAATTCCTTATATCAGTTGGCAATTCTTTAAGTTCGGAACAAGATTCAAGAATCAAGGTTTGCAAACTTTGTAGATTGCAGATTGAATTCGGCAGCCGCTTGATGTTCTTATTTCCATGCAAGTCGAGGTATCTCAGGCTTTTAAGTGTTCCTATAGCATTTGGAATTTCCTCTATGTTTGCATTGCCTAAATGTAACACTCTTAAGCTGTTGTACTTCAGAATGCACATTCCAGGTAGGGATGCATTAGGTGGCACCATATCTCTTACATGGAAGGCTAGGGATTTTATTTCAGTGACTGGTTTGGTACCTACTAAGCCCTCAGACTGTTCTTGTGCGAGTACGTGTGCAAGTTTTTGGATAAGAGGGTGCAGTTTAAACCAGTAGCCGAGGCCATGTTCTTCAATTTCTTGAACGAAACATCTTGACCATAGCTCCATGAAATACTTCTCACCTATATTTTTCATAGATGCGTGATTCTCTTCAGGTGAATGGAGGAGCCCATGTGCCATCCATAACTGAATCACGTCATTTGAAGATAATATGCGGTCTGGTAATTGAGCACAATGAAGAAAACAGGGCTTCAGGTGTGAGGGCATTTGATTATAGCATAATCTGAGTATGTGTAAAATACCGTCATTCTCTTCCTTTTGTGCCCACAACTTGTCCTTGATATTCTTCCACTCAGCTATTCTAGTTTTTGAAGATAGCAAGCTTCCCATACACTTTATTGCCAATGGAACTCCCTTGCATTTCTGCAGAATTTGGTCTTCTATTCCAGTGAATTCTTCATTGTTTACTTCTTGTCTATCTCTGAAGGCATATCTTTTGAATAACTTAGATGATTTATCCTCTGACAGCCTCTCCAGTTTGAAGACTGTTGAAACTTTCTGATTCTTATCGTCCGAATTCTGCGTCGATTGTTTCTCCGACTGTTTCTCAGACGCATAGACTATTCCTAGTCCTAGAGCACCTGCAGTCTCCTCACTTTGAGTGGTCACTAAGATTTTGCTGTTCTTGTTTCCCATGCCCAATAGTTCCATCAATGTGGATGAATCGTCTAAGTTTGTGGTCGAAAGGTCTTGAATGACGAGCAAAAAAGATGTACCCTTCAGAAGATGTTGAACCTTTAAATGCAATTGTTTCTTAGTCAAGTCATTACAAATTTCTTTACAGGCTACAACACTATATACCTCTTCGATCAATCTCTGTGTATCAAAACCTTCTTTCACACAAACCCAGAATCTTGAAAGATACTGTTCAACTACTTCTTTATCATTGTAAAGGAATTTGGCAAGTGTGCTCTTACCTATACCTGCTTCCCCAACTATGAGGAAGAACCCATGAATAGATTTGAAAGTTGTCAGAAGTTCAGCATACCGATCGCCATCATAAGTCATTGGAGCATCAGTTAAGGTGTACTGCAATTGCCAACTTGGTTTCATATGTCTTACTGAAACTTCAGTAGTGCTAAGGAAGGAGTGTGCTGTCTTCCTTCTTCCTATGGAATCTGGGTGCTTTTCAGTGAGATGAAATTCATACATTCTGGCTGCTAGTGTATCCAAATGATTGCGTAGTCTCATCATTTTACGAAACCGTATAAATTTATGTGCAGAAATTGGGTGAGAGCACGAGAAAGGGGCTTGTACCTTTTTTCCAGTGCCCTCTTGTTCAAAATCCCATTTGAATTCATCCATGGTGTCCTCAATTTGGTAAAAGGTGTTTTGAAGCTCCTTAAGCCACTTACGTAGATCATGATTCTGTTCTTGTTCTTGCTTCTCTTCAGCATCAAAAAGAATTGATTTAAGGCTCAACATGGTATACTCCATTGTTTGGAAGGTTGAGCAATTTTTTAGCCTACTCAAGATAGTTTCAGCGTGGTTGTAGATGAAATTAGCCATATTCTCATGGAAAATGTTTGTTCCTCCTTGCGATTTATTGGGTTGCCCTCTTACTCTTTGAGTCTGCCTGCCTGCCTTGTCTCAGGTGTCTACAGCAGATCTGATTAGTCTCGTGACGAAGATTGGTCTTTTCCGATCTTCCACTAAGAAAATCAGGGTATCGTCACGATAACTCAATTTGCCCGATAAACTACTCAGGACTCATGGTTTAGTAAGAGGGGAGATTTACTTTTGAATCCGGGATGAATTTCTATAAATGGAGATCGTTGATATATGCACCTAATCCGTGGATTCGTTGCATTTTCATGAAGCAAAATAGCTTCAAAACTTAACTCTTAGGGAAATGTATATTCTTCCCGTGCATCCTGAAGAATAGAATAGGAGCACTCAACTCTCGCCAGGTACTTCTCTTTAGGATTATGTATTTAAGGAGTTTTGCAACTGTTTGCTTTTTATTCTAAACAAGTGGAGCCCCCATTTCTTCAAATGAACAGAGAATTCTATTCTTGCTGTGTTCTAACCTATCTGGAATGATCCCCTTCTTTCTTAGATGTAATTCCAAACTCAACTCTTGGGGAAATGTATATTCTTCCCCTTTATATTAAAGAACTGTATAGGAGGAGGGGTCTTGGTATTCTTTAAGCTATTCCTTTACAGGTTCTTACAATTTCCCCTCTTACTTCAGAGTTATTTGGTAGAACTAGACCTTCTTTATTGAAAGGAATTAATATGAGCACCCAAAATCCTATGCTTAGAGTTTGATTATCATGATTTCCAATCATTTTAAGGCTAACCAAATCCTGTTTTTGATTGTTTTGTTTATCTCAATTAGATTTACTCCAAATTAATTGACAAGATTCAAAATTTAGATGAATTTTCTGACTATTTTGTGATATTTGAACGATTGGTTGTGTTTATGTTGTAGCCACAAAGCCATTAATAAATTTCGTGGATAACTTATGAATTGCGAGCTGATATGAACTTATGAGTGATAGAAGCATTATTATATATACTACAGGGTCGTAATTACACTGCATGCACTCGACTCTCATAAGAGTCTTTATTGTGGACACATATACTACACGCCATATACAATAATAATACTATGAACCACACCATCAAAGGATTGATGATGATTATTTATATATGTATTCTGATTCAGGGAAAGGCCTTCCTGTGGTTCAAGCCAATTGCCTGGGGGGACACTCTGCTGGAACCAGACCCGCTACCACTCCCAGACCCCGACCCCGACCCCGACCCAGAAGCTGAGCCCGAACCACTGCTCGATCCACTTCCTGAGCCTCCCCCACCGCCGCCGCCGCCACCACCATTGGGCTGCTGCACAGCCGCAAGGCTGATGCTGCTGCTTATTAAGAAGATGAGAAAGCAGAAGCGGAGGTAAACCATGGATGGAGAGGCCATGAGGGAATAAGGGAATAAGGCTGAAGGGGGAGTTTGTTGTTGATATTGGTATTTATAGAGTATTGGGAGTTTGTGGGTTTGGGGCGTCTGGCATTTGTTGA

mRNA sequence

CACCGTATTCCAATATTGGAAGCCATAACCCGGCGCCGCCCCTTTGACTTTTACAGATCCTACAATTTTGGCCGTTTCAACACCTGTAAACGCTTGTAACCTCCCTACATTGATGGTTATGGGGCAATTGATGATCTGCAGGTTTCTCAAAGTTTCGATCGATCCGTCAAGCCATTCAGGTAGGGATGCATTAGGTGGCACCATATCTCTTACATGGAAGGCTAGGGATTTTATTTCAGTGACTGGTTTGATGCGTGATTCTCTTCAGGTGAATGGAGGAGCCCATGTGCCATCCATAACTGAATCACGTCATTTGAAGATAATATGCGGTCTGGAATTTGGCAAGTGTGCTCTTACCTATACCTGCTTCCCCAACTATGAGGAAGAACCCATGAATAGATTTGAAAGTTGTCAGAAGTTCAGCATACCGATCGCCATCATAACATCAAAAAGAATTGATTTAAGGCTCAACATGGTGTCTACAGCAGATCTGATTAGTCTCGTGACGAAGATTGGTCTTTTCCGATCTTCCACTAAGAAAATCAGGGGAAAGGCCTTCCTGTGGTTCAAGCCAATTGCCTGGGGGGACACTCTGCTGGAACCAGACCCGCTACCACTCCCAGACCCCGACCCCGACCCCGACCCAGAAGCTGAGCCCGAACCACTGCTCGATCCACTTCCTGAGCCTCCCCCACCGCCGCCGCCGCCACCACCATTGGGCTGCTGCACAGCCGCAAGGCTGATGCTGCTGCTTATTAAGAAGATGAGAAAGCAGAAGCGGAGAGTATTGGGAGTTTGTGGGTTTGGGGCGTCTGGCATTTGTTGA

Coding sequence (CDS)

ATGGTTATGGGGCAATTGATGATCTGCAGGTTTCTCAAAGTTTCGATCGATCCGTCAAGCCATTCAGGTAGGGATGCATTAGGTGGCACCATATCTCTTACATGGAAGGCTAGGGATTTTATTTCAGTGACTGGTTTGATGCGTGATTCTCTTCAGGTGAATGGAGGAGCCCATGTGCCATCCATAACTGAATCACGTCATTTGAAGATAATATGCGGTCTGGAATTTGGCAAGTGTGCTCTTACCTATACCTGCTTCCCCAACTATGAGGAAGAACCCATGAATAGATTTGAAAGTTGTCAGAAGTTCAGCATACCGATCGCCATCATAACATCAAAAAGAATTGATTTAAGGCTCAACATGGTGTCTACAGCAGATCTGATTAGTCTCGTGACGAAGATTGGTCTTTTCCGATCTTCCACTAAGAAAATCAGGGGAAAGGCCTTCCTGTGGTTCAAGCCAATTGCCTGGGGGGACACTCTGCTGGAACCAGACCCGCTACCACTCCCAGACCCCGACCCCGACCCCGACCCAGAAGCTGAGCCCGAACCACTGCTCGATCCACTTCCTGAGCCTCCCCCACCGCCGCCGCCGCCACCACCATTGGGCTGCTGCACAGCCGCAAGGCTGATGCTGCTGCTTATTAAGAAGATGAGAAAGCAGAAGCGGAGAGTATTGGGAGTTTGTGGGTTTGGGGCGTCTGGCATTTGTTGA

Protein sequence

MVMGQLMICRFLKVSIDPSSHSGRDALGGTISLTWKARDFISVTGLMRDSLQVNGGAHVPSITESRHLKIICGLEFGKCALTYTCFPNYEEEPMNRFESCQKFSIPIAIITSKRIDLRLNMVSTADLISLVTKIGLFRSSTKKIRGKAFLWFKPIAWGDTLLEPDPLPLPDPDPDPDPEAEPEPLLDPLPEPPPPPPPPPPLGCCTAARLMLLLIKKMRKQKRRVLGVCGFGASGIC
Homology
BLAST of CmoCh14G004740 vs. ExPASy TrEMBL
Match: E5GBX5 (Uncharacterized protein (Fragment) OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 4.7e-12
Identity = 46/54 (85.19%), Postives = 50/54 (92.59%), Query Frame = 0

Query: 171 DPDPDPDPEAEPEPLLDPLPEPPPPPPPPPPLGCCTAARLMLLLIKKMRKQKRR 225
           DPDPDPDPEAEP PLLD LP+PPPPPPPPPPLG CTAARL LL+IKKM+KQK+R
Sbjct: 1   DPDPDPDPEAEPPPLLDALPDPPPPPPPPPPLGRCTAARL-LLIIKKMKKQKQR 53

BLAST of CmoCh14G004740 vs. ExPASy TrEMBL
Match: A0A6J1D0H8 (protein CHUP1, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111016154 PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 8.6e-06
Identity = 36/47 (76.60%), Postives = 39/47 (82.98%), Query Frame = 0

Query: 155 IAWGDTLLEPDPLPLPDPDPDPDPEAEPEPLLDPLPEPPPPPPPPPP 202
           +AW D+ LEP P PL  PDPDP+PEAEPEPLLDP P PPPPPPPPPP
Sbjct: 1   MAWRDSPLEPHPPPL--PDPDPEPEAEPEPLLDPPPPPPPPPPPPPP 45

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E5GBX54.7e-1285.19Uncharacterized protein (Fragment) OS=Cucumis melo subsp. melo OX=412675 PE=4 SV... [more]
A0A6J1D0H88.6e-0676.60protein CHUP1, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111016154... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 164..202
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..202
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 170..184

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G004740.1CmoCh14G004740.1mRNA