CmaCh01G008420.1 (mRNA) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh01G008420.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionLysM domain-containing protein
LocationCma_Chr01: 4630200 .. 4632227 (+)
Sequence length1130
RNA-Seq ExpressionCmaCh01G008420.1
SyntenyCmaCh01G008420.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGATTCAATTTTCTCCACTTTATGTAATTACACTAAAAGGGAAAAGGAAAACAAAAATGTCATCAGAAAATAGGCAATGGGCAATAGGCAATAGGCATGGCGAAATTTGAAGGGTTTGTGCGATGGAAGTGAAGGTCGGGCAGAGAAATGGAGTAGACCGTTCTCCTCTTCTCCCACAACCAATTCTCCCTTCTCTTCTAACTCCTCACAGGTATTTCGCCATTCCCTTATCTCACAACCCATCATCGACACCCTTTTCTTTCTTTTTCTGAGAAATCTTACACAACTTGCTTCGCTTTTCTCCTTTTCCCTCCAATAGAAATTGGGCCAACGCCAAAACATCCTTGAAGAATCACTTCAGAGCCATTGCGCTGGTATGTTTCTCTCCATTCAATATCGTGCTTTTGGTTCTAATTACTCCATCCCCATGTTAGAATTGTTTCTATTACTTTTGGAATTGAATGCTTTTTCCAACCTTAACACTAACATTGGTTGCTATAGGCTAAATTGTACTGCAAACTAATTACTGAATGTTCAGATCCTCAAAAATTTCAAACCCCATATGAATTTCATCTCAGTTTCCATGAGTTGGGTTCATAATTTCTAAATTCGGTGATGAAAGTATTGGTATTACTTGAAGAATTGACAAACTCGCTTAACTTTTTGTTGATTCATTGCGTTCGTGAGCTACATGTAAGAGGAATTTTTGTATATATGGTTATGTCTTACATAATCACTGCAGTTTAGAGATAGATATCAATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTTGTAAATATGAAATGCATTTCACGCTTGCCTGCCCCCGACTACCTCCTGTAAGTTTCTTTTTCTCACCAGATGCAGAGCTATATCCAATCTACACATATTTATACCATCTTATCTGTTCTCTCTTCGTTTTTCTTTGTCAGGCTAGAACCACTAGTTTGATAGTTTTGGTTCCCCTTATAGTATTTTGTGCCAATTGCATAATTGGTGCCTCTTATGCTAGAGTTTTCGGAACGTCGAGGCTTAAACCCGTTAAGGAACCAGAGGGAGAACGTCACAACTTCAGAAACGGCCATTGGAGATCTGCTCTTCGAGAGATAAGGGAATTGGATGGTTTGGATTCTGAGTCCTCCATATATTCTACTGTAAGTAATGTAACTTTTTGCTCCTACTCTTACTGTAAGGAATGGTTTTACTTGCAATGTGAAACGGGAGGAAAATTGATAACTGATAGAGTTGTTGAATTGTTGCTTTCATTTCTTTGAGTACCCAACTCCCCAAGGCTTTTCCAGCTGATCATTCATTGTTTTCTATTCCTAATGGAGTTCACTGGCCATCCACTTCGCTTTCTTCTTGTTTGCCCATCTTTTTGTTCCAATCAGTGTATAATGCTGCCAAAATGGCAAACTATCTTCAGATAACAACATTGGATCTGCACATGGGTGGGGTGAACAAGAAAAGGAACAATGCAGAAATATGTTACTAAGAGTGCTTAAATGAACTTCATAAATTATAAAAATACATCATATGTCGAAATCATTATAGAAAATATGCTTGATGATTTCAAAACAATACCCTTAAAAATCATTCTATACTCATTGTAGTTTTTCTTTTGCAGAGGCCTTCAGAAGAACAGATCTCAGTTGAAGATTTGTCACATGCTTACAAAAAACTGGATCAGGAGTACGACAAATTTCTATCAGAATGTGGACTGAGTAAATCGGGCTACTGGCGAGGAGGTGCGCAGAGACCTGGACAGGAATAGGCAATCTTTGTAGCACTTTTCCGTGCTCCCTTGCCATCCAAAAGGGGCTAGGCTAGGGAGTTCTCTGTAGCAACCATTTTTCAGGTGTTGTATTGTAGTGGAGTAAAAATTTCAACCCAGTAATGTAATTAAACTTTAAATTGGACAAAATTTATGCATTCTGCATCCAATTTGCATTAACTCTTAACAAAGAACGTGCATGACCGGAGTCTCGCCGGGATTCTTAGTCACTTTGAGCTTTGCATCGGC

mRNA sequence

GGATTCAATTTTCTCCACTTTATGTAATTACACTAAAAGGGAAAAGGAAAACAAAAATGTCATCAGAAAATAGGCAATGGGCAATAGGCAATAGGCATGGCGAAATTTGAAGGGTTTGTGCGATGGAAGTGAAGGTCGGGCAGAGAAATGGAGTAGACCGTTCTCCTCTTCTCCCACAACCAATTCTCCCTTCTCTTCTAACTCCTCACAGAAATTGGGCCAACGCCAAAACATCCTTGAAGAATCACTTCAGAGCCATTGCGCTGGCTAAATTGTACTGCAAACTAATTACTGAATGTTCAGATCCTCAAAAATTTCAAACCCCATATGAATTTCATCTCAGTTTCCATGAGTTGGAATTGACAAACTCGCTTAACTTTTTGTTGATTCATTGCGTTCGTGAGCTACATTTTAGAGATAGATATCAATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTTGTAAATATGAAATGCATTTCACGCTTGCCTGCCCCCGACTACCTCCTGCTAGAACCACTAGTTTGATAGTTTTGGTTCCCCTTATAGTATTTTGTGCCAATTGCATAATTGGTGCCTCTTATGCTAGAGTTTTCGGAACGTCGAGGCTTAAACCCGTTAAGGAACCAGAGGGAGAACGTCACAACTTCAGAAACGGCCATTGGAGATCTGCTCTTCGAGAGATAAGGGAATTGGATGGTTTGGATTCTGAGTCCTCCATATATTCTACTAGGCCTTCAGAAGAACAGATCTCAGTTGAAGATTTGTCACATGCTTACAAAAAACTGGATCAGGAGTACGACAAATTTCTATCAGAATGTGGACTGAGTAAATCGGGCTACTGGCGAGGAGGTGCGCAGAGACCTGGACAGGAATAGGCAATCTTTGTAGCACTTTTCCGTGCTCCCTTGCCATCCAAAAGGGGCTAGGCTAGGGAGTTCTCTGTAGCAACCATTTTTCAGGTGTTGTATTGTAGTGGAGTAAAAATTTCAACCCAGTAATGTAATTAAACTTTAAATTGGACAAAATTTATGCATTCTGCATCCAATTTGCATTAACTCTTAACAAAGAACGTGCATGACCGGAGTCTCGCCGGGATTCTTAGTCACTTTGAGCTTTGCATCGGC

Coding sequence (CDS)

ATGGAAGTGAAGGTCGGGCAGAGAAATGGAGTAGACCGTTCTCCTCTTCTCCCACAACCAATTCTCCCTTCTCTTCTAACTCCTCACAGAAATTGGGCCAACGCCAAAACATCCTTGAAGAATCACTTCAGAGCCATTGCGCTGGCTAAATTGTACTGCAAACTAATTACTGAATGTTCAGATCCTCAAAAATTTCAAACCCCATATGAATTTCATCTCAGTTTCCATGAGTTGGAATTGACAAACTCGCTTAACTTTTTGTTGATTCATTGCGTTCGTGAGCTACATTTTAGAGATAGATATCAATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTTGTAAATATGAAATGCATTTCACGCTTGCCTGCCCCCGACTACCTCCTGCTAGAACCACTAGTTTGATAGTTTTGGTTCCCCTTATAGTATTTTGTGCCAATTGCATAATTGGTGCCTCTTATGCTAGAGTTTTCGGAACGTCGAGGCTTAAACCCGTTAAGGAACCAGAGGGAGAACGTCACAACTTCAGAAACGGCCATTGGAGATCTGCTCTTCGAGAGATAAGGGAATTGGATGGTTTGGATTCTGAGTCCTCCATATATTCTACTAGGCCTTCAGAAGAACAGATCTCAGTTGAAGATTTGTCACATGCTTACAAAAAACTGGATCAGGAGTACGACAAATTTCTATCAGAATGTGGACTGAGTAAATCGGGCTACTGGCGAGGAGGTGCGCAGAGACCTGGACAGGAATAG

Protein sequence

MEVKVGQRNGVDRSPLLPQPILPSLLTPHRNWANAKTSLKNHFRAIALAKLYCKLITECSDPQKFQTPYEFHLSFHELELTNSLNFLLIHCVRELHFRDRYQLQEKLQSSLNGLCKYEMHFTLACPRLPPARTTSLIVLVPLIVFCANCIIGASYARVFGTSRLKPVKEPEGERHNFRNGHWRSALREIRELDGLDSESSIYSTRPSEEQISVEDLSHAYKKLDQEYDKFLSECGLSKSGYWRGGAQRPGQE
Homology
BLAST of CmaCh01G008420.1 vs. TAIR 10
Match: AT4G09970.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 13 growth stages; Has 15 Blast hits to 15 proteins in 6 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 13; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 50.8 bits (120), Expect = 1.8e-06
Identity = 39/117 (33.33%), Postives = 62/117 (52.99%), Query Frame = 0

Query: 128 LPPARTTSLIV-LVPLIVFCANCIIGASYARVFGTSRLKPVKEPEGERHNFRNGHWRSAL 187
           LP   T  L+  L+P++ FC  CIIG  +  +   SR    K  +G  H   +  WR+AL
Sbjct: 143 LPHLNTGVLLTSLLPVLGFCIICIIGTLHTII---SR----KTSQGHHHG--SERWRTAL 202

Query: 188 REIRE---LDGLDSESSIYS-TRPSEEQISVEDLSHAYKKLDQEYDKFLSECGLSKS 240
            +  E    DG DS S  Y     ++E  + ++++ AY +++ EY +FL ECG+ +S
Sbjct: 203 MDWNEPLASDGHDSMSPEYRVASTNQEATATDEMNEAYSRVELEYKRFLLECGVGES 250

BLAST of CmaCh01G008420.1 vs. TAIR 10
Match: AT4G09970.2 (unknown protein; Has 13 Blast hits to 13 proteins in 5 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 11; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 49.3 bits (116), Expect = 5.3e-06
Identity = 39/116 (33.62%), Postives = 60/116 (51.72%), Query Frame = 0

Query: 128 LPPARTTSLIV-LVPLIVFCANCIIGASYARVFGTSRLKPVKEPEGERHNFRNGHWRSAL 187
           LP   T  L+  L+P++ FC  CIIG  +  +   SR    K  +G  H   +  WR+AL
Sbjct: 111 LPHLNTGVLLTSLLPVLGFCIICIIGTLHTII---SR----KTSQGHHHG--SERWRTAL 170

Query: 188 REIRE---LDGLDSESSIYSTRPSEEQISVEDLSHAYKKLDQEYDKFLSECGLSKS 240
            +  E    DG DS S  Y      E  + ++++ AY +++ EY +FL ECG+ +S
Sbjct: 171 MDWNEPLASDGHDSMSPEY-----REATATDEMNEAYSRVELEYKRFLLECGVGES 212

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G09970.11.8e-0633.33unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 18 plant structures; EXP... [more]
AT4G09970.25.3e-0633.62unknown protein; Has 13 Blast hits to 13 proteins in 5 species: Archae - 0; Bact... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 213..233

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh01G008420CmaCh01G008420gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh01G008420.1:exon:1119CmaCh01G008420.1:exon:1119exon
CmaCh01G008420.1:exon:1120CmaCh01G008420.1:exon:1120exon
CmaCh01G008420.1:exon:1121CmaCh01G008420.1:exon:1121exon
CmaCh01G008420.1:exon:1122CmaCh01G008420.1:exon:1122exon
CmaCh01G008420.1:exon:1123CmaCh01G008420.1:exon:1123exon
CmaCh01G008420.1:exon:1124CmaCh01G008420.1:exon:1124exon
CmaCh01G008420.1:exon:1125CmaCh01G008420.1:exon:1125exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh01G008420.1:five_prime_utrCmaCh01G008420.1:five_prime_utrfive_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh01G008420.1:cdsCmaCh01G008420.1:cdsCDS
CmaCh01G008420.1:cdsCmaCh01G008420.1:cds_2CDS
CmaCh01G008420.1:cdsCmaCh01G008420.1:cds_3CDS
CmaCh01G008420.1:cdsCmaCh01G008420.1:cds_4CDS
CmaCh01G008420.1:cdsCmaCh01G008420.1:cds_5CDS
CmaCh01G008420.1:cdsCmaCh01G008420.1:cds_6CDS
CmaCh01G008420.1:cdsCmaCh01G008420.1:cds_7CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh01G008420.1:three_prime_utrCmaCh01G008420.1:three_prime_utrthree_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh01G008420.1CmaCh01G008420.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane