CmaCh11G007790 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh11G007790
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionUnknown protein
LocationCma_Chr11: 3786365 .. 3788811 (-)
RNA-Seq ExpressionCmaCh11G007790
SyntenyCmaCh11G007790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGGAGAGTTGATGATAACGACTGTGAATTCCAAATTGTGCGTTCGATCTCAGTTGCTACAATTCCAATCCAAATCCGCTTCCCATTTTTCTGCTACTTCAGAATCTCAGTTCCCCATCCAGTCACTTTCACTTTCTTTCCTCGCTCCTCGCCCTGTTCTTCAATTCGGGCAATGGTGGATGAAATTCCGATCAAGAGAAATAGGGTATGGCCGCACGTTTTTGATTTTCTCTGCATTACTATTCCATAATCATAAAGGTATCTTTCATGGGAATCAAAACAAGATCAATTATGGAACAACAGCCTTCCGCCCAACTTTAAAGGTTGCGTGGGACGATTCCCATTTCGTTTTCCCCTCTCATCCAAGGAATTTTCTGTTCTATCTTCTGTTTACTTGCCTTCTCCGATTGATCTTTCGTTGATCCTTATTTCGCATGATTTTCTTTGAAGCCATCTCTCGTCTGTGATTCCTCCCTTGCGGCAGGGGAGTTTGATTCTGTAACATTTGTTCACGTTTTGGACTCGGATTTCTTCATTTCCAGTTTTAGAGCTATGGATTATTATTAGTGTGGCTGGTGATTTCATTCAGATTCATGTTTCTGCTGGTTGTTGGTGAAATAATGGGCTAAGTTTACAAGTGATTAAGTAGTTAAGTAGTCGATTTTTGTTGTTTAAATTGATATTAATGAGGTTTTGGATGTGTAAGATTCACTGTCCTTCATCCCTTTGCCTTTGCCAACCTTCCCCTCATATACATTCCACTGGATCAGTATCATTAGATTTGCAAAACTCACCACATTTGCCTTCACGAGATGTGCCAGTCATCGGAACTTCTGGTTCCAGTGTTGAGACCCTTGAACCTAAGCGCGACAAAGTGCCGGAGAGTGAGTTTGGAAACAAGAGCAGTTTGAGGAAGCTGAATTCGCGGTCAGGTGTATCAAAGGAGAAGCATGTAAAAACTGTACAATGGATGGATTTTTCAGGCAAAGAACTTGCTGAGATCAGGGAATTTGAAGCTAGGTAATATCTCTTACCATTTAATCTCTCTCTTCTCATTTGAGACTTGAAGAGCCTGGAAGGAGTGGAGAAGGTGTGATCTCTGTTTTTAGCATCTCGTAAATCATGATCGAGGAGATCGAATATCATCCAAATAATTGAAATTCAGGGTCTTCCTTCTATTTAATTATTTGATTTTAGAATCTTTACATCTCATGAATTGAGTTTGAGAAGATTTAATGTCATCAAAATAATTGAAATGCTGTTTCTTTCATTGATTTATTATTTGATTAGATCTTGTATTAGTATGGAATTACCAAGTTACTTAAATATCAAGTTATACAATATAATCTGTTCATATTAACACAACATACTGAAACTTGATCGAATATTGGAGGTATGATGAAATAATATGTACTTAGAAGGATGGAACTGCAATTACATTTGATGCTTTAGATGTTCATACCAGTTTAAAGATATTTGAGATGATGCTTCATGTCCCTATTCCAAAATCTTTGAAGAGTTCATAAACTTACAGATGGGATTGGAATGCCCAAATTTGAGTTTTCTTCATGATGGCTGCTGTTTTCTTGTGAATGAACTATACATACTGACTACTGATCGTGTACGCGTGTGCGTGTACATATTTTTCTTATGCAACAGTGAAGATGAAGATTCGGATTATGAAGACGAGGATAACGGAAGCTGCATCTGTACTATTTTATGAAATTTCTCAGTAGCTTAGATTTTCAAATTTAAAAAAGAAGAGGAAGTAGCTAACGAAGCCCGCTTAATGTCGGTTACAATCATATTCGTCCTGAAAGGTCTTTGATGCTTCTTTGTTTTCATTTCTCAGATTAGAATATGTTGTGAGAATGAAAGGCAAAATGGAGCTTTATTTACCAATAATGTACCTAATTTATTTGTTTTATGAAAATGTTGTTAATTCTTCAACATGAACAAATTTCACTTTTCACTTGATTGGCAAAAGAATTCTCTGCTTTGTAAATTATATACTGAATGTTATCTCATGAGGCCACTACTTATACTCTAAGCTTCCATGATTGTCTAAATTATTGATGCATGCTGCCTGAGATATGAAATTTTGTTATATTGTTTTGCTGCATTTTAATGTTTCTAAAGTGCAGACTATTTGAACCAGAGCAGATTCCTACATATGGGATCATTGAAACTCATTGCCTTCGCCTTGTAGACCTTCGAAAGATGCGTCTACGTGGTGTTCGTGGACTAAAGTACCCAGAGATGTAAACGATTAGCACTGCAAACACAAAGCCAGCTGGGAACCCAATTCCTGCACCTATCCAGGAAAACACTTCTTGCTTTTCTTCCTCCTCAGCTTGTTTTGTCGGTTGCTCGTCTTCTGGACATGGCTGTTGGATTTGAATTCCACATAGCCCACTGTTGTTGGCATAATAGCTTGGAGTGTTCAT

mRNA sequence

TTGGAGAGTTGATGATAACGACTGTGAATTCCAAATTGTGCGTTCGATCTCAGTTGCTACAATTCCAATCCAAATCCGCTTCCCATTTTTCTGCTACTTCAGAATCTCAGTTCCCCATCCAGTCACTTTCACTTTCTTTCCTCGCTCCTCGCCCTGTTCTTCAATTCGGGCAATGGTGGATGAAATTCCGATCAAGAGAAATAGGGTATGGCCGCACGTTTTTGATTTTCTCTGCATTACTATTCCATAATCATAAAGGTATCTTTCATGGGAATCAAAACAAGATCAATTATGGAACAACAGCCTTCCGCCCAACTTTAAAGGTTGCGTGGGACGATTCCCATTTCGTTTTCCCCTCTCATCCAAGGAATTTTCTGTTCTATCTTCTGTTTACTTGCCTTCTCCGATTGATCTTTCGTTGATCCTTATTTCGCATGATTTTCTTTGAAGCCATCTCTCGTCTGTGATTCCTCCCTTGCGGCAGGGGAGTTTGATTCTGTAACATTTGTTCACGTTTTGGACTCGGATTTCTTCATTTCCAGTTTTAGAGCTATGGATTATTATTAGTGTGGCTGGTGATTTCATTCAGATTCATGTTTCTGCTGGTTGTTGGTGAAATAATGGGCTAAGTTTACAAGTGATTAAGTAGTTAAGTAGTCGATTTTTGTTGTTTAAATTGATATTAATGAGGTTTTGGATGTGTAAGATTCACTGTCCTTCATCCCTTTGCCTTTGCCAACCTTCCCCTCATATACATTCCACTGGATCAGTATCATTAGATTTGCAAAACTCACCACATTTGCCTTCACGAGATGTGCCAGTCATCGGAACTTCTGGTTCCAGTGTTGAGACCCTTGAACCTAAGCGCGACAAAGTGCCGGAGAGTGAGTTTGGAAACAAGAGCAGTTTGAGGAAGCTGAATTCGCGGTCAGGTGTATCAAAGGAGAAGCATGTAAAAACTGTACAATGGATGGATTTTTCAGGCAAAGAACTTGCTGAGATCAGGGAATTTGAAGCTAGTGAAGATGAAGATTCGGATTATGAAGACGAGGATAACGGAAGCTGCATCTGTACTATTTTATGAAATTTCTCAGTAGCTTAGATTTTCAAATTTAAAAAAGAAGAGGAAGTAGCTAACGAAGCCCGCTTAATGTCGGTTACAATCATATTCGTCCTGAAAGACTATTTGAACCAGAGCAGATTCCTACATATGGGATCATTGAAACTCATTGCCTTCGCCTTGTAGACCTTCGAAAGATGCGTCTACGTGGTGTTCGTGGACTAAAGTACCCAGAGATGTAAACGATTAGCACTGCAAACACAAAGCCAGCTGGGAACCCAATTCCTGCACCTATCCAGGAAAACACTTCTTGCTTTTCTTCCTCCTCAGCTTGTTTTGTCGGTTGCTCGTCTTCTGGACATGGCTGTTGGATTTGAATTCCACATAGCCCACTGTTGTTGGCATAATAGCTTGGAGTGTTCAT

Coding sequence (CDS)

ATGAGGTTTTGGATGTGTAAGATTCACTGTCCTTCATCCCTTTGCCTTTGCCAACCTTCCCCTCATATACATTCCACTGGATCAGTATCATTAGATTTGCAAAACTCACCACATTTGCCTTCACGAGATGTGCCAGTCATCGGAACTTCTGGTTCCAGTGTTGAGACCCTTGAACCTAAGCGCGACAAAGTGCCGGAGAGTGAGTTTGGAAACAAGAGCAGTTTGAGGAAGCTGAATTCGCGGTCAGGTGTATCAAAGGAGAAGCATGTAAAAACTGTACAATGGATGGATTTTTCAGGCAAAGAACTTGCTGAGATCAGGGAATTTGAAGCTAGTGAAGATGAAGATTCGGATTATGAAGACGAGGATAACGGAAGCTGCATCTGTACTATTTTATGA

Protein sequence

MRFWMCKIHCPSSLCLCQPSPHIHSTGSVSLDLQNSPHLPSRDVPVIGTSGSSVETLEPKRDKVPESEFGNKSSLRKLNSRSGVSKEKHVKTVQWMDFSGKELAEIREFEASEDEDSDYEDEDNGSCICTIL
Homology
BLAST of CmaCh11G007790 vs. TAIR 10
Match: AT3G13480.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G55475.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 89.4 bits (220), Expect = 2.4e-18
Identity = 64/164 (39.02%), Postives = 86/164 (52.44%), Query Frame = 0

Query: 1   MRFWMCKIHCPSSLCLCQPSPHIHSTGSVSLDL---------------------QNSPHL 60
           MR  +CKI CPS +C C+PSPHI+++GS+ L+                       +  H+
Sbjct: 13  MRVLLCKIQCPSFICFCKPSPHIYASGSLKLENTFPQVSSSTTVVDDRDHDDNDDDDAHV 72

Query: 61  PSRDVPVIGTSGSSVE-------TLEPKRDKVPESEFGN--KSSLRK--LNSRSGVSKEK 120
              +V V    G   E        LE K+++      G   KSSL+K  L+S  G  KEK
Sbjct: 73  EEEEVVVDHVDGLLTEVVREEDCALEGKKEEEESLSNGEILKSSLKKEVLDSADGGRKEK 132

Query: 121 HVKTVQWMDFSGKELAEIREFEASEDEDSDYEDEDNGSCICTIL 133
             K VQW+D  GKELAEIREFE+SE+ED  Y+ +   SC+C IL
Sbjct: 133 --KKVQWVDLMGKELAEIREFESSEEEDVRYDGDQ--SCVCVIL 172

BLAST of CmaCh11G007790 vs. TAIR 10
Match: AT1G55475.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G13480.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 58.9 bits (141), Expect = 3.5e-09
Identity = 33/62 (53.23%), Postives = 45/62 (72.58%), Query Frame = 0

Query: 72  KSSLRKLNSRSGVSKEKHVKTVQWMDFSGKELAEIREFEASEDEDSDYEDEDNG-SCICT 131
           KSSLRK++S S  ++++  K VQW+D  GKELAEIREFE S+++D    D D G +C+C 
Sbjct: 62  KSSLRKVDSNSTEAEKREKKKVQWVDVIGKELAEIREFEPSDEDDI---DSDRGKTCVCI 120

Query: 132 IL 133
           IL
Sbjct: 122 IL 120

BLAST of CmaCh11G007790 vs. TAIR 10
Match: AT2G33390.1 (unknown protein; Has 34 Blast hits to 34 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 46.6 bits (109), Expect = 1.8e-05
Identity = 25/48 (52.08%), Postives = 32/48 (66.67%), Query Frame = 0

Query: 85  SKEKHVKTVQWMDFSGKELAEIREFEASEDEDSDYEDEDNGSCICTIL 133
           S +K  +TVQW D  G  LAE+  +E S  E SD ED+D+ SCICTI+
Sbjct: 53  STDKIKRTVQWNDIKGDNLAEVLVYEPS--EVSDTEDDDSDSCICTIM 98

BLAST of CmaCh11G007790 vs. TAIR 10
Match: AT1G34010.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22790.2); Has 74 Blast hits to 74 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 44.3 bits (103), Expect = 8.9e-05
Identity = 38/98 (38.78%), Postives = 48/98 (48.98%), Query Frame = 0

Query: 58  EPKRDKVPESE----------FGNKSSLRKLN-----------SRSGVSKEKHVKTVQWM 117
           E K+D+ P  E          F  KSSL+K +           SR GV      + VQW 
Sbjct: 85  EGKKDEAPSVEDYNNCEVTNRFALKSSLKKRSFSDVVIGDDDVSRDGVVDHIDRRKVQWP 144

Query: 118 DFSGKELAEIREFEASE-DEDSDYEDEDNG-SCICTIL 133
           D  G E+AE+REFE SE DE  D     +G SC+CTI+
Sbjct: 145 DTCGIEIAEVREFEPSEVDESEDEFHHGSGKSCMCTIM 182

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT3G13480.12.4e-1839.02unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G55475.13.5e-0953.23unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G33390.11.8e-0552.08unknown protein; Has 34 Blast hits to 34 proteins in 10 species: Archae - 0; Bac... [more]
AT1G34010.18.9e-0538.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..83
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 56..70
NoneNo IPR availablePANTHERPTHR33401LIGHT-HARVESTING COMPLEX-LIKE PROTEIN OHP2, CHLOROPLASTICcoord: 1..132
NoneNo IPR availablePANTHERPTHR33401:SF19EXPRESSED PROTEINcoord: 1..132

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh11G007790.1CmaCh11G007790.1mRNA