CmaCh02G018000 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh02G018000
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionUnknown protein
LocationCma_Chr02: 10026624 .. 10028416 (+)
RNA-Seq ExpressionCmaCh02G018000
SyntenyCmaCh02G018000
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATCTCTAGTTTTTTCAAGACTAGCCGTTGCAATCTTCTTCTTCTTCTTCTTCTTTATTCATATTCTCCTTCCTCTAGCTCATCCTCATAGTCATGGCAGCCTCGGTGGATTCTCCGTCATCTTCACATCCCAACCAGGTTCTTCTTCTTCTCTCTCCCTCTCGCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTAATACATTTTCTGCAATGATTTCAGGGATCGACCAGCTTCGTGGGTTCTTCCCCATTGTTCAGTCCCGCTTCCGACAAGCGTTTCTGGAGCTCCCTTCGAGGTAGAATAGACTCTCTTCTTGAGGAACGGAATGTAAAGTCTTCAAATCTGGATCCTATCATGCCCCACCAATTAGTGAGTGCCTTTTCTCTTCCATTTTCTTGCTCTTTTCGGGCCAAGTCAAGCTCTAAAATTTTGATGGGTTTGTGGGGATAGAACACCAACAAATCGGAAAGGGCAAAGAGATTGAAGGAAGATTCTTTGCTTTTGCTGAGGGGTTTCGACTCGGTTGGCTACACCCTATCTCAGCTGTCCAACAATTTGGATAATGCCTTACAGGTAACGTCAAACTCATTCCTTCCATTTTCTTTGTTCTAGAGTTTCCATAAAGGACTTGCAATCTTCTTTTGTTCAAATTCAGTAAACTATGCTCTTGAACCCCACAAATGGATATTTGATTTTGAGGTCTGGTTTCAGGGCGCTAGGGATCTCGTCAAAGCGCCAACCTTGACGGAGATCTTCCAGAGCAACCTCAAGAACTCGGAGGTTGAGGTAGGTGATTCGAAAAGGAAAGAAAATGAGTGGGAGGTGCCCAAGCAAGCAACAAAGAGAAAATTTGATGACAGTCATTGCTCAGAAGAATCAGAGGTCGATTTAGAAAAAGATAAGCAGCAAAACCCAAAAGACAAGCTTCAAAAGGCCAAAACTGTTGGTTGCCTTATCCTTTCATCTATATTTGAACACTCAACCCAATCTCATTCTTGATGAATATTAACATTTTTGGGCCTGCGTTTCTTCAGCTTGCAGTTACAATGGCAACAAAATCAGCCTCTCTGGCAAGAGAATTGAAATCATTGAAATCCAATCTATGTTTTATGCAAGAGCGATGTGGTATACTTGAGGAAGAGAATAGAAGACTTCGGGATGGGTTTTCCAGAGGGATCAGACCAGAAGAAGATGATCTGGTCCGTATCAAATTTATGTTCATGATTGTTTGTTCTTCATTATTCCCAGGTGTGTAAATTAGATGATGATTTATATGGAGTCTGTTTGTTTGATGGAAAATCAGGTTAGGCTTCAAATGGAGGCACTACTTGCTGAGAAATCCAGATTAGCAAACGAAAATGCAAACTTAACAAGAGAAAACCAATGCCTTCACCAGCTTGTGGAGTATCACCAACTCACATCCGAAGACCTCTCTGTATCTTACGAGGAAGTCATCCAAGGCATGTGCTTGGACTTCTCCTCACCACCACCAGCCATTGCTGAAGAAGATGAAGAAGAAGAAGAAGAAGAAACCAGTGGAACACCTAGAGTTGATCTTTTTAGCTTTTCTAACTCACTTGATGAGCTCCACCAAGAAGAAGAGTAGAGGTGATTTTGGAGCAGAAGATTTCAGAGCTTTGGTCTTCAGTATCTTTCCATTTTACATTCCTGTAATCAAGTAACCTGATGATCGTCACTGCATTTATTTAGCTCAGAAATAAACTAATTTATTTAGATCATCTTTTTTTTCCCTTTACACGCCAAAGCAAGTTTTTGGTACGTCGTTTTATGC

mRNA sequence

AATCTCTAGTTTTTTCAAGACTAGCCGTTGCAATCTTCTTCTTCTTCTTCTTCTTTATTCATATTCTCCTTCCTCTAGCTCATCCTCATAGTCATGGCAGCCTCGGTGGATTCTCCGTCATCTTCACATCCCAACCAGGGATCGACCAGCTTCGTGGGTTCTTCCCCATTGTTCAGTCCCGCTTCCGACAAGCGTTTCTGGAGCTCCCTTCGAGGTAGAATAGACTCTCTTCTTGAGGAACGGAATGTAAAGTCTTCAAATCTGGATCCTATCATGCCCCACCAATTAAACACCAACAAATCGGAAAGGGCAAAGAGATTGAAGGAAGATTCTTTGCTTTTGCTGAGGGGTTTCGACTCGGTTGGCTACACCCTATCTCAGCTGTCCAACAATTTGGATAATGCCTTACAGGGCGCTAGGGATCTCGTCAAAGCGCCAACCTTGACGGAGATCTTCCAGAGCAACCTCAAGAACTCGGAGGTTGAGGTAGGTGATTCGAAAAGGAAAGAAAATGAGTGGGAGGTGCCCAAGCAAGCAACAAAGAGAAAATTTGATGACAGTCATTGCTCAGAAGAATCAGAGGTCGATTTAGAAAAAGATAAGCAGCAAAACCCAAAAGACAAGCTTCAAAAGGCCAAAACTCTTGCAGTTACAATGGCAACAAAATCAGCCTCTCTGGCAAGAGAATTGAAATCATTGAAATCCAATCTATGTTTTATGCAAGAGCGATGTGGTATACTTGAGGAAGAGAATAGAAGACTTCGGGATGGGTTTTCCAGAGGGATCAGACCAGAAGAAGATGATCTGGTTAGGCTTCAAATGGAGGCACTACTTGCTGAGAAATCCAGATTAGCAAACGAAAATGCAAACTTAACAAGAGAAAACCAATGCCTTCACCAGCTTGTGGAGTATCACCAACTCACATCCGAAGACCTCTCTGTATCTTACGAGGAAGTCATCCAAGGCATGTGCTTGGACTTCTCCTCACCACCACCAGCCATTGCTGAAGAAGATGAAGAAGAAGAAGAAGAAGAAACCAGTGGAACACCTAGAGTTGATCTTTTTAGCTTTTCTAACTCACTTGATGAGCTCCACCAAGAAGAAGAGTAGAGGTGATTTTGGAGCAGAAGATTTCAGAGCTTTGGTCTTCAGTATCTTTCCATTTTACATTCCTGTAATCAAGTAACCTGATGATCGTCACTGCATTTATTTAGCTCAGAAATAAACTAATTTATTTAGATCATCTTTTTTTTCCCTTTACACGCCAAAGCAAGTTTTTGGTACGTCGTTTTATGC

Coding sequence (CDS)

ATGGCAGCCTCGGTGGATTCTCCGTCATCTTCACATCCCAACCAGGGATCGACCAGCTTCGTGGGTTCTTCCCCATTGTTCAGTCCCGCTTCCGACAAGCGTTTCTGGAGCTCCCTTCGAGGTAGAATAGACTCTCTTCTTGAGGAACGGAATGTAAAGTCTTCAAATCTGGATCCTATCATGCCCCACCAATTAAACACCAACAAATCGGAAAGGGCAAAGAGATTGAAGGAAGATTCTTTGCTTTTGCTGAGGGGTTTCGACTCGGTTGGCTACACCCTATCTCAGCTGTCCAACAATTTGGATAATGCCTTACAGGGCGCTAGGGATCTCGTCAAAGCGCCAACCTTGACGGAGATCTTCCAGAGCAACCTCAAGAACTCGGAGGTTGAGGTAGGTGATTCGAAAAGGAAAGAAAATGAGTGGGAGGTGCCCAAGCAAGCAACAAAGAGAAAATTTGATGACAGTCATTGCTCAGAAGAATCAGAGGTCGATTTAGAAAAAGATAAGCAGCAAAACCCAAAAGACAAGCTTCAAAAGGCCAAAACTCTTGCAGTTACAATGGCAACAAAATCAGCCTCTCTGGCAAGAGAATTGAAATCATTGAAATCCAATCTATGTTTTATGCAAGAGCGATGTGGTATACTTGAGGAAGAGAATAGAAGACTTCGGGATGGGTTTTCCAGAGGGATCAGACCAGAAGAAGATGATCTGGTTAGGCTTCAAATGGAGGCACTACTTGCTGAGAAATCCAGATTAGCAAACGAAAATGCAAACTTAACAAGAGAAAACCAATGCCTTCACCAGCTTGTGGAGTATCACCAACTCACATCCGAAGACCTCTCTGTATCTTACGAGGAAGTCATCCAAGGCATGTGCTTGGACTTCTCCTCACCACCACCAGCCATTGCTGAAGAAGATGAAGAAGAAGAAGAAGAAGAAACCAGTGGAACACCTAGAGTTGATCTTTTTAGCTTTTCTAACTCACTTGATGAGCTCCACCAAGAAGAAGAGTAG

Protein sequence

MAASVDSPSSSHPNQGSTSFVGSSPLFSPASDKRFWSSLRGRIDSLLEERNVKSSNLDPIMPHQLNTNKSERAKRLKEDSLLLLRGFDSVGYTLSQLSNNLDNALQGARDLVKAPTLTEIFQSNLKNSEVEVGDSKRKENEWEVPKQATKRKFDDSHCSEESEVDLEKDKQQNPKDKLQKAKTLAVTMATKSASLARELKSLKSNLCFMQERCGILEEENRRLRDGFSRGIRPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLHQLVEYHQLTSEDLSVSYEEVIQGMCLDFSSPPPAIAEEDEEEEEEETSGTPRVDLFSFSNSLDELHQEEE
Homology
BLAST of CmaCh02G018000 vs. TAIR 10
Match: AT4G02800.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 3209 Blast hits to 2720 proteins in 308 species: Archae - 13; Bacteria - 213; Metazoa - 1207; Fungi - 247; Plants - 183; Viruses - 21; Other Eukaryotes - 1325 (source: NCBI BLink). )

HSP 1 Score: 313.5 bits (802), Expect = 2.0e-85
Identity = 187/348 (53.74%), Postives = 246/348 (70.69%), Query Frame = 0

Query: 1   MAASVDSPSSSHPNQ--------GSTSFVGSSPLFSPASDKRFWSSLRGRIDSLLEERNV 60
           MAASV++PS +H N          +TSF  SSP  SP+SDKR WS++R R+D LLEE   
Sbjct: 1   MAASVETPSPNHTNNEGTRLNMVSATSFDSSSPSVSPSSDKRLWSNVRNRVDVLLEE--- 60

Query: 61  KSSNLDPIMPHQLNTNKSERAKRLKEDSLLLLRGFDSVGYTLSQLSNNLDNALQGARDLV 120
            S N  P+        +SER+KR K DS+LLL+GFDSV +TLS LS+NLDNALQG R+L 
Sbjct: 61  NSKNHKPVT--NTIAIESERSKRFKNDSMLLLKGFDSVSHTLSLLSSNLDNALQGVRELA 120

Query: 121 KAPTLTEIFQSNLKNSEVEVGDSKRKENEWEVPKQATKRKFDDSHCSEESEVDLEKDKQQ 180
           K P+ +EI  SNLK  +++    +++E+E E   +  KRK +      E   D   ++++
Sbjct: 121 KPPSYSEILHSNLKADQIQ--RQQKEEDEEEEESKGKKRKHES---DVEQTEDSSNEEEK 180

Query: 181 NPKDK--LQKAKTLAVTMATKSASLARELKSLKSNLCFMQERCGILEEENRRLRDGFSRG 240
            PK++  ++KAK +A++MA K+ SLARELK++KS+L F+QERCG+LEEEN+RLRDGF +G
Sbjct: 181 RPKERKIMKKAKNIAISMAAKANSLARELKTIKSDLSFIQERCGLLEEENKRLRDGFVKG 240

Query: 241 IRPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLHQLVEYHQLTSEDLSVSYEEVIQ 300
           +RPEEDDLVRLQ+E LLAEK+RLANENANL RENQCLHQ+VEYHQ+TS+DLS SYE+V+Q
Sbjct: 241 VRPEEDDLVRLQLEVLLAEKARLANENANLVRENQCLHQMVEYHQITSQDLSPSYEQVVQ 300

Query: 301 GMCLDFSSPPPAIAEEDEEEEEEETSGTPRVDLFSFSNSLDELHQEEE 339
           G CLDFSSP P   + D+EEEE ET      D+    N   E  +EE+
Sbjct: 301 GFCLDFSSPLP---QYDDEEEEHETRAR---DVSKALNESFEKAEEEQ 332

BLAST of CmaCh02G018000 vs. TAIR 10
Match: AT5G01970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G30050.1); Has 240 Blast hits to 236 proteins in 72 species: Archae - 0; Bacteria - 15; Metazoa - 51; Fungi - 19; Plants - 119; Viruses - 0; Other Eukaryotes - 36 (source: NCBI BLink). )

HSP 1 Score: 110.9 bits (276), Expect = 2.0e-24
Identity = 90/279 (32.26%), Postives = 145/279 (51.97%), Query Frame = 0

Query: 23  SSPLFSPASDKRF-------WSSLRGRIDSLLEERNVKSSNLDPIMPHQLNTNKSERAKR 82
           SSP F     K F       W  +  +  S++E+   KSS+          +  S+   +
Sbjct: 39  SSPAFDQPRSKNFTTEPKGLWGVIAQKAKSVIEDD--KSSDRSTTASQSRFSYLSDEGFK 98

Query: 83  LKEDSLLLLRGFDSVGYTLSQLSNNLDNALQGARDLVKAPTLTEIFQSNLKNSEVEVGDS 142
            K D+  L RG D +  +L+Q+ +  + A +  R LV+  T  +I Q   K      G  
Sbjct: 99  -KMDNPKLRRGLDKLTSSLNQIGDTFEKAFEDGRTLVENKT-ADIIQETRKLQTRRRGTG 158

Query: 143 KRKENEWEVPKQATKRKFDDSHCSEESEVDLEKDKQQNPKDKLQKAKTLAVTMATKSASL 202
              EN+ +    ++  K       + + ++ E         +L+ ++ +A+  A K+  L
Sbjct: 159 GEDENQNQSYGVSSSWKKSPEQPMQLNHIEHE--------TQLKASRDVAMATAAKAKLL 218

Query: 203 ARELKSLKSNLCFMQERCGILEEENRRLRDGF-SRGIRPEEDDLVRLQMEALLAEKSRLA 262
            RELK++K++L F +ERC  LEEEN+ LR+    +G  P ++DL+RLQ+E+LLAEK+RLA
Sbjct: 219 LRELKTVKADLAFAKERCAQLEEENKHLRESHREKGSNPADEDLIRLQLESLLAEKARLA 278

Query: 263 NENANLTRENQCLHQLVEYHQLTSED---LSVSYEEVIQ 291
           +EN+   REN+ L ++VEYHQLT +D   +    EEV Q
Sbjct: 279 HENSVYARENRFLREIVEYHQLTMQDVVYIDEGSEEVTQ 305

BLAST of CmaCh02G018000 vs. TAIR 10
Match: AT1G30050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 246 Blast hits to 244 proteins in 61 species: Archae - 0; Bacteria - 8; Metazoa - 78; Fungi - 10; Plants - 117; Viruses - 0; Other Eukaryotes - 33 (source: NCBI BLink). )

HSP 1 Score: 102.1 bits (253), Expect = 9.2e-22
Identity = 75/190 (39.47%), Postives = 112/190 (58.95%), Query Frame = 0

Query: 106 QGARDLVKAPTLTEIFQS--NLKNSEVEVGDSKRK---ENEWEVPKQATKRKFD------ 165
           Q   D++  P+   I +S   +  S   +GDS  K   E    V  Q  ++  D      
Sbjct: 101 QQQNDVIFEPSNPTIRKSIDKITTSLNHIGDSFEKAFEEGRTIVASQIRRKGSDLIDSDN 160

Query: 166 -DSHCSEESEVDLEKDKQQNPKD-KLQKAKTLAVTMATKSASLARELKSLKSNLCFMQER 225
            + H S  S    +   Q NP++ +L+ ++ +A+  A K+  L RELK++K++L F +ER
Sbjct: 161 NNYHQSSGSSSPWQPLTQPNPRESQLKASRDVAMATAAKAKLLLRELKTVKADLAFAKER 220

Query: 226 CGILEEENRRLRDGFSRG-IRPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLHQLV 282
           C  LEEEN+RLRD   +G   P +DDL+RLQ+E LLAEK+RLA+EN+   REN+ L ++V
Sbjct: 221 CSQLEEENKRLRDNRDKGNNNPADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIV 280

BLAST of CmaCh02G018000 vs. TAIR 10
Match: AT2G30530.1 (unknown protein; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 5513 Blast hits to 872 proteins in 154 species: Archae - 0; Bacteria - 30; Metazoa - 615; Fungi - 144; Plants - 149; Viruses - 12; Other Eukaryotes - 4563 (source: NCBI BLink). )

HSP 1 Score: 101.7 bits (252), Expect = 1.2e-21
Identity = 59/117 (50.43%), Postives = 83/117 (70.94%), Query Frame = 0

Query: 171 QQNP------KDKLQKAKTLAVTMATKSASLARELKSLKSNLCFMQERCGILEEENRRLR 230
           QQNP      + +L+ ++ +A+ MA K+  L RELK +KS+L F ++RC  LEEEN+ LR
Sbjct: 216 QQNPEIQADLEIQLKASRDVAMAMAAKAKLLLRELKMVKSDLAFAKQRCAQLEEENKVLR 275

Query: 231 DGFSRGIRPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLHQLVEYHQLTSEDL 282
           +  S   + ++DDLVRLQ+E LLAEK+RLA+EN+  TREN  L  +VEYHQLT +D+
Sbjct: 276 ENRSGDSQTDDDDLVRLQLETLLAEKARLAHENSIYTRENLYLRGVVEYHQLTMQDV 332

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G02800.12.0e-8553.74unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G01970.12.0e-2432.26unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G30050.19.2e-2239.47unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G30530.11.2e-2150.43unknown protein; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant ... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 243..270
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 302..316
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..26
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..177
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 128..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 296..338
NoneNo IPR availablePANTHERPTHR31016:SF2OSJNBA0065B15.1 PROTEINcoord: 1..328
NoneNo IPR availablePANTHERPTHR31016UNCHARACTERIZEDcoord: 1..328

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G018000.1CmaCh02G018000.1mRNA