CmaCh02G009970 (gene) Cucurbita maxima (Rimu)

NameCmaCh02G009970
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionSeed specific protein Bn15D1B
LocationCma_Chr02 : 5941177 .. 5942839 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTAAACTTGACAGCCCTTCCTTCACCACTCACTCCACGCGCACGTCCTTCAGCTTCGTATTCTTCTTCTTCTTCTTCTTCTTCTTCAATTGCATGGCCGCCGCTTCTAACCTTATAATTGCTGCTCGACCACGACTCGCCCATTTGCGGAACCCTGTCCGCCGCCTCTCGCCCTCTCTCTTCACTCTCCGGTACTCTTGCCTCTTATTTCTCTTCCTTAAATTAATATAAATATCTTTTAATTTTATATTTACTTAAACATCTCCCACTAAATTAATAATGTCAGACTCATACCCAATAATCCTTAACAATTTTAATTTTTTTTAAAAAAATTTAAAAATAATTTTGTGTTACCGTATTTCAAAAATATTAACGTTAGAATGTTTGTAAATAATTTGTAAAGTGTTTCAGGACTGTTCTTGTCTATATCTTAAGAGTAAAAATTTTTAATTTTTTTTTAAAATAAAAGTAAAGGGTGTCGATGTTGATTTTTAAATGTTATGGGTATTTTTTAAACTTAATGGTTAAATTTAATTTTATATTTAGAAGGTTTGTGAATTAAAAAAATGAATTTATGTTGAGAGCTATTAGACAGAAATTTAACTATTTATTTAACCTACTTAGCTAATTTAAAGACCAAAATATTTAATAATTTAATGTCCAAAGGAAACAAAAAAAAAAAATTATATTTATCTTATATGCTATCACACGTGTCATCAAGCTTAAGCTTAAATTTGAATAAATATGAAGATTAATTAAAAATTATAAGAATAAATTTATAATTTAATTTAATTTATTTTAAAGAAAAGGCATGGAAGCACCTACCTTTAACATTTGAGAGACCATGCCTGGACAACGTCTTAATTTCCACAATTTTAAGAATATTTTATTCTCTTCGATCGACGTAAGATCTCACTGTTCATACCATCAACATATTAATAATATATAATTTATTTATTATTATTATTATTTATTTTTTTGCTTTCAGGGCAGGTCATTGGTCAGCGTATGAGAAGAACCCAGAAGAGGAAGTGTGCCCGACGGTGGTGCCAGATCATATGATCCACTCCATATCCGAAGACTACTGGGCGCCACATCCCCAAACCGGAGTTTTCGGCCCACCAATAAGTCAACACAATCCTGTTCCTAGCGACACGTCGGCTGGCGGCGCCAAAGAGGGCTCCGTTCTGGAGCTGAAGGTTTGGTTCCGACACATCGAGCTGGAGGACCCCGAGAAGCCACACGACCTGTAATCTCCATGGCTATGATAAGATATAATAATAATAAGCTTACATGCATGCAAGGGTGTAAGAAGAAGTTTATGTGTATTATTTGGATTGAGTGCCGTGTTTCTATTTTCAAGTTGTGTTATGAACGTGTGCGCTACACGAACTTTGTACTCGTTTCATGATTATATCTCTTGTTGTAAATTTAAGCTATGTGATTAGAAAGGGAGTAATTACTCGGATGAGATTGATGGTCGATGCCATTCTCAGTTGGAGGGAGGAACAAAACATTTATTAAGGTGTGGAAACTTATTTCTAGACTTGTAATATTGTCTAATGTTGTTTGGGCGGTAGAGTTTTTAGATTTTTTAATAATGTCGAGTTTAATCATCTTCTTATTTTGAATAGAATGTGTTGATCTAAATCTTCTTCCGGA

mRNA sequence

TTTTAAACTTGACAGCCCTTCCTTCACCACTCACTCCACGCGCACGTCCTTCAGCTTCGTATTCTTCTTCTTCTTCTTCTTCTTCTTCAATTGCATGGCCGCCGCTTCTAACCTTATAATTGCTGCTCGACCACGACTCGCCCATTTGCGGAACCCTGTCCGCCGCCTCTCGCCCTCTCTCTTCACTCTCCGGGCAGGTCATTGGTCAGCGTATGAGAAGAACCCAGAAGAGGAAGTGTGCCCGACGGTGGTGCCAGATCATATGATCCACTCCATATCCGAAGACTACTGGGCGCCACATCCCCAAACCGGAGTTTTCGGCCCACCAATAAGTCAACACAATCCTGTTCCTAGCGACACGTCGGCTGGCGGCGCCAAAGAGGGCTCCGTTCTGGAGCTGAAGGTTTGGTTCCGACACATCGAGCTGGAGGACCCCGAGAAGCCACACGACCTGTAATCTCCATGGCTATGATAAGATATAATAATAATAAGCTTACATGCATGCAAGGGTGTAAGAAGAAGTTTATGTGTATTATTTGGATTGAGTGCCGTGTTTCTATTTTCAAGTTGTGTTATGAACGTGTGCGCTACACGAACTTTGTACTCGTTTCATGATTATATCTCTTGTTGTAAATTTAAGCTATGTGATTAGAAAGGGAGTAATTACTCGGATGAGATTGATGGTCGATGCCATTCTCAGTTGGAGGGAGGAACAAAACATTTATTAAGGTGTGGAAACTTATTTCTAGACTTGTAATATTGTCTAATGTTGTTTGGGCGGTAGAGTTTTTAGATTTTTTAATAATGTCGAGTTTAATCATCTTCTTATTTTGAATAGAATGTGTTGATCTAAATCTTCTTCCGGA

Coding sequence (CDS)

ATGGCCGCCGCTTCTAACCTTATAATTGCTGCTCGACCACGACTCGCCCATTTGCGGAACCCTGTCCGCCGCCTCTCGCCCTCTCTCTTCACTCTCCGGGCAGGTCATTGGTCAGCGTATGAGAAGAACCCAGAAGAGGAAGTGTGCCCGACGGTGGTGCCAGATCATATGATCCACTCCATATCCGAAGACTACTGGGCGCCACATCCCCAAACCGGAGTTTTCGGCCCACCAATAAGTCAACACAATCCTGTTCCTAGCGACACGTCGGCTGGCGGCGCCAAAGAGGGCTCCGTTCTGGAGCTGAAGGTTTGGTTCCGACACATCGAGCTGGAGGACCCCGAGAAGCCACACGACCTGTAA

Protein sequence

MAAASNLIIAARPRLAHLRNPVRRLSPSLFTLRAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDYWAPHPQTGVFGPPISQHNPVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPHDL
BLAST of CmaCh02G009970 vs. TrEMBL
Match: A0A0A0KRJ5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G168830 PE=4 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 9.0e-29
Identity = 71/126 (56.35%), Postives = 86/126 (68.25%), Query Frame = 1

Query: 1   MAAASNLIIAAR---PRLAHLRNPVRRLSPSLFTLRAGHWSAYEKNPEEEVCPTVVPDHM 60
           MA +S +I+AA    PRL+ L           F  R+ HWSAYEKN E+++ PT+VP+H+
Sbjct: 1   MACSSKIIVAAAAASPRLSLLLPHTYLFPFPNFGFRSSHWSAYEKNQEDQIWPTMVPNHL 60

Query: 61  I-HSISEDYWAPHPQTGVFGPPISQHN--PVPSDTSAGGAKEGSVLELKVWFRHIELEDP 120
           I   +SEDYW PHP+TGVFGPP + HN   VP+DTSAGG  EGSVL LK WFRH  LED 
Sbjct: 61  ISRDLSEDYWVPHPETGVFGPPKAHHNSSTVPNDTSAGG-NEGSVLNLKAWFRHNGLEDL 120

BLAST of CmaCh02G009970 vs. TrEMBL
Match: F4KFM8_ARATH (Uncharacterized protein OS=Arabidopsis thaliana GN=At5g17165 PE=4 SV=1)

HSP 1 Score: 97.1 bits (240), Expect = 1.6e-17
Identity = 57/119 (47.90%), Postives = 71/119 (59.66%), Query Frame = 1

Query: 2   AAASNLIIAARPRLAHLRNPVRR--LSPSLFTLRAGHWSAYEKNPEEEVCPTVVPDHMIH 61
           A + N+ +  R    H+ N VR   ++  LFT R  H SAY+KN EEE+ P+ VPD MI 
Sbjct: 3   AKSKNIQVVGR----HIVNGVRSRAVAYGLFTSRNDHTSAYDKNVEEELQPSQVPDEMIK 62

Query: 62  SISEDYWAPHPQTGVFGPPISQHNPVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPH 119
             S+ YW+PHPQTGVFGP  S  N    D   GG +E SV+E K WFR   LED +K H
Sbjct: 63  PDSDKYWSPHPQTGVFGPSSSSTN--AKDEFRGG-QEDSVMEEKAWFRPTSLEDLDKTH 114

BLAST of CmaCh02G009970 vs. TrEMBL
Match: A0A078BX60_BRANA (BnaA10g17500D protein OS=Brassica napus GN=BnaA10g17500D PE=4 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 2.1e-17
Identity = 46/96 (47.92%), Postives = 61/96 (63.54%), Query Frame = 1

Query: 23  RRLSPSLFTLRAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDYWAPHPQTGVFGPPISQH 82
           R ++  L   R GH SAY+KN EEE+ P+ VPD +I S S+ YW+PHPQTGVFGP  S  
Sbjct: 26  RAVASDLSVSRNGHTSAYDKNVEEELQPSKVPDELIKSESDKYWSPHPQTGVFGPSSSST 85

Query: 83  NPVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPH 119
             +  +  + G++E S +E K WFR   LED +K H
Sbjct: 86  TDMADEKLSRGSQEDSGMEEKAWFRPTSLEDFDKTH 121

BLAST of CmaCh02G009970 vs. TrEMBL
Match: A9P9U6_POPTR (Putative uncharacterized protein OS=Populus trichocarpa PE=2 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 2.1e-17
Identity = 51/104 (49.04%), Postives = 65/104 (62.50%), Query Frame = 1

Query: 20  NPVRRLSPSLF-TLRAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDYWAPHPQTGVFGPP 79
           N +R  S S   +LR  H S Y+KN ++E+ P VVPD +I + S+ YWAPHP+TGVFGP 
Sbjct: 23  NSIRSFSVSNTPSLRGAHTSVYDKNLDDELQPNVVPDDVIKTQSDKYWAPHPRTGVFGPA 82

Query: 80  ISQH-NPVPSDT-SAGGAKEGSVLELKVWFRHIELEDPEKPHDL 121
             QH + +  D+ S G   +  VLE K WFR   LED EKPH L
Sbjct: 83  TEQHLSEISGDSASVGDGGQDPVLEEKAWFRPTSLEDLEKPHRL 126

BLAST of CmaCh02G009970 vs. TrEMBL
Match: A0A059DKJ6_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02956 PE=4 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 3.6e-17
Identity = 45/85 (52.94%), Postives = 53/85 (62.35%), Query Frame = 1

Query: 33  RAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDYWAPHPQTGVFGPPISQHNPVPSDTSAG 92
           RA H S Y+KNPE++V  TVVPD +I + S+ YWAPHP+TGVFGP    H     +    
Sbjct: 13  RAAHTSVYDKNPEDQVASTVVPDEVIDAPSDKYWAPHPKTGVFGPATDHHLRAHGELPKP 72

Query: 93  GAKEGSVLELKVWFRHIELEDPEKP 118
           G  E SVLE   WFR   LED EKP
Sbjct: 73  GDAENSVLEDTAWFRPTSLEDLEKP 97

BLAST of CmaCh02G009970 vs. TAIR10
Match: AT5G17165.1 (AT5G17165.1 unknown protein)

HSP 1 Score: 97.1 bits (240), Expect = 8.1e-21
Identity = 57/119 (47.90%), Postives = 71/119 (59.66%), Query Frame = 1

Query: 2   AAASNLIIAARPRLAHLRNPVRR--LSPSLFTLRAGHWSAYEKNPEEEVCPTVVPDHMIH 61
           A + N+ +  R    H+ N VR   ++  LFT R  H SAY+KN EEE+ P+ VPD MI 
Sbjct: 3   AKSKNIQVVGR----HIVNGVRSRAVAYGLFTSRNDHTSAYDKNVEEELQPSQVPDEMIK 62

Query: 62  SISEDYWAPHPQTGVFGPPISQHNPVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPH 119
             S+ YW+PHPQTGVFGP  S  N    D   GG +E SV+E K WFR   LED +K H
Sbjct: 63  PDSDKYWSPHPQTGVFGPSSSSTN--AKDEFRGG-QEDSVMEEKAWFRPTSLEDLDKTH 114

BLAST of CmaCh02G009970 vs. TAIR10
Match: AT3G03150.1 (AT3G03150.1 unknown protein)

HSP 1 Score: 94.7 bits (234), Expect = 4.0e-20
Identity = 49/113 (43.36%), Postives = 70/113 (61.95%), Query Frame = 1

Query: 7   LIIAARPRLAHLRNPVRRLSPSLF-TLRAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDY 66
           LI + R  L + R   R  + +LF + R+GH SAY+KN E+E+  + VPD +I   S+ Y
Sbjct: 11  LITSLRKHLVNTRASTRATASALFPSRRSGHSSAYDKNVEDELHASAVPDEVIKPDSDKY 70

Query: 67  WAPHPQTGVFGPPISQHNPVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPH 119
           W+PHP+TGVFGP  ++H    S T+ G  ++ +VLE   WFR   LED +K H
Sbjct: 71  WSPHPKTGVFGPSTTEH----SATAEGAHQDTAVLEETAWFRPTSLEDSDKTH 119

BLAST of CmaCh02G009970 vs. NCBI nr
Match: gi|449451834|ref|XP_004143665.1| (PREDICTED: uncharacterized protein LOC101218227 [Cucumis sativus])

HSP 1 Score: 134.4 bits (337), Expect = 1.3e-28
Identity = 71/126 (56.35%), Postives = 86/126 (68.25%), Query Frame = 1

Query: 1   MAAASNLIIAAR---PRLAHLRNPVRRLSPSLFTLRAGHWSAYEKNPEEEVCPTVVPDHM 60
           MA +S +I+AA    PRL+ L           F  R+ HWSAYEKN E+++ PT+VP+H+
Sbjct: 1   MACSSKIIVAAAAASPRLSLLLPHTYLFPFPNFGFRSSHWSAYEKNQEDQIWPTMVPNHL 60

Query: 61  I-HSISEDYWAPHPQTGVFGPPISQHN--PVPSDTSAGGAKEGSVLELKVWFRHIELEDP 120
           I   +SEDYW PHP+TGVFGPP + HN   VP+DTSAGG  EGSVL LK WFRH  LED 
Sbjct: 61  ISRDLSEDYWVPHPETGVFGPPKAHHNSSTVPNDTSAGG-NEGSVLNLKAWFRHNGLEDL 120

BLAST of CmaCh02G009970 vs. NCBI nr
Match: gi|659073212|ref|XP_008467312.1| (PREDICTED: uncharacterized protein LOC103504694 isoform X2 [Cucumis melo])

HSP 1 Score: 127.9 bits (320), Expect = 1.2e-26
Identity = 67/115 (58.26%), Postives = 82/115 (71.30%), Query Frame = 1

Query: 10  AARPRLAHLRNPVRRLSPSLFT-LRAGHWSAYEKNPEEEVCPTVVPDHMIH-SISEDYWA 69
           AA PRL+ L +    L P   +  RA HWSAYEKN E+ + PT+VP+ +IH ++S++YW 
Sbjct: 27  AAAPRLSLLVSHSPHLFPFANSGFRASHWSAYEKNQEDPIWPTMVPNDVIHHNLSDNYWV 86

Query: 70  PHPQTGVFGPPISQHN--PVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPHDL 121
           PHPQTGVFGPP + HN   +P+DTSAGG  EGSVL LK WFRH  LED EKPH L
Sbjct: 87  PHPQTGVFGPPKAHHNSSTLPNDTSAGG-NEGSVLNLKAWFRHNGLEDLEKPHSL 140

BLAST of CmaCh02G009970 vs. NCBI nr
Match: gi|659073210|ref|XP_008467311.1| (PREDICTED: uncharacterized protein LOC103504694 isoform X1 [Cucumis melo])

HSP 1 Score: 127.9 bits (320), Expect = 1.2e-26
Identity = 67/115 (58.26%), Postives = 82/115 (71.30%), Query Frame = 1

Query: 10  AARPRLAHLRNPVRRLSPSLFT-LRAGHWSAYEKNPEEEVCPTVVPDHMIH-SISEDYWA 69
           AA PRL+ L +    L P   +  RA HWSAYEKN E+ + PT+VP+ +IH ++S++YW 
Sbjct: 37  AAAPRLSLLVSHSPHLFPFANSGFRASHWSAYEKNQEDPIWPTMVPNDVIHHNLSDNYWV 96

Query: 70  PHPQTGVFGPPISQHN--PVPSDTSAGGAKEGSVLELKVWFRHIELEDPEKPHDL 121
           PHPQTGVFGPP + HN   +P+DTSAGG  EGSVL LK WFRH  LED EKPH L
Sbjct: 97  PHPQTGVFGPPKAHHNSSTLPNDTSAGG-NEGSVLNLKAWFRHNGLEDLEKPHSL 150

BLAST of CmaCh02G009970 vs. NCBI nr
Match: gi|1009123783|ref|XP_015878722.1| (PREDICTED: uncharacterized protein LOC107414996 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 102.1 bits (253), Expect = 7.1e-19
Identity = 51/108 (47.22%), Postives = 67/108 (62.04%), Query Frame = 1

Query: 18  LRNPVRRLS---PSLFTLRAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDYWAPHPQTGV 77
           +R+P  + S   P+L   RA H SAY+KNP+E++ P+VVPD +I   S+ YWAPHP+TGV
Sbjct: 23  IRDPTHQASVSSPALSISRAVHNSAYDKNPDEQIRPSVVPDELIQPQSDKYWAPHPKTGV 82

Query: 78  FGPPISQHNPVPSD----TSAGGAKEGSVLELKVWFRHIELEDPEKPH 119
           FGP     +    +     S+    EGSVLE K WFR   +ED EKPH
Sbjct: 83  FGPATESTSAAGGERGLHASSSDGGEGSVLEQKAWFRPTSIEDLEKPH 130

BLAST of CmaCh02G009970 vs. NCBI nr
Match: gi|1009123785|ref|XP_015878723.1| (PREDICTED: uncharacterized protein LOC107414996 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 101.3 bits (251), Expect = 1.2e-18
Identity = 50/107 (46.73%), Postives = 67/107 (62.62%), Query Frame = 1

Query: 18  LRNPVRRLSPS--LFTLRAGHWSAYEKNPEEEVCPTVVPDHMIHSISEDYWAPHPQTGVF 77
           +R+P  + S S    ++RA H SAY+KNP+E++ P+VVPD +I   S+ YWAPHP+TGVF
Sbjct: 23  IRDPTHQASVSSPALSIRAVHNSAYDKNPDEQIRPSVVPDELIQPQSDKYWAPHPKTGVF 82

Query: 78  GPPISQHNPVPSD----TSAGGAKEGSVLELKVWFRHIELEDPEKPH 119
           GP     +    +     S+    EGSVLE K WFR   +ED EKPH
Sbjct: 83  GPATESTSAAGGERGLHASSSDGGEGSVLEQKAWFRPTSIEDLEKPH 129

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KRJ5_CUCSA9.0e-2956.35Uncharacterized protein OS=Cucumis sativus GN=Csa_5G168830 PE=4 SV=1[more]
F4KFM8_ARATH1.6e-1747.90Uncharacterized protein OS=Arabidopsis thaliana GN=At5g17165 PE=4 SV=1[more]
A0A078BX60_BRANA2.1e-1747.92BnaA10g17500D protein OS=Brassica napus GN=BnaA10g17500D PE=4 SV=1[more]
A9P9U6_POPTR2.1e-1749.04Putative uncharacterized protein OS=Populus trichocarpa PE=2 SV=1[more]
A0A059DKJ6_EUCGR3.6e-1752.94Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02956 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G17165.18.1e-2147.90 unknown protein[more]
AT3G03150.14.0e-2043.36 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449451834|ref|XP_004143665.1|1.3e-2856.35PREDICTED: uncharacterized protein LOC101218227 [Cucumis sativus][more]
gi|659073212|ref|XP_008467312.1|1.2e-2658.26PREDICTED: uncharacterized protein LOC103504694 isoform X2 [Cucumis melo][more]
gi|659073210|ref|XP_008467311.1|1.2e-2658.26PREDICTED: uncharacterized protein LOC103504694 isoform X1 [Cucumis melo][more]
gi|1009123783|ref|XP_015878722.1|7.1e-1947.22PREDICTED: uncharacterized protein LOC107414996 isoform X1 [Ziziphus jujuba][more]
gi|1009123785|ref|XP_015878723.1|1.2e-1846.73PREDICTED: uncharacterized protein LOC107414996 isoform X2 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G009970.1CmaCh02G009970.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35122FAMILY NOT NAMEDcoord: 5..120
score: 5.6
NoneNo IPR availablePANTHERPTHR35122:SF2SUBFAMILY NOT NAMEDcoord: 5..120
score: 5.6

The following gene(s) are paralogous to this gene:

None