CmaCh20G008640 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G008640
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionDUF4228 domain protein
LocationCma_Chr20 : 4152444 .. 4152848 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGGTTTGTGCCTCATCACAGCACTCAAGCCCTAGCCTCACGACCTGGCATTTCACAGCGAAAATCATTCATATAGATGGCAGGCTACAAGAATTGAGGCACCCAGTCAAGGCCAGCCACATTCTCAATCAAAACCCTAATTGTTTCCTCTGTAGTTCAGAATCTATGAAGATCAATTCAATCGTTCCACAAATTTCCAGCGACAGAGAGCTCGAATTGGGGGAAATTTACTTTCTAATCCCGCTTGCTAAATCCGATCTGCCGATTTCTCTCACAATCTTGTGTGCTTTGGCTGCCAAAGCAAATGTAGCCCTCACCAGCTCCAAGAAGGCGTATCCGTCCATGAAAGCGGCGCCGGCCGCCGTAGGGTACCGCATCAGAGCACCAGGGACGTCGTACTGA

mRNA sequence

ATGGGGGTTTGTGCCTCATCACAGCACTCAAGCCCTAGCCTCACGACCTGGCATTTCACAGCGAAAATCATTCATATAGATGGCAGGCTACAAGAATTGAGGCACCCAGTCAAGGCCAGCCACATTCTCAATCAAAACCCTAATTGTTTCCTCTGTAGTTCAGAATCTATGAAGATCAATTCAATCGTTCCACAAATTTCCAGCGACAGAGAGCTCGAATTGGGGGAAATTTACTTTCTAATCCCGCTTGCTAAATCCGATCTGCCGATTTCTCTCACAATCTTGTGTGCTTTGGCTGCCAAAGCAAATGTAGCCCTCACCAGCTCCAAGAAGGCGTATCCGTCCATGAAAGCGGCGCCGGCCGCCGTAGGGTACCGCATCAGAGCACCAGGGACGTCGTACTGA

Coding sequence (CDS)

ATGGGGGTTTGTGCCTCATCACAGCACTCAAGCCCTAGCCTCACGACCTGGCATTTCACAGCGAAAATCATTCATATAGATGGCAGGCTACAAGAATTGAGGCACCCAGTCAAGGCCAGCCACATTCTCAATCAAAACCCTAATTGTTTCCTCTGTAGTTCAGAATCTATGAAGATCAATTCAATCGTTCCACAAATTTCCAGCGACAGAGAGCTCGAATTGGGGGAAATTTACTTTCTAATCCCGCTTGCTAAATCCGATCTGCCGATTTCTCTCACAATCTTGTGTGCTTTGGCTGCCAAAGCAAATGTAGCCCTCACCAGCTCCAAGAAGGCGTATCCGTCCATGAAAGCGGCGCCGGCCGCCGTAGGGTACCGCATCAGAGCACCAGGGACGTCGTACTGA

Protein sequence

MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSMKAAPAAVGYRIRAPGTSY
BLAST of CmaCh20G008640 vs. TrEMBL
Match: A0A0A0LIX5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G239380 PE=4 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 3.6e-50
Identity = 110/133 (82.71%), Postives = 114/133 (85.71%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKIN 60
           MGVCASSQHS+ SLT W FTAKIIH DGRLQELRHPVKASHILNQNPNCFLCSSESMKI 
Sbjct: 47  MGVCASSQHSNASLTNWPFTAKIIHTDGRLQELRHPVKASHILNQNPNCFLCSSESMKIG 106

Query: 61  SIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSMKAAP 120
           SIVPQISSDRELELGEIYFLIPL KS LPISLT LC+LAAKANVAL SSKKA+PS+KA  
Sbjct: 107 SIVPQISSDRELELGEIYFLIPLKKSHLPISLTDLCSLAAKANVALASSKKAHPSLKAVG 166

Query: 121 A---AVGYRIRAP 131
           A    VG R   P
Sbjct: 167 AESERVGRRTEIP 179

BLAST of CmaCh20G008640 vs. TrEMBL
Match: F6HQJ2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g01950 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 1.2e-26
Identity = 67/124 (54.03%), Postives = 87/124 (70.16%), Query Frame = 1

Query: 1   MGVCASSQHSSPS--LTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMK 60
           MG+C+SS   S +    +W  TAKIIH+DG+LQE  HP++A  IL+QNPNCFLCSSESM 
Sbjct: 1   MGICSSSHLMSKNGRCLSWPSTAKIIHLDGKLQEFLHPIQAGLILSQNPNCFLCSSESMF 60

Query: 61  INSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSMKA 120
           INS  PQ+    EL+LG+IYFL+PL+KS  P+SL  LC LA KA+ A+     A+ ++K 
Sbjct: 61  INSHAPQVPDKEELQLGQIYFLMPLSKSRSPLSLQDLCILAVKASAAIAHPNTAHLAIKT 120

Query: 121 APAA 123
             AA
Sbjct: 121 GRAA 124

BLAST of CmaCh20G008640 vs. TrEMBL
Match: B9HRT2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s11380g PE=4 SV=1)

HSP 1 Score: 126.3 bits (316), Expect = 2.7e-26
Identity = 64/114 (56.14%), Postives = 83/114 (72.81%), Query Frame = 1

Query: 1   MGVCASSQHSSP-----SLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSE 60
           MG CAS Q++           W  TAKIIH+DGRLQE R P+KASHIL+ NP  FLCSSE
Sbjct: 1   MGNCASPQYTKKVGGGGGGLNWPSTAKIIHVDGRLQEFRQPIKASHILSLNPKSFLCSSE 60

Query: 61  SMKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSS 110
           SM I+  +PQ+  D EL+LG++YFL+PL+KS++P+SL  LCALA+KA+ +L  S
Sbjct: 61  SMYIDCHLPQVPDDEELQLGQLYFLVPLSKSNVPLSLQELCALASKASASLAQS 114

BLAST of CmaCh20G008640 vs. TrEMBL
Match: W9RBP1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_026971 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 5.2e-25
Identity = 60/115 (52.17%), Postives = 85/115 (73.91%), Query Frame = 1

Query: 1   MGVCASSQHSSP----SLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSES 60
           MG+CAS Q+++     SL  W    +IIH +G+LQE  HPVK+  +L+QNPNCFLCSSES
Sbjct: 1   MGICASLQYTTNKARMSLINWQNATQIIHFEGKLQEYIHPVKSGQVLSQNPNCFLCSSES 60

Query: 61  MKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKK 112
           M + + VPQ+ ++ EL+ G+IYFL+PL+++  P+SL  LCALA KA+ AL++S K
Sbjct: 61  MFVGAHVPQLPTNEELQPGQIYFLLPLSQAKTPLSLQDLCALAIKASSALSASNK 115

BLAST of CmaCh20G008640 vs. TrEMBL
Match: B9RZE0_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0938490 PE=4 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 8.8e-25
Identity = 63/121 (52.07%), Postives = 84/121 (69.42%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHF----TAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSES 60
           MG CAS Q++       ++    TAKI+  DGRLQE + P+KA+++L+QNPNCFLCSSES
Sbjct: 1   MGNCASPQYTKKGALALNYYRQSTAKIVDRDGRLQEFKQPIKANYVLSQNPNCFLCSSES 60

Query: 61  MKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSM 118
           M +NS V  +  D EL++G+IYFL+PL+KS + +SL  LCALA KA+ AL  S   Y S 
Sbjct: 61  MYVNSPVSPVPDDEELQVGQIYFLMPLSKSHVLLSLQELCALAIKASAALAQSDPEYSSA 120

BLAST of CmaCh20G008640 vs. TAIR10
Match: AT2G23690.1 (AT2G23690.1 unknown protein)

HSP 1 Score: 83.2 bits (204), Expect = 1.3e-16
Identity = 42/109 (38.53%), Postives = 67/109 (61.47%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKIN 60
           MG+C+S + +  +      TAK+I  DGR+ E   PVK  ++L +NP CF+C+S+ M  +
Sbjct: 1   MGICSSYESTQVA------TAKLILHDGRMMEFTSPVKVGYVLQKNPMCFICNSDDMDFD 60

Query: 61  SIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSS 110
           ++V  IS+D E +LG++YF +PL+     +    + ALA KA+ AL  S
Sbjct: 61  NVVSAISADEEFQLGQLYFALPLSSLHHSLKAEEMAALAVKASSALMRS 103

BLAST of CmaCh20G008640 vs. TAIR10
Match: AT4G37240.1 (AT4G37240.1 unknown protein)

HSP 1 Score: 83.2 bits (204), Expect = 1.3e-16
Identity = 44/106 (41.51%), Postives = 66/106 (62.26%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKIN 60
           MG+C+SS+ +  +      TAK+I  DGR+ E  +PVK  ++L + P CF+C+S+ M  +
Sbjct: 1   MGICSSSESTQVA------TAKLILQDGRMMEFANPVKVGYVLLKYPMCFICNSDDMDFD 60

Query: 61  SIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVAL 107
             V  IS+D EL+LG+IYF +PL     P+    + ALA KA+ AL
Sbjct: 61  DAVAAISADEELQLGQIYFALPLCWLRQPLKAEEMAALAVKASSAL 100

BLAST of CmaCh20G008640 vs. TAIR10
Match: AT5G66580.1 (AT5G66580.1 unknown protein)

HSP 1 Score: 75.9 bits (185), Expect = 2.2e-14
Identity = 42/109 (38.53%), Postives = 62/109 (56.88%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKIN 60
           MG CAS +           +AK+I +DG LQE   PVK   IL +NP  F+C+S+ M  +
Sbjct: 1   MGACASRESLRSD------SAKLILLDGTLQEFSSPVKVWQILQKNPTSFVCNSDEMDFD 60

Query: 61  SIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSS 110
             V  ++ + EL  G++YF++PL   + P+    + ALA KA+ ALT S
Sbjct: 61  DAVSAVAGNEELRSGQLYFVLPLTWLNHPLRAEEMAALAVKASSALTKS 103

BLAST of CmaCh20G008640 vs. TAIR10
Match: AT3G50800.1 (AT3G50800.1 unknown protein)

HSP 1 Score: 71.2 bits (173), Expect = 5.3e-13
Identity = 42/109 (38.53%), Postives = 58/109 (53.21%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKIN 60
           MG CAS +           TAK+I  DG LQE   PVK   IL +NP  F+C+S+ M  +
Sbjct: 1   MGACASRESRRTE------TAKLILPDGTLQEFSTPVKVWQILQKNPTSFVCNSDDMDFD 60

Query: 61  SIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSS 110
             V  +    +L  GE+YF++PL   + P+    + ALA KA+ AL  S
Sbjct: 61  DAVLAVPGSEDLRPGELYFVLPLTWLNHPLRADEMAALAVKASSALAKS 103

BLAST of CmaCh20G008640 vs. TAIR10
Match: AT1G76600.1 (AT1G76600.1 unknown protein)

HSP 1 Score: 65.9 bits (159), Expect = 2.2e-11
Identity = 42/117 (35.90%), Postives = 70/117 (59.83%), Query Frame = 1

Query: 1   MGVCAS-SQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHIL----------NQNPNC 60
           MG+C S +++   S +T   TAKI+ I+G L+E   PV AS +L          + + + 
Sbjct: 1   MGLCVSVNRNEYVSSST---TAKIVTINGDLREYDVPVLASQVLESESTSSSSSSSSSSY 60

Query: 61  FLCSSESMKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVAL 107
           FLC+S+S+  +  +P I SD  L+  +IYF++P++K    +S + + ALA KA+VA+
Sbjct: 61  FLCNSDSLYYDDFIPAIESDEILQANQIYFVLPISKRQYRLSASDMAALAVKASVAI 114

BLAST of CmaCh20G008640 vs. NCBI nr
Match: gi|778669274|ref|XP_011649227.1| (PREDICTED: uncharacterized protein LOC101205353 [Cucumis sativus])

HSP 1 Score: 205.7 bits (522), Expect = 5.1e-50
Identity = 110/133 (82.71%), Postives = 114/133 (85.71%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMKIN 60
           MGVCASSQHS+ SLT W FTAKIIH DGRLQELRHPVKASHILNQNPNCFLCSSESMKI 
Sbjct: 47  MGVCASSQHSNASLTNWPFTAKIIHTDGRLQELRHPVKASHILNQNPNCFLCSSESMKIG 106

Query: 61  SIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSMKAAP 120
           SIVPQISSDRELELGEIYFLIPL KS LPISLT LC+LAAKANVAL SSKKA+PS+KA  
Sbjct: 107 SIVPQISSDRELELGEIYFLIPLKKSHLPISLTDLCSLAAKANVALASSKKAHPSLKAVG 166

Query: 121 A---AVGYRIRAP 131
           A    VG R   P
Sbjct: 167 AESERVGRRTEIP 179

BLAST of CmaCh20G008640 vs. NCBI nr
Match: gi|359475255|ref|XP_003631624.1| (PREDICTED: uncharacterized protein LOC100855234 [Vitis vinifera])

HSP 1 Score: 127.5 bits (319), Expect = 1.8e-26
Identity = 67/124 (54.03%), Postives = 87/124 (70.16%), Query Frame = 1

Query: 1   MGVCASSQHSSPS--LTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSESMK 60
           MG+C+SS   S +    +W  TAKIIH+DG+LQE  HP++A  IL+QNPNCFLCSSESM 
Sbjct: 1   MGICSSSHLMSKNGRCLSWPSTAKIIHLDGKLQEFLHPIQAGLILSQNPNCFLCSSESMF 60

Query: 61  INSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSMKA 120
           INS  PQ+    EL+LG+IYFL+PL+KS  P+SL  LC LA KA+ A+     A+ ++K 
Sbjct: 61  INSHAPQVPDKEELQLGQIYFLMPLSKSRSPLSLQDLCILAVKASAAIAHPNTAHLAIKT 120

Query: 121 APAA 123
             AA
Sbjct: 121 GRAA 124

BLAST of CmaCh20G008640 vs. NCBI nr
Match: gi|224105459|ref|XP_002313818.1| (hypothetical protein POPTR_0009s11380g [Populus trichocarpa])

HSP 1 Score: 126.3 bits (316), Expect = 3.9e-26
Identity = 64/114 (56.14%), Postives = 83/114 (72.81%), Query Frame = 1

Query: 1   MGVCASSQHSSP-----SLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSE 60
           MG CAS Q++           W  TAKIIH+DGRLQE R P+KASHIL+ NP  FLCSSE
Sbjct: 1   MGNCASPQYTKKVGGGGGGLNWPSTAKIIHVDGRLQEFRQPIKASHILSLNPKSFLCSSE 60

Query: 61  SMKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSS 110
           SM I+  +PQ+  D EL+LG++YFL+PL+KS++P+SL  LCALA+KA+ +L  S
Sbjct: 61  SMYIDCHLPQVPDDEELQLGQLYFLVPLSKSNVPLSLQELCALASKASASLAQS 114

BLAST of CmaCh20G008640 vs. NCBI nr
Match: gi|703097883|ref|XP_010096234.1| (hypothetical protein L484_026971 [Morus notabilis])

HSP 1 Score: 122.1 bits (305), Expect = 7.4e-25
Identity = 60/115 (52.17%), Postives = 85/115 (73.91%), Query Frame = 1

Query: 1   MGVCASSQHSSP----SLTTWHFTAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSES 60
           MG+CAS Q+++     SL  W    +IIH +G+LQE  HPVK+  +L+QNPNCFLCSSES
Sbjct: 1   MGICASLQYTTNKARMSLINWQNATQIIHFEGKLQEYIHPVKSGQVLSQNPNCFLCSSES 60

Query: 61  MKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKK 112
           M + + VPQ+ ++ EL+ G+IYFL+PL+++  P+SL  LCALA KA+ AL++S K
Sbjct: 61  MFVGAHVPQLPTNEELQPGQIYFLLPLSQAKTPLSLQDLCALAIKASSALSASNK 115

BLAST of CmaCh20G008640 vs. NCBI nr
Match: gi|223541772|gb|EEF43320.1| (conserved hypothetical protein [Ricinus communis])

HSP 1 Score: 121.3 bits (303), Expect = 1.3e-24
Identity = 63/121 (52.07%), Postives = 84/121 (69.42%), Query Frame = 1

Query: 1   MGVCASSQHSSPSLTTWHF----TAKIIHIDGRLQELRHPVKASHILNQNPNCFLCSSES 60
           MG CAS Q++       ++    TAKI+  DGRLQE + P+KA+++L+QNPNCFLCSSES
Sbjct: 1   MGNCASPQYTKKGALALNYYRQSTAKIVDRDGRLQEFKQPIKANYVLSQNPNCFLCSSES 60

Query: 61  MKINSIVPQISSDRELELGEIYFLIPLAKSDLPISLTILCALAAKANVALTSSKKAYPSM 118
           M +NS V  +  D EL++G+IYFL+PL+KS + +SL  LCALA KA+ AL  S   Y S 
Sbjct: 61  MYVNSPVSPVPDDEELQVGQIYFLMPLSKSHVLLSLQELCALAIKASAALAQSDPEYSSA 120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LIX5_CUCSA3.6e-5082.71Uncharacterized protein OS=Cucumis sativus GN=Csa_2G239380 PE=4 SV=1[more]
F6HQJ2_VITVI1.2e-2654.03Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g01950 PE=4 SV=... [more]
B9HRT2_POPTR2.7e-2656.14Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s11380g PE=4 SV=1[more]
W9RBP1_9ROSA5.2e-2552.17Uncharacterized protein OS=Morus notabilis GN=L484_026971 PE=4 SV=1[more]
B9RZE0_RICCO8.8e-2552.07Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0938490 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G23690.11.3e-1638.53 unknown protein[more]
AT4G37240.11.3e-1641.51 unknown protein[more]
AT5G66580.12.2e-1438.53 unknown protein[more]
AT3G50800.15.3e-1338.53 unknown protein[more]
AT1G76600.12.2e-1135.90 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778669274|ref|XP_011649227.1|5.1e-5082.71PREDICTED: uncharacterized protein LOC101205353 [Cucumis sativus][more]
gi|359475255|ref|XP_003631624.1|1.8e-2654.03PREDICTED: uncharacterized protein LOC100855234 [Vitis vinifera][more]
gi|224105459|ref|XP_002313818.1|3.9e-2656.14hypothetical protein POPTR_0009s11380g [Populus trichocarpa][more]
gi|703097883|ref|XP_010096234.1|7.4e-2552.17hypothetical protein L484_026971 [Morus notabilis][more]
gi|223541772|gb|EEF43320.1|1.3e-2452.07conserved hypothetical protein [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025322DUF4228_plant
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G008640.1CmaCh20G008640.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 1..110
score: 8.2
NoneNo IPR availablePANTHERPTHR33052FAMILY NOT NAMEDcoord: 4..117
score: 1.4
NoneNo IPR availablePANTHERPTHR33052:SF21SUBFAMILY NOT NAMEDcoord: 4..117
score: 1.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh20G008640CmaCh02G004170Cucurbita maxima (Rimu)cmacmaB469