CmaCh05G006240 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G006240
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionprotein ENL-like
LocationCma_Chr05: 3135933 .. 3137248 (+)
RNA-Seq ExpressionCmaCh05G006240
SyntenyCmaCh05G006240
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTGATGGTATTAATTTCCATAATTTGGATAAAACTGGTCCGAAAATTTGCCTTTTTTAAAATACTACTAATAATAATAATAATAATAATCAGCAGCTTCCTTTTTTTGCAGTTCTTCCTTTAATGGAACGCATTTTCCCTTCCAGTTTCATACCTTTCTCTCTCTGTAAATGGCAGCACTATCTCTTCAACCAACGTCTCACGACAACGAAGAAGGAATCTCCGCTACTTCCACTTCCGTTTGATTCTCAATCTTCACGATCATGGAGGTTCTCTTCGGTCCTCCCTCTTTCACCATTCCGGACGCCTCTTCTTCCTCTTTCGTCAGAGACCGAACCGCCGTATCTCCGCCGCCGCAGCACCGTGTTGATTCGCCTTGGAACTCTATCAAATCGGATTTCGGACGCTTCGCGTCCGGTAAAGGTCCGGAGGATGAAATCGATTACCGTTCCGATTCCTCGTCGTCGATTGGAGTTCCTGATGATGATAGCGATGAGGAGAGCGTTTCGTCCACTGGCGGAGATCGCGAGGAAGTTCAGAGTAAATTAAATGCAGGATTGAACTCTGTTGGATCGTTGGAAAGGTCTCTTCCAATTAAGTAAGTTAGTTTTCCTTTTCCTTCTCTTCCTCTGTTGTTTGATCGATTGATTGAAGGAAATAAACTCAATCACTCGATTCGATTTAAAGAAATTTATCCTGCTCAATTTGCTGGCCTCAATTTGAGATTTGTTGAAATTTTAAACCAGGAGAGGCTTATCGAGTCATTTTTCTGGAAAATCGAAATCGTTTGCGAATCTAGCAGAGGCGAAATCAGTAAAAGACATTGAGAAACCTGAAAACTCATTCAACAAGAGGAGGCGATTTTTTATTGCGTCGAAATTGGCCAGGAAAACTTCCTTCTACACCTGGCCAAACCCTAAGTCGATGCCTCTGTTAGCGCTCAGAGAAGAAGAACACCACGACGACGGCGACGGCGACAAAGATGAATCACTTGCTCCATATTCTTCAGAAGATACTGAAGATGAAAAAGAAGAACCGAAGGAAAAAAGATTTTCAGATTTTAATCACAGAAGGTTGATGAGCTTTAAGTCTAGAAGCTTCTCTCTGGCGGATCTGCAACAGGAGCACCATGATATCGATCGACAAGAAGAACAATAATCCATAGCCTTACCCTCTCTCTTTCTCTCTCTCTATTTTTCCTTGTTTTGTTTATTTGTTTCTTCCCTTTCTTTTCCCCTTGATTTTATTTTCTCGTCGTATATGTATTGTATCCATTATACATAAAATAGTATTTTTTTTTTTTTAGTTAAATTAAG

mRNA sequence

GCTGATGGTATTAATTTCCATAATTTGGATAAAACTGGTCCGAAAATTTGCCTTTTTTAAAATACTACTAATAATAATAATAATAATAATCAGCAGCTTCCTTTTTTTGCAGTTCTTCCTTTAATGGAACGCATTTTCCCTTCCAGTTTCATACCTTTCTCTCTCTGTAAATGGCAGCACTATCTCTTCAACCAACGTCTCACGACAACGAAGAAGGAATCTCCGCTACTTCCACTTCCGTTTGATTCTCAATCTTCACGATCATGGAGGTTCTCTTCGGTCCTCCCTCTTTCACCATTCCGGACGCCTCTTCTTCCTCTTTCGTCAGAGACCGAACCGCCGTATCTCCGCCGCCGCAGCACCGTGTTGATTCGCCTTGGAACTCTATCAAATCGGATTTCGGACGCTTCGCGTCCGGTAAAGGTCCGGAGGATGAAATCGATTACCGTTCCGATTCCTCGTCGTCGATTGGAGTTCCTGATGATGATAGCGATGAGGAGAGCGTTTCGTCCACTGGCGGAGATCGCGAGGAAGTTCAGAGTAAATTAAATGCAGGATTGAACTCTGTTGGATCGTTGGAAAGGTCTCTTCCAATTAAGAGAGGCTTATCGAGTCATTTTTCTGGAAAATCGAAATCGTTTGCGAATCTAGCAGAGGCGAAATCAGTAAAAGACATTGAGAAACCTGAAAACTCATTCAACAAGAGGAGGCGATTTTTTATTGCGTCGAAATTGGCCAGGAAAACTTCCTTCTACACCTGGCCAAACCCTAAGTCGATGCCTCTGTTAGCGCTCAGAGAAGAAGAACACCACGACGACGGCGACGGCGACAAAGATGAATCACTTGCTCCATATTCTTCAGAAGATACTGAAGATGAAAAAGAAGAACCGAAGGAAAAAAGATTTTCAGATTTTAATCACAGAAGGTTGATGAGCTTTAAGTCTAGAAGCTTCTCTCTGGCGGATCTGCAACAGGAGCACCATGATATCGATCGACAAGAAGAACAATAATCCATAGCCTTACCCTCTCTCTTTCTCTCTCTCTATTTTTCCTTGTTTTGTTTATTTGTTTCTTCCCTTTCTTTTCCCCTTGATTTTATTTTCTCGTCGTATATGTATTGTATCCATTATACATAAAATAGTATTTTTTTTTTTTTAGTTAAATTAAG

Coding sequence (CDS)

ATGGAGGTTCTCTTCGGTCCTCCCTCTTTCACCATTCCGGACGCCTCTTCTTCCTCTTTCGTCAGAGACCGAACCGCCGTATCTCCGCCGCCGCAGCACCGTGTTGATTCGCCTTGGAACTCTATCAAATCGGATTTCGGACGCTTCGCGTCCGGTAAAGGTCCGGAGGATGAAATCGATTACCGTTCCGATTCCTCGTCGTCGATTGGAGTTCCTGATGATGATAGCGATGAGGAGAGCGTTTCGTCCACTGGCGGAGATCGCGAGGAAGTTCAGAGTAAATTAAATGCAGGATTGAACTCTGTTGGATCGTTGGAAAGGTCTCTTCCAATTAAGAGAGGCTTATCGAGTCATTTTTCTGGAAAATCGAAATCGTTTGCGAATCTAGCAGAGGCGAAATCAGTAAAAGACATTGAGAAACCTGAAAACTCATTCAACAAGAGGAGGCGATTTTTTATTGCGTCGAAATTGGCCAGGAAAACTTCCTTCTACACCTGGCCAAACCCTAAGTCGATGCCTCTGTTAGCGCTCAGAGAAGAAGAACACCACGACGACGGCGACGGCGACAAAGATGAATCACTTGCTCCATATTCTTCAGAAGATACTGAAGATGAAAAAGAAGAACCGAAGGAAAAAAGATTTTCAGATTTTAATCACAGAAGGTTGATGAGCTTTAAGTCTAGAAGCTTCTCTCTGGCGGATCTGCAACAGGAGCACCATGATATCGATCGACAAGAAGAACAATAA

Protein sequence

MEVLFGPPSFTIPDASSSSFVRDRTAVSPPPQHRVDSPWNSIKSDFGRFASGKGPEDEIDYRSDSSSSIGVPDDDSDEESVSSTGGDREEVQSKLNAGLNSVGSLERSLPIKRGLSSHFSGKSKSFANLAEAKSVKDIEKPENSFNKRRRFFIASKLARKTSFYTWPNPKSMPLLALREEEHHDDGDGDKDESLAPYSSEDTEDEKEEPKEKRFSDFNHRRLMSFKSRSFSLADLQQEHHDIDRQEEQ
Homology
BLAST of CmaCh05G006240 vs. TAIR 10
Match: AT5G24890.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24550.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 125.9 bits (315), Expect = 4.4e-29
Identity = 90/189 (47.62%), Postives = 117/189 (61.90%), Query Frame = 0

Query: 60  DYRSDSSSSIGVPDDDSDEESVSSTGGDREEVQSKLNAGLNSVGSLERSLPIKRGLSSHF 119
           DY SD SSSIG P D  ++E  S    D    +     GL S+ SLE SLP KRGLS+H+
Sbjct: 57  DYSSD-SSSIGTPGDSEEDEEESENENDDVSSKELGLRGLASMSSLEDSLPSKRGLSNHY 116

Query: 120 SGKSKSFANLAEAKSVKDIEKPENSFNKRRRFFIASKLARKTSFYTWPNPKSMPLLALRE 179
            GKSKSF NL E  SVK++ K EN  NKRRR  I +KLARK SFY+W NPKSMPLL + E
Sbjct: 117 KGKSKSFGNLGEIGSVKEVAKQENPLNKRRRLQICNKLARK-SFYSWQNPKSMPLLPVNE 176

Query: 180 EEHHDDGDGDKDESLAPYSSEDTEDEKEEPKE--KRFSDFNHRRLMSFKSRS-FSLADLQ 239
           +E  DD D D+++  + +    +  ++E  K+   R   F +R   ++KSRS F+L+DL 
Sbjct: 177 DEDDDDEDDDEEDLKSGFDENKSSSDEEGVKKVVVRKGSFKNR---AYKSRSCFALSDLI 236

Query: 240 QEHHDIDRQ 246
           +E  D D Q
Sbjct: 237 EEEDDDDDQ 240

BLAST of CmaCh05G006240 vs. TAIR 10
Match: AT4G31510.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24550.1); Has 205 Blast hits to 205 proteins in 31 species: Archae - 0; Bacteria - 0; Metazoa - 5; Fungi - 3; Plants - 187; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 104.0 bits (258), Expect = 1.8e-22
Identity = 90/227 (39.65%), Postives = 121/227 (53.30%), Query Frame = 0

Query: 19  SFVRDRTAVSPPPQHRVDSPWNSIKSDFGRFASGKGPEDEIDYRSDSSSSIGVPDDDSDE 78
           S  RDR++V+   Q    +   S+ S  G    G+ P  E      SSSS+G   ++ ++
Sbjct: 7   STFRDRSSVTTHDQ----AVPASLSSRIGLRRCGRSPPPE------SSSSVGETSENEED 66

Query: 79  ESVSSTGGDREEVQSKLNAGLNSV-GSLERSLPIKRGLSSHFSGKSKSFANLAEAKSVKD 138
           E         + V S     LNS   SLE SLPIKRGLS+H+ GKSKSF NL EA +  D
Sbjct: 67  ED--------DAVSSSQGRWLNSFSSSLEDSLPIKRGLSNHYIGKSKSFGNLMEASNTND 126

Query: 139 IEKPENSFNKRRRFFIASKLARKT-----SFYTWPNPKSMPLLALREEEHHDDGDGDKDE 198
           + K E+  NKRRR  IA+KL R++     S YT  NP SMPLLAL+E ++ D    D D+
Sbjct: 127 LVKVESPLNKRRRLLIANKLRRRSSLSSFSIYTKINPNSMPLLALQESDNEDHKLNDDDD 186

Query: 199 SLAPYSSEDTEDEKEEPKEKRFSDFNHRRLMSFKSRS-FSLADLQQE 239
                    ++DE  + KEKR    NHR  M  +++S FSL   Q +
Sbjct: 187 D----DDSSSDDETSKLKEKRMKMTNHRDFMVPQTKSCFSLTSFQDD 211

BLAST of CmaCh05G006240 vs. TAIR 10
Match: AT2G24550.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31510.1); Has 219 Blast hits to 219 proteins in 33 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 2; Plants - 184; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 102.8 bits (255), Expect = 4.0e-22
Identity = 103/272 (37.87%), Postives = 140/272 (51.47%), Query Frame = 0

Query: 1   MEVLFGPPSFTIPDASSSSFVRDRTAVSPPPQHRVDSPWNSIKSDFGRFAS--------- 60
           MEV+ G  SF I     +++VRD   VS   Q +       +  + GR  S         
Sbjct: 1   MEVMVG-SSFGI---GMAAYVRDHRGVS--AQDKAVQTALFLADESGRGGSQIGIGLRMS 60

Query: 61  ---GKGPEDEIDYRSDSSSSIGVPDDDSDEESVSSTGGDREEVQSKLNAGLNSV-GSLER 120
               K PE+     SDSSSSIG   ++ +EE       + ++  S     L+S   SLE 
Sbjct: 61  NNNNKSPEES----SDSSSSIGESSENEEEE-------EEDDAVSCQRGTLDSFSSSLED 120

Query: 121 SLPIKRGLSSHFSGKSKSFANLAEAKS-VKDIEKPENSFNKRRRFFIASKLARK------ 180
           SLPIKRGLS+H+ GKSKSF NL EA S  KD+EK EN FNKRRR  IA+KL R+      
Sbjct: 121 SLPIKRGLSNHYVGKSKSFGNLMEAASKAKDLEKVENPFNKRRRLVIANKLRRRGRSMSA 180

Query: 181 TSFYTWPNPKSMPLLALR---EEEHHDDGDGDKDESLAPYSSEDTEDEKEEPKEKRFSDF 240
           ++FY+W NP SMPLLAL+   EE+HH   D            ED + + ++ ++      
Sbjct: 181 SNFYSWQNPNSMPLLALQEPNEEDHHIHND----------DYEDDDGDGDDHRKIMMMMK 240

Query: 241 NHRRLMSFKSRSFSLADLQQEHH-DIDRQEEQ 249
           N + LM+     F L+ LQ+E   D D  E++
Sbjct: 241 NKKELMAQTRSCFCLSSLQEEDDGDGDDDEDE 245

BLAST of CmaCh05G006240 vs. TAIR 10
Match: AT3G43850.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: vacuole; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G21940.1); Has 215 Blast hits to 215 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 67.4 bits (163), Expect = 1.8e-11
Identity = 48/127 (37.80%), Postives = 71/127 (55.91%), Query Frame = 0

Query: 63  SDSSSSIGVPDDDSDEESVSSTGGDREEVQSKLNAGLNSVGSLERSLPIKRGLSSHFSGK 122
           S SS SIG   DD +        G   E++S  N  L+ + SLE +LPIKR +S  + GK
Sbjct: 24  STSSDSIGENSDDDE--------GGENEIESSYNGPLDMMESLEEALPIKRAISKFYKGK 83

Query: 123 SKSFANLAEAKS--VKDIEKPENSFNKRRRFFIASKLARKTSFYTWPNPKSMPLLALREE 182
           SKSF +L+E  S  VKD+ KPEN +++RRR  ++ ++  +      P  KS+  ++ RE 
Sbjct: 84  SKSFMSLSETSSLPVKDLTKPENLYSRRRRNLLSHRICSRGGISKKPF-KSVLAMSQREG 141

Query: 183 EHHDDGD 188
           +    GD
Sbjct: 144 DSSSSGD 141

BLAST of CmaCh05G006240 vs. TAIR 10
Match: AT5G21940.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G43850.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 4.1e-11
Identity = 54/137 (39.42%), Postives = 73/137 (53.28%), Query Frame = 0

Query: 49  FASGKGPEDEIDYRSDS-SSSIGVPDDDSDEESVSSTGGD---REEVQSKLNAGLNSVGS 108
           F+    P D     S S SSSIG   DD   E  S  GGD     EV+S     L  + S
Sbjct: 23  FSPSPSPSDSSSSPSSSASSSIGRNSDDG--EKSSEDGGDDAGENEVESPYKGPLEMMES 82

Query: 109 LERSLPIKRGLSSHFSGKSKSFANL-AEA-------KSVKDIEKPENSFNKRRRFFIASK 168
           LE+ LP+++G+S ++SGKSKSF NL AEA        S+KD+ KPEN +++RRR  +  +
Sbjct: 83  LEQVLPVRKGISKYYSGKSKSFTNLTAEAASALTSSSSMKDLAKPENPYSRRRRNLLCHQ 142

Query: 169 LARKTSFYTWPNPKSMP 174
           +        W N K+ P
Sbjct: 143 I--------WENNKTTP 149

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G24890.14.4e-2947.62unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G31510.11.8e-2239.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24550.14.0e-2237.87unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G43850.11.8e-1137.80unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G21940.14.1e-1139.42unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 175..216
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 11..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..103
NoneNo IPR availablePANTHERPTHR33172OS08G0516900 PROTEINcoord: 50..241
NoneNo IPR availablePANTHERPTHR33172:SF46OS08G0516900 PROTEINcoord: 50..241

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G006240.1CmaCh05G006240.1mRNA