ClCG03G004400 (gene) Watermelon (Charleston Gray)

NameClCG03G004400
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
Descriptionearly nodulin-like protein 1 LENGTH=370
LocationCG_Chr03 : 4782587 .. 4786645 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCGTACTAATTTCCTTCTTTGGCAAAACCTAGCTTGGCCTATTTTGAGAAGCTATCGACTGGAAGGTTATCTAACTGGAGAAAAAGTTTTCCCTAGTAAGTTCGTGAATCAGAGTGTTGAAGCCTATGAAGAAATGGCGATGGACGGCGAAGGTAGAAGCACTGGTTCAATTAGGCCGAAAAGAGAAAGCGAGTTTGCATGCAAAAATATGAAACATGGATAGCTGTAGGTCAATTGCTCCTGGCTTGGTTTGCAACACAATGTTGCCTGAAATAGCAACTCAGGTAATGGGTTATCAATCCTCCAAATACTTGTCGGACACAATTGATAGCACCTTAAATGTGCTATCTTACTCAGTTTAACTAGGAATCATTATACTACTATTTAATTTATTAAAAATATTATTTTGTAGGTATCGAGCTCCTGAAAGGTAGAATAGAGCGTTAGGAAGAAAACGAACCTAGCTAAAACAAACCAATTTTGAGTCTAAAGAAGAAATTAGGTCAAATAGAAGCTAAAAAAACCAAAATGCCCTTAAAAGACCTCGTTGTGGTGTTGCCGCGCAGATATCAAAACTACACTTCGGTAATTATGTCATATTTTTTTATCTAGTTGGCTGTTTTGAGCAAATAATCACTTGTTTTCTTTGTTTTTCCATCCTCTATACATCTACATCATTAGTTTTAGATTTTGAGTTGAATATTTCCATGTAATTGGGCTGAAAGGTGGCGGGTTCAAGCTTTCTCATCTCAATGGGAAGGTAATAATCTCCAATTTCTAGTTTTTTAAGTTTTCTATAATTGTAACCTTATGAGACTTGTGCTAAACTCTTAATCTTGATTATTTCTATTTTGTCTTTGGATTCTTTTACTGTCTTTGCATGTTTATGGTTTTACTCCACTTTAAAGAAATACTTTGATGGACTCGCTTTCGTAGGTAACCCTGTGTCTAATGATGATCTCATCTCTCAAGTACTCTCTGGTCTTGATGAAGAGTACAATCCGATGGTCGCTGTTGTACAAGGAAAACATGGAGTCGGCTAGACTGATTTACATTCTGATCTTCTCTCCTATGAAAAACGTCTTGAGTTCAAACAACGTTGAAATCAGGGGAAGTACCTTGTCCTTGTCTTCTACCAATACTAGTCAGTCTCCCTCTGTCAATATGACTCAGAACAAGGGGCAACCAAATCAAAACCCTCCCTCAAGTCCCAATCGAAATCAATAGAGAGGATTCTCTCCCAATCAGCGAGGTTGTGGAAGAGGCCGTGGAAGATGGGGATACCAAAATTGACCAATGTGTCAAAGTCTATACCAAGGTAGGTTATACAACTGAACTCTTTCATATCGTTACCTAAAAGAGTACAAAAATCCTAATCAAACTCATCTTTTTGAGTCAATTGGTCCTGCACCTAGCTTTAACAACAATCAAGGTTTTCGAGCACCTCAGGATCCATTTATGGCTAACTCTGTGATGGCCACTGAAACTCTAATTGATCCCTCGTGGTATGTTGATATGTTGATAGTGGAGCATACCATGAGGGATCCCTTATTTGTAAAACCAATTGGTTCGGTTTGGTTATTGGTTAACCTTAAAATCAATAAAACCGGACCGACTATCACCTCTAGTCTCCTGTTACAAACAAATGTTGTTGTGACCAAATCCATTTTGCACAAAAGACTTTATCATCCCTCCTCTCAAACACTAGACAAAATTATCAAGAGCTGTAAATTGAAGTTATCATCTAATATACAGCAATCTATTTTCTAAAGCTTGTCAATTTGGAAAGATCCACACTCTTCCTTTCTCAAATTATGTCTCTCACACTTCAAAACCTTTTGAACTGCTTCATACGAATCTCTAGGGTCCATCCCCAATTTTATTTGCAGATGGTTTCAGATATTATGTACTTTTGTTGATGACAACAATTGTTTTACTTGGTTATACCCCCTACGGCAAAAGAGTGATACTCTTCATGTCTTTAATGAATTTTGTGCCTTAGTTATGAATCAATTTAACAGAAGTATCAAAATGTTGCAGCTAGATGGGAGAGGAGAATATAAACCAATCGTCGGTGTTTGTAACCAGCTCGGGATCTAAACACGAGTATCATGTCCATATATGTCAGCCCAAAATGGTCGTGCTGAATGAAAGCACTGCCATATTGTTGAAACCGACCTCACTGGTAACACCCGAAATGTGTTATCTTCCTCAACTTAACTTGGGATCGTCGTACTATTTAATGTGTTAAAAAATATCATTTTGTAGGTTTTGAGCATTTGACAGGTAGAGGGGGATGTTAGCAATGAATTGGAGAAACGGAGTTTTATTGTAACAATTGGTAAAGAGAACTTCACCTAGGCGCTGGCGTGACACTTTTTCTCGAGAGCAAGTCTACTGGCAAAGAGCAACAGCGTCGTGACGTTGCATAGGAGCGTCGTGATGCTCCGAAGTCTTGTCGTGGGTGTGCGCTCTGACTGTTTTATAAAGGAAAATCTCGATTTTTAGGTTAGAAGGCAGCCACCCGCCGAATCTTTCTAAAATTCTCTCTATTCTCTTCCTCCTTTTGATCATGTATCAAACCTTATTCCTAGTTGCTCATTATGATGTATCAAGGCTAAGGTAACAATTTTCTAGCTTGGATTTAGGGTTCTTTCTCGATTCTAATATGTTTGTAAGATTTTGAACTGTTATTTGTGATTGTTGGATTTAATTCAGTTCCAATTTTATTTACGATCATATTTGTGGGCATTGGTGAACACTATGACTACTGCTTTGAATGAACATGCATCAGGTGATACTTGATTATATGTTGATTAGTAGAAAGCAATAGGTTAGGTTTGGTTTGCAATTCGAATGAAAATTAGTATTTATCCAACCAACCTAGAATTGATCCGATAGAATATTTAATTTAGGCATTTCCAGTTAAATATTCCCCCTAATGTTTCCATAAAACTTAATGCAGTTAGGGCAAACTTGGATTTCGCATGTCTTTCATCGATTAAATTTATACTAATTTGTGATTTAGGACAATCTAATTTTGGACTTTCCAATTAAATCGGTTAGGTGTAGGCAATTCTGTAATTTCAGTAATGAAGAATTGATGATCAAATATACGGAATGAGAAAAATCCATTACAACCCAAGCTAAACTATTCTCATATAGATTTTATCTCTTTTAATTTTTGTGTTTACTTTTATGCAATTTAATTTTCTTGCAAATAATCATCTCAAACAATCCTCCTTCAGTTACTAGCTGCTTGGCTGAGACATTCGAATGCTCCTTGTGTTCAACCCAGTTTATCACTACTACGGTTTGGTAAGTACAGTAAATTATTTGGCACGGAATTGGGAGCCGAATTCCATCCGCCACTCATCCTCCTTGCCCAAGCGTCCATGCTCCTAAAATTATGGTGGTAAGCTTTTCAAACTGCTACTTTCCTGATTAATGGCTTACCTTACTCTTCTTTCCAGGGTTGAACACTAATTCAACTTTTATACAAGCAACTTATGGACATATCCGCTCTTCGGTCTTTGGGTGTGCATGCTTCCCAAATCTAAGGCCATATCAGAAGGACAAGTTTGATTTTCACACAGAGAAGTGTGTGTATCTTGGACCATCTCTAATACACAAAGGGCATCGGTGTCTAAATGCAGTTGGTGATTGAATCTTTATCTTAAGACATGTTGTGTTCAATGAAGTTGAATTTCCTTTTCAGTATGGTTTTGACTATCCCAATCCTCAGGCCTAAACAATACCAGCCCAAAACCCAATTTTGTCTTGGTTTAGTTTACTGCCCGTCCCTTAACCTGCATAAATTATAAACCCAATTAGACCCACCTCTAGCCCAAATACTACCCAAACAAGCCTAACCTCAAACACTCACCAGAGTCCGAACATAAGCCTAAATCCATCTTCAAATGAAAACCCAAGCCCAATCCAGCATAGCCCTTCACCACTTCCAACCCAAGCAAACTCAACTGAACCTACCTCTCCAGTCAACCAATTATCTCTTCACAGTCCCTCTAGACCGTCTCACCTTCATCGTCCTTAA

mRNA sequence

ATGGATCGTACTAATTTCCTTCTTTGGCAAAACCTAGCTTGGCCTATTTTGAGAAGCTATCGACTGGAAGGTTATCTAACTGGAGAAAAAGTTTTCCCTAGTAAGTTCGTGAATCAGAGTGTTGAAGCCTATGAAGAAATGGCGATGGACGGCGAAGTCTCCCTCTGTCAATATGACTCAGAACAAGGGGCAACCAAATCAAAACCCTCCCTCAAGTCCCAATCGAAATCAATAGAGAGGATTCTCTCCCAATCAGCGAGGTTGTGGAAGAGGCCGTGGAAGATGGGGATACCAAAATTGACCAATGTGTCAAAGTCTATACCAAGACCCACCTCTAGCCCAAATACTACCCAAACAAGCCTAACCTCAAACACTCACCAGAGTCCGAACATAAGCCTAAATCCATCTTCAAATGAAAACCCAAGCCCAATCCAGCATAGCCCTTCACCACTTCCAACCCAAGCAAACTCAACTGAACCTACCTCTCCAGTCAACCAATTATCTCTTCACAGTCCCTCTAGACCGTCTCACCTTCATCGTCCTTAA

Coding sequence (CDS)

ATGGATCGTACTAATTTCCTTCTTTGGCAAAACCTAGCTTGGCCTATTTTGAGAAGCTATCGACTGGAAGGTTATCTAACTGGAGAAAAAGTTTTCCCTAGTAAGTTCGTGAATCAGAGTGTTGAAGCCTATGAAGAAATGGCGATGGACGGCGAAGTCTCCCTCTGTCAATATGACTCAGAACAAGGGGCAACCAAATCAAAACCCTCCCTCAAGTCCCAATCGAAATCAATAGAGAGGATTCTCTCCCAATCAGCGAGGTTGTGGAAGAGGCCGTGGAAGATGGGGATACCAAAATTGACCAATGTGTCAAAGTCTATACCAAGACCCACCTCTAGCCCAAATACTACCCAAACAAGCCTAACCTCAAACACTCACCAGAGTCCGAACATAAGCCTAAATCCATCTTCAAATGAAAACCCAAGCCCAATCCAGCATAGCCCTTCACCACTTCCAACCCAAGCAAACTCAACTGAACCTACCTCTCCAGTCAACCAATTATCTCTTCACAGTCCCTCTAGACCGTCTCACCTTCATCGTCCTTAA

Protein sequence

MDRTNFLLWQNLAWPILRSYRLEGYLTGEKVFPSKFVNQSVEAYEEMAMDGEVSLCQYDSEQGATKSKPSLKSQSKSIERILSQSARLWKRPWKMGIPKLTNVSKSIPRPTSSPNTTQTSLTSNTHQSPNISLNPSSNENPSPIQHSPSPLPTQANSTEPTSPVNQLSLHSPSRPSHLHRP
BLAST of ClCG03G004400 vs. Swiss-Prot
Match: EGR1B_XENLA (Early growth response protein 1-B OS=Xenopus laevis GN=egr1-b PE=2 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 1.2e-06
Identity = 47/168 (27.98%), Postives = 78/168 (46.43%), Query Frame = 1

Query: 35  KFVNQSVEAYEEMAMDGEVSLCQYDSEQGATKSKPSLKSQSK-SIERILSQSARLWKRPW 94
           +F + + +A+ EM++  E ++ +  S    T   PSL    + S+E   + S  LW  P 
Sbjct: 39  QFDHHAADAFSEMSLSNEKAVLE-SSYANHTTRLPSLTYTGRFSLEPAPNSSNTLWPEPL 98

Query: 95  ---KMGIPKLTNVSKSIPRPTSSPNTTQTSLTSNTHQSPNISLNPSSNENPSPI------ 154
                G+  + NVS S   P+SSP+++ +S +S++ QSP +S +  SNE+ SPI      
Sbjct: 99  FSLVSGLVGMANVSSS-SAPSSSPSSSSSSSSSSSSQSPPLSCSVQSNES-SPIYSAAPT 158

Query: 155 ----------QHSPSPLPTQANST----EPTSPVNQLSLHSPSRPSHL 179
                      HSP P    + ++     P  PV++ +   P  P +L
Sbjct: 159 FPNSSPEMFPDHSPQPFQNASTASIPYPPPAYPVSKTTFQVPMIPDYL 203

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EGR1B_XENLA1.2e-0627.98Early growth response protein 1-B OS=Xenopus laevis GN=egr1-b PE=2 SV=1[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G004400.1ClCG03G004400.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None