CmaCh14G004660 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G004660
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionUnknown protein
LocationCma_Chr14 : 2273043 .. 2277097 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGCAATTGATGATCTGCAGGTTTCTCAAAGTTCCGATCGATCCGTCAAGCCATTCAGGTAATTCCTCCACTTTTGGAAGCTCTCTGATTGTGAATCTCTGAAGCCTAAATTCGGCGTTCTCTAAGGTGAGTGTTAGTTGCTTGCAGCCCCAGATAACCAAATTCTGTAGTGATTTTAGATATATCATCTCATTTGGCAACGATTTCAAAGTATCGCAATTGTAAATCATTAAGGTTTCAAGGTCTGCAAGGCACTCTGGCCGTTCGAATAGAATTTGTAGGTTTTTGCACCCTCCAATGGCGAGAAATTGCAGAGAAGTCATGGTTCCAACTCCGTTTTTGTGCAAACGAAGCTCTTTTGTTGTTATCCACAAGTATTTCAGGCTGATCAAATTCCTTATATCATTTGGCAATTCTTGAAGTTCGGAACAAGATTCAAGAATCAAGGTTTGCAGACTTTGTAGATTGCAGATTGAATTCGGCAGCCGCTTGATGTTCTTATTTCCATGCAAGTCGAGGTATCTCAGGCTTGTAAGTGTTCCTATAGCATTTGGAATTTCCTCTATGTTTGCATTGCCTAAATGTAACAATCTTAAGCTGTTGTACTTCGGAATGCACATTCCAGGTAGGGATGCATTAGGTGGCACCATATCTCTTACATGGAAGGCTAGGGATTTTATTTGAGTGACTGGTTTGGTACCTACTAAGCCCTCAGATTGTTCTTGTGCGAGTACGTGTGCAAGTTTTTGGATAAGAGGGTGCAGTTTAAACCAGTAGCCGAGGCCATGTTCTTCAATTTCTTGAACGAAACATCTTGACCATAGCTCCATGAAATACCTCTCACCTATATTTTTCATAGATGCGTGATTCTCTTCAGGTGAATGGAGGAGCCCATGTGCCATCCATAACTGAATCACGTCATTTGAAGATAATATGCGGTCTGGTAATTGAGCACAATGAAGAAAACAGGGCTTCAGGTGTGAGGGCATTTGATTATAGCATAATCTGAGTATATGTAAAATACCGTCATTTTCTTCCTCTTGTGCCCACAACTTGTCCTTGATATTCTTCCACTCAGCTATTCTAATTTTTGAAGATAGCAAGCTTCCCATACACTTTATTGCCAATGGAACTCCCTTGCATTTCTGCAGAATTTGGTTTTCTATTTCAGTGAATACTTCTCTGTTTACTTCTTGTCTATCTCTGAAAGCATATCTTTTGAATAACTTAGCTGATTTATCCTCTGACAGCCTCTCCAGTTTGAAGCCTGTTGGAACTTCCTGATTCTTATCGTTCGAATTCTGCATCGATTGTTTCTCAAACGCATAGACTATTCTTACTCCTAGAGCATCTGCGGTCTCCTCACTTTGAGTGGTCACTAAGATTTTGCTGTCCTCGTTTCCCATGTCCAATAGTTCCATCAATGCGGATGAATCGTCTAAGTTTGTGGTCGAAAGGTCTTGAATGACGAGCAAAAAAGATGTACCCTTCAGAAGTTGTCGAACCTTTAAATGCAATTGTTTCTTAGACAAGTCATTACAAATTTCTTTACAGGCTACAACACTATATACCTCTTCGATCAATCTCTGTGTATCAAAACCTTCTTTCACACAAACCCAGAATCTTGAAAGATACTGTTCAACTACTACTTTATGATGGTAAAGGAATTTGGCAAGTGTGCTCTTACCTATACCTGCTTCCCCAACTATGAGGAAGAATCCATGACTAGATTTGAAAGTTTGCAGAAGGTCAGCGTACCGATTGTCATGATAATCCGTTGGAGCATCAACTAATGTGTACTGCAATTGCCAACTTGGTTTCATGTGTCTTACTGAAACTTCAGTAGTGCTAAGGAAGGAGTGTGCTGTCTTCATTCTTCCTATGGAATCTGGGTGCTTTTCAGTGAGATGAAATTCATACATTCTGGCTGCAAGTGTATTCAAATGATTGCATAGTCTCTTCATTTTACGAATCAGTTTAGATTTATGTGCAGAAATTGGATCAGAGCACAAGAAAGGGGCTTGTACCTTTTTTCCAGTGCCCCCTTGTTCAAAGAAATCCCATTTGAATTCATCCATGGAGTCCTCAATTTGGTAAAAGATGTTTTGAAGCTCCTTAAGCCAGTTACGTAGATCATGATTCTGTTCTTGTTCTTGCTTCTCTTCAGCATCAAAAAGAATTGATTTAAGGCTCAACATGGTATACTCGATTGTTCGGAAGTTTGAGCAATTTTTTAGCCTACTCAAGATAGTTTCAGCGTGGTTGTAGATGAAATTAGCCATATTCTTATGGAAAATGTTGTTCCTCCTTGCGATTTATTGGGTTGTCCTCAATTGTCTTCTTACTCTTTGAGTCTGCCTGCCTTGTCTCTCTCGGGTGTCTACAGCAGATCTGATTAGTCTCGTGATGAAGAATAGTCTTTTCCGATCTTCCACTAAGAAAATCAGAGCATTGTCACGACAACTCAATTTACCCGATAAACTACTCAAGGCTCATGGTTTAGTAAGAGGGGGAGATTTGCTTTGAATCCGGGATGAATTTCTATAAATGGAGATTGTTGATATATGCACCTAATCCGTGGATTCGTTGCATTTTCATGAAGCAAAATAGCTTCAAACTTAACTCTTAGGGAAATGTATATTCTTCCCGTGCATCCTTGAAGAATAGAATAGGAGCACTCAACTCTCGCCAGGTATAATACTTCTCTTTAGGATTTATGTATTTAAGGAGTTTTGCAACTGTTTGCTTTTTATTCTAAACAAGTGGAGCCCTCATTTCTTCAAATGAACAGAGAATTCTATTCTTGCTGTGTTCGAAACTATCTGGAATAATCCCCTTCTTTCTTAGATGTAATTCCAAACTCAACACTTGGGGAAATGTATAGTCTTTCCCTTTATATTAAAGAACTGTATAGGAGGGGTCTTGGTATTCTTTAAGCTATTCCTTTACAGGTTCTTACAATTTCCCCTCTTACTTCAGAGTTATTTGGTAGAACTAGACCTTCTTTATTGAGAAGAACTAATATGAGCACCCAAAATCCTCTGCTTAGAGTTTGATTATCATGATTTCCATTCATTTTAAGGTTAACCAAATCCTGTTTTTGATTGTTTTGTTTATCTCAATTAGATTTACTCCAAATTAATTGACAAGATTCAAAATTTAGATGAATTTTCGGACTATTTTGTGATATTTGAACGATTGGTTGTGTTTAGGTTGTAGCCACAAAGCCATTAATAAATTTCGTGGATAACTTATGAATTGCGAGCTGATATGAACTTATGAGTGATAGAAGTATTATCATATACTACAGGGTTGTAATTACGCTGCATGCACTCGAACTCTCATAAGAGTCTTTATTGTGGACACATATACTACACGCCATATAATACTATGAACCGCACCATCAAAGGATTGATGATGATTATTTATATATATATAGATTCTGATTCAGGGAAAGGCCTTCCTGTGGTTCAAGCCAATTGCCTGGGGAGACACTCCGCTGGAACCAGACCCGCTACCACTCCCAGACCCCGACCCAGATCCAGAAGCCGAGCCCGAACCACTGCTTGATCCGCTTCCTGAGCCTCCCCCGCCACCGCCACCGCCACCGCCACCACCATTGGGCTGCTGCACTGCCGCAAGGCTGATGCTGCTGCTGCTGCTTATTAAGAAGATGAGAAAGCAGAAGCGGAGGTAAAGCATGGATGGAGAGGCCATGAGGGAATAAGGCTGAAGGGGACGTTTGTTGTTGATATTGGTATTTATAGAGAGTTTTGGGAGTTTGTGGGTTTGGGGCGTCTGGCATTTGTTGAGCGCGCAGATTGCCAAGGAGCCGACAATGTGTCGCCCTTTTGGGCATCGAACACAACACGCAAATGCGTCTTAATAGTTCCCATCAGTGGGGCGCCGGGGGGGCTTAGTGTGATCATCTCTTGTGTCCACTATTTTTTTTTGTTTTTTCTTCCCTTTCAGGCCCTTTTTTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTGAGTTGTTGTGGAGAAGATGAGGAAAGTGAACTAA

mRNA sequence

ATGGGGCAATTGATGATCTGCAGGTTTCTCAAAGTTCCGATCGATCCGTCAAGCCATTCAGATATATCATCTCATTTGGCAACGATTTCAAAGTATCGCAATTGTAAATCATTAAGGTTTCAAGGTCTGCAAGGCACTCTGGCCGTTCGAATAGAATTTATTGAATTCGGCAGCCGCTTGATGTTCTTATTTCCATGCAAGTCGAGGTATCTCAGGCTTGTGAATGGAGGAGCCCATGTGCCATCCATAACTGAATCACGTCATTTGAAGATAATATGCGGTCTGGGAAAGGCCTTCCTGTGGTTCAAGCCAATTGCCTGGGGAGACACTCCGCTGGAACCAGACCCGCTACCACTCCCAGACCCCGACCCAGATCCAGAAGCCGAGCCCGAACCACTGCTTGATCCGCTTCCTGAGCCTCCCCCGCCACCGCCACCGCCACCGCCACCACCATTGGGCTGCTGCACTGCCGCAAGGCTGATGCTGCTGCTGCTGCTTATTAAGAAGATGAGAAAGCAGAAGCGGAGAGAGTTTTGGGAGTTTGTGGGTTTGGGGCGTCTGGCATTTGTTGAGCGCGCAGATTGCCAAGGAGCCGACAATGTGTCGCCCTTTTGGGCATCGAACACAACACGCAAATGCGTCTTAATAGTTCCCATCAGTGGGGCGCCGGGGGGGCTTATTGTTGTGGAGAAGATGAGGAAAGTGAACTAA

Coding sequence (CDS)

ATGGGGCAATTGATGATCTGCAGGTTTCTCAAAGTTCCGATCGATCCGTCAAGCCATTCAGATATATCATCTCATTTGGCAACGATTTCAAAGTATCGCAATTGTAAATCATTAAGGTTTCAAGGTCTGCAAGGCACTCTGGCCGTTCGAATAGAATTTATTGAATTCGGCAGCCGCTTGATGTTCTTATTTCCATGCAAGTCGAGGTATCTCAGGCTTGTGAATGGAGGAGCCCATGTGCCATCCATAACTGAATCACGTCATTTGAAGATAATATGCGGTCTGGGAAAGGCCTTCCTGTGGTTCAAGCCAATTGCCTGGGGAGACACTCCGCTGGAACCAGACCCGCTACCACTCCCAGACCCCGACCCAGATCCAGAAGCCGAGCCCGAACCACTGCTTGATCCGCTTCCTGAGCCTCCCCCGCCACCGCCACCGCCACCGCCACCACCATTGGGCTGCTGCACTGCCGCAAGGCTGATGCTGCTGCTGCTGCTTATTAAGAAGATGAGAAAGCAGAAGCGGAGAGAGTTTTGGGAGTTTGTGGGTTTGGGGCGTCTGGCATTTGTTGAGCGCGCAGATTGCCAAGGAGCCGACAATGTGTCGCCCTTTTGGGCATCGAACACAACACGCAAATGCGTCTTAATAGTTCCCATCAGTGGGGCGCCGGGGGGGCTTATTGTTGTGGAGAAGATGAGGAAAGTGAACTAA

Protein sequence

MGQLMICRFLKVPIDPSSHSDISSHLATISKYRNCKSLRFQGLQGTLAVRIEFIEFGSRLMFLFPCKSRYLRLVNGGAHVPSITESRHLKIICGLGKAFLWFKPIAWGDTPLEPDPLPLPDPDPDPEAEPEPLLDPLPEPPPPPPPPPPPPLGCCTAARLMLLLLLIKKMRKQKRREFWEFVGLGRLAFVERADCQGADNVSPFWASNTTRKCVLIVPISGAPGGLIVVEKMRKVN
BLAST of CmaCh14G004660 vs. TrEMBL
Match: E5GBX5_CUCME (Putative uncharacterized protein (Fragment) OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 9.8e-19
Identity = 66/100 (66.00%), Postives = 73/100 (73.00%), Query Frame = 1

Query: 120 PDPDPDPEAEPEPLLDPLPEPPPPPPPPPPPPLGCCTAARLMLLLLLIKKMRKQKRREFW 179
           PDPDPDPEAEP PLLD LP+  PPPPPPPPPPLG CTAAR   LLL+IKKM+KQK+R+  
Sbjct: 2   PDPDPDPEAEPPPLLDALPD--PPPPPPPPPPLGRCTAAR---LLLIIKKMKKQKQRKSL 61

Query: 180 E---FVG-LGRLAFVERADCQGADNVSPFWASNTTRKCVL 216
           E    VG    LAFVE+ DCQGA NV+ FW    TRK  L
Sbjct: 62  EGEAIVGSFEYLAFVEQYDCQGATNVAGFWTLKATRKFAL 96

BLAST of CmaCh14G004660 vs. NCBI nr
Match: gi|307136130|gb|ADN33975.1| (hypothetical protein [Cucumis melo subsp. melo])

HSP 1 Score: 102.1 bits (253), Expect = 1.4e-18
Identity = 66/100 (66.00%), Postives = 73/100 (73.00%), Query Frame = 1

Query: 120 PDPDPDPEAEPEPLLDPLPEPPPPPPPPPPPPLGCCTAARLMLLLLLIKKMRKQKRREFW 179
           PDPDPDPEAEP PLLD LP+  PPPPPPPPPPLG CTAAR   LLL+IKKM+KQK+R+  
Sbjct: 2   PDPDPDPEAEPPPLLDALPD--PPPPPPPPPPLGRCTAAR---LLLIIKKMKKQKQRKSL 61

Query: 180 E---FVG-LGRLAFVERADCQGADNVSPFWASNTTRKCVL 216
           E    VG    LAFVE+ DCQGA NV+ FW    TRK  L
Sbjct: 62  EGEAIVGSFEYLAFVEQYDCQGATNVAGFWTLKATRKFAL 96

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E5GBX5_CUCME9.8e-1966.00Putative uncharacterized protein (Fragment) OS=Cucumis melo subsp. melo PE=4 SV=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|307136130|gb|ADN33975.1|1.4e-1866.00hypothetical protein [Cucumis melo subsp. melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G004660.1CmaCh14G004660.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownSSF101447Formin homology 2 domain (FH2 domain)coord: 141..182
score: 8.1

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh14G004660ClCG10G016810Watermelon (Charleston Gray)cmawcgB222
The following gene(s) are paralogous to this gene:

None