CmaCh05G007800 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G007800
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionCCHC-type domain-containing protein
LocationCma_Chr05: 4668841 .. 4670387 (+)
RNA-Seq ExpressionCmaCh05G007800
SyntenyCmaCh05G007800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACGAGAAACCGTCGACTATGAACAAGGTGTATTTGATGCAGAGACTGTTTAATCTACAAATGTCTGAAAGTGGATCTGTTGCTCGTCATATAAATGAATTCAATATAATTGTAAGTCAACTGAGTTCGGTAGATATTAATTTCGAAGATGAAATTAAGGCAATGATTTTGATGTCATCTTTACTCGAGTCGTGGAATACTGTTGTTGCCGTGATCAACAGTTCACGAGGATCTGATAAACTGAAGTTCGATGAAATTCGAGATGTAGTTCTTAGCGAAAGTATTCGCAAACAGGAAATCGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGTAAATCAAAGAACCCAAACAATGGGCGATCAAAATCAAGGAACCAAGGAAAATCTCCAAACAAACCAAATGTAAAGTGTGGGAGTTGTGGAGAAAAAGGTCATTTTCAAATAGACTGTAAAAGACCAGAGAGGAAGTAGTATCACAAATTAGAGGATGACGATGATTTGGTAAATTCAGCAGAAGACATTGAGGATGCTCTAATCCTCAGTGTGCACAGTTCGATTGAATCTTGGATTTTAGATTTAGGTGCATCTTTTTTCATTCGTCTCCAAATAAAGAGCTGTTTCAAAATTTTAAGTTTGGAAATTTCAAGAATGTGTATCTTGCCGACAACAAAGATTTGAAGATTGAAGGAAAATGAGATGTATGCATAAAAAACTGCGACAGGAAATCAGTGGACATTAAAAGATGTCAGATATATTCTTAGTTCAAGTAGAACCTGATCTCTATTGGTCAGTTGGATAGCACAGGTTATGCAACAAAGTTTGGTAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAAATCTGAAACTTTATACACCACTGCAGGGTGTATGAACAGAGTTGCTGTTGCTGAGAGTGCTTCAAATTCAAGTATATGGTACAATAGACCTGGACATATGAGCGTTAAAGGAATGAAGATGTCGGGTGCGAAAGAAGTTTTGGAAGGTCTAAAATCTATTGATATGAGTCCTTGTGAGAACTGCGTTATGAGCAAACAAAAGCGAGTTAGCTTCACAAAGACTGCCAGAGAATTGAAGAAAGATATCGGGACAACAAAGCAAGTGGGAGTTGAGGTAGAGTTGCAGAACAGTTCATAGAGTGATGTTGTAGCGGATACTCAAGAAACTCCTGAGACTGTTGCTGAGGAACCAGAGGTGAAACAAGTGGGAGTTGAGGTTGAGTTGCTGAAAGATTCATCTAGTGATGTTGTAGCAGATACTCAAGAAACTCTTAAGATTTTTGCTGAGGAACCAGAAGCAGAGCAAGTGACACCTGAGCAGGTGTTGAAAAAATCATCCAGAGCCATCAGAGTACCAGATAGGTATGTACCTTCTTTACACTATTGGTTGAGGACTGATGAAGGGGAACGAAAGCCATTTGATGAGGACCGACAGTTTGAGGATACAACTAAGTGGGAGCAAGCCATGGATGATGGGATGTCTAAGCTTTAA

mRNA sequence

ATGTACGAGAAACCGTCGACTATGAACAAGGTGTATTTGATGCAGAGACTGTTTAATCTACAAATGTCTGAAAGTGGATCTGTTGCTCGTCATATAAATGAATTCAATATAATTGTAAGTCAACTGAGTTCGGTAGATATTAATTTCGAAGATGAAATTAAGGCAATGATTTTGATGTCATCTTTACTCGAGTCGTGGAATACTGTTGTTGCCGTGATCAACAGTTCACGAGGATCTGATAAACTGAAGTTCGATGAAATTCGAGATGTAGTTCTTAGCGAAAGTATTCGCAAACAGGAAATCGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGTAAATCAAAGAACCCAAACAATGGGCGATCAAAATCAAGGAACCAAGGAAAATCTCCAAACAAACCAAATGTAAAGTGTGGGAGTTGTGGAGAAAAAGGTTATGCAACAAAGTTTGGTAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAAATCTGAAACTTTATACACCACTGCAGGGTGTATGAACAGAGTTGCTGTTGCTGAGAGTGCTTCAAATTCAAGTATATGGTACAATAGACCTGGACATATGAGCGTTAAAGGAATGAAGATGTCGGGTGCGAAAGAAGTTTTGGAAGCGGATACTCAAGAAACTCCTGAGACTGTTGCTGAGGAACCAGAGGTGAAACAAGTGGGAGTTGAGGTTGAGTTGCTGAAAGATTCATCTAGTGATGTTGTAGCAGATACTCAAGAAACTCTTAAGATTTTTGCTGAGGAACCAGAAGCAGAGCAAGTGACACCTGAGCAGGTGTTGAAAAAATCATCCAGAGCCATCAGAGTACCAGATAGGTATGTACCTTCTTTACACTATTGGTTGAGGACTGATGAAGGGGAACGAAAGCCATTTGATGAGGACCGACAGTTTGAGGATACAACTAAGTGGGAGCAAGCCATGGATGATGGGATGTCTAAGCTTTAA

Coding sequence (CDS)

ATGTACGAGAAACCGTCGACTATGAACAAGGTGTATTTGATGCAGAGACTGTTTAATCTACAAATGTCTGAAAGTGGATCTGTTGCTCGTCATATAAATGAATTCAATATAATTGTAAGTCAACTGAGTTCGGTAGATATTAATTTCGAAGATGAAATTAAGGCAATGATTTTGATGTCATCTTTACTCGAGTCGTGGAATACTGTTGTTGCCGTGATCAACAGTTCACGAGGATCTGATAAACTGAAGTTCGATGAAATTCGAGATGTAGTTCTTAGCGAAAGTATTCGCAAACAGGAAATCGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGTAAATCAAAGAACCCAAACAATGGGCGATCAAAATCAAGGAACCAAGGAAAATCTCCAAACAAACCAAATGTAAAGTGTGGGAGTTGTGGAGAAAAAGGTTATGCAACAAAGTTTGGTAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAAATCTGAAACTTTATACACCACTGCAGGGTGTATGAACAGAGTTGCTGTTGCTGAGAGTGCTTCAAATTCAAGTATATGGTACAATAGACCTGGACATATGAGCGTTAAAGGAATGAAGATGTCGGGTGCGAAAGAAGTTTTGGAAGCGGATACTCAAGAAACTCCTGAGACTGTTGCTGAGGAACCAGAGGTGAAACAAGTGGGAGTTGAGGTTGAGTTGCTGAAAGATTCATCTAGTGATGTTGTAGCAGATACTCAAGAAACTCTTAAGATTTTTGCTGAGGAACCAGAAGCAGAGCAAGTGACACCTGAGCAGGTGTTGAAAAAATCATCCAGAGCCATCAGAGTACCAGATAGGTATGTACCTTCTTTACACTATTGGTTGAGGACTGATGAAGGGGAACGAAAGCCATTTGATGAGGACCGACAGTTTGAGGATACAACTAAGTGGGAGCAAGCCATGGATGATGGGATGTCTAAGCTTTAA

Protein sequence

MYEKPSTMNKVYLMQRLFNLQMSESGSVARHINEFNIIVSQLSSVDINFEDEIKAMILMSSLLESWNTVVAVINSSRGSDKLKFDEIRDVVLSESIRKQEIGDSSGNALSVDRRGRSKSKNPNNGRSKSRNQGKSPNKPNVKCGSCGEKGYATKFGKSSWKIVKGAMVVARGTKSETLYTTAGCMNRVAVAESASNSSIWYNRPGHMSVKGMKMSGAKEVLEADTQETPETVAEEPEVKQVGVEVELLKDSSSDVVADTQETLKIFAEEPEAEQVTPEQVLKKSSRAIRVPDRYVPSLHYWLRTDEGERKPFDEDRQFEDTTKWEQAMDDGMSKL
Homology
BLAST of CmaCh05G007800 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 6.4e-12
Identity = 51/153 (33.33%), Postives = 95/153 (62.09%), Query Frame = 0

Query: 1   MYEKPSTMNKVYLMQRLFNLQMSESGSVARHINEFNIIVSQLSSVDINFEDEIKAMILMS 60
           +Y   +  NK+YL ++L+ L MSE  +   H+N FN +++QL+++ +  E+E KA++L++
Sbjct: 91  LYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLN 150

Query: 61  SLLESWNTVVAVINSSRGSDKLKFDEIRDVVLSESIRKQEIGDSSGNALSVDRRGRSKSK 120
           SL  S++ +   I   + + +LK D    ++L+E +RK+   ++ G AL  + RGRS  +
Sbjct: 151 SLPSSYDNLATTILHGKTTIELK-DVTSALLLNEKMRKKP--ENQGQALITEGRGRSYQR 210

Query: 121 NPNN-GRSKSRNQGKSPNKPNVK-CGSCGEKGY 152
           + NN GRS +R + K+ +K  V+ C +C + G+
Sbjct: 211 SSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGH 240

BLAST of CmaCh05G007800 vs. TAIR 10
Match: AT4G35820.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 43.1 bits (100), Expect = 5.0e-04
Identity = 24/62 (38.71%), Postives = 38/62 (61.29%), Query Frame = 0

Query: 1  MYEKPSTMNKVYLMQRLFNLQMSESGSVARHINEFNIIVSQLSSVDINFEDEIKAMILMS 60
          M +  S    +YL QRL  L++ E+  + +HIN F+ +V +  SVD+  E++ K MIL+ 
Sbjct: 1  MSKSTSVSTILYLRQRLQGLKIYETSDLIQHINTFDELVGEQVSVDVKIEEKTKDMILLC 60

Query: 61 SL 63
          SL
Sbjct: 61 SL 62

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109786.4e-1233.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
AT4G35820.15.0e-0438.712-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 1..99
e-value: 3.7E-20
score: 72.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 305..335
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 103..143
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 122..139
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 1..151
NoneNo IPR availablePANTHERPTHR34676:SF1ZINC FINGER, CCHC-TYPE, TUBBY C-TERMINAL-LIKE DOMAIN PROTEIN-RELATEDcoord: 1..151

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G007800.1CmaCh05G007800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding