CmaCh00G002230 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002230
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGag/pol protein
LocationCma_Chr00: 15752788 .. 15754709 (-)
RNA-Seq ExpressionCmaCh00G002230
SyntenyCmaCh00G002230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTATAAAGTTCTTAACTGATTTGGAAGTCGTTCCAAACATGCATCTTCCCATCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTTGAAAGGACCAAGAAGCCATAACCGAAGAAAACATATTGAGCGCAAATATCATCTCATAAGGAAGATTGTGCAACGAGGAGATGTGATCATCACACAGATAGCTTCGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTTAGTCTAGGACTACGAGTTATGTAATCTAGGGCAAGTGGGAGAAAAGGCAAGAGTATGTGATGATTATATAATTGTTATATAAAGCATGTTAGATGCATGTTATAGGTTATTTTCAACGTCATAAAACGAGATAGACGTTAAAATCTATAATGAAAATAAGTGCATGCTCGCTTAAGGTTAAGAATTAAGCACAATCAATTTTAATAGTTCAAATAGGTCGATGTCTTGTAAAATTTTGCTTACAAGAAAGCTCACCTGGCACGATCTTGACTAAGGCTGGAGGTACTTAAGTTGACGGTTTACGGAACACCTACTACTTAGTGGGAGATCGGACCAATACTTGAGCTTAGTTAGCCTAGTTTTATGTGCATGCTTGAGTGATGTAAGTTCTGGAAAAATTACCTAGACTTAGGTTATATTTAATTAGCCGAAATATACCTAAGTAATGAGTATACCCAACCGTTTAAAATACTTAGTGGGAGGAAAAAAGTATATGAGATACATCGTTTTCCTTTTCACGCTCTCTGAAAAATTCACACTGTGAGATTCATGCTCGGCCTCGTGTCGCCCTGGGAGCGTCTTCCATTCGGAAGGTATTTGCATGAGTCAATAACAAGGTGAATAGAGAAAGTACTTATAGTAAGTGGAAGAAGGAAGTATATAACCACGTCCTATGGTCTCCACTACTAGGTTGCACCGTGAGATTCTCATGTTCCGCCTGCATGTTGCCCTAGAGCAACTACCCATTCGGAGGGTCGTAACATGGGAGTCGAAACAACGCAAACTCCAGAAATGGATACTATTTCTTAGGGCCATTCCCAACTTTGGCTTTTTCATTCAATAGCATTGTTGGGACCGATCTATGAGGCCTGAAATTGGTGGGTCACACTTGCGAAGAATTACTAAGAGTTAGTGTTTAAGTCAAGAATGTCTTCGTTTGGTGAATGAGTGATTGACTGCCCCTCGGTAGCAGTTGCTCTAACTCACTAAAGTATCGTTGCAAATTAATAATTTTTTTTTGGTACATTATTACATTTGCTAAAACTGATTGGATTAAATAGCATTAAGTCAAATCACTAATAGAATTTTCTTTATATCTCAGCATGACAAACTCAATAGTACAATTACTCGCTTTTGAGAAATCAAACCTGAACACAATACTGGTAATTGATGATTTAAGGTTTGTTTTAACTGAGGAATGTCCTCCAAAACCCAACTCAAATGCAAACCGAACAGTTCGGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGTATATCTGATGTTTTGTCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCACTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAATGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGCTTTCCGCACAAATGTGGTGATGAACAAAATAGAATATAA

mRNA sequence

ATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTATAAAGTTCTTAACTGATTTGGAAGTCGTTCCAAACATGCATCTTCCCATCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTTGAAAGGACCAAGAAGCCATAACCGAAGAAAACATATTGAGCGCAAATATCATCTCATAAGGAAGATTGTGCAACGAGGAGATGTGATCATCACACAGATAGCTTCGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTTAGTCTAGGACTACGAGTTATGTTTGTTTTAACTGAGGAATGTCCTCCAAAACCCAACTCAAATGCAAACCGAACAGTTCGGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGTATATCTGATGTTTTGTCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCACTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAATGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGCTTTCCGCACAAATGTGGTGATGAACAAAATAGAATATAA

Coding sequence (CDS)

ATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTATAAAGTTCTTAACTGATTTGGAAGTCGTTCCAAACATGCATCTTCCCATCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTTGAAAGGACCAAGAAGCCATAACCGAAGAAAACATATTGAGCGCAAATATCATCTCATAAGGAAGATTGTGCAACGAGGAGATGTGATCATCACACAGATAGCTTCGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTTAGTCTAGGACTACGAGTTATGTTTGTTTTAACTGAGGAATGTCCTCCAAAACCCAACTCAAATGCAAACCGAACAGTTCGGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGTATATCTGATGTTTTGTCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCACTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAATGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGCTTTCCGCACAAATGTGGTGATGAACAAAATAGAATATAA

Protein sequence

MEAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKGPRSHNRRKHIERKYHLIRKIVQRGDVIITQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVMFVLTEECPPKPNSNANRTVRDAYDRWIKANDKARVYILASISDVLSKKHDVMGTAKEIMESLKGMFGQPSFSTRHEAIKYIYNCRMKEWTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMESLPKSFPHKCGDEQNRI
Homology
BLAST of CmaCh00G002230 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 81.6 bits (200), Expect = 1.4e-14
Identity = 44/108 (40.74%), Postives = 61/108 (56.48%), Query Frame = 0

Query: 2    EAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKGPRSHNRRKHIE 61
            EAEY+A  EA +E++WL   LT + +   +  PI +Y DN G ++    P  H R KHI+
Sbjct: 1295 EAEYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAKHID 1354

Query: 62   RKYHLIRKIVQRGDVIITQIASEHNIADPFTKPLTAKVFEGHLVSLGL 110
             KYH  R+ VQ   + +  I +E+ +AD FTKPL A  F      LGL
Sbjct: 1355 IKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGL 1400

BLAST of CmaCh00G002230 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 4.1e-11
Identity = 37/100 (37.00%), Postives = 57/100 (57.00%), Query Frame = 0

Query: 2    EAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKGPRSHNRRKHIE 61
            EAEY+AA E  KE +WL +FL +L +    ++   +YCD+  A+   K    H R KHI+
Sbjct: 1221 EAEYIAATETGKEMIWLKRFLQELGLHQKEYV---VYCDSQSAIDLSKNSMYHARTKHID 1280

Query: 62   RKYHLIRKIVQRGDVIITQIASEHNIADPFTKPLTAKVFE 102
             +YH IR++V    + + +I++  N AD  TK +    FE
Sbjct: 1281 VRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFE 1317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P041461.4e-1440.74Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P109784.1e-1137.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 138..244
e-value: 5.9E-14
score: 52.0
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 126..243
NoneNo IPR availablePANTHERPTHR35317:SF8POLYPROTEIN-LIKE PROTEINcoord: 126..243
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1..98
e-value: 2.84081E-40
score: 133.748

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002230.1CmaCh00G002230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding