CmaCh00G002180 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002180
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGag/pol protein
LocationCma_Chr00: 15580074 .. 15581994 (-)
RNA-Seq ExpressionCmaCh00G002180
SyntenyCmaCh00G002180
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTATAAAGTTCTTAACTGATTTGGAAGTCGTTCCAAACATGCATCTTCCCATCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTTGAAAGAACCAAGAAGCCATAACCGAAGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCATCACACAGATAGCTTTGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCATGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAATCTAGGGCAAGTGGGAGAAAAGGCAAGAGTATGCGATGATTATATAATTGTTATATAAAGCATGTTAGATGCCTTTTATAGGTTATTTTCAACGTCATAAAACGAGATAGACGTTAAAATCTATAATGAAAATAAGTGCATGCTCGCTTAAGGTTAAGAACCAAGCACAACCAATTTTAATAGTTAAAATAGGTCGATGTCTTGTAAAATTTTGCTTACAAGAAAGCTCACCTGGCACGATCTTGACTAAGGCTGGAGGTACTTAAGTTGACGGTTTACGGAACACCTACTACTTAGTGGGAGATCGGACCAATACTTGAGCTTAGTTAGCCCAGTTTTATGAGCATGCTTGAGTGATGTAAGTTCTGGAAAAATTACCTAGACTTAGGTTATATTTAATTAGCCGAAATATACCTAAGTAATGAGTATACCCAACCGTTTAAAATACTTAGTGGGAGGAAAAAAGTATATGAGATACATCGTTTTCCTTTTCACGCTCTCTGAAAAATTCACACCGTGAGATTCATGCTCGGCCTCGTGTCGCCCTGGGAGCGTCCTCCATTCGGAAGGTATTTGCATGAGCCAATAACAAGGTGAATAGAGAAAGTACTTATAGTAAGTGGAAGAAGGAAGTATATCAACACGTCCTATGGTCTCCACCACTAGGTTGCGCCGTGAGATTCTCATGTTCCGCCTGCATGTTGCCCTAGAGAAACTACCCATTCGGAGGGTCGTAACATGGGAGTCGAAACAACGCAAACTCCAGAAATGGATACTATTTCTTAGGGCCATTCCCAACTTTGGCTTTTTCATTCAATAGCATTGTTGGGACCGATCTATGAGGCCTGACATTGGTGGGTCACACTTGCGAAGAATTACTAAGAGTCAGTGTTTAAGTCAAGAATGTCTTCGTTTGGTGAATGAGTGATTGACTGCCCCTCGGTAGCAGTTGCTCTAACTCACTAAAGTATCGTTGCAAATTAATAATTTTTTTTGGTACATTATTACATTTGCTAAAACTGATTGGATTAAACAGCATTAAGTCAAATCACTGATAGAATTTTCTTTATATCTCAGCATGACAAACTCAATAGTACAATTACTCGCTTTTGAGAAATCAAACCTGAACACAATACTGGTAATTGATGATTTAAGGTTTGTTTTAACTGAGGAATGTCCTCCAAAACCCAGCTCAAATGCAAACCGAACAGTTCGGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGTATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCACTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAATGGACCTCAGTTAGAGAACATGTACTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCATTTTTATCATGGAGTCTCTTTCGAAGAGCTTTCCGCACAAATGTGGTGATGAACAAAATAGAATATAA

mRNA sequence

ATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTATAAAGTTCTTAACTGATTTGGAAGTCGTTCCAAACATGCATCTTCCCATCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTTGAAAGAACCAAGAAGCCATAACCGAAGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCATCACACAGATAGCTTTGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCATGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTTTGTTTTAACTGAGGAATGTCCTCCAAAACCCAGCTCAAATGCAAACCGAACAGTTCGGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGTATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCACTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAATGGACCTCAGTTAGAGAACATGTACTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCATTTTTATCATGGAGTCTCTTTCGAAGAGCTTTCCGCACAAATGTGGTGATGAACAAAATAGAATATAA

Coding sequence (CDS)

ATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTATAAAGTTCTTAACTGATTTGGAAGTCGTTCCAAACATGCATCTTCCCATCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTTGAAAGAACCAAGAAGCCATAACCGAAGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCATCACACAGATAGCTTTGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCATGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTTTGTTTTAACTGAGGAATGTCCTCCAAAACCCAGCTCAAATGCAAACCGAACAGTTCGGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGTATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCACTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAATGGACCTCAGTTAGAGAACATGTACTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCATTTTTATCATGGAGTCTCTTTCGAAGAGCTTTCCGCACAAATGTGGTGATGAACAAAATAGAATATAA

Protein sequence

MEAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKEPRSHNRRKHIERKYHLIREIVQRGDVIITQIALEHNIADPFTKPLMAKVFEGHLVSLGLRVMFVLTEECPPKPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSTRHEAIKYIYNCRMKEWTSVREHVLDMMVHFNVAEENEAVIDEKSQVIFIMESLSKSFPHKCGDEQNRI
Homology
BLAST of CmaCh00G002180 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 81.6 bits (200), Expect = 1.4e-14
Identity = 45/108 (41.67%), Postives = 60/108 (55.56%), Query Frame = 0

Query: 2    EAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKEPRSHNRRKHIE 61
            EAEY+A  EA +E++WL   LT + +   +  PI +Y DN G ++    P  H R KHI+
Sbjct: 1295 EAEYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAKHID 1354

Query: 62   RKYHLIREIVQRGDVIITQIALEHNIADPFTKPLMAKVFEGHLVSLGL 110
             KYH  RE VQ   + +  I  E+ +AD FTKPL A  F      LGL
Sbjct: 1355 IKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGL 1400

BLAST of CmaCh00G002180 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 4.1e-11
Identity = 38/100 (38.00%), Postives = 56/100 (56.00%), Query Frame = 0

Query: 2    EAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKEPRSHNRRKHIE 61
            EAEY+AA E  KE +WL +FL +L +    ++   +YCD+  A+   K    H R KHI+
Sbjct: 1221 EAEYIAATETGKEMIWLKRFLQELGLHQKEYV---VYCDSQSAIDLSKNSMYHARTKHID 1280

Query: 62   RKYHLIREIVQRGDVIITQIALEHNIADPFTKPLMAKVFE 102
             +YH IRE+V    + + +I+   N AD  TK +    FE
Sbjct: 1281 VRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFE 1317

BLAST of CmaCh00G002180 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 47.8 bits (112), Expect = 1.5e-05
Identity = 26/68 (38.24%), Postives = 36/68 (52.94%), Query Frame = 0

Query: 2   EAEYVAACEAAKESVWLIKFLTDLEVVPNMHLPITLYCDNSGAVANLKEPRSHNRRKHIE 61
           EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+        H R KHIE
Sbjct: 488 EAEYRALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIE 547

Query: 62  RKYHLIRE 70
              H +RE
Sbjct: 548 SDCHSVRE 553

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P041461.4e-1441.67Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P109784.1e-1138.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.5e-0538.24cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 138..244
e-value: 1.3E-13
score: 50.9
NoneNo IPR availablePANTHERPTHR35317:SF8POLYPROTEIN-LIKE PROTEINcoord: 126..243
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 126..243
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1..95
e-value: 5.03538E-40
score: 133.362

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002180.1CmaCh00G002180.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding