ClCG08G004860 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG08G004860
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag-pol polyprotein
LocationCG_Chr08: 15210452 .. 15212279 (+)
RNA-Seq ExpressionClCG08G004860
SyntenyClCG08G004860
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAGTCGCCCACAAAGGAACTTCCAAAGTCAAAATGTCGAGGCTCCAATTGTTGACTTCCAAATTTGAAAATTTGAGAATGAATGATGATGAATCAATTGCCGAGTTTAATGTGTGTCTACTGGATATTGCCAATGAATCTTTTGTCCTTGGAGAAAAGATCTCAGAGGAAAGGTTGGTTCAGAAGGTGCTCAGATCTTTGCAAAGAAGTTTGATATGAAAGTAAACTGTTGTAAAAGAGGATTAGGATATTGCATCGTTGAAAGTGGATGAGTTGTTTGGGTCTCTACGAACCTTTGAAATGTCCCTTGACAGGAAACCAGAGAAGAAGACTGAAAGCATTGCTCTCCAATCTGTTGGTAAGCCTCTCAAAGAAGCCACAGACAAAGAATCTAATGAATCTCTAGCTAAATCTATAGTCGTTCTCTCTCTAAACAATTCGATAAAGTACTGAAAAGTTTTGGAAAGAAATTTGGTGGAAAGTCGACTATTTAGCAAACCATTCAACGCAGTTAATAATAAGAAGGATGGCGACAAACCTGTGAGTTCCTATACTCAAAAAGAAAAGAATCTTAGGTGAAGAGAATGCGAAGGTTTTGGACATTTTCAAGCAGAATGTCCAACGTTCCTCAAGAGACAAAATAGAAGCTATGTTGTGACGTTATTAGATGATGAGACAGACTCTACTGAGTTTGAGGCAGATGTTAAAGCTCTTTTCAGTAACATCTCCTCTGACACATCGTCAGGTGATATCTGAATGCATATGTTGAAAATGAGATTGAGGATAATATTCATGTGTCTGATGATCCATTGTATTAGGCGTTGTTCACCAAATGGCATGAAGATATGAAAGTGCTGGATATTCAGAAAGAAGGAATTGAGTCCTTGTTGGCTGACAACCATCGCTTAATGTCAACCATTGCAGAACTAAAACGTGCACTAGTCCATTCTAAGGTTGAAAAGGAATCCATGGTGAAGAACATCAGGATGCTCAATTTTGGCACTAAATTGCTTGAAAACATTCTTTCTAAAGGAAAAACCGCAGGTGATCATCATGGCCTGGGGTTGAGCAATTCTAAGGAGCAATCCACATCCCAATCCTAGAGTAAGGGAAAATAAGCTGTTGAGTTTGTTAAGGCTCAGAACCCAGAAACCTTTGGTGTTGCTAAGGCCTTAACTAGTTACCGACATAGAACATCTAATAGGCACCACAAATCTCGATGGATCTGTCACTTTTGTGGCAAACTTGGTCACATAAGACCCTTCTATCATTAGCTGCTAAATATAACTAGAAGAACTACCAACTATGGTAGAAATCCCCAGTACTTTCATTCTAAAAGGCTAGAAGGAAGAAAGGTAAAAGTTGTTTGGAGAGTAAAGGAGAATCAGTCGAAATGTAACTTAGTTTTGACTTCTCTTAGATCATCTGCTCGGGATGACTGGAATTTTGACAGTGGCAGCTCTTGCCACATGATTTGTGATTGAAATTATCTAACTAATCTCATGCTAGTAAGTGCAGGTAAAGTCACTTTTGGAGATGGAGCCATCAGAAGAATCATTGGGAAGGGAAAGCTCAACGTGCAAGGACTACCTTCTCTTGAGGATGATATGTTGGTCAAAGGATTGAATCCAATCTAATTAGCATCAGTCAGCTTTGTGATAAAAAACTACAAGTAAGTTTCATGAAAGAACAATGTATTGTTACTGATGATGTTAAAACACTTGTTATGGCAGGCACTAGATCATCTGACAACTGCTATTTGTGGAATCTGGAATCTACATCCCCAGTTTGCCATTTAGCTAGGCAAGACGAAGCAGATCACTGA

mRNA sequence

ATGTCAGTCGCCCACAAAGGAACTTCCAAAGTCAAAATGTCGAGGCTCCAATTGTTGACTTCCAAATTTGAAAATTTGAGAATGAATGATGATGAATCAATTGCCGAGTTTAATGTGTGTCTACTGGATATTGCCAATGAATCTTTTGTCCTTGGAGAAAAGATCTCAGAGGAAAGGAAACCAGAGAAGAAGACTGAAAGCATTGCTCTCCAATCTGTTGATGATGAGACAGACTCTACTGAGTTTGAGGCAGATGTTAAAGCTCTTTTCAGTAACATCTCCTCTGACACATCGTCAGATATGAAAGTGCTGGATATTCAGAAAGAAGGAATTGAGTCCTTGTTGGCTGACAACCATCGCTTAATGTCAACCATTGCAGAACTAAAACGTGCACTAGTCCATTCTAAGGTTGAAAAGGAATCCATGGTGAAGAACATCAGGATGCTCAATTTTGGCACTAAATTGCTTGAAAACATTCTTTCTAAAGGAAAAACCGCAGGCACTAGATCATCTGACAACTGCTATTTGTGGAATCTGGAATCTACATCCCCAGTTTGCCATTTAGCTAGGCAAGACGAAGCAGATCACTGA

Coding sequence (CDS)

ATGTCAGTCGCCCACAAAGGAACTTCCAAAGTCAAAATGTCGAGGCTCCAATTGTTGACTTCCAAATTTGAAAATTTGAGAATGAATGATGATGAATCAATTGCCGAGTTTAATGTGTGTCTACTGGATATTGCCAATGAATCTTTTGTCCTTGGAGAAAAGATCTCAGAGGAAAGGAAACCAGAGAAGAAGACTGAAAGCATTGCTCTCCAATCTGTTGATGATGAGACAGACTCTACTGAGTTTGAGGCAGATGTTAAAGCTCTTTTCAGTAACATCTCCTCTGACACATCGTCAGATATGAAAGTGCTGGATATTCAGAAAGAAGGAATTGAGTCCTTGTTGGCTGACAACCATCGCTTAATGTCAACCATTGCAGAACTAAAACGTGCACTAGTCCATTCTAAGGTTGAAAAGGAATCCATGGTGAAGAACATCAGGATGCTCAATTTTGGCACTAAATTGCTTGAAAACATTCTTTCTAAAGGAAAAACCGCAGGCACTAGATCATCTGACAACTGCTATTTGTGGAATCTGGAATCTACATCCCCAGTTTGCCATTTAGCTAGGCAAGACGAAGCAGATCACTGA

Protein sequence

MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEERKPEKKTESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTAGTRSSDNCYLWNLESTSPVCHLARQDEADH
Homology
BLAST of ClCG08G004860 vs. NCBI nr
Match: XP_038896219.1 (uncharacterized protein LOC120084497 [Benincasa hispida])

HSP 1 Score: 102.8 bits (255), Expect = 3.4e-18
Identity = 78/171 (45.61%), Postives = 96/171 (56.14%), Query Frame = 0

Query: 13  MSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE-------------- 72
           MSRLQLLTSKFENLRM++DE I +FNV +LDI+NES  LGEKIS+E              
Sbjct: 1   MSRLQLLTSKFENLRMHNDEFIGDFNV-MLDISNESSALGEKISKEKPVRKMFRSLSKGF 60

Query: 73  --------------RKPEKK------------TESIALQSVDDET-DSTEFEADVKALFS 132
                         RK +K+            +E   L   D ET +S++ E DVKAL+ 
Sbjct: 61  DMKVTTIEEAQDITRKNDKEIKVECPNFLKKHSEGYVLTWFDTETEESSDSEEDVKALW- 120

Query: 133 NISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESM 143
                   D K LDI KE I++L+ DNHRLMS IAELKR L   KVEK++M
Sbjct: 121 ------QEDQKALDILKEKIDTLITDNHRLMSMIAELKRELSQVKVEKDTM 163

BLAST of ClCG08G004860 vs. NCBI nr
Match: KAA0059847.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK24801.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 101.3 bits (251), Expect = 9.8e-18
Identity = 76/237 (32.07%), Postives = 122/237 (51.48%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER- 60
           + VA++GTSKVK+SRLQL+TSKFE L+M +DE+++E+N  +L+IAN+S +L EKI E + 
Sbjct: 85  LEVAYEGTSKVKISRLQLITSKFEALKMTEDETVSEYNERVLEIANDSLLLAEKIPESKI 144

Query: 61  ----------------------------------------------KPEKKTESIALQSV 120
                                                         +  KK + IA +SV
Sbjct: 145 VCKVLRSLPRKFDMKVTAIEEAQDITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSV 204

Query: 121 DDE---TDSTEFEADVKALFSNISSD---TSSDMKVL--------DIQKEGIESLLADNH 177
            D+    + +E  ++  +  SNI+ D   T  ++K+L         IQKE I+ L+ +N 
Sbjct: 205 YDQENTVNQSEINSEADSESSNINEDEELTLEELKILRKEDSEARAIQKERIQDLMDENE 264

BLAST of ClCG08G004860 vs. NCBI nr
Match: KAA0035264.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK07573.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 90.5 bits (223), Expect = 1.7e-14
Identity = 67/215 (31.16%), Postives = 108/215 (50.23%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER- 60
           + VA++ T++VK+SRLQL+TSKFE L+M +DESI+E+N  +L+IANES + G++I E + 
Sbjct: 83  LEVAYESTTRVKISRLQLITSKFEALKMFEDESISEYNERVLEIANESLLFGKRIPESKI 142

Query: 61  -------KPEK--------------------------------------KTESIALQSVD 120
                   PEK                                       T  I   + +
Sbjct: 143 VRKVLRSLPEKFDMKIECPTFLRRQKKNLRAILSDEDPYDSEEDNDMNAFTIRITKTNFE 202

Query: 121 DETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVH 170
           DE++S++   D +  F  +      D +   +QK  I+ L+ +N  LMS I+ LK  L  
Sbjct: 203 DESESSKENCDNELTFEKLKVLWKEDSEARTVQKVKIQELMKENEHLMSVISSLKLKLRE 262

BLAST of ClCG08G004860 vs. NCBI nr
Match: KAA0056457.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK29070.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 89.0 bits (219), Expect = 5.0e-14
Identity = 65/198 (32.83%), Postives = 101/198 (51.01%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE-- 60
           ++    GTSKVK+SRLQL+TSKFE LRM  DES++++N  +L+I NES +LGEKI +   
Sbjct: 31  LNAIFNGTSKVKISRLQLITSKFEALRMTKDESMSDYNKRVLEITNESLLLGEKIPDSKI 90

Query: 61  ----------RKPEKK-----------------------TESIALQSVDDETDSTEFEAD 120
                     RK EK                        T  I  ++ DD+++  E   +
Sbjct: 91  VRKAKCPTHLRKQEKNFRVTLSDEESGDSRDDDRNINAFTIRITDENSDDDSECFEESKN 150

Query: 121 VKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKN 164
                  + +    D +   IQKE ++ L+ +N RL+S I+ LK  L   + E + ++K+
Sbjct: 151 DDVTIEKLEALWKEDCEARAIQKERMQDLIEENERLLSVISSLKLKLREVQNENDQILKS 210

BLAST of ClCG08G004860 vs. NCBI nr
Match: XP_024019486.1 (uncharacterized protein LOC112091030 [Morus notabilis])

HSP 1 Score: 87.8 bits (216), Expect = 1.1e-13
Identity = 63/175 (36.00%), Postives = 98/175 (56.00%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER- 60
           + VAH+GT+ V+ S+L +LT++FE LRM + E+I+EFN  L DIANESF LGEKISE + 
Sbjct: 110 LEVAHEGTATVRQSKLNMLTTRFETLRMLEIETISEFNSRLCDIANESFALGEKISEAKL 169

Query: 61  --KPEKKTESIALQSVDD------ETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGI 120
             KPE+        S  +      ET    +E+        +  + S + K+ D+ +E  
Sbjct: 170 SVKPEENGNLFDSDSFSNDGELTHETIQEAYESMFNKWIQVVKLNKSLEKKLADVVQEKE 229

Query: 121 ESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTA 167
           +  +    R  S IAE+++ L  +  E E   K ++M+N GT  L++I S  K++
Sbjct: 230 DPKM----RHESKIAEMRKRLQEANAELERTQKTLKMMNTGTAKLDHIFSMRKSS 280

BLAST of ClCG08G004860 vs. ExPASy TrEMBL
Match: A0A5D3DMG6 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold184G00490 PE=4 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 4.7e-18
Identity = 76/237 (32.07%), Postives = 122/237 (51.48%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER- 60
           + VA++GTSKVK+SRLQL+TSKFE L+M +DE+++E+N  +L+IAN+S +L EKI E + 
Sbjct: 85  LEVAYEGTSKVKISRLQLITSKFEALKMTEDETVSEYNERVLEIANDSLLLAEKIPESKI 144

Query: 61  ----------------------------------------------KPEKKTESIALQSV 120
                                                         +  KK + IA +SV
Sbjct: 145 VCKVLRSLPRKFDMKVTAIEEAQDITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSV 204

Query: 121 DDE---TDSTEFEADVKALFSNISSD---TSSDMKVL--------DIQKEGIESLLADNH 177
            D+    + +E  ++  +  SNI+ D   T  ++K+L         IQKE I+ L+ +N 
Sbjct: 205 YDQENTVNQSEINSEADSESSNINEDEELTLEELKILRKEDSEARAIQKERIQDLMDENE 264

BLAST of ClCG08G004860 vs. ExPASy TrEMBL
Match: A0A5A7SVG0 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold852G00050 PE=4 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 8.4e-15
Identity = 67/215 (31.16%), Postives = 108/215 (50.23%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER- 60
           + VA++ T++VK+SRLQL+TSKFE L+M +DESI+E+N  +L+IANES + G++I E + 
Sbjct: 83  LEVAYESTTRVKISRLQLITSKFEALKMFEDESISEYNERVLEIANESLLFGKRIPESKI 142

Query: 61  -------KPEK--------------------------------------KTESIALQSVD 120
                   PEK                                       T  I   + +
Sbjct: 143 VRKVLRSLPEKFDMKIECPTFLRRQKKNLRAILSDEDPYDSEEDNDMNAFTIRITKTNFE 202

Query: 121 DETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVH 170
           DE++S++   D +  F  +      D +   +QK  I+ L+ +N  LMS I+ LK  L  
Sbjct: 203 DESESSKENCDNELTFEKLKVLWKEDSEARTVQKVKIQELMKENEHLMSVISSLKLKLRE 262

BLAST of ClCG08G004860 vs. ExPASy TrEMBL
Match: A0A5D3E0N2 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold120G001900 PE=4 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 2.4e-14
Identity = 65/198 (32.83%), Postives = 101/198 (51.01%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE-- 60
           ++    GTSKVK+SRLQL+TSKFE LRM  DES++++N  +L+I NES +LGEKI +   
Sbjct: 31  LNAIFNGTSKVKISRLQLITSKFEALRMTKDESMSDYNKRVLEITNESLLLGEKIPDSKI 90

Query: 61  ----------RKPEKK-----------------------TESIALQSVDDETDSTEFEAD 120
                     RK EK                        T  I  ++ DD+++  E   +
Sbjct: 91  VRKAKCPTHLRKQEKNFRVTLSDEESGDSRDDDRNINAFTIRITDENSDDDSECFEESKN 150

Query: 121 VKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKN 164
                  + +    D +   IQKE ++ L+ +N RL+S I+ LK  L   + E + ++K+
Sbjct: 151 DDVTIEKLEALWKEDCEARAIQKERMQDLIEENERLLSVISSLKLKLREVQNENDQILKS 210

BLAST of ClCG08G004860 vs. ExPASy TrEMBL
Match: A0A5D3BVD8 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold121G00400 PE=4 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 2.7e-13
Identity = 73/248 (29.44%), Postives = 115/248 (46.37%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISE--- 60
           + VA++GTSKVK+SRLQL+TSKFE + M +DES++++N  +L+IANES +L EKI +   
Sbjct: 70  LEVAYEGTSKVKISRLQLITSKFEAVSMTEDESVSDYNKRVLEIANESLLLREKIPDSKI 129

Query: 61  ---------------------------------------------ERKPE-----KKTES 120
                                                        +RK E     +K ++
Sbjct: 130 VWKVLRSLPKKFDMKVTAIDEAHNITTLRLDELFGLLLTFETATADRKAECPTFLRKQKN 189

Query: 121 IALQSVDDET-DSTEFEADVKA----LFSNISSDTS---------------------SDM 170
             +   D+E+ DS + ++++ A    +   I+ D S                      D 
Sbjct: 190 FCVTLSDEESGDSRDDDSNINAFTIRITYKITDDESECSEESKSDELTIEKLEALWKEDC 249

BLAST of ClCG08G004860 vs. ExPASy TrEMBL
Match: A0A5A7U4G8 (Gag-proteinase polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold163G001030 PE=4 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 7.8e-13
Identity = 45/91 (49.45%), Postives = 67/91 (73.63%), Query Frame = 0

Query: 1   MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEERK 60
           + VA++GTSKVK+SRLQ+LTS+FE L+MNDDE+IA+FNV +LD+ANESF LGEKI++ + 
Sbjct: 126 LEVAYEGTSKVKVSRLQILTSRFEALKMNDDETIAKFNVRVLDLANESFTLGEKIAKSKM 185

Query: 61  PEKKTESI------ALQSVDDETDSTEFEAD 86
            +K   S+       + ++++  D T  + D
Sbjct: 186 VQKVLRSLPSRFSMKMTAIEEANDITTMKLD 216

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896219.13.4e-1845.61uncharacterized protein LOC120084497 [Benincasa hispida][more]
KAA0059847.19.8e-1832.07gag-pol polyprotein [Cucumis melo var. makuwa] >TYK24801.1 gag-pol polyprotein [... [more]
KAA0035264.11.7e-1431.16gag-pol polyprotein [Cucumis melo var. makuwa] >TYK07573.1 gag-pol polyprotein [... [more]
KAA0056457.15.0e-1432.83gag-pol polyprotein [Cucumis melo var. makuwa] >TYK29070.1 gag-pol polyprotein [... [more]
XP_024019486.11.1e-1336.00uncharacterized protein LOC112091030 [Morus notabilis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3DMG64.7e-1832.07Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold184G... [more]
A0A5A7SVG08.4e-1531.16Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold852G... [more]
A0A5D3E0N22.4e-1432.83Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold120G... [more]
A0A5D3BVD82.7e-1329.44Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold121G... [more]
A0A5A7U4G87.8e-1349.45Gag-proteinase polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 111..131

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG08G004860.2ClCG08G004860.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008233 peptidase activity
molecular_function GO:0008270 zinc ion binding