ClCG03G008860 (gene) Watermelon (Charleston Gray)

NameClCG03G008860
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionCentromere protein V
LocationCG_Chr03 : 11077799 .. 11078191 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGTGCACAGTGGTGGATGCCACTGCAAGAGAATAAGATGGGAAGTGAAAGCAGGAAGCAGTGTCACAGCTTGGGATTGCAACTGTTCAAACTGCTCCATGAGAGGGAATACACATTTCACTGTGCCTTCTCAACATTTTAAGCTTTTGGGAGAGTCAGACAAGTTTATTACAACCTACACTTTTGGGACTCATACTGCAAACCACACATTTTGCAAAGTTTGTGGGATCACTTCCTTTTATCATTCACGCTCAACCCCAGATGGGGTTTCTGTTAGTTTCAGATGTGTTGATCCAGGCACCTTGGATCATGTTCAGATTAACAAGTTTGATGGTGCAAATTGGGAGCAAGCTCATCATCTTCATCATTCTAATTTGTCAAACAACTAG

mRNA sequence

ATGGTTGTGCACAGTGGTGGATGCCACTGCAAGAGAATAAGATGGGAAGTGAAAGCAGGAAGCAGTGTCACAGCTTGGGATTGCAACTGTTCAAACTGCTCCATGAGAGGGAATACACATTTCACTGTGCCTTCTCAACATTTTAAGCTTTTGGGAGAGTCAGACAAGTTTATTACAACCTACACTTTTGGGACTCATACTGCAAACCACACATTTTGCAAAGTTTGTGGGATCACTTCCTTTTATCATTCACGCTCAACCCCAGATGGGGTTTCTGTTAGTTTCAGATGTGTTGATCCAGGCACCTTGGATCATGTTCAGATTAACAAGTTTGATGGTGCAAATTGGGAGCAAGCTCATCATCTTCATCATTCTAATTTGTCAAACAACTAG

Coding sequence (CDS)

ATGGTTGTGCACAGTGGTGGATGCCACTGCAAGAGAATAAGATGGGAAGTGAAAGCAGGAAGCAGTGTCACAGCTTGGGATTGCAACTGTTCAAACTGCTCCATGAGAGGGAATACACATTTCACTGTGCCTTCTCAACATTTTAAGCTTTTGGGAGAGTCAGACAAGTTTATTACAACCTACACTTTTGGGACTCATACTGCAAACCACACATTTTGCAAAGTTTGTGGGATCACTTCCTTTTATCATTCACGCTCAACCCCAGATGGGGTTTCTGTTAGTTTCAGATGTGTTGATCCAGGCACCTTGGATCATGTTCAGATTAACAAGTTTGATGGTGCAAATTGGGAGCAAGCTCATCATCTTCATCATTCTAATTTGTCAAACAACTAG

Protein sequence

MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITTYTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAHHLHHSNLSNN
BLAST of ClCG03G008860 vs. Swiss-Prot
Match: CENPV_MOUSE (Centromere protein V OS=Mus musculus GN=Cenpv PE=1 SV=2)

HSP 1 Score: 133.7 bits (335), Expect = 1.5e-30
Identity = 60/123 (48.78%), Postives = 79/123 (64.23%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           +V H+GGCHC  +R+EV A + +  +DCNCS C  + N HF VP+  FKLL  ++  ITT
Sbjct: 122 LVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNRHFIVPASRFKLLKGAES-ITT 181

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTF TH A HTFCK CG+ SFY  RS P G  ++  C+D GT+  V   +F+G++WE+A 
Sbjct: 182 YTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDEGTVRSVVTEEFNGSDWERAM 241

Query: 121 HLH 124
             H
Sbjct: 242 KEH 243

BLAST of ClCG03G008860 vs. Swiss-Prot
Match: CENPV_HUMAN (Centromere protein V OS=Homo sapiens GN=CENPV PE=1 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 3.3e-30
Identity = 59/123 (47.97%), Postives = 79/123 (64.23%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           +V H+GGCHC  +R+EV A + +  +DCNCS C  + N HF VP+  FKLL +  + ITT
Sbjct: 145 LVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNRHFIVPASRFKLL-KGAEHITT 204

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTF TH A HTFCK CG+ SFY  RS P G  ++  C+D GT+  +   +F+G++WE+A 
Sbjct: 205 YTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDEGTVRSMVTEEFNGSDWEKAM 264

Query: 121 HLH 124
             H
Sbjct: 265 KEH 266

BLAST of ClCG03G008860 vs. TrEMBL
Match: A0A0A0KN47_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G410710 PE=4 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 9.4e-64
Identity = 111/121 (91.74%), Postives = 115/121 (95.04%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           MVVHSGGCHCKRIRWEV+A SSV AWDCNCSNCSMRGNTHFTVPS+HFKLLG+SD FI+T
Sbjct: 7   MVVHSGGCHCKRIRWEVEAASSVIAWDCNCSNCSMRGNTHFTVPSKHFKLLGDSDDFIST 66

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQI KFDG NWEQAH
Sbjct: 67  YTFGTHTAKHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQIIKFDGTNWEQAH 126

Query: 121 H 122
           H
Sbjct: 127 H 127

BLAST of ClCG03G008860 vs. TrEMBL
Match: I3SET8_LOTJA (Uncharacterized protein OS=Lotus japonicus PE=2 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 5.3e-51
Identity = 87/120 (72.50%), Postives = 104/120 (86.67%), Query Frame = 1

Query: 2   VVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITTY 61
           VVH+GGCHCK +RW+V A SSV AWDCNCSNC MR N HF VP+++F+LLG+S KFITTY
Sbjct: 8   VVHNGGCHCKSVRWKVLAPSSVVAWDCNCSNCYMRANNHFVVPAENFELLGDSGKFITTY 67

Query: 62  TFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAHH 121
           TFGTHTA HTFCK+CGITSFY+ RS PDGV+VSFRCVDPGTL H++I  FDG NWE++++
Sbjct: 68  TFGTHTAKHTFCKICGITSFYYPRSNPDGVAVSFRCVDPGTLTHIEIRHFDGKNWERSYN 127

BLAST of ClCG03G008860 vs. TrEMBL
Match: A0A059BUT9_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02876 PE=4 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 1.2e-50
Identity = 85/120 (70.83%), Postives = 105/120 (87.50%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           +++HSGGCHC+R+RWEV+A +SV AW CNCS+CSMRGN HF VPS+ FKLLG SD+++TT
Sbjct: 5   LLLHSGGCHCRRVRWEVEAPTSVVAWKCNCSDCSMRGNIHFIVPSERFKLLGNSDQYLTT 64

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCKVCGITSFY  RS PDG++V++RCVDPGTL HV+I +FDG NWE ++
Sbjct: 65  YTFGTHTAKHTFCKVCGITSFYKPRSNPDGIAVTYRCVDPGTLAHVEIKQFDGQNWESSY 124

BLAST of ClCG03G008860 vs. TrEMBL
Match: U5FK86_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s02700g PE=4 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 2.0e-50
Identity = 86/121 (71.07%), Postives = 105/121 (86.78%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           MV+H+GGCHC+R+RW V+A SSV AW+CNCS+CSMRGNTHF VPS+ F+LLG+S +F+TT
Sbjct: 5   MVIHNGGCHCRRVRWRVQAPSSVVAWNCNCSDCSMRGNTHFIVPSEKFELLGDSKEFLTT 64

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCK CGITSFY  RS PDGV+V+FRCVDPGTL HV+I  +DG NWE ++
Sbjct: 65  YTFGTHTAKHTFCKFCGITSFYIPRSNPDGVAVTFRCVDPGTLTHVEIKHYDGRNWESSY 124

Query: 121 H 122
           +
Sbjct: 125 N 125

BLAST of ClCG03G008860 vs. TrEMBL
Match: I1MZF8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_18G041500 PE=4 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 2.6e-50
Identity = 86/120 (71.67%), Postives = 102/120 (85.00%), Query Frame = 1

Query: 2   VVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITTY 61
           VVH+GGCHCK +RW+V A SSV AWDCNCS C MR NTHF VP+ +F+LLG+S+KF+TTY
Sbjct: 6   VVHTGGCHCKSVRWKVVAPSSVVAWDCNCSTCYMRANTHFIVPADNFELLGDSEKFLTTY 65

Query: 62  TFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAHH 121
           TF THTA HTFCK+CGITSFYH RS PDGV+V+FRCVDPGTL HV+I  FDG NW+ A++
Sbjct: 66  TFATHTAKHTFCKICGITSFYHPRSNPDGVAVTFRCVDPGTLTHVEIRHFDGKNWDSAYN 125

BLAST of ClCG03G008860 vs. TAIR10
Match: AT5G16940.1 (AT5G16940.1 carbon-sulfur lyases)

HSP 1 Score: 188.0 bits (476), Expect = 3.8e-48
Identity = 78/120 (65.00%), Postives = 92/120 (76.67%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           ++ H GGCHC +I+W VKA  SV AW CNCS+CSMRGN HF VPS +F+LL +S  FITT
Sbjct: 5   LIFHEGGCHCGKIKWRVKAARSVIAWSCNCSDCSMRGNVHFIVPSSNFELLDDSKDFITT 64

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCKVCGITSFY  RS PDGV+V+ +CV  GTL H+++  +DG NWE +H
Sbjct: 65  YTFGTHTAKHTFCKVCGITSFYIPRSNPDGVAVTVKCVKSGTLAHIEVKSYDGQNWEMSH 124

BLAST of ClCG03G008860 vs. NCBI nr
Match: gi|700195853|gb|KGN51030.1| (hypothetical protein Csa_5G410710 [Cucumis sativus])

HSP 1 Score: 250.8 bits (639), Expect = 1.3e-63
Identity = 111/121 (91.74%), Postives = 115/121 (95.04%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           MVVHSGGCHCKRIRWEV+A SSV AWDCNCSNCSMRGNTHFTVPS+HFKLLG+SD FI+T
Sbjct: 7   MVVHSGGCHCKRIRWEVEAASSVIAWDCNCSNCSMRGNTHFTVPSKHFKLLGDSDDFIST 66

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQI KFDG NWEQAH
Sbjct: 67  YTFGTHTAKHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQIIKFDGTNWEQAH 126

Query: 121 H 122
           H
Sbjct: 127 H 127

BLAST of ClCG03G008860 vs. NCBI nr
Match: gi|1021494247|ref|XP_016190113.1| (PREDICTED: centromere protein V [Arachis ipaensis])

HSP 1 Score: 209.5 bits (532), Expect = 3.4e-51
Identity = 88/116 (75.86%), Postives = 102/116 (87.93%), Query Frame = 1

Query: 2   VVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITTY 61
           VVH+GGCHCK +RW+V A SS+ AWDCNCS+CSMRGNTHF VP+ +F+LLGES KFITTY
Sbjct: 6   VVHTGGCHCKSVRWKVVAPSSIVAWDCNCSDCSMRGNTHFVVPAVNFQLLGESSKFITTY 65

Query: 62  TFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWE 118
           TFGTHTA HTFCK+CGI+SFYH RS PDGV+V+FRCVDPGTL HV++ K DG NWE
Sbjct: 66  TFGTHTAKHTFCKICGISSFYHPRSNPDGVAVTFRCVDPGTLTHVEVRKADGKNWE 121

BLAST of ClCG03G008860 vs. NCBI nr
Match: gi|388501428|gb|AFK38780.1| (unknown [Lotus japonicus])

HSP 1 Score: 208.4 bits (529), Expect = 7.7e-51
Identity = 87/120 (72.50%), Postives = 104/120 (86.67%), Query Frame = 1

Query: 2   VVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITTY 61
           VVH+GGCHCK +RW+V A SSV AWDCNCSNC MR N HF VP+++F+LLG+S KFITTY
Sbjct: 8   VVHNGGCHCKSVRWKVLAPSSVVAWDCNCSNCYMRANNHFVVPAENFELLGDSGKFITTY 67

Query: 62  TFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAHH 121
           TFGTHTA HTFCK+CGITSFY+ RS PDGV+VSFRCVDPGTL H++I  FDG NWE++++
Sbjct: 68  TFGTHTAKHTFCKICGITSFYYPRSNPDGVAVSFRCVDPGTLTHIEIRHFDGKNWERSYN 127

BLAST of ClCG03G008860 vs. NCBI nr
Match: gi|702374682|ref|XP_010062288.1| (PREDICTED: centromere protein V-like isoform X3 [Eucalyptus grandis])

HSP 1 Score: 207.2 bits (526), Expect = 1.7e-50
Identity = 85/120 (70.83%), Postives = 105/120 (87.50%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           +++HSGGCHC+R+RWEV+A +SV AW CNCS+CSMRGN HF VPS+ FKLLG SD+++TT
Sbjct: 5   LLLHSGGCHCRRVRWEVEAPTSVVAWKCNCSDCSMRGNIHFIVPSERFKLLGNSDQYLTT 64

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCKVCGITSFY  RS PDG++V++RCVDPGTL HV+I +FDG NWE ++
Sbjct: 65  YTFGTHTAKHTFCKVCGITSFYKPRSNPDGIAVTYRCVDPGTLAHVEIKQFDGQNWESSY 124

BLAST of ClCG03G008860 vs. NCBI nr
Match: gi|702374676|ref|XP_010062287.1| (PREDICTED: centromere protein V-like isoform X2 [Eucalyptus grandis])

HSP 1 Score: 207.2 bits (526), Expect = 1.7e-50
Identity = 85/120 (70.83%), Postives = 105/120 (87.50%), Query Frame = 1

Query: 1   MVVHSGGCHCKRIRWEVKAGSSVTAWDCNCSNCSMRGNTHFTVPSQHFKLLGESDKFITT 60
           +++HSGGCHC+R+RWEV+A +SV AW CNCS+CSMRGN HF VPS+ FKLLG SD+++TT
Sbjct: 40  LLLHSGGCHCRRVRWEVEAPTSVVAWKCNCSDCSMRGNIHFIVPSERFKLLGNSDQYLTT 99

Query: 61  YTFGTHTANHTFCKVCGITSFYHSRSTPDGVSVSFRCVDPGTLDHVQINKFDGANWEQAH 120
           YTFGTHTA HTFCKVCGITSFY  RS PDG++V++RCVDPGTL HV+I +FDG NWE ++
Sbjct: 100 YTFGTHTAKHTFCKVCGITSFYKPRSNPDGIAVTYRCVDPGTLAHVEIKQFDGQNWESSY 159

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CENPV_MOUSE1.5e-3048.78Centromere protein V OS=Mus musculus GN=Cenpv PE=1 SV=2[more]
CENPV_HUMAN3.3e-3047.97Centromere protein V OS=Homo sapiens GN=CENPV PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KN47_CUCSA9.4e-6491.74Uncharacterized protein OS=Cucumis sativus GN=Csa_5G410710 PE=4 SV=1[more]
I3SET8_LOTJA5.3e-5172.50Uncharacterized protein OS=Lotus japonicus PE=2 SV=1[more]
A0A059BUT9_EUCGR1.2e-5070.83Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02876 PE=4 SV=1[more]
U5FK86_POPTR2.0e-5071.07Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s02700g PE=4 SV=1[more]
I1MZF8_SOYBN2.6e-5071.67Uncharacterized protein OS=Glycine max GN=GLYMA_18G041500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G16940.13.8e-4865.00 carbon-sulfur lyases[more]
Match NameE-valueIdentityDescription
gi|700195853|gb|KGN51030.1|1.3e-6391.74hypothetical protein Csa_5G410710 [Cucumis sativus][more]
gi|1021494247|ref|XP_016190113.1|3.4e-5175.86PREDICTED: centromere protein V [Arachis ipaensis][more]
gi|388501428|gb|AFK38780.1|7.7e-5172.50unknown [Lotus japonicus][more]
gi|702374682|ref|XP_010062288.1|1.7e-5070.83PREDICTED: centromere protein V-like isoform X3 [Eucalyptus grandis][more]
gi|702374676|ref|XP_010062287.1|1.7e-5070.83PREDICTED: centromere protein V-like isoform X2 [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006913GFA/CENP-V
IPR011057Mss4-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016846carbon-sulfur lyase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016846 carbon-sulfur lyase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G008860.1ClCG03G008860.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006913Glutathione-dependent formaldehyde-activating enzyme/centromere protein VGENE3DG3DSA:3.90.1590.10coord: 4..99
score: 2.0
IPR006913Glutathione-dependent formaldehyde-activating enzyme/centromere protein VPFAMPF04828GFAcoord: 27..110
score: 3.
IPR011057Mss4-likeunknownSSF51316Mss4-likecoord: 2..102
score: 1.46
NoneNo IPR availablePANTHERPTHR28620FAMILY NOT NAMEDcoord: 1..119
score: 3.6
NoneNo IPR availablePANTHERPTHR28620:SF1PROTEIN F25B4.8, ISOFORM Acoord: 1..119
score: 3.6