ClCG01G024800 (gene) Watermelon (Charleston Gray)

NameClCG01G024800
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionUnknown protein
LocationCG_Chr01 : 37843708 .. 37846582 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAGAATCAAAGATGGCAGCTTTCTGCGATCTCTTATGTAGAAGTGGAAGGAGTATTTTTGGGGACACTGCTCACGGTGGTGAAGCAAATAAATAGGAGTAGCGATGCCATTGCCATTGACAACCCGATCGATTCCTCTCTTCCGAGGCACTGTAACGAAAAACCCTAATTCAAATTCCGGGAGACGACGACGACGACCCTGATTCTGTCATTTGGGGAATACCAAATATGGGTCCCAATACGAAGACCCACTTCAGAGATTGAGCTTTTTTTCATGCCCTCTCCCATCTCTCTGTCGATTCCCTTTCCTTTCTTTCGAGACGCATCCATGGCTGACGAATCGCAACAACCCCAATCTGAATCTCAACCTCCGCTCGAAGTCGATTCTGCACCTCCGCCACCGCCTCCGCCTTCTTTTGACCCCAGTCGAAGTAATATTTCTTTCTCCATCTTCGTTTTCTGACTCTGAGGGGTTTCTTTCTACTCGTTGTTGCTCGATGAAATTTTTCGTTTGTTGGGTTTTGTTAGGTATTGAATGTCGTTTTGGGGATTTATTAGGGTTGGGTGGTTCTGATCTGAACTCTCGAGTAATGGGGTTGTTGTTTTTTAAGCACTTGAGATTGATGGATTTAAACATTCTAGCTATGGGAATGGGCGAATTGGCTCCCCTCTTCTTTATGATATTGTAGCAACAAACTGTAATTTTTTATCTTGTCCGTGAGTTTTTTACTTTGAACTTGGACATTCAGGTTTTGAAACTGTAGTTTTTTACTTTGAACTTGGACATTCAGGTTTTGAAACTGTTTTTCAATCTTTGGGTTTGCCGCTTTGTTGTATGAGATGGTAAAGTAGAGTTGATCCATTCCCAATTTGAATGGTAGAGCAGAGATGTTTTAAGAGACGTTTCTTCTTAAAATTTTCCAGTTCCATGGATTGTAATTGTACTGATCAGTTTTTTCTCTTTCTCTTTTGAAGTGATTGGTATCATTAAACGGAAGGCCTTGATAAAAGAACTAGCTGCTGCGTATCACGCTGAGTGTCTTGTATACTGCCAAGAGCTTCTTGAGCTTCAAAGAAAGTGGGACGAGGTATGTAATTCAGCCTTATATTCCTTCTGCTTCAATGTTTTTTTTTCTTGGGACGTTCTATCAAAATGATTCAATACTTGAGCTGTCAAAATTTTCTTTGATCGAGCAACTATACTAGCGAATCACTAATCTCTGGTCATTGTCTATGAATATTCCTTCTGTTTCAATGTTTTCTTCTTGGGACGTTTTATCAAAACGATTCACTACTTAAATTTTCTTTGATCGAGCAACTATACTAGCGAACCATTAATCCCTGGTCGTTTTCTATGAACCAGATGTTTGAACTGCAGATTTTTGCCACAGACAAGTTTTACTGATCATAATAATTATAGTTTCTTTCACTGTCGGAAAATCATTCAATGTGGCTGCATTATAATAACGGTTGGATTAAAAAATTTGCATGCCCCCTAATAAACCACTAGATATAAATTTGAAGTAGTTCTCTCATAAATCCAATGGCAAAACACTGTTAGCCACATTGGATAATTCCTCTTCATTGTGCAATGGATATCTGTGTGATTCCAGATTTGTAACCTTCGATTATTCACCTGTGCTTACCACAGGGTTATGAGGATTATAATATACTAGGTCTGTGGCTGGCAATTTATTTCTGGTAGTTATACTTATATCTTTGATTATAAGCTATTAAGGAACGTGGCTTTAGGTAGCAATTAGCCATAGAGAAAATATTTGTGGAGAATTCATATTGAGGTAGAGAATTTGTATGGATGGCTGAGATATTTTAACTAAACATAGATTCCGTTTTCATGATGGTGATGGTATTCCGGATCTGAGAGATTCTTCTACATGTTGAAAAATCCAATGATTTAATTTAAAGTTTCTTTACTTACTCATTGTTTCTTTATCTGCTTGCAGCCATATATTGAGTTAAAAGCACCTGATGATCCAAGAAAGGAGACAATGAAGCCTAGCAAACGTCAAAGGAAATCCCGCTAGGTATTTAATAAGCATTTTAGGAATTGGTTAGTTAATCCTCTACCAAGCTTTTTGTAACTTTTAGTCTCATATGGGGAATTTTTTTTTTTCTTCTTTTCGTTTTATAAAATAATAACAGTAGGCGTTGCGGCCGAGGTTATGCCGTATAATATTTGATTTAAAAAAAAAAAAGCCATGAGAATAAACCAGTTTGGTAACTTTCTGGTTTTTGGTCTTTTGAATATTAAGCCTATAAATGCTGAGTCTCTTTGTTTTGTTTCTACTTTTTACAAATATTTTCAAAATCCACATCAAGTTTTGAAACTTGAAAGAAAATACTGGTTTTTGGAATTTAGAATGTGAGCCATGAATTTAAAAATAGTTTACAATTGATTATTGAATGGGACCTTAATTTTCTAAATGGAATGGTGATCGTTCTTAAAATTTATCAAAGTGGAATATGAGGACTTGTGATTACTTAACATACTTGATGCATTCAACATAGAACCTTTCTGCTTTTCCAGTGCAAACATATTTCTTGAGTTAGCAATTCTTGTTACCTTTCTACCATCTCTTAATCTTTTTTTTCTCCCTCTCTTGCAGGCAATAAATATATTTGATGCTCCTAACTTGACTTGCGAATCTGATGGTTCTTCCGTTCTGCAAGACTCAAGAGAGACTTATGCGCAAGATTGATAGTCTGCTATGAATTGGCTGATTACTCTATGGATCCAACAATTTGTACACTCAGTTATGAAAACAGTGAAGTATATAAAGTATCTGTGTGCTGTAGCCTGTATGTATTCAATTTTTGTGCGATAATATCGCTACACAGCGGTAGGATTTAGTT

mRNA sequence

GAAAGAATCAAAGATGGCAGCTTTCTGCGATCTCTTATGTAGAAGTGGAAGGAGTATTTTTGGGGACACTGCTCACGGTGGTGAAGCAAATAAATAGGAGTAGCGATGCCATTGCCATTGACAACCCGATCGATTCCTCTCTTCCGAGGCACTGTAACGAAAAACCCTAATTCAAATTCCGGGAGACGACGACGACGACCCTGATTCTGTCATTTGGGGAATACCAAATATGGGTCCCAATACGAAGACCCACTTCAGAGATTGAGCTTTTTTTCATGCCCTCTCCCATCTCTCTGTCGATTCCCTTTCCTTTCTTTCGAGACGCATCCATGGCTGACGAATCGCAACAACCCCAATCTGAATCTCAACCTCCGCTCGAAGTCGATTCTGCACCTCCGCCACCGCCTCCGCCTTCTTTTGACCCCAGTCGAATGATTGGTATCATTAAACGGAAGGCCTTGATAAAAGAACTAGCTGCTGCGTATCACGCTGAGTGTCTTGTATACTGCCAAGAGCTTCTTGAGCTTCAAAGAAAGTGGGACGAGCCATATATTGAGTTAAAAGCACCTGATGATCCAAGAAAGGAGACAATGAAGCCTAGCAAACGTCAAAGGAAATCCCGCTAGGTATTTAATAAGCATTTTAGGAATTGGCAATAAATATATTTGATGCTCCTAACTTGACTTGCGAATCTGATGGTTCTTCCGTTCTGCAAGACTCAAGAGAGACTTATGCGCAAGATTGATAGTCTGCTATGAATTGGCTGATTACTCTATGGATCCAACAATTTGTACACTCAGTTATGAAAACAGTGAAGTATATAAAGTATCTGTGTGCTGTAGCCTGTATGTATTCAATTTTTGTGCGATAATATCGCTACACAGCGGTAGGATTTAGTT

Coding sequence (CDS)

ATGCCCTCTCCCATCTCTCTGTCGATTCCCTTTCCTTTCTTTCGAGACGCATCCATGGCTGACGAATCGCAACAACCCCAATCTGAATCTCAACCTCCGCTCGAAGTCGATTCTGCACCTCCGCCACCGCCTCCGCCTTCTTTTGACCCCAGTCGAATGATTGGTATCATTAAACGGAAGGCCTTGATAAAAGAACTAGCTGCTGCGTATCACGCTGAGTGTCTTGTATACTGCCAAGAGCTTCTTGAGCTTCAAAGAAAGTGGGACGAGCCATATATTGAGTTAAAAGCACCTGATGATCCAAGAAAGGAGACAATGAAGCCTAGCAAACGTCAAAGGAAATCCCGCTAG

Protein sequence

MPSPISLSIPFPFFRDASMADESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR
BLAST of ClCG01G024800 vs. TrEMBL
Match: A0A0A0KM14_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G524060 PE=4 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 9.3e-47
Identity = 90/98 (91.84%), Postives = 95/98 (96.94%), Query Frame = 1

Query: 19  MADESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYC 78
           MADESQQPQSE+QPPL++DS PPPPPPP FDPSRMIGII+RKALIKELAAAYHAECLVYC
Sbjct: 1   MADESQQPQSETQPPLDIDSVPPPPPPPRFDPSRMIGIIRRKALIKELAAAYHAECLVYC 60

Query: 79  QELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           QELLELQRKWDEPYIELKAPDDPRKET KPSKRQ+KSR
Sbjct: 61  QELLELQRKWDEPYIELKAPDDPRKETTKPSKRQKKSR 98

BLAST of ClCG01G024800 vs. TrEMBL
Match: A0A0B0N2G7_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_00739 PE=4 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 1.0e-32
Identity = 72/104 (69.23%), Postives = 80/104 (76.92%), Query Frame = 1

Query: 13  FFRDASMADESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHA 72
           FF+   M+ E QQP     PPL     PPP PPP FDPSRMIGIIKRKALIKELAA YHA
Sbjct: 12  FFKTDPMSQEQQQPPPS--PPLASQPDPPPVPPPPFDPSRMIGIIKRKALIKELAAVYHA 71

Query: 73  ECLVYCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           ECL  CQELLELQ+KWDEP+I+LK PDD RKE ++PSKR +KSR
Sbjct: 72  ECLAKCQELLELQKKWDEPFIDLKIPDDLRKEKIRPSKRVKKSR 113

BLAST of ClCG01G024800 vs. TrEMBL
Match: A0A0D2VEH3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G238000 PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 7.1e-31
Identity = 69/98 (70.41%), Postives = 77/98 (78.57%), Query Frame = 1

Query: 19  MADESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYC 78
           M+ E QQP     PPL+    PPP PP  FDPSRMIGIIKRKALIKELAA YHAECL  C
Sbjct: 1   MSQEQQQPPPS--PPLDSQPDPPPAPPLPFDPSRMIGIIKRKALIKELAAVYHAECLAKC 60

Query: 79  QELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           QELLELQ+KWDEP+I+LK PDD RKE ++PSKR +KSR
Sbjct: 61  QELLELQKKWDEPFIDLKIPDDLRKEKIRPSKRVKKSR 96

BLAST of ClCG01G024800 vs. TrEMBL
Match: W9QXC0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005901 PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 7.1e-31
Identity = 68/94 (72.34%), Postives = 76/94 (80.85%), Query Frame = 1

Query: 27  QSESQPPLEVDSAPP----PPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYCQELL 86
           Q +   P ++DSA P    PPPPP FDPSRMIGIIKRKALIKELAA YHAECL YCQELL
Sbjct: 4   QQQQTLPSQLDSAQPQSLPPPPPPPFDPSRMIGIIKRKALIKELAAVYHAECLKYCQELL 63

Query: 87  ELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           ELQRKWDEP+I+LK P+D RKETM+P KR +K R
Sbjct: 64  ELQRKWDEPFIDLKTPEDARKETMRPPKRLKKLR 97

BLAST of ClCG01G024800 vs. TrEMBL
Match: A0A061GE22_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_029436 PE=4 SV=1)

HSP 1 Score: 137.1 bits (344), Expect = 1.3e-29
Identity = 65/90 (72.22%), Postives = 71/90 (78.89%), Query Frame = 1

Query: 27  QSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYCQELLELQR 86
           Q + QP     S P  PPPP FDPSRMIGIIKRKALIKELAA YHAECL YCQELLELQR
Sbjct: 3   QEQQQPSPAPASQPDHPPPPPFDPSRMIGIIKRKALIKELAAVYHAECLAYCQELLELQR 62

Query: 87  KWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           KWDEP+I++K PDD RKE  +P KR +KSR
Sbjct: 63  KWDEPFIDVKTPDDLRKEKTRPPKRLKKSR 92

BLAST of ClCG01G024800 vs. TAIR10
Match: AT4G18400.1 (AT4G18400.1 unknown protein)

HSP 1 Score: 110.9 bits (276), Expect = 5.2e-25
Identity = 58/106 (54.72%), Postives = 74/106 (69.81%), Query Frame = 1

Query: 19  MADESQQ--PQSESQPPLEVDSAP--PPP----PPPSFDPSRMIGIIKRKALIKELAAAY 78
           MA+E ++  P+     P E  S P  PPP    PP +FDPSRMIGIIKRKALIK+LAAAY
Sbjct: 1   MAEEEKRTSPEPPLPSPPESSSQPEHPPPETSTPPATFDPSRMIGIIKRKALIKDLAAAY 60

Query: 79  HAECLVYCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           H ECL  C+ELLELQ++ DEP+++ KA +D RKET++ S ++ K +
Sbjct: 61  HVECLALCRELLELQKRKDEPFLDTKATEDLRKETLRSSSKRAKKK 106

BLAST of ClCG01G024800 vs. NCBI nr
Match: gi|449433722|ref|XP_004134646.1| (PREDICTED: uncharacterized protein LOC101211064 [Cucumis sativus])

HSP 1 Score: 194.1 bits (492), Expect = 1.3e-46
Identity = 90/98 (91.84%), Postives = 95/98 (96.94%), Query Frame = 1

Query: 19  MADESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYC 78
           MADESQQPQSE+QPPL++DS PPPPPPP FDPSRMIGII+RKALIKELAAAYHAECLVYC
Sbjct: 1   MADESQQPQSETQPPLDIDSVPPPPPPPRFDPSRMIGIIRRKALIKELAAAYHAECLVYC 60

Query: 79  QELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           QELLELQRKWDEPYIELKAPDDPRKET KPSKRQ+KSR
Sbjct: 61  QELLELQRKWDEPYIELKAPDDPRKETTKPSKRQKKSR 98

BLAST of ClCG01G024800 vs. NCBI nr
Match: gi|659078216|ref|XP_008439608.1| (PREDICTED: uncharacterized protein LOC103484351 [Cucumis melo])

HSP 1 Score: 194.1 bits (492), Expect = 1.3e-46
Identity = 94/100 (94.00%), Postives = 96/100 (96.00%), Query Frame = 1

Query: 19  MADESQQPQSESQPPLEVDSAPPP--PPPPSFDPSRMIGIIKRKALIKELAAAYHAECLV 78
           MADESQQPQSESQPPLEVDS PPP  PPPP FDPSRMIGII+RKALIKELAAAYHAECLV
Sbjct: 1   MADESQQPQSESQPPLEVDSTPPPPPPPPPRFDPSRMIGIIRRKALIKELAAAYHAECLV 60

Query: 79  YCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           YCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQ+KSR
Sbjct: 61  YCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQKKSR 100

BLAST of ClCG01G024800 vs. NCBI nr
Match: gi|1009107039|ref|XP_015877564.1| (PREDICTED: uncharacterized protein LOC107413996 [Ziziphus jujuba])

HSP 1 Score: 152.1 bits (383), Expect = 5.8e-34
Identity = 72/95 (75.79%), Postives = 79/95 (83.16%), Query Frame = 1

Query: 22  ESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYCQEL 81
           ESQQPQ E+ PP        PPPPP FDPSRMIGIIKRKALIKELAA YHAECL YCQEL
Sbjct: 2   ESQQPQEEAFPPPPSLPDSTPPPPPPFDPSRMIGIIKRKALIKELAAVYHAECLAYCQEL 61

Query: 82  LELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           LELQRKWDEPYI+LK P+D RKET++PSKR +K+R
Sbjct: 62  LELQRKWDEPYIDLKTPEDTRKETLRPSKRLKKTR 96

BLAST of ClCG01G024800 vs. NCBI nr
Match: gi|728826549|gb|KHG06777.1| (hypothetical protein F383_00739 [Gossypium arboreum])

HSP 1 Score: 147.5 bits (371), Expect = 1.4e-32
Identity = 72/104 (69.23%), Postives = 80/104 (76.92%), Query Frame = 1

Query: 13  FFRDASMADESQQPQSESQPPLEVDSAPPPPPPPSFDPSRMIGIIKRKALIKELAAAYHA 72
           FF+   M+ E QQP     PPL     PPP PPP FDPSRMIGIIKRKALIKELAA YHA
Sbjct: 12  FFKTDPMSQEQQQPPPS--PPLASQPDPPPVPPPPFDPSRMIGIIKRKALIKELAAVYHA 71

Query: 73  ECLVYCQELLELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           ECL  CQELLELQ+KWDEP+I+LK PDD RKE ++PSKR +KSR
Sbjct: 72  ECLAKCQELLELQKKWDEPFIDLKIPDDLRKEKIRPSKRVKKSR 113

BLAST of ClCG01G024800 vs. NCBI nr
Match: gi|703063633|ref|XP_010087005.1| (hypothetical protein L484_005901 [Morus notabilis])

HSP 1 Score: 141.4 bits (355), Expect = 1.0e-30
Identity = 68/94 (72.34%), Postives = 76/94 (80.85%), Query Frame = 1

Query: 27  QSESQPPLEVDSAPP----PPPPPSFDPSRMIGIIKRKALIKELAAAYHAECLVYCQELL 86
           Q +   P ++DSA P    PPPPP FDPSRMIGIIKRKALIKELAA YHAECL YCQELL
Sbjct: 4   QQQQTLPSQLDSAQPQSLPPPPPPPFDPSRMIGIIKRKALIKELAAVYHAECLKYCQELL 63

Query: 87  ELQRKWDEPYIELKAPDDPRKETMKPSKRQRKSR 117
           ELQRKWDEP+I+LK P+D RKETM+P KR +K R
Sbjct: 64  ELQRKWDEPFIDLKTPEDARKETMRPPKRLKKLR 97

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KM14_CUCSA9.3e-4791.84Uncharacterized protein OS=Cucumis sativus GN=Csa_6G524060 PE=4 SV=1[more]
A0A0B0N2G7_GOSAR1.0e-3269.23Uncharacterized protein OS=Gossypium arboreum GN=F383_00739 PE=4 SV=1[more]
A0A0D2VEH3_GOSRA7.1e-3170.41Uncharacterized protein OS=Gossypium raimondii GN=B456_010G238000 PE=4 SV=1[more]
W9QXC0_9ROSA7.1e-3172.34Uncharacterized protein OS=Morus notabilis GN=L484_005901 PE=4 SV=1[more]
A0A061GE22_THECC1.3e-2972.22Uncharacterized protein OS=Theobroma cacao GN=TCM_029436 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G18400.15.2e-2554.72 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449433722|ref|XP_004134646.1|1.3e-4691.84PREDICTED: uncharacterized protein LOC101211064 [Cucumis sativus][more]
gi|659078216|ref|XP_008439608.1|1.3e-4694.00PREDICTED: uncharacterized protein LOC103484351 [Cucumis melo][more]
gi|1009107039|ref|XP_015877564.1|5.8e-3475.79PREDICTED: uncharacterized protein LOC107413996 [Ziziphus jujuba][more]
gi|728826549|gb|KHG06777.1|1.4e-3269.23hypothetical protein F383_00739 [Gossypium arboreum][more]
gi|703063633|ref|XP_010087005.1|1.0e-3072.34hypothetical protein L484_005901 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G024800.1ClCG01G024800.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37242FAMILY NOT NAMEDcoord: 15..116
score: 7.0
NoneNo IPR availablePANTHERPTHR37242:SF1SUBFAMILY NOT NAMEDcoord: 15..116
score: 7.0

The following gene(s) are paralogous to this gene:

None