ClCG01G021310 (gene) Watermelon (Charleston Gray)

NameClCG01G021310
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionGb|AAF04428.1
LocationCG_Chr01 : 35186033 .. 35186326 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTTAAGATGGCTGCTTCATTCAACAAGTTATCTTCTTGGGAACCCAACTGAGGTCCATGGCTATGGTGAAGAGAGAAGTTCAAAAGTTAAAAAGGGTTATGAAGAAATATGCACTTCTGGGTTTCAAATGCCTCTTCATTACCCTCGCTACAACAAGGCAGATTACCAGAAGATGGAGGAGTGGAAGCTTGATCTCCTTCTCAAGGAATATGGCTTGAGTTTTCAAGGCAGTTTGGAGGAGAAGAGGGCTTTTGCAATGGGTGCTTTTCTATGGCCTGATCAGTATTGA

mRNA sequence

ATGGCTTTAAGATGGCTGCTTCATTCAACAAGTTATCTTCTTGGGAACCCAACTGAGGTCCATGGCTATGGTGAAGAGAGAAGTTCAAAAGTTAAAAAGGGTTATGAAGAAATATGCACTTCTGGGTTTCAAATGCCTCTTCATTACCCTCGCTACAACAAGGCAGATTACCAGAAGATGGAGGAGTGGAAGCTTGATCTCCTTCTCAAGGAATATGGCTTGAGTTTTCAAGGCAGTTTGGAGGAGAAGAGGGCTTTTGCAATGGGTGCTTTTCTATGGCCTGATCAGTATTGA

Coding sequence (CDS)

ATGGCTTTAAGATGGCTGCTTCATTCAACAAGTTATCTTCTTGGGAACCCAACTGAGGTCCATGGCTATGGTGAAGAGAGAAGTTCAAAAGTTAAAAAGGGTTATGAAGAAATATGCACTTCTGGGTTTCAAATGCCTCTTCATTACCCTCGCTACAACAAGGCAGATTACCAGAAGATGGAGGAGTGGAAGCTTGATCTCCTTCTCAAGGAATATGGCTTGAGTTTTCAAGGCAGTTTGGAGGAGAAGAGGGCTTTTGCAATGGGTGCTTTTCTATGGCCTGATCAGTATTGA

Protein sequence

MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY
BLAST of ClCG01G021310 vs. TrEMBL
Match: A0A0A0KMW8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G511080 PE=4 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 8.9e-43
Identity = 84/96 (87.50%), Postives = 87/96 (90.62%), Query Frame = 1

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M LR LLHS SYLLGNP E H YGEERSSK KKGYEE+C SGFQMPLHYPRY K+DYQKM
Sbjct: 1  MDLRGLLHSVSYLLGNPNEAHAYGEERSSKGKKGYEELCNSGFQMPLHYPRYKKSDYQKM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
          EEWKLDLLLKEYGLSF+GSLEEKRAFAMGAFLWPDQ
Sbjct: 61 EEWKLDLLLKEYGLSFEGSLEEKRAFAMGAFLWPDQ 96

BLAST of ClCG01G021310 vs. TrEMBL
Match: A0A061GHA6_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_030413 PE=4 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 2.1e-28
Identity = 68/121 (56.20%), Postives = 81/121 (66.94%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPT------------------EVHGYGEERSSKVKKGYE------ 60
           MALRW +HS  ++LG P                   E H  G  RSSKV  G +      
Sbjct: 1   MALRWFVHSACHVLGYPKDDHPNHLQHCNNMESYQKEGHSGGVIRSSKVSNGEQVSTQTA 60

Query: 61  EICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 98
           E+  SGFQMPLHYPRY KADY+KMEEWK+D+LL+EYGLSF+G+L+EKRA+AMGAFLWPDQ
Sbjct: 61  EMHLSGFQMPLHYPRYTKADYEKMEEWKVDMLLREYGLSFRGNLDEKRAYAMGAFLWPDQ 120

BLAST of ClCG01G021310 vs. TrEMBL
Match: A0A067GFM8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g045350mg PE=4 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 1.4e-27
Identity = 66/118 (55.93%), Postives = 80/118 (67.80%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPTE-------VHGY-----------GEERSSKVKKGYE---EIC 60
           MALRW+LH+  ++LG+  +       V GY           G    SKV  G     E C
Sbjct: 1   MALRWVLHTACHVLGHQNDNKIECNGVVGYQNDHHHQVQVNGVSSDSKVSDGLSQSVEAC 60

Query: 61  TSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
            SGFQ+PLHYPRY+KADY+KME+WKLD+LL+EYGL FQG+ +EKRAFAMGAFLWPDQY
Sbjct: 61  ASGFQLPLHYPRYSKADYEKMEDWKLDMLLREYGLCFQGTPDEKRAFAMGAFLWPDQY 118

BLAST of ClCG01G021310 vs. TrEMBL
Match: I1MFQ9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G117600 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 5.2e-27
Identity = 67/115 (58.26%), Postives = 82/115 (71.30%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPT---------EVHGY---GEERSSK---VKKGYEEI--C--TS 60
           MALRW+LHS  ++LG PT         ++ GY   GE +S K   +   + E+  C   S
Sbjct: 1   MALRWVLHSACHVLGYPTRNIEEEECKKIEGYSNIGEAKSVKGLSLSNNFNEVDQCYPCS 60

Query: 61  GFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
           GFQMPLHYPRY K DY+ MEEWK+DLLLK+YGLSF+G+L+EKRAFAMGAFLWPDQ
Sbjct: 61  GFQMPLHYPRYTKQDYESMEEWKVDLLLKQYGLSFKGTLDEKRAFAMGAFLWPDQ 115

BLAST of ClCG01G021310 vs. TrEMBL
Match: A0A0B2RLN7_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_009934 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 6.8e-27
Identity = 66/115 (57.39%), Postives = 80/115 (69.57%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPT---------EVHGYGEERSSKVKKG------YEEI--C--TS 60
           MALRW+LHS  ++LG PT         ++ GY     +K  KG      + E+  C   S
Sbjct: 1   MALRWVLHSACHVLGYPTRNIEEEECKKIEGYSNIGEAKSVKGLSLANNFNEVDQCYPCS 60

Query: 61  GFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
           GFQMPLHYPRY K DY+ MEEWK+DLLLK+YGLSF+G+L+EKRAFAMGAFLWPDQ
Sbjct: 61  GFQMPLHYPRYTKQDYESMEEWKVDLLLKQYGLSFKGTLDEKRAFAMGAFLWPDQ 115

BLAST of ClCG01G021310 vs. TAIR10
Match: AT5G55620.1 (AT5G55620.1 unknown protein)

HSP 1 Score: 94.0 bits (232), Expect = 5.5e-20
Identity = 41/70 (58.57%), Postives = 55/70 (78.57%), Query Frame = 1

Query: 27  RSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAF 86
           R+  +K   +E   SGFQ+PLHYP+Y+K+DY+ M++ +LDLLLK+YG SF+GSLE+KR F
Sbjct: 31  RNKIIKMMKKEEFPSGFQVPLHYPKYSKSDYEVMDDLRLDLLLKQYGFSFEGSLEDKRVF 90

Query: 87  AMGAFLWPDQ 97
           A+ +FLWPDQ
Sbjct: 91  AIESFLWPDQ 100

BLAST of ClCG01G021310 vs. TAIR10
Match: AT3G09950.1 (AT3G09950.1 unknown protein)

HSP 1 Score: 84.7 bits (208), Expect = 3.4e-17
Identity = 39/65 (60.00%), Postives = 46/65 (70.77%), Query Frame = 1

Query: 32 KKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGL--SFQGSLEEKRAFAMG 91
          K G  +  +SGF+MPLHYPRY K DY++MEEW+LDLLL EYGL      +L EKRAFA+ 
Sbjct: 25 KNGAVKAPSSGFKMPLHYPRYTKEDYEEMEEWRLDLLLSEYGLLAFHDNTLHEKRAFAID 84

Query: 92 AFLWP 95
           F+WP
Sbjct: 85 TFIWP 89

BLAST of ClCG01G021310 vs. TAIR10
Match: AT5G41761.1 (AT5G41761.1 unknown protein)

HSP 1 Score: 83.6 bits (205), Expect = 7.5e-17
Identity = 37/71 (52.11%), Postives = 48/71 (67.61%), Query Frame = 1

Query: 26 ERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRA 85
          E ++K+     +  +S FQ+PLHYP+Y K+DY+KM EW+LD LL+EYGL   G   EKR 
Sbjct: 28 ETATKINHDKPQNQSSSFQIPLHYPKYTKSDYEKMPEWQLDRLLREYGLPVIGDSYEKRK 87

Query: 86 FAMGAFLWPDQ 97
          FA+GAFLW  +
Sbjct: 88 FAIGAFLWSSE 98

BLAST of ClCG01G021310 vs. TAIR10
Match: AT3G55570.1 (AT3G55570.1 unknown protein)

HSP 1 Score: 80.1 bits (196), Expect = 8.3e-16
Identity = 36/53 (67.92%), Postives = 39/53 (73.58%), Query Frame = 1

Query: 41 SGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLW 94
          S F+MPLHYPRY+K DYQ M EWKLD +L +YGLS  G L  KR FA+GAFLW
Sbjct: 30 SVFRMPLHYPRYSKEDYQDMPEWKLDRVLADYGLSTYGDLAHKRDFAIGAFLW 82

BLAST of ClCG01G021310 vs. TAIR10
Match: AT3G11405.1 (AT3G11405.1 unknown protein)

HSP 1 Score: 58.9 bits (141), Expect = 2.0e-09
Identity = 31/54 (57.41%), Postives = 33/54 (61.11%), Query Frame = 1

Query: 41  SGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQ-GSLEEKRAFAMGAFLW 94
           S FQMPL YP Y K  Y  M E +LD LLK YGL    G+L  K+ FA+GAFLW
Sbjct: 55  SSFQMPLQYPNYAKEQYDIMSEEELDRLLKLYGLPTDIGNLSCKKEFAVGAFLW 108

BLAST of ClCG01G021310 vs. NCBI nr
Match: gi|778720128|ref|XP_011658113.1| (PREDICTED: uncharacterized protein LOC105435946 [Cucumis sativus])

HSP 1 Score: 180.6 bits (457), Expect = 1.3e-42
Identity = 84/96 (87.50%), Postives = 87/96 (90.62%), Query Frame = 1

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M LR LLHS SYLLGNP E H YGEERSSK KKGYEE+C SGFQMPLHYPRY K+DYQKM
Sbjct: 1  MDLRGLLHSVSYLLGNPNEAHAYGEERSSKGKKGYEELCNSGFQMPLHYPRYKKSDYQKM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
          EEWKLDLLLKEYGLSF+GSLEEKRAFAMGAFLWPDQ
Sbjct: 61 EEWKLDLLLKEYGLSFEGSLEEKRAFAMGAFLWPDQ 96

BLAST of ClCG01G021310 vs. NCBI nr
Match: gi|590627017|ref|XP_007026334.1| (Uncharacterized protein TCM_030413 [Theobroma cacao])

HSP 1 Score: 132.9 bits (333), Expect = 3.0e-28
Identity = 68/121 (56.20%), Postives = 81/121 (66.94%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPT------------------EVHGYGEERSSKVKKGYE------ 60
           MALRW +HS  ++LG P                   E H  G  RSSKV  G +      
Sbjct: 1   MALRWFVHSACHVLGYPKDDHPNHLQHCNNMESYQKEGHSGGVIRSSKVSNGEQVSTQTA 60

Query: 61  EICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 98
           E+  SGFQMPLHYPRY KADY+KMEEWK+D+LL+EYGLSF+G+L+EKRA+AMGAFLWPDQ
Sbjct: 61  EMHLSGFQMPLHYPRYTKADYEKMEEWKVDMLLREYGLSFRGNLDEKRAYAMGAFLWPDQ 120

BLAST of ClCG01G021310 vs. NCBI nr
Match: gi|641859833|gb|KDO78523.1| (hypothetical protein CISIN_1g045350mg [Citrus sinensis])

HSP 1 Score: 130.2 bits (326), Expect = 2.0e-27
Identity = 66/118 (55.93%), Postives = 80/118 (67.80%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPTE-------VHGY-----------GEERSSKVKKGYE---EIC 60
           MALRW+LH+  ++LG+  +       V GY           G    SKV  G     E C
Sbjct: 1   MALRWVLHTACHVLGHQNDNKIECNGVVGYQNDHHHQVQVNGVSSDSKVSDGLSQSVEAC 60

Query: 61  TSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
            SGFQ+PLHYPRY+KADY+KME+WKLD+LL+EYGL FQG+ +EKRAFAMGAFLWPDQY
Sbjct: 61  ASGFQLPLHYPRYSKADYEKMEDWKLDMLLREYGLCFQGTPDEKRAFAMGAFLWPDQY 118

BLAST of ClCG01G021310 vs. NCBI nr
Match: gi|571517919|ref|XP_006597610.1| (PREDICTED: uncharacterized protein LOC102663856 [Glycine max])

HSP 1 Score: 128.3 bits (321), Expect = 7.5e-27
Identity = 67/115 (58.26%), Postives = 82/115 (71.30%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPT---------EVHGY---GEERSSK---VKKGYEEI--C--TS 60
           MALRW+LHS  ++LG PT         ++ GY   GE +S K   +   + E+  C   S
Sbjct: 1   MALRWVLHSACHVLGYPTRNIEEEECKKIEGYSNIGEAKSVKGLSLSNNFNEVDQCYPCS 60

Query: 61  GFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
           GFQMPLHYPRY K DY+ MEEWK+DLLLK+YGLSF+G+L+EKRAFAMGAFLWPDQ
Sbjct: 61  GFQMPLHYPRYTKQDYESMEEWKVDLLLKQYGLSFKGTLDEKRAFAMGAFLWPDQ 115

BLAST of ClCG01G021310 vs. NCBI nr
Match: gi|734404432|gb|KHN32943.1| (hypothetical protein glysoja_009934 [Glycine soja])

HSP 1 Score: 127.9 bits (320), Expect = 9.8e-27
Identity = 66/115 (57.39%), Postives = 80/115 (69.57%), Query Frame = 1

Query: 1   MALRWLLHSTSYLLGNPT---------EVHGYGEERSSKVKKG------YEEI--C--TS 60
           MALRW+LHS  ++LG PT         ++ GY     +K  KG      + E+  C   S
Sbjct: 1   MALRWVLHSACHVLGYPTRNIEEEECKKIEGYSNIGEAKSVKGLSLANNFNEVDQCYPCS 60

Query: 61  GFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
           GFQMPLHYPRY K DY+ MEEWK+DLLLK+YGLSF+G+L+EKRAFAMGAFLWPDQ
Sbjct: 61  GFQMPLHYPRYTKQDYESMEEWKVDLLLKQYGLSFKGTLDEKRAFAMGAFLWPDQ 115

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KMW8_CUCSA8.9e-4387.50Uncharacterized protein OS=Cucumis sativus GN=Csa_6G511080 PE=4 SV=1[more]
A0A061GHA6_THECC2.1e-2856.20Uncharacterized protein OS=Theobroma cacao GN=TCM_030413 PE=4 SV=1[more]
A0A067GFM8_CITSI1.4e-2755.93Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g045350mg PE=4 SV=1[more]
I1MFQ9_SOYBN5.2e-2758.26Uncharacterized protein OS=Glycine max GN=GLYMA_15G117600 PE=4 SV=1[more]
A0A0B2RLN7_GLYSO6.8e-2757.39Uncharacterized protein OS=Glycine soja GN=glysoja_009934 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G55620.15.5e-2058.57 unknown protein[more]
AT3G09950.13.4e-1760.00 unknown protein[more]
AT5G41761.17.5e-1752.11 unknown protein[more]
AT3G55570.18.3e-1667.92 unknown protein[more]
AT3G11405.12.0e-0957.41 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778720128|ref|XP_011658113.1|1.3e-4287.50PREDICTED: uncharacterized protein LOC105435946 [Cucumis sativus][more]
gi|590627017|ref|XP_007026334.1|3.0e-2856.20Uncharacterized protein TCM_030413 [Theobroma cacao][more]
gi|641859833|gb|KDO78523.1|2.0e-2755.93hypothetical protein CISIN_1g045350mg [Citrus sinensis][more]
gi|571517919|ref|XP_006597610.1|7.5e-2758.26PREDICTED: uncharacterized protein LOC102663856 [Glycine max][more]
gi|734404432|gb|KHN32943.1|9.8e-2757.39hypothetical protein glysoja_009934 [Glycine soja][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G021310.1ClCG01G021310.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33513FAMILY NOT NAMEDcoord: 36..97
score: 4.1
NoneNo IPR availablePANTHERPTHR33513:SF2SUBFAMILY NOT NAMEDcoord: 36..97
score: 4.1

The following gene(s) are paralogous to this gene:

None