Clc01G23080 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G23080
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionCHCH domain-containing protein
LocationClcChr01: 33957407 .. 33957930 (+)
RNA-Seq ExpressionClc01G23080
SyntenyClc01G23080
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCTCACTCAAACCCTCCACTCACCACAATGGATCCTCATTCCCTTACAATTATTCTTTAAATCTTCAACAGAATCCCATTCCAATCATTAGAATTAAGCCCAATTTCACACCTGAATTCCACAGCAGAACAACCTTCAGAACCCGCGCCGAACAACCCGGATCAGGCAACGGCGCTGAAAGACGGACTTTCTTGACCATCGAAGAAGCCGGTCTGGTTGAAGTATCCGGCCTCGGCACGCACGAACGATTTCTCTGTCGTTTAACCGTATAATCTCTGTTTCTGGTCTTTTTGAATACACATGTTGCCTTAATCCTCAATTATTATTGAAGCTGTTGGTTGTGTTTGTGTAGATATCGTCCCTGAATTTACTGAGAGTGATAGCGGAGGAAGAGGAATGTTCCATTGAAGAGTTGAATGCGGGAAGATTATGCGATTGGTTTCTCAAGGATAAGTTGAAAAGAGAGCAGAATTTGGATTCTGCTGTTCTTCAATGGGACGATTCTGACCCTCTCTTCTGA

mRNA sequence

ATGATCTCACTCAAACCCTCCACTCACCACAATGGATCCTCATTCCCTTACAATTATTCTTTAAATCTTCAACAGAATCCCATTCCAATCATTAGAATTAAGCCCAATTTCACACCTGAATTCCACAGCAGAACAACCTTCAGAACCCGCGCCGAACAACCCGGATCAGGCAACGGCGCTGAAAGACGGACTTTCTTGACCATCGAAGAAGCCGGTCTGGTTGAAGTATCCGGCCTCGGCACGCACGAACGATTTCTCTGTCGTTTAACCATATCGTCCCTGAATTTACTGAGAGTGATAGCGGAGGAAGAGGAATGTTCCATTGAAGAGTTGAATGCGGGAAGATTATGCGATTGGTTTCTCAAGGATAAGTTGAAAAGAGAGCAGAATTTGGATTCTGCTGTTCTTCAATGGGACGATTCTGACCCTCTCTTCTGA

Coding sequence (CDS)

ATGATCTCACTCAAACCCTCCACTCACCACAATGGATCCTCATTCCCTTACAATTATTCTTTAAATCTTCAACAGAATCCCATTCCAATCATTAGAATTAAGCCCAATTTCACACCTGAATTCCACAGCAGAACAACCTTCAGAACCCGCGCCGAACAACCCGGATCAGGCAACGGCGCTGAAAGACGGACTTTCTTGACCATCGAAGAAGCCGGTCTGGTTGAAGTATCCGGCCTCGGCACGCACGAACGATTTCTCTGTCGTTTAACCATATCGTCCCTGAATTTACTGAGAGTGATAGCGGAGGAAGAGGAATGTTCCATTGAAGAGTTGAATGCGGGAAGATTATGCGATTGGTTTCTCAAGGATAAGTTGAAAAGAGAGCAGAATTTGGATTCTGCTGTTCTTCAATGGGACGATTCTGACCCTCTCTTCTGA

Protein sequence

MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPEFHSRTTFRTRAEQPGSGNGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF
Homology
BLAST of Clc01G23080 vs. NCBI nr
Match: XP_038881387.1 (uncharacterized protein LOC120072922 [Benincasa hispida])

HSP 1 Score: 248.8 bits (634), Expect = 2.8e-62
Identity = 131/148 (88.51%), Postives = 136/148 (91.89%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIP-IIRIKPNFTPEFHSR-TTFRTRAEQPG-SG 60
           MISL PSTHHNGSSF YN S NLQQN IP IIRIKPNF P FH++ TTFRT+AEQ G SG
Sbjct: 1   MISLNPSTHHNGSSFHYN-SSNLQQNFIPTIIRIKPNFAPPFHTKTTTFRTQAEQAGSSG 60

Query: 61  NGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLC 120
           NGA+RRTFLTIEEAGLVEVSGL THERFLCRLTISSLNLLRVIAEEE+CSIEELNAGRLC
Sbjct: 61  NGAQRRTFLTIEEAGLVEVSGLSTHERFLCRLTISSLNLLRVIAEEEKCSIEELNAGRLC 120

Query: 121 DWFLKDKLKREQNLDSAVLQWDDSDPLF 146
           DWFLKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 DWFLKDKLKREQNLDSAVLQWDDSDPLF 147

BLAST of Clc01G23080 vs. NCBI nr
Match: XP_004134733.1 (uncharacterized protein LOC101211873 [Cucumis sativus] >KGN49180.1 hypothetical protein Csa_002941 [Cucumis sativus])

HSP 1 Score: 237.7 bits (605), Expect = 6.5e-59
Identity = 123/148 (83.11%), Postives = 132/148 (89.19%), Query Frame = 0

Query: 1   MISLKPSTHHNG--SSFPYNYSLNLQQNPIPIIRI-KPNFTPEFHSRTTFRTRAEQPGSG 60
           MISLKPSTHHNG  +SFPYN S   QQNPIPIIRI KPNF+   H++TTFRT A+  GSG
Sbjct: 1   MISLKPSTHHNGFDTSFPYN-SSKFQQNPIPIIRITKPNFS---HTKTTFRTHAQPSGSG 60

Query: 61  NGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLC 120
            G ++RTFLTIEEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLC
Sbjct: 61  KGPQKRTFLTIEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLC 120

Query: 121 DWFLKDKLKREQNLDSAVLQWDDSDPLF 146
           DWFLKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 DWFLKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. NCBI nr
Match: XP_022977984.1 (uncharacterized protein LOC111478111 [Cucurbita maxima])

HSP 1 Score: 235.7 bits (600), Expect = 2.5e-58
Identity = 119/145 (82.07%), Postives = 127/145 (87.59%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPEFHSRTTFRTRAEQPGSGNGA 60
           MISLKPS HHNGSSF  N S NL Q   PI R++ NF P F ++TTFR RAEQ  SGNGA
Sbjct: 1   MISLKPSPHHNGSSFLSN-SSNLHQKLNPITRLRSNFPPPFRTKTTFRIRAEQAESGNGA 60

Query: 61  ERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLCDWF 120
           +RRTFLT+EEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLCDWF
Sbjct: 61  QRRTFLTLEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLCDWF 120

Query: 121 LKDKLKREQNLDSAVLQWDDSDPLF 146
           LKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 LKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. NCBI nr
Match: XP_023544671.1 (uncharacterized protein LOC111804184 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 235.0 bits (598), Expect = 4.2e-58
Identity = 118/145 (81.38%), Postives = 127/145 (87.59%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPEFHSRTTFRTRAEQPGSGNGA 60
           MISLKPS HHNGSSF +N S NL Q   PI R + NF P F ++TTF+ RAEQ  SGNGA
Sbjct: 1   MISLKPSPHHNGSSFLFN-SSNLHQKLSPITRHRSNFPPPFRTKTTFKIRAEQAESGNGA 60

Query: 61  ERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLCDWF 120
           +RRTFLT+EEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLCDWF
Sbjct: 61  QRRTFLTLEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLCDWF 120

Query: 121 LKDKLKREQNLDSAVLQWDDSDPLF 146
           LKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 LKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. NCBI nr
Match: XP_022949732.1 (uncharacterized protein LOC111453037 [Cucurbita moschata] >KAG6603726.1 hypothetical protein SDJN03_04335, partial [Cucurbita argyrosperma subsp. sororia] >KAG7033905.1 hypothetical protein SDJN02_03630 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 231.9 bits (590), Expect = 3.6e-57
Identity = 118/145 (81.38%), Postives = 125/145 (86.21%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPEFHSRTTFRTRAEQPGSGNGA 60
           MISLKPS HHNGSSF  N S NL Q   PI R + NF P   ++TTFR RAEQ  SGNGA
Sbjct: 1   MISLKPSPHHNGSSFLSN-SSNLHQKLSPITRHRSNFPPPLRTKTTFRIRAEQAESGNGA 60

Query: 61  ERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLCDWF 120
           +RRTFLT+EEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLCDWF
Sbjct: 61  QRRTFLTLEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLCDWF 120

Query: 121 LKDKLKREQNLDSAVLQWDDSDPLF 146
           LKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 LKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. ExPASy TrEMBL
Match: A0A0A0KHI4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G516880 PE=4 SV=1)

HSP 1 Score: 237.7 bits (605), Expect = 3.1e-59
Identity = 123/148 (83.11%), Postives = 132/148 (89.19%), Query Frame = 0

Query: 1   MISLKPSTHHNG--SSFPYNYSLNLQQNPIPIIRI-KPNFTPEFHSRTTFRTRAEQPGSG 60
           MISLKPSTHHNG  +SFPYN S   QQNPIPIIRI KPNF+   H++TTFRT A+  GSG
Sbjct: 1   MISLKPSTHHNGFDTSFPYN-SSKFQQNPIPIIRITKPNFS---HTKTTFRTHAQPSGSG 60

Query: 61  NGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLC 120
            G ++RTFLTIEEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLC
Sbjct: 61  KGPQKRTFLTIEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLC 120

Query: 121 DWFLKDKLKREQNLDSAVLQWDDSDPLF 146
           DWFLKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 DWFLKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. ExPASy TrEMBL
Match: A0A6J1ISV2 (uncharacterized protein LOC111478111 OS=Cucurbita maxima OX=3661 GN=LOC111478111 PE=4 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 1.2e-58
Identity = 119/145 (82.07%), Postives = 127/145 (87.59%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPEFHSRTTFRTRAEQPGSGNGA 60
           MISLKPS HHNGSSF  N S NL Q   PI R++ NF P F ++TTFR RAEQ  SGNGA
Sbjct: 1   MISLKPSPHHNGSSFLSN-SSNLHQKLNPITRLRSNFPPPFRTKTTFRIRAEQAESGNGA 60

Query: 61  ERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLCDWF 120
           +RRTFLT+EEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLCDWF
Sbjct: 61  QRRTFLTLEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLCDWF 120

Query: 121 LKDKLKREQNLDSAVLQWDDSDPLF 146
           LKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 LKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. ExPASy TrEMBL
Match: A0A6J1GCY7 (uncharacterized protein LOC111453037 OS=Cucurbita moschata OX=3662 GN=LOC111453037 PE=4 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 1.7e-57
Identity = 118/145 (81.38%), Postives = 125/145 (86.21%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPEFHSRTTFRTRAEQPGSGNGA 60
           MISLKPS HHNGSSF  N S NL Q   PI R + NF P   ++TTFR RAEQ  SGNGA
Sbjct: 1   MISLKPSPHHNGSSFLSN-SSNLHQKLSPITRHRSNFPPPLRTKTTFRIRAEQAESGNGA 60

Query: 61  ERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRLCDWF 120
           +RRTFLT+EEAGLVEVSGL THERFLCRLTISSLNLL+VIAEEE+CSIEELNAGRLCDWF
Sbjct: 61  QRRTFLTLEEAGLVEVSGLSTHERFLCRLTISSLNLLKVIAEEEKCSIEELNAGRLCDWF 120

Query: 121 LKDKLKREQNLDSAVLQWDDSDPLF 146
           LKDKLKREQNLDSAVLQWDDSDPLF
Sbjct: 121 LKDKLKREQNLDSAVLQWDDSDPLF 144

BLAST of Clc01G23080 vs. ExPASy TrEMBL
Match: A0A6J1CKG5 (uncharacterized protein LOC111012432 OS=Momordica charantia OX=3673 GN=LOC111012432 PE=4 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 2.6e-45
Identity = 107/149 (71.81%), Postives = 113/149 (75.84%), Query Frame = 0

Query: 1   MISLKPSTHHNGSSFPYNYSLNLQQNPIPIIRIKPNFTPE-FHSRTTFR---TRAEQPGS 60
           MISL P  + NGSSFP+  S  L   PIP     P  T      +  FR    RAEQ G 
Sbjct: 1   MISLGPCAYQNGSSFPFYSSNLLNLRPIPSNFALPLPTNSIILGKRNFRFRVVRAEQAG- 60

Query: 61  GNGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNAGRL 120
           GN A+RRTFLT+EEAGLVEVSGL THERFLCRLTISSLNLLRVIAEEE+CSIEELNAGRL
Sbjct: 61  GNSAQRRTFLTVEEAGLVEVSGLSTHERFLCRLTISSLNLLRVIAEEEKCSIEELNAGRL 120

Query: 121 CDWFLKDKLKREQNLDSAVLQWDDSDPLF 146
           CDWFLKDKLKREQNLDSAVLQWDDSD  F
Sbjct: 121 CDWFLKDKLKREQNLDSAVLQWDDSDLTF 148

BLAST of Clc01G23080 vs. ExPASy TrEMBL
Match: A0A6P4A1U2 (uncharacterized protein LOC107422001 OS=Ziziphus jujuba OX=326968 GN=LOC107422001 PE=4 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 1.2e-37
Identity = 93/152 (61.18%), Postives = 110/152 (72.37%), Query Frame = 0

Query: 1   MISLKP-STHHNGSSF--PYNYSLNLQQNPIPIIRIKP-NFTPEFHSRTTFRTRAEQPGS 60
           M+SLKP  THH  SS   P+N    ++Q      +I+P    P      + R  + Q   
Sbjct: 1   MLSLKPCKTHHRLSSLSSPFNLQNIIKQTQ---TKIRPTKLAPRRSLSLSIRGCSGQEEK 60

Query: 61  G---NGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEEEECSIEELNA 120
           G     ++RRTFLT+EEAGLVE+SGLGTHERFLCRLTISSLNLLRVIAE+E CSIEELNA
Sbjct: 61  GRENKSSQRRTFLTLEEAGLVEISGLGTHERFLCRLTISSLNLLRVIAEQEGCSIEELNA 120

Query: 121 GRLCDWFLKDKLKREQNLDSAVLQWDDSDPLF 146
           GR+CDWF+KDKLKREQN+DSAVLQWDDS+  F
Sbjct: 121 GRVCDWFVKDKLKREQNIDSAVLQWDDSESPF 149

BLAST of Clc01G23080 vs. TAIR 10
Match: AT4G21445.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, chloroplast stroma; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 148.3 bits (373), Expect = 4.8e-36
Identity = 70/99 (70.71%), Postives = 83/99 (83.84%), Query Frame = 0

Query: 44  RTTFRTRAEQPGSGNGAERRTFLTIEEAGLVEVSGLGTHERFLCRLTISSLNLLRVIAEE 103
           R +  T  E+    N  ERR+FL++ EAGLVE+SGLG HE+FLCRLTISSLNLLRVI+E+
Sbjct: 53  RASAETGEEESDDQNKPERRSFLSLAEAGLVEISGLGAHEKFLCRLTISSLNLLRVISEQ 112

Query: 104 EECSIEELNAGRLCDWFLKDKLKREQNLDSAVLQWDDSD 143
           E CSIEELNAG++CDWFLKDKLKRE N++SAVLQWDD D
Sbjct: 113 EGCSIEELNAGKICDWFLKDKLKREHNIESAVLQWDDPD 151

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881387.12.8e-6288.51uncharacterized protein LOC120072922 [Benincasa hispida][more]
XP_004134733.16.5e-5983.11uncharacterized protein LOC101211873 [Cucumis sativus] >KGN49180.1 hypothetical ... [more]
XP_022977984.12.5e-5882.07uncharacterized protein LOC111478111 [Cucurbita maxima][more]
XP_023544671.14.2e-5881.38uncharacterized protein LOC111804184 [Cucurbita pepo subsp. pepo][more]
XP_022949732.13.6e-5781.38uncharacterized protein LOC111453037 [Cucurbita moschata] >KAG6603726.1 hypothet... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KHI43.1e-5983.11Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G516880 PE=4 SV=1[more]
A0A6J1ISV21.2e-5882.07uncharacterized protein LOC111478111 OS=Cucurbita maxima OX=3661 GN=LOC111478111... [more]
A0A6J1GCY71.7e-5781.38uncharacterized protein LOC111453037 OS=Cucurbita moschata OX=3662 GN=LOC1114530... [more]
A0A6J1CKG52.6e-4571.81uncharacterized protein LOC111012432 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6P4A1U21.2e-3761.18uncharacterized protein LOC107422001 OS=Ziziphus jujuba OX=326968 GN=LOC10742200... [more]
Match NameE-valueIdentityDescription
AT4G21445.14.8e-3670.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36382OSJNBA0043L09.26 PROTEINcoord: 1..143

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G23080.1Clc01G23080.1mRNA