Clc01G21760 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G21760
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
LocationClcChr01: 33193715 .. 33194008 (-)
RNA-Seq ExpressionClc01G21760
SyntenyClc01G21760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTTAAGATGGCTGCTTCATTCAACAAGTTATCTTCTTGGGAACCCAACTGAGGTCCATGGCTATGGTGAAGAGAGAAGTTCAAAAGTTAAAAAGGGTTATGAAGAAATATGCACTTCTGGGTTTCAAATGCCTCTTCATTACCCTCGCTACAACAAGGCAGATTACCAGAAGATGGAGGAGTGGAAGCTTGATCTCCTTCTCAAGGAATATGGCTTGAGTTTTCAAGGCAGTTTGGAGGAGAAGAGGGCTTTTGCAATGGGTGCTTTTCTATGGCCTGATCAGTATTGA

mRNA sequence

ATGGCTTTAAGATGGCTGCTTCATTCAACAAGTTATCTTCTTGGGAACCCAACTGAGGTCCATGGCTATGGTGAAGAGAGAAGTTCAAAAGTTAAAAAGGGTTATGAAGAAATATGCACTTCTGGGTTTCAAATGCCTCTTCATTACCCTCGCTACAACAAGGCAGATTACCAGAAGATGGAGGAGTGGAAGCTTGATCTCCTTCTCAAGGAATATGGCTTGAGTTTTCAAGGCAGTTTGGAGGAGAAGAGGGCTTTTGCAATGGGTGCTTTTCTATGGCCTGATCAGTATTGA

Coding sequence (CDS)

ATGGCTTTAAGATGGCTGCTTCATTCAACAAGTTATCTTCTTGGGAACCCAACTGAGGTCCATGGCTATGGTGAAGAGAGAAGTTCAAAAGTTAAAAAGGGTTATGAAGAAATATGCACTTCTGGGTTTCAAATGCCTCTTCATTACCCTCGCTACAACAAGGCAGATTACCAGAAGATGGAGGAGTGGAAGCTTGATCTCCTTCTCAAGGAATATGGCTTGAGTTTTCAAGGCAGTTTGGAGGAGAAGAGGGCTTTTGCAATGGGTGCTTTTCTATGGCCTGATCAGTATTGA

Protein sequence

MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY
Homology
BLAST of Clc01G21760 vs. NCBI nr
Match: TYK12988.1 (uncharacterized protein E5676_scaffold255G005850 [Cucumis melo var. makuwa])

HSP 1 Score: 185.3 bits (469), Expect = 2.6e-43
Identity = 85/97 (87.63%), Postives = 87/97 (89.69%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M LRWLLHSTSYLLGNP E H YGEERSS  KKGYEEIC SGFQMPLHYPRY K+DYQ M
Sbjct: 1  MDLRWLLHSTSYLLGNPNEAHAYGEERSSTGKKGYEEICNSGFQMPLHYPRYKKSDYQNM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
          E WKLDLLLKEYGLSFQGSLEEKRAFAMGAF+WPDQY
Sbjct: 61 EGWKLDLLLKEYGLSFQGSLEEKRAFAMGAFIWPDQY 97

BLAST of Clc01G21760 vs. NCBI nr
Match: XP_038882557.1 (uncharacterized protein LOC120073788 [Benincasa hispida])

HSP 1 Score: 183.7 bits (465), Expect = 7.4e-43
Identity = 88/97 (90.72%), Postives = 89/97 (91.75%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          MALRWLLHSTSYLLGNP E    GEE SSK K GYEEICTSGFQMPLHYPRYNKADYQKM
Sbjct: 1  MALRWLLHSTSYLLGNPIEAR--GEESSSKGKNGYEEICTSGFQMPLHYPRYNKADYQKM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
          EEWKLDLLLKEYGLSFQGSLEEKRAFA+GAFLWPDQY
Sbjct: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAIGAFLWPDQY 95

BLAST of Clc01G21760 vs. NCBI nr
Match: XP_011658113.1 (uncharacterized protein LOC105435946 [Cucumis sativus] >KGN49041.1 hypothetical protein Csa_003579 [Cucumis sativus])

HSP 1 Score: 181.4 bits (459), Expect = 3.7e-42
Identity = 84/96 (87.50%), Postives = 87/96 (90.62%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M LR LLHS SYLLGNP E H YGEERSSK KKGYEE+C SGFQMPLHYPRY K+DYQKM
Sbjct: 1  MDLRGLLHSVSYLLGNPNEAHAYGEERSSKGKKGYEELCNSGFQMPLHYPRYKKSDYQKM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
          EEWKLDLLLKEYGLSF+GSLEEKRAFAMGAFLWPDQ
Sbjct: 61 EEWKLDLLLKEYGLSFEGSLEEKRAFAMGAFLWPDQ 96

BLAST of Clc01G21760 vs. NCBI nr
Match: KAG6603827.1 (hypothetical protein SDJN03_04436, partial [Cucurbita argyrosperma subsp. sororia] >KAG7034009.1 hypothetical protein SDJN02_03735, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 171.0 bits (432), Expect = 5.0e-39
Identity = 84/99 (84.85%), Postives = 88/99 (88.89%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPT-EVHGYGEERSSKVKKGYEEICT-SGFQMPLHYPRYNKADYQ 60
          MAL WLL+S + LLGNP  EVHGYGEERSSK +KG EEICT SGFQMPLHYP Y KADYQ
Sbjct: 1  MALGWLLYSAARLLGNPNHEVHGYGEERSSKGEKGCEEICTSSGFQMPLHYPHYTKADYQ 60

Query: 61 KMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
          KMEEWK+D LLKEYGLSFQGSLEEKRAFAMGAFLWPDQY
Sbjct: 61 KMEEWKVDQLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 99

BLAST of Clc01G21760 vs. NCBI nr
Match: KAG6594704.1 (hypothetical protein SDJN03_11257, partial [Cucurbita argyrosperma subsp. sororia] >KAG7026671.1 hypothetical protein SDJN02_10674, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 157.1 bits (396), Expect = 7.5e-35
Identity = 76/96 (79.17%), Postives = 81/96 (84.38%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M L+WLLHSTS LLGNP EV       SS+ K+G EEIC SGFQMPLHYPRYNKADYQKM
Sbjct: 1  MPLKWLLHSTSCLLGNPIEV-----PISSQGKRGCEEICNSGFQMPLHYPRYNKADYQKM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
          E+WK+DLLLKEYGLSF GSLEEKRAFAMGAF WPDQ
Sbjct: 61 EDWKVDLLLKEYGLSFHGSLEEKRAFAMGAFTWPDQ 91

BLAST of Clc01G21760 vs. ExPASy TrEMBL
Match: A0A5D3CP92 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G005850 PE=4 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.2e-43
Identity = 85/97 (87.63%), Postives = 87/97 (89.69%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M LRWLLHSTSYLLGNP E H YGEERSS  KKGYEEIC SGFQMPLHYPRY K+DYQ M
Sbjct: 1  MDLRWLLHSTSYLLGNPNEAHAYGEERSSTGKKGYEEICNSGFQMPLHYPRYKKSDYQNM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
          E WKLDLLLKEYGLSFQGSLEEKRAFAMGAF+WPDQY
Sbjct: 61 EGWKLDLLLKEYGLSFQGSLEEKRAFAMGAFIWPDQY 97

BLAST of Clc01G21760 vs. ExPASy TrEMBL
Match: A0A0A0KMW8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511080 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 1.8e-42
Identity = 84/96 (87.50%), Postives = 87/96 (90.62%), Query Frame = 0

Query: 1  MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKM 60
          M LR LLHS SYLLGNP E H YGEERSSK KKGYEE+C SGFQMPLHYPRY K+DYQKM
Sbjct: 1  MDLRGLLHSVSYLLGNPNEAHAYGEERSSKGKKGYEELCNSGFQMPLHYPRYKKSDYQKM 60

Query: 61 EEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 97
          EEWKLDLLLKEYGLSF+GSLEEKRAFAMGAFLWPDQ
Sbjct: 61 EEWKLDLLLKEYGLSFEGSLEEKRAFAMGAFLWPDQ 96

BLAST of Clc01G21760 vs. ExPASy TrEMBL
Match: A0A6J1BUU9 (uncharacterized protein LOC111005569 OS=Momordica charantia OX=3673 GN=LOC111005569 PE=4 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 8.3e-32
Identity = 74/107 (69.16%), Postives = 81/107 (75.70%), Query Frame = 0

Query: 1   MALRWLLHSTSYLLGNPTEVHGYG--------EERSSKVKKGYEEIC--TSGFQMPLHYP 60
           MAL+WLLHS  YLL    EVH           EER  K KKG EEIC   SGFQMPLHYP
Sbjct: 3   MALKWLLHSACYLL---NEVHACANGGVKTSDEERIPKGKKGCEEICNSASGFQMPLHYP 62

Query: 61  RYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
           RYNKADYQKMEEW++DLLL +YG+ F+GSLEEKRAFAMGAFLWPDQ+
Sbjct: 63  RYNKADYQKMEEWEVDLLLNQYGMGFEGSLEEKRAFAMGAFLWPDQF 106

BLAST of Clc01G21760 vs. ExPASy TrEMBL
Match: A0A2N9EZP3 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8195 PE=4 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 1.1e-28
Identity = 63/100 (63.00%), Postives = 79/100 (79.00%), Query Frame = 0

Query: 1   MALRWLLHSTSYLLGNPTEVHGYGEERSSKVKKG---YEEICTSGFQMPLHYPRYNKADY 60
           MAL WL+HS  ++LG P + +     +S KV  G    +E+  SGFQMPLHYPRYNKADY
Sbjct: 1   MALSWLIHSACHVLGTPKDTNIQCHVKSLKVPNGGLPSKEMNPSGFQMPLHYPRYNKADY 60

Query: 61  QKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQY 98
           +KMEEWK+DLLLK+YGL+F+G+L+EKRA+AMGAFLWP QY
Sbjct: 61  EKMEEWKVDLLLKQYGLNFKGNLDEKRAYAMGAFLWPGQY 100

BLAST of Clc01G21760 vs. ExPASy TrEMBL
Match: A0A6J1B6U9 (uncharacterized protein LOC110424651 OS=Herrania umbratica OX=108875 GN=LOC110424651 PE=4 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 2.5e-28
Identity = 69/121 (57.02%), Postives = 81/121 (66.94%), Query Frame = 0

Query: 1   MALRWLLHSTSYLLGNP------------------TEVHGYGEERSSKVKKGYE------ 60
           MALRW +HS  ++LG P                   E H  G  RSSKV  G +      
Sbjct: 1   MALRWFVHSACHVLGYPKDDHPSHLQHCNNMGSYQKEGHSGGVIRSSKVSNGEQLSTQTA 60

Query: 61  EICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLWPDQ 98
           E+  SGFQMPLHYPRY KADYQKMEEWK+D+LL+EYGLSF+G+L+EKRA+AMGAFLWPDQ
Sbjct: 61  EMHLSGFQMPLHYPRYTKADYQKMEEWKVDVLLREYGLSFKGTLDEKRAYAMGAFLWPDQ 120

BLAST of Clc01G21760 vs. TAIR 10
Match: AT5G55620.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G09950.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 94.4 bits (233), Expect = 5.5e-20
Identity = 41/70 (58.57%), Postives = 57/70 (81.43%), Query Frame = 0

Query: 27  RSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAF 86
           R+  +K   +E   SGFQ+PLHYP+Y+K+DY+ M++ +LDLLLK+YG SF+GSLE+KR F
Sbjct: 31  RNKIIKMMKKEEFPSGFQVPLHYPKYSKSDYEVMDDLRLDLLLKQYGFSFEGSLEDKRVF 90

Query: 87  AMGAFLWPDQ 97
           A+ +FLWPDQ
Sbjct: 91  AIESFLWPDQ 100

BLAST of Clc01G21760 vs. TAIR 10
Match: AT3G09950.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41761.1); Has 128 Blast hits to 128 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 128; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 84.7 bits (208), Expect = 4.4e-17
Identity = 39/65 (60.00%), Postives = 48/65 (73.85%), Query Frame = 0

Query: 32 KKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGL--SFQGSLEEKRAFAMG 91
          K G  +  +SGF+MPLHYPRY K DY++MEEW+LDLLL EYGL      +L EKRAFA+ 
Sbjct: 25 KNGAVKAPSSGFKMPLHYPRYTKEDYEEMEEWRLDLLLSEYGLLAFHDNTLHEKRAFAID 84

Query: 92 AFLWP 95
           F+WP
Sbjct: 85 TFIWP 89

BLAST of Clc01G21760 vs. TAIR 10
Match: AT5G41761.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55570.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 84.0 bits (206), Expect = 7.5e-17
Identity = 37/71 (52.11%), Postives = 50/71 (70.42%), Query Frame = 0

Query: 26 ERSSKVKKGYEEICTSGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRA 85
          E ++K+     +  +S FQ+PLHYP+Y K+DY+KM EW+LD LL+EYGL   G   EKR 
Sbjct: 28 ETATKINHDKPQNQSSSFQIPLHYPKYTKSDYEKMPEWQLDRLLREYGLPVIGDSYEKRK 87

Query: 86 FAMGAFLWPDQ 97
          FA+GAFLW  +
Sbjct: 88 FAIGAFLWSSE 98

BLAST of Clc01G21760 vs. TAIR 10
Match: AT3G55570.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41761.1); Has 128 Blast hits to 128 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 128; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 80.1 bits (196), Expect = 1.1e-15
Identity = 36/53 (67.92%), Postives = 41/53 (77.36%), Query Frame = 0

Query: 41 SGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQGSLEEKRAFAMGAFLW 94
          S F+MPLHYPRY+K DYQ M EWKLD +L +YGLS  G L  KR FA+GAFLW
Sbjct: 30 SVFRMPLHYPRYSKEDYQDMPEWKLDRVLADYGLSTYGDLAHKRDFAIGAFLW 82

BLAST of Clc01G21760 vs. TAIR 10
Match: AT3G11405.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55570.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 58.9 bits (141), Expect = 2.6e-09
Identity = 31/54 (57.41%), Postives = 35/54 (64.81%), Query Frame = 0

Query: 41  SGFQMPLHYPRYNKADYQKMEEWKLDLLLKEYGLSFQ-GSLEEKRAFAMGAFLW 94
           S FQMPL YP Y K  Y  M E +LD LLK YGL    G+L  K+ FA+GAFLW
Sbjct: 55  SSFQMPLQYPNYAKEQYDIMSEEELDRLLKLYGLPTDIGNLSCKKEFAVGAFLW 108

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK12988.12.6e-4387.63uncharacterized protein E5676_scaffold255G005850 [Cucumis melo var. makuwa][more]
XP_038882557.17.4e-4390.72uncharacterized protein LOC120073788 [Benincasa hispida][more]
XP_011658113.13.7e-4287.50uncharacterized protein LOC105435946 [Cucumis sativus] >KGN49041.1 hypothetical ... [more]
KAG6603827.15.0e-3984.85hypothetical protein SDJN03_04436, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG6594704.17.5e-3579.17hypothetical protein SDJN03_11257, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3CP921.2e-4387.63Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0KMW81.8e-4287.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511080 PE=4 SV=1[more]
A0A6J1BUU98.3e-3269.16uncharacterized protein LOC111005569 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A2N9EZP31.1e-2863.00Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8195 PE=4 SV=1[more]
A0A6J1B6U92.5e-2857.02uncharacterized protein LOC110424651 OS=Herrania umbratica OX=108875 GN=LOC11042... [more]
Match NameE-valueIdentityDescription
AT5G55620.15.5e-2058.57unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G09950.14.4e-1760.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G41761.17.5e-1752.11unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G55570.11.1e-1567.92unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G11405.12.6e-0957.41unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33513:SF2SUBFAMILY NOT NAMEDcoord: 1..97
NoneNo IPR availablePANTHERPTHR33513OS06G0523300 PROTEINcoord: 1..97

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G21760.1Clc01G21760.1mRNA