Clc01G22540 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G22540
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
LocationClcChr01: 33650766 .. 33651558 (-)
RNA-Seq ExpressionClc01G22540
SyntenyClc01G22540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATAAGTTAAGATATATACATATATATATATTATTTTCCTGGTTTAAGATATAATGGGAATCATATACACATGATCTGAAGGCCAGCTCAGGAGCTGCTTGGTCTCTCTTCCATAATTTAAAGCACATGCTGTGCTAGCTTGCTTCTTCATTGCCTTATGGATGACCTTCTTCATTGACCCCACAGCCAAAGTTCACACTTACCCATTTCCACACAACTTCACTCATAAAATGCCATTTCTCCACTCAAACTCTCTCCACTGCCTACATATTGAGCTTAGTTTAGAATCTCTCTCTAATTTTCTAAGTTGAGAAGAGAGATGAGTTTTGTTCACATTAGGAGGCCAAATGGTTACTCCAAGGTTGATAAAGAAGATCCAGAAGAGATAATTCATCGAAGAGCACAGTTTCTAATCAACAAAGTGTTGGAACGAGCAGATTCCATGGGGAAACCTTCGTATTTGAGAATTAGAATTCGGAGGCTGAAGATTAGATTTGGGAGAAGATTGAAGAAGCTAAAGAAGAGTGCAATGGCAAGTATTTCAACTCTCAAGATTGGTGTTTACAAGCAAGTCGTTTCTCAGATCAGAAATTGCAAGTCTTTGTTTGGCCGAAAACAAACTAACTTTCCCGATTTTCCTCTTTTGTTATCATCTTGAAGTGAACTCACCATAGTTTCTTCAGCCAAATCATTGGGATAATGACTTCTGTTTTCCCTTGGAGCTTTAATTTTGTTCATGAGCTGTTATATGAAAGAAAATTTTATTAAGCATTTATTTGGGTGCTTTTTTTTT

mRNA sequence

TATAAGTTAAGATATATACATATATATATATTATTTTCCTGGTTTAAGATATAATGGGAATCATATACACATGATCTGAAGGCCAGCTCAGGAGCTGCTTGGTCTCTCTTCCATAATTTAAAGCACATGCTGTGCTAGCTTGCTTCTTCATTGCCTTATGGATGACCTTCTTCATTGACCCCACAGCCAAAGTTCACACTTACCCATTTCCACACAACTTCACTCATAAAATGCCATTTCTCCACTCAAACTCTCTCCACTGCCTACATATTGAGCTTAGTTTAGAATCTCTCTCTAATTTTCTAAGTTGAGAAGAGAGATGAGTTTTGTTCACATTAGGAGGCCAAATGGTTACTCCAAGGTTGATAAAGAAGATCCAGAAGAGATAATTCATCGAAGAGCACAGTTTCTAATCAACAAAGTGTTGGAACGAGCAGATTCCATGGGGAAACCTTCGTATTTGAGAATTAGAATTCGGAGGCTGAAGATTAGATTTGGGAGAAGATTGAAGAAGCTAAAGAAGAGTGCAATGGCAAGTATTTCAACTCTCAAGATTGGTGTTTACAAGCAAGTCGTTTCTCAGATCAGAAATTGCAAGTCTTTGTTTGGCCGAAAACAAACTAACTTTCCCGATTTTCCTCTTTTGTTATCATCTTGAAGTGAACTCACCATAGTTTCTTCAGCCAAATCATTGGGATAATGACTTCTGTTTTCCCTTGGAGCTTTAATTTTGTTCATGAGCTGTTATATGAAAGAAAATTTTATTAAGCATTTATTTGGGTGCTTTTTTTTT

Coding sequence (CDS)

ATGAGTTTTGTTCACATTAGGAGGCCAAATGGTTACTCCAAGGTTGATAAAGAAGATCCAGAAGAGATAATTCATCGAAGAGCACAGTTTCTAATCAACAAAGTGTTGGAACGAGCAGATTCCATGGGGAAACCTTCGTATTTGAGAATTAGAATTCGGAGGCTGAAGATTAGATTTGGGAGAAGATTGAAGAAGCTAAAGAAGAGTGCAATGGCAAGTATTTCAACTCTCAAGATTGGTGTTTACAAGCAAGTCGTTTCTCAGATCAGAAATTGCAAGTCTTTGTTTGGCCGAAAACAAACTAACTTTCCCGATTTTCCTCTTTTGTTATCATCTTGA

Protein sequence

MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFGRRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS
Homology
BLAST of Clc01G22540 vs. NCBI nr
Match: XP_038883675.1 (uncharacterized protein LOC120074584 [Benincasa hispida])

HSP 1 Score: 208.0 bits (528), Expect = 4.3e-50
Identity = 106/112 (94.64%), Postives = 110/112 (98.21%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSFV IRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSFVLIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLKKLKKSAMASISTLKIGVYKQV+SQIRNCKSLFG KQT+FP+FPLLLSS
Sbjct: 61  RRLKKLKKSAMASISTLKIGVYKQVISQIRNCKSLFGPKQTHFPNFPLLLSS 112

BLAST of Clc01G22540 vs. NCBI nr
Match: KGN49119.1 (hypothetical protein Csa_003853 [Cucumis sativus])

HSP 1 Score: 194.1 bits (492), Expect = 6.4e-46
Identity = 98/112 (87.50%), Postives = 105/112 (93.75%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MS V  +R NGY KVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSLVLTKRSNGYCKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLK+LKKSAM SISTLKIGVYKQV++QIRNCKSLFGRKQTNF +FP+LLSS
Sbjct: 61  RRLKRLKKSAMGSISTLKIGVYKQVITQIRNCKSLFGRKQTNFANFPVLLSS 112

BLAST of Clc01G22540 vs. NCBI nr
Match: XP_008439979.1 (PREDICTED: uncharacterized protein LOC103484604 [Cucumis melo] >TYK13053.1 uncharacterized protein E5676_scaffold255G006550 [Cucumis melo var. makuwa])

HSP 1 Score: 191.0 bits (484), Expect = 5.4e-45
Identity = 97/112 (86.61%), Postives = 105/112 (93.75%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSFV  +RPNGY KVDKEDPEEII RRAQFLINKVLERADSMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSFVLTKRPNGYCKVDKEDPEEIIRRRAQFLINKVLERADSMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           +RLKKLKKSAMASISTLK+GVYKQV++QIRN KSLFGRKQT F +FP+LLSS
Sbjct: 61  KRLKKLKKSAMASISTLKVGVYKQVITQIRNYKSLFGRKQTKFANFPVLLSS 112

BLAST of Clc01G22540 vs. NCBI nr
Match: XP_022142336.1 (uncharacterized protein LOC111012478 [Momordica charantia])

HSP 1 Score: 182.2 bits (461), Expect = 2.5e-42
Identity = 92/112 (82.14%), Postives = 100/112 (89.29%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSF+ IRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERA+SMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSFLLIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERAESMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLKKLKKSA+ASIS  KIGVYKQV+ Q+RNCKSLFGRK+    +    LSS
Sbjct: 61  RRLKKLKKSALASISAAKIGVYKQVIGQLRNCKSLFGRKEATIANLSPFLSS 112

BLAST of Clc01G22540 vs. NCBI nr
Match: XP_023003962.1 (uncharacterized protein LOC111497408 [Cucurbita maxima])

HSP 1 Score: 181.8 bits (460), Expect = 3.3e-42
Identity = 91/112 (81.25%), Postives = 101/112 (90.18%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSFV ++  NGYSKVDKEDPEEIIHRRAQF+INKVLERADSMGK SYLRIRIRRL++RFG
Sbjct: 1   MSFVLVKSSNGYSKVDKEDPEEIIHRRAQFIINKVLERADSMGKASYLRIRIRRLRVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLKKLKKSAMASI  +KI VYKQ++ QIRNCKSLFGRK+TNF + P LLSS
Sbjct: 61  RRLKKLKKSAMASICNVKISVYKQIICQIRNCKSLFGRKETNFANLPPLLSS 112

BLAST of Clc01G22540 vs. ExPASy TrEMBL
Match: A0A0A0KJX3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G514800 PE=4 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 3.1e-46
Identity = 98/112 (87.50%), Postives = 105/112 (93.75%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MS V  +R NGY KVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSLVLTKRSNGYCKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLK+LKKSAM SISTLKIGVYKQV++QIRNCKSLFGRKQTNF +FP+LLSS
Sbjct: 61  RRLKRLKKSAMGSISTLKIGVYKQVITQIRNCKSLFGRKQTNFANFPVLLSS 112

BLAST of Clc01G22540 vs. ExPASy TrEMBL
Match: A0A5D3CMA4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G006550 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 2.6e-45
Identity = 97/112 (86.61%), Postives = 105/112 (93.75%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSFV  +RPNGY KVDKEDPEEII RRAQFLINKVLERADSMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSFVLTKRPNGYCKVDKEDPEEIIRRRAQFLINKVLERADSMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           +RLKKLKKSAMASISTLK+GVYKQV++QIRN KSLFGRKQT F +FP+LLSS
Sbjct: 61  KRLKKLKKSAMASISTLKVGVYKQVITQIRNYKSLFGRKQTKFANFPVLLSS 112

BLAST of Clc01G22540 vs. ExPASy TrEMBL
Match: A0A1S3B0N9 (uncharacterized protein LOC103484604 OS=Cucumis melo OX=3656 GN=LOC103484604 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 2.6e-45
Identity = 97/112 (86.61%), Postives = 105/112 (93.75%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSFV  +RPNGY KVDKEDPEEII RRAQFLINKVLERADSMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSFVLTKRPNGYCKVDKEDPEEIIRRRAQFLINKVLERADSMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           +RLKKLKKSAMASISTLK+GVYKQV++QIRN KSLFGRKQT F +FP+LLSS
Sbjct: 61  KRLKKLKKSAMASISTLKVGVYKQVITQIRNYKSLFGRKQTKFANFPVLLSS 112

BLAST of Clc01G22540 vs. ExPASy TrEMBL
Match: A0A6J1CLW2 (uncharacterized protein LOC111012478 OS=Momordica charantia OX=3673 GN=LOC111012478 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.2e-42
Identity = 92/112 (82.14%), Postives = 100/112 (89.29%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSF+ IRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERA+SMGKPSYLRIRIRRLK+RFG
Sbjct: 1   MSFLLIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERAESMGKPSYLRIRIRRLKVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLKKLKKSA+ASIS  KIGVYKQV+ Q+RNCKSLFGRK+    +    LSS
Sbjct: 61  RRLKKLKKSALASISAAKIGVYKQVIGQLRNCKSLFGRKEATIANLSPFLSS 112

BLAST of Clc01G22540 vs. ExPASy TrEMBL
Match: A0A6J1KP23 (uncharacterized protein LOC111497408 OS=Cucurbita maxima OX=3661 GN=LOC111497408 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.6e-42
Identity = 91/112 (81.25%), Postives = 101/112 (90.18%), Query Frame = 0

Query: 1   MSFVHIRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLRIRIRRLKIRFG 60
           MSFV ++  NGYSKVDKEDPEEIIHRRAQF+INKVLERADSMGK SYLRIRIRRL++RFG
Sbjct: 1   MSFVLVKSSNGYSKVDKEDPEEIIHRRAQFIINKVLERADSMGKASYLRIRIRRLRVRFG 60

Query: 61  RRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLSS 113
           RRLKKLKKSAMASI  +KI VYKQ++ QIRNCKSLFGRK+TNF + P LLSS
Sbjct: 61  RRLKKLKKSAMASICNVKISVYKQIICQIRNCKSLFGRKETNFANLPPLLSS 112

BLAST of Clc01G22540 vs. TAIR 10
Match: AT1G11655.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G21902.1); Has 22 Blast hits to 22 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 22; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 69.3 bits (168), Expect = 2.2e-12
Identity = 40/103 (38.83%), Postives = 66/103 (64.08%), Query Frame = 0

Query: 7   RRPNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGK----PSYLRIRIRRLKIRFGRR 66
           +RP  YSK+DKEDPEE++ RRA+FLI K L+ AD + +     S++R+++  LK++ G+R
Sbjct: 5   KRPCLYSKMDKEDPEEVLSRRAKFLIYKTLQEADLISRRDPHSSFIRLKLYLLKVKIGKR 64

Query: 67  LKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLF-GRKQTNFP 105
           L KL++S ++++     G+ K   + +R  K +F G   T  P
Sbjct: 65  LAKLRRSVVSAVRF--GGIRKHSHNGVRALKKMFQGGATTGLP 105

BLAST of Clc01G22540 vs. TAIR 10
Match: AT4G04745.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G21902.1); Has 32 Blast hits to 32 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 65.1 bits (157), Expect = 4.1e-11
Identity = 42/112 (37.50%), Postives = 59/112 (52.68%), Query Frame = 0

Query: 9   PNGYSKVDKEDPEEIIHRRAQFLINKVLERADSMGKPSYLR---------IRIRRLKIRF 68
           P  Y+K++KEDP+E+IHRRAQFLI KVLERADS  +    R         IR+  +++R 
Sbjct: 34  PYEYTKMEKEDPQELIHRRAQFLIQKVLERADSKTRQHQQRRRSSGPLIMIRVVGIRMRI 93

Query: 69  GRRLKKLKKSAMASISTLKIGVYKQVVSQIRNCKSLFGRKQTNFPDFPLLLS 112
           G++L+KL+K+     + L     K     +  C S          D P L S
Sbjct: 94  GKKLRKLRKTNTCICNNLITRFLKSFKRFL--CSSSSSSSSRTISDLPPLFS 143

BLAST of Clc01G22540 vs. TAIR 10
Match: AT4G21902.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G04745.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 58.2 bits (139), Expect = 5.1e-09
Identity = 30/75 (40.00%), Postives = 53/75 (70.67%), Query Frame = 0

Query: 6  IRRPNGYSKVDKEDPEEIIHRRAQFLINKVLERAD------SMGKPSYLRI---RIRRLK 65
          ++ PN Y+K++KED  EIIHRRAQFLI+K+L+RAD         + + +++   R+  ++
Sbjct: 4  MKNPNQYTKIEKEDLNEIIHRRAQFLIHKILQRADIETLRQQQKRNTTIKLFSFRVVGIR 63

Query: 66 IRFGRRLKKLKKSAM 72
          ++ G++L+KL+KS +
Sbjct: 64 MKIGKKLRKLRKSCV 78

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038883675.14.3e-5094.64uncharacterized protein LOC120074584 [Benincasa hispida][more]
KGN49119.16.4e-4687.50hypothetical protein Csa_003853 [Cucumis sativus][more]
XP_008439979.15.4e-4586.61PREDICTED: uncharacterized protein LOC103484604 [Cucumis melo] >TYK13053.1 uncha... [more]
XP_022142336.12.5e-4282.14uncharacterized protein LOC111012478 [Momordica charantia][more]
XP_023003962.13.3e-4281.25uncharacterized protein LOC111497408 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KJX33.1e-4687.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G514800 PE=4 SV=1[more]
A0A5D3CMA42.6e-4586.61Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B0N92.6e-4586.61uncharacterized protein LOC103484604 OS=Cucumis melo OX=3656 GN=LOC103484604 PE=... [more]
A0A6J1CLW21.2e-4282.14uncharacterized protein LOC111012478 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1KP231.6e-4281.25uncharacterized protein LOC111497408 OS=Cucurbita maxima OX=3661 GN=LOC111497408... [more]
Match NameE-valueIdentityDescription
AT1G11655.12.2e-1238.83unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G04745.14.1e-1137.50unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G21902.15.1e-0940.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35687:SF1OS07G0516700 PROTEINcoord: 5..101
NoneNo IPR availablePANTHERPTHR35687OS07G0516700 PROTEINcoord: 5..101

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G22540.1Clc01G22540.1mRNA