Cla002026 (gene) Watermelon (97103) v1

NameCla002026
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr8 : 10015974 .. 10016558 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGAGGGAAAGGGTCGCAAGCAGTTCCACAATTTGCTTTTGGAGATGAGCTTCTTCCCTGAGGAGGCGCCCCTATTAGCCTACATCATGGACATCATAGACCACCATGGCTGGCAAAACTTCTATGTCGATCCAACTATCATGCAGCTACCAGTCGTCCGTGCATTTTATGTGGGTCAGATTAATGACGAAGAAGACATGGTAACATTCCATGGATGTGAAGTCCCATTTAACGCAAGGGAGATCAATGCCATATTCCAGTTGAGGGATAACTCGAATATAGAGGGCAATCAGTTAGTTGCCTCAACGTTTCATGAATACATGCAAGATGCCATCTGGGTTATGGCCAAGTCAGGCTCAAAATGGGACGTTTCACCCGCAGGCATCAAGACACTGCCTTCCAACTACCTCTTGCCTGAGGCCAAATTGTGGGTGTATTTCGCAAAGAAGCGACTCATTCCCACCACCCACAACAAAACCGTTTCAAAAGATAAAGTGTTGACAGCTGATTACTTGTGGAAATACAAGTTATTGTATTCTTTATTTGGAGTAATTAGAGTAATATTGATGATAAACACATGA

mRNA sequence

ATGGACGAGGGAAAGGGTCGCAAGCAGTTCCACAATTTGCTTTTGGAGATGAGCTTCTTCCCTGAGGAGGCGCCCCTATTAGCCTACATCATGGACATCATAGACCACCATGGCTGGCAAAACTTCTATGTCGATCCAACTATCATGCAGCTACCAGTCGTCCGTGCATTTTATGTGGGTCAGATTAATGACGAAGAAGACATGGTAACATTCCATGGATGTGAAGTCCCATTTAACGCAAGGGAGATCAATGCCATATTCCAGTTGAGGGATAACTCGAATATAGAGGGCAATCAGTTAGTTGCCTCAACGTTTCATGAATACATGCAAGATGCCATCTGGGTTATGGCCAAGTCAGGCTCAAAATGGGACGTTTCACCCGCAGGCATCAAGACACTGCCTTCCAACTACCTCTTGCCTGAGGCCAAATTGTGGGTGTATTTCGCAAAGAAGCGACTCATTCCCACCACCCACAACAAAACCGTTTCAAAAGATAAAGTGTTGACAGCTGATTACTTGTGGAAATACAAGTTATTGTATTCTTTATTTGGAGTAATTAGAGTAATATTGATGATAAACACATGA

Coding sequence (CDS)

ATGGACGAGGGAAAGGGTCGCAAGCAGTTCCACAATTTGCTTTTGGAGATGAGCTTCTTCCCTGAGGAGGCGCCCCTATTAGCCTACATCATGGACATCATAGACCACCATGGCTGGCAAAACTTCTATGTCGATCCAACTATCATGCAGCTACCAGTCGTCCGTGCATTTTATGTGGGTCAGATTAATGACGAAGAAGACATGGTAACATTCCATGGATGTGAAGTCCCATTTAACGCAAGGGAGATCAATGCCATATTCCAGTTGAGGGATAACTCGAATATAGAGGGCAATCAGTTAGTTGCCTCAACGTTTCATGAATACATGCAAGATGCCATCTGGGTTATGGCCAAGTCAGGCTCAAAATGGGACGTTTCACCCGCAGGCATCAAGACACTGCCTTCCAACTACCTCTTGCCTGAGGCCAAATTGTGGGTGTATTTCGCAAAGAAGCGACTCATTCCCACCACCCACAACAAAACCGTTTCAAAAGATAAAGTGTTGACAGCTGATTACTTGTGGAAATACAAGTTATTGTATTCTTTATTTGGAGTAATTAGAGTAATATTGATGATAAACACATGA

Protein sequence

MDEGKGRKQFHNLLLEMSFFPEEAPLLAYIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDMVTFHGCEVPFNAREINAIFQLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPAGIKTLPSNYLLPEAKLWVYFAKKRLIPTTHNKTVSKDKVLTADYLWKYKLLYSLFGVIRVILMINT
BLAST of Cla002026 vs. TrEMBL
Match: W9QTD9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 8.3e-16
Identity = 54/161 (33.54%), Postives = 83/161 (51.55%), Query Frame = 1

Query: 28  AYIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDMVTFHGCEVPFNAREINAIF 87
           A+I  +I  HGW+ F   P+   +P+VR FY   ++  ++ V     +VPF AR IN+IF
Sbjct: 5   AFITRVIHQHGWRQFCQHPSNPIVPLVREFYANLLDFNQETVFVQNVKVPFTARAINSIF 64

Query: 88  QLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPAGIKTLPSNYLLPEAKLWVY 147
            L +  + E     +    E ++  +  +A  G+ W +SP G  T     L   A++W +
Sbjct: 65  GLEEVVD-EYVDFASEVTDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWYH 124

Query: 148 FAKKRLIPTTHNKTVSKDKVLTADYLWKYKLLYSLFGVIRV 189
           F   R +P+TH KTV+KD+VL         LLYS+   I V
Sbjct: 125 FLTARFMPSTHGKTVAKDRVL---------LLYSILTGISV 155

BLAST of Cla002026 vs. TrEMBL
Match: W9RBS1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000844 PE=4 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 1.1e-12
Identity = 46/160 (28.75%), Postives = 82/160 (51.25%), Query Frame = 1

Query: 12  NLLLEMSFFPEEAPLLA---YIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDM 71
           NL+ E  F   ++P L    +I D+I   GWQ F   P    +P+V+ FY    N  ++ 
Sbjct: 58  NLVKEKGFLLHDSPTLGQPGFISDVIISRGWQIFCRHPIDPIVPLVKEFYANLQNQGQNT 117

Query: 72  VTFHGCEVPFNAREINAIFQLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPA 131
           V     ++ F +  IN +  +  N + E  +L+     E +++ +  +A  G++W +S  
Sbjct: 118 VFVWEIDITFTSNYINGVLGI-PNQDDEFVELITDAIEEQLKEVLKTIAILGAQWLLSAK 177

Query: 132 GIKTLPSNYLLPEAKLWVYFAKKRLIPTTHNKTVSKDKVL 169
           G  T   + L P AK+W +F   RL+ +TH KT+S+++ +
Sbjct: 178 GSYTCNRHELQPAAKVWYHFLASRLLLSTHGKTISRNRAI 216

BLAST of Cla002026 vs. TrEMBL
Match: A0A061FAJ6_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_032752 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 5.2e-10
Identity = 44/165 (26.67%), Postives = 79/165 (47.88%), Query Frame = 1

Query: 9   QFHNLLLEMSFFPE---EAPLLAY--IMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQIN 68
           +++  L+     PE   E P+L Y  I D+I    W  F   P ++ + VVR FY   + 
Sbjct: 27  RYYTSLINKVPIPERGIEIPILPYKEINDLIRDRYWHQFCHQPNVVVVLVVREFYATVVE 86

Query: 69  DEEDMVTFHGCEVPFNAREINAIFQLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKW 128
             + +    G  VPF+++ IN + +  +  N E  Q +    H+   + I  +   G++W
Sbjct: 87  HVDGVAFVRGKHVPFHSQAINELLRTPNIENDEYGQYLGD--HQDCNEIISTLCIEGAQW 146

Query: 129 DVSPAGIKTLPSNYLLPEAKLWVYFAKKRLIPTTHNKTVSKDKVL 169
             S     +   + +  E K+W++F   RL+P+TH   V+KD+ +
Sbjct: 147 KTSHGEPVSFKRSVMKKELKVWLHFVAARLLPSTHISDVTKDRAV 189

BLAST of Cla002026 vs. TrEMBL
Match: W9RK06_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000483 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 4.9e-08
Identity = 36/141 (25.53%), Postives = 69/141 (48.94%), Query Frame = 1

Query: 28  AYIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDMVTFHGCEVPFNAREINAIF 87
           ++I  +I  H W +F  +P    + +VR +       E D +   G  +PF +  IN+++
Sbjct: 72  SFIHSVIAAHKWHSFCHNPHAATVQLVREYAA-----EPDTIFVRGQLIPFTSEAINSLY 131

Query: 88  QLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPAGIKTLPSNYLLPEAKLWVY 147
            L D  +   N    S   + + + I  +   G++W  +  G  T P   L P +K+W +
Sbjct: 132 DLPDVED-HFNNFADSLNEDQLDEVINELCVEGTEWRRATRGSMTFPRECLQPGSKIWYH 191

Query: 148 FAKKRLIPTTHNKTVSKDKVL 169
           F + RL+P++H + V K++ +
Sbjct: 192 FLRFRLMPSSHYRLVHKERAI 206

BLAST of Cla002026 vs. NCBI nr
Match: gi|703087370|ref|XP_010093253.1| (hypothetical protein L484_022412 [Morus notabilis])

HSP 1 Score: 92.0 bits (227), Expect = 1.2e-15
Identity = 54/161 (33.54%), Postives = 83/161 (51.55%), Query Frame = 1

Query: 28  AYIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDMVTFHGCEVPFNAREINAIF 87
           A+I  +I  HGW+ F   P+   +P+VR FY   ++  ++ V     +VPF AR IN+IF
Sbjct: 5   AFITRVIHQHGWRQFCQHPSNPIVPLVREFYANLLDFNQETVFVQNVKVPFTARAINSIF 64

Query: 88  QLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPAGIKTLPSNYLLPEAKLWVY 147
            L +  + E     +    E ++  +  +A  G+ W +SP G  T     L   A++W +
Sbjct: 65  GLEEVVD-EYVDFASEVTDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWYH 124

Query: 148 FAKKRLIPTTHNKTVSKDKVLTADYLWKYKLLYSLFGVIRV 189
           F   R +P+TH KTV+KD+VL         LLYS+   I V
Sbjct: 125 FLTARFMPSTHGKTVAKDRVL---------LLYSILTGISV 155

BLAST of Cla002026 vs. NCBI nr
Match: gi|703082781|ref|XP_010092041.1| (hypothetical protein L484_000844 [Morus notabilis])

HSP 1 Score: 81.6 bits (200), Expect = 1.6e-12
Identity = 46/160 (28.75%), Postives = 82/160 (51.25%), Query Frame = 1

Query: 12  NLLLEMSFFPEEAPLLA---YIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDM 71
           NL+ E  F   ++P L    +I D+I   GWQ F   P    +P+V+ FY    N  ++ 
Sbjct: 58  NLVKEKGFLLHDSPTLGQPGFISDVIISRGWQIFCRHPIDPIVPLVKEFYANLQNQGQNT 117

Query: 72  VTFHGCEVPFNAREINAIFQLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPA 131
           V     ++ F +  IN +  +  N + E  +L+     E +++ +  +A  G++W +S  
Sbjct: 118 VFVWEIDITFTSNYINGVLGI-PNQDDEFVELITDAIEEQLKEVLKTIAILGAQWLLSAK 177

Query: 132 GIKTLPSNYLLPEAKLWVYFAKKRLIPTTHNKTVSKDKVL 169
           G  T   + L P AK+W +F   RL+ +TH KT+S+++ +
Sbjct: 178 GSYTCNRHELQPAAKVWYHFLASRLLLSTHGKTISRNRAI 216

BLAST of Cla002026 vs. NCBI nr
Match: gi|590612524|ref|XP_007022408.1| (Uncharacterized protein TCM_032752 [Theobroma cacao])

HSP 1 Score: 72.8 bits (177), Expect = 7.5e-10
Identity = 44/165 (26.67%), Postives = 79/165 (47.88%), Query Frame = 1

Query: 9   QFHNLLLEMSFFPE---EAPLLAY--IMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQIN 68
           +++  L+     PE   E P+L Y  I D+I    W  F   P ++ + VVR FY   + 
Sbjct: 27  RYYTSLINKVPIPERGIEIPILPYKEINDLIRDRYWHQFCHQPNVVVVLVVREFYATVVE 86

Query: 69  DEEDMVTFHGCEVPFNAREINAIFQLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKW 128
             + +    G  VPF+++ IN + +  +  N E  Q +    H+   + I  +   G++W
Sbjct: 87  HVDGVAFVRGKHVPFHSQAINELLRTPNIENDEYGQYLGD--HQDCNEIISTLCIEGAQW 146

Query: 129 DVSPAGIKTLPSNYLLPEAKLWVYFAKKRLIPTTHNKTVSKDKVL 169
             S     +   + +  E K+W++F   RL+P+TH   V+KD+ +
Sbjct: 147 KTSHGEPVSFKRSVMKKELKVWLHFVAARLLPSTHISDVTKDRAV 189

BLAST of Cla002026 vs. NCBI nr
Match: gi|703109223|ref|XP_010099236.1| (hypothetical protein L484_000483 [Morus notabilis])

HSP 1 Score: 66.2 bits (160), Expect = 7.0e-08
Identity = 36/141 (25.53%), Postives = 69/141 (48.94%), Query Frame = 1

Query: 28  AYIMDIIDHHGWQNFYVDPTIMQLPVVRAFYVGQINDEEDMVTFHGCEVPFNAREINAIF 87
           ++I  +I  H W +F  +P    + +VR +       E D +   G  +PF +  IN+++
Sbjct: 72  SFIHSVIAAHKWHSFCHNPHAATVQLVREYAA-----EPDTIFVRGQLIPFTSEAINSLY 131

Query: 88  QLRDNSNIEGNQLVASTFHEYMQDAIWVMAKSGSKWDVSPAGIKTLPSNYLLPEAKLWVY 147
            L D  +   N    S   + + + I  +   G++W  +  G  T P   L P +K+W +
Sbjct: 132 DLPDVED-HFNNFADSLNEDQLDEVINELCVEGTEWRRATRGSMTFPRECLQPGSKIWYH 191

Query: 148 FAKKRLIPTTHNKTVSKDKVL 169
           F + RL+P++H + V K++ +
Sbjct: 192 FLRFRLMPSSHYRLVHKERAI 206

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
W9QTD9_9ROSA8.3e-1633.54Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1[more]
W9RBS1_9ROSA1.1e-1228.75Uncharacterized protein OS=Morus notabilis GN=L484_000844 PE=4 SV=1[more]
A0A061FAJ6_THECC5.2e-1026.67Uncharacterized protein OS=Theobroma cacao GN=TCM_032752 PE=4 SV=1[more]
W9RK06_9ROSA4.9e-0825.53Uncharacterized protein OS=Morus notabilis GN=L484_000483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|703087370|ref|XP_010093253.1|1.2e-1533.54hypothetical protein L484_022412 [Morus notabilis][more]
gi|703082781|ref|XP_010092041.1|1.6e-1228.75hypothetical protein L484_000844 [Morus notabilis][more]
gi|590612524|ref|XP_007022408.1|7.5e-1026.67Uncharacterized protein TCM_032752 [Theobroma cacao][more]
gi|703109223|ref|XP_010099236.1|7.0e-0825.53hypothetical protein L484_000483 [Morus notabilis][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002026Cla002026.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None