Cla014404 (gene) Watermelon (97103) v1

NameCla014404
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr1 : 30524982 .. 30528021 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGTTTATTCATTAGCAAATTCCATCCATTTCCATATCCCCAACACATTCTCAAACTCAACCTCCAACAACCTCAATGCCACCACCTCTCTAAGCAGTTGTGTTCCTCCGGCGCTCTCGGTCAACCGTCGAGGCTCTTTGTGCGTTAAATGCCGTGGTAGTGCCAAGCCGGAAAATAAAAACCACGATGAAAATGACCCTCTTGAAACAATCGACAAACTTTACAAAACCATCAAGAAGAAAGACATCGTCGAATTGTCTAATGTAATAGCCGATCAACATCCACATATTTTCGATTCCATCCCTTTTCTTCAAACCAATTTGGTAATTTTTTAAATTATTGAAAGAAGTTGCTACACAATTTGAAATTGTATCTAGTAATTTTGATGATGGTGGTGTGTGTGTCACACACACATATATATACTTTGGGTTATAACTATATATATATTTTTTGAACAGAAAATGTGGAATATGGTGTCCAATATCATAAGGGGATTACAAGATAGCCTAGTATTTTCAGTGCAGCCAACAACAAAAGATGGCTCGATGGTGGGCATTAAATGGAGAGTAGGTACATATTCCAAAACTAAAACCACCTTTAATTTGCTTGAATTAATGCTAATTCGTGGATTGATTACCCTTTCATTTTTAGTTTTCTTTTTTTTAAAAAATAATGCTCATATTCTTAATCAAGTTATTAAATAAAAATAAATTTTAGTTTTTAAAATTTAGCTTTAAAAAATAAAATGGATAAAGTGAGATCTAATTAATAAATGTGGCACTGGCAGGGTGGCATAAACCGCTCATAGGGTCCGAAAAAGGAGTCAATATCCATTCTCATCATATCTATACCGGAAAATTGCTTATTGGGTACTTCAAAATATTCAAACATCTCTTTTTCTTATTTTTCTTCAACTTCATACTCTCAATAATTTTATTTTTAATTTGAATTGGATTCAGAAATTTTGAAATGTTATTGGATCCTCTTCTTCAACTCGGGCCAACAAAGATGGTGAGTGAAATTTTATAACTCAATTAAAGTCTATAATCAAAACTTCTATTCTTGCTATTGTTGAACTTTATTAAAAGAAAAAAAAAAATCATAACTTTGGAGGTTTATGCAAATAAGAATGGAAAGTTAAAATAACATATATGTAAATAGATGAGATGAAATTTGGTTGACTGAGAATATTATGATTGTAATAATGTGTGGAATTAAAAATGTTTATTATATTGGATTTGGAGCAGATGAAATGGAATGAAGAGTCAAAGTCAAAAGAGAAGAGAATTGCATCAATGTGTTTGGGTGTTTTCCTCCTCCTTGTATCACTCTTTTGTCTCCAGTTTTCTGTTCGTTAAGTTAAAACTACCTTTAATTGGATCATCTAAATCTTATATTCATATTCTACTTTATTCCTTTTAAAATGTACCTCATTAAACAATAACATTATCTTCCAATATTTTTTTATGCATTATTTGCACCATTTTGAGCGAAAAATTCAACCTCCAACTTCTAAAATGGTAACTAAGTATTTTATGTAAGGAAATTGTCATAAATAACCTACTAATATTGTTTTCATCTTTCAAAAAATCAACTCGCATCCTTTAGAGAATTCGTTGAAATAGTTTTTCTCAGTTCATCCAAGTTCGAGATTGACTATCCAAATTCAAATAAAAAATTATGTATGTATGAAGATCAAGAAAAATTATTTCAATGAATTTTCTAAAAGTGTGATAGTTTTAAAAAATGAAAACCTTTTTATGGTCACTTTTGAAAATTTTCCTTTTATCTAATGAAACATACTTAAATTTTCAAATATGATTATTACCTCATTGTTTCCTAGCTACCTTTTGAGTTAACTATAGTGTTTCAAATTGGAACTTTTAAATTTTTGTTAAGATTGTTTCAATTATTTCATATTTTCATATATTTGGTGTCAAATTTATAAATTAAATTTTAATTAAATGATTGCTCAAAATATGAGTAATAATATATTTATAAAGCATAACCTTAATTATAATTAGTCACTAAATATTAAGTTAAATAATATTTACTATTAATTGATTTATGAACTTAATTTATGTTTGAAAAAATCCTATATTTATTATTTTGTAATAACATATAATTTATAATCTATTATACAAACATATTTTATTTTGACAAAAATAATTGAGTTTAAAGCCTAAAATGCTATTTTTGTTATTATTTTTACCTTTTTAAATATAATTCTACATTATACACTAAAAATATAAAAATGTTATTTTCAGAACATGCAATGTTTGGATCATATAATCCAAAAAGAATTTTTTGTATACCAAACAGGTCCTAAAAATTTTTAAATTTGTTTTATTTTGATCCTAGAATTTTTAAAATGTCTATTTTAGTCTTTAAATTTTATTATATTTTTTTTTCTTACCTTTATTTTGTTTTTTGATTAATGAATGACACCATTTTTAGTACGTGCGAGCATGATTTCAAGTCATATATGATATAGTTGGAGACAAACATTTGACAAGTAGATATTAGATGCAAAAATATTACAAGAATGACTAAAATGATGTATTCTTTTTCAGTTAAGAGATTAAAATAGAACTTTTGAAAGTTTAGAGATCAAAATTTATTTTTTAATTAATAAACTAATTTTAAAAGGTTAAAAAAAAAGAGAGAGAGAAAGATTGAAGTAATATTTATAAGCTTAATTCTCCTGAGTCAAAAACTATAAAATCAAATTTGGAGCTTAAAAGATTAAAAATGAAATTGAAAAGAGAATATTGAATGTGCGCCAAGGAGAGTCTATGCAATTAGTGTGTTATAAGATAGGTTTAAAATATAATCGTTGATAGAAAGATCGAGACTTTGATTTTATGTAAATATCAACAAAATATTAACGTGGATAGATATTTTTAGAAAATTTATAAAAATAAAAATTCAAGAGTATATTTAATAATTAATAAACATTTTACTGTTTTTAAATAGGTTAAGATGACTATGATAATTTATATTAATGATATTTGGTTGTTTATTCATATAGATTAA

mRNA sequence

ATGGCTGTTTATTCATTAGCAAATTCCATCCATTTCCATATCCCCAACACATTCTCAAACTCAACCTCCAACAACCTCAATGCCACCACCTCTCTAAGCAGTTGTGTTCCTCCGGCGCTCTCGGTCAACCGTCGAGGCTCTTTGTGCGTTAAATGCCGTGGTAGTGCCAAGCCGGAAAATAAAAACCACGATGAAAATGACCCTCTTGAAACAATCGACAAACTTTACAAAACCATCAAGAAGAAAGACATCGTCGAATTGTCTAATGTAATAGCCGATCAACATCCACATATTTTCGATTCCATCCCTTTTCTTCAAACCAATTTGAAAATGTGGAATATGGTGTCCAATATCATAAGGGGATTACAAGATAGCCTAGTATTTTCAGTGCAGCCAACAACAAAAGATGGCTCGATGGTGGGCATTAAATGGAGAGTAGATTAA

Coding sequence (CDS)

ATGGCTGTTTATTCATTAGCAAATTCCATCCATTTCCATATCCCCAACACATTCTCAAACTCAACCTCCAACAACCTCAATGCCACCACCTCTCTAAGCAGTTGTGTTCCTCCGGCGCTCTCGGTCAACCGTCGAGGCTCTTTGTGCGTTAAATGCCGTGGTAGTGCCAAGCCGGAAAATAAAAACCACGATGAAAATGACCCTCTTGAAACAATCGACAAACTTTACAAAACCATCAAGAAGAAAGACATCGTCGAATTGTCTAATGTAATAGCCGATCAACATCCACATATTTTCGATTCCATCCCTTTTCTTCAAACCAATTTGAAAATGTGGAATATGGTGTCCAATATCATAAGGGGATTACAAGATAGCCTAGTATTTTCAGTGCAGCCAACAACAAAAGATGGCTCGATGGTGGGCATTAAATGGAGAGTAGATTAA

Protein sequence

MAVYSLANSIHFHIPNTFSNSTSNNLNATTSLSSCVPPALSVNRRGSLCVKCRGSAKPENKNHDENDPLETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQTNLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRVD
BLAST of Cla014404 vs. TrEMBL
Match: A0A0A0KL37_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G511720 PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 9.1e-31
Identity = 82/159 (51.57%), Postives = 105/159 (66.04%), Query Frame = 1

Query: 1   MAVYSLANSIHFHIPNTF-SNSTSNNLNATTSLSSCV------PPALSVNRR---GSLCV 60
           MA  SL N I   IP +  S   S NLNA  S  + +      P  LS  RR    SLC+
Sbjct: 1   MAFSSLQNFIR--IPTSLISIPNSKNLNAAASFHTTINNSYLPPQLLSTKRRRPAASLCI 60

Query: 61  KCRGS---AKPENKNHDENDPLETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQT 120
           +C G    +  ++ + D   PLETI+K+YK+IKKKD+ +L+NVIADQ P I DSIPFL+T
Sbjct: 61  QCHGGGFFSMDDDDSSDNEGPLETINKVYKSIKKKDVAKLANVIADQRPDIVDSIPFLRT 120

Query: 121 NLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRV 147
            LKM  + S+II+GLQ++LVFS+QPTTKDGSMVGI W+V
Sbjct: 121 RLKMRKLASHIIKGLQENLVFSIQPTTKDGSMVGIIWKV 157

BLAST of Cla014404 vs. TrEMBL
Match: A0A061GP35_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_030364 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 6.3e-08
Identity = 40/118 (33.90%), Postives = 63/118 (53.39%), Query Frame = 1

Query: 34  SCVPPALSVNRRGSLCVKCRGSAKPENKNHDEND-PLETIDKLYKTIKKKDIVELSNVIA 93
           S +P +    RRG   V      K  +   +E+   LET+ KLY  IK +++ ELS++I 
Sbjct: 57  SLIPASYPTRRRGLSVVPLDAKTKSSDPGGEEDSGALETVLKLYSAIKNQNVRELSDIID 116

Query: 94  DQHP---HIFDSIPFLQTNLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRVD 148
           D+     + F S   LQ   ++    +++I+ L D + F VQPT  DG +VGI WR++
Sbjct: 117 DECRCICNFFSSFQPLQGKKQVLEFFASLIKFLGDHIEFVVQPTLHDGMVVGIHWRLE 174

BLAST of Cla014404 vs. TrEMBL
Match: K4D635_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 2.7e-06
Identity = 37/104 (35.58%), Postives = 62/104 (59.62%), Query Frame = 1

Query: 50  VKCRGSAKPENKNHDENDP--LETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQT 109
           +KC  S   EN N +E DP  +ET+ KLYK +K K+++ELS++I ++   I +    LQT
Sbjct: 65  LKCNKSDNSENNNPEE-DPKAIETVQKLYKALKNKNLIELSDIIGEECRCISNVASSLQT 124

Query: 110 ---NLKMWNMVSNIIRGL-QDSLVFSVQPTTKDGSMVGIKWRVD 148
                ++ +   +II+ L  ++  F  +PTT DG+ VG+ W ++
Sbjct: 125 FYGKEQVIDFFKSIIKLLGNNNFEFVFKPTTHDGTHVGVAWELE 167

BLAST of Cla014404 vs. TrEMBL
Match: A0A067K560_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22251 PE=4 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 4.5e-06
Identity = 34/106 (32.08%), Postives = 57/106 (53.77%), Query Frame = 1

Query: 44  RRGSLCVKCRGSAKPENKNHDENDPLETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIP 103
           RR    V    + +  + N +E   LET+ KLY  IK ++I ++SN+IAD+   + +   
Sbjct: 20  RRCLSLVPLSNAKRSVSGNDEEKQALETVLKLYSAIKNQNIHQVSNIIADECRCVCNFFS 79

Query: 104 FLQT---NLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRV 147
           F Q+     ++ +    +IR   D++ F V+PT  DG  VG+ WR+
Sbjct: 80  FFQSFHGKQQVLDFFKYVIRIFGDNIEFVVKPTVHDGMNVGVSWRL 125

BLAST of Cla014404 vs. NCBI nr
Match: gi|659079093|ref|XP_008440070.1| (PREDICTED: uncharacterized protein LOC103484658 [Cucumis melo])

HSP 1 Score: 152.1 bits (383), Expect = 7.4e-34
Identity = 86/160 (53.75%), Postives = 110/160 (68.75%), Query Frame = 1

Query: 2   AVYSLANSIHFHIPNTFS-NSTSNNLNATTSLSSCV-------PPALSVNRRG--SLCVK 61
           A+ SL N IHF  P +FS +S S NLNA  S  + +       P  LS  RR   SLCV+
Sbjct: 4   ALSSLQNFIHF--PTSFSISSNSRNLNAAASFHTTINNNSYLPPQPLSTKRRRRPSLCVQ 63

Query: 62  CRGSAK-----PENKNHDENDPLETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQ 121
           C G         ++ + ++ DPLETI+K+YK+IKKKD++EL+NVIADQ P I DSIPFL 
Sbjct: 64  CHGGGSFSMKNQDDNDDNDEDPLETINKVYKSIKKKDVIELANVIADQRPDIMDSIPFLG 123

Query: 122 TNLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRV 147
           T+LKM  +VSN+++GLQ+SLVF V PT KDGSMVGIKW+V
Sbjct: 124 TSLKMRKLVSNMVKGLQESLVFYVLPTKKDGSMVGIKWKV 161

BLAST of Cla014404 vs. NCBI nr
Match: gi|778720180|ref|XP_011658120.1| (PREDICTED: uncharacterized protein LOC105435947 isoform X1 [Cucumis sativus])

HSP 1 Score: 141.4 bits (355), Expect = 1.3e-30
Identity = 82/159 (51.57%), Postives = 105/159 (66.04%), Query Frame = 1

Query: 1   MAVYSLANSIHFHIPNTF-SNSTSNNLNATTSLSSCV------PPALSVNRR---GSLCV 60
           MA  SL N I   IP +  S   S NLNA  S  + +      P  LS  RR    SLC+
Sbjct: 1   MAFSSLQNFIR--IPTSLISIPNSKNLNAAASFHTTINNSYLPPQLLSTKRRRPAASLCI 60

Query: 61  KCRGS---AKPENKNHDENDPLETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQT 120
           +C G    +  ++ + D   PLETI+K+YK+IKKKD+ +L+NVIADQ P I DSIPFL+T
Sbjct: 61  QCHGGGFFSMDDDDSSDNEGPLETINKVYKSIKKKDVAKLANVIADQRPDIVDSIPFLRT 120

Query: 121 NLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRV 147
            LKM  + S+II+GLQ++LVFS+QPTTKDGSMVGI W+V
Sbjct: 121 RLKMRKLASHIIKGLQENLVFSIQPTTKDGSMVGIIWKV 157

BLAST of Cla014404 vs. NCBI nr
Match: gi|778720183|ref|XP_011658121.1| (PREDICTED: uncharacterized protein LOC105435947 isoform X2 [Cucumis sativus])

HSP 1 Score: 139.8 bits (351), Expect = 3.8e-30
Identity = 81/158 (51.27%), Postives = 104/158 (65.82%), Query Frame = 1

Query: 1   MAVYSLANSIHFHIPNTF-SNSTSNNLNATTSLSSCV------PPALSVNRR---GSLCV 60
           MA  SL N I   IP +  S   S NLNA  S  + +      P  LS  RR    SLC+
Sbjct: 1   MAFSSLQNFIR--IPTSLISIPNSKNLNAAASFHTTINNSYLPPQLLSTKRRRPAASLCI 60

Query: 61  KCRGS---AKPENKNHDENDPLETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQT 120
           +C G    +  ++ + D   PLETI+K+YK+IKKKD+ +L+NVIADQ P I DSIPFL+T
Sbjct: 61  QCHGGGFFSMDDDDSSDNEGPLETINKVYKSIKKKDVAKLANVIADQRPDIVDSIPFLRT 120

Query: 121 NLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWR 146
            LKM  + S+II+GLQ++LVFS+QPTTKDGSMVGI W+
Sbjct: 121 RLKMRKLASHIIKGLQENLVFSIQPTTKDGSMVGIIWK 156

BLAST of Cla014404 vs. NCBI nr
Match: gi|590626789|ref|XP_007026267.1| (Uncharacterized protein TCM_030364 [Theobroma cacao])

HSP 1 Score: 65.5 bits (158), Expect = 9.1e-08
Identity = 40/118 (33.90%), Postives = 63/118 (53.39%), Query Frame = 1

Query: 34  SCVPPALSVNRRGSLCVKCRGSAKPENKNHDEND-PLETIDKLYKTIKKKDIVELSNVIA 93
           S +P +    RRG   V      K  +   +E+   LET+ KLY  IK +++ ELS++I 
Sbjct: 57  SLIPASYPTRRRGLSVVPLDAKTKSSDPGGEEDSGALETVLKLYSAIKNQNVRELSDIID 116

Query: 94  DQHP---HIFDSIPFLQTNLKMWNMVSNIIRGLQDSLVFSVQPTTKDGSMVGIKWRVD 148
           D+     + F S   LQ   ++    +++I+ L D + F VQPT  DG +VGI WR++
Sbjct: 117 DECRCICNFFSSFQPLQGKKQVLEFFASLIKFLGDHIEFVVQPTLHDGMVVGIHWRLE 174

BLAST of Cla014404 vs. NCBI nr
Match: gi|970062098|ref|XP_015057934.1| (PREDICTED: uncharacterized protein LOC107004207 [Solanum pennellii])

HSP 1 Score: 62.8 bits (151), Expect = 5.9e-07
Identity = 38/104 (36.54%), Postives = 63/104 (60.58%), Query Frame = 1

Query: 50  VKCRGSAKPENKNHDENDP--LETIDKLYKTIKKKDIVELSNVIADQHPHIFDSIPFLQT 109
           +KC  S   EN N +E DP  +ET+ KLYK +K K+++ELS++I ++   I +    LQT
Sbjct: 65  LKCNKSDNSENNNPEE-DPKAMETVQKLYKALKNKNLIELSDIIGEECRCISNVASSLQT 124

Query: 110 ---NLKMWNMVSNIIRGL-QDSLVFSVQPTTKDGSMVGIKWRVD 148
                ++ +   +II+ L  D+  F  +PTT DG+ VG+ W+++
Sbjct: 125 FYGKEQVIDFFKSIIKLLGNDNFEFVFKPTTHDGTHVGVAWKLE 167

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KL37_CUCSA9.1e-3151.57Uncharacterized protein OS=Cucumis sativus GN=Csa_6G511720 PE=4 SV=1[more]
A0A061GP35_THECC6.3e-0833.90Uncharacterized protein OS=Theobroma cacao GN=TCM_030364 PE=4 SV=1[more]
K4D635_SOLLC2.7e-0635.58Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
A0A067K560_JATCU4.5e-0632.08Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22251 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659079093|ref|XP_008440070.1|7.4e-3453.75PREDICTED: uncharacterized protein LOC103484658 [Cucumis melo][more]
gi|778720180|ref|XP_011658120.1|1.3e-3051.57PREDICTED: uncharacterized protein LOC105435947 isoform X1 [Cucumis sativus][more]
gi|778720183|ref|XP_011658121.1|3.8e-3051.27PREDICTED: uncharacterized protein LOC105435947 isoform X2 [Cucumis sativus][more]
gi|590626789|ref|XP_007026267.1|9.1e-0833.90Uncharacterized protein TCM_030364 [Theobroma cacao][more]
gi|970062098|ref|XP_015057934.1|5.9e-0736.54PREDICTED: uncharacterized protein LOC107004207 [Solanum pennellii][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU24853watermelon EST collection version 2.0transcribed_cluster
WMU51686watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla014404Cla014404.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU24853WMU24853transcribed_cluster
WMU51686WMU51686transcribed_cluster