Cla012139 (gene) Watermelon (97103) v1

NameCla012139
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionB3 domain-containing protein At1g05920 (AHRD V1 ***- Y1592_ARATH)
LocationChr4 : 15734543 .. 15735046 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCGAGAACCTTCCTTCGAATTTCCCCTACCGATTCGCGAAACTAATAAAAGATCCGAGCTCGATTCATCCTCGAATTTTGATGGAGAAGCGACTGCGGAGAGCGGATGTGAGGAAGAGTAAAGATCGACTGTCGATTACGAGGAAGTGGATAAAGGAATCGTTCTTAACGGAGGAGGAAAAGAAGAAGTTGGAATCCGGCGAGGTTATGGGAGTGGCGGTGATGGAGCCGGATGGAGAAACGGTGAGTTATCTGAGATTGAGAAAGCTGCGGAACAAAGGTAAGGAGATGGGGTCGTACGCGCTAACCGGAGGGTGGTATAACACGGTGGCGCGTAACAGTGACGCGCTGATGAGAGGAGAATTTGTTAGGGTTTGGTATTTTCGCGTGAACGGTAGGCTTTGGTTTGGGGTTGAAGTTGTTGGAGAAGTGAAGAGACAATTTGAAATTTTGGAGCAAATGTCTCAGAAATGGTGTCACGAAAATATTTTGGTTCCTTAA

mRNA sequence

ATGTTCGAGAACCTTCCTTCGAATTTCCCCTACCGATTCGCGAAACTAATAAAAGATCCGAGCTCGATTCATCCTCGAATTTTGATGGAGAAGCGACTGCGGAGAGCGGATGTGAGGAAGAGTAAAGATCGACTGTCGATTACGAGGAAGTGGATAAAGGAATCGTTCTTAACGGAGGAGGAAAAGAAGAAGTTGGAATCCGGCGAGGTTATGGGAGTGGCGGTGATGGAGCCGGATGGAGAAACGGTGAGTTATCTGAGATTGAGAAAGCTGCGGAACAAAGGTAAGGAGATGGGGTCGTACGCGCTAACCGGAGGGTGGTATAACACGGTGGCGCGTAACAGTGACGCGCTGATGAGAGGAGAATTTGTTAGGGTTTGGTATTTTCGCGTGAACGGTAGGCTTTGGTTTGGGGTTGAAGTTGTTGGAGAAGTGAAGAGACAATTTGAAATTTTGGAGCAAATGTCTCAGAAATGGTGTCACGAAAATATTTTGGTTCCTTAA

Coding sequence (CDS)

ATGTTCGAGAACCTTCCTTCGAATTTCCCCTACCGATTCGCGAAACTAATAAAAGATCCGAGCTCGATTCATCCTCGAATTTTGATGGAGAAGCGACTGCGGAGAGCGGATGTGAGGAAGAGTAAAGATCGACTGTCGATTACGAGGAAGTGGATAAAGGAATCGTTCTTAACGGAGGAGGAAAAGAAGAAGTTGGAATCCGGCGAGGTTATGGGAGTGGCGGTGATGGAGCCGGATGGAGAAACGGTGAGTTATCTGAGATTGAGAAAGCTGCGGAACAAAGGTAAGGAGATGGGGTCGTACGCGCTAACCGGAGGGTGGTATAACACGGTGGCGCGTAACAGTGACGCGCTGATGAGAGGAGAATTTGTTAGGGTTTGGTATTTTCGCGTGAACGGTAGGCTTTGGTTTGGGGTTGAAGTTGTTGGAGAAGTGAAGAGACAATTTGAAATTTTGGAGCAAATGTCTCAGAAATGGTGTCACGAAAATATTTTGGTTCCTTAA

Protein sequence

MFENLPSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEEKKKLESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVARNSDALMRGEFVRVWYFRVNGRLWFGVEVVGEVKRQFEILEQMSQKWCHENILVP
BLAST of Cla012139 vs. Swiss-Prot
Match: Y1592_ARATH (B3 domain-containing protein At1g05920 OS=Arabidopsis thaliana GN=At1g05920 PE=2 SV=1)

HSP 1 Score: 51.6 bits (122), Expect = 9.6e-06
Identity = 43/152 (28.29%), Postives = 68/152 (44.74%), Query Frame = 1

Query: 2   FENLPSNFPYRFAKLIKDPSSIH----PRILMEKRLRRADVRKSKDRLSI-TRKWIKESF 61
           +E +P   P     L++  S ++    P++++EK L   DV   ++RLSI     I+  F
Sbjct: 153 YEFVPPKHPQEPQWLLQVMSRMNGAGDPKLIIEKNLDSNDVDPRQNRLSIPINTVIQNDF 212

Query: 62  LTEEEKKKLESGEV-----MGVAVMEPDGET----VSYLRLRKLRNKGKEMGSYALTGGW 121
           LT +E + ++  E+     MGVA    D  T    + + +     + G    S+ L G W
Sbjct: 213 LTLDESRLIDEDEITNEGNMGVAAFLVDQRTKKWNMGFKQWFMTTDSGSSYWSFVLRGEW 272

Query: 122 YNTVARNSDALMRGEFVRVWYFRVNGRLWFGV 140
            N V  N   L  G+ + +W FR N  L F +
Sbjct: 273 SNVVETN--GLKEGDKISLWSFRSNDILCFAL 302

BLAST of Cla012139 vs. TrEMBL
Match: B9S7N8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0610210 PE=4 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 3.7e-12
Identity = 45/137 (32.85%), Postives = 75/137 (54.74%), Query Frame = 1

Query: 7   SNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKE-SFLTEEEKKKL 66
           +N PY   + I+     + +++M K+L + D+    +RLS+    IK+ +FL EEE++KL
Sbjct: 128 TNIPYELRRKIETKGGTNVKLVMTKKLFKTDLSSGHNRLSMPLNQIKDHTFLQEEERQKL 187

Query: 67  ESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVARNSDA------LM 126
           E  + + V  MEP G   S ++++K   + ++  SYAL  GW + VARN           
Sbjct: 188 EKYDELSVQFMEPCG-VDSVVKVKKWAYRNQKGSSYALRTGWNDVVARNQTTEEEIKEFK 247

Query: 127 RGEFVRVWYFRVNGRLW 137
             + +++W FRV G LW
Sbjct: 248 ENDMIQIWSFRVEGVLW 263

BLAST of Cla012139 vs. TrEMBL
Match: A0A059AN92_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00811 PE=4 SV=1)

HSP 1 Score: 77.8 bits (190), Expect = 1.4e-11
Identity = 50/147 (34.01%), Postives = 80/147 (54.42%), Query Frame = 1

Query: 6   PSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEEKKKL 65
           PS  P ++   I++       +++EK L   D+ + + RLSI  K +++ FL+EEE + L
Sbjct: 162 PSELPTKYMDKIREMHGQDVTLVVEKTLTATDMSRGQSRLSIPNKQMRQGFLSEEEIRIL 221

Query: 66  ESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVAR--NSDALMRGEF 125
              E + V+++EP  E    LRL+K     K+  SY LT  W N VA     + LM+   
Sbjct: 222 NRKEGIKVSLIEPCLEVSHGLRLKKWNYTSKKF-SYMLTERW-NAVAHPYTRNELMKDVV 281

Query: 126 VRVWYFRVNGRLWFGVEVVGEVKRQFE 151
           +++W FRV+G LWF +  V    RQ++
Sbjct: 282 IQLWSFRVDGSLWFCLRKVEAPMRQWK 306

BLAST of Cla012139 vs. TrEMBL
Match: A0A059AMI0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00861 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 5.3e-11
Identity = 49/138 (35.51%), Postives = 79/138 (57.25%), Query Frame = 1

Query: 2   FENLPSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEE 61
           F   P   P R+ + I++       +++EK L   D+ + + RLSI  K I++ FL+EEE
Sbjct: 94  FPRNPRELPRRYMEKIQEMQGRDMTLVVEKTLTATDMSRGQSRLSIPNKQIRQGFLSEEE 153

Query: 62  KKKLESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVAR--NSDALM 121
            + L+  E + V+++EP  E VS L+L++   + K + SY LT  W N VA     + LM
Sbjct: 154 IRMLDRKEGIKVSLIEPCLE-VSQLQLKRWNYQSK-IFSYVLTERW-NGVAHPYERNELM 213

Query: 122 RGEFVRVWYFRVNGRLWF 138
           +   +++W FRV+G LWF
Sbjct: 214 KDVVIQLWSFRVDGSLWF 228

BLAST of Cla012139 vs. TrEMBL
Match: A0A059ALZ8_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00818 PE=4 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 1.7e-09
Identity = 47/144 (32.64%), Postives = 75/144 (52.08%), Query Frame = 1

Query: 2   FENLPSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEE 61
           F   P   P  +   I++       +++EK L   D+ + + RLS   K I++SFL EEE
Sbjct: 114 FPRNPRKLPRWYMDKIQEMQGRDMTLVVEKTLTATDMSRGQSRLSTPNKQIRQSFLHEEE 173

Query: 62  KKKLESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTV---ARNSDAL 121
            + L+  E + V+++EP  E    L+L++   K +   SY LT  W       ARN   L
Sbjct: 174 IRILDRKEGIKVSLIEPCLEVSHGLQLKRWNYKSRNF-SYVLTERWNGVAHPYARNE--L 233

Query: 122 MRGEFVRVWYFRVNGRLWFGVEVV 143
           M+   +++W FRV+G LWF ++ V
Sbjct: 234 MKDVVIQLWSFRVDGSLWFCLKKV 254

BLAST of Cla012139 vs. TrEMBL
Match: A0A0D2TW38_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G062600 PE=4 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 8.5e-09
Identity = 47/145 (32.41%), Postives = 76/145 (52.41%), Query Frame = 1

Query: 7   SNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEEKKKLE 66
           S+ P  F   I+D      +I+++K L+  D+   ++R SI+ K IK +FL E+E++ L 
Sbjct: 7   SDMPQSFNDCIQDLGGSEIKIIIQKFLQVTDLMSQQNRFSISLKQIKSTFLNEDEERMLN 66

Query: 67  SGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVARNSDALMRGEFVRV 126
           +   M VA +EP   +VS + L K  N G  + SY + G +   +  N D+L     V++
Sbjct: 67  AKRQMAVAFVEP-CLSVSTVNLAKW-NIGSSL-SYIINGDYSKVLKNNRDSLKPNAVVQI 126

Query: 127 WYFRVNGRLWFG---VEVVGEVKRQ 149
           W FRV   L  G   ++V+ E   Q
Sbjct: 127 WLFRVQPNLQLGFALIKVIDEENSQ 148

BLAST of Cla012139 vs. NCBI nr
Match: gi|223538811|gb|EEF40411.1| (hypothetical protein RCOM_0610210 [Ricinus communis])

HSP 1 Score: 79.7 bits (195), Expect = 5.3e-12
Identity = 45/137 (32.85%), Postives = 75/137 (54.74%), Query Frame = 1

Query: 7   SNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKE-SFLTEEEKKKL 66
           +N PY   + I+     + +++M K+L + D+    +RLS+    IK+ +FL EEE++KL
Sbjct: 128 TNIPYELRRKIETKGGTNVKLVMTKKLFKTDLSSGHNRLSMPLNQIKDHTFLQEEERQKL 187

Query: 67  ESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVARNSDA------LM 126
           E  + + V  MEP G   S ++++K   + ++  SYAL  GW + VARN           
Sbjct: 188 EKYDELSVQFMEPCG-VDSVVKVKKWAYRNQKGSSYALRTGWNDVVARNQTTEEEIKEFK 247

Query: 127 RGEFVRVWYFRVNGRLW 137
             + +++W FRV G LW
Sbjct: 248 ENDMIQIWSFRVEGVLW 263

BLAST of Cla012139 vs. NCBI nr
Match: gi|629088612|gb|KCW54865.1| (hypothetical protein EUGRSUZ_I00811 [Eucalyptus grandis])

HSP 1 Score: 77.8 bits (190), Expect = 2.0e-11
Identity = 50/147 (34.01%), Postives = 80/147 (54.42%), Query Frame = 1

Query: 6   PSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEEKKKL 65
           PS  P ++   I++       +++EK L   D+ + + RLSI  K +++ FL+EEE + L
Sbjct: 162 PSELPTKYMDKIREMHGQDVTLVVEKTLTATDMSRGQSRLSIPNKQMRQGFLSEEEIRIL 221

Query: 66  ESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVAR--NSDALMRGEF 125
              E + V+++EP  E    LRL+K     K+  SY LT  W N VA     + LM+   
Sbjct: 222 NRKEGIKVSLIEPCLEVSHGLRLKKWNYTSKKF-SYMLTERW-NAVAHPYTRNELMKDVV 281

Query: 126 VRVWYFRVNGRLWFGVEVVGEVKRQFE 151
           +++W FRV+G LWF +  V    RQ++
Sbjct: 282 IQLWSFRVDGSLWFCLRKVEAPMRQWK 306

BLAST of Cla012139 vs. NCBI nr
Match: gi|629088638|gb|KCW54891.1| (hypothetical protein EUGRSUZ_I00861 [Eucalyptus grandis])

HSP 1 Score: 75.9 bits (185), Expect = 7.6e-11
Identity = 49/138 (35.51%), Postives = 79/138 (57.25%), Query Frame = 1

Query: 2   FENLPSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEE 61
           F   P   P R+ + I++       +++EK L   D+ + + RLSI  K I++ FL+EEE
Sbjct: 94  FPRNPRELPRRYMEKIQEMQGRDMTLVVEKTLTATDMSRGQSRLSIPNKQIRQGFLSEEE 153

Query: 62  KKKLESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVAR--NSDALM 121
            + L+  E + V+++EP  E VS L+L++   + K + SY LT  W N VA     + LM
Sbjct: 154 IRMLDRKEGIKVSLIEPCLE-VSQLQLKRWNYQSK-IFSYVLTERW-NGVAHPYERNELM 213

Query: 122 RGEFVRVWYFRVNGRLWF 138
           +   +++W FRV+G LWF
Sbjct: 214 KDVVIQLWSFRVDGSLWF 228

BLAST of Cla012139 vs. NCBI nr
Match: gi|702513559|ref|XP_010041305.1| (PREDICTED: uncharacterized protein LOC104430250 [Eucalyptus grandis])

HSP 1 Score: 75.5 bits (184), Expect = 9.9e-11
Identity = 51/147 (34.69%), Postives = 77/147 (52.38%), Query Frame = 1

Query: 6   PSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEEKKKL 65
           PS  P ++   I++       +++EK L   D+   + RLSI  K I++ FL+EEE + L
Sbjct: 121 PSELPTKYMDKIREMHGQDVTLVVEKTLTATDMSCGQSRLSIPNKQIRQGFLSEEEIRIL 180

Query: 66  ESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTVAR--NSDALMRGEF 125
              E + V ++EP  E    LRL+K     K   SY LT  W N VA     + LM+   
Sbjct: 181 NRKEGIKVFLIEPCLEVSHGLRLKKWNYTSKNF-SYMLTERW-NAVAHPYTRNELMKDVV 240

Query: 126 VRVWYFRVNGRLWFGVEVVGEVKRQFE 151
           +++W FRV+G LWF +  V    RQ++
Sbjct: 241 IQLWSFRVDGSLWFCLRKVEAPMRQWK 265

BLAST of Cla012139 vs. NCBI nr
Match: gi|702470532|ref|XP_010030556.1| (PREDICTED: uncharacterized protein LOC104420403 [Eucalyptus grandis])

HSP 1 Score: 75.1 bits (183), Expect = 1.3e-10
Identity = 47/140 (33.57%), Postives = 76/140 (54.29%), Query Frame = 1

Query: 6   PSNFPYRFAKLIKDPSSIHPRILMEKRLRRADVRKSKDRLSITRKWIKESFLTEEEKKKL 65
           PS  P ++   I++       +++EK L   D+ + + RLSI    I++ FL+EEE + L
Sbjct: 157 PSKLPTKYMDKIQEMQGRDMTLVVEKTLTATDMSRGQSRLSIPNNQIRQGFLSEEEIRIL 216

Query: 66  ESGEVMGVAVMEPDGETVSYLRLRKLRNKGKEMGSYALTGGWYNTV---ARNSDALMRGE 125
           +  E + V+++EP  E    L+L++   K K   SY LT  W       ARN   LM+  
Sbjct: 217 DRKEGIKVSLVEPCLEVSHRLQLKRWNYKSKNF-SYVLTERWNGVAHPYARNE--LMKDV 276

Query: 126 FVRVWYFRVNGRLWFGVEVV 143
            +++W FRV+G LWF ++ V
Sbjct: 277 VIQLWSFRVDGSLWFCLKKV 293

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1592_ARATH9.6e-0628.29B3 domain-containing protein At1g05920 OS=Arabidopsis thaliana GN=At1g05920 PE=2... [more]
Match NameE-valueIdentityDescription
B9S7N8_RICCO3.7e-1232.85Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0610210 PE=4 SV=1[more]
A0A059AN92_EUCGR1.4e-1134.01Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00811 PE=4 SV=1[more]
A0A059AMI0_EUCGR5.3e-1135.51Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00861 PE=4 SV=1[more]
A0A059ALZ8_EUCGR1.7e-0932.64Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00818 PE=4 SV=1[more]
A0A0D2TW38_GOSRA8.5e-0932.41Uncharacterized protein OS=Gossypium raimondii GN=B456_013G062600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|223538811|gb|EEF40411.1|5.3e-1232.85hypothetical protein RCOM_0610210 [Ricinus communis][more]
gi|629088612|gb|KCW54865.1|2.0e-1134.01hypothetical protein EUGRSUZ_I00811 [Eucalyptus grandis][more]
gi|629088638|gb|KCW54891.1|7.6e-1135.51hypothetical protein EUGRSUZ_I00861 [Eucalyptus grandis][more]
gi|702513559|ref|XP_010041305.1|9.9e-1134.69PREDICTED: uncharacterized protein LOC104430250 [Eucalyptus grandis][more]
gi|702470532|ref|XP_010030556.1|1.3e-1033.57PREDICTED: uncharacterized protein LOC104420403 [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005508DUF313
IPR015300DNA-bd_pseudobarrel_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla012139Cla012139.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005508B3 domain-containing proteinPFAMPF03754DUF313coord: 22..111
score: 4.
IPR015300DNA-binding pseudobarrel domainGENE3DG3DSA:2.40.330.10coord: 21..137
score: 3.
NoneNo IPR availablePANTHERPTHR31541FAMILY NOT NAMEDcoord: 1..159
score: 5.0