CmUC11G209680 (gene) Watermelon (USVL531) v1

Overview
NameCmUC11G209680
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionTranscription factor bHLH61-like protein
LocationCmU531Chr11: 6132421 .. 6133534 (-)
RNA-Seq ExpressionCmUC11G209680
SyntenyCmUC11G209680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCAGTAAATTCTCTTCTCCTAATTTCTTAATTTTCTTTTATATCTTTTTTTTAAAAAATTATTTTTAATTTTTAATAATCCCCTCTACCAAAATGTATTTTAGAAGAGAAATTATGGGTTTTATTTTAGGAAGAACCAATGATTTTTCTAATTTGTGTGTGTGTGTGTAGCAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGGTATAATTTTAATTTCAATCAATAATAACAACAACAACAATAATAATAATATTCTTCCTTCCCAATTATTTCTAAATTAATCAACAAATTACTGACTCATTTATTTGTGTGTGTTATAGCAGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAGTAAGATATTCCAAATACCCTTCTTTCTATATATATATATTATCATCATCATTATTCATCAATTTAAATATCTTTTTATTAATTAATTCTCTCTCTCACAACCTTATATTACTTATCATCATTTAATTCTTTCATGCATAATTGTGTGTGTGTGTGTTTTTTTTTTTTTTTTTGTGGCTTCTTAAAGTCAAATCAATCATTAATTTTAACAAGGATATTTATCGAAAGTAATTGAATTTTCCATCTAATATAATCATAAAACTTCAAATTAAATGTTGGGTTTTATTTATATTATTAAAACATTAAATTTAATTAAATGGGGTTTTTTTTTTTCGGTAGATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGCAGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA

mRNA sequence

ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGCAGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGCAGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA

Coding sequence (CDS)

ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGCAGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGCAGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA

Protein sequence

MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
Homology
BLAST of CmUC11G209680 vs. NCBI nr
Match: XP_038884496.1 (uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida])

HSP 1 Score: 284.3 bits (726), Expect = 6.8e-73
Identity = 152/163 (93.25%), Postives = 158/163 (96.93%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+N
Sbjct: 1   MVSREHKKAVLHEKLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQN 60

Query: 61  SIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC 120
           SIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC
Sbjct: 61  SIHPNPLSHQYSPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC 120

Query: 121 TDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           TD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Sbjct: 121 TDTFQLQAIAEIEEEGEEAIDAQAVKEAVVQAIKSWGQSGEQD 163

BLAST of CmUC11G209680 vs. NCBI nr
Match: XP_038884498.1 (uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida])

HSP 1 Score: 279.6 bits (714), Expect = 1.7e-71
Identity = 149/162 (91.98%), Postives = 154/162 (95.06%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+N
Sbjct: 1   MVSREHKKAVLHEKLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQN 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           SIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT
Sbjct: 61  SIHPNPLSHQYSPMVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           D+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Sbjct: 121 DTFQLQAIAEIEEEGEEAIDAQAVKEAVVQAIKSWGQSGEQD 162

BLAST of CmUC11G209680 vs. NCBI nr
Match: XP_038884497.1 (uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida])

HSP 1 Score: 278.1 bits (710), Expect = 4.8e-71
Identity = 151/163 (92.64%), Postives = 157/163 (96.32%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTV+N
Sbjct: 1   MVSREHKKAVLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDISTVQN 60

Query: 61  SIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC 120
           SIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC
Sbjct: 61  SIHPNPLSHQYSPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC 120

Query: 121 TDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           TD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Sbjct: 121 TDTFQLQAIAEIEEEGEEAIDAQAVKEAVVQAIKSWGQSGEQD 162

BLAST of CmUC11G209680 vs. NCBI nr
Match: XP_038884499.1 (uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida])

HSP 1 Score: 273.5 bits (698), Expect = 1.2e-69
Identity = 148/162 (91.36%), Postives = 153/162 (94.44%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTV+N
Sbjct: 1   MVSREHKKAVLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDISTVQN 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           SIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT
Sbjct: 61  SIHPNPLSHQYSPMVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           D+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Sbjct: 121 DTFQLQAIAEIEEEGEEAIDAQAVKEAVVQAIKSWGQSGEQD 161

BLAST of CmUC11G209680 vs. NCBI nr
Match: XP_011656425.1 (uncharacterized protein LOC101218918 isoform X1 [Cucumis sativus] >KGN45818.1 hypothetical protein Csa_005035 [Cucumis sativus])

HSP 1 Score: 257.7 bits (657), Expect = 6.8e-65
Identity = 146/164 (89.02%), Postives = 154/164 (93.90%), Query Frame = 0

Query: 1   MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVE 60
           MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEELKQKVERLNQDISTV+
Sbjct: 1   MVSREHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQ 60

Query: 61  NSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVS 120
           NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVS
Sbjct: 61  NS---NPLSHQYSPMQVTVERVVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVS 120

Query: 121 CTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           CT +FQLQAIGEIEEEGEE IDAQ+VKEAVVQAIKSWSQ+GEQD
Sbjct: 121 CTHTFQLQAIGEIEEEGEEGIDAQTVKEAVVQAIKSWSQNGEQD 160

BLAST of CmUC11G209680 vs. ExPASy Swiss-Prot
Match: Q9LXA9 (Transcription factor bHLH61 OS=Arabidopsis thaliana OX=3702 GN=BHLH61 PE=2 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 3.9e-07
Identity = 39/154 (25.32%), Postives = 84/154 (54.55%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           +++   ++  L+++L LLRSI    +++++ SI+ DA  Y++EL  K+ +L +D   + +
Sbjct: 153 LMAERRRRKRLNDRLSLLRSIVPKITKMDRTSILGDAIDYMKELLDKINKLQEDEQELGS 212

Query: 61  SIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QGLLVSILEVFEELGLNVIEAR 120
           + H +  I++   ++ +++  V    +N   +  C    GL+VS +   E LGL + +  
Sbjct: 213 NSHLSTLITNESMVRNSLKFEVDQREVNTHIDICCPTKPGLVVSTVSTLETLGLEIEQCV 272

Query: 121 VSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ 151
           +SC   F LQA      E    + +++ K+A+++
Sbjct: 273 ISCFSDFSLQASCFEVGEQRYMVTSEATKQALIR 306

BLAST of CmUC11G209680 vs. ExPASy Swiss-Prot
Match: Q9LSE2 (Transcription factor ICE1 OS=Arabidopsis thaliana OX=3702 GN=SCRM PE=1 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 6.7e-07
Identity = 48/172 (27.91%), Postives = 88/172 (51.16%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVE 60
           +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ ST  
Sbjct: 309 LMAERRRRKKLNDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQRINDLHNELESTPP 368

Query: 61  NSIHPNPISHHP-------------------------PMQVTVE-RLVKGFSINV--FSE 120
            S+ P   S HP                           Q  VE RL +G ++N+  F  
Sbjct: 369 GSLPPTSSSFHPLTPTPQTLSCRVKEELCPSSLPSPKGQQARVEVRLREGRAVNIHMFCG 428

Query: 121 KSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ 143
           +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Sbjct: 429 RR-PGLLLATMKALDNLGLDVQQAVISCFNGFALDVFRAEQCQEGQEILPDQ 479

BLAST of CmUC11G209680 vs. ExPASy Swiss-Prot
Match: Q9LSL1 (Transcription factor bHLH93 OS=Arabidopsis thaliana OX=3702 GN=BHLH93 PE=1 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 1.9e-06
Identity = 41/163 (25.15%), Postives = 81/163 (49.69%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERL---NQDIST 60
           +++   ++  L+++L +LRSI    S++++ SI+ DA  Y++EL  K+ +L    Q++  
Sbjct: 180 LMAERRRRKRLNDRLSMLRSIVPKISKMDRTSILGDAIDYMKELLDKINKLQDEEQELGN 239

Query: 61  VENSIHP------------NPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVF 120
             NS H              P+  + P +  ++R  +   +++       GLL+S +   
Sbjct: 240 SNNSHHSKLFGDLKDLNANEPLVRNSP-KFEIDRRDEDTRVDICCSPK-PGLLLSTVNTL 299

Query: 121 EELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAV 149
           E LGL + +  +SC   F LQA      E  + I ++ +K+A+
Sbjct: 300 ETLGLEIEQCVISCFSDFSLQASCSEGAEQRDFITSEDIKQAL 340

BLAST of CmUC11G209680 vs. ExPASy Swiss-Prot
Match: Q9LPW3 (Transcription factor SCREAM2 OS=Arabidopsis thaliana OX=3702 GN=SCRM2 PE=1 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 1.3e-05
Identity = 40/175 (22.86%), Postives = 88/175 (50.29%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTV-- 60
           +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ +   
Sbjct: 269 LMAERRRRKKLNDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQRINDLHTELESTPP 328

Query: 61  -ENSIH-----------------------PNPISHHPPMQVTVERLVKGFSINVFSEKSC 120
             +S+H                       P+P    P ++V + R  K  +I++F  +  
Sbjct: 329 SSSSLHPLTPTPQTLSYRVKEELCPSSSLPSPKGQQPRVEVRL-REGKAVNIHMFCGRR- 388

Query: 121 QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVV 150
            GLL+S +   + LGL+V +A +SC + F L      + + +  +  + +K  ++
Sbjct: 389 PGLLLSTMRALDNLGLDVQQAVISCFNGFALDVFRAEQCQEDHDVLPEQIKAVLL 441

BLAST of CmUC11G209680 vs. ExPASy Swiss-Prot
Match: Q10S44 (Transcription factor BHLH3 OS=Oryza sativa subsp. japonica OX=39947 GN=BHLH3 PE=1 SV=1)

HSP 1 Score: 46.6 bits (109), Expect = 3.1e-04
Identity = 38/163 (23.31%), Postives = 80/163 (49.08%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVE- 60
           +++   ++  L+++L +LRSI    S++++ SI+ D   Y++EL ++++ L ++I     
Sbjct: 184 LMAERRRRKRLNDRLSMLRSIVPKISKMDRTSILGDTIDYVKELTERIKTLEEEIGVTPE 243

Query: 61  -----NSIHPNPISHHPPMQV---TVERLVKGFSINVFSEKSC---QGLLVSILEVFEEL 120
                N++  +   ++  M V   T   +    S N   E  C    G+L+S +   E L
Sbjct: 244 ELDLLNTMKDSSSGNNNEMLVRNSTKFDVENRGSGNTRIEICCPANPGVLLSTVSALEVL 303

Query: 121 GLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQA 152
           GL + +  VSC   F +QA    E+   + +    +K+ + ++
Sbjct: 304 GLEIEQCVVSCFSDFGMQASCLQEDGKRQVVSTDEIKQTLFRS 346

BLAST of CmUC11G209680 vs. ExPASy TrEMBL
Match: A0A0A0K8N7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G013900 PE=4 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 3.3e-65
Identity = 146/164 (89.02%), Postives = 154/164 (93.90%), Query Frame = 0

Query: 1   MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVE 60
           MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEELKQKVERLNQDISTV+
Sbjct: 1   MVSREHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQ 60

Query: 61  NSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVS 120
           NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVS
Sbjct: 61  NS---NPLSHQYSPMQVTVERVVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVS 120

Query: 121 CTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           CT +FQLQAIGEIEEEGEE IDAQ+VKEAVVQAIKSWSQ+GEQD
Sbjct: 121 CTHTFQLQAIGEIEEEGEEGIDAQTVKEAVVQAIKSWSQNGEQD 160

BLAST of CmUC11G209680 vs. ExPASy TrEMBL
Match: A0A6J1JV46 (uncharacterized protein LOC111487778 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487778 PE=4 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 4.3e-65
Identity = 140/162 (86.42%), Postives = 150/162 (92.59%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEELKQKVERLNQDI+TV+N
Sbjct: 1   MVSREHKKAALHEKLQLLRSITNSHA-LNKGSIIVDASKYIEELKQKVERLNQDIATVQN 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           SIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVSILE FEELGLNV+EARVSCT
Sbjct: 61  SIHPN-----HPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           D+FQLQA  EIEE+GEEA+DAQ+VKEAVV+AIKSWSQ+GEQD
Sbjct: 121 DTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD 156

BLAST of CmUC11G209680 vs. ExPASy TrEMBL
Match: A0A6J1HIH6 (uncharacterized protein LOC111464709 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464709 PE=4 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 4.3e-65
Identity = 140/162 (86.42%), Postives = 150/162 (92.59%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEELKQKVERLNQDI+TV+N
Sbjct: 1   MVSREHKKAALHEKLQLLRSITNSHA-LNKGSIIVDASKYIEELKQKVERLNQDIATVQN 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           SIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVSILE FEELGLNV+EARVSCT
Sbjct: 61  SIHPN-----HPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           D+FQLQA  EIEE+GEEA+DAQ+VKEAVV+AIKSWSQ+GEQD
Sbjct: 121 DTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD 156

BLAST of CmUC11G209680 vs. ExPASy TrEMBL
Match: A0A6J1K1N3 (uncharacterized protein LOC111489817 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489817 PE=4 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 2.8e-64
Identity = 140/162 (86.42%), Postives = 144/162 (88.89%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREH  AALH  LQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+ 
Sbjct: 1   MVSREHNNAALHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQT 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           SIH        PMQVTVE L KGFSINVFSEKSCQGLLVSILE FEELGLNV+EARVSCT
Sbjct: 61  SIH--------PMQVTVESLAKGFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           DSFQLQAI EIEEEGEEAIDAQ+VKEAVVQAIK WSQSGEQD
Sbjct: 121 DSFQLQAIAEIEEEGEEAIDAQAVKEAVVQAIKIWSQSGEQD 154

BLAST of CmUC11G209680 vs. ExPASy TrEMBL
Match: A0A6J1FG89 (uncharacterized protein LOC111445214 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445214 PE=4 SV=1)

HSP 1 Score: 253.4 bits (646), Expect = 6.2e-64
Identity = 139/162 (85.80%), Postives = 144/162 (88.89%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSREH  A+LH  LQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+ 
Sbjct: 1   MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQT 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           SIH        PMQVTVE L KGFSINVFSEKSCQGLLVSILE FEELGLNV+EARVSCT
Sbjct: 61  SIH--------PMQVTVESLAKGFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD 163
           DSFQLQAI EIEE+GEEAIDAQSVKEAVVQAIK WSQSGEQD
Sbjct: 121 DSFQLQAIAEIEEQGEEAIDAQSVKEAVVQAIKIWSQSGEQD 154

BLAST of CmUC11G209680 vs. TAIR 10
Match: AT2G40435.1 (BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1); Has 289 Blast hits to 289 proteins in 30 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 289; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 162.2 bits (409), Expect = 3.6e-40
Identity = 94/157 (59.87%), Postives = 118/157 (75.16%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           MVSRE K+ +L EK QLLRSITNSH++ N  SII+DASKYI++LKQKVER NQD +  ++
Sbjct: 1   MVSREQKRGSLQEKFQLLRSITNSHAE-NDTSIIMDASKYIQKLKQKVERFNQDPTAEQS 60

Query: 61  SIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCT 120
           S    P     PM VTVE L KGF INVFS K+  G+LVS+LE FE++GLNV+EAR SCT
Sbjct: 61  S--SEPTDPKTPM-VTVETLDKGFMINVFSGKNQPGMLVSVLEAFEDIGLNVLEARASCT 120

Query: 121 DSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQ 158
           DSF L A+G +E E  E +DA++VK+AV  AI+SW +
Sbjct: 121 DSFSLHAMG-LENEDGENMDAEAVKQAVTDAIRSWGE 152

BLAST of CmUC11G209680 vs. TAIR 10
Match: AT3G56220.1 (transcription regulators )

HSP 1 Score: 149.4 bits (376), Expect = 2.4e-36
Identity = 88/159 (55.35%), Postives = 117/159 (73.58%), Query Frame = 0

Query: 1   MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVE 60
           MVSREHK+ ++L EK  LLRSIT+SH++ ++ SIIVDASKYI++LKQKVE++N + +T E
Sbjct: 1   MVSREHKRGSSLREKFHLLRSITDSHAE-SETSIIVDASKYIKKLKQKVEKIN-NATTSE 60

Query: 61  NSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC 120
            S      S  P   VTVE L KGF I V S K+  G+LV +LE FE+LGL+V+EARVSC
Sbjct: 61  QSFRE---SSDPNPMVTVETLEKGFMIKVMSRKNEAGMLVCVLETFEDLGLDVVEARVSC 120

Query: 121 TDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQS 159
           TD+F L AIG    +  + IDA++VK+AV +AI++WS S
Sbjct: 121 TDTFSLHAIGSSNNDDGDCIDAEAVKQAVAEAIRTWSDS 154

BLAST of CmUC11G209680 vs. TAIR 10
Match: AT1G29270.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G40435.1); Has 98 Blast hits to 98 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 68.6 bits (166), Expect = 5.4e-12
Identity = 45/129 (34.88%), Postives = 77/129 (59.69%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEELKQKVERLNQDISTVE 60
           MV+ E KK A   K   L+++T+    +++ S+++ +A  YI  LK ++E L ++   ++
Sbjct: 1   MVASEQKKRASQGKPHFLKNLTHFKFSIHEQSMVIREALLYIAMLKLEIEALQREYEDLK 60

Query: 61  NSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSC 120
            +      S H   +V VE++ + F + + S +  +  LV+ILE FEE+GLNV +AR SC
Sbjct: 61  IT---KKESLHQFQEVKVEKIGEMFQVKIKSPRG-ENNLVNILEAFEEMGLNVAQARASC 120

Query: 121 TDSFQLQAI 129
            DSF ++AI
Sbjct: 121 LDSFAMEAI 125

BLAST of CmUC11G209680 vs. TAIR 10
Match: AT5G10570.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 56.2 bits (134), Expect = 2.8e-08
Identity = 39/154 (25.32%), Postives = 84/154 (54.55%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN 60
           +++   ++  L+++L LLRSI    +++++ SI+ DA  Y++EL  K+ +L +D   + +
Sbjct: 153 LMAERRRRKRLNDRLSLLRSIVPKITKMDRTSILGDAIDYMKELLDKINKLQEDEQELGS 212

Query: 61  SIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QGLLVSILEVFEELGLNVIEAR 120
           + H +  I++   ++ +++  V    +N   +  C    GL+VS +   E LGL + +  
Sbjct: 213 NSHLSTLITNESMVRNSLKFEVDQREVNTHIDICCPTKPGLVVSTVSTLETLGLEIEQCV 272

Query: 121 VSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ 151
           +SC   F LQA      E    + +++ K+A+++
Sbjct: 273 ISCFSDFSLQASCFEVGEQRYMVTSEATKQALIR 306

BLAST of CmUC11G209680 vs. TAIR 10
Match: AT3G26744.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 55.5 bits (132), Expect = 4.8e-08
Identity = 48/172 (27.91%), Postives = 88/172 (51.16%), Query Frame = 0

Query: 1   MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVE 60
           +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ ST  
Sbjct: 309 LMAERRRRKKLNDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQRINDLHNELESTPP 368

Query: 61  NSIHPNPISHHP-------------------------PMQVTVE-RLVKGFSINV--FSE 120
            S+ P   S HP                           Q  VE RL +G ++N+  F  
Sbjct: 369 GSLPPTSSSFHPLTPTPQTLSCRVKEELCPSSLPSPKGQQARVEVRLREGRAVNIHMFCG 428

Query: 121 KSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ 143
           +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Sbjct: 429 RR-PGLLLATMKALDNLGLDVQQAVISCFNGFALDVFRAEQCQEGQEILPDQ 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884496.16.8e-7393.25uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida][more]
XP_038884498.11.7e-7191.98uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida][more]
XP_038884497.14.8e-7192.64uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida][more]
XP_038884499.11.2e-6991.36uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida][more]
XP_011656425.16.8e-6589.02uncharacterized protein LOC101218918 isoform X1 [Cucumis sativus] >KGN45818.1 hy... [more]
Match NameE-valueIdentityDescription
Q9LXA93.9e-0725.32Transcription factor bHLH61 OS=Arabidopsis thaliana OX=3702 GN=BHLH61 PE=2 SV=1[more]
Q9LSE26.7e-0727.91Transcription factor ICE1 OS=Arabidopsis thaliana OX=3702 GN=SCRM PE=1 SV=1[more]
Q9LSL11.9e-0625.15Transcription factor bHLH93 OS=Arabidopsis thaliana OX=3702 GN=BHLH93 PE=1 SV=1[more]
Q9LPW31.3e-0522.86Transcription factor SCREAM2 OS=Arabidopsis thaliana OX=3702 GN=SCRM2 PE=1 SV=1[more]
Q10S443.1e-0423.31Transcription factor BHLH3 OS=Oryza sativa subsp. japonica OX=39947 GN=BHLH3 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0K8N73.3e-6589.02Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G013900 PE=4 SV=1[more]
A0A6J1JV464.3e-6586.42uncharacterized protein LOC111487778 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HIH64.3e-6586.42uncharacterized protein LOC111464709 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1K1N32.8e-6486.42uncharacterized protein LOC111489817 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FG896.2e-6485.80uncharacterized protein LOC111445214 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G40435.13.6e-4059.87BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G5... [more]
AT3G56220.12.4e-3655.35transcription regulators [more]
AT1G29270.15.4e-1234.88unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G10570.12.8e-0825.32basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G26744.14.8e-0827.91basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 34..61
NoneNo IPR availablePANTHERPTHR31945TRANSCRIPTION FACTOR SCREAM2-RELATEDcoord: 1..160
NoneNo IPR availablePANTHERPTHR31945:SF5TRANSCRIPTION FACTOR SCREAM-LIKE PROTEINcoord: 1..160
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3D4.10.280.10coord: 4..66
e-value: 6.0E-6
score: 28.3
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILY47459HLH, helix-loop-helix DNA-binding domaincoord: 10..69

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC11G209680.1CmUC11G209680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0043565 sequence-specific DNA binding