CmoCh02G005850 (gene) Cucurbita moschata (Rifu)

NameCmoCh02G005850
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(BHLH transcription factor) (DNA binding protein)
LocationCmo_Chr02 : 3473200 .. 3473835 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGACATCGACATCCTCAAATCGACACTATCACCTTCCAACTCCATCGACATGTCGTCCACAATCCTCCTCAACAACAACAACAGCTCCCACTACCCTCCACTCCCCTTCTTCTCCGACTTCTCATCTCCCTCCTTCCACACCGCACAGCCAGTGCTGCCGAACCAACAGAACCGGCAGAAGCGCGGCGGCGGCGGGGGCGGGGGCGGGGGCGGGGGCGGGGGAATGGCAGCAATGAGAGAGATGATATTCCGAATAGCAGCAATGCAGCCAATACAAATAGACCCAGAGGAGATAAAGGCGCCAAAGCGGCGAAACGTGAGAATCTCGAAGGACCCACAGAGCGTGGCGGCGCGGCAGCGGAGGGAGAGAATCAGCCAGAAGATTAGGATTCTGCAGCGGCTGGTGCCTGGGGGAACTAAGATGGACACGGCGTCGATGCTGGATGAGGCGGTTCATTATGTGAAGTTCTTGAAGAGGCAAGTGCAAACGCTGGAGCAGGCAGGGTTTCACAATGCTAATACTACTAATAATAATAACAACTGTCCTAACCTTAGCTACTCTTCTGCGCTTTTCAAAGCTTGCCAAATGCCCCACGCTCCATCTATGCCTGGCTCTTTGCAAATGCATTGA

mRNA sequence

ATGGACGACATCGACATCCTCAAATCGACACTATCACCTTCCAACTCCATCGACATGTCGTCCACAATCCTCCTCAACAACAACAACAGCTCCCACTACCCTCCACTCCCCTTCTTCTCCGACTTCTCATCTCCCTCCTTCCACACCGCACAGCCAGTGCTGCCGAACCAACAGAACCGGCAGAAGCGCGGCGGCGGCGGGGGCGGGGGCGGGGGCGGGGGCGGGGGAATGGCAGCAATGAGAGAGATGATATTCCGAATAGCAGCAATGCAGCCAATACAAATAGACCCAGAGGAGATAAAGGCGCCAAAGCGGCGAAACGTGAGAATCTCGAAGGACCCACAGAGCGTGGCGGCGCGGCAGCGGAGGGAGAGAATCAGCCAGAAGATTAGGATTCTGCAGCGGCTGGTGCCTGGGGGAACTAAGATGGACACGGCGTCGATGCTGGATGAGGCGGTTCATTATGTGAAGTTCTTGAAGAGGCAAGTGCAAACGCTGGAGCAGGCAGGGTTTCACAATGCTAATACTACTAATAATAATAACAACTGTCCTAACCTTAGCTACTCTTCTGCGCTTTTCAAAGCTTGCCAAATGCCCCACGCTCCATCTATGCCTGGCTCTTTGCAAATGCATTGA

Coding sequence (CDS)

ATGGACGACATCGACATCCTCAAATCGACACTATCACCTTCCAACTCCATCGACATGTCGTCCACAATCCTCCTCAACAACAACAACAGCTCCCACTACCCTCCACTCCCCTTCTTCTCCGACTTCTCATCTCCCTCCTTCCACACCGCACAGCCAGTGCTGCCGAACCAACAGAACCGGCAGAAGCGCGGCGGCGGCGGGGGCGGGGGCGGGGGCGGGGGCGGGGGAATGGCAGCAATGAGAGAGATGATATTCCGAATAGCAGCAATGCAGCCAATACAAATAGACCCAGAGGAGATAAAGGCGCCAAAGCGGCGAAACGTGAGAATCTCGAAGGACCCACAGAGCGTGGCGGCGCGGCAGCGGAGGGAGAGAATCAGCCAGAAGATTAGGATTCTGCAGCGGCTGGTGCCTGGGGGAACTAAGATGGACACGGCGTCGATGCTGGATGAGGCGGTTCATTATGTGAAGTTCTTGAAGAGGCAAGTGCAAACGCTGGAGCAGGCAGGGTTTCACAATGCTAATACTACTAATAATAATAACAACTGTCCTAACCTTAGCTACTCTTCTGCGCTTTTCAAAGCTTGCCAAATGCCCCACGCTCCATCTATGCCTGGCTCTTTGCAAATGCATTGA
BLAST of CmoCh02G005850 vs. Swiss-Prot
Match: HEC2_ARATH (Transcription factor HEC2 OS=Arabidopsis thaliana GN=HEC2 PE=1 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 6.6e-36
Identity = 95/177 (53.67%), Postives = 115/177 (64.97%), Query Frame = 1

Query: 1   MDDIDILKSTLSPSNSIDMSSTILLNNNNSSH-------YPPLPFFSDFS--SPSFHTAQ 60
           M  ++ L    S SN       I++ + +++H       +  LPF        P  +   
Sbjct: 12  MQQMEKLPEHFSNSNPNPNPHNIMMLSESNTHPFFFNPTHSHLPFDQTMPHHQPGLNFRY 71

Query: 61  PVLPNQQNRQKRGGGGGGGGGGGGGMAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRIS 120
              P+    +KRGG           MAAMREMIFRIA MQPI IDPE +K PKR+NVRIS
Sbjct: 72  APSPSSSLPEKRGGCSDNAN-----MAAMREMIFRIAVMQPIHIDPESVKPPKRKNVRIS 131

Query: 121 KDPQSVAARQRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQ 169
           KDPQSVAAR RRERIS++IRILQRLVPGGTKMDTASMLDEA+HYVKFLK+QVQ+LE+
Sbjct: 132 KDPQSVAARHRRERISERIRILQRLVPGGTKMDTASMLDEAIHYVKFLKKQVQSLEE 183

BLAST of CmoCh02G005850 vs. Swiss-Prot
Match: HEC1_ARATH (Transcription factor HEC1 OS=Arabidopsis thaliana GN=HEC1 PE=1 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 1.9e-35
Identity = 92/152 (60.53%), Postives = 102/152 (67.11%), Query Frame = 1

Query: 29  NSSHYPPLPFFSDFS---SPSFHTAQPVLPNQQNRQKRGG---------GGGGGGGGGGG 88
           NS+HY      SD S    P F     +L N  +                       G  
Sbjct: 40  NSTHYQ-----SDHSMTNEPGFRYGSGLLTNPSSISPNTAYSSVFLDKRNNSNNNNNGTN 99

Query: 89  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 148
           MAAMREMIFRIA MQPI IDPE +K PKRRNVRISKDPQSVAAR RRERIS++IRILQRL
Sbjct: 100 MAAMREMIFRIAVMQPIHIDPEAVKPPKRRNVRISKDPQSVAARHRRERISERIRILQRL 159

Query: 149 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQ 169
           VPGGTKMDTASMLDEA+HYVKFLK+QVQ+LE+
Sbjct: 160 VPGGTKMDTASMLDEAIHYVKFLKKQVQSLEE 186

BLAST of CmoCh02G005850 vs. Swiss-Prot
Match: HEC3_ARATH (Transcription factor HEC3 OS=Arabidopsis thaliana GN=HEC3 PE=1 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 1.2e-29
Identity = 80/143 (55.94%), Postives = 99/143 (69.23%), Query Frame = 1

Query: 24  LLNNNNSSHYPPLPFFSDFSSPSFHTAQPVLPNQQNRQKRGGGGGGGGGGGGGMAAMREM 83
           + N  +SSH+PPL       S S  T   +  +Q++ +         G       AM+EM
Sbjct: 53  IFNPFSSSHFPPL-------SSSLTTTTLLSGDQEDDEDEEEPLEELG-------AMKEM 112

Query: 84  IFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLVPGGTKM 143
           +++IAAMQ + IDP  +K PKRRNVRIS DPQSVAAR RRERIS++IRILQRLVPGGTKM
Sbjct: 113 MYKIAAMQSVDIDPATVKKPKRRNVRISDDPQSVAARHRRERISERIRILQRLVPGGTKM 172

Query: 144 DTASMLDEAVHYVKFLKRQVQTL 167
           DTASMLDEA+ YVKFLKRQ++ L
Sbjct: 173 DTASMLDEAIRYVKFLKRQIRLL 181

BLAST of CmoCh02G005850 vs. Swiss-Prot
Match: IND_ARATH (Transcription factor IND OS=Arabidopsis thaliana GN=IND PE=1 SV=3)

HSP 1 Score: 125.2 bits (313), Expect = 8.7e-28
Identity = 66/107 (61.68%), Postives = 79/107 (73.83%), Query Frame = 1

Query: 77  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 136
           M AM+EM + IA MQP+ IDP  +  P RRNVRIS DPQ+V AR+RRERIS+KIRIL+R+
Sbjct: 85  MDAMKEMQYMIAVMQPVDIDPATVPKPNRRNVRISDDPQTVVARRRRERISEKIRILKRI 144

Query: 137 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFHNANTTNNNNNC 184
           VPGG KMDTASMLDEA+ Y KFLKRQV+ L+      A   N +  C
Sbjct: 145 VPGGAKMDTASMLDEAIRYTKFLKRQVRILQPHSQIGAPMANPSYLC 191

BLAST of CmoCh02G005850 vs. Swiss-Prot
Match: BH087_ARATH (Transcription factor bHLH87 OS=Arabidopsis thaliana GN=BHLH87 PE=2 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 3.3e-27
Identity = 68/115 (59.13%), Postives = 85/115 (73.91%), Query Frame = 1

Query: 77  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 136
           +A M+EMI+R AA +P+    E ++ PKR+NV+IS DPQ+VAARQRRERIS+KIR+LQ L
Sbjct: 242 IAQMKEMIYRAAAFRPVNFGLEIVEKPKRKNVKISTDPQTVAARQRRERISEKIRVLQTL 301

Query: 137 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFHNANTTNNNNNCPNLSYSSA 192
           VPGGTKMDTASMLDEA +Y+KFL+ QV+ LE        T        NLS+SSA
Sbjct: 302 VPGGTKMDTASMLDEAANYLKFLRAQVKALENLRPKLDQT--------NLSFSSA 348

BLAST of CmoCh02G005850 vs. TrEMBL
Match: A0A0A0LJI7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G285890 PE=4 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 4.9e-62
Identity = 153/227 (67.40%), Postives = 165/227 (72.69%), Query Frame = 1

Query: 1   MDDIDILKSTLSPSNSIDMSSTILLNN---NNSSHY----PPLPFFSDFSSPS----FHT 60
           MDDIDILKSTLS S   DMSST   NN   N + H     PP  +FSD+S P     F T
Sbjct: 1   MDDIDILKSTLSQS---DMSSTFFPNNTAPNCTPHVLPIIPPPAYFSDYSPPPGTSLFQT 60

Query: 61  AQPVLPNQQNRQKRGGGGGGGGGGGGGMAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVR 120
              ++P    RQ+R G  GG      GMAAMREMIFRIAAMQP++IDPE IKAPKRRNVR
Sbjct: 61  TPTIIPETPARQRRSGVSGG------GMAAMREMIFRIAAMQPVEIDPEAIKAPKRRNVR 120

Query: 121 ISKDPQSVAARQRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQA 180
           ISKDPQSVAAR RRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQA
Sbjct: 121 ISKDPQSVAARHRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQA 180

Query: 181 GFHNANTTNNNN------NCPNLSYSSALFKACQMPHAPSMPGSLQM 211
           GF+  N  NNNN      N  NL+Y+SALFKACQ+     MP SLQM
Sbjct: 181 GFNYNNNNNNNNNFNNFVNSANLNYASALFKACQI-----MPASLQM 213

BLAST of CmoCh02G005850 vs. TrEMBL
Match: A0A0B2SA94_GLYSO (Transcription factor HEC1 OS=Glycine soja GN=glysoja_003825 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 4.8e-41
Identity = 122/240 (50.83%), Postives = 145/240 (60.42%), Query Frame = 1

Query: 3   DIDILKSTLSPSNSIDMSSTILL-------------NNNNSSHYP-------------PL 62
           D+DILK++ S + S+DM  T++              NNNN+S  P              L
Sbjct: 2   DVDILKTSTSDNISMDMMMTMMQMEKFPEFCEPFYNNNNNTSTAPLYPENELLINSTTTL 61

Query: 63  PFFSDF-----------SSPSFHTAQPVLPN-QQNRQKRGGGGGGGGGGGGGMAAMREMI 122
           P FS+            SS +F   QP+ P+ + N +KR             +AAMREMI
Sbjct: 62  PVFSNVINNPNVITPPPSSSNFIQQQPMTPHLEPNLEKRNS-----------VAAMREMI 121

Query: 123 FRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLVPGGTKMD 182
           FR+A MQPI IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++IRILQRLVPGGTKMD
Sbjct: 122 FRVAVMQPIHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIRILQRLVPGGTKMD 181

Query: 183 TASMLDEAVHYVKFLKRQVQTLEQAGFHN-------ANTTNNNNNCPNLSYSSALFKACQ 198
           TASMLDEA+HYVKFLK+QVQTLEQAG            T +N NN  N SY S   K+CQ
Sbjct: 182 TASMLDEAIHYVKFLKKQVQTLEQAGASRPLNVVGFPTTASNANN--NNSY-SGFVKSCQ 227

BLAST of CmoCh02G005850 vs. TrEMBL
Match: I1LHE1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_11G055300 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 4.8e-41
Identity = 122/240 (50.83%), Postives = 145/240 (60.42%), Query Frame = 1

Query: 3   DIDILKSTLSPSNSIDMSSTILL-------------NNNNSSHYP-------------PL 62
           D+DILK++ S + S+DM  T++              NNNN+S  P              L
Sbjct: 2   DVDILKTSTSDNISMDMMMTMMQMEKFPEFCEPFYNNNNNTSTAPLYPENELLINSTTTL 61

Query: 63  PFFSDF-----------SSPSFHTAQPVLPN-QQNRQKRGGGGGGGGGGGGGMAAMREMI 122
           P FS+            SS +F   QP+ P+ + N +KR             +AAMREMI
Sbjct: 62  PVFSNVINNPNVITPPPSSSNFIQQQPMTPHLEPNLEKRNS-----------VAAMREMI 121

Query: 123 FRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLVPGGTKMD 182
           FR+A MQPI IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++IRILQRLVPGGTKMD
Sbjct: 122 FRVAVMQPIHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIRILQRLVPGGTKMD 181

Query: 183 TASMLDEAVHYVKFLKRQVQTLEQAGFHN-------ANTTNNNNNCPNLSYSSALFKACQ 198
           TASMLDEA+HYVKFLK+QVQTLEQAG            T +N NN  N SY S   K+CQ
Sbjct: 182 TASMLDEAIHYVKFLKKQVQTLEQAGASRPLNVVGFPTTASNANN--NNSY-SGFVKSCQ 227

BLAST of CmoCh02G005850 vs. TrEMBL
Match: B9SC79_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1408840 PE=4 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 3.1e-40
Identity = 98/136 (72.06%), Postives = 110/136 (80.88%), Query Frame = 1

Query: 78  AAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLV 137
           AA+REMIFRIAAMQPI IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++IRILQRLV
Sbjct: 143 AAIREMIFRIAAMQPIHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIRILQRLV 202

Query: 138 PGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFH---NANTTNNNNNCPNLSYSSALFK 197
           PGGTKMDTASMLDEA+HYVKFLK+QVQ+LEQAG +    A    +    PN+ YSS L K
Sbjct: 203 PGGTKMDTASMLDEAIHYVKFLKKQVQSLEQAGANRSMGAGFPFSGLTMPNMGYSS-LLK 262

Query: 198 ACQMPHAPSMPGSLQM 211
            CQ  H P+M  S+QM
Sbjct: 263 NCQPAH-PNMVSSMQM 276

BLAST of CmoCh02G005850 vs. TrEMBL
Match: C6TJ68_SOYBN (Putative uncharacterized protein OS=Glycine max PE=2 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 4.0e-40
Identity = 124/250 (49.60%), Postives = 148/250 (59.20%), Query Frame = 1

Query: 3   DIDILKSTLSPSNSIDMSSTILL-------------NNNNSSHYP-------------PL 62
           D+DILK++ S + S+DM  T++              NNNN+S  P              L
Sbjct: 2   DVDILKTSTSDNISMDMMMTMMQMEKFPEFCEPFYNNNNNTSTAPLYPENELLINSTTTL 61

Query: 63  PFFSDF-----------SSPSFHTAQPVLPN-QQNRQKRGGGGGGGGGGGGGMAAMREMI 122
           P FS+            SS +F   QP+ P+ + N +KR             +AAMREMI
Sbjct: 62  PVFSNVINNPNVITPPPSSSNFIQQQPMTPHLEPNLEKRNS-----------VAAMREMI 121

Query: 123 FRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLVPGGTKMD 182
           FR+A MQPI IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++I+ILQRLVPGGTKMD
Sbjct: 122 FRVAVMQPIHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIKILQRLVPGGTKMD 181

Query: 183 TASMLDEAVHYVKFLKRQVQTLEQAGFHN-------ANTTNNNNNCPNLSYSSALFKACQ 208
           TASMLDEA+HYVKFLK+QVQTLEQAG            T +N NN  N SY S   K+C 
Sbjct: 182 TASMLDEAIHYVKFLKKQVQTLEQAGASRPLNVVGFPTTASNANN--NNSY-SGFVKSC- 234

BLAST of CmoCh02G005850 vs. TAIR10
Match: AT3G50330.1 (AT3G50330.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 152.1 bits (383), Expect = 3.7e-37
Identity = 95/177 (53.67%), Postives = 115/177 (64.97%), Query Frame = 1

Query: 1   MDDIDILKSTLSPSNSIDMSSTILLNNNNSSH-------YPPLPFFSDFS--SPSFHTAQ 60
           M  ++ L    S SN       I++ + +++H       +  LPF        P  +   
Sbjct: 12  MQQMEKLPEHFSNSNPNPNPHNIMMLSESNTHPFFFNPTHSHLPFDQTMPHHQPGLNFRY 71

Query: 61  PVLPNQQNRQKRGGGGGGGGGGGGGMAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRIS 120
              P+    +KRGG           MAAMREMIFRIA MQPI IDPE +K PKR+NVRIS
Sbjct: 72  APSPSSSLPEKRGGCSDNAN-----MAAMREMIFRIAVMQPIHIDPESVKPPKRKNVRIS 131

Query: 121 KDPQSVAARQRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQ 169
           KDPQSVAAR RRERIS++IRILQRLVPGGTKMDTASMLDEA+HYVKFLK+QVQ+LE+
Sbjct: 132 KDPQSVAARHRRERISERIRILQRLVPGGTKMDTASMLDEAIHYVKFLKKQVQSLEE 183

BLAST of CmoCh02G005850 vs. TAIR10
Match: AT5G67060.1 (AT5G67060.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 150.6 bits (379), Expect = 1.1e-36
Identity = 92/152 (60.53%), Postives = 102/152 (67.11%), Query Frame = 1

Query: 29  NSSHYPPLPFFSDFS---SPSFHTAQPVLPNQQNRQKRGG---------GGGGGGGGGGG 88
           NS+HY      SD S    P F     +L N  +                       G  
Sbjct: 40  NSTHYQ-----SDHSMTNEPGFRYGSGLLTNPSSISPNTAYSSVFLDKRNNSNNNNNGTN 99

Query: 89  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 148
           MAAMREMIFRIA MQPI IDPE +K PKRRNVRISKDPQSVAAR RRERIS++IRILQRL
Sbjct: 100 MAAMREMIFRIAVMQPIHIDPEAVKPPKRRNVRISKDPQSVAARHRRERISERIRILQRL 159

Query: 149 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQ 169
           VPGGTKMDTASMLDEA+HYVKFLK+QVQ+LE+
Sbjct: 160 VPGGTKMDTASMLDEAIHYVKFLKKQVQSLEE 186

BLAST of CmoCh02G005850 vs. TAIR10
Match: AT5G09750.1 (AT5G09750.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 131.3 bits (329), Expect = 6.8e-31
Identity = 80/143 (55.94%), Postives = 99/143 (69.23%), Query Frame = 1

Query: 24  LLNNNNSSHYPPLPFFSDFSSPSFHTAQPVLPNQQNRQKRGGGGGGGGGGGGGMAAMREM 83
           + N  +SSH+PPL       S S  T   +  +Q++ +         G       AM+EM
Sbjct: 53  IFNPFSSSHFPPL-------SSSLTTTTLLSGDQEDDEDEEEPLEELG-------AMKEM 112

Query: 84  IFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLVPGGTKM 143
           +++IAAMQ + IDP  +K PKRRNVRIS DPQSVAAR RRERIS++IRILQRLVPGGTKM
Sbjct: 113 MYKIAAMQSVDIDPATVKKPKRRNVRISDDPQSVAARHRRERISERIRILQRLVPGGTKM 172

Query: 144 DTASMLDEAVHYVKFLKRQVQTL 167
           DTASMLDEA+ YVKFLKRQ++ L
Sbjct: 173 DTASMLDEAIRYVKFLKRQIRLL 181

BLAST of CmoCh02G005850 vs. TAIR10
Match: AT4G00120.1 (AT4G00120.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 125.2 bits (313), Expect = 4.9e-29
Identity = 66/107 (61.68%), Postives = 79/107 (73.83%), Query Frame = 1

Query: 77  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 136
           M AM+EM + IA MQP+ IDP  +  P RRNVRIS DPQ+V AR+RRERIS+KIRIL+R+
Sbjct: 85  MDAMKEMQYMIAVMQPVDIDPATVPKPNRRNVRISDDPQTVVARRRRERISEKIRILKRI 144

Query: 137 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFHNANTTNNNNNC 184
           VPGG KMDTASMLDEA+ Y KFLKRQV+ L+      A   N +  C
Sbjct: 145 VPGGAKMDTASMLDEAIRYTKFLKRQVRILQPHSQIGAPMANPSYLC 191

BLAST of CmoCh02G005850 vs. TAIR10
Match: AT3G21330.1 (AT3G21330.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 123.2 bits (308), Expect = 1.9e-28
Identity = 68/115 (59.13%), Postives = 85/115 (73.91%), Query Frame = 1

Query: 77  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 136
           +A M+EMI+R AA +P+    E ++ PKR+NV+IS DPQ+VAARQRRERIS+KIR+LQ L
Sbjct: 242 IAQMKEMIYRAAAFRPVNFGLEIVEKPKRKNVKISTDPQTVAARQRRERISEKIRVLQTL 301

Query: 137 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFHNANTTNNNNNCPNLSYSSA 192
           VPGGTKMDTASMLDEA +Y+KFL+ QV+ LE        T        NLS+SSA
Sbjct: 302 VPGGTKMDTASMLDEAANYLKFLRAQVKALENLRPKLDQT--------NLSFSSA 348

BLAST of CmoCh02G005850 vs. NCBI nr
Match: gi|659115951|ref|XP_008457823.1| (PREDICTED: transcription factor HEC2-like [Cucumis melo])

HSP 1 Score: 246.1 bits (627), Expect = 5.4e-62
Identity = 152/224 (67.86%), Postives = 163/224 (72.77%), Query Frame = 1

Query: 1   MDDIDILKSTLSPSNSIDMSSTILLNN-NNSSHYPPLPFFSDFSSPS----FHTAQPVLP 60
           MDDIDILKSTLSPS   DMSST   NN        P P+FSD+S P     F T   ++P
Sbjct: 1   MDDIDILKSTLSPS---DMSSTFFPNNCPQLLPVIPPPYFSDYSPPPGTSLFQTTPTIIP 60

Query: 61  NQQNRQKRGGGGGGGGGGGGGMAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQ 120
               RQ+R G  GG      GMAAMREMIFRIAAMQP++IDPE IKAPKRRNVRISKDPQ
Sbjct: 61  EAPARQRRSGVSGG------GMAAMREMIFRIAAMQPVEIDPEAIKAPKRRNVRISKDPQ 120

Query: 121 SVAARQRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFHNAN 180
           SVAAR RRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGF+  N
Sbjct: 121 SVAARHRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFNYNN 180

Query: 181 TTNNNN---------NCPNLSYSSALFKACQMPHAPSMPGSLQM 211
             NNNN         N  NL+Y+SALFKACQ+     MP SLQM
Sbjct: 181 NNNNNNNNNNFNNFVNSANLNYASALFKACQI-----MPASLQM 210

BLAST of CmoCh02G005850 vs. NCBI nr
Match: gi|700206886|gb|KGN62005.1| (hypothetical protein Csa_2G285890 [Cucumis sativus])

HSP 1 Score: 245.7 bits (626), Expect = 7.0e-62
Identity = 153/227 (67.40%), Postives = 165/227 (72.69%), Query Frame = 1

Query: 1   MDDIDILKSTLSPSNSIDMSSTILLNN---NNSSHY----PPLPFFSDFSSPS----FHT 60
           MDDIDILKSTLS S   DMSST   NN   N + H     PP  +FSD+S P     F T
Sbjct: 1   MDDIDILKSTLSQS---DMSSTFFPNNTAPNCTPHVLPIIPPPAYFSDYSPPPGTSLFQT 60

Query: 61  AQPVLPNQQNRQKRGGGGGGGGGGGGGMAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVR 120
              ++P    RQ+R G  GG      GMAAMREMIFRIAAMQP++IDPE IKAPKRRNVR
Sbjct: 61  TPTIIPETPARQRRSGVSGG------GMAAMREMIFRIAAMQPVEIDPEAIKAPKRRNVR 120

Query: 121 ISKDPQSVAARQRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQA 180
           ISKDPQSVAAR RRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQA
Sbjct: 121 ISKDPQSVAARHRRERISQKIRILQRLVPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQA 180

Query: 181 GFHNANTTNNNN------NCPNLSYSSALFKACQMPHAPSMPGSLQM 211
           GF+  N  NNNN      N  NL+Y+SALFKACQ+     MP SLQM
Sbjct: 181 GFNYNNNNNNNNNFNNFVNSANLNYASALFKACQI-----MPASLQM 213

BLAST of CmoCh02G005850 vs. NCBI nr
Match: gi|734423164|gb|KHN42085.1| (Transcription factor HEC1 [Glycine soja])

HSP 1 Score: 176.0 bits (445), Expect = 6.8e-41
Identity = 122/240 (50.83%), Postives = 145/240 (60.42%), Query Frame = 1

Query: 3   DIDILKSTLSPSNSIDMSSTILL-------------NNNNSSHYP-------------PL 62
           D+DILK++ S + S+DM  T++              NNNN+S  P              L
Sbjct: 2   DVDILKTSTSDNISMDMMMTMMQMEKFPEFCEPFYNNNNNTSTAPLYPENELLINSTTTL 61

Query: 63  PFFSDF-----------SSPSFHTAQPVLPN-QQNRQKRGGGGGGGGGGGGGMAAMREMI 122
           P FS+            SS +F   QP+ P+ + N +KR             +AAMREMI
Sbjct: 62  PVFSNVINNPNVITPPPSSSNFIQQQPMTPHLEPNLEKRNS-----------VAAMREMI 121

Query: 123 FRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLVPGGTKMD 182
           FR+A MQPI IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++IRILQRLVPGGTKMD
Sbjct: 122 FRVAVMQPIHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIRILQRLVPGGTKMD 181

Query: 183 TASMLDEAVHYVKFLKRQVQTLEQAGFHN-------ANTTNNNNNCPNLSYSSALFKACQ 198
           TASMLDEA+HYVKFLK+QVQTLEQAG            T +N NN  N SY S   K+CQ
Sbjct: 182 TASMLDEAIHYVKFLKKQVQTLEQAGASRPLNVVGFPTTASNANN--NNSY-SGFVKSCQ 227

BLAST of CmoCh02G005850 vs. NCBI nr
Match: gi|1021514246|ref|XP_016200633.1| (PREDICTED: transcription factor HEC2 [Arachis ipaensis])

HSP 1 Score: 174.9 bits (442), Expect = 1.5e-40
Identity = 95/139 (68.35%), Postives = 107/139 (76.98%), Query Frame = 1

Query: 77  MAAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRL 136
           MAAMREMIFR+A MQP+ IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++IRILQRL
Sbjct: 114 MAAMREMIFRVAVMQPVHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIRILQRL 173

Query: 137 VPGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFHNANTTNNN--------NNCPNLSY 196
           VPGGTKMDTASMLDEA+HYVKFLK+QVQTLEQAG     T NNN        ++  N S 
Sbjct: 174 VPGGTKMDTASMLDEAIHYVKFLKKQVQTLEQAG--GGRTCNNNGFTGFSSSSSSINASN 233

Query: 197 SSALFKACQMPHAPSMPGS 208
             A+ K C  P+ P + GS
Sbjct: 234 YPAMVKGCHQPYPPMLMGS 250

BLAST of CmoCh02G005850 vs. NCBI nr
Match: gi|255565212|ref|XP_002523598.1| (PREDICTED: transcription factor HEC1 [Ricinus communis])

HSP 1 Score: 173.3 bits (438), Expect = 4.4e-40
Identity = 98/136 (72.06%), Postives = 110/136 (80.88%), Query Frame = 1

Query: 78  AAMREMIFRIAAMQPIQIDPEEIKAPKRRNVRISKDPQSVAARQRRERISQKIRILQRLV 137
           AA+REMIFRIAAMQPI IDPE IK PKRRNV+ISKDPQSVAAR RRERIS++IRILQRLV
Sbjct: 143 AAIREMIFRIAAMQPIHIDPESIKPPKRRNVKISKDPQSVAARHRRERISERIRILQRLV 202

Query: 138 PGGTKMDTASMLDEAVHYVKFLKRQVQTLEQAGFH---NANTTNNNNNCPNLSYSSALFK 197
           PGGTKMDTASMLDEA+HYVKFLK+QVQ+LEQAG +    A    +    PN+ YSS L K
Sbjct: 203 PGGTKMDTASMLDEAIHYVKFLKKQVQSLEQAGANRSMGAGFPFSGLTMPNMGYSS-LLK 262

Query: 198 ACQMPHAPSMPGSLQM 211
            CQ  H P+M  S+QM
Sbjct: 263 NCQPAH-PNMVSSMQM 276

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HEC2_ARATH6.6e-3653.67Transcription factor HEC2 OS=Arabidopsis thaliana GN=HEC2 PE=1 SV=1[more]
HEC1_ARATH1.9e-3560.53Transcription factor HEC1 OS=Arabidopsis thaliana GN=HEC1 PE=1 SV=1[more]
HEC3_ARATH1.2e-2955.94Transcription factor HEC3 OS=Arabidopsis thaliana GN=HEC3 PE=1 SV=1[more]
IND_ARATH8.7e-2861.68Transcription factor IND OS=Arabidopsis thaliana GN=IND PE=1 SV=3[more]
BH087_ARATH3.3e-2759.13Transcription factor bHLH87 OS=Arabidopsis thaliana GN=BHLH87 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LJI7_CUCSA4.9e-6267.40Uncharacterized protein OS=Cucumis sativus GN=Csa_2G285890 PE=4 SV=1[more]
A0A0B2SA94_GLYSO4.8e-4150.83Transcription factor HEC1 OS=Glycine soja GN=glysoja_003825 PE=4 SV=1[more]
I1LHE1_SOYBN4.8e-4150.83Uncharacterized protein OS=Glycine max GN=GLYMA_11G055300 PE=4 SV=1[more]
B9SC79_RICCO3.1e-4072.06DNA binding protein, putative OS=Ricinus communis GN=RCOM_1408840 PE=4 SV=1[more]
C6TJ68_SOYBN4.0e-4049.60Putative uncharacterized protein OS=Glycine max PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT3G50330.13.7e-3753.67 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G67060.11.1e-3660.53 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G09750.16.8e-3155.94 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G00120.14.9e-2961.68 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G21330.11.9e-2859.13 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659115951|ref|XP_008457823.1|5.4e-6267.86PREDICTED: transcription factor HEC2-like [Cucumis melo][more]
gi|700206886|gb|KGN62005.1|7.0e-6267.40hypothetical protein Csa_2G285890 [Cucumis sativus][more]
gi|734423164|gb|KHN42085.1|6.8e-4150.83Transcription factor HEC1 [Glycine soja][more]
gi|1021514246|ref|XP_016200633.1|1.5e-4068.35PREDICTED: transcription factor HEC2 [Arachis ipaensis][more]
gi|255565212|ref|XP_002523598.1|4.4e-4072.06PREDICTED: transcription factor HEC1 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G005850.1CmoCh02G005850.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 111..169
score: 6.8
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 119..159
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 116..165
score: 9.8
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 110..159
score: 15
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 114..169
score: 2.36
NoneNo IPR availableunknownCoilCoilcoord: 149..169
scor
NoneNo IPR availablePANTHERPTHR12565STEROL REGULATORY ELEMENT-BINDING PROTEINcoord: 77..168
score: 8.7
NoneNo IPR availablePANTHERPTHR12565:SF74TRANSCRIPTION FACTOR HEC1coord: 77..168
score: 8.7

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh02G005850Cucsa.101020Cucumber (Gy14) v1cgycmoB0225
CmoCh02G005850Cucsa.366890Cucumber (Gy14) v1cgycmoB1018
CmoCh02G005850CmaCh02G005770Cucurbita maxima (Rimu)cmacmoB619
CmoCh02G005850CmaCh03G010890Cucurbita maxima (Rimu)cmacmoB652
CmoCh02G005850Cla020193Watermelon (97103) v1cmowmB596
CmoCh02G005850Cla008442Watermelon (97103) v1cmowmB574
CmoCh02G005850Csa2G285890Cucumber (Chinese Long) v2cmocuB581
CmoCh02G005850Csa6G483450Cucumber (Chinese Long) v2cmocuB619
CmoCh02G005850MELO3C020892Melon (DHL92) v3.5.1cmomeB535
CmoCh02G005850ClCG01G015500Watermelon (Charleston Gray)cmowcgB519
CmoCh02G005850ClCG02G012720Watermelon (Charleston Gray)cmowcgB547
CmoCh02G005850CSPI06G26550Wild cucumber (PI 183967)cmocpiB624
CmoCh02G005850CSPI02G14270Wild cucumber (PI 183967)cmocpiB586
CmoCh02G005850Lsi10G005600Bottle gourd (USVL1VR-Ls)cmolsiB538
CmoCh02G005850Cp4.1LG05g11010Cucurbita pepo (Zucchini)cmocpeB595
CmoCh02G005850Cp4.1LG10g03410Cucurbita pepo (Zucchini)cmocpeB547
CmoCh02G005850CsaV3_6G041730Cucumber (Chinese Long) v3cmocucB0738
CmoCh02G005850CsaV3_2G016810Cucumber (Chinese Long) v3cmocucB0692
CmoCh02G005850Cla97C01G014450Watermelon (97103) v2cmowmbB579
CmoCh02G005850Cla97C02G038410Watermelon (97103) v2cmowmbB610
CmoCh02G005850Bhi10G001761Wax gourdcmowgoB0753
CmoCh02G005850CsGy2G014150Cucumber (Gy14) v2cgybcmoB225
CmoCh02G005850CsGy6G026000Cucumber (Gy14) v2cgybcmoB796
CmoCh02G005850Carg11647Silver-seed gourdcarcmoB1422
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh02G005850CmoCh03G010750Cucurbita moschata (Rifu)cmocmoB431