CmaCh04G020040 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G020040
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionBasic helix-loop-helix DNA-binding superfamily protein
LocationCma_Chr04 : 12357179 .. 12359087 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGAAAGGGGGCGCCTATGGATGCTGCTGCGTCTATATATTTAGCGTAGTGGAAGGGTTGAAAAACCAAAGCTCCTACCTTTGATCTCAACCAACTCAAACCCATCATCCTTTTCAAACTCCCACTGCCCACGCACCAAGCAGATCTTAATCACAAGGGTAGGGTTTTTTTTTGGGATATTATGAAGGAGGAAGGAGAGTGCTTTCAGGGGTTTCAAGAGCACCTCCTAATGCAACAATTCCAGACTATGCAACAAAACAGCGACGTTTTTGGAGTAGGAGGTGGAGGAGGCTTGATTTTTCCTGAAGTTTCTCCTATGATGTTTACGCCTCCGTGGACCGCGACCACAACCATCCCTCAAGTCCACCACGACCCCTTCATCCCGCCGCCGCCCCCACCTTCCTACGCCAGTTTCTTCAATAGGAGAAATGCTTCTCTGCAGTTCCCTTACGATATTGGGAACAATAGCCTTGGAATTGGGCAGATGGGTCAGCCTGGGTCTGGCCCCTTTGGGCTGCAGGCCGAGCTGAGTAAGATGAGCCCTCAAGAAATAATGGACGCCAAAGCTCTTGCTGCCTCTAAAAGCCATAGTGAAGCTGAGAGACGAAGAAGAGAGAGAATCAATAATCATCTTGCTAAGTTACGAAGCTTACTCCCAAGTACCACCAAAGTAAGTTCTTTTTGTTCCTCAAATTAAACCCAAAAACAAATGGAATCAATTTCTTTACTTCAAAATCACTCCCAAATTGGGCTATTTTTTGTTTTTTATGCTATGGAATTTTCTAAAATCCAAGATTTTTATACATTTTTTTACTTCTTTTTCCACATGCGATATAAAATGGAAGCTCTTATTCAATCAATTTCATTAAAATTGAATGAATTCATGTAATATATTTTTTACTATTCTCAAAGAAATCCTTTTTACAATGTTTTTGCAGACGGACAAGGCATCCTTGCTTGCAGAGGTAATTCAACATGTAAAAGAGCTGAAACGCCAGACGACATTGATAGCTGATGTAAGTCCAGTTCCGACCGAGCTCGATGAGCTATCAGTTGACGTCGATGCATCGGACGAGGACGGTAAGTTCGTGATAAAAGCCTCACTTTGTTGTGAGGACAGGTCTGATCTTCTCCCACAGATCATTAAAACCCTGAAATCCCTCCGCTTGAGAACCCTAAAAGCGGAGATCACGACGCTGGGCGGCCGAGTAAAGAACGTGTTGTTTATCACCGGGGACGACGACTCCTCCAGCGACCAAGAGGAGCCTGAACAGCAGCACCACCAATACAGCATCAGTTCCATTCAGGAAGCCCTAAAGGGCGTGATGGAAAAGGTTTGTAGCGAAGACCAATCATCCTCAGGCAGCATCAAAAGGCAAAGGACCAATAATCACAACAACGTCAATATTTAGGAGGAGAGGTGGGTTTAAGTGTGACATAAAATTATGATATTTGTTAAAATAGTATGTTAAGTGTGTTAGCATTAAGAATGTGACATAAAATTATGATCTTTGTTTTGGTGTGGCTTTCGTATGTACAATTATTATTATTATGTGTACTAGAGAGCAGTGGATGAGAAGGGAATGTAATGATGTTTGTGATGGAAAAGGGTAAGGATCTTTTTCGTGGGGAGGAAACCAAAATATGAGGATCCAAGATGGAAATTTGTGATTTTGAGTATTAGAAGTGAAATGAATGTGTATTGGGGTTACAGAAGGATCTGAGTTTGGTCCCAAATTTGCAGAGAGAGTTTGGGTTTGTTTTGTAGTCAAGTATGTGGCTGGCAGAGAAGCTGCCGTCTCTTTACAGTTTTTATTGGTGGGAATTATGTCATGGATGAGACAATGTTAACGACATCTAGTTTTGTCATATGTTCACAGAATTCTAGTGGACCAACAACCCAACTTTA

mRNA sequence

AGGAAAGGGGGCGCCTATGGATGCTGCTGCGTCTATATATTTAGCGTAGTGGAAGGGTTGAAAAACCAAAGCTCCTACCTTTGATCTCAACCAACTCAAACCCATCATCCTTTTCAAACTCCCACTGCCCACGCACCAAGCAGATCTTAATCACAAGGGTAGGGTTTTTTTTTGGGATATTATGAAGGAGGAAGGAGAGTGCTTTCAGGGGTTTCAAGAGCACCTCCTAATGCAACAATTCCAGACTATGCAACAAAACAGCGACGTTTTTGGAGTAGGAGGTGGAGGAGGCTTGATTTTTCCTGAAGTTTCTCCTATGATGTTTACGCCTCCGTGGACCGCGACCACAACCATCCCTCAAGTCCACCACGACCCCTTCATCCCGCCGCCGCCCCCACCTTCCTACGCCAGTTTCTTCAATAGGAGAAATGCTTCTCTGCAGTTCCCTTACGATATTGGGAACAATAGCCTTGGAATTGGGCAGATGGGTCAGCCTGGGTCTGGCCCCTTTGGGCTGCAGGCCGAGCTGAGTAAGATGAGCCCTCAAGAAATAATGGACGCCAAAGCTCTTGCTGCCTCTAAAAGCCATAGTGAAGCTGAGAGACGAAGAAGAGAGAGAATCAATAATCATCTTGCTAAGTTACGAAGCTTACTCCCAAGTACCACCAAAACGGACAAGGCATCCTTGCTTGCAGAGGTAATTCAACATGTAAAAGAGCTGAAACGCCAGACGACATTGATAGCTGATGTAAGTCCAGTTCCGACCGAGCTCGATGAGCTATCAGTTGACGTCGATGCATCGGACGAGGACGGTAAGTTCGTGATAAAAGCCTCACTTTGTTGTGAGGACAGGTCTGATCTTCTCCCACAGATCATTAAAACCCTGAAATCCCTCCGCTTGAGAACCCTAAAAGCGGAGATCACGACGCTGGGCGGCCGAGTAAAGAACGTGTTGTTTATCACCGGGGACGACGACTCCTCCAGCGACCAAGAGGAGCCTGAACAGCAGCACCACCAATACAGCATCAGTTCCATTCAGGAAGCCCTAAAGGGCGTGATGGAAAAGGTTTGTAGCGAAGACCAATCATCCTCAGGCAGCATCAAAAGGCAAAGGACCAATAATCACAACAACGTCAATATTTAGGAGGAGAGGTGGGTTTAAGTGTGACATAAAATTATGATATTTGTTAAAATAGTATGTTAAGTGTGTTAGCATTAAGAATGTGACATAAAATTATGATCTTTGTTTTGGTGTGGCTTTCGTATGTACAATTATTATTATTATGTGTACTAGAGAGCAGTGGATGAGAAGGGAATGTAATGATGTTTGTGATGGAAAAGGGTAAGGATCTTTTTCGTGGGGAGGAAACCAAAATATGAGGATCCAAGATGGAAATTTGTGATTTTGAGTATTAGAAGTGAAATGAATGTGTATTGGGGTTACAGAAGGATCTGAGTTTGGTCCCAAATTTGCAGAGAGAGTTTGGGTTTGTTTTGTAGTCAAGTATGTGGCTGGCAGAGAAGCTGCCGTCTCTTTACAGTTTTTATTGGTGGGAATTATGTCATGGATGAGACAATGTTAACGACATCTAGTTTTGTCATATGTTCACAGAATTCTAGTGGACCAACAACCCAACTTTA

Coding sequence (CDS)

ATGAAGGAGGAAGGAGAGTGCTTTCAGGGGTTTCAAGAGCACCTCCTAATGCAACAATTCCAGACTATGCAACAAAACAGCGACGTTTTTGGAGTAGGAGGTGGAGGAGGCTTGATTTTTCCTGAAGTTTCTCCTATGATGTTTACGCCTCCGTGGACCGCGACCACAACCATCCCTCAAGTCCACCACGACCCCTTCATCCCGCCGCCGCCCCCACCTTCCTACGCCAGTTTCTTCAATAGGAGAAATGCTTCTCTGCAGTTCCCTTACGATATTGGGAACAATAGCCTTGGAATTGGGCAGATGGGTCAGCCTGGGTCTGGCCCCTTTGGGCTGCAGGCCGAGCTGAGTAAGATGAGCCCTCAAGAAATAATGGACGCCAAAGCTCTTGCTGCCTCTAAAAGCCATAGTGAAGCTGAGAGACGAAGAAGAGAGAGAATCAATAATCATCTTGCTAAGTTACGAAGCTTACTCCCAAGTACCACCAAAACGGACAAGGCATCCTTGCTTGCAGAGGTAATTCAACATGTAAAAGAGCTGAAACGCCAGACGACATTGATAGCTGATGTAAGTCCAGTTCCGACCGAGCTCGATGAGCTATCAGTTGACGTCGATGCATCGGACGAGGACGGTAAGTTCGTGATAAAAGCCTCACTTTGTTGTGAGGACAGGTCTGATCTTCTCCCACAGATCATTAAAACCCTGAAATCCCTCCGCTTGAGAACCCTAAAAGCGGAGATCACGACGCTGGGCGGCCGAGTAAAGAACGTGTTGTTTATCACCGGGGACGACGACTCCTCCAGCGACCAAGAGGAGCCTGAACAGCAGCACCACCAATACAGCATCAGTTCCATTCAGGAAGCCCTAAAGGGCGTGATGGAAAAGGTTTGTAGCGAAGACCAATCATCCTCAGGCAGCATCAAAAGGCAAAGGACCAATAATCACAACAACGTCAATATTTAG

Protein sequence

MKEEGECFQGFQEHLLMQQFQTMQQNSDVFGVGGGGGLIFPEVSPMMFTPPWTATTTIPQVHHDPFIPPPPPPSYASFFNRRNASLQFPYDIGNNSLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTNNHNNVNI
BLAST of CmaCh04G020040 vs. Swiss-Prot
Match: BH030_ARATH (Transcription factor bHLH30 OS=Arabidopsis thaliana GN=BHLH30 PE=1 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 7.3e-79
Identity = 184/311 (59.16%), Postives = 221/311 (71.06%), Query Frame = 1

Query: 34  GGGGLIFPE--VSPMMFTPPWTATTTIPQVHHDPFIPPPPPPS--YASFFNRRNA---SL 93
           G G  IF +  VSP+   PP T+     Q   D F PP   P+  Y SFFNR  A    L
Sbjct: 58  GSGFTIFSQDSVSPIWSLPPPTSI----QPPFDQFPPPSSSPASFYGSFFNRSRAHHQGL 117

Query: 94  QFPYD---------------IGNNSLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALA 153
           QF Y+               +   S  +G + Q GSGPFGLQAEL KM+ QEIMDAKALA
Sbjct: 118 QFGYEGFGGATSAAHHHHEQLRILSEALGPVVQAGSGPFGLQAELGKMTAQEIMDAKALA 177

Query: 154 ASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVS 213
           ASKSHSEAERRRRERINNHLAKLRS+LP+TTKTDKASLLAEVIQHVKELKR+T++I++ +
Sbjct: 178 ASKSHSEAERRRRERINNHLAKLRSILPNTTKTDKASLLAEVIQHVKELKRETSVISETN 237

Query: 214 PVPTELDELSVDVDASDE--DGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITT 273
            VPTE DEL+V     +E  DG+FVIKASLCCEDRSDLLP +IKTLK++RL+TLKAEITT
Sbjct: 238 LVPTESDELTVAFTEEEETGDGRFVIKASLCCEDRSDLLPDMIKTLKAMRLKTLKAEITT 297

Query: 274 LGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKR 321
           +GGRVKNVLF+TG++ S  + EE      +Y I +I+EALK VMEK   E+ SSSG+ KR
Sbjct: 298 VGGRVKNVLFVTGEESSGEEVEE------EYCIGTIEEALKAVMEKSNVEESSSSGNAKR 357

BLAST of CmaCh04G020040 vs. Swiss-Prot
Match: BH032_ARATH (Transcription factor AIG1 OS=Arabidopsis thaliana GN=BHLH32 PE=1 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 3.5e-57
Identity = 144/281 (51.25%), Postives = 184/281 (65.48%), Query Frame = 1

Query: 62  HHDPFIPPPPPPSYASFFNRRNASLQFPYD-IGNNSLGIGQMGQPGSGPFGLQAEL-SKM 121
           ++D F+ PPP             S+  P + +   S  +G + + GS  FG   E+  K+
Sbjct: 74  YNDGFVSPPP-------------SMDHPQNHLRILSEALGPIMRRGSS-FGFDGEIMGKL 133

Query: 122 SPQEIMDAKALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKE 181
           S QE+MDAKALAASKSHSEAERRRRERIN HLAKLRS+LP+TTKTDKASLLAEVIQH+KE
Sbjct: 134 SAQEVMDAKALAASKSHSEAERRRRERINTHLAKLRSILPNTTKTDKASLLAEVIQHMKE 193

Query: 182 LKRQTTLIADVSPVPTELDELSVDVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLR 241
           LKRQT+ I D   VPTE D+L+VD   +DE+G  VI+AS CC+DR+DL+  +I  LKSLR
Sbjct: 194 LKRQTSQITDTYQVPTECDDLTVDSSYNDEEGNLVIRASFCCQDRTDLMHDVINALKSLR 253

Query: 242 LRTLKAEITTLGGRVKNVLFITGDDDSSSD-------------QEEPEQQHHQYSISSIQ 301
           LRTLKAEI T+GGRVKN+LF++ + D   D             ++  E++     +SSI+
Sbjct: 254 LRTLKAEIATVGGRVKNILFLSREYDDEEDHDSYRRNFDGDDVEDYDEERMMNNRVSSIE 313

Query: 302 EALKGVMEKVCSE----------DQSSSGSIKRQRTNNHNN 318
           EALK V+EK              ++SSSG IKRQRT+   N
Sbjct: 314 EALKAVIEKCVHNNDESNDNNNLEKSSSGGIKRQRTSKMVN 340

BLAST of CmaCh04G020040 vs. Swiss-Prot
Match: BH106_ARATH (Transcription factor bHLH106 OS=Arabidopsis thaliana GN=BHLH106 PE=2 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 1.6e-33
Identity = 91/197 (46.19%), Postives = 131/197 (66.50%), Query Frame = 1

Query: 128 KALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT--T 187
           +ALAA ++H EAERRRRERIN+HL KLR++L   +KTDKA+LLA+V+Q V+ELK+QT  T
Sbjct: 63  RALAALRNHKEAERRRRERINSHLNKLRNVLSCNSKTDKATLLAKVVQRVRELKQQTLET 122

Query: 188 LIADVSPVPTELDELSV-DVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLK 247
             +D + +P+E DE+SV        DG  + KASLCCEDRSDLLP +++ LKSL ++TL+
Sbjct: 123 SDSDQTLLPSETDEISVLHFGDYSNDGHIIFKASLCCEDRSDLLPDLMEILKSLNMKTLR 182

Query: 248 AEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSE--DQS 307
           AE+ T+GGR ++VL +  D          ++ H   S+  +Q ALK ++E+      ++S
Sbjct: 183 AEMVTIGGRTRSVLVVAAD----------KEMHGVESVHFLQNALKSLLERSSKSLMERS 242

Query: 308 SSGS----IKRQRTNNH 316
           S G      KR+R  +H
Sbjct: 243 SGGGGGERSKRRRALDH 249

BLAST of CmaCh04G020040 vs. Swiss-Prot
Match: BH107_ARATH (Putative transcription factor bHLH107 OS=Arabidopsis thaliana GN=BHLH107 PE=2 SV=1)

HSP 1 Score: 137.1 bits (344), Expect = 3.3e-31
Identity = 84/201 (41.79%), Postives = 127/201 (63.18%), Query Frame = 1

Query: 124 IMDAKALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQ 183
           + + KALA+ ++H EAER+RR RIN+HL KLR LL   +KTDK++LLA+V+Q VKELK+Q
Sbjct: 37  VYEDKALASLRNHKEAERKRRARINSHLNKLRKLLSCNSKTDKSTLLAKVVQRVKELKQQ 96

Query: 184 TTLIADVSPVPTELDELSV---DVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRL 243
           T  I D   +P+E DE+SV   +  +  +D + + K S CCEDR +LL  +++TLKSL++
Sbjct: 97  TLEITD-ETIPSETDEISVLNIEDCSRGDDRRIIFKVSFCCEDRPELLKDLMETLKSLQM 156

Query: 244 RTLKAEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSED 303
            TL A++TT+GGR +NVL +  D          ++ H   S++ +Q ALK ++E+     
Sbjct: 157 ETLFADMTTVGGRTRNVLVVAAD----------KEHHGVQSVNFLQNALKSLLERSSKSV 216

Query: 304 QSSSGS------IKRQRTNNH 316
               G       +KR+R  +H
Sbjct: 217 MVGHGGGGGEERLKRRRALDH 226

BLAST of CmaCh04G020040 vs. Swiss-Prot
Match: BH051_ARATH (Transcription factor bHLH51 OS=Arabidopsis thaliana GN=BHLH51 PE=2 SV=1)

HSP 1 Score: 105.1 bits (261), Expect = 1.4e-21
Identity = 71/192 (36.98%), Postives = 113/192 (58.85%), Query Frame = 1

Query: 128 KALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLI 187
           KA + S+SH  AE+RRR+RIN+HL  LR L+P++ K DKA+LLA VI+ VKELK++    
Sbjct: 59  KAESLSRSHRLAEKRRRDRINSHLTALRKLVPNSDKLDKAALLATVIEQVKELKQKAAES 118

Query: 188 ADVSPVPTELDELSVD----VDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTL 247
                +PTE DE++V      D        + KAS CCED+ + + +II+ L  L+L T+
Sbjct: 119 PIFQDLPTEADEVTVQPETISDFESNTNTIIFKASFCCEDQPEAISEIIRVLTKLQLETI 178

Query: 248 KAEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSS 307
           +AEI ++GGR++ + FI  D + +      E  +   S  +++++L   + ++ S   ++
Sbjct: 179 QAEIISVGGRMR-INFILKDSNCN------ETTNIAASAKALKQSLCSALNRITSSSTTT 238

Query: 308 SG----SIKRQR 312
           S       KRQR
Sbjct: 239 SSVCRIRSKRQR 243

BLAST of CmaCh04G020040 vs. TrEMBL
Match: F6HF16_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02940 PE=4 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 3.2e-105
Identity = 232/343 (67.64%), Postives = 266/343 (77.55%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTMQQ-NSDVFGVGGG-GGLIFPEVSPMMFTPPWT 61
           +++GEC       QG+QE LL+QQ   MQQ N+D +G GGG  GLIFPEVSP++   PW+
Sbjct: 7   EDQGECSQTIHNIQGYQEQLLLQQHHQMQQQNNDAYGGGGGRSGLIFPEVSPIL--QPWS 66

Query: 62  ATTTIPQVH--------------HDPFIPPPPPPSYASFFNRRNASLQFPYDIGNN---- 121
                P VH              HDPF+ PPPP +Y S FNRR  +LQF Y+  ++    
Sbjct: 67  ----FPPVHAFNPAHFAANPVRDHDPFLVPPPPSAYGSVFNRRAPALQFAYEGPSSEHLR 126

Query: 122 --SLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHLAK 181
             S  +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRRRERINNHLAK
Sbjct: 127 IISDTLGPVVQPGSSPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRRRERINNHLAK 186

Query: 182 LRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDGKF 241
           LRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  VD SDEDGKF
Sbjct: 187 LRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAESSPVPTEMDELT--VDTSDEDGKF 246

Query: 242 VIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSS---DQ 301
           VIKASLCCEDR+DLLP +IKTLK+LRLRTLKAEITTLGGRVKNVLFITG++DSSS   +Q
Sbjct: 247 VIKASLCCEDRTDLLPDLIKTLKALRLRTLKAEITTLGGRVKNVLFITGEEDSSSSGENQ 306

Query: 302 EEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           ++ +QQ  QYSISSIQEALK VMEK    D+SSSGS+KRQRTN
Sbjct: 307 QQQQQQQQQYSISSIQEALKAVMEKT-GGDESSSGSVKRQRTN 340

BLAST of CmaCh04G020040 vs. TrEMBL
Match: A0A061EB14_THECC (Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao GN=TCM_011873 PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 7.1e-105
Identity = 231/344 (67.15%), Postives = 266/344 (77.33%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTMQQ------NSDVFGVGGGGGLIFPEVSPMMFT 61
           +++GEC       QG+QE LL+QQ Q MQQ      N+D+FG G  GGLIFPEVSP++  
Sbjct: 7   EDQGECSQAIHNIQGYQEQLLIQQHQQMQQHHHQQQNNDLFG-GTRGGLIFPEVSPIL-- 66

Query: 62  PPWTATTTIPQVH--------------HDPFIPPPPPPSYASFFNRRNASLQFPYDIGNN 121
            PW+    +P VH              HDPF+ PPPP SY + FNRR  +LQF YD  + 
Sbjct: 67  -PWS----LPPVHSFNPAHFNGNQVRDHDPFLVPPPPSSYGALFNRRAPALQFAYDGPST 126

Query: 122 ------SLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINN 181
                 S  +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRRRERINN
Sbjct: 127 DHLRILSDTLGPVVQPGSAPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRRRERINN 186

Query: 182 HLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDE 241
           HLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  VD SDE
Sbjct: 187 HLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAETSPVPTEIDELT--VDTSDE 246

Query: 242 DGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSD 301
           DGKF+IKASLCCEDRSDLLP +IKTLK+LRL+TLKAEITTLGGRVKNVLFITG++DSSS 
Sbjct: 247 DGKFLIKASLCCEDRSDLLPDLIKTLKALRLKTLKAEITTLGGRVKNVLFITGEEDSSSS 306

Query: 302 QEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
            ++ +QQ  QYS+SSIQEALK VMEK  S D+SS+GS+KRQRTN
Sbjct: 307 GDQ-QQQQQQYSVSSIQEALKAVMEKT-SGDESSAGSVKRQRTN 338

BLAST of CmaCh04G020040 vs. TrEMBL
Match: B9SEI3_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_0705340 PE=4 SV=1)

HSP 1 Score: 387.1 bits (993), Expect = 2.1e-104
Identity = 239/351 (68.09%), Postives = 268/351 (76.35%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTM--QQNSDVFGV--GGGGGLIFPEVSPMMFTPP 61
           +++GEC       QG+QE LL+QQ Q    QQ SD++G   GG  GLIFPEVSP++   P
Sbjct: 7   EDQGECSQTIHNLQGYQEQLLLQQMQHQHQQQTSDMYGGARGGSSGLIFPEVSPIL---P 66

Query: 62  WT---------ATTTIP--QVHH--------DPF-IPPPPPPSYASFFNRRNASLQFPYD 121
           W          A T  P  QVHH        DPF IPPP P SY + FNRR  +LQF YD
Sbjct: 67  WPLPPVHSFNPAMTQFPSNQVHHHHHHRDHHDPFLIPPPVPSSYGNLFNRRAPALQFAYD 126

Query: 122 IGNNSLG--------IGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRR 181
            G++S          +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRR
Sbjct: 127 -GSSSHDHLRIITDTLGPVVQPGSAPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRR 186

Query: 182 RERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVD 241
           RERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  
Sbjct: 187 RERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAETSPVPTEIDELT-- 246

Query: 242 VDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGD 301
           VDASDEDGKF+IKASLCCEDRSDLLP +IKTLK+LRLRTLKAEITTLGGRVKNVLFITG+
Sbjct: 247 VDASDEDGKFIIKASLCCEDRSDLLPDLIKTLKALRLRTLKAEITTLGGRVKNVLFITGE 306

Query: 302 DDSSSD-QEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           +DSSS+  EE +QQ  QYSISSIQEALK VMEK    D+SSSGS+KRQRTN
Sbjct: 307 EDSSSNSNEEDQQQQPQYSISSIQEALKAVMEK-SGGDESSSGSVKRQRTN 350

BLAST of CmaCh04G020040 vs. TrEMBL
Match: B9HWM2_POPTR (Basic helix-loop-helix family protein OS=Populus trichocarpa GN=POPTR_0010s14010g PE=4 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 1.1e-102
Identity = 230/340 (67.65%), Postives = 256/340 (75.29%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTMQQN-----SDVFGVGGGGGLIFPEVSPMMFTP 61
           +++GEC       Q +QE LL+Q  Q MQQ+     SD++G   G G IFPEVSP++  P
Sbjct: 7   EDQGECSQTIHNLQNYQEQLLLQYHQQMQQHQQQQSSDIYGGARGSGFIFPEVSPILPWP 66

Query: 62  --------PWTATTTIPQVHHDPF-IPPPPPPSYASFFNRRNASLQFPYD------IGNN 121
                   P   T   P   HDPF IPPP P SY   FNRR  SLQF YD      +   
Sbjct: 67  LPPVHSFNPAHFTPNHPVRDHDPFLIPPPVPSSYGGLFNRRAPSLQFAYDGTPSDHLRII 126

Query: 122 SLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHLAKLR 181
           S  +G + QPGS PFGLQAELSKM+ QEIMDAKALAASKSHSEAERRRRERINNHLAKLR
Sbjct: 127 SDTLGPVVQPGSAPFGLQAELSKMTAQEIMDAKALAASKSHSEAERRRRERINNHLAKLR 186

Query: 182 SLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDGKFVI 241
           SLLPSTTKTDKASLLAEVIQHVKELKRQTTLIA+ SPVPTE+DEL+  VD +DEDGKFVI
Sbjct: 187 SLLPSTTKTDKASLLAEVIQHVKELKRQTTLIAETSPVPTEMDELT--VDTADEDGKFVI 246

Query: 242 KASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSDQEEPEQ 301
           KASLCCEDR DLLP +IKTLK+LRLRTLKAEITTLGGRVKNVLFI+G++DSSSD  +  Q
Sbjct: 247 KASLCCEDRPDLLPDLIKTLKALRLRTLKAEITTLGGRVKNVLFISGEEDSSSDSNDQHQ 306

Query: 302 QHH--QYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           Q    QYSISSIQEALK VMEK    D+SSSGS+KRQRTN
Sbjct: 307 QQEPLQYSISSIQEALKAVMEKT-GGDESSSGSVKRQRTN 343

BLAST of CmaCh04G020040 vs. TrEMBL
Match: A0A0B0N1U5_GOSAR (Transcription factor bHLH30-like protein OS=Gossypium arboreum GN=F383_03211 PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 2.5e-102
Identity = 229/342 (66.96%), Postives = 260/342 (76.02%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQ----FQTMQQNSDVFGVGGGGGLIFPEVSPMMFTPP 61
           +++GEC       QG+QE L MQQ     Q  Q N D+FG G  GGLIFPEVSP++   P
Sbjct: 7   EDQGECSQTIHNIQGYQEQLYMQQQHQQMQQQQHNIDLFG-GTRGGLIFPEVSPIL---P 66

Query: 62  WTATTTIPQVHH--------------DPFIPPPPPPSYASFFNRRNASLQFPYD------ 121
           W+    +P +H+              DPF+ PPPP SY   FNRR  SLQF YD      
Sbjct: 67  WS----LPPIHNFNPALFTGNPVRDDDPFLVPPPPTSYGGLFNRRAPSLQFAYDGTSADH 126

Query: 122 IGNNSLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHL 181
           +   S  +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRRRERINNHL
Sbjct: 127 LRILSDTLGPVVQPGSAPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRRRERINNHL 186

Query: 182 AKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDG 241
           AKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  VD S+EDG
Sbjct: 187 AKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAETSPVPTEIDELT--VDTSEEDG 246

Query: 242 KFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSDQE 301
           KFVIKASLCCEDRSDLLP +IKTLK+LRL+TLKAEITTLGGRVKNVLFITG++DSSS  E
Sbjct: 247 KFVIKASLCCEDRSDLLPDLIKTLKALRLKTLKAEITTLGGRVKNVLFITGEEDSSSSAE 306

Query: 302 EPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           + +QQ  QY ISSIQEALK VMEK  S D+SSSG++KRQRTN
Sbjct: 307 Q-QQQQLQYCISSIQEALKAVMEKT-SVDESSSGNVKRQRTN 336

BLAST of CmaCh04G020040 vs. TAIR10
Match: AT1G68810.1 (AT1G68810.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 295.4 bits (755), Expect = 4.1e-80
Identity = 184/311 (59.16%), Postives = 221/311 (71.06%), Query Frame = 1

Query: 34  GGGGLIFPE--VSPMMFTPPWTATTTIPQVHHDPFIPPPPPPS--YASFFNRRNA---SL 93
           G G  IF +  VSP+   PP T+     Q   D F PP   P+  Y SFFNR  A    L
Sbjct: 58  GSGFTIFSQDSVSPIWSLPPPTSI----QPPFDQFPPPSSSPASFYGSFFNRSRAHHQGL 117

Query: 94  QFPYD---------------IGNNSLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALA 153
           QF Y+               +   S  +G + Q GSGPFGLQAEL KM+ QEIMDAKALA
Sbjct: 118 QFGYEGFGGATSAAHHHHEQLRILSEALGPVVQAGSGPFGLQAELGKMTAQEIMDAKALA 177

Query: 154 ASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVS 213
           ASKSHSEAERRRRERINNHLAKLRS+LP+TTKTDKASLLAEVIQHVKELKR+T++I++ +
Sbjct: 178 ASKSHSEAERRRRERINNHLAKLRSILPNTTKTDKASLLAEVIQHVKELKRETSVISETN 237

Query: 214 PVPTELDELSVDVDASDE--DGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITT 273
            VPTE DEL+V     +E  DG+FVIKASLCCEDRSDLLP +IKTLK++RL+TLKAEITT
Sbjct: 238 LVPTESDELTVAFTEEEETGDGRFVIKASLCCEDRSDLLPDMIKTLKAMRLKTLKAEITT 297

Query: 274 LGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKR 321
           +GGRVKNVLF+TG++ S  + EE      +Y I +I+EALK VMEK   E+ SSSG+ KR
Sbjct: 298 VGGRVKNVLFVTGEESSGEEVEE------EYCIGTIEEALKAVMEKSNVEESSSSGNAKR 357

BLAST of CmaCh04G020040 vs. TAIR10
Match: AT3G25710.1 (AT3G25710.1 basic helix-loop-helix 32)

HSP 1 Score: 223.4 bits (568), Expect = 2.0e-58
Identity = 144/281 (51.25%), Postives = 184/281 (65.48%), Query Frame = 1

Query: 62  HHDPFIPPPPPPSYASFFNRRNASLQFPYD-IGNNSLGIGQMGQPGSGPFGLQAEL-SKM 121
           ++D F+ PPP             S+  P + +   S  +G + + GS  FG   E+  K+
Sbjct: 74  YNDGFVSPPP-------------SMDHPQNHLRILSEALGPIMRRGSS-FGFDGEIMGKL 133

Query: 122 SPQEIMDAKALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKE 181
           S QE+MDAKALAASKSHSEAERRRRERIN HLAKLRS+LP+TTKTDKASLLAEVIQH+KE
Sbjct: 134 SAQEVMDAKALAASKSHSEAERRRRERINTHLAKLRSILPNTTKTDKASLLAEVIQHMKE 193

Query: 182 LKRQTTLIADVSPVPTELDELSVDVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLR 241
           LKRQT+ I D   VPTE D+L+VD   +DE+G  VI+AS CC+DR+DL+  +I  LKSLR
Sbjct: 194 LKRQTSQITDTYQVPTECDDLTVDSSYNDEEGNLVIRASFCCQDRTDLMHDVINALKSLR 253

Query: 242 LRTLKAEITTLGGRVKNVLFITGDDDSSSD-------------QEEPEQQHHQYSISSIQ 301
           LRTLKAEI T+GGRVKN+LF++ + D   D             ++  E++     +SSI+
Sbjct: 254 LRTLKAEIATVGGRVKNILFLSREYDDEEDHDSYRRNFDGDDVEDYDEERMMNNRVSSIE 313

Query: 302 EALKGVMEKVCSE----------DQSSSGSIKRQRTNNHNN 318
           EALK V+EK              ++SSSG IKRQRT+   N
Sbjct: 314 EALKAVIEKCVHNNDESNDNNNLEKSSSGGIKRQRTSKMVN 340

BLAST of CmaCh04G020040 vs. TAIR10
Match: AT2G41130.1 (AT2G41130.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 144.8 bits (364), Expect = 9.0e-35
Identity = 91/197 (46.19%), Postives = 131/197 (66.50%), Query Frame = 1

Query: 128 KALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT--T 187
           +ALAA ++H EAERRRRERIN+HL KLR++L   +KTDKA+LLA+V+Q V+ELK+QT  T
Sbjct: 63  RALAALRNHKEAERRRRERINSHLNKLRNVLSCNSKTDKATLLAKVVQRVRELKQQTLET 122

Query: 188 LIADVSPVPTELDELSV-DVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLK 247
             +D + +P+E DE+SV        DG  + KASLCCEDRSDLLP +++ LKSL ++TL+
Sbjct: 123 SDSDQTLLPSETDEISVLHFGDYSNDGHIIFKASLCCEDRSDLLPDLMEILKSLNMKTLR 182

Query: 248 AEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSE--DQS 307
           AE+ T+GGR ++VL +  D          ++ H   S+  +Q ALK ++E+      ++S
Sbjct: 183 AEMVTIGGRTRSVLVVAAD----------KEMHGVESVHFLQNALKSLLERSSKSLMERS 242

Query: 308 SSGS----IKRQRTNNH 316
           S G      KR+R  +H
Sbjct: 243 SGGGGGERSKRRRALDH 249

BLAST of CmaCh04G020040 vs. TAIR10
Match: AT3G56770.1 (AT3G56770.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 137.1 bits (344), Expect = 1.9e-32
Identity = 84/201 (41.79%), Postives = 127/201 (63.18%), Query Frame = 1

Query: 124 IMDAKALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQ 183
           + + KALA+ ++H EAER+RR RIN+HL KLR LL   +KTDK++LLA+V+Q VKELK+Q
Sbjct: 37  VYEDKALASLRNHKEAERKRRARINSHLNKLRKLLSCNSKTDKSTLLAKVVQRVKELKQQ 96

Query: 184 TTLIADVSPVPTELDELSV---DVDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRL 243
           T  I D   +P+E DE+SV   +  +  +D + + K S CCEDR +LL  +++TLKSL++
Sbjct: 97  TLEITD-ETIPSETDEISVLNIEDCSRGDDRRIIFKVSFCCEDRPELLKDLMETLKSLQM 156

Query: 244 RTLKAEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSED 303
            TL A++TT+GGR +NVL +  D          ++ H   S++ +Q ALK ++E+     
Sbjct: 157 ETLFADMTTVGGRTRNVLVVAAD----------KEHHGVQSVNFLQNALKSLLERSSKSV 216

Query: 304 QSSSGS------IKRQRTNNH 316
               G       +KR+R  +H
Sbjct: 217 MVGHGGGGGEERLKRRRALDH 226

BLAST of CmaCh04G020040 vs. TAIR10
Match: AT2G40200.1 (AT2G40200.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 105.1 bits (261), Expect = 7.9e-23
Identity = 71/192 (36.98%), Postives = 113/192 (58.85%), Query Frame = 1

Query: 128 KALAASKSHSEAERRRRERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLI 187
           KA + S+SH  AE+RRR+RIN+HL  LR L+P++ K DKA+LLA VI+ VKELK++    
Sbjct: 59  KAESLSRSHRLAEKRRRDRINSHLTALRKLVPNSDKLDKAALLATVIEQVKELKQKAAES 118

Query: 188 ADVSPVPTELDELSVD----VDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTL 247
                +PTE DE++V      D        + KAS CCED+ + + +II+ L  L+L T+
Sbjct: 119 PIFQDLPTEADEVTVQPETISDFESNTNTIIFKASFCCEDQPEAISEIIRVLTKLQLETI 178

Query: 248 KAEITTLGGRVKNVLFITGDDDSSSDQEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSS 307
           +AEI ++GGR++ + FI  D + +      E  +   S  +++++L   + ++ S   ++
Sbjct: 179 QAEIISVGGRMR-INFILKDSNCN------ETTNIAASAKALKQSLCSALNRITSSSTTT 238

Query: 308 SG----SIKRQR 312
           S       KRQR
Sbjct: 239 SSVCRIRSKRQR 243

BLAST of CmaCh04G020040 vs. NCBI nr
Match: gi|225423869|ref|XP_002278697.1| (PREDICTED: transcription factor bHLH30 [Vitis vinifera])

HSP 1 Score: 389.8 bits (1000), Expect = 4.6e-105
Identity = 232/343 (67.64%), Postives = 266/343 (77.55%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTMQQ-NSDVFGVGGG-GGLIFPEVSPMMFTPPWT 61
           +++GEC       QG+QE LL+QQ   MQQ N+D +G GGG  GLIFPEVSP++   PW+
Sbjct: 7   EDQGECSQTIHNIQGYQEQLLLQQHHQMQQQNNDAYGGGGGRSGLIFPEVSPIL--QPWS 66

Query: 62  ATTTIPQVH--------------HDPFIPPPPPPSYASFFNRRNASLQFPYDIGNN---- 121
                P VH              HDPF+ PPPP +Y S FNRR  +LQF Y+  ++    
Sbjct: 67  ----FPPVHAFNPAHFAANPVRDHDPFLVPPPPSAYGSVFNRRAPALQFAYEGPSSEHLR 126

Query: 122 --SLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHLAK 181
             S  +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRRRERINNHLAK
Sbjct: 127 IISDTLGPVVQPGSSPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRRRERINNHLAK 186

Query: 182 LRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDGKF 241
           LRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  VD SDEDGKF
Sbjct: 187 LRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAESSPVPTEMDELT--VDTSDEDGKF 246

Query: 242 VIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSS---DQ 301
           VIKASLCCEDR+DLLP +IKTLK+LRLRTLKAEITTLGGRVKNVLFITG++DSSS   +Q
Sbjct: 247 VIKASLCCEDRTDLLPDLIKTLKALRLRTLKAEITTLGGRVKNVLFITGEEDSSSSGENQ 306

Query: 302 EEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           ++ +QQ  QYSISSIQEALK VMEK    D+SSSGS+KRQRTN
Sbjct: 307 QQQQQQQQQYSISSIQEALKAVMEKT-GGDESSSGSVKRQRTN 340

BLAST of CmaCh04G020040 vs. NCBI nr
Match: gi|590701106|ref|XP_007046318.1| (Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao])

HSP 1 Score: 388.7 bits (997), Expect = 1.0e-104
Identity = 231/344 (67.15%), Postives = 266/344 (77.33%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTMQQ------NSDVFGVGGGGGLIFPEVSPMMFT 61
           +++GEC       QG+QE LL+QQ Q MQQ      N+D+FG G  GGLIFPEVSP++  
Sbjct: 7   EDQGECSQAIHNIQGYQEQLLIQQHQQMQQHHHQQQNNDLFG-GTRGGLIFPEVSPIL-- 66

Query: 62  PPWTATTTIPQVH--------------HDPFIPPPPPPSYASFFNRRNASLQFPYDIGNN 121
            PW+    +P VH              HDPF+ PPPP SY + FNRR  +LQF YD  + 
Sbjct: 67  -PWS----LPPVHSFNPAHFNGNQVRDHDPFLVPPPPSSYGALFNRRAPALQFAYDGPST 126

Query: 122 ------SLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINN 181
                 S  +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRRRERINN
Sbjct: 127 DHLRILSDTLGPVVQPGSAPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRRRERINN 186

Query: 182 HLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDE 241
           HLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  VD SDE
Sbjct: 187 HLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAETSPVPTEIDELT--VDTSDE 246

Query: 242 DGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSD 301
           DGKF+IKASLCCEDRSDLLP +IKTLK+LRL+TLKAEITTLGGRVKNVLFITG++DSSS 
Sbjct: 247 DGKFLIKASLCCEDRSDLLPDLIKTLKALRLKTLKAEITTLGGRVKNVLFITGEEDSSSS 306

Query: 302 QEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
            ++ +QQ  QYS+SSIQEALK VMEK  S D+SS+GS+KRQRTN
Sbjct: 307 GDQ-QQQQQQYSVSSIQEALKAVMEKT-SGDESSAGSVKRQRTN 338

BLAST of CmaCh04G020040 vs. NCBI nr
Match: gi|255566837|ref|XP_002524402.1| (PREDICTED: transcription factor bHLH30 [Ricinus communis])

HSP 1 Score: 387.1 bits (993), Expect = 3.0e-104
Identity = 239/351 (68.09%), Postives = 268/351 (76.35%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTM--QQNSDVFGV--GGGGGLIFPEVSPMMFTPP 61
           +++GEC       QG+QE LL+QQ Q    QQ SD++G   GG  GLIFPEVSP++   P
Sbjct: 7   EDQGECSQTIHNLQGYQEQLLLQQMQHQHQQQTSDMYGGARGGSSGLIFPEVSPIL---P 66

Query: 62  WT---------ATTTIP--QVHH--------DPF-IPPPPPPSYASFFNRRNASLQFPYD 121
           W          A T  P  QVHH        DPF IPPP P SY + FNRR  +LQF YD
Sbjct: 67  WPLPPVHSFNPAMTQFPSNQVHHHHHHRDHHDPFLIPPPVPSSYGNLFNRRAPALQFAYD 126

Query: 122 IGNNSLG--------IGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRR 181
            G++S          +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRR
Sbjct: 127 -GSSSHDHLRIITDTLGPVVQPGSAPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRR 186

Query: 182 RERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVD 241
           RERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  
Sbjct: 187 RERINNHLAKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAETSPVPTEIDELT-- 246

Query: 242 VDASDEDGKFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGD 301
           VDASDEDGKF+IKASLCCEDRSDLLP +IKTLK+LRLRTLKAEITTLGGRVKNVLFITG+
Sbjct: 247 VDASDEDGKFIIKASLCCEDRSDLLPDLIKTLKALRLRTLKAEITTLGGRVKNVLFITGE 306

Query: 302 DDSSSD-QEEPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           +DSSS+  EE +QQ  QYSISSIQEALK VMEK    D+SSSGS+KRQRTN
Sbjct: 307 EDSSSNSNEEDQQQQPQYSISSIQEALKAVMEK-SGGDESSSGSVKRQRTN 350

BLAST of CmaCh04G020040 vs. NCBI nr
Match: gi|224111740|ref|XP_002315961.1| (basic helix-loop-helix family protein [Populus trichocarpa])

HSP 1 Score: 381.3 bits (978), Expect = 1.6e-102
Identity = 230/340 (67.65%), Postives = 256/340 (75.29%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQFQTMQQN-----SDVFGVGGGGGLIFPEVSPMMFTP 61
           +++GEC       Q +QE LL+Q  Q MQQ+     SD++G   G G IFPEVSP++  P
Sbjct: 7   EDQGECSQTIHNLQNYQEQLLLQYHQQMQQHQQQQSSDIYGGARGSGFIFPEVSPILPWP 66

Query: 62  --------PWTATTTIPQVHHDPF-IPPPPPPSYASFFNRRNASLQFPYD------IGNN 121
                   P   T   P   HDPF IPPP P SY   FNRR  SLQF YD      +   
Sbjct: 67  LPPVHSFNPAHFTPNHPVRDHDPFLIPPPVPSSYGGLFNRRAPSLQFAYDGTPSDHLRII 126

Query: 122 SLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHLAKLR 181
           S  +G + QPGS PFGLQAELSKM+ QEIMDAKALAASKSHSEAERRRRERINNHLAKLR
Sbjct: 127 SDTLGPVVQPGSAPFGLQAELSKMTAQEIMDAKALAASKSHSEAERRRRERINNHLAKLR 186

Query: 182 SLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDGKFVI 241
           SLLPSTTKTDKASLLAEVIQHVKELKRQTTLIA+ SPVPTE+DEL+  VD +DEDGKFVI
Sbjct: 187 SLLPSTTKTDKASLLAEVIQHVKELKRQTTLIAETSPVPTEMDELT--VDTADEDGKFVI 246

Query: 242 KASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSDQEEPEQ 301
           KASLCCEDR DLLP +IKTLK+LRLRTLKAEITTLGGRVKNVLFI+G++DSSSD  +  Q
Sbjct: 247 KASLCCEDRPDLLPDLIKTLKALRLRTLKAEITTLGGRVKNVLFISGEEDSSSDSNDQHQ 306

Query: 302 QHH--QYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           Q    QYSISSIQEALK VMEK    D+SSSGS+KRQRTN
Sbjct: 307 QQEPLQYSISSIQEALKAVMEKT-GGDESSSGSVKRQRTN 343

BLAST of CmaCh04G020040 vs. NCBI nr
Match: gi|728826119|gb|KHG06567.1| (Transcription factor bHLH30 -like protein [Gossypium arboreum])

HSP 1 Score: 380.2 bits (975), Expect = 3.6e-102
Identity = 229/342 (66.96%), Postives = 260/342 (76.02%), Query Frame = 1

Query: 2   KEEGEC------FQGFQEHLLMQQ----FQTMQQNSDVFGVGGGGGLIFPEVSPMMFTPP 61
           +++GEC       QG+QE L MQQ     Q  Q N D+FG G  GGLIFPEVSP++   P
Sbjct: 7   EDQGECSQTIHNIQGYQEQLYMQQQHQQMQQQQHNIDLFG-GTRGGLIFPEVSPIL---P 66

Query: 62  WTATTTIPQVHH--------------DPFIPPPPPPSYASFFNRRNASLQFPYD------ 121
           W+    +P +H+              DPF+ PPPP SY   FNRR  SLQF YD      
Sbjct: 67  WS----LPPIHNFNPALFTGNPVRDDDPFLVPPPPTSYGGLFNRRAPSLQFAYDGTSADH 126

Query: 122 IGNNSLGIGQMGQPGSGPFGLQAELSKMSPQEIMDAKALAASKSHSEAERRRRERINNHL 181
           +   S  +G + QPGS PFGLQAEL KM+ QEIMDAKALAASKSHSEAERRRRERINNHL
Sbjct: 127 LRILSDTLGPVVQPGSAPFGLQAELGKMTAQEIMDAKALAASKSHSEAERRRRERINNHL 186

Query: 182 AKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTTLIADVSPVPTELDELSVDVDASDEDG 241
           AKLRSLLPSTTKTDKASLLAEVIQHVKELKRQT+LIA+ SPVPTE+DEL+  VD S+EDG
Sbjct: 187 AKLRSLLPSTTKTDKASLLAEVIQHVKELKRQTSLIAETSPVPTEIDELT--VDTSEEDG 246

Query: 242 KFVIKASLCCEDRSDLLPQIIKTLKSLRLRTLKAEITTLGGRVKNVLFITGDDDSSSDQE 301
           KFVIKASLCCEDRSDLLP +IKTLK+LRL+TLKAEITTLGGRVKNVLFITG++DSSS  E
Sbjct: 247 KFVIKASLCCEDRSDLLPDLIKTLKALRLKTLKAEITTLGGRVKNVLFITGEEDSSSSAE 306

Query: 302 EPEQQHHQYSISSIQEALKGVMEKVCSEDQSSSGSIKRQRTN 314
           + +QQ  QY ISSIQEALK VMEK  S D+SSSG++KRQRTN
Sbjct: 307 Q-QQQQLQYCISSIQEALKAVMEKT-SVDESSSGNVKRQRTN 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH030_ARATH7.3e-7959.16Transcription factor bHLH30 OS=Arabidopsis thaliana GN=BHLH30 PE=1 SV=1[more]
BH032_ARATH3.5e-5751.25Transcription factor AIG1 OS=Arabidopsis thaliana GN=BHLH32 PE=1 SV=1[more]
BH106_ARATH1.6e-3346.19Transcription factor bHLH106 OS=Arabidopsis thaliana GN=BHLH106 PE=2 SV=1[more]
BH107_ARATH3.3e-3141.79Putative transcription factor bHLH107 OS=Arabidopsis thaliana GN=BHLH107 PE=2 SV... [more]
BH051_ARATH1.4e-2136.98Transcription factor bHLH51 OS=Arabidopsis thaliana GN=BHLH51 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
F6HF16_VITVI3.2e-10567.64Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02940 PE=4 SV=... [more]
A0A061EB14_THECC7.1e-10567.15Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao GN=TCM... [more]
B9SEI3_RICCO2.1e-10468.09DNA binding protein, putative OS=Ricinus communis GN=RCOM_0705340 PE=4 SV=1[more]
B9HWM2_POPTR1.1e-10267.65Basic helix-loop-helix family protein OS=Populus trichocarpa GN=POPTR_0010s14010... [more]
A0A0B0N1U5_GOSAR2.5e-10266.96Transcription factor bHLH30-like protein OS=Gossypium arboreum GN=F383_03211 PE=... [more]
Match NameE-valueIdentityDescription
AT1G68810.14.1e-8059.16 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G25710.12.0e-5851.25 basic helix-loop-helix 32[more]
AT2G41130.19.0e-3546.19 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G56770.11.9e-3241.79 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT2G40200.17.9e-2336.98 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
gi|225423869|ref|XP_002278697.1|4.6e-10567.64PREDICTED: transcription factor bHLH30 [Vitis vinifera][more]
gi|590701106|ref|XP_007046318.1|1.0e-10467.15Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao][more]
gi|255566837|ref|XP_002524402.1|3.0e-10468.09PREDICTED: transcription factor bHLH30 [Ricinus communis][more]
gi|224111740|ref|XP_002315961.1|1.6e-10267.65basic helix-loop-helix family protein [Populus trichocarpa][more]
gi|728826119|gb|KHG06567.1|3.6e-10266.96Transcription factor bHLH30 -like protein [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G020040.1CmaCh04G020040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 135..188
score: 6.5
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 134..180
score: 7.4
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 137..186
score: 7.6
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 131..180
score: 16
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 135..185
score: 1.57
NoneNo IPR availableGENE3DG3DSA:3.30.70.260coord: 218..270
score: 1.
NoneNo IPR availablePANTHERPTHR12565STEROL REGULATORY ELEMENT-BINDING PROTEINcoord: 9..294
score: 1.4E
NoneNo IPR availablePANTHERPTHR12565:SF139SUBFAMILY NOT NAMEDcoord: 9..294
score: 1.4E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh04G020040Cucurbita moschata (Rifu)cmacmoB679
CmaCh04G020040Cucurbita moschata (Rifu)cmacmoB729
CmaCh04G020040Cucurbita maxima (Rimu)cmacmaB018