Cp4.1LG12g04380 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g04380
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBasic helix loop helix (BHLH) family transcription factor
LocationCp4.1LG12 : 3393961 .. 3396917 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTATGTATCATGAAGATTTCCTATAACTAACTAAAATATAAACAAAGCACCCAAATAAGGATAAAGGGAAAGAAGATCTAACTAACCTACTATGATAAACTCTTCCCTAAACATCCTCCATGTAGATCTCGAGACCATTTTTGTTTTTTTTCTCTCTCCAAATAAATCTTCTTTATGGTAGCAGCTTGAAGGAGGTGGAAATCAATTTTTTTTAGGGTCGTTACATCATATACCTCTAATAAATGTTTAATCTCGGAATAACTTAGATTCTCGTTTTAAGAGTTGCTTGTTAAGTCTTGAAGCACAATTAATGAAAAATTCTTAACCAATATACCCTAATCCATACGAGTCATAAACATGTCTTTGCATACATAAATCTTTCATAAGCATCTTGTCACTTGCTATTCAATAAAAACCTTAATTTCTTATGAAGTGTAGTGAGAGGTAGTTACCTACTTTTCCTCTTGATTTATATTCCAAAGTTGTCCTAATTTGTTATGATCTTTCATACTTATTCTTAGTATTGTAAATGAGTCTCATATTAATTTTCTCACTATTTTGAGTTTTTTAAGGTCTTGAAAACATCTTAGAATTCTCATATGGGTATTTTGATCATTTATTACCCCAAATTTTGATTTAAATCCTTAACATATTCCAATGGTCTTGTAACTATATATATAATCCCTTTAACCATGACATAAGACTCCTTGTCAATTTTCATGTCATTTAGAGGTCACTTTGGCCATAAGCTAGTGGAACATAATTATAGGGGCATTTTACCTACCCATGGTTTAATAGTATAATGATCATCAAATTCCATACATAACCTTGAGACTTATCCTAAAATTTAATGAGCAACAAAATTCATTTAATAGGTTTGTGGCTCACGCTCTCCATTGAAATCAATATTCATGTCGATATACTTTAACTTAAACCTTAATTTCAACTTTCTAACTTTTTTTCAATATGACTTGTTTCCTTCCGTCAATTAGATTAACAGTTTTACGAGCTTCATGCATTACATTGTCTTCTGAGTCAAATTCTCTAAATTTTGCTAAAGGTTTACTAATTGACCTCTTTGTCCATACATCTTGAGGATCTCCTTTTAACAACTTTCCTATAACCATACCAAAAAGGCACTACAATTTTTTTAATACCCATACTTAAATTTCATATTTACATATTTCATAGTGACAAAATTTCCTTTCAACACTACTACTTATCTATAGCAGACTATAAGATATTGTCGCTATCAAGCTTCCTTTGTCGTTTTCATTTGAAACTACTTACTCCTTAGTATACGTTGGTACAGTCCTTTCTATCTTTATGGTCATAATATTGTTCATTTTAAATATGTTCGAGATGTCTAACTAATACATCTGGCCTTCCTTAGCAAAATCATTTTTATGCATTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCACGGCCTTCCTCGAAGATCCTTCTGCTCCGGCGCGGCTGTCTGGCTCACCGGCCCGGACGAGCTCGAGCGGTACGACTGCCAAAGAGCCAAGGAGGCCAAATCCCAAGGCATCCAAACCCTCCTCTGCGTCCCAGCCCCTTTCGGCGTCCTCGAATTCGCATCCCGACAGATAATCCCCGAGGATTGGGGCTTAGTCCAACAGGTGAAATCCGTGTTGGAGTCGGACATACCCAATTTCAGAAACAGCAGCAGCCCACTTCCGTTGTTGGAGCAAGACGTCAATTTGGAGGAGATTGGGTTCTTGAGCGAAGCCCCAGAGGAGGAAGTGGGAAGGCCCGGCAGAGGGAAATCGAGAAGAACAGAGTCCGCAGGGGAATTGGAGCTGTCGGATTCCGACAGCCCAGTGGGGAAAGCAGCGGGGAAAAGAAGAGGGAGGAAACGAAACAACGTGGAAGCAGAACGGCAGAGGAGGGAGAAACTAAACAAGCGATTCTACGCGCTCCTGTCGGTAGTGCCGAACGTGTCGAGAATGGACAAGGCGTCGTTGCTGTGGGACGCGGTGTCTTACATAAACGCACTAAAGGGGAAGGTGGAGGAGATGGAGTTGGAGTTGGAGGTGAGGAAGTTGAAGAGAGGGGGAGGGGAGGGGGTGGAGAAACAGAGCACCACGACGTCGGAGGAGGAGGAGGAACAGGGGAAGGGCAGTACTCTGTTCGACGTGGAGGTAAAAAGGATGGGAGGAGGGGACGCCATGGTGCGAGTTGAGTCCCATAACCAAAACTGCCCTTCCGCCACATTGATGGGAGCATTACGAGATCTGGAAGTTCACATTCACCACGCCAACATCACCAATGTAAACGATTTGATGCTCCAACATGTTTTGATTAAGCTTCCCCATGCCTTCTCCACCGATGAAGCCTTCAAAGCATCTCTTTTATCTAAATTACACTAA

mRNA sequence

ATGACTATATCCTTCTGCTCCGGCGCGGCTGTCTGGCTCACCGGCCCGGACGAGCTCGAGCGGTACGACTGCCAAAGAGCCAAGGAGGCCAAATCCCAAGGCATCCAAACCCTCCTCTGCGTCCCAGCCCCTTTCGGCGTCCTCGAATTCGCATCCCGACAGATAATCCCCGAGGATTGGGGCTTAGTCCAACAGGTGAAATCCGTGTTGGAGTCGGACATACCCAATTTCAGAAACAGCAGCAGCCCACTTCCGTTGTTGGAGCAAGACGTCAATTTGGAGGAGATTGGGTTCTTGAGCGAAGCCCCAGAGGAGGAAGTGGGAAGGCCCGGCAGAGGGAAATCGAGAAGAACAGAGTCCGCAGGGGAATTGGAGCTGTCGGATTCCGACAGCCCAGTGGGGAAAGCAGCGGGGAAAAGAAGAGGGAGGAAACGAAACAACGTGGAAGCAGAACGGCAGAGGAGGGAGAAACTAAACAAGCGATTCTACGCGCTCCTGTCGGTAGTGCCGAACGTGTCGAGAATGGACAAGGCGTCGTTGCTGTGGGACGCGGTGTCTTACATAAACGCACTAAAGGGGAAGGTGGAGGAGATGGAGTTGGAGTTGGAGGTGAGGAAGTTGAAGAGAGGGGGAGGGGAGGGGGTGGAGAAACAGAGCACCACGACGTCGGAGGAGGAGGAGGAACAGGGGAAGGGCAGTACTCTGTTCGACGTGGAGGTAAAAAGGATGGGAGGAGGGGACGCCATGGTGCGAGTTGAGTCCCATAACCAAAACTGCCCTTCCGCCACATTGATGGGAGCATTACGAGATCTGGAAGTTCACATTCACCACGCCAACATCACCAATGTAAACGATTTGATGCTCCAACATGTTTTGATTAAGCTTCCCCATGCCTTCTCCACCGATGAAGCCTTCAAAGCATCTCTTTTATCTAAATTACACTAA

Coding sequence (CDS)

ATGACTATATCCTTCTGCTCCGGCGCGGCTGTCTGGCTCACCGGCCCGGACGAGCTCGAGCGGTACGACTGCCAAAGAGCCAAGGAGGCCAAATCCCAAGGCATCCAAACCCTCCTCTGCGTCCCAGCCCCTTTCGGCGTCCTCGAATTCGCATCCCGACAGATAATCCCCGAGGATTGGGGCTTAGTCCAACAGGTGAAATCCGTGTTGGAGTCGGACATACCCAATTTCAGAAACAGCAGCAGCCCACTTCCGTTGTTGGAGCAAGACGTCAATTTGGAGGAGATTGGGTTCTTGAGCGAAGCCCCAGAGGAGGAAGTGGGAAGGCCCGGCAGAGGGAAATCGAGAAGAACAGAGTCCGCAGGGGAATTGGAGCTGTCGGATTCCGACAGCCCAGTGGGGAAAGCAGCGGGGAAAAGAAGAGGGAGGAAACGAAACAACGTGGAAGCAGAACGGCAGAGGAGGGAGAAACTAAACAAGCGATTCTACGCGCTCCTGTCGGTAGTGCCGAACGTGTCGAGAATGGACAAGGCGTCGTTGCTGTGGGACGCGGTGTCTTACATAAACGCACTAAAGGGGAAGGTGGAGGAGATGGAGTTGGAGTTGGAGGTGAGGAAGTTGAAGAGAGGGGGAGGGGAGGGGGTGGAGAAACAGAGCACCACGACGTCGGAGGAGGAGGAGGAACAGGGGAAGGGCAGTACTCTGTTCGACGTGGAGGTAAAAAGGATGGGAGGAGGGGACGCCATGGTGCGAGTTGAGTCCCATAACCAAAACTGCCCTTCCGCCACATTGATGGGAGCATTACGAGATCTGGAAGTTCACATTCACCACGCCAACATCACCAATGTAAACGATTTGATGCTCCAACATGTTTTGATTAAGCTTCCCCATGCCTTCTCCACCGATGAAGCCTTCAAAGCATCTCTTTTATCTAAATTACACTAA

Protein sequence

MTISFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLVQQVKSVLESDIPNFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRRTESAGELELSDSDSPVGKAAGKRRGRKRNNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEMELELEVRKLKRGGGEGVEKQSTTTSEEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKLH
BLAST of Cp4.1LG12g04380 vs. Swiss-Prot
Match: BH028_ARATH (Transcription factor bHLH28 OS=Arabidopsis thaliana GN=BHLH28 PE=2 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 8.6e-40
Identity = 133/357 (37.25%), Postives = 176/357 (49.30%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           +F S   V +TG D +    C RAK+    G+QT+LC+P+  GVLE AS + I  +  L 
Sbjct: 159 AFASYNPVLVTGSDLIYGSGCDRAKQGGDVGLQTILCIPSHNGVLELASTEEIRPNSDLF 218

Query: 64  QQVK-----SVLESDIPNFRNSSSPLPL------------------LEQDVNLE---EIG 123
            +++     S   S  PN  +   P  L                  L+   NL       
Sbjct: 219 NRIRFLFGGSKYFSGAPNSNSELFPFQLESSCSSTVTGNPNPSPVYLQNRYNLNFSTSSS 278

Query: 124 FLSEAPEEEVGRPGRGKSRRTESAGELELSDSDSPV--------GKAAGKRRGRKR---- 183
            L+ AP  +V   G    +  E+      SD    V         K  GK+RGRK     
Sbjct: 279 TLARAPCGDVLSFGENVKQSFENRNPNTYSDQIQNVVPHATVMLEKKKGKKRGRKPAHGR 338

Query: 184 ----NNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEMEL- 243
               N+VEAER RREKLN RFYAL +VVPNVS+MDK SLL DAV YIN LK K E +EL 
Sbjct: 339 DKPLNHVEAERMRREKLNHRFYALRAVVPNVSKMDKTSLLEDAVCYINELKSKAENVELE 398

Query: 244 ----ELEVRKLKRGGGEGVEKQSTTTSEEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHN 303
               E++  +LK   G+     S    EE     K S +  +EVK M   DAMVRVES  
Sbjct: 399 KHAIEIQFNELKEIAGQRNAIPSVCKYEE-----KASEMMKIEVKIMESDDAMVRVESRK 458

Query: 304 QNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
            + P A LM AL DLE+ ++HA+I+ +NDLM+Q   +K+       E  +  L+SK+
Sbjct: 459 DHHPGARLMNALMDLELEVNHASISVMNDLMIQQANVKMGLRIYKQEELRDLLMSKI 510

BLAST of Cp4.1LG12g04380 vs. Swiss-Prot
Match: BH014_ARATH (Transcription factor bHLH14 OS=Arabidopsis thaliana GN=BHLH14 PE=2 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 6.8e-37
Identity = 107/313 (34.19%), Postives = 172/313 (54.95%), Query Frame = 1

Query: 11  VWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLVQQVKSVL 70
           VWLTGPDEL   + +RAKEA   G+ TL+ +P   G++E  S + I ++   + +VKS+ 
Sbjct: 130 VWLTGPDELRFSNYERAKEAGFHGVHTLVSIPINNGIIELGSSESIIQNRNFINRVKSIF 189

Query: 71  ESDIP---NFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRRTESAGELELS 130
            S        +  S P P +          F SE       R  R K   T  A   +  
Sbjct: 190 GSGKTTKHTNQTGSYPKPAVSDHSKSGNQQFGSE-------RKRRRKLETTRVAAATK-- 249

Query: 131 DSDSPVGKAAGKRRGRKRNNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSY 190
                      K      ++VEAE+QRREKLN RFYAL ++VP VSRMDKASLL DAVSY
Sbjct: 250 ----------EKHHPAVLSHVEAEKQRREKLNHRFYALRAIVPKVSRMDKASLLSDAVSY 309

Query: 191 INALKGKVEEMELELEVRKLKRGGGEGVEKQSTTTS------EEEEEQGKGSTLFDVEVK 250
           I +LK K++  +LE E++K+K    + ++  S+ TS      +  ++  K +   D+EV+
Sbjct: 310 IESLKSKID--DLETEIKKMKMTETDKLDNSSSNTSPSSVEYQVNQKPSKSNRGSDLEVQ 369

Query: 251 -RMGGGDAMVRVESHNQNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFS 310
            ++ G +A++RV++ N N P++ LM AL +++  + HAN + ++ +M+Q V++ +P    
Sbjct: 370 VKIVGEEAIIRVQTENVNHPTSALMSALMEMDCRVQHANASRLSQVMVQDVVVLVPEGLR 421

Query: 311 TDEAFKASLLSKL 314
           +++  + +L+  L
Sbjct: 430 SEDRLRTTLVRTL 421

BLAST of Cp4.1LG12g04380 vs. Swiss-Prot
Match: MYC2_ARATH (Transcription factor MYC2 OS=Arabidopsis thaliana GN=MYC2 PE=1 SV=2)

HSP 1 Score: 142.5 bits (358), Expect = 7.8e-33
Identity = 100/222 (45.05%), Postives = 136/222 (61.26%), Query Frame = 1

Query: 119 ESAGELELSDSDSPVGKAAG-----KRRGRKR--------NNVEAERQRREKLNKRFYAL 178
           ++AGE + SD ++ V K        K+RGRK         N+VEAERQRREKLN+RFYAL
Sbjct: 412 KTAGESDHSDLEASVVKEVAVEKRPKKRGRKPANGREEPLNHVEAERQRREKLNQRFYAL 471

Query: 179 LSVVPNVSRMDKASLLWDAVSYINALKGKV--------------EEMELELEVRKLKRGG 238
            +VVPNVS+MDKASLL DA++YIN LK KV              EE++LEL  RK    G
Sbjct: 472 RAVVPNVSKMDKASLLGDAIAYINELKSKVVKTESEKLQIKNQLEEVKLELAGRKASASG 531

Query: 239 GEGVEKQSTTTSEEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDL 298
           G+     S++ S  +          ++EVK + G DAM+RVES  +N P+A LM AL DL
Sbjct: 532 GD----MSSSCSSIK------PVGMEIEVKII-GWDAMIRVESSKRNHPAARLMSALMDL 591

Query: 299 EVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           E+ ++HA+++ VNDLM+Q   +K+     T E  +ASL+SK+
Sbjct: 592 ELEVNHASMSVVNDLMIQQATVKMGFRIYTQEQLRASLISKI 622

BLAST of Cp4.1LG12g04380 vs. Swiss-Prot
Match: MYC4_ARATH (Transcription factor MYC4 OS=Arabidopsis thaliana GN=MYC4 PE=1 SV=1)

HSP 1 Score: 127.1 bits (318), Expect = 3.4e-28
Identity = 76/188 (40.43%), Postives = 118/188 (62.77%), Query Frame = 1

Query: 134 GKAAGKRRGRKRNNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKG 193
           G+     R    N+VEAERQRREKLN+RFY+L +VVPNVS+MDKASLL DA+SYI+ LK 
Sbjct: 404 GRKPANGREEPLNHVEAERQRREKLNQRFYSLRAVVPNVSKMDKASLLGDAISYISELKS 463

Query: 194 KV-------EEMELELEVRKLKRGGGEGVEKQSTTTSEEEEEQGKGSTLFDVEVK-RMGG 253
           K+       EE++ +++V   + G  +   K     ++E       S L ++EV  ++ G
Sbjct: 464 KLQKAESDKEELQKQIDVMNKEAGNAKSSVKDRKCLNQE------SSVLIEMEVDVKIIG 523

Query: 254 GDAMVRVESHNQNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAF 313
            DAM+R++   +N P A  M AL++L++ ++HA+++ VNDLM+Q   +K+ + F T +  
Sbjct: 524 WDAMIRIQCSKRNHPGAKFMEALKELDLEVNHASLSVVNDLMIQQATVKMGNQFFTQDQL 583

BLAST of Cp4.1LG12g04380 vs. Swiss-Prot
Match: MYC2_ORYSJ (Transcription factor MYC2 OS=Oryza sativa subsp. japonica GN=MYC2 PE=1 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 1.7e-27
Identity = 88/227 (38.77%), Postives = 138/227 (60.79%), Query Frame = 1

Query: 103 PEEEVGRPGRGKSRRTE---SAGELELSDSDSPVGKAAGK--RRGRKR--------NNVE 162
           P    G P + +S  ++   S  E+E S   +P  +A  +  +RGRK         N+VE
Sbjct: 468 PSTGTGAPAKSESDHSDLEASVREVESSRVVAPPPEAEKRPRKRGRKPANGREEPLNHVE 527

Query: 163 AERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEMELELEVRKLKR 222
           AERQRREKLN+RFYAL +VVPNVS+MDKASLL DA+SYIN L+GK+  +E + E  + + 
Sbjct: 528 AERQRREKLNQRFYALRAVVPNVSKMDKASLLGDAISYINELRGKLTALETDKETLQSQM 587

Query: 223 GGGEGVEKQSTTTSEEEEEQG--KGSTLFDVEVK-RMGGGDAMVRVESHNQNCPSATLMG 282
              E ++K+           G   G+    VE++ ++ G +AM+RV+ H +N P+A LM 
Sbjct: 588 ---ESLKKERDARPPAPSGGGGDGGARCHAVEIEAKILGLEAMIRVQCHKRNHPAARLMT 647

Query: 283 ALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           ALR+L++ ++HA+++ V DLM+Q V +K+     + +   A+L +++
Sbjct: 648 ALRELDLDVYHASVSVVKDLMIQQVAVKMASRVYSQDQLNAALYTRI 691

BLAST of Cp4.1LG12g04380 vs. TrEMBL
Match: A0A0A0KFN7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G107910 PE=4 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 4.2e-94
Identity = 205/340 (60.29%), Postives = 241/340 (70.88%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           SF S + VWLTG +EL  +DC R KEAKS GIQT LCVP  +GVLE AS+QIIPEDWGL+
Sbjct: 99  SFTSSSVVWLTGSEELHLHDCHRVKEAKSHGIQTFLCVPTSYGVLELASQQIIPEDWGLI 158

Query: 64  QQVKSVLESDIPNFRNSS-SPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRRTESAG 123
           QQ+KS+ +SD  NF  ++ +PLP L+QD N E+IGF+SE  EEE+  P R K++     G
Sbjct: 159 QQIKSLFDSDFVNFSTTTDTPLPFLDQDFNFEDIGFISEVAEEEMETPLRKKTK----TG 218

Query: 124 ELELSDSDSP-----VGKAAGKRRGRK--------RNNVEAERQRREKLNKRFYALLSVV 183
           E ELSDSDSP     V K  G++RGRK         N+VEAERQRREKLN RFYAL SVV
Sbjct: 219 EWELSDSDSPVLKTGVMKKTGQKRGRKPNMSKENAMNHVEAERQRREKLNNRFYALRSVV 278

Query: 184 PNVSRMDKASLLWDAVSYINALKGKVEEMELELEVRKLKRGGGEGVEKQSTTTSEEEEEQ 243
           PNVSRMDKASLL DAVSYINALK KVEEMEL+L  R+ K+   EG + QSTTT+ EE  +
Sbjct: 279 PNVSRMDKASLLSDAVSYINALKAKVEEMELQL--RESKKSRDEGGDNQSTTTTSEELMK 338

Query: 244 GKGS---------------TLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDLEVH 303
           G                  T FDVEVK + G DAMVRV+SHN N PSA +MG  RD+E  
Sbjct: 339 GNSGGGVTTPTITTTTTTMTRFDVEVKII-GRDAMVRVQSHNLNFPSAIVMGVFRDMEFE 398

Query: 304 IHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKLH 315
           I HA+ITNVND+MLQ VLIKLPH FSTDEA KA++LS+LH
Sbjct: 399 IQHASITNVNDIMLQDVLIKLPHGFSTDEALKAAVLSRLH 431

BLAST of Cp4.1LG12g04380 vs. TrEMBL
Match: A0A061DS37_THECC (Basic helix-loop-helix DNA-binding family protein OS=Theobroma cacao GN=TCM_005064 PE=4 SV=1)

HSP 1 Score: 252.7 bits (644), Expect = 6.0e-64
Identity = 156/354 (44.07%), Postives = 216/354 (61.02%), Query Frame = 1

Query: 7   SGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLVQQV 66
           +G+ VWLTG  EL+ Y+C+RA+EA+   I+TL+C+P   GVLE  S ++I E+WGLVQQV
Sbjct: 145 TGSLVWLTGAHELQFYNCERAREAQMHAIETLVCIPTSCGVLELGSSEMIRENWGLVQQV 204

Query: 67  KSVLESDI---------PNFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRR 126
           KSV  SD+         PN   +  P+  L+++++  +IG ++   EE+     R K   
Sbjct: 205 KSVFGSDLIGLVPKQSNPNPNLTPGPIQFLDRNISFADIGIIAGVQEEDASPDNRTKQEN 264

Query: 127 -------------TESAGELELSDSDSP------VGKAAGKRRGRKR--------NNVEA 186
                          S  + E SDSD P      + K   K+RGRK         N+VEA
Sbjct: 265 HNNQTKKDSTKPGQSSYVDSEHSDSDCPLLAMNNIEKRTPKKRGRKPGLGRETPLNHVEA 324

Query: 187 ERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLK 246
           ERQRREKLN RFYAL +VVPNVSRMDKASLL DAVSYIN LK K+EE+E  L+ E +K+K
Sbjct: 325 ERQRREKLNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKAKIEELESQLQRECKKVK 384

Query: 247 RGGGEGVEKQSTTTSEEEEEQ---------GKGSTLFDVEVKRMGGGDAMVRVESHNQNC 306
               + ++ QSTTTS ++  +         G G   FD+++    G DAM+RV+S N N 
Sbjct: 385 VEMVDAMDNQSTTTSVDQAARPSNSSSGTAGSGGLEFDIKIM---GNDAMIRVQSENVNY 444

Query: 307 PSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           PSA LM ALRDLE  +HHA+++ VN+LMLQ +++++P    T+E  K++LL +L
Sbjct: 445 PSARLMIALRDLEFQVHHASMSCVNELMLQDIVVRVPDGLRTEEGLKSALLRRL 495

BLAST of Cp4.1LG12g04380 vs. TrEMBL
Match: M5X2D4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004680mg PE=4 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 2.3e-63
Identity = 160/345 (46.38%), Postives = 217/345 (62.90%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           +F SG+ VWLTG  EL+ Y+C RAKEA+  G QTL+C+P P GVLE  S   I E+W LV
Sbjct: 151 AFSSGSVVWLTGSHELQFYNCDRAKEAQMHGFQTLVCIPTPTGVLEMGSSDSIRENWSLV 210

Query: 64  QQVKSVLESDI-------PNFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEV--------- 123
           QQ KS+  SD+       P+   + SP+  + ++ +  +IG ++   EEE          
Sbjct: 211 QQAKSLFGSDLICSVADQPD-PETRSPIDFINRNFSFADIGIIAGVEEEEDDKKEVALDL 270

Query: 124 -------GRPGRGKSRRTESAGELELSDSDSPVGKAAGKRRGRKR--------NNVEAER 183
                  G PG G    + +  + + SDSD P  K   K+RGRK         N+VEAER
Sbjct: 271 TMMKRKGGNPGTGLYPDSNANPKPDYSDSDGP--KRTPKKRGRKPGLGRDTPLNHVEAER 330

Query: 184 QRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLKRG 243
           QRREKLN RFYAL +VVPNVSRMDKASLL DAVSYIN LK KV+E+E  ++ E +K+K  
Sbjct: 331 QRREKLNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKTKVDELESQVQRESKKVKVE 390

Query: 244 GGEGVEKQSTTTSEEE-----EEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLM 303
            G+ ++ QSTTTS E+          GS L +VEVK + G DAM+RV+S N N PSA LM
Sbjct: 391 TGDNLDIQSTTTSVEQIAKPPSSSANGSGL-EVEVK-IVGTDAMIRVQSENVNYPSARLM 450

Query: 304 GALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLL 311
            ALRDLE+ IHHA+++ +N+LMLQ +++K+P    ++++ K++LL
Sbjct: 451 AALRDLELQIHHASLSCINELMLQDIVLKVPENMRSEDSLKSALL 490

BLAST of Cp4.1LG12g04380 vs. TrEMBL
Match: A0A059BZD6_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00019 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 6.6e-63
Identity = 159/349 (45.56%), Postives = 213/349 (61.03%), Query Frame = 1

Query: 7   SGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLVQQV 66
           +G+ VWLTG  ELE Y C RAKEA+  GI+T++C+P   GVLE  S  +IPE+WGLVQ+ 
Sbjct: 146 TGSLVWLTGARELESYKCDRAKEAELHGIRTMVCIPTGDGVLELGSCDVIPENWGLVQRA 205

Query: 67  KSVLESDIPNFRNSSSPLPLLE-----QDVNLEEIGFLSE------APEEEVGRPGRGKS 126
           KS+  SD+   ++   P P  +      D++  +IG ++       AP ++  +  + K 
Sbjct: 206 KSLFGSDLLLPKHPPPPPPPFQLHHDHSDISFADIGIIAGVQENDFAPHDDHEKKVKKKQ 265

Query: 127 RRTESAG--------------ELELSDSDSP------VGKAAGKRRGRKR--------NN 186
              E AG              E E SDSDSP        K   K+RGRK         N+
Sbjct: 266 PLVEGAGGKPEAPFGCSSYLVESEHSDSDSPFMAAVMTEKRTPKKRGRKPGLGRDTPLNH 325

Query: 187 VEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVR 246
           VEAERQRREKLN RFYAL +VVPNVSRMDKASLL DAVSYIN LK K+ ++E  L+ E +
Sbjct: 326 VEAERQRREKLNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKSKIGDLESQLQRESK 385

Query: 247 KLKRGGGEGVEKQSTTTS-EEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATL 306
           ++K+   +  +  STTTS +     G G +L +VEVK + G DAM+RV+S N N PSA L
Sbjct: 386 RVKQEVTDATDNLSTTTSVDHSSPSGCGGSLLEVEVK-IVGCDAMIRVQSENANYPSARL 445

Query: 307 MGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           M A+RDLE+HIHHA+++ VNDLMLQ V++ +P     +E  +A+LL  L
Sbjct: 446 MAAMRDLELHIHHASLSTVNDLMLQDVVVSVPEGLKGEEDLRAALLRAL 493

BLAST of Cp4.1LG12g04380 vs. TrEMBL
Match: A9PF26_POPTR (Putative uncharacterized protein OS=Populus trichocarpa PE=2 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 1.1e-62
Identity = 155/349 (44.41%), Postives = 218/349 (62.46%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           ++ +G+ +WLTG  EL+ Y+C+R KEA+  GI+TL+C+P   GVLE  S  +I E+WGLV
Sbjct: 143 AYTTGSLIWLTGGHELQFYNCERVKEAQMHGIETLVCIPTSCGVLELGSSSVIRENWGLV 202

Query: 64  QQVKSVLESD-----IPNFRNSSS--PLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSR 123
           QQ KS+  SD     +P   N+SS  P   L++ ++  ++G ++   E+      +  +R
Sbjct: 203 QQAKSLFGSDLSAYLVPKGPNNSSEEPTQFLDRSISFADMGIIAGLQEDCAVDREQKNAR 262

Query: 124 RTESAGEL------------ELSDSDSPV-----GKAAGKRRGRKR--------NNVEAE 183
            TE A +             E SDSD P+      K   K+RGRK         N+VEAE
Sbjct: 263 ETEEANKRNANKPGLSYLNSEHSDSDFPLLAMHMEKRIPKKRGRKPGLGRDAPLNHVEAE 322

Query: 184 RQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLKR 243
           RQRREKLN RFYAL +VVPNVSRMDKASLL DAVSYIN LK KV+E+E  LE E +K+K 
Sbjct: 323 RQRREKLNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKAKVDELESQLERESKKVKL 382

Query: 244 GGGEGVEKQSTTTSEEE-----EEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATL 303
              + ++ QSTTTS ++        G      +VE+K + G DAM+RV+S N N P++ L
Sbjct: 383 EVADNLDNQSTTTSVDQSACRPNSAGGAGLALEVEIKFV-GNDAMIRVQSENVNYPASRL 442

Query: 304 MGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           M ALR+LE  +HHA+++ VN+LMLQ V++++P    T+EA K++LL +L
Sbjct: 443 MCALRELEFQVHHASMSCVNELMLQDVVVRVPDGLRTEEALKSALLGRL 490

BLAST of Cp4.1LG12g04380 vs. TAIR10
Match: AT5G46830.1 (AT5G46830.1 NACL-inducible gene 1)

HSP 1 Score: 165.6 bits (418), Expect = 4.8e-41
Identity = 133/357 (37.25%), Postives = 176/357 (49.30%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           +F S   V +TG D +    C RAK+    G+QT+LC+P+  GVLE AS + I  +  L 
Sbjct: 159 AFASYNPVLVTGSDLIYGSGCDRAKQGGDVGLQTILCIPSHNGVLELASTEEIRPNSDLF 218

Query: 64  QQVK-----SVLESDIPNFRNSSSPLPL------------------LEQDVNLE---EIG 123
            +++     S   S  PN  +   P  L                  L+   NL       
Sbjct: 219 NRIRFLFGGSKYFSGAPNSNSELFPFQLESSCSSTVTGNPNPSPVYLQNRYNLNFSTSSS 278

Query: 124 FLSEAPEEEVGRPGRGKSRRTESAGELELSDSDSPV--------GKAAGKRRGRKR---- 183
            L+ AP  +V   G    +  E+      SD    V         K  GK+RGRK     
Sbjct: 279 TLARAPCGDVLSFGENVKQSFENRNPNTYSDQIQNVVPHATVMLEKKKGKKRGRKPAHGR 338

Query: 184 ----NNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEMEL- 243
               N+VEAER RREKLN RFYAL +VVPNVS+MDK SLL DAV YIN LK K E +EL 
Sbjct: 339 DKPLNHVEAERMRREKLNHRFYALRAVVPNVSKMDKTSLLEDAVCYINELKSKAENVELE 398

Query: 244 ----ELEVRKLKRGGGEGVEKQSTTTSEEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHN 303
               E++  +LK   G+     S    EE     K S +  +EVK M   DAMVRVES  
Sbjct: 399 KHAIEIQFNELKEIAGQRNAIPSVCKYEE-----KASEMMKIEVKIMESDDAMVRVESRK 458

Query: 304 QNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
            + P A LM AL DLE+ ++HA+I+ +NDLM+Q   +K+       E  +  L+SK+
Sbjct: 459 DHHPGARLMNALMDLELEVNHASISVMNDLMIQQANVKMGLRIYKQEELRDLLMSKI 510

BLAST of Cp4.1LG12g04380 vs. TAIR10
Match: AT4G00870.1 (AT4G00870.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 156.0 bits (393), Expect = 3.8e-38
Identity = 107/313 (34.19%), Postives = 172/313 (54.95%), Query Frame = 1

Query: 11  VWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLVQQVKSVL 70
           VWLTGPDEL   + +RAKEA   G+ TL+ +P   G++E  S + I ++   + +VKS+ 
Sbjct: 130 VWLTGPDELRFSNYERAKEAGFHGVHTLVSIPINNGIIELGSSESIIQNRNFINRVKSIF 189

Query: 71  ESDIP---NFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRRTESAGELELS 130
            S        +  S P P +          F SE       R  R K   T  A   +  
Sbjct: 190 GSGKTTKHTNQTGSYPKPAVSDHSKSGNQQFGSE-------RKRRRKLETTRVAAATK-- 249

Query: 131 DSDSPVGKAAGKRRGRKRNNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSY 190
                      K      ++VEAE+QRREKLN RFYAL ++VP VSRMDKASLL DAVSY
Sbjct: 250 ----------EKHHPAVLSHVEAEKQRREKLNHRFYALRAIVPKVSRMDKASLLSDAVSY 309

Query: 191 INALKGKVEEMELELEVRKLKRGGGEGVEKQSTTTS------EEEEEQGKGSTLFDVEVK 250
           I +LK K++  +LE E++K+K    + ++  S+ TS      +  ++  K +   D+EV+
Sbjct: 310 IESLKSKID--DLETEIKKMKMTETDKLDNSSSNTSPSSVEYQVNQKPSKSNRGSDLEVQ 369

Query: 251 -RMGGGDAMVRVESHNQNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFS 310
            ++ G +A++RV++ N N P++ LM AL +++  + HAN + ++ +M+Q V++ +P    
Sbjct: 370 VKIVGEEAIIRVQTENVNHPTSALMSALMEMDCRVQHANASRLSQVMVQDVVVLVPEGLR 421

Query: 311 TDEAFKASLLSKL 314
           +++  + +L+  L
Sbjct: 430 SEDRLRTTLVRTL 421

BLAST of Cp4.1LG12g04380 vs. TAIR10
Match: AT1G32640.1 (AT1G32640.1 Basic helix-loop-helix (bHLH) DNA-binding family protein)

HSP 1 Score: 142.5 bits (358), Expect = 4.4e-34
Identity = 100/222 (45.05%), Postives = 136/222 (61.26%), Query Frame = 1

Query: 119 ESAGELELSDSDSPVGKAAG-----KRRGRKR--------NNVEAERQRREKLNKRFYAL 178
           ++AGE + SD ++ V K        K+RGRK         N+VEAERQRREKLN+RFYAL
Sbjct: 412 KTAGESDHSDLEASVVKEVAVEKRPKKRGRKPANGREEPLNHVEAERQRREKLNQRFYAL 471

Query: 179 LSVVPNVSRMDKASLLWDAVSYINALKGKV--------------EEMELELEVRKLKRGG 238
            +VVPNVS+MDKASLL DA++YIN LK KV              EE++LEL  RK    G
Sbjct: 472 RAVVPNVSKMDKASLLGDAIAYINELKSKVVKTESEKLQIKNQLEEVKLELAGRKASASG 531

Query: 239 GEGVEKQSTTTSEEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDL 298
           G+     S++ S  +          ++EVK + G DAM+RVES  +N P+A LM AL DL
Sbjct: 532 GD----MSSSCSSIK------PVGMEIEVKII-GWDAMIRVESSKRNHPAARLMSALMDL 591

Query: 299 EVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           E+ ++HA+++ VNDLM+Q   +K+     T E  +ASL+SK+
Sbjct: 592 ELEVNHASMSVVNDLMIQQATVKMGFRIYTQEQLRASLISKI 622

BLAST of Cp4.1LG12g04380 vs. TAIR10
Match: AT4G17880.1 (AT4G17880.1 Basic helix-loop-helix (bHLH) DNA-binding family protein)

HSP 1 Score: 127.1 bits (318), Expect = 1.9e-29
Identity = 76/188 (40.43%), Postives = 118/188 (62.77%), Query Frame = 1

Query: 134 GKAAGKRRGRKRNNVEAERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKG 193
           G+     R    N+VEAERQRREKLN+RFY+L +VVPNVS+MDKASLL DA+SYI+ LK 
Sbjct: 404 GRKPANGREEPLNHVEAERQRREKLNQRFYSLRAVVPNVSKMDKASLLGDAISYISELKS 463

Query: 194 KV-------EEMELELEVRKLKRGGGEGVEKQSTTTSEEEEEQGKGSTLFDVEVK-RMGG 253
           K+       EE++ +++V   + G  +   K     ++E       S L ++EV  ++ G
Sbjct: 464 KLQKAESDKEELQKQIDVMNKEAGNAKSSVKDRKCLNQE------SSVLIEMEVDVKIIG 523

Query: 254 GDAMVRVESHNQNCPSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAF 313
            DAM+R++   +N P A  M AL++L++ ++HA+++ VNDLM+Q   +K+ + F T +  
Sbjct: 524 WDAMIRIQCSKRNHPGAKFMEALKELDLEVNHASLSVVNDLMIQQATVKMGNQFFTQDQL 583

BLAST of Cp4.1LG12g04380 vs. TAIR10
Match: AT5G46760.1 (AT5G46760.1 Basic helix-loop-helix (bHLH) DNA-binding family protein)

HSP 1 Score: 124.0 bits (310), Expect = 1.6e-28
Identity = 82/218 (37.61%), Postives = 130/218 (59.63%), Query Frame = 1

Query: 120 SAGELELSDSDSPVGKAA---------GKRRGRKR--------NNVEAERQRREKLNKRF 179
           +A + + SD ++ V K A          ++RGRK         N+VEAERQRREKLN+RF
Sbjct: 372 AANDSDHSDLEASVVKEAIVVEPPEKKPRKRGRKPANGREEPLNHVEAERQRREKLNQRF 431

Query: 180 YALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEMELELEVRKLKRGG-------GEGV 239
           Y+L +VVPNVS+MDKASLL DA+SYIN LK K+++ E + E  + K  G       G+G 
Sbjct: 432 YSLRAVVPNVSKMDKASLLGDAISYINELKSKLQQAESDKEEIQKKLDGMSKEGNNGKGC 491

Query: 240 EKQSTTTSEEEEEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDLEVHI 299
             ++       ++    S   +++VK + G D M+RV+   ++ P A  M AL++L++ +
Sbjct: 492 GSRAKERKSSNQDSTASSIEMEIDVKII-GWDVMIRVQCGKKDHPGARFMEALKELDLEV 551

Query: 300 HHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           +HA+++ VNDLM+Q   +K+   F   +  K +L++K+
Sbjct: 552 NHASLSVVNDLMIQQATVKMGSQFFNHDQLKVALMTKV 588

BLAST of Cp4.1LG12g04380 vs. NCBI nr
Match: gi|449445714|ref|XP_004140617.1| (PREDICTED: transcription factor MYC2-like [Cucumis sativus])

HSP 1 Score: 352.8 bits (904), Expect = 6.1e-94
Identity = 205/340 (60.29%), Postives = 241/340 (70.88%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           SF S + VWLTG +EL  +DC R KEAKS GIQT LCVP  +GVLE AS+QIIPEDWGL+
Sbjct: 99  SFTSSSVVWLTGSEELHLHDCHRVKEAKSHGIQTFLCVPTSYGVLELASQQIIPEDWGLI 158

Query: 64  QQVKSVLESDIPNFRNSS-SPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRRTESAG 123
           QQ+KS+ +SD  NF  ++ +PLP L+QD N E+IGF+SE  EEE+  P R K++     G
Sbjct: 159 QQIKSLFDSDFVNFSTTTDTPLPFLDQDFNFEDIGFISEVAEEEMETPLRKKTK----TG 218

Query: 124 ELELSDSDSP-----VGKAAGKRRGRK--------RNNVEAERQRREKLNKRFYALLSVV 183
           E ELSDSDSP     V K  G++RGRK         N+VEAERQRREKLN RFYAL SVV
Sbjct: 219 EWELSDSDSPVLKTGVMKKTGQKRGRKPNMSKENAMNHVEAERQRREKLNNRFYALRSVV 278

Query: 184 PNVSRMDKASLLWDAVSYINALKGKVEEMELELEVRKLKRGGGEGVEKQSTTTSEEEEEQ 243
           PNVSRMDKASLL DAVSYINALK KVEEMEL+L  R+ K+   EG + QSTTT+ EE  +
Sbjct: 279 PNVSRMDKASLLSDAVSYINALKAKVEEMELQL--RESKKSRDEGGDNQSTTTTSEELMK 338

Query: 244 GKGS---------------TLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDLEVH 303
           G                  T FDVEVK + G DAMVRV+SHN N PSA +MG  RD+E  
Sbjct: 339 GNSGGGVTTPTITTTTTTMTRFDVEVKII-GRDAMVRVQSHNLNFPSAIVMGVFRDMEFE 398

Query: 304 IHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKLH 315
           I HA+ITNVND+MLQ VLIKLPH FSTDEA KA++LS+LH
Sbjct: 399 IQHASITNVNDIMLQDVLIKLPHGFSTDEALKAAVLSRLH 431

BLAST of Cp4.1LG12g04380 vs. NCBI nr
Match: gi|590720902|ref|XP_007051457.1| (Basic helix-loop-helix DNA-binding family protein [Theobroma cacao])

HSP 1 Score: 252.7 bits (644), Expect = 8.5e-64
Identity = 156/354 (44.07%), Postives = 216/354 (61.02%), Query Frame = 1

Query: 7   SGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLVQQV 66
           +G+ VWLTG  EL+ Y+C+RA+EA+   I+TL+C+P   GVLE  S ++I E+WGLVQQV
Sbjct: 145 TGSLVWLTGAHELQFYNCERAREAQMHAIETLVCIPTSCGVLELGSSEMIRENWGLVQQV 204

Query: 67  KSVLESDI---------PNFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEVGRPGRGKSRR 126
           KSV  SD+         PN   +  P+  L+++++  +IG ++   EE+     R K   
Sbjct: 205 KSVFGSDLIGLVPKQSNPNPNLTPGPIQFLDRNISFADIGIIAGVQEEDASPDNRTKQEN 264

Query: 127 -------------TESAGELELSDSDSP------VGKAAGKRRGRKR--------NNVEA 186
                          S  + E SDSD P      + K   K+RGRK         N+VEA
Sbjct: 265 HNNQTKKDSTKPGQSSYVDSEHSDSDCPLLAMNNIEKRTPKKRGRKPGLGRETPLNHVEA 324

Query: 187 ERQRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLK 246
           ERQRREKLN RFYAL +VVPNVSRMDKASLL DAVSYIN LK K+EE+E  L+ E +K+K
Sbjct: 325 ERQRREKLNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKAKIEELESQLQRECKKVK 384

Query: 247 RGGGEGVEKQSTTTSEEEEEQ---------GKGSTLFDVEVKRMGGGDAMVRVESHNQNC 306
               + ++ QSTTTS ++  +         G G   FD+++    G DAM+RV+S N N 
Sbjct: 385 VEMVDAMDNQSTTTSVDQAARPSNSSSGTAGSGGLEFDIKIM---GNDAMIRVQSENVNY 444

Query: 307 PSATLMGALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           PSA LM ALRDLE  +HHA+++ VN+LMLQ +++++P    T+E  K++LL +L
Sbjct: 445 PSARLMIALRDLEFQVHHASMSCVNELMLQDIVVRVPDGLRTEEGLKSALLRRL 495

BLAST of Cp4.1LG12g04380 vs. NCBI nr
Match: gi|596021864|ref|XP_007219048.1| (hypothetical protein PRUPE_ppa004680mg [Prunus persica])

HSP 1 Score: 250.8 bits (639), Expect = 3.2e-63
Identity = 160/345 (46.38%), Postives = 217/345 (62.90%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           +F SG+ VWLTG  EL+ Y+C RAKEA+  G QTL+C+P P GVLE  S   I E+W LV
Sbjct: 151 AFSSGSVVWLTGSHELQFYNCDRAKEAQMHGFQTLVCIPTPTGVLEMGSSDSIRENWSLV 210

Query: 64  QQVKSVLESDI-------PNFRNSSSPLPLLEQDVNLEEIGFLSEAPEEEV--------- 123
           QQ KS+  SD+       P+   + SP+  + ++ +  +IG ++   EEE          
Sbjct: 211 QQAKSLFGSDLICSVADQPD-PETRSPIDFINRNFSFADIGIIAGVEEEEDDKKEVALDL 270

Query: 124 -------GRPGRGKSRRTESAGELELSDSDSPVGKAAGKRRGRKR--------NNVEAER 183
                  G PG G    + +  + + SDSD P  K   K+RGRK         N+VEAER
Sbjct: 271 TMMKRKGGNPGTGLYPDSNANPKPDYSDSDGP--KRTPKKRGRKPGLGRDTPLNHVEAER 330

Query: 184 QRREKLNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLKRG 243
           QRREKLN RFYAL +VVPNVSRMDKASLL DAVSYIN LK KV+E+E  ++ E +K+K  
Sbjct: 331 QRREKLNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKTKVDELESQVQRESKKVKVE 390

Query: 244 GGEGVEKQSTTTSEEE-----EEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLM 303
            G+ ++ QSTTTS E+          GS L +VEVK + G DAM+RV+S N N PSA LM
Sbjct: 391 TGDNLDIQSTTTSVEQIAKPPSSSANGSGL-EVEVK-IVGTDAMIRVQSENVNYPSARLM 450

Query: 304 GALRDLEVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLL 311
            ALRDLE+ IHHA+++ +N+LMLQ +++K+P    ++++ K++LL
Sbjct: 451 AALRDLELQIHHASLSCINELMLQDIVLKVPENMRSEDSLKSALL 490

BLAST of Cp4.1LG12g04380 vs. NCBI nr
Match: gi|694400749|ref|XP_009375455.1| (PREDICTED: transcription factor MYC2-like [Pyrus x bretschneideri])

HSP 1 Score: 250.4 bits (638), Expect = 4.2e-63
Identity = 162/342 (47.37%), Postives = 220/342 (64.33%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           +F SG+ VWLTG  EL+ Y+C+RAKEA+  GIQTL+C+P P GVLE  S  +I E+  LV
Sbjct: 154 AFSSGSLVWLTGSHELQFYNCERAKEAQMHGIQTLVCIPTPTGVLELGSSDLIRENGNLV 213

Query: 64  QQVKSVLESDI----PNFRNSSSPLPLLEQDVNLEEIGFLSEAPEE----EVGRPGRGKS 123
           QQ +S+  +D+    P+   + SP+ L+ ++ +  +IG ++   EE    EV        
Sbjct: 214 QQTESLFGADVVWGQPD-PGTRSPIDLINRNFSFADIGIIAGVEEEDDKKEVALDITAMK 273

Query: 124 RRTESA--GELELSD--------SDSPVGKAAGKRRGRKR--------NNVEAERQRREK 183
           ++   A  G L+LS+        SDS   K   K+RGRK         N+VEAERQRREK
Sbjct: 274 KKCGRACPGLLQLSNLNSLNPEHSDSEFPKRTPKKRGRKPGLGRDTPLNHVEAERQRREK 333

Query: 184 LNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLKRGGGEGV 243
           LN RFYAL +VVPNVSRMDKASLL DAVSYIN LK KV+E+E  ++ E +K+K   G+ +
Sbjct: 334 LNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKSKVDELESQVQRESKKVKVETGDNL 393

Query: 244 EKQSTTTSEEE----EEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDL 303
           + QSTTTS E+         GST  ++EVK + G DAM+RV+S N N PSA LM ALRDL
Sbjct: 394 DNQSTTTSVEQTRPPNSSASGSTGLEMEVK-IVGSDAMIRVQSANVNYPSARLMAALRDL 453

Query: 304 EVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           E  IHHA+++ +N+LMLQ V++K+P    ++E+ KA+LL  L
Sbjct: 454 EFEIHHASLSCMNELMLQDVVVKVPDNMRSEESIKAALLKIL 493

BLAST of Cp4.1LG12g04380 vs. NCBI nr
Match: gi|657957697|ref|XP_008370350.1| (PREDICTED: transcription factor MYC2 [Malus domestica])

HSP 1 Score: 250.4 bits (638), Expect = 4.2e-63
Identity = 160/342 (46.78%), Postives = 215/342 (62.87%), Query Frame = 1

Query: 4   SFCSGAAVWLTGPDELERYDCQRAKEAKSQGIQTLLCVPAPFGVLEFASRQIIPEDWGLV 63
           +F SG+ VWLTG  EL+ Y+C+RAKEA+  GIQTL+C+P P GVLE  S  +I E+  LV
Sbjct: 154 AFSSGSLVWLTGSHELQFYNCERAKEAQMHGIQTLVCIPTPTGVLELGSSDLIRENXNLV 213

Query: 64  QQVKSVLESDI----PNFRNSSSPLPLLEQDVNLEEIGFLSEAPEE----EVGRPGRGKS 123
           QQ +S+  SD+    P+   + SP+ ++ ++ +  +IG ++   EE    EV        
Sbjct: 214 QQTESLFGSDVVWGQPD-PGTRSPIDJINRNFSFADIGIIAGVGEEDDKKEVALDITAMK 273

Query: 124 RRTESA----------GELELSDSDSPVGKAAGKRRGRKR--------NNVEAERQRREK 183
           ++   A            L    SDS   K   K+RGRK         N+VEAERQRREK
Sbjct: 274 KKCGXACPDLVQLSNLNSLNPEHSDSEFPKRTPKKRGRKPGLGRDTPLNHVEAERQRREK 333

Query: 184 LNKRFYALLSVVPNVSRMDKASLLWDAVSYINALKGKVEEME--LELEVRKLKRGGGEGV 243
           LN RFYAL +VVPNVSRMDKASLL DAVSYIN LK KV+E+E  ++ E +K+K   G+ +
Sbjct: 334 LNHRFYALRAVVPNVSRMDKASLLSDAVSYINELKXKVDELESQVQRESKKVKVETGDNL 393

Query: 244 EKQSTTTSEEE----EEQGKGSTLFDVEVKRMGGGDAMVRVESHNQNCPSATLMGALRDL 303
           + QSTTTS E+         GST F+ EVK + G DAM+RV+S N N PSA LM ALRDL
Sbjct: 394 DNQSTTTSVEQTRPPNSSASGSTGFETEVK-IVGSDAMIRVQSANVNYPSARLMAALRDL 453

Query: 304 EVHIHHANITNVNDLMLQHVLIKLPHAFSTDEAFKASLLSKL 314
           E  IHHA+++ +N+LMLQ V++K+P    ++E+ KA+LL  L
Sbjct: 454 EFEIHHASLSCMNELMLQDVVVKVPBNMRSEESIKAALLKIL 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH028_ARATH8.6e-4037.25Transcription factor bHLH28 OS=Arabidopsis thaliana GN=BHLH28 PE=2 SV=1[more]
BH014_ARATH6.8e-3734.19Transcription factor bHLH14 OS=Arabidopsis thaliana GN=BHLH14 PE=2 SV=1[more]
MYC2_ARATH7.8e-3345.05Transcription factor MYC2 OS=Arabidopsis thaliana GN=MYC2 PE=1 SV=2[more]
MYC4_ARATH3.4e-2840.43Transcription factor MYC4 OS=Arabidopsis thaliana GN=MYC4 PE=1 SV=1[more]
MYC2_ORYSJ1.7e-2738.77Transcription factor MYC2 OS=Oryza sativa subsp. japonica GN=MYC2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KFN7_CUCSA4.2e-9460.29Uncharacterized protein OS=Cucumis sativus GN=Csa_6G107910 PE=4 SV=1[more]
A0A061DS37_THECC6.0e-6444.07Basic helix-loop-helix DNA-binding family protein OS=Theobroma cacao GN=TCM_0050... [more]
M5X2D4_PRUPE2.3e-6346.38Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004680mg PE=4 SV=1[more]
A0A059BZD6_EUCGR6.6e-6345.56Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00019 PE=4 SV=1[more]
A9PF26_POPTR1.1e-6244.41Putative uncharacterized protein OS=Populus trichocarpa PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G46830.14.8e-4137.25 NACL-inducible gene 1[more]
AT4G00870.13.8e-3834.19 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G32640.14.4e-3445.05 Basic helix-loop-helix (bHLH) DNA-binding family protein[more]
AT4G17880.11.9e-2940.43 Basic helix-loop-helix (bHLH) DNA-binding family protein[more]
AT5G46760.11.6e-2837.61 Basic helix-loop-helix (bHLH) DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|449445714|ref|XP_004140617.1|6.1e-9460.29PREDICTED: transcription factor MYC2-like [Cucumis sativus][more]
gi|590720902|ref|XP_007051457.1|8.5e-6444.07Basic helix-loop-helix DNA-binding family protein [Theobroma cacao][more]
gi|596021864|ref|XP_007219048.1|3.2e-6346.38hypothetical protein PRUPE_ppa004680mg [Prunus persica][more]
gi|694400749|ref|XP_009375455.1|4.2e-6347.37PREDICTED: transcription factor MYC2-like [Pyrus x bretschneideri][more]
gi|657957697|ref|XP_008370350.1|4.2e-6346.78PREDICTED: transcription factor MYC2 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR025610MYC/MYB_N
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0044424 intracellular part
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g04380.1Cp4.1LG12g04380.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 143..203
score: 3.6
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 143..191
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 148..197
score: 1.2
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 142..191
score: 15
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 139..204
score: 7.98
IPR025610Transcription factor MYC/MYB N-terminalPFAMPF14215bHLH-MYC_Ncoord: 5..69
score: 6.1
NoneNo IPR availableunknownCoilCoilcoord: 141..161
score: -coord: 188..208
scor
NoneNo IPR availablePANTHERPTHR11514MYCcoord: 4..313
score: 2.0
NoneNo IPR availablePANTHERPTHR11514:SF40TRANSCRIPTION FACTOR BHLH14coord: 4..313
score: 2.0

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG12g04380Cucurbita moschata (Rifu)cmocpeB335
Cp4.1LG12g04380Bottle gourd (USVL1VR-Ls)cpelsiB101
Cp4.1LG12g04380Silver-seed gourdcarcpeB1250