Cp4.1LG10g01760 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g01760
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBasic helix loop helix (BHLH) DNA-binding family protein
LocationCp4.1LG10 : 3034203 .. 3036034 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGCTACCTAACAATCTTCTTCCTCGCTTCATTCGATTGTGAACTGATAAACACACACACCCAAACCTTAGTTTTGTTGACCCTTTTCGCAATGGAAATCTCATCTGCAAAATGGTTATCTGATATGGTAAACACTTTTATGCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNCTTCGTTTTCGCTTCTTTTTTTGCTTGATCTTATGTTTTTTGGGTTTTGGTTATAATTAGGATTTGGAATCTGCATTTATGGATGATTTTGAAATGAACCCATTTGAGTGCACGCTAGACGAGCTCAGTTTCCAAACTTTCTCTGACGAAAGCCACACATCCCACCTAGATCTTGAGAACTCCGTACAAACTCCGCCGTCGCCGCCGCCGCCGGCCAAGCAGCCGAGGACCAGCGGCAGCTGGAACAACTCTTCCACGACCCGTCAAATTGCTTCCATGGCTGCTTCGTCTTCTTCATCACACATCATTTCATTTGGGAATTCCCATTCTTCTTCTCCTCCTGCTTCCAACAAATTAGTTGGGAACAATGGCAATTACAGCAACGTGAAGCCCAAATTCGAAATTGGGTGCGAAGGGAACATTGATTTGTCATCGGCGATCCCTCAAGGTTCCTATGAGAACAATCCAAATGGTTCCCCAAAATACGATGGTGTGGGAATGAAGAGGAGCGCTTCGGCTATGAATTATCGCACCGCTTTGGTTGCTCAAGATCATGTCATAGCTGAGCGGAAGCGTAGAGAAAAGCTCAGCCAGCGATTCGTTGCTCTTTCAGCTCTTATTCCACACCTCAAGAAGGTGAGTACATGATCTAATTTTGACCTTATAATTCAAGATTTTATTGACATTATTTAACGGATCAAATGGGTTATTGGTAATAGATGGACAAAGCGTCTATTCTTGGGGATGCAATAACATACATCAAGGATCTTCAAGATCGTTTGAAGGTTGCGGATGAAGAAGCAGCCAAATCAAGAGTGGAATCAGTGGTGTTTGTGAACAGATCCGAGGATGTCTCTGCTGTGGTGGAAGACGATTCCTCAGAGGAAAACAGCTCATCCGACAGAGCCATTCCAGAGATAGAAGCCAGAGTGTCTGGGAAGGATGTTCTGTTGAAGATTCATGGCAAGAAATGCAAAGGCTGCCTTTCAAACATACTAAACCACATAGAGGAGCTTAATCTAACAGTTCTCAACAGCAGTGCCTTGCCATTTGGCAATTTCAGGATTGATATAACTATTATAGCTCAGGTACTGTGGATATGTTTCCATTTCAGAACTGTCATTTCTGGAGAATGAGCTTATTACGGGACTAAATTCATTTGCTTTATTATGTAATTGCAGATGGGTGATGGCTTTTCCATGACAGTGAAGGAGCTAGTGCAGAAACTACGACAGACTTGCCTACAATTCGTGTAAAATTACAGAGCTTTGTCACTCTCAGTCACAAAGGTTGGCTTTTGCCTACAATGCTCCAATTAACTGTTGATGATCTGCTTCCATTTTACTCCAGAATATGTTCAGCACGAGGGCGTTTGTTTTTTTTTAGAAAAATTTCTTTGCATTTTTTTGCGTGGGGCAGATCTTTGACCACGTTAAGGAAATTTAGGATAACAAAACACCCACATGTGCTGTAAATCCAAGACTTTTTTCCCCCAAGTTCCCGTTTTTTTGGGGAGTCTCCTTTGCATACGGATCTATAATTAATTATTGTAAGATGTTGATTATTATGGATTAATTAAGACAACTACAAGGATATTACAACTAATTATCTGGATCTGACAGCTCCATTGCAACTTAATTAA

mRNA sequence

AAGCTACCTAACAATCTTCTTCCTCGCTTCATTCGATTGTGAACTGATAAACACACACACCCAAACCTTAGTTTTGTTGACCCTTTTCGCAATGGAAATCTCATCTGCAAAATGGTTATCTGATATGGATTTGGAATCTGCATTTATGGATGATTTTGAAATGAACCCATTTGAGTGCACGCTAGACGAGCTCAGTTTCCAAACTTTCTCTGACGAAAGCCACACATCCCACCTAGATCTTGAGAACTCCGTACAAACTCCGCCGTCGCCGCCGCCGCCGGCCAAGCAGCCGAGGACCAGCGGCAGCTGGAACAACTCTTCCACGACCCGTCAAATTGCTTCCATGGCTGCTTCGTCTTCTTCATCACACATCATTTCATTTGGGAATTCCCATTCTTCTTCTCCTCCTGCTTCCAACAAATTAGTTGGGAACAATGGCAATTACAGCAACGTGAAGCCCAAATTCGAAATTGGGTGCGAAGGGAACATTGATTTGTCATCGGCGATCCCTCAAGGTTCCTATGAGAACAATCCAAATGGTTCCCCAAAATACGATGGTGTGGGAATGAAGAGGAGCGCTTCGGCTATGAATTATCGCACCGCTTTGGTTGCTCAAGATCATGTCATAGCTGAGCGGAAGCGTAGAGAAAAGCTCAGCCAGCGATTCGTTGCTCTTTCAGCTCTTATTCCACACCTCAAGAAGATGGACAAAGCGTCTATTCTTGGGGATGCAATAACATACATCAAGGATCTTCAAGATCGTTTGAAGGTTGCGGATGAAGAAGCAGCCAAATCAAGAGTGGAATCAGTGGTGTTTGTGAACAGATCCGAGGATGTCTCTGCTGTGGTGGAAGACGATTCCTCAGAGGAAAACAGCTCATCCGACAGAGCCATTCCAGAGATAGAAGCCAGAGTGTCTGGGAAGGATGTTCTGTTGAAGATTCATGGCAAGAAATGCAAAGGCTGCCTTTCAAACATACTAAACCACATAGAGGAGCTTAATCTAACAGTTCTCAACAGCAGTGCCTTGCCATTTGGCAATTTCAGGATTGATATAACTATTATAGCTCAGATGGGTGATGGCTTTTCCATGACAGTGAAGGAGCTAGTGCAGAAACTACGACAGACTTGCCTACAATTCGTGTAAAATTACAGAGCTTTGTCACTCTCAGTCACAAAGGTTGGCTTTTGCCTACAATGCTCCAATTAACTGTTGATGATCTGCTTCCATTTTACTCCAGAATATGTTCAGCACGAGGGCGTTTGTTTTTTTTTAGAAAAATTTCTTTGCATTTTTTTGCGTGGGGCAGATCTTTGACCACGTTAAGGAAATTTAGGATAACAAAACACCCACATGTGCTGTAAATCCAAGACTTTTTTCCCCCAAGTTCCCGTTTTTTTGGGGAGTCTCCTTTGCATACGGATCTATAATTAATTATTGTAAGATGTTGATTATTATGGATTAATTAAGACAACTACAAGGATATTACAACTAATTATCTGGATCTGACAGCTCCATTGCAACTTAATTAA

Coding sequence (CDS)

ATGGAAATCTCATCTGCAAAATGGTTATCTGATATGGATTTGGAATCTGCATTTATGGATGATTTTGAAATGAACCCATTTGAGTGCACGCTAGACGAGCTCAGTTTCCAAACTTTCTCTGACGAAAGCCACACATCCCACCTAGATCTTGAGAACTCCGTACAAACTCCGCCGTCGCCGCCGCCGCCGGCCAAGCAGCCGAGGACCAGCGGCAGCTGGAACAACTCTTCCACGACCCGTCAAATTGCTTCCATGGCTGCTTCGTCTTCTTCATCACACATCATTTCATTTGGGAATTCCCATTCTTCTTCTCCTCCTGCTTCCAACAAATTAGTTGGGAACAATGGCAATTACAGCAACGTGAAGCCCAAATTCGAAATTGGGTGCGAAGGGAACATTGATTTGTCATCGGCGATCCCTCAAGGTTCCTATGAGAACAATCCAAATGGTTCCCCAAAATACGATGGTGTGGGAATGAAGAGGAGCGCTTCGGCTATGAATTATCGCACCGCTTTGGTTGCTCAAGATCATGTCATAGCTGAGCGGAAGCGTAGAGAAAAGCTCAGCCAGCGATTCGTTGCTCTTTCAGCTCTTATTCCACACCTCAAGAAGATGGACAAAGCGTCTATTCTTGGGGATGCAATAACATACATCAAGGATCTTCAAGATCGTTTGAAGGTTGCGGATGAAGAAGCAGCCAAATCAAGAGTGGAATCAGTGGTGTTTGTGAACAGATCCGAGGATGTCTCTGCTGTGGTGGAAGACGATTCCTCAGAGGAAAACAGCTCATCCGACAGAGCCATTCCAGAGATAGAAGCCAGAGTGTCTGGGAAGGATGTTCTGTTGAAGATTCATGGCAAGAAATGCAAAGGCTGCCTTTCAAACATACTAAACCACATAGAGGAGCTTAATCTAACAGTTCTCAACAGCAGTGCCTTGCCATTTGGCAATTTCAGGATTGATATAACTATTATAGCTCAGATGGGTGATGGCTTTTCCATGACAGTGAAGGAGCTAGTGCAGAAACTACGACAGACTTGCCTACAATTCGTGTAA

Protein sequence

MEISSAKWLSDMDLESAFMDDFEMNPFECTLDELSFQTFSDESHTSHLDLENSVQTPPSPPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEENSSSDRAIPEIEARVSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQTCLQFV
BLAST of Cp4.1LG10g01760 vs. Swiss-Prot
Match: BH018_ARATH (Transcription factor bHLH18 OS=Arabidopsis thaliana GN=BHLH18 PE=2 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 7.6e-37
Identity = 98/191 (51.31%), Postives = 129/191 (67.54%), Query Frame = 1

Query: 158 GMKRSASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITY 217
           G KR+ S    R+   AQDH++AERKRREKL+QRFVALSALIP LKKMDKAS+LGDAI +
Sbjct: 110 GTKRAQSLT--RSQSNAQDHILAERKRREKLTQRFVALSALIPGLKKMDKASVLGDAIKH 169

Query: 218 IKDLQDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEENS-----SSDRAIPEIE 277
           IK LQ+ +K  +E+  +  +ESVV V +S  V       SS  +S     SS   +PEIE
Sbjct: 170 IKYLQESVKEYEEQKKEKTMESVVLVKKSSLVLDENHQPSSSSSSDGNRNSSSSNLPEIE 229

Query: 278 ARVSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGF 337
            RVSGKDVL+KI  +K KG +  I+  IE+L L++ NS+ LPFG    DI+IIAQ  + F
Sbjct: 230 VRVSGKDVLIKILCEKQKGNVIKIMGEIEKLGLSITNSNVLPFGP-TFDISIIAQKNNNF 289

Query: 338 SMTVKELVQKL 344
            M ++++V+ L
Sbjct: 290 DMKIEDVVKNL 297

BLAST of Cp4.1LG10g01760 vs. Swiss-Prot
Match: BH019_ARATH (Transcription factor bHLH19 OS=Arabidopsis thaliana GN=BHLH19 PE=2 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 2.9e-36
Identity = 98/238 (41.18%), Postives = 148/238 (62.18%), Query Frame = 1

Query: 122 KPKFEIGCEGNIDLSSAIPQGSYENNPNGSP--------KYDGVGMKRSASAMNYRTALV 181
           KPK  +     I+    +    + +N   SP        K  G G KR   +   R+ ++
Sbjct: 57  KPKAAVKPMMKINNKQQLISFDFSSNVISSPAAEEIIMDKLVGRGTKRKTCSHGTRSPVL 116

Query: 182 AQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAA 241
           A++HV+AERKRREKLS++F+ALSAL+P LKK DK +IL DAI+ +K LQ++L+   EE  
Sbjct: 117 AKEHVLAERKRREKLSEKFIALSALLPGLKKADKVTILDDAISRMKQLQEQLRTLKEEKE 176

Query: 242 KSR-VESVVFVNRSEDVSAVVEDDSSEENSSS-----DRAIPEIEARVSGKDVLLKIHGK 301
            +R +ES++ V +S+      +++ +   S S     D+A+PEIEA++S  D+L++I  +
Sbjct: 177 ATRQMESMILVKKSK---VFFDEEPNLSCSPSVHIEFDQALPEIEAKISQNDILIRILCE 236

Query: 302 KCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTV-KELVQKLR 345
           K KGC+ NILN IE   L + NS  LPFG+  +DIT++AQM   FSM++ K+LV+ LR
Sbjct: 237 KSKGCMINILNTIENFQLRIENSIVLPFGDSTLDITVLAQMDKDFSMSILKDLVRNLR 291

BLAST of Cp4.1LG10g01760 vs. Swiss-Prot
Match: BH025_ARATH (Transcription factor bHLH25 OS=Arabidopsis thaliana GN=BHLH25 PE=2 SV=2)

HSP 1 Score: 150.2 bits (378), Expect = 4.2e-35
Identity = 133/375 (35.47%), Postives = 193/375 (51.47%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLE-SAFMDDFEMNPFECTLDEL------SFQTFSDESHTSHLD---- 60
           M I S +W S+ ++E ++ +  F MN     + E       SF T +D S+   ++    
Sbjct: 1   MSILSTRWFSEQEIEENSIIQQFHMNSIVGEVQEAQYIFPHSFTTNNDPSYDDLIEMKPP 60

Query: 61  --LENSVQTPPSPPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGN------SH 120
             LE +  +P S  PP  +P                      SSS I+SF +       H
Sbjct: 61  KILETTYISPSSHLPPNSKPHH----------------IHRHSSSRILSFEDYGSNDMEH 120

Query: 121 SSSPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKR 180
             SP   N +           PK E   + +        Q S E N  G+ +       +
Sbjct: 121 EYSPTYLNSIFS---------PKLEAQVQPH--------QKSDEFNRKGTKRAQPFSRNQ 180

Query: 181 SASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDL 240
           S           AQDH+IAERKRREKL+QRFVALSAL+P LKKMDKAS+LGDA+ +IK L
Sbjct: 181 SN----------AQDHIIAERKRREKLTQRFVALSALVPGLKKMDKASVLGDALKHIKYL 240

Query: 241 QDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDD-----SSEENSSSDRAIPEIEARVS 300
           Q+R+   +E+  + R+ES+V V +S+    +++D+     SS E+  SD  +PEIE R S
Sbjct: 241 QERVGELEEQKKERRLESMVLVKKSK---LILDDNNQSFSSSCEDGFSDLDLPEIEVRFS 300

Query: 301 GKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTV 352
            +DVL+KI  +K KG L+ I+  IE+L++ + NSS L FG   +DITIIA+    F MT+
Sbjct: 301 DEDVLIKILCEKQKGHLAKIMAEIEKLHILITNSSVLNFGP-TLDITIIAKKESDFDMTL 328

BLAST of Cp4.1LG10g01760 vs. Swiss-Prot
Match: BH020_ARATH (Transcription factor NAI1 OS=Arabidopsis thaliana GN=NAI1 PE=2 SV=1)

HSP 1 Score: 136.7 bits (343), Expect = 4.8e-31
Identity = 103/266 (38.72%), Postives = 157/266 (59.02%), Query Frame = 1

Query: 99  NSHSSSPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVG 158
           NS SSSP +S+    ++G+ ++    F     G+ D  +  P  +  N  N       VG
Sbjct: 63  NSTSSSPSSSS----SSGSRTSQVISF-----GSPDTKTN-PVETSLNFSNQVSMDQKVG 122

Query: 159 MKRSASAMN--YRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAIT 218
            KR     N   R   + ++HV+AERKRR+KL++R +ALSAL+P LKK DKA++L DAI 
Sbjct: 123 SKRKDCVNNGGRREPHLLKEHVLAERKRRQKLNERLIALSALLPGLKKTDKATVLEDAIK 182

Query: 219 YIKDLQDRLKVADEE--AAKSRVESVVFVNRSEDVSAVVEDDSSEENS----------SS 278
           ++K LQ+R+K  +EE    K   +S++ V RS+     ++DDSS  +S          SS
Sbjct: 183 HLKQLQERVKKLEEERVVTKKMDQSIILVKRSQ---VYLDDDSSSYSSTCSAASPLSSSS 242

Query: 279 D------RAIPEIEARVSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNF 338
           D      + +P IEARVS +D+L+++H +K KGC+  IL+ +E+  L V+NS  LPFGN 
Sbjct: 243 DEVSIFKQTMPMIEARVSDRDLLIRVHCEKNKGCMIKILSSLEKFRLEVVNSFTLPFGNS 302

Query: 339 RIDITIIAQMGDGFSMTVKELVQKLR 345
            + ITI+ +M + FS  V+E+V+ +R
Sbjct: 303 TLVITILTKMDNKFSRPVEEVVKNIR 315

BLAST of Cp4.1LG10g01760 vs. Swiss-Prot
Match: MYC3_ARATH (Transcription factor MYC3 OS=Arabidopsis thaliana GN=MYC3 PE=1 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 2.0e-13
Identity = 56/161 (34.78%), Postives = 96/161 (59.63%), Query Frame = 1

Query: 176 DHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVA--DEEAA 235
           +HV AER+RREKL+QRF +L A++P++ KMDKAS+LGDAI+YI +L+ +L+ A  D+E  
Sbjct: 415 NHVEAERQRREKLNQRFYSLRAVVPNVSKMDKASLLGDAISYINELKSKLQQAESDKEEI 474

Query: 236 KSRVE--SVVFVNRSEDVSAVVEDDSSEENSSSDRAIPEIEARVSGKDVLLKIHGKKCKG 295
           + +++  S    N     S   E  SS ++S++     EI+ ++ G DV++++   K   
Sbjct: 475 QKKLDGMSKEGNNGKGCGSRAKERKSSNQDSTASSIEMEIDVKIIGWDVMIRVQCGKKDH 534

Query: 296 CLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGF 333
             +  +  ++EL+L V ++S     +  I    + +MG  F
Sbjct: 535 PGARFMEALKELDLEVNHASLSVVNDLMIQQATV-KMGSQF 574

BLAST of Cp4.1LG10g01760 vs. TrEMBL
Match: A0A0A0KG27_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G497110 PE=4 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 2.3e-109
Identity = 243/358 (67.88%), Postives = 277/358 (77.37%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLESAFMDDFEMNPFECTLDELS-FQTFSDESHTSHLDLENSVQTPPS 60
           MEISSAKWLS+M+LES+FM+D EMNPFECTL+ELS FQTFSDES+TSH+DL+NS    P+
Sbjct: 1   MEISSAKWLSEMELESSFMNDLEMNPFECTLEELSSFQTFSDESYTSHVDLDNSSVQTPA 60

Query: 61  PPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPASNKLVGNNGNYS 120
            PPPAKQ RTS     S ++R+I+SMA SSSSS IISFGN   S   A      NN N  
Sbjct: 61  APPPAKQARTS-----SGSSRRISSMATSSSSSQIISFGNIEMSPMVAQPSYDNNNNNNK 120

Query: 121 NVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASA---MNYRTALVAQD 180
                                  +Y  +PN   K  GVG+KRSA+A    N R+ LVAQD
Sbjct: 121 T---------------------SNYYCSPN---KNHGVGIKRSAAAAMNSNNRSPLVAQD 180

Query: 181 HVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAAKSR 240
           HV+AERKRREKLSQRFVALSALIP LKKMDKASILGDAITYIKDLQ+RLKVA+E+AAK+ 
Sbjct: 181 HVLAERKRREKLSQRFVALSALIPDLKKMDKASILGDAITYIKDLQERLKVANEQAAKAT 240

Query: 241 VESVVFVNRSEDVSAVV-EDDSSEEN--SSSDRAIPEIEARVSGKDVLLKIHGKKCKGCL 300
           VESVVFVN+S+D S ++  DDSSEEN  SSSD AIP++EARVSGKDVLL+IHGKKCKGCL
Sbjct: 241 VESVVFVNKSDDASTIIASDDSSEENSSSSSDGAIPDVEARVSGKDVLLRIHGKKCKGCL 300

Query: 301 SNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQTCLQFV 352
           SNILN IE+LNLTVLNSSALPFGNFR+DITIIAQM D FSMTVKELVQKLRQ  L+F+
Sbjct: 301 SNILNQIEKLNLTVLNSSALPFGNFRLDITIIAQMDDDFSMTVKELVQKLRQASLEFM 329

BLAST of Cp4.1LG10g01760 vs. TrEMBL
Match: M5WL55_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017640mg PE=4 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 2.7e-73
Identity = 179/349 (51.29%), Postives = 244/349 (69.91%), Query Frame = 1

Query: 15  ESAFMDDFEMNPFECTLDELSFQTFSDESHTSHLDL---------ENSVQTPPSPPP--- 74
           +  F+  +EMN  + +LD+L+FQ+FS ES++S+ +            S++TP        
Sbjct: 3   DPTFIHQYEMNSLDYSLDDLNFQSFSSESYSSYPNFTPKATHNFSNASIETPQQAGTHER 62

Query: 75  PAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPASNKLVGNNGNYSNVK 134
           PAKQP+   +WN  +T   I + AASSSSSH+ISF NS+SS P +S +  G   N   +K
Sbjct: 63  PAKQPKNHTTWNPCTTDHTIMAKAASSSSSHLISFDNSNSSPPTSSQQFYGTLDN--TMK 122

Query: 135 PKFEIG-CEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASAMNYRTALVAQDHVIAE 194
           PK E+    G ++L++ I QGSY+     SPK+ G G+KR+A+    R+ L AQDHV+AE
Sbjct: 123 PKNEVEYSNGKLNLTTLISQGSYDPQ-TCSPKH-GQGIKRAATVT--RSPLHAQDHVLAE 182

Query: 195 RKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAAKSRVESVV 254
           RKRREKLSQRF+ALSAL+P LKKMDKAS+LGDAI Y+K LQ+R K+ +E+A K  VE+VV
Sbjct: 183 RKRREKLSQRFIALSALVPGLKKMDKASVLGDAIKYVKHLQERTKMLEEKAVKKTVEAVV 242

Query: 255 FVNRSEDVSAVVEDDSSEEN--SSSDRAIPEIEARVSGKDVLLKIHGKKCKGCLSNILNH 314
           FV R++  SA  +  SS+EN  SSSD+ +PEIEARVS K+VL+++H +K KGCL+ IL+ 
Sbjct: 243 FVKRTQ-YSADDDISSSDENFESSSDQPLPEIEARVSDKEVLIRVHCEKTKGCLAKILSE 302

Query: 315 IEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQTCL 349
           IE L+LT++NSS LPFGN  +DIT+IAQM   FSMTVK+LV+ LRQ  +
Sbjct: 303 IESLDLTIVNSSVLPFGNSTLDITVIAQMDAEFSMTVKDLVKNLRQNLI 344

BLAST of Cp4.1LG10g01760 vs. TrEMBL
Match: W9RQB0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_016091 PE=4 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 6.0e-65
Identity = 176/380 (46.32%), Postives = 240/380 (63.16%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLESAFMDDFE-MNPFE--CTLDELSFQTFSDESHTSH------LDLE 60
           MEI SAKWLS+++++  F   ++ +N  +   + D+++FQ+FS ES++SH       + +
Sbjct: 1   MEIPSAKWLSELEMDYNFFHQYDQVNSLDHHYSFDDINFQSFSSESYSSHNTNFAPANSQ 60

Query: 61  N--SVQTPPSPPP----------PAKQPRTSGSWNNSSTT--------RQIASMAASSSS 120
           N     TP    P          PAKQ + + SWNN+++          + ++ AASSSS
Sbjct: 61  NLGGAATPVDQAPHHQITDFESRPAKQLKINNSWNNNNSCTTNDNYHYHKTSAKAASSSS 120

Query: 121 SHIISFGNSHSSSPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGS 180
           S IISF    SS+     K   N       +PK E    G+ D         ++ N N S
Sbjct: 121 SQIISFEKYASSAATTPEKYYDNLDQSPVKQPKDEPA--GSTDKYMIFQSSYHDRNENFS 180

Query: 181 PKYDGVGMKRSASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASIL 240
           PK   V  ++  +A   R+ L AQDHVIAER+RREKL+QR++ALSA++P LKKMDKAS+L
Sbjct: 181 PKLGQVIREKRPAAAMSRSPLHAQDHVIAERRRREKLNQRYIALSAVVPGLKKMDKASVL 240

Query: 241 GDAITYIKDLQDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEEN--SSSDRAIP 300
           GDAITYIK LQ+R+ + +E+AAK  VESVVFV RS  +SA  E  SS+EN  SSSD+ +P
Sbjct: 241 GDAITYIKTLQERVSILEEQAAKKTVESVVFVKRSH-LSADDEISSSDENFDSSSDQPLP 300

Query: 301 EIEARVSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMG 350
           EIEARVSGKDVL++IH +K KGCLSNIL  IE+L+LT++NSS LPFG     ITI+AQM 
Sbjct: 301 EIEARVSGKDVLIRIHCEKQKGCLSNILCEIEKLHLTIVNSSVLPFGGSTTHITIVAQMD 360

BLAST of Cp4.1LG10g01760 vs. TrEMBL
Match: A0A061DHY3_THECC (Basic helix-loop-helix DNA-binding superfamily protein, putative OS=Theobroma cacao GN=TCM_000895 PE=4 SV=1)

HSP 1 Score: 242.7 bits (618), Expect = 6.9e-61
Identity = 178/371 (47.98%), Postives = 237/371 (63.88%), Query Frame = 1

Query: 1   MEISSAKWLSDMDL-ESAFMDDFEMNPFE--CTLDELSF--------QTFSDESHTSHLD 60
           M+ SSAKWLS++ + E   +    MN      T ++++         Q+FS ES++S+ +
Sbjct: 1   MDSSSAKWLSELGMDEYNIIHQCHMNSLAELTTAEDIATALTAGNFKQSFSSESYSSYPN 60

Query: 61  LENSVQTPPSPPP------PAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSS 120
                 T  S         P KQ +TS SWN+S+TT  I     SS +S I+SF    S+
Sbjct: 61  FNTKNATTFSGSSIETCERPTKQIKTSTSWNSSTTTEHIPQKP-SSPTSQILSF--EKST 120

Query: 121 SPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSA 180
           S PA+++   N  +++ +KPK E    GN++ S  I  G Y  N N +PK    G+KR+ 
Sbjct: 121 SLPANSQQFYNIDHHA-MKPKDETVSSGNMNFSPVITNGPY-GNTNYAPK-PNPGIKRTY 180

Query: 181 SAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQD 240
           S    R+   AQDH++AERKRREKLSQRF+ALSA++P LKKMDKAS+LGDAI Y+K LQ+
Sbjct: 181 SMT--RSPSHAQDHIMAERKRREKLSQRFIALSAIVPGLKKMDKASVLGDAIKYVKQLQE 240

Query: 241 RLKVADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEENS---SSDRAIPEIEARVSGKDV 300
           RLKV +E+  K  VESVVFV +S+ +SA  E  S EENS   SSD A+PEIEARVS  DV
Sbjct: 241 RLKVLEEQTKKRTVESVVFVKKSQ-LSADDETSSCEENSDSQSSDAALPEIEARVSDNDV 300

Query: 301 LLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELV 352
           L++IH +K KG +  IL+ IE L+LTV+NSS LPFGN  +DITIIAQ    FSMTVK+LV
Sbjct: 301 LIRIHCEKQKGFVVKILSEIENLHLTVVNSSVLPFGNSTLDITIIAQKDAEFSMTVKDLV 360

BLAST of Cp4.1LG10g01760 vs. TrEMBL
Match: A0A0D2QXV7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G228900 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 4.6e-57
Identity = 166/368 (45.11%), Postives = 229/368 (62.23%), Query Frame = 1

Query: 1   MEISSAKWLSDMDL-ESAFMDDFEMNPFE--CTLDELSF---------QTFSDESHTSHL 60
           M+ SSAKWLS++ + E   +    MN      T D+L+          Q+FS ES++S+ 
Sbjct: 1   MDSSSAKWLSELGMDEYNIIHQCHMNTLAELTTTDDLATALVGGGNLKQSFSSESYSSYP 60

Query: 61  DL-ENSVQTPPSPPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPA 120
           +L   +  T  S     + P    + +  +T         SS +S I+SFGNS+S  P  
Sbjct: 61  NLYTKNTTTTISGSSSIETPDYRPAKHLKTTHHHHVPPKPSSPTSQILSFGNSNSL-PAT 120

Query: 121 SNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASAMN 180
           S+    N  N   V PK E    GN++    +  G YE+  N +PK +  G+KR+ S   
Sbjct: 121 SHHHYYNVDN--TVNPKDETLSSGNMNFLPPVTNGPYEST-NYAPKINNHGVKRTYSMT- 180

Query: 181 YRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKV 240
            RT  VAQDH+IAERKRREKLSQRF+ALSA++P LKKMDKAS+LGDAI Y+K LQ+R+KV
Sbjct: 181 -RTPSVAQDHIIAERKRREKLSQRFIALSAIVPGLKKMDKASVLGDAIKYVKQLQERVKV 240

Query: 241 ADEEAAKSRVESVVFVNRS----EDVSAVVEDDSSEENSSSDRAIPEIEARVSGKDVLLK 300
            +E+  K  VESVVFV +S    +D S+  ED++SE   SSD A+PEIEARVS  DVL++
Sbjct: 241 LEEQTKKRTVESVVFVRKSQLSADDESSSCEDNNSELGPSSDAALPEIEARVSDHDVLVR 300

Query: 301 IHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKL 352
           IH +  KG +  IL+ IE L+L+V+NS+ALPFGN  +DITIIA+    F++TVK+LV+ +
Sbjct: 301 IHCENHKGFVPKILSEIENLHLSVVNSTALPFGNSTLDITIIAKKDSEFNITVKDLVKDI 360

BLAST of Cp4.1LG10g01760 vs. TAIR10
Match: AT2G22750.2 (AT2G22750.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 156.0 bits (393), Expect = 4.3e-38
Identity = 98/191 (51.31%), Postives = 129/191 (67.54%), Query Frame = 1

Query: 158 GMKRSASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITY 217
           G KR+ S    R+   AQDH++AERKRREKL+QRFVALSALIP LKKMDKAS+LGDAI +
Sbjct: 110 GTKRAQSLT--RSQSNAQDHILAERKRREKLTQRFVALSALIPGLKKMDKASVLGDAIKH 169

Query: 218 IKDLQDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEENS-----SSDRAIPEIE 277
           IK LQ+ +K  +E+  +  +ESVV V +S  V       SS  +S     SS   +PEIE
Sbjct: 170 IKYLQESVKEYEEQKKEKTMESVVLVKKSSLVLDENHQPSSSSSSDGNRNSSSSNLPEIE 229

Query: 278 ARVSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGF 337
            RVSGKDVL+KI  +K KG +  I+  IE+L L++ NS+ LPFG    DI+IIAQ  + F
Sbjct: 230 VRVSGKDVLIKILCEKQKGNVIKIMGEIEKLGLSITNSNVLPFGP-TFDISIIAQKNNNF 289

Query: 338 SMTVKELVQKL 344
            M ++++V+ L
Sbjct: 290 DMKIEDVVKNL 297

BLAST of Cp4.1LG10g01760 vs. TAIR10
Match: AT2G22760.1 (AT2G22760.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 154.1 bits (388), Expect = 1.6e-37
Identity = 98/238 (41.18%), Postives = 148/238 (62.18%), Query Frame = 1

Query: 122 KPKFEIGCEGNIDLSSAIPQGSYENNPNGSP--------KYDGVGMKRSASAMNYRTALV 181
           KPK  +     I+    +    + +N   SP        K  G G KR   +   R+ ++
Sbjct: 57  KPKAAVKPMMKINNKQQLISFDFSSNVISSPAAEEIIMDKLVGRGTKRKTCSHGTRSPVL 116

Query: 182 AQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAA 241
           A++HV+AERKRREKLS++F+ALSAL+P LKK DK +IL DAI+ +K LQ++L+   EE  
Sbjct: 117 AKEHVLAERKRREKLSEKFIALSALLPGLKKADKVTILDDAISRMKQLQEQLRTLKEEKE 176

Query: 242 KSR-VESVVFVNRSEDVSAVVEDDSSEENSSS-----DRAIPEIEARVSGKDVLLKIHGK 301
            +R +ES++ V +S+      +++ +   S S     D+A+PEIEA++S  D+L++I  +
Sbjct: 177 ATRQMESMILVKKSK---VFFDEEPNLSCSPSVHIEFDQALPEIEAKISQNDILIRILCE 236

Query: 302 KCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTV-KELVQKLR 345
           K KGC+ NILN IE   L + NS  LPFG+  +DIT++AQM   FSM++ K+LV+ LR
Sbjct: 237 KSKGCMINILNTIENFQLRIENSIVLPFGDSTLDITVLAQMDKDFSMSILKDLVRNLR 291

BLAST of Cp4.1LG10g01760 vs. TAIR10
Match: AT4G37850.1 (AT4G37850.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 150.2 bits (378), Expect = 2.4e-36
Identity = 133/375 (35.47%), Postives = 193/375 (51.47%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLE-SAFMDDFEMNPFECTLDEL------SFQTFSDESHTSHLD---- 60
           M I S +W S+ ++E ++ +  F MN     + E       SF T +D S+   ++    
Sbjct: 1   MSILSTRWFSEQEIEENSIIQQFHMNSIVGEVQEAQYIFPHSFTTNNDPSYDDLIEMKPP 60

Query: 61  --LENSVQTPPSPPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGN------SH 120
             LE +  +P S  PP  +P                      SSS I+SF +       H
Sbjct: 61  KILETTYISPSSHLPPNSKPHH----------------IHRHSSSRILSFEDYGSNDMEH 120

Query: 121 SSSPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKR 180
             SP   N +           PK E   + +        Q S E N  G+ +       +
Sbjct: 121 EYSPTYLNSIFS---------PKLEAQVQPH--------QKSDEFNRKGTKRAQPFSRNQ 180

Query: 181 SASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDL 240
           S           AQDH+IAERKRREKL+QRFVALSAL+P LKKMDKAS+LGDA+ +IK L
Sbjct: 181 SN----------AQDHIIAERKRREKLTQRFVALSALVPGLKKMDKASVLGDALKHIKYL 240

Query: 241 QDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDD-----SSEENSSSDRAIPEIEARVS 300
           Q+R+   +E+  + R+ES+V V +S+    +++D+     SS E+  SD  +PEIE R S
Sbjct: 241 QERVGELEEQKKERRLESMVLVKKSK---LILDDNNQSFSSSCEDGFSDLDLPEIEVRFS 300

Query: 301 GKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTV 352
            +DVL+KI  +K KG L+ I+  IE+L++ + NSS L FG   +DITIIA+    F MT+
Sbjct: 301 DEDVLIKILCEKQKGHLAKIMAEIEKLHILITNSSVLNFGP-TLDITIIAKKESDFDMTL 328

BLAST of Cp4.1LG10g01760 vs. TAIR10
Match: AT2G22770.1 (AT2G22770.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 136.7 bits (343), Expect = 2.7e-32
Identity = 103/266 (38.72%), Postives = 157/266 (59.02%), Query Frame = 1

Query: 99  NSHSSSPPASNKLVGNNGNYSNVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVG 158
           NS SSSP +S+    ++G+ ++    F     G+ D  +  P  +  N  N       VG
Sbjct: 63  NSTSSSPSSSS----SSGSRTSQVISF-----GSPDTKTN-PVETSLNFSNQVSMDQKVG 122

Query: 159 MKRSASAMN--YRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAIT 218
            KR     N   R   + ++HV+AERKRR+KL++R +ALSAL+P LKK DKA++L DAI 
Sbjct: 123 SKRKDCVNNGGRREPHLLKEHVLAERKRRQKLNERLIALSALLPGLKKTDKATVLEDAIK 182

Query: 219 YIKDLQDRLKVADEE--AAKSRVESVVFVNRSEDVSAVVEDDSSEENS----------SS 278
           ++K LQ+R+K  +EE    K   +S++ V RS+     ++DDSS  +S          SS
Sbjct: 183 HLKQLQERVKKLEEERVVTKKMDQSIILVKRSQ---VYLDDDSSSYSSTCSAASPLSSSS 242

Query: 279 D------RAIPEIEARVSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNF 338
           D      + +P IEARVS +D+L+++H +K KGC+  IL+ +E+  L V+NS  LPFGN 
Sbjct: 243 DEVSIFKQTMPMIEARVSDRDLLIRVHCEKNKGCMIKILSSLEKFRLEVVNSFTLPFGNS 302

Query: 339 RIDITIIAQMGDGFSMTVKELVQKLR 345
            + ITI+ +M + FS  V+E+V+ +R
Sbjct: 303 TLVITILTKMDNKFSRPVEEVVKNIR 315

BLAST of Cp4.1LG10g01760 vs. TAIR10
Match: AT5G46760.1 (AT5G46760.1 Basic helix-loop-helix (bHLH) DNA-binding family protein)

HSP 1 Score: 78.2 bits (191), Expect = 1.1e-14
Identity = 56/161 (34.78%), Postives = 96/161 (59.63%), Query Frame = 1

Query: 176 DHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVA--DEEAA 235
           +HV AER+RREKL+QRF +L A++P++ KMDKAS+LGDAI+YI +L+ +L+ A  D+E  
Sbjct: 415 NHVEAERQRREKLNQRFYSLRAVVPNVSKMDKASLLGDAISYINELKSKLQQAESDKEEI 474

Query: 236 KSRVE--SVVFVNRSEDVSAVVEDDSSEENSSSDRAIPEIEARVSGKDVLLKIHGKKCKG 295
           + +++  S    N     S   E  SS ++S++     EI+ ++ G DV++++   K   
Sbjct: 475 QKKLDGMSKEGNNGKGCGSRAKERKSSNQDSTASSIEMEIDVKIIGWDVMIRVQCGKKDH 534

Query: 296 CLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGF 333
             +  +  ++EL+L V ++S     +  I    + +MG  F
Sbjct: 535 PGARFMEALKELDLEVNHASLSVVNDLMIQQATV-KMGSQF 574

BLAST of Cp4.1LG10g01760 vs. NCBI nr
Match: gi|449451351|ref|XP_004143425.1| (PREDICTED: transcription factor bHLH18-like [Cucumis sativus])

HSP 1 Score: 403.7 bits (1036), Expect = 3.4e-109
Identity = 243/358 (67.88%), Postives = 277/358 (77.37%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLESAFMDDFEMNPFECTLDELS-FQTFSDESHTSHLDLENSVQTPPS 60
           MEISSAKWLS+M+LES+FM+D EMNPFECTL+ELS FQTFSDES+TSH+DL+NS    P+
Sbjct: 1   MEISSAKWLSEMELESSFMNDLEMNPFECTLEELSSFQTFSDESYTSHVDLDNSSVQTPA 60

Query: 61  PPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPASNKLVGNNGNYS 120
            PPPAKQ RTS     S ++R+I+SMA SSSSS IISFGN   S   A      NN N  
Sbjct: 61  APPPAKQARTS-----SGSSRRISSMATSSSSSQIISFGNIEMSPMVAQPSYDNNNNNNK 120

Query: 121 NVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASA---MNYRTALVAQD 180
                                  +Y  +PN   K  GVG+KRSA+A    N R+ LVAQD
Sbjct: 121 T---------------------SNYYCSPN---KNHGVGIKRSAAAAMNSNNRSPLVAQD 180

Query: 181 HVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAAKSR 240
           HV+AERKRREKLSQRFVALSALIP LKKMDKASILGDAITYIKDLQ+RLKVA+E+AAK+ 
Sbjct: 181 HVLAERKRREKLSQRFVALSALIPDLKKMDKASILGDAITYIKDLQERLKVANEQAAKAT 240

Query: 241 VESVVFVNRSEDVSAVV-EDDSSEEN--SSSDRAIPEIEARVSGKDVLLKIHGKKCKGCL 300
           VESVVFVN+S+D S ++  DDSSEEN  SSSD AIP++EARVSGKDVLL+IHGKKCKGCL
Sbjct: 241 VESVVFVNKSDDASTIIASDDSSEENSSSSSDGAIPDVEARVSGKDVLLRIHGKKCKGCL 300

Query: 301 SNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQTCLQFV 352
           SNILN IE+LNLTVLNSSALPFGNFR+DITIIAQM D FSMTVKELVQKLRQ  L+F+
Sbjct: 301 SNILNQIEKLNLTVLNSSALPFGNFRLDITIIAQMDDDFSMTVKELVQKLRQASLEFM 329

BLAST of Cp4.1LG10g01760 vs. NCBI nr
Match: gi|659079831|ref|XP_008440467.1| (PREDICTED: transcription factor bHLH18-like [Cucumis melo])

HSP 1 Score: 397.9 bits (1021), Expect = 1.8e-107
Identity = 243/357 (68.07%), Postives = 275/357 (77.03%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLESAFMDDFEMNPFECTLDELS-FQTFSDESHTSHLDLENSVQTPPS 60
           MEISSA WLS+M+LES+FM+D EMNPFECTL+ELS FQTFSDES+TSH+DL+N+    P+
Sbjct: 1   MEISSANWLSEMELESSFMNDLEMNPFECTLEELSSFQTFSDESYTSHVDLDNNSVQTPT 60

Query: 61  PPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPASNKLVGNNGNYS 120
            PPPAKQ RTSGS      +R+IASMA SSSSS IISFGN   SS  A       N N +
Sbjct: 61  APPPAKQARTSGS------SRRIASMATSSSSSQIISFGNVELSSMVAQPSYDNKNNNKT 120

Query: 121 NVKPKFEIGCEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASAM--NYRTALVAQDH 180
                                  +Y  +PN   K  GVG+KRS +AM  N R+ LVAQ+H
Sbjct: 121 ----------------------PNYYCSPN---KNHGVGIKRSVAAMNSNNRSPLVAQEH 180

Query: 181 VIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAAKSRV 240
           V+AERKRREKLSQRFVALSALIP LKKMDKASILGDAITYIKDLQ+RLKVA+E+AAK+ V
Sbjct: 181 VLAERKRREKLSQRFVALSALIPDLKKMDKASILGDAITYIKDLQERLKVANEQAAKATV 240

Query: 241 ESVVFVNRSEDVSA-VVEDDSSEEN--SSSDRAIPEIEARVSGKDVLLKIHGKKCKGCLS 300
           ESVVFVN+SED S  VV DDSSEEN  SSSD AIP++EARVSGKDVLLKIH KKC GCLS
Sbjct: 241 ESVVFVNKSEDASTIVVSDDSSEENSSSSSDGAIPDVEARVSGKDVLLKIHCKKCTGCLS 300

Query: 301 NILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQTCLQFV 352
           NILN IE+LNLTVLNSSALPFGNFR+DITIIAQM D FS+TVKELVQKLRQ  L+F+
Sbjct: 301 NILNQIEKLNLTVLNSSALPFGNFRVDITIIAQMDDDFSITVKELVQKLRQASLKFM 326

BLAST of Cp4.1LG10g01760 vs. NCBI nr
Match: gi|645219682|ref|XP_008236887.1| (PREDICTED: transcription factor bHLH25-like [Prunus mume])

HSP 1 Score: 296.2 bits (757), Expect = 7.5e-77
Identity = 186/366 (50.82%), Postives = 258/366 (70.49%), Query Frame = 1

Query: 3   ISSAKWLSDMDLES-AFMDDFEMNPFECTLDELSFQTFSDESHTSH-------------L 62
           ISSAKW+SD+++E   F+  +EMN  + +LD+L+FQ+FS ES++S+              
Sbjct: 4   ISSAKWVSDLEMEDPTFIHQYEMNSLDYSLDDLNFQSFSSESYSSYPNFTPKATHNFNNA 63

Query: 63  DLENSVQTPPSPPPPAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPAS 122
            +E S Q   +   PAKQP+   +WN  ++     + AASSSSSH+ISF NS+SS P +S
Sbjct: 64  SIETSHQAG-THERPAKQPKNHTTWNPCTSDHTFMAKAASSSSSHLISFDNSNSSPPTSS 123

Query: 123 NKLVGNNGNYSNVKPKFEIG-CEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASAMN 182
            +  GN  N   +KPK E+    G ++L++ I QGSY+     SP+++  G+KR+A+   
Sbjct: 124 QQFYGNLDN--TMKPKNEVEYSNGKLNLTTLISQGSYDPQ-TCSPRHE-QGIKRAATVT- 183

Query: 183 YRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKV 242
            R+ L AQDHV+AERKRREKLSQRF+ALSAL+P LKKMDKAS+LGDAI Y+K LQ+R ++
Sbjct: 184 -RSPLHAQDHVLAERKRREKLSQRFIALSALVPGLKKMDKASVLGDAIKYVKHLQERTRM 243

Query: 243 ADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEEN--SSSDRAIPEIEARVSGKDVLLKIH 302
            +E+A K  VE+VVFV R++  SA  +  SS+EN  S SD+ +PEIEARVS K+VL+++H
Sbjct: 244 LEEQAVKKTVEAVVFVKRTQ-YSADDDISSSDENFESCSDQPLPEIEARVSDKEVLIRVH 303

Query: 303 GKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQ 352
            +K KGCL+ IL+ IE L+LT++NSS LPFGN  +DIT+IAQM   FSMTVK+LV+ LRQ
Sbjct: 304 CEKTKGCLAKILSEIESLDLTIVNSSVLPFGNSTLDITVIAQMDAEFSMTVKDLVKNLRQ 361

BLAST of Cp4.1LG10g01760 vs. NCBI nr
Match: gi|595831834|ref|XP_007206351.1| (hypothetical protein PRUPE_ppa017640mg [Prunus persica])

HSP 1 Score: 283.9 bits (725), Expect = 3.9e-73
Identity = 179/349 (51.29%), Postives = 244/349 (69.91%), Query Frame = 1

Query: 15  ESAFMDDFEMNPFECTLDELSFQTFSDESHTSHLDL---------ENSVQTPPSPPP--- 74
           +  F+  +EMN  + +LD+L+FQ+FS ES++S+ +            S++TP        
Sbjct: 3   DPTFIHQYEMNSLDYSLDDLNFQSFSSESYSSYPNFTPKATHNFSNASIETPQQAGTHER 62

Query: 75  PAKQPRTSGSWNNSSTTRQIASMAASSSSSHIISFGNSHSSSPPASNKLVGNNGNYSNVK 134
           PAKQP+   +WN  +T   I + AASSSSSH+ISF NS+SS P +S +  G   N   +K
Sbjct: 63  PAKQPKNHTTWNPCTTDHTIMAKAASSSSSHLISFDNSNSSPPTSSQQFYGTLDN--TMK 122

Query: 135 PKFEIG-CEGNIDLSSAIPQGSYENNPNGSPKYDGVGMKRSASAMNYRTALVAQDHVIAE 194
           PK E+    G ++L++ I QGSY+     SPK+ G G+KR+A+    R+ L AQDHV+AE
Sbjct: 123 PKNEVEYSNGKLNLTTLISQGSYDPQ-TCSPKH-GQGIKRAATVT--RSPLHAQDHVLAE 182

Query: 195 RKRREKLSQRFVALSALIPHLKKMDKASILGDAITYIKDLQDRLKVADEEAAKSRVESVV 254
           RKRREKLSQRF+ALSAL+P LKKMDKAS+LGDAI Y+K LQ+R K+ +E+A K  VE+VV
Sbjct: 183 RKRREKLSQRFIALSALVPGLKKMDKASVLGDAIKYVKHLQERTKMLEEKAVKKTVEAVV 242

Query: 255 FVNRSEDVSAVVEDDSSEEN--SSSDRAIPEIEARVSGKDVLLKIHGKKCKGCLSNILNH 314
           FV R++  SA  +  SS+EN  SSSD+ +PEIEARVS K+VL+++H +K KGCL+ IL+ 
Sbjct: 243 FVKRTQ-YSADDDISSSDENFESSSDQPLPEIEARVSDKEVLIRVHCEKTKGCLAKILSE 302

Query: 315 IEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSMTVKELVQKLRQTCL 349
           IE L+LT++NSS LPFGN  +DIT+IAQM   FSMTVK+LV+ LRQ  +
Sbjct: 303 IESLDLTIVNSSVLPFGNSTLDITVIAQMDAEFSMTVKDLVKNLRQNLI 344

BLAST of Cp4.1LG10g01760 vs. NCBI nr
Match: gi|1009145653|ref|XP_015890448.1| (PREDICTED: transcription factor bHLH25-like [Ziziphus jujuba])

HSP 1 Score: 281.6 bits (719), Expect = 1.9e-72
Identity = 186/377 (49.34%), Postives = 252/377 (66.84%), Query Frame = 1

Query: 1   MEISSAKWLSDMDLES-AFMDDFEMNPFECTLDELSFQTFSDESHTSHLDLENS------ 60
           MEI+SAKWLS+ ++E   F++ +  N  + +LD L+FQ+FS ES++S+ +          
Sbjct: 1   MEIASAKWLSEFEMEDPTFINQY--NSLDYSLDGLNFQSFSSESYSSYPNFNTPKCAPHQ 60

Query: 61  -VQTPPSPPP---------PAKQPRTSGSWNNSSTTR--QIASMAASSSSSHIISFGNSH 120
            + T P   P         P+KQ + S SWN+ +TT   QI++ A+SSSSSH+ISF N+ 
Sbjct: 61  LLSTTPMETPHHQTTSIERPSKQLK-SNSWNSCTTTHDHQISTKASSSSSSHLISFENNS 120

Query: 121 SSS---PPASNKLVGNNGNYSNVKPKFEIGCEGNIDL-SSAIPQGSYENNPNGSPKYDGV 180
           +     PP S +   ++G    +KPK E+G +GN+ +  S I + SYE   +     +GV
Sbjct: 121 AEKAVMPPTSEQYYASHGP---IKPKNEVGSDGNMSVFPSLISRTSYETQNHSMKHTEGV 180

Query: 181 -GMKRSASAMNYRTALVAQDHVIAERKRREKLSQRFVALSALIPHLKKMDKASILGDAIT 240
                SA  M+ RTA+ AQDHV+AERKRREKLSQRF+ALSA++P LKKMDKAS+LGDAI 
Sbjct: 181 KSSSTSAGTMSRRTAIHAQDHVLAERKRREKLSQRFIALSAVVPGLKKMDKASVLGDAIK 240

Query: 241 YIKDLQDRLKVADEEAAKSRVESVVFVNRSEDVSAVVEDDSSEEN--SSSDRAIPEIEAR 300
           Y+K LQ+R+   +E+AAK  VES VFV RS  VS   E  SS+EN  S SD+ +PEIEAR
Sbjct: 241 YVKQLQERVNTLEEQAAKKTVESAVFVKRSL-VSGDDELSSSDENFDSCSDQPLPEIEAR 300

Query: 301 VSGKDVLLKIHGKKCKGCLSNILNHIEELNLTVLNSSALPFGNFRIDITIIAQMGDGFSM 352
           VS KDVL++IH +K KGCLSNIL+ +E+L LT++NSS LPFG   +DITI+AQM   FSM
Sbjct: 301 VSDKDVLIRIHCEKHKGCLSNILSEVEKLPLTIVNSSVLPFGGSTLDITIVAQMDVEFSM 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH018_ARATH7.6e-3751.31Transcription factor bHLH18 OS=Arabidopsis thaliana GN=BHLH18 PE=2 SV=1[more]
BH019_ARATH2.9e-3641.18Transcription factor bHLH19 OS=Arabidopsis thaliana GN=BHLH19 PE=2 SV=1[more]
BH025_ARATH4.2e-3535.47Transcription factor bHLH25 OS=Arabidopsis thaliana GN=BHLH25 PE=2 SV=2[more]
BH020_ARATH4.8e-3138.72Transcription factor NAI1 OS=Arabidopsis thaliana GN=NAI1 PE=2 SV=1[more]
MYC3_ARATH2.0e-1334.78Transcription factor MYC3 OS=Arabidopsis thaliana GN=MYC3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KG27_CUCSA2.3e-10967.88Uncharacterized protein OS=Cucumis sativus GN=Csa_6G497110 PE=4 SV=1[more]
M5WL55_PRUPE2.7e-7351.29Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017640mg PE=4 SV=1[more]
W9RQB0_9ROSA6.0e-6546.32Uncharacterized protein OS=Morus notabilis GN=L484_016091 PE=4 SV=1[more]
A0A061DHY3_THECC6.9e-6147.98Basic helix-loop-helix DNA-binding superfamily protein, putative OS=Theobroma ca... [more]
A0A0D2QXV7_GOSRA4.6e-5745.11Uncharacterized protein OS=Gossypium raimondii GN=B456_007G228900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22750.24.3e-3851.31 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT2G22760.11.6e-3741.18 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G37850.12.4e-3635.47 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT2G22770.12.7e-3238.72 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G46760.11.1e-1434.78 Basic helix-loop-helix (bHLH) DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|449451351|ref|XP_004143425.1|3.4e-10967.88PREDICTED: transcription factor bHLH18-like [Cucumis sativus][more]
gi|659079831|ref|XP_008440467.1|1.8e-10768.07PREDICTED: transcription factor bHLH18-like [Cucumis melo][more]
gi|645219682|ref|XP_008236887.1|7.5e-7750.82PREDICTED: transcription factor bHLH25-like [Prunus mume][more]
gi|595831834|ref|XP_007206351.1|3.9e-7351.29hypothetical protein PRUPE_ppa017640mg [Prunus persica][more]
gi|1009145653|ref|XP_015890448.1|1.9e-7249.34PREDICTED: transcription factor bHLH25-like [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0030001 metal ion transport
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g01760.1Cp4.1LG10g01760.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 176..234
score: 1.7
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 176..222
score: 6.2
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 178..227
score: 4.7
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 172..221
score: 15
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 177..237
score: 8.9
NoneNo IPR availableunknownCoilCoilcoord: 218..238
scor
NoneNo IPR availablePANTHERPTHR23042CIRCADIAN PROTEIN CLOCK/ARNT/BMAL/PAScoord: 169..344
score: 1.6E-88coord: 1..53
score: 1.6E-88coord: 99..131
score: 1.6
NoneNo IPR availablePANTHERPTHR23042:SF61TRANSCRIPTION FACTOR BHLH18-RELATEDcoord: 99..131
score: 1.6E-88coord: 1..53
score: 1.6E-88coord: 169..344
score: 1.6

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG10g01760Cp4.1LG19g10190Cucurbita pepo (Zucchini)cpecpeB076
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG10g01760Cucurbita pepo (Zucchini)cpecpeB063
Cp4.1LG10g01760Cucurbita pepo (Zucchini)cpecpeB104
Cp4.1LG10g01760Melon (DHL92) v3.5.1cpemeB051
Cp4.1LG10g01760Cucumber (Gy14) v2cgybcpeB157
Cp4.1LG10g01760Melon (DHL92) v3.6.1cpemedB065
Cp4.1LG10g01760Silver-seed gourdcarcpeB0078
Cp4.1LG10g01760Cucumber (Chinese Long) v3cpecucB0065