Cp4.1LG09g08850 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g08850
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBHLH transcription factor
LocationCp4.1LG09 : 8079280 .. 8082266 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACCTAAGATCTCACATGCACTCATGAATGAGTATCGAACATGCCACATGGGGGTTAACAAACTTCAAATTCGTTCCGACGGTTCAAAACCCAAATTGGGAAAGCGAACAGGAGACTGTGGGCCTTAACATTGCAAATCGCATTCGCAGTTTCCACTTTTGCTTGTGGTGTTAGCATCGGTGGAACAGTTGCGGTATGCGATTCTTCTCCGATTTTCCCTTTCGTTTTATCTCTCATATTCCCTTTTCCATTTCTTCAAATTTCCCATCGTTTCCTTCCCAATGTATATATTCTACCATTCGCCATGGAATTCGGCTGTATCTCGTGTTCTGTACTCAGAAATTCAGGGATTTTGAGTTCCGGCAATTGATCGCTGATGAAGTCGCCTGAAATTACCGACTGGGTATTTGATTACGCTCTGATCGAGGACTTTCCGGTCCCCGGCGGTGACCTACCTTCACTGGATCTTCCTGTTTTCACTTTGTCCTCTCGCGATTTCACTGCCTCCTTCAGGTTCTAATAGCTCATGAATCTCTTTTGAGTTTCGTTTTTTGATATTGTTCCTGGGATTTGTATGTTGTTTGAAATGAATTGGTTGAGTTGTGGATTATCATCTGGGTTTGCTTGTGGATGTAGATTACATGATTCAACATAGGACTATGAAGTTATTTCATTAAGCAAAATTTGATTCACTAACGTCTTACATTGGCAAGGAGTTGTACCAGATTATGATGCCTACTCCAATTTAGCCTTCGTTCTTATAGATTTTTAACATGGTATCAAAGTAAAAAGCCTTGAATTCAAACTCTTTTTTTAATGTTTTTTCCTTCTCTGTTTTAGATTAGTAGTTCATGATATTAGAGGTAGGTTTCAAGTTCTAAATTTTACCGTGATCCATGTGTTAGTGAAATTGTCCAAGCCCACTGCCAGTAGATATTGTCTTCTTTGGCTTTTTCTTTTCGGGCTTCCCCTCAAGATTCTCCTTAAGATTCTGCTAGGGAGAGGTTTTGTCCTCTTTGGCTTTTTCCTTTCGGGCTTCCCCTCAAAGTTTTAAAATGCCTCTGATAGGGAGAGGTTTCCACACCCGTAATGTTTCGTTCCCCTCTCCAACCGACATGAGATCTCACAATCCACCTCTCTTGGGGCCAGCGTCCTCGTTGGAACACCACCCGGTGTTTGATTCTGATACCATTTGTAACCGTCTAAGACCACTGCTAGCAGATATGGTCCTCTTTGAGCTTTCTCTTTTGGACTTCCCCTCAAGATTTTAAAACGCGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAGGAATGTTTCGTTCCCCTCTCCAACCGACGTGGGATCTCACAATCCACACCTCTTGGGGGCCTAGTGTCCTTGCTGGCACACCGCCCCGTGTATGGCTCTGATACCATTTCTAAGAATCTAAGCCCACCGCTAGCATATATTGTCTTCTTTAGGCTTTCCCTTAAGGGCTTCTTCTCAAGGTTTTTAAAACGTGTTTGCTAGGGAGAGGTTTCCACCCCCTTATAAGGAATGTTTCGTTCTCATCTCCAACTGATGTGGGATCTCACAATTAGACTCAATTCAAACATTAGAGCCTTTTCATGAGAGCGTACTCCAATGCTTATATGTTTTATTTTCTTTATATGACAGCTGAGATGCCATTTTTTTATTAAAGATCTCATCGTACCGTTGTTAGTGTTTTAAGGGAAGACTTTGATGAGGCACATGGAAAGCTAAACGATATAAAAGAATCTGGGTCTAGGAAAAGGTGAGCAATTATTAGTTCCCAAGTTGAGTATCTGCACTGTGAGGAGGATTAGCATATCATTGTTTATGTTCTAATGCTGTGGCTATCAATTTGAGCATATTGGATATGCCCTTACAATATCATTGGTTATGTTCTAATGCTGTGGCTATCAACTTTAGCATATTGGATATGCCCTTACAATAGCATGGTAGTGATCTTACTGTTGACAGGATGAGCTCTGGATCAGGTGCATCTTTGTCCAAAGCACATAAAGAGAAAGTGCGGAGAAATAGACTAAATGACAGGCATCTCGATTCGTTACATTCATCTTTTTCGCCATTCCCGTCTCTGTCCGCTCCTCCCCCGCCTCATATTTTGTGTTTGCTTGCTGATTAAAGGTTTCTGGAGTTGAATTCCATCCTCAATCATGGAACGTCTCCCAAAATTGACAAGTCTGTTATTTTGGGTGATGCAGTTCGAATGATCTTGCAGCTAAGAGATGAAGCTCAGAAGCTGAAGGAGTCCAATGAGAATTTTCTGGAGAAGATCAATGAAATGAAGGTAGATTATGTGGTTTTGTACTCGTCGATGTTTGGTTTGATCTTCCATGGATGTTTGATTCTATTTCTTCTTTGTTCAGGCTGAAAAGAATGAACTACGAGACGAGAAACAGAGGCTAAAAGAAGCAAAAGACAACCTTAAACAGAAAATGAAAACCTTCATTACTCAACCAAGTTTCCTGCCTCACCCTTCTGCATTTTCTGCTCCAAATCATGTTGTTGGGGCGAAGTTCGTACCTGTGATTGGATATCCAGGAGTATCCATGTGGAAGCTTATGCCTCAGGGCGCTATCGATACGTCGCAAGACCACGTTCTCCGACCACCAGTCGCCTGACTTGGTAGAGTTCTTCCATTTTCACTGGCTTTGCTTTTGCATGCGAGGATTCCTATGTTGGTTTTTCTGTAGGGCTGACATGGTTCTGTTGTCTGTTAGATTTTCAAGATTGAATTGGCTTGAATATCTGTTTTGTTAAAGAAATTCCGCTGATCAAAATCAATATAGTCTACTCCATAAGTTCTTAGCCTTACTCTTCTTCAGCAGAAGACGTGTTTGCTCCGAGAGAAACAATCAGTCTTTAGAAGATTATAGAGTTTAAACTAATAGAACTATGCTCGGGTTGACTGATTTTTCTTTCGATATCCATTGTTCGGCATAGGACTTCAACAATCCTC

mRNA sequence

ACCTAAGATCTCACATGCACTCATGAATGAGTATCGAACATGCCACATGGGGGTTAACAAACTTCAAATTCGTTCCGACGGTTCAAAACCCAAATTGGGAAAGCGAACAGGAGACTGTGGGCCTTAACATTGCAAATCGCATTCGCAGTTTCCACTTTTGCTTGTGGTGTTAGCATCGGTGGAACAGTTGCGGTATGCGATTCTTCTCCGATTTTCCCTTTCGTTTTATCTCTCATATTCCCTTTTCCATTTCTTCAAATTTCCCATCGTTTCCTTCCCAATGTATATATTCTACCATTCGCCATGGAATTCGGCTGTATCTCGTGTTCTGTACTCAGAAATTCAGGGATTTTGAGTTCCGGCAATTGATCGCTGATGAAGTCGCCTGAAATTACCGACTGGGTATTTGATTACGCTCTGATCGAGGACTTTCCGGTCCCCGGCGGTGACCTACCTTCACTGGATCTTCCTGTTTTCACTTTGTCCTCTCGCGATTTCACTGCCTCCTTCAGATCTCATCGTACCGTTGTTAGTGTTTTAAGGGAAGACTTTGATGAGGCACATGGAAAGCTAAACGATATAAAAGAATCTGGGTCTAGGAAAAGGTTTCTGGAGTTGAATTCCATCCTCAATCATGGAACGTCTCCCAAAATTGACAAGTCTGTTATTTTGGGTGATGCAGTTCGAATGATCTTGCAGCTAAGAGATGAAGCTCAGAAGCTGAAGGAGTCCAATGAGAATTTTCTGGAGAAGATCAATGAAATGAAGGCTGAAAAGAATGAACTACGAGACGAGAAACAGAGGCTAAAAGAAGCAAAAGACAACCTTAAACAGAAAATGAAAACCTTCATTACTCAACCAAGTTTCCTGCCTCACCCTTCTGCATTTTCTGCTCCAAATCATGTTGTTGGGGCGAAGTTCGTACCTGTGATTGGATATCCAGGAGTATCCATGTGGAAGCTTATGCCTCAGGGCGCTATCGATACGTCGCAAGACCACGTTCTCCGACCACCAGTCGCCTGACTTGGTAGAGTTCTTCCATTTTCACTGGCTTTGCTTTTGCATGCGAGGATTCCTATGTTGGTTTTTCTGTAGGGCTGACATGGTTCTGTTGTCTGTTAGATTTTCAAGATTGAATTGGCTTGAATATCTGTTTTGTTAAAGAAATTCCGCTGATCAAAATCAATATAGTCTACTCCATAAGTTCTTAGCCTTACTCTTCTTCAGCAGAAGACGTGTTTGCTCCGAGAGAAACAATCAGTCTTTAGAAGATTATAGAGTTTAAACTAATAGAACTATGCTCGGGTTGACTGATTTTTCTTTCGATATCCATTGTTCGGCATAGGACTTCAACAATCCTC

Coding sequence (CDS)

ATGAAGTCGCCTGAAATTACCGACTGGGTATTTGATTACGCTCTGATCGAGGACTTTCCGGTCCCCGGCGGTGACCTACCTTCACTGGATCTTCCTGTTTTCACTTTGTCCTCTCGCGATTTCACTGCCTCCTTCAGATCTCATCGTACCGTTGTTAGTGTTTTAAGGGAAGACTTTGATGAGGCACATGGAAAGCTAAACGATATAAAAGAATCTGGGTCTAGGAAAAGGTTTCTGGAGTTGAATTCCATCCTCAATCATGGAACGTCTCCCAAAATTGACAAGTCTGTTATTTTGGGTGATGCAGTTCGAATGATCTTGCAGCTAAGAGATGAAGCTCAGAAGCTGAAGGAGTCCAATGAGAATTTTCTGGAGAAGATCAATGAAATGAAGGCTGAAAAGAATGAACTACGAGACGAGAAACAGAGGCTAAAAGAAGCAAAAGACAACCTTAAACAGAAAATGAAAACCTTCATTACTCAACCAAGTTTCCTGCCTCACCCTTCTGCATTTTCTGCTCCAAATCATGTTGTTGGGGCGAAGTTCGTACCTGTGATTGGATATCCAGGAGTATCCATGTGGAAGCTTATGCCTCAGGGCGCTATCGATACGTCGCAAGACCACGTTCTCCGACCACCAGTCGCCTGA

Protein sequence

MKSPEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDFDEAHGKLNDIKESGSRKRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFLPHPSAFSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA
BLAST of Cp4.1LG09g08850 vs. Swiss-Prot
Match: BH115_ARATH (Transcription factor bHLH115 OS=Arabidopsis thaliana GN=BHLH115 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.7e-37
Identity = 101/231 (43.72%), Postives = 133/231 (57.58%), Query Frame = 1

Query: 1   MKSPEITDWVFDYALIED--------FP--VPGGDLPSLDLPVFTLSSRDFTASFRSHRT 60
           M SPE T+W+ DY LIE         FP  + G    S+++  F L   D      S + 
Sbjct: 1   MVSPENTNWLSDYPLIEGAFSDQNPTFPWQIDGSATVSVEVDGF-LCDADVIKEPSSRKR 60

Query: 61  VVSVLREDFDEAHGKLNDIKESGSR--KRFLELNSILNHGTSPKIDKSVILGDAVRMILQ 120
           + +   E    ++ K    K+   R   +F EL+S+L  G +PK DK  I+ DA+RM+ Q
Sbjct: 61  IKT---ESCTGSNSKACREKQRRDRLNDKFTELSSVLEPGRTPKTDKVAIINDAIRMVNQ 120

Query: 121 LRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPS----F 180
            RDEAQKLK+ N +  EKI E+K EKNELRDEKQ+LK  K+ + Q++K   TQP     F
Sbjct: 121 ARDEAQKLKDLNSSLQEKIKELKDEKNELRDEKQKLKVEKERIDQQLKAIKTQPQPQPCF 180

Query: 181 LPHPSAFSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           LP+P   S      G+K VP   YPG +MW+ MP  A+DTSQDHVLRPPVA
Sbjct: 181 LPNPQTLSQA-QAPGSKLVPFTTYPGFAMWQFMPPAAVDTSQDHVLRPPVA 226

BLAST of Cp4.1LG09g08850 vs. Swiss-Prot
Match: ILR3_ARATH (Transcription factor ILR3 OS=Arabidopsis thaliana GN=ILR3 PE=1 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 4.7e-37
Identity = 85/145 (58.62%), Postives = 104/145 (71.72%), Query Frame = 1

Query: 77  RFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFLEKINEMKAEKNE 136
           +F+EL +IL  G  PK DK+ IL DAVRM+ QLR EAQKLK+SN +  +KI E+K EKNE
Sbjct: 90  KFMELGAILEPGNPPKTDKAAILVDAVRMVTQLRGEAQKLKDSNSSLQDKIKELKTEKNE 149

Query: 137 LRDEKQRLKEAKDNLKQKMKTF-ITQPSFLP----HPSAF-SAPNHVVGAKFVPVIGYPG 196
           LRDEKQRLK  K+ L+Q++K     QPSF P     P+AF SA     G K VP+I YPG
Sbjct: 150 LRDEKQRLKTEKEKLEQQLKAMNAPQPSFFPAPPMMPTAFASAQGQAPGNKMVPIISYPG 209

Query: 197 VSMWKLMPQGAIDTSQDHVLRPPVA 216
           V+MW+ MP  ++DTSQDHVLRPPVA
Sbjct: 210 VAMWQFMPPASVDTSQDHVLRPPVA 234

BLAST of Cp4.1LG09g08850 vs. Swiss-Prot
Match: BH104_ARATH (Transcription factor bHLH104 OS=Arabidopsis thaliana GN=BHLH104 PE=2 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 5.4e-25
Identity = 72/143 (50.35%), Postives = 98/143 (68.53%), Query Frame = 1

Query: 76  KRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFLEKINEMKAEKN 135
           +RF++L+S+L  G +PK DK  IL DA+R++ QLRDEA KL+E+N+  LE+I  +KAEKN
Sbjct: 148 ERFMDLSSVLEPGRTPKTDKPAILDDAIRILNQLRDEALKLEETNQKLLEEIKSLKAEKN 207

Query: 136 ELRDEKQRLKEAKDNLKQKMKTFITQPS--FLPH-PSAFSAPNHVVGAKFVPVIGYPGVS 195
           ELR+EK  LK  K+  +Q++K+ +T PS  F+PH P+AF   NH   A + P  GY  + 
Sbjct: 208 ELREEKLVLKADKEKTEQQLKS-MTAPSSGFIPHIPAAF---NHNKMAVY-PSYGY--MP 267

Query: 196 MWKLMPQGAIDTSQDHVLRPPVA 216
           MW  MPQ   DTS+D  LRPP A
Sbjct: 268 MWHYMPQSVRDTSRDQELRPPAA 283

BLAST of Cp4.1LG09g08850 vs. Swiss-Prot
Match: BH034_ARATH (Transcription factor bHLH34 OS=Arabidopsis thaliana GN=BHLH34 PE=2 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.1e-22
Identity = 66/156 (42.31%), Postives = 97/156 (62.18%), Query Frame = 1

Query: 65  KLNDIKESGSRKRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFL 124
           KLND        +F++L+S+L  G +PK DKS IL DA+R++ QLR EA +L+E+N+  L
Sbjct: 177 KLND--------KFMDLSSVLEPGRTPKTDKSAILDDAIRVVNQLRGEAHELQETNQKLL 236

Query: 125 EKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFIT-QPSFLP--HPSAFSAPNHVVGAK 184
           E+I  +KA+KNELR+EK  LK  K+ ++Q++K+ +   P F+P  HP+AF +    V   
Sbjct: 237 EEIKSLKADKNELREEKLVLKAEKEKMEQQLKSMVVPSPGFMPSQHPAAFHSHKMAVAYP 296

Query: 185 FVPVIGY--PGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           +    GY  P + MW  +P    DTS+D    PPVA
Sbjct: 297 Y----GYYPPNMPMWSPLPPADRDTSRDLKNLPPVA 320

BLAST of Cp4.1LG09g08850 vs. TrEMBL
Match: A0A0A0LX71_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G065960 PE=4 SV=1)

HSP 1 Score: 297.7 bits (761), Expect = 1.1e-77
Identity = 162/235 (68.94%), Postives = 180/235 (76.60%), Query Frame = 1

Query: 1   MKSPEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDFD 60
           M SPE+TDWVFDY +IE+ PVPGGDLPSLDLP FTL S DFTASFR        + ED  
Sbjct: 1   MGSPELTDWVFDYGVIENIPVPGGDLPSLDLPSFTLPSCDFTASFREDFDEPLGMEEDVK 60

Query: 61  EAHGK------LNDIKESGSRK----------RFLELNSILNHGTSPKIDKSVILGDAVR 120
           E+  +       ++  ES +RK          RFLELNSILNHG  PKIDKS ILGDAVR
Sbjct: 61  ESRSRKRMSSGSSNAFESKARKEKIRRDKLNDRFLELNSILNHGRPPKIDKSAILGDAVR 120

Query: 121 MILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSF 180
           MI+QLRDEAQKLKESNE+ LEKINEMKAEKNELRDEKQRLKEAKD+L++KMK F TQP+F
Sbjct: 121 MIIQLRDEAQKLKESNESSLEKINEMKAEKNELRDEKQRLKEAKDSLEKKMKGFNTQPTF 180

Query: 181 LPHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           LPHP A    FS+PN +VG K VPVIGYPGVSMW+ MP GAIDTSQDHVLRPPVA
Sbjct: 181 LPHPPAIPAGFSSPNQIVGGKLVPVIGYPGVSMWQFMPPGAIDTSQDHVLRPPVA 235

BLAST of Cp4.1LG09g08850 vs. TrEMBL
Match: W9QRW3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_019764 PE=4 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 5.5e-61
Identity = 133/219 (60.73%), Postives = 153/219 (69.86%), Query Frame = 1

Query: 1   MKSPEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDFD 60
           M SPE  +WVFDY+LIED P+PGG LP+LD P F   S  FT       T VS    DFD
Sbjct: 1   MGSPENPNWVFDYSLIEDLPIPGGGLPALDPPSFPWPSHSFTPP-----TTVSA---DFD 60

Query: 61  EAHGKLNDIKESGSRKRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESN 120
           E+    +  KE+G RKRFLEL SI   G  PK+DK+VILGDAVRM+  LR EA+KLKESN
Sbjct: 61  ESLANSDGFKEAGCRKRFLELGSISEPGRPPKMDKAVILGDAVRMVTHLRMEAEKLKESN 120

Query: 121 ENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFLPHPSA----FSAPNH 180
           E   EKINE+KAEKNELRDEKQRLK  KD L +++K   TQPSFL HP A    F  P  
Sbjct: 121 EKLQEKINELKAEKNELRDEKQRLKAEKDVLDKQVKALTTQPSFL-HPPAIPPPFPGPGQ 180

Query: 181 VVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           VVG K +P +GYPG+SMW+ MP  A+DTSQDHVLRPPVA
Sbjct: 181 VVGGKLMPFVGYPGISMWQFMPPAAVDTSQDHVLRPPVA 210

BLAST of Cp4.1LG09g08850 vs. TrEMBL
Match: M5WIN3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010730mg PE=4 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 2.9e-54
Identity = 122/234 (52.14%), Postives = 151/234 (64.53%), Query Frame = 1

Query: 4   PEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTA------SFRSHRTVVSVLRE 63
           P+  +WVFDY ++ED PVPGGDLP LDLP FT  S  F A       F         ++E
Sbjct: 5   PQNHNWVFDYGVLEDIPVPGGDLPPLDLPGFTWPSHSFVAPAAPSVDFDDSFGNSDSIKE 64

Query: 64  DFDEAHGKLNDIKESGSRK------------RFLELNSILNHGTSPKIDKSVILGDAVRM 123
                  +      +GS+             RFLEL+S+L  G  PK DK+ ILGDAVR+
Sbjct: 65  SGFRKRVRSGSCNVTGSKACREKMRRDRLNDRFLELSSMLEPGRPPKTDKAAILGDAVRV 124

Query: 124 ILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFL 183
           + QLR EAQ+LK+SN +  EKINE+KAEKNELRDEKQRLK  K+N+++++K   TQP FL
Sbjct: 125 VNQLRGEAQQLKDSNGDLQEKINELKAEKNELRDEKQRLKTEKENIERQIKALNTQPGFL 184

Query: 184 PHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           PHP+A    FSAP  VVG K +P +GYPGVSMW+ MP  A+DTSQDHVLR PVA
Sbjct: 185 PHPAAIPGPFSAPGQVVGGKLMPFVGYPGVSMWQFMPPAAVDTSQDHVLRSPVA 238

BLAST of Cp4.1LG09g08850 vs. TrEMBL
Match: A0A0U2I0R6_9ROSA (BHLH transcription factor OS=Prunus pseudocerasus PE=2 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 2.9e-54
Identity = 122/234 (52.14%), Postives = 151/234 (64.53%), Query Frame = 1

Query: 4   PEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTA------SFRSHRTVVSVLRE 63
           P+  +WVFDY +IED PVPGGDLP LDLP FT  S  F A       F         ++E
Sbjct: 5   PQNHNWVFDYGVIEDIPVPGGDLPPLDLPGFTWPSNSFVAPAAPSVDFDDSFGNSDSIKE 64

Query: 64  DFDEAHGKLNDIKESGSRK------------RFLELNSILNHGTSPKIDKSVILGDAVRM 123
                  +      +GS+             RFLEL+S+L  G  PK DK+ ILGDAVR+
Sbjct: 65  SGFRKRVRSGSCNVTGSKACREKMRRDRLNDRFLELSSMLEPGRPPKTDKAAILGDAVRV 124

Query: 124 ILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFL 183
           + QLR EAQ+LK+SN +  EKINE+KAEKNELR+EKQRLK  K+N+++++K   TQP FL
Sbjct: 125 VNQLRGEAQQLKDSNGHLQEKINELKAEKNELRNEKQRLKTEKENIERQIKALNTQPGFL 184

Query: 184 PHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           PHP+A    F AP  VVG K +P +GYPGVSMW+ MP  A+DTSQDHVLRPPVA
Sbjct: 185 PHPAAIPGPFPAPGQVVGGKLMPFVGYPGVSMWQFMPPAAVDTSQDHVLRPPVA 238

BLAST of Cp4.1LG09g08850 vs. TrEMBL
Match: A0A061GF70_THECC (DNA binding protein, putative isoform 2 OS=Theobroma cacao GN=TCM_030082 PE=4 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 2.9e-54
Identity = 125/226 (55.31%), Postives = 149/226 (65.93%), Query Frame = 1

Query: 1   MKSPEITDWVFD-YALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDF 60
           + S   + W+FD Y L+ED PVPGGDLPSLD      SS+  T S    R          
Sbjct: 12  VSSENPSGWIFDDYGLLEDIPVPGGDLPSLDPAAPIWSSQSLTCSTPPLRARSGSCSASG 71

Query: 61  DEA------HGKLNDIKESGSRKRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEA 120
            +A        +LND        RFLEL SIL+ G   K+DK+VIL DAVRM+ QLRDEA
Sbjct: 72  SKACREKMRRDRLND--------RFLELGSILDPGRPLKVDKAVILVDAVRMVTQLRDEA 131

Query: 121 QKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFLPHPSA--- 180
           QKL+ESNE+  EKINE+KAEKNELRDEKQRLK  K+NL+Q++K   TQP FLPHP A   
Sbjct: 132 QKLRESNESLQEKINELKAEKNELRDEKQRLKTEKENLEQQVKALGTQPGFLPHPPAIPT 191

Query: 181 -FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
            FS P  VVG K VP +GYPGVSMW+ +P  ++DTSQDH+LRPPVA
Sbjct: 192 PFSTPGQVVGGKLVPFVGYPGVSMWQFLPPASVDTSQDHILRPPVA 229

BLAST of Cp4.1LG09g08850 vs. TAIR10
Match: AT5G54680.1 (AT5G54680.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 156.0 bits (393), Expect = 2.6e-38
Identity = 85/145 (58.62%), Postives = 104/145 (71.72%), Query Frame = 1

Query: 77  RFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFLEKINEMKAEKNE 136
           +F+EL +IL  G  PK DK+ IL DAVRM+ QLR EAQKLK+SN +  +KI E+K EKNE
Sbjct: 90  KFMELGAILEPGNPPKTDKAAILVDAVRMVTQLRGEAQKLKDSNSSLQDKIKELKTEKNE 149

Query: 137 LRDEKQRLKEAKDNLKQKMKTF-ITQPSFLP----HPSAF-SAPNHVVGAKFVPVIGYPG 196
           LRDEKQRLK  K+ L+Q++K     QPSF P     P+AF SA     G K VP+I YPG
Sbjct: 150 LRDEKQRLKTEKEKLEQQLKAMNAPQPSFFPAPPMMPTAFASAQGQAPGNKMVPIISYPG 209

Query: 197 VSMWKLMPQGAIDTSQDHVLRPPVA 216
           V+MW+ MP  ++DTSQDHVLRPPVA
Sbjct: 210 VAMWQFMPPASVDTSQDHVLRPPVA 234

BLAST of Cp4.1LG09g08850 vs. TAIR10
Match: AT1G51070.2 (AT1G51070.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 147.1 bits (370), Expect = 1.2e-35
Identity = 78/143 (54.55%), Postives = 98/143 (68.53%), Query Frame = 1

Query: 77  RFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFLEKINEMKAEKNE 136
           +F EL+S+L  G +PK DK  I+ DA+RM+ Q RDEAQKLK+ N +  EKI E+K EKNE
Sbjct: 151 KFTELSSVLEPGRTPKTDKVAIINDAIRMVNQARDEAQKLKDLNSSLQEKIKELKDEKNE 210

Query: 137 LRDEKQRLKEAKDNLKQKMKTFITQPS----FLPHPSAFSAPNHVVGAKFVPVIGYPGVS 196
           LRDEKQ+LK  K+ + Q++K   TQP     FLP+P   S      G+K VP   YPG +
Sbjct: 211 LRDEKQKLKVEKERIDQQLKAIKTQPQPQPCFLPNPQTLSQA-QAPGSKLVPFTTYPGFA 270

Query: 197 MWKLMPQGAIDTSQDHVLRPPVA 216
           MW+ MP  A+DTSQDHVLRPPVA
Sbjct: 271 MWQFMPPAAVDTSQDHVLRPPVA 292

BLAST of Cp4.1LG09g08850 vs. TAIR10
Match: AT4G14410.1 (AT4G14410.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 115.9 bits (289), Expect = 3.0e-26
Identity = 72/143 (50.35%), Postives = 98/143 (68.53%), Query Frame = 1

Query: 76  KRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFLEKINEMKAEKN 135
           +RF++L+S+L  G +PK DK  IL DA+R++ QLRDEA KL+E+N+  LE+I  +KAEKN
Sbjct: 148 ERFMDLSSVLEPGRTPKTDKPAILDDAIRILNQLRDEALKLEETNQKLLEEIKSLKAEKN 207

Query: 136 ELRDEKQRLKEAKDNLKQKMKTFITQPS--FLPH-PSAFSAPNHVVGAKFVPVIGYPGVS 195
           ELR+EK  LK  K+  +Q++K+ +T PS  F+PH P+AF   NH   A + P  GY  + 
Sbjct: 208 ELREEKLVLKADKEKTEQQLKS-MTAPSSGFIPHIPAAF---NHNKMAVY-PSYGY--MP 267

Query: 196 MWKLMPQGAIDTSQDHVLRPPVA 216
           MW  MPQ   DTS+D  LRPP A
Sbjct: 268 MWHYMPQSVRDTSRDQELRPPAA 283

BLAST of Cp4.1LG09g08850 vs. TAIR10
Match: AT3G23210.1 (AT3G23210.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 108.2 bits (269), Expect = 6.3e-24
Identity = 66/156 (42.31%), Postives = 97/156 (62.18%), Query Frame = 1

Query: 65  KLNDIKESGSRKRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESNENFL 124
           KLND        +F++L+S+L  G +PK DKS IL DA+R++ QLR EA +L+E+N+  L
Sbjct: 177 KLND--------KFMDLSSVLEPGRTPKTDKSAILDDAIRVVNQLRGEAHELQETNQKLL 236

Query: 125 EKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFIT-QPSFLP--HPSAFSAPNHVVGAK 184
           E+I  +KA+KNELR+EK  LK  K+ ++Q++K+ +   P F+P  HP+AF +    V   
Sbjct: 237 EEIKSLKADKNELREEKLVLKAEKEKMEQQLKSMVVPSPGFMPSQHPAAFHSHKMAVAYP 296

Query: 185 FVPVIGY--PGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           +    GY  P + MW  +P    DTS+D    PPVA
Sbjct: 297 Y----GYYPPNMPMWSPLPPADRDTSRDLKNLPPVA 320

BLAST of Cp4.1LG09g08850 vs. NCBI nr
Match: gi|449454698|ref|XP_004145091.1| (PREDICTED: transcription factor ILR3-like [Cucumis sativus])

HSP 1 Score: 297.7 bits (761), Expect = 1.6e-77
Identity = 162/235 (68.94%), Postives = 180/235 (76.60%), Query Frame = 1

Query: 1   MKSPEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDFD 60
           M SPE+TDWVFDY +IE+ PVPGGDLPSLDLP FTL S DFTASFR        + ED  
Sbjct: 1   MGSPELTDWVFDYGVIENIPVPGGDLPSLDLPSFTLPSCDFTASFREDFDEPLGMEEDVK 60

Query: 61  EAHGK------LNDIKESGSRK----------RFLELNSILNHGTSPKIDKSVILGDAVR 120
           E+  +       ++  ES +RK          RFLELNSILNHG  PKIDKS ILGDAVR
Sbjct: 61  ESRSRKRMSSGSSNAFESKARKEKIRRDKLNDRFLELNSILNHGRPPKIDKSAILGDAVR 120

Query: 121 MILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSF 180
           MI+QLRDEAQKLKESNE+ LEKINEMKAEKNELRDEKQRLKEAKD+L++KMK F TQP+F
Sbjct: 121 MIIQLRDEAQKLKESNESSLEKINEMKAEKNELRDEKQRLKEAKDSLEKKMKGFNTQPTF 180

Query: 181 LPHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           LPHP A    FS+PN +VG K VPVIGYPGVSMW+ MP GAIDTSQDHVLRPPVA
Sbjct: 181 LPHPPAIPAGFSSPNQIVGGKLVPVIGYPGVSMWQFMPPGAIDTSQDHVLRPPVA 235

BLAST of Cp4.1LG09g08850 vs. NCBI nr
Match: gi|659067851|ref|XP_008441610.1| (PREDICTED: transcription factor ILR3-like [Cucumis melo])

HSP 1 Score: 297.0 bits (759), Expect = 2.7e-77
Identity = 164/235 (69.79%), Postives = 180/235 (76.60%), Query Frame = 1

Query: 1   MKSPEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDFD 60
           M SPEITDWVFDY +IEDFPVPGGDLPSLDLP FTL   DFTASFR+       + ED  
Sbjct: 1   MGSPEITDWVFDYGVIEDFPVPGGDLPSLDLPSFTLPC-DFTASFRADFDEPLGMAEDVK 60

Query: 61  EAHGK------LNDIKESGSRK----------RFLELNSILNHGTSPKIDKSVILGDAVR 120
           E+  +       ++  ES +RK          RFLELNSILNHG  PK+DKS ILGDAVR
Sbjct: 61  ESGSRKRMSSGSSNAFESKARKEKMRRDKLNDRFLELNSILNHGRPPKLDKSAILGDAVR 120

Query: 121 MILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSF 180
           MI+QLRDEAQKLKESNE+ LEKINEMKAEKNELRDEKQRLKEAKD L++KMK F TQPSF
Sbjct: 121 MIIQLRDEAQKLKESNESSLEKINEMKAEKNELRDEKQRLKEAKDGLEKKMKAFNTQPSF 180

Query: 181 LPHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           LPHP A    FS+PN +VG K VPVIGYPGVSMW+ MP GAIDTSQDHVLRPPVA
Sbjct: 181 LPHPPAIPPGFSSPNQIVGGKLVPVIGYPGVSMWQFMPPGAIDTSQDHVLRPPVA 234

BLAST of Cp4.1LG09g08850 vs. NCBI nr
Match: gi|703085262|ref|XP_010092690.1| (hypothetical protein L484_019764 [Morus notabilis])

HSP 1 Score: 242.3 bits (617), Expect = 7.9e-61
Identity = 133/219 (60.73%), Postives = 153/219 (69.86%), Query Frame = 1

Query: 1   MKSPEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTASFRSHRTVVSVLREDFD 60
           M SPE  +WVFDY+LIED P+PGG LP+LD P F   S  FT       T VS    DFD
Sbjct: 1   MGSPENPNWVFDYSLIEDLPIPGGGLPALDPPSFPWPSHSFTPP-----TTVSA---DFD 60

Query: 61  EAHGKLNDIKESGSRKRFLELNSILNHGTSPKIDKSVILGDAVRMILQLRDEAQKLKESN 120
           E+    +  KE+G RKRFLEL SI   G  PK+DK+VILGDAVRM+  LR EA+KLKESN
Sbjct: 61  ESLANSDGFKEAGCRKRFLELGSISEPGRPPKMDKAVILGDAVRMVTHLRMEAEKLKESN 120

Query: 121 ENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFLPHPSA----FSAPNH 180
           E   EKINE+KAEKNELRDEKQRLK  KD L +++K   TQPSFL HP A    F  P  
Sbjct: 121 EKLQEKINELKAEKNELRDEKQRLKAEKDVLDKQVKALTTQPSFL-HPPAIPPPFPGPGQ 180

Query: 181 VVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           VVG K +P +GYPG+SMW+ MP  A+DTSQDHVLRPPVA
Sbjct: 181 VVGGKLMPFVGYPGISMWQFMPPAAVDTSQDHVLRPPVA 210

BLAST of Cp4.1LG09g08850 vs. NCBI nr
Match: gi|645236723|ref|XP_008224874.1| (PREDICTED: transcription factor ILR3-like [Prunus mume])

HSP 1 Score: 223.0 bits (567), Expect = 5.0e-55
Identity = 123/234 (52.56%), Postives = 152/234 (64.96%), Query Frame = 1

Query: 4   PEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTA------SFRSHRTVVSVLRE 63
           P+  +WVFDY ++ED PVPGGDLP LDLP FT  S  F A       F         ++E
Sbjct: 5   PQNHNWVFDYGVLEDIPVPGGDLPPLDLPGFTWPSHSFVAPAAPSVDFDDSFGNSDSIKE 64

Query: 64  DFDEAHGKLNDIKESGSRK------------RFLELNSILNHGTSPKIDKSVILGDAVRM 123
                  +      +GS+             RFLEL+S+L  G  PK DK+ ILGDAVR+
Sbjct: 65  SGCRKRVRSGSCNVTGSKACREKMRRDRLNDRFLELSSMLEPGRPPKTDKAAILGDAVRV 124

Query: 124 ILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFL 183
           + QLR EAQ+LK+SN +  EKINE+KAEKNELRDEKQRLK  K+N+++++K   TQP FL
Sbjct: 125 VNQLRGEAQQLKDSNGDLQEKINELKAEKNELRDEKQRLKTEKENIERQIKALNTQPGFL 184

Query: 184 PHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           PHP+A    FSAP  VVG K +P +GYPGVSMW+ MP  A+DTSQDHVLRPPVA
Sbjct: 185 PHPAAIPGPFSAPGQVVGGKLMPFVGYPGVSMWQFMPPAAVDTSQDHVLRPPVA 238

BLAST of Cp4.1LG09g08850 vs. NCBI nr
Match: gi|950804812|gb|ALN42134.1| (bHLH transcription factor [Prunus pseudocerasus])

HSP 1 Score: 219.9 bits (559), Expect = 4.2e-54
Identity = 122/234 (52.14%), Postives = 151/234 (64.53%), Query Frame = 1

Query: 4   PEITDWVFDYALIEDFPVPGGDLPSLDLPVFTLSSRDFTA------SFRSHRTVVSVLRE 63
           P+  +WVFDY +IED PVPGGDLP LDLP FT  S  F A       F         ++E
Sbjct: 5   PQNHNWVFDYGVIEDIPVPGGDLPPLDLPGFTWPSNSFVAPAAPSVDFDDSFGNSDSIKE 64

Query: 64  DFDEAHGKLNDIKESGSRK------------RFLELNSILNHGTSPKIDKSVILGDAVRM 123
                  +      +GS+             RFLEL+S+L  G  PK DK+ ILGDAVR+
Sbjct: 65  SGFRKRVRSGSCNVTGSKACREKMRRDRLNDRFLELSSMLEPGRPPKTDKAAILGDAVRV 124

Query: 124 ILQLRDEAQKLKESNENFLEKINEMKAEKNELRDEKQRLKEAKDNLKQKMKTFITQPSFL 183
           + QLR EAQ+LK+SN +  EKINE+KAEKNELR+EKQRLK  K+N+++++K   TQP FL
Sbjct: 125 VNQLRGEAQQLKDSNGHLQEKINELKAEKNELRNEKQRLKTEKENIERQIKALNTQPGFL 184

Query: 184 PHPSA----FSAPNHVVGAKFVPVIGYPGVSMWKLMPQGAIDTSQDHVLRPPVA 216
           PHP+A    F AP  VVG K +P +GYPGVSMW+ MP  A+DTSQDHVLRPPVA
Sbjct: 185 PHPAAIPGPFPAPGQVVGGKLMPFVGYPGVSMWQFMPPAAVDTSQDHVLRPPVA 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH115_ARATH2.7e-3743.72Transcription factor bHLH115 OS=Arabidopsis thaliana GN=BHLH115 PE=2 SV=1[more]
ILR3_ARATH4.7e-3758.62Transcription factor ILR3 OS=Arabidopsis thaliana GN=ILR3 PE=1 SV=1[more]
BH104_ARATH5.4e-2550.35Transcription factor bHLH104 OS=Arabidopsis thaliana GN=BHLH104 PE=2 SV=1[more]
BH034_ARATH1.1e-2242.31Transcription factor bHLH34 OS=Arabidopsis thaliana GN=BHLH34 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LX71_CUCSA1.1e-7768.94Uncharacterized protein OS=Cucumis sativus GN=Csa_1G065960 PE=4 SV=1[more]
W9QRW3_9ROSA5.5e-6160.73Uncharacterized protein OS=Morus notabilis GN=L484_019764 PE=4 SV=1[more]
M5WIN3_PRUPE2.9e-5452.14Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010730mg PE=4 SV=1[more]
A0A0U2I0R6_9ROSA2.9e-5452.14BHLH transcription factor OS=Prunus pseudocerasus PE=2 SV=1[more]
A0A061GF70_THECC2.9e-5455.31DNA binding protein, putative isoform 2 OS=Theobroma cacao GN=TCM_030082 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT5G54680.12.6e-3858.62 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G51070.21.2e-3554.55 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G14410.13.0e-2650.35 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G23210.16.3e-2442.31 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449454698|ref|XP_004145091.1|1.6e-7768.94PREDICTED: transcription factor ILR3-like [Cucumis sativus][more]
gi|659067851|ref|XP_008441610.1|2.7e-7769.79PREDICTED: transcription factor ILR3-like [Cucumis melo][more]
gi|703085262|ref|XP_010092690.1|7.9e-6160.73hypothetical protein L484_019764 [Morus notabilis][more]
gi|645236723|ref|XP_008224874.1|5.0e-5552.56PREDICTED: transcription factor ILR3-like [Prunus mume][more]
gi|950804812|gb|ALN42134.1|4.2e-5452.14bHLH transcription factor [Prunus pseudocerasus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g08850.1Cp4.1LG09g08850.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 76..131
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 76..137
score: 9.
NoneNo IPR availableunknownCoilCoilcoord: 106..161
scor
NoneNo IPR availablePANTHERPTHR23042CIRCADIAN PROTEIN CLOCK/ARNT/BMAL/PAScoord: 1..215
score: 4.0
NoneNo IPR availablePANTHERPTHR23042:SF67TRANSCRIPTION FACTOR ILR3-RELATEDcoord: 1..215
score: 4.0

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG09g08850Cp4.1LG20g03990Cucurbita pepo (Zucchini)cpecpeB048
Cp4.1LG09g08850Cp4.1LG02g01250Cucurbita pepo (Zucchini)cpecpeB050