IVF0026607 (gene) Melon (IVF77) v1

Overview
NameIVF0026607
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptiontranscription factor bHLH67 isoform X1
Locationtig00195367: 782274 .. 784966 (-)
RNA-Seq ExpressionIVF0026607
SyntenyIVF0026607
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTGAACTCTCAATCTATTTATTTACCGCCCTTTGTTTGTCTTTACTGCCTTTTCCACCTCCTCTTCATCACTGTTTAACCTCTCAGCTTCCTCTTCGGAGAACAACATTCTTCTCCTCTAGCCATAGCATTTAATATTATTCTATAAAGAGTATCAACTGCTCCAAAAACGAAGGAATATGTTTTGGGTTGATATGGGGTCTCCTTCTGCTCATTGGGTATTACATTTAATGCTATTGTTTTGTTAGCATCACTGTGAAATGCATGTTGTTACCTCTAGACAGAGAAGTAGAAAGGCTAAATTAACACACTTCAGAACTGCTCTGCCGTTCAAATGGAGAGGCTCCAAGGACCAATTAATCCTTGCGTATGATAAATGCTATCCCTCTCTTTTTCTATTTATTTCCGTTTTCCTTTTCCATGGTCACCCACTTTTAGCATTTTCTTTTCACAACAAGTGCACTGCATTCATTTAATTCTTCAAATGCTTTGCATTGAATGGCTTTGTGTAATTTCTCCAGGATGGATATTAAAATATATACTTTGTACTTCAAATGAACGTGTTTTGTTTCTTATTATATATTTCACTTCTCACTCTGGATTCTATTCTTTTCAGCTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCTCAAACTTGGGATTTGAAGAATCAGAAGAAGTTTGTTTACTAACCTCAAGTTTGGAAGATAAAATACCATTCCTTCAGATGCTGCAGAGTGTGGAATCGCAGTCATTCAAGGAACCTAACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAAACAAACCATGGGAAGAGGGGGTTAGTAAAATTCAGGAGCTTGTAGAGTTGTTTTCTTCACCTATAAACTCAGAAACGAAGGACCAAAATCAACCTCCAAATTCGGACAGAGTGTTCTCAGAGTGTAACCAAAATCAAGGCTTATCCCAGACCCAAATGACAAAGTTTCCTCCGGTCATCAAGGAAAGAAGAAAACGAAAGAGATCCAAACCAACAAAAAACAAGGAAGAAGTAGAGAGCCAAAGAATGACCCACATTGCTGTTGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCTCTCATACCTACGTCCTATGTACAAAGGGTATACTAAAACAAAATATCGAAATTTTAGTTTTCAAAGATCATGTTGTTTTCTTTGGTACTGATTAATTAAGAATGTCACATGCTGATTTGACCCGTTTTTTGTGCACATGACCAGGGTGATCAGGCATCCATAATTGGGGGAGCAATAGACTTCGTGAAGGAATTGGAGCAGCTACTTGAATCTTTGGAAGCACTGAGGAAAGAAAGAAAGGGAGCGGAAGGTGAGTGTAAGGATGAGCAGTCAGAAGTGCGAGTGGCATCAAATAGGAGAATAGGAGAAGGGGTTTGCGCCGAGCTCAGGTCAGAAGTCGCTGAGATTGAGGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAGATGCCGCAAAAGGCAAGGTCAGTTGTTGAAAGTCATTGTTGCTTTAGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATAACCTCACAAACCGCTGCCACCATGCTTTACTCCTTTAATCTAAAGGTATTGCTCTGCTATTATTCATTATTGCCCTTTTCTTTTCTTCTTTAGCGTTTGGATAGAGGGGTGGAAAAGTAGACTACTACTCTGTGTCCTCCATCTGTAGTCAACCTCAACTAGTCTGACTTTGGAACTCAAGCACAGTGCTTCTAGTTTCAACTTCCAAGAGGCCGGCTCTGTTCCTACTTCTTTTGCTTTTTGTTTTTTCTAAAAAAAATATTTAACAGTATCTCGGTGCAAGACTCATCTTTTTCGCACAGAAAAGAGCATGCATTATTTATAATTGCATGTACCATTCTTGTGGCTAAGATGGCTAAGAATATGATTATCTTTGATTCGGTTGGGAGTAAGATTGTTTTAACAAACGAGGGTCTTTCGTTGGTTGCATGTGAATCCAGATAGAAGATGAATGTAAGCTAGAATCAGAGGAGCAGATTGCAGCAACAGTTAATCAAATATTCAGTTTTATGAACAATGGCAGACTGGTCAATGAGGCCAAAGGCAAATTTCAGGCAGTACAGTGGCAGTCGCTGACTATGGAAGAGGACCCCCCTTTCCTCCCCTATTTCCCATTCGAAAAAAGGAAAAATAGAAAATCCAAGCTAGGCAAATAAGGAGGAGGGTAAGTTGGAGTGCGTGTGGCTTGTGGGCACCTCGAATACTGTTCTCTAGTGTACCACAGATGGGTTGGTCAAATGGTTATGGCAGGCTTAAAACAAGTGTTTGTCGTCTCCTACTTTACTCAAATTCATAAATCTTTTACAAAATAAATTAAACAACACAGCTTTTTTATATTCGTTTTCTTATTGGGATTTGGGTTTGAGCCATGCGCAAGAGGATGGGAGACGTATTGCGCATAGTCATTCATTCATATTTTTATTTTCATTGATATGAATTTTTCATTGAGTTTGAGCAGCGGAGAACTGATAAAAATATCACATATTTTGGAAGCAAATAATTAGAAATGAGAA

mRNA sequence

CTTGAACTCTCAATCTATTTATTTACCGCCCTTTGTTTGTCTTTACTGCCTTTTCCACCTCCTCTTCATCACTGTTTAACCTCTCAGCTTCCTCTTCGGAGAACAACATTCTTCTCCTCTAGCCATAGCATTTAATATTATTCTATAAAGAGTATCAACTGCTCCAAAAACGAAGGAATATGTTTTGGGTTGATATGGGGTCTCCTTCTGCTCATTGGGTATTACATTTAATGCTATTGTTTTGTTAGCATCACTGTGAAATGCATGTTGTTACCTCTAGACAGAGAAGTAGAAAGGCTAAATTAACACACTTCAGAACTGCTCTGCCGTTCAAATGGAGAGGCTCCAAGGACCAATTAATCCTTGCCTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCTCAAACTTGGGATTTGAAGAATCAGAAGAAGTTTGTTTACTAACCTCAAGTTTGGAAGATAAAATACCATTCCTTCAGATGCTGCAGAGTGTGGAATCGCAGTCATTCAAGGAACCTAACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAAACAAACCATGGGAAGAGGGGGTTAGTAAAATTCAGGAGCTTGTAGAGTTGTTTTCTTCACCTATAAACTCAGAAACGAAGGACCAAAATCAACCTCCAAATTCGGACAGAGTGTTCTCAGAGTGTAACCAAAATCAAGGCTTATCCCAGACCCAAATGACAAAGTTTCCTCCGGTCATCAAGGAAAGAAGAAAACGAAAGAGATCCAAACCAACAAAAAACAAGGAAGAAGTAGAGAGCCAAAGAATGACCCACATTGCTGTTGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCTCTCATACCTACGTCCTATGTACAAAGGGGTGATCAGGCATCCATAATTGGGGGAGCAATAGACTTCGTGAAGGAATTGGAGCAGCTACTTGAATCTTTGGAAGCACTGAGGAAAGAAAGAAAGGGAGCGGAAGGTGAGTGTAAGGATGAGCAGTCAGAAGTGCGAGTGGCATCAAATAGGAGAATAGGAGAAGGGGTTTGCGCCGAGCTCAGGTCAGAAGTCGCTGAGATTGAGGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAGATGCCGCAAAAGGCAAGGTCAGTTGTTGAAAGTCATTGTTGCTTTAGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATAACCTCACAAACCGCTGCCACCATGCTTTACTCCTTTAATCTAAAGATAGAAGATGAATGTAAGCTAGAATCAGAGGAGCAGATTGCAGCAACAGTTAATCAAATATTCAGTTTTATGAACAATGGCAGACTGGTCAATGAGGCCAAAGGCAAATTTCAGGCAGTACAGTGGCAGTCGCTGACTATGGAAGAGGACCCCCCTTTCCTCCCCTATTTCCCATTCGAAAAAAGGAAAAATAGAAAATCCAAGCTAGGCAAATAAGGAGGAGGGTAAGTTGGAGTGCGTGTGGCTTGTGGGCACCTCGAATACTGTTCTCTAGTGTACCACAGATGGGTTGGTCAAATGGTTATGGCAGGCTTAAAACAAGTGTTTGTCGTCTCCTACTTTACTCAAATTCATAAATCTTTTACAAAATAAATTAAACAACACAGCTTTTTTATATTCGTTTTCTTATTGGGATTTGGGTTTGAGCCATGCGCAAGAGGATGGGAGACGTATTGCGCATAGTCATTCATTCATATTTTTATTTTCATTGATATGAATTTTTCATTGAGTTTGAGCAGCGGAGAACTGATAAAAATATCACATATTTTGGAAGCAAATAATTAGAAATGAGAA

Coding sequence (CDS)

ATGGAGAGGCTCCAAGGACCAATTAATCCTTGCCTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCTCAAACTTGGGATTTGAAGAATCAGAAGAAGTTTGTTTACTAACCTCAAGTTTGGAAGATAAAATACCATTCCTTCAGATGCTGCAGAGTGTGGAATCGCAGTCATTCAAGGAACCTAACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAAACAAACCATGGGAAGAGGGGGTTAGTAAAATTCAGGAGCTTGTAGAGTTGTTTTCTTCACCTATAAACTCAGAAACGAAGGACCAAAATCAACCTCCAAATTCGGACAGAGTGTTCTCAGAGTGTAACCAAAATCAAGGCTTATCCCAGACCCAAATGACAAAGTTTCCTCCGGTCATCAAGGAAAGAAGAAAACGAAAGAGATCCAAACCAACAAAAAACAAGGAAGAAGTAGAGAGCCAAAGAATGACCCACATTGCTGTTGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCTCTCATACCTACGTCCTATGTACAAAGGGGTGATCAGGCATCCATAATTGGGGGAGCAATAGACTTCGTGAAGGAATTGGAGCAGCTACTTGAATCTTTGGAAGCACTGAGGAAAGAAAGAAAGGGAGCGGAAGGTGAGTGTAAGGATGAGCAGTCAGAAGTGCGAGTGGCATCAAATAGGAGAATAGGAGAAGGGGTTTGCGCCGAGCTCAGGTCAGAAGTCGCTGAGATTGAGGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAGATGCCGCAAAAGGCAAGGTCAGTTGTTGAAAGTCATTGTTGCTTTAGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATAACCTCACAAACCGCTGCCACCATGCTTTACTCCTTTAATCTAAAGATAGAAGATGAATGTAAGCTAGAATCAGAGGAGCAGATTGCAGCAACAGTTAATCAAATATTCAGTTTTATGAACAATGGCAGACTGGTCAATGAGGCCAAAGGCAAATTTCAGGCAGTACAGTGGCAGTCGCTGACTATGGAAGAGGACCCCCCTTTCCTCCCCTATTTCCCATTCGAAAAAAGGAAAAATAGAAAATCCAAGCTAGGCAAATAA

Protein sequence

MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEEDPPFLPYFPFEKRKNRKSKLGK
Homology
BLAST of IVF0026607 vs. ExPASy Swiss-Prot
Match: Q700E4 (Transcription factor bHLH67 OS=Arabidopsis thaliana OX=3702 GN=BHLH67 PE=2 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 3.0e-66
Identity = 162/366 (44.26%), Postives = 233/366 (63.66%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSE----QEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQS 60
           MER QG INPC +    +    E     E  +  F+E EE      SL+D +PFLQMLQS
Sbjct: 1   MERFQGHINPCFFDRKPDVRSLEVQGFAEAQSFAFKEKEE-----ESLQDTVPFLQMLQS 60

Query: 61  VESQSF---KEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSPINSET----KDQNQP 120
            +  SF   KEPNF +LL LQ L +PWE E    +++    F SP+ SET    +  NQ 
Sbjct: 61  EDPSSFFSIKEPNFLTLLSLQTLKEPWELERYLSLED--SQFHSPVQSETNRFMEGANQA 120

Query: 121 PNSDRV-FSECNQNQGLS-----------QTQMTKFPP--VIKERRKRKRSKPTKNKEEV 180
            +S  + FS+ N     S           + ++    P  + +E+RKR+++KP+KN EE+
Sbjct: 121 VSSQEIPFSQANMTLPSSTSSPLSAHSRRKRKINHLLPQEMTREKRKRRKTKPSKNNEEI 180

Query: 181 ESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESL 240
           E+QR+ HIAVERNRRRQMN+H+N +++L+P SY+QRGDQASI+GGAI++VK LEQ+++SL
Sbjct: 181 ENQRINHIAVERNRRRQMNEHINSLRALLPPSYIQRGDQASIVGGAINYVKVLEQIIQSL 240

Query: 241 EALRKERKGAEGECKDEQSEVRVASNRRIGEG-----VCAELRSEVAEIEVTMIQTHVNL 300
           E+ ++ ++ +  E       V  A N   G          E ++ + +IE T+IQ HV+L
Sbjct: 241 ESQKRTQQQSNSEV------VENALNHLSGISSNDLWTTLEDQTCIPKIEATVIQNHVSL 300

Query: 301 KIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAA 336
           K++C K+QGQLLK I++LE L+LTVLHLNIT+ + +++ YSFNLK+EDEC LES ++I A
Sbjct: 301 KVQCEKKQGQLLKGIISLEKLKLTVLHLNITTSSHSSVSYSFNLKMEDECDLESADEITA 353

BLAST of IVF0026607 vs. ExPASy Swiss-Prot
Match: O81037 (Transcription factor bHLH70 OS=Arabidopsis thaliana OX=3702 GN=BHLH70 PE=2 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 9.8e-65
Identity = 160/331 (48.34%), Postives = 214/331 (64.65%), Query Frame = 0

Query: 24  QEFSNLGFEESEEVCLLTSSLED-KIPFLQMLQSVESQ----SFKEPNFQSLLKLQHLNK 83
           ++  +   EE E+     S L+D  IPFLQMLQ  E      SFK+P+F +LL LQ L K
Sbjct: 43  EDHQSFALEEEEQQLSTPSLLQDTTIPFLQMLQQSEDPSPFLSFKDPSFLALLSLQTLEK 102

Query: 84  PWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTKFPP---- 143
           PWE       E+ E F SPI+SET      P+ + V +E   NQ L    +         
Sbjct: 103 PWELENYLPHEVPE-FHSPIHSETNHYYHNPSLEGV-NEAISNQELPFNPLENARSRRKR 162

Query: 144 --------VIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTS 203
                   + +E+RKR+R+KPTKN EE+ESQRMTHIAVERNRRRQMN HLN ++S+IP+S
Sbjct: 163 KNNNLASLMTREKRKRRRTKPTKNIEEIESQRMTHIAVERNRRRQMNVHLNSLRSIIPSS 222

Query: 204 YVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEG--ECKDEQSEVRVASNRRIG 263
           Y+QRGDQASI+GGAIDFVK LEQ L+SLEA ++ ++  +   +  ++ S   ++SN+   
Sbjct: 223 YIQRGDQASIVGGAIDFVKILEQQLQSLEAQKRSQQSDDNKEQIPEDNSLRNISSNKLRA 282

Query: 264 EGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTA 323
                E +S   +IE T+I++HVNLKI+C ++QGQLL+ I+ LE LR TVLHLNITS T 
Sbjct: 283 SN--KEEQSSKLKIEATVIESHVNLKIQCTRKQGQLLRSIILLEKLRFTVLHLNITSPTN 342

Query: 324 ATMLYSFNLKIEDECKLESEEQIAATVNQIF 336
            ++ YSFNLK+EDEC L S ++I A + QIF
Sbjct: 343 TSVSYSFNLKMEDECNLGSADEITAAIRQIF 369

BLAST of IVF0026607 vs. ExPASy Swiss-Prot
Match: Q9M128 (Transcription factor bHLH57 OS=Arabidopsis thaliana OX=3702 GN=BHLH57 PE=1 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 4.3e-52
Identity = 139/312 (44.55%), Postives = 192/312 (61.54%), Query Frame = 0

Query: 42  SSLEDKIPFLQMLQSVESQ-SFKEPN--FQSLLKLQHLNKPWEEGVSKIQELVELFSSPI 101
           +++E+KIPFLQMLQ +E   +  EPN   QSLL++Q L                   S +
Sbjct: 20  TTMEEKIPFLQMLQCIEHPFTTTEPNQFLQSLLQIQTLES----------------KSCL 79

Query: 102 NSETKDQNQPPNSDRVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQR 161
             ET  +  P  +D    +     G            +KE+RKRKR++  KNK+EVE+QR
Sbjct: 80  TLETNIKRDPGQTDDPEKDPRTENG---------AVTVKEKRKRKRTRAPKNKDEVENQR 139

Query: 162 MTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALR 221
           MTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQLL+SLEA  
Sbjct: 140 MTHIAVERNRRRQMNEHLNSLRSLMPPSFLQRGDQASIVGGAIDFIKELEQLLQSLEA-E 199

Query: 222 KERKGAEGECKD---EQSEVRVASNRRIG-------EGVCAEL-RSEVAEIEVTMIQTHV 281
           K + G +   K      S     +N  I         G  A     +  E+E T+IQ HV
Sbjct: 200 KRKDGTDETPKTASCSSSSSLACTNSSISSVSTTSENGFTARFGGGDTTEVEATVIQNHV 259

Query: 282 NLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQI 340
           +LK+RC++ + Q+LK IV++E+L+L +LHL I+S +   ++YSFNLK+ED CKL S ++I
Sbjct: 260 SLKVRCKRGKRQILKAIVSIEELKLAILHLTISS-SFDFVIYSFNLKMEDGCKLGSADEI 304

BLAST of IVF0026607 vs. ExPASy Swiss-Prot
Match: Q56YJ8 (Transcription factor FAMA OS=Arabidopsis thaliana OX=3702 GN=FAMA PE=1 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 5.8e-41
Identity = 108/225 (48.00%), Postives = 151/225 (67.11%), Query Frame = 0

Query: 139 RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIG 198
           + KRKR++ +K  EEVESQRMTHIAVERNRR+QMN+HL V++SL+P SYVQRGDQASIIG
Sbjct: 177 KSKRKRARTSKTSEEVESQRMTHIAVERNRRKQMNEHLRVLRSLMPGSYVQRGDQASIIG 236

Query: 199 GAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRI--------------- 258
           GAI+FV+ELEQLL+ LE+  ++R+   GE   + +    +S+  I               
Sbjct: 237 GAIEFVRELEQLLQCLES--QKRRRILGETGRDMTTTTTSSSSPITTVANQAQPLIITGN 296

Query: 259 ------GEGV---CAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTV 318
                 G G+    AE +S +A++EV ++     +KI  R+R GQL+K I ALEDL L++
Sbjct: 297 VTELEGGGGLREETAENKSCLADVEVKLLGFDAMIKILSRRRPGQLIKTIAALEDLHLSI 356

Query: 319 LHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMN 340
           LH NIT+    T+LYSFN+KI  E +  +E+ IA+++ QIFSF++
Sbjct: 357 LHTNITTM-EQTVLYSFNVKITSETRFTAED-IASSIQQIFSFIH 397

BLAST of IVF0026607 vs. ExPASy Swiss-Prot
Match: Q9C7T4 (Transcription factor bHLH96 OS=Arabidopsis thaliana OX=3702 GN=BHLH96 PE=1 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 4.9e-40
Identity = 97/206 (47.09%), Postives = 142/206 (68.93%), Query Frame = 0

Query: 139 RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIG 198
           RRKR+R++ +KNKEE+E+QRMTHIAVERNRR+QMN++L V++SL+P  Y QRGDQASI+G
Sbjct: 105 RRKRRRTRSSKNKEEIENQRMTHIAVERNRRKQMNEYLAVLRSLMPPYYAQRGDQASIVG 164

Query: 199 GAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRIGE----------GVC 258
           GAI+++KELE  L+S+E   K      G   D+      +S+    +             
Sbjct: 165 GAINYLKELEHHLQSMEPPVKTATEDTGAGHDQTKTTSASSSGPFSDFFAFPQYSNRPTS 224

Query: 259 AELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATML 318
           A     +AEIEVTM+++H +LKI  +KR  QLLK++ +++ LRLT+LHLN+T++   ++L
Sbjct: 225 AAAAEGMAEIEVTMVESHASLKILAKKRPRQLLKLVSSIQSLRLTLLHLNVTTRD-DSVL 284

Query: 319 YSFNLKIEDECKLESEEQIAATVNQI 335
           YS ++K+E+  +L + E IAA VNQI
Sbjct: 285 YSISVKVEEGSQLNTVEDIAAAVNQI 309

BLAST of IVF0026607 vs. ExPASy TrEMBL
Match: A0A1S3AVB5 (transcription factor bHLH67 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103483091 PE=4 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 6.8e-202
Identity = 371/377 (98.41%), Postives = 373/377 (98.94%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120
           SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120

Query: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180
           NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLT 360
           SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLT
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLT 360

Query: 361 MEEDPPFLPYFPFEKRK 378
           MEEDPPF P FP  K++
Sbjct: 361 MEEDPPFPPLFPIRKKE 377

BLAST of IVF0026607 vs. ExPASy TrEMBL
Match: A0A5D3DAW7 (Transcription factor bHLH67 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00040 PE=4 SV=1)

HSP 1 Score: 687.6 bits (1773), Expect = 3.0e-194
Identity = 360/366 (98.36%), Postives = 362/366 (98.91%), Query Frame = 0

Query: 12  LYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLL 71
           LYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLL
Sbjct: 14  LYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLL 73

Query: 72  KLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTK 131
           KLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTK
Sbjct: 74  KLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTK 133

Query: 132 FPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRG 191
           FPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRG
Sbjct: 134 FPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRG 193

Query: 192 DQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRIGEGVCAEL 251
           DQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRIGEGVCAEL
Sbjct: 194 DQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRIGEGVCAEL 253

Query: 252 RSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSF 311
           RSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSF
Sbjct: 254 RSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSF 313

Query: 312 NLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEEDPPFLPYF 371
           NLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEEDPPF P F
Sbjct: 314 NLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEEDPPFPPLF 373

Query: 372 PFEKRK 378
           P  K++
Sbjct: 374 PIRKKE 379

BLAST of IVF0026607 vs. ExPASy TrEMBL
Match: A0A0A0L6W2 (BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G116640 PE=4 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 6.2e-179
Identity = 334/352 (94.89%), Postives = 341/352 (96.88%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPC YGEYSETGCSEQEF+NLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120
           SFKEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPINSETKDQNQPP SDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120

Query: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180
           NQG+SQTQMTK PPVIKERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECK EQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRC KRQ QLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 353
           SQTAATMLYSFNLKIEDECKLESEEQIAATVN+IFSF+NNGRLVNEAK  F+
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFR 352

BLAST of IVF0026607 vs. ExPASy TrEMBL
Match: A0A1S3AUE4 (transcription factor bHLH70 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483091 PE=4 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 5.2e-170
Identity = 319/325 (98.15%), Postives = 321/325 (98.77%), Query Frame = 0

Query: 53  MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD 112
           MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD
Sbjct: 1   MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD 60

Query: 113 RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM 172
           RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM
Sbjct: 61  RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM 120

Query: 173 NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ 232
           NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ
Sbjct: 121 NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ 180

Query: 233 SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL 292
           SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL
Sbjct: 181 SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL 240

Query: 293 TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 352
           TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ
Sbjct: 241 TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 300

Query: 353 AVQWQSLTMEEDPPFLPYFPFEKRK 378
           AVQWQSLTMEEDPPF P FP  K++
Sbjct: 301 AVQWQSLTMEEDPPFPPLFPIRKKE 325

BLAST of IVF0026607 vs. ExPASy TrEMBL
Match: A0A6J1CV34 (LOW QUALITY PROTEIN: transcription factor bHLH67 OS=Momordica charantia OX=3673 GN=LOC111014634 PE=4 SV=1)

HSP 1 Score: 532.7 bits (1371), Expect = 1.3e-147
Identity = 309/404 (76.49%), Postives = 330/404 (81.68%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQG I+PC YGEYSE GCSEQ F++L FEESEE   LTS+LEDK+PFLQMLQSVESQ
Sbjct: 1   MERLQGHIHPCFYGEYSERGCSEQGFTSLRFEESEEAYFLTSTLEDKMPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSPINSETKDQNQPPNS----DRVF 120
            FKEPNFQ+LLKLQHLNKPWE E VS+IQELVEL+SSPINSETKDQNQ PNS    D V 
Sbjct: 61  PFKEPNFQNLLKLQHLNKPWELEEVSQIQELVELYSSPINSETKDQNQHPNSASYTDGVS 120

Query: 121 SECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDH 180
           SECNQNQ   +TQM K PPV KERRKRKR++P KNKEEVESQRMTHIAVERNRRRQMNDH
Sbjct: 121 SECNQNQ---RTQMAKAPPVTKERRKRKRTRPAKNKEEVESQRMTHIAVERNRRRQMNDH 180

Query: 181 LNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDE---- 240
           LNVIKSLIPTSY+ RGDQASIIGGAI FVKELEQLLESLEA   +RKG EG CK +    
Sbjct: 181 LNVIKSLIPTSYIHRGDQASIIGGAIHFVKELEQLLESLEA---QRKGEEGGCKAKGELS 240

Query: 241 --------QSEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKV 300
                    S + +ASN RIGEGVCAE +SEVAEIEVTMIQTHVNLKI+C KRQGQLLK 
Sbjct: 241 SVGSRSPTSSAMGMASNGRIGEGVCAEHKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKA 300

Query: 301 IVALEDLRLTVLHLNI-TSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGR 360
           IVALEDLRLTVLHLNI TSQ  ATM YSFNLKIEDECK+ S EQIAATV+QIFSFMN+GR
Sbjct: 301 IVALEDLRLTVLHLNISTSQATATMHYSFNLKIEDECKIGSAEQIAATVHQIFSFMNDGR 360

Query: 361 LVNE-AKGKFQAVQWQSLTMEEDPPFLPYFPFEKRKNRK-SKLG 385
           LV E  KGKFQAVQWQSLTME      P  PF K K RK SK G
Sbjct: 361 LVKEGGKGKFQAVQWQSLTMENGR--TPPLPFPKIKVRKLSKYG 396

BLAST of IVF0026607 vs. NCBI nr
Match: XP_008437748.1 (PREDICTED: transcription factor bHLH67 isoform X1 [Cucumis melo])

HSP 1 Score: 711 bits (1836), Expect = 1.10e-257
Identity = 371/376 (98.67%), Postives = 372/376 (98.94%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120
           SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120

Query: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180
           NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLT 360
           SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLT
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLT 360

Query: 361 MEEDPPFLPYFPFEKR 376
           MEEDPPF P FP  K+
Sbjct: 361 MEEDPPFPPLFPIRKK 376

BLAST of IVF0026607 vs. NCBI nr
Match: KAA0048794.1 (transcription factor bHLH67 isoform X1 [Cucumis melo var. makuwa] >TYK20747.1 transcription factor bHLH67 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 687 bits (1772), Expect = 6.70e-248
Identity = 361/373 (96.78%), Postives = 363/373 (97.32%), Query Frame = 0

Query: 4   LQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFK 63
           +  P    LYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFK
Sbjct: 6   MGSPSAHWLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFK 65

Query: 64  EPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQG 123
           EPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQG
Sbjct: 66  EPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQG 125

Query: 124 LSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLI 183
           LSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLI
Sbjct: 126 LSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLI 185

Query: 184 PTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRI 243
           PTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRI
Sbjct: 186 PTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRI 245

Query: 244 GEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQT 303
           GEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQT
Sbjct: 246 GEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQT 305

Query: 304 AATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEE 363
           AATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEE
Sbjct: 306 AATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQAVQWQSLTMEE 365

Query: 364 DPPFLPYFPFEKR 376
           DPPF P FP  K+
Sbjct: 366 DPPFPPLFPIRKK 378

BLAST of IVF0026607 vs. NCBI nr
Match: XP_004134273.1 (transcription factor bHLH67 isoform X2 [Cucumis sativus] >KGN56327.1 hypothetical protein Csa_011171 [Cucumis sativus])

HSP 1 Score: 635 bits (1637), Expect = 1.09e-227
Identity = 334/352 (94.89%), Postives = 341/352 (96.88%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPC YGEYSETGCSEQEF+NLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120
           SFKEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPINSETKDQNQPP SDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120

Query: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180
           NQG+SQTQMTK PPVIKERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECK EQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRC KRQ QLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 352
           SQTAATMLYSFNLKIEDECKLESEEQIAATVN+IFSF+NNGRLVNEAK  F+
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFR 352

BLAST of IVF0026607 vs. NCBI nr
Match: XP_008437749.1 (PREDICTED: transcription factor bHLH70 isoform X2 [Cucumis melo])

HSP 1 Score: 606 bits (1563), Expect = 6.19e-217
Identity = 319/324 (98.46%), Postives = 320/324 (98.77%), Query Frame = 0

Query: 53  MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD 112
           MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD
Sbjct: 1   MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD 60

Query: 113 RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM 172
           RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM
Sbjct: 61  RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM 120

Query: 173 NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ 232
           NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ
Sbjct: 121 NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ 180

Query: 233 SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL 292
           SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL
Sbjct: 181 SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL 240

Query: 293 TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 352
           TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ
Sbjct: 241 TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 300

Query: 353 AVQWQSLTMEEDPPFLPYFPFEKR 376
           AVQWQSLTMEEDPPF P FP  K+
Sbjct: 301 AVQWQSLTMEEDPPFPPLFPIRKK 324

BLAST of IVF0026607 vs. NCBI nr
Match: XP_031738937.1 (transcription factor bHLH70 isoform X1 [Cucumis sativus])

HSP 1 Score: 610 bits (1572), Expect = 1.09e-216
Identity = 323/341 (94.72%), Postives = 330/341 (96.77%), Query Frame = 0

Query: 12  LYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLL 71
            YGEYSETGCSEQEF+NLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLL
Sbjct: 81  FYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLL 140

Query: 72  KLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTK 131
           KLQHL KPWE GV+KIQELV+LFSSPINSETKDQNQPP SDRVFSECNQNQG+SQTQMTK
Sbjct: 141 KLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQNQGISQTQMTK 200

Query: 132 FPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRG 191
            PPVIKERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRG
Sbjct: 201 APPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRG 260

Query: 192 DQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRIGEGVCAEL 251
           DQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECK EQSEVRVASNRRIGEGVCAEL
Sbjct: 261 DQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASNRRIGEGVCAEL 320

Query: 252 RSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSF 311
           RSEVAEIEVTMIQTHVNLKIRC KRQ QLLKVIVALEDLRLTVLHLNITSQTAATMLYSF
Sbjct: 321 RSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSF 380

Query: 312 NLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 352
           NLKIEDECKLESEEQIAATVN+IFSF+NNGRLVNEAK  F+
Sbjct: 381 NLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFR 421

BLAST of IVF0026607 vs. TAIR 10
Match: AT3G61950.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 253.8 bits (647), Expect = 2.2e-67
Identity = 162/366 (44.26%), Postives = 233/366 (63.66%), Query Frame = 0

Query: 1   MERLQGPINPCLYGEYSETGCSE----QEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQS 60
           MER QG INPC +    +    E     E  +  F+E EE      SL+D +PFLQMLQS
Sbjct: 1   MERFQGHINPCFFDRKPDVRSLEVQGFAEAQSFAFKEKEE-----ESLQDTVPFLQMLQS 60

Query: 61  VESQSF---KEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSPINSET----KDQNQP 120
            +  SF   KEPNF +LL LQ L +PWE E    +++    F SP+ SET    +  NQ 
Sbjct: 61  EDPSSFFSIKEPNFLTLLSLQTLKEPWELERYLSLED--SQFHSPVQSETNRFMEGANQA 120

Query: 121 PNSDRV-FSECNQNQGLS-----------QTQMTKFPP--VIKERRKRKRSKPTKNKEEV 180
            +S  + FS+ N     S           + ++    P  + +E+RKR+++KP+KN EE+
Sbjct: 121 VSSQEIPFSQANMTLPSSTSSPLSAHSRRKRKINHLLPQEMTREKRKRRKTKPSKNNEEI 180

Query: 181 ESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESL 240
           E+QR+ HIAVERNRRRQMN+H+N +++L+P SY+QRGDQASI+GGAI++VK LEQ+++SL
Sbjct: 181 ENQRINHIAVERNRRRQMNEHINSLRALLPPSYIQRGDQASIVGGAINYVKVLEQIIQSL 240

Query: 241 EALRKERKGAEGECKDEQSEVRVASNRRIGEG-----VCAELRSEVAEIEVTMIQTHVNL 300
           E+ ++ ++ +  E       V  A N   G          E ++ + +IE T+IQ HV+L
Sbjct: 241 ESQKRTQQQSNSEV------VENALNHLSGISSNDLWTTLEDQTCIPKIEATVIQNHVSL 300

Query: 301 KIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAA 336
           K++C K+QGQLLK I++LE L+LTVLHLNIT+ + +++ YSFNLK+EDEC LES ++I A
Sbjct: 301 KVQCEKKQGQLLKGIISLEKLKLTVLHLNITTSSHSSVSYSFNLKMEDECDLESADEITA 353

BLAST of IVF0026607 vs. TAIR 10
Match: AT2G46810.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 248.8 bits (634), Expect = 7.0e-66
Identity = 160/331 (48.34%), Postives = 214/331 (64.65%), Query Frame = 0

Query: 24  QEFSNLGFEESEEVCLLTSSLED-KIPFLQMLQSVESQ----SFKEPNFQSLLKLQHLNK 83
           ++  +   EE E+     S L+D  IPFLQMLQ  E      SFK+P+F +LL LQ L K
Sbjct: 43  EDHQSFALEEEEQQLSTPSLLQDTTIPFLQMLQQSEDPSPFLSFKDPSFLALLSLQTLEK 102

Query: 84  PWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQNQGLSQTQMTKFPP---- 143
           PWE       E+ E F SPI+SET      P+ + V +E   NQ L    +         
Sbjct: 103 PWELENYLPHEVPE-FHSPIHSETNHYYHNPSLEGV-NEAISNQELPFNPLENARSRRKR 162

Query: 144 --------VIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTS 203
                   + +E+RKR+R+KPTKN EE+ESQRMTHIAVERNRRRQMN HLN ++S+IP+S
Sbjct: 163 KNNNLASLMTREKRKRRRTKPTKNIEEIESQRMTHIAVERNRRRQMNVHLNSLRSIIPSS 222

Query: 204 YVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEG--ECKDEQSEVRVASNRRIG 263
           Y+QRGDQASI+GGAIDFVK LEQ L+SLEA ++ ++  +   +  ++ S   ++SN+   
Sbjct: 223 YIQRGDQASIVGGAIDFVKILEQQLQSLEAQKRSQQSDDNKEQIPEDNSLRNISSNKLRA 282

Query: 264 EGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTA 323
                E +S   +IE T+I++HVNLKI+C ++QGQLL+ I+ LE LR TVLHLNITS T 
Sbjct: 283 SN--KEEQSSKLKIEATVIESHVNLKIQCTRKQGQLLRSIILLEKLRFTVLHLNITSPTN 342

Query: 324 ATMLYSFNLKIEDECKLESEEQIAATVNQIF 336
            ++ YSFNLK+EDEC L S ++I A + QIF
Sbjct: 343 TSVSYSFNLKMEDECNLGSADEITAAIRQIF 369

BLAST of IVF0026607 vs. TAIR 10
Match: AT3G61950.2 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 221.9 bits (564), Expect = 9.1e-58
Identity = 140/310 (45.16%), Postives = 205/310 (66.13%), Query Frame = 0

Query: 53  MLQSVESQSF---KEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSPINSET----KD 112
           MLQS +  SF   KEPNF +LL LQ L +PWE E    +++    F SP+ SET    + 
Sbjct: 1   MLQSEDPSSFFSIKEPNFLTLLSLQTLKEPWELERYLSLED--SQFHSPVQSETNRFMEG 60

Query: 113 QNQPPNSDRV-FSECNQNQGLS-----------QTQMTKFPP--VIKERRKRKRSKPTKN 172
            NQ  +S  + FS+ N     S           + ++    P  + +E+RKR+++KP+KN
Sbjct: 61  ANQAVSSQEIPFSQANMTLPSSTSSPLSAHSRRKRKINHLLPQEMTREKRKRRKTKPSKN 120

Query: 173 KEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQL 232
            EE+E+QR+ HIAVERNRRRQMN+H+N +++L+P SY+QRGDQASI+GGAI++VK LEQ+
Sbjct: 121 NEEIENQRINHIAVERNRRRQMNEHINSLRALLPPSYIQRGDQASIVGGAINYVKVLEQI 180

Query: 233 LESLEALRKERKGAEGECKDEQSEVRVASNRRIGEG-----VCAELRSEVAEIEVTMIQT 292
           ++SLE+ ++ ++ +  E       V  A N   G          E ++ + +IE T+IQ 
Sbjct: 181 IQSLESQKRTQQQSNSEV------VENALNHLSGISSNDLWTTLEDQTCIPKIEATVIQN 240

Query: 293 HVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEE 336
           HV+LK++C K+QGQLLK I++LE L+LTVLHLNIT+ + +++ YSFNLK+EDEC LES +
Sbjct: 241 HVSLKVQCEKKQGQLLKGIISLEKLKLTVLHLNITTSSHSSVSYSFNLKMEDECDLESAD 300

BLAST of IVF0026607 vs. TAIR 10
Match: AT4G01460.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 206.8 bits (525), Expect = 3.0e-53
Identity = 139/312 (44.55%), Postives = 192/312 (61.54%), Query Frame = 0

Query: 42  SSLEDKIPFLQMLQSVESQ-SFKEPN--FQSLLKLQHLNKPWEEGVSKIQELVELFSSPI 101
           +++E+KIPFLQMLQ +E   +  EPN   QSLL++Q L                   S +
Sbjct: 20  TTMEEKIPFLQMLQCIEHPFTTTEPNQFLQSLLQIQTLES----------------KSCL 79

Query: 102 NSETKDQNQPPNSDRVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQR 161
             ET  +  P  +D    +     G            +KE+RKRKR++  KNK+EVE+QR
Sbjct: 80  TLETNIKRDPGQTDDPEKDPRTENG---------AVTVKEKRKRKRTRAPKNKDEVENQR 139

Query: 162 MTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALR 221
           MTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQLL+SLEA  
Sbjct: 140 MTHIAVERNRRRQMNEHLNSLRSLMPPSFLQRGDQASIVGGAIDFIKELEQLLQSLEA-E 199

Query: 222 KERKGAEGECKD---EQSEVRVASNRRIG-------EGVCAEL-RSEVAEIEVTMIQTHV 281
           K + G +   K      S     +N  I         G  A     +  E+E T+IQ HV
Sbjct: 200 KRKDGTDETPKTASCSSSSSLACTNSSISSVSTTSENGFTARFGGGDTTEVEATVIQNHV 259

Query: 282 NLKIRCRKRQGQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQI 340
           +LK+RC++ + Q+LK IV++E+L+L +LHL I+S +   ++YSFNLK+ED CKL S ++I
Sbjct: 260 SLKVRCKRGKRQILKAIVSIEELKLAILHLTISS-SFDFVIYSFNLKMEDGCKLGSADEI 304

BLAST of IVF0026607 vs. TAIR 10
Match: AT3G24140.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 169.9 bits (429), Expect = 4.1e-42
Identity = 108/225 (48.00%), Postives = 151/225 (67.11%), Query Frame = 0

Query: 139 RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIG 198
           + KRKR++ +K  EEVESQRMTHIAVERNRR+QMN+HL V++SL+P SYVQRGDQASIIG
Sbjct: 177 KSKRKRARTSKTSEEVESQRMTHIAVERNRRKQMNEHLRVLRSLMPGSYVQRGDQASIIG 236

Query: 199 GAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASNRRI--------------- 258
           GAI+FV+ELEQLL+ LE+  ++R+   GE   + +    +S+  I               
Sbjct: 237 GAIEFVRELEQLLQCLES--QKRRRILGETGRDMTTTTTSSSSPITTVANQAQPLIITGN 296

Query: 259 ------GEGV---CAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTV 318
                 G G+    AE +S +A++EV ++     +KI  R+R GQL+K I ALEDL L++
Sbjct: 297 VTELEGGGGLREETAENKSCLADVEVKLLGFDAMIKILSRRRPGQLIKTIAALEDLHLSI 356

Query: 319 LHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMN 340
           LH NIT+    T+LYSFN+KI  E +  +E+ IA+++ QIFSF++
Sbjct: 357 LHTNITTM-EQTVLYSFNVKITSETRFTAED-IASSIQQIFSFIH 397

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q700E43.0e-6644.26Transcription factor bHLH67 OS=Arabidopsis thaliana OX=3702 GN=BHLH67 PE=2 SV=1[more]
O810379.8e-6548.34Transcription factor bHLH70 OS=Arabidopsis thaliana OX=3702 GN=BHLH70 PE=2 SV=1[more]
Q9M1284.3e-5244.55Transcription factor bHLH57 OS=Arabidopsis thaliana OX=3702 GN=BHLH57 PE=1 SV=1[more]
Q56YJ85.8e-4148.00Transcription factor FAMA OS=Arabidopsis thaliana OX=3702 GN=FAMA PE=1 SV=1[more]
Q9C7T44.9e-4047.09Transcription factor bHLH96 OS=Arabidopsis thaliana OX=3702 GN=BHLH96 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3AVB56.8e-20298.41transcription factor bHLH67 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103483091 P... [more]
A0A5D3DAW73.0e-19498.36Transcription factor bHLH67 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A0A0L6W26.2e-17994.89BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G116640 PE=4 S... [more]
A0A1S3AUE45.2e-17098.15transcription factor bHLH70 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483091 P... [more]
A0A6J1CV341.3e-14776.49LOW QUALITY PROTEIN: transcription factor bHLH67 OS=Momordica charantia OX=3673 ... [more]
Match NameE-valueIdentityDescription
XP_008437748.11.10e-25798.67PREDICTED: transcription factor bHLH67 isoform X1 [Cucumis melo][more]
KAA0048794.16.70e-24896.78transcription factor bHLH67 isoform X1 [Cucumis melo var. makuwa] >TYK20747.1 tr... [more]
XP_004134273.11.09e-22794.89transcription factor bHLH67 isoform X2 [Cucumis sativus] >KGN56327.1 hypothetica... [more]
XP_008437749.16.19e-21798.46PREDICTED: transcription factor bHLH70 isoform X2 [Cucumis melo][more]
XP_031738937.11.09e-21694.72transcription factor bHLH70 isoform X1 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
AT3G61950.12.2e-6744.26basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT2G46810.17.0e-6648.34basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G61950.29.1e-5845.16basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT4G01460.13.0e-5344.55basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G24140.14.1e-4248.00basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 200..234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..158
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..132
NoneNo IPR availablePANTHERPTHR11969:SF57TRANSCRIPTION FACTOR BHLH70coord: 21..340
NoneNo IPR availablePANTHERPTHR11969MAX DIMERIZATION, MADcoord: 21..340
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 162..213
e-value: 2.6E-9
score: 46.9
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 157..208
e-value: 1.1E-9
score: 38.1
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROSITEPS50888BHLHcoord: 156..207
score: 14.881353
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3D4.10.280.10coord: 147..238
e-value: 1.0E-11
score: 46.6
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILY47459HLH, helix-loop-helix DNA-binding domaincoord: 147..218

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0026607.1IVF0026607.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000978 RNA polymerase II cis-regulatory region sequence-specific DNA binding