Cla97C11G207060 (gene) Watermelon (97103) v2

NameCla97C11G207060
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionTranscription factor, putative
LocationCla97Chr11 : 963321 .. 968040 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGAGGAGTGCAACGAGAGCTCTGTTGCGACATCTAATTCCAATCCATCCAACTGGTGGGATATTAACCATAATCATCATCATCATCACCATCCTTCACTTTCTTATAATTCCCATTGGCTTCTCCAAAACCCTAATTCCAATTCCTCTTGTGAAGAAGATGTTTCCATCTCAACTTCCTCCTTTACCAACAATAATTCCAATCACTCTCCACTCACCTCCAACCACCTTCTTCCTCATCATCCTTCTGATAATAATCACCTTTGGACCCAAGTTTTGCTGTAAGTTCTCTTCTCTTTGTTTTCATTTTAGTTTATCCTCATTGGGCTCGATCGGGATATGATTCTGAGCGTTGAAGATGTGTCAACCTACTAGTTGAGATATTTGGTGTGTCTACTAATCCCTATTTCTCCCATTATGATTTGTTCCAAAATTTTAGTTTTTTAACATGACAAAGTCTAACGATTTGAAAGTATTACAGATGTATCAACCTAATTGAGATATCTTTATGCACCCCTTGATCCCATCATTTTCTAGTATTTTGTTCAAAAGAAAAAATCTAACGCTCTGAATCTCTTAATTTGGTTTTCATTAAGTTACTAAACCAGCCTTTTTAACTTTGGATAACACAAATCTTAGAGGAAAAAAAACCTTTAATTTGAGATAATTGAGTGGTTTGAGAGTCACCATGCATATATCTAGATAATCTTATATATATATATATATATATATATATATATAAGATTATGCTTATTAATTTTGAAGCTATACAGGAACATAGGAAACGGTGTGGAATTGCAAAGCAATGAAGAAGACATAGAAGCGAATTTCCTTGAGACAATATCATCAAGAAGCATGTCCACCACCGGAATCTTCGAACCCACCGCATGTAGCGATTACCTTAAAAAAATGGACACAACTAACAACAACAACAACAACAACTGGGACGACACTTTCCAAAGCTTCACCAACAACAATAACGGACTTCTTACAACGTCCCACCACACTCACATGCTCCAAAACGATAGGTTTTTGAAGCTTTCCAATCTCGTAAACACGTGGTCCATTGCTCTCCCGAGCTCACAACAAGCACATCTACGGCACTTAATCGACGACCAACACGACCATCTCCCAACCTCCATTGTTCCCACTCATGAACCAGATGGCGCTGTGCCTCACTCGGGCCTCGACCCATGTGCTTCTACCATCCTCAGGAGGTCGCTTCATAATCAAAATTATGGAGATTATATTTCCTTTAATGGACGACTGCCTAAACCGGTGGCCGGTATCAACGGTTCCTCTCATAATCCTTGTTTTAAGTCCTCATTGAATTTGTCTGCTGATACTAAGAAGCAGATTCACCAAATTTCTTCGCCGGTAAGCAAATTTTTTGTTTTATATTATATTAAAAGAAAAATTAAATCAAAAGCAAGTGGTTTCAACAATCTTGCTTTTCTTAATATTTCAAATTAAACTAAGATTTCTATATACACTTTTAATTTTTTACAAATATATTGGTTTGGTCTATTTTCTCTTCCTATTATCATATACTAGTGTGTTTGGCACCAAGGATTAAAAATACTTTGTTCTTCATAGTGCTTTTTAAGAATTTAAAAATCAAAAGTGGAATGAAAAAAAATTATATATTTAAATGAAAACTATTCAATAATGTTTTCAAAGAGGGCATTAAACATGTTCATATATAATAACCCCACTTATCTATCATGGATCAAATAAGAAGTTCATTTCATTTTATTCCCTTTTTTTTTTTTTTTTTTTATTCATATTTAACAAATAAAATACGTAAATTTCAGTGGTAAAATCATAGTAGAACTAGACTTGTAAGTTAATTTAATTCAATTGAATAATATTTGACTTTTCTATTTTTTATTTTATTGCAATGTGAATGGAAGTTTTGAAATTGTTTGAATTTTATTGAATGAAATTGTAGACAAAAATTAGTGGACGAGGAAGTGGAGTTTTGAGCGAAGGGAAGAAGAAAAGATCGGAAGAATCTTCGGAAACTGCCCCCAAAAAGGCTAAGCAAGATAATTCAACACCTCCATCTAATAAGGTCTATATTTAACTTATAAATTTTATGCATTTATATTTCAACAAATTTTTGACCCATCACTAATTAAAACTCACACACTTATATATAAAATATTATCATACAATATACATATTTTATCGATTCATTTATTTTTAATTCTTCTACGTTTCAGCTTAATTTTAACATTTTATCTAAGAATTCTTACATTAATTGGTATAATTTAGTTTTATTAGTAGTTTTTTTTTTTATCAAGTTACAATAGGCCGATTAAGGATCCATCAAGACATCTCAACTTAGGTTACAATATTTAGTATTCATCATAACCTAGACTAAAAATATTTAGTTTAATTAGTAGTTAGAAATTAATTAGCTATTTCTTACATTTTTTAAGACTATAAATTAGCCACTTTATTGATGGTAATATGTAGCTTTTTGAAATCATCTTTGAAATTTGAACCGTGTTCTAAAGAGCTTTCTCTAAACTTTATGGATAAATTTTTTTTTTGTTGACCATGAATTTGATCTAATACTTATTCATCAGCCCAATCAATAGTATAGTTAACTTTCTTTCACTACAATAAAATTATGGAAAAGTCTTGAAGAAAATTATCAAACGTCTTGACTATTGAAAATATCTAATAAATAATTGGAAGAGTAAATTGAAACTCAAGTCAATGAATGAATTCAAAATTAGGGTTGAAGCTGGTTATTTTATCTCATTCTATTAATTTATTTTTACATAATTTGACTATATTATGATTTTTTAAAAAAAAAAAAAAACGTTTAGAATGCTAATTATTTTTGTTATAATTAATAGATGCAACAACCAAAGGTCAAAATTGGAGACAGGATAACGGCCCTTCAGCAAATTGTGTCGCCATTTGGAAAGGTAAATTTGACTTTTGTTTCCTTTTTCAATTTTCTGTTATTTTCCACTATTAGTTTTTTTTTTTCTTGGATTTTTTTCTTTTATTTCATTTTAGATATGAAATTAAATTTGTATTTATCAAATAGCTTAAGTTTTTTTTTTTTTTTTTTTTTAATTAAATCAACGATTTAACAATAACTTCATTTCAGACTGATACTGCGTCAGTTCTAACCGAAACCATTGGATACATAAAGTTCCTACAAGAGCAAGTCCAGGTTTGCATGCTTCTCTCTCCTTTCCAATTATTTCATTTCTATTGCACTAAAAAAAACTGCATTAAAAAATAAAAATAAAAATAAAAATTCAACCAGCTTTGCTTCAATATATTTGATAATTTCTCCTAGGTATTATTAGAGTTCTAGCTTTAGCTTTTGTTTCAGTGCCACTTTCCAAGTTTTGGTCTTTTGTGCTTTTGTATCTGTTCTATGTCAGTACTTTTGGATATATAATTTTAATCATATGATAGTAACTTTAAATGAAGCAATGATTGAAAACTAATGTGATATATATTGTATATCTCTAATGTTATTGGATACATCAAAATCTTACTATATGTCATAAAAATAAAAAAAGACAAAGAAAAATGAAGAAATAATGCAACAGTTGTCCAATAAAAAGAATTGTTATGTGATAATTTTTATTTTAAAAAATACTTTTTATTATTTTTTCTTAAAAGTAAATGAAAGCATAGATTAAAGTTGCACCTAATCTTTGCTTAAGAATAATCATATTAATCATTGAATATTTTTTAATATTATGTAACATTTTTCAATATTTACAATTTGTTTCGTTTTCCATATTTCAGCTACTGAGTAATCCTTACATGAAGACCAATTCCTATAAGGTATACTGTTACATTCTTTTTCTTTTTCCTTTTCTTTTTTTCCCAATTTTTTAGAATATTTTATTACAAATTAAGAAAGAGAATTCAAACCTAAAATAAAATAAAGAGAACTAAAAACTAAAAGTAGAGATTTAGACCTTTGATCTCGAAAAAATAAAGTGTAAAGTTTTAACTAATTAAATTATGCTTTTTAATTAAGTTACTAGAATTATGTAAAAGTTGACATTTATTATATTCTGAGTATAATAGATATAACATAACTCAACTGCTTAAAATTAAATAATATATCATTAATCAAAAGGCCATAAATTACTTACTCTCTCACATTTTACTGAAGCGAAAATAGTTAAAAGAACATTTTTTGACCTTAAAATATTTATGAAAGTAAGAAATTAGTTTTAATAATTTAGTTGTGTAATTTCAATTATAAAAGTATTTGGTCCTTAAATTTCATTACCTACCAATTTAGTAACTATACTTAAAATTTTGAAACAATAATTCTAAAAAAAAAGATTGGTGCCAATTTTACATTATCAATCTATGTAGGTAAAAAAAAGTATAATATTTTAATAACATTTTTTTCATTACAACGATTGAATCATTGCAAACTTTAAATTACATAACTTGACTTGTTGCAGAATTGAAAGTAAAAGAGTTAAATCATTATAAACTAAGGTGGTTAGAATAAAAGTAATGTTGTTGTAAAAAATTTTGGCCCTATCTTCAGGATCCATGGCAAAACTTGGAGAGAAAAGAAGTGAAAGGGGATGGAAAAACTGACCTAAGGAGCAGAGGGCTTTGTTTAGTTCCAATTTCATGTACACCACAAGTCTATAGAGAGAACACAGGATCTGACTATTGGACACCTTACCGAGGTTGTTTCTATAGATAG

mRNA sequence

ATGGCAGAGGAGTGCAACGAGAGCTCTGTTGCGACATCTAATTCCAATCCATCCAACTGGTGGGATATTAACCATAATCATCATCATCATCACCATCCTTCACTTTCTTATAATTCCCATTGGCTTCTCCAAAACCCTAATTCCAATTCCTCTTGTGAAGAAGATGTTTCCATCTCAACTTCCTCCTTTACCAACAATAATTCCAATCACTCTCCACTCACCTCCAACCACCTTCTTCCTCATCATCCTTCTGATAATAATCACCTTTGGACCCAAGTTTTGCTGAACATAGGAAACGGTGTGGAATTGCAAAGCAATGAAGAAGACATAGAAGCGAATTTCCTTGAGACAATATCATCAAGAAGCATGTCCACCACCGGAATCTTCGAACCCACCGCATGTAGCGATTACCTTAAAAAAATGGACACAACTAACAACAACAACAACAACAACTGGGACGACACTTTCCAAAGCTTCACCAACAACAATAACGGACTTCTTACAACGTCCCACCACACTCACATGCTCCAAAACGATAGGTTTTTGAAGCTTTCCAATCTCGTAAACACGTGGTCCATTGCTCTCCCGAGCTCACAACAAGCACATCTACGGCACTTAATCGACGACCAACACGACCATCTCCCAACCTCCATTGTTCCCACTCATGAACCAGATGGCGCTGTGCCTCACTCGGGCCTCGACCCATGTGCTTCTACCATCCTCAGGAGGTCGCTTCATAATCAAAATTATGGAGATTATATTTCCTTTAATGGACGACTGCCTAAACCGGTGGCCGGTATCAACGGTTCCTCTCATAATCCTTGTTTTAAGTCCTCATTGAATTTGTCTGCTGATACTAAGAAGCAGATTCACCAAATTTCTTCGCCGACAAAAATTAGTGGACGAGGAAGTGGAGTTTTGAGCGAAGGGAAGAAGAAAAGATCGGAAGAATCTTCGGAAACTGCCCCCAAAAAGGCTAAGCAAGATAATTCAACACCTCCATCTAATAAGATGCAACAACCAAAGGTCAAAATTGGAGACAGGATAACGGCCCTTCAGCAAATTGTGTCGCCATTTGGAAAGACTGATACTGCGTCAGTTCTAACCGAAACCATTGGATACATAAAGTTCCTACAAGAGCAAGTCCAGCTACTGAGTAATCCTTACATGAAGACCAATTCCTATAAGGATCCATGGCAAAACTTGGAGAGAAAAGAAGTGAAAGGGGATGGAAAAACTGACCTAAGGAGCAGAGGGCTTTGTTTAGTTCCAATTTCATGTACACCACAAGTCTATAGAGAGAACACAGGATCTGACTATTGGACACCTTACCGAGGTTGTTTCTATAGATAG

Coding sequence (CDS)

ATGGCAGAGGAGTGCAACGAGAGCTCTGTTGCGACATCTAATTCCAATCCATCCAACTGGTGGGATATTAACCATAATCATCATCATCATCACCATCCTTCACTTTCTTATAATTCCCATTGGCTTCTCCAAAACCCTAATTCCAATTCCTCTTGTGAAGAAGATGTTTCCATCTCAACTTCCTCCTTTACCAACAATAATTCCAATCACTCTCCACTCACCTCCAACCACCTTCTTCCTCATCATCCTTCTGATAATAATCACCTTTGGACCCAAGTTTTGCTGAACATAGGAAACGGTGTGGAATTGCAAAGCAATGAAGAAGACATAGAAGCGAATTTCCTTGAGACAATATCATCAAGAAGCATGTCCACCACCGGAATCTTCGAACCCACCGCATGTAGCGATTACCTTAAAAAAATGGACACAACTAACAACAACAACAACAACAACTGGGACGACACTTTCCAAAGCTTCACCAACAACAATAACGGACTTCTTACAACGTCCCACCACACTCACATGCTCCAAAACGATAGGTTTTTGAAGCTTTCCAATCTCGTAAACACGTGGTCCATTGCTCTCCCGAGCTCACAACAAGCACATCTACGGCACTTAATCGACGACCAACACGACCATCTCCCAACCTCCATTGTTCCCACTCATGAACCAGATGGCGCTGTGCCTCACTCGGGCCTCGACCCATGTGCTTCTACCATCCTCAGGAGGTCGCTTCATAATCAAAATTATGGAGATTATATTTCCTTTAATGGACGACTGCCTAAACCGGTGGCCGGTATCAACGGTTCCTCTCATAATCCTTGTTTTAAGTCCTCATTGAATTTGTCTGCTGATACTAAGAAGCAGATTCACCAAATTTCTTCGCCGACAAAAATTAGTGGACGAGGAAGTGGAGTTTTGAGCGAAGGGAAGAAGAAAAGATCGGAAGAATCTTCGGAAACTGCCCCCAAAAAGGCTAAGCAAGATAATTCAACACCTCCATCTAATAAGATGCAACAACCAAAGGTCAAAATTGGAGACAGGATAACGGCCCTTCAGCAAATTGTGTCGCCATTTGGAAAGACTGATACTGCGTCAGTTCTAACCGAAACCATTGGATACATAAAGTTCCTACAAGAGCAAGTCCAGCTACTGAGTAATCCTTACATGAAGACCAATTCCTATAAGGATCCATGGCAAAACTTGGAGAGAAAAGAAGTGAAAGGGGATGGAAAAACTGACCTAAGGAGCAGAGGGCTTTGTTTAGTTCCAATTTCATGTACACCACAAGTCTATAGAGAGAACACAGGATCTGACTATTGGACACCTTACCGAGGTTGTTTCTATAGATAG

Protein sequence

MAEECNESSVATSNSNPSNWWDINHNHHHHHHPSLSYNSHWLLQNPNSNSSCEEDVSISTSSFTNNNSNHSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISSRSMSTTGIFEPTACSDYLKKMDTTNNNNNNNWDDTFQSFTNNNNGLLTTSHHTHMLQNDRFLKLSNLVNTWSIALPSSQQAHLRHLIDDQHDHLPTSIVPTHEPDGAVPHSGLDPCASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQISSPTKISGRGSGVLSEGKKKRSEESSETAPKKAKQDNSTPPSNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
BLAST of Cla97C11G207060 vs. NCBI nr
Match: XP_004140441.1 (PREDICTED: transcription factor bHLH111 [Cucumis sativus] >KGN50812.1 hypothetical protein Csa_5G269890 [Cucumis sativus])

HSP 1 Score: 562.8 bits (1449), Expect = 1.1e-156
Identity = 380/457 (83.15%), Postives = 394/457 (86.21%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MAEEC ESSVATSNS PSNWWDI XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX     
Sbjct: 1   MAEECTESSVATSNSTPSNWWDINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEDVSI 60

Query: 61  XXXXXXXXXXSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISS 120
                         SNHLLPHHPSDNNHLWTQVLLNIGN VEL+SNEE+IE NFLETISS
Sbjct: 61  STSSFTN------ASNHLLPHHPSDNNHLWTQVLLNIGNDVELESNEENIEGNFLETISS 120

Query: 121 R-SMSTTGIFEPTACSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQND 180
           R SMSTTGIFE TACSDYLKKM   XXXXXXXXXXXXXXXXXXXXXXXX     HMLQN+
Sbjct: 121 RSSMSTTGIFESTACSDYLKKMDTSXXXXXXXXXXXXXXXXXXXXXXXXLTSHTHMLQNE 180

Query: 181 RFLKLSNLVNTWSIALPSSQQAHLRHL-IDDQHDHLPTSIVPTH---EPDGAVPHSGLDP 240
           RFLKLSNLVN WSIALP +   HLRHL +DDQHDHL  S +PTH   EPDG +PH GLDP
Sbjct: 181 RFLKLSNLVNRWSIALP-NPDPHLRHLTMDDQHDHLRASTMPTHEILEPDGTMPHQGLDP 240

Query: 241 CASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQISS 300
           C S+ LRRSL NQNYGDYISFNGRL KPV GINGSS+NPCFKSSLNLSAD+KKQIHQI S
Sbjct: 241 CDSSFLRRSLQNQNYGDYISFNGRLAKPVVGINGSSNNPCFKSSLNLSADSKKQIHQICS 300

Query: 301 PTKISGRGS-GVLSEGK-KKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITALQ 360
           PT+ISGRGS GV +EGK     XXXXXXXXXXXXXXXXXXXXNK+QQPKVKIGDRITALQ
Sbjct: 301 PTRISGRGSGGVSNEGKXXXXXXXXXXXXXXXXXXXXXXXXXNKIQQPKVKIGDRITALQ 360

Query: 361 QIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGKT 420
           QIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQ+LERKE KGDGK 
Sbjct: 361 QIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKEGKGDGKM 420

Query: 421 DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
           DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Sbjct: 421 DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 450

BLAST of Cla97C11G207060 vs. NCBI nr
Match: XP_008460218.1 (PREDICTED: transcription factor bHLH111 [Cucumis melo])

HSP 1 Score: 562.4 bits (1448), Expect = 1.4e-156
Identity = 386/458 (84.28%), Postives = 397/458 (86.68%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MAEEC ESSVATSNS PSNWWDI XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MAEECTESSVATSNSTPSNWWDINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISS 120
           XXX           +NHLLPHHPSDNNHLWTQVLLNIGN VELQSNEEDIE NFLETISS
Sbjct: 61  XXXSF---------TNHLLPHHPSDNNHLWTQVLLNIGNDVELQSNEEDIEGNFLETISS 120

Query: 121 R-SMSTTGIFEPTACSDYLKKM-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQN 180
           R SMSTTGIFEPTACSDYLKKM  XXXXXXXXXXXXXXXXXXXXXXXXXXXX   HMLQN
Sbjct: 121 RSSMSTTGIFEPTACSDYLKKMDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQAHMLQN 180

Query: 181 DRFLKLSNLVNTWSIALPSSQQAHLRHLIDDQHDHLPTSIVPTHE----PDGAVPHSGLD 240
           +RFLKLSNLVN WSIALP S   HLRHL DDQHDHL  + VPTHE     DG VPH GLD
Sbjct: 181 ERFLKLSNLVNRWSIALP-SPDPHLRHLTDDQHDHLRATTVPTHEILESADGVVPHQGLD 240

Query: 241 PCASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQIS 300
           PC S+ LRR+L NQNYGDYISFNGRL KP+ GIN SS+NPCFKSSLNLSAD+KKQIHQI 
Sbjct: 241 PCDSSFLRRTLQNQNYGDYISFNGRLAKPMVGINSSSNNPCFKSSLNLSADSKKQIHQIC 300

Query: 301 SPTKISGRGS-GVLSEGK-KKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITAL 360
           SPT+ISGRGS GV +EGK     XXXXXXXXXXXXX       NKMQQPKVKIGDRITAL
Sbjct: 301 SPTRISGRGSGGVSNEGKXXXXXXXXXXXXXXXXXXDNSTPYSNKMQQPKVKIGDRITAL 360

Query: 361 QQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGK 420
           QQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQ+LERKE KGDGK
Sbjct: 361 QQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKEGKGDGK 420

Query: 421 TDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
            DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Sbjct: 421 IDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 448

BLAST of Cla97C11G207060 vs. NCBI nr
Match: XP_022158889.1 (transcription factor bHLH111 isoform X1 [Momordica charantia])

HSP 1 Score: 431.8 bits (1109), Expect = 2.9e-117
Identity = 309/484 (63.84%), Postives = 337/484 (69.63%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           M +EC ESSVATS S P NWWD     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MGDECTESSVATS-STPPNWWD----SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXSPLT---SNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLET 120
           X         S LT   S  L    PSD N LWTQVLLNIGNGVELQS+E++I  NFLE 
Sbjct: 61  XVSLSTSSFHSALTVESSRRLFDPLPSD-NQLWTQVLLNIGNGVELQSDEQEIGENFLEA 120

Query: 121 ISSRSMSTTGIFEPTACSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQ 180
           ISS+++STTGIFE  ACSDYLKKM                 XXXXXXXXXXXXXX H  +
Sbjct: 121 ISSKNISTTGIFESAACSDYLKKM-DTNYNNNWNSFPIFDAXXXXXXXXXXXXXXSHAAE 180

Query: 181 NDRFLKLSNLVNTWSIALPSSQQAHLRHLIDDQHDHLPTSIVPT--HEPDGAVPHSG--- 240
           N+R LKLS+LVNTWSIALP          +DD+ DHL  + VP   H      P +G   
Sbjct: 181 NERLLKLSSLVNTWSIALP----------MDDEPDHLRAATVPCFGHGLKAEAPEAGTVA 240

Query: 241 ---LDPCASTILRRSLHN-----------------------QNYGDYISFNGRLPKPVAG 300
              L+P A+ + RRS  +                       +N+ DYISFNGRL KPV  
Sbjct: 241 PIRLEPGAA-LFRRSFGSGGFQNSIGAKQYDHSMQLDNAGTRNFADYISFNGRLGKPVIN 300

Query: 301 INGSSHNPCFKSSLNLSADTKKQIHQISSPTKISGRGSGVLSEGKKKRSXXXXXXXXXXX 360
           ING  +NPCFK SLNLSADTKKQIHQ SSPT+ISGRGSGV SEGKKKRS           
Sbjct: 301 ING-INNPCFK-SLNLSADTKKQIHQTSSPTRISGRGSGVSSEGKKKRSEECSETATKKT 360

Query: 361 XXXXXXXXXNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLS 420
                    NKMQQPKVK+GDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQ+QLLS
Sbjct: 361 KQENATAASNKMQQPKVKLGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQIQLLS 420

Query: 421 NPYMKTNSYKDPWQNLERKEVKGDGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG 451
           NPYMKTN+YKDPW++ ERK+ KGDGK DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG
Sbjct: 421 NPYMKTNAYKDPWKSTERKDPKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG 464

BLAST of Cla97C11G207060 vs. NCBI nr
Match: XP_022158891.1 (transcription factor bHLH111 isoform X2 [Momordica charantia])

HSP 1 Score: 397.1 bits (1019), Expect = 7.8e-107
Identity = 297/484 (61.36%), Postives = 324/484 (66.94%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           M +EC ESSVATS S P NWWD     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MGDECTESSVATS-STPPNWWD----SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXSPLT---SNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLET 120
           X         S LT   S  L    PSD N LWTQVLLNIGNGVELQS+E++I  NFLE 
Sbjct: 61  XVSLSTSSFHSALTVESSRRLFDPLPSD-NQLWTQVLLNIGNGVELQSDEQEIGENFLEA 120

Query: 121 ISSRSMSTTGIFEPTACSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQ 180
           ISS+++STTGIFE  ACSDYLKKM                 XXXXXXXXXXXXXX H  +
Sbjct: 121 ISSKNISTTGIFESAACSDYLKKM-DTNYNNNWNSFPIFDAXXXXXXXXXXXXXXSHAAE 180

Query: 181 NDRFLKLSNLVNTWSIALPSSQQAHLRHLIDDQHDHLPTSIVPT--HEPDGAVPHSG--- 240
           N+R LKLS+LVNTWSIALP          +DD+ DHL  + VP   H      P +G   
Sbjct: 181 NERLLKLSSLVNTWSIALP----------MDDEPDHLRAATVPCFGHGLKAEAPEAGTVA 240

Query: 241 ---LDPCASTILRRSLHN-----------------------QNYGDYISFNGRLPKPVAG 300
              L+P A+ + RRS  +                       +N+ DYISFNGRL KPV  
Sbjct: 241 PIRLEPGAA-LFRRSFGSGGFQNSIGAKQYDHSMQLDNAGTRNFADYISFNGRLGKPVIN 300

Query: 301 INGSSHNPCFKSSLNLSADTKKQIHQISSPTKISGRGSGVLSEGKKKRSXXXXXXXXXXX 360
           ING  +NPCFK SLNLSADTKKQIHQ SSPT+ISGRGSGV SEGKKKRS           
Sbjct: 301 ING-INNPCFK-SLNLSADTKKQIHQTSSPTRISGRGSGVSSEGKKKRSEECSETATKKT 360

Query: 361 XXXXXXXXXNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLS 420
                    NKMQQPKVK+GDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQ+Q   
Sbjct: 361 KQENATAASNKMQQPKVKLGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQIQ--- 420

Query: 421 NPYMKTNSYKDPWQNLERKEVKGDGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG 451
                     DPW++ ERK+ KGDGK DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG
Sbjct: 421 ----------DPWKSTERKDPKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG 451

BLAST of Cla97C11G207060 vs. NCBI nr
Match: XP_023549380.1 (transcription factor bHLH111 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 392.5 bits (1007), Expect = 1.9e-105
Identity = 229/394 (58.12%), Postives = 253/394 (64.21%), Query Frame = 0

Query: 74  TSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISSRSMSTTGIFEPTA 133
           +S  LLPHH SD +HLWTQVLLNIGNGV      EDI+ NFL              EP A
Sbjct: 60  SSAQLLPHHASD-HHLWTQVLLNIGNGV------EDIQPNFL--------------EPIA 119

Query: 134 CSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQNDRFLKLSNLVNTWSI 193
           CSDYLKKM                                  L+N+R LKLSNLVNTWSI
Sbjct: 120 CSDYLKKMDTNTNWDDTFQTFNNNNGLLT------------TLENERLLKLSNLVNTWSI 179

Query: 194 ALPSSQQAHLRHLIDDQ-HDHLPTSIVPTHEPDGAVPHSGLDPCASTILRRSLH------ 253
           ALP S  AHLRHL+D + H   PT+++   +PD A     LDPCAS   RRSLH      
Sbjct: 180 ALP-SPDAHLRHLMDQEPHPLRPTTLL---DPDAA-----LDPCASAFFRRSLHTPMPAK 239

Query: 254 ---------NQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQISSPT 313
                     +NYGDYISFN R  KP+ G+N S        S NLS  TKKQI QISSPT
Sbjct: 240 PFYDNHTATTRNYGDYISFNPRFAKPLLGVNPSI------KSFNLSPQTKKQIQQISSPT 299

Query: 314 KISGRGSGVLSEGKKKRSXXXXXXXXXXXXXXXXXXXXN-KMQQPKVKIGDRITALQQIV 373
           + SGRGSGVL+EGKKKRS                    + K+QQPKVKIGDRIT LQQIV
Sbjct: 300 RGSGRGSGVLNEGKKKRSEESSETVTKKAKQDNSTTPASTKIQQPKVKIGDRITTLQQIV 359

Query: 374 SPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGKTDLR 433
           SPFGKTDTASVL ETIGYIKFLQEQVQLL+NPY+KTNSYKD WQ+LERKE KG+GK +LR
Sbjct: 360 SPFGKTDTASVLNETIGYIKFLQEQVQLLTNPYLKTNSYKDAWQSLERKESKGEGKMELR 405

Query: 434 SRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
           SRGLCLVPISCTPQVYREN+GSDYWTPYRGCFYR
Sbjct: 420 SRGLCLVPISCTPQVYRENSGSDYWTPYRGCFYR 405

BLAST of Cla97C11G207060 vs. TrEMBL
Match: tr|A0A0A0KQW5|A0A0A0KQW5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G269890 PE=4 SV=1)

HSP 1 Score: 562.8 bits (1449), Expect = 7.1e-157
Identity = 380/457 (83.15%), Postives = 394/457 (86.21%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MAEEC ESSVATSNS PSNWWDI XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX     
Sbjct: 1   MAEECTESSVATSNSTPSNWWDINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEDVSI 60

Query: 61  XXXXXXXXXXSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISS 120
                         SNHLLPHHPSDNNHLWTQVLLNIGN VEL+SNEE+IE NFLETISS
Sbjct: 61  STSSFTN------ASNHLLPHHPSDNNHLWTQVLLNIGNDVELESNEENIEGNFLETISS 120

Query: 121 R-SMSTTGIFEPTACSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQND 180
           R SMSTTGIFE TACSDYLKKM   XXXXXXXXXXXXXXXXXXXXXXXX     HMLQN+
Sbjct: 121 RSSMSTTGIFESTACSDYLKKMDTSXXXXXXXXXXXXXXXXXXXXXXXXLTSHTHMLQNE 180

Query: 181 RFLKLSNLVNTWSIALPSSQQAHLRHL-IDDQHDHLPTSIVPTH---EPDGAVPHSGLDP 240
           RFLKLSNLVN WSIALP +   HLRHL +DDQHDHL  S +PTH   EPDG +PH GLDP
Sbjct: 181 RFLKLSNLVNRWSIALP-NPDPHLRHLTMDDQHDHLRASTMPTHEILEPDGTMPHQGLDP 240

Query: 241 CASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQISS 300
           C S+ LRRSL NQNYGDYISFNGRL KPV GINGSS+NPCFKSSLNLSAD+KKQIHQI S
Sbjct: 241 CDSSFLRRSLQNQNYGDYISFNGRLAKPVVGINGSSNNPCFKSSLNLSADSKKQIHQICS 300

Query: 301 PTKISGRGS-GVLSEGK-KKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITALQ 360
           PT+ISGRGS GV +EGK     XXXXXXXXXXXXXXXXXXXXNK+QQPKVKIGDRITALQ
Sbjct: 301 PTRISGRGSGGVSNEGKXXXXXXXXXXXXXXXXXXXXXXXXXNKIQQPKVKIGDRITALQ 360

Query: 361 QIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGKT 420
           QIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQ+LERKE KGDGK 
Sbjct: 361 QIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKEGKGDGKM 420

Query: 421 DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
           DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Sbjct: 421 DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 450

BLAST of Cla97C11G207060 vs. TrEMBL
Match: tr|A0A1S3CD90|A0A1S3CD90_CUCME (transcription factor bHLH111 OS=Cucumis melo OX=3656 GN=LOC103499104 PE=4 SV=1)

HSP 1 Score: 562.4 bits (1448), Expect = 9.3e-157
Identity = 386/458 (84.28%), Postives = 397/458 (86.68%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MAEEC ESSVATSNS PSNWWDI XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MAEECTESSVATSNSTPSNWWDINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISS 120
           XXX           +NHLLPHHPSDNNHLWTQVLLNIGN VELQSNEEDIE NFLETISS
Sbjct: 61  XXXSF---------TNHLLPHHPSDNNHLWTQVLLNIGNDVELQSNEEDIEGNFLETISS 120

Query: 121 R-SMSTTGIFEPTACSDYLKKM-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQN 180
           R SMSTTGIFEPTACSDYLKKM  XXXXXXXXXXXXXXXXXXXXXXXXXXXX   HMLQN
Sbjct: 121 RSSMSTTGIFEPTACSDYLKKMDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQAHMLQN 180

Query: 181 DRFLKLSNLVNTWSIALPSSQQAHLRHLIDDQHDHLPTSIVPTHE----PDGAVPHSGLD 240
           +RFLKLSNLVN WSIALP S   HLRHL DDQHDHL  + VPTHE     DG VPH GLD
Sbjct: 181 ERFLKLSNLVNRWSIALP-SPDPHLRHLTDDQHDHLRATTVPTHEILESADGVVPHQGLD 240

Query: 241 PCASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQIS 300
           PC S+ LRR+L NQNYGDYISFNGRL KP+ GIN SS+NPCFKSSLNLSAD+KKQIHQI 
Sbjct: 241 PCDSSFLRRTLQNQNYGDYISFNGRLAKPMVGINSSSNNPCFKSSLNLSADSKKQIHQIC 300

Query: 301 SPTKISGRGS-GVLSEGK-KKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITAL 360
           SPT+ISGRGS GV +EGK     XXXXXXXXXXXXX       NKMQQPKVKIGDRITAL
Sbjct: 301 SPTRISGRGSGGVSNEGKXXXXXXXXXXXXXXXXXXDNSTPYSNKMQQPKVKIGDRITAL 360

Query: 361 QQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGK 420
           QQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQ+LERKE KGDGK
Sbjct: 361 QQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKEGKGDGK 420

Query: 421 TDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
            DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Sbjct: 421 IDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 448

BLAST of Cla97C11G207060 vs. TrEMBL
Match: tr|A0A2P4LB10|A0A2P4LB10_QUESU (Transcription factor OS=Quercus suber OX=58331 GN=CFP56_28058 PE=4 SV=1)

HSP 1 Score: 291.6 bits (745), Expect = 3.1e-75
Identity = 213/525 (40.57%), Postives = 263/525 (50.10%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDI--------XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MAEEC ESSVATS+S P+NWWDI                                     
Sbjct: 1   MAEECIESSVATSSSTPTNWWDIPGASSLPAWNNNPNSWHHQNPNSNSSCEEDVSISTSF 60

Query: 61  XXXXXXXXXXXXXXXXXXSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEA 120
                              P +SN L+  H SDNN LW+ VLL++G+   LQ+N +D+E 
Sbjct: 61  TNASNHSGLTVESSRRLAEPASSNDLIGEHASDNN-LWSHVLLSVGSNGGLQNN-QDVEE 120

Query: 121 NFLETISSRSMSTTGIFEPTACSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           NF++ +SS+SMS TG+F+P AC DYLKK+                               
Sbjct: 121 NFVDALSSKSMS-TGMFQP-AC-DYLKKL--------DNSWEFTNSTSFNNFEKHINGFN 180

Query: 181 XHMLQNDRFLKLSNLVNTWSIALPS--------------SQQAHLRHLIDDQHDHLPTSI 240
             +L N+R  KLSNLV+TWSIA P               S  + + H       H+  + 
Sbjct: 181 DSVLDNERLNKLSNLVSTWSIAPPDPDVNRQFDPQTCNISLSSSMDHYSQPDLCHMKHAF 240

Query: 241 VPTHEPD-GAVPHSGLDPCAS----------------TILRRSLHN-------------- 300
             +   D G   +  L PC S                 +LRRS +N              
Sbjct: 241 SDSTSCDLGTTRNCSLFPCYSHDMKVENERRETEAPVALLRRSFNNNGVGYQIGINSSLM 300

Query: 301 --------------------QNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTK 360
                               +++ D ISF+ R+ KP+  I+  + N CFK SLN S D K
Sbjct: 301 GDNPRSYFYGMPNSSSCRSIKSFADVISFSNRMGKPLMDIH--APNNCFK-SLNTS-DYK 360

Query: 361 KQIHQISSPTKISGRGSGVLSEGKKKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGD 420
           KQ  Q +S T+ +G+G  + +EGK+KRS                     K+Q PKVKIGD
Sbjct: 361 KQGLQ-TSMTRNNGKGQAIANEGKRKRSEDGSESALKKPKQESPTASSVKVQAPKVKIGD 420

Query: 421 RITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEV 451
           RITALQQIVSPFGKTDTASVL E I YIKFLQEQVQLL+NPYMKTNS+KDPW  L+RK+ 
Sbjct: 421 RITALQQIVSPFGKTDTASVLFEAIQYIKFLQEQVQLLTNPYMKTNSHKDPWGGLDRKD- 480

BLAST of Cla97C11G207060 vs. TrEMBL
Match: tr|A0A2P4LAY5|A0A2P4LAY5_QUESU (Transcription factor OS=Quercus suber OX=58331 GN=CFP56_28058 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 2.2e-73
Identity = 211/525 (40.19%), Postives = 263/525 (50.10%), Query Frame = 0

Query: 1   MAEECNESSVATSNSNPSNWWDI--------XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MAEEC ESSVATS+S P+NWWDI                                     
Sbjct: 1   MAEECIESSVATSSSTPTNWWDIPGASSLPAWNNNPNSWHHQNPNSNSSCEEDVSISTSF 60

Query: 61  XXXXXXXXXXXXXXXXXXSPLTSNHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEA 120
                              P +SN L+  H SDNN LW+ VLL++G+   LQ+N +D+E 
Sbjct: 61  TNASNHSGLTVESSRRLAEPASSNDLIGEHASDNN-LWSHVLLSVGSNGGLQNN-QDVEE 120

Query: 121 NFLETISSRSMSTTGIFEPTACSDYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           NF++ +SS+SMS TG+F+P AC DYLKK+                               
Sbjct: 121 NFVDALSSKSMS-TGMFQP-AC-DYLKKL--------DNSWEFTNSTSFNNFEKHINGFN 180

Query: 181 XHMLQNDRFLKLSNLVNTWSIALPS--------------SQQAHLRHLIDDQHDHLPTSI 240
             +L N+R  KLSNLV+TWSIA P               S  + + H       H+  + 
Sbjct: 181 DSVLDNERLNKLSNLVSTWSIAPPDPDVNRQFDPQTCNISLSSSMDHYSQPDLCHMKHAF 240

Query: 241 VPTHEPD-GAVPHSGLDPCAS----------------TILRRSLHN-------------- 300
             +   D G   +  L PC S                 +LRRS +N              
Sbjct: 241 SDSTSCDLGTTRNCSLFPCYSHDMKVENERRETEAPVALLRRSFNNNGVGYQIGINSSLM 300

Query: 301 --------------------QNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTK 360
                               +++ D ISF+ R+ KP+  I+  + N CFK SLN S D K
Sbjct: 301 GDNPRSYFYGMPNSSSCRSIKSFADVISFSNRMGKPLMDIH--APNNCFK-SLNTS-DYK 360

Query: 361 KQIHQISSPTKISGRGSGVLSEGKKKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGD 420
           KQ  Q +S T+ +G+G  + +EGK+KRS                    + ++ PKVKIGD
Sbjct: 361 KQGLQ-TSMTRNNGKGQAIANEGKRKRS--EDGSESALKKPKQESPTASSVKAPKVKIGD 420

Query: 421 RITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEV 451
           RITALQQIVSPFGKTDTASVL E I YIKFLQEQVQLL+NPYMKTNS+KDPW  L+RK+ 
Sbjct: 421 RITALQQIVSPFGKTDTASVLFEAIQYIKFLQEQVQLLTNPYMKTNSHKDPWGGLDRKD- 480

BLAST of Cla97C11G207060 vs. TrEMBL
Match: tr|B9S7A5|B9S7A5_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0775400 PE=4 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 3.0e-70
Identity = 182/386 (47.15%), Postives = 222/386 (57.51%), Query Frame = 0

Query: 76  NHLLPHHPSDNNHLWTQVLLNIGNGVELQSNEEDIEANFLETISSRSMSTTGIFEPTACS 135
           N L+  H SD + LW+ +LL +G+  ELQ+N +D+  N L+ +SSRS++++GIFEP AC 
Sbjct: 84  NELIGEHASD-SQLWSHILLGVGSNGELQNN-QDVGENLLDALSSRSINSSGIFEP-AC- 143

Query: 136 DYLKKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHMLQND-RFLKLSNLVNTWSIA 195
           DYLKK+                                 ++++D R  KLSNLV      
Sbjct: 144 DYLKKI-----DHNWEFTSSFNNFEKHINGFSTDHHHQSLIESDQRVTKLSNLVEN---- 203

Query: 196 LPSSQQAHLRHLIDDQHDHLPTSIVPTHEPDGAVPHSGLDPCASTILRRSLH-------- 255
                Q H RH+      ++       +   G   H GL+   S +   S +        
Sbjct: 204 -DHEHQNHHRHVEAPAAGYVLRRSTFNNNGAGVGYHIGLNN-GSVMADNSKYYYGTTENT 263

Query: 256 -NQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIHQISSPTKISGRGSG 315
             + + D ++FNGRL KP+  I G  H PCFK SLNLS D +KQ  Q SS T    RG G
Sbjct: 264 SARTFNDGLTFNGRLNKPLIDIQG--HKPCFK-SLNLS-DCRKQGLQASSQTV---RGQG 323

Query: 316 VLSEGKKKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITALQQIVSPFGKTDTA 375
             SEGKKKR                      K Q PKVK+GDRITALQQIVSPFGKTDTA
Sbjct: 324 NSSEGKKKRYEDTSETIPKKPKHESSTASSVKTQAPKVKLGDRITALQQIVSPFGKTDTA 383

Query: 376 SVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKEVKGDGKTDLRSRGLCLVPI 435
           SVL E I YIKFLQEQVQLLSNPYMK+NS+KDPW  L++K  +GD K DLRSRGLCLVPI
Sbjct: 384 SVLLEAIQYIKFLQEQVQLLSNPYMKSNSHKDPWGGLDKK-AQGDAKVDLRSRGLCLVPI 443

Query: 436 SCTPQVYRENTGSDYWTP-YRGCFYR 451
           SCTPQ+Y ENTGSDYWTP YRGC YR
Sbjct: 444 SCTPQIYHENTGSDYWTPTYRGCLYR 446

BLAST of Cla97C11G207060 vs. Swiss-Prot
Match: sp|Q9FYJ6|BH111_ARATH (Transcription factor bHLH111 OS=Arabidopsis thaliana OX=3702 GN=BHLH111 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 8.7e-41
Identity = 118/282 (41.84%), Postives = 152/282 (53.90%), Query Frame = 0

Query: 178 NDRFLKLSNLVNT-WSIALPSS--QQAHLRHLIDDQHDHLPTSIVPTHEPDGAVPHSGLD 237
           + R  KL++LV   WSIA P++     +L H  D  HDH     +  +     V +   D
Sbjct: 47  DQRLSKLTDLVGKHWSIAPPNNPDMNHNLHHHFD--HDHSQNDDISMYRQALEVKNEE-D 106

Query: 238 PC---ASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIH 297
            C    S+      H+         + RL +P+  IN  S  PCFK +LN+S   KK+ H
Sbjct: 107 LCYNNGSSGGGSLFHDPIESSRSFLDIRLSRPLTDIN-PSFKPCFK-ALNVSEFNKKE-H 166

Query: 298 QISSPTKISGRGSGVLSEGKKKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITA 357
           Q +S   ++    G  + GKKKR                      + + PK K+ D+IT 
Sbjct: 167 QTAS---LAAVRLGTTNAGKKKRCEEISDEVSKKAKCSEGSTLSPEKELPKAKLRDKITT 226

Query: 358 LQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKE--VKG 417
           LQQIVSPFGKTDTASVL E I YI F QEQV+LLS PYMK +S KDPW   +R++   +G
Sbjct: 227 LQQIVSPFGKTDTASVLQEAITYINFYQEQVKLLSTPYMKNSSMKDPWGGWDREDHNKRG 286

Query: 418 DGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTP-YRGCFYR 451
               DLRSRGLCLVPIS TP  YR+N+ +DYW P YRG  YR
Sbjct: 287 PKHLDLRSRGLCLVPISYTPIAYRDNSATDYWNPTYRGSLYR 319

BLAST of Cla97C11G207060 vs. Swiss-Prot
Match: sp|Q8GXT3|BH123_ARATH (Transcription factor bHLH123 OS=Arabidopsis thaliana OX=3702 GN=BHLH123 PE=1 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 3.3e-24
Identity = 61/109 (55.96%), Postives = 75/109 (68.81%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQN 401
           K K+GDRI ALQQ+VSPFGKTD ASVL+E I YIKFL +QV  LSNPYMK+ +     Q+
Sbjct: 347 KEKMGDRIAALQQLVSPFGKTDAASVLSEAIEYIKFLHQQVSALSNPYMKSGASLQHQQS 406

Query: 402 LERKEVKGDGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
               E++   + DLRSRGLCLVP+S T  V  + T  D+WTP  G  +R
Sbjct: 407 DHSTELEVSEEPDLRSRGLCLVPVSSTFPVTHDTT-VDFWTPTFGGTFR 454

BLAST of Cla97C11G207060 vs. Swiss-Prot
Match: sp|Q8S3D1|BH068_ARATH (Transcription factor bHLH68 OS=Arabidopsis thaliana OX=3702 GN=BHLH68 PE=1 SV=2)

HSP 1 Score: 108.6 bits (270), Expect = 1.8e-22
Identity = 63/138 (45.65%), Postives = 78/138 (56.52%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSY------ 401
           K K+G RI AL Q+VSPFGKTDTASVL+E IGYI+FLQ Q++ LS+PY  T +       
Sbjct: 266 KEKLGGRIAALHQLVSPFGKTDTASVLSEAIGYIRFLQSQIEALSHPYFGTTASGNMRHQ 325

Query: 402 ------------KDPWQ---------------NLERKEVKGDGKTDLRSRGLCLVPISCT 447
                       +DP Q               + + +    + K DLRSRGLCLVPISCT
Sbjct: 326 QHLQGDRSCIFPEDPGQLVNDQCMKRRGASSSSTDNQNASEEPKKDLRSRGLCLVPISCT 385

BLAST of Cla97C11G207060 vs. Swiss-Prot
Match: sp|Q9LT67|BH113_ARATH (Transcription factor bHLH113 OS=Arabidopsis thaliana OX=3702 GN=BHLH113 PE=2 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 3.4e-21
Identity = 56/107 (52.34%), Postives = 73/107 (68.22%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQN 401
           K ++G+RI ALQQ+VSP+GKTD ASVL E +GYIKFLQ+Q+Q+L +PY+  +S       
Sbjct: 157 KERLGERIAALQQLVSPYGKTDAASVLHEAMGYIKFLQDQIQVLCSPYLINHS------- 216

Query: 402 LERKEVKGD-----GKTDLRSRGLCLVPISCTPQVYRENTGSDYWTP 444
           L+   V GD        DLRSRGLCLVP+S T  V   N G+D+W+P
Sbjct: 217 LDGGVVTGDVMAAMKAKDLRSRGLCLVPVSSTVHVENSN-GADFWSP 255

BLAST of Cla97C11G207060 vs. Swiss-Prot
Match: sp|Q9SFZ3|BH110_ARATH (Transcription factor bHLH110 OS=Arabidopsis thaliana OX=3702 GN=BHLH110 PE=2 SV=2)

HSP 1 Score: 101.7 bits (252), Expect = 2.2e-20
Identity = 55/93 (59.14%), Postives = 66/93 (70.97%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYK-DPWQ 401
           K K+GDRI ALQQ+VSPFGKTDTASVL E IGYIKFLQ Q++ LS PYM+ +  +     
Sbjct: 335 KEKLGDRIAALQQLVSPFGKTDTASVLMEAIGYIKFLQSQIETLSVPYMRASRNRPGKAS 394

Query: 402 NLERKEVKGDGK--TDLRSRGLCLVPISCTPQV 432
            L  +  +GD +   DLRSRGLCLVP+SC   V
Sbjct: 395 QLVSQSQEGDEEETRDLRSRGLCLVPLSCMTYV 427

BLAST of Cla97C11G207060 vs. TAIR10
Match: AT1G31050.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 169.5 bits (428), Expect = 4.8e-42
Identity = 118/282 (41.84%), Postives = 152/282 (53.90%), Query Frame = 0

Query: 178 NDRFLKLSNLVNT-WSIALPSS--QQAHLRHLIDDQHDHLPTSIVPTHEPDGAVPHSGLD 237
           + R  KL++LV   WSIA P++     +L H  D  HDH     +  +     V +   D
Sbjct: 162 DQRLSKLTDLVGKHWSIAPPNNPDMNHNLHHHFD--HDHSQNDDISMYRQALEVKNEE-D 221

Query: 238 PC---ASTILRRSLHNQNYGDYISFNGRLPKPVAGINGSSHNPCFKSSLNLSADTKKQIH 297
            C    S+      H+         + RL +P+  IN  S  PCFK +LN+S   KK+ H
Sbjct: 222 LCYNNGSSGGGSLFHDPIESSRSFLDIRLSRPLTDIN-PSFKPCFK-ALNVSEFNKKE-H 281

Query: 298 QISSPTKISGRGSGVLSEGKKKRSXXXXXXXXXXXXXXXXXXXXNKMQQPKVKIGDRITA 357
           Q +S   ++    G  + GKKKR                      + + PK K+ D+IT 
Sbjct: 282 QTAS---LAAVRLGTTNAGKKKRCEEISDEVSKKAKCSEGSTLSPEKELPKAKLRDKITT 341

Query: 358 LQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQNLERKE--VKG 417
           LQQIVSPFGKTDTASVL E I YI F QEQV+LLS PYMK +S KDPW   +R++   +G
Sbjct: 342 LQQIVSPFGKTDTASVLQEAITYINFYQEQVKLLSTPYMKNSSMKDPWGGWDREDHNKRG 401

Query: 418 DGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTP-YRGCFYR 451
               DLRSRGLCLVPIS TP  YR+N+ +DYW P YRG  YR
Sbjct: 402 PKHLDLRSRGLCLVPISYTPIAYRDNSATDYWNPTYRGSLYR 434

BLAST of Cla97C11G207060 vs. TAIR10
Match: AT3G20640.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 114.4 bits (285), Expect = 1.8e-25
Identity = 61/109 (55.96%), Postives = 75/109 (68.81%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQN 401
           K K+GDRI ALQQ+VSPFGKTD ASVL+E I YIKFL +QV  LSNPYMK+ +     Q+
Sbjct: 347 KEKMGDRIAALQQLVSPFGKTDAASVLSEAIEYIKFLHQQVSALSNPYMKSGASLQHQQS 406

Query: 402 LERKEVKGDGKTDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR 451
               E++   + DLRSRGLCLVP+S T  V  + T  D+WTP  G  +R
Sbjct: 407 DHSTELEVSEEPDLRSRGLCLVPVSSTFPVTHDTT-VDFWTPTFGGTFR 454

BLAST of Cla97C11G207060 vs. TAIR10
Match: AT4G29100.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 108.6 bits (270), Expect = 1.0e-23
Identity = 63/138 (45.65%), Postives = 78/138 (56.52%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSY------ 401
           K K+G RI AL Q+VSPFGKTDTASVL+E IGYI+FLQ Q++ LS+PY  T +       
Sbjct: 266 KEKLGGRIAALHQLVSPFGKTDTASVLSEAIGYIRFLQSQIEALSHPYFGTTASGNMRHQ 325

Query: 402 ------------KDPWQ---------------NLERKEVKGDGKTDLRSRGLCLVPISCT 447
                       +DP Q               + + +    + K DLRSRGLCLVPISCT
Sbjct: 326 QHLQGDRSCIFPEDPGQLVNDQCMKRRGASSSSTDNQNASEEPKKDLRSRGLCLVPISCT 385

BLAST of Cla97C11G207060 vs. TAIR10
Match: AT3G19500.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 104.4 bits (259), Expect = 1.9e-22
Identity = 56/107 (52.34%), Postives = 73/107 (68.22%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQN 401
           K ++G+RI ALQQ+VSP+GKTD ASVL E +GYIKFLQ+Q+Q+L +PY+  +S       
Sbjct: 157 KERLGERIAALQQLVSPYGKTDAASVLHEAMGYIKFLQDQIQVLCSPYLINHS------- 216

Query: 402 LERKEVKGD-----GKTDLRSRGLCLVPISCTPQVYRENTGSDYWTP 444
           L+   V GD        DLRSRGLCLVP+S T  V   N G+D+W+P
Sbjct: 217 LDGGVVTGDVMAAMKAKDLRSRGLCLVPVSSTVHVENSN-GADFWSP 255

BLAST of Cla97C11G207060 vs. TAIR10
Match: AT1G27660.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 101.7 bits (252), Expect = 1.2e-21
Identity = 55/93 (59.14%), Postives = 66/93 (70.97%), Query Frame = 0

Query: 342 KVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYK-DPWQ 401
           K K+GDRI ALQQ+VSPFGKTDTASVL E IGYIKFLQ Q++ LS PYM+ +  +     
Sbjct: 335 KEKLGDRIAALQQLVSPFGKTDTASVLMEAIGYIKFLQSQIETLSVPYMRASRNRPGKAS 394

Query: 402 NLERKEVKGDGK--TDLRSRGLCLVPISCTPQV 432
            L  +  +GD +   DLRSRGLCLVP+SC   V
Sbjct: 395 QLVSQSQEGDEEETRDLRSRGLCLVPLSCMTYV 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140441.11.1e-15683.15PREDICTED: transcription factor bHLH111 [Cucumis sativus] >KGN50812.1 hypothetic... [more]
XP_008460218.11.4e-15684.28PREDICTED: transcription factor bHLH111 [Cucumis melo][more]
XP_022158889.12.9e-11763.84transcription factor bHLH111 isoform X1 [Momordica charantia][more]
XP_022158891.17.8e-10761.36transcription factor bHLH111 isoform X2 [Momordica charantia][more]
XP_023549380.11.9e-10558.12transcription factor bHLH111 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KQW5|A0A0A0KQW5_CUCSA7.1e-15783.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G269890 PE=4 SV=1[more]
tr|A0A1S3CD90|A0A1S3CD90_CUCME9.3e-15784.28transcription factor bHLH111 OS=Cucumis melo OX=3656 GN=LOC103499104 PE=4 SV=1[more]
tr|A0A2P4LB10|A0A2P4LB10_QUESU3.1e-7540.57Transcription factor OS=Quercus suber OX=58331 GN=CFP56_28058 PE=4 SV=1[more]
tr|A0A2P4LAY5|A0A2P4LAY5_QUESU2.2e-7340.19Transcription factor OS=Quercus suber OX=58331 GN=CFP56_28058 PE=4 SV=1[more]
tr|B9S7A5|B9S7A5_RICCO3.0e-7047.15Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0775400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9FYJ6|BH111_ARATH8.7e-4141.84Transcription factor bHLH111 OS=Arabidopsis thaliana OX=3702 GN=BHLH111 PE=2 SV=... [more]
sp|Q8GXT3|BH123_ARATH3.3e-2455.96Transcription factor bHLH123 OS=Arabidopsis thaliana OX=3702 GN=BHLH123 PE=1 SV=... [more]
sp|Q8S3D1|BH068_ARATH1.8e-2245.65Transcription factor bHLH68 OS=Arabidopsis thaliana OX=3702 GN=BHLH68 PE=1 SV=2[more]
sp|Q9LT67|BH113_ARATH3.4e-2152.34Transcription factor bHLH113 OS=Arabidopsis thaliana OX=3702 GN=BHLH113 PE=2 SV=... [more]
sp|Q9SFZ3|BH110_ARATH2.2e-2059.14Transcription factor bHLH110 OS=Arabidopsis thaliana OX=3702 GN=BHLH110 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
AT1G31050.14.8e-4241.84basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G20640.11.8e-2555.96basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G29100.11.0e-2345.65basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G19500.11.9e-2252.34basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G27660.11.2e-2159.14basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
IPR036638HLH_DNA-bd_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G207060.1Cla97C11G207060.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3DG3DSA:4.10.280.10coord: 343..397
e-value: 1.7E-5
score: 26.5
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILYSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 344..395
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 325..339
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..302
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 307..324
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..342
NoneNo IPR availablePANTHERPTHR16223FAMILY NOT NAMEDcoord: 1..199
NoneNo IPR availablePANTHERPTHR16223:SF32SUBFAMILY NOT NAMEDcoord: 246..444
NoneNo IPR availablePANTHERPTHR16223FAMILY NOT NAMEDcoord: 246..444
NoneNo IPR availablePANTHERPTHR16223:SF32SUBFAMILY NOT NAMEDcoord: 1..199
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROSITEPS50888BHLHcoord: 329..378
score: 10.207
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainCDDcd00083HLHcoord: 344..383
e-value: 8.71417E-5
score: 39.5047