CSPI06G00470 (gene) Wild cucumber (PI 183967)

NameCSPI06G00470
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionMyc anthocyanin regulatory protein
LocationChr6 : 350900 .. 355086 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTTCTTCGTCTCTTTGTTTTTCTGGAAACAAAAACATGGCTAATGGACTTGAAAACTGTGACAGCGAACCTGGGTTTCTCAGAAAGCAGCTTGCTGTTGCTGTCAAGAGCATCCAGTGGAGCTATGCGATTTTCTGGTCACCGTCGTCTAGGCAACATGGGTATGGATTTTTCTTTTTTGATCAGGTCTTGGTGCCATTGTTCTTCATTCTTGGGTTCTATATCTATCTGGGTTTCTTTTTTTCTTGTGTAACTTTTGTTTTTGTTGAGATGGGGATTTGTATTTGCAGCTAAATTCATCAGATGGGTGTTGAACTTCCAATTTGGTACTGTGATTTTGATTGCTTAAATTTGTGGGGTTTGTCTTTGGCAACAATCTAAGAAAATGAAGCTGGTTGTTGATCGTAAACGAGGCTCACTATGATCTTATTTAAGTCCTCTCTTTCTTTTAACCTCAACTTTCCTGAACTTCCAGTTCATAGCTTTACTGAGCATCTGAATACTTTTGAGGAGCTCAAAATCCGAGTTTTATCTAAAATAACATGTACATTCCCAGCTACCCTGGCAATTGAGCTTGGTTGTTTAAAGGAATCCCCCAAAATCTGGATTTCCAGCTCCTTTTTCTTTTTTTGAAAACTTATAGCGGAGAACTTAAAATCGGCCCAGATGGCGAAGCCTCTGGGAGGTTAGTGGGGTTAGAGGCGAGCCGTGATCATAATGCAAGGGTTGGGACCCTTGATACCTTAATTACGGAAAGAGAAGTGTTGAAATATCTCTCTTCGACCTAGGAAGGGGTGAGGGATGATAATGTAATTGCTTGCAACTTCACAGTTTGTCTCTCTGTGCTGCTTGCTCTCTGTTTCGGTTGTTTTGCTCGGTGATGATAATGTAGTTGTTGTACTGTAAAGAAAAGATTTGGTTTTGTACTTTAGGGTGCTGGAATGGTGTGATGGCTACTACAATGGAGACATCAAGACGAGGAAAACAGTTCAAGCTGAGGATGTCCACGTCGATAATATGGGCTTACACAGAAGTGAACAGTTGAGAGAGCTCTACAGATCTCTCTTAGAAGGTGAAAGTGAACAACGAACAAAGAAGCCTCCTGCCTCTTTGTCTCCTGAAGATCTATCTGATGCAGAATGGTATTACTTGGTTTGCATGTCCTTTTTCTTCAATCAAGGCCAAGGGTATTCCCTCTGATCCATTCTAATATTTAACCCTTCACTTTTGCTGTTTAACTTGTGAGATTATGAGCACTAACCATCCATCATGCTTAGGTTGCCTGGAAGAGCGTTAGCTGATGATCGAACTATCTGGTTATGCAATGCTCAATATGCAGAGAGTACTGTATTCTCCCGCTCCTTGCTTGCAAAGGTTGGCCTTCTATTGGTCGAAAAAGATCAATCATTTGATTTTGAATAGTTTGATGCCTATTTCAAGTTCCTTAAGGTCTTATCAGTGAGAAAATTTGACTTTTTCTTCCTTTTCTTTTTCATTACTTTGGACGTCGTTGATCAACTACATACAGAGTGCATCAATTCAGGTATTATTATCCTTGTAAAACATCATCTAGTGCTAATTATGATGATAAAGAAAACCAAAAGTCTTTTAACGGTTCACCTGAAAAGAAACCTCTCGTGATAAAACCAAAAAATGTTGTTTACCTCATACTGATGAAATAATTCTTTTTGGTAGATGTGATTGTAAAGTGTACCAATAGTGTTTCTCTGGCCTTGTCTAACAAGTCAAACAAATGCCCTTCGTTTAAGAACCTTGTTTTCTTCTTCACAATATGATGAAGAAGAAAATGGCAGTATTTAAGATACTATTATGAGAAACTGAGTTCAGTTATTGGCTACATGAATCATATACAGTACTTAGAATGGACTGCTTTTACAATTCTAATTTCAAGGCTGAAAGACCTGATCTTTAAATTGTTTTTGTATTGTAGACTGTGGTTTGCTTTCCTTACCTTGGCGGTGTTATAGAACTAGGTGTAACTGAGCAGGTAATATTCATAAAGAAACACACATTACACAATTTTTTTAGTTACAAATTGGCCTTAGATATTTGCGGACCGTGTTTCCTGTTTCATGTTGTCTTTTCAAGGTTTTGGAGGATCCTAGTCTTCTTCAACACGTCAAAGATTTTTTACTGAAGTTCTCAAAGCCAATATGCTCTAAGAAACCTTCTTCCGCTGCTTATAAAGATGATAACGGTAAAGAACCAATGACTGCCAAATCTGACAATGAGATTGTTGAAGTTTTGGCAATGGAGAATCTCTACTGCTCAACAGCCGTGAAATTTGACAGGAAGTCAGTAAATGGGATTCAAAGGAAAAACAATGAGTTTGGCATTGATTCTCTAGATGATTTTTCAAATGGTTGTGAACAATATCACCCAATGGAAGATACTTTAAGACTTGAAGGTGCCGAGGGAGGGGCCTCTCGTTTCCAGAGTTTGCAGTTTCTAGATGATGACTTCAGTTACGGTTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCCGAAGCTTTGGCAAATCAGGAGAAAGTCTCATCTTCTCCAAGATTGAAAGATGCCAACAATTTACCTTTGAAAGAACTTCAAAACCCGAATCACACTCAATCAGGTTCCTTAGATCCCAGTTCTGACGAAGACATGCACTACAAGAGAACTATCTTCACCATTTTGGGAAGTTCAACTCAATTGGTTGGAAGTCCCCTTCTCCATAATTTCTCAAACAGGTCTAATTTCATACCATGGAAGAAAGTAGTGGCTGAGACACATACGCCCCCCATGCAACAAAGAATGTTAAAGAAGATTTTGTTTGCAGTTCCATTATTGTCAGCTGGTTCTCTAAAAGGCCTCAAGGATGAGGAACACTCGATCTTGAAACAGGGTAACAACGACTCCTGCACGAAAAATGCCACACTTGACAAATTAAAAGAAAATGAAAAATTTATGGCCCTTAAGTCGATGTTACCTTCACTTAATGAGGTACTTCCAGCTATTTTCTGTAAAATTTGTTTGAATTTGAAATGGAATTTGCAAGTTCCTAAGCATATACATTGTTTCGATATTGGCTTGAATCGTATTTTCTTCTATCTGAAATTTTAATTCTTTTAAGCAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATCTGAAGATGCTTGAAGCAAGAGTACAGGAGTTAGAAACTTGCATGGATTCGTTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAGCAGACTTCAGATAACTATGACTATGAGAAGATTGAAGGCAGCTTAAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACTGACCTGAAGCTGAAGAATGATTTTCCCAAAGTTGGACGTAAGCTAGATGTGAAAGTCAGCATGGAAGAGCATGAAGTTCTCGTTGACATGCACTGTCCATATCGAGAATATATATTGGTCGATGTCATGGATGCTTTGAATGATTTGCAACTGGATGCCTACTCGGTTCAATCTTCTGATCACAATGGTCTTTTCTCCTTGACCCTCAAATCTAAGGTTTGTAGCTGAGTTTCTGACATTTGTTTTTCATAGATTTCTTATCCCAATTGATTTATACACGCGCGCACAATAGGAAAAAAGTGAAAAAAGCAGTTCTCCAATAGCCATTTAATGATTTAATAGCCATTAGGAGTGCACACTGAAGCCGAGAAAACTGAAAAACTGACTTGAAGTGAACCACTAACATCCTTACTAGCAAACCATGGAGAAATGTCATGTTCGACCTTAAGTATGCACGAGTAGAGCTTATATTACTCTATCCTCTGGTAAAATACATGTTTTTCCTTCTATCTAGGATGTTCTTGGTGCCCAGGCTATACATTCCACAAATATTATCCGAGTGAGCTACTTTATTGAAATTTCGAAATGACTTCCTGTCAACTCCATACAGGAACTGCTGATATGTTTGTCTTCATATCTTAGTTATAGAACTGCTTGTCATTTAGTATGAAGACTGATTAGGAGTAGGAGTAGGAGTACATTTCTATCTTTACACCAAGACTCAATAATTTACGCTGTTGAATTTCGTTTCAGTTTCGAGGGATGGCGGCTGCATCAGTTGGGATGATCAAACTAGCACTTTTGAAAGTTGTCAACAAGAGCTGA

mRNA sequence

ATGGCTAATGGACTTGAAAACTGTGACAGCGAACCTGGGTTTCTCAGAAAGCAGCTTGCTGTTGCTGTCAAGAGCATCCAGTGGAGCTATGCGATTTTCTGGTCACCGTCGTCTAGGCAACATGGGGTGCTGGAATGGTGTGATGGCTACTACAATGGAGACATCAAGACGAGGAAAACAGTTCAAGCTGAGGATGTCCACGTCGATAATATGGGCTTACACAGAAGTGAACAGTTGAGAGAGCTCTACAGATCTCTCTTAGAAGGTGAAAGTGAACAACGAACAAAGAAGCCTCCTGCCTCTTTGTCTCCTGAAGATCTATCTGATGCAGAATGGTATTACTTGGTTTGCATGTCCTTTTTCTTCAATCAAGGCCAAGGGTTGCCTGGAAGAGCGTTAGCTGATGATCGAACTATCTGGTTATGCAATGCTCAATATGCAGAGAGTACTGTATTCTCCCGCTCCTTGCTTGCAAAGGTTGGCCTTCTATTGACTGTGGTTTGCTTTCCTTACCTTGGCGGTGTTATAGAACTAGGTGTAACTGAGCAGGTTTTGGAGGATCCTAGTCTTCTTCAACACGTCAAAGATTTTTTACTGAAGTTCTCAAAGCCAATATGCTCTAAGAAACCTTCTTCCGCTGCTTATAAAGATGATAACGGTAAAGAACCAATGACTGCCAAATCTGACAATGAGATTGTTGAAGTTTTGGCAATGGAGAATCTCTACTGCTCAACAGCCGTGAAATTTGACAGGAAGTCAGTAAATGGGATTCAAAGGAAAAACAATGAGTTTGGCATTGATTCTCTAGATGATTTTTCAAATGGTTGTGAACAATATCACCCAATGGAAGATACTTTAAGACTTGAAGGTGCCGAGGGAGGGGCCTCTCGTTTCCAGAGTTTGCAGTTTCTAGATGATGACTTCAGTTACGGTTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCCGAAGCTTTGGCAAATCAGGAGAAAGTCTCATCTTCTCCAAGATTGAAAGATGCCAACAATTTACCTTTGAAAGAACTTCAAAACCCGAATCACACTCAATCAGGTTCCTTAGATCCCAGTTCTGACGAAGACATGCACTACAAGAGAACTATCTTCACCATTTTGGGAAGTTCAACTCAATTGGTTGGAAGTCCCCTTCTCCATAATTTCTCAAACAGGTCTAATTTCATACCATGGAAGAAAGTAGTGGCTGAGACACATACGCCCCCCATGCAACAAAGAATGTTAAAGAAGATTTTGTTTGCAGTTCCATTATTGTCAGCTGGTTCTCTAAAAGGCCTCAAGGATGAGGAACACTCGATCTTGAAACAGGGTAACAACGACTCCTGCACGAAAAATGCCACACTTGACAAATTAAAAGAAAATGAAAAATTTATGGCCCTTAAGTCGATGTTACCTTCACTTAATGAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATCTGAAGATGCTTGAAGCAAGAGTACAGGAGTTAGAAACTTGCATGGATTCGTTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAGCAGACTTCAGATAACTATGACTATGAGAAGATTGAAGGCAGCTTAAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACTGACCTGAAGCTGAAGAATGATTTTCCCAAAGTTGGACGTAAGCTAGATGTGAAAGTCAGCATGGAAGAGCATGAAGTTCTCGTTGACATGCACTGTCCATATCGAGAATATATATTGGTCGATGTCATGGATGCTTTGAATGATTTGCAACTGGATGCCTACTCGGTTCAATCTTCTGATCACAATGGTCTTTTCTCCTTGACCCTCAAATCTAAGTTTCGAGGGATGGCGGCTGCATCAGTTGGGATGATCAAACTAGCACTTTTGAAAGTTGTCAACAAGAGCTGA

Coding sequence (CDS)

ATGGCTAATGGACTTGAAAACTGTGACAGCGAACCTGGGTTTCTCAGAAAGCAGCTTGCTGTTGCTGTCAAGAGCATCCAGTGGAGCTATGCGATTTTCTGGTCACCGTCGTCTAGGCAACATGGGGTGCTGGAATGGTGTGATGGCTACTACAATGGAGACATCAAGACGAGGAAAACAGTTCAAGCTGAGGATGTCCACGTCGATAATATGGGCTTACACAGAAGTGAACAGTTGAGAGAGCTCTACAGATCTCTCTTAGAAGGTGAAAGTGAACAACGAACAAAGAAGCCTCCTGCCTCTTTGTCTCCTGAAGATCTATCTGATGCAGAATGGTATTACTTGGTTTGCATGTCCTTTTTCTTCAATCAAGGCCAAGGGTTGCCTGGAAGAGCGTTAGCTGATGATCGAACTATCTGGTTATGCAATGCTCAATATGCAGAGAGTACTGTATTCTCCCGCTCCTTGCTTGCAAAGGTTGGCCTTCTATTGACTGTGGTTTGCTTTCCTTACCTTGGCGGTGTTATAGAACTAGGTGTAACTGAGCAGGTTTTGGAGGATCCTAGTCTTCTTCAACACGTCAAAGATTTTTTACTGAAGTTCTCAAAGCCAATATGCTCTAAGAAACCTTCTTCCGCTGCTTATAAAGATGATAACGGTAAAGAACCAATGACTGCCAAATCTGACAATGAGATTGTTGAAGTTTTGGCAATGGAGAATCTCTACTGCTCAACAGCCGTGAAATTTGACAGGAAGTCAGTAAATGGGATTCAAAGGAAAAACAATGAGTTTGGCATTGATTCTCTAGATGATTTTTCAAATGGTTGTGAACAATATCACCCAATGGAAGATACTTTAAGACTTGAAGGTGCCGAGGGAGGGGCCTCTCGTTTCCAGAGTTTGCAGTTTCTAGATGATGACTTCAGTTACGGTTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCCGAAGCTTTGGCAAATCAGGAGAAAGTCTCATCTTCTCCAAGATTGAAAGATGCCAACAATTTACCTTTGAAAGAACTTCAAAACCCGAATCACACTCAATCAGGTTCCTTAGATCCCAGTTCTGACGAAGACATGCACTACAAGAGAACTATCTTCACCATTTTGGGAAGTTCAACTCAATTGGTTGGAAGTCCCCTTCTCCATAATTTCTCAAACAGGTCTAATTTCATACCATGGAAGAAAGTAGTGGCTGAGACACATACGCCCCCCATGCAACAAAGAATGTTAAAGAAGATTTTGTTTGCAGTTCCATTATTGTCAGCTGGTTCTCTAAAAGGCCTCAAGGATGAGGAACACTCGATCTTGAAACAGGGTAACAACGACTCCTGCACGAAAAATGCCACACTTGACAAATTAAAAGAAAATGAAAAATTTATGGCCCTTAAGTCGATGTTACCTTCACTTAATGAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATCTGAAGATGCTTGAAGCAAGAGTACAGGAGTTAGAAACTTGCATGGATTCGTTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAGCAGACTTCAGATAACTATGACTATGAGAAGATTGAAGGCAGCTTAAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACTGACCTGAAGCTGAAGAATGATTTTCCCAAAGTTGGACGTAAGCTAGATGTGAAAGTCAGCATGGAAGAGCATGAAGTTCTCGTTGACATGCACTGTCCATATCGAGAATATATATTGGTCGATGTCATGGATGCTTTGAATGATTTGCAACTGGATGCCTACTCGGTTCAATCTTCTGATCACAATGGTCTTTTCTCCTTGACCCTCAAATCTAAGTTTCGAGGGATGGCGGCTGCATCAGTTGGGATGATCAAACTAGCACTTTTGAAAGTTGTCAACAAGAGCTGA
BLAST of CSPI06G00470 vs. Swiss-Prot
Match: GL3_ARATH (Transcription factor GLABRA 3 OS=Arabidopsis thaliana GN=GL3 PE=1 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.3e-114
Identity = 268/668 (40.12%), Postives = 377/668 (56.44%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MA G +N  + P  L+K LAV+V++IQWSY IFWS S+ Q GVLEW DGYYNGDIKTRKT
Sbjct: 1   MATG-QNRTTVPENLKKHLAVSVRNIQWSYGIFWSVSASQSGVLEWGDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESE--------QRTKKPPAS-LSPEDLSDAE 120
           +QA ++  D +GL RSEQL ELY SL   ES         Q T++  A+ LSPEDL+D E
Sbjct: 61  IQASEIKADQLGLRRSEQLSELYESLSVAESSSSGVAAGSQVTRRASAAALSPEDLADTE 120

Query: 121 WYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPY 180
           WYYLVCMSF FN G+G+PGR  A+   IWLCNA  A+S VFSRSLLAK   + TVVCFP+
Sbjct: 121 WYYLVCMSFVFNIGEGMPGRTFANGEPIWLCNAHTADSKVFSRSLLAKSAAVKTVVCFPF 180

Query: 181 LGGVIELGVTEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNE 240
           LGGV+E+G TE + ED +++Q VK   L+   P  +  P+ + Y  DN  +P     D  
Sbjct: 181 LGGVVEIGTTEHITEDMNVIQCVKTSFLEAPDPYATILPARSDYHIDNVLDPQQILGDEI 240

Query: 241 IVEVLAMENLYCSTAVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGA 300
              + + E    ++      ++ NG  +++ +   D                D+   E  
Sbjct: 241 YAPMFSTEPFPTASP----SRTTNGFDQEHEQVADD---------------HDSFMTERI 300

Query: 301 EGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALAN--QEKVSSSPRLKDANNLPLKEL 360
            GGAS+ QS Q +DD+ S     S+N SDC+S+        +V+   R      L   + 
Sbjct: 301 TGGASQVQSWQLMDDELSNCVHQSLNSSDCVSQTFVEGAAGRVAYGARKSRVQRLGQIQE 360

Query: 361 QNPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAE- 420
           Q  N  ++ S DP +D D+HY+  I TI  ++ QL+  P   N   +S+F  WKK  +  
Sbjct: 361 QQRN-VKTLSFDPRND-DVHYQSVISTIFKTNHQLILGPQFRNCDKQSSFTRWKKSSSSS 420

Query: 421 ----THTPPMQQRMLKKILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKE 480
               T T P  Q MLKKI+F VP +     K + D   +  + GN+    K     + K 
Sbjct: 421 SGTATVTAP-SQGMLKKIIFDVPRVHQKE-KLMLDSPEARDETGNHAVLEKKR---REKL 480

Query: 481 NEKFMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEER-----FRR 540
           NE+FM L+ ++PS+N+I+KVSIL+DTI+YL+ LE RVQELE+C +S   E R      R+
Sbjct: 481 NERFMTLRKIIPSINKIDKVSILDDTIEYLQELERRVQELESCRESTDTETRGTMTMKRK 540

Query: 541 KYLDMVEQTSDNYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEE 600
           K  D  E+TS N    +     K S N     E  +T           G   ++++    
Sbjct: 541 KPCDAGERTSANCANNETGNGKKVSVNNVGEAEPADTGF--------TGLTDNLRIGSFG 600

Query: 601 HEVLVDMHCPYREYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGM 648
           +EV++++ C +RE +L+++MD ++DL LD++SVQSS  +GL  LT+  K +G   A+ GM
Sbjct: 601 NEVVIELRCAWREGVLLEIMDVISDLHLDSHSVQSSTGDGLLCLTVNCKHKGSKIATPGM 633

BLAST of CSPI06G00470 vs. Swiss-Prot
Match: EGL1_ARATH (Transcription factor EGL1 OS=Arabidopsis thaliana GN=BHLH2 PE=1 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 6.0e-112
Identity = 253/645 (39.22%), Postives = 371/645 (57.52%), Query Frame = 1

Query: 12  PGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDNM 71
           P  L+KQLAV+V++IQWSY IFWS S+ Q GVLEW DGYYNGDIKTRKT+QA +V +D +
Sbjct: 10  PDNLKKQLAVSVRNIQWSYGIFWSVSASQPGVLEWGDGYYNGDIKTRKTIQAAEVKIDQL 69

Query: 72  GLHRSEQLRELYRSL------LEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQG 131
           GL RSEQLRELY SL        G S+   +   A+LSPEDL+D EWYYLVCMSF FN G
Sbjct: 70  GLERSEQLRELYESLSLAESSASGSSQVTRRASAAALSPEDLTDTEWYYLVCMSFVFNIG 129

Query: 132 QGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVL 191
           +G+PG AL++   IWLCNA+ A+S VF+RSLLAK   L TVVCFP+LGGV+E+G TE + 
Sbjct: 130 EGIPGGALSNGEPIWLCNAETADSKVFTRSLLAKSASLQTVVCFPFLGGVLEIGTTEHIK 189

Query: 192 EDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCST 251
           ED +++Q VK   L+         P +      + +E     SD++   V   E      
Sbjct: 190 EDMNVIQSVKTLFLE-------APPYTTISTRSDYQEIFDPLSDDKYTPVFITE------ 249

Query: 252 AVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLD 311
              F   S +G +++  +      D F N                 +GGAS+ QS QF+ 
Sbjct: 250 --AFPTTSTSGFEQEPEDH-----DSFIN-----------------DGGASQVQSWQFVG 309

Query: 312 DDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRLKDANNLPLKELQNPNHTQSGSLDPSS 371
           ++ S     S+N SDC+S+  +    +++  PR           +Q     Q  S   + 
Sbjct: 310 EEISNCIHQSLNSSDCVSQTFVGTTGRLACDPR--------KSRIQRLGQIQEQSNHVNM 369

Query: 372 DEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILF 431
           D+D+HY+  I TI  ++ QL+  P   NF  RS+F  WK+  +        Q+M+KKILF
Sbjct: 370 DDDVHYQGVISTIFKTTHQLILGPQFQNFDKRSSFTRWKRSSSVKTLGEKSQKMIKKILF 429

Query: 432 AVPLLSAGSLKGLKDEE--HSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEIN 491
            VPL++       K EE      ++  N + ++    +KL  NE+FM L+S++PS+++I+
Sbjct: 430 EVPLMN-------KKEELLPDTPEETGNHALSEKKRREKL--NERFMTLRSIIPSISKID 489

Query: 492 KVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLK 551
           KVSIL+DTI+YL+ L+ RVQELE+C +S   E R     + M+++   + + E+   +  
Sbjct: 490 KVSILDDTIEYLQDLQKRVQELESCRESADTETR-----ITMMKRKKPDDEEERASANCM 549

Query: 552 PSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDAL 611
            S  K     + E +     D    G   ++++S   +EV++++ C +RE IL+++MD +
Sbjct: 550 NSKRKGSDVNVGEDE---PADIGYAGLTDNLRISSLGNEVVIELRCAWREGILLEIMDVI 592

Query: 612 NDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKV 648
           +DL LD++SVQSS  +GL  LT+  K +G   A+ GMI+ AL +V
Sbjct: 610 SDLNLDSHSVQSSTGDGLLCLTVNCKHKGTKIATTGMIQEALQRV 592

BLAST of CSPI06G00470 vs. Swiss-Prot
Match: BHLHW_PEA (Basic helix-loop-helix protein A OS=Pisum sativum GN=BHLH PE=3 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 6.1e-72
Identity = 211/673 (31.35%), Postives = 331/673 (49.18%), Query Frame = 1

Query: 15  LRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDNMGLH 74
           L+  L  AV+S+QW+Y++FW    +Q  +L W DGYYNG IKTRKTVQ  +V  +   L 
Sbjct: 13  LQNMLQAAVQSVQWTYSLFWQICPQQL-ILVWGDGYYNGAIKTRKTVQPMEVSAEEASLQ 72

Query: 75  RSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA 134
           RS+QLRELY SL  GE+   T++P ASLSPEDL+++EW+YL+C+SF F  G GLPG+A A
Sbjct: 73  RSQQLRELYESLSAGETNPPTRRPCASLSPEDLTESEWFYLMCVSFSFPPGVGLPGKAYA 132

Query: 135 DDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVLEDPSLLQHV 194
             + +WL  A   +S  FSR++LAK   + TVVC P L GV+E+G T+++ ED + ++HV
Sbjct: 133 RRQHVWLTGANEVDSKTFSRAILAKSANIQTVVCIPVLDGVVEIGTTDKIQEDLNFIKHV 192

Query: 195 KDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCSTAVKFDRKSV 254
           + F +         KP+ + +   N        S + I  ++       STA+       
Sbjct: 193 RSFFIDHHS--LPPKPALSEHSTSN-----PTYSTDHIPAIMYTVADPASTAIP------ 252

Query: 255 NGIQRKNNEFGIDSLDDFSNGCE----QYHPMEDTLRLEGAEGGASRFQSLQFLDDDFSY 314
           N      +E   D  D+  +G E    Q H    T  +E AE   S    ++  DD    
Sbjct: 253 NQDDMDEDEEEDDEDDEVESGSEDETNQGHNQHATSIIEAAE--PSELMQIEMPDDIRIG 312

Query: 315 GFQDSMNPSDCISEALANQEKVSSSPRLKDANNL---PLKE-------LQNPNHTQSGSL 374
              D  N  D     LA   + + S ++         P++E       +Q  +      L
Sbjct: 313 SPNDGSNNLDSDFHLLAVSNQGNPSRQIDSYTTERWGPIEEPLDDSLQIQLSSSVLHHPL 372

Query: 375 DPSSDEDMHYKRTIFTILGSSTQLVGSPLLH--NFSNRSNFIPWKKVVAETHTPP---MQ 434
           +  + ED HY +T+ TIL    Q + SP ++  N+S +S+F  W         PP     
Sbjct: 373 EDLTQEDTHYSQTVTTIL--QNQWIDSPSINYINYSTQSSFTTWTNHHFHPPPPPDPATS 432

Query: 435 QRMLKKILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCT----KNATLDKL--------- 494
           Q ++K ILF VP L   +      +        +ND       K    D+L         
Sbjct: 433 QWLVKYILFTVPYLHTKNHDETSPQTRDTAGVNSNDPSARLRGKGTPQDELSANHVLAER 492

Query: 495 ----KENEKFMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFR 554
               K NE+F+ L+S++P + +++K SIL DTI+YLK L  ++Q+LET    +  E    
Sbjct: 493 RRREKLNERFIILRSLVPFVTKMDKASILGDTIEYLKQLRRKIQDLETRNRQMESE---- 552

Query: 555 RKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSME 614
           +  + ++   ++      +EG+      + KA E+                   V+VS+ 
Sbjct: 553 KSGVTVLVGPTEKKKVRIVEGNGTGGGVRAKAVEV----------------VASVQVSII 612

Query: 615 EHEVLVDMHCPYREYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFR---GMAAA 649
           E + L+++ C  RE +L+DVM  L +L+++   VQSS +NG+F   L++K +        
Sbjct: 613 ESDALLEIECLQREGLLLDVMMMLRELRIEVIGVQSSLNNGVFVAELRAKVKENGNGKKV 647

BLAST of CSPI06G00470 vs. Swiss-Prot
Match: BH012_ARATH (Transcription factor MYC1 OS=Arabidopsis thaliana GN=BHLH12 PE=1 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 8.8e-55
Identity = 121/249 (48.59%), Postives = 159/249 (63.86%), Query Frame = 1

Query: 1   MANGLE----NCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIK 60
           MA+G+E        +   LRKQLA+AV+S+QWSYAIFWS S  Q GVLEW +G YNGD+K
Sbjct: 5   MADGVEAAAGRSKRQNSLLRKQLALAVRSVQWSYAIFWSSSLTQPGVLEWGEGCYNGDMK 64

Query: 61  TRKTVQAEDVHVDNMGLHRSEQLRELYRSLLEGES--------------EQRTKKPPASL 120
            RK  ++ + H    GL +S++LR+LY S+LEG+S              +         L
Sbjct: 65  KRK--KSYESHY-KYGLQKSKELRKLYLSMLEGDSGTTVSTTHDNLNDDDDNCHSTSMML 124

Query: 121 SPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGL 180
           SP+DLSD EWYYLV MS+ F+  Q LPGRA A   TIWLCNAQYAE+ +FSRSLLA+   
Sbjct: 125 SPDDLSDEEWYYLVSMSYVFSPSQCLPGRASATGETIWLCNAQYAENKLFSRSLLARSAS 184

Query: 181 LLTVVCFPYLGGVIELGVTEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKE 232
           + TVVCFPYLGGVIELGVTE + ED +LL+++K  L++ S           A++D++ ++
Sbjct: 185 IQTVVCFPYLGGVIELGVTELISEDHNLLRNIKSCLMEIS-----------AHQDNDDEK 239

BLAST of CSPI06G00470 vs. Swiss-Prot
Match: ARRS_MAIZE (Anthocyanin regulatory R-S protein OS=Zea mays GN=R-S PE=2 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 3.1e-52
Identity = 100/206 (48.54%), Postives = 135/206 (65.53%), Query Frame = 1

Query: 10  SEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVD 69
           +E   +R QLA A +SI WSYA+FWS S  Q GVL W DG+YNG++KTRK   + ++  D
Sbjct: 19  AERQLMRSQLAAAARSINWSYALFWSISDTQPGVLTWTDGFYNGEVKTRKISNSVELTSD 78

Query: 70  NMGLHRSEQLRELYRSLLEGESEQRTK--KPPASLSPEDLSDAEWYYLVCMSFFFNQGQG 129
            + + RS+QLRELY +LL GE ++R    +P  SLSPEDL D EWYY+V M++ F  GQG
Sbjct: 79  QLVMQRSDQLRELYEALLSGEGDRRAAPARPAGSLSPEDLGDTEWYYVVSMTYAFRPGQG 138

Query: 130 LPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVLED 189
           LPGR+ A D  +WLCNA  A S  F R+LLAK   + +++C P +GGV+ELG T+ V E 
Sbjct: 139 LPGRSFASDEHVWLCNAHLAGSKAFPRALLAKSASIQSILCIPVMGGVLELGTTDTVPEA 198

Query: 190 PSLLQHVKDFLLKFSKPICSKKPSSA 214
           P L+        +   P  S++PSS+
Sbjct: 199 PDLVSRATAAFWEPQCPTYSEEPSSS 224

BLAST of CSPI06G00470 vs. TrEMBL
Match: I6N8K6_CUCSA (GL3 OS=Cucumis sativus PE=2 SV=1)

HSP 1 Score: 1272.3 bits (3291), Expect = 0.0e+00
Identity = 639/651 (98.16%), Postives = 643/651 (98.77%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MANGLENCDSEPGFLRKQLAVAVKSIQWSYA+FWSPSSRQHGVLEWCDGYYNGDIKTRKT
Sbjct: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYALFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGV 180
           FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240
           TEQV EDPSLLQHVKDFLLKFS+PICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN
Sbjct: 181 TEQVSEDPSLLQHVKDFLLKFSRPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240

Query: 241 LYCSTAVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS 300
           LYCSTAVKFD KSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS
Sbjct: 241 LYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS 300

Query: 301 LQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSGSL 360
           LQFLDDDFSYGFQDSMNPSDCISEALA+QEKVSSSPRLKDANNLPLKE QNPNHTQSGSL
Sbjct: 301 LQFLDDDFSYGFQDSMNPSDCISEALADQEKVSSSPRLKDANNLPLKEHQNPNHTQSGSL 360

Query: 361 DPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLK 420
           DPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLK
Sbjct: 361 DPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLK 420

Query: 421 KILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNE 480
           KILFAVPLLSAGSLKGLKDEE SILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNE
Sbjct: 421 KILFAVPLLSAGSLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNE 480

Query: 481 INKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS 540
           INKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS
Sbjct: 481 INKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS 540

Query: 541 LKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMD 600
           LKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMD
Sbjct: 541 LKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMD 600

Query: 601 ALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 652
           ALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS
Sbjct: 601 ALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 651

BLAST of CSPI06G00470 vs. TrEMBL
Match: A0A0A0KCZ7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G003480 PE=4 SV=1)

HSP 1 Score: 1015.8 bits (2625), Expect = 2.4e-293
Identity = 520/541 (96.12%), Postives = 523/541 (96.67%), Query Frame = 1

Query: 111 EWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFP 170
           +W Y +  S    Q  GLPGRALADDRTIWLCNAQYAESTVFSRSLLAK   + TVVCFP
Sbjct: 20  QWSYAIFWSPSSRQ-HGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFP 79

Query: 171 YLGGVIELGVTEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN 230
           YLGGVIELGVTEQV EDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN
Sbjct: 80  YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN 139

Query: 231 EIVEVLAMENLYCSTAVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG 290
           EIVEVLAMENLYCSTAVKFD KSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG
Sbjct: 140 EIVEVLAMENLYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG 199

Query: 291 AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQ 350
           AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKE Q
Sbjct: 200 AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKEHQ 259

Query: 351 NPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETH 410
           NPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETH
Sbjct: 260 NPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETH 319

Query: 411 TPPMQQRMLKKILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKENEKFMA 470
           TPPMQQRMLKKILFAVPLLSAGSLKGLKDEE SILKQGNNDSCTKNATLDKLKENEKFMA
Sbjct: 320 TPPMQQRMLKKILFAVPLLSAGSLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMA 379

Query: 471 LKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSD 530
           LKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSD
Sbjct: 380 LKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSD 439

Query: 531 NYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPY 590
           NYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPY
Sbjct: 440 NYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPY 499

Query: 591 REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK 650
           REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK
Sbjct: 500 REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK 559

Query: 651 S 652
           S
Sbjct: 560 S 559

BLAST of CSPI06G00470 vs. TrEMBL
Match: A0A075BRK3_9ROSI (Basic helix-loop-helix protein OS=Morella rubra GN=bHLH2 PE=2 SV=1)

HSP 1 Score: 723.4 bits (1866), Expect = 2.5e-205
Identity = 387/653 (59.26%), Postives = 478/653 (73.20%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MANG +  D  P  LRK+LAVAV+SIQWSYAIFWS S+ Q GVLEW DGYYNGDIKTRKT
Sbjct: 1   MANGTQTHDGLPENLRKRLAVAVRSIQWSYAIFWSLSTTQQGVLEWGDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQA ++  D +GL RSEQLRELY+SLLEGE++Q+ K+P A+LSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAVELKADKIGLQRSEQLRELYQSLLEGEADQQAKRPSAALSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGV 180
            F+ G+GLPGRALA+ + IWLCNAQYA+S VFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 VFSPGEGLPGRALANGQAIWLCNAQYADSKVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240
           TE V EDPSLLQH+K  LL+ SKP+CS K S    K D+  +P+ A  + EI++ L +EN
Sbjct: 181 TELVSEDPSLLQHIKASLLELSKPVCSDKSSPTPPKADDDGDPICANVNLEIMDTLPLEN 240

Query: 241 LYCST-AVKFDRKSVNGIQRK-NNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRF 300
           LY  T  ++FDR+ +  +    + E  +DS D+ SNG E  H  ED+  L+G  GGAS+ 
Sbjct: 241 LYSPTEGIEFDREGIVELGGNIHEEINMDSPDECSNGXEHNHQTEDSFMLDGINGGASQV 300

Query: 301 QSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSG 360
           QS   LDDDFS G  DSMN SDCISEA  NQEK  S+ + +D N   LKELQN NHT+ G
Sbjct: 301 QSWHVLDDDFSNGVPDSMNSSDCISEAFVNQEKAISTLKREDVNQ-HLKELQNSNHTKLG 360

Query: 361 SLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPW-KKVVAETHTPPMQQR 420
           SLD  +D+D+HY+R +  I+GSS +L+ +   H   +RSNF+ W K+ + + + P  QQ 
Sbjct: 361 SLDLGADDDLHYRRILSAIVGSSPRLIENLRFHYTDHRSNFLCWTKEALGDAYRPQAQQT 420

Query: 421 MLKKILFAVPLLSAGSLKGLKDE---EHSILKQGNNDSCTKNATLDKLKENEKFMALKSM 480
           MLKKILF VPL+  G    L+ E   +  + K  + D C  +   D  +ENE F+ALKSM
Sbjct: 421 MLKKILFTVPLMYGGCSFRLQRENCGKEWLRKSESGDICLGHVLSDNRRENENFLALKSM 480

Query: 481 LPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDY 540
           +PS++EI+K SIL DTIKYLK LEARV+ELE+CMDS+ YEER RRKYLDMVEQ SDN D 
Sbjct: 481 VPSISEIDKASILRDTIKYLKELEARVEELESCMDSVDYEERARRKYLDMVEQISDNCDK 540

Query: 541 EKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYI 600
           +KI+   K   NKRKACE DETD +L    P+    LDVKVS++E EVL++M CPYREY+
Sbjct: 541 KKIDNGKKSWINKRKACEFDETDPELNRVVPEDSLPLDVKVSIKEQEVLIEMRCPYREYV 600

Query: 601 LVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKV 648
           L+DVMDA+N+L L+A+SVQSS  NG+ +LTLKSKFRG A A VGMIK AL K+
Sbjct: 601 LLDVMDAINNLHLEAHSVQSSAPNGILTLTLKSKFRGAATAPVGMIKQALWKI 652

BLAST of CSPI06G00470 vs. TrEMBL
Match: F6I629_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02560 PE=4 SV=1)

HSP 1 Score: 711.8 bits (1836), Expect = 7.4e-202
Identity = 376/660 (56.97%), Postives = 486/660 (73.64%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MANG++N +  P  L KQLAVAV+SIQWSYAIFWS S+RQ GVLEW  GYYNGDIKTRKT
Sbjct: 1   MANGVQNQEGVPENLSKQLAVAVRSIQWSYAIFWSLSTRQQGVLEWSGGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQ  ++  D MGL RSEQLRELY SLLEGE++Q++K+P A+LSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQEMELKADKMGLQRSEQLRELYESLLEGETDQQSKRPSAALSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGV 180
            FN G+GLPGRALA+ ++IWLC+AQYA+S VFSRSLLAK   + TVVCFP++GGVIELGV
Sbjct: 121 VFNPGEGLPGRALANGQSIWLCDAQYADSKVFSRSLLAKSASIQTVVCFPHMGGVIELGV 180

Query: 181 TEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240
           TE V EDPSL+QH+K  LL+ SKPICS+K S      D+ K+ M AK D++IVE +A+E 
Sbjct: 181 TELVPEDPSLIQHIKACLLELSKPICSEKSSFVPCNTDDDKDRMCAKVDHDIVETMALEK 240

Query: 241 LYCST-AVKFDRKSVNGIQRK-NNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRF 300
           LY +T  +KF+++ ++ +    + E  I S DD SNGCE  H  ED+  LEG  GGAS+ 
Sbjct: 241 LYPATEEIKFEQEGMSELHGNIHEEHNIGSPDDCSNGCEDDHQTEDSFMLEGINGGASQV 300

Query: 301 QSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSG 360
           QS  F+DDDFS G Q SM+ SDCIS+A  NQE++ SSP+ ++ NN+ LK+LQ  N T+  
Sbjct: 301 QSWHFVDDDFSNGVQGSMDSSDCISQAFVNQERIHSSPKGENVNNVRLKDLQECNDTKFS 360

Query: 361 SLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKK-VVAETHTPPMQQR 420
           SLD  +D+D+HY+RTI T+L  S  L+G+     +  +S+FI WKK  + +   P  QQR
Sbjct: 361 SLDLGADDDLHYRRTISTVLRKSHPLIGNSCFRCYDIKSSFITWKKGGMLDAQKPQTQQR 420

Query: 421 MLKKILFAVPLLSAGSLKGLKDEEHS-----ILKQGNNDSCTKNATLDKLKENEKFMALK 480
           +LKKILF VPL+  G   G K ++ +     + K G++  C ++A  DK +E EKF+ L+
Sbjct: 421 ILKKILFTVPLMHGGC--GFKSQKENAGRDGLWKSGSDGICKQHALSDKKREKEKFLVLR 480

Query: 481 SMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMD-SLYYEERFRRKYLDMVEQTSDN 540
           SM+PS+N+I++VSIL DTI+YLK LEARV+ELET MD     E R R+KYLDMVEQTSDN
Sbjct: 481 SMVPSINKIDEVSILGDTIEYLKKLEARVEELETSMDLQTELEARARQKYLDMVEQTSDN 540

Query: 541 YDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVG-RKLDVKVSMEEHEVLVDMHCPY 600
           YD + I+   K   NKRKAC++DETDL++    PK      D+KV + E EVL++M CP+
Sbjct: 541 YDDKMIDDGKKLWINKRKACDIDETDLEINEIIPKDSLPSSDMKVRINEQEVLIEMRCPW 600

Query: 601 REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK 651
           REY+L+D+MDA+N+L LD +SVQSS+H+G  +LTLKSKFRG A AS GMIK AL ++ +K
Sbjct: 601 REYLLLDIMDAINNLHLDCHSVQSSNHDGFLTLTLKSKFRGRAVASAGMIKQALWRITSK 658

BLAST of CSPI06G00470 vs. TrEMBL
Match: A2TEF4_VITVI (Myc anthocyanin regulatory protein OS=Vitis vinifera GN=MYCA1 PE=2 SV=3)

HSP 1 Score: 706.8 bits (1823), Expect = 2.4e-200
Identity = 375/660 (56.82%), Postives = 485/660 (73.48%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MANG++N +  P  L KQLAVAV+SIQWSYAIFWS S+RQ GVLEW  GYYNGDIKTRKT
Sbjct: 1   MANGVQNQEGVPENLSKQLAVAVRSIQWSYAIFWSLSTRQQGVLEWSGGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQ  ++  D MGL RSEQLRELY SLLEGE++Q++K+P A+LSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQEMELKADKMGLQRSEQLRELYESLLEGETDQQSKRPSAALSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGV 180
            FN G+GLPGRALA+ ++IWLC+AQYA+S VFSRSLLAK     TVVCFP++GGVIELGV
Sbjct: 121 VFNPGEGLPGRALANGQSIWLCDAQYADSKVFSRSLLAK-----TVVCFPHMGGVIELGV 180

Query: 181 TEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240
           TE V EDPSL+QH+K  LL+ SKPICS+K S      D+ K+ M AK D++IVE +A+E 
Sbjct: 181 TELVPEDPSLIQHIKACLLELSKPICSEKSSFVPCNTDDDKDRMCAKVDHDIVETMALEK 240

Query: 241 LYCST-AVKFDRKSVNGIQRK-NNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRF 300
           LY +T  +KF+++ ++ +    + E  I S DD SNGCE  H  ED+  LEG  GGAS+ 
Sbjct: 241 LYPATEEIKFEQEGMSELHGNIHEEHNIGSPDDCSNGCEDDHQTEDSFMLEGINGGASQV 300

Query: 301 QSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSG 360
           QS  F+DDDFS G Q SM+ SDCIS+A  NQE++ SSP+ ++ NN+ LK+LQ  N T+  
Sbjct: 301 QSWHFVDDDFSNGVQGSMDSSDCISQAFVNQERIHSSPKGENVNNVRLKDLQECNDTKFS 360

Query: 361 SLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKK-VVAETHTPPMQQR 420
           SLD  +D+D+HY+RTI T+L  S  L+G+     +  +S+FI WKK  + +   P  QQR
Sbjct: 361 SLDLGADDDLHYRRTISTVLRKSHPLIGNSCFRCYDIKSSFITWKKGGMLDAQKPQTQQR 420

Query: 421 MLKKILFAVPLLSAGSLKGLKDEEHS-----ILKQGNNDSCTKNATLDKLKENEKFMALK 480
           +LKKILF VPL+  G   G K ++ +     + K G++  C ++A  DK +E EKF+ L+
Sbjct: 421 ILKKILFTVPLMHGGC--GFKSQKENAGRDGLWKSGSDGICKQHALSDKKREKEKFLVLR 480

Query: 481 SMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMD-SLYYEERFRRKYLDMVEQTSDN 540
           SM+PS+N+I++VSIL DTI+YLK LEARV+ELET MD     + R R+KYLDMVEQTSDN
Sbjct: 481 SMVPSINKIDEVSILGDTIEYLKKLEARVEELETSMDLQTELDARARQKYLDMVEQTSDN 540

Query: 541 YDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVG-RKLDVKVSMEEHEVLVDMHCPY 600
           YD + I+   K   NKRKAC++DETDL++    PK      D+KV + E EVL++M CP+
Sbjct: 541 YDDKMIDDGKKLWINKRKACDIDETDLEINEIIPKDSLPSSDMKVRINEQEVLIEMRCPW 600

Query: 601 REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK 651
           REY+L+D+MDA+N+L LD +SVQSS+H+G  +LTLKSKFRG A AS GMIK AL ++ +K
Sbjct: 601 REYLLLDIMDAINNLHLDCHSVQSSNHDGFLTLTLKSKFRGRAVASAGMIKQALWRITSK 653

BLAST of CSPI06G00470 vs. TAIR10
Match: AT5G41315.1 (AT5G41315.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 415.2 bits (1066), Expect = 7.3e-116
Identity = 268/668 (40.12%), Postives = 377/668 (56.44%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MA G +N  + P  L+K LAV+V++IQWSY IFWS S+ Q GVLEW DGYYNGDIKTRKT
Sbjct: 1   MATG-QNRTTVPENLKKHLAVSVRNIQWSYGIFWSVSASQSGVLEWGDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESE--------QRTKKPPAS-LSPEDLSDAE 120
           +QA ++  D +GL RSEQL ELY SL   ES         Q T++  A+ LSPEDL+D E
Sbjct: 61  IQASEIKADQLGLRRSEQLSELYESLSVAESSSSGVAAGSQVTRRASAAALSPEDLADTE 120

Query: 121 WYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPY 180
           WYYLVCMSF FN G+G+PGR  A+   IWLCNA  A+S VFSRSLLAK   + TVVCFP+
Sbjct: 121 WYYLVCMSFVFNIGEGMPGRTFANGEPIWLCNAHTADSKVFSRSLLAKSAAVKTVVCFPF 180

Query: 181 LGGVIELGVTEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNE 240
           LGGV+E+G TE + ED +++Q VK   L+   P  +  P+ + Y  DN  +P     D  
Sbjct: 181 LGGVVEIGTTEHITEDMNVIQCVKTSFLEAPDPYATILPARSDYHIDNVLDPQQILGDEI 240

Query: 241 IVEVLAMENLYCSTAVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGA 300
              + + E    ++      ++ NG  +++ +   D                D+   E  
Sbjct: 241 YAPMFSTEPFPTASP----SRTTNGFDQEHEQVADD---------------HDSFMTERI 300

Query: 301 EGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALAN--QEKVSSSPRLKDANNLPLKEL 360
            GGAS+ QS Q +DD+ S     S+N SDC+S+        +V+   R      L   + 
Sbjct: 301 TGGASQVQSWQLMDDELSNCVHQSLNSSDCVSQTFVEGAAGRVAYGARKSRVQRLGQIQE 360

Query: 361 QNPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAE- 420
           Q  N  ++ S DP +D D+HY+  I TI  ++ QL+  P   N   +S+F  WKK  +  
Sbjct: 361 QQRN-VKTLSFDPRND-DVHYQSVISTIFKTNHQLILGPQFRNCDKQSSFTRWKKSSSSS 420

Query: 421 ----THTPPMQQRMLKKILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKE 480
               T T P  Q MLKKI+F VP +     K + D   +  + GN+    K     + K 
Sbjct: 421 SGTATVTAP-SQGMLKKIIFDVPRVHQKE-KLMLDSPEARDETGNHAVLEKKR---REKL 480

Query: 481 NEKFMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEER-----FRR 540
           NE+FM L+ ++PS+N+I+KVSIL+DTI+YL+ LE RVQELE+C +S   E R      R+
Sbjct: 481 NERFMTLRKIIPSINKIDKVSILDDTIEYLQELERRVQELESCRESTDTETRGTMTMKRK 540

Query: 541 KYLDMVEQTSDNYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEE 600
           K  D  E+TS N    +     K S N     E  +T           G   ++++    
Sbjct: 541 KPCDAGERTSANCANNETGNGKKVSVNNVGEAEPADTGF--------TGLTDNLRIGSFG 600

Query: 601 HEVLVDMHCPYREYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGM 648
           +EV++++ C +RE +L+++MD ++DL LD++SVQSS  +GL  LT+  K +G   A+ GM
Sbjct: 601 NEVVIELRCAWREGVLLEIMDVISDLHLDSHSVQSSTGDGLLCLTVNCKHKGSKIATPGM 633

BLAST of CSPI06G00470 vs. TAIR10
Match: AT1G63650.1 (AT1G63650.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 406.4 bits (1043), Expect = 3.4e-113
Identity = 253/645 (39.22%), Postives = 371/645 (57.52%), Query Frame = 1

Query: 12  PGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDNM 71
           P  L+KQLAV+V++IQWSY IFWS S+ Q GVLEW DGYYNGDIKTRKT+QA +V +D +
Sbjct: 10  PDNLKKQLAVSVRNIQWSYGIFWSVSASQPGVLEWGDGYYNGDIKTRKTIQAAEVKIDQL 69

Query: 72  GLHRSEQLRELYRSL------LEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQG 131
           GL RSEQLRELY SL        G S+   +   A+LSPEDL+D EWYYLVCMSF FN G
Sbjct: 70  GLERSEQLRELYESLSLAESSASGSSQVTRRASAAALSPEDLTDTEWYYLVCMSFVFNIG 129

Query: 132 QGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVL 191
           +G+PG AL++   IWLCNA+ A+S VF+RSLLAK   L TVVCFP+LGGV+E+G TE + 
Sbjct: 130 EGIPGGALSNGEPIWLCNAETADSKVFTRSLLAKSASLQTVVCFPFLGGVLEIGTTEHIK 189

Query: 192 EDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCST 251
           ED +++Q VK   L+         P +      + +E     SD++   V   E      
Sbjct: 190 EDMNVIQSVKTLFLE-------APPYTTISTRSDYQEIFDPLSDDKYTPVFITE------ 249

Query: 252 AVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLD 311
              F   S +G +++  +      D F N                 +GGAS+ QS QF+ 
Sbjct: 250 --AFPTTSTSGFEQEPEDH-----DSFIN-----------------DGGASQVQSWQFVG 309

Query: 312 DDFSYGFQDSMNPSDCISEA-LANQEKVSSSPRLKDANNLPLKELQNPNHTQSGSLDPSS 371
           ++ S     S+N SDC+S+  +    +++  PR           +Q     Q  S   + 
Sbjct: 310 EEISNCIHQSLNSSDCVSQTFVGTTGRLACDPR--------KSRIQRLGQIQEQSNHVNM 369

Query: 372 DEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILF 431
           D+D+HY+  I TI  ++ QL+  P   NF  RS+F  WK+  +        Q+M+KKILF
Sbjct: 370 DDDVHYQGVISTIFKTTHQLILGPQFQNFDKRSSFTRWKRSSSVKTLGEKSQKMIKKILF 429

Query: 432 AVPLLSAGSLKGLKDEE--HSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEIN 491
            VPL++       K EE      ++  N + ++    +KL  NE+FM L+S++PS+++I+
Sbjct: 430 EVPLMN-------KKEELLPDTPEETGNHALSEKKRREKL--NERFMTLRSIIPSISKID 489

Query: 492 KVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLK 551
           KVSIL+DTI+YL+ L+ RVQELE+C +S   E R     + M+++   + + E+   +  
Sbjct: 490 KVSILDDTIEYLQDLQKRVQELESCRESADTETR-----ITMMKRKKPDDEEERASANCM 549

Query: 552 PSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDAL 611
            S  K     + E +     D    G   ++++S   +EV++++ C +RE IL+++MD +
Sbjct: 550 NSKRKGSDVNVGEDE---PADIGYAGLTDNLRISSLGNEVVIELRCAWREGILLEIMDVI 592

Query: 612 NDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKV 648
           +DL LD++SVQSS  +GL  LT+  K +G   A+ GMI+ AL +V
Sbjct: 610 SDLNLDSHSVQSSTGDGLLCLTVNCKHKGTKIATTGMIQEALQRV 592

BLAST of CSPI06G00470 vs. TAIR10
Match: AT4G00480.2 (AT4G00480.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 216.5 bits (550), Expect = 5.0e-56
Identity = 121/249 (48.59%), Postives = 159/249 (63.86%), Query Frame = 1

Query: 1   MANGLE----NCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIK 60
           MA+G+E        +   LRKQLA+AV+S+QWSYAIFWS S  Q GVLEW +G YNGD+K
Sbjct: 5   MADGVEAAAGRSKRQNSLLRKQLALAVRSVQWSYAIFWSSSLTQPGVLEWGEGCYNGDMK 64

Query: 61  TRKTVQAEDVHVDNMGLHRSEQLRELYRSLLEGES--------------EQRTKKPPASL 120
            RK  ++ + H    GL +S++LR+LY S+LEG+S              +         L
Sbjct: 65  KRK--KSYESHY-KYGLQKSKELRKLYLSMLEGDSGTTVSTTHDNLNDDDDNCHSTSMML 124

Query: 121 SPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGL 180
           SP+DLSD EWYYLV MS+ F+  Q LPGRA A   TIWLCNAQYAE+ +FSRSLLA+   
Sbjct: 125 SPDDLSDEEWYYLVSMSYVFSPSQCLPGRASATGETIWLCNAQYAENKLFSRSLLARSAS 184

Query: 181 LLTVVCFPYLGGVIELGVTEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKE 232
           + TVVCFPYLGGVIELGVTE + ED +LL+++K  L++ S           A++D++ ++
Sbjct: 185 IQTVVCFPYLGGVIELGVTELISEDHNLLRNIKSCLMEIS-----------AHQDNDDEK 239

BLAST of CSPI06G00470 vs. TAIR10
Match: AT4G09820.1 (AT4G09820.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 176.8 bits (447), Expect = 4.4e-44
Identity = 94/224 (41.96%), Postives = 131/224 (58.48%), Query Frame = 1

Query: 15  LRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDNMGLH 74
           L+  L  AV+S+ W+Y++FW    +Q  VL W +GYYNG IKTRKT Q  +V  +   L 
Sbjct: 20  LQGLLKTAVQSVDWTYSVFWQFCPQQR-VLVWGNGYYNGAIKTRKTTQPAEVTAEEAALE 79

Query: 75  RSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA 134
           RS+QLRELY +LL GES    +   A LSPEDL++ EW+YL+C+SF F    G+PG+A A
Sbjct: 80  RSQQLRELYETLLAGESTSEARACTA-LSPEDLTETEWFYLMCVSFSFPPPSGMPGKAYA 139

Query: 135 DDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVLEDPSLLQHV 194
             + +WL  A   +S  FSR++LAK   + TVVC P L GV+ELG T++V ED   ++  
Sbjct: 140 RRKHVWLSGANEVDSKTFSRAILAKSAKIQTVVCIPMLDGVVELGTTKKVREDVEFVELT 199

Query: 195 KDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAM 239
           K F        C   P  A  +    +    A+ + E+ E + M
Sbjct: 200 KSFFYDH----CKTNPKPALSEHSTYEVHEEAEDEEEVEEEMTM 237

BLAST of CSPI06G00470 vs. TAIR10
Match: AT1G32640.1 (AT1G32640.1 Basic helix-loop-helix (bHLH) DNA-binding family protein)

HSP 1 Score: 88.2 bits (217), Expect = 2.0e-17
Identity = 55/173 (31.79%), Postives = 82/173 (47.40%), Query Frame = 1

Query: 28  WSYAIFWSPSSRQHG--VLEWCDGYYNGD---IKTRKTVQAEDVHVDNMGLHRSEQLREL 87
           W+YAIFW PS    G  VL W DGYY G+      R+   +          +R + LREL
Sbjct: 83  WTYAIFWQPSYDFSGASVLGWGDGYYKGEEDKANPRRRSSSPPFSTPADQEYRKKVLREL 142

Query: 88  YRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLC 147
             SL+ G        P      E+++D EW++LV M+  F  G GL G+A A    +W+ 
Sbjct: 143 -NSLISGGVA-----PSDDAVDEEVTDTEWFFLVSMTQSFACGAGLAGKAFATGNAVWVS 202

Query: 148 NAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVLEDPSLLQHVK 196
            +     +   R+    V  + T+ C P   GV+E+G TE + +   L+  V+
Sbjct: 203 GSDQLSGSGCERAKQGGVFGMHTIACIPSANGVVEVGSTEPIRQSSDLINKVR 249

BLAST of CSPI06G00470 vs. NCBI nr
Match: gi|793421610|ref|NP_001292635.1| (transcription factor EGL1 [Cucumis sativus])

HSP 1 Score: 1272.3 bits (3291), Expect = 0.0e+00
Identity = 639/651 (98.16%), Postives = 643/651 (98.77%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MANGLENCDSEPGFLRKQLAVAVKSIQWSYA+FWSPSSRQHGVLEWCDGYYNGDIKTRKT
Sbjct: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYALFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGV 180
           FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240
           TEQV EDPSLLQHVKDFLLKFS+PICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN
Sbjct: 181 TEQVSEDPSLLQHVKDFLLKFSRPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240

Query: 241 LYCSTAVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS 300
           LYCSTAVKFD KSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS
Sbjct: 241 LYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS 300

Query: 301 LQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSGSL 360
           LQFLDDDFSYGFQDSMNPSDCISEALA+QEKVSSSPRLKDANNLPLKE QNPNHTQSGSL
Sbjct: 301 LQFLDDDFSYGFQDSMNPSDCISEALADQEKVSSSPRLKDANNLPLKEHQNPNHTQSGSL 360

Query: 361 DPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLK 420
           DPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLK
Sbjct: 361 DPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLK 420

Query: 421 KILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNE 480
           KILFAVPLLSAGSLKGLKDEE SILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNE
Sbjct: 421 KILFAVPLLSAGSLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNE 480

Query: 481 INKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS 540
           INKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS
Sbjct: 481 INKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGS 540

Query: 541 LKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMD 600
           LKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMD
Sbjct: 541 LKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMD 600

Query: 601 ALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 652
           ALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS
Sbjct: 601 ALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 651

BLAST of CSPI06G00470 vs. NCBI nr
Match: gi|778709077|ref|XP_011656339.1| (PREDICTED: transcription factor EGL1 isoform X2 [Cucumis sativus])

HSP 1 Score: 1254.6 bits (3245), Expect = 0.0e+00
Identity = 632/641 (98.60%), Postives = 633/641 (98.75%), Query Frame = 1

Query: 11  EPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN 70
           EPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN
Sbjct: 4   EPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN 63

Query: 71  MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 130
           MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG
Sbjct: 64  MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 123

Query: 131 RALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVLEDPSL 190
           RALADDRTIWLCNAQYAESTVFSRSLLAK   + TVVCFPYLGGVIELGVTEQV EDPSL
Sbjct: 124 RALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFPYLGGVIELGVTEQVSEDPSL 183

Query: 191 LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCSTAVKFD 250
           LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCSTAVKFD
Sbjct: 184 LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCSTAVKFD 243

Query: 251 RKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLDDDFSY 310
            KSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLDDDFSY
Sbjct: 244 GKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLDDDFSY 303

Query: 311 GFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSGSLDPSSDEDMHY 370
           GFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKE QNPNHTQSGSLDPSSDEDMHY
Sbjct: 304 GFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKEHQNPNHTQSGSLDPSSDEDMHY 363

Query: 371 KRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILFAVPLLS 430
           KRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILFAVPLLS
Sbjct: 364 KRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILFAVPLLS 423

Query: 431 AGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEINKVSILNDT 490
           AGSLKGLKDEE SILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEINKVSILNDT
Sbjct: 424 AGSLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEINKVSILNDT 483

Query: 491 IKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKA 550
           IKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKA
Sbjct: 484 IKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKA 543

Query: 551 CEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDALNDLQLDAY 610
           CEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDALNDLQLDAY
Sbjct: 544 CEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDALNDLQLDAY 603

Query: 611 SVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 652
           SVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS
Sbjct: 604 SVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 644

BLAST of CSPI06G00470 vs. NCBI nr
Match: gi|659116733|ref|XP_008458230.1| (PREDICTED: LOW QUALITY PROTEIN: transcription factor EGL1-like [Cucumis melo])

HSP 1 Score: 1219.1 bits (3153), Expect = 0.0e+00
Identity = 611/641 (95.32%), Postives = 623/641 (97.19%), Query Frame = 1

Query: 11  EPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN 70
           EPGFLRKQLAVAVKSIQWSYAIFWSPS+RQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN
Sbjct: 4   EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN 63

Query: 71  MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 130
           MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG
Sbjct: 64  MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 123

Query: 131 RALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGVTEQVLEDPSL 190
           RALADDRTIWLCNAQYAES+VFSRSLLAK   + TVVCFPYLGGVIELGVTEQV EDP L
Sbjct: 124 RALADDRTIWLCNAQYAESSVFSRSLLAKSASIQTVVCFPYLGGVIELGVTEQVAEDPCL 183

Query: 191 LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCSTAVKFD 250
           LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVE LAMENLYCSTAVKFD
Sbjct: 184 LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEFLAMENLYCSTAVKFD 243

Query: 251 RKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLDDDFSY 310
            KSVNGIQR NNEFGIDSLDDFSNGCEQYH MED+LRLEG EGGASRFQSLQFLDDDFSY
Sbjct: 244 GKSVNGIQRXNNEFGIDSLDDFSNGCEQYHQMEDSLRLEGVEGGASRFQSLQFLDDDFSY 303

Query: 311 GFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSGSLDPSSDEDMHY 370
           GFQDSMNPSDCISEALANQ+KVSSSPRLKDANNLPLKELQNPN TQSGSLDPSSDEDMHY
Sbjct: 304 GFQDSMNPSDCISEALANQDKVSSSPRLKDANNLPLKELQNPNQTQSGSLDPSSDEDMHY 363

Query: 371 KRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILFAVPLLS 430
           KRTIFTILGSSTQLVGSPLLHNFSNRSNF PWKKV+AETHTPPMQQRMLKKILFAVPLLS
Sbjct: 364 KRTIFTILGSSTQLVGSPLLHNFSNRSNFTPWKKVMAETHTPPMQQRMLKKILFAVPLLS 423

Query: 431 AGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEINKVSILNDT 490
           AGSLKGLKD E SILKQGNN+SCTKNATLDKL+ENEKFMALKSMLPSLNEINKVSILNDT
Sbjct: 424 AGSLKGLKDVERSILKQGNNNSCTKNATLDKLRENEKFMALKSMLPSLNEINKVSILNDT 483

Query: 491 IKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKA 550
           IKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKA
Sbjct: 484 IKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKRKA 543

Query: 551 CEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDALNDLQLDAY 610
           CEMDETDLKLK+DFPKVG KLDVKVSMEEHEVL+DMHCPYREYILVDV+DALNDLQLDAY
Sbjct: 544 CEMDETDLKLKHDFPKVGHKLDVKVSMEEHEVLIDMHCPYREYILVDVVDALNDLQLDAY 603

Query: 611 SVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 652
           SVQSSDHNG FSLTLKSKFRG+AAASVGMIKLALLKV NKS
Sbjct: 604 SVQSSDHNGFFSLTLKSKFRGIAAASVGMIKLALLKVANKS 644

BLAST of CSPI06G00470 vs. NCBI nr
Match: gi|700190452|gb|KGN45656.1| (hypothetical protein Csa_6G003480 [Cucumis sativus])

HSP 1 Score: 1015.8 bits (2625), Expect = 3.5e-293
Identity = 520/541 (96.12%), Postives = 523/541 (96.67%), Query Frame = 1

Query: 111 EWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFP 170
           +W Y +  S    Q  GLPGRALADDRTIWLCNAQYAESTVFSRSLLAK   + TVVCFP
Sbjct: 20  QWSYAIFWSPSSRQ-HGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFP 79

Query: 171 YLGGVIELGVTEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN 230
           YLGGVIELGVTEQV EDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN
Sbjct: 80  YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN 139

Query: 231 EIVEVLAMENLYCSTAVKFDRKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG 290
           EIVEVLAMENLYCSTAVKFD KSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG
Sbjct: 140 EIVEVLAMENLYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG 199

Query: 291 AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQ 350
           AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKE Q
Sbjct: 200 AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKEHQ 259

Query: 351 NPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETH 410
           NPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETH
Sbjct: 260 NPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETH 319

Query: 411 TPPMQQRMLKKILFAVPLLSAGSLKGLKDEEHSILKQGNNDSCTKNATLDKLKENEKFMA 470
           TPPMQQRMLKKILFAVPLLSAGSLKGLKDEE SILKQGNNDSCTKNATLDKLKENEKFMA
Sbjct: 320 TPPMQQRMLKKILFAVPLLSAGSLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMA 379

Query: 471 LKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSD 530
           LKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSD
Sbjct: 380 LKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSD 439

Query: 531 NYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPY 590
           NYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPY
Sbjct: 440 NYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPY 499

Query: 591 REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK 650
           REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK
Sbjct: 500 REYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNK 559

Query: 651 S 652
           S
Sbjct: 560 S 559

BLAST of CSPI06G00470 vs. NCBI nr
Match: gi|514482123|gb|AGO58373.1| (basic helix-loop-helix protein [Morella rubra])

HSP 1 Score: 723.4 bits (1866), Expect = 3.5e-205
Identity = 387/653 (59.26%), Postives = 478/653 (73.20%), Query Frame = 1

Query: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60
           MANG +  D  P  LRK+LAVAV+SIQWSYAIFWS S+ Q GVLEW DGYYNGDIKTRKT
Sbjct: 1   MANGTQTHDGLPENLRKRLAVAVRSIQWSYAIFWSLSTTQQGVLEWGDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQA ++  D +GL RSEQLRELY+SLLEGE++Q+ K+P A+LSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAVELKADKIGLQRSEQLRELYQSLLEGEADQQAKRPSAALSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKVGLLLTVVCFPYLGGVIELGV 180
            F+ G+GLPGRALA+ + IWLCNAQYA+S VFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 VFSPGEGLPGRALANGQAIWLCNAQYADSKVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVLEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240
           TE V EDPSLLQH+K  LL+ SKP+CS K S    K D+  +P+ A  + EI++ L +EN
Sbjct: 181 TELVSEDPSLLQHIKASLLELSKPVCSDKSSPTPPKADDDGDPICANVNLEIMDTLPLEN 240

Query: 241 LYCST-AVKFDRKSVNGIQRK-NNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRF 300
           LY  T  ++FDR+ +  +    + E  +DS D+ SNG E  H  ED+  L+G  GGAS+ 
Sbjct: 241 LYSPTEGIEFDREGIVELGGNIHEEINMDSPDECSNGXEHNHQTEDSFMLDGINGGASQV 300

Query: 301 QSLQFLDDDFSYGFQDSMNPSDCISEALANQEKVSSSPRLKDANNLPLKELQNPNHTQSG 360
           QS   LDDDFS G  DSMN SDCISEA  NQEK  S+ + +D N   LKELQN NHT+ G
Sbjct: 301 QSWHVLDDDFSNGVPDSMNSSDCISEAFVNQEKAISTLKREDVNQ-HLKELQNSNHTKLG 360

Query: 361 SLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPW-KKVVAETHTPPMQQR 420
           SLD  +D+D+HY+R +  I+GSS +L+ +   H   +RSNF+ W K+ + + + P  QQ 
Sbjct: 361 SLDLGADDDLHYRRILSAIVGSSPRLIENLRFHYTDHRSNFLCWTKEALGDAYRPQAQQT 420

Query: 421 MLKKILFAVPLLSAGSLKGLKDE---EHSILKQGNNDSCTKNATLDKLKENEKFMALKSM 480
           MLKKILF VPL+  G    L+ E   +  + K  + D C  +   D  +ENE F+ALKSM
Sbjct: 421 MLKKILFTVPLMYGGCSFRLQRENCGKEWLRKSESGDICLGHVLSDNRRENENFLALKSM 480

Query: 481 LPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDY 540
           +PS++EI+K SIL DTIKYLK LEARV+ELE+CMDS+ YEER RRKYLDMVEQ SDN D 
Sbjct: 481 VPSISEIDKASILRDTIKYLKELEARVEELESCMDSVDYEERARRKYLDMVEQISDNCDK 540

Query: 541 EKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYI 600
           +KI+   K   NKRKACE DETD +L    P+    LDVKVS++E EVL++M CPYREY+
Sbjct: 541 KKIDNGKKSWINKRKACEFDETDPELNRVVPEDSLPLDVKVSIKEQEVLIEMRCPYREYV 600

Query: 601 LVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKV 648
           L+DVMDA+N+L L+A+SVQSS  NG+ +LTLKSKFRG A A VGMIK AL K+
Sbjct: 601 LLDVMDAINNLHLEAHSVQSSAPNGILTLTLKSKFRGAATAPVGMIKQALWKI 652

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GL3_ARATH1.3e-11440.12Transcription factor GLABRA 3 OS=Arabidopsis thaliana GN=GL3 PE=1 SV=1[more]
EGL1_ARATH6.0e-11239.22Transcription factor EGL1 OS=Arabidopsis thaliana GN=BHLH2 PE=1 SV=1[more]
BHLHW_PEA6.1e-7231.35Basic helix-loop-helix protein A OS=Pisum sativum GN=BHLH PE=3 SV=1[more]
BH012_ARATH8.8e-5548.59Transcription factor MYC1 OS=Arabidopsis thaliana GN=BHLH12 PE=1 SV=1[more]
ARRS_MAIZE3.1e-5248.54Anthocyanin regulatory R-S protein OS=Zea mays GN=R-S PE=2 SV=1[more]
Match NameE-valueIdentityDescription
I6N8K6_CUCSA0.0e+0098.16GL3 OS=Cucumis sativus PE=2 SV=1[more]
A0A0A0KCZ7_CUCSA2.4e-29396.12Uncharacterized protein OS=Cucumis sativus GN=Csa_6G003480 PE=4 SV=1[more]
A0A075BRK3_9ROSI2.5e-20559.26Basic helix-loop-helix protein OS=Morella rubra GN=bHLH2 PE=2 SV=1[more]
F6I629_VITVI7.4e-20256.97Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02560 PE=4 SV=... [more]
A2TEF4_VITVI2.4e-20056.82Myc anthocyanin regulatory protein OS=Vitis vinifera GN=MYCA1 PE=2 SV=3[more]
Match NameE-valueIdentityDescription
AT5G41315.17.3e-11640.12 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G63650.13.4e-11339.22 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G00480.25.0e-5648.59 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G09820.14.4e-4441.96 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G32640.12.0e-1731.79 Basic helix-loop-helix (bHLH) DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|793421610|ref|NP_001292635.1|0.0e+0098.16transcription factor EGL1 [Cucumis sativus][more]
gi|778709077|ref|XP_011656339.1|0.0e+0098.60PREDICTED: transcription factor EGL1 isoform X2 [Cucumis sativus][more]
gi|659116733|ref|XP_008458230.1|0.0e+0095.32PREDICTED: LOW QUALITY PROTEIN: transcription factor EGL1-like [Cucumis melo][more]
gi|700190452|gb|KGN45656.1|3.5e-29396.12hypothetical protein Csa_6G003480 [Cucumis sativus][more]
gi|514482123|gb|AGO58373.1|3.5e-20559.26basic helix-loop-helix protein [Morella rubra][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
IPR025610MYC/MYB_N
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0001708 cell fate specification
biological_process GO:0009913 epidermal cell differentiation
biological_process GO:0045165 cell fate commitment
biological_process GO:0009888 tissue development
biological_process GO:0048629 trichome patterning
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G00470.1CSPI06G00470.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 465..511
score: 4.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 457..503
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 448..497
score: 9
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 465..519
score: 1.4
IPR025610Transcription factor MYC/MYB N-terminalPFAMPF14215bHLH-MYC_Ncoord: 15..196
score: 7.9
NoneNo IPR availablePANTHERPTHR11514MYCcoord: 243..651
score: 1.6E-265coord: 15..224
score: 1.6E
NoneNo IPR availablePANTHERPTHR11514:SF38TRANSCRIPTION FACTOR MYC1coord: 15..224
score: 1.6E-265coord: 243..651
score: 1.6E

The following gene(s) are paralogous to this gene:

None