Cp4.1LG12g03680 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g03680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyc anthocyanin regulatory protein
LocationCp4.1LG12 : 2613869 .. 2621177 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCGTCTCGTCAGTCAAAAAGTAAACCCTAAATTAGAATTTGTCAGCCCCTCGATCAGATTTTAATCATAAACTAATCGATTATAGCCCTCTCTTTTAAACCCAAATTTGTAAAAGGGTTGCATCTGAGAAGATGGTTACCACTTTTTCCAGTAATTACTTTTCTGTTCAGAAAGCTGCTACTCTCTGTCGCACGGCTAAAGCCTAAAAGCCTTTTGGGTTTCGCCATTCTGTGGCGTTTGAACCAAATCTTAAGCTCCTTATGGCTAATGGCTTGGTGGGTTTTCGTTGTTCTTCATCGTTTCTTTTGAGTTTCTCTGTTTTTTCTGGAAAACAATGGCTAATGGAACTGAAATCTGTGATAGCGAACCTGGGTTTCTCCGAAAGCAGCTCGCTGTCGCTGTGAAGAGCATCCAATGGAGCTATGCGATCTTCTGGTCACCGTCGATTAGGCAACATGGGTATTGATTTTTTCTTTTAATCAGTTCTTGATGACATTGTTCTTCATTCTTTGGCTTGTATATCCTTCTGGGGTTTCCTTTTTCATCATATGGGTGTTGAAAATCCAAATGGGTACTGTGATTTTGATTGTCTTATGTTGTGGGGTTTGTTCTTGGCAACAATTTGTGAAAATGAAGTGGTTAATGATCTTAAACTAGGCTTACAGTGATCTGATTTAAGTCCCTTCTTTCTTTTACCTTCAACTTTCCTGGACTTCTGTTTTATATCATTACTGAGCAATGGGATACCTTTAAGGAGCTCAAAATCTGAGTTTCATCTACAATAACATGTAACTTTGAGCTTGGTTGTCCGGAGATATCCTCGGAAATCTGGAGCTCAAGCTTTCGTGTGAGATCCTACATTGGTTGTGGAGGAGAACGAAGCATTCTTTATAAGGGTGTGGAAACCTCTCCCTAACAGATACGTTTTAAAACCTTGAGGAGAAGCCCAGACGGGAAAGCCCAAAGAGGACAATATCTGATAGCAGTGGGGTTGGGTTGTTACAAATGTATGAGAGCTAGACATCGGGCGATATGTCACTGAGGAGGCTGAGCCCGAAGGGAGGTGGACAAGAGGTAGTGTGCTGCAAGGATGCTGGGCCCTGAAGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGTGGATTGGGGGATCCCACATCGATTGGAGAAGGAAATGAATGCTTGTGAGGACGCTGGGCCTTGAAGGCGGTGGATTGTGAGATCCCACCTCGGTTGGGAGGAGAACGAAACATTCTTTATAAAGGTGTGGAAACCTCTCCTTAGTAGACACGTTTTATAACCTTGAGGGGAACCCCGAAAGGGAAAGTCCGAAGAGGACAATATTTGCTAGCGGTGGACTTGGGCTCGAAAAAACTTAAAATCAGCCCTAGATGGCAAAGCCTCTGGGTGATTATTGGGGTTAGGGGCGGGCTATGAGAATAATATTCAGGATTGTTGGGAGGGAGTTTCACGTTGGCTAATTTAGAGGAGGATTATGGGTTTATAAGTAAGGAATACATCTCCATTGGTATGAGGTCTTTTGGGGAAACCAAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAACATCATACCATTATGGAGATCTGTGATTCCTAACATGGTATCAGAGTCGTGCCCTTAGCTATGTAAATAGAATCCTCAAATGTCGAACAAAGAAGTTGTGAGCCTCGAAGGTGTAGTAAAAAAGTGACTCAAGTGTCAAACAAAGAGTGTACTTTGTTCGATGGCTCCAGAGAAAGGAGTCGAGCCTCAATTAAGGGGAGGCTGTTCGAGGGCTCTATAGGTCTCAGGAGAATCTCTATGGTCTACTTTGTTCGAGGACTCCAGAGAAGGAGTTGAGTCTTGATTAAGGGGAGACTGTTCGAGGGTTGCAAAGGCCTCAAGGGAGGCTCTATGGTGTACTTTGTTCGAGGGGAGGATTGTAGTTGCTTACAAATTCACAGTTTGTCTATATAGTTGTTGTACTATAAAGAAACGATTTGGTTTTATACTTTAGGGTACTGGAATGGTGTGATGGCTACTACAATGGAGACATCAAGACGAGGAAGACGGTTCAAGCTGAAGATGTTCATGTCGATATTATGGGCTTACATCGAAGTGAACAATTGAGAGAGCTCTACAAGTCTCTCTTAGACGGTGAAAACGAGCAACGATCGAAAAAGCCTCCCGCCTCTTTGTCTCCTGAAGATCTATCTGATGCTGAATGGTATTACTTGGTTTGCATGTCCTTTTTCTTCAATCAAGGCCAAGGGTATTGTCTCTGATCCATACTTGTGTTTATCTCTTGACTTTTGTTGTTTAACTTGTGATATTACGAGCGCTATCTATCTATCGTGCATAGTTTGCCTGGAAGAGCGTTAGCTGATGATCGAACTATCTGGTTATGCAATGCTCAATATGCAGAGAGTAGCGTATTCTCCCGCTCCTTGCTTGCGAAGGTTCGATTTCTTTTGGTGGGAAAATATCACTCGTTTGATTTTGAATAGTTTGATGCCTATTCAAGTTCCTTATGATCTTATTAGTGAGAAACAAATTGACGTTTTTCTTCTTTCTTTTTCATTGGGGTTCTTATTTTCGACGTCGTTGATCGACTGCTTACAGAGTGCATCGATTCAGGTATGATTATCCGTGTCAAACATCATCTAGTGCTAGTTATGATGATAAAGAAAATCAAAAGTCCTCTAAGAATAGTATGCCTTAAAAAAGGAAAAAGAAACCTCTCGTGATAAAACCATAAAACTTTGTCTATGTCTGCCCCATTGGTTGGAGAGGGCAATAAAGCATTCTTTATAAGGGTGTGGAAACCTCTCTCTAACAGGCGCGTTTTAAAACCGTGAGGTCAACAACGATATGTAACGAGCCAAAGCGGACAATATCTACTACCAGTGGGCTTGTGCTATTACAAATGGAATCAGAGCTAGACATCGGGCGGTGTGCCAGCGAGGACGTTGGGCCCCAAGGAGGGTGGATTGTGAGATCCCACATCGGTTGGGGAGAGGAACAAATCATTCTTTATAAGGGTGTGGAAACCTCTCCCTAACAGACGTGTTTTAAAGCTGTGAGGCCGACGGCGATACGTAAGGGGCCAAAACGGACAATATCTGCTAATGGTGGGCTTGAGATGTTTTAAATGGTATCAGAGCCAGACACCGGGCGGTGTGCCAGCGAGGACGCTGGACCCCAAGGAAGGTGGATTGTGAGATCCCACGTCGGTTGGAGAGGGAACAAAGCATTCTTTATAAGGGTGTAGAAACCTCTCCCTAACAAACGCATTTTAAAACCGTGAGGCCGGCGGTAATACGTAACATGTCAAAACAGACAATATATGCTAACAATGGACTTGAGATGTTACAAATGATATTAGAACCAGACCACCGGGCGGTGTGCTAGCGAGGACGCTGGGCCCCCAAGGAGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACTAAACATTTTTTATAAGGGTGTGGAAACCTCTCCCTAACAGATGTGTTTTAGAACCGTGAGACCGACGGCGACACATAACGGACAATATCTGCTAACGGTGGGCTTGGGCTGTTACAAATGGTATCGAAGCTAGACATTGAGCGGTGTGCCAGCGAGGACGTTGGGCTCCCAAGGAGAGTGGATTGTGAGATCACACATTGGTTGGAGAGGGGAACAAAGCATTCTTTATAAGGGTGTGGAAACCTCTCCCTAATAGATGCGTTTTAAAACCGTGAGACCGACGGCGATGCATAACGGGCCAAAGCAGACAATACGTAACGAGTTCGATTATTGACTACATGAATCAGCTAAAGCTCTGAGAATTGACTGCTGTTTACAATGCTATTTTCAAGACCTGATCTTAAAATTGTTTTTGTGTTGTAGACTGTGGTGTGCTTTCCTTACCTTGGCGGCGTTATTGAACTAGGTGTAACCGAGCAGGTAAAATTCATAAAGAATCATCTGTTACACAAATTGTTTATTTATGAATCAACATCATATGTTTCGGGACATCATATGTTTCGGTACCATGTTTCGTGTTTCATGCTGTCTTTTTAAGGTTTCGGAGGATCCTAGTCTTCTTCAACATGTCAAAGATTTTTTACTGAAGTTCTCGAAGCCGATATGCTCTAAGAAATCGTCTTCCTCTGCTTATAAAGATGATAATGGTAAAGAACCAATGGTTGCCAAATCTGACAATGAGATTGTTGAAGTTTTGGCTATGGAGAACGTCCACGGTTTGACAGCTGCGAAATTCGACAGGAAGGCAGTAAATGGGATTCAAAGGAAAAACGATGAGTTCGGTATTGATTCTCTTGATCGTTTTTCGAATGGTTGTGAACGATTTCACCAAATGGTCGATCCTTTAAGACTTGAAGGTGTCGAGGGAGGGGCTTCGTGTTTTCGGAGTTTGCAGTTTCTTGATGACGACTTCAGTTACGGTTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCTGAAGCTTTGGTAAATAATCCGGAGAAAGTCTCATCGAAAGGCACGAACGATTTATCTTTGAAAGAGCTTCAAAACTCGAACCGAACTAAATCAGTTTCCTTAGATCCTAGAACTGATGAAGACCTGCACTACAAGAGAACTATCTTCACCATTTTGGGAAGTTCAACTCAATTGGCTGGAAGTCCCCTTCTCCATAGTTTCTCGAGCAGATCGAGTTTCATGCCATGGAAGAAAGGAATGGCCGAGATAAACACGGCCCCGGTGCAACAAAAGATGTTGAAGAAGATTTTGTTCACGGTTCCATTATTATCTGCTGGTTGTTCTCTAAATCGGCTCAATGACGGGGAACGGTCGATCTTGAAACAGGGTAACGACGATTTCTGCACGAAAGATGTCGTGCACGACAAATTAAGAGAAAATGAAAAGTTTATGGCTCTTAAGTCCATGCTACCTTCACTTAATGAGGTACTTCCTGCTTCGATTTGTTACATTTGTGCGGATTTTACATCAAATTTGCAAGTTCTTAAGCATATATATGGTTGCATTTTTGCTTCAATCATATTTTCTTAAAGTGATATTGTAACAGCCCGAGTCCACCGCTAGTAGATGTTGTCCTCTTTGGGCTTTCCTTTTCGGGCTTCCCCTCAAAGTTTTTAAAATGCATCTGTTGGGGAGAAGTTTCCACACCCTTATAAAGAATATACCGACATGGGATCTCACAATCCAACCCCCTTCAGGGCCTAGTGCTAGCACTTGTTCCCTTCTCCAATCAATGTAGGACCCCGCAATCCACCCCCCTTCGGGGCCTAGCGTCCTTGCTGGCACACTGCCTCGTGTCTACCCCTCTTCGGGGCTCAAGCTCCTCACTAGCACATCGCTCAGTATCTGACTCTCATACCATTTGTACAACTCAAGCCCACCACTAGCAGATATTGTCCTCTTTAGGCTTTACCTTTCGAGCTCCCCTCAAGATTTTTAAAATACGTCTATTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTCATTCTCCTTCCCAACCGATGTGGGATCTCAAAATCCACCTCCCTTCGGGGCCCAGCGTCCTTGTTGACACACTACCTCGTGTCCACCCCCCTTCAAGGCTCAGCCTCCTTTGCTAGCACATCGTCCAGTGTCTAGCTCTGATACCATTTGTAACAACTCAAGCCCAACGTTAGTAGATATTGTCCTCTTTGGGCTTTCCCTTTCGGGATTTCCCACGCGTATGCTAGGGAGAGGTTTCCACACCCTTATAAAAAATGTTTCGGTTTTCCCCTCACGAAATTTGCAAGTTCCTAAGCATATACATTGTTGCGTTTTTTGTTCGATCATATCTTCTTGAAGTGATACAATCTGAAAATTTTACTTATTTTACGCAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATTTGAAGATGCTCGAAGCGAGAGTACAAGAGTTGGAAACATGCATGGACTCATTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAACAGACATCAGATAACTATGACTATGATAAGATTGAAGGCACCTTAAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACTGACCTTAAGCTGAAGAACAACATTCCCAAAGATGGCCTTAAACTAGATGTGAAAGTCACCATGAACGAGCAAGAGGTTCTCGTCGACATGCACTGTCCTTATCGAGAATATATATTGGTCGACGTCATGGATACTTTAAACGACTTGCAACTCGACGCCCACTCGGTTCAATCTTCTGATCGTAATGGCGTTTTCTCCTTGACCCTCAAGTCTAAGGTTTGTAATTTAGTTTCTGTCATTTGTTTTTCATGCATAGACACCACCGAAAACCCGACTTTTGAACCGTCTACATCCCTAGTAGCAAACCATGGAGAAATGTCGATGAGTCGGAGGGCTTATATCTTTCATAGGCTAAACCATATATTTGCTTTACGTTCCACAAGTTTGATCATAGTCTGCAAATTTATTGAACCATCGACTCGTAATGGAAATGCTGATATATTATCTTTGATGAACGGGATTATGACGATCCTTCGTTTGTTTGACGTTTTAGTCGCCATATCTATTTTAGTTATAGAACTGTGCTGTTTTCGTTTAGAATGACGACTGCGTAGGAGTAGAAGTATGTTTCTGTCTGTAAATCGATACTTGGGTAATTTACGCTGCCTGATTTCGTTTCAGTTTCGAGGGATGGTGGCTGCATCTGTTGGGATGGTCAAACTAGCACTTTTGAAAGTTGCCAACAAGAGTTGAGCCAAGCCGAACCGAACCGAGCCGAGGCGAGCGGAGAAGAAAGCTTAGGATCAATGGTGAAGCATTTTCTGTAAGTCAGTAGAGTAGAAGCAAAGGAGCTGACCTTCTAACATCCTCCCCTTGGGGCTTACGAATTAGGCAAGCTTAGCATCAACTGATTAGATTGCTTAGCTTTACTGTTGTTATTTGCTACCCATAATCAACTTACCATGTTCTAATTATGAGAGTTGTGGTGGTTAATAGCTAGCTAGGTTGCACTTACTTTTTTGCAACCCGTCCCGAGATTTCTACATAGTATCATTTATCTTTATGATTCAATCATAAACTAAATCTCTCTCGAGCCAATGAAAGAGTGAGACC

mRNA sequence

ATCGTCTCGTCAGTCAAAAAGTAAACCCTAAATTAGAATTTGTCAGCCCCTCGATCAGATTTTAATCATAAACTAATCGATTATAGCCCTCTCTTTTAAACCCAAATTTGTAAAAGGGTTGCATCTGAGAAGATGGTTACCACTTTTTCCAGTAATTACTTTTCTGTTCAGAAAGCTGCTACTCTCTGTCGCACGGCTAAAGCCTAAAAGCCTTTTGGGTTTCGCCATTCTGTGGCGTTTGAACCAAATCTTAAGCTCCTTATGGCTAATGGCTTGGTGGGTTTTCGTTGTTCTTCATCGTTTCTTTTGAGTTTCTCTGTTTTTTCTGGAAAACAATGGCTAATGGAACTGAAATCTGTGATAGCGAACCTGGGTTTCTCCGAAAGCAGCTCGCTGTCGCTGTGAAGAGCATCCAATGGAGCTATGCGATCTTCTGGTCACCGTCGATTAGGCAACATGGGGTACTGGAATGGTGTGATGGCTACTACAATGGAGACATCAAGACGAGGAAGACGGTTCAAGCTGAAGATGTTCATGTCGATATTATGGGCTTACATCGAAGTGAACAATTGAGAGAGCTCTACAAGTCTCTCTTAGACGGTGAAAACGAGCAACGATCGAAAAAGCCTCCCGCCTCTTTGTCTCCTGAAGATCTATCTGATGCTGAATGGTATTACTTGGTTTGCATGTCCTTTTTCTTCAATCAAGGCCAAGGTTTGCCTGGAAGAGCGTTAGCTGATGATCGAACTATCTGGTTATGCAATGCTCAATATGCAGAGAGTAGCGTATTCTCCCGCTCCTTGCTTGCGAAGGTTCGATTTCTTTTGACTGTGGTGTGCTTTCCTTACCTTGGCGGCGTTATTGAACTAGGTGTAACCGAGCAGGTTTCGGAGGATCCTAGTCTTCTTCAACATGTCAAAGATTTTTTACTGAAGTTCTCGAAGCCGATATGCTCTAAGAAATCGTCTTCCTCTGCTTATAAAGATGATAATGGTAAAGAACCAATGGTTGCCAAATCTGACAATGAGATTGTTGAAGTTTTGGCTATGGAGAACGTCCACGGTTTGACAGCTGCGAAATTCGACAGGAAGGCAGTAAATGGGATTCAAAGGAAAAACGATGAGTTCGGTATTGATTCTCTTGATCGTTTTTCGAATGGTTGTGAACGATTTCACCAAATGGTCGATCCTTTAAGACTTGAAGGTGTCGAGGGAGGGGCTTCGTGTTTTCGGAGTTTGCAGTTTCTTGATGACGACTTCAGTTACGGTTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCTGAAGCTTTGGTAAATAATCCGGAGAAAGTCTCATCGAAAGGCACGAACGATTTATCTTTGAAAGAGCTTCAAAACTCGAACCGAACTAAATCAGTTTCCTTAGATCCTAGAACTGATGAAGACCTGCACTACAAGAGAACTATCTTCACCATTTTGGGAAGTTCAACTCAATTGGCTGGAAGTCCCCTTCTCCATAGTTTCTCGAGCAGATCGAGTTTCATGCCATGGAAGAAAGGAATGGCCGAGATAAACACGGCCCCGGTGCAACAAAAGATGTTGAAGAAGATTTTGTTCACGGTTCCATTATTATCTGCTGGTTGTTCTCTAAATCGGCTCAATGACGGGGAACGGTCGATCTTGAAACAGGGTAACGACGATTTCTGCACGAAAGATGTCGTGCACGACAAATTAAGAGAAAATGAAAAGTTTATGGCTCTTAAGTCCATGCTACCTTCACTTAATGAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATTTGAAGATGCTCGAAGCGAGAGTACAAGAGTTGGAAACATGCATGGACTCATTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAACAGACATCAGATAACTATGACTATGATAAGATTGAAGGCACCTTAAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACTGACCTTAAGCTGAAGAACAACATTCCCAAAGATGGCCTTAAACTAGATGTGAAAGTCACCATGAACGAGCAAGAGGTTCTCGTCGACATGCACTGTCCTTATCGAGAATATATATTGGTCGACGTCATGGATACTTTAAACGACTTGCAACTCGACGCCCACTCGGTTCAATCTTCTGATCGTAATGGCGTTTTCTCCTTGACCCTCAAGTCTAAGTTTCGAGGGATGGTGGCTGCATCTGTTGGGATGGTCAAACTAGCACTTTTGAAAGTTGCCAACAAGAGTTGAGCCAAGCCGAACCGAACCGAGCCGAGGCGAGCGGAGAAGAAAGCTTAGGATCAATGGTGAAGCATTTTCTGTAAGTCAGTAGAGTAGAAGCAAAGGAGCTGACCTTCTAACATCCTCCCCTTGGGGCTTACGAATTAGGCAAGCTTAGCATCAACTGATTAGATTGCTTAGCTTTACTGTTGTTATTTGCTACCCATAATCAACTTACCATGTTCTAATTATGAGAGTTGTGGTGGTTAATAGCTAGCTAGGTTGCACTTACTTTTTTGCAACCCGTCCCGAGATTTCTACATAGTATCATTTATCTTTATGATTCAATCATAAACTAAATCTCTCTCGAGCCAATGAAAGAGTGAGACC

Coding sequence (CDS)

ATGGCTAATGGAACTGAAATCTGTGATAGCGAACCTGGGTTTCTCCGAAAGCAGCTCGCTGTCGCTGTGAAGAGCATCCAATGGAGCTATGCGATCTTCTGGTCACCGTCGATTAGGCAACATGGGGTACTGGAATGGTGTGATGGCTACTACAATGGAGACATCAAGACGAGGAAGACGGTTCAAGCTGAAGATGTTCATGTCGATATTATGGGCTTACATCGAAGTGAACAATTGAGAGAGCTCTACAAGTCTCTCTTAGACGGTGAAAACGAGCAACGATCGAAAAAGCCTCCCGCCTCTTTGTCTCCTGAAGATCTATCTGATGCTGAATGGTATTACTTGGTTTGCATGTCCTTTTTCTTCAATCAAGGCCAAGGTTTGCCTGGAAGAGCGTTAGCTGATGATCGAACTATCTGGTTATGCAATGCTCAATATGCAGAGAGTAGCGTATTCTCCCGCTCCTTGCTTGCGAAGGTTCGATTTCTTTTGACTGTGGTGTGCTTTCCTTACCTTGGCGGCGTTATTGAACTAGGTGTAACCGAGCAGGTTTCGGAGGATCCTAGTCTTCTTCAACATGTCAAAGATTTTTTACTGAAGTTCTCGAAGCCGATATGCTCTAAGAAATCGTCTTCCTCTGCTTATAAAGATGATAATGGTAAAGAACCAATGGTTGCCAAATCTGACAATGAGATTGTTGAAGTTTTGGCTATGGAGAACGTCCACGGTTTGACAGCTGCGAAATTCGACAGGAAGGCAGTAAATGGGATTCAAAGGAAAAACGATGAGTTCGGTATTGATTCTCTTGATCGTTTTTCGAATGGTTGTGAACGATTTCACCAAATGGTCGATCCTTTAAGACTTGAAGGTGTCGAGGGAGGGGCTTCGTGTTTTCGGAGTTTGCAGTTTCTTGATGACGACTTCAGTTACGGTTTTCAAGATTCCATGAATCCTAGTGACTGTATTTCTGAAGCTTTGGTAAATAATCCGGAGAAAGTCTCATCGAAAGGCACGAACGATTTATCTTTGAAAGAGCTTCAAAACTCGAACCGAACTAAATCAGTTTCCTTAGATCCTAGAACTGATGAAGACCTGCACTACAAGAGAACTATCTTCACCATTTTGGGAAGTTCAACTCAATTGGCTGGAAGTCCCCTTCTCCATAGTTTCTCGAGCAGATCGAGTTTCATGCCATGGAAGAAAGGAATGGCCGAGATAAACACGGCCCCGGTGCAACAAAAGATGTTGAAGAAGATTTTGTTCACGGTTCCATTATTATCTGCTGGTTGTTCTCTAAATCGGCTCAATGACGGGGAACGGTCGATCTTGAAACAGGGTAACGACGATTTCTGCACGAAAGATGTCGTGCACGACAAATTAAGAGAAAATGAAAAGTTTATGGCTCTTAAGTCCATGCTACCTTCACTTAATGAGATCAACAAAGTATCGATACTCAACGATACAATCAAATATTTGAAGATGCTCGAAGCGAGAGTACAAGAGTTGGAAACATGCATGGACTCATTATATTATGAAGAAAGATTCAGAAGGAAATATCTTGACATGGTGGAACAGACATCAGATAACTATGACTATGATAAGATTGAAGGCACCTTAAAACCTTCAACGAACAAGAGAAAAGCCTGTGAAATGGATGAAACTGACCTTAAGCTGAAGAACAACATTCCCAAAGATGGCCTTAAACTAGATGTGAAAGTCACCATGAACGAGCAAGAGGTTCTCGTCGACATGCACTGTCCTTATCGAGAATATATATTGGTCGACGTCATGGATACTTTAAACGACTTGCAACTCGACGCCCACTCGGTTCAATCTTCTGATCGTAATGGCGTTTTCTCCTTGACCCTCAAGTCTAAGTTTCGAGGGATGGTGGCTGCATCTGTTGGGATGGTCAAACTAGCACTTTTGAAAGTTGCCAACAAGAGTTGA

Protein sequence

MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVHGLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLSLKELQNSNRTKSVSLDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKMLKKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANKS
BLAST of Cp4.1LG12g03680 vs. Swiss-Prot
Match: GL3_ARATH (Transcription factor GLABRA 3 OS=Arabidopsis thaliana GN=GL3 PE=1 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 1.8e-116
Identity = 267/666 (40.09%), Postives = 372/666 (55.86%), Query Frame = 1

Query: 12  PGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDIM 71
           P  L+K LAV+V++IQWSY IFWS S  Q GVLEW DGYYNGDIKTRKT+QA ++  D +
Sbjct: 11  PENLKKHLAVSVRNIQWSYGIFWSVSASQSGVLEWGDGYYNGDIKTRKTIQASEIKADQL 70

Query: 72  GLHRSEQLRELYKSLLDGENEQRS---------KKPPASLSPEDLSDAEWYYLVCMSFFF 131
           GL RSEQL ELY+SL   E+             +   A+LSPEDL+D EWYYLVCMSF F
Sbjct: 71  GLRRSEQLSELYESLSVAESSSSGVAAGSQVTRRASAAALSPEDLADTEWYYLVCMSFVF 130

Query: 132 NQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTE 191
           N G+G+PGR  A+   IWLCNA  A+S VFSRSLLAK   + TVVCFP+LGGV+E+G TE
Sbjct: 131 NIGEGMPGRTFANGEPIWLCNAHTADSKVFSRSLLAKSAAVKTVVCFPFLGGVVEIGTTE 190

Query: 192 QVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVH 251
            ++ED +++Q VK   L+   P  +   + S Y  DN  +P     D     + + E   
Sbjct: 191 HITEDMNVIQCVKTSFLEAPDPYATILPARSDYHIDNVLDPQQILGDEIYAPMFSTE--- 250

Query: 252 GLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMV----DPLRLEGVEGGASCF 311
                                F   S  R +NG ++ H+ V    D    E + GGAS  
Sbjct: 251 --------------------PFPTASPSRTTNGFDQEHEQVADDHDSFMTERITGGASQV 310

Query: 312 RSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLSLKEL----QNSNRTKS 371
           +S Q +DD+ S     S+N SDC+S+  V       + G     ++ L    +     K+
Sbjct: 311 QSWQLMDDELSNCVHQSLNSSDCVSQTFVEGAAGRVAYGARKSRVQRLGQIQEQQRNVKT 370

Query: 372 VSLDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEIN-----TA 431
           +S DPR D D+HY+  I TI  ++ QL   P   +   +SSF  WKK  +  +     TA
Sbjct: 371 LSFDPRND-DVHYQSVISTIFKTNHQLILGPQFRNCDKQSSFTRWKKSSSSSSGTATVTA 430

Query: 432 PVQQKMLKKILFTVPLLSAGCSLNRLNDGERSIL--KQGNDDFCTKDVVHDKLRE--NEK 491
           P  Q MLKKI+F VP         R++  E+ +L   +  D+     V+  K RE  NE+
Sbjct: 431 P-SQGMLKKIIFDVP---------RVHQKEKLMLDSPEARDETGNHAVLEKKRREKLNER 490

Query: 492 FMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEER-----FRRKYL 551
           FM L+ ++PS+N+I+KVSIL+DTI+YL+ LE RVQELE+C +S   E R      R+K  
Sbjct: 491 FMTLRKIIPSINKIDKVSILDDTIEYLQELERRVQELESCRESTDTETRGTMTMKRKKPC 550

Query: 552 DMVEQTSDNYDYDKIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEV 611
           D  E+TS N   ++     K S N     E  +T           GL  ++++     EV
Sbjct: 551 DAGERTSANCANNETGNGKKVSVNNVGEAEPADTGF--------TGLTDNLRIGSFGNEV 610

Query: 612 LVDMHCPYREYILVDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKL 647
           ++++ C +RE +L+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GM+K 
Sbjct: 611 VIELRCAWREGVLLEIMDVISDLHLDSHSVQSSTGDGLLCLTVNCKHKGSKIATPGMIKE 634

BLAST of Cp4.1LG12g03680 vs. Swiss-Prot
Match: EGL1_ARATH (Transcription factor EGL1 OS=Arabidopsis thaliana GN=BHLH2 PE=1 SV=1)

HSP 1 Score: 404.8 bits (1039), Expect = 1.7e-111
Identity = 257/651 (39.48%), Postives = 373/651 (57.30%), Query Frame = 1

Query: 12  PGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDIM 71
           P  L+KQLAV+V++IQWSY IFWS S  Q GVLEW DGYYNGDIKTRKT+QA +V +D +
Sbjct: 10  PDNLKKQLAVSVRNIQWSYGIFWSVSASQPGVLEWGDGYYNGDIKTRKTIQAAEVKIDQL 69

Query: 72  GLHRSEQLRELYKSL------LDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQG 131
           GL RSEQLRELY+SL        G ++   +   A+LSPEDL+D EWYYLVCMSF FN G
Sbjct: 70  GLERSEQLRELYESLSLAESSASGSSQVTRRASAAALSPEDLTDTEWYYLVCMSFVFNIG 129

Query: 132 QGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVS 191
           +G+PG AL++   IWLCNA+ A+S VF+RSLLAK   L TVVCFP+LGGV+E+G TE + 
Sbjct: 130 EGIPGGALSNGEPIWLCNAETADSKVFTRSLLAKSASLQTVVCFPFLGGVLEIGTTEHIK 189

Query: 192 EDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVHGLT 251
           ED +++Q VK   L+   P  +  S+ S Y     +E     SD++   V   E      
Sbjct: 190 EDMNVIQSVKTLFLE--APPYTTISTRSDY-----QEIFDPLSDDKYTPVFITE------ 249

Query: 252 AAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRSLQFLD 311
              F   + +G +++ ++      D F N                 +GGAS  +S QF+ 
Sbjct: 250 --AFPTTSTSGFEQEPEDH-----DSFIN-----------------DGGASQVQSWQFVG 309

Query: 312 DDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLSLKELQNSNRTKSVSLDPRTDEDL 371
           ++ S     S+N SDC+S+  V    +++     D     +Q   + +  S     D+D+
Sbjct: 310 EEISNCIHQSLNSSDCVSQTFVGTTGRLAC----DPRKSRIQRLGQIQEQSNHVNMDDDV 369

Query: 372 HYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKMLKKILFTVPL 431
           HY+  I TI  ++ QL   P   +F  RSSF  WK+  +        QKM+KKILF VPL
Sbjct: 370 HYQGVISTIFKTTHQLILGPQFQNFDKRSSFTRWKRSSSVKTLGEKSQKMIKKILFEVPL 429

Query: 432 LSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRE--NEKFMALKSMLPSLNEINKVS 491
           ++           +  +L    ++     +   K RE  NE+FM L+S++PS+++I+KVS
Sbjct: 430 MNK----------KEELLPDTPEETGNHALSEKKRREKLNERFMTLRSIIPSISKIDKVS 489

Query: 492 ILNDTIKYLKMLEARVQELETCMDSLYYEERF----RRKYLDMVEQTSDNYDYDKIEGTL 551
           IL+DTI+YL+ L+ RVQELE+C +S   E R     R+K  D  E+ S N          
Sbjct: 490 ILDDTIEYLQDLQKRVQELESCRESADTETRITMMKRKKPDDEEERASANC--------- 549

Query: 552 KPSTNKRKACEMDETDLKLKNNIPKD----GLKLDVKVTMNEQEVLVDMHCPYREYILVD 611
               +KRK      +D+ +  + P D    GL  +++++    EV++++ C +RE IL++
Sbjct: 550 --MNSKRKG-----SDVNVGEDEPADIGYAGLTDNLRISSLGNEVVIELRCAWREGILLE 593

Query: 612 VMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVA 647
           +MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GM++ AL +VA
Sbjct: 610 IMDVISDLNLDSHSVQSSTGDGLLCLTVNCKHKGTKIATTGMIQEALQRVA 593

BLAST of Cp4.1LG12g03680 vs. Swiss-Prot
Match: BHLHW_PEA (Basic helix-loop-helix protein A OS=Pisum sativum GN=BHLH PE=3 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.1e-70
Identity = 204/649 (31.43%), Postives = 326/649 (50.23%), Query Frame = 1

Query: 15  LRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDIMGLH 74
           L+  L  AV+S+QW+Y++FW    +Q  +L W DGYYNG IKTRKTVQ  +V  +   L 
Sbjct: 13  LQNMLQAAVQSVQWTYSLFWQICPQQL-ILVWGDGYYNGAIKTRKTVQPMEVSAEEASLQ 72

Query: 75  RSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALA 134
           RS+QLRELY+SL  GE    +++P ASLSPEDL+++EW+YL+C+SF F  G GLPG+A A
Sbjct: 73  RSQQLRELYESLSAGETNPPTRRPCASLSPEDLTESEWFYLMCVSFSFPPGVGLPGKAYA 132

Query: 135 DDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVSEDPSLLQHV 194
             + +WL  A   +S  FSR++LAK   + TVVC P L GV+E+G T+++ ED + ++HV
Sbjct: 133 RRQHVWLTGANEVDSKTFSRAILAKSANIQTVVCIPVLDGVVEIGTTDKIQEDLNFIKHV 192

Query: 195 KDFLLKFS----KPICSKKSSSS-AYKDDNGKEPMVAKSDNEIVEVLAMENV-HGLTAAK 254
           + F +       KP  S+ S+S+  Y  D+    M   +D     +   +++        
Sbjct: 193 RSFFIDHHSLPPKPALSEHSTSNPTYSTDHIPAIMYTVADPASTAIPNQDDMDEDEEEDD 252

Query: 255 FDRKAVNGIQRKNDEFGIDSLDRFSNGCERFH----QMVDPLRLEGVEGGASCFRSLQFL 314
            D +  +G + + ++             E       +M D +R+     G++       L
Sbjct: 253 EDDEVESGSEDETNQGHNQHATSIIEAAEPSELMQIEMPDDIRIGSPNDGSNN------L 312

Query: 315 DDDFSY-GFQDSMNPSDCISEALVNNPEKVSSKGTNDLSLKELQNSNRTKSVSLDPRTDE 374
           D DF      +  NPS  I          +     + L   ++Q S+      L+  T E
Sbjct: 313 DSDFHLLAVSNQGNPSRQIDSYTTERWGPIEEPLDDSL---QIQLSSSVLHHPLEDLTQE 372

Query: 375 DLHYKRTIFTILGSSTQLAGSPLLH--SFSSRSSFMPWKKGMAEINTAP---VQQKMLKK 434
           D HY +T+ TIL    Q   SP ++  ++S++SSF  W          P     Q ++K 
Sbjct: 373 DTHYSQTVTTIL--QNQWIDSPSINYINYSTQSSFTTWTNHHFHPPPPPDPATSQWLVKY 432

Query: 435 ILFTVPLL--------------SAGCSLNRLNDGERSILKQG--NDDFCTKDVVHDKLRE 494
           ILFTVP L              +AG +    ND    +  +G   D+     V+ ++ R 
Sbjct: 433 ILFTVPYLHTKNHDETSPQTRDTAGVN---SNDPSARLRGKGTPQDELSANHVLAERRRR 492

Query: 495 ---NEKFMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKY 554
              NE+F+ L+S++P + +++K SIL DTI+YLK L  ++Q+LET    +  E    +  
Sbjct: 493 EKLNERFIILRSLVPFVTKMDKASILGDTIEYLKQLRRKIQDLETRNRQMESE----KSG 552

Query: 555 LDMVEQTSDNYDYDKIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQE 614
           + ++   ++      +EG       + KA E                +   V+V++ E +
Sbjct: 553 VTVLVGPTEKKKVRIVEGNGTGGGVRAKAVE----------------VVASVQVSIIESD 612

Query: 615 VLVDMHCPYREYILVDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFR 629
            L+++ C  RE +L+DVM  L +L+++   VQSS  NGVF   L++K +
Sbjct: 613 ALLEIECLQREGLLLDVMMMLRELRIEVIGVQSSLNNGVFVAELRAKVK 626

BLAST of Cp4.1LG12g03680 vs. Swiss-Prot
Match: BH012_ARATH (Transcription factor MYC1 OS=Arabidopsis thaliana GN=BHLH12 PE=1 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 3.0e-55
Identity = 120/249 (48.19%), Postives = 160/249 (64.26%), Query Frame = 1

Query: 1   MANGTEICDS----EPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIK 60
           MA+G E        +   LRKQLA+AV+S+QWSYAIFWS S+ Q GVLEW +G YNGD+K
Sbjct: 5   MADGVEAAAGRSKRQNSLLRKQLALAVRSVQWSYAIFWSSSLTQPGVLEWGEGCYNGDMK 64

Query: 61  TRKTVQAEDVHVDIMGLHRSEQLRELYKSLLDG--------------ENEQRSKKPPASL 120
            RK  ++ + H    GL +S++LR+LY S+L+G              +++         L
Sbjct: 65  KRK--KSYESHYKY-GLQKSKELRKLYLSMLEGDSGTTVSTTHDNLNDDDDNCHSTSMML 124

Query: 121 SPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRF 180
           SP+DLSD EWYYLV MS+ F+  Q LPGRA A   TIWLCNAQYAE+ +FSRSLLA+   
Sbjct: 125 SPDDLSDEEWYYLVSMSYVFSPSQCLPGRASATGETIWLCNAQYAENKLFSRSLLARSAS 184

Query: 181 LLTVVCFPYLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKE 232
           + TVVCFPYLGGVIELGVTE +SED +LL+++K  L++            SA++D++ ++
Sbjct: 185 IQTVVCFPYLGGVIELGVTELISEDHNLLRNIKSCLMEI-----------SAHQDNDDEK 239

BLAST of Cp4.1LG12g03680 vs. Swiss-Prot
Match: ARLC_MAIZE (Anthocyanin regulatory Lc protein OS=Zea mays GN=LC PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.2e-53
Identity = 117/290 (40.34%), Postives = 169/290 (58.28%), Query Frame = 1

Query: 10  SEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVD 69
           +E   +R QLA A +SI WSYA+FWS S  Q GVL W DG+YNG++KTRK   + ++  D
Sbjct: 19  AERQLMRSQLAAAARSINWSYALFWSISDTQPGVLTWTDGFYNGEVKTRKISNSVELTSD 78

Query: 70  IMGLHRSEQLRELYKSLLDGENEQRSK--KPPASLSPEDLSDAEWYYLVCMSFFFNQGQG 129
            + + RS+QLRELY++LL GE ++R+   +P  SLSPEDL D EWYY+V M++ F  GQG
Sbjct: 79  QLVMQRSDQLRELYEALLSGEGDRRAAPARPAGSLSPEDLGDTEWYYVVSMTYAFRPGQG 138

Query: 130 LPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVSED 189
           LPGR+ A D  +WLCNA  A S  F R+LLAK   + +++C P +GGV+ELG T+ V E 
Sbjct: 139 LPGRSFASDEHVWLCNAHLAGSKAFPRALLAKSASIQSILCIPVMGGVLELGTTDTVPEA 198

Query: 190 PSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSD-----NEIVEVLAMENVH 249
           P L+         F +P C   SS S   ++ G+    A  D      E+     M+++ 
Sbjct: 199 PDLVSRA---TAAFWEPQC-PSSSPSGRANETGE---AAADDGTFAFEELDHNNGMDDIE 258

Query: 250 GLTAAKFDRKAVNGIQRKNDEFGID-SLDRFSNGCERFHQMVDPLRLEGV 292
            +TAA    +      R+ +    D SL+  +   E F+ + D + L+ +
Sbjct: 259 AMTAAGGHGQEEELRLREAEALSDDASLEHITKEIEEFYSLCDEMDLQAL 301

BLAST of Cp4.1LG12g03680 vs. TrEMBL
Match: I6N8K6_CUCSA (GL3 OS=Cucumis sativus PE=2 SV=1)

HSP 1 Score: 1090.1 bits (2818), Expect = 0.0e+00
Identity = 555/653 (84.99%), Postives = 595/653 (91.12%), Query Frame = 1

Query: 1   MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKT 60
           MANG E CDSEPGFLRKQLAVAVKSIQWSYA+FWSPS RQHGVLEWCDGYYNGDIKTRKT
Sbjct: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYALFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQAEDVHVD MGLHRSEQLRELY+SLL+GE+EQR+KKPPASLSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGV 180
           FFNQGQGLPGRALADDRTIWLCNAQYAES+VFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMEN 240
           TEQVSEDPSLLQHVKDFLLKFS+PICSKK SS+AYKDDNGKEPM AKSDNEIVEVLAMEN
Sbjct: 181 TEQVSEDPSLLQHVKDFLLKFSRPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240

Query: 241 VHGLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRS 300
           ++  TA KFD K+VNGIQRKN+EFGIDSLD FSNGCE++H M D LRLEG EGGAS F+S
Sbjct: 241 LYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS 300

Query: 301 LQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSS----KGTNDLSLKELQNSNRTKSVS 360
           LQFLDDDFSYGFQDSMNPSDCISEAL +  EKVSS    K  N+L LKE QN N T+S S
Sbjct: 301 LQFLDDDFSYGFQDSMNPSDCISEALADQ-EKVSSSPRLKDANNLPLKEHQNPNHTQSGS 360

Query: 361 LDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKML 420
           LDP +DED+HYKRTIFTILGSSTQL GSPLLH+FS+RS+F+PWKK +AE +T P+QQ+ML
Sbjct: 361 LDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRML 420

Query: 421 KKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPSL 480
           KKILF VPLLSAG SL  L D E+SILKQGN+D CTK+   DKL+ENEKFMALKSMLPSL
Sbjct: 421 KKILFAVPLLSAG-SLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSL 480

Query: 481 NEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKIE 540
           NEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDY+KIE
Sbjct: 481 NEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIE 540

Query: 541 GTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVDV 600
           G+LKPSTNKRKACEMDETDLKLKN+ PK G KLDVKV+M E EVLVDMHCPYREYILVDV
Sbjct: 541 GSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDV 600

Query: 601 MDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANKS 650
           MD LNDLQLDA+SVQSSD NG+FSLTLKSKFRGM AASVGM+KLALLKV NKS
Sbjct: 601 MDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 651

BLAST of Cp4.1LG12g03680 vs. TrEMBL
Match: A0A0A0KCZ7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G003480 PE=4 SV=1)

HSP 1 Score: 848.6 bits (2191), Expect = 5.1e-243
Identity = 444/543 (81.77%), Postives = 479/543 (88.21%), Query Frame = 1

Query: 111 EWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFP 170
           +W Y +  S    Q  GLPGRALADDRTIWLCNAQYAES+VFSRSLLAK   + TVVCFP
Sbjct: 20  QWSYAIFWSPSSRQ-HGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFP 79

Query: 171 YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDN 230
           YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKK SS+AYKDDNGKEPM AKSDN
Sbjct: 80  YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN 139

Query: 231 EIVEVLAMENVHGLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEG 290
           EIVEVLAMEN++  TA KFD K+VNGIQRKN+EFGIDSLD FSNGCE++H M D LRLEG
Sbjct: 140 EIVEVLAMENLYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG 199

Query: 291 VEGGASCFRSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSS----KGTNDLSLKEL 350
            EGGAS F+SLQFLDDDFSYGFQDSMNPSDCISEAL N  EKVSS    K  N+L LKE 
Sbjct: 200 AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRLKDANNLPLKEH 259

Query: 351 QNSNRTKSVSLDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEI 410
           QN N T+S SLDP +DED+HYKRTIFTILGSSTQL GSPLLH+FS+RS+F+PWKK +AE 
Sbjct: 260 QNPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAET 319

Query: 411 NTAPVQQKMLKKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKF 470
           +T P+QQ+MLKKILF VPLLSAG SL  L D E+SILKQGN+D CTK+   DKL+ENEKF
Sbjct: 320 HTPPMQQRMLKKILFAVPLLSAG-SLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKF 379

Query: 471 MALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQT 530
           MALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQT
Sbjct: 380 MALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQT 439

Query: 531 SDNYDYDKIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHC 590
           SDNYDY+KIEG+LKPSTNKRKACEMDETDLKLKN+ PK G KLDVKV+M E EVLVDMHC
Sbjct: 440 SDNYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHC 499

Query: 591 PYREYILVDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVA 650
           PYREYILVDVMD LNDLQLDA+SVQSSD NG+FSLTLKSKFRGM AASVGM+KLALLKV 
Sbjct: 500 PYREYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVV 559

BLAST of Cp4.1LG12g03680 vs. TrEMBL
Match: A0A075BRK3_9ROSI (Basic helix-loop-helix protein OS=Morella rubra GN=bHLH2 PE=2 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 1.7e-206
Identity = 390/653 (59.72%), Postives = 478/653 (73.20%), Query Frame = 1

Query: 1   MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKT 60
           MANGT+  D  P  LRK+LAVAV+SIQWSYAIFWS S  Q GVLEW DGYYNGDIKTRKT
Sbjct: 1   MANGTQTHDGLPENLRKRLAVAVRSIQWSYAIFWSLSTTQQGVLEWGDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQA ++  D +GL RSEQLRELY+SLL+GE +Q++K+P A+LSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAVELKADKIGLQRSEQLRELYQSLLEGEADQQAKRPSAALSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGV 180
            F+ G+GLPGRALA+ + IWLCNAQYA+S VFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 VFSPGEGLPGRALANGQAIWLCNAQYADSKVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMEN 240
           TE VSEDPSLLQH+K  LL+ SKP+CS KSS +  K D+  +P+ A  + EI++ L +EN
Sbjct: 181 TELVSEDPSLLQHIKASLLELSKPVCSDKSSPTPPKADDDGDPICANVNLEIMDTLPLEN 240

Query: 241 VHGLT-AAKFDRKAVNGIQRK-NDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCF 300
           ++  T   +FDR+ +  +    ++E  +DS D  SNG E  HQ  D   L+G+ GGAS  
Sbjct: 241 LYSPTEGIEFDREGIVELGGNIHEEINMDSPDECSNGXEHNHQTEDSFMLDGINGGASQV 300

Query: 301 RSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLS--LKELQNSNRTKSVS 360
           +S   LDDDFS G  DSMN SDCISEA VN  + +S+    D++  LKELQNSN TK  S
Sbjct: 301 QSWHVLDDDFSNGVPDSMNSSDCISEAFVNQEKAISTLKREDVNQHLKELQNSNHTKLGS 360

Query: 361 LDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPW-KKGMAEINTAPVQQKM 420
           LD   D+DLHY+R +  I+GSS +L  +   H    RS+F+ W K+ + +      QQ M
Sbjct: 361 LDLGADDDLHYRRILSAIVGSSPRLIENLRFHYTDHRSNFLCWTKEALGDAYRPQAQQTM 420

Query: 421 LKKILFTVPLLSAGCS--LNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSML 480
           LKKILFTVPL+  GCS  L R N G+  + K  + D C   V+ D  RENE F+ALKSM+
Sbjct: 421 LKKILFTVPLMYGGCSFRLQRENCGKEWLRKSESGDICLGHVLSDNRRENENFLALKSMV 480

Query: 481 PSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYD 540
           PS++EI+K SIL DTIKYLK LEARV+ELE+CMDS+ YEER RRKYLDMVEQ SDN D  
Sbjct: 481 PSISEIDKASILRDTIKYLKELEARVEELESCMDSVDYEERARRKYLDMVEQISDNCDKK 540

Query: 541 KIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYIL 600
           KI+   K   NKRKACE DETD +L   +P+D L LDVKV++ EQEVL++M CPYREY+L
Sbjct: 541 KIDNGKKSWINKRKACEFDETDPELNRVVPEDSLPLDVKVSIKEQEVLIEMRCPYREYVL 600

Query: 601 VDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVA 647
           +DVMD +N+L L+AHSVQSS  NG+ +LTLKSKFRG   A VGM+K AL K+A
Sbjct: 601 LDVMDAINNLHLEAHSVQSSAPNGILTLTLKSKFRGAATAPVGMIKQALWKIA 653

BLAST of Cp4.1LG12g03680 vs. TrEMBL
Match: M5XIR9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002645mg PE=4 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 5.3e-200
Identity = 373/653 (57.12%), Postives = 471/653 (72.13%), Query Frame = 1

Query: 1   MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKT 60
           MANGT+  +  P  LRKQ AVAV+SI+WSYAIFWS S  Q GVLEWC+GYYNGDIKTRKT
Sbjct: 1   MANGTQNHERVPENLRKQFAVAVRSIKWSYAIFWSLSTSQQGVLEWCEGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSF 120
           V+  ++  D MGL R+ QLRELYKSLL+GE E ++K P A+L+PEDLSDAEWYYL+CMSF
Sbjct: 61  VEGVELKTDKMGLERNAQLRELYKSLLEGETEPQAKAPSAALNPEDLSDAEWYYLLCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGV 180
            FN G+GLPGRALA+ +TIWLC+AQYA+S VFSRSLLAK   + TVVCFPYLGGV+ELGV
Sbjct: 121 VFNPGEGLPGRALANGQTIWLCDAQYADSKVFSRSLLAKSASIQTVVCFPYLGGVVELGV 180

Query: 181 TEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMEN 240
           TE V ED SL+QH+K  LL FSKP CS+KSSS+ +K D+  + ++AK D+EIV+ LA+EN
Sbjct: 181 TELVPEDLSLIQHIKASLLDFSKPDCSEKSSSAPHKADDDSDQVLAKVDHEIVDTLALEN 240

Query: 241 VHGLT-AAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFR 300
           ++  +   KFD   +N +    +EF +DS +  SNGCE  HQ  D    EG+  GAS  +
Sbjct: 241 LYSPSEEIKFDPMGINDLHGNYEEFNMDSPEECSNGCEHNHQTEDSFMPEGINDGASQVQ 300

Query: 301 SLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSS---KGTNDLSLKELQNSNRTKSVS 360
           S  F+D+DFS G QDSMN SDCISEA VN     SS   +  N   LKEL+N N TK  S
Sbjct: 301 SWHFMDEDFSIGVQDSMNSSDCISEAFVNKKRAQSSPRHESVNRNHLKELENLNDTKFSS 360

Query: 361 LD-PRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKM 420
           LD    D+ +HY RT+  ILGSST+L  +P       +SSF+ WKKG+ +     V QK+
Sbjct: 361 LDLGPADDHIHYTRTLSNILGSSTRLTENPCSCDGDCKSSFVTWKKGVVDNCRPTVHQKI 420

Query: 421 LKKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPS 480
           LKKILFTVPL+    S N + DG   + K  +DD     V+ DKL+ENEK + L+SM+PS
Sbjct: 421 LKKILFTVPLMCGASSQNTIQDG---LSKLQSDDIHKGHVMPDKLKENEKLLVLRSMVPS 480

Query: 481 LNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKI 540
           ++E++K S+L+DTIKYLK LEAR +E+E+CMD++  E   RRKYLD  E+TSDNYD  K+
Sbjct: 481 ISEVDKASVLDDTIKYLKELEARAEEMESCMDTV--EAIARRKYLDRAEKTSDNYDKIKM 540

Query: 541 EGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVD 600
           +   KP  NKRKAC++DETD  L   +P++ L LDVKV + EQEVL++M CPYREYIL+D
Sbjct: 541 DNVKKPWLNKRKACDIDETDPDLNRLVPRESLPLDVKVILKEQEVLIEMRCPYREYILLD 600

Query: 601 VMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANK 649
           +MD +N+L LDAHSVQSS  +GV +L+L SKFRG   A VGM+K AL K+A K
Sbjct: 601 IMDAINNLYLDAHSVQSSTLDGVLTLSLTSKFRGAAVAPVGMIKQALWKIAGK 648

BLAST of Cp4.1LG12g03680 vs. TrEMBL
Match: A0A0A7W5E0_PRUAV (BHLH33 OS=Prunus avium GN=bHLH33 PE=2 SV=1)

HSP 1 Score: 703.0 bits (1813), Expect = 3.4e-199
Identity = 373/653 (57.12%), Postives = 470/653 (71.98%), Query Frame = 1

Query: 1   MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKT 60
           MANGT+  +  P  LRKQ AVAV+SI+WSYAIFWS S  Q GVLEWC+GYYNGDIKTRKT
Sbjct: 1   MANGTQNHERVPENLRKQFAVAVRSIKWSYAIFWSLSTSQQGVLEWCEGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSF 120
           V+  ++  D MGL R+ QLRELYKSLL+GE E ++K P A+L+PEDLSDAEWYYL+CMSF
Sbjct: 61  VEGVELKTDKMGLERNAQLRELYKSLLEGETEPQAKAPSAALNPEDLSDAEWYYLLCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGV 180
            FN G+GLPGRALA+ +TIWLC+AQYA+S VFSRSLLAK   + TVVCFPYLGGV+ELGV
Sbjct: 121 VFNPGEGLPGRALANGQTIWLCDAQYADSKVFSRSLLAKSASIQTVVCFPYLGGVVELGV 180

Query: 181 TEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMEN 240
           TE V ED SL+QH+K  LL FSKP CS+KSSS+ +K D+  + ++AK D+EIV  LA+EN
Sbjct: 181 TELVPEDLSLIQHIKASLLDFSKPDCSEKSSSAPHKADDDSDQVLAKVDHEIVGTLALEN 240

Query: 241 VHGLT-AAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFR 300
           ++  +   KFD   +N +   ++EF +DS +  SNGCE  HQ  D    EG+  GAS  +
Sbjct: 241 LYSPSEEIKFDPMGINDLHGNHEEFNMDSPEECSNGCEHNHQTEDSFMPEGINDGASQVQ 300

Query: 301 SLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSS---KGTNDLSLKELQNSNRTKSVS 360
           S  F+D+DFS G QDSMN SDCISEA VN     SS   +  N   LKELQN N TK  S
Sbjct: 301 SWHFMDEDFSIGVQDSMNSSDCISEAFVNKKGAHSSPRHESVNRNHLKELQNFNDTKFSS 360

Query: 361 LD-PRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKM 420
           LD    D+ +HY RT+  ILGSS +L  +P       +SSF+ WKKG+ +     V QK+
Sbjct: 361 LDLGPADDHIHYTRTLSNILGSSIRLTKNPCSCDGGCKSSFVTWKKGVVDNCRPTVHQKI 420

Query: 421 LKKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPS 480
           LKKILFTVPL+    S N + DG   + K  +DD     V+ DKL+ENEK + L+SM+PS
Sbjct: 421 LKKILFTVPLMCGASSQNTIQDG---LSKLRSDDIHKGHVMPDKLKENEKLLVLRSMVPS 480

Query: 481 LNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKI 540
           ++E++K S+L+DTIKYLK LEAR +E+E+CMD++  E   RRKYLD  E+TSDNYD  K+
Sbjct: 481 ISEVDKASVLDDTIKYLKELEARAEEMESCMDTV--EAIARRKYLDRAEKTSDNYDKIKM 540

Query: 541 EGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVD 600
           +   KP  NKRKAC++DETD  L   +P++ L LDVKV + EQEVL++M CPYREYIL+D
Sbjct: 541 DNVKKPWLNKRKACDIDETDPDLNRLVPRESLPLDVKVILKEQEVLIEMRCPYREYILLD 600

Query: 601 VMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANK 649
           +MD +N+L LDAHSVQSS  +GV +L+L SKFRG   A VGM+K AL K+A K
Sbjct: 601 IMDAINNLYLDAHSVQSSTLDGVLTLSLTSKFRGAAVAPVGMIKQALWKIAGK 648

BLAST of Cp4.1LG12g03680 vs. TAIR10
Match: AT5G41315.1 (AT5G41315.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 421.4 bits (1082), Expect = 1.0e-117
Identity = 267/666 (40.09%), Postives = 372/666 (55.86%), Query Frame = 1

Query: 12  PGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDIM 71
           P  L+K LAV+V++IQWSY IFWS S  Q GVLEW DGYYNGDIKTRKT+QA ++  D +
Sbjct: 11  PENLKKHLAVSVRNIQWSYGIFWSVSASQSGVLEWGDGYYNGDIKTRKTIQASEIKADQL 70

Query: 72  GLHRSEQLRELYKSLLDGENEQRS---------KKPPASLSPEDLSDAEWYYLVCMSFFF 131
           GL RSEQL ELY+SL   E+             +   A+LSPEDL+D EWYYLVCMSF F
Sbjct: 71  GLRRSEQLSELYESLSVAESSSSGVAAGSQVTRRASAAALSPEDLADTEWYYLVCMSFVF 130

Query: 132 NQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTE 191
           N G+G+PGR  A+   IWLCNA  A+S VFSRSLLAK   + TVVCFP+LGGV+E+G TE
Sbjct: 131 NIGEGMPGRTFANGEPIWLCNAHTADSKVFSRSLLAKSAAVKTVVCFPFLGGVVEIGTTE 190

Query: 192 QVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVH 251
            ++ED +++Q VK   L+   P  +   + S Y  DN  +P     D     + + E   
Sbjct: 191 HITEDMNVIQCVKTSFLEAPDPYATILPARSDYHIDNVLDPQQILGDEIYAPMFSTE--- 250

Query: 252 GLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMV----DPLRLEGVEGGASCF 311
                                F   S  R +NG ++ H+ V    D    E + GGAS  
Sbjct: 251 --------------------PFPTASPSRTTNGFDQEHEQVADDHDSFMTERITGGASQV 310

Query: 312 RSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLSLKEL----QNSNRTKS 371
           +S Q +DD+ S     S+N SDC+S+  V       + G     ++ L    +     K+
Sbjct: 311 QSWQLMDDELSNCVHQSLNSSDCVSQTFVEGAAGRVAYGARKSRVQRLGQIQEQQRNVKT 370

Query: 372 VSLDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEIN-----TA 431
           +S DPR D D+HY+  I TI  ++ QL   P   +   +SSF  WKK  +  +     TA
Sbjct: 371 LSFDPRND-DVHYQSVISTIFKTNHQLILGPQFRNCDKQSSFTRWKKSSSSSSGTATVTA 430

Query: 432 PVQQKMLKKILFTVPLLSAGCSLNRLNDGERSIL--KQGNDDFCTKDVVHDKLRE--NEK 491
           P  Q MLKKI+F VP         R++  E+ +L   +  D+     V+  K RE  NE+
Sbjct: 431 P-SQGMLKKIIFDVP---------RVHQKEKLMLDSPEARDETGNHAVLEKKRREKLNER 490

Query: 492 FMALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEER-----FRRKYL 551
           FM L+ ++PS+N+I+KVSIL+DTI+YL+ LE RVQELE+C +S   E R      R+K  
Sbjct: 491 FMTLRKIIPSINKIDKVSILDDTIEYLQELERRVQELESCRESTDTETRGTMTMKRKKPC 550

Query: 552 DMVEQTSDNYDYDKIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEV 611
           D  E+TS N   ++     K S N     E  +T           GL  ++++     EV
Sbjct: 551 DAGERTSANCANNETGNGKKVSVNNVGEAEPADTGF--------TGLTDNLRIGSFGNEV 610

Query: 612 LVDMHCPYREYILVDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKL 647
           ++++ C +RE +L+++MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GM+K 
Sbjct: 611 VIELRCAWREGVLLEIMDVISDLHLDSHSVQSSTGDGLLCLTVNCKHKGSKIATPGMIKE 634

BLAST of Cp4.1LG12g03680 vs. TAIR10
Match: AT1G63650.1 (AT1G63650.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 404.8 bits (1039), Expect = 9.8e-113
Identity = 257/651 (39.48%), Postives = 373/651 (57.30%), Query Frame = 1

Query: 12  PGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDIM 71
           P  L+KQLAV+V++IQWSY IFWS S  Q GVLEW DGYYNGDIKTRKT+QA +V +D +
Sbjct: 10  PDNLKKQLAVSVRNIQWSYGIFWSVSASQPGVLEWGDGYYNGDIKTRKTIQAAEVKIDQL 69

Query: 72  GLHRSEQLRELYKSL------LDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQG 131
           GL RSEQLRELY+SL        G ++   +   A+LSPEDL+D EWYYLVCMSF FN G
Sbjct: 70  GLERSEQLRELYESLSLAESSASGSSQVTRRASAAALSPEDLTDTEWYYLVCMSFVFNIG 129

Query: 132 QGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVS 191
           +G+PG AL++   IWLCNA+ A+S VF+RSLLAK   L TVVCFP+LGGV+E+G TE + 
Sbjct: 130 EGIPGGALSNGEPIWLCNAETADSKVFTRSLLAKSASLQTVVCFPFLGGVLEIGTTEHIK 189

Query: 192 EDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVHGLT 251
           ED +++Q VK   L+   P  +  S+ S Y     +E     SD++   V   E      
Sbjct: 190 EDMNVIQSVKTLFLE--APPYTTISTRSDY-----QEIFDPLSDDKYTPVFITE------ 249

Query: 252 AAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRSLQFLD 311
              F   + +G +++ ++      D F N                 +GGAS  +S QF+ 
Sbjct: 250 --AFPTTSTSGFEQEPEDH-----DSFIN-----------------DGGASQVQSWQFVG 309

Query: 312 DDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLSLKELQNSNRTKSVSLDPRTDEDL 371
           ++ S     S+N SDC+S+  V    +++     D     +Q   + +  S     D+D+
Sbjct: 310 EEISNCIHQSLNSSDCVSQTFVGTTGRLAC----DPRKSRIQRLGQIQEQSNHVNMDDDV 369

Query: 372 HYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKMLKKILFTVPL 431
           HY+  I TI  ++ QL   P   +F  RSSF  WK+  +        QKM+KKILF VPL
Sbjct: 370 HYQGVISTIFKTTHQLILGPQFQNFDKRSSFTRWKRSSSVKTLGEKSQKMIKKILFEVPL 429

Query: 432 LSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRE--NEKFMALKSMLPSLNEINKVS 491
           ++           +  +L    ++     +   K RE  NE+FM L+S++PS+++I+KVS
Sbjct: 430 MNK----------KEELLPDTPEETGNHALSEKKRREKLNERFMTLRSIIPSISKIDKVS 489

Query: 492 ILNDTIKYLKMLEARVQELETCMDSLYYEERF----RRKYLDMVEQTSDNYDYDKIEGTL 551
           IL+DTI+YL+ L+ RVQELE+C +S   E R     R+K  D  E+ S N          
Sbjct: 490 ILDDTIEYLQDLQKRVQELESCRESADTETRITMMKRKKPDDEEERASANC--------- 549

Query: 552 KPSTNKRKACEMDETDLKLKNNIPKD----GLKLDVKVTMNEQEVLVDMHCPYREYILVD 611
               +KRK      +D+ +  + P D    GL  +++++    EV++++ C +RE IL++
Sbjct: 550 --MNSKRKG-----SDVNVGEDEPADIGYAGLTDNLRISSLGNEVVIELRCAWREGILLE 593

Query: 612 VMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVA 647
           +MD ++DL LD+HSVQSS  +G+  LT+  K +G   A+ GM++ AL +VA
Sbjct: 610 IMDVISDLNLDSHSVQSSTGDGLLCLTVNCKHKGTKIATTGMIQEALQRVA 593

BLAST of Cp4.1LG12g03680 vs. TAIR10
Match: AT4G00480.2 (AT4G00480.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 218.0 bits (554), Expect = 1.7e-56
Identity = 120/249 (48.19%), Postives = 160/249 (64.26%), Query Frame = 1

Query: 1   MANGTEICDS----EPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIK 60
           MA+G E        +   LRKQLA+AV+S+QWSYAIFWS S+ Q GVLEW +G YNGD+K
Sbjct: 5   MADGVEAAAGRSKRQNSLLRKQLALAVRSVQWSYAIFWSSSLTQPGVLEWGEGCYNGDMK 64

Query: 61  TRKTVQAEDVHVDIMGLHRSEQLRELYKSLLDG--------------ENEQRSKKPPASL 120
            RK  ++ + H    GL +S++LR+LY S+L+G              +++         L
Sbjct: 65  KRK--KSYESHYKY-GLQKSKELRKLYLSMLEGDSGTTVSTTHDNLNDDDDNCHSTSMML 124

Query: 121 SPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRF 180
           SP+DLSD EWYYLV MS+ F+  Q LPGRA A   TIWLCNAQYAE+ +FSRSLLA+   
Sbjct: 125 SPDDLSDEEWYYLVSMSYVFSPSQCLPGRASATGETIWLCNAQYAENKLFSRSLLARSAS 184

Query: 181 LLTVVCFPYLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKE 232
           + TVVCFPYLGGVIELGVTE +SED +LL+++K  L++            SA++D++ ++
Sbjct: 185 IQTVVCFPYLGGVIELGVTELISEDHNLLRNIKSCLMEI-----------SAHQDNDDEK 239

BLAST of Cp4.1LG12g03680 vs. TAIR10
Match: AT4G09820.1 (AT4G09820.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 175.3 bits (443), Expect = 1.3e-43
Identity = 95/237 (40.08%), Postives = 141/237 (59.49%), Query Frame = 1

Query: 6   EICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAED 65
           ++  +E   L+  L  AV+S+ W+Y++FW    +Q  VL W +GYYNG IKTRKT Q  +
Sbjct: 11  KVAGAEKKELQGLLKTAVQSVDWTYSVFWQFCPQQR-VLVWGNGYYNGAIKTRKTTQPAE 70

Query: 66  VHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQG 125
           V  +   L RS+QLRELY++LL GE+   ++   A LSPEDL++ EW+YL+C+SF F   
Sbjct: 71  VTAEEAALERSQQLRELYETLLAGESTSEARACTA-LSPEDLTETEWFYLMCVSFSFPPP 130

Query: 126 QGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVS 185
            G+PG+A A  + +WL  A   +S  FSR++LAK   + TVVC P L GV+ELG T++V 
Sbjct: 131 SGMPGKAYARRKHVWLSGANEVDSKTFSRAILAKSAKIQTVVCIPMLDGVVELGTTKKVR 190

Query: 186 EDPSLLQHVKDFLLKF----SKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAM 239
           ED   ++  K F         KP  S+ S+   +++        A+ + E+ E + M
Sbjct: 191 EDVEFVELTKSFFYDHCKTNPKPALSEHSTYEVHEE--------AEDEEEVEEEMTM 237

BLAST of Cp4.1LG12g03680 vs. TAIR10
Match: AT1G32640.1 (AT1G32640.1 Basic helix-loop-helix (bHLH) DNA-binding family protein)

HSP 1 Score: 94.0 bits (232), Expect = 3.7e-19
Identity = 56/173 (32.37%), Postives = 82/173 (47.40%), Query Frame = 1

Query: 28  WSYAIFWSPSIRQHG--VLEWCDGYYNGD---IKTRKTVQAEDVHVDIMGLHRSEQLREL 87
           W+YAIFW PS    G  VL W DGYY G+      R+   +          +R + LREL
Sbjct: 83  WTYAIFWQPSYDFSGASVLGWGDGYYKGEEDKANPRRRSSSPPFSTPADQEYRKKVLREL 142

Query: 88  YKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPGRALADDRTIWLC 147
             SL+ G        P      E+++D EW++LV M+  F  G GL G+A A    +W+ 
Sbjct: 143 -NSLISG-----GVAPSDDAVDEEVTDTEWFFLVSMTQSFACGAGLAGKAFATGNAVWVS 202

Query: 148 NAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVSEDPSLLQHVK 196
            +     S   R+    V  + T+ C P   GV+E+G TE + +   L+  V+
Sbjct: 203 GSDQLSGSGCERAKQGGVFGMHTIACIPSANGVVEVGSTEPIRQSSDLINKVR 249

BLAST of Cp4.1LG12g03680 vs. NCBI nr
Match: gi|793421610|ref|NP_001292635.1| (transcription factor EGL1 [Cucumis sativus])

HSP 1 Score: 1090.1 bits (2818), Expect = 0.0e+00
Identity = 555/653 (84.99%), Postives = 595/653 (91.12%), Query Frame = 1

Query: 1   MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKT 60
           MANG E CDSEPGFLRKQLAVAVKSIQWSYA+FWSPS RQHGVLEWCDGYYNGDIKTRKT
Sbjct: 1   MANGLENCDSEPGFLRKQLAVAVKSIQWSYALFWSPSSRQHGVLEWCDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQAEDVHVD MGLHRSEQLRELY+SLL+GE+EQR+KKPPASLSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAEDVHVDNMGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGV 180
           FFNQGQGLPGRALADDRTIWLCNAQYAES+VFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMEN 240
           TEQVSEDPSLLQHVKDFLLKFS+PICSKK SS+AYKDDNGKEPM AKSDNEIVEVLAMEN
Sbjct: 181 TEQVSEDPSLLQHVKDFLLKFSRPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMEN 240

Query: 241 VHGLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRS 300
           ++  TA KFD K+VNGIQRKN+EFGIDSLD FSNGCE++H M D LRLEG EGGAS F+S
Sbjct: 241 LYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQS 300

Query: 301 LQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSS----KGTNDLSLKELQNSNRTKSVS 360
           LQFLDDDFSYGFQDSMNPSDCISEAL +  EKVSS    K  N+L LKE QN N T+S S
Sbjct: 301 LQFLDDDFSYGFQDSMNPSDCISEALADQ-EKVSSSPRLKDANNLPLKEHQNPNHTQSGS 360

Query: 361 LDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKML 420
           LDP +DED+HYKRTIFTILGSSTQL GSPLLH+FS+RS+F+PWKK +AE +T P+QQ+ML
Sbjct: 361 LDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRML 420

Query: 421 KKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPSL 480
           KKILF VPLLSAG SL  L D E+SILKQGN+D CTK+   DKL+ENEKFMALKSMLPSL
Sbjct: 421 KKILFAVPLLSAG-SLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSL 480

Query: 481 NEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKIE 540
           NEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDY+KIE
Sbjct: 481 NEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIE 540

Query: 541 GTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVDV 600
           G+LKPSTNKRKACEMDETDLKLKN+ PK G KLDVKV+M E EVLVDMHCPYREYILVDV
Sbjct: 541 GSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDV 600

Query: 601 MDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANKS 650
           MD LNDLQLDA+SVQSSD NG+FSLTLKSKFRGM AASVGM+KLALLKV NKS
Sbjct: 601 MDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 651

BLAST of Cp4.1LG12g03680 vs. NCBI nr
Match: gi|778709077|ref|XP_011656339.1| (PREDICTED: transcription factor EGL1 isoform X2 [Cucumis sativus])

HSP 1 Score: 1077.8 bits (2786), Expect = 0.0e+00
Identity = 550/643 (85.54%), Postives = 587/643 (91.29%), Query Frame = 1

Query: 11  EPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDI 70
           EPGFLRKQLAVAVKSIQWSYAIFWSPS RQHGVLEWCDGYYNGDIKTRKTVQAEDVHVD 
Sbjct: 4   EPGFLRKQLAVAVKSIQWSYAIFWSPSSRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN 63

Query: 71  MGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 130
           MGLHRSEQLRELY+SLL+GE+EQR+KKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG
Sbjct: 64  MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 123

Query: 131 RALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVSEDPSL 190
           RALADDRTIWLCNAQYAES+VFSRSLLAK   + TVVCFPYLGGVIELGVTEQVSEDPSL
Sbjct: 124 RALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFPYLGGVIELGVTEQVSEDPSL 183

Query: 191 LQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVHGLTAAKFD 250
           LQHVKDFLLKFSKPICSKK SS+AYKDDNGKEPM AKSDNEIVEVLAMEN++  TA KFD
Sbjct: 184 LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEVLAMENLYCSTAVKFD 243

Query: 251 RKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRSLQFLDDDFSY 310
            K+VNGIQRKN+EFGIDSLD FSNGCE++H M D LRLEG EGGAS F+SLQFLDDDFSY
Sbjct: 244 GKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEGAEGGASRFQSLQFLDDDFSY 303

Query: 311 GFQDSMNPSDCISEALVNNPEKVSS----KGTNDLSLKELQNSNRTKSVSLDPRTDEDLH 370
           GFQDSMNPSDCISEAL N  EKVSS    K  N+L LKE QN N T+S SLDP +DED+H
Sbjct: 304 GFQDSMNPSDCISEALANQ-EKVSSSPRLKDANNLPLKEHQNPNHTQSGSLDPSSDEDMH 363

Query: 371 YKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKMLKKILFTVPLL 430
           YKRTIFTILGSSTQL GSPLLH+FS+RS+F+PWKK +AE +T P+QQ+MLKKILF VPLL
Sbjct: 364 YKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAETHTPPMQQRMLKKILFAVPLL 423

Query: 431 SAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPSLNEINKVSILN 490
           SAG SL  L D E+SILKQGN+D CTK+   DKL+ENEKFMALKSMLPSLNEINKVSILN
Sbjct: 424 SAG-SLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKFMALKSMLPSLNEINKVSILN 483

Query: 491 DTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKIEGTLKPSTNKR 550
           DTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDY+KIEG+LKPSTNKR
Sbjct: 484 DTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKR 543

Query: 551 KACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVDVMDTLNDLQLD 610
           KACEMDETDLKLKN+ PK G KLDVKV+M E EVLVDMHCPYREYILVDVMD LNDLQLD
Sbjct: 544 KACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHCPYREYILVDVMDALNDLQLD 603

Query: 611 AHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANKS 650
           A+SVQSSD NG+FSLTLKSKFRGM AASVGM+KLALLKV NKS
Sbjct: 604 AYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVVNKS 644

BLAST of Cp4.1LG12g03680 vs. NCBI nr
Match: gi|659116733|ref|XP_008458230.1| (PREDICTED: LOW QUALITY PROTEIN: transcription factor EGL1-like [Cucumis melo])

HSP 1 Score: 1073.9 bits (2776), Expect = 1.0e-310
Identity = 548/643 (85.23%), Postives = 587/643 (91.29%), Query Frame = 1

Query: 11  EPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDI 70
           EPGFLRKQLAVAVKSIQWSYAIFWSPS RQHGVLEWCDGYYNGDIKTRKTVQAEDVHVD 
Sbjct: 4   EPGFLRKQLAVAVKSIQWSYAIFWSPSTRQHGVLEWCDGYYNGDIKTRKTVQAEDVHVDN 63

Query: 71  MGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 130
           MGLHRSEQLRELY+SLL+GE+EQR+KKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG
Sbjct: 64  MGLHRSEQLRELYRSLLEGESEQRTKKPPASLSPEDLSDAEWYYLVCMSFFFNQGQGLPG 123

Query: 131 RALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGVTEQVSEDPSL 190
           RALADDRTIWLCNAQYAESSVFSRSLLAK   + TVVCFPYLGGVIELGVTEQV+EDP L
Sbjct: 124 RALADDRTIWLCNAQYAESSVFSRSLLAKSASIQTVVCFPYLGGVIELGVTEQVAEDPCL 183

Query: 191 LQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMENVHGLTAAKFD 250
           LQHVKDFLLKFSKPICSKK SS+AYKDDNGKEPM AKSDNEIVE LAMEN++  TA KFD
Sbjct: 184 LQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDNEIVEFLAMENLYCSTAVKFD 243

Query: 251 RKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCFRSLQFLDDDFSY 310
            K+VNGIQR N+EFGIDSLD FSNGCE++HQM D LRLEGVEGGAS F+SLQFLDDDFSY
Sbjct: 244 GKSVNGIQRXNNEFGIDSLDDFSNGCEQYHQMEDSLRLEGVEGGASRFQSLQFLDDDFSY 303

Query: 311 GFQDSMNPSDCISEALVNNPEKVSS----KGTNDLSLKELQNSNRTKSVSLDPRTDEDLH 370
           GFQDSMNPSDCISEAL N  +KVSS    K  N+L LKELQN N+T+S SLDP +DED+H
Sbjct: 304 GFQDSMNPSDCISEALANQ-DKVSSSPRLKDANNLPLKELQNPNQTQSGSLDPSSDEDMH 363

Query: 371 YKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEINTAPVQQKMLKKILFTVPLL 430
           YKRTIFTILGSSTQL GSPLLH+FS+RS+F PWKK MAE +T P+QQ+MLKKILF VPLL
Sbjct: 364 YKRTIFTILGSSTQLVGSPLLHNFSNRSNFTPWKKVMAETHTPPMQQRMLKKILFAVPLL 423

Query: 431 SAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSMLPSLNEINKVSILN 490
           SAG SL  L D ERSILKQGN++ CTK+   DKLRENEKFMALKSMLPSLNEINKVSILN
Sbjct: 424 SAG-SLKGLKDVERSILKQGNNNSCTKNATLDKLRENEKFMALKSMLPSLNEINKVSILN 483

Query: 491 DTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYDKIEGTLKPSTNKR 550
           DTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDY+KIEG+LKPSTNKR
Sbjct: 484 DTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYEKIEGSLKPSTNKR 543

Query: 551 KACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYILVDVMDTLNDLQLD 610
           KACEMDETDLKLK++ PK G KLDVKV+M E EVL+DMHCPYREYILVDV+D LNDLQLD
Sbjct: 544 KACEMDETDLKLKHDFPKVGHKLDVKVSMEEHEVLIDMHCPYREYILVDVVDALNDLQLD 603

Query: 611 AHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVANKS 650
           A+SVQSSD NG FSLTLKSKFRG+ AASVGM+KLALLKVANKS
Sbjct: 604 AYSVQSSDHNGFFSLTLKSKFRGIAAASVGMIKLALLKVANKS 644

BLAST of Cp4.1LG12g03680 vs. NCBI nr
Match: gi|700190452|gb|KGN45656.1| (hypothetical protein Csa_6G003480 [Cucumis sativus])

HSP 1 Score: 848.6 bits (2191), Expect = 7.3e-243
Identity = 444/543 (81.77%), Postives = 479/543 (88.21%), Query Frame = 1

Query: 111 EWYYLVCMSFFFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFP 170
           +W Y +  S    Q  GLPGRALADDRTIWLCNAQYAES+VFSRSLLAK   + TVVCFP
Sbjct: 20  QWSYAIFWSPSSRQ-HGLPGRALADDRTIWLCNAQYAESTVFSRSLLAKSASIQTVVCFP 79

Query: 171 YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDN 230
           YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKK SS+AYKDDNGKEPM AKSDN
Sbjct: 80  YLGGVIELGVTEQVSEDPSLLQHVKDFLLKFSKPICSKKPSSAAYKDDNGKEPMTAKSDN 139

Query: 231 EIVEVLAMENVHGLTAAKFDRKAVNGIQRKNDEFGIDSLDRFSNGCERFHQMVDPLRLEG 290
           EIVEVLAMEN++  TA KFD K+VNGIQRKN+EFGIDSLD FSNGCE++H M D LRLEG
Sbjct: 140 EIVEVLAMENLYCSTAVKFDGKSVNGIQRKNNEFGIDSLDDFSNGCEQYHPMEDTLRLEG 199

Query: 291 VEGGASCFRSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSS----KGTNDLSLKEL 350
            EGGAS F+SLQFLDDDFSYGFQDSMNPSDCISEAL N  EKVSS    K  N+L LKE 
Sbjct: 200 AEGGASRFQSLQFLDDDFSYGFQDSMNPSDCISEALANQ-EKVSSSPRLKDANNLPLKEH 259

Query: 351 QNSNRTKSVSLDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPWKKGMAEI 410
           QN N T+S SLDP +DED+HYKRTIFTILGSSTQL GSPLLH+FS+RS+F+PWKK +AE 
Sbjct: 260 QNPNHTQSGSLDPSSDEDMHYKRTIFTILGSSTQLVGSPLLHNFSNRSNFIPWKKVVAET 319

Query: 411 NTAPVQQKMLKKILFTVPLLSAGCSLNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKF 470
           +T P+QQ+MLKKILF VPLLSAG SL  L D E+SILKQGN+D CTK+   DKL+ENEKF
Sbjct: 320 HTPPMQQRMLKKILFAVPLLSAG-SLKGLKDEEQSILKQGNNDSCTKNATLDKLKENEKF 379

Query: 471 MALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQT 530
           MALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQT
Sbjct: 380 MALKSMLPSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQT 439

Query: 531 SDNYDYDKIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHC 590
           SDNYDY+KIEG+LKPSTNKRKACEMDETDLKLKN+ PK G KLDVKV+M E EVLVDMHC
Sbjct: 440 SDNYDYEKIEGSLKPSTNKRKACEMDETDLKLKNDFPKVGRKLDVKVSMEEHEVLVDMHC 499

Query: 591 PYREYILVDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVA 650
           PYREYILVDVMD LNDLQLDA+SVQSSD NG+FSLTLKSKFRGM AASVGM+KLALLKV 
Sbjct: 500 PYREYILVDVMDALNDLQLDAYSVQSSDHNGLFSLTLKSKFRGMAAASVGMIKLALLKVV 559

BLAST of Cp4.1LG12g03680 vs. NCBI nr
Match: gi|514482123|gb|AGO58373.1| (basic helix-loop-helix protein [Morella rubra])

HSP 1 Score: 727.2 bits (1876), Expect = 2.4e-206
Identity = 390/653 (59.72%), Postives = 478/653 (73.20%), Query Frame = 1

Query: 1   MANGTEICDSEPGFLRKQLAVAVKSIQWSYAIFWSPSIRQHGVLEWCDGYYNGDIKTRKT 60
           MANGT+  D  P  LRK+LAVAV+SIQWSYAIFWS S  Q GVLEW DGYYNGDIKTRKT
Sbjct: 1   MANGTQTHDGLPENLRKRLAVAVRSIQWSYAIFWSLSTTQQGVLEWGDGYYNGDIKTRKT 60

Query: 61  VQAEDVHVDIMGLHRSEQLRELYKSLLDGENEQRSKKPPASLSPEDLSDAEWYYLVCMSF 120
           VQA ++  D +GL RSEQLRELY+SLL+GE +Q++K+P A+LSPEDLSDAEWYYLVCMSF
Sbjct: 61  VQAVELKADKIGLQRSEQLRELYQSLLEGEADQQAKRPSAALSPEDLSDAEWYYLVCMSF 120

Query: 121 FFNQGQGLPGRALADDRTIWLCNAQYAESSVFSRSLLAKVRFLLTVVCFPYLGGVIELGV 180
            F+ G+GLPGRALA+ + IWLCNAQYA+S VFSRSLLAK   + TVVCFPYLGGVIELGV
Sbjct: 121 VFSPGEGLPGRALANGQAIWLCNAQYADSKVFSRSLLAKSASIQTVVCFPYLGGVIELGV 180

Query: 181 TEQVSEDPSLLQHVKDFLLKFSKPICSKKSSSSAYKDDNGKEPMVAKSDNEIVEVLAMEN 240
           TE VSEDPSLLQH+K  LL+ SKP+CS KSS +  K D+  +P+ A  + EI++ L +EN
Sbjct: 181 TELVSEDPSLLQHIKASLLELSKPVCSDKSSPTPPKADDDGDPICANVNLEIMDTLPLEN 240

Query: 241 VHGLT-AAKFDRKAVNGIQRK-NDEFGIDSLDRFSNGCERFHQMVDPLRLEGVEGGASCF 300
           ++  T   +FDR+ +  +    ++E  +DS D  SNG E  HQ  D   L+G+ GGAS  
Sbjct: 241 LYSPTEGIEFDREGIVELGGNIHEEINMDSPDECSNGXEHNHQTEDSFMLDGINGGASQV 300

Query: 301 RSLQFLDDDFSYGFQDSMNPSDCISEALVNNPEKVSSKGTNDLS--LKELQNSNRTKSVS 360
           +S   LDDDFS G  DSMN SDCISEA VN  + +S+    D++  LKELQNSN TK  S
Sbjct: 301 QSWHVLDDDFSNGVPDSMNSSDCISEAFVNQEKAISTLKREDVNQHLKELQNSNHTKLGS 360

Query: 361 LDPRTDEDLHYKRTIFTILGSSTQLAGSPLLHSFSSRSSFMPW-KKGMAEINTAPVQQKM 420
           LD   D+DLHY+R +  I+GSS +L  +   H    RS+F+ W K+ + +      QQ M
Sbjct: 361 LDLGADDDLHYRRILSAIVGSSPRLIENLRFHYTDHRSNFLCWTKEALGDAYRPQAQQTM 420

Query: 421 LKKILFTVPLLSAGCS--LNRLNDGERSILKQGNDDFCTKDVVHDKLRENEKFMALKSML 480
           LKKILFTVPL+  GCS  L R N G+  + K  + D C   V+ D  RENE F+ALKSM+
Sbjct: 421 LKKILFTVPLMYGGCSFRLQRENCGKEWLRKSESGDICLGHVLSDNRRENENFLALKSMV 480

Query: 481 PSLNEINKVSILNDTIKYLKMLEARVQELETCMDSLYYEERFRRKYLDMVEQTSDNYDYD 540
           PS++EI+K SIL DTIKYLK LEARV+ELE+CMDS+ YEER RRKYLDMVEQ SDN D  
Sbjct: 481 PSISEIDKASILRDTIKYLKELEARVEELESCMDSVDYEERARRKYLDMVEQISDNCDKK 540

Query: 541 KIEGTLKPSTNKRKACEMDETDLKLKNNIPKDGLKLDVKVTMNEQEVLVDMHCPYREYIL 600
           KI+   K   NKRKACE DETD +L   +P+D L LDVKV++ EQEVL++M CPYREY+L
Sbjct: 541 KIDNGKKSWINKRKACEFDETDPELNRVVPEDSLPLDVKVSIKEQEVLIEMRCPYREYVL 600

Query: 601 VDVMDTLNDLQLDAHSVQSSDRNGVFSLTLKSKFRGMVAASVGMVKLALLKVA 647
           +DVMD +N+L L+AHSVQSS  NG+ +LTLKSKFRG   A VGM+K AL K+A
Sbjct: 601 LDVMDAINNLHLEAHSVQSSAPNGILTLTLKSKFRGAATAPVGMIKQALWKIA 653

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GL3_ARATH1.8e-11640.09Transcription factor GLABRA 3 OS=Arabidopsis thaliana GN=GL3 PE=1 SV=1[more]
EGL1_ARATH1.7e-11139.48Transcription factor EGL1 OS=Arabidopsis thaliana GN=BHLH2 PE=1 SV=1[more]
BHLHW_PEA1.1e-7031.43Basic helix-loop-helix protein A OS=Pisum sativum GN=BHLH PE=3 SV=1[more]
BH012_ARATH3.0e-5548.19Transcription factor MYC1 OS=Arabidopsis thaliana GN=BHLH12 PE=1 SV=1[more]
ARLC_MAIZE2.2e-5340.34Anthocyanin regulatory Lc protein OS=Zea mays GN=LC PE=2 SV=1[more]
Match NameE-valueIdentityDescription
I6N8K6_CUCSA0.0e+0084.99GL3 OS=Cucumis sativus PE=2 SV=1[more]
A0A0A0KCZ7_CUCSA5.1e-24381.77Uncharacterized protein OS=Cucumis sativus GN=Csa_6G003480 PE=4 SV=1[more]
A0A075BRK3_9ROSI1.7e-20659.72Basic helix-loop-helix protein OS=Morella rubra GN=bHLH2 PE=2 SV=1[more]
M5XIR9_PRUPE5.3e-20057.12Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002645mg PE=4 SV=1[more]
A0A0A7W5E0_PRUAV3.4e-19957.12BHLH33 OS=Prunus avium GN=bHLH33 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41315.11.0e-11740.09 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G63650.19.8e-11339.48 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G00480.21.7e-5648.19 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G09820.11.3e-4340.08 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G32640.13.7e-1932.37 Basic helix-loop-helix (bHLH) DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|793421610|ref|NP_001292635.1|0.0e+0084.99transcription factor EGL1 [Cucumis sativus][more]
gi|778709077|ref|XP_011656339.1|0.0e+0085.54PREDICTED: transcription factor EGL1 isoform X2 [Cucumis sativus][more]
gi|659116733|ref|XP_008458230.1|1.0e-31085.23PREDICTED: LOW QUALITY PROTEIN: transcription factor EGL1-like [Cucumis melo][more]
gi|700190452|gb|KGN45656.1|7.3e-24381.77hypothetical protein Csa_6G003480 [Cucumis sativus][more]
gi|514482123|gb|AGO58373.1|2.4e-20659.72basic helix-loop-helix protein [Morella rubra][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR025610MYC/MYB_N
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045165 cell fate commitment
biological_process GO:0009888 tissue development
biological_process GO:0001708 cell fate specification
biological_process GO:0009913 epidermal cell differentiation
biological_process GO:0048629 trichome patterning
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g03680.1Cp4.1LG12g03680.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 463..509
score: 4.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 455..501
score: 2.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 463..517
score: 1.3
IPR025610Transcription factor MYC/MYB N-terminalPFAMPF14215bHLH-MYC_Ncoord: 15..196
score: 6.2
NoneNo IPR availablePANTHERPTHR11514MYCcoord: 15..224
score: 6.8E-259coord: 247..649
score: 6.8E
NoneNo IPR availablePANTHERPTHR11514:SF38TRANSCRIPTION FACTOR MYC1coord: 247..649
score: 6.8E-259coord: 15..224
score: 6.8E