ClCG01G021030 (gene) Watermelon (Charleston Gray)

NameClCG01G021030
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionMyb family transcription factor-related protein
LocationCG_Chr01 : 35013406 .. 35019457 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCTCTTTCTCTCTCTCACGCACTTCTTTCTTCTTCTTCTTCTTCTTTTTCTTCTTCTTCACACAACAAACAAACCCAATGAACTCTCAATTTCTCTGTAAATCATTCTTACCCCATCACCAGATTTCATCTTCTTCCTTCTTTCTCTAATCAATTCAATCCATTCTCTCTTCTGTTTTTTCATTCCCACTTCCACCATTCCTTTCTTCTCCCTCTCCCCATTTCACAACAAGGGTTATGTTTCCCAGACTCGTTAATCCCGACGGAGATATCCAGATCCATGGCCCCCGTGGTTCTGTTGCTTCCGACCTAACCCACAGTCATCGAGGAGACCCTTGTCTCGTTTTGACTTCAGATCCCAAACCTCGCCTCCGCTGGACTGCCGATCTTCACGAGCGATTCGTCGATGCCGTCACTCAGCTCGGTGGTGCTGGCAGTCAGTGCTTCTATCTCCATCTACTTAAATTTCTGCGAGTTTTTTTATTTTTGAGCTTGGTGTTCTATTTTTCTTGTTGTTCATATATTTGCTGCCATAGACGCGTTACTGGTATCCGACAAGCTGTTCACTCAACTAGCTTGAATTAGACTTCTCTTCTTTTTCTAAGAAATAGCGGCGATGGTCTGTTTAGTACCTATTTTTGGGACGATTTACAGTAGGATTTCGATTTTCATTTTAAGTACTCTTGAATTAGGAGAAATGTTTGATGCGTTGAAATGAATTGCGTTCCTTGAGGGTCGAGCACTGGAACGCCTACATTTTAGTTCCGCTATTCACTTATGCTTACTATGTGAATGGGAAGTTTTGATCCTGTTGAAAGATTCTTATCGATGACAATCATTAAGGATTTAGCCTGGTAAACAAATAAGTATGATTAGGTACCTTATGGCAACGAGAAAACATGGGTTTAGACTTTTGAGAAATGGGATGGGAGGCCAAGCTCAATTCCTCTAGAATAGTTAAATTCGGCTGATTCTTAATCAAGGTTCAAAATTTCACGACTTTGTCAAAGCCATGGAAGTTCAGTTGGTCCAAGTTTAAATGGGGATTTTGCCCTTGAGTGCTTTGAAGAATGTTCATAAATGATTGTTGCCTGGGAAGGGTATGCTTCTAATTGGCTCACAGATTGAGTAGATTTAAGTAGCGGTTATTTGATCATGCTGTTGTTACATTTGGAGTAGTCAGGAAGTCAAATTTAGTCTCTCTTGCTTTTATATATTATAGGGGTGTCACTTTGTTTTTTTTATTTTTAATGATTGGAATCCCTGCAATTTTTGTTTTGGCTTCCATGGTTCCTTCGACCTGCTTATTGTATGACTTTTGTGTATAATCATTCATTTTTGTTTGTTAAAAAGGACACATTTGTCAACACATGTATAAACCGTTGCTTGCTAGTTGTCATTTCTTCCTTCATTTCTTTAAATTATTCTCATAATCTTTTCTTTTCTTTGTTTACCATTTTCCTGTTTACTACTGAGTAGAAAATTCCAATCGTATCAGTTGTTTTATCTAACCTTAATTGTGATGTTAAATACCAGTCTTTGCATCAACTATCTCTGGGCACGGGAAATTCATAGGAGTAAGTTTTTGGGGAACAATTCATTGAGTGGATTCAGTTGTTTTGTCTATGTTTTCTAGTTGAATTGCTTGGGTTAAATATCTGATAGTTAGCTCAAAAGGCACTCTGGCAGAGAAAACTTTCAAGGGCAAAAACACGGAAAAAGGACAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAGTTCCAGGAAACTATATTATCTTCCTTCCAAATCTTGCAGGATCTCTATTTCTAAAAATTTAAATTGTATCTCAGCTAAGAATTCCATATAGGCTTTTTAACCCAATTCCGAGTTTATAATTAGGATGTCAATGCATCTTTACATTTTCTATCCAACTAGATGTCTACTTCATTTGACATGATGACTCTATTAAACATCATAAACCATTAAAATACAATTCTAAAAGATCTGAAGTTAGACTGAAAAGGTCTATAAACAATTTTGATTTATTCCTTAGTAGTAGTACTGTAGTAGTATGCCTGATCATTGCCTGATAGAAAAGTTAATGGAACCAGACAAGATTTGAATGTTAATGTGTAGACTGTATACAACTTTGAGGTTTTTCAAAATTCATAGTTTGAAAACCTTCGCAAGTTTGCTTGATGGTTTAGGTGGTGAGTTACATATCAAGATTCCTTCATGGTCATATCTCAATTTTCAGGAGCGATAATTAATGCCATTTTCCCAATCACAGAAGTTATCCCGTTTTATCATAATTTATGGGTTCTTTTAGCCTTTGTCCCATAATGAAGTATCAAGTCACTCAGCAAGAAATTCCATCATCAGTTTGAAGATAGTTTTTCTTTTCCCCATGTAGATAATCGAAGGTGAAAAGATTTAAGATCTAACTGAGTTTGTGAAAATAGTCCATCTGTATGCTGGTGAGGGTACAAGAAAGAATCCAAAAGGATAAATAGTTTTCACAATTCAAAGAAATTATATGGATTAAATTGGAACGGTGTTTCCATCTTTACATAATTTATTTATGCGTTGAAGTTACACTTAAACATTCAACTCCTCCAGGCAAAAGATTGGATTGTAAGTACAATATAACCATAAGGGAACTCATGAAAATTTTAACATTATGAAACTAAGGTCCAACCAATCAGTTGCTTATAATCTGAAATGGTTTTAGTTTTTTCTTAATTTTTTTTTATCTAATTCAGCCAGTCAGTAAATATTGTTACACTGTTTAATATCAGATCAGAATCAATGTGAATCTTCTCGTAATATGGTTTGTGCTGTGTGGTAGACATTCATGCCATATGTCATTGTTAAAGAGTTTTACGTAAGGACTTCTTTAAAACCAGAACCTTTTTATTTATTTATTTTAATTAATTAATTAATTAAAAAAACAAATTTTTCCACTTTTTATTTTGTAGTTTTTATTTCTGTTGGCGTAAGAAGTAGACTTTTTCATTATCAAAACACTCAAAAGAACAACTTAGAGGCCAAGGAGACAAGGTGTCCCCTTGCTAGGAGAAAGAGGTTAGACAAAAACACCTCCGATCGAGATGGATAATAAGAGCAGAGTAATTACAAGAGGAATTGTGTGATGCACTCCAATTAAGGTCTTAAATTGTACAATCTGACAAAAATCAATTGGACTTCTATGCTTTTCTTCGAAGATCCTGTTATTTCTTTTCTTCTGGAGAGTGGTTGACATGTTTGACCCCAGAACTTGGATTGTAATCATTAAGTCAAAGGTTCATTTGCTGCCAGCATAGCTCGGCAGTTATGGTGGCTATTCCTTTATTTGAAGTCAAAGGTTCAAATCTCCATCCCCACATTTGTTATACTTAAAAGAATAAGGAAAAAAAAAAAAAAAAGATCAGTCAGTTTGTTTGCAGCATAGAGAATTTCCTTTGCATTCTTTGATTGATAGGTCTTCTTACCATCTTTCTTTTGATTTGTTGGTGGGGACATGGGTGGGGAGGATATCGGTCATATTTGAATTCTAATCCATGATAAAGGTTTGACATTTGAACACTTGGTGCAGAAGCTACTCCAAAAGCAATAATGCGAACAATGAATGTGAAAGGACTGACTCTCTTCCATCTGAAGAGCCACCTCCAGGTAATCATAATGACATGAAAATGCATCTTTTTATGAAAGAATTCTGATGTTGGAATTCATGGTGAATTTTTATTAATGTATTGCTTGTTTCTTCATTACAGAAATACAGATTAGGTAAGCAATCAGGGAAGGATATGGGTGAGGCATCTAAAGATGGTAAGCTATACATCTCTATTTTTGTCCTTGCATATCATGTGATGCCTCCATGAGTCTGCTGATATTCAACTACATAACTAGTGTTGGGTCATGGAGTTCTTATCCAAAAGAGCTTGCTCTCCTTTCACCACCTGTATTTGCACTAGCTACTAATTTCTGAAAGTTTTAACCTGTCTGGTCCAAATATTTAAATCTTTATTTATCGCTTGGTGCAATGAACATCAGGTGCATATCTTTTAGAAAGTCCAAGTACGAATAATTTCTCTCCTGACTTGCCAATTTCTGAAATGGCCGAGTAAGATAAGAGTTTTTTTTCCCTCCTTTAAGATACTTACGATAATGCCTTGAATGTTGTTGTTATAAGATTTTAAGTATGTTCAAACTGTTTATAACTTAGACGTCTGTTATAGTGGTTATGAAGTCAAGGAGGCATTAAGAGCGCAAATGGAAGTTCAAAGTAAATTACATCTGCAAGTTGAGGTAAAATTTTCATCCTTTAACATTAAAATGGAATTGATAATGAACCTGGACAGACTCCATGAAGATCCAACCTTGTTAATGAATACCAGGGTGGGTACTAAAACATAATTTGAATGGAGAAAGGCATATAAGTTTCTAGTTTTTAGATTGCTTACTTTCTTAGTTGATTGTCCTCGTGGTTGAGTACATTTGGAATTACTGATTTAAAACTAGCAATGGAGATCTTTGTACCTGCTGATAGGATAGGAACTTCCAAGTGGTTAATACTGAGACTTAGGGTCCATTCTTCAGCTAGCTTACCTTACCATCGTTGAGCAGGCAGAGAAGCACCTACGAATTCGTCAGGATGCCGAGCGAAGATATTTGGCCATGCTTGAGAGAGCTTGTAAAATGCTTGCAGATCAATTCATTGGAGGCGCAGTTTCGGACTCAGACAGCAAGAAGTCCCAAGGACAGGATCGTAAGAGCCCAAGAGGTTCCTCCATTGACCCACTTGGTTTCTATGCTTCCCAATCGCAAGAGATGGAAAGAGTGAATGGCACGGAAGAAGTGCAGGCTAATCTCCCTTGCCAAAGGGCTGATTGTTCAACCGAAAGCTGCCTAACCTCCAACGAGAGTCCCGGAGGATTGGCTATGGAAAAATCTCCTGTTGCAAGCAAGAAAAACATGGTTAACTTGGATTCAGCAACTGCATCTTTGATTTGGGGTGAAGCTAAGGAAAGAATACAAGATGCTAACATCATCCAAGTTAACCATCACGGCGTATCTGGATGCGACATGTGGGGATGAAGTTCTGCAGACAAAAACTCATCACGATTGATGAAAAGTATGATCTATTTCCAACTTTGTAAAAAGCTCATTATTGTCCGTTCACAGAATCTAGTATGAACTGCAGCAAGAACATTCCCCCTAGCTTGTAAACCCGAGCAGTTTAGTTCATTGACAACTCGTCCAAATGCAAGTAGTTAGGCATTCTTTTTGCTGTTCTGGACAATTCTATAATTGTCTGTTCTGTTATGATTTCTGTCTGACTTTTGGCAGATTTGAGTCTATTATTAGGGTGTAATCAGTAATGTTATGTTGGAAACACATCCTCTTTCATTTGTTATTAGTGAAATGGACCTTGTCTTCCAATGATGATGGATAAATGAATGAATATTGTGTCATTTTCTCTTA

mRNA sequence

CCTCTCTTTCTCTCTCTCACGCACTTCTTTCTTCTTCTTCTTCTTCTTTTTCTTCTTCTTCACACAACAAACAAACCCAATGAACTCTCAATTTCTCTGTAAATCATTCTTACCCCATCACCAGATTTCATCTTCTTCCTTCTTTCTCTAATCAATTCAATCCATTCTCTCTTCTGTTTTTTCATTCCCACTTCCACCATTCCTTTCTTCTCCCTCTCCCCATTTCACAACAAGGGTTATGTTTCCCAGACTCGTTAATCCCGACGGAGATATCCAGATCCATGGCCCCCGTGGTTCTGTTGCTTCCGACCTAACCCACAGTCATCGAGGAGACCCTTGTCTCGTTTTGACTTCAGATCCCAAACCTCGCCTCCGCTGGACTGCCGATCTTCACGAGCGATTCGTCGATGCCGTCACTCAGCTCGGTGGTGCTGGCAAAGCTACTCCAAAAGCAATAATGCGAACAATGAATGTGAAAGGACTGACTCTCTTCCATCTGAAGAGCCACCTCCAGAAATACAGATTAGGTAAGCAATCAGGGAAGGATATGGGTGAGGCATCTAAAGATGGTGCATATCTTTTAGAAAGTCCAAGTACGAATAATTTCTCTCCTGACTTGCCAATTTCTGAAATGGCCGATGGTTATGAAGTCAAGGAGGCATTAAGAGCGCAAATGGAAGTTCAAAGTAAATTACATCTGCAAGTTGAGCTAGCTTACCTTACCATCGTTGAGCAGGCAGAGAAGCACCTACGAATTCGTCAGGATGCCGAGCGAAGATATTTGGCCATGCTTGAGAGAGCTTGTAAAATGCTTGCAGATCAATTCATTGGAGGCGCAGTTTCGGACTCAGACAGCAAGAAGTCCCAAGGACAGGATCGTAAGAGCCCAAGAGGTTCCTCCATTGACCCACTTGGTTTCTATGCTTCCCAATCGCAAGAGATGGAAAGAGTGAATGGCACGGAAGAAGTGCAGGCTAATCTCCCTTGCCAAAGGGCTGATTGTTCAACCGAAAGCTGCCTAACCTCCAACGAGAGTCCCGGAGGATTGGCTATGGAAAAATCTCCTGTTGCAAGCAAGAAAAACATGGTTAACTTGGATTCAGCAACTGCATCTTTGATTTGGGGTGAAGCTAAGGAAAGAATACAAGATGCTAACATCATCCAAGTTAACCATCACGGCGTATCTGGATGCGACATGTGGGGATGAAGTTCTGCAGACAAAAACTCATCACGATTGATGAAAAGTATGATCTATTTCCAACTTTGTAAAAAGCTCATTATTGTCCGTTCACAGAATCTAGTATGAACTGCAGCAAGAACATTCCCCCTAGCTTGTAAACCCGAGCAGTTTAGTTCATTGACAACTCGTCCAAATGCAAGTAGTTAGGCATTCTTTTTGCTGTTCTGGACAATTCTATAATTGTCTGTTCTGTTATGATTTCTGTCTGACTTTTGGCAGATTTGAGTCTATTATTAGGGTGTAATCAGTAATGTTATGTTGGAAACACATCCTCTTTCATTTGTTATTAGTGAAATGGACCTTGTCTTCCAATGATGATGGATAAATGAATGAATATTGTGTCATTTTCTCTTA

Coding sequence (CDS)

ATGTTTCCCAGACTCGTTAATCCCGACGGAGATATCCAGATCCATGGCCCCCGTGGTTCTGTTGCTTCCGACCTAACCCACAGTCATCGAGGAGACCCTTGTCTCGTTTTGACTTCAGATCCCAAACCTCGCCTCCGCTGGACTGCCGATCTTCACGAGCGATTCGTCGATGCCGTCACTCAGCTCGGTGGTGCTGGCAAAGCTACTCCAAAAGCAATAATGCGAACAATGAATGTGAAAGGACTGACTCTCTTCCATCTGAAGAGCCACCTCCAGAAATACAGATTAGGTAAGCAATCAGGGAAGGATATGGGTGAGGCATCTAAAGATGGTGCATATCTTTTAGAAAGTCCAAGTACGAATAATTTCTCTCCTGACTTGCCAATTTCTGAAATGGCCGATGGTTATGAAGTCAAGGAGGCATTAAGAGCGCAAATGGAAGTTCAAAGTAAATTACATCTGCAAGTTGAGCTAGCTTACCTTACCATCGTTGAGCAGGCAGAGAAGCACCTACGAATTCGTCAGGATGCCGAGCGAAGATATTTGGCCATGCTTGAGAGAGCTTGTAAAATGCTTGCAGATCAATTCATTGGAGGCGCAGTTTCGGACTCAGACAGCAAGAAGTCCCAAGGACAGGATCGTAAGAGCCCAAGAGGTTCCTCCATTGACCCACTTGGTTTCTATGCTTCCCAATCGCAAGAGATGGAAAGAGTGAATGGCACGGAAGAAGTGCAGGCTAATCTCCCTTGCCAAAGGGCTGATTGTTCAACCGAAAGCTGCCTAACCTCCAACGAGAGTCCCGGAGGATTGGCTATGGAAAAATCTCCTGTTGCAAGCAAGAAAAACATGGTTAACTTGGATTCAGCAACTGCATCTTTGATTTGGGGTGAAGCTAAGGAAAGAATACAAGATGCTAACATCATCCAAGTTAACCATCACGGCGTATCTGGATGCGACATGTGGGGATGA

Protein sequence

MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNGTEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAKERIQDANIIQVNHHGVSGCDMWG
BLAST of ClCG01G021030 vs. Swiss-Prot
Match: PHL2_ARATH (Protein PHR1-LIKE 2 OS=Arabidopsis thaliana GN=PHL2 PE=1 SV=1)

HSP 1 Score: 188.0 bits (476), Expect = 1.7e-46
Identity = 113/240 (47.08%), Postives = 150/240 (62.50%), Query Frame = 1

Query: 31  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSH 90
           GD CLVLT+DPKPRLRWT +LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSH
Sbjct: 30  GDACLVLTTDPKPRLRWTTELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSH 89

Query: 91  LQKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPD---LPISEMADGYEVKEALRAQME 150
           LQK+RLG+Q+GK+  E SKD + + ES  T + S     +   E  +GY+V EALRAQME
Sbjct: 90  LQKFRLGRQAGKESTENSKDASCVGESQDTGSSSTSSMRMAQQEQNEGYQVTEALRAQME 149

Query: 151 VQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQ---FIGGAVSD 210
           VQ +LH Q+E+         ++ L++R +A+ +YL ++LE+ACK   +Q   F G   + 
Sbjct: 150 VQRRLHDQLEV---------QRRLQLRIEAQGKYLQSILEKACKAFDEQAATFAGLEAAR 209

Query: 211 SDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNGTEEVQANLPCQRADCSTESCLTS 264
            +  +   +   S +G+S+    F A++   M  ++       N      +CS ES LTS
Sbjct: 210 EELSELAIKVSNSSQGTSVP--YFDATKMMMMPSLSELAVAIDNKNNITTNCSVESSLTS 258

BLAST of ClCG01G021030 vs. Swiss-Prot
Match: PHL3_ARATH (Protein PHR1-LIKE 3 OS=Arabidopsis thaliana GN=PHL3 PE=1 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 1.3e-43
Identity = 96/173 (55.49%), Postives = 123/173 (71.10%), Query Frame = 1

Query: 32  DPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSHL 91
           D CLVLT+DPKPRLRWT++LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHL
Sbjct: 27  DACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHL 86

Query: 92  QKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPD---LPISEMADGYEVKEALRAQMEV 151
           QK+RLG+QS K+  + SKD + + ES  T + S     L   E  + Y+V EALRAQMEV
Sbjct: 87  QKFRLGRQSCKESIDNSKDVSCVAESQDTGSSSTSSLRLAAQEQNESYQVTEALRAQMEV 146

Query: 152 QSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGA 201
           Q +LH Q+E+         ++ L++R +A+ +YL ++LE+ACK + +Q +  A
Sbjct: 147 QRRLHEQLEV---------QRRLQLRIEAQGKYLQSILEKACKAIEEQAVAFA 190

BLAST of ClCG01G021030 vs. Swiss-Prot
Match: APL_ARATH (Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=1 SV=2)

HSP 1 Score: 150.6 bits (379), Expect = 2.9e-35
Identity = 84/175 (48.00%), Postives = 113/175 (64.57%), Query Frame = 1

Query: 30  RGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKS 89
           +GD  LVLT+DPKPRLRWT +LHERFVDAV QLGG  KATPK IMR M VKGLTL+HLKS
Sbjct: 22  QGDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKS 81

Query: 90  HLQKYRLGKQSGKDMGE-ASKDGAYLLESPSTNNFSPDLPISEMADGYEVKEALRAQMEV 149
           HLQK+RLGKQ  K+ G+ ++K+G+         N +        + G   +     QMEV
Sbjct: 82  HLQKFRLGKQPHKEYGDHSTKEGSRASAMDIQRNVA-------SSSGMMSRNMNEMQMEV 141

Query: 150 QSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVS 203
           Q +LH Q+E+         ++HL++R +A+ +Y+ ++LERAC+ LA + +  A +
Sbjct: 142 QRRLHEQLEV---------QRHLQLRIEAQGKYMQSILERACQTLAGENMAAATA 180

BLAST of ClCG01G021030 vs. Swiss-Prot
Match: PHL9_ARATH (Myb-related protein 2 OS=Arabidopsis thaliana GN=MYR2 PE=1 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 1.1e-34
Identity = 102/250 (40.80%), Postives = 143/250 (57.20%), Query Frame = 1

Query: 31  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSH 90
           GD  L+L++D KPRL+WT DLHERF++AV QLGGA KATPK IM+ M + GLTL+HLKSH
Sbjct: 34  GDSGLILSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTIMKVMGIPGLTLYHLKSH 93

Query: 91  LQKYRLGKQ-SGKDMGEASKDGAYLL---ESPSTNNF-SPDLPISEMAD-GYEVKEALRA 150
           LQKYRL K  +G+     +K G   +   ++P  +   S +L I    +    + EAL+ 
Sbjct: 94  LQKYRLSKNLNGQANNSFNKIGIMTMMEEKTPDADEIQSENLSIGPQPNKNSPIGEALQM 153

Query: 151 QMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGA--- 210
           Q+EVQ +LH Q+E+         ++HL++R +A+ +YL ++LE+A + L  Q +G A   
Sbjct: 154 QIEVQRRLHEQLEV---------QRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIE 213

Query: 211 -----VSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNGTEEVQANLPCQRADC 266
                +S+  SK S     + P  S ++P       SQ+M         Q N P    DC
Sbjct: 214 AAKVQLSELVSKVS----AEYPNSSFLEPKELQNLCSQQM---------QTNYP---PDC 258

BLAST of ClCG01G021030 vs. Swiss-Prot
Match: PHL8_ARATH (Myb family transcription factor PHL8 OS=Arabidopsis thaliana GN=PHL8 PE=2 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 1.8e-32
Identity = 94/256 (36.72%), Postives = 137/256 (53.52%), Query Frame = 1

Query: 28  SHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHL 87
           +H+    LVL++D KPRL+WT DLH +F++AV QLGG  KATPK +M+ M + GLTL+HL
Sbjct: 20  NHKAKMSLVLSTDAKPRLKWTCDLHHKFIEAVNQLGGPNKATPKGLMKVMEIPGLTLYHL 79

Query: 88  KSHLQKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPDL---PISE-----MADGYEVK 147
           KSHLQKYRLGK    D  +     A   +   + N S DL    ++E       +G ++ 
Sbjct: 80  KSHLQKYRLGKSMKFDDNKLEVSSASENQEVESKNDSRDLRGCSVTEENSNPAKEGLQIT 139

Query: 148 EALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYLAMLERACKMLADQFIGG 207
           EAL+ QMEVQ KLH Q+E+         ++HL+++ +A+ +YL    ++  M A Q + G
Sbjct: 140 EALQMQMEVQKKLHEQIEV---------QRHLQVKIEAQGKYL----QSVLMKAQQTLAG 199

Query: 208 AVSDSDSKKSQGQD---RKSPRGSSIDPLGFYASQSQEMERVNGTEE------VQANLPC 267
                 S  + G D    +  R +S+   G  ++   E+ +V   EE         N   
Sbjct: 200 Y-----SSSNLGMDFARTELSRLASMVNRGCPSTSFSELTQVEEEEEGFLWYKKPENRGI 257

BLAST of ClCG01G021030 vs. TrEMBL
Match: A0A0A0KL09_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G510300 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 1.3e-167
Identity = 299/322 (92.86%), Postives = 305/322 (94.72%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRLVNPDGDIQIHGPRGSVASDLTH+HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST
Sbjct: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
           NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVE         AEKHL+IRQDAERR
Sbjct: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVE---------AEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           YLAMLERACKMLADQFI GAVSDSDSKKS+GQDRKSPR +SIDPLGFY +QSQEMERVNG
Sbjct: 181 YLAMLERACKMLADQFIVGAVSDSDSKKSEGQDRKSPRSTSIDPLGFYTTQSQEMERVNG 240

Query: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAKE 300
           TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSP ASKKNMVNL SATASLIW  AKE
Sbjct: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPAASKKNMVNLGSATASLIWSGAKE 300

Query: 301 RIQDANIIQVNHHGVSGCDMWG 323
            IQ+ANIIQVNHHGVSGCDMWG
Sbjct: 301 GIQNANIIQVNHHGVSGCDMWG 313

BLAST of ClCG01G021030 vs. TrEMBL
Match: M5WGF6_PRUPE (Myb family transcription factor APL OS=Prunus persica GN=APL PE=2 SV=1)

HSP 1 Score: 422.5 bits (1085), Expect = 4.5e-115
Identity = 216/317 (68.14%), Postives = 250/317 (78.86%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRL+    +       G    +    HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLMQAPHE-------GIAGQEDMQGHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGG+ KATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGK+MG+ SKD +YLLESP T
Sbjct: 61  QLGGSSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKEMGDVSKDASYLLESPGT 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
            N SP+LP S++ +GYEVKEALRAQMEVQSKLH+QVE         AEKHL+IRQDAERR
Sbjct: 121 GNSSPNLPTSDLNEGYEVKEALRAQMEVQSKLHVQVE---------AEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           Y+AMLERACKMLADQFIGG+V+D+DS K  G   K+ +G S+DPLGFY+ QS ++  V+G
Sbjct: 181 YMAMLERACKMLADQFIGGSVTDTDSHKCHGLGNKNTKGPSLDPLGFYSLQSTDVAAVHG 240

Query: 241 -TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAK 300
             EEV  ++  QRADCSTESCLTS+ESPGGL +E SP   KK M++LDSA ASLIWGEAK
Sbjct: 241 PEEEVPTSIHTQRADCSTESCLTSHESPGGLTLEGSPGGGKKRMLSLDSAAASLIWGEAK 300

Query: 301 ERIQDANIIQVNHHGVS 317
            R Q+ N+  VN HG++
Sbjct: 301 VRTQEINVAAVNPHGIA 301

BLAST of ClCG01G021030 vs. TrEMBL
Match: V7BR79_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_006G134800g PE=4 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 1.4e-113
Identity = 213/313 (68.05%), Postives = 244/313 (77.96%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           M+PRL++P   I       + AS+L+HSH+GDPCLVLT+DPKPRLRWT DLHERFVDAVT
Sbjct: 1   MYPRLIHPHDGIVAQDDMQAAASNLSHSHKGDPCLVLTADPKPRLRWTQDLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGGA KATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKD+GE  KDG+YLLESP T
Sbjct: 61  QLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDLGEGCKDGSYLLESPGT 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
            N SP LP S+  +GYE+KEALRAQMEVQSKLHLQVE         AEKHL+IRQDAERR
Sbjct: 121 ENTSPKLPTSDTNEGYEIKEALRAQMEVQSKLHLQVE---------AEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVN- 240
           Y+AMLERACKMLADQFIG  V D+DS+K Q    K+PRG+ +DPLGFY+  S E+  VN 
Sbjct: 181 YMAMLERACKMLADQFIGATVIDTDSQKFQAIGSKTPRGTLVDPLGFYSLPSAEVAGVNV 240

Query: 241 GTEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAK 300
             EE+  +LP QRADCSTESCLTS+ES GGL +E SP   K+ M+ +DS  A LIW EAK
Sbjct: 241 PDEEIPHSLPPQRADCSTESCLTSHESSGGLTLEGSPGGGKRRMLGMDSMAAPLIWSEAK 300

Query: 301 ERIQDANIIQVNH 313
            R Q  N+ Q +H
Sbjct: 301 MRTQAINVAQGSH 304

BLAST of ClCG01G021030 vs. TrEMBL
Match: I1L075_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_09G017300 PE=4 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 2.4e-113
Identity = 214/314 (68.15%), Postives = 247/314 (78.66%), Query Frame = 1

Query: 1   MFPRLVNP-DGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAV 60
           M+PRL++P DG +     +G  AS+L+H+H+GDPCLVLT+DPKPRLRWT DLHERFVDAV
Sbjct: 1   MYPRLIHPHDGIVTQDELQGGAASNLSHAHKGDPCLVLTADPKPRLRWTQDLHERFVDAV 60

Query: 61  TQLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPS 120
           TQLGGA KATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKD+GE  KDG+YLLESP 
Sbjct: 61  TQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDVGEGCKDGSYLLESPG 120

Query: 121 TNNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAER 180
            +N SP LP  +  +GYE+KEALRAQMEVQSKLHLQVE         AEKHL+IRQDAER
Sbjct: 121 ADNTSPKLPTPDTNEGYEIKEALRAQMEVQSKLHLQVE---------AEKHLQIRQDAER 180

Query: 181 RYLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVN 240
           RY+AMLERACKMLADQFI   V D+DS+K QG   K+PRG+ +DPLGFY+  S E+  VN
Sbjct: 181 RYMAMLERACKMLADQFISATVIDTDSQKFQGIGSKAPRGTLVDPLGFYSLPSTEVAGVN 240

Query: 241 -GTEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEA 300
              EE+  +LP QRADCSTESCLTS+ES GGLA+E SP   K+ M+ +DS  A LIW EA
Sbjct: 241 VPEEEILPSLPPQRADCSTESCLTSHESSGGLALEGSPGEGKRRMLGMDSMAAPLIWSEA 300

Query: 301 KERIQDANIIQVNH 313
           K R Q  N+ Q NH
Sbjct: 301 KMRTQAINVAQGNH 305

BLAST of ClCG01G021030 vs. TrEMBL
Match: A0A0B2S941_GLYSO (Myb family transcription factor APL OS=Glycine soja GN=glysoja_015124 PE=4 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 2.4e-113
Identity = 214/314 (68.15%), Postives = 247/314 (78.66%), Query Frame = 1

Query: 1   MFPRLVNP-DGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAV 60
           M+PRL++P DG +     +G  AS+L+H+H+GDPCLVLT+DPKPRLRWT DLHERFVDAV
Sbjct: 1   MYPRLIHPHDGIVTQDELQGGAASNLSHAHKGDPCLVLTADPKPRLRWTQDLHERFVDAV 60

Query: 61  TQLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPS 120
           TQLGGA KATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKD+GE  KDG+YLLESP 
Sbjct: 61  TQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDVGEGCKDGSYLLESPG 120

Query: 121 TNNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAER 180
            +N SP LP  +  +GYE+KEALRAQMEVQSKLHLQVE         AEKHL+IRQDAER
Sbjct: 121 ADNTSPKLPTPDTNEGYEIKEALRAQMEVQSKLHLQVE---------AEKHLQIRQDAER 180

Query: 181 RYLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVN 240
           RY+AMLERACKMLADQFI   V D+DS+K QG   K+PRG+ +DPLGFY+  S E+  VN
Sbjct: 181 RYMAMLERACKMLADQFISATVIDTDSQKFQGIGSKAPRGTLVDPLGFYSLPSTEVAGVN 240

Query: 241 -GTEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEA 300
              EE+  +LP QRADCSTESCLTS+ES GGLA+E SP   K+ M+ +DS  A LIW EA
Sbjct: 241 VPEEEILPSLPPQRADCSTESCLTSHESSGGLALEGSPGEGKRRMLGMDSMAAPLIWSEA 300

Query: 301 KERIQDANIIQVNH 313
           K R Q  N+ Q NH
Sbjct: 301 KMRTQAINVAQGNH 305

BLAST of ClCG01G021030 vs. TAIR10
Match: AT3G24120.2 (AT3G24120.2 Homeodomain-like superfamily protein)

HSP 1 Score: 190.3 bits (482), Expect = 1.9e-48
Identity = 114/240 (47.50%), Postives = 150/240 (62.50%), Query Frame = 1

Query: 31  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSH 90
           GD CLVLT+DPKPRLRWT +LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSH
Sbjct: 30  GDACLVLTTDPKPRLRWTTELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSH 89

Query: 91  LQKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPD---LPISEMADGYEVKEALRAQME 150
           LQK+RLG+Q+GK+  E SKD + + ES  T + S     +   E  +GY+V EALRAQME
Sbjct: 90  LQKFRLGRQAGKESTENSKDASCVGESQDTGSSSTSSMRMAQQEQNEGYQVTEALRAQME 149

Query: 151 VQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQ---FIGGAVSD 210
           VQ +LH Q+E        Q ++ L++R +A+ +YL ++LE+ACK   +Q   F G   + 
Sbjct: 150 VQRRLHDQLEYG------QVQRRLQLRIEAQGKYLQSILEKACKAFDEQAATFAGLEAAR 209

Query: 211 SDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNGTEEVQANLPCQRADCSTESCLTS 264
            +  +   +   S +G+S+    F A++   M  ++       N      +CS ES LTS
Sbjct: 210 EELSELAIKVSNSSQGTSVP--YFDATKMMMMPSLSELAVAIDNKNNITTNCSVESSLTS 261

BLAST of ClCG01G021030 vs. TAIR10
Match: AT4G13640.2 (AT4G13640.2 Homeodomain-like superfamily protein)

HSP 1 Score: 180.6 bits (457), Expect = 1.5e-45
Identity = 97/173 (56.07%), Postives = 123/173 (71.10%), Query Frame = 1

Query: 32  DPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSHL 91
           D CLVLT+DPKPRLRWT++LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHL
Sbjct: 27  DACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHL 86

Query: 92  QKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPD---LPISEMADGYEVKEALRAQMEV 151
           QK+RLG+QS K+  + SKD + + ES  T + S     L   E  + Y+V EALRAQMEV
Sbjct: 87  QKFRLGRQSCKESIDNSKDVSCVAESQDTGSSSTSSLRLAAQEQNESYQVTEALRAQMEV 146

Query: 152 QSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGA 201
           Q +LH Q+E        Q ++ L++R +A+ +YL ++LE+ACK + +Q +  A
Sbjct: 147 QRRLHEQLEYT------QVQRRLQLRIEAQGKYLQSILEKACKAIEEQAVAFA 193

BLAST of ClCG01G021030 vs. TAIR10
Match: AT1G79430.2 (AT1G79430.2 Homeodomain-like superfamily protein)

HSP 1 Score: 150.6 bits (379), Expect = 1.7e-36
Identity = 84/175 (48.00%), Postives = 113/175 (64.57%), Query Frame = 1

Query: 30  RGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKS 89
           +GD  LVLT+DPKPRLRWT +LHERFVDAV QLGG  KATPK IMR M VKGLTL+HLKS
Sbjct: 22  QGDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKS 81

Query: 90  HLQKYRLGKQSGKDMGE-ASKDGAYLLESPSTNNFSPDLPISEMADGYEVKEALRAQMEV 149
           HLQK+RLGKQ  K+ G+ ++K+G+         N +        + G   +     QMEV
Sbjct: 82  HLQKFRLGKQPHKEYGDHSTKEGSRASAMDIQRNVA-------SSSGMMSRNMNEMQMEV 141

Query: 150 QSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVS 203
           Q +LH Q+E+         ++HL++R +A+ +Y+ ++LERAC+ LA + +  A +
Sbjct: 142 QRRLHEQLEV---------QRHLQLRIEAQGKYMQSILERACQTLAGENMAAATA 180

BLAST of ClCG01G021030 vs. TAIR10
Match: AT3G04030.3 (AT3G04030.3 Homeodomain-like superfamily protein)

HSP 1 Score: 148.7 bits (374), Expect = 6.3e-36
Identity = 102/250 (40.80%), Postives = 143/250 (57.20%), Query Frame = 1

Query: 31  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHLKSH 90
           GD  L+L++D KPRL+WT DLHERF++AV QLGGA KATPK IM+ M + GLTL+HLKSH
Sbjct: 34  GDSGLILSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTIMKVMGIPGLTLYHLKSH 93

Query: 91  LQKYRLGKQ-SGKDMGEASKDGAYLL---ESPSTNNF-SPDLPISEMAD-GYEVKEALRA 150
           LQKYRL K  +G+     +K G   +   ++P  +   S +L I    +    + EAL+ 
Sbjct: 94  LQKYRLSKNLNGQANNSFNKIGIMTMMEEKTPDADEIQSENLSIGPQPNKNSPIGEALQM 153

Query: 151 QMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGA--- 210
           Q+EVQ +LH Q+E+         ++HL++R +A+ +YL ++LE+A + L  Q +G A   
Sbjct: 154 QIEVQRRLHEQLEV---------QRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIE 213

Query: 211 -----VSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNGTEEVQANLPCQRADC 266
                +S+  SK S     + P  S ++P       SQ+M         Q N P    DC
Sbjct: 214 AAKVQLSELVSKVS----AEYPNSSFLEPKELQNLCSQQM---------QTNYP---PDC 258

BLAST of ClCG01G021030 vs. TAIR10
Match: AT1G69580.2 (AT1G69580.2 Homeodomain-like superfamily protein)

HSP 1 Score: 140.6 bits (353), Expect = 1.7e-33
Identity = 93/257 (36.19%), Postives = 135/257 (52.53%), Query Frame = 1

Query: 28  SHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGAGKATPKAIMRTMNVKGLTLFHL 87
           +H+    LVL++D KPRL+WT DLH +F++AV QLGG  KATPK +M+ M + GLTL+HL
Sbjct: 20  NHKAKMSLVLSTDAKPRLKWTCDLHHKFIEAVNQLGGPNKATPKGLMKVMEIPGLTLYHL 79

Query: 88  KSHLQKYRLGKQSGKDMGEASKDGAYLLESPSTNNFSPDLPISEMAD---------GYEV 147
           KSHLQKYRLGK    D  +     A   +   + N S DL    + +         G ++
Sbjct: 80  KSHLQKYRLGKSMKFDDNKLEVSSASENQEVESKNDSRDLRGCSVTEENSNPAKDRGLQI 139

Query: 148 KEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERRYLAMLERACKMLADQFIG 207
            EAL+ QMEVQ KLH Q+E+         ++HL+++ +A+ +YL    ++  M A Q + 
Sbjct: 140 TEALQMQMEVQKKLHEQIEV---------QRHLQVKIEAQGKYL----QSVLMKAQQTLA 199

Query: 208 GAVSDSDSKKSQGQD---RKSPRGSSIDPLGFYASQSQEMERVNGTEE------VQANLP 267
           G      S  + G D    +  R +S+   G  ++   E+ +V   EE         N  
Sbjct: 200 GY-----SSSNLGMDFARTELSRLASMVNRGCPSTSFSELTQVEEEEEGFLWYKKPENRG 258

BLAST of ClCG01G021030 vs. NCBI nr
Match: gi|659080724|ref|XP_008440945.1| (PREDICTED: LOW QUALITY PROTEIN: protein PHR1-LIKE 1-like [Cucumis melo])

HSP 1 Score: 624.0 bits (1608), Expect = 1.4e-175
Identity = 309/322 (95.96%), Postives = 316/322 (98.14%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRLVNPDGDIQIHGPRGSVASDLTH+HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST
Sbjct: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
           NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIV QAEKHL+IRQDAERR
Sbjct: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVGQAEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           YLAMLERACKMLADQFI GAVSDSDSKKS+GQDRKSPR +SIDPLGFY +QSQEMERVNG
Sbjct: 181 YLAMLERACKMLADQFIVGAVSDSDSKKSEGQDRKSPRSTSIDPLGFYTTQSQEMERVNG 240

Query: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAKE 300
           TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIW +AKE
Sbjct: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWSDAKE 300

Query: 301 RIQDANIIQVNHHGVSGCDMWG 323
            IQ+ANIIQVNHHGVSGCDMWG
Sbjct: 301 GIQNANIIQVNHHGVSGCDMWG 322

BLAST of ClCG01G021030 vs. NCBI nr
Match: gi|778720007|ref|XP_011658094.1| (PREDICTED: protein PHR1-LIKE 1-like isoform X1 [Cucumis sativus])

HSP 1 Score: 618.2 bits (1593), Expect = 7.9e-174
Identity = 307/322 (95.34%), Postives = 313/322 (97.20%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRLVNPDGDIQIHGPRGSVASDLTH+HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST
Sbjct: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
           NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIV QAEKHL+IRQDAERR
Sbjct: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVGQAEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           YLAMLERACKMLADQFI GAVSDSDSKKS+GQDRKSPR +SIDPLGFY +QSQEMERVNG
Sbjct: 181 YLAMLERACKMLADQFIVGAVSDSDSKKSEGQDRKSPRSTSIDPLGFYTTQSQEMERVNG 240

Query: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAKE 300
           TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSP ASKKNMVNL SATASLIW  AKE
Sbjct: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPAASKKNMVNLGSATASLIWSGAKE 300

Query: 301 RIQDANIIQVNHHGVSGCDMWG 323
            IQ+ANIIQVNHHGVSGCDMWG
Sbjct: 301 GIQNANIIQVNHHGVSGCDMWG 322

BLAST of ClCG01G021030 vs. NCBI nr
Match: gi|778720011|ref|XP_011658095.1| (PREDICTED: myb family transcription factor APL-like isoform X2 [Cucumis sativus])

HSP 1 Score: 597.0 bits (1538), Expect = 1.9e-167
Identity = 299/322 (92.86%), Postives = 305/322 (94.72%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRLVNPDGDIQIHGPRGSVASDLTH+HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST
Sbjct: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
           NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVE         AEKHL+IRQDAERR
Sbjct: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVE---------AEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           YLAMLERACKMLADQFI GAVSDSDSKKS+GQDRKSPR +SIDPLGFY +QSQEMERVNG
Sbjct: 181 YLAMLERACKMLADQFIVGAVSDSDSKKSEGQDRKSPRSTSIDPLGFYTTQSQEMERVNG 240

Query: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAKE 300
           TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSP ASKKNMVNL SATASLIW  AKE
Sbjct: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPAASKKNMVNLGSATASLIWSGAKE 300

Query: 301 RIQDANIIQVNHHGVSGCDMWG 323
            IQ+ANIIQVNHHGVSGCDMWG
Sbjct: 301 GIQNANIIQVNHHGVSGCDMWG 313

BLAST of ClCG01G021030 vs. NCBI nr
Match: gi|778720014|ref|XP_011658096.1| (PREDICTED: protein PHR1-LIKE 1-like isoform X3 [Cucumis sativus])

HSP 1 Score: 580.9 bits (1496), Expect = 1.4e-162
Identity = 293/322 (90.99%), Postives = 299/322 (92.86%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRLVNPDGDIQIHGPRGSVASDLTH+HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLG              AYLLESPST
Sbjct: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLG--------------AYLLESPST 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
           NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIV QAEKHL+IRQDAERR
Sbjct: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVGQAEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           YLAMLERACKMLADQFI GAVSDSDSKKS+GQDRKSPR +SIDPLGFY +QSQEMERVNG
Sbjct: 181 YLAMLERACKMLADQFIVGAVSDSDSKKSEGQDRKSPRSTSIDPLGFYTTQSQEMERVNG 240

Query: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAKE 300
           TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSP ASKKNMVNL SATASLIW  AKE
Sbjct: 241 TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPAASKKNMVNLGSATASLIWSGAKE 300

Query: 301 RIQDANIIQVNHHGVSGCDMWG 323
            IQ+ANIIQVNHHGVSGCDMWG
Sbjct: 301 GIQNANIIQVNHHGVSGCDMWG 308

BLAST of ClCG01G021030 vs. NCBI nr
Match: gi|595863909|ref|XP_007211678.1| (hypothetical protein PRUPE_ppa009148mg [Prunus persica])

HSP 1 Score: 422.5 bits (1085), Expect = 6.4e-115
Identity = 216/317 (68.14%), Postives = 250/317 (78.86%), Query Frame = 1

Query: 1   MFPRLVNPDGDIQIHGPRGSVASDLTHSHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60
           MFPRL+    +       G    +    HRGDPCLVLTSDPKPRLRWTADLHERFVDAVT
Sbjct: 1   MFPRLMQAPHE-------GIAGQEDMQGHRGDPCLVLTSDPKPRLRWTADLHERFVDAVT 60

Query: 61  QLGGAGKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKDMGEASKDGAYLLESPST 120
           QLGG+ KATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGK+MG+ SKD +YLLESP T
Sbjct: 61  QLGGSSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGKEMGDVSKDASYLLESPGT 120

Query: 121 NNFSPDLPISEMADGYEVKEALRAQMEVQSKLHLQVELAYLTIVEQAEKHLRIRQDAERR 180
            N SP+LP S++ +GYEVKEALRAQMEVQSKLH+QVE         AEKHL+IRQDAERR
Sbjct: 121 GNSSPNLPTSDLNEGYEVKEALRAQMEVQSKLHVQVE---------AEKHLQIRQDAERR 180

Query: 181 YLAMLERACKMLADQFIGGAVSDSDSKKSQGQDRKSPRGSSIDPLGFYASQSQEMERVNG 240
           Y+AMLERACKMLADQFIGG+V+D+DS K  G   K+ +G S+DPLGFY+ QS ++  V+G
Sbjct: 181 YMAMLERACKMLADQFIGGSVTDTDSHKCHGLGNKNTKGPSLDPLGFYSLQSTDVAAVHG 240

Query: 241 -TEEVQANLPCQRADCSTESCLTSNESPGGLAMEKSPVASKKNMVNLDSATASLIWGEAK 300
             EEV  ++  QRADCSTESCLTS+ESPGGL +E SP   KK M++LDSA ASLIWGEAK
Sbjct: 241 PEEEVPTSIHTQRADCSTESCLTSHESPGGLTLEGSPGGGKKRMLSLDSAAASLIWGEAK 300

Query: 301 ERIQDANIIQVNHHGVS 317
            R Q+ N+  VN HG++
Sbjct: 301 VRTQEINVAAVNPHGIA 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PHL2_ARATH1.7e-4647.08Protein PHR1-LIKE 2 OS=Arabidopsis thaliana GN=PHL2 PE=1 SV=1[more]
PHL3_ARATH1.3e-4355.49Protein PHR1-LIKE 3 OS=Arabidopsis thaliana GN=PHL3 PE=1 SV=1[more]
APL_ARATH2.9e-3548.00Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=1 SV=2[more]
PHL9_ARATH1.1e-3440.80Myb-related protein 2 OS=Arabidopsis thaliana GN=MYR2 PE=1 SV=1[more]
PHL8_ARATH1.8e-3236.72Myb family transcription factor PHL8 OS=Arabidopsis thaliana GN=PHL8 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KL09_CUCSA1.3e-16792.86Uncharacterized protein OS=Cucumis sativus GN=Csa_6G510300 PE=4 SV=1[more]
M5WGF6_PRUPE4.5e-11568.14Myb family transcription factor APL OS=Prunus persica GN=APL PE=2 SV=1[more]
V7BR79_PHAVU1.4e-11368.05Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_006G134800g PE=4 SV=1[more]
I1L075_SOYBN2.4e-11368.15Uncharacterized protein OS=Glycine max GN=GLYMA_09G017300 PE=4 SV=1[more]
A0A0B2S941_GLYSO2.4e-11368.15Myb family transcription factor APL OS=Glycine soja GN=glysoja_015124 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G24120.21.9e-4847.50 Homeodomain-like superfamily protein[more]
AT4G13640.21.5e-4556.07 Homeodomain-like superfamily protein[more]
AT1G79430.21.7e-3648.00 Homeodomain-like superfamily protein[more]
AT3G04030.36.3e-3640.80 Homeodomain-like superfamily protein[more]
AT1G69580.21.7e-3336.19 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659080724|ref|XP_008440945.1|1.4e-17595.96PREDICTED: LOW QUALITY PROTEIN: protein PHR1-LIKE 1-like [Cucumis melo][more]
gi|778720007|ref|XP_011658094.1|7.9e-17495.34PREDICTED: protein PHR1-LIKE 1-like isoform X1 [Cucumis sativus][more]
gi|778720011|ref|XP_011658095.1|1.9e-16792.86PREDICTED: myb family transcription factor APL-like isoform X2 [Cucumis sativus][more]
gi|778720014|ref|XP_011658096.1|1.4e-16290.99PREDICTED: protein PHR1-LIKE 1-like isoform X3 [Cucumis sativus][more]
gi|595863909|ref|XP_007211678.1|6.4e-11568.14hypothetical protein PRUPE_ppa009148mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001005SANT/Myb
IPR006447Myb_dom_plants
IPR009057Homeobox-like_sf
IPR025756Myb_CC_LHEQLE
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0000785 chromatin
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003682 chromatin binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G021030.1ClCG01G021030.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 44..95
score: 3.
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 42..97
score: 3.5
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 39..97
score: 2.2
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 42..98
score: 1.02
IPR025756MYB-CC type transcription factor, LHEQLE-containing domainPFAMPF14379Myb_CC_LHEQLEcoord: 137..191
score: 4.
NoneNo IPR availablePANTHERPTHR31499FAMILY NOT NAMEDcoord: 11..321
score: 1.9E
NoneNo IPR availablePANTHERPTHR31499:SF9MYB FAMILY TRANSCRIPTION FACTORcoord: 11..321
score: 1.9E