ClCG02G008090 (gene) Watermelon (Charleston Gray)

NameClCG02G008090
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
Descriptionmyb domain protein r1 LENGTH=305
LocationCG_Chr02 : 10168617 .. 10170211 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAGCGTGGACTTGTGTAAATCAGAGCAGTTAAGATGTGACGCGTGTCCAGTCGCCAGCTGTATCCCAACTCCCCTCTTAACGGCTTCACAGGAACTACCGGTAACTAATCCCCTAACCCCCTCTCCCACCAACGTCCAACCTTTCACCTCCTCTTCCAGTAACCCTTCCCCACCCTTTTATATATCCCCCAAATCTCCAGGCCGCCATTAAAATCCCCCCTCCCAAACCTTCCTATGTTTTCTCCGATTCCAAGCAACTGAGTTTTTGTTTTTTCTTTTCTTCCATGGCTAAAGAAATCGATCGAATCAAAGGTCCCTGGAGTCCTGAAGAAGACGATGCTCTCCAGAGATTGGTCCATAAATACGGCCCTCGGAACTGGTCCTTGATTAGCAAATCAATTCCAGGCCGCTCCGGCAAATCCTGTCGCCTACGGTGGTGCAACCAGCTCTCACCGCAGGTGGAGCACCGAGCCTTCACGCCGGAGGAAGACGAGACAATCATTAACGCCCAAGCTCTGTACGGCAATAAGTGGGCTACCATCGCCAGGCTCCTCTCCGGCCGGACCGATAACGCGATCAAGAACCACTGGAACTCGACCTTGAAACGTAAGTGCTCGTCCATGGCCGACGAGAGCGGGGGTTGTAATACCAGTTCCCCGCCGTTGAAGAGGTCGGTTTCTGCAGGTGTTTATATGCCTCCGAATAGTCCTTCTGGATCCGACGTCAGCGATTCCGGCTTCTTCCCCGCCGTGTCGTCCTCCCATGTATACCGGCCAGTGGCCAGAACCGGCGGCGTTTTGCCTCCTGGCGAGACGGTGTCGTCTTCGAATGATCCACCGACATCATTGTCGTTGTCGCTTCCGGGTGCGGACTCATCGGAGGTTAATTTTGTGGCAAATTCTCTTCCAGCAATGGCTGGAGTTTCTGAGAGACCGAGTACGGCCCTGCCGTCTTCGGCACCGGCAAATGGACAGGAACAAATTTCAGGGGAGAAGGATGAGAGGAATTATAATGGGTTTGGGATTCTTAGCTCGGATTTAATGGCGGTGATGCAGGAGATGATAAGGAAGGAGGTGAGAAACTACATGGCTGGATTAATGGAACAGAACGTTGGTGGCGGTGGGGGTGGGGGAGTTTGTTTTCAGCAAGCTACCGTCGGTGGGTTTAGAAACGTCGTCGTCCAGAGGATTGGGGTCAGCAAGATTGACTGAGGAGAAGTTGCTGAAGTAAAAAAGGAAAAAAAAAAAAAAAAACCCGAAAAAGAGGAAAGAAGATGGAGGTTAATAATGATGGGCAATTATTAGTGTTTTGTTTTGTTATTTTGGTTATTTTTTGTTTATGTTTCTGTCGATGTAGTCTGTGAATCCAAGGAGAATGGGTAAATTATGGTGGGAAAATTTGGGGATTTTGATGAAGGAAGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGCGGTGGGTTTACATGGATGATGAAGAAGAAGATGGTGATGATGAATCTTCCATCAATCAGAGAGTTTCAAAAACCATTTGGAACTGAAACTTTTGAATCTGTTTCTCATTTTCCCTTTTAA

mRNA sequence

ATGGAGAGCGTGGACTTGTGTAAATCAGAGCAGTTAAGATGTGACGCGTGTCCAGTCGCCAGCTGTATCCCAACTCCCCTCTTAACGGCTTCACAGGAACTACCGGCCGCCATTAAAATCCCCCCTCCCAAACCTTCCTATGTTTTCTCCGATTCCAAGCAACTGAGTTTTTGTTTTTTCTTTTCTTCCATGGCTAAAGAAATCGATCGAATCAAAGGTCCCTGGAGTCCTGAAGAAGACGATGCTCTCCAGAGATTGGTCCATAAATACGGCCCTCGGAACTGGTCCTTGATTAGCAAATCAATTCCAGGCCGCTCCGGCAAATCCTGTCGCCTACGGTGGTGCAACCAGCTCTCACCGCAGGTGGAGCACCGAGCCTTCACGCCGGAGGAAGACGAGACAATCATTAACGCCCAAGCTCTGTACGGCAATAAGTGGGCTACCATCGCCAGGCTCCTCTCCGGCCGGACCGATAACGCGATCAAGAACCACTGGAACTCGACCTTGAAACGTAAGTGCTCGTCCATGGCCGACGAGAGCGGGGGTTGTAATACCAGTTCCCCGCCGTTGAAGAGGTCGGTTTCTGCAGGTGTTTATATGCCTCCGAATAGTCCTTCTGGATCCGACGTCAGCGATTCCGGCTTCTTCCCCGCCGTGTCGTCCTCCCATGTATACCGGCCAGTGGCCAGAACCGGCGGCGTTTTGCCTCCTGGCGAGACGGTGTCGTCTTCGAATGATCCACCGACATCATTGTCGTTGTCGCTTCCGGGTGCGGACTCATCGGAGGTTAATTTTGTGGCAAATTCTCTTCCAGCAATGGCTGGAGTTTCTGAGAGACCGAGTACGGCCCTGCCGTCTTCGGCACCGGCAAATGGACAGGAACAAATTTCAGGGGAGAAGGATGAGAGGAATTATAATGGGTTTGGGATTCTTAGCTCGGATTTAATGGCGGTGATGCAGGAGATGATAAGGAAGGAGGTGAGAAACTACATGGCTGGATTAATGGAACAGAACGTTGGTGGCGGTGGGGGTGGGGGAGTTTGTTTTCAGCAAGCTACCGTCGGTGGGTTTAGAAACGTCGTCGTCCAGAGGATTGGGTCTGTGAATCCAAGGAGAATGGGTAAATTATGGTGGGAAAATTTGGGGATTTTGATGAAGGAAGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGCGGTGGGTTTACATGGATGATGAAGAAGAAGATGGTGATGATGAATCTTCCATCAATCAGAGAGTTTCAAAAACCATTTGGAACTGAAACTTTTGAATCTGTTTCTCATTTTCCCTTTTAA

Coding sequence (CDS)

ATGGAGAGCGTGGACTTGTGTAAATCAGAGCAGTTAAGATGTGACGCGTGTCCAGTCGCCAGCTGTATCCCAACTCCCCTCTTAACGGCTTCACAGGAACTACCGGCCGCCATTAAAATCCCCCCTCCCAAACCTTCCTATGTTTTCTCCGATTCCAAGCAACTGAGTTTTTGTTTTTTCTTTTCTTCCATGGCTAAAGAAATCGATCGAATCAAAGGTCCCTGGAGTCCTGAAGAAGACGATGCTCTCCAGAGATTGGTCCATAAATACGGCCCTCGGAACTGGTCCTTGATTAGCAAATCAATTCCAGGCCGCTCCGGCAAATCCTGTCGCCTACGGTGGTGCAACCAGCTCTCACCGCAGGTGGAGCACCGAGCCTTCACGCCGGAGGAAGACGAGACAATCATTAACGCCCAAGCTCTGTACGGCAATAAGTGGGCTACCATCGCCAGGCTCCTCTCCGGCCGGACCGATAACGCGATCAAGAACCACTGGAACTCGACCTTGAAACGTAAGTGCTCGTCCATGGCCGACGAGAGCGGGGGTTGTAATACCAGTTCCCCGCCGTTGAAGAGGTCGGTTTCTGCAGGTGTTTATATGCCTCCGAATAGTCCTTCTGGATCCGACGTCAGCGATTCCGGCTTCTTCCCCGCCGTGTCGTCCTCCCATGTATACCGGCCAGTGGCCAGAACCGGCGGCGTTTTGCCTCCTGGCGAGACGGTGTCGTCTTCGAATGATCCACCGACATCATTGTCGTTGTCGCTTCCGGGTGCGGACTCATCGGAGGTTAATTTTGTGGCAAATTCTCTTCCAGCAATGGCTGGAGTTTCTGAGAGACCGAGTACGGCCCTGCCGTCTTCGGCACCGGCAAATGGACAGGAACAAATTTCAGGGGAGAAGGATGAGAGGAATTATAATGGGTTTGGGATTCTTAGCTCGGATTTAATGGCGGTGATGCAGGAGATGATAAGGAAGGAGGTGAGAAACTACATGGCTGGATTAATGGAACAGAACGTTGGTGGCGGTGGGGGTGGGGGAGTTTGTTTTCAGCAAGCTACCGTCGGTGGGTTTAGAAACGTCGTCGTCCAGAGGATTGGGTCTGTGAATCCAAGGAGAATGGGTAAATTATGGTGGGAAAATTTGGGGATTTTGATGAAGGAAGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGCGGTGGGTTTACATGGATGATGAAGAAGAAGATGGTGATGATGAATCTTCCATCAATCAGAGAGTTTCAAAAACCATTTGGAACTGAAACTTTTGAATCTGTTTCTCATTTTCCCTTTTAA

Protein sequence

MESVDLCKSEQLRCDACPVASCIPTPLLTASQELPAAIKIPPPKPSYVFSDSKQLSFCFFFSSMAKEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSSPPLKRSVSAGVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPPGETVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMAGVSERPSTALPSSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGGGVCFQQATVGGFRNVVVQRIGSVNPRRMGKLWWENLGILMKEDEEEEEEEEEEEEEEEGGGFTWMMKKKMVMMNLPSIREFQKPFGTETFESVSHFPF
BLAST of ClCG02G008090 vs. Swiss-Prot
Match: MYB44_ARATH (Transcription factor MYB44 OS=Arabidopsis thaliana GN=MYB44 PE=2 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 6.0e-79
Identity = 164/288 (56.94%), Postives = 195/288 (67.71%), Query Frame = 1

Query: 69  DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFT 128
           DRIKGPWSPEED+ L+RLV KYGPRNW++ISKSIPGRSGKSCRLRWCNQLSPQVEHR F+
Sbjct: 3   DRIKGPWSPEEDEQLRRLVVKYGPRNWTVISKSIPGRSGKSCRLRWCNQLSPQVEHRPFS 62

Query: 129 PEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSSP 188
            EEDETI  A A +GNKWATIARLL+GRTDNA+KNHWNSTLKRKC          +    
Sbjct: 63  AEEDETIARAHAQFGNKWATIARLLNGRTDNAVKNHWNSTLKRKCGGYDHRGYDGSEDHR 122

Query: 189 PLKRSVSA-------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGG-VLP-PGE 248
           P+KRSVSA       G+YM P SP+GSDVSDS   P + S  +++PV R G  VLP P E
Sbjct: 123 PVKRSVSAGSPPVVTGLYMSPGSPTGSDVSDSSTIPILPSVELFKPVPRPGAVVLPLPIE 182

Query: 249 TVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMAGV--SERPSTALPSSAPANGQEQIS 308
           T SSS+DPPTSLSLSLPGAD SE +  ++    +     S        S  P +G  + +
Sbjct: 183 TSSSSDDPPTSLSLSLPGADVSEESNRSHESTNINNTTSSRHNHNNTVSFMPFSGGFRGA 242

Query: 309 GEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGG 346
            E+  +++ G G    + MAV+QEMI+ EVR+YM  +   N GG  GG
Sbjct: 243 IEEMGKSFPGNG---GEFMAVVQEMIKAEVRSYMTEMQRNNGGGFVGG 287

BLAST of ClCG02G008090 vs. Swiss-Prot
Match: MYBB_XENLA (Myb-related protein B OS=Xenopus laevis GN=mybl2 PE=2 SV=2)

HSP 1 Score: 141.7 bits (356), Expect = 1.9e-32
Identity = 96/250 (38.40%), Postives = 127/250 (50.80%), Query Frame = 1

Query: 69  DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFT 128
           D +KGPW+ EED+ +  LV KYG ++W+LI+K + GR GK CR RW N L+P+V+  ++T
Sbjct: 80  DLVKGPWTKEEDEKVIELVKKYGTKHWTLIAKQLRGRMGKQCRERWHNHLNPEVKKSSWT 139

Query: 129 PEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSSP 188
            EED  I  A  + GN+WA IA+LL GRTDNA+KNHWNST+KRK      E+GG  T   
Sbjct: 140 EEEDRIICQAHKVLGNRWAEIAKLLPGRTDNAVKNHWNSTIKRKV-----ETGGFLT--- 199

Query: 189 PLKRSVSAGVYMPPNSPSGSDVSDSGFFPAVSSSHVY--RPVARTGGVLPPGETVSSSND 248
                V A       S    +  DSG+  A   +HV    PV R+  +  P E    SN 
Sbjct: 200 -----VKA-------SGQQEEREDSGYQAAEDQNHVLLSEPVERSANI--PEE---PSNI 259

Query: 249 PPTSLSLSLPGADSSEVNFVANSLPAMAGVSERPSTALPSSAPANGQEQISGEKDERNYN 308
               L    PG  S + +          G +   +TA+  SAP         EK    Y 
Sbjct: 260 LSPKLLTKSPGIRSEQES-------GGEGSNSESATAIVDSAP---------EKWMVEYV 288

Query: 309 GFGILSSDLM 317
            F +  SD+M
Sbjct: 320 NFLVPGSDIM 288


HSP 2 Score: 82.0 bits (201), Expect = 1.8e-14
Identity = 77/282 (27.30%), Postives = 112/282 (39.72%), Query Frame = 1

Query: 69  DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFT 128
           +R+K  W+PEED+ L+ LV K+G   W  I+ ++  R+ + C+ RW   L P +    +T
Sbjct: 28  NRVKVKWTPEEDETLKALVKKHGQGEWKTIASNLNNRTEQQCQHRWLRVLHPDLVKGPWT 87

Query: 129 PEEDETIINAQALYGNK-WATIARLLSGRTDNAIKNHWNSTL--KRKCSSMADESGGC-- 188
            EEDE +I     YG K W  IA+ L GR     +  W++ L  + K SS  +E      
Sbjct: 88  KEEDEKVIELVKKYGTKHWTLIAKQLRGRMGKQCRERWHNHLNPEVKKSSWTEEEDRIIC 147

Query: 189 --------------------------NTSSPPLKRSVSAGVYMPPN-SPSGSDVSDSGFF 248
                                     N  +  +KR V  G ++    S    +  DSG+ 
Sbjct: 148 QAHKVLGNRWAEIAKLLPGRTDNAVKNHWNSTIKRKVETGGFLTVKASGQQEEREDSGYQ 207

Query: 249 PAVSSSHVY--RPVARTGGVLPPGETVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMA 308
            A   +HV    PV R+  +  P E    SN     L    PG  S + +          
Sbjct: 208 AAEDQNHVLLSEPVERSANI--PEE---PSNILSPKLLTKSPGIRSEQES-------GGE 267

Query: 309 GVSERPSTALPSSAPANGQEQISGEKDERNYNGFGILSSDLM 317
           G +   +TA+  SAP         EK    Y  F +  SD+M
Sbjct: 268 GSNSESATAIVDSAP---------EKWMVEYVNFLVPGSDIM 288

BLAST of ClCG02G008090 vs. Swiss-Prot
Match: MYB_XENLA (Transcriptional activator Myb OS=Xenopus laevis GN=myb PE=2 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 5.5e-32
Identity = 62/102 (60.78%), Postives = 75/102 (73.53%), Query Frame = 1

Query: 71  IKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPE 130
           IKGPW+ EED  +  LVHKYGP+ WS+I+K + GR GK CR RW N L+P+V+  ++T E
Sbjct: 88  IKGPWTKEEDQRVIELVHKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKSSWTEE 147

Query: 131 EDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRK 173
           ED TI  A    GN+WA IA+LL GRTDNAIKNHWNST++RK
Sbjct: 148 EDRTIYEAHKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRK 189


HSP 2 Score: 71.6 bits (174), Expect = 2.4e-11
Identity = 38/111 (34.23%), Postives = 59/111 (53.15%), Query Frame = 1

Query: 72  KGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPEE 131
           K  W+ EED+ L++LV + G   W +I+  +P R+   C+ RW   L+P++    +T EE
Sbjct: 37  KTRWTREEDEKLKKLVEQNGTEEWKVIASFLPNRTDVQCQHRWQKVLNPELIKGPWTKEE 96

Query: 132 DETIINAQALYGNK-WATIARLLSGRTDNAIKNHWNSTL--KRKCSSMADE 180
           D+ +I     YG K W+ IA+ L GR     +  W++ L  + K SS  +E
Sbjct: 97  DQRVIELVHKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKSSWTEE 147


HSP 3 Score: 34.3 bits (77), Expect = 4.2e+00
Identity = 16/63 (25.40%), Postives = 28/63 (44.44%), Query Frame = 1

Query: 72  KGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPEE 131
           K  W+ EED  +    HK     W+ I+K +PGR+  + +  W + +  + E   +    
Sbjct: 141 KSSWTEEEDRTIYE-AHKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRKEEQEGYLQNS 200

Query: 132 DET 135
            +T
Sbjct: 201 SKT 202

BLAST of ClCG02G008090 vs. Swiss-Prot
Match: MYB_BOVIN (Transcriptional activator Myb OS=Bos taurus GN=MYB PE=2 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 9.3e-32
Identity = 74/187 (39.57%), Postives = 105/187 (56.15%), Query Frame = 1

Query: 71  IKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPE 130
           IKGPW+ EED  +  LV KYGP+ WS+I+K + GR GK CR RW N L+P+V+  ++T E
Sbjct: 91  IKGPWTKEEDQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTSWTEE 150

Query: 131 EDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSSPPL 190
           ED  I  A    GN+WA IA+LL GRTDNAIKNHWNST++RK             S P +
Sbjct: 151 EDRIIYQAHKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRKVEQEGYLQESSKASQPAV 210

Query: 191 KRSVSAGVYMP--PNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPPGETVSSSNDPP 250
             S     ++    ++P  + +  +G  P+V+S + Y  ++         + VSS    P
Sbjct: 211 TTSFQKNSHLMGFTHAPPSAQLPPAG-QPSVNSDYPYYHISE-------AQNVSSHVPYP 269

Query: 251 TSLSLSL 256
            +L +++
Sbjct: 271 VALHVNI 269


HSP 2 Score: 71.2 bits (173), Expect = 3.1e-11
Identity = 34/99 (34.34%), Postives = 54/99 (54.55%), Query Frame = 1

Query: 72  KGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPEE 131
           K  W+ EED+ L++LV + G  +W +I+  +P R+   C+ RW   L+P++    +T EE
Sbjct: 40  KTRWTREEDEKLKKLVEQNGTDDWKVIANYLPNRTDVQCQHRWQKVLNPELIKGPWTKEE 99

Query: 132 DETIINAQALYGNK-WATIARLLSGRTDNAIKNHWNSTL 170
           D+ +I     YG K W+ IA+ L GR     +  W++ L
Sbjct: 100 DQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHL 138

BLAST of ClCG02G008090 vs. Swiss-Prot
Match: MYB_MOUSE (Transcriptional activator Myb OS=Mus musculus GN=Myb PE=1 SV=2)

HSP 1 Score: 138.3 bits (347), Expect = 2.1e-31
Identity = 79/174 (45.40%), Postives = 99/174 (56.90%), Query Frame = 1

Query: 71  IKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPE 130
           IKGPW+ EED  +  LV KYGP+ WS+I+K + GR GK CR RW N L+P+V+  ++T E
Sbjct: 91  IKGPWTKEEDQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTSWTEE 150

Query: 131 EDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSSPPL 190
           ED  I  A    GN+WA IA+LL GRTDNAIKNHWNST++RK             S  P+
Sbjct: 151 EDRIIYQAHKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRKVEQEGYLQEPSKASQTPV 210

Query: 191 KRS-------VSAGVYMPPN--SPSG-SDVSDSGFFPAVS-----SSHVYRPVA 230
             S       +  G   PP+  SPSG S V+    +  ++     SSHV  PVA
Sbjct: 211 ATSFQKNNHLMGFGHASPPSQLSPSGQSSVNSEYPYYHIAEAQNISSHVPYPVA 264


HSP 2 Score: 71.2 bits (173), Expect = 3.1e-11
Identity = 34/99 (34.34%), Postives = 54/99 (54.55%), Query Frame = 1

Query: 72  KGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTPEE 131
           K  W+ EED+ L++LV + G  +W +I+  +P R+   C+ RW   L+P++    +T EE
Sbjct: 40  KTRWTREEDEKLKKLVEQNGTDDWKVIANYLPNRTDVQCQHRWQKVLNPELIKGPWTKEE 99

Query: 132 DETIINAQALYGNK-WATIARLLSGRTDNAIKNHWNSTL 170
           D+ +I     YG K W+ IA+ L GR     +  W++ L
Sbjct: 100 DQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHL 138

BLAST of ClCG02G008090 vs. TrEMBL
Match: A0A0A0LTE4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G427310 PE=4 SV=1)

HSP 1 Score: 534.3 bits (1375), Expect = 1.4e-148
Identity = 261/302 (86.42%), Postives = 279/302 (92.38%), Query Frame = 1

Query: 64  MAKEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE 123
           M+K+ DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE
Sbjct: 1   MSKQTDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE 60

Query: 124 HRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGC 183
           HRAFTP+EDE IINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSM D++GGC
Sbjct: 61  HRAFTPDEDEAIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMPDDTGGC 120

Query: 184 NTSSPPLKRSVSAGVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPPGETVSS 243
           + +SPP K+SVSAG+YMPPNSPSGSD+SDSGFFPAVSSSHV+RPV RTG VLPPGETVSS
Sbjct: 121 HATSPPFKKSVSAGLYMPPNSPSGSDLSDSGFFPAVSSSHVFRPVPRTGAVLPPGETVSS 180

Query: 244 SNDPPTSLSLSLPGADSSEVNFVANSLPAMAGVSERPSTALPSSAPANGQEQISGEKDER 303
           S+DPPTSLSLSLPGADSSEVNFVANS+  + GVSER ST L  SA ANG+E+ISGEK+E 
Sbjct: 181 SDDPPTSLSLSLPGADSSEVNFVANSVQGVGGVSERRSTGLACSATANGEERISGEKEES 240

Query: 304 NYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGGGVCFQQATVGGFRNVVVQ 363
           N NGFGI SSDLMAVMQEMIRKEVRNYMAGLMEQ V  GGGGGVC+QQA  GGFRNVVVQ
Sbjct: 241 NSNGFGIFSSDLMAVMQEMIRKEVRNYMAGLMEQKV--GGGGGVCYQQAAAGGFRNVVVQ 300

Query: 364 RI 366
           RI
Sbjct: 301 RI 300

BLAST of ClCG02G008090 vs. TrEMBL
Match: A0A067K7A3_JATCU (MYB family protein OS=Jatropha curcas GN=JCGZ_12458 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 5.4e-103
Identity = 211/324 (65.12%), Postives = 235/324 (72.53%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           K++DRIKGPWSPEEDDALQ+LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 6   KDVDRIKGPWSPEEDDALQKLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 65

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNT 185
           AFTPEEDETII A A +GNKWATIARLL+GRTDNAIKNHWNSTLKRKCSSMA++ G  + 
Sbjct: 66  AFTPEEDETIIRAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRKCSSMAEDGGAFDG 125

Query: 186 SSPPLKRSVSA--------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPP 245
           +  PLKRSVSA        G+YM P SPSGSDVSDS   P +SSSHVYRPVARTG V+PP
Sbjct: 126 NCQPLKRSVSAGSGMAVSTGLYMSPGSPSGSDVSDSS-VPVLSSSHVYRPVARTGAVIPP 185

Query: 246 GETVSSS--NDPPTSLSLSLPGADSSEV-NFVANS---------LPAMAGVSERPSTALP 305
            ET SSS  NDPPTSL LSLPG DSSEV N VA S         +P M  VS  P    P
Sbjct: 186 VETTSSSNNNDPPTSLCLSLPGVDSSEVSNRVAESTQATNTISLMPPMNNVSSPPPARPP 245

Query: 306 SSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGG 365
            +A    Q+   G  +     GF   +++ MAVMQEMIR EVRNYM     +  GGG GG
Sbjct: 246 PAASVQPQQ---GAVNGGLGTGFVGFTAEFMAVMQEMIRMEVRNYMI----EKAGGGSGG 305

Query: 366 ---GVCFQQATVGGFRNVVVQRIG 367
              G+CFQ A   GFRNV + R+G
Sbjct: 306 ANVGMCFQAAGGEGFRNVAMNRVG 321

BLAST of ClCG02G008090 vs. TrEMBL
Match: F6HSY7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0129g01050 PE=4 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 4.9e-96
Identity = 207/323 (64.09%), Postives = 232/323 (71.83%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           K++DRIKGPWSPEEDDALQ+LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 6   KDVDRIKGPWSPEEDDALQKLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 65

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMA-DESGGCN 185
           AFT EED+TI+ A A +GNKWATIARLLSGRTDNAIKNHWNSTLKRKCS++  D S G +
Sbjct: 66  AFTSEEDDTIMRAHARFGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSAITEDGSFGGD 125

Query: 186 TSSPPLKRSVSA-------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPP 245
               PLKRSVSA       G+Y+ P+SP GSDVSDS   P VSSSHVYRPVARTGG++PP
Sbjct: 126 YPPHPLKRSVSAGAAAPVSGLYLSPSSPCGSDVSDSS-LPVVSSSHVYRPVARTGGIIPP 185

Query: 246 GETVSSSNDPPTSLSLSLPGADSSEVNFVA------------NSLPAMAGVSERP--STA 305
            ET SSSNDPPTSLSLSLPG DS EV+  A              +P MA + + P     
Sbjct: 186 -ETTSSSNDPPTSLSLSLPGVDSCEVSNRAPEPNHAPPANPIQMIPTMAPLQQIPMHQHN 245

Query: 306 LPSSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGG 365
            P++ PA    Q  GEK       F   S++L+AVMQEMIRKEVRNYMAGL EQN     
Sbjct: 246 QPATVPATVLSQ--GEKP------FIPFSAELLAVMQEMIRKEVRNYMAGL-EQN----- 305

Query: 366 GGGVCFQQATVGGFRNVVVQRIG 367
             GVC Q     G RN  V+RIG
Sbjct: 306 --GVCLQ---ADGIRNAAVKRIG 307

BLAST of ClCG02G008090 vs. TrEMBL
Match: B9SG07_RICCO (R2r3-myb transcription factor, putative OS=Ricinus communis GN=RCOM_1153840 PE=4 SV=1)

HSP 1 Score: 357.5 bits (916), Expect = 2.4e-95
Identity = 210/340 (61.76%), Postives = 236/340 (69.41%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           KE+DRIKGPWSPEEDDALQ+LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 6   KEVDRIKGPWSPEEDDALQKLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 65

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGG--- 185
           AFTPEEDETII A A +GNKWATIARLL+GRTDNAIKNHWNSTLKRKC S+ +   G   
Sbjct: 66  AFTPEEDETIIRAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRKCCSLDEGYDGNLC 125

Query: 186 CNT----------SSPPLKRSVSA--------GVYMPPNSPSGSDVSDSGFFPAVSSS-H 245
           C+           ++ PLKRSVSA        G+YM P SPSGSDVSDS + P  +SS H
Sbjct: 126 CSNNNNNNNDSLINTQPLKRSVSAGSGVPVSTGLYMNPGSPSGSDVSDSSYVPVFTSSPH 185

Query: 246 VYRPVARTGGVLPPGETVSSSN----------DPPTSLSLSLPGADSSEV-NFVANSLPA 305
           V+RPVARTGGV+   E  SSSN          DPPTSLSLSLPG DSSEV N VA S P 
Sbjct: 186 VFRPVARTGGVVSFVEATSSSNDNNNNNNISDDPPTSLSLSLPGVDSSEVSNRVAESTPV 245

Query: 306 MAGVSERPS-----TALPSSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEV 365
               S   S       +PS A A   +Q  G  +   + GF   +++ MAVMQEMIRKEV
Sbjct: 246 RVPDSTTISLMPVMNQVPSPATAAAVQQ-GGSANSGGFMGF---TTEFMAVMQEMIRKEV 305

Query: 366 RNYMAGLMEQNVGGGGGGGVCFQQATVGGFRNVV-VQRIG 367
           RNYM   MEQ+    GGGG+CFQ A   GFRNVV + R+G
Sbjct: 306 RNYM---MEQS----GGGGMCFQAAGGDGFRNVVGMNRVG 334

BLAST of ClCG02G008090 vs. TrEMBL
Match: A0A0A0KJL5_CUCSA (Sucrose responsive element binding protein OS=Cucumis sativus GN=Csa_6G491690 PE=4 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 5.4e-95
Identity = 191/306 (62.42%), Postives = 224/306 (73.20%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           K++DRIKGPWSPEEDDALQRLV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 6   KDMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 65

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNT 185
            F+PEEDETII A A +GN+WATIARLL+GRTDNA+KNHWNSTLKRKCS M +E    + 
Sbjct: 66  PFSPEEDETIIRAHANFGNRWATIARLLTGRTDNAVKNHWNSTLKRKCSLMMNEGYEVDP 125

Query: 186 SSPPLKRSVSA--------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPP 245
           +  P+K+SVSA        G+YM P SPSGSD+SDS   P VS + VYRPVARTGGV+PP
Sbjct: 126 NVQPMKKSVSAGAAVNASNGLYMSPGSPSGSDISDSS-VPVVSPT-VYRPVARTGGVIPP 185

Query: 246 GETV-SSSNDPPTSLSLSLPGADSSEVNFVANS--LPAMAGVSERPSTALPSSAPANGQE 305
           GE+  SS+ DPPTSLSLSLPG DSS  +   ++  +P MA  ++  S             
Sbjct: 186 GESAPSSATDPPTSLSLSLPGVDSSRHSGSGSTAQVPLMAAFAQIQSMTTTEQVRTAQPS 245

Query: 306 QISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGGGVCFQQATV 361
             +GEK     NGFG+ S+DLMAVMQEMI+ EV++YM GL EQ       G  CFQ+A  
Sbjct: 246 GGAGEK----INGFGVFSADLMAVMQEMIKSEVKSYMEGLSEQR------GRRCFQEAKA 299

BLAST of ClCG02G008090 vs. TAIR10
Match: AT5G67300.1 (AT5G67300.1 myb domain protein r1)

HSP 1 Score: 296.2 bits (757), Expect = 3.4e-80
Identity = 164/288 (56.94%), Postives = 195/288 (67.71%), Query Frame = 1

Query: 69  DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFT 128
           DRIKGPWSPEED+ L+RLV KYGPRNW++ISKSIPGRSGKSCRLRWCNQLSPQVEHR F+
Sbjct: 3   DRIKGPWSPEEDEQLRRLVVKYGPRNWTVISKSIPGRSGKSCRLRWCNQLSPQVEHRPFS 62

Query: 129 PEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSSP 188
            EEDETI  A A +GNKWATIARLL+GRTDNA+KNHWNSTLKRKC          +    
Sbjct: 63  AEEDETIARAHAQFGNKWATIARLLNGRTDNAVKNHWNSTLKRKCGGYDHRGYDGSEDHR 122

Query: 189 PLKRSVSA-------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGG-VLP-PGE 248
           P+KRSVSA       G+YM P SP+GSDVSDS   P + S  +++PV R G  VLP P E
Sbjct: 123 PVKRSVSAGSPPVVTGLYMSPGSPTGSDVSDSSTIPILPSVELFKPVPRPGAVVLPLPIE 182

Query: 249 TVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMAGV--SERPSTALPSSAPANGQEQIS 308
           T SSS+DPPTSLSLSLPGAD SE +  ++    +     S        S  P +G  + +
Sbjct: 183 TSSSSDDPPTSLSLSLPGADVSEESNRSHESTNINNTTSSRHNHNNTVSFMPFSGGFRGA 242

Query: 309 GEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGG 346
            E+  +++ G G    + MAV+QEMI+ EVR+YM  +   N GG  GG
Sbjct: 243 IEEMGKSFPGNG---GEFMAVVQEMIKAEVRSYMTEMQRNNGGGFVGG 287

BLAST of ClCG02G008090 vs. TAIR10
Match: AT4G37260.1 (AT4G37260.1 myb domain protein 73)

HSP 1 Score: 293.5 bits (750), Expect = 2.2e-79
Identity = 175/327 (53.52%), Postives = 205/327 (62.69%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           K ++RIKGPWSPEEDD LQRLV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSP+VEHR
Sbjct: 7   KNMERIKGPWSPEEDDLLQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPEVEHR 66

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADE-----S 185
           AF+ EEDETII A A +GNKWATI+RLL+GRTDNAIKNHWNSTLKRKCS          +
Sbjct: 67  AFSQEEDETIIRAHARFGNKWATISRLLNGRTDNAIKNHWNSTLKRKCSVEGQSCDFGGN 126

Query: 186 GGCNTS---SPPLKRS------VSAGVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVART 245
           GG + +     PLKR+      VS G+YM P SPSGSDVS+     +   +HV++P  R+
Sbjct: 127 GGYDGNLGEEQPLKRTASGGGGVSTGLYMSPGSPSGSDVSEQ----SSGGAHVFKPTVRS 186

Query: 246 GGVLPPGETVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMAG---VSERPSTALPSSA 305
                     SS  DPPT LSLSLP  D +    V  + P       V +   TA     
Sbjct: 187 -----EVTASSSGEDPPTYLSLSLPWTDET----VRVNEPVQLNQNTVMDGGYTA--ELF 246

Query: 306 PANGQEQISGEKDERN--YNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVG----GG 365
           P   +EQ+  E++E      GFG    + M V+QEMIR EVR+YMA L   NVG    GG
Sbjct: 247 PVRKEEQVEVEEEEAKGISGGFG---GEFMTVVQEMIRTEVRSYMADLQRGNVGGSSSGG 306

Query: 366 GGGGVCFQQATVG---GFRNVVVQRIG 367
           GGGG C  Q+      GFR  +V +IG
Sbjct: 307 GGGGSCMPQSVNSRRVGFREFIVNQIG 315

BLAST of ClCG02G008090 vs. TAIR10
Match: AT2G23290.1 (AT2G23290.1 myb domain protein 70)

HSP 1 Score: 283.1 bits (723), Expect = 2.9e-76
Identity = 169/326 (51.84%), Postives = 190/326 (58.28%), Query Frame = 1

Query: 63  SMAKEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQV 122
           S  KE+DRIKGPWSPEEDD LQ LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSP+V
Sbjct: 4   STRKEMDRIKGPWSPEEDDLLQSLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPEV 63

Query: 123 EHRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCS---SMADE 182
           EHR FT EED+TII A A +GNKWATIARLL+GRTDNAIKNHWNSTLKRKCS      +E
Sbjct: 64  EHRGFTAEEDDTIILAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRKCSGGGGGGEE 123

Query: 183 SGGCN-----------TSSPPLKRSVSAG---VYMPPNSPSGSDVSD-----SGFFPAVS 242
              C+           T   PLKR  S G   V +   SP+GSDVS+         P  S
Sbjct: 124 GQSCDFGGNGGYDGNLTDEKPLKRRASGGGGVVVVTALSPTGSDVSEQSQSSGSVLPVSS 183

Query: 243 SSHVYRPVARTGG-VLPPGETVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMAGVSER 302
           S HV++P AR GG V+          DP T L LSLP  +                    
Sbjct: 184 SCHVFKPTARAGGVVIESSSPEEEEKDPMTCLRLSLPWVNE------------------- 243

Query: 303 PSTALPSSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNV 362
            ST  P   P   +E+   E+ ER  +G G    D M V+QEMI+ EVR+YMA L   N 
Sbjct: 244 -STTPPELFPVKREEE---EEKEREISGLG---GDFMTVVQEMIKTEVRSYMADLQLGNG 303

BLAST of ClCG02G008090 vs. TAIR10
Match: AT3G50060.1 (AT3G50060.1 myb domain protein 77)

HSP 1 Score: 276.6 bits (706), Expect = 2.8e-74
Identity = 160/311 (51.45%), Postives = 196/311 (63.02%), Query Frame = 1

Query: 69  DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFT 128
           DR+KGPWS EED+ L+R+V KYGPRNWS ISKSIPGRSGKSCRLRWCNQLSP+VEHR F+
Sbjct: 3   DRVKGPWSQEEDEQLRRMVEKYGPRNWSAISKSIPGRSGKSCRLRWCNQLSPEVEHRPFS 62

Query: 129 PEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNTSS- 188
           PEEDETI+ A+A +GNKWATIARLL+GRTDNA+KNHWNSTLKRKCS     +    T   
Sbjct: 63  PEEDETIVTARAQFGNKWATIARLLNGRTDNAVKNHWNSTLKRKCSGGVAVTTVTETEED 122

Query: 189 ---PPLKRSVS---------AGVYMPPNSPSGSDVSDSGFFPAVSS--SHVYRPVARTGG 248
              P  +RSVS          G+YM P SP+G DVSDS   P+ SS  + +++P+  +GG
Sbjct: 123 QDRPKKRRSVSFDSAFAPVDTGLYMSPESPNGIDVSDSSTIPSPSSPVAQLFKPMPISGG 182

Query: 249 --VLP---PGETVSSSNDPPTSLSLSLPGADSSEVNFVANSLPAMAGVSERPSTALPSSA 308
             V+P   P E  SSS DPPTSLSLSLPGA+++  +   N+   M    E       S  
Sbjct: 183 FTVVPQPLPVEMSSSSEDPPTSLSLSLPGAENTSSSHNNNNNALMFPRFE-------SQM 242

Query: 309 PANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGGGVC 360
             N +E+  G + E             M V+QEMI+ EVR+YMA +  Q   GG   G  
Sbjct: 243 KINVEERGEGRRGE------------FMTVVQEMIKAEVRSYMAEM--QKTSGGFVVGGL 292

BLAST of ClCG02G008090 vs. TAIR10
Match: AT3G55730.1 (AT3G55730.1 myb domain protein 109)

HSP 1 Score: 170.2 bits (430), Expect = 2.8e-42
Identity = 94/199 (47.24%), Postives = 118/199 (59.30%), Query Frame = 1

Query: 70  RIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRAFTP 129
           ++KGPWS EED  L +LV K GPRNWSLI++ IPGRSGKSCRLRWCNQL P ++ + F+ 
Sbjct: 54  KVKGPWSTEEDAVLTKLVRKLGPRNWSLIARGIPGRSGKSCRLRWCNQLDPCLKRKPFSD 113

Query: 130 EEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRK----------------C 189
           EED  II+A A++GNKWA IA+LL+GRTDNAIKNHWNSTL+RK                 
Sbjct: 114 EEDRMIISAHAVHGNKWAVIAKLLTGRTDNAIKNHWNSTLRRKYADLWNNGQWMANSVTT 173

Query: 190 SSMADESGGCNTSSPPLKRSVSAGVY--MPPNSPSGSDV--------------SDSGFFP 233
           +S+ +E+    T+ P  K+ +  G     PP  P  SDV                    P
Sbjct: 174 ASVKNENVDETTNPPSSKQQLPQGDINSSPPKPPQVSDVVMEEAANEPQEPQEQQEQAPP 233

BLAST of ClCG02G008090 vs. NCBI nr
Match: gi|659125758|ref|XP_008462845.1| (PREDICTED: transcription factor MYB44-like [Cucumis melo])

HSP 1 Score: 548.1 bits (1411), Expect = 1.4e-152
Identity = 270/302 (89.40%), Postives = 280/302 (92.72%), Query Frame = 1

Query: 64  MAKEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE 123
           MAK+IDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE
Sbjct: 1   MAKQIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE 60

Query: 124 HRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGC 183
           HRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGC
Sbjct: 61  HRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGC 120

Query: 184 NTSSPPLKRSVSAGVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPPGETVSS 243
           + +SPPLKRS+SAG+YMPPNSPS SDVSDSGFFPAVSSSHVYRPVARTG VLPPGETVSS
Sbjct: 121 DATSPPLKRSLSAGLYMPPNSPSASDVSDSGFFPAVSSSHVYRPVARTGAVLPPGETVSS 180

Query: 244 SNDPPTSLSLSLPGADSSEVNFVANSLPAMAGVSERPSTALPSSAPANGQEQISGEKDER 303
           SNDPPTSLSLSLPGADSSEVNFVANS   M GVSER ST L  SA  NG+E+ISGEK+E 
Sbjct: 181 SNDPPTSLSLSLPGADSSEVNFVANSAQGMGGVSERRSTGLACSAAMNGEERISGEKEES 240

Query: 304 NYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGGGVCFQQATVGGFRNVVVQ 363
           N NGFGI  SDLM VMQEMIRKEVRNYMAGLMEQNV  GGGGGVC+QQA  GGFRNVV+Q
Sbjct: 241 NSNGFGIFGSDLMTVMQEMIRKEVRNYMAGLMEQNV--GGGGGVCYQQAAAGGFRNVVIQ 300

Query: 364 RI 366
           RI
Sbjct: 301 RI 300

BLAST of ClCG02G008090 vs. NCBI nr
Match: gi|778673643|ref|XP_011650034.1| (PREDICTED: transcription factor MYB44-like [Cucumis sativus])

HSP 1 Score: 534.3 bits (1375), Expect = 2.1e-148
Identity = 261/302 (86.42%), Postives = 279/302 (92.38%), Query Frame = 1

Query: 64  MAKEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE 123
           M+K+ DRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE
Sbjct: 1   MSKQTDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVE 60

Query: 124 HRAFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGC 183
           HRAFTP+EDE IINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSM D++GGC
Sbjct: 61  HRAFTPDEDEAIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMPDDTGGC 120

Query: 184 NTSSPPLKRSVSAGVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPPGETVSS 243
           + +SPP K+SVSAG+YMPPNSPSGSD+SDSGFFPAVSSSHV+RPV RTG VLPPGETVSS
Sbjct: 121 HATSPPFKKSVSAGLYMPPNSPSGSDLSDSGFFPAVSSSHVFRPVPRTGAVLPPGETVSS 180

Query: 244 SNDPPTSLSLSLPGADSSEVNFVANSLPAMAGVSERPSTALPSSAPANGQEQISGEKDER 303
           S+DPPTSLSLSLPGADSSEVNFVANS+  + GVSER ST L  SA ANG+E+ISGEK+E 
Sbjct: 181 SDDPPTSLSLSLPGADSSEVNFVANSVQGVGGVSERRSTGLACSATANGEERISGEKEES 240

Query: 304 NYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGGGVCFQQATVGGFRNVVVQ 363
           N NGFGI SSDLMAVMQEMIRKEVRNYMAGLMEQ V  GGGGGVC+QQA  GGFRNVVVQ
Sbjct: 241 NSNGFGIFSSDLMAVMQEMIRKEVRNYMAGLMEQKV--GGGGGVCYQQAAAGGFRNVVVQ 300

Query: 364 RI 366
           RI
Sbjct: 301 RI 300

BLAST of ClCG02G008090 vs. NCBI nr
Match: gi|643722118|gb|KDP31997.1| (hypothetical protein JCGZ_12458 [Jatropha curcas])

HSP 1 Score: 382.9 bits (982), Expect = 7.7e-103
Identity = 211/324 (65.12%), Postives = 235/324 (72.53%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           K++DRIKGPWSPEEDDALQ+LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 6   KDVDRIKGPWSPEEDDALQKLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 65

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNT 185
           AFTPEEDETII A A +GNKWATIARLL+GRTDNAIKNHWNSTLKRKCSSMA++ G  + 
Sbjct: 66  AFTPEEDETIIRAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRKCSSMAEDGGAFDG 125

Query: 186 SSPPLKRSVSA--------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPP 245
           +  PLKRSVSA        G+YM P SPSGSDVSDS   P +SSSHVYRPVARTG V+PP
Sbjct: 126 NCQPLKRSVSAGSGMAVSTGLYMSPGSPSGSDVSDSS-VPVLSSSHVYRPVARTGAVIPP 185

Query: 246 GETVSSS--NDPPTSLSLSLPGADSSEV-NFVANS---------LPAMAGVSERPSTALP 305
            ET SSS  NDPPTSL LSLPG DSSEV N VA S         +P M  VS  P    P
Sbjct: 186 VETTSSSNNNDPPTSLCLSLPGVDSSEVSNRVAESTQATNTISLMPPMNNVSSPPPARPP 245

Query: 306 SSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGG 365
            +A    Q+   G  +     GF   +++ MAVMQEMIR EVRNYM     +  GGG GG
Sbjct: 246 PAASVQPQQ---GAVNGGLGTGFVGFTAEFMAVMQEMIRMEVRNYMI----EKAGGGSGG 305

Query: 366 ---GVCFQQATVGGFRNVVVQRIG 367
              G+CFQ A   GFRNV + R+G
Sbjct: 306 ANVGMCFQAAGGEGFRNVAMNRVG 321

BLAST of ClCG02G008090 vs. NCBI nr
Match: gi|802644219|ref|XP_012079312.1| (PREDICTED: transcription factor MYB44 [Jatropha curcas])

HSP 1 Score: 382.9 bits (982), Expect = 7.7e-103
Identity = 211/324 (65.12%), Postives = 235/324 (72.53%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           K++DRIKGPWSPEEDDALQ+LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 17  KDVDRIKGPWSPEEDDALQKLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 76

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADESGGCNT 185
           AFTPEEDETII A A +GNKWATIARLL+GRTDNAIKNHWNSTLKRKCSSMA++ G  + 
Sbjct: 77  AFTPEEDETIIRAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRKCSSMAEDGGAFDG 136

Query: 186 SSPPLKRSVSA--------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRPVARTGGVLPP 245
           +  PLKRSVSA        G+YM P SPSGSDVSDS   P +SSSHVYRPVARTG V+PP
Sbjct: 137 NCQPLKRSVSAGSGMAVSTGLYMSPGSPSGSDVSDSS-VPVLSSSHVYRPVARTGAVIPP 196

Query: 246 GETVSSS--NDPPTSLSLSLPGADSSEV-NFVANS---------LPAMAGVSERPSTALP 305
            ET SSS  NDPPTSL LSLPG DSSEV N VA S         +P M  VS  P    P
Sbjct: 197 VETTSSSNNNDPPTSLCLSLPGVDSSEVSNRVAESTQATNTISLMPPMNNVSSPPPARPP 256

Query: 306 SSAPANGQEQISGEKDERNYNGFGILSSDLMAVMQEMIRKEVRNYMAGLMEQNVGGGGGG 365
            +A    Q+   G  +     GF   +++ MAVMQEMIR EVRNYM     +  GGG GG
Sbjct: 257 PAASVQPQQ---GAVNGGLGTGFVGFTAEFMAVMQEMIRMEVRNYMI----EKAGGGSGG 316

Query: 366 ---GVCFQQATVGGFRNVVVQRIG 367
              G+CFQ A   GFRNV + R+G
Sbjct: 317 ANVGMCFQAAGGEGFRNVAMNRVG 332

BLAST of ClCG02G008090 vs. NCBI nr
Match: gi|1009145307|ref|XP_015890262.1| (PREDICTED: transcription factor MYB44-like [Ziziphus jujuba])

HSP 1 Score: 361.7 bits (927), Expect = 1.8e-96
Identity = 212/337 (62.91%), Postives = 242/337 (71.81%), Query Frame = 1

Query: 66  KEIDRIKGPWSPEEDDALQRLVHKYGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 125
           +E+DRIKGPWSPEEDD+LQ+LV K+GPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR
Sbjct: 6   REMDRIKGPWSPEEDDSLQKLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHR 65

Query: 126 AFTPEEDETIINAQALYGNKWATIARLLSGRTDNAIKNHWNSTLKRKCSSMADE------ 185
           AFTPEEDETII A A +GNKWATIARLL+GRTDNAIKNHWNSTLKRKCS++ D+      
Sbjct: 66  AFTPEEDETIIRAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRKCSAIIDDCNLHGV 125

Query: 186 --SGGCNTSSP--PLKRSVSA--------GVYMPPNSPSGSDVSDSGFFPAVSSSHVYRP 245
              GG + +SP   LKRSVSA        G+YM P+SPSGSD+S+S   P VSSS V+RP
Sbjct: 126 VPGGGYDGNSPNQRLKRSVSAGSALPVSTGLYMSPSSPSGSDMSESS-VPVVSSSQVFRP 185

Query: 246 VARTGGVLPPGETVSSSNDPPTSLSLSLPGADSSEV-NFVANS----------LPAMAGV 305
           VARTGGVLP  ET SSSNDPPTSLSLSLPG DS EV N V  S          LPA A  
Sbjct: 186 VARTGGVLPLVETTSSSNDPPTSLSLSLPGVDSCEVSNRVVESTQNASNTMTLLPAPATA 245

Query: 306 SERPSTALPSSAP-----ANGQEQISGEKDERNYNG-FGILSSDLMAVMQEMIRKEVRNY 365
           +   +T  P+SAP     A   +++   +   + NG F   S++L+AVMQEMI+KEVRNY
Sbjct: 246 TATATT--PTSAPAVVPMAVPVQKVPTSRGSESENGVFVPFSAELLAVMQEMIKKEVRNY 305

Query: 366 MAGLMEQNVGGGGGGGVCF-QQATVGGFRNVVVQRIG 367
           MAGL EQ        GVC  QQA    FRNV V+RIG
Sbjct: 306 MAGL-EQT-------GVCLPQQAVADAFRNVAVKRIG 331

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MYB44_ARATH6.0e-7956.94Transcription factor MYB44 OS=Arabidopsis thaliana GN=MYB44 PE=2 SV=1[more]
MYBB_XENLA1.9e-3238.40Myb-related protein B OS=Xenopus laevis GN=mybl2 PE=2 SV=2[more]
MYB_XENLA5.5e-3260.78Transcriptional activator Myb OS=Xenopus laevis GN=myb PE=2 SV=1[more]
MYB_BOVIN9.3e-3239.57Transcriptional activator Myb OS=Bos taurus GN=MYB PE=2 SV=1[more]
MYB_MOUSE2.1e-3145.40Transcriptional activator Myb OS=Mus musculus GN=Myb PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LTE4_CUCSA1.4e-14886.42Uncharacterized protein OS=Cucumis sativus GN=Csa_2G427310 PE=4 SV=1[more]
A0A067K7A3_JATCU5.4e-10365.12MYB family protein OS=Jatropha curcas GN=JCGZ_12458 PE=4 SV=1[more]
F6HSY7_VITVI4.9e-9664.09Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0129g01050 PE=4 SV=... [more]
B9SG07_RICCO2.4e-9561.76R2r3-myb transcription factor, putative OS=Ricinus communis GN=RCOM_1153840 PE=4... [more]
A0A0A0KJL5_CUCSA5.4e-9562.42Sucrose responsive element binding protein OS=Cucumis sativus GN=Csa_6G491690 PE... [more]
Match NameE-valueIdentityDescription
AT5G67300.13.4e-8056.94 myb domain protein r1[more]
AT4G37260.12.2e-7953.52 myb domain protein 73[more]
AT2G23290.12.9e-7651.84 myb domain protein 70[more]
AT3G50060.12.8e-7451.45 myb domain protein 77[more]
AT3G55730.12.8e-4247.24 myb domain protein 109[more]
Match NameE-valueIdentityDescription
gi|659125758|ref|XP_008462845.1|1.4e-15289.40PREDICTED: transcription factor MYB44-like [Cucumis melo][more]
gi|778673643|ref|XP_011650034.1|2.1e-14886.42PREDICTED: transcription factor MYB44-like [Cucumis sativus][more]
gi|643722118|gb|KDP31997.1|7.7e-10365.12hypothetical protein JCGZ_12458 [Jatropha curcas][more]
gi|802644219|ref|XP_012079312.1|7.7e-10365.12PREDICTED: transcription factor MYB44 [Jatropha curcas][more]
gi|1009145307|ref|XP_015890262.1|1.8e-9662.91PREDICTED: transcription factor MYB44-like [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001005SANT/Myb
IPR009057Homeobox-like_sf
IPR017930Myb_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0000785 chromatin
molecular_function GO:0003677 DNA binding
molecular_function GO:0003682 chromatin binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G008090.1ClCG02G008090.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 72..117
score: 5.7E-19coord: 126..168
score: 9.3
IPR001005SANT/Myb domainSMARTSM00717santcoord: 71..120
score: 1.4E-17coord: 123..171
score: 2.7
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 73..125
score: 4.7E-26coord: 126..172
score: 4.9
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 69..165
score: 2.52
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 67..122
score: 26.373coord: 124..173
score: 21
NoneNo IPR availableunknownCoilCoilcoord: 381..405
scor
NoneNo IPR availablePANTHERPTHR10641MYB-LIKE DNA-BINDING PROTEIN MYBcoord: 43..426
score: 2.8E
NoneNo IPR availablePANTHERPTHR10641:SF551MYB TRANSCRIPTION FACTORcoord: 43..426
score: 2.8E