Cp4.1LG03g18020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g18020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyb family transcription factor family protein
LocationCp4.1LG03 : 12298519 .. 12302320 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTCCACTCTCAGTAAAGCTTTATTATTTTTTGGGTTTAAAAAGCAAAAAAGATTCCATGGACGCCATGGATTTCCATGGATTCAAGCTCTAGAGAGAGAAAGAGAAAAAGTGGGTGTTTTCTCTGAGGGATGATTTGGTCAGCAAAATCAAGTTCTTCTTCTTCTTCTTCTTCTTCTTCAAATTCACTTCTTTTTTTTATTAAAAGATTTTAAAAAGAATTTTCAAAGAAAAAAAGAAGCTCTAAAATCAATCACATGGCAGCAGGTTGCCTTTCTGATTTAGCCTTCTACCCACGCTTGCACAGACTCATACTGACAAAGACAGAACACAGAGAAAAAGAAAACAAAAACTTGGGACAGAGACTGCGCACAGACATCGCTGTCTTGTTTCTCTCTCTTTTAAATTCATTTCATATATATATTTCTTTTCTTGATTGCTCGACCCATTTTCGGGGCTCCGACCCAAAAGGGTCGTTCAATTTTATGGAGTGGATTTTTTGGCTCTCACATTTTCGTTTTTTTTTTTTTTTTTGTTTCACTCGCCGGAGCAAAAATGCTCTCTGGGTTTTCACAGGAACCGGCAGGAATGTACTCTGCCATTCCGGCGCTGCCGATGGACGGCGGCGGCGGCGGCGGCAAGTTTCAGGGTTCTCTAGACGGAACGAATCTTCCTGGAGATGCTTGTTTGGTTCTCACGTCGGATCCCAAACCACGCCTCCGATGGACGGCGGAGCTCCATGAGCGATTCGTCGACGCCGTCACTCAGCTCGGCGGCGCCGACAGTGAGTACACCTTTTTTCCCTTCTCTTTTTGCTTTGTAATTGCTTCTCTCTGCAATTTGGTCTCTGTTTTAATGATTGTGTCACTTAATGATGGCTTGAAGTGGGAGGAGGTGTTCTTATATCTATCCTTTAGAGACAAGTACTTGGCTCATGCGGGTCGGTACTCTTATCGGTTAAATTCCAACATTTGTAACGGTTTGATCGTGAGCTGTAGAAATGATTGAGTTTGCTATTAAACTCGTAAAAAGAGTAGTGTAGCGGTTTTTGTTCATATTGATTTCACTACGATCCACTTTTCTATTAAATGCAATAGAAATAGACCGTGTCGACAAGTTAGTTCGACGTCCTACCGACGGGTATGCAAGTAAGTATTGTCCTTTTTTTAGGGTAGAGTGTCCTTGAGGGGAGATGTGGCCTACAACTAGATTTATGTCTATTGATTTGGTGTCCTTCAAGTTTCTTTCATTTAAGACAAGCTGAGTTAAGTTAGGTTTCTCATTGCTCCCTTAAGTGTTGGGCATGGAAAAGGGAGTAAGTGTGCAGTTAAGATGAAAAGAAAAATGTTAAGATGAAAAGAAAAATGTTTCTGGCCCGGGGGTATCCCAACACGGAGAGTTTGCCATATATCGGGGTGGAGGATGGAATCGGGTCGGTGCGGGCTGCGAGAGCGGTACCAAATCGAGGCAGGCGAGTGAATAAGGATCGAAACCCAATATGAAAACACTTCCGCTCGAGTTAGGGGCCTTTTTTTGCAGGATGCCAGGTGTGCAGGAACGCCCAACCGGGTTCGGTTATATAAACATTCTATATGGCTAAGATGCAAGCTTTAGCAATAAGAAACGATTGCGTTACTGAATAGAACGATTTGTATCGGTTTCTTGTTCGATTATTATGTTTTAATACGTGCTAGAGAGTAAACTCGTCGAGTTCGGAGGCATGGTGTCGTCTTGGTTGCTCTTTCTATAGAGTTGTATGATACTTTCTCTTATTTGATTTGTTCCTAATGGTCAAAATGCTGATATTGGTTGCTCTACACTTTTCAGAGGCAACCCCAAAAACAATTATGAGAACAATGGGAGTAAAAGGCCTCACCCTTTATCACTTAAAATCACACCTCCAGGTTTGTTCATATCGCCATTATACCGACCGGGCCGAGCCGAGCCCGACACTCGTCTTGTTTTGTTCTATTTGTGTATTCGGGTCGGGGGTTGGGTTGAGTGACCGTCTACGCTTGAGCATTGTAGATATTTACATTATATGTTTTCTTGATGCCTTGTGCTAGCAGAAATATCGGCTTGGGAAGCAATCGTTCAAGGAATCGACGGAGAACTCGAAGGACGGTACGAACTTCCTCTTCAAAACAGTTAGGACTTGGCAGTTTCCTTTTCTTGTTTTGTATGGAATGAAGTAGCAAATAGTTCAAACTGTGGGGCATGCTTACTCTTTTTTATTTAGAGAATCATACATCGACGTACCCGAACCACCGGTTTCGGTTTTTTGCTGTATATCGTCTGTGTGAAGCTTAAACAACTCATAGATATGGCCTTTCAGCTTCTTGCATTGCAGAAAGTCAAGAAACAAGTTCGTCATCATCGCCATCATCGAAAATAATAGCTCGAGATTTAAACGAGTATGCTCGACTTTTTCGTTTTACGTAATGGTTTTCCATGCTCTATATATGTCTTGCATTGAGTAGTTTTGTTTTTCGAACAGTAGTTTCCAAGTCACTGAGGCGTTACGAGCACAGATGGAGGTTCAAAGAAGGCTGCACGAGCAGCTTGAGGTTAGTCGCTCGTTCGTTCTTCCTGAAGAAACTCGAATTTGTAACGGGCTTTCTCCATCCTCTTTGGGCGTTCTCTTTCGGGCTTCAGCTCAAGGTTTTTAAAACGTGTTTGCTAGGGAGAGGCTTTCACAACACCCTTCTAAAGGGTGTTTCGTTCTCTTCCCCAACCGATGTGGGATCTCACAATCCACCCCTTCAGGGCCCAACGTCCTCACTGGCACTTGTTCCTTTCTTGAATCAATGTGGGACCCCCACCAACTTCACCCTCCTTCGGGGGCCCAGTGTCCTTGCTGGCACACCACCCGTGTCTACCCCCTTCGGAGAACAACCTCCTTGCTGGCACATTGCTCGGTGTCTGGCTTTGATACCATTTGTAACGGCCCAAGCCCACCGCTAGCAAATATTGTCCTCTTTAGGCTTTCCCCTTCGGGTTTCCCCTCAAGGTTTTTAAAACGCGTCTACTAGGGAGAGGTTTCCACACTCTTATAAAGGGTGTTTCGTTCTCCCCAACCGACGTGGGATCTCACAGAAGATTTATAAAACTCACTGTTTTCCATTTTCAAGTCCATGCTTCTTTTACTTTAAGCGCAGGTGCAGCGTCGACTTCAACTTCGCATTGAAGCACAAGGAAAATACTTGCAGTCGATACTCGAGCGGGCGTGTCGAGCATTGAGCGATCAGGCTGCAGCATCTGCTGGACTCGAAGCTGTACGAGAAGAGCTCTCTGAACTTGCAATGAAAGTAGGCAATGAGTCTAAAGAGATGGCACCCTTGGAGGCTCAAAAAGTGCTTCCCTTTTCAGAACTTGCTGCTGCTCTAGAAAATCCAAAGGCTCCTACGGTTATGCCTCGAGTAGGCGATTGCTCGATGGATAGCTGCTTGACATCAGCTGGAAGCCCGGTTTCCCCGATCGGAGTAGGATCAACTACCACCGGAATGAAGAGACCGAGACCAGTTTTTAGTCATGGAGATTCAATGGCGCTCGAGGGCAATGCTCGACACGATGTCGAATGGATGATGAGATGAAAATTAGGACTTTGCCTTAAAAAGTACAGCCTTTTCTTCTGAGCTACTTGCTCTCTTGTTCACTTCTGTAATGAAAGTATTATACAATCTGTTTTGTGGCTTGTAAATTCTTCGTAATCAGACGCTTTTGGCTTCTTTGATCTTCATTTACATCTCTAACTTAACATGGAGGGCAAAATGGTCAATTTCAAAAGTTATTCTTACATTAA

mRNA sequence

GTGTCCACTCTCAGTAAAGCTTTATTATTTTTTGGGTTTAAAAAGCAAAAAAGATTCCATGGACGCCATGGATTTCCATGGATTCAAGCTCTAGAGAGAGAAAGAGAAAAAGTGGGTGTTTTCTCTGAGGGATGATTTGGTCAGCAAAATCAAGTTCTTCTTCTTCTTCTTCTTCTTCTTCAAATTCACTTCTTTTTTTTATTAAAAGATTTTAAAAAGAATTTTCAAAGAAAAAAAGAAGCTCTAAAATCAATCACATGGCAGCAGGTTGCCTTTCTGATTTAGCCTTCTACCCACGCTTGCACAGACTCATACTGACAAAGACAGAACACAGAGAAAAAGAAAACAAAAACTTGGGACAGAGACTGCGCACAGACATCGCTGTCTTGTTTCTCTCTCTTTTAAATTCATTTCATATATATATTTCTTTTCTTGATTGCTCGACCCATTTTCGGGGCTCCGACCCAAAAGGGTCGTTCAATTTTATGGAGTGGATTTTTTGGCTCTCACATTTTCGTTTTTTTTTTTTTTTTTGTTTCACTCGCCGGAGCAAAAATGCTCTCTGGGTTTTCACAGGAACCGGCAGGAATGTACTCTGCCATTCCGGCGCTGCCGATGGACGGCGGCGGCGGCGGCGGCAAGTTTCAGGGTTCTCTAGACGGAACGAATCTTCCTGGAGATGCTTGTTTGGTTCTCACGTCGGATCCCAAACCACGCCTCCGATGGACGGCGGAGCTCCATGAGCGATTCGTCGACGCCGTCACTCAGCTCGGCGGCGCCGACAAGGCAACCCCAAAAACAATTATGAGAACAATGGGAGTAAAAGGCCTCACCCTTTATCACTTAAAATCACACCTCCAGAAATATCGGCTTGGGAAGCAATCGTTCAAGGAATCGACGGAGAACTCGAAGGACGCTTCTTGCATTGCAGAAAGTCAAGAAACAAGTTCGTCATCATCGCCATCATCGAAAATAATAGCTCGAGATTTAAACGATAGTTTCCAAGTCACTGAGGCGTTACGAGCACAGATGGAGGTTCAAAGAAGGCTGCACGAGCAGCTTGAGGTGCAGCGTCGACTTCAACTTCGCATTGAAGCACAAGGAAAATACTTGCAGTCGATACTCGAGCGGGCGTGTCGAGCATTGAGCGATCAGGCTGCAGCATCTGCTGGACTCGAAGCTGTACGAGAAGAGCTCTCTGAACTTGCAATGAAAGTAGGCAATGAGTCTAAAGAGATGGCACCCTTGGAGGCTCAAAAAGTGCTTCCCTTTTCAGAACTTGCTGCTGCTCTAGAAAATCCAAAGGCTCCTACGGTTATGCCTCGAGTAGGCGATTGCTCGATGGATAGCTGCTTGACATCAGCTGGAAGCCCGGTTTCCCCGATCGGAGTAGGATCAACTACCACCGGAATGAAGAGACCGAGACCAGTTTTTAGTCATGGAGATTCAATGGCGCTCGAGGGCAATGCTCGACACGATGTCGAATGGATGATGAGATGAAAATTAGGACTTTGCCTTAAAAAGTACAGCCTTTTCTTCTGAGCTACTTGCTCTCTTGTTCACTTCTGTAATGAAAGTATTATACAATCTGTTTTGTGGCTTGTAAATTCTTCGTAATCAGACGCTTTTGGCTTCTTTGATCTTCATTTACATCTCTAACTTAACATGGAGGGCAAAATGGTCAATTTCAAAAGTTATTCTTACATTAA

Coding sequence (CDS)

ATGCTCTCTGGGTTTTCACAGGAACCGGCAGGAATGTACTCTGCCATTCCGGCGCTGCCGATGGACGGCGGCGGCGGCGGCGGCAAGTTTCAGGGTTCTCTAGACGGAACGAATCTTCCTGGAGATGCTTGTTTGGTTCTCACGTCGGATCCCAAACCACGCCTCCGATGGACGGCGGAGCTCCATGAGCGATTCGTCGACGCCGTCACTCAGCTCGGCGGCGCCGACAAGGCAACCCCAAAAACAATTATGAGAACAATGGGAGTAAAAGGCCTCACCCTTTATCACTTAAAATCACACCTCCAGAAATATCGGCTTGGGAAGCAATCGTTCAAGGAATCGACGGAGAACTCGAAGGACGCTTCTTGCATTGCAGAAAGTCAAGAAACAAGTTCGTCATCATCGCCATCATCGAAAATAATAGCTCGAGATTTAAACGATAGTTTCCAAGTCACTGAGGCGTTACGAGCACAGATGGAGGTTCAAAGAAGGCTGCACGAGCAGCTTGAGGTGCAGCGTCGACTTCAACTTCGCATTGAAGCACAAGGAAAATACTTGCAGTCGATACTCGAGCGGGCGTGTCGAGCATTGAGCGATCAGGCTGCAGCATCTGCTGGACTCGAAGCTGTACGAGAAGAGCTCTCTGAACTTGCAATGAAAGTAGGCAATGAGTCTAAAGAGATGGCACCCTTGGAGGCTCAAAAAGTGCTTCCCTTTTCAGAACTTGCTGCTGCTCTAGAAAATCCAAAGGCTCCTACGGTTATGCCTCGAGTAGGCGATTGCTCGATGGATAGCTGCTTGACATCAGCTGGAAGCCCGGTTTCCCCGATCGGAGTAGGATCAACTACCACCGGAATGAAGAGACCGAGACCAGTTTTTAGTCATGGAGATTCAATGGCGCTCGAGGGCAATGCTCGACACGATGTCGAATGGATGATGAGATGA

Protein sequence

MLSGFSQEPAGMYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKAPTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGMKRPRPVFSHGDSMALEGNARHDVEWMMR
BLAST of Cp4.1LG03g18020 vs. Swiss-Prot
Match: PHL2_ARATH (Protein PHR1-LIKE 2 OS=Arabidopsis thaliana GN=PHL2 PE=1 SV=1)

HSP 1 Score: 332.0 bits (850), Expect = 6.9e-90
Identity = 187/271 (69.00%), Postives = 219/271 (80.81%), Query Frame = 1

Query: 12  MYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQ 71
           MYSAI +LP+DGG  GG + G LDGTNLPGDACLVLT+DPKPRLRWT ELHERFVDAVTQ
Sbjct: 1   MYSAIRSLPLDGGHVGGDYHGPLDGTNLPGDACLVLTTDPKPRLRWTTELHERFVDAVTQ 60

Query: 72  LGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQETS 131
           LGG DKATPKTIMRTMGVKGLTLYHLKSHLQK+RLG+Q+ KESTENSKDASC+ ESQ+T 
Sbjct: 61  LGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGRQAGKESTENSKDASCVGESQDTG 120

Query: 132 SSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 191
           SSS+ S ++  ++ N+ +QVTEALRAQMEVQRRLH+QLEVQRRLQLRIEAQGKYLQSILE
Sbjct: 121 SSSTSSMRMAQQEQNEGYQVTEALRAQMEVQRRLHDQLEVQRRLQLRIEAQGKYLQSILE 180

Query: 192 RACRALSDQAAASAGLEAVREELSELAMKVGNESK--EMAPLEAQKVL---PFSELAAAL 251
           +AC+A  +QAA  AGLEA REELSELA+KV N S+   +   +A K++     SELA A+
Sbjct: 181 KACKAFDEQAATFAGLEAAREELSELAIKVSNSSQGTSVPYFDATKMMMMPSLSELAVAI 240

Query: 252 ENPKAPTVMPRVGDCSMDSCLTSA--GSPVS 276
           +N    T      +CS++S LTS   GS +S
Sbjct: 241 DNKNNITT-----NCSVESSLTSITHGSSIS 266

BLAST of Cp4.1LG03g18020 vs. Swiss-Prot
Match: PHL3_ARATH (Protein PHR1-LIKE 3 OS=Arabidopsis thaliana GN=PHL3 PE=1 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 2.3e-85
Identity = 194/310 (62.58%), Postives = 228/310 (73.55%), Query Frame = 1

Query: 12  MYSAI-PALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVT 71
           MYSAI  +LP+DG  G        DGTNLP DACLVLT+DPKPRLRWT+ELHERFVDAVT
Sbjct: 1   MYSAIRSSLPLDGSLGDYS-----DGTNLPIDACLVLTTDPKPRLRWTSELHERFVDAVT 60

Query: 72  QLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQET 131
           QLGG DKATPKTIMRTMGVKGLTLYHLKSHLQK+RLG+QS KES +NSKD SC+AESQ+T
Sbjct: 61  QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGRQSCKESIDNSKDVSCVAESQDT 120

Query: 132 SSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL 191
            SSS+ S ++ A++ N+S+QVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL
Sbjct: 121 GSSSTSSLRLAAQEQNESYQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL 180

Query: 192 ERACRALSDQAAASAGLEAVREELSELAMKV----GNESKEMAPLEAQKVLP-FSELAAA 251
           E+AC+A+ +QA A AGLEA REELSELA+K     G +         + ++P  SELA A
Sbjct: 181 EKACKAIEEQAVAFAGLEAAREELSELAIKASITNGCQGTTSTFDTTKMMIPSLSELAVA 240

Query: 252 LENPKAPTVMPRVGDCSMDSCLTSA--GSPVSPIGVGSTTTGMKRPRPVFSHGDSMALEG 311
           +E+           +CS +S LTS+  GSPV      S     KR R VF +GDS+ +  
Sbjct: 241 IEHK---------NNCSAESSLTSSTVGSPV------SAALMKKRQRGVFGNGDSVVV-- 286

Query: 312 NARHDVEWMM 314
              HD  W+M
Sbjct: 301 --GHDAGWVM 286

BLAST of Cp4.1LG03g18020 vs. Swiss-Prot
Match: APL_ARATH (Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=1 SV=2)

HSP 1 Score: 192.2 bits (487), Expect = 8.6e-48
Identity = 107/165 (64.85%), Postives = 127/165 (76.97%), Query Frame = 1

Query: 41  GDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSH 100
           GD+ LVLT+DPKPRLRWT ELHERFVDAV QLGG DKATPKTIMR MGVKGLTLYHLKSH
Sbjct: 23  GDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKSH 82

Query: 101 LQKYRLGKQSFKESTENSKDASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQME 160
           LQK+RLGKQ  KE  ++S      A + +   + + SS +++R++N+          QME
Sbjct: 83  LQKFRLGKQPHKEYGDHSTKEGSRASAMDIQRNVASSSGMMSRNMNE---------MQME 142

Query: 161 VQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASA 206
           VQRRLHEQLEVQR LQLRIEAQGKY+QSILERAC+ L+ +  A+A
Sbjct: 143 VQRRLHEQLEVQRHLQLRIEAQGKYMQSILERACQTLAGENMAAA 178

BLAST of Cp4.1LG03g18020 vs. Swiss-Prot
Match: PHL9_ARATH (Myb-related protein 2 OS=Arabidopsis thaliana GN=MYR2 PE=1 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 1.5e-47
Identity = 115/236 (48.73%), Postives = 150/236 (63.56%), Query Frame = 1

Query: 38  NLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHL 97
           N PGD+ L+L++D KPRL+WT +LHERF++AV QLGGADKATPKTIM+ MG+ GLTLYHL
Sbjct: 31  NSPGDSGLILSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTIMKVMGIPGLTLYHL 90

Query: 98  KSHLQKYRLGKQSFKESTENSKDASCIAESQETSSSS---SPSSKIIARDLNDSFQVTEA 157
           KSHLQKYRL K    ++  +      +   +E +  +      +  I    N +  + EA
Sbjct: 91  KSHLQKYRLSKNLNGQANNSFNKIGIMTMMEEKTPDADEIQSENLSIGPQPNKNSPIGEA 150

Query: 158 LRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASAGLEAVREEL 217
           L+ Q+EVQRRLHEQLEVQR LQLRIEAQGKYLQS+LE+A   L  Q   +AG+EA + +L
Sbjct: 151 LQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIEAAKVQL 210

Query: 218 SELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKAPTVMPRVGDCSMDSCLTSA 271
           SEL  KV  E    + LE +++            P          DCS++SCLTS+
Sbjct: 211 SELVSKVSAEYPNSSFLEPKELQNLCSQQMQTNYPP---------DCSLESCLTSS 257

BLAST of Cp4.1LG03g18020 vs. Swiss-Prot
Match: PHLA_ARATH (Myb-related protein 1 OS=Arabidopsis thaliana GN=MYR1 PE=1 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 1.8e-45
Identity = 117/236 (49.58%), Postives = 149/236 (63.14%), Query Frame = 1

Query: 38  NLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHL 97
           N  GD+ L+L++D KPRL+WT +LHERFV+AV QLGG DKATPKTIM+ MG+ GLTLYHL
Sbjct: 31  NGTGDSGLILSTDAKPRLKWTPDLHERFVEAVNQLGGGDKATPKTIMKVMGIPGLTLYHL 90

Query: 98  KSHLQKYRLGKQ---SFKESTENSKDASCIAESQETSSSSSPSSKIIARDLNDSFQVTEA 157
           KSHLQKYRL K        S   +   + + E+      S   S  I    + +  +++A
Sbjct: 91  KSHLQKYRLSKNLNGQANSSLNKTSVMTMVEENPPEVDESHSESLSIGPQPSMNLPISDA 150

Query: 158 LRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASAGLEAVREEL 217
           L+ Q+EVQRRLHEQLEVQR LQLRIEAQGKYLQSILE+A   L  Q   +AG+EA + +L
Sbjct: 151 LQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQETLGRQNLGAAGIEATKAQL 210

Query: 218 SELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKAPTVMPRVGDCSMDSCLTSA 271
           SEL  KV  +  + + LE +      EL          T  P   + S+DSCLTS+
Sbjct: 211 SELVSKVSADYPDSSFLEPK------ELQNLHHQQMQKTYPP---NSSLDSCLTSS 257

BLAST of Cp4.1LG03g18020 vs. TrEMBL
Match: A0A0A0L435_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G130900 PE=4 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 1.3e-151
Identity = 288/309 (93.20%), Postives = 295/309 (95.47%), Query Frame = 1

Query: 1   MLSGFSQEPAGMYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60
           MLSGFSQEPAGMYS I ALPMD  GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE
Sbjct: 1   MLSGFSQEPAGMYSTITALPMD--GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60

Query: 61  LHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120
           LHERFVDAVTQLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD
Sbjct: 61  LHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120

Query: 121 ASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIE 180
           ASCIAESQETSSSSSPSS+I+A+DLND FQVTEALR QMEVQRRLHEQLEVQR LQLRIE
Sbjct: 121 ASCIAESQETSSSSSPSSRIMAQDLNDGFQVTEALRVQMEVQRRLHEQLEVQRHLQLRIE 180

Query: 181 AQGKYLQSILERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFS 240
           AQGKYLQSILERAC+ALSDQAAASAGLEA REELSELA+KV N+SKEMAPLE QKVLPFS
Sbjct: 181 AQGKYLQSILERACQALSDQAAASAGLEAAREELSELAIKVSNDSKEMAPLETQKVLPFS 240

Query: 241 ELAAALENPKAPTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGMKRPRPVFSHGDSMA 300
           ELAAALEN KAPTVMPR+GDCSMDSCLTSAGSPVSPIGVGST T MKRPRPVFSHGDSMA
Sbjct: 241 ELAAALENRKAPTVMPRIGDCSMDSCLTSAGSPVSPIGVGSTATAMKRPRPVFSHGDSMA 300

Query: 301 LEGNARHDV 310
           LEGNARHDV
Sbjct: 301 LEGNARHDV 307

BLAST of Cp4.1LG03g18020 vs. TrEMBL
Match: F2QKV9_ROSRU (Putative MYB transcription factor OS=Rosa rugosa GN=myb6 PE=2 SV=1)

HSP 1 Score: 435.3 bits (1118), Expect = 6.5e-119
Identity = 232/304 (76.32%), Postives = 266/304 (87.50%), Query Frame = 1

Query: 12  MYSAIPALPMDGGG-GGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVT 71
           MYSA+ +LP+DGG  G G+F GSLDGTNLPGDACLVLT+DPKPRLRWTAELHERFVDAVT
Sbjct: 1   MYSALHSLPLDGGVCGHGEFSGSLDGTNLPGDACLVLTTDPKPRLRWTAELHERFVDAVT 60

Query: 72  QLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQET 131
           QLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQ+T
Sbjct: 61  QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQDT 120

Query: 132 SSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL 191
            SS++ SS++IA+DLND +QVTEALR QMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL
Sbjct: 121 GSSAT-SSRVIAQDLNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL 180

Query: 192 ERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFSELAAALENPK 251
           E+AC+AL+DQAA +AGLEA +EELSELA+KV ++ + MAPL+  K+   SE+AAA+EN  
Sbjct: 181 EKACKALNDQAATAAGLEAAKEELSELAIKVSSDCQGMAPLDTIKMQSLSEIAAAIENKS 240

Query: 252 APTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGM-KRPRPVFSHGDSMALEGNARHDV 311
           A  V+ R+G+CS+DSCLTS GSP SP+G+ S    M KR RP FS+GDS+ LEGN R +V
Sbjct: 241 ASNVLARIGNCSVDSCLTSTGSPGSPMGMSSLAAAMKKRQRPFFSNGDSLPLEGNMRQEV 300

Query: 312 EWMM 314
           EWMM
Sbjct: 301 EWMM 303

BLAST of Cp4.1LG03g18020 vs. TrEMBL
Match: B9RWI5_RICCO (Transcription factor, putative OS=Ricinus communis GN=RCOM_1019810 PE=4 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 8.8e-116
Identity = 228/303 (75.25%), Postives = 260/303 (85.81%), Query Frame = 1

Query: 12  MYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQ 71
           MYSAI +LP+DG G    FQGSLDGTNLPGDACLVLT+DPKPRLRWTAELHERFVDAVTQ
Sbjct: 1   MYSAIHSLPLDGHG---DFQGSLDGTNLPGDACLVLTTDPKPRLRWTAELHERFVDAVTQ 60

Query: 72  LGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQETS 131
           LGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLG+QS KES ENSKDAS +AESQ+T 
Sbjct: 61  LGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGRQSCKESNENSKDAS-VAESQDTG 120

Query: 132 SSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 191
           SS+S SS++IA+D+ND +QVTEALR QMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE
Sbjct: 121 SSTSTSSRMIAQDVNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 180

Query: 192 RACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKA 251
           +AC+AL+DQAA SAGLEA REELSELA+KV NE + + P +  K+   SELA ALE+   
Sbjct: 181 KACKALNDQAAVSAGLEAAREELSELAIKVSNECQGIVPADNMKMPSLSELAVALESKST 240

Query: 252 PTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGM-KRPRPVFSHGDSMALEGNARHDVE 311
             +  R+GDCS++SCLTS GSPVSP+GVGS T  + KRPRP+F +GDS+ LEG+ R +VE
Sbjct: 241 SNLPARIGDCSVESCLTSTGSPVSPMGVGSHTASIKKRPRPIFGNGDSLPLEGSMRQEVE 299

Query: 312 WMM 314
           WMM
Sbjct: 301 WMM 299

BLAST of Cp4.1LG03g18020 vs. TrEMBL
Match: B9GFX5_POPTR (Myb family transcription factor family protein OS=Populus trichocarpa GN=POPTR_0001s32200g PE=4 SV=2)

HSP 1 Score: 419.5 bits (1077), Expect = 3.7e-114
Identity = 228/303 (75.25%), Postives = 258/303 (85.15%), Query Frame = 1

Query: 12  MYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQ 71
           MYSAI +LP+DG G    FQ +LDGTNLPGDACLVLT+DPKPRLRWTAELHERFVDAV Q
Sbjct: 1   MYSAIHSLPLDGHGD---FQAALDGTNLPGDACLVLTTDPKPRLRWTAELHERFVDAVAQ 60

Query: 72  LGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQETS 131
           LGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQS KEST+NSKDAS +AESQ+T 
Sbjct: 61  LGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSCKESTDNSKDAS-VAESQDTG 120

Query: 132 SSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 191
           SS+S SS++IA+DLND +QVTEALR QMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE
Sbjct: 121 SSTSASSRMIAQDLNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 180

Query: 192 RACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKA 251
           +AC+AL+DQA A+AGLEA REELSELA+KV NE   +APL+  K+   SELAAALEN  A
Sbjct: 181 KACKALNDQAVATAGLEAAREELSELAIKVSNERAGIAPLDTMKMPSISELAAALENKHA 240

Query: 252 PTVMPRVGDCSMDSCLTSAGSPVSPIGVGS-TTTGMKRPRPVFSHGDSMALEGNARHDVE 311
             V  RVGDCS++SCLTS GSPVSP+GVG+   +  KR RPVF +GDS+  +GN + +VE
Sbjct: 241 SNVPARVGDCSVESCLTSTGSPVSPMGVGAQVASTKKRSRPVFGNGDSLPFDGNIQQEVE 299

Query: 312 WMM 314
           W M
Sbjct: 301 WTM 299

BLAST of Cp4.1LG03g18020 vs. TrEMBL
Match: F6GWG5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0029g00060 PE=4 SV=1)

HSP 1 Score: 419.1 bits (1076), Expect = 4.8e-114
Identity = 228/303 (75.25%), Postives = 254/303 (83.83%), Query Frame = 1

Query: 12  MYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQ 71
           MYSAI +LP+DGG     FQGSLDGTNLPGDACLVLT+DPKPRLRWTAELHERFVDAVTQ
Sbjct: 1   MYSAIHSLPLDGGVAHADFQGSLDGTNLPGDACLVLTTDPKPRLRWTAELHERFVDAVTQ 60

Query: 72  LGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQETS 131
           LGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQS KE T+NS   SCIAESQ+T 
Sbjct: 61  LGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSCKELTDNS---SCIAESQDTG 120

Query: 132 SSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 191
           SSS+ SS++I +DLND +QVTEALR QMEVQRRLHEQLEVQR LQLRIEAQGKYLQSILE
Sbjct: 121 SSSTSSSRMIPQDLNDGYQVTEALRVQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILE 180

Query: 192 RACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKA 251
           +AC+AL DQAAA+AGLEA REELSEL +KV N+ + M PLE  K+   SE+AAALEN  A
Sbjct: 181 KACKALKDQAAATAGLEAAREELSELQIKVSNDCEGMNPLETIKMPCLSEIAAALENKNA 240

Query: 252 PTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGM-KRPRPVFSHGDSMALEGNARHDVE 311
             V  R+GDCS+DSCLTS+GSP+SP+G  S    M KR RP+F+ G S+ALE N R DVE
Sbjct: 241 VNVPARIGDCSVDSCLTSSGSPISPMGASSRGAVMKKRSRPLFTGGSSLALENNMRQDVE 300

Query: 312 WMM 314
           WMM
Sbjct: 301 WMM 300

BLAST of Cp4.1LG03g18020 vs. TAIR10
Match: AT3G24120.2 (AT3G24120.2 Homeodomain-like superfamily protein)

HSP 1 Score: 326.6 bits (836), Expect = 1.6e-89
Identity = 187/274 (68.25%), Postives = 219/274 (79.93%), Query Frame = 1

Query: 12  MYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQ 71
           MYSAI +LP+DGG  GG + G LDGTNLPGDACLVLT+DPKPRLRWT ELHERFVDAVTQ
Sbjct: 1   MYSAIRSLPLDGGHVGGDYHGPLDGTNLPGDACLVLTTDPKPRLRWTTELHERFVDAVTQ 60

Query: 72  LGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQETS 131
           LGG DKATPKTIMRTMGVKGLTLYHLKSHLQK+RLG+Q+ KESTENSKDASC+ ESQ+T 
Sbjct: 61  LGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGRQAGKESTENSKDASCVGESQDTG 120

Query: 132 SSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLE---VQRRLQLRIEAQGKYLQS 191
           SSS+ S ++  ++ N+ +QVTEALRAQMEVQRRLH+QLE   VQRRLQLRIEAQGKYLQS
Sbjct: 121 SSSTSSMRMAQQEQNEGYQVTEALRAQMEVQRRLHDQLEYGQVQRRLQLRIEAQGKYLQS 180

Query: 192 ILERACRALSDQAAASAGLEAVREELSELAMKVGNESK--EMAPLEAQKVL---PFSELA 251
           ILE+AC+A  +QAA  AGLEA REELSELA+KV N S+   +   +A K++     SELA
Sbjct: 181 ILEKACKAFDEQAATFAGLEAAREELSELAIKVSNSSQGTSVPYFDATKMMMMPSLSELA 240

Query: 252 AALENPKAPTVMPRVGDCSMDSCLTSA--GSPVS 276
            A++N    T      +CS++S LTS   GS +S
Sbjct: 241 VAIDNKNNITT-----NCSVESSLTSITHGSSIS 269

BLAST of Cp4.1LG03g18020 vs. TAIR10
Match: AT4G13640.2 (AT4G13640.2 Homeodomain-like superfamily protein)

HSP 1 Score: 311.6 bits (797), Expect = 5.5e-85
Identity = 194/313 (61.98%), Postives = 228/313 (72.84%), Query Frame = 1

Query: 12  MYSAI-PALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVT 71
           MYSAI  +LP+DG  G        DGTNLP DACLVLT+DPKPRLRWT+ELHERFVDAVT
Sbjct: 1   MYSAIRSSLPLDGSLGDYS-----DGTNLPIDACLVLTTDPKPRLRWTSELHERFVDAVT 60

Query: 72  QLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQET 131
           QLGG DKATPKTIMRTMGVKGLTLYHLKSHLQK+RLG+QS KES +NSKD SC+AESQ+T
Sbjct: 61  QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGRQSCKESIDNSKDVSCVAESQDT 120

Query: 132 SSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLE---VQRRLQLRIEAQGKYLQ 191
            SSS+ S ++ A++ N+S+QVTEALRAQMEVQRRLHEQLE   VQRRLQLRIEAQGKYLQ
Sbjct: 121 GSSSTSSLRLAAQEQNESYQVTEALRAQMEVQRRLHEQLEYTQVQRRLQLRIEAQGKYLQ 180

Query: 192 SILERACRALSDQAAASAGLEAVREELSELAMKV----GNESKEMAPLEAQKVLP-FSEL 251
           SILE+AC+A+ +QA A AGLEA REELSELA+K     G +         + ++P  SEL
Sbjct: 181 SILEKACKAIEEQAVAFAGLEAAREELSELAIKASITNGCQGTTSTFDTTKMMIPSLSEL 240

Query: 252 AAALENPKAPTVMPRVGDCSMDSCLTSA--GSPVSPIGVGSTTTGMKRPRPVFSHGDSMA 311
           A A+E+           +CS +S LTS+  GSPV      S     KR R VF +GDS+ 
Sbjct: 241 AVAIEHK---------NNCSAESSLTSSTVGSPV------SAALMKKRQRGVFGNGDSVV 289

Query: 312 LEGNARHDVEWMM 314
           +     HD  W+M
Sbjct: 301 V----GHDAGWVM 289

BLAST of Cp4.1LG03g18020 vs. TAIR10
Match: AT1G79430.2 (AT1G79430.2 Homeodomain-like superfamily protein)

HSP 1 Score: 192.2 bits (487), Expect = 4.8e-49
Identity = 107/165 (64.85%), Postives = 127/165 (76.97%), Query Frame = 1

Query: 41  GDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSH 100
           GD+ LVLT+DPKPRLRWT ELHERFVDAV QLGG DKATPKTIMR MGVKGLTLYHLKSH
Sbjct: 23  GDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKSH 82

Query: 101 LQKYRLGKQSFKESTENSKDASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQME 160
           LQK+RLGKQ  KE  ++S      A + +   + + SS +++R++N+          QME
Sbjct: 83  LQKFRLGKQPHKEYGDHSTKEGSRASAMDIQRNVASSSGMMSRNMNE---------MQME 142

Query: 161 VQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASA 206
           VQRRLHEQLEVQR LQLRIEAQGKY+QSILERAC+ L+ +  A+A
Sbjct: 143 VQRRLHEQLEVQRHLQLRIEAQGKYMQSILERACQTLAGENMAAA 178

BLAST of Cp4.1LG03g18020 vs. TAIR10
Match: AT3G04030.3 (AT3G04030.3 Homeodomain-like superfamily protein)

HSP 1 Score: 191.4 bits (485), Expect = 8.2e-49
Identity = 115/236 (48.73%), Postives = 150/236 (63.56%), Query Frame = 1

Query: 38  NLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHL 97
           N PGD+ L+L++D KPRL+WT +LHERF++AV QLGGADKATPKTIM+ MG+ GLTLYHL
Sbjct: 31  NSPGDSGLILSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTIMKVMGIPGLTLYHL 90

Query: 98  KSHLQKYRLGKQSFKESTENSKDASCIAESQETSSSS---SPSSKIIARDLNDSFQVTEA 157
           KSHLQKYRL K    ++  +      +   +E +  +      +  I    N +  + EA
Sbjct: 91  KSHLQKYRLSKNLNGQANNSFNKIGIMTMMEEKTPDADEIQSENLSIGPQPNKNSPIGEA 150

Query: 158 LRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASAGLEAVREEL 217
           L+ Q+EVQRRLHEQLEVQR LQLRIEAQGKYLQS+LE+A   L  Q   +AG+EA + +L
Sbjct: 151 LQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIEAAKVQL 210

Query: 218 SELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKAPTVMPRVGDCSMDSCLTSA 271
           SEL  KV  E    + LE +++            P          DCS++SCLTS+
Sbjct: 211 SELVSKVSAEYPNSSFLEPKELQNLCSQQMQTNYPP---------DCSLESCLTSS 257

BLAST of Cp4.1LG03g18020 vs. TAIR10
Match: AT5G18240.1 (AT5G18240.1 myb-related protein 1)

HSP 1 Score: 184.5 bits (467), Expect = 1.0e-46
Identity = 117/236 (49.58%), Postives = 149/236 (63.14%), Query Frame = 1

Query: 38  NLPGDACLVLTSDPKPRLRWTAELHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHL 97
           N  GD+ L+L++D KPRL+WT +LHERFV+AV QLGG DKATPKTIM+ MG+ GLTLYHL
Sbjct: 31  NGTGDSGLILSTDAKPRLKWTPDLHERFVEAVNQLGGGDKATPKTIMKVMGIPGLTLYHL 90

Query: 98  KSHLQKYRLGKQ---SFKESTENSKDASCIAESQETSSSSSPSSKIIARDLNDSFQVTEA 157
           KSHLQKYRL K        S   +   + + E+      S   S  I    + +  +++A
Sbjct: 91  KSHLQKYRLSKNLNGQANSSLNKTSVMTMVEENPPEVDESHSESLSIGPQPSMNLPISDA 150

Query: 158 LRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILERACRALSDQAAASAGLEAVREEL 217
           L+ Q+EVQRRLHEQLEVQR LQLRIEAQGKYLQSILE+A   L  Q   +AG+EA + +L
Sbjct: 151 LQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQETLGRQNLGAAGIEATKAQL 210

Query: 218 SELAMKVGNESKEMAPLEAQKVLPFSELAAALENPKAPTVMPRVGDCSMDSCLTSA 271
           SEL  KV  +  + + LE +      EL          T  P   + S+DSCLTS+
Sbjct: 211 SELVSKVSADYPDSSFLEPK------ELQNLHHQQMQKTYPP---NSSLDSCLTSS 257

BLAST of Cp4.1LG03g18020 vs. NCBI nr
Match: gi|659075830|ref|XP_008438354.1| (PREDICTED: myb family transcription factor APL isoform X1 [Cucumis melo])

HSP 1 Score: 558.9 bits (1439), Expect = 5.6e-156
Identity = 294/313 (93.93%), Postives = 301/313 (96.17%), Query Frame = 1

Query: 1   MLSGFSQEPAGMYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60
           MLSGFSQEPAGMYSAIPALPMD  GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE
Sbjct: 1   MLSGFSQEPAGMYSAIPALPMD--GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60

Query: 61  LHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120
           LHERFVDAVTQLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD
Sbjct: 61  LHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120

Query: 121 ASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIE 180
           ASCIAESQETSSSSSPSS+I+A+DLND FQVTEALR QMEVQRRLHEQLEVQR LQLRIE
Sbjct: 121 ASCIAESQETSSSSSPSSRIMAQDLNDGFQVTEALRVQMEVQRRLHEQLEVQRHLQLRIE 180

Query: 181 AQGKYLQSILERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFS 240
           AQGKYLQSILERAC+ALSDQAAASAGLEA REELSELA+KV N+SKEMAPLE QKVLPFS
Sbjct: 181 AQGKYLQSILERACQALSDQAAASAGLEAAREELSELAIKVSNDSKEMAPLETQKVLPFS 240

Query: 241 ELAAALENPKAPTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGMKRPRPVFSHGDSMA 300
           ELAAALEN KAPTVMPR+GDCSMDSCLTSAGSPVSPIGVGST T MKRPRPVFSHGDSMA
Sbjct: 241 ELAAALENRKAPTVMPRIGDCSMDSCLTSAGSPVSPIGVGSTATAMKRPRPVFSHGDSMA 300

Query: 301 LEGNARHDVEWMM 314
           LEGNARHDVEWMM
Sbjct: 301 LEGNARHDVEWMM 311

BLAST of Cp4.1LG03g18020 vs. NCBI nr
Match: gi|778677863|ref|XP_004133994.2| (PREDICTED: myb family transcription factor APL isoform X1 [Cucumis sativus])

HSP 1 Score: 543.9 bits (1400), Expect = 1.9e-151
Identity = 288/309 (93.20%), Postives = 295/309 (95.47%), Query Frame = 1

Query: 1   MLSGFSQEPAGMYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60
           MLSGFSQEPAGMYS I ALPMD  GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE
Sbjct: 1   MLSGFSQEPAGMYSTITALPMD--GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60

Query: 61  LHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120
           LHERFVDAVTQLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD
Sbjct: 61  LHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120

Query: 121 ASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIE 180
           ASCIAESQETSSSSSPSS+I+A+DLND FQVTEALR QMEVQRRLHEQLEVQR LQLRIE
Sbjct: 121 ASCIAESQETSSSSSPSSRIMAQDLNDGFQVTEALRVQMEVQRRLHEQLEVQRHLQLRIE 180

Query: 181 AQGKYLQSILERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFS 240
           AQGKYLQSILERAC+ALSDQAAASAGLEA REELSELA+KV N+SKEMAPLE QKVLPFS
Sbjct: 181 AQGKYLQSILERACQALSDQAAASAGLEAAREELSELAIKVSNDSKEMAPLETQKVLPFS 240

Query: 241 ELAAALENPKAPTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGMKRPRPVFSHGDSMA 300
           ELAAALEN KAPTVMPR+GDCSMDSCLTSAGSPVSPIGVGST T MKRPRPVFSHGDSMA
Sbjct: 241 ELAAALENRKAPTVMPRIGDCSMDSCLTSAGSPVSPIGVGSTATAMKRPRPVFSHGDSMA 300

Query: 301 LEGNARHDV 310
           LEGNARHDV
Sbjct: 301 LEGNARHDV 307

BLAST of Cp4.1LG03g18020 vs. NCBI nr
Match: gi|659075832|ref|XP_008438355.1| (PREDICTED: myb family transcription factor APL isoform X2 [Cucumis melo])

HSP 1 Score: 543.5 bits (1399), Expect = 2.4e-151
Identity = 289/313 (92.33%), Postives = 296/313 (94.57%), Query Frame = 1

Query: 1   MLSGFSQEPAGMYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60
           MLSGFSQEPAGMYSAIPALPMD  GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE
Sbjct: 1   MLSGFSQEPAGMYSAIPALPMD--GGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60

Query: 61  LHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120
           LHERFVDAVTQLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD
Sbjct: 61  LHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120

Query: 121 ASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIE 180
                ESQETSSSSSPSS+I+A+DLND FQVTEALR QMEVQRRLHEQLEVQR LQLRIE
Sbjct: 121 -----ESQETSSSSSPSSRIMAQDLNDGFQVTEALRVQMEVQRRLHEQLEVQRHLQLRIE 180

Query: 181 AQGKYLQSILERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFS 240
           AQGKYLQSILERAC+ALSDQAAASAGLEA REELSELA+KV N+SKEMAPLE QKVLPFS
Sbjct: 181 AQGKYLQSILERACQALSDQAAASAGLEAAREELSELAIKVSNDSKEMAPLETQKVLPFS 240

Query: 241 ELAAALENPKAPTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGMKRPRPVFSHGDSMA 300
           ELAAALEN KAPTVMPR+GDCSMDSCLTSAGSPVSPIGVGST T MKRPRPVFSHGDSMA
Sbjct: 241 ELAAALENRKAPTVMPRIGDCSMDSCLTSAGSPVSPIGVGSTATAMKRPRPVFSHGDSMA 300

Query: 301 LEGNARHDVEWMM 314
           LEGNARHDVEWMM
Sbjct: 301 LEGNARHDVEWMM 306

BLAST of Cp4.1LG03g18020 vs. NCBI nr
Match: gi|778677866|ref|XP_011650877.1| (PREDICTED: myb family transcription factor APL isoform X2 [Cucumis sativus])

HSP 1 Score: 528.5 bits (1360), Expect = 8.1e-147
Identity = 283/309 (91.59%), Postives = 290/309 (93.85%), Query Frame = 1

Query: 1   MLSGFSQEPAGMYSAIPALPMDGGGGGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60
           MLSGFSQEPAGMYS I ALPMDGGGG  KFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE
Sbjct: 1   MLSGFSQEPAGMYSTITALPMDGGGG--KFQGSLDGTNLPGDACLVLTSDPKPRLRWTAE 60

Query: 61  LHERFVDAVTQLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120
           LHERFVDAVTQLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD
Sbjct: 61  LHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKD 120

Query: 121 ASCIAESQETSSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIE 180
                ESQETSSSSSPSS+I+A+DLND FQVTEALR QMEVQRRLHEQLEVQR LQLRIE
Sbjct: 121 -----ESQETSSSSSPSSRIMAQDLNDGFQVTEALRVQMEVQRRLHEQLEVQRHLQLRIE 180

Query: 181 AQGKYLQSILERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFS 240
           AQGKYLQSILERAC+ALSDQAAASAGLEA REELSELA+KV N+SKEMAPLE QKVLPFS
Sbjct: 181 AQGKYLQSILERACQALSDQAAASAGLEAAREELSELAIKVSNDSKEMAPLETQKVLPFS 240

Query: 241 ELAAALENPKAPTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGMKRPRPVFSHGDSMA 300
           ELAAALEN KAPTVMPR+GDCSMDSCLTSAGSPVSPIGVGST T MKRPRPVFSHGDSMA
Sbjct: 241 ELAAALENRKAPTVMPRIGDCSMDSCLTSAGSPVSPIGVGSTATAMKRPRPVFSHGDSMA 300

Query: 301 LEGNARHDV 310
           LEGNARHDV
Sbjct: 301 LEGNARHDV 302

BLAST of Cp4.1LG03g18020 vs. NCBI nr
Match: gi|327412613|emb|CCA29095.1| (putative MYB transcription factor [Rosa rugosa])

HSP 1 Score: 435.3 bits (1118), Expect = 9.3e-119
Identity = 232/304 (76.32%), Postives = 266/304 (87.50%), Query Frame = 1

Query: 12  MYSAIPALPMDGGG-GGGKFQGSLDGTNLPGDACLVLTSDPKPRLRWTAELHERFVDAVT 71
           MYSA+ +LP+DGG  G G+F GSLDGTNLPGDACLVLT+DPKPRLRWTAELHERFVDAVT
Sbjct: 1   MYSALHSLPLDGGVCGHGEFSGSLDGTNLPGDACLVLTTDPKPRLRWTAELHERFVDAVT 60

Query: 72  QLGGADKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQET 131
           QLGG DKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQ+T
Sbjct: 61  QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQSFKESTENSKDASCIAESQDT 120

Query: 132 SSSSSPSSKIIARDLNDSFQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL 191
            SS++ SS++IA+DLND +QVTEALR QMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL
Sbjct: 121 GSSAT-SSRVIAQDLNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSIL 180

Query: 192 ERACRALSDQAAASAGLEAVREELSELAMKVGNESKEMAPLEAQKVLPFSELAAALENPK 251
           E+AC+AL+DQAA +AGLEA +EELSELA+KV ++ + MAPL+  K+   SE+AAA+EN  
Sbjct: 181 EKACKALNDQAATAAGLEAAKEELSELAIKVSSDCQGMAPLDTIKMQSLSEIAAAIENKS 240

Query: 252 APTVMPRVGDCSMDSCLTSAGSPVSPIGVGSTTTGM-KRPRPVFSHGDSMALEGNARHDV 311
           A  V+ R+G+CS+DSCLTS GSP SP+G+ S    M KR RP FS+GDS+ LEGN R +V
Sbjct: 241 ASNVLARIGNCSVDSCLTSTGSPGSPMGMSSLAAAMKKRQRPFFSNGDSLPLEGNMRQEV 300

Query: 312 EWMM 314
           EWMM
Sbjct: 301 EWMM 303

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PHL2_ARATH6.9e-9069.00Protein PHR1-LIKE 2 OS=Arabidopsis thaliana GN=PHL2 PE=1 SV=1[more]
PHL3_ARATH2.3e-8562.58Protein PHR1-LIKE 3 OS=Arabidopsis thaliana GN=PHL3 PE=1 SV=1[more]
APL_ARATH8.6e-4864.85Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=1 SV=2[more]
PHL9_ARATH1.5e-4748.73Myb-related protein 2 OS=Arabidopsis thaliana GN=MYR2 PE=1 SV=1[more]
PHLA_ARATH1.8e-4549.58Myb-related protein 1 OS=Arabidopsis thaliana GN=MYR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L435_CUCSA1.3e-15193.20Uncharacterized protein OS=Cucumis sativus GN=Csa_3G130900 PE=4 SV=1[more]
F2QKV9_ROSRU6.5e-11976.32Putative MYB transcription factor OS=Rosa rugosa GN=myb6 PE=2 SV=1[more]
B9RWI5_RICCO8.8e-11675.25Transcription factor, putative OS=Ricinus communis GN=RCOM_1019810 PE=4 SV=1[more]
B9GFX5_POPTR3.7e-11475.25Myb family transcription factor family protein OS=Populus trichocarpa GN=POPTR_0... [more]
F6GWG5_VITVI4.8e-11475.25Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0029g00060 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G24120.21.6e-8968.25 Homeodomain-like superfamily protein[more]
AT4G13640.25.5e-8561.98 Homeodomain-like superfamily protein[more]
AT1G79430.24.8e-4964.85 Homeodomain-like superfamily protein[more]
AT3G04030.38.2e-4948.73 Homeodomain-like superfamily protein[more]
AT5G18240.11.0e-4649.58 myb-related protein 1[more]
Match NameE-valueIdentityDescription
gi|659075830|ref|XP_008438354.1|5.6e-15693.93PREDICTED: myb family transcription factor APL isoform X1 [Cucumis melo][more]
gi|778677863|ref|XP_004133994.2|1.9e-15193.20PREDICTED: myb family transcription factor APL isoform X1 [Cucumis sativus][more]
gi|659075832|ref|XP_008438355.1|2.4e-15192.33PREDICTED: myb family transcription factor APL isoform X2 [Cucumis melo][more]
gi|778677866|ref|XP_011650877.1|8.1e-14791.59PREDICTED: myb family transcription factor APL isoform X2 [Cucumis sativus][more]
gi|327412613|emb|CCA29095.1|9.3e-11976.32putative MYB transcription factor [Rosa rugosa][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR025756Myb_CC_LHEQLE
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g18020.1Cp4.1LG03g18020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 54..105
score: 1.
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 52..107
score: 5.3
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 49..107
score: 3.2
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 52..109
score: 2.15
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 49..109
score: 12
IPR025756MYB-CC type transcription factor, LHEQLE-containing domainPFAMPF14379Myb_CC_LHEQLEcoord: 149..195
score: 6.8
NoneNo IPR availableunknownCoilCoilcoord: 207..227
score: -coord: 152..172
scor
NoneNo IPR availablePANTHERPTHR31499FAMILY NOT NAMEDcoord: 35..314
score: 3.9E
NoneNo IPR availablePANTHERPTHR31499:SF6MYB FAMILY TRANSCRIPTION FACTOR-RELATEDcoord: 35..314
score: 3.9E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g18020Cp4.1LG08g02780Cucurbita pepo (Zucchini)cpecpeB482