Cp4.1LG03g09290 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g09290
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyb family transcription factor family protein
LocationCp4.1LG03 : 6377956 .. 6379805 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATCTCTCTCAACACACCACACCACACAACACAACAACTCTCCAATCCATACCCTTAACTCAGGTAATCCATTCCATAGCTAGTTAGTTTTTAAAAATGGTGCTTATTTCTTTACCAACAATCGAGTTCATACCCAATATGGGTCCATGTTTGTTTATAATGTGATGTTAACGTTGAATTGTATGAGTGCAGAATGTACCATCATCATCAACATAGAGGGAAAAGCATCCACTCCTCTGAGAGGCATTTGTTCTTACAAGGTGGGAATGGGCCTGGAGGAGATTCAGGTCTTGTTCTTTCCACCGACGCTAAACCCAGGCTTAAATGGACTCCTGATTTGCATGACCGTTTTGTTGAAGCAGTCAATCAACTTGGAGGGCCTGACAGTGAGCATTTCATTTCATCTTTAATTCTATTTTACAATTACTATTATTATGATTTTAAGTTCTTCAATTCTATCTTTGTAGAGGCCACTCCTAAAACCGTGATGAAAATTATGGGCATTTCTGGACTTACTTTGTACCATTTGAAGAGCCATCTTCAGGTATAAATCTGTTTAAGCTTTTGAATCGAGTCATGATTTAACGACATCGTGACGTTTTGTATAATGATTGGATGCAGAAATACAGGCTGAGCAAGAACATGCATGGACAAGCAAATGGTGGAAGTGGAACCAATAATAAAACTGGTAATTAATAATTTAGAGCAATTTTAGCATAATTATCTCTTTCCTTTCATCGATTTTGTTTGAATCCTCGTCTTTTTCTAGGAGAAATTGTTATCTTTTTTAATTAAAAAGATTTGTGACCAGTTGGGAGGAATTTTAACAGGACGGTTGTACTTTTATCTTTATAAATAAATGAGATGTTGTGTAACAAAGAAAGAGAATTGAAAAGCATAAATACTTGACTACTTGGAAGTTCATTTGTTTGGACCTGATGGTTATGGTAAAATAGGTATGGGTTGGCGTGTAGGAACGGTGGCTGTTTCTGTAGACCAGAGATTGGGTGAGGCAAATGGAGCAGGTCGCACTAACAACAGCATAGCCGTCGCACCACAGGCCTCCTCCCAGCCAAACAAGTAGGCCACCTTAAGACAAAAATAGAGCAAATTTAGTGCATAAAACATGATTTATAACAATACTCCCTTCTTCTCCAGAAGTCTTCAAATCAGTGAAACAATACAAATGCAAATTGAAGTGCAGAAAAGGCTACACGAACAACTCGAGGTACAAAGGCATTTGCAGCTACGAATAGAAGCACAAGGGAAGTATCTCCAAACAGTGTTGGAGAAAGCACAAGAAACATTAGGAACACAGAATTTAGGGACAGTGGGACTTGAAGCTGCCAAAGTGCAGCTCTCAGAATTGGTCTCCAAAGTGTCCACTCAATGCCTAACTGCAGCCTTTCCAGAGCTCCACCACCAACAACAACAACAACAAAACCAACCCCAAAGGCTATGTCCTCAACAAACCCAGCCCCCTGACTGCTCCATGGACAGTTGCTTGACTTCCTGCGAGGCCTCCAAGGATCAGCAAAACGGCCACTTGGCCCTCCAACCCTATGCCATCGCCACCGACAGAGGCTCCTCCGCTGCGGACCAGCTTCATGGGCTGTCCATGGGCATGGGGCTGGTGCAAGGGGAGAAGGGGGAAGGGTATAATGGGTATTCACCATCCGAGGGACAAAGATTTGGGAGTACAAGAAAGGAGGGAGTGGAAAGGGAGAAAGTGGTGGAGGGGGCGTTTAGATATAGAATGGACTTGAATGCTGGAGAGGATCAGCTTAGTGATAATGATCATGCTGCGTCTAATACTTGCAAGATGTTTGATCTTAATGGTTTTAGCTGA

mRNA sequence

TGATCTCTCTCAACACACCACACCACACAACACAACAACTCTCCAATCCATACCCTTAACTCAGAATGTACCATCATCATCAACATAGAGGGAAAAGCATCCACTCCTCTGAGAGGCATTTGTTCTTACAAGGTGGGAATGGGCCTGGAGGAGATTCAGGTCTTGTTCTTTCCACCGACGCTAAACCCAGGCTTAAATGGACTCCTGATTTGCATGACCGTTTTGTTGAAGCAGTCAATCAACTTGGAGGGCCTGACAAGGCCACTCCTAAAACCGTGATGAAAATTATGGGCATTTCTGGACTTACTTTGTACCATTTGAAGAGCCATCTTCAGAAATACAGGCTGAGCAAGAACATGCATGGACAAGCAAATGGTGGAAGTGGAACCAATAATAAAACTGGTATGGGTTGGCGTGTAGGAACGGTGGCTGTTTCTGTAGACCAGAGATTGGGTGAGGCAAATGGAGCAGGTCGCACTAACAACAGCATAGCCGTCGCACCACAGGCCTCCTCCCAGCCAAACAAAAGTCTTCAAATCAGTGAAACAATACAAATGCAAATTGAAGTGCAGAAAAGGCTACACGAACAACTCGAGGTACAAAGGCATTTGCAGCTACGAATAGAAGCACAAGGGAAGTATCTCCAAACAGTGTTGGAGAAAGCACAAGAAACATTAGGAACACAGAATTTAGGGACAGTGGGACTTGAAGCTGCCAAAGTGCAGCTCTCAGAATTGGTCTCCAAAGTGTCCACTCAATGCCTAACTGCAGCCTTTCCAGAGCTCCACCACCAACAACAACAACAACAAAACCAACCCCAAAGGCTATGTCCTCAACAAACCCAGCCCCCTGACTGCTCCATGGACAGTTGCTTGACTTCCTGCGAGGCCTCCAAGGATCAGCAAAACGGCCACTTGGCCCTCCAACCCTATGCCATCGCCACCGACAGAGGCTCCTCCGCTGCGGACCAGCTTCATGGGCTGTCCATGGGCATGGGGCTGGTGCAAGGGGAGAAGGGGGAAGGGTATAATGGGTATTCACCATCCGAGGGACAAAGATTTGGGAGTACAAGAAAGGAGGGAGTGGAAAGGGAGAAAGTGGTGGAGGGGGCGTTTAGATATAGAATGGACTTGAATGCTGGAGAGGATCAGCTTAGTGATAATGATCATGCTGCGTCTAATACTTGCAAGATGTTTGATCTTAATGGTTTTAGCTGA

Coding sequence (CDS)

ATGTACCATCATCATCAACATAGAGGGAAAAGCATCCACTCCTCTGAGAGGCATTTGTTCTTACAAGGTGGGAATGGGCCTGGAGGAGATTCAGGTCTTGTTCTTTCCACCGACGCTAAACCCAGGCTTAAATGGACTCCTGATTTGCATGACCGTTTTGTTGAAGCAGTCAATCAACTTGGAGGGCCTGACAAGGCCACTCCTAAAACCGTGATGAAAATTATGGGCATTTCTGGACTTACTTTGTACCATTTGAAGAGCCATCTTCAGAAATACAGGCTGAGCAAGAACATGCATGGACAAGCAAATGGTGGAAGTGGAACCAATAATAAAACTGGTATGGGTTGGCGTGTAGGAACGGTGGCTGTTTCTGTAGACCAGAGATTGGGTGAGGCAAATGGAGCAGGTCGCACTAACAACAGCATAGCCGTCGCACCACAGGCCTCCTCCCAGCCAAACAAAAGTCTTCAAATCAGTGAAACAATACAAATGCAAATTGAAGTGCAGAAAAGGCTACACGAACAACTCGAGGTACAAAGGCATTTGCAGCTACGAATAGAAGCACAAGGGAAGTATCTCCAAACAGTGTTGGAGAAAGCACAAGAAACATTAGGAACACAGAATTTAGGGACAGTGGGACTTGAAGCTGCCAAAGTGCAGCTCTCAGAATTGGTCTCCAAAGTGTCCACTCAATGCCTAACTGCAGCCTTTCCAGAGCTCCACCACCAACAACAACAACAACAAAACCAACCCCAAAGGCTATGTCCTCAACAAACCCAGCCCCCTGACTGCTCCATGGACAGTTGCTTGACTTCCTGCGAGGCCTCCAAGGATCAGCAAAACGGCCACTTGGCCCTCCAACCCTATGCCATCGCCACCGACAGAGGCTCCTCCGCTGCGGACCAGCTTCATGGGCTGTCCATGGGCATGGGGCTGGTGCAAGGGGAGAAGGGGGAAGGGTATAATGGGTATTCACCATCCGAGGGACAAAGATTTGGGAGTACAAGAAAGGAGGGAGTGGAAAGGGAGAAAGTGGTGGAGGGGGCGTTTAGATATAGAATGGACTTGAATGCTGGAGAGGATCAGCTTAGTGATAATGATCATGCTGCGTCTAATACTTGCAAGATGTTTGATCTTAATGGTTTTAGCTGA

Protein sequence

MYHHHQHRGKSIHSSERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFPELHHQQQQQQNQPQRLCPQQTQPPDCSMDSCLTSCEASKDQQNGHLALQPYAIATDRGSSAADQLHGLSMGMGLVQGEKGEGYNGYSPSEGQRFGSTRKEGVEREKVVEGAFRYRMDLNAGEDQLSDNDHAASNTCKMFDLNGFS
BLAST of Cp4.1LG03g09290 vs. Swiss-Prot
Match: PHL9_ARATH (Myb-related protein 2 OS=Arabidopsis thaliana GN=MYR2 PE=1 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.2e-80
Identity = 206/423 (48.70%), Postives = 255/423 (60.28%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MY+ +QH+GK+I SS       ERH FL+G N P GDSGL+LSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYYQNQHQGKNILSSSRMHITSERHPFLRG-NSP-GDSGLILSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKT+MK+MGI GLTLYHLKSHLQKYRLSKN++GQAN    + NK G
Sbjct: 61  IEAVNQLGGADKATPKTIMKVMGIPGLTLYHLKSHLQKYRLSKNLNGQAN---NSFNKIG 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
                  +   ++++  +A+     N SI        QPNK+  I E +QMQIEVQ+RLH
Sbjct: 121 -------IMTMMEEKTPDADEIQSENLSI------GPQPNKNSPIGEALQMQIEVQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ+VLEKAQETLG QNLG  G+EAAKVQLSELVSKVS +  
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIEAAKVQLSELVSKVSAEYP 240

Query: 241 TAAFPELHHQQQQQQNQPQRLCPQQTQ---PPDCSMDSCLTSCEA----SKDQQNGHLAL 300
            ++F E          + Q LC QQ Q   PPDCS++SCLTS E     SK  +N  L L
Sbjct: 241 NSSFLE--------PKELQNLCSQQMQTNYPPDCSLESCLTSSEGTQKNSKMLENNRLGL 300

Query: 301 QPYAIATDRGSSAADQLHGLS------MGMGLVQGEKGEGYNGYSPSEGQR--------- 360
           + Y      G S ++Q   +       M +   +G +G  Y     SE ++         
Sbjct: 301 RTYI-----GDSTSEQKEIMEEPLFQRMELTWTEGLRGNPYLSTMVSEAEQRISYSERSP 360

Query: 361 -----------FGSTRKEGVEREKVVEGAFRYRMDLNAGEDQLSDNDHAASNTCKMFDLN 384
                        S  ++G   +  +E   R  MD     D  +  ++  +   K FDLN
Sbjct: 361 GRLSIGVGLHGHKSQHQQGNNEDHKLETRNRKGMDSTTELDLNTHVENYCTTRTKQFDLN 392

BLAST of Cp4.1LG03g09290 vs. Swiss-Prot
Match: PHLA_ARATH (Myb-related protein 1 OS=Arabidopsis thaliana GN=MYR1 PE=1 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 9.7e-78
Identity = 177/290 (61.03%), Postives = 208/290 (71.72%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MY+H+QH+GKSI SS       ERH FL+G NG  GDSGL+LSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYYHNQHQGKSILSSSRMPISSERHPFLRG-NGT-GDSGLILSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           VEAVNQLGG DKATPKT+MK+MGI GLTLYHLKSHLQKYRLSKN++GQAN    + NKT 
Sbjct: 61  VEAVNQLGGGDKATPKTIMKVMGIPGLTLYHLKSHLQKYRLSKNLNGQAN---SSLNKTS 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
           +   V      VD+   E         S+++ P    QP+ +L IS+ +QMQIEVQ+RLH
Sbjct: 121 VMTMVEENPPEVDESHSE---------SLSIGP----QPSMNLPISDALQMQIEVQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ++LEKAQETLG QNLG  G+EA K QLSELVSKVS    
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQSILEKAQETLGRQNLGAAGIEATKAQLSELVSKVSADYP 240

Query: 241 TAAFPE------LHHQQQQQQNQPQRLCPQQTQPPDCSMDSCLTSCEASK 278
            ++F E      LHHQQ            Q+T PP+ S+DSCLTS E ++
Sbjct: 241 DSSFLEPKELQNLHHQQM-----------QKTYPPNSSLDSCLTSSEGTQ 261

BLAST of Cp4.1LG03g09290 vs. Swiss-Prot
Match: PHL8_ARATH (Myb family transcription factor PHL8 OS=Arabidopsis thaliana GN=PHL8 PE=2 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 6.6e-50
Identity = 120/250 (48.00%), Postives = 165/250 (66.00%), Query Frame = 1

Query: 33  LVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKY 92
           LVLSTDAKPRLKWT DLH +F+EAVNQLGGP+KATPK +MK+M I GLTLYHLKSHLQKY
Sbjct: 27  LVLSTDAKPRLKWTCDLHHKFIEAVNQLGGPNKATPKGLMKVMEIPGLTLYHLKSHLQKY 86

Query: 93  RLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQP 152
           RL K+M    N             ++   + S +Q + E+    R     +V  + S+  
Sbjct: 87  RLGKSMKFDDN-------------KLEVSSASENQEV-ESKNDSRDLRGCSVTEENSNPA 146

Query: 153 NKSLQISETIQMQIEVQKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTV 212
            + LQI+E +QMQ+EVQK+LHEQ+EVQRHLQ++IEAQGKYLQ+VL KAQ+TL   +   +
Sbjct: 147 KEGLQITEALQMQMEVQKKLHEQIEVQRHLQVKIEAQGKYLQSVLMKAQQTLAGYSSSNL 206

Query: 213 GLEAAKVQLSELVSKVSTQCLTAAFPELHHQQQQQQ-----NQPQRLCPQQTQPPDCSMD 272
           G++ A+ +LS L S V+  C + +F EL   +++++      +P+     Q +   CS++
Sbjct: 207 GMDFARTELSRLASMVNRGCPSTSFSELTQVEEEEEGFLWYKKPENRGISQLR---CSVE 259

Query: 273 SCLTSCEASK 278
           S LTS E S+
Sbjct: 267 SSLTSSETSE 259

BLAST of Cp4.1LG03g09290 vs. Swiss-Prot
Match: PHL3_ARATH (Protein PHR1-LIKE 3 OS=Arabidopsis thaliana GN=PHL3 PE=1 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 4.9e-45
Identity = 110/200 (55.00%), Postives = 142/200 (71.00%), Query Frame = 1

Query: 30  DSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHL 89
           D+ LVL+TD KPRL+WT +LH+RFV+AV QLGGPDKATPKT+M+ MG+ GLTLYHLKSHL
Sbjct: 27  DACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHL 86

Query: 90  QKYRLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQAS 149
           QK+RL +    Q+   S  N+K      V  VA S D       G+  T +S+ +A Q  
Sbjct: 87  QKFRLGR----QSCKESIDNSKD-----VSCVAESQD------TGSSST-SSLRLAAQ-- 146

Query: 150 SQPNKSLQISETIQMQIEVQKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNL 209
            + N+S Q++E ++ Q+EVQ+RLHEQLEVQR LQLRIEAQGKYLQ++LEKA + +  Q +
Sbjct: 147 -EQNESYQVTEALRAQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKAIEEQAV 206

Query: 210 GTVGLEAAKVQLSELVSKVS 230
              GLEAA+ +LSEL  K S
Sbjct: 207 AFAGLEAAREELSELAIKAS 207

BLAST of Cp4.1LG03g09290 vs. Swiss-Prot
Match: PHL2_ARATH (Protein PHR1-LIKE 2 OS=Arabidopsis thaliana GN=PHL2 PE=1 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 2.4e-44
Identity = 118/256 (46.09%), Postives = 155/256 (60.55%), Query Frame = 1

Query: 21  LQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGL 80
           L G N PG D+ LVL+TD KPRL+WT +LH+RFV+AV QLGGPDKATPKT+M+ MG+ GL
Sbjct: 23  LDGTNLPG-DACLVLTTDPKPRLRWTTELHERFVDAVTQLGGPDKATPKTIMRTMGVKGL 82

Query: 81  TLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNN 140
           TLYHLKSHLQK+RL +    QA   S  N+K                 +GE+   G  ++
Sbjct: 83  TLYHLKSHLQKFRLGR----QAGKESTENSKDA-------------SCVGESQDTG--SS 142

Query: 141 SIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKA 200
           S +    A  + N+  Q++E ++ Q+EVQ+RLH+QLEVQR LQLRIEAQGKYLQ++LEKA
Sbjct: 143 STSSMRMAQQEQNEGYQVTEALRAQMEVQRRLHDQLEVQRRLQLRIEAQGKYLQSILEKA 202

Query: 201 QETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFPELHHQQQQQQNQPQRLC----P 260
            +    Q     GLEAA+ +LSEL  KVS      + P     +         L      
Sbjct: 203 CKAFDEQAATFAGLEAAREELSELAIKVSNSSQGTSVPYFDATKMMMMPSLSELAVAIDN 258

Query: 261 QQTQPPDCSMDSCLTS 273
           +     +CS++S LTS
Sbjct: 263 KNNITTNCSVESSLTS 258

BLAST of Cp4.1LG03g09290 vs. TrEMBL
Match: A0A0A0LG98_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G844850 PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 1.9e-152
Identity = 322/407 (79.12%), Postives = 337/407 (82.80%), Query Frame = 1

Query: 3   HHHQHRGKSIHSSERHLFLQGG-NGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 62
           +HHQHRGKSIHSSERH+FLQGG NG GGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG
Sbjct: 2   YHHQHRGKSIHSSERHMFLQGGGNGGGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 61

Query: 63  GPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTV 122
           G DKATPKTVMKIMGI GLTLYHLKSHLQKYRLSKN+HGQANGGSGTN KTG GWRVGTV
Sbjct: 62  GADKATPKTVMKIMGIPGLTLYHLKSHLQKYRLSKNLHGQANGGSGTN-KTGTGWRVGTV 121

Query: 123 AVSVDQRLGEANGAG---RTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQLEV 182
           AVSVDQRLGEANGA    RT+N I V PQ +SQ NKSLQISETIQMQIEVQKRLHEQLEV
Sbjct: 122 AVSVDQRLGEANGAAAAARTSN-IVVGPQPTSQSNKSLQISETIQMQIEVQKRLHEQLEV 181

Query: 183 QRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP 242
           QRHLQLRIEAQGKYLQTVLEKAQETLG QNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP
Sbjct: 182 QRHLQLRIEAQGKYLQTVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP 241

Query: 243 ELHHQQQQQQNQPQRLC-PQQTQPPDCSMDSCLTSCE-ASKDQQ----------NGHLAL 302
           ELH+     Q+Q QR+C  QQ+QPPDCSMDSCLTS E  SKDQQ          N HLAL
Sbjct: 242 ELHN-----QSQSQRVCAQQQSQPPDCSMDSCLTSSEGGSKDQQAQQQQHVLLHNSHLAL 301

Query: 303 QPYAIATDRGSSAA--DQLHGLSMGMGLVQGEKG--EGYNGYSPSEGQR-FGSTRKEGVE 362
           +PYA   DR SS A    LHGLSM +GLVQGEK   EGYNGYS SEGQR FGS R +   
Sbjct: 302 RPYA---DRASSGAPDHSLHGLSMSIGLVQGEKAGPEGYNGYSTSEGQRLFGSKRTKDAV 361

Query: 363 REKVVEGAFRYRMDL-NAGEDQL----SDNDHAASNTCKMFDLNGFS 384
            EK  E  FRYRMDL NAGEDQL    ++NDH +S TCKMFDLNGFS
Sbjct: 362 MEK--ETGFRYRMDLNNAGEDQLISSNNNNDHTSSTTCKMFDLNGFS 396

BLAST of Cp4.1LG03g09290 vs. TrEMBL
Match: A0A067JLA4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06855 PE=4 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.0e-98
Identity = 243/444 (54.73%), Postives = 278/444 (62.61%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MYHHHQH+GKS+HSS       ERHLFLQGGNGPG DSGLVLSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYHHHQHQGKSVHSSSRMPIPPERHLFLQGGNGPG-DSGLVLSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKTVMK+MGI GLTLYHLKSHLQKYRLSKN+HGQAN GS       
Sbjct: 61  IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSS------ 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
              ++G  AV+ D R+ EAN     N SI        Q NKSL ISE +QMQIEVQ+RLH
Sbjct: 121 ---KIGAAAVTSD-RMSEANVTHMNNLSIG------PQTNKSLHISEALQMQIEVQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ VLEKAQETLG QNLGT+GLEAAKVQLSELVSKVSTQCL
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTMGLEAAKVQLSELVSKVSTQCL 240

Query: 241 TAAFPELHHQQQQQQNQPQRLCPQQTQ---PPDCSMDSCLTSCEASKDQQNGH---LALQ 300
            +AF EL         + Q LCPQQTQ   P DCS+DSCLTSCE S+ +Q  H   + L+
Sbjct: 241 NSAFSEL--------KELQGLCPQQTQTTPPTDCSVDSCLTSCEGSQKEQEIHNTGMGLR 300

Query: 301 PY---AIATDRGSSAADQLHGLSMGMGLVQGEKGEGYNGYSPSEGQR--FGSTR------ 360
           PY   A+   +  +    LH       L  GE  + +     +  +R  F S R      
Sbjct: 301 PYNGSALLESKDMAEEHMLHQTE----LKWGEDNKMFLSPMGNNAERRIFSSERSSSDLS 360

Query: 361 -KEGVEREK------VVEGAFRYRMDLNAGEDQ--------------------------- 384
            + G++ E         EG ++ R D +   DQ                           
Sbjct: 361 MRVGLQGESRNPCSGFSEGRYKERNDDDKFPDQTKKTADSVKLQNENISPGYRLPYFATK 415

BLAST of Cp4.1LG03g09290 vs. TrEMBL
Match: F6HLA5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g07580 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 2.6e-98
Identity = 232/399 (58.15%), Postives = 268/399 (67.17%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MYHHH H+GK+IH S       ER+LFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYHHHHHQGKNIHPSSRTPITPERNLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKTVMK+MGI GLTLYHLKSHLQKYRLSKN+HGQAN  +   +KT 
Sbjct: 61  IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSAT---SKTV 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
           +G           +R+ EANGA      +  +P   +Q NKSL +SET+QM IE Q+RLH
Sbjct: 121 VG-----------ERMPEANGA------LMSSPNIGNQTNKSLHLSETLQM-IEAQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ VLEKAQETLG QNLG VGLEAAKVQLSELVSKVSTQCL
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCL 240

Query: 241 TAAFPELHHQQQQQQNQPQRLCPQ--QTQPPDCSMDSCLTSCEASKDQQNGH---LALQP 300
            +AF EL         + Q LCPQ  QTQP DCSMDSCLTSCE S+ +Q  H   + L+P
Sbjct: 241 HSAFSEL--------KELQSLCPQQTQTQPTDCSMDSCLTSCEGSQREQEIHNCGMGLRP 300

Query: 301 YAIATDRGSSAADQLHGLSMGMGLVQGEKGEGYNGYSPSEGQRFGSTRKEGVEREKVVEG 360
           Y       S +  +  G +     V             + G   G++ K+  E EK+  G
Sbjct: 301 YTNGNGSNSYSEGRFKGRAEADNFVD----------RTNHGADSGNSVKQ--ENEKMSHG 351

Query: 361 ----AFRYRMDLNAGEDQLSDNDHAASNTCKMFDLNGFS 384
                F  ++DLNA +    +ND   S  CK FDLNGFS
Sbjct: 361 YRLPCFGAKLDLNAHD----ENDVTLS--CKQFDLNGFS 351

BLAST of Cp4.1LG03g09290 vs. TrEMBL
Match: B9I5Z8_POPTR (Myb family transcription factor family protein OS=Populus trichocarpa GN=POPTR_0013s05670g PE=4 SV=2)

HSP 1 Score: 366.7 bits (940), Expect = 3.5e-98
Identity = 213/298 (71.48%), Postives = 230/298 (77.18%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MYHHHQH+GKSIHSS       ERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKTVMK+MGI GLTLYHLKSHLQKYRLSKN+HGQAN GS       
Sbjct: 61  IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGSS------ 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNK-----SLQISETIQMQIEV 180
              ++GTVAV V  R+ EAN      N++++     SQPNK     SL  SE +QMQIEV
Sbjct: 121 ---KIGTVAV-VGDRMPEANATHININNLSI----GSQPNKILKSRSLHFSEALQMQIEV 180

Query: 181 QKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKV 240
           Q+RLHEQLEVQRHLQLRIEAQGKYLQ VLEKAQETLG QNLGTVGLEAAKVQLSELVSKV
Sbjct: 181 QRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKV 240

Query: 241 STQCLTAAFPELHHQQQQQQNQPQRLCPQQ---TQPPDCSMDSCLTSCEASKDQQNGH 284
           STQCL + F EL        N  Q LCPQQ   TQP DCSMDSCLTSCE S+ +Q  H
Sbjct: 241 STQCLNSTFSEL--------NDLQGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIH 275

BLAST of Cp4.1LG03g09290 vs. TrEMBL
Match: A0A067EMH8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013552mg PE=4 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 1.9e-96
Identity = 215/305 (70.49%), Postives = 236/305 (77.38%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MYHHHQ++GKS+HSS       ERHLFLQGG+GPG DSGLVLSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPG-DSGLVLSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKTVMK+MGI GLTLYHLKSHLQKYRLSKN+HGQAN G+     TG
Sbjct: 61  IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTG 120

Query: 121 MGWR------VGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIE 180
           +G        VG V V   +R+ EAN     N SI        QPNKSL ISETIQMQIE
Sbjct: 121 IGGMKFKSSGVGPVTVP-GERMPEANATHMNNLSIG------PQPNKSLHISETIQMQIE 180

Query: 181 VQKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSK 240
           VQ+RLHEQLEVQRHLQLRIEAQGKYLQ VLEKAQETLG QNLGT GLEAAKVQLSELVSK
Sbjct: 181 VQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSK 240

Query: 241 VSTQCLTAAFPELHHQQQQQQNQPQRLCPQQTQPPDCSMDSCLTSCEAS-KDQQ--NGHL 290
           VSTQCL + F +L   ++ Q   PQ+  PQ  QP DCSMDSCLTSCE S KDQ+  NG +
Sbjct: 241 VSTQCLNSTFSDL---KELQGFCPQQ--PQANQPTDCSMDSCLTSCEGSQKDQEIHNGGV 292

BLAST of Cp4.1LG03g09290 vs. TAIR10
Match: AT3G04030.3 (AT3G04030.3 Homeodomain-like superfamily protein)

HSP 1 Score: 301.6 bits (771), Expect = 6.9e-82
Identity = 206/423 (48.70%), Postives = 255/423 (60.28%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MY+ +QH+GK+I SS       ERH FL+G N P GDSGL+LSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYYQNQHQGKNILSSSRMHITSERHPFLRG-NSP-GDSGLILSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKT+MK+MGI GLTLYHLKSHLQKYRLSKN++GQAN    + NK G
Sbjct: 61  IEAVNQLGGADKATPKTIMKVMGIPGLTLYHLKSHLQKYRLSKNLNGQAN---NSFNKIG 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
                  +   ++++  +A+     N SI        QPNK+  I E +QMQIEVQ+RLH
Sbjct: 121 -------IMTMMEEKTPDADEIQSENLSI------GPQPNKNSPIGEALQMQIEVQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ+VLEKAQETLG QNLG  G+EAAKVQLSELVSKVS +  
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIEAAKVQLSELVSKVSAEYP 240

Query: 241 TAAFPELHHQQQQQQNQPQRLCPQQTQ---PPDCSMDSCLTSCEA----SKDQQNGHLAL 300
            ++F E          + Q LC QQ Q   PPDCS++SCLTS E     SK  +N  L L
Sbjct: 241 NSSFLE--------PKELQNLCSQQMQTNYPPDCSLESCLTSSEGTQKNSKMLENNRLGL 300

Query: 301 QPYAIATDRGSSAADQLHGLS------MGMGLVQGEKGEGYNGYSPSEGQR--------- 360
           + Y      G S ++Q   +       M +   +G +G  Y     SE ++         
Sbjct: 301 RTYI-----GDSTSEQKEIMEEPLFQRMELTWTEGLRGNPYLSTMVSEAEQRISYSERSP 360

Query: 361 -----------FGSTRKEGVEREKVVEGAFRYRMDLNAGEDQLSDNDHAASNTCKMFDLN 384
                        S  ++G   +  +E   R  MD     D  +  ++  +   K FDLN
Sbjct: 361 GRLSIGVGLHGHKSQHQQGNNEDHKLETRNRKGMDSTTELDLNTHVENYCTTRTKQFDLN 392

BLAST of Cp4.1LG03g09290 vs. TAIR10
Match: AT5G18240.1 (AT5G18240.1 myb-related protein 1)

HSP 1 Score: 292.0 bits (746), Expect = 5.5e-79
Identity = 177/290 (61.03%), Postives = 208/290 (71.72%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MY+H+QH+GKSI SS       ERH FL+G NG  GDSGL+LSTDAKPRLKWTPDLH+RF
Sbjct: 1   MYYHNQHQGKSILSSSRMPISSERHPFLRG-NGT-GDSGLILSTDAKPRLKWTPDLHERF 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           VEAVNQLGG DKATPKT+MK+MGI GLTLYHLKSHLQKYRLSKN++GQAN    + NKT 
Sbjct: 61  VEAVNQLGGGDKATPKTIMKVMGIPGLTLYHLKSHLQKYRLSKNLNGQAN---SSLNKTS 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
           +   V      VD+   E         S+++ P    QP+ +L IS+ +QMQIEVQ+RLH
Sbjct: 121 VMTMVEENPPEVDESHSE---------SLSIGP----QPSMNLPISDALQMQIEVQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ++LEKAQETLG QNLG  G+EA K QLSELVSKVS    
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQSILEKAQETLGRQNLGAAGIEATKAQLSELVSKVSADYP 240

Query: 241 TAAFPE------LHHQQQQQQNQPQRLCPQQTQPPDCSMDSCLTSCEASK 278
            ++F E      LHHQQ            Q+T PP+ S+DSCLTS E ++
Sbjct: 241 DSSFLEPKELQNLHHQQM-----------QKTYPPNSSLDSCLTSSEGTQ 261

BLAST of Cp4.1LG03g09290 vs. TAIR10
Match: AT1G69580.2 (AT1G69580.2 Homeodomain-like superfamily protein)

HSP 1 Score: 197.2 bits (500), Expect = 1.8e-50
Identity = 117/250 (46.80%), Postives = 162/250 (64.80%), Query Frame = 1

Query: 33  LVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKY 92
           LVLSTDAKPRLKWT DLH +F+EAVNQLGGP+KATPK +MK+M I GLTLYHLKSHLQKY
Sbjct: 27  LVLSTDAKPRLKWTCDLHHKFIEAVNQLGGPNKATPKGLMKVMEIPGLTLYHLKSHLQKY 86

Query: 93  RLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQP 152
           RL K+M    N             ++   + S +Q +   N +            ++   
Sbjct: 87  RLGKSMKFDDN-------------KLEVSSASENQEVESKNDSRDLRGCSVTEENSNPAK 146

Query: 153 NKSLQISETIQMQIEVQKRLHEQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTV 212
           ++ LQI+E +QMQ+EVQK+LHEQ+EVQRHLQ++IEAQGKYLQ+VL KAQ+TL   +   +
Sbjct: 147 DRGLQITEALQMQMEVQKKLHEQIEVQRHLQVKIEAQGKYLQSVLMKAQQTLAGYSSSNL 206

Query: 213 GLEAAKVQLSELVSKVSTQCLTAAFPELHHQQQQQQ-----NQPQRLCPQQTQPPDCSMD 272
           G++ A+ +LS L S V+  C + +F EL   +++++      +P+     Q +   CS++
Sbjct: 207 GMDFARTELSRLASMVNRGCPSTSFSELTQVEEEEEGFLWYKKPENRGISQLR---CSVE 260

Query: 273 SCLTSCEASK 278
           S LTS E S+
Sbjct: 267 SSLTSSETSE 260

BLAST of Cp4.1LG03g09290 vs. TAIR10
Match: AT4G13640.2 (AT4G13640.2 Homeodomain-like superfamily protein)

HSP 1 Score: 177.9 bits (450), Expect = 1.2e-44
Identity = 110/203 (54.19%), Postives = 142/203 (69.95%), Query Frame = 1

Query: 30  DSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHL 89
           D+ LVL+TD KPRL+WT +LH+RFV+AV QLGGPDKATPKT+M+ MG+ GLTLYHLKSHL
Sbjct: 27  DACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHL 86

Query: 90  QKYRLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQAS 149
           QK+RL +    Q+   S  N+K      V  VA S D       G+  T +S+ +A Q  
Sbjct: 87  QKFRLGR----QSCKESIDNSKD-----VSCVAESQD------TGSSST-SSLRLAAQ-- 146

Query: 150 SQPNKSLQISETIQMQIEVQKRLHEQLE---VQRHLQLRIEAQGKYLQTVLEKAQETLGT 209
            + N+S Q++E ++ Q+EVQ+RLHEQLE   VQR LQLRIEAQGKYLQ++LEKA + +  
Sbjct: 147 -EQNESYQVTEALRAQMEVQRRLHEQLEYTQVQRRLQLRIEAQGKYLQSILEKACKAIEE 206

Query: 210 QNLGTVGLEAAKVQLSELVSKVS 230
           Q +   GLEAA+ +LSEL  K S
Sbjct: 207 QAVAFAGLEAAREELSELAIKAS 210

BLAST of Cp4.1LG03g09290 vs. TAIR10
Match: AT3G24120.2 (AT3G24120.2 Homeodomain-like superfamily protein)

HSP 1 Score: 175.6 bits (444), Expect = 5.7e-44
Identity = 118/259 (45.56%), Postives = 155/259 (59.85%), Query Frame = 1

Query: 21  LQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLGGPDKATPKTVMKIMGISGL 80
           L G N PG D+ LVL+TD KPRL+WT +LH+RFV+AV QLGGPDKATPKT+M+ MG+ GL
Sbjct: 23  LDGTNLPG-DACLVLTTDPKPRLRWTTELHERFVDAVTQLGGPDKATPKTIMRTMGVKGL 82

Query: 81  TLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTVAVSVDQRLGEANGAGRTNN 140
           TLYHLKSHLQK+RL +    QA   S  N+K                 +GE+   G  ++
Sbjct: 83  TLYHLKSHLQKFRLGR----QAGKESTENSKDA-------------SCVGESQDTG--SS 142

Query: 141 SIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQLE---VQRHLQLRIEAQGKYLQTVL 200
           S +    A  + N+  Q++E ++ Q+EVQ+RLH+QLE   VQR LQLRIEAQGKYLQ++L
Sbjct: 143 STSSMRMAQQEQNEGYQVTEALRAQMEVQRRLHDQLEYGQVQRRLQLRIEAQGKYLQSIL 202

Query: 201 EKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFPELHHQQQQQQNQPQRLC-- 260
           EKA +    Q     GLEAA+ +LSEL  KVS      + P     +         L   
Sbjct: 203 EKACKAFDEQAATFAGLEAAREELSELAIKVSNSSQGTSVPYFDATKMMMMPSLSELAVA 261

Query: 261 --PQQTQPPDCSMDSCLTS 273
              +     +CS++S LTS
Sbjct: 263 IDNKNNITTNCSVESSLTS 261

BLAST of Cp4.1LG03g09290 vs. NCBI nr
Match: gi|778686066|ref|XP_011652324.1| (PREDICTED: uncharacterized protein LOC101206445 isoform X1 [Cucumis sativus])

HSP 1 Score: 547.0 bits (1408), Expect = 2.7e-152
Identity = 322/407 (79.12%), Postives = 337/407 (82.80%), Query Frame = 1

Query: 3   HHHQHRGKSIHSSERHLFLQGG-NGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 62
           +HHQHRGKSIHSSERH+FLQGG NG GGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG
Sbjct: 2   YHHQHRGKSIHSSERHMFLQGGGNGGGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 61

Query: 63  GPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTV 122
           G DKATPKTVMKIMGI GLTLYHLKSHLQKYRLSKN+HGQANGGSGTN KTG GWRVGTV
Sbjct: 62  GADKATPKTVMKIMGIPGLTLYHLKSHLQKYRLSKNLHGQANGGSGTN-KTGTGWRVGTV 121

Query: 123 AVSVDQRLGEANGAG---RTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQLEV 182
           AVSVDQRLGEANGA    RT+N I V PQ +SQ NKSLQISETIQMQIEVQKRLHEQLEV
Sbjct: 122 AVSVDQRLGEANGAAAAARTSN-IVVGPQPTSQSNKSLQISETIQMQIEVQKRLHEQLEV 181

Query: 183 QRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP 242
           QRHLQLRIEAQGKYLQTVLEKAQETLG QNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP
Sbjct: 182 QRHLQLRIEAQGKYLQTVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP 241

Query: 243 ELHHQQQQQQNQPQRLC-PQQTQPPDCSMDSCLTSCE-ASKDQQ----------NGHLAL 302
           ELH+     Q+Q QR+C  QQ+QPPDCSMDSCLTS E  SKDQQ          N HLAL
Sbjct: 242 ELHN-----QSQSQRVCAQQQSQPPDCSMDSCLTSSEGGSKDQQAQQQQHVLLHNSHLAL 301

Query: 303 QPYAIATDRGSSAA--DQLHGLSMGMGLVQGEKG--EGYNGYSPSEGQR-FGSTRKEGVE 362
           +PYA   DR SS A    LHGLSM +GLVQGEK   EGYNGYS SEGQR FGS R +   
Sbjct: 302 RPYA---DRASSGAPDHSLHGLSMSIGLVQGEKAGPEGYNGYSTSEGQRLFGSKRTKDAV 361

Query: 363 REKVVEGAFRYRMDL-NAGEDQL----SDNDHAASNTCKMFDLNGFS 384
            EK  E  FRYRMDL NAGEDQL    ++NDH +S TCKMFDLNGFS
Sbjct: 362 MEK--ETGFRYRMDLNNAGEDQLISSNNNNDHTSSTTCKMFDLNGFS 396

BLAST of Cp4.1LG03g09290 vs. NCBI nr
Match: gi|659130748|ref|XP_008465330.1| (PREDICTED: uncharacterized protein LOC103502977 isoform X1 [Cucumis melo])

HSP 1 Score: 543.1 bits (1398), Expect = 3.9e-151
Identity = 316/412 (76.70%), Postives = 331/412 (80.34%), Query Frame = 1

Query: 3   HHHQHRGKSIHSSERHLFLQGG-NGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 62
           +HHQHRGKSIHSSERH+FLQGG NG GGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG
Sbjct: 2   YHHQHRGKSIHSSERHMFLQGGGNGGGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 61

Query: 63  GPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTV 122
           G DKATPKTVMKIMGI GLTLYHLKSHLQKYRLSKN+HGQANGGSGTN KTGMGWRVGTV
Sbjct: 62  GADKATPKTVMKIMGIPGLTLYHLKSHLQKYRLSKNLHGQANGGSGTN-KTGMGWRVGTV 121

Query: 123 AVSVDQRLGEANGAGRT-----NNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQL 182
           AVSVDQRLGEANGA         NSI V PQ SSQ NK+LQISETIQMQIEVQKRLHEQL
Sbjct: 122 AVSVDQRLGEANGAAAAAAAARTNSIVVGPQPSSQSNKNLQISETIQMQIEVQKRLHEQL 181

Query: 183 EVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAA 242
           EVQRHLQLRIE QGKYLQTVLEKAQETLG QNLGTVGLEAAKVQLSELVSKVSTQCLTAA
Sbjct: 182 EVQRHLQLRIETQGKYLQTVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLTAA 241

Query: 243 FPELHHQQQQQQNQPQRLC-PQQTQPPDCSMDSCLTSCE-ASKDQQ-------------- 302
           FPELH+     Q+Q QR+C  QQ+QPPDCSMDSCLTS E  SKDQQ              
Sbjct: 242 FPELHN-----QSQSQRVCAQQQSQPPDCSMDSCLTSSEGGSKDQQAAQQQQQQQQVLLH 301

Query: 303 NGHLALQPYAIATDRGSSAA--DQLHGLSMGMGLVQGEKG--EGYNGYSPSEGQRFGSTR 362
           N HLAL+PYA   DR SS A    LHGLSM +GLVQGEK   E YNGYS SEGQR   ++
Sbjct: 302 NSHLALRPYA---DRASSGAPDHSLHGLSMSIGLVQGEKAGTEAYNGYSTSEGQRLFGSK 361

Query: 363 KEGVEREKVVEGAFRYRMDL-NAGEDQL----SDNDHAASNTCKMFDLNGFS 384
           +   E     E  FRYRMDL NAGEDQL    S+NDH +S TCKMFDLNGFS
Sbjct: 362 RTTKEAVLEKETGFRYRMDLNNAGEDQLISSNSNNDHTSSTTCKMFDLNGFS 404

BLAST of Cp4.1LG03g09290 vs. NCBI nr
Match: gi|778686069|ref|XP_011652325.1| (PREDICTED: uncharacterized protein LOC101206445 isoform X2 [Cucumis sativus])

HSP 1 Score: 530.4 bits (1365), Expect = 2.6e-147
Identity = 315/407 (77.40%), Postives = 331/407 (81.33%), Query Frame = 1

Query: 3   HHHQHRGKSIHSSERHLFLQGG-NGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 62
           +HHQHRGKSIHSSERH+FLQGG NG GGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG
Sbjct: 2   YHHQHRGKSIHSSERHMFLQGGGNGGGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 61

Query: 63  GPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTV 122
           G DKATPKTVMKIMGI GLTLYHLKSHLQKYRLSKN+HGQANGGSGTN       + GTV
Sbjct: 62  GADKATPKTVMKIMGIPGLTLYHLKSHLQKYRLSKNLHGQANGGSGTN-------KTGTV 121

Query: 123 AVSVDQRLGEANGAG---RTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQLEV 182
           AVSVDQRLGEANGA    RT+N I V PQ +SQ NKSLQISETIQMQIEVQKRLHEQLEV
Sbjct: 122 AVSVDQRLGEANGAAAAARTSN-IVVGPQPTSQSNKSLQISETIQMQIEVQKRLHEQLEV 181

Query: 183 QRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP 242
           QRHLQLRIEAQGKYLQTVLEKAQETLG QNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP
Sbjct: 182 QRHLQLRIEAQGKYLQTVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLTAAFP 241

Query: 243 ELHHQQQQQQNQPQRLC-PQQTQPPDCSMDSCLTSCE-ASKDQQ----------NGHLAL 302
           ELH+     Q+Q QR+C  QQ+QPPDCSMDSCLTS E  SKDQQ          N HLAL
Sbjct: 242 ELHN-----QSQSQRVCAQQQSQPPDCSMDSCLTSSEGGSKDQQAQQQQHVLLHNSHLAL 301

Query: 303 QPYAIATDRGSSAA--DQLHGLSMGMGLVQGEKG--EGYNGYSPSEGQR-FGSTRKEGVE 362
           +PYA   DR SS A    LHGLSM +GLVQGEK   EGYNGYS SEGQR FGS R +   
Sbjct: 302 RPYA---DRASSGAPDHSLHGLSMSIGLVQGEKAGPEGYNGYSTSEGQRLFGSKRTKDAV 361

Query: 363 REKVVEGAFRYRMDL-NAGEDQL----SDNDHAASNTCKMFDLNGFS 384
            EK  E  FRYRMDL NAGEDQL    ++NDH +S TCKMFDLNGFS
Sbjct: 362 MEK--ETGFRYRMDLNNAGEDQLISSNNNNDHTSSTTCKMFDLNGFS 390

BLAST of Cp4.1LG03g09290 vs. NCBI nr
Match: gi|659130752|ref|XP_008465332.1| (PREDICTED: chromatin modification-related protein EAF1 isoform X2 [Cucumis melo])

HSP 1 Score: 524.2 bits (1349), Expect = 1.9e-145
Identity = 308/412 (74.76%), Postives = 324/412 (78.64%), Query Frame = 1

Query: 3   HHHQHRGKSIHSSERHLFLQGG-NGPGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 62
           +HHQHRGKSIHSSERH+FLQGG NG GGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG
Sbjct: 2   YHHQHRGKSIHSSERHMFLQGGGNGGGGDSGLVLSTDAKPRLKWTPDLHDRFVEAVNQLG 61

Query: 63  GPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTGMGWRVGTV 122
           G DKATPKTVMKIMGI GLTLYHLKSHLQKYRLSKN+HGQANGGSGTN       + GTV
Sbjct: 62  GADKATPKTVMKIMGIPGLTLYHLKSHLQKYRLSKNLHGQANGGSGTN-------KTGTV 121

Query: 123 AVSVDQRLGEANGAGRT-----NNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLHEQL 182
           AVSVDQRLGEANGA         NSI V PQ SSQ NK+LQISETIQMQIEVQKRLHEQL
Sbjct: 122 AVSVDQRLGEANGAAAAAAAARTNSIVVGPQPSSQSNKNLQISETIQMQIEVQKRLHEQL 181

Query: 183 EVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCLTAA 242
           EVQRHLQLRIE QGKYLQTVLEKAQETLG QNLGTVGLEAAKVQLSELVSKVSTQCLTAA
Sbjct: 182 EVQRHLQLRIETQGKYLQTVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLTAA 241

Query: 243 FPELHHQQQQQQNQPQRLC-PQQTQPPDCSMDSCLTSCE-ASKDQQ-------------- 302
           FPELH+     Q+Q QR+C  QQ+QPPDCSMDSCLTS E  SKDQQ              
Sbjct: 242 FPELHN-----QSQSQRVCAQQQSQPPDCSMDSCLTSSEGGSKDQQAAQQQQQQQQVLLH 301

Query: 303 NGHLALQPYAIATDRGSSAA--DQLHGLSMGMGLVQGEKG--EGYNGYSPSEGQRFGSTR 362
           N HLAL+PYA   DR SS A    LHGLSM +GLVQGEK   E YNGYS SEGQR   ++
Sbjct: 302 NSHLALRPYA---DRASSGAPDHSLHGLSMSIGLVQGEKAGTEAYNGYSTSEGQRLFGSK 361

Query: 363 KEGVEREKVVEGAFRYRMDL-NAGEDQL----SDNDHAASNTCKMFDLNGFS 384
           +   E     E  FRYRMDL NAGEDQL    S+NDH +S TCKMFDLNGFS
Sbjct: 362 RTTKEAVLEKETGFRYRMDLNNAGEDQLISSNSNNDHTSSTTCKMFDLNGFS 398

BLAST of Cp4.1LG03g09290 vs. NCBI nr
Match: gi|743873244|ref|XP_011034346.1| (PREDICTED: uncharacterized protein LOC105132500 isoform X1 [Populus euphratica])

HSP 1 Score: 367.9 bits (943), Expect = 2.2e-98
Identity = 211/293 (72.01%), Postives = 228/293 (77.82%), Query Frame = 1

Query: 1   MYHHHQHRGKSIHSS-------ERHLFLQGGNGPGGDSGLVLSTDAKPRLKWTPDLHDRF 60
           MYHHHQH+GKSIHSS       ERHLFLQGGNGPG DSGLVLSTDAKPRLKWTPDLH+R 
Sbjct: 1   MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGPG-DSGLVLSTDAKPRLKWTPDLHERV 60

Query: 61  VEAVNQLGGPDKATPKTVMKIMGISGLTLYHLKSHLQKYRLSKNMHGQANGGSGTNNKTG 120
           +EAVNQLGG DKATPKTVMK+MGI GLTLYHLKSHLQKYRLSKN+HGQA  GS       
Sbjct: 61  IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQATIGSS------ 120

Query: 121 MGWRVGTVAVSVDQRLGEANGAGRTNNSIAVAPQASSQPNKSLQISETIQMQIEVQKRLH 180
              ++GTVAV V  R+ EAN      N++++     SQPNKSL  SE +QMQIEVQ+RLH
Sbjct: 121 ---KIGTVAV-VGDRMPEANATHININNLSIG----SQPNKSLHFSEALQMQIEVQRRLH 180

Query: 181 EQLEVQRHLQLRIEAQGKYLQTVLEKAQETLGTQNLGTVGLEAAKVQLSELVSKVSTQCL 240
           EQLEVQRHLQLRIEAQGKYLQ VLEKAQETLG QNLGTVGLEAAKVQLSELVSKVSTQCL
Sbjct: 181 EQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCL 240

Query: 241 TAAFPELHHQQQQQQNQPQRLCPQQ---TQPPDCSMDSCLTSCEASKDQQNGH 284
            + F EL        N  Q LCPQQ   TQP DCSMDSCLTSCE S+ +Q  H
Sbjct: 241 NSTFSEL--------NDLQGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIH 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PHL9_ARATH1.2e-8048.70Myb-related protein 2 OS=Arabidopsis thaliana GN=MYR2 PE=1 SV=1[more]
PHLA_ARATH9.7e-7861.03Myb-related protein 1 OS=Arabidopsis thaliana GN=MYR1 PE=1 SV=1[more]
PHL8_ARATH6.6e-5048.00Myb family transcription factor PHL8 OS=Arabidopsis thaliana GN=PHL8 PE=2 SV=1[more]
PHL3_ARATH4.9e-4555.00Protein PHR1-LIKE 3 OS=Arabidopsis thaliana GN=PHL3 PE=1 SV=1[more]
PHL2_ARATH2.4e-4446.09Protein PHR1-LIKE 2 OS=Arabidopsis thaliana GN=PHL2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LG98_CUCSA1.9e-15279.12Uncharacterized protein OS=Cucumis sativus GN=Csa_3G844850 PE=4 SV=1[more]
A0A067JLA4_JATCU2.0e-9854.73Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06855 PE=4 SV=1[more]
F6HLA5_VITVI2.6e-9858.15Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g07580 PE=4 SV=... [more]
B9I5Z8_POPTR3.5e-9871.48Myb family transcription factor family protein OS=Populus trichocarpa GN=POPTR_0... [more]
A0A067EMH8_CITSI1.9e-9670.49Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013552mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G04030.36.9e-8248.70 Homeodomain-like superfamily protein[more]
AT5G18240.15.5e-7961.03 myb-related protein 1[more]
AT1G69580.21.8e-5046.80 Homeodomain-like superfamily protein[more]
AT4G13640.21.2e-4454.19 Homeodomain-like superfamily protein[more]
AT3G24120.25.7e-4445.56 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778686066|ref|XP_011652324.1|2.7e-15279.12PREDICTED: uncharacterized protein LOC101206445 isoform X1 [Cucumis sativus][more]
gi|659130748|ref|XP_008465330.1|3.9e-15176.70PREDICTED: uncharacterized protein LOC103502977 isoform X1 [Cucumis melo][more]
gi|778686069|ref|XP_011652325.1|2.6e-14777.40PREDICTED: uncharacterized protein LOC101206445 isoform X2 [Cucumis sativus][more]
gi|659130752|ref|XP_008465332.1|1.9e-14574.76PREDICTED: chromatin modification-related protein EAF1 isoform X2 [Cucumis melo][more]
gi|743873244|ref|XP_011034346.1|2.2e-9872.01PREDICTED: uncharacterized protein LOC105132500 isoform X1 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR025756Myb_CC_LHEQLE
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g09290.1Cp4.1LG03g09290.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 42..93
score: 1.
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 40..95
score: 3.5
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 38..95
score: 1.1
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 39..95
score: 1.77
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 37..97
score: 11
IPR025756MYB-CC type transcription factor, LHEQLE-containing domainPFAMPF14379Myb_CC_LHEQLEcoord: 157..203
score: 7.6
NoneNo IPR availablePANTHERPTHR31499FAMILY NOT NAMEDcoord: 1..281
score: 2.4E
NoneNo IPR availablePANTHERPTHR31499:SF4MYB FAMILY TRANSCRIPTION FACTOR-RELATEDcoord: 1..281
score: 2.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g09290Cp4.1LG08g06680Cucurbita pepo (Zucchini)cpecpeB490
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG03g09290Cucurbita pepo (Zucchini)cpecpeB450
Cp4.1LG03g09290Cucurbita moschata (Rifu)cmocpeB245
Cp4.1LG03g09290Cucurbita moschata (Rifu)cmocpeB246
Cp4.1LG03g09290Bottle gourd (USVL1VR-Ls)cpelsiB500
Cp4.1LG03g09290Watermelon (Charleston Gray)cpewcgB544
Cp4.1LG03g09290Watermelon (97103) v1cpewmB595
Cp4.1LG03g09290Melon (DHL92) v3.5.1cpemeB566
Cp4.1LG03g09290Melon (DHL92) v3.6.1cpemedB673
Cp4.1LG03g09290Silver-seed gourdcarcpeB0983
Cp4.1LG03g09290Silver-seed gourdcarcpeB1147
Cp4.1LG03g09290Wax gourdcpewgoB0742