Cp4.1LG14g05400 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g05400
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptionmyb-like transcription factor family protein
LocationCp4.1LG14 : 1052388 .. 1054706 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAATCGCCGAGAAAATGTTCGCATTGCGGATTAAACGGACATAATTCCCGAACGTGTAGTAATTTTTTGAAGGGGAATAATAATTATAGTAATAGTAGTTATTATTGCGTGAAGTTATTTGGAGTTAATTTAATGGAAAACAGAGACGAATCTATGAGGAAGAGTCTCAGCATGGGGAATTTGAATCTTCACTCATGCAATAATGTTCTTGATCTTAATACCGCCGCCGCCGTGTACAACGTCGCCGATGATAATTCCGCCTCCGCCGACAACGCTGGGTATCTTTCCGATGGGTTTATTCATAACAAGAGACGCAAAGCCGCTCACGAGAGGAAGAAAGGTTCGTAAGCTAAAATTTTATTTTCTTTTATTAATTTTAATTTTTGGTTTAGTTTCTCCCATACTTTTTTTTTTTTTTTTTTNAGAACCCTTAGAAAATACGAAATTATGACAGTTATGAGAAACTTAAGTTGGCTGACAAAAATGGTAACCTTCAGCGGATGGGGTAAGCTTTTTGCTTCCCTTTTTTTGTCTCTTGTCTCTTTTTTTTTTTTTTTTTTTAATGGTAAGGATCCTTAATTATTGATTATCTATGCTCGAGTTTACTATATTTGTTCTTACCATTAGTAGTTTTTTTTTTCGAGTTCGATAATTACGAGGTTGCGGGTGGGGATCAAACCATCGAGATGGTAATTGGTGCCTCGCTCTCAAATTGGTAGGCATATATACACTTCAATAATAAATAAAAAAATTGTCCGAAGAACATAACTCAATTGTGTTAAAGCTCGGGTAACCTTATTTCAGAAAAAAATTATGTTCATCCTGTTAGAAAACCGAGAAAATTTATATTTTTGGATCCTAAAAATTAGATTGTTCGAAAATTTGTATAAGAATTTTAGATTGATGTGACATGTATTATGCAGGAAAGCCATGGACCGAAGAGGAACATAGAACGTTTTTGGCTGGTTTGAAGAAGCTCGGGAAAGGGGATTGGAGAGGGATTTCGAAGAACTTTGTAACCACAAGAACACCGACCCAAGTAGCTAGTCATGCACAAAAGTACTTTTTGAGGAAGATGCATGCAAACGAAAAGAAGAAACGTAGAGCCAGTCTTTTTGACATGCCTGAAATCAAAGTAATATTTGGTTTTTTTCGTGCGCTTATTTCTTAACCGGACCTCGAGTTTTGTCGAATTTTTTTGTGATTATTAATGTTATTTTTAAAAAATACAGAATAATGTTCCCCAAGATTGCCAAGCTTCTGGAGAAACGTCACTGCCAAAGAACAACAGTTCTGAGAATCAACAACAGGTGAAATAGATTAAACATTCTTGGATAATTTTCTTTAAAAATAAATTAAATTTACAACCATTCTTTGTTTTTAATCTATTTTCAGTAATTCTAGTTCTGAAAAAATAAAATGGTTTTTTTTAACTGTTTTTTTTTTTTTTACTTTTTTATTATTTACTGAAAATAAATTTTGTTGCAGGTCAATAATTTACCAGCACAACTTATCAATCGATTCCCTCATCTTTGTTTGGATGCTCCTCATTTTGTTCCTCCAGCAACTGGATCGGCAGCCTCGGCTGCTGCAAATTTGCATGGGATTCCTTATGTGGTAAGAACACTTTCAAAAGCTTATTTATTTATTTTTTTTTTAAATATTCGCTTTTTTTTTTTTACGCCAAGTAATTCTAGCTAATTAAGACGTGTATTATCCGCAAATATGTCAAATCATATGGTTCAAATCTTCACCTACACATATTATTCAATTAGAAGAGACACTTCTAATGGTTCGGATCGAGTCGACCTCATTTATTTAGTAAACCGTACTTGAACAACCGGATTTGATGTGATGTCCCAGGTTGGGGAGGAGAAAAAATCACTCTTTATCAAGGTGTGGAAACTTTCGTGTGAAAATCGGAAAGAAAAAGCCCAAATAAGACAATATCTACCGACGTTGAATCTAGAACAATACATCTAATTTATGGAAATATTTGGTTTTGGGCAGGTGGGAGTATCCCCAAATGTTCCAATACTGCCATTGGTGAATATTGGAAGAAGAGGAGGAGGAGGAGTTATGGCAATGCCAAAGAGCCCAAGGAGTCCCAACGCACCGATGGCTGGCCATCCATCAGGCATTCCTCCTTCGCCAAGAACCTCCCCAAGCAGGCCCATATTGCAGCAGTTGCAGCCGTCTGCCATAGCAGTGGCAGCCTCTAATTTCGAGGCAGATGCCCTGGAGCTCAAGATTGGGCTGCCTCAGTCTCCACAGACACAAAACTTGTCATCTCAAACTTCTGGAGCAATTAGAGTTATA

mRNA sequence

GAATCGCCGAGAAAATGTTCGCATTGCGGATTAAACGGACATAATTCCCGAACGTGTAGTAATTTTTTGAAGGGGAATAATAATTATAGTAATAGTAGTTATTATTGCGTGAAGTTATTTGGAGTTAATTTAATGGAAAACAGAGACGAATCTATGAGGAAGAGTCTCAGCATGGGGAATTTGAATCTTCACTCATGCAATAATGTTCTTGATCTTAATACCGCCGCCGCCGTGTACAACGTCGCCGATGATAATTCCGCCTCCGCCGACAACGCTGGGTATCTTTCCGATGGGTTTATTCATAACAAGAGACGCAAAGCCGCTCACGAGAGGAAGAAAGGAAAGCCATGGACCGAAGAGGAACATAGAACGTTTTTGGCTGGTTTGAAGAAGCTCGGGAAAGGGGATTGGAGAGGGATTTCGAAGAACTTTGTAACCACAAGAACACCGACCCAAGTAGCTAGTCATGCACAAAAGTACTTTTTGAGGAAGATGCATGCAAACGAAAAGAAGAAACGTAGAGCCAGTCTTTTTGACATGCCTGAAATCAAAAATAATGTTCCCCAAGATTGCCAAGCTTCTGGAGAAACGTCACTGCCAAAGAACAACAGTTCTGAGAATCAACAACAGGTCAATAATTTACCAGCACAACTTATCAATCGATTCCCTCATCTTTGTTTGGATGCTCCTCATTTTGTTCCTCCAGCAACTGGATCGGCAGCCTCGGCTGCTGCAAATTTGCATGGGATTCCTTATGTGAGCCCAAGGAGTCCCAACGCACCGATGGCTGGCCATCCATCAGGCATTCCTCCTTCGCCAAGAACCTCCCCAAGCAGGCCCATATTGCAGCAGTTGCAGCCGTCTGCCATAGCAGTGGCAGCCTCTAATTTCGAGGCAGATGCCCTGGAGCTCAAGATTGGGCTGCCTCAGTCTCCACAGACACAAAACTTGTCATCTCAAACTTCTGGAGCAATTAGAGTTATA

Coding sequence (CDS)

GAATCGCCGAGAAAATGTTCGCATTGCGGATTAAACGGACATAATTCCCGAACGTGTAGTAATTTTTTGAAGGGGAATAATAATTATAGTAATAGTAGTTATTATTGCGTGAAGTTATTTGGAGTTAATTTAATGGAAAACAGAGACGAATCTATGAGGAAGAGTCTCAGCATGGGGAATTTGAATCTTCACTCATGCAATAATGTTCTTGATCTTAATACCGCCGCCGCCGTGTACAACGTCGCCGATGATAATTCCGCCTCCGCCGACAACGCTGGGTATCTTTCCGATGGGTTTATTCATAACAAGAGACGCAAAGCCGCTCACGAGAGGAAGAAAGGAAAGCCATGGACCGAAGAGGAACATAGAACGTTTTTGGCTGGTTTGAAGAAGCTCGGGAAAGGGGATTGGAGAGGGATTTCGAAGAACTTTGTAACCACAAGAACACCGACCCAAGTAGCTAGTCATGCACAAAAGTACTTTTTGAGGAAGATGCATGCAAACGAAAAGAAGAAACGTAGAGCCAGTCTTTTTGACATGCCTGAAATCAAAAATAATGTTCCCCAAGATTGCCAAGCTTCTGGAGAAACGTCACTGCCAAAGAACAACAGTTCTGAGAATCAACAACAGGTCAATAATTTACCAGCACAACTTATCAATCGATTCCCTCATCTTTGTTTGGATGCTCCTCATTTTGTTCCTCCAGCAACTGGATCGGCAGCCTCGGCTGCTGCAAATTTGCATGGGATTCCTTATGTGAGCCCAAGGAGTCCCAACGCACCGATGGCTGGCCATCCATCAGGCATTCCTCCTTCGCCAAGAACCTCCCCAAGCAGGCCCATATTGCAGCAGTTGCAGCCGTCTGCCATAGCAGTGGCAGCCTCTAATTTCGAGGCAGATGCCCTGGAGCTCAAGATTGGGCTGCCTCAGTCTCCACAGACACAAAACTTGTCATCTCAAACTTCTGGAGCAATTAGAGTTATA

Protein sequence

ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGNLNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFDMPEIKNNVPQDCQASGETSLPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFVPPATGSAASAAANLHGIPYVSPRSPNAPMAGHPSGIPPSPRTSPSRPILQQLQPSAIAVAASNFEADALELKIGLPQSPQTQNLSSQTSGAIRVI
BLAST of Cp4.1LG14g05400 vs. Swiss-Prot
Match: MY1R1_SOLTU (Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 2.8e-25
Identity = 101/281 (35.94%), Postives = 141/281 (50.18%), Query Frame = 1

Query: 37  VKLFGVNLMENRDESMRKSLSMGNLNLHSCNNVLDLNTAAAVYNVADDNSAS--ADNAGY 96
           + LFGV +   + + MRKS+S+ +L+ +   N  + N      N  D+N +S  A + GY
Sbjct: 24  IMLFGVRV---KVDPMRKSVSLNDLSQYEHPNANNNN------NGGDNNESSKVAQDEGY 83

Query: 97  LSDGFIHNKRRKAAHERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVA 156
            S       +  +  ERK+G PWTEEEH+ FL GL+K+GKGDWRGIS+NFV TRTPTQVA
Sbjct: 84  ASADDAVQHQSNSGRERKRGVPWTEEEHKLFLLGLQKVGKGDWRGISRNFVKTRTPTQVA 143

Query: 157 SHAQKYFLRKMHANEKKKRRASLFDMPEIKNNVPQDCQASGETSLPKNNSSENQQQV--- 216
           SHAQKYFLR+ + N +++RR+SLFD+             +   S+      EN+Q++   
Sbjct: 144 SHAQKYFLRRSNLN-RRRRRSSLFDI------------TTDSVSVMPIEEVENKQEIPVV 203

Query: 217 --NNLPAQLINRFPHLCLDAPHFVPPATGSAASAAANL---HG--------IPYVSPRSP 276
               LP    N FP      P   P     +      L   HG        +P  S  +P
Sbjct: 204 APATLPTTKTNAFPVAPTVGPIIFPVQIDKSREYPTLLRHDHGNSSMLVGPVPMFSMPNP 263

Query: 277 NAPM---AGHPSGIPPSPRTSPSRPILQQLQPSAIAVAASN 297
           +  +   A H S I PS  +      L Q Q S+   +A N
Sbjct: 264 STAIDLNANHNSTIEPSSLSLRLSLSLDQGQASSTRHSAYN 282

BLAST of Cp4.1LG14g05400 vs. Swiss-Prot
Match: DIV_ANTMA (Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA PE=2 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 2.2e-22
Identity = 59/117 (50.43%), Postives = 77/117 (65.81%), Query Frame = 1

Query: 94  YLSDGFIHNKRRKAAHERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQV 153
           Y + G   +  R +  ERKKG PWTEEEH+ FL GLKK GKGDWR IS+NFV TRTPTQV
Sbjct: 111 YGTGGRKSSSGRPSEQERKKGVPWTEEEHKLFLMGLKKYGKGDWRNISRNFVITRTPTQV 170

Query: 154 ASHAQKYFLRKMHANEKKKRRASLFDMPEIKNNVPQDCQASGETSLPKNNSSENQQQ 211
           ASHAQKYF+R++ +  K KRRAS+ D+  +  N+  +   S +   P ++   +  Q
Sbjct: 171 ASHAQKYFIRQL-SGGKDKRRASIHDITTV--NLSDNQTPSPDNKKPPSSPDHSMAQ 224

BLAST of Cp4.1LG14g05400 vs. Swiss-Prot
Match: MYBI_DICDI (Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 5.9e-07
Identity = 41/114 (35.96%), Postives = 60/114 (52.63%), Query Frame = 1

Query: 103 KRRKAAHERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFL 162
           ++ K + ++K+ + WT EEH  F+  L K G  D + IS+ +V+TR PTQV +HAQKYFL
Sbjct: 161 EQEKQSEKKKQSRYWTPEEHSRFIEALSKYGHKDVKSISQ-YVSTRNPTQVRTHAQKYFL 220

Query: 163 -------RKMHANEKKKRRASLFD--MPEIKNNVPQDCQASGETSLPKNNSSEN 208
                  RK+ + E     A   D  + E  N+     Q S  ++ P  NS  N
Sbjct: 221 RIDRERGRKLESKESINGGADKDDDWLREEYNDEGSPTQYSSCSNSPTTNSVAN 273

BLAST of Cp4.1LG14g05400 vs. Swiss-Prot
Match: MYBJ_DICDI (Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 8.5e-06
Identity = 68/247 (27.53%), Postives = 112/247 (45.34%), Query Frame = 1

Query: 47  NRDESMRKSLSMGNLNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRK 106
           N    +  S+   N N +S +N  + N      N  ++ + +   A   S G  +   +K
Sbjct: 313 NTSTPITVSIINNNNNNNSNSNNNNNNNNNNNNNNTNNTTTTTTTATTTSGGKTNPTGKK 372

Query: 107 AAHERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMH 166
            +   K+G  WT+EEH  FL G++  GKG W+ I++ FV TRTPTQ+ SHAQKY+LR+  
Sbjct: 373 TS--LKQG--WTKEEHIRFLNGIQIHGKGAWKEIAQ-FVGTRTPTQIQSHAQKYYLRQKQ 432

Query: 167 ANEKKK--RRASLFDMPE------IKNNVPQDCQASGETSLPKNNSSENQQQVNNLPAQL 226
             + K+     SL D+ +       KNNV ++ Q   E    K   ++++        ++
Sbjct: 433 ETKNKRSIHDLSLQDLIDDNLNNSNKNNVDKNKQDDKEKKTQKTKKTKSKSSTKG-DEEM 492

Query: 227 INRFPHLCLDAPHFVPPATGSAASAAANLHGIPYVSPRSPNAPMAGHPSGIPPSPRTSPS 286
           I +   L         P      +   N +  P  S  SP +     PS  P S ++S +
Sbjct: 493 ITQQQQLQQQPQQ--QPQQKQPPTIITNFNTTPTSSQSSPKSNSPSSPSS-PQSFQSSQT 550

BLAST of Cp4.1LG14g05400 vs. TrEMBL
Match: A0A0A0KXN0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G025050 PE=4 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 2.1e-128
Identity = 258/354 (72.88%), Postives = 280/354 (79.10%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           ESPRKCSHCG NGHNSRTC NF KGN   SN+ YYCVKLFGVNLMENRDESMRKSLSMGN
Sbjct: 4   ESPRKCSHCGFNGHNSRTCGNFSKGN---SNNYYYCVKLFGVNLMENRDESMRKSLSMGN 63

Query: 61  LNLHSCNNVLDLNTAAAVYNVADDNSASA---DNAGYLSDGFIHNKRRKAAHERKKGKPW 120
           LNLHSCNNVLDLN    V NV  DN+A+A   D+AGYLSDG IHNKRRKAAHERKKGKPW
Sbjct: 64  LNLHSCNNVLDLNNTTTVNNVTGDNTAAAASTDDAGYLSDGLIHNKRRKAAHERKKGKPW 123

Query: 121 TEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASL 180
           +EEEHRTFL GLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKM+AN+KKKRRASL
Sbjct: 124 SEEEHRTFLIGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 183

Query: 181 FDMPEIKNNVPQDCQASGETS----LPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFV 240
           FD+PEIKNN  +DC ASGE      LPKNNS +NQ QVNNL  QLINRFPHLCLD PHF+
Sbjct: 184 FDIPEIKNNFSRDCPASGELPSQILLPKNNSPDNQSQVNNLGTQLINRFPHLCLDTPHFI 243

Query: 241 PPATGSAASAAA-----------NLHGIPYV----SPRSPNAPMAGHPSGIPPSPRTSPS 300
           P  T  ++S ++           N + IP V    S  SPNA MA HPSGIP SPR+SP+
Sbjct: 244 PQQTNGSSSPSSIPFVVGVSPNNNNNNIPLVNIGRSRVSPNAAMAAHPSGIPHSPRSSPT 303

Query: 301 RPILQQLQPSAIAV--AASNFE--ADALELKIGLPQSPQTQNLSSQTSGAIRVI 329
           R +L Q   SA+A+  AAS F+  ADALELKIGLPQSPQ  NLSSQT GAIRVI
Sbjct: 304 RTLLMQPGASALAMAAAASTFDQSADALELKIGLPQSPQPNNLSSQTPGAIRVI 354

BLAST of Cp4.1LG14g05400 vs. TrEMBL
Match: M5X281_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010092mg PE=4 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 1.4e-60
Identity = 160/330 (48.48%), Postives = 194/330 (58.79%), Query Frame = 1

Query: 4   RKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGNL-- 63
           RKCSHCG NGHNSRTC+N     + +  +   C+KLFGVN+ME  D+SM+KS SMGNL  
Sbjct: 2   RKCSHCGHNGHNSRTCNN---NGHGHGQNKGVCLKLFGVNIMEKEDDSMKKSYSMGNLQA 61

Query: 64  --NLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTE 123
             N    NNV+ ++                 +AGYLSDG IHNK+ KAAHERKKG+PWTE
Sbjct: 62  AGNADHNNNVVTID----------------HDAGYLSDGLIHNKKHKAAHERKKGRPWTE 121

Query: 124 EEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFD 183
           EEHR FLAGLKKLGKGDWRGI++NFVTTRTPTQVASHAQKYFLR+    +K+KRR+SLFD
Sbjct: 122 EEHRVFLAGLKKLGKGDWRGIARNFVTTRTPTQVASHAQKYFLRQA-TYDKRKRRSSLFD 181

Query: 184 M--PEIKNNVPQDCQASGETSLPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFVPPAT 243
           M   E+   +P+                       N+P        H    + H++    
Sbjct: 182 MQFKELSMGIPE-----------------------NVP---YTPMMHFARPSYHYMIKTH 241

Query: 244 GSAASAAANLHGIPYVSPRSPNAPMAGHPSGIPPSPRTSPSRPILQQLQPSAIAVAASNF 303
           G+ A+ A      P +S          HPSGI PSPR+ PS P +     +     +S  
Sbjct: 242 GNFATCA------PVIS----------HPSGI-PSPRSLPSSPSM-----AGRIGMSSPA 263

Query: 304 EADALELKIGLPQSPQTQNLSSQTSGAIRV 328
           E DALELKIG PQ  Q  NLSS TSGAIRV
Sbjct: 302 EKDALELKIGQPQPSQGANLSSPTSGAIRV 263

BLAST of Cp4.1LG14g05400 vs. TrEMBL
Match: W9S9L1_9ROSA (Transcription factor OS=Morus notabilis GN=L484_017072 PE=4 SV=1)

HSP 1 Score: 235.0 bits (598), Expect = 1.3e-58
Identity = 164/358 (45.81%), Postives = 205/358 (57.26%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSN---FLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLS 60
           E+ RKCSHCG  GHNSRTC+N    +   N   N + +  KLFGVN++E  ++SM+KS S
Sbjct: 4   EAQRKCSHCGQQGHNSRTCNNRNNVVSPRNRNGNGNGHSFKLFGVNIVEGAEDSMKKSRS 63

Query: 61  MGNLNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPW 120
           MGNL   S     +      V +  DD+      AGYLSDG +HN +RKAA ERK+GKPW
Sbjct: 64  MGNLAALSGGQGKN-----DVVSGGDDHDDHDPEAGYLSDGVLHNVKRKAARERKRGKPW 123

Query: 121 TEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASL 180
           TEEEHRTFLAG+ KLGKGDWRGIS  FVTTRTPTQVASHAQKYF+RK    ++KKRR+SL
Sbjct: 124 TEEEHRTFLAGMNKLGKGDWRGISSKFVTTRTPTQVASHAQKYFIRKQAPKDRKKRRSSL 183

Query: 181 FDMPEIKNN---VPQDCQASG---ETSLPKNNSSENQQQVNNLPAQLINRFPHLCLD--- 240
           FDMP  +++    PQ  Q S     T+   +  S +Q    N P++++NRFP LCLD   
Sbjct: 184 FDMPFTQSDDVVPPQGHQVSSMIKPTAETSSRPSRSQGLAENNPSEILNRFPQLCLDNYP 243

Query: 241 -------APHFVPPATGSAASAAANLHGIPYVSPRSP----NAP-MAGHPSGIPPSPRTS 300
                  AP+++ P          N+   P ++   P    N P + G      P   T 
Sbjct: 244 VPVRPGVAPNYLVP---YGVGFPGNVQPWPLINIGRPSYLYNVPNVYGSVDNGVPVVVTH 303

Query: 301 PS---RPILQQLQPSAIAVAASNFEAD---ALELKIGLPQSPQTQNLSSQ-TSGAIRV 328
           PS    P      PS  A A+ +  AD    LEL IG PQS Q   LSS   SGAI+V
Sbjct: 304 PSGIPPPRSPPTSPSIKAAASKDSSADQTNGLELTIGQPQSKQGSELSSSPASGAIKV 353

BLAST of Cp4.1LG14g05400 vs. TrEMBL
Match: A0A067JSE9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26305 PE=4 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 3.4e-54
Identity = 167/393 (42.49%), Postives = 207/393 (52.67%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           E  RKCSHCG NGHNSRTC++   G           VKLFGVN+ E +++ M+KS S+GN
Sbjct: 4   EGTRKCSHCGQNGHNSRTCNHGKGGAG---------VKLFGVNIFEKQEQPMKKSASLGN 63

Query: 61  LNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTEE 120
           L                  ++ D+N+    + GYLSDG+I+ KR KAA+ERKKGKPWTEE
Sbjct: 64  LE-----------------SLIDNNAHHHVDEGYLSDGYINFKRGKAANERKKGKPWTEE 123

Query: 121 EHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFDM 180
           EHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKY+LR+  A + KKRR+SLFDM
Sbjct: 124 EHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYYLRQASA-DIKKRRSSLFDM 183

Query: 181 PEIKNNVPQDCQASGETSLPKNNSSENQQQVNN---LP--------------AQLINRFP 240
             +K   P+   +     LP N+SS+ +Q  ++   LP              AQ++NRFP
Sbjct: 184 -TLKE--PKVLTSQERPILPSNSSSQVKQASSSSLALPLRTTTDIPARAITSAQILNRFP 243

Query: 241 HLCLDAPHFVPPATGSAASAAANLHGIPYVSPRSPNAPMAGHPSGIPP--SPRTSPSRPI 300
           HLCLD P   P      A++  N  GIPY         M G P         RT+ + P+
Sbjct: 244 HLCLDTPAIGP---AYLATSVPNYTGIPY---------MLGFPDDGRSFMGARTASAAPL 303

Query: 301 LQQLQPSAIAVA-------ASNFEADA--------------------------------- 329
           LQ +  +   +A          F A                                   
Sbjct: 304 LQMMHYNYSRLAYPFPPNSQGRFAATCVPLTAHPSGIPAPRSYPLGFLQQGSSTSPTKKE 354

BLAST of Cp4.1LG14g05400 vs. TrEMBL
Match: A0A061G076_THECC (Myb-like transcription factor family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_014838 PE=4 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 1.1e-52
Identity = 162/360 (45.00%), Postives = 196/360 (54.44%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNL--MENRDESMRKSLSM 60
           E+ RKCSHCG NGHNSRTC    KG          CVKLFGVN+  +E ++  M+KS SM
Sbjct: 4   ETGRKCSHCGHNGHNSRTCHG--KG----------CVKLFGVNISAVEKQESFMKKSFSM 63

Query: 61  GNLNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWT 120
            +L  H            A YN   +N+ S D+ GYLSDG IH+++  AA ERK+GKPWT
Sbjct: 64  ESLRSHH-----------AEYN---NNAPSVDD-GYLSDGQIHSRKSNAARERKRGKPWT 123

Query: 121 EEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLF 180
           EEEHR FLAGL+KLGKGDWRGISK FVTTRTPTQVASHAQKYFLR+   N+KKKRR SLF
Sbjct: 124 EEEHRIFLAGLRKLGKGDWRGISKKFVTTRTPTQVASHAQKYFLRQA-GNDKKKRRPSLF 183

Query: 181 DM--PEIKNNVPQDCQASGETSLPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFVPPA 240
           DM   E+++N      AS   S  +  + ++  QV   P+ + NRFPHLCLD      P 
Sbjct: 184 DMAFQELESN------ASPPGSPAEQTTRDSSDQV-KAPSPIANRFPHLCLDD----RPV 243

Query: 241 TGSAASAAANLHGIPYVSPRSPNAPMAGHPSG-IPPSPRTSPSRPILQQLQ--------- 300
           T   AS     H  P    R       G P+G + P  +  PS P L  +          
Sbjct: 244 TSLTAS-----HSFPTYYHRIQPLQAGGAPNGQVFPEAKMMPSLPFLHAMNYAGLHYGYM 303

Query: 301 ---------------PSAIAVAASNF---------EADALELKIGLPQSPQTQNLSSQTS 323
                          PS  +V  S F         E D LELKIG PQS +  ++ SQ S
Sbjct: 304 AKALGCAPAAHPSGIPSPWSVQHSMFRAGPGASPAEKDLLELKIGPPQSSKNTSMLSQAS 319

BLAST of Cp4.1LG14g05400 vs. TAIR10
Match: AT5G61620.1 (AT5G61620.1 myb-like transcription factor family protein)

HSP 1 Score: 191.0 bits (484), Expect = 1.1e-48
Identity = 148/336 (44.05%), Postives = 194/336 (57.74%), Query Frame = 1

Query: 4   RKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMEN-----RDESMRKSLSM 63
           + CSHCG NGHN+RTC N   G N  S      VKLFGVN+  +        ++RKSLS+
Sbjct: 10  KTCSHCGHNGHNARTCLN---GVNKAS------VKLFGVNISSDPIRPPEVTALRKSLSL 69

Query: 64  GNLNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWT 123
           GNL+    N+           N + D  A+ D+ GY SDG IH+K+ K AHE+KKGKPWT
Sbjct: 70  GNLDALLANDES---------NGSGDPIAAVDDTGYHSDGQIHSKKGKTAHEKKKGKPWT 129

Query: 124 EEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLF 183
           EEEHR FL GL KLGKGDWRGI+K+FV+TRTPTQVASHAQKYF+R ++ N+K+KRRASLF
Sbjct: 130 EEEHRNFLIGLNKLGKGDWRGIAKSFVSTRTPTQVASHAQKYFIR-LNVNDKRKRRASLF 189

Query: 184 DMP-EIKNNVPQDCQASGETSLPKNNSSENQQQV--NNLPAQLINRFPHLCLD-APHF-- 243
           D+  E +    ++ Q +   + PK   +  QQ V   +   ++ NRF +L ++  P +  
Sbjct: 190 DISLEDQKEKERNSQDASTKTPPKQPITGIQQPVVQGHTQTEISNRFQNLSMEYMPIYQP 249

Query: 244 VPPATGSAASAAANLHGIPYVSPRSPNAPMAGHPSGIPPSPRTSPSRPILQQLQPSAIAV 303
           +PP            + + Y +P+ P   +  HPSGI P PR  P    L   QPS    
Sbjct: 250 IPPYYNFPPIMYHPNYPMYYANPQVPVRFV--HPSGI-PVPRHIPIG--LPLSQPSE--- 309

Query: 304 AASNFEADALELKIGLPQSPQTQNLSSQTS-GAIRV 328
           A++    D L+L IGLP  PQ    S  T  G I V
Sbjct: 310 ASNMTNKDGLDLHIGLP--PQATGASDLTGHGVIHV 316

BLAST of Cp4.1LG14g05400 vs. TAIR10
Match: AT5G47390.1 (AT5G47390.1 myb-like transcription factor family protein)

HSP 1 Score: 161.4 bits (407), Expect = 9.5e-40
Identity = 116/290 (40.00%), Postives = 157/290 (54.14%), Query Frame = 1

Query: 4   RKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGNLNL 63
           R+CSHC  NGHNSRTC N  +G           VKLFGV L E    S+RKS SMGNL+ 
Sbjct: 3   RRCSHCNHNGHNSRTCPN--RG-----------VKLFGVRLTEG---SIRKSASMGNLSH 62

Query: 64  HSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTEEEHR 123
           ++ +      T +       D        GY S+ F+      ++ ERKKG PWTEEEHR
Sbjct: 63  YTGSGSGGHGTGSNTPGSPGDVPDHVAGDGYASEDFVAGS--SSSRERKKGTPWTEEEHR 122

Query: 124 TFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFDM-PE 183
            FL GL+KLGKGDWRGIS+N+VTTRTPTQVASHAQKYF+R+ + + ++KRR+SLFDM P+
Sbjct: 123 MFLLGLQKLGKGDWRGISRNYVTTRTPTQVASHAQKYFIRQSNVS-RRKRRSSLFDMVPD 182

Query: 184 IKNNVPQDCQASGETSLPKNNSSENQQQVNNLPAQLINRFPHL-------CLDAPHFV-- 243
              ++P D Q   E ++P     +    ++   A      P +        +D+ +    
Sbjct: 183 EVGDIPMDLQEPEEDNIPVETEMQGADSIHQTLAPSSLHAPSILEIEECESMDSTNSTTG 242

Query: 244 -PPATGSAASAAANLHGIPYVSPRSPNAPMAGHPSGIPPSPRTSPSRPIL 283
            P AT +AAS+++ L     +             S + P P+   S PIL
Sbjct: 243 EPTATAAAASSSSRLEETTQLQ------------SQLQPQPQLPGSFPIL 261

BLAST of Cp4.1LG14g05400 vs. TAIR10
Match: AT1G70000.1 (AT1G70000.1 myb-like transcription factor family protein)

HSP 1 Score: 153.3 bits (386), Expect = 2.6e-37
Identity = 90/179 (50.28%), Postives = 108/179 (60.34%), Query Frame = 1

Query: 4   RKCSHCGLNGHNSRTCSNFLK--GNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGNL 63
           R CS CG NGHNSRTC   +   G+NN        + LFGV + E      RKS+SM NL
Sbjct: 3   RSCSQCGNNGHNSRTCPTDITTTGDNNDKGGGEKAIMLFGVRVTEASSSCFRKSVSMNNL 62

Query: 64  NLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTEEE 123
           +                    D N    D+ GY SD  +H   R    ERK+G PWTEEE
Sbjct: 63  SQFD--------------QTPDPNPT--DDGGYASDDVVHASGRN--RERKRGTPWTEEE 122

Query: 124 HRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFDM 181
           HR FL GL K+GKGDWRGIS+NFV TRTPTQVASHAQKYFLR+ + N +++RR+SLFD+
Sbjct: 123 HRLFLTGLHKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLRRTNQN-RRRRRSSLFDI 162

BLAST of Cp4.1LG14g05400 vs. TAIR10
Match: AT5G56840.1 (AT5G56840.1 myb-like transcription factor family protein)

HSP 1 Score: 141.4 bits (355), Expect = 1.0e-33
Identity = 90/194 (46.39%), Postives = 107/194 (55.15%), Query Frame = 1

Query: 4   RKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDE------------S 63
           R+CSHCG  GHNSRTCS++              V+LFGV+L                  +
Sbjct: 3   RRCSHCGNVGHNSRTCSSY----------QTRVVRLFGVHLDTTSSSPPPPPPPSILAAA 62

Query: 64  MRKSLSMGNLNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHER 123
           ++KS SM  L   S                    S+S+  AGYLSDG  H        +R
Sbjct: 63  IKKSFSMDCLPACS--------------------SSSSSFAGYLSDGLAHK-----TPDR 122

Query: 124 KKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKK 183
           KKG PWT EEHRTFL GL+KLGKGDWRGIS+NFV T++PTQVASHAQKYFLR+      K
Sbjct: 123 KKGVPWTAEEHRTFLIGLEKLGKGDWRGISRNFVVTKSPTQVASHAQKYFLRQTTTLHHK 161

Query: 184 KRRASLFDMPEIKN 186
           +RR SLFDM    N
Sbjct: 183 RRRTSLFDMVSAGN 161

BLAST of Cp4.1LG14g05400 vs. TAIR10
Match: AT3G16350.1 (AT3G16350.1 Homeodomain-like superfamily protein)

HSP 1 Score: 139.0 bits (349), Expect = 5.1e-33
Identity = 101/238 (42.44%), Postives = 136/238 (57.14%), Query Frame = 1

Query: 4   RKCSHCGLNGHNSRTCSNFLKGNNNYS-----------NSSYYCVKLFGVNLMENRDESM 63
           R+CSHC  NGHNSRTC     G    S           + S   VKLFGV L +     +
Sbjct: 3   RRCSHCSNNGHNSRTCPTRGGGTCGGSGGGGGGGGGGGSGSSSAVKLFGVRLTDG--SII 62

Query: 64  RKSLSMGNLNL------HSCNNVLDLNTAAAVYNVAD----DNSASAD---NAGYLSDGF 123
           +KS SMGNL+        + ++ L  ++  A  N+ D    D++  ++   N GYLSD  
Sbjct: 63  KKSASMGNLSALAVAAAAATHHRLSPSSPLATSNLNDSPLSDHARYSNLHHNEGYLSDDP 122

Query: 124 IHNKRRKAAH-ERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQ 183
            H         ERK+G PWTEEEHR FL GL+KLGKGDWRGIS+N+VT+RTPTQVASHAQ
Sbjct: 123 AHGSGSSHRRGERKRGVPWTEEEHRLFLVGLQKLGKGDWRGISRNYVTSRTPTQVASHAQ 182

Query: 184 KYFLRKMHANEKKKRRASLFDMPEIKNNVPQDCQASGETSLPKNNSSENQQQVNNLPA 217
           KYF+R   ++ ++KRR+SLFDM      V        E +L  ++ S+  ++ + LP+
Sbjct: 183 KYFIRHT-SSSRRKRRSSLFDM-VTDEMVTDSSPTQEEQTLNGSSPSKEPEKKSYLPS 236

BLAST of Cp4.1LG14g05400 vs. NCBI nr
Match: gi|778697489|ref|XP_004146936.2| (PREDICTED: myb-like protein I [Cucumis sativus])

HSP 1 Score: 466.8 bits (1200), Expect = 3.0e-128
Identity = 258/354 (72.88%), Postives = 280/354 (79.10%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           ESPRKCSHCG NGHNSRTC NF KGN   SN+ YYCVKLFGVNLMENRDESMRKSLSMGN
Sbjct: 4   ESPRKCSHCGFNGHNSRTCGNFSKGN---SNNYYYCVKLFGVNLMENRDESMRKSLSMGN 63

Query: 61  LNLHSCNNVLDLNTAAAVYNVADDNSASA---DNAGYLSDGFIHNKRRKAAHERKKGKPW 120
           LNLHSCNNVLDLN    V NV  DN+A+A   D+AGYLSDG IHNKRRKAAHERKKGKPW
Sbjct: 64  LNLHSCNNVLDLNNTTTVNNVTGDNTAAAASTDDAGYLSDGLIHNKRRKAAHERKKGKPW 123

Query: 121 TEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASL 180
           +EEEHRTFL GLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKM+AN+KKKRRASL
Sbjct: 124 SEEEHRTFLIGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 183

Query: 181 FDMPEIKNNVPQDCQASGETS----LPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFV 240
           FD+PEIKNN  +DC ASGE      LPKNNS +NQ QVNNL  QLINRFPHLCLD PHF+
Sbjct: 184 FDIPEIKNNFSRDCPASGELPSQILLPKNNSPDNQSQVNNLGTQLINRFPHLCLDTPHFI 243

Query: 241 PPATGSAASAAA-----------NLHGIPYV----SPRSPNAPMAGHPSGIPPSPRTSPS 300
           P  T  ++S ++           N + IP V    S  SPNA MA HPSGIP SPR+SP+
Sbjct: 244 PQQTNGSSSPSSIPFVVGVSPNNNNNNIPLVNIGRSRVSPNAAMAAHPSGIPHSPRSSPT 303

Query: 301 RPILQQLQPSAIAV--AASNFE--ADALELKIGLPQSPQTQNLSSQTSGAIRVI 329
           R +L Q   SA+A+  AAS F+  ADALELKIGLPQSPQ  NLSSQT GAIRVI
Sbjct: 304 RTLLMQPGASALAMAAAASTFDQSADALELKIGLPQSPQPNNLSSQTPGAIRVI 354

BLAST of Cp4.1LG14g05400 vs. NCBI nr
Match: gi|659107779|ref|XP_008453854.1| (PREDICTED: transcription factor MYB1R1 isoform X1 [Cucumis melo])

HSP 1 Score: 466.8 bits (1200), Expect = 3.0e-128
Identity = 259/356 (72.75%), Postives = 284/356 (79.78%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           ESPRKCSHCGLNGHNSRTC NF KGNNNY    YYCVKLFGVNLMENRDESMRKSLSMGN
Sbjct: 4   ESPRKCSHCGLNGHNSRTCGNFSKGNNNY----YYCVKLFGVNLMENRDESMRKSLSMGN 63

Query: 61  LNLHS-CNNVLDLNT-AAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWT 120
           LNLHS CNNVLDLN    AV NV  DN+ASAD++GYLSDG IHNKRRKAAHERKKGKPW+
Sbjct: 64  LNLHSSCNNVLDLNNNTTAVNNVTGDNAASADDSGYLSDGLIHNKRRKAAHERKKGKPWS 123

Query: 121 EEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLF 180
           EEEHRTFL GLKKLGKGDWRGISKN+VTTRTPTQVASHAQKYFLRKM+AN+KKKRRASLF
Sbjct: 124 EEEHRTFLIGLKKLGKGDWRGISKNYVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLF 183

Query: 181 DMPEIKNNVPQDCQASGETS----LPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFVP 240
           D+PEIKNN  +DCQAS E      L KNNS +NQ QVNNL  QL+NRFPHLCLD PHF+P
Sbjct: 184 DIPEIKNNYSRDCQASSELPSQILLSKNNSLDNQPQVNNLQTQLVNRFPHLCLDTPHFIP 243

Query: 241 PATGSAASAAANLHGIPY---VSPR--------------SPNAPMAGHPSGIPPSPRTSP 300
           P T  ++S ++    IP+   VSP+              SPNA M  HPSGIPPSPR+SP
Sbjct: 244 PQTNGSSSPSS----IPFVVGVSPKNNIPLVNTGRSSRGSPNAAMVAHPSGIPPSPRSSP 303

Query: 301 SRPIL-QQLQPSAIAVAASNFEA----DALELKIGLPQSPQTQNLSSQTSGAIRVI 329
           +R +L QQ   SA+A+AA+   A    DALELKIGLPQSPQ +NLSSQTSGAIRVI
Sbjct: 304 TRTLLMQQPAASALAMAAAASSAFDQSDALELKIGLPQSPQPKNLSSQTSGAIRVI 351

BLAST of Cp4.1LG14g05400 vs. NCBI nr
Match: gi|659107781|ref|XP_008453855.1| (PREDICTED: transcription factor MYB1R1 isoform X2 [Cucumis melo])

HSP 1 Score: 449.9 bits (1156), Expect = 3.8e-123
Identity = 251/352 (71.31%), Postives = 277/352 (78.69%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           ESPRKCSHCGLNGHNSRTC NF KGNNNY    YYCVKLFGVNLMENRDESMRKSLSMGN
Sbjct: 4   ESPRKCSHCGLNGHNSRTCGNFSKGNNNY----YYCVKLFGVNLMENRDESMRKSLSMGN 63

Query: 61  LNLHS-CNNVLDLNT-AAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWT 120
           LNLHS CNNVLDLN    AV NV  DN+ASAD++GYLSDG IHNKRRKAAHERKKGKPW+
Sbjct: 64  LNLHSSCNNVLDLNNNTTAVNNVTGDNAASADDSGYLSDGLIHNKRRKAAHERKKGKPWS 123

Query: 121 EEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLF 180
           EEEHRTFL GLKKLGKGDWRGISKN+VTTRTPTQVASHAQKYFLRKM+AN+KKKRRASLF
Sbjct: 124 EEEHRTFLIGLKKLGKGDWRGISKNYVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLF 183

Query: 181 DMPEIKNNVPQDCQASGETSLPKNNSSENQQQVNNLPAQLINRFPHLCLDAPHFVPPATG 240
           D+PEIK       +   +  L KNNS +NQ QVNNL  QL+NRFPHLCLD PHF+PP T 
Sbjct: 184 DIPEIK----ASSELPSQILLSKNNSLDNQPQVNNLQTQLVNRFPHLCLDTPHFIPPQTN 243

Query: 241 SAASAAANLHGIPY---VSPR--------------SPNAPMAGHPSGIPPSPRTSPSRPI 300
            ++S ++    IP+   VSP+              SPNA M  HPSGIPPSPR+SP+R +
Sbjct: 244 GSSSPSS----IPFVVGVSPKNNIPLVNTGRSSRGSPNAAMVAHPSGIPPSPRSSPTRTL 303

Query: 301 L-QQLQPSAIAVAASNFEA----DALELKIGLPQSPQTQNLSSQTSGAIRVI 329
           L QQ   SA+A+AA+   A    DALELKIGLPQSPQ +NLSSQTSGAIRVI
Sbjct: 304 LMQQPAASALAMAAAASSAFDQSDALELKIGLPQSPQPKNLSSQTSGAIRVI 343

BLAST of Cp4.1LG14g05400 vs. NCBI nr
Match: gi|764634901|ref|XP_011470039.1| (PREDICTED: myb-like protein J isoform X2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 284.3 bits (726), Expect = 2.8e-73
Identity = 183/377 (48.54%), Postives = 223/377 (59.15%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           ES RKCSHCG NGHNSRTC+    G NN  N+   C+KLFGVN+ME +++S++KS SMGN
Sbjct: 4   ESARKCSHCGHNGHNSRTCNLLGHGVNN--NNKGICLKLFGVNIMERQEDSIKKSYSMGN 63

Query: 61  LNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTEE 120
           L             AA   +  ++ +  AD+AGYLSDG IHNK+RKAAHERKKG+PWTEE
Sbjct: 64  LQ------------AAGNGDQNNNINVGADDAGYLSDGLIHNKKRKAAHERKKGRPWTEE 123

Query: 121 EHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFDM 180
           EHR FLAGLKKLGKGDWRGI++NFVTTRTPTQVASHAQKYFLR+   N+K+KRR SLFDM
Sbjct: 124 EHRVFLAGLKKLGKGDWRGIARNFVTTRTPTQVASHAQKYFLRQA-TNDKRKRRTSLFDM 183

Query: 181 -----------------PEIKNNVPQDCQAS--GETS--LPKNNSSENQQQVNNLPAQLI 240
                            P  K      C +S  G TS  LP+  +  N  Q N +P+Q++
Sbjct: 184 HFKELSEQYAHQDSPLSPTAKTAEASSCSSSSPGSTSKVLPQITTITNSPQAN-MPSQVL 243

Query: 241 NRFPHLCLDAPHFVPPATGSAASAAANLHGIPYVSPRSPNA------------------- 300
           NRFPHLCLD+P  V     S+ S       IPY+    P                     
Sbjct: 244 NRFPHLCLDSPPAVAQMPASSCSVPTYAPVIPYMQVGMPGTVPYTPAMHYGGPTFHYMLK 303

Query: 301 ----------PMAGHPSGIPPSPRTSPSRPILQQLQPSAIAVAASNFEADALELKIGLPQ 328
                     P+  HPSGIP S R+ PS P++   + ++     S  + DALELKIG PQ
Sbjct: 304 TQGGNFSTCQPVISHPSGIPSSSRSVPSSPLMGVHRTNS-----SITKKDALELKIGQPQ 359

BLAST of Cp4.1LG14g05400 vs. NCBI nr
Match: gi|470146824|ref|XP_004309020.1| (PREDICTED: myb-like protein J isoform X1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 282.7 bits (722), Expect = 8.1e-73
Identity = 184/376 (48.94%), Postives = 225/376 (59.84%), Query Frame = 1

Query: 1   ESPRKCSHCGLNGHNSRTCSNFLKGNNNYSNSSYYCVKLFGVNLMENRDESMRKSLSMGN 60
           ES RKCSHCG NGHNSRTC+    G NN  N+   C+KLFGVN+ME +++S++KS SMGN
Sbjct: 4   ESARKCSHCGHNGHNSRTCNLLGHGVNN--NNKGICLKLFGVNIMERQEDSIKKSYSMGN 63

Query: 61  LNLHSCNNVLDLNTAAAVYNVADDNSASADNAGYLSDGFIHNKRRKAAHERKKGKPWTEE 120
           L             AA   +  ++ +  AD+AGYLSDG IHNK+RKAAHERKKG+PWTEE
Sbjct: 64  LQ------------AAGNGDQNNNINVGADDAGYLSDGLIHNKKRKAAHERKKGRPWTEE 123

Query: 121 EHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMHANEKKKRRASLFDM 180
           EHR FLAGLKKLGKGDWRGI++NFVTTRTPTQVASHAQKYFLR+   N+K+KRR SLFDM
Sbjct: 124 EHRVFLAGLKKLGKGDWRGIARNFVTTRTPTQVASHAQKYFLRQA-TNDKRKRRTSLFDM 183

Query: 181 -----------------PEIKNNVPQDCQAS--GETS--LPKNNSSENQQQVNNLPAQLI 240
                            P  K      C +S  G TS  LP+  +  N  Q N +P+Q++
Sbjct: 184 HFKELSEQYAHQDSPLSPTAKTAEASSCSSSSPGSTSKVLPQITTITNSPQAN-MPSQVL 243

Query: 241 NRFPHLCLDAPHFVPPATGSAASAAANLHGIPYVS------PRSPNA------------- 300
           NRFPHLCLD+P  V     S+ S       IPY+       P +P               
Sbjct: 244 NRFPHLCLDSPPAVAQMPASSCSVPTYAPVIPYMVGMPGTVPYTPAMHYGGPTFHYMLKT 303

Query: 301 ---------PMAGHPSGIPPSPRTSPSRPILQQLQPSAIAVAASNFEADALELKIGLPQS 328
                    P+  HPSGIP S R+ PS P++   + ++     S  + DALELKIG PQ 
Sbjct: 304 QGGNFSTCQPVISHPSGIPSSSRSVPSSPLMGVHRTNS-----SITKKDALELKIGQPQP 358

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MY1R1_SOLTU2.8e-2535.94Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1[more]
DIV_ANTMA2.2e-2250.43Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA PE=2 SV=1[more]
MYBI_DICDI5.9e-0735.96Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1[more]
MYBJ_DICDI8.5e-0627.53Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KXN0_CUCSA2.1e-12872.88Uncharacterized protein OS=Cucumis sativus GN=Csa_4G025050 PE=4 SV=1[more]
M5X281_PRUPE1.4e-6048.48Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010092mg PE=4 SV=1[more]
W9S9L1_9ROSA1.3e-5845.81Transcription factor OS=Morus notabilis GN=L484_017072 PE=4 SV=1[more]
A0A067JSE9_JATCU3.4e-5442.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26305 PE=4 SV=1[more]
A0A061G076_THECC1.1e-5245.00Myb-like transcription factor family protein, putative isoform 2 OS=Theobroma ca... [more]
Match NameE-valueIdentityDescription
AT5G61620.11.1e-4844.05 myb-like transcription factor family protein[more]
AT5G47390.19.5e-4040.00 myb-like transcription factor family protein[more]
AT1G70000.12.6e-3750.28 myb-like transcription factor family protein[more]
AT5G56840.11.0e-3346.39 myb-like transcription factor family protein[more]
AT3G16350.15.1e-3342.44 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778697489|ref|XP_004146936.2|3.0e-12872.88PREDICTED: myb-like protein I [Cucumis sativus][more]
gi|659107779|ref|XP_008453854.1|3.0e-12872.75PREDICTED: transcription factor MYB1R1 isoform X1 [Cucumis melo][more]
gi|659107781|ref|XP_008453855.1|3.8e-12371.31PREDICTED: transcription factor MYB1R1 isoform X2 [Cucumis melo][more]
gi|764634901|ref|XP_011470039.1|2.8e-7348.54PREDICTED: myb-like protein J isoform X2 [Fragaria vesca subsp. vesca][more]
gi|470146824|ref|XP_004309020.1|8.1e-7348.94PREDICTED: myb-like protein J isoform X1 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g05400.1Cp4.1LG14g05400.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 115..160
score: 3.9
IPR001005SANT/Myb domainSMARTSM00717santcoord: 113..163
score: 6.3
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 112..164
score: 2.2
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 113..160
score: 1.3
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 111..163
score: 1.04
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 109..165
score: 20
NoneNo IPR availablePANTHERPTHR12374TRANSCRIPTIONAL ADAPTOR 2 ADA2 -RELATEDcoord: 4..180
score: 6.0
NoneNo IPR availablePANTHERPTHR12374:SF19SUBFAMILY NOT NAMEDcoord: 4..180
score: 6.0