Cp4.1LG01g03470 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g03470
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptionmyb-like transcription factor family protein
LocationCp4.1LG01 : 1848853 .. 1851352 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACGGAGAGAGAGAGAGACGGAGAGAGAGAGAGAGAGAGTTGTGGTTTATTGATGAATCGATGAGCAAATTTCTTCCAATTCAAATTCTCTTTCTCTTTCTCGCAGCAGTTTTTGATTGAGAGAGAGAGGAGGAGAAAAAAAAAAAAAAAAGGAAGAAGGAAACCAGCCTTAGGGTTACCAGTTTTCTCTTTGGTTGTTTGTTTCTCGGGAAAATGGTGAGGGAATCGCCGAGAAAATGTTCGCATTGTGGATTGAATGGACATAATTCGCGAACATGCAGTAATTTTGCGAAGGGGAATAATAATAATAACTATTGCGTCAAATTGTTTGGAGTTAATTTGATGGATAATGGAGATGAATCTATGCGGAAGAGTCTCAGCATGGGGAATTTGAATATTCATGTATGTAATCACAACGCCGCTGCGGCTGGAAACAATGTCGCCTGTTATAATGCCGCCGTCGGTGACGACGGTGGCTATCTTTCCGATGGCCTTATTCATAACACGAAACGCAAAGCCGCTCTTGAGAGGAAGAAAGGTACGTAATCGTTTTTTTTCATTCAAGAAATTGCATCATAAATCCCTATGTATTCTGAGAAATTTGTGGAGAAAAAAGGAAAATCAAAGCGTCTGTATTAAATCTTAAGCCTATTAGTTTGATTAGCGGGAATGAATCTAACTCAATCGTTGATAATACGTATTAGGGAGAATTCGTAGATTGTACAGAGTATAAGTTTTGTGAGATAATGCAGGAAAGCCATGGACTGAGGAGGAACATAGAACCTTTTTGGCTGGTTTGAAGAAGCTGGGGAAAGGGGATTGGAGAGGAATTTCGAAGAACTTTGTGACGACACGAACACCGACGCAAGTAGCTAGTCATGCCCAAAAGTACTTTTTGAGGAAGATGAATGCGAACGATAAGAAGAAACGCAGAGCCAGTCTTTTCGACATCCCAGAAATCAAAGTAATAATTTGTTTTCTCAACTGCTTCTTGACGTGTGTCTCGATCAACTTCCCAAATGATTTTAGCCATATTTGTTTTAAAAACTTATAAACACCACTTCTATTCGGTAAGTTTCTTCGTTTTATTATCTACTTTGTGTCGATACCAATCTATTTTTCGAACGAAAAACGACCCGAAACTTCAAAAACTAAAAGTCGAAACTTAAATGGTTATCAAATCGGAGTTCTGCTGGAAAATTACTCTGAATATGCTTTTAACTGTTGTGTACATTGTGTTATGTTTATATATATATGTGTATAGAACAATGTTTCTCAAGAATGCCAAGCTTCATCATCTTCAATGAGGGCTAATGGAGAAACGTCACAAATTCTGCTGCCAAAGAACAGTTCTGAGAATCAACAACAGGTGAAATTCTCATCTCTTTGATCAATTTCTGTTTATAATCTGTAAGAAATGGGTGAATTCAGCTGTTTTTACTGTATATGAACATGGTTACAGGTCAGTAATTTACCAACACAGCTTATCAATCGATTCCCTCATCTTTGTTTGGATACTCCTCATTTCGTGCCTTCAACAACTGGATCACCAGCTTCTGGTGTTGCAAATTTGCATGGGATTCCTTATGTGGTAATTATTTCGTTCTAATCTCGGATTAAAAACTTTAGCGTTTCTTAGTCATAGTAAAGAAATCGTGAGAAAATAAGCATAATGAATGGCCGCTGGTTGGTAAGGCGAGAGCTTCAAGCTTGATTTCTGGTAGCGCACCTTACACACAAGCACCACCCAACGAAGGGCAGACGAACCTTCTTTCTAAATGGGGAGCCCAGAGCACCTCATGGTGAACCAAATATGAACATAACGAGACACGGACGACTCGGTGTCTCGTACGTAAAACAGAACAAGTAATGTAGGTGATGTAATGTAATGTTTGAATATGGTGATCAGGTGGGAGTTTCCCCTAATAATGTTCCAATAATCCCTTTGGTGAAGCTGGGAAGAAGATCATCAGCTGTTATGGCAATGCCGAAGAGCCCAATAAGGACAAGCAGTCCGAATGCAGCAGCAGCAGCAGCAGCAGCAGCGTTGGTAGCCCATCCATCTGGTATTCCTCCTTCTCCAAGGTCTTCCCCTCAAAGACCGTTGATGGTGCAGTTGCAGGCTTCTGCCATGGCAGCCTCTTTTGAGACAGATGCTCTAGAGCTCAAGATTGGACTGCCCCAATCTCCACAGGCTAAGAACTTGTCTTCTCAAACTTCTGGAGCCATTAGAGTTATATGAACCAACAAGAAAAAAGCAAAAATTGAAAATTAAAAAGAAATGTGAAATTTATCTTCTATAAGAAGGTTAATTAGAGTAAGAAAAAGTGTTATTTTTTTGAATGAGATGTTTGTTTTTGGCCTCCTCTCTATGCATGTAATAGGTAATAATTGATTTGTAAACCATATTCAATTCAATTTGATATTGTTTGCTTCAATCTTGTACGGATGGATTTT

mRNA sequence

AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACGGAGAGAGAGAGAGACGGAGAGAGAGAGAGAGAGAGTTGTGGTTTATTGATGAATCGATGAGCAAATTTCTTCCAATTCAAATTCTCTTTCTCTTTCTCGCAGCAGTTTTTGATTGAGAGAGAGAGGAGGAGAAAAAAAAAAAAAAAAGGAAGAAGGAAACCAGCCTTAGGGTTACCAGTTTTCTCTTTGGTTGTTTGTTTCTCGGGAAAATGGTGAGGGAATCGCCGAGAAAATGTTCGCATTGTGGATTGAATGGACATAATTCGCGAACATGCAGTAATTTTGCGAAGGGGAATAATAATAATAACTATTGCGTCAAATTGTTTGGAGTTAATTTGATGGATAATGGAGATGAATCTATGCGGAAGAGTCTCAGCATGGGGAATTTGAATATTCATGTATGTAATCACAACGCCGCTGCGGCTGGAAACAATGTCGCCTGTTATAATGCCGCCGTCGGTGACGACGGTGGCTATCTTTCCGATGGCCTTATTCATAACACGAAACGCAAAGCCGCTCTTGAGAGGAAGAAAGGAAAGCCATGGACTGAGGAGGAACATAGAACCTTTTTGGCTGGTTTGAAGAAGCTGGGGAAAGGGGATTGGAGAGGAATTTCGAAGAACTTTGTGACGACACGAACACCGACGCAAGTAGCTAGTCATGCCCAAAAGTACTTTTTGAGGAAGATGAATGCGAACGATAAGAAGAAACGCAGAGCCAGTCTTTTCGACATCCCAGAAATCAAAAACAATGTTTCTCAAGAATGCCAAGCTTCATCATCTTCAATGAGGGCTAATGGAGAAACGTCACAAATTCTGCTGCCAAAGAACAGTTCTGAGAATCAACAACAGGTCAGTAATTTACCAACACAGCTTATCAATCGATTCCCTCATCTTTGTTTGGATACTCCTCATTTCGTGCCTTCAACAACTGGATCACCAGCTTCTGGTGTTGCAAATTTGCATGGGATTCCTTATGTGGTGGGAGTTTCCCCTAATAATGTTCCAATAATCCCTTTGGTGAAGCTGGGAAGAAGATCATCAGCTGTTATGGCAATGCCGAAGAGCCCAATAAGGACAAGCAGTCCGAATGCAGCAGCAGCAGCAGCAGCAGCAGCGTTGGTAGCCCATCCATCTGGTATTCCTCCTTCTCCAAGGTCTTCCCCTCAAAGACCGTTGATGGTGCAGTTGCAGGCTTCTGCCATGGCAGCCTCTTTTGAGACAGATGCTCTAGAGCTCAAGATTGGACTGCCCCAATCTCCACAGGCTAAGAACTTGTCTTCTCAAACTTCTGGAGCCATTAGAGTTATATGAACCAACAAGAAAAAAGCAAAAATTGAAAATTAAAAAGAAATGTGAAATTTATCTTCTATAAGAAGGTTAATTAGAGTAAGAAAAAGTGTTATTTTTTTGAATGAGATGTTTGTTTTTGGCCTCCTCTCTATGCATGTAATAGGTAATAATTGATTTGTAAACCATATTCAATTCAATTTGATATTGTTTGCTTCAATCTTGTACGGATGGATTTT

Coding sequence (CDS)

ATGGTGAGGGAATCGCCGAGAAAATGTTCGCATTGTGGATTGAATGGACATAATTCGCGAACATGCAGTAATTTTGCGAAGGGGAATAATAATAATAACTATTGCGTCAAATTGTTTGGAGTTAATTTGATGGATAATGGAGATGAATCTATGCGGAAGAGTCTCAGCATGGGGAATTTGAATATTCATGTATGTAATCACAACGCCGCTGCGGCTGGAAACAATGTCGCCTGTTATAATGCCGCCGTCGGTGACGACGGTGGCTATCTTTCCGATGGCCTTATTCATAACACGAAACGCAAAGCCGCTCTTGAGAGGAAGAAAGGAAAGCCATGGACTGAGGAGGAACATAGAACCTTTTTGGCTGGTTTGAAGAAGCTGGGGAAAGGGGATTGGAGAGGAATTTCGAAGAACTTTGTGACGACACGAACACCGACGCAAGTAGCTAGTCATGCCCAAAAGTACTTTTTGAGGAAGATGAATGCGAACGATAAGAAGAAACGCAGAGCCAGTCTTTTCGACATCCCAGAAATCAAAAACAATGTTTCTCAAGAATGCCAAGCTTCATCATCTTCAATGAGGGCTAATGGAGAAACGTCACAAATTCTGCTGCCAAAGAACAGTTCTGAGAATCAACAACAGGTCAGTAATTTACCAACACAGCTTATCAATCGATTCCCTCATCTTTGTTTGGATACTCCTCATTTCGTGCCTTCAACAACTGGATCACCAGCTTCTGGTGTTGCAAATTTGCATGGGATTCCTTATGTGGTGGGAGTTTCCCCTAATAATGTTCCAATAATCCCTTTGGTGAAGCTGGGAAGAAGATCATCAGCTGTTATGGCAATGCCGAAGAGCCCAATAAGGACAAGCAGTCCGAATGCAGCAGCAGCAGCAGCAGCAGCAGCGTTGGTAGCCCATCCATCTGGTATTCCTCCTTCTCCAAGGTCTTCCCCTCAAAGACCGTTGATGGTGCAGTTGCAGGCTTCTGCCATGGCAGCCTCTTTTGAGACAGATGCTCTAGAGCTCAAGATTGGACTGCCCCAATCTCCACAGGCTAAGAACTTGTCTTCTCAAACTTCTGGAGCCATTAGAGTTATATGA

Protein sequence

MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDESMRKSLSMGNLNIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDIPEIKNNVSQECQASSSSMRANGETSQILLPKNSSENQQQVSNLPTQLINRFPHLCLDTPHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIRTSSPNAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALELKIGLPQSPQAKNLSSQTSGAIRVI
BLAST of Cp4.1LG01g03470 vs. Swiss-Prot
Match: MY1R1_SOLTU (Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 2.3e-28
Identity = 84/200 (42.00%), Postives = 113/200 (56.50%), Query Frame = 1

Query: 36  VKLFGVNLMDNGDESMRKSLSMGNLNIHV---CNHNAAAAGNNVACYNAAVGDDGGYLSD 95
           + LFGV +     + MRKS+S+ +L+ +     N+N     NN +   + V  D GY S 
Sbjct: 24  IMLFGVRVKV---DPMRKSVSLNDLSQYEHPNANNNNNGGDNNES---SKVAQDEGYASA 83

Query: 96  GLIHNTKRKAALERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHA 155
                 +  +  ERK+G PWTEEEH+ FL GL+K+GKGDWRGIS+NFV TRTPTQVASHA
Sbjct: 84  DDAVQHQSNSGRERKRGVPWTEEEHKLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHA 143

Query: 156 QKYFLRKMNANDKKKRRASLFDIPEIKNNVSQECQASSSSMRANGETSQILLPKNSSENQ 215
           QKYFLR+ N N +++RR+SLFDI                        S  ++P    EN+
Sbjct: 144 QKYFLRRSNLN-RRRRRSSLFDIT---------------------TDSVSVMPIEEVENK 195

Query: 216 QQV-----SNLPTQLINRFP 228
           Q++     + LPT   N FP
Sbjct: 204 QEIPVVAPATLPTTKTNAFP 195

BLAST of Cp4.1LG01g03470 vs. Swiss-Prot
Match: DIV_ANTMA (Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA PE=2 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 1.0e-23
Identity = 60/104 (57.69%), Postives = 74/104 (71.15%), Query Frame = 1

Query: 84  GDDG---GYLSDGLIHNTKRKAALERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFV 143
           G DG    Y + G   ++ R +  ERKKG PWTEEEH+ FL GLKK GKGDWR IS+NFV
Sbjct: 103 GFDGFKQSYGTGGRKSSSGRPSEQERKKGVPWTEEEHKLFLMGLKKYGKGDWRNISRNFV 162

Query: 144 TTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDIPEIKNNVSQ 185
            TRTPTQVASHAQKYF+R+++   K KRRAS+ DI  +  + +Q
Sbjct: 163 ITRTPTQVASHAQKYFIRQLSGG-KDKRRASIHDITTVNLSDNQ 205

BLAST of Cp4.1LG01g03470 vs. Swiss-Prot
Match: MYBJ_DICDI (Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 3.9e-07
Identity = 74/254 (29.13%), Postives = 112/254 (44.09%), Query Frame = 1

Query: 18  NSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDESMRKSLSMGNLNIHVCNHNAAAAGNNVA 77
           NS T S  +  +N +        V++++N + +   S +  N N +  N+N     N   
Sbjct: 301 NSPTTSTSSTHSNTSTPIT----VSIINNNNNNNSNSNNNNNNNNNNNNNNT---NNTTT 360

Query: 78  CYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISK 137
               A    GG  +      T +K +L  K+G  WT+EEH  FL G++  GKG W+ I++
Sbjct: 361 TTTTATTTSGGKTNP-----TGKKTSL--KQG--WTKEEHIRFLNGIQIHGKGAWKEIAQ 420

Query: 138 NFVTTRTPTQVASHAQKYFLRKMNANDKKK--RRASLFDIPE------IKNNV------- 197
            FV TRTPTQ+ SHAQKY+LR+      K+     SL D+ +       KNNV       
Sbjct: 421 -FVGTRTPTQIQSHAQKYYLRQKQETKNKRSIHDLSLQDLIDDNLNNSNKNNVDKNKQDD 480

Query: 198 ----SQECQASSSSMRANGETSQILLPKNSSE--NQQQVSNLPTQLINRF---PHLCLDT 247
               +Q+ + + S     G+   I   +   +   QQ     P  +I  F   P     +
Sbjct: 481 KEKKTQKTKKTKSKSSTKGDEEMITQQQQLQQQPQQQPQQKQPPTIITNFNTTPTSSQSS 537

BLAST of Cp4.1LG01g03470 vs. Swiss-Prot
Match: MYBI_DICDI (Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 1.1e-06
Identity = 43/134 (32.09%), Postives = 68/134 (50.75%), Query Frame = 1

Query: 99  KRKAALERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFL- 158
           + K + ++K+ + WT EEH  F+  L K G  D + IS+ +V+TR PTQV +HAQKYFL 
Sbjct: 162 QEKQSEKKKQSRYWTPEEHSRFIEALSKYGHKDVKSISQ-YVSTRNPTQVRTHAQKYFLR 221

Query: 159 ------RKMNANDKKKRRASLFD--IPEIKNNVSQECQASSSSMR------ANGETSQIL 218
                 RK+ + +     A   D  + E  N+     Q SS S        AN  ++ ++
Sbjct: 222 IDRERGRKLESKESINGGADKDDDWLREEYNDEGSPTQYSSCSNSPTTNSVANPFSNSLI 281

BLAST of Cp4.1LG01g03470 vs. TrEMBL
Match: A0A0A0KXN0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G025050 PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 6.6e-123
Identity = 266/385 (69.09%), Postives = 287/385 (74.55%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNY-CVKLFGVNLMDNGDESMRKSLSMGN 60
           MVRESPRKCSHCG NGHNSRTC NF+KGN+NN Y CVKLFGVNLM+N DESMRKSLSMGN
Sbjct: 1   MVRESPRKCSHCGFNGHNSRTCGNFSKGNSNNYYYCVKLFGVNLMENRDESMRKSLSMGN 60

Query: 61  LNIHVCNH----NAAAAGNNVACYNAAVG---DDGGYLSDGLIHNTKRKAALERKKGKPW 120
           LN+H CN+    N     NNV   N A     DD GYLSDGLIHN +RKAA ERKKGKPW
Sbjct: 61  LNLHSCNNVLDLNNTTTVNNVTGDNTAAAASTDDAGYLSDGLIHNKRRKAAHERKKGKPW 120

Query: 121 TEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 180
           +EEEHRTFL GLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL
Sbjct: 121 SEEEHRTFLIGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 180

Query: 181 FDIPEIKNNVSQECQASSSSMRANGE-TSQILLPKNSS-ENQQQVSNLPTQLINRFPHLC 240
           FDIPEIKNN S++C AS       GE  SQILLPKN+S +NQ QV+NL TQLINRFPHLC
Sbjct: 181 FDIPEIKNNFSRDCPAS-------GELPSQILLPKNNSPDNQSQVNNLGTQLINRFPHLC 240

Query: 241 LDTPHFVPSTTGSPASGVANLHGIPYVVGVSP-NNVPIIPLVKLGRRSSAVMAMPKSPIR 300
           LDTPHF+P  T    +G ++   IP+VVGVSP NN   IPLV +GR              
Sbjct: 241 LDTPHFIPQQT----NGSSSPSSIPFVVGVSPNNNNNNIPLVNIGR-------------S 300

Query: 301 TSSPNAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQAS--AMAASFET-----DALE 360
             SPNAA A       AHPSGIP SPRSSP R L++Q  AS  AMAA+  T     DALE
Sbjct: 301 RVSPNAAMA-------AHPSGIPHSPRSSPTRTLLMQPGASALAMAAAASTFDQSADALE 354

Query: 361 LKIGLPQSPQAKNLSSQTSGAIRVI 368
           LKIGLPQSPQ  NLSSQT GAIRVI
Sbjct: 361 LKIGLPQSPQPNNLSSQTPGAIRVI 354

BLAST of Cp4.1LG01g03470 vs. TrEMBL
Match: W9S9L1_9ROSA (Transcription factor OS=Morus notabilis GN=L484_017072 PE=4 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 3.8e-70
Identity = 183/385 (47.53%), Postives = 231/385 (60.00%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFA-------KGNNNNNYCVKLFGVNLMDNGDESMRK 60
           MV+E+ RKCSHCG  GHNSRTC+N         +  N N +  KLFGVN+++  ++SM+K
Sbjct: 1   MVKEAQRKCSHCGQQGHNSRTCNNRNNVVSPRNRNGNGNGHSFKLFGVNIVEGAEDSMKK 60

Query: 61  SLSMGNLNIHVCNHNAAAAGNNVACYNAAVGDD-------GGYLSDGLIHNTKRKAALER 120
           S SMGNL        AA +G        + GDD        GYLSDG++HN KRKAA ER
Sbjct: 61  SRSMGNL--------AALSGGQGKNDVVSGGDDHDDHDPEAGYLSDGVLHNVKRKAARER 120

Query: 121 KKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKK 180
           K+GKPWTEEEHRTFLAG+ KLGKGDWRGIS  FVTTRTPTQVASHAQKYF+RK    D+K
Sbjct: 121 KRGKPWTEEEHRTFLAGMNKLGKGDWRGISSKFVTTRTPTQVASHAQKYFIRKQAPKDRK 180

Query: 181 KRRASLFDIPEIKNN--VSQECQASSSSMRANGETSQILLPKNSSENQQQVSNLPTQLIN 240
           KRR+SLFD+P  +++  V  +    SS ++   ETS        S +Q    N P++++N
Sbjct: 181 KRRSSLFDMPFTQSDDVVPPQGHQVSSMIKPTAETSS-----RPSRSQGLAENNPSEILN 240

Query: 241 RFPHLCLDTPHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMP 300
           RFP LCLD  + VP   G     VA  + +PY VG  P NV   PL+ +GR S       
Sbjct: 241 RFPQLCLDN-YPVPVRPG-----VAPNYLVPYGVGF-PGNVQPWPLINIGRPSYLY---- 300

Query: 301 KSPIRTSSPNAAAAA--AAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALE 360
                 + PN   +       +V HPSGIPP PRS P  P  ++  AS  +++ +T+ LE
Sbjct: 301 ------NVPNVYGSVDNGVPVVVTHPSGIPP-PRSPPTSP-SIKAAASKDSSADQTNGLE 353

Query: 361 LKIGLPQSPQAKNLSSQ-TSGAIRV 367
           L IG PQS Q   LSS   SGAI+V
Sbjct: 361 LTIGQPQSKQGSELSSSPASGAIKV 353

BLAST of Cp4.1LG01g03470 vs. TrEMBL
Match: A0A067JSE9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26305 PE=4 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 3.3e-66
Identity = 182/382 (47.64%), Postives = 226/382 (59.16%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDESMRKSLSMGNL 60
           MV+E  RKCSHCG NGHNSRTC N  KG       VKLFGVN+ +  ++ M+KS S+GNL
Sbjct: 1   MVKEGTRKCSHCGQNGHNSRTC-NHGKGGAG----VKLFGVNIFEKQEQPMKKSASLGNL 60

Query: 61  NIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTEEEHRTF 120
              + N             NA    D GYLSDG I+  + KAA ERKKGKPWTEEEHRTF
Sbjct: 61  ESLIDN-------------NAHHHVDEGYLSDGYINFKRGKAANERKKGKPWTEEEHRTF 120

Query: 121 LAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD--IPEI 180
           LAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKY+LR+ +A D KKRR+SLFD  + E 
Sbjct: 121 LAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYYLRQASA-DIKKRRSSLFDMTLKEP 180

Query: 181 KNNVSQE---CQASSSSMRANGETSQILLPKNSSENQQQVSNLPTQLINRFPHLCLDTPH 240
           K   SQE     ++SSS      +S + LP  ++ +    +    Q++NRFPHLCLDTP 
Sbjct: 181 KVLTSQERPILPSNSSSQVKQASSSSLALPLRTTTDIPARAITSAQILNRFPHLCLDTPA 240

Query: 241 FVPSTTGSPASGVANLHGIPYVVG--------VSPNNVPIIPLVKLGRRSSAVMAMPKSP 300
             P+     A+ V N  GIPY++G        +        PL+++   + + +A P  P
Sbjct: 241 IGPAYL---ATSVPNYTGIPYMLGFPDDGRSFMGARTASAAPLLQMMHYNYSRLAYPFPP 300

Query: 301 IRTSSPNAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALELKIG- 360
               +     AA    L AHPSGI P+PRS P    + Q  +++         L+LKIG 
Sbjct: 301 ----NSQGRFAATCVPLTAHPSGI-PAPRSYP-LGFLQQGSSTSPTKKENPPELDLKIGP 354

Query: 361 -LPQSPQAKNLSSQTSGAIRVI 368
             PQSPQ  +LS Q SG I VI
Sbjct: 361 PPPQSPQEASLSPQASGPISVI 354

BLAST of Cp4.1LG01g03470 vs. TrEMBL
Match: A0A061G0P9_THECC (Myb-like transcription factor family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_014838 PE=4 SV=1)

HSP 1 Score: 231.5 bits (589), Expect = 1.7e-57
Identity = 171/369 (46.34%), Postives = 206/369 (55.83%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNL--MDNGDESMRKSLSMG 60
           MV+E+ RKCSHCG NGHNSRTC    KG      CVKLFGVN+  ++  +  M+KS SM 
Sbjct: 23  MVKETGRKCSHCGHNGHNSRTCHG--KG------CVKLFGVNISAVEKQESFMKKSFSME 82

Query: 61  NLNIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTEEEHR 120
           +L  H   +N           N A   D GYLSDG IH+ K  AA ERK+GKPWTEEEHR
Sbjct: 83  SLRSHHAEYN-----------NNAPSVDDGYLSDGQIHSRKSNAARERKRGKPWTEEEHR 142

Query: 121 TFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDIPEI 180
            FLAGL+KLGKGDWRGISK FVTTRTPTQVASHAQKYFLR+   NDKKKRR SLFD+   
Sbjct: 143 IFLAGLRKLGKGDWRGISKKFVTTRTPTQVASHAQKYFLRQA-GNDKKKRRPSLFDM--- 202

Query: 181 KNNVSQECQASSSSMRANGETSQILLPKNSSENQQQVSNLPTQLINRFPHLCLDTPHFVP 240
                QE ++++S   +  E       + + ++  QV   P+ + NRFPHLCLD    V 
Sbjct: 203 ---AFQELESNASPPGSPAE-------QTTRDSSDQV-KAPSPIANRFPHLCLD-DRPVT 262

Query: 241 STTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIRTSSPNAA-- 300
           S T S  S     H I  + G +PN   + P  K+         MP  P   +   A   
Sbjct: 263 SLTAS-HSFPTYYHRIQPLAGGAPNG-QVFPEAKM---------MPSLPFLHAMNYAGLH 322

Query: 301 ----AAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALELKIGLPQSPQ 360
               A A   A  AHPSGIP     SP        +A   A+  E D LELKIG PQS +
Sbjct: 323 YGYMAKALGCAPAAHPSGIP-----SPWSVQHSMFRAGPGASPAEKDLLELKIGPPQSSK 340

Query: 361 AKNLSSQTS 362
             ++ SQ S
Sbjct: 383 NTSMLSQAS 340

BLAST of Cp4.1LG01g03470 vs. TrEMBL
Match: A0A061G076_THECC (Myb-like transcription factor family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_014838 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 4.8e-57
Identity = 169/369 (45.80%), Postives = 204/369 (55.28%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNL--MDNGDESMRKSLSMG 60
           MV+E+ RKCSHCG NGHNSRTC    KG      CVKLFGVN+  ++  +  M+KS SM 
Sbjct: 1   MVKETGRKCSHCGHNGHNSRTCHG--KG------CVKLFGVNISAVEKQESFMKKSFSME 60

Query: 61  NLNIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTEEEHR 120
           +L  H   +N           N A   D GYLSDG IH+ K  AA ERK+GKPWTEEEHR
Sbjct: 61  SLRSHHAEYN-----------NNAPSVDDGYLSDGQIHSRKSNAARERKRGKPWTEEEHR 120

Query: 121 TFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDIPEI 180
            FLAGL+KLGKGDWRGISK FVTTRTPTQVASHAQKYFLR+   NDKKKRR SLFD+   
Sbjct: 121 IFLAGLRKLGKGDWRGISKKFVTTRTPTQVASHAQKYFLRQA-GNDKKKRRPSLFDM--- 180

Query: 181 KNNVSQECQASSSSMRANGETSQILLPKNSSENQQQVSNLPTQLINRFPHLCLDTPHFVP 240
                QE ++++S   +  E       + + ++  QV   P+ + NRFPHLCLD    V 
Sbjct: 181 ---AFQELESNASPPGSPAE-------QTTRDSSDQV-KAPSPIANRFPHLCLD-DRPVT 240

Query: 241 STTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIRTSSPNAA-- 300
           S T S +         P   G +PN   + P  K+         MP  P   +   A   
Sbjct: 241 SLTASHSFPTYYHRIQPLQAGGAPNG-QVFPEAKM---------MPSLPFLHAMNYAGLH 300

Query: 301 ----AAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALELKIGLPQSPQ 360
               A A   A  AHPSGIP     SP        +A   A+  E D LELKIG PQS +
Sbjct: 301 YGYMAKALGCAPAAHPSGIP-----SPWSVQHSMFRAGPGASPAEKDLLELKIGPPQSSK 319

Query: 361 AKNLSSQTS 362
             ++ SQ S
Sbjct: 361 NTSMLSQAS 319

BLAST of Cp4.1LG01g03470 vs. TAIR10
Match: AT5G61620.1 (AT5G61620.1 myb-like transcription factor family protein)

HSP 1 Score: 194.5 bits (493), Expect = 1.1e-49
Identity = 156/379 (41.16%), Postives = 199/379 (52.51%), Query Frame = 1

Query: 1   MVRES---PRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDE-----SMR 60
           MV+E+    + CSHCG NGHN+RTC N       N   VKLFGVN+  +        ++R
Sbjct: 1   MVKETVTVAKTCSHCGHNGHNARTCLNGV-----NKASVKLFGVNISSDPIRPPEVTALR 60

Query: 61  KSLSMGNLNIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPW 120
           KSLS+GNL+  + N  +  +G+ +A       DD GY SDG IH+ K K A E+KKGKPW
Sbjct: 61  KSLSLGNLDALLANDESNGSGDPIAAV-----DDTGYHSDGQIHSKKGKTAHEKKKGKPW 120

Query: 121 TEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 180
           TEEEHR FL GL KLGKGDWRGI+K+FV+TRTPTQVASHAQKYF+R +N NDK+KRRASL
Sbjct: 121 TEEEHRNFLIGLNKLGKGDWRGIAKSFVSTRTPTQVASHAQKYFIR-LNVNDKRKRRASL 180

Query: 181 FDIPEIKNNVSQECQASSSSMRANGETSQILLPKNSSENQQQ---VSNLPTQLINRFPHL 240
           FDI     ++  + +   +S  A+ +T     PK      QQ     +  T++ NRF +L
Sbjct: 181 FDI-----SLEDQKEKERNSQDASTKTP----PKQPITGIQQPVVQGHTQTEISNRFQNL 240

Query: 241 CLDTPHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIR 300
            ++   ++P     P          PY            P +          A P+ P+R
Sbjct: 241 SME---YMPIYQPIP----------PYY---------NFPPIMYHPNYPMYYANPQVPVR 300

Query: 301 TSSPNAAAAAAAAALVAHPSGIPPSPRSSP-QRPLMVQLQASAMAASFETDALELKIGLP 360
                            HPSGI P PR  P   PL    +AS M      D L+L IGLP
Sbjct: 301 F---------------VHPSGI-PVPRHIPIGLPLSQPSEASNMT---NKDGLDLHIGLP 316

Query: 361 QSPQAKNLSSQTS-GAIRV 367
             PQA   S  T  G I V
Sbjct: 361 --PQATGASDLTGHGVIHV 316

BLAST of Cp4.1LG01g03470 vs. TAIR10
Match: AT5G47390.1 (AT5G47390.1 myb-like transcription factor family protein)

HSP 1 Score: 157.1 bits (396), Expect = 2.0e-38
Identity = 94/186 (50.54%), Postives = 121/186 (65.05%), Query Frame = 1

Query: 7   RKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDESMRKSLSMGNLNIHVCN 66
           R+CSHC  NGHNSRTC N           VKLFGV L +    S+RKS SMGNL+ +  +
Sbjct: 3   RRCSHCNHNGHNSRTCPNRG---------VKLFGVRLTEG---SIRKSASMGNLSHYTGS 62

Query: 67  HNAA-AAGNNVACYNAAVGDD---GGYLSDGLIHNTKRKAALERKKGKPWTEEEHRTFLA 126
            +     G+N       V D     GY S+  +  +   ++ ERKKG PWTEEEHR FL 
Sbjct: 63  GSGGHGTGSNTPGSPGDVPDHVAGDGYASEDFVAGSS--SSRERKKGTPWTEEEHRMFLL 122

Query: 127 GLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD-IPEIKNN 186
           GL+KLGKGDWRGIS+N+VTTRTPTQVASHAQKYF+R+ N + ++KRR+SLFD +P+   +
Sbjct: 123 GLQKLGKGDWRGISRNYVTTRTPTQVASHAQKYFIRQSNVS-RRKRRSSLFDMVPDEVGD 173

Query: 187 VSQECQ 188
           +  + Q
Sbjct: 183 IPMDLQ 173

BLAST of Cp4.1LG01g03470 vs. TAIR10
Match: AT1G70000.1 (AT1G70000.1 myb-like transcription factor family protein)

HSP 1 Score: 156.8 bits (395), Expect = 2.6e-38
Identity = 91/175 (52.00%), Postives = 109/175 (62.29%), Query Frame = 1

Query: 7   RKCSHCGLNGHNSRTCSNFAKGNNNNN------YCVKLFGVNLMDNGDESMRKSLSMGNL 66
           R CS CG NGHNSRTC        +NN        + LFGV + +      RKS+SM NL
Sbjct: 3   RSCSQCGNNGHNSRTCPTDITTTGDNNDKGGGEKAIMLFGVRVTEASSSCFRKSVSMNNL 62

Query: 67  NIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTEEEHRTF 126
           +      +     N          DDGGY SD ++H + R    ERK+G PWTEEEHR F
Sbjct: 63  S----QFDQTPDPNPT--------DDGGYASDDVVHASGRNR--ERKRGTPWTEEEHRLF 122

Query: 127 LAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDI 176
           L GL K+GKGDWRGIS+NFV TRTPTQVASHAQKYFLR+ N N +++RR+SLFDI
Sbjct: 123 LTGLHKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLRRTNQN-RRRRRSSLFDI 162

BLAST of Cp4.1LG01g03470 vs. TAIR10
Match: AT5G56840.1 (AT5G56840.1 myb-like transcription factor family protein)

HSP 1 Score: 144.8 bits (364), Expect = 1.0e-34
Identity = 88/186 (47.31%), Postives = 109/186 (58.60%), Query Frame = 1

Query: 7   RKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDE------------SMRKS 66
           R+CSHCG  GHNSRTCS++          V+LFGV+L                  +++KS
Sbjct: 3   RRCSHCGNVGHNSRTCSSY------QTRVVRLFGVHLDTTSSSPPPPPPPSILAAAIKKS 62

Query: 67  LSMGNLNIHVCNHNAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTE 126
            SM  L    C+ ++++                GYLSDGL H T      +RKKG PWT 
Sbjct: 63  FSMDCLP--ACSSSSSSFA--------------GYLSDGLAHKTP-----DRKKGVPWTA 122

Query: 127 EEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD 181
           EEHRTFL GL+KLGKGDWRGIS+NFV T++PTQVASHAQKYFLR+      K+RR SLFD
Sbjct: 123 EEHRTFLIGLEKLGKGDWRGISRNFVVTKSPTQVASHAQKYFLRQTTTLHHKRRRTSLFD 161

BLAST of Cp4.1LG01g03470 vs. TAIR10
Match: AT3G16350.1 (AT3G16350.1 Homeodomain-like superfamily protein)

HSP 1 Score: 141.4 bits (355), Expect = 1.1e-33
Identity = 101/258 (39.15%), Postives = 139/258 (53.88%), Query Frame = 1

Query: 7   RKCSHCGLNGHNSRTC---------------SNFAKGNNNNNYCVKLFGVNLMDNGDESM 66
           R+CSHC  NGHNSRTC                    G + ++  VKLFGV L D     +
Sbjct: 3   RRCSHCSNNGHNSRTCPTRGGGTCGGSGGGGGGGGGGGSGSSSAVKLFGVRLTDGS--II 62

Query: 67  RKSLSMGNLNI-----HVCNHNAAAAGNNVACYN------------AAVGDDGGYLSDGL 126
           +KS SMGNL+          H+  +  + +A  N            + +  + GYLSD  
Sbjct: 63  KKSASMGNLSALAVAAAAATHHRLSPSSPLATSNLNDSPLSDHARYSNLHHNEGYLSDDP 122

Query: 127 IHNTKRKAAL-ERKKGKPWTEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQ 186
            H +       ERK+G PWTEEEHR FL GL+KLGKGDWRGIS+N+VT+RTPTQVASHAQ
Sbjct: 123 AHGSGSSHRRGERKRGVPWTEEEHRLFLVGLQKLGKGDWRGISRNYVTSRTPTQVASHAQ 182

Query: 187 KYFLRKMNANDKKKRRASLFDIPE----IKNNVSQECQASSSSMRANGETSQILLP---- 222
           KYF+R   ++ ++KRR+SLFD+        ++ +QE Q  + S  +     +  LP    
Sbjct: 183 KYFIRH-TSSSRRKRRSSLFDMVTDEMVTDSSPTQEEQTLNGSSPSKEPEKKSYLPSLEL 242

BLAST of Cp4.1LG01g03470 vs. NCBI nr
Match: gi|659107779|ref|XP_008453854.1| (PREDICTED: transcription factor MYB1R1 isoform X1 [Cucumis melo])

HSP 1 Score: 464.9 bits (1195), Expect = 1.3e-127
Identity = 265/382 (69.37%), Postives = 291/382 (76.18%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDESMRKSLSMGNL 60
           MVRESPRKCSHCGLNGHNSRTC NF+KGNNN  YCVKLFGVNLM+N DESMRKSLSMGNL
Sbjct: 1   MVRESPRKCSHCGLNGHNSRTCGNFSKGNNNYYYCVKLFGVNLMENRDESMRKSLSMGNL 60

Query: 61  NIHV-CNH-----NAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTE 120
           N+H  CN+     N   A NNV   NAA  DD GYLSDGLIHN +RKAA ERKKGKPW+E
Sbjct: 61  NLHSSCNNVLDLNNNTTAVNNVTGDNAASADDSGYLSDGLIHNKRRKAAHERKKGKPWSE 120

Query: 121 EEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD 180
           EEHRTFL GLKKLGKGDWRGISKN+VTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD
Sbjct: 121 EEHRTFLIGLKKLGKGDWRGISKNYVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD 180

Query: 181 IPEIKNNVSQECQASSSSMRANGETSQILLPKNSS-ENQQQVSNLPTQLINRFPHLCLDT 240
           IPEIKNN S++CQASS         SQILL KN+S +NQ QV+NL TQL+NRFPHLCLDT
Sbjct: 181 IPEIKNNYSRDCQASSEL------PSQILLSKNNSLDNQPQVNNLQTQLVNRFPHLCLDT 240

Query: 241 PHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIRTSSP 300
           PHF+P  T    +G ++   IP+VVGVSP N   IPLV  GR S              SP
Sbjct: 241 PHFIPPQT----NGSSSPSSIPFVVGVSPKN--NIPLVNTGRSS------------RGSP 300

Query: 301 NAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQ--------LQASAMAASFETDALELKI 360
           N       AA+VAHPSGIPPSPRSSP R L++Q        + A+A +A  ++DALELKI
Sbjct: 301 N-------AAMVAHPSGIPPSPRSSPTRTLLMQQPAASALAMAAAASSAFDQSDALELKI 351

Query: 361 GLPQSPQAKNLSSQTSGAIRVI 368
           GLPQSPQ KNLSSQTSGAIRVI
Sbjct: 361 GLPQSPQPKNLSSQTSGAIRVI 351

BLAST of Cp4.1LG01g03470 vs. NCBI nr
Match: gi|778697489|ref|XP_004146936.2| (PREDICTED: myb-like protein I [Cucumis sativus])

HSP 1 Score: 448.7 bits (1153), Expect = 9.5e-123
Identity = 266/385 (69.09%), Postives = 287/385 (74.55%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNY-CVKLFGVNLMDNGDESMRKSLSMGN 60
           MVRESPRKCSHCG NGHNSRTC NF+KGN+NN Y CVKLFGVNLM+N DESMRKSLSMGN
Sbjct: 1   MVRESPRKCSHCGFNGHNSRTCGNFSKGNSNNYYYCVKLFGVNLMENRDESMRKSLSMGN 60

Query: 61  LNIHVCNH----NAAAAGNNVACYNAAVG---DDGGYLSDGLIHNTKRKAALERKKGKPW 120
           LN+H CN+    N     NNV   N A     DD GYLSDGLIHN +RKAA ERKKGKPW
Sbjct: 61  LNLHSCNNVLDLNNTTTVNNVTGDNTAAAASTDDAGYLSDGLIHNKRRKAAHERKKGKPW 120

Query: 121 TEEEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 180
           +EEEHRTFL GLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL
Sbjct: 121 SEEEHRTFLIGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASL 180

Query: 181 FDIPEIKNNVSQECQASSSSMRANGE-TSQILLPKNSS-ENQQQVSNLPTQLINRFPHLC 240
           FDIPEIKNN S++C AS       GE  SQILLPKN+S +NQ QV+NL TQLINRFPHLC
Sbjct: 181 FDIPEIKNNFSRDCPAS-------GELPSQILLPKNNSPDNQSQVNNLGTQLINRFPHLC 240

Query: 241 LDTPHFVPSTTGSPASGVANLHGIPYVVGVSP-NNVPIIPLVKLGRRSSAVMAMPKSPIR 300
           LDTPHF+P  T    +G ++   IP+VVGVSP NN   IPLV +GR              
Sbjct: 241 LDTPHFIPQQT----NGSSSPSSIPFVVGVSPNNNNNNIPLVNIGR-------------S 300

Query: 301 TSSPNAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQAS--AMAASFET-----DALE 360
             SPNAA A       AHPSGIP SPRSSP R L++Q  AS  AMAA+  T     DALE
Sbjct: 301 RVSPNAAMA-------AHPSGIPHSPRSSPTRTLLMQPGASALAMAAAASTFDQSADALE 354

Query: 361 LKIGLPQSPQAKNLSSQTSGAIRVI 368
           LKIGLPQSPQ  NLSSQT GAIRVI
Sbjct: 361 LKIGLPQSPQPNNLSSQTPGAIRVI 354

BLAST of Cp4.1LG01g03470 vs. NCBI nr
Match: gi|659107781|ref|XP_008453855.1| (PREDICTED: transcription factor MYB1R1 isoform X2 [Cucumis melo])

HSP 1 Score: 446.8 bits (1148), Expect = 3.6e-122
Identity = 259/382 (67.80%), Postives = 285/382 (74.61%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNNYCVKLFGVNLMDNGDESMRKSLSMGNL 60
           MVRESPRKCSHCGLNGHNSRTC NF+KGNNN  YCVKLFGVNLM+N DESMRKSLSMGNL
Sbjct: 1   MVRESPRKCSHCGLNGHNSRTCGNFSKGNNNYYYCVKLFGVNLMENRDESMRKSLSMGNL 60

Query: 61  NIHV-CNH-----NAAAAGNNVACYNAAVGDDGGYLSDGLIHNTKRKAALERKKGKPWTE 120
           N+H  CN+     N   A NNV   NAA  DD GYLSDGLIHN +RKAA ERKKGKPW+E
Sbjct: 61  NLHSSCNNVLDLNNNTTAVNNVTGDNAASADDSGYLSDGLIHNKRRKAAHERKKGKPWSE 120

Query: 121 EEHRTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD 180
           EEHRTFL GLKKLGKGDWRGISKN+VTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD
Sbjct: 121 EEHRTFLIGLKKLGKGDWRGISKNYVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFD 180

Query: 181 IPEIKNNVSQECQASSSSMRANGETSQILLPKNSS-ENQQQVSNLPTQLINRFPHLCLDT 240
           IPEIK         +SS +      SQILL KN+S +NQ QV+NL TQL+NRFPHLCLDT
Sbjct: 181 IPEIK---------ASSEL-----PSQILLSKNNSLDNQPQVNNLQTQLVNRFPHLCLDT 240

Query: 241 PHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIRTSSP 300
           PHF+P  T    +G ++   IP+VVGVSP N   IPLV  GR S              SP
Sbjct: 241 PHFIPPQT----NGSSSPSSIPFVVGVSPKN--NIPLVNTGRSS------------RGSP 300

Query: 301 NAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQ--------LQASAMAASFETDALELKI 360
           N       AA+VAHPSGIPPSPRSSP R L++Q        + A+A +A  ++DALELKI
Sbjct: 301 N-------AAMVAHPSGIPPSPRSSPTRTLLMQQPAASALAMAAAASSAFDQSDALELKI 343

Query: 361 GLPQSPQAKNLSSQTSGAIRVI 368
           GLPQSPQ KNLSSQTSGAIRVI
Sbjct: 361 GLPQSPQPKNLSSQTSGAIRVI 343

BLAST of Cp4.1LG01g03470 vs. NCBI nr
Match: gi|470146824|ref|XP_004309020.1| (PREDICTED: myb-like protein J isoform X1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 313.9 bits (803), Expect = 3.7e-82
Identity = 197/381 (51.71%), Postives = 243/381 (63.78%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNN--YCVKLFGVNLMDNGDESMRKSLSMG 60
           MV+ES RKCSHCG NGHNSRTC+    G NNNN   C+KLFGVN+M+  ++S++KS SMG
Sbjct: 1   MVKESARKCSHCGHNGHNSRTCNLLGHGVNNNNKGICLKLFGVNIMERQEDSIKKSYSMG 60

Query: 61  NLNIHVCNHNAAAAGNNVACYNAAVG-DDGGYLSDGLIHNTKRKAALERKKGKPWTEEEH 120
           NL          AAGN     N  VG DD GYLSDGLIHN KRKAA ERKKG+PWTEEEH
Sbjct: 61  NLQ---------AAGNGDQNNNINVGADDAGYLSDGLIHNKKRKAAHERKKGRPWTEEEH 120

Query: 121 RTFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDI-- 180
           R FLAGLKKLGKGDWRGI++NFVTTRTPTQVASHAQKYFLR+   NDK+KRR SLFD+  
Sbjct: 121 RVFLAGLKKLGKGDWRGIARNFVTTRTPTQVASHAQKYFLRQA-TNDKRKRRTSLFDMHF 180

Query: 181 PEIKNNVSQE----------CQASSSSMRANGETSQILLPKNSSENQQQVSNLPTQLINR 240
            E+    + +           +ASS S  + G TS++ LP+ ++      +N+P+Q++NR
Sbjct: 181 KELSEQYAHQDSPLSPTAKTAEASSCSSSSPGSTSKV-LPQITTITNSPQANMPSQVLNR 240

Query: 241 FPHLCLDTPHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPK 300
           FPHLCLD+P  V     S  S       IPY+VG+ P  VP  P +  G  +   M    
Sbjct: 241 FPHLCLDSPPAVAQMPASSCSVPTYAPVIPYMVGM-PGTVPYTPAMHYGGPTFHYM---- 300

Query: 301 SPIRTSSPNAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALELKI 360
             ++T   N    +    +++HPSGIP S RS P  PLM   + ++     + DALELKI
Sbjct: 301 --LKTQGGN---FSTCQPVISHPSGIPSSSRSVPSSPLMGVHRTNSSIT--KKDALELKI 358

Query: 361 GLPQSPQAKNLSSQTSGAIRV 367
           G PQ  Q  NLSS +SGAIRV
Sbjct: 361 GQPQPSQGANLSSSSSGAIRV 358

BLAST of Cp4.1LG01g03470 vs. NCBI nr
Match: gi|645257522|ref|XP_008234450.1| (PREDICTED: myb-like protein J isoform X2 [Prunus mume])

HSP 1 Score: 312.4 bits (799), Expect = 1.1e-81
Identity = 192/375 (51.20%), Postives = 242/375 (64.53%), Query Frame = 1

Query: 1   MVRESPRKCSHCGLNGHNSRTCSNFAKGNNNNN-YCVKLFGVNLMDNGDESMRKSLSMGN 60
           MV+ES RKCSHCG NGHNSRTC++   G+  N   C+KLFGVN+M+  D++M+KS SMGN
Sbjct: 1   MVKESMRKCSHCGHNGHNSRTCNSNGHGHGQNKGVCLKLFGVNIMEKEDDAMKKSYSMGN 60

Query: 61  LNIHVCNHNAAAAGNNVACYNAAVGD-DGGYLSDGLIHNTKRKAALERKKGKPWTEEEHR 120
           L          AAGN     N    D D GYLSDGLIHN + KAA ERKKG+PWTEEEHR
Sbjct: 61  LQ---------AAGNADHNNNVVTIDHDAGYLSDGLIHNKRHKAAHERKKGRPWTEEEHR 120

Query: 121 TFLAGLKKLGKGDWRGISKNFVTTRTPTQVASHAQKYFLRKMNANDKKKRRASLFDI--P 180
            FLAGLKKLGKGDWRGI++NFVTTRTPTQVASHAQKYFLR+    DK+KRR+SLFD+   
Sbjct: 121 VFLAGLKKLGKGDWRGIARNFVTTRTPTQVASHAQKYFLRQA-TYDKRKRRSSLFDMQFK 180

Query: 181 EIKNNVSQECQAS----SSSMRANGETSQILLPK-NSSENQQQVSNLPTQLINRFPHLCL 240
           E+ +   Q+   S    ++   + G +S++L  K N++ +    +++P+Q++NRFPHLCL
Sbjct: 181 ELSDQGHQDSPISPTRTATETSSEGSSSKVLPQKINTANSSPPKASVPSQILNRFPHLCL 240

Query: 241 DTPHFVPSTTGSPASGVANLHGIPYVVGVSPNNVPIIPLVKLGRRSSAVMAMPKSPIRTS 300
           D+P   P+   SP   V N   +PY++G+ P NVP  P++   R S   M        T 
Sbjct: 241 DSP---PAAPVSPPCNVPNYPAVPYMMGI-PENVPYAPMMHFARPSYHYMIKTHGNFATC 300

Query: 301 SPNAAAAAAAAALVAHPSGIPPSPRSSPQRPLMVQLQASAMAASFETDALELKIGLPQSP 360
           +P          +++HPSGI PSPRS P  P M       M++  E DALELKIG PQ  
Sbjct: 301 AP----------VISHPSGI-PSPRSLPSSPSMA--GRIGMSSPAEKDALELKIGQPQPS 348

Query: 361 QAKNLSSQTSGAIRV 367
           Q  NLSS TSGAIRV
Sbjct: 361 QGANLSSPTSGAIRV 348

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MY1R1_SOLTU2.3e-2842.00Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1[more]
DIV_ANTMA1.0e-2357.69Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA PE=2 SV=1[more]
MYBJ_DICDI3.9e-0729.13Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1[more]
MYBI_DICDI1.1e-0632.09Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KXN0_CUCSA6.6e-12369.09Uncharacterized protein OS=Cucumis sativus GN=Csa_4G025050 PE=4 SV=1[more]
W9S9L1_9ROSA3.8e-7047.53Transcription factor OS=Morus notabilis GN=L484_017072 PE=4 SV=1[more]
A0A067JSE9_JATCU3.3e-6647.64Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26305 PE=4 SV=1[more]
A0A061G0P9_THECC1.7e-5746.34Myb-like transcription factor family protein, putative isoform 1 OS=Theobroma ca... [more]
A0A061G076_THECC4.8e-5745.80Myb-like transcription factor family protein, putative isoform 2 OS=Theobroma ca... [more]
Match NameE-valueIdentityDescription
AT5G61620.11.1e-4941.16 myb-like transcription factor family protein[more]
AT5G47390.12.0e-3850.54 myb-like transcription factor family protein[more]
AT1G70000.12.6e-3852.00 myb-like transcription factor family protein[more]
AT5G56840.11.0e-3447.31 myb-like transcription factor family protein[more]
AT3G16350.11.1e-3339.15 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659107779|ref|XP_008453854.1|1.3e-12769.37PREDICTED: transcription factor MYB1R1 isoform X1 [Cucumis melo][more]
gi|778697489|ref|XP_004146936.2|9.5e-12369.09PREDICTED: myb-like protein I [Cucumis sativus][more]
gi|659107781|ref|XP_008453855.1|3.6e-12267.80PREDICTED: transcription factor MYB1R1 isoform X2 [Cucumis melo][more]
gi|470146824|ref|XP_004309020.1|3.7e-8251.71PREDICTED: myb-like protein J isoform X1 [Fragaria vesca subsp. vesca][more]
gi|645257522|ref|XP_008234450.1|1.1e-8151.20PREDICTED: myb-like protein J isoform X2 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g03470.1Cp4.1LG01g03470.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 110..155
score: 4.6
IPR001005SANT/Myb domainSMARTSM00717santcoord: 108..158
score: 6.3
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 107..159
score: 2.6
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 108..155
score: 1.6
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 106..158
score: 9.66
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 104..160
score: 19
NoneNo IPR availablePANTHERPTHR12374TRANSCRIPTIONAL ADAPTOR 2 ADA2 -RELATEDcoord: 7..196
score: 7.3
NoneNo IPR availablePANTHERPTHR12374:SF19SUBFAMILY NOT NAMEDcoord: 7..196
score: 7.3

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g03470Cp4.1LG14g05400Cucurbita pepo (Zucchini)cpecpeB234
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g03470Wax gourdcpewgoB0477
Cp4.1LG01g03470Cucurbita pepo (Zucchini)cpecpeB403
Cp4.1LG01g03470Bottle gourd (USVL1VR-Ls)cpelsiB314
Cp4.1LG01g03470Bottle gourd (USVL1VR-Ls)cpelsiB340
Cp4.1LG01g03470Watermelon (Charleston Gray)cpewcgB372
Cp4.1LG01g03470Silver-seed gourdcarcpeB1001
Cp4.1LG01g03470Cucumber (Chinese Long) v3cpecucB0540