Cp4.1LG01g05420 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g05420
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyb transcription factor
LocationCp4.1LG01 : 698077 .. 698922 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCTAATCGGAATTCCCCTCTCAAATCGAACCAGCAACAGTAACAGTAATCGAATTAAAGGATCGTGGAGTCCTCAAGAGGACGCTACCTTAATCAAATTGGTGGAGGAACACGGCCCGAGAAATTGGTCTTTGATCAGTACAGGAATTCCTGGTCGCTCTGGCAAGTCCTGCCGTTTGCGCTGGTGTAATCAGCTCAGTCCGGCGGTTCAGCACCGTCCGTTCACGCCGGCGGAGGACTCTCTGATTGTTCAGGCGCACGCCGTCCACGGTAACAAGTGGTCCACGATTGCTCGCCTCTTGCCTGGTCGGACCGACAATGCGATTAAGAATCACTGGAACTCGACGCTCAGGCGCCGCCGCGATGCCGATTTTTCTTCCGATTCCACGGCGTTTATGAAGCGGCCGAGTTGTGAAGTTTCCAGATCGGCGTCGGATGATGATGATTCGGAAGGATCGTTCAAACGGAGGTGCTTTGGAATGGCTGTGGAGCGGAACGGCGGCGGAGAAGGAGAAGGACCGGAGACGTCGCTGAGGCTGTCATTGCCCGGGGAGGTGGTGGTGCCGGCGGCGTGTGTCAAGGTGAAAGAGGAGGTGACAGTGGATAGCGAGGAGAACGATGGACGGCGTCATGTGACGGTGGCGGCGACGGCGACGGCGACGGCGGAGGAGAAAGGGAACCGGAAAAGGGAAATGGACGAGAGTTGTATGGCAACGATAATGCAGAGGATGATTGCTCGGGAGGTCCGAAATTACATAGACAGTTTACGGGCCCATGGTGGGCTTTCAATTGGGCCTAATATTGGGCCTGGGCTGGATTCAGGAGTCCAAGATACTGAGTAA

mRNA sequence

ATGGTTCTAATCGGAATTCCCCTCTCAAATCGAACCAGCAACAGTAACAGTAATCGAATTAAAGGATCGTGGAGTCCTCAAGAGGACGCTACCTTAATCAAATTGGTGGAGGAACACGGCCCGAGAAATTGGTCTTTGATCAGTACAGGAATTCCTGGTCGCTCTGGCAAGTCCTGCCGTTTGCGCTGGTGTAATCAGCTCAGTCCGGCGGTTCAGCACCGTCCGTTCACGCCGGCGGAGGACTCTCTGATTGTTCAGGCGCACGCCGTCCACGGTAACAAGTGGTCCACGATTGCTCGCCTCTTGCCTGGTCGGACCGACAATGCGATTAAGAATCACTGGAACTCGACGCTCAGGCGCCGCCGCGATGCCGATTTTTCTTCCGATTCCACGGCGTTTATGAAGCGGCCGAGTTGTGAAGTTTCCAGATCGGCGTCGGATGATGATGATTCGGAAGGATCGTTCAAACGGAGGTGCTTTGGAATGGCTGTGGAGCGGAACGGCGGCGGAGAAGGAGAAGGACCGGAGACGTCGCTGAGGCTGTCATTGCCCGGGGAGGTGGTGGTGCCGGCGGCGTGTGTCAAGGTGAAAGAGGAGGTGACAGTGGATAGCGAGGAGAACGATGGACGGCGTCATGTGACGGTGGCGGCGACGGCGACGGCGACGGCGGAGGAGAAAGGGAACCGGAAAAGGGAAATGGACGAGAGTTGTATGGCAACGATAATGCAGAGGATGATTGCTCGGGAGGTCCGAAATTACATAGACAGTTTACGGGCCCATGGTGGGCTTTCAATTGGGCCTAATATTGGGCCTGGGCTGGATTCAGGAGTCCAAGATACTGAGTAA

Coding sequence (CDS)

ATGGTTCTAATCGGAATTCCCCTCTCAAATCGAACCAGCAACAGTAACAGTAATCGAATTAAAGGATCGTGGAGTCCTCAAGAGGACGCTACCTTAATCAAATTGGTGGAGGAACACGGCCCGAGAAATTGGTCTTTGATCAGTACAGGAATTCCTGGTCGCTCTGGCAAGTCCTGCCGTTTGCGCTGGTGTAATCAGCTCAGTCCGGCGGTTCAGCACCGTCCGTTCACGCCGGCGGAGGACTCTCTGATTGTTCAGGCGCACGCCGTCCACGGTAACAAGTGGTCCACGATTGCTCGCCTCTTGCCTGGTCGGACCGACAATGCGATTAAGAATCACTGGAACTCGACGCTCAGGCGCCGCCGCGATGCCGATTTTTCTTCCGATTCCACGGCGTTTATGAAGCGGCCGAGTTGTGAAGTTTCCAGATCGGCGTCGGATGATGATGATTCGGAAGGATCGTTCAAACGGAGGTGCTTTGGAATGGCTGTGGAGCGGAACGGCGGCGGAGAAGGAGAAGGACCGGAGACGTCGCTGAGGCTGTCATTGCCCGGGGAGGTGGTGGTGCCGGCGGCGTGTGTCAAGGTGAAAGAGGAGGTGACAGTGGATAGCGAGGAGAACGATGGACGGCGTCATGTGACGGTGGCGGCGACGGCGACGGCGACGGCGGAGGAGAAAGGGAACCGGAAAAGGGAAATGGACGAGAGTTGTATGGCAACGATAATGCAGAGGATGATTGCTCGGGAGGTCCGAAATTACATAGACAGTTTACGGGCCCATGGTGGGCTTTCAATTGGGCCTAATATTGGGCCTGGGCTGGATTCAGGAGTCCAAGATACTGAGTAA

Protein sequence

MVLIGIPLSNRTSNSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADFSSDSTAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVERNGGGEGEGPETSLRLSLPGEVVVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQRMIAREVRNYIDSLRAHGGLSIGPNIGPGLDSGVQDTE
BLAST of Cp4.1LG01g05420 vs. Swiss-Prot
Match: MYB44_ARATH (Transcription factor MYB44 OS=Arabidopsis thaliana GN=MYB44 PE=2 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 7.0e-41
Identity = 74/105 (70.48%), Postives = 89/105 (84.76%), Query Frame = 1

Query: 17  SNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPF 76
           ++RIKG WSP+ED  L +LV ++GPRNW++IS  IPGRSGKSCRLRWCNQLSP V+HRPF
Sbjct: 2   ADRIKGPWSPEEDEQLRRLVVKYGPRNWTVISKSIPGRSGKSCRLRWCNQLSPQVEHRPF 61

Query: 77  TPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
           +  ED  I +AHA  GNKW+TIARLL GRTDNA+KNHWNSTL+R+
Sbjct: 62  SAEEDETIARAHAQFGNKWATIARLLNGRTDNAVKNHWNSTLKRK 106

BLAST of Cp4.1LG01g05420 vs. Swiss-Prot
Match: MB3R1_ARATH (Myb-related protein 3R-1 OS=Arabidopsis thaliana GN=MYB3R-1 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 5.5e-30
Identity = 53/113 (46.90%), Postives = 82/113 (72.57%), Query Frame = 1

Query: 16  NSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRP 75
           N   +KG WS +ED T+I LVE++GP+ WS IS  +PGR GK CR RW N L+P +    
Sbjct: 82  NPELVKGPWSKEEDNTIIDLVEKYGPKKWSTISQHLPGRIGKQCRERWHNHLNPGINKNA 141

Query: 76  FTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADFSS 129
           +T  E+  +++AH ++GNKW+ + + LPGR+DN+IKNHWNS+++++ D+ ++S
Sbjct: 142 WTQEEELTLIRAHQIYGNKWAELMKFLPGRSDNSIKNHWNSSVKKKLDSYYAS 194

BLAST of Cp4.1LG01g05420 vs. Swiss-Prot
Match: MYBA_CHICK (Myb-related protein A OS=Gallus gallus GN=MYBL1 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 5.5e-30
Identity = 58/106 (54.72%), Postives = 79/106 (74.53%), Query Frame = 1

Query: 16  NSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRP 75
           N   IKG W+ +ED  +I+LV+++GP+ WSLI+  + GR GK CR RW N L+P V+   
Sbjct: 82  NPELIKGPWTKEEDQRVIELVQKYGPKRWSLIAKHLKGRIGKQCRERWHNHLNPEVKKSS 141

Query: 76  FTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
           +T AED +I +AH   GN+W+ IA+LLPGRTDN+IKNHWNST+RR+
Sbjct: 142 WTEAEDRVIYEAHKRLGNRWAEIAKLLPGRTDNSIKNHWNSTMRRK 187

BLAST of Cp4.1LG01g05420 vs. Swiss-Prot
Match: MYB_MOUSE (Transcriptional activator Myb OS=Mus musculus GN=Myb PE=1 SV=2)

HSP 1 Score: 132.5 bits (332), Expect = 7.2e-30
Identity = 58/106 (54.72%), Postives = 78/106 (73.58%), Query Frame = 1

Query: 16  NSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRP 75
           N   IKG W+ +ED  +I+LV+++GP+ WS+I+  + GR GK CR RW N L+P V+   
Sbjct: 87  NPELIKGPWTKEEDQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTS 146

Query: 76  FTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
           +T  ED +I QAH   GN+W+ IA+LLPGRTDNAIKNHWNST+RR+
Sbjct: 147 WTEEEDRIIYQAHKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRK 192

BLAST of Cp4.1LG01g05420 vs. Swiss-Prot
Match: MYB_CHICK (Transcriptional activator Myb OS=Gallus gallus GN=MYB PE=1 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 7.2e-30
Identity = 58/106 (54.72%), Postives = 78/106 (73.58%), Query Frame = 1

Query: 16  NSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRP 75
           N   IKG W+ +ED  +I+LV+++GP+ WS+I+  + GR GK CR RW N L+P V+   
Sbjct: 87  NPELIKGPWTKEEDQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTS 146

Query: 76  FTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
           +T  ED +I QAH   GN+W+ IA+LLPGRTDNAIKNHWNST+RR+
Sbjct: 147 WTEEEDRIIYQAHKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRK 192

BLAST of Cp4.1LG01g05420 vs. TrEMBL
Match: A0A0A0KQJ2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G209490 PE=4 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 3.5e-116
Identity = 227/293 (77.47%), Postives = 243/293 (82.94%), Query Frame = 1

Query: 1   MVLIGIPLSNRTSNSNSN-------RIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPG 60
           MVLIGI L N  +N+NSN       R+KGSWSPQEDATLIKLVE+HGPRNWSLISTGIPG
Sbjct: 1   MVLIGIALPNHNNNTNSNNNNNNHNRVKGSWSPQEDATLIKLVEQHGPRNWSLISTGIPG 60

Query: 61  RSGKSCRLRWCNQLSPAVQHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNH 120
           RSGKSCRLRWCNQLSP VQHRPFTPAED+LI+QAHAVHGNKWSTIAR LPGRTDNAIKNH
Sbjct: 61  RSGKSCRLRWCNQLSPTVQHRPFTPAEDALILQAHAVHGNKWSTIARSLPGRTDNAIKNH 120

Query: 121 WNSTLRRRRDADFSSDSTAFMKRPSCEVSRSAS-----DDDDSEGSFKRRCFGMAVERNG 180
           WNSTLRRRRDAD SSDSTAF+KRPS EVSRSAS     DDDDSE S KR CF    +RN 
Sbjct: 121 WNSTLRRRRDADLSSDSTAFLKRPSYEVSRSASDDDDNDDDDSEASLKRTCF----DRNS 180

Query: 181 GGEGEGPETSLRLSLPGEVVVPAAC-VKVKEEVTVDSEENDGRRHVTVAATATATAEEKG 240
            G GE PETSLRLSLPGEVVV A   VKVKEEVTV+SE+NDGRR V  AA      EEKG
Sbjct: 181 VGGGE-PETSLRLSLPGEVVVAAEMDVKVKEEVTVESEKNDGRRRVVAAA-----EEEKG 240

Query: 241 NRKREMDESCMATIMQRMIAREVRNYIDSLRAHGGLSIGPNIGPGLDSGVQDT 281
           NRK+E+DESC+ATIMQRMIA+EVRNYIDSLRA GGL IGP +GPGLD   Q+T
Sbjct: 241 NRKKEVDESCLATIMQRMIAQEVRNYIDSLRARGGLGIGPGVGPGLDPAAQET 283

BLAST of Cp4.1LG01g05420 vs. TrEMBL
Match: V4T705_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032317mg PE=4 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 1.3e-73
Identity = 158/273 (57.88%), Postives = 191/273 (69.96%), Query Frame = 1

Query: 8   LSNRTSNSNSN---RIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWC 67
           ++N ++NSNSN   RIKGSWSPQEDATLIKLVE+HGPRNWS+IS+GIPGRSGKSCRLRWC
Sbjct: 11  MNNNSTNSNSNNNDRIKGSWSPQEDATLIKLVEQHGPRNWSMISSGIPGRSGKSCRLRWC 70

Query: 68  NQLSPAVQHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDA 127
           NQLSP VQHRPFTPAEDS+I+QAHAVHGNKW+TIAR LPGRTDNAIKNHWNSTLRR+R A
Sbjct: 71  NQLSPTVQHRPFTPAEDSIIIQAHAVHGNKWATIARSLPGRTDNAIKNHWNSTLRRKRMA 130

Query: 128 DFSSDS----TAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVER---NGGGEGEGPET 187
           + SS S    +  MKRP  E + S+++ D   G  KR+C  ++ E    N      GPET
Sbjct: 131 ELSSASSESNSVMMKRPGFENNISSAESDSDSGGIKRQCLRVSQEHDSYNVEAGIVGPET 190

Query: 188 SLRLSLPGEVVVPAAC----VKVKEEVTVDSE---ENDGRRHVTVAATATATAEEKGNRK 247
            L LS PGE VV        V+ +E++ ++ E   + DG +            E++    
Sbjct: 191 LLTLSPPGESVVLGGSGGEKVEDEEKLKINKECDGDGDGGK------IEKEKGEDRCTVD 250

Query: 248 REMDESCMATIMQRMIAREVRNYIDSLRAHGGL 264
            EM+ESC+ TIM+RMIA EVRN  D LRA  GL
Sbjct: 251 MEMEESCLFTIMRRMIAEEVRNQFDGLRAQAGL 277

BLAST of Cp4.1LG01g05420 vs. TrEMBL
Match: A0A067LKB3_JATCU (MYB family protein OS=Jatropha curcas GN=JCGZ_01234 PE=4 SV=1)

HSP 1 Score: 278.9 bits (712), Expect = 6.9e-72
Identity = 154/263 (58.56%), Postives = 185/263 (70.34%), Query Frame = 1

Query: 13  SNSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQ 72
           S   ++RIKGSWSPQEDA LIKLVE+HGPRNWSLISTGIPGRSGKSCRLRWCNQLSP VQ
Sbjct: 5   SGCGADRIKGSWSPQEDANLIKLVEQHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPEVQ 64

Query: 73  HRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADF---SSD 132
           HRPF+PAED++IVQAHA+HGNKW+TIARLLPGRTDNAIKNHWNSTLRR+R A+    SS+
Sbjct: 65  HRPFSPAEDAIIVQAHAIHGNKWATIARLLPGRTDNAIKNHWNSTLRRKRAAELSSASSE 124

Query: 133 STAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVERNGGGEGEG----PETSLRLSLPG 192
           + + MKRP  +V    S +  SE   KR+C G A   +   +GE     PETSL LS PG
Sbjct: 125 ANSVMKRPCLDV----STESGSESGVKRQCLG-APPEDSSFDGEAGIVEPETSLTLSPPG 184

Query: 193 EVVVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQRM 252
           +  V     +  +EV V   E D  R +          EE+     +++E+C+ TIMQRM
Sbjct: 185 DGFVRTEVGEKVKEVAVKGGEQDVDRQM----VGEEEEEEEETCGLQIEETCLLTIMQRM 244

Query: 253 IAREVRNYIDSLRAHGGLSIGPN 269
           IA EVR+YID LR+  G + GPN
Sbjct: 245 IATEVRSYIDRLRSKNGFA-GPN 257

BLAST of Cp4.1LG01g05420 vs. TrEMBL
Match: B9HHT0_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s07100g PE=4 SV=1)

HSP 1 Score: 273.1 bits (697), Expect = 3.8e-70
Identity = 153/259 (59.07%), Postives = 181/259 (69.88%), Query Frame = 1

Query: 14  NSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQH 73
           N   +RIKGSWSPQEDATLIKLVE+HGPRNWS+ISTGIPGRSGKSCRLRWCNQLSP VQH
Sbjct: 5   NGGDHRIKGSWSPQEDATLIKLVEQHGPRNWSMISTGIPGRSGKSCRLRWCNQLSPEVQH 64

Query: 74  RPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRD--ADFSSDST 133
           RPFTPAED+ IVQAHA+HGNKW+TIARLLPGRTDNAIKNHWNSTLRR+R   +  SS+S 
Sbjct: 65  RPFTPAEDAKIVQAHAIHGNKWATIARLLPGRTDNAIKNHWNSTLRRKRGSVSSASSESN 124

Query: 134 AFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVERN--GGGEG-EGPETSLRLSLPGE-V 193
           +  KR + EVS  +  + DS GS KR+C   +   N   G  G +GPETSL LS PG+  
Sbjct: 125 SVFKRSTLEVSVVSESESDS-GS-KRQCLHASPGHNSVNGDVGVDGPETSLTLSPPGDGF 184

Query: 194 VVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQRMIA 253
           V  A   K+KE V V+  E D    +   A       EK     EMDE C+  ++Q++I 
Sbjct: 185 VSMAVAEKLKEGVAVNGREKDLGESIMKDA-------EKIRCTEEMDEDCVRALIQKIIQ 244

Query: 254 REVRNYIDSLRAHGGLSIG 267
            EVR Y D L+   G++IG
Sbjct: 245 EEVRIYFDRLKTRNGVTIG 254

BLAST of Cp4.1LG01g05420 vs. TrEMBL
Match: A0A067FND1_CITSI (Uncharacterized protein (Fragment) OS=Citrus sinensis GN=CISIN_1g046570mg PE=4 SV=1)

HSP 1 Score: 270.8 bits (691), Expect = 1.9e-69
Identity = 152/259 (58.69%), Postives = 178/259 (68.73%), Query Frame = 1

Query: 13  SNSNSN-RIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAV 72
           SNSN+N RIKGSWSPQEDATLIKLVE+HGPRNWS+IS+GIPGRSGKSCRLRWCNQLSP V
Sbjct: 3   SNSNNNDRIKGSWSPQEDATLIKLVEQHGPRNWSMISSGIPGRSGKSCRLRWCNQLSPTV 62

Query: 73  QHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADFSSDS- 132
           QHRPFTPAEDS+I+QAHAVHGNKW+TIAR LPGRTDNAIKNHWNSTLRR+R A+ SS S 
Sbjct: 63  QHRPFTPAEDSIIIQAHAVHGNKWATIARSLPGRTDNAIKNHWNSTLRRKRMAELSSASS 122

Query: 133 ---TAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVER---NGGGEGEGPETSLRLSLP 192
              +  MKRP  E + S+++ D   G  KR+C  ++ E    N      GPET       
Sbjct: 123 ESNSVMMKRPGFENNISSAESDSDSGGIKRQCLRVSQEHDSYNVEAGIVGPET-----FG 182

Query: 193 GEVVVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQR 252
           GE V     +K+ +E   D    DG +            E++     EM+ESC+ TIM+R
Sbjct: 183 GEKVEDEEKLKINKECDGD---GDGGK------IEKEKGEDRCTVDMEMEESCLFTIMRR 242

Query: 253 MIAREVRNYIDSLRAHGGL 264
           MIA EVRN  D LRA  GL
Sbjct: 243 MIAEEVRNQFDGLRAQAGL 247

BLAST of Cp4.1LG01g05420 vs. TAIR10
Match: AT2G23290.1 (AT2G23290.1 myb domain protein 70)

HSP 1 Score: 173.7 bits (439), Expect = 1.6e-43
Identity = 117/282 (41.49%), Postives = 147/282 (52.13%), Query Frame = 1

Query: 12  TSNSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAV 71
           ++    +RIKG WSP+ED  L  LV++HGPRNWSLIS  IPGRSGKSCRLRWCNQLSP V
Sbjct: 4   STRKEMDRIKGPWSPEEDDLLQSLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPEV 63

Query: 72  QHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADFSSDST 131
           +HR FT  ED  I+ AHA  GNKW+TIARLL GRTDNAIKNHWNSTL+R+     S    
Sbjct: 64  EHRGFTAEEDDTIILAHARFGNKWATIARLLNGRTDNAIKNHWNSTLKRK----CSGGGG 123

Query: 132 AFMKRPSCEVSRSASDDDD--SEGSFKRRCFG-------MAVERNGGGEGEGPETS---L 191
              +  SC+   +   D +   E   KRR  G        A+   G    E  ++S   L
Sbjct: 124 GGEEGQSCDFGGNGGYDGNLTDEKPLKRRASGGGGVVVVTALSPTGSDVSEQSQSSGSVL 183

Query: 192 RLSLPGEVVVPAACV--KVKEEVTVDSEENDG-----------RRHVTVAATATATAEEK 251
            +S    V  P A     V E  + + EE D                T         EE+
Sbjct: 184 PVSSSCHVFKPTARAGGVVIESSSPEEEEKDPMTCLRLSLPWVNESTTPPELFPVKREEE 243

Query: 252 GNRKREMD--ESCMATIMQRMIAREVRNYIDSLRAHGGLSIG 267
             ++RE+        T++Q MI  EVR+Y+  L+   G   G
Sbjct: 244 EEKEREISGLGGDFMTVVQEMIKTEVRSYMADLQLGNGGGAG 281

BLAST of Cp4.1LG01g05420 vs. TAIR10
Match: AT4G37260.1 (AT4G37260.1 myb domain protein 73)

HSP 1 Score: 171.4 bits (433), Expect = 7.9e-43
Identity = 100/215 (46.51%), Postives = 126/215 (58.60%), Query Frame = 1

Query: 8   LSNRTSNSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQL 67
           +SN T   N  RIKG WSP+ED  L +LV++HGPRNWSLIS  IPGRSGKSCRLRWCNQL
Sbjct: 1   MSNPT-RKNMERIKGPWSPEEDDLLQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQL 60

Query: 68  SPAVQHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADFS 127
           SP V+HR F+  ED  I++AHA  GNKW+TI+RLL GRTDNAIKNHWNSTL+R+      
Sbjct: 61  SPEVEHRAFSQEEDETIIRAHARFGNKWATISRLLNGRTDNAIKNHWNSTLKRK------ 120

Query: 128 SDSTAFMKRPSCEVSRSASDDDD--SEGSFKRRCFGMAVERNG----GGEGEGPETSLRL 187
                 ++  SC+   +   D +   E   KR   G      G     G   G + S + 
Sbjct: 121 ----CSVEGQSCDFGGNGGYDGNLGEEQPLKRTASGGGGVSTGLYMSPGSPSGSDVSEQS 180

Query: 188 SLPGEVVVPAACVKVKEEVTVDSEENDGRRHVTVA 217
           S    V  P     V+ EVT  S   D   +++++
Sbjct: 181 SGGAHVFKPT----VRSEVTASSSGEDPPTYLSLS 200

BLAST of Cp4.1LG01g05420 vs. TAIR10
Match: AT3G55730.1 (AT3G55730.1 myb domain protein 109)

HSP 1 Score: 169.9 bits (429), Expect = 2.3e-42
Identity = 73/104 (70.19%), Postives = 88/104 (84.62%), Query Frame = 1

Query: 18  NRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFT 77
           +++KG WS +EDA L KLV + GPRNWSLI+ GIPGRSGKSCRLRWCNQL P ++ +PF+
Sbjct: 53  SKVKGPWSTEEDAVLTKLVRKLGPRNWSLIARGIPGRSGKSCRLRWCNQLDPCLKRKPFS 112

Query: 78  PAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
             ED +I+ AHAVHGNKW+ IA+LL GRTDNAIKNHWNSTLRR+
Sbjct: 113 DEEDRMIISAHAVHGNKWAVIAKLLTGRTDNAIKNHWNSTLRRK 156

BLAST of Cp4.1LG01g05420 vs. TAIR10
Match: AT5G67300.1 (AT5G67300.1 myb domain protein r1)

HSP 1 Score: 169.1 bits (427), Expect = 3.9e-42
Identity = 74/105 (70.48%), Postives = 89/105 (84.76%), Query Frame = 1

Query: 17  SNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPF 76
           ++RIKG WSP+ED  L +LV ++GPRNW++IS  IPGRSGKSCRLRWCNQLSP V+HRPF
Sbjct: 2   ADRIKGPWSPEEDEQLRRLVVKYGPRNWTVISKSIPGRSGKSCRLRWCNQLSPQVEHRPF 61

Query: 77  TPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
           +  ED  I +AHA  GNKW+TIARLL GRTDNA+KNHWNSTL+R+
Sbjct: 62  SAEEDETIARAHAQFGNKWATIARLLNGRTDNAVKNHWNSTLKRK 106

BLAST of Cp4.1LG01g05420 vs. TAIR10
Match: AT3G50060.1 (AT3G50060.1 myb domain protein 77)

HSP 1 Score: 168.7 bits (426), Expect = 5.1e-42
Identity = 74/105 (70.48%), Postives = 88/105 (83.81%), Query Frame = 1

Query: 17  SNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPF 76
           ++R+KG WS +ED  L ++VE++GPRNWS IS  IPGRSGKSCRLRWCNQLSP V+HRPF
Sbjct: 2   ADRVKGPWSQEEDEQLRRMVEKYGPRNWSAISKSIPGRSGKSCRLRWCNQLSPEVEHRPF 61

Query: 77  TPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRR 122
           +P ED  IV A A  GNKW+TIARLL GRTDNA+KNHWNSTL+R+
Sbjct: 62  SPEEDETIVTARAQFGNKWATIARLLNGRTDNAVKNHWNSTLKRK 106

BLAST of Cp4.1LG01g05420 vs. NCBI nr
Match: gi|449460939|ref|XP_004148201.1| (PREDICTED: transcription factor MYB44-like [Cucumis sativus])

HSP 1 Score: 426.0 bits (1094), Expect = 5.0e-116
Identity = 227/293 (77.47%), Postives = 243/293 (82.94%), Query Frame = 1

Query: 1   MVLIGIPLSNRTSNSNSN-------RIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPG 60
           MVLIGI L N  +N+NSN       R+KGSWSPQEDATLIKLVE+HGPRNWSLISTGIPG
Sbjct: 1   MVLIGIALPNHNNNTNSNNNNNNHNRVKGSWSPQEDATLIKLVEQHGPRNWSLISTGIPG 60

Query: 61  RSGKSCRLRWCNQLSPAVQHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNH 120
           RSGKSCRLRWCNQLSP VQHRPFTPAED+LI+QAHAVHGNKWSTIAR LPGRTDNAIKNH
Sbjct: 61  RSGKSCRLRWCNQLSPTVQHRPFTPAEDALILQAHAVHGNKWSTIARSLPGRTDNAIKNH 120

Query: 121 WNSTLRRRRDADFSSDSTAFMKRPSCEVSRSAS-----DDDDSEGSFKRRCFGMAVERNG 180
           WNSTLRRRRDAD SSDSTAF+KRPS EVSRSAS     DDDDSE S KR CF    +RN 
Sbjct: 121 WNSTLRRRRDADLSSDSTAFLKRPSYEVSRSASDDDDNDDDDSEASLKRTCF----DRNS 180

Query: 181 GGEGEGPETSLRLSLPGEVVVPAAC-VKVKEEVTVDSEENDGRRHVTVAATATATAEEKG 240
            G GE PETSLRLSLPGEVVV A   VKVKEEVTV+SE+NDGRR V  AA      EEKG
Sbjct: 181 VGGGE-PETSLRLSLPGEVVVAAEMDVKVKEEVTVESEKNDGRRRVVAAA-----EEEKG 240

Query: 241 NRKREMDESCMATIMQRMIAREVRNYIDSLRAHGGLSIGPNIGPGLDSGVQDT 281
           NRK+E+DESC+ATIMQRMIA+EVRNYIDSLRA GGL IGP +GPGLD   Q+T
Sbjct: 241 NRKKEVDESCLATIMQRMIAQEVRNYIDSLRARGGLGIGPGVGPGLDPAAQET 283

BLAST of Cp4.1LG01g05420 vs. NCBI nr
Match: gi|567886500|ref|XP_006435772.1| (hypothetical protein CICLE_v10032317mg [Citrus clementina])

HSP 1 Score: 284.6 bits (727), Expect = 1.8e-73
Identity = 158/273 (57.88%), Postives = 191/273 (69.96%), Query Frame = 1

Query: 8   LSNRTSNSNSN---RIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWC 67
           ++N ++NSNSN   RIKGSWSPQEDATLIKLVE+HGPRNWS+IS+GIPGRSGKSCRLRWC
Sbjct: 11  MNNNSTNSNSNNNDRIKGSWSPQEDATLIKLVEQHGPRNWSMISSGIPGRSGKSCRLRWC 70

Query: 68  NQLSPAVQHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDA 127
           NQLSP VQHRPFTPAEDS+I+QAHAVHGNKW+TIAR LPGRTDNAIKNHWNSTLRR+R A
Sbjct: 71  NQLSPTVQHRPFTPAEDSIIIQAHAVHGNKWATIARSLPGRTDNAIKNHWNSTLRRKRMA 130

Query: 128 DFSSDS----TAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVER---NGGGEGEGPET 187
           + SS S    +  MKRP  E + S+++ D   G  KR+C  ++ E    N      GPET
Sbjct: 131 ELSSASSESNSVMMKRPGFENNISSAESDSDSGGIKRQCLRVSQEHDSYNVEAGIVGPET 190

Query: 188 SLRLSLPGEVVVPAAC----VKVKEEVTVDSE---ENDGRRHVTVAATATATAEEKGNRK 247
            L LS PGE VV        V+ +E++ ++ E   + DG +            E++    
Sbjct: 191 LLTLSPPGESVVLGGSGGEKVEDEEKLKINKECDGDGDGGK------IEKEKGEDRCTVD 250

Query: 248 REMDESCMATIMQRMIAREVRNYIDSLRAHGGL 264
            EM+ESC+ TIM+RMIA EVRN  D LRA  GL
Sbjct: 251 MEMEESCLFTIMRRMIAEEVRNQFDGLRAQAGL 277

BLAST of Cp4.1LG01g05420 vs. NCBI nr
Match: gi|802547651|ref|XP_012090668.1| (PREDICTED: transcriptional activator Myb-like [Jatropha curcas])

HSP 1 Score: 278.9 bits (712), Expect = 1.0e-71
Identity = 154/263 (58.56%), Postives = 185/263 (70.34%), Query Frame = 1

Query: 13  SNSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQ 72
           S   ++RIKGSWSPQEDA LIKLVE+HGPRNWSLISTGIPGRSGKSCRLRWCNQLSP VQ
Sbjct: 5   SGCGADRIKGSWSPQEDANLIKLVEQHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPEVQ 64

Query: 73  HRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADF---SSD 132
           HRPF+PAED++IVQAHA+HGNKW+TIARLLPGRTDNAIKNHWNSTLRR+R A+    SS+
Sbjct: 65  HRPFSPAEDAIIVQAHAIHGNKWATIARLLPGRTDNAIKNHWNSTLRRKRAAELSSASSE 124

Query: 133 STAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVERNGGGEGEG----PETSLRLSLPG 192
           + + MKRP  +V    S +  SE   KR+C G A   +   +GE     PETSL LS PG
Sbjct: 125 ANSVMKRPCLDV----STESGSESGVKRQCLG-APPEDSSFDGEAGIVEPETSLTLSPPG 184

Query: 193 EVVVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQRM 252
           +  V     +  +EV V   E D  R +          EE+     +++E+C+ TIMQRM
Sbjct: 185 DGFVRTEVGEKVKEVAVKGGEQDVDRQM----VGEEEEEEEETCGLQIEETCLLTIMQRM 244

Query: 253 IAREVRNYIDSLRAHGGLSIGPN 269
           IA EVR+YID LR+  G + GPN
Sbjct: 245 IATEVRSYIDRLRSKNGFA-GPN 257

BLAST of Cp4.1LG01g05420 vs. NCBI nr
Match: gi|224098716|ref|XP_002311241.1| (hypothetical protein POPTR_0008s07100g [Populus trichocarpa])

HSP 1 Score: 273.1 bits (697), Expect = 5.5e-70
Identity = 153/259 (59.07%), Postives = 181/259 (69.88%), Query Frame = 1

Query: 14  NSNSNRIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQH 73
           N   +RIKGSWSPQEDATLIKLVE+HGPRNWS+ISTGIPGRSGKSCRLRWCNQLSP VQH
Sbjct: 5   NGGDHRIKGSWSPQEDATLIKLVEQHGPRNWSMISTGIPGRSGKSCRLRWCNQLSPEVQH 64

Query: 74  RPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRD--ADFSSDST 133
           RPFTPAED+ IVQAHA+HGNKW+TIARLLPGRTDNAIKNHWNSTLRR+R   +  SS+S 
Sbjct: 65  RPFTPAEDAKIVQAHAIHGNKWATIARLLPGRTDNAIKNHWNSTLRRKRGSVSSASSESN 124

Query: 134 AFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVERN--GGGEG-EGPETSLRLSLPGE-V 193
           +  KR + EVS  +  + DS GS KR+C   +   N   G  G +GPETSL LS PG+  
Sbjct: 125 SVFKRSTLEVSVVSESESDS-GS-KRQCLHASPGHNSVNGDVGVDGPETSLTLSPPGDGF 184

Query: 194 VVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQRMIA 253
           V  A   K+KE V V+  E D    +   A       EK     EMDE C+  ++Q++I 
Sbjct: 185 VSMAVAEKLKEGVAVNGREKDLGESIMKDA-------EKIRCTEEMDEDCVRALIQKIIQ 244

Query: 254 REVRNYIDSLRAHGGLSIG 267
            EVR Y D L+   G++IG
Sbjct: 245 EEVRIYFDRLKTRNGVTIG 254

BLAST of Cp4.1LG01g05420 vs. NCBI nr
Match: gi|641850046|gb|KDO68919.1| (hypothetical protein CISIN_1g046570mg, partial [Citrus sinensis])

HSP 1 Score: 270.8 bits (691), Expect = 2.7e-69
Identity = 152/259 (58.69%), Postives = 178/259 (68.73%), Query Frame = 1

Query: 13  SNSNSN-RIKGSWSPQEDATLIKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAV 72
           SNSN+N RIKGSWSPQEDATLIKLVE+HGPRNWS+IS+GIPGRSGKSCRLRWCNQLSP V
Sbjct: 3   SNSNNNDRIKGSWSPQEDATLIKLVEQHGPRNWSMISSGIPGRSGKSCRLRWCNQLSPTV 62

Query: 73  QHRPFTPAEDSLIVQAHAVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADFSSDS- 132
           QHRPFTPAEDS+I+QAHAVHGNKW+TIAR LPGRTDNAIKNHWNSTLRR+R A+ SS S 
Sbjct: 63  QHRPFTPAEDSIIIQAHAVHGNKWATIARSLPGRTDNAIKNHWNSTLRRKRMAELSSASS 122

Query: 133 ---TAFMKRPSCEVSRSASDDDDSEGSFKRRCFGMAVER---NGGGEGEGPETSLRLSLP 192
              +  MKRP  E + S+++ D   G  KR+C  ++ E    N      GPET       
Sbjct: 123 ESNSVMMKRPGFENNISSAESDSDSGGIKRQCLRVSQEHDSYNVEAGIVGPET-----FG 182

Query: 193 GEVVVPAACVKVKEEVTVDSEENDGRRHVTVAATATATAEEKGNRKREMDESCMATIMQR 252
           GE V     +K+ +E   D    DG +            E++     EM+ESC+ TIM+R
Sbjct: 183 GEKVEDEEKLKINKECDGD---GDGGK------IEKEKGEDRCTVDMEMEESCLFTIMRR 242

Query: 253 MIAREVRNYIDSLRAHGGL 264
           MIA EVRN  D LRA  GL
Sbjct: 243 MIAEEVRNQFDGLRAQAGL 247

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MYB44_ARATH7.0e-4170.48Transcription factor MYB44 OS=Arabidopsis thaliana GN=MYB44 PE=2 SV=1[more]
MB3R1_ARATH5.5e-3046.90Myb-related protein 3R-1 OS=Arabidopsis thaliana GN=MYB3R-1 PE=2 SV=1[more]
MYBA_CHICK5.5e-3054.72Myb-related protein A OS=Gallus gallus GN=MYBL1 PE=2 SV=1[more]
MYB_MOUSE7.2e-3054.72Transcriptional activator Myb OS=Mus musculus GN=Myb PE=1 SV=2[more]
MYB_CHICK7.2e-3054.72Transcriptional activator Myb OS=Gallus gallus GN=MYB PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KQJ2_CUCSA3.5e-11677.47Uncharacterized protein OS=Cucumis sativus GN=Csa_5G209490 PE=4 SV=1[more]
V4T705_9ROSI1.3e-7357.88Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032317mg PE=4 SV=1[more]
A0A067LKB3_JATCU6.9e-7258.56MYB family protein OS=Jatropha curcas GN=JCGZ_01234 PE=4 SV=1[more]
B9HHT0_POPTR3.8e-7059.07Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s07100g PE=4 SV=1[more]
A0A067FND1_CITSI1.9e-6958.69Uncharacterized protein (Fragment) OS=Citrus sinensis GN=CISIN_1g046570mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT2G23290.11.6e-4341.49 myb domain protein 70[more]
AT4G37260.17.9e-4346.51 myb domain protein 73[more]
AT3G55730.12.3e-4270.19 myb domain protein 109[more]
AT5G67300.13.9e-4270.48 myb domain protein r1[more]
AT3G50060.15.1e-4270.48 myb domain protein 77[more]
Match NameE-valueIdentityDescription
gi|449460939|ref|XP_004148201.1|5.0e-11677.47PREDICTED: transcription factor MYB44-like [Cucumis sativus][more]
gi|567886500|ref|XP_006435772.1|1.8e-7357.88hypothetical protein CICLE_v10032317mg [Citrus clementina][more]
gi|802547651|ref|XP_012090668.1|1.0e-7158.56PREDICTED: transcriptional activator Myb-like [Jatropha curcas][more]
gi|224098716|ref|XP_002311241.1|5.5e-7059.07hypothetical protein POPTR_0008s07100g [Populus trichocarpa][more]
gi|641850046|gb|KDO68919.1|2.7e-6958.69hypothetical protein CISIN_1g046570mg, partial [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0030154 cell differentiation
biological_process GO:0006357 regulation of transcription from RNA polymerase II promoter
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0000981 RNA polymerase II transcription factor activity, sequence-specific DNA binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0001135 transcription factor activity, RNA polymerase II transcription factor recruiting
molecular_function GO:0044212 transcription regulatory region DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g05420.1Cp4.1LG01g05420.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 75..117
score: 8.9E-15coord: 21..66
score: 1.6
IPR001005SANT/Myb domainSMARTSM00717santcoord: 72..120
score: 1.2E-14coord: 20..69
score: 2.8
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 22..74
score: 2.1E-24coord: 75..121
score: 5.4
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 18..114
score: 1.16
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 16..71
score: 26.326coord: 72..122
score: 21
NoneNo IPR availablePANTHERPTHR10641MYB-LIKE DNA-BINDING PROTEIN MYBcoord: 1..121
score: 1.6
NoneNo IPR availablePANTHERPTHR10641:SF553SUBFAMILY NOT NAMEDcoord: 1..121
score: 1.6

The following gene(s) are paralogous to this gene:

None