Cp4.1LG13g06810 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g06810
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptionmyb-like transcription factor family protein
LocationCp4.1LG13 : 4437223 .. 4439784 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGTCAAAAAAGAATTTTGTAAATTAAAGTATTTAATAATTTTTTATTAAGTAAAAATAAAAAAAGAAAAAAAGAAAAGGGGGCGTTGGGTTATTGAGAAGGTCCATGAAAGAGTATATATTGGTTACACTACGCGACGGCCCGACCATGATATTTTAGTCTCTGCCTCCTCTGTGTCGCCTCCCATAAATTCTCCGTTGAATTTTCCAATTCTCTCTGTCTCTCTGTCTCTCTGGTTCTTTGTTTTGTTTGTTAGTTTGACGATCATGGAGCTTGAGCACCAAGTTTCTTCCTCCCAATGATTCTCTTTTAGATCCCTTTTTCAAGCCCCATTGCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCGATTTCATCTTCTTTCCCATTTTCCGCCAGAAACAGAGGTGTGACAATTATGGTTTACTCCTACAAAATGCAAGAAATTGCATCCACAATGGGCTTCACGCTCTCCGATTTCGCCGATACTTTGGAACAAGAACGCTCCAAAGTCCTCATGTTTCAGCGCGAGCTCCCTCTCTGTTTGCAGATCGTTTCCCATGGTATAAACCCATTAAACCCCCTTCTCTGTTTCTTCTTCTTCTTCTTCTTCTTCTGTTCTGATTTGGGTTTTTTTGCAGCCATTGATTGCTGCAGGCAGCACCTATCGGAGTCGACGACGGAGAATCGTCAATCCGAGTGCTCCGACCAGACTTCCAGCGACATGGGTCTTGTTCTTGAGGAGTTCATTCCAATTAATGGGGTTTTTGATTCTGAACGACCACGACACCTCCATGAAGCTCAAACAGAGAACAACAAGATGAACGATTCTGATTTAAACAATTTGAATTTGCCTCCCTCCGATTGGCTCAGATCTGCTCAGCTTTGGAATCAGACCTCAGATCCTCCTCCTCTGAGTCAGGTAAAATTTTCCCCCAATTTCGCACGGATTTTAAATTAAGTCTCGCCGGAAATTTATGCGTTATTGGGTCTGGTGGTGCAGGACACGCCGGAGAACACGACGGTTGTTGAGGTAAACAGAAATGGAGGTGCTTTCCAGCCATTTCAGAAGGAGAAAACCGGTAGTGGCGGAGGGATGATGTCGTCGTCTTCAGATCCGGTTCCGGCTGCGGAGACGGGGTTAGGTGGAAGCAGCCGACGGGAAGAGAAGGAAGCACAGAATCAGAGGAAACAGAGACGTTGCTGGTCTCCGGAGCTGCACCGGCGGTTCCTTCATGCGCTTCAGCAGCTCGGAGGCTCCCATGGTAAGGAAGAACATGAAAATTGAGTTTATTTTGAAAAAGGAATTGAATTGAATTAATTTATATGGTTTGAATTAAAATGGCAGTGGCGACGCCGAAGCAAATAAGGGAATTGATGAAGGTGGATGGTCTTACCAACGGCGAAGTCAAAAGTCATCTACAGAAGTATCGTCTACACACCAAATGCCCCACTCCGACAATCCACAACAACGAGACCGTCCAACCGCCCCAGTTCCTGGTGGTCGGCGGCATATGGGTACCGGCGTCCGACTACGCCACCACTTCCTCCAGAGAAGCAATCAGCGCCGCTACCACCAACGGAATATATGCACCGATGGTTGCAGCGGCGGCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCCGCCACCGCCGCCACCGAATGTAATTCTTCTACTACATCTTCCTCTACTCGTACCTCAGTTTCACCTGCTTCTTGAGCTTCGAAGGGTTGAAATTTTGTTTTTTTTTTTTTTTTTTTTTTTTACTTTTTTTCTACACTTTTTGATGTGTAAAAGTTTTGTTACACCATTTTTTAGAGGCAACTCAAATAATTTTTATTGGTAAAAAATCTATTTATTTAATTTCAAAGATTTAATTTAGATAATACTTAAAAATTATTATGAAATTAAATATAAATAAATACATTTATATATAAATTGTATTTTTAAAAATTAACAGTTTAATTCAATTAAAAGAATTAAATTATTGGAAATTGAAATTTTATAAATACAAAATAAAAATTAAGTATTTATAAAAGAAATATTCAAATATAAAATCATTACAAAAATCAAACAATAATTAAAAGTTTGATCTGAAAAATGAGAAAAAAAAAAAAGAATTAAAGCATTTTTATTGGATCTTTAAAAGAATTATGGCGTCTACAAAACTCAAGAAAATAATAATTCATATAAAGATAATTGAGTACATGCTTGGAATTACTTGTTTCAATTTGTTTTGTTAG

mRNA sequence

AGGTCAAAAAAGAATTTTGTAAATTAAAGTATTTAATAATTTTTTATTAAGTAAAAATAAAAAAAGAAAAAAAGAAAAGGGGGCGTTGGGTTATTGAGAAGGTCCATGAAAGAGTATATATTGGTTACACTACGCGACGGCCCGACCATGATATTTTAGTCTCTGCCTCCTCTGTGTCGCCTCCCATAAATTCTCCGTTGAATTTTCCAATTCTCTCTGTCTCTCTGTCTCTCTGGTTCTTTGTTTTGTTTGTTAGTTTGACGATCATGGAGCTTGAGCACCAAGTTTCTTCCTCCCAATGATTCTCTTTTAGATCCCTTTTTCAAGCCCCATTGCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCGATTTCATCTTCTTTCCCATTTTCCGCCAGAAACAGAGGTGTGACAATTATGGTTTACTCCTACAAAATGCAAGAAATTGCATCCACAATGGGCTTCACGCTCTCCGATTTCGCCGATACTTTGGAACAAGAACGCTCCAAAGTCCTCATGTTTCAGCGCGAGCTCCCTCTCTGTTTGCAGATCGTTTCCCATGCCATTGATTGCTGCAGGCAGCACCTATCGGAGTCGACGACGGAGAATCGTCAATCCGAGTGCTCCGACCAGACTTCCAGCGACATGGGTCTTGTTCTTGAGGAGTTCATTCCAATTAATGGGGTTTTTGATTCTGAACGACCACGACACCTCCATGAAGCTCAAACAGAGAACAACAAGATGAACGATTCTGATTTAAACAATTTGAATTTGCCTCCCTCCGATTGGCTCAGATCTGCTCAGCTTTGGAATCAGACCTCAGATCCTCCTCCTCTGAGTCAGGACACGCCGGAGAACACGACGGTTGTTGAGGTAAACAGAAATGGAGGTGCTTTCCAGCCATTTCAGAAGGAGAAAACCGGTAGTGGCGGAGGGATGATGTCGTCGTCTTCAGATCCGGTTCCGGCTGCGGAGACGGGGTTAGGTGGAAGCAGCCGACGGGAAGAGAAGGAAGCACAGAATCAGAGGAAACAGAGACGTTGCTGGTCTCCGGAGCTGCACCGGCGGTTCCTTCATGCGCTTCAGCAGCTCGGAGGCTCCCATGTGGCGACGCCGAAGCAAATAAGGGAATTGATGAAGGTGGATGGTCTTACCAACGGCGAAGTCAAAAGTCATCTACAGAAGTATCGTCTACACACCAAATGCCCCACTCCGACAATCCACAACAACGAGACCGTCCAACCGCCCCAGTTCCTGGTGGTCGGCGGCATATGGGTACCGGCGTCCGACTACGCCACCACTTCCTCCAGAGAAGCAATCAGCGCCGCTACCACCAACGGAATATATGCACCGATGGTTGCAGCGGCGGCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCCGCCACCGCCGCCACCGAATGTAATTCTTCTACTACATCTTCCTCTACTCGTACCTCAGTTTCACCTGCTTCTTGAGCTTCGAAGGGTTGAAATTTTGTTTTTTTTTTTTTTTTTTTTTTTTACTTTTTTTCTACACTTTTTGATGTGTAAAAGTTTTGTTACACCATTTTTTAGAGGCAACTCAAATAATTTTTATTGGTAAAAAATCTATTTATTTAATTTCAAAGATTTAATTTAGATAATACTTAAAAATTATTATGAAATTAAATATAAATAAATACATTTATATATAAATTGTATTTTTAAAAATTAACAGTTTAATTCAATTAAAAGAATTAAATTATTGGAAATTGAAATTTTATAAATACAAAATAAAAATTAAGTATTTATAAAAGAAATATTCAAATATAAAATCATTACAAAAATCAAACAATAATTAAAAGTTTGATCTGAAAAATGAGAAAAAAAAAAAAGAATTAAAGCATTTTTATTGGATCTTTAAAAGAATTATGGCGTCTACAAAACTCAAGAAAATAATAATTCATATAAAGATAATTGAGTACATGCTTGGAATTACTTGTTTCAATTTGTTTTGTTAG

Coding sequence (CDS)

ATGGTTTACTCCTACAAAATGCAAGAAATTGCATCCACAATGGGCTTCACGCTCTCCGATTTCGCCGATACTTTGGAACAAGAACGCTCCAAAGTCCTCATGTTTCAGCGCGAGCTCCCTCTCTGTTTGCAGATCGTTTCCCATGCCATTGATTGCTGCAGGCAGCACCTATCGGAGTCGACGACGGAGAATCGTCAATCCGAGTGCTCCGACCAGACTTCCAGCGACATGGGTCTTGTTCTTGAGGAGTTCATTCCAATTAATGGGGTTTTTGATTCTGAACGACCACGACACCTCCATGAAGCTCAAACAGAGAACAACAAGATGAACGATTCTGATTTAAACAATTTGAATTTGCCTCCCTCCGATTGGCTCAGATCTGCTCAGCTTTGGAATCAGACCTCAGATCCTCCTCCTCTGAGTCAGGACACGCCGGAGAACACGACGGTTGTTGAGGTAAACAGAAATGGAGGTGCTTTCCAGCCATTTCAGAAGGAGAAAACCGGTAGTGGCGGAGGGATGATGTCGTCGTCTTCAGATCCGGTTCCGGCTGCGGAGACGGGGTTAGGTGGAAGCAGCCGACGGGAAGAGAAGGAAGCACAGAATCAGAGGAAACAGAGACGTTGCTGGTCTCCGGAGCTGCACCGGCGGTTCCTTCATGCGCTTCAGCAGCTCGGAGGCTCCCATGTGGCGACGCCGAAGCAAATAAGGGAATTGATGAAGGTGGATGGTCTTACCAACGGCGAAGTCAAAAGTCATCTACAGAAGTATCGTCTACACACCAAATGCCCCACTCCGACAATCCACAACAACGAGACCGTCCAACCGCCCCAGTTCCTGGTGGTCGGCGGCATATGGGTACCGGCGTCCGACTACGCCACCACTTCCTCCAGAGAAGCAATCAGCGCCGCTACCACCAACGGAATATATGCACCGATGGTTGCAGCGGCGGCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGATTCCTTCCTCCTCCGCCGCCGCAGCCATTACCCAGTACAGTTCAAAAGCCCAAGCCCAGGCCCATGA

Protein sequence

MVYSYKMQEIASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTENRQSECSDQTSSDMGLVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPASDYATTSSREAISAATTNGIYAPMVAAAAPQPLPSTVQKPKPRPMIPSSSAAAAITQYSSKAQAQAHDSFLLRRRSHYPVQFKSPSPGP
BLAST of Cp4.1LG13g06810 vs. Swiss-Prot
Match: EFM_ARATH (Myb family transcription factor EFM OS=Arabidopsis thaliana GN=EFM PE=1 SV=2)

HSP 1 Score: 153.3 bits (386), Expect = 5.3e-36
Identity = 109/289 (37.72%), Postives = 148/289 (51.21%), Query Frame = 1

Query: 18  LSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTENRQSECSDQTSSDM 77
           L D    LEQER K+  F+RELPLC+Q++++A++  +Q L      +  +  S  T    
Sbjct: 36  LEDLLSRLEQERLKIDAFKRELPLCMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTRP-- 95

Query: 78  GLVLEEFIPINGVFDSERPRHLHE-------AQTENNKMNDSDLNNLNLPPSDWLRSAQL 137
             VLEEFIP+    +    +  +        +Q+E    N     + +LP  +   S +L
Sbjct: 96  --VLEEFIPLRNQPEKTNNKGSNWMTTAQLWSQSETKPKNIDSTTDQSLPKDEINSSPKL 155

Query: 138 WNQTSD---------PPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGG--MMSSSS 197
            +  +          P    Q  PE     EV R      P  +   G  G    M ++ 
Sbjct: 156 GHFDAKQRNGSGAFLPFSKEQSLPELALSTEVKR----VSPTNEHTNGQDGNDESMINND 215

Query: 198 DPVPAAETGLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIREL 257
           +           S+      +Q+ RK RRCWSP+LHRRF+ ALQ LGGS VATPKQIREL
Sbjct: 216 NNYNNNNNNNSNSNGVSSTTSQSNRKARRCWSPDLHRRFVQALQMLGGSQVATPKQIREL 275

Query: 258 MKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVP 289
           MKVDGLTN EVKSHLQKYRLHT+ P+P+   +     P  +V+GGIWVP
Sbjct: 276 MKVDGLTNDEVKSHLQKYRLHTRRPSPSPQTSGG-PGPHLVVLGGIWVP 315

BLAST of Cp4.1LG13g06810 vs. Swiss-Prot
Match: PHR1_ORYSI (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE=3 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 3.1e-12
Identity = 45/103 (43.69%), Postives = 55/103 (53.40%), Query Frame = 1

Query: 156 NGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSSRREEKEAQNQRKQRRCWSPELH 215
           N  A QP   + T S  G +   + P P                  +  KQR  W+PELH
Sbjct: 180 NSAASQPAFNQSTSSHSGDICPVTSPPP-------------NNSNASASKQRMRWTPELH 239

Query: 216 RRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYR 259
             F+HA+ +LGGS  ATPK + +LMKVDGLT   VKSHLQKYR
Sbjct: 240 ESFVHAVNKLGGSEKATPKGVLKLMKVDGLTIYHVKSHLQKYR 269

BLAST of Cp4.1LG13g06810 vs. Swiss-Prot
Match: PHR1_ORYSJ (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 PE=2 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 3.1e-12
Identity = 45/103 (43.69%), Postives = 55/103 (53.40%), Query Frame = 1

Query: 156 NGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSSRREEKEAQNQRKQRRCWSPELH 215
           N  A QP   + T S  G +   + P P                  +  KQR  W+PELH
Sbjct: 180 NSAASQPAFNQSTSSHSGDICPVTSPPP-------------NNSNASASKQRMRWTPELH 239

Query: 216 RRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYR 259
             F+HA+ +LGGS  ATPK + +LMKVDGLT   VKSHLQKYR
Sbjct: 240 ESFVHAVNKLGGSEKATPKGVLKLMKVDGLTIYHVKSHLQKYR 269

BLAST of Cp4.1LG13g06810 vs. Swiss-Prot
Match: PHL4_ARATH (Myb family transcription factor PHL4 OS=Arabidopsis thaliana GN=PHL4 PE=2 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 7.7e-11
Identity = 35/63 (55.56%), Postives = 43/63 (68.25%), Query Frame = 1

Query: 205 KQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCP 264
           K R  W+PELH  F+ A+ QLGGS+ ATPK + + MKV+GLT   VKSHLQKYR     P
Sbjct: 231 KGRMRWTPELHEVFVDAVNQLGGSNEATPKGVLKHMKVEGLTIFHVKSHLQKYRTAKYIP 290

Query: 265 TPT 268
            P+
Sbjct: 291 VPS 293

BLAST of Cp4.1LG13g06810 vs. Swiss-Prot
Match: PCL1_ARATH (Transcription factor LUX OS=Arabidopsis thaliana GN=LUX PE=1 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 2.9e-10
Identity = 52/152 (34.21%), Postives = 76/152 (50.00%), Query Frame = 1

Query: 147 NTTVVEVNRNGGAF--QPFQKEKTGSGGGMMSSSSDP--VPAAETGLGGSSRREEKEAQN 206
           N  V E +R G +      +K+KT +G G      DP    AAE G  G+   E+   + 
Sbjct: 85  NNNVEEEDRVGSSSPGSDSKKQKTSNGDGDDGGGVDPDSAMAAEEGDSGT---EDLSGKT 144

Query: 207 QRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYRLHTK 266
            ++ R  W+P+LH+RF+  +  LG  + A PK I +LM V+GLT   V SHLQKYRL+ K
Sbjct: 145 LKRPRLVWTPQLHKRFVDVVAHLGIKN-AVPKTIMQLMNVEGLTRENVASHLQKYRLYLK 204

Query: 267 ----------CPTPTIHNNETVQPPQFLVVGG 285
                       +  + ++  V P  F  +GG
Sbjct: 205 RMQGLTNEGPSASDKLFSSTPVPPQSFQDIGG 232

BLAST of Cp4.1LG13g06810 vs. TrEMBL
Match: A0A0A0KAT4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G031440 PE=4 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 4.1e-144
Identity = 283/356 (79.49%), Postives = 302/356 (84.83%), Query Frame = 1

Query: 1   MVYSYKMQEIASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSES 60
           M+YS KMQ+IA+ MGFTLSDFADTLEQER KVLMFQRELPLCL +VSHAIDCCRQ LS +
Sbjct: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60

Query: 61  TTENRQSECSDQTSSDMGLVLEEFIPIN--GVFDSERPRHLHEAQTENNKMNDSDLNNLN 120
           TTENRQSECS+QTSSDMG VLEEFIPIN  GV D E+         +NNK +DSDLNNLN
Sbjct: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTE-------KNNKNHDSDLNNLN 120

Query: 121 LPPSDWLRSAQLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGM--MS 180
           L PSDWLRSAQLWNQTSDPPPL+QD PENT VVEVNRNGGAF+PFQKEKTG GGG    S
Sbjct: 121 LAPSDWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGAS 180

Query: 181 SSSDPVPAAET------GLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV 240
           SSS P PAAET      G GGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV
Sbjct: 181 SSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV 240

Query: 241 ATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPAS 300
           ATPKQIRELMKVDGLTN EVKSHLQKYRLHT+ PTPTIHNNE    PQFLVVGGIWVPA+
Sbjct: 241 ATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAA 300

Query: 301 DYA----TTSSREAISAATTNGIYAPMVAAAAPQPLPSTVQKPKPRP---MIPSSS 340
           +YA    TTSS E +SAATTNGIYAP+VAAAAPQPL STVQKPKP+P   +IPSS+
Sbjct: 301 EYAAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSA 349

BLAST of Cp4.1LG13g06810 vs. TrEMBL
Match: D7T987_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g03110 PE=4 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 4.3e-85
Identity = 186/328 (56.71%), Postives = 221/328 (67.38%), Query Frame = 1

Query: 20  DFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE--NRQSECSDQTSSDM 79
           D+ + LE+ER K+ +FQRELPLCL++VS AI+ CRQ +S +T E  + QSECS+QTSSD 
Sbjct: 6   DYIEALEEERRKIQVFQRELPLCLELVSQAIESCRQQMSGTTQEYFHGQSECSEQTSSD- 65

Query: 80  GLVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQTSDP 139
           G VLEEFIPI    D E  +  H+     +K ND          SDWLRS QLWNQT DP
Sbjct: 66  GPVLEEFIPIKKTSDDEDEQQSHQPNDNKDKNNDKSGKK-----SDWLRSVQLWNQTPDP 125

Query: 140 PPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSS--RR 199
           P + +DTP+    +EV +NGGAF PF+++K        + S+     AET  G SS  R+
Sbjct: 126 P-VKEDTPKKIPSMEVKKNGGAFHPFKRDKAVGTNPTSAPSAATSSTAETATGCSSGSRK 185

Query: 200 EEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQ 259
           EEKE Q+QRK RRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTN EVKSHLQ
Sbjct: 186 EEKEGQSQRKARRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQ 245

Query: 260 KYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPASDY----ATTSSREAISAATTNGIYA 319
           KYRLHT+ P P I +N   Q PQF+VVGGIWVP  +Y    ATTSS EA    T NGIYA
Sbjct: 246 KYRLHTRRPNPAIQHNGNPQAPQFVVVGGIWVPPPEYTAVAATTSSGEATGVTTANGIYA 305

Query: 320 PMVAAAAPQPLPSTVQKPKPRPMIPSSS 340
           P+ +     P  ST    + +PM P  S
Sbjct: 306 PVASVPPSHPQGST---QRQQPMKPKKS 323

BLAST of Cp4.1LG13g06810 vs. TrEMBL
Match: M5XS96_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007222mg PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 2.1e-84
Identity = 189/343 (55.10%), Postives = 230/343 (67.06%), Query Frame = 1

Query: 11  ASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE--NRQSE 70
           AS +GF   D+   LE+ER K+ +FQRELPLCL++V+ AI+ C+Q LS++TT+  + QSE
Sbjct: 3   ASRLGFR--DYVKALEEERHKIQVFQRELPLCLELVTQAIERCKQQLSDTTTDYMHGQSE 62

Query: 71  CSDQTSSDMGLVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSA 130
           CS+QTSS+ G V EEFIP+     S+        +++  K ND D  N +   SDWLRSA
Sbjct: 63  CSEQTSSE-GHVFEEFIPLKRTSSSDSDDD-EVQESQEPKTNDKDKTNGDKIKSDWLRSA 122

Query: 131 QLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKT-GSGGGMMSSSSDPVPAAET 190
           QLWN T DPP L  + P    V+EV RNGGAFQPFQ+EK+ G     ++      PA  +
Sbjct: 123 QLWNTTPDPP-LKDELPRKALVMEVKRNGGAFQPFQREKSVGKTNRPVAKVPASAPATSS 182

Query: 191 -------GLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELM 250
                  G G S ++EEK+ Q QRKQRR WSPELHRRFLHALQQLGGSH ATPKQIRELM
Sbjct: 183 TTDTVSGGSGESHKKEEKDGQGQRKQRRNWSPELHRRFLHALQQLGGSHAATPKQIRELM 242

Query: 251 KVDGLTNGEVKSHLQKYRLHTKCPTPTIHNN----ETVQPPQFLVVGGIWVPASDY---- 310
           KVDGLTN EVKSHLQKYRLHT+ PTPT+HNN       Q PQFLVVGGIWVP  DY    
Sbjct: 243 KVDGLTNDEVKSHLQKYRLHTRRPTPTMHNNNNSDNNTQAPQFLVVGGIWVPPQDYAAVA 302

Query: 311 ATTSSREAISAATTNGIYAPMV---AAAAPQPLPSTVQKPKPR 333
           ATT+S EA   A  NGIYAP+    +   P   PS +Q+P+P+
Sbjct: 303 ATTASGEATRVAAANGIYAPVATSPSTVTPVSPPSLMQRPRPK 340

BLAST of Cp4.1LG13g06810 vs. TrEMBL
Match: B9ST36_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_0353920 PE=4 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 7.6e-82
Identity = 186/367 (50.68%), Postives = 229/367 (62.40%), Query Frame = 1

Query: 1   MVYSYKMQEIASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSES 60
           M Y+ KMQ           ++ + LE+E+ K+ +FQRELPLCL++V+ AI+ C++ LS +
Sbjct: 1   MDYAEKMQRC--------HEYVEALEEEKRKIQVFQRELPLCLELVTQAIEACKRELSGT 60

Query: 61  TTE--NRQSECSDQTSSDMG---------LVLEEFIPINGVFDSERPRHLHEAQTENNKM 120
           TTE  + QSECS+QT+S  G         LVLEEFIPI  +  S    + ++   EN K 
Sbjct: 61  TTEYMHGQSECSEQTTSTDGTANGTGTRSLVLEEFIPIKRINSSSHNDNDNDDDNENEKE 120

Query: 121 NDS------------------DLNNLNLPPSDWLRSAQLWNQTS-DPPPLSQDTPENTTV 180
           ++                   D+NN     SDWLRS QLWNQ+S D  P  +D P    V
Sbjct: 121 DNDDDEEEEDQDSHKPNKSIRDINNDQKKKSDWLRSVQLWNQSSPDSEPPKEDLPRKAAV 180

Query: 181 VEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLG------GSSRREEKEAQNQR 240
            EV RNGGAFQPF KEK  +       +S    +AETG G      G++R+E+K+ Q QR
Sbjct: 181 TEVKRNGGAFQPFHKEKGIAKTPPSVPASATSSSAETGTGGGTSGAGNNRKEDKDGQAQR 240

Query: 241 KQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCP 300
           KQRRCWSPELHRRFLHALQQLGGSH ATPKQIRELMKVDGLTN EVKSHLQKYRLHT+ P
Sbjct: 241 KQRRCWSPELHRRFLHALQQLGGSHAATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP 300

Query: 301 TPTIHNNETVQPPQFLVVGGIWVPASDY----ATTSSREAISAATTNGIYAPMVAAAAPQ 328
           +PTIHNN   Q PQF+VVGGIWVP  +Y    ATT+S E ++ A  NGIYAP+ A     
Sbjct: 301 SPTIHNNSNPQAPQFVVVGGIWVPPPEYAAVAATTASMETVTTAAANGIYAPVAAPLGTI 359

BLAST of Cp4.1LG13g06810 vs. TrEMBL
Match: A0A067KAT1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18291 PE=4 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 7.6e-82
Identity = 189/362 (52.21%), Postives = 233/362 (64.36%), Query Frame = 1

Query: 20  DFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE--NRQSECSDQTSSDM 79
           ++ + LE+ER K+ +FQRELPLCL++V+ AI+ C++ LS +TTE  + QSECS QTSSD 
Sbjct: 12  EYVEALEEERRKIQVFQRELPLCLELVTQAIEACKRELSGTTTEYMHGQSECSVQTSSD- 71

Query: 80  GL-----VLEEFIPINGVFDS--------------------ERPRHLHEAQTENNKMNDS 139
           G+     VLEEFIPI     S                    ++  H H  + + N + D 
Sbjct: 72  GIATRPPVLEEFIPIKRTHSSSDNDYEEEEEEDDDDDNENDDKEHHSHNKRNDKN-IKDK 131

Query: 140 DLNNL--NLPPSDWLRSAQLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKT-G 199
           D N+   +   SDWLRS QLWNQ++D PPL +D P    V EV RNGGAFQPFQKEK+ G
Sbjct: 132 DKNSSADHKKKSDWLRSVQLWNQSAD-PPLKEDLPRKIAVSEVKRNGGAFQPFQKEKSVG 191

Query: 200 SGGGMMSSSSDPVPAAET-----------------GLGGSSRREEKEAQNQRKQRRCWSP 259
                ++ +  PVPA+ T                 G GGS +++EKE+ +QRKQRRCWSP
Sbjct: 192 KNNQTITKTPSPVPASATSSTEETKTGGTGNGSGNGGGGSGKKDEKES-SQRKQRRCWSP 251

Query: 260 ELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNE 319
           ELHRRFLHALQQLGGSH ATPKQIRELMKVDGLTN EVKSHLQKYRLHT+ P+PTIHNN 
Sbjct: 252 ELHRRFLHALQQLGGSHAATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPSPTIHNNN 311

Query: 320 TVQPPQFLVVGGIWVPASDYA------TTSSREAISAATTNGIYAPMVAAAAPQPLPSTV 329
             Q PQF+VVGGIWVP +DYA      TT+S E  + A  NG+YAP+  A+ P      V
Sbjct: 312 NPQAPQFVVVGGIWVPPADYATVAAGTTTTSGETATIAAANGLYAPV--ASRPPASSHLV 367

BLAST of Cp4.1LG13g06810 vs. TAIR10
Match: AT1G25550.1 (AT1G25550.1 myb-like transcription factor family protein)

HSP 1 Score: 257.3 bits (656), Expect = 1.5e-68
Identity = 156/312 (50.00%), Postives = 200/312 (64.10%), Query Frame = 1

Query: 20  DFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE-NRQSECSDQTSSDMG 79
           ++ + LE+E+ K+ +FQRELPLCL++V+ AI+ CR+ LSES+     QSECS++T+S+ G
Sbjct: 20  EYVEALEEEQKKIQVFQRELPLCLELVTQAIESCRKELSESSEHVGGQSECSERTTSECG 79

Query: 80  -LVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQTSDP 139
             V EEF+PI     S       E + E  +M  ++ N+ +   SDWLRS QLWNQ+ DP
Sbjct: 80  GAVFEEFMPIKWSSASSDETDKDE-EAEKTEMMTNENNDGDKKKSDWLRSVQLWNQSPDP 139

Query: 140 PPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMM-------SSSSDPVPAAETGLG 199
            P ++       V+EV R+ GAFQPFQKEK  +            +S++     AET  G
Sbjct: 140 QPNNK----KPMVIEVKRSAGAFQPFQKEKPKAADSQPLIKAITPTSTTTTSSTAETVGG 199

Query: 200 GSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEV 259
           G    E+K++ + RKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR+LMKVDGLTN EV
Sbjct: 200 GKEFEEQKQSHSNRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRDLMKVDGLTNDEV 259

Query: 260 KSHLQKYRLHTKCP-TPTIH-NNETVQPPQFLVVGGIWVPASDYATTSSREAISAATTNG 319
           KSHLQKYRLHT+ P TP +    E  Q  QF+V+ GIWVP+ D             T N 
Sbjct: 260 KSHLQKYRLHTRRPATPVVRTGGENPQQRQFMVMEGIWVPSHD------------TTNNR 313

Query: 320 IYAPMVAAAAPQ 321
           +YAP VA   PQ
Sbjct: 320 VYAP-VATQPPQ 313

BLAST of Cp4.1LG13g06810 vs. TAIR10
Match: AT1G68670.1 (AT1G68670.1 myb-like transcription factor family protein)

HSP 1 Score: 238.4 bits (607), Expect = 7.1e-63
Identity = 156/342 (45.61%), Postives = 205/342 (59.94%), Query Frame = 1

Query: 1   MVYSYKMQEIASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSES 60
           M Y+ KMQ+          ++ + LE+E+ K+ +FQRELPLCL++V+ AI+ CR+ LS +
Sbjct: 5   MDYAKKMQKC--------HEYVEALEEEQKKIQVFQRELPLCLELVTQAIEACRKELSGT 64

Query: 61  TTENRQSECSDQTSSDMG-LVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNL 120
           TT   + +CS+QT+S  G  V EEFIPI  +  S     + E + E+ + ++S    +N 
Sbjct: 65  TTTTSE-QCSEQTTSVCGGPVFEEFIPIKKI--SSLCEEVQEEEEEDGE-HESSPELVNN 124

Query: 121 PPSDWLRSAQLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSS 180
             SDWLRS QLWN + D  P  +   +   VVEV    GAFQPFQK    +        +
Sbjct: 125 KKSDWLRSVQLWNHSPDLNPKEERVAKKAKVVEVKPKSGAFQPFQKRVLETDLQPAVKVA 184

Query: 181 DPVPAAETG-----LGGSS----------RREEKEAQNQ--RKQRRCWSPELHRRFLHAL 240
             +PA  T       GG S          R E++++Q+   RKQRRCWSPELHRRFL+AL
Sbjct: 185 SSMPATTTSSTTETCGGKSDLIKAGDEERRIEQQQSQSHTHRKQRRCWSPELHRRFLNAL 244

Query: 241 QQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCPTPT---IHNNETVQPPQF 300
           QQLGGSHVATPKQIR+ MKVDGLTN EVKSHLQKYRLHT+ P  T     +    Q PQF
Sbjct: 245 QQLGGSHVATPKQIRDHMKVDGLTNDEVKSHLQKYRLHTRRPAATSVAAQSTGNQQQPQF 304

Query: 301 LVVGGIWVPAS-DYATTSSREAISAATTNGIYAPMVAAAAPQ 321
           +VVGGIWVP+S D+   S       A   G+YAP+  A +P+
Sbjct: 305 VVVGGIWVPSSQDFPPPS-----DVANKGGVYAPVAVAQSPK 329

BLAST of Cp4.1LG13g06810 vs. TAIR10
Match: AT1G13300.1 (AT1G13300.1 myb-like transcription factor family protein)

HSP 1 Score: 232.6 bits (592), Expect = 3.9e-61
Identity = 139/307 (45.28%), Postives = 177/307 (57.65%), Query Frame = 1

Query: 21  FADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTENR--QSECSDQTSSDMG 80
           + + LE+ER K+ +FQRELPLCL +V+ AI+ C++ L E TTEN   Q ECS+QT+ + G
Sbjct: 20  YIEALEEERRKIHVFQRELPLCLDLVTQAIEACKRELPEMTTENMYGQPECSEQTTGECG 79

Query: 81  LVLEEFIPI--NGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQTSD 140
            VLE+F+ I  +   + E      +    ++  NDS+  N     SDWL+S QLWNQ   
Sbjct: 80  PVLEQFLTIKDSSTSNEEEDEEFDDEHGNHDPDNDSEDKNTK---SDWLKSVQLWNQPDH 139

Query: 141 PPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSSRRE 200
           P    ++  +  T+          +  +K+   +GG                  G  R  
Sbjct: 140 PLLPKEERLQQETMTR-------DESMRKDPMVNGG-----------------EGRKREA 199

Query: 201 EKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQK 260
           EK+    RKQRRCWS +LHRRFL+ALQ LGG HVATPKQIRE MKVDGLTN EVKSHLQK
Sbjct: 200 EKDGGGGRKQRRCWSSQLHRRFLNALQHLGGPHVATPKQIREFMKVDGLTNDEVKSHLQK 259

Query: 261 YRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPASDYA---TTSSREAISAATTNGIYAPM 320
           YRLHT+ P  T+ NN   Q   F+VVGG+WVP SDY+   TT      S  TT GIY  M
Sbjct: 260 YRLHTRRPRQTVPNNGNSQTQHFVVVGGLWVPQSDYSTGKTTGGATTSSTTTTTGIYGTM 299

BLAST of Cp4.1LG13g06810 vs. TAIR10
Match: AT3G25790.1 (AT3G25790.1 myb-like transcription factor family protein)

HSP 1 Score: 222.6 bits (566), Expect = 4.0e-58
Identity = 132/307 (43.00%), Postives = 193/307 (62.87%), Query Frame = 1

Query: 20  DFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTENR--QSECSDQTSSDM 79
           ++ + LE+ER K+ +FQRELPLC+++V+ AI+  ++ +S ++T+N   QSECS+QT+ + 
Sbjct: 20  EYIEALEEERRKINVFQRELPLCVELVTQAIEAYKREISGTSTDNLYGQSECSEQTTGEC 79

Query: 80  GLVLEEFIPI---NGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQT 139
           G +L+ FIPI   +   + E      + +   +   D D ++ N+  S+WL+S QLWNQ+
Sbjct: 80  GRILDLFIPIKHSSTSIEEEVDDKDDDDEEHQSHETDIDFDDKNMK-SEWLKSVQLWNQS 139

Query: 140 -----SDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGL 199
                ++    SQ+  E  T+VE+ +          E       + S    PV  ++ G 
Sbjct: 140 DAVVSNNRQDRSQEKTE--TLVELIK-------INDEAAKKNNNIKS----PVTTSDGGS 199

Query: 200 GGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGE 259
           GG   R     + QRK RRCWS ELHRRFL+AL+QLGG HVATPKQIR++MKVDGLTN E
Sbjct: 200 GGGGGR-----RGQRKNRRCWSQELHRRFLNALKQLGGPHVATPKQIRDIMKVDGLTNDE 259

Query: 260 VKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPASDYATTSSREAISAATTNGI 317
           VKSHLQKYRLH + P+ T  NN   Q   F+VVGGIWVP ++++T ++  A+++  T GI
Sbjct: 260 VKSHLQKYRLHARRPSQTTPNNRNSQTQHFVVVGGIWVPQTNHSTANAVNAVASGETTGI 307

BLAST of Cp4.1LG13g06810 vs. TAIR10
Match: AT2G03500.1 (AT2G03500.1 Homeodomain-like superfamily protein)

HSP 1 Score: 153.3 bits (386), Expect = 3.0e-37
Identity = 109/289 (37.72%), Postives = 148/289 (51.21%), Query Frame = 1

Query: 18  LSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTENRQSECSDQTSSDM 77
           L D    LEQER K+  F+RELPLC+Q++++A++  +Q L      +  +  S  T    
Sbjct: 36  LEDLLSRLEQERLKIDAFKRELPLCMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTRP-- 95

Query: 78  GLVLEEFIPINGVFDSERPRHLHE-------AQTENNKMNDSDLNNLNLPPSDWLRSAQL 137
             VLEEFIP+    +    +  +        +Q+E    N     + +LP  +   S +L
Sbjct: 96  --VLEEFIPLRNQPEKTNNKGSNWMTTAQLWSQSETKPKNIDSTTDQSLPKDEINSSPKL 155

Query: 138 WNQTSD---------PPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGG--MMSSSS 197
            +  +          P    Q  PE     EV R      P  +   G  G    M ++ 
Sbjct: 156 GHFDAKQRNGSGAFLPFSKEQSLPELALSTEVKR----VSPTNEHTNGQDGNDESMINND 215

Query: 198 DPVPAAETGLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIREL 257
           +           S+      +Q+ RK RRCWSP+LHRRF+ ALQ LGGS VATPKQIREL
Sbjct: 216 NNYNNNNNNNSNSNGVSSTTSQSNRKARRCWSPDLHRRFVQALQMLGGSQVATPKQIREL 275

Query: 258 MKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVP 289
           MKVDGLTN EVKSHLQKYRLHT+ P+P+   +     P  +V+GGIWVP
Sbjct: 276 MKVDGLTNDEVKSHLQKYRLHTRRPSPSPQTSGG-PGPHLVVLGGIWVP 315

BLAST of Cp4.1LG13g06810 vs. NCBI nr
Match: gi|659089682|ref|XP_008445641.1| (PREDICTED: probable transcription factor GLK2 [Cucumis melo])

HSP 1 Score: 526.9 bits (1356), Expect = 2.8e-146
Identity = 289/373 (77.48%), Postives = 310/373 (83.11%), Query Frame = 1

Query: 1   MVYSYKMQEIASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSES 60
           MVYS KMQEIA+ MGFTLSDFADTLEQER KVLMFQRELPLCLQ+VSHAIDCCRQ LS +
Sbjct: 1   MVYSDKMQEIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLQLVSHAIDCCRQQLSGT 60

Query: 61  TTENRQSECSDQTSSDMGLVLEEFIPIN--GVFDSERPRHLHEAQTENNKMNDSDLNNLN 120
           TTENRQSECS+QTSSD+G VLEEFIPIN  GV D E+   +       NK +D DLNNLN
Sbjct: 61  TTENRQSECSEQTSSDIGPVLEEFIPINRNGVSDFEKTEKI-------NKNDDPDLNNLN 120

Query: 121 LPPSDWLRSAQLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGS--GGGMMS 180
           L PSDWLRSAQLWNQTSDPPPL+QD PENT VVEVNRNGGAF+PFQKEKTG   GGG  S
Sbjct: 121 LAPSDWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGCGGGGGAS 180

Query: 181 SSSDPVPAAET------GLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV 240
           SSS P PAAET      G GGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV
Sbjct: 181 SSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV 240

Query: 241 ATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPAS 300
           ATPKQIRELMKVDGLTN EVKSHLQKYRLHT+ PTPTIHNNE+   PQFLVVGGIWVPA+
Sbjct: 241 ATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNESGHTPQFLVVGGIWVPAA 300

Query: 301 DYA----TTSSREAISAATTNGIYAPMVAAAAPQPLPSTVQKPKPRP-MIPSSSAAAAIT 359
           +YA    TTSS E +SAATTNGIYAP+VAAAAPQPL STVQKPKP+P +IPSS+A AA+ 
Sbjct: 301 EYAAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLASTVQKPKPKPKIIPSSAAVAAVE 360

BLAST of Cp4.1LG13g06810 vs. NCBI nr
Match: gi|778710070|ref|XP_011656513.1| (PREDICTED: probable transcription factor GLK2 [Cucumis sativus])

HSP 1 Score: 519.2 bits (1336), Expect = 5.9e-144
Identity = 283/356 (79.49%), Postives = 302/356 (84.83%), Query Frame = 1

Query: 1   MVYSYKMQEIASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSES 60
           M+YS KMQ+IA+ MGFTLSDFADTLEQER KVLMFQRELPLCL +VSHAIDCCRQ LS +
Sbjct: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60

Query: 61  TTENRQSECSDQTSSDMGLVLEEFIPIN--GVFDSERPRHLHEAQTENNKMNDSDLNNLN 120
           TTENRQSECS+QTSSDMG VLEEFIPIN  GV D E+         +NNK +DSDLNNLN
Sbjct: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTE-------KNNKNHDSDLNNLN 120

Query: 121 LPPSDWLRSAQLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGM--MS 180
           L PSDWLRSAQLWNQTSDPPPL+QD PENT VVEVNRNGGAF+PFQKEKTG GGG    S
Sbjct: 121 LAPSDWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGAS 180

Query: 181 SSSDPVPAAET------GLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV 240
           SSS P PAAET      G GGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV
Sbjct: 181 SSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHV 240

Query: 241 ATPKQIRELMKVDGLTNGEVKSHLQKYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPAS 300
           ATPKQIRELMKVDGLTN EVKSHLQKYRLHT+ PTPTIHNNE    PQFLVVGGIWVPA+
Sbjct: 241 ATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAA 300

Query: 301 DYA----TTSSREAISAATTNGIYAPMVAAAAPQPLPSTVQKPKPRP---MIPSSS 340
           +YA    TTSS E +SAATTNGIYAP+VAAAAPQPL STVQKPKP+P   +IPSS+
Sbjct: 301 EYAAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSA 349

BLAST of Cp4.1LG13g06810 vs. NCBI nr
Match: gi|645276211|ref|XP_008243179.1| (PREDICTED: probable transcription factor GLK2 [Prunus mume])

HSP 1 Score: 324.7 bits (831), Expect = 2.1e-85
Identity = 191/350 (54.57%), Postives = 236/350 (67.43%), Query Frame = 1

Query: 11  ASTMGFTLSDFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE--NRQSE 70
           AS +GF   D+   LE+ER K+ +FQRELPLCL++V+ AI+ C+Q LS++TT+  + QSE
Sbjct: 9   ASRLGFR--DYVKALEEERHKIQVFQRELPLCLELVTQAIERCKQQLSDTTTDYMHGQSE 68

Query: 71  CSDQTSSDMGLVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSA 130
           CS+QTSS+ GLV EEFIP+     S+        +++  K+ND D  N +   SDWLRSA
Sbjct: 69  CSEQTSSE-GLVFEEFIPLKRTSSSDSDDD-EVQESQEPKINDKDKTNGDKKKSDWLRSA 128

Query: 131 QLWNQTSDPPPLSQDTPENTTVVEVNRNGGAFQPFQKEKT-GSGGGMMSSSSDPVPAAET 190
           QLWN T DPP L ++ P   +V+EV RNGGAFQPFQ+EK+ G     ++      PA  +
Sbjct: 129 QLWNTTPDPP-LKEELPRKASVMEVKRNGGAFQPFQREKSVGKTNRPVAKVPASAPATSS 188

Query: 191 -------GLGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELM 250
                  G G S ++EEK+ Q QRKQRR WSPELHRRFLHALQQLGGSH ATPKQIRELM
Sbjct: 189 TTDTVSGGSGESLKKEEKDGQGQRKQRRNWSPELHRRFLHALQQLGGSHAATPKQIRELM 248

Query: 251 KVDGLTNGEVKSHLQKYRLHTKCPTPTIHN----NETVQPPQFLVVGGIWVPASDYAT-- 310
           KVDGLTN EVKSHLQKYRLHT+ PTPT+HN    N   Q PQFLVVGGIWVP  DYA   
Sbjct: 249 KVDGLTNDEVKSHLQKYRLHTRRPTPTMHNNNNSNSNTQAPQFLVVGGIWVPPQDYAAVA 308

Query: 311 --TSSREAISAATTNGIYAPMV---AAAAPQPLPSTVQKPKPRPMIPSSS 340
             T+S EA   A  NGIYAP+    +   P   PS +Q+P+P+  + S S
Sbjct: 309 APTASGEAARVAAANGIYAPVATSPSTVTPVSPPSLMQRPRPKRPVSSHS 353

BLAST of Cp4.1LG13g06810 vs. NCBI nr
Match: gi|359472981|ref|XP_003631224.1| (PREDICTED: probable transcription factor GLK2 [Vitis vinifera])

HSP 1 Score: 323.2 bits (827), Expect = 6.2e-85
Identity = 186/328 (56.71%), Postives = 221/328 (67.38%), Query Frame = 1

Query: 20  DFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE--NRQSECSDQTSSDM 79
           D+ + LE+ER K+ +FQRELPLCL++VS AI+ CRQ +S +T E  + QSECS+QTSSD 
Sbjct: 12  DYIEALEEERRKIQVFQRELPLCLELVSQAIESCRQQMSGTTQEYFHGQSECSEQTSSD- 71

Query: 80  GLVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQTSDP 139
           G VLEEFIPI    D E  +  H+     +K ND          SDWLRS QLWNQT DP
Sbjct: 72  GPVLEEFIPIKKTSDDEDEQQSHQPNDNKDKNNDKSGKK-----SDWLRSVQLWNQTPDP 131

Query: 140 PPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSS--RR 199
           P + +DTP+    +EV +NGGAF PF+++K        + S+     AET  G SS  R+
Sbjct: 132 P-VKEDTPKKIPSMEVKKNGGAFHPFKRDKAVGTNPTSAPSAATSSTAETATGCSSGSRK 191

Query: 200 EEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQ 259
           EEKE Q+QRK RRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTN EVKSHLQ
Sbjct: 192 EEKEGQSQRKARRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQ 251

Query: 260 KYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPASDY----ATTSSREAISAATTNGIYA 319
           KYRLHT+ P P I +N   Q PQF+VVGGIWVP  +Y    ATTSS EA    T NGIYA
Sbjct: 252 KYRLHTRRPNPAIQHNGNPQAPQFVVVGGIWVPPPEYTAVAATTSSGEATGVTTANGIYA 311

Query: 320 PMVAAAAPQPLPSTVQKPKPRPMIPSSS 340
           P+ +     P  ST    + +PM P  S
Sbjct: 312 PVASVPPSHPQGST---QRQQPMKPKKS 329

BLAST of Cp4.1LG13g06810 vs. NCBI nr
Match: gi|297737857|emb|CBI27058.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 323.2 bits (827), Expect = 6.2e-85
Identity = 186/328 (56.71%), Postives = 221/328 (67.38%), Query Frame = 1

Query: 20  DFADTLEQERSKVLMFQRELPLCLQIVSHAIDCCRQHLSESTTE--NRQSECSDQTSSDM 79
           D+ + LE+ER K+ +FQRELPLCL++VS AI+ CRQ +S +T E  + QSECS+QTSSD 
Sbjct: 6   DYIEALEEERRKIQVFQRELPLCLELVSQAIESCRQQMSGTTQEYFHGQSECSEQTSSD- 65

Query: 80  GLVLEEFIPINGVFDSERPRHLHEAQTENNKMNDSDLNNLNLPPSDWLRSAQLWNQTSDP 139
           G VLEEFIPI    D E  +  H+     +K ND          SDWLRS QLWNQT DP
Sbjct: 66  GPVLEEFIPIKKTSDDEDEQQSHQPNDNKDKNNDKSGKK-----SDWLRSVQLWNQTPDP 125

Query: 140 PPLSQDTPENTTVVEVNRNGGAFQPFQKEKTGSGGGMMSSSSDPVPAAETGLGGSS--RR 199
           P + +DTP+    +EV +NGGAF PF+++K        + S+     AET  G SS  R+
Sbjct: 126 P-VKEDTPKKIPSMEVKKNGGAFHPFKRDKAVGTNPTSAPSAATSSTAETATGCSSGSRK 185

Query: 200 EEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNGEVKSHLQ 259
           EEKE Q+QRK RRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTN EVKSHLQ
Sbjct: 186 EEKEGQSQRKARRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQ 245

Query: 260 KYRLHTKCPTPTIHNNETVQPPQFLVVGGIWVPASDY----ATTSSREAISAATTNGIYA 319
           KYRLHT+ P P I +N   Q PQF+VVGGIWVP  +Y    ATTSS EA    T NGIYA
Sbjct: 246 KYRLHTRRPNPAIQHNGNPQAPQFVVVGGIWVPPPEYTAVAATTSSGEATGVTTANGIYA 305

Query: 320 PMVAAAAPQPLPSTVQKPKPRPMIPSSS 340
           P+ +     P  ST    + +PM P  S
Sbjct: 306 PVASVPPSHPQGST---QRQQPMKPKKS 323

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EFM_ARATH5.3e-3637.72Myb family transcription factor EFM OS=Arabidopsis thaliana GN=EFM PE=1 SV=2[more]
PHR1_ORYSI3.1e-1243.69Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE... [more]
PHR1_ORYSJ3.1e-1243.69Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 ... [more]
PHL4_ARATH7.7e-1155.56Myb family transcription factor PHL4 OS=Arabidopsis thaliana GN=PHL4 PE=2 SV=1[more]
PCL1_ARATH2.9e-1034.21Transcription factor LUX OS=Arabidopsis thaliana GN=LUX PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KAT4_CUCSA4.1e-14479.49Uncharacterized protein OS=Cucumis sativus GN=Csa_6G031440 PE=4 SV=1[more]
D7T987_VITVI4.3e-8556.71Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g03110 PE=4 SV=... [more]
M5XS96_PRUPE2.1e-8455.10Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007222mg PE=4 SV=1[more]
B9ST36_RICCO7.6e-8250.68DNA binding protein, putative OS=Ricinus communis GN=RCOM_0353920 PE=4 SV=1[more]
A0A067KAT1_JATCU7.6e-8252.21Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18291 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G25550.11.5e-6850.00 myb-like transcription factor family protein[more]
AT1G68670.17.1e-6345.61 myb-like transcription factor family protein[more]
AT1G13300.13.9e-6145.28 myb-like transcription factor family protein[more]
AT3G25790.14.0e-5843.00 myb-like transcription factor family protein[more]
AT2G03500.13.0e-3737.72 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659089682|ref|XP_008445641.1|2.8e-14677.48PREDICTED: probable transcription factor GLK2 [Cucumis melo][more]
gi|778710070|ref|XP_011656513.1|5.9e-14479.49PREDICTED: probable transcription factor GLK2 [Cucumis sativus][more]
gi|645276211|ref|XP_008243179.1|2.1e-8554.57PREDICTED: probable transcription factor GLK2 [Prunus mume][more]
gi|359472981|ref|XP_003631224.1|6.2e-8556.71PREDICTED: probable transcription factor GLK2 [Vitis vinifera][more]
gi|297737857|emb|CBI27058.3|6.2e-8556.71unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g06810.1Cp4.1LG13g06810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 207..258
score: 2.
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 205..260
score: 1.4
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 202..262
score: 6.1
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 203..262
score: 1.31
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 202..262
score: 15
NoneNo IPR availablePANTHERPTHR31003MYB FAMILY TRANSCRIPTION FACTORcoord: 1..324
score: 1.3E
NoneNo IPR availablePANTHERPTHR31003:SF4GENOMIC DNA, CHROMOSOME 3, TAC CLONE:K13N2-RELATEDcoord: 1..324
score: 1.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG13g06810Cp4.1LG01g17980Cucurbita pepo (Zucchini)cpecpeB209