Cp4.1LG15g01030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g01030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtodermal factor 2, putative
LocationCp4.1LG15 : 758765 .. 762109 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCTTCGATCAAGATATAGATATTCATAAAAATACATGATATTATCGACGTACAGATTTTTGACACTTCTTGATTATTATGTTATAGCCGTTTTTTCATCAATTATTCTGTTCTGTGAGTAGCATATTTGGATTTTGATCTCTGGAAATATTTCCATCTGTGTTGGATTCGATTCGTTAGAGTGGATTAGTCATCAAACTTTGATATAAGTTAGAGCTCAGAAATGAATCTTGAGACGGATGCTGAGTTACCCCAAGCTGAGGAATCAATTCCCGAAGTGACTATGGAACAGAAACCTAATGAAGGACCGTCTGATTCGATGAAAGATGACCTTGAGGTTTCATCTAGTTGCTCTAATGTATGTTTTACTTCTATTATCAATCTGTTATTATTTGATATTATAAACTCATTGCTAGTTGAACTGGAATTGAAATCAAGTTTTTAATTTCAAAAATCACTCTCACATATGTCTTTAATTATTCAAAATCAATTTAGACTAAACATTGGATTGTTTAAATTGTTTTGAATAAGTAAGACATATTTGAGATTGTTTCAAATCACTGTAAAACGTGTTGTTAGTCTTTGTAGAATCATTAACAGAATCGACAATGCAGGATTCGAGCATCGGGGAAATTTTTTACAATGATGAACCTGACTTCCTAGAAATGCAAAGGGTAAATAATATAATCAAAGCGGCTTACAAGGAATTCATGGCAATAGCCATGAGAGCAAACGCTTTGATTTTCCATCCTCCAGCAGTAGGATCAGTAGAGGATTTACAAGAGATGTTCCACACTACACCACCTCGTGGATATACAGTTGAATGTTCCGTCGAATATGACGTTCTTTCCATTACTCCAGAATCGCTGATCACAATAATGATGAATGGGGTGAGATATCCTTAACTTGACATTGATTTTCTATACATTATTCTTAACATTTTGTACCCTTATTCACACTGTTTATAGGTGATATGGAGTTCGATATTTTCCAACATCATTTGTGATGGATCATTTGAAAATGTTCTCACTCCTTTGAAAGAATTCTGGTCAGCAGATTCAAATTCAAATTCTAAGGATGTTGTATTGGTAACTCTTTTTTTTTCTTTTGTTCCTTCTAAACTTTAGGACACAGAAAAAGAAAATATATGAACTTTTATCATATAATAGGTCAATGCCCAGTTAAGACTGCCGGCTGAATTCCTGCCAAGATGGTGCACTCGCTTTTTGAGATTTAAGATGGACGTCGCCGAAGATACATGGGCTATCTGCGATGTGTCAACTGATTATTTCATGGAAGCAACCTCTGATCCCACCGTGGGAACTCAATACAGGCGGAGGCCTTCAGGAGTGGTAATCCGGCAGTTTGGTGTTCTCTCAGAGGTCTGTTTACAGAAAACCATCATCTATCAGTAATTACTTCACAAATTTACAAATCTAAAACAGAATTTTATTTTACAGGTTCTATGGGTTGAAAACGCAGAGGTACAAGAGATTGATATCCCTAACAACATATCCTCAAAAGTTACTTCAAACTTTCATTTAACTGCAAAGCAATGGATCAGTATCTTATCACAAAATTTGAAACGTAGATGGAGCAAGATAACGACTTCAGAGATGTTCGACGACCCCAATGGTATAGTGTATGCTCTTAATTTCTCTTATGAATGTTAGGAATCACGACTCTCCACAAATACTGTCCACTTTGAGCACAAGCTCTCATGACTTTGCTTTTGGGTTTCCAAAAGGCCTCATACCGATGGAGATGTATTTATTACTTATAAACCCATAATCATTCCCTCAATTAATTAGCTACCGTGGGACTCCCTCCCAACAATTCTCCCCTCGAACAAAGTACACTATAGAGCCTCCCTTGAAGCCTATGGAGCCCTCGAACAGCCTCTCCTTAATCGAGGCTTGACTCATTCTTTGGAGTCCTCGAACAAAGGACACCATTTGTTTGACACTTGAGTCACTTTTGACTACACCTTCGAGGCTCACAACTTATTTGTTCAACATTTGAGGATTCTATTGACATGACTAAGTTAAGAGCATGACTCTGATACCATGTTAGGAATCACAACTCTCCACAATGGTATGATATTGTCCACTTTGAGCATAATCTCTCATGGCTTATGGGGTTTCCCAAAATGCCTCATACCAATGGAGATGGAGATGTATTCCTTACTTATAAACCCATGATCATGCCATAAATTAGCCAACGTAGGACTCCGTCCCAATAATCCTCAACCATGAACAGCCTTATAAAGCCTTAAATTCTCAAGCTAATTGGATTTTATGGGATAACAGATGTACCTGATCTGTTGACAATTGGGGAGAACCTGAAGAGATTATATCTAACATCAGTTAATCCCTTTCCACTGGAAAGAAAATGGGATTTATATTGTGATGATAATATAAGGATTTTGAGAGACATGAAAGCACGTAACATTGGCTATCACCATGATTATGTTGCCTCAAGTACCGTATACATTCCAGAAACCCCCATCAGACTCTTAATGTTTCTAGCCACATATAATGCGACATACCAGGTAAACTCCGTCTTAGAAACTAAACTTATAATTTAACCTATATTCTTGCTTGCTTATACTCGCCACATACAGTTGACGAGCATAAAATCACCGGAACAGCTCAAGCTGGTAGCTGCACTTTACACAGAAGATAATAGTTACACCATTGGTTTACGTAGAAAAGTAGAAGAAACAGTAGCTGATGAACTTTTTGCGGTGCTTCTCTTTTACATAAACTTCTTTACCTGATCCTCCTTGCCCAAAACTTATATACTTCAATGATATGATATGCTTGTTTAACGTGATAGGACCATGAATTTTTCTTACAAGAAGCTACAGCGAATGACTATTGCTCCCTCGTCCTATCATCACAATTGTCAGAAGAAGATGTTCATATCTCTCTCATGCCCAAATTTAGTATGAATAGTGTGTTTCTGAGACCTTCTGGCTTTGCCATCATGCCTGCGGAACAGGGAGGTCTACAGTCCAAGGCGTCCCTCGTGACCATATTCATACGTCGGGAGCTTCAATATATCGAAGATGATCATGCAATTGCAGTTATGAGAAGCCACATGAGTGATGTAATTGATCAGATGACTAATATCCAATCTCCAAACGCTACAGCTGAGGACTCAAAACAGAAAACTCCTAAAAAGGTATTCTAAGTTATGAGAAGCACAATTCTTTCCATTTTTTTATCACTATTATTAGCAATATAGATTCTAAAATTCAGCACTGCAAGGTTGAAAGTCACTTTAAGCTTGCAGTGCTGAATTTAAGCTCATGTAATGTGTAAATTTTACCCAACATTTTTA

mRNA sequence

TGCTTCGATCAAGATATAGATATTCATAAAAATACATGATATTATCGACGTACAGATTTTTGACACTTCTTGATTATTATGTTATAGCCGTTTTTTCATCAATTATTCTGTTCTGTGAGTAGCATATTTGGATTTTGATCTCTGGAAATATTTCCATCTGTGTTGGATTCGATTCGTTAGAGTGGATTAGTCATCAAACTTTGATATAAGTTAGAGCTCAGAAATGAATCTTGAGACGGATGCTGAGTTACCCCAAGCTGAGGAATCAATTCCCGAAGTGACTATGGAACAGAAACCTAATGAAGGACCGTCTGATTCGATGAAAGATGACCTTGAGGTTTCATCTAGTTGCTCTAATGATTCGAGCATCGGGGAAATTTTTTACAATGATGAACCTGACTTCCTAGAAATGCAAAGGGTAAATAATATAATCAAAGCGGCTTACAAGGAATTCATGGCAATAGCCATGAGAGCAAACGCTTTGATTTTCCATCCTCCAGCAGTAGGATCAGTAGAGGATTTACAAGAGATGTTCCACACTACACCACCTCGTGGATATACAGTTGAATGTTCCGTCGAATATGACGTTCTTTCCATTACTCCAGAATCGCTGATCACAATAATGATGAATGGGGTGATATGGAGTTCGATATTTTCCAACATCATTTGTGATGGATCATTTGAAAATGTTCTCACTCCTTTGAAAGAATTCTGGTCAGCAGATTCAAATTCAAATTCTAAGGATGTTGTATTGGTCAATGCCCAGTTAAGACTGCCGGCTGAATTCCTGCCAAGATGGTGCACTCGCTTTTTGAGATTTAAGATGGACGTCGCCGAAGATACATGGGCTATCTGCGATGTGTCAACTGATTATTTCATGGAAGCAACCTCTGATCCCACCGTGGGAACTCAATACAGGCGGAGGCCTTCAGGAGTGGTAATCCGGCAGTTTGGTGTTCTCTCAGAGGTTCTATGGGTTGAAAACGCAGAGGTACAAGAGATTGATATCCCTAACAACATATCCTCAAAAGTTACTTCAAACTTTCATTTAACTGCAAAGCAATGGATCAATGTACCTGATCTGTTGACAATTGGGGAGAACCTGAAGAGATTATATCTAACATCAGTTAATCCCTTTCCACTGGAAAGAAAATGGGATTTATATTGTGATGATAATATAAGGATTTTGAGAGACATGAAAGCACGTAACATTGGCTATCACCATGATTATGTTGCCTCAAGTACCGTATACATTCCAGAAACCCCCATCAGACTCTTAATGTTTCTAGCCACATATAATGCGACATACCAGTTGACGAGCATAAAATCACCGGAACAGCTCAAGCTGGTAGCTGCACTTTACACAGAAGATAATAGTTACACCATTGGTTTACGTAGAAAAGTAGAAGAAACAGTAGCTGATGAACTTTTTGCGGACCATGAATTTTTCTTACAAGAAGCTACAGCGAATGACTATTGCTCCCTCGTCCTATCATCACAATTGTCAGAAGAAGATGTTCATATCTCTCTCATGCCCAAATTTAGTATGAATAGTGTGTTTCTGAGACCTTCTGGCTTTGCCATCATGCCTGCGGAACAGGGAGGTCTACAGTCCAAGGCGTCCCTCGTGACCATATTCATACGTCGGGAGCTTCAATATATCGAAGATGATCATGCAATTGCAGTTATGAGAAGCCACATGAGTGATGTAATTGATCAGATGACTAATATCCAATCTCCAAACGCTACAGCTGAGGACTCAAAACAGAAAACTCCTAAAAAGGTATTCTAAGTTATGAGAAGCACAATTCTTTCCATTTTTTTATCACTATTATTAGCAATATAGATTCTAAAATTCAGCACTGCAAGGTTGAAAGTCACTTTAAGCTTGCAGTGCTGAATTTAAGCTCATGTAATGTGTAAATTTTACCCAACATTTTTA

Coding sequence (CDS)

ATGAATCTTGAGACGGATGCTGAGTTACCCCAAGCTGAGGAATCAATTCCCGAAGTGACTATGGAACAGAAACCTAATGAAGGACCGTCTGATTCGATGAAAGATGACCTTGAGGTTTCATCTAGTTGCTCTAATGATTCGAGCATCGGGGAAATTTTTTACAATGATGAACCTGACTTCCTAGAAATGCAAAGGGTAAATAATATAATCAAAGCGGCTTACAAGGAATTCATGGCAATAGCCATGAGAGCAAACGCTTTGATTTTCCATCCTCCAGCAGTAGGATCAGTAGAGGATTTACAAGAGATGTTCCACACTACACCACCTCGTGGATATACAGTTGAATGTTCCGTCGAATATGACGTTCTTTCCATTACTCCAGAATCGCTGATCACAATAATGATGAATGGGGTGATATGGAGTTCGATATTTTCCAACATCATTTGTGATGGATCATTTGAAAATGTTCTCACTCCTTTGAAAGAATTCTGGTCAGCAGATTCAAATTCAAATTCTAAGGATGTTGTATTGGTCAATGCCCAGTTAAGACTGCCGGCTGAATTCCTGCCAAGATGGTGCACTCGCTTTTTGAGATTTAAGATGGACGTCGCCGAAGATACATGGGCTATCTGCGATGTGTCAACTGATTATTTCATGGAAGCAACCTCTGATCCCACCGTGGGAACTCAATACAGGCGGAGGCCTTCAGGAGTGGTAATCCGGCAGTTTGGTGTTCTCTCAGAGGTTCTATGGGTTGAAAACGCAGAGGTACAAGAGATTGATATCCCTAACAACATATCCTCAAAAGTTACTTCAAACTTTCATTTAACTGCAAAGCAATGGATCAATGTACCTGATCTGTTGACAATTGGGGAGAACCTGAAGAGATTATATCTAACATCAGTTAATCCCTTTCCACTGGAAAGAAAATGGGATTTATATTGTGATGATAATATAAGGATTTTGAGAGACATGAAAGCACGTAACATTGGCTATCACCATGATTATGTTGCCTCAAGTACCGTATACATTCCAGAAACCCCCATCAGACTCTTAATGTTTCTAGCCACATATAATGCGACATACCAGTTGACGAGCATAAAATCACCGGAACAGCTCAAGCTGGTAGCTGCACTTTACACAGAAGATAATAGTTACACCATTGGTTTACGTAGAAAAGTAGAAGAAACAGTAGCTGATGAACTTTTTGCGGACCATGAATTTTTCTTACAAGAAGCTACAGCGAATGACTATTGCTCCCTCGTCCTATCATCACAATTGTCAGAAGAAGATGTTCATATCTCTCTCATGCCCAAATTTAGTATGAATAGTGTGTTTCTGAGACCTTCTGGCTTTGCCATCATGCCTGCGGAACAGGGAGGTCTACAGTCCAAGGCGTCCCTCGTGACCATATTCATACGTCGGGAGCTTCAATATATCGAAGATGATCATGCAATTGCAGTTATGAGAAGCCACATGAGTGATGTAATTGATCAGATGACTAATATCCAATCTCCAAACGCTACAGCTGAGGACTCAAAACAGAAAACTCCTAAAAAGGTATTCTAA

Protein sequence

MNLETDAELPQAEESIPEVTMEQKPNEGPSDSMKDDLEVSSSCSNDSSIGEIFYNDEPDFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHTTPPRGYTVECSVEYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGVVIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWINVPDLLTIGENLKRLYLTSVNPFPLERKWDLYCDDNIRILRDMKARNIGYHHDYVASSTVYIPETPIRLLMFLATYNATYQLTSIKSPEQLKLVAALYTEDNSYTIGLRRKVEETVADELFADHEFFLQEATANDYCSLVLSSQLSEEDVHISLMPKFSMNSVFLRPSGFAIMPAEQGGLQSKASLVTIFIRRELQYIEDDHAIAVMRSHMSDVIDQMTNIQSPNATAEDSKQKTPKKVF
BLAST of Cp4.1LG15g01030 vs. Swiss-Prot
Match: ANL2_ARATH (Homeobox-leucine zipper protein ANTHOCYANINLESS 2 OS=Arabidopsis thaliana GN=ANL2 PE=2 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 4.8e-11
Identity = 89/405 (21.98%), Postives = 162/405 (40.00%), Query Frame = 1

Query: 98  EDLQEMFHTTPPRGYTVECSVEYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVL 157
           ++    F +T P G   E S    ++ I   +L+  +M+   W+ +F   +   +  +V+
Sbjct: 358 DEYMRTFSSTKPTGLATEASRTSGMVIINSLALVETLMDSNRWTEMFPCNVARATTTDVI 417

Query: 158 TPLKEFWSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDY 217
           +         + + +  + L+NA+L++ +  +P     FLRF    AE  WA+ DVS D 
Sbjct: 418 S------GGMAGTINGALQLMNAELQVLSPLVPVRNVNFLRFCKQHAEGVWAVVDVSIDP 477

Query: 218 FMEATSDPTVGTQYRRRPSGVVIRQF-GVLSEVLWVENAEVQEIDIPNNISSKVTSNFHL 277
             E +    V    RR PSG V++      S+V WVE+AE  E  I       + S    
Sbjct: 478 VRENSGGAPV---IRRLPSGCVVQDVSNGYSKVTWVEHAEYDENQIHQLYRPLLRSGLGF 537

Query: 278 TAKQWINVPD------LLTIGENLKRLYLTSVNP----------------------FPLE 337
            +++W+           + I  ++     TS+ P                       P  
Sbjct: 538 GSQRWLATLQRQCECLAILISSSVTSHDNTSITPGGRKSMLKLAQRMTFNFCSGISAPSV 597

Query: 338 RKWDLY----CDDNIRILRDMKARNIGYHHDYV--ASSTVYIPETPIRLLMFLATYNATY 397
             W        D ++R++      + G     V  A+++V++P  P RL  FL       
Sbjct: 598 HNWSKLTVGNVDPDVRVMTRKSVDDPGEPPGIVLSAATSVWLPAAPQRLYDFLRNERMRC 657

Query: 398 QLTSIKSPEQLKLVAALYTEDNSYTIGLRRKVEETVADELFADHEFFLQEATANDYCSLV 457
           +   + +   ++ +A + T+     + L R    + A          LQE   +   +LV
Sbjct: 658 EWDILSNGGPMQEMAHI-TKGQDQGVSLLR----SNAMNANQSSMLILQETCIDASGALV 717

Query: 458 LSSQLSEEDVHISLMPKFSMNSVFLRPSGFAIMPAEQGGLQSKAS 468
           + + +    +H+ +M     + V L PSGFA++P   GG+    S
Sbjct: 718 VYAPVDIPAMHV-VMNGGDSSYVALLPSGFAVLP--DGGIDGGGS 745

BLAST of Cp4.1LG15g01030 vs. Swiss-Prot
Match: ROC5_ORYSJ (Homeobox-leucine zipper protein ROC5 OS=Oryza sativa subsp. japonica GN=ROC5 PE=2 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 1.5e-09
Identity = 51/180 (28.33%), Postives = 85/180 (47.22%), Query Frame = 1

Query: 109 PRGYTVECSVEYDVLSITPE-SLITIMMNGVIWSSIFSNIICDGSFENVLTPLKEFWSAD 168
           P GY  E S E  ++ I    +L+  +M+   WS +FS +I         T L+E  +  
Sbjct: 364 PAGYVSEASRESGLVIIDNSLALVETLMDERRWSDMFSCMIAKA------TVLEEVSTGI 423

Query: 169 SNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTV 228
           + S +  ++L+ A+L++ +  +P     FLRF   +AE  WA+ DVS D  +   +  T 
Sbjct: 424 AGSRNGALLLMKAELQVLSPLVPIREVTFLRFCKQLAEGAWAVVDVSIDGLVRDHNSGTA 483

Query: 229 GT----QYRRRPSGVVIRQF-GVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWI 283
            T    + RR PSG V++       +V WVE+ E  E  +       + S     A++W+
Sbjct: 484 PTGGNVKCRRVPSGCVMQDTPNGYCKVTWVEHTEYDEASVHQLYRPLLRSGLAFGARRWL 537

BLAST of Cp4.1LG15g01030 vs. Swiss-Prot
Match: ROC4_ORYSJ (Homeobox-leucine zipper protein ROC4 OS=Oryza sativa subsp. japonica GN=ROC4 PE=2 SV=2)

HSP 1 Score: 64.7 bits (156), Expect = 3.4e-09
Identity = 70/265 (26.42%), Postives = 115/265 (43.40%), Query Frame = 1

Query: 33  MKDDLEVSSSCSNDSSIGEIFYNDEPDFLE--MQRVNNIIKAAYKEFMAIAMRANALIFH 92
           MK + E S+    D S+          FLE  M  ++ ++K A  +         A +  
Sbjct: 296 MKSEAEPSAMAGIDKSL----------FLELAMSAMDELVKMA--QMGDPLWIPGASVPS 355

Query: 93  PPAVGSVEDLQEMFHTTPP------RGYTVECSVEYDVLSITP-ESLITIMMNGVIWSSI 152
            PA  S+ + +E  +T PP       GY  E S E  ++ I    +L+  +M+   WS +
Sbjct: 356 SPAKESL-NFEEYLNTFPPCIGVKPEGYVSEASRESGIVIIDDGAALVETLMDERRWSDM 415

Query: 153 FSNIICDGSF-ENVLTPLKEFWSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMD 212
           FS +I   S  E + T +    +      S +  ++ A+L++ +  +P    +FLRF   
Sbjct: 416 FSCMIAKASTTEEISTGVAGSRNGALLLVSDEHSVMQAELQVLSPLVPIREVKFLRFSKQ 475

Query: 213 VAEDTWAICDVSTDYFME----ATSDPTVGTQYRRRPSGVVIRQF-GVLSEVLWVENAEV 272
           +A+  WA+ DVS D  M      ++  T     RR PSG V++       +V WVE+ E 
Sbjct: 476 LADGVWAVVDVSADELMRDQGITSASSTANMNCRRLPSGCVLQDTPNGFVKVTWVEHTEY 535

Query: 273 QEIDIPNNISSKVTSNFHLTAKQWI 283
            E  +       + S   L A +WI
Sbjct: 536 DEASVHPLYRPLLRSGLALGAGRWI 547

BLAST of Cp4.1LG15g01030 vs. TrEMBL
Match: A0A0A0L8M1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G609280 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 4.5e-141
Identity = 287/548 (52.37%), Postives = 367/548 (66.97%), Query Frame = 1

Query: 1   MNLETDAELPQAEESIPEVTMEQKPNEGPSDSMKDDLEVSSSCSND--SSIGEIFYNDEP 60
           M L T  +LP+ EE  PE ++EQ        + ++D  V+ +  N   S +G+IFYNDE 
Sbjct: 21  MLLATKTKLPEIEELRPEPSIEQTFGHEEVSNTENDAPVAVTSDNSCHSCLGDIFYNDEL 80

Query: 61  DFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHTTPPRGYTVECSV 120
           DFLE+Q VNNI+KAAYKEFMAIA+ A A I  PPAVG++ED QEMF+T PP GYTVE SV
Sbjct: 81  DFLEVQWVNNIMKAAYKEFMAIAIAAKAWISDPPAVGAIEDFQEMFNTPPPHGYTVERSV 140

Query: 121 EYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVVLV 180
           E  +LSI+ +SL++IMM+G  W+S+FS+IIC  S E V  PLK+F    +     + VL+
Sbjct: 141 ETAILSISSQSLMSIMMDGAQWASMFSSIICSASDEVVFYPLKKFLL--TGPCGWEFVLM 200

Query: 181 NAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGV 240
           NA+ RLPA FLPRW TRF+RFK  +  +T+AI DVSTDYF   T+DPT    Y+RRPSGV
Sbjct: 201 NAEFRLPAGFLPRWNTRFMRFKKLIVGETYAIFDVSTDYFENMTADPTQKVVYKRRPSGV 260

Query: 241 VIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWI---------------- 300
           +IR  G LSEV+W+ENAEVQ+IDIPN++ S  T NFHLTA+QWI                
Sbjct: 261 IIRPCGFLSEVIWIENAEVQKIDIPNHLHSTFTPNFHLTARQWISMISQNLKRRNGEIVT 320

Query: 301 ---------NVPDLLTIGENLKRLYLTSVNPFPLERKWDLYCDDNIRILRDMKARNIGYH 360
                    +VPDLLT+G NL++ +L +VNPFP ERKWDL+ DD IRILRD+KA  IG  
Sbjct: 321 EEMFAVRRMDVPDLLTMGNNLRKYFLQAVNPFPTERKWDLFSDDKIRILRDIKASYIGRR 380

Query: 361 HDYVASSTVYIPETPIRLLMFLATYNATYQLTSIKSPEQLKL-VAALYTEDNSYTIGLRR 420
            D++A  TV + ETP  LL +L T N   Q TS KS  QL + VA L T+++S T+    
Sbjct: 381 DDFIAIRTVCLAETPSTLLTYLDTNNYILQ-TSKKSQAQLSMTVALLATDESSCTV---L 440

Query: 421 KVEETVADELFADHEFFLQEATANDYCSLVLSSQLSEEDVHISLMPKFSMNSVFLRPSGF 480
            V++   DE   D+ FFLQE+T N+YCS +LSSQ+++ DVH+SL+P F  N +FLRPSGF
Sbjct: 441 SVKKETGDEDTKDNYFFLQESTENEYCSFILSSQMTKADVHVSLLPMFCRNCLFLRPSGF 500

Query: 481 AIMPAEQGGLQSKASLVTIFIRRELQYIEDDHAIAVMRSHMSDVIDQMTNIQSPNATAED 521
           AIMPAE GGLQSKAS VTI+IRREL+ +E    I  M   M  VIDQ++NIQ P      
Sbjct: 501 AIMPAEPGGLQSKASFVTIYIRRELKNMEVHQVIEAMSCDMDAVIDQISNIQFPTTINGK 560

BLAST of Cp4.1LG15g01030 vs. TrEMBL
Match: A0A061DNQ3_THECC (Protodermal factor 2, putative OS=Theobroma cacao GN=TCM_003923 PE=4 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 7.7e-32
Identity = 133/480 (27.71%), Postives = 220/480 (45.83%), Query Frame = 1

Query: 59  DFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHTTPPRGYTVECSV 118
           ++ ++Q  NN  KAAY+E M +A  A   +   P +GS++D+   F T    G  +E SV
Sbjct: 58  EYQKLQCANNYFKAAYQEVMRMAEEAGTWVRMKPGLGSLDDIVNEFQTPALPGKKLESSV 117

Query: 119 EYDVL-SITPESLITIMMN-GVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVV 178
              V+ ++    ++T+MMN    WS     I+  G        L+   + D+    K VV
Sbjct: 118 ATAVIPNVRASEMVTMMMNVNKKWSKSLFPIVNYGEEYTPRRILQHLRNTDTAI--KGVV 177

Query: 179 LVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPS 238
            V A+L+LP   +P     F R+  ++ +  + + D+S+ Y      D +     R+RPS
Sbjct: 178 QVYAELQLPTTSVPTRYFDFFRYVKEIMKSIYIVVDISSHYL----GDGSANCNSRKRPS 237

Query: 239 GVVIRQFGVLS-EVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWINV----------- 298
           GV+IR+ G L  E++ VEN EV E    N  SSK +SNF      WI+            
Sbjct: 238 GVIIRERGPLDCEIICVENVEVDE-PRENMYSSKTSSNFAFCVNHWISTLLWKLRRDRST 297

Query: 299 -----PDLLTIGENLKRLYLTSVNPFPLERKWDLYCDDNIRILRDMK-------ARNIGY 358
                 DL     +       S+  F +E   +   +D + +L + +        + +  
Sbjct: 298 FIDVKIDLHHSAGDYLLALTRSMKHFFMECFSEHPNEDLLSVLTNAEDPIRLLHNKTLEE 357

Query: 359 HHDYVASSTVYIPETPIRLLMFLATYNATYQLTSIK---SPEQLKLVAALYTEDNSYTIG 418
              YV  ++ +I   P+ +  FL   +   Q  S     + E+ + +    T+D S TI 
Sbjct: 358 FIGYVGLNSFHIQAKPLSVFQFLMKKDLQLQFRSTSNSDTEEEPEELFKFITDDKSNTIS 417

Query: 419 L-RRKVEETVADELFADHEFFLQEATANDYCSLVLSSQLSEEDVHISLMPKFSMNSVFLR 478
           L R+KVEE        +  + LQEAT ++YCS +LS  ++E+ V+ +++    + S++ +
Sbjct: 418 LHRKKVEE--------EARYCLQEATRDEYCSFILSKLINEDHVNFNIVS--GVQSIYQK 477

Query: 479 ---------PSGFAIMPAEQGGLQSKASLVTIFIRRELQYIEDDHAIAVMR-SHMSDVID 499
                     SGFAIMP   GGLQ   SLVT  ++      E    +  +R   +SD+I+
Sbjct: 478 GDDRVLDTVTSGFAIMPDGPGGLQCDGSLVTFLVQLHYDRTEGPVTLDTVREDFLSDLIE 520

BLAST of Cp4.1LG15g01030 vs. TrEMBL
Match: A0A0D2QV56_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G238000 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 2.4e-17
Identity = 95/312 (30.45%), Postives = 143/312 (45.83%), Query Frame = 1

Query: 180 AQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGVV 239
           A+L+LP   +P     FLR+  ++ +  + I DVS+ Y     SDP       RRPSGV+
Sbjct: 168 AELQLPTTLVPTRYFEFLRYGKEIMDGIYIIVDVSSRY-----SDPFAKRNSERRPSGVI 227

Query: 240 IRQFGVLS-EVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWINVPDLLTIGENLKRLY 299
           IR+ G    E++W+EN EV E    N  S+ + SN    A +W+      T+  NLKR  
Sbjct: 228 IREHGPEDCEIIWIENVEVDETR-ENLYSTIIGSNLAYGAHRWVT-----TLLWNLKR-- 287

Query: 300 LTSVNPFPLERKWDLYCDDNIRILRDMKARN------IGYHHDYVASSTVYIPETPIRLL 359
               + F  + K D++      +L   +A        +  H D  A + +   E PIR+L
Sbjct: 288 --DKSSFS-DLKIDVHPGAGSFLLALTQAMKRFFMECVSQHPDEAALTVITSGEDPIRIL 347

Query: 360 --MFLATYNATYQLTSIK-SPEQLKLVAALYTEDNSYTIGLRRKVEETVADELFADHEFF 419
               L  Y +   + S +   + L +   L  +D         ++E  V         + 
Sbjct: 348 HNKKLTEYISFVGVNSFRVQAKPLSVFQFLMKKD--------LQLEGIV---------YG 407

Query: 420 LQEATANDYCSLVLSSQLSEEDV--HI-----SLMPKFSMNSVFLRPSGFAIMPAEQGGL 475
           LQEA+ ++YCS +LS  L+E+ V  HI     S+          + PSGFAIMP   GGL
Sbjct: 408 LQEASMDEYCSFILSKTLTEDTVNAHIVCGNKSIYEDSKSRMANITPSGFAIMPDGPGGL 446

BLAST of Cp4.1LG15g01030 vs. TrEMBL
Match: A0A061DQF7_THECC (Protodermal factor 2, putative OS=Theobroma cacao GN=TCM_003924 PE=4 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 9.4e-14
Identity = 81/271 (29.89%), Postives = 124/271 (45.76%), Query Frame = 1

Query: 22  EQKPNEGPSDSMKD-----DLEVSSSCSNDSSIGEIFYNDEPDFLEMQRVNNIIKAAYKE 81
           +++ +E  SD  +D     D  ++ + S D S   I Y +    L  +   N+ KAAY E
Sbjct: 78  QERTSEVFSDQHRDVEAYSDRSITDAVSLDKSEDSIEYLE----LSAEIAKNVCKAAYHE 137

Query: 82  FMAIAMRANALIFHPPAVGSV---EDLQEMFHTTPPRGYTVECSVEYDVLSITPESLITI 141
           FM IAMR  + +   PA G +    D+ E F T  P G   E S    ++ I    + ++
Sbjct: 138 FMRIAMRERSWLPGNPAEGFLPGDTDISE-FRTRTPAGRKTEVSTASAIIPIPASEMASM 197

Query: 142 MMNGVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVVLVNAQLRLPAEFLPRWC 201
           MM+   W+++F  I+              +W A       D   V  ++R     + R  
Sbjct: 198 MMDVNQWANLFVQIV--------------YWGA-----GYDCTHVLQRMREDDNSIQR-- 257

Query: 202 TRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGVVIRQFGVLS-EVLWV 261
                  +++   ++AI DVS  Y      DPT  T   R+PSGVVIR+    S E++WV
Sbjct: 258 ------LVEITPSSYAIIDVSIYYI-----DPTSRTDSLRKPSGVVIREHDQDSCEIVWV 310

Query: 262 ENAEVQEIDIPNNISSKVTSNFHLTAKQWIN 284
           EN EV E+   N  SS + SN    A++WI+
Sbjct: 318 ENVEVDEVS-ENIYSSVINSNLAFCAQRWIS 310

BLAST of Cp4.1LG15g01030 vs. TrEMBL
Match: A0A161ZVC4_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_021815 PE=4 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 2.1e-13
Identity = 50/187 (26.74%), Postives = 100/187 (53.48%), Query Frame = 1

Query: 100 LQEMFHTTPPRGYTVECSVEYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVLTP 159
           +Q +F++ P      E +     + +  + L+ +M +  +W+S   +I+      +   P
Sbjct: 61  IQHVFNSLPSTETETENTSIARTVPVAADVLLRMMSDAELWASSLVHIV---HLLSSSVP 120

Query: 160 LKEFWSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFM 219
              +    S+   K+++++NA  +LP+  + R    F RF  +V E+TW + D+STDYF+
Sbjct: 121 PNLYRQMKSDRRIKEILMINADFQLPSPLVQRQNFHFYRFIKEVDENTWIMFDISTDYFL 180

Query: 220 EATSDPTVGTQYRRRPSGVVIRQFGVLSE---VLWVENAEVQEIDIPNNISSKVTSNFHL 279
           + +SD +VG +  RR SGV++++    S+   V WVEN +    ++ +N S+ + S    
Sbjct: 181 D-SSDGSVGQKVWRRRSGVIVQKSNDSSKDCRVHWVENVQSPGTNLQDNHSAVINSKGWF 240

Query: 280 TAKQWIN 284
           +A +W++
Sbjct: 241 SANRWLS 243

BLAST of Cp4.1LG15g01030 vs. TAIR10
Match: AT4G00730.1 (AT4G00730.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 70.9 bits (172), Expect = 2.7e-12
Identity = 89/405 (21.98%), Postives = 162/405 (40.00%), Query Frame = 1

Query: 98  EDLQEMFHTTPPRGYTVECSVEYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVL 157
           ++    F +T P G   E S    ++ I   +L+  +M+   W+ +F   +   +  +V+
Sbjct: 358 DEYMRTFSSTKPTGLATEASRTSGMVIINSLALVETLMDSNRWTEMFPCNVARATTTDVI 417

Query: 158 TPLKEFWSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDY 217
           +         + + +  + L+NA+L++ +  +P     FLRF    AE  WA+ DVS D 
Sbjct: 418 S------GGMAGTINGALQLMNAELQVLSPLVPVRNVNFLRFCKQHAEGVWAVVDVSIDP 477

Query: 218 FMEATSDPTVGTQYRRRPSGVVIRQF-GVLSEVLWVENAEVQEIDIPNNISSKVTSNFHL 277
             E +    V    RR PSG V++      S+V WVE+AE  E  I       + S    
Sbjct: 478 VRENSGGAPV---IRRLPSGCVVQDVSNGYSKVTWVEHAEYDENQIHQLYRPLLRSGLGF 537

Query: 278 TAKQWINVPD------LLTIGENLKRLYLTSVNP----------------------FPLE 337
            +++W+           + I  ++     TS+ P                       P  
Sbjct: 538 GSQRWLATLQRQCECLAILISSSVTSHDNTSITPGGRKSMLKLAQRMTFNFCSGISAPSV 597

Query: 338 RKWDLY----CDDNIRILRDMKARNIGYHHDYV--ASSTVYIPETPIRLLMFLATYNATY 397
             W        D ++R++      + G     V  A+++V++P  P RL  FL       
Sbjct: 598 HNWSKLTVGNVDPDVRVMTRKSVDDPGEPPGIVLSAATSVWLPAAPQRLYDFLRNERMRC 657

Query: 398 QLTSIKSPEQLKLVAALYTEDNSYTIGLRRKVEETVADELFADHEFFLQEATANDYCSLV 457
           +   + +   ++ +A + T+     + L R    + A          LQE   +   +LV
Sbjct: 658 EWDILSNGGPMQEMAHI-TKGQDQGVSLLR----SNAMNANQSSMLILQETCIDASGALV 717

Query: 458 LSSQLSEEDVHISLMPKFSMNSVFLRPSGFAIMPAEQGGLQSKAS 468
           + + +    +H+ +M     + V L PSGFA++P   GG+    S
Sbjct: 718 VYAPVDIPAMHV-VMNGGDSSYVALLPSGFAVLP--DGGIDGGGS 745

BLAST of Cp4.1LG15g01030 vs. NCBI nr
Match: gi|659093670|ref|XP_008447653.1| (PREDICTED: homeobox-leucine zipper protein ROC7-like [Cucumis melo])

HSP 1 Score: 532.7 bits (1371), Expect = 7.1e-148
Identity = 294/534 (55.06%), Postives = 368/534 (68.91%), Query Frame = 1

Query: 1   MNLETDAELPQAEESIPEVTMEQKPNEGPSDSMKDD--LEVSSSCSNDSSIGEIFYNDEP 60
           M L T+ +LP+ E  +P+ ++EQ  +     + ++D  +EVS   SN S + +IFYNDE 
Sbjct: 17  MLLATETKLPEIEGLVPKPSVEQINDHEEVSNPENDAPIEVSYDNSNHSCLVDIFYNDEL 76

Query: 61  DFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHTTPPRGYTVECSV 120
           DFLE+QR+NNIIKAAYKEFMAIA+ A A I  PPAVG+VE+ QEMFHT PP GYTVE SV
Sbjct: 77  DFLEVQRINNIIKAAYKEFMAIAIAAKAWILDPPAVGAVEEFQEMFHTPPPHGYTVERSV 136

Query: 121 EYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVVLV 180
           E  +LSI   SL++IMM+G  W+S+FS+IIC  S E VL PLK+FW   +     D VL+
Sbjct: 137 ETAILSIPSRSLMSIMMDGGKWASMFSSIICSESDEIVLVPLKKFWM--TGPCDWDFVLM 196

Query: 181 NAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGV 240
           NAQ RLPAEFLPRW TRFLRFK  +  DT+AI DVSTDYF   T+DPT    Y+RRPSG+
Sbjct: 197 NAQFRLPAEFLPRWNTRFLRFKKLIVGDTYAIFDVSTDYFENMTADPTQKIVYKRRPSGL 256

Query: 241 VIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWI---------------- 300
           ++R  G LSEV+W+ENAEV++IDIPN++ S  T NFHLTAKQWI                
Sbjct: 257 IVRPIGFLSEVIWIENAEVKKIDIPNHMHSTFTPNFHLTAKQWISMISQNLKRINGRIVT 316

Query: 301 --------NVPDLLTIGENLKRLYLTSVNPFPLERKWDLYCDDNIRILRDMKARNIGYHH 360
                   +VPDLLTIG NL++ +L +VNPFP ER WDL+ DD IRI RD+KA  IGYH 
Sbjct: 317 PDMFDESMDVPDLLTIGNNLRKYFLQTVNPFPTERTWDLFSDDKIRISRDIKASYIGYHD 376

Query: 361 DYVASSTVYIPETPIRLLMFLATYNATYQLTSIKSPEQLKLVAALYTEDNSYTIGLRRKV 420
           D +A  TV I ETP  LL +L   N  +Q TS  S  QL++  AL   D S    L  K 
Sbjct: 377 DCIAIRTVCIAETPTTLLTYLDVNNHIFQ-TSKNSQAQLEMAVALLATDESSCTILSMKK 436

Query: 421 EETVADELFADHEFFLQEATANDYCSLVLSSQLSEEDVHISLMPKFSMNSVFLRPSGFAI 480
           E  + DE   D++FFLQE+T N+YCS +LSSQ+SE DVHISL+P+F  NS+FLRPSGFAI
Sbjct: 437 E--IGDEDSKDNKFFLQESTENEYCSFILSSQMSEADVHISLLPRFCRNSLFLRPSGFAI 496

Query: 481 MPAEQGGLQSKASLVTIFIRRELQYIEDDHAIAVMRSHMSDVIDQMTNIQSPNA 509
           MPA  GGLQSKAS VTI+IRREL+ ++ +  I  M  H++ VID+++NIQ P A
Sbjct: 497 MPAGPGGLQSKASFVTIYIRRELKNMKVEQVIEAMSCHVNAVIDRISNIQFPCA 545

BLAST of Cp4.1LG15g01030 vs. NCBI nr
Match: gi|700203167|gb|KGN58300.1| (hypothetical protein Csa_3G609280 [Cucumis sativus])

HSP 1 Score: 509.6 bits (1311), Expect = 6.4e-141
Identity = 287/548 (52.37%), Postives = 367/548 (66.97%), Query Frame = 1

Query: 1   MNLETDAELPQAEESIPEVTMEQKPNEGPSDSMKDDLEVSSSCSND--SSIGEIFYNDEP 60
           M L T  +LP+ EE  PE ++EQ        + ++D  V+ +  N   S +G+IFYNDE 
Sbjct: 21  MLLATKTKLPEIEELRPEPSIEQTFGHEEVSNTENDAPVAVTSDNSCHSCLGDIFYNDEL 80

Query: 61  DFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHTTPPRGYTVECSV 120
           DFLE+Q VNNI+KAAYKEFMAIA+ A A I  PPAVG++ED QEMF+T PP GYTVE SV
Sbjct: 81  DFLEVQWVNNIMKAAYKEFMAIAIAAKAWISDPPAVGAIEDFQEMFNTPPPHGYTVERSV 140

Query: 121 EYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVVLV 180
           E  +LSI+ +SL++IMM+G  W+S+FS+IIC  S E V  PLK+F    +     + VL+
Sbjct: 141 ETAILSISSQSLMSIMMDGAQWASMFSSIICSASDEVVFYPLKKFLL--TGPCGWEFVLM 200

Query: 181 NAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGV 240
           NA+ RLPA FLPRW TRF+RFK  +  +T+AI DVSTDYF   T+DPT    Y+RRPSGV
Sbjct: 201 NAEFRLPAGFLPRWNTRFMRFKKLIVGETYAIFDVSTDYFENMTADPTQKVVYKRRPSGV 260

Query: 241 VIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWI---------------- 300
           +IR  G LSEV+W+ENAEVQ+IDIPN++ S  T NFHLTA+QWI                
Sbjct: 261 IIRPCGFLSEVIWIENAEVQKIDIPNHLHSTFTPNFHLTARQWISMISQNLKRRNGEIVT 320

Query: 301 ---------NVPDLLTIGENLKRLYLTSVNPFPLERKWDLYCDDNIRILRDMKARNIGYH 360
                    +VPDLLT+G NL++ +L +VNPFP ERKWDL+ DD IRILRD+KA  IG  
Sbjct: 321 EEMFAVRRMDVPDLLTMGNNLRKYFLQAVNPFPTERKWDLFSDDKIRILRDIKASYIGRR 380

Query: 361 HDYVASSTVYIPETPIRLLMFLATYNATYQLTSIKSPEQLKL-VAALYTEDNSYTIGLRR 420
            D++A  TV + ETP  LL +L T N   Q TS KS  QL + VA L T+++S T+    
Sbjct: 381 DDFIAIRTVCLAETPSTLLTYLDTNNYILQ-TSKKSQAQLSMTVALLATDESSCTV---L 440

Query: 421 KVEETVADELFADHEFFLQEATANDYCSLVLSSQLSEEDVHISLMPKFSMNSVFLRPSGF 480
            V++   DE   D+ FFLQE+T N+YCS +LSSQ+++ DVH+SL+P F  N +FLRPSGF
Sbjct: 441 SVKKETGDEDTKDNYFFLQESTENEYCSFILSSQMTKADVHVSLLPMFCRNCLFLRPSGF 500

Query: 481 AIMPAEQGGLQSKASLVTIFIRRELQYIEDDHAIAVMRSHMSDVIDQMTNIQSPNATAED 521
           AIMPAE GGLQSKAS VTI+IRREL+ +E    I  M   M  VIDQ++NIQ P      
Sbjct: 501 AIMPAEPGGLQSKASFVTIYIRRELKNMEVHQVIEAMSCDMDAVIDQISNIQFPTTINGK 560

BLAST of Cp4.1LG15g01030 vs. NCBI nr
Match: gi|778681940|ref|XP_011651613.1| (PREDICTED: homeobox-leucine zipper protein ROC7-like [Cucumis sativus])

HSP 1 Score: 303.1 bits (775), Expect = 9.2e-79
Identity = 165/298 (55.37%), Postives = 212/298 (71.14%), Query Frame = 1

Query: 1   MNLETDAELPQAEESIPEVTMEQKPNEGPSDSMKDDLEVSSSCSND--SSIGEIFYNDEP 60
           M L T  +LP+ EE  PE ++EQ        + ++D  V+ +  N   S +G+IFYNDE 
Sbjct: 17  MLLATKTKLPEIEELRPEPSIEQTFGHEEVSNTENDAPVAVTSDNSCHSCLGDIFYNDEL 76

Query: 61  DFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHTTPPRGYTVECSV 120
           DFLE+Q VNNI+KAAYKEFMAIA+ A A I  PPAVG++ED QEMF+T PP GYTVE SV
Sbjct: 77  DFLEVQWVNNIMKAAYKEFMAIAIAAKAWISDPPAVGAIEDFQEMFNTPPPHGYTVERSV 136

Query: 121 EYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFENVLTPLKEFWSADSNSNSKDVVLV 180
           E  +LSI+ +SL++IMM+G  W+S+FS+IIC  S E V  PLK+F    +     + VL+
Sbjct: 137 ETAILSISSQSLMSIMMDGAQWASMFSSIICSASDEVVFYPLKKFLL--TGPCGWEFVLM 196

Query: 181 NAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATSDPTVGTQYRRRPSGV 240
           NA+ RLPA FLPRW TRF+RFK  +  +T+AI DVSTDYF   T+DPT    Y+RRPSGV
Sbjct: 197 NAEFRLPAGFLPRWNTRFMRFKKLIVGETYAIFDVSTDYFENMTADPTQKVVYKRRPSGV 256

Query: 241 VIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWINVPDLLTIGENLKR 297
           +IR  G LSEV+W+ENAEVQ+IDIPN++ S  T NFHLTA+QWI++     I +NLKR
Sbjct: 257 IIRPCGFLSEVIWIENAEVQKIDIPNHLHSTFTPNFHLTARQWISM-----ISQNLKR 307

BLAST of Cp4.1LG15g01030 vs. NCBI nr
Match: gi|764527256|ref|XP_011458040.1| (PREDICTED: homeobox-leucine zipper protein MERISTEM L1-like isoform X2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 247.3 bits (630), Expect = 6.0e-62
Identity = 161/496 (32.46%), Postives = 253/496 (51.01%), Query Frame = 1

Query: 47  SSIGEIFYNDEPDFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHT 106
           SS   +     P FLEMQ   NI  AAYKE M +A++AN  +   P +GS+E L  MF  
Sbjct: 13  SSSNGVVKEGVPKFLEMQECYNICMAAYKEIMTLALKANTWVVDIPVLGSIEKLHHMFRE 72

Query: 107 TPPRGYTVECSVEYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFE---NVLTPLKEF 166
            P  G  +E S+E  ++ +  ++++T M N   W+S+F NI+ + + E    V++ L   
Sbjct: 73  PPSPGMEIEYSIESGIVPLRLDNVVTTMTNVDEWASVFCNIVHNRTAEVSSGVMSNLAPN 132

Query: 167 WSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATS 226
               +  N K V  +NA+L LP    P     F+RF  +V  D WA+ DVSTDYF   +S
Sbjct: 133 TPPHNRDNFKCVKTINAELLLPMPCAPTRKFSFIRFLREVIPDVWAVVDVSTDYFPHLSS 192

Query: 227 DPTVGTQYRRRPSGVVIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWIN 286
             T+    +RRPSGV++R+    SEV+W+EN E+   ++ ++I S V SN    AK WI+
Sbjct: 193 --TLNVNCKRRPSGVIVRRRDDHSEVIWIENMEIHNYNVEDSIYSGVNSNLAFDAKHWIS 252

Query: 287 V---------------------------PDLLTIGENLKRLYLTSVNPFPLERKWDLYCD 346
           +                             LLT+   LK ++L+ +   P E KWDL  D
Sbjct: 253 MLLTKEKRIQSKFVTVKLPIMNHAFSVRHALLTLARRLKMVFLSYIAEHPDENKWDLLSD 312

Query: 347 DNIRILRDMKARNIGYHHDYVASSTVYIPETPIRLLMFLATYNATYQLTSIKSPEQLKLV 406
             I+IL+D  A+     ++Y+A +T  +  TP+ +  FL   N   Q   ++   + + V
Sbjct: 313 TGIKILKDTDAQ----RNNYMAVTTSRVEATPLSVFNFLVKRNRQLQWPCLEEMAEPEEV 372

Query: 407 AALYTEDNSYTIGLRRK-------VEETVADELFADHEFFLQEATANDYCSLVLSSQLSE 466
             L T+D+S  I +  K        E  +  E   + E+ LQEA+ +++CS ++SSQ+ +
Sbjct: 373 INLVTDDHSNCITIHAKRVNQNESTERVMQTESTENFEYVLQEASKDEFCSFIISSQIDQ 432

Query: 467 EDVHISLMPKFSMNSVFLRPSGFAIMPAEQGGLQSKASLVTIFIRRELQY-IEDDHAIAV 505
            +V+ +L   F   S  LRP GF+I+P    G  S AS+VT+  + EL+  +E    I  
Sbjct: 433 SEVNSAL--GFGQLSCSLRPFGFSIIPDGSRGHLSDASIVTMACQLELEADLETGEVIER 492

BLAST of Cp4.1LG15g01030 vs. NCBI nr
Match: gi|764527250|ref|XP_011458039.1| (PREDICTED: homeobox-leucine zipper protein MERISTEM L1-like isoform X1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 245.7 bits (626), Expect = 1.7e-61
Identity = 160/495 (32.32%), Postives = 252/495 (50.91%), Query Frame = 1

Query: 47  SSIGEIFYNDEPDFLEMQRVNNIIKAAYKEFMAIAMRANALIFHPPAVGSVEDLQEMFHT 106
           SS   +     P FLEMQ   NI  AAYKE M +A++AN  +   P +GS+E L  MF  
Sbjct: 13  SSSNGVVKEGVPKFLEMQECYNICMAAYKEIMTLALKANTWVVDIPVLGSIEKLHHMFRE 72

Query: 107 TPPRGYTVECSVEYDVLSITPESLITIMMNGVIWSSIFSNIICDGSFE---NVLTPLKEF 166
            P  G  +E S+E  ++ +  ++++T M N   W+S+F NI+ + + E    V++ L   
Sbjct: 73  PPSPGMEIEYSIESGIVPLRLDNVVTTMTNVDEWASVFCNIVHNRTAEVSSGVMSNLAPN 132

Query: 167 WSADSNSNSKDVVLVNAQLRLPAEFLPRWCTRFLRFKMDVAEDTWAICDVSTDYFMEATS 226
               +  N K V  +NA+L LP    P     F+RF  +V  D WA+ DVSTDYF   +S
Sbjct: 133 TPPHNRDNFKCVKTINAELLLPMPCAPTRKFSFIRFLREVIPDVWAVVDVSTDYFPHLSS 192

Query: 227 DPTVGTQYRRRPSGVVIRQFGVLSEVLWVENAEVQEIDIPNNISSKVTSNFHLTAKQWIN 286
             T+    +RRPSGV++R+    SEV+W+EN E+   ++ ++I S V SN    AK WI+
Sbjct: 193 --TLNVNCKRRPSGVIVRRRDDHSEVIWIENMEIHNYNVEDSIYSGVNSNLAFDAKHWIS 252

Query: 287 V---------------------------PDLLTIGENLKRLYLTSVNPFPLERKWDLYCD 346
           +                             LLT+   LK ++L+ +   P E KWDL  D
Sbjct: 253 MLLTKEKRIQSKFVTVKLPIMNHAFSVRHALLTLARRLKMVFLSYIAEHPDENKWDLLSD 312

Query: 347 DNIRILRDMKARNIGYHHDYVASSTVYIPETPIRLLMFLATYNATYQLTSIKSPEQLKLV 406
             I+IL+D  A+     ++Y+A +T  +  TP+ +  FL   N   Q   ++   + + V
Sbjct: 313 TGIKILKDTDAQ----RNNYMAVTTSRVEATPLSVFNFLVKRNRQLQWPCLEEMAEPEEV 372

Query: 407 AALYTEDNSYTIGLRRK-------VEETVADELFADHEFFLQEATANDYCSLVLSSQLSE 466
             L T+D+S  I +  K        E  +  E   + E+ LQEA+ +++CS ++SSQ+ +
Sbjct: 373 INLVTDDHSNCITIHAKRVNQNESTERVMQTESTENFEYVLQEASKDEFCSFIISSQIDQ 432

Query: 467 EDVHISLMPKFSMNSVFLRPSGFAIMPAEQGGLQSKASLVTIFIRRELQY-IEDDHAIAV 504
            +V+ +L   F   S  LRP GF+I+P    G  S AS+VT+  + EL+  +E    I  
Sbjct: 433 SEVNSAL--GFGQLSCSLRPFGFSIIPDGSRGHLSDASIVTMACQLELEADLETGEVIER 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ANL2_ARATH4.8e-1121.98Homeobox-leucine zipper protein ANTHOCYANINLESS 2 OS=Arabidopsis thaliana GN=ANL... [more]
ROC5_ORYSJ1.5e-0928.33Homeobox-leucine zipper protein ROC5 OS=Oryza sativa subsp. japonica GN=ROC5 PE=... [more]
ROC4_ORYSJ3.4e-0926.42Homeobox-leucine zipper protein ROC4 OS=Oryza sativa subsp. japonica GN=ROC4 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0L8M1_CUCSA4.5e-14152.37Uncharacterized protein OS=Cucumis sativus GN=Csa_3G609280 PE=4 SV=1[more]
A0A061DNQ3_THECC7.7e-3227.71Protodermal factor 2, putative OS=Theobroma cacao GN=TCM_003923 PE=4 SV=1[more]
A0A0D2QV56_GOSRA2.4e-1730.45Uncharacterized protein OS=Gossypium raimondii GN=B456_001G238000 PE=4 SV=1[more]
A0A061DQF7_THECC9.4e-1429.89Protodermal factor 2, putative OS=Theobroma cacao GN=TCM_003924 PE=4 SV=1[more]
A0A161ZVC4_DAUCA2.1e-1326.74Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_021815 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G00730.12.7e-1221.98 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
Match NameE-valueIdentityDescription
gi|659093670|ref|XP_008447653.1|7.1e-14855.06PREDICTED: homeobox-leucine zipper protein ROC7-like [Cucumis melo][more]
gi|700203167|gb|KGN58300.1|6.4e-14152.37hypothetical protein Csa_3G609280 [Cucumis sativus][more]
gi|778681940|ref|XP_011651613.1|9.2e-7955.37PREDICTED: homeobox-leucine zipper protein ROC7-like [Cucumis sativus][more]
gi|764527256|ref|XP_011458040.1|6.0e-6232.46PREDICTED: homeobox-leucine zipper protein MERISTEM L1-like isoform X2 [Fragaria... [more]
gi|764527250|ref|XP_011458039.1|1.7e-6132.32PREDICTED: homeobox-leucine zipper protein MERISTEM L1-like isoform X1 [Fragaria... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding
molecular_function GO:0008289 lipid binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g01030.1Cp4.1LG15g01030.1mRNA


The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG15g01030Cp4.1LG05g00890Cucurbita pepo (Zucchini)cpecpeB273