Cp4.1LG02g10010 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g10010
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox-leucine zipper family protein
LocationCp4.1LG02 : 10001766 .. 10003872 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTAGTTTTGGTTCCCATGAAATGGGATGTTCTTCTCTTGATCTTGATTTGCTTCCTGCAAGTTCTTCAAGTGTTCTTCCAATTCATGTCTCCAATATGGATAAGACCCTGATGAGTGAAGTTGCTACAAATGCCATGGGAGAGTTGCTAAGGCTTTGTCAAACTAATGAACCATTGTGGATGAAATCACGAAGCGATGGACGGGATGTTCTTGATCTCGAAACGTATGAACAGGCTTTTCCAAGAGCCAATGTGCCATTGGAGAATCTCCATTTTCGTACCGAAGTGTCTAGAGATTCCGGGGTGGTGATCGTGAATAGTGCAACGTTGGTTGATATGTTTATGGACTCGGTATGGCTATGTTAATGTCTCTTGCTCTTTCCATTTCAATGTTTCTTTATTGATATTGAATTGAGTTGAGTGTGCAGAACAAATGGAGTGAGTTATTTCCGACGATTGTTTCGAGTGCAAGAACGATTGAAGTTATATCGTCCGGGATGTTGGGAAGTCATCATAATGGCTCATTGCAACTGGTATATAGCTAGCTAGATGCAATCTTCCCGTGTCGTGTTACGTTTTCTGTTCCCTAATTTTTGAGTCAAAAGTGTTGGAATTGCAGATGTACCTAGAATTGCAGCTACTTTCCCCATTGGTTCCGACTCGCCATTTTTACATTCTTAGATATAGTCAACAAATCGAGCAGGGAACGTGGGCTGTAGTCGATGTTTCGTATAACGTCACCGGAGGAAGCCAAATGGTTTCTCATTCTCAATGTCGTCGCTCTCCCTCGGGCTGCTTGATCCAGGACATGCCTAATGGTTATTCCAAGGTTTTCCCATTTGAACTTCATTGTTTAGAACTTTTTCGAGTTTCTTCTCATTCTACAATGCTCACCAAAAACAGGTCACTTGGATTGAGCATGTGGAAGTGGAAGATAAAGGCTCAACTCATTGGCTTTTTAGAGATCTTATTCATAGCGGATTAGCCTTCGGTGCCGAACGGTGGCTCGCAACTCTTCAACGAACGTCTGAAAGATTCGCCTGTTCAATGGTTACTAGTAGTTCGAGTCAGGATCTCGGCGGAGGTAACTTCCCTGGTCGACTTTGAGATATTTTGTTCGTGATTCCTAACAGTTGTTTCGTTCTCTCAGTAATTCCATCATTAGAGGGAAGGAGAAGCATGATGAAACTAGCACAGCGGATGGTGAACAATTTCTGTGCTAGTATCAGCACGTCTCATGGTCACCGTTGGACGACTCTATCTGGGACGGATGAGGTCGGTGTTTATGTCTCGGTTCATAAAAGTATGGATCCTGGTCAGCGTAACGGCGTGGCTCTTAGTGCAGCTACAACCATATGGCTCCCTGTTCCTCCTCAAACCATCTTTAACTTCTTTAAGAATGACAGAACCAGATCTCAGGTAGTTGATACTAAAATCTCAAACAGTTTATAATGACAACGACAGCATTAACTGAAAGTTCATTGCTGCATCAGTGGGATGTCCTGTCGGATGGCAATCCAGTTCAAGAGGTTGCTCATATCACCAACGGATCCCATCCAGGAAACTGCATATCTGTTCTTAGAGTGAGTATTAATGTTATTTTGTTCTTGTTTCAAGCTCAACTAGAAGGCGTATTGATTTAGTGTAAACAATTTTTGTCGTTCGAATAGGCTATGAATTCGACACAGAACAACATGTTGATACTCCAAGAGAGTTGCATAGACTCATCTGGATCACTTGTTGTGTACTGCCCTGTGGATTTACCAGCCATGAATCTTGCAATGAGTGGGGAAGATCCATCAAACATTCCTTTACTACCATCAGGATTCGCGATTCTCCCGGACGGTCGACAGGATCAAGGAGAGGGCGCATCGAGCAGCTCGGACGTGCACAACCGGTCAGGTGGGTCGTTGGTGACGGTAGCGTTTCAGATACTCGTAAGTAGCTTGCCATCCGGGAAGCTGAATTTGGAATCAGTGACAACGGTTAATAACCTCATTAGTACCACTGTCCACCAAATCAAAACAGCCTTAAATTGTCATAGCTCCTCCTGAGAAGTGGCATATTCAGCTGAACACTGTTATGTTTTTTTTTTTTTTTTTTT

mRNA sequence

ATGAGTAGTTTTGGTTCCCATGAAATGGGATGTTCTTCTCTTGATCTTGATTTGCTTCCTGCAAGTTCTTCAAGTGTTCTTCCAATTCATGTCTCCAATATGGATAAGACCCTGATGAGTGAAGTTGCTACAAATGCCATGGGAGAGTTGCTAAGGCTTTGTCAAACTAATGAACCATTGTGGATGAAATCACGAAGCGATGGACGGGATGTCACTTGGATTGAGCATGTGGAAGTGGAAGATAAAGGCTCAACTCATTGGCTTTTTAGAGATCTTATTCATAGCGGATTAGCCTTCGGTGCCGAACGGTGGCTCGCAACTCTTCAACGAACGTCTGAAAGATTCGCCTGTTCAATGGTTACTAGTAGTTCGAGTCAGGATCTCGGCGGAGTAATTCCATCATTAGAGGGAAGGAGAAGCATGATGAAACTAGCACAGCGGATGGTGAACAATTTCTGTGCTAGTATCAGCACGTCTCATGGTCACCGTTGGACGACTCTATCTGGGACGGATGAGGTCGGTGTTTATGTCTCGGTTCATAAAAGTATGGATCCTGGTCAGCGTAACGGCGTGGCTCTTAGTGCAGCTACAACCATATGGCTCCCTGTTCCTCCTCAAACCATCTTTAACTTCTTTAAGAATGACAGAACCAGATCTCAGTGGGATGTCCTGTCGGATGGCAATCCAGTTCAAGAGGTTGCTCATATCACCAACGGATCCCATCCAGGAAACTGCATATCTGTTCTTAGAGCTATGAATTCGACACAGAACAACATGTTGATACTCCAAGAGAGTTGCATAGACTCATCTGGATCACTTGTTGTGTACTGCCCTGTGGATTTACCAGCCATGAATCTTGCAATGAGTGGGGAAGATCCATCAAACATTCCTTTACTACCATCAGGATTCGCGATTCTCCCGGACGGTCGACAGGATCAAGGAGAGGGCGCATCGAGCAGCTCGGACGTGCACAACCGGTCAGGTGGGTCGTTGGTGACGGTAGCGTTTCAGATACTCGTAAGTAGCTTGCCATCCGGGAAGCTGAATTTGGAATCAGTGACAACGGTTAATAACCTCATTAGTACCACTGTCCACCAAATCAAAACAGCCTTAAATTGTCATAGCTCCTCCTGAGAAGTGGCATATTCAGCTGAACACTGTTATGTTTTTTTTTTTTTTTTTTT

Coding sequence (CDS)

ATGAGTAGTTTTGGTTCCCATGAAATGGGATGTTCTTCTCTTGATCTTGATTTGCTTCCTGCAAGTTCTTCAAGTGTTCTTCCAATTCATGTCTCCAATATGGATAAGACCCTGATGAGTGAAGTTGCTACAAATGCCATGGGAGAGTTGCTAAGGCTTTGTCAAACTAATGAACCATTGTGGATGAAATCACGAAGCGATGGACGGGATGTCACTTGGATTGAGCATGTGGAAGTGGAAGATAAAGGCTCAACTCATTGGCTTTTTAGAGATCTTATTCATAGCGGATTAGCCTTCGGTGCCGAACGGTGGCTCGCAACTCTTCAACGAACGTCTGAAAGATTCGCCTGTTCAATGGTTACTAGTAGTTCGAGTCAGGATCTCGGCGGAGTAATTCCATCATTAGAGGGAAGGAGAAGCATGATGAAACTAGCACAGCGGATGGTGAACAATTTCTGTGCTAGTATCAGCACGTCTCATGGTCACCGTTGGACGACTCTATCTGGGACGGATGAGGTCGGTGTTTATGTCTCGGTTCATAAAAGTATGGATCCTGGTCAGCGTAACGGCGTGGCTCTTAGTGCAGCTACAACCATATGGCTCCCTGTTCCTCCTCAAACCATCTTTAACTTCTTTAAGAATGACAGAACCAGATCTCAGTGGGATGTCCTGTCGGATGGCAATCCAGTTCAAGAGGTTGCTCATATCACCAACGGATCCCATCCAGGAAACTGCATATCTGTTCTTAGAGCTATGAATTCGACACAGAACAACATGTTGATACTCCAAGAGAGTTGCATAGACTCATCTGGATCACTTGTTGTGTACTGCCCTGTGGATTTACCAGCCATGAATCTTGCAATGAGTGGGGAAGATCCATCAAACATTCCTTTACTACCATCAGGATTCGCGATTCTCCCGGACGGTCGACAGGATCAAGGAGAGGGCGCATCGAGCAGCTCGGACGTGCACAACCGGTCAGGTGGGTCGTTGGTGACGGTAGCGTTTCAGATACTCGTAAGTAGCTTGCCATCCGGGAAGCTGAATTTGGAATCAGTGACAACGGTTAATAACCTCATTAGTACCACTGTCCACCAAATCAAAACAGCCTTAAATTGTCATAGCTCCTCCTGA

Protein sequence

MSSFGSHEMGCSSLDLDLLPASSSSVLPIHVSNMDKTLMSEVATNAMGELLRLCQTNEPLWMKSRSDGRDVTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGGVIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNGVALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLRAMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGRQDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTALNCHSSS
BLAST of Cp4.1LG02g10010 vs. Swiss-Prot
Match: ROC8_ORYSJ (Homeobox-leucine zipper protein ROC8 OS=Oryza sativa subsp. japonica GN=ROC8 PE=2 SV=2)

HSP 1 Score: 386.0 bits (990), Expect = 4.9e-106
Identity = 199/313 (63.58%), Postives = 239/313 (76.36%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EH+EVE+K   + L+RDL+ SG AFGA RWLA LQR  ER+A S+V       + G
Sbjct: 394 VTWVEHMEVEEKSPINVLYRDLVLSGAAFGAHRWLAALQRACERYA-SLVALGVPHHIAG 453

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           V P  EG+RSMMKL+QRMVN+FC+S+  S  H+WTTLSG++EV V V++H+S DPGQ NG
Sbjct: 454 VTP--EGKRSMMKLSQRMVNSFCSSLGASQMHQWTTLSGSNEVSVRVTMHRSTDPGQPNG 513

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAAT+IWLPVP   +F F +++ TRSQWDVLS GN VQEV+ I NGS+PGNCIS+LR
Sbjct: 514 VVLSAATSIWLPVPCDHVFAFVRDENTRSQWDVLSHGNQVQEVSRIPNGSNPGNCISLLR 573

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
            +N++QN+MLILQESC D+SGSLVVY P+D+PA N+ MSGEDPS+IPLLPSGF ILPDGR
Sbjct: 574 GLNASQNSMLILQESCTDASGSLVVYSPIDIPAANVVMSGEDPSSIPLLPSGFTILPDGR 633

Query: 311 QDQGEGASSSS----------DVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLI 370
                GAS+SS                GGS+VTVAFQILVSSLPS KLN ESV TVN LI
Sbjct: 634 PGSAAGASTSSAGPLAAARGGGGGGAGGGSVVTVAFQILVSSLPSSKLNAESVATVNGLI 693

Query: 371 STTVHQIKTALNC 374
           +TTV QIK ALNC
Sbjct: 694 TTTVEQIKAALNC 703

BLAST of Cp4.1LG02g10010 vs. Swiss-Prot
Match: HDG11_ARATH (Homeobox-leucine zipper protein HDG11 OS=Arabidopsis thaliana GN=HDG11 PE=1 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 1.0e-103
Identity = 195/309 (63.11%), Postives = 238/309 (77.02%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EH+E E+K   H L+R++IH G+AFGA+RW+ TLQR  ERFA   V +SSS+DLGG
Sbjct: 414 VTWVEHIETEEKELVHELYREIIHRGIAFGADRWVTTLQRMCERFASLSVPASSSRDLGG 473

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VI S EG+RSMM+LAQRM++N+C S+S S+  R T +S  +EVG+ V+ HKS +P   NG
Sbjct: 474 VILSPEGKRSMMRLAQRMISNYCLSVSRSNNTRSTVVSELNEVGIRVTAHKSPEP---NG 533

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
             L AATT WLP  PQ +FNF K++RTR QWDVLS+GN VQEVAHI+NGSHPGNCISVLR
Sbjct: 534 TVLCAATTFWLPNSPQNVFNFLKDERTRPQWDVLSNGNAVQEVAHISNGSHPGNCISVLR 593

Query: 251 AMNST-QNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDG 310
             N+T  NNMLILQES  DSSG+ VVY PVDL A+N+AMSGEDPS IPLL SGF I PDG
Sbjct: 594 GSNATHSNNMLILQESSTDSSGAFVVYSPVDLAALNIAMSGEDPSYIPLLSSGFTISPDG 653

Query: 311 RQDQGE-GASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIK 370
                E G +S+S     + GSL+TV FQI+VS+LP+ KLN+ESV TVNNLI TTVHQIK
Sbjct: 654 NGSNSEQGGASTSSGRASASGSLITVGFQIMVSNLPTAKLNMESVETVNNLIGTTVHQIK 713

Query: 371 TALNCHSSS 378
           TAL+  ++S
Sbjct: 714 TALSGPTAS 719

BLAST of Cp4.1LG02g10010 vs. Swiss-Prot
Match: HDG12_ARATH (Homeobox-leucine zipper protein HDG12 OS=Arabidopsis thaliana GN=HDG12 PE=2 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 3.3e-102
Identity = 191/310 (61.61%), Postives = 236/310 (76.13%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EH E E++   H +F+D++H GLAFGAERW+ATLQR  ERF   +  ++SS DLGG
Sbjct: 394 VTWVEHGEFEEQEPIHEMFKDIVHKGLAFGAERWIATLQRMCERFTNLLEPATSSLDLGG 453

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS EG+RS+M+LA RMV+NFC S+ TS+  R T +SG DE G+ V+ HKS    + NG
Sbjct: 454 VIPSPEGKRSIMRLAHRMVSNFCLSVGTSNNTRSTVVSGLDEFGIRVTSHKSRH--EPNG 513

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           + L AAT+ WLP+ PQ +FNF K++RTR QWDVLS+GN VQEVAHITNGS+PGNCISVLR
Sbjct: 514 MVLCAATSFWLPISPQNVFNFLKDERTRPQWDVLSNGNSVQEVAHITNGSNPGNCISVLR 573

Query: 251 AMN--STQNNMLILQESCID-SSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILP 310
             N  S+QNNMLILQESCID SS +LV+Y PVDLPA+N+AMSG+D S IP+LPSGFAI P
Sbjct: 574 GFNASSSQNNMLILQESCIDSSSAALVIYTPVDLPALNIAMSGQDTSYIPILPSGFAISP 633

Query: 311 DGRQDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQI 370
           DG               ++ GGSL+TV FQI+VS L   KLN+ES+ TVNNLI+TTVHQI
Sbjct: 634 DG--------------SSKGGGSLITVGFQIMVSGLQPAKLNMESMETVNNLINTTVHQI 687

Query: 371 KTALNCHSSS 378
           KT LNC S++
Sbjct: 694 KTTLNCPSTA 687

BLAST of Cp4.1LG02g10010 vs. Swiss-Prot
Match: HDG2_ARATH (Homeobox-leucine zipper protein HDG2 OS=Arabidopsis thaliana GN=HDG2 PE=2 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 2.3e-87
Identity = 170/335 (50.75%), Postives = 231/335 (68.96%), Query Frame = 1

Query: 55  QTNEPLWMKSRSDG----------RDVTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERW 114
           Q N P   + R+ G            VTW+EHVEV+D+G  H L++ ++ +G AFGA+RW
Sbjct: 397 QPNPPARCRRRASGCLIQELPNGYSKVTWVEHVEVDDRG-VHNLYKHMVSTGHAFGAKRW 456

Query: 115 LATLQRTSERFACSMVTSSSSQDLGGVIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRW 174
           +A L R  ER A  M T+ SS ++G VI + EGRRSM+KLA+RMV +FCA +S S  H W
Sbjct: 457 VAILDRQCERLASVMATNISSGEVG-VITNQEGRRSMLKLAERMVISFCAGVSASTAHTW 516

Query: 175 TTLSGTDEVGVYVSVHKSMD-PGQRNGVALSAATTIWLPVPPQTIFNFFKNDRTRSQWDV 234
           TTLSGT    V V   KS+D PG+  G+ LSAAT+ W+PVPP+ +F+F +++ +R++WD+
Sbjct: 517 TTLSGTGAEDVRVMTRKSVDDPGRPPGIVLSAATSFWIPVPPKRVFDFLRDENSRNEWDI 576

Query: 235 LSDGNPVQEVAHITNGSHPGNCISVLR--AMNSTQNNMLILQESCIDSSGSLVVYCPVDL 294
           LS+G  VQE+AHI NG   GNC+S+LR  + NS+Q+NMLILQESC D + S V+Y PVD+
Sbjct: 577 LSNGGVVQEMAHIANGRDTGNCVSLLRVNSANSSQSNMLILQESCTDPTASFVIYAPVDI 636

Query: 295 PAMNLAMSGEDPSNIPLLPSGFAILPDGRQDQGEGASSSSDVHNRSGGSLVTVAFQILVS 354
            AMN+ ++G DP  + LLPSGFAILPDG  + G             GGSL+TVAFQILV 
Sbjct: 637 VAMNIVLNGGDPDYVALLPSGFAILPDGNANSGAPGG--------DGGSLLTVAFQILVD 696

Query: 355 SLPSGKLNLESVTTVNNLISTTVHQIKTALNCHSS 377
           S+P+ KL+L SV TVNNLI+ TV +IK +++C ++
Sbjct: 697 SVPTAKLSLGSVATVNNLIACTVERIKASMSCETA 721

BLAST of Cp4.1LG02g10010 vs. Swiss-Prot
Match: PDF2_ARATH (Homeobox-leucine zipper protein PROTODERMAL FACTOR 2 OS=Arabidopsis thaliana GN=PDF2 PE=2 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 3.0e-87
Identity = 171/310 (55.16%), Postives = 223/310 (71.94%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTWIEH+EV+D+ S H +++ L+ SGLAFGA+RW+ATL+R  ER A SM  S+   DL  
Sbjct: 431 VTWIEHMEVDDR-SVHNMYKPLVQSGLAFGAKRWVATLERQCERLASSMA-SNIPGDLS- 490

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMD-PGQRN 190
           VI S EGR+SM+KLA+RMV +FC+ +  S  H WTT+S T    V V   KSMD PG+  
Sbjct: 491 VITSPEGRKSMLKLAERMVMSFCSGVGASTAHAWTTMSTTGSDDVRVMTRKSMDDPGRPP 550

Query: 191 GVALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVL 250
           G+ LSAAT+ W+PV P+ +F+F +++ +R +WD+LS+G  VQE+AHI NG  PGNC+S+L
Sbjct: 551 GIVLSAATSFWIPVAPKRVFDFLRDENSRKEWDILSNGGMVQEMAHIANGHEPGNCVSLL 610

Query: 251 RAM--NSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILP 310
           R    NS+Q+NMLILQESC D+SGS V+Y PVD+ AMN+ +SG DP  + LLPSGFAILP
Sbjct: 611 RVNSGNSSQSNMLILQESCTDASGSYVIYAPVDIVAMNVVLSGGDPDYVALLPSGFAILP 670

Query: 311 DGRQDQGEGASSSSDVHNRS----GGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTT 370
           DG    G+G      V   S    GGSL+TVAFQILV S+P+ KL+L SV TVN+LI  T
Sbjct: 671 DGSVGGGDGNQHQEMVSTTSSGSCGGSLLTVAFQILVDSVPTAKLSLGSVATVNSLIKCT 730

Query: 371 VHQIKTALNC 374
           V +IK A++C
Sbjct: 731 VERIKAAVSC 737

BLAST of Cp4.1LG02g10010 vs. TrEMBL
Match: A0A0A0LVB8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G031750 PE=4 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 3.5e-159
Identity = 282/306 (92.16%), Postives = 294/306 (96.08%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTWIEHVEVED+GSTHWLFRDLIHSGLAFGAERWLATLQR SERFAC MVTSSS+QDLGG
Sbjct: 400 VTWIEHVEVEDRGSTHWLFRDLIHSGLAFGAERWLATLQRMSERFACLMVTSSSNQDLGG 459

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPSLEG+RSMMKLAQRMVNNFCASISTSHGHRWTTLSG +EVGV V+VHKS D GQ NG
Sbjct: 460 VIPSLEGKRSMMKLAQRMVNNFCASISTSHGHRWTTLSGMNEVGVRVTVHKSTDSGQPNG 519

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQTIFNFFKNDRTRSQWDVLS+GNPVQEVAHI+NGSHPGNCISVLR
Sbjct: 520 VVLSAATTIWLPVSPQTIFNFFKNDRTRSQWDVLSEGNPVQEVAHISNGSHPGNCISVLR 579

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
             N++QNNMLILQESCIDSSGSLVVYCPVDLPAMN+AMSGEDPS+IPLLPSGF ILPDGR
Sbjct: 580 GFNTSQNNMLILQESCIDSSGSLVVYCPVDLPAMNVAMSGEDPSSIPLLPSGFTILPDGR 639

Query: 311 QDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 370
           +DQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA
Sbjct: 640 RDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 699

Query: 371 LNCHSS 377
           LNCHSS
Sbjct: 700 LNCHSS 705

BLAST of Cp4.1LG02g10010 vs. TrEMBL
Match: A5BQ38_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_009450 PE=4 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 2.0e-135
Identity = 244/309 (78.96%), Postives = 269/309 (87.06%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EHVE+EDK  TH L+RDLIH GLAFGAERWLATLQR  ERFAC MV  +S++DLGG
Sbjct: 409 VTWVEHVEIEDKTPTHRLYRDLIHRGLAFGAERWLATLQRMCERFACLMVKGTSTRDLGG 468

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS +G+RSMMKLAQRMVNNFCASISTS+GHRWTTLSG +EVGV V++HK+ DPGQ NG
Sbjct: 469 VIPSPDGKRSMMKLAQRMVNNFCASISTSNGHRWTTLSGLNEVGVRVTIHKNTDPGQPNG 528

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQ +FNFF+++RTR QWDVLS+GN VQEVAHI NG HPGNCISVLR
Sbjct: 529 VVLSAATTIWLPVSPQNVFNFFRDERTRPQWDVLSNGNAVQEVAHIANGPHPGNCISVLR 588

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNMLILQESCIDSSGSLV+YCPVDLPA+N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 589 AFNTSQNNMLILQESCIDSSGSLVIYCPVDLPAINIAMSGEDPSYIPLLPSGFTISPDGR 648

Query: 311 QDQGEGASSSSDV---HNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQI 370
            DQG+GASSSS       RSGGSL+TV FQILVSSLPS KLNLESVTTVNNLI  TV QI
Sbjct: 649 LDQGDGASSSSSTTASMGRSGGSLITVVFQILVSSLPSAKLNLESVTTVNNLIGNTVQQI 708

Query: 371 KTALNCHSS 377
           K ALNC SS
Sbjct: 709 KAALNCPSS 717

BLAST of Cp4.1LG02g10010 vs. TrEMBL
Match: M5WD52_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015345m1g PE=4 SV=1)

HSP 1 Score: 483.4 bits (1243), Expect = 2.5e-133
Identity = 241/306 (78.76%), Postives = 269/306 (87.91%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           V+W+EHVE+EDK  TH L+RDLIHSGLAFGAERWLA LQR  ERFAC MV+ +S++DL G
Sbjct: 129 VSWVEHVEIEDKAPTHRLYRDLIHSGLAFGAERWLAALQRMCERFACLMVSGTSTRDLEG 188

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS EG+RSMMKLAQRMVNNFCASISTS+GHRWTTLSG +EVGV V++HKS DPGQ NG
Sbjct: 189 VIPSPEGKRSMMKLAQRMVNNFCASISTSNGHRWTTLSGMNEVGVRVTIHKSTDPGQPNG 248

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQ +FNFFK++RTR QWDVLS+ N VQEVAHI NGSHPGNCISVLR
Sbjct: 249 VVLSAATTIWLPVSPQNVFNFFKDERTRPQWDVLSNNNAVQEVAHIANGSHPGNCISVLR 308

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNML+LQESCIDSSGSLVVY PVDLP++N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 309 AFNTSQNNMLMLQESCIDSSGSLVVYSPVDLPSINIAMSGEDPSYIPLLPSGFTISPDGR 368

Query: 311 QDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 370
            +QG+GAS+SS   + SGGSLVTVAFQILVSSLPS KLNLESV TVN LI TTV QIK A
Sbjct: 369 PEQGDGASTSSCNVHGSGGSLVTVAFQILVSSLPSAKLNLESVNTVNTLIGTTVQQIKAA 428

Query: 371 LNCHSS 377
           LNC+SS
Sbjct: 429 LNCNSS 434

BLAST of Cp4.1LG02g10010 vs. TrEMBL
Match: A0A067LF10_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17405 PE=4 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 2.1e-132
Identity = 240/310 (77.42%), Postives = 270/310 (87.10%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EHVE+EDK  TH L+RDLI+SG+AFGAERWLATLQR  ERFAC MV+ +S++DLGG
Sbjct: 403 VTWVEHVEIEDKTPTHRLYRDLIYSGMAFGAERWLATLQRMCERFACLMVSGTSTRDLGG 462

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS +G+RSMMKLAQRMVN+FCASISTS+ HRWTT+SG++EVGV   VHKS DPGQ NG
Sbjct: 463 VIPSPDGKRSMMKLAQRMVNSFCASISTSNRHRWTTVSGSNEVGV--RVHKSTDPGQPNG 522

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V L+AATT WLPV PQ +FNFFK++RTR+QWDVLS GN VQEVAHI NGSHPGNCISVLR
Sbjct: 523 VVLNAATTFWLPVSPQNVFNFFKDERTRAQWDVLSSGNAVQEVAHIANGSHPGNCISVLR 582

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N+ QNNMLILQESCIDSSGSLVVYCPVDLPA+N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 583 AFNTGQNNMLILQESCIDSSGSLVVYCPVDLPAINIAMSGEDPSYIPLLPSGFTISPDGR 642

Query: 311 QDQGEGASSSSDVH----NRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQ 370
            D G+GAS+SS  H     RS GSL+TV+FQILVSSLPS KLNLESVTTVNNLISTT+ Q
Sbjct: 643 LDHGDGASTSSSTHVSMGRRSSGSLITVSFQILVSSLPSAKLNLESVTTVNNLISTTIQQ 702

Query: 371 IKTALNCHSS 377
           IK A+NC SS
Sbjct: 703 IKAAMNCPSS 710

BLAST of Cp4.1LG02g10010 vs. TrEMBL
Match: W9RIV5_9ROSA (Homeobox-leucine zipper protein HDG11 OS=Morus notabilis GN=L484_025110 PE=4 SV=1)

HSP 1 Score: 478.4 bits (1230), Expect = 8.0e-132
Identity = 242/315 (76.83%), Postives = 272/315 (86.35%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EHVE+EDK  TH L+RDLIHSGLAFGA RWLATLQR  ERFAC MV+ +S++DLGG
Sbjct: 407 VTWVEHVEIEDKTPTHRLYRDLIHSGLAFGAGRWLATLQRMCERFACLMVSGASTRDLGG 466

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS +G++SMMKLAQRMVNNFCASISTS+GHRWTTLSG +EVGV V+VHKS DPGQ NG
Sbjct: 467 VIPSPDGKKSMMKLAQRMVNNFCASISTSNGHRWTTLSGLNEVGVRVTVHKSSDPGQPNG 526

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLP+ PQ +FNFFK++RTRSQWDVLS+GN VQEVAHI NGSHPGNCISVLR
Sbjct: 527 VVLSAATTIWLPISPQDVFNFFKDERTRSQWDVLSNGNAVQEVAHIANGSHPGNCISVLR 586

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNMLILQES ID+SGS+VVYCPVDLPA+N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 587 AFNTSQNNMLILQESSIDASGSVVVYCPVDLPAINIAMSGEDPSYIPLLPSGFTISPDGR 646

Query: 311 QDQ---GEGASSSS------DVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLIS 370
            +    G+GAS+SS      +   RS GSL+TVAFQILVSSLPS KLNLESVTTVNNLI 
Sbjct: 647 TEPGGGGDGASTSSAAVPPGNAMGRSSGSLITVAFQILVSSLPSAKLNLESVTTVNNLIG 706

Query: 371 TTVHQIKTALNCHSS 377
           TTV QIK ALNC +S
Sbjct: 707 TTVQQIKAALNCTNS 721

BLAST of Cp4.1LG02g10010 vs. TAIR10
Match: AT1G73360.1 (AT1G73360.1 homeodomain GLABROUS 11)

HSP 1 Score: 378.3 bits (970), Expect = 5.7e-105
Identity = 195/309 (63.11%), Postives = 238/309 (77.02%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EH+E E+K   H L+R++IH G+AFGA+RW+ TLQR  ERFA   V +SSS+DLGG
Sbjct: 414 VTWVEHIETEEKELVHELYREIIHRGIAFGADRWVTTLQRMCERFASLSVPASSSRDLGG 473

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VI S EG+RSMM+LAQRM++N+C S+S S+  R T +S  +EVG+ V+ HKS +P   NG
Sbjct: 474 VILSPEGKRSMMRLAQRMISNYCLSVSRSNNTRSTVVSELNEVGIRVTAHKSPEP---NG 533

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
             L AATT WLP  PQ +FNF K++RTR QWDVLS+GN VQEVAHI+NGSHPGNCISVLR
Sbjct: 534 TVLCAATTFWLPNSPQNVFNFLKDERTRPQWDVLSNGNAVQEVAHISNGSHPGNCISVLR 593

Query: 251 AMNST-QNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDG 310
             N+T  NNMLILQES  DSSG+ VVY PVDL A+N+AMSGEDPS IPLL SGF I PDG
Sbjct: 594 GSNATHSNNMLILQESSTDSSGAFVVYSPVDLAALNIAMSGEDPSYIPLLSSGFTISPDG 653

Query: 311 RQDQGE-GASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIK 370
                E G +S+S     + GSL+TV FQI+VS+LP+ KLN+ESV TVNNLI TTVHQIK
Sbjct: 654 NGSNSEQGGASTSSGRASASGSLITVGFQIMVSNLPTAKLNMESVETVNNLIGTTVHQIK 713

Query: 371 TALNCHSSS 378
           TAL+  ++S
Sbjct: 714 TALSGPTAS 719

BLAST of Cp4.1LG02g10010 vs. TAIR10
Match: AT1G17920.1 (AT1G17920.1 homeodomain GLABROUS 12)

HSP 1 Score: 373.2 bits (957), Expect = 1.8e-103
Identity = 191/310 (61.61%), Postives = 236/310 (76.13%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EH E E++   H +F+D++H GLAFGAERW+ATLQR  ERF   +  ++SS DLGG
Sbjct: 394 VTWVEHGEFEEQEPIHEMFKDIVHKGLAFGAERWIATLQRMCERFTNLLEPATSSLDLGG 453

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS EG+RS+M+LA RMV+NFC S+ TS+  R T +SG DE G+ V+ HKS    + NG
Sbjct: 454 VIPSPEGKRSIMRLAHRMVSNFCLSVGTSNNTRSTVVSGLDEFGIRVTSHKSRH--EPNG 513

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           + L AAT+ WLP+ PQ +FNF K++RTR QWDVLS+GN VQEVAHITNGS+PGNCISVLR
Sbjct: 514 MVLCAATSFWLPISPQNVFNFLKDERTRPQWDVLSNGNSVQEVAHITNGSNPGNCISVLR 573

Query: 251 AMN--STQNNMLILQESCID-SSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILP 310
             N  S+QNNMLILQESCID SS +LV+Y PVDLPA+N+AMSG+D S IP+LPSGFAI P
Sbjct: 574 GFNASSSQNNMLILQESCIDSSSAALVIYTPVDLPALNIAMSGQDTSYIPILPSGFAISP 633

Query: 311 DGRQDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQI 370
           DG               ++ GGSL+TV FQI+VS L   KLN+ES+ TVNNLI+TTVHQI
Sbjct: 634 DG--------------SSKGGGSLITVGFQIMVSGLQPAKLNMESMETVNNLINTTVHQI 687

Query: 371 KTALNCHSSS 378
           KT LNC S++
Sbjct: 694 KTTLNCPSTA 687

BLAST of Cp4.1LG02g10010 vs. TAIR10
Match: AT1G05230.1 (AT1G05230.1 homeodomain GLABROUS 2)

HSP 1 Score: 323.9 bits (829), Expect = 1.3e-88
Identity = 170/335 (50.75%), Postives = 231/335 (68.96%), Query Frame = 1

Query: 55  QTNEPLWMKSRSDG----------RDVTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERW 114
           Q N P   + R+ G            VTW+EHVEV+D+G  H L++ ++ +G AFGA+RW
Sbjct: 397 QPNPPARCRRRASGCLIQELPNGYSKVTWVEHVEVDDRG-VHNLYKHMVSTGHAFGAKRW 456

Query: 115 LATLQRTSERFACSMVTSSSSQDLGGVIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRW 174
           +A L R  ER A  M T+ SS ++G VI + EGRRSM+KLA+RMV +FCA +S S  H W
Sbjct: 457 VAILDRQCERLASVMATNISSGEVG-VITNQEGRRSMLKLAERMVISFCAGVSASTAHTW 516

Query: 175 TTLSGTDEVGVYVSVHKSMD-PGQRNGVALSAATTIWLPVPPQTIFNFFKNDRTRSQWDV 234
           TTLSGT    V V   KS+D PG+  G+ LSAAT+ W+PVPP+ +F+F +++ +R++WD+
Sbjct: 517 TTLSGTGAEDVRVMTRKSVDDPGRPPGIVLSAATSFWIPVPPKRVFDFLRDENSRNEWDI 576

Query: 235 LSDGNPVQEVAHITNGSHPGNCISVLR--AMNSTQNNMLILQESCIDSSGSLVVYCPVDL 294
           LS+G  VQE+AHI NG   GNC+S+LR  + NS+Q+NMLILQESC D + S V+Y PVD+
Sbjct: 577 LSNGGVVQEMAHIANGRDTGNCVSLLRVNSANSSQSNMLILQESCTDPTASFVIYAPVDI 636

Query: 295 PAMNLAMSGEDPSNIPLLPSGFAILPDGRQDQGEGASSSSDVHNRSGGSLVTVAFQILVS 354
            AMN+ ++G DP  + LLPSGFAILPDG  + G             GGSL+TVAFQILV 
Sbjct: 637 VAMNIVLNGGDPDYVALLPSGFAILPDGNANSGAPGG--------DGGSLLTVAFQILVD 696

Query: 355 SLPSGKLNLESVTTVNNLISTTVHQIKTALNCHSS 377
           S+P+ KL+L SV TVNNLI+ TV +IK +++C ++
Sbjct: 697 SVPTAKLSLGSVATVNNLIACTVERIKASMSCETA 721

BLAST of Cp4.1LG02g10010 vs. TAIR10
Match: AT4G04890.1 (AT4G04890.1 protodermal factor 2)

HSP 1 Score: 323.6 bits (828), Expect = 1.7e-88
Identity = 171/310 (55.16%), Postives = 223/310 (71.94%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTWIEH+EV+D+ S H +++ L+ SGLAFGA+RW+ATL+R  ER A SM  S+   DL  
Sbjct: 431 VTWIEHMEVDDR-SVHNMYKPLVQSGLAFGAKRWVATLERQCERLASSMA-SNIPGDLS- 490

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMD-PGQRN 190
           VI S EGR+SM+KLA+RMV +FC+ +  S  H WTT+S T    V V   KSMD PG+  
Sbjct: 491 VITSPEGRKSMLKLAERMVMSFCSGVGASTAHAWTTMSTTGSDDVRVMTRKSMDDPGRPP 550

Query: 191 GVALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVL 250
           G+ LSAAT+ W+PV P+ +F+F +++ +R +WD+LS+G  VQE+AHI NG  PGNC+S+L
Sbjct: 551 GIVLSAATSFWIPVAPKRVFDFLRDENSRKEWDILSNGGMVQEMAHIANGHEPGNCVSLL 610

Query: 251 RAM--NSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILP 310
           R    NS+Q+NMLILQESC D+SGS V+Y PVD+ AMN+ +SG DP  + LLPSGFAILP
Sbjct: 611 RVNSGNSSQSNMLILQESCTDASGSYVIYAPVDIVAMNVVLSGGDPDYVALLPSGFAILP 670

Query: 311 DGRQDQGEGASSSSDVHNRS----GGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTT 370
           DG    G+G      V   S    GGSL+TVAFQILV S+P+ KL+L SV TVN+LI  T
Sbjct: 671 DGSVGGGDGNQHQEMVSTTSSGSCGGSLLTVAFQILVDSVPTAKLSLGSVATVNSLIKCT 730

Query: 371 VHQIKTALNC 374
           V +IK A++C
Sbjct: 731 VERIKAAVSC 737

BLAST of Cp4.1LG02g10010 vs. TAIR10
Match: AT4G21750.1 (AT4G21750.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 315.8 bits (808), Expect = 3.5e-86
Identity = 169/323 (52.32%), Postives = 223/323 (69.04%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EH+EV+D+ S H +++ L+++GLAFGA+RW+ATL R  ER A SM ++  + DL  
Sbjct: 439 VTWVEHIEVDDR-SVHNMYKPLVNTGLAFGAKRWVATLDRQCERLASSMASNIPACDLS- 498

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMD-PGQRN 190
           VI S EGR+SM+KLA+RMV +FC  +  S  H WTTLS T    V V   KSMD PG+  
Sbjct: 499 VITSPEGRKSMLKLAERMVMSFCTGVGASTAHAWTTLSTTGSDDVRVMTRKSMDDPGRPP 558

Query: 191 GVALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVL 250
           G+ LSAAT+ W+PV P+ +F+F +++ +RS+WD+LS+G  VQE+AHI NG  PGN +S+L
Sbjct: 559 GIVLSAATSFWIPVAPKRVFDFLRDENSRSEWDILSNGGLVQEMAHIANGRDPGNSVSLL 618

Query: 251 RAM--NSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILP 310
           R    NS Q+NMLILQESC D+SGS V+Y PVD+ AMN+ +SG DP  + LLPSGFAILP
Sbjct: 619 RVNSGNSGQSNMLILQESCTDASGSYVIYAPVDIIAMNVVLSGGDPDYVALLPSGFAILP 678

Query: 311 DGRQDQGEGASSSS-----------------DVHNRSGGSLVTVAFQILVSSLPSGKLNL 370
           DG    G G++++S                       GGSL+TVAFQILV S+P+ KL+L
Sbjct: 679 DGSARGGGGSANASAGAGVEGGGEGNNLEVVTTTGSCGGSLLTVAFQILVDSVPTAKLSL 738

Query: 371 ESVTTVNNLISTTVHQIKTALNC 374
            SV TVN+LI  TV +IK AL C
Sbjct: 739 GSVATVNSLIKCTVERIKAALAC 759

BLAST of Cp4.1LG02g10010 vs. NCBI nr
Match: gi|659068729|ref|XP_008445879.1| (PREDICTED: homeobox-leucine zipper protein HDG11-like [Cucumis melo])

HSP 1 Score: 571.2 bits (1471), Expect = 1.3e-159
Identity = 283/307 (92.18%), Postives = 295/307 (96.09%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTWIEHVEVED+GSTHWLFRDLIHSGLAFGAERWLATLQR SERFAC MVT SS+QDLGG
Sbjct: 400 VTWIEHVEVEDRGSTHWLFRDLIHSGLAFGAERWLATLQRMSERFACLMVTGSSNQDLGG 459

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPSLEG+RSMMKLAQRMVNNFCASISTSHGHRWTTLSG +EVGV V+VHKS D GQ NG
Sbjct: 460 VIPSLEGKRSMMKLAQRMVNNFCASISTSHGHRWTTLSGMNEVGVRVTVHKSTDSGQPNG 519

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHI+NGSHPGNCISVLR
Sbjct: 520 VVLSAATTIWLPVSPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHISNGSHPGNCISVLR 579

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNMLILQESCIDSSGSLVVYCPVDLPAMN+AMSGEDPS+IPLLPSGF ILPDGR
Sbjct: 580 AFNTSQNNMLILQESCIDSSGSLVVYCPVDLPAMNVAMSGEDPSSIPLLPSGFTILPDGR 639

Query: 311 QDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 370
           +DQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA
Sbjct: 640 RDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 699

Query: 371 LNCHSSS 378
           LNCHSS+
Sbjct: 700 LNCHSST 706

BLAST of Cp4.1LG02g10010 vs. NCBI nr
Match: gi|449439589|ref|XP_004137568.1| (PREDICTED: homeobox-leucine zipper protein HDG11-like isoform X1 [Cucumis sativus])

HSP 1 Score: 569.3 bits (1466), Expect = 5.0e-159
Identity = 282/306 (92.16%), Postives = 294/306 (96.08%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTWIEHVEVED+GSTHWLFRDLIHSGLAFGAERWLATLQR SERFAC MVTSSS+QDLGG
Sbjct: 400 VTWIEHVEVEDRGSTHWLFRDLIHSGLAFGAERWLATLQRMSERFACLMVTSSSNQDLGG 459

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPSLEG+RSMMKLAQRMVNNFCASISTSHGHRWTTLSG +EVGV V+VHKS D GQ NG
Sbjct: 460 VIPSLEGKRSMMKLAQRMVNNFCASISTSHGHRWTTLSGMNEVGVRVTVHKSTDSGQPNG 519

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQTIFNFFKNDRTRSQWDVLS+GNPVQEVAHI+NGSHPGNCISVLR
Sbjct: 520 VVLSAATTIWLPVSPQTIFNFFKNDRTRSQWDVLSEGNPVQEVAHISNGSHPGNCISVLR 579

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
             N++QNNMLILQESCIDSSGSLVVYCPVDLPAMN+AMSGEDPS+IPLLPSGF ILPDGR
Sbjct: 580 GFNTSQNNMLILQESCIDSSGSLVVYCPVDLPAMNVAMSGEDPSSIPLLPSGFTILPDGR 639

Query: 311 QDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 370
           +DQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA
Sbjct: 640 RDQGEGASSSSDVHNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQIKTA 699

Query: 371 LNCHSS 377
           LNCHSS
Sbjct: 700 LNCHSS 705

BLAST of Cp4.1LG02g10010 vs. NCBI nr
Match: gi|147856728|emb|CAN83483.1| (hypothetical protein VITISV_009450 [Vitis vinifera])

HSP 1 Score: 490.3 bits (1261), Expect = 2.9e-135
Identity = 244/309 (78.96%), Postives = 269/309 (87.06%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EHVE+EDK  TH L+RDLIH GLAFGAERWLATLQR  ERFAC MV  +S++DLGG
Sbjct: 409 VTWVEHVEIEDKTPTHRLYRDLIHRGLAFGAERWLATLQRMCERFACLMVKGTSTRDLGG 468

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS +G+RSMMKLAQRMVNNFCASISTS+GHRWTTLSG +EVGV V++HK+ DPGQ NG
Sbjct: 469 VIPSPDGKRSMMKLAQRMVNNFCASISTSNGHRWTTLSGLNEVGVRVTIHKNTDPGQPNG 528

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQ +FNFF+++RTR QWDVLS+GN VQEVAHI NG HPGNCISVLR
Sbjct: 529 VVLSAATTIWLPVSPQNVFNFFRDERTRPQWDVLSNGNAVQEVAHIANGPHPGNCISVLR 588

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNMLILQESCIDSSGSLV+YCPVDLPA+N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 589 AFNTSQNNMLILQESCIDSSGSLVIYCPVDLPAINIAMSGEDPSYIPLLPSGFTISPDGR 648

Query: 311 QDQGEGASSSSDV---HNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQI 370
            DQG+GASSSS       RSGGSL+TV FQILVSSLPS KLNLESVTTVNNLI  TV QI
Sbjct: 649 LDQGDGASSSSSTTASMGRSGGSLITVVFQILVSSLPSAKLNLESVTTVNNLIGNTVQQI 708

Query: 371 KTALNCHSS 377
           K ALNC SS
Sbjct: 709 KAALNCPSS 717

BLAST of Cp4.1LG02g10010 vs. NCBI nr
Match: gi|225464265|ref|XP_002271012.1| (PREDICTED: homeobox-leucine zipper protein HDG11-like isoform X2 [Vitis vinifera])

HSP 1 Score: 490.3 bits (1261), Expect = 2.9e-135
Identity = 244/309 (78.96%), Postives = 269/309 (87.06%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EHVE+EDK  TH L+RDLIH GLAFGAERWLATLQR  ERFAC MV  +S++DLGG
Sbjct: 407 VTWVEHVEIEDKTPTHRLYRDLIHRGLAFGAERWLATLQRMCERFACLMVKGTSTRDLGG 466

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS +G+RSMMKLAQRMVNNFCASISTS+GHRWTTLSG +EVGV V++HK+ DPGQ NG
Sbjct: 467 VIPSPDGKRSMMKLAQRMVNNFCASISTSNGHRWTTLSGLNEVGVRVTIHKNTDPGQPNG 526

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQ +FNFF+++RTR QWDVLS+GN VQEVAHI NG HPGNCISVLR
Sbjct: 527 VVLSAATTIWLPVSPQNVFNFFRDERTRPQWDVLSNGNAVQEVAHIANGPHPGNCISVLR 586

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNMLILQESCIDSSGSLV+YCPVDLPA+N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 587 AFNTSQNNMLILQESCIDSSGSLVIYCPVDLPAINIAMSGEDPSYIPLLPSGFTISPDGR 646

Query: 311 QDQGEGASSSSDV---HNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQI 370
            DQG+GASSSS       RSGGSL+TV FQILVSSLPS KLNLESVTTVNNLI  TV QI
Sbjct: 647 LDQGDGASSSSSTTASMGRSGGSLITVVFQILVSSLPSAKLNLESVTTVNNLIGNTVQQI 706

Query: 371 KTALNCHSS 377
           K ALNC SS
Sbjct: 707 KAALNCPSS 715

BLAST of Cp4.1LG02g10010 vs. NCBI nr
Match: gi|731427324|ref|XP_010663934.1| (PREDICTED: homeobox-leucine zipper protein HDG11-like isoform X1 [Vitis vinifera])

HSP 1 Score: 490.3 bits (1261), Expect = 2.9e-135
Identity = 244/309 (78.96%), Postives = 269/309 (87.06%), Query Frame = 1

Query: 71  VTWIEHVEVEDKGSTHWLFRDLIHSGLAFGAERWLATLQRTSERFACSMVTSSSSQDLGG 130
           VTW+EHVE+EDK  TH L+RDLIH GLAFGAERWLATLQR  ERFAC MV  +S++DLGG
Sbjct: 411 VTWVEHVEIEDKTPTHRLYRDLIHRGLAFGAERWLATLQRMCERFACLMVKGTSTRDLGG 470

Query: 131 VIPSLEGRRSMMKLAQRMVNNFCASISTSHGHRWTTLSGTDEVGVYVSVHKSMDPGQRNG 190
           VIPS +G+RSMMKLAQRMVNNFCASISTS+GHRWTTLSG +EVGV V++HK+ DPGQ NG
Sbjct: 471 VIPSPDGKRSMMKLAQRMVNNFCASISTSNGHRWTTLSGLNEVGVRVTIHKNTDPGQPNG 530

Query: 191 VALSAATTIWLPVPPQTIFNFFKNDRTRSQWDVLSDGNPVQEVAHITNGSHPGNCISVLR 250
           V LSAATTIWLPV PQ +FNFF+++RTR QWDVLS+GN VQEVAHI NG HPGNCISVLR
Sbjct: 531 VVLSAATTIWLPVSPQNVFNFFRDERTRPQWDVLSNGNAVQEVAHIANGPHPGNCISVLR 590

Query: 251 AMNSTQNNMLILQESCIDSSGSLVVYCPVDLPAMNLAMSGEDPSNIPLLPSGFAILPDGR 310
           A N++QNNMLILQESCIDSSGSLV+YCPVDLPA+N+AMSGEDPS IPLLPSGF I PDGR
Sbjct: 591 AFNTSQNNMLILQESCIDSSGSLVIYCPVDLPAINIAMSGEDPSYIPLLPSGFTISPDGR 650

Query: 311 QDQGEGASSSSDV---HNRSGGSLVTVAFQILVSSLPSGKLNLESVTTVNNLISTTVHQI 370
            DQG+GASSSS       RSGGSL+TV FQILVSSLPS KLNLESVTTVNNLI  TV QI
Sbjct: 651 LDQGDGASSSSSTTASMGRSGGSLITVVFQILVSSLPSAKLNLESVTTVNNLIGNTVQQI 710

Query: 371 KTALNCHSS 377
           K ALNC SS
Sbjct: 711 KAALNCPSS 719

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ROC8_ORYSJ4.9e-10663.58Homeobox-leucine zipper protein ROC8 OS=Oryza sativa subsp. japonica GN=ROC8 PE=... [more]
HDG11_ARATH1.0e-10363.11Homeobox-leucine zipper protein HDG11 OS=Arabidopsis thaliana GN=HDG11 PE=1 SV=1[more]
HDG12_ARATH3.3e-10261.61Homeobox-leucine zipper protein HDG12 OS=Arabidopsis thaliana GN=HDG12 PE=2 SV=1[more]
HDG2_ARATH2.3e-8750.75Homeobox-leucine zipper protein HDG2 OS=Arabidopsis thaliana GN=HDG2 PE=2 SV=1[more]
PDF2_ARATH3.0e-8755.16Homeobox-leucine zipper protein PROTODERMAL FACTOR 2 OS=Arabidopsis thaliana GN=... [more]
Match NameE-valueIdentityDescription
A0A0A0LVB8_CUCSA3.5e-15992.16Uncharacterized protein OS=Cucumis sativus GN=Csa_1G031750 PE=4 SV=1[more]
A5BQ38_VITVI2.0e-13578.96Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_009450 PE=4 SV=1[more]
M5WD52_PRUPE2.5e-13378.76Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015345m1g PE=4 ... [more]
A0A067LF10_JATCU2.1e-13277.42Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17405 PE=4 SV=1[more]
W9RIV5_9ROSA8.0e-13276.83Homeobox-leucine zipper protein HDG11 OS=Morus notabilis GN=L484_025110 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G73360.15.7e-10563.11 homeodomain GLABROUS 11[more]
AT1G17920.11.8e-10361.61 homeodomain GLABROUS 12[more]
AT1G05230.11.3e-8850.75 homeodomain GLABROUS 2[more]
AT4G04890.11.7e-8855.16 protodermal factor 2[more]
AT4G21750.13.5e-8652.32 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
Match NameE-valueIdentityDescription
gi|659068729|ref|XP_008445879.1|1.3e-15992.18PREDICTED: homeobox-leucine zipper protein HDG11-like [Cucumis melo][more]
gi|449439589|ref|XP_004137568.1|5.0e-15992.16PREDICTED: homeobox-leucine zipper protein HDG11-like isoform X1 [Cucumis sativu... [more]
gi|147856728|emb|CAN83483.1|2.9e-13578.96hypothetical protein VITISV_009450 [Vitis vinifera][more]
gi|225464265|ref|XP_002271012.1|2.9e-13578.96PREDICTED: homeobox-leucine zipper protein HDG11-like isoform X2 [Vitis vinifera... [more]
gi|731427324|ref|XP_010663934.1|2.9e-13578.96PREDICTED: homeobox-leucine zipper protein HDG11-like isoform X1 [Vitis vinifera... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008289lipid binding
Vocabulary: INTERPRO
TermDefinition
IPR002913START_lipid-bd_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0008289 lipid binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g10010.1Cp4.1LG02g10010.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002913START domainPFAMPF01852STARTcoord: 71..113
score: 2.
IPR002913START domainPROFILEPS50848STARTcoord: 32..236
score: 1
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 90..372
score: 3.3E
NoneNo IPR availablePANTHERPTHR24326:SF236HOMEOBOX-LEUCINE ZIPPER PROTEIN HDG12coord: 90..372
score: 3.3E
NoneNo IPR availableunknownSSF55961Bet v1-likecoord: 136..368
score: 2.2