Cp4.1LG03g15880 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g15880
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox leucine zipper family protein
LocationCp4.1LG03 : 13535819 .. 13541714 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTTGTTAAATAATAAAACTGCCAATTTAAATATTATGTTTGTTTAATAATAATAATATTATAAAAAAAATGGGATAACCGGGTCTGGCAGAGTCCTAGTTTGGTTCACAATTTTTTTTCTCAACAACATCCTAACTATTTCCCCCTCAATTTAATATAAAATGTCAATTGTGACTTTGAATTTCAATTTGGTCGACTTAAATAAATGTAGCAATGTCTACCTTTGTTAAATTTCTTATAGCTTTTCACAACAATTTTTTATTTTTAAACAAAAGAAGTAAGAATATCACAAATTGTACAACTGTTTTTAAAAAATAAAAAAATAAAAAAAAAATAGTTGAGTAATTTTGTTACTAATTCAATACAATATACACTCTTTAGGTCTGTCATCTCTAAAATCACACATAAAAATATTTTGTTCCAGTAAATTACACAAATTTCAAGATTTCTAGGAGTCAAAAAAACGTTTTTAAAATTTTAAATACAAAAACTAAATTACTTATCAAATGAACTTTCCTTTTTTAAAAAAACAGAAAATTAAAACCTAAACTAAAAGAAAAACTAATTAGGTATTAAAGGGGCTCTAAATGAACCCGTAAAACAAAGATTAAGCTATGAAGTATCAAGTACCTCATTATTTTCCCCCTGCCTCAAGTTGTCTCAAGGTCTGGTCTACGGCTTGGCTTCAACTGTATAGTCAAATAACGTAAAAAGATTGGTAAATACTTTTAATTAAAAGCTATGCACAATATATTCTGAAACAAATTAGAAAAACAAAATACTTGTAATATACATCTAATTACATTCACAAGGCTACTCAGATCCTATTACAAAAAAGAAAAACAAATCATAAAATACGATACTTTTTCTTTTTTCCTGAATACACACACACACACACACAAATTAAAAATAAAACAAAACAAAAATCCGAGGAAGCAAAGCAAATGCCAATAGAATTATCCGAAACAGAGGAGAAAATAATAACCTTCCCTAGCCAAGAAAGAAATTCTCTCTTTCTATCAAGGACCCCACCAGACTTGGATATAAATGATGCATAAAATAAGTGAGCAAATATATTAGAGATATTGCAAATCATTCACTCTCCCAACACCTGGTTTTGGAATAAAGCCTTGTGGTGTCCAAATTGAATCTTAAGCAGCATGCTAAATGATGATGATGCCGAATATTCTCCTCCAGAATCCATGGCAGAGGCTTTTGGCATGAGAAAAAAAAGCATGAACAGAAGAAGGTTCAGTGAAGAGCAGATCAAATCATTGGAGTCAATTTTCGAGTCTGAGTCAAGGCTTGAGCCCAGAAAGAAGTTGCAGCTAGCGGGAGAGCTGGGGTTGCATCCACGCCAGGTTGCAATATGGTTTCAAAACAAGAGAGCTAGATGGAAGTCAAAGCAGCTTGAACGAGACTACAGCGTTTTACGAGCTAATTACAACACTCTAGTTTCCCGGTTTGAAGCTCTTAAGAAGGAAAAGCAAGCTTTGGCTATACAGGTATTTGTTCTTTGTTTCTATACAAAAACACCGTCCGTTTCTAGTTTTTGCTAATCATTGTTATTTCTTTACTTCGGCCAGTTGCAGAAGCTAAATGATCTCGTGCAGAGGTCCATGGAGGAAACTGAGAGCTGCAGGGGAGGTCTCTCCTTAGAAACTATTGATGGCAAGTCTGAAAATGGCCACAGAACAAAGTATGAGTCTGAAGTAAAACCTTGTGTGTCAGCTGAGGAGAAATCAGAACATGAACTAGAGGTTCTTTCCAACTATGGTAGTGGCACAAAGGAAGCCTATCTTGGATTAGACGAGCCCCAGTTCAGGGAACCTACTCAGGGCTCCTTGATATCAACACCAAATTGGAGCAATTTAGAGTCTGAAGGGCTTTTCAGTCAGTCCAATACCAATGGCCAGTGGTGGGACTTCTGGTCGTGAAAANAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCATAAAGGATCTGTAAACAAATAAAATTCCAATCTTTTTTTTTAAATAGATCATCCTAGCAATGATTTCGGCTCAAAGGACGAGGAACAGTTCAGAGCATAGCGTCTTCTATCAAGGATAGCAATGATTGTAATGTGGTTGTTTCAAGCACTGATGCTCCAAAATTCACCATAACTTCAGATGGAAGTCAGAAAATATTCTGTGTATAACCAAAAAATTCGGTGAGCAACGCTAGGAAATACCACCGGCAAAGAGTGCTGCCTTGGAAAACTGAAACAATATGCTCATATGCAGACAGGGGTGCAGTCTGGAGTGCTTGCAGTGTGTGAGAGTTCCTTTCTTNTTTTTTTTTTTTTTTTTTTCTCTTCCCTTTTCGTTGTTTGGTTGGTTGGGGATAGGGATGGGGGGTGACTATGCAACTCTATATTCATATGCTCAGTTATGAACTAGAAGTAGTGCTCACTTTACACAAAGATTCTCAGTTTTGTGGTGCTTCTTTGGTATTAGTGTCATCTCACGAAAGATTTAACCCAGCACATAAGGAAACCAAAGGAATAATGTCAGTAAACTCTCGGATTTTAATGGTCGAATTTGATATCATGGAGATTATTCCTATGCTGAAAATAGCTAGAAGATGTCCGTCTTTTCCATCGTAGGTTCTTCAGAAGATCTTTTAATCATGTAAGATATGCTAAACATCATCTAAAACGTAAACAGTAATAGCAAGTCAATGTAACTTCATATTCGGCCACCTAAAAACTAAAAGGCTTAGATAAACTAACCCTCAAAAAATCCTAGTCGGAGAAGGCTTGAGAATGCTAATCAGCGAATGGTTGATATAAAACTTTACATCATGAATTTGAAGTCTTGTTTTCTATCAACAAAATGCAAAAATGAGCCCTGTACTTCATCATTTTGCTAGTACACACTTCATCCAATTTCTGAATCCTCTTCTAGCCCTCAAATTTAATGACCAGTCCCACTTCATCCTTCCATCTACGTCAACTATTATTTATGAGCAGACAAGATTTATCCACTGGCAACATTTTTCTTAAATTAAAAGCACATTCGTCCCCTTTCTTCAATTTCAACTAGCTAAGTTCATGTTAACTACAGATAAGAAGGTAAGTTTTACAGAATTACTTGATAAGAACATAATAACTCTTGTTCTTGAAAGCCAATGGCAGAAGTAAGATTTTGAAACAAATTTTTTATGACCCACCTTGAAAATTAAGTTTCATTTTCTTTTTAATAACATTAACCGAGTGGAAAAGAACTCGCGCTTGAAAACAAGTGGTCTCCCCCCTTAACATTAAGTGCTCTGCTCACCGCCTGTACTAGATCAAACTTTTATGAACGCTCAGTGCTCACCATAATTTCTAGGGTTCACGGAGAAACTCAATGAAGGCAGCATTTATCGCTATAGTGAGTTCAATTCTAGGACGAATTAAATGCCACTTTCACGGCCTAGATTCGAGTCTCCCAATTCTATATGTAGAATCCCAACCTACATCGTTATTTGTTTACTTAAATCATACTCTTATCTTTCTAAGTTAGAAAGAAAATGCAAGTAAAACCAACGCAATAACAACAGTTAAGCACAACTTAATCGGTATTCCTTAAAGAAATTTCATGATAATAAAACTTGAACTGAATTTGAAAATGTCGGAAGAAGAGGATCAAACCTTTAGCATTGGGAGCTTCTGTTCTTTCGATCTTAGAAACAGGAGGCCGAAGCTCCACGATTTCCCGCTGGAATTGGCAATATAAACGCGCGGTTCTGAACCTAATGCAGACGATCACTGCCAGATCCCAAATCGTCAATTAAAAAAAAAAAAAAATCAGAAATCCTCAATTATTTGGCGGTACTTATTTTACTTTTTAAAATGAATTTTGAGAGAATTTGAAAATAATAATAATAATATGTTTTTTTCAGAAATTGTAAAGTTTTTCCATATTTTCATAGACAATTATGTAAGTTGCCTCTAAGACCAAAAAGGTAATTTAATGAAATAAATAAATAAATAAATAGTTATTTAAAAAAAATGAAAAAAAGAAAAAAAAAAAAAAAAAACAAAAACAAAAACAATGAAGCCGTTTGACTATGGTACCAATTAACCGCACTGCCCCTCGCTTCGGCTGATTCTCCTGCACAGAGACGACAGACGAGCGAGTAATGAGCGACCTCTCCATTGCAGCTCCCACCGTAAAACACCACCATTACACCGCTTGCAACCCCATTCCCTCCGCCGCTCTCTACATTCTCATACCCCTTTTCATTTTGGGATTTTCCGTCTCCATTTTCGTCCTCGCCGTGGTTCACAACGCCTTCTTCTTCGTCTCTATCCTCCTTCTAGCCATCTTTCTCTCTGCTTTCGCCCTCTGGAATAGCCTCTACTTCTCCTCCAAAGCTGCAATGCTCTCATTTCTTCACTCTTTTCCCGACTCCGACCTCACACTTGCCTGCGAAGGTCAACTCGTTAAGATTACAGGGGTATTCCTTCTCTAATTTCTTCTCCCTTTTCAGCTTAATGGGTTATCAGATTTTCGTGAGTAGGTTGTATTTTCTCTTTTCAATAGCCACCGCTCATGGGTGTGATTTGTTTCTGCATATTTTTGTTTTCCCTTTCAGCAATTCGTATGAGAGTTTCTGAAAATGTTTCCGTTGAGTTGATGATTGTAAAACGCTTGCTTGTGACATTAACATTTTGGAGACTCAAACCATTGACATAGCTGATGTTAGCTCTTTGGGATTTCATTTTATTGTTGTTCTTAAGTACCCTGCAGATGAATTGGAGGATTAATGTTGTTTGCTGTTTCATATTCAGTTTGCTTCGTGCGGGAGTGTCTCCCTTGAATCATCCTATGAAAAGGCTAGTGGGTGTATTTATGCTTCTACTTCTCTATATGAATATAGAGGAATGTCTCTGGTTTTTCAAAAGGTCACCCAACCTTACTGTGGTTGGAAGTTAGCATACAGTGAGGTATGCAAATTCCACTCCCAACAAACGTTTCGTTTCTTTGTTTCTATTGTTTTCTTGCAACAATGTCACTTTATGTTGTTTGGAAGCTACTGCATGAAGTGAAAGAAATGTCATTTAGCTCCATTTTTTCTTTATGGTTCAAATGTCTCCAGTTGTATTCATTATCTTGACATCCTTATCTGCTCATTTTGGGATTTTTATTTGACCTTGATATCGGTTTTTGTTATGATGGGTATGTGTTTAAGATTATTAATACCGCAGTGCTGGTATTTCTACCTGGCAGAGGTTCTCCACGGATTTCTACATAACTGATAGAAAAAGTGGTATTAGAGCTATGGTTCAGGCTGGCCCAGGTAGTAAACTGGTTCCTCTTATTATTGAGAGCAAGCTTGTTAATACTACCAGACACCGTAAGATTCTGTCCTCTTCCTTGAGAAAATGGCTGAGAGACCGAAACCTTTCTACCGAAGCTCGAGTGCTTCGACTTGAAGAGGGGTAACCGCTTGTCGTGATAGCACCTGTTTGTTTGACTATGTTTGGCCGTAATTTCAGTAGCTAATTCCATGTGATTTTAGCAGATATGTTCAGGAAGGGAGCTTGGTGTCTGTAATGGGAATGTTGCATAAGAGTAACGGTCATTTAACAGTAGTTCAACCTCCAGATGCAATCTCTACCGGATGTTCATGGCGGAAACTTCTTCTTCCCATTGATATTGATGGACTAGTCCTTGGGGTCTCACAGATGTCTGGCCCTTCGCTCGTTCGGGGATCGTTATGTCAGCAAGAACAGTTGGCTGATATATGAGACTTGAACAGTATGTTGTAGATGGAAATTTTAATCTTCCTAACCTTTTGCTTATCATTAGTAATTTTGATCTTCTTTCAGTTCATCTATCTCTGCGCCTTTCGTTTC

mRNA sequence

ATGAACATGCTAAATGATGATGATGCCGAATATTCTCCTCCAGAATCCATGGCAGAGGCTTTTGGCATGAGAAAAAAAAGCATGAACAGAAGAAGGTTCAGTGAAGAGCAGATCAAATCATTGGAGTCAATTTTCGAGTCTGAGTCAAGGCTTGAGCCCAGAAAGAAGTTGCAGCTAGCGGGAGAGCTGGGGTTGCATCCACGCCAGGTTGCAATATGGTTTCAAAACAAGAGAGCTAGATGGAAGTCAAAGCAGCTTGAACGAGACTACAGCGTTTTACGAGCTAATTACAACACTCTAGTTTCCCGGTTTGAAGCTCTTAAGAAGGAAAAGCAAGCTTTGGCTATACAGTTGCAGAAGCTAAATGATCTCGTGCAGAGGTCCATGGAGGAAACTGAGAGCTGCAGGGGAGGTCTCTCCTTAGAAACTATTGATGGCAAGTCTGAAAATGGCCACAGAACAAAGTATGAGTCTGAAGTAAAACCTTGTGTGTCAGCTGAGGAGAAATCAGAACATGAACTAGAGGTTCTTTCCAACTATGGTAGTGGCACAAAGGAAGCCTATCTTGGATTAGACGAGCCCCATTATGAACTAGAAGTAGTGCTCACTTTACACAAAGATTCTCAGTTTTGTGGTGCTTCTTTGAAAGAAAATGCAAGTAAAACCAACGCAATAACAACAGTTAAGCACAACTTAATCGAGACGACAGACGAGCGAGTAATGAGCGACCTCTCCATTGCAGCTCCCACCGTAAAACACCACCATTACACCGCTTGCAACCCCATTCCCTCCGCCGCTCTCTACATTCTCATACCCCTTTTCATTTTGGGATTTTCCGTCTCCATTTTCGTCCTCGCCGTGGTTCACAACGCCTTCTTCTTCGTCTCTATCCTCCTTCTAGCCATCTTTCTCTCTGCTTTCGCCCTCTGGAATAGCCTCTACTTCTCCTCCAAAGCTGCAATGCTCTCATTTCTTCACTCTTTTCCCGACTCCGACCTCACACTTGCCTGCGAAGGTCAACTCGTTAAGATTACAGGGTTTGCTTCGTGCGGGAGTGTCTCCCTTGAATCATCCTATGAAAAGGCTAGTGGGTGTATTTATGCTTCTACTTCTCTATATGAATATAGAGGAATGTCTCTGGTTTTTCAAAAGGTCACCCAACCTTACTGTGGTTGGAAGTTAGCATACAGTGAGAGGTTCTCCACGGATTTCTACATAACTGATAGAAAAAGTGGTATTAGAGCTATGGTTCAGGCTGGCCCAGGTAGTAAACTGGTTCCTCTTATTATTGAGAGCAAGCTTGTTAATACTACCAGACACCGTAAGATTCTGTCCTCTTCCTTGAGAAAATGGCTGAGAGACCGAAACCTTTCTACCGAAGCTCGAGTGCTTCGACTTGAAGAGGGATATGTTCAGGAAGGGAGCTTGGTGTCTGTAATGGGAATGTTGCATAAGAGTAACGGTCATTTAACAGTAGTTCAACCTCCAGATGCAATCTCTACCGGATGTTCATGGCGGAAACTTCTTCTTCCCATTGATATTGATGGACTAGTCCTTGGGGTCTCACAGATGTCTGGCCCTTCGCTCGTTCGGGGATCGTTATGTCAGCAAGAACAGTTGGCTGATATATGAGACTTGAACAGTATGTTGTAGATGGAAATTTTAATCTTCCTAACCTTTTGCTTATCATTAGTAATTTTGATCTTCTTTCAGTTCATCTATCTCTGCGCCTTTCGTTTC

Coding sequence (CDS)

ATGAACATGCTAAATGATGATGATGCCGAATATTCTCCTCCAGAATCCATGGCAGAGGCTTTTGGCATGAGAAAAAAAAGCATGAACAGAAGAAGGTTCAGTGAAGAGCAGATCAAATCATTGGAGTCAATTTTCGAGTCTGAGTCAAGGCTTGAGCCCAGAAAGAAGTTGCAGCTAGCGGGAGAGCTGGGGTTGCATCCACGCCAGGTTGCAATATGGTTTCAAAACAAGAGAGCTAGATGGAAGTCAAAGCAGCTTGAACGAGACTACAGCGTTTTACGAGCTAATTACAACACTCTAGTTTCCCGGTTTGAAGCTCTTAAGAAGGAAAAGCAAGCTTTGGCTATACAGTTGCAGAAGCTAAATGATCTCGTGCAGAGGTCCATGGAGGAAACTGAGAGCTGCAGGGGAGGTCTCTCCTTAGAAACTATTGATGGCAAGTCTGAAAATGGCCACAGAACAAAGTATGAGTCTGAAGTAAAACCTTGTGTGTCAGCTGAGGAGAAATCAGAACATGAACTAGAGGTTCTTTCCAACTATGGTAGTGGCACAAAGGAAGCCTATCTTGGATTAGACGAGCCCCATTATGAACTAGAAGTAGTGCTCACTTTACACAAAGATTCTCAGTTTTGTGGTGCTTCTTTGAAAGAAAATGCAAGTAAAACCAACGCAATAACAACAGTTAAGCACAACTTAATCGAGACGACAGACGAGCGAGTAATGAGCGACCTCTCCATTGCAGCTCCCACCGTAAAACACCACCATTACACCGCTTGCAACCCCATTCCCTCCGCCGCTCTCTACATTCTCATACCCCTTTTCATTTTGGGATTTTCCGTCTCCATTTTCGTCCTCGCCGTGGTTCACAACGCCTTCTTCTTCGTCTCTATCCTCCTTCTAGCCATCTTTCTCTCTGCTTTCGCCCTCTGGAATAGCCTCTACTTCTCCTCCAAAGCTGCAATGCTCTCATTTCTTCACTCTTTTCCCGACTCCGACCTCACACTTGCCTGCGAAGGTCAACTCGTTAAGATTACAGGGTTTGCTTCGTGCGGGAGTGTCTCCCTTGAATCATCCTATGAAAAGGCTAGTGGGTGTATTTATGCTTCTACTTCTCTATATGAATATAGAGGAATGTCTCTGGTTTTTCAAAAGGTCACCCAACCTTACTGTGGTTGGAAGTTAGCATACAGTGAGAGGTTCTCCACGGATTTCTACATAACTGATAGAAAAAGTGGTATTAGAGCTATGGTTCAGGCTGGCCCAGGTAGTAAACTGGTTCCTCTTATTATTGAGAGCAAGCTTGTTAATACTACCAGACACCGTAAGATTCTGTCCTCTTCCTTGAGAAAATGGCTGAGAGACCGAAACCTTTCTACCGAAGCTCGAGTGCTTCGACTTGAAGAGGGATATGTTCAGGAAGGGAGCTTGGTGTCTGTAATGGGAATGTTGCATAAGAGTAACGGTCATTTAACAGTAGTTCAACCTCCAGATGCAATCTCTACCGGATGTTCATGGCGGAAACTTCTTCTTCCCATTGATATTGATGGACTAGTCCTTGGGGTCTCACAGATGTCTGGCCCTTCGCTCGTTCGGGGATCGTTATGTCAGCAAGAACAGTTGGCTGATATATGA

Protein sequence

MNMLNDDDAEYSPPESMAEAFGMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSNYGSGTKEAYLGLDEPHYELEVVLTLHKDSQFCGASLKENASKTNAITTVKHNLIETTDERVMSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILLLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSYEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQAGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSVMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQEQLADI
BLAST of Cp4.1LG03g15880 vs. Swiss-Prot
Match: Y1686_ARATH (Uncharacterized membrane protein At1g16860 OS=Arabidopsis thaliana GN=At1g16860 PE=1 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.8e-45
Identity = 97/281 (34.52%), Postives = 165/281 (58.72%), Query Frame = 1

Query: 249 PTVKHHH----------YTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSIL 308
           PTV H+           ++     P   L++++ +FI+GF    F+L  VHN    V + 
Sbjct: 183 PTVVHNQAVTTLGPEDDFSCLKSFPKPVLWLVVLIFIMGFLAGGFILGAVHNPILLVVVA 242

Query: 309 LLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESS 368
           +L   ++A  +WN  +   +  +  F+  +PD+DL  A  GQ VK+TG  +CG+V LESS
Sbjct: 243 ILFTVVAALFIWNICW--GRRGITDFIARYPDADLRTAKNGQHVKVTGVVTCGNVPLESS 302

Query: 369 YEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQ 428
           + +   C+Y ST LYEYRG        +  +  W L  SER   DFYI+D +SG+RA+V+
Sbjct: 303 FHRVPRCVYTSTCLYEYRGWGSKPANSSHRHFTWGLRSSERHVVDFYISDFQSGLRALVK 362

Query: 429 AGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVS 488
            G G+K+ PL+ +S +++  +  + +S    +WL  +NL+++ R++RL+EGY++EGS VS
Sbjct: 363 TGSGAKVTPLVDDSVVIDFKQGSEQVSPDFVRWLGKKNLTSDDRIMRLKEGYIKEGSTVS 422

Query: 489 VMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVL 520
           V+G++ +++  L +V   + ++ G  WR+   P  ++G+VL
Sbjct: 423 VIGVVQRNDNVLMIVPSSEPLAAGWQWRRCTFPTSLEGIVL 461

BLAST of Cp4.1LG03g15880 vs. Swiss-Prot
Match: ATHB7_ARATH (Homeobox-leucine zipper protein ATHB-7 OS=Arabidopsis thaliana GN=ATHB-7 PE=1 SV=2)

HSP 1 Score: 161.0 bits (406), Expect = 3.7e-38
Identity = 103/218 (47.25%), Postives = 136/218 (62.39%), Query Frame = 1

Query: 7   DDAEYSPPESMAEAFGMRKKSM-------NRRRFSEEQIKSLESIFESESRLEPRKKLQL 66
           +  EYSP    AE F   KK         N+RRFS+EQIKSLE +FESE+RLEPRKK+QL
Sbjct: 3   EGGEYSPAMMSAEPFLTMKKMKKSNHNKNNQRRFSDEQIKSLEMMFESETRLEPRKKVQL 62

Query: 67  AGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQ 126
           A ELGL PRQVAIWFQNKRARWKSKQLE +Y++LR NY+ L S+FE+LKKEKQAL  +LQ
Sbjct: 63  ARELGLQPRQVAIWFQNKRARWKSKQLETEYNILRQNYDNLASQFESLKKEKQALVSELQ 122

Query: 127 KLNDLVQ-RSMEETESCRGG---LSLETIDGKSEN-GHRTKYESEVKPCVSAEEK----- 186
           +L +  Q ++ EE   C G    ++L +   +SEN  +R +   EV+P +  ++      
Sbjct: 123 RLKEATQKKTQEEERQCSGDQAVVALSSTHHESENEENRRRKPEEVRPEMEMKDDKGHHG 182

Query: 187 ---SEHELEVLSN-YGSGTKEAYLG--LDEPHYELEVV 202
                H+ E   N Y +  K  Y G   +EP + + +V
Sbjct: 183 VMCDHHDYEDDDNGYSNNIKREYFGGFEEEPDHLMNIV 220

BLAST of Cp4.1LG03g15880 vs. Swiss-Prot
Match: ATB12_ARATH (Homeobox-leucine zipper protein ATHB-12 OS=Arabidopsis thaliana GN=ATHB-12 PE=1 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 3.1e-37
Identity = 96/168 (57.14%), Postives = 120/168 (71.43%), Query Frame = 1

Query: 25  KKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSK 84
           KKS N++RFSEEQIKSLE IFESE+RLEPRKK+Q+A ELGL PRQVAIWFQNKRARWK+K
Sbjct: 26  KKSNNQKRFSEEQIKSLELIFESETRLEPRKKVQVARELGLQPRQVAIWFQNKRARWKTK 85

Query: 85  QLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEET-ESCRG--GLSL 144
           QLE++Y+ LRANYN L S+FE +KKEKQ+L  +LQ+LN+ +QR  EE    C G  GL+L
Sbjct: 86  QLEKEYNTLRANYNNLASQFEIMKKEKQSLVSELQRLNEEMQRPKEEKHHECCGDQGLAL 145

Query: 145 ----ETIDGKSENGHRT----------KYESEVK-PCVSAEEKSEHEL 175
               E+ +GKSE   R            Y + +K      EE+++HEL
Sbjct: 146 SSSTESHNGKSEPEGRLDQGSVLCNDGDYNNNIKTEYFGFEEETDHEL 193

BLAST of Cp4.1LG03g15880 vs. Swiss-Prot
Match: HOX6_ORYSI (Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. indica GN=HOX6 PE=2 SV=2)

HSP 1 Score: 134.0 bits (336), Expect = 4.8e-30
Identity = 67/97 (69.07%), Postives = 85/97 (87.63%), Query Frame = 1

Query: 30  RRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERD 89
           ++RFSEEQIKSLES+F ++++LEPR+KLQLA ELGL PRQVAIWFQNKRARWKSKQLER+
Sbjct: 31  KKRFSEEQIKSLESMFATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWKSKQLERE 90

Query: 90  YSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQ 127
           YS LR +Y+ L+  +E+LKKEK AL  QL+KL +++Q
Sbjct: 91  YSALRDDYDALLCSYESLKKEKLALIKQLEKLAEMLQ 127

BLAST of Cp4.1LG03g15880 vs. Swiss-Prot
Match: HOX6_ORYSJ (Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. japonica GN=HOX6 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 4.8e-30
Identity = 67/97 (69.07%), Postives = 85/97 (87.63%), Query Frame = 1

Query: 30  RRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERD 89
           ++RFSEEQIKSLES+F ++++LEPR+KLQLA ELGL PRQVAIWFQNKRARWKSKQLER+
Sbjct: 31  KKRFSEEQIKSLESMFATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWKSKQLERE 90

Query: 90  YSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQ 127
           YS LR +Y+ L+  +E+LKKEK AL  QL+KL +++Q
Sbjct: 91  YSALRDDYDALLCSYESLKKEKLALIKQLEKLAEMLQ 127

BLAST of Cp4.1LG03g15880 vs. TrEMBL
Match: A0A0A0L8G1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119250 PE=4 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 3.8e-135
Identity = 251/305 (82.30%), Postives = 276/305 (90.49%), Query Frame = 1

Query: 241 MSDLS-IAAPTVKH-HHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSIL 300
           MS+LS I APT KH HHY+ACNPIPS ALYILIPLFILGFSVSIFVL VVHNAFFF+S+L
Sbjct: 1   MSNLSSIEAPTEKHYHHYSACNPIPSPALYILIPLFILGFSVSIFVLVVVHNAFFFISLL 60

Query: 301 LLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESS 360
            L+IFLSAFALWNSL FSSK A+LSFLHS PDSDLTLA EGQLVKI+GFASCG+VSLESS
Sbjct: 61  FLSIFLSAFALWNSLNFSSKTAILSFLHSLPDSDLTLAQEGQLVKISGFASCGTVSLESS 120

Query: 361 YEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQ 420
           YEKA+GC+YASTSLYEYRGM ++FQK+TQPYCGW+L YSERFSTDFYITDRK+GIRAMV+
Sbjct: 121 YEKATGCVYASTSLYEYRGMPMIFQKITQPYCGWRLVYSERFSTDFYITDRKTGIRAMVR 180

Query: 421 AGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVS 480
           AGPGSKLVPLIIESKLVNTTRHRKILS SLRKWLR++N+STEAR+LRLEEGYVQEGS VS
Sbjct: 181 AGPGSKLVPLIIESKLVNTTRHRKILSPSLRKWLREKNISTEARILRLEEGYVQEGSFVS 240

Query: 481 VMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQE 540
           V GMLH++NG +T+VQPPD ISTGC WRK LLPI IDGLVLGVSQ +GP L  GSL   E
Sbjct: 241 VFGMLHRNNGQITIVQPPDVISTGCVWRKFLLPIYIDGLVLGVSQATGPLLGPGSLYHHE 300

Query: 541 QLADI 544
           Q ADI
Sbjct: 301 QFADI 305

BLAST of Cp4.1LG03g15880 vs. TrEMBL
Match: F6I2W2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0048g02880 PE=4 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 4.0e-92
Identity = 179/300 (59.67%), Postives = 227/300 (75.67%), Query Frame = 1

Query: 238 ERVMSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSI 297
           ERVM+DLS A+ +++ H Y+ C PIPS  LY+L+PLF  G +VSIF+L  VHNAF FVS+
Sbjct: 27  ERVMNDLSNAS-SIQSHCYS-CKPIPSQVLYVLVPLFFTGLAVSIFILIAVHNAFLFVSL 86

Query: 298 LLLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLES 357
           L L+  ++AF +WN++ +    A+  +L SFPDSDL LA  GQLVKITG  SCG++SLES
Sbjct: 87  LCLSALVAAFLIWNTVNWRRSRALFCYLRSFPDSDLRLARHGQLVKITGLVSCGNISLES 146

Query: 358 SYEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMV 417
           SYEKA+ CIY ST LYEY G+ L       P  GW LAY ERFSTDFYITD KSGIRA+V
Sbjct: 147 SYEKATRCIYTSTLLYEYPGLGLKLADAKVPCFGWGLAYCERFSTDFYITDSKSGIRALV 206

Query: 418 QAGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLV 477
           +AG GS++ PLI+ES+LVNTTR  + LSS  +KWL +RN+S +AR+LRLEEGYV+EGS +
Sbjct: 207 KAGSGSRVTPLIVESRLVNTTRKCRFLSSHFKKWLAERNISGQARLLRLEEGYVKEGSSM 266

Query: 478 SVMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQ 537
           +V+GMLH+ N  L +VQPP+ +STGC WRKLLLP+DIDG++LGV +M GP     S  QQ
Sbjct: 267 AVIGMLHRDNDALMIVQPPELLSTGCLWRKLLLPVDIDGVILGVPEMVGPVANPPSSMQQ 324

BLAST of Cp4.1LG03g15880 vs. TrEMBL
Match: M5X591_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa021193mg PE=4 SV=1)

HSP 1 Score: 337.4 bits (864), Expect = 3.2e-89
Identity = 178/280 (63.57%), Postives = 219/280 (78.21%), Query Frame = 1

Query: 240 VMSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILL 299
           VM+DLS AA  ++ H  +A  PIPS  LYIL+P+F LG SVSIF+L VVHNA FFVS L+
Sbjct: 5   VMNDLSNAA--LRDHQSSAFKPIPSLVLYILVPIFFLGLSVSIFILIVVHNALFFVSFLV 64

Query: 300 LAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSY 359
           L+  + AF +WN  +++ KAA   FL+S P+SDL LA  GQLVKITG ASC S+SLESSY
Sbjct: 65  LSALVFAFVVWNKRHWAKKAAFFLFLNSLPESDLRLAQHGQLVKITGIASCESLSLESSY 124

Query: 360 EKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQA 419
           EKA+GCIYAST LYEYRG++     +      W LAY ERFSTDFY+TD+KSG+RA V+A
Sbjct: 125 EKATGCIYASTLLYEYRGLTRQPVNINSSCFQWHLAYCERFSTDFYLTDQKSGLRATVKA 184

Query: 420 GPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSV 479
           G G K++PL++ESKLVNT R R +LS  LRKWL +RNLS+E+R+LRLEEGYVQEGS V+V
Sbjct: 185 GSGCKVIPLVVESKLVNTKRCR-LLSPHLRKWLSERNLSSESRLLRLEEGYVQEGSSVTV 244

Query: 480 MGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVL 520
            GMLH++N   T+VQPP+ ISTGC WRKLLLP+DIDGL+L
Sbjct: 245 FGMLHRNNEITTIVQPPEVISTGCLWRKLLLPVDIDGLIL 281

BLAST of Cp4.1LG03g15880 vs. TrEMBL
Match: A0A0A0L317_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119240 PE=4 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 7.1e-89
Identity = 175/192 (91.15%), Postives = 183/192 (95.31%), Query Frame = 1

Query: 3   MLNDDDAEYSPPESMAEAFGMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 62
           MLN+D AEYSPP SMAEAF MRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE
Sbjct: 1   MLNED-AEYSPPASMAEAFAMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60

Query: 63  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLN 122
           LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTL SRFEALKKEKQAL +QLQKLN
Sbjct: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLASRFEALKKEKQALTMQLQKLN 120

Query: 123 DLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSNYGS 182
           +LVQRSMEETESCRG LS+ETIDGKSE  HRTKYESEVKPC+SAEEKSEHELEVLSNYGS
Sbjct: 121 NLVQRSMEETESCRGVLSIETIDGKSEIDHRTKYESEVKPCLSAEEKSEHELEVLSNYGS 180

Query: 183 GTKEAYLGLDEP 195
           G KEAY+GL++P
Sbjct: 181 GVKEAYIGLEDP 191

BLAST of Cp4.1LG03g15880 vs. TrEMBL
Match: A0A067JX28_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14208 PE=4 SV=1)

HSP 1 Score: 334.0 bits (855), Expect = 3.5e-88
Identity = 175/287 (60.98%), Postives = 219/287 (76.31%), Query Frame = 1

Query: 241 MSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILLL 300
           M+DLS A   ++  +   C PIP  A++IL+ LF +G SVSIF+L VVHNA FF+S LL+
Sbjct: 1   MNDLSSAV--LREQNSENCKPIPRVAVFILVALFAIGLSVSIFILIVVHNAIFFLSFLLI 60

Query: 301 AIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSYE 360
           +  + +F  WN + +  KAA+  FL SFPDSDL  A +GQLVKITG ASCGSVSLESSYE
Sbjct: 61  SGLVISFIAWNRVNWRHKAAVFRFLRSFPDSDLASARDGQLVKITGLASCGSVSLESSYE 120

Query: 361 KASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQAG 420
           +A+ CIYAST LYEY G  L  +        W L Y ER+STDFYITDRKSGIRA+V+AG
Sbjct: 121 RATRCIYASTLLYEYGGFGLKPKDANTSCFQWSLTYCERYSTDFYITDRKSGIRALVKAG 180

Query: 421 PGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSVM 480
           PG K+VPLI+ESKLV TTR  +ILS  LRKWL+DRNLS EAR+LRLEEGY+Q GS+V+V+
Sbjct: 181 PGCKVVPLIVESKLVATTRQCRILSPHLRKWLQDRNLSVEARLLRLEEGYIQGGSIVTVI 240

Query: 481 GMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGP 528
           G+LH+++  L +VQP + +STGC W KL+LPIDIDGLV+G+S +SGP
Sbjct: 241 GVLHRNDDILMIVQPQELLSTGCLWTKLILPIDIDGLVVGLSNLSGP 285

BLAST of Cp4.1LG03g15880 vs. TAIR10
Match: AT4G22290.1 (AT4G22290.1 Ubiquitin-specific protease family C19-related protein)

HSP 1 Score: 189.9 bits (481), Expect = 4.1e-48
Identity = 99/258 (38.37%), Postives = 155/258 (60.08%), Query Frame = 1

Query: 262 IPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILLLAIFLSAFALWNSLYFSSKAAM 321
           +P A ++ ++ +  +G  V  F+   V       ++L          +WN ++   +  +
Sbjct: 177 VPKAMVWAVLIVAAMGLLVGAFLTVAVKKPVVIAAVLAAVCPAIVVLVWNCVW--RRKGL 236

Query: 322 LSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSYEKASGCIYASTSLYEYRGMSLV 381
           LSF+  +PD++L  A +GQ VK+TG  +CGS+ LESS+++   C+Y ST LYEY+G    
Sbjct: 237 LSFIKKYPDAELRGAIDGQFVKVTGVVTCGSIPLESSFQRTPRCVYVSTELYEYKGFGGK 296

Query: 382 FQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQAGPGSKLVPLIIESKLVNTTRHR 441
                     W   ++E++ +DFYI+D +SG+RA+V+AG GSK+ P +  + + N T   
Sbjct: 297 SANPKHRCFSWGSRHAEKYVSDFYISDFQSGLRALVKAGYGSKVSPFVKPATVANVTTQN 356

Query: 442 KILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSVMGMLHKSNGHLTVVQPPDAIST 501
           K LS S  KWL DRNLS + RV+RL+EGY++EGS VSVMGM+ + +  L +V P +A+S+
Sbjct: 357 KDLSPSFLKWLSDRNLSADDRVMRLKEGYIKEGSTVSVMGMVRRHDNVLMIVPPAEAVSS 416

Query: 502 GCSWRKLLLPIDIDGLVL 520
           GC W   L P   DGL++
Sbjct: 417 GCRWWHCLFPTYADGLII 432

BLAST of Cp4.1LG03g15880 vs. TAIR10
Match: AT1G16860.1 (AT1G16860.1 Ubiquitin-specific protease family C19-related protein)

HSP 1 Score: 185.3 bits (469), Expect = 1.0e-46
Identity = 97/281 (34.52%), Postives = 165/281 (58.72%), Query Frame = 1

Query: 249 PTVKHHH----------YTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSIL 308
           PTV H+           ++     P   L++++ +FI+GF    F+L  VHN    V + 
Sbjct: 183 PTVVHNQAVTTLGPEDDFSCLKSFPKPVLWLVVLIFIMGFLAGGFILGAVHNPILLVVVA 242

Query: 309 LLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESS 368
           +L   ++A  +WN  +   +  +  F+  +PD+DL  A  GQ VK+TG  +CG+V LESS
Sbjct: 243 ILFTVVAALFIWNICW--GRRGITDFIARYPDADLRTAKNGQHVKVTGVVTCGNVPLESS 302

Query: 369 YEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQ 428
           + +   C+Y ST LYEYRG        +  +  W L  SER   DFYI+D +SG+RA+V+
Sbjct: 303 FHRVPRCVYTSTCLYEYRGWGSKPANSSHRHFTWGLRSSERHVVDFYISDFQSGLRALVK 362

Query: 429 AGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVS 488
            G G+K+ PL+ +S +++  +  + +S    +WL  +NL+++ R++RL+EGY++EGS VS
Sbjct: 363 TGSGAKVTPLVDDSVVIDFKQGSEQVSPDFVRWLGKKNLTSDDRIMRLKEGYIKEGSTVS 422

Query: 489 VMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVL 520
           V+G++ +++  L +V   + ++ G  WR+   P  ++G+VL
Sbjct: 423 VIGVVQRNDNVLMIVPSSEPLAAGWQWRRCTFPTSLEGIVL 461

BLAST of Cp4.1LG03g15880 vs. TAIR10
Match: AT1G78880.1 (AT1G78880.1 Ubiquitin-specific protease family C19-related protein)

HSP 1 Score: 179.9 bits (455), Expect = 4.3e-45
Identity = 92/264 (34.85%), Postives = 155/264 (58.71%), Query Frame = 1

Query: 256 YTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILLLAIFLSAFALWNSLYF 315
           ++     P   L+++I +F++GF    F+L  VHNA   + + +L   ++A  +WN    
Sbjct: 194 FSCMKSFPKPVLWLVILIFVMGFLAGGFILGAVHNAILLIVVAVLFTVVAALFIWN--IS 253

Query: 316 SSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSYEKASGCIYASTSLYEY 375
             +  +  F+  +PD+DL  A  GQ VK+TG  +CG+V LESS+ +   C+Y ST LYEY
Sbjct: 254 CERRGITDFIARYPDADLRTAKNGQYVKVTGVVTCGNVPLESSFHRVPRCVYTSTCLYEY 313

Query: 376 RGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQAGPGSKLVPLIIESKLV 435
           RG        +     W L  +ER   DFYI+D +SG+RA+V+ G G+K+ PL+ +S ++
Sbjct: 314 RGWGSKPANASHRRFTWGLRSAERHVVDFYISDFQSGLRALVKTGNGAKVTPLVDDSVVI 373

Query: 436 NTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSVMGMLHKSNGHLTVVQP 495
           +     +  S    +WL  +NL+ + R++RL+EGY++EGS VSV+G++ +++  L +V  
Sbjct: 374 DFKPGNEQASPDFVRWLGKKNLTNDDRIMRLKEGYIKEGSTVSVIGVVQRNDNVLMIVPT 433

Query: 496 PDAISTGCSWRKLLLPIDIDGLVL 520
            + ++ G  W K   P  ++G+VL
Sbjct: 434 TEPLAAGWQWSKCTFPASLEGIVL 455

BLAST of Cp4.1LG03g15880 vs. TAIR10
Match: AT2G46680.1 (AT2G46680.1 homeobox 7)

HSP 1 Score: 161.0 bits (406), Expect = 2.1e-39
Identity = 103/218 (47.25%), Postives = 136/218 (62.39%), Query Frame = 1

Query: 7   DDAEYSPPESMAEAFGMRKKSM-------NRRRFSEEQIKSLESIFESESRLEPRKKLQL 66
           +  EYSP    AE F   KK         N+RRFS+EQIKSLE +FESE+RLEPRKK+QL
Sbjct: 3   EGGEYSPAMMSAEPFLTMKKMKKSNHNKNNQRRFSDEQIKSLEMMFESETRLEPRKKVQL 62

Query: 67  AGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQ 126
           A ELGL PRQVAIWFQNKRARWKSKQLE +Y++LR NY+ L S+FE+LKKEKQAL  +LQ
Sbjct: 63  ARELGLQPRQVAIWFQNKRARWKSKQLETEYNILRQNYDNLASQFESLKKEKQALVSELQ 122

Query: 127 KLNDLVQ-RSMEETESCRGG---LSLETIDGKSEN-GHRTKYESEVKPCVSAEEK----- 186
           +L +  Q ++ EE   C G    ++L +   +SEN  +R +   EV+P +  ++      
Sbjct: 123 RLKEATQKKTQEEERQCSGDQAVVALSSTHHESENEENRRRKPEEVRPEMEMKDDKGHHG 182

Query: 187 ---SEHELEVLSN-YGSGTKEAYLG--LDEPHYELEVV 202
                H+ E   N Y +  K  Y G   +EP + + +V
Sbjct: 183 VMCDHHDYEDDDNGYSNNIKREYFGGFEEEPDHLMNIV 220

BLAST of Cp4.1LG03g15880 vs. TAIR10
Match: AT3G61890.1 (AT3G61890.1 homeobox 12)

HSP 1 Score: 157.9 bits (398), Expect = 1.7e-38
Identity = 96/168 (57.14%), Postives = 120/168 (71.43%), Query Frame = 1

Query: 25  KKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSK 84
           KKS N++RFSEEQIKSLE IFESE+RLEPRKK+Q+A ELGL PRQVAIWFQNKRARWK+K
Sbjct: 26  KKSNNQKRFSEEQIKSLELIFESETRLEPRKKVQVARELGLQPRQVAIWFQNKRARWKTK 85

Query: 85  QLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEET-ESCRG--GLSL 144
           QLE++Y+ LRANYN L S+FE +KKEKQ+L  +LQ+LN+ +QR  EE    C G  GL+L
Sbjct: 86  QLEKEYNTLRANYNNLASQFEIMKKEKQSLVSELQRLNEEMQRPKEEKHHECCGDQGLAL 145

Query: 145 ----ETIDGKSENGHRT----------KYESEVK-PCVSAEEKSEHEL 175
               E+ +GKSE   R            Y + +K      EE+++HEL
Sbjct: 146 SSSTESHNGKSEPEGRLDQGSVLCNDGDYNNNIKTEYFGFEEETDHEL 193

BLAST of Cp4.1LG03g15880 vs. NCBI nr
Match: gi|659074877|ref|XP_008437844.1| (PREDICTED: uncharacterized membrane protein At1g16860-like [Cucumis melo])

HSP 1 Score: 497.7 bits (1280), Expect = 2.6e-137
Identity = 253/305 (82.95%), Postives = 279/305 (91.48%), Query Frame = 1

Query: 241 MSDLS-IAAPTVKH-HHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSIL 300
           MSDLS I APT KH HHY+ACNPIPS ALYILIPLFILGFSVSIFVL VVHNAFFF+S+L
Sbjct: 1   MSDLSSIEAPTEKHYHHYSACNPIPSPALYILIPLFILGFSVSIFVLVVVHNAFFFISLL 60

Query: 301 LLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESS 360
            L+IFLS FALWNSL FSSK A+LSFLHS PDSDLT+A EGQ VKI+GFASCG+VSLESS
Sbjct: 61  FLSIFLSTFALWNSLNFSSKTAILSFLHSLPDSDLTIAREGQFVKISGFASCGTVSLESS 120

Query: 361 YEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQ 420
           YEKA+GC+YASTSLYEYRGM L+FQK+TQPYCGW+L YSERFSTDFYITDRK+GIRAMV+
Sbjct: 121 YEKAAGCVYASTSLYEYRGMPLIFQKITQPYCGWRLVYSERFSTDFYITDRKTGIRAMVR 180

Query: 421 AGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVS 480
           AGPGSKLVPLIIESKLVNTTRHRKILS SLRKWLR++N+STEAR+LRLEEGYVQEGS VS
Sbjct: 181 AGPGSKLVPLIIESKLVNTTRHRKILSPSLRKWLREKNISTEARMLRLEEGYVQEGSFVS 240

Query: 481 VMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQE 540
           V+GMLH++NG +T+VQPPD ISTGC WRKLLLPI IDG+VLGVSQ +GPSL  GSLC QE
Sbjct: 241 VLGMLHRNNGQITIVQPPDVISTGCVWRKLLLPIYIDGVVLGVSQTTGPSLGPGSLCHQE 300

Query: 541 QLADI 544
           Q ADI
Sbjct: 301 QFADI 305

BLAST of Cp4.1LG03g15880 vs. NCBI nr
Match: gi|449432010|ref|XP_004133793.1| (PREDICTED: uncharacterized membrane protein At1g16860 [Cucumis sativus])

HSP 1 Score: 490.0 bits (1260), Expect = 5.5e-135
Identity = 251/305 (82.30%), Postives = 276/305 (90.49%), Query Frame = 1

Query: 241 MSDLS-IAAPTVKH-HHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSIL 300
           MS+LS I APT KH HHY+ACNPIPS ALYILIPLFILGFSVSIFVL VVHNAFFF+S+L
Sbjct: 1   MSNLSSIEAPTEKHYHHYSACNPIPSPALYILIPLFILGFSVSIFVLVVVHNAFFFISLL 60

Query: 301 LLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESS 360
            L+IFLSAFALWNSL FSSK A+LSFLHS PDSDLTLA EGQLVKI+GFASCG+VSLESS
Sbjct: 61  FLSIFLSAFALWNSLNFSSKTAILSFLHSLPDSDLTLAQEGQLVKISGFASCGTVSLESS 120

Query: 361 YEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQ 420
           YEKA+GC+YASTSLYEYRGM ++FQK+TQPYCGW+L YSERFSTDFYITDRK+GIRAMV+
Sbjct: 121 YEKATGCVYASTSLYEYRGMPMIFQKITQPYCGWRLVYSERFSTDFYITDRKTGIRAMVR 180

Query: 421 AGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVS 480
           AGPGSKLVPLIIESKLVNTTRHRKILS SLRKWLR++N+STEAR+LRLEEGYVQEGS VS
Sbjct: 181 AGPGSKLVPLIIESKLVNTTRHRKILSPSLRKWLREKNISTEARILRLEEGYVQEGSFVS 240

Query: 481 VMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQE 540
           V GMLH++NG +T+VQPPD ISTGC WRK LLPI IDGLVLGVSQ +GP L  GSL   E
Sbjct: 241 VFGMLHRNNGQITIVQPPDVISTGCVWRKFLLPIYIDGLVLGVSQATGPLLGPGSLYHHE 300

Query: 541 QLADI 544
           Q ADI
Sbjct: 301 QFADI 305

BLAST of Cp4.1LG03g15880 vs. NCBI nr
Match: gi|1009142553|ref|XP_015888783.1| (PREDICTED: uncharacterized membrane protein At1g16860-like [Ziziphus jujuba])

HSP 1 Score: 348.6 bits (893), Expect = 2.0e-92
Identity = 183/293 (62.46%), Postives = 226/293 (77.13%), Query Frame = 1

Query: 234 ETTDERVMSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFF 293
           E   E VM+DLS AA  +  HH + C PIPS ALYIL  LF++G SVSIF+L VVHNAFF
Sbjct: 127 EREREEVMNDLSNAA--LLDHHCSTCKPIPSLALYILTSLFVIGLSVSIFILIVVHNAFF 186

Query: 294 FVSILLLAIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSV 353
           FVS LL++  + AF LWN+  +  + AM  FL S P++DL LA EGQLVKITG  +CGS+
Sbjct: 187 FVSFLLVSAIVLAFILWNTRSWRRRGAMFFFLSSLPETDLRLAQEGQLVKITGLTTCGSI 246

Query: 354 SLESSYEKASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGI 413
           SLESSYE+A+ C+YAST LYEY G++L      +    W LAY ERFSTDFYITD+KSG+
Sbjct: 247 SLESSYERATRCLYASTLLYEYGGLALNLVSFNKSCFQWSLAYCERFSTDFYITDKKSGL 306

Query: 414 RAMVQAGPGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQE 473
           RAMV+AG G K+VPLI+ES+L+NTTR  +ILS  L KWLR+RNLS EAR+LRLEEGYVQE
Sbjct: 307 RAMVKAGSGCKVVPLILESRLINTTRECRILSPYLTKWLRERNLSVEARLLRLEEGYVQE 366

Query: 474 GSLVSVMGMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSG 527
           GS V+V+G+L ++N    +VQPP+ ISTGC W+KLLLP+DIDGL+L VSQM+G
Sbjct: 367 GSSVTVVGLLRRNNDTPMIVQPPEIISTGCLWQKLLLPVDIDGLILSVSQMAG 417

BLAST of Cp4.1LG03g15880 vs. NCBI nr
Match: gi|731420719|ref|XP_010661482.1| (PREDICTED: uncharacterized membrane protein At1g16860-like [Vitis vinifera])

HSP 1 Score: 341.7 bits (875), Expect = 2.4e-90
Identity = 176/297 (59.26%), Postives = 224/297 (75.42%), Query Frame = 1

Query: 241 MSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILLL 300
           M+DLS A+ +++ H Y+ C PIPS  LY+L+PLF  G +VSIF+L  VHNAF FVS+L L
Sbjct: 1   MNDLSNAS-SIQSHCYS-CKPIPSQVLYVLVPLFFTGLAVSIFILIAVHNAFLFVSLLCL 60

Query: 301 AIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSYE 360
           +  ++AF +WN++ +    A+  +L SFPDSDL LA  GQLVKITG  SCG++SLESSYE
Sbjct: 61  SALVAAFLIWNTVNWRRSRALFCYLRSFPDSDLRLARHGQLVKITGLVSCGNISLESSYE 120

Query: 361 KASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQAG 420
           KA+ CIY ST LYEY G+ L       P  GW LAY ERFSTDFYITD KSGIRA+V+AG
Sbjct: 121 KATRCIYTSTLLYEYPGLGLKLADAKVPCFGWGLAYCERFSTDFYITDSKSGIRALVKAG 180

Query: 421 PGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSVM 480
            GS++ PLI+ES+LVNTTR  + LSS  +KWL +RN+S +AR+LRLEEGYV+EGS ++V+
Sbjct: 181 SGSRVTPLIVESRLVNTTRKCRFLSSHFKKWLAERNISGQARLLRLEEGYVKEGSSMAVI 240

Query: 481 GMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQ 538
           GMLH+ N  L +VQPP+ +STGC WRKLLLP+DIDG++LGV +M GP     S  QQ
Sbjct: 241 GMLHRDNDALMIVQPPELLSTGCLWRKLLLPVDIDGVILGVPEMVGPVANPPSSMQQ 295

BLAST of Cp4.1LG03g15880 vs. NCBI nr
Match: gi|470145312|ref|XP_004308285.1| (PREDICTED: uncharacterized membrane protein At1g16860-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 340.9 bits (873), Expect = 4.1e-90
Identity = 182/299 (60.87%), Postives = 229/299 (76.59%), Query Frame = 1

Query: 241 MSDLSIAAPTVKHHHYTACNPIPSAALYILIPLFILGFSVSIFVLAVVHNAFFFVSILLL 300
           M+DLS A     HH  T   PIPS  LYILIP+F+LG SVSIF+L  VHNA FFV  LLL
Sbjct: 1   MNDLSNAVLR-GHHTSTPFKPIPSLVLYILIPIFLLGLSVSIFILIAVHNALFFVFFLLL 60

Query: 301 AIFLSAFALWNSLYFSSKAAMLSFLHSFPDSDLTLACEGQLVKITGFASCGSVSLESSYE 360
           +  + AF LWN+ ++++++A+L FL+S PDSDL +A  G LVKITG ASCGS+SLESSYE
Sbjct: 61  SALVLAFVLWNTRHWATQSAVLFFLNSLPDSDLRVAQHGDLVKITGLASCGSLSLESSYE 120

Query: 361 KASGCIYASTSLYEYRGMSLVFQKVTQPYCGWKLAYSERFSTDFYITDRKSGIRAMVQAG 420
           KA+ C+YAST LYEY+G++L  +   +    W L Y ERFSTDFY+TDRKSG+RA+V+AG
Sbjct: 121 KATRCVYASTLLYEYKGLTLHPRNAKRSCFQWHLEYCERFSTDFYLTDRKSGLRAIVKAG 180

Query: 421 PGSKLVPLIIESKLVNTTRHRKILSSSLRKWLRDRNLSTEARVLRLEEGYVQEGSLVSVM 480
            G  L+PL+ ESKLVNT + R ILS  L KWLR+RNLS E+R+LRLEEGYVQEGS V+V 
Sbjct: 181 SGCNLIPLVFESKLVNTRKSR-ILSPHLTKWLRERNLSAESRLLRLEEGYVQEGSTVTVF 240

Query: 481 GMLHKSNGHLTVVQPPDAISTGCSWRKLLLPIDIDGLVLGVSQMSGPSLVRGSLCQQEQ 540
           GMLHK+N   T+VQPP+ IS+GC WRKLLLP+DIDGL+L +SQM+G S+ + S+   E+
Sbjct: 241 GMLHKNNEMTTIVQPPEVISSGCQWRKLLLPVDIDGLIL-ISQMAGQSVNQNSIQHPER 296

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1686_ARATH1.8e-4534.52Uncharacterized membrane protein At1g16860 OS=Arabidopsis thaliana GN=At1g16860 ... [more]
ATHB7_ARATH3.7e-3847.25Homeobox-leucine zipper protein ATHB-7 OS=Arabidopsis thaliana GN=ATHB-7 PE=1 SV... [more]
ATB12_ARATH3.1e-3757.14Homeobox-leucine zipper protein ATHB-12 OS=Arabidopsis thaliana GN=ATHB-12 PE=1 ... [more]
HOX6_ORYSI4.8e-3069.07Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. indica GN=HOX6 PE=2 ... [more]
HOX6_ORYSJ4.8e-3069.07Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. japonica GN=HOX6 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0L8G1_CUCSA3.8e-13582.30Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119250 PE=4 SV=1[more]
F6I2W2_VITVI4.0e-9259.67Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0048g02880 PE=4 SV=... [more]
M5X591_PRUPE3.2e-8963.57Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa021193mg PE=4 S... [more]
A0A0A0L317_CUCSA7.1e-8991.15Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119240 PE=4 SV=1[more]
A0A067JX28_JATCU3.5e-8860.98Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14208 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G22290.14.1e-4838.37 Ubiquitin-specific protease family C19-related protein[more]
AT1G16860.11.0e-4634.52 Ubiquitin-specific protease family C19-related protein[more]
AT1G78880.14.3e-4534.85 Ubiquitin-specific protease family C19-related protein[more]
AT2G46680.12.1e-3947.25 homeobox 7[more]
AT3G61890.11.7e-3857.14 homeobox 12[more]
Match NameE-valueIdentityDescription
gi|659074877|ref|XP_008437844.1|2.6e-13782.95PREDICTED: uncharacterized membrane protein At1g16860-like [Cucumis melo][more]
gi|449432010|ref|XP_004133793.1|5.5e-13582.30PREDICTED: uncharacterized membrane protein At1g16860 [Cucumis sativus][more]
gi|1009142553|ref|XP_015888783.1|2.0e-9262.46PREDICTED: uncharacterized membrane protein At1g16860-like [Ziziphus jujuba][more]
gi|731420719|ref|XP_010661482.1|2.4e-9059.26PREDICTED: uncharacterized membrane protein At1g16860-like [Vitis vinifera][more]
gi|470145312|ref|XP_004308285.1|4.1e-9060.87PREDICTED: uncharacterized membrane protein At1g16860-like [Fragaria vesca subsp... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR017970Homeobox_CS
IPR009057Homeobox-like_sf
IPR003106Leu_zip_homeo
IPR001356Homeobox_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g15880.1Cp4.1LG03g15880.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 29..82
score: 1.2
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 26..88
score: 1.7
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 24..84
score: 17
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 84..125
score: 1.3
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 29..91
score: 4.6
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 14..86
score: 1.84
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 59..82
scor
NoneNo IPR availableunknownCoilCoilcoord: 104..131
scor
NoneNo IPR availablePANTHERPTHR33709FAMILY NOT NAMEDcoord: 234..532
score: 2.1E
NoneNo IPR availablePANTHERPTHR33709:SF1SUBFAMILY NOT NAMEDcoord: 234..532
score: 2.1E

The following gene(s) are paralogous to this gene:

None