Sed0004043 (gene) Chayote v1

Overview
NameSed0004043
Typegene
OrganismSechium edule (Chayote v1)
Descriptionhomeobox-leucine zipper protein GLABRA 2-like
LocationLG09: 39336227 .. 39344206 (+)
RNA-Seq ExpressionSed0004043
SyntenySed0004043
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGGGAATTCAATAAACACAGAAAAAAACAATTTAATGAAATAAAACCTCAAAATAGTAAACTCATTTGAATACCGTGAACTTTAATTTGTCTATTTTGTACGGACACTGTTCCCATTTCTGAACGTATATCTCTATAAGCTCCCTTTTTCCCTTAGCCCTTGTTTGAACTCCACCGCCATTAATGGCCGTCGCCATGTCCGGCAACCACCAACCTCGCCGTGTTATCATGAACGTTCCTTCTTCTCCGGCTCTCTCTCTCACGCTTGTACTCATCTTCTTCCCTCCTGAATCTCTCTCTGCAAAAAAAAACGCAGCTCTAATGGAGTTTTTTTTTCGTCTCAGGCTGGAGGTTTTGGCAATGCGGCGGCGGCGAGGAGAGACGATAGCTGCAGTGACAACTCCGATCCGGCGGGGTCGAGATCGGCGGAGGATTTGTGCATCGATCTCGACGATGAGGATGAAGATAAACTACAAGGAAATAGGAAAAAGAGAAAGAATCGGCATAGTTCAGAGCAAATCAGAGAAATGGAGAAGTATATATCTCTATATATGTATATATGTTCTCAACATTTTTGTAATGATTTCAACATTTTTGTTCGATGAAATTTGCAGAGTGTTTAAAGAATCGCCTCATCCAGATGAGAAACAGAGGCAGTTACTCAGTGAGAAACTCGGTCTTCATTCCAAGCAAATCAAATTCTGGTTTCAGAATCGTAGAACTCAAATCAAGGTTTCCAGTTCTCTCAAGAGTTTCTTCTTCTTCATCTTCTTTCTTTTGCTTGATTTGTTTCTGTTTTCTAGGCCAGTCTCGAGCGACAGGAAATGGAGAAATTGCGCAAGGAAAATCAATCCATGAAGGAAATGCTCGACAAATCCTCTTGTCCTAAATGTTGCTCTTCAGCTACATCAATCTTCAGTTCTGAACAACAACAATCGCAATTAGTTACTGAAATTGTTCGACTCAAAGCCGAGGTAATAATATCTGGTTTCGATTGCAAGTTCTTGAAATTGATGGAGTTAACCTTCGATTTTCCGGTTGGTTCCGATTGCAAGTCCGTGAAAAAACTGAAATCACCTTCGATTTTTAGGTCGATTGCTTTAATATCTGGCTCCGGTTGAAATTTCTTGATAATTTATGAACTAACCTTCGATTTCCTGGTCGAGATGTATAATATTCGGTTTCGATTGAAAGTTCTTGAAAATTGATGAACTAACCTTCGATTTCATATTTTTGACATGGATCAACTAGCCTTCGATTTCCAGGACGAGATGTATTTTTTTCGGTTTCGATTGCAAGTTCTTGAAAAAACTGAACCAACCTTCGATTTCCAGGTCCGATAGGTATTATATTTGGTTCCGATTGCGAGTTCTTGACATTTGATGAACCAATCTTCGATTTCCAGGTCAAGAAGTATAATATTAAATTTCGATAACAAGTTCTTGAAATCGATGAACTAGCCTTCGATTTCCATGTTCTAGATGTAGAATATTAGATTCCGATAGCGAGTTCTTGAAACTGATGAACTAACCCATGTCGAGATGCATAATGTTAAGTTTCGATTGCAAGTTCTTGAAATTGATAGAACTAACATTCGATTTCCAGCTCGAGAGATTACGAGCGGCTCTGGAGGAATACGGTCCGGCTAGGGCGTCCACGTCGCGATCGAACGAGGAGGGGATTTTCGGAGCGGAGAAAGCTAGGGTTATGGAGATGGCGAATCGAGCGGTTGAGGAGGTTGTGAAAATGGCGGATTCCGGCGAGCCGCTGTGGGTTCGGAGCTTCGACGCCGGTCGAGAACTGCTGAATTACGACGAGTATGTGAAGGAATTCGGCGCCGGCGACGAGCGGCCGGGAAGAAAAATCGAAGCGTCGAGAGATTCCTGCGTGTTGTTCGTCGATCTTCAGCAGTTGGTTCAGAGCTTCATGGATGTGGTATATATGATTTTTAAGAACTAATTTCAGTAAAAACGTTGGGATTATTGTTGCATTAATCGGAGTTTTTTTTTTTTTTTTTTAATTATTATTATTATGATTATGATTTTTTTGTGAAGGTTCAGTGGAAAGAAATGTTCCCGTCGATAATCTCGAAGGCGTCGACTGTGGAGGTCGTCTGCAATGGCGACGGGGCCACCTGTAATGGCGCAGTTCAACTGGTAAGTCTTCTCTTTTACATTATTTTAATTTTTTTATTTTATTTTGCATTACTGCCTTTTCTTTTTATAATTAATGATTGAATGAATACATGTTTTTATTTTTTTTTCCTTTTAGGATAATTTTTTGCATTTTCAATTTAATTCTTTTTTTAAAAAAAAAACAAACGTGAGGATATTTACCACAGAGAAATATCATTGATACCGCATATGTGTTTAAATTTGTGCCACATGTTTTTTTCTAGCTACACATGGGATTGTTTGTTTGAGATATTTAGGTAGGGTTTAGTTTAAAGAGTTATATTTAAAAGTTATGTTGAGTAGTTAAAAATTGAAAATTGTAACAAAAAATTCGTTTGGATGTACTATTTATGTAGAGAAGTTAGAGTCAAAATTTTGTTTGGATGTAAATTTAAAATGAGTATTTAAAAAAGAAGTAATTAAAAACAAGTGGTTAAAATGAGTTATTTTGATTGATTTGAAAGAAATGATTAAAAATGAGTGGTTAGAAATGGTTGAGTCATTATAAATTTATGATTAAAAATGAGAGATATTTTTAATAATAAAAATGATGAGAAGTTAATAAAAACTCGTTTGAATGTAGATTTTTTGGGAGGAATTATTAAATCATAAAACTCAATTCCAGCTAAATGTGCCAAACAAACAGCCCCATACACTGTAGAATAATAACATAAAGGATAAAATATTAGTTTAGTATAGGTTAAAATTTAGTTCATATAATTTTAAAAATTAAAAATTAGTTTTGGTCGTTTGTTTTTAATAATGAATAGATTAAGGATTTGAAAATTAGTTATTCATAATTTAATAATGACATTATGAATTAAAGTAATGTATATAACAAAAAATTCATATCATTTATACAATGAACACAGTGTGTTTCTGTGTTTTTTTTTTTTGAGAAATGTGTTTCTGTGTAATTAATATCCATACTATAAAGACTATTTTTTAACTTCTATAAAACCATGAGTACGGTAATCCTTTGATATTTCTTCTGGCATAAAATATAAAGATTTCTTCCTTTTTTTTGTTTTTCTTCATTGTGTTTAGTAATCTCTATTCTTTGCCATTTGCTTGCATAAAATATCGAAGATTTTTTTGTTTTTTGTTTTCCAGTTGTTTTCTGTATGTTTAGGGTTTTCCTTTTTTTTTTTTTAGCATGCATAGTGTGTCAATAGATTGAGTTGAATTAACATATGAAAATTTCCCATTTAAAATTTTTCTTAACATAAAGTTATAATCGGAAACATATTAAATATGCAAGCTACTAAATTGGAAATTTAAAACAATTGATCCCAAATTTGAAAAACACCCTAAAACAATTTAATGGGGTTTATGGTAGAAGCTGAATTTTGTGCCAGGGTTGCAGATGTATGCCGAGCTTCAAATGCTAACTCCCAGCATCCCTCCCCGAGAGGTGCATTTCATTCGAAGTTGCAAGCAAATAAGCCCTCAAAAGTGGGTCATTGCCGATGTTTCCGTCTCGAAACTCGGCGATGGTATCGACTCATCATCATCCTCTTCCTCCCTTTGCGCCAAACGCCCATCCGGTTGCATCATTCACGACACTTCTAACGGCCGTTGTAAGGTAACCAACCTTTGATTTCATCTTAAGTTCGTTCAGTTATTTAAGAATATGAATACAACATTTTAGTGTTTGAAATTTCAACTTAATTTTTACTTGGTCTCTAAATTTTTAATATTATGATCATTTTATTTTTTGGTTTATACATATTTGATAATTTTATTTTTTTAGGTTTATTATTATTATACATGTTTGATAATTATTTATTAGTGACACTTATTCTTTGTGGTTACTCTCCATTCTAATACGAGATGCTACAATATGTATCCTATTATAGACAAATTGATTAATGAAATAGAGTTTTGATCAAAAAAAGAAGAAGCAAAACATGTTTAATAATACTTTTTTTTATTTTTTACAAAATCACAACTACGTTTGATTGTTGCTTTTAGCTTTTATTTTCCAATTAAAATAAATTTGAATTATTTTTACTATTTTTAGTCAGATTTTTTTTAATTGCATGTTAAAATAATAACAATGTTTTATATGCATGTCCCGTGTCACTAAACAAATAAAGTAAAAATAACAAAAATAAACCTGCCCAAAAAATAAATATTATCTTATAAATTAGCATATATTAATTATACTACGCCACCATTTTATCTGGCCAATTCACAACAACTTTATTAGAAACCATTTAGAAACTAATTTTAAATTTTAGAGACTAATTACAAATCAAAATTTGAGCTTTAAGCTAATAGACTTGATAACGAGTTTACGGATTCGAAACTAGTTACCTCCAATTGTTGTTCGGTTAAATAAAAAACAAATTTTAGAGACCCAAACATTGTTAGGGAACCTCTAGGATCAAATACCAACAAAATTGGAATCTAAAAAAAATTGGTTTAAAAACTTAAAAGAATCAAATGGAAACTAAGTCCAAATCTCAAGCACAAAAAATATATTGTCCCGAATATAGAATTACATTTTTAAATTTAGTGATAGTTTAATATGGTATCAAAGTTGAACATTTTTTGTCATGTCAGGTCACTTGGATAGAGCATTGGAAATGCCCAAAGATCGGGCTTCGCACGATATATCGTACCGTTATTGACAACGGTTCGATGTTCGGGGCGAGGCATTGGATAGCGACGCTACAAATCCATTGTGAGCGGCAAGTTTTCTTCATGGCTACCAACGTTCCTATGAAGGATTCCACTGGTAAAACTCCTCACCCATTGTTAATTTTCAAGTCTTTACCAGATCTCAGTTTCTTAAAAACTTTTAGGAATTACCACAACTGGTGGAAGAAAAAGCGTTTTTCAATTGGCACAAAGAATGACGTCTGGTATTTATCAATCCATTGGGGCTTCGAGCAGCCATACGTGGGCCAAGGTCCAGAGCAAAATAGGCGAAAACATTAGGGTTGCTTCAAGGAAGAACTTGAATGATCCACGAGAACCTCTTGGTTTGATCTTGTGTGCAGCTGCTTCTGTCTGGCTACCTGTTTCACCTAAAGTCTTGTTTGAATTCTTGATCGACGAGTCTCGACGAATCGAGGTATGCCACCATCTTTTCTTTGTGTTTGGCTGAATACAAATAAAGTAGCAGTCTTACGTTGGTTTAAATAAAGAGTATCATGTCCCCAAAGGACAGTCAAGCGGTTGACGTTAAAGACTTCAAGACATGTTTCCTTGAAAGATTACTGTTTCGAGGCCGTTGGAAAGCGCTTAAACTAGAATACTCTTCGAAGTATACCGATGCCACAGTCTAGGGACGGGCCAGTGGTTTTCCGGGCATTGAGCAAAACTACAATTTTCCGGTTTTCAAAAAAAAAAAAAAAAAGAGTAAATCATGCTTATACGCCTAAAGTAAATTTATAAAGATCTTTAAAAAAAGGTTCAATAGAGAAAAGAAAGAGAGAATATAAGTCTAATTTTTTTTTACACTAAAAATTTCAAGATTTTAATTTTTCCCTTTAATTTGCTCCCTTTCAAAAACATATTTGGCTATATGGTACAATGAAGGGTCTTGATGTTGACAGCTAAGAGACTTTTTTTCTTCTTCTCATTTTCTACCTCTCTCTTTTAGCTCTCAACATTTCAATAAACTTGTAGTCTAAGTATTGTTCTACACATGAATGATTAAAATGCAGAGTTTAGTTCTATTTTTTAGATTTATATTCATTTGGTTTTTTAACTTTAAAAAATATCTAATAAAATTTGGAAGATTTTGATTGATTGGATATAAAATTGAATCTTCTGTCTAACAACTTTTTAACTTTCAATTTTGTATTTAATAAATTTGAAAATTTAAATAGACCTTGACTAAGTCAAGAATCTATTGTACTAAAAAAAAAAAGAAAGTCAAGAATCTATTTGATAACATTCAGAATTCAAAAACCTATTAGACACAAAATTGAAAGCTCAAAATCTGTTTATAGACCAAATAAACATGATTTTGAATATCTAAGAACAAGATTTCTGTTTAAGTTTAGAGACGAGAAACACAAACTTGAAAACTTATAATTAAGGTCTGGGTCGATAATCATTTGGTTTTCTTTTATTATTTTTAATATAAGTTTATTTGATAACCAGTTTTGATTTTTGTTTTTAAAATTTTATAAACGTTTTAAAAATATTAAAGAAATTTTAAAAGCTAAAAAAGTAGTTTTTAAAAACAGGTTTTTGTTTTAATTTTTTAAAAAACATGAAAAAATTAAAAATAAATAGTAAAAAATTAAAAAAAATTTAAACATGTTTGAAAAAAAACAATTATCACACATGTCTTTTTATTTTAAATTTTTAAAAACATAAAATTAAAATCAGTTATCATATACAAATTTTATTTTTTAAAACAAAAAACCAAAAACTAAAAACGAAAAATTAAAAACAAAAAACAAAAGACGAAATGGTTATCAAAATTTAGGTCCGATAAATATAAGTTTAGAATGACACCATTTGCCCCCTTCGATGAAAATTTATGAGCTACTTTTCTTGTTGATGATGAATAAATGATTGGATTGTGAAAACTTTCAGTGGGATGTGATGTTAAGCAGTCCAGTGGAAACAGTAGCAACCTTTGCCAAAGGACAAAATCGAGGCAATGCTGTAACCATCCAAGTGAGTATCCTTTCGCTGTGCGACAGTTTTCTTGACAACCGAATATTATAATAATCAAATAGTTTGTCTCATGAGAAATGTTAAAGTATCTATAAGTTGATTCGAACATTCACGGATATAAAAAAAAAATAATGTAATTTAGAGATAATTCTTTTGCAATCATTTGTTTCAGGCAACAAAATCAGATGAAACCAACATGTGGATTCTACAAGACAGTCTAACAAATGAATATGAATCGACAGTGGTCTATGCTCAAGTTGACATTACGAGCATGCGGACGGTGATGGCAGGGTCCGACCCGAGCAGCATCACAATGTTACCGATGGGGTTTTCGATCCTCCCGGATGGACACTCGCCGAGGCCATCGATTATCAGCTTGAACACGGAGGAGAATGGAAGCGAAGGAGGTTCGTTGCTAACAATAGCGACTCAAATCTCAGTAAGTTCCTCCCCCACTGCTGAAACTTCGTCGCAGTCTGTTGAGTACGTTAAAAATTATATATCACATACGTTACAAAATATTAAAGCAAAGCTACAAGGCGAGGATGATTAAATATCACCATTTCTCATTCTAAAATATTCATTTCAACATTTTTTTTTGCCAACCTTTGCAGTGATTTAGAATGTTGACTTTTTAGCCAAAAGGGGTTGAACTCGGCCAAAACGAGTTGAACTCGGCCGTAATTGACATGTATCATTAACTTAGAGGTTTGCTTGCTCGAATTCTCTAACTTTAAATATTATTTTTGTACTAAAAATAAAATTAGTCCATCGATTATAATAATTAGTAAGTTTACCGTTGTTGTTTCACATAATTCATTCTTTCAAGTTTTTCTATATTTTTTTTGGCTATCCCTACACAATCGAATCAAATGGGTTAGTGATGCAATGATTTTATTGGTTATAAATCAGTTCAATACTATATGTCGTCGCAGGTTTGAGCCCGAGAATCGGTATTTATCCCACCTCCCCTGATATCTACTCAAAAAAAAAAAATCCGTACACTGTCGAAGTTATTTGTAGTTGAAGAAACTTGAATCTTGTATTTTGCTAATATTTTTGTTATATAAATTTTTGTTTTTGCTTTATTTAAGCACTGCCACAACCTCGCAATCCCCATTGAAAAAACAATTGGACCGAGACATATAGAATATTGTTTAAT

mRNA sequence

GTGGGAATTCAATAAACACAGAAAAAAACAATTTAATGAAATAAAACCTCAAAATAGTAAACTCATTTGAATACCGTGAACTTTAATTTGTCTATTTTGTACGGACACTGTTCCCATTTCTGAACGTATATCTCTATAAGCTCCCTTTTTCCCTTAGCCCTTGTTTGAACTCCACCGCCATTAATGGCCGTCGCCATGTCCGGCAACCACCAACCTCGCCGTGTTATCATGAACGTTCCTTCTTCTCCGGCTCTCTCTCTCACGCTTGCTGGAGGTTTTGGCAATGCGGCGGCGGCGAGGAGAGACGATAGCTGCAGTGACAACTCCGATCCGGCGGGGTCGAGATCGGCGGAGGATTTGTGCATCGATCTCGACGATGAGGATGAAGATAAACTACAAGGAAATAGGAAAAAGAGAAAGAATCGGCATAGTTCAGAGCAAATCAGAGAAATGGAGAAAGTGTTTAAAGAATCGCCTCATCCAGATGAGAAACAGAGGCAGTTACTCAGTGAGAAACTCGGTCTTCATTCCAAGCAAATCAAATTCTGGTTTCAGAATCGTAGAACTCAAATCAAGGCCAGTCTCGAGCGACAGGAAATGGAGAAATTGCGCAAGGAAAATCAATCCATGAAGGAAATGCTCGACAAATCCTCTTGTCCTAAATGTTGCTCTTCAGCTACATCAATCTTCAGTTCTGAACAACAACAATCGCAATTAGTTACTGAAATTGTTCGACTCAAAGCCGAGCTCGAGAGATTACGAGCGGCTCTGGAGGAATACGGTCCGGCTAGGGCGTCCACGTCGCGATCGAACGAGGAGGGGATTTTCGGAGCGGAGAAAGCTAGGGTTATGGAGATGGCGAATCGAGCGGTTGAGGAGGTTGTGAAAATGGCGGATTCCGGCGAGCCGCTGTGGGTTCGGAGCTTCGACGCCGGTCGAGAACTGCTGAATTACGACGAGTATGTGAAGGAATTCGGCGCCGGCGACGAGCGGCCGGGAAGAAAAATCGAAGCGTCGAGAGATTCCTGCGTGTTGTTCGTCGATCTTCAGCAGTTGGTTCAGAGCTTCATGGATGTGGTTCAGTGGAAAGAAATGTTCCCGTCGATAATCTCGAAGGCGTCGACTGTGGAGGTCGTCTGCAATGGCGACGGGGCCACCTGTAATGGCGCAGTTCAACTGATGTATGCCGAGCTTCAAATGCTAACTCCCAGCATCCCTCCCCGAGAGGTGCATTTCATTCGAAGTTGCAAGCAAATAAGCCCTCAAAAGTGGGTCATTGCCGATGTTTCCGTCTCGAAACTCGGCGATGGTATCGACTCATCATCATCCTCTTCCTCCCTTTGCGCCAAACGCCCATCCGGTTGCATCATTCACGACACTTCTAACGGCCGTTGTAAGGTCACTTGGATAGAGCATTGGAAATGCCCAAAGATCGGGCTTCGCACGATATATCGTACCGTTATTGACAACGGTTCGATGTTCGGGGCGAGGCATTGGATAGCGACGCTACAAATCCATTGTGAGCGGCAAGTTTTCTTCATGGCTACCAACGTTCCTATGAAGGATTCCACTGGAATTACCACAACTGGTGGAAGAAAAAGCGTTTTTCAATTGGCACAAAGAATGACGTCTGGTATTTATCAATCCATTGGGGCTTCGAGCAGCCATACGTGGGCCAAGGTCCAGAGCAAAATAGGCGAAAACATTAGGGTTGCTTCAAGGAAGAACTTGAATGATCCACGAGAACCTCTTGGTTTGATCTTGTGTGCAGCTGCTTCTGTCTGGCTACCTGTTTCACCTAAAGTCTTGTTTGAATTCTTGATCGACGAGTCTCGACGAATCGAGTGGGATGTGATGTTAAGCAGTCCAGTGGAAACAGTAGCAACCTTTGCCAAAGGACAAAATCGAGGCAATGCTGTAACCATCCAAGCAACAAAATCAGATGAAACCAACATGTGGATTCTACAAGACAGTCTAACAAATGAATATGAATCGACAGTGGTCTATGCTCAAGTTGACATTACGAGCATGCGGACGGTGATGGCAGGGTCCGACCCGAGCAGCATCACAATGTTACCGATGGGGTTTTCGATCCTCCCGGATGGACACTCGCCGAGGCCATCGATTATCAGCTTGAACACGGAGGAGAATGGAAGCGAAGGAGGTTCGTTGCTAACAATAGCGACTCAAATCTCAGTAAGTTCCTCCCCCACTGCTGAAACTTCGTCGCAGTCTGTTGAGTACGTTAAAAATTATATATCACATACGTTACAAAATATTAAAGCAAAGCTACAAGGCGAGGATGATTAAATATCACCATTTCTCATTCTAAAATATTCATTTCAACATTTTTTTTTGCCAACCTTTGCAGTGATTTAGAATGTTGACTTTTTAGCCAAAAGGGGTTGAACTCGGCCAAAACGAGTTGAACTCGGCCGTAATTGACATGTATCATTAACTTAGAGGTTTGCTTGCTCGAATTCTCTAACTTTAAATATTATTTTTGTACTAAAAATAAAATTAGTCCATCGATTATAATAATTAGTAAGTTTACCGTTGTTGTTTCACATAATTCATTCTTTCAAGTTTTTCTATATTTTTTTTGGCTATCCCTACACAATCGAATCAAATGGGTTAGTGATGCAATGATTTTATTGGTTATAAATCAGTTCAATACTATATGTCGTCGCAGGTTTGAGCCCGAGAATCGGTATTTATCCCACCTCCCCTGATATCTACTCAAAAAAAAAAAATCCGTACACTGTCGAAGTTATTTGTAGTTGAAGAAACTTGAATCTTGTATTTTGCTAATATTTTTGTTATATAAATTTTTGTTTTTGCTTTATTTAAGCACTGCCACAACCTCGCAATCCCCATTGAAAAAACAATTGGACCGAGACATATAGAATATTGTTTAAT

Coding sequence (CDS)

ATGGCCGTCGCCATGTCCGGCAACCACCAACCTCGCCGTGTTATCATGAACGTTCCTTCTTCTCCGGCTCTCTCTCTCACGCTTGCTGGAGGTTTTGGCAATGCGGCGGCGGCGAGGAGAGACGATAGCTGCAGTGACAACTCCGATCCGGCGGGGTCGAGATCGGCGGAGGATTTGTGCATCGATCTCGACGATGAGGATGAAGATAAACTACAAGGAAATAGGAAAAAGAGAAAGAATCGGCATAGTTCAGAGCAAATCAGAGAAATGGAGAAAGTGTTTAAAGAATCGCCTCATCCAGATGAGAAACAGAGGCAGTTACTCAGTGAGAAACTCGGTCTTCATTCCAAGCAAATCAAATTCTGGTTTCAGAATCGTAGAACTCAAATCAAGGCCAGTCTCGAGCGACAGGAAATGGAGAAATTGCGCAAGGAAAATCAATCCATGAAGGAAATGCTCGACAAATCCTCTTGTCCTAAATGTTGCTCTTCAGCTACATCAATCTTCAGTTCTGAACAACAACAATCGCAATTAGTTACTGAAATTGTTCGACTCAAAGCCGAGCTCGAGAGATTACGAGCGGCTCTGGAGGAATACGGTCCGGCTAGGGCGTCCACGTCGCGATCGAACGAGGAGGGGATTTTCGGAGCGGAGAAAGCTAGGGTTATGGAGATGGCGAATCGAGCGGTTGAGGAGGTTGTGAAAATGGCGGATTCCGGCGAGCCGCTGTGGGTTCGGAGCTTCGACGCCGGTCGAGAACTGCTGAATTACGACGAGTATGTGAAGGAATTCGGCGCCGGCGACGAGCGGCCGGGAAGAAAAATCGAAGCGTCGAGAGATTCCTGCGTGTTGTTCGTCGATCTTCAGCAGTTGGTTCAGAGCTTCATGGATGTGGTTCAGTGGAAAGAAATGTTCCCGTCGATAATCTCGAAGGCGTCGACTGTGGAGGTCGTCTGCAATGGCGACGGGGCCACCTGTAATGGCGCAGTTCAACTGATGTATGCCGAGCTTCAAATGCTAACTCCCAGCATCCCTCCCCGAGAGGTGCATTTCATTCGAAGTTGCAAGCAAATAAGCCCTCAAAAGTGGGTCATTGCCGATGTTTCCGTCTCGAAACTCGGCGATGGTATCGACTCATCATCATCCTCTTCCTCCCTTTGCGCCAAACGCCCATCCGGTTGCATCATTCACGACACTTCTAACGGCCGTTGTAAGGTCACTTGGATAGAGCATTGGAAATGCCCAAAGATCGGGCTTCGCACGATATATCGTACCGTTATTGACAACGGTTCGATGTTCGGGGCGAGGCATTGGATAGCGACGCTACAAATCCATTGTGAGCGGCAAGTTTTCTTCATGGCTACCAACGTTCCTATGAAGGATTCCACTGGAATTACCACAACTGGTGGAAGAAAAAGCGTTTTTCAATTGGCACAAAGAATGACGTCTGGTATTTATCAATCCATTGGGGCTTCGAGCAGCCATACGTGGGCCAAGGTCCAGAGCAAAATAGGCGAAAACATTAGGGTTGCTTCAAGGAAGAACTTGAATGATCCACGAGAACCTCTTGGTTTGATCTTGTGTGCAGCTGCTTCTGTCTGGCTACCTGTTTCACCTAAAGTCTTGTTTGAATTCTTGATCGACGAGTCTCGACGAATCGAGTGGGATGTGATGTTAAGCAGTCCAGTGGAAACAGTAGCAACCTTTGCCAAAGGACAAAATCGAGGCAATGCTGTAACCATCCAAGCAACAAAATCAGATGAAACCAACATGTGGATTCTACAAGACAGTCTAACAAATGAATATGAATCGACAGTGGTCTATGCTCAAGTTGACATTACGAGCATGCGGACGGTGATGGCAGGGTCCGACCCGAGCAGCATCACAATGTTACCGATGGGGTTTTCGATCCTCCCGGATGGACACTCGCCGAGGCCATCGATTATCAGCTTGAACACGGAGGAGAATGGAAGCGAAGGAGGTTCGTTGCTAACAATAGCGACTCAAATCTCAGTAAGTTCCTCCCCCACTGCTGAAACTTCGTCGCAGTCTGTTGAGTACGTTAAAAATTATATATCACATACGTTACAAAATATTAAAGCAAAGCTACAAGGCGAGGATGATTAA

Protein sequence

MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNAAAARRDDSCSDNSDPAGSRSAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSIFSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNEEGIFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYISHTLQNIKAKLQGEDD
Homology
BLAST of Sed0004043 vs. NCBI nr
Match: XP_023000325.1 (homeobox-leucine zipper protein GLABRA 2-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1032.7 bits (2669), Expect = 1.5e-297
Identity = 547/736 (74.32%), Postives = 621/736 (84.38%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL  + DDEDEDKL GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGAEPDDEDEDKLLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSC KCCSSA SI      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCLKCCSSANSIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q+Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQKQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A +A+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAMKAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPE 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGA+QL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAIQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++S +KWV+ADVS++ +GD ID   SSSS C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSLEKWVVADVSINNVGDSID---SSSSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+Y+T++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYQTMVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSKIGE IRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKIGEIIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWD MLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDGMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE N WI+QDSLTN+YES+V+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDEPNKWIIQDSLTNDYESSVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH PR S+IS + EE  +EGGSLLTIATQI VSSS TAE +SQS EYV ++I
Sbjct: 661 MGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQIPVSSSGTAEKTSQSAEYVNDFI 720

BLAST of Sed0004043 vs. NCBI nr
Match: KAG7025809.1 (Homeobox-leucine zipper protein GLABRA 2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1031.6 bits (2666), Expect = 3.3e-297
Identity = 544/736 (73.91%), Postives = 618/736 (83.97%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDIGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL ++ DDEDEDK  GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGVEPDDEDEDKGLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSCPKCCSSA +I      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCPKCCSSANAIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q+Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQKQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A +A+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAKKAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPK 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGAVQL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAVQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++SPQKWV+ADVS++ +GD ID   SS S C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSPQKWVVADVSINNVGDSID---SSPSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+YRT++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYRTIVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSKIGENIRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKIGENIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWDVMLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDVMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE+N WI+QDSLTN+YESTV+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDESNKWIIQDSLTNDYESTVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH+PR S+IS + EE  +EGGSLLTIATQ           +SQS EYV ++I
Sbjct: 661 MGFSILPDGHTPRASVISKSKEERVTEGGSLLTIATQ-----------TSQSAEYVNDFI 720

BLAST of Sed0004043 vs. NCBI nr
Match: XP_022963997.1 (homeobox-leucine zipper protein GLABRA 2-like [Cucurbita moschata])

HSP 1 Score: 1030.0 bits (2662), Expect = 9.5e-297
Identity = 543/736 (73.78%), Postives = 616/736 (83.70%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL ++ DDEDEDK  GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGVEPDDEDEDKGLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSCPKCCSSA SI      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCPKCCSSANSIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQNQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A RA+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAKRAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPK 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGAVQL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAVQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++SP+KWV+ADVS++ +GD ID   SS S C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSPEKWVVADVSINNVGDSID---SSPSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+YRT++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYRTIVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSK+GENIRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKMGENIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWDVMLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDVMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE+N WI+QDSLTN+YESTV+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDESNKWIIQDSLTNDYESTVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH PR S+IS + EE  +EGGSLLTIATQ           +SQS EYV +++
Sbjct: 661 MGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQ-----------TSQSAEYVNDFV 720

BLAST of Sed0004043 vs. NCBI nr
Match: XP_023000326.1 (homeobox-leucine zipper protein GLABRA 2-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 1014.6 bits (2622), Expect = 4.1e-292
Identity = 539/736 (73.23%), Postives = 613/736 (83.29%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL  + DDEDEDKL GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGAEPDDEDEDKLLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSC KCCSSA SI      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCLKCCSSANSIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q+Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQKQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A +A+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAMKAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPE 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGA+QL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAIQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++S +KWV+ADVS++ +GD ID   SSSS C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSLEKWVVADVSINNVGDSID---SSSSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+Y+T++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYQTMVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSKIGE IRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKIGEIIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWD MLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDGMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE N WI+QDSLTN+YES+V+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDEPNKWIIQDSLTNDYESSVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH PR S+IS + EE  +EGGSLLTIATQ           +SQS EYV ++I
Sbjct: 661 MGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQ-----------TSQSAEYVNDFI 720

BLAST of Sed0004043 vs. NCBI nr
Match: XP_023513617.1 (homeobox-leucine zipper protein GLABRA 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1009.6 bits (2609), Expect = 1.3e-290
Identity = 536/739 (72.53%), Postives = 615/739 (83.22%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLD--DEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKL 120
           SAEDL ++ D  DEDEDK  GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE L
Sbjct: 61  SAEDLGVEPDDEDEDEDKGLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENL 120

Query: 121 GLHSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI---- 180
           GLHSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSC KCCSSA SI    
Sbjct: 121 GLHSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCLKCCSSANSIAISM 180

Query: 181 ----FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG------ 240
                SS+Q+Q QLVTEIVRLKAE+E L+ ALE+Y PA  S SR  E     EG      
Sbjct: 181 DSMFTSSDQKQQQLVTEIVRLKAEVEGLKTALEQYAPAGTSRSRLGESEDAIEGRRNLEK 240

Query: 241 ---IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEF-GAGDE 300
              IFG EKARVME+A +A+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F G  ++
Sbjct: 241 SKRIFGLEKARVMEIAKKAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEYEQ 300

Query: 301 RPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGA 360
           +P  +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGA
Sbjct: 301 QPKGEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGA 360

Query: 361 VQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAK 420
           VQLM+AELQMLTP  PPREV+FIR+CK++SP+KWV+ADVS++ +GD ID   SS S C K
Sbjct: 361 VQLMFAELQMLTPVFPPREVYFIRTCKRLSPEKWVVADVSINNVGDSID---SSPSFCRK 420

Query: 421 RPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQ 480
           RPSGCII DTSNG CKV  +EHW+C K  LRT+YRT++++G +FGARHW+AT+Q HCE Q
Sbjct: 421 RPSGCIIEDTSNGHCKVIVLEHWECQKTKLRTMYRTIVNSGLIFGARHWMATMQTHCEWQ 480

Query: 481 VFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIR 540
           VF+MATNVPMKDSTGITT GGRKSV +LAQRMTS +YQ++GASSSHTW KVQSKIGENIR
Sbjct: 481 VFYMATNVPMKDSTGITTLGGRKSVLRLAQRMTSSVYQAMGASSSHTWTKVQSKIGENIR 540

Query: 541 VASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATF 600
           VASRKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWDVMLS PVET+A F
Sbjct: 541 VASRKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDVMLSGPVETLAVF 600

Query: 601 AKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSIT 660
           AKGQNRGNAVTIQA KSDE+N WI+QD+LTN+YESTV+YAQ+DITSM++VM G DPS+IT
Sbjct: 601 AKGQNRGNAVTIQAIKSDESNKWIIQDTLTNDYESTVIYAQIDITSMQSVMVGCDPSTIT 660

Query: 661 MLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVK 709
            LPMGFSILPDGH PR S+IS + EE  +EGGSLLTIATQ           +SQS EYV 
Sbjct: 661 TLPMGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQ-----------TSQSAEYVN 720

BLAST of Sed0004043 vs. ExPASy Swiss-Prot
Match: P46607 (Homeobox-leucine zipper protein GLABRA 2 OS=Arabidopsis thaliana OX=3702 GN=GL2 PE=1 SV=3)

HSP 1 Score: 674.1 bits (1738), Expect = 1.7e-192
Identity = 388/759 (51.12%), Postives = 516/759 (67.98%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNAAAA------------------RRDD 60
           MAV MS + QP +      SSPALSL+LAG F NA++                   R  +
Sbjct: 3   MAVDMS-SKQPTKDFF---SSPALSLSLAGIFRNASSGSTNPEEDFLGRRVVDDEDRTVE 62

Query: 61  SCSDNSDPAGSRSAEDL-CIDLDDEDEDKLQG-------NRKKRK--NRHSSEQIREMEK 120
             S+NS P  SRS EDL   D DDE+E++  G       N++KRK  +RH+++QIR ME 
Sbjct: 63  MSSENSGPTRSRSEEDLEGEDHDDEEEEEEDGAAGNKGTNKRKRKKYHRHTTDQIRHMEA 122

Query: 121 VFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQNRRTQIKA-------SLERQEMEKLRKE 180
           +FKE+PHPDEKQRQ LS++LGL  +Q+KFWFQNRRTQIKA       SL + E+EKLR+E
Sbjct: 123 LFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIKAIQERHENSLLKAELEKLREE 182

Query: 181 NQSMKEMLDK--SSCPKCCSSATSIFSSEQQQSQLVTEIVRLKAELERLRAALEEYG-PA 240
           N++M+E   K  SSCP C                L  E  +LKAEL++LRAAL     P 
Sbjct: 183 NKAMRESFSKANSSCPNCGGG----------PDDLHLENSKLKAELDKLRAALGRTPYPL 242

Query: 241 RASTSRSNEE---------GIFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRE 300
           +AS S   E          G+F  EK+R+ E++NRA  E+ KMA SGEP+W+RS + GRE
Sbjct: 243 QASCSDDQEHRLGSLDFYTGVFALEKSRIAEISNRATLELQKMATSGEPMWLRSVETGRE 302

Query: 301 LLNYDEYVKEF--GAGDERPGRK-IEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIIS 360
           +LNYDEY+KEF        PGRK IEASRD+ ++F+D  +L QSFMDV QWKE F  +IS
Sbjct: 303 ILNYDEYLKEFPQAQASSFPGRKTIEASRDAGIVFMDAHKLAQSFMDVGQWKETFACLIS 362

Query: 361 KASTVEVVCNGDG-ATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVS 420
           KA+TV+V+  G+G +  +GA+QLM+ E+Q+LTP +P REV+F+RSC+Q+SP+KW I DVS
Sbjct: 363 KAATVDVIRQGEGPSRIDGAIQLMFGEMQLLTPVVPTREVYFVRSCRQLSPEKWAIVDVS 422

Query: 421 VSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDN 480
           VS + D      +S   C K PSGCII DTSNG  KVTW+EH       ++ ++R++++ 
Sbjct: 423 VS-VEDSNTEKEASLLKCRKLPSGCIIEDTSNGHSKVTWVEHLDVSASTVQPLFRSLVNT 482

Query: 481 GSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSI 540
           G  FGARHW+ATLQ+HCER VFFMATNVP KDS G+TT  GRKSV ++AQRMT   Y++I
Sbjct: 483 GLAFGARHWVATLQLHCERLVFFMATNVPTKDSLGVTTLAGRKSVLKMAQRMTQSFYRAI 542

Query: 541 GASSSHTWAKVQSKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDE 600
            ASS H W K+ +K G+++RV+SRKNL+DP EP G+I+CA++S+WLPVSP +LF+F  DE
Sbjct: 543 AASSYHQWTKITTKTGQDMRVSSRKNLHDPGEPTGVIVCASSSLWLPVSPALLFDFFRDE 602

Query: 601 SRRIEWDVMLS-SPVETVATFAKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVY 660
           +RR EWD + + + V+++A  +KGQ+RGN+V IQ  KS E ++W+LQDS TN YES VVY
Sbjct: 603 ARRHEWDALSNGAHVQSIANLSKGQDRGNSVAIQTVKSREKSIWVLQDSSTNSYESVVVY 662

Query: 661 AQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIAT 708
           A VDI + + V+AG DPS+I +LP GFSI+PDG   RP +I+   ++  S+GGSLLT+A 
Sbjct: 663 APVDINTTQLVLAGHDPSNIQILPSGFSIIPDGVESRPLVITSTQDDRNSQGGSLLTLAL 722

BLAST of Sed0004043 vs. ExPASy Swiss-Prot
Match: Q5JMF3 (Homeobox-leucine zipper protein ROC9 OS=Oryza sativa subsp. japonica OX=39947 GN=ROC9 PE=2 SV=1)

HSP 1 Score: 515.4 bits (1326), Expect = 1.0e-144
Identity = 330/809 (40.79%), Postives = 474/809 (58.59%), Query Frame = 0

Query: 7   GNHQPRRVIMNVPSSPALSLTLAGGFG--NAAAARRDDS------------------CSD 66
           G ++PR    +  ++PALSLTLAG FG  N  AA   D                    S+
Sbjct: 2   GTNRPRPRTKDFFAAPALSLTLAGVFGRKNGPAASGGDGVEEGDEEVQAAGEAAVEISSE 61

Query: 67  NSDP------AGSRSAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHP 126
           N+ P      +G  S ED   D DD+ E   +  R+K  +RH++EQIR ME +FKESPHP
Sbjct: 62  NAGPGCRQSQSGGGSGEDGGHD-DDDGEGSNKKRRRKNYHRHTAEQIRIMEALFKESPHP 121

Query: 127 DEKQRQLLSEKLGLHSKQIKFWFQNRRTQIKA-------SLERQEMEKLRKENQSMKEML 186
           DE+QRQ +S++LGL ++Q+KFWFQNRRTQIKA       SL + E+EKL+ E+++M+E+ 
Sbjct: 122 DERQRQQVSKQLGLSARQVKFWFQNRRTQIKAVQERHENSLLKSELEKLQDEHRAMRELA 181

Query: 187 DK-SSCPKCCSSATS------IFSSEQQQSQLVTEIVRLKAE------------------ 246
            K S C  C   ATS        +++ ++ +L  E  +LKAE                  
Sbjct: 182 KKPSRCLNCGVVATSSDAAAAATAADTREQRLRLEKAKLKAEVCMPPPRSRARPFRCATL 241

Query: 247 ---------------LERLRAA-------------LEEYGPARASTSRS----NEEGIF- 306
                          +ERLR                     A  + SRS    + +G F 
Sbjct: 242 QDTDSGELAMLNLFQIERLRGTPGKSAADGIASPPCSASAGAMQTNSRSPPLHDHDGGFL 301

Query: 307 --GAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEF-----GAGD 366
               +K R++E+A RA++E+V M  SGEP+WVR  + GR++LNYDEYV+ F     G+GD
Sbjct: 302 RHDDDKPRILELATRALDELVGMCSSGEPVWVRGVETGRDILNYDEYVRLFRRDHGGSGD 361

Query: 367 ERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNG 426
           +  G  +EASR+  ++++D   LV +FMDV +WK++FP++ISKA+T+E++ N +    +G
Sbjct: 362 QMAGWTVEASRECGLVYLDTMHLVHTFMDVDKWKDLFPTMISKAATLEMISNREDDGRDG 421

Query: 427 AVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCA 486
            +QLMYAELQ LTP +P RE++F R CK+++ ++W I DVS  +   G+ +SS+    C 
Sbjct: 422 VLQLMYAELQTLTPMVPTRELYFARYCKKLAAERWAIVDVSFDESETGVHASSAVR--CW 481

Query: 487 KRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCER 546
           K PSGC+I + +NGRCK+TW+EH +C +  +  +YR V  +G  FGAR W+A LQ+ CER
Sbjct: 482 KNPSGCLIEEQNNGRCKMTWVEHTRCRRCTVAPLYRAVTASGVAFGARRWVAALQLQCER 541

Query: 547 QVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIG--- 606
            VF +ATNVP +DSTG++T  GR+SV +LA RMTS + ++ G S    W +         
Sbjct: 542 MVFAVATNVPTRDSTGVSTLAGRRSVLKLAHRMTSSLCRTTGGSCDMAWRRAPKGGSGGG 601

Query: 607 --ENIRVASRKNL-NDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVML-SS 666
             ++I + SR+N  +DP EP GLI CAAAS WLPV+P  L + L DESRR EWDVML   
Sbjct: 602 GDDDIWLTSRENAGDDPGEPQGLIACAAASTWLPVNPTALLDLLRDESRRPEWDVMLPGK 661

Query: 667 PVETVATFAKGQNRGNAVTIQATKSDET----NMWILQDSLTNEYESTVVYAQVDITSMR 704
            V++    AKG++R N VT  A + +E       W+LQD  TN  EST+ YA +D  +++
Sbjct: 662 SVQSRVNLAKGKDRTNCVTAYAARPEEEEERGGKWVLQDVCTNPCESTIAYAAIDAAALQ 721

BLAST of Sed0004043 vs. ExPASy Swiss-Prot
Match: A2YR02 (Homeobox-leucine zipper protein ROC7 OS=Oryza sativa subsp. indica OX=39946 GN=ROC7 PE=3 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 3.8e-123
Identity = 277/680 (40.74%), Postives = 397/680 (58.38%), Query Frame = 0

Query: 66  EDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQN 125
           +D+D  Q  RKKR +RH+  QI+E+E  FKE PHPD+KQR+ LS +LGL   Q+KFWFQN
Sbjct: 79  DDQDPNQRPRKKRYHRHTQHQIQELEAFFKECPHPDDKQRKELSRELGLEPLQVKFWFQN 138

Query: 126 RRTQIKASLERQ-------EMEKLRKENQSMKEMLDKSSCPKCCSSATSIFSSEQQQSQL 185
           +RTQ+K   ER        E EKLR EN   KE L  +SCP C   A +I      +  L
Sbjct: 139 KRTQMKTQHERHENNALRAENEKLRAENMRYKEALANASCPNCGGPA-AIGEMSFDEHHL 198

Query: 186 VTEIVRLKAELERLRAALEEY--GPARASTSR------SNEE------------GIFGA- 245
             E  RL+ E++R+ A   +Y   PA A ++       SN               +FGA 
Sbjct: 199 RLENARLRDEIDRISAIAAKYVGKPAAAVSAAYPPLPPSNRSPLDHMGIPGAGADVFGAD 258

Query: 246 -EKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEF--GAGDERPGRK 305
            +K  V+E+A  A+EE+V+MA  GEPLW  +   G E L  +EY + F  G G + P  +
Sbjct: 259 FDKPLVIELAVAAMEELVRMAQLGEPLWAPAL--GGEALGEEEYARTFPRGLGPKSPELR 318

Query: 306 IEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQLMY 365
            EASR++ V+ ++   LV+  MDV QW  +F SI+S+A+T+EV+  G     NGA+QLM 
Sbjct: 319 SEASRETAVVIMNHVSLVEMLMDVGQWTALFSSIVSRAATLEVLSTGVAGNHNGALQLMS 378

Query: 366 AELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKL--GDGIDSSSSSSSLCAKRPS 425
           AE QM +P +P RE  F+R CKQ     W + DVS+  L  G G     +++    +RPS
Sbjct: 379 AEFQMPSPLVPTRETQFLRYCKQHPDGTWAVVDVSLDGLRAGAGGGCQPAAARGHRRRPS 438

Query: 426 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 485
           GC+I +  NG  KVTW+EH +     +  +Y+ V+++G  FGAR W+ATL+  CER    
Sbjct: 439 GCLIQEMPNGYSKVTWVEHVEADDQMVHNLYKPVVNSGMAFGARRWVATLERQCERLASA 498

Query: 486 MATNVPMKDSTG-ITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVA 545
           MA+NV      G ITT+ GR+S+ +LA+RM +     + AS++H W  +     E++RV 
Sbjct: 499 MASNVASSGDAGVITTSEGRRSMLKLAERMVASFCGGVTASTTHQWTTLSGSGAEDVRVM 558

Query: 546 SRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLS-SPVETVATFA 605
           +RK+++DP  P G++L AA S WLPV P  +F+FL D+S R EWD++ +   V+ +A  A
Sbjct: 559 TRKSVDDPGRPPGIVLNAATSFWLPVPPSRVFDFLRDDSTRSEWDILSNGGVVQEMAHIA 618

Query: 606 KGQNRGNAVT---IQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSS 665
            G++ GNAV+   +    S+++NM ILQ+  T+   S V+YA VD+ +M  V+ G DP  
Sbjct: 619 NGRDHGNAVSLLRVNNANSNQSNMLILQECCTDATGSYVIYAPVDVVAMNVVLNGGDPDY 678

Query: 666 ITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEY 708
           + +LP GF+ILPDG                  GGSLLT+A QI V S PTA+ S  SV  
Sbjct: 679 VALLPSGFAILPDGPD--------------GGGGSLLTVAFQILVDSVPTAKLSLGSVAT 738

BLAST of Sed0004043 vs. ExPASy Swiss-Prot
Match: A3BPF2 (Homeobox-leucine zipper protein ROC7 OS=Oryza sativa subsp. japonica OX=39947 GN=ROC7 PE=2 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 3.8e-123
Identity = 278/679 (40.94%), Postives = 396/679 (58.32%), Query Frame = 0

Query: 67  DEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQNR 126
           D+D  Q  RKKR +RH+  QI+E+E  FKE PHPD+KQR+ LS +LGL   Q+KFWFQN+
Sbjct: 80  DQDPNQRPRKKRYHRHTQHQIQELEAFFKECPHPDDKQRKELSRELGLEPLQVKFWFQNK 139

Query: 127 RTQIKASLERQ-------EMEKLRKENQSMKEMLDKSSCPKCCSSATSIFSSEQQQSQLV 186
           RTQ+K   ER        E EKLR EN   KE L  +SCP C   A +I      +  L 
Sbjct: 140 RTQMKTQHERHENNALRAENEKLRAENMRYKEALANASCPNCGGPA-AIGEMSFDEHHLR 199

Query: 187 TEIVRLKAELERLRAALEEY--GPARASTSR------SNEE------------GIFGA-- 246
            E  RL+ E++R+ A   +Y   PA A ++       SN               +FGA  
Sbjct: 200 LENARLRDEIDRISAIAAKYVGKPAAAVSAAYPPLPPSNRSPLDHMGIPGAGADVFGADF 259

Query: 247 EKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEF--GAGDERPGRKI 306
           +K  V+E+A  A+EE+V+MA  GEPLW  +   G E L  +EY + F  G G + P  + 
Sbjct: 260 DKPLVIELAVAAMEELVRMAQLGEPLWAPAL--GGEALGEEEYARTFPRGLGPKSPELRS 319

Query: 307 EASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQLMYA 366
           EASR++ V+ ++   LV+  MDV QW  +F SI+S+A+T+EV+  G     NGA+QLM A
Sbjct: 320 EASRETAVVIMNHVSLVEMLMDVGQWTALFSSIVSRAATLEVLSTGVAGNHNGALQLMSA 379

Query: 367 ELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKL--GDGIDSSSSSSSLCAKRPSG 426
           E QM +P +P RE  F+R CKQ     W + DVS+  L  G G     +++    +RPSG
Sbjct: 380 EFQMPSPLVPTRETQFLRYCKQHPDGTWAVVDVSLDGLRAGAGGGCQPAAARGHRRRPSG 439

Query: 427 CIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFFM 486
           C+I +  NG  KVTW+EH +     +  +Y+ V+++G  FGAR W+ATL+  CER    M
Sbjct: 440 CLIQEMPNGYSKVTWVEHVEADDQMVHNLYKPVVNSGMAFGARRWVATLERQCERLASAM 499

Query: 487 ATNVPMKDSTG-ITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 546
           A+NV      G ITT+ GR+S+ +LA+RM +     + AS++H W  +     E++RV +
Sbjct: 500 ASNVASSGDAGVITTSEGRRSMLKLAERMVASFCGGVTASTTHQWTTLSGSGAEDVRVMT 559

Query: 547 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLS-SPVETVATFAK 606
           RK+++DP  P G+IL AA S WLPV P  +F+FL D+S R EWD++ +   V+ +A  A 
Sbjct: 560 RKSVDDPGRPPGIILNAATSFWLPVPPSRVFDFLRDDSTRSEWDILSNGGVVQEMAHIAN 619

Query: 607 GQNRGNAVT---IQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSI 666
           G++ GNAV+   +    S+++NM ILQ+  T+   S V+YA VD+ +M  V+ G DP  +
Sbjct: 620 GRDHGNAVSLLRVNNANSNQSNMLILQECCTDATGSYVIYAPVDVVAMNVVLNGGDPDYV 679

Query: 667 TMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYV 708
            +LP GF+ILPDG                  GGSLLT+A QI V S PTA+ S  SV  V
Sbjct: 680 ALLPSGFAILPDGPD--------------GGGGSLLTVAFQILVDSVPTAKLSLGSVATV 739

BLAST of Sed0004043 vs. ExPASy Swiss-Prot
Match: Q0WV12 (Homeobox-leucine zipper protein ANTHOCYANINLESS 2 OS=Arabidopsis thaliana OX=3702 GN=ANL2 PE=2 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 5.5e-122
Identity = 273/700 (39.00%), Postives = 414/700 (59.14%), Query Frame = 0

Query: 53  SRSAEDLCIDLDDEDEDKL-QGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEK 112
           SRS  D    +  ED+D   +  RKKR +RH+ +QI+E+E +FKE PHPDEKQR  LS++
Sbjct: 111 SRSGSDNVEGISGEDQDAADKPPRKKRYHRHTPQQIQELESMFKECPHPDEKQRLELSKR 170

Query: 113 LGLHSKQIKFWFQNRRTQIKASLE-------RQEMEKLRKENQSMKEMLDKSSCPKCCSS 172
           L L ++Q+KFWFQNRRTQ+K  LE       RQE +KLR EN S++E +    C  C   
Sbjct: 171 LCLETRQVKFWFQNRRTQMKTQLERHENALLRQENDKLRAENMSIREAMRNPICTNCGGP 230

Query: 173 ATSIFSSEQQQSQLVTEIVRLKAELERLRAALEEY----------GPARASTSRSNEEGI 232
           A  +     ++  L  E  RLK EL+R+     ++               +   +N  G 
Sbjct: 231 A-MLGDVSLEEHHLRIENARLKDELDRVCNLTGKFLGHHHNHHYNSSLELAVGTNNNGGH 290

Query: 233 FG-------------------------AEKARVMEMANRAVEEVVKMADSGEPLWVRSFD 292
           F                           +K+ ++E+A  A++E+VK+A S EPLWV+S D
Sbjct: 291 FAFPPDFGGGGGCLPPQQQQSTVINGIDQKSVLLELALTAMDELVKLAQSEEPLWVKSLD 350

Query: 293 AGRELLNYDEYVKEFGAGDERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSII 352
             R+ LN DEY++ F +  +  G   EASR S ++ ++   LV++ MD  +W EMFP  +
Sbjct: 351 GERDELNQDEYMRTF-SSTKPTGLATEASRTSGMVIINSLALVETLMDSNRWTEMFPCNV 410

Query: 353 SKASTVEVVCNGDGATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVS 412
           ++A+T +V+  G   T NGA+QLM AELQ+L+P +P R V+F+R CKQ +   W + DVS
Sbjct: 411 ARATTTDVISGGMAGTINGALQLMNAELQVLSPLVPVRNVNFLRFCKQHAEGVWAVVDVS 470

Query: 413 VSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDN 472
           +    D +  +S  + +  + PSGC++ D SNG  KVTW+EH +  +  +  +YR ++ +
Sbjct: 471 I----DPVRENSGGAPVIRRLPSGCVVQDVSNGYSKVTWVEHAEYDENQIHQLYRPLLRS 530

Query: 473 GSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSI 532
           G  FG++ W+ATLQ  CE     ++++V   D+T I T GGRKS+ +LAQRMT      I
Sbjct: 531 GLGFGSQRWLATLQRQCECLAILISSSVTSHDNTSI-TPGGRKSMLKLAQRMTFNFCSGI 590

Query: 533 GASSSHTWAKVQ-SKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLID 592
            A S H W+K+    +  ++RV +RK+++DP EP G++L AA SVWLP +P+ L++FL +
Sbjct: 591 SAPSVHNWSKLTVGNVDPDVRVMTRKSVDDPGEPPGIVLSAATSVWLPAAPQRLYDFLRN 650

Query: 593 ESRRIEWDVMLS-SPVETVATFAKGQNRG-NAVTIQATKSDETNMWILQDSLTNEYESTV 652
           E  R EWD++ +  P++ +A   KGQ++G + +   A  +++++M ILQ++  +   + V
Sbjct: 651 ERMRCEWDILSNGGPMQEMAHITKGQDQGVSLLRSNAMNANQSSMLILQETCIDASGALV 710

Query: 653 VYAQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTI 707
           VYA VDI +M  VM G D S + +LP GF++LPDG        S + ++    GGSLLT+
Sbjct: 711 VYAPVDIPAMHVVMNGGDSSYVALLPSGFAVLPDGGIDGGG--SGDGDQRPVGGGSLLTV 770

BLAST of Sed0004043 vs. ExPASy TrEMBL
Match: A0A6J1KI04 (homeobox-leucine zipper protein GLABRA 2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111494591 PE=3 SV=1)

HSP 1 Score: 1032.7 bits (2669), Expect = 7.1e-298
Identity = 547/736 (74.32%), Postives = 621/736 (84.38%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL  + DDEDEDKL GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGAEPDDEDEDKLLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSC KCCSSA SI      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCLKCCSSANSIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q+Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQKQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A +A+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAMKAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPE 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGA+QL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAIQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++S +KWV+ADVS++ +GD ID   SSSS C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSLEKWVVADVSINNVGDSID---SSSSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+Y+T++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYQTMVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSKIGE IRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKIGEIIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWD MLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDGMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE N WI+QDSLTN+YES+V+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDEPNKWIIQDSLTNDYESSVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH PR S+IS + EE  +EGGSLLTIATQI VSSS TAE +SQS EYV ++I
Sbjct: 661 MGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQIPVSSSGTAEKTSQSAEYVNDFI 720

BLAST of Sed0004043 vs. ExPASy TrEMBL
Match: A0A6J1HGN1 (homeobox-leucine zipper protein GLABRA 2-like OS=Cucurbita moschata OX=3662 GN=LOC111464150 PE=3 SV=1)

HSP 1 Score: 1030.0 bits (2662), Expect = 4.6e-297
Identity = 543/736 (73.78%), Postives = 616/736 (83.70%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL ++ DDEDEDK  GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGVEPDDEDEDKGLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSCPKCCSSA SI      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCPKCCSSANSIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQNQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A RA+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAKRAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPK 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGAVQL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAVQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++SP+KWV+ADVS++ +GD ID   SS S C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSPEKWVVADVSINNVGDSID---SSPSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+YRT++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYRTIVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSK+GENIRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKMGENIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWDVMLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDVMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE+N WI+QDSLTN+YESTV+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDESNKWIIQDSLTNDYESTVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH PR S+IS + EE  +EGGSLLTIATQ           +SQS EYV +++
Sbjct: 661 MGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQ-----------TSQSAEYVNDFV 720

BLAST of Sed0004043 vs. ExPASy TrEMBL
Match: A0A6J1KFK3 (homeobox-leucine zipper protein GLABRA 2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111494591 PE=3 SV=1)

HSP 1 Score: 1014.6 bits (2622), Expect = 2.0e-292
Identity = 539/736 (73.23%), Postives = 613/736 (83.29%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNA------AAARRDDSCSDNSDPAGSR 60
           MAV MS N +P R +MNVPSSPALSLTLAG F NA      A  RR+DSCSDNS+PAGSR
Sbjct: 1   MAVVMSDN-RPSRRVMNVPSSPALSLTLAGVFQNAVDMGADAPVRREDSCSDNSEPAGSR 60

Query: 61  SAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGL 120
           SAEDL  + DDEDEDKL GNRKKRKNRH+SEQIREMEKVFKESPHPDEKQRQ LSE LGL
Sbjct: 61  SAEDLGAEPDDEDEDKLLGNRKKRKNRHTSEQIREMEKVFKESPHPDEKQRQKLSENLGL 120

Query: 121 HSKQIKFWFQNRRTQIKASLERQEMEKLRKENQSMKEMLDKSSCPKCCSSATSI------ 180
           HSKQIKFWFQNRRTQIK SLERQEMEKLR+ENQ++KEM++KSSC KCCSSA SI      
Sbjct: 121 HSKQIKFWFQNRRTQIKVSLERQEMEKLREENQALKEMINKSSCLKCCSSANSIAISMDS 180

Query: 181 --FSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNE-----EG-------- 240
              SS+Q+Q QLVTEIVRLKAE+E LR ALE+Y PA  S SRS E     EG        
Sbjct: 181 MFTSSDQKQQQLVTEIVRLKAEVEGLRTALEQYAPAGTSRSRSGENEDAIEGRRNLEKSK 240

Query: 241 -IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFGAGDERPG 300
            IFG EKARVME+A +A+EEVVKMADSGEPLW+RSF+ GRELLNYDEY+K+F    E+P 
Sbjct: 241 RIFGLEKARVMEIAMKAIEEVVKMADSGEPLWIRSFETGRELLNYDEYMKQFAGEHEQPE 300

Query: 301 RKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNGAVQL 360
            +IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKAST+EVVCNGDG+T NGA+QL
Sbjct: 301 GEIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKASTIEVVCNGDGSTRNGAIQL 360

Query: 361 MYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCAKRPS 420
           M+AELQMLTP  PPREV+FIR+CK++S +KWV+ADVS++ +GD ID   SSSS C KRPS
Sbjct: 361 MFAELQMLTPVFPPREVYFIRTCKRLSLEKWVVADVSINNVGDSID---SSSSFCRKRPS 420

Query: 421 GCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCERQVFF 480
           GCII DTSNG CKV  +EHW+C K  LRT+Y+T++++G +FGARHW+AT+Q HCE QVF+
Sbjct: 421 GCIIEDTSNGHCKVIVLEHWECQKTKLRTMYQTMVNSGLIFGARHWMATMQTHCEWQVFY 480

Query: 481 MATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENIRVAS 540
           MATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ++GASSSHTW KVQSKIGE IRVAS
Sbjct: 481 MATNVPMKDSTGITTLGGRKSVLRLAQRMTSSIYQAMGASSSHTWTKVQSKIGEIIRVAS 540

Query: 541 RKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSSPVETVATFAKG 600
           RKNLNDPREPLGLILCA ASVWLP+SPK+LFEFLI+ SRRIEWD MLS PVET+A FAKG
Sbjct: 541 RKNLNDPREPLGLILCAVASVWLPISPKLLFEFLINGSRRIEWDGMLSGPVETLAVFAKG 600

Query: 601 QNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSDPSSITMLP 660
           QNRGNAVTIQA KSDE N WI+QDSLTN+YES+V+YAQ+DITSM++VMAG DPS+IT LP
Sbjct: 601 QNRGNAVTIQAIKSDEPNKWIIQDSLTNDYESSVIYAQIDITSMQSVMAGCDPSTITTLP 660

Query: 661 MGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQSVEYVKNYI 709
           MGFSILPDGH PR S+IS + EE  +EGGSLLTIATQ           +SQS EYV ++I
Sbjct: 661 MGFSILPDGHPPRASVISKSKEERVTEGGSLLTIATQ-----------TSQSAEYVNDFI 720

BLAST of Sed0004043 vs. ExPASy TrEMBL
Match: A0A6J1DQD4 (homeobox-leucine zipper protein GLABRA 2 OS=Momordica charantia OX=3673 GN=LOC111022137 PE=3 SV=1)

HSP 1 Score: 1003.8 bits (2594), Expect = 3.5e-289
Identity = 540/747 (72.29%), Postives = 608/747 (81.39%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNAAA--------ARRDDSCSDNSDPAG 60
           MAV MSGNH PRR + N PSSPALSLTLAG FGNAAA        ARRDDSCSDNS+PAG
Sbjct: 1   MAVVMSGNHPPRR-LNNAPSSPALSLTLAGVFGNAAAEDLEVDAPARRDDSCSDNSEPAG 60

Query: 61  SRSAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKL 120
           SRSAEDL  D DDEDEDK QGNRKKRKNRH+SEQIREME +FKESPHPDEKQR  LSEKL
Sbjct: 61  SRSAEDLGADQDDEDEDK-QGNRKKRKNRHTSEQIREMEMLFKESPHPDEKQRLQLSEKL 120

Query: 121 GLHSKQIKFWFQNRRTQIKASLERQ-------EMEKLRKENQSMKEMLDKSSCPKCCSSA 180
           GL SKQIKFWFQNRRTQIKA  ER        EMEK+R ENQ+M+E++ K SCPKC SS+
Sbjct: 121 GLSSKQIKFWFQNRRTQIKAIHERHENALLKGEMEKMRDENQAMREIISKGSCPKCGSSS 180

Query: 181 T--------SIFSSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRS--NEEG-- 240
                    +IF++  +Q QL TEI RLKAE+E LR AL +Y P    TSRS  NEEG  
Sbjct: 181 ANSSAVSRETIFTTISEQQQLRTEITRLKAEVETLRVALAKYAPPGTCTSRSTENEEGIL 240

Query: 241 -----------IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVK 300
                      IFG EKARVME+A RA +E+VKMADSGEPLWVRS + GRELLNYD Y+K
Sbjct: 241 ERRRSLEQSKTIFGLEKARVMEIAKRATDELVKMADSGEPLWVRSVETGRELLNYDVYMK 300

Query: 301 EFGAGDERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGD 360
           EF A +ERP R+IEASR+S V+FVDL +LVQSFMDVVQWKEMFPSIISKA+T+EVV NGD
Sbjct: 301 EFAADNERPKREIEASRESGVVFVDLHRLVQSFMDVVQWKEMFPSIISKATTMEVVSNGD 360

Query: 361 GATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSS 420
           GA  NGAVQLM+AELQMLTP++P REV+FIRSC Q+SP KWV+ADVSV K+GDGIDSSSS
Sbjct: 361 GAARNGAVQLMFAELQMLTPALPSREVYFIRSCTQVSPDKWVVADVSVDKVGDGIDSSSS 420

Query: 421 SSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATL 480
            S    KRPSGCII DTS+G CK+TW+EHW+C K+GLRTIYRT+I++G +FGA+HW+A+L
Sbjct: 421 VS---RKRPSGCIIQDTSDGHCKITWVEHWECQKMGLRTIYRTIINSGLIFGAKHWMASL 480

Query: 481 QIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQS 540
           Q HCE QVFFMATNVPMKDSTGITT GGRKSV +LAQRMTS  YQ+ GAS SH+W KV +
Sbjct: 481 QTHCEWQVFFMATNVPMKDSTGITTLGGRKSVLRLAQRMTSSFYQAFGASISHSWPKVPT 540

Query: 541 KIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVML-SS 600
           K GENIRVASRKNLNDPREPLGLILCA ASVWLPVSPKVLFEFLIDE+RR+EWDVM    
Sbjct: 541 KTGENIRVASRKNLNDPREPLGLILCAVASVWLPVSPKVLFEFLIDEARRLEWDVMSGGG 600

Query: 601 PVETVATFAKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMA 660
             ET+  FAKGQNRGNAVTIQA KSDETN+W+LQDSLTNEYES VVYAQVDITSM++VMA
Sbjct: 601 SAETITNFAKGQNRGNAVTIQAIKSDETNVWVLQDSLTNEYESMVVYAQVDITSMKSVMA 660

Query: 661 GSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETS 709
           G DP +IT LP GFSILPDGH  RP +IS + EE G+EGGSLLT+A+QI  S+S TAE +
Sbjct: 661 GCDPGNITTLPTGFSILPDGHQSRPMVISSSKEEKGAEGGSLLTMASQILASASQTAEMT 720

BLAST of Sed0004043 vs. ExPASy TrEMBL
Match: A0A0A0K6N0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447000 PE=3 SV=1)

HSP 1 Score: 982.2 bits (2538), Expect = 1.1e-282
Identity = 527/744 (70.83%), Postives = 601/744 (80.78%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNAA--------AARRDDSCSDNSDPAG 60
           MAV MS NH  RR +  +PSSPALSLTLAG FGNAA          RR+DSCSDNS+PAG
Sbjct: 1   MAVVMSDNHASRR-LKTLPSSPALSLTLAGVFGNAAPMDVEADTTGRREDSCSDNSEPAG 60

Query: 61  SRSAEDLCIDLDDEDEDKLQGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKL 120
           SRSAEDL +D DDEDEDKLQGN KKRKNRH+SEQIREME +FKESPHPDEKQRQ LSEKL
Sbjct: 61  SRSAEDLGVDPDDEDEDKLQGNTKKRKNRHTSEQIREMEMLFKESPHPDEKQRQQLSEKL 120

Query: 121 GLHSKQIKFWFQNRRTQIKASLERQ-------EMEKLRKENQSMKEMLDKSSCPK-CCSS 180
           GL  KQIKFWFQNRRTQIKA  ER        EMEKLR+ENQ+M+EM+ KSSC K CCS+
Sbjct: 121 GLSCKQIKFWFQNRRTQIKAIHERHENALLKGEMEKLREENQAMREMISKSSCTKGCCSA 180

Query: 181 AT----SIF-SSEQQQSQLVTEIVRLKAELERLRAALEEYGPARASTSRSNEEG------ 240
           +T    +IF +S+QQQ QLVTEI RLKAE+ERLR AL++Y P  A T  + EEG      
Sbjct: 181 STNSLDAIFTTSDQQQQQLVTEIARLKAEVERLRTALDKYAP--AGTENNKEEGGIERPG 240

Query: 241 --------IFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEFG 300
                   IFG EK RVM +  RA+EEVVKM DS EPLWVRS + GRELLNYD Y+KE  
Sbjct: 241 RNLEKSKSIFGLEKGRVMLIGKRAIEEVVKMGDSDEPLWVRSVETGRELLNYDVYMKELA 300

Query: 301 AGDERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGAT 360
            G+ER  R++EASR++ V+F DL +LVQSFMDVVQWKEMFPS+ISKAST+EVV NGDG  
Sbjct: 301 VGNERGKREVEASRETGVVFADLHRLVQSFMDVVQWKEMFPSMISKASTMEVVFNGDGNN 360

Query: 361 CNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSS 420
            +GAVQLM+AELQMLTP+IPPRE+ FIRSCKQ+SP KWV+ADVS+ K+GD +DSSSS   
Sbjct: 361 RDGAVQLMFAELQMLTPTIPPREIFFIRSCKQLSPGKWVVADVSIDKVGDHVDSSSSR-- 420

Query: 421 LCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIH 480
            C KRPSGCII D S+G CKVTW+EHW+C KIGL TIYRT++++G +FGA HW++TLQ+H
Sbjct: 421 -CRKRPSGCIIQDQSDGHCKVTWVEHWECHKIGLHTIYRTIVNSGLIFGATHWMSTLQMH 480

Query: 481 CERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIG 540
           CE QVFFMATNVPMKDSTGITT GGRKSV +LAQRMTS IYQ+IGAS+SHTW KVQSKIG
Sbjct: 481 CEWQVFFMATNVPMKDSTGITTVGGRKSVLRLAQRMTSSIYQAIGASNSHTWTKVQSKIG 540

Query: 541 ENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLSS-PVE 600
           E IR+ASRKNL +P EP GLILCA AS+WLPVSPK+LFEFLIDE+RR EWDVMLSS   E
Sbjct: 541 ETIRIASRKNLKNPHEPTGLILCAVASIWLPVSPKLLFEFLIDEARRPEWDVMLSSGQAE 600

Query: 601 TVATFAKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSD 660
            +A FAKGQNRGNAVTIQA KSDETN WILQDSLTNEYESTVVYAQVD+  M++VMAG D
Sbjct: 601 MLANFAKGQNRGNAVTIQAVKSDETNKWILQDSLTNEYESTVVYAQVDMNGMKSVMAGFD 660

Query: 661 PSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTAETSSQS 709
             +IT LP GFSILPDGH  RP +IS + EE  + GGSLLT+A+QI VS SPTAET+SQS
Sbjct: 661 SGNITTLPTGFSILPDGHPTRPLVISSSKEERETRGGSLLTVASQILVSPSPTAETTSQS 720

BLAST of Sed0004043 vs. TAIR 10
Match: AT1G79840.1 (HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START domain )

HSP 1 Score: 674.1 bits (1738), Expect = 1.2e-193
Identity = 388/759 (51.12%), Postives = 516/759 (67.98%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNAAAA------------------RRDD 60
           MAV MS + QP +      SSPALSL+LAG F NA++                   R  +
Sbjct: 3   MAVDMS-SKQPTKDFF---SSPALSLSLAGIFRNASSGSTNPEEDFLGRRVVDDEDRTVE 62

Query: 61  SCSDNSDPAGSRSAEDL-CIDLDDEDEDKLQG-------NRKKRK--NRHSSEQIREMEK 120
             S+NS P  SRS EDL   D DDE+E++  G       N++KRK  +RH+++QIR ME 
Sbjct: 63  MSSENSGPTRSRSEEDLEGEDHDDEEEEEEDGAAGNKGTNKRKRKKYHRHTTDQIRHMEA 122

Query: 121 VFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQNRRTQIKA-------SLERQEMEKLRKE 180
           +FKE+PHPDEKQRQ LS++LGL  +Q+KFWFQNRRTQIKA       SL + E+EKLR+E
Sbjct: 123 LFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIKAIQERHENSLLKAELEKLREE 182

Query: 181 NQSMKEMLDK--SSCPKCCSSATSIFSSEQQQSQLVTEIVRLKAELERLRAALEEYG-PA 240
           N++M+E   K  SSCP C                L  E  +LKAEL++LRAAL     P 
Sbjct: 183 NKAMRESFSKANSSCPNCGGG----------PDDLHLENSKLKAELDKLRAALGRTPYPL 242

Query: 241 RASTSRSNEE---------GIFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRE 300
           +AS S   E          G+F  EK+R+ E++NRA  E+ KMA SGEP+W+RS + GRE
Sbjct: 243 QASCSDDQEHRLGSLDFYTGVFALEKSRIAEISNRATLELQKMATSGEPMWLRSVETGRE 302

Query: 301 LLNYDEYVKEF--GAGDERPGRK-IEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIIS 360
           +LNYDEY+KEF        PGRK IEASRD+ ++F+D  +L QSFMDV QWKE F  +IS
Sbjct: 303 ILNYDEYLKEFPQAQASSFPGRKTIEASRDAGIVFMDAHKLAQSFMDVGQWKETFACLIS 362

Query: 361 KASTVEVVCNGDG-ATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVS 420
           KA+TV+V+  G+G +  +GA+QLM+ E+Q+LTP +P REV+F+RSC+Q+SP+KW I DVS
Sbjct: 363 KAATVDVIRQGEGPSRIDGAIQLMFGEMQLLTPVVPTREVYFVRSCRQLSPEKWAIVDVS 422

Query: 421 VSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDN 480
           VS + D      +S   C K PSGCII DTSNG  KVTW+EH       ++ ++R++++ 
Sbjct: 423 VS-VEDSNTEKEASLLKCRKLPSGCIIEDTSNGHSKVTWVEHLDVSASTVQPLFRSLVNT 482

Query: 481 GSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSI 540
           G  FGARHW+ATLQ+HCER VFFMATNVP KDS G+TT  GRKSV ++AQRMT   Y++I
Sbjct: 483 GLAFGARHWVATLQLHCERLVFFMATNVPTKDSLGVTTLAGRKSVLKMAQRMTQSFYRAI 542

Query: 541 GASSSHTWAKVQSKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDE 600
            ASS H W K+ +K G+++RV+SRKNL+DP EP G+I+CA++S+WLPVSP +LF+F  DE
Sbjct: 543 AASSYHQWTKITTKTGQDMRVSSRKNLHDPGEPTGVIVCASSSLWLPVSPALLFDFFRDE 602

Query: 601 SRRIEWDVMLS-SPVETVATFAKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVY 660
           +RR EWD + + + V+++A  +KGQ+RGN+V IQ  KS E ++W+LQDS TN YES VVY
Sbjct: 603 ARRHEWDALSNGAHVQSIANLSKGQDRGNSVAIQTVKSREKSIWVLQDSSTNSYESVVVY 662

Query: 661 AQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIAT 708
           A VDI + + V+AG DPS+I +LP GFSI+PDG   RP +I+   ++  S+GGSLLT+A 
Sbjct: 663 APVDINTTQLVLAGHDPSNIQILPSGFSIIPDGVESRPLVITSTQDDRNSQGGSLLTLAL 722

BLAST of Sed0004043 vs. TAIR 10
Match: AT1G79840.2 (HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START domain )

HSP 1 Score: 674.1 bits (1738), Expect = 1.2e-193
Identity = 388/759 (51.12%), Postives = 516/759 (67.98%), Query Frame = 0

Query: 1   MAVAMSGNHQPRRVIMNVPSSPALSLTLAGGFGNAAAA------------------RRDD 60
           MAV MS + QP +      SSPALSL+LAG F NA++                   R  +
Sbjct: 32  MAVDMS-SKQPTKDFF---SSPALSLSLAGIFRNASSGSTNPEEDFLGRRVVDDEDRTVE 91

Query: 61  SCSDNSDPAGSRSAEDL-CIDLDDEDEDKLQG-------NRKKRK--NRHSSEQIREMEK 120
             S+NS P  SRS EDL   D DDE+E++  G       N++KRK  +RH+++QIR ME 
Sbjct: 92  MSSENSGPTRSRSEEDLEGEDHDDEEEEEEDGAAGNKGTNKRKRKKYHRHTTDQIRHMEA 151

Query: 121 VFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQNRRTQIKA-------SLERQEMEKLRKE 180
           +FKE+PHPDEKQRQ LS++LGL  +Q+KFWFQNRRTQIKA       SL + E+EKLR+E
Sbjct: 152 LFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIKAIQERHENSLLKAELEKLREE 211

Query: 181 NQSMKEMLDK--SSCPKCCSSATSIFSSEQQQSQLVTEIVRLKAELERLRAALEEYG-PA 240
           N++M+E   K  SSCP C                L  E  +LKAEL++LRAAL     P 
Sbjct: 212 NKAMRESFSKANSSCPNCGGG----------PDDLHLENSKLKAELDKLRAALGRTPYPL 271

Query: 241 RASTSRSNEE---------GIFGAEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRE 300
           +AS S   E          G+F  EK+R+ E++NRA  E+ KMA SGEP+W+RS + GRE
Sbjct: 272 QASCSDDQEHRLGSLDFYTGVFALEKSRIAEISNRATLELQKMATSGEPMWLRSVETGRE 331

Query: 301 LLNYDEYVKEF--GAGDERPGRK-IEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIIS 360
           +LNYDEY+KEF        PGRK IEASRD+ ++F+D  +L QSFMDV QWKE F  +IS
Sbjct: 332 ILNYDEYLKEFPQAQASSFPGRKTIEASRDAGIVFMDAHKLAQSFMDVGQWKETFACLIS 391

Query: 361 KASTVEVVCNGDG-ATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVS 420
           KA+TV+V+  G+G +  +GA+QLM+ E+Q+LTP +P REV+F+RSC+Q+SP+KW I DVS
Sbjct: 392 KAATVDVIRQGEGPSRIDGAIQLMFGEMQLLTPVVPTREVYFVRSCRQLSPEKWAIVDVS 451

Query: 421 VSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDN 480
           VS + D      +S   C K PSGCII DTSNG  KVTW+EH       ++ ++R++++ 
Sbjct: 452 VS-VEDSNTEKEASLLKCRKLPSGCIIEDTSNGHSKVTWVEHLDVSASTVQPLFRSLVNT 511

Query: 481 GSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSI 540
           G  FGARHW+ATLQ+HCER VFFMATNVP KDS G+TT  GRKSV ++AQRMT   Y++I
Sbjct: 512 GLAFGARHWVATLQLHCERLVFFMATNVPTKDSLGVTTLAGRKSVLKMAQRMTQSFYRAI 571

Query: 541 GASSSHTWAKVQSKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDE 600
            ASS H W K+ +K G+++RV+SRKNL+DP EP G+I+CA++S+WLPVSP +LF+F  DE
Sbjct: 572 AASSYHQWTKITTKTGQDMRVSSRKNLHDPGEPTGVIVCASSSLWLPVSPALLFDFFRDE 631

Query: 601 SRRIEWDVMLS-SPVETVATFAKGQNRGNAVTIQATKSDETNMWILQDSLTNEYESTVVY 660
           +RR EWD + + + V+++A  +KGQ+RGN+V IQ  KS E ++W+LQDS TN YES VVY
Sbjct: 632 ARRHEWDALSNGAHVQSIANLSKGQDRGNSVAIQTVKSREKSIWVLQDSSTNSYESVVVY 691

Query: 661 AQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTIAT 708
           A VDI + + V+AG DPS+I +LP GFSI+PDG   RP +I+   ++  S+GGSLLT+A 
Sbjct: 692 APVDINTTQLVLAGHDPSNIQILPSGFSIIPDGVESRPLVITSTQDDRNSQGGSLLTLAL 751

BLAST of Sed0004043 vs. TAIR 10
Match: AT4G00730.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 439.9 bits (1130), Expect = 3.9e-123
Identity = 273/700 (39.00%), Postives = 414/700 (59.14%), Query Frame = 0

Query: 53  SRSAEDLCIDLDDEDEDKL-QGNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEK 112
           SRS  D    +  ED+D   +  RKKR +RH+ +QI+E+E +FKE PHPDEKQR  LS++
Sbjct: 111 SRSGSDNVEGISGEDQDAADKPPRKKRYHRHTPQQIQELESMFKECPHPDEKQRLELSKR 170

Query: 113 LGLHSKQIKFWFQNRRTQIKASLE-------RQEMEKLRKENQSMKEMLDKSSCPKCCSS 172
           L L ++Q+KFWFQNRRTQ+K  LE       RQE +KLR EN S++E +    C  C   
Sbjct: 171 LCLETRQVKFWFQNRRTQMKTQLERHENALLRQENDKLRAENMSIREAMRNPICTNCGGP 230

Query: 173 ATSIFSSEQQQSQLVTEIVRLKAELERLRAALEEY----------GPARASTSRSNEEGI 232
           A  +     ++  L  E  RLK EL+R+     ++               +   +N  G 
Sbjct: 231 A-MLGDVSLEEHHLRIENARLKDELDRVCNLTGKFLGHHHNHHYNSSLELAVGTNNNGGH 290

Query: 233 FG-------------------------AEKARVMEMANRAVEEVVKMADSGEPLWVRSFD 292
           F                           +K+ ++E+A  A++E+VK+A S EPLWV+S D
Sbjct: 291 FAFPPDFGGGGGCLPPQQQQSTVINGIDQKSVLLELALTAMDELVKLAQSEEPLWVKSLD 350

Query: 293 AGRELLNYDEYVKEFGAGDERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSII 352
             R+ LN DEY++ F +  +  G   EASR S ++ ++   LV++ MD  +W EMFP  +
Sbjct: 351 GERDELNQDEYMRTF-SSTKPTGLATEASRTSGMVIINSLALVETLMDSNRWTEMFPCNV 410

Query: 353 SKASTVEVVCNGDGATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVS 412
           ++A+T +V+  G   T NGA+QLM AELQ+L+P +P R V+F+R CKQ +   W + DVS
Sbjct: 411 ARATTTDVISGGMAGTINGALQLMNAELQVLSPLVPVRNVNFLRFCKQHAEGVWAVVDVS 470

Query: 413 VSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDN 472
           +    D +  +S  + +  + PSGC++ D SNG  KVTW+EH +  +  +  +YR ++ +
Sbjct: 471 I----DPVRENSGGAPVIRRLPSGCVVQDVSNGYSKVTWVEHAEYDENQIHQLYRPLLRS 530

Query: 473 GSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSI 532
           G  FG++ W+ATLQ  CE     ++++V   D+T I T GGRKS+ +LAQRMT      I
Sbjct: 531 GLGFGSQRWLATLQRQCECLAILISSSVTSHDNTSI-TPGGRKSMLKLAQRMTFNFCSGI 590

Query: 533 GASSSHTWAKVQ-SKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLID 592
            A S H W+K+    +  ++RV +RK+++DP EP G++L AA SVWLP +P+ L++FL +
Sbjct: 591 SAPSVHNWSKLTVGNVDPDVRVMTRKSVDDPGEPPGIVLSAATSVWLPAAPQRLYDFLRN 650

Query: 593 ESRRIEWDVMLS-SPVETVATFAKGQNRG-NAVTIQATKSDETNMWILQDSLTNEYESTV 652
           E  R EWD++ +  P++ +A   KGQ++G + +   A  +++++M ILQ++  +   + V
Sbjct: 651 ERMRCEWDILSNGGPMQEMAHITKGQDQGVSLLRSNAMNANQSSMLILQETCIDASGALV 710

Query: 653 VYAQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTEENGSEGGSLLTI 707
           VYA VDI +M  VM G D S + +LP GF++LPDG        S + ++    GGSLLT+
Sbjct: 711 VYAPVDIPAMHVVMNGGDSSYVALLPSGFAVLPDGGIDGGG--SGDGDQRPVGGGSLLTV 770

BLAST of Sed0004043 vs. TAIR 10
Match: AT4G04890.1 (protodermal factor 2 )

HSP 1 Score: 436.8 bits (1122), Expect = 3.3e-122
Identity = 270/683 (39.53%), Postives = 395/683 (57.83%), Query Frame = 0

Query: 74  NRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGLHSKQIKFWFQNRRTQIKAS 133
           N+KKR +RH+  QI+E+E  FKE PHPD+KQR+ LS  L L   Q+KFWFQN+RTQ+KA 
Sbjct: 61  NKKKRYHRHTQRQIQELESFFKECPHPDDKQRKELSRDLNLEPLQVKFWFQNKRTQMKAQ 120

Query: 134 LERQE-------MEKLRKENQSMKEMLDKSSCPKCCSSATSIFSSEQQQSQLVTEIVRLK 193
            ER E        +KLR EN   KE L  ++CP C   A +I      +  L  E  RL+
Sbjct: 121 SERHENQILKSDNDKLRAENNRYKEALSNATCPNCGGPA-AIGEMSFDEQHLRIENARLR 180

Query: 194 AELERLRAALEEY---------------GPARAST----SRSNEEGIFG----------- 253
            E++R+ A   +Y                P+R+      +  N+ G  G           
Sbjct: 181 EEIDRISAIAAKYVGKPLGSSFAPLAIHAPSRSLDLEVGNFGNQTGFVGEMYGTGDILRS 240

Query: 254 ------AEKARVMEMANRAVEEVVKMADSGEPLWVRSFDAGRELLNYDEYVKEF--GAGD 313
                  +K  ++E+A  A+EE+V+MA +G+PLW+ S D   E+LN +EY + F  G G 
Sbjct: 241 VSIPSETDKPIIVELAVAAMEELVRMAQTGDPLWL-STDNSVEILNEEEYFRTFPRGIGP 300

Query: 314 ERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPSIISKASTVEVVCNGDGATCNG 373
           +  G + EASR S V+ ++   LV+  MDV QW  +F  I+S+A T+EV+  G     NG
Sbjct: 301 KPLGLRSEASRQSAVVIMNHINLVEILMDVNQWSCVFSGIVSRALTLEVLSTGVAGNYNG 360

Query: 374 AVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIADVSVSKLGDGIDSSSSSSSLCA 433
           A+Q+M AE Q+ +P +P RE +F+R CKQ S   W + DVS+  L       S+      
Sbjct: 361 ALQVMTAEFQVPSPLVPTRENYFVRYCKQHSDGSWAVVDVSLDSL-----RPSTPILRTR 420

Query: 434 KRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVIDNGSMFGARHWIATLQIHCER 493
           +RPSGC+I +  NG  KVTWIEH +     +  +Y+ ++ +G  FGA+ W+ATL+  CER
Sbjct: 421 RRPSGCLIQELPNGYSKVTWIEHMEVDDRSVHNMYKPLVQSGLAFGAKRWVATLERQCER 480

Query: 494 QVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQSIGASSSHTWAKVQSKIGENI 553
               MA+N+P  D + IT+  GRKS+ +LA+RM       +GAS++H W  + +   +++
Sbjct: 481 LASSMASNIP-GDLSVITSPEGRKSMLKLAERMVMSFCSGVGASTAHAWTTMSTTGSDDV 540

Query: 554 RVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLIDESRRIEWDVMLS-SPVETVA 613
           RV +RK+++DP  P G++L AA S W+PV+PK +F+FL DE+ R EWD++ +   V+ +A
Sbjct: 541 RVMTRKSMDDPGRPPGIVLSAATSFWIPVAPKRVFDFLRDENSRKEWDILSNGGMVQEMA 600

Query: 614 TFAKGQNRGNAVT---IQATKSDETNMWILQDSLTNEYESTVVYAQVDITSMRTVMAGSD 673
             A G   GN V+   + +  S ++NM ILQ+S T+   S V+YA VDI +M  V++G D
Sbjct: 601 HIANGHEPGNCVSLLRVNSGNSSQSNMLILQESCTDASGSYVIYAPVDIVAMNVVLSGGD 660

Query: 674 PSSITMLPMGFSILPDGH------SPRPSIISLNTEENGSEGGSLLTIATQISVSSSPTA 702
           P  + +LP GF+ILPDG       +    ++S  T  +GS GGSLLT+A QI V S PTA
Sbjct: 661 PDYVALLPSGFAILPDGSVGGGDGNQHQEMVS--TTSSGSCGGSLLTVAFQILVDSVPTA 720

BLAST of Sed0004043 vs. TAIR 10
Match: AT4G21750.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 433.3 bits (1113), Expect = 3.7e-121
Identity = 279/718 (38.86%), Postives = 405/718 (56.41%), Query Frame = 0

Query: 63  LDDEDEDKLQ-GNRKKRKNRHSSEQIREMEKVFKESPHPDEKQRQLLSEKLGLHSKQIKF 122
           L++E +D  Q  N+KKR +RH+  QI+E+E  FKE PHPD+KQR+ LS +L L   Q+KF
Sbjct: 49  LEEELQDPNQRPNKKKRYHRHTQRQIQELESFFKECPHPDDKQRKELSRELSLEPLQVKF 108

Query: 123 WFQNRRTQIKASLER-------QEMEKLRKENQSMKEMLDKSSCPKCCSSATSIFSSEQQ 182
           WFQN+RTQ+KA  ER        E +KLR EN   K+ L  ++CP C   A +I      
Sbjct: 109 WFQNKRTQMKAQHERHENQILKSENDKLRAENNRYKDALSNATCPNCGGPA-AIGEMSFD 168

Query: 183 QSQLVTEIVRLKAELERLRAALEEY--GPARA---------------STSRSNEEGIFG- 242
           +  L  E  RL+ E++R+ A   +Y   P  A               S S   E G FG 
Sbjct: 169 EQHLRIENARLREEIDRISAIAAKYVGKPLMANSSSFPQLSSSHHIPSRSLDLEVGNFGN 228

Query: 243 ---------------------------AEKARVMEMANRAVEEVVKMADSGEPLWVRSFD 302
                                      A+K  ++E+A  A+EE+V+MA +G+PLWV S D
Sbjct: 229 NNNSHTGFVGEMFGSSDILRSVSIPSEADKPMIVELAVAAMEELVRMAQTGDPLWVSS-D 288

Query: 303 AGRELLNYDEYVKEF--GAGDERPGRKIEASRDSCVLFVDLQQLVQSFMDVVQWKEMFPS 362
              E+LN +EY + F  G G +  G + EASR+S V+ ++   L++  MDV QW  +F  
Sbjct: 289 NSVEILNEEEYFRTFPRGIGPKPIGLRSEASRESTVVIMNHINLIEILMDVNQWSSVFCG 348

Query: 363 IISKASTVEVVCNGDGATCNGAVQLMYAELQMLTPSIPPREVHFIRSCKQISPQKWVIAD 422
           I+S+A T+EV+  G     NGA+Q+M AE Q+ +P +P RE +F+R CKQ S   W + D
Sbjct: 349 IVSRALTLEVLSTGVAGNYNGALQVMTAEFQVPSPLVPTRENYFVRYCKQHSDGIWAVVD 408

Query: 423 VSVSKLGDGIDSSSSSSSLCAKRPSGCIIHDTSNGRCKVTWIEHWKCPKIGLRTIYRTVI 482
           VS+  L        S  +   +RPSGC+I +  NG  KVTW+EH +     +  +Y+ ++
Sbjct: 409 VSLDSL------RPSPITRSRRRPSGCLIQELQNGYSKVTWVEHIEVDDRSVHNMYKPLV 468

Query: 483 DNGSMFGARHWIATLQIHCERQVFFMATNVPMKDSTGITTTGGRKSVFQLAQRMTSGIYQ 542
           + G  FGA+ W+ATL   CER    MA+N+P  D + IT+  GRKS+ +LA+RM      
Sbjct: 469 NTGLAFGAKRWVATLDRQCERLASSMASNIPACDLSVITSPEGRKSMLKLAERMVMSFCT 528

Query: 543 SIGASSSHTWAKVQSKIGENIRVASRKNLNDPREPLGLILCAAASVWLPVSPKVLFEFLI 602
            +GAS++H W  + +   +++RV +RK+++DP  P G++L AA S W+PV+PK +F+FL 
Sbjct: 529 GVGASTAHAWTTLSTTGSDDVRVMTRKSMDDPGRPPGIVLSAATSFWIPVAPKRVFDFLR 588

Query: 603 DESRRIEWDVMLSSP-VETVATFAKGQNRGNAVT---IQATKSDETNMWILQDSLTNEYE 662
           DE+ R EWD++ +   V+ +A  A G++ GN+V+   + +  S ++NM ILQ+S T+   
Sbjct: 589 DENSRSEWDILSNGGLVQEMAHIANGRDPGNSVSLLRVNSGNSGQSNMLILQESCTDASG 648

Query: 663 STVVYAQVDITSMRTVMAGSDPSSITMLPMGFSILPDGHSPRPSIISLNTE--------- 704
           S V+YA VDI +M  V++G DP  + +LP GF+ILPDG S R    S N           
Sbjct: 649 SYVIYAPVDIIAMNVVLSGGDPDYVALLPSGFAILPDG-SARGGGGSANASAGAGVEGGG 708

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023000325.11.5e-29774.32homeobox-leucine zipper protein GLABRA 2-like isoform X1 [Cucurbita maxima][more]
KAG7025809.13.3e-29773.91Homeobox-leucine zipper protein GLABRA 2 [Cucurbita argyrosperma subsp. argyrosp... [more]
XP_022963997.19.5e-29773.78homeobox-leucine zipper protein GLABRA 2-like [Cucurbita moschata][more]
XP_023000326.14.1e-29273.23homeobox-leucine zipper protein GLABRA 2-like isoform X2 [Cucurbita maxima][more]
XP_023513617.11.3e-29072.53homeobox-leucine zipper protein GLABRA 2-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
P466071.7e-19251.12Homeobox-leucine zipper protein GLABRA 2 OS=Arabidopsis thaliana OX=3702 GN=GL2 ... [more]
Q5JMF31.0e-14440.79Homeobox-leucine zipper protein ROC9 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
A2YR023.8e-12340.74Homeobox-leucine zipper protein ROC7 OS=Oryza sativa subsp. indica OX=39946 GN=R... [more]
A3BPF23.8e-12340.94Homeobox-leucine zipper protein ROC7 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
Q0WV125.5e-12239.00Homeobox-leucine zipper protein ANTHOCYANINLESS 2 OS=Arabidopsis thaliana OX=370... [more]
Match NameE-valueIdentityDescription
A0A6J1KI047.1e-29874.32homeobox-leucine zipper protein GLABRA 2-like isoform X1 OS=Cucurbita maxima OX=... [more]
A0A6J1HGN14.6e-29773.78homeobox-leucine zipper protein GLABRA 2-like OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1KFK32.0e-29273.23homeobox-leucine zipper protein GLABRA 2-like isoform X2 OS=Cucurbita maxima OX=... [more]
A0A6J1DQD43.5e-28972.29homeobox-leucine zipper protein GLABRA 2 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A0A0K6N01.1e-28270.83Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447000 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79840.11.2e-19351.12HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START dom... [more]
AT1G79840.21.2e-19351.12HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START dom... [more]
AT4G00730.13.9e-12339.00Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
AT4G04890.13.3e-12239.53protodermal factor 2 [more]
AT4G21750.13.7e-12138.86Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 175..195
NoneNo IPR availableCOILSCoilCoilcoord: 125..152
NoneNo IPR availableGENE3D1.10.10.60coord: 46..133
e-value: 3.4E-18
score: 67.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 35..86
NoneNo IPR availablePANTHERPTHR45654:SF24HOMEOBOX-LEUCINE ZIPPER PROTEIN GLABRA 2coord: 23..465
NoneNo IPR availableCDDcd08875START_ArGLABRA2_likecoord: 219..447
e-value: 1.94097E-88
score: 269.911
NoneNo IPR availableSUPERFAMILY55961Bet v1-likecoord: 223..448
IPR002913START domainSMARTSM00234START_1coord: 224..448
e-value: 3.5E-47
score: 172.7
IPR002913START domainPFAMPF01852STARTcoord: 225..448
e-value: 7.1E-44
score: 149.7
IPR002913START domainPROSITEPS50848STARTcoord: 215..451
score: 33.694111
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 75..137
e-value: 6.5E-16
score: 68.9
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 76..131
e-value: 6.2E-17
score: 61.2
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 73..133
score: 17.993402
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 76..131
e-value: 3.17331E-16
score: 71.1204
IPR023393START-like domain superfamilyGENE3D3.30.530.20coord: 224..430
e-value: 1.4E-5
score: 26.6
IPR042160Homeobox-leucine zipper protein GLABRA2/ANL2/PDF2/ATML1-likePANTHERPTHR45654HOMEOBOX-LEUCINE ZIPPER PROTEIN MERISTEM L1coord: 23..465
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 108..131
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 67..131

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0004043.1Sed0004043.1mRNA
Sed0004043.3Sed0004043.3mRNA
Sed0004043.2Sed0004043.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0008289 lipid binding