CmaCh14G013050 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh14G013050
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionhomeobox-leucine zipper protein GLABRA 2-like
LocationCma_Chr14: 10283301 .. 10293260 (+)
RNA-Seq ExpressionCmaCh14G013050
SyntenyCmaCh14G013050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCGCCGACATGTCGGACAATAATAATCGCCTCGCTTTCACCAAAGACTTCTTCTCCTCTCCGGCTCTTTCTCTTACTCTTGTATGTGTAATTCTATTTTCTGTTTTCTATCAAAATTTCCAGCTGCCTCTCTGATGGGTTTGTTTTTAATTTTTGAGGCGGGGATTTTTCGCCGGGGCGAGGCGGGGGAGAAAGGGGATGTTGAGATGGAGGAGGTGGATGACGGGAGAGATGATATGACGACGGCGGTGGAGGTTAGTAGCGAGAATTCGGGACCGGTGAGGTCGAGATCGGATGACGATTACGACGGTGGAGGAGTGCATGAAGAGAATGAAGATGGGTGTCATGGAAAGAGAAGGAAGAAGTATCATCGGCATACCAACGAGCAGATCAGAGAAATGGAGGCGTAATTCATCCATCGCATTGCTCTACTTTTTTTTGTTATGGCCATTGGAATCTTTTGATATGCTTTTTCTTTGGGTTCCATTTTTTAGGCTGTTTAAAGAGTCGCCACATCCGGATGAGAAGCAAAGGCAGCAGCTTGCTAAGCTACTAGGGCTTTCATCAAGGCAGGTCAAGTTTTGGTTCCAAAATCGTCGAACCCAAATCAAAGTAAGTCTAACCATCTTGTTAGGAATCACGAACCCCACAATCATATGGTATTGTTTACTTTGAACATAATCTATCGTGAGCTTACTTTTCATTTTCTCAAAAAACAGTTGTATTTGTTGTTTATAAACTTGTGATCAATCTTTTAAATAGTCAATGTGAGACTTCTCTTCCAACCATCTTCTCCTCTGAACAAAATACATCATAGAGCCTCAGAGCCCTATGGAGCGCTTGAATAGCCTCCTCTTAATCGAGGCTCGACTTCTTCTCTGGAGCCCTCCAACAAAATACACCATTTGTTCAATACTGAGTCAATTTTGACTACACTTTCGAGAGTCCTAATTTCTTTGTTCAACATTTTATTGACATGACTAAGTTAAAGACATAACTCTAATACCATGTTAGAAACGAATTTCCATAATGATATGATATTGTTTATTTTGAACATAAGCTTTCATGTCTTTACTTTTGGTTTCTCCAAAACGTCTCATATCAATGTAAATATATTCCTCTTAATTTGTTGAAGTGAGACTCGTCTCCCAACAATTCTCAACATACTCTTTATTTATGCAGGCAATTCAAGAGCGTCATGAGAACACATTATTTTCGAAATGGAGAAACTTCGAGATGAAAATAAAGGAATGAGAGAAATTTTCAAGCGAAAACTCGGTTGTCCAAACTGTGGCACTACCGACGTCGCAGCAACAGCAACAACCACAGAGCAATTGCGTATCAAGAATGCCAAGCTCAAAGTCGAGGTAACATTTAAATAGATCGAGCTTAAACCCTTGCTCTGATTTAAAGTGTTCTAATTCACAGATTGAGAAACTACGAGCGGCGCTGGGAAAACACCGACATGCAGCGGTGTCTCTGTCATACTCATCGGAGAACGAGCAAGAAACGAACCAAAGCTGCTTAGATTATTACACAGGAATTTTCCGGCTTGAAAAGTCAAGGATCATGGAGAAGGTTCATCAAGCTCTAGAAGAGCTCAAAACAATGGCTGCAGCCGGCGACAGTCTTTGGGTCCTCAGCGTCGAGACCGGAAGAGAGATATTAAACTACGACGAGTACTTAAAAACCTTCCATCTCAACAATGATTCTAACCGTAGTTGGCTCAAAACCCACATCGAGGCCTCTCGTGAGACGGCCCTTGTCTTCATGGAGCCCTCAAGGTTGGTTCAAAGCTTCATGGATGAGGTAAATAAACAAATTTCGTGTTCAACACAACTTTTTATGGAAATTACTAAGTAGCACAAACACTCTGTGCTAGAATGAAAACCTAGATATGATAGTGGAGCGTGAGGGATATTTATAATAAACCTTAGACAATAAATATTAAAATGAATACAGGCTCGTATCATTATTAGACTCGACTCTATTGAACTAATTATGATATTTTGACATTTCGAACATGGACGAAGTAAATAAACAAATTTTGTGTTGAACACGACTCTTTGTGAAAAACAATCACATTCTTTATTAACACCAATCGAGAAGAGATACAAGACATTCTAAACACTTTATACCAGTCATTTATACTTCTTGAGATAAAAATTTAAGTACAGACTAACCATCTATTCCTAAAATAACTAATTCATCTTCCTAAAATGAATATAAACTAGTTGCTTGATCTTATTGACTCAGTAATGATGTTTTAACATTTCGAACATAGACTTTTGTCGTGTTTTTGTTTCCTACGATGTCTTTCTTCCATTCTCATTTATGGTTTGTTGGTAGAATCAATGGAAGGAGATGTTTCCTTTCATGATCTCGAAGGCAGCTACGGTTGATGTTATTTGTAATGGAGGGATTGCCAATTGGGACGGCGCAGTGCAATTGGTGAGCCTTAATTTTGAGTACGAGGCCTATAAATGAGAGCTAGATGAGTTAAAAATGGCTCATTCATTGTGTACTACAGATGTTTGCAGAGGTACAAATGCTTACACCATTAGTGTCCACAAGAGAGATGTATTTCATCCGGCATTGCAAGCAGCTCGACGCTGAACGATGGGCGATTGTCGATGTTTCAATCGAAAATGTCGAAGATAATAATATCGACGTATCATTGGTGAAATATAGAAAACGTCCTTCTGGCTGCATCATTAAGGATGAATTTAATGGTCATTGTAAGGTATGCTTTTACTTCGAATCGAAAGATTCAAATCTCCAATCGATTCATAAAAGTGGAAGGTTTTTCTGGTAACAATGGTGGAGCATTTGGAATGTCAAAAGAACAAAGTTCATAACTTGTTCAGAAACTTAATCAACAATGGCGGTGGCTTTGGGGCAAAACGTTGGATGGCAACTCTCCAACTCCAATGCGAACGCCTTGCTTTCTTCATGGCAACCAACATCCCCATGAAAGACTCAACTGGTTAGCCCTCATCTTCCTTCTAACCCAACTTCATAACTAGAACTAACAATAAAGAATATACCATACAAATTTTTTTAATCTACATTCAACGATTATTAAGAAACTAATACGTTATAACAGTCTCACCAAATAATTTTTTCCAGTAAAAGGTGCCATTATTGCTTAGTTACATTAATTTTTTAAATTCAAAAAATTGAAAATTAATTAAATAAAAACTTTACGTTTATTTTCTAACCATCCTAGTATATGAATTTTGAGAGTAATGTTTTTGAACCATTTAATAGAAAAATATGTAATAAATTCTTGGAAACTTAGATTTTATGTATTGTATAGTTATATAGAGAAAAAAAATAAAAATTGTATAAAATAATAAACCAATTTTCTTGAAATTTTTTGAAATCAGTAGGATTAGAATTCTTTGATTCTAATAAATGTAATAACATTTCATAAAAAATAAGAATATGACGATTAAAAAAAAATATTTATAAATTAGTCTTATGTCTTACAATATATATATACACATATGCATGCATACTACATGTGTATATATATATATATATATATATATAGATAGATAGATATGTATGTGTAGCTCCACGTAGGCAGTTTATTGAAACTGGAAGTTTGAAACGTTACGTATTCAGGAGTTGCAACACTAGCGGGTAGAAAAAGCACATTAAAGTTGGCACAGAGAATGAGTTCAAGCTTCTCCCAAGCAATAGCAGCTTCAAGTTATCAGACATGGACCAAGGTTGTGGGCAAAACAGGGGAAGACATTAGGGTTTGTTCCAGGAAGAATCCTAGTGACCTTGGTGAACCCATTGGAGCTATTTTGTGTTCAGTTTCTTCTCTATGGCTGCCTGTTTCTCCTCACCTTCTCTTTCATTTCCTGAGAGACCAAGCTTGTCGACATCAGGTAAAACATTCGAGTATTCATACGTGAACCTATCAGTTCTCCTACCACCCACCAACTTAAGATTGGTTGGAAGAATAATTTCATTAGTTCATAACTCTCATTTATAACAATGATTTACTGCTCTTATTTATTAGTTCATAATCTATGTAACTCTTATTTATTTAGTTTATAACCGTTCATGATTCTCATTTATTTTGCTCATAACTCTGTGTTTAATTCATCTCTCATTTATTTCATTTATCTTTCCATTTCGATTCATCTATGATATTAATGCTTCAAGGGTTATCTAGGATGATTGAGTTTGATTAAGTTTGAGTATCATTTTGTATTTGTTAGGTGATATTTTGAGTAAATTTATCAACAGACTTTGAAAAGTGAGATATACACTTTATAAACACTTTGGTTATTGAGAGTGATGGCTACTTGTGGATTAAGGAAATTTATTCTCTCCAACCGGTCTATATTTTTAAATTTGGGTTTGTGGGTGAGTAAATGTGATTCACTTGCTTTCTCTTAGGTACAACAAAAGGTTTTTTATTTTATTTTATTTTATTTTATTTTAGTGGGACGTTATGTTTGGTGGAGATGAAGCTAAGTCGATTGCAAATTTAGCTAAAGGACAAGATCGAGGCAACTCAGTTACCATTCAAGTAACTAATGATTGGTTCATGAAATTTTAGATTTTTTTCATTTCAGAATGGCTAATGGAAAGGCTAAAAGTTGCAGACAATTGGATCAAAAGAGAGCAGCAGCAGCATGTGGATCCTCCAAGACAGCTCCACAAACTCGTCGGAATCCATGGTGGTTTACTCCGGAGTAGACGTTACCGGCATGCAGTCGGTGATGACAGGCTGCGATTCCAGCAGCCTCACCATTCTCCCTTCTGGCTTTTCAATTCTCCCCGACGGCGCTGTGTCCAGGCCGCCCCTCCTCATCACCCGACAGAAAGACGACAAGACCGCCGACACCAATGGCGGCGTTCTGCTGACTGCCGCCGTTCAAATCCTCACCGACGCCTCTCCCTCTGCAAAACACACCATGGAATCTGTTGAGTACGTTAAAAACATCATTAGTTGTACGCTAAAAAATATCAAAACTAGCATGAGCTGTGAGGAAGATTGATACATAACCTATAATTGATCCATATAGCCTATATTATCCTTTTTTCCATGTGGGTTTTGCTTTAATTTGTTTGAGTGGAGTTAGTTTTTATGTTACAAATTTGATTTTAACGAATAATCATCAACAGTGCAAAACAAAATCATGAGAATTTATGTCGAAAATGGACAAGATCATATTATTGTTTAGACATGGAGAGTTTATTGTTCTTAATGAATGATATTTGATTTGTTGGATTTGACTCAGGATCGAAATATGTTCAAACAAGTAAAATAAAAAAAGGGTAAACTTGAGGGGATTTTCATTTTTGTGTTGTTGTAATTGTGAAAAGAGTATACATCTCATGAGTATTTACTATAGAGAAAGCTCTTACTCGCTTCCTAAATGTTCTTACTAATTAAAAGCACCAAACTGATACTACTAGTTCCACAATTTAATAACTACTATCAATTCCACAATTTAACAACTACTATCAATTTTAGTATCAATTTAAGTGCGTGATCCATAAGTTAGTAATTTATTCTATAAATCAAGTTTATTAGTTCACAAAACTTACCAACAACGAGTTAGTCAATGATTCATGTTAGTATTTAAAATTGCCATTGAAGTTTTACCGGAGAAGCATCATAATTTACTTGGATACACACAAATCTCCTCTAACAATGACAACGAAAGCCCAAGGCAAGGTCCAAAATCTTCCACCATATCAACAAGGATAGGCTTATCCACCTCCTTCCAACGTGGCACTTCCTAAGTGACACTCCAAATTGTCTTCCAAAATCCCATTTCGCCACGTGTCAAAAGAAAATCCAATCAAAGGATCAAAACCCCATTTGAAGCCTCACCTCCTCCCTCAGCCTGCAACTCCAAGTTTGTTCCTTTTCAATACAAAACGCAGACAACAATGGCTTCCTTAGCAACCTTAGCCGCCGTTCAGCCGGTCACCGTAAAGGGCCTTGGTGGAAGCTCCCTTGCCGGAACTAAGCTCCCTCTCAGGCCCTCTCGCCAGAGCTTCAGACCAAAAAGCTTCAAGTACCACAAATTTCTCTAATTTCTTCTTTATTTTGGTGCATATTGATTTGAAACTTCATGAACTTGTGGGGTTTTTCCCCTTTTTGTTACTCTGTTTTCAGGGCTGGTGCTGTGGTGGCTAAGTACGGTGACAAAAGTGTTTACTTCGATTTGGAGGATTTGGGCAACACTACTGGACAGTGGGATTTGTATGGATCTGATGCTCCTTCACCATACAATTCTCTTCAGGTTTGTAGCTCATTCTCTAGCTTTAATAAAATTAGAATGATAATGAGATCTCGTTTGAGAGGGGAACGAAACATTTCTTGTAAGAGTGTGAAAACCTCTCTCTAATAGACGCATTTTAAAATCGTGAGGTTGTTGACGATATGTAACGGGCAAAAGTAGACAATATCAGTTAGCATTGGACTTGAGCTGTTACAAACAATATCAAAGTTAGGCACCGAGTGATGCGCCAACGAGGATGCTAGGCCCCCAAGAGGGGTGGATTATGAGATCTCACATTGGATGGAGAGAGTAACAAAGCATTCCTTATAAGGGTGTGGAAACCTCTTCCTAATCGGCGTGTTTTAAAACCGTGAGGCTGATGACAATATGTAACGAGCCAAAGCGGACAATATTCGCTAGTGGTGAACTTGGGCTATTGCAAATGGTATTAGAGGACACTAGGCCCTCAAGTGGGGTGGATTATGAGATCTCACATTGATTTTAAAGAGGGAGACGGAACATTTTTTATAAGGGGTGCCTGTGTAAGTCTATTTTTAACCATGTAACGGGGCAAGCATTCCGTCTTTTTGGTGTATTTATAGAGCAAATTCTTTGAGACGTTTGCCGCTCCATTCACCAAGAGAGGATTGTTGCTCAAGTTCTTGCTTCTAGGCGGTGGAGCCACTTTAGCTTATTACAGTGCCACTGCCCCAGATGATGTTCTTCCCATCAAGAAAGGACCTCAACTTCCACCAAAGCTTGGGCCTCGTGGCAAGATCTAATTCGCTCTCAAATCCTTTTGCAGTATGTAAATTTTCTCTCTTATCCGCCCAGTTATGTTTCAGTTGAGAAATTGTTATTATGTAATGAATGTACTTTTACAAAGTGTTCTTGCCAGTTTCTTTGAAAAACTGATGAAGATATGGGTTAAAAAGACTTATTCTTTCTAACAAAATTGACTAAAACTTAGCCCTGTAGAAGAACCATATTAAAAAAAAGACTCCAAAACCAAATTACTTGTGCTCTTCCTTCATATGTACAACAGGAATGAATTGAATGGACTTGGTAATTACAGCTCTAAAGAAAACTCAGACAAAAGTAGGTTGAGCTATTAACTGTGTTGTGTTCTTCTGTTATGTGTCGTGTCAATGTCGTGCCGTAAGGGCTTTTCGTCCCCCCTATGTTCTCGAGGTAAGAAACCAACTTGTTGTTGGTCACTATGTTTGAAGCTTTTGCACAGGCCTTGGTTCAGTTTCTGTGAACATATTTTCTGCATTCTCTGTTGGTGATTCCTTCTTTACCTCATTATCAGCAATATCCGGACCCCAATTCATGAAATCCAATAGTGTCATTGAAGTAGGTGTCTCTGCAGCACTTTGTGGCTCTTCTCCTGAGCTCACTGCGGTTTCCCGTGTCGAGTTCTTCTTACTCGATACTTGTGATGGATTTTGAGCGATGTAATCCGCATAGAGTACGTCCTCAATCCTTGACATCACTGTGAAGGCCAAGCTTTCAAGAATTCTTGAGTAACTCTCTAGAACAGCCTGTCCAACATCCTATAAAACCAAACCACTCTCCCATTAGACCAGATTTAAGAAGTCATATAATGTCTTAAACACAAGTACTTTCCAAAAAACATGTTTTGCACCCGGTTGTATTGGATTTTGCTGATGTCTAATGCTGATTGAGGAGTGCCTGGGAAACGATGTTTGAGGATGAGTAGAATCGTTTCCGCTCGCTCCTCGAAAAGCTCTCGTTTCTCCAAGCTTACAGCTGAACCCCAAGCAGATTTTCCATCTTTCTGATTCATTTTCCTCTTCCATATCACAATGGAAGCTTCAATTCTATCCTTGATATCTAAGATCTTGTGCTCTGAAGATAAGTCCATGGTAGACAGAAATTGATCCGGGTCGAAGTACTCGACCGTTATGTTTCTGTACACGGAGTCACCAAGGCTTTCCCTCCCATTCTGCAAGTTTACATATTCAATCACAAAATCAATGAAAGATAGAGAATTTATGACATTGTCCTCTTTGGACTTTCTCATACGGGTTTCCCCTCAAGATTTTAAAACGTGTCTGATAGGGAGAGGTTTTCACAACCTTATAAGGAATGTTTCGTTCTCCTCTCCAACCGACGTGGAGTCAAAATTTACCTTAGGCAATAACTCTATGTAGTTTTCTGGGATCTCCATTTCTGATAGAACTTGTGCATTGATAGCCATGGCTGCTTTAAGGACTTGGTTTACACAATCCTTTTGGTACTGCATAAATTTCCTTGAGTTTTCAGATAAACCATTCTCAGGAACTTTTGGAGTAGGTAGCCACCATTTGTCATCTTTTCTCTTTGAAGCATTGTCTTTATTTGAATCACTTGAATCTCTGGATAAGTAGTAGAACTCGGATTGGTCCTTAAAATTATCCAAACAATCCTATACATCATGTGAAACCATTAGAAAGATTCATTAAATCTTATTAAACAGGACAAAAAGTAGAGCAAGATTAGCAATCTTTTCCTACAATAAGCATCGCATCGAGCTTCCGCAATGCCGGGATGTTCATGTGAAGATCGTTGCGTTGTCGGGTAACCATAATCTGTTAATTCAAAGAACATGAACTGAATTATGATCTATAAATGCATTTTGAAGCTCAGAAAGAGAGAGGGAGAGAAAAAATTGCCTCCATGTTTGTTCCATCCTTGGATTTCTGTTGGGAAGGAACAAATTCAACAATGTAATCAGTGACTGATAATAGCAAGTCAATCTCTTTTCTCCACCGTGTTTTTCTCCCTACCGACATCGGCTCGAGACGCCATTGTTCCCCGAAAACAGAAGCTGCAAAAACCATAGGAGAAAGATGAAAACTTGACTCCCTAGATTCAACTACAAAACTTATCTAGATTTGCTTCCTCTAGCAGCATCTAACCGAGCACAATTCAACTGATATCGTCGATATTATCGACACAAAACAGAAGACCGATACACGAACGACATACCTGCAAGATTCGTAATTGCATTCGACAATGCTAAAGCCGACGAAACACCCTTTCCTCCACCGGACATATCTTCGCCCAAAAGCAACTTAGCAAACCTCTCCTTCATCTGTTCTATTTCTGATCATATCCAAGAAGAACACACCAATTTTAAACCAACCTCAATAATGGCCTTTTAGTTCATACAAGACAATAGGATTAGACATCTATGCAATGAAAATGTCACGAACATCAGAATGTCAACCATATAATATAAATAATCATAATGTTACAACTCATTTTGAGAAGTCATAAACTATAGTTTATAATTTCAGAGAACAGAAAATAAAACAGACAAAAAGATGTCTGAACCTGTTGGTGGCTGATCTTTTGGCTGTTTAACTGCAGCATCATCTCTGGAGGGCTTGGATTTAGGGCCTTTGTCCGGTCCATTAGCGGGGGGTTTGTCCAACCCGCCGTCGTCTGGGGGAGCTAATCCCTGGTTTCTGTTGGAAAACAAAACATTATTATTATTAAACTCTCCAAGCTCAAGCTTTTGGAGTGTTTTATGTTTGAATTGAAGAATTTGGACCTGTAATTCTCTTCCTCCTGCTCCATTTCTCGTACCATTTTCAGCTTTTTGATGAATCTTTTGATAGAATAAACGAGAGAGAATAAACCAGCAATCTTGACAGACCTGAAAAAACAAAGTTCATAGAAGAAGAAC

mRNA sequence

ATGGGCGCCGACATGTCGGACAATAATAATCGCCTCGCTTTCACCAAAGACTTCTTCTCCTCTCCGGCTCTTTCTCTTACTCTTGCGGGGATTTTTCGCCGGGGCGAGGCGGGGGAGAAAGGGGATGTTGAGATGGAGGAGGTGGATGACGGGAGAGATGATATGACGACGGCGGTGGAGGTTAGTAGCGAGAATTCGGGACCGGTGAGGTCGAGATCGGATGACGATTACGACGGTGGAGGAGTGCATGAAGAGAATGAAGATGGGTGTCATGGAAAGAGAAGGAAGAAGTATCATCGGCATACCAACGAGCAGATCAGAGAAATGGAGGCGCTGTTTAAAGAGTCGCCACATCCGGATGAGAAGCAAAGGCAGCAGCTTGCTAAGCTACTAGGGCTTTCATCAAGGCAGGTCAAGTTTTGGTTCCAAAATCGTCGAACCCAAATCAAAATTGAGAAACTACGAGCGGCGCTGGGAAAACACCGACATGCAGCGGTGTCTCTGTCATACTCATCGGAGAACGAGCAAGAAACGAACCAAAGCTGCTTAGATTATTACACAGGAATTTTCCGGCTTGAAAAGTCAAGGATCATGGAGAAGGTTCATCAAGCTCTAGAAGAGCTCAAAACAATGGCTGCAGCCGGCGACAGTCTTTGGGTCCTCAGCGTCGAGACCGGAAGAGAGATATTAAACTACGACGAGTACTTAAAAACCTTCCATCTCAACAATGATTCTAACCGTAGTTGGCTCAAAACCCACATCGAGGCCTCTCGTGAGACGGCCCTTGTCTTCATGGAGCCCTCAAGGTTGGTTCAAAGCTTCATGGATGAGAATCAATGGAAGGAGATGTTTCCTTTCATGATCTCGAAGGCAGCTACGGTTGATGTTATTTGTAATGGAGGGATTGCCAATTGGGACGGCGCAGTGCAATTGATGTTTGCAGAGGTACAAATGCTTACACCATTAGTGTCCACAAGAGAGATGTATTTCATCCGGCATTGCAAGCAGCTCGACGCTGAACGATGGGCGATTGTCGATGTTTCAATCGAAAATGTCGAAGATAATAATATCGACGTATCATTGGTGAAATATAGAAAACGTCCTTCTGGCTGCATCATTAAGGATGAATTTAATGGTCATTGTAAGAACAAAGTTCATAACTTGTTCAGAAACTTAATCAACAATGGCGGTGGCTTTGGGGCAAAACGTTGGATGGCAACTCTCCAACTCCAATGCGAACGCCTTGCTTTCTTCATGGCAACCAACATCCCCATGAAAGACTCAACTGGAGTTGCAACACTAGCGGGTAGAAAAAGCACATTAAAGTTGGCACAGAGAATGAGTTCAAGCTTCTCCCAAGCAATAGCAGCTTCAAGTTATCAGACATGGACCAAGGTTGTGGGCAAAACAGGGGAAGACATTAGGGTTTGTTCCAGGAAGAATCCTAGTGACCTTGGTGAACCCATTGGAGCTATTTTGTGTTCAGTTTCTTCTCTATGGCTGCCTGTTTCTCCTCACCTTCTCTTTCATTTCCTGAGAGACCAAGCTTGTCGACATCAGTGGGACGTTATGTTTGGTGGAGATGAAGCTAAGTCGATTGCAAATTTAGCTAAAGGACAAGATCGAGGCAACTCAGTTACCATTCAAACAATTGGATCAAAAGAGAGCAGCAGCAGCATGTGGATCCTCCAAGACAGCTCCACAAACTCGTCGGAATCCATGGTGGTTTACTCCGGAGTAGACGTTACCGGCATGCAGTCGGTGATGACAGGCTGCGATTCCAGCAGCCTCACCATTCTCCCTTCTGGCTTTTCAATTCTCCCCGACGGCGCTGTGTCCAGGCCGCCCCTCCTCATCACCCGACAGAAAGACGACAAGACCGCCGACACCAATGGCGGCGTTCTGCTGACTGCCGCCGTTCAAATCCTCACCGACGCCTCTCCCTCTGCAAAACACACCATGGAATCTGTTGAGATCAAAACCCCATTTGAAGCCTCACCTCCTCCCTCAGCCTGCAACTCCAAGTTTGTTCCTTTTCAATACAAAACGCAGACAACAATGGCTTCCTTAGCAACCTTAGCCGCCGTTCAGCCGGTCACCGTAAAGGGCCTTGGTGGAAGCTCCCTTGCCGGAACTAAGCTCCCTCTCAGGCCCTCTCGCCAGAGCTTCAGACCAAAAAGCTTCAAGGCTGGTGCTGTGGTGGCTAAGTACGGTGACAAAAGTGTTTACTTCGATTTGGAGGATTTGGGCAACACTACTGGACAGTGGGATTTGTATGGATCTGATGCTCCTTCACCATACAATTCTCTTCAGAGCAAATTCTTTGAGACGTTTGCCGCTCCATTCACCAAGAGAGGATTGTTGCTCAAGTTCTTGCTTCTAGGCGGTGGAGCCACTTTAGCTTATTACAGTGCCACTGCCCCAGATGATGTTCTTCCCATCAAGAAAGGACCTCAACTTCCACCAAAGCTTGGGCCTCGTGGCAAGATCTAATTCGCTCTCAAATCCTTTTGCACATCATCTCTGGAGGGCTTGGATTTAGGGCCTTTGTCCGGTCCATTAGCGGGGGGTTTGTCCAACCCGCCGTCGTCTGGGGGAGCTAATCCCTGGTTTCTGTTGGAAAACAAAACATTATTATTATTAAACTCTCCAAGCTCAAGCTTTTGGAGTGTTTTATGTTTGAATTGAAGAATTTGGACCTGTAATTCTCTTCCTCCTGCTCCATTTCTCGTACCATTTTCAGCTTTTTGATGAATCTTTTGATAGAATAAACGAGAGAGAATAAACCAGCAATCTTGACAGACCTGAAAAAACAAAGTTCATAGAAGAAGAAC

Coding sequence (CDS)

ATGGGCGCCGACATGTCGGACAATAATAATCGCCTCGCTTTCACCAAAGACTTCTTCTCCTCTCCGGCTCTTTCTCTTACTCTTGCGGGGATTTTTCGCCGGGGCGAGGCGGGGGAGAAAGGGGATGTTGAGATGGAGGAGGTGGATGACGGGAGAGATGATATGACGACGGCGGTGGAGGTTAGTAGCGAGAATTCGGGACCGGTGAGGTCGAGATCGGATGACGATTACGACGGTGGAGGAGTGCATGAAGAGAATGAAGATGGGTGTCATGGAAAGAGAAGGAAGAAGTATCATCGGCATACCAACGAGCAGATCAGAGAAATGGAGGCGCTGTTTAAAGAGTCGCCACATCCGGATGAGAAGCAAAGGCAGCAGCTTGCTAAGCTACTAGGGCTTTCATCAAGGCAGGTCAAGTTTTGGTTCCAAAATCGTCGAACCCAAATCAAAATTGAGAAACTACGAGCGGCGCTGGGAAAACACCGACATGCAGCGGTGTCTCTGTCATACTCATCGGAGAACGAGCAAGAAACGAACCAAAGCTGCTTAGATTATTACACAGGAATTTTCCGGCTTGAAAAGTCAAGGATCATGGAGAAGGTTCATCAAGCTCTAGAAGAGCTCAAAACAATGGCTGCAGCCGGCGACAGTCTTTGGGTCCTCAGCGTCGAGACCGGAAGAGAGATATTAAACTACGACGAGTACTTAAAAACCTTCCATCTCAACAATGATTCTAACCGTAGTTGGCTCAAAACCCACATCGAGGCCTCTCGTGAGACGGCCCTTGTCTTCATGGAGCCCTCAAGGTTGGTTCAAAGCTTCATGGATGAGAATCAATGGAAGGAGATGTTTCCTTTCATGATCTCGAAGGCAGCTACGGTTGATGTTATTTGTAATGGAGGGATTGCCAATTGGGACGGCGCAGTGCAATTGATGTTTGCAGAGGTACAAATGCTTACACCATTAGTGTCCACAAGAGAGATGTATTTCATCCGGCATTGCAAGCAGCTCGACGCTGAACGATGGGCGATTGTCGATGTTTCAATCGAAAATGTCGAAGATAATAATATCGACGTATCATTGGTGAAATATAGAAAACGTCCTTCTGGCTGCATCATTAAGGATGAATTTAATGGTCATTGTAAGAACAAAGTTCATAACTTGTTCAGAAACTTAATCAACAATGGCGGTGGCTTTGGGGCAAAACGTTGGATGGCAACTCTCCAACTCCAATGCGAACGCCTTGCTTTCTTCATGGCAACCAACATCCCCATGAAAGACTCAACTGGAGTTGCAACACTAGCGGGTAGAAAAAGCACATTAAAGTTGGCACAGAGAATGAGTTCAAGCTTCTCCCAAGCAATAGCAGCTTCAAGTTATCAGACATGGACCAAGGTTGTGGGCAAAACAGGGGAAGACATTAGGGTTTGTTCCAGGAAGAATCCTAGTGACCTTGGTGAACCCATTGGAGCTATTTTGTGTTCAGTTTCTTCTCTATGGCTGCCTGTTTCTCCTCACCTTCTCTTTCATTTCCTGAGAGACCAAGCTTGTCGACATCAGTGGGACGTTATGTTTGGTGGAGATGAAGCTAAGTCGATTGCAAATTTAGCTAAAGGACAAGATCGAGGCAACTCAGTTACCATTCAAACAATTGGATCAAAAGAGAGCAGCAGCAGCATGTGGATCCTCCAAGACAGCTCCACAAACTCGTCGGAATCCATGGTGGTTTACTCCGGAGTAGACGTTACCGGCATGCAGTCGGTGATGACAGGCTGCGATTCCAGCAGCCTCACCATTCTCCCTTCTGGCTTTTCAATTCTCCCCGACGGCGCTGTGTCCAGGCCGCCCCTCCTCATCACCCGACAGAAAGACGACAAGACCGCCGACACCAATGGCGGCGTTCTGCTGACTGCCGCCGTTCAAATCCTCACCGACGCCTCTCCCTCTGCAAAACACACCATGGAATCTGTTGAGATCAAAACCCCATTTGAAGCCTCACCTCCTCCCTCAGCCTGCAACTCCAAGTTTGTTCCTTTTCAATACAAAACGCAGACAACAATGGCTTCCTTAGCAACCTTAGCCGCCGTTCAGCCGGTCACCGTAAAGGGCCTTGGTGGAAGCTCCCTTGCCGGAACTAAGCTCCCTCTCAGGCCCTCTCGCCAGAGCTTCAGACCAAAAAGCTTCAAGGCTGGTGCTGTGGTGGCTAAGTACGGTGACAAAAGTGTTTACTTCGATTTGGAGGATTTGGGCAACACTACTGGACAGTGGGATTTGTATGGATCTGATGCTCCTTCACCATACAATTCTCTTCAGAGCAAATTCTTTGAGACGTTTGCCGCTCCATTCACCAAGAGAGGATTGTTGCTCAAGTTCTTGCTTCTAGGCGGTGGAGCCACTTTAGCTTATTACAGTGCCACTGCCCCAGATGATGTTCTTCCCATCAAGAAAGGACCTCAACTTCCACCAAAGCTTGGGCCTCGTGGCAAGATCTAA

Protein sequence

MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVEVSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIKIEKLRAALGKHRHAAVSLSYSSENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHCKNKVHNLFRNLINNGGGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTAAVQILTDASPSAKHTMESVEIKTPFEASPPPSACNSKFVPFQYKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Homology
BLAST of CmaCh14G013050 vs. ExPASy Swiss-Prot
Match: P46607 (Homeobox-leucine zipper protein GLABRA 2 OS=Arabidopsis thaliana OX=3702 GN=GL2 PE=1 SV=3)

HSP 1 Score: 680.2 bits (1754), Expect = 2.8e-194
Identity = 403/743 (54.24%), Postives = 488/743 (65.68%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGR---DDMTT 60
           M  DMS        TKDFFSSPALSL+LAGIFR   +   G    EE   GR   DD   
Sbjct: 3   MAVDMSSKQP----TKDFFSSPALSLSLAGIFRNASS---GSTNPEEDFLGRRVVDDEDR 62

Query: 61  AVEVSSENSGPVRSRSDDDYDG---GGVHEENEDGCHG------KRRKKYHRHTNEQIRE 120
            VE+SSENSGP RSRS++D +G       EE EDG  G      ++RKKYHRHT +QIR 
Sbjct: 63  TVEMSSENSGPTRSRSEEDLEGEDHDDEEEEEEDGAAGNKGTNKRKRKKYHRHTTDQIRH 122

Query: 121 MEALFKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------ 180
           MEALFKE+PHPDEKQRQQL+K LGL+ RQVKFWFQNRRTQIK                  
Sbjct: 123 MEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIKAIQERHENSLLKAELEKL 182

Query: 181 --------------------------------------IEKLRAALGKHRHAAVSLSYSS 240
                                                 ++KLRAALG+       L  S 
Sbjct: 183 REENKAMRESFSKANSSCPNCGGGPDDLHLENSKLKAELDKLRAALGR---TPYPLQASC 242

Query: 241 ENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNY 300
            ++QE     LD+YTG+F LEKSRI E  ++A  EL+ MA +G+ +W+ SVETGREILNY
Sbjct: 243 SDDQEHRLGSLDFYTGVFALEKSRIAEISNRATLELQKMATSGEPMWLRSVETGREILNY 302

Query: 301 DEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAA 360
           DEYLK F     S+    KT IEASR+  +VFM+  +L QSFMD  QWKE F  +ISKAA
Sbjct: 303 DEYLKEFPQAQASSFPGRKT-IEASRDAGIVFMDAHKLAQSFMDVGQWKETFACLISKAA 362

Query: 361 TVDVICNG-GIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIEN 420
           TVDVI  G G +  DGA+QLMF E+Q+LTP+V TRE+YF+R C+QL  E+WAIVDVS+ +
Sbjct: 363 TVDVIRQGEGPSRIDGAIQLMFGEMQLLTPVVPTREVYFVRSCRQLSPEKWAIVDVSV-S 422

Query: 421 VEDNNI--DVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNLINNGGG 480
           VED+N   + SL+K RK PSGCII+D  NGH K           + V  LFR+L+N G  
Sbjct: 423 VEDSNTEKEASLLKCRKLPSGCIIEDTSNGHSKVTWVEHLDVSASTVQPLFRSLVNTGLA 482

Query: 481 FGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAAS 540
           FGA+ W+ATLQL CERL FFMATN+P KDS GV TLAGRKS LK+AQRM+ SF +AIAAS
Sbjct: 483 FGARHWVATLQLHCERLVFFMATNVPTKDSLGVTTLAGRKSVLKMAQRMTQSFYRAIAAS 542

Query: 541 SYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACR 600
           SY  WTK+  KTG+D+RV SRKN  D GEP G I+C+ SSLWLPVSP LLF F RD+A R
Sbjct: 543 SYHQWTKITTKTGQDMRVSSRKNLHDPGEPTGVIVCASSSLWLPVSPALLFDFFRDEARR 602

Query: 601 HQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYS 660
           H+WD +  G   +SIANL+KGQDRGNSV IQT+ S+E   S+W+LQDSSTNS ES+VVY+
Sbjct: 603 HEWDALSNGAHVQSIANLSKGQDRGNSVAIQTVKSRE--KSIWVLQDSSTNSYESVVVYA 662

Query: 661 GVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTA 662
            VD+   Q V+ G D S++ ILPSGFSI+PDG  SR PL+IT  +DD+  ++ GG LLT 
Sbjct: 663 PVDINTTQLVLAGHDPSNIQILPSGFSIIPDGVESR-PLVITSTQDDR--NSQGGSLLTL 722

BLAST of CmaCh14G013050 vs. ExPASy Swiss-Prot
Match: Q5JMF3 (Homeobox-leucine zipper protein ROC9 OS=Oryza sativa subsp. japonica OX=39947 GN=ROC9 PE=2 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 6.2e-133
Identity = 311/777 (40.03%), Postives = 415/777 (53.41%), Query Frame = 0

Query: 15  TKDFFSSPALSLTLAGIFRR--GEAGEKGDVEMEEVDDGRDDMTTAVEVSSENSGP---- 74
           TKDFF++PALSLTLAG+F R  G A   GD   E  ++ +     AVE+SSEN+GP    
Sbjct: 10  TKDFFAAPALSLTLAGVFGRKNGPAASGGDGVEEGDEEVQAAGEAAVEISSENAGPGCRQ 69

Query: 75  VRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPDEKQRQQLA 134
            +S      DGG   ++ E     +RRK YHRHT EQIR MEALFKESPHPDE+QRQQ++
Sbjct: 70  SQSGGGSGEDGGHDDDDGEGSNKKRRRKNYHRHTAEQIRIMEALFKESPHPDERQRQQVS 129

Query: 135 KLLGLSSRQVKFWFQNRRTQIK-------------------------------------- 194
           K LGLS+RQVKFWFQNRRTQIK                                      
Sbjct: 130 KQLGLSARQVKFWFQNRRTQIKAVQERHENSLLKSELEKLQDEHRAMRELAKKPSRCLNC 189

Query: 195 ------------------------------------------------------------ 254
                                                                       
Sbjct: 190 GVVATSSDAAAAATAADTREQRLRLEKAKLKAEVCMPPPRSRARPFRCATLQDTDSGELA 249

Query: 255 ------IEKLRAALGKHRHAAV-----SLSYSSENEQETNQSCLDYYTGIFRL--EKSRI 314
                 IE+LR   GK     +     S S  +      +    D+  G  R   +K RI
Sbjct: 250 MLNLFQIERLRGTPGKSAADGIASPPCSASAGAMQTNSRSPPLHDHDGGFLRHDDDKPRI 309

Query: 315 MEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKTFHLNN----DSNRSWLKTH 374
           +E   +AL+EL  M ++G+ +WV  VETGR+ILNYDEY++ F  ++    D    W    
Sbjct: 310 LELATRALDELVGMCSSGEPVWVRGVETGRDILNYDEYVRLFRRDHGGSGDQMAGWT--- 369

Query: 375 IEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVICNGGIANWDGAVQLMF 434
           +EASRE  LV+++   LV +FMD ++WK++FP MISKAAT+++I N      DG +QLM+
Sbjct: 370 VEASRECGLVYLDTMHLVHTFMDVDKWKDLFPTMISKAATLEMISNREDDGRDGVLQLMY 429

Query: 435 AEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNIDVSLVKYRKRPSGCII 494
           AE+Q LTP+V TRE+YF R+CK+L AERWAIVDVS +  E      S V+  K PSGC+I
Sbjct: 430 AELQTLTPMVPTRELYFARYCKKLAAERWAIVDVSFDESETGVHASSAVRCWKNPSGCLI 489

Query: 495 KDEFNGHCKN-----------KVHNLFRNLINNGGGFGAKRWMATLQLQCERLAFFMATN 554
           +++ NG CK             V  L+R +  +G  FGA+RW+A LQLQCER+ F +ATN
Sbjct: 490 EEQNNGRCKMTWVEHTRCRRCTVAPLYRAVTASGVAFGARRWVAALQLQCERMVFAVATN 549

Query: 555 IPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVV-----GKTGEDIRVC 614
           +P +DSTGV+TLAGR+S LKLA RM+SS  +    S    W +       G   +DI + 
Sbjct: 550 VPTRDSTGVSTLAGRRSVLKLAHRMTSSLCRTTGGSCDMAWRRAPKGGSGGGGDDDIWLT 609

Query: 615 SRKNP-SDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVMFGGDEAKSIANL 651
           SR+N   D GEP G I C+ +S WLPV+P  L   LRD++ R +WDVM  G   +S  NL
Sbjct: 610 SRENAGDDPGEPQGLIACAAASTWLPVNPTALLDLLRDESRRPEWDVMLPGKSVQSRVNL 669

BLAST of CmaCh14G013050 vs. ExPASy Swiss-Prot
Match: Q0WV12 (Homeobox-leucine zipper protein ANTHOCYANINLESS 2 OS=Arabidopsis thaliana OX=3702 GN=ANL2 PE=2 SV=1)

HSP 1 Score: 402.1 bits (1032), Expect = 1.5e-110
Identity = 267/741 (36.03%), Postives = 393/741 (53.04%), Query Frame = 0

Query: 15  TKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVEVSSENSGPVRSRSD 74
           TK  ++S  LSL L    R    GE        V  G D    +V   S       SRS 
Sbjct: 56  TKSVYASSGLSLALEQPERGTNRGEASMRNNNNVGGGGDTFDGSVNRRSREE-EHESRSG 115

Query: 75  DDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPDEKQRQQLAKLLGLS 134
            D   G   E+ +      R+K+YHRHT +QI+E+E++FKE PHPDEKQR +L+K L L 
Sbjct: 116 SDNVEGISGEDQDAADKPPRKKRYHRHTPQQIQELESMFKECPHPDEKQRLELSKRLCLE 175

Query: 135 SRQVKFWFQNRRTQIK--IEKLRAALGKHRHAAVSLSYSSENEQETNQSCLD-------- 194
           +RQVKFWFQNRRTQ+K  +E+   AL +  +  +     S  E   N  C +        
Sbjct: 176 TRQVKFWFQNRRTQMKTQLERHENALLRQENDKLRAENMSIREAMRNPICTNCGGPAMLG 235

Query: 195 -------------------------------------YYTGIFRL--------------- 254
                                                +Y     L               
Sbjct: 236 DVSLEEHHLRIENARLKDELDRVCNLTGKFLGHHHNHHYNSSLELAVGTNNNGGHFAFPP 295

Query: 255 -----------------------EKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREI 314
                                  +KS ++E    A++EL  +A + + LWV S++  R+ 
Sbjct: 296 DFGGGGGCLPPQQQQSTVINGIDQKSVLLELALTAMDELVKLAQSEEPLWVKSLDGERDE 355

Query: 315 LNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMIS 374
           LN DEY++TF   + +  + L T  EASR + +V +    LV++ MD N+W EMFP  ++
Sbjct: 356 LNQDEYMRTF---SSTKPTGLAT--EASRTSGMVIINSLALVETLMDSNRWTEMFPCNVA 415

Query: 375 KAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSI 434
           +A T DVI  G     +GA+QLM AE+Q+L+PLV  R + F+R CKQ     WA+VDVSI
Sbjct: 416 RATTTDVISGGMAGTINGALQLMNAELQVLSPLVPVRNVNFLRFCKQHAEGVWAVVDVSI 475

Query: 435 ENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNLINNGGG 494
           + V +N+    ++  R+ PSGC+++D  NG+ K           N++H L+R L+ +G G
Sbjct: 476 DPVRENSGGAPVI--RRLPSGCVVQDVSNGYSKVTWVEHAEYDENQIHQLYRPLLRSGLG 535

Query: 495 FGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAAS 554
           FG++RW+ATLQ QCE LA  +++++   D+T + T  GRKS LKLAQRM+ +F   I+A 
Sbjct: 536 FGSQRWLATLQRQCECLAILISSSVTSHDNTSI-TPGGRKSMLKLAQRMTFNFCSGISAP 595

Query: 555 SYQTWTKV-VGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQAC 614
           S   W+K+ VG    D+RV +RK+  D GEP G +L + +S+WLP +P  L+ FLR++  
Sbjct: 596 SVHNWSKLTVGNVDPDVRVMTRKSVDDPGEPPGIVLSAATSVWLPAAPQRLYDFLRNERM 655

Query: 615 RHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVY 659
           R +WD++  G   + +A++ KGQD+G S+ +++     + SSM ILQ++  ++S ++VVY
Sbjct: 656 RCEWDILSNGGPMQEMAHITKGQDQGVSL-LRSNAMNANQSSMLILQETCIDASGALVVY 715

BLAST of CmaCh14G013050 vs. ExPASy Swiss-Prot
Match: Q6ZAR0 (Homeobox-leucine zipper protein ROC1 OS=Oryza sativa subsp. japonica OX=39947 GN=ROC1 PE=2 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 9.1e-108
Identity = 250/682 (36.66%), Postives = 367/682 (53.81%), Query Frame = 0

Query: 76  DYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPDEKQRQQLAKLLGLSS 135
           D  G G+  +++D     R+K+YHRHT  QI+EMEA FKE PHPD+KQR++L++ LGL  
Sbjct: 88  DGAGDGLSGDDQDPNQRPRKKRYHRHTQHQIQEMEAFFKECPHPDDKQRKELSRELGLEP 147

Query: 136 RQVKFWFQNRRTQIK--------------IEKLRAALGKHRHAAVS-------------- 195
            QVKFWFQN+RTQ+K               +KLRA   +++ A  S              
Sbjct: 148 LQVKFWFQNKRTQMKNQHERHENAQLRAENDKLRAENMRYKEALSSASCPNCGGPAALGE 207

Query: 196 LSYSSENEQETNQSC--------------------------------------------- 255
           +S+   + +  N                                                
Sbjct: 208 MSFDEHHLRVENARLRDEIDRISGIAAKHVGKPPIVSFPVLSSPLAVAAARSPLDLAGAY 267

Query: 256 ------LDYYTGIFRL---------EKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGR 315
                 LD + G   L         +K  I+E    A++EL  MA   + LW  S E   
Sbjct: 268 GVVTPGLDMFGGAGDLLRGVHPLDADKPMIVELAVAAMDELVQMAQLDEPLWSSSSEPAA 327

Query: 316 EILNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFM 375
            +L+ +EY + F       +  LK+  EASR  A+V M  S LV+  MD NQ+  +F  +
Sbjct: 328 ALLDEEEYARMFPRGLGPKQYGLKS--EASRHGAVVIMTHSNLVEILMDVNQFATVFSSI 387

Query: 376 ISKAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDV 435
           +S+A+T +V+  G   N++GA+Q+M  E Q+ +PLV TRE YF+R+CK      WA+VDV
Sbjct: 388 VSRASTHEVLSTGVAGNYNGALQVMSMEFQVPSPLVPTRESYFVRYCKNNSDGTWAVVDV 447

Query: 436 SIENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNLINNG 495
           S++++  + +     K R+RPSGC+I++  NG+ K           + VHN+++ L+N+G
Sbjct: 448 SLDSLRPSPVQ----KCRRRPSGCLIQEMPNGYSKVTWVEHVEVDDSSVHNIYKPLVNSG 507

Query: 496 GGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIA 555
             FGAKRW+ TL  QCERLA  MA+NIP  D   + ++ GRKS LKLA+RM +SF   + 
Sbjct: 508 LAFGAKRWVGTLDRQCERLASAMASNIPNGDLGVITSVEGRKSMLKLAERMVASFCGGVT 567

Query: 556 ASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQA 615
           AS    WT + G   ED+RV +RK+  D G P G +L + +S WLPV P  +F FLRD+ 
Sbjct: 568 ASVAHQWTTLSGSGAEDVRVMTRKSVDDPGRPPGIVLNAATSFWLPVPPAAVFDFLRDET 627

Query: 616 CRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESS-SSMWILQDSSTNSSESMV 658
            R +WD++  G   + +A++A G+D GNSV++  + S  S+ S+M ILQ+S T++S S V
Sbjct: 628 SRSEWDILSNGGAVQEMAHIANGRDHGNSVSLLRVNSANSNQSNMLILQESCTDASGSYV 687

BLAST of CmaCh14G013050 vs. ExPASy Swiss-Prot
Match: A2YR02 (Homeobox-leucine zipper protein ROC7 OS=Oryza sativa subsp. indica OX=39946 GN=ROC7 PE=3 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 1.5e-107
Identity = 255/687 (37.12%), Postives = 371/687 (54.00%), Query Frame = 0

Query: 60  EVSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHP 119
           E+    SG   +       GGG   +++D     R+K+YHRHT  QI+E+EA FKE PHP
Sbjct: 54  ELEMSKSGGSDNLESGGGGGGGGSGDDQDPNQRPRKKRYHRHTQHQIQELEAFFKECPHP 113

Query: 120 DEKQRQQLAKLLGLSSRQVKFWFQNRRTQIKI--------------EKLRAALGKHRHAA 179
           D+KQR++L++ LGL   QVKFWFQN+RTQ+K               EKLRA   +++ A 
Sbjct: 114 DDKQRKELSRELGLEPLQVKFWFQNKRTQMKTQHERHENNALRAENEKLRAENMRYKEAL 173

Query: 180 VSLSYSS-------------------ENEQ------------------------------ 239
            + S  +                   EN +                              
Sbjct: 174 ANASCPNCGGPAAIGEMSFDEHHLRLENARLRDEIDRISAIAAKYVGKPAAAVSAAYPPL 233

Query: 240 -ETNQSCLDYY------TGIF--RLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGR 299
             +N+S LD+         +F    +K  ++E    A+EEL  MA  G+ LW  ++  G 
Sbjct: 234 PPSNRSPLDHMGIPGAGADVFGADFDKPLVIELAVAAMEELVRMAQLGEPLWAPAL--GG 293

Query: 300 EILNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFM 359
           E L  +EY +TF          L++  EASRETA+V M    LV+  MD  QW  +F  +
Sbjct: 294 EALGEEEYARTFPRGLGPKSPELRS--EASRETAVVIMNHVSLVEMLMDVGQWTALFSSI 353

Query: 360 ISKAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDV 419
           +S+AAT++V+  G   N +GA+QLM AE QM +PLV TRE  F+R+CKQ     WA+VDV
Sbjct: 354 VSRAATLEVLSTGVAGNHNGALQLMSAEFQMPSPLVPTRETQFLRYCKQHPDGTWAVVDV 413

Query: 420 SIENVE----DNNIDVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNL 479
           S++ +           +   +R+RPSGC+I++  NG+ K             VHNL++ +
Sbjct: 414 SLDGLRAGAGGGCQPAAARGHRRRPSGCLIQEMPNGYSKVTWVEHVEADDQMVHNLYKPV 473

Query: 480 INNGGGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLA-GRKSTLKLAQRMSSSF 539
           +N+G  FGA+RW+ATL+ QCERLA  MA+N+      GV T + GR+S LKLA+RM +SF
Sbjct: 474 VNSGMAFGARRWVATLERQCERLASAMASNVASSGDAGVITTSEGRRSMLKLAERMVASF 533

Query: 540 SQAIAASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHF 599
              + AS+   WT + G   ED+RV +RK+  D G P G +L + +S WLPV P  +F F
Sbjct: 534 CGGVTASTTHQWTTLSGSGAEDVRVMTRKSVDDPGRPPGIVLNAATSFWLPVPPSRVFDF 593

Query: 600 LRDQACRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESS-SSMWILQDSSTNS 658
           LRD + R +WD++  G   + +A++A G+D GN+V++  + +  S+ S+M ILQ+  T++
Sbjct: 594 LRDDSTRSEWDILSNGGVVQEMAHIANGRDHGNAVSLLRVNNANSNQSNMLILQECCTDA 653

BLAST of CmaCh14G013050 vs. ExPASy TrEMBL
Match: A0A6J1IST9 (LOW QUALITY PROTEIN: homeobox-leucine zipper protein GLABRA 2-like OS=Cucurbita maxima OX=3661 GN=LOC111479058 PE=3 SV=1)

HSP 1 Score: 1261.9 bits (3264), Expect = 0.0e+00
Identity = 658/734 (89.65%), Postives = 658/734 (89.65%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60
           MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE
Sbjct: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60

Query: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120
           VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD
Sbjct: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120

Query: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------------------ 180
           EKQRQQLAKLLGLSSRQVKFWFQNRRTQIK                              
Sbjct: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIKAIQERHENTLFXEMEKLRDENKGMREIFKR 180

Query: 181 --------------------------------IEKLRAALGKHRHAAVSLSYSSENEQET 240
                                           IEKLRAALGKHRHAAVSLSYSSENEQET
Sbjct: 181 KLGCPNCGTTDVAATATTTEQLRIKNAKLKVEIEKLRAALGKHRHAAVSLSYSSENEQET 240

Query: 241 NQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKT 300
           NQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKT
Sbjct: 241 NQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKT 300

Query: 301 FHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVIC 360
           FHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVIC
Sbjct: 301 FHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVIC 360

Query: 361 NGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNID 420
           NGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNID
Sbjct: 361 NGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNID 420

Query: 421 VSLVKYRKRPSGCIIKDEFNGHC--------------KNKVHNLFRNLINNGGGFGAKRW 480
           VSLVKYRKRPSGCIIKDEFNGHC              KNKVHNLFRNLINNGGGFGAKRW
Sbjct: 421 VSLVKYRKRPSGCIIKDEFNGHCKVFLVTMVEHLECQKNKVHNLFRNLINNGGGFGAKRW 480

Query: 481 MATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWT 540
           MATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWT
Sbjct: 481 MATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWT 540

Query: 541 KVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVM 600
           KVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVM
Sbjct: 541 KVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVM 600

Query: 601 FGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTG 659
           FGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTG
Sbjct: 601 FGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTG 660

BLAST of CmaCh14G013050 vs. ExPASy TrEMBL
Match: A0A6J1GVV3 (homeobox-leucine zipper protein GLABRA 2-like OS=Cucurbita moschata OX=3662 GN=LOC111457637 PE=3 SV=1)

HSP 1 Score: 1184.9 bits (3064), Expect = 0.0e+00
Identity = 625/737 (84.80%), Postives = 637/737 (86.43%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60
           M ADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGE GEKGDVEMEEVDDGRDDMTTAVE
Sbjct: 1   MRADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGETGEKGDVEMEEVDDGRDDMTTAVE 60

Query: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120
           VSSENSGPVRSRSD+DYDGGGVHEEN+DGCHGKRRKKYHRHTNEQIREMEALFKESPH D
Sbjct: 61  VSSENSGPVRSRSDEDYDGGGVHEENDDGCHGKRRKKYHRHTNEQIREMEALFKESPHLD 120

Query: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------------------ 180
           EKQRQQL+K LGLSSRQVKFWFQNRRTQIK                              
Sbjct: 121 EKQRQQLSKRLGLSSRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLRDENKGMREISK 180

Query: 181 ------------------------------------IEKLRAALGKHRHAAVSLSYSSEN 240
                                               IEKLRAALGKHRHAA S SYSSEN
Sbjct: 181 RKLGCPNCGTTDAGEDDVASTTTEQLRIKNAKLKVEIEKLRAALGKHRHAAASPSYSSEN 240

Query: 241 EQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDE 300
           EQETNQSCLDYYTGIF LEKSRIMEKVHQALEELKTMAAAGD LWV SVETGREILNYDE
Sbjct: 241 EQETNQSCLDYYTGIFGLEKSRIMEKVHQALEELKTMAAAGDPLWVRSVETGREILNYDE 300

Query: 301 YLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATV 360
           YLKTFHLNN+SN  WLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATV
Sbjct: 301 YLKTFHLNNNSNHRWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATV 360

Query: 361 DVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVED 420
           DVICNGG+ANWDGAVQLMFAEVQMLTPLVSTREMYFIR+CKQLDAERWAI+DVSIENVED
Sbjct: 361 DVICNGGVANWDGAVQLMFAEVQMLTPLVSTREMYFIRYCKQLDAERWAILDVSIENVED 420

Query: 421 NNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNLINNGGGFGAKR 480
           NNIDVSLVKYRKRPSGCIIKDEFNGHC           KNKVHNLFR+L+NNGGGFGAKR
Sbjct: 421 NNIDVSLVKYRKRPSGCIIKDEFNGHCKVTMVEHLECQKNKVHNLFRSLVNNGGGFGAKR 480

Query: 481 WMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTW 540
           WMATLQLQCERLAFFMATNIPMKDS GVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTW
Sbjct: 481 WMATLQLQCERLAFFMATNIPMKDSNGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTW 540

Query: 541 TKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDV 600
           TKVVGKTGEDIRVCSRKN SDLGEPIGAILC+VSSLWLPVSP LLFHFL DQA   +WD 
Sbjct: 541 TKVVGKTGEDIRVCSRKNLSDLGEPIGAILCAVSSLWLPVSPDLLFHFLLDQARAIRWDA 600

Query: 601 MFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSS--MWILQDSSTNSSESMVVYSGVD 659
           MFGGDEAKSIANLAKGQDRGNSVTIQTIGSKES+SS  MWILQDSSTNSSESMVVYSGVD
Sbjct: 601 MFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESNSSSGMWILQDSSTNSSESMVVYSGVD 660

BLAST of CmaCh14G013050 vs. ExPASy TrEMBL
Match: A0A6J1KCZ9 (homeobox-leucine zipper protein GLABRA 2 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493199 PE=3 SV=1)

HSP 1 Score: 1027.7 bits (2656), Expect = 2.7e-296
Identity = 549/744 (73.79%), Postives = 598/744 (80.38%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDG-----RDDM 60
           M ADMS+NNNRLAFTKDFFSSPALSLTLAGIFRRGE  EKGDVEMEEVDDG     RDD 
Sbjct: 1   MLADMSNNNNRLAFTKDFFSSPALSLTLAGIFRRGEMAEKGDVEMEEVDDGSGEVRRDD- 60

Query: 61  TTAVEVSSENSGPVRSRSDDDYDGGGVHEENED---GCHGKRRKKYHRHTNEQIREMEAL 120
             A EVSSEN GPVRSRSDDD++GGGVHEENE+   GC  KRRKKYHRHT EQIREMEAL
Sbjct: 61  --AAEVSSENLGPVRSRSDDDFEGGGVHEENEEGDGGCLVKRRKKYHRHTTEQIREMEAL 120

Query: 121 FKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK---------------------- 180
           FKESPHPDEKQRQQL+K LGLS RQVKFWFQNRRTQIK                      
Sbjct: 121 FKESPHPDEKQRQQLSKRLGLSPRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLREEN 180

Query: 181 --------------------------------------------IEKLRAALGKHRHAAV 240
                                                       +EKLRAALGKH  +++
Sbjct: 181 KAMRELTKKKVGCPNCGTANAAEDHTAFTTTEQLRIKNAKLKAEVEKLRAALGKHSQSSI 240

Query: 241 SLSYSSENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETG 300
           S SYSS N+QETN++CLD+YTGIF LEKSRIME+VHQALEELKTMAA  + LW+ S+ETG
Sbjct: 241 SPSYSSGNDQETNRNCLDFYTGIFGLEKSRIMERVHQALEELKTMAATNEPLWIRSIETG 300

Query: 301 REILNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPF 360
           REILNYDEYLKTFHL N SN  WLK HIEASR+T +VFMEPSRL+QSFMDENQWKEMFP 
Sbjct: 301 REILNYDEYLKTFHLKN-SNTCWLKRHIEASRDTTVVFMEPSRLIQSFMDENQWKEMFPS 360

Query: 361 MISKAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVD 420
           MISKAAT+DVICNG +ANW+GAVQLMF EVQ+LTPLV TRE+YFIRHCKQLD E+WAIVD
Sbjct: 361 MISKAATIDVICNGEVANWNGAVQLMFVEVQLLTPLVPTREIYFIRHCKQLDTEQWAIVD 420

Query: 421 VSIENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNLINN 480
           VSI+NV+D NID SL+KYRKRPSGCIIKDE NGHC           K +VHNL+R ++NN
Sbjct: 421 VSIDNVDDINIDASLMKYRKRPSGCIIKDESNGHCKVTMVEHLECEKTQVHNLYRTIVNN 480

Query: 481 GGGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAI 540
           G  FGA+ WMATLQLQCERLAFFMATNIP+KDSTGVATLAGRKSTLKLAQRMSSSFSQA+
Sbjct: 481 GTAFGARHWMATLQLQCERLAFFMATNIPIKDSTGVATLAGRKSTLKLAQRMSSSFSQAV 540

Query: 541 AASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQ 600
           AASSYQTWTKVVGKTGEDIRVCSRKN SD GEPIG ILC+V SLWLPV PH LFHFLRD 
Sbjct: 541 AASSYQTWTKVVGKTGEDIRVCSRKNLSDPGEPIGVILCAVFSLWLPVPPHTLFHFLRDD 600

Query: 601 ACRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKE-SSSSMWILQDSSTNSSESM 659
           A R++WD MFGGD+  SIANLAKGQDRGNSV+IQT GSKE SSSSMWILQD+STNSSESM
Sbjct: 601 ARRNEWDAMFGGDKVDSIANLAKGQDRGNSVSIQTRGSKESSSSSMWILQDTSTNSSESM 660

BLAST of CmaCh14G013050 vs. ExPASy TrEMBL
Match: A0A6J1G8W0 (homeobox-leucine zipper protein GLABRA 2 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111452011 PE=3 SV=1)

HSP 1 Score: 1018.8 bits (2633), Expect = 1.2e-293
Identity = 542/739 (73.34%), Postives = 593/739 (80.24%), Query Frame = 0

Query: 5   MSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDG-----RDDMTTAV 64
           MS+NNNRLAFTKDFFSSPALSLTLAGIFRRGE  EKGDVEMEEVDDG     RDD   A 
Sbjct: 1   MSNNNNRLAFTKDFFSSPALSLTLAGIFRRGEMAEKGDVEMEEVDDGSGEVRRDD---AA 60

Query: 65  EVSSENSGPVRSRSDDDYDGGGVHEENED---GCHGKRRKKYHRHTNEQIREMEALFKES 124
           EVSSEN GPVRSRSDDD++GGGVHEENE+   GC  KRRKKYHRHT EQIREMEALFKES
Sbjct: 61  EVSSENLGPVRSRSDDDFEGGGVHEENEEGDGGCLVKRRKKYHRHTTEQIREMEALFKES 120

Query: 125 PHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK-------------------------- 184
           PHPDEKQRQQL+K LGLS RQVKFWFQNRRTQIK                          
Sbjct: 121 PHPDEKQRQQLSKRLGLSPRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLREENKAMR 180

Query: 185 ----------------------------------------IEKLRAALGKHRHAAVSLSY 244
                                                   +EKLRAALGKH  +++S SY
Sbjct: 181 ELTKKKVGCPNCGTANAAEDHTAFTTTEQLRIKNAKLKAEVEKLRAALGKHSQSSISPSY 240

Query: 245 SSENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREIL 304
           SS N+QE N++CLD+YTGIF LEKSRIME+VHQALEELKTMA   + LW+ S+ETGREIL
Sbjct: 241 SSGNDQEANRNCLDFYTGIFGLEKSRIMERVHQALEELKTMAVTNEPLWIRSIETGREIL 300

Query: 305 NYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISK 364
           NYDEYLKTFH+ N SN  WLK HIEASR+T +VFMEPSRL+QSFMDENQWKEMFP MISK
Sbjct: 301 NYDEYLKTFHVKN-SNACWLKRHIEASRDTTVVFMEPSRLIQSFMDENQWKEMFPSMISK 360

Query: 365 AATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIE 424
           AAT+DVICNG +ANW+GAVQLMF EVQ+LTPLV TRE+YFIRHCKQLDAE WAIVDVSI+
Sbjct: 361 AATIDVICNGEVANWNGAVQLMFVEVQLLTPLVPTREIYFIRHCKQLDAELWAIVDVSID 420

Query: 425 NVEDNNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNLINNGGGF 484
           NV+D NID SL+KYRKRPSGCIIKDE NGHC           K KVHNL+R ++NNG  F
Sbjct: 421 NVDDTNIDASLMKYRKRPSGCIIKDESNGHCKVTMVEHLECEKTKVHNLYRTIVNNGTAF 480

Query: 485 GAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASS 544
           GA+ WMATLQLQCERLAFFMATNIP+KDSTGVATLAGRKS LKLA+RMSSSFSQA+AASS
Sbjct: 481 GARHWMATLQLQCERLAFFMATNIPIKDSTGVATLAGRKSILKLAERMSSSFSQAVAASS 540

Query: 545 YQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRH 604
           YQTWTKVVGKTGEDIRVCSRKN SD GEPIG ILC+V SLWLPV PH LFHFLRD+A R+
Sbjct: 541 YQTWTKVVGKTGEDIRVCSRKNLSDPGEPIGVILCAVFSLWLPVPPHTLFHFLRDEARRN 600

Query: 605 QWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSG 659
           +WD MFGGD+ +SIANLAKGQDRGNSV+IQT GSKE SSSMWILQD+STNSSESMVVYSG
Sbjct: 601 EWDAMFGGDKVESIANLAKGQDRGNSVSIQTRGSKE-SSSMWILQDASTNSSESMVVYSG 660

BLAST of CmaCh14G013050 vs. ExPASy TrEMBL
Match: A0A6J1KAL9 (homeobox-leucine zipper protein GLABRA 2 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111493199 PE=3 SV=1)

HSP 1 Score: 1016.5 bits (2627), Expect = 6.2e-293
Identity = 546/744 (73.39%), Postives = 594/744 (79.84%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDG-----RDDM 60
           M ADMS+NNNRLAFTKDFFSSPALSLTLAGIFRRGE  EKGDVEMEEVDDG     RDD 
Sbjct: 1   MLADMSNNNNRLAFTKDFFSSPALSLTLAGIFRRGEMAEKGDVEMEEVDDGSGEVRRDD- 60

Query: 61  TTAVEVSSENSGPVRSRSDDDYDGGGVHEENED---GCHGKRRKKYHRHTNEQIREMEAL 120
             A EVSSEN GPVRSRSDDD++GGGVHEENE+   GC  KRRKKYHRHT EQIREMEAL
Sbjct: 61  --AAEVSSENLGPVRSRSDDDFEGGGVHEENEEGDGGCLVKRRKKYHRHTTEQIREMEAL 120

Query: 121 FKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK---------------------- 180
           FKESPHPDEKQRQQL+K LGLS RQVKFWFQNRRTQIK                      
Sbjct: 121 FKESPHPDEKQRQQLSKRLGLSPRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLREEN 180

Query: 181 --------------------------------------------IEKLRAALGKHRHAAV 240
                                                       +EKLRAALGKH  +++
Sbjct: 181 KAMRELTKKKVGCPNCGTANAAEDHTAFTTTEQLRIKNAKLKAEVEKLRAALGKHSQSSI 240

Query: 241 SLSYSSENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETG 300
           S SYSS N+QETN++CLD+YTGIF LEKSRIME+VHQALEELKTMAA  + LW+ S+ETG
Sbjct: 241 SPSYSSGNDQETNRNCLDFYTGIFGLEKSRIMERVHQALEELKTMAATNEPLWIRSIETG 300

Query: 301 REILNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPF 360
           REILNYDEYLKTFHL N SN  WLK HIEASR+T +VFMEPSRL+QSFMDENQWKEMFP 
Sbjct: 301 REILNYDEYLKTFHLKN-SNTCWLKRHIEASRDTTVVFMEPSRLIQSFMDENQWKEMFPS 360

Query: 361 MISKAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVD 420
           MISKAAT+DVICNG +ANW+GAVQLMF EVQ+LTPLV TRE+YFIRHCKQLD E+WAIVD
Sbjct: 361 MISKAATIDVICNGEVANWNGAVQLMFVEVQLLTPLVPTREIYFIRHCKQLDTEQWAIVD 420

Query: 421 VSIENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNLINN 480
           VSI+NV+D NID SL+KYRKRPSGCIIKDE NGHC           K +VHNL+R ++NN
Sbjct: 421 VSIDNVDDINIDASLMKYRKRPSGCIIKDESNGHCKVTMVEHLECEKTQVHNLYRTIVNN 480

Query: 481 GGGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAI 540
           G  FGA+ WMATLQLQCERLAFFMATNIP+KDSTGVATLAGRKSTLKLAQRMSSSFSQA+
Sbjct: 481 GTAFGARHWMATLQLQCERLAFFMATNIPIKDSTGVATLAGRKSTLKLAQRMSSSFSQAV 540

Query: 541 AASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQ 600
           AASSYQTWTKVVGKTGEDIRVCSRKN SD GEPIG ILC+V SLWLPV PH LFHFLRD 
Sbjct: 541 AASSYQTWTKVVGKTGEDIRVCSRKNLSDPGEPIGVILCAVFSLWLPVPPHTLFHFLRDD 600

Query: 601 ACRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKE-SSSSMWILQDSSTNSSESM 659
           A R++WD MFGGD+  SIANLAKGQDRGNS    T GSKE SSSSMWILQD+STNSSESM
Sbjct: 601 ARRNEWDAMFGGDKVDSIANLAKGQDRGNS----TRGSKESSSSSMWILQDTSTNSSESM 660

BLAST of CmaCh14G013050 vs. NCBI nr
Match: XP_022979290.1 (LOW QUALITY PROTEIN: homeobox-leucine zipper protein GLABRA 2-like [Cucurbita maxima])

HSP 1 Score: 1261.9 bits (3264), Expect = 0.0e+00
Identity = 658/734 (89.65%), Postives = 658/734 (89.65%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60
           MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE
Sbjct: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60

Query: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120
           VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD
Sbjct: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120

Query: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------------------ 180
           EKQRQQLAKLLGLSSRQVKFWFQNRRTQIK                              
Sbjct: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIKAIQERHENTLFXEMEKLRDENKGMREIFKR 180

Query: 181 --------------------------------IEKLRAALGKHRHAAVSLSYSSENEQET 240
                                           IEKLRAALGKHRHAAVSLSYSSENEQET
Sbjct: 181 KLGCPNCGTTDVAATATTTEQLRIKNAKLKVEIEKLRAALGKHRHAAVSLSYSSENEQET 240

Query: 241 NQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKT 300
           NQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKT
Sbjct: 241 NQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKT 300

Query: 301 FHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVIC 360
           FHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVIC
Sbjct: 301 FHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVIC 360

Query: 361 NGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNID 420
           NGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNID
Sbjct: 361 NGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNID 420

Query: 421 VSLVKYRKRPSGCIIKDEFNGHC--------------KNKVHNLFRNLINNGGGFGAKRW 480
           VSLVKYRKRPSGCIIKDEFNGHC              KNKVHNLFRNLINNGGGFGAKRW
Sbjct: 421 VSLVKYRKRPSGCIIKDEFNGHCKVFLVTMVEHLECQKNKVHNLFRNLINNGGGFGAKRW 480

Query: 481 MATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWT 540
           MATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWT
Sbjct: 481 MATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWT 540

Query: 541 KVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVM 600
           KVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVM
Sbjct: 541 KVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVM 600

Query: 601 FGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTG 659
           FGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTG
Sbjct: 601 FGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTG 660

BLAST of CmaCh14G013050 vs. NCBI nr
Match: XP_022955725.1 (homeobox-leucine zipper protein GLABRA 2-like [Cucurbita moschata])

HSP 1 Score: 1184.9 bits (3064), Expect = 0.0e+00
Identity = 625/737 (84.80%), Postives = 637/737 (86.43%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60
           M ADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGE GEKGDVEMEEVDDGRDDMTTAVE
Sbjct: 1   MRADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGETGEKGDVEMEEVDDGRDDMTTAVE 60

Query: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120
           VSSENSGPVRSRSD+DYDGGGVHEEN+DGCHGKRRKKYHRHTNEQIREMEALFKESPH D
Sbjct: 61  VSSENSGPVRSRSDEDYDGGGVHEENDDGCHGKRRKKYHRHTNEQIREMEALFKESPHLD 120

Query: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------------------ 180
           EKQRQQL+K LGLSSRQVKFWFQNRRTQIK                              
Sbjct: 121 EKQRQQLSKRLGLSSRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLRDENKGMREISK 180

Query: 181 ------------------------------------IEKLRAALGKHRHAAVSLSYSSEN 240
                                               IEKLRAALGKHRHAA S SYSSEN
Sbjct: 181 RKLGCPNCGTTDAGEDDVASTTTEQLRIKNAKLKVEIEKLRAALGKHRHAAASPSYSSEN 240

Query: 241 EQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDE 300
           EQETNQSCLDYYTGIF LEKSRIMEKVHQALEELKTMAAAGD LWV SVETGREILNYDE
Sbjct: 241 EQETNQSCLDYYTGIFGLEKSRIMEKVHQALEELKTMAAAGDPLWVRSVETGREILNYDE 300

Query: 301 YLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATV 360
           YLKTFHLNN+SN  WLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATV
Sbjct: 301 YLKTFHLNNNSNHRWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATV 360

Query: 361 DVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVED 420
           DVICNGG+ANWDGAVQLMFAEVQMLTPLVSTREMYFIR+CKQLDAERWAI+DVSIENVED
Sbjct: 361 DVICNGGVANWDGAVQLMFAEVQMLTPLVSTREMYFIRYCKQLDAERWAILDVSIENVED 420

Query: 421 NNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNLINNGGGFGAKR 480
           NNIDVSLVKYRKRPSGCIIKDEFNGHC           KNKVHNLFR+L+NNGGGFGAKR
Sbjct: 421 NNIDVSLVKYRKRPSGCIIKDEFNGHCKVTMVEHLECQKNKVHNLFRSLVNNGGGFGAKR 480

Query: 481 WMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTW 540
           WMATLQLQCERLAFFMATNIPMKDS GVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTW
Sbjct: 481 WMATLQLQCERLAFFMATNIPMKDSNGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTW 540

Query: 541 TKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDV 600
           TKVVGKTGEDIRVCSRKN SDLGEPIGAILC+VSSLWLPVSP LLFHFL DQA   +WD 
Sbjct: 541 TKVVGKTGEDIRVCSRKNLSDLGEPIGAILCAVSSLWLPVSPDLLFHFLLDQARAIRWDA 600

Query: 601 MFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSS--MWILQDSSTNSSESMVVYSGVD 659
           MFGGDEAKSIANLAKGQDRGNSVTIQTIGSKES+SS  MWILQDSSTNSSESMVVYSGVD
Sbjct: 601 MFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESNSSSGMWILQDSSTNSSESMVVYSGVD 660

BLAST of CmaCh14G013050 vs. NCBI nr
Match: XP_038904725.1 (homeobox-leucine zipper protein GLABRA 2-like [Benincasa hispida])

HSP 1 Score: 1037.7 bits (2682), Expect = 5.3e-299
Identity = 563/746 (75.47%), Postives = 602/746 (80.70%), Query Frame = 0

Query: 1   MGADMSDNN---NRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTT 60
           MGADMS+NN   NRLAFTKDFFSSPALSLTLAGIFRRGE  EKGDVEMEEVDDG     T
Sbjct: 1   MGADMSNNNNNTNRLAFTKDFFSSPALSLTLAGIFRRGEVTEKGDVEMEEVDDGARRDDT 60

Query: 61  AVEVSSENSGPVRSRSDDD-YDGGGVHEENED----GCHGKRRKKYHRHTNEQIREMEAL 120
             E+SSENSGP+RSRSDD+  DGGG   ENED    GCH KRRKKYHRHT EQIREMEAL
Sbjct: 61  TAELSSENSGPMRSRSDDEGLDGGG--GENEDGVDHGCHVKRRKKYHRHTTEQIREMEAL 120

Query: 121 FKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK---------------------- 180
           FKESPHPDEKQRQQL+K LGLS RQVKFWFQNRRTQIK                      
Sbjct: 121 FKESPHPDEKQRQQLSKRLGLSPRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLREEN 180

Query: 181 --------------------------------------------IEKLRAALGKHRHAAV 240
                                                       +EKLRAALGK+  AA 
Sbjct: 181 KAMREISKKKVGCANCGISDAGEDHVALTTTEQLRIKNAKLKAEVEKLRAALGKYPQAAA 240

Query: 241 SLSYSSENEQE-TNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVET 300
           S SYSS NE E TN+SCLD+YTGIF LEKSRIMEKVHQALEELKTMAAAGD LWV SVET
Sbjct: 241 SPSYSSGNEPETTNRSCLDFYTGIFGLEKSRIMEKVHQALEELKTMAAAGDPLWVRSVET 300

Query: 301 GREILNYDEYLKTF-HLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMF 360
           GR+ILNYDEYLKTF H NN+SN  WLKTHIEASRETALVFMEPSRLVQSFMDEN+WKEMF
Sbjct: 301 GRQILNYDEYLKTFHHNNNNSNTRWLKTHIEASRETALVFMEPSRLVQSFMDENKWKEMF 360

Query: 361 PFMISKAATVDVICNGGIANW-DGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWA 420
           PFMISKAATVDVICNG  ANW +GAVQLMFAEVQMLTPLV TREMYFIRHCKQLD E+WA
Sbjct: 361 PFMISKAATVDVICNGEAANWNNGAVQLMFAEVQMLTPLVPTREMYFIRHCKQLDIEQWA 420

Query: 421 IVDVSIENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNL 480
           IVDVSIEN+EDNNID SLVKY+KRPSGCIIKDE NGHC           K+KVHNL+R++
Sbjct: 421 IVDVSIENIEDNNIDASLVKYKKRPSGCIIKDESNGHCKVTMVEHLECEKSKVHNLYRSI 480

Query: 481 INNGGGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFS 540
           +NNG  FGA+ WMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMS SFS
Sbjct: 481 VNNGTAFGARHWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSCSFS 540

Query: 541 QAIAASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFL 600
           Q +AASSYQTWTKVVGKTGEDIRVCSRKN SD GEPIG ILC+VSSLWLP+SPHLLF F 
Sbjct: 541 QVVAASSYQTWTKVVGKTGEDIRVCSRKNVSDPGEPIGVILCAVSSLWLPLSPHLLFDFF 600

Query: 601 RDQACRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSE 659
           RD++ R+QWD MFGGD+AKSIANLAKGQDRGNSVTIQTIGSKE S++MWILQDSSTN SE
Sbjct: 601 RDESRRNQWDAMFGGDKAKSIANLAKGQDRGNSVTIQTIGSKE-SNNMWILQDSSTNLSE 660

BLAST of CmaCh14G013050 vs. NCBI nr
Match: KAG6581890.1 (Photosystem I reaction center subunit VI, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1029.6 bits (2661), Expect = 1.5e-296
Identity = 584/831 (70.28%), Postives = 592/831 (71.24%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVE 60
           M ADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGE GEKGDVEMEEVDDGRDDMTTAVE
Sbjct: 1   MRADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGETGEKGDVEMEEVDDGRDDMTTAVE 60

Query: 61  VSSENSGPVRSRSDDDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPD 120
           VSSEN GP                                                    
Sbjct: 61  VSSENLGP---------------------------------------------------- 120

Query: 121 EKQRQQLAKLLGLSSRQVKFWFQNRRTQIKIEKLRAALGKHRHAAVSLSYSSENEQETNQ 180
                                         IEKLRAALGKHRHAA S SYSSENEQETNQ
Sbjct: 121 ------------------------------IEKLRAALGKHRHAAASPSYSSENEQETNQ 180

Query: 181 SCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKTFH 240
           SCLDYYTGIF LEKSRIMEKVHQALEELKTMAAA D LW                     
Sbjct: 181 SCLDYYTGIFGLEKSRIMEKVHQALEELKTMAAASDPLW--------------------- 240

Query: 241 LNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVICNG 300
                                                NQWKEMFPFMISKAATVDVICNG
Sbjct: 241 -------------------------------------NQWKEMFPFMISKAATVDVICNG 300

Query: 301 GIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIV-DVSIENVEDNNIDV 360
           G+ANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAI+ DVSIENVE+NNIDV
Sbjct: 301 GVANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIILDVSIENVENNNIDV 360

Query: 361 SLVKYRKRPSGCIIKDEFNGHCKNKVHNLFRNLINNGGGFGAKRWMATLQLQCERLAFFM 420
           SLVKYRKRPSGCIIKDE                                           
Sbjct: 361 SLVKYRKRPSGCIIKDE------------------------------------------- 420

Query: 421 ATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVVGKTGEDIRVCSR 480
                   S GVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVVGKTGEDIRVCSR
Sbjct: 421 --------SNGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVVGKTGEDIRVCSR 480

Query: 481 KNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVMFGGDEAKSIANLAKG 540
           KN SDLGEPIGAILC+VSSLWLPVSPHLLFHFL DQA RH                    
Sbjct: 481 KNLSDLGEPIGAILCAVSSLWLPVSPHLLFHFLLDQARRH-------------------- 540

Query: 541 QDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTI 600
                    QTIGSKES+SSMWILQDSSTNS ESMVVYSGVDVTGMQSVMTGCDSSSLTI
Sbjct: 541 ---------QTIGSKESNSSMWILQDSSTNSLESMVVYSGVDVTGMQSVMTGCDSSSLTI 600

Query: 601 LPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTAAVQILTDASPSAKHTMESVEI 660
           LPSGFSILPDGAVSRPPLLITRQKDDKTA+TNGGVLLTAAVQILTDASPSAK TMESVEI
Sbjct: 601 LPSGFSILPDGAVSRPPLLITRQKDDKTANTNGGVLLTAAVQILTDASPSAKPTMESVEI 611

Query: 661 KTPFEASPPPSACNSKFVPFQYKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRPS 720
           + PFEAS PPSACNSKFVP Q KTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRP+
Sbjct: 661 EIPFEASSPPSACNSKFVPSQSKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRPT 611

Query: 721 RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFA 780
           RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFA
Sbjct: 721 RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFA 611

Query: 781 APFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI 831
           APFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Sbjct: 781 APFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI 611

BLAST of CmaCh14G013050 vs. NCBI nr
Match: XP_022998610.1 (homeobox-leucine zipper protein GLABRA 2 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1027.7 bits (2656), Expect = 5.5e-296
Identity = 549/744 (73.79%), Postives = 598/744 (80.38%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDG-----RDDM 60
           M ADMS+NNNRLAFTKDFFSSPALSLTLAGIFRRGE  EKGDVEMEEVDDG     RDD 
Sbjct: 1   MLADMSNNNNRLAFTKDFFSSPALSLTLAGIFRRGEMAEKGDVEMEEVDDGSGEVRRDD- 60

Query: 61  TTAVEVSSENSGPVRSRSDDDYDGGGVHEENED---GCHGKRRKKYHRHTNEQIREMEAL 120
             A EVSSEN GPVRSRSDDD++GGGVHEENE+   GC  KRRKKYHRHT EQIREMEAL
Sbjct: 61  --AAEVSSENLGPVRSRSDDDFEGGGVHEENEEGDGGCLVKRRKKYHRHTTEQIREMEAL 120

Query: 121 FKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK---------------------- 180
           FKESPHPDEKQRQQL+K LGLS RQVKFWFQNRRTQIK                      
Sbjct: 121 FKESPHPDEKQRQQLSKRLGLSPRQVKFWFQNRRTQIKAIQERHENTLLKAEMEKLREEN 180

Query: 181 --------------------------------------------IEKLRAALGKHRHAAV 240
                                                       +EKLRAALGKH  +++
Sbjct: 181 KAMRELTKKKVGCPNCGTANAAEDHTAFTTTEQLRIKNAKLKAEVEKLRAALGKHSQSSI 240

Query: 241 SLSYSSENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETG 300
           S SYSS N+QETN++CLD+YTGIF LEKSRIME+VHQALEELKTMAA  + LW+ S+ETG
Sbjct: 241 SPSYSSGNDQETNRNCLDFYTGIFGLEKSRIMERVHQALEELKTMAATNEPLWIRSIETG 300

Query: 301 REILNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPF 360
           REILNYDEYLKTFHL N SN  WLK HIEASR+T +VFMEPSRL+QSFMDENQWKEMFP 
Sbjct: 301 REILNYDEYLKTFHLKN-SNTCWLKRHIEASRDTTVVFMEPSRLIQSFMDENQWKEMFPS 360

Query: 361 MISKAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVD 420
           MISKAAT+DVICNG +ANW+GAVQLMF EVQ+LTPLV TRE+YFIRHCKQLD E+WAIVD
Sbjct: 361 MISKAATIDVICNGEVANWNGAVQLMFVEVQLLTPLVPTREIYFIRHCKQLDTEQWAIVD 420

Query: 421 VSIENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHC-----------KNKVHNLFRNLINN 480
           VSI+NV+D NID SL+KYRKRPSGCIIKDE NGHC           K +VHNL+R ++NN
Sbjct: 421 VSIDNVDDINIDASLMKYRKRPSGCIIKDESNGHCKVTMVEHLECEKTQVHNLYRTIVNN 480

Query: 481 GGGFGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAI 540
           G  FGA+ WMATLQLQCERLAFFMATNIP+KDSTGVATLAGRKSTLKLAQRMSSSFSQA+
Sbjct: 481 GTAFGARHWMATLQLQCERLAFFMATNIPIKDSTGVATLAGRKSTLKLAQRMSSSFSQAV 540

Query: 541 AASSYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQ 600
           AASSYQTWTKVVGKTGEDIRVCSRKN SD GEPIG ILC+V SLWLPV PH LFHFLRD 
Sbjct: 541 AASSYQTWTKVVGKTGEDIRVCSRKNLSDPGEPIGVILCAVFSLWLPVPPHTLFHFLRDD 600

Query: 601 ACRHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKE-SSSSMWILQDSSTNSSESM 659
           A R++WD MFGGD+  SIANLAKGQDRGNSV+IQT GSKE SSSSMWILQD+STNSSESM
Sbjct: 601 ARRNEWDAMFGGDKVDSIANLAKGQDRGNSVSIQTRGSKESSSSSMWILQDTSTNSSESM 660

BLAST of CmaCh14G013050 vs. TAIR 10
Match: AT1G79840.1 (HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START domain )

HSP 1 Score: 680.2 bits (1754), Expect = 2.0e-195
Identity = 403/743 (54.24%), Postives = 488/743 (65.68%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGR---DDMTT 60
           M  DMS        TKDFFSSPALSL+LAGIFR   +   G    EE   GR   DD   
Sbjct: 3   MAVDMSSKQP----TKDFFSSPALSLSLAGIFRNASS---GSTNPEEDFLGRRVVDDEDR 62

Query: 61  AVEVSSENSGPVRSRSDDDYDG---GGVHEENEDGCHG------KRRKKYHRHTNEQIRE 120
            VE+SSENSGP RSRS++D +G       EE EDG  G      ++RKKYHRHT +QIR 
Sbjct: 63  TVEMSSENSGPTRSRSEEDLEGEDHDDEEEEEEDGAAGNKGTNKRKRKKYHRHTTDQIRH 122

Query: 121 MEALFKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------ 180
           MEALFKE+PHPDEKQRQQL+K LGL+ RQVKFWFQNRRTQIK                  
Sbjct: 123 MEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIKAIQERHENSLLKAELEKL 182

Query: 181 --------------------------------------IEKLRAALGKHRHAAVSLSYSS 240
                                                 ++KLRAALG+       L  S 
Sbjct: 183 REENKAMRESFSKANSSCPNCGGGPDDLHLENSKLKAELDKLRAALGR---TPYPLQASC 242

Query: 241 ENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNY 300
            ++QE     LD+YTG+F LEKSRI E  ++A  EL+ MA +G+ +W+ SVETGREILNY
Sbjct: 243 SDDQEHRLGSLDFYTGVFALEKSRIAEISNRATLELQKMATSGEPMWLRSVETGREILNY 302

Query: 301 DEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAA 360
           DEYLK F     S+    KT IEASR+  +VFM+  +L QSFMD  QWKE F  +ISKAA
Sbjct: 303 DEYLKEFPQAQASSFPGRKT-IEASRDAGIVFMDAHKLAQSFMDVGQWKETFACLISKAA 362

Query: 361 TVDVICNG-GIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIEN 420
           TVDVI  G G +  DGA+QLMF E+Q+LTP+V TRE+YF+R C+QL  E+WAIVDVS+ +
Sbjct: 363 TVDVIRQGEGPSRIDGAIQLMFGEMQLLTPVVPTREVYFVRSCRQLSPEKWAIVDVSV-S 422

Query: 421 VEDNNI--DVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNLINNGGG 480
           VED+N   + SL+K RK PSGCII+D  NGH K           + V  LFR+L+N G  
Sbjct: 423 VEDSNTEKEASLLKCRKLPSGCIIEDTSNGHSKVTWVEHLDVSASTVQPLFRSLVNTGLA 482

Query: 481 FGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAAS 540
           FGA+ W+ATLQL CERL FFMATN+P KDS GV TLAGRKS LK+AQRM+ SF +AIAAS
Sbjct: 483 FGARHWVATLQLHCERLVFFMATNVPTKDSLGVTTLAGRKSVLKMAQRMTQSFYRAIAAS 542

Query: 541 SYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACR 600
           SY  WTK+  KTG+D+RV SRKN  D GEP G I+C+ SSLWLPVSP LLF F RD+A R
Sbjct: 543 SYHQWTKITTKTGQDMRVSSRKNLHDPGEPTGVIVCASSSLWLPVSPALLFDFFRDEARR 602

Query: 601 HQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYS 660
           H+WD +  G   +SIANL+KGQDRGNSV IQT+ S+E   S+W+LQDSSTNS ES+VVY+
Sbjct: 603 HEWDALSNGAHVQSIANLSKGQDRGNSVAIQTVKSRE--KSIWVLQDSSTNSYESVVVYA 662

Query: 661 GVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTA 662
            VD+   Q V+ G D S++ ILPSGFSI+PDG  SR PL+IT  +DD+  ++ GG LLT 
Sbjct: 663 PVDINTTQLVLAGHDPSNIQILPSGFSIIPDGVESR-PLVITSTQDDR--NSQGGSLLTL 722

BLAST of CmaCh14G013050 vs. TAIR 10
Match: AT1G79840.2 (HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START domain )

HSP 1 Score: 680.2 bits (1754), Expect = 2.0e-195
Identity = 403/743 (54.24%), Postives = 488/743 (65.68%), Query Frame = 0

Query: 1   MGADMSDNNNRLAFTKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGR---DDMTT 60
           M  DMS        TKDFFSSPALSL+LAGIFR   +   G    EE   GR   DD   
Sbjct: 32  MAVDMSSKQP----TKDFFSSPALSLSLAGIFRNASS---GSTNPEEDFLGRRVVDDEDR 91

Query: 61  AVEVSSENSGPVRSRSDDDYDG---GGVHEENEDGCHG------KRRKKYHRHTNEQIRE 120
            VE+SSENSGP RSRS++D +G       EE EDG  G      ++RKKYHRHT +QIR 
Sbjct: 92  TVEMSSENSGPTRSRSEEDLEGEDHDDEEEEEEDGAAGNKGTNKRKRKKYHRHTTDQIRH 151

Query: 121 MEALFKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIK------------------ 180
           MEALFKE+PHPDEKQRQQL+K LGL+ RQVKFWFQNRRTQIK                  
Sbjct: 152 MEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIKAIQERHENSLLKAELEKL 211

Query: 181 --------------------------------------IEKLRAALGKHRHAAVSLSYSS 240
                                                 ++KLRAALG+       L  S 
Sbjct: 212 REENKAMRESFSKANSSCPNCGGGPDDLHLENSKLKAELDKLRAALGR---TPYPLQASC 271

Query: 241 ENEQETNQSCLDYYTGIFRLEKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREILNY 300
            ++QE     LD+YTG+F LEKSRI E  ++A  EL+ MA +G+ +W+ SVETGREILNY
Sbjct: 272 SDDQEHRLGSLDFYTGVFALEKSRIAEISNRATLELQKMATSGEPMWLRSVETGREILNY 331

Query: 301 DEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAA 360
           DEYLK F     S+    KT IEASR+  +VFM+  +L QSFMD  QWKE F  +ISKAA
Sbjct: 332 DEYLKEFPQAQASSFPGRKT-IEASRDAGIVFMDAHKLAQSFMDVGQWKETFACLISKAA 391

Query: 361 TVDVICNG-GIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIEN 420
           TVDVI  G G +  DGA+QLMF E+Q+LTP+V TRE+YF+R C+QL  E+WAIVDVS+ +
Sbjct: 392 TVDVIRQGEGPSRIDGAIQLMFGEMQLLTPVVPTREVYFVRSCRQLSPEKWAIVDVSV-S 451

Query: 421 VEDNNI--DVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNLINNGGG 480
           VED+N   + SL+K RK PSGCII+D  NGH K           + V  LFR+L+N G  
Sbjct: 452 VEDSNTEKEASLLKCRKLPSGCIIEDTSNGHSKVTWVEHLDVSASTVQPLFRSLVNTGLA 511

Query: 481 FGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAAS 540
           FGA+ W+ATLQL CERL FFMATN+P KDS GV TLAGRKS LK+AQRM+ SF +AIAAS
Sbjct: 512 FGARHWVATLQLHCERLVFFMATNVPTKDSLGVTTLAGRKSVLKMAQRMTQSFYRAIAAS 571

Query: 541 SYQTWTKVVGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQACR 600
           SY  WTK+  KTG+D+RV SRKN  D GEP G I+C+ SSLWLPVSP LLF F RD+A R
Sbjct: 572 SYHQWTKITTKTGQDMRVSSRKNLHDPGEPTGVIVCASSSLWLPVSPALLFDFFRDEARR 631

Query: 601 HQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVYS 660
           H+WD +  G   +SIANL+KGQDRGNSV IQT+ S+E   S+W+LQDSSTNS ES+VVY+
Sbjct: 632 HEWDALSNGAHVQSIANLSKGQDRGNSVAIQTVKSRE--KSIWVLQDSSTNSYESVVVYA 691

Query: 661 GVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTA 662
            VD+   Q V+ G D S++ ILPSGFSI+PDG  SR PL+IT  +DD+  ++ GG LLT 
Sbjct: 692 PVDINTTQLVLAGHDPSNIQILPSGFSIIPDGVESR-PLVITSTQDDR--NSQGGSLLTL 751

BLAST of CmaCh14G013050 vs. TAIR 10
Match: AT4G00730.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 402.1 bits (1032), Expect = 1.1e-111
Identity = 267/741 (36.03%), Postives = 393/741 (53.04%), Query Frame = 0

Query: 15  TKDFFSSPALSLTLAGIFRRGEAGEKGDVEMEEVDDGRDDMTTAVEVSSENSGPVRSRSD 74
           TK  ++S  LSL L    R    GE        V  G D    +V   S       SRS 
Sbjct: 56  TKSVYASSGLSLALEQPERGTNRGEASMRNNNNVGGGGDTFDGSVNRRSREE-EHESRSG 115

Query: 75  DDYDGGGVHEENEDGCHGKRRKKYHRHTNEQIREMEALFKESPHPDEKQRQQLAKLLGLS 134
            D   G   E+ +      R+K+YHRHT +QI+E+E++FKE PHPDEKQR +L+K L L 
Sbjct: 116 SDNVEGISGEDQDAADKPPRKKRYHRHTPQQIQELESMFKECPHPDEKQRLELSKRLCLE 175

Query: 135 SRQVKFWFQNRRTQIK--IEKLRAALGKHRHAAVSLSYSSENEQETNQSCLD-------- 194
           +RQVKFWFQNRRTQ+K  +E+   AL +  +  +     S  E   N  C +        
Sbjct: 176 TRQVKFWFQNRRTQMKTQLERHENALLRQENDKLRAENMSIREAMRNPICTNCGGPAMLG 235

Query: 195 -------------------------------------YYTGIFRL--------------- 254
                                                +Y     L               
Sbjct: 236 DVSLEEHHLRIENARLKDELDRVCNLTGKFLGHHHNHHYNSSLELAVGTNNNGGHFAFPP 295

Query: 255 -----------------------EKSRIMEKVHQALEELKTMAAAGDSLWVLSVETGREI 314
                                  +KS ++E    A++EL  +A + + LWV S++  R+ 
Sbjct: 296 DFGGGGGCLPPQQQQSTVINGIDQKSVLLELALTAMDELVKLAQSEEPLWVKSLDGERDE 355

Query: 315 LNYDEYLKTFHLNNDSNRSWLKTHIEASRETALVFMEPSRLVQSFMDENQWKEMFPFMIS 374
           LN DEY++TF   + +  + L T  EASR + +V +    LV++ MD N+W EMFP  ++
Sbjct: 356 LNQDEYMRTF---SSTKPTGLAT--EASRTSGMVIINSLALVETLMDSNRWTEMFPCNVA 415

Query: 375 KAATVDVICNGGIANWDGAVQLMFAEVQMLTPLVSTREMYFIRHCKQLDAERWAIVDVSI 434
           +A T DVI  G     +GA+QLM AE+Q+L+PLV  R + F+R CKQ     WA+VDVSI
Sbjct: 416 RATTTDVISGGMAGTINGALQLMNAELQVLSPLVPVRNVNFLRFCKQHAEGVWAVVDVSI 475

Query: 435 ENVEDNNIDVSLVKYRKRPSGCIIKDEFNGHCK-----------NKVHNLFRNLINNGGG 494
           + V +N+    ++  R+ PSGC+++D  NG+ K           N++H L+R L+ +G G
Sbjct: 476 DPVRENSGGAPVI--RRLPSGCVVQDVSNGYSKVTWVEHAEYDENQIHQLYRPLLRSGLG 535

Query: 495 FGAKRWMATLQLQCERLAFFMATNIPMKDSTGVATLAGRKSTLKLAQRMSSSFSQAIAAS 554
           FG++RW+ATLQ QCE LA  +++++   D+T + T  GRKS LKLAQRM+ +F   I+A 
Sbjct: 536 FGSQRWLATLQRQCECLAILISSSVTSHDNTSI-TPGGRKSMLKLAQRMTFNFCSGISAP 595

Query: 555 SYQTWTKV-VGKTGEDIRVCSRKNPSDLGEPIGAILCSVSSLWLPVSPHLLFHFLRDQAC 614
           S   W+K+ VG    D+RV +RK+  D GEP G +L + +S+WLP +P  L+ FLR++  
Sbjct: 596 SVHNWSKLTVGNVDPDVRVMTRKSVDDPGEPPGIVLSAATSVWLPAAPQRLYDFLRNERM 655

Query: 615 RHQWDVMFGGDEAKSIANLAKGQDRGNSVTIQTIGSKESSSSMWILQDSSTNSSESMVVY 659
           R +WD++  G   + +A++ KGQD+G S+ +++     + SSM ILQ++  ++S ++VVY
Sbjct: 656 RCEWDILSNGGPMQEMAHITKGQDQGVSL-LRSNAMNANQSSMLILQETCIDASGALVVY 715

BLAST of CmaCh14G013050 vs. TAIR 10
Match: AT4G04890.1 (protodermal factor 2 )

HSP 1 Score: 389.0 bits (998), Expect = 9.3e-108
Identity = 258/716 (36.03%), Postives = 386/716 (53.91%), Query Frame = 0

Query: 54  DMTTAVEVSSENSGPVRSRSDDDYD---GGGVHEENEDG-------CHGKRRKKYHRHTN 113
           DMT   + +S+N   +    +DD++   G  V  EN  G           ++K+YHRHT 
Sbjct: 14  DMTP--KSTSDNDLGITGSREDDFETKSGTEVTTENPSGEELQDPSQRPNKKKRYHRHTQ 73

Query: 114 EQIREMEALFKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIKI------------ 173
            QI+E+E+ FKE PHPD+KQR++L++ L L   QVKFWFQN+RTQ+K             
Sbjct: 74  RQIQELESFFKECPHPDDKQRKELSRDLNLEPLQVKFWFQNKRTQMKAQSERHENQILKS 133

Query: 174 --EKLRAALGKHRHAAVS--------------LSYSSENEQETN---------------- 233
             +KLRA   +++ A  +              +S+  ++ +  N                
Sbjct: 134 DNDKLRAENNRYKEALSNATCPNCGGPAAIGEMSFDEQHLRIENARLREEIDRISAIAAK 193

Query: 234 ------------------QSCLDYYTGIF------------------------RLEKSRI 293
                                LD   G F                          +K  I
Sbjct: 194 YVGKPLGSSFAPLAIHAPSRSLDLEVGNFGNQTGFVGEMYGTGDILRSVSIPSETDKPII 253

Query: 294 MEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKTFHLNNDSNRSWLKTHIEAS 353
           +E    A+EEL  MA  GD LW LS +   EILN +EY +TF          L++  EAS
Sbjct: 254 VELAVAAMEELVRMAQTGDPLW-LSTDNSVEILNEEEYFRTFPRGIGPKPLGLRS--EAS 313

Query: 354 RETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVICNGGIANWDGAVQLMFAEVQ 413
           R++A+V M    LV+  MD NQW  +F  ++S+A T++V+  G   N++GA+Q+M AE Q
Sbjct: 314 RQSAVVIMNHINLVEILMDVNQWSCVFSGIVSRALTLEVLSTGVAGNYNGALQVMTAEFQ 373

Query: 414 MLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNIDVSLVKYRKRPSGCIIKDEF 473
           + +PLV TRE YF+R+CKQ     WA+VDVS++++  +     +++ R+RPSGC+I++  
Sbjct: 374 VPSPLVPTRENYFVRYCKQHSDGSWAVVDVSLDSLRPS---TPILRTRRRPSGCLIQELP 433

Query: 474 NGHCK-----------NKVHNLFRNLINNGGGFGAKRWMATLQLQCERLAFFMATNIPMK 533
           NG+ K             VHN+++ L+ +G  FGAKRW+ATL+ QCERLA  MA+NIP  
Sbjct: 434 NGYSKVTWIEHMEVDDRSVHNMYKPLVQSGLAFGAKRWVATLERQCERLASSMASNIP-G 493

Query: 534 DSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVVGKTGEDIRVCSRKNPSDLG 593
           D + + +  GRKS LKLA+RM  SF   + AS+   WT +     +D+RV +RK+  D G
Sbjct: 494 DLSVITSPEGRKSMLKLAERMVMSFCSGVGASTAHAWTTMSTTGSDDVRVMTRKSMDDPG 553

Query: 594 EPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVMFGGDEAKSIANLAKGQDRGNSV 653
            P G +L + +S W+PV+P  +F FLRD+  R +WD++  G   + +A++A G + GN V
Sbjct: 554 RPPGIVLSAATSFWIPVAPKRVFDFLRDENSRKEWDILSNGGMVQEMAHIANGHEPGNCV 613

Query: 654 TIQTIGSKESS-SSMWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFS 658
           ++  + S  SS S+M ILQ+S T++S S V+Y+ VD+  M  V++G D   + +LPSGF+
Sbjct: 614 SLLRVNSGNSSQSNMLILQESCTDASGSYVIYAPVDIVAMNVVLSGGDPDYVALLPSGFA 673

BLAST of CmaCh14G013050 vs. TAIR 10
Match: AT4G21750.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 381.3 bits (978), Expect = 1.9e-105
Identity = 255/727 (35.08%), Postives = 375/727 (51.58%), Query Frame = 0

Query: 62  SSENSGPVRSRSDDDYD---GGGVHEEN-------EDGCHGKRRKKYHRHTNEQIREMEA 121
           +SEN   +    ++D++   G  V  EN       +      ++K+YHRHT  QI+E+E+
Sbjct: 20  NSENDLGITGSHEEDFETKSGAEVTMENPLEEELQDPNQRPNKKKRYHRHTQRQIQELES 79

Query: 122 LFKESPHPDEKQRQQLAKLLGLSSRQVKFWFQNRRTQIKI--------------EKLRAA 181
            FKE PHPD+KQR++L++ L L   QVKFWFQN+RTQ+K               +KLRA 
Sbjct: 80  FFKECPHPDDKQRKELSRELSLEPLQVKFWFQNKRTQMKAQHERHENQILKSENDKLRAE 139

Query: 182 LGKHRHA----------------------------------------AVSLSYSSENEQE 241
             +++ A                                        A++  Y  +    
Sbjct: 140 NNRYKDALSNATCPNCGGPAAIGEMSFDEQHLRIENARLREEIDRISAIAAKYVGKPLMA 199

Query: 242 TNQS-------------CLDYYTGIF----------------------------RLEKSR 301
            + S              LD   G F                              +K  
Sbjct: 200 NSSSFPQLSSSHHIPSRSLDLEVGNFGNNNNSHTGFVGEMFGSSDILRSVSIPSEADKPM 259

Query: 302 IMEKVHQALEELKTMAAAGDSLWVLSVETGREILNYDEYLKTFHLNNDSNRSWLKTHIEA 361
           I+E    A+EEL  MA  GD LWV S +   EILN +EY +TF          L++  EA
Sbjct: 260 IVELAVAAMEELVRMAQTGDPLWV-SSDNSVEILNEEEYFRTFPRGIGPKPIGLRS--EA 319

Query: 362 SRETALVFMEPSRLVQSFMDENQWKEMFPFMISKAATVDVICNGGIANWDGAVQLMFAEV 421
           SRE+ +V M    L++  MD NQW  +F  ++S+A T++V+  G   N++GA+Q+M AE 
Sbjct: 320 SRESTVVIMNHINLIEILMDVNQWSSVFCGIVSRALTLEVLSTGVAGNYNGALQVMTAEF 379

Query: 422 QMLTPLVSTREMYFIRHCKQLDAERWAIVDVSIENVEDNNIDVSLVKYRKRPSGCIIKDE 481
           Q+ +PLV TRE YF+R+CKQ     WA+VDVS++++  + I     + R+RPSGC+I++ 
Sbjct: 380 QVPSPLVPTRENYFVRYCKQHSDGIWAVVDVSLDSLRPSPI----TRSRRRPSGCLIQEL 439

Query: 482 FNGHCK-----------NKVHNLFRNLINNGGGFGAKRWMATLQLQCERLAFFMATNIPM 541
            NG+ K             VHN+++ L+N G  FGAKRW+ATL  QCERLA  MA+NIP 
Sbjct: 440 QNGYSKVTWVEHIEVDDRSVHNMYKPLVNTGLAFGAKRWVATLDRQCERLASSMASNIPA 499

Query: 542 KDSTGVATLAGRKSTLKLAQRMSSSFSQAIAASSYQTWTKVVGKTGEDIRVCSRKNPSDL 601
            D + + +  GRKS LKLA+RM  SF   + AS+   WT +     +D+RV +RK+  D 
Sbjct: 500 CDLSVITSPEGRKSMLKLAERMVMSFCTGVGASTAHAWTTLSTTGSDDVRVMTRKSMDDP 559

Query: 602 GEPIGAILCSVSSLWLPVSPHLLFHFLRDQACRHQWDVMFGGDEAKSIANLAKGQDRGNS 658
           G P G +L + +S W+PV+P  +F FLRD+  R +WD++  G   + +A++A G+D GNS
Sbjct: 560 GRPPGIVLSAATSFWIPVAPKRVFDFLRDENSRSEWDILSNGGLVQEMAHIANGRDPGNS 619

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P466072.8e-19454.24Homeobox-leucine zipper protein GLABRA 2 OS=Arabidopsis thaliana OX=3702 GN=GL2 ... [more]
Q5JMF36.2e-13340.03Homeobox-leucine zipper protein ROC9 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
Q0WV121.5e-11036.03Homeobox-leucine zipper protein ANTHOCYANINLESS 2 OS=Arabidopsis thaliana OX=370... [more]
Q6ZAR09.1e-10836.66Homeobox-leucine zipper protein ROC1 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
A2YR021.5e-10737.12Homeobox-leucine zipper protein ROC7 OS=Oryza sativa subsp. indica OX=39946 GN=R... [more]
Match NameE-valueIdentityDescription
A0A6J1IST90.0e+0089.65LOW QUALITY PROTEIN: homeobox-leucine zipper protein GLABRA 2-like OS=Cucurbita ... [more]
A0A6J1GVV30.0e+0084.80homeobox-leucine zipper protein GLABRA 2-like OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1KCZ92.7e-29673.79homeobox-leucine zipper protein GLABRA 2 isoform X1 OS=Cucurbita maxima OX=3661 ... [more]
A0A6J1G8W01.2e-29373.34homeobox-leucine zipper protein GLABRA 2 isoform X2 OS=Cucurbita moschata OX=366... [more]
A0A6J1KAL96.2e-29373.39homeobox-leucine zipper protein GLABRA 2 isoform X2 OS=Cucurbita maxima OX=3661 ... [more]
Match NameE-valueIdentityDescription
XP_022979290.10.0e+0089.65LOW QUALITY PROTEIN: homeobox-leucine zipper protein GLABRA 2-like [Cucurbita ma... [more]
XP_022955725.10.0e+0084.80homeobox-leucine zipper protein GLABRA 2-like [Cucurbita moschata][more]
XP_038904725.15.3e-29975.47homeobox-leucine zipper protein GLABRA 2-like [Benincasa hispida][more]
KAG6581890.11.5e-29670.28Photosystem I reaction center subunit VI, chloroplastic, partial [Cucurbita argy... [more]
XP_022998610.15.5e-29673.79homeobox-leucine zipper protein GLABRA 2 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G79840.12.0e-19554.24HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START dom... [more]
AT1G79840.22.0e-19554.24HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START dom... [more]
AT4G00730.11.1e-11136.03Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
AT4G04890.19.3e-10836.03protodermal factor 2 [more]
AT4G21750.11.9e-10535.08Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 94..156
e-value: 2.4E-18
score: 76.9
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 95..150
e-value: 1.0E-18
score: 66.9
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 92..152
score: 18.479261
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 95..153
e-value: 7.60033E-20
score: 81.906
IPR002913START domainSMARTSM00234START_1coord: 199..414
e-value: 8.8E-44
score: 161.4
IPR002913START domainPFAMPF01852STARTcoord: 203..414
e-value: 6.0E-39
score: 133.6
IPR002913START domainPROSITEPS50848STARTcoord: 190..417
score: 26.441162
NoneNo IPR availableGENE3D1.20.5.220coord: 784..806
e-value: 2.4E-9
score: 38.7
NoneNo IPR availableGENE3D1.10.10.60coord: 82..164
e-value: 8.8E-21
score: 75.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 36..100
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 69..92
NoneNo IPR availablePANTHERPTHR45654:SF24HOMEOBOX-LEUCINE ZIPPER PROTEIN GLABRA 2coord: 151..658
coord: 17..150
NoneNo IPR availableCDDcd08875START_ArGLABRA2_likecoord: 194..413
e-value: 1.02003E-90
score: 283.393
NoneNo IPR availableSUPERFAMILY55961Bet v1-likecoord: 193..415
NoneNo IPR availableSUPERFAMILY55961Bet v1-likecoord: 458..642
IPR004928Photosystem I PsaH, reaction centre subunit VIPFAMPF03244PSI_PsaHcoord: 693..830
e-value: 1.0E-68
score: 229.2
IPR042160Homeobox-leucine zipper protein GLABRA2/ANL2/PDF2/ATML1-likePANTHERPTHR45654HOMEOBOX-LEUCINE ZIPPER PROTEIN MERISTEM L1coord: 151..658
coord: 17..150
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 127..150
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 83..150

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G013050.1CmaCh14G013050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015979 photosynthesis
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0009538 photosystem I reaction center
cellular_component GO:0009522 photosystem I
molecular_function GO:0003677 DNA binding
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0008289 lipid binding