Cp4.1LG08g04800 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG08g04800
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionprotein ALWAYS EARLY 3-like isoform X1
LocationCp4.1LG08: 1324815 .. 1337416 (-)
RNA-Seq ExpressionCp4.1LG08g04800
SyntenyCp4.1LG08g04800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGAGGAAATTTGCTGACTTGTTAGGGCCTCAATGGAGCAGAGATGAGGTTGAGCAGTTCTATGAAGCATATCGTAAATATGGAAAAGATTGGAAGAAGGTTAGTATCAATTTATGGAATGGTCATGACAATGTTCTAGTGCTGTGAGCCAAGTGAATCTCATTGCCACAGAAAAATTGAGCCGTGAAACTTTAAGTATCTTGTTTCAAAATCATGCAATACTTTCTTCAAATTTTTTTCAGAAAACAAGCTTATACTTGTAATTTTCTTGTAGATGTCAATTAAATTTGTTACTGTAGTTGCACCTAATTCTTCAGTCTTAAACAGATACTGGTACTGTATTTGGGTTGCAACATTTTTGTCTGGTACTTTTGTGCAGGTAGCTGCTGCAGTAAGGAACCGTTCTACCGAAATGGTCGAGGCTCTTTTCACCATGAATAGGGTAAATCCCTTTTTCCTAATGATCTCAGCCCTCCCACTACTTATCTTAAATATTTGTATCGCCATTAATCTTTCTATACAAATGACTTTTCTGATTAAGTTTCNCCTAATGATCTCAGCCCTCCCACTACTTATCTTAAATATTTGTATTGCCATTAATCTTTCTATACAAATGACTTTTCTGATTAAGTTTCCAGTAAACTTGTCTGTTGTATGTCGTATTCCTAGTATCGGGGACCTATACTGTGTATTTGTTGCAAACAACCCCATTTTCTTCCCTATACTTATTATCTGATCATAATGTCTCTGCTGCATGTTTTATGTGTTGCTTGTGTTCTCTGTGATCTCTATATCGTGTACTAAAGGTCATGAAAATTTTGAGTTGTCATTTAATATTCAGTTTTCTTTTATCTGATCAAAAAACCTTTTTAGTCCGTGGCCACTTTGATTGTTGGCGCTCTTTTGGAGTTTCACTTTCACGATTTAAATTGGTACTGTGATTTATCTTGTTACCTTTTCCTGCATCGTTATATTTTTTGGAATTCTGGTCGTGGCTTGTAAAATTATTGGAAAACATATATACTTAATATTTGGAAATGTCAACACTGTGACACCATCTTGTTATTGTGGTTTTTCTGAAATGCGTATGCACATTCTGTTGATTTCAGGCCTATTTATCACTTCCAGAGGGCACTGCTTCAGTTGTTGGGTTAATTGCAATGATGACTGATCATTACAGCGTACTGGTAAGATTTTTTCTCTATCTGTGTGTTATTTTCTCCCTTCTTTCTTTTCTTTACCCCTCACCCCCTCTTGTTTTGAAATAGAACTGTGTATATACTCAATTTAGATATTTTCTTTCATCAATATAAGTTTTTGTCTCCCCAGAGTTTATTTTTGTCAATATCAAGCTAATTTACCCTGGTTCATCGTCCTTTGATCTAGAAGGGATGCAAGTTGCGGGTGAATTTTTTAGTTTTTAGTTTGTTTAAAGATAAGATATTGTATGCGTTGAAAAAAGGTGTACAAAGCTTTGAAGATTATATAAATGCTCACTACGATAAGCAAACAATGGACCATATAAGGATGCTCTTCAAATAGGATCTTTATGATATAAAATCATCGGCAATGATTGGTCTGCACTTCTAGATTCTTCCTCCTTTAATGGGCACACGAAGCTCTCATGACCCTACAGATGAACTGAAAGTAATTCTTCTTTTAGCCCGTCATAATAATCAACCAAAATGCATAAGGAAGTCTGAAAGTGGAAGGGACATGAATAGTGCTTTAACTTTAGAAGGGCTTTTGCCTTTTATTGTTTGGAGGATATTTTGGGTGTTTCCCTCCTTCATACGCTTGAAGGCTTCTTTCACAGCCTCAATCATGGCAACAGAGTATTGACGTTTGGAAATTGTGGAAAGTTTTCTTGCCAGGCCCAGTTTCATTATATCTTTTGACAATATGAAGTCCATCCCAAAATTATTTTGATAAAAAATTGTGGAAACTCAAAGTGAAAAGGGAAGTAAAGTTCTTCATGTGGACTCTCCATGCTAGAATCAACACTTTGATGAATTGATGAAATTTAGTGAGAGAACATTAGCATCTCCCTCTGCCCTCCCTCTACTGGTTTGTGCTGCAAAATCACAATGAAAGTCAGAATTTTATCACCCTTAAGTACTTTGTGCTCCATTGCTGTATTGGGCTCTGAAAGTATTTATTAATCTCTTCTAAGAGCTGAAAACATTGAAGCATCTTTCCATTTGAGGGTTATGTATGAAACGTAGAAATAGGAATGTAGATGTAGCATATATTTCTTCTGCAATATTTTGTTCAATGTATTTTGTTTCTATATTAGCATATTGGATTTGCATGGGAGTTACAGAGTATTGTAAAGCTATTGTTGAGTTCATATGGCTCTTCTGAGGTCTAAATTGATAGATGCGTTGTCAGAGAGACAGTGAAAGTGAGCAAGAAAGTAATGAGGATTCAGGAGCAACAAGAAAGCCTCAAAAGCGACTGCGTGGAAAGTCACGAAATAACAACTCAAAGGGATTGGATGCACATTTTGGAGATGCTTCACAATCACAGTCACTTCCAACAAACTATGGCTGCCTGTCATTGTTGAAGAAAAGGCGTTCTGGTAGGAAATGTCTCAAATTTCTTTCTTAGGTGAATAAGGTTTTCTGTACTTGTACTNTTGAGCTAGCAATGTCTCTTTTCGTTACATTGTCTGTTTTTTAACTTTTAAAATCATTTACGGAAAATTGCTGTCACGTTTTCGGATTTCATTTATTGGTGTTAATGTGGATGGTGTTAGGAATTAAACCTCATGCTGTTGGAAAACGGACTCCGCGCGTCCCTGTATCATATTCATACGATAAAGATAGTAGAGAGAGGATTTTTTCACCTTCCAGGCATACTAGTAAGCTAAAGGTTGATGATCCAAATGATGATGATGTTGCTCATGAGATAGCTTTAGTTTTAACAGAGGCTTCGCAACGCGATGGTTCTCCTCAACTTTCACAAACACCAAATCCGAAAATAGAAGGTCATGTACTTTCACCTATCCGAAATGACAGGATGGTAATTATTCATAGCTTTTAGAAACACTTTACCACTATTTTGAACATGTAAAGGGATTGACTTCAAGTAATTTTGATATTCAGCGAAGTGAATCAGACATGATGAGTACAAAGTTTCGATGTAGTGAAATGGATGAAGGTGGTTGTGAATTAANTCAAGTAATTTTGATATTCAGCGAAGTGAATCAGACATGATGAGTACGAAGTTTCGATGTAGTGAAATGGATGAAGGTGGTTGTGAATTAAGCTTAGGAAGCACTGGAGCTGATAATGCAGACTATGATCAGGGGAAAAATACTCGTGAGATTCAAAGAAAGGGTAAAAGGTACTACGGAAAGAAGGCAGAAGTTGAAGAAAGTATGTATAACCACTTGGATGACATCAAGGAAGCTTGTAGTGGGACTGAAGAGGGGCAAAAGTCAGGCAGTTTGAGGGGAAAACTTGAAACTGAGGATTTTGATGTAAAATCTGTGAGAACCTCTTTTAAAGGCCCAAGGAAGAGAAGTAAGAAAGCTCTATTTGGAGGTAAGCTGAAAAATAAATTTGAATTGGGTTGTCTACCCTTCTTTTGTATTTTGTATTTATCTGTTCTATTAATTTTTTAAGGACAGATTATTAGCTTAGTTTTCCTTCTTGACTAAAGTTTTGTCTATTTTTCTTCCTTTGTTAATAAAATTTCAATGGTGCATATTTTCTTATGTTTGGGATGGTTTTGTGTCTTGTGAAATGGACGAGAGTGTGAAACGGACTAGAGATTATAACTGGATTTTGAATTCATTGTTCAAGTTTTAAAGTAGTTTTTGTTATGGTTTGGTAGTCAGTAAATTCCTGTATTCATATGTTTGATTAGAGATGGCAAGTTTTGGAAGCTTTAACAAGTTATGGATCAGTTGTGGCATTAGGTCTTTTAGATTCGGAGTACAATTTATAATCACTGTACTTATCACATATCACTAGAAGTGGCCATTTCTTTCTTTTGGATGCTGCCTCTATATTATAATCAATCAATCAATTATTGTTTAAAGTTTGGTAAAAACTTGAAGGGTGGAAAAGTAATGTGTTGTGGTTCTTCTATTTTTAATTTGAAAATGCGAATTTGTATGTATGTATTTTAAGATGTTGCCTCTAGCCACTATTTCTATCATTAGATATTCAAATGCATGCGTGATTCTTTTTTTTTTCTTTTTTTTCTTTTTTCTTTTTGATGTTTTCATAACCGAATAAAGGTCATGTTCTTATCTTTAATTCTTTTGGTTGTTCTCTGTTCAGATGAATGCTCGGCATTTGATGCACTGCAAACTTTAGCTGATTTGTCTTTGATGATGCCAGACACTACTGCTGACACTGGTAAAAATTATTCTTCAATNGGTTCTTCTATTTTTAATTTGAAAATGTGAATTTGTATGTATGTATTTTTAGATGTTGCCTCTAGCCACTATTTCTATCATTAGATATTCAAANGAGTACAATTTATAATCACTGTACTTATCACATATCACTAGAAGTGGCCATTTCTTTCTTTTGGATGCTGTCTCTATATTATAATCAATCAATCAATCAATTATTGTTTAAAGTTTGGTAAAAACTTGAAGGGTGGAAAAGTAATGTGTTGTAGTTCTTCTATTTTTAATTTGAAAATGCGAATTTGTATGTATGTATTTTAAGATGTTGCCTCTAGCCACTATTTCTATCATTAGATATTCAAATGCATGCGTGATTCTTTTTTTTTTCTTTTTTCTTTTTGATGTTCTCATAACCGAATAAAGGTCATGTTCTTATCTTTAATTCTTTTGGTTGTTCTCTGTTCAGATGAATGCTCGGCATTTGATGCACTGCAAACTTTAGCTGATTTGTCTTTGATGATGCCAGACACTACTGCTGACACTGGTAAAAATTATTCTTCAATTAAAATTTATTTTTGAGGCTTGTCTCATGAAAGCTACTTATACATTGTGTTTTGGTTCGTTTTTTTTTTTTAATTCTTTTGGTTGCCAACTATTCTTCATCGCTTCATGCATCTCTAGGCTGTTTCCTCTTTCTTCATAATACTATTGGGGAAAGCTTACAGAAAGGTGGCGTATGAACTGATTGTTTCAAAATATGGAGTAAATGTGATATTGGTTGCCATAAAAATTCAGTAAATAGAATGCCAACTAGAAAAAAAGAAGAAAAAAAGTTAAGTCAGTATAAAATGCCAGCAATTGCTAGAGGTGAAGTTGTAATCCTGATTTAACTGAAGCTGGGGCTAGCCATAAGTGCGTAGGATGTTGAGTAAATGGAAAATTGTAGTTTGATTGACACTCAAGTTACTCAACTTGCATTCATCTAATTTTTATTCCAGCAAAGAAGTTTCCAACATATGGGCATGTTGCAATCTAATATCTAGGATGGCTGCACACATTTGATTTAATGAGCTAATTTTGGTCATATCCTCTATGTTCAAACTCTCCCAGCTTGTTACTAAATCAATGAACTCTTGCAATTCAATTGGATATGCTTGAATTTATTAGATGAAATCAAATTTGGTGGGTTTGAAATAAATTCTCTGTTTTGCGACTATCATTATCAATTCCTCCCCCCTCCCCTTCCTTCCTCAACCATTAGAGTGATGATGAACATGATCATTTTGTTTCTCTATATTAGAGCCTTCTGCAAAGGTTAAAGAAGAAAATCTTGATGTCATGGACAAGTCTAAAATGAAAGGGAATCATTCAGTTGTGGGAGCTGGAATCTCTGCTTCCAAGACATCTAAGACGGGAAAGGCTTCAGGCAACAATGTTGGTCCTATTCCTGAAGCAGAAGGAATTCAAGGATCCAATAATGGAAATCGTAAAAGGAAACAGAAGTCCTCTCCATTTAAAGTAAGGATGGATTGGAATCATAATTTTGCTGGTTTTAAATTGCTGCTGTCTTTTTTTTTATCATTTGTCTGTTTTGGAATGCATTTTGACAATATGGAAATTTATGGTCACAGATTTCATCTAAAGATGAAGATAGCAATGATTCTCGTGTCAATGACACTCCAAAAACCAAGGTGTGGGTCTTATTCGATCCTTGTTTTAGAAGAGTCCTTCAATTCATCTTAATTCTTTATTTTTTAATATGTTATATAGCTAAGANTAAGGTGTGGGTCTTATTTGATCCTTGTTTTAGAAGAGTCCTTCAGTTCATCTTAATTCTTTATTTTTTAATATGTTATATAGCTAAGAATTTAAAATGAACTCACATTTTGACTTAATGTATTTTGTTATCCTATTATTGTTCTTTTTTCTTTTTCAAACAAGAATCAAACTTTCTATTGATGGATGATAAGATAGAAAAATGTTCAGGGATTCAAACACTTGTATGGAGTGAAAAGAAAACCAAACAACCAAAAAACTTTTCATCTATAAGAAGCTTCCACTGTGACCTGATGGCAATATTCCAAAGAACATTAAGCTTGCCATTGAGTTGAGAGCCAAAAAACTTCTATTGATTTTTTTCTTATAATAAACTAACAGTGCATATATATTTTTTTTGATAATAAACATTTTTTCATGAAAAGAGAAGAAAGCATACATAAGTAAGGTGAGAAGATATCANCTTGAGTTGAGAGCCAAAAAACTTCTATTGATTTTTTTTCTTATAATAAACTAATAGGAAAAGAGAAGAAAGCATACATAAGTAAGGTGAGAAGATATCAGCCCAACTAAAACCGATTTGATTACGAAAAGTTCTTCCATATGGCGATAATTAAATAATTGTCGTAATTTAGAGAAAAGTGCCTAGGTGGGACTCTATACGGAGCCATATCCTTGCCATCCAAATTCAAAAGCATAACTTTTAATAGTATAGGTCCATAGTGNGCATAACTTTTAACAGCATAGGTCCATAGTGGTCAGATTTTAAATCTTGCTTAGAGTTTCNACATAACTTTTAATAGTATAGGTCCATAGTGGTCAGATTTTAAATCTTGCTTAGAGTTTCCTCAAGGGTAAACTTAGATAATCGAAAGGCTGGGCCTCAGCTAAGCATTAGAATGCCTGTGCTTTAAAAACCATTTCCTATGGGAGACAAAACAAAGTACCAACCTTCTGCTCTGAACTTGCAACGGAGGAACAAATGAGCTAGTGCTACATTTTTTTTTTTGAAGGAAAAGGCAGGAAACAGAGGGAGATGATGCCTTAATTGTAAATCTCCACTGAAGCCTATCTTGGTATTGATACTTCTGTGAGCAATAGCGCCTAGGAAAAAATTAATTGCCTTAGGAACTCTACCTATCCTGACTTCCTTCCTGTTAATACTTATCTCTAACAGTTGTGCCTTTGAAGGTTTACATGGTGAACAACAACAATTAACCAGTACTTTCATTTCTCCTTCGTGTTAACTTCATAGATTTTGGTTTTCCATTGCCCTGTATGTTTCTAAGAAAGGCTTTATGCTCGTGATTTTGTTTACAATTGGCTCTTGCTCCAGAAAACTTCGTTTCCCCCCCTAATTTCATACCTTCTGTTTTTGTTTTAGGCTACAGATGATGGAAAGAGTTCTTTTGGTAAAGTTAAACGGTCCCCTCATAATGCTGGACTGGCAAAATCTAGCAAAATATCAAAACCTCTAGATCATCATTCATCTTCTAGTACTGACCATAAAAGAGAAGATGGTGACTATGCTTTATCCACAACACAAGTTCCATCAATTAACCCAATCAGCTTGCCTACCAAAATGAGGAGCAGGCGGAAGATGGACCTGCTGAAGTCGCAAAGAGATTCAAAGATTGCTGATAACATTTTGATTGATCAACTTAATGTAACTGCTCATTCATTAGATGATAGACCACATGATCTCAAGGTTACTATTTCTGGTCATGACATAAATTTTCTTGGATACTGGATCCTGAATTGTTTACTGTATGTTTGTAATTGCAGGAGCAGCATTCTAATTGCCTATCTTGGCATAAATTGCGTAGATGGTGTGTTTTTGAGTGGCTCTACAGTGCAATTGATTTCCCATGGTTTGCAAAATGTGAGTTTGTAGAGTACTTAAATCACGTTGGATTAGGTCACATTCCTAGATTGACACGTGTTGAATGGGGTGTTATTCGAAGGTATTTTTTTTCAACTTGTCCATTTGTATCCTGTTGACTCATTTCGAGACACTATCTATAGAAGACCAATATTGTAGTTTTGTCTTGATTCAGTTCCCTTGGGAGACCGCGGAGATTCTCTGCACAATTCTTGAAGGAAGAAAAACAGAAGCTAAATCAATATAGGGAATCTGTTAGGAAACACTATGCTGAACTTCGTGCAGGCACAAGGGAAGGACTTCCAACCGATTTAGCTCGGCCATTATCTGTTGGACAGCGTGTTATTGCCATTCATCCAAAAACAAGAGAGATCCACGATGGAAGTGTACTTACTGTTGACTATAGCAGGTGTCGTGTTCAGTTTGACCGACCTGAACTTGGGGTTGAATTTGTCATGGTAATTATGTTTTCATGATTTCTTTATTTTTATGGTTTGTTCATTTGATAGAATTGAGATTGCCGTAGATCCTTACAGTTTCATGTGGGCGATCTCATCCCCAAGTATTAAGGATTCCCAAATAAGACAACGGTTTAAACTACATGTCAGAAACAAGAGACTGTTTTCTGACAGGTTTCTTAATTTTAGATGATGATCATAGGCACAACACTTTGCTTAACGTCGGTATTTTCCACAGAAAAAGAAGCTCACTTATGGATTGTTTTATTTTTAGTTATGTATGTACGATTTATATTTGAGTGGCATGGTTTGACTCCTCCATCACAGGATATCGAGTGTATGCCTTTAAATCCAGTTGAAAACATGCCTGCAAATTTGTCAAGACATGGCGTGACTCTCGATAAGATATTTGGTAATCTCAATGAGGTTAAGATTAATGGCTTACTGAAGGAAGCGAAGATTGAAGACTATATAAAATCAACCAGTAACGATAAACTTGAAAGCACCGATGGTTCTGTGTTTATTTCCCCTTCCACTCATCATATCAACAAATTAATTAAACAGGCAAAGGTTAGTGTTCTGCTACTTTTTTTTTTTGAATTAGATGATATGCATTAGTAGCTTATTTGAGCATATTTCTCCACGTCATCACATTGGCATGCTACCATGTAGCTAAGGTCATATTATTATCAATATAGAATTATAGATTACGTAATTGGATTGTATTCTTCTACATTTCAGACATGCATAATTGGATTCCAAAATCTGCATTCTTTCAACTGAAAACAATGAATGTCACTATGATAGACCTCTAACAAACACAGGTAATTCAGCCTTATCTAGAGCTTTTAGAGGAAGTCAATAGAATCAAGTATTCTTACAGACGATATCCAATAACCAATGCTCAAGTCTCCTCCCATCTGGCAGCTTTGTCCTTCAAAAACTGTAATTCCTTTTCGACCAATCCTTCCAAAAAGAGCCTTAATTGCACTCATTTAAAAAACCCTAGCTTTATTTTACCTTTTGTTTTTCTAAATACTTTTTGAATACTTCTTATTCCTCTATAGGTGTTCTGTCATTCTTCTGTCTTATACACTTAGTTTCATAGAATAACAAAGACTCGTCTTAATTCTGGGGTTTTGAGATAACTGAGTTTGGCTTGAAATGCTTACTCTATAGTTAGGCCTACACATTATTTTTGACCTGCCGGTAAGTGATAAAGGAATTGTTCGTGGCCAGGTCGACCTTGGGTGCTCTAATCTACAAACTAAATTTGGGCTTAATGAGACTGTCGGTATCCAACAGGAGGCAAGTTCCCAACTTTCTGTCCTTGCTCAAATTCAAGCTAAAGAAGCTGATGTTCATGCTCTGTCTGAATTGTCACGTGCACTTGACAAGAAGGTAGTTCATCATATTAGCTTCATATTATCAACTTTATGTCAAGATTTCAATTATAAGAAAACAAGTGTTATATTGCAGTATCAAAAAGAAATTACCATTTTTGAGGATTCTCACTATCTCAAGTCCCACAATGGACCAACTCCCATTCCTCCAATCTTCCTAACTCCCTGCATTTATAATCCAACAAAACCCTCCTAAGTAATTACCATTATATCTTTAATAGTATCCTAATGACATTCCCTATACATTTTCATTAATACTTGTAACATTTTATTTGTCCTACATTTTCATTAATACTTGTAACATTTTATTTGTCCTACGCTGGGAGAGACTAGTTTGAAAGGTAGAAATCTATTTTGCTTTAACCTTTACATCAAATTGAATGTCTTTTAACCTGACTGATGCGTGCCTAAATCTAGGAGGTGGTGGTGTCTGAATTGAAGCGCTTGAATGATGAGGTGTTGGAAAACCAAATAAATGGAGACAACTTGCTCAAGGATTCAGAGAACTTTAAGAAGCAATATGCTGCTGTGCTATTACAGTTGAATGAAGTTAATGAACAGGCATGCGTTTTAATGATCACATTTCCAGCTTAATCCCATTTTTTTTTATTCAAGTGCTTATTTTTCTTAGTATATATAGGTCTCCTCTGCTCTGTATAGCTTGAGGCAGCGCAATACGTATCAAGGGACTTCACCATTAATGTTCCTCAAGCCAGTGCATGATTTGGGTGACCCTTGCTCTCATGCTCAAGAACCCGGTTCCCATGTGGCTGAAATTGTGGGAAGTTCCAGAGCAAAGGCTCAAACAATGATCGATGAAGCAATGCAGGTTTAACCTTTACTATTATCTAACCATTTTGATCATCAGTTGGTCGGTGTTTAGGTATTATGAGTAGTCTATCTTCCGCTGGATGTTCATGTAAATAATGGACATACTTGTCTAGTCAATTGTTAACTAATATGAATTTTCACAAAACTATTAATTACTAAATTACTTCGTAGGACTAATTTTTTCTTATAATTTCAACTAATTGACTGCATAAATCAAATTTCAACTAGAACAATCTGTAGAATAAAATACACAATTTCATTTCCCAAACATTAATTCTCTTGCATTTCCATATATGTATCTTTAAGATTGTGTACCTTTTAAATCCCATATTTTAAACCCTCAGATATTTAATATCTATATCCATTGAACAAAATTCACTATCCAATCAAACCATTAGAGAGACGTAATGTACTTCATGATATTTAGATTTGCTTAAAAGTAACTGTGATTATCATGCATGTACTTCAAATCTTTACACTGTAAAATTTCATCAAGTTATTGAATCATCCAAGTCCTAAATGCTTCCTTCAGAGTCCATGTTGGAGATCTCTTTTCCTGTAGATTTTGTGCTTCCTCTGACTTCATTTGCTGTTAACAATCATTTCAGGCAATTCTTGCTCTGAAAAAAAGAGAAAGTAATTTGGAGAACATTGAAGAAGCCATTGATTTTGTGAGTAATAAACTCTCAGTAGATGATTTGGCCTTGCCAACTGTGAAATCAACATCTGCAGACACTAGTAATGCCACTCAAGTATCTCAGAATCATTTCAATGCGGGTGCATCAAACCCATCANTACTAGTAATGCCACTCCAGTATCTCAGAATCATTTCAATGCGGGTGCATCAAACCCATCAGCTGCTAATTATGTAGTTGGTTCCAAGTCCAATAGCCCATCCGACAAGCCTGAAGTGGAGATCCCTTCAGAACTTATTGCGCACTGTGTAGCCACTTTACTCATGATTCAGGTAAAGGGAGCTTCATTCGAGTCTTCCTCTCAATTATGTATTATTTGTCCGGAATGATTTTGAATCAATGATCTATGACCCCCAATTTGTTTCTGATTTTCTGTTTTGCATAAATAAGAAGATTTGGTTGTTTATTCATCACAAACTGATTCATACAGTTCAAGGGGATATTAGAAGAGGGACTTCTTACACTGTAACCGTTGACAAATTGGTGTTGCAATTTTTTCTTGTTTTGTGGAAAGGCAGGGAAATCGTCTCATTAGACTTGTCGAGTATTATTGTGATCTTGCATATTGTTGGTTGCGAATCATGTACATAATAGTCAAGTACTTTTAGCATGCTTGGTTCTCGTTTCTCTTGCGTATGAGGATAATAGTTTGATTGTTGCTATAAACATGCANTTGATTTCCAAATCCTGATTGTTGCTATAAACATGCAGAAATGCACAGAACGACAGTTTCCGCCAGCTGATGTTGCTCAGGTACTGGATTCTGCTGTCAATAGTTTGCAGCCATGTTGTCCTCAGAACCTTCCACTATATGCAGAAATACAGAAATGCATGGGAATTATAAGGAGCCAGATACTTGCGCTTATACCTACATAGGTTCAAATCCGCTTCCCATGTGTTAAGACAAATTGAAAAGGTGAGTAAATACGTCTTAGATATAATATTTTGGCCTCACTTTTTGAATTATAAATGTGTATCTCCTTGGCTATCAGTTTCTTTGTCTGACTGTACCATAGACTGGAATTATGTATAAATCCTTGTAGTTTGAGCAAATTGTAGTTTCATTGAATGAAAATCAATCTTCTACCTNCTTTGTCTGACTGTACCATAGACTGGAATTATGTATAAATCCTTGTAAAACGAGCAAATTGTAGTTTCATTGAATGAAAATCAATCTTCTACCTGCTTCTG

mRNA sequence

AAGAGGAAATTTGCTGACTTGTTAGGGCCTCAATGGAGCAGAGATGAGGTTGAGCAGTTCTATGAAGCATATCGTAAATATGGAAAAGATTGGAAGAAGGTAGCTGCTGCAGTAAGGAACCGTTCTACCGAAATGGTCGAGGCTCTTTTCACCATGAATAGGGCCTATTTATCACTTCCAGAGGGCACTGCTTCAGTTGTTGGGTTAATTGCAATGATGACTGATCATTACAGCGTACTGAGAGACAGTGAAAGTGAGCAAGAAAGTAATGAGGATTCAGGAGCAACAAGAAAGCCTCAAAAGCGACTGCGTGGAAAGTCACGAAATAACAACTCAAAGGGATTGGATGCACATTTTGGAGATGCTTCACAATCACAGTCACTTCCAACAAACTATGGCTGCCTGTCATTGTTGAAGAAAAGGCGTTCTGGAATTAAACCTCATGCTGTTGGAAAACGGACTCCGCGCGTCCCTGTATCATATTCATACGATAAAGATAGTAGAGAGAGGATTTTTTCACCTTCCAGGCATACTAGTAAGCTAAAGGTTGATGATCCAAATGATGATGATGTTGCTCATGAGATAGCTTTAGTTTTAACAGAGGCTTCGCAACGCGATGGTTCTCCTCAACTTTCACAAACACCAAATCCGAAAATAGAAGGTCATGTACTTTCACCTATCCGAAATGACAGGATGCGAAGTGAATCAGACATGATGAGTACGAAGTTTCGATGTAGTGAAATGGATGAAGGTGGTTGTGAATTAAGCTTAGGAAGCACTGGAGCTGATAATGCAGACTATGATCAGGGGAAAAATACTCGTGAGATTCAAAGAAAGGGTAAAAGGTACTACGGAAAGAAGGCAGAAGTTGAAGAAAGTATGTATAACCACTTGGATGACATCAAGGAAGCTTGTAGTGGGACTGAAGAGGGGCAAAAGTCAGGCAGTTTGAGGGGAAAACTTGAAACTGAGGATTTTGATGTAAAATCTGTGAGAACCTCTTTTAAAGGCCCAAGGAAGAGAAGTAAGAAAGCTCTATTTGGAGATGAATGCTCGGCATTTGATGCACTGCAAACTTTAGCTGATTTGTCTTTGATGATGCCAGACACTACTGCTGACACTGATGAATGCTCGGCATTTGATGCACTGCAAACTTTAGCTGATTTGTCTTTGATGATGCCAGACACTACTGCTGACACTGAGCCTTCTGCAAAGGTTAAAGAAGAAAATCTTGATGTCATGGACAAGTCTAAAATGAAAGGGAATCATTCAGTTGTGGGAGCTGGAATCTCTGCTTCCAAGACATCTAAGACGGGAAAGGCTTCAGGCAACAATGTTGGTCCTATTCCTGAAGCAGAAGGAATTCAAGGATCCAATAATGGAAATCGTAAAAGGAAACAGAAGTCCTCTCCATTTAAAATTTCATCTAAAGATGAAGATAGCAATGATTCTCGTGTCAATGACACTCCAAAAACCAAGGAGGTGGTGGTGTCTGAATTGAAGCGCTTGAATGATGAGGTGTTGGAAAACCAAATAAATGGAGACAACTTGCTCAAGGATTCAGAGAACTTTAAGAAGCAATATGCTGCTGTGCTATTACAGTTGAATGAAGTTAATGAACAGGCATGCGTCTCCTCTGCTCTGTATAGCTTGAGGCAGCGCAATACGTATCAAGGGACTTCACCATTAATGTTCCTCAAGCCAGTGCATGATTTGGGTGACCCTTGCTCTCATGCTCAAGAACCCGGTTCCCATGTGGCTGAAATTGTGGGAAGTTCCAGAGCAAAGGCTCAAACAATGATCGATGAAGCAATGCAGGCAATTCTTGCTCTGAAAAAAAGAGAAAGTAATTTGGAGAACATTGAAGAAGCCATTGATTTTGTGAGTAATAAACTCTCAGTAGATGATTTGGCCTTGCCAACTGTGAAATCAACATCTGCAGACACTAGTAATGCCACTCAAGTATCTCAGAATCATTTCAATGCGGTATCTCAGAATCATTTCAATGCGGGTGCATCAAACCCATCAGCTGCTAATTATGTAGTTGGTTCCAAGTCCAATAGCCCATCCGACAAGCCTGAAGTGGAGATCCCTTCAGAACTTATTGCGCACTGTGTAGCCACTTTACTCATGATTCAGAAATGCACAGAACGACAGTTTCCGCCAGCTGATGTTGCTCAGGTACTGGATTCTGCTGTCAATAGTTTGCAGCCATGTTGTCCTCAGAACCTTCCACTATATGCAGAAATACAGAAATGCATGGGAATTATAAGGAGCCAGATACTTGCGCTTATACCTACATAGGTTCAAATCCGCTTCCCATGTGTTAAGACAAATTGAAAAGGTGAGTAAATACGTCTTAGATATAATATTTTGGCCTCACTTTTTGAATTATAAATGTGTATCTCCTTGGCTATCAGTTTCTTTGTCTGACTGTACCATAGACTGGAATTATGTATAAATCCTTGTAGTTTGAGCAAATTGTAGTTTCATTGAATGAAAATCAATCTTCTACCTNCTTTGTCTGACTGTACCATAGACTGGAATTATGTATAAATCCTTGTAAAACGAGCAAATTGTAGTTTCATTGAATGAAAATCAATCTTCTACCTGCTTCTG

Coding sequence (CDS)

AAGAGGAAATTTGCTGACTTGTTAGGGCCTCAATGGAGCAGAGATGAGGTTGAGCAGTTCTATGAAGCATATCGTAAATATGGAAAAGATTGGAAGAAGGTAGCTGCTGCAGTAAGGAACCGTTCTACCGAAATGGTCGAGGCTCTTTTCACCATGAATAGGGCCTATTTATCACTTCCAGAGGGCACTGCTTCAGTTGTTGGGTTAATTGCAATGATGACTGATCATTACAGCGTACTGAGAGACAGTGAAAGTGAGCAAGAAAGTAATGAGGATTCAGGAGCAACAAGAAAGCCTCAAAAGCGACTGCGTGGAAAGTCACGAAATAACAACTCAAAGGGATTGGATGCACATTTTGGAGATGCTTCACAATCACAGTCACTTCCAACAAACTATGGCTGCCTGTCATTGTTGAAGAAAAGGCGTTCTGGAATTAAACCTCATGCTGTTGGAAAACGGACTCCGCGCGTCCCTGTATCATATTCATACGATAAAGATAGTAGAGAGAGGATTTTTTCACCTTCCAGGCATACTAGTAAGCTAAAGGTTGATGATCCAAATGATGATGATGTTGCTCATGAGATAGCTTTAGTTTTAACAGAGGCTTCGCAACGCGATGGTTCTCCTCAACTTTCACAAACACCAAATCCGAAAATAGAAGGTCATGTACTTTCACCTATCCGAAATGACAGGATGCGAAGTGAATCAGACATGATGAGTACGAAGTTTCGATGTAGTGAAATGGATGAAGGTGGTTGTGAATTAAGCTTAGGAAGCACTGGAGCTGATAATGCAGACTATGATCAGGGGAAAAATACTCGTGAGATTCAAAGAAAGGGTAAAAGGTACTACGGAAAGAAGGCAGAAGTTGAAGAAAGTATGTATAACCACTTGGATGACATCAAGGAAGCTTGTAGTGGGACTGAAGAGGGGCAAAAGTCAGGCAGTTTGAGGGGAAAACTTGAAACTGAGGATTTTGATGTAAAATCTGTGAGAACCTCTTTTAAAGGCCCAAGGAAGAGAAGTAAGAAAGCTCTATTTGGAGATGAATGCTCGGCATTTGATGCACTGCAAACTTTAGCTGATTTGTCTTTGATGATGCCAGACACTACTGCTGACACTGATGAATGCTCGGCATTTGATGCACTGCAAACTTTAGCTGATTTGTCTTTGATGATGCCAGACACTACTGCTGACACTGAGCCTTCTGCAAAGGTTAAAGAAGAAAATCTTGATGTCATGGACAAGTCTAAAATGAAAGGGAATCATTCAGTTGTGGGAGCTGGAATCTCTGCTTCCAAGACATCTAAGACGGGAAAGGCTTCAGGCAACAATGTTGGTCCTATTCCTGAAGCAGAAGGAATTCAAGGATCCAATAATGGAAATCGTAAAAGGAAACAGAAGTCCTCTCCATTTAAAATTTCATCTAAAGATGAAGATAGCAATGATTCTCGTGTCAATGACACTCCAAAAACCAAGGAGGTGGTGGTGTCTGAATTGAAGCGCTTGAATGATGAGGTGTTGGAAAACCAAATAAATGGAGACAACTTGCTCAAGGATTCAGAGAACTTTAAGAAGCAATATGCTGCTGTGCTATTACAGTTGAATGAAGTTAATGAACAGGCATGCGTCTCCTCTGCTCTGTATAGCTTGAGGCAGCGCAATACGTATCAAGGGACTTCACCATTAATGTTCCTCAAGCCAGTGCATGATTTGGGTGACCCTTGCTCTCATGCTCAAGAACCCGGTTCCCATGTGGCTGAAATTGTGGGAAGTTCCAGAGCAAAGGCTCAAACAATGATCGATGAAGCAATGCAGGCAATTCTTGCTCTGAAAAAAAGAGAAAGTAATTTGGAGAACATTGAAGAAGCCATTGATTTTGTGAGTAATAAACTCTCAGTAGATGATTTGGCCTTGCCAACTGTGAAATCAACATCTGCAGACACTAGTAATGCCACTCAAGTATCTCAGAATCATTTCAATGCGGTATCTCAGAATCATTTCAATGCGGGTGCATCAAACCCATCAGCTGCTAATTATGTAGTTGGTTCCAAGTCCAATAGCCCATCCGACAAGCCTGAAGTGGAGATCCCTTCAGAACTTATTGCGCACTGTGTAGCCACTTTACTCATGATTCAGAAATGCACAGAACGACAGTTTCCGCCAGCTGATGTTGCTCAGGTACTGGATTCTGCTGTCAATAGTTTGCAGCCATGTTGTCCTCAGAACCTTCCACTATATGCAGAAATACAGAAATGCATGGGAATTATAAGGAGCCAGATACTTGCGCTTATACCTACATAG

Protein sequence

KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLPEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDKSKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDEDSNDSRVNDTPKTKEVVVSELKRLNDEVLENQINGDNLLKDSENFKKQYAAVLLQLNEVNEQACVSSALYSLRQRNTYQGTSPLMFLKPVHDLGDPCSHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILALKKRESNLENIEEAIDFVSNKLSVDDLALPTVKSTSADTSNATQVSQNHFNAVSQNHFNAGASNPSAANYVVGSKSNSPSDKPEVEIPSELIAHCVATLLMIQKCTERQFPPADVAQVLDSAVNSLQPCCPQNLPLYAEIQKCMGIIRSQILALIPT
Homology
BLAST of Cp4.1LG08g04800 vs. ExPASy Swiss-Prot
Match: Q6A332 (Protein ALWAYS EARLY 3 OS=Arabidopsis thaliana OX=3702 GN=ALY3 PE=1 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 1.4e-118
Identity = 381/1129 (33.75%), Postives = 510/1129 (45.17%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRK +D+LGPQWS++E+E+FYE YRK+GK+WKKVA  V +RS EMVEAL+TMN+AYLSLP
Sbjct: 36   KRKLSDMLGPQWSKEELERFYEGYRKFGKEWKKVAGFVHSRSAEMVEALYTMNKAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVLR-DSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            EGTASVVGL AMMTDHYSVL   S+SEQE+NE     R   KR R KS ++ S GL+   
Sbjct: 96   EGTASVVGLTAMMTDHYSVLHGGSDSEQENNEGIETPRSAPKRSRVKSSDHPSIGLEG-L 155

Query: 121  GDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTS 180
             D  Q +S   + G +  LKKRR+   P AVGKRTPR+P+SY+ +KD+RER  SP +   
Sbjct: 156  SDRLQFRS---SSGFMPSLKKRRTETMPRAVGKRTPRIPISYTLEKDTRERYLSPVKRGL 215

Query: 181  KLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMM 240
              K DD  DDD+ HEIAL L EASQR GS + S TPN K + +     + +RMR++ D+ 
Sbjct: 216  NQKGDD-TDDDMEHEIALALAEASQRGGSTKNSHTPNRKAKMYPPDK-KGERMRADIDLA 275

Query: 241  STKFRCSEMDEGGCELSLGSTGADNADYDQGKN---------TREIQRKGKRYYGKKAEV 300
              K   ++M++  CE SLGST ADNADY  G+N           E Q+KG+ YY ++  +
Sbjct: 276  IAKLHATDMEDVRCEPSLGSTEADNADYSGGRNDLTHGEGSSAVEKQQKGRTYYRRRVGI 335

Query: 301  EESMYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALF-GD 360
            +E      +D KEACSGT+E    G+   K E E  + K+++ ++K  R++SKK+LF  D
Sbjct: 336  KE------EDAKEACSGTDEAPSLGAPDEKFEQER-EGKALKFTYKVSRRKSKKSLFTAD 395

Query: 361  ECSAFDALQTLADLSLMMPDTTADT------DECSAFDAL-------------------- 420
            E +A DAL TLADLSLMMP+T  DT      +E  A +A                     
Sbjct: 396  EDTACDALHTLADLSLMMPETATDTESSVQAEEKKAGEAYVSDFKGTDPASMSKSSSLRN 455

Query: 421  ---QTLADLSLMMPDTTADTEPS------------AKVKEENL--------DVMDKSKMK 480
               +      L  P+    +  S            AKV+E  L         V++    K
Sbjct: 456  SKQRRYGSNDLCNPELERKSPSSSLIQKRRQKALPAKVRENVLKDELAASSQVIEPCNSK 515

Query: 481  G---NHSVVGAG-----ISASKTSKTGK-----ASGNNV--------------------- 540
            G    +  VG G     I  S   K+ K     +S NN+                     
Sbjct: 516  GIGEEYKPVGRGKRSASIRNSHEKKSAKSHDHTSSSNNIVEEDESAPSNAVIKKQVNLPT 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  KVRSRRKIVTEKPLTIDDGKISETIEKFSHCISSFRARRWCIFEWFYSAIDYPWFARQEF 635

Query: 601  ---------GPIPEAE----GIQGSNNGNRKR---------KQK---------------- 660
                     G +P       G+  S+ G  +R         K+K                
Sbjct: 636  VEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEKEKLYLYRDSVRKHYDELN 695

Query: 661  -----------SSPFKISSK---------------------------------------- 720
                       + P  +S +                                        
Sbjct: 696  TGMREGLPMDLARPLNVSQRVICLHPKSREIHDGNVLTVDHCRYRIQFDNPELGVEFVKD 755

Query: 721  -----------------------------------DEDSNDSRVNDTPK----------- 768
                                                E + +S +   PK           
Sbjct: 756  TECMPLNPLENMPASLARHYAFSNYHIQNPIEEKMHERAKESMLEGYPKLSCETGHLLSS 815

BLAST of Cp4.1LG08g04800 vs. ExPASy Swiss-Prot
Match: Q6A333 (Protein ALWAYS EARLY 2 OS=Arabidopsis thaliana OX=3702 GN=ALY2 PE=1 SV=1)

HSP 1 Score: 273.9 bits (699), Expect = 5.7e-72
Identity = 314/1055 (29.76%), Postives = 437/1055 (41.42%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRN-RSTEMVEALFTMNRAYLSL 60
            ++K +D LGPQW+R E+E+FY+AYRK+G++W++VAAA+RN RS +MVEALF MNRAYLSL
Sbjct: 33   RKKLSDKLGPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNRAYLSL 92

Query: 61   PEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            PEGTASV GLIAMMTDHYSV+  S SE E ++ S   RK QKR R K + ++S       
Sbjct: 93   PEGTASVAGLIAMMTDHYSVMEGSGSEGEGHDASEVPRKQQKRKRAKPQRSDSP------ 152

Query: 121  GDASQSQSLPTNYGCLSLLKK-RRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHT 180
             +    QS+ +  GCL+ LK+ R +G + HA GKRTPRVPV  S+ +D RE    P++  
Sbjct: 153  EEVDIQQSIGSPDGCLTFLKQARANGTQRHATGKRTPRVPVQTSFMRDDREGSTPPNKRA 212

Query: 181  SKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRN----DRMRS 240
             K       +DDVAH +AL LT+AS+R GSP++S++PN + E    SPI++     R R 
Sbjct: 213  RK---QFDANDDVAHFLALALTDASRRGGSPKVSESPNRRTELSDSSPIKSWGKMSRTRK 272

Query: 241  ESDMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGK-NTREIQRKGKRYYGKKAEVEES 300
                        E  E   E  L S        D  +    E  RKGKR Y K+ +VEE+
Sbjct: 273  SQSKHCGSSIFEEWMESSRERKLDSDKDTTLLMDMERAGEMEAPRKGKRVYKKRVKVEEA 332

Query: 301  MYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSA 360
              N  DD  EACS T +G +S S R K   E       + S + P+KR  K   G    A
Sbjct: 333  ECNDSDDNGEACSAT-QGLRSKSQRRKAAIE---ASREKYSPRSPKKRDDKHTSG----A 392

Query: 361  FDALQTLADLSLMM----------------PDTTADTDECSAF-DALQTLA--------- 420
            FDALQ LA+LS  M                  T  D DE S+  +A  T +         
Sbjct: 393  FDALQALAELSASMLPANLMESELSAQLKEERTEYDMDEKSSTPEATSTSSHGEKANVEP 452

Query: 421  DLSLM------------------MPDTTADTEPSAKVKEENLDVMDKSKMK--------- 480
            D SL+                  +  T  D  P+ K++ +    + K K K         
Sbjct: 453  DDSLLHAISSVENANKRKSKPSRLVSTDCDDVPTGKLQPQTSGSLRKRKPKVLGDEAPAE 512

Query: 481  --GNHSV--------------------VGAGISASKTSKTGKA------------SGNNV 540
               N S+                     G   + SK  KT KA             G ++
Sbjct: 513  FSQNKSINKKELPQDENNMKSLVKTKRAGQVPAQSKQMKTVKALEESAITSDKKRPGMDI 572

Query: 541  GPIPEAEGIQGSN-------NGNRKRKQKSSPFKISSKDEDSNDSRVNDTPKTKEVVVSE 600
               P+     G         N  +K  QKS   K  S +     +R + +   +E+++ +
Sbjct: 573  VASPKQVSDSGPTSLSQKPPNRRKKSLQKSLQEKAKSSETTHKAARSSRSLSEQELLLKD 632

Query: 601  ------------------------------------------------LKRLNDEVLENQ 660
                                                            L RL   V+++ 
Sbjct: 633  KLATSLSFPFARRRCIFEWFYSAIDHPWFSKMEFVDYLNHVGLGHIPRLTRLEWSVIKSS 692

Query: 661  IN-----GDNLLKDSENFKKQY---------------------------------AAVLL 720
            +       +  L +     KQY                                  A+  
Sbjct: 693  LGRPRRFSERFLHEEREKLKQYRESVRKHYTELRTGAREGLPTDLARPLAVGNRVIAIHP 752

Query: 721  QLNEVNE---------------------------------------------QACVSSAL 768
            +  E+++                                               C+S   
Sbjct: 753  KTREIHDGKILTVDHNKCNVLFDDLGVELVMDIDCMPLNPLEYMPEGLRRQIDKCLSMKK 812

BLAST of Cp4.1LG08g04800 vs. ExPASy Swiss-Prot
Match: Q6A331 (Protein ALWAYS EARLY 1 OS=Arabidopsis thaliana OX=3702 GN=ALY1 PE=2 SV=2)

HSP 1 Score: 186.8 bits (473), Expect = 9.1e-46
Identity = 253/970 (26.08%), Postives = 403/970 (41.55%), Query Frame = 0

Query: 1   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVR-NRSTEMVEALFTMNRAYLSL 60
           K+K AD LGPQW++ E+ +FY+AYRKY  DWKKVAAAVR NRS EMVE LF MNRAYLSL
Sbjct: 34  KKKLADKLGPQWTKRELVRFYDAYRKYVGDWKKVAAAVRNNRSVEMVETLFCMNRAYLSL 93

Query: 61  PEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
           PEGTASV GLIAMMTDHYSV+  SESE E ++ S  TRK  KR R +   ++ +      
Sbjct: 94  PEGTASVAGLIAMMTDHYSVMEGSESEGEDHDASEVTRKHLKRKRPQVLPSDFR------ 153

Query: 121 GDASQSQSLPTNYGCLSLLKKRRSGIK-PHAVGKRTPRVPVSYSYDKDSRERIFSPSRHT 180
            +     S+ +  GCLS LK+ ++  K   A GKRTPR  V+ ++++D  E  FSP    
Sbjct: 154 EEVVPPHSVASVEGCLSFLKQTQAYEKRQRATGKRTPRFLVAITHERDDIED-FSPPNKR 213

Query: 181 SKLKVDDPNDDDVAH----------EIALVLTEASQRDGSPQLSQTPNPKI--------- 240
           +K ++D   DDD +           E++ +     ++    Q +Q  +P           
Sbjct: 214 AKKQLD--ADDDASRRGGGSPYRRKELSEITPTRLRKTSQAQEAQFKHPDSSMFENGVRD 273

Query: 241 ----------EGHVLSPIRNDRMRSES--DMMSTKFRCSEMDEG-GCELSLGSTGADNA- 300
                     +G +L  +     + E    +   +   S+ D+G G   +L    A  A 
Sbjct: 274 RWHKKGAADRDGALLMDMEGLVTQKEKIVRVEEAEGNYSDDDDGLGALKTLAEMSASLAP 333

Query: 301 ----------DYDQGKNTREIQRKGK----------RYYGKKAEVEESMYNHLD------ 360
                      +++ + T  + +K            R   K+A +E+++ + +       
Sbjct: 334 AGLLESESSPHWEEERKTNNVDKKSNTLETVSTSHHREKAKQAGLEDNLLHAISAPDKRK 393

Query: 361 --DIKEACSG---TEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAF 420
              + E+  G   + E  ++ S + K + +  DV + + S         K+L+  E +  
Sbjct: 394 PKSVPESVDGNVVSIEELRTSSRKRKPKFQVLDVVAPKES------TQDKSLYTKESAEV 453

Query: 421 DALQT--------------LADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADT 480
           D+L+T              L      +  ++A   + +  DA+     +S   P+T    
Sbjct: 454 DSLKTPVKARRSSQGPAKQLKTAKTTVESSSASDKKITGPDAVVPATQVSASGPETLPQK 513

Query: 481 EPS-------------AKVKEENLDVMDKSKMKGNHSVVGAGIS---------------- 540
            P+             AK  E   D     K    H ++   +S                
Sbjct: 514 PPNRRKISLKKSLQERAKSLETTHDKPRSFKKLSEHELLQEKLSNCLSYPLVRRWCIYEW 573

Query: 541 ---------ASKTSKTGKASGNNVGPIP-----EAEGIQGSNNGNRKRKQKSSPFKISSK 600
                     +K   T   +   +G  P     E   I+ S    R+  Q+    +    
Sbjct: 574 FYSAIDYPWFAKMEFTDYLNHVGLGHAPRLTRVEWSVIKSSLGRPRRLSQRFLQDERDKL 633

Query: 601 DEDSNDSRVNDT----------------------------PKTKEVVVSELKRLND---E 660
            E     R + T                            PKT+E+   ++  ++     
Sbjct: 634 QEYRESVRKHYTELRGCATGVLHTDLARPLSVGNRVIAIHPKTREIRDGKILTVDHNKCN 693

Query: 661 VLENQINGDNLLKD------------SENFKKQYAAVLLQLNEVNEQACVSSALYSLRQR 720
           VL +++ G  L+ D             E  ++Q    L    E       SS    L   
Sbjct: 694 VLFDEL-GVELVMDIDCMPLNPLEYMPEGLRRQIDKCLAICKEARLNRHPSSDASVLFSP 753

Query: 721 NTYQGTSPLMFLKPV--HDLGDP-----------------------------------CS 768
           +  +  +  M   P    D+ +P                                    S
Sbjct: 754 SVLENVNFSMNPPPAKQDDIREPVLYGKVIATNTTDQSIVINSKVTGTEIQRTLALQHTS 813

BLAST of Cp4.1LG08g04800 vs. NCBI nr
Match: XP_022936778.1 (protein ALWAYS EARLY 3-like isoform X4 [Cucurbita moschata])

HSP 1 Score: 1229 bits (3180), Expect = 0.0
Identity = 725/1114 (65.08%), Postives = 726/1114 (65.17%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRCQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAG SASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGTSASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDR 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  PHDLKEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEW 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  GVIRSSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQR 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  VIAIHPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHG 815

BLAST of Cp4.1LG08g04800 vs. NCBI nr
Match: XP_023539796.1 (protein ALWAYS EARLY 3-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1224 bits (3166), Expect = 0.0
Identity = 730/1164 (62.71%), Postives = 730/1164 (62.71%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG 120
            EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG
Sbjct: 96   EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG 155

Query: 121  DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK 180
            DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK
Sbjct: 156  DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK 215

Query: 181  LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS 240
            LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS
Sbjct: 216  LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS 275

Query: 241  TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD 300
            TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD
Sbjct: 276  TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD 335

Query: 301  IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL 360
            IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL
Sbjct: 336  IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL 395

Query: 361  ADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDKSKMK 420
            ADLSLMMPDTTADT                          EPSAKVKEENLDVMDKSKMK
Sbjct: 396  ADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDKSKMK 455

Query: 421  GNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED 480
            GNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED
Sbjct: 456  GNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED 515

Query: 481  SNDSRVNDTPKTK----------------------------------------------- 540
            SNDSRVNDTPKTK                                               
Sbjct: 516  SNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKREDGD 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  YALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDRPHDL 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  KEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEWGVIR 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  SSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQRVIAI 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  HPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHGVTLD 815

BLAST of Cp4.1LG08g04800 vs. NCBI nr
Match: XP_022973712.1 (protein ALWAYS EARLY 3-like isoform X4 [Cucurbita maxima])

HSP 1 Score: 1221 bits (3158), Expect = 0.0
Identity = 723/1114 (64.90%), Postives = 726/1114 (65.17%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRFQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AH   +SQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AH---SSQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPE EGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEGEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDR 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  PHDLKEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEW 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  GVIRSSLGRPRRFSAQFLKEEKHKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQR 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  VIAIHPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHG 815

BLAST of Cp4.1LG08g04800 vs. NCBI nr
Match: XP_023539795.1 (protein ALWAYS EARLY 3-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1218 bits (3151), Expect = 0.0
Identity = 730/1168 (62.50%), Postives = 730/1168 (62.50%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRCQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDR 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  PHDLKEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEW 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  GVIRSSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQR 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  VIAIHPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHG 815

BLAST of Cp4.1LG08g04800 vs. NCBI nr
Match: KAG7028966.1 (Protein ALWAYS EARLY 3, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1216 bits (3147), Expect = 0.0
Identity = 728/1185 (61.43%), Postives = 728/1185 (61.43%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 2    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 61

Query: 61   EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG 120
            EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG
Sbjct: 62   EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG 121

Query: 121  DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK 180
            DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK
Sbjct: 122  DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK 181

Query: 181  LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS 240
            LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS
Sbjct: 182  LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS 241

Query: 241  TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD 300
            TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD
Sbjct: 242  TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD 301

Query: 301  IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL 360
            IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL
Sbjct: 302  IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL 361

Query: 361  ADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDKSKMK 420
            ADLSLMMPDTTADT                          EPSAKVKEENLDVMDKSKMK
Sbjct: 362  ADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDKSKMK 421

Query: 421  GNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED 480
            GNHSVVGAG SASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED
Sbjct: 422  GNHSVVGAGTSASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED 481

Query: 481  SNDSRVNDTPKTK----------------------------------------------- 540
            SNDSRVNDTPKTK                                               
Sbjct: 482  SNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKREDGD 541

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 542  YALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDRPHDL 601

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 602  KEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEWGVIR 661

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 662  SSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQRVIAI 721

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 722  HPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHGVTLD 781

BLAST of Cp4.1LG08g04800 vs. ExPASy TrEMBL
Match: A0A6J1FE66 (protein ALWAYS EARLY 3-like isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111443249 PE=4 SV=1)

HSP 1 Score: 1229 bits (3180), Expect = 0.0
Identity = 725/1114 (65.08%), Postives = 726/1114 (65.17%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRCQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAG SASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGTSASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDR 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  PHDLKEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEW 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  GVIRSSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQR 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  VIAIHPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHG 815

BLAST of Cp4.1LG08g04800 vs. ExPASy TrEMBL
Match: A0A6J1I8A2 (protein ALWAYS EARLY 3-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111472290 PE=4 SV=1)

HSP 1 Score: 1221 bits (3158), Expect = 0.0
Identity = 723/1114 (64.90%), Postives = 726/1114 (65.17%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRFQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AH   +SQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AH---SSQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPE EGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEGEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDR 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  PHDLKEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEW 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  GVIRSSLGRPRRFSAQFLKEEKHKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQR 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  VIAIHPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHG 815

BLAST of Cp4.1LG08g04800 vs. ExPASy TrEMBL
Match: A0A6J1FEN0 (protein ALWAYS EARLY 3-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111443249 PE=4 SV=1)

HSP 1 Score: 1214 bits (3141), Expect = 0.0
Identity = 725/1164 (62.29%), Postives = 726/1164 (62.37%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG 120
            EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG
Sbjct: 96   EGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHFG 155

Query: 121  DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK 180
            DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK
Sbjct: 156  DASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTSK 215

Query: 181  LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS 240
            LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS
Sbjct: 216  LKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMMS 275

Query: 241  TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD 300
            TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD
Sbjct: 276  TKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYNHLDD 335

Query: 301  IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL 360
            IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL
Sbjct: 336  IKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDALQTL 395

Query: 361  ADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDKSKMK 420
            ADLSLMMPDTTADT                          EPSAKVKEENLDVMDKSKMK
Sbjct: 396  ADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDKSKMK 455

Query: 421  GNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED 480
            GNHSVVGAG SASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED
Sbjct: 456  GNHSVVGAGTSASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISSKDED 515

Query: 481  SNDSRVNDTPKTK----------------------------------------------- 540
            SNDSRVNDTPKTK                                               
Sbjct: 516  SNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKREDGD 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  YALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDRPHDL 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  KEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEWGVIR 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  SSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQRVIAI 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  HPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHGVTLD 815

BLAST of Cp4.1LG08g04800 vs. ExPASy TrEMBL
Match: A0A6J1FE57 (protein ALWAYS EARLY 3-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111443249 PE=4 SV=1)

HSP 1 Score: 1214 bits (3140), Expect = 0.0
Identity = 725/1154 (62.82%), Postives = 726/1154 (62.91%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRCQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAG SASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGTSASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNEQHSNCLSW 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  HKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEWGVIRSSLGRPRRFS 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  AQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQRVIAIHPKTREIHDG 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  SVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHGVTLDKIFGNLNEVK 815

BLAST of Cp4.1LG08g04800 vs. ExPASy TrEMBL
Match: A0A6J1FEM0 (protein ALWAYS EARLY 3-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443249 PE=4 SV=1)

HSP 1 Score: 1208 bits (3126), Expect = 0.0
Identity = 725/1168 (62.07%), Postives = 726/1168 (62.16%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP
Sbjct: 36   KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVL----RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 120
            EGTASVVGLIAMMTDHYSVL    RDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD
Sbjct: 96   EGTASVVGLIAMMTDHYSVLMRCQRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLD 155

Query: 121  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 180
            AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR
Sbjct: 156  AHFGDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSR 215

Query: 181  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 240
            HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES
Sbjct: 216  HTSKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSES 275

Query: 241  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 300
            DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN
Sbjct: 276  DMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGKNTREIQRKGKRYYGKKAEVEESMYN 335

Query: 301  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 360
            HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA
Sbjct: 336  HLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSAFDA 395

Query: 361  LQTLADLSLMMPDTTADTDECSAFDALQTLADLSLMMPDTTADTEPSAKVKEENLDVMDK 420
            LQTLADLSLMMPDTTADT                          EPSAKVKEENLDVMDK
Sbjct: 396  LQTLADLSLMMPDTTADT--------------------------EPSAKVKEENLDVMDK 455

Query: 421  SKMKGNHSVVGAGISASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 480
            SKMKGNHSVVGAG SASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS
Sbjct: 456  SKMKGNHSVVGAGTSASKTSKTGKASGNNVGPIPEAEGIQGSNNGNRKRKQKSSPFKISS 515

Query: 481  KDEDSNDSRVNDTPKTK------------------------------------------- 540
            KDEDSNDSRVNDTPKTK                                           
Sbjct: 516  KDEDSNDSRVNDTPKTKATDDGKSSFGKVKRSPHNAGLAKSSKISKPLDHHSSSSTDHKR 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  EDGDYALSTTQVPSINPISLPTKMRSRRKMDLLKSQRDSKIADNILIDQLNVTAHSLDDR 635

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 636  PHDLKEQHSNCLSWHKLRRWCVFEWLYSAIDFPWFAKCEFVEYLNHVGLGHIPRLTRVEW 695

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 696  GVIRSSLGRPRRFSAQFLKEEKQKLNQYRESVRKHYAELRAGTREGLPTDLARPLSVGQR 755

Query: 721  ------------------------------------------------------------ 767
                                                                        
Sbjct: 756  VIAIHPKTREIHDGSVLTVDYSRCRVQFDRPELGVEFVMDIECMPLNPVENMPANLSRHG 815

BLAST of Cp4.1LG08g04800 vs. TAIR 10
Match: AT3G21430.2 (DNA binding )

HSP 1 Score: 428.7 bits (1101), Expect = 9.8e-120
Identity = 381/1129 (33.75%), Postives = 510/1129 (45.17%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRNRSTEMVEALFTMNRAYLSLP 60
            KRK +D+LGPQWS++E+E+FYE YRK+GK+WKKVA  V +RS EMVEAL+TMN+AYLSLP
Sbjct: 36   KRKLSDMLGPQWSKEELERFYEGYRKFGKEWKKVAGFVHSRSAEMVEALYTMNKAYLSLP 95

Query: 61   EGTASVVGLIAMMTDHYSVLR-DSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            EGTASVVGL AMMTDHYSVL   S+SEQE+NE     R   KR R KS ++ S GL+   
Sbjct: 96   EGTASVVGLTAMMTDHYSVLHGGSDSEQENNEGIETPRSAPKRSRVKSSDHPSIGLEG-L 155

Query: 121  GDASQSQSLPTNYGCLSLLKKRRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHTS 180
             D  Q +S   + G +  LKKRR+   P AVGKRTPR+P+SY+ +KD+RER  SP +   
Sbjct: 156  SDRLQFRS---SSGFMPSLKKRRTETMPRAVGKRTPRIPISYTLEKDTRERYLSPVKRGL 215

Query: 181  KLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRNDRMRSESDMM 240
              K DD  DDD+ HEIAL L EASQR GS + S TPN K + +     + +RMR++ D+ 
Sbjct: 216  NQKGDD-TDDDMEHEIALALAEASQRGGSTKNSHTPNRKAKMYPPDK-KGERMRADIDLA 275

Query: 241  STKFRCSEMDEGGCELSLGSTGADNADYDQGKN---------TREIQRKGKRYYGKKAEV 300
              K   ++M++  CE SLGST ADNADY  G+N           E Q+KG+ YY ++  +
Sbjct: 276  IAKLHATDMEDVRCEPSLGSTEADNADYSGGRNDLTHGEGSSAVEKQQKGRTYYRRRVGI 335

Query: 301  EESMYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALF-GD 360
            +E      +D KEACSGT+E    G+   K E E  + K+++ ++K  R++SKK+LF  D
Sbjct: 336  KE------EDAKEACSGTDEAPSLGAPDEKFEQER-EGKALKFTYKVSRRKSKKSLFTAD 395

Query: 361  ECSAFDALQTLADLSLMMPDTTADT------DECSAFDAL-------------------- 420
            E +A DAL TLADLSLMMP+T  DT      +E  A +A                     
Sbjct: 396  EDTACDALHTLADLSLMMPETATDTESSVQAEEKKAGEAYVSDFKGTDPASMSKSSSLRN 455

Query: 421  ---QTLADLSLMMPDTTADTEPS------------AKVKEENL--------DVMDKSKMK 480
               +      L  P+    +  S            AKV+E  L         V++    K
Sbjct: 456  SKQRRYGSNDLCNPELERKSPSSSLIQKRRQKALPAKVRENVLKDELAASSQVIEPCNSK 515

Query: 481  G---NHSVVGAG-----ISASKTSKTGK-----ASGNNV--------------------- 540
            G    +  VG G     I  S   K+ K     +S NN+                     
Sbjct: 516  GIGEEYKPVGRGKRSASIRNSHEKKSAKSHDHTSSSNNIVEEDESAPSNAVIKKQVNLPT 575

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 576  KVRSRRKIVTEKPLTIDDGKISETIEKFSHCISSFRARRWCIFEWFYSAIDYPWFARQEF 635

Query: 601  ---------GPIPEAE----GIQGSNNGNRKR---------KQK---------------- 660
                     G +P       G+  S+ G  +R         K+K                
Sbjct: 636  VEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEKEKLYLYRDSVRKHYDELN 695

Query: 661  -----------SSPFKISSK---------------------------------------- 720
                       + P  +S +                                        
Sbjct: 696  TGMREGLPMDLARPLNVSQRVICLHPKSREIHDGNVLTVDHCRYRIQFDNPELGVEFVKD 755

Query: 721  -----------------------------------DEDSNDSRVNDTPK----------- 768
                                                E + +S +   PK           
Sbjct: 756  TECMPLNPLENMPASLARHYAFSNYHIQNPIEEKMHERAKESMLEGYPKLSCETGHLLSS 815

BLAST of Cp4.1LG08g04800 vs. TAIR 10
Match: AT3G05380.2 (DIRP ;Myb-like DNA-binding domain )

HSP 1 Score: 275.0 bits (702), Expect = 1.8e-73
Identity = 315/1055 (29.86%), Postives = 437/1055 (41.42%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRN-RSTEMVEALFTMNRAYLSL 60
            K+K +D LGPQW+R E+E+FY+AYRK+G++W++VAAA+RN RS +MVEALF MNRAYLSL
Sbjct: 34   KKKLSDKLGPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNRAYLSL 93

Query: 61   PEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            PEGTASV GLIAMMTDHYSV+  S SE E ++ S   RK QKR R K + ++S       
Sbjct: 94   PEGTASVAGLIAMMTDHYSVMEGSGSEGEGHDASEVPRKQQKRKRAKPQRSDSP------ 153

Query: 121  GDASQSQSLPTNYGCLSLLKK-RRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHT 180
             +    QS+ +  GCL+ LK+ R +G + HA GKRTPRVPV  S+ +D RE    P++  
Sbjct: 154  EEVDIQQSIGSPDGCLTFLKQARANGTQRHATGKRTPRVPVQTSFMRDDREGSTPPNKRA 213

Query: 181  SKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRN----DRMRS 240
             K       +DDVAH +AL LT+AS+R GSP++S++PN + E    SPI++     R R 
Sbjct: 214  RK---QFDANDDVAHFLALALTDASRRGGSPKVSESPNRRTELSDSSPIKSWGKMSRTRK 273

Query: 241  ESDMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGK-NTREIQRKGKRYYGKKAEVEES 300
                        E  E   E  L S        D  +    E  RKGKR Y K+ +VEE+
Sbjct: 274  SQSKHCGSSIFEEWMESSRERKLDSDKDTTLLMDMERAGEMEAPRKGKRVYKKRVKVEEA 333

Query: 301  MYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSA 360
              N  DD  EACS T +G +S S R K   E       + S + P+KR  K   G    A
Sbjct: 334  ECNDSDDNGEACSAT-QGLRSKSQRRKAAIE---ASREKYSPRSPKKRDDKHTSG----A 393

Query: 361  FDALQTLADLSLMM----------------PDTTADTDECSAF-DALQTLA--------- 420
            FDALQ LA+LS  M                  T  D DE S+  +A  T +         
Sbjct: 394  FDALQALAELSASMLPANLMESELSAQLKEERTEYDMDEKSSTPEATSTSSHGEKANVEP 453

Query: 421  DLSLM------------------MPDTTADTEPSAKVKEENLDVMDKSKMK--------- 480
            D SL+                  +  T  D  P+ K++ +    + K K K         
Sbjct: 454  DDSLLHAISSVENANKRKSKPSRLVSTDCDDVPTGKLQPQTSGSLRKRKPKVLGDEAPAE 513

Query: 481  --GNHSV--------------------VGAGISASKTSKTGKA------------SGNNV 540
               N S+                     G   + SK  KT KA             G ++
Sbjct: 514  FSQNKSINKKELPQDENNMKSLVKTKRAGQVPAQSKQMKTVKALEESAITSDKKRPGMDI 573

Query: 541  GPIPEAEGIQGSN-------NGNRKRKQKSSPFKISSKDEDSNDSRVNDTPKTKEVVVSE 600
               P+     G         N  +K  QKS   K  S +     +R + +   +E+++ +
Sbjct: 574  VASPKQVSDSGPTSLSQKPPNRRKKSLQKSLQEKAKSSETTHKAARSSRSLSEQELLLKD 633

Query: 601  ------------------------------------------------LKRLNDEVLENQ 660
                                                            L RL   V+++ 
Sbjct: 634  KLATSLSFPFARRRCIFEWFYSAIDHPWFSKMEFVDYLNHVGLGHIPRLTRLEWSVIKSS 693

Query: 661  IN-----GDNLLKDSENFKKQY---------------------------------AAVLL 720
            +       +  L +     KQY                                  A+  
Sbjct: 694  LGRPRRFSERFLHEEREKLKQYRESVRKHYTELRTGAREGLPTDLARPLAVGNRVIAIHP 753

Query: 721  QLNEVNE---------------------------------------------QACVSSAL 768
            +  E+++                                               C+S   
Sbjct: 754  KTREIHDGKILTVDHNKCNVLFDDLGVELVMDIDCMPLNPLEYMPEGLRRQIDKCLSMKK 813

BLAST of Cp4.1LG08g04800 vs. TAIR 10
Match: AT3G05380.5 (DIRP ;Myb-like DNA-binding domain )

HSP 1 Score: 275.0 bits (702), Expect = 1.8e-73
Identity = 315/1055 (29.86%), Postives = 437/1055 (41.42%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRN-RSTEMVEALFTMNRAYLSL 60
            K+K +D LGPQW+R E+E+FY+AYRK+G++W++VAAA+RN RS +MVEALF MNRAYLSL
Sbjct: 34   KKKLSDKLGPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNRAYLSL 93

Query: 61   PEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            PEGTASV GLIAMMTDHYSV+  S SE E ++ S   RK QKR R K + ++S       
Sbjct: 94   PEGTASVAGLIAMMTDHYSVMEGSGSEGEGHDASEVPRKQQKRKRAKPQRSDSP------ 153

Query: 121  GDASQSQSLPTNYGCLSLLKK-RRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHT 180
             +    QS+ +  GCL+ LK+ R +G + HA GKRTPRVPV  S+ +D RE    P++  
Sbjct: 154  EEVDIQQSIGSPDGCLTFLKQARANGTQRHATGKRTPRVPVQTSFMRDDREGSTPPNKRA 213

Query: 181  SKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRN----DRMRS 240
             K       +DDVAH +AL LT+AS+R GSP++S++PN + E    SPI++     R R 
Sbjct: 214  RK---QFDANDDVAHFLALALTDASRRGGSPKVSESPNRRTELSDSSPIKSWGKMSRTRK 273

Query: 241  ESDMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGK-NTREIQRKGKRYYGKKAEVEES 300
                        E  E   E  L S        D  +    E  RKGKR Y K+ +VEE+
Sbjct: 274  SQSKHCGSSIFEEWMESSRERKLDSDKDTTLLMDMERAGEMEAPRKGKRVYKKRVKVEEA 333

Query: 301  MYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSA 360
              N  DD  EACS T +G +S S R K   E       + S + P+KR  K   G    A
Sbjct: 334  ECNDSDDNGEACSAT-QGLRSKSQRRKAAIE---ASREKYSPRSPKKRDDKHTSG----A 393

Query: 361  FDALQTLADLSLMM----------------PDTTADTDECSAF-DALQTLA--------- 420
            FDALQ LA+LS  M                  T  D DE S+  +A  T +         
Sbjct: 394  FDALQALAELSASMLPANLMESELSAQLKEERTEYDMDEKSSTPEATSTSSHGEKANVEP 453

Query: 421  DLSLM------------------MPDTTADTEPSAKVKEENLDVMDKSKMK--------- 480
            D SL+                  +  T  D  P+ K++ +    + K K K         
Sbjct: 454  DDSLLHAISSVENANKRKSKPSRLVSTDCDDVPTGKLQPQTSGSLRKRKPKVLGDEAPAE 513

Query: 481  --GNHSV--------------------VGAGISASKTSKTGKA------------SGNNV 540
               N S+                     G   + SK  KT KA             G ++
Sbjct: 514  FSQNKSINKKELPQDENNMKSLVKTKRAGQVPAQSKQMKTVKALEESAITSDKKRPGMDI 573

Query: 541  GPIPEAEGIQGSN-------NGNRKRKQKSSPFKISSKDEDSNDSRVNDTPKTKEVVVSE 600
               P+     G         N  +K  QKS   K  S +     +R + +   +E+++ +
Sbjct: 574  VASPKQVSDSGPTSLSQKPPNRRKKSLQKSLQEKAKSSETTHKAARSSRSLSEQELLLKD 633

Query: 601  ------------------------------------------------LKRLNDEVLENQ 660
                                                            L RL   V+++ 
Sbjct: 634  KLATSLSFPFARRRCIFEWFYSAIDHPWFSKMEFVDYLNHVGLGHIPRLTRLEWSVIKSS 693

Query: 661  IN-----GDNLLKDSENFKKQY---------------------------------AAVLL 720
            +       +  L +     KQY                                  A+  
Sbjct: 694  LGRPRRFSERFLHEEREKLKQYRESVRKHYTELRTGAREGLPTDLARPLAVGNRVIAIHP 753

Query: 721  QLNEVNE---------------------------------------------QACVSSAL 768
            +  E+++                                               C+S   
Sbjct: 754  KTREIHDGKILTVDHNKCNVLFDDLGVELVMDIDCMPLNPLEYMPEGLRRQIDKCLSMKK 813

BLAST of Cp4.1LG08g04800 vs. TAIR 10
Match: AT3G05380.4 (DIRP ;Myb-like DNA-binding domain )

HSP 1 Score: 275.0 bits (702), Expect = 1.8e-73
Identity = 315/1055 (29.86%), Postives = 437/1055 (41.42%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRN-RSTEMVEALFTMNRAYLSL 60
            K+K +D LGPQW+R E+E+FY+AYRK+G++W++VAAA+RN RS +MVEALF MNRAYLSL
Sbjct: 34   KKKLSDKLGPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNRAYLSL 93

Query: 61   PEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            PEGTASV GLIAMMTDHYSV+  S SE E ++ S   RK QKR R K + ++S       
Sbjct: 94   PEGTASVAGLIAMMTDHYSVMEGSGSEGEGHDASEVPRKQQKRKRAKPQRSDSP------ 153

Query: 121  GDASQSQSLPTNYGCLSLLKK-RRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHT 180
             +    QS+ +  GCL+ LK+ R +G + HA GKRTPRVPV  S+ +D RE    P++  
Sbjct: 154  EEVDIQQSIGSPDGCLTFLKQARANGTQRHATGKRTPRVPVQTSFMRDDREGSTPPNKRA 213

Query: 181  SKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRN----DRMRS 240
             K       +DDVAH +AL LT+AS+R GSP++S++PN + E    SPI++     R R 
Sbjct: 214  RK---QFDANDDVAHFLALALTDASRRGGSPKVSESPNRRTELSDSSPIKSWGKMSRTRK 273

Query: 241  ESDMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGK-NTREIQRKGKRYYGKKAEVEES 300
                        E  E   E  L S        D  +    E  RKGKR Y K+ +VEE+
Sbjct: 274  SQSKHCGSSIFEEWMESSRERKLDSDKDTTLLMDMERAGEMEAPRKGKRVYKKRVKVEEA 333

Query: 301  MYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSA 360
              N  DD  EACS T +G +S S R K   E       + S + P+KR  K   G    A
Sbjct: 334  ECNDSDDNGEACSAT-QGLRSKSQRRKAAIE---ASREKYSPRSPKKRDDKHTSG----A 393

Query: 361  FDALQTLADLSLMM----------------PDTTADTDECSAF-DALQTLA--------- 420
            FDALQ LA+LS  M                  T  D DE S+  +A  T +         
Sbjct: 394  FDALQALAELSASMLPANLMESELSAQLKEERTEYDMDEKSSTPEATSTSSHGEKANVEP 453

Query: 421  DLSLM------------------MPDTTADTEPSAKVKEENLDVMDKSKMK--------- 480
            D SL+                  +  T  D  P+ K++ +    + K K K         
Sbjct: 454  DDSLLHAISSVENANKRKSKPSRLVSTDCDDVPTGKLQPQTSGSLRKRKPKVLGDEAPAE 513

Query: 481  --GNHSV--------------------VGAGISASKTSKTGKA------------SGNNV 540
               N S+                     G   + SK  KT KA             G ++
Sbjct: 514  FSQNKSINKKELPQDENNMKSLVKTKRAGQVPAQSKQMKTVKALEESAITSDKKRPGMDI 573

Query: 541  GPIPEAEGIQGSN-------NGNRKRKQKSSPFKISSKDEDSNDSRVNDTPKTKEVVVSE 600
               P+     G         N  +K  QKS   K  S +     +R + +   +E+++ +
Sbjct: 574  VASPKQVSDSGPTSLSQKPPNRRKKSLQKSLQEKAKSSETTHKAARSSRSLSEQELLLKD 633

Query: 601  ------------------------------------------------LKRLNDEVLENQ 660
                                                            L RL   V+++ 
Sbjct: 634  KLATSLSFPFARRRCIFEWFYSAIDHPWFSKMEFVDYLNHVGLGHIPRLTRLEWSVIKSS 693

Query: 661  IN-----GDNLLKDSENFKKQY---------------------------------AAVLL 720
            +       +  L +     KQY                                  A+  
Sbjct: 694  LGRPRRFSERFLHEEREKLKQYRESVRKHYTELRTGAREGLPTDLARPLAVGNRVIAIHP 753

Query: 721  QLNEVNE---------------------------------------------QACVSSAL 768
            +  E+++                                               C+S   
Sbjct: 754  KTREIHDGKILTVDHNKCNVLFDDLGVELVMDIDCMPLNPLEYMPEGLRRQIDKCLSMKK 813

BLAST of Cp4.1LG08g04800 vs. TAIR 10
Match: AT3G05380.1 (DIRP ;Myb-like DNA-binding domain )

HSP 1 Score: 273.9 bits (699), Expect = 4.0e-73
Identity = 314/1055 (29.76%), Postives = 437/1055 (41.42%), Query Frame = 0

Query: 1    KRKFADLLGPQWSRDEVEQFYEAYRKYGKDWKKVAAAVRN-RSTEMVEALFTMNRAYLSL 60
            ++K +D LGPQW+R E+E+FY+AYRK+G++W++VAAA+RN RS +MVEALF MNRAYLSL
Sbjct: 33   RKKLSDKLGPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNRAYLSL 92

Query: 61   PEGTASVVGLIAMMTDHYSVLRDSESEQESNEDSGATRKPQKRLRGKSRNNNSKGLDAHF 120
            PEGTASV GLIAMMTDHYSV+  S SE E ++ S   RK QKR R K + ++S       
Sbjct: 93   PEGTASVAGLIAMMTDHYSVMEGSGSEGEGHDASEVPRKQQKRKRAKPQRSDSP------ 152

Query: 121  GDASQSQSLPTNYGCLSLLKK-RRSGIKPHAVGKRTPRVPVSYSYDKDSRERIFSPSRHT 180
             +    QS+ +  GCL+ LK+ R +G + HA GKRTPRVPV  S+ +D RE    P++  
Sbjct: 153  EEVDIQQSIGSPDGCLTFLKQARANGTQRHATGKRTPRVPVQTSFMRDDREGSTPPNKRA 212

Query: 181  SKLKVDDPNDDDVAHEIALVLTEASQRDGSPQLSQTPNPKIEGHVLSPIRN----DRMRS 240
             K       +DDVAH +AL LT+AS+R GSP++S++PN + E    SPI++     R R 
Sbjct: 213  RK---QFDANDDVAHFLALALTDASRRGGSPKVSESPNRRTELSDSSPIKSWGKMSRTRK 272

Query: 241  ESDMMSTKFRCSEMDEGGCELSLGSTGADNADYDQGK-NTREIQRKGKRYYGKKAEVEES 300
                        E  E   E  L S        D  +    E  RKGKR Y K+ +VEE+
Sbjct: 273  SQSKHCGSSIFEEWMESSRERKLDSDKDTTLLMDMERAGEMEAPRKGKRVYKKRVKVEEA 332

Query: 301  MYNHLDDIKEACSGTEEGQKSGSLRGKLETEDFDVKSVRTSFKGPRKRSKKALFGDECSA 360
              N  DD  EACS T +G +S S R K   E       + S + P+KR  K   G    A
Sbjct: 333  ECNDSDDNGEACSAT-QGLRSKSQRRKAAIE---ASREKYSPRSPKKRDDKHTSG----A 392

Query: 361  FDALQTLADLSLMM----------------PDTTADTDECSAF-DALQTLA--------- 420
            FDALQ LA+LS  M                  T  D DE S+  +A  T +         
Sbjct: 393  FDALQALAELSASMLPANLMESELSAQLKEERTEYDMDEKSSTPEATSTSSHGEKANVEP 452

Query: 421  DLSLM------------------MPDTTADTEPSAKVKEENLDVMDKSKMK--------- 480
            D SL+                  +  T  D  P+ K++ +    + K K K         
Sbjct: 453  DDSLLHAISSVENANKRKSKPSRLVSTDCDDVPTGKLQPQTSGSLRKRKPKVLGDEAPAE 512

Query: 481  --GNHSV--------------------VGAGISASKTSKTGKA------------SGNNV 540
               N S+                     G   + SK  KT KA             G ++
Sbjct: 513  FSQNKSINKKELPQDENNMKSLVKTKRAGQVPAQSKQMKTVKALEESAITSDKKRPGMDI 572

Query: 541  GPIPEAEGIQGSN-------NGNRKRKQKSSPFKISSKDEDSNDSRVNDTPKTKEVVVSE 600
               P+     G         N  +K  QKS   K  S +     +R + +   +E+++ +
Sbjct: 573  VASPKQVSDSGPTSLSQKPPNRRKKSLQKSLQEKAKSSETTHKAARSSRSLSEQELLLKD 632

Query: 601  ------------------------------------------------LKRLNDEVLENQ 660
                                                            L RL   V+++ 
Sbjct: 633  KLATSLSFPFARRRCIFEWFYSAIDHPWFSKMEFVDYLNHVGLGHIPRLTRLEWSVIKSS 692

Query: 661  IN-----GDNLLKDSENFKKQY---------------------------------AAVLL 720
            +       +  L +     KQY                                  A+  
Sbjct: 693  LGRPRRFSERFLHEEREKLKQYRESVRKHYTELRTGAREGLPTDLARPLAVGNRVIAIHP 752

Query: 721  QLNEVNE---------------------------------------------QACVSSAL 768
            +  E+++                                               C+S   
Sbjct: 753  KTREIHDGKILTVDHNKCNVLFDDLGVELVMDIDCMPLNPLEYMPEGLRRQIDKCLSMKK 812

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6A3321.4e-11833.75Protein ALWAYS EARLY 3 OS=Arabidopsis thaliana OX=3702 GN=ALY3 PE=1 SV=1[more]
Q6A3335.7e-7229.76Protein ALWAYS EARLY 2 OS=Arabidopsis thaliana OX=3702 GN=ALY2 PE=1 SV=1[more]
Q6A3319.1e-4626.08Protein ALWAYS EARLY 1 OS=Arabidopsis thaliana OX=3702 GN=ALY1 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
XP_022936778.10.065.08protein ALWAYS EARLY 3-like isoform X4 [Cucurbita moschata][more]
XP_023539796.10.062.71protein ALWAYS EARLY 3-like isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022973712.10.064.90protein ALWAYS EARLY 3-like isoform X4 [Cucurbita maxima][more]
XP_023539795.10.062.50protein ALWAYS EARLY 3-like isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7028966.10.061.43Protein ALWAYS EARLY 3, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1FE660.065.08protein ALWAYS EARLY 3-like isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1I8A20.064.90protein ALWAYS EARLY 3-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC1114722... [more]
A0A6J1FEN00.062.29protein ALWAYS EARLY 3-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1FE570.062.82protein ALWAYS EARLY 3-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1FEM00.062.07protein ALWAYS EARLY 3-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
Match NameE-valueIdentityDescription
AT3G21430.29.8e-12033.75DNA binding [more]
AT3G05380.21.8e-7329.86DIRP ;Myb-like DNA-binding domain [more]
AT3G05380.51.8e-7329.86DIRP ;Myb-like DNA-binding domain [more]
AT3G05380.41.8e-7329.86DIRP ;Myb-like DNA-binding domain [more]
AT3G05380.14.0e-7329.76DIRP ;Myb-like DNA-binding domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 594..631
NoneNo IPR availableGENE3D1.20.58.1880coord: 4..64
e-value: 4.2E-8
score: 35.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..219
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 471..493
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 81..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..222
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 81..99
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 110..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 395..493
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 401..419
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 428..443
IPR001005SANT/Myb domainSMARTSM00717santcoord: 8..56
e-value: 3.7E-4
score: 29.8
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 12..46
e-value: 8.68956E-7
score: 44.1034
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 11..46
e-value: 3.3E-7
score: 30.4
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 1..46
score: 9.30805
IPR010561Protein LIN-9/Protein ALWAYS EARLYPANTHERPTHR21689LIN-9coord: 1..376
coord: 374..484
coord: 493..767
IPR028306Protein ALWAYS EARLY, plantPANTHERPTHR21689:SF5PROTEIN ALWAYS EARLY 1-RELATEDcoord: 1..376
coord: 374..484
coord: 493..767
IPR017884SANT domainPROSITEPS51293SANTcoord: 7..44
score: 10.783667
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 8..54

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g04800.1Cp4.1LG08g04800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0017053 transcription repressor complex