Cp4.1LG08g07180 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g07180
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPolycomb group protein VERNALIZATION 2
LocationCp4.1LG08 : 5652994 .. 5672824 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATTCTTATTTTGTTTTATCTGCAGTGATTTTGAAAGTTTTTATTTGGCGACGCGTCGTCGATCAGCTACCGGAAGAACATAAATTTTGGAGGGAGAATCGTTTCCAATCTCCCCTTCTGGTTTCGACATAACAACCAATTTCTTCTACTCTTCAACCACTCTGCAATTCTGAAACCCATACTTTTTTCCTCGGAATTTCCAACTAGTTTGCCCGCCGGTTCCATTGTTTCTGGTAAGGTTAATAGGCTAGCCTTCTTTTATCGCTTTCTCTGTTTGGTTCTCCTGTCGTTGTTCATAATTTGTTCCTCTCCAACCTCACTGGGATAGAAGTTTATTCATCTTTTGTGTTTTGCTATGGAATTCTGAGTACAAGTTTTATTTGGTGGGTGTTAGTTTTCGTCTCGTTCTCTTGATTGGAGATGTTGAAGTTGGGCATAATCTAGTCTCAATCGGTGTAGAAGTGATTAAATCTGGTTGACATTTGCTAGTTTTGAATAATTACTTGTTTTGGTATGTGTTTCGTGTAAACGTTTCTACTAAAATCAGGATTAAGAACTTTCGGAGTTATCTTTAAGATCGTTCTTCTCCTTGTTGCGGGGCTCAGTAAATGGTTCATTCATTGGCTTAGGGTTGTTTTTGATTTTGATCCCTCTTTATAAGCTTGGCACAGACGCTGATTTCTTCTTTAGAAGTTGCGGATCAACTGGGTCAACGTTGTTAGTACTGATCTTCTACACATTGTATTGATCTGGATTCCTTTGAAACGCTGACCGATAGGCCTGATCTAAATTAACCAGTGCCTATGAATTAAGTTATTGGAGGAATATCATTGAAGGGAAGCAACTTTTGGGTTTTTATTTATTTTTGTAATTTAGGAGATGAATACCTCATGAAAGCCTTGGAAATTGATTATCTTATCCTCGATTGAAGTAGAATTCCACCATCTATGCAAAAGGAGTCTTGAAGCAAGGGAGTTGAAGAATCTATTTTTCTATCATATATCGTGAGAAAAGCTGTGAGAATATTTATGAAGGATCCTCTTGTGATATGAAGGTTCTTGGTTGGGAAGTCAATGGACAAAGTCTAGCAGGAAATATTGGTGAAAAAGAAACGACATAGAATTTTTGGCCGAGTGTTGCATTGTGATGGATAAATGTAACAGTCCAAGCCCACCGTTAGTAAATATTGTCCTCTTTGGGTTTTCCCTCAATATTTTTGAAATGTGTCTGCTAGGGGAGGTTTCCACACTCTTATAAAGAATGCCTCGTTCTCCTCCCCAACCGATGTGGGATCTCACAATCTACCCCCCTGTGAGGCTCAGTGTCCTCACTGGTACTCGTTCCCTTCTCTAATCAATGTGGGACCCTTAGTCTACCCCCTTTGGGGCCCAGTGTCCTTGTTGGCACACTGCCTGTTGTCCACTCTCCCTTCGGGTTTCAGCCTCCTTGTTGGCACATTGCCTGATGTCTGGCTCTGATACCATTTATAACAACCCAAGCCCACCGCTAGCAGATATTGCCCTTTTTGGGCTTTCCCTTTCGGGTTCCCTTCAAGGTTTTTTAAAACGCGTCTGCTAGGAAGAGGTTTCCACACTCTTATAAAGAATGCTTTGTTCTCCTTCCCAACTGATATGGGACCTCACAGTAAATGAAGGTATATAGATCAATCGTGAAATTAGTGTGAGGTTCTGGTTGGTCAAAAGGCGAGTCTAGGCTTACAATGTTGTGGGATGAAGGGAAGATTAAAGTGCTAGAGGTAAGTGCACATTCTCTCTCTCAGTAAAGTTACAATTGAGAGATGAGCTGGATACATGGATTAATCAACATCTATAGTTTGTTTAACCATTGAGATATGGGAGAATTATGAGAAGCATATACGGCAATAGTCAATAGGAGGTGGTGTTGAGTTGGTAATTGTAATGAAGCTATAGTCCGATTAAAAATCAAATGTGGAAGATAAACAAGAAGCATGCAACTATTTTATAATTTGATTGAAGATTTGTAGTTGCACAAAAGTCCTATTGATCAAGGGATATTTATGCGATTAAACTTAAGAGAATTTGGCGCAACCACTTACTGACACATAGTATCCTGGAATAAGGTAGCTAGATTTCTGGATGCCAGAGGGGTGCAGAAGGCGAGAATAGTTCTTGACCATTCACATTCTTTTTAGAAATTGGATCTTAAACTTGGGGGGATCTTTTTCGTATACGATGCAACACTTATGGACATTAAATGGAGAAGAGTGCAAGGATAAATCTTTGATTATTCTCAAAATTTGTATACAAATTGGATCTTTAGATCGCAGAACTTTTTCTTTCAAGATACAAAAATTTTGGACTTTGCATCCTGGTTTACAAAAAATTGAGGAGGAATATTCGGTAGCATATAACTCAGAAGAAAGACTTAGTTGAGAAAAAAATCTTTGTTTTTTTGGCCATGACCGGAGGTGGAACCAAATATTTCCACTGGATAGATGAATGGAATAGATTATGAAGTTTTAAGGACCACTTCCATAGACTTTTTGCTCAATCTTTGCAAAACAGTCTAAAACTAAGAGCGTGTGGGGTCAACGAGCTCACAATTGGCACCTTTTATTTTAGTTACCCTTTCTTGATAGAGAATTGGAAGATTGAGAGATCCTTGTGAACCTAGTTAGCAATAAGCATCTCTTTCACGTGATTGACCCAAGATTTTGGTCGATCACAGAAGCAATTCTTCTTTGATAATTACTTATTTTCCACTCTAATTGCGGTTGCCCCTCTTTCTTTACATTTTAAATGCTATTAGGAAGAAACATTTGGCCAAGAAATGCAAGGTTTGTTTAATATGGCACAAAATTTAGAACCTAGTTTTGTTTTTAGAATGATTGATCTTCTAATAATTTCTTTTTTTCTTTTTATTCAAGATTTTATAGGGACATATATACACCATTGGAAGTCTAAATAGTTGAAATAATATAGAAGTAAAGGTAAATACTTTGAATAATATAGAAGTAAGGACATAAGAGGGTCATTGTTCACCTCGAGGTATGGGCAAGCTATGTGGATTTGTCTTCCAAATGTAAGAGACAGACACGTCCTCCCACTTTAGCTAAGTTGTAATACCTAACATCTTTTTCCTATCCGAACAACTATTCTTGAAGAAAATACACATCGTGAATGTTGTCTCCTTTATTCAAGCCTATTGCATGCTAAAGACCTGACCATTGACTTTTGCAACTGCTTTGACCTTCCACTTTGATGACCCCACTCACCTAGACGTCTTCATCAATTCATTTTAGCGTCTGACAATTGAGGTTCATGAAAATCATCAACTTTCTCTCTCAAACTTCTAAATTGCTTTTGAATTGATGGATCTTGATTTGACTGATCGAAATTAGTTGTCTTTTGTGAGTTCTCACATCGTTGAAGAAGGGAACGAAACATTCCTTACAAGCGTATGAAAACCTCTTCCTAGTAGACGCATTTTAAAACCGTGAGGCTGATGGTGATATGTAACGGGCCAAAGCGGACAATATTTGCTAGTAGTGAGCTTGTGCTATTACAAATGGTATTAGAGCCAGTCACCGTGCAGTGTGCCAGCAAGGACGGTGGGCCCCCAAGGGGGGCTGGATTGTGAGATCCCACATTGGTTGGAGAGGGGAGCGAAGCATTCCTTATAAGGGTGTGGAAACCTCTTCCTAGTAGACACATTTTAAAACCGTGAGGCGAACGACTATACATAACAGGCCAAACCGGACAATATTTGCTAGCGGTCGGCTTGAGCTGTTACATCTTTGTTGTTGGCTCAACTATACCTCTACCACTACCATTTGCAGCTTTAACAAGCATCCACCATCTATTTGAATCTACGACAGCATTCCATAGACGTTCGTACGATGTAGATATTTTGTCACCATCTGCCATTCCTTAGGAACCTAAACTGTGCTTACACTAGCGTCTACCAGACTCACACCTACAACCTTCATTAGCTAACTTGCTAATAAAGATAAGATTCATCTTAATTATCTCCACACTCACTCTGTCTTCAGATTGACATTCCCAATCCCAGTGGGGTTAGAGGTTCTACCATTCCCCATCCTCACTAGGCCATGATGTCCTTCTGTGAAAGATGTTAACAAACTCCTATCTAAAACTATGTGTACAGTAGCTGCACTGTAATCTTCATTCTAATAAGTGGCTACAAGTATTTGTGTCACTATCCATGCAGACTAAGGTGGTTTGACACATCTTCACTGCTAGATTAGACCTCATCATCGTTAAGCATTCTAGGTTCTTCTTTATCCCTTGTATTGCCACTTTGATCTCAACAATATCCTTCTCCGTACCTTCCACTCTGTCCTTGAGCTGTTTGTGCGCCATCCTTTGTGTTCTCCCTGGATGAATGTGCTTTGTTTCCAAATTGTTAGGGATTTAAAGCAACACTGCTCTATTTAGAGTTATTCAATGAATAATGGGTTAATGTAGTTGATTGTATTGGTAATCAAAAACCAAAAATAGAGAAACATAGCCAAACACCCCAGCTATTCGAAAGAGCTGGGATTCTCTCCTATGGCACAATTGTCCACAAAATCTCACACCACAACCAACAACTTAAAAGCCCTAAAAAACCCCTCCTACCAAGTCCGTACAGCAGATGCACGACCCCAAACCAGTTCCCACAAAATCTTTACCCTCATTTCCCCATTTTACCCGTTGATAAGTTTGCATAATAGGAGGTCTCCTACCAACTTCATGGGAATTCGTAATCTCGGTTGATGGCCCATTTCTCTTAATGGATATTTTATATTTTTGGAACAATAACAAAAAATCTATACTTAGTTTCTTATAAAATTTAAAATCACAAAAATATAACATGCACTTTAGTGAGTACGCCTTGCTTCAGGTAGTGTTAAAAATGGGATAGGATGAGGTTGGATATCAATTTCCATCCTAAACTTTGCAAAAAAGACTTCAATCCATGTCTTTGTTACACACCTTGATTTGAGGAATAGGGTTAGGGACAGGGAATCCTTGCTTATGGATTGGCATTCTCTCTAGGGAAAAATATCCATCTATTATATATACATAACGTAACTACTTTAAGTAAAAAATAATTAAATATGATTGCTTAAAAGCTTTAAATTGCTTTTTATGAATTTGATTTGGGTGATTCGAGCCTGTAGTTGATGAAATCTCTTGTTACCAAGCTTGGTGGTACTCTTATAGCCATATTGTGTATCTTCTTGGGTGGTTTGGAAGGATGAAATCCCACAAAGGTGGTAAGTTCTTCAGAGGAACCCTTTTCCTTGGTGGTGTTAGTTCTTCAGAGGAAATCTGGAAGAAAGCGATCCCCCATAGGTGGTAAGTTTCTGCAAAAAGGGCTTAACAAAGATGTTTAGTAACTTCACTTGTTTTTACGTTCTCCTATCCCCTTTTGTGGTGGACATTCTTTTATTTTTTTTTTGACTGTGATGGAGCTTTTCTTGGCTTACCAAGTAAGCCGCCTCTCTTCTTATTTGGCTGCCTGTTTAAAAAGATGGATAGAATCCTTTAGTGTAACACCTTTCTAGCCAACCTCTGTGCTTTTGGCTGGCATTAAATCATAGGATTTTTAGGATGATGAAAGACCTTTAAAGAAAATTTGGGACCATCTTTCTTTAAGTTGGTCTAGCCCTCCTAATTTTATATATATATATACTTGTTATTATCGCATTTCTTCCATTGTAACCAATTATAAGGTCTTCTGGTAATTCTTTGGCTTGGGTTGTGGTCCCTTTTGTACATTCATCCTCTCAATGAAATTGATGTATAAAAGGAGTTGTTTAATGGGGGCGTGGTAAGAAAGAAATAGAAGAGTTTTTGTAGCTGAGGAAAAAGTTTTGGATGTTATTCGGGACAAATTGTGTTTTTGTTTCTTCTTCTTGGAATGCTACCCCAAGGTGTTCTAATCGGGAAGGTTTCTGTTGTTATTTTTTACTTTTGTAGGCATCGTAATATTCCATTCTTACGTTGAAATTTGTTATATATATATAATGATACAGCCGCTATGCGATCAGTATTGATTTGATATTTCAATATTTAATTCTGTTTAGAAACTTCAGTTTATATGATGTTTTAACACTTGTTTATTGCGGTTGTAACAGTAATAGAGATCTGGTAGATTTTTTCATCATTCTTAACAAATCTCTTGCAAAGTTCAAGATGTGCCATGATAACTTCCACGTTCATTCCTTAGAAGAGGCAACTACAGCTGAAGATAGTCTCCTAATATATTGCAAACCAGTTGAACTATATAACATTCTTCATCTTCGTTCCCTTAACAATGTAATGTGATGCATCTCCTTGTGTCAAATCGCTTTTTACTCCATACTCTATTTTCTTGCGGATCAGCTTCCTTGATGCTTTTTAATGGACAAAAATTTCTTTAGTTATTCGCAATGAATTGAAAATGTTTTTTTCTTTCAAAGAAATATAATGAATTGAATGCTTTTGTAAGTAAAATATTCACGTGCTTTAAATCATGAGGCCCTTTAGTTCGCAATGTACTTCAAAGTTCATTTTTTTCCAAAACATAATTTTACTTACTCATTTGTAACATTAACTCAATTTTATAAGTTCTGATGAAATTAATTCCTCTGTTGCAGCCTTCTTTTCTTCGACGTTGTTTGCATTACAAACTACAAGCTAGGCGAAAAAAGAGGTCAGTTATTCATTTTCATATTAAATAAAAGGAGCAACCTCTGGCAGATGGCTGTCATCTTTTTAAGTTTAGTCTGGATTCGATACTTTTTAGCCATTATTCTTTAGCTATAGGAGACCTCTAAATTTGATGACAAAGATGTTTATTTTAGTTTGAGTCCGCCTATATTACCGATGTATAGGATATTAAACATGAACTTTGTCATATTTTATATTTCATTATGTTATTTACTCAGGAGGAGATATGTTATTGACTCCGGAGGAAATATATGAAATGACTCCGTTTGGTAGTCATGTTAAGAAAAAATTTTCTATTCCAGAAAAAAAAAAAAAAATTTGATGAATAAGACTTATTTTGCCGGAAATTTTCTTAAAGCGCTTTTCTAAATGAAAGTAGCATTTTTCTGTTTTTGTGTTTCTGTTCAACTGGTTGTTCATATATGATCATTTTCTTTTGTTTTTTTTTTGAAGCATATCTGACTTAAATATGTATAATTGTACAATAATATAGAGATTAAATAGATAAACTCATAAGAACAAAGATACTTTTCCAAATAGTTTTTAGAATTTAAACAATAGAAAATAGTTTTAAGTAGTTAATCTATTAAACATTTTTTCTAAAAAGTATACTCGAATGGTCTTTTATGTTTCATCAAATGGTAAGAACAACATGTTCATGAAACATCAAAGTGGTTTCACAATAACGCCATTGGCTCATGTTTTGCATTAGGCTTACAAATGACAGCTGCAGATCTATTTGCCAACTAAATTAAATGAGAATGAAATTGTTCTTTTGTGCCCTTATCCTTTCTGGTATTTCAATGTGGGTTTCATTTAAATCCAATGGGTGCGTGATGGATATTTTTGGCCTTATGTACAGGGTGAGCACTGGAGTTGTAATTTTCAACTATAGGGACTACAACAACATAGTACGAAAAACTGAAGGTCTCTCCCTGCCTTTCTTTGTAGTGATATATTGAATTTTTTCATCTTTTAAATGCCTTTGATAGAGCTAATTTCTTGCAATAGGTTTAAACCCTATTGACATGCCATGCCTTTAGAAAGACAACTTTTTATTCATTTATGGTTTTATGTGCTGAAATAGTCGCCATAAATCTGTTGATTGTATGTTCTATGCAAGCATATAATCAATTATATTGCTATTAATTGTGTTTTTTATGTCTAATTTGTTAAAACCATTTTTTATTATTTTTGTTTAAATTTAAACTTTGGATAGATATTTTTTTTACTTTATTTATTTTATCCTCATTAATTTAAAAATATGAAATATAATGCACAAATCAAATTATACTTAAAAAAAGAATTTAAAAAAAAATTGTATATGAATGGTACATTCAACAAAAGATGAAAAATAAAATACATGGAGACAATAAAATATTGTTTTGTGAAGTCTAAATATTTTTGTAGTTACTATTTTTCTTACATACAAACCAATTGGAGAGTATCCTGACCCCCTGAACTTTTCAACTTGGAAGATGCTTGTTTCTTATTAAAATAAATAATCTTTGTTATTTGTTTTTCCTTTCCAGAAAACCATTTTTTAAAATTGTTCTATTAAGTGAGTTTTTTTTTGTTTTTTTTTTTTATTTTAATTAAAAAACTATTTAATTTTTGAAGCAGAACACCATTTTAAACATGATTTTCTAATATATCCTTGTTCTATGACCACAGCTTTCTGATGATTATTAATCATTGGAAGGTTCTCCTTTGTTAGTTTTATGGGGAGGGTTATCTCTACCCTTTTTCCTTAGACTATTCTTTGATTCTTTTGTGGAATGTACGTCTCCCCATTTCTTATGAAATTAAAAATACTAAGTCAGGACAGAGTAGTATGCAAAGAAAAGCAAGTTGCCAGCTCCACAATTTTTTACTAATTTTAAAGGTGATGCTTAATGTAATTTACATGCGTGCATGAGAATATGTGCTTACTGTAAAAATCAAGGTTTTTATTCCAAAATGGGCATTTATATTGAAATTTCCCCTTTTGCTTACGTGTGTAGCATTCATTTATTATATTTCTTCAATTGAGTTGTAAAATAAGTAATAGCTTTGAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGGAAGAAAGAAGGTAGTTAACTATACATCCTTCCATTAACTTGATGCGGCTGTTGGAACTAAAAGGACCTGTATAATTAGAGCTTACTACGTGCGTTGTATTGATTTATTAGATGTCTACAATTGAGCTATACAATAAGTAGTAGTTAGTTATAGTTTTATAGGTCGTTCCATCCACATTGATGCTGACTGTTGTAACTAAGAATAATTATAGCTAATAATATCTATAGACAATAAGTTCCATTACTTCTTGGAGAAGTATGTGTATATATATATATATGTATATTTATCTGTAAAGACTTGTAGTTTTCCTTTTAAATATAAGATAATCTAACAAAAATGTACAAGTATGTGTTGTAAATCATTTGCTGTACTGATATCTATGTAACTCCCATATTTTGATTAAGAGATGGAAGAGGTATTTATTCTTTTCATTTTTTTCGGTCACTCCTATTTTCCCAAATTCCTCTTCTCCATGGGCATATGTGAGGATGTTTCGTGATTGGATATATCATATGTTACTCATTATGATATTTTAATTGTCATTCCAAGATGATTGCAATGTTTTTGACCTTGAACCACTAATATCTACTTACAAGTCTGTCTAACACTTTTCTTTTTTGTAAAGATCTCTATCTTGTTCTTTGAACGTATAGTTTTGATCCCAATTCTCTAATCATGTAGTGACTGAAGACTTCTCATGTCCATTTTGTTTGATGCTATGCGCAAGCTTTAAGGTAGGGATTCAGACTCCATGATATTCCTTAAATGTGAAGTTAAGGTTTTTGTTTCTTGACCATTTTCTGTTTTTTTTTTTGAAGTTATTGGGTTGAAATTCATATGGAAATTTCTAAAGAAATCATTTTCATAGCAAATATGCCTTTATCTAATTTTTTTCCTTTTATTTATATGCTATTACGATTGTCTTCTTTCAATCATAATGCTAAGGGAATTTATTGGGTTGATGGCCTGGATCTATCTTGTTTCGAAAATTTAGGTGGAAAGCTTAATAATATAAATTCCTTGATATCTCTGAGAGTTTGCATTTGAGCACGCTTATTTTCCATGGAATAGTGGAATATATGTATCAAGAATGATGCCATCTGCATACGGGCATTCGGCATGAGATGGGAGGGTTTAAGAATATGGGTTTTCGGTATCTTATTCTTATGGCTTTAGATATCTGGGTATCCCTATCCCTCAGAATATCCTAAATCGGGGGGGTAGTTCATGTTCCTCAATTATGTTGGTTTCCTGAGTTACCATAACCCAGGAGTGATTGAGGTGATATTTTAACCATTCTAGAAACCTTTTACGTGCAGGTGCCGTATTTTTAACCCTTGTTTTTAAATTCTAAAGCTTCCTTTGATTACAAATGTTTTGTTTTAAAAGTTAATTTTTTAAAAAAAATTTAGCCACATAAGTTTTTAATACTAATTTTTTCATTTAAAAATTTGGTCTAGATTTCACCAACATATAAAATTTACATGATTCATTTTAATTGTATAACATAAATAGTTTTTTAAAAATTTAATAAAACTAATAAGGTAAATAATGGCCAACTGCCATTATTTGGGATACGTTTGGAATTCAAAAGTTTATAGAGGACAAATAGGTCTAAAAATGATTCTCAACTACCTCACCTTTTAATATAATAAGATATAGATAAAGATAGCCCTGAAGGGGTGACGCCATTGGCGAAGGCTTTCGGTTTTGAGTAACATCTACCCAATGGTAGCATGTTTGAATCTTTAAGTGGGTTTAATGAGGAAAATTCTTCATGTTTCTTCTTAATTTTCAAGGAGGTACATGTGTCTTTTATGATAAAGAAAGGCTAGCTAAATACCTATATATTAGGTAACAATATAATTACCTAGATATGAGGTAACAATACAGAAACCTTAGATATTAGGTAACAATGTAGATACAATTAAGGTTATGATTTACATAGATATAATTATAGTCATTCCTTCAACACTCTCCCTCGAGCTATGGAATAGATAGGATTATGTGCAATAGTGATCATACGACTTAAATGTCAAGACACGGCTTCAACGGTAGCTATGAAGGCTTTGAATAAATTTGGTCAAGGTGGAAGATGTGCTATGAAGGCTTCAAATAAGTTTTGTCAAGGTGAAAGATCAGTAGCTATGAAGACTCTGAATAAATTTAGTCAAGGTGGAAGATCGGTAGCTTGGTGGAAGATCGGTAGCTTGGTGGAAGATTGGTAACTACAAAGGCTCCAAATAAATTTGGTCAAAGTGGAAGATTGGTAGCTATGAAGGCTACCTCGATTAATCACACTTTCTTGAATAGAATGACTTTGATACCATGTTACAAGTGTAAATTATCTTACTCAAGTTTCTTCCTAATTTGTAGGGAGGTACATGTGCCTTTTATAATAAAGAAACTTAGATACTTAGATGTTAGGGAACAATACAAAATCTCCAGATATAGGTAACAATATAGATAAAATTAAGGCTAGAATTTACATAGATATAATTAAGATCATTCTTTCAATAGTTTCTACTTGTAGAATAATTTTCAAGATAAAGATAGAGATTACTTTTTTTTATCTTCTTACAATGTTGGTAAATTGATTTTGAATTTACTCAAAACAGTGATTTACGCATTATAGGACAATTACTTTGGCCCTTCAACTACCCATTTAAAATTTTAAAAGTTCTATTTGAGAATTTAAACTCTCATCTAAAATTTCTTTTATTTCACTCAGTTCAAACATTCACCAAATAATCAGAAAGCAGCATGCTATAATAGCCCTTCCCTTGTCCCAAAAGGGCGTGTAAAGTATCATTTCCTACATCAGGAGAGTCAAGATCTGTTACAAGCTAGAAACATACCACACGTTATTAAAGATAAATCTGAACTGAAACTTGAAACTTCTACTCCATTGCGTACAATCCGAATCCTCAAAATTACATTTATTCGTGGACAATATCTGGACATTGTTATCCTAATGGGCGTTTCTATAGATTACCTTCAGTCACATTTCAATTTAAAAGGAGCAGCTACCTTTATACATACATAGTTTTAGGTACTCGTGTTTCTAACTACTGTGGTTAATGAATAACTCTCCAATAATTTTGTTCTTACGATAACTTTAACTGCAAACTGACCTAACTGAATGAGCTCATTAACTGAAAATTTACAGAAATTATGCTTTACCACCAGGCATTGTGTTGTACAAATGTAGATATTCTATATTCGTACTGCTCATTTTGACATTTTTCATGAAGTTTGTTGCTGAGTTTTTTAATCAAATTATTATTATTTTGTTATATTTCCTTTGTTGTACTTGTTTGTGTTGATAATATGTATTTGTATAGCATAATTTTAGCTGGGTCATATAATTTGAAGATTTTGTTGCTTGTTTGGTGGCATACTTGATTGATCCTTTGTGCTTATCTTTTAGGGTTTGCGGTACCACTTAGGCTCTTCCCATGATATGTTCAACTTCGAATACTGGGTTGGTAATCTCCAATGTCTAGATGTTTGCTGCTTATCGAGCCATTATGCGGTTTGTGAAGAAACATTTAACGTGTAGGTTACTGAAGAATATCAGGCAGTGAATGTCTCTGTGAAAATTGATGTCTTTAGACCTGAGGTAAGAGTTTGGAATTCTTATTTTGTGGACAAATCTCTATGGCAACTTTGCTTTTCTTATTGCTATCATAACAAGGCAACTACGTCATGTTCTTTCTTTCCTCTCATCTTTTGCTCGTTTTGTCTCTTCACTCCATCTTTTAGCTCATTTCATGGCATTTTATGCTAGTTAATTAGCAATTTCATTGGATTATCGTGCCAAACCCTCCTCTTGAAGGCTGGTAGTATGTTTTTCTTTGCAGACTGTAGCAGATGGGGTGGACCCACAACTTCAAACTTTCTTCTTGTGGTATGTTTTTTAAACATCATCGTATGAATTCATCTTTTTAGTGCTTTTCCTTATTCTATTGAAATATTTGGCACTCTAATACAGCACAAGACCACGAAGGCGTAAGTTGAAGAACTCTGTTCAGAATGGGAAGTATGTGCAGTTCTTGGAGATGGACTCAGCTAGACCTTCCACAGAAGGCATGCCTAAGGGATTTATTGGGCATAACGCTGGTAATTTTTTTCCGATCCAATGTAAGAATTTTATTTTTCCTTTTTCAATCCTTTAAAGTTTACCTTTAAGATAGAGGAAAAGGGAATGATATAATTCTTTCTTATGTCCTACTAACAGAAGTGGCCAACTACTGTTATGCAATATGTTATCCGGACGTAAACTTGGCTGATCATTTGTGCTTACCTGAGGTTACATGGTGTTTTATATCTAGGAATATTTGGCTGGAGAAATAGAAGCCTCATGAAATGTGGTTCTAAAATTCTTAACAATTTTCCCCCTCGTTTAGTTAAAAAGGGATGGCAAGTATGTTTCGTATGATGTTAAAAAAAAGAAAAGAAAAGAAAAGAAAGAATACGAAGGATACTCCATCATGTTTCATTTATGTAATTATAAATTACACAAAAAGTTCATGGTCAGTTTTAACAGCAGATATTATTCTACAAATGGGAGTTGATTAAATCTTAAATTTTACTAGGGGGTTGAGGATTTTACTGTTCACTGTCCATCCAATTAAAATGAATATTAATTAATTGCATAAATTCATTGATTCCATGCTATATATCATAATTTTGAAAAGCTTATGCCATATCAATTGTAGTAAGCTATTCATTATCTTATTTCTTCGTAGATGGATTTTCTCTTCTTTCCCTTTACATTTTGTGGAAATTGAACTACACGATCACAATAATTTGCTTGGCACTTTAACTGAAATTAAGTTTGTATTTAGATGGTGTATCCTGTGAAAAAGAGGGCAGCCATTCATTCCCCATTGAGGCTTATTTGCAGAATGCACAACAAGGAGAAAATATTGGTCTTGAATGTCCTTCCTCTATGGAGTGCATTGAACATGTTGCATCCAGCTCAAATATTCCAGGTGTCTCAATGGCTATCAGTCAATCTTGTACGGGTCCTGAATGTTATAAAGTACTATCTGGAAGTGATCATTTACATCCAGCCAAAGCAAGGAAGTTAACGGTAGAACGAGACCCAAAAAAGTATGTTTACCATCTTTTCTGGCTTTATTCTTCAGATATCAAAGCTTCATTCACAAGTAGTCTAGGGCTTAGTTCTGTTCTGGTTGTTTACCTTTTTAGTTTGATTTTTTTTGTGTTTTCTGTTTTTGTTTAACTGGTTACTTATATATTAGAACCAGTTGGAAGTTGCCAACATGAGCCTAGCTCAATGGGCCAAGGCATCTATATTGCCCTTGAGAGGTTATAAGACCCCACCCCACGTCATATAACGTATTTCAAAAAATGTATTGAGCTTTAACCATGTAATTGATTTTGGATGGTGACTTCACTTAGGCTTCCTGTATTTTCTTTGACAAGAAATAAAACTTCGGGGTCTCTAATTTATACCTTTCTGTGATGGGATTTGATTGTCTTAGTAGGGACCGACAGAATTCTAGGATCCCTGCAGTTGCTTGAAAAAAAAAAAAAAAAAAAAAANTTTGATATTATTAGAGGGAAATATTTATCTATTCCATTTTAGTTGGATTGTTACAATTTAGCTTGTAGATTATATCAAGAATTATTATAATTTAGCTTAGATTATGTCAAGAACTATTATTGCCTTACTTGGCTCACCTTGTGCTTTAAAAAGTATTAATTAGAATTTCAGTACAGTGTTTAAAAATCGAATAGATTTTGGATGGGACCATGACTTAGTTCTTTGGTTTAGCAAGAGTCATGATTATTACAAATAAATTTCCTGTCGTTTGGCTTTTGACGTCTTTACACATACACTATTTCTAGTAGAGTGACTTCAGATAACTTACTCGTTTTGGATTTGTTTAGAATGCTTTAATTGGAGTGTTTGATTTCTTTTATCACTTTTTTGAATACTAAACACACAAATGACATGCTTAGGACTCACGAGACATATTGGTGACTCCAAGGACATGTTGAAAACATGTTTTGTAGAGAAAATGAGAAAATTGGTTTATATCATTTTAAATAAAAAAATTGAAACATCTATATCGAGGCTTTAAATATTTAAATTTCCATAAATTGAATAGATGATAATACAAGTCTATACCTTTAAATCTTTACCTCATATAATTCATTATATTCGAAAACATTGAAATATGTGATGTATGTAGAAGTAATTCTCAAAATATTTTTGTTATTGCATGTGTCTTTGTTGTGTATGCCTTTATATTTAGAAAATGGTGCATTGGCATTTTCATATCATATCCATGTTTGCTCTTCAACGAAATCCAAACTTCAAGTTCATCCTTGAATTCTAGAGCTCTCTATTTTTATCATCTTTGAGTCTGCCAGTTAATAAGGGTAGCCAATTTATTATGTTATTTCAGCCGCATGCTTTTGCAGAAACGGCAGTTTTACCACTCTCACAGGGTTCAGGTATGAATCTTGATTTACATTTTACTTTTTTCTTTTGTCACAACCAGTTCATGACCGACAACATTGATGTCTTTCAATGATTAAAGTTTACTTAATTAAGAATGAATTATTACATTTGATGTGGGGTGAGATGGAGGTTGGTCCTGTGCTTTTGTACTGTGTTGAGCAGAAGTGGCGGTGGCCTGGGCTGGAAATTTAAATATGCACTTTTACCGTTTATGTGATATTCCACGTTTGTGGGAAGGAGAACGAAACACCCTTTATAAGTGCGTGGAAACCTCTCTCTAGCAAATGCGTTTTAAAAACCTTGAGGAAAAGCCCGAAAGAAAAAGTTCAAAGAGGACAGTAACTGCTAGCGGTGGGCTTGGACTGTTACAGTTTAAATTAACGATCAAGTTACAGAGATGAGATCTGTAAGAAATGAAAGTGAACTTACTCTTAGAAAAATAGTCCAATCAACTTGTTAGTAAAAGGTGAAAAGTGCCTGCTCATTGCCGAGTTTGGATCATTAGTATTTTTATTTCTTTGTTTTCAGTCCTACGGTTGTATAATTTTTTAACATTTAGTAGTCAAAACAAATACAAGATGTTTATATTAGGCATATTATCAACGTGGGATTAATTTGTTCTAAATAGTTTTGGACAAGTAGTTATAGGCTCAGTACATGAACTTTCCAGGTTGCTGTTCTGTTAATTAAAAATTTGAAGTGTAGCTAGAATATTTTAGCTACATCTAAATCTAGTTTGCTTCCATCTTACCCAGCACGCTACCATGCTATTGTCAGCCTTTTTGCCTGGACTGAAATCTGTTGCTTGCTCATCTCTATCTTTTGTTTTGATATTGGAAGCCTATGGCACTAGATAAAGTATTATCCGACAAAGATAGTGAGGATGAAGTGGATGATGACATTGCAGATTTTGAAGACCGAAGGGTATGGTTTTCTTTGAACTACTAAATATTTGACTTATTCCTGCTGGGAATCTTTTAGAAGGAATTTTTTTTCGTACGAATTTTTCATCGATCGGTCGTGACCATATCGTTTCCTATACCTAATTTATCGTATCACAATATATCATATCATAAACATGTCTATTTATCACACATTTCGTACCATAACATATATCATACGGTAAACACTTCGTAGCCATATCGTTTCCGTATGCATACCCTAAATCATATCGTTTCATGAACGTATCCTAGTCATACCGAATTGATTTTTCGAACTAGCAATATTGAATATTTTGGTTCGGTATGTGATTACAATCCTATGTGCTACTCTGTTTGGACCGGTTAAATTCACTTTTTTGGAATTTGTGCAAACTTAAAATACTAACTCGAGCTGTTAATTTTCAGATGCTCGATGATTTCGTGGATGTGACCAAAGATGAAAAACGGCTTATGCATCTATGGAACTCCTTTGTTAGAAAGCAAAGGTTGTTCCTTGCCAACCTTAATTTTCTTATTTTTCATTACCTTCTTTCTCTATAATGTTTAATTCTAATTTAACATTGCACCGAACTGAGCTGCACCCTAAATAATACTTGCCTGGCGGTCTCACAATCCCTATTATAAAAAATCACATTATTTTCTGTGTACCAGGGTGCTAGCTGATGGTCACGTTCCTTGGGCATGTGAAGCATTCTCAAAACTTCACGGCAAAGAACTTGTCTCATCTCCGGCTCTCTTTTGGTAATGTTCTCGCACAATTTTAGCATATTTCCGTGGCTGGTTTAGATTCCGGCGGCATTGCATTTTGATATAGGTTGTGATGTTATTGATTATTAGTTTCTTTATTTTTGGATAAGGTGCTAAACTTACATCGAGTAAAAATGAAAAAAAAAAAAAAAAAAAAAAAAGGTATAAAAAACAACCCCATACGAAAGGAGAAAAACTAATACATAAATGGGTTTTAGTCTAGCAAAATGAGACGTAAAGGGTAACTACAAAACCGATCTCAACAACTAAAGGCCAAAAAGTGATATAAAACCTAACAAACCCAAAGATTGCCAAACTATTAAGTAATGTTGCAAGTCTAGAGATAATCTAAATACTAGGTTAACTGTTTATCTAAAGATACTAAATAACCAAAATTTAAATAATGATGTTTAGATACTATGTAACTAAATATAAATAAATATCTAGGTAACTAATTGAAAATAAATCCAATTAGGGGTAAATTAGGGAGGTCAACTATTTCTCTTATTTAATGGAATTACACACAAATATCCTTCTATCTTCTATTGTAAGCAATGGAGGTTTTTTAAATTAATCGGTTCTCCAAGAGGAGTGGGCACGAAAGAGGTGTGAAATGCCGATTGAAGTGCTCCAAGGGGATGGTATACTTTCATGGCTTCAACTAACGAATGTGCTAACAAAGATATCAATAATTTATTCAGAGCCTTCATAGCTATAATTGAAAATTTTATTCGAAGCCTTCATAGCTATCGTTGACAAATTTATTTGGAACCTTCATAAATACCATTGACAAATTTATTCGGAGCCTTCATAGCTACCATGTGCCAATTTATTTGGAGCCTTCATAGCTACCGTTTGCAAATTTATTTGGAGCCTTCATAGCTACCGATGAAGTCGTGCAATCAATATTGCACATAAACCTATCCAGTCCTATTTAGAAGTAGTTGACCTCCCTAATTTATATGTGGATTTATTTAGATTTAGTTACCTATATCTGGGATTATTATTAACATATGACTATTCAGATTTAGTTATCTAGTATCTAGGAATTTTTATTTAGATGTATTGATATTTAGATTATTTAGATTAGATTAGATTGAGATCCAGTTATCATTTCTCAGTTCTCTAAAAATTAATAAGAAAAAGAGTCATCTAATGTATACTTGCAACAATAATAATGTTCCTTCATTATTAGTTGTGTTATTGGCTCTAACAAGCCGAAAGAGAGCCTAATAATAATGTTTGCTCATTATTAGTTGTTTTGTTGGCCGATGCACTATCTTTGGTGTATAGTGTGTATATGTATGCATATAATTCAAACTTCGTTAGATGATCAACTTCCCGAAACAAGGTTACAATATCCAGTGGTTCATTTTCCATCCATTTTAAAGCGAAGCCAATCAGAATAGGAGTAGATTGTTGTTTTCTTCAAGTCCAAATCTCTTTTATGTCAATTTTAGGTATCTTAGTATCTTATAGACAACGTGCTTATTTTCTTCTATAAATTATGGTGGGAGTGGAAAGTTGTAGCCTTTGAGTAAAGAGTCAGACCCCTTGAAAGACAAGAATTTTGCAACTTCTCCTTTGCATACAAATACACTATAGTGATAGGAGTCTGGTAGATAAGTTCATAACTAATTTACAAAGCTTTCTGCAAAGCAATATTTTTTCTTTTTCCCTAACCAGGAACAATGCTTTCATTAAGAAGGAGTTAGGTTTTATGTAAGTGAGATAAATAAGCTTTCCAATTAATCTTTGACGCTGCTCTTTATTAATATTTATGTTTTTATTATTTGGATTCGGACGTGGGATGTGGGTCAACAATTGTAGGGGTAGGATTGAACCATCGACCTTTAAAATAGTAATTAGTATCTTATTTATTGAGCTGTGCTCGGATTCGCAAGGTAAGTGACCCAGAGAGTAATTTTGGGAACTTTTGATCCCTTTCCTGCTGGAATTTGAAAGCAAGTTTAATTAGAGAACTTATTGGCCTTAATGTCAGTTTAAAGCTTGAAAATTGTGTTTACCTGAAAAACCTGGAACCACAGGTCCTATTATCAGTTTATTGATTTTGCAGCCATTTGGAAGTTTGATAAGGTGCTGTGTTAGATGATTTATAAGTTCAAGACTAAACTTATTTCTCCATATATCTTCTGATATGGACATGTAGCGTATGGGTTCTTTCCTCTTTAATTTGAATCATTTGAAAATGCACTAGAAATATTTCCATTGTATGAGTATCAGTTCATCCCACTCATGATTAGAAGAGTCGCTGAACGAACCGATTTCGATTCGAGTTTTCAAATTCGGCACTCTTTAGCTAAAAAAATTCTCCTCTTGAATCACTTGGACTCAAACAATTTATATGCTTGAACATCTTTGGCTGTTGACCTCACTTAAATTTGGCAGGTGTTGGAGGTTTTTCATGATCAAACTCTGGAATCATGGCCTTCTTGATCCTTCTACCATGAACAACTGTAATTTAACTCTCGAAGTATTCAAAGATGAGAGTTTTGATGCAACGGAGAATGGGAGAAGAGAGGATGATTAACAGGCTACTTGAGTTAACATTCTTGATCTGCTTCATTAACTAATCATTTATAGGTTGGAGTTAGCTCCTTTTACATTCATTATTCATATAGGTTCCCTTCTTCACTAACCTCTATTGAATGAAGAAAGAGCCTCCCCATGTTCTTTTTCTAAATTTTTGCCACTTCCTCTACTTTATAACTCTTATTCTCTTCTACTTTTCTTTCTTAATTGAAAGATATTGCATATATTATAGGATGAAATATAAAAAATATACACCAACTTTGCTTCTAA

mRNA sequence

CATTCTTATTTTGTTTTATCTGCAGTGATTTTGAAAGTTTTTATTTGGCGACGCGTCGTCGATCAGCTACCGGAAGAACATAAATTTTGGAGGGAGAATCGTTTCCAATCTCCCCTTCTGGTTTCGACATAACAACCAATTTCTTCTACTCTTCAACCACTCTGCAATTCTGAAACCCATACTTTTTTCCTCGGAATTTCCAACTAGTTTGCCCGCCGGTTCCATTGTTTCTGTAATAGAGATCTGGTAGATTTTTTCATCATTCTTAACAAATCTCTTGCAAAGTTCAAGATGTGCCATGATAACTTCCACGTTCATTCCTTAGAAGAGGCAACTACAGCTGAAGATAGTCTCCTAATATATTGCAAACCAGTTGAACTATATAACATTCTTCATCTTCGTTCCCTTAACAATCCTTCTTTTCTTCGACGTTGTTTGCATTACAAACTACAAGCTAGGCGAAAAAAGAGGGTGAGCACTGGAGTTGTAATTTTCAACTATAGGGACTACAACAACATAGTACGAAAAACTGAAGTGACTGAAGACTTCTCATGTCCATTTTGTTTGATGCTATGCGCAAGCTTTAAGGGTTTGCGGTACCACTTAGGCTCTTCCCATGATATGTTCAACTTCGAATACTGGGTTACTGAAGAATATCAGGCAGTGAATGTCTCTGTGAAAATTGATGTCTTTAGACCTGAGACTGTAGCAGATGGGGTGGACCCACAACTTCAAACTTTCTTCTTGTGCACAAGACCACGAAGGCGTAAGTTGAAGAACTCTGTTCAGAATGGGAAGTATGTGCAGTTCTTGGAGATGGACTCAGCTAGACCTTCCACAGAAGGCATGCCTAAGGGATTTATTGGGCATAACGCTGATGGTGTATCCTGTGAAAAAGAGGGCAGCCATTCATTCCCCATTGAGGCTTATTTGCAGAATGCACAACAAGGAGAAAATATTGGTCTTGAATGTCCTTCCTCTATGGAGTGCATTGAACATGTTGCATCCAGCTCAAATATTCCAGGTGTCTCAATGGCTATCAGTCAATCTTGTACGGGTCCTGAATGTTATAAAGTACTATCTGGAAGTGATCATTTACATCCAGCCAAAGCAAGGAAGTTAACGGTAGAACGAGACCCAAAAAACCGCATGCTTTTGCAGAAACGGCAGTTTTACCACTCTCACAGGGTTCAGCCTATGGCACTAGATAAAGTATTATCCGACAAAGATAGTGAGGATGAAGTGGATGATGACATTGCAGATTTTGAAGACCGAAGGATGCTCGATGATTTCGTGGATGTGACCAAAGATGAAAAACGGCTTATGCATCTATGGAACTCCTTTGTTAGAAAGCAAAGGGTGCTAGCTGATGGTCACGTTCCTTGGGCATGTGAAGCATTCTCAAAACTTCACGGCAAAGAACTTGTCTCATCTCCGGCTCTCTTTTGGTGTTGGAGGTTTTTCATGATCAAACTCTGGAATCATGGCCTTCTTGATCCTTCTACCATGAACAACTGTAATTTAACTCTCGAAGTATTCAAAGATGAGAGTTTTGATGCAACGGAGAATGGGAGAAGAGAGGATGATTAACAGGCTACTTGAGTTAACATTCTTGATCTGCTTCATTAACTAATCATTTATAGGTTGGAGTTAGCTCCTTTTACATTCATTATTCATATAGGTTCCCTTCTTCACTAACCTCTATTGAATGAAGAAAGAGCCTCCCCATGTTCTTTTTCTAAATTTTTGCCACTTCCTCTACTTTATAACTCTTATTCTCTTCTACTTTTCTTTCTTAATTGAAAGATATTGCATATATTATAGGATGAAATATAAAAAATATACACCAACTTTGCTTCTAA

Coding sequence (CDS)

ATGTGCCATGATAACTTCCACGTTCATTCCTTAGAAGAGGCAACTACAGCTGAAGATAGTCTCCTAATATATTGCAAACCAGTTGAACTATATAACATTCTTCATCTTCGTTCCCTTAACAATCCTTCTTTTCTTCGACGTTGTTTGCATTACAAACTACAAGCTAGGCGAAAAAAGAGGGTGAGCACTGGAGTTGTAATTTTCAACTATAGGGACTACAACAACATAGTACGAAAAACTGAAGTGACTGAAGACTTCTCATGTCCATTTTGTTTGATGCTATGCGCAAGCTTTAAGGGTTTGCGGTACCACTTAGGCTCTTCCCATGATATGTTCAACTTCGAATACTGGGTTACTGAAGAATATCAGGCAGTGAATGTCTCTGTGAAAATTGATGTCTTTAGACCTGAGACTGTAGCAGATGGGGTGGACCCACAACTTCAAACTTTCTTCTTGTGCACAAGACCACGAAGGCGTAAGTTGAAGAACTCTGTTCAGAATGGGAAGTATGTGCAGTTCTTGGAGATGGACTCAGCTAGACCTTCCACAGAAGGCATGCCTAAGGGATTTATTGGGCATAACGCTGATGGTGTATCCTGTGAAAAAGAGGGCAGCCATTCATTCCCCATTGAGGCTTATTTGCAGAATGCACAACAAGGAGAAAATATTGGTCTTGAATGTCCTTCCTCTATGGAGTGCATTGAACATGTTGCATCCAGCTCAAATATTCCAGGTGTCTCAATGGCTATCAGTCAATCTTGTACGGGTCCTGAATGTTATAAAGTACTATCTGGAAGTGATCATTTACATCCAGCCAAAGCAAGGAAGTTAACGGTAGAACGAGACCCAAAAAACCGCATGCTTTTGCAGAAACGGCAGTTTTACCACTCTCACAGGGTTCAGCCTATGGCACTAGATAAAGTATTATCCGACAAAGATAGTGAGGATGAAGTGGATGATGACATTGCAGATTTTGAAGACCGAAGGATGCTCGATGATTTCGTGGATGTGACCAAAGATGAAAAACGGCTTATGCATCTATGGAACTCCTTTGTTAGAAAGCAAAGGGTGCTAGCTGATGGTCACGTTCCTTGGGCATGTGAAGCATTCTCAAAACTTCACGGCAAAGAACTTGTCTCATCTCCGGCTCTCTTTTGGTGTTGGAGGTTTTTCATGATCAAACTCTGGAATCATGGCCTTCTTGATCCTTCTACCATGAACAACTGTAATTTAACTCTCGAAGTATTCAAAGATGAGAGTTTTGATGCAACGGAGAATGGGAGAAGAGAGGATGATTAA

Protein sequence

MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKKRVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVTEEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQFLEMDSARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQGENIGLECPSSMECIEHVASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEVFKDESFDATENGRREDD
BLAST of Cp4.1LG08g07180 vs. Swiss-Prot
Match: VRN2_ARATH (Polycomb group protein VERNALIZATION 2 OS=Arabidopsis thaliana GN=VRN2 PE=1 SV=2)

HSP 1 Score: 410.2 bits (1053), Expect = 2.8e-113
Identity = 219/431 (50.81%), Postives = 285/431 (66.13%), Query Frame = 1

Query: 1   MCHDNFHVHSL-EEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MC  N    S  EE  + +++LLIYCKPV LYNI HLRSL NPSFL RCL+YK+ A+RK+
Sbjct: 1   MCRQNCRAKSSPEEVISTDENLLIYCKPVRLYNIFHLRSLGNPSFLPRCLNYKIGAKRKR 60

Query: 61  RV-STGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWV 120
           +  STG+V+FNY+D NN +++TEV ED SCPFC MLC SFKGL++HL SSHD+F FE+ +
Sbjct: 61  KSRSTGMVVFNYKDCNNTLQRTEVREDCSCPFCSMLCGSFKGLQFHLNSSHDLFEFEFKL 120

Query: 121 TEEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQ--FLEM 180
            EEYQ VNVSVK++ F  E      D + + F LC++PR+R+ +    N + ++  FL +
Sbjct: 121 LEEYQTVNVSVKLNSFIFEEEGSD-DDKFEPFSLCSKPRKRRQRGGRNNTRRLKVCFLPL 180

Query: 181 DSARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQGENIGLECPSSMECIEH 240
           DS  PS             +G++   +G                 N GL  P + E    
Sbjct: 181 DS--PS-------LANGTENGIALLNDG-----------------NRGLGYPEATELAGQ 240

Query: 241 VASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVER-DPKNRMLLQKRQFY 300
              +SNIP    AI+ S        +L+    +   K RKL+ ER + ++ +LLQKRQFY
Sbjct: 241 FEMTSNIP---PAIAHSSLDAGAKVILTTEAVVPATKTRKLSAERSEARSHLLLQKRQFY 300

Query: 301 HSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQ 360
           HSHRVQPMAL++V+SD+DSEDEVDDD+ADFEDR+MLDDFVDV KDEK+ MHLWNSFVRKQ
Sbjct: 301 HSHRVQPMALEQVMSDRDSEDEVDDDVADFEDRQMLDDFVDVNKDEKQFMHLWNSFVRKQ 360

Query: 361 RVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEV 420
           RV+ADGH+ WACE FS+ + KEL    +LFWCWR F+IKLWNHGL+D +T+NNCN  LE 
Sbjct: 361 RVIADGHISWACEVFSRFYEKELHCYSSLFWCWRLFLIKLWNHGLVDSATINNCNTILEN 401

Query: 421 FKDESFDATEN 427
            ++ S     N
Sbjct: 421 CRNTSVTNNNN 401

BLAST of Cp4.1LG08g07180 vs. Swiss-Prot
Match: EMF2_ARATH (Polycomb group protein EMBRYONIC FLOWER 2 OS=Arabidopsis thaliana GN=EMF2 PE=1 SV=2)

HSP 1 Score: 387.9 bits (995), Expect = 1.5e-106
Identity = 200/359 (55.71%), Postives = 248/359 (69.08%), Query Frame = 1

Query: 60  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 119
           R+ TG V+FNYR YNN ++KTEVTEDFSCPFCL+ CASFKGLRYHL S+HD+ NFE+WVT
Sbjct: 298 RLRTGNVVFNYRYYNNKLQKTEVTEDFSCPFCLVKCASFKGLRYHLPSTHDLLNFEFWVT 357

Query: 120 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRP-RRRKLKNSVQNGKYVQFLEMDS 179
           EE+QAVNVS+K +    +   D VDP+ QTFF  ++  RRR+ K+ V++ +    L    
Sbjct: 358 EEFQAVNVSLKTETMISKVNEDDVDPKQQTFFFSSKKFRRRRQKSQVRSSRQGPHL---- 417

Query: 180 ARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAY--LQNAQQGENIGLECPSSMECIEH 239
                 G+    +    D  S   E S   P + Y  +  A+ G+ +             
Sbjct: 418 ------GLGCEVLDKTDDAHSVRSEKSRIPPGKHYERIGGAESGQRVP------------ 477

Query: 240 VASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVER-DPKNRMLLQKRQFY 299
                  PG S A  QSC  P+  + ++GS  L  AK RK+++ER D +NR LLQKRQF+
Sbjct: 478 -------PGTSPADVQSCGDPDYVQSIAGSTMLQFAKTRKISIERSDLRNRSLLQKRQFF 537

Query: 300 HSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQ 359
           HSHR QPMAL++VLSD+DSEDEVDDD+ADFEDRRMLDDFVDVTKDEK++MH+WNSFVRKQ
Sbjct: 538 HSHRAQPMALEQVLSDRDSEDEVDDDVADFEDRRMLDDFVDVTKDEKQMMHMWNSFVRKQ 597

Query: 360 RVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLE 415
           RVLADGH+PWACEAFS+LHG  +V +P L WCWR FM+KLWNHGLLD  TMNNCN  LE
Sbjct: 598 RVLADGHIPWACEAFSRLHGPIMVRTPHLIWCWRVFMVKLWNHGLLDARTMNNCNTFLE 627

BLAST of Cp4.1LG08g07180 vs. Swiss-Prot
Match: FIS2L_ARATH (Polycomb group protein FERTILIZATION-INDEPENDENT SEED 2 OS=Arabidopsis thaliana GN=FIS2 PE=1 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 2.3e-35
Identity = 74/150 (49.33%), Postives = 102/150 (68.00%), Query Frame = 1

Query: 272 AKARKLTVERDPKNRM-LLQKRQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRM 331
           ++ ++L  ER    R+  L+ RQFYHS  +QPM  ++V+S++DSE+E DD   D  +R  
Sbjct: 643 SRRKELHAERCEAKRLERLKGRQFYHSQTMQPMTFEQVMSNEDSENETDDYALDISERLR 702

Query: 332 LDDFVDVTKDEKRLMHLWNSFVRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRF 391
           L+  V V+K+EKR M+LWN FVRKQRV+ADGHVPWACE F+KLH +E+ +S +  W WR 
Sbjct: 703 LERLVGVSKEEKRYMYLWNIFVRKQRVIADGHVPWACEEFAKLHKEEMKNSSSFDWWWRM 762

Query: 392 FMIKLWNHGLLDPSTMNNCNLTLEVFKDES 421
           F IKLWN+GL+   T + C   L    DE+
Sbjct: 763 FRIKLWNNGLICAKTFHKCTTILLSNSDEA 792

BLAST of Cp4.1LG08g07180 vs. Swiss-Prot
Match: FIS2C_ARATH (Polycomb group protein FERTILIZATION-INDEPENDENT SEED 2 OS=Arabidopsis thaliana GN=FIS2 PE=2 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 6.7e-35
Identity = 79/181 (43.65%), Postives = 110/181 (60.77%), Query Frame = 1

Query: 240 SSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHR 299
           +SNI   +       + P+  +V S    LH  +     +ER       L+ RQFYHS  
Sbjct: 562 TSNILATTQPAKAEPSEPKVTRV-SRRKELHAERCEAKRLER-------LKGRQFYHSQT 621

Query: 300 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 359
           +QP+  ++V+S++DSE+E DD   D  +R  L+  V V+K+EKR M+LWN FVRKQRV+A
Sbjct: 622 MQPITFEQVMSNEDSENETDDYALDISERLRLERLVGVSKEEKRYMYLWNIFVRKQRVIA 681

Query: 360 DGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEVFKDE 419
           DGHVPWACE F+KLH +E+ +S +  W WR F IKLWN+GL+   T + C   L    DE
Sbjct: 682 DGHVPWACEEFAKLHKEEMKNSSSFDWWWRMFRIKLWNNGLICAKTFHKCTTILLSNSDE 734

Query: 420 S 421
           +
Sbjct: 742 A 734

BLAST of Cp4.1LG08g07180 vs. Swiss-Prot
Match: SZ12A_DANRE (Polycomb protein suz12-A OS=Danio rerio GN=suz12a PE=2 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 3.8e-06
Identity = 37/129 (28.68%), Postives = 62/129 (48.06%), Query Frame = 1

Query: 292 RQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSF 351
           R ++HS    P+   ++  + DSEDE D D    +    +++F DV + EK +M LWN  
Sbjct: 528 RLYFHSDSCTPLRPQEM--EVDSEDERDPDWLREKTAMQIEEFTDVNEGEKEIMKLWNLL 587

Query: 352 VRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNL 411
           V K   +AD  +  AC +F + HG  +V    L       +I + + GL+  +T++    
Sbjct: 588 VMKHGFIADNQMNQACMSFVEQHGTIMVEK-NLCRNALLHLINMHDFGLITTATIDKAMT 647

Query: 412 TLEVFKDES 421
            L     +S
Sbjct: 648 HLRDLTQQS 653

BLAST of Cp4.1LG08g07180 vs. TrEMBL
Match: A0A0A0L461_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G017280 PE=4 SV=1)

HSP 1 Score: 818.9 bits (2114), Expect = 2.9e-234
Identity = 397/433 (91.69%), Postives = 411/433 (94.92%), Query Frame = 1

Query: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKKR 60
           MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRK+R
Sbjct: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKER 60

Query: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVTE 120
           VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHL SSHDMFNFEYWVTE
Sbjct: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLCSSHDMFNFEYWVTE 120

Query: 121 EYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQFLEMDSAR 180
           EYQAVNVSVK+DVFRPE VADGVDPQLQTFF CTRPR+RKLKNS+QNGKYVQFLEMDS  
Sbjct: 121 EYQAVNVSVKVDVFRPENVADGVDPQLQTFFFCTRPRKRKLKNSIQNGKYVQFLEMDSPG 180

Query: 181 PSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQ-GENIGLECPSSMECIEHVAS 240
           P+TEGM KGF+GHNADGVSCEKEGSHSFPIE YLQNAQQ GENIG E PSSMECIE VAS
Sbjct: 181 PATEGMHKGFVGHNADGVSCEKEGSHSFPIETYLQNAQQDGENIGPEGPSSMECIERVAS 240

Query: 241 SSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHR 300
           SSNIPG S+AI+QS TGPECYKVLSG+DHL PAKARKLTVERDP+NRMLLQKRQFYHSHR
Sbjct: 241 SSNIPGFSVAINQSSTGPECYKVLSGNDHLQPAKARKLTVERDPRNRMLLQKRQFYHSHR 300

Query: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 360
           VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA
Sbjct: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 360

Query: 361 DGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEVFKDE 420
           DGHVPWACEAFSKLHGKEL+SSP LFWCWR FMIKLWNHGLLD STMNNCNLTLE FKDE
Sbjct: 361 DGHVPWACEAFSKLHGKELISSPPLFWCWRLFMIKLWNHGLLDASTMNNCNLTLEGFKDE 420

Query: 421 SFDATENGRREDD 433
           S +AT+N   +DD
Sbjct: 421 SSNATKNCGGDDD 433

BLAST of Cp4.1LG08g07180 vs. TrEMBL
Match: B9T283_RICCO (Polycomb protein embryonic flower, putative OS=Ricinus communis GN=RCOM_0191350 PE=4 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 1.0e-159
Identity = 294/440 (66.82%), Postives = 338/440 (76.82%), Query Frame = 1

Query: 1   MCHDNFHVH-SLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MCH N  VH S+EEA  A++SLLIYCKPVELYNILH R+  NPSFLRRCL YK++A RK+
Sbjct: 37  MCHQNSCVHLSVEEAIAADESLLIYCKPVELYNILHRRAQFNPSFLRRCLRYKMKASRKR 96

Query: 61  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 120
           R++ G+VIFNYRDYNN ++KTEVTEDF CPFC M+C SFKGLRYHL SSHD+FNFE+WV 
Sbjct: 97  RLAAGIVIFNYRDYNNKLQKTEVTEDFLCPFCSMMCMSFKGLRYHLCSSHDLFNFEFWVN 156

Query: 121 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYV--QFLEMD 180
           EEYQAVNVSVK+D F  ETVADGV+ + QTFF C+RPRRRK +N   N K V  QFLE+D
Sbjct: 157 EEYQAVNVSVKLDRFLSETVADGVEQRQQTFFFCSRPRRRKSRNHDHNEKQVCVQFLELD 216

Query: 181 SARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQGENIGLECPSSMECIEHV 240
           S +   EG+ +GF+                          +  EN G+E P+ +E IE V
Sbjct: 217 SPKLPLEGINEGFL-------------------------RKDDENYGVEYPN-VELIERV 276

Query: 241 ASSSNIPGVSMAISQSCTGPECYKVLSGSDH-----LHPAKARKLTVER-DPKNRMLLQK 300
           ASSSNI GVS+A +QS    EC K L GSDH     +H AKARKLTVER DP+NR+LLQK
Sbjct: 277 ASSSNILGVSIAKAQSSVDSECVKSLCGSDHSLPAGIHVAKARKLTVERSDPRNRVLLQK 336

Query: 301 RQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSF 360
           RQFYHSHRVQPMAL++V+SD+DSEDEVDDDIADFEDRRMLDDFVDV+KDEK+LMH WNSF
Sbjct: 337 RQFYHSHRVQPMALEQVMSDRDSEDEVDDDIADFEDRRMLDDFVDVSKDEKQLMHFWNSF 396

Query: 361 VRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNL 420
           VRKQRVLADGHVPWACEAFSKLHG+ELV SPALFWCWR FMIKLWN GLLD  TMNNCNL
Sbjct: 397 VRKQRVLADGHVPWACEAFSKLHGQELVGSPALFWCWRLFMIKLWNQGLLDACTMNNCNL 450

Query: 421 TLEVFKDESFDATENGRRED 432
            LE  +DE  DA +  +  D
Sbjct: 457 ILERSRDEGSDAMKTEKGND 450

BLAST of Cp4.1LG08g07180 vs. TrEMBL
Match: B9GYN4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s07930g PE=4 SV=2)

HSP 1 Score: 569.7 bits (1467), Expect = 3.0e-159
Identity = 297/442 (67.19%), Postives = 344/442 (77.83%), Query Frame = 1

Query: 1   MCHDNFHVH--SLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRK 60
           MCH N  V   S+EEA  A++SLLIYCKPVELYNIL  R+ +NPSFLRRCL YK++ RRK
Sbjct: 1   MCHQNCGVEHLSVEEAIAADESLLIYCKPVELYNILRRRAQDNPSFLRRCLRYKIKERRK 60

Query: 61  KRVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWV 120
           KR+  G+VIFNY+DY N++RKTE TEDFSCPFCLM C SFKGLRYHL SSHD+FNF++WV
Sbjct: 61  KRLRDGIVIFNYKDYKNMLRKTEATEDFSCPFCLMQCLSFKGLRYHLCSSHDLFNFDFWV 120

Query: 121 TEEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYV--QFLEM 180
           TEEYQAV VSV ID F  ETVADG++ + QTFF C++PR RK  N  QN K V  +FLE+
Sbjct: 121 TEEYQAVTVSVNIDRFISETVADGIEQRQQTFFFCSKPRTRKSINLDQNVKKVSIKFLEL 180

Query: 181 DSARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQG-ENIGLECPSSMECIE 240
           +S+    EG   GF+G   +G +  K  S     E  L N + G EN G ECP++ E +E
Sbjct: 181 NSS----EGTNNGFLGKE-EGENASKSSSS----EKDLLNMRDGTENYGSECPTATELME 240

Query: 241 HVASSSNIPGVSMAISQSCTGPECYKVLSGSDH-----LHPAKARKLTVER-DPKNRMLL 300
            VASS +IPGVS+A +QS   PEC K  SGS+      LH AKARKLTVER DPK R LL
Sbjct: 241 RVASSFSIPGVSIAQAQSSVDPECVKSQSGSEPSLPAALHVAKARKLTVERSDPKYRALL 300

Query: 301 QKRQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWN 360
           QKRQFYHSHRVQPMAL++V+SD+DSEDEVDDDIADFEDRRMLDDFVDV+KDEK++MHLWN
Sbjct: 301 QKRQFYHSHRVQPMALEQVMSDRDSEDEVDDDIADFEDRRMLDDFVDVSKDEKQVMHLWN 360

Query: 361 SFVRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNC 420
           SFVRKQRVLADGHVPWACEAFSKLHG+ELV SPALFWCWR FMIKLWNHGLLD STMNNC
Sbjct: 361 SFVRKQRVLADGHVPWACEAFSKLHGQELVISPALFWCWRLFMIKLWNHGLLDASTMNNC 420

Query: 421 NLTLEVFKDESFDATENGRRED 432
           N+ LE  +DE   A ++ R ED
Sbjct: 421 NMILERCRDEGSGAAKSERLED 433

BLAST of Cp4.1LG08g07180 vs. TrEMBL
Match: A0A0B2SCY3_GLYSO (Polycomb group protein VERNALIZATION 2 OS=Glycine soja GN=glysoja_018911 PE=4 SV=1)

HSP 1 Score: 564.3 bits (1453), Expect = 1.3e-157
Identity = 284/432 (65.74%), Postives = 341/432 (78.94%), Query Frame = 1

Query: 1   MCHDNFHVHSL-EEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MC  N  VH   EE   A++SLLIYCKPVELYNIL+ R+L NPSFLRRCL YK++A RK+
Sbjct: 1   MCRQNSPVHHAGEEEIAADESLLIYCKPVELYNILYRRALQNPSFLRRCLRYKIRASRKR 60

Query: 61  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 120
           R+  G+VIFNYRD  NI+RKTEVTEDFSCPFCLM C +FKGLR+HL SSHD+FNFE+WVT
Sbjct: 61  RLRAGIVIFNYRDRYNILRKTEVTEDFSCPFCLMQCGNFKGLRFHLCSSHDLFNFEFWVT 120

Query: 121 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKY--VQFLEMD 180
           E+YQAVNVSVKID+ R E VADGV PQ QTFF C+RPR+R+ K+SVQ  K   V+FLE+D
Sbjct: 121 EDYQAVNVSVKIDILRSENVADGVIPQSQTFFFCSRPRKRRRKDSVQIEKRTNVKFLELD 180

Query: 181 SARPSTEGMPKGFIGHNADGVSCEKEG-SHSFPIEAYLQNAQQ-GENIGLECPSSMECIE 240
           S     EG+  GF+  + D +SC+ E  S +   E  L + +  G   G + P +M+ +E
Sbjct: 181 SP----EGIHNGFLQKDDDILSCKGENVSRTSRSEKILPSGRNDGGKFGPDHPGTMDNLE 240

Query: 241 HVASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVER-DPKNRMLLQKRQF 300
           HV SS NIPGVS+A+ QS   PEC K +  SD   PAK +KL+++R D +NRMLLQKR F
Sbjct: 241 HVESSFNIPGVSIAMPQSSVDPECSKSICKSDPALPAKTKKLSMDRSDSRNRMLLQKRLF 300

Query: 301 YHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRK 360
           +HSHRVQPMAL++VLSD+DSEDEVDDDIAD EDRRMLDDFVDV+KDEK+LMHLWNSF+RK
Sbjct: 301 FHSHRVQPMALEQVLSDRDSEDEVDDDIADLEDRRMLDDFVDVSKDEKQLMHLWNSFMRK 360

Query: 361 QRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLE 420
           QRVLADGHVPWACEAFSKLHGKEL+SSPALFWCWR FMIKLWNHGLLD  TMNNC++ L+
Sbjct: 361 QRVLADGHVPWACEAFSKLHGKELISSPALFWCWRLFMIKLWNHGLLDACTMNNCSIVLD 420

Query: 421 VFKDESFDATEN 427
            +++E  D  +N
Sbjct: 421 SYRNEGSDTRKN 428

BLAST of Cp4.1LG08g07180 vs. TrEMBL
Match: I1J9S4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_01G206000 PE=4 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 1.8e-156
Identity = 281/432 (65.05%), Postives = 337/432 (78.01%), Query Frame = 1

Query: 1   MCHDNFHVHSL-EEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MC  N  VH   EE   A++SLLIYCKPVELYNIL+ R+L NPSFLRRCL YK++A RK+
Sbjct: 1   MCRQNSPVHHAGEEEIAADESLLIYCKPVELYNILYRRALQNPSFLRRCLRYKIRASRKR 60

Query: 61  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 120
           R+  G+VIFNYRD  NI+RKTEVTEDFSCPFCLM C +FKGLR+HL SSHD+FNFE+WVT
Sbjct: 61  RLRAGIVIFNYRDRYNILRKTEVTEDFSCPFCLMQCGNFKGLRFHLCSSHDLFNFEFWVT 120

Query: 121 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKY--VQFLEMD 180
           E+YQAVNVSVKID+ R E VADGV PQ QTFF C+RPR+R+ K+SVQ  K   V+FLE+D
Sbjct: 121 EDYQAVNVSVKIDILRSENVADGVIPQSQTFFFCSRPRKRRRKDSVQIEKRTNVKFLELD 180

Query: 181 SARPSTEGMPKGFIGHNADGVSCEKEGSH--SFPIEAYLQNAQQGENIGLECPSSMECIE 240
           S     EG+  GF+  + D +SC+ E     S   + +      G   G + P +M+ +E
Sbjct: 181 SP----EGIHNGFLQKDDDILSCKGENVSRTSRSEKIFPSGRNDGGKFGPDHPGTMDNLE 240

Query: 241 HVASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVER-DPKNRMLLQKRQF 300
           HV SS NIPGVS+A+ QS   PEC K +  SD   PAK +KL+++R D +NRMLLQKR F
Sbjct: 241 HVESSFNIPGVSIAMPQSSVDPECSKSICKSDPALPAKTKKLSMDRSDSRNRMLLQKRLF 300

Query: 301 YHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRK 360
           +HSHRVQPMAL++VLSD+DSEDEVDDDIAD EDRRMLDDFVDV+KDEK+LMHLWNSF+RK
Sbjct: 301 FHSHRVQPMALEQVLSDRDSEDEVDDDIADLEDRRMLDDFVDVSKDEKQLMHLWNSFMRK 360

Query: 361 QRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLE 420
           QRVLADGHVPWACEAFSKLHGKEL+SSPALFWCWR FMIKLWNHGLLD  TMNNC++ L+
Sbjct: 361 QRVLADGHVPWACEAFSKLHGKELISSPALFWCWRLFMIKLWNHGLLDACTMNNCSIVLD 420

Query: 421 VFKDESFDATEN 427
            +++E     +N
Sbjct: 421 SYRNEGSGTRKN 428

BLAST of Cp4.1LG08g07180 vs. TAIR10
Match: AT4G16845.1 (AT4G16845.1 VEFS-Box of polycomb protein)

HSP 1 Score: 410.2 bits (1053), Expect = 1.6e-114
Identity = 219/431 (50.81%), Postives = 285/431 (66.13%), Query Frame = 1

Query: 1   MCHDNFHVHSL-EEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MC  N    S  EE  + +++LLIYCKPV LYNI HLRSL NPSFL RCL+YK+ A+RK+
Sbjct: 1   MCRQNCRAKSSPEEVISTDENLLIYCKPVRLYNIFHLRSLGNPSFLPRCLNYKIGAKRKR 60

Query: 61  RV-STGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWV 120
           +  STG+V+FNY+D NN +++TEV ED SCPFC MLC SFKGL++HL SSHD+F FE+ +
Sbjct: 61  KSRSTGMVVFNYKDCNNTLQRTEVREDCSCPFCSMLCGSFKGLQFHLNSSHDLFEFEFKL 120

Query: 121 TEEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQ--FLEM 180
            EEYQ VNVSVK++ F  E      D + + F LC++PR+R+ +    N + ++  FL +
Sbjct: 121 LEEYQTVNVSVKLNSFIFEEEGSD-DDKFEPFSLCSKPRKRRQRGGRNNTRRLKVCFLPL 180

Query: 181 DSARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQGENIGLECPSSMECIEH 240
           DS  PS             +G++   +G                 N GL  P + E    
Sbjct: 181 DS--PS-------LANGTENGIALLNDG-----------------NRGLGYPEATELAGQ 240

Query: 241 VASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVER-DPKNRMLLQKRQFY 300
              +SNIP    AI+ S        +L+    +   K RKL+ ER + ++ +LLQKRQFY
Sbjct: 241 FEMTSNIP---PAIAHSSLDAGAKVILTTEAVVPATKTRKLSAERSEARSHLLLQKRQFY 300

Query: 301 HSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQ 360
           HSHRVQPMAL++V+SD+DSEDEVDDD+ADFEDR+MLDDFVDV KDEK+ MHLWNSFVRKQ
Sbjct: 301 HSHRVQPMALEQVMSDRDSEDEVDDDVADFEDRQMLDDFVDVNKDEKQFMHLWNSFVRKQ 360

Query: 361 RVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEV 420
           RV+ADGH+ WACE FS+ + KEL    +LFWCWR F+IKLWNHGL+D +T+NNCN  LE 
Sbjct: 361 RVIADGHISWACEVFSRFYEKELHCYSSLFWCWRLFLIKLWNHGLVDSATINNCNTILEN 401

Query: 421 FKDESFDATEN 427
            ++ S     N
Sbjct: 421 CRNTSVTNNNN 401

BLAST of Cp4.1LG08g07180 vs. TAIR10
Match: AT5G51230.1 (AT5G51230.1 VEFS-Box of polycomb protein)

HSP 1 Score: 387.9 bits (995), Expect = 8.3e-108
Identity = 200/359 (55.71%), Postives = 248/359 (69.08%), Query Frame = 1

Query: 60  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 119
           R+ TG V+FNYR YNN ++KTEVTEDFSCPFCL+ CASFKGLRYHL S+HD+ NFE+WVT
Sbjct: 298 RLRTGNVVFNYRYYNNKLQKTEVTEDFSCPFCLVKCASFKGLRYHLPSTHDLLNFEFWVT 357

Query: 120 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRP-RRRKLKNSVQNGKYVQFLEMDS 179
           EE+QAVNVS+K +    +   D VDP+ QTFF  ++  RRR+ K+ V++ +    L    
Sbjct: 358 EEFQAVNVSLKTETMISKVNEDDVDPKQQTFFFSSKKFRRRRQKSQVRSSRQGPHL---- 417

Query: 180 ARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAY--LQNAQQGENIGLECPSSMECIEH 239
                 G+    +    D  S   E S   P + Y  +  A+ G+ +             
Sbjct: 418 ------GLGCEVLDKTDDAHSVRSEKSRIPPGKHYERIGGAESGQRVP------------ 477

Query: 240 VASSSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVER-DPKNRMLLQKRQFY 299
                  PG S A  QSC  P+  + ++GS  L  AK RK+++ER D +NR LLQKRQF+
Sbjct: 478 -------PGTSPADVQSCGDPDYVQSIAGSTMLQFAKTRKISIERSDLRNRSLLQKRQFF 537

Query: 300 HSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQ 359
           HSHR QPMAL++VLSD+DSEDEVDDD+ADFEDRRMLDDFVDVTKDEK++MH+WNSFVRKQ
Sbjct: 538 HSHRAQPMALEQVLSDRDSEDEVDDDVADFEDRRMLDDFVDVTKDEKQMMHMWNSFVRKQ 597

Query: 360 RVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLE 415
           RVLADGH+PWACEAFS+LHG  +V +P L WCWR FM+KLWNHGLLD  TMNNCN  LE
Sbjct: 598 RVLADGHIPWACEAFSRLHGPIMVRTPHLIWCWRVFMVKLWNHGLLDARTMNNCNTFLE 627

BLAST of Cp4.1LG08g07180 vs. TAIR10
Match: AT2G35670.1 (AT2G35670.1 VEFS-Box of polycomb protein)

HSP 1 Score: 149.8 bits (377), Expect = 3.8e-36
Identity = 79/181 (43.65%), Postives = 110/181 (60.77%), Query Frame = 1

Query: 240 SSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHR 299
           +SNI   +       + P+  +V S    LH  +     +ER       L+ RQFYHS  
Sbjct: 562 TSNILATTQPAKAEPSEPKVTRV-SRRKELHAERCEAKRLER-------LKGRQFYHSQT 621

Query: 300 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 359
           +QP+  ++V+S++DSE+E DD   D  +R  L+  V V+K+EKR M+LWN FVRKQRV+A
Sbjct: 622 MQPITFEQVMSNEDSENETDDYALDISERLRLERLVGVSKEEKRYMYLWNIFVRKQRVIA 681

Query: 360 DGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEVFKDE 419
           DGHVPWACE F+KLH +E+ +S +  W WR F IKLWN+GL+   T + C   L    DE
Sbjct: 682 DGHVPWACEEFAKLHKEEMKNSSSFDWWWRMFRIKLWNNGLICAKTFHKCTTILLSNSDE 734

Query: 420 S 421
           +
Sbjct: 742 A 734

BLAST of Cp4.1LG08g07180 vs. TAIR10
Match: AT4G16810.1 (AT4G16810.1 VEFS-Box of polycomb protein)

HSP 1 Score: 142.9 bits (359), Expect = 4.6e-34
Identity = 75/143 (52.45%), Postives = 101/143 (70.63%), Query Frame = 1

Query: 271 PAKARKLTVERDPKNRMLLQKRQFYHSHRVQPMALDKVLSDKDSEDEVD--DDIADFEDR 330
           PAK  K T    P     L KRQFYHS   QP++L++V+SD+DSE++VD  DD A  E+ 
Sbjct: 13  PAKRSKATSHYLP-----LHKRQFYHSRTGQPLSLEQVMSDRDSENDVDKNDDAAHLEES 72

Query: 331 RMLDDFVDVTKD-EKRLMHLWNSFVRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWC 390
           +ML+  +D  +   +R + LWNSFV++QR++AD H+PWACEAFS+LH +EL S+ +L  C
Sbjct: 73  QMLNGSMDENEIVAERFIKLWNSFVKQQRIVADAHIPWACEAFSRLHLQELRSNLSLDLC 132

Query: 391 WRFFMIKLWNHGLLDPSTMNNCN 411
           WR FMIK W++GLLD  TMN CN
Sbjct: 133 WRQFMIKQWDYGLLDRVTMNKCN 150

BLAST of Cp4.1LG08g07180 vs. NCBI nr
Match: gi|659095839|ref|XP_008448794.1| (PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 [Cucumis melo])

HSP 1 Score: 823.2 bits (2125), Expect = 2.2e-235
Identity = 400/433 (92.38%), Postives = 413/433 (95.38%), Query Frame = 1

Query: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKKR 60
           MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCL+YKLQARRK+R
Sbjct: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLYYKLQARRKER 60

Query: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVTE 120
           VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHL SSHDMFNFEYWVTE
Sbjct: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLCSSHDMFNFEYWVTE 120

Query: 121 EYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQFLEMDSAR 180
           EYQAVNVSVKIDVFRPETVADGVDPQLQTFF CTRPR+RKLKNS+QNGKYVQFLEMDS  
Sbjct: 121 EYQAVNVSVKIDVFRPETVADGVDPQLQTFFFCTRPRKRKLKNSIQNGKYVQFLEMDSPG 180

Query: 181 PSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQ-GENIGLECPSSMECIEHVAS 240
           P+TEGM KGF+GHNADGVSCEKEGSHSFPIE YLQNAQQ GENIG E PSSMECIE VAS
Sbjct: 181 PATEGMHKGFVGHNADGVSCEKEGSHSFPIETYLQNAQQDGENIGPEGPSSMECIERVAS 240

Query: 241 SSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHR 300
           SSNIPG S+AI+QS TGPECYKVLSGSDHL PAKARKLTVERDP+NRMLLQKRQFYHSHR
Sbjct: 241 SSNIPGFSVAINQSSTGPECYKVLSGSDHLQPAKARKLTVERDPRNRMLLQKRQFYHSHR 300

Query: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 360
           VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA
Sbjct: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 360

Query: 361 DGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEVFKDE 420
           DGHVPWACEAFSKLHGKEL+SSP LFWCWR FMIKLWNHGLLD STMNNCNLTLE FKDE
Sbjct: 361 DGHVPWACEAFSKLHGKELISSPPLFWCWRLFMIKLWNHGLLDASTMNNCNLTLEGFKDE 420

Query: 421 SFDATENGRREDD 433
           S +A +N RR+DD
Sbjct: 421 SSNAMKNCRRDDD 433

BLAST of Cp4.1LG08g07180 vs. NCBI nr
Match: gi|449458988|ref|XP_004147228.1| (PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 818.9 bits (2114), Expect = 4.1e-234
Identity = 397/433 (91.69%), Postives = 411/433 (94.92%), Query Frame = 1

Query: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKKR 60
           MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRK+R
Sbjct: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKER 60

Query: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVTE 120
           VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHL SSHDMFNFEYWVTE
Sbjct: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLCSSHDMFNFEYWVTE 120

Query: 121 EYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQFLEMDSAR 180
           EYQAVNVSVK+DVFRPE VADGVDPQLQTFF CTRPR+RKLKNS+QNGKYVQFLEMDS  
Sbjct: 121 EYQAVNVSVKVDVFRPENVADGVDPQLQTFFFCTRPRKRKLKNSIQNGKYVQFLEMDSPG 180

Query: 181 PSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQ-GENIGLECPSSMECIEHVAS 240
           P+TEGM KGF+GHNADGVSCEKEGSHSFPIE YLQNAQQ GENIG E PSSMECIE VAS
Sbjct: 181 PATEGMHKGFVGHNADGVSCEKEGSHSFPIETYLQNAQQDGENIGPEGPSSMECIERVAS 240

Query: 241 SSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHR 300
           SSNIPG S+AI+QS TGPECYKVLSG+DHL PAKARKLTVERDP+NRMLLQKRQFYHSHR
Sbjct: 241 SSNIPGFSVAINQSSTGPECYKVLSGNDHLQPAKARKLTVERDPRNRMLLQKRQFYHSHR 300

Query: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 360
           VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA
Sbjct: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRVLA 360

Query: 361 DGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNLTLEVFKDE 420
           DGHVPWACEAFSKLHGKEL+SSP LFWCWR FMIKLWNHGLLD STMNNCNLTLE FKDE
Sbjct: 361 DGHVPWACEAFSKLHGKELISSPPLFWCWRLFMIKLWNHGLLDASTMNNCNLTLEGFKDE 420

Query: 421 SFDATENGRREDD 433
           S +AT+N   +DD
Sbjct: 421 SSNATKNCGGDDD 433

BLAST of Cp4.1LG08g07180 vs. NCBI nr
Match: gi|778675274|ref|XP_011650380.1| (PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 680.6 bits (1755), Expect = 1.7e-192
Identity = 333/358 (93.02%), Postives = 344/358 (96.09%), Query Frame = 1

Query: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKKR 60
           MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRK+R
Sbjct: 1   MCHDNFHVHSLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKER 60

Query: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVTE 120
           VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHL SSHDMFNFEYWVTE
Sbjct: 61  VSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLCSSHDMFNFEYWVTE 120

Query: 121 EYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYVQFLEMDSAR 180
           EYQAVNVSVK+DVFRPE VADGVDPQLQTFF CTRPR+RKLKNS+QNGKYVQFLEMDS  
Sbjct: 121 EYQAVNVSVKVDVFRPENVADGVDPQLQTFFFCTRPRKRKLKNSIQNGKYVQFLEMDSPG 180

Query: 181 PSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQ-GENIGLECPSSMECIEHVAS 240
           P+TEGM KGF+GHNADGVSCEKEGSHSFPIE YLQNAQQ GENIG E PSSMECIE VAS
Sbjct: 181 PATEGMHKGFVGHNADGVSCEKEGSHSFPIETYLQNAQQDGENIGPEGPSSMECIERVAS 240

Query: 241 SSNIPGVSMAISQSCTGPECYKVLSGSDHLHPAKARKLTVERDPKNRMLLQKRQFYHSHR 300
           SSNIPG S+AI+QS TGPECYKVLSG+DHL PAKARKLTVERDP+NRMLLQKRQFYHSHR
Sbjct: 241 SSNIPGFSVAINQSSTGPECYKVLSGNDHLQPAKARKLTVERDPRNRMLLQKRQFYHSHR 300

Query: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRV 358
           VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQR+
Sbjct: 301 VQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSFVRKQRL 358

BLAST of Cp4.1LG08g07180 vs. NCBI nr
Match: gi|1000940943|ref|XP_015582811.1| (PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 [Ricinus communis])

HSP 1 Score: 583.9 bits (1504), Expect = 2.2e-163
Identity = 302/441 (68.48%), Postives = 348/441 (78.91%), Query Frame = 1

Query: 1   MCHDNFHVH-SLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MCH N  VH S+EEA  A++SLLIYCKPVELYNILH R+  NPSFLRRCL YK++A RK+
Sbjct: 1   MCHQNSCVHLSVEEAIAADESLLIYCKPVELYNILHRRAQFNPSFLRRCLRYKMKASRKR 60

Query: 61  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 120
           R++ G+VIFNYRDYNN ++KTEVTEDF CPFC M+C SFKGLRYHL SSHD+FNFE+WV 
Sbjct: 61  RLAAGIVIFNYRDYNNKLQKTEVTEDFLCPFCSMMCMSFKGLRYHLCSSHDLFNFEFWVN 120

Query: 121 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYV--QFLEMD 180
           EEYQAVNVSVK+D F  ETVADGV+ + QTFF C+RPRRRK +N   N K V  QFLE+D
Sbjct: 121 EEYQAVNVSVKLDRFLSETVADGVEQRQQTFFFCSRPRRRKSRNHDHNEKQVCVQFLELD 180

Query: 181 SARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQG-ENIGLECPSSMECIEH 240
           S +   EG+ +GF+    DG +  K  S     E  L N++ G EN G+E P ++E IE 
Sbjct: 181 SPKLPLEGINEGFL-RKDDGENASKSSSS----EKDLHNSRHGAENYGVEYP-NVELIER 240

Query: 241 VASSSNIPGVSMAISQSCTGPECYKVLSGSDH-----LHPAKARKLTVER-DPKNRMLLQ 300
           VASSSNI GVS+A +QS    EC K L GSDH     +H AKARKLTVER DP+NR+LLQ
Sbjct: 241 VASSSNILGVSIAKAQSSVDSECVKSLCGSDHSLPAGIHVAKARKLTVERSDPRNRVLLQ 300

Query: 301 KRQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNS 360
           KRQFYHSHRVQPMAL++V+SD+DSEDEVDDDIADFEDRRMLDDFVDV+KDEK+LMH WNS
Sbjct: 301 KRQFYHSHRVQPMALEQVMSDRDSEDEVDDDIADFEDRRMLDDFVDVSKDEKQLMHFWNS 360

Query: 361 FVRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCN 420
           FVRKQRVLADGHVPWACEAFSKLHG+ELV SPALFWCWR FMIKLWN GLLD  TMNNCN
Sbjct: 361 FVRKQRVLADGHVPWACEAFSKLHGQELVGSPALFWCWRLFMIKLWNQGLLDACTMNNCN 420

Query: 421 LTLEVFKDESFDATENGRRED 432
           L LE  +DE  DA +  +  D
Sbjct: 421 LILERSRDEGSDAMKTEKGND 435

BLAST of Cp4.1LG08g07180 vs. NCBI nr
Match: gi|223527939|gb|EEF30025.1| (polycomb protein embryonic flower, putative [Ricinus communis])

HSP 1 Score: 571.2 bits (1471), Expect = 1.5e-159
Identity = 294/440 (66.82%), Postives = 338/440 (76.82%), Query Frame = 1

Query: 1   MCHDNFHVH-SLEEATTAEDSLLIYCKPVELYNILHLRSLNNPSFLRRCLHYKLQARRKK 60
           MCH N  VH S+EEA  A++SLLIYCKPVELYNILH R+  NPSFLRRCL YK++A RK+
Sbjct: 37  MCHQNSCVHLSVEEAIAADESLLIYCKPVELYNILHRRAQFNPSFLRRCLRYKMKASRKR 96

Query: 61  RVSTGVVIFNYRDYNNIVRKTEVTEDFSCPFCLMLCASFKGLRYHLGSSHDMFNFEYWVT 120
           R++ G+VIFNYRDYNN ++KTEVTEDF CPFC M+C SFKGLRYHL SSHD+FNFE+WV 
Sbjct: 97  RLAAGIVIFNYRDYNNKLQKTEVTEDFLCPFCSMMCMSFKGLRYHLCSSHDLFNFEFWVN 156

Query: 121 EEYQAVNVSVKIDVFRPETVADGVDPQLQTFFLCTRPRRRKLKNSVQNGKYV--QFLEMD 180
           EEYQAVNVSVK+D F  ETVADGV+ + QTFF C+RPRRRK +N   N K V  QFLE+D
Sbjct: 157 EEYQAVNVSVKLDRFLSETVADGVEQRQQTFFFCSRPRRRKSRNHDHNEKQVCVQFLELD 216

Query: 181 SARPSTEGMPKGFIGHNADGVSCEKEGSHSFPIEAYLQNAQQGENIGLECPSSMECIEHV 240
           S +   EG+ +GF+                          +  EN G+E P+ +E IE V
Sbjct: 217 SPKLPLEGINEGFL-------------------------RKDDENYGVEYPN-VELIERV 276

Query: 241 ASSSNIPGVSMAISQSCTGPECYKVLSGSDH-----LHPAKARKLTVER-DPKNRMLLQK 300
           ASSSNI GVS+A +QS    EC K L GSDH     +H AKARKLTVER DP+NR+LLQK
Sbjct: 277 ASSSNILGVSIAKAQSSVDSECVKSLCGSDHSLPAGIHVAKARKLTVERSDPRNRVLLQK 336

Query: 301 RQFYHSHRVQPMALDKVLSDKDSEDEVDDDIADFEDRRMLDDFVDVTKDEKRLMHLWNSF 360
           RQFYHSHRVQPMAL++V+SD+DSEDEVDDDIADFEDRRMLDDFVDV+KDEK+LMH WNSF
Sbjct: 337 RQFYHSHRVQPMALEQVMSDRDSEDEVDDDIADFEDRRMLDDFVDVSKDEKQLMHFWNSF 396

Query: 361 VRKQRVLADGHVPWACEAFSKLHGKELVSSPALFWCWRFFMIKLWNHGLLDPSTMNNCNL 420
           VRKQRVLADGHVPWACEAFSKLHG+ELV SPALFWCWR FMIKLWN GLLD  TMNNCNL
Sbjct: 397 VRKQRVLADGHVPWACEAFSKLHGQELVGSPALFWCWRLFMIKLWNQGLLDACTMNNCNL 450

Query: 421 TLEVFKDESFDATENGRRED 432
            LE  +DE  DA +  +  D
Sbjct: 457 ILERSRDEGSDAMKTEKGND 450

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VRN2_ARATH2.8e-11350.81Polycomb group protein VERNALIZATION 2 OS=Arabidopsis thaliana GN=VRN2 PE=1 SV=2[more]
EMF2_ARATH1.5e-10655.71Polycomb group protein EMBRYONIC FLOWER 2 OS=Arabidopsis thaliana GN=EMF2 PE=1 S... [more]
FIS2L_ARATH2.3e-3549.33Polycomb group protein FERTILIZATION-INDEPENDENT SEED 2 OS=Arabidopsis thaliana ... [more]
FIS2C_ARATH6.7e-3543.65Polycomb group protein FERTILIZATION-INDEPENDENT SEED 2 OS=Arabidopsis thaliana ... [more]
SZ12A_DANRE3.8e-0628.68Polycomb protein suz12-A OS=Danio rerio GN=suz12a PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L461_CUCSA2.9e-23491.69Uncharacterized protein OS=Cucumis sativus GN=Csa_3G017280 PE=4 SV=1[more]
B9T283_RICCO1.0e-15966.82Polycomb protein embryonic flower, putative OS=Ricinus communis GN=RCOM_0191350 ... [more]
B9GYN4_POPTR3.0e-15967.19Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s07930g PE=4 SV=2[more]
A0A0B2SCY3_GLYSO1.3e-15765.74Polycomb group protein VERNALIZATION 2 OS=Glycine soja GN=glysoja_018911 PE=4 SV... [more]
I1J9S4_SOYBN1.8e-15665.05Uncharacterized protein OS=Glycine max GN=GLYMA_01G206000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16845.11.6e-11450.81 VEFS-Box of polycomb protein[more]
AT5G51230.18.3e-10855.71 VEFS-Box of polycomb protein[more]
AT2G35670.13.8e-3643.65 VEFS-Box of polycomb protein[more]
AT4G16810.14.6e-3452.45 VEFS-Box of polycomb protein[more]
Match NameE-valueIdentityDescription
gi|659095839|ref|XP_008448794.1|2.2e-23592.38PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 [Cucumis melo][more]
gi|449458988|ref|XP_004147228.1|4.1e-23491.69PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 isoform X1 [Cucumis sativus... [more]
gi|778675274|ref|XP_011650380.1|1.7e-19293.02PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 isoform X2 [Cucumis sativus... [more]
gi|1000940943|ref|XP_015582811.1|2.2e-16368.48PREDICTED: polycomb group protein EMBRYONIC FLOWER 2 [Ricinus communis][more]
gi|223527939|gb|EEF30025.1|1.5e-15966.82polycomb protein embryonic flower, putative [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019135Polycomb_protein_VEFS-Box
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g07180.1Cp4.1LG08g07180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019135Polycomb protein, VEFS-BoxPFAMPF09733VEFS-Boxcoord: 281..410
score: 2.9
NoneNo IPR availablePANTHERPTHR22597POLYCOMB GROUP PROTEINcoord: 41..432
score: 4.5E