Cp4.1LG02g04430 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g04430
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSPOC domain/transcription elongation factor S-II, putative
LocationCp4.1LG02 : 2426637 .. 2436049 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGATTGCATTTCGAAGCGCGACCCTTTTCGCTCTAAATCTTCGAGCCTTTCTTGGCCACTTTCTCTCTCTCTCCTCCCGATTCCATTTCCATTATCTTTTCAACTCTGCTTAGCTTCTATCACCATCGACCTCTCTTCTCTCTGCAATTTTTGATCTGGTGGTTACTTTCTCTCGCTGGCACTGCGCTCGCCTTCTACGGATCGTGCTGTATCGGATGATTTATGTATGTTCATTTCTTGCATTTTTGGACAGTTTTTTTCTTTTTTGTGGATTTGGATGATGGGGAGGTGTTCGGTTAGGGTTTTAGGTTTTCGAATGGTTGTTGTTATGCTTTTGGCGGGTGGCTTTTGATGGATTATTTTGAATTGGTTGTGATGATATGGAATTCTTGATGTTTTCTGATTTCGTATGGTGTACCTGTCCGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTACTTGGAGTGAATTAGTTCTTTTTTTTTTTGGCTGATTTTACTTTGAGGTATTTTGCTTTGAAATTGAGTTTTGAGAATTGGTTGATGAGATTGGGTAGTGGAGGATATTTTTTTTTTGGTTCGAAGGTTATGCGTGGCCTTAATGGGAGGTTGAATTAATCTTTACGAATGATTATTGGTGTATTATGGGTCTTTTTCAATTGTTTAATTCAACTGTGATGGTCGTGAAATTCTTGATGTTTGCTGGTTTTGCGCCTTATTGCTACTTTCTATGCTCGAGTGCAATTTTTCCTTTTCCGTTGCCTCATTTTTTCTTCAATGTCTTTTGCGAAAAGCAAAATTGATGAGATTGATAGTGAAAGGTACTTTTTTGTGCGAAGCACCTGCGATGTGAGGTTGAATTGATCTATTCATTTTGATTCTGGAAGGTGTGTTACTTTTCCTAGAAATCCCGATACGTATTTACTCGGTGGTGTGGGAACGCGTATTTACATAGGTTTTTTGAAAGTAAAAAAAGGATTGCATGCCTAATCTTTACACTTCACAATGTACAGCTATGACATAAAAACATTATGATACCTGATCGACAAAATGCAAACTTAGGCTAATGGGGATTTTAATGGATGGAGGACAAATCTTCTTTTCGAAGTTGGAAGTTCCATATGATTTCTTTTACTTTTATATGATTTCTTCTGTTTATGCTTATTTATTAGACAACAGTAGGGTTTGAGGTTGTCTCGAGTGTATGCCATACTGCATTCCTTGAGGCAATACTGTCAGTCTCACCTCGAGGATGTGCCCCAAATGCGTTTTGGAGCATGGAGTTTGGTAAATTGTACTCGCTTACACCCTGGTTTGTCATTTTTATGCTACTCTAATGTTCATACCCCCTGCACTTAACAATTTGTGTGTAACCATATAAAGTGTAAATGGACTTGAATTGTTGGGGTTGGATAACAGCATGTAGTAAACTGCGATATAGTTTCTTGATTTTGAAGATGTACTTTACTCTAAATGCAGATGCTTAAGTTTCTTCTTAAGAATGATGAAGCACAACGTGTGGACACTATTTGATGCTCACATACCTAGCGAAAAATTTATTATGTGTAATGTTAGTGCCGTAGTCTGGTCAACAATCTTGTTTCTGAGAAATATTCAATGCGAGGAATGCTAAGTAGTCAATTGGACTCTGTGATAAGCAAAGTGGAATTGTCATTATCTAGACTGGTTGTAGTTTCTTCCAATGACCCTTCATTAGTTCAATTTCTGGTAGCTTTTATTGTCCGAATCTTCTCATCCGCCTAAGAAAGCTACAATGACACACGAAGAATTTGTGAAAAGTAAGTGATCCTCCAAAAGTTATGCAAATGTGACTAGAAAAAATCCACACCACTTTCAAAGTCATGACTTTTTGAAAGGAGTGCCCCTTCAGGCTTCTTCTTTTTGGGTGAGAAAAGAGAAGGAAGAAATGAATTTGAATCTCAATTCCTTGGCTGTTTCTAGGATGCTTGCTCACTACTCTTGGAAAGAGGTGATAGTCACTTTAGAAGGTTATTTTGTGTCCAAAAGTTTATTAAATCCATTCATGGATGAAAAGGCTGTTTTTTTGAGAATAAGTTTAGTTGATGGTTTGTTTGGTTGAAAAATGGGAACGCGTATTTACATAGGTTTTTTGAAAGTAAAAAAAGGATTGCATGCCTAATCTTTACACTTCACAATGTACAGCTATGACATAAAAACATTATGATACCTGATCGACAAAATGCAAACTTAGGCTAATGGGGATTTTAATGGATGGAGGACAAATCTTCTTTTCGAAGTTGGAAGTTCCATATGATTTCTTTTACTTTTATATGATTTCTTCTGTTTATGCTTATTTATTAGACAACAGTAGGGTTTGAGGTTGTCTCGAGTGTATGCCATACTGCATTCCTTGAGGCAATACTGTCAGTCTCACCTCGAGGATGTGCCCCAAATGCGTTTTGGAGCATGGAGTTTGGTAAATTGTACTCGCTTACACCCTGGTTTGTCATTTTTATGCTACTCTAATGTTCATACCCCCTGCACTTAACAATTTGTGTGTAACCATATAAAGTGTAAATGGACTTGAATTGTTGGGGTTGGATAACAGCATGTAGTAAACTGCGATATAGTTTCTTGATTTTGAAGATGTACTTTACTCTAAATGCAGATGCTTAAGTTTCTTCTTAAGAATGATGAAGCACAACGTGTGGACACTATTTGATGCTCACATACCTAGCGAAAAATTTATTATGTGTAATGTTAGTGCCGTAGTCTGGTCAACAATCTTGTTTCTGAGAAATATTCAATGCGAGGAATGCTAAGTAGTCAATTGGACTCTGTGATAAGCAAAGTGGAATTGTCATTATCTAGACTGGTTGTAGTTTCTTCCAATGACCCTTCATTAGTTCAATTTCTGGTAGCTTTTATTGTCCGAATCTTCTCATCCGCCTAAGAAAGCTACAATGACACACGAAGAATTTGTGAAAAGTAAGTGATCCTCCAAAAGTTATGCAAATGTGACTAGAAAAAATCCACACCACTTTCAAAGTCATGACTTTTTGAAAGGAGTGCCCCTTCAGGCTTCTTCTTTTTGGGTGAGAAAAGAGAAGGAAGAAATGAATTTGAATCTCAATTCCTTGGCTGTTTCTAGGATGCTTGCTCACTACTCTTGGAAAGAGGTGATAGTCACTTTAGAAGGTTATTTTGTGTCCAAAAGTTTATTAAATCCATTCATGGATGAAAAGGCTGTTTTTTTGAGAATAAGTTTAGTTGATGGTTTGTTTGGTTGAAAATTGAGTATTGGTCCAACAAGAAACATTCCTATTTGGAGTAACATAAAGAGCTACGGAGGATGGATTCCTTTAAAGAATCTACCTTTCCCATATTAGAAGCTTTCTTTCTTTGAAGCAATAGGGCAAAACTTTTGTGGTATGTTAAGCATTTCCTCCCAAAGAATTAATCTCTTTGATAGTTAGTTGCGGTAATAAAAGTAGAAGAATATATTTGTGGATTAATCCAAACAAAAGTTGAAGTTGGTAATGTTTGAAAATTTTCCCACAGTTTTGGTGATTTGTTGCATTTGACTGTCTTAGCATGAAATGTCGGTTTCTTGCATGATCATAATCAACTACCATCTTCATAAGGTGTTATTTCTCGTGGTTTTATTAGACCAACTTCTGATGGAAGTGATTTAAACTAAATAATGAAGTGAATGCCCAACGATCAAATATTAAATACTCTTTTATTGCCACATATTCAGAATTTTATACCAGAAAAAACTCATGATGAATCGTTGTCATGGCTTCAGAATCTTTCATTTCTATTAAGTTCATTCAATTTTCGATGCCTCTGTTATATCCTCTGGTTTCATTAAATGCAAGATAGGTCTTGAAAAAGTAAGTTGGCTAATCTAGATTTTATTGAAAATGCTTGCCTACACATTGTAGTTCCCCCAATCCAATCAGCTTCCCTTCTAGAAAAGGAGTCTTCGAAGCAAACATTGCAAGGCCTCTCATAATTTTGTTAATTTGTCATTGTCATTGTCATTTTCCATTTTAGGTTCAGACTTAAATCTCAAAAAAATTATACTGCTTTCTCCTGAATTTTCAGATGTTCCAACATCTTCCTTACCATTTATTGACTCTGAGGCTGAATCATTAACCAATGTTTTGAGAAATTGGAGATTTCCCCGATAATCTTGAACAGTCATTGACTCTCCTTTTTTGGACGAATATGATGTACTTTCTTTAAAGAAGATTTTTGAGTGGAGTCTACCCTCTTGATTCCTAAAAAGATTTCATTTCTTGTTAGATTTGCAGTCTTTGGATGTGTGAAATTCTGCCTTATTCATCTCAGATAAAAGTTCAGTTCATTTCGTTTAAACCCTCATTGAAGATTCTTTTATGGCTTTTAGAGGTCTTGGAAAGTTTCGAAACAGTCGGCTCTTAAGTTTCTTTTTCTGTCCAATATTCTTTTTAGAATAAGCAGGTTTGTTTGTATAACGATTGATGATAGTTGAGATCAAGTTTGATGAAATTTTGTTCTCGGCTTTCTTCTTAGGGATGATCCAAGAGTGTAGTGCCAGAAATTTTTATTTTCTGTGAATAGTTTCTTAAGAGATCCTCATTGAAGGTTATTGCTTGGCATTTTAGAGGGTTTTGGAAGGTTTCGGGCTTGCATTAGTTGAATGACTCTTTCAAGGTAAGAAGTGTATCTTCTGAAATATTTTTTCTTAGCTTCATTTTCTCGAGTTGCTTTTTTGAGCCTCAAGCACTTAGGTCTTTGTTGTACTTCCTTCCTAAGAAGTTTCTGCCTTTTAAACCGGGGTGGGAGCTCTTTATGGCCGATTTTTGGCTGTACTTTTGTTTGCTTTGCTAGTTATTTTTCATCTTTTTATCTTGTATTTTTACTACCTTTTCCTTCCTTTTTATACCTCGAAAGTGGTTTCTTTTATAAAAAAAATACCTGACAATCAATTTGGACAATCTATTTGGAGGGACTGGAAACATGCTGAGAACTGCAGAAGGATTGCTGTCACTTCCTGTGAAGCGCAAGGCATCGAGTGAGCCTTTTAACTCTCTCTCACAGCAGGCTTCATTGCATAATAAGCGAGTTGCACAGATGGAACCTCGATCATGGTTGCAACAAGTATCTGGATTAGCCAAAAGACCTCCTTTACAAATACCAAAGAATGTCCCAGCTCCCACATCGATGCATTTCCCTGCAGGAACTAAGAGAAAGGTACAGCAAATAGAATCACATCCAACTAAAGTTGTACATCAACGTTCCACTGCTCCCAAATGCCAGAGTGCTCCACTGACTCCAACTTCCAAAATGCAAAATGAGCCGACTGGGTCTGTGAGATCAAAGATGAGGGAATCCTTGGCTGCTGCATTAGCCTTGGTATCGCAGCAGCAAAACAAATCTTCTAATGATGAAAAAAATCCTTTAACTGAGGCTGAGAAGTCTGCAACTCAAATGCAGGAAAATGCTTTAGCGTCTGATCCAGCTATTATTGTTCATGTATCTGATGACTCAAAGAAAATCTTTTCCGAGAAGTTAGATTCTGTTGGTCTTGAAGATAATGTAGGAAGGATGTTAGATAAGAATTTGCTGTGTGTAAATGATAGCGATTTAGAGTCATTAGGATATGATGGACGAGTCTTTCAACCAAATAATATTTTGTCTTATGAAGATATTTCTTTTGGGGATAACTTTTTTATTAAAGACGATCTTTTACAAGAAAATTCTCTCTCTTGGGTACTGGAAGCTGATGTAGGGCTAGCTGATAAAAAGGAAATCAGAACTGATGAACTTCAGAAGATTGATGTTGGTATAGCAAATCAAAACCAAGGATCAAAACCAGTACAGAGTCCTGAGTCTTTGGCATTTAAAATGGAAGAAGAATTATTTAAATTATTTAGTGGTGTTAATAAAAAGTACAAGGAGAAAGGAAGATCCCTTTTGTTCAACCTGAAAGACAGAAATAATCCCGAGCTGAGAGAAAGGGTTATGAATGGGGAAATTACCCCAGAAAGATTATGTTCCATGACTGCCGAGGAACTTGCATCTAAGGAGCTTTCTGAGTGGAGAATGGCCAAGGCTGAAGAACTTGCACAAATGGTAGTTTTACCCAACTCAGAAGTTGATATCAGACGTTTGGTAAGGAAGACGCATAAAGGTGAGTTTCAAGTAGAAGTTGAAGAATACGATAATGCCTCTATCGATGTTTCATCTGGGGTTTCTACGTTTTCTCAGAGTCAACGTAATAAGAATGAGACTGTAGGTGGATCACCTGATGAACCTGATACAATTATGGACGAGTGGAATATTTCTGGCCAGAAAAATGGTGCATCTGACAAGGATGAGTACACCTTTACAATTGCATCGACTGAAGGGTCTGAATTATTGTCACTGCCTCCGATCTCCTCCATAGATGAGTTTATGGAATCCTTTGATACAGAGCCACCTTTCAATATTTTATCTGAAGATACTGGTAAATCGTCTCCTATTTTGGAGAAGGGTGAGCCAGAGCCTGGCTCTCAGTTGAAGGCTGCAGCTCATTCTATGGAAGGCGCAACTGATGTTAGTATAGACAAAAATGAAAATATTGAGTCTTATACAAAAGCAGACATTGGCTCGTCTTCTATTAGCCACATGGATTTGACATCTAGTGATTGTAAAACTGATGAGGACTTGAATGAAAATCAAGCTGGGTTAAGAACATCTGACAGGAATGATGGTACAGTATCTGGTGATAGTAATGCAAAATCTGGGACAGAATCTTTGGCCAGCACATTTAGTTTAGAATATTTATGGGATGGCATCCTCCAGTATAATATTTCGACAATGACTCCGGTCGTGGGTACCTACATAAGGTGTGTATTTTCTCTGATTAATCGTGGATTTTACTTCTTTTGTAATTGCATTCACAATAATTATATATTTTTTGTTGGATATTTCTCTTTCATGATTTTACTTGAAAACCTCAGTAAGCTTAGTTTTTCCTTGCATTAATCCAGCTCCAGCATTTACATGTATCCGCATGGTTTCCACATTTCTTCATTTGGGCATTCCCATGTGACCTTTAAGTTTTAAGGGACAACTTTTGATTTAGTGAAAAAAATTGCCAAGTTTAGTTTTAGATATATCTTTGTGTTGAATGTTAGACATAGTTTTGTTGCATTCATAACGTGTTAACGTTCTTGGTTTCCATGATTAACGTGATCGAGAAAGGATTTGTCTTTACAGTGTAATCTAAATTTTACTTGCTGCCTTTGTGGGTTTCCTGGTTGGTTTTAATGTTCTTGATGCGTTCTTTTTAATAATTGCCTTGCTTTGTTGCAGTGGTGAAAGAACATCAGCGAAAGATTGGCCTAGCACTCTTGAGATCAAAGGAAGAGTTAGATTGGATGCATTTGAGAAGTTCCTTCAAGAGCTTCCATTATCTCGTAGTCGAGCCGTTATGGTATCTCTAGTTTATTTTTCCTGTACACATGCCTCTGACCAATTTCCCCCCCCCCTAAAATTCACTCCACGGGACAAAAAAGAACAAATTAAAAGATGAAAGTTGCCAAGAAAATACAAATGCATTTGTTCTTATTTTCGTGCGCTAGCATTCTGCCTCTTGGATAAATCATACGTTCAAAGTTGTCGATCATTATTCCGTTCAAAGCACGTATGGAAAGTTGCTTGGCATATCAGAAAAGTGTCGAATCGTATAAATAAACCCAACAATCTTTGGACTAATAGTACATATAAACTCAACAATCTTTGCACATGGAAAGTTACTTGGCATAGCAGACGGCGTTCTGTTTTCCAATTGTCATTCAATTTTGAGTTCTACTTTTTGAACCAAAAGAATTACCATTGTTCTTATGATTACGTATTACCATTGTTCTTGTTTCTGGGTTTTGTTGAAGATATTAATTTCCATATTCTTGCAGGTTCTTCATTTGGATTTAAAGAAAGGTTGCCCGGAAAGCGACCGAGCAAATCTTCAAGAGGTAATGTATTAGTAGTAGTCTCTCTTGAGGTTTCTCCAAGTAGGTTTCAGCCATTCTGCTGTTGTGTTTATACTCGGAATTCTCAAATCTTGTAGGTGGCGGAGTCGTATGTCGCCGATGAGCGAGTTGGTATAGCGGAGCCTGGTTCTGGGGTGGAATTTTATTTTTGCTCTCCACACGGACGGATTCTTGAAATGGTTGGCAGGATCCTTCTAAAGGAAAATAATGAGTTACTTAATGCAATTGAAAATGGCCTAATAGGCGTCGTTGTATGGAGAAAACCTCAATTAACTTTAATGTCACCAAACTCAACGTCACTCCACAAACGCAGTTCAAAAAAGCAACATTTTAGCTCTAGAAGACTGCAGGAGACACCAAACTTGAAAGCTAATGATGTTTCCCCTATGCCTCGAGGCTATTTTCCCGTCGCTAGCGATTATCCTCTGACTGAGGAGGATGATGCTGATGGCGACGATGATGTCCCGCCTGGCTTTGGCCCGTCAACTACTCGGGATGACGACGATCTTCCTGAGTTTAACTTCTCTGGTTCTGCAAACCCTCCTCCCCAAGGACTGTCTAGGCTGCCCTCATTTCAGCCAATATCCCGAACTTGGTCTCGGCCTGTAGAGAAAATGCGAGAGATTGTGCAAAAATATGGGCAAAGCGAAAATGCCTCTAGCGGAAACTGGCAAGAAAGGGGCTTCAGTTCAGTACCCATCCAGTCGTGGAATGATGACGATGACGACGACGACGACGACATCCCGGAATGGCAACCACAAGCAGCAGCAGCAGCCTCAAGGCATCGAATGCCTCCTCCCTCGCACTTGCAGCAGCCTGTGCGCAGGCTCGGGCAGCCGTCGCTGAGGCCTCATTATGTGGTGAACCAACAGCAGCAGCATCTGGGGCAGTTGTCCCAGTTGGGTGCTAACCAGCAGACCGTGGGGGGCCGCCTCCCCTTAAATGCAAATCAACAAGGGACATGGTGGGTTCCTCAGCAAGGCCACAACAACAACCCCATCAATATACATTCTTTTAGCAATTTAGGTGGTAGTCATAGTGGTAGTGGTCAGTTTTATGGAGCATTTGGGCGATCAACGCCTTCCAACCCTTCAAATAACAGAGGGTTTTGA

mRNA sequence

GGATTGCATTTCGAAGCGCGACCCTTTTCGCTCTAAATCTTCGAGCCTTTCTTGGCCACTTTCTCTCTCTCTCCTCCCGATTCCATTTCCATTATCTTTTCAACTCTGCTTAGCTTCTATCACCATCGACCTCTCTTCTCTCTGCAATTTTTGATCTGGTGGTTACTTTCTCTCGCTGGCACTGCGCTCGCCTTCTACGGATCGTGCTGTATCGGATGATTTATGGACTGGAAACATGCTGAGAACTGCAGAAGGATTGCTGTCACTTCCTGTGAAGCGCAAGGCATCGAGTGAGCCTTTTAACTCTCTCTCACAGCAGGCTTCATTGCATAATAAGCGAGTTGCACAGATGGAACCTCGATCATGGTTGCAACAAGTATCTGGATTAGCCAAAAGACCTCCTTTACAAATACCAAAGAATGTCCCAGCTCCCACATCGATGCATTTCCCTGCAGGAACTAAGAGAAAGGTACAGCAAATAGAATCACATCCAACTAAAGTTGTACATCAACGTTCCACTGCTCCCAAATGCCAGAGTGCTCCACTGACTCCAACTTCCAAAATGCAAAATGAGCCGACTGGGTCTGTGAGATCAAAGATGAGGGAATCCTTGGCTGCTGCATTAGCCTTGGTATCGCAGCAGCAAAACAAATCTTCTAATGATGAAAAAAATCCTTTAACTGAGGCTGAGAAGTCTGCAACTCAAATGCAGGAAAATGCTTTAGCGTCTGATCCAGCTATTATTGTTCATGTATCTGATGACTCAAAGAAAATCTTTTCCGAGAAGTTAGATTCTGTTGGTCTTGAAGATAATGTAGGAAGGATGTTAGATAAGAATTTGCTGTGTGTAAATGATAGCGATTTAGAGTCATTAGGATATGATGGACGAGTCTTTCAACCAAATAATATTTTGTCTTATGAAGATATTTCTTTTGGGGATAACTTTTTTATTAAAGACGATCTTTTACAAGAAAATTCTCTCTCTTGGGTACTGGAAGCTGATGTAGGGCTAGCTGATAAAAAGGAAATCAGAACTGATGAACTTCAGAAGATTGATGTTGGTATAGCAAATCAAAACCAAGGATCAAAACCAGTACAGAGTCCTGAGTCTTTGGCATTTAAAATGGAAGAAGAATTATTTAAATTATTTAGTGGTGTTAATAAAAAGTACAAGGAGAAAGGAAGATCCCTTTTGTTCAACCTGAAAGACAGAAATAATCCCGAGCTGAGAGAAAGGGTTATGAATGGGGAAATTACCCCAGAAAGATTATGTTCCATGACTGCCGAGGAACTTGCATCTAAGGAGCTTTCTGAGTGGAGAATGGCCAAGGCTGAAGAACTTGCACAAATGGTAGTTTTACCCAACTCAGAAGTTGATATCAGACGTTTGGTAAGGAAGACGCATAAAGGTGAGTTTCAAGTAGAAGTTGAAGAATACGATAATGCCTCTATCGATGTTTCATCTGGGGTTTCTACGTTTTCTCAGAGTCAACGTAATAAGAATGAGACTGTAGGTGGATCACCTGATGAACCTGATACAATTATGGACGAGTGGAATATTTCTGGCCAGAAAAATGGTGCATCTGACAAGGATGAGTACACCTTTACAATTGCATCGACTGAAGGGTCTGAATTATTGTCACTGCCTCCGATCTCCTCCATAGATGAGTTTATGGAATCCTTTGATACAGAGCCACCTTTCAATATTTTATCTGAAGATACTGGTAAATCGTCTCCTATTTTGGAGAAGGGTGAGCCAGAGCCTGGCTCTCAGTTGAAGGCTGCAGCTCATTCTATGGAAGGCGCAACTGATGTTAGTATAGACAAAAATGAAAATATTGAGTCTTATACAAAAGCAGACATTGGCTCGTCTTCTATTAGCCACATGGATTTGACATCTAGTGATTGTAAAACTGATGAGGACTTGAATGAAAATCAAGCTGGGTTAAGAACATCTGACAGGAATGATGGTACAGTATCTGGTGATAGTAATGCAAAATCTGGGACAGAATCTTTGGCCAGCACATTTAGTTTAGAATATTTATGGGATGGCATCCTCCAGTATAATATTTCGACAATGACTCCGGTCGTGGGTACCTACATAAGTGGTGAAAGAACATCAGCGAAAGATTGGCCTAGCACTCTTGAGATCAAAGGAAGAGTTAGATTGGATGCATTTGAGAAGTTCCTTCAAGAGCTTCCATTATCTCGTAGTCGAGCCGTTATGGTTCTTCATTTGGATTTAAAGAAAGGTTGCCCGGAAAGCGACCGAGCAAATCTTCAAGAGGTGGCGGAGTCGTATGTCGCCGATGAGCGAGTTGGTATAGCGGAGCCTGGTTCTGGGGTGGAATTTTATTTTTGCTCTCCACACGGACGGATTCTTGAAATGGTTGGCAGGATCCTTCTAAAGGAAAATAATGAGTTACTTAATGCAATTGAAAATGGCCTAATAGGCGTCGTTGTATGGAGAAAACCTCAATTAACTTTAATGTCACCAAACTCAACGTCACTCCACAAACGCAGTTCAAAAAAGCAACATTTTAGCTCTAGAAGACTGCAGGAGACACCAAACTTGAAAGCTAATGATGTTTCCCCTATGCCTCGAGGCTATTTTCCCGTCGCTAGCGATTATCCTCTGACTGAGGAGGATGATGCTGATGGCGACGATGATGTCCCGCCTGGCTTTGGCCCGTCAACTACTCGGGATGACGACGATCTTCCTGAGTTTAACTTCTCTGGTTCTGCAAACCCTCCTCCCCAAGGACTGTCTAGGCTGCCCTCATTTCAGCCAATATCCCGAACTTGGTCTCGGCCTCAGCCTGTGCGCAGGCTCGGGCAGCCGTCGCTGAGGCCTCATTATGTGGTGAACCAACAGCAGCAGCATCTGGGGCAGTTGTCCCAGTTGGGTGCTAACCAGCAGACCGTGGGGGGCCGCCTCCCCTTAAATGCAAATCAACAAGGGACATGGTGGGTTCCTCAGCAAGGCCACAACAACAACCCCATCAATATACATTCTTTTAGCAATTTAGGTGGTAGTCATAGTGGTAGTGGTCAGTTTTATGGAGCATTTGGGCGATCAACGCCTTCCAACCCTTCAAATAACAGAGGGTTTTGA

Coding sequence (CDS)

ATGCTGAGAACTGCAGAAGGATTGCTGTCACTTCCTGTGAAGCGCAAGGCATCGAGTGAGCCTTTTAACTCTCTCTCACAGCAGGCTTCATTGCATAATAAGCGAGTTGCACAGATGGAACCTCGATCATGGTTGCAACAAGTATCTGGATTAGCCAAAAGACCTCCTTTACAAATACCAAAGAATGTCCCAGCTCCCACATCGATGCATTTCCCTGCAGGAACTAAGAGAAAGGTACAGCAAATAGAATCACATCCAACTAAAGTTGTACATCAACGTTCCACTGCTCCCAAATGCCAGAGTGCTCCACTGACTCCAACTTCCAAAATGCAAAATGAGCCGACTGGGTCTGTGAGATCAAAGATGAGGGAATCCTTGGCTGCTGCATTAGCCTTGGTATCGCAGCAGCAAAACAAATCTTCTAATGATGAAAAAAATCCTTTAACTGAGGCTGAGAAGTCTGCAACTCAAATGCAGGAAAATGCTTTAGCGTCTGATCCAGCTATTATTGTTCATGTATCTGATGACTCAAAGAAAATCTTTTCCGAGAAGTTAGATTCTGTTGGTCTTGAAGATAATGTAGGAAGGATGTTAGATAAGAATTTGCTGTGTGTAAATGATAGCGATTTAGAGTCATTAGGATATGATGGACGAGTCTTTCAACCAAATAATATTTTGTCTTATGAAGATATTTCTTTTGGGGATAACTTTTTTATTAAAGACGATCTTTTACAAGAAAATTCTCTCTCTTGGGTACTGGAAGCTGATGTAGGGCTAGCTGATAAAAAGGAAATCAGAACTGATGAACTTCAGAAGATTGATGTTGGTATAGCAAATCAAAACCAAGGATCAAAACCAGTACAGAGTCCTGAGTCTTTGGCATTTAAAATGGAAGAAGAATTATTTAAATTATTTAGTGGTGTTAATAAAAAGTACAAGGAGAAAGGAAGATCCCTTTTGTTCAACCTGAAAGACAGAAATAATCCCGAGCTGAGAGAAAGGGTTATGAATGGGGAAATTACCCCAGAAAGATTATGTTCCATGACTGCCGAGGAACTTGCATCTAAGGAGCTTTCTGAGTGGAGAATGGCCAAGGCTGAAGAACTTGCACAAATGGTAGTTTTACCCAACTCAGAAGTTGATATCAGACGTTTGGTAAGGAAGACGCATAAAGGTGAGTTTCAAGTAGAAGTTGAAGAATACGATAATGCCTCTATCGATGTTTCATCTGGGGTTTCTACGTTTTCTCAGAGTCAACGTAATAAGAATGAGACTGTAGGTGGATCACCTGATGAACCTGATACAATTATGGACGAGTGGAATATTTCTGGCCAGAAAAATGGTGCATCTGACAAGGATGAGTACACCTTTACAATTGCATCGACTGAAGGGTCTGAATTATTGTCACTGCCTCCGATCTCCTCCATAGATGAGTTTATGGAATCCTTTGATACAGAGCCACCTTTCAATATTTTATCTGAAGATACTGGTAAATCGTCTCCTATTTTGGAGAAGGGTGAGCCAGAGCCTGGCTCTCAGTTGAAGGCTGCAGCTCATTCTATGGAAGGCGCAACTGATGTTAGTATAGACAAAAATGAAAATATTGAGTCTTATACAAAAGCAGACATTGGCTCGTCTTCTATTAGCCACATGGATTTGACATCTAGTGATTGTAAAACTGATGAGGACTTGAATGAAAATCAAGCTGGGTTAAGAACATCTGACAGGAATGATGGTACAGTATCTGGTGATAGTAATGCAAAATCTGGGACAGAATCTTTGGCCAGCACATTTAGTTTAGAATATTTATGGGATGGCATCCTCCAGTATAATATTTCGACAATGACTCCGGTCGTGGGTACCTACATAAGTGGTGAAAGAACATCAGCGAAAGATTGGCCTAGCACTCTTGAGATCAAAGGAAGAGTTAGATTGGATGCATTTGAGAAGTTCCTTCAAGAGCTTCCATTATCTCGTAGTCGAGCCGTTATGGTTCTTCATTTGGATTTAAAGAAAGGTTGCCCGGAAAGCGACCGAGCAAATCTTCAAGAGGTGGCGGAGTCGTATGTCGCCGATGAGCGAGTTGGTATAGCGGAGCCTGGTTCTGGGGTGGAATTTTATTTTTGCTCTCCACACGGACGGATTCTTGAAATGGTTGGCAGGATCCTTCTAAAGGAAAATAATGAGTTACTTAATGCAATTGAAAATGGCCTAATAGGCGTCGTTGTATGGAGAAAACCTCAATTAACTTTAATGTCACCAAACTCAACGTCACTCCACAAACGCAGTTCAAAAAAGCAACATTTTAGCTCTAGAAGACTGCAGGAGACACCAAACTTGAAAGCTAATGATGTTTCCCCTATGCCTCGAGGCTATTTTCCCGTCGCTAGCGATTATCCTCTGACTGAGGAGGATGATGCTGATGGCGACGATGATGTCCCGCCTGGCTTTGGCCCGTCAACTACTCGGGATGACGACGATCTTCCTGAGTTTAACTTCTCTGGTTCTGCAAACCCTCCTCCCCAAGGACTGTCTAGGCTGCCCTCATTTCAGCCAATATCCCGAACTTGGTCTCGGCCTCAGCCTGTGCGCAGGCTCGGGCAGCCGTCGCTGAGGCCTCATTATGTGGTGAACCAACAGCAGCAGCATCTGGGGCAGTTGTCCCAGTTGGGTGCTAACCAGCAGACCGTGGGGGGCCGCCTCCCCTTAAATGCAAATCAACAAGGGACATGGTGGGTTCCTCAGCAAGGCCACAACAACAACCCCATCAATATACATTCTTTTAGCAATTTAGGTGGTAGTCATAGTGGTAGTGGTCAGTTTTATGGAGCATTTGGGCGATCAACGCCTTCCAACCCTTCAAATAACAGAGGGTTTTGA

Protein sequence

MLRTAEGLLSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTSMHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKIFSEKLDSVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKDDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQRNKNETVGGSPDEPDTIMDEWNISGQKNGASDKDEYTFTIASTEGSELLSLPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSIDKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAKSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFYFCSPHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQHFSSRRLQETPNLKANDVSPMPRGYFPVASDYPLTEEDDADGDDDVPPGFGPSTTRDDDDLPEFNFSGSANPPPQGLSRLPSFQPISRTWSRPQPVRRLGQPSLRPHYVVNQQQQHLGQLSQLGANQQTVGGRLPLNANQQGTWWVPQQGHNNNPINIHSFSNLGGSHSGSGQFYGAFGRSTPSNPSNNRGF
BLAST of Cp4.1LG02g04430 vs. Swiss-Prot
Match: PHF3_HUMAN (PHD finger protein 3 OS=Homo sapiens GN=PHF3 PE=1 SV=3)

HSP 1 Score: 94.4 bits (233), Expect = 7.4e-18
Identity = 49/112 (43.75%), Postives = 70/112 (62.50%), Query Frame = 1

Query: 286  PVQSPESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERL 345
            P +    +A K+E+ELF  F   + KYK K RSL+FNLKD  N  L ++V+ GE+TP+ L
Sbjct: 951  PEEKAAKVATKIEKELFSFFRDTDAKYKNKYRSLMFNLKDPKNNILFKKVLKGEVTPDHL 1010

Query: 346  CSMTAEELASKELSEWRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVE 398
              M+ EELASKEL+ WR  +     +M+     EV+ R + + THKGE ++E
Sbjct: 1011 IRMSPEELASKELAAWRRRENRHTIEMIEKEQREVERRPITKITHKGEIEIE 1062

BLAST of Cp4.1LG02g04430 vs. Swiss-Prot
Match: BYE1_YARLI (Transcription factor BYE1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=BYE1 PE=3 SV=1)

HSP 1 Score: 70.5 bits (171), Expect = 1.2e-10
Identity = 49/125 (39.20%), Postives = 74/125 (59.20%), Query Frame = 1

Query: 283 GSKPVQSPESLAFKMEEELFKLFSGVNKK----YKEKGRSLLFNLKDRNNPELRERVMNG 342
           G  P Q  E+LA  +E+EL+  +  V  +    Y++K R+L FNL+D  N  LR RVM G
Sbjct: 227 GVSPEQFCETLALTIEQELYDAYGTVEPEIGSNYRDKFRTLSFNLRDSKNETLRIRVMTG 286

Query: 343 EITPERLCSMTAEELASKELSEW-RMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQV- 402
           ++TP+ L +M++EE+ + EL +     +AE +   V++    VD    +R+THKGE  V 
Sbjct: 287 QVTPQTLVAMSSEEMMNPELQKLAEEVRAEAIRDTVLV----VDEAPRLRRTHKGEEIVG 346

BLAST of Cp4.1LG02g04430 vs. Swiss-Prot
Match: SPOC1_HUMAN (SPOC domain-containing protein 1 OS=Homo sapiens GN=SPOCD1 PE=2 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 1.7e-09
Identity = 41/109 (37.61%), Postives = 66/109 (60.55%), Query Frame = 1

Query: 291 ESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTA 350
           E +A  +E  L+ L  G N +YK K RSLLFNL+D  N +L  +V++G++TP  L  M++
Sbjct: 638 EGIAAGIEAALWDLTQGTNGRYKTKYRSLLFNLRDPRNLDLFLKVVHGDVTPYDLVRMSS 697

Query: 351 EELASKELSEWRMAKAEELAQMVVLPNSEVDIRRL--VRKTHKGEFQVE 398
            +LA +EL+ WR    EE   + ++   + +  RL   + THKGE +++
Sbjct: 698 MQLAPQELARWR--DQEEKRGLNIIEQQQKEPCRLPASKMTHKGEVEIQ 744

BLAST of Cp4.1LG02g04430 vs. Swiss-Prot
Match: TCEA2_BOVIN (Transcription elongation factor A protein 2 OS=Bos taurus GN=TCEA2 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 3.7e-09
Identity = 37/78 (47.44%), Postives = 51/78 (65.38%), Query Frame = 1

Query: 291 ESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTA 350
           E LA ++EE +F+     + KYK + RS L NLKD  NP LR +V+ G ITP+++  MT+
Sbjct: 165 ECLAGQIEECIFRDVGNTDMKYKNRVRSRLSNLKDAKNPGLRRKVLCGAITPQQIAVMTS 224

Query: 351 EELASKELSEWRMAKAEE 369
           EE+AS EL E R A  +E
Sbjct: 225 EEMASDELKEIRKAMTKE 242

BLAST of Cp4.1LG02g04430 vs. Swiss-Prot
Match: TCEA2_HUMAN (Transcription elongation factor A protein 2 OS=Homo sapiens GN=TCEA2 PE=1 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 6.3e-09
Identity = 35/78 (44.87%), Postives = 51/78 (65.38%), Query Frame = 1

Query: 291 ESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTA 350
           E L+ ++EE +F+     + KYK + RS + NLKD  NP+LR  V+ G ITP+++  MT+
Sbjct: 164 ERLSAQIEECIFRDVGNTDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTS 223

Query: 351 EELASKELSEWRMAKAEE 369
           EE+AS EL E R A  +E
Sbjct: 224 EEMASDELKEIRKAMTKE 241

BLAST of Cp4.1LG02g04430 vs. TrEMBL
Match: A0A0A0LDR1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G878940 PE=4 SV=1)

HSP 1 Score: 1265.4 bits (3273), Expect = 0.0e+00
Identity = 714/979 (72.93%), Postives = 789/979 (80.59%), Query Frame = 1

Query: 1    MLRTAEGLLSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIP 60
            M+RTAEG+LSLPVKRKAS+EP NSL+QQ+ LHNKRVA ME R WLQ  SG+AKRP LQIP
Sbjct: 93   MVRTAEGMLSLPVKRKASNEPLNSLAQQSPLHNKRVAPMEHRPWLQPASGIAKRPHLQIP 152

Query: 61   KNVPAPTSMHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRS 120
             N PAP  M+ PAGTKRKVQQ+ESHPTKV HQRS + K Q+AP TPTSK+QNEPTGSVRS
Sbjct: 153  NNSPAPAPMYSPAGTKRKVQQMESHPTKVGHQRSNSSKGQTAPPTPTSKIQNEPTGSVRS 212

Query: 121  KMRESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKI 180
            KMRESL AALALVSQQ++KSSNDEK+  TEAEK +T  QEN+L+S PAI  HVSDDS+KI
Sbjct: 213  KMRESLTAALALVSQQEDKSSNDEKSSPTEAEKFSTPKQENSLSSGPAI-GHVSDDSRKI 272

Query: 181  FSEKLDSVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIK 240
            FSEKLDSVGLEDNVG+MLDK+ LCVN SDL++L YDGRVFQPNN+LSYEDISFGDNFFIK
Sbjct: 273  FSEKLDSVGLEDNVGKMLDKSSLCVNVSDLDALRYDGRVFQPNNVLSYEDISFGDNFFIK 332

Query: 241  DDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEE 300
            DDLLQEN LSWVLEAD+G+ADKKEI TDELQKIDVGI NQNQ +KPVQ+PESLA K+EEE
Sbjct: 333  DDLLQENGLSWVLEADLGVADKKEILTDELQKIDVGIGNQNQVAKPVQTPESLALKIEEE 392

Query: 301  LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSE 360
            LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVM+GEITPERLCSMTAEELASKELSE
Sbjct: 393  LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMSGEITPERLCSMTAEELASKELSE 452

Query: 361  WRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYD-NASIDVSSGVSTFSQSQ 420
            WRMAKAEE AQMVVLP++EVDIRRLV+KTHKGEFQVEVEEYD NAS DVSSG STFSQSQ
Sbjct: 453  WRMAKAEEFAQMVVLPDTEVDIRRLVKKTHKGEFQVEVEEYDNNASADVSSGASTFSQSQ 512

Query: 421  --RNKNETVGGSPDEPDTIMDEWNISGQKNGASDKDEYTFTIASTEGSELLS-------- 480
              RN NE+  GSPDEP+ + DE NISGQKN AS+KD YTFTIAS EGS+L+         
Sbjct: 513  SLRNNNESEDGSPDEPEAVKDEQNISGQKNAASNKDNYTFTIASNEGSDLMQGLMVDDGL 572

Query: 481  -----LPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGA 540
                 LPPI S+DEFMES DTEPPF+IL+E  GK SP+LEKGE EP S+LK AAH  +GA
Sbjct: 573  KDTELLPPIVSLDEFMESLDTEPPFDILAEGAGKLSPVLEKGESEPNSRLKTAAHPPKGA 632

Query: 541  TDVSIDKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGD 600
            TDVS +KN N ES+TKADIGSSSI H+DL  S  K D D N+NQAGLRTSDRND   S D
Sbjct: 633  TDVSTEKN-NEESHTKADIGSSSIGHVDLQPSPTKLDVDSNDNQAGLRTSDRNDVAKSND 692

Query: 601  S-NAKSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRV 660
            S NAKS TES AS   LE+LWDGILQYNISTMT VVGTYISGERTSAKDWP  LEIKGRV
Sbjct: 693  SNNAKSETESPASAVKLEHLWDGILQYNISTMTSVVGTYISGERTSAKDWPGILEIKGRV 752

Query: 661  RLDAFEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSG 720
            RLDAFEKFLQELPLSRSRAVMVLHLDLK+G PES++A+L+EVAESYV DERVGIA+PGSG
Sbjct: 753  RLDAFEKFLQELPLSRSRAVMVLHLDLKEGRPESEQADLREVAESYVVDERVGIADPGSG 812

Query: 721  VEFYFCSPHGRILEMVGRILLKE-NNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKR 780
            VEFYFC PHGRILEM+GRILLKE +NE LNAIENGLIGVVVWRK QLT MSPNSTS HKR
Sbjct: 813  VEFYFCPPHGRILEMLGRILLKETSNEALNAIENGLIGVVVWRKTQLTSMSPNSTSHHKR 872

Query: 781  SSKKQHFSSRRLQETPNLKANDVSP---MPR-GYFPVASDYPLTEEDDADGDDDVPPGFG 840
            SSKKQHFSSRR QET N KAN++SP   +PR  YFP+A+ +P  EEDDADG+DDVPPGFG
Sbjct: 873  SSKKQHFSSRRPQETSNFKANNISPKQTIPRSSYFPIATAHPPPEEDDADGEDDVPPGFG 932

Query: 841  PSTTRDDDDLPEFNFSGSANPP-----------PQG-LSRLPSFQPISRTWSRPQPVRRL 900
            PST RDDDDLPEFNFSGSANPP           P+G  SR PSFQP+S+T SRP    R 
Sbjct: 933  PSTARDDDDLPEFNFSGSANPPGFSSQNKHPLTPRGQSSRPPSFQPVSQTGSRPVEQMR- 992

Query: 901  GQPSLRPHYVVNQQQQHLGQLSQLGANQQTVGGRLPLNANQQGTW---------WVPQQG 934
                     +V++  Q+LG+ +   AN    G R   ++     W         W PQ G
Sbjct: 993  --------ELVHKYGQNLGKNTPSTANW---GERSGFSSVAIQPWNDDDDDIPEWQPQAG 1052

BLAST of Cp4.1LG02g04430 vs. TrEMBL
Match: M5VSR9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000459mg PE=4 SV=1)

HSP 1 Score: 646.0 bits (1665), Expect = 7.4e-182
Identity = 427/903 (47.29%), Postives = 545/903 (60.35%), Query Frame = 1

Query: 13  VKRKASSEPF--NSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTSMH 72
           VKRKA SE    N  + Q S+ NKRVA ME R WLQQ    A R  +Q+     AP S H
Sbjct: 122 VKRKAPSELMSDNPATHQLSMLNKRVAHMEHRPWLQQAPA-ANRRSVQMESVHNAPLSPH 181

Query: 73  FPAGTKR------------------------KVQQIESHPTKVVHQRSTAPKCQSAPLTP 132
            PA  KR                        K+ ++ES   + V QRS++ K Q     P
Sbjct: 182 LPAPNKRMVKIESGGSVHNAPGSPHLLAPNKKMVKMESFSGRSVSQRSSSQKTQMLQSQP 241

Query: 133 TSKMQNEPTGSVRSKMRESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASD 192
           + K+Q E   SVRSKMRESLAAALALV+QQQ+K  +       EA       QEN   + 
Sbjct: 242 SPKLQKESFESVRSKMRESLAAALALVNQQQDKCVDSGSKSQGEAGGIQGSTQENPQPAA 301

Query: 193 PAIIVHVSDDSKKIFSEKLDSVGLEDNVGRMLDKNLLCVNDSDLESL--GYDGRVFQPNN 252
            A+     +  +   S +  S+   D+ G    + +L    +   +L    DG+ FQ +N
Sbjct: 302 DAVYTDSKEPKENFTSSETCSIRKSDD-GEGAGQIILADATTSASALIPTCDGKEFQSSN 361

Query: 253 ILSYEDISFGDNFFIKDDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGS 312
           IL YED+SF DN F+KD+LLQ N LSWVL++++ + ++K+I+  E QK+D    ++    
Sbjct: 362 ILRYEDVSFNDNLFVKDELLQGNGLSWVLDSEMEMTERKDIQPAEKQKLDHEEMDRRPEE 421

Query: 313 KPVQSPESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPER 372
           + VQSPE LA ++E ELFKLF GVNKKYKEKGRSLLFNLKDRNNPELRERVM+GEI PER
Sbjct: 422 QAVQSPEELASRIEAELFKLFGGVNKKYKEKGRSLLFNLKDRNNPELRERVMSGEIPPER 481

Query: 373 LCSMTAEELASKELSEWRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNA 432
           LCSMTAEELASKELSEWRMAKAEELAQMVVLP+SEVD+RRLV+KTHKGE  VEVE+YD+A
Sbjct: 482 LCSMTAEELASKELSEWRMAKAEELAQMVVLPDSEVDMRRLVKKTHKGE--VEVEQYDSA 541

Query: 433 SIDVSSGVSTFSQSQRNKNETVGGSPDEPDTIMDEWNISGQKNGASDK-DEYTFTIASTE 492
           S++V    ++ +QS     E    +P +PD   +E N SG+K+   DK  + TFTI STE
Sbjct: 542 SVEVPVDTTSHAQSLPRSKEMEVSTPLKPDKPKEEGNASGEKSTIEDKTTQCTFTIPSTE 601

Query: 493 GSE----------LLSLPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQ 552
            ++          L  LPPI S+DEFMES DTEPPF IL E   K +PI +K + E GS+
Sbjct: 602 ATDFMQGLMVDDGLKDLPPIVSLDEFMESLDTEPPFEILPE---KVTPISDKDDSETGSE 661

Query: 553 LKAAAHSMEGATDV---SIDKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAG 612
            K +  S +   D     +D+ +  +S + AD+ +S  SH  + +SD   D       A 
Sbjct: 662 SKHSVLSPKNTVDAPPQKLDEIDTTDSKSDADLKTSG-SHAVIKTSD-HADTKSRNVCAD 721

Query: 613 LRTSDRNDGTVSGDSNAKSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSA 672
           +++S   + +VS       G          E +W+G LQ N+S M  V+G Y SGE+TSA
Sbjct: 722 VKSSGSPEKSVSRPLGTPKG----------ERVWNGSLQLNLSPMASVIGIYKSGEKTSA 781

Query: 673 KDWPSTLEIKGRVRLDAFEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYV 732
           K+WP  L+IKGRVRLDAFEKFLQELP SRSRAVMV+H   K+G  E++ A+L+EV ESY+
Sbjct: 782 KEWPGFLDIKGRVRLDAFEKFLQELPQSRSRAVMVVHFVPKEGSSEAECASLREVGESYI 841

Query: 733 ADERVGIAEPGSGVEFYFCSPHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLT 792
            DERVG +EP  GVE YFC PH +  +M+ +I+ KE+ E LN I+NGL+GV+VWRK    
Sbjct: 842 VDERVGFSEPCFGVEIYFCPPHNKTFDMLSKIIQKEHIEALNTIDNGLVGVIVWRK---- 901

Query: 793 LMSPNSTSLHKRSSKKQHFSSRRL----QETPNLKANDVSPMPRGYFPVASDYPLTEEDD 852
           L SP S+S HK  SKKQH+SS       +   NL  N  S   +      +  P      
Sbjct: 902 LTSPKSSSHHKHISKKQHYSSSTTTSSRRHDTNLNTNYTSKPAQ----ARTVTPTNTRSA 961

Query: 853 ADGDDDVPPGFGPSTTRDDDDLPEFNFSGSANPP-PQGLSRLPS--------FQPISRTW 861
            D DDDVPPGFGP   RD+DDLPEFNFSG ANP  PQ  ++ PS          P S T 
Sbjct: 962 HDDDDDVPPGFGPGAPRDEDDLPEFNFSGGANPSLPQYSAQRPSRGPGVAAPVYPKSHTP 997

BLAST of Cp4.1LG02g04430 vs. TrEMBL
Match: W9S1A1_9ROSA (PHD finger protein 3 OS=Morus notabilis GN=L484_007377 PE=4 SV=1)

HSP 1 Score: 643.7 bits (1659), Expect = 3.7e-181
Identity = 405/854 (47.42%), Postives = 543/854 (63.58%), Query Frame = 1

Query: 9   LSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTS 68
           +S P KRK   EP +   +  S+  KRVA+M+ R WLQQ+S   KR  +Q+   + +P S
Sbjct: 90  MSAPFKRKTPMEPISQNHENMSMLQKRVAEMQHRPWLQQMSAPNKRN-VQLESMLNSPGS 149

Query: 69  MHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESLAA 128
            + P   K+ V+  +S   K   QR ++ K Q+A + P +K  +E + SVRSKMRE L A
Sbjct: 150 QNSPTPNKKMVKA-DSFSNKSGSQRMSSQKNQTARVQPPAKASSESSESVRSKMREQLTA 209

Query: 129 ALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKIFSEKLDSV 188
           A +LV+QQ+NK S D +NP      S T+       S  A  V   D + K+ +    + 
Sbjct: 210 AFSLVTQQENKPS-DMQNPGQAVNCSGTEENNEPAGSIAADAV---DRAAKVSNNFARNF 269

Query: 189 GLEDNVGRMLDKNLLC----VNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKDDLL 248
             ++N G   +   +        S L S+  DGR F  +N+LSYED+ F +NFF+KD+LL
Sbjct: 270 STQENHGGEGESRKILGDARTGGSTLSSM-CDGREFHSSNVLSYEDVPFSENFFVKDELL 329

Query: 249 QENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKL 308
           Q N LSWVL+ D+ +A+KKE +     K D      ++  +  QSP++LAF++E ELFKL
Sbjct: 330 QGNGLSWVLDPDLDMAEKKESQNAGEPKSDHEEVGGDRVEQAYQSPQNLAFEIELELFKL 389

Query: 309 FSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRMA 368
           F GVNKKYKEKGRSLLFNLKDRNNPEL ERVM GEI+PERLCSMTAE+LASKELS+WRMA
Sbjct: 390 FGGVNKKYKEKGRSLLFNLKDRNNPELIERVMAGEISPERLCSMTAEDLASKELSQWRMA 449

Query: 369 KAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQ-RNKN 428
           KAEELAQMVVLP+S+VDIRRLV+KTHKGEF VEVE+ D+  +D+S G S+ + S+ +NK 
Sbjct: 450 KAEELAQMVVLPDSDVDIRRLVKKTHKGEFHVEVEQDDSNPVDISGGSSSLAHSEPKNKE 509

Query: 429 ETVGGSPDEPDTIMDEWNISGQK-NGASDKDEYTFTIASTEGSELLS------------- 488
             +  S  +P    D+ N  G+  N    +      +   E S+L+              
Sbjct: 510 MEIPNS--KPVVKKDKVNAQGENSNLEGHRTSCPLMLHPNEESDLMHGLIVDDGFKYVEF 569

Query: 489 LPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSI 548
           LPPI S+DEFMES D+EPPF IL  D+ + +P+  K + E GS  K++  + +   D S 
Sbjct: 570 LPPIVSLDEFMESLDSEPPFEILPLDSERMTPVSGKDDSEVGSGTKSSNPTSKDVVDASS 629

Query: 549 DKNENIE-SYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAK 608
           +K++N++ ++TK D         D+ S D   D  L++     ++ D + G    DS  K
Sbjct: 630 EKHDNVDVTHTKIDA--------DVKSDDSPVDAKLDDGSTDAKSRDNHVGVQPNDSPLK 689

Query: 609 SGTE-SLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDA 668
           + T  +L+ T   E++W G LQ NIS+    V  + SGE+TSA +WP  +EIKGRVRL+A
Sbjct: 690 TETTLALSGTPMGEHVWGGSLQLNISSTANFVCIFKSGEKTSANEWPGFIEIKGRVRLEA 749

Query: 669 FEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFY 728
           FEKFLQELPLSRSRAVMV+H  LK+   E++RA LQEV+ESY+ DERVG AEP SGVE Y
Sbjct: 750 FEKFLQELPLSRSRAVMVVHFVLKESS-ETERAALQEVSESYILDERVGFAEPASGVELY 809

Query: 729 FCSPHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQ 788
           FC PH + LE +G+I+ +E+ E LNAI+NGLIGV+VWRK  L+ +SP S+S HK + KKQ
Sbjct: 810 FCPPHNKTLETLGKIVHEEHIEALNAIDNGLIGVIVWRK--LSSISPKSSSHHKHALKKQ 869

Query: 789 HFSSRRLQETP-NLKANDVSPMPRGYFPVASDYPLTEEDDADGDDDVPPGFGPSTTRDDD 841
           HF+SRR QE+P N      S  PRG  P A+  P  ++D+    DD+PPGFGP   RD+D
Sbjct: 870 HFTSRRQQESPLNSNFAPKSAAPRGLAP-ANSRPSHDDDE----DDIPPGFGPPVARDED 918

BLAST of Cp4.1LG02g04430 vs. TrEMBL
Match: A0A061GPM9_THECC (SPOC domain / Transcription elongation factor S-II protein, putative isoform 1 OS=Theobroma cacao GN=TCM_038305 PE=4 SV=1)

HSP 1 Score: 642.9 bits (1657), Expect = 6.2e-181
Identity = 456/1023 (44.57%), Postives = 584/1023 (57.09%), Query Frame = 1

Query: 14   KRKASSEPFN--SLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTSMHF 73
            KRKA  EP +  S+ Q+  + NKRVA ME R WLQ +S  +KR  +Q+      P S   
Sbjct: 119  KRKAPMEPISTDSVPQRLPVPNKRVAHMEHRPWLQPISASSKRT-VQMQSVSVMPGSQPS 178

Query: 74   PAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESLAAALA 133
            PA  KR V       T     R+   + +SAP     K+Q E   SVRSKMRESLAAALA
Sbjct: 179  PASIKRSVPSKTGSSTS----RNQPVQMRSAP-----KVQTESFESVRSKMRESLAAALA 238

Query: 134  LVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPA-----IIVHVSDDSKKIFSEKLD 193
            LVSQQQ ++S  EKN   EA  S  + QE++   D        +  +S + + I     D
Sbjct: 239  LVSQQQGENSKVEKNSNGEAVSSPGKTQESSNPVDSNSGNADAVGSMSAEPRGILLSNQD 298

Query: 194  SVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKDDLLQE 253
              G     G + D           ++L  DG+ FQ +N+L  ED+ F DN F +D+LLQ 
Sbjct: 299  GAG----GGNISDTT---------QTLKCDGQQFQSSNLLPDEDVPFSDNIFARDELLQG 358

Query: 254  NSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKLFS 313
            N LSWVLE  + +A+ KEI T   Q        +N   K VQSP+ LA+++E ELFKLF 
Sbjct: 359  NGLSWVLEPAIDVAENKEIETVGKQNPVNEKIGENAVEKSVQSPQVLAYQIEAELFKLFG 418

Query: 314  GVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRMAKA 373
            GVNKKYKEKGRSLLFNLKDRNNPELRERV++GEI+PERLCSM+AEELASKELS+WR AKA
Sbjct: 419  GVNKKYKEKGRSLLFNLKDRNNPELRERVVSGEISPERLCSMSAEELASKELSQWRQAKA 478

Query: 374  EELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQRNKNETV 433
            EELAQMVVLP++EVDIRRLVRKTHKGEFQVEVE+ D+AS++VS+  S    S+R K E  
Sbjct: 479  EELAQMVVLPDTEVDIRRLVRKTHKGEFQVEVEQTDSASVEVSAATSI---SRRPKTEAK 538

Query: 434  GGSPDEPDTI--MDEWNISGQKNGASDKDEYTFTIASTEGSELLS-------------LP 493
               P    T+   D    +G+K+   D D  T TI S+EG + +              LP
Sbjct: 539  Q-DPTTGKTVGKKDGAGTAGEKSNIEDPD-LTITIPSSEGPDPMQGLMGEDELKDADFLP 598

Query: 494  PISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSIDK 553
            PI S+DEFM+S D+EPPF  L  D  K++ I  K + E GS  K++  + +   D + DK
Sbjct: 599  PIVSLDEFMQSLDSEPPFENLPSDARKAASISNKDDSEAGSDSKSSGRASQDPVDTTPDK 658

Query: 554  NENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAKSGT 613
             E I+                  +S+ K+D D+  N   ++T    + TVS         
Sbjct: 659  LETID------------------ASNVKSDADVKPNDIPVKT----ETTVS--------- 718

Query: 614  ESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFEKF 673
                +T   E++W+G+LQ NI+ MT V+GT+ SGE+T  K+WPS LEIKGRVRLDAFEKF
Sbjct: 719  ---VATLKGEHVWEGLLQLNITAMTSVIGTFKSGEKTCTKEWPSLLEIKGRVRLDAFEKF 778

Query: 674  LQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFYFCSP 733
            LQELP+SRSRAVMV+H   K+G  ES+R +L E A+SY+ D RVG AEP SGVE YFC P
Sbjct: 779  LQELPMSRSRAVMVVHFLCKEGSAESERGSLVEAADSYILDGRVGFAEPASGVELYFCPP 838

Query: 734  HGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQHFSS 793
            H R  EM+ +IL K++ E LNAI+NGLIGVVVWRK QL  +SPNSTS HK +SKKQHF+S
Sbjct: 839  HARTHEMLSKILPKDHLEALNAIDNGLIGVVVWRKAQL--ISPNSTSHHKHTSKKQHFTS 898

Query: 794  RRLQETP-NLKANDVSPMPRGYF--PVASDYPLTEEDDADGDDDVPPGFGPSTTRDDDDL 853
            RR Q+   N+ +N  S     +   PV S   L + +D    DDVPPGFGP+T+RD+DDL
Sbjct: 899  RRHQDKDANMNSNFPSKPTFSHSGPPVYSKPSLDDNED----DDVPPGFGPATSRDEDDL 958

Query: 854  PEFNFSGSANPP----PQGLSR----LPSFQPISRTWSRP-----QPVRRLGQPSLRPHY 913
            PEFNFSG +NP     P G       + S    S+T SRP     + V++ GQP+     
Sbjct: 959  PEFNFSGGSNPSGPQYPTGYQSQRVGIASAHLHSQTSSRPVDQMRELVQKYGQPNTNASL 1018

Query: 914  VVNQQ------------QQHLGQLSQLGANQQTVGGRLPLNANQQ--------------- 960
             V+ Q            Q  + Q  Q     Q    + P++  QQ               
Sbjct: 1019 GVSMQPWNDDDDDIPEWQPQISQQQQPQPPTQVHRFQQPMHVPQQLPHQALSTMHVQGLQ 1061

BLAST of Cp4.1LG02g04430 vs. TrEMBL
Match: A0A0B0P3A3_GOSAR (PHD finger 3 OS=Gossypium arboreum GN=F383_26367 PE=4 SV=1)

HSP 1 Score: 634.8 bits (1636), Expect = 1.7e-178
Identity = 452/1020 (44.31%), Postives = 585/1020 (57.35%), Query Frame = 1

Query: 9    LSLPVKRKASSEPF--NSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAP 68
            LS   KRKA  EP   NS+ Q+ SL NKRVAQ E R WLQ +S    + P+Q+     +P
Sbjct: 182  LSTLNKRKAPMEPISPNSIPQKLSLPNKRVAQTEHRPWLQPMSA-PSQSPVQMQSVSNSP 241

Query: 69   TSMHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESL 128
             S   PA  KR V      P+K     S+AP+ Q A   P+ ++Q E + SVRSKMRESL
Sbjct: 242  GSQLSPASNKRLV------PSK---SGSSAPRNQPAQTRPSPRVQAESSESVRSKMRESL 301

Query: 129  AAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKIFSEKLD 188
            A ALALVSQQQ +++  EKN   EA  S  + +E +   D       S +S  + S   +
Sbjct: 302  AGALALVSQQQGENATPEKNSNGEAMGSPLKREEGSHPVDSG-----SGNSDAVHSISAE 361

Query: 189  SVGL-EDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKDDLLQ 248
              G+   N G   D N    N    ++L YD +  Q +N+L  ED+ F DN F +D+LLQ
Sbjct: 362  PQGIMRSNQGSSTDGN----NSDTTQTLQYDRQQLQSSNLLPDEDVPFSDNIFARDELLQ 421

Query: 249  ENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKLF 308
             N LSWVLE ++ +A KKE+  D  Q  D     +N+  + + SPE LA+++E ELFKLF
Sbjct: 422  GNGLSWVLEPEIDMARKKELEMDGKQIPDNENVEKNELEQLLPSPEELAYQIEAELFKLF 481

Query: 309  SGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRMAK 368
             GVNKKYKEKGRSLLFNLKDRNNPELRERV++GEI PERLCSM+AEELASKELS+WR AK
Sbjct: 482  GGVNKKYKEKGRSLLFNLKDRNNPELRERVVSGEIPPERLCSMSAEELASKELSQWRQAK 541

Query: 369  AEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQRNKNET 428
            AEELAQMV+LP+ EVDIRRLVRKTHKGEFQVEVE+ D++S++VS+G S   + + +  + 
Sbjct: 542  AEELAQMVILPDVEVDIRRLVRKTHKGEFQVEVEQTDSSSVEVSAGTSVTRRPKTDAKQ- 601

Query: 429  VGGSPDEPDTIMDEW--NISGQKNGASDKDEYTFTIASTEGSELLS-------------L 488
               +P    T+  E   N  G+KN   D +  T TI S+EG + +              L
Sbjct: 602  ---APRNNKTVAKEHESNTVGEKNKLEDPN-LTITIPSSEGPDPMQGLMGEDELKDADFL 661

Query: 489  PPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSID 548
            PPI S+DEFM+S D+EPPF  L  D GK++   +K + E G   K++  + +   +   D
Sbjct: 662  PPIVSLDEFMQSLDSEPPFENLPGDAGKATSTSDKDDSEAGYDSKSSGRASQDPPETVPD 721

Query: 549  KNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAKSG 608
            K  N                    SS+ K+D D+  N                D+  K+ 
Sbjct: 722  KPVN------------------TGSSNLKSDSDVKPN----------------DTTTKTE 781

Query: 609  TESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFEK 668
            T    +T   E +W+G+LQ N+S+MT VV  + SGE+TS KDWPS +EIKGRVRL+AFE+
Sbjct: 782  TVDSVATLKGERVWEGMLQLNVSSMTSVVCLFKSGEKTSTKDWPSLVEIKGRVRLEAFER 841

Query: 669  FLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFYFCS 728
            FLQELP+SRSRAVMV H+  K+G  ESD A+L E A+SY+ DERVG AEPG+GVE YFC 
Sbjct: 842  FLQELPMSRSRAVMVAHVVCKEGATESDHASLVEAADSYILDERVGFAEPGAGVEIYFCP 901

Query: 729  PHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKK-QHF 788
            P+ + LEMV RIL K+  + LNAI+NGLIGVVVWR+ Q  L+SPNSTS HK ++KK QHF
Sbjct: 902  PYTKTLEMVTRILPKDQPQPLNAIDNGLIGVVVWRRAQ--LISPNSTSHHKHNTKKQQHF 961

Query: 789  SSRRLQETPNLKANDVSPMPRGYFPVASDYP----LTEEDDADGDDDVPPGFGPSTTRDD 848
            +S      P+ K + +S +   +       P    L   DD D DDDVPPGFGP+ +RD+
Sbjct: 962  TSS--SRKPHDKDDAISNVNSNFLSKTHVGPPLHSLPPPDDDDDDDDVPPGFGPAASRDE 1021

Query: 849  DDLPEFNFSGSANP-----PPQGLSRLPSFQP--ISRTWSRP-----QPVRRLGQP-SLR 908
            DDLPEFNFSG +NP     P    S+     P   S+T SRP     + +++ GQP S  
Sbjct: 1022 DDLPEFNFSGGSNPSGPKYPAGYQSQRVGMAPHLHSQTPSRPVDQMRELIQKYGQPNSNA 1081

Query: 909  PHYVVNQQ-------------QQHLGQLSQLGANQQTVGG-RLPLNA------------- 960
            P  V  QQ             Q    Q   L      V   + P++A             
Sbjct: 1082 PVGVPIQQWNDDDDDDDIPEWQPQTSQQQHLQPPPSKVRRFQQPMHAPQQLPHQALPAMH 1129

BLAST of Cp4.1LG02g04430 vs. TAIR10
Match: AT5G25520.2 (AT5G25520.2 SPOC domain / Transcription elongation factor S-II protein)

HSP 1 Score: 452.6 bits (1163), Expect = 6.1e-127
Identity = 336/856 (39.25%), Postives = 474/856 (55.37%), Query Frame = 1

Query: 10  SLPVKRKASSEPF---NSLSQQASLHNKRVAQMEPRSWLQQV-SGLAKRPPLQIPKNVPA 69
           S+  KRK+  E     ++ S++    NKRV  +  R WL+Q  S   +R  +  P  +  
Sbjct: 102 SVTGKRKSPPESTLSGSATSEKLDASNKRVEPVHHRPWLEQFYSECIQRGHMPPPATLST 161

Query: 70  PTSMHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRES 129
            T  H P   K KV+Q+E  P      +    K Q+     + K  N+   S+RSKM+ES
Sbjct: 162 KTE-HLPTPAK-KVRQME--PASQKSGKQVMNKKQAGLSQGSVKTLNDGNESLRSKMKES 221

Query: 130 LAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKIFSEKL 189
           LAAALALV + + +S  ++KN  TE E S      N  AS     V V +D     S + 
Sbjct: 222 LAAALALVHEHE-ESPKEKKNSETE-EASVPVADSNEPASACGTSVTVGEDITPAMSTRD 281

Query: 190 DSVGLEDNVGRML------DKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFI 249
           +S   ++  GR L      D  +  VN SD++   +D        +   +D+ F D+ F 
Sbjct: 282 ESFEQKNGNGRTLSQESSKDTKMNYVNQSDVQKTQFD-------EVFPCDDVRFSDSIFT 341

Query: 250 KDDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEE 309
            D+LLQ N LSWVLE                    V    +N+  K  + PE LA K+E 
Sbjct: 342 GDELLQGNGLSWVLEP-------------------VSDFGENETQKSFEDPELLASKIEL 401

Query: 310 ELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELS 369
           ELFKLF GVNKKYKEKGRSLLFNLKD+NNPELRE VM+G+I+PERLC+MTAEELASKELS
Sbjct: 402 ELFKLFGGVNKKYKEKGRSLLFNLKDKNNPELRESVMSGKISPERLCNMTAEELASKELS 461

Query: 370 EWRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQ 429
           +WR AKAEE+A+MVVL ++++D+R LVRKTHKGEFQVE++  D+ ++DVS+ +++ S+ +
Sbjct: 462 QWRQAKAEEMAEMVVLRDTDIDVRNLVRKTHKGEFQVEIDPVDSGTVDVSAEITSNSKPR 521

Query: 430 -RNKNETVGGSPDEPDTIMDEWNISGQKNGAS--------DKDEYTFTIASTEGSELLSL 489
            + K+              ++ NI   +  +S        + D         E  ++  L
Sbjct: 522 AKAKSSKSSTKATLKKNDSNDKNIKSNQGTSSAVTLPPTEEIDPMQGLSMDDEMKDVGFL 581

Query: 490 PPISSIDEFMESFDTEPPFNILSEDT-GKSSPILEKGEPEPGSQLKAAAHSMEGATDVSI 549
           PPI S+DEFMES ++EPPF    E   GK  P  EK + + GS  K+ + S + +     
Sbjct: 582 PPIVSLDEFMESLNSEPPFGSPHEHPPGKEDPASEKSDSKDGSHSKSPSRSPKQSPK--- 641

Query: 550 DKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAKS 609
           + +E++ S T+ +                KT+    +  AG    D+ DG VS   N   
Sbjct: 642 EPSESVSSKTELE----------------KTNVISPKPDAG----DQLDGDVSKPENT-- 701

Query: 610 GTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFE 669
              SL  +   + +WDGILQ + +++  V G + SGE+    +WP+ +E+KGRVRL AF 
Sbjct: 702 ---SLVDSIKEDRIWDGILQLSSASVVSVTGIFKSGEKAKTSEWPTMVEVKGRVRLSAFG 761

Query: 670 KFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFYFC 729
           KF++ELPLSRSR +MV+++  K G  +S R +L EVA+SYVAD+RVG AEP SGVE Y C
Sbjct: 762 KFVKELPLSRSRVLMVMNVVCKNGISQSQRDSLIEVAKSYVADQRVGYAEPTSGVELYLC 821

Query: 730 SPHGRILEMVGRILLKENNELLNAIEN-GLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQH 789
              G  L+++ +I+ K+  + +   E+ GLIGVVVWR+    + SP S   HK   K+QH
Sbjct: 822 PTLGETLDLLSKIISKDYLDEVKCSEDIGLIGVVVWRRA--VVASPGSR--HKPGFKRQH 881

Query: 790 FSS---RRLQETPNLKANDVSPMPRGYFPVAS--DYPLTEEDDADGDDDVPPGFGPSTTR 840
            S+   R +    N K+  VS        V S  ++ L   DD   D+D+PPGFGP   +
Sbjct: 882 SSTGTKRSVLAPENQKSRSVSVTNPSVVNVESMRNHGLVGCDD--DDEDMPPGFGPVAAK 891

BLAST of Cp4.1LG02g04430 vs. TAIR10
Match: AT5G11430.1 (AT5G11430.1 SPOC domain / Transcription elongation factor S-II protein)

HSP 1 Score: 427.2 bits (1097), Expect = 2.7e-119
Identity = 317/847 (37.43%), Postives = 446/847 (52.66%), Query Frame = 1

Query: 7   GLLSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAP 66
           G + L  K K+  +        +   NK+V     R WLQQ+S  A    L IP  + + 
Sbjct: 8   GSMQLVGKHKSLPQTTLGGGSASEAPNKQV-----RPWLQQLSP-ASNGILHIPTKILSQ 67

Query: 67  TSMHFPAGTKRKVQQIESHPTK----VVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKM 126
            ++H     K K  Q ES P K    VV+++   P     P   + K   E   SVRSKM
Sbjct: 68  ETLHSLMHGK-KATQTESAPQKPAKPVVNKKQHVP-----PPQRSVKAMEEVNESVRSKM 127

Query: 127 RESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAII-VHVSDDSKKIF 186
           RESLA+ALALV +  +     E     E      +  ++   + PA I V V + +    
Sbjct: 128 RESLASALALVKKDDDSPKGKENIGTVETPVITQENTQSFQPASPASISVPVGEGTMSEM 187

Query: 187 SEKLDSVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKD 246
              ++S   +D+         + V+    + + ++    Q + +   +++ F D  F  D
Sbjct: 188 PTSVESSVQKDSE--------IPVDIMMEDVIKFNVLKSQYDEVFPRDNVPFTDIIFPND 247

Query: 247 DLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEEL 306
           DLL  N LSW LE    L + K+  T               G K  Q P+ LA K+E EL
Sbjct: 248 DLLHGNELSWDLEVS-DLGETKDYGTG--------------GEKSFQDPKLLASKIEMEL 307

Query: 307 FKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEW 366
           +KLF GVNKKY+E+GRSLLFNLKD+NNPELRERVM+ EI+ ERLCSMTAEELASKELS+W
Sbjct: 308 YKLFGGVNKKYRERGRSLLFNLKDKNNPELRERVMSEEISAERLCSMTAEELASKELSQW 367

Query: 367 RMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQ-R 426
           R AKAEE+A+MVVL ++++D+R LVRKTHKGEFQVE+E  D  ++DVS G+ + S+ + R
Sbjct: 368 RQAKAEEMAKMVVLQDTDIDVRSLVRKTHKGEFQVEIEPVDRGTVDVSGGIMSRSKRRPR 427

Query: 427 NKNETVGGSPDEPDTIMDEWNISGQKNGASDKDEYTFTIASTEGSELLSLPPISSIDEFM 486
            K+ +V  +  +     D            + D         E  ++  LPPI S+DEFM
Sbjct: 428 AKSHSVKTALKDEAAKADNEKSRSTPPSTEEIDPMQGLGIDDELKDVEFLPPIVSLDEFM 487

Query: 487 ESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSID--KNENIESY 546
           ES D+EPPF     ++       EK + E GS  K+   S +  +D S+   K E I+  
Sbjct: 488 ESLDSEPPFESPHGNSEMQVSPSEKSDSEAGSDSKSPKGSPKELSDKSLPEAKPEKIDEV 547

Query: 547 TKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAKSGTESLASTF 606
           T                ++ K D+D++  +     SD                       
Sbjct: 548 TPE------------FDANVKVDDDISRVEKAAALSDDKG-------------------- 607

Query: 607 SLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFEKFLQELPLS 666
             E  WDGILQ ++S++ PV G + SGE+    +WP+ +E+KGRVRL  F KF+QELP S
Sbjct: 608 --ERAWDGILQLSMSSVVPVAGIFKSGEKAETSEWPAMVEVKGRVRLSGFGKFIQELPKS 667

Query: 667 RSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFYFCSPHGRILEM 726
           R+RA+MV++L  K G  ES R +L EV +SYVAD+RVG AEP SGVE Y C   G  L++
Sbjct: 668 RTRALMVMYLAYKDGISESQRGSLIEVIDSYVADQRVGYAEPASGVELYLCPTRGETLDL 727

Query: 727 VGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQH-FSSRRLQET 786
           + +++ +E  + + +++ GL+GVVVWR+    +  P S       SK+QH FSS    +T
Sbjct: 728 LNKVISQEQLDEVKSLDIGLVGVVVWRR--AVVPKPGS------GSKRQHSFSSSIGSKT 777

Query: 787 PNLKAN-----DVSPMPRGYFPVASDYPLTEEDDADGDDDVPPGFGPSTTRDDDDLPEFN 840
             L  N      V+  P     + + +    + D   DDDVPPGFGP  +RD+DDLPEFN
Sbjct: 788 SVLPVNKKQRVHVTEKPLVVASMRNHHHGYVKHDTAADDDVPPGFGPVASRDEDDLPEFN 777

BLAST of Cp4.1LG02g04430 vs. TAIR10
Match: AT2G25640.1 (AT2G25640.1 SPOC domain / Transcription elongation factor S-II protein)

HSP 1 Score: 409.1 bits (1050), Expect = 7.7e-114
Identity = 300/742 (40.43%), Postives = 417/742 (56.20%), Query Frame = 1

Query: 11  LPVKRKASSEPFNSLSQQASLHNKRVA-QMEPRSWLQQVSGLAKRPPLQIPKNVPAPTSM 70
           LP KRK+   P        S+ NKR+A  ME R W           P+ +  +  +P + 
Sbjct: 69  LPGKRKSPLHP--------SVQNKRMALPMEGRPWASA--------PMPVQLSSVSPRTQ 128

Query: 71  HFPAGTKRKVQQIE-SHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESLAA 130
           + PA    K   +  S P K    R   P  Q   L    K Q+E +GSVRSKMRESLA 
Sbjct: 129 YLPASFVSKNSFVSFSKPGKQAAARK--PTLQKPMLL---KPQSESSGSVRSKMRESLAG 188

Query: 131 ALALVSQQQNKSSNDEKNPLTEAEKSATQMQENA---LASDPAIIVHVSDDSKKIFSEKL 190
           ALA+V Q Q    N+ K  + ++E  A  ++ +    +++   + V VS+ S ++ +   
Sbjct: 189 ALAMV-QCQMDVPNESK--MLDSETVANPLEGHVSGPVSAASGVDVMVSNGSTEMLTLSD 248

Query: 191 DSVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKDDLLQ 250
            S     +V  +L + L     SD +       V +       +++S+ DN F KDDLLQ
Sbjct: 249 PSPVAGISVQTVLPEILSIAKTSDAQ-------VPEAVKPFVQDNVSYSDNVFSKDDLLQ 308

Query: 251 ENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKLF 310
            N LSW LE+D+      E   +   ++   +AN     K +  P+ LAF++E ELFKLF
Sbjct: 309 GNDLSWALESDI------EFTVNCQNEMIGAMANDGSLEKLLLDPQVLAFEIETELFKLF 368

Query: 311 SGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRMAK 370
            GVNKKYKEKGRSLLFNLKD++NP+LRE+VM GEI  ERLCSM+AEELASKEL+EWR AK
Sbjct: 369 GGVNKKYKEKGRSLLFNLKDKSNPKLREKVMYGEIAAERLCSMSAEELASKELAEWRQAK 428

Query: 371 AEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQRNKNET 430
           AEE+AQMVVL ++EVDIR LVRKTHKGEFQVEVE  D+ S++VS G+S+ + S+    + 
Sbjct: 429 AEEMAQMVVLQDTEVDIRSLVRKTHKGEFQVEVEPMDSGSVEVSVGMSSINWSRTKNFKK 488

Query: 431 VGGSPDEPDTIMDEWNISGQKNGASDKDEYTFTIASTEGSELLSLPPISSIDEFMESFDT 490
              S  +   + +E N S +  G  +      TI     +   SLPPI S+DEFM S D+
Sbjct: 489 KTPSITKTLGVKNELNSSNESTGPIN----GVTIDDEMQAATGSLPPIVSLDEFMSSIDS 548

Query: 491 EPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSIDKNENIESYTKADIGS 550
           E P   LS DT K   + +  +             +E     S  ++ NI      D+ +
Sbjct: 549 ESPSGFLSSDTEKKPSVSDNND-------------VEEVLVSSPKESANI------DLCT 608

Query: 551 SSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAKSGTESLASTFSLEYLWD 610
           S +    L+    K    +N   A + +S          S+ KS T S+      E LW+
Sbjct: 609 SPVKAEALSPLTAKASSPVNAEDADIVSS-------KPSSDLKSKTTSVFIPDG-ERLWE 668

Query: 611 GILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFEKFLQELPLSRSRAVMV 670
           G+LQ + ST++ V+G   SGE+T+ K+WP  LEIKGRVRLDAFEKF++ELP SRSRAVMV
Sbjct: 669 GVLQLSPSTVSSVIGILRSGEKTTTKEWPILLEIKGRVRLDAFEKFVRELPNSRSRAVMV 728

Query: 671 LHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFYFCSPHGRILEMVGRILLK 730
           +    K+ C ++++ N+ EV +SY  D RVG AEP SGVE Y C   GR +E++ +I+ +
Sbjct: 729 MCFVCKEECSKTEQENISEVVDSYAKDGRVGYAEPASGVELYLCPTRGRTVEILNKIVPR 742

Query: 731 ENNELLNAI-ENGLIGVVVWRK 747
              + L +I ++GLIGVVVWR+
Sbjct: 789 NQLDFLKSINDDGLIGVVVWRR 742

BLAST of Cp4.1LG02g04430 vs. TAIR10
Match: AT3G29639.1 (AT3G29639.1 BEST Arabidopsis thaliana protein match is: SPOC domain / Transcription elongation factor S-II protein (TAIR:AT5G11430.1))

HSP 1 Score: 69.3 bits (168), Expect = 1.4e-11
Identity = 30/59 (50.85%), Postives = 44/59 (74.58%), Query Frame = 1

Query: 607 ILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDAFEKFLQELPLSRSRAVMV 666
           +LQ ++S++ PV G + SGE+    +WP+ +E+K RVRL  F KF+QELP SR+RA+MV
Sbjct: 4   LLQLSMSSVVPVAGIFKSGEKAETSEWPAMVEVKRRVRLSGFGKFIQELPKSRTRALMV 62

BLAST of Cp4.1LG02g04430 vs. TAIR10
Match: AT2G38560.1 (AT2G38560.1 transcript elongation factor IIS)

HSP 1 Score: 60.1 bits (144), Expect = 8.8e-09
Identity = 36/108 (33.33%), Postives = 64/108 (59.26%), Query Frame = 1

Query: 249 LSWVLEADVGLADK-KEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKLFSG 308
           L+ +L+ +  + DK +E+  + L ++     +  + S     P  +A  +E  +F+    
Sbjct: 200 LTAMLKCNDPVRDKIRELLVEALCRVAGEADDYERESVNASDPLRVAVSVESLMFEKLGR 259

Query: 309 VNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELAS 356
                K K RS++FNL+D NNP+LR RV+ GEI+PE+L +++AE++AS
Sbjct: 260 STGAQKLKYRSIMFNLRDSNNPDLRRRVLTGEISPEKLITLSAEDMAS 307

BLAST of Cp4.1LG02g04430 vs. NCBI nr
Match: gi|659132763|ref|XP_008466371.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503799 [Cucumis melo])

HSP 1 Score: 1280.4 bits (3312), Expect = 0.0e+00
Identity = 697/890 (78.31%), Postives = 756/890 (84.94%), Query Frame = 1

Query: 1   MLRTAEGLLSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIP 60
           M+RTAEG+LSLPVKRKAS+EP NSL+QQ+ LHNKRVA ME R WLQ  SG+AKRP LQIP
Sbjct: 93  MVRTAEGMLSLPVKRKASNEPLNSLAQQSPLHNKRVAPMEHRPWLQPASGIAKRPHLQIP 152

Query: 61  KNVPAPTSMHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRS 120
            N PAP  MH PAGTKRKVQQ+ESHPTKV HQRS + K Q+AP TPTSK+QNEPTGSVRS
Sbjct: 153 NNSPAPAPMHSPAGTKRKVQQMESHPTKVGHQRSNSSKGQTAPPTPTSKIQNEPTGSVRS 212

Query: 121 KMRESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKI 180
           KMRESL AALALVSQQ++KSSNDEK+P TEAEKSA   QE +L+S PAI  HVSDDSKKI
Sbjct: 213 KMRESLTAALALVSQQEDKSSNDEKSPRTEAEKSAAPKQEKSLSSGPAI-GHVSDDSKKI 272

Query: 181 FSEKLDSVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIK 240
           FSEKLDSVGLEDNVG+MLDK+ LCVN SDL++L YDGRVFQPNN+LSYEDISFGDNFFIK
Sbjct: 273 FSEKLDSVGLEDNVGKMLDKSSLCVNVSDLDALRYDGRVFQPNNVLSYEDISFGDNFFIK 332

Query: 241 DDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEE 300
           DDLLQEN LSWVLEAD+G+ADKKE  TDELQKIDVGI NQNQG+KPVQ+PESLA K+EEE
Sbjct: 333 DDLLQENGLSWVLEADLGVADKKETLTDELQKIDVGIGNQNQGAKPVQTPESLAVKIEEE 392

Query: 301 LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSE 360
           LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVM+GEITPERLCSMTAEELASKELSE
Sbjct: 393 LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMSGEITPERLCSMTAEELASKELSE 452

Query: 361 WRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQR 420
           WRMAKAEE AQMVVLP++EVDIRRLV+KTHKGEFQVEVEEYDNAS DVSSG STFSQSQR
Sbjct: 453 WRMAKAEEFAQMVVLPDTEVDIRRLVKKTHKGEFQVEVEEYDNASADVSSGASTFSQSQR 512

Query: 421 NKNETVGGSPDEPDTIMDEWNISGQKNGASDKDEYTFTIASTEGSELLS----------- 480
           NKNE+  GSPDEP+T+ DE NISGQKN AS+KD YTFTIAS EGS+L+            
Sbjct: 513 NKNESEDGSPDEPETVKDEQNISGQKNAASNKDNYTFTIASNEGSDLMQGLMVDDGLKDT 572

Query: 481 --LPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDV 540
             LPPI S+DEFMES DTEPPF+IL+E  GK SP+ EKGE EP S+LK AAH  +GATDV
Sbjct: 573 ELLPPIVSLDEFMESLDTEPPFDILAEGAGKLSPVSEKGESEPNSRLKTAAHPTKGATDV 632

Query: 541 SIDKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDS-N 600
           S +KN N E +TKADI SSSI H+DL  S  K D D N+NQ GLRTSDRND   S DS N
Sbjct: 633 STEKN-NEEFHTKADIASSSIGHVDLQPSPTKPDVDSNDNQVGLRTSDRNDVAKSNDSNN 692

Query: 601 AKSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLD 660
           AKS TES A+   LE+LWDGILQYNISTMT VVGTYISGERTSAKDWP  LEIKGRVRLD
Sbjct: 693 AKSETESPATAVKLEHLWDGILQYNISTMTSVVGTYISGERTSAKDWPGILEIKGRVRLD 752

Query: 661 AFEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEF 720
           AFEKFLQELPLSRSRAVMVLHLDLK+G PES+RA+L+EVAESYV DERVGIAEPGSGVEF
Sbjct: 753 AFEKFLQELPLSRSRAVMVLHLDLKEGRPESERADLREVAESYVVDERVGIAEPGSGVEF 812

Query: 721 YFCSPHGRILEMVGRILLKE-NNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSK 780
           YFC PH RILEM+GRILLKE +NE LNAIENGLIGVVVWRK QLT MSPNSTS HKRSSK
Sbjct: 813 YFCPPHRRILEMLGRILLKETSNEALNAIENGLIGVVVWRKTQLTSMSPNSTSHHKRSSK 872

Query: 781 KQHFSSRRLQETPNLKANDVSP---MPRGYFPVASDYPLTEEDDADGDDDVPPGFGPSTT 840
           KQHFSSRR QET N KAN++SP   MP GYFP+A+  P  EEDDADGDDDVPPGFGPST 
Sbjct: 873 KQHFSSRRPQETSNFKANNISPKQTMPHGYFPIATARPPPEEDDADGDDDVPPGFGPSTA 932

Query: 841 RDDDDLPEFNFSGSANPP-----------PQG-LSRLPSFQPISRTWSRP 861
           RDDDDLPEFNFSGSANPP           P+G  SR PSFQP S+T SRP
Sbjct: 933 RDDDDLPEFNFSGSANPPGFSSQNKHPLTPRGQSSRPPSFQP-SQTGSRP 979

BLAST of Cp4.1LG02g04430 vs. NCBI nr
Match: gi|778687053|ref|XP_004136468.2| (PREDICTED: death-inducer obliterator 1 [Cucumis sativus])

HSP 1 Score: 1265.4 bits (3273), Expect = 0.0e+00
Identity = 714/979 (72.93%), Postives = 789/979 (80.59%), Query Frame = 1

Query: 1    MLRTAEGLLSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIP 60
            M+RTAEG+LSLPVKRKAS+EP NSL+QQ+ LHNKRVA ME R WLQ  SG+AKRP LQIP
Sbjct: 93   MVRTAEGMLSLPVKRKASNEPLNSLAQQSPLHNKRVAPMEHRPWLQPASGIAKRPHLQIP 152

Query: 61   KNVPAPTSMHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRS 120
             N PAP  M+ PAGTKRKVQQ+ESHPTKV HQRS + K Q+AP TPTSK+QNEPTGSVRS
Sbjct: 153  NNSPAPAPMYSPAGTKRKVQQMESHPTKVGHQRSNSSKGQTAPPTPTSKIQNEPTGSVRS 212

Query: 121  KMRESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKI 180
            KMRESL AALALVSQQ++KSSNDEK+  TEAEK +T  QEN+L+S PAI  HVSDDS+KI
Sbjct: 213  KMRESLTAALALVSQQEDKSSNDEKSSPTEAEKFSTPKQENSLSSGPAI-GHVSDDSRKI 272

Query: 181  FSEKLDSVGLEDNVGRMLDKNLLCVNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIK 240
            FSEKLDSVGLEDNVG+MLDK+ LCVN SDL++L YDGRVFQPNN+LSYEDISFGDNFFIK
Sbjct: 273  FSEKLDSVGLEDNVGKMLDKSSLCVNVSDLDALRYDGRVFQPNNVLSYEDISFGDNFFIK 332

Query: 241  DDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEE 300
            DDLLQEN LSWVLEAD+G+ADKKEI TDELQKIDVGI NQNQ +KPVQ+PESLA K+EEE
Sbjct: 333  DDLLQENGLSWVLEADLGVADKKEILTDELQKIDVGIGNQNQVAKPVQTPESLALKIEEE 392

Query: 301  LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSE 360
            LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVM+GEITPERLCSMTAEELASKELSE
Sbjct: 393  LFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMSGEITPERLCSMTAEELASKELSE 452

Query: 361  WRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYD-NASIDVSSGVSTFSQSQ 420
            WRMAKAEE AQMVVLP++EVDIRRLV+KTHKGEFQVEVEEYD NAS DVSSG STFSQSQ
Sbjct: 453  WRMAKAEEFAQMVVLPDTEVDIRRLVKKTHKGEFQVEVEEYDNNASADVSSGASTFSQSQ 512

Query: 421  --RNKNETVGGSPDEPDTIMDEWNISGQKNGASDKDEYTFTIASTEGSELLS-------- 480
              RN NE+  GSPDEP+ + DE NISGQKN AS+KD YTFTIAS EGS+L+         
Sbjct: 513  SLRNNNESEDGSPDEPEAVKDEQNISGQKNAASNKDNYTFTIASNEGSDLMQGLMVDDGL 572

Query: 481  -----LPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGA 540
                 LPPI S+DEFMES DTEPPF+IL+E  GK SP+LEKGE EP S+LK AAH  +GA
Sbjct: 573  KDTELLPPIVSLDEFMESLDTEPPFDILAEGAGKLSPVLEKGESEPNSRLKTAAHPPKGA 632

Query: 541  TDVSIDKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGD 600
            TDVS +KN N ES+TKADIGSSSI H+DL  S  K D D N+NQAGLRTSDRND   S D
Sbjct: 633  TDVSTEKN-NEESHTKADIGSSSIGHVDLQPSPTKLDVDSNDNQAGLRTSDRNDVAKSND 692

Query: 601  S-NAKSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRV 660
            S NAKS TES AS   LE+LWDGILQYNISTMT VVGTYISGERTSAKDWP  LEIKGRV
Sbjct: 693  SNNAKSETESPASAVKLEHLWDGILQYNISTMTSVVGTYISGERTSAKDWPGILEIKGRV 752

Query: 661  RLDAFEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSG 720
            RLDAFEKFLQELPLSRSRAVMVLHLDLK+G PES++A+L+EVAESYV DERVGIA+PGSG
Sbjct: 753  RLDAFEKFLQELPLSRSRAVMVLHLDLKEGRPESEQADLREVAESYVVDERVGIADPGSG 812

Query: 721  VEFYFCSPHGRILEMVGRILLKE-NNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKR 780
            VEFYFC PHGRILEM+GRILLKE +NE LNAIENGLIGVVVWRK QLT MSPNSTS HKR
Sbjct: 813  VEFYFCPPHGRILEMLGRILLKETSNEALNAIENGLIGVVVWRKTQLTSMSPNSTSHHKR 872

Query: 781  SSKKQHFSSRRLQETPNLKANDVSP---MPR-GYFPVASDYPLTEEDDADGDDDVPPGFG 840
            SSKKQHFSSRR QET N KAN++SP   +PR  YFP+A+ +P  EEDDADG+DDVPPGFG
Sbjct: 873  SSKKQHFSSRRPQETSNFKANNISPKQTIPRSSYFPIATAHPPPEEDDADGEDDVPPGFG 932

Query: 841  PSTTRDDDDLPEFNFSGSANPP-----------PQG-LSRLPSFQPISRTWSRPQPVRRL 900
            PST RDDDDLPEFNFSGSANPP           P+G  SR PSFQP+S+T SRP    R 
Sbjct: 933  PSTARDDDDLPEFNFSGSANPPGFSSQNKHPLTPRGQSSRPPSFQPVSQTGSRPVEQMR- 992

Query: 901  GQPSLRPHYVVNQQQQHLGQLSQLGANQQTVGGRLPLNANQQGTW---------WVPQQG 934
                     +V++  Q+LG+ +   AN    G R   ++     W         W PQ G
Sbjct: 993  --------ELVHKYGQNLGKNTPSTANW---GERSGFSSVAIQPWNDDDDDIPEWQPQAG 1052

BLAST of Cp4.1LG02g04430 vs. NCBI nr
Match: gi|1009177572|ref|XP_015870045.1| (PREDICTED: uncharacterized protein LOC107407297 [Ziziphus jujuba])

HSP 1 Score: 681.8 bits (1758), Expect = 1.7e-192
Identity = 435/901 (48.28%), Postives = 567/901 (62.93%), Query Frame = 1

Query: 9   LSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTS 68
           LS   KRKA  EP    +   S+ +KR+AQME R WLQQVSG  KR  +Q+     AP S
Sbjct: 120 LSTNFKRKAPMEP----NPHNSMSHKRMAQMEHRPWLQQVSGSNKRV-VQLDSVPNAPAS 179

Query: 69  MHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESLAA 128
            H P+  K+ V+ IES   K   QRS++ K Q+  + P+SK   E + SVRSKMRESLAA
Sbjct: 180 PHLPSPNKKTVK-IESFSNKSALQRSSSQKNQNVQMQPSSKASTESSESVRSKMRESLAA 239

Query: 129 ALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKIFSEKLDSV 188
           AL+LV Q +NK SN +        ++   +Q    A +      ++++SK        S 
Sbjct: 240 ALSLVDQLKNKPSNSQSEAGNSQVRTEENLQPGGSAFEAGNAESITEESKDTLHSIGSSG 299

Query: 189 GLEDNVGRMLDKNLLCVNDSDLESLG-YDGRVFQPNNILSYEDISFGDNFFIKDDLLQEN 248
              ++VG    +    V   D      +D R FQ  N+L Y D+SF +N F+KD+LLQ N
Sbjct: 300 QKSNDVGGGSLRGFADVRTDDFSKTSVHDEREFQSCNVLPY-DVSFSENLFVKDELLQLN 359

Query: 249 SLSWVLEADVGLADKKEIRTDELQKIDVG----IANQNQGSKPVQSPESLAFKMEEELFK 308
            LSWVL++D+ L + KEI+    + +D G    +  +   S   +SP+ LA K+E ELFK
Sbjct: 360 GLSWVLDSDMQLTETKEIQASGKRNLDCGGVGGVMAEQATSNLQRSPQHLASKIEAELFK 419

Query: 309 LFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRM 368
           LF GVNKKYKEKGRSLLFNLKDRNNPELRERVM+GEI PERLCSM+AEELASKELSEWRM
Sbjct: 420 LFGGVNKKYKEKGRSLLFNLKDRNNPELRERVMSGEIPPERLCSMSAEELASKELSEWRM 479

Query: 369 AKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQRNKN 428
           AKAEELAQMVVLP+SEVDIRRLV+KTHKGEFQVEVE+ +    +VS G S+  QSQ    
Sbjct: 480 AKAEELAQMVVLPDSEVDIRRLVKKTHKGEFQVEVEQDNIVPAEVSVGTSSLGQSQTKSK 539

Query: 429 ETVGGSPDEPDTIMDEWNISGQKNGASDKD-EYTFTIASTEGSELLS------------- 488
           ++   +P +P+    + N SG+ + + +++  YT TI S+EG++ +              
Sbjct: 540 DSTR-TPKKPEGGKGQQNASGENSSSGEQNGSYTLTIPSSEGTDPMEGLMVDDGLKDAEF 599

Query: 489 LPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSI 548
           LPPI S+DEFMES D+EPPF  +  D  K +P  +K + E GS+LK+   + +   D S 
Sbjct: 600 LPPIVSLDEFMESLDSEPPFENMPVDAAKMTPTSDKDDSEVGSELKSMDPTPKDTADASP 659

Query: 549 DKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRND--GTVSGDSNA 608
              +N++        + +ISH +L +     D DL  N      + R    G  S DS+ 
Sbjct: 660 RNLDNVDD-------NVAISHANLDADIKSNDSDLKSNDGDSDVNSRAGLAGKKSNDSSV 719

Query: 609 KSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDA 668
           +S T +L+ST   E +W G+LQ NIST   V+G + SGE+TSAK+WP  LEIKGRV+LDA
Sbjct: 720 ESET-ALSSTQKGEQVWGGLLQLNISTTASVIGIFKSGEKTSAKEWPGFLEIKGRVKLDA 779

Query: 669 FEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFY 728
           FEKFLQELPLSRSRAVMV+H  LK G PES++A+L+EVA+SY+ DERVG AEP  GVE Y
Sbjct: 780 FEKFLQELPLSRSRAVMVVHFVLKVGSPESEQASLKEVADSYIVDERVGFAEPAPGVELY 839

Query: 729 FCSPHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQ 788
           FC P+ + LEM+G+I+ KE+ E +NAI+NGLIGV+VWRK  LT  SP S+S HK  SKK 
Sbjct: 840 FCPPY-KTLEMLGKIIQKEHIEAVNAIDNGLIGVIVWRK--LTTTSPKSSSQHKHVSKKN 899

Query: 789 HFSSRRLQETPNLKANDVSPMPRGYFPVASDYPLTEEDDADGDDDVPPGFGPSTTRDDDD 848
           HFSSRR Q+T NL A   +P         +  P    DD   DDD+PPGFGP  +RD+DD
Sbjct: 900 HFSSRRHQDT-NLNAKYTTPKSTASHGQDTTIPRPSPDD---DDDIPPGFGPPASRDEDD 959

Query: 849 LPEFNFSG---------SANPPPQGLSRLPSFQPISRTWSRP-----QPVRRLGQPSLRP 875
           LPEFNFSG         SA  P +GL  + S++P S+T SRP     + V+R GQP+  P
Sbjct: 960 LPEFNFSGGSNSSVPPFSAQNPSRGLG-IASYRPPSQTSSRPVDQMRELVQRYGQPNTSP 996

BLAST of Cp4.1LG02g04430 vs. NCBI nr
Match: gi|595811283|ref|XP_007203213.1| (hypothetical protein PRUPE_ppa000459mg [Prunus persica])

HSP 1 Score: 646.0 bits (1665), Expect = 1.1e-181
Identity = 427/903 (47.29%), Postives = 545/903 (60.35%), Query Frame = 1

Query: 13  VKRKASSEPF--NSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTSMH 72
           VKRKA SE    N  + Q S+ NKRVA ME R WLQQ    A R  +Q+     AP S H
Sbjct: 122 VKRKAPSELMSDNPATHQLSMLNKRVAHMEHRPWLQQAPA-ANRRSVQMESVHNAPLSPH 181

Query: 73  FPAGTKR------------------------KVQQIESHPTKVVHQRSTAPKCQSAPLTP 132
            PA  KR                        K+ ++ES   + V QRS++ K Q     P
Sbjct: 182 LPAPNKRMVKIESGGSVHNAPGSPHLLAPNKKMVKMESFSGRSVSQRSSSQKTQMLQSQP 241

Query: 133 TSKMQNEPTGSVRSKMRESLAAALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASD 192
           + K+Q E   SVRSKMRESLAAALALV+QQQ+K  +       EA       QEN   + 
Sbjct: 242 SPKLQKESFESVRSKMRESLAAALALVNQQQDKCVDSGSKSQGEAGGIQGSTQENPQPAA 301

Query: 193 PAIIVHVSDDSKKIFSEKLDSVGLEDNVGRMLDKNLLCVNDSDLESL--GYDGRVFQPNN 252
            A+     +  +   S +  S+   D+ G    + +L    +   +L    DG+ FQ +N
Sbjct: 302 DAVYTDSKEPKENFTSSETCSIRKSDD-GEGAGQIILADATTSASALIPTCDGKEFQSSN 361

Query: 253 ILSYEDISFGDNFFIKDDLLQENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGS 312
           IL YED+SF DN F+KD+LLQ N LSWVL++++ + ++K+I+  E QK+D    ++    
Sbjct: 362 ILRYEDVSFNDNLFVKDELLQGNGLSWVLDSEMEMTERKDIQPAEKQKLDHEEMDRRPEE 421

Query: 313 KPVQSPESLAFKMEEELFKLFSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPER 372
           + VQSPE LA ++E ELFKLF GVNKKYKEKGRSLLFNLKDRNNPELRERVM+GEI PER
Sbjct: 422 QAVQSPEELASRIEAELFKLFGGVNKKYKEKGRSLLFNLKDRNNPELRERVMSGEIPPER 481

Query: 373 LCSMTAEELASKELSEWRMAKAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNA 432
           LCSMTAEELASKELSEWRMAKAEELAQMVVLP+SEVD+RRLV+KTHKGE  VEVE+YD+A
Sbjct: 482 LCSMTAEELASKELSEWRMAKAEELAQMVVLPDSEVDMRRLVKKTHKGE--VEVEQYDSA 541

Query: 433 SIDVSSGVSTFSQSQRNKNETVGGSPDEPDTIMDEWNISGQKNGASDK-DEYTFTIASTE 492
           S++V    ++ +QS     E    +P +PD   +E N SG+K+   DK  + TFTI STE
Sbjct: 542 SVEVPVDTTSHAQSLPRSKEMEVSTPLKPDKPKEEGNASGEKSTIEDKTTQCTFTIPSTE 601

Query: 493 GSE----------LLSLPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQ 552
            ++          L  LPPI S+DEFMES DTEPPF IL E   K +PI +K + E GS+
Sbjct: 602 ATDFMQGLMVDDGLKDLPPIVSLDEFMESLDTEPPFEILPE---KVTPISDKDDSETGSE 661

Query: 553 LKAAAHSMEGATDV---SIDKNENIESYTKADIGSSSISHMDLTSSDCKTDEDLNENQAG 612
            K +  S +   D     +D+ +  +S + AD+ +S  SH  + +SD   D       A 
Sbjct: 662 SKHSVLSPKNTVDAPPQKLDEIDTTDSKSDADLKTSG-SHAVIKTSD-HADTKSRNVCAD 721

Query: 613 LRTSDRNDGTVSGDSNAKSGTESLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSA 672
           +++S   + +VS       G          E +W+G LQ N+S M  V+G Y SGE+TSA
Sbjct: 722 VKSSGSPEKSVSRPLGTPKG----------ERVWNGSLQLNLSPMASVIGIYKSGEKTSA 781

Query: 673 KDWPSTLEIKGRVRLDAFEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYV 732
           K+WP  L+IKGRVRLDAFEKFLQELP SRSRAVMV+H   K+G  E++ A+L+EV ESY+
Sbjct: 782 KEWPGFLDIKGRVRLDAFEKFLQELPQSRSRAVMVVHFVPKEGSSEAECASLREVGESYI 841

Query: 733 ADERVGIAEPGSGVEFYFCSPHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLT 792
            DERVG +EP  GVE YFC PH +  +M+ +I+ KE+ E LN I+NGL+GV+VWRK    
Sbjct: 842 VDERVGFSEPCFGVEIYFCPPHNKTFDMLSKIIQKEHIEALNTIDNGLVGVIVWRK---- 901

Query: 793 LMSPNSTSLHKRSSKKQHFSSRRL----QETPNLKANDVSPMPRGYFPVASDYPLTEEDD 852
           L SP S+S HK  SKKQH+SS       +   NL  N  S   +      +  P      
Sbjct: 902 LTSPKSSSHHKHISKKQHYSSSTTTSSRRHDTNLNTNYTSKPAQ----ARTVTPTNTRSA 961

Query: 853 ADGDDDVPPGFGPSTTRDDDDLPEFNFSGSANPP-PQGLSRLPS--------FQPISRTW 861
            D DDDVPPGFGP   RD+DDLPEFNFSG ANP  PQ  ++ PS          P S T 
Sbjct: 962 HDDDDDVPPGFGPGAPRDEDDLPEFNFSGGANPSLPQYSAQRPSRGPGVAAPVYPKSHTP 997

BLAST of Cp4.1LG02g04430 vs. NCBI nr
Match: gi|703147406|ref|XP_010109043.1| (PHD finger protein 3 [Morus notabilis])

HSP 1 Score: 643.7 bits (1659), Expect = 5.3e-181
Identity = 405/854 (47.42%), Postives = 543/854 (63.58%), Query Frame = 1

Query: 9   LSLPVKRKASSEPFNSLSQQASLHNKRVAQMEPRSWLQQVSGLAKRPPLQIPKNVPAPTS 68
           +S P KRK   EP +   +  S+  KRVA+M+ R WLQQ+S   KR  +Q+   + +P S
Sbjct: 90  MSAPFKRKTPMEPISQNHENMSMLQKRVAEMQHRPWLQQMSAPNKRN-VQLESMLNSPGS 149

Query: 69  MHFPAGTKRKVQQIESHPTKVVHQRSTAPKCQSAPLTPTSKMQNEPTGSVRSKMRESLAA 128
            + P   K+ V+  +S   K   QR ++ K Q+A + P +K  +E + SVRSKMRE L A
Sbjct: 150 QNSPTPNKKMVKA-DSFSNKSGSQRMSSQKNQTARVQPPAKASSESSESVRSKMREQLTA 209

Query: 129 ALALVSQQQNKSSNDEKNPLTEAEKSATQMQENALASDPAIIVHVSDDSKKIFSEKLDSV 188
           A +LV+QQ+NK S D +NP      S T+       S  A  V   D + K+ +    + 
Sbjct: 210 AFSLVTQQENKPS-DMQNPGQAVNCSGTEENNEPAGSIAADAV---DRAAKVSNNFARNF 269

Query: 189 GLEDNVGRMLDKNLLC----VNDSDLESLGYDGRVFQPNNILSYEDISFGDNFFIKDDLL 248
             ++N G   +   +        S L S+  DGR F  +N+LSYED+ F +NFF+KD+LL
Sbjct: 270 STQENHGGEGESRKILGDARTGGSTLSSM-CDGREFHSSNVLSYEDVPFSENFFVKDELL 329

Query: 249 QENSLSWVLEADVGLADKKEIRTDELQKIDVGIANQNQGSKPVQSPESLAFKMEEELFKL 308
           Q N LSWVL+ D+ +A+KKE +     K D      ++  +  QSP++LAF++E ELFKL
Sbjct: 330 QGNGLSWVLDPDLDMAEKKESQNAGEPKSDHEEVGGDRVEQAYQSPQNLAFEIELELFKL 389

Query: 309 FSGVNKKYKEKGRSLLFNLKDRNNPELRERVMNGEITPERLCSMTAEELASKELSEWRMA 368
           F GVNKKYKEKGRSLLFNLKDRNNPEL ERVM GEI+PERLCSMTAE+LASKELS+WRMA
Sbjct: 390 FGGVNKKYKEKGRSLLFNLKDRNNPELIERVMAGEISPERLCSMTAEDLASKELSQWRMA 449

Query: 369 KAEELAQMVVLPNSEVDIRRLVRKTHKGEFQVEVEEYDNASIDVSSGVSTFSQSQ-RNKN 428
           KAEELAQMVVLP+S+VDIRRLV+KTHKGEF VEVE+ D+  +D+S G S+ + S+ +NK 
Sbjct: 450 KAEELAQMVVLPDSDVDIRRLVKKTHKGEFHVEVEQDDSNPVDISGGSSSLAHSEPKNKE 509

Query: 429 ETVGGSPDEPDTIMDEWNISGQK-NGASDKDEYTFTIASTEGSELLS------------- 488
             +  S  +P    D+ N  G+  N    +      +   E S+L+              
Sbjct: 510 MEIPNS--KPVVKKDKVNAQGENSNLEGHRTSCPLMLHPNEESDLMHGLIVDDGFKYVEF 569

Query: 489 LPPISSIDEFMESFDTEPPFNILSEDTGKSSPILEKGEPEPGSQLKAAAHSMEGATDVSI 548
           LPPI S+DEFMES D+EPPF IL  D+ + +P+  K + E GS  K++  + +   D S 
Sbjct: 570 LPPIVSLDEFMESLDSEPPFEILPLDSERMTPVSGKDDSEVGSGTKSSNPTSKDVVDASS 629

Query: 549 DKNENIE-SYTKADIGSSSISHMDLTSSDCKTDEDLNENQAGLRTSDRNDGTVSGDSNAK 608
           +K++N++ ++TK D         D+ S D   D  L++     ++ D + G    DS  K
Sbjct: 630 EKHDNVDVTHTKIDA--------DVKSDDSPVDAKLDDGSTDAKSRDNHVGVQPNDSPLK 689

Query: 609 SGTE-SLASTFSLEYLWDGILQYNISTMTPVVGTYISGERTSAKDWPSTLEIKGRVRLDA 668
           + T  +L+ T   E++W G LQ NIS+    V  + SGE+TSA +WP  +EIKGRVRL+A
Sbjct: 690 TETTLALSGTPMGEHVWGGSLQLNISSTANFVCIFKSGEKTSANEWPGFIEIKGRVRLEA 749

Query: 669 FEKFLQELPLSRSRAVMVLHLDLKKGCPESDRANLQEVAESYVADERVGIAEPGSGVEFY 728
           FEKFLQELPLSRSRAVMV+H  LK+   E++RA LQEV+ESY+ DERVG AEP SGVE Y
Sbjct: 750 FEKFLQELPLSRSRAVMVVHFVLKESS-ETERAALQEVSESYILDERVGFAEPASGVELY 809

Query: 729 FCSPHGRILEMVGRILLKENNELLNAIENGLIGVVVWRKPQLTLMSPNSTSLHKRSSKKQ 788
           FC PH + LE +G+I+ +E+ E LNAI+NGLIGV+VWRK  L+ +SP S+S HK + KKQ
Sbjct: 810 FCPPHNKTLETLGKIVHEEHIEALNAIDNGLIGVIVWRK--LSSISPKSSSHHKHALKKQ 869

Query: 789 HFSSRRLQETP-NLKANDVSPMPRGYFPVASDYPLTEEDDADGDDDVPPGFGPSTTRDDD 841
           HF+SRR QE+P N      S  PRG  P A+  P  ++D+    DD+PPGFGP   RD+D
Sbjct: 870 HFTSRRQQESPLNSNFAPKSAAPRGLAP-ANSRPSHDDDE----DDIPPGFGPPVARDED 918

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PHF3_HUMAN7.4e-1843.75PHD finger protein 3 OS=Homo sapiens GN=PHF3 PE=1 SV=3[more]
BYE1_YARLI1.2e-1039.20Transcription factor BYE1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=BY... [more]
SPOC1_HUMAN1.7e-0937.61SPOC domain-containing protein 1 OS=Homo sapiens GN=SPOCD1 PE=2 SV=1[more]
TCEA2_BOVIN3.7e-0947.44Transcription elongation factor A protein 2 OS=Bos taurus GN=TCEA2 PE=2 SV=1[more]
TCEA2_HUMAN6.3e-0944.87Transcription elongation factor A protein 2 OS=Homo sapiens GN=TCEA2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LDR1_CUCSA0.0e+0072.93Uncharacterized protein OS=Cucumis sativus GN=Csa_3G878940 PE=4 SV=1[more]
M5VSR9_PRUPE7.4e-18247.29Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000459mg PE=4 SV=1[more]
W9S1A1_9ROSA3.7e-18147.42PHD finger protein 3 OS=Morus notabilis GN=L484_007377 PE=4 SV=1[more]
A0A061GPM9_THECC6.2e-18144.57SPOC domain / Transcription elongation factor S-II protein, putative isoform 1 O... [more]
A0A0B0P3A3_GOSAR1.7e-17844.31PHD finger 3 OS=Gossypium arboreum GN=F383_26367 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25520.26.1e-12739.25 SPOC domain / Transcription elongation factor S-II protein[more]
AT5G11430.12.7e-11937.43 SPOC domain / Transcription elongation factor S-II protein[more]
AT2G25640.17.7e-11440.43 SPOC domain / Transcription elongation factor S-II protein[more]
AT3G29639.11.4e-1150.85 BEST Arabidopsis thaliana protein match is: SPOC domain / Transcript... [more]
AT2G38560.18.8e-0933.33 transcript elongation factor IIS[more]
Match NameE-valueIdentityDescription
gi|659132763|ref|XP_008466371.1|0.0e+0078.31PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503799 [Cucumis me... [more]
gi|778687053|ref|XP_004136468.2|0.0e+0072.93PREDICTED: death-inducer obliterator 1 [Cucumis sativus][more]
gi|1009177572|ref|XP_015870045.1|1.7e-19248.28PREDICTED: uncharacterized protein LOC107407297 [Ziziphus jujuba][more]
gi|595811283|ref|XP_007203213.1|1.1e-18147.29hypothetical protein PRUPE_ppa000459mg [Prunus persica][more]
gi|703147406|ref|XP_010109043.1|5.3e-18147.42PHD finger protein 3 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR012921SPOC_C
IPR003618TFIIS_cen_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0034645 cellular macromolecule biosynthetic process
biological_process GO:0044238 primary metabolic process
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0005488 binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g04430.1Cp4.1LG02g04430.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003618Transcription elongation factor S-II, central domainGENE3DG3DSA:1.10.472.30coord: 111..134
score: 1.1E-35coord: 284..357
score: 1.1
IPR003618Transcription elongation factor S-II, central domainPFAMPF07500TFIIS_Mcoord: 274..368
score: 5.6
IPR003618Transcription elongation factor S-II, central domainSMARTSM00510mid_6coord: 242..364
score: 6.1
IPR003618Transcription elongation factor S-II, central domainPROFILEPS51321TFIIS_CENTRALcoord: 265..381
score: 33
IPR003618Transcription elongation factor S-II, central domainunknownSSF46942Elongation factor TFIIS domain 2coord: 286..357
score: 6.02
IPR012921Spen paralogue and orthologue SPOC, C-terminalPFAMPF07744SPOCcoord: 604..709
score: 5.6
NoneNo IPR availablePANTHERPTHR11477TRANSCRIPTION ELONGATION FACTOR S-IIcoord: 263..565
score: 7.1E-136coord: 584..882
score: 7.1E
NoneNo IPR availablePANTHERPTHR11477:SF17PROTEIN PARTNER OF SNFcoord: 263..565
score: 7.1E-136coord: 584..882
score: 7.1E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG02g04430Cp4.1LG06g03570Cucurbita pepo (Zucchini)cpecpeB467