Cp4.1LG15g00030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g00030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMediator of RNA polymerase II transcription subunit 33A
LocationCp4.1LG15 : 70610 .. 83069 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAACCAGGGTTCAAATCTAATTTGATTTCTTCAATTACTTAAACCAGGGTTTTCTTCTTCCATCATTTTACGCGCCGCGAACAATTTCATGTACCTCTAAATTTTGGACATACACGAGAATGAACCCCAAATGTTTTCTCAAATAACTTCAATCTACTCACCCTGAATCTCTTTTCCATCCATTATTCCCTTTCCCTGAGCAAGAAATTCTGTGGGTCAGATTCTCCCATGGCGGTTTCCGCTCAACCGCCCGGTCAACTGCAGGGGATCGCTGGTGTTTGGGACACTGTGTTGGAGCTTACCAAGTCGGCGCAGGAGAAGAACAGCGATCCACTGCTTTGGGCGGTTCAGCTCAGCTCCAGCCTCAATTCGGCTAGTGTTTCCTTGCCGTCTGTCGAGCTCGCCCACCTCTTGGTCTCTCATATTTGTTGGGACAATCACGTTCCGATCATGTGGAAATTCCTTGAGAAGGCAATGACCGCCAGAATCGTTCCTCCCCTGCTGGTTATTGCTCTTCTTTCTACCAGGTCTCATTTGTTCTAGCATTTTGATTAAAGCTTCTTCGCTTCCTCGAATTGTTATTTAAATGTATCATATTGAGTGCTTCATTAGTAATACTTTGAATTTGCTTATGGTTTCTTTATGTATCCGCGTGGACATTTGTGAAAATAAGTTACCGTATGCTTAGATTTCGTGCGTTGTTGGGTAACTGCCTTTCTGCTTGCACATCATCCATAATCAGCTATAATTTCGTTGTGCTGTAGTTGTCTACTGTTGATATATGTTCTAGTCTGTATCTGTTGCCTCTTGTCCAATAATTCACTTCAATTTGCATCAGATACTGGCTTTGAGTGTGGAAGATCATTTTATTTTTCTTCCTCTTTTGTTATACTTCGATTTTTCTCATCCACTATTCACCGTCCAAAGCTGACATATACCACATTTCAGGGCAATTCCATACAGAAAGCTTCGACCTGCAGCATACAGGCTTTACCTGGAACTTCTAAGCAGACACGTCTTTTCATCAACATTGGAAGTCAATGGACCCAATTACCCAAGGTAGATTTCATTGCAGTTTACTCGTGTCATTATAGTGGTTAGCCACATTCTCTTTTCATTCTGTTTAACCATTTTTCTTTCTCGATGGTAAACCTAGAGGAAAGCTTTTGCCTTGAGAGGCATTTGGCTGTATAACTCTAAGCAAAGCCAACCACGTCCACATAATGTTCCTTCCTGGAGAAAGACACTAGCTAGGAAGTCTATTTTGAGTTTTATCATGAGTATTGATACCCGCGTGGGGCAAGCTCCAATTAAAAGAAGAAGTTCACTTTGTTGAGGTTTTTAGAGTGCTGGATCACCTTATAAAGGCTAGGGTTCAACAAGTTTCTTTTATTGACAATATCCTTTAGCACGGAATTCCTAGATAGAAATCCTGTTAACATCAAGCTTATCTCTTCAATCTTTCTAATTATCTTCCTTTATGGTTGTTTTGAATTTTCGATTCCTTTTTTTTTTATATATAAAAAAATACGTATTAAAAAATAATGGGAGAAATGCCTCAACCTTATTGCACCATCATTTGTTTCATCACCCACCCCCCAGATTTTTATTTTATGCAACTGGAGGAAAAGGCTTCATGAATGATGTTATGCCATTTCAGGATCATGCAAACCATCGATGACGTCCTTCATCTGTCCCAGATATTTGGTCTCCAGACATGTGAACCTGGGCTACTTATGGTTGAATTATTCTTTTCAATTGTATGGCATTTGCTTGATGCATCATTGGATGATGAAGGATTGCTGGAACTTCCTGCAGAAGAAAGATCAGTGTGGCTAATCAGGCCACAACCACATGATATGGAACTAGATGTTCATGATTCTTTTGGTGAGAAGAAAACTGAGAACAGTGAAAATCTGCTTAAAGTGAACACTGCAAAAGCTATTGAGATTATTGGGCAGTTCCTGCAAAATAAGAAAACTGCAAGGATTTTGTGCTTGGCCCATCAAAATATGTAAGAAGTATAATTCTTCCTAACCTTGTTTGTCTTCTCTCTCCGTCATCTTTGATTTCTGAAGGAAAGAAAAGCGTATGGATTTTAAGAATGGCCACATTTCATTGAGACTACCACTAGTTATTGGAATAACGTATGCTGCTTTAATGTTAAATTAATACAAAATTTTTAGAGCTCAATGAAAATGATTAACATAATTCTGCTTTCTGGGTCTTAGGCCATTGCACTGGGCAGGTTTTGCCCAGCGGTTACAACTACTTGCAGCAAACTCAGTAGTTTTGAGGAACACAAAACTAATAACTCCAGAGGTCCTTCTGCACTGGACATCCGATAAACATAGGTTTTTATCACAAGAAGGAAAAACAACATCTCAGTTAGAGTTCCATGATGTAATGGCTTCTGGATCGCTCTTTTCTTCCGCTGGTCAATCTCATGGCGTTAATTGGTCTGCATTGTGGCTTCCCATTGATTTGTTCCTGGAGGATGCCATGGATGGATCACAAGTTCTGGCAACTAGTGCTGTTGAGCGTCTGATTTGTATGAGCTTACAATTTTTATAGTGCTTGAGTGACTCTTCAATTAAATTGAAATTCTAAGTTTCTATAAGAGTTCTATTGATCCAGAGTTATCTGCAGGCTTGATAAAATCTTTGCGGGCAGTTAATGATGCCTCCTGGCACAATACATTTTTGGGTTTGTGGATTGCAGCGTTGCGACTTATTCAAAGGGTAGGATCAATTAGCGTTTTGATAAATGAATCATGTTTATCTGTTTATATCCTCTTTATGTTCAGATATTAACTTTGTCCAAAATGATTTCATCCGCACAATTTACTTAATTAGCAGGTCCATTGCCTTGAATTGAGAGATTTAAAGTGATTTCTGTTCTATGATAATTGCATGGTAACTTTATGCTGGCTTATAAAGTTTATACTATGTTCTTCTTTCTCCTTTATATTTTAACTCCAAGATTGGGGGAGATGGATAAAAAAAGCAGCATTGAAGTTCAATCTTTTAATAAGCCTGACTCATGTGGCAGTTTTACTCTGTTTGTGACAATAATCTACTTAAAATACGGTCTCCTTCTGTTGAATTGGGGAAACGTACCTTGCAAGAAAATTAAGCATCCATCCCTCTTTTCACTTATACTCCTCTTTATTATTATTTGAGTATTTATTACCTTTAGGTAAAAAAAACTCAGGTTTAATCTGCCTACATCTGGGGGGCATTTAACTTATTCCAACCTGCTTATATTTGCTAATTCAAATTTCCATTTTAGGAAAGGGATCCAAGTGAGGGTCCTGTGCCTCGTTTGGATACATGCTTGTGCATGTTATTGTCAATTACAACCCTTGCAGTCACCATTATTATTGAAGAAGAGGAAGGTGAACTAAAGGAGGAGGATGAATGCAGCCCAAGTAAAAGTAGAGATGAGAAGCAGTCTTCAGGAAAGCGCCGACAAGGTTTGATTACGAGCTTGCAGATGTTGGGTGAATATGAGAGCTTGCTGACTCCTCCTCAATCCGTTATTGTAGTAGCCAATCAGGCTGCCGCAAAAGCAGTAATGTTCATATCAGGAGTTGCAGTTGGTAATGAGTACTATGACTGTGTTAGTATGAATGACACACCTATTAATTGTTGTAAGTACTTGTTCTTATTTCAGAATTTAACATTGGTTGTATGGATTCTAATGAGCAATGGTCTCTAGAATATTTAAATTGGTGAATATTTGCGTTCTTTTGGAACTTCTAAACATAGTCAATTGACCTGTTCTATCTCCATTCATCTGCAAATGCTAACCCTTAGATTCATCTGCAAATGATAACCTTTAGATTCATGCTCAATTTCTCAAGATTTCCTGGGAGTACAATTCTGTTAGAACAGTCAAATCTTTTATAAAAGGACATTATTGACTTCATGCTAAAAAATTGCTTGGTTCAGTGGTTGACATTTATTCATCAACAAGTGCAAAATTTTGGATTGACGTTCTTCAAGAATCATTATTACTGGAATTCTGCTTGAACAATTTATCTTCAATGAAGCACATTATTGAACGTATCCCCAAAATTATTTTATTCTGTGGACAAACTAATTCTAGATGTGGCATAGGATTCTGGCGTGCAAATCTACATAGTATAGTAGTCTTTGTTATCCTGTACATACATCTTAAATTTTAATTTACAATGGTTCTTTTGATCCTTGAAGATTTCTCTCTCACTCTCTCTCTCTCTCTCTCACACACACACACACACATTGTTGCATACATATGCACATCCGAAATGTATTCTCTTGTGTGGCTAATTTGTTGCTTTTTCTTGTTGCAGCTGGAAATATGCGGCATCTGATTGTTGAGGCTTGTATTTCTAGGAATCTTCTAGATACATCGGCATATTTTTGGCCAGGCTATGTAAATACACGCAGCAGTCAAGTGCCTCGTAGTGCATCTAGTCAGGTGGTTGGTTGGTCATCATTCATGAAAGGGTCGTCCCTAACTCCGTCAATGGTGAATGCTTTAGAGGCAACTCCAGCTTCTAGGTATGCTCCCACAAGTATCTTTTTGTATCATGTAATGCCATATTGAGGGAAGATGATTAATTAAAGGACCAAGTTCAGTACCTAGGTTCTGTATTTTGTTTCCCAAAAAAGTGAATACAGATGCAGCGACTATGTGTTGCAGACAGGGAAGCTGGAAATAGATTTGGATAGCTTCTTGAAGTTATGAAAATTAGGAAGGTGGGGAAAGCTGGGTATTCTAAGCAGGTTTAATAGAAGGGGTTGGAGAGAGGTGCTAAATTTTGTTCCCTATTATCCTTCTTTTTGTCAGTTAGGGGGCAAAATATAAGGGGTCACTGAGAGGAATGACAAGAGGGTGAAAATGGGTAATTCATGGTCCAATCAACGGTTGGGTAGACGAAATATTAGTGATTCAGGACGAAATATTAGTGATTCAGGAAGGTGGCTGACGGGGGGAAACTAGAGATGAGATGAACATAGTAAGTAAGATGGATTTATAAACGATTAGATTGATTTCTAACAAGGAGTTTTTTTCCCGGATGAGAAACAAGAGAATATCATTCAAGAGGCAAAAGCCAAAACAAAACAACTTAAGGGACAAGGGAATAAGGCATCCCCTCTTCCGAAGGAAAGAAGTTGATGGTTTCTATCGAGTAGTTACAAGACAACCTGTGTTGCGCACTCCAAGTAGAGGCCATAAACTGTACGAGGTCACAAAAAGAAAGAAGAGAGCTCGACTTATCTGTAAACACTCTAGAATTTCTCTCTTTCCAAAGAAGCCAAGGAATAGCCCTATCAAGATTCATCCAAAGCACTCCAGCTTTGTATTTCAGCCACCACTCAGAGAATTCCAACCAACCACAAAACTATACCACGCAGCTTACACCCCTGCAACCCAAACATTTGACCGAATAAATTCCACACTTTTCCAGCGAACTCACGGTGCATCAGCAGATGGGAAGAAGACTCCTCATTTTTACAGCACATAACACACATATTGGGGGAGATACACAGGTTGGGGTACCTCCGTTAAACTTTGTCTAGAGTATTTGAGGTCCGTAGTCCACAAAAAAAATGAACTTAGGAATCATCCGTTCCCAAAGGTGAGTACACTATGCTTCATCTAACACTCTTCTCTTTTCTTGAAGAGCAACCAAGGCCAAAGGAACTGTAAATATACCAGACTTATCCAGCTTCGAAAGCAAAGTATCATCCCTTTGAGTGGATTATCTAGTTTCCAATAGCAAGCCTAGCTCACCACCCCTCCAACTCTCTGTCAGAAACCACTTCTCAAACCAAGATCCCAATCGTTACTGTACACAGTATTGAGAGAATAAAGGACAACTATATACAATAACATATATCATAACCATCCCTCTCAAGCGGGAGCAAATATGTCGATCATGCTCAGCTTGTTGGAGTGATGATTTTCTTGAACCGTTTAAGGCTTTAGTGAGAATATCCCTCAGTTATTCTCCTGTCTTCACATATCAGGTGGACAGCAACCCTGGTTGTATTTTCTCACAAATAAAATGACATTCAATATGTTTAGTTCACTCATGTAACAATGGAATAGATGCAAATTGAAGTGCAGCTTGATTATCTCACCACAATTTTGCTGGTATTGTATACTAAAGCCTATCTCAGTTAACAATTGATATATCCACACTATTTCACATATAGTCTGTGCCACAACTCAATATTATGACTCAACACTTGATTATGAAACCACGTTCTGCTTCTTACTCTTCCACGACACTAAATTTCCTCCTACGAAGACAATATTATGACTCAAAGCTTGTGATTTTCTTGGGAGCTTATATCTTTGAGCATTAGTCTCTTTTCATCTCTTCAATAAAAGTTTGGTTCTTGTTAGGAAAGGAAAAAAAAAAAAAAAAAGAATACAAGGGAAGGAAGAAAAATTAAGGGAACAAATATGTGAACAAAATAAAAGAGAGAATAACTATGGAATATTTCCACAAGGGTCAATAATCAACAAAATTATTTGTTGAACAAAATCTTTGGGAAACGTACATGTTGTCACCGTAATACATTTATGGAGTTTTATCACATTCATAAGCGTACATCAGATGCGTTGCAATTTTACCCAGTAGTATGCCAATTAATTTTAACTTAAAGCTCATGGATTTGATGTAAGCTTAGCGGAGATTGAGAAGATCTATGAGATTGCCATAAATGGTTCAGGTGACGAGAAGATATCTGCAGCTTCCATTTTGTGTGGGGCATCACTTGTTCGAGGCTGGAATCTACAGGTGAATGACAGTTGATGCCTCTGATTTGTATGGCACTGTGTGAAGGTTTTTTCCCCAATATTTTTGTAAAAGATTAATTTTGCATCTGTTCTCCACTATTAACATCTGGGATTATTTTAACCCATTTTCTTCTAGGAACACACTGTTCTATTTATATCCAGATTATTGTCGCCACCAATTCCTGCAGATTACCCTGGAAGTGATAGCTATTTGATCGACTATGCCCCATTTCTGAATGTTCTACTGGTTGGAATATCATCAGTTGATTGCGTGCAGATTTTTTCCTTGCATGGCATGGTAAGAATGTTAACCTTGCTTATTTGACCCTGCTGATAAGCATCTGAATCTGAAGTTACCTTTTTATATAATAGATTCTGTGTGCAAGCTTGGTAGAGGTTTTATATATGAATTTACTCTTCATTTATAGCCATTATAAATTAAGTATGAGAGCTGATTTGTTTCGGAGAAGTTTTGAAAGGGTTTGATGCCTGGCTTTTGATAGGTGGGACGGAGGCCTATTGACTAAGAGAAAATTCCTTATAAGCCCATTCAAATAATCATTAGATTCAATTTTTGTTTAATAATCTATATCCTTCGGCCGACCTGAAAATAGTTCAAATGACATGAAGTATATACTTTTTGACTAGGAGACTAGAGGTTTGAATCCCACCCTATTGTTGAACTCAAAAAANCCTGCTTCAATGATTCTGAACATACTTGATTTTTGACATGGAAAAGAATTCGGTTTGAACAGAGTAATTGAGACCTTATTTTCATTGTACTATGAGCTTATTTTTCCCCCCTTTTTTTTGCTGTTTGATGCCAGGTTCCTCTACTTGCAGGTCAATTAATGCCAATCTGCGAAGCTTTTGGATCAAGTACCCCCAAGTCATGGATCCTTACATCTGGGGAAGAACTTACTTGTCATGCAGTGTTCTCCTTGGCTTTTACACTTCTATTGAGGTTGTGGCGGTTTCATCACCCACCTATTGAAAATGTGAAGGGAGATGCACGGCCTGTGGGATCTCAGCTAACTCCTGAATATCTGCTATTGGTTCGAAATTCTCAGTTAGCATCTTTTGGAAAGTCACCCAAGGATCGACTTAAAGTCAGACGGCTGTCAAAATTGTTGAAATTTTCTTTAGAACCTACATTCATGGATTCCTTTCCAAAATTGAAAGGGTGGTACCGGCAACATCAAGAATGCATTGCTTCCATTCCCCCTGGTCTTGTACCTGGGGCCCCTGTTCATCAAACTGTTGATGCTCTCTTGACCATGATGTTCAAGAAGATAAATCGTGGTGGTCAGTCTTTGACTTCAACTACTTCAGCAAGCAGCAACTCATCTGGATCTGCAAATGAAGAGGCCTCCATTAAGCTTAAAGTGCCTGCATGGGACATCCTTGAAGCAACTCCCTTCGTTCTCGATGCTGCTCTTACTGCCTGTGCTCATGGACGATTGTCTCCCCGTGATTTGGCTACAGGCAAGTCTGAAACACTGTTTTAATGAGAAACAAGAAAGGTTAATCTATTACTTAGAATATAAAATTCTGCATGTGGTTTATTTTTTTGAGAACTTATGAATTCTTTTTGGACTAATACCATTCTTGATGTTTAAATATAGAACTTACCTTCATTTATTGTGATTTGTCATTAAATTAAATGTTTAAATATTACATGTCTAAGTAGATTATTAATAACAAAAAATGAAAAATGTTAGGAACCTGCTCTCTCTCTCTCTCTCTCCCTTCTGTTAGAATATTACAAAGAACCCTTTCACTTAATTGGATTATTAATACCTTCAATTCCCGTTGCTCTCTTCCCCTCCTTCCCCACCCTTGCTTCATGCAAATCTACCTTATTAATACAAAAAGTCTCTCTTATTATTTGTGGATTATAAATCTAATTTGCTTCCCCTTTTGTTTTGCAATATTAATAGAGGTTTGGTCTAACGTAGAAATAGGTAGCTCATCTAAAGGAGGAGCTAAGACCTTGGTTACAGTGGGGATAAGTGATGTGATACAATTAGACTCAACTTCCAGTAGCTCAATTTCCCTCCCATTCTCCCCCAAATTTGGGACTTGATGAAAATGGACTAATTTTAAAAAAAGTAAACATCGATAGATGTTTACATTTTTCTAGTGGGTGGGGAGTAACATTTTTATTCTTTTTGGTGTGATGTAACCCGGAAAAATGCACCTGATGAATTTTGGATCAAGTTTGGGCTGATGAGGAGGATTGTGAACTGAAGAAAAATACACAAATTCTCGAAGAGGTAGTTTTAAGGAAAATTTTTGGAGTTGGGGATGGATTTGGAGGAAGACATTTCTAGGAGATTGGAATTTGAGTACTCGTCATGACATTCGATATAAAGGAAGGTTGCAGCGAGAATGAATTCTCCCCAAAAATGATTGGAACATTACTGGGAAGTATAAGAGAACGAGTTAATCAAGAAGTTGTCCTTTTTTTTTTCCCCAGTAATTCCATTTTGTTAAGGAATGCCCACCCATGAACAATATAAACAATACCATGAGATGAAAATGAGCCAAGAGTTGAATTGACATCGTCTCGAGCATTGTCTCTTCTAAGAACTTTGTGAATTTGTGTACTGCGGGCTTCTTGATGAATCCATATTTGTACCCTTGTGACGGCTTATAGAAAATATATTTCTTCTCAGGATCAATATAGGAACAATCTGTTGTTGGTTTGCTTGGTAAGTTCTTTACCTCATTGTTTCAGGACTCAAAGACCTTGCTGATTTTCTACCGGCATCCTTTGCTACTATTGTGTGCTACTTTTCAGCTGAAGTTACACGTGGTATATGGAAGCCAGCATTCATGAACGGAACTGATTGGCCTAGTCCTGCTGCAACTTTGTCCGTTGTTGAGCAACAGATTAAAAAGATTCTTGCTGCAACTGGTGTTGATGTCCCTAGTCTTGCTCTAGGTAAATTCTTGATGCGCACTAATGCCGTTTTATATATATATATATATATATATATATATTTTATTAGAACATCTGGAAACTATTTTCCTACTTTTATCAATTTCATTGAGTTAGTATTTGGTTTCTATACTTCTTTTTTAGAAATTAAAACGAAGGTCAAGATAAGAAAGCCCTCCCAACCATTAGAATGGAGATTGAAGAAATGCTCTCCAATTGATTTGAATAAAGGAGGATGGTGATAAAACTTTCATAAAGAAAATAAATGTTACGAAGAGCACCTCTATATTCTCTCTACCTGGTTAGGTAGGTGCATTTTTCATCTTCCCCACCCCATATATTCTCTCTACTTGGTTAGGTAGGTGCCAAAGCACAGCCTGCTGTTGGTCCGCAAAATCTGCCTTCTACACTTTATGAGTTAGCCGAACAACAATCAGAAAAAAAAGTTTATCGTCCTGGTGGGGAAAAGATATTTAGTAAGCCAAATATCTCCATTATCAACTCTCTAGAAAACTCAAGAGTCTCAGCCGAAGTAATAGAAGCAATTTCCTAAGCGTCTAGGATATATTCTTCATACAAATATATTAATCTGAATACGTCTCACGGAAGACGATAGAAATGGTAAGAATGATAGCGACTTCATATCAAAGAATTTTTCAAGACAAGTCATTAGAGTAACGTCGACGCTTTTTGTACAAATATTCAGTTTTTATCTTCTTAGTGGTGCCATAGCAACAACAGTTTACTTCCTAACTGCCTCGTATCTTTGATTCTTCTTGATAGGAAAGCTTTTTGTAAATAGTCTTTTGGATTGGTGGGTGGGGGAGGGGGGATCTCCTGCCCTTAGATTGTTGCTGTTTTTTGGTCTTTGCAACATATTCACAGGTTTCTTATCAAGAGGAAAAAAAAGAAAGAAAAGAATGACAGTGACTTTGCACTGTTTGAAATGTGATGAAGCTGCAACACATTCACTTGATCTCTCACCAGAGTAACTCTTGAAAAACATAAAAAATTGAATTGGTATTGCATTTTTCATGTGTTTTCTGTTCAACCTAAAATTGAGCTTGAATTGCCTATTTGCAGGAGGAAGTTTTCCAGCTATGCTTCCTTTACCCTTGGCTGCCCTAATAAGCCTCACAATAACCTATAAACTGGATAAAGCCTCTGAACGCCTTCTTGCCCTCGTTGGCCCCGCGCTAAATTCGCTCGTTGCTGGTTGTTCGTGGCCTTGCACACCTATTATAGCCTCATTGTGGGCTCAGAAAGTGAAGCGATGGAATGACTTCCTTGTGTTCTCTGCTTCTCGCACTGTTTTTCACCATAATAGTGATGCTGTTGTCCAGCTGCTTAAGAGTTGTTTCACTTCGACTCTGGGTTTAGGCAACTCCAACGTAAACAGCGGTGGAGGTGTAGGCGCACTCCTCGGTCACGGTTTCGGTTCTCACGTTTTAGGAGGGATGTCTCCAGTGGCTCCTGGGATTCTCTATCTCCGTGTGCATCGATGTGTTAGAGATGCTTTGTTCTTGGTGGAGGAGATTGTCTCTCTTTTAATGCTCTCTGTCAAAGACATTGCAGTTACTGGGCTACCAAAGGAGAAGGCTGAAAAACTAAAGAAGTCCAAGCATGGAATGAGATGTGAACAGGTTTCTTTTGCTTCTGCAATGGCACGCGTTAAACTTGCAGCTTCTCTTGGAGCTTCATTAGTTTGGATATCCGGTGGATCAGGCTTGGTCCAATCTTTGTATAAAGAAACCTTGCCGTCTTGGTTTTTATCAGTCCATTCAGTAGACCGTGAAGGTGTAGAATATGGAGGTATGGTTCCTGTGCTTAGGGGCTATGCACTTGCATTCTTTTCAGTACTTTGTGGGACGTTCTCCTGGGGCATAGACTCGATATCATCAGCGTCGAAGAGGCGTGCAAAGCTTCTCGACGCCCACCTCGAATTTCTTGCGAGCGCATTGGATGGAAAATTCTCCATTGGGTGTGATTGGGCTACATGGCGGGCTTATGTTTCTGGGTTCGTGAGCTTGCTGGTGCGTTGTGCACCGAAGTGGTTGCTCGAGGTGGATTTGAAGGTGTTGGAGAGGTTGGGCAAAGGATTAAGGCAGTTGAACGAGGAGGAATTGGCTCTTGCATTGCTGGAAAGTGGTGGGTTGACTGCAATGGGTGCAGCAGCGGAACTTATTATTGAAGGAGGATTTTAAATGTTACAGACTTAACATGAAAAGTGAACCATTTTTGTTTAGAAAAACTGGAACCATCAACTGACGTAGGTTTCCCTTAGCATCATATCCCCAGTAATGCAACGTTCATGTACACAAATGGTTCTTACCGCTGCTCTGCATAGGCATATTAGCCGCCGTTTTTTTTTTGGCTAAAAATCTGAAGTAATGATATTCAAAATTTTAGCATGTAATTCATGTATCATTAAATTTGTGAAAAATATGTAACAATTTAGGCCACATCCTATTAAATTTGTATATTTTAGTTTAACAGTTTTGACCTCGAATAGATCAAGAATGTAAGTTCAAAAAAACTTAATAGCAATTAGATTTAATCATATAAAATAGAAAGTTTGACCATTGAATTGTACACGAGGTTGATTGAATTGTACACGAGGTTGAGGTGTTGTGATGTCCCATATTGGTTGGGGAGGAGATACGAGGTTGAGGTGTTGTGATGTCCCATATTGGTTGGAGAGGAGATACGAGGTTGAG

mRNA sequence

TGAACCAGGGTTCAAATCTAATTTGATTTCTTCAATTACTTAAACCAGGGTTTTCTTCTTCCATCATTTTACGCGCCGCGAACAATTTCATGTACCTCTAAATTTTGGACATACACGAGAATGAACCCCAAATGTTTTCTCAAATAACTTCAATCTACTCACCCTGAATCTCTTTTCCATCCATTATTCCCTTTCCCTGAGCAAGAAATTCTGTGGGTCAGATTCTCCCATGGCGGTTTCCGCTCAACCGCCCGGTCAACTGCAGGGGATCGCTGGTGTTTGGGACACTGTGTTGGAGCTTACCAAGTCGGCGCAGGAGAAGAACAGCGATCCACTGCTTTGGGCGGTTCAGCTCAGCTCCAGCCTCAATTCGGCTAGTGTTTCCTTGCCGTCTGTCGAGCTCGCCCACCTCTTGGTCTCTCATATTTGTTGGGACAATCACGTTCCGATCATGTGGAAATTCCTTGAGAAGGCAATGACCGCCAGAATCGTTCCTCCCCTGCTGGTTATTGCTCTTCTTTCTACCAGGGCAATTCCATACAGAAAGCTTCGACCTGCAGCATACAGGCTTTACCTGGAACTTCTAAGCAGACACGTCTTTTCATCAACATTGGAAGTCAATGGACCCAATTACCCAAGGATCATGCAAACCATCGATGACGTCCTTCATCTGTCCCAGATATTTGGTCTCCAGACATGTGAACCTGGGCTACTTATGGTTGAATTATTCTTTTCAATTGTATGGCATTTGCTTGATGCATCATTGGATGATGAAGGATTGCTGGAACTTCCTGCAGAAGAAAGATCAGTGTGGCTAATCAGGCCACAACCACATGATATGGAACTAGATGTTCATGATTCTTTTGGTGAGAAGAAAACTGAGAACAGTGAAAATCTGCTTAAAGTGAACACTGCAAAAGCTATTGAGATTATTGGGCAGTTCCTGCAAAATAAGAAAACTGCAAGGATTTTGTGCTTGGCCCATCAAAATATGCCATTGCACTGGGCAGGTTTTGCCCAGCGGTTACAACTACTTGCAGCAAACTCAGTAGTTTTGAGGAACACAAAACTAATAACTCCAGAGGTCCTTCTGCACTGGACATCCGATAAACATAGGTTTTTATCACAAGAAGGAAAAACAACATCTCAGTTAGAGTTCCATGATGTAATGGCTTCTGGATCGCTCTTTTCTTCCGCTGGTCAATCTCATGGCGTTAATTGGTCTGCATTGTGGCTTCCCATTGATTTGTTCCTGGAGGATGCCATGGATGGATCACAAGTTCTGGCAACTAGTGCTGTTGAGCGTCTGATTTGCTTGATAAAATCTTTGCGGGCAGTTAATGATGCCTCCTGGCACAATACATTTTTGGGTTTGTGGATTGCAGCGTTGCGACTTATTCAAAGGGAAAGGGATCCAAGTGAGGGTCCTGTGCCTCGTTTGGATACATGCTTGTGCATGTTATTGTCAATTACAACCCTTGCAGTCACCATTATTATTGAAGAAGAGGAAGGTGAACTAAAGGAGGAGGATGAATGCAGCCCAAGTAAAAGTAGAGATGAGAAGCAGTCTTCAGGAAAGCGCCGACAAGGTTTGATTACGAGCTTGCAGATGTTGGGTGAATATGAGAGCTTGCTGACTCCTCCTCAATCCGTTATTGTAGTAGCCAATCAGGCTGCCGCAAAAGCAGTAATGTTCATATCAGGAGTTGCAGTTGGTAATGAGTACTATGACTGTGTTAGTATGAATGACACACCTATTAATTGTTGCTATGTAAATACACGCAGCAGTCAAGTGCCTCGTAGTGCATCTAGTCAGGTGGTTGGTTGGTCATCATTCATGAAAGGGTCGTCCCTAACTCCGTCAATGGTGAATGCTTTAGAGGCAACTCCAGCTTCTAGCTTAGCGGAGATTGAGAAGATCTATGAGATTGCCATAAATGGTTCAGGTGACGAGAAGATATCTGCAGCTTCCATTTTGTGTGGGGCATCACTTGTTCGAGGCTGGAATCTACAGGAACACACTGTTCTATTTATATCCAGATTATTGTCGCCACCAATTCCTGCAGATTACCCTGGAAGTGATAGCTATTTGATCGACTATGCCCCATTTCTGAATGTTCTACTGGTTGGAATATCATCAGTTGATTGCGTGCAGATTTTTTCCTTGCATGGCATGGTTCCTCTACTTGCAGGTCAATTAATGCCAATCTGCGAAGCTTTTGGATCAAGTACCCCCAAGTCATGGATCCTTACATCTGGGGAAGAACTTACTTGTCATGCAGTGTTCTCCTTGGCTTTTACACTTCTATTGAGGTTGTGGCGGTTTCATCACCCACCTATTGAAAATGTGAAGGGAGATGCACGGCCTGTGGGATCTCAGCTAACTCCTGAATATCTGCTATTGGTTCGAAATTCTCAGTTAGCATCTTTTGGAAAGTCACCCAAGGATCGACTTAAAGTCAGACGGCTGTCAAAATTGTTGAAATTTTCTTTAGAACCTACATTCATGGATTCCTTTCCAAAATTGAAAGGGTGGTACCGGCAACATCAAGAATGCATTGCTTCCATTCCCCCTGGTCTTGTACCTGGGGCCCCTGTTCATCAAACTGTTGATGCTCTCTTGACCATGATGTTCAAGAAGATAAATCGTGGTGGTCAGTCTTTGACTTCAACTACTTCAGCAAGCAGCAACTCATCTGGATCTGCAAATGAAGAGGCCTCCATTAAGCTTAAAGTGCCTGCATGGGACATCCTTGAAGCAACTCCCTTCGTTCTCGATGCTGCTCTTACTGCCTGTGCTCATGGACGATTGTCTCCCCGACTCAAAGACCTTGCTGATTTTCTACCGGCATCCTTTGCTACTATTGTGTGCTACTTTTCAGCTGAAGTTACACGTGGTATATGGAAGCCAGCATTCATGAACGGAACTGATTGGCCTAGTCCTGCTGCAACTTTGTCCGTTGTTGAGCAACAGATTAAAAAGATTCTTGCTGCAACTGGTGTTGATGTCCCTAGTCTTGCTCTAGGAGGAAGTTTTCCAGCTATGCTTCCTTTACCCTTGGCTGCCCTAATAAGCCTCACAATAACCTATAAACTGGATAAAGCCTCTGAACGCCTTCTTGCCCTCGTTGGCCCCGCGCTAAATTCGCTCGTTGCTGGTTGTTCGTGGCCTTGCACACCTATTATAGCCTCATTGTGGGCTCAGAAAGTGAAGCGATGGAATGACTTCCTTGTGTTCTCTGCTTCTCGCACTGTTTTTCACCATAATAGTGATGCTGTTGTCCAGCTGCTTAAGAGTTGTTTCACTTCGACTCTGGGTTTAGGCAACTCCAACGTAAACAGCGGTGGAGGTGTAGGCGCACTCCTCGGTCACGGTTTCGGTTCTCACGTTTTAGGAGGGATGTCTCCAGTGGCTCCTGGGATTCTCTATCTCCGTGTGCATCGATGTGTTAGAGATGCTTTGTTCTTGGTGGAGGAGATTGTCTCTCTTTTAATGCTCTCTGTCAAAGACATTGCAGTTACTGGGCTACCAAAGGAGAAGGCTGAAAAACTAAAGAAGTCCAAGCATGGAATGAGATGTGAACAGGTTTCTTTTGCTTCTGCAATGGCACGCGTTAAACTTGCAGCTTCTCTTGGAGCTTCATTAGTTTGGATATCCGGTGGATCAGGCTTGGTCCAATCTTTGTATAAAGAAACCTTGCCGTCTTGGTTTTTATCAGTCCATTCAGTAGACCGTGAAGGTGTAGAATATGGAGGTATGGTTCCTGTGCTTAGGGGCTATGCACTTGCATTCTTTTCAGTACTTTGTGGGACGTTCTCCTGGGGCATAGACTCGATATCATCAGCGTCGAAGAGGCGTGCAAAGCTTCTCGACGCCCACCTCGAATTTCTTGCGAGCGCATTGGATGGAAAATTCTCCATTGGGTGTGATTGGGCTACATGGCGGGCTTATGTTTCTGGGTTCGTGAGCTTGCTGGTGCGTTGTGCACCGAAGTGGTTGCTCGAGGTGGATTTGAAGGTGTTGGAGAGGTTGGGCAAAGGATTAAGGCAGTTGAACGAGGAGGAATTGGCTCTTGCATTGCTGGAAAGTGGTGGGTTGACTGCAATGGGTGCAGCAGCGGAACTTATTATTGAAGGAGGATTTTAAATGTTACAGACTTAACATGAAAAGTGAACCATTTTTGTTTAGAAAAACTGGAACCATCAACTGACGTAGGTTTCCCTTAGCATCATATCCCCAGTAATGCAACGTTCATGTACACAAATGGTTCTTACCGCTGCTCTGCATAGGCATATTAGCCGCCGTTTTTTTTTTGGCTAAAAATCTGAAGTAATGATATTCAAAATTTTAGCATGTAATTCATGTATCATTAAATTTGTGAAAAATATGTAACAATTTAGGCCACATCCTATTAAATTTGTATATTTTAGTTTAACAGTTTTGACCTCGAATAGATCAAGAATGTAAGTTCAAAAAAACTTAATAGCAATTAGATTTAATCATATAAAATAGAAAGTTTGACCATTGAATTGTACACGAGGTTGAGGTGTTGTGATGTCCCATATTGGTTGGAGAGGAGATACGAGGTTGAG

Coding sequence (CDS)

ATGGCGGTTTCCGCTCAACCGCCCGGTCAACTGCAGGGGATCGCTGGTGTTTGGGACACTGTGTTGGAGCTTACCAAGTCGGCGCAGGAGAAGAACAGCGATCCACTGCTTTGGGCGGTTCAGCTCAGCTCCAGCCTCAATTCGGCTAGTGTTTCCTTGCCGTCTGTCGAGCTCGCCCACCTCTTGGTCTCTCATATTTGTTGGGACAATCACGTTCCGATCATGTGGAAATTCCTTGAGAAGGCAATGACCGCCAGAATCGTTCCTCCCCTGCTGGTTATTGCTCTTCTTTCTACCAGGGCAATTCCATACAGAAAGCTTCGACCTGCAGCATACAGGCTTTACCTGGAACTTCTAAGCAGACACGTCTTTTCATCAACATTGGAAGTCAATGGACCCAATTACCCAAGGATCATGCAAACCATCGATGACGTCCTTCATCTGTCCCAGATATTTGGTCTCCAGACATGTGAACCTGGGCTACTTATGGTTGAATTATTCTTTTCAATTGTATGGCATTTGCTTGATGCATCATTGGATGATGAAGGATTGCTGGAACTTCCTGCAGAAGAAAGATCAGTGTGGCTAATCAGGCCACAACCACATGATATGGAACTAGATGTTCATGATTCTTTTGGTGAGAAGAAAACTGAGAACAGTGAAAATCTGCTTAAAGTGAACACTGCAAAAGCTATTGAGATTATTGGGCAGTTCCTGCAAAATAAGAAAACTGCAAGGATTTTGTGCTTGGCCCATCAAAATATGCCATTGCACTGGGCAGGTTTTGCCCAGCGGTTACAACTACTTGCAGCAAACTCAGTAGTTTTGAGGAACACAAAACTAATAACTCCAGAGGTCCTTCTGCACTGGACATCCGATAAACATAGGTTTTTATCACAAGAAGGAAAAACAACATCTCAGTTAGAGTTCCATGATGTAATGGCTTCTGGATCGCTCTTTTCTTCCGCTGGTCAATCTCATGGCGTTAATTGGTCTGCATTGTGGCTTCCCATTGATTTGTTCCTGGAGGATGCCATGGATGGATCACAAGTTCTGGCAACTAGTGCTGTTGAGCGTCTGATTTGCTTGATAAAATCTTTGCGGGCAGTTAATGATGCCTCCTGGCACAATACATTTTTGGGTTTGTGGATTGCAGCGTTGCGACTTATTCAAAGGGAAAGGGATCCAAGTGAGGGTCCTGTGCCTCGTTTGGATACATGCTTGTGCATGTTATTGTCAATTACAACCCTTGCAGTCACCATTATTATTGAAGAAGAGGAAGGTGAACTAAAGGAGGAGGATGAATGCAGCCCAAGTAAAAGTAGAGATGAGAAGCAGTCTTCAGGAAAGCGCCGACAAGGTTTGATTACGAGCTTGCAGATGTTGGGTGAATATGAGAGCTTGCTGACTCCTCCTCAATCCGTTATTGTAGTAGCCAATCAGGCTGCCGCAAAAGCAGTAATGTTCATATCAGGAGTTGCAGTTGGTAATGAGTACTATGACTGTGTTAGTATGAATGACACACCTATTAATTGTTGCTATGTAAATACACGCAGCAGTCAAGTGCCTCGTAGTGCATCTAGTCAGGTGGTTGGTTGGTCATCATTCATGAAAGGGTCGTCCCTAACTCCGTCAATGGTGAATGCTTTAGAGGCAACTCCAGCTTCTAGCTTAGCGGAGATTGAGAAGATCTATGAGATTGCCATAAATGGTTCAGGTGACGAGAAGATATCTGCAGCTTCCATTTTGTGTGGGGCATCACTTGTTCGAGGCTGGAATCTACAGGAACACACTGTTCTATTTATATCCAGATTATTGTCGCCACCAATTCCTGCAGATTACCCTGGAAGTGATAGCTATTTGATCGACTATGCCCCATTTCTGAATGTTCTACTGGTTGGAATATCATCAGTTGATTGCGTGCAGATTTTTTCCTTGCATGGCATGGTTCCTCTACTTGCAGGTCAATTAATGCCAATCTGCGAAGCTTTTGGATCAAGTACCCCCAAGTCATGGATCCTTACATCTGGGGAAGAACTTACTTGTCATGCAGTGTTCTCCTTGGCTTTTACACTTCTATTGAGGTTGTGGCGGTTTCATCACCCACCTATTGAAAATGTGAAGGGAGATGCACGGCCTGTGGGATCTCAGCTAACTCCTGAATATCTGCTATTGGTTCGAAATTCTCAGTTAGCATCTTTTGGAAAGTCACCCAAGGATCGACTTAAAGTCAGACGGCTGTCAAAATTGTTGAAATTTTCTTTAGAACCTACATTCATGGATTCCTTTCCAAAATTGAAAGGGTGGTACCGGCAACATCAAGAATGCATTGCTTCCATTCCCCCTGGTCTTGTACCTGGGGCCCCTGTTCATCAAACTGTTGATGCTCTCTTGACCATGATGTTCAAGAAGATAAATCGTGGTGGTCAGTCTTTGACTTCAACTACTTCAGCAAGCAGCAACTCATCTGGATCTGCAAATGAAGAGGCCTCCATTAAGCTTAAAGTGCCTGCATGGGACATCCTTGAAGCAACTCCCTTCGTTCTCGATGCTGCTCTTACTGCCTGTGCTCATGGACGATTGTCTCCCCGACTCAAAGACCTTGCTGATTTTCTACCGGCATCCTTTGCTACTATTGTGTGCTACTTTTCAGCTGAAGTTACACGTGGTATATGGAAGCCAGCATTCATGAACGGAACTGATTGGCCTAGTCCTGCTGCAACTTTGTCCGTTGTTGAGCAACAGATTAAAAAGATTCTTGCTGCAACTGGTGTTGATGTCCCTAGTCTTGCTCTAGGAGGAAGTTTTCCAGCTATGCTTCCTTTACCCTTGGCTGCCCTAATAAGCCTCACAATAACCTATAAACTGGATAAAGCCTCTGAACGCCTTCTTGCCCTCGTTGGCCCCGCGCTAAATTCGCTCGTTGCTGGTTGTTCGTGGCCTTGCACACCTATTATAGCCTCATTGTGGGCTCAGAAAGTGAAGCGATGGAATGACTTCCTTGTGTTCTCTGCTTCTCGCACTGTTTTTCACCATAATAGTGATGCTGTTGTCCAGCTGCTTAAGAGTTGTTTCACTTCGACTCTGGGTTTAGGCAACTCCAACGTAAACAGCGGTGGAGGTGTAGGCGCACTCCTCGGTCACGGTTTCGGTTCTCACGTTTTAGGAGGGATGTCTCCAGTGGCTCCTGGGATTCTCTATCTCCGTGTGCATCGATGTGTTAGAGATGCTTTGTTCTTGGTGGAGGAGATTGTCTCTCTTTTAATGCTCTCTGTCAAAGACATTGCAGTTACTGGGCTACCAAAGGAGAAGGCTGAAAAACTAAAGAAGTCCAAGCATGGAATGAGATGTGAACAGGTTTCTTTTGCTTCTGCAATGGCACGCGTTAAACTTGCAGCTTCTCTTGGAGCTTCATTAGTTTGGATATCCGGTGGATCAGGCTTGGTCCAATCTTTGTATAAAGAAACCTTGCCGTCTTGGTTTTTATCAGTCCATTCAGTAGACCGTGAAGGTGTAGAATATGGAGGTATGGTTCCTGTGCTTAGGGGCTATGCACTTGCATTCTTTTCAGTACTTTGTGGGACGTTCTCCTGGGGCATAGACTCGATATCATCAGCGTCGAAGAGGCGTGCAAAGCTTCTCGACGCCCACCTCGAATTTCTTGCGAGCGCATTGGATGGAAAATTCTCCATTGGGTGTGATTGGGCTACATGGCGGGCTTATGTTTCTGGGTTCGTGAGCTTGCTGGTGCGTTGTGCACCGAAGTGGTTGCTCGAGGTGGATTTGAAGGTGTTGGAGAGGTTGGGCAAAGGATTAAGGCAGTTGAACGAGGAGGAATTGGCTCTTGCATTGCTGGAAAGTGGTGGGTTGACTGCAATGGGTGCAGCAGCGGAACTTATTATTGAAGGAGGATTTTAA

Protein sequence

MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFISGVAVGNEYYDCVSMNDTPINCCYVNTRSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRLKDLADFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELIIEGGF
BLAST of Cp4.1LG15g00030 vs. Swiss-Prot
Match: MD33B_ARATH (Mediator of RNA polymerase II transcription subunit 33B OS=Arabidopsis thaliana GN=MED33B PE=1 SV=1)

HSP 1 Score: 1429.8 bits (3700), Expect = 0.0e+00
Identity = 782/1320 (59.24%), Postives = 935/1320 (70.83%), Query Frame = 1

Query: 17   VWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDNHVPIMW 76
            +W++V  L +SAQEKN DPL WA+QL  +L SA +SLPS +LA  LV+HI W+NH P+ W
Sbjct: 10   LWESVTSLIRSAQEKNVDPLHWALQLRLTLASAGISLPSPDLAQFLVTHIFWENHSPLSW 69

Query: 77   KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEVNGPNYP 136
            K LEKA++  IVPPLLV+ALLS R IP RKL PAAYRLY+ELL RH FS    +  P Y 
Sbjct: 70   KLLEKAISVNIVPPLLVLALLSPRVIPNRKLHPAAYRLYMELLKRHAFSFMPLIRAPGYH 129

Query: 137  RIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWL 196
            + M +IDD+LHLS+ FG+Q  EPG +++   FSIVW LLDASLD+EGLLEL + +RS W 
Sbjct: 130  KTMNSIDDILHLSETFGVQDQEPGSILLAFVFSIVWELLDASLDEEGLLELTSNKRSKW- 189

Query: 197  IRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMP 256
                PHDM+LD             EN +K N               +    L  A+  M 
Sbjct: 190  -PSSPHDMDLD-----------GLENSVKRN---------------ENHDALEKANTEMA 249

Query: 257  LHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMAS 316
            +      + +Q    N V  R         +LH  S      + E KT  + EFH +++S
Sbjct: 250  I------ELIQEFLQNKVTSR---------ILHLASQ-----NMESKTIPRGEFHAIVSS 309

Query: 317  GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWH 376
            GS  +    S      ALWLPIDLF ED MDG+Q  A SAVE L  L+K+L+A N  SWH
Sbjct: 310  GSKLALTSDS------ALWLPIDLFFEDIMDGTQAAAASAVENLTGLVKALQAANSTSWH 369

Query: 377  NTFLGLWIAALRLIQRE-------------------RDPSEGPVPRLDTCLCMLLSITTL 436
            + FL LW+AALRL+QRE                   RDP EGPVPR DT LC+LLS+T L
Sbjct: 370  DAFLALWLAALRLVQRENLCLRYCFFMHMLEILSEERDPIEGPVPRTDTFLCVLLSVTPL 429

Query: 437  AVTIIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIV 496
            AV  IIEEEE +  ++   SPS    EK+  GK RQGLI SLQ LG+YESLLTPP+SV  
Sbjct: 430  AVANIIEEEESQWIDQTSSSPSNQWKEKK--GKCRQGLINSLQQLGDYESLLTPPRSVQS 489

Query: 497  VANQAAAKAVMFISGVAVGNEYYDCVSMNDTPINCCYVNTRSSQVPRSASSQVV------ 556
            VANQAAAKA+MFISG+   N  Y+  SM+++   CC V      +       V       
Sbjct: 490  VANQAAAKAIMFISGITNSNGSYENTSMSESASGCCKVRFSLFTLKMFVVMGVYLLCNIS 549

Query: 557  GWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRG 616
             WS  MKGS LTPS+ N+L  TPASSLAEIEK+YE+A  GS DEKI+ ASILCGASL RG
Sbjct: 550  CWSLVMKGSPLTPSLTNSLITTPASSLAEIEKMYEVATTGSEDEKIAVASILCGASLFRG 609

Query: 617  WNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVP 676
            W++QEH ++FI  LLSPP PAD  GS S+LI+ APFLNVLLVGIS +DCV IFSLHG+VP
Sbjct: 610  WSIQEHVIIFIVTLLSPPAPADLSGSYSHLINSAPFLNVLLVGISPIDCVHIFSLHGVVP 669

Query: 677  LLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKGD 736
            LLAG LMPICEAFGS  P  +W L +GE ++ HAVFS AFTLLLRLWRF HPP++ V GD
Sbjct: 670  LLAGALMPICEAFGSGVPNITWTLPTGELISSHAVFSTAFTLLLRLWRFDHPPLDYVLGD 729

Query: 737  ARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSFPKLKGWY 796
              PVG Q +PEYLLLVRN +L  FGKSPKDR+  RR SK++  S++P FMDSFP+LK WY
Sbjct: 730  VPPVGPQPSPEYLLLVRNCRLECFGKSPKDRMARRRFSKVIDISVDPIFMDSFPRLKQWY 789

Query: 797  RQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANEEA 856
            RQHQEC+ASI   L  G+PVH  VD+LL+MMFKK N+GG    + +S SS+ S S  +++
Sbjct: 790  RQHQECMASILSELKTGSPVHHIVDSLLSMMFKKANKGGSQSLTPSSGSSSLSTSGGDDS 849

Query: 857  SIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCYFSA 916
            S +LK+PAWDILEA PFVLDAALTACAHG LSPR     LK LADFLPA+  T+V YFS+
Sbjct: 850  SDQLKLPAWDILEAAPFVLDAALTACAHGSLSPRELATGLKILADFLPATLGTMVSYFSS 909

Query: 917  EVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLPLA 976
            EVTRG+WKP  MNGTDWPSPAA L+ VEQQI+KILAATGVDVP L   G   A LPLPLA
Sbjct: 910  EVTRGLWKPVSMNGTDWPSPAANLASVEQQIEKILAATGVDVPRLPADGISAATLPLPLA 969

Query: 977  ALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVFSA 1036
            AL+SLTITYKLDKA+ER L LVGPAL+SL A C WPC PI+ SLW QKVKRW+DFL+FSA
Sbjct: 970  ALVSLTITYKLDKATERFLVLVGPALDSLAAACPWPCMPIVTSLWTQKVKRWSDFLIFSA 1029

Query: 1037 SRTVFHHNSDAVVQLLKSCFTSTLGL-GNSNVNSGGGVGALLGHGFGSHVLGGMSPVAPG 1096
            SRTVFHHN DAV+QLL+SCFT TLGL   S + S GGVGALLGHGFGS   GG+S  APG
Sbjct: 1030 SRTVFHHNRDAVIQLLRSCFTCTLGLTPTSQLCSYGGVGALLGHGFGSRYSGGISTAAPG 1089

Query: 1097 ILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMR--CEQVSF 1156
            ILY++VHR +RD +FL EEI+SLLM SVK IA   LP  +AEKLKK+K G R    QVS 
Sbjct: 1090 ILYIKVHRSIRDVMFLTEEILSLLMFSVKSIATRELPAGQAEKLKKTKDGSRYGIGQVSL 1149

Query: 1157 ASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLR 1216
            + AM RVKLAASLGASLVWISGG  LVQ+L KETLPSWF+SVH    E  E GGMVP+LR
Sbjct: 1150 SLAMRRVKLAASLGASLVWISGGLNLVQALIKETLPSWFISVHG---EEDELGGMVPMLR 1209

Query: 1217 GYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATWRAYV 1276
            GYALA+F++L   F+WG+DS   ASKRR ++L  HLEF+ SAL+GK S+GCDWATW+AYV
Sbjct: 1210 GYALAYFAILSSAFAWGVDSSYPASKRRPRVLWLHLEFMVSALEGKISLGCDWATWQAYV 1269

Query: 1277 SGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELIIE 1303
            +GFVSL+V+C P W+LEVD++V++RL K LRQ NE++LALALL +GGL  MGAA ELI+E
Sbjct: 1270 TGFVSLMVQCTPAWVLEVDVEVIKRLSKSLRQWNEQDLALALLCAGGLGTMGAATELIVE 1270

BLAST of Cp4.1LG15g00030 vs. Swiss-Prot
Match: MD33A_ARATH (Mediator of RNA polymerase II transcription subunit 33A OS=Arabidopsis thaliana GN=MED33A PE=1 SV=1)

HSP 1 Score: 1406.7 bits (3640), Expect = 0.0e+00
Identity = 744/1319 (56.41%), Postives = 950/1319 (72.02%), Query Frame = 1

Query: 17   VWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDNHVPIMW 76
            VWD V+ELTK AQE   DP LWA QLSS+L   +V LPS ELA ++VS+ICWDN+VPI+W
Sbjct: 9    VWDCVIELTKMAQENCVDPRLWASQLSSNLKFFAVELPSTELAEVIVSYICWDNNVPIVW 68

Query: 77   KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEVNGPNYP 136
            KFLE+AM  ++V PL+V+ALL+ R +P R  + AAYR+YLELL R++F+    ++GP+Y 
Sbjct: 69   KFLERAMALKLVSPLVVLALLADRVVPTRSTQQAAYRIYLELLKRNMFTIKDHISGPHYQ 128

Query: 137  RIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWL 196
            ++M ++ ++L LS++F L T +PG+L+VE  F +V  LLDA+L DEGLLEL  +  S WL
Sbjct: 129  KVMISVSNILRLSELFDLDTSKPGVLLVEFVFKMVSQLLDAALSDEGLLELSQDSSSQWL 188

Query: 197  IRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMP 256
            ++ Q  DME+D  + + EK T + E L  +NT  AIE+I +FL+N   AR+L L   N  
Sbjct: 189  VKSQ--DMEIDAPERYNEK-TGSLEKLQSLNTIMAIELIAEFLRNTVIARLLYLVSSNRA 248

Query: 257  LHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMAS 316
              W  F Q++QLL  NS  L+++K++    LL   S++    S + K TS  + + ++  
Sbjct: 249  SKWHEFVQKVQLLGENSSALKHSKVLNSGDLLQLISNRRFGYSYDSKVTSARKSNAIVDF 308

Query: 317  GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWH 376
            GSL S AG  HG + S+LWLP+DL  EDAMDG QV  TSA+E +  L K+L+ +N ++WH
Sbjct: 309  GSLSSYAGLCHGASLSSLWLPLDLVFEDAMDGYQVNPTSAIEIITGLAKTLKEINGSTWH 368

Query: 377  NTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGELKEEDEC 436
            +TFLGLWIAALRL+QRERDP EGP+PRLDT LCM L I  L V  +IEE + E   E   
Sbjct: 369  DTFLGLWIAALRLVQRERDPIEGPIPRLDTRLCMSLCIVPLVVANLIEEGKYESVME--- 428

Query: 437  SPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFISGVAVG 496
                         K R  L+TSLQ+LG++  LL PP+ V+  AN+AA KA++F+SG  VG
Sbjct: 429  -------------KLRDDLVTSLQVLGDFPGLLAPPKCVVSAANKAATKAILFLSGGNVG 488

Query: 497  NEYYDCVSMNDTPINCC--------------------------YVNTRSSQVPRSASSQV 556
               +D ++M D P+NC                           YVN R +Q+P+S  ++V
Sbjct: 489  KSCFDVINMKDMPVNCSGNMRHLIVEACIARNILDMSAYSWPGYVNGRINQIPQSLPNEV 548

Query: 557  VGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVR 616
              WSSF+KG+ L  +MVN L + PASSLAE+EK++E+A+ GS DEKISAA++LCGASL R
Sbjct: 549  PCWSSFVKGAPLNAAMVNTLVSVPASSLAELEKLFEVAVKGSDDEKISAATVLCGASLTR 608

Query: 617  GWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMV 676
            GWN+QEHTV +++RLLSPP+PADY  ++++LI YA  LNV++VGI SVD +QIFSLHGMV
Sbjct: 609  GWNIQEHTVEYLTRLLSPPVPADYSRAENHLIGYACMLNVVIVGIGSVDSIQIFSLHGMV 668

Query: 677  PLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKG 736
            P LA  LMPICE FGS TP  SW L SGE ++ ++VFS AFTLLL+LWRF+HPPIE+  G
Sbjct: 669  PQLACSLMPICEEFGSYTPSVSWTLPSGEAISAYSVFSNAFTLLLKLWRFNHPPIEHGVG 728

Query: 737  DARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKF-SLEPTFMDSFPKLKG 796
            D   VGSQLTPE+LL VRNS L S     +DR + +RLS++ +  S +P F+DSFPKLK 
Sbjct: 729  DVPTVGSQLTPEHLLSVRNSYLVSSEILDRDRNR-KRLSEVARAASCQPVFVDSFPKLKV 788

Query: 797  WYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANE 856
            WYRQHQ CIA+   GL  G+PVHQTV+ALL M F K+ RG Q+L    S +S+SSG+A+E
Sbjct: 789  WYRQHQRCIAATLSGLTHGSPVHQTVEALLNMTFGKV-RGSQTLNPVNSGTSSSSGAASE 848

Query: 857  EASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCYF 916
            +++I+ + PAWDIL+A P+V+DAALTAC HGRLSPR     LKDLADFLPAS ATIV YF
Sbjct: 849  DSNIRPEFPAWDILKAVPYVVDAALTACTHGRLSPRQLATGLKDLADFLPASLATIVSYF 908

Query: 917  SAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLP 976
            SAEV+RG+WKP FMNG DWPSPA  LS VE+ I KILA TGVD+PSLA GGS PA LPLP
Sbjct: 909  SAEVSRGVWKPVFMNGVDWPSPATNLSTVEEYITKILATTGVDIPSLAPGGSSPATLPLP 968

Query: 977  LAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVF 1036
            LAA +SLTITYK+DKASER L L GPAL  L AGC WPC PI+ASLW QK KRW DFLVF
Sbjct: 969  LAAFVSLTITYKIDKASERFLNLAGPALECLAAGCPWPCMPIVASLWTQKAKRWFDFLVF 1028

Query: 1037 SASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGMSPVAP 1096
            SASRTVF HN DAV+QLL++CF++TLGL  + +++ GGVGALLGHGFGSH  GG+SPVAP
Sbjct: 1029 SASRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSNDGGVGALLGHGFGSHFYGGISPVAP 1088

Query: 1097 GILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCEQVSFA 1156
            GILYLR++R +RD + + EEI+SLL+ SV+DIA   L KEK EKLK  K+G R  Q S A
Sbjct: 1089 GILYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNRLSKEKLEKLKTVKNGSRYGQSSLA 1148

Query: 1157 SAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLRG 1216
            +AM +VKLAASL ASLVW++GG G+V  L KET+PSWFLS    DRE      +V  LRG
Sbjct: 1149 TAMTQVKLAASLSASLVWLTGGLGVVHVLIKETIPSWFLSTDKSDREQGP-SDLVAELRG 1208

Query: 1217 YALAFFSVLCGTFSWGIDSISSASKRRAK-LLDAHLEFLASALDGKFSIGCDWATWRAYV 1276
            +ALA+F VLCG  +WG+DS SSASKRR + +L +HLEF+ASALDGK S+GC+ ATWR Y+
Sbjct: 1209 HALAYFVVLCGALTWGVDSRSSASKRRRQAILGSHLEFIASALDGKISVGCETATWRTYI 1268

Query: 1277 SGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELII 1302
            SG VSL+V C P W+ E+D +VL+ L  GLR+  ++ELA+ LL  GGL  M  AA+ II
Sbjct: 1269 SGLVSLMVSCLPLWVTEIDTEVLKSLSNGLRKWGKDELAIVLLSLGGLKTMDYAADFII 1305

BLAST of Cp4.1LG15g00030 vs. TrEMBL
Match: A0A061DRW3_THECC (Reduced epidermal fluorescence 4, putative isoform 1 OS=Theobroma cacao GN=TCM_004452 PE=4 SV=1)

HSP 1 Score: 1824.7 bits (4725), Expect = 0.0e+00
Identity = 948/1332 (71.17%), Postives = 1080/1332 (81.08%), Query Frame = 1

Query: 11   LQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDN 70
            +Q  + VW++VLE TKSAQ KNSDPLLWAVQLSSSLNSA VSLPS++LAHLLVSHICWDN
Sbjct: 5    IQAASPVWESVLEQTKSAQGKNSDPLLWAVQLSSSLNSAGVSLPSIDLAHLLVSHICWDN 64

Query: 71   HVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEV 130
            HVPI WK+LEKAMT + VPP+LV+ALLSTR IP RK  PAAYRLY+ELL RH FS   ++
Sbjct: 65   HVPITWKYLEKAMTVKFVPPILVLALLSTRVIPNRKFHPAAYRLYMELLRRHTFSLKCQI 124

Query: 131  NGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAE 190
            N PNY + M++IDDVLHLSQIFGLQ  EPGLL+VE  FSIVW LLDASLDDEGLLEL  E
Sbjct: 125  NKPNYQKTMKSIDDVLHLSQIFGLQVSEPGLLLVEFVFSIVWQLLDASLDDEGLLELTPE 184

Query: 191  ERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCL 250
            +RS+W    Q  DME+D  D+F EK+ E  + + K NT  AIEIIG+FLQNK T+RIL L
Sbjct: 185  KRSIWPTITQ--DMEIDTPDNFNEKRNEQLDVMYKGNTTMAIEIIGEFLQNKVTSRILFL 244

Query: 251  AHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEF 310
            A +NMP HW  F Q+L +LAA SV LRN+K +TP+ LL  TSD  + LS+E K  S  EF
Sbjct: 245  ARRNMPSHWGAFIQQLSILAAQSVALRNSKHLTPDALLQLTSDSRKVLSRECKIKSHEEF 304

Query: 311  HDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLIC------LI 370
            H ++ SG L SS+GQ +G + SA WLPIDLFLEDAMDGSQV AT AVERL        L+
Sbjct: 305  HAIIGSGCLTSSSGQCYGTSSSAHWLPIDLFLEDAMDGSQVAATGAVERLTGRVNLAGLV 364

Query: 371  KSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIE 430
            K+L+AVN  +WH+TFLGLWIAALRL+QRERD SEGPVPRLDTCLCMLLSIT L V  I+E
Sbjct: 365  KALQAVNGTTWHDTFLGLWIAALRLVQRERDISEGPVPRLDTCLCMLLSITPLVVANIVE 424

Query: 431  EEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAA 490
            EEE EL +E +CSP+    EKQ+ G+ R+ LI+SLQML +YE+LLTPPQSV  VANQAAA
Sbjct: 425  EEESELIDESDCSPTNQTKEKQAPGRCRKDLISSLQMLSDYEALLTPPQSVRSVANQAAA 484

Query: 491  KAVMFISGVAVGNEYYDCVSMNDTPINCC--------------------------YVNTR 550
            KA+MFISG+ VGN YY+C+S+ND P+NC                           YVN R
Sbjct: 485  KAIMFISGLTVGNGYYECMSINDMPMNCSGNMRHLIVEACIARNLLDTSAYIWPGYVNAR 544

Query: 551  SSQVPRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKIS 610
            ++ +P S  SQV GWSS MKGS LTP+++NAL ATPASSLAEIEKIYEIA  GS +EKIS
Sbjct: 545  AN-IPCSVPSQVPGWSSLMKGSPLTPTLINALIATPASSLAEIEKIYEIATKGSDEEKIS 604

Query: 611  AASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSV 670
            AASILCGASLVRGWN+QEH +LFI+ LLSPP+PADY GSDS+LI+YAP LNVLLVGISSV
Sbjct: 605  AASILCGASLVRGWNIQEHNILFITSLLSPPVPADYSGSDSHLINYAPLLNVLLVGISSV 664

Query: 671  DCVQIFSLHGMVPLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLW 730
            DCVQIFSLHGMVPLLAG LMP+CE FGS+ P  SW L +GEELT HAVF+ AFTLLLRLW
Sbjct: 665  DCVQIFSLHGMVPLLAGTLMPLCEVFGSTAPTVSWTLPTGEELTSHAVFTNAFTLLLRLW 724

Query: 731  RFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEP 790
            RF HPP+E V GDA PVGSQL+P+YLLLVRNS+L +FGKSPKDRLK++RLSK L FSL+ 
Sbjct: 725  RFDHPPLERVMGDATPVGSQLSPDYLLLVRNSKLLAFGKSPKDRLKIKRLSKNLNFSLDI 784

Query: 791  TFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTS 850
             FMDSFPKLK WYRQHQECIAS   GLV G  VHQ VDALL MMF+KI+RGGQS TSTTS
Sbjct: 785  IFMDSFPKLKCWYRQHQECIASTLSGLVQGTTVHQIVDALLNMMFRKISRGGQSFTSTTS 844

Query: 851  ASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFL 910
             SS+SS S  E+A  +LKVPAWDILE TP+VLDAALTACAHGRLSPR     LKDLADFL
Sbjct: 845  GSSSSSASGAEDAHTRLKVPAWDILEGTPYVLDAALTACAHGRLSPRELATGLKDLADFL 904

Query: 911  PASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLAL 970
            PA+  TIV YFSAEVTRGIWKPAFMNGTDWPSPAA LS+VEQ IKKILAATGVDVPSLA+
Sbjct: 905  PATLGTIVSYFSAEVTRGIWKPAFMNGTDWPSPAANLSMVEQHIKKILAATGVDVPSLAV 964

Query: 971  GGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQ 1030
            GGS P  LPLPLAAL+SLTITYKLDK SER L L+GPALNSL  GC WPC PIIASLWAQ
Sbjct: 965  GGSSPTTLPLPLAALVSLTITYKLDKGSERFLILIGPALNSLAEGCPWPCMPIIASLWAQ 1024

Query: 1031 KVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGS 1090
            KVKRWNDFLVFSASRTVFHH+SDAVVQLL+SCFTSTLGL  S + S GGVGALLGHGFGS
Sbjct: 1025 KVKRWNDFLVFSASRTVFHHSSDAVVQLLRSCFTSTLGLSPSIIYSNGGVGALLGHGFGS 1084

Query: 1091 HVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSK 1150
            H  GGMSPVAPGILYLRVHR VRD +F+ EEIVSLLM SV++IA +GL +EK+EKLKK+K
Sbjct: 1085 HFSGGMSPVAPGILYLRVHRSVRDIMFMTEEIVSLLMSSVREIASSGLSQEKSEKLKKTK 1144

Query: 1151 HGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGV 1210
             G+R  QVS  +AM RVKLAASLGASLVW+SGG  LVQSL KETLPSWF+S H+ +++G 
Sbjct: 1145 FGLRYGQVSLGAAMTRVKLAASLGASLVWLSGGLSLVQSLIKETLPSWFISAHAPEKDGG 1204

Query: 1211 EYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIG 1270
            E GG+V +L GYALA+F+VLCGTF+WG+DS S ASKRR K+L AHLEFLASALDGK S+G
Sbjct: 1205 EPGGVVAMLGGYALAYFAVLCGTFAWGVDSASPASKRRPKVLGAHLEFLASALDGKISLG 1264

Query: 1271 CDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTA 1305
            CD ATWRAYV+GFVSL+V C  KW+L+VD+ VL+RL  GLRQ NEEELA+ALL  GG+ A
Sbjct: 1265 CDSATWRAYVTGFVSLMVACTQKWVLDVDVYVLKRLSNGLRQWNEEELAMALLGLGGVGA 1324

BLAST of Cp4.1LG15g00030 vs. TrEMBL
Match: F6HQ63_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g00390 PE=4 SV=1)

HSP 1 Score: 1806.6 bits (4678), Expect = 0.0e+00
Identity = 937/1322 (70.88%), Postives = 1069/1322 (80.86%), Query Frame = 1

Query: 16   GVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDNHVPIM 75
            G+WD+VLELTKSAQE+NSDPLLWAVQLSS LNSA  SLPS ELAHLLVSHICW N+VPI 
Sbjct: 18   GLWDSVLELTKSAQERNSDPLLWAVQLSSILNSAGTSLPSPELAHLLVSHICWANNVPIT 77

Query: 76   WKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEVNGPNY 135
            WKFLEKA++ RI PP+LV+ALLS+R IP R+L PAAYRLY+ELL RH FS T E+NGPNY
Sbjct: 78   WKFLEKAVSGRIAPPMLVLALLSSRVIPNRRLYPAAYRLYMELLKRHTFSFTSEINGPNY 137

Query: 136  PRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVW 195
             +IM++IDDVLHLSQIFGLQ CEPG L+VE  FSIVW LLDASLDDEGLLEL  E++S W
Sbjct: 138  QKIMKSIDDVLHLSQIFGLQVCEPGALVVEFIFSIVWQLLDASLDDEGLLELAPEKKSKW 197

Query: 196  LIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNM 255
              R Q  DM++D  DSF EK+T+  E L KVNT  AIEIIG F QNK T++IL LA +NM
Sbjct: 198  PTRSQ--DMDIDGQDSFNEKRTDRQEGLCKVNTVMAIEIIGDFFQNKVTSKILYLARRNM 257

Query: 256  PLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMA 315
              HW  F QRL++LAANS  LRN+K I+P+ LL  TSD    L++E KT+ Q +FH V+A
Sbjct: 258  FSHWGSFIQRLRVLAANSTALRNSKHISPDALLQLTSDARIVLTRECKTSLQKQFHAVVA 317

Query: 316  SGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASW 375
            SGSL SSAGQ HGV+WSALWLPID+FLED MD SQV+ATSAVE L  L+K+L+AVN  SW
Sbjct: 318  SGSLISSAGQCHGVSWSALWLPIDMFLEDTMDDSQVVATSAVETLTGLVKALQAVNGTSW 377

Query: 376  HNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGELKEEDE 435
            HNTFLG+WIAALRL+QRERDPSEGPVPRLDTCLCMLLSIT LA+  IIEEEE  L +E  
Sbjct: 378  HNTFLGVWIAALRLVQRERDPSEGPVPRLDTCLCMLLSITPLAIVNIIEEEESTLIDEAG 437

Query: 436  CSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFISGVAV 495
             SP+  R EKQ S K R+ LI+SLQ+LG+YE LLT PQS+ +VANQA AKA+MF+SGV  
Sbjct: 438  RSPTNLRKEKQISVKHRKDLISSLQLLGDYEGLLTVPQSLSLVANQAVAKAMMFVSGVTS 497

Query: 496  GNEYYDCVSMNDTPINCC--------------------------YVNTRSSQVPRSASSQ 555
            G+ Y DC+SMND P+NC                           YVN RS+Q+PRS    
Sbjct: 498  GSGYLDCMSMNDLPMNCSGNMRHLIVEACIARNLLDTSAYLWPGYVNGRSNQLPRSVPGP 557

Query: 556  VVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLV 615
            + GWSS MKGS LTP M+N L +TPASSLAEIEKIYEIA+NGS DEKISAA+ILCGASLV
Sbjct: 558  MPGWSSLMKGSPLTPPMINVLVSTPASSLAEIEKIYEIAVNGSDDEKISAAAILCGASLV 617

Query: 616  RGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGM 675
            RGWN+QEHTV FI++LLSPP+PADY G+DS+LI YAPFLNVLLVGISSVDCVQI+SLHG+
Sbjct: 618  RGWNIQEHTVFFITKLLSPPVPADYSGTDSHLIGYAPFLNVLLVGISSVDCVQIYSLHGL 677

Query: 676  VPLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVK 735
            VP LAG LMPICE FGS  PK S  LT+GEE++ H VFS AF LLLRLWRF+HPP+E+V 
Sbjct: 678  VPQLAGALMPICEVFGSCAPKVSCTLTTGEEISSHQVFSNAFNLLLRLWRFNHPPLEHVM 737

Query: 736  G-DARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSFPKLK 795
            G D  PVGSQLTPEYLLLVRNSQLA+ G + K   K RR S++   S EP FMDSFPKLK
Sbjct: 738  GGDIPPVGSQLTPEYLLLVRNSQLANSGNTTKGPFKYRRPSRISSPSPEPIFMDSFPKLK 797

Query: 796  GWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSAN 855
             WYRQHQ CIAS   GLV G PVHQ VDA+L MMF+K+ RGGQ LT T S SSNSSGS  
Sbjct: 798  LWYRQHQACIASALSGLVHGTPVHQVVDAVLNMMFRKMGRGGQPLTPTASGSSNSSGSGP 857

Query: 856  EEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCY 915
            E+AS++LK+PAWDILEA PFVLDAALTACAHGRLSPR     LKDL+DFLPAS ATI  Y
Sbjct: 858  EDASLRLKLPAWDILEAIPFVLDAALTACAHGRLSPRELATGLKDLSDFLPASLATIASY 917

Query: 916  FSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPL 975
            FSAEVTRGIWKPAFMNGTDWPSPAA LS+VEQQIKK+LAATGVDVPSLA  GS PA LPL
Sbjct: 918  FSAEVTRGIWKPAFMNGTDWPSPAANLSMVEQQIKKVLAATGVDVPSLAAVGSSPATLPL 977

Query: 976  PLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLV 1035
            PLAAL+SLTITYKLD+A+ERLL +VGPALNSL AGC WPC PIIASLWAQKVKRW D+L+
Sbjct: 978  PLAALVSLTITYKLDRATERLLTVVGPALNSLAAGCPWPCMPIIASLWAQKVKRWADYLI 1037

Query: 1036 FSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGMSPVA 1095
            FSASRTVFHH SDAVVQLLKSCFTSTLGL +S V+S GGVGALLGHGFGSH  GGMSPVA
Sbjct: 1038 FSASRTVFHHKSDAVVQLLKSCFTSTLGLNSSPVSSNGGVGALLGHGFGSHYSGGMSPVA 1097

Query: 1096 PGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCEQVSF 1155
            PGILYLRVHR VRD +F+ E ++SLLM SV+DIA  GLPKE+ EKLKK+K+GMR  QVS 
Sbjct: 1098 PGILYLRVHRDVRDVMFMTEVVLSLLMQSVRDIASCGLPKERLEKLKKTKYGMRYGQVSL 1157

Query: 1156 ASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLR 1215
            A+AM RVKLAASLGAS+VWISGG  LVQSL KETLPSWF+SVH  + EG E   M  +L 
Sbjct: 1158 AAAMTRVKLAASLGASIVWISGGLTLVQSLIKETLPSWFISVHGSEHEGTESEEMAAMLG 1217

Query: 1216 GYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATWRAYV 1275
            GYALA+F+VLCGTF+WG+D +S ASKRR K+L AHLEFLA+ALDGK S+GC W TWRAYV
Sbjct: 1218 GYALAYFAVLCGTFAWGVDLMSPASKRRPKVLGAHLEFLANALDGKISLGCVWGTWRAYV 1277

Query: 1276 SGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELIIE 1305
               V+L+V C P W+ EVD++VL+R+ KGLRQ NEEELA+ALL  GG+ AMGAAAE+I+E
Sbjct: 1278 PALVTLMVGCTPAWIPEVDVEVLKRVSKGLRQWNEEELAIALLGLGGIHAMGAAAEVIVE 1337

BLAST of Cp4.1LG15g00030 vs. TrEMBL
Match: A0A067JPN7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21970 PE=4 SV=1)

HSP 1 Score: 1801.2 bits (4664), Expect = 0.0e+00
Identity = 927/1338 (69.28%), Postives = 1078/1338 (80.57%), Query Frame = 1

Query: 1    MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAH 60
            MAVS Q P Q      +WD VLELTKSAQ+KNSDPLLWA+QLSS LNSA V+LPS ELAH
Sbjct: 1    MAVSLQQPSQ----QSLWDWVLELTKSAQDKNSDPLLWAIQLSSRLNSAGVTLPSTELAH 60

Query: 61   LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLS 120
            LLVSH C++NHVPIMWKFLEK++  +I PP LV+ALLSTR IP RKL PAAYRLYLEL+ 
Sbjct: 61   LLVSHTCFENHVPIMWKFLEKSLALKIAPPTLVLALLSTRVIPNRKLHPAAYRLYLELVK 120

Query: 121  RHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLD 180
            RH FS + ++NGPNY +IM+  DD LHLS IFGLQ C+PGL++VE  FS+VW LLDASLD
Sbjct: 121  RHAFSFSSQINGPNYAKIMKLTDDALHLSHIFGLQVCKPGLILVEFVFSMVWQLLDASLD 180

Query: 181  DEGLLELPAEERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQ 240
            DEGLL+L +E++S WL   Q  DM++D H+SFGEK+ E++E L   NT  AIE+IG+FLQ
Sbjct: 181  DEGLLDLTSEKKSRWLTSLQ--DMDIDGHESFGEKRNEHNEGLKSANTTMAIELIGEFLQ 240

Query: 241  NKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQ 300
            NK T+RIL LA +NM  HW GF QR++LLAA+S  LRN+K IT E+LL  TSD  + LS 
Sbjct: 241  NKVTSRILYLARKNMSSHWRGFIQRVRLLAAHSAALRNSKHITAEILLQLTSDTRQLLSG 300

Query: 301  EGKTTSQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
            + K  SQ +FH VM+S SL SSAGQ HG +WSALWLPIDLFLEDAMDGSQV A SAVE L
Sbjct: 301  DTKRISQQDFHAVMSSRSLISSAGQCHGASWSALWLPIDLFLEDAMDGSQVAAISAVENL 360

Query: 361  ICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVT 420
              L+K+L+AVN  +WH+TFLGLW+A+LRL+QRER+PSEGPVPRLDTCLCMLL  TTLAV 
Sbjct: 361  TGLVKALQAVNGTTWHDTFLGLWMASLRLVQREREPSEGPVPRLDTCLCMLLCTTTLAVA 420

Query: 421  IIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVAN 480
             IIEEEEGEL  E E +P+    EKQ  GK R+GLIT+LQ+LG+Y+SLLTPPQSV +VAN
Sbjct: 421  NIIEEEEGELIMETESNPTDQMKEKQGLGKCREGLITALQLLGDYDSLLTPPQSVSLVAN 480

Query: 481  QAAAKAVMFISGVAVGNEYYDCVSMNDTPINCC--------------------------- 540
            QAAAKA++FISGV  GN YY+ +SMND P++C                            
Sbjct: 481  QAAAKAMLFISGVTSGNSYYESMSMNDMPMSCSAGNMRHLIVEACIARFLLDTSAYVWPG 540

Query: 541  YVNTRSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSG 600
            YVN  S+Q+PR    Q+ GWS+ MKGS LTPSM+N L ATPASSL EIEK+YEIA+NGS 
Sbjct: 541  YVNAHSNQIPRGVLGQMPGWSALMKGSPLTPSMINTLVATPASSLPEIEKVYEIALNGSN 600

Query: 601  DEKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLV 660
            DEK+SAA+ILCGASL RGWN+QEHT+LFI +LLSPP PADY G+DS+LIDYAP LN+LLV
Sbjct: 601  DEKLSAATILCGASLSRGWNIQEHTILFIIKLLSPPFPADYSGNDSHLIDYAPLLNILLV 660

Query: 661  GISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTL 720
            G+SSVDCVQI SLHG+VPLLAG L+PICE FGSS PK SW L SGEE+TCHAVFS AF+L
Sbjct: 661  GLSSVDCVQILSLHGLVPLLAGALIPICEVFGSSVPKVSWTLPSGEEITCHAVFSNAFSL 720

Query: 721  LLRLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLK 780
            L+RLWRFHHPP+ENV GD  PV S+L PEYLL+VRNSQLASFG SP+D +K RR SK+L 
Sbjct: 721  LVRLWRFHHPPLENVTGDKTPVASKLGPEYLLVVRNSQLASFGSSPRDFIKSRRFSKILN 780

Query: 781  FSLEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGG-QS 840
             SLEP FMDSFPKL+ WYRQH ECIAS   GLV G PVHQ VDALL MMFK+INRGG QS
Sbjct: 781  ISLEPIFMDSFPKLQLWYRQHLECIASTFSGLVNGTPVHQLVDALLNMMFKRINRGGVQS 840

Query: 841  LTSTTSASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LK 900
            LTSTTS SS SSGS  EEA ++L+VPAWDILEATPF LDAALTACAHG+LSPR     LK
Sbjct: 841  LTSTTSGSS-SSGSGAEEAYVRLQVPAWDILEATPFALDAALTACAHGKLSPRELATGLK 900

Query: 901  DLADFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVD 960
            DLADFLPAS ATIV Y SAEVTRGIWKPA+MNGTDWPSPAA LS VEQQIKKIL+ATGVD
Sbjct: 901  DLADFLPASLATIVSYLSAEVTRGIWKPAYMNGTDWPSPAANLSTVEQQIKKILSATGVD 960

Query: 961  VPSLALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPII 1020
            VPSL +GG+ PA LPLPLAAL+SLTITY+LDK SER L L+GPALN+L + C WPC PII
Sbjct: 961  VPSLPVGGNSPATLPLPLAALVSLTITYRLDKVSERQLVLIGPALNALASSCPWPCMPII 1020

Query: 1021 ASLWAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALL 1080
            A+LWAQKVKRW+DFLVFSAS TVFHH  DAVVQLLKSCFT+TLG    +++S GGVGALL
Sbjct: 1021 AALWAQKVKRWSDFLVFSASSTVFHHKGDAVVQLLKSCFTATLGFSAPHISSNGGVGALL 1080

Query: 1081 GHGFGSHVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAE 1140
            GHGFGSH +GG+SPVAPG LY+RVHR +RD +FL E+IVS+LM SVK+IA +GLP+E  E
Sbjct: 1081 GHGFGSHFVGGISPVAPGFLYVRVHRSLRDFVFLTEKIVSILMHSVKEIANSGLPRETVE 1140

Query: 1141 KLKKSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHS 1200
            KL+K+K+GMR  QVS  +AM RVKLAASLGASLVWISGGS LVQSL KETLPSWF+S HS
Sbjct: 1141 KLRKTKYGMRYGQVSLTAAMMRVKLAASLGASLVWISGGSSLVQSLVKETLPSWFISGHS 1200

Query: 1201 VDREGVEYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALD 1260
            +++E  E GG+V +LRGYALA+  VL GTF+WGIDS S ASKRR  +L  HL+F+ASALD
Sbjct: 1201 LEQEDAESGGLVAMLRGYALAYLVVLSGTFAWGIDSESPASKRRPMVLSCHLKFIASALD 1260

Query: 1261 GKFSIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLE 1305
            GK S+G DWAT RA+ +G +SL+V C PKWL EV++ +L+RL KGLRQ NEE+LAL LL 
Sbjct: 1261 GKISLGSDWATTRAFATGVLSLMVACTPKWLTEVNVDLLKRLSKGLRQWNEEQLALDLLG 1320

BLAST of Cp4.1LG15g00030 vs. TrEMBL
Match: M5VJA7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000301mg PE=4 SV=1)

HSP 1 Score: 1775.8 bits (4598), Expect = 0.0e+00
Identity = 922/1338 (68.91%), Postives = 1064/1338 (79.52%), Query Frame = 1

Query: 1    MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAH 60
            MAVS Q P     ++ +WDTVL+LTKSAQ+KNSDPLLWAVQLS+SLNSA V+LPSVELAH
Sbjct: 2    MAVSVQQP-----LSHLWDTVLQLTKSAQDKNSDPLLWAVQLSNSLNSAGVALPSVELAH 61

Query: 61   LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLS 120
            LLVSHICW NHVPI WKFLEKA+T +IVPP+LV+ALLST+                    
Sbjct: 62   LLVSHICWANHVPITWKFLEKALTVKIVPPMLVLALLSTK-------------------- 121

Query: 121  RHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLD 180
             H F    ++NGPN  +I+++IDDVLHLSQ++GLQ CEPG+++VE  FSI+W LLDASLD
Sbjct: 122  -HTFLFASQLNGPNCQKIIKSIDDVLHLSQLYGLQVCEPGVVIVEFVFSIIWQLLDASLD 181

Query: 181  DEGLLELPAEERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQ 240
            DEGLLEL  +++S W  RP+  DME+D HD F EK++E +E L K NTA AIEII +FLQ
Sbjct: 182  DEGLLELTPDKKSRWSTRPE--DMEIDGHDCFNEKRSEQNEGLQKANTAMAIEIIVEFLQ 241

Query: 241  NKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQ 300
            NK T+RILCL  +N+P HW GF Q++Q+LAANS VLR  K ITPE LL  TSD  R L++
Sbjct: 242  NKVTSRILCLTRRNLPSHWGGFIQKMQVLAANSSVLRTLKHITPESLLQLTSDTRRLLTR 301

Query: 301  EGKTTSQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
            E K  S+  FH V++SGSL SSA QSHGV+ SA WLPIDLFLEDAMDGS+V   SAVE L
Sbjct: 302  ECKIISRQGFHAVLSSGSLRSSASQSHGVSSSAFWLPIDLFLEDAMDGSEVAIISAVETL 361

Query: 361  ICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVT 420
              L+K+L+AVN  +WHN FLGLWIAALRL+QRERDP EGPVPRLDTCLCMLLSITTLAVT
Sbjct: 362  TGLVKALQAVNSTTWHNAFLGLWIAALRLVQRERDPREGPVPRLDTCLCMLLSITTLAVT 421

Query: 421  IIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVAN 480
             IIEEEE +L EE E   +  R E Q+ GKRR+GLI  LQ LG+YE+LLTPPQSV  V N
Sbjct: 422  NIIEEEEAQLMEEIEGDRTNQRKE-QAPGKRREGLIMCLQRLGDYETLLTPPQSVCSVVN 481

Query: 481  QAAAKAVMFISGVAVGNEYYDCVSMNDTPINCC--------------------------Y 540
            QAAAKA+M+ SG+ V N YY+ VS+ND PINC                           Y
Sbjct: 482  QAAAKAIMYRSGLTVSNGYYESVSVNDVPINCMGNLRHLIVEACIARNLLDTSAYFWPGY 541

Query: 541  VNTRSSQV----PRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAIN 600
            V+ RSSQV    P     QV GWSS MKGS LTPS+VNAL ATPASSLAEIEKIYEIA+N
Sbjct: 542  VSARSSQVLRNVPGQVPGQVPGWSSIMKGSPLTPSLVNALVATPASSLAEIEKIYEIAVN 601

Query: 601  GSGDEKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNV 660
            GS +EKISA +ILCGASL+RGWN+QEH  LFI  LLSPP+PADY GSDS+LI YA F NV
Sbjct: 602  GSDEEKISAVTILCGASLIRGWNIQEHICLFIINLLSPPVPADYSGSDSHLISYASFFNV 661

Query: 661  LLVGISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLA 720
            LLVG+SS+D VQIFSL G+VPLLA  LMPICE FGSS P  SW  T+GEEL+CHAVFS A
Sbjct: 662  LLVGVSSIDTVQIFSLLGLVPLLAAALMPICEVFGSSVPNISWTPTTGEELSCHAVFSNA 721

Query: 721  FTLLLRLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSK 780
            FTLLLRLWRF HPP+E+V GD   VG+Q+ PEYLL++RNS+LAS G SP DR+K RR+SK
Sbjct: 722  FTLLLRLWRFDHPPLEHVMGDLPTVGNQVGPEYLLVLRNSRLASLGNSPMDRIKSRRVSK 781

Query: 781  LLKFSLEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGG 840
             + F  E   MD FPKLK WY+QHQ+CI S   GLVPG PVHQ VDALLTMMF+KINR  
Sbjct: 782  FITFPSETITMDRFPKLKLWYQQHQKCICSTLSGLVPGTPVHQIVDALLTMMFRKINRSS 841

Query: 841  QSLTSTTSASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR----- 900
            Q LT  TS SSNSS S  +E+SI+LKVPAWDILEATPFVLDAALTACAHGRLSPR     
Sbjct: 842  QPLTPATSGSSNSSASGTDESSIRLKVPAWDILEATPFVLDAALTACAHGRLSPRELATG 901

Query: 901  LKDLADFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATG 960
            LK+LAD+LPA+ AT+V YFSAEVTRGIWKPA MNGTDWPSPAA LS+VEQQIKKILAATG
Sbjct: 902  LKELADYLPATLATMVSYFSAEVTRGIWKPACMNGTDWPSPAANLSIVEQQIKKILAATG 961

Query: 961  VDVPSLALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTP 1020
            VDVPSLA+GGS PAMLPLP AAL+SLTITYKLD+ASER L L+GPALNSL AGC WPC P
Sbjct: 962  VDVPSLAVGGSAPAMLPLPFAALVSLTITYKLDRASERALTLIGPALNSLAAGCPWPCMP 1021

Query: 1021 IIASLWAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGA 1080
            I+ASLWAQKVKRW+D+LVF+AS+TVFHHN DAVVQLLKSCFTSTLGL +S   S G VGA
Sbjct: 1022 ILASLWAQKVKRWSDYLVFTASQTVFHHNGDAVVQLLKSCFTSTLGLSSSRAYSNGSVGA 1081

Query: 1081 LLGHGFGSHVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEK 1140
            LLGHGFGSH  GG+SPVAPGILYLRVHR VRD +F+ EE++SLLM SV+DIA  GLP+++
Sbjct: 1082 LLGHGFGSHFSGGISPVAPGILYLRVHRSVRDVMFMTEEVLSLLMFSVRDIASCGLPRDR 1141

Query: 1141 AEKLKKSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSV 1200
             ++LKK KHGMR  QVS A+AMARV++AA+LGA+LVWISGGS LVQSL KETLPSWF+S 
Sbjct: 1142 VDRLKKIKHGMRYGQVSLAAAMARVRIAATLGATLVWISGGSNLVQSLIKETLPSWFIST 1201

Query: 1201 HSVDREGVEYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASA 1260
            H +D+EG E GGMV +L GYALA+F+VLCGTF+WG+DS++ ASKRR K+L AHLEFLASA
Sbjct: 1202 HGLDQEGRESGGMVAMLGGYALAYFAVLCGTFAWGVDSVTPASKRRPKVLGAHLEFLASA 1261

Query: 1261 LDGKFSIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALAL 1303
            LDGK S+GCDWA WRAYVSGFVSL+V C  KW+LEVD++VL+RL K LR+LNEEELALAL
Sbjct: 1262 LDGKISLGCDWAMWRAYVSGFVSLMVACTQKWVLEVDVEVLKRLSKELRRLNEEELALAL 1310

BLAST of Cp4.1LG15g00030 vs. TrEMBL
Match: A0A0D2TEF0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G134500 PE=4 SV=1)

HSP 1 Score: 1774.2 bits (4594), Expect = 0.0e+00
Identity = 917/1326 (69.16%), Postives = 1062/1326 (80.09%), Query Frame = 1

Query: 11   LQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDN 70
            +Q  + VW++VLELTKSAQ KN DPLLWAVQLSSSLNSA +SLPS++LAHLLVSHICWDN
Sbjct: 5    VQAASPVWESVLELTKSAQYKNRDPLLWAVQLSSSLNSAGISLPSIDLAHLLVSHICWDN 64

Query: 71   HVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEV 130
            HVPI WKFLEKA+T + VP +LV+ALLSTR IP RK  PAAYRLY+ELL RH FS   ++
Sbjct: 65   HVPITWKFLEKALTVKFVPSMLVLALLSTRVIPNRKFHPAAYRLYMELLRRHAFSLQCQI 124

Query: 131  NGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAE 190
            NGPNY +IM+++ DVLHLSQIFG+Q  EPGLL+VE  FSIV  LLDASLDDEGLLEL  E
Sbjct: 125  NGPNYQKIMKSVGDVLHLSQIFGVQVSEPGLLLVEFVFSIVLQLLDASLDDEGLLELTPE 184

Query: 191  ERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCL 250
            +RS+W    Q  DME+D  D+F EK++E+ + L K NT  AIEIIG+FLQNK T+RIL L
Sbjct: 185  KRSIWPTVTQ--DMEIDSVDNFNEKRSEHHDVLCKGNTTMAIEIIGEFLQNKVTSRILFL 244

Query: 251  AHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEF 310
            A +NM  HW  F Q+L++LAA SV LRN K ITPE  L+ TSD H+ +S++ KT SQ E 
Sbjct: 245  ARRNMSSHWGTFLQQLRVLAAKSVALRNAKHITPEAFLNLTSDMHKVVSRKCKTISQQEL 304

Query: 311  HDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAV 370
            + V+ SGSL +S+GQ HG + SA WLPIDLFLEDAMDGSQV AT AVE L  L+K+L+AV
Sbjct: 305  NAVIGSGSLTTSSGQYHGTSPSAHWLPIDLFLEDAMDGSQVAATGAVESLTGLVKALQAV 364

Query: 371  NDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGEL 430
            N  +WH+TFLGLWIAALRL+QRERD SEGPVPRLDTCLCMLLSIT L V  I+EEEE EL
Sbjct: 365  NGTTWHDTFLGLWIAALRLVQRERDISEGPVPRLDTCLCMLLSITPLVVANIVEEEESEL 424

Query: 431  KEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFI 490
             +  +CSP+    EKQS G+ RQ LI SLQMLG+YE+LLTPPQ V  VANQAAAKA+MF+
Sbjct: 425  IDGCDCSPTNQIKEKQSPGRCRQDLICSLQMLGDYEALLTPPQPVRSVANQAAAKAIMFV 484

Query: 491  SGVAVGNEYYDCVSMNDTPINCC--------------------------YVNTRSSQVPR 550
            SG+ VGN Y++C+S+ND P+NC                           YVN R++ +PR
Sbjct: 485  SGLTVGNGYHECMSINDMPMNCSGNMRHLIVEACIARNLLDTSAYVWPGYVNARAN-IPR 544

Query: 551  SASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILC 610
            +   QV+GWSS MKGS LTP+++NAL ATPASSLAEIEKIYEIA  GS DEKISAASILC
Sbjct: 545  NVPVQVIGWSSLMKGSPLTPTLINALIATPASSLAEIEKIYEIATKGSDDEKISAASILC 604

Query: 611  GASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIF 670
            GASLVRGWN+QEHT+LFI+ LLSPP+PADY GS+S+LI Y PF N+LLVGIS VDCVQIF
Sbjct: 605  GASLVRGWNIQEHTILFITSLLSPPVPADYSGSESHLISYGPFFNILLVGISPVDCVQIF 664

Query: 671  SLHGMVPLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPP 730
            SLHG+VPLLAG LMP+CE FGS++P  SW L +GEELT HAVFS AFTLLLRLWRF+HPP
Sbjct: 665  SLHGLVPLLAGTLMPLCEVFGSTSPNVSWTLPTGEELTSHAVFSNAFTLLLRLWRFNHPP 724

Query: 731  IENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSF 790
             EN  GDA PVGSQLTPEYLLLVRNS+L+ FGKSPKD +K++R+SK L  SLE  FMDSF
Sbjct: 725  FENAMGDATPVGSQLTPEYLLLVRNSKLSDFGKSPKDHMKLKRMSKNLNISLELIFMDSF 784

Query: 791  PKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSS 850
            PKLK WYRQHQECIAS   GLV G  VHQ VDALL MMF+KI+RGGQSLTSTTS SS+S 
Sbjct: 785  PKLKSWYRQHQECIASTLTGLVQGTTVHQIVDALLNMMFRKISRGGQSLTSTTSGSSSSP 844

Query: 851  GSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFAT 910
             S  E+ S++LKVPAWDILE  P+VLDAALTACAHGRL PR     LKDLAD LPA+ AT
Sbjct: 845  ASGAEDVSMRLKVPAWDILEGAPYVLDAALTACAHGRLLPRDLATGLKDLADLLPATLAT 904

Query: 911  IVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPA 970
            IV Y SAEVTRGIWKP FMNGTDWPSP+A L +VEQQIKKI+AATGVD+PSLA+GGS PA
Sbjct: 905  IVSYLSAEVTRGIWKPVFMNGTDWPSPSANLFMVEQQIKKIIAATGVDIPSLAIGGSSPA 964

Query: 971  MLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWN 1030
            MLPLPLAAL+SLTITYKLDKASER + L+GPAL+SL   C WPC PIIASLWAQKVKRW+
Sbjct: 965  MLPLPLAALVSLTITYKLDKASERFIVLIGPALSSLAESCPWPCMPIIASLWAQKVKRWS 1024

Query: 1031 DFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGM 1090
            DFLVFSASRTVFHHN DAVVQLL+SCFTS  GL  S + S GGVGALLGHGFGSH  GGM
Sbjct: 1025 DFLVFSASRTVFHHNIDAVVQLLRSCFTSIPGLSPSTIYSNGGVGALLGHGFGSHFSGGM 1084

Query: 1091 SPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCE 1150
            S VAPGILYLRVHR VR+ +F+ EEIVSLLM SV+DIA +GL +E  EKLKK+K G+R  
Sbjct: 1085 SAVAPGILYLRVHRSVRNIMFMTEEIVSLLMSSVRDIANSGLFRENLEKLKKAKFGLRYR 1144

Query: 1151 QVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMV 1210
            QVS  +A  RV+ AASLGASLVW+SGG  LVQ L +ETLPSWFLS H+ D++G E GG+V
Sbjct: 1145 QVSLGAATTRVRFAASLGASLVWLSGGLSLVQLLIQETLPSWFLSAHTPDQDGGEPGGLV 1204

Query: 1211 PVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATW 1270
             +L GYALA+F VLCGT +WG+DS S  SKRR K+L AHLEFLASA+DGK S+GC++ATW
Sbjct: 1205 AMLSGYALAYFVVLCGTLAWGVDSSSPTSKRRPKVLGAHLEFLASAIDGKISLGCEYATW 1264

Query: 1271 RAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAE 1305
             AYV+GFVSL+V C  KW+L+V++ VL RL  GLRQ NEEELA+ALL  GG  A  +AAE
Sbjct: 1265 HAYVTGFVSLMVGCTKKWVLDVNVDVLRRLSNGLRQWNEEELAIALLGLGGAEATASAAE 1324

BLAST of Cp4.1LG15g00030 vs. TAIR10
Match: AT2G48110.1 (AT2G48110.1 reduced epidermal fluorescence 4)

HSP 1 Score: 1429.8 bits (3700), Expect = 0.0e+00
Identity = 782/1320 (59.24%), Postives = 935/1320 (70.83%), Query Frame = 1

Query: 17   VWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDNHVPIMW 76
            +W++V  L +SAQEKN DPL WA+QL  +L SA +SLPS +LA  LV+HI W+NH P+ W
Sbjct: 10   LWESVTSLIRSAQEKNVDPLHWALQLRLTLASAGISLPSPDLAQFLVTHIFWENHSPLSW 69

Query: 77   KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEVNGPNYP 136
            K LEKA++  IVPPLLV+ALLS R IP RKL PAAYRLY+ELL RH FS    +  P Y 
Sbjct: 70   KLLEKAISVNIVPPLLVLALLSPRVIPNRKLHPAAYRLYMELLKRHAFSFMPLIRAPGYH 129

Query: 137  RIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWL 196
            + M +IDD+LHLS+ FG+Q  EPG +++   FSIVW LLDASLD+EGLLEL + +RS W 
Sbjct: 130  KTMNSIDDILHLSETFGVQDQEPGSILLAFVFSIVWELLDASLDEEGLLELTSNKRSKW- 189

Query: 197  IRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMP 256
                PHDM+LD             EN +K N               +    L  A+  M 
Sbjct: 190  -PSSPHDMDLD-----------GLENSVKRN---------------ENHDALEKANTEMA 249

Query: 257  LHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMAS 316
            +      + +Q    N V  R         +LH  S      + E KT  + EFH +++S
Sbjct: 250  I------ELIQEFLQNKVTSR---------ILHLASQ-----NMESKTIPRGEFHAIVSS 309

Query: 317  GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWH 376
            GS  +    S      ALWLPIDLF ED MDG+Q  A SAVE L  L+K+L+A N  SWH
Sbjct: 310  GSKLALTSDS------ALWLPIDLFFEDIMDGTQAAAASAVENLTGLVKALQAANSTSWH 369

Query: 377  NTFLGLWIAALRLIQRE-------------------RDPSEGPVPRLDTCLCMLLSITTL 436
            + FL LW+AALRL+QRE                   RDP EGPVPR DT LC+LLS+T L
Sbjct: 370  DAFLALWLAALRLVQRENLCLRYCFFMHMLEILSEERDPIEGPVPRTDTFLCVLLSVTPL 429

Query: 437  AVTIIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIV 496
            AV  IIEEEE +  ++   SPS    EK+  GK RQGLI SLQ LG+YESLLTPP+SV  
Sbjct: 430  AVANIIEEEESQWIDQTSSSPSNQWKEKK--GKCRQGLINSLQQLGDYESLLTPPRSVQS 489

Query: 497  VANQAAAKAVMFISGVAVGNEYYDCVSMNDTPINCCYVNTRSSQVPRSASSQVV------ 556
            VANQAAAKA+MFISG+   N  Y+  SM+++   CC V      +       V       
Sbjct: 490  VANQAAAKAIMFISGITNSNGSYENTSMSESASGCCKVRFSLFTLKMFVVMGVYLLCNIS 549

Query: 557  GWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRG 616
             WS  MKGS LTPS+ N+L  TPASSLAEIEK+YE+A  GS DEKI+ ASILCGASL RG
Sbjct: 550  CWSLVMKGSPLTPSLTNSLITTPASSLAEIEKMYEVATTGSEDEKIAVASILCGASLFRG 609

Query: 617  WNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVP 676
            W++QEH ++FI  LLSPP PAD  GS S+LI+ APFLNVLLVGIS +DCV IFSLHG+VP
Sbjct: 610  WSIQEHVIIFIVTLLSPPAPADLSGSYSHLINSAPFLNVLLVGISPIDCVHIFSLHGVVP 669

Query: 677  LLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKGD 736
            LLAG LMPICEAFGS  P  +W L +GE ++ HAVFS AFTLLLRLWRF HPP++ V GD
Sbjct: 670  LLAGALMPICEAFGSGVPNITWTLPTGELISSHAVFSTAFTLLLRLWRFDHPPLDYVLGD 729

Query: 737  ARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSFPKLKGWY 796
              PVG Q +PEYLLLVRN +L  FGKSPKDR+  RR SK++  S++P FMDSFP+LK WY
Sbjct: 730  VPPVGPQPSPEYLLLVRNCRLECFGKSPKDRMARRRFSKVIDISVDPIFMDSFPRLKQWY 789

Query: 797  RQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANEEA 856
            RQHQEC+ASI   L  G+PVH  VD+LL+MMFKK N+GG    + +S SS+ S S  +++
Sbjct: 790  RQHQECMASILSELKTGSPVHHIVDSLLSMMFKKANKGGSQSLTPSSGSSSLSTSGGDDS 849

Query: 857  SIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCYFSA 916
            S +LK+PAWDILEA PFVLDAALTACAHG LSPR     LK LADFLPA+  T+V YFS+
Sbjct: 850  SDQLKLPAWDILEAAPFVLDAALTACAHGSLSPRELATGLKILADFLPATLGTMVSYFSS 909

Query: 917  EVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLPLA 976
            EVTRG+WKP  MNGTDWPSPAA L+ VEQQI+KILAATGVDVP L   G   A LPLPLA
Sbjct: 910  EVTRGLWKPVSMNGTDWPSPAANLASVEQQIEKILAATGVDVPRLPADGISAATLPLPLA 969

Query: 977  ALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVFSA 1036
            AL+SLTITYKLDKA+ER L LVGPAL+SL A C WPC PI+ SLW QKVKRW+DFL+FSA
Sbjct: 970  ALVSLTITYKLDKATERFLVLVGPALDSLAAACPWPCMPIVTSLWTQKVKRWSDFLIFSA 1029

Query: 1037 SRTVFHHNSDAVVQLLKSCFTSTLGL-GNSNVNSGGGVGALLGHGFGSHVLGGMSPVAPG 1096
            SRTVFHHN DAV+QLL+SCFT TLGL   S + S GGVGALLGHGFGS   GG+S  APG
Sbjct: 1030 SRTVFHHNRDAVIQLLRSCFTCTLGLTPTSQLCSYGGVGALLGHGFGSRYSGGISTAAPG 1089

Query: 1097 ILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMR--CEQVSF 1156
            ILY++VHR +RD +FL EEI+SLLM SVK IA   LP  +AEKLKK+K G R    QVS 
Sbjct: 1090 ILYIKVHRSIRDVMFLTEEILSLLMFSVKSIATRELPAGQAEKLKKTKDGSRYGIGQVSL 1149

Query: 1157 ASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLR 1216
            + AM RVKLAASLGASLVWISGG  LVQ+L KETLPSWF+SVH    E  E GGMVP+LR
Sbjct: 1150 SLAMRRVKLAASLGASLVWISGGLNLVQALIKETLPSWFISVHG---EEDELGGMVPMLR 1209

Query: 1217 GYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATWRAYV 1276
            GYALA+F++L   F+WG+DS   ASKRR ++L  HLEF+ SAL+GK S+GCDWATW+AYV
Sbjct: 1210 GYALAYFAILSSAFAWGVDSSYPASKRRPRVLWLHLEFMVSALEGKISLGCDWATWQAYV 1269

Query: 1277 SGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELIIE 1303
            +GFVSL+V+C P W+LEVD++V++RL K LRQ NE++LALALL +GGL  MGAA ELI+E
Sbjct: 1270 TGFVSLMVQCTPAWVLEVDVEVIKRLSKSLRQWNEQDLALALLCAGGLGTMGAATELIVE 1270

BLAST of Cp4.1LG15g00030 vs. TAIR10
Match: AT3G23590.1 (AT3G23590.1 REF4-related 1)

HSP 1 Score: 1406.7 bits (3640), Expect = 0.0e+00
Identity = 744/1319 (56.41%), Postives = 950/1319 (72.02%), Query Frame = 1

Query: 17   VWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDNHVPIMW 76
            VWD V+ELTK AQE   DP LWA QLSS+L   +V LPS ELA ++VS+ICWDN+VPI+W
Sbjct: 9    VWDCVIELTKMAQENCVDPRLWASQLSSNLKFFAVELPSTELAEVIVSYICWDNNVPIVW 68

Query: 77   KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEVNGPNYP 136
            KFLE+AM  ++V PL+V+ALL+ R +P R  + AAYR+YLELL R++F+    ++GP+Y 
Sbjct: 69   KFLERAMALKLVSPLVVLALLADRVVPTRSTQQAAYRIYLELLKRNMFTIKDHISGPHYQ 128

Query: 137  RIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWL 196
            ++M ++ ++L LS++F L T +PG+L+VE  F +V  LLDA+L DEGLLEL  +  S WL
Sbjct: 129  KVMISVSNILRLSELFDLDTSKPGVLLVEFVFKMVSQLLDAALSDEGLLELSQDSSSQWL 188

Query: 197  IRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMP 256
            ++ Q  DME+D  + + EK T + E L  +NT  AIE+I +FL+N   AR+L L   N  
Sbjct: 189  VKSQ--DMEIDAPERYNEK-TGSLEKLQSLNTIMAIELIAEFLRNTVIARLLYLVSSNRA 248

Query: 257  LHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMAS 316
              W  F Q++QLL  NS  L+++K++    LL   S++    S + K TS  + + ++  
Sbjct: 249  SKWHEFVQKVQLLGENSSALKHSKVLNSGDLLQLISNRRFGYSYDSKVTSARKSNAIVDF 308

Query: 317  GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWH 376
            GSL S AG  HG + S+LWLP+DL  EDAMDG QV  TSA+E +  L K+L+ +N ++WH
Sbjct: 309  GSLSSYAGLCHGASLSSLWLPLDLVFEDAMDGYQVNPTSAIEIITGLAKTLKEINGSTWH 368

Query: 377  NTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGELKEEDEC 436
            +TFLGLWIAALRL+QRERDP EGP+PRLDT LCM L I  L V  +IEE + E   E   
Sbjct: 369  DTFLGLWIAALRLVQRERDPIEGPIPRLDTRLCMSLCIVPLVVANLIEEGKYESVME--- 428

Query: 437  SPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFISGVAVG 496
                         K R  L+TSLQ+LG++  LL PP+ V+  AN+AA KA++F+SG  VG
Sbjct: 429  -------------KLRDDLVTSLQVLGDFPGLLAPPKCVVSAANKAATKAILFLSGGNVG 488

Query: 497  NEYYDCVSMNDTPINCC--------------------------YVNTRSSQVPRSASSQV 556
               +D ++M D P+NC                           YVN R +Q+P+S  ++V
Sbjct: 489  KSCFDVINMKDMPVNCSGNMRHLIVEACIARNILDMSAYSWPGYVNGRINQIPQSLPNEV 548

Query: 557  VGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVR 616
              WSSF+KG+ L  +MVN L + PASSLAE+EK++E+A+ GS DEKISAA++LCGASL R
Sbjct: 549  PCWSSFVKGAPLNAAMVNTLVSVPASSLAELEKLFEVAVKGSDDEKISAATVLCGASLTR 608

Query: 617  GWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMV 676
            GWN+QEHTV +++RLLSPP+PADY  ++++LI YA  LNV++VGI SVD +QIFSLHGMV
Sbjct: 609  GWNIQEHTVEYLTRLLSPPVPADYSRAENHLIGYACMLNVVIVGIGSVDSIQIFSLHGMV 668

Query: 677  PLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKG 736
            P LA  LMPICE FGS TP  SW L SGE ++ ++VFS AFTLLL+LWRF+HPPIE+  G
Sbjct: 669  PQLACSLMPICEEFGSYTPSVSWTLPSGEAISAYSVFSNAFTLLLKLWRFNHPPIEHGVG 728

Query: 737  DARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKF-SLEPTFMDSFPKLKG 796
            D   VGSQLTPE+LL VRNS L S     +DR + +RLS++ +  S +P F+DSFPKLK 
Sbjct: 729  DVPTVGSQLTPEHLLSVRNSYLVSSEILDRDRNR-KRLSEVARAASCQPVFVDSFPKLKV 788

Query: 797  WYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANE 856
            WYRQHQ CIA+   GL  G+PVHQTV+ALL M F K+ RG Q+L    S +S+SSG+A+E
Sbjct: 789  WYRQHQRCIAATLSGLTHGSPVHQTVEALLNMTFGKV-RGSQTLNPVNSGTSSSSGAASE 848

Query: 857  EASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCYF 916
            +++I+ + PAWDIL+A P+V+DAALTAC HGRLSPR     LKDLADFLPAS ATIV YF
Sbjct: 849  DSNIRPEFPAWDILKAVPYVVDAALTACTHGRLSPRQLATGLKDLADFLPASLATIVSYF 908

Query: 917  SAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLP 976
            SAEV+RG+WKP FMNG DWPSPA  LS VE+ I KILA TGVD+PSLA GGS PA LPLP
Sbjct: 909  SAEVSRGVWKPVFMNGVDWPSPATNLSTVEEYITKILATTGVDIPSLAPGGSSPATLPLP 968

Query: 977  LAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVF 1036
            LAA +SLTITYK+DKASER L L GPAL  L AGC WPC PI+ASLW QK KRW DFLVF
Sbjct: 969  LAAFVSLTITYKIDKASERFLNLAGPALECLAAGCPWPCMPIVASLWTQKAKRWFDFLVF 1028

Query: 1037 SASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGMSPVAP 1096
            SASRTVF HN DAV+QLL++CF++TLGL  + +++ GGVGALLGHGFGSH  GG+SPVAP
Sbjct: 1029 SASRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSNDGGVGALLGHGFGSHFYGGISPVAP 1088

Query: 1097 GILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCEQVSFA 1156
            GILYLR++R +RD + + EEI+SLL+ SV+DIA   L KEK EKLK  K+G R  Q S A
Sbjct: 1089 GILYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNRLSKEKLEKLKTVKNGSRYGQSSLA 1148

Query: 1157 SAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLRG 1216
            +AM +VKLAASL ASLVW++GG G+V  L KET+PSWFLS    DRE      +V  LRG
Sbjct: 1149 TAMTQVKLAASLSASLVWLTGGLGVVHVLIKETIPSWFLSTDKSDREQGP-SDLVAELRG 1208

Query: 1217 YALAFFSVLCGTFSWGIDSISSASKRRAK-LLDAHLEFLASALDGKFSIGCDWATWRAYV 1276
            +ALA+F VLCG  +WG+DS SSASKRR + +L +HLEF+ASALDGK S+GC+ ATWR Y+
Sbjct: 1209 HALAYFVVLCGALTWGVDSRSSASKRRRQAILGSHLEFIASALDGKISVGCETATWRTYI 1268

Query: 1277 SGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELII 1302
            SG VSL+V C P W+ E+D +VL+ L  GLR+  ++ELA+ LL  GGL  M  AA+ II
Sbjct: 1269 SGLVSLMVSCLPLWVTEIDTEVLKSLSNGLRKWGKDELAIVLLSLGGLKTMDYAADFII 1305

BLAST of Cp4.1LG15g00030 vs. NCBI nr
Match: gi|659096980|ref|XP_008449381.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 33B-like isoform X1 [Cucumis melo])

HSP 1 Score: 2312.7 bits (5992), Expect = 0.0e+00
Identity = 1196/1336 (89.52%), Postives = 1237/1336 (92.59%), Query Frame = 1

Query: 1    MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAH 60
            MAVSAQPPGQLQGIAG+WDTVLE+TKSAQ+KN DPLLWAVQLSS+LNSA VSLPSVELA 
Sbjct: 1    MAVSAQPPGQLQGIAGLWDTVLEVTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60

Query: 61   LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLS 120
            LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKL+PAAYRLYLELLS
Sbjct: 61   LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLQPAAYRLYLELLS 120

Query: 121  RHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLD 180
            RHVFSST ++ GPNY RIMQTIDDVLHL+QIFGLQTCEPG+LMVELFFSIVW LLDASLD
Sbjct: 121  RHVFSSTSQIYGPNYQRIMQTIDDVLHLTQIFGLQTCEPGVLMVELFFSIVWQLLDASLD 180

Query: 181  DEGLLELPAEERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQ 240
            DEGLL LP EE+S WLIRPQ HDMELDVHDSFGEKKTENSE+LLKVNTAKAIEIIGQFLQ
Sbjct: 181  DEGLLALPGEEKSAWLIRPQLHDMELDVHDSFGEKKTENSESLLKVNTAKAIEIIGQFLQ 240

Query: 241  NKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQ 300
            NKKT RILCLA +NMPL WAGFAQRLQLL ANSVVL N KLITPEVLLHWTSDK++ LSQ
Sbjct: 241  NKKTERILCLALRNMPLQWAGFAQRLQLLGANSVVLGNAKLITPEVLLHWTSDKNKLLSQ 300

Query: 301  EGKTTSQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
            +GKT SQLEF DVM+SGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL
Sbjct: 301  KGKT-SQLEFRDVMSSGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360

Query: 361  ICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVT 420
            ICLIKSLRAVND SWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVT
Sbjct: 361  ICLIKSLRAVNDTSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVT 420

Query: 421  IIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVAN 480
            IIIEEEE E KE+D CSPSKSRDEKQSSG  R+GLITSLQMLGEYESLLTPPQS+I VAN
Sbjct: 421  IIIEEEEVEPKEDD-CSPSKSRDEKQSSGMCRKGLITSLQMLGEYESLLTPPQSIIAVAN 480

Query: 481  QAAAKAVMFISGVAVGNEYYDCVSMNDTPINCC--------------------------Y 540
            QAAAKAVMFISGVAVGNEYYDC SMND PINC                           Y
Sbjct: 481  QAAAKAVMFISGVAVGNEYYDCASMNDAPINCSGNMRHLIVEACISRNLLDTSAYFWPGY 540

Query: 541  VNTRSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGD 600
            VN  SSQVPRSAS+QVVGWSSFMKGS LTPSMVNAL ATPASSLAEIEKIYEIAINGSGD
Sbjct: 541  VNALSSQVPRSASNQVVGWSSFMKGSPLTPSMVNALVATPASSLAEIEKIYEIAINGSGD 600

Query: 601  EKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVG 660
            EKISAASILCGASLVRGW LQEHT LFISRLL PPIP DY GSDSYLIDYAPFLNVLLVG
Sbjct: 601  EKISAASILCGASLVRGWYLQEHTALFISRLLLPPIPTDYSGSDSYLIDYAPFLNVLLVG 660

Query: 661  ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPKSWILTSGEELTCHAVFSLAFTLLL 720
            ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSS PKSWILTSGEELTCHAVFSLAFTLLL
Sbjct: 661  ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSPPKSWILTSGEELTCHAVFSLAFTLLL 720

Query: 721  RLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFS 780
            RLWRFHHPP+ENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSP DRLK RRLSKLLKFS
Sbjct: 721  RLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPNDRLKARRLSKLLKFS 780

Query: 781  LEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTS 840
            L+P FMDSFPKLKGWYRQHQECIASI  GLVPGAPVHQ VDALLTMMF+KINRGGQSLTS
Sbjct: 781  LQPIFMDSFPKLKGWYRQHQECIASILSGLVPGAPVHQIVDALLTMMFRKINRGGQSLTS 840

Query: 841  TTSASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLA 900
            TTS SSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR     LKDLA
Sbjct: 841  TTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA 900

Query: 901  DFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPS 960
            DFLPASFATIV YFSAEVTRGIWKPAFMNGTDWPSPAATLS+VEQQIKKILAATGVDVP 
Sbjct: 901  DFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPC 960

Query: 961  LALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASL 1020
            LA+GGS PAMLPLPLAALISLTITYKLDKASERLLALVGPALNSL A CSWPCTPIIASL
Sbjct: 961  LAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASL 1020

Query: 1021 WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHG 1080
            WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSN N+ GGVG LLGHG
Sbjct: 1021 WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNGNNSGGVGTLLGHG 1080

Query: 1081 FGSHVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLK 1140
            FGSHVLGGMSPVAPGILYLRVHR VRD LF+VEEIVSLLMLSV+DIAV+GLPKEKAEKLK
Sbjct: 1081 FGSHVLGGMSPVAPGILYLRVHRSVRDVLFVVEEIVSLLMLSVRDIAVSGLPKEKAEKLK 1140

Query: 1141 KSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDR 1200
            K+K+GMR EQVSFASAMARVKLAASLGASLVWISGGSGLVQSL+KETLPSWFLSVHSV+R
Sbjct: 1141 KTKYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVHSVER 1200

Query: 1201 EGVEYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKF 1260
            EGV YGGMV VLRG+ALAFFSVLCGTFSWGIDS SSASKRRAK+LD++LEFLASALDGKF
Sbjct: 1201 EGVNYGGMVAVLRGHALAFFSVLCGTFSWGIDSSSSASKRRAKILDSYLEFLASALDGKF 1260

Query: 1261 SIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGG 1306
            SIGCDWATWRAYVSGFVSL+VRCAP+WLLEVDL VL RL  GLRQLNEEEL L LLESGG
Sbjct: 1261 SIGCDWATWRAYVSGFVSLIVRCAPRWLLEVDLNVLTRLSNGLRQLNEEELGLELLESGG 1320

BLAST of Cp4.1LG15g00030 vs. NCBI nr
Match: gi|778716139|ref|XP_011657513.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 33B isoform X1 [Cucumis sativus])

HSP 1 Score: 2297.3 bits (5952), Expect = 0.0e+00
Identity = 1191/1336 (89.15%), Postives = 1231/1336 (92.14%), Query Frame = 1

Query: 1    MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAH 60
            MAVS  PPGQLQGIAG+WDTVLELTKSAQ+KN DPLLWAVQLSS+LNSA VSLPSVELA 
Sbjct: 1    MAVSTLPPGQLQGIAGLWDTVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60

Query: 61   LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLS 120
            LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKL+PAAYRLYLELLS
Sbjct: 61   LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLQPAAYRLYLELLS 120

Query: 121  RHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLD 180
            RHVFSST ++ GPNY RIMQTIDDVLHL+QIFGLQTCEPG+LMVELFFSIVW LLDASLD
Sbjct: 121  RHVFSSTCQIYGPNYQRIMQTIDDVLHLTQIFGLQTCEPGVLMVELFFSIVWQLLDASLD 180

Query: 181  DEGLLELPAEERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQ 240
            DEGLL L  EE+S WLIRPQ HDMELDVHDSFGEK+TENSE+LLKVNTAKAIEIIGQFLQ
Sbjct: 181  DEGLLALHGEEKSAWLIRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQ 240

Query: 241  NKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQ 300
            NKKTARILCLA +NMPL WA FAQRLQLL ANSVVL N KLITPEVLLHWTSDK++ LS+
Sbjct: 241  NKKTARILCLALRNMPLQWAAFAQRLQLLGANSVVLGNAKLITPEVLLHWTSDKNKLLSR 300

Query: 301  EGKTTSQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
            +GKT SQLEF DVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL
Sbjct: 301  KGKT-SQLEFRDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360

Query: 361  ICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVT 420
            ICLIKSLRAVND SWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVT
Sbjct: 361  ICLIKSLRAVNDTSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVT 420

Query: 421  IIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVAN 480
            IIIEEEE E KE+D CSPSKSRDEKQSSG  R+GLITSLQMLGEYESLLTPPQS+I VAN
Sbjct: 421  IIIEEEEVEPKEDD-CSPSKSRDEKQSSGMCRKGLITSLQMLGEYESLLTPPQSIIAVAN 480

Query: 481  QAAAKAVMFISGVAVGNEYYDCVSMNDTPINCC--------------------------Y 540
            QAAAKAVMFISGVAVGNEYYDC SMNDTPINC                           Y
Sbjct: 481  QAAAKAVMFISGVAVGNEYYDCASMNDTPINCSGNMRHLIVEACISRNLLDTSVYFWPGY 540

Query: 541  VNTRSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGD 600
            VN  SSQVP SAS+QVVGWSSFMKGS LTPSMVNAL ATPASSLAEIEKIYEIAINGSGD
Sbjct: 541  VNALSSQVPHSASNQVVGWSSFMKGSPLTPSMVNALVATPASSLAEIEKIYEIAINGSGD 600

Query: 601  EKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVG 660
            EKISAASILCGASLVRGW LQEH  LFISRLL PPIP DY GSDSYLIDYAPFLNVLLVG
Sbjct: 601  EKISAASILCGASLVRGWYLQEHAALFISRLLLPPIPTDYSGSDSYLIDYAPFLNVLLVG 660

Query: 661  ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPKSWILTSGEELTCHAVFSLAFTLLL 720
            ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSS PKSWILTSGEELTCHAVFSLAFTLLL
Sbjct: 661  ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSPPKSWILTSGEELTCHAVFSLAFTLLL 720

Query: 721  RLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFS 780
            RLWRFHHPP+ENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSP DRLK RRLSKLLKFS
Sbjct: 721  RLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPNDRLKARRLSKLLKFS 780

Query: 781  LEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTS 840
            L+P FMDSFPKLKGWYRQHQECIASI  GLVPGAPV Q VDALLTMMF+KINRGGQSLTS
Sbjct: 781  LQPIFMDSFPKLKGWYRQHQECIASILSGLVPGAPVLQIVDALLTMMFRKINRGGQSLTS 840

Query: 841  TTSASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLA 900
            TTS SSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR     LKDLA
Sbjct: 841  TTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA 900

Query: 901  DFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPS 960
            DFLPASFATIV YFSAEVTRGIWKPAFMNGTDWPSPAATLS+VEQQIKKILAATGVDVP 
Sbjct: 901  DFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPC 960

Query: 961  LALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASL 1020
            LA+GGS PAMLPLPLAALISLTITYKLDKASERLLALVGPAL+SL A CSWPCTPIIASL
Sbjct: 961  LAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALSSLAASCSWPCTPIIASL 1020

Query: 1021 WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHG 1080
            WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSN NS GGVG LLGHG
Sbjct: 1021 WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNGNSSGGVGTLLGHG 1080

Query: 1081 FGSHVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLK 1140
            FGSHVLGGMSPVAPGILYLRVHR VRD LF+VEEIVSLLMLSV+DIAV+GLPKEKAEKLK
Sbjct: 1081 FGSHVLGGMSPVAPGILYLRVHRSVRDVLFMVEEIVSLLMLSVRDIAVSGLPKEKAEKLK 1140

Query: 1141 KSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDR 1200
            K+K+GMR EQVSFASAMARVKLAASL ASLVWISGGSGLVQSL+KE LPSWFLSVHSV+R
Sbjct: 1141 KTKYGMRYEQVSFASAMARVKLAASLAASLVWISGGSGLVQSLFKEILPSWFLSVHSVER 1200

Query: 1201 EGVEYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKF 1260
            EGV YGGMV VLRGYALAFFSVLCGTFSWGIDS SSASKRRAK+LD++LEFLASALDGKF
Sbjct: 1201 EGVNYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSYLEFLASALDGKF 1260

Query: 1261 SIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGG 1306
            SIGCDWATWRAYVSGFVSL+VRCAP+WLLEVDL VL RL  GLRQLNEEEL LALLESGG
Sbjct: 1261 SIGCDWATWRAYVSGFVSLIVRCAPRWLLEVDLNVLTRLSNGLRQLNEEELGLALLESGG 1320

BLAST of Cp4.1LG15g00030 vs. NCBI nr
Match: gi|659096982|ref|XP_008449382.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 33A-like isoform X2 [Cucumis melo])

HSP 1 Score: 2067.4 bits (5355), Expect = 0.0e+00
Identity = 1071/1198 (89.40%), Postives = 1105/1198 (92.24%), Query Frame = 1

Query: 139  MQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWLIR 198
            MQTIDDVLHL+QIFGLQTCEPG+LMVELFFSIVW LLDASLDDEGLL LP EE+S WLIR
Sbjct: 1    MQTIDDVLHLTQIFGLQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSAWLIR 60

Query: 199  PQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMPLH 258
            PQ HDMELDVHDSFGEKKTENSE+LLKVNTAKAIEIIGQFLQNKKT RILCLA +NMPL 
Sbjct: 61   PQLHDMELDVHDSFGEKKTENSESLLKVNTAKAIEIIGQFLQNKKTERILCLALRNMPLQ 120

Query: 259  WAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMASGS 318
            WAGFAQRLQLL ANSVVL N KLITPEVLLHWTSDK++ LSQ+GKT SQLEF DVM+SGS
Sbjct: 121  WAGFAQRLQLLGANSVVLGNAKLITPEVLLHWTSDKNKLLSQKGKT-SQLEFRDVMSSGS 180

Query: 319  LFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWHNT 378
            LFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVND SWHNT
Sbjct: 181  LFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDTSWHNT 240

Query: 379  FLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGELKEEDECSP 438
            FLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVTIIIEEEE E KE+D CSP
Sbjct: 241  FLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEVEPKEDD-CSP 300

Query: 439  SKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFISGVAVGNE 498
            SKSRDEKQSSG  R+GLITSLQMLGEYESLLTPPQS+I VANQAAAKAVMFISGVAVGNE
Sbjct: 301  SKSRDEKQSSGMCRKGLITSLQMLGEYESLLTPPQSIIAVANQAAAKAVMFISGVAVGNE 360

Query: 499  YYDCVSMNDTPINCC--------------------------YVNTRSSQVPRSASSQVVG 558
            YYDC SMND PINC                           YVN  SSQVPRSAS+QVVG
Sbjct: 361  YYDCASMNDAPINCSGNMRHLIVEACISRNLLDTSAYFWPGYVNALSSQVPRSASNQVVG 420

Query: 559  WSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW 618
            WSSFMKGS LTPSMVNAL ATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW
Sbjct: 421  WSSFMKGSPLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW 480

Query: 619  NLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPL 678
             LQEHT LFISRLL PPIP DY GSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPL
Sbjct: 481  YLQEHTALFISRLLLPPIPTDYSGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPL 540

Query: 679  LAGQLMPICEAFGSSTPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKGDAR 738
            LAGQLMPICEAFGSS PKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPP+ENVKGDAR
Sbjct: 541  LAGQLMPICEAFGSSPPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGDAR 600

Query: 739  PVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSFPKLKGWYRQ 798
            PVGSQLTPEYLLLVRNSQLASFGKSP DRLK RRLSKLLKFSL+P FMDSFPKLKGWYRQ
Sbjct: 601  PVGSQLTPEYLLLVRNSQLASFGKSPNDRLKARRLSKLLKFSLQPIFMDSFPKLKGWYRQ 660

Query: 799  HQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANEEASI 858
            HQECIASI  GLVPGAPVHQ VDALLTMMF+KINRGGQSLTSTTS SSNSSGSANEEASI
Sbjct: 661  HQECIASILSGLVPGAPVHQIVDALLTMMFRKINRGGQSLTSTTSGSSNSSGSANEEASI 720

Query: 859  KLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCYFSAEV 918
            KLKVPAWDILEATPFVLDAALTACAHGRLSPR     LKDLADFLPASFATIV YFSAEV
Sbjct: 721  KLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSAEV 780

Query: 919  TRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLPLAAL 978
            TRGIWKPAFMNGTDWPSPAATLS+VEQQIKKILAATGVDVP LA+GGS PAMLPLPLAAL
Sbjct: 781  TRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPCLAVGGSSPAMLPLPLAAL 840

Query: 979  ISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVFSASR 1038
            ISLTITYKLDKASERLLALVGPALNSL A CSWPCTPIIASLWAQKVKRWNDFLVFSASR
Sbjct: 841  ISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSASR 900

Query: 1039 TVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGMSPVAPGILY 1098
            TVFHHNSDAVVQLLKSCFTSTLGLGNSN N+ GGVG LLGHGFGSHVLGGMSPVAPGILY
Sbjct: 901  TVFHHNSDAVVQLLKSCFTSTLGLGNSNGNNSGGVGTLLGHGFGSHVLGGMSPVAPGILY 960

Query: 1099 LRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCEQVSFASAMA 1158
            LRVHR VRD LF+VEEIVSLLMLSV+DIAV+GLPKEKAEKLKK+K+GMR EQVSFASAMA
Sbjct: 961  LRVHRSVRDVLFVVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASAMA 1020

Query: 1159 RVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLRGYALA 1218
            RVKLAASLGASLVWISGGSGLVQSL+KETLPSWFLSVHSV+REGV YGGMV VLRG+ALA
Sbjct: 1021 RVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVHSVEREGVNYGGMVAVLRGHALA 1080

Query: 1219 FFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATWRAYVSGFVS 1278
            FFSVLCGTFSWGIDS SSASKRRAK+LD++LEFLASALDGKFSIGCDWATWRAYVSGFVS
Sbjct: 1081 FFSVLCGTFSWGIDSSSSASKRRAKILDSYLEFLASALDGKFSIGCDWATWRAYVSGFVS 1140

Query: 1279 LLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELIIEGGF 1306
            L+VRCAP+WLLEVDL VL RL  GLRQLNEEEL L LLESGG+ AMGAAAELIIEGGF
Sbjct: 1141 LIVRCAPRWLLEVDLNVLTRLSNGLRQLNEEELGLELLESGGVNAMGAAAELIIEGGF 1196

BLAST of Cp4.1LG15g00030 vs. NCBI nr
Match: gi|778716142|ref|XP_011657514.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 33A isoform X2 [Cucumis sativus])

HSP 1 Score: 2054.6 bits (5322), Expect = 0.0e+00
Identity = 1067/1198 (89.07%), Postives = 1101/1198 (91.90%), Query Frame = 1

Query: 139  MQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAEERSVWLIR 198
            MQTIDDVLHL+QIFGLQTCEPG+LMVELFFSIVW LLDASLDDEGLL L  EE+S WLIR
Sbjct: 1    MQTIDDVLHLTQIFGLQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALHGEEKSAWLIR 60

Query: 199  PQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCLAHQNMPLH 258
            PQ HDMELDVHDSFGEK+TENSE+LLKVNTAKAIEIIGQFLQNKKTARILCLA +NMPL 
Sbjct: 61   PQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMPLQ 120

Query: 259  WAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEFHDVMASGS 318
            WA FAQRLQLL ANSVVL N KLITPEVLLHWTSDK++ LS++GKT SQLEF DVMASGS
Sbjct: 121  WAAFAQRLQLLGANSVVLGNAKLITPEVLLHWTSDKNKLLSRKGKT-SQLEFRDVMASGS 180

Query: 319  LFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDASWHNT 378
            LFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVND SWHNT
Sbjct: 181  LFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDTSWHNT 240

Query: 379  FLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIEEEEGELKEEDECSP 438
            FLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVTIIIEEEE E KE+D CSP
Sbjct: 241  FLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEVEPKEDD-CSP 300

Query: 439  SKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAAKAVMFISGVAVGNE 498
            SKSRDEKQSSG  R+GLITSLQMLGEYESLLTPPQS+I VANQAAAKAVMFISGVAVGNE
Sbjct: 301  SKSRDEKQSSGMCRKGLITSLQMLGEYESLLTPPQSIIAVANQAAAKAVMFISGVAVGNE 360

Query: 499  YYDCVSMNDTPINCC--------------------------YVNTRSSQVPRSASSQVVG 558
            YYDC SMNDTPINC                           YVN  SSQVP SAS+QVVG
Sbjct: 361  YYDCASMNDTPINCSGNMRHLIVEACISRNLLDTSVYFWPGYVNALSSQVPHSASNQVVG 420

Query: 559  WSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW 618
            WSSFMKGS LTPSMVNAL ATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW
Sbjct: 421  WSSFMKGSPLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW 480

Query: 619  NLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPL 678
             LQEH  LFISRLL PPIP DY GSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPL
Sbjct: 481  YLQEHAALFISRLLLPPIPTDYSGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPL 540

Query: 679  LAGQLMPICEAFGSSTPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPIENVKGDAR 738
            LAGQLMPICEAFGSS PKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPP+ENVKGDAR
Sbjct: 541  LAGQLMPICEAFGSSPPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGDAR 600

Query: 739  PVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEPTFMDSFPKLKGWYRQ 798
            PVGSQLTPEYLLLVRNSQLASFGKSP DRLK RRLSKLLKFSL+P FMDSFPKLKGWYRQ
Sbjct: 601  PVGSQLTPEYLLLVRNSQLASFGKSPNDRLKARRLSKLLKFSLQPIFMDSFPKLKGWYRQ 660

Query: 799  HQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTSASSNSSGSANEEASI 858
            HQECIASI  GLVPGAPV Q VDALLTMMF+KINRGGQSLTSTTS SSNSSGSANEEASI
Sbjct: 661  HQECIASILSGLVPGAPVLQIVDALLTMMFRKINRGGQSLTSTTSGSSNSSGSANEEASI 720

Query: 859  KLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFLPASFATIVCYFSAEV 918
            KLKVPAWDILEATPFVLDAALTACAHGRLSPR     LKDLADFLPASFATIV YFSAEV
Sbjct: 721  KLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSAEV 780

Query: 919  TRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLALGGSFPAMLPLPLAAL 978
            TRGIWKPAFMNGTDWPSPAATLS+VEQQIKKILAATGVDVP LA+GGS PAMLPLPLAAL
Sbjct: 781  TRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPCLAVGGSSPAMLPLPLAAL 840

Query: 979  ISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQKVKRWNDFLVFSASR 1038
            ISLTITYKLDKASERLLALVGPAL+SL A CSWPCTPIIASLWAQKVKRWNDFLVFSASR
Sbjct: 841  ISLTITYKLDKASERLLALVGPALSSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSASR 900

Query: 1039 TVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGSHVLGGMSPVAPGILY 1098
            TVFHHNSDAVVQLLKSCFTSTLGLGNSN NS GGVG LLGHGFGSHVLGGMSPVAPGILY
Sbjct: 901  TVFHHNSDAVVQLLKSCFTSTLGLGNSNGNSSGGVGTLLGHGFGSHVLGGMSPVAPGILY 960

Query: 1099 LRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSKHGMRCEQVSFASAMA 1158
            LRVHR VRD LF+VEEIVSLLMLSV+DIAV+GLPKEKAEKLKK+K+GMR EQVSFASAMA
Sbjct: 961  LRVHRSVRDVLFMVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASAMA 1020

Query: 1159 RVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGVEYGGMVPVLRGYALA 1218
            RVKLAASL ASLVWISGGSGLVQSL+KE LPSWFLSVHSV+REGV YGGMV VLRGYALA
Sbjct: 1021 RVKLAASLAASLVWISGGSGLVQSLFKEILPSWFLSVHSVEREGVNYGGMVAVLRGYALA 1080

Query: 1219 FFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIGCDWATWRAYVSGFVS 1278
            FFSVLCGTFSWGIDS SSASKRRAK+LD++LEFLASALDGKFSIGCDWATWRAYVSGFVS
Sbjct: 1081 FFSVLCGTFSWGIDSSSSASKRRAKILDSYLEFLASALDGKFSIGCDWATWRAYVSGFVS 1140

Query: 1279 LLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTAMGAAAELIIEGGF 1306
            L+VRCAP+WLLEVDL VL RL  GLRQLNEEEL LALLESGG+ AMGAAAELIIEGGF
Sbjct: 1141 LIVRCAPRWLLEVDLNVLTRLSNGLRQLNEEELGLALLESGGVNAMGAAAELIIEGGF 1196

BLAST of Cp4.1LG15g00030 vs. NCBI nr
Match: gi|590717753|ref|XP_007050679.1| (Reduced epidermal fluorescence 4, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 1824.7 bits (4725), Expect = 0.0e+00
Identity = 948/1332 (71.17%), Postives = 1080/1332 (81.08%), Query Frame = 1

Query: 11   LQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAHLLVSHICWDN 70
            +Q  + VW++VLE TKSAQ KNSDPLLWAVQLSSSLNSA VSLPS++LAHLLVSHICWDN
Sbjct: 5    IQAASPVWESVLEQTKSAQGKNSDPLLWAVQLSSSLNSAGVSLPSIDLAHLLVSHICWDN 64

Query: 71   HVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLSRHVFSSTLEV 130
            HVPI WK+LEKAMT + VPP+LV+ALLSTR IP RK  PAAYRLY+ELL RH FS   ++
Sbjct: 65   HVPITWKYLEKAMTVKFVPPILVLALLSTRVIPNRKFHPAAYRLYMELLRRHTFSLKCQI 124

Query: 131  NGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLDDEGLLELPAE 190
            N PNY + M++IDDVLHLSQIFGLQ  EPGLL+VE  FSIVW LLDASLDDEGLLEL  E
Sbjct: 125  NKPNYQKTMKSIDDVLHLSQIFGLQVSEPGLLLVEFVFSIVWQLLDASLDDEGLLELTPE 184

Query: 191  ERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQNKKTARILCL 250
            +RS+W    Q  DME+D  D+F EK+ E  + + K NT  AIEIIG+FLQNK T+RIL L
Sbjct: 185  KRSIWPTITQ--DMEIDTPDNFNEKRNEQLDVMYKGNTTMAIEIIGEFLQNKVTSRILFL 244

Query: 251  AHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQEGKTTSQLEF 310
            A +NMP HW  F Q+L +LAA SV LRN+K +TP+ LL  TSD  + LS+E K  S  EF
Sbjct: 245  ARRNMPSHWGAFIQQLSILAAQSVALRNSKHLTPDALLQLTSDSRKVLSRECKIKSHEEF 304

Query: 311  HDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLIC------LI 370
            H ++ SG L SS+GQ +G + SA WLPIDLFLEDAMDGSQV AT AVERL        L+
Sbjct: 305  HAIIGSGCLTSSSGQCYGTSSSAHWLPIDLFLEDAMDGSQVAATGAVERLTGRVNLAGLV 364

Query: 371  KSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTIIIE 430
            K+L+AVN  +WH+TFLGLWIAALRL+QRERD SEGPVPRLDTCLCMLLSIT L V  I+E
Sbjct: 365  KALQAVNGTTWHDTFLGLWIAALRLVQRERDISEGPVPRLDTCLCMLLSITPLVVANIVE 424

Query: 431  EEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVANQAAA 490
            EEE EL +E +CSP+    EKQ+ G+ R+ LI+SLQML +YE+LLTPPQSV  VANQAAA
Sbjct: 425  EEESELIDESDCSPTNQTKEKQAPGRCRKDLISSLQMLSDYEALLTPPQSVRSVANQAAA 484

Query: 491  KAVMFISGVAVGNEYYDCVSMNDTPINCC--------------------------YVNTR 550
            KA+MFISG+ VGN YY+C+S+ND P+NC                           YVN R
Sbjct: 485  KAIMFISGLTVGNGYYECMSINDMPMNCSGNMRHLIVEACIARNLLDTSAYIWPGYVNAR 544

Query: 551  SSQVPRSASSQVVGWSSFMKGSSLTPSMVNALEATPASSLAEIEKIYEIAINGSGDEKIS 610
            ++ +P S  SQV GWSS MKGS LTP+++NAL ATPASSLAEIEKIYEIA  GS +EKIS
Sbjct: 545  AN-IPCSVPSQVPGWSSLMKGSPLTPTLINALIATPASSLAEIEKIYEIATKGSDEEKIS 604

Query: 611  AASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVGISSV 670
            AASILCGASLVRGWN+QEH +LFI+ LLSPP+PADY GSDS+LI+YAP LNVLLVGISSV
Sbjct: 605  AASILCGASLVRGWNIQEHNILFITSLLSPPVPADYSGSDSHLINYAPLLNVLLVGISSV 664

Query: 671  DCVQIFSLHGMVPLLAGQLMPICEAFGSSTPK-SWILTSGEELTCHAVFSLAFTLLLRLW 730
            DCVQIFSLHGMVPLLAG LMP+CE FGS+ P  SW L +GEELT HAVF+ AFTLLLRLW
Sbjct: 665  DCVQIFSLHGMVPLLAGTLMPLCEVFGSTAPTVSWTLPTGEELTSHAVFTNAFTLLLRLW 724

Query: 731  RFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFSLEP 790
            RF HPP+E V GDA PVGSQL+P+YLLLVRNS+L +FGKSPKDRLK++RLSK L FSL+ 
Sbjct: 725  RFDHPPLERVMGDATPVGSQLSPDYLLLVRNSKLLAFGKSPKDRLKIKRLSKNLNFSLDI 784

Query: 791  TFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTSTTS 850
             FMDSFPKLK WYRQHQECIAS   GLV G  VHQ VDALL MMF+KI+RGGQS TSTTS
Sbjct: 785  IFMDSFPKLKCWYRQHQECIASTLSGLVQGTTVHQIVDALLNMMFRKISRGGQSFTSTTS 844

Query: 851  ASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPR-----LKDLADFL 910
             SS+SS S  E+A  +LKVPAWDILE TP+VLDAALTACAHGRLSPR     LKDLADFL
Sbjct: 845  GSSSSSASGAEDAHTRLKVPAWDILEGTPYVLDAALTACAHGRLSPRELATGLKDLADFL 904

Query: 911  PASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPSLAL 970
            PA+  TIV YFSAEVTRGIWKPAFMNGTDWPSPAA LS+VEQ IKKILAATGVDVPSLA+
Sbjct: 905  PATLGTIVSYFSAEVTRGIWKPAFMNGTDWPSPAANLSMVEQHIKKILAATGVDVPSLAV 964

Query: 971  GGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASLWAQ 1030
            GGS P  LPLPLAAL+SLTITYKLDK SER L L+GPALNSL  GC WPC PIIASLWAQ
Sbjct: 965  GGSSPTTLPLPLAALVSLTITYKLDKGSERFLILIGPALNSLAEGCPWPCMPIIASLWAQ 1024

Query: 1031 KVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHGFGS 1090
            KVKRWNDFLVFSASRTVFHH+SDAVVQLL+SCFTSTLGL  S + S GGVGALLGHGFGS
Sbjct: 1025 KVKRWNDFLVFSASRTVFHHSSDAVVQLLRSCFTSTLGLSPSIIYSNGGVGALLGHGFGS 1084

Query: 1091 HVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLKKSK 1150
            H  GGMSPVAPGILYLRVHR VRD +F+ EEIVSLLM SV++IA +GL +EK+EKLKK+K
Sbjct: 1085 HFSGGMSPVAPGILYLRVHRSVRDIMFMTEEIVSLLMSSVREIASSGLSQEKSEKLKKTK 1144

Query: 1151 HGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDREGV 1210
             G+R  QVS  +AM RVKLAASLGASLVW+SGG  LVQSL KETLPSWF+S H+ +++G 
Sbjct: 1145 FGLRYGQVSLGAAMTRVKLAASLGASLVWLSGGLSLVQSLIKETLPSWFISAHAPEKDGG 1204

Query: 1211 EYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDAHLEFLASALDGKFSIG 1270
            E GG+V +L GYALA+F+VLCGTF+WG+DS S ASKRR K+L AHLEFLASALDGK S+G
Sbjct: 1205 EPGGVVAMLGGYALAYFAVLCGTFAWGVDSASPASKRRPKVLGAHLEFLASALDGKISLG 1264

Query: 1271 CDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLERLGKGLRQLNEEELALALLESGGLTA 1305
            CD ATWRAYV+GFVSL+V C  KW+L+VD+ VL+RL  GLRQ NEEELA+ALL  GG+ A
Sbjct: 1265 CDSATWRAYVTGFVSLMVACTQKWVLDVDVYVLKRLSNGLRQWNEEELAMALLGLGGVGA 1324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MD33B_ARATH0.0e+0059.24Mediator of RNA polymerase II transcription subunit 33B OS=Arabidopsis thaliana ... [more]
MD33A_ARATH0.0e+0056.41Mediator of RNA polymerase II transcription subunit 33A OS=Arabidopsis thaliana ... [more]
Match NameE-valueIdentityDescription
A0A061DRW3_THECC0.0e+0071.17Reduced epidermal fluorescence 4, putative isoform 1 OS=Theobroma cacao GN=TCM_0... [more]
F6HQ63_VITVI0.0e+0070.88Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g00390 PE=4 SV=... [more]
A0A067JPN7_JATCU0.0e+0069.28Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21970 PE=4 SV=1[more]
M5VJA7_PRUPE0.0e+0068.91Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000301mg PE=4 SV=1[more]
A0A0D2TEF0_GOSRA0.0e+0069.16Uncharacterized protein OS=Gossypium raimondii GN=B456_007G134500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G48110.10.0e+0059.24 reduced epidermal fluorescence 4[more]
AT3G23590.10.0e+0056.41 REF4-related 1[more]
Match NameE-valueIdentityDescription
gi|659096980|ref|XP_008449381.1|0.0e+0089.52PREDICTED: mediator of RNA polymerase II transcription subunit 33B-like isoform ... [more]
gi|778716139|ref|XP_011657513.1|0.0e+0089.15PREDICTED: mediator of RNA polymerase II transcription subunit 33B isoform X1 [C... [more]
gi|659096982|ref|XP_008449382.1|0.0e+0089.40PREDICTED: mediator of RNA polymerase II transcription subunit 33A-like isoform ... [more]
gi|778716142|ref|XP_011657514.1|0.0e+0089.07PREDICTED: mediator of RNA polymerase II transcription subunit 33A isoform X2 [C... [more]
gi|590717753|ref|XP_007050679.1|0.0e+0071.17Reduced epidermal fluorescence 4, putative isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:2000762 regulation of phenylpropanoid metabolic process
biological_process GO:0009698 phenylpropanoid metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0016592 mediator complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g00030.1Cp4.1LG15g00030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33739FAMILY NOT NAMEDcoord: 4..1305
score:
NoneNo IPR availablePANTHERPTHR33739:SF4MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 33Bcoord: 4..1305
score:

The following gene(s) are paralogous to this gene:

None