CmoCh19G003080.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh19G003080.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionmediator of RNA polymerase II transcription subunit 15a-like isoform X1
LocationCmo_Chr19: 2284147 .. 2299274 (+)
Sequence length4686
RNA-Seq ExpressionCmoCh19G003080.1
SyntenyCmoCh19G003080.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGAAACATTGAACAAGGGTTCACGAAGAAGACAAGAACAGGGTATTTATTTCTTGCTTTTGTCTTCCATTCTCCACGAGAGGACGCTCCTCGATCCACATAGAACACACACAAAAGCTTCCTCTTGAATGGCGAGTGAATAGCAAGCCTTCGGTCGTTATCTTCTTTGAAATCGCCTCGATTTTCTCAATTTTCCTATACATATCTGAATTAAACTCCTGGTTTCCTGTAATCAGTGTCCAAACAATTTCCGAGTCGATTCTGCTCGATTTCTGCTCGATTTCTGGCATTTATTGAGCACACCCTCACTGTTTGTTAGTGGGATCTCGACAAATTCTTTTATCGTAGGTTGTTTGGAAACTTCTGAATGGATACTAATAATTGGAGACCTACTCAAGGTGGAGAACCCGGAATCGAAGCCGGGGATTGGAGGTCTCAATTGCAGCCCGATTCTCGGCAACGAATTGTCAACAAAATGTACGGTTTTTCTTAAATGTTTATTGCATCGTGCCCTACTCTTATTATTTCTTATTTGAGGAATGCGTTTGAGTTTTTTTTATAGCTTTTTTGTTTCGTGTTCATTGGTTAGGTTTAGTTACTGTGAGCAGCAAAACTTTGGAAATATAGAACATTTTCTCGGATTTTGTGTATAACTTATAGATGCGAACTGTTCCCAGAACGTAGTTCGTTAAAATTGCCCCCCATAGATTTTCCAATCTCTGAATTGAAGATATAGTTGGTTAGGTTTAGTGGAGGAGGTTCCTATTTGTTTCTTCAGGTAATATCCCCTTCTTTAAGCATAAGAAGCATAGGCTATTTGTCGCAGAATGGATCTTGTTGAATTGTTCTTGACCGCTCTAAATACATTCCCTTTGAAGTATCTAATATTTTGAGTCATGGACATGGATTGCTACGGTAGTTTTCCTTTCAAGAACAGCTTCCCATGCTTTTGTGTTTGATAAATTAGTTAGACATTATGTTTTCCACTTTTGTGTTAGAATGGAGACATTGAAGAGGCACCTTCCTGTTTCTGGTCATGAGGGATTGAGTGAGCTGAAAAAAATTGCTGTAAGGTTTGAGGAAAAGATTTATACTGCCGCTACCAGCCAGGTATTTCTTACTCACTTTGTTGTAGTTTAAATCAAAGATTGTATGAATAAACATATTCTCATTTACGTTTAGCCCATATGCTAATATCAATAAGATTCCAACTTGATTTGGTACCTAAGATTTAGCTATGTATTCTTGCCCTAAACACAGCAGTGTTTTAGCGATCTCTTGAGTACTCTCACCGATTCTATTTTGAAACTCTGTTTCAGTCAGATTACCTAAGGAAAATATCTCTGAAGATGCTTACGATGGAAACAAAATCTCAGACTCCTATGGGAACTTCATTACCATCCAATCCTATGGTGCCTAGCAATAAGCCTCTGGATTCTGGTAAGTGATCATCAGACAATCTGCCACTATGTTGTATACGTGATGAAAATTTTAATAACATGTTGAAGCCAATTTGTGAATGAATTTCTTAAAGAGTCTCTGTTCTTGGATATTGTAACAATTTCTCAATTAACATATGGCGTTTTATTTGACTTGATGTTGTGTATTTTATCTTAAACAGCATCACAAAGCATGCAGCCCCAAGTTCTAAATCAAGGGCAATCAATTTCTGTTCCTCAGTCTTCTAATCAGCCTCAGTCACGCCAGCAACTACTCTCCCAGAATATCCAAAATAATATTGTTTCTCAGAGTTCATCCAGCTTACCTTCTGCAGTACCTCCTGTCTCTGGATTAGCTTCATCTTCAATGCCAAACATGGTTGGCCAGAATCCAAGCATGCAAAATGTATCTGGAATTCCACAGAATTCTGTTGGAAATGCTATGGGTCAAGGGGTTCCTTCCAATGTATTTACCAACTCTCAGAGGCCAGTACAAGGAAGACAAGTAGTTTCCCAACAGCAACAGCAGCAGGCACAAACCCAGCAACAGCAGTTTCTTTTTCATCAACAACAGTTACAGCAGCAGATGATGAATAAGAAGCTTCAGCAGGGAAGTATACCACAACAACGTATGCAATCTCACATTCCACAACAACAACAGAATCTAATGCAACCAAATCAACTGCAATCATCTCAGCAATCTGCTATGCAACCTTCTATGATGCAACCTTCTCTGTCTAATCTTCAACAAAACCAACAATCTTCCATTCAACAGCCCACTCAATCCATGCTTCAGCAACCTCAACAGCCAGTTCTTAGGCAGCAGCCACAGTCCCAGCAACATGCTGTCATGCATTCACAGTCTACAATGTCACAGCAGACTAGCTTGCCTTCACAGCAGCAACAACAGCTAATTAGTCAGCAACCAAATTCTTCAAACATGCAGCAAAATCCATTGATCGGGCAGCAAAACAGTGTTGGGGACATGCAGCAACACCTGCCTCAGCAATCTAGGTCGCATGGGCAGCAGAGCAATCTATCAAATATGCAGTCTCCACCATCGCAGCAGCAGCAGTTAATGGCTCAGCAAAACAACCTTTCAAATTTGCAGCAGCAGCAGTTAGGACCTCAAAGTAATGTTTCTGGACTACAGCAACAACAAATGCATGGAACTCAGTCAGGTAACTCAAACATGCAATCAGATCAGCACCCAATGCATATGCTGCAACAGAACAAGGTTCATATGCAGCAGCAACCTCCACAAAATGCATCAAATTTATTGTCAGCACAAGGTCCGCAAGGGCAACTTCAGTCATCCCAACAGTTGATGTCACAGATTCCGTTGCAGTCCACCCAAGTCCAACAACAGGTACCTCTACATCAGCAGCAGCAACAGCAGCCAAATGCCATGTCACATGATCTCCAACAAAGGCTTCAAGTTGGAGGTCAAGCACCAAGCTCCCTGCTTCAATCACAAAATGTAATGGATCAGCAAAAGCAGTTGTATCATTCACAAAGAGCTCTTCCAGAGACATCATCGAGTATGTTTTCTTGCTAACTTTAGAACCTTGCATTTTGAATTTTTGATGCTAACATTATCATTCTTCGTCAATTTTTTTGTGATAAGTGTGGATTGCAGCATGCCTCTTTCTATTATATTCGTTTGGTCTACCTTCAGCATTTTTGTGCATGAAGCATGGACGTAAACATGTGACACAAATACGATAAAACATGATTTTCTTGAAAAATTAGGAGAGACACAACAGTGAGACAATCAATACAGTTCTAAAAAAATATTATGCAGCTCAGAATTTCATGTTTATTATGTTCTTTATTTCCATCTCTTTTCAGCTGAAAACTTATTTCTGTATCTAGGAATATTTTACTGAGTTTTGTTTTTCCTTCTATTACCATTCTTTTCATAATAAGAAATTGTAAATCAACCTTGAGGTTGGCGGGAATGGATTCAGAGGAGCTATTCAAGAAAACCTTCCAAATCTAAATGGGGGAGAAAACAGTGATTATTGTTATTGTTGTATCCATCAATATTTGTGCTCGCTTGTGCAAGCCTTAATTGCCTGACCCTGATTTTGTTTGCCAACGAAACTTATAAGATTTTAAACTACAAGGATGTGGCCACCGTGGTTGAAACTTATTCTCTCTTAGCCATGTATTGTTTTTTCACTGGCCCCAGTGTCCACTATCAAGGTGGTTTAGATAAAGAATAAATGTGATTATTAGACAATGATTTGTGAAAGGCACCGTAATGCGAGGGCATGTACTGGATGTAGTTCAGAAAGCATCTGTAGATTTCTTTTGTTGTGAAAAATTTCTTGGTTTCTGTCTAAAAAAGCTATCAGGAGCTAACTTCACACTTCTAAATAATTCCACTTTTCCTTTCAACCACCAACTTATAAGCACTTCACATAACTAGTCTTCTATATAATCGGGAAGGCACAAGGATAAGTTAAATAATCCAAATGTATAAGACTAACCCTCAGTAATAATCCTACAGTTGAGGAAGACGTGATCAATGGCTTTTCATTCCCCCCTCCCCCCTCCCCCCGTCTGTTGGGCCTTCTACTGTGGCCCCAATTCTCTATTTTCTTTTACTGTTATCAACGAAGGCTGAGTTTCTTATTACTAGGGAGCTCTCTATCTCTATGCTGCAGTTCAGTTACATCCCCTCGGCTCTTATTTTCCATATCCACTGACCTTTATTTCATTTATGTAGCATCTTTAGATTCCACGGCTCAGACAGGACAAGCCAATGGTGGAGATTGGCAAGAGGAGATTTATCAGAAGGTGCTTAGCTAACACTTCATCTCTCTTCAATGAGTTTTTCTTTACTCTTTAGAGATTCAAAAGGAAGTTCATTCAATTTGCAATATTAGTAGAGGAGGTACTATTTGCAATATTAGTAGAGGAGGTACTATTTGCAATCTTCAATAATTCAGTTTTAAGTTGTATAAATTATAACTTTTTATGATAAGATTTAACAAATCATCAAGTTAACTTCGTATATTTTTAACATCTAAATTTTTTAGAAACAGGTATTAACTTATGTTCATTCATTTTGTAAAAAGTTAATTAATCTGCATATTGTTCTTTTTCAATTCATCCAAATATATATATTTTGTTGTGTTGTAGTCATGTATATTTAAATATCACATTTTTACAAAATATAATTAATGGGCTGTGAGGTGGGGCTATTCACTTCATCTCACTTGAGTCTCCCCCTTGGTCATAACCTTAGAAGTGCTATTTTATGGACCCAGTGGTGGAAGAGATTCAGACGATATTCCTCTTAGGAAAAGATTTTCTCTAAGCGAGGGAGACTCACCTTAATCTAGTCTGTCTTAAGTGGAATCCCCACTTGCATTTTATCCTTATTTAGGGTCCTTATCAGTACTCTGGAGAGGCCATATGTAACTTTCTCTGGGAAGGTGTGGAGGTAAGGAAGAGTCCTTGCATGGTTGGGGGATTGGGTATTGGGAATTTGTGGGATGTAGCAAGACCTTGCTAGCTAAGTGGTTGTCGCACTTTTCTGTGAGACTGACAGTTTGTGGTATAAGGTCATTTCGAGCAAGTACAGCCCCCATCCTTTTACTTTGCTCACGAAAGGGGCAATAAGTACAACCAAAAATCCTTGGAAAGCCATTTTAGACGGTCTCCCTTTCTTTTCTCAATTCATTAATTCTATTATGGGTGATGGGTCTAGTACTTACTTTTAGGAACACAAGTGGTTGGGGATAACCCTCTCTGAATGTGTTCCCTCGTATTTATCATTTGTTGAATTCGAAGTTCTCTTCTGTGGTTTATATCCTTCCTTCTTCTAGGAACTCTTCATCGATCTCCCTTGGTTTTCGTGACTCTCTCTCTAATGGTGTTATAGCCCCTTTTGGCTTTAATTAGTGATATCACTTTATTGGAGGAGAAGTGACTTTTGGCTTTGGAACCCTAGTCCTTCAAACAAGTTTTCCCGTCAATCGTTATTTCTTTGTCGAGTGGGTTTGTATGGTCTAAGGACTCTCCCATCTTCTCAATTCTTTGGAAGGGGAAAATTTCGTAGAAGGTTAAATATTTTGTATGGCAGGTTCTCCTTAGCAAGGTGAATACTCTTTATCATGTTAAGAGGTTCTTGCCCAAGTTCATCGGGTAGGGAAGCTGTCGAGGACCTCGATCACCTTCTACAGACATGTTAGTTTGCTAGATCTCTTTGGTGCAGATTCTTCAAGGCGTTGTCAGCCACGGAGCTTCAGTTCCATGTTTGAGGAGTTTCTTACCCATCAACCTTTGAGGAAAAGGGTAGCAGGCTTTGGTCTCTTTCTAGATTTCATGCGTCTCTTTGAGTTTCGGTCTATAAAGAATTTTGTAATTATCCCTAAGGTCTTATTTTGCTTGACTGAAGCCCTCATGAAGTTTATTTTTTTTATTATTATTATTATTTTAATATTTTTGGTTTATTCTTTTCTTTTTCTCGATGAAAGCTTGACTTTTAATTTTTTTAAAAAAGGAGAATAGTAAGATATGATAAGATATAGGCATTCGGGATATTCATTGGCATGGTGAAAAGTTACCAACAAGTTGTAGTTCTTCCGTGAGGAGTTTATAAAAGATACTTCAATTTGAGTCAATTATAAAGGGAGTAAAAAATAGTCTTTGAGTGAATACCAAGAACATGATTTTTGTCAATATCTAAGTGTTTTGGCCTTGCTGTGCATCTCAACTAGTTTAGTCCTAGGACATCAACATTATCTCTGCTAGTCTAGTCCTAGAATGTGCATTTGATTCTACAACATTTGGTTTATAGAAAATTGTTAGAATATCAATGTCCTAAGTTTCTTTACTTTTCCACATGCTCCAATGCTATTGGCCACCAAAGTTGGTTAGTAGAATTGTGGGCTTGTTGCTTCTTTCAAGCCTCAAAGTTTGAAGTAGAGCTCTGATTGCATTCATTCACCACACTTTTCCTTTTCTTCTTTGCTAATGATCACACAGCAATACAACGTATGGATTAGGTTTAGCTTACTCAAGAAACAAAAAATCTGAATGAGAAAATGCAAACATTAGGAAATGGACAGGGCAATAAAAGATGGGTCGGATATTCCTGATCCTTCCTTCAAAGTATGCACCAACTAGATATTAAAGTCATGTTTGGCTATCTTTTTTGAAACCTTTCAGGTGCATTCACCCCTTCATAGAAAAAATTGACTTTTTCGATAACATTCCTTTACATGGTGTAAAAGAAACTTCCTTGAGCCTCGAGAACAATCAAAGCAAAATCAACTAAATAAACTTGGCAAATGGAGATCCCAAGGTCATCTGTATTCCTCTCTTCTAAAGCATAAAAGCTTAAAAGCGAAGAGTTCTTGTTCCATCTCTTCATTCTCTGGGCACTTCTCTTTTGCTTTGAAGGTTTGGAATTTTGATTACCGAAGGTTTGGTTTAGTCTGGTGTGTGTCAATTTTCGAAGATTGTAGACAAGTGGCTCTTTGAAGTTATTTCTGGCTGTTGATTTAAAGTCAAGGCTAGAGTGTTCTATGCAGCTATAACTCATTTTTGCTTTTTTAGGAGAGAGAGATCTTTACCAGCAATAAAATTCCTTTGAATTCCTTTGAATTGGGTTGCCTTATGCAAATCCTTATGTAGAACTCTACTTCTAAAAACCTAGACTGGAGACATTATGTTTGATCTCCCACCTTTGGCACGAGCAAGTCTCATCTCATGCCTTTAGGCTATTTCTGTTGGTTGGTACTAACAAGGCCTTTCCAATAAAAATGAGAGAAAAGGGGGTTTAATCTTTAAAATGTTAATGAACTTTGAGAAGGAACAAAAGGATGGATTGGGATTTCCCCCTTGCTGTTCTGGACAGTAAAGGCGTAGGTTCTAAATTGAGGTATTGTGATAGTGGCCAGTTGCTTTAGTTGCCTCCTTGAGAAAGGGTTCAGTAACATTCATATCGTTGATGACGATACTCTTGTTTTCTCAATGTTTGAGTAGGGAAGATCCACAATCTCTTCAATTCATTTATATCTTTGAAAAGATATTCGGGCCTAAGTGAATGGGACTAGATCCATGTTTGTGGGTATTAATTGATTGCATTGGTGAAGATTGTATTGATTAAAAAGTCAGCAGTGTGGCTGAGGACTTTGGTTTTAAAGTAGTTGTTCGGGCTTTGGCCGTTGATTATTTAGGCATCCAGCTGCTTTGAAGTCCAAGTGTGATGTTTTAGCTGCATCTGGAGAGAATTGCATCTAAAGCCTTTAGAAGCATTTCTCTAAGGAGGTGGGATTCTACTGCTTTCTTTGTTACTCCCTAACTTTCCTAATTACTTATTTATGTATTTTTGAGGCGCCCAAGTGTTAGAGGATTTAGTTTCTTAGGTATCTTTTGTCAAAAGGTCTGATATAGGAGGGTGGCTCCTGTTTGATTAATTGGAACATAGAGTCTCTACATAAGGCACCTTAACGTAGGATGCAACTTTCTTTTTCTTTTATTTTTTGTCAAGGAGGATTCTTTGTTCTACTTCTTTAGCAAATGTCTTTTTCTCTTTGTAGAATTTAGAATCCGTTTAAAATTTTTGTTTAACTGCTTTCTCCTCCTCCTTTTTGTAGATTAAAGCTATGAAGGAGTTGTACTTTTTTGAACTGAAAGAAATGTACCAGAAAATTCTTCCAAAAGTGAATCAGGTATGTGATTCTGCATCTATATCCTAGGGCATTTGACTTTTTGATATTATTAATTGCGGAAGAAGCTGAATCCTCATTGAGTGAACCCATTGATCTATCACCTTTTGTTCATATCTTCGAGCATTGCAATTTTTTTTACCACTTCAGCTGTTTAATGCCCTTTCCTTCACCCGCATTGTAGTTTTTCTCATTCCCCTTATTCCCCTTATTGTTATGCACTGATGTTGTCTAATTTGATATTTTATTGTGCAGTTGGAAAGTCTTCCACAGCAACCGAAGTCAGAGCAGCTAAATAAGCTAAAGACATTTAAGTTAATCTTGGAACGCCTTATAGCATTCTTACAGATTTCAAAAAGTAATATCGTAATTGGATTGAAAGATAAGATAGGCCACTATGAGAAGCAGATTGTTAGTTTTTTAAATTCTAATAGGCCAAGGAATCCAGTATCTACACTGCAGCCAGGACAGCTTCCTGCCTCTCACATGCAATCTATACAGCAGGCACAATCACAGATGACTCCATTACAGTCTCCTGACAATCAAATCAATCCCCAACTACATTCGGCGAACATGCAAGGTTCTGTGGCTCCGGTGCAGCAGAACAATATGAACAATATGAACAATATGCAGCATAATTCTCTTCCAACTTTTTCAGGATCAGCAGCACAACAAAACATGACGATCCCAATGCAGCCTGGTTCGAGTTTGGAATCAGGACAAGGAAATTCACTGAGCTCGTTTCAGCAGGTTGCTTCTGGGTCTTTGCAACAGAATCCTTCCAACAGTTCCCAAAGGGCAAACAATAGTTCTTTGCCATCACAAAATGGGGTGAACACTCTGCAGCCAAACATCGGTTCTCTTCAAACAAATCATAACATGCTTCAACATCAACATCTAAAACAGGATCCGCAACAACAGCTGAAACAACAAATGCAGCAGAGACAGATGCAGCAGCTAAAGCAGCAGCAGATGCTGCAGCACCAGCAACAGCAACAACAACCACAATTACATCAGCAACAGTCGCAGCTACACCAGCAAGGAAAGCCGCAGTTACCTGCACAAATGCAGGCACACCAATTGTCACACCTTAATCAAATCGAGATGAGACAGGGGCTTGCTACAAAGCCAGGGATGTTTCAACATCTCCCAGCGGCTCACCGCTCAGGTTATACCCATCAGCAGCAGATGAAACCAGGAACTTCATTACCAATTTCACCCCAAATCTTTCAGACTGCATCCCCTCAAGTTGCTCAAAATTCTTCTCCACAGGTCGACCAACAAAATCTACTTTCATCCATTACCAAAGTTCCACCTTTGCAATCTGCAAGCTCACCTTTAGTTGTACTATCCCCTTCAACGCCTGTGGCTCCATCTCCAATGCCGGGTGATTCAGAAAAACCCACTTCTGGTGTCTCAACACTTACAAATGCTGGGAATACTGGACAACAAACTAGTGTGTCTGGGACACAAGTCCAGTCTCTTGCCATTGGTACCCCCGGGATATCTGCCTCCCCATTACTTGCTGAGTTTAGTGGTACAGATGGCGCTTATGCCAATGCATTACCAACTGTTTCCGGAAAATCAAGTGCTACAGAGCAGCCTCTTGAGCGCCTAATTAAAGCTGTTAGTCAAAACCTTGAGTGCTTTTAGTTGTTTCTGGCTTAAGTTGTTACCTATTCTTCTTGATTGTTTATAACATTGGTAACCTTATTATGGTTTTCAGGTTAAATCAATGTCGCCTAAAGCTTTGAGTGCCTCTGTCAATGGCATTGGATCAGTTGTTAGTATGATTGATAGGGTAGCAGGCTCGGCCCCTGGCAATGGGTCAAGAGCTGCAGTCGGGGAGGATCTGGTTGCCATGACAAAATGTCGGCTGCAAGCAAGAAATTTTGTTTCACATGATGGATCAAATGGAACAAAAAAGATGAGACGCTACACAAGTGCAATGCCCTTAAATGTTGTATCATCAGCTGGAAGCATAAATGACGTTTTTAAACCGTTTACTGGTGCAGAGACATCCGATCTCGAGTCAACTGCAACATCTAGGGCCAAAAGGTCCAGGGTTGAGGTAATAACAACTTTTACATATTGTATTTTCTATGTGGGTAAAATAGTTTTCTTTTTAAGTATAGATAGATATTGTTAAGAGATGCCACAAGTTGAAGAACAGATGACCTTCATCTTTAGGTATCTAACTACGTTTAGAGATCACACATATGCTTCACCGTTTTACATATAATTCCAATTTCATTAGGAATTTATATTGTTTTATTGGATGTCGTTCAATATTCTAACTAACTTCATAACCTACTTGGACATTTTGAACTTGATATACTTTTTTTTCGTTCTTCGATTTTTGCTTTTTTTGCTTCTGGTGGGGGGTTAGGGTGGAGGGTGGTATTAGTCGTCACTTACATTGTTGTTAAATATTAGCATATTTAAAGAAGTGGTTTTTATTTTTTTAAAAAAGGCATCAGCATTATTTTAGTCCCCCTGAATTCAAATTTCAACTTTTTCCCTTTTTATTTATTTATTTGATGGTCCAATGTCTTGGAGATGGAGAAAAATTAATTTCAAGCTATAGTATTCGTGTTCTTAGATTGTCCGTGTTATTGGCTTGATACATTATATTTGTACGATAGAGACTTGAATATGAAAGCAACTAAACAAATTTATAATGTATTATAGATGATAGCAGATAGAATAGGCAGTGTGATTAAAATGTCATATTCAAGTCTCTATCGTACATTATATTTGTACGATAGAGACTTGAATATGAAAGCAACTAAACAAATTTATAATGTATTATAGATGATAGCAGATAGAATAGGCAGTGTGATTAAAATGTCCACCTAGTTTAAAATTTAAAGTTTCCTTTTGGCTTGCAATTTTCTTATCAATCTTTATGTTAGAATATCAACTCAATATTGTCTGTTTCGAACTTGTGTAGGCCCACTTTAGTTTTATCAATCACTTGCTTTGCTATTTGAATAATATATATATATATATATATATATATATATATATTTACCTGTTGCAGGCCAATCATGTTCTACTGGAAGAAATAAGGGAAATAAACCAACGTCTTATCGACACGGTGGTTGTAATCAGTGATGAAGTAGTTGATCCAAGTGCTTTAGCAGCTGCTGCTGATGGGAGCGAAGGAACAATTGTCAAGTGTTCTTTTAGTGCTGTGGCTCTCAGTCCCAGCTTAAAATCCCAGTACATGTCCGCACAAATGGTGGGCAGATTTCTTAATTCTAGAATTTTTTTGATTGAATTATGATCCATCCAAAGAGCAAAAAGAATCGACTAATTATTCTATGATTCATAAATTTTGACTCGATTTCTGTATGGGATAAAGAGTTGTTGGTGAAAAATCGAAAACTGTAAAAGTCGAATTTTTATAAAAAAAGGAATCATAGTTTAAAAAACTAAATAGAATCATGAGCTAAAAGTATCCAATTAAACATAGTTAAAAAAAATTCTTTTTATAGTTAGAATTGGTTGGACGTGCCTATCAAAAAAAAAAATATCAGGTTCTTCATTGTCTGCGTGATTATCTGTTTTCTTACCCGTGTATTATTATTTTGCAGTCTCCAATTCAGCCTCTACGGTTACTTGTTCCTACAAATTATCCAAATTGTTCTCCAATACTCCTAGACAAGTTTCCTGTTGAAGTCAGGTTAGTTTACATGGGTTCTGTTTTTTTTCTTCTTCTTTTTTTTTTTTTATCAGCTTTTTTTCTTTTTGTCTTCCATGAACCCTTCTCTACGTTCAAAGCATATTAAATATATATGTGCAGCAGGCTATTTGCATCAAATAAATAAATAAATAAAACGATATCATAAATTTGTACAAAATTTAAAATGTATGTGCCATTTTCAAATCTAAACTAACTAGAAAGGATGAACAAATTTCTAGTTGAACAGATTAATGGAATAAATTGAAAATGACCCTTGATTTTAGGACCGAAGTTCTAGTTAATCAGATTGATACTTCTCGAGTTATTCTTAGGTATTTGTGCATCCTCACACCTTAGTTTTGGTGGAAGCATAGGCTAGGTAGGCTATTAGATTGCAGATTGATTCTTTGAAAAGGTCCACATGACAATTTGATCTAGCGACGACCTTCCTCATGAAGTGCTTTATCAATCACTGCCTTTTGTGTATAGAAACTAAATGTTTATTGCTTCTTGTATGCATACATGTATGCATATCTGATGGTTTTTTTTTTTAAATGGGAGACAATTTCATTGGCACTTTTTCTATGTCAAAGATTTTTCTGTTTCTCTACATCCTAAGATATATATTGTTTAAATAAGAAACGGGAAAACTTCATTCATCAAAAAATGGAATGATTGGTGTGTAAATCTACAATTGGTGAATTATGTTTGTTGTCTCCATTGGTGGTGGCCCTGGGATATATTTTCCTTTCGTACAAGCAATACATACTCATTGCGGTTACTAGGTCCTTTGGCTGGTAGAGCTCCGCCTCAACTGATACAGACTCTTGCAATCTGCTGAGATATAACTGAATCTTTTGATTTTGAGTTAACATTCTAGCTCTTAAAACCAACACTTCCAATTTGTTTTAGCTTGGTGAGCTCCCCTAATTTATTGCTTTAGATGGAAAGTTCAAACCGTAAGTCGCATTGGTGGACAAACGCTATCTAGGTGGGATCCGGAATATCTTGTACCATTTGTAAATACCATAGCTGAGCAATTCCCTCTAAGTGATAAGAGGCTAAGCTCACCTTTCTACCTTAGGTGTCTGATTGATGTCGGAAAAAAGTGTTTAGGCTAATCCAACCTAAAGGGTCTTCTAGCCCATTGAAGCAAGGAAAGTCAAAATTTTAGTACAATCAAGTTTCCTCCAAAACTCCAAAATTTTTAGAAATGGTTGTCCTAGAAATCCCTTCTCCTTGTTTTTGCTTGACTTTGAGCACTTGAAGAAGTCAGACCTTCTTTGTGTATTCCAACTCGAGAGCATGGCATCCAATTTGACATGAAGTTGCTCAGCCTTCTCATTATAGGCCAATTGCCTACTCTTCGGACAAGAGTAAATTGAAGAGTTTGGTAAGTTGTTCTTGCACAAATTCACCCCTAACTGATGGCTTTAATACCAAATTGTTATGGGCCCAAATCTGCTATCGAGTTATGGGTCTCTGGTAAGTGGAGGTAGAAGGTGTTTGTATGAGCGAAGAGAGAGTAGAATTCTTCTTTCGGCCTATTTTGAATAATTGAATCCCAAATAAAGGGAGCCTTAATACCTCTGAGTTAAAGTACTCAAAGACTAATTGATAATTTTAGGTACTGTATTCATTGGTGTTTTTGACTATTTGTACAAGTTGAATAACAACCATTGTTACGTCAAGAGACATGACATCCAATTTTGCATGAAGCTGCTCAACCTTCTCATTATAGACCAATCGCCTGCTGTTCAGATAAGAGTAAATTGGTAAGCTGTTCTTGCACAAATTCATTCATAACTATTGGCTATGTTACGAAATTGTGACTGGCCCAAATTTGCAATCGAGTTATGGGTCTCTAGTAAGTGGGAGATAGAAGGTGTTTGTGTGGGTGAAGAGAGAAGTAGTACACTTGTCAGGGCGGATCATTCTCCTAAAATGAAAGTGGAGAACTCCTTAAAGATACTGAAAAAAAAAAATGCATAAAGAAAAATTCAACCAACAACGAAAAATAATTTAGAATTGGTTAGAAATAGAAATGAATGTGAAGACTCCTTAAGATACAATAAAAAAATGCTAAAGAAAAATTCAACCAGCAACTAAAAATAATTTGAATTGGTTAACCTTCAATTATTTTGTTTTCTTTGTAATCGTCAGTGGGTTCGGTACGTGACTGCTTCTTTCTGCCATGAGTATAACGTGCGTACATTGTAGTAGCATTGTATGAAAAGCTCCTTCTCATCTTCGTTTAGGAAGAAAGAAAATGTTTTGTGACAAGCGGGTATATGTATACTTTCTTGGGCTTTGGAGTGAAAGTGACAATAGAACTTTTAGAGGATGGAACGTCATTCTAACAACGTTTAGCTTTGTTGTTAGGTTCCATGTTTCCTATGGGCTTCAGTAACAAAGCTTTTTTATAATTATTTGCCTATGCCCTTATTTTTTTTCTCAATGAGAGTTTGATAATTAAAAATAGACTAAAAAATAGGCCAAACTGACTTCTCTTGCACCCTTTTTCATTTTTTTTTTATCTTATTTTTTCTCTTCCACGAAACATTGGGCGATTTGATCATCTTCTGCCTGTCTTCTCTTTGGTGAATGCCTTGTTTTTCTCTGTACTACCTGTGTCTATGTAAGGAAATAGAGAATAAGGTAGAATATTAGGTTGGCATATTTGTCTTTATTTAAGAAGTTTGTGGGAGGGAGAGTCCATGCTTCTTGCTTAAAATCCTGAAATTTTCTGTTAAATGTCATTACTATATTTGTTAATTTCGTTTCGGTTTTGGTTTCTTTGGTCCCTAGCAGTCTATTATCCTTGTTATCATCATTTTCTTTTTGCAGGAAGGAATACGAAGACCTTTCAATAAAGGCCAAGTCAAGATTTAGCATATCCTTAAGGAACCTTTCACAACCTATGTCACTTGGGGACATAGCGAGAACTTGGGATGTCTGTGCACGTACTGTTGTTTCCGAGTATGCTCAGCAGAGCGGCGGTGGCAGCTTCTGTTCAAGGTATGGAGCTTGGGAGAACTGTTTGAGCGCTGCATGATTCAAAGAATCGATCGAAAAGGATTTGGCCGACTGACCAGCTCGGTATGTTTAGGGAAGACGGGCTGTTTATTCAAGTAACGTTTGCGTATCATAAGGTATGCCTCACTAAAGCTGAAAGTACACGCTGCATGTTGTTTAATCAGGGCTAGGCTAATTCAGTATTTATATTACCACGGTGGATCTGCTTTGCTTGTGAACACTGATTCCTTTTTGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAT

mRNA sequence

AGAGAAACATTGAACAAGGGTTCACGAAGAAGACAAGAACAGGGTATTTATTTCTTGCTTTTGTCTTCCATTCTCCACGAGAGGACGCTCCTCGATCCACATAGAACACACACAAAAGCTTCCTCTTGAATGGCGAGTGAATAGCAAGCCTTCGGTCGTTATCTTCTTTGAAATCGCCTCGATTTTCTCAATTTTCCTATACATATCTGAATTAAACTCCTGGTTTCCTGTAATCAGTGTCCAAACAATTTCCGAGTCGATTCTGCTCGATTTCTGCTCGATTTCTGGCATTTATTGAGCACACCCTCACTGTTTGTTAGTGGGATCTCGACAAATTCTTTTATCGTAGGTTGTTTGGAAACTTCTGAATGGATACTAATAATTGGAGACCTACTCAAGGTGGAGAACCCGGAATCGAAGCCGGGGATTGGAGGTCTCAATTGCAGCCCGATTCTCGGCAACGAATTGTCAACAAAATAATGGAGACATTGAAGAGGCACCTTCCTGTTTCTGGTCATGAGGGATTGAGTGAGCTGAAAAAAATTGCTGTAAGGTTTGAGGAAAAGATTTATACTGCCGCTACCAGCCAGTCAGATTACCTAAGGAAAATATCTCTGAAGATGCTTACGATGGAAACAAAATCTCAGACTCCTATGGGAACTTCATTACCATCCAATCCTATGGTGCCTAGCAATAAGCCTCTGGATTCTGCATCACAAAGCATGCAGCCCCAAGTTCTAAATCAAGGGCAATCAATTTCTGTTCCTCAGTCTTCTAATCAGCCTCAGTCACGCCAGCAACTACTCTCCCAGAATATCCAAAATAATATTGTTTCTCAGAGTTCATCCAGCTTACCTTCTGCAGTACCTCCTGTCTCTGGATTAGCTTCATCTTCAATGCCAAACATGGTTGGCCAGAATCCAAGCATGCAAAATGTATCTGGAATTCCACAGAATTCTGTTGGAAATGCTATGGGTCAAGGGGTTCCTTCCAATGTATTTACCAACTCTCAGAGGCCAGTACAAGGAAGACAAGTAGTTTCCCAACAGCAACAGCAGCAGGCACAAACCCAGCAACAGCAGTTTCTTTTTCATCAACAACAGTTACAGCAGCAGATGATGAATAAGAAGCTTCAGCAGGGAAGTATACCACAACAACGTATGCAATCTCACATTCCACAACAACAACAGAATCTAATGCAACCAAATCAACTGCAATCATCTCAGCAATCTGCTATGCAACCTTCTATGATGCAACCTTCTCTGTCTAATCTTCAACAAAACCAACAATCTTCCATTCAACAGCCCACTCAATCCATGCTTCAGCAACCTCAACAGCCAGTTCTTAGGCAGCAGCCACAGTCCCAGCAACATGCTGTCATGCATTCACAGTCTACAATGTCACAGCAGACTAGCTTGCCTTCACAGCAGCAACAACAGCTAATTAGTCAGCAACCAAATTCTTCAAACATGCAGCAAAATCCATTGATCGGGCAGCAAAACAGTGTTGGGGACATGCAGCAACACCTGCCTCAGCAATCTAGGTCGCATGGGCAGCAGAGCAATCTATCAAATATGCAGTCTCCACCATCGCAGCAGCAGCAGTTAATGGCTCAGCAAAACAACCTTTCAAATTTGCAGCAGCAGCAGTTAGGACCTCAAAGTAATGTTTCTGGACTACAGCAACAACAAATGCATGGAACTCAGTCAGGTAACTCAAACATGCAATCAGATCAGCACCCAATGCATATGCTGCAACAGAACAAGGTTCATATGCAGCAGCAACCTCCACAAAATGCATCAAATTTATTGTCAGCACAAGGTCCGCAAGGGCAACTTCAGTCATCCCAACAGTTGATGTCACAGATTCCGTTGCAGTCCACCCAAGTCCAACAACAGGTACCTCTACATCAGCAGCAGCAACAGCAGCCAAATGCCATGTCACATGATCTCCAACAAAGGCTTCAAGTTGGAGGTCAAGCACCAAGCTCCCTGCTTCAATCACAAAATGTAATGGATCAGCAAAAGCAGTTGTATCATTCACAAAGAGCTCTTCCAGAGACATCATCGACATCTTTAGATTCCACGGCTCAGACAGGACAAGCCAATGGTGGAGATTGGCAAGAGGAGATTTATCAGAAGATTAAAGCTATGAAGGAGTTGTACTTTTTTGAACTGAAAGAAATGTACCAGAAAATTCTTCCAAAAGTGAATCAGTTGGAAAGTCTTCCACAGCAACCGAAGTCAGAGCAGCTAAATAAGCTAAAGACATTTAAGTTAATCTTGGAACGCCTTATAGCATTCTTACAGATTTCAAAAAGTAATATCGTAATTGGATTGAAAGATAAGATAGGCCACTATGAGAAGCAGATTGTTAGTTTTTTAAATTCTAATAGGCCAAGGAATCCAGTATCTACACTGCAGCCAGGACAGCTTCCTGCCTCTCACATGCAATCTATACAGCAGGCACAATCACAGATGACTCCATTACAGTCTCCTGACAATCAAATCAATCCCCAACTACATTCGGCGAACATGCAAGGTTCTGTGGCTCCGGTGCAGCAGAACAATATGAACAATATGAACAATATGCAGCATAATTCTCTTCCAACTTTTTCAGGATCAGCAGCACAACAAAACATGACGATCCCAATGCAGCCTGGTTCGAGTTTGGAATCAGGACAAGGAAATTCACTGAGCTCGTTTCAGCAGGTTGCTTCTGGGTCTTTGCAACAGAATCCTTCCAACAGTTCCCAAAGGGCAAACAATAGTTCTTTGCCATCACAAAATGGGGTGAACACTCTGCAGCCAAACATCGGTTCTCTTCAAACAAATCATAACATGCTTCAACATCAACATCTAAAACAGGATCCGCAACAACAGCTGAAACAACAAATGCAGCAGAGACAGATGCAGCAGCTAAAGCAGCAGCAGATGCTGCAGCACCAGCAACAGCAACAACAACCACAATTACATCAGCAACAGTCGCAGCTACACCAGCAAGGAAAGCCGCAGTTACCTGCACAAATGCAGGCACACCAATTGTCACACCTTAATCAAATCGAGATGAGACAGGGGCTTGCTACAAAGCCAGGGATGTTTCAACATCTCCCAGCGGCTCACCGCTCAGGTTATACCCATCAGCAGCAGATGAAACCAGGAACTTCATTACCAATTTCACCCCAAATCTTTCAGACTGCATCCCCTCAAGTTGCTCAAAATTCTTCTCCACAGGTCGACCAACAAAATCTACTTTCATCCATTACCAAAGTTCCACCTTTGCAATCTGCAAGCTCACCTTTAGTTGTACTATCCCCTTCAACGCCTGTGGCTCCATCTCCAATGCCGGGTGATTCAGAAAAACCCACTTCTGGTGTCTCAACACTTACAAATGCTGGGAATACTGGACAACAAACTAGTGTGTCTGGGACACAAGTCCAGTCTCTTGCCATTGGTACCCCCGGGATATCTGCCTCCCCATTACTTGCTGAGTTTAGTGGTACAGATGGCGCTTATGCCAATGCATTACCAACTGTTTCCGGAAAATCAAGTGCTACAGAGCAGCCTCTTGAGCGCCTAATTAAAGCTGTTAAATCAATGTCGCCTAAAGCTTTGAGTGCCTCTGTCAATGGCATTGGATCAGTTGTTAGTATGATTGATAGGGTAGCAGGCTCGGCCCCTGGCAATGGGTCAAGAGCTGCAGTCGGGGAGGATCTGGTTGCCATGACAAAATGTCGGCTGCAAGCAAGAAATTTTGTTTCACATGATGGATCAAATGGAACAAAAAAGATGAGACGCTACACAAGTGCAATGCCCTTAAATGTTGTATCATCAGCTGGAAGCATAAATGACGTTTTTAAACCGTTTACTGGTGCAGAGACATCCGATCTCGAGTCAACTGCAACATCTAGGGCCAAAAGGTCCAGGGTTGAGGCCAATCATGTTCTACTGGAAGAAATAAGGGAAATAAACCAACGTCTTATCGACACGGTGGTTGTAATCAGTGATGAAGTAGTTGATCCAAGTGCTTTAGCAGCTGCTGCTGATGGGAGCGAAGGAACAATTGTCAAGTGTTCTTTTAGTGCTGTGGCTCTCAGTCCCAGCTTAAAATCCCAGTACATGTCCGCACAAATGTCTCCAATTCAGCCTCTACGGTTACTTGTTCCTACAAATTATCCAAATTGTTCTCCAATACTCCTAGACAAGTTTCCTGTTGAAGTCAGGAAGGAATACGAAGACCTTTCAATAAAGGCCAAGTCAAGATTTAGCATATCCTTAAGGAACCTTTCACAACCTATGTCACTTGGGGACATAGCGAGAACTTGGGATGTCTGTGCACGTACTGTTGTTTCCGAGTATGCTCAGCAGAGCGGCGGTGGCAGCTTCTGTTCAAGGTATGGAGCTTGGGAGAACTGTTTGAGCGCTGCATGATTCAAAGAATCGATCGAAAAGGATTTGGCCGACTGACCAGCTCGGTATGTTTAGGGAAGACGGGCTGTTTATTCAAGTAACGTTTGCGTATCATAAGGTATGCCTCACTAAAGCTGAAAGTACACGCTGCATGTTGTTTAATCAGGGCTAGGCTAATTCAGTATTTATATTACCACGGTGGATCTGCTTTGCTTGTGAACACTGATTCCTTTTTGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAT

Coding sequence (CDS)

ATGGATACTAATAATTGGAGACCTACTCAAGGTGGAGAACCCGGAATCGAAGCCGGGGATTGGAGGTCTCAATTGCAGCCCGATTCTCGGCAACGAATTGTCAACAAAATAATGGAGACATTGAAGAGGCACCTTCCTGTTTCTGGTCATGAGGGATTGAGTGAGCTGAAAAAAATTGCTGTAAGGTTTGAGGAAAAGATTTATACTGCCGCTACCAGCCAGTCAGATTACCTAAGGAAAATATCTCTGAAGATGCTTACGATGGAAACAAAATCTCAGACTCCTATGGGAACTTCATTACCATCCAATCCTATGGTGCCTAGCAATAAGCCTCTGGATTCTGCATCACAAAGCATGCAGCCCCAAGTTCTAAATCAAGGGCAATCAATTTCTGTTCCTCAGTCTTCTAATCAGCCTCAGTCACGCCAGCAACTACTCTCCCAGAATATCCAAAATAATATTGTTTCTCAGAGTTCATCCAGCTTACCTTCTGCAGTACCTCCTGTCTCTGGATTAGCTTCATCTTCAATGCCAAACATGGTTGGCCAGAATCCAAGCATGCAAAATGTATCTGGAATTCCACAGAATTCTGTTGGAAATGCTATGGGTCAAGGGGTTCCTTCCAATGTATTTACCAACTCTCAGAGGCCAGTACAAGGAAGACAAGTAGTTTCCCAACAGCAACAGCAGCAGGCACAAACCCAGCAACAGCAGTTTCTTTTTCATCAACAACAGTTACAGCAGCAGATGATGAATAAGAAGCTTCAGCAGGGAAGTATACCACAACAACGTATGCAATCTCACATTCCACAACAACAACAGAATCTAATGCAACCAAATCAACTGCAATCATCTCAGCAATCTGCTATGCAACCTTCTATGATGCAACCTTCTCTGTCTAATCTTCAACAAAACCAACAATCTTCCATTCAACAGCCCACTCAATCCATGCTTCAGCAACCTCAACAGCCAGTTCTTAGGCAGCAGCCACAGTCCCAGCAACATGCTGTCATGCATTCACAGTCTACAATGTCACAGCAGACTAGCTTGCCTTCACAGCAGCAACAACAGCTAATTAGTCAGCAACCAAATTCTTCAAACATGCAGCAAAATCCATTGATCGGGCAGCAAAACAGTGTTGGGGACATGCAGCAACACCTGCCTCAGCAATCTAGGTCGCATGGGCAGCAGAGCAATCTATCAAATATGCAGTCTCCACCATCGCAGCAGCAGCAGTTAATGGCTCAGCAAAACAACCTTTCAAATTTGCAGCAGCAGCAGTTAGGACCTCAAAGTAATGTTTCTGGACTACAGCAACAACAAATGCATGGAACTCAGTCAGGTAACTCAAACATGCAATCAGATCAGCACCCAATGCATATGCTGCAACAGAACAAGGTTCATATGCAGCAGCAACCTCCACAAAATGCATCAAATTTATTGTCAGCACAAGGTCCGCAAGGGCAACTTCAGTCATCCCAACAGTTGATGTCACAGATTCCGTTGCAGTCCACCCAAGTCCAACAACAGGTACCTCTACATCAGCAGCAGCAACAGCAGCCAAATGCCATGTCACATGATCTCCAACAAAGGCTTCAAGTTGGAGGTCAAGCACCAAGCTCCCTGCTTCAATCACAAAATGTAATGGATCAGCAAAAGCAGTTGTATCATTCACAAAGAGCTCTTCCAGAGACATCATCGACATCTTTAGATTCCACGGCTCAGACAGGACAAGCCAATGGTGGAGATTGGCAAGAGGAGATTTATCAGAAGATTAAAGCTATGAAGGAGTTGTACTTTTTTGAACTGAAAGAAATGTACCAGAAAATTCTTCCAAAAGTGAATCAGTTGGAAAGTCTTCCACAGCAACCGAAGTCAGAGCAGCTAAATAAGCTAAAGACATTTAAGTTAATCTTGGAACGCCTTATAGCATTCTTACAGATTTCAAAAAGTAATATCGTAATTGGATTGAAAGATAAGATAGGCCACTATGAGAAGCAGATTGTTAGTTTTTTAAATTCTAATAGGCCAAGGAATCCAGTATCTACACTGCAGCCAGGACAGCTTCCTGCCTCTCACATGCAATCTATACAGCAGGCACAATCACAGATGACTCCATTACAGTCTCCTGACAATCAAATCAATCCCCAACTACATTCGGCGAACATGCAAGGTTCTGTGGCTCCGGTGCAGCAGAACAATATGAACAATATGAACAATATGCAGCATAATTCTCTTCCAACTTTTTCAGGATCAGCAGCACAACAAAACATGACGATCCCAATGCAGCCTGGTTCGAGTTTGGAATCAGGACAAGGAAATTCACTGAGCTCGTTTCAGCAGGTTGCTTCTGGGTCTTTGCAACAGAATCCTTCCAACAGTTCCCAAAGGGCAAACAATAGTTCTTTGCCATCACAAAATGGGGTGAACACTCTGCAGCCAAACATCGGTTCTCTTCAAACAAATCATAACATGCTTCAACATCAACATCTAAAACAGGATCCGCAACAACAGCTGAAACAACAAATGCAGCAGAGACAGATGCAGCAGCTAAAGCAGCAGCAGATGCTGCAGCACCAGCAACAGCAACAACAACCACAATTACATCAGCAACAGTCGCAGCTACACCAGCAAGGAAAGCCGCAGTTACCTGCACAAATGCAGGCACACCAATTGTCACACCTTAATCAAATCGAGATGAGACAGGGGCTTGCTACAAAGCCAGGGATGTTTCAACATCTCCCAGCGGCTCACCGCTCAGGTTATACCCATCAGCAGCAGATGAAACCAGGAACTTCATTACCAATTTCACCCCAAATCTTTCAGACTGCATCCCCTCAAGTTGCTCAAAATTCTTCTCCACAGGTCGACCAACAAAATCTACTTTCATCCATTACCAAAGTTCCACCTTTGCAATCTGCAAGCTCACCTTTAGTTGTACTATCCCCTTCAACGCCTGTGGCTCCATCTCCAATGCCGGGTGATTCAGAAAAACCCACTTCTGGTGTCTCAACACTTACAAATGCTGGGAATACTGGACAACAAACTAGTGTGTCTGGGACACAAGTCCAGTCTCTTGCCATTGGTACCCCCGGGATATCTGCCTCCCCATTACTTGCTGAGTTTAGTGGTACAGATGGCGCTTATGCCAATGCATTACCAACTGTTTCCGGAAAATCAAGTGCTACAGAGCAGCCTCTTGAGCGCCTAATTAAAGCTGTTAAATCAATGTCGCCTAAAGCTTTGAGTGCCTCTGTCAATGGCATTGGATCAGTTGTTAGTATGATTGATAGGGTAGCAGGCTCGGCCCCTGGCAATGGGTCAAGAGCTGCAGTCGGGGAGGATCTGGTTGCCATGACAAAATGTCGGCTGCAAGCAAGAAATTTTGTTTCACATGATGGATCAAATGGAACAAAAAAGATGAGACGCTACACAAGTGCAATGCCCTTAAATGTTGTATCATCAGCTGGAAGCATAAATGACGTTTTTAAACCGTTTACTGGTGCAGAGACATCCGATCTCGAGTCAACTGCAACATCTAGGGCCAAAAGGTCCAGGGTTGAGGCCAATCATGTTCTACTGGAAGAAATAAGGGAAATAAACCAACGTCTTATCGACACGGTGGTTGTAATCAGTGATGAAGTAGTTGATCCAAGTGCTTTAGCAGCTGCTGCTGATGGGAGCGAAGGAACAATTGTCAAGTGTTCTTTTAGTGCTGTGGCTCTCAGTCCCAGCTTAAAATCCCAGTACATGTCCGCACAAATGTCTCCAATTCAGCCTCTACGGTTACTTGTTCCTACAAATTATCCAAATTGTTCTCCAATACTCCTAGACAAGTTTCCTGTTGAAGTCAGGAAGGAATACGAAGACCTTTCAATAAAGGCCAAGTCAAGATTTAGCATATCCTTAAGGAACCTTTCACAACCTATGTCACTTGGGGACATAGCGAGAACTTGGGATGTCTGTGCACGTACTGTTGTTTCCGAGTATGCTCAGCAGAGCGGCGGTGGCAGCTTCTGTTCAAGGTATGGAGCTTGGGAGAACTGTTTGAGCGCTGCATGA

Protein sequence

MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIAVRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA
Homology
BLAST of CmoCh19G003080.1 vs. ExPASy Swiss-Prot
Match: F4I171 (Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana OX=3702 GN=MED15A PE=1 SV=1)

HSP 1 Score: 1068.5 bits (2762), Expect = 6.0e-311
Identity = 757/1399 (54.11%), Postives = 952/1399 (68.05%), Query Frame = 0

Query: 1    MDTNNWRPT-QGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKI 60
            MD NNWRP+   GEP ++ GDWR+QL PDSRQ+IVNKIMETLK+HLP SG EG++EL++I
Sbjct: 1    MDNNNWRPSLPNGEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRI 60

Query: 61   AVRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSM 120
            A RFEEKI++ A +Q+DYLRKIS+KMLTMETKSQ   G+S  + P   +   +DS     
Sbjct: 61   AARFEEKIFSGALNQTDYLRKISMKMLTMETKSQNAAGSS-AAIPAANNGTSIDSIP--- 120

Query: 121  QPQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVS--QSSSSLPSAVPPVSGLASSSM 180
                 NQGQ +    S+NQ Q+ Q LLSQ +QNN  S    S++LPS++PPVS + +++ 
Sbjct: 121  ----TNQGQLLPGSLSTNQSQAPQPLLSQTMQNNTASGMTGSTALPSSMPPVSSITNNNT 180

Query: 181  PNMVGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQ 240
             ++V QN +MQNV+G+ Q+S G     G+ SN+F+  QR + GR      QQ     QQQ
Sbjct: 181  TSVVNQNANMQNVAGMLQDSSGQ---HGLSSNMFSGPQRQMLGRPHAMSSQQ-----QQQ 240

Query: 241  QFLFHQQQLQQQMMNKKLQQGSIPQQR--MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMM 300
             +L+ QQQLQQQ++ +  Q G++P     + SHI QQQQN++QPNQL SSQQ  +  S  
Sbjct: 241  PYLY-QQQLQQQLLKQNFQSGNVPNPNSLLPSHIQQQQQNVLQPNQLHSSQQPGVPTSAT 300

Query: 301  QPS------LSNLQQNQQS----SIQQPTQSMLQQPQQPVLRQQPQSQQHAVMH-SQSTM 360
            QPS      L  L  NQQS    S QQ TQSML+Q Q  +LRQ PQSQQ + +H  QS++
Sbjct: 301  QPSTVNSAPLQGLHTNQQSSPQLSSQQTTQSMLRQHQSSMLRQHPQSQQASGIHQQQSSL 360

Query: 361  SQQTSLPSQQQ-QQLISQQ-PNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSN 420
             QQ+  P QQQ  QL+ QQ  NSS +QQ  ++G Q+ VGDMQQ    Q R   QQ+N+ N
Sbjct: 361  PQQSISPLQQQPTQLMRQQAANSSGIQQKQMMG-QHVVGDMQQQ--HQQRLLNQQNNVMN 420

Query: 421  MQSPPSQ-----------------QQQLMAQQNNLSNLQQQQLGPQSNVSGLQ--QQQMH 480
            +Q   SQ                 QQQLM+QQN+L    Q  LG QSNV+GLQ  QQQM 
Sbjct: 421  IQQQQSQQQPLQQPQQQQKQQPPAQQQLMSQQNSLQATHQNPLGTQSNVAGLQQPQQQML 480

Query: 481  GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 540
             +Q GNS++Q++QH +HML Q  V + Q+  Q    L S+QG Q Q Q SQQ        
Sbjct: 481  NSQVGNSSLQNNQHSVHMLSQPTVGL-QRTHQAGHGLYSSQGQQSQNQPSQQ-------- 540

Query: 541  STQVQQQVPLHQQQ---QQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQR 600
              Q+  Q+  H QQ   QQQPN +  D+QQRLQ  GQ   SLL  QNV+DQQ+QLY SQR
Sbjct: 541  --QMMPQLQSHHQQLGLQQQPNLLQQDVQQRLQASGQVTGSLLPPQNVVDQQRQLYQSQR 600

Query: 601  ALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESL 660
             LPE  S+SLDSTAQT  ANGGDWQEE+YQKIK+MKE Y  +L E+YQ++  K+ Q +S+
Sbjct: 601  TLPEMPSSSLDSTAQTESANGGDWQEEVYQKIKSMKETYLPDLNEIYQRVAAKLQQ-DSM 660

Query: 661  PQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNP 720
            PQQ +S+QL KL+ FK +LER+I FL +SKSNI+  LKDK+ +YEKQI+ FLN +RPR P
Sbjct: 661  PQQQRSDQLEKLRQFKTMLERMIQFLSVSKSNIMPALKDKVAYYEKQIIGFLNMHRPRKP 720

Query: 721  VSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNN 780
            V   Q GQLP S MQ +QQ QSQ    QS DNQ NPQ+ S +MQG+    QQ++M NM +
Sbjct: 721  V---QQGQLPQSQMQPMQQPQSQTVQDQSHDNQTNPQMQSMSMQGAGPRAQQSSMTNMQS 780

Query: 781  MQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANN 840
               +S P    SA QQN+   + P SSLESGQGN+L++ QQVA GS+QQ   N+SQ  NN
Sbjct: 781  NVLSSRP--GVSAPQQNIPSSI-PASSLESGQGNTLNNGQQVAMGSMQQ---NTSQLVNN 840

Query: 841  SSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLK--QDPQQQLKQQMQQRQMQQLKQQQMLQ 900
            SS  +Q+G++TLQ N+   Q + ++LQHQHLK  QD Q QLKQQ QQRQMQQ  QQ   +
Sbjct: 841  SSASAQSGLSTLQSNVNQPQLSSSLLQHQHLKQQQDQQMQLKQQFQQRQMQQ--QQLQAR 900

Query: 901  HQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMF-QHLPAA 960
             QQQQQQ Q  QQ +QL               Q++ +N +  RQG+    GMF QH    
Sbjct: 901  QQQQQQQLQARQQAAQL--------------QQMNDMNDLTSRQGMNVSRGMFQQHSMQG 960

Query: 961  HRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSP 1020
             R+ Y   QQ+KPG     SPQ+ Q ASPQ++Q+ SPQVDQ+N ++ +    PLQ A+SP
Sbjct: 961  QRANYP-LQQLKPGA--VSSPQLLQGASPQMSQHLSPQVDQKNTVNKMG--TPLQPANSP 1020

Query: 1021 LVVLSP-STPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISAS 1080
             VV SP STP+APSPM  DSEKP  G S+L+      QQ +     VQSLAIGTPGISAS
Sbjct: 1021 FVVPSPSSTPLAPSPMQVDSEKP--GSSSLSMGNIARQQATGMQGVVQSLAIGTPGISAS 1080

Query: 1081 PLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMI 1140
            PLL EF+  DG   N+    SGK SATE P+ERLI+AVKS+SP+ALS++V+ IGSVVSM+
Sbjct: 1081 PLLQEFTSPDGNILNSSTITSGKPSATELPIERLIRAVKSISPQALSSAVSDIGSVVSMV 1140

Query: 1141 DRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSA 1200
            DR+AGSAPGNGSRA+VGEDLVAMTKCRLQARNF++ +G   TKKM+R+T+AMPL+V S  
Sbjct: 1141 DRIAGSAPGNGSRASVGEDLVAMTKCRLQARNFMTQEGMMATKKMKRHTTAMPLSVASLG 1200

Query: 1201 GSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISD--E 1260
            GS+ D +K F G+ETSDLESTATS  K++R E  H LLEEI+EINQRLIDTVV ISD  +
Sbjct: 1201 GSVGDNYKQFAGSETSDLESTATSDGKKARTETEHALLEEIKEINQRLIDTVVEISDDED 1260

Query: 1261 VVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSP 1320
              DPS +A ++ G EGT V+ SF AV+LSP+LK+   S QMSPIQPLRLLVP +YPN SP
Sbjct: 1261 AADPSEVAISSIGCEGTTVRFSFIAVSLSPALKAHLSSTQMSPIQPLRLLVPCSYPNGSP 1320

Query: 1321 ILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQS 1354
             LLDK PVE  KE EDLS KA +RF+I LR+LSQPMSL DIA+TWD CAR V+ EYAQQ 
Sbjct: 1321 SLLDKLPVETSKENEDLSSKAMARFNILLRSLSQPMSLKDIAKTWDACARAVICEYAQQF 1335

BLAST of CmoCh19G003080.1 vs. ExPASy Swiss-Prot
Match: Q9SHV7 (Probable mediator of RNA polymerase II transcription subunit 15c OS=Arabidopsis thaliana OX=3702 GN=MED15C PE=3 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 5.1e-68
Identity = 308/980 (31.43%), Postives = 454/980 (46.33%), Query Frame = 0

Query: 396  QQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSD 455
            Q+S     +    +Q+QL+ Q  NL         P S  +   QQ   G    +S+ Q++
Sbjct: 147  QKSVFDTTEQKRQEQEQLINQLTNL---------PTSRPNNRDQQ---GAFQVSSSQQNN 206

Query: 456  QHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQ 515
               +H + Q K ++Q                   +   QQ+    P+ S Q +QQ P+ Q
Sbjct: 207  NVTLHAMSQQKNNLQ------------------SMTRGQQVGQSQPMMSQQYRQQYPM-Q 266

Query: 516  QQQQQPNAMSH-DLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP-----ETSSTS 575
            Q  Q  N   H D  Q      QA SSL Q+QN+ DQQ Q    +RA P          S
Sbjct: 267  QDPQNRNLQKHLDFVQNNTNQFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVAS 326

Query: 576  LDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQPKSEQ- 635
             DST +T   N G+WQEE YQKIK +KE+    L  M+Q++  K+ + ESLP QP   Q 
Sbjct: 327  QDSTGKTVNVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQW 386

Query: 636  LNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQ 695
            + KLK  KL +E L+ FL + +S++    +DK   YE  I+ F  S       +  Q GQ
Sbjct: 387  IEKLKAGKLSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQ 446

Query: 696  LPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHNSLPT 755
             P S  Q+  Q        QSP   ++  L+    +  + P  QN  +++  ++    P 
Sbjct: 447  FPPS--QTAMQT-------QSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDP- 506

Query: 756  FSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNG 815
                   +N+                 ++S   V   S++QNP                 
Sbjct: 507  -----RDENII----------------MASSGNVMLPSVKQNP---------------RA 566

Query: 816  VNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQL 875
            VNT   NI S+Q+    LQ                        KQ++    Q QQQQPQ 
Sbjct: 567  VNT---NISSVQS----LQ------------------------KQKRFHHRQMQQQQPQQ 626

Query: 876  HQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTHQQQM 935
               Q Q+               Q + +N + MR+ +  K  +               +Q 
Sbjct: 627  GNHQHQM---------------QTNEMNDVRMRERVNIKARLL--------------EQQ 686

Query: 936  KPGTSLPISPQIFQTASPQVAQNSSPQ-VDQQNLLSSITKV-PPLQSASSPLVVLSPSTP 995
               +   +  Q    +S Q+  +SSPQ VDQ  L ++I K   PL S+ S  V       
Sbjct: 687  VSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSAFVA------ 746

Query: 996  VAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTD 1055
             APSP+PGDSE P S  S ++          ++ T   S  +GT     +PLL       
Sbjct: 747  PAPSPVPGDSEMPISVESPVSGV------DEINSTLDSSSKLGT---QETPLL------- 806

Query: 1056 GAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGN 1115
                     V      TE+P++RLIKA ++ SPK+L+ SV+ I SV+SM+D + GS P +
Sbjct: 807  --------FVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSS 866

Query: 1116 -GSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKP 1175
             GSRA +GEDL   T      RNF +H+ +N +K+M+R  + +P ++ S      D ++ 
Sbjct: 867  GGSRAGLGEDLSERT------RNFTTHEETNLSKRMKRSINIVPPDMSSQI----DSYEQ 926

Query: 1176 FTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAA 1235
             +  E S++ ST +S  K + +   + LL+EI+E N RL++TVV I DE           
Sbjct: 927  LSSLE-SEVVSTTSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDE----------- 935

Query: 1236 DGSEGTIVKCSFSAVALSPSLKSQYMSAQM----------SPIQPLRLLVPTNYPNCSPI 1295
              S GTIV C+++ VALS + K  Y S ++          + IQPLRLL P +YP  SPI
Sbjct: 987  -DSLGTIVTCTYAPVALSATFKDHYKSGKIIFYVSKCLMQAQIQPLRLLFPMDYPYSSPI 935

Query: 1296 LLDK--FPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQ 1354
            +L++  F   V K YEDLS + +SRFS+S++  S+P     IA+TW+ CAR  + EYA++
Sbjct: 1047 VLEEISFDTSVHK-YEDLSARTRSRFSLSMKEFSEPGFSKGIAQTWNDCARATMVEYAER 935

BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match: A0A6J1HI15 (mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463831 PE=4 SV=1)

HSP 1 Score: 2419.4 bits (6269), Expect = 0.0e+00
Identity = 1353/1353 (100.00%), Postives = 1353/1353 (100.00%), Query Frame = 0

Query: 1    MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
            MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA
Sbjct: 1    MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60

Query: 61   VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
            VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ
Sbjct: 61   VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120

Query: 121  PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
            PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM
Sbjct: 121  PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180

Query: 181  VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240
            VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL
Sbjct: 181  VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240

Query: 241  FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS 300
            FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS
Sbjct: 241  FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS 300

Query: 301  NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS 360
            NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS
Sbjct: 301  NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS 360

Query: 361  QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNL 420
            QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNL
Sbjct: 361  QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNL 420

Query: 421  SNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNL 480
            SNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNL
Sbjct: 421  SNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNL 480

Query: 481  LSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPS 540
            LSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPS
Sbjct: 481  LSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPS 540

Query: 541  SLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYF 600
            SLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYF
Sbjct: 541  SLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYF 600

Query: 601  FELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK 660
            FELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK
Sbjct: 601  FELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK 660

Query: 661  IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS 720
            IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS
Sbjct: 661  IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS 720

Query: 721  ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQ 780
            ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQ
Sbjct: 721  ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQ 780

Query: 781  QVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLK 840
            QVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLK
Sbjct: 781  QVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLK 840

Query: 841  QQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEM 900
            QQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEM
Sbjct: 841  QQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEM 900

Query: 901  RQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQN 960
            RQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQN
Sbjct: 901  RQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQN 960

Query: 961  LLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSG 1020
            LLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSG
Sbjct: 961  LLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSG 1020

Query: 1021 TQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPK 1080
            TQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPK
Sbjct: 1021 TQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPK 1080

Query: 1081 ALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKK 1140
            ALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKK
Sbjct: 1081 ALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKK 1140

Query: 1141 MRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREI 1200
            MRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREI
Sbjct: 1141 MRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREI 1200

Query: 1201 NQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQP 1260
            NQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQP
Sbjct: 1201 NQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQP 1260

Query: 1261 LRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWD 1320
            LRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWD
Sbjct: 1261 LRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWD 1320

Query: 1321 VCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
            VCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA
Sbjct: 1321 VCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1353

BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match: A0A6J1HR60 (mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467027 PE=4 SV=1)

HSP 1 Score: 2368.6 bits (6137), Expect = 0.0e+00
Identity = 1334/1356 (98.38%), Postives = 1341/1356 (98.89%), Query Frame = 0

Query: 1    MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
            MDTNNWRPTQGGEPGIE GDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA
Sbjct: 1    MDTNNWRPTQGGEPGIEDGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60

Query: 61   VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
            VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ
Sbjct: 61   VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120

Query: 121  PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
            PQVLNQGQSISVPQ SNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM
Sbjct: 121  PQVLNQGQSISVPQPSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180

Query: 181  VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240
            VGQNPSMQNVSGIPQNSVGN+MGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL
Sbjct: 181  VGQNPSMQNVSGIPQNSVGNSMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240

Query: 241  FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS 300
            FHQQQLQQQMMNKK QQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQ AMQPSMMQ SLS
Sbjct: 241  FHQQQLQQQMMNKKFQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQPAMQPSMMQSSLS 300

Query: 301  NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS 360
            NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMH QSTMSQQTSLPSQQQQQLIS
Sbjct: 301  NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHPQSTMSQQTSLPSQQQQQLIS 360

Query: 361  QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPP-SQQQQLMAQQNN 420
            QQPNSS+MQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPP  QQQQLMAQQNN
Sbjct: 361  QQPNSSSMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPLQQQQQLMAQQNN 420

Query: 421  LSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASN 480
            LSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASN
Sbjct: 421  LSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASN 480

Query: 481  LLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAP 540
            LLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAP
Sbjct: 481  LLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAP 540

Query: 541  SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY 600
            SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY
Sbjct: 541  SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY 600

Query: 601  FFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKD 660
            FFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKD
Sbjct: 601  FFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKD 660

Query: 661  KIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLH 720
            KIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQ+QSQMTPLQSP+NQINPQLH
Sbjct: 661  KIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQSQSQMTPLQSPENQINPQLH 720

Query: 721  SANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSF 780
            SANMQGSVAPVQQ   NNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSF
Sbjct: 721  SANMQGSVAPVQQ---NNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSF 780

Query: 781  QQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQL 840
            QQVASGSLQQN +NSSQRANN+SLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQL
Sbjct: 781  QQVASGSLQQNSANSSQRANNNSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQL 840

Query: 841  KQQMQQRQMQQLKQQQMLQH--QQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQ 900
            KQQMQQRQMQQLKQQQMLQH  QQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQ
Sbjct: 841  KQQMQQRQMQQLKQQQMLQHQQQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQ 900

Query: 901  IEMRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVD 960
            IEMRQGLATK GMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVD
Sbjct: 901  IEMRQGLATKSGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVD 960

Query: 961  QQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTS 1020
            QQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTS
Sbjct: 961  QQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTS 1020

Query: 1021 VSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSM 1080
            VSGTQVQSLAIGTPGISASPLLAEFSGTDGAYA+ALPTVSGKSSATEQPLERLIKAVKSM
Sbjct: 1021 VSGTQVQSLAIGTPGISASPLLAEFSGTDGAYASALPTVSGKSSATEQPLERLIKAVKSM 1080

Query: 1081 SPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNG 1140
            SPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNG
Sbjct: 1081 SPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNG 1140

Query: 1141 TKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEI 1200
            TKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEI
Sbjct: 1141 TKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEI 1200

Query: 1201 REINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSP 1260
            REINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSP
Sbjct: 1201 REINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSP 1260

Query: 1261 IQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIAR 1320
            IQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIAR
Sbjct: 1261 IQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIAR 1320

Query: 1321 TWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
            TWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA
Sbjct: 1321 TWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1353

BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match: A0A6J1HI99 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463831 PE=4 SV=1)

HSP 1 Score: 2252.6 bits (5836), Expect = 0.0e+00
Identity = 1269/1269 (100.00%), Postives = 1269/1269 (100.00%), Query Frame = 0

Query: 85   MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ 144
            MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ
Sbjct: 1    MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ 60

Query: 145  LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ 204
            LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ
Sbjct: 61   LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ 120

Query: 205  GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR 264
            GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR
Sbjct: 121  GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR 180

Query: 265  MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP 324
            MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP
Sbjct: 181  MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP 240

Query: 325  VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ 384
            VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ
Sbjct: 241  VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ 300

Query: 385  QHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHG 444
            QHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHG
Sbjct: 301  QHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHG 360

Query: 445  TQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQS 504
            TQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQS
Sbjct: 361  TQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQS 420

Query: 505  TQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPE 564
            TQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPE
Sbjct: 421  TQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPE 480

Query: 565  TSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQP 624
            TSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQP
Sbjct: 481  TSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQP 540

Query: 625  KSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTL 684
            KSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTL
Sbjct: 541  KSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTL 600

Query: 685  QPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHN 744
            QPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHN
Sbjct: 601  QPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHN 660

Query: 745  SLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLP 804
            SLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLP
Sbjct: 661  SLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLP 720

Query: 805  SQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQ 864
            SQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQ
Sbjct: 721  SQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQ 780

Query: 865  QPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTH 924
            QPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTH
Sbjct: 781  QPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTH 840

Query: 925  QQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPS 984
            QQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPS
Sbjct: 841  QQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPS 900

Query: 985  TPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSG 1044
            TPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSG
Sbjct: 901  TPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSG 960

Query: 1045 TDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAP 1104
            TDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAP
Sbjct: 961  TDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAP 1020

Query: 1105 GNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFK 1164
            GNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFK
Sbjct: 1021 GNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFK 1080

Query: 1165 PFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAA 1224
            PFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAA
Sbjct: 1081 PFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAA 1140

Query: 1225 ADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV 1284
            ADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV
Sbjct: 1141 ADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV 1200

Query: 1285 RKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYG 1344
            RKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYG
Sbjct: 1201 RKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYG 1260

Query: 1345 AWENCLSAA 1354
            AWENCLSAA
Sbjct: 1261 AWENCLSAA 1269

BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match: A0A6J1HVE4 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467027 PE=4 SV=1)

HSP 1 Score: 2203.7 bits (5709), Expect = 0.0e+00
Identity = 1251/1272 (98.35%), Postives = 1258/1272 (98.90%), Query Frame = 0

Query: 85   MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ 144
            MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQ SNQPQSRQQ
Sbjct: 1    MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQPSNQPQSRQQ 60

Query: 145  LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ 204
            LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGN+MGQ
Sbjct: 61   LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNSMGQ 120

Query: 205  GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR 264
            GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKK QQGSIPQQR
Sbjct: 121  GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKFQQGSIPQQR 180

Query: 265  MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP 324
            MQSHIPQQQQNLMQPNQLQSSQQ AMQPSMMQ SLSNLQQNQQSSIQQPTQSMLQQPQQP
Sbjct: 181  MQSHIPQQQQNLMQPNQLQSSQQPAMQPSMMQSSLSNLQQNQQSSIQQPTQSMLQQPQQP 240

Query: 325  VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ 384
            VLRQQPQSQQHAVMH QSTMSQQTSLPSQQQQQLISQQPNSS+MQQNPLIGQQNSVGDMQ
Sbjct: 241  VLRQQPQSQQHAVMHPQSTMSQQTSLPSQQQQQLISQQPNSSSMQQNPLIGQQNSVGDMQ 300

Query: 385  QHLPQQSRSHGQQSNLSNMQSPP-SQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMH 444
            QHLPQQSRSHGQQSNLSNMQSPP  QQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMH
Sbjct: 301  QHLPQQSRSHGQQSNLSNMQSPPLQQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMH 360

Query: 445  GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 504
            GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ
Sbjct: 361  GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 420

Query: 505  STQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP 564
            STQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP
Sbjct: 421  STQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP 480

Query: 565  ETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQ 624
            ETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQ
Sbjct: 481  ETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQ 540

Query: 625  PKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVST 684
            PKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVST
Sbjct: 541  PKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVST 600

Query: 685  LQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQH 744
            LQPGQLPASHMQSIQQ+QSQMTPLQSP+NQINPQLHSANMQGSVAPVQQ   NNMNNMQH
Sbjct: 601  LQPGQLPASHMQSIQQSQSQMTPLQSPENQINPQLHSANMQGSVAPVQQ---NNMNNMQH 660

Query: 745  NSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSL 804
            NSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQN +NSSQRANN+SL
Sbjct: 661  NSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNSANSSQRANNNSL 720

Query: 805  PSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQH--QQ 864
            PSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQH  QQ
Sbjct: 721  PSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQ 780

Query: 865  QQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSG 924
            QQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATK GMFQHLPAAHRSG
Sbjct: 781  QQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKSGMFQHLPAAHRSG 840

Query: 925  YTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVL 984
            YTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVL
Sbjct: 841  YTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVL 900

Query: 985  SPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAE 1044
            SPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAE
Sbjct: 901  SPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAE 960

Query: 1045 FSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAG 1104
            FSGTDGAYA+ALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAG
Sbjct: 961  FSGTDGAYASALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAG 1020

Query: 1105 SAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSIND 1164
            SAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSIND
Sbjct: 1021 SAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSIND 1080

Query: 1165 VFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSAL 1224
            VFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSAL
Sbjct: 1081 VFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSAL 1140

Query: 1225 AAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFP 1284
            AAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFP
Sbjct: 1141 AAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFP 1200

Query: 1285 VEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCS 1344
            VEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCS
Sbjct: 1201 VEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCS 1260

Query: 1345 RYGAWENCLSAA 1354
            RYGAWENCLSAA
Sbjct: 1261 RYGAWENCLSAA 1269

BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match: A0A6J1ELX0 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111434564 PE=4 SV=1)

HSP 1 Score: 2004.9 bits (5193), Expect = 0.0e+00
Identity = 1166/1375 (84.80%), Postives = 1235/1375 (89.82%), Query Frame = 0

Query: 1    MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
            MD++NWRP QGGE G++AGDWRSQLQPDSR RIVNKIMETLKRHLPVSGHEGLSEL+KIA
Sbjct: 1    MDSSNWRPAQGGESGVDAGDWRSQLQPDSRHRIVNKIMETLKRHLPVSGHEGLSELRKIA 60

Query: 61   VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
            VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQT   T+LPSN MVP+NKPLDS SQSMQ
Sbjct: 61   VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQT---TALPSNSMVPTNKPLDSTSQSMQ 120

Query: 121  PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
             QVLNQG S+S P SSNQPQ RQQLLSQNIQNNI SQSSSSLPS+VPPV+GLAS+ M N+
Sbjct: 121  SQVLNQGPSMSGPMSSNQPQPRQQLLSQNIQNNIASQSSSSLPSSVPPVAGLASAPMANI 180

Query: 181  VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVS--QQQQQQAQTQQQQ 240
            VGQNPSMQNVSG+PQ+SVGNAMGQGV SNVFTNSQRP+QGRQVVS  QQQQQQ+Q+QQQQ
Sbjct: 181  VGQNPSMQNVSGVPQSSVGNAMGQGVSSNVFTNSQRPIQGRQVVSQQQQQQQQSQSQQQQ 240

Query: 241  FLFHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPS 300
              F QQ LQQQ+M +K QQGS+P Q MQSHIPQQQ NLM PNQL SSQQ     S+MQPS
Sbjct: 241  LFFQQQHLQQQIMKQKYQQGSMPHQLMQSHIPQQQTNLMAPNQLPSSQQ-----SVMQPS 300

Query: 301  LSNLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQL 360
            LSNLQQNQQSSIQQPTQSMLQQP QPVLRQQ QSQQH+V+H Q TMSQQ SL SQQQQQL
Sbjct: 301  LSNLQQNQQSSIQQPTQSMLQQPPQPVLRQQQQSQQHSVLHQQPTMSQQASLSSQQQQQL 360

Query: 361  ISQQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPS--QQQQLMAQ 420
            I+QQ NSSNMQQN LI  QNSVGDMQQ LPQQSRSHGQQSNLSNMQ+PPS  QQQQLM Q
Sbjct: 361  INQQSNSSNMQQNSLI--QNSVGDMQQQLPQQSRSHGQQSNLSNMQTPPSQQQQQQLMNQ 420

Query: 421  QNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQN 480
            Q++LSNLQQ QLGPQSNVSGLQQQQMHGTQSGNSNMQS+QH +HM+QQNKV MQQQPPQN
Sbjct: 421  QSSLSNLQQPQLGPQSNVSGLQQQQMHGTQSGNSNMQSNQHGVHMMQQNKVQMQQQPPQN 480

Query: 481  ASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQP--NAMSHDLQQRLQV 540
             SNLLS QG QGQLQSSQQLMSQIPLQS QVQQQV L QQQQQQP  N +SH+LQQRLQ 
Sbjct: 481  PSNLLSTQGQQGQLQSSQQLMSQIPLQSAQVQQQVSLQQQQQQQPQSNTLSHELQQRLQA 540

Query: 541  GGQAPSSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKA 600
            GGQAP  LLQSQNVMDQQKQLYH QR LPETSSTSLDSTAQTGQANGGDWQEEIYQKIK+
Sbjct: 541  GGQAPGPLLQSQNVMDQQKQLYHPQRVLPETSSTSLDSTAQTGQANGGDWQEEIYQKIKS 600

Query: 601  MKELYFFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIV 660
            MKELY FELKEMYQKILPKV+Q +SLPQQPKSEQLNKL+ F++ILERLIAFLQ+ K+NIV
Sbjct: 601  MKELYLFELKEMYQKILPKVHQFDSLPQQPKSEQLNKLRAFRVILERLIAFLQVPKNNIV 660

Query: 661  IGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQI 720
            IG KDKI HYEKQIVSFLNSNRPRNPVSTLQ GQLPASHMQS+QQ+QSQMTPLQSP+NQI
Sbjct: 661  IGFKDKISHYEKQIVSFLNSNRPRNPVSTLQQGQLPASHMQSMQQSQSQMTPLQSPENQI 720

Query: 721  NPQLHSANMQGSVAPVQQ---NNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESG 780
            NPQLHSANMQGSVA VQQ   NNMNNMNNMQHNSLPTFSGSA QQNMTIPMQPGSSLESG
Sbjct: 721  NPQLHSANMQGSVALVQQNNMNNMNNMNNMQHNSLPTFSGSAPQQNMTIPMQPGSSLESG 780

Query: 781  QGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHL 840
            QGNSLSS QQV + SLQQNP+N SQRANNSSL SQNGVN LQPNI SLQ+N N+LQHQH+
Sbjct: 781  QGNSLSSLQQVGAVSLQQNPANGSQRANNSSLASQNGVNALQPNISSLQSNTNILQHQHM 840

Query: 841  K-QDP------QQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLP 900
            K QDP      QQQLKQQMQQR MQ LKQQ +   QQQQQQPQLHQQQSQL QQGK QLP
Sbjct: 841  KQQDPQQLLQSQQQLKQQMQQRHMQHLKQQMLQHQQQQQQQPQLHQQQSQLQQQGKQQLP 900

Query: 901  AQMQAHQLSHLNQIE-----MRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQ 960
             QMQAHQ+SHLNQIE     MRQG+A KPGMFQH P   RS YTH  QMKPGTS PISP 
Sbjct: 901  TQMQAHQMSHLNQIEMNDLKMRQGVAAKPGMFQH-PGTQRSAYTH-PQMKPGTSFPISPP 960

Query: 961  IFQTASPQVAQNSSPQVDQQNLLSSITKV-PPLQSASSPLVVLSPSTPVAPSPMPGDSEK 1020
            IFQ  SPQV QNSSPQVDQQN+ SS+ ++  PLQSASSP VV SPSTP+APSPMPGDSEK
Sbjct: 961  IFQATSPQVTQNSSPQVDQQNMFSSMNRIGTPLQSASSPFVVPSPSTPLAPSPMPGDSEK 1020

Query: 1021 PTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSG 1080
            PTS VS+L NAGNTGQQ +VSG Q  SLAIGTPGISASPLLAEFSGTDGAYA ALPTVSG
Sbjct: 1021 PTSAVSSLPNAGNTGQQMNVSGAQAPSLAIGTPGISASPLLAEFSGTDGAYAIALPTVSG 1080

Query: 1081 KSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVA 1140
            KSS TEQPLERLIKAVKSMSP+AL+ASV+GIGSVVSMIDR+AGSAPGNGSRAAVGEDLVA
Sbjct: 1081 KSSVTEQPLERLIKAVKSMSPRALNASVSGIGSVVSMIDRIAGSAPGNGSRAAVGEDLVA 1140

Query: 1141 MTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTA 1200
            MTKCRLQARNFVSHDGSNGTK+MRR+TSAMPLNVVSSAGS+NDVFKP TGAETSDLESTA
Sbjct: 1141 MTKCRLQARNFVSHDGSNGTKRMRRHTSAMPLNVVSSAGSVNDVFKPLTGAETSDLESTA 1200

Query: 1201 TSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFS 1260
            TS  KRSR+EA+HVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGS+GTIVKCSFS
Sbjct: 1201 TSSVKRSRIEASHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGSDGTIVKCSFS 1260

Query: 1261 AVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSR 1320
            AVALSPSLKSQY SAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV KEYEDLSIKAKSR
Sbjct: 1261 AVALSPSLKSQYTSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEVSKEYEDLSIKAKSR 1320

Query: 1321 FSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
            FSISLRNLSQPMSLGDIARTWDVCAR VVSEYAQQSGGGSFCS+YGAWENCLSAA
Sbjct: 1321 FSISLRNLSQPMSLGDIARTWDVCARAVVSEYAQQSGGGSFCSKYGAWENCLSAA 1363

BLAST of CmoCh19G003080.1 vs. TAIR 10
Match: AT1G15780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G10440.1); Has 103701 Blast hits to 43153 proteins in 1828 species: Archae - 30; Bacteria - 7385; Metazoa - 38639; Fungi - 11531; Plants - 7727; Viruses - 307; Other Eukaryotes - 38082 (source: NCBI BLink). )

HSP 1 Score: 1068.5 bits (2762), Expect = 4.3e-312
Identity = 757/1399 (54.11%), Postives = 952/1399 (68.05%), Query Frame = 0

Query: 1    MDTNNWRPT-QGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKI 60
            MD NNWRP+   GEP ++ GDWR+QL PDSRQ+IVNKIMETLK+HLP SG EG++EL++I
Sbjct: 1    MDNNNWRPSLPNGEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRI 60

Query: 61   AVRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSM 120
            A RFEEKI++ A +Q+DYLRKIS+KMLTMETKSQ   G+S  + P   +   +DS     
Sbjct: 61   AARFEEKIFSGALNQTDYLRKISMKMLTMETKSQNAAGSS-AAIPAANNGTSIDSIP--- 120

Query: 121  QPQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVS--QSSSSLPSAVPPVSGLASSSM 180
                 NQGQ +    S+NQ Q+ Q LLSQ +QNN  S    S++LPS++PPVS + +++ 
Sbjct: 121  ----TNQGQLLPGSLSTNQSQAPQPLLSQTMQNNTASGMTGSTALPSSMPPVSSITNNNT 180

Query: 181  PNMVGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQ 240
             ++V QN +MQNV+G+ Q+S G     G+ SN+F+  QR + GR      QQ     QQQ
Sbjct: 181  TSVVNQNANMQNVAGMLQDSSGQ---HGLSSNMFSGPQRQMLGRPHAMSSQQ-----QQQ 240

Query: 241  QFLFHQQQLQQQMMNKKLQQGSIPQQR--MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMM 300
             +L+ QQQLQQQ++ +  Q G++P     + SHI QQQQN++QPNQL SSQQ  +  S  
Sbjct: 241  PYLY-QQQLQQQLLKQNFQSGNVPNPNSLLPSHIQQQQQNVLQPNQLHSSQQPGVPTSAT 300

Query: 301  QPS------LSNLQQNQQS----SIQQPTQSMLQQPQQPVLRQQPQSQQHAVMH-SQSTM 360
            QPS      L  L  NQQS    S QQ TQSML+Q Q  +LRQ PQSQQ + +H  QS++
Sbjct: 301  QPSTVNSAPLQGLHTNQQSSPQLSSQQTTQSMLRQHQSSMLRQHPQSQQASGIHQQQSSL 360

Query: 361  SQQTSLPSQQQ-QQLISQQ-PNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSN 420
             QQ+  P QQQ  QL+ QQ  NSS +QQ  ++G Q+ VGDMQQ    Q R   QQ+N+ N
Sbjct: 361  PQQSISPLQQQPTQLMRQQAANSSGIQQKQMMG-QHVVGDMQQQ--HQQRLLNQQNNVMN 420

Query: 421  MQSPPSQ-----------------QQQLMAQQNNLSNLQQQQLGPQSNVSGLQ--QQQMH 480
            +Q   SQ                 QQQLM+QQN+L    Q  LG QSNV+GLQ  QQQM 
Sbjct: 421  IQQQQSQQQPLQQPQQQQKQQPPAQQQLMSQQNSLQATHQNPLGTQSNVAGLQQPQQQML 480

Query: 481  GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 540
             +Q GNS++Q++QH +HML Q  V + Q+  Q    L S+QG Q Q Q SQQ        
Sbjct: 481  NSQVGNSSLQNNQHSVHMLSQPTVGL-QRTHQAGHGLYSSQGQQSQNQPSQQ-------- 540

Query: 541  STQVQQQVPLHQQQ---QQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQR 600
              Q+  Q+  H QQ   QQQPN +  D+QQRLQ  GQ   SLL  QNV+DQQ+QLY SQR
Sbjct: 541  --QMMPQLQSHHQQLGLQQQPNLLQQDVQQRLQASGQVTGSLLPPQNVVDQQRQLYQSQR 600

Query: 601  ALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESL 660
             LPE  S+SLDSTAQT  ANGGDWQEE+YQKIK+MKE Y  +L E+YQ++  K+ Q +S+
Sbjct: 601  TLPEMPSSSLDSTAQTESANGGDWQEEVYQKIKSMKETYLPDLNEIYQRVAAKLQQ-DSM 660

Query: 661  PQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNP 720
            PQQ +S+QL KL+ FK +LER+I FL +SKSNI+  LKDK+ +YEKQI+ FLN +RPR P
Sbjct: 661  PQQQRSDQLEKLRQFKTMLERMIQFLSVSKSNIMPALKDKVAYYEKQIIGFLNMHRPRKP 720

Query: 721  VSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNN 780
            V   Q GQLP S MQ +QQ QSQ    QS DNQ NPQ+ S +MQG+    QQ++M NM +
Sbjct: 721  V---QQGQLPQSQMQPMQQPQSQTVQDQSHDNQTNPQMQSMSMQGAGPRAQQSSMTNMQS 780

Query: 781  MQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANN 840
               +S P    SA QQN+   + P SSLESGQGN+L++ QQVA GS+QQ   N+SQ  NN
Sbjct: 781  NVLSSRP--GVSAPQQNIPSSI-PASSLESGQGNTLNNGQQVAMGSMQQ---NTSQLVNN 840

Query: 841  SSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLK--QDPQQQLKQQMQQRQMQQLKQQQMLQ 900
            SS  +Q+G++TLQ N+   Q + ++LQHQHLK  QD Q QLKQQ QQRQMQQ  QQ   +
Sbjct: 841  SSASAQSGLSTLQSNVNQPQLSSSLLQHQHLKQQQDQQMQLKQQFQQRQMQQ--QQLQAR 900

Query: 901  HQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMF-QHLPAA 960
             QQQQQQ Q  QQ +QL               Q++ +N +  RQG+    GMF QH    
Sbjct: 901  QQQQQQQLQARQQAAQL--------------QQMNDMNDLTSRQGMNVSRGMFQQHSMQG 960

Query: 961  HRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSP 1020
             R+ Y   QQ+KPG     SPQ+ Q ASPQ++Q+ SPQVDQ+N ++ +    PLQ A+SP
Sbjct: 961  QRANYP-LQQLKPGA--VSSPQLLQGASPQMSQHLSPQVDQKNTVNKMG--TPLQPANSP 1020

Query: 1021 LVVLSP-STPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISAS 1080
             VV SP STP+APSPM  DSEKP  G S+L+      QQ +     VQSLAIGTPGISAS
Sbjct: 1021 FVVPSPSSTPLAPSPMQVDSEKP--GSSSLSMGNIARQQATGMQGVVQSLAIGTPGISAS 1080

Query: 1081 PLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMI 1140
            PLL EF+  DG   N+    SGK SATE P+ERLI+AVKS+SP+ALS++V+ IGSVVSM+
Sbjct: 1081 PLLQEFTSPDGNILNSSTITSGKPSATELPIERLIRAVKSISPQALSSAVSDIGSVVSMV 1140

Query: 1141 DRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSA 1200
            DR+AGSAPGNGSRA+VGEDLVAMTKCRLQARNF++ +G   TKKM+R+T+AMPL+V S  
Sbjct: 1141 DRIAGSAPGNGSRASVGEDLVAMTKCRLQARNFMTQEGMMATKKMKRHTTAMPLSVASLG 1200

Query: 1201 GSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISD--E 1260
            GS+ D +K F G+ETSDLESTATS  K++R E  H LLEEI+EINQRLIDTVV ISD  +
Sbjct: 1201 GSVGDNYKQFAGSETSDLESTATSDGKKARTETEHALLEEIKEINQRLIDTVVEISDDED 1260

Query: 1261 VVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSP 1320
              DPS +A ++ G EGT V+ SF AV+LSP+LK+   S QMSPIQPLRLLVP +YPN SP
Sbjct: 1261 AADPSEVAISSIGCEGTTVRFSFIAVSLSPALKAHLSSTQMSPIQPLRLLVPCSYPNGSP 1320

Query: 1321 ILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQS 1354
             LLDK PVE  KE EDLS KA +RF+I LR+LSQPMSL DIA+TWD CAR V+ EYAQQ 
Sbjct: 1321 SLLDKLPVETSKENEDLSSKAMARFNILLRSLSQPMSLKDIAKTWDACARAVICEYAQQF 1335

BLAST of CmoCh19G003080.1 vs. TAIR 10
Match: AT2G10440.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 1628 Blast hits to 1350 proteins in 149 species: Archae - 0; Bacteria - 39; Metazoa - 480; Fungi - 159; Plants - 187; Viruses - 2; Other Eukaryotes - 761 (source: NCBI BLink). )

HSP 1 Score: 263.1 bits (671), Expect = 1.3e-69
Identity = 263/820 (32.07%), Postives = 396/820 (48.29%), Query Frame = 0

Query: 540  SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY 599
            SS+  +++ +  QK ++ +   +      S DST +T   N G+WQEE YQKIK +KE+ 
Sbjct: 186  SSIKLTKHSITDQKSVFDTTVLIMNIIVASQDSTGKTVNVNAGNWQEETYQKIKKLKEMC 245

Query: 600  FFELKEMYQKILPKVNQLESLPQQPKSEQ-LNKLKTFKLILERLIAFLQISKSNIVIGLK 659
               L  M+Q++  K+ + ESLP QP   Q + KLK  KL +E L+ FL + +S++    +
Sbjct: 246  LPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGKLSMEHLMFFLNVHRSSVSEKHR 305

Query: 660  DKIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQL 719
            DK   YE  I+ F  S       +  Q GQ P S  Q+  Q        QSP   ++  L
Sbjct: 306  DKFSQYEYHILKFTKSQTMVLRPTQQQQGQFPPS--QTAMQT-------QSPQVHVSQSL 365

Query: 720  HSANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSS 779
            +    +  + P  QN  +++  ++    P        +N+                 ++S
Sbjct: 366  YKEQRRSRLMPSSQNEASSLLQIRPKLDP------RDENII----------------MAS 425

Query: 780  FQQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQ 839
               V   S++QNP                 VNT   NI S+Q+    LQ           
Sbjct: 426  SGNVMLPSVKQNP---------------RAVNT---NISSVQS----LQ----------- 485

Query: 840  LKQQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQI 899
                         KQ++    Q QQQQPQ    Q Q+               Q + +N +
Sbjct: 486  -------------KQKRFHHRQMQQQQPQQGNHQHQM---------------QTNEMNDV 545

Query: 900  EMRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQ-VD 959
             MR+ +  K  +               +Q    +   +  Q    +S Q+  +SSPQ VD
Sbjct: 546  RMRERVNIKARLL--------------EQQVSSSQRQVPKQESNVSSSQIQNHSSPQLVD 605

Query: 960  QQNLLSSITKV-PPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQT 1019
            Q  L ++I K   PL S+ S  V        APSP+PGDSE P S  S ++         
Sbjct: 606  QHILPATINKTGTPLNSSGSAFVA------PAPSPVPGDSEMPISVESPVSGV------D 665

Query: 1020 SVSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKS 1079
             ++ T   S  +GT     +PLL                V      TE+P++RLIKA ++
Sbjct: 666  EINSTLDSSSKLGT---QETPLL---------------FVPPPEPITERPIDRLIKAFQA 725

Query: 1080 MSPKALSASVNGIGSVVSMIDRVAGSAPGN-GSRAAVGEDLVAMTKCRLQARNFVSHDGS 1139
             SPK+L+ SV+ I SV+SM+D + GS P + GSRA +GEDL   T      RNF +H+ +
Sbjct: 726  ASPKSLAESVSEISSVISMVDMIGGSFPSSGGSRAGLGEDLSERT------RNFTTHEET 785

Query: 1140 NGTKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLE 1199
            N +K+M+R  + +P ++ S      D ++  +  E S++ ST +S  K + +   + LL+
Sbjct: 786  NLSKRMKRSINIVPPDMSSQI----DSYEQLSSLE-SEVVSTTSSGLKVNNIAPGYALLQ 845

Query: 1200 EIREINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQM 1259
            EI+E N RL++TVV I DE             S GTIV C+++ VALS + K  Y S ++
Sbjct: 846  EIKETNGRLVETVVEICDE------------DSLGTIVTCTYAPVALSATFKDHYKSGKI 845

Query: 1260 SPIQPLRLLVPTNYPNCSPILLDK--FPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLG 1319
            + IQPLRLL P +YP  SPI+L++  F   V K YEDLS + +SRFS+S++  S+P    
Sbjct: 906  AQIQPLRLLFPMDYPYSSPIVLEEISFDTSVHK-YEDLSARTRSRFSLSMKEFSEPGFSK 845

Query: 1320 DIARTWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
             IA+TW+ CAR  + EYA++ GGG+F S+YGAWE  L A+
Sbjct: 966  GIAQTWNDCARATMVEYAERHGGGTFSSKYGAWETVLRAS 845

BLAST of CmoCh19G003080.1 vs. TAIR 10
Match: AT2G10440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 8319 Blast hits to 5104 proteins in 317 species: Archae - 0; Bacteria - 285; Metazoa - 1706; Fungi - 535; Plants - 320; Viruses - 18; Other Eukaryotes - 5455 (source: NCBI BLink). )

HSP 1 Score: 261.5 bits (667), Expect = 3.6e-69
Identity = 308/980 (31.43%), Postives = 454/980 (46.33%), Query Frame = 0

Query: 396  QQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSD 455
            Q+S     +    +Q+QL+ Q  NL         P S  +   QQ   G    +S+ Q++
Sbjct: 147  QKSVFDTTEQKRQEQEQLINQLTNL---------PTSRPNNRDQQ---GAFQVSSSQQNN 206

Query: 456  QHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQ 515
               +H + Q K ++Q                   +   QQ+    P+ S Q +QQ P+ Q
Sbjct: 207  NVTLHAMSQQKNNLQ------------------SMTRGQQVGQSQPMMSQQYRQQYPM-Q 266

Query: 516  QQQQQPNAMSH-DLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP-----ETSSTS 575
            Q  Q  N   H D  Q      QA SSL Q+QN+ DQQ Q    +RA P          S
Sbjct: 267  QDPQNRNLQKHLDFVQNNTNQFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVAS 326

Query: 576  LDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQPKSEQ- 635
             DST +T   N G+WQEE YQKIK +KE+    L  M+Q++  K+ + ESLP QP   Q 
Sbjct: 327  QDSTGKTVNVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQW 386

Query: 636  LNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQ 695
            + KLK  KL +E L+ FL + +S++    +DK   YE  I+ F  S       +  Q GQ
Sbjct: 387  IEKLKAGKLSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQ 446

Query: 696  LPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHNSLPT 755
             P S  Q+  Q        QSP   ++  L+    +  + P  QN  +++  ++    P 
Sbjct: 447  FPPS--QTAMQT-------QSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDP- 506

Query: 756  FSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNG 815
                   +N+                 ++S   V   S++QNP                 
Sbjct: 507  -----RDENII----------------MASSGNVMLPSVKQNP---------------RA 566

Query: 816  VNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQL 875
            VNT   NI S+Q+    LQ                        KQ++    Q QQQQPQ 
Sbjct: 567  VNT---NISSVQS----LQ------------------------KQKRFHHRQMQQQQPQQ 626

Query: 876  HQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTHQQQM 935
               Q Q+               Q + +N + MR+ +  K  +               +Q 
Sbjct: 627  GNHQHQM---------------QTNEMNDVRMRERVNIKARLL--------------EQQ 686

Query: 936  KPGTSLPISPQIFQTASPQVAQNSSPQ-VDQQNLLSSITKV-PPLQSASSPLVVLSPSTP 995
               +   +  Q    +S Q+  +SSPQ VDQ  L ++I K   PL S+ S  V       
Sbjct: 687  VSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSAFVA------ 746

Query: 996  VAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTD 1055
             APSP+PGDSE P S  S ++          ++ T   S  +GT     +PLL       
Sbjct: 747  PAPSPVPGDSEMPISVESPVSGV------DEINSTLDSSSKLGT---QETPLL------- 806

Query: 1056 GAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGN 1115
                     V      TE+P++RLIKA ++ SPK+L+ SV+ I SV+SM+D + GS P +
Sbjct: 807  --------FVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSS 866

Query: 1116 -GSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKP 1175
             GSRA +GEDL   T      RNF +H+ +N +K+M+R  + +P ++ S      D ++ 
Sbjct: 867  GGSRAGLGEDLSERT------RNFTTHEETNLSKRMKRSINIVPPDMSSQI----DSYEQ 926

Query: 1176 FTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAA 1235
             +  E S++ ST +S  K + +   + LL+EI+E N RL++TVV I DE           
Sbjct: 927  LSSLE-SEVVSTTSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDE----------- 935

Query: 1236 DGSEGTIVKCSFSAVALSPSLKSQYMSAQM----------SPIQPLRLLVPTNYPNCSPI 1295
              S GTIV C+++ VALS + K  Y S ++          + IQPLRLL P +YP  SPI
Sbjct: 987  -DSLGTIVTCTYAPVALSATFKDHYKSGKIIFYVSKCLMQAQIQPLRLLFPMDYPYSSPI 935

Query: 1296 LLDK--FPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQ 1354
            +L++  F   V K YEDLS + +SRFS+S++  S+P     IA+TW+ CAR  + EYA++
Sbjct: 1047 VLEEISFDTSVHK-YEDLSARTRSRFSLSMKEFSEPGFSKGIAQTWNDCARATMVEYAER 935

BLAST of CmoCh19G003080.1 vs. TAIR 10
Match: AT1G15770.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 117.9 bits (294), Expect = 6.5e-26
Identity = 101/273 (37.00%), Postives = 150/273 (54.95%), Query Frame = 0

Query: 867  QLHQQQ-SQLHQQGKPQLPAQ-MQAHQLSHLNQIE-MRQGLATKPGMFQHLPAAHRSGYT 926
            Q+HQ+  ++++Q+   +L  +   +HQ    +Q E +++G     GM + L  +      
Sbjct: 10   QIHQRDLNEIYQRVAAKLQQEDSLSHQKQRSDQFEKLKRGKTVLEGMLRFLSLS------ 69

Query: 927  HQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSP 986
             +  +KP           + +      N    ++ Q+L  ++ K+   +S   P+    P
Sbjct: 70   -KSNIKPD---------LKDSMDYRKNNIMNFLNMQSLRKTVQKLQLTKSEIQPM--QQP 129

Query: 987  STPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFS 1046
             +         D         ++  AG+  QQ  +    +QSL IGTPGISASPLL E +
Sbjct: 130  LSQTVQDQSHDDQTTLQMQSMSMQGAGSRVQQ--IRQGVLQSLEIGTPGISASPLLPELT 189

Query: 1047 GTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSA 1106
              DG   N L +  GKSSATE P+ERLI+A+KS+SP+ALS++V  I SVVSM+DR+AGS 
Sbjct: 190  SPDGNIINPLTSTCGKSSATELPIERLIRAMKSISPQALSSAVCDIRSVVSMVDRIAGSV 249

Query: 1107 PGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSN 1137
            PG GSRA+ G DLVAMTKC LQ RNF++ DG +
Sbjct: 250  PGKGSRASFGVDLVAMTKCHLQERNFMTQDGDH 262


HSP 2 Score: 74.7 bits (182), Expect = 6.3e-13
Identity = 59/161 (36.65%), Postives = 91/161 (56.52%), Query Frame = 0

Query: 602 ELKEMYQKILPKVNQLESLP-QQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK 661
           +L E+YQ++  K+ Q +SL  Q+ +S+Q  KLK  K +LE ++ FL +SKSNI   LKD 
Sbjct: 15  DLNEIYQRVAAKLQQEDSLSHQKQRSDQFEKLKRGKTVLEGMLRFLSLSKSNIKPDLKDS 74

Query: 662 IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS 721
           + + +  I++FLN    R    T+Q  QL  S +Q +QQ  SQ    QS D+Q   Q+ S
Sbjct: 75  MDYRKNNIMNFLNMQSLR---KTVQKLQLTKSEIQPMQQPLSQTVQDQSHDDQTTLQMQS 134

Query: 722 ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIP 762
            +MQG+ + VQQ     + +++  + P  S S     +T P
Sbjct: 135 MSMQGAGSRVQQIRQGVLQSLEIGT-PGISASPLLPELTSP 171

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4I1716.0e-31154.11Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana ... [more]
Q9SHV75.1e-6831.43Probable mediator of RNA polymerase II transcription subunit 15c OS=Arabidopsis ... [more]
Match NameE-valueIdentityDescription
A0A6J1HI150.0e+00100.00mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucur... [more]
A0A6J1HR600.0e+0098.38mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucur... [more]
A0A6J1HI990.0e+00100.00mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucur... [more]
A0A6J1HVE40.0e+0098.35mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucur... [more]
A0A6J1ELX00.0e+0084.80mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucur... [more]
Match NameE-valueIdentityDescription
AT1G15780.14.3e-31254.11unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G10440.21.3e-6932.07unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G10440.13.6e-6931.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15770.16.5e-2637.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036529Coactivator CBP, KIX domain superfamilyGENE3D1.10.246.20Coactivator CBP, KIX domaincoord: 20..97
e-value: 1.1E-23
score: 85.1
IPR036529Coactivator CBP, KIX domain superfamilySUPERFAMILY47040Kix domain of CBP (creb binding protein)coord: 19..93
IPR036546Mediator complex subunit 15, KIX domainPFAMPF16987KIX_2coord: 18..97
e-value: 3.0E-37
score: 126.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 758..887
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 997..1020
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..226
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 547..581
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 250..527
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 976..1020
NoneNo IPR availablePANTHERPTHR33137:SF27OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A, PUTATIVE-RELATEDcoord: 896..1352
coord: 1..784
IPR044661Mediator of RNA polymerase II transcription subunit 15a/b/c-likePANTHERPTHR33137MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 15A-RELATEDcoord: 1..784
coord: 896..1352

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh19G003080CmoCh19G003080gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh19G003080.1:exon:6245CmoCh19G003080.1:exon:6245exon
CmoCh19G003080.1:exon:6246CmoCh19G003080.1:exon:6246exon
CmoCh19G003080.1:exon:6247CmoCh19G003080.1:exon:6247exon
CmoCh19G003080.1:exon:6248CmoCh19G003080.1:exon:6248exon
CmoCh19G003080.1:exon:6249CmoCh19G003080.1:exon:6249exon
CmoCh19G003080.1:exon:6250CmoCh19G003080.1:exon:6250exon
CmoCh19G003080.1:exon:6251CmoCh19G003080.1:exon:6251exon
CmoCh19G003080.1:exon:6252CmoCh19G003080.1:exon:6252exon
CmoCh19G003080.1:exon:6253CmoCh19G003080.1:exon:6253exon
CmoCh19G003080.1:exon:6254CmoCh19G003080.1:exon:6254exon
CmoCh19G003080.1:exon:6255CmoCh19G003080.1:exon:6255exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh19G003080.1:five_prime_utrCmoCh19G003080.1:five_prime_utrfive_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh19G003080.1:cdsCmoCh19G003080.1:cdsCDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_2CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_3CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_4CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_5CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_6CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_7CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_8CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_9CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_10CDS
CmoCh19G003080.1:cdsCmoCh19G003080.1:cds_11CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh19G003080.1:three_prime_utrCmoCh19G003080.1:three_prime_utrthree_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh19G003080.1CmoCh19G003080.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0045893 positive regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0031490 chromatin DNA binding
molecular_function GO:0003713 transcription coactivator activity
molecular_function GO:0003712 transcription coregulator activity