Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGAAACATTGAACAAGGGTTCACGAAGAAGACAAGAACAGGGTATTTATTTCTTGCTTTTGTCTTCCATTCTCCACGAGAGGACGCTCCTCGATCCACATAGAACACACACAAAAGCTTCCTCTTGAATGGCGAGTGAATAGCAAGCCTTCGGTCGTTATCTTCTTTGAAATCGCCTCGATTTTCTCAATTTTCCTATACATATCTGAATTAAACTCCTGGTTTCCTGTAATCAGTGTCCAAACAATTTCCGAGTCGATTCTGCTCGATTTCTGCTCGATTTCTGGCATTTATTGAGCACACCCTCACTGTTTGTTAGTGGGATCTCGACAAATTCTTTTATCGTAGGTTGTTTGGAAACTTCTGAATGGATACTAATAATTGGAGACCTACTCAAGGTGGAGAACCCGGAATCGAAGCCGGGGATTGGAGGTCTCAATTGCAGCCCGATTCTCGGCAACGAATTGTCAACAAAATGTACGGTTTTTCTTAAATGTTTATTGCATCGTGCCCTACTCTTATTATTTCTTATTTGAGGAATGCGTTTGAGTTTTTTTTATAGCTTTTTTGTTTCGTGTTCATTGGTTAGGTTTAGTTACTGTGAGCAGCAAAACTTTGGAAATATAGAACATTTTCTCGGATTTTGTGTATAACTTATAGATGCGAACTGTTCCCAGAACGTAGTTCGTTAAAATTGCCCCCCATAGATTTTCCAATCTCTGAATTGAAGATATAGTTGGTTAGGTTTAGTGGAGGAGGTTCCTATTTGTTTCTTCAGGTAATATCCCCTTCTTTAAGCATAAGAAGCATAGGCTATTTGTCGCAGAATGGATCTTGTTGAATTGTTCTTGACCGCTCTAAATACATTCCCTTTGAAGTATCTAATATTTTGAGTCATGGACATGGATTGCTACGGTAGTTTTCCTTTCAAGAACAGCTTCCCATGCTTTTGTGTTTGATAAATTAGTTAGACATTATGTTTTCCACTTTTGTGTTAGAATGGAGACATTGAAGAGGCACCTTCCTGTTTCTGGTCATGAGGGATTGAGTGAGCTGAAAAAAATTGCTGTAAGGTTTGAGGAAAAGATTTATACTGCCGCTACCAGCCAGGTATTTCTTACTCACTTTGTTGTAGTTTAAATCAAAGATTGTATGAATAAACATATTCTCATTTACGTTTAGCCCATATGCTAATATCAATAAGATTCCAACTTGATTTGGTACCTAAGATTTAGCTATGTATTCTTGCCCTAAACACAGCAGTGTTTTAGCGATCTCTTGAGTACTCTCACCGATTCTATTTTGAAACTCTGTTTCAGTCAGATTACCTAAGGAAAATATCTCTGAAGATGCTTACGATGGAAACAAAATCTCAGACTCCTATGGGAACTTCATTACCATCCAATCCTATGGTGCCTAGCAATAAGCCTCTGGATTCTGGTAAGTGATCATCAGACAATCTGCCACTATGTTGTATACGTGATGAAAATTTTAATAACATGTTGAAGCCAATTTGTGAATGAATTTCTTAAAGAGTCTCTGTTCTTGGATATTGTAACAATTTCTCAATTAACATATGGCGTTTTATTTGACTTGATGTTGTGTATTTTATCTTAAACAGCATCACAAAGCATGCAGCCCCAAGTTCTAAATCAAGGGCAATCAATTTCTGTTCCTCAGTCTTCTAATCAGCCTCAGTCACGCCAGCAACTACTCTCCCAGAATATCCAAAATAATATTGTTTCTCAGAGTTCATCCAGCTTACCTTCTGCAGTACCTCCTGTCTCTGGATTAGCTTCATCTTCAATGCCAAACATGGTTGGCCAGAATCCAAGCATGCAAAATGTATCTGGAATTCCACAGAATTCTGTTGGAAATGCTATGGGTCAAGGGGTTCCTTCCAATGTATTTACCAACTCTCAGAGGCCAGTACAAGGAAGACAAGTAGTTTCCCAACAGCAACAGCAGCAGGCACAAACCCAGCAACAGCAGTTTCTTTTTCATCAACAACAGTTACAGCAGCAGATGATGAATAAGAAGCTTCAGCAGGGAAGTATACCACAACAACGTATGCAATCTCACATTCCACAACAACAACAGAATCTAATGCAACCAAATCAACTGCAATCATCTCAGCAATCTGCTATGCAACCTTCTATGATGCAACCTTCTCTGTCTAATCTTCAACAAAACCAACAATCTTCCATTCAACAGCCCACTCAATCCATGCTTCAGCAACCTCAACAGCCAGTTCTTAGGCAGCAGCCACAGTCCCAGCAACATGCTGTCATGCATTCACAGTCTACAATGTCACAGCAGACTAGCTTGCCTTCACAGCAGCAACAACAGCTAATTAGTCAGCAACCAAATTCTTCAAACATGCAGCAAAATCCATTGATCGGGCAGCAAAACAGTGTTGGGGACATGCAGCAACACCTGCCTCAGCAATCTAGGTCGCATGGGCAGCAGAGCAATCTATCAAATATGCAGTCTCCACCATCGCAGCAGCAGCAGTTAATGGCTCAGCAAAACAACCTTTCAAATTTGCAGCAGCAGCAGTTAGGACCTCAAAGTAATGTTTCTGGACTACAGCAACAACAAATGCATGGAACTCAGTCAGGTAACTCAAACATGCAATCAGATCAGCACCCAATGCATATGCTGCAACAGAACAAGGTTCATATGCAGCAGCAACCTCCACAAAATGCATCAAATTTATTGTCAGCACAAGGTCCGCAAGGGCAACTTCAGTCATCCCAACAGTTGATGTCACAGATTCCGTTGCAGTCCACCCAAGTCCAACAACAGGTACCTCTACATCAGCAGCAGCAACAGCAGCCAAATGCCATGTCACATGATCTCCAACAAAGGCTTCAAGTTGGAGGTCAAGCACCAAGCTCCCTGCTTCAATCACAAAATGTAATGGATCAGCAAAAGCAGTTGTATCATTCACAAAGAGCTCTTCCAGAGACATCATCGAGTATGTTTTCTTGCTAACTTTAGAACCTTGCATTTTGAATTTTTGATGCTAACATTATCATTCTTCGTCAATTTTTTTGTGATAAGTGTGGATTGCAGCATGCCTCTTTCTATTATATTCGTTTGGTCTACCTTCAGCATTTTTGTGCATGAAGCATGGACGTAAACATGTGACACAAATACGATAAAACATGATTTTCTTGAAAAATTAGGAGAGACACAACAGTGAGACAATCAATACAGTTCTAAAAAAATATTATGCAGCTCAGAATTTCATGTTTATTATGTTCTTTATTTCCATCTCTTTTCAGCTGAAAACTTATTTCTGTATCTAGGAATATTTTACTGAGTTTTGTTTTTCCTTCTATTACCATTCTTTTCATAATAAGAAATTGTAAATCAACCTTGAGGTTGGCGGGAATGGATTCAGAGGAGCTATTCAAGAAAACCTTCCAAATCTAAATGGGGGAGAAAACAGTGATTATTGTTATTGTTGTATCCATCAATATTTGTGCTCGCTTGTGCAAGCCTTAATTGCCTGACCCTGATTTTGTTTGCCAACGAAACTTATAAGATTTTAAACTACAAGGATGTGGCCACCGTGGTTGAAACTTATTCTCTCTTAGCCATGTATTGTTTTTTCACTGGCCCCAGTGTCCACTATCAAGGTGGTTTAGATAAAGAATAAATGTGATTATTAGACAATGATTTGTGAAAGGCACCGTAATGCGAGGGCATGTACTGGATGTAGTTCAGAAAGCATCTGTAGATTTCTTTTGTTGTGAAAAATTTCTTGGTTTCTGTCTAAAAAAGCTATCAGGAGCTAACTTCACACTTCTAAATAATTCCACTTTTCCTTTCAACCACCAACTTATAAGCACTTCACATAACTAGTCTTCTATATAATCGGGAAGGCACAAGGATAAGTTAAATAATCCAAATGTATAAGACTAACCCTCAGTAATAATCCTACAGTTGAGGAAGACGTGATCAATGGCTTTTCATTCCCCCCTCCCCCCTCCCCCCGTCTGTTGGGCCTTCTACTGTGGCCCCAATTCTCTATTTTCTTTTACTGTTATCAACGAAGGCTGAGTTTCTTATTACTAGGGAGCTCTCTATCTCTATGCTGCAGTTCAGTTACATCCCCTCGGCTCTTATTTTCCATATCCACTGACCTTTATTTCATTTATGTAGCATCTTTAGATTCCACGGCTCAGACAGGACAAGCCAATGGTGGAGATTGGCAAGAGGAGATTTATCAGAAGGTGCTTAGCTAACACTTCATCTCTCTTCAATGAGTTTTTCTTTACTCTTTAGAGATTCAAAAGGAAGTTCATTCAATTTGCAATATTAGTAGAGGAGGTACTATTTGCAATATTAGTAGAGGAGGTACTATTTGCAATCTTCAATAATTCAGTTTTAAGTTGTATAAATTATAACTTTTTATGATAAGATTTAACAAATCATCAAGTTAACTTCGTATATTTTTAACATCTAAATTTTTTAGAAACAGGTATTAACTTATGTTCATTCATTTTGTAAAAAGTTAATTAATCTGCATATTGTTCTTTTTCAATTCATCCAAATATATATATTTTGTTGTGTTGTAGTCATGTATATTTAAATATCACATTTTTACAAAATATAATTAATGGGCTGTGAGGTGGGGCTATTCACTTCATCTCACTTGAGTCTCCCCCTTGGTCATAACCTTAGAAGTGCTATTTTATGGACCCAGTGGTGGAAGAGATTCAGACGATATTCCTCTTAGGAAAAGATTTTCTCTAAGCGAGGGAGACTCACCTTAATCTAGTCTGTCTTAAGTGGAATCCCCACTTGCATTTTATCCTTATTTAGGGTCCTTATCAGTACTCTGGAGAGGCCATATGTAACTTTCTCTGGGAAGGTGTGGAGGTAAGGAAGAGTCCTTGCATGGTTGGGGGATTGGGTATTGGGAATTTGTGGGATGTAGCAAGACCTTGCTAGCTAAGTGGTTGTCGCACTTTTCTGTGAGACTGACAGTTTGTGGTATAAGGTCATTTCGAGCAAGTACAGCCCCCATCCTTTTACTTTGCTCACGAAAGGGGCAATAAGTACAACCAAAAATCCTTGGAAAGCCATTTTAGACGGTCTCCCTTTCTTTTCTCAATTCATTAATTCTATTATGGGTGATGGGTCTAGTACTTACTTTTAGGAACACAAGTGGTTGGGGATAACCCTCTCTGAATGTGTTCCCTCGTATTTATCATTTGTTGAATTCGAAGTTCTCTTCTGTGGTTTATATCCTTCCTTCTTCTAGGAACTCTTCATCGATCTCCCTTGGTTTTCGTGACTCTCTCTCTAATGGTGTTATAGCCCCTTTTGGCTTTAATTAGTGATATCACTTTATTGGAGGAGAAGTGACTTTTGGCTTTGGAACCCTAGTCCTTCAAACAAGTTTTCCCGTCAATCGTTATTTCTTTGTCGAGTGGGTTTGTATGGTCTAAGGACTCTCCCATCTTCTCAATTCTTTGGAAGGGGAAAATTTCGTAGAAGGTTAAATATTTTGTATGGCAGGTTCTCCTTAGCAAGGTGAATACTCTTTATCATGTTAAGAGGTTCTTGCCCAAGTTCATCGGGTAGGGAAGCTGTCGAGGACCTCGATCACCTTCTACAGACATGTTAGTTTGCTAGATCTCTTTGGTGCAGATTCTTCAAGGCGTTGTCAGCCACGGAGCTTCAGTTCCATGTTTGAGGAGTTTCTTACCCATCAACCTTTGAGGAAAAGGGTAGCAGGCTTTGGTCTCTTTCTAGATTTCATGCGTCTCTTTGAGTTTCGGTCTATAAAGAATTTTGTAATTATCCCTAAGGTCTTATTTTGCTTGACTGAAGCCCTCATGAAGTTTATTTTTTTTATTATTATTATTATTTTAATATTTTTGGTTTATTCTTTTCTTTTTCTCGATGAAAGCTTGACTTTTAATTTTTTTAAAAAAGGAGAATAGTAAGATATGATAAGATATAGGCATTCGGGATATTCATTGGCATGGTGAAAAGTTACCAACAAGTTGTAGTTCTTCCGTGAGGAGTTTATAAAAGATACTTCAATTTGAGTCAATTATAAAGGGAGTAAAAAATAGTCTTTGAGTGAATACCAAGAACATGATTTTTGTCAATATCTAAGTGTTTTGGCCTTGCTGTGCATCTCAACTAGTTTAGTCCTAGGACATCAACATTATCTCTGCTAGTCTAGTCCTAGAATGTGCATTTGATTCTACAACATTTGGTTTATAGAAAATTGTTAGAATATCAATGTCCTAAGTTTCTTTACTTTTCCACATGCTCCAATGCTATTGGCCACCAAAGTTGGTTAGTAGAATTGTGGGCTTGTTGCTTCTTTCAAGCCTCAAAGTTTGAAGTAGAGCTCTGATTGCATTCATTCACCACACTTTTCCTTTTCTTCTTTGCTAATGATCACACAGCAATACAACGTATGGATTAGGTTTAGCTTACTCAAGAAACAAAAAATCTGAATGAGAAAATGCAAACATTAGGAAATGGACAGGGCAATAAAAGATGGGTCGGATATTCCTGATCCTTCCTTCAAAGTATGCACCAACTAGATATTAAAGTCATGTTTGGCTATCTTTTTTGAAACCTTTCAGGTGCATTCACCCCTTCATAGAAAAAATTGACTTTTTCGATAACATTCCTTTACATGGTGTAAAAGAAACTTCCTTGAGCCTCGAGAACAATCAAAGCAAAATCAACTAAATAAACTTGGCAAATGGAGATCCCAAGGTCATCTGTATTCCTCTCTTCTAAAGCATAAAAGCTTAAAAGCGAAGAGTTCTTGTTCCATCTCTTCATTCTCTGGGCACTTCTCTTTTGCTTTGAAGGTTTGGAATTTTGATTACCGAAGGTTTGGTTTAGTCTGGTGTGTGTCAATTTTCGAAGATTGTAGACAAGTGGCTCTTTGAAGTTATTTCTGGCTGTTGATTTAAAGTCAAGGCTAGAGTGTTCTATGCAGCTATAACTCATTTTTGCTTTTTTAGGAGAGAGAGATCTTTACCAGCAATAAAATTCCTTTGAATTCCTTTGAATTGGGTTGCCTTATGCAAATCCTTATGTAGAACTCTACTTCTAAAAACCTAGACTGGAGACATTATGTTTGATCTCCCACCTTTGGCACGAGCAAGTCTCATCTCATGCCTTTAGGCTATTTCTGTTGGTTGGTACTAACAAGGCCTTTCCAATAAAAATGAGAGAAAAGGGGGTTTAATCTTTAAAATGTTAATGAACTTTGAGAAGGAACAAAAGGATGGATTGGGATTTCCCCCTTGCTGTTCTGGACAGTAAAGGCGTAGGTTCTAAATTGAGGTATTGTGATAGTGGCCAGTTGCTTTAGTTGCCTCCTTGAGAAAGGGTTCAGTAACATTCATATCGTTGATGACGATACTCTTGTTTTCTCAATGTTTGAGTAGGGAAGATCCACAATCTCTTCAATTCATTTATATCTTTGAAAAGATATTCGGGCCTAAGTGAATGGGACTAGATCCATGTTTGTGGGTATTAATTGATTGCATTGGTGAAGATTGTATTGATTAAAAAGTCAGCAGTGTGGCTGAGGACTTTGGTTTTAAAGTAGTTGTTCGGGCTTTGGCCGTTGATTATTTAGGCATCCAGCTGCTTTGAAGTCCAAGTGTGATGTTTTAGCTGCATCTGGAGAGAATTGCATCTAAAGCCTTTAGAAGCATTTCTCTAAGGAGGTGGGATTCTACTGCTTTCTTTGTTACTCCCTAACTTTCCTAATTACTTATTTATGTATTTTTGAGGCGCCCAAGTGTTAGAGGATTTAGTTTCTTAGGTATCTTTTGTCAAAAGGTCTGATATAGGAGGGTGGCTCCTGTTTGATTAATTGGAACATAGAGTCTCTACATAAGGCACCTTAACGTAGGATGCAACTTTCTTTTTCTTTTATTTTTTGTCAAGGAGGATTCTTTGTTCTACTTCTTTAGCAAATGTCTTTTTCTCTTTGTAGAATTTAGAATCCGTTTAAAATTTTTGTTTAACTGCTTTCTCCTCCTCCTTTTTGTAGATTAAAGCTATGAAGGAGTTGTACTTTTTTGAACTGAAAGAAATGTACCAGAAAATTCTTCCAAAAGTGAATCAGGTATGTGATTCTGCATCTATATCCTAGGGCATTTGACTTTTTGATATTATTAATTGCGGAAGAAGCTGAATCCTCATTGAGTGAACCCATTGATCTATCACCTTTTGTTCATATCTTCGAGCATTGCAATTTTTTTTACCACTTCAGCTGTTTAATGCCCTTTCCTTCACCCGCATTGTAGTTTTTCTCATTCCCCTTATTCCCCTTATTGTTATGCACTGATGTTGTCTAATTTGATATTTTATTGTGCAGTTGGAAAGTCTTCCACAGCAACCGAAGTCAGAGCAGCTAAATAAGCTAAAGACATTTAAGTTAATCTTGGAACGCCTTATAGCATTCTTACAGATTTCAAAAAGTAATATCGTAATTGGATTGAAAGATAAGATAGGCCACTATGAGAAGCAGATTGTTAGTTTTTTAAATTCTAATAGGCCAAGGAATCCAGTATCTACACTGCAGCCAGGACAGCTTCCTGCCTCTCACATGCAATCTATACAGCAGGCACAATCACAGATGACTCCATTACAGTCTCCTGACAATCAAATCAATCCCCAACTACATTCGGCGAACATGCAAGGTTCTGTGGCTCCGGTGCAGCAGAACAATATGAACAATATGAACAATATGCAGCATAATTCTCTTCCAACTTTTTCAGGATCAGCAGCACAACAAAACATGACGATCCCAATGCAGCCTGGTTCGAGTTTGGAATCAGGACAAGGAAATTCACTGAGCTCGTTTCAGCAGGTTGCTTCTGGGTCTTTGCAACAGAATCCTTCCAACAGTTCCCAAAGGGCAAACAATAGTTCTTTGCCATCACAAAATGGGGTGAACACTCTGCAGCCAAACATCGGTTCTCTTCAAACAAATCATAACATGCTTCAACATCAACATCTAAAACAGGATCCGCAACAACAGCTGAAACAACAAATGCAGCAGAGACAGATGCAGCAGCTAAAGCAGCAGCAGATGCTGCAGCACCAGCAACAGCAACAACAACCACAATTACATCAGCAACAGTCGCAGCTACACCAGCAAGGAAAGCCGCAGTTACCTGCACAAATGCAGGCACACCAATTGTCACACCTTAATCAAATCGAGATGAGACAGGGGCTTGCTACAAAGCCAGGGATGTTTCAACATCTCCCAGCGGCTCACCGCTCAGGTTATACCCATCAGCAGCAGATGAAACCAGGAACTTCATTACCAATTTCACCCCAAATCTTTCAGACTGCATCCCCTCAAGTTGCTCAAAATTCTTCTCCACAGGTCGACCAACAAAATCTACTTTCATCCATTACCAAAGTTCCACCTTTGCAATCTGCAAGCTCACCTTTAGTTGTACTATCCCCTTCAACGCCTGTGGCTCCATCTCCAATGCCGGGTGATTCAGAAAAACCCACTTCTGGTGTCTCAACACTTACAAATGCTGGGAATACTGGACAACAAACTAGTGTGTCTGGGACACAAGTCCAGTCTCTTGCCATTGGTACCCCCGGGATATCTGCCTCCCCATTACTTGCTGAGTTTAGTGGTACAGATGGCGCTTATGCCAATGCATTACCAACTGTTTCCGGAAAATCAAGTGCTACAGAGCAGCCTCTTGAGCGCCTAATTAAAGCTGTTAGTCAAAACCTTGAGTGCTTTTAGTTGTTTCTGGCTTAAGTTGTTACCTATTCTTCTTGATTGTTTATAACATTGGTAACCTTATTATGGTTTTCAGGTTAAATCAATGTCGCCTAAAGCTTTGAGTGCCTCTGTCAATGGCATTGGATCAGTTGTTAGTATGATTGATAGGGTAGCAGGCTCGGCCCCTGGCAATGGGTCAAGAGCTGCAGTCGGGGAGGATCTGGTTGCCATGACAAAATGTCGGCTGCAAGCAAGAAATTTTGTTTCACATGATGGATCAAATGGAACAAAAAAGATGAGACGCTACACAAGTGCAATGCCCTTAAATGTTGTATCATCAGCTGGAAGCATAAATGACGTTTTTAAACCGTTTACTGGTGCAGAGACATCCGATCTCGAGTCAACTGCAACATCTAGGGCCAAAAGGTCCAGGGTTGAGGTAATAACAACTTTTACATATTGTATTTTCTATGTGGGTAAAATAGTTTTCTTTTTAAGTATAGATAGATATTGTTAAGAGATGCCACAAGTTGAAGAACAGATGACCTTCATCTTTAGGTATCTAACTACGTTTAGAGATCACACATATGCTTCACCGTTTTACATATAATTCCAATTTCATTAGGAATTTATATTGTTTTATTGGATGTCGTTCAATATTCTAACTAACTTCATAACCTACTTGGACATTTTGAACTTGATATACTTTTTTTTCGTTCTTCGATTTTTGCTTTTTTTGCTTCTGGTGGGGGGTTAGGGTGGAGGGTGGTATTAGTCGTCACTTACATTGTTGTTAAATATTAGCATATTTAAAGAAGTGGTTTTTATTTTTTTAAAAAAGGCATCAGCATTATTTTAGTCCCCCTGAATTCAAATTTCAACTTTTTCCCTTTTTATTTATTTATTTGATGGTCCAATGTCTTGGAGATGGAGAAAAATTAATTTCAAGCTATAGTATTCGTGTTCTTAGATTGTCCGTGTTATTGGCTTGATACATTATATTTGTACGATAGAGACTTGAATATGAAAGCAACTAAACAAATTTATAATGTATTATAGATGATAGCAGATAGAATAGGCAGTGTGATTAAAATGTCATATTCAAGTCTCTATCGTACATTATATTTGTACGATAGAGACTTGAATATGAAAGCAACTAAACAAATTTATAATGTATTATAGATGATAGCAGATAGAATAGGCAGTGTGATTAAAATGTCCACCTAGTTTAAAATTTAAAGTTTCCTTTTGGCTTGCAATTTTCTTATCAATCTTTATGTTAGAATATCAACTCAATATTGTCTGTTTCGAACTTGTGTAGGCCCACTTTAGTTTTATCAATCACTTGCTTTGCTATTTGAATAATATATATATATATATATATATATATATATATATTTACCTGTTGCAGGCCAATCATGTTCTACTGGAAGAAATAAGGGAAATAAACCAACGTCTTATCGACACGGTGGTTGTAATCAGTGATGAAGTAGTTGATCCAAGTGCTTTAGCAGCTGCTGCTGATGGGAGCGAAGGAACAATTGTCAAGTGTTCTTTTAGTGCTGTGGCTCTCAGTCCCAGCTTAAAATCCCAGTACATGTCCGCACAAATGGTGGGCAGATTTCTTAATTCTAGAATTTTTTTGATTGAATTATGATCCATCCAAAGAGCAAAAAGAATCGACTAATTATTCTATGATTCATAAATTTTGACTCGATTTCTGTATGGGATAAAGAGTTGTTGGTGAAAAATCGAAAACTGTAAAAGTCGAATTTTTATAAAAAAAGGAATCATAGTTTAAAAAACTAAATAGAATCATGAGCTAAAAGTATCCAATTAAACATAGTTAAAAAAAATTCTTTTTATAGTTAGAATTGGTTGGACGTGCCTATCAAAAAAAAAAATATCAGGTTCTTCATTGTCTGCGTGATTATCTGTTTTCTTACCCGTGTATTATTATTTTGCAGTCTCCAATTCAGCCTCTACGGTTACTTGTTCCTACAAATTATCCAAATTGTTCTCCAATACTCCTAGACAAGTTTCCTGTTGAAGTCAGGTTAGTTTACATGGGTTCTGTTTTTTTTCTTCTTCTTTTTTTTTTTTTATCAGCTTTTTTTCTTTTTGTCTTCCATGAACCCTTCTCTACGTTCAAAGCATATTAAATATATATGTGCAGCAGGCTATTTGCATCAAATAAATAAATAAATAAAACGATATCATAAATTTGTACAAAATTTAAAATGTATGTGCCATTTTCAAATCTAAACTAACTAGAAAGGATGAACAAATTTCTAGTTGAACAGATTAATGGAATAAATTGAAAATGACCCTTGATTTTAGGACCGAAGTTCTAGTTAATCAGATTGATACTTCTCGAGTTATTCTTAGGTATTTGTGCATCCTCACACCTTAGTTTTGGTGGAAGCATAGGCTAGGTAGGCTATTAGATTGCAGATTGATTCTTTGAAAAGGTCCACATGACAATTTGATCTAGCGACGACCTTCCTCATGAAGTGCTTTATCAATCACTGCCTTTTGTGTATAGAAACTAAATGTTTATTGCTTCTTGTATGCATACATGTATGCATATCTGATGGTTTTTTTTTTTAAATGGGAGACAATTTCATTGGCACTTTTTCTATGTCAAAGATTTTTCTGTTTCTCTACATCCTAAGATATATATTGTTTAAATAAGAAACGGGAAAACTTCATTCATCAAAAAATGGAATGATTGGTGTGTAAATCTACAATTGGTGAATTATGTTTGTTGTCTCCATTGGTGGTGGCCCTGGGATATATTTTCCTTTCGTACAAGCAATACATACTCATTGCGGTTACTAGGTCCTTTGGCTGGTAGAGCTCCGCCTCAACTGATACAGACTCTTGCAATCTGCTGAGATATAACTGAATCTTTTGATTTTGAGTTAACATTCTAGCTCTTAAAACCAACACTTCCAATTTGTTTTAGCTTGGTGAGCTCCCCTAATTTATTGCTTTAGATGGAAAGTTCAAACCGTAAGTCGCATTGGTGGACAAACGCTATCTAGGTGGGATCCGGAATATCTTGTACCATTTGTAAATACCATAGCTGAGCAATTCCCTCTAAGTGATAAGAGGCTAAGCTCACCTTTCTACCTTAGGTGTCTGATTGATGTCGGAAAAAAGTGTTTAGGCTAATCCAACCTAAAGGGTCTTCTAGCCCATTGAAGCAAGGAAAGTCAAAATTTTAGTACAATCAAGTTTCCTCCAAAACTCCAAAATTTTTAGAAATGGTTGTCCTAGAAATCCCTTCTCCTTGTTTTTGCTTGACTTTGAGCACTTGAAGAAGTCAGACCTTCTTTGTGTATTCCAACTCGAGAGCATGGCATCCAATTTGACATGAAGTTGCTCAGCCTTCTCATTATAGGCCAATTGCCTACTCTTCGGACAAGAGTAAATTGAAGAGTTTGGTAAGTTGTTCTTGCACAAATTCACCCCTAACTGATGGCTTTAATACCAAATTGTTATGGGCCCAAATCTGCTATCGAGTTATGGGTCTCTGGTAAGTGGAGGTAGAAGGTGTTTGTATGAGCGAAGAGAGAGTAGAATTCTTCTTTCGGCCTATTTTGAATAATTGAATCCCAAATAAAGGGAGCCTTAATACCTCTGAGTTAAAGTACTCAAAGACTAATTGATAATTTTAGGTACTGTATTCATTGGTGTTTTTGACTATTTGTACAAGTTGAATAACAACCATTGTTACGTCAAGAGACATGACATCCAATTTTGCATGAAGCTGCTCAACCTTCTCATTATAGACCAATCGCCTGCTGTTCAGATAAGAGTAAATTGGTAAGCTGTTCTTGCACAAATTCATTCATAACTATTGGCTATGTTACGAAATTGTGACTGGCCCAAATTTGCAATCGAGTTATGGGTCTCTAGTAAGTGGGAGATAGAAGGTGTTTGTGTGGGTGAAGAGAGAAGTAGTACACTTGTCAGGGCGGATCATTCTCCTAAAATGAAAGTGGAGAACTCCTTAAAGATACTGAAAAAAAAAAATGCATAAAGAAAAATTCAACCAACAACGAAAAATAATTTAGAATTGGTTAGAAATAGAAATGAATGTGAAGACTCCTTAAGATACAATAAAAAAATGCTAAAGAAAAATTCAACCAGCAACTAAAAATAATTTGAATTGGTTAACCTTCAATTATTTTGTTTTCTTTGTAATCGTCAGTGGGTTCGGTACGTGACTGCTTCTTTCTGCCATGAGTATAACGTGCGTACATTGTAGTAGCATTGTATGAAAAGCTCCTTCTCATCTTCGTTTAGGAAGAAAGAAAATGTTTTGTGACAAGCGGGTATATGTATACTTTCTTGGGCTTTGGAGTGAAAGTGACAATAGAACTTTTAGAGGATGGAACGTCATTCTAACAACGTTTAGCTTTGTTGTTAGGTTCCATGTTTCCTATGGGCTTCAGTAACAAAGCTTTTTTATAATTATTTGCCTATGCCCTTATTTTTTTTCTCAATGAGAGTTTGATAATTAAAAATAGACTAAAAAATAGGCCAAACTGACTTCTCTTGCACCCTTTTTCATTTTTTTTTTATCTTATTTTTTCTCTTCCACGAAACATTGGGCGATTTGATCATCTTCTGCCTGTCTTCTCTTTGGTGAATGCCTTGTTTTTCTCTGTACTACCTGTGTCTATGTAAGGAAATAGAGAATAAGGTAGAATATTAGGTTGGCATATTTGTCTTTATTTAAGAAGTTTGTGGGAGGGAGAGTCCATGCTTCTTGCTTAAAATCCTGAAATTTTCTGTTAAATGTCATTACTATATTTGTTAATTTCGTTTCGGTTTTGGTTTCTTTGGTCCCTAGCAGTCTATTATCCTTGTTATCATCATTTTCTTTTTGCAGGAAGGAATACGAAGACCTTTCAATAAAGGCCAAGTCAAGATTTAGCATATCCTTAAGGAACCTTTCACAACCTATGTCACTTGGGGACATAGCGAGAACTTGGGATGTCTGTGCACGTACTGTTGTTTCCGAGTATGCTCAGCAGAGCGGCGGTGGCAGCTTCTGTTCAAGGTATGGAGCTTGGGAGAACTGTTTGAGCGCTGCATGATTCAAAGAATCGATCGAAAAGGATTTGGCCGACTGACCAGCTCGGTATGTTTAGGGAAGACGGGCTGTTTATTCAAGTAACGTTTGCGTATCATAAGGTATGCCTCACTAAAGCTGAAAGTACACGCTGCATGTTGTTTAATCAGGGCTAGGCTAATTCAGTATTTATATTACCACGGTGGATCTGCTTTGCTTGTGAACACTGATTCCTTTTTGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAT
mRNA sequence
AGAGAAACATTGAACAAGGGTTCACGAAGAAGACAAGAACAGGGTATTTATTTCTTGCTTTTGTCTTCCATTCTCCACGAGAGGACGCTCCTCGATCCACATAGAACACACACAAAAGCTTCCTCTTGAATGGCGAGTGAATAGCAAGCCTTCGGTCGTTATCTTCTTTGAAATCGCCTCGATTTTCTCAATTTTCCTATACATATCTGAATTAAACTCCTGGTTTCCTGTAATCAGTGTCCAAACAATTTCCGAGTCGATTCTGCTCGATTTCTGCTCGATTTCTGGCATTTATTGAGCACACCCTCACTGTTTGTTAGTGGGATCTCGACAAATTCTTTTATCGTAGGTTGTTTGGAAACTTCTGAATGGATACTAATAATTGGAGACCTACTCAAGGTGGAGAACCCGGAATCGAAGCCGGGGATTGGAGGTCTCAATTGCAGCCCGATTCTCGGCAACGAATTGTCAACAAAATAATGGAGACATTGAAGAGGCACCTTCCTGTTTCTGGTCATGAGGGATTGAGTGAGCTGAAAAAAATTGCTGTAAGGTTTGAGGAAAAGATTTATACTGCCGCTACCAGCCAGTCAGATTACCTAAGGAAAATATCTCTGAAGATGCTTACGATGGAAACAAAATCTCAGACTCCTATGGGAACTTCATTACCATCCAATCCTATGGTGCCTAGCAATAAGCCTCTGGATTCTGCATCACAAAGCATGCAGCCCCAAGTTCTAAATCAAGGGCAATCAATTTCTGTTCCTCAGTCTTCTAATCAGCCTCAGTCACGCCAGCAACTACTCTCCCAGAATATCCAAAATAATATTGTTTCTCAGAGTTCATCCAGCTTACCTTCTGCAGTACCTCCTGTCTCTGGATTAGCTTCATCTTCAATGCCAAACATGGTTGGCCAGAATCCAAGCATGCAAAATGTATCTGGAATTCCACAGAATTCTGTTGGAAATGCTATGGGTCAAGGGGTTCCTTCCAATGTATTTACCAACTCTCAGAGGCCAGTACAAGGAAGACAAGTAGTTTCCCAACAGCAACAGCAGCAGGCACAAACCCAGCAACAGCAGTTTCTTTTTCATCAACAACAGTTACAGCAGCAGATGATGAATAAGAAGCTTCAGCAGGGAAGTATACCACAACAACGTATGCAATCTCACATTCCACAACAACAACAGAATCTAATGCAACCAAATCAACTGCAATCATCTCAGCAATCTGCTATGCAACCTTCTATGATGCAACCTTCTCTGTCTAATCTTCAACAAAACCAACAATCTTCCATTCAACAGCCCACTCAATCCATGCTTCAGCAACCTCAACAGCCAGTTCTTAGGCAGCAGCCACAGTCCCAGCAACATGCTGTCATGCATTCACAGTCTACAATGTCACAGCAGACTAGCTTGCCTTCACAGCAGCAACAACAGCTAATTAGTCAGCAACCAAATTCTTCAAACATGCAGCAAAATCCATTGATCGGGCAGCAAAACAGTGTTGGGGACATGCAGCAACACCTGCCTCAGCAATCTAGGTCGCATGGGCAGCAGAGCAATCTATCAAATATGCAGTCTCCACCATCGCAGCAGCAGCAGTTAATGGCTCAGCAAAACAACCTTTCAAATTTGCAGCAGCAGCAGTTAGGACCTCAAAGTAATGTTTCTGGACTACAGCAACAACAAATGCATGGAACTCAGTCAGGTAACTCAAACATGCAATCAGATCAGCACCCAATGCATATGCTGCAACAGAACAAGGTTCATATGCAGCAGCAACCTCCACAAAATGCATCAAATTTATTGTCAGCACAAGGTCCGCAAGGGCAACTTCAGTCATCCCAACAGTTGATGTCACAGATTCCGTTGCAGTCCACCCAAGTCCAACAACAGGTACCTCTACATCAGCAGCAGCAACAGCAGCCAAATGCCATGTCACATGATCTCCAACAAAGGCTTCAAGTTGGAGGTCAAGCACCAAGCTCCCTGCTTCAATCACAAAATGTAATGGATCAGCAAAAGCAGTTGTATCATTCACAAAGAGCTCTTCCAGAGACATCATCGACATCTTTAGATTCCACGGCTCAGACAGGACAAGCCAATGGTGGAGATTGGCAAGAGGAGATTTATCAGAAGATTAAAGCTATGAAGGAGTTGTACTTTTTTGAACTGAAAGAAATGTACCAGAAAATTCTTCCAAAAGTGAATCAGTTGGAAAGTCTTCCACAGCAACCGAAGTCAGAGCAGCTAAATAAGCTAAAGACATTTAAGTTAATCTTGGAACGCCTTATAGCATTCTTACAGATTTCAAAAAGTAATATCGTAATTGGATTGAAAGATAAGATAGGCCACTATGAGAAGCAGATTGTTAGTTTTTTAAATTCTAATAGGCCAAGGAATCCAGTATCTACACTGCAGCCAGGACAGCTTCCTGCCTCTCACATGCAATCTATACAGCAGGCACAATCACAGATGACTCCATTACAGTCTCCTGACAATCAAATCAATCCCCAACTACATTCGGCGAACATGCAAGGTTCTGTGGCTCCGGTGCAGCAGAACAATATGAACAATATGAACAATATGCAGCATAATTCTCTTCCAACTTTTTCAGGATCAGCAGCACAACAAAACATGACGATCCCAATGCAGCCTGGTTCGAGTTTGGAATCAGGACAAGGAAATTCACTGAGCTCGTTTCAGCAGGTTGCTTCTGGGTCTTTGCAACAGAATCCTTCCAACAGTTCCCAAAGGGCAAACAATAGTTCTTTGCCATCACAAAATGGGGTGAACACTCTGCAGCCAAACATCGGTTCTCTTCAAACAAATCATAACATGCTTCAACATCAACATCTAAAACAGGATCCGCAACAACAGCTGAAACAACAAATGCAGCAGAGACAGATGCAGCAGCTAAAGCAGCAGCAGATGCTGCAGCACCAGCAACAGCAACAACAACCACAATTACATCAGCAACAGTCGCAGCTACACCAGCAAGGAAAGCCGCAGTTACCTGCACAAATGCAGGCACACCAATTGTCACACCTTAATCAAATCGAGATGAGACAGGGGCTTGCTACAAAGCCAGGGATGTTTCAACATCTCCCAGCGGCTCACCGCTCAGGTTATACCCATCAGCAGCAGATGAAACCAGGAACTTCATTACCAATTTCACCCCAAATCTTTCAGACTGCATCCCCTCAAGTTGCTCAAAATTCTTCTCCACAGGTCGACCAACAAAATCTACTTTCATCCATTACCAAAGTTCCACCTTTGCAATCTGCAAGCTCACCTTTAGTTGTACTATCCCCTTCAACGCCTGTGGCTCCATCTCCAATGCCGGGTGATTCAGAAAAACCCACTTCTGGTGTCTCAACACTTACAAATGCTGGGAATACTGGACAACAAACTAGTGTGTCTGGGACACAAGTCCAGTCTCTTGCCATTGGTACCCCCGGGATATCTGCCTCCCCATTACTTGCTGAGTTTAGTGGTACAGATGGCGCTTATGCCAATGCATTACCAACTGTTTCCGGAAAATCAAGTGCTACAGAGCAGCCTCTTGAGCGCCTAATTAAAGCTGTTAAATCAATGTCGCCTAAAGCTTTGAGTGCCTCTGTCAATGGCATTGGATCAGTTGTTAGTATGATTGATAGGGTAGCAGGCTCGGCCCCTGGCAATGGGTCAAGAGCTGCAGTCGGGGAGGATCTGGTTGCCATGACAAAATGTCGGCTGCAAGCAAGAAATTTTGTTTCACATGATGGATCAAATGGAACAAAAAAGATGAGACGCTACACAAGTGCAATGCCCTTAAATGTTGTATCATCAGCTGGAAGCATAAATGACGTTTTTAAACCGTTTACTGGTGCAGAGACATCCGATCTCGAGTCAACTGCAACATCTAGGGCCAAAAGGTCCAGGGTTGAGGCCAATCATGTTCTACTGGAAGAAATAAGGGAAATAAACCAACGTCTTATCGACACGGTGGTTGTAATCAGTGATGAAGTAGTTGATCCAAGTGCTTTAGCAGCTGCTGCTGATGGGAGCGAAGGAACAATTGTCAAGTGTTCTTTTAGTGCTGTGGCTCTCAGTCCCAGCTTAAAATCCCAGTACATGTCCGCACAAATGTCTCCAATTCAGCCTCTACGGTTACTTGTTCCTACAAATTATCCAAATTGTTCTCCAATACTCCTAGACAAGTTTCCTGTTGAAGTCAGGAAGGAATACGAAGACCTTTCAATAAAGGCCAAGTCAAGATTTAGCATATCCTTAAGGAACCTTTCACAACCTATGTCACTTGGGGACATAGCGAGAACTTGGGATGTCTGTGCACGTACTGTTGTTTCCGAGTATGCTCAGCAGAGCGGCGGTGGCAGCTTCTGTTCAAGGTATGGAGCTTGGGAGAACTGTTTGAGCGCTGCATGATTCAAAGAATCGATCGAAAAGGATTTGGCCGACTGACCAGCTCGGTATGTTTAGGGAAGACGGGCTGTTTATTCAAGTAACGTTTGCGTATCATAAGGTATGCCTCACTAAAGCTGAAAGTACACGCTGCATGTTGTTTAATCAGGGCTAGGCTAATTCAGTATTTATATTACCACGGTGGATCTGCTTTGCTTGTGAACACTGATTCCTTTTTGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAT
Coding sequence (CDS)
ATGGATACTAATAATTGGAGACCTACTCAAGGTGGAGAACCCGGAATCGAAGCCGGGGATTGGAGGTCTCAATTGCAGCCCGATTCTCGGCAACGAATTGTCAACAAAATAATGGAGACATTGAAGAGGCACCTTCCTGTTTCTGGTCATGAGGGATTGAGTGAGCTGAAAAAAATTGCTGTAAGGTTTGAGGAAAAGATTTATACTGCCGCTACCAGCCAGTCAGATTACCTAAGGAAAATATCTCTGAAGATGCTTACGATGGAAACAAAATCTCAGACTCCTATGGGAACTTCATTACCATCCAATCCTATGGTGCCTAGCAATAAGCCTCTGGATTCTGCATCACAAAGCATGCAGCCCCAAGTTCTAAATCAAGGGCAATCAATTTCTGTTCCTCAGTCTTCTAATCAGCCTCAGTCACGCCAGCAACTACTCTCCCAGAATATCCAAAATAATATTGTTTCTCAGAGTTCATCCAGCTTACCTTCTGCAGTACCTCCTGTCTCTGGATTAGCTTCATCTTCAATGCCAAACATGGTTGGCCAGAATCCAAGCATGCAAAATGTATCTGGAATTCCACAGAATTCTGTTGGAAATGCTATGGGTCAAGGGGTTCCTTCCAATGTATTTACCAACTCTCAGAGGCCAGTACAAGGAAGACAAGTAGTTTCCCAACAGCAACAGCAGCAGGCACAAACCCAGCAACAGCAGTTTCTTTTTCATCAACAACAGTTACAGCAGCAGATGATGAATAAGAAGCTTCAGCAGGGAAGTATACCACAACAACGTATGCAATCTCACATTCCACAACAACAACAGAATCTAATGCAACCAAATCAACTGCAATCATCTCAGCAATCTGCTATGCAACCTTCTATGATGCAACCTTCTCTGTCTAATCTTCAACAAAACCAACAATCTTCCATTCAACAGCCCACTCAATCCATGCTTCAGCAACCTCAACAGCCAGTTCTTAGGCAGCAGCCACAGTCCCAGCAACATGCTGTCATGCATTCACAGTCTACAATGTCACAGCAGACTAGCTTGCCTTCACAGCAGCAACAACAGCTAATTAGTCAGCAACCAAATTCTTCAAACATGCAGCAAAATCCATTGATCGGGCAGCAAAACAGTGTTGGGGACATGCAGCAACACCTGCCTCAGCAATCTAGGTCGCATGGGCAGCAGAGCAATCTATCAAATATGCAGTCTCCACCATCGCAGCAGCAGCAGTTAATGGCTCAGCAAAACAACCTTTCAAATTTGCAGCAGCAGCAGTTAGGACCTCAAAGTAATGTTTCTGGACTACAGCAACAACAAATGCATGGAACTCAGTCAGGTAACTCAAACATGCAATCAGATCAGCACCCAATGCATATGCTGCAACAGAACAAGGTTCATATGCAGCAGCAACCTCCACAAAATGCATCAAATTTATTGTCAGCACAAGGTCCGCAAGGGCAACTTCAGTCATCCCAACAGTTGATGTCACAGATTCCGTTGCAGTCCACCCAAGTCCAACAACAGGTACCTCTACATCAGCAGCAGCAACAGCAGCCAAATGCCATGTCACATGATCTCCAACAAAGGCTTCAAGTTGGAGGTCAAGCACCAAGCTCCCTGCTTCAATCACAAAATGTAATGGATCAGCAAAAGCAGTTGTATCATTCACAAAGAGCTCTTCCAGAGACATCATCGACATCTTTAGATTCCACGGCTCAGACAGGACAAGCCAATGGTGGAGATTGGCAAGAGGAGATTTATCAGAAGATTAAAGCTATGAAGGAGTTGTACTTTTTTGAACTGAAAGAAATGTACCAGAAAATTCTTCCAAAAGTGAATCAGTTGGAAAGTCTTCCACAGCAACCGAAGTCAGAGCAGCTAAATAAGCTAAAGACATTTAAGTTAATCTTGGAACGCCTTATAGCATTCTTACAGATTTCAAAAAGTAATATCGTAATTGGATTGAAAGATAAGATAGGCCACTATGAGAAGCAGATTGTTAGTTTTTTAAATTCTAATAGGCCAAGGAATCCAGTATCTACACTGCAGCCAGGACAGCTTCCTGCCTCTCACATGCAATCTATACAGCAGGCACAATCACAGATGACTCCATTACAGTCTCCTGACAATCAAATCAATCCCCAACTACATTCGGCGAACATGCAAGGTTCTGTGGCTCCGGTGCAGCAGAACAATATGAACAATATGAACAATATGCAGCATAATTCTCTTCCAACTTTTTCAGGATCAGCAGCACAACAAAACATGACGATCCCAATGCAGCCTGGTTCGAGTTTGGAATCAGGACAAGGAAATTCACTGAGCTCGTTTCAGCAGGTTGCTTCTGGGTCTTTGCAACAGAATCCTTCCAACAGTTCCCAAAGGGCAAACAATAGTTCTTTGCCATCACAAAATGGGGTGAACACTCTGCAGCCAAACATCGGTTCTCTTCAAACAAATCATAACATGCTTCAACATCAACATCTAAAACAGGATCCGCAACAACAGCTGAAACAACAAATGCAGCAGAGACAGATGCAGCAGCTAAAGCAGCAGCAGATGCTGCAGCACCAGCAACAGCAACAACAACCACAATTACATCAGCAACAGTCGCAGCTACACCAGCAAGGAAAGCCGCAGTTACCTGCACAAATGCAGGCACACCAATTGTCACACCTTAATCAAATCGAGATGAGACAGGGGCTTGCTACAAAGCCAGGGATGTTTCAACATCTCCCAGCGGCTCACCGCTCAGGTTATACCCATCAGCAGCAGATGAAACCAGGAACTTCATTACCAATTTCACCCCAAATCTTTCAGACTGCATCCCCTCAAGTTGCTCAAAATTCTTCTCCACAGGTCGACCAACAAAATCTACTTTCATCCATTACCAAAGTTCCACCTTTGCAATCTGCAAGCTCACCTTTAGTTGTACTATCCCCTTCAACGCCTGTGGCTCCATCTCCAATGCCGGGTGATTCAGAAAAACCCACTTCTGGTGTCTCAACACTTACAAATGCTGGGAATACTGGACAACAAACTAGTGTGTCTGGGACACAAGTCCAGTCTCTTGCCATTGGTACCCCCGGGATATCTGCCTCCCCATTACTTGCTGAGTTTAGTGGTACAGATGGCGCTTATGCCAATGCATTACCAACTGTTTCCGGAAAATCAAGTGCTACAGAGCAGCCTCTTGAGCGCCTAATTAAAGCTGTTAAATCAATGTCGCCTAAAGCTTTGAGTGCCTCTGTCAATGGCATTGGATCAGTTGTTAGTATGATTGATAGGGTAGCAGGCTCGGCCCCTGGCAATGGGTCAAGAGCTGCAGTCGGGGAGGATCTGGTTGCCATGACAAAATGTCGGCTGCAAGCAAGAAATTTTGTTTCACATGATGGATCAAATGGAACAAAAAAGATGAGACGCTACACAAGTGCAATGCCCTTAAATGTTGTATCATCAGCTGGAAGCATAAATGACGTTTTTAAACCGTTTACTGGTGCAGAGACATCCGATCTCGAGTCAACTGCAACATCTAGGGCCAAAAGGTCCAGGGTTGAGGCCAATCATGTTCTACTGGAAGAAATAAGGGAAATAAACCAACGTCTTATCGACACGGTGGTTGTAATCAGTGATGAAGTAGTTGATCCAAGTGCTTTAGCAGCTGCTGCTGATGGGAGCGAAGGAACAATTGTCAAGTGTTCTTTTAGTGCTGTGGCTCTCAGTCCCAGCTTAAAATCCCAGTACATGTCCGCACAAATGTCTCCAATTCAGCCTCTACGGTTACTTGTTCCTACAAATTATCCAAATTGTTCTCCAATACTCCTAGACAAGTTTCCTGTTGAAGTCAGGAAGGAATACGAAGACCTTTCAATAAAGGCCAAGTCAAGATTTAGCATATCCTTAAGGAACCTTTCACAACCTATGTCACTTGGGGACATAGCGAGAACTTGGGATGTCTGTGCACGTACTGTTGTTTCCGAGTATGCTCAGCAGAGCGGCGGTGGCAGCTTCTGTTCAAGGTATGGAGCTTGGGAGAACTGTTTGAGCGCTGCATGA
Protein sequence
MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIAVRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA
Homology
BLAST of CmoCh19G003080.1 vs. ExPASy Swiss-Prot
Match:
F4I171 (Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana OX=3702 GN=MED15A PE=1 SV=1)
HSP 1 Score: 1068.5 bits (2762), Expect = 6.0e-311
Identity = 757/1399 (54.11%), Postives = 952/1399 (68.05%), Query Frame = 0
Query: 1 MDTNNWRPT-QGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKI 60
MD NNWRP+ GEP ++ GDWR+QL PDSRQ+IVNKIMETLK+HLP SG EG++EL++I
Sbjct: 1 MDNNNWRPSLPNGEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRI 60
Query: 61 AVRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSM 120
A RFEEKI++ A +Q+DYLRKIS+KMLTMETKSQ G+S + P + +DS
Sbjct: 61 AARFEEKIFSGALNQTDYLRKISMKMLTMETKSQNAAGSS-AAIPAANNGTSIDSIP--- 120
Query: 121 QPQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVS--QSSSSLPSAVPPVSGLASSSM 180
NQGQ + S+NQ Q+ Q LLSQ +QNN S S++LPS++PPVS + +++
Sbjct: 121 ----TNQGQLLPGSLSTNQSQAPQPLLSQTMQNNTASGMTGSTALPSSMPPVSSITNNNT 180
Query: 181 PNMVGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQ 240
++V QN +MQNV+G+ Q+S G G+ SN+F+ QR + GR QQ QQQ
Sbjct: 181 TSVVNQNANMQNVAGMLQDSSGQ---HGLSSNMFSGPQRQMLGRPHAMSSQQ-----QQQ 240
Query: 241 QFLFHQQQLQQQMMNKKLQQGSIPQQR--MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMM 300
+L+ QQQLQQQ++ + Q G++P + SHI QQQQN++QPNQL SSQQ + S
Sbjct: 241 PYLY-QQQLQQQLLKQNFQSGNVPNPNSLLPSHIQQQQQNVLQPNQLHSSQQPGVPTSAT 300
Query: 301 QPS------LSNLQQNQQS----SIQQPTQSMLQQPQQPVLRQQPQSQQHAVMH-SQSTM 360
QPS L L NQQS S QQ TQSML+Q Q +LRQ PQSQQ + +H QS++
Sbjct: 301 QPSTVNSAPLQGLHTNQQSSPQLSSQQTTQSMLRQHQSSMLRQHPQSQQASGIHQQQSSL 360
Query: 361 SQQTSLPSQQQ-QQLISQQ-PNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSN 420
QQ+ P QQQ QL+ QQ NSS +QQ ++G Q+ VGDMQQ Q R QQ+N+ N
Sbjct: 361 PQQSISPLQQQPTQLMRQQAANSSGIQQKQMMG-QHVVGDMQQQ--HQQRLLNQQNNVMN 420
Query: 421 MQSPPSQ-----------------QQQLMAQQNNLSNLQQQQLGPQSNVSGLQ--QQQMH 480
+Q SQ QQQLM+QQN+L Q LG QSNV+GLQ QQQM
Sbjct: 421 IQQQQSQQQPLQQPQQQQKQQPPAQQQLMSQQNSLQATHQNPLGTQSNVAGLQQPQQQML 480
Query: 481 GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 540
+Q GNS++Q++QH +HML Q V + Q+ Q L S+QG Q Q Q SQQ
Sbjct: 481 NSQVGNSSLQNNQHSVHMLSQPTVGL-QRTHQAGHGLYSSQGQQSQNQPSQQ-------- 540
Query: 541 STQVQQQVPLHQQQ---QQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQR 600
Q+ Q+ H QQ QQQPN + D+QQRLQ GQ SLL QNV+DQQ+QLY SQR
Sbjct: 541 --QMMPQLQSHHQQLGLQQQPNLLQQDVQQRLQASGQVTGSLLPPQNVVDQQRQLYQSQR 600
Query: 601 ALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESL 660
LPE S+SLDSTAQT ANGGDWQEE+YQKIK+MKE Y +L E+YQ++ K+ Q +S+
Sbjct: 601 TLPEMPSSSLDSTAQTESANGGDWQEEVYQKIKSMKETYLPDLNEIYQRVAAKLQQ-DSM 660
Query: 661 PQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNP 720
PQQ +S+QL KL+ FK +LER+I FL +SKSNI+ LKDK+ +YEKQI+ FLN +RPR P
Sbjct: 661 PQQQRSDQLEKLRQFKTMLERMIQFLSVSKSNIMPALKDKVAYYEKQIIGFLNMHRPRKP 720
Query: 721 VSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNN 780
V Q GQLP S MQ +QQ QSQ QS DNQ NPQ+ S +MQG+ QQ++M NM +
Sbjct: 721 V---QQGQLPQSQMQPMQQPQSQTVQDQSHDNQTNPQMQSMSMQGAGPRAQQSSMTNMQS 780
Query: 781 MQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANN 840
+S P SA QQN+ + P SSLESGQGN+L++ QQVA GS+QQ N+SQ NN
Sbjct: 781 NVLSSRP--GVSAPQQNIPSSI-PASSLESGQGNTLNNGQQVAMGSMQQ---NTSQLVNN 840
Query: 841 SSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLK--QDPQQQLKQQMQQRQMQQLKQQQMLQ 900
SS +Q+G++TLQ N+ Q + ++LQHQHLK QD Q QLKQQ QQRQMQQ QQ +
Sbjct: 841 SSASAQSGLSTLQSNVNQPQLSSSLLQHQHLKQQQDQQMQLKQQFQQRQMQQ--QQLQAR 900
Query: 901 HQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMF-QHLPAA 960
QQQQQQ Q QQ +QL Q++ +N + RQG+ GMF QH
Sbjct: 901 QQQQQQQLQARQQAAQL--------------QQMNDMNDLTSRQGMNVSRGMFQQHSMQG 960
Query: 961 HRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSP 1020
R+ Y QQ+KPG SPQ+ Q ASPQ++Q+ SPQVDQ+N ++ + PLQ A+SP
Sbjct: 961 QRANYP-LQQLKPGA--VSSPQLLQGASPQMSQHLSPQVDQKNTVNKMG--TPLQPANSP 1020
Query: 1021 LVVLSP-STPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISAS 1080
VV SP STP+APSPM DSEKP G S+L+ QQ + VQSLAIGTPGISAS
Sbjct: 1021 FVVPSPSSTPLAPSPMQVDSEKP--GSSSLSMGNIARQQATGMQGVVQSLAIGTPGISAS 1080
Query: 1081 PLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMI 1140
PLL EF+ DG N+ SGK SATE P+ERLI+AVKS+SP+ALS++V+ IGSVVSM+
Sbjct: 1081 PLLQEFTSPDGNILNSSTITSGKPSATELPIERLIRAVKSISPQALSSAVSDIGSVVSMV 1140
Query: 1141 DRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSA 1200
DR+AGSAPGNGSRA+VGEDLVAMTKCRLQARNF++ +G TKKM+R+T+AMPL+V S
Sbjct: 1141 DRIAGSAPGNGSRASVGEDLVAMTKCRLQARNFMTQEGMMATKKMKRHTTAMPLSVASLG 1200
Query: 1201 GSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISD--E 1260
GS+ D +K F G+ETSDLESTATS K++R E H LLEEI+EINQRLIDTVV ISD +
Sbjct: 1201 GSVGDNYKQFAGSETSDLESTATSDGKKARTETEHALLEEIKEINQRLIDTVVEISDDED 1260
Query: 1261 VVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSP 1320
DPS +A ++ G EGT V+ SF AV+LSP+LK+ S QMSPIQPLRLLVP +YPN SP
Sbjct: 1261 AADPSEVAISSIGCEGTTVRFSFIAVSLSPALKAHLSSTQMSPIQPLRLLVPCSYPNGSP 1320
Query: 1321 ILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQS 1354
LLDK PVE KE EDLS KA +RF+I LR+LSQPMSL DIA+TWD CAR V+ EYAQQ
Sbjct: 1321 SLLDKLPVETSKENEDLSSKAMARFNILLRSLSQPMSLKDIAKTWDACARAVICEYAQQF 1335
BLAST of CmoCh19G003080.1 vs. ExPASy Swiss-Prot
Match:
Q9SHV7 (Probable mediator of RNA polymerase II transcription subunit 15c OS=Arabidopsis thaliana OX=3702 GN=MED15C PE=3 SV=1)
HSP 1 Score: 261.5 bits (667), Expect = 5.1e-68
Identity = 308/980 (31.43%), Postives = 454/980 (46.33%), Query Frame = 0
Query: 396 QQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSD 455
Q+S + +Q+QL+ Q NL P S + QQ G +S+ Q++
Sbjct: 147 QKSVFDTTEQKRQEQEQLINQLTNL---------PTSRPNNRDQQ---GAFQVSSSQQNN 206
Query: 456 QHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQ 515
+H + Q K ++Q + QQ+ P+ S Q +QQ P+ Q
Sbjct: 207 NVTLHAMSQQKNNLQ------------------SMTRGQQVGQSQPMMSQQYRQQYPM-Q 266
Query: 516 QQQQQPNAMSH-DLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP-----ETSSTS 575
Q Q N H D Q QA SSL Q+QN+ DQQ Q +RA P S
Sbjct: 267 QDPQNRNLQKHLDFVQNNTNQFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVAS 326
Query: 576 LDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQPKSEQ- 635
DST +T N G+WQEE YQKIK +KE+ L M+Q++ K+ + ESLP QP Q
Sbjct: 327 QDSTGKTVNVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQW 386
Query: 636 LNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQ 695
+ KLK KL +E L+ FL + +S++ +DK YE I+ F S + Q GQ
Sbjct: 387 IEKLKAGKLSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQ 446
Query: 696 LPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHNSLPT 755
P S Q+ Q QSP ++ L+ + + P QN +++ ++ P
Sbjct: 447 FPPS--QTAMQT-------QSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDP- 506
Query: 756 FSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNG 815
+N+ ++S V S++QNP
Sbjct: 507 -----RDENII----------------MASSGNVMLPSVKQNP---------------RA 566
Query: 816 VNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQL 875
VNT NI S+Q+ LQ KQ++ Q QQQQPQ
Sbjct: 567 VNT---NISSVQS----LQ------------------------KQKRFHHRQMQQQQPQQ 626
Query: 876 HQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTHQQQM 935
Q Q+ Q + +N + MR+ + K + +Q
Sbjct: 627 GNHQHQM---------------QTNEMNDVRMRERVNIKARLL--------------EQQ 686
Query: 936 KPGTSLPISPQIFQTASPQVAQNSSPQ-VDQQNLLSSITKV-PPLQSASSPLVVLSPSTP 995
+ + Q +S Q+ +SSPQ VDQ L ++I K PL S+ S V
Sbjct: 687 VSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSAFVA------ 746
Query: 996 VAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTD 1055
APSP+PGDSE P S S ++ ++ T S +GT +PLL
Sbjct: 747 PAPSPVPGDSEMPISVESPVSGV------DEINSTLDSSSKLGT---QETPLL------- 806
Query: 1056 GAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGN 1115
V TE+P++RLIKA ++ SPK+L+ SV+ I SV+SM+D + GS P +
Sbjct: 807 --------FVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSS 866
Query: 1116 -GSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKP 1175
GSRA +GEDL T RNF +H+ +N +K+M+R + +P ++ S D ++
Sbjct: 867 GGSRAGLGEDLSERT------RNFTTHEETNLSKRMKRSINIVPPDMSSQI----DSYEQ 926
Query: 1176 FTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAA 1235
+ E S++ ST +S K + + + LL+EI+E N RL++TVV I DE
Sbjct: 927 LSSLE-SEVVSTTSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDE----------- 935
Query: 1236 DGSEGTIVKCSFSAVALSPSLKSQYMSAQM----------SPIQPLRLLVPTNYPNCSPI 1295
S GTIV C+++ VALS + K Y S ++ + IQPLRLL P +YP SPI
Sbjct: 987 -DSLGTIVTCTYAPVALSATFKDHYKSGKIIFYVSKCLMQAQIQPLRLLFPMDYPYSSPI 935
Query: 1296 LLDK--FPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQ 1354
+L++ F V K YEDLS + +SRFS+S++ S+P IA+TW+ CAR + EYA++
Sbjct: 1047 VLEEISFDTSVHK-YEDLSARTRSRFSLSMKEFSEPGFSKGIAQTWNDCARATMVEYAER 935
BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match:
A0A6J1HI15 (mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463831 PE=4 SV=1)
HSP 1 Score: 2419.4 bits (6269), Expect = 0.0e+00
Identity = 1353/1353 (100.00%), Postives = 1353/1353 (100.00%), Query Frame = 0
Query: 1 MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA
Sbjct: 1 MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
Query: 61 VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ
Sbjct: 61 VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
Query: 121 PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM
Sbjct: 121 PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
Query: 181 VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240
VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL
Sbjct: 181 VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240
Query: 241 FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS 300
FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS
Sbjct: 241 FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS 300
Query: 301 NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS 360
NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS
Sbjct: 301 NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS 360
Query: 361 QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNL 420
QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNL
Sbjct: 361 QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNL 420
Query: 421 SNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNL 480
SNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNL
Sbjct: 421 SNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNL 480
Query: 481 LSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPS 540
LSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPS
Sbjct: 481 LSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPS 540
Query: 541 SLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYF 600
SLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYF
Sbjct: 541 SLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYF 600
Query: 601 FELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK 660
FELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK
Sbjct: 601 FELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK 660
Query: 661 IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS 720
IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS
Sbjct: 661 IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS 720
Query: 721 ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQ 780
ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQ
Sbjct: 721 ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQ 780
Query: 781 QVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLK 840
QVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLK
Sbjct: 781 QVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLK 840
Query: 841 QQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEM 900
QQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEM
Sbjct: 841 QQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEM 900
Query: 901 RQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQN 960
RQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQN
Sbjct: 901 RQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQN 960
Query: 961 LLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSG 1020
LLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSG
Sbjct: 961 LLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSG 1020
Query: 1021 TQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPK 1080
TQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPK
Sbjct: 1021 TQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPK 1080
Query: 1081 ALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKK 1140
ALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKK
Sbjct: 1081 ALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKK 1140
Query: 1141 MRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREI 1200
MRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREI
Sbjct: 1141 MRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREI 1200
Query: 1201 NQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQP 1260
NQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQP
Sbjct: 1201 NQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQP 1260
Query: 1261 LRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWD 1320
LRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWD
Sbjct: 1261 LRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWD 1320
Query: 1321 VCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
VCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA
Sbjct: 1321 VCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1353
BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match:
A0A6J1HR60 (mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467027 PE=4 SV=1)
HSP 1 Score: 2368.6 bits (6137), Expect = 0.0e+00
Identity = 1334/1356 (98.38%), Postives = 1341/1356 (98.89%), Query Frame = 0
Query: 1 MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
MDTNNWRPTQGGEPGIE GDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA
Sbjct: 1 MDTNNWRPTQGGEPGIEDGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
Query: 61 VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ
Sbjct: 61 VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
Query: 121 PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
PQVLNQGQSISVPQ SNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM
Sbjct: 121 PQVLNQGQSISVPQPSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
Query: 181 VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240
VGQNPSMQNVSGIPQNSVGN+MGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL
Sbjct: 181 VGQNPSMQNVSGIPQNSVGNSMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFL 240
Query: 241 FHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLS 300
FHQQQLQQQMMNKK QQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQ AMQPSMMQ SLS
Sbjct: 241 FHQQQLQQQMMNKKFQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQPAMQPSMMQSSLS 300
Query: 301 NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLIS 360
NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMH QSTMSQQTSLPSQQQQQLIS
Sbjct: 301 NLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHPQSTMSQQTSLPSQQQQQLIS 360
Query: 361 QQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPP-SQQQQLMAQQNN 420
QQPNSS+MQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPP QQQQLMAQQNN
Sbjct: 361 QQPNSSSMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPLQQQQQLMAQQNN 420
Query: 421 LSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASN 480
LSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASN
Sbjct: 421 LSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASN 480
Query: 481 LLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAP 540
LLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAP
Sbjct: 481 LLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAP 540
Query: 541 SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY 600
SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY
Sbjct: 541 SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY 600
Query: 601 FFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKD 660
FFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKD
Sbjct: 601 FFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKD 660
Query: 661 KIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLH 720
KIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQ+QSQMTPLQSP+NQINPQLH
Sbjct: 661 KIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQSQSQMTPLQSPENQINPQLH 720
Query: 721 SANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSF 780
SANMQGSVAPVQQ NNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSF
Sbjct: 721 SANMQGSVAPVQQ---NNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSF 780
Query: 781 QQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQL 840
QQVASGSLQQN +NSSQRANN+SLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQL
Sbjct: 781 QQVASGSLQQNSANSSQRANNNSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQL 840
Query: 841 KQQMQQRQMQQLKQQQMLQH--QQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQ 900
KQQMQQRQMQQLKQQQMLQH QQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQ
Sbjct: 841 KQQMQQRQMQQLKQQQMLQHQQQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQ 900
Query: 901 IEMRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVD 960
IEMRQGLATK GMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVD
Sbjct: 901 IEMRQGLATKSGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVD 960
Query: 961 QQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTS 1020
QQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTS
Sbjct: 961 QQNLLSSITKVPPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTS 1020
Query: 1021 VSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSM 1080
VSGTQVQSLAIGTPGISASPLLAEFSGTDGAYA+ALPTVSGKSSATEQPLERLIKAVKSM
Sbjct: 1021 VSGTQVQSLAIGTPGISASPLLAEFSGTDGAYASALPTVSGKSSATEQPLERLIKAVKSM 1080
Query: 1081 SPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNG 1140
SPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNG
Sbjct: 1081 SPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNG 1140
Query: 1141 TKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEI 1200
TKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEI
Sbjct: 1141 TKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEI 1200
Query: 1201 REINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSP 1260
REINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSP
Sbjct: 1201 REINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSP 1260
Query: 1261 IQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIAR 1320
IQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIAR
Sbjct: 1261 IQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIAR 1320
Query: 1321 TWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
TWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA
Sbjct: 1321 TWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1353
BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match:
A0A6J1HI99 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463831 PE=4 SV=1)
HSP 1 Score: 2252.6 bits (5836), Expect = 0.0e+00
Identity = 1269/1269 (100.00%), Postives = 1269/1269 (100.00%), Query Frame = 0
Query: 85 MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ 144
MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ
Sbjct: 1 MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ 60
Query: 145 LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ 204
LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ
Sbjct: 61 LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ 120
Query: 205 GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR 264
GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR
Sbjct: 121 GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR 180
Query: 265 MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP 324
MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP
Sbjct: 181 MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP 240
Query: 325 VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ 384
VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ
Sbjct: 241 VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ 300
Query: 385 QHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHG 444
QHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHG
Sbjct: 301 QHLPQQSRSHGQQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHG 360
Query: 445 TQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQS 504
TQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQS
Sbjct: 361 TQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQS 420
Query: 505 TQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPE 564
TQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPE
Sbjct: 421 TQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALPE 480
Query: 565 TSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQP 624
TSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQP
Sbjct: 481 TSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQP 540
Query: 625 KSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTL 684
KSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTL
Sbjct: 541 KSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTL 600
Query: 685 QPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHN 744
QPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHN
Sbjct: 601 QPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHN 660
Query: 745 SLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLP 804
SLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLP
Sbjct: 661 SLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLP 720
Query: 805 SQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQ 864
SQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQ
Sbjct: 721 SQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQ 780
Query: 865 QPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTH 924
QPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTH
Sbjct: 781 QPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTH 840
Query: 925 QQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPS 984
QQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPS
Sbjct: 841 QQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSPS 900
Query: 985 TPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSG 1044
TPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSG
Sbjct: 901 TPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSG 960
Query: 1045 TDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAP 1104
TDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAP
Sbjct: 961 TDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAP 1020
Query: 1105 GNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFK 1164
GNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFK
Sbjct: 1021 GNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFK 1080
Query: 1165 PFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAA 1224
PFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAA
Sbjct: 1081 PFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAA 1140
Query: 1225 ADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV 1284
ADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV
Sbjct: 1141 ADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV 1200
Query: 1285 RKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYG 1344
RKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYG
Sbjct: 1201 RKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYG 1260
Query: 1345 AWENCLSAA 1354
AWENCLSAA
Sbjct: 1261 AWENCLSAA 1269
BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match:
A0A6J1HVE4 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467027 PE=4 SV=1)
HSP 1 Score: 2203.7 bits (5709), Expect = 0.0e+00
Identity = 1251/1272 (98.35%), Postives = 1258/1272 (98.90%), Query Frame = 0
Query: 85 MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQSSNQPQSRQQ 144
MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQ SNQPQSRQQ
Sbjct: 1 MLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQPQVLNQGQSISVPQPSNQPQSRQQ 60
Query: 145 LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNAMGQ 204
LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGN+MGQ
Sbjct: 61 LLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNMVGQNPSMQNVSGIPQNSVGNSMGQ 120
Query: 205 GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKLQQGSIPQQR 264
GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKK QQGSIPQQR
Sbjct: 121 GVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQQFLFHQQQLQQQMMNKKFQQGSIPQQR 180
Query: 265 MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPSLSNLQQNQQSSIQQPTQSMLQQPQQP 324
MQSHIPQQQQNLMQPNQLQSSQQ AMQPSMMQ SLSNLQQNQQSSIQQPTQSMLQQPQQP
Sbjct: 181 MQSHIPQQQQNLMQPNQLQSSQQPAMQPSMMQSSLSNLQQNQQSSIQQPTQSMLQQPQQP 240
Query: 325 VLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQLISQQPNSSNMQQNPLIGQQNSVGDMQ 384
VLRQQPQSQQHAVMH QSTMSQQTSLPSQQQQQLISQQPNSS+MQQNPLIGQQNSVGDMQ
Sbjct: 241 VLRQQPQSQQHAVMHPQSTMSQQTSLPSQQQQQLISQQPNSSSMQQNPLIGQQNSVGDMQ 300
Query: 385 QHLPQQSRSHGQQSNLSNMQSPP-SQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMH 444
QHLPQQSRSHGQQSNLSNMQSPP QQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMH
Sbjct: 301 QHLPQQSRSHGQQSNLSNMQSPPLQQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMH 360
Query: 445 GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 504
GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ
Sbjct: 361 GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 420
Query: 505 STQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP 564
STQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP
Sbjct: 421 STQVQQQVPLHQQQQQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP 480
Query: 565 ETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQ 624
ETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQ
Sbjct: 481 ETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQ 540
Query: 625 PKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVST 684
PKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVST
Sbjct: 541 PKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVST 600
Query: 685 LQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQH 744
LQPGQLPASHMQSIQQ+QSQMTPLQSP+NQINPQLHSANMQGSVAPVQQ NNMNNMQH
Sbjct: 601 LQPGQLPASHMQSIQQSQSQMTPLQSPENQINPQLHSANMQGSVAPVQQ---NNMNNMQH 660
Query: 745 NSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSL 804
NSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQN +NSSQRANN+SL
Sbjct: 661 NSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNSANSSQRANNNSL 720
Query: 805 PSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQH--QQ 864
PSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQH QQ
Sbjct: 721 PSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQ 780
Query: 865 QQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSG 924
QQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATK GMFQHLPAAHRSG
Sbjct: 781 QQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKSGMFQHLPAAHRSG 840
Query: 925 YTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVL 984
YTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVL
Sbjct: 841 YTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVL 900
Query: 985 SPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAE 1044
SPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAE
Sbjct: 901 SPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAE 960
Query: 1045 FSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAG 1104
FSGTDGAYA+ALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAG
Sbjct: 961 FSGTDGAYASALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAG 1020
Query: 1105 SAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSIND 1164
SAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSIND
Sbjct: 1021 SAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSIND 1080
Query: 1165 VFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSAL 1224
VFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSAL
Sbjct: 1081 VFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSAL 1140
Query: 1225 AAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFP 1284
AAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFP
Sbjct: 1141 AAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFP 1200
Query: 1285 VEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCS 1344
VEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCS
Sbjct: 1201 VEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCS 1260
Query: 1345 RYGAWENCLSAA 1354
RYGAWENCLSAA
Sbjct: 1261 RYGAWENCLSAA 1269
BLAST of CmoCh19G003080.1 vs. ExPASy TrEMBL
Match:
A0A6J1ELX0 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111434564 PE=4 SV=1)
HSP 1 Score: 2004.9 bits (5193), Expect = 0.0e+00
Identity = 1166/1375 (84.80%), Postives = 1235/1375 (89.82%), Query Frame = 0
Query: 1 MDTNNWRPTQGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKIA 60
MD++NWRP QGGE G++AGDWRSQLQPDSR RIVNKIMETLKRHLPVSGHEGLSEL+KIA
Sbjct: 1 MDSSNWRPAQGGESGVDAGDWRSQLQPDSRHRIVNKIMETLKRHLPVSGHEGLSELRKIA 60
Query: 61 VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSMQ 120
VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQT T+LPSN MVP+NKPLDS SQSMQ
Sbjct: 61 VRFEEKIYTAATSQSDYLRKISLKMLTMETKSQT---TALPSNSMVPTNKPLDSTSQSMQ 120
Query: 121 PQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVSQSSSSLPSAVPPVSGLASSSMPNM 180
QVLNQG S+S P SSNQPQ RQQLLSQNIQNNI SQSSSSLPS+VPPV+GLAS+ M N+
Sbjct: 121 SQVLNQGPSMSGPMSSNQPQPRQQLLSQNIQNNIASQSSSSLPSSVPPVAGLASAPMANI 180
Query: 181 VGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVS--QQQQQQAQTQQQQ 240
VGQNPSMQNVSG+PQ+SVGNAMGQGV SNVFTNSQRP+QGRQVVS QQQQQQ+Q+QQQQ
Sbjct: 181 VGQNPSMQNVSGVPQSSVGNAMGQGVSSNVFTNSQRPIQGRQVVSQQQQQQQQSQSQQQQ 240
Query: 241 FLFHQQQLQQQMMNKKLQQGSIPQQRMQSHIPQQQQNLMQPNQLQSSQQSAMQPSMMQPS 300
F QQ LQQQ+M +K QQGS+P Q MQSHIPQQQ NLM PNQL SSQQ S+MQPS
Sbjct: 241 LFFQQQHLQQQIMKQKYQQGSMPHQLMQSHIPQQQTNLMAPNQLPSSQQ-----SVMQPS 300
Query: 301 LSNLQQNQQSSIQQPTQSMLQQPQQPVLRQQPQSQQHAVMHSQSTMSQQTSLPSQQQQQL 360
LSNLQQNQQSSIQQPTQSMLQQP QPVLRQQ QSQQH+V+H Q TMSQQ SL SQQQQQL
Sbjct: 301 LSNLQQNQQSSIQQPTQSMLQQPPQPVLRQQQQSQQHSVLHQQPTMSQQASLSSQQQQQL 360
Query: 361 ISQQPNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSNMQSPPS--QQQQLMAQ 420
I+QQ NSSNMQQN LI QNSVGDMQQ LPQQSRSHGQQSNLSNMQ+PPS QQQQLM Q
Sbjct: 361 INQQSNSSNMQQNSLI--QNSVGDMQQQLPQQSRSHGQQSNLSNMQTPPSQQQQQQLMNQ 420
Query: 421 QNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQN 480
Q++LSNLQQ QLGPQSNVSGLQQQQMHGTQSGNSNMQS+QH +HM+QQNKV MQQQPPQN
Sbjct: 421 QSSLSNLQQPQLGPQSNVSGLQQQQMHGTQSGNSNMQSNQHGVHMMQQNKVQMQQQPPQN 480
Query: 481 ASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQQQQQQP--NAMSHDLQQRLQV 540
SNLLS QG QGQLQSSQQLMSQIPLQS QVQQQV L QQQQQQP N +SH+LQQRLQ
Sbjct: 481 PSNLLSTQGQQGQLQSSQQLMSQIPLQSAQVQQQVSLQQQQQQQPQSNTLSHELQQRLQA 540
Query: 541 GGQAPSSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKA 600
GGQAP LLQSQNVMDQQKQLYH QR LPETSSTSLDSTAQTGQANGGDWQEEIYQKIK+
Sbjct: 541 GGQAPGPLLQSQNVMDQQKQLYHPQRVLPETSSTSLDSTAQTGQANGGDWQEEIYQKIKS 600
Query: 601 MKELYFFELKEMYQKILPKVNQLESLPQQPKSEQLNKLKTFKLILERLIAFLQISKSNIV 660
MKELY FELKEMYQKILPKV+Q +SLPQQPKSEQLNKL+ F++ILERLIAFLQ+ K+NIV
Sbjct: 601 MKELYLFELKEMYQKILPKVHQFDSLPQQPKSEQLNKLRAFRVILERLIAFLQVPKNNIV 660
Query: 661 IGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQI 720
IG KDKI HYEKQIVSFLNSNRPRNPVSTLQ GQLPASHMQS+QQ+QSQMTPLQSP+NQI
Sbjct: 661 IGFKDKISHYEKQIVSFLNSNRPRNPVSTLQQGQLPASHMQSMQQSQSQMTPLQSPENQI 720
Query: 721 NPQLHSANMQGSVAPVQQ---NNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESG 780
NPQLHSANMQGSVA VQQ NNMNNMNNMQHNSLPTFSGSA QQNMTIPMQPGSSLESG
Sbjct: 721 NPQLHSANMQGSVALVQQNNMNNMNNMNNMQHNSLPTFSGSAPQQNMTIPMQPGSSLESG 780
Query: 781 QGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHL 840
QGNSLSS QQV + SLQQNP+N SQRANNSSL SQNGVN LQPNI SLQ+N N+LQHQH+
Sbjct: 781 QGNSLSSLQQVGAVSLQQNPANGSQRANNSSLASQNGVNALQPNISSLQSNTNILQHQHM 840
Query: 841 K-QDP------QQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLP 900
K QDP QQQLKQQMQQR MQ LKQQ + QQQQQQPQLHQQQSQL QQGK QLP
Sbjct: 841 KQQDPQQLLQSQQQLKQQMQQRHMQHLKQQMLQHQQQQQQQPQLHQQQSQLQQQGKQQLP 900
Query: 901 AQMQAHQLSHLNQIE-----MRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQ 960
QMQAHQ+SHLNQIE MRQG+A KPGMFQH P RS YTH QMKPGTS PISP
Sbjct: 901 TQMQAHQMSHLNQIEMNDLKMRQGVAAKPGMFQH-PGTQRSAYTH-PQMKPGTSFPISPP 960
Query: 961 IFQTASPQVAQNSSPQVDQQNLLSSITKV-PPLQSASSPLVVLSPSTPVAPSPMPGDSEK 1020
IFQ SPQV QNSSPQVDQQN+ SS+ ++ PLQSASSP VV SPSTP+APSPMPGDSEK
Sbjct: 961 IFQATSPQVTQNSSPQVDQQNMFSSMNRIGTPLQSASSPFVVPSPSTPLAPSPMPGDSEK 1020
Query: 1021 PTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSG 1080
PTS VS+L NAGNTGQQ +VSG Q SLAIGTPGISASPLLAEFSGTDGAYA ALPTVSG
Sbjct: 1021 PTSAVSSLPNAGNTGQQMNVSGAQAPSLAIGTPGISASPLLAEFSGTDGAYAIALPTVSG 1080
Query: 1081 KSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGNGSRAAVGEDLVA 1140
KSS TEQPLERLIKAVKSMSP+AL+ASV+GIGSVVSMIDR+AGSAPGNGSRAAVGEDLVA
Sbjct: 1081 KSSVTEQPLERLIKAVKSMSPRALNASVSGIGSVVSMIDRIAGSAPGNGSRAAVGEDLVA 1140
Query: 1141 MTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTA 1200
MTKCRLQARNFVSHDGSNGTK+MRR+TSAMPLNVVSSAGS+NDVFKP TGAETSDLESTA
Sbjct: 1141 MTKCRLQARNFVSHDGSNGTKRMRRHTSAMPLNVVSSAGSVNDVFKPLTGAETSDLESTA 1200
Query: 1201 TSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFS 1260
TS KRSR+EA+HVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGS+GTIVKCSFS
Sbjct: 1201 TSSVKRSRIEASHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAADGSDGTIVKCSFS 1260
Query: 1261 AVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEVRKEYEDLSIKAKSR 1320
AVALSPSLKSQY SAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEV KEYEDLSIKAKSR
Sbjct: 1261 AVALSPSLKSQYTSAQMSPIQPLRLLVPTNYPNCSPILLDKFPVEVSKEYEDLSIKAKSR 1320
Query: 1321 FSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
FSISLRNLSQPMSLGDIARTWDVCAR VVSEYAQQSGGGSFCS+YGAWENCLSAA
Sbjct: 1321 FSISLRNLSQPMSLGDIARTWDVCARAVVSEYAQQSGGGSFCSKYGAWENCLSAA 1363
BLAST of CmoCh19G003080.1 vs. TAIR 10
Match:
AT1G15780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G10440.1); Has 103701 Blast hits to 43153 proteins in 1828 species: Archae - 30; Bacteria - 7385; Metazoa - 38639; Fungi - 11531; Plants - 7727; Viruses - 307; Other Eukaryotes - 38082 (source: NCBI BLink). )
HSP 1 Score: 1068.5 bits (2762), Expect = 4.3e-312
Identity = 757/1399 (54.11%), Postives = 952/1399 (68.05%), Query Frame = 0
Query: 1 MDTNNWRPT-QGGEPGIEAGDWRSQLQPDSRQRIVNKIMETLKRHLPVSGHEGLSELKKI 60
MD NNWRP+ GEP ++ GDWR+QL PDSRQ+IVNKIMETLK+HLP SG EG++EL++I
Sbjct: 1 MDNNNWRPSLPNGEPAMDTGDWRTQLPPDSRQKIVNKIMETLKKHLPFSGPEGINELRRI 60
Query: 61 AVRFEEKIYTAATSQSDYLRKISLKMLTMETKSQTPMGTSLPSNPMVPSNKPLDSASQSM 120
A RFEEKI++ A +Q+DYLRKIS+KMLTMETKSQ G+S + P + +DS
Sbjct: 61 AARFEEKIFSGALNQTDYLRKISMKMLTMETKSQNAAGSS-AAIPAANNGTSIDSIP--- 120
Query: 121 QPQVLNQGQSISVPQSSNQPQSRQQLLSQNIQNNIVS--QSSSSLPSAVPPVSGLASSSM 180
NQGQ + S+NQ Q+ Q LLSQ +QNN S S++LPS++PPVS + +++
Sbjct: 121 ----TNQGQLLPGSLSTNQSQAPQPLLSQTMQNNTASGMTGSTALPSSMPPVSSITNNNT 180
Query: 181 PNMVGQNPSMQNVSGIPQNSVGNAMGQGVPSNVFTNSQRPVQGRQVVSQQQQQQAQTQQQ 240
++V QN +MQNV+G+ Q+S G G+ SN+F+ QR + GR QQ QQQ
Sbjct: 181 TSVVNQNANMQNVAGMLQDSSGQ---HGLSSNMFSGPQRQMLGRPHAMSSQQ-----QQQ 240
Query: 241 QFLFHQQQLQQQMMNKKLQQGSIPQQR--MQSHIPQQQQNLMQPNQLQSSQQSAMQPSMM 300
+L+ QQQLQQQ++ + Q G++P + SHI QQQQN++QPNQL SSQQ + S
Sbjct: 241 PYLY-QQQLQQQLLKQNFQSGNVPNPNSLLPSHIQQQQQNVLQPNQLHSSQQPGVPTSAT 300
Query: 301 QPS------LSNLQQNQQS----SIQQPTQSMLQQPQQPVLRQQPQSQQHAVMH-SQSTM 360
QPS L L NQQS S QQ TQSML+Q Q +LRQ PQSQQ + +H QS++
Sbjct: 301 QPSTVNSAPLQGLHTNQQSSPQLSSQQTTQSMLRQHQSSMLRQHPQSQQASGIHQQQSSL 360
Query: 361 SQQTSLPSQQQ-QQLISQQ-PNSSNMQQNPLIGQQNSVGDMQQHLPQQSRSHGQQSNLSN 420
QQ+ P QQQ QL+ QQ NSS +QQ ++G Q+ VGDMQQ Q R QQ+N+ N
Sbjct: 361 PQQSISPLQQQPTQLMRQQAANSSGIQQKQMMG-QHVVGDMQQQ--HQQRLLNQQNNVMN 420
Query: 421 MQSPPSQ-----------------QQQLMAQQNNLSNLQQQQLGPQSNVSGLQ--QQQMH 480
+Q SQ QQQLM+QQN+L Q LG QSNV+GLQ QQQM
Sbjct: 421 IQQQQSQQQPLQQPQQQQKQQPPAQQQLMSQQNSLQATHQNPLGTQSNVAGLQQPQQQML 480
Query: 481 GTQSGNSNMQSDQHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQ 540
+Q GNS++Q++QH +HML Q V + Q+ Q L S+QG Q Q Q SQQ
Sbjct: 481 NSQVGNSSLQNNQHSVHMLSQPTVGL-QRTHQAGHGLYSSQGQQSQNQPSQQ-------- 540
Query: 541 STQVQQQVPLHQQQ---QQQPNAMSHDLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQR 600
Q+ Q+ H QQ QQQPN + D+QQRLQ GQ SLL QNV+DQQ+QLY SQR
Sbjct: 541 --QMMPQLQSHHQQLGLQQQPNLLQQDVQQRLQASGQVTGSLLPPQNVVDQQRQLYQSQR 600
Query: 601 ALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESL 660
LPE S+SLDSTAQT ANGGDWQEE+YQKIK+MKE Y +L E+YQ++ K+ Q +S+
Sbjct: 601 TLPEMPSSSLDSTAQTESANGGDWQEEVYQKIKSMKETYLPDLNEIYQRVAAKLQQ-DSM 660
Query: 661 PQQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNP 720
PQQ +S+QL KL+ FK +LER+I FL +SKSNI+ LKDK+ +YEKQI+ FLN +RPR P
Sbjct: 661 PQQQRSDQLEKLRQFKTMLERMIQFLSVSKSNIMPALKDKVAYYEKQIIGFLNMHRPRKP 720
Query: 721 VSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNN 780
V Q GQLP S MQ +QQ QSQ QS DNQ NPQ+ S +MQG+ QQ++M NM +
Sbjct: 721 V---QQGQLPQSQMQPMQQPQSQTVQDQSHDNQTNPQMQSMSMQGAGPRAQQSSMTNMQS 780
Query: 781 MQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANN 840
+S P SA QQN+ + P SSLESGQGN+L++ QQVA GS+QQ N+SQ NN
Sbjct: 781 NVLSSRP--GVSAPQQNIPSSI-PASSLESGQGNTLNNGQQVAMGSMQQ---NTSQLVNN 840
Query: 841 SSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLK--QDPQQQLKQQMQQRQMQQLKQQQMLQ 900
SS +Q+G++TLQ N+ Q + ++LQHQHLK QD Q QLKQQ QQRQMQQ QQ +
Sbjct: 841 SSASAQSGLSTLQSNVNQPQLSSSLLQHQHLKQQQDQQMQLKQQFQQRQMQQ--QQLQAR 900
Query: 901 HQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMF-QHLPAA 960
QQQQQQ Q QQ +QL Q++ +N + RQG+ GMF QH
Sbjct: 901 QQQQQQQLQARQQAAQL--------------QQMNDMNDLTSRQGMNVSRGMFQQHSMQG 960
Query: 961 HRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSP 1020
R+ Y QQ+KPG SPQ+ Q ASPQ++Q+ SPQVDQ+N ++ + PLQ A+SP
Sbjct: 961 QRANYP-LQQLKPGA--VSSPQLLQGASPQMSQHLSPQVDQKNTVNKMG--TPLQPANSP 1020
Query: 1021 LVVLSP-STPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISAS 1080
VV SP STP+APSPM DSEKP G S+L+ QQ + VQSLAIGTPGISAS
Sbjct: 1021 FVVPSPSSTPLAPSPMQVDSEKP--GSSSLSMGNIARQQATGMQGVVQSLAIGTPGISAS 1080
Query: 1081 PLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMI 1140
PLL EF+ DG N+ SGK SATE P+ERLI+AVKS+SP+ALS++V+ IGSVVSM+
Sbjct: 1081 PLLQEFTSPDGNILNSSTITSGKPSATELPIERLIRAVKSISPQALSSAVSDIGSVVSMV 1140
Query: 1141 DRVAGSAPGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSA 1200
DR+AGSAPGNGSRA+VGEDLVAMTKCRLQARNF++ +G TKKM+R+T+AMPL+V S
Sbjct: 1141 DRIAGSAPGNGSRASVGEDLVAMTKCRLQARNFMTQEGMMATKKMKRHTTAMPLSVASLG 1200
Query: 1201 GSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISD--E 1260
GS+ D +K F G+ETSDLESTATS K++R E H LLEEI+EINQRLIDTVV ISD +
Sbjct: 1201 GSVGDNYKQFAGSETSDLESTATSDGKKARTETEHALLEEIKEINQRLIDTVVEISDDED 1260
Query: 1261 VVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQMSPIQPLRLLVPTNYPNCSP 1320
DPS +A ++ G EGT V+ SF AV+LSP+LK+ S QMSPIQPLRLLVP +YPN SP
Sbjct: 1261 AADPSEVAISSIGCEGTTVRFSFIAVSLSPALKAHLSSTQMSPIQPLRLLVPCSYPNGSP 1320
Query: 1321 ILLDKFPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQS 1354
LLDK PVE KE EDLS KA +RF+I LR+LSQPMSL DIA+TWD CAR V+ EYAQQ
Sbjct: 1321 SLLDKLPVETSKENEDLSSKAMARFNILLRSLSQPMSLKDIAKTWDACARAVICEYAQQF 1335
BLAST of CmoCh19G003080.1 vs. TAIR 10
Match:
AT2G10440.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 1628 Blast hits to 1350 proteins in 149 species: Archae - 0; Bacteria - 39; Metazoa - 480; Fungi - 159; Plants - 187; Viruses - 2; Other Eukaryotes - 761 (source: NCBI BLink). )
HSP 1 Score: 263.1 bits (671), Expect = 1.3e-69
Identity = 263/820 (32.07%), Postives = 396/820 (48.29%), Query Frame = 0
Query: 540 SSLLQSQNVMDQQKQLYHSQRALPETSSTSLDSTAQTGQANGGDWQEEIYQKIKAMKELY 599
SS+ +++ + QK ++ + + S DST +T N G+WQEE YQKIK +KE+
Sbjct: 186 SSIKLTKHSITDQKSVFDTTVLIMNIIVASQDSTGKTVNVNAGNWQEETYQKIKKLKEMC 245
Query: 600 FFELKEMYQKILPKVNQLESLPQQPKSEQ-LNKLKTFKLILERLIAFLQISKSNIVIGLK 659
L M+Q++ K+ + ESLP QP Q + KLK KL +E L+ FL + +S++ +
Sbjct: 246 LPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGKLSMEHLMFFLNVHRSSVSEKHR 305
Query: 660 DKIGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQL 719
DK YE I+ F S + Q GQ P S Q+ Q QSP ++ L
Sbjct: 306 DKFSQYEYHILKFTKSQTMVLRPTQQQQGQFPPS--QTAMQT-------QSPQVHVSQSL 365
Query: 720 HSANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIPMQPGSSLESGQGNSLSS 779
+ + + P QN +++ ++ P +N+ ++S
Sbjct: 366 YKEQRRSRLMPSSQNEASSLLQIRPKLDP------RDENII----------------MAS 425
Query: 780 FQQVASGSLQQNPSNSSQRANNSSLPSQNGVNTLQPNIGSLQTNHNMLQHQHLKQDPQQQ 839
V S++QNP VNT NI S+Q+ LQ
Sbjct: 426 SGNVMLPSVKQNP---------------RAVNT---NISSVQS----LQ----------- 485
Query: 840 LKQQMQQRQMQQLKQQQMLQHQQQQQQPQLHQQQSQLHQQGKPQLPAQMQAHQLSHLNQI 899
KQ++ Q QQQQPQ Q Q+ Q + +N +
Sbjct: 486 -------------KQKRFHHRQMQQQQPQQGNHQHQM---------------QTNEMNDV 545
Query: 900 EMRQGLATKPGMFQHLPAAHRSGYTHQQQMKPGTSLPISPQIFQTASPQVAQNSSPQ-VD 959
MR+ + K + +Q + + Q +S Q+ +SSPQ VD
Sbjct: 546 RMRERVNIKARLL--------------EQQVSSSQRQVPKQESNVSSSQIQNHSSPQLVD 605
Query: 960 QQNLLSSITKV-PPLQSASSPLVVLSPSTPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQT 1019
Q L ++I K PL S+ S V APSP+PGDSE P S S ++
Sbjct: 606 QHILPATINKTGTPLNSSGSAFVA------PAPSPVPGDSEMPISVESPVSGV------D 665
Query: 1020 SVSGTQVQSLAIGTPGISASPLLAEFSGTDGAYANALPTVSGKSSATEQPLERLIKAVKS 1079
++ T S +GT +PLL V TE+P++RLIKA ++
Sbjct: 666 EINSTLDSSSKLGT---QETPLL---------------FVPPPEPITERPIDRLIKAFQA 725
Query: 1080 MSPKALSASVNGIGSVVSMIDRVAGSAPGN-GSRAAVGEDLVAMTKCRLQARNFVSHDGS 1139
SPK+L+ SV+ I SV+SM+D + GS P + GSRA +GEDL T RNF +H+ +
Sbjct: 726 ASPKSLAESVSEISSVISMVDMIGGSFPSSGGSRAGLGEDLSERT------RNFTTHEET 785
Query: 1140 NGTKKMRRYTSAMPLNVVSSAGSINDVFKPFTGAETSDLESTATSRAKRSRVEANHVLLE 1199
N +K+M+R + +P ++ S D ++ + E S++ ST +S K + + + LL+
Sbjct: 786 NLSKRMKRSINIVPPDMSSQI----DSYEQLSSLE-SEVVSTTSSGLKVNNIAPGYALLQ 845
Query: 1200 EIREINQRLIDTVVVISDEVVDPSALAAAADGSEGTIVKCSFSAVALSPSLKSQYMSAQM 1259
EI+E N RL++TVV I DE S GTIV C+++ VALS + K Y S ++
Sbjct: 846 EIKETNGRLVETVVEICDE------------DSLGTIVTCTYAPVALSATFKDHYKSGKI 845
Query: 1260 SPIQPLRLLVPTNYPNCSPILLDK--FPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLG 1319
+ IQPLRLL P +YP SPI+L++ F V K YEDLS + +SRFS+S++ S+P
Sbjct: 906 AQIQPLRLLFPMDYPYSSPIVLEEISFDTSVHK-YEDLSARTRSRFSLSMKEFSEPGFSK 845
Query: 1320 DIARTWDVCARTVVSEYAQQSGGGSFCSRYGAWENCLSAA 1354
IA+TW+ CAR + EYA++ GGG+F S+YGAWE L A+
Sbjct: 966 GIAQTWNDCARATMVEYAERHGGGTFSSKYGAWETVLRAS 845
BLAST of CmoCh19G003080.1 vs. TAIR 10
Match:
AT2G10440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 8319 Blast hits to 5104 proteins in 317 species: Archae - 0; Bacteria - 285; Metazoa - 1706; Fungi - 535; Plants - 320; Viruses - 18; Other Eukaryotes - 5455 (source: NCBI BLink). )
HSP 1 Score: 261.5 bits (667), Expect = 3.6e-69
Identity = 308/980 (31.43%), Postives = 454/980 (46.33%), Query Frame = 0
Query: 396 QQSNLSNMQSPPSQQQQLMAQQNNLSNLQQQQLGPQSNVSGLQQQQMHGTQSGNSNMQSD 455
Q+S + +Q+QL+ Q NL P S + QQ G +S+ Q++
Sbjct: 147 QKSVFDTTEQKRQEQEQLINQLTNL---------PTSRPNNRDQQ---GAFQVSSSQQNN 206
Query: 456 QHPMHMLQQNKVHMQQQPPQNASNLLSAQGPQGQLQSSQQLMSQIPLQSTQVQQQVPLHQ 515
+H + Q K ++Q + QQ+ P+ S Q +QQ P+ Q
Sbjct: 207 NVTLHAMSQQKNNLQ------------------SMTRGQQVGQSQPMMSQQYRQQYPM-Q 266
Query: 516 QQQQQPNAMSH-DLQQRLQVGGQAPSSLLQSQNVMDQQKQLYHSQRALP-----ETSSTS 575
Q Q N H D Q QA SSL Q+QN+ DQQ Q +RA P S
Sbjct: 267 QDPQNRNLQKHLDFVQNNTNQFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVAS 326
Query: 576 LDSTAQTGQANGGDWQEEIYQKIKAMKELYFFELKEMYQKILPKVNQLESLPQQPKSEQ- 635
DST +T N G+WQEE YQKIK +KE+ L M+Q++ K+ + ESLP QP Q
Sbjct: 327 QDSTGKTVNVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQW 386
Query: 636 LNKLKTFKLILERLIAFLQISKSNIVIGLKDKIGHYEKQIVSFLNSNRPRNPVSTLQPGQ 695
+ KLK KL +E L+ FL + +S++ +DK YE I+ F S + Q GQ
Sbjct: 387 IEKLKAGKLSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQ 446
Query: 696 LPASHMQSIQQAQSQMTPLQSPDNQINPQLHSANMQGSVAPVQQNNMNNMNNMQHNSLPT 755
P S Q+ Q QSP ++ L+ + + P QN +++ ++ P
Sbjct: 447 FPPS--QTAMQT-------QSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDP- 506
Query: 756 FSGSAAQQNMTIPMQPGSSLESGQGNSLSSFQQVASGSLQQNPSNSSQRANNSSLPSQNG 815
+N+ ++S V S++QNP
Sbjct: 507 -----RDENII----------------MASSGNVMLPSVKQNP---------------RA 566
Query: 816 VNTLQPNIGSLQTNHNMLQHQHLKQDPQQQLKQQMQQRQMQQLKQQQMLQHQQQQQQPQL 875
VNT NI S+Q+ LQ KQ++ Q QQQQPQ
Sbjct: 567 VNT---NISSVQS----LQ------------------------KQKRFHHRQMQQQQPQQ 626
Query: 876 HQQQSQLHQQGKPQLPAQMQAHQLSHLNQIEMRQGLATKPGMFQHLPAAHRSGYTHQQQM 935
Q Q+ Q + +N + MR+ + K + +Q
Sbjct: 627 GNHQHQM---------------QTNEMNDVRMRERVNIKARLL--------------EQQ 686
Query: 936 KPGTSLPISPQIFQTASPQVAQNSSPQ-VDQQNLLSSITKV-PPLQSASSPLVVLSPSTP 995
+ + Q +S Q+ +SSPQ VDQ L ++I K PL S+ S V
Sbjct: 687 VSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSAFVA------ 746
Query: 996 VAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFSGTD 1055
APSP+PGDSE P S S ++ ++ T S +GT +PLL
Sbjct: 747 PAPSPVPGDSEMPISVESPVSGV------DEINSTLDSSSKLGT---QETPLL------- 806
Query: 1056 GAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSAPGN 1115
V TE+P++RLIKA ++ SPK+L+ SV+ I SV+SM+D + GS P +
Sbjct: 807 --------FVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSS 866
Query: 1116 -GSRAAVGEDLVAMTKCRLQARNFVSHDGSNGTKKMRRYTSAMPLNVVSSAGSINDVFKP 1175
GSRA +GEDL T RNF +H+ +N +K+M+R + +P ++ S D ++
Sbjct: 867 GGSRAGLGEDLSERT------RNFTTHEETNLSKRMKRSINIVPPDMSSQI----DSYEQ 926
Query: 1176 FTGAETSDLESTATSRAKRSRVEANHVLLEEIREINQRLIDTVVVISDEVVDPSALAAAA 1235
+ E S++ ST +S K + + + LL+EI+E N RL++TVV I DE
Sbjct: 927 LSSLE-SEVVSTTSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDE----------- 935
Query: 1236 DGSEGTIVKCSFSAVALSPSLKSQYMSAQM----------SPIQPLRLLVPTNYPNCSPI 1295
S GTIV C+++ VALS + K Y S ++ + IQPLRLL P +YP SPI
Sbjct: 987 -DSLGTIVTCTYAPVALSATFKDHYKSGKIIFYVSKCLMQAQIQPLRLLFPMDYPYSSPI 935
Query: 1296 LLDK--FPVEVRKEYEDLSIKAKSRFSISLRNLSQPMSLGDIARTWDVCARTVVSEYAQQ 1354
+L++ F V K YEDLS + +SRFS+S++ S+P IA+TW+ CAR + EYA++
Sbjct: 1047 VLEEISFDTSVHK-YEDLSARTRSRFSLSMKEFSEPGFSKGIAQTWNDCARATMVEYAER 935
BLAST of CmoCh19G003080.1 vs. TAIR 10
Match:
AT1G15770.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 117.9 bits (294), Expect = 6.5e-26
Identity = 101/273 (37.00%), Postives = 150/273 (54.95%), Query Frame = 0
Query: 867 QLHQQQ-SQLHQQGKPQLPAQ-MQAHQLSHLNQIE-MRQGLATKPGMFQHLPAAHRSGYT 926
Q+HQ+ ++++Q+ +L + +HQ +Q E +++G GM + L +
Sbjct: 10 QIHQRDLNEIYQRVAAKLQQEDSLSHQKQRSDQFEKLKRGKTVLEGMLRFLSLS------ 69
Query: 927 HQQQMKPGTSLPISPQIFQTASPQVAQNSSPQVDQQNLLSSITKVPPLQSASSPLVVLSP 986
+ +KP + + N ++ Q+L ++ K+ +S P+ P
Sbjct: 70 -KSNIKPD---------LKDSMDYRKNNIMNFLNMQSLRKTVQKLQLTKSEIQPM--QQP 129
Query: 987 STPVAPSPMPGDSEKPTSGVSTLTNAGNTGQQTSVSGTQVQSLAIGTPGISASPLLAEFS 1046
+ D ++ AG+ QQ + +QSL IGTPGISASPLL E +
Sbjct: 130 LSQTVQDQSHDDQTTLQMQSMSMQGAGSRVQQ--IRQGVLQSLEIGTPGISASPLLPELT 189
Query: 1047 GTDGAYANALPTVSGKSSATEQPLERLIKAVKSMSPKALSASVNGIGSVVSMIDRVAGSA 1106
DG N L + GKSSATE P+ERLI+A+KS+SP+ALS++V I SVVSM+DR+AGS
Sbjct: 190 SPDGNIINPLTSTCGKSSATELPIERLIRAMKSISPQALSSAVCDIRSVVSMVDRIAGSV 249
Query: 1107 PGNGSRAAVGEDLVAMTKCRLQARNFVSHDGSN 1137
PG GSRA+ G DLVAMTKC LQ RNF++ DG +
Sbjct: 250 PGKGSRASFGVDLVAMTKCHLQERNFMTQDGDH 262
HSP 2 Score: 74.7 bits (182), Expect = 6.3e-13
Identity = 59/161 (36.65%), Postives = 91/161 (56.52%), Query Frame = 0
Query: 602 ELKEMYQKILPKVNQLESLP-QQPKSEQLNKLKTFKLILERLIAFLQISKSNIVIGLKDK 661
+L E+YQ++ K+ Q +SL Q+ +S+Q KLK K +LE ++ FL +SKSNI LKD
Sbjct: 15 DLNEIYQRVAAKLQQEDSLSHQKQRSDQFEKLKRGKTVLEGMLRFLSLSKSNIKPDLKDS 74
Query: 662 IGHYEKQIVSFLNSNRPRNPVSTLQPGQLPASHMQSIQQAQSQMTPLQSPDNQINPQLHS 721
+ + + I++FLN R T+Q QL S +Q +QQ SQ QS D+Q Q+ S
Sbjct: 75 MDYRKNNIMNFLNMQSLR---KTVQKLQLTKSEIQPMQQPLSQTVQDQSHDDQTTLQMQS 134
Query: 722 ANMQGSVAPVQQNNMNNMNNMQHNSLPTFSGSAAQQNMTIP 762
+MQG+ + VQQ + +++ + P S S +T P
Sbjct: 135 MSMQGAGSRVQQIRQGVLQSLEIGT-PGISASPLLPELTSP 171
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4I171 | 6.0e-311 | 54.11 | Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana ... | [more] |
Q9SHV7 | 5.1e-68 | 31.43 | Probable mediator of RNA polymerase II transcription subunit 15c OS=Arabidopsis ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1HI15 | 0.0e+00 | 100.00 | mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucur... | [more] |
A0A6J1HR60 | 0.0e+00 | 98.38 | mediator of RNA polymerase II transcription subunit 15a-like isoform X1 OS=Cucur... | [more] |
A0A6J1HI99 | 0.0e+00 | 100.00 | mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucur... | [more] |
A0A6J1HVE4 | 0.0e+00 | 98.35 | mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucur... | [more] |
A0A6J1ELX0 | 0.0e+00 | 84.80 | mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Cucur... | [more] |
Match Name | E-value | Identity | Description | |
AT1G15780.1 | 4.3e-312 | 54.11 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G10440.2 | 1.3e-69 | 32.07 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT2G10440.1 | 3.6e-69 | 31.43 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G15770.1 | 6.5e-26 | 37.00 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |