Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCCGAGCTAGGCCAACAAACGGTCGAGTTCTCTGCCCTGGTTTCCCGTGCTGCTGAGGACTCTTTCCTCTCCCTCAAAGAACTTGTTGACAAGTCCAAATCCTCCGACCAATCCGATTCCGATAAGAAGGTTAATATCCTCAAGTATGTCTTCAAGACCCAGCAGAGGATACTCCGCCTTTATGCCCTTGCCAAGTGGTGTCAACAGGTTGGTTGCTTGCTTTCCCTCTGTCTTCTTTTACTTCGAATTTTTAAATTATTGTCTTCCCAATTCTCTATGCTCGCGACTTACTTCGAAATTGAGTATACTGTGATTTTTTCTTCGCCATGAATATGGAAGACTGGGTACTTCTTTAGCTGTTTGTGTCTCATGGAGGTAGTAATGCTAGGTGAGGTCGTTGGGAATTAGACTTACCACATTCAGGAGGTTATTCTTCTTAAAATTGACTTTGAAATGCTCATGATCATGTTGATTAAGCTTTGGGGAATGAAGACAGGGTTAGGTATAACTGTATGAGTGGAAGGTATGTGTTGAACTTTTCCAAGATTCACAATCTTCTCTATCTTTTTTTTGGTTCGAGCCAAGGAGGAGGCTTTTCTCTGTGCCGTCTAGGAAAGGAACCACGTTTCTCTGTTATGGTTTATCTTCGTTGCCCAAGCAGAATGATCGATAGAGGGGTGGAAAAAGGGTATTTTGAAGGTTGTTAGTAGGGAGGGTGTTCACTTCATATAACTACCGTCTGCGGGCGACATCATGGTTAACGAGGAGTCTTTTTATAATCCTCAGAAGGCTTTCGATGCCCTTGAGGAATTATCTTGTTGGGGTAGCCTGTGGCGATCAAACCAACAATATCAAGATTCGAGATGAGTGGGCTTTGGCTAGTTTGTTTACATTGGTTTATTTAAGTTGATAAGAAACCATTCCATTCACCTAGGATCATGGAGAAAGAGGAAGATTGGCTTTCTCTAATTAGCTAAAGAGTTTACAAGAAAGCCCCAGTTGACAGACAACAAAAGGATTATAATTACCACCTTTAAAAAATGCCTACGGGAAAGTTGAATCATTTGCATTGTGCTTACAATCTATACGAAAATACCAATCAACCCAAGGGCATCCACAGCTAGTACTTAACGAGTAGAATGAGATCACACTGCTTGAACCTCTAGACAAATTTTCTACCCCCCCAAAAAAAAAAAAAAAAAAGAAAAAAAGAAAAAAAAAGAAAAAGAAAAAGAAAAATCCTCTAGACTCCCTTGAGCCAACTTCCAAATAATGCAATGTGAACGTTGAGATGCCTCATATTGGAAATCAAACTGGAAAGATTGCCCATTCTTGAATTTCATTTTCCATGATATTTCTTCTGAAGTTTATCCTCCATTTCCACCAACCATCCGTTTTTTGGAAAAAAATGAGGAATGTTGAATCCAAAAATGATCAAACCTAAGTTGGATGCCTCTTCGAACTTCTCATTTCTGGAAAAATCTGGAAGTGTTCTGGAGTATCTCTACACCCTTTAGAAACCATATAATTCTAAAGTTCACATGCCAGGAAAAAACAACTGTCCAGACGAGAGTAGATGTTGACTATTGACTTCCTATGTTCAATGCTGATTTTTCTAAAATAAGAACTGATGCCTATAAGTTGTGGACTTTGATTGTTTTTGTTTTGCTTTGTTTTTTGTTTTTGGTAAGAAACTGAGCTTTTATTGGGGAAAAAATGAAAGAATATACAAGGGCATTAAAAAAAGTCCAATAGATGGGAGTCCTGGAACTAACAACAAAAGAACTATAATTCAACAACGTCAAACCGAGATCATTTTTTAGAAAAAGAAACATTTCATTGATTGATGAAATAAGGGAAAACCCCAACACCTAGAGGTGATTAGAAGAAAGCTCTCCAATTGGAAATTAAATAAGACACGCTATAATGAGTAAAAGGATACTTGGATTTGCACCCAATTAATGCTGAAGATAGAACGAAATCCATAAAACAATCAAAAGAATATTTGAAATGACGTACATTGTACTCACCCCAAAGCAGCCAAAAGAAAGCTCGCAAAACAGTGAACCAAATTGTCCTTTTAGTATAACCAAAAGGATGACTCACTAAAAAGGAAGCAAGATCATAAATGAAATTAGGACACGACATGTACCGACCAAAAGCCTCCAAAATAATACGCTAGAAGCGAGTTGCAAAAGAACAATAAAAAAATAGGTGAGCTAGAGGTTCAACATGTTATCGACACATGGGGCACCAAGAAGGGGAAAGATGCATATTAGAGATGTCCAATTTCCCCACGGGGACGGGGATTCCCCCATTAGGCGGGAATGGGGGAGGTAGTGGGGAAAAAAAATTCCCCGCGAGCTAAACGGGGACGGGGAACGCATTCCCCGTCCCTACCCCCGCCCCCGCCTCTGCCCTGTTACCTGTCCCTGCCCCTTAGCTTCTACATATTTATTTAGTATAATTATCTAATGTTATACTATTATTATTATTATATAAATATTACATTTAAATTTCAAATTTGATTATTTATTGAGAAAGATAATGAAAATGTTTAAATTGAATTATTTAAATTTAGATTGTATATGTGAATAATTTGATTTATATTTATTTCTTTCTACTAAAAAAATTAATTAGTTTTTTTAGGCCAAAATTTTAGTAATTTATCTTTAAATTCAAATTGTCATGTAAAACTAACCATAATAAACAATTTAGTGATAAAGTTAATTATTTAATTAAAATTTGTTCACATTAGTGATTTAAATTAATTGGTTTTTTACAAAAAAAAAGGTAACGGGGGAAAATTCCCCGCGGGGAACCTGATCCCAGCCAATTTCCCACGAGGAATCCCCGCCTCGATCCCCGTGGGGAATTTCACGGGGATGGGGAATGAAATGGGGAGCGGGGACGGGGATGGGGAATGGCATCCCCGGCCTCGTCCCGCCTCGTGGACATCTCTAATGCATATAATGCATTCGTCGTTGTAACCTATCAACCATATTAATAGCTTCAAGTCTCAGCTCTCAAAGGAAAATTTTAATTTTCTTTGGATAATGATCTTTTGGAATAACAAAATAAAAATCCTTCAAAAGAGGATCGACCATACCCACTGAATCCGTTATGAGAGATTTAGCAATGAAAATCGAAGAGCAATCAAGAGGCCAGGACCAAGTATTAGGAGAAAGATGAATTCTGATTGATGACAAAAGATGAGATATCCTTGTTGGTTGGGATAAAATTCTCTCTCCTCTCAGTCACGGCGGTTTGGGGATTACTAATCTTAAGCAGAGGAATGAAGCTCTCCTTGCCAAAGGGATTTGAAGATATAGGGTGGAAAAACGTGCTCTGTGGAGAAGGGTCATCACATCCAAATTTGGCTCTACTAAATGTGATAACAAAGCTGGTTCCCATTTGTTGTCCACTGCCAAAGGTCCACATGTTGCAGTGGACATTAGGCAATGGGTGTTTTGTTTCTTTTTGGTACGATTCTTGGGTGAGCCCCATCCCCTTGCACACGACCTTTGCCAACTTATTTGCTCTCGCCCTTGCTAAGGAGACTTTGGTATTCGATTCATGGAAAGAGGTTGAGGGGTGTTGGGACCTCAACCTCCGTAGGAACCTCAACGAGCTGGAAATATGTGAATGGGCTGATTTATTGCGTGTTAACCAATATCCAACCTCAACAAATGACAAATGTTTAGGGAAGCATGAATCATCTAGGATCTTCTCCACCAAATCCTTGATTGCTCTCGAGCACAAGATCAATACTAATGTTTTTGCTTGCCTTTCCAAATCTATATGGTCTGACCTTTACCCTAAAAAGATTAAATTCTTTTGTGTGAGCTTTCCATAAGGCTGTTCATACTTATGACAAGCTTCAACGCTGATTGTCCTTTTTGGTGGTTTCCTCCGGTTGGTGCACACTTTGCAAGAACGACAATGAGACTCAAAGCCACCTTTGCATTTTCTGCCCTTTTCTAAACAGATTTTGGAACCACTTTCTCTATGCTTTTAATTGGAATGTGGTCTTTCCATCAGATGTCACCAATTTTCTATCTCTATTACTTGAGGGACATCCTTTTAAAGGGAGAAAGAGCCTCCTTTGGATGCAACTCATAAGAGTCTTCTTGTGGTCGGTGTGGATTGAACGAAATAGCCGTCTTTTCAACGACAAAGCTCAACAATTCAAATTTTTTTAGAGCATGTTATTTTCATCGCTCTTGGTGTAAATTGTCGAAGCTATTCCATTCATATAGTTTTGATTCTCTTTTGACCAATTGGAAATGTCTTTTGTAACTCCTTTGGCTCGGGGCTTCTTTCCCCTTCCCTTTTGTAATTTCACACATCAATGAAATTGTTTCTTATTAAAAAAAAAAAACACAAAAGATGAGATAAAGTAGCCCATTCAGAAATCTCCAAATCATTAAGATGACGACGAAGACGCAAATCCCAAGCATTCGAATCAGCAATCCACACATCTGACACACTATCATTAGGATTGAGAGCAAGATGGTAAAGACGAGGGAAAATAGAAAAATTGACCACTATACCGAGATTATAATCACAAAAAAACCTAGTGACGGGCGCCCGCAAGGAACCTTAAACATCACCACCTTTCGAACCTCGTTACACCACCTCTTTACCCTCTTTAAAAATTCGATTATTTCTTTTGAGCCAAATTATGCATATAATGTTAGAGAAACTAGCATGCACAAAACTTGGCCCTTCTCTAGAAAAGGCGAACATAGAAGCACCTCCTCCATCATACCATGACTTTTCTAATCTAAACATAAGCCAAATGAACTCATCCTTTGATTCCAAAGTGAACGAGCAAATTGAGAACCCCATAACAAGTGATGGAGGTCACTTTCCTAACGCCTACAAAGAATGCAACATTTTAGGTATTACACAAAGGAAGTGTGTCGGTAAACACAATCCATAATGTTGACTATCCCATATAAAATTTGTCACACAAATTCTTTTAGCTTTCTAGGGATGCTAACCTTCCAAAGAGGAGAAAGCCAAGGTGGCTTGGGGTAGGGAAAGGATAAAAGTAATAATAAAAATAAAAAGATATAAAATAATAAATGATGACTATTATTTTAATCTTCGTTGTAATTGTTGTGTTGCATCAACGGGATATCTTTGGTGCAAATGACATGAAGATTGATCTTATTTATTTATCTTGTGTTATCTCTCTTTTTTTTTTTTTTTTTTTTTTTGAAACGAAAACGTAACTTTTCATTAATATAATGAAGGAGGCTAATGCTCAAAATACAATGCGACATAATGATCATAGAGGACATACTAAAAGTAGGGTCAGTGGGTGCACCTGGCCATCTCAACGGTTGACACAACCTTAGCACTCATCATTTCGAATGCAAAATATATCCAGAGAACAAAAGGTAATACATCATAGAGTTCTATGGCAATACATTAGTAAAATGTGAATGTAAATTAAGTTGGAGAAATGAAAACATTCCAATTGAGACATATGTCTTGGATTGAATAATGAGCAAAAAGTTTGGATTGTGAACCCCAAGAAGAAGCAAGGAGTCGAGTAGAACCAAAGCGATCCGACCACTCAATCGACTTGTCGTGAGAGACCCGGGGGTAGTGTTTTCAAAGCGCAAGGCGCACCTTAAGGCGACAAGTCTCCCAATTGCCTCAAGGCGAGAGGCGATAAAAGGGCGTAGCCCGAGTTAAGCAAGGTGCATCTTAATAAAATAATAAATAAATATTATCTAAAACTATAATAGGTAAAATATAAGAGTATTTAATTTTCAAAATACAAAAAATATCAATCAAGCTATTTTATATTTAAATAAAAATTAATTTTTGTGATAATTACAAAAAAAAGCATATGTAATTTAGAAAACAAATAGTTCTATGAGAGTGAGTTTATGCAATTAAATTGAATATGTTTATAATGTTATTTGTTTAGTAGTTGGCTTTTCTTCTTAAATGGAAAATGATACTAGCAAGATACTGATTAGAAAAGAGAACTAAACGCCATCTGGCAGTCATGGAAAGTGGACGAGAGAAATAGATGGCCATCTGGCATTCATGGAGGAGATATACTACATGCAACATGCAACCAACTCTCTTTATTGAAAAATGAAGGCGTGCATTTTCATAACTAGACTTTTCTTCTTCTTTTTCTTCTTTTATCTTCCTTGACACATCCTATTTTTCTTCTTGTATGGTAATTTTTCACTTTTCTACCTTTTCTTCCACTTTTACTTCTTCTCCTCTTCTGTATCTTCTTTCTTGTTCTTCTTCTTCATTTTTTTATCTTTCTAATCATCTAACTCTTCTAGTCTTTATATAATTGTTGAAAAAAGAGGAAGAAAAGAACCCAAGATTTACGTGGAAAATCCTAGATCAGGGAGAAAAAACCACGATAGAGACTGATTTTTATTATTAATTCTGATACACAATAATACAAGAGAATAGGCCAAATAAATAGACTAGGAGAAGCCTTAGGGTACAAATAATTACTAAACTACCCCTAAGGTAACAAGCTGATTTTCGACACTCCCCCTCAAGTTGGGGCGTAAATATCATTAAGGCCCAACTTGCTGACACATTAGTCAAAATTTTATCTCAGTAACCCCTTTTGTGAGCACATCTGCAATTTGCTGATTTGAGGGAACATAAGGGATGCGTATACTTTCGTTGTCAAGTCTTTCCTTGATGAAGTGCCGGTCGATCTCAACGTGTTTTGTTTTGTCATGCTGAACAGGATTATTTGCAATACTTATTGTTGCCTTGTTGTCACAAAACAGTTTCATTGGGAGCTCATTGTCTTGATGAAGATCTGACAACACTTTCATCAACCAGATTTCTTCACATATCCCTAAACTCATGGCCCGATACTCAGCTTCCGCACTATTTCTGGCCACAACTCCCTGTTTCTTGCTCCTCCAGGTCACTAGATTACCCCAAACAAATGTGCAATACCCAGATGTTGATTTTCTATCAACAATAGACCCTGCCCAACCAGAATCAGTGTAGGCTTCCATGCAGCGTTTATCAGACTTCTTGAACATTAACCCCTTACCTGGTGTACCTTTCAAATATCATAGGATGCGTTCAACGGCTGTCATGTGTTCCTCATAAGGAGCCTGCATAAATTAACTGACAACACTCACTGCATAGGAAATATCAGGTCTCGTATGAGATAAGTATATCAATTTCCCAACTAACCGCTGATACCTTTCTTTATTGATCGGGACACTGTTACTGGTATTGCTGAGCTTTGAATTATATTCAACTGGGGTGTCAGCAGGTTTGCACCCAGTCATTCCTGTTTCCGTTAGAAGATCAATCGTGTATTTCCGTTGGGACACTGAGATCCCGTCCCTTGACCGGGCTACTTCCATTCCGAGAAAGTATCTTAACTTCCCAAGGTCTTTAATTTCAAACTCTTTGGCCATCTTTGCTTTGAGTTTCACAATTTCATCATTATCATCACCGGAGAGCACAATATCGTCAACATACACAATGAGAATGGAAACCTTCCCTGAAGTTGACCTTTTTGTAAATAGTGTGTGGTCTGACTGCCCTTGGTTATACCCTTGTCTTTTGACAAAGGTGGCAAATCTGTCAAACCAGGCTCTCGGAGATTGTTTCAACCCATATCGCGACTTTCTCAGTCTGCAGACTCGCTGTTTGAACTGATCTCCAAATCCGGGGGAGGATTCATGTAGACTTCCTCCTCAAGTTCTCCATTGAGAAAGGCATTCTTTACATCCAGTTGATGCAAAGGCCAATCTCTGTTAGCTGCAACTGATAGCAAAACTCGAATAGTATTAAGTTTAGCTACGGGAGAGAATGTCTCGGAGTAATCTACCCCAAAGGTTTGGGTAAACCCTTTGGCAACTAGCCTGGCCTTGTACTGATCAACAGTCCTGTCTGACTTGTACTTTATAGTGAAAACCCATTTGCATCCAACTGTTTTGTGCCCTTTGGGTAGGATCACCTGGTCCCATGTTCCATTTTTCTCAAGAACCCTCATCTCCTCCATAACAACAGCCTTCCAAATTTCAGTTTTCATTGCTTCCTGTATGTTACTCGGTATCACCCCTGTATCTAAACTGGTGGTAAACGCCCTAAATTCAGGTGACAAGTTACTGTAGGTCATGTAACTATGCATAGGGTACTTTGTACATGACCAAATACCTTTCCTCAGGGCTATTGGAAGATCAAGAGAAGCATCATGCTCCTTCAGTATGTCAGAATTTCAACTGTGATCACTCATCTCCTCTGCTCGGGTTTCATCTTCAGACATCACCTTTTCACCATTATCTCTAGTATCACTCACATTTTCTAGAGCTGTGTCCGAATCATTATTTTCGTAGTCTTCCCATCCGCTATAGTGATGCTTTCAGCACCCTTTTCTCCTTTTTCCTCTCTACATTCGTTAGTCTAAGCACAAGTATCAATCTCATTAGTTTCAGGAATACACGTACCTGATGTCGGGTTTGACTCATGGACCGGAGTCGAAGGTGCAACAGGTGACGTTACTTCCTTCCTGAGATTCTTCCTATAGTATGTTATCCATGGGACTTGATTGGTAGGGAAGATAGGACCGACATGTTCGGGAATGGGAGATGACTCCAAGGGAACTGTATAAGTGGACCAATTAGGCTCTTCACTAGATATCTCCCCCTGAAGATGGCTAACAGGGAAGAAAGGTTGATCCTCAAGGAAGGTGATATCCATGGAAACAAAATATTTGCGAGAGGATGGGTGGTAACATTTATACCCACGTTGATGAAGTGGATACCCAACAAACACGCATTTTTGAGCACGGGGGTAAATTTCGTGCGGTTTGGACCATGGGAGTGAACAAAAGCAACACAGCCAAAGACTCGAAGTGGTACATCGGAGATTAAACGAGTAGTGGGGTAGGACTCTTTCAAACAGTCTAATGGATTTTGAAAGTTGAGGACTCGGGAAGACATTCGGTTGATTAAGTGAGCATCATTAAGGATGACATCACCCCATAGGTAAGACGGGAGTGATGTGGATAGCATTAAGGACCTGGCAATTTCCACCAAATGACGATTTTTTCTTTCGGCTACCCCGTTTTGCTGGGGAGTGTAGGCACACGAACTTTGGTGGACAATCCCTTTGGATGCAAGGAATTCCTGGAAGGAAGTGTTAAAGAATTCNACACGAACTTTGGTGGACAATCCCTTTGGATGCAAGGAATTCCTGGAAGGAAGTGTTAAAGAATTCCCGCCCATTGTCACTTCTCAAAATACCAATCCTGGCACTAAACTGGGTTTCCACAGTAGCATAGAATTGCTGGAATACAGATAACACTTCGGATTTATCAGTGAGAAGAAATACCCAAGTAAGCCGAGTGTGGTCATCTATAAAGGTGACAAACTAGCGTTTCCCTGTGGAGGTAGTGGTTGGTGAGGGATCCCAAACATCACTGTGGACTAAGGTGAAGGATTTGGATGGTTTATAGGGTTGGGAATGGAAGGAGACACGACTCTGTTTGGCACGAATACACACGTCACAATTTAAAGAAAACATATTAACATTGCGAAATAAATGCGGAAATAAGAATTTCATGTATTGAAAATTTGGATGTCCTAAGCGGTAATGCCATAACAGAAAATCATTCTCAGAATTAGAAAAATTTTAAAGAGACAAGGCTAGTCCTATAATCATTCCTAGAGGAAGCATCATCAGTCAGGAAGTAGAGTCCCCTATCGTGCCGGACAGTGCCAATCATCGTCCCCGAGCTCAGATCCTAAAATAGAACAGAGTCGGGTGAAAAAACTGCCTGATAATTTAGATCTTTTGTAATCTTACTGATGGATAACAAATTATATGAAATTTTAGGAACATGCAAAACATCTCGCAAAACCAAACCCTTAAATGGAGATAAATGTCCTTTTCCTGCAACAGATGCGAACGACCCATCTACAATCCTAATTTGTTCATTACTGGCACTTAGACAATATGAGAGGAACTGATCAAAGGTACCAGTAAGATGATCTGTCGCCCCTGAGTCAACAATCCAAGGTTTTTTCCCATTAATACTAATAAGACCAACAGACTGTATATTACCTGACTGGGCGATGGCACTGACTCCAATTGTACTTAGGCTGGTCATACTATCTGTTGAGGATTACTGAGGATTCTCATTTACCGAGTCACTCACAAGGGCACGTCCAGATGTTGATTTGTCATGTGACTGACGACGTTTACAGTTTGGGGGTCGTCCGTGCAGCTTCCAACACTGGTCCTTTGCATGCCACGACTTTTTACAATGCTCACAGACAGGAGTAGGTCTTTTATCGCCGTTAGTATTGGAGGTCTTAGTAGCAAAAGCTGCAGCATCAGTAGAGGTAATGATAGGGTTATTCATTGCAGTTGATCTATCTTCCTCCAGACGTATTTCTGAGCATACTTCTATGAGAGAGGGAGTAGGGCGTTGCTCCAGAATCCGACTCCGAACGGCATCAAACTTCGAATTTAGACCGGCCAGGAAGTCATAGACACGATCTGCTTCTTCAATTTTTGTGTACTGTGCTCCGTCATGAGCACATTTCCAGATTGTCTCCCTACAAAGATCCATCTCCTGCCACAAAAGAGTCATTTTATTAAAATAGGACGTTACGTCCATAGTGCCTTGCTTACATACATGTGCCTGTTTTCGTAACGTATAGAGCCGGGATGCATTCTGCCTTTGTGAATAAAGATGTTGAACAACATCCCAAATATCCTTGGCAGTTGCAGCATAGAGTAACGGTCTGCCAATCTGGGGTTCCATACTATTAATCAAAACGGATCTCAACAACGAGTCTTCTCCTTTCCAGATCTGCTCCTGAGGGTCTCCTTGAACGGGCTTAGGTATCTCTCCAGTCAAGTATCCAAACTTATGACGTCCTTCCAAGGCCATTCTGATGGATTGAGACTAGGAAAAGTAGTTTTGGGCATTCAATTTCTCTCCAGTAATAAACCCTGCAGTATTTCCCATAGATCAAGACAAGATATTTGTAGTGGGTAAAATAGGGAAAGAATTTATCGGGTTTTTAGGGTAAACTGGTATGTTGGCCGAAAAATCAGGTTGAGAGTTTGTCCCAAGAACCGTCCCGAAATCTGCAATCTGCTGTTGTAGATGGGCAAAACGCCGTTGGAGATCACTTGTAGACCCCCGGCAAGGCTAATCTGCAAATCCGACCGCTGGTGCCATCGACCCTCTTGGGCAGAAGACCGACGCATAAGGACTTACTGGCTTCTCCTGGAATGGTTGGACCACAATTGCTGTCCGGAAACACGTCGAATCTGACCGCTGATAAGAAGTTGCGATCGGCAAGATCTGCTCCGGCTGGCTCGTCGCACCGGCAGGGTCGACCGATGAAGAGGTGCGGCTGGGGTTGAGGTGAGGCTGGAAGCACTGCGATCTGGTCGGTGGGGGACCGAGCGGCTGAGGAAGGACCACTTCGAGCGGCTGCGATCTGTTGAACGGCTCGAGTGGTTGCGATCTGTCGAACGGCTAGAGAACACCGGGATCGAGTGGCTGGAATGGATCGAGCGACTGGGCTGTCTGCTGTTCGGCTGGGGAAGAACGGCTGGAACTTGGCCGATTAGGGTTTGGGTCTGAAACAAATTTAATTATTTTGGGCAGTGCGGCTTGGAGGTATTGTTCCACGGCTTCCCGAACGGCTTTGGTGATATCGGCTTCCCGACTAGTTCCAATGAAGGTTGCTGATGGAGAATTTGGTCTATTTTCATCCATGATGGCTAGGGTTTCGTCGCCATGCTCTGATACCATGTTGAAAAAAGAGGAAGAAAAGAATCCAAGATTTACGTGGAAAACCCTAGATCAGGGAGAAAAAACCACGATAGAGACTGATTTTTATTATTAATTCTGATACACAAAAATACAAGAGAATAGGCCAAATAAATAGACTAGGGGAAGCCCTAGGGTACAAACAATTACTAAACTACCCCTAGGGTTACAAGCTGATTTTCGACAATAATCAATCTTCTTCTCGTGATTTTTCATTTCTCACTTGTTACAAAAAATTTGAATGAAAAAATGAAGAGAAAGGAAGACAGGCGCTCACCTTCATGTATGATGAAGGCGCAACAAGGCGACAAAAGGCAACCAGGCGAGCGCATGGTTGAAGGTGCTCGCCCTATTTCTCTTCAAGGTGCAATGGCTCAAGGTCGCCTCACCTCTCGCCTTAGGCGCGGGTCGCCTCGAAAACACTGCCCGTGGGTCGCGTTCAAACTAAATTTTTGACAATAAGGCCTTGATTGCGACCATAATATTTGAGCTTTAGGTGATAGTGAAGGACCTCTCAACACTAGGAAACATTATCTCGGAATGAAGGCCCAAAAACCTAAAGTGCATTAAAGTATGCGAAGAGCAACTTCCACCAACATTTACTAGAGAAGGGGCATTCAAAAAACAGATGGTGCAAGTCTTCACCATTGGCTAAGCAAAAAGAACAAATAGATGGAGATCAATAATGAGCTTGAAACTTGTGTTGCATTGTGGATGAACAGTTGAGGGAGCCGAACATCATAATCCAAATAAGAATGTTAATTCTTTTTGGACTCTTGGATTTCCACGGGGCTTGGAAGATTTCTTTATCAAGTGGAGATGATGCATTCAAATATCTTGATAAAGACTTAACTGTGAACTTACTAGAGGATTCCAAGGACCAACGCCTTAAATTGGGGGATAAAGAGTCATTGGCCTTCTAAATTCCATAGCAGCTACTGGCATTCTGATATTTCCTCATCTTTTAGTTGTCTTCTATGCAATAAGTTCCATGAAGTCGTCACCGAATCCCAATGGTCATTGATGCTGCCATTTGGTGCAGAAGCAATTCGGAAAAGATGAGAGAAGGTCAGCTTAAAAGGCTGGTCTGAGCACCATGGGTCAAGCCAAAAGTAGATGTTTTTTCCGGTCCCTATCTTGAAGACTGCAAGAGATTCAAAACGACTCCAAATCTTGAAATATTAATCCATGGGCTGCGTAAACTCATATCAAGCTTCCTAGAAGTGTGCCACTGATGTGGATCAACTCCATGTATGCTCTTCACAATTTTACACCAAAATGCATAGGTTCTTTCAAAAACCTCCAGCTACATTTGAATAACAAGGCTAAGTTTCTATTTTTCAGACTGCGATTCCTATCCCTCCTTGCAGAGCTTTTGCGGCTACCTCCCATCTCACAAGGTGGTTCATCTTAGAACCTTTCTGTCTTTCCCAGAAGAAATTACTCATTATCCTTTCTAATTTTGCTGCTGCTGTATTTGGCATTTGAAAAATGGACATGTAGTGTGTGGGAAGATTTGCCAAAACTGCATTGCAAAGGGTTAACTGGCCTCCTCTGGACAGATTGTATCTCTTCCATCTGTCAAGTTTCAGGTGCACCTTGTCTAGTATTGGTTGCCAAAAAGAGTGTTTTTTTGGGTAACCACCTAGTGGCAGCCCCAAGTATGCGAAAGGCAAAGTTCCAGCCTTGCAATTAATTCGAGAAGCTGTTGATGGCAGAGTGTTTTCATCAATGTTTACCCCATACAATGCCCACTTCTCCCAGTTCACTTTCTGCCCGAACACCATTAAAAAAAGCTGATTGTTTTTGTAAGATTATCCAACATAGATTGGTCATGCTTGCAAAATAAGAGTGTACCATCCGCTAATTGTAATATCGAAAGATGATCCTTTTCTTTTCCTATCAAGAATCCTTCAAACAATCCATTCTCATGAATTTGTGAAATAAGGGTACTTAAGACTTTACTAATTATGAGAAATTAGAAGGTTGATAAAGGATCACCTTGGTGGGTTCCTCTCTACATGGTCAAAGGCCTTTTCCATGTCTAGTTTCAGAATCCAACCCTTCTGTTTCTCTGCATGATAATCCCCGACTGCTTCTTCGGCAATTAGCATTGGGTCTAAGGTCTTCCTCCTATACAGGCACTTTGGGTATCTGCAATTAAGGATGGCATGACTCTTTCTAATCTTTCAGCAAGTATTTTAGTGATTGTTTTGTAAGAAGATGCAGTGAGACTTATTGGTCTAAAGTCTTTGACCGACAAAGACTCCTCTTTCTTAGGAACGAGGCAAATGAAATTCTCATTGATACAAGCATTCAGCTTCCCAGTTAAATGGAATTCATTGAACATTTTCTGGAAAATGTTGTGTTATTTCACATAACCTTTCCTCTCACTTTATACACACGTGTGCACGTGCACACGCACACATGTAGTAGTATTTGTATTTGACTTGACTTGGGAGAAACAAGAGGATCCTGGATAGTGGTAGTATTTTTTTCTGTTCAACTGAGCCCCTTCCCAAGGGCTACTGTGTCCAGTTTTTAAGGGTTTCCTCGTTTTTATTTTTATTGATATTGCATATTAGGGGTGTTTACTTGGGATTAGCTCCAGATATTTGGCCCTTACGTTTTGTATGGTCATGCTTTTATTTCTTCCTTCTTCTTAATAAGGTGGTTCTTACAAAAAGATAAAGAAAAGATCAATTTACACCGTTGTGAGGTGGAGCAGTTATGTTCCCTAAAGAATTGTATGTATAACGTCGACATGGGTTGATCCTCTTGGCCTTTTACATTTTGATATTTCCCTTTTTCACCTTGATCAATTGTATGTTAAATTTCAATGTTGATAATATGGATAATGGATGCATTTCTCGTCATTCTTAAATCATATTTTCTCTTGTTTTACTCTCTACCTCCTGGCTTGCAGGTTCCTTTGATTCAATACTGTCAACAACTTGCATCGACTTTGTCGAGTCATGATTCTTGTTTTACACAAGCTGCAGATTCTTTATTCTTCATGCATGAGGGGCTACAGCAAGCTCGTGCACCTATTTATGATGTTCCATCTGCTACTGAAATTCTTCTTACGGGAACCTATGAGCGTCTGCCTAAATGTGTAGAAGATATCAGTATTCAGGGAACACTGACCGAAGACCAACAAAAGAATGCATTAAAAAAGTTGGAGATATTAGTACGGTCTAAGTTACTGGAAGTTTCGCTTCCGAAAGAAATTTCCGAAGTAAAGGTCACTGATGGTACAGCACTGCTTCGTGTAGATGGGGAATTTAAGGTTTTAGTTACTCTTGGCTACAGAGGACACTTATCCTTGTGGAGAATACTGCATTTGGAGCTGCTAGTTGGAGAGAGAAGGGGACCTGTGAAACTGGAAGAAGTGCACCGTCATGCTCTTGGAGATGATTTGGAACGCAGAATGGCTGCAGCTGAACATCCATTCACTACATTATATTCAATTTTGCATGAACTTTGCATCTCGCTTGTTATGGACACTGTCTTAAAGCAAGTACATTCTCTTAGACAAGGAAGATGGAGAGATGCTATTCGGTTTGATGTCATATCTGATGGTATTACTGGTGGTTCCACACAATTGAACCATGATGGAGAATCGGACTTATCTGGTCTTCGAACCCCAGGGTTGAAAATCATGTATTGGTTGGATTTTGATAAAAACACTGGCAGTTCTGATCCAGGATCATGTCCTTTCATAAAAATTGAACCTGGACCAGATATGCAGATAAAGTGTATCCACAGTACATTCGTCATAGATCCGTTAACCAACAAGGAAGCAGAATTTTCTCTTGATCAAAGTTGCATTGATGTTGAAAAGTTGCTGTTGAGAGCTATATGTTGTAACAAATATACGCGGCTTCTTGAAATTCAAAAAGAATTGAAGAAAAATATTCAAATTTGTCGAACGGCAGATGATGTTGTTCTTCAGCACCAAGTTGACGAGCCTGATGTTGACCATAAAAAGGTTGTGTGTTTATGATGATTCATTTTCTTCATACTCCCATAAAATTTACTGTGGCTAGGGCAGGAAATATTGAAAAATAAAGTTGATGATCTGTCTAGTCTAGTTACTACTCTCTTCCCCCACTGATTTATACATGAAGGGTCATTTGTTTTGAATACTACTCCCATCCCACATCCCAACCATCTGTATGGGATGGTATTTACTATTTTGTATCTTTGTGGCTTTTGGGCTCAGGACTAGTGATCATTCGATGAAAAATTTAATTTGTTAAAGTGAAAATTTACAGGTTTGAGGAAGAAATGGTGAAGGGTTTGCATTCGGTTGTGTATAGGAGTTTATGTTTCTGGTTCCCGTTTTTTTACCATTGTCAGAATTTTTGTATGGTTTTCTTTTTCCTTCCCATCTTTCATTGAAGCATAGTCTCCAGAGAAACATGATTTCTTGATTGTTGGCAAAATAAATTGCTTCATGAGACTTTCATTTTGTGGTATCATTTTGTCCCCTATAATTTCCGGATCCTTTATTTTATTCTGTTATTTTGCAAGTGAATGGTTTGGGTATCTGCAAATCATGGTTATTTGCAATTAGGGGAGTGGGTATGCTTTCCATTCTTTATTCTCTTTCCCGTCGAGGTAGGACGTTGTTATTCATTTATTTTTGCCACATTAGTAACATACAATGACACTAACTAGAGTTCCATTTTCTTTACTTTTGTTATAACGTCTACTATTCTTTGCAGAAGAATAAAGTTCATGACCCCACTGCATATGAGGGAGAAGAAATATTGCGGGTGCGTGCTTATGGCTCATCATTTTTCACCCTTGGAATAAATACAAGGTACCGTTCTTGAGATGATATTAGGAAAAAGAATAGTTACTACACTATTTGGAAAAAGAATGGTTACTACACTATATTTGCTGTCGGATGGGTTTTCAATTATTTATGACTAAAACTTCACGTCAAGCCAACTTCAAGTAAATTGAGTATGGTAAAGCAAACTTAACTAATGATCCATATGATCATTTGATAAATATTAACTGTGAAATTTAAAATTAAATCTATACAATTTATCCTTCTTTAAAATAAAAAACAAGAAGGTACAAAATGCTAGAATTTGAACTTTTCCATCAATTAGAGTTATTTCCCATGTATCCGTGCTTATATTTTTGACAAATTCTAAAAATTCTCTTTATAATTATTTTTATGAATTGCATTTCCTTAAATTTTATTGTTTGTTTTTAATGGAATTAATGCTACAGTGCTTTAAAAAAAAAAAAAAAAAAAAGAATTCATGCTACATTCAACATATCTCAGTGCTAAATAAAAGTTGCATGGATTGAGGATAGATGTAGATTTACAATGGAACGGGTAGAACTGATGGTAAATGGAAATTTGTTTGACAATTGGGGTTATTCAATGTGAAGATGTCAATTGTATTTTCATATCATTTTTAATCTATACAATTTAAGGATTATACAACTATGATTTCATATTTTATTTAACATTCTTTCTATGTTGATGGTTATGACAATCAATATAGTTAAAGTAGTGAAAGAGATTTTTTTTTTTTTTTTTTTTTTGGTTTCTTTATAGAACTTCTTTTCTATGAATACGGTTATGATTTTATAAATTTGTTTCATTTTATTAATTGGTTGTTGACATTGTTTGAGAATTTGATTAACATTAAATGAATAGTACCATATTATTTATGGGAATGAAACCTAAGTTCGAGTGCTAACTAATTTTCGAGAGAAAAATTATAAATGTCAATTAAATGTTGCAAAGTGGGAAATGCAAGTGATCATCATAAAAACATAATAGTCCAAAAAATGTTTCAGTTGATTTATTTTATCATAGATATATTTGAGCTTCTTTGTCTCATACCTATGATTTTGTTTGTAGGAATGGTCGTTTTCTTCTCCAGTCTTCACACAATAAACTTGCAACCGCATCACTGACAGAGTATGAAGAAGCTTTAAATCAAGGAAGTATGAATGCAGCTGATGTTTTTATAAGATTGAGAAGCCGAAGTATTCTGCATTTGTTTGCATCTATTAGTAGGTTTTTGGGCCTTGAGGTATGGTTGCGTATCTTATATTTTTGTAAAGAAGGAAATGACGTGATACATTGTGATTGCTATTCTTTGTTTATTCTAATTGTATATCAAACATGGGTTTTATATTAAATAAAAATCTAATACTAAATGTTTGCAGGTATATGAAAATGGGTTTTCTGCGGTTCGATTGCCAAAGAACATTTCAAATGGTTCAACCATGTTGCTGATGGGATTTCCAGATTGTGGGAATTCATACTTCTTGTTAATGCAGCTTGACAAGGATTTCAAACCCCAGTTTAAATTGCTGGAGACAAAACCAGATTCTTCTGGTAAAGCCCATCGTCTCAGTGATCTAAACAATGTGATACGCATGAAGAAAATTGACATTGATCAGGCTCAGATACTTGAAGATGAGCTGAACTTAAGTCTGCTTGACTGGGAAAGGCTATTTCCCTCTCTGCCAAGTTCTGTCAGTAATCAAACTTCCGAAAATGGTCTTCTTCCTGATGTTAGCATTGATGGTGCTCTGCAGATTGCTGGATATATTCCGTCCAGTTTCTCATCTGTTGTTGATGAAGTGTTTGAGTTGGAGAAGGGGCCTCCCCCTGTACCTGCTTTTTCTGTTTCAAATCTGTCTCAATCTTTCAATTCATCTGCACCTCATTATAGTTCTCTCTCTAATATTCATAATGTGAAGGGAGTTCCTTCTCCCAAGTGGGAAGTGAGTATGCAGCCATCCCAGGGTAATAATGTTGCAAAACTTTCAAATATCCCTTCCCACAGCAATGGTTCCTTGTATTCAACTAGCAATTCAAAGGGTCCAGTGCATTCCACATCCCTGGGTTCTATTTCTTCTGGTCCTGTTAGGGGTGCTACTACAAGACGACTTTCAAATTCAAAATCTGAACAGGATTTAACTTCCCTTAGATACCCAAATCCTGCTGAGGTTGGTTCTTATACTGCATTGGATGATGATCATATAAGTATGCCGAATGATACGTCAAAGGATGGGGTGTATGCAAATAGGTCTTCCCGGCTATTGTCTCCATCTCAACATGGTGGCTCTCGAATTTCTGCAAGTATAAAACCTAATGGATCCAGAAGTTCACCAACTGCAGCTCCAACAGGGTCTTTAAGGCCTTCTGGATCTTGCTCGTCTGTTTCAACTCCCGTATGTAAGATACTTACTCTAAGTTTCTATGTTATTGGTTGACTGCGTGTAATTTAACAATTGAAGCACTGAACTTATAAAAAAAAGGGATAGATGTTTTACTTTCACTAACTTGAGAGATGTAATTTTTCAGCCCAGAATCAAGATTCTTGCTCTAGTCCCTTCTATGAAAGTGGTTTAAAAAATGATAGTTCTCGGAAGCGTACTGCTTCAGATATGCTGAACTTAATTCCTTCACTTAAAGGTATTGATGCATATAATGGAGTTTCTAAGAGAAGGAAGGTTTCAGAATCACCTAGATTTAGCAAACACTCATCACAGTTGCTTATTTCAAAAGAAATGGTTTCCAAAACTGAATACAGTTATGGTAACCTTATTGCTGAAGCTAACAAAGGCAGTGCACCTTCGAGTACATATGTCTCTGCTCTGCTTCATGTAATCAGGCACTGTTCACTATGTATCAAACATGCCAGGCTTACTAGCCAGATGGATGCACTGGATATTCCATATGTTGAAGAAGTTGGTTTAAGAAATGCATCAACAAATATATGGTTTCGACTTCCATTTGCCAGAGATGATTCATGGCAACACATATGCTTGAGACTTGGAAGGCCTGGAACCATGTGTTGGGATGTCAAGATACATGATCAGCACTTCAGAGATTTGTGGGAGCTTCAGAAGAAAAGCTGTACGGCTCCATGGGGTCCTGATGTTCGAATAGCAAATACATCCGACAAAGACTCTCACATTCGTTACGATCCCGAAGGTGTTGTTCTCAGTTATCAATCAGTAGAGGCAGATAGCATAGACAAGCTAGTGGCAGATATACGAAGGCTTTCCAATGCAAGAATGTTTGCCATTGGGATGCGTAAACTGCTTGGGGTTGGAACAGATGAGAGGCTAGAAGAAAGTAGTATGACCTCAGATGTAAAGACACTAGTTATGAAAGGTGCACCCGACACTGTGGATAAGTTATCTGAACAGATGAGGAGGGCATTTAGAATTGAGGCAGTTGGGTTGATGAGCTTGTGGTTTAGTTTTGGTTCTGGCGTGCTGGCACGATTTGTCGTAGAGTGGGAATCAGGTAAAGAGGGTTGCACTATGCACGTTTCACCTGATCAACTTTGGCCTCATACAAAGGTTTGTTTAATGCCCCTTATTGTTATTTATTTTATCATAGTGTAATTTTTTTTCATTCTCCGTTTAGGATGTCGTTCATTGTTTACAAAACAAATGAAAAGCATTCCTCAAACATTTGGCCATCTAAGAAGATATTAATAAAGTTTCCTACTAATCTAGTTTTAAATTTCCTTGATTGTTTCTGTTTTAATTAGTATTTTGTTTCTGGCTGTGGATAACGAGAGTATTATAATTTTTTTCCTTTTTATTTATTTTCTTCTTCTTCGTCTATTTATTTATTATTATTATTTTTGCCTAAAGATGGGAAACAATGTATTGCTTCCCAGGCAAAATAAAAAATCGAAAAATCAAAATTTAAAAAAAAAAAATCACAAAATTTAAAGAATTTTCATTTTCAAAATGCTCCAACAGATTTAAATCCATCATGAATTTGAATACTGATGATATGTGCCTTGGCCATGTTTTCTCAAATCACTAGGACATGAGTTCAAAGTTTAGGCGAAGGACTTGCGGGTAATGAATGGGAAGATCGAAAAGTTGGAAGAAGAAAAGATTTTTTGGTTAGACGTATGGAATTAGCTAAGCATTTTATTCGAATAAATATAGAACCCGAATGGATGGTTTTATGTCTATTACCTGTTCTTCCTCCCTAGCTGAGACCGATCATTCAAATAGATGGAGTTTACTTCCCATCTTTAGGCAAAATAAAAAGTCAAAAAATCAAAAATTAAAAAATAATAAAATCACAAATTAATTTTCATTTTCATAATGCTCCAAAAGATTTAAATCCATCATACATTCGAATACTGATGATATGTGCCGTGGCCATGTTTTCTCAAGTCATTAGGACATGATTTCAAAGTTTTGCTTAGATATATTCTGTTTTCATAAATATGTTTATGTGCTTATATTTTCTGCTGTGTACAGTTTTTAGAAGATTTTATAAATGGAGCTGAAGTTGCATCACTTTTGGATTGCATTCGTCTCACTGCTGGACCACTACATGCTCTTGCAGCAGCAACCCGACCTGCTCGAGCTGGTCCTGTTTCGACACTTCCTGGCATAGCTGCAGCTCTTTCTTCCCTTCCAAAACATGGAGGATACACACCTACCCAGGGTGTTTTACCTAGCAGTTCAGCCACGAACACTGGCCAAATTACCAATGGCCCAGTTGGTAACGCTGTTTCTGCAAATGTTTCTGGCCCTCTTGCAAATCATAGCCTTCATGGGGCTGCAATGTTAGCTGCTGCTGCTGGCCGTGGTGGGCCTGGCATTGCTCCTAGTTCCTTGTTGCCAATAGATGTTTCTGTTGTGTTGCGTGGTCCGTATTGGATAAGAATAATATATCGAAAACAATTTGCAGTTGATATGCGCTGCTTTGCAGGAGATCAGGTATGGTTGCAACCAGCAACGCCTGCCAAGGTCAACCCTTCAGTTGGAGGGTCATTACCATGCCCACAGTTCCGGCCATTTATTATGGAGCATGTTGCCCAAGAATTAAATGGTGTAGAGCCAAACTTCCCAGGTGTTCAACAAACTGTTGGATTGTCAGCTCCAAACAATCAGAATCCAAATTCAAGTTCACAGATGACTGCTGCAAATGGAAATAGACTTAGTCTTCCTGGTTCTCCTGCATTGACTAGGGCAGGAAATCAGGTGGCTAATATAAATCGTGTGGGAAGTGCTCTGTCTGGATCTTCAAATTTGGCTTCTGTGGGCTCAGGATTGCCATTACGGAGATCACCAGGAACAGGTGTCCCTGCACACGTGAGAGGTGAACTGAATACAGCTATTATTGGACTTGGGGATGATGGGGGGTATGGAGGAGGTTGGGTTCCTCTTCTTGCTCTTAAGAAAGTTTTGAGAGGTATTCTCAAATACCTTGGAGTTCTTTGGCTGTTTGCTCAGCTTCCAGATCTTCTGAAAGAGATCCTAGGTTCAATTTTGAGGGACAACGAAGGTGCATTGCTGAATTTGGATCCTGAGCAGCCTGCCTTACGTTTCTTTGTGGGGTAAGTGTGTTACAGTTTTTAATCTTATTTGTTTGGTTTTGCTATTGTTGTGAGGAAAGTTACCAATCTTATTCACTTGCATCATCTGTTTATATTTCTTATTTGACTTCATTATTAGTGGTATATATTTGAAACAAGCTGTGAATTATGACATGGAAATCTCATTTTCCCCCTTTTTGCCATTGACATCACAAAACTAGATGTTGTGTTGTGTGTGTGTTTTTTTTTTTTTTGGTACATGTGATTTTCTACGTGTTGGATTTAGCCATCCACAACAAAACGTGTAGGGGAAAATTCCTTTTGGGACACAGAGTCTTTTGTTGAAACCCACACTTGTGGGTGCCAACTTGATGATGGAAAGGAGACGATGGAAGTTGGCTTTTATTAGAGAAATTCGACCCTCCACTCTTCAAATCTGTCCATGTGGAAGGTCTCTTGTAATTCAGTTTTCCCCCACTTTTGTAAATTTCATATATCAATGAAATCGTTTTTCTATTAATACAAAAACAACCACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAACAAACAAACAAAATCCTTCCATGTAGAAGAAGCAAGATTTCATTGTCAACAACTATTTTCTCCCACCAATCATTTTTCACATTTTCTTCTATATTTCTTCAAAGGTTTTTTAAACATAATTTCTTTTCAAACAAATTCCCCCAAAACGGTGGCCTTAATCTTATTTATCCATAGCAATTTTGCTTTTCTTTTGAACTGTCCTCCAAAAAGCTGATTCGATTCTTCTTTACTTTTGCACAAAAACCCGAGAAATTCTGAATTTCTCCCAGACTTTCTGCTAAAGATGCAATGATGAAGACATGTTCATTACTTTCAACATGTGTTGTGCCACAGAACTTTGTTTTTATGCCACTTTCTGAAGGAATTTGCAACTTGTGTTGTATGTTACAACGTGAATACGAGGGAGTTGAACCATTGAACATTGGGGCATGACTTAATTTTGAATTTCCAATAATTTCAAAGCCGGTCTGTTTAGTCTACATTTTGAGTCATTCATTTCATCCCATTACATATTTTATGTAGGGGATATGTATTTGCTGTAAGCGTTCATAGAGTCCAACTGCTTCTCCAAGTGCTTAGTGTGAAGCGTTTCCATCATCAACAGCAACAACAGCAGCAGCAAAACTCCACTACAGCACAAGAGGAATTGACACAGTCAGAAATTGGTGAAATATGTGATTATTTTAGCCGTCGTGTTGCATCAGAGCCGTATGATGCTTCTCGTGTTGCATCCTTCATTACTCTCCTCACCTTGCCAATATCAGTTTTAAGGGAATTTTTGAAATTGATAGCATGGAAAAAGGGAGTGGCTCAGGCTCAGGGTGGAGATATTGCTCCGGCCCAAAAACCCCGCATTGAATTGTGTCTTGAGAATCATTCTGGGTTGAGTATAGATGAAAACTCTGAAAGATCCACATCCAAAAGCAATATCCATTATGATAGGCAACACAACTCTGTTGATTTCGCCCTGACAGTTGTACTCGATCCTGCTCATATACCTCACATGAATGCAGCAGGTGGCGCTGCCTGGTTGCCCTATTGTGTCTCAGTTAAGTTGAGATATTCCTTTGGTGAAAGCCCTGTTGTCTCTTTTCTTGGTATGGAAGGAAGCCATGGGGGCCGAGCATGCTGGTTACGCATTGATGACTGGGAAAAATGTAAACAGAGGGTGGCTCGAACTGTTGAGGTGAGTGGAAGTTCAACCGGAGATGTTAGCCAAGGAAGGTTGAGAATTGTAGCAGATAGTGTCCAGAGGACATTACATATGTGCCTTCAAGGATTGAGAGAGGGCAGTGAAATAACTGCAATTACCGGTTCAACGTCGTGA
mRNA sequence
ATGGCGGCCGAGCTAGGCCAACAAACGGTCGAGTTCTCTGCCCTGGTTTCCCGTGCTGCTGAGGACTCTTTCCTCTCCCTCAAAGAACTTGTTGACAAGTCCAAATCCTCCGACCAATCCGATTCCGATAAGAAGGTTAATATCCTCAAGTATGTCTTCAAGACCCAGCAGAGGATACTCCGCCTTTATGCCCTTGCCAAGTGGTGTCAACAGGTTCCTTTGATTCAATACTGTCAACAACTTGCATCGACTTTGTCGAGTCATGATTCTTGTTTTACACAAGCTGCAGATTCTTTATTCTTCATGCATGAGGGGCTACAGCAAGCTCGTGCACCTATTTATGATGTTCCATCTGCTACTGAAATTCTTCTTACGGGAACCTATGAGCGTCTGCCTAAATGTGTAGAAGATATCAGTATTCAGGGAACACTGACCGAAGACCAACAAAAGAATGCATTAAAAAAGTTGGAGATATTAGTACGGTCTAAGTTACTGGAAGTTTCGCTTCCGAAAGAAATTTCCGAAGTAAAGGTCACTGATGGTACAGCACTGCTTCGTGTAGATGGGGAATTTAAGGTTTTAGTTACTCTTGGCTACAGAGGACACTTATCCTTGTGGAGAATACTGCATTTGGAGCTGCTAGTTGGAGAGAGAAGGGGACCTGTGAAACTGGAAGAAGTGCACCGTCATGCTCTTGGAGATGATTTGGAACGCAGAATGGCTGCAGCTGAACATCCATTCACTACATTATATTCAATTTTGCATGAACTTTGCATCTCGCTTGTTATGGACACTGTCTTAAAGCAAGTACATTCTCTTAGACAAGGAAGATGGAGAGATGCTATTCGGTTTGATGTCATATCTGATGGTATTACTGGTGGTTCCACACAATTGAACCATGATGGAGAATCGGACTTATCTGGTCTTCGAACCCCAGGGTTGAAAATCATGTATTGGTTGGATTTTGATAAAAACACTGGCAGTTCTGATCCAGGATCATGTCCTTTCATAAAAATTGAACCTGGACCAGATATGCAGATAAAGTGTATCCACAGTACATTCGTCATAGATCCGTTAACCAACAAGGAAGCAGAATTTTCTCTTGATCAAAGTTGCATTGATGTTGAAAAGTTGCTGTTGAGAGCTATATGTTGTAACAAATATACGCGGCTTCTTGAAATTCAAAAAGAATTGAAGAAAAATATTCAAATTTGTCGAACGGCAGATGATGTTGTTCTTCAGCACCAAGTTGACGAGCCTGATGTTGACCATAAAAAGAAGAATAAAGTTCATGACCCCACTGCATATGAGGGAGAAGAAATATTGCGGGTGCGTGCTTATGGCTCATCATTTTTCACCCTTGGAATAAATACAAGGAATGGTCGTTTTCTTCTCCAGTCTTCACACAATAAACTTGCAACCGCATCACTGACAGAGTATGAAGAAGCTTTAAATCAAGGAAGTATGAATGCAGCTGATGTTTTTATAAGATTGAGAAGCCGAAGTATTCTGCATTTGTTTGCATCTATTAGTAGGTTTTTGGGCCTTGAGGTATATGAAAATGGGTTTTCTGCGGTTCGATTGCCAAAGAACATTTCAAATGGTTCAACCATGTTGCTGATGGGATTTCCAGATTGTGGGAATTCATACTTCTTGTTAATGCAGCTTGACAAGGATTTCAAACCCCAGTTTAAATTGCTGGAGACAAAACCAGATTCTTCTGGTAAAGCCCATCGTCTCAGTGATCTAAACAATGTGATACGCATGAAGAAAATTGACATTGATCAGGCTCAGATACTTGAAGATGAGCTGAACTTAAGTCTGCTTGACTGGGAAAGGCTATTTCCCTCTCTGCCAAGTTCTGTCAGTAATCAAACTTCCGAAAATGGTCTTCTTCCTGATGTTAGCATTGATGGTGCTCTGCAGATTGCTGGATATATTCCGTCCAGTTTCTCATCTGTTGTTGATGAAGTGTTTGAGTTGGAGAAGGGGCCTCCCCCTGTACCTGCTTTTTCTGTTTCAAATCTGTCTCAATCTTTCAATTCATCTGCACCTCATTATAGTTCTCTCTCTAATATTCATAATGTGAAGGGAGTTCCTTCTCCCAAGTGGGAAGTGAGTATGCAGCCATCCCAGGGTAATAATGTTGCAAAACTTTCAAATATCCCTTCCCACAGCAATGGTTCCTTGTATTCAACTAGCAATTCAAAGGGTCCAGTGCATTCCACATCCCTGGGTTCTATTTCTTCTGGTCCTGTTAGGGGTGCTACTACAAGACGACTTTCAAATTCAAAATCTGAACAGGATTTAACTTCCCTTAGATACCCAAATCCTGCTGAGGTTGGTTCTTATACTGCATTGGATGATGATCATATAAGTATGCCGAATGATACGTCAAAGGATGGGGTGTATGCAAATAGGTCTTCCCGGCTATTGTCTCCATCTCAACATGGTGGCTCTCGAATTTCTGCAAGTATAAAACCTAATGGATCCAGAAGTTCACCAACTGCAGCTCCAACAGGGTCTTTAAGGCCTTCTGGATCTTGCTCGTCTGTTTCAACTCCCGTATCCCAGAATCAAGATTCTTGCTCTAGTCCCTTCTATGAAAGTGGTTTAAAAAATGATAGTTCTCGGAAGCGTACTGCTTCAGATATGCTGAACTTAATTCCTTCACTTAAAGGTATTGATGCATATAATGGAGTTTCTAAGAGAAGGAAGGTTTCAGAATCACCTAGATTTAGCAAACACTCATCACAGTTGCTTATTTCAAAAGAAATGGTTTCCAAAACTGAATACAGTTATGGTAACCTTATTGCTGAAGCTAACAAAGGCAGTGCACCTTCGAGTACATATGTCTCTGCTCTGCTTCATGTAATCAGGCACTGTTCACTATGTATCAAACATGCCAGGCTTACTAGCCAGATGGATGCACTGGATATTCCATATGTTGAAGAAGTTGGTTTAAGAAATGCATCAACAAATATATGGTTTCGACTTCCATTTGCCAGAGATGATTCATGGCAACACATATGCTTGAGACTTGGAAGGCCTGGAACCATGTGTTGGGATGTCAAGATACATGATCAGCACTTCAGAGATTTGTGGGAGCTTCAGAAGAAAAGCTGTACGGCTCCATGGGGTCCTGATGTTCGAATAGCAAATACATCCGACAAAGACTCTCACATTCGTTACGATCCCGAAGGTGTTGTTCTCAGTTATCAATCAGTAGAGGCAGATAGCATAGACAAGCTAGTGGCAGATATACGAAGGCTTTCCAATGCAAGAATGTTTGCCATTGGGATGCGTAAACTGCTTGGGGTTGGAACAGATGAGAGGCTAGAAGAAAGTAGTATGACCTCAGATGTAAAGACACTAGTTATGAAAGGTGCACCCGACACTGTGGATAAGTTATCTGAACAGATGAGGAGGGCATTTAGAATTGAGGCAGTTGGGTTGATGAGCTTGTGGTTTAGTTTTGGTTCTGGCGTGCTGGCACGATTTGTCGTAGAGTGGGAATCAGGTAAAGAGGGTTGCACTATGCACGTTTCACCTGATCAACTTTGGCCTCATACAAAGTTTTTAGAAGATTTTATAAATGGAGCTGAAGTTGCATCACTTTTGGATTGCATTCGTCTCACTGCTGGACCACTACATGCTCTTGCAGCAGCAACCCGACCTGCTCGAGCTGGTCCTGTTTCGACACTTCCTGGCATAGCTGCAGCTCTTTCTTCCCTTCCAAAACATGGAGGATACACACCTACCCAGGGTGTTTTACCTAGCAGTTCAGCCACGAACACTGGCCAAATTACCAATGGCCCAGTTGGTAACGCTGTTTCTGCAAATGTTTCTGGCCCTCTTGCAAATCATAGCCTTCATGGGGCTGCAATGTTAGCTGCTGCTGCTGGCCGTGGTGGGCCTGGCATTGCTCCTAGTTCCTTGTTGCCAATAGATGTTTCTGTTGTGTTGCGTGGTCCGTATTGGATAAGAATAATATATCGAAAACAATTTGCAGTTGATATGCGCTGCTTTGCAGGAGATCAGGTATGGTTGCAACCAGCAACGCCTGCCAAGGTCAACCCTTCAGTTGGAGGGTCATTACCATGCCCACAGTTCCGGCCATTTATTATGGAGCATGTTGCCCAAGAATTAAATGGTGTAGAGCCAAACTTCCCAGGTGTTCAACAAACTGTTGGATTGTCAGCTCCAAACAATCAGAATCCAAATTCAAGTTCACAGATGACTGCTGCAAATGGAAATAGACTTAGTCTTCCTGGTTCTCCTGCATTGACTAGGGCAGGAAATCAGGTGGCTAATATAAATCGTGTGGGAAGTGCTCTGTCTGGATCTTCAAATTTGGCTTCTGTGGGCTCAGGATTGCCATTACGGAGATCACCAGGAACAGGTGTCCCTGCACACGTGAGAGGTGAACTGAATACAGCTATTATTGGACTTGGGGATGATGGGGGGTATGGAGGAGGTTGGGTTCCTCTTCTTGCTCTTAAGAAAGTTTTGAGAGGTATTCTCAAATACCTTGGAGTTCTTTGGCTGTTTGCTCAGCTTCCAGATCTTCTGAAAGAGATCCTAGGTTCAATTTTGAGGGACAACGAAGGTGCATTGCTGAATTTGGATCCTGAGCAGCCTGCCTTACGTTTCTTTGTGGGGGGATATGTATTTGCTGTAAGCGTTCATAGAGTCCAACTGCTTCTCCAAGTGCTTAGTGTGAAGCGTTTCCATCATCAACAGCAACAACAGCAGCAGCAAAACTCCACTACAGCACAAGAGGAATTGACACAGTCAGAAATTGGTGAAATATGTGATTATTTTAGCCGTCGTGTTGCATCAGAGCCGTATGATGCTTCTCGTGTTGCATCCTTCATTACTCTCCTCACCTTGCCAATATCAGTTTTAAGGGAATTTTTGAAATTGATAGCATGGAAAAAGGGAGTGGCTCAGGCTCAGGGTGGAGATATTGCTCCGGCCCAAAAACCCCGCATTGAATTGTGTCTTGAGAATCATTCTGGGTTGAGTATAGATGAAAACTCTGAAAGATCCACATCCAAAAGCAATATCCATTATGATAGGCAACACAACTCTGTTGATTTCGCCCTGACAGTTGTACTCGATCCTGCTCATATACCTCACATGAATGCAGCAGGTGGCGCTGCCTGGTTGCCCTATTGTGTCTCAGTTAAGTTGAGATATTCCTTTGGTGAAAGCCCTGTTGTCTCTTTTCTTGGTATGGAAGGAAGCCATGGGGGCCGAGCATGCTGGTTACGCATTGATGACTGGGAAAAATGTAAACAGAGGGTGGCTCGAACTGTTGAGGTGAGTGGAAGTTCAACCGGAGATGTTAGCCAAGGAAGGTTGAGAATTGTAGCAGATAGTGTCCAGAGGACATTACATATGTGCCTTCAAGGATTGAGAGAGGGCAGTGAAATAACTGCAATTACCGGTTCAACGTCGTGA
Coding sequence (CDS)
ATGGCGGCCGAGCTAGGCCAACAAACGGTCGAGTTCTCTGCCCTGGTTTCCCGTGCTGCTGAGGACTCTTTCCTCTCCCTCAAAGAACTTGTTGACAAGTCCAAATCCTCCGACCAATCCGATTCCGATAAGAAGGTTAATATCCTCAAGTATGTCTTCAAGACCCAGCAGAGGATACTCCGCCTTTATGCCCTTGCCAAGTGGTGTCAACAGGTTCCTTTGATTCAATACTGTCAACAACTTGCATCGACTTTGTCGAGTCATGATTCTTGTTTTACACAAGCTGCAGATTCTTTATTCTTCATGCATGAGGGGCTACAGCAAGCTCGTGCACCTATTTATGATGTTCCATCTGCTACTGAAATTCTTCTTACGGGAACCTATGAGCGTCTGCCTAAATGTGTAGAAGATATCAGTATTCAGGGAACACTGACCGAAGACCAACAAAAGAATGCATTAAAAAAGTTGGAGATATTAGTACGGTCTAAGTTACTGGAAGTTTCGCTTCCGAAAGAAATTTCCGAAGTAAAGGTCACTGATGGTACAGCACTGCTTCGTGTAGATGGGGAATTTAAGGTTTTAGTTACTCTTGGCTACAGAGGACACTTATCCTTGTGGAGAATACTGCATTTGGAGCTGCTAGTTGGAGAGAGAAGGGGACCTGTGAAACTGGAAGAAGTGCACCGTCATGCTCTTGGAGATGATTTGGAACGCAGAATGGCTGCAGCTGAACATCCATTCACTACATTATATTCAATTTTGCATGAACTTTGCATCTCGCTTGTTATGGACACTGTCTTAAAGCAAGTACATTCTCTTAGACAAGGAAGATGGAGAGATGCTATTCGGTTTGATGTCATATCTGATGGTATTACTGGTGGTTCCACACAATTGAACCATGATGGAGAATCGGACTTATCTGGTCTTCGAACCCCAGGGTTGAAAATCATGTATTGGTTGGATTTTGATAAAAACACTGGCAGTTCTGATCCAGGATCATGTCCTTTCATAAAAATTGAACCTGGACCAGATATGCAGATAAAGTGTATCCACAGTACATTCGTCATAGATCCGTTAACCAACAAGGAAGCAGAATTTTCTCTTGATCAAAGTTGCATTGATGTTGAAAAGTTGCTGTTGAGAGCTATATGTTGTAACAAATATACGCGGCTTCTTGAAATTCAAAAAGAATTGAAGAAAAATATTCAAATTTGTCGAACGGCAGATGATGTTGTTCTTCAGCACCAAGTTGACGAGCCTGATGTTGACCATAAAAAGAAGAATAAAGTTCATGACCCCACTGCATATGAGGGAGAAGAAATATTGCGGGTGCGTGCTTATGGCTCATCATTTTTCACCCTTGGAATAAATACAAGGAATGGTCGTTTTCTTCTCCAGTCTTCACACAATAAACTTGCAACCGCATCACTGACAGAGTATGAAGAAGCTTTAAATCAAGGAAGTATGAATGCAGCTGATGTTTTTATAAGATTGAGAAGCCGAAGTATTCTGCATTTGTTTGCATCTATTAGTAGGTTTTTGGGCCTTGAGGTATATGAAAATGGGTTTTCTGCGGTTCGATTGCCAAAGAACATTTCAAATGGTTCAACCATGTTGCTGATGGGATTTCCAGATTGTGGGAATTCATACTTCTTGTTAATGCAGCTTGACAAGGATTTCAAACCCCAGTTTAAATTGCTGGAGACAAAACCAGATTCTTCTGGTAAAGCCCATCGTCTCAGTGATCTAAACAATGTGATACGCATGAAGAAAATTGACATTGATCAGGCTCAGATACTTGAAGATGAGCTGAACTTAAGTCTGCTTGACTGGGAAAGGCTATTTCCCTCTCTGCCAAGTTCTGTCAGTAATCAAACTTCCGAAAATGGTCTTCTTCCTGATGTTAGCATTGATGGTGCTCTGCAGATTGCTGGATATATTCCGTCCAGTTTCTCATCTGTTGTTGATGAAGTGTTTGAGTTGGAGAAGGGGCCTCCCCCTGTACCTGCTTTTTCTGTTTCAAATCTGTCTCAATCTTTCAATTCATCTGCACCTCATTATAGTTCTCTCTCTAATATTCATAATGTGAAGGGAGTTCCTTCTCCCAAGTGGGAAGTGAGTATGCAGCCATCCCAGGGTAATAATGTTGCAAAACTTTCAAATATCCCTTCCCACAGCAATGGTTCCTTGTATTCAACTAGCAATTCAAAGGGTCCAGTGCATTCCACATCCCTGGGTTCTATTTCTTCTGGTCCTGTTAGGGGTGCTACTACAAGACGACTTTCAAATTCAAAATCTGAACAGGATTTAACTTCCCTTAGATACCCAAATCCTGCTGAGGTTGGTTCTTATACTGCATTGGATGATGATCATATAAGTATGCCGAATGATACGTCAAAGGATGGGGTGTATGCAAATAGGTCTTCCCGGCTATTGTCTCCATCTCAACATGGTGGCTCTCGAATTTCTGCAAGTATAAAACCTAATGGATCCAGAAGTTCACCAACTGCAGCTCCAACAGGGTCTTTAAGGCCTTCTGGATCTTGCTCGTCTGTTTCAACTCCCGTATCCCAGAATCAAGATTCTTGCTCTAGTCCCTTCTATGAAAGTGGTTTAAAAAATGATAGTTCTCGGAAGCGTACTGCTTCAGATATGCTGAACTTAATTCCTTCACTTAAAGGTATTGATGCATATAATGGAGTTTCTAAGAGAAGGAAGGTTTCAGAATCACCTAGATTTAGCAAACACTCATCACAGTTGCTTATTTCAAAAGAAATGGTTTCCAAAACTGAATACAGTTATGGTAACCTTATTGCTGAAGCTAACAAAGGCAGTGCACCTTCGAGTACATATGTCTCTGCTCTGCTTCATGTAATCAGGCACTGTTCACTATGTATCAAACATGCCAGGCTTACTAGCCAGATGGATGCACTGGATATTCCATATGTTGAAGAAGTTGGTTTAAGAAATGCATCAACAAATATATGGTTTCGACTTCCATTTGCCAGAGATGATTCATGGCAACACATATGCTTGAGACTTGGAAGGCCTGGAACCATGTGTTGGGATGTCAAGATACATGATCAGCACTTCAGAGATTTGTGGGAGCTTCAGAAGAAAAGCTGTACGGCTCCATGGGGTCCTGATGTTCGAATAGCAAATACATCCGACAAAGACTCTCACATTCGTTACGATCCCGAAGGTGTTGTTCTCAGTTATCAATCAGTAGAGGCAGATAGCATAGACAAGCTAGTGGCAGATATACGAAGGCTTTCCAATGCAAGAATGTTTGCCATTGGGATGCGTAAACTGCTTGGGGTTGGAACAGATGAGAGGCTAGAAGAAAGTAGTATGACCTCAGATGTAAAGACACTAGTTATGAAAGGTGCACCCGACACTGTGGATAAGTTATCTGAACAGATGAGGAGGGCATTTAGAATTGAGGCAGTTGGGTTGATGAGCTTGTGGTTTAGTTTTGGTTCTGGCGTGCTGGCACGATTTGTCGTAGAGTGGGAATCAGGTAAAGAGGGTTGCACTATGCACGTTTCACCTGATCAACTTTGGCCTCATACAAAGTTTTTAGAAGATTTTATAAATGGAGCTGAAGTTGCATCACTTTTGGATTGCATTCGTCTCACTGCTGGACCACTACATGCTCTTGCAGCAGCAACCCGACCTGCTCGAGCTGGTCCTGTTTCGACACTTCCTGGCATAGCTGCAGCTCTTTCTTCCCTTCCAAAACATGGAGGATACACACCTACCCAGGGTGTTTTACCTAGCAGTTCAGCCACGAACACTGGCCAAATTACCAATGGCCCAGTTGGTAACGCTGTTTCTGCAAATGTTTCTGGCCCTCTTGCAAATCATAGCCTTCATGGGGCTGCAATGTTAGCTGCTGCTGCTGGCCGTGGTGGGCCTGGCATTGCTCCTAGTTCCTTGTTGCCAATAGATGTTTCTGTTGTGTTGCGTGGTCCGTATTGGATAAGAATAATATATCGAAAACAATTTGCAGTTGATATGCGCTGCTTTGCAGGAGATCAGGTATGGTTGCAACCAGCAACGCCTGCCAAGGTCAACCCTTCAGTTGGAGGGTCATTACCATGCCCACAGTTCCGGCCATTTATTATGGAGCATGTTGCCCAAGAATTAAATGGTGTAGAGCCAAACTTCCCAGGTGTTCAACAAACTGTTGGATTGTCAGCTCCAAACAATCAGAATCCAAATTCAAGTTCACAGATGACTGCTGCAAATGGAAATAGACTTAGTCTTCCTGGTTCTCCTGCATTGACTAGGGCAGGAAATCAGGTGGCTAATATAAATCGTGTGGGAAGTGCTCTGTCTGGATCTTCAAATTTGGCTTCTGTGGGCTCAGGATTGCCATTACGGAGATCACCAGGAACAGGTGTCCCTGCACACGTGAGAGGTGAACTGAATACAGCTATTATTGGACTTGGGGATGATGGGGGGTATGGAGGAGGTTGGGTTCCTCTTCTTGCTCTTAAGAAAGTTTTGAGAGGTATTCTCAAATACCTTGGAGTTCTTTGGCTGTTTGCTCAGCTTCCAGATCTTCTGAAAGAGATCCTAGGTTCAATTTTGAGGGACAACGAAGGTGCATTGCTGAATTTGGATCCTGAGCAGCCTGCCTTACGTTTCTTTGTGGGGGGATATGTATTTGCTGTAAGCGTTCATAGAGTCCAACTGCTTCTCCAAGTGCTTAGTGTGAAGCGTTTCCATCATCAACAGCAACAACAGCAGCAGCAAAACTCCACTACAGCACAAGAGGAATTGACACAGTCAGAAATTGGTGAAATATGTGATTATTTTAGCCGTCGTGTTGCATCAGAGCCGTATGATGCTTCTCGTGTTGCATCCTTCATTACTCTCCTCACCTTGCCAATATCAGTTTTAAGGGAATTTTTGAAATTGATAGCATGGAAAAAGGGAGTGGCTCAGGCTCAGGGTGGAGATATTGCTCCGGCCCAAAAACCCCGCATTGAATTGTGTCTTGAGAATCATTCTGGGTTGAGTATAGATGAAAACTCTGAAAGATCCACATCCAAAAGCAATATCCATTATGATAGGCAACACAACTCTGTTGATTTCGCCCTGACAGTTGTACTCGATCCTGCTCATATACCTCACATGAATGCAGCAGGTGGCGCTGCCTGGTTGCCCTATTGTGTCTCAGTTAAGTTGAGATATTCCTTTGGTGAAAGCCCTGTTGTCTCTTTTCTTGGTATGGAAGGAAGCCATGGGGGCCGAGCATGCTGGTTACGCATTGATGACTGGGAAAAATGTAAACAGAGGGTGGCTCGAACTGTTGAGGTGAGTGGAAGTTCAACCGGAGATGTTAGCCAAGGAAGGTTGAGAATTGTAGCAGATAGTGTCCAGAGGACATTACATATGTGCCTTCAAGGATTGAGAGAGGGCAGTGAAATAACTGCAATTACCGGTTCAACGTCGTGA
Protein sequence
MAAELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRILRLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSATEILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTDGTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRMAAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNHDGESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLTNKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEPDVDHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEYEEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLLMGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQILEDELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFELEKGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLSNIPSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAEVGSYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPTGSLRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNGVSKRRKVSESPRFSKHSSQLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIRHCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTMCWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADSIDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVKTLVMKGAPDTVDKLSEQMRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFINGAEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPTQGVLPSSSATNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPSSLLPIDVSVVLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVAQELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTRAGNQVANINRVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLLALKKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAVSVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRRVASEPYDASRVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDENSERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESPVVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADSVQRTLHMCLQGLREGSEITAITGSTS
Homology
BLAST of Bhi06G001519 vs. TAIR 10
Match:
AT3G04740.1 (RNA polymerase II transcription mediators )
HSP 1 Score: 2110.5 bits (5467), Expect = 0.0e+00
Identity = 1161/1820 (63.79%), Postives = 1366/1820 (75.05%), Query Frame = 0
Query: 3 AELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRILRL 62
AELGQQTV+FSALV RAAE+SFLS KELVDKSKS++ SD++KKV++LKYV KTQQR+LRL
Sbjct: 2 AELGQQTVDFSALVGRAAEESFLSFKELVDKSKSTELSDTEKKVSLLKYVAKTQQRMLRL 61
Query: 63 YALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSATEI 122
ALAKWC+QVPLI Y Q L STLS+HD CFTQAADSLFFMHEGLQQARAP+YDVPSA EI
Sbjct: 62 NALAKWCKQVPLINYFQDLGSTLSAHDICFTQAADSLFFMHEGLQQARAPVYDVPSAVEI 121
Query: 123 LLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTDGT 182
LLTG+Y+RLPKC++D+ +Q +L E QQK AL+KLE+LVRSKLLE++LPKEI+EVK++ GT
Sbjct: 122 LLTGSYQRLPKCLDDVGMQSSLDEHQQKPALRKLEVLVRSKLLEITLPKEITEVKISKGT 181
Query: 183 ALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRMAA 242
L VDGEFKVLVTLGYRGHLS+WRILHL+LLVGER GP+KLE RH LGDDLERRM+
Sbjct: 182 VTLSVDGEFKVLVTLGYRGHLSMWRILHLDLLVGERSGPIKLEVTRRHILGDDLERRMSV 241
Query: 243 AEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNHDG 302
AE+PFT LY++LHELC+++VMDTV++QV +L QGRW+DAIRFD+ISD G+T N +G
Sbjct: 242 AENPFTILYAVLHELCVAIVMDTVIRQVRALLQGRWKDAIRFDLISD---TGTTPANQEG 301
Query: 303 ESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLTNK 362
E+D LRTPG+K+ YW D DKN+G PFIKIEPG D+QIKC HSTFVIDPLT K
Sbjct: 302 EADSVSLRTPGMKLFYWSDSDKNSG-------PFIKIEPGSDLQIKCSHSTFVIDPLTGK 361
Query: 363 EAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEPDV 422
EAEFSLDQSCIDVEKLLL+AICCN+YTRLLEIQKEL +N +ICRT DV+LQ +DEP +
Sbjct: 362 EAEFSLDQSCIDVEKLLLKAICCNRYTRLLEIQKELLRNTRICRTPSDVILQALLDEPGI 421
Query: 423 DHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEYEE 482
+ N V E E+LRVRAYGSSFFTLGIN R GRFLLQSS + L ++ L E+E+
Sbjct: 422 E--GDNMVDSKERVE-PEVLRVRAYGSSFFTLGINIRTGRFLLQSSKSILTSSILEEFED 481
Query: 483 ALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLLMG 542
ALNQGS++A D FI LRS+SILH FA+I +FLGLEVYE+GF ++PK++ +GS++L +G
Sbjct: 482 ALNQGSISAVDAFINLRSKSILHFFAAIGKFLGLEVYEHGFGINKVPKSLLDGSSILTLG 541
Query: 543 FPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQILED 602
FPDC +S+ LLM+L+KDF P FKLLET+ D SGK +D +N++R KKIDI Q +ILED
Sbjct: 542 FPDCESSHLLLMELEKDFTPLFKLLETQMDGSGKPQSFNDPSNILRAKKIDIGQIRILED 601
Query: 603 ELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFELE 662
+LNL D + S + + P + +D AL SFSSVVD VF L+
Sbjct: 602 DLNLITSDVVKFVSSFSDAEGINQASGHRQPGL-VDEALTEMSGSQLSFSSVVDGVFGLQ 661
Query: 663 KGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLSNI 722
K ++ + S H N+ V G K +
Sbjct: 662 K------------VTSALMSIDGHGLVPKNLSAVTG-----------------HGKAPML 721
Query: 723 PSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAEVG 782
S+ + SLY N +GP+ S+S +SS P +G+ ++++ S S+Q+L
Sbjct: 722 TSYHSDSLY---NRQGPLQSSSYNMLSSPPGKGSAMKKIAISNSDQEL------------ 781
Query: 783 SYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPTGS 842
S +LSPS G+ +S S GSR S
Sbjct: 782 --------------------------SLILSPSLSTGNGVSES----GSR----LVTESS 841
Query: 843 LRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNGVS 902
L P P+SQ D +S K+ RKR+ASD+L LIPSL+ ++ +
Sbjct: 842 LSP--------LPLSQTADLATSSAGPLLRKDQKPRKRSASDLLRLIPSLQVVEGVASPN 901
Query: 903 KRRKVSESPR------FSKHSSQLLISKEMVSKT-EYSYGNLIAEANKGSAPSSTYVSAL 962
KRRK SE + +S S L + +KT SYGNLIAEANKG+APSS +V AL
Sbjct: 902 KRRKTSELVQSELVKSWSPASQTLSTAVSTSTKTIGCSYGNLIAEANKGNAPSSVFVYAL 961
Query: 963 LHVIRHCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLG 1022
LHV+RH SL IKHA+LTSQM+ALDI YVEE+GLR+A ++IWFRLPFA++DSWQHICL+LG
Sbjct: 962 LHVVRHSSLSIKHAKLTSQMEALDIQYVEEMGLRDAFSDIWFRLPFAQNDSWQHICLQLG 1021
Query: 1023 RPGTMCWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQS 1082
RPG+MCWDVKI+DQHFRDLWELQK S T PWG V IAN+SD DSHIRYDPEGVVLSYQS
Sbjct: 1022 RPGSMCWDVKINDQHFRDLWELQKGSKTTPWGSGVHIANSSDVDSHIRYDPEGVVLSYQS 1081
Query: 1083 VEADSIDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVK-TLVMKGAPDTV 1142
VEADSI KLVADI+RLSNARMF++GMRKLLG+ DE+ EE S S +K + KG+ + V
Sbjct: 1082 VEADSIKKLVADIQRLSNARMFSLGMRKLLGIKPDEKTEECSANSTMKGSTGGKGSGEPV 1141
Query: 1143 DKLSEQMRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFL 1202
D+ RAF+IEAVGL SLWFSFGSGVLARFVVEWESGK+GCTMHVSPDQLWPHTKFL
Sbjct: 1142 DRW-----RAFKIEAVGLTSLWFSFGSGVLARFVVEWESGKDGCTMHVSPDQLWPHTKFL 1201
Query: 1203 EDFINGAEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPT 1262
EDFINGAEV SLLDCIRLTAGPLHALAAATRPARA + +P + A SS + T
Sbjct: 1202 EDFINGAEVESLLDCIRLTAGPLHALAAATRPARASTATGMPVVPATASS-RQSNQIQQT 1261
Query: 1263 QGVL-PSSSA--TNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPS 1322
QG++ PS+ A TGQ + GN V+++ PL HG AML AAAGR GPGI PS
Sbjct: 1262 QGIIAPSTLAAPNATGQSASATSGNTVASSAPSPLGG-GFHGVAML-AAAGRSGPGIVPS 1321
Query: 1323 SLLPIDVSVVLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQF 1382
SLLPIDVSVVLRGPYWIRIIYRK+FAVDMRCFAGDQVWLQPATP K S+GGSLPCPQF
Sbjct: 1322 SLLPIDVSVVLRGPYWIRIIYRKRFAVDMRCFAGDQVWLQPATPPKGGASIGGSLPCPQF 1381
Query: 1383 RPFIMEHVAQELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTR 1442
RPFIMEHVAQELNG+EPN G Q G + PN+ NP T NR++ SP+ R
Sbjct: 1382 RPFIMEHVAQELNGLEPNLTGSQ---GATNPNSGNP------TVNGVNRVNF--SPSSAR 1441
Query: 1443 AGNQVANINRVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYG 1502
A +NRV S SGS V SGLP+RR+PGT VPAHVRGELNTAIIGLGDDGGYG
Sbjct: 1442 AA-----MNRVASVASGS---LVVSSGLPVRRTPGTAVPAHVRGELNTAIIGLGDDGGYG 1501
Query: 1503 GGWVPLLALKKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRF 1562
GGWVPL+ALKKVLRGILKYLGVLWLFAQLPDLL+EILGSIL+DNEGALLNLD EQPALRF
Sbjct: 1502 GGWVPLVALKKVLRGILKYLGVLWLFAQLPDLLREILGSILKDNEGALLNLDQEQPALRF 1561
Query: 1563 FVGGYVFAVSVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRR 1622
FVGGYVFAVSVHRVQLLLQVLSV+RFHH Q QQ +S AQEELTQSEIGEICDYFSRR
Sbjct: 1562 FVGGYVFAVSVHRVQLLLQVLSVRRFHH--QAQQNGSSAAAQEELTQSEIGEICDYFSRR 1621
Query: 1623 VASEPYDASRVASFITLLTLPISVLREFLKLIAWKKGVAQA-QGGDIAPAQKPRIELCLE 1682
VASEPYDASRVASFITLLTLPISVLREFLKLIAWKKG++Q+ Q G+IAPAQ+PRIELCLE
Sbjct: 1622 VASEPYDASRVASFITLLTLPISVLREFLKLIAWKKGLSQSQQAGEIAPAQRPRIELCLE 1681
Query: 1683 NHSGLSIDENSERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSV 1742
NHSG +D N +KSNIHYDR HN+VDFALTVVLDP HIPH+NAAGGAAWLPYCVSV
Sbjct: 1682 NHSGTDLDNN---CAAKSNIHYDRPHNTVDFALTVVLDPVHIPHINAAGGAAWLPYCVSV 1689
Query: 1743 KLRYSFGESPVVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLR 1802
+LRY+FGE+P V+FLGMEGSHGGRACW R+DDWEKCKQRV+RTVEV+GS+ GD++QG+L+
Sbjct: 1742 RLRYTFGENPSVTFLGMEGSHGGRACWQRVDDWEKCKQRVSRTVEVNGSAAGDLTQGKLK 1689
Query: 1803 IVADSVQRTLHMCLQGLREG 1811
+VADSVQRTLH+CLQGLREG
Sbjct: 1802 LVADSVQRTLHLCLQGLREG 1689
BLAST of Bhi06G001519 vs. ExPASy Swiss-Prot
Match:
Q9SR02 (Mediator of RNA polymerase II transcription subunit 14 OS=Arabidopsis thaliana OX=3702 GN=MED14 PE=1 SV=1)
HSP 1 Score: 2110.5 bits (5467), Expect = 0.0e+00
Identity = 1161/1820 (63.79%), Postives = 1366/1820 (75.05%), Query Frame = 0
Query: 3 AELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRILRL 62
AELGQQTV+FSALV RAAE+SFLS KELVDKSKS++ SD++KKV++LKYV KTQQR+LRL
Sbjct: 2 AELGQQTVDFSALVGRAAEESFLSFKELVDKSKSTELSDTEKKVSLLKYVAKTQQRMLRL 61
Query: 63 YALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSATEI 122
ALAKWC+QVPLI Y Q L STLS+HD CFTQAADSLFFMHEGLQQARAP+YDVPSA EI
Sbjct: 62 NALAKWCKQVPLINYFQDLGSTLSAHDICFTQAADSLFFMHEGLQQARAPVYDVPSAVEI 121
Query: 123 LLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTDGT 182
LLTG+Y+RLPKC++D+ +Q +L E QQK AL+KLE+LVRSKLLE++LPKEI+EVK++ GT
Sbjct: 122 LLTGSYQRLPKCLDDVGMQSSLDEHQQKPALRKLEVLVRSKLLEITLPKEITEVKISKGT 181
Query: 183 ALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRMAA 242
L VDGEFKVLVTLGYRGHLS+WRILHL+LLVGER GP+KLE RH LGDDLERRM+
Sbjct: 182 VTLSVDGEFKVLVTLGYRGHLSMWRILHLDLLVGERSGPIKLEVTRRHILGDDLERRMSV 241
Query: 243 AEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNHDG 302
AE+PFT LY++LHELC+++VMDTV++QV +L QGRW+DAIRFD+ISD G+T N +G
Sbjct: 242 AENPFTILYAVLHELCVAIVMDTVIRQVRALLQGRWKDAIRFDLISD---TGTTPANQEG 301
Query: 303 ESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLTNK 362
E+D LRTPG+K+ YW D DKN+G PFIKIEPG D+QIKC HSTFVIDPLT K
Sbjct: 302 EADSVSLRTPGMKLFYWSDSDKNSG-------PFIKIEPGSDLQIKCSHSTFVIDPLTGK 361
Query: 363 EAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEPDV 422
EAEFSLDQSCIDVEKLLL+AICCN+YTRLLEIQKEL +N +ICRT DV+LQ +DEP +
Sbjct: 362 EAEFSLDQSCIDVEKLLLKAICCNRYTRLLEIQKELLRNTRICRTPSDVILQALLDEPGI 421
Query: 423 DHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEYEE 482
+ N V E E+LRVRAYGSSFFTLGIN R GRFLLQSS + L ++ L E+E+
Sbjct: 422 E--GDNMVDSKERVE-PEVLRVRAYGSSFFTLGINIRTGRFLLQSSKSILTSSILEEFED 481
Query: 483 ALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLLMG 542
ALNQGS++A D FI LRS+SILH FA+I +FLGLEVYE+GF ++PK++ +GS++L +G
Sbjct: 482 ALNQGSISAVDAFINLRSKSILHFFAAIGKFLGLEVYEHGFGINKVPKSLLDGSSILTLG 541
Query: 543 FPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQILED 602
FPDC +S+ LLM+L+KDF P FKLLET+ D SGK +D +N++R KKIDI Q +ILED
Sbjct: 542 FPDCESSHLLLMELEKDFTPLFKLLETQMDGSGKPQSFNDPSNILRAKKIDIGQIRILED 601
Query: 603 ELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFELE 662
+LNL D + S + + P + +D AL SFSSVVD VF L+
Sbjct: 602 DLNLITSDVVKFVSSFSDAEGINQASGHRQPGL-VDEALTEMSGSQLSFSSVVDGVFGLQ 661
Query: 663 KGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLSNI 722
K ++ + S H N+ V G K +
Sbjct: 662 K------------VTSALMSIDGHGLVPKNLSAVTG-----------------HGKAPML 721
Query: 723 PSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAEVG 782
S+ + SLY N +GP+ S+S +SS P +G+ ++++ S S+Q+L
Sbjct: 722 TSYHSDSLY---NRQGPLQSSSYNMLSSPPGKGSAMKKIAISNSDQEL------------ 781
Query: 783 SYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPTGS 842
S +LSPS G+ +S S GSR S
Sbjct: 782 --------------------------SLILSPSLSTGNGVSES----GSR----LVTESS 841
Query: 843 LRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNGVS 902
L P P+SQ D +S K+ RKR+ASD+L LIPSL+ ++ +
Sbjct: 842 LSP--------LPLSQTADLATSSAGPLLRKDQKPRKRSASDLLRLIPSLQVVEGVASPN 901
Query: 903 KRRKVSESPR------FSKHSSQLLISKEMVSKT-EYSYGNLIAEANKGSAPSSTYVSAL 962
KRRK SE + +S S L + +KT SYGNLIAEANKG+APSS +V AL
Sbjct: 902 KRRKTSELVQSELVKSWSPASQTLSTAVSTSTKTIGCSYGNLIAEANKGNAPSSVFVYAL 961
Query: 963 LHVIRHCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLG 1022
LHV+RH SL IKHA+LTSQM+ALDI YVEE+GLR+A ++IWFRLPFA++DSWQHICL+LG
Sbjct: 962 LHVVRHSSLSIKHAKLTSQMEALDIQYVEEMGLRDAFSDIWFRLPFAQNDSWQHICLQLG 1021
Query: 1023 RPGTMCWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQS 1082
RPG+MCWDVKI+DQHFRDLWELQK S T PWG V IAN+SD DSHIRYDPEGVVLSYQS
Sbjct: 1022 RPGSMCWDVKINDQHFRDLWELQKGSKTTPWGSGVHIANSSDVDSHIRYDPEGVVLSYQS 1081
Query: 1083 VEADSIDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVK-TLVMKGAPDTV 1142
VEADSI KLVADI+RLSNARMF++GMRKLLG+ DE+ EE S S +K + KG+ + V
Sbjct: 1082 VEADSIKKLVADIQRLSNARMFSLGMRKLLGIKPDEKTEECSANSTMKGSTGGKGSGEPV 1141
Query: 1143 DKLSEQMRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFL 1202
D+ RAF+IEAVGL SLWFSFGSGVLARFVVEWESGK+GCTMHVSPDQLWPHTKFL
Sbjct: 1142 DRW-----RAFKIEAVGLTSLWFSFGSGVLARFVVEWESGKDGCTMHVSPDQLWPHTKFL 1201
Query: 1203 EDFINGAEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPT 1262
EDFINGAEV SLLDCIRLTAGPLHALAAATRPARA + +P + A SS + T
Sbjct: 1202 EDFINGAEVESLLDCIRLTAGPLHALAAATRPARASTATGMPVVPATASS-RQSNQIQQT 1261
Query: 1263 QGVL-PSSSA--TNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPS 1322
QG++ PS+ A TGQ + GN V+++ PL HG AML AAAGR GPGI PS
Sbjct: 1262 QGIIAPSTLAAPNATGQSASATSGNTVASSAPSPLGG-GFHGVAML-AAAGRSGPGIVPS 1321
Query: 1323 SLLPIDVSVVLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQF 1382
SLLPIDVSVVLRGPYWIRIIYRK+FAVDMRCFAGDQVWLQPATP K S+GGSLPCPQF
Sbjct: 1322 SLLPIDVSVVLRGPYWIRIIYRKRFAVDMRCFAGDQVWLQPATPPKGGASIGGSLPCPQF 1381
Query: 1383 RPFIMEHVAQELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTR 1442
RPFIMEHVAQELNG+EPN G Q G + PN+ NP T NR++ SP+ R
Sbjct: 1382 RPFIMEHVAQELNGLEPNLTGSQ---GATNPNSGNP------TVNGVNRVNF--SPSSAR 1441
Query: 1443 AGNQVANINRVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYG 1502
A +NRV S SGS V SGLP+RR+PGT VPAHVRGELNTAIIGLGDDGGYG
Sbjct: 1442 AA-----MNRVASVASGS---LVVSSGLPVRRTPGTAVPAHVRGELNTAIIGLGDDGGYG 1501
Query: 1503 GGWVPLLALKKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRF 1562
GGWVPL+ALKKVLRGILKYLGVLWLFAQLPDLL+EILGSIL+DNEGALLNLD EQPALRF
Sbjct: 1502 GGWVPLVALKKVLRGILKYLGVLWLFAQLPDLLREILGSILKDNEGALLNLDQEQPALRF 1561
Query: 1563 FVGGYVFAVSVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRR 1622
FVGGYVFAVSVHRVQLLLQVLSV+RFHH Q QQ +S AQEELTQSEIGEICDYFSRR
Sbjct: 1562 FVGGYVFAVSVHRVQLLLQVLSVRRFHH--QAQQNGSSAAAQEELTQSEIGEICDYFSRR 1621
Query: 1623 VASEPYDASRVASFITLLTLPISVLREFLKLIAWKKGVAQA-QGGDIAPAQKPRIELCLE 1682
VASEPYDASRVASFITLLTLPISVLREFLKLIAWKKG++Q+ Q G+IAPAQ+PRIELCLE
Sbjct: 1622 VASEPYDASRVASFITLLTLPISVLREFLKLIAWKKGLSQSQQAGEIAPAQRPRIELCLE 1681
Query: 1683 NHSGLSIDENSERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSV 1742
NHSG +D N +KSNIHYDR HN+VDFALTVVLDP HIPH+NAAGGAAWLPYCVSV
Sbjct: 1682 NHSGTDLDNN---CAAKSNIHYDRPHNTVDFALTVVLDPVHIPHINAAGGAAWLPYCVSV 1689
Query: 1743 KLRYSFGESPVVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLR 1802
+LRY+FGE+P V+FLGMEGSHGGRACW R+DDWEKCKQRV+RTVEV+GS+ GD++QG+L+
Sbjct: 1742 RLRYTFGENPSVTFLGMEGSHGGRACWQRVDDWEKCKQRVSRTVEVNGSAAGDLTQGKLK 1689
Query: 1803 IVADSVQRTLHMCLQGLREG 1811
+VADSVQRTLH+CLQGLREG
Sbjct: 1802 LVADSVQRTLHLCLQGLREG 1689
BLAST of Bhi06G001519 vs. ExPASy Swiss-Prot
Match:
P0CB66 (Putative mediator of RNA polymerase II transcription subunit 14 OS=Dictyostelium discoideum OX=44689 GN=med14 PE=3 SV=1)
HSP 1 Score: 167.5 bits (423), Expect = 1.4e-39
Identity = 198/843 (23.49%), Postives = 346/843 (41.04%), Query Frame = 0
Query: 8 QTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRILRLYALAK 67
+ + S ++ R E S+ SL L + + +D ++K I+ Y+ T+++ LRL L K
Sbjct: 64 RNISLSLVIHRLVEQSYNSLLGLTEGLPKA--NDLERKKAIVDYLDGTREKFLRLMVLIK 123
Query: 68 WCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSATEILLTGT 127
W + VP + + L+ DS +AAD L L ARAPIYDVP+A ++L TGT
Sbjct: 124 WSEHVPTLTKANNIIDILNLEDSYLREAADLLINTQFSLVNARAPIYDVPTAIDVLTTGT 183
Query: 128 YERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTDGTALLRV 187
Y+R+P ++ + L Q ++AL++L +++ KL +PKE + V+DG A + V
Sbjct: 184 YQRMPTNIKRVIPPPPLKPTQIESALERLNDIIKYKLFISDVPKEFQPITVSDGKAHIFV 243
Query: 188 DGEFKVLVTLGYRGHLSLWRILHLELLVGERR-----GPVKL--EEVHRHALGDDLERRM 247
D E++ +T+ S W IL L L V +R GP+K+ + ++ L D ++ R+
Sbjct: 244 DDEYEAYLTIDGGSEKSNWVILSLNLFVYSKRNLNGEGPIKVAYDNKMKYVL-DRVQNRI 303
Query: 248 AAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 307
++ P L++I+H LCIS MD + QV +L++ ++ IR D
Sbjct: 304 ISSAQPLFELHNIVHYLCISSQMDILASQVENLKKTILKNNIRCVFGKD----------- 363
Query: 308 DGESDLSGLRTPGLKIMYWLDFDKN--------TGSSDPGSCPFIKIEPGPDMQIKCIHS 367
+ + YWL D N G+ P KI +IK H
Sbjct: 364 -----------QSITVFYWLPEDFNLVGVTQHTLGNLMPNKHTNFKIYIDEHQKIKISH- 423
Query: 368 TFVIDPLTNKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADD-- 427
P+T+ + E + +++E +LL+AI N Y ++ + L N T
Sbjct: 424 ---YPPITHPKNENYFKIASLNLETILLQAIELNAYDKVYLLNSLLLDNRITANTTSSSS 483
Query: 428 ---------------------------------------------VVLQHQVDEPDVDHK 487
+++ + + + +
Sbjct: 484 SSSSNNNNTASPIINRNNNNGKPNLLSTKQSNNPLSRSFHLNDIKLIMSSRFSDENQNDS 543
Query: 488 KKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEYEEALN 547
N H PT +LRV YGS F + +N +NG+F L S N + + E+ LN
Sbjct: 544 NGNNDHLPT------VLRVMLYGSKFLDITVNFQNGKFSLIKSSNYIEFTN--HLEQRLN 603
Query: 548 QGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLLMGFPD 607
+ + + +S+L F S FLGLE F+ + L N SN S +
Sbjct: 604 KDPNEIESIVNVFKLKSLLTCFEEASLFLGLEC----FNKIPLQMNSSNNSESNQLANEL 663
Query: 608 CGNSYFLL--MQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQILE-D 667
+S F+ + L K+ P + ++ K + L + + + +D LE D
Sbjct: 664 FSDSNFICVSISLAKENNPYYLVISIKATCFTPSFHLLFCKMLPKSTIMTLDSIIKLESD 723
Query: 668 ELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFELE 727
+LN L + P S SN T+ NG G S S++++++ E
Sbjct: 724 QLNKLLKE----CPIGSISSSNNTNSNG-------------NGPFQSYISTLLEKIVEAS 783
Query: 728 KGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLSNI 786
+ + ++ N P + N +N + + + NN +N
Sbjct: 784 NQKINLLSIQSFLKKENINYYQPSQQDIENNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 843
BLAST of Bhi06G001519 vs. ExPASy Swiss-Prot
Match:
A2ABV5 (Mediator of RNA polymerase II transcription subunit 14 OS=Mus musculus OX=10090 GN=Med14 PE=1 SV=1)
HSP 1 Score: 128.3 bits (321), Expect = 9.1e-28
Identity = 94/368 (25.54%), Postives = 175/368 (47.55%), Query Frame = 0
Query: 39 QSDSDKKVNILKYVFKTQQRILRLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADS 98
+SD ++K+ I+++ +T+Q +RL AL KW ++ C ++S L F AD
Sbjct: 82 KSDVERKIEIVQFASRTRQLFVRLLALVKWANDAGKVEKCAMISSFLDQQAILFVDTADR 141
Query: 99 LFFM-HEGLQQARAPIYDVPSATEILLTGTYERLPKCVED-ISIQGTLTEDQQKNALKKL 158
L + + L AR P + +P A ++L TG+Y RLP C+ D I +T+ +++ L +L
Sbjct: 142 LASLARDALVHARLPSFAIPYAIDVLTTGSYPRLPTCIRDKIIPPDPITKIEKQATLHQL 201
Query: 159 EILVRSKLLEVSLPKEISEVKVTDGTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVG 218
++R +L+ LP +++ + V +G RV+GEF+ +T+ WR+L LE+LV
Sbjct: 202 NQILRHRLVTTDLPPQLANLTVANGRVKFRVEGEFEATLTVMGDDPEVPWRLLKLEILVE 261
Query: 219 ERR---GPVKLEEVHRHALGDDLERRMAAAEHPFTTLYSILHELCISLVMDTVLKQVHSL 278
++ G + + + ++ R+ A E P +Y+ LH C+SL ++ + Q L
Sbjct: 262 DKETGDGRALVHSMQIDFIHQLVQSRLFADEKPLQDMYNCLHCFCLSLQLEVLHSQTLML 321
Query: 279 RQGRWRDAIRFDVISDGITGGSTQLNHDGESDLSGLRTPGLKIMYWLD--FDKNTGSSDP 338
+ RW D ++ + H G+S L + W + TG++
Sbjct: 322 IRERWGDLVQ------------VERYHAGKS---------LSLSVWNQQVLGRKTGTASV 381
Query: 339 GSCPFIKIEPGPDMQIKCIHSTFVIDPLTNKEAEFSLDQSCIDVEKLLLRAICCNKYTRL 398
IKI+ + I + +K E ++ + +EKLL+ ++ + RL
Sbjct: 382 HKVT-IKIDENDVSKPLQIFHDPPLPASDSKLVERAMKIDHLSIEKLLIDSVHARAHQRL 427
Query: 399 LEIQKELK 400
E++ L+
Sbjct: 442 QELKAILR 427
BLAST of Bhi06G001519 vs. ExPASy Swiss-Prot
Match:
O60244 (Mediator of RNA polymerase II transcription subunit 14 OS=Homo sapiens OX=9606 GN=MED14 PE=1 SV=2)
HSP 1 Score: 126.7 bits (317), Expect = 2.7e-27
Identity = 73/252 (28.97%), Postives = 134/252 (53.17%), Query Frame = 0
Query: 39 QSDSDKKVNILKYVFKTQQRILRLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADS 98
+SD ++K+ I+++ +T+Q +RL AL KW ++ C ++S L F AD
Sbjct: 76 KSDVERKIEIVQFASRTRQLFVRLLALVKWANNAGKVEKCAMISSFLDQQAILFVDTADR 135
Query: 99 LFFM-HEGLQQARAPIYDVPSATEILLTGTYERLPKCVED-ISIQGTLTEDQQKNALKKL 158
L + + L AR P + +P A ++L TG+Y RLP C+ D I +T+ +++ L +L
Sbjct: 136 LASLARDALVHARLPSFAIPYAIDVLTTGSYPRLPTCIRDKIIPPDPITKIEKQATLHQL 195
Query: 159 EILVRSKLLEVSLPKEISEVKVTDGTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVG 218
++R +L+ LP +++ + V +G RV+GEF+ +T+ WR+L LE+LV
Sbjct: 196 NQILRHRLVTTDLPPQLANLTVANGRVKFRVEGEFEATLTVMGDDPDVPWRLLKLEILVE 255
Query: 219 ERR---GPVKLEEVHRHALGDDLERRMAAAEHPFTTLYSILHELCISLVMDTVLKQVHSL 278
++ G + + + ++ R+ A E P +Y+ LH C+SL ++ + Q L
Sbjct: 256 DKETGDGRALVHSMQISFIHQLVQSRLFADEKPLQDMYNCLHSFCLSLQLEVLHSQTLML 315
Query: 279 RQGRWRDAIRFD 286
+ RW D ++ +
Sbjct: 316 IRERWGDLVQVE 327
BLAST of Bhi06G001519 vs. ExPASy Swiss-Prot
Match:
Q03570 (Mediator of RNA polymerase II transcription subunit 14 OS=Caenorhabditis elegans OX=6239 GN=rgr-1 PE=3 SV=6)
HSP 1 Score: 125.6 bits (314), Expect = 5.9e-27
Identity = 84/292 (28.77%), Postives = 148/292 (50.68%), Query Frame = 0
Query: 3 AELGQQTVEFSALVSRAAEDSFLSLKELVD--KSKSSDQSDSDKKVNILKYVFKTQQRIL 62
A G T+ + L+ A + + + L + + K++DQ + ++K++++ + T+ + L
Sbjct: 98 ANCGPPTIPLNVLLDFAIQHVYHEITVLAELMQRKTNDQGEQERKMSLVHFAHATRSQFL 157
Query: 63 RLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEG-LQQARAPIYDVPSA 122
+L AL KW + + C + L F AD L M G L+ AR P Y + A
Sbjct: 158 KLVALVKWIRISKRMDVCYSIDYLLDLQSQYFIDTADRLVAMTRGDLELARLPEYHIAPA 217
Query: 123 TEILLTGTYERLPKCVEDISI-QGTLTEDQQKNALKKLEILVRSKLLEVS--LPKEISEV 182
++L+ GTY R+P +++ I +T +QK +L L+ S+L +S +P I E+
Sbjct: 218 IDVLVLGTYNRMPSKIKEAFIPPAKITPREQKLVTSRLNQLIESRLSRLSSGIPPNIKEI 277
Query: 183 KVTDGTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVH---RHALG 242
+ +G A L V GEF++ +TL ++ W +L++++LV + + L VH + L
Sbjct: 278 HINNGLATLLVPGEFEIKITLLGETEMTKWTLLNIKILVEDYELGMGLPLVHPLQLNQLH 337
Query: 243 DDLERRMAAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFD 286
L+ RM + +P +S LH C+SL +D + Q L GR RD I +
Sbjct: 338 GVLQSRMNVSLNPIKEAFSFLHSFCVSLQLDVLFCQTSRLAAGRLRDNITIE 389
BLAST of Bhi06G001519 vs. ExPASy TrEMBL
Match:
A0A5A7V7Q3 (Mediator of RNA polymerase II transcription subunit 14 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold456G001440 PE=3 SV=1)
HSP 1 Score: 3388.6 bits (8785), Expect = 0.0e+00
Identity = 1727/1821 (94.84%), Postives = 1771/1821 (97.25%), Query Frame = 0
Query: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRIL 60
MAA+LGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDS+KKVNILKYVFKTQQRIL
Sbjct: 1 MAADLGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSEKKVNILKYVFKTQQRIL 60
Query: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
RLYALAKWCQQVPLIQYCQQLASTLSSHD+CFTQAADSLFFMHEGLQQARAPIYDVPSAT
Sbjct: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDACFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
Query: 121 EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTD 180
EILLTGTYE LPKCVEDISIQGTLT+DQQK+ALKKLEILVRSKLLEVSLPKEISEVKVTD
Sbjct: 121 EILLTGTYEHLPKCVEDISIQGTLTDDQQKSALKKLEILVRSKLLEVSLPKEISEVKVTD 180
Query: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRM 240
GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRG VKLE+VHRHALGDDLERRM
Sbjct: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGLVKLEQVHRHALGDDLERRM 240
Query: 241 AAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
AA+E+PFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH
Sbjct: 241 AASENPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
Query: 301 DGESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLT 360
DGE+DLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKC+HSTFVIDPLT
Sbjct: 301 DGETDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCVHSTFVIDPLT 360
Query: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEP 420
NKEAEF LDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKN+QICRTADDVVLQHQVDEP
Sbjct: 361 NKEAEFFLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNVQICRTADDVVLQHQVDEP 420
Query: 421 DVDHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEY 480
DVD KKK+ +HDPTA+EGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKL T+SLTE
Sbjct: 421 DVDPKKKDIIHDPTAFEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLVTSSLTEC 480
Query: 481 EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLL 540
EEALNQGSM+AADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGS+MLL
Sbjct: 481 EEALNQGSMSAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSSMLL 540
Query: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQIL 600
MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPD SGKA LSDL+NVIR+KKID+DQ QIL
Sbjct: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDPSGKARGLSDLSNVIRVKKIDVDQTQIL 600
Query: 601 EDELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFE 660
EDELNLSLLDW +LFPSLP+S NQT ENGLLPD+SI GALQIAGY PSSFSSVVDEVFE
Sbjct: 601 EDELNLSLLDWGKLFPSLPNSAGNQTPENGLLPDISIGGALQIAGYPPSSFSSVVDEVFE 660
Query: 661 LEKGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLS 720
LEKGPPPVP+FSVSN+SQSFNS+A HY SLSNIHNVKGVPSPKWEV +QPSQGNNVAKLS
Sbjct: 661 LEKGPPPVPSFSVSNMSQSFNSTASHYGSLSNIHNVKGVPSPKWEVGIQPSQGNNVAKLS 720
Query: 721 NIPSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAE 780
NIPSHSNGSLYS SN KGPV STS+GSISSGP RGA TRRLSNSKSEQDLTSLRYPNP E
Sbjct: 721 NIPSHSNGSLYSGSNLKGPVPSTSMGSISSGPGRGAATRRLSNSKSEQDLTSLRYPNPVE 780
Query: 781 VGSYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPT 840
GSYTALDDDHISMP+DTSKDGVYANRSSRLLSPS HGG RIS SIKPNGSRSSPTAAPT
Sbjct: 781 GGSYTALDDDHISMPSDTSKDGVYANRSSRLLSPSPHGGPRISGSIKPNGSRSSPTAAPT 840
Query: 841 GSLRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
GSLRPSGSCSSVSTPVSQNQD+CSSP YESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG
Sbjct: 841 GSLRPSGSCSSVSTPVSQNQDTCSSPVYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
Query: 901 VSKRRKVSESPRFSKHSSQLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
+SKRRKVSES RFSK SSQLLISKEMVS+TEYSYGNLIAEANKGSAPSSTYVSALLHVIR
Sbjct: 901 LSKRRKVSESARFSKTSSQLLISKEMVSRTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
Query: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
HCSLCIKHARLTSQMDALDIP+VEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM
Sbjct: 961 HCSLCIKHARLTSQMDALDIPFVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
Query: 1021 CWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
CWDVKIHDQHFRDLWELQKKS TAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS
Sbjct: 1021 CWDVKIHDQHFRDLWELQKKSTTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
Query: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVKTLVMKGAPDTVDKLSEQ 1140
I+KLVADIRRLSNARMFAIGMRKLLGVGTDE+LEESSMTSD+K V KGA DTVDKLSEQ
Sbjct: 1081 IEKLVADIRRLSNARMFAIGMRKLLGVGTDEKLEESSMTSDIKAPVTKGASDTVDKLSEQ 1140
Query: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING
Sbjct: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
Query: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPTQGVLPS 1260
AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGI A LSSLPKHGGYTPTQ VLPS
Sbjct: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIVATLSSLPKHGGYTPTQSVLPS 1260
Query: 1261 SSATNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPSSLLPIDVSV 1320
SSATNTGQ+TNGPVGNAVS NVSGPLANHSLHGAAML AAAGRGGPGIAPSSLLPIDVSV
Sbjct: 1261 SSATNTGQVTNGPVGNAVSTNVSGPLANHSLHGAAML-AAAGRGGPGIAPSSLLPIDVSV 1320
Query: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA 1380
VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPS GGSLPCPQFRPFIMEHVA
Sbjct: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSFGGSLPCPQFRPFIMEHVA 1380
Query: 1381 QELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTRAGNQVANIN 1440
QELNG+EPNFPGVQQTVGLSAPNNQNPNSSSQ+TAANGNRLSLPGSPA+ R GNQVA+IN
Sbjct: 1381 QELNGLEPNFPGVQQTVGLSAPNNQNPNSSSQITAANGNRLSLPGSPAMPRTGNQVASIN 1440
Query: 1441 RVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLLAL 1500
RVG+ALSGSSNLASV SGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPL+AL
Sbjct: 1441 RVGNALSGSSNLASVSSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLVAL 1500
Query: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV
Sbjct: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
Query: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
SVHRVQLLLQVLSVKRFHHQQQQQQQQNS TAQEELTQSEIGEICDYFSRRVASEPYDAS
Sbjct: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSATAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
Query: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN
Sbjct: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
Query: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP
Sbjct: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
Query: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADSVQRTL 1800
VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVAD+VQRTL
Sbjct: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADNVQRTL 1800
Query: 1801 HMCLQGLREGSEITAITGSTS 1822
HMCLQGLREGSEIT IT STS
Sbjct: 1801 HMCLQGLREGSEITTITSSTS 1820
BLAST of Bhi06G001519 vs. ExPASy TrEMBL
Match:
A0A1S3C281 (Mediator of RNA polymerase II transcription subunit 14 OS=Cucumis melo OX=3656 GN=LOC103496018 PE=3 SV=1)
HSP 1 Score: 3388.6 bits (8785), Expect = 0.0e+00
Identity = 1727/1821 (94.84%), Postives = 1771/1821 (97.25%), Query Frame = 0
Query: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRIL 60
MAA+LGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDS+KKVNILKYVFKTQQRIL
Sbjct: 1 MAADLGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSEKKVNILKYVFKTQQRIL 60
Query: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
RLYALAKWCQQVPLIQYCQQLASTLSSHD+CFTQAADSLFFMHEGLQQARAPIYDVPSAT
Sbjct: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDACFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
Query: 121 EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTD 180
EILLTGTYE LPKCVEDISIQGTLT+DQQK+ALKKLEILVRSKLLEVSLPKEISEVKVTD
Sbjct: 121 EILLTGTYEHLPKCVEDISIQGTLTDDQQKSALKKLEILVRSKLLEVSLPKEISEVKVTD 180
Query: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRM 240
GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRG VKLE+VHRHALGDDLERRM
Sbjct: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGLVKLEQVHRHALGDDLERRM 240
Query: 241 AAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
AA+E+PFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH
Sbjct: 241 AASENPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
Query: 301 DGESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLT 360
DGE+DLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKC+HSTFVIDPLT
Sbjct: 301 DGETDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCVHSTFVIDPLT 360
Query: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEP 420
NKEAEF LDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKN+QICRTADDVVLQHQVDEP
Sbjct: 361 NKEAEFFLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNVQICRTADDVVLQHQVDEP 420
Query: 421 DVDHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEY 480
DVD KKK+ +HDPTA+EGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKL T+SLTE
Sbjct: 421 DVDPKKKDIIHDPTAFEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLVTSSLTEC 480
Query: 481 EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLL 540
EEALNQGSM+AADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGS+MLL
Sbjct: 481 EEALNQGSMSAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSSMLL 540
Query: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQIL 600
MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPD SGKA LSDL+NVIR+KKID+DQ QIL
Sbjct: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDPSGKARGLSDLSNVIRVKKIDVDQTQIL 600
Query: 601 EDELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFE 660
EDELNLSLLDW +LFPSLP+S NQT ENGLLPD+SI GALQIAGY PSSFSSVVDEVFE
Sbjct: 601 EDELNLSLLDWGKLFPSLPNSAGNQTPENGLLPDISIGGALQIAGYPPSSFSSVVDEVFE 660
Query: 661 LEKGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLS 720
LEKGPPPVP+FSVSN+SQSFNS+A HY SLSNIHNVKGVPSPKWEV +QPSQGNNVAKLS
Sbjct: 661 LEKGPPPVPSFSVSNMSQSFNSTASHYGSLSNIHNVKGVPSPKWEVGIQPSQGNNVAKLS 720
Query: 721 NIPSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAE 780
NIPSHSNGSLYS SN KGPV STS+GSISSGP RGA TRRLSNSKSEQDLTSLRYPNP E
Sbjct: 721 NIPSHSNGSLYSGSNLKGPVPSTSMGSISSGPGRGAATRRLSNSKSEQDLTSLRYPNPVE 780
Query: 781 VGSYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPT 840
GSYTALDDDHISMP+DTSKDGVYANRSSRLLSPS HGG RIS SIKPNGSRSSPTAAPT
Sbjct: 781 GGSYTALDDDHISMPSDTSKDGVYANRSSRLLSPSPHGGPRISGSIKPNGSRSSPTAAPT 840
Query: 841 GSLRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
GSLRPSGSCSSVSTPVSQNQD+CSSP YESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG
Sbjct: 841 GSLRPSGSCSSVSTPVSQNQDTCSSPVYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
Query: 901 VSKRRKVSESPRFSKHSSQLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
+SKRRKVSES RFSK SSQLLISKEMVS+TEYSYGNLIAEANKGSAPSSTYVSALLHVIR
Sbjct: 901 LSKRRKVSESARFSKTSSQLLISKEMVSRTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
Query: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
HCSLCIKHARLTSQMDALDIP+VEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM
Sbjct: 961 HCSLCIKHARLTSQMDALDIPFVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
Query: 1021 CWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
CWDVKIHDQHFRDLWELQKKS TAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS
Sbjct: 1021 CWDVKIHDQHFRDLWELQKKSTTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
Query: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVKTLVMKGAPDTVDKLSEQ 1140
I+KLVADIRRLSNARMFAIGMRKLLGVGTDE+LEESSMTSD+K V KGA DTVDKLSEQ
Sbjct: 1081 IEKLVADIRRLSNARMFAIGMRKLLGVGTDEKLEESSMTSDIKAPVTKGASDTVDKLSEQ 1140
Query: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING
Sbjct: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
Query: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPTQGVLPS 1260
AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGI A LSSLPKHGGYTPTQ VLPS
Sbjct: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIVATLSSLPKHGGYTPTQSVLPS 1260
Query: 1261 SSATNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPSSLLPIDVSV 1320
SSATNTGQ+TNGPVGNAVS NVSGPLANHSLHGAAML AAAGRGGPGIAPSSLLPIDVSV
Sbjct: 1261 SSATNTGQVTNGPVGNAVSTNVSGPLANHSLHGAAML-AAAGRGGPGIAPSSLLPIDVSV 1320
Query: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA 1380
VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPS GGSLPCPQFRPFIMEHVA
Sbjct: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSFGGSLPCPQFRPFIMEHVA 1380
Query: 1381 QELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTRAGNQVANIN 1440
QELNG+EPNFPGVQQTVGLSAPNNQNPNSSSQ+TAANGNRLSLPGSPA+ R GNQVA+IN
Sbjct: 1381 QELNGLEPNFPGVQQTVGLSAPNNQNPNSSSQITAANGNRLSLPGSPAMPRTGNQVASIN 1440
Query: 1441 RVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLLAL 1500
RVG+ALSGSSNLASV SGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPL+AL
Sbjct: 1441 RVGNALSGSSNLASVSSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLVAL 1500
Query: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV
Sbjct: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
Query: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
SVHRVQLLLQVLSVKRFHHQQQQQQQQNS TAQEELTQSEIGEICDYFSRRVASEPYDAS
Sbjct: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSATAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
Query: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN
Sbjct: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
Query: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP
Sbjct: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
Query: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADSVQRTL 1800
VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVAD+VQRTL
Sbjct: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADNVQRTL 1800
Query: 1801 HMCLQGLREGSEITAITGSTS 1822
HMCLQGLREGSEIT IT STS
Sbjct: 1801 HMCLQGLREGSEITTITSSTS 1820
BLAST of Bhi06G001519 vs. ExPASy TrEMBL
Match:
A0A0A0LFI5 (Mediator of RNA polymerase II transcription subunit 14 OS=Cucumis sativus OX=3659 GN=Csa_2G011430 PE=3 SV=1)
HSP 1 Score: 3373.9 bits (8747), Expect = 0.0e+00
Identity = 1720/1821 (94.45%), Postives = 1763/1821 (96.81%), Query Frame = 0
Query: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRIL 60
MAA+LGQQTVEFSALVSRAA+DSFLSLKELVDKSKSSDQSDS+KKVNILKYVFKTQQRIL
Sbjct: 1 MAADLGQQTVEFSALVSRAADDSFLSLKELVDKSKSSDQSDSEKKVNILKYVFKTQQRIL 60
Query: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
RLYALAKWCQQVPLIQYCQQLASTLSSHD+CFTQAADSLFFMHEGLQQARAPIYDVPSAT
Sbjct: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDACFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
Query: 121 EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTD 180
EILLTGTYERLPKCVEDISIQGTLT+DQQK+ALKKLEILVRSKLLEVSLPKEISEVKVTD
Sbjct: 121 EILLTGTYERLPKCVEDISIQGTLTDDQQKSALKKLEILVRSKLLEVSLPKEISEVKVTD 180
Query: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRM 240
GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRG VKLE+VHRHALGDDLERRM
Sbjct: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGLVKLEQVHRHALGDDLERRM 240
Query: 241 AAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
AAAE+PFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH
Sbjct: 241 AAAENPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
Query: 301 DGESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLT 360
DGE+DLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKC+HSTFVIDPLT
Sbjct: 301 DGETDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCVHSTFVIDPLT 360
Query: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEP 420
NKEAEF LDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKN+QICRTADDVVL+HQVDEP
Sbjct: 361 NKEAEFFLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNVQICRTADDVVLEHQVDEP 420
Query: 421 DVDHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEY 480
DVD KKK+K+HDP A+EGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKL T+SLTE
Sbjct: 421 DVDPKKKDKIHDPIAFEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLVTSSLTEC 480
Query: 481 EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLL 540
EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGS+MLL
Sbjct: 481 EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSSMLL 540
Query: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQIL 600
MGFPDCGN YFLLMQLDKDFKPQFKLLETKPD SGKA LSDLNNVIR+KKID+DQ QIL
Sbjct: 541 MGFPDCGNLYFLLMQLDKDFKPQFKLLETKPDPSGKARGLSDLNNVIRVKKIDVDQTQIL 600
Query: 601 EDELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFE 660
EDELNLSLLDW +LFP LP+S NQT ENGLLPD+ IDGALQIAGY PSSFSSVVDEVFE
Sbjct: 601 EDELNLSLLDWGKLFPLLPNSAGNQTPENGLLPDIGIDGALQIAGYPPSSFSSVVDEVFE 660
Query: 661 LEKGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLS 720
LEKGPPPVP+FSVSNLSQSFNS+A HY SLSNIHNVKGVPSPKWEV MQPSQGNNVAKLS
Sbjct: 661 LEKGPPPVPSFSVSNLSQSFNSTASHYGSLSNIHNVKGVPSPKWEVGMQPSQGNNVAKLS 720
Query: 721 NIPSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAE 780
NIPSHSNGSLYS SN KGPV STS+GSISSGP RGA TRRLSNSKSEQDLTSLRY NP E
Sbjct: 721 NIPSHSNGSLYSASNLKGPVPSTSMGSISSGPGRGAATRRLSNSKSEQDLTSLRYTNPVE 780
Query: 781 VGSYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPT 840
GSYTALDDDHISMP+DTSKDGVYANRSSRLLSP+ HGG RIS SIKPNGSRSSPTAAPT
Sbjct: 781 GGSYTALDDDHISMPSDTSKDGVYANRSSRLLSPTPHGGPRISGSIKPNGSRSSPTAAPT 840
Query: 841 GSLRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
GSLRPSGSCSSVSTPVSQNQD+CSSP YESGLK+D SRKRTASDMLNLIPSLKGIDAYNG
Sbjct: 841 GSLRPSGSCSSVSTPVSQNQDTCSSPVYESGLKSDCSRKRTASDMLNLIPSLKGIDAYNG 900
Query: 901 VSKRRKVSESPRFSKHSSQLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
+SKRRKVSES RFSK SSQLLISKEMVS+TEYSYGNLIAEANKG+APSSTYVSALLHVIR
Sbjct: 901 LSKRRKVSESARFSKPSSQLLISKEMVSRTEYSYGNLIAEANKGAAPSSTYVSALLHVIR 960
Query: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
HCSLCIKHARLTSQMDALDIP+VEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM
Sbjct: 961 HCSLCIKHARLTSQMDALDIPFVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
Query: 1021 CWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
CWDVKIHDQHFRDLWELQKKS TAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS
Sbjct: 1021 CWDVKIHDQHFRDLWELQKKSTTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
Query: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVKTLVMKGAPDTVDKLSEQ 1140
IDKLVADIRRLSNARMFAIGMRKLLGVGTDE+LEESS TSD K V KGA DTVDKLSEQ
Sbjct: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDEKLEESSTTSD-KAPVTKGASDTVDKLSEQ 1140
Query: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING
Sbjct: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
Query: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPTQGVLPS 1260
AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGI A LSSLPKHGGYTPTQ VLPS
Sbjct: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIVATLSSLPKHGGYTPTQSVLPS 1260
Query: 1261 SSATNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPSSLLPIDVSV 1320
SSATNTGQ+TNGPVGNAVS NVSGPLANHSLHGAAMLAA AGRGGPGIAPSSLLPIDVSV
Sbjct: 1261 SSATNTGQVTNGPVGNAVSTNVSGPLANHSLHGAAMLAATAGRGGPGIAPSSLLPIDVSV 1320
Query: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA 1380
VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPS+GGSLPCPQFRPFIMEHVA
Sbjct: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSMGGSLPCPQFRPFIMEHVA 1380
Query: 1381 QELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTRAGNQVANIN 1440
QELNG+EPNFPGVQQTVGLSAPNNQNPNSSSQ+ AANGNRLSLPGSPA+ RAGNQVANIN
Sbjct: 1381 QELNGLEPNFPGVQQTVGLSAPNNQNPNSSSQIAAANGNRLSLPGSPAMPRAGNQVANIN 1440
Query: 1441 RVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLLAL 1500
RVG+ALSGSSNLASV SGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPL+AL
Sbjct: 1441 RVGNALSGSSNLASVSSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLVAL 1500
Query: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV
Sbjct: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
Query: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
SVHRVQLLLQVLSVKRFHHQQQQQQQ NS TAQEELTQSEIGEICDYFSRRVASEPYDAS
Sbjct: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQPNSATAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
Query: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLS DEN
Sbjct: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSTDEN 1680
Query: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGES
Sbjct: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESL 1740
Query: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADSVQRTL 1800
VVSFLGMEGSHGGRACWLR+DDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVAD+VQRTL
Sbjct: 1741 VVSFLGMEGSHGGRACWLRVDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADNVQRTL 1800
Query: 1801 HMCLQGLREGSEITAITGSTS 1822
HMCLQGLREGSEI IT STS
Sbjct: 1801 HMCLQGLREGSEIATITSSTS 1820
BLAST of Bhi06G001519 vs. ExPASy TrEMBL
Match:
A0A6J1KAS6 (Mediator of RNA polymerase II transcription subunit 14 OS=Cucurbita maxima OX=3661 GN=LOC111493784 PE=3 SV=1)
HSP 1 Score: 3286.9 bits (8521), Expect = 0.0e+00
Identity = 1682/1821 (92.37%), Postives = 1741/1821 (95.61%), Query Frame = 0
Query: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRIL 60
MAAELGQQTVEFSALVSRAAEDSFLSLKELVD SKSSDQSDS+KK+NILKYV+KTQQR+L
Sbjct: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKELVDNSKSSDQSDSEKKINILKYVYKTQQRVL 60
Query: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
RLYALAKWCQQVPLIQYCQQLASTLSSHD+CFTQ ADSLFFMHEGLQQARAPIYDVPSAT
Sbjct: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDTCFTQTADSLFFMHEGLQQARAPIYDVPSAT 120
Query: 121 EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTD 180
EILL+GTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLL+VSLPKEISEVKV+D
Sbjct: 121 EILLSGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLDVSLPKEISEVKVSD 180
Query: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRM 240
GTALLRVDGEFKVLVTLGYRGHLS+WRILHLELLVGERRG VKLEEVHRH LGDDLERRM
Sbjct: 181 GTALLRVDGEFKVLVTLGYRGHLSMWRILHLELLVGERRGLVKLEEVHRHVLGDDLERRM 240
Query: 241 AAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
AAAE+PFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFD+ISDG+TGGS+Q NH
Sbjct: 241 AAAENPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDIISDGMTGGSSQFNH 300
Query: 301 DGESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLT 360
DGE+DLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDP+T
Sbjct: 301 DGETDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPIT 360
Query: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEP 420
NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRT DDV+LQH V+EP
Sbjct: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTEDDVLLQHHVEEP 420
Query: 421 DVDHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEY 480
+VDHKKK+K+HDP+AYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASL +
Sbjct: 421 NVDHKKKDKIHDPSAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLADC 480
Query: 481 EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLL 540
EEALNQGSMNA DVFIRLRSRSILHLFASISRF+GLEVYENG SAVRLPKNISNGS MLL
Sbjct: 481 EEALNQGSMNATDVFIRLRSRSILHLFASISRFMGLEVYENGSSAVRLPKNISNGSAMLL 540
Query: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQIL 600
MGFPDCGNSYFL MQLDKDFKPQFKLLETK D +GKA LSDL+NVI MKKID+DQ QIL
Sbjct: 541 MGFPDCGNSYFLFMQLDKDFKPQFKLLETKSDPTGKARGLSDLSNVIHMKKIDVDQIQIL 600
Query: 601 EDELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFE 660
ED+L SLLDW +L PSLP+SV+NQTSEN LL D+S+ GALQIAGY PSSFSSVVDEVF
Sbjct: 601 EDDLTFSLLDWGKLLPSLPNSVTNQTSENSLLSDMSLHGALQIAGYPPSSFSSVVDEVFG 660
Query: 661 LEKGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLS 720
LEKGPP VP FSVSN SQSFNS++ Y SLS+IHNVKGVPSPKWEV MQPSQGNNVAKLS
Sbjct: 661 LEKGPPTVPNFSVSNPSQSFNSASSPYGSLSSIHNVKGVPSPKWEVGMQPSQGNNVAKLS 720
Query: 721 NIPSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAE 780
NIPSHSNGSLYSTSN KG VHSTSLGSISSGP RGA RRLSNSKSEQDLTSLR+PNP E
Sbjct: 721 NIPSHSNGSLYSTSNLKGSVHSTSLGSISSGPGRGAAMRRLSNSKSEQDLTSLRFPNPVE 780
Query: 781 VGSYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPT 840
VGSYTALDDDHISMPNDTSKDG+YANRSSRLLSPSQHGGSRISASI PNGSRSSPTAAPT
Sbjct: 781 VGSYTALDDDHISMPNDTSKDGLYANRSSRLLSPSQHGGSRISASINPNGSRSSPTAAPT 840
Query: 841 GSLRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
GSL+PSGSCS VSTPVSQNQDSCSSP YESGLK+DS KRTA ++L+LIPSLKGIDA NG
Sbjct: 841 GSLKPSGSCSLVSTPVSQNQDSCSSPVYESGLKSDSFPKRTALNVLSLIPSLKGIDAPNG 900
Query: 901 VSKRRKVSESPRFSKHSSQLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
+SKRRK+ ES RF+K SS LLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR
Sbjct: 901 LSKRRKLLESARFTKPSSHLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
Query: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM
Sbjct: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
Query: 1021 CWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
CWDVKI DQHFRDLWELQKKS +PWGPDVRIANTSDKDSHIRYDPEGV+LSYQSVEADS
Sbjct: 1021 CWDVKIRDQHFRDLWELQKKSSKSPWGPDVRIANTSDKDSHIRYDPEGVILSYQSVEADS 1080
Query: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVKTLVMKGAPDTVDKLSEQ 1140
IDKLVADIRRLSNARMFAIGMRKLLGVGTD +LEESS+TSDVK V KGAPDTVDKL+EQ
Sbjct: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDAKLEESSLTSDVKAPVTKGAPDTVDKLTEQ 1140
Query: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING
Sbjct: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
Query: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPTQGVLPS 1260
AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSS PKHGGYTPTQ VLP
Sbjct: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSFPKHGGYTPTQSVLPG 1260
Query: 1261 SSATNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPSSLLPIDVSV 1320
SSA NTGQ+TNGPVGN VSANVSGPLANHSLHGAAML AAAGRGGPGIAPSSLLPIDVSV
Sbjct: 1261 SSAANTGQVTNGPVGNTVSANVSGPLANHSLHGAAML-AAAGRGGPGIAPSSLLPIDVSV 1320
Query: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA 1380
VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA
Sbjct: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA 1380
Query: 1381 QELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTRAGNQVANIN 1440
QELNG+EPNFPGVQQTVGLSA NNQNPNSSS +TAANGNR SLPGSPA+ RAGNQVANIN
Sbjct: 1381 QELNGLEPNFPGVQQTVGLSASNNQNPNSSS-ITAANGNRPSLPGSPAMPRAGNQVANIN 1440
Query: 1441 RVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLLAL 1500
RVG+ALSGSSNL SV SGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPL+AL
Sbjct: 1441 RVGNALSGSSNLVSVSSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLVAL 1500
Query: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV
Sbjct: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
Query: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
SVHRVQLLLQVLSVKRFHH QQQQQNSTTAQEELTQ+EIGEICDYFSRRVASEPYDAS
Sbjct: 1561 SVHRVQLLLQVLSVKRFHH---QQQQQNSTTAQEELTQTEIGEICDYFSRRVASEPYDAS 1620
Query: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDE
Sbjct: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEK 1680
Query: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP
Sbjct: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
Query: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADSVQRTL 1800
VVSFLGMEGSHG RACWLR+DDWEK KQRVARTVEVS +STGDVSQGRLRIVAD VQRTL
Sbjct: 1741 VVSFLGMEGSHGVRACWLRVDDWEKSKQRVARTVEVS-NSTGDVSQGRLRIVADGVQRTL 1800
Query: 1801 HMCLQGLREGSEITAITGSTS 1822
HMCLQGLREGSEITAI GSTS
Sbjct: 1801 HMCLQGLREGSEITAIVGSTS 1815
BLAST of Bhi06G001519 vs. ExPASy TrEMBL
Match:
A0A6J1CPN6 (Mediator of RNA polymerase II transcription subunit 14 OS=Momordica charantia OX=3673 GN=LOC111013156 PE=3 SV=1)
HSP 1 Score: 3276.9 bits (8495), Expect = 0.0e+00
Identity = 1669/1821 (91.65%), Postives = 1731/1821 (95.06%), Query Frame = 0
Query: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKELVDKSKSSDQSDSDKKVNILKYVFKTQQRIL 60
MAAELGQQTVEFSALVSRAAEDSFLSLK+LV SKSSD SDS+KK+NILKYV KTQQR+L
Sbjct: 1 MAAELGQQTVEFSALVSRAAEDSFLSLKDLVHNSKSSDLSDSEKKINILKYVVKTQQRML 60
Query: 61 RLYALAKWCQQVPLIQYCQQLASTLSSHDSCFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
RL ALAKWCQQVPLIQYCQQLASTLSSHD+CFTQAADSLFFMHEGLQQARAPIYDVPSAT
Sbjct: 61 RLNALAKWCQQVPLIQYCQQLASTLSSHDTCFTQAADSLFFMHEGLQQARAPIYDVPSAT 120
Query: 121 EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVSLPKEISEVKVTD 180
EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEV+LPKEISEVKVTD
Sbjct: 121 EILLTGTYERLPKCVEDISIQGTLTEDQQKNALKKLEILVRSKLLEVTLPKEISEVKVTD 180
Query: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEEVHRHALGDDLERRM 240
GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEE+HRHALGDDLERRM
Sbjct: 181 GTALLRVDGEFKVLVTLGYRGHLSLWRILHLELLVGERRGPVKLEELHRHALGDDLERRM 240
Query: 241 AAAEHPFTTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFDVISDGITGGSTQLNH 300
A+AE+PF+TLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRF++IS+GITG S QLN
Sbjct: 241 ASAENPFSTLYSILHELCISLVMDTVLKQVHSLRQGRWRDAIRFELISEGITGSSAQLNQ 300
Query: 301 DGESDLSGLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHSTFVIDPLT 360
DGE+DLS LRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIH FVIDPLT
Sbjct: 301 DGETDLSSLRTPGLKIMYWLDFDKNTGSSDPGSCPFIKIEPGPDMQIKCIHIAFVIDPLT 360
Query: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNIQICRTADDVVLQHQVDEP 420
NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKN+QICR ADDVVLQH VDEP
Sbjct: 361 NKEAEFSLDQSCIDVEKLLLRAICCNKYTRLLEIQKELKKNVQICRAADDVVLQHNVDEP 420
Query: 421 DVDHKKKNKVHDPTAYEGEEILRVRAYGSSFFTLGINTRNGRFLLQSSHNKLATASLTEY 480
DVDHKKK+K+HDPT YEG+EILRVRAYGSSFFTLGINTRNGRFLLQSSHN LA ASL +
Sbjct: 421 DVDHKKKDKIHDPTVYEGQEILRVRAYGSSFFTLGINTRNGRFLLQSSHNILAPASLKDC 480
Query: 481 EEALNQGSMNAADVFIRLRSRSILHLFASISRFLGLEVYENGFSAVRLPKNISNGSTMLL 540
EEALNQGSM AADVFIRLRS+SIL+LFASISRF GLEVYENGFSAVRLPKNISNGS+MLL
Sbjct: 481 EEALNQGSMTAADVFIRLRSKSILYLFASISRFWGLEVYENGFSAVRLPKNISNGSSMLL 540
Query: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETKPDSSGKAHRLSDLNNVIRMKKIDIDQAQIL 600
MGFPDCGNSYFLLMQLDKDFKPQFKLLET+PD SGKAH L D+NNVIRMK IDID+ QIL
Sbjct: 541 MGFPDCGNSYFLLMQLDKDFKPQFKLLETRPDPSGKAHGLGDINNVIRMKIIDIDRIQIL 600
Query: 601 EDELNLSLLDWERLFPSLPSSVSNQTSENGLLPDVSIDGALQIAGYIPSSFSSVVDEVFE 660
EDELNLSLLDW +L P LP+S +NQTSEN LL D+S+DGALQIAGY SSFSSVVD+VFE
Sbjct: 601 EDELNLSLLDWGKLLPLLPNSGNNQTSENSLLSDISLDGALQIAGYPSSSFSSVVDDVFE 660
Query: 661 LEKGPPPVPAFSVSNLSQSFNSSAPHYSSLSNIHNVKGVPSPKWEVSMQPSQGNNVAKLS 720
LEKGPPPVPAFSVS+LSQSFNSSA HYSSLSNIHN+KGVPSPKWEV +QPSQGNNVAKLS
Sbjct: 661 LEKGPPPVPAFSVSSLSQSFNSSASHYSSLSNIHNIKGVPSPKWEVGIQPSQGNNVAKLS 720
Query: 721 NIPSHSNGSLYSTSNSKGPVHSTSLGSISSGPVRGATTRRLSNSKSEQDLTSLRYPNPAE 780
NIPSHSNGSLYS+SN KG VHSTSLGS++SGP RGA TRRLSNSKSEQDLTSLR+PNP E
Sbjct: 721 NIPSHSNGSLYSSSNLKGAVHSTSLGSLASGPGRGAATRRLSNSKSEQDLTSLRFPNPVE 780
Query: 781 VGSYTALDDDHISMPNDTSKDGVYANRSSRLLSPSQHGGSRISASIKPNGSRSSPTAAPT 840
V SYT LDDD+ISMPNDTSKDGVYANRSSRLLSPSQH G RISASIKPNGSRSSPTAAPT
Sbjct: 781 VSSYTTLDDDNISMPNDTSKDGVYANRSSRLLSPSQHAGPRISASIKPNGSRSSPTAAPT 840
Query: 841 GSLRPSGSCSSVSTPVSQNQDSCSSPFYESGLKNDSSRKRTASDMLNLIPSLKGIDAYNG 900
GSLRPSGSCSS STPVSQNQDSCSSP Y+S LKND+SRKRTASDMLNLIPSLKGID YNG
Sbjct: 841 GSLRPSGSCSSASTPVSQNQDSCSSPVYDSSLKNDNSRKRTASDMLNLIPSLKGIDVYNG 900
Query: 901 VSKRRKVSESPRFSKHSSQLLISKEMVSKTEYSYGNLIAEANKGSAPSSTYVSALLHVIR 960
+ KRRKVSES FS+ SSQLL SKEMV +T Y YGNLIAEANKG APSSTYVSALLHVIR
Sbjct: 901 IPKRRKVSESAIFSQPSSQLLTSKEMVPRTLYCYGNLIAEANKGIAPSSTYVSALLHVIR 960
Query: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFRLPFARDDSWQHICLRLGRPGTM 1020
HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWF LPFAR DSWQHICLRLGRPGTM
Sbjct: 961 HCSLCIKHARLTSQMDALDIPYVEEVGLRNASTNIWFGLPFARGDSWQHICLRLGRPGTM 1020
Query: 1021 CWDVKIHDQHFRDLWELQKKSCTAPWGPDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
CWDVKIHDQHFRDLWELQKKS TAPWG DVRIANTSDKDSHIRYDPEGVVLSYQSVEADS
Sbjct: 1021 CWDVKIHDQHFRDLWELQKKSSTAPWGSDVRIANTSDKDSHIRYDPEGVVLSYQSVEADS 1080
Query: 1081 IDKLVADIRRLSNARMFAIGMRKLLGVGTDERLEESSMTSDVKTLVMKGAPDTVDKLSEQ 1140
I++LVADIRRLSNARMFAIGMR+LLGV TDE+ EES MTSDVK V KGAPD VDKLSEQ
Sbjct: 1081 IERLVADIRRLSNARMFAIGMRRLLGVRTDEKPEESGMTSDVKAPVAKGAPDAVDKLSEQ 1140
Query: 1141 MRRAFRIEAVGLMSLWFSFGSGVLARFVVEWESGKEGCTMHVSPDQLWPHTKFLEDFING 1200
MRR FRIEAVG MSLWFSFGS VLARFVVEWE+GKEGCTMHVSPDQLWPHTKFLEDFING
Sbjct: 1141 MRRVFRIEAVGFMSLWFSFGSSVLARFVVEWEAGKEGCTMHVSPDQLWPHTKFLEDFING 1200
Query: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVSTLPGIAAALSSLPKHGGYTPTQGVLPS 1260
AEVASLLDCIRLTAGPLHALAAATRPARAGPV TLPGIAAA SSLPKHGGYTPTQ VLPS
Sbjct: 1201 AEVASLLDCIRLTAGPLHALAAATRPARAGPVPTLPGIAAAPSSLPKHGGYTPTQSVLPS 1260
Query: 1261 SSATNTGQITNGPVGNAVSANVSGPLANHSLHGAAMLAAAAGRGGPGIAPSSLLPIDVSV 1320
S+ TN GQITNGPVGN VS+NV+GP ANHS HGAAML AAAGRGGPGIAPSSLLPIDVSV
Sbjct: 1261 STLTNAGQITNGPVGNTVSSNVAGPPANHSFHGAAML-AAAGRGGPGIAPSSLLPIDVSV 1320
Query: 1321 VLRGPYWIRIIYRKQFAVDMRCFAGDQVWLQPATPAKVNPSVGGSLPCPQFRPFIMEHVA 1380
VLRGPYWIRIIYRK FAVDMRCFAGDQVWLQPATPAKVNPS+GGSLPCPQFRPFIMEHVA
Sbjct: 1321 VLRGPYWIRIIYRKHFAVDMRCFAGDQVWLQPATPAKVNPSIGGSLPCPQFRPFIMEHVA 1380
Query: 1381 QELNGVEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPALTRAGNQVANIN 1440
QELNG+EPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPA++R GNQVANIN
Sbjct: 1381 QELNGLEPNFPGVQQTVGLSAPNNQNPNSSSQMTAANGNRLSLPGSPAMSRGGNQVANIN 1440
Query: 1441 RVGSALSGSSNLASVGSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLLAL 1500
RVG+ALSGS NLASV SGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPL+AL
Sbjct: 1441 RVGNALSGSPNLASVSSGLPLRRSPGTGVPAHVRGELNTAIIGLGDDGGYGGGWVPLVAL 1500
Query: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV
Sbjct: 1501 KKVLRGILKYLGVLWLFAQLPDLLKEILGSILRDNEGALLNLDPEQPALRFFVGGYVFAV 1560
Query: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQQNSTTAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
SVHRVQLLLQVLSVKRFHHQQQQQQQ NSTTAQEELTQSEIGEICDYFSRRVASEPYDAS
Sbjct: 1561 SVHRVQLLLQVLSVKRFHHQQQQQQQPNSTTAQEELTQSEIGEICDYFSRRVASEPYDAS 1620
Query: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN
Sbjct: 1621 RVASFITLLTLPISVLREFLKLIAWKKGVAQAQGGDIAPAQKPRIELCLENHSGLSIDEN 1680
Query: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLRYSFGESP 1740
SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKL+YSFGE+P
Sbjct: 1681 SERSTSKSNIHYDRQHNSVDFALTVVLDPAHIPHMNAAGGAAWLPYCVSVKLKYSFGENP 1740
Query: 1741 VVSFLGMEGSHGGRACWLRIDDWEKCKQRVARTVEVSGSSTGDVSQGRLRIVADSVQRTL 1800
VV FLGMEGSHGGRACWLRIDDWEKCKQRV RTVEVSG+STGDVSQGRLRIVADSVQRTL
Sbjct: 1741 VVRFLGMEGSHGGRACWLRIDDWEKCKQRVVRTVEVSGNSTGDVSQGRLRIVADSVQRTL 1800
Query: 1801 HMCLQGLREGSEITAITGSTS 1822
H+CLQGLREGSEI AI G TS
Sbjct: 1801 HLCLQGLREGSEIAAIAGLTS 1820
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT3G04740.1 | 0.0e+00 | 63.79 | RNA polymerase II transcription mediators | [more] |
Match Name | E-value | Identity | Description | |
Q9SR02 | 0.0e+00 | 63.79 | Mediator of RNA polymerase II transcription subunit 14 OS=Arabidopsis thaliana O... | [more] |
P0CB66 | 1.4e-39 | 23.49 | Putative mediator of RNA polymerase II transcription subunit 14 OS=Dictyostelium... | [more] |
A2ABV5 | 9.1e-28 | 25.54 | Mediator of RNA polymerase II transcription subunit 14 OS=Mus musculus OX=10090 ... | [more] |
O60244 | 2.7e-27 | 28.97 | Mediator of RNA polymerase II transcription subunit 14 OS=Homo sapiens OX=9606 G... | [more] |
Q03570 | 5.9e-27 | 28.77 | Mediator of RNA polymerase II transcription subunit 14 OS=Caenorhabditis elegans... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7V7Q3 | 0.0e+00 | 94.84 | Mediator of RNA polymerase II transcription subunit 14 OS=Cucumis melo var. maku... | [more] |
A0A1S3C281 | 0.0e+00 | 94.84 | Mediator of RNA polymerase II transcription subunit 14 OS=Cucumis melo OX=3656 G... | [more] |
A0A0A0LFI5 | 0.0e+00 | 94.45 | Mediator of RNA polymerase II transcription subunit 14 OS=Cucumis sativus OX=365... | [more] |
A0A6J1KAS6 | 0.0e+00 | 92.37 | Mediator of RNA polymerase II transcription subunit 14 OS=Cucurbita maxima OX=36... | [more] |
A0A6J1CPN6 | 0.0e+00 | 91.65 | Mediator of RNA polymerase II transcription subunit 14 OS=Momordica charantia OX... | [more] |