Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGGAGAGTCATTAGTCTAATTTTAGTTTCACAGATTCAATTACTTGTTCGAGGGTTAAAATCTAATTTAATCGATTTCAATTGCTTGAACCAGGGTTTTCTTCTTTGATTGTTTCACGCGCCGAAGAACAATTCCATCAGCTTAAAACTTTGTGAGCGACTCAAGAATTGGGAGTGACCTTCAAAACGCACTTCGCAAAACCCCAAATTTCAATCCTCTAACTCTTCTTTTGATAATCTTCTTGCGATTCAGCATTCTGTTTCTCTCGCTGAGGGAAGAATTTCTGTGGCTTTGGATCTCTCATGGCGGTTTCCACTCAACCGCCTGGTCAACTGCAGGAGATTTCTGGTTTATGGGACAGTGTGTTGGAGCTTACGAAGTCGGCACAGGACAAGAACTGTGATCCACTGCTTTGGGCGGTTCAATTGAGCTCCACCCTCAATTCCGCAGGCGTTTCCTTGCCGTCGGTTGAGCTCGCCCAGCTCTTGGTCTCTCATATTTGTTGGGATAATCATGTTCCGATCATGTGGAAATTTCTTGAGAAAGCAATGACCGCGAGAATCGTTCCTCCTCTGCTGGTTATTGCTCTTCTTTCTACCAGGTCTTGATTACTCTCTCGCTTCCTCGAGTTTTTATTTGAATCGTGTTCAGGGTGCATTTTTAATGCAGAAGGTACTGTATTGAGTGCTTCATTACGTTTTTTTGCCGGTATCGTTCAAAGTAATACTTTGAATTTGGTAATGGTTGTTTATGTACCCGCGTGGACCATTCTGAAACTAAGTTACGGAGAATAAAAAGAAAATGAAACTGAAGATATACGGATGGATTGCGACATTCATACGTCTTTATTCATGCATAGATTTCGTGCGTTGTTGCGTAATTGTCTTTCTGCTTGTATATCATCCATAATTTTGTCGTGCTGTAGTTGTCTACTGTTGATAGTAAGTTCAAGTCTGTATCTGATTCCCCTTATAGAATTTAATTTGCGTCAGACACTGAAGTGAGTTGGGAAGATGCCTAGAACTTATTGGCGTTTTGTTTTTCTTCCTATTTGTTTATTCTTTTTGTTTTGTTTTGTTTTTGTCATCTATTTACTGTCCAAAGTTGACATTTATGGCATTTTTTCTCTGATTTACTTCAGGGCAATTCCATATAGAAAGCTTCGACCTGCAGCGTACAGGCTTTACCTGGAAGTTCTAAGCAGACATATCTTTTCATTAACATCCCAAATCAATGGACCCAATTACCAAAGGTAGATTTGATTGCATTTCACTCGTGTGCAGTGTAGTGGTTAGCCTCACTCTCCCTTTCATACTTTTTAATCATTCTTCTTAGTGGTACACCTAGTGGTAAAATTTTACCTTGAAAGGTATTCGGCTGTATAACTCTGAGCAAAGCCAAGAAGTTTTTGACCACTTCCACATAATGCACCGTTCTGGAGAGAGAGTTATCATGAGTATTGATACCCTCATAGGGGCAAGCTTCCAATAAAAGAAAGAGAACTTCACTTTGTTGAGATTTTTAGAGTTCTGGATTACCTTATAAAGGCTAGGGGTAGGGTTCAACAAGTTTCCTTTGATGGAAATATCCTTAAACAAGGAAAGAATTGCTAGATAGAATTCCTGTAACATCAAGCCCATCTCTGCAATCTTCTTCTTCTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAATATAGAATCAACAAATTTTATTGAGAAAAAAAATGAAAGAAGCTCATCTATGCAATCTTTTGATTTATCTTCCTTTATGGAAGTTTTGCATTATCAATTATTTTTTTTTAATTTTTTATTAATTATGCACTAAAATATCATGGGAGGAATGGCTCAACCTTTGCACCATGATTTGTTTCATCACCTACCCGCCAGATGTTTATTTATGCAACTGGAGGGCAACGATTCATGAATGATGTTCTGCCATTTCAGGATCATGCAAACCATCGATGATGTCCTTCATCTGACCCAGATATTTGGTGTCCAAACGTGTGAACCTGGGGTACTTATGGTTGAATTATTCTTTTCCATTGTATGGCAGCTGCTTGATGCATCATTGGATGATGAAGGATTGCTGGCACTTCCTGGAGAAGAAAAATCAGTGTGGCTGATCAGGCCACAACTGCATGATATGGAACTAGATGTTCATGATTCTTTTGGTGAGAAGAGAACTGAGAACAGTGAAAGTCTGCTTAAAGTAAACACTGCAAAAGCTATTGAGATAATTGGGCAGTTCCTGCAAAATAAGAAAACTGCAAGGATTTTGTGCTTGGCCCTTCGAAATATGTAAGAAGTGTAGTTCTTTCTAACTTTGTTTACTTTCTCTATCTCCTTTGATTTTAAGAAAGGCAATATTTCATTGAGTCTACCACTACTCCACTAGGAGTTGAAATAATTTATACTGCTTTAATGTTAAATGTATATTTGTGTATTGTATAGTACTATAGTTTCATTTGGATGAAAATATTAGTCATTTATCTTTTAATTATTTTTTTAGATGTCCTTTCTTTTTATAATTAGTTCCTTTAATTAGAATCTCTCTTGTATTATATTTACCCATTATTGGGTCTTTTATAAAAAATATGAAAAAAATATTATTTCTCCAAAAATAACATTAAATGAATGCAAAATTTTGACGCCTCAATGATAAAGATGTAACATAATAATGTTTTTGGTTTTTAGGCCATTGCACTGGGCAGGTTTTGCCCAGCGGTTAAAACTACTTGCAGCAAACTCTGTAGTTTTGAGGAACACAAAGCTAATAACTCCAGAGGTCCTTCTGCACTGGACATCTGATAAAAGTAAGCTTTTATCACAAGAAGGAAAAACATGTCAGCTAGAGTTTCGTGATGTAATGGCTTCTGGATCACTATTTTCTTCTGCTGGTCAATCTCATGGCGTTAATTGGTCTGCATTGTGGCTTCCCATCGATTTGTTCCTGGAGGATGCCATGGATGGATCGCAAGTTCTGGCAACTAGTGCTGTTGAACGTCTGATATGTATGAGCTAACAAATTTTATATTGCTCGAGTGATTCTTTAACAATTTTTACAGTGCTCAGTGTTTCTTGAATTAAATTGAAATTCTAAGTTTCCCCAAGAGTTCTATTGATCCAGAGTCATCCGCAGGTTTGATAAAATCTTTGCGGGCAGTTAATGATTCCTCCTGGCACAATACATTTTTAGGTTTGTGGATTGCAGCATTGCGACTTATCCAAAGGGTAGGATCATATAGCACCTTGATAAATGAATGTTTTGCCGTTCATATCCTCTTTATGTTCAGATATTAATTTGTCTGAAATGATTTCACGCTTTCAATTTACTTAATTAGCAGATCCACTGTCTTGAATTGAGAGAACTTAAATGATTTCTGTTCTATGATAATTGTATAGTAACTTTGTGTTGGCTTGGAAAGGTTTTTCTATTTTTCACCTTTTCCTTTTTTTTTTGGCAAATAAATTCAAAGACTAAGGGAGATGGATAAAAAATAACCATTGAAAATCAATCTTTTAAGAAGTTTGATGCATGTGGCAGTTTTACTCCATTTGTGATAACCAACTTAAAATACGGTCTCCTTTTGTTGATTTTGGAGAAACATATCTTGCAAGAATATTGAACGTTATTCCCTCTTATGCTCTACTTTACTTATTTGAGTGTATATTTATCTTTAGGTAACAAAAAACCCATATTTAATCTGCCACCATCCAGGAGGTATTTAACGCATTCCCACCTGGATATATTTGCTAATTCAAATTTTCCATTTCAGGAAAGGGATCCGAGTGAGGGTCCTGTACCTCGTTTGGATACATGCTTGTGCATGTTGTTGTCTATTTCAACCCTTGCAGTCACCATTATTATTGAAGAAGAGGAAGGTGAAGTAAAGGAGGATGAATGCAGCGCAAGTAAAAGCAGAGATGAAAAGCAGTCTTCAGGAATGTGCCGCAAAGGTTTGATTACGAGCTTGCAGATGTTGGGTGAATATGAGAGCTTGCTGATTCCTCCTCAATCCGTTATTGCAGTAGCCAATCAGGCTGCTGCAAAAGCTGTAATGTTCATATCAGGGGTTGCAGTTGGTAATGAGTACCATGACTGTGTTAGTATGAGCGATACACCTATTAATTGTTGTAAGTACTTATTCTTATTTAATAATTTAACATTGGTTGATCCTATTAAAACAGTAGTCTCTGAAAGATTTAAATTGATAAAATATTTGGATTCTTTTCAAACTGATAAACATAGTCAAATGAGCTATTTTATATTGAAGAAAAGTTCTATCTCCTCTGCAGAAAAAAATGGGATGGAGGATTATTGTATTTATGGACCTCAAGTGGTAGGTTTAAGGGCGTTTCTTACAAGAGCCCTTGGTCTACTATTGCTTTGTTTTGGGATATCCTTTATTCTCTCAGTTCGTTATATGTTCTATTGGGGGTGGGCACGTAACATTTGCATTCCTATTTCTGGGAGGATTGTCCGTTGGGAATGAAACCTTTGTGTTCCTTGTTCCCTCATCTTTACCACCTGTAGGAGAAGAGGTTACACTCAATGGCCTCGGTCTTGCCTTTGAAAGTTTTCTTCCCTATCTCTGTATTCTTGTCGTGTGCTTTCTGATCATGAGGCTTGTGAGGTTGTTAGTCTCCTCTCTATAATTTTAAACCGAGTTATGCATTTAGGGAGAAAGTATTCTAGGTGTTGGCTTCCATCCAGAGCTCTTTGAGGGATCCTCCAGTAGTGTTTTTTTTTCTTTCCTGGATTTTTTGTGCCCCTTCTCCCTCTCATGGCCCCTTCGTCTTCCTCTCTTTGGTAAGTTATAATTCCTAAGAAAGTGAAACTTTTTGCTTGGCCAGTTTCATTGGAGAGTGAATACTCAAGATTGTATCCTGGAGCTTTCTTTAATGGTGTTGCATACACAATAGTGCGGCCTTTGGAGGAGAGAGGAAGAGGATCTTAACCATTTACTTTGGAACTATGTGTTTATTACCTCCTATGGAAACAAGCTCTTTAGGATCTTTGAGACTGTGCTTGATCGAAATAGCGGTTGTGATGTTTGAGGAGGTGCTCTTGAATCCTCCTTTTTGCAACAAAGGAAAGGTTCTTTGGCAGTCTTAGTTTTTTGCTTTGTTATAGGCATTTGGCTTGAGAGGAATAGTAGAATTTTTTGTGGCTCAAGCAGTCTAGGGAGGAATTTTGGGATGTGATTAGGTTTAACACATCCTTGTAGGTGTCTGTTAATAGGCGTTTTCTTAATTATCAGCTTCGTTTAATTCTTTTGAATTGGAGCCCCTTTCTGAATTTGAGCGTCAGGGGGATATTGTTTTTGGGGCTTGTTTTTTATATTGCCCTTGTATATCCTTTTATCTTTCTCAATGAAAATTTAATTTCTTTAAAAAAAGAAAATTGACGTTCAATGTGTTAAGAATTTAAAACTCTTATTAATGGCATTGTGCTTGAACACTTCATCTTCAATGAGAAGAACATTATTAAATATATGTTCGAAACTGTTCTAATCTGTCATTTGAACTTCTGGTTTTCTATAAACAATTTATCTTTCATAAAGGACATTTTAATTCTTGATATGTCATAGGATTTTGGCGTGCGAATCTTCATGTTATAGTGGTTTTTGTCATCCTTTACATGCATCTTAAATTTTAGTTTTCACTCCTTTTGATCCTAGAAGGATCCCCCTCTCTCTCACATTGTTGCGTGTGTATGCATTCGAAATGTGTTCTCTGTGGGGCTAATTTGTTGCTTTTTTCTTGCTGCAGCTGGAAATATGCGGCATCTGATTGTTGAGGCTTGTATTTCTAGGAACCTTCTAGACACATCGGCATATTTTTGGCCAGGTTATGCAAATGCACGCAGTAGTCAAGTGCCTCGTAGTGCATCTAGTCAGGTGGTCGGTTGGTCATCATTCATGAAAGGGTCATCCCTAACTCCGTCGATGGTGAATGCTTTAGTGGCAACCCCAGCTTCTAGGTATGCTCTTGTAGTATCTGTTTGTATCATGTGAGGTCATATTGAGGGAAGATGATTGGTTAAAGGACCAAGTTCAGGTCCTAGGTTCTGTATTTTCATTTTGTTTCCCAAAAATGTGAGTACAGGTTCCGCAACTATGTGGTGCAGATAAAGAATCTGACAAATAGATTTGGGGAGGTTCTTGAAGTTATGAAAATTGGGAAGGTGATGCGCACAGAGTCTAAAAATTCCTTAGAAGTTAGTTAGCCTCAAAATGCTTTGCTGCCTTCATATTTATAGTAATGAAGGCAGTGAACTAACTTACAAACAGTTAACTTGGCTGTAAAAACTAAAGCTGTCTTAAAACAGAAACTAAAACAAAGAAAGGGCGTTTGTCTTTTCAGCTTAATCTATGTTTTAGGATGATGTTCTTGTACTTTTGTTGTTCTATGTCTTCTTTTGAATATTTTATTAACAAAACAAAACAAAGAAGGAAAGGTAAATGGTTGGGAAATAGCAACACCACTAACTAAGCAAATGAAACAAGTCAGCCAAAACCAGAGAAAATGCAAGAATGTGCAAGAAAGAACAAGATTGGCCTACAAAAGCAAGAAAAGCCCAAGAACAATACAAATGCCAAGAAAAGGCCAATAACGATACAAATGGCCAAGAGCAAAGCATGTTCCTACATCATTCACAAGTTTAAACTTCTAAGGAGAACTCATTCGGAGCATGGTAAAGAGAAAGATTCAAGACATTGAAAATAGGAATAATATGTTAAAGGAGGGAGATCAATGAAAACTCAGTTTCTTTTCATTTCTGTCTTTCCCGTAGAGCTTGTATCTTTAAGCATTGGTCTCTTCTCATCTCTTCAATGAAAACTTGGTTTGTTGTTAGGAAGGCAGAATAGTTAAGTGAACAGAATAGAAGGGAGAATAATTCTAGAATGTTTTCATGAGGGTCGCTGATCAACAAGAATATTTGATGAACAACTCTTTGGGAAATGTACATATTATTATCATCTTAATTCATTTATGATATTTATGCCTTGTAGTTTTACCCATATGTATCAGTATGACAATTAAATTTAACTTTAGCTCATGGATTTGATGGAAGCTTAGCAGAGATTGAGAAGATCTATGAGATTGCAATAAATGGTTCAGGCGACGAGAAGATATCTGCGGCTTCCATTTTGTGTGGGGCTTCGCTTGTTCGAGGCTGGAATCTACAAGTGAGTGATGGTGGCACCCCCCCTCCTGTTTCAAAATTTAATTTTGCATCCCCTCTCATCTGGGATTATCTTTACCCATTTTCTTCTAGGAACACACTGCTCTATTTATATCCAGATTATTGTCACCACCAATTCCTACAGATTACTCTGGGAGTGAGAGCTATTTGATCGATTATGCCCCATTTCTGAATGTTCTACTGGTTGGAATATCATCAGTTGATTGCGTGCAGATTTTTTCCTTGCATGGCATGGTAAGAATGTTAACTTGTCTGCCTATGTGACCCTACTGATAAGCTCCTGAATTTGAAGTTATGTTTTTATATAAGATTGCATGCAAGCTTGGGAGAGGTTTTATATGTGAATTTATGCCCCATTAATTGGCATTATAAGTCAAGCATGAGAGCCTAATTATTTCAGAGAAGTTTTGGTGGGATTTTGAGGCCCATTGACTGAGGGGAAAAAAAAAAAACCTCATATGAGCATTCAAACAATCATTTGATTGAGTTTTTGTTTAATAATGTATATCCTTCTTCAATGATTCTGAAAATATTTGATTTTTGATACGGAGGACTGATATAGATGATTTAACAAACTAATTGAGAAACTCTGGTTTTCGTTGTGTTGTGAGCTTATTCCTATTTTTCTTTTCTTTTCTTTTTGCTGTTTGATGCCAGGTTCCTCTACTTGCAGGTCAATTAATGCCAATCTGCGAAGCTTTTGGATCGTGTTCTCCCAAGTCATGGATCCTTACGTCTGGGGAAGAGCTTACTTGTCATGCAGTATTTTCCTTGGCATTTACACTTCTATTGAGGTTGTGGCGGTTTCATCACCCACCTGTTGAAAATGTGAAGGGTGATGCACGACCAGTGGGATCTCAACTAACTCCCGAATATCTGTTATTGGTTCGAAATTCTCAGTTAGCATCTTTTGGGAAGTCACCCAAGGATCGACTTAAAGCGAGACGATTGTCAAAATTATTGAAATTTTCTTTAGAACCTATATTCATGGATTCCTTTCCAAAATTGAAAGGCTGGTACCGGCAACATCAAGAATGCATTGCTTCCATTCTCTCTGGTCTTATACCTGGGGCCCCAGTTCATCAAATTGTTGATGCTCTCTTGACTATGATGTTCAGGAAGATAAATCGTGCTGGTCAGTCTTTGACTTCAACAACTTCAGGAAGCAGCAACTCGTCTGGATCTGCAAATGAAGAGGCCTCCATTAAGCTTAAAGTGCCTGCATGGGACATCCTTGAAGCAACTCCCTTCGTTCTTGATGCTGCTCTTACCGCCTGTGCTCATGGACGATTGTCCCCCCGTGATTTGGCTACAGGCAAGTCCAAAACATTTTTTAATAGTAACTAGAGAGACCATCTACTACTCAGCGTATTCTACATATGAAATTTGTTTTGTTTTGTTTGTTTGTTTTTCTTTTCTTTTGAACTTGTGAATCTTTTTCAACTATACCATTCTTGATTTTAAATATGAAAGAAAGTGACGTTCATTTATTGTGATTTGTAATTAAATTGGTTGCTATTTCATTAATAATAAAGTAGATCATTAATACTATCCTATCCAAAAAATAAAAAGAAAAATTATGGATTCTCGGCCCAACTTTGTTGCTGTGCATAGTCGTTTCTCCTCAAAAAGCTATTGCAGTCTCCAAGCAATCCAATTTTCCCGCCCATTATTCCTCTCACAACAGTTGCCTCATCACATCATTCTCCATCTTTGACAGGAGTCCAACTGGTCCCATCAGCCCCATTCTCCCTCTCAACGGCAAGAAAAAAATTAGGAATGTTGTTGATGGTACTCGAAGAAAAGAAAATCAGAAAGAAGATAGGAAAAAAAAAATGAAGAAAGAAAAGAAAATGAAGACGGAAGTGAGGGAGAGAGGACAAGATGGAGGAGAGAAGAATAGAAAATGAAAAATTTAAAAATAATAATATTATAATAGGTAATGATTACCTATAGAAACAATGATTGGTTTTCAAAAAGCCAGTAAATGGTTTTTCAACTGGCGTGTGGTTTTGGCTGTCAGAATTGTTTTTTCCAATATAGAAAACCGACGACCAATGTTAAAGTCAAAAACTGAACGAACGGAGTTGGTTTGGTCGGTTTTTGTCGGTCGGCTCGGTTTTCCAGTCTGATTTGCTCACAATGCATACTTTTGCTTTTGTTTCCATTGGGATTGGCTGATCTTTGTAGATTGTGAATTAACGGTCTTTTATTGGTTTACTTGGTAAGTTCTTTACCTCTATGTTTCAGGACTCAAAGACCTTGCTGATTTTTTACCTGCATCCTTCGCAACTATTGTGAGCTACTTTTCAGCTGAAGTTACACGGGGTATATGGAAGCCAGCATTTATGAACGGAACTGACTGGCCTAGTCCTGCTGCAACTTTGTCCATTGTTGAGCAACAGATAAAAAAGATTCTTGCTGCAACTGGTGTTGATGTCCCTAGTCTTGCTGTAGGTAAATTCTGATGTGCATTAATGATGTTTTGTATTTTTCTTTTCGTTATCACGTCTGGCAATTGCTTTCGAGTTTTTTTGTATGGATGACCCATTCCCAAGCATATTTATGATTCATTACCTGTCCTCACAAATGTATAAAATCTATCATTCTACTTACGTTAGAGTGCACTACTGCATTTTATAACGTATCGCGTTAGGGATGGTCATATCTGTTGTAGCCCTAACGCGATGGCCATCTCTATCGTGATACCTCCATCAACACTCTAGAAAACTCCAAGTCTAAGACGAAGTGTTTTTGATTTCTTGATATGACGACCTGTGAGAGAGCTAAATTGGGATGCTTTCTTTCAAATTTATCATGGTTCTTAATCTACCCCGGCCAAGAACCAGATAAAAACTGAACTTTGTCTGCTGCAAGGATCTTATACAATCATGTCAAGAGACTGTTGATGAAGCATAGCAACAGCAATTTCCAAAGTGTGGAAGATAATGTTCATTTAAATATATTAATCTAAATACCTTTCATGAAAGATGATGGTAATGGTAAGAATGATGGTGACTTTATATCAAAAAGATTTTTCAAGATAAGTCCTCTACTAAGTAAACACTTTTTTGTACAAATTGAATTAGTATTGCATTTCATGGATTTTCTTTTCAATCGGAAATTGAGCTTGAGTTGTCTATTTGCAGGTGGAAGTTCTCCTGCTATGCTTCCTTTACCCTTGGCTGCCCTAATAAGCCTCACAATAACTTACAAACTAGATAAAGCCTCCGAACGCCTTCTTGCCCTCGTTGGCCCAGCATTAAATTCACTAGCTGCTAGTTGTTCGTGGCCTTGCACTCCTATTATAGCTTCATTATGGGCTCAAAAGGTGAAGCGATGGAATGATTTCCTTGTGTTCTCTGCTTCTCGCACTGTTTTTCACCATACCAGTGATGCTGTTGTCCAGCTGCTTAAGAGTTGTTTCACTTCAACTCTCGGTTTAGGCAACTCCAATGTAAACAGCAGTGGAGGTGTAGGCACACTCCTCGGTCATGGTTTTGGCTCTCACGTTTTAGGAGGGATGTCCCCAGTGGCTCCTGGGATTCTCTATCTGCGAGTCCATCGATCTGTGAGAGATGTTTTGTTCTTGGTGGAGGAGATTGTCTCTCTCTTAATGCTCTCTGTCAGAGATATTGCAGTTAGTGGGCTACCAAAGGAGAAGGCTGAAAAACTAAAAAAGACCAAGTATGGAATGAGATATGAACAGGTTTCCTTCGCTTCCGCAATGGCACGTGTTAAACTTGCAGCTTCCCTAGGAGCTTCGTTAGTTTGGATTTCGGGTGGATCGGGTTTGGTCCAATCTTTGTTTAAAGAAACCTTGCCGTCTTGGTTTTTATCAGTCCAGTCAGTAGAACGTGAAGGTGTAGAATATGGAGGTATGGTTGCTGTGCTTAGGGGCTATGCACTTGCATTCTTTTCAGTACTATGTGGAACGTTTTCATGGGGCATAGACTCATCATCGTCAGCATCGAAGAGGCGGGCAAAGATTCTCGACTCCTTCCTTGAATTTCTAGCAAGCGCACTGGACGGAAAATTTTCAATTGGGTGTGATTGGGCTACTTGGCGAGCTTATGTTTCTGGGTTTGTGAGTTTGATGGTGCGTTGTGCACCGAGGTGGTTGCTTGAGGTGGATTTGAACCTCTTGAAGAGGTTGAGCAATGGATTAAGGCAGTTGAATGAGGAACAATTGGCTCTTGCATTACTGGGAAGTGGTGGGGTAACTGCAATGGGTGCAGCAGCTGAGCTCATTATCGAAGGTGGATTTTAAATTCTGGTAATTGATTGACATAAAAATATGAGCAGCCAATTAAATTAGTGAGGCATTTTTGTTTAGGAAAACTCGAACCATCAACTAACGCAGGCTTCCATTAGCATCATTAACCCACTGTAATGTAACTTTCATACACGCCAGTGGTTCGTACTGCTGTTGTGCATAAGCATATTGGCTGCCGTTTTGGCTAAAAATGGAAGGGTTTATTTGTATCCCTTTTTGTTTCATGTGATGTAAGTTACGGTAGTTAAGGTAAGAGGCTGAACTCATTAAGCAGATGGCAAAAAAGTATGGAGGATCCACAGCTGAATTCATGCTAATCATAATTAATTGGTTAAAAACACTTGAGTAGTGATTTGGATTCTAATTTGTATTAATATAGATGTATTTCATAAACTTTAATGTCTATGAGCTATTATGTTTGTACCAATTTAGTTTTTCAATTTAATTTGTGATACTGTAATTTAAGATGTTGAGTAGATAAAAAATCTAATGAAC
mRNA sequence
GGGGAGAGTCATTAGTCTAATTTTAGTTTCACAGATTCAATTACTTGTTCGAGGGTTAAAATCTAATTTAATCGATTTCAATTGCTTGAACCAGGGTTTTCTTCTTTGATTGTTTCACGCGCCGAAGAACAATTCCATCAGCTTAAAACTTTGTGAGCGACTCAAGAATTGGGAGTGACCTTCAAAACGCACTTCGCAAAACCCCAAATTTCAATCCTCTAACTCTTCTTTTGATAATCTTCTTGCGATTCAGCATTCTGTTTCTCTCGCTGAGGGAAGAATTTCTGTGGCTTTGGATCTCTCATGGCGGTTTCCACTCAACCGCCTGGTCAACTGCAGGAGATTTCTGGTTTATGGGACAGTGTGTTGGAGCTTACGAAGTCGGCACAGGACAAGAACTGTGATCCACTGCTTTGGGCGGTTCAATTGAGCTCCACCCTCAATTCCGCAGGCGTTTCCTTGCCGTCGGTTGAGCTCGCCCAGCTCTTGGTCTCTCATATTTGTTGGGATAATCATGTTCCGATCATGTGGAAATTTCTTGAGAAAGCAATGACCGCGAGAATCGTTCCTCCTCTGCTGGTTATTGCTCTTCTTTCTACCAGGGCAATTCCATATAGAAAGCTTCGACCTGCAGCGTACAGGCTTTACCTGGAAGTTCTAAGCAGACATATCTTTTCATTAACATCCCAAATCAATGGACCCAATTACCAAAGGATCATGCAAACCATCGATGATGTCCTTCATCTGACCCAGATATTTGGTGTCCAAACGTGTGAACCTGGGGTACTTATGGTTGAATTATTCTTTTCCATTGTATGGCAGCTGCTTGATGCATCATTGGATGATGAAGGATTGCTGGCACTTCCTGGAGAAGAAAAATCAGTGTGGCTGATCAGGCCACAACTGCATGATATGGAACTAGATGTTCATGATTCTTTTGGTGAGAAGAGAACTGAGAACAGTGAAAGTCTGCTTAAAGTAAACACTGCAAAAGCTATTGAGATAATTGGGCAGTTCCTGCAAAATAAGAAAACTGCAAGGATTTTGTGCTTGGCCCTTCGAAATATGCCATTGCACTGGGCAGGTTTTGCCCAGCGGTTAAAACTACTTGCAGCAAACTCTGTAGTTTTGAGGAACACAAAGCTAATAACTCCAGAGGTCCTTCTGCACTGGACATCTGATAAAAGTAAGCTTTTATCACAAGAAGGAAAAACATGTCAGCTAGAGTTTCGTGATGTAATGGCTTCTGGATCACTATTTTCTTCTGCTGGTCAATCTCATGGCGTTAATTGGTCTGCATTGTGGCTTCCCATCGATTTGTTCCTGGAGGATGCCATGGATGGATCGCAAGTTCTGGCAACTAGTGCTGTTGAACGTCTGATATGTTTGATAAAATCTTTGCGGGCAGTTAATGATTCCTCCTGGCACAATACATTTTTAGGTTTGTGGATTGCAGCATTGCGACTTATCCAAAGGGAAAGGGATCCGAGTGAGGGTCCTGTACCTCGTTTGGATACATGCTTGTGCATGTTGTTGTCTATTTCAACCCTTGCAGTCACCATTATTATTGAAGAAGAGGAAGGTGAAGTAAAGGAGGATGAATGCAGCGCAAGTAAAAGCAGAGATGAAAAGCAGTCTTCAGGAATGTGCCGCAAAGGTTTGATTACGAGCTTGCAGATGTTGGGTGAATATGAGAGCTTGCTGATTCCTCCTCAATCCGTTATTGCAGTAGCCAATCAGGCTGCTGCAAAAGCTGTAATGTTCATATCAGGGGTTGCAGTTGGTAATGAGTACCATGACTGTGTTAGTATGAGCGATACACCTATTAATTGTTCTGGAAATATGCGGCATCTGATTGTTGAGGCTTGTATTTCTAGGAACCTTCTAGACACATCGGCATATTTTTGGCCAGGTTATGCAAATGCACGCAGTAGTCAAGTGCCTCGTAGTGCATCTAGTCAGGTGGTCGGTTGGTCATCATTCATGAAAGGGTCATCCCTAACTCCGTCGATGGTGAATGCTTTAGTGGCAACCCCAGCTTCTAGCTTAGCAGAGATTGAGAAGATCTATGAGATTGCAATAAATGGTTCAGGCGACGAGAAGATATCTGCGGCTTCCATTTTGTGTGGGGCTTCGCTTGTTCGAGGCTGGAATCTACAAGAACACACTGCTCTATTTATATCCAGATTATTGTCACCACCAATTCCTACAGATTACTCTGGGAGTGAGAGCTATTTGATCGATTATGCCCCATTTCTGAATGTTCTACTGGTTGGAATATCATCAGTTGATTGCGTGCAGATTTTTTCCTTGCATGGCATGGTTCCTCTACTTGCAGGTCAATTAATGCCAATCTGCGAAGCTTTTGGATCGTGTTCTCCCAAGTCATGGATCCTTACGTCTGGGGAAGAGCTTACTTGTCATGCAGTATTTTCCTTGGCATTTACACTTCTATTGAGGTTGTGGCGGTTTCATCACCCACCTGTTGAAAATGTGAAGGGTGATGCACGACCAGTGGGATCTCAACTAACTCCCGAATATCTGTTATTGGTTCGAAATTCTCAGTTAGCATCTTTTGGGAAGTCACCCAAGGATCGACTTAAAGCGAGACGATTGTCAAAATTATTGAAATTTTCTTTAGAACCTATATTCATGGATTCCTTTCCAAAATTGAAAGGCTGGTACCGGCAACATCAAGAATGCATTGCTTCCATTCTCTCTGGTCTTATACCTGGGGCCCCAGTTCATCAAATTGTTGATGCTCTCTTGACTATGATGTTCAGGAAGATAAATCGTGCTGGTCAGTCTTTGACTTCAACAACTTCAGGAAGCAGCAACTCGTCTGGATCTGCAAATGAAGAGGCCTCCATTAAGCTTAAAGTGCCTGCATGGGACATCCTTGAAGCAACTCCCTTCGTTCTTGATGCTGCTCTTACCGCCTGTGCTCATGGACGATTGTCCCCCCGTGATTTGGCTACAGGACTCAAAGACCTTGCTGATTTTTTACCTGCATCCTTCGCAACTATTGTGAGCTACTTTTCAGCTGAAGTTACACGGGGTATATGGAAGCCAGCATTTATGAACGGAACTGACTGGCCTAGTCCTGCTGCAACTTTGTCCATTGTTGAGCAACAGATAAAAAAGATTCTTGCTGCAACTGGTGTTGATGTCCCTAGTCTTGCTGTAGGTGGAAGTTCTCCTGCTATGCTTCCTTTACCCTTGGCTGCCCTAATAAGCCTCACAATAACTTACAAACTAGATAAAGCCTCCGAACGCCTTCTTGCCCTCGTTGGCCCAGCATTAAATTCACTAGCTGCTAGTTGTTCGTGGCCTTGCACTCCTATTATAGCTTCATTATGGGCTCAAAAGGTGAAGCGATGGAATGATTTCCTTGTGTTCTCTGCTTCTCGCACTGTTTTTCACCATACCAGTGATGCTGTTGTCCAGCTGCTTAAGAGTTGTTTCACTTCAACTCTCGGTTTAGGCAACTCCAATGTAAACAGCAGTGGAGGTGTAGGCACACTCCTCGGTCATGGTTTTGGCTCTCACGTTTTAGGAGGGATGTCCCCAGTGGCTCCTGGGATTCTCTATCTGCGAGTCCATCGATCTGTGAGAGATGTTTTGTTCTTGGTGGAGGAGATTGTCTCTCTCTTAATGCTCTCTGTCAGAGATATTGCAGTTAGTGGGCTACCAAAGGAGAAGGCTGAAAAACTAAAAAAGACCAAGTATGGAATGAGATATGAACAGGTTTCCTTCGCTTCCGCAATGGCACGTGTTAAACTTGCAGCTTCCCTAGGAGCTTCGTTAGTTTGGATTTCGGGTGGATCGGGTTTGGTCCAATCTTTGTTTAAAGAAACCTTGCCGTCTTGGTTTTTATCAGTCCAGTCAGTAGAACGTGAAGGTGTAGAATATGGAGGTATGGTTGCTGTGCTTAGGGGCTATGCACTTGCATTCTTTTCAGTACTATGTGGAACGTTTTCATGGGGCATAGACTCATCATCGTCAGCATCGAAGAGGCGGGCAAAGATTCTCGACTCCTTCCTTGAATTTCTAGCAAGCGCACTGGACGGAAAATTTTCAATTGGGTGTGATTGGGCTACTTGGCGAGCTTATGTTTCTGGGTTTGTGAGTTTGATGGTGCGTTGTGCACCGAGGTGGTTGCTTGAGGTGGATTTGAACCTCTTGAAGAGGTTGAGCAATGGATTAAGGCAGTTGAATGAGGAACAATTGGCTCTTGCATTACTGGGAAGTGGTGGGGTAACTGCAATGGGTGCAGCAGCTGAGCTCATTATCGAAGGTGGATTTTAAATTCTGGTAATTGATTGACATAAAAATATGAGCAGCCAATTAAATTAGTGAGGCATTTTTGTTTAGGAAAACTCGAACCATCAACTAACGCAGGCTTCCATTAGCATCATTAACCCACTGTAATGTAACTTTCATACACGCCAGTGGTTCGTACTGCTGTTGTGCATAAGCATATTGGCTGCCGTTTTGGCTAAAAATGGAAGGGTTTATTTGTATCCCTTTTTGTTTCATGTGATGTAAGTTACGGTAGTTAAGGTAAGAGGCTGAACTCATTAAGCAGATGGCAAAAAAGTATGGAGGATCCACAGCTGAATTCATGCTAATCATAATTAATTGGTTAAAAACACTTGAGTAGTGATTTGGATTCTAATTTGTATTAATATAGATGTATTTCATAAACTTTAATGTCTATGAGCTATTATGTTTGTACCAATTTAGTTTTTCAATTTAATTTGTGATACTGTAATTTAAGATGTTGAGTAGATAAAAAATCTAATGAAC
Coding sequence (CDS)
ATGGCGGTTTCCACTCAACCGCCTGGTCAACTGCAGGAGATTTCTGGTTTATGGGACAGTGTGTTGGAGCTTACGAAGTCGGCACAGGACAAGAACTGTGATCCACTGCTTTGGGCGGTTCAATTGAGCTCCACCCTCAATTCCGCAGGCGTTTCCTTGCCGTCGGTTGAGCTCGCCCAGCTCTTGGTCTCTCATATTTGTTGGGATAATCATGTTCCGATCATGTGGAAATTTCTTGAGAAAGCAATGACCGCGAGAATCGTTCCTCCTCTGCTGGTTATTGCTCTTCTTTCTACCAGGGCAATTCCATATAGAAAGCTTCGACCTGCAGCGTACAGGCTTTACCTGGAAGTTCTAAGCAGACATATCTTTTCATTAACATCCCAAATCAATGGACCCAATTACCAAAGGATCATGCAAACCATCGATGATGTCCTTCATCTGACCCAGATATTTGGTGTCCAAACGTGTGAACCTGGGGTACTTATGGTTGAATTATTCTTTTCCATTGTATGGCAGCTGCTTGATGCATCATTGGATGATGAAGGATTGCTGGCACTTCCTGGAGAAGAAAAATCAGTGTGGCTGATCAGGCCACAACTGCATGATATGGAACTAGATGTTCATGATTCTTTTGGTGAGAAGAGAACTGAGAACAGTGAAAGTCTGCTTAAAGTAAACACTGCAAAAGCTATTGAGATAATTGGGCAGTTCCTGCAAAATAAGAAAACTGCAAGGATTTTGTGCTTGGCCCTTCGAAATATGCCATTGCACTGGGCAGGTTTTGCCCAGCGGTTAAAACTACTTGCAGCAAACTCTGTAGTTTTGAGGAACACAAAGCTAATAACTCCAGAGGTCCTTCTGCACTGGACATCTGATAAAAGTAAGCTTTTATCACAAGAAGGAAAAACATGTCAGCTAGAGTTTCGTGATGTAATGGCTTCTGGATCACTATTTTCTTCTGCTGGTCAATCTCATGGCGTTAATTGGTCTGCATTGTGGCTTCCCATCGATTTGTTCCTGGAGGATGCCATGGATGGATCGCAAGTTCTGGCAACTAGTGCTGTTGAACGTCTGATATGTTTGATAAAATCTTTGCGGGCAGTTAATGATTCCTCCTGGCACAATACATTTTTAGGTTTGTGGATTGCAGCATTGCGACTTATCCAAAGGGAAAGGGATCCGAGTGAGGGTCCTGTACCTCGTTTGGATACATGCTTGTGCATGTTGTTGTCTATTTCAACCCTTGCAGTCACCATTATTATTGAAGAAGAGGAAGGTGAAGTAAAGGAGGATGAATGCAGCGCAAGTAAAAGCAGAGATGAAAAGCAGTCTTCAGGAATGTGCCGCAAAGGTTTGATTACGAGCTTGCAGATGTTGGGTGAATATGAGAGCTTGCTGATTCCTCCTCAATCCGTTATTGCAGTAGCCAATCAGGCTGCTGCAAAAGCTGTAATGTTCATATCAGGGGTTGCAGTTGGTAATGAGTACCATGACTGTGTTAGTATGAGCGATACACCTATTAATTGTTCTGGAAATATGCGGCATCTGATTGTTGAGGCTTGTATTTCTAGGAACCTTCTAGACACATCGGCATATTTTTGGCCAGGTTATGCAAATGCACGCAGTAGTCAAGTGCCTCGTAGTGCATCTAGTCAGGTGGTCGGTTGGTCATCATTCATGAAAGGGTCATCCCTAACTCCGTCGATGGTGAATGCTTTAGTGGCAACCCCAGCTTCTAGCTTAGCAGAGATTGAGAAGATCTATGAGATTGCAATAAATGGTTCAGGCGACGAGAAGATATCTGCGGCTTCCATTTTGTGTGGGGCTTCGCTTGTTCGAGGCTGGAATCTACAAGAACACACTGCTCTATTTATATCCAGATTATTGTCACCACCAATTCCTACAGATTACTCTGGGAGTGAGAGCTATTTGATCGATTATGCCCCATTTCTGAATGTTCTACTGGTTGGAATATCATCAGTTGATTGCGTGCAGATTTTTTCCTTGCATGGCATGGTTCCTCTACTTGCAGGTCAATTAATGCCAATCTGCGAAGCTTTTGGATCGTGTTCTCCCAAGTCATGGATCCTTACGTCTGGGGAAGAGCTTACTTGTCATGCAGTATTTTCCTTGGCATTTACACTTCTATTGAGGTTGTGGCGGTTTCATCACCCACCTGTTGAAAATGTGAAGGGTGATGCACGACCAGTGGGATCTCAACTAACTCCCGAATATCTGTTATTGGTTCGAAATTCTCAGTTAGCATCTTTTGGGAAGTCACCCAAGGATCGACTTAAAGCGAGACGATTGTCAAAATTATTGAAATTTTCTTTAGAACCTATATTCATGGATTCCTTTCCAAAATTGAAAGGCTGGTACCGGCAACATCAAGAATGCATTGCTTCCATTCTCTCTGGTCTTATACCTGGGGCCCCAGTTCATCAAATTGTTGATGCTCTCTTGACTATGATGTTCAGGAAGATAAATCGTGCTGGTCAGTCTTTGACTTCAACAACTTCAGGAAGCAGCAACTCGTCTGGATCTGCAAATGAAGAGGCCTCCATTAAGCTTAAAGTGCCTGCATGGGACATCCTTGAAGCAACTCCCTTCGTTCTTGATGCTGCTCTTACCGCCTGTGCTCATGGACGATTGTCCCCCCGTGATTTGGCTACAGGACTCAAAGACCTTGCTGATTTTTTACCTGCATCCTTCGCAACTATTGTGAGCTACTTTTCAGCTGAAGTTACACGGGGTATATGGAAGCCAGCATTTATGAACGGAACTGACTGGCCTAGTCCTGCTGCAACTTTGTCCATTGTTGAGCAACAGATAAAAAAGATTCTTGCTGCAACTGGTGTTGATGTCCCTAGTCTTGCTGTAGGTGGAAGTTCTCCTGCTATGCTTCCTTTACCCTTGGCTGCCCTAATAAGCCTCACAATAACTTACAAACTAGATAAAGCCTCCGAACGCCTTCTTGCCCTCGTTGGCCCAGCATTAAATTCACTAGCTGCTAGTTGTTCGTGGCCTTGCACTCCTATTATAGCTTCATTATGGGCTCAAAAGGTGAAGCGATGGAATGATTTCCTTGTGTTCTCTGCTTCTCGCACTGTTTTTCACCATACCAGTGATGCTGTTGTCCAGCTGCTTAAGAGTTGTTTCACTTCAACTCTCGGTTTAGGCAACTCCAATGTAAACAGCAGTGGAGGTGTAGGCACACTCCTCGGTCATGGTTTTGGCTCTCACGTTTTAGGAGGGATGTCCCCAGTGGCTCCTGGGATTCTCTATCTGCGAGTCCATCGATCTGTGAGAGATGTTTTGTTCTTGGTGGAGGAGATTGTCTCTCTCTTAATGCTCTCTGTCAGAGATATTGCAGTTAGTGGGCTACCAAAGGAGAAGGCTGAAAAACTAAAAAAGACCAAGTATGGAATGAGATATGAACAGGTTTCCTTCGCTTCCGCAATGGCACGTGTTAAACTTGCAGCTTCCCTAGGAGCTTCGTTAGTTTGGATTTCGGGTGGATCGGGTTTGGTCCAATCTTTGTTTAAAGAAACCTTGCCGTCTTGGTTTTTATCAGTCCAGTCAGTAGAACGTGAAGGTGTAGAATATGGAGGTATGGTTGCTGTGCTTAGGGGCTATGCACTTGCATTCTTTTCAGTACTATGTGGAACGTTTTCATGGGGCATAGACTCATCATCGTCAGCATCGAAGAGGCGGGCAAAGATTCTCGACTCCTTCCTTGAATTTCTAGCAAGCGCACTGGACGGAAAATTTTCAATTGGGTGTGATTGGGCTACTTGGCGAGCTTATGTTTCTGGGTTTGTGAGTTTGATGGTGCGTTGTGCACCGAGGTGGTTGCTTGAGGTGGATTTGAACCTCTTGAAGAGGTTGAGCAATGGATTAAGGCAGTTGAATGAGGAACAATTGGCTCTTGCATTACTGGGAAGTGGTGGGGTAACTGCAATGGGTGCAGCAGCTGAGCTCATTATCGAAGGTGGATTTTAA
Protein sequence
MAVSTQPPGQLQEISGLWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQLLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLSRHIFSLTSQINGPNYQRIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSVWLIRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMPLHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQEGKTCQLEFRDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDSSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEGEVKEDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVANQAAAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGYANARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFSLEPIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTSTTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSLAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHGFGSHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVEREGVEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLASALDGKFSIGCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGGVTAMGAAAELIIEGGF
Homology
BLAST of Bhi05G000263 vs. TAIR 10
Match:
AT3G23590.1 (REF4-related 1 )
HSP 1 Score: 1479.9 bits (3830), Expect = 0.0e+00
Identity = 768/1317 (58.31%), Postives = 976/1317 (74.11%), Query Frame = 0
Query: 17 LWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQLLVSHICWDNHVPIMW 76
+WD V+ELTK AQ+ DP LWA QLSS L V LPS ELA+++VS+ICWDN+VPI+W
Sbjct: 9 VWDCVIELTKMAQENCVDPRLWASQLSSNLKFFAVELPSTELAEVIVSYICWDNNVPIVW 68
Query: 77 KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLSRHIFSLTSQINGPNYQ 136
KFLE+AM ++V PL+V+ALL+ R +P R + AAYR+YLE+L R++F++ I+GP+YQ
Sbjct: 69 KFLERAMALKLVSPLVVLALLADRVVPTRSTQQAAYRIYLELLKRNMFTIKDHISGPHYQ 128
Query: 137 RIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSVWL 196
++M ++ ++L L+++F + T +PGVL+VE F +V QLLDA+L DEGLL L + S WL
Sbjct: 129 KVMISVSNILRLSELFDLDTSKPGVLLVEFVFKMVSQLLDAALSDEGLLELSQDSSSQWL 188
Query: 197 IRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMP 256
++ Q DME+D + + EK T + E L +NT AIE+I +FL+N AR+L L N
Sbjct: 189 VKSQ--DMEIDAPERYNEK-TGSLEKLQSLNTIMAIELIAEFLRNTVIARLLYLVSSNRA 248
Query: 257 LHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQEGK-TCQLEFRDVMAS 316
W F Q+++LL NS L+++K++ LL S++ S + K T + ++
Sbjct: 249 SKWHEFVQKVQLLGENSSALKHSKVLNSGDLLQLISNRRFGYSYDSKVTSARKSNAIVDF 308
Query: 317 GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDSSWH 376
GSL S AG HG + S+LWLP+DL EDAMDG QV TSA+E + L K+L+ +N S+WH
Sbjct: 309 GSLSSYAGLCHGASLSSLWLPLDLVFEDAMDGYQVNPTSAIEIITGLAKTLKEINGSTWH 368
Query: 377 NTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEGEVKEDECS 436
+TFLGLWIAALRL+QRERDP EGP+PRLDT LCM L I L V +IEE + E
Sbjct: 369 DTFLGLWIAALRLVQRERDPIEGPIPRLDTRLCMSLCIVPLVVANLIEE-----GKYESV 428
Query: 437 ASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVANQAAAKAVMFISGVAVGN 496
K RD+ L+TSLQ+LG++ LL PP+ V++ AN+AA KA++F+SG VG
Sbjct: 429 MEKLRDD----------LVTSLQVLGDFPGLLAPPKCVVSAANKAATKAILFLSGGNVGK 488
Query: 497 EYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGYANARSSQVPRSASSQVV 556
D ++M D P+NCSGNMRHLIVEACI+RN+LD SAY WPGY N R +Q+P+S ++V
Sbjct: 489 SCFDVINMKDMPVNCSGNMRHLIVEACIARNILDMSAYSWPGYVNGRINQIPQSLPNEVP 548
Query: 557 GWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRG 616
WSSF+KG+ L +MVN LV+ PASSLAE+EK++E+A+ GS DEKISAA++LCGASL RG
Sbjct: 549 CWSSFVKGAPLNAAMVNTLVSVPASSLAELEKLFEVAVKGSDDEKISAATVLCGASLTRG 608
Query: 617 WNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVP 676
WN+QEHT +++RLLSPP+P DYS +E++LI YA LNV++VGI SVD +QIFSLHGMVP
Sbjct: 609 WNIQEHTVEYLTRLLSPPVPADYSRAENHLIGYACMLNVVIVGIGSVDSIQIFSLHGMVP 668
Query: 677 LLAGQLMPICEAFGSCSPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGD 736
LA LMPICE FGS +P SW L SGE ++ ++VFS AFTLLL+LWRF+HPP+E+ GD
Sbjct: 669 QLACSLMPICEEFGSYTPSVSWTLPSGEAISAYSVFSNAFTLLLKLWRFNHPPIEHGVGD 728
Query: 737 ARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFSLEPIFMDSFPKLKGWY 796
VGSQLTPE+LL VRNS L S +DR + R S +P+F+DSFPKLK WY
Sbjct: 729 VPTVGSQLTPEHLLSVRNSYLVSSEILDRDRNRKRLSEVARAASCQPVFVDSFPKLKVWY 788
Query: 797 RQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTSTTSGSSNSSGSANEEA 856
RQHQ CIA+ LSGL G+PVHQ V+ALL M F K+ R Q+L SG+S+SSG+A+E++
Sbjct: 789 RQHQRCIAATLSGLTHGSPVHQTVEALLNMTFGKV-RGSQTLNPVNSGTSSSSGAASEDS 848
Query: 857 SIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSA 916
+I+ + PAWDIL+A P+V+DAALTAC HGRLSPR LATGLKDLADFLPAS ATIVSYFSA
Sbjct: 849 NIRPEFPAWDILKAVPYVVDAALTACTHGRLSPRQLATGLKDLADFLPASLATIVSYFSA 908
Query: 917 EVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSLAVGGSSPAMLPLPLA 976
EV+RG+WKP FMNG DWPSPA LS VE+ I KILA TGVD+PSLA GGSSPA LPLPLA
Sbjct: 909 EVSRGVWKPVFMNGVDWPSPATNLSTVEEYITKILATTGVDIPSLAPGGSSPATLPLPLA 968
Query: 977 ALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSA 1036
A +SLTITYK+DKASER L L GPAL LAA C WPC PI+ASLW QK KRW DFLVFSA
Sbjct: 969 AFVSLTITYKIDKASERFLNLAGPALECLAAGCPWPCMPIVASLWTQKAKRWFDFLVFSA 1028
Query: 1037 SRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHGFGSHVLGGMSPVAPGI 1096
SRTVF H DAV+QLL++CF++TLGL + +++ GGVG LLGHGFGSH GG+SPVAPGI
Sbjct: 1029 SRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSNDGGVGALLGHGFGSHFYGGISPVAPGI 1088
Query: 1097 LYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASA 1156
LYLR++R++RD + + EEI+SLL+ SV DIA + L KEK EKLK K G RY Q S A+A
Sbjct: 1089 LYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNRLSKEKLEKLKTVKNGSRYGQSSLATA 1148
Query: 1157 MARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVEREGVEYGGMVAVLRGYA 1216
M +VKLAASL ASLVW++GG G+V L KET+PSWFLS +RE +VA LRG+A
Sbjct: 1149 MTQVKLAASLSASLVWLTGGLGVVHVLIKETIPSWFLSTDKSDREQGP-SDLVAELRGHA 1208
Query: 1217 LAFFSVLCGTFSWGIDSSSSASKRRAK-ILDSFLEFLASALDGKFSIGCDWATWRAYVSG 1276
LA+F VLCG +WG+DS SSASKRR + IL S LEF+ASALDGK S+GC+ ATWR Y+SG
Sbjct: 1209 LAYFVVLCGALTWGVDSRSSASKRRRQAILGSHLEFIASALDGKISVGCETATWRTYISG 1268
Query: 1277 FVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGGVTAMGAAAELII 1331
VSLMV C P W+ E+D +LK LSNGLR+ +++LA+ LL GG+ M AA+ II
Sbjct: 1269 LVSLMVSCLPLWVTEIDTEVLKSLSNGLRKWGKDELAIVLLSLGGLKTMDYAADFII 1305
BLAST of Bhi05G000263 vs. TAIR 10
Match:
AT2G48110.1 (reduced epidermal fluorescence 4 )
HSP 1 Score: 1473.0 bits (3812), Expect = 0.0e+00
Identity = 802/1339 (59.90%), Postives = 960/1339 (71.70%), Query Frame = 0
Query: 17 LWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQLLVSHICWDNHVPIMW 76
LW+SV L +SAQ+KN DPL WA+QL TL SAG+SLPS +LAQ LV+HI W+NH P+ W
Sbjct: 10 LWESVTSLIRSAQEKNVDPLHWALQLRLTLASAGISLPSPDLAQFLVTHIFWENHSPLSW 69
Query: 77 KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLSRHIFSLTSQINGPNYQ 136
K LEKA++ IVPPLLV+ALLS R IP RKL PAAYRLY+E+L RH FS I P Y
Sbjct: 70 KLLEKAISVNIVPPLLVLALLSPRVIPNRKLHPAAYRLYMELLKRHAFSFMPLIRAPGYH 129
Query: 137 RIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSVWL 196
+ M +IDD+LHL++ FGVQ EPG +++ FSIVW+LLDASLD+EGLL L ++S W
Sbjct: 130 KTMNSIDDILHLSETFGVQDQEPGSILLAFVFSIVWELLDASLDEEGLLELTSNKRSKWP 189
Query: 197 IRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMP 256
P HDM+LD ++ KR EN ++L K NT AIE+I +FLQNK T+RIL LA +NM
Sbjct: 190 SSP--HDMDLDGLEN-SVKRNENHDALEKANTEMAIELIQEFLQNKVTSRILHLASQNM- 249
Query: 257 LHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQEGKTC-QLEFRDVMAS 316
E KT + EF +++S
Sbjct: 250 --------------------------------------------ESKTIPRGEFHAIVSS 309
Query: 317 GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDSSWH 376
GS + SALWLPIDLF ED MDG+Q A SAVE L L+K+L+A N +SWH
Sbjct: 310 GSKLALTSD------SALWLPIDLFFEDIMDGTQAAAASAVENLTGLVKALQAANSTSWH 369
Query: 377 NTFLGLWIAALRLIQR-------------------ERDPSEGPVPRLDTCLCMLLSISTL 436
+ FL LW+AALRL+QR ERDP EGPVPR DT LC+LLS++ L
Sbjct: 370 DAFLALWLAALRLVQRENLCLRYCFFMHMLEILSEERDPIEGPVPRTDTFLCVLLSVTPL 429
Query: 437 AVTIIIEEEEGEVKEDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAV 496
AV IIEEEE + D+ S+S S K+ G CR+GLI SLQ LG+YESLL PP+SV +V
Sbjct: 430 AVANIIEEEESQ-WIDQTSSSPSNQWKEKKGKCRQGLINSLQQLGDYESLLTPPRSVQSV 489
Query: 497 ANQAAAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWP 556
ANQAAAKA+MFISG+ N ++ SMS++ C C R L T F
Sbjct: 490 ANQAAAKAIMFISGITNSNGSYENTSMSESASGC-----------CKVRFSLFTLKMF-- 549
Query: 557 GYANARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGS 616
+ + WS MKGS LTPS+ N+L+ TPASSLAEIEK+YE+A GS
Sbjct: 550 -------VVMGVYLLCNISCWSLVMKGSPLTPSLTNSLITTPASSLAEIEKMYEVATTGS 609
Query: 617 GDEKISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLL 676
DEKI+ ASILCGASL RGW++QEH +FI LLSPP P D SGS S+LI+ APFLNVLL
Sbjct: 610 EDEKIAVASILCGASLFRGWSIQEHVIIFIVTLLSPPAPADLSGSYSHLINSAPFLNVLL 669
Query: 677 VGISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPK-SWILTSGEELTCHAVFSLAFT 736
VGIS +DCV IFSLHG+VPLLAG LMPICEAFGS P +W L +GE ++ HAVFS AFT
Sbjct: 670 VGISPIDCVHIFSLHGVVPLLAGALMPICEAFGSGVPNITWTLPTGELISSHAVFSTAFT 729
Query: 737 LLLRLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLL 796
LLLRLWRF HPP++ V GD PVG Q +PEYLLLVRN +L FGKSPKDR+ RR SK++
Sbjct: 730 LLLRLWRFDHPPLDYVLGDVPPVGPQPSPEYLLLVRNCRLECFGKSPKDRMARRRFSKVI 789
Query: 797 KFSLEPIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQS 856
S++PIFMDSFP+LK WYRQHQEC+ASILS L G+PVH IVD+LL+MMF+K N+ G
Sbjct: 790 DISVDPIFMDSFPRLKQWYRQHQECMASILSELKTGSPVHHIVDSLLSMMFKKANKGGSQ 849
Query: 857 LTSTTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLK 916
+ +SGSS+ S S +++S +LK+PAWDILEA PFVLDAALTACAHG LSPR+LATGLK
Sbjct: 850 SLTPSSGSSSLSTSGGDDSSDQLKLPAWDILEAAPFVLDAALTACAHGSLSPRELATGLK 909
Query: 917 DLADFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVD 976
LADFLPA+ T+VSYFS+EVTRG+WKP MNGTDWPSPAA L+ VEQQI+KILAATGVD
Sbjct: 910 ILADFLPATLGTMVSYFSSEVTRGLWKPVSMNGTDWPSPAANLASVEQQIEKILAATGVD 969
Query: 977 VPSLAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPII 1036
VP L G S A LPLPLAAL+SLTITYKLDKA+ER L LVGPAL+SLAA+C WPC PI+
Sbjct: 970 VPRLPADGISAATLPLPLAALVSLTITYKLDKATERFLVLVGPALDSLAAACPWPCMPIV 1029
Query: 1037 ASLWAQKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGL-GNSNVNSSGGVGTL 1096
SLW QKVKRW+DFL+FSASRTVFHH DAV+QLL+SCFT TLGL S + S GGVG L
Sbjct: 1030 TSLWTQKVKRWSDFLIFSASRTVFHHNRDAVIQLLRSCFTCTLGLTPTSQLCSYGGVGAL 1089
Query: 1097 LGHGFGSHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKA 1156
LGHGFGS GG+S APGILY++VHRS+RDV+FL EEI+SLLM SV+ IA LP +A
Sbjct: 1090 LGHGFGSRYSGGISTAAPGILYIKVHRSIRDVMFLTEEILSLLMFSVKSIATRELPAGQA 1149
Query: 1157 EKLKKTKYGMRY--EQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLS 1216
EKLKKTK G RY QVS + AM RVKLAASLGASLVWISGG LVQ+L KETLPSWF+S
Sbjct: 1150 EKLKKTKDGSRYGIGQVSLSLAMRRVKLAASLGASLVWISGGLNLVQALIKETLPSWFIS 1209
Query: 1217 VQSVEREGVEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLAS 1276
V E E GGMV +LRGYALA+F++L F+WG+DSS ASKRR ++L LEF+ S
Sbjct: 1210 VHGEED---ELGGMVPMLRGYALAYFAILSSAFAWGVDSSYPASKRRPRVLWLHLEFMVS 1269
Query: 1277 ALDGKFSIGCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALA 1332
AL+GK S+GCDWATW+AYV+GFVSLMV+C P W+LEVD+ ++KRLS LRQ NE+ LALA
Sbjct: 1270 ALEGKISLGCDWATWQAYVTGFVSLMVQCTPAWVLEVDVEVIKRLSKSLRQWNEQDLALA 1270
BLAST of Bhi05G000263 vs. ExPASy Swiss-Prot
Match:
Q9LUG9 (Mediator of RNA polymerase II transcription subunit 33A OS=Arabidopsis thaliana OX=3702 GN=MED33A PE=1 SV=1)
HSP 1 Score: 1479.9 bits (3830), Expect = 0.0e+00
Identity = 768/1317 (58.31%), Postives = 976/1317 (74.11%), Query Frame = 0
Query: 17 LWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQLLVSHICWDNHVPIMW 76
+WD V+ELTK AQ+ DP LWA QLSS L V LPS ELA+++VS+ICWDN+VPI+W
Sbjct: 9 VWDCVIELTKMAQENCVDPRLWASQLSSNLKFFAVELPSTELAEVIVSYICWDNNVPIVW 68
Query: 77 KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLSRHIFSLTSQINGPNYQ 136
KFLE+AM ++V PL+V+ALL+ R +P R + AAYR+YLE+L R++F++ I+GP+YQ
Sbjct: 69 KFLERAMALKLVSPLVVLALLADRVVPTRSTQQAAYRIYLELLKRNMFTIKDHISGPHYQ 128
Query: 137 RIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSVWL 196
++M ++ ++L L+++F + T +PGVL+VE F +V QLLDA+L DEGLL L + S WL
Sbjct: 129 KVMISVSNILRLSELFDLDTSKPGVLLVEFVFKMVSQLLDAALSDEGLLELSQDSSSQWL 188
Query: 197 IRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMP 256
++ Q DME+D + + EK T + E L +NT AIE+I +FL+N AR+L L N
Sbjct: 189 VKSQ--DMEIDAPERYNEK-TGSLEKLQSLNTIMAIELIAEFLRNTVIARLLYLVSSNRA 248
Query: 257 LHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQEGK-TCQLEFRDVMAS 316
W F Q+++LL NS L+++K++ LL S++ S + K T + ++
Sbjct: 249 SKWHEFVQKVQLLGENSSALKHSKVLNSGDLLQLISNRRFGYSYDSKVTSARKSNAIVDF 308
Query: 317 GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDSSWH 376
GSL S AG HG + S+LWLP+DL EDAMDG QV TSA+E + L K+L+ +N S+WH
Sbjct: 309 GSLSSYAGLCHGASLSSLWLPLDLVFEDAMDGYQVNPTSAIEIITGLAKTLKEINGSTWH 368
Query: 377 NTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEGEVKEDECS 436
+TFLGLWIAALRL+QRERDP EGP+PRLDT LCM L I L V +IEE + E
Sbjct: 369 DTFLGLWIAALRLVQRERDPIEGPIPRLDTRLCMSLCIVPLVVANLIEE-----GKYESV 428
Query: 437 ASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVANQAAAKAVMFISGVAVGN 496
K RD+ L+TSLQ+LG++ LL PP+ V++ AN+AA KA++F+SG VG
Sbjct: 429 MEKLRDD----------LVTSLQVLGDFPGLLAPPKCVVSAANKAATKAILFLSGGNVGK 488
Query: 497 EYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGYANARSSQVPRSASSQVV 556
D ++M D P+NCSGNMRHLIVEACI+RN+LD SAY WPGY N R +Q+P+S ++V
Sbjct: 489 SCFDVINMKDMPVNCSGNMRHLIVEACIARNILDMSAYSWPGYVNGRINQIPQSLPNEVP 548
Query: 557 GWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRG 616
WSSF+KG+ L +MVN LV+ PASSLAE+EK++E+A+ GS DEKISAA++LCGASL RG
Sbjct: 549 CWSSFVKGAPLNAAMVNTLVSVPASSLAELEKLFEVAVKGSDDEKISAATVLCGASLTRG 608
Query: 617 WNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVP 676
WN+QEHT +++RLLSPP+P DYS +E++LI YA LNV++VGI SVD +QIFSLHGMVP
Sbjct: 609 WNIQEHTVEYLTRLLSPPVPADYSRAENHLIGYACMLNVVIVGIGSVDSIQIFSLHGMVP 668
Query: 677 LLAGQLMPICEAFGSCSPK-SWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGD 736
LA LMPICE FGS +P SW L SGE ++ ++VFS AFTLLL+LWRF+HPP+E+ GD
Sbjct: 669 QLACSLMPICEEFGSYTPSVSWTLPSGEAISAYSVFSNAFTLLLKLWRFNHPPIEHGVGD 728
Query: 737 ARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFSLEPIFMDSFPKLKGWY 796
VGSQLTPE+LL VRNS L S +DR + R S +P+F+DSFPKLK WY
Sbjct: 729 VPTVGSQLTPEHLLSVRNSYLVSSEILDRDRNRKRLSEVARAASCQPVFVDSFPKLKVWY 788
Query: 797 RQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTSTTSGSSNSSGSANEEA 856
RQHQ CIA+ LSGL G+PVHQ V+ALL M F K+ R Q+L SG+S+SSG+A+E++
Sbjct: 789 RQHQRCIAATLSGLTHGSPVHQTVEALLNMTFGKV-RGSQTLNPVNSGTSSSSGAASEDS 848
Query: 857 SIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSA 916
+I+ + PAWDIL+A P+V+DAALTAC HGRLSPR LATGLKDLADFLPAS ATIVSYFSA
Sbjct: 849 NIRPEFPAWDILKAVPYVVDAALTACTHGRLSPRQLATGLKDLADFLPASLATIVSYFSA 908
Query: 917 EVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSLAVGGSSPAMLPLPLA 976
EV+RG+WKP FMNG DWPSPA LS VE+ I KILA TGVD+PSLA GGSSPA LPLPLA
Sbjct: 909 EVSRGVWKPVFMNGVDWPSPATNLSTVEEYITKILATTGVDIPSLAPGGSSPATLPLPLA 968
Query: 977 ALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSA 1036
A +SLTITYK+DKASER L L GPAL LAA C WPC PI+ASLW QK KRW DFLVFSA
Sbjct: 969 AFVSLTITYKIDKASERFLNLAGPALECLAAGCPWPCMPIVASLWTQKAKRWFDFLVFSA 1028
Query: 1037 SRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHGFGSHVLGGMSPVAPGI 1096
SRTVF H DAV+QLL++CF++TLGL + +++ GGVG LLGHGFGSH GG+SPVAPGI
Sbjct: 1029 SRTVFLHNQDAVIQLLRNCFSATLGLNAAPMSNDGGVGALLGHGFGSHFYGGISPVAPGI 1088
Query: 1097 LYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASA 1156
LYLR++R++RD + + EEI+SLL+ SV DIA + L KEK EKLK K G RY Q S A+A
Sbjct: 1089 LYLRMYRALRDTVSVSEEILSLLIHSVEDIAQNRLSKEKLEKLKTVKNGSRYGQSSLATA 1148
Query: 1157 MARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVEREGVEYGGMVAVLRGYA 1216
M +VKLAASL ASLVW++GG G+V L KET+PSWFLS +RE +VA LRG+A
Sbjct: 1149 MTQVKLAASLSASLVWLTGGLGVVHVLIKETIPSWFLSTDKSDREQGP-SDLVAELRGHA 1208
Query: 1217 LAFFSVLCGTFSWGIDSSSSASKRRAK-ILDSFLEFLASALDGKFSIGCDWATWRAYVSG 1276
LA+F VLCG +WG+DS SSASKRR + IL S LEF+ASALDGK S+GC+ ATWR Y+SG
Sbjct: 1209 LAYFVVLCGALTWGVDSRSSASKRRRQAILGSHLEFIASALDGKISVGCETATWRTYISG 1268
Query: 1277 FVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGGVTAMGAAAELII 1331
VSLMV C P W+ E+D +LK LSNGLR+ +++LA+ LL GG+ M AA+ II
Sbjct: 1269 LVSLMVSCLPLWVTEIDTEVLKSLSNGLRKWGKDELAIVLLSLGGLKTMDYAADFII 1305
BLAST of Bhi05G000263 vs. ExPASy Swiss-Prot
Match:
F4IN69 (Mediator of RNA polymerase II transcription subunit 33B OS=Arabidopsis thaliana OX=3702 GN=MED33B PE=1 SV=1)
HSP 1 Score: 1473.0 bits (3812), Expect = 0.0e+00
Identity = 802/1339 (59.90%), Postives = 960/1339 (71.70%), Query Frame = 0
Query: 17 LWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQLLVSHICWDNHVPIMW 76
LW+SV L +SAQ+KN DPL WA+QL TL SAG+SLPS +LAQ LV+HI W+NH P+ W
Sbjct: 10 LWESVTSLIRSAQEKNVDPLHWALQLRLTLASAGISLPSPDLAQFLVTHIFWENHSPLSW 69
Query: 77 KFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLSRHIFSLTSQINGPNYQ 136
K LEKA++ IVPPLLV+ALLS R IP RKL PAAYRLY+E+L RH FS I P Y
Sbjct: 70 KLLEKAISVNIVPPLLVLALLSPRVIPNRKLHPAAYRLYMELLKRHAFSFMPLIRAPGYH 129
Query: 137 RIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSVWL 196
+ M +IDD+LHL++ FGVQ EPG +++ FSIVW+LLDASLD+EGLL L ++S W
Sbjct: 130 KTMNSIDDILHLSETFGVQDQEPGSILLAFVFSIVWELLDASLDEEGLLELTSNKRSKWP 189
Query: 197 IRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMP 256
P HDM+LD ++ KR EN ++L K NT AIE+I +FLQNK T+RIL LA +NM
Sbjct: 190 SSP--HDMDLDGLEN-SVKRNENHDALEKANTEMAIELIQEFLQNKVTSRILHLASQNM- 249
Query: 257 LHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQEGKTC-QLEFRDVMAS 316
E KT + EF +++S
Sbjct: 250 --------------------------------------------ESKTIPRGEFHAIVSS 309
Query: 317 GSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDSSWH 376
GS + SALWLPIDLF ED MDG+Q A SAVE L L+K+L+A N +SWH
Sbjct: 310 GSKLALTSD------SALWLPIDLFFEDIMDGTQAAAASAVENLTGLVKALQAANSTSWH 369
Query: 377 NTFLGLWIAALRLIQR-------------------ERDPSEGPVPRLDTCLCMLLSISTL 436
+ FL LW+AALRL+QR ERDP EGPVPR DT LC+LLS++ L
Sbjct: 370 DAFLALWLAALRLVQRENLCLRYCFFMHMLEILSEERDPIEGPVPRTDTFLCVLLSVTPL 429
Query: 437 AVTIIIEEEEGEVKEDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAV 496
AV IIEEEE + D+ S+S S K+ G CR+GLI SLQ LG+YESLL PP+SV +V
Sbjct: 430 AVANIIEEEESQ-WIDQTSSSPSNQWKEKKGKCRQGLINSLQQLGDYESLLTPPRSVQSV 489
Query: 497 ANQAAAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWP 556
ANQAAAKA+MFISG+ N ++ SMS++ C C R L T F
Sbjct: 490 ANQAAAKAIMFISGITNSNGSYENTSMSESASGC-----------CKVRFSLFTLKMF-- 549
Query: 557 GYANARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGS 616
+ + WS MKGS LTPS+ N+L+ TPASSLAEIEK+YE+A GS
Sbjct: 550 -------VVMGVYLLCNISCWSLVMKGSPLTPSLTNSLITTPASSLAEIEKMYEVATTGS 609
Query: 617 GDEKISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLL 676
DEKI+ ASILCGASL RGW++QEH +FI LLSPP P D SGS S+LI+ APFLNVLL
Sbjct: 610 EDEKIAVASILCGASLFRGWSIQEHVIIFIVTLLSPPAPADLSGSYSHLINSAPFLNVLL 669
Query: 677 VGISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPK-SWILTSGEELTCHAVFSLAFT 736
VGIS +DCV IFSLHG+VPLLAG LMPICEAFGS P +W L +GE ++ HAVFS AFT
Sbjct: 670 VGISPIDCVHIFSLHGVVPLLAGALMPICEAFGSGVPNITWTLPTGELISSHAVFSTAFT 729
Query: 737 LLLRLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLL 796
LLLRLWRF HPP++ V GD PVG Q +PEYLLLVRN +L FGKSPKDR+ RR SK++
Sbjct: 730 LLLRLWRFDHPPLDYVLGDVPPVGPQPSPEYLLLVRNCRLECFGKSPKDRMARRRFSKVI 789
Query: 797 KFSLEPIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQS 856
S++PIFMDSFP+LK WYRQHQEC+ASILS L G+PVH IVD+LL+MMF+K N+ G
Sbjct: 790 DISVDPIFMDSFPRLKQWYRQHQECMASILSELKTGSPVHHIVDSLLSMMFKKANKGGSQ 849
Query: 857 LTSTTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLK 916
+ +SGSS+ S S +++S +LK+PAWDILEA PFVLDAALTACAHG LSPR+LATGLK
Sbjct: 850 SLTPSSGSSSLSTSGGDDSSDQLKLPAWDILEAAPFVLDAALTACAHGSLSPRELATGLK 909
Query: 917 DLADFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVD 976
LADFLPA+ T+VSYFS+EVTRG+WKP MNGTDWPSPAA L+ VEQQI+KILAATGVD
Sbjct: 910 ILADFLPATLGTMVSYFSSEVTRGLWKPVSMNGTDWPSPAANLASVEQQIEKILAATGVD 969
Query: 977 VPSLAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPII 1036
VP L G S A LPLPLAAL+SLTITYKLDKA+ER L LVGPAL+SLAA+C WPC PI+
Sbjct: 970 VPRLPADGISAATLPLPLAALVSLTITYKLDKATERFLVLVGPALDSLAAACPWPCMPIV 1029
Query: 1037 ASLWAQKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGL-GNSNVNSSGGVGTL 1096
SLW QKVKRW+DFL+FSASRTVFHH DAV+QLL+SCFT TLGL S + S GGVG L
Sbjct: 1030 TSLWTQKVKRWSDFLIFSASRTVFHHNRDAVIQLLRSCFTCTLGLTPTSQLCSYGGVGAL 1089
Query: 1097 LGHGFGSHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKA 1156
LGHGFGS GG+S APGILY++VHRS+RDV+FL EEI+SLLM SV+ IA LP +A
Sbjct: 1090 LGHGFGSRYSGGISTAAPGILYIKVHRSIRDVMFLTEEILSLLMFSVKSIATRELPAGQA 1149
Query: 1157 EKLKKTKYGMRY--EQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLS 1216
EKLKKTK G RY QVS + AM RVKLAASLGASLVWISGG LVQ+L KETLPSWF+S
Sbjct: 1150 EKLKKTKDGSRYGIGQVSLSLAMRRVKLAASLGASLVWISGGLNLVQALIKETLPSWFIS 1209
Query: 1217 VQSVEREGVEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLAS 1276
V E E GGMV +LRGYALA+F++L F+WG+DSS ASKRR ++L LEF+ S
Sbjct: 1210 VHGEED---ELGGMVPMLRGYALAYFAILSSAFAWGVDSSYPASKRRPRVLWLHLEFMVS 1269
Query: 1277 ALDGKFSIGCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALA 1332
AL+GK S+GCDWATW+AYV+GFVSLMV+C P W+LEVD+ ++KRLS LRQ NE+ LALA
Sbjct: 1270 ALEGKISLGCDWATWQAYVTGFVSLMVQCTPAWVLEVDVEVIKRLSKSLRQWNEQDLALA 1270
BLAST of Bhi05G000263 vs. ExPASy TrEMBL
Match:
A0A1S3BMJ1 (mediator of RNA polymerase II transcription subunit 33B-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491278 PE=4 SV=1)
HSP 1 Score: 2481.4 bits (6430), Expect = 0.0e+00
Identity = 1270/1334 (95.20%), Postives = 1298/1334 (97.30%), Query Frame = 0
Query: 1 MAVSTQPPGQLQEISGLWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60
MAVS QPPGQLQ I+GLWD+VLE+TKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ
Sbjct: 1 MAVSAQPPGQLQGIAGLWDTVLEVTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60
Query: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLS 120
LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKL+PAAYRLYLE+LS
Sbjct: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLQPAAYRLYLELLS 120
Query: 121 RHIFSLTSQINGPNYQRIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLD 180
RH+FS TSQI GPNYQRIMQTIDDVLHLTQIFG+QTCEPGVLMVELFFSIVWQLLDASLD
Sbjct: 121 RHVFSSTSQIYGPNYQRIMQTIDDVLHLTQIFGLQTCEPGVLMVELFFSIVWQLLDASLD 180
Query: 181 DEGLLALPGEEKSVWLIRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQ 240
DEGLLALPGEEKS WLIRPQLHDMELDVHDSFGEK+TENSESLLKVNTAKAIEIIGQFLQ
Sbjct: 181 DEGLLALPGEEKSAWLIRPQLHDMELDVHDSFGEKKTENSESLLKVNTAKAIEIIGQFLQ 240
Query: 241 NKKTARILCLALRNMPLHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQ 300
NKKT RILCLALRNMPL WAGFAQRL+LL ANSVVL N KLITPEVLLHWTSDK+KLLSQ
Sbjct: 241 NKKTERILCLALRNMPLQWAGFAQRLQLLGANSVVLGNAKLITPEVLLHWTSDKNKLLSQ 300
Query: 301 EGKTCQLEFRDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLI 360
+GKT QLEFRDVM+SGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLI
Sbjct: 301 KGKTSQLEFRDVMSSGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLI 360
Query: 361 CLIKSLRAVNDSSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTI 420
CLIKSLRAVND+SWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTI
Sbjct: 361 CLIKSLRAVNDTSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTI 420
Query: 421 IIEEEEGEVKEDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVANQA 480
IIEEEE E KED+CS SKSRDEKQSSGMCRKGLITSLQMLGEYESLL PPQS+IAVANQA
Sbjct: 421 IIEEEEVEPKEDDCSPSKSRDEKQSSGMCRKGLITSLQMLGEYESLLTPPQSIIAVANQA 480
Query: 481 AAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGYAN 540
AAKAVMFISGVAVGNEY+DC SM+D PINCSGNMRHLIVEACISRNLLDTSAYFWPGY N
Sbjct: 481 AAKAVMFISGVAVGNEYYDCASMNDAPINCSGNMRHLIVEACISRNLLDTSAYFWPGYVN 540
Query: 541 ARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEK 600
A SSQVPRSAS+QVVGWSSFMKGS LTPSMVNALVATPASSLAEIEKIYEIAINGSGDEK
Sbjct: 541 ALSSQVPRSASNQVVGWSSFMKGSPLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEK 600
Query: 601 ISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVGIS 660
ISAASILCGASLVRGW LQEHTALFISRLL PPIPTDYSGS+SYLIDYAPFLNVLLVGIS
Sbjct: 601 ISAASILCGASLVRGWYLQEHTALFISRLLLPPIPTDYSGSDSYLIDYAPFLNVLLVGIS 660
Query: 661 SVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPKSWILTSGEELTCHAVFSLAFTLLLRL 720
SVDCVQIFSLHGMVPLLAGQLMPICEAFGS PKSWILTSGEELTCHAVFSLAFTLLLRL
Sbjct: 661 SVDCVQIFSLHGMVPLLAGQLMPICEAFGSSPPKSWILTSGEELTCHAVFSLAFTLLLRL 720
Query: 721 WRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFSLE 780
WRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSP DRLKARRLSKLLKFSL+
Sbjct: 721 WRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPNDRLKARRLSKLLKFSLQ 780
Query: 781 PIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTSTT 840
PIFMDSFPKLKGWYRQHQECIASILSGL+PGAPVHQIVDALLTMMFRKINR GQSLTSTT
Sbjct: 781 PIFMDSFPKLKGWYRQHQECIASILSGLVPGAPVHQIVDALLTMMFRKINRGGQSLTSTT 840
Query: 841 SGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADF 900
SGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADF
Sbjct: 841 SGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADF 900
Query: 901 LPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSLA 960
LPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVP LA
Sbjct: 901 LPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPCLA 960
Query: 961 VGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWA 1020
VGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWA
Sbjct: 961 VGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWA 1020
Query: 1021 QKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHGFG 1080
QKVKRWNDFLVFSASRTVFHH SDAVVQLLKSCFTSTLGLGNSN N+SGGVGTLLGHGFG
Sbjct: 1021 QKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNGNNSGGVGTLLGHGFG 1080
Query: 1081 SHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKT 1140
SHVLGGMSPVAPGILYLRVHRSVRDVLF+VEEIVSLLMLSVRDIAVSGLPKEKAEKLKKT
Sbjct: 1081 SHVLGGMSPVAPGILYLRVHRSVRDVLFVVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKT 1140
Query: 1141 KYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVEREG 1200
KYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSV SVEREG
Sbjct: 1141 KYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVHSVEREG 1200
Query: 1201 VEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLASALDGKFSI 1260
V YGGMVAVLRG+ALAFFSVLCGTFSWGIDSSSSASKRRAKILDS+LEFLASALDGKFSI
Sbjct: 1201 VNYGGMVAVLRGHALAFFSVLCGTFSWGIDSSSSASKRRAKILDSYLEFLASALDGKFSI 1260
Query: 1261 GCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGGVT 1320
GCDWATWRAYVSGFVSL+VRCAPRWLLEVDLN+L RLSNGLRQLNEE+L L LL SGGV
Sbjct: 1261 GCDWATWRAYVSGFVSLIVRCAPRWLLEVDLNVLTRLSNGLRQLNEEELGLELLESGGVN 1320
Query: 1321 AMGAAAELIIEGGF 1335
AMGAAAELIIEGGF
Sbjct: 1321 AMGAAAELIIEGGF 1334
BLAST of Bhi05G000263 vs. ExPASy TrEMBL
Match:
A0A6J1DPP9 (mediator of RNA polymerase II transcription subunit 33B-like OS=Momordica charantia OX=3673 GN=LOC111022676 PE=4 SV=1)
HSP 1 Score: 2431.8 bits (6301), Expect = 0.0e+00
Identity = 1237/1335 (92.66%), Postives = 1286/1335 (96.33%), Query Frame = 0
Query: 1 MAVSTQPPGQLQEISGLWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60
M VS QPP QLQ ++GLWDSVLELTKSAQDKNCDPLLWAVQLSS+LNSAGVSLPS+ELAQ
Sbjct: 1 MVVSVQPPSQLQGMAGLWDSVLELTKSAQDKNCDPLLWAVQLSSSLNSAGVSLPSIELAQ 60
Query: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLS 120
LLVSHICWDNHVPIMWKFLEKAMTA+IVPPLLV+ALLSTRAIPYRKLRPAAYRLYLE+LS
Sbjct: 61 LLVSHICWDNHVPIMWKFLEKAMTAKIVPPLLVVALLSTRAIPYRKLRPAAYRLYLELLS 120
Query: 121 RHIFSLTSQINGPNYQRIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLD 180
RH+FS TSQINGPNYQRIMQTIDDVLHL+QIF +Q CEPG+LMVELFFSIVWQLLDASLD
Sbjct: 121 RHVFSSTSQINGPNYQRIMQTIDDVLHLSQIFSLQACEPGLLMVELFFSIVWQLLDASLD 180
Query: 181 DEGLLALPGEEKSVWLIRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQ 240
DEGLL LP EE+S WLIRPQ HDMELDVHDSF EKRTENSESLLKVNTAKAIEIIGQFLQ
Sbjct: 181 DEGLLVLPAEERSAWLIRPQPHDMELDVHDSFSEKRTENSESLLKVNTAKAIEIIGQFLQ 240
Query: 241 NKKTARILCLALRNMPLHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQ 300
NKKTARIL LA RNMPLHWAGFAQRL+LLAANS VLRNTKLITPEVLLHWTSDK +LLS+
Sbjct: 241 NKKTARILYLAHRNMPLHWAGFAQRLQLLAANSAVLRNTKLITPEVLLHWTSDKHRLLSR 300
Query: 301 EGKTCQLEFRDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLI 360
EGKT Q EFR+VMASGSLFSSAGQSHGVNWS LWLPIDLFLEDAMDGSQVLATSAVERLI
Sbjct: 301 EGKTSQQEFRNVMASGSLFSSAGQSHGVNWSTLWLPIDLFLEDAMDGSQVLATSAVERLI 360
Query: 361 CLIKSLRAVNDSSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTI 420
CLIKSL+AVND+SWHNTF+GLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVTI
Sbjct: 361 CLIKSLQAVNDTSWHNTFMGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVTI 420
Query: 421 IIEEEEGEVK-EDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVANQ 480
IIEE+EGE+K EDECS SK RDEK+ SG CRKGLITSLQMLGEYE LL PPQSV A+ANQ
Sbjct: 421 IIEEDEGELKEEDECSPSKGRDEKKCSGKCRKGLITSLQMLGEYEGLLTPPQSVTAIANQ 480
Query: 481 AAAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGYA 540
AAAKAVMFISGVAVGNEY+DCVSM+DTP+NCSGNMRHLIVEACISRNLLDTS YFWPGY
Sbjct: 481 AAAKAVMFISGVAVGNEYYDCVSMNDTPVNCSGNMRHLIVEACISRNLLDTSVYFWPGYV 540
Query: 541 NARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGDE 600
NARSSQVPRSAS QVVGWSSFMKGSSLT SMV+ALVATPASSLAEIEKIYEIA+NGSGDE
Sbjct: 541 NARSSQVPRSASGQVVGWSSFMKGSSLTLSMVDALVATPASSLAEIEKIYEIAVNGSGDE 600
Query: 601 KISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVGI 660
KISAASILCG SLVRGWNLQEHT LFI+RLLSPPIP DYSGS+SYLIDYAPFLNVLLVGI
Sbjct: 601 KISAASILCGESLVRGWNLQEHTVLFIARLLSPPIPADYSGSDSYLIDYAPFLNVLLVGI 660
Query: 661 SSVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPKSWILTSGEELTCHAVFSLAFTLLLR 720
SSVDCVQIFSLHGMVPLLAGQLMPICEAFG PKSW+LTSGEELTCHAVFSLAFTLLLR
Sbjct: 661 SSVDCVQIFSLHGMVPLLAGQLMPICEAFGLSPPKSWVLTSGEELTCHAVFSLAFTLLLR 720
Query: 721 LWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFSL 780
LWRFHHPPVENVK DARPVGSQLTPEYLLLVRNSQLASFGKSPKDR K RRLSKLLKFSL
Sbjct: 721 LWRFHHPPVENVKRDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRFKVRRLSKLLKFSL 780
Query: 781 EPIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTST 840
EPIFMDSFPKLKGWYRQHQECIASILSGL+PGAPVHQIVDALLTMMFRKINR G SLTST
Sbjct: 781 EPIFMDSFPKLKGWYRQHQECIASILSGLVPGAPVHQIVDALLTMMFRKINRGGHSLTST 840
Query: 841 TSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLAD 900
TSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLAD
Sbjct: 841 TSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLAD 900
Query: 901 FLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSL 960
FLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSL
Sbjct: 901 FLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSL 960
Query: 961 AVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLW 1020
AVGG+SPAMLPLPLAALISLTITYKLDKASERLLALVGPALN+LAASCSWPCTPIIASLW
Sbjct: 961 AVGGNSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNTLAASCSWPCTPIIASLW 1020
Query: 1021 AQKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHGF 1080
AQKVKRWNDFLVFSASRTVFHH SDAVVQLLKSCFTSTLGLGNSN+NS+GGVGTLLGHGF
Sbjct: 1021 AQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNLNSNGGVGTLLGHGF 1080
Query: 1081 GSHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLKK 1140
GSHVLGGMSPVAPGILYLRVHRSVRD LF+VEEIVSLLMLSVRDIAVSGLP+EKAEKLKK
Sbjct: 1081 GSHVLGGMSPVAPGILYLRVHRSVRDALFMVEEIVSLLMLSVRDIAVSGLPREKAEKLKK 1140
Query: 1141 TKYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVERE 1200
TK+GMRYEQVSFASAM+RVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSV S+ERE
Sbjct: 1141 TKHGMRYEQVSFASAMSRVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVHSLERE 1200
Query: 1201 GVEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLASALDGKFS 1260
GVEYGGMVAVL GYALAFFSVLCGTFSWGIDS SSASKRRAKILDS LEFLASALDGKFS
Sbjct: 1201 GVEYGGMVAVLGGYALAFFSVLCGTFSWGIDSVSSASKRRAKILDSHLEFLASALDGKFS 1260
Query: 1261 IGCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGGV 1320
IGCDWATWRAYVSGFVSLMVRCAP+W++EVD+N+LKRLSNGLRQL+EE+LALALL SGGV
Sbjct: 1261 IGCDWATWRAYVSGFVSLMVRCAPKWVVEVDVNILKRLSNGLRQLSEEELALALLESGGV 1320
Query: 1321 TAMGAAAELIIEGGF 1335
TAMGAAAELIIEGGF
Sbjct: 1321 TAMGAAAELIIEGGF 1335
BLAST of Bhi05G000263 vs. ExPASy TrEMBL
Match:
A0A6J1HFH4 (mediator of RNA polymerase II transcription subunit 33B-like OS=Cucurbita moschata OX=3662 GN=LOC111463830 PE=4 SV=1)
HSP 1 Score: 2413.3 bits (6253), Expect = 0.0e+00
Identity = 1235/1336 (92.44%), Postives = 1278/1336 (95.66%), Query Frame = 0
Query: 1 MAVSTQPPGQLQEISGLWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60
MAVS QPPGQLQ I+G+WD+VLELTKSAQ+KN DPLLWAVQLSS+LNSA VSLPSVELA
Sbjct: 1 MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVQLSSSLNSASVSLPSVELAH 60
Query: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLS 120
LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLE+LS
Sbjct: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLS 120
Query: 121 RHIFSLTSQINGPNYQRIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLD 180
RH+FS T ++NGPNY RIMQTIDDVLHL+QIFG+QTCEPG+LMVELFFSIVW LLDASLD
Sbjct: 121 RHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLMVELFFSIVWHLLDASLD 180
Query: 181 DEGLLALPGEEKSVWLIRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQ 240
DEGLL LP EE+SVWLIRPQ HDMELDVHDSFGEK+TENSE+LLKVNTAKAIEIIGQFLQ
Sbjct: 181 DEGLLELPAEERSVWLIRPQPHDMELDVHDSFGEKKTENSENLLKVNTAKAIEIIGQFLQ 240
Query: 241 NKKTARILCLALRNMPLHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQ 300
NKKTARILCLA +NMPLHWAGFAQRL+LLAANSVVLRNTKLITPEVLL WTSDK + LSQ
Sbjct: 241 NKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLQWTSDKHRFLSQ 300
Query: 301 EGKT-CQLEFRDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
EGKT QLEF DVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL
Sbjct: 301 EGKTKSQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
Query: 361 ICLIKSLRAVNDSSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVT 420
ICLIKSLRAVND+SWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVT
Sbjct: 361 ICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVT 420
Query: 421 IIIEEEEGEVK-EDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVAN 480
IIIEEEEGE+K EDECS SKSRDEKQSSG R+GLITSLQMLGEYESLL PPQSVI VAN
Sbjct: 421 IIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITSLQMLGEYESLLTPPQSVIVVAN 480
Query: 481 QAAAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGY 540
QAAAKAVMFISGVAVGNEY+DCVSM+DTPINCSGNMRHLIVEACISRNLLDTSAYFWPGY
Sbjct: 481 QAAAKAVMFISGVAVGNEYYDCVSMNDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGY 540
Query: 541 ANARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGD 600
N RSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGD
Sbjct: 541 VNTRSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGD 600
Query: 601 EKISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVG 660
EKISAASILCGASLVRGWNLQEHT LFISRLLSPPIP DY GS+SYLIDYAPFLNVLLVG
Sbjct: 601 EKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNVLLVG 660
Query: 661 ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPKSWILTSGEELTCHAVFSLAFTLLL 720
ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGS +PKSWILTSGEELTCHAVFSLAFTLLL
Sbjct: 661 ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPKSWILTSGEELTCHAVFSLAFTLLL 720
Query: 721 RLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFS 780
RLWRFHHPP+ENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLK RRLSKLLKFS
Sbjct: 721 RLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFS 780
Query: 781 LEPIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTS 840
LEP FMDSFPKLKGWYRQHQECIASI GL+PGAPVHQ VDALLTMMF+KINR GQSLTS
Sbjct: 781 LEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTS 840
Query: 841 TTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA 900
TTS SSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA
Sbjct: 841 TTSASSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA 900
Query: 901 DFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPS 960
DFLPASFATIV YFSAEVTRGIWKPAFMNGTDWPSPAATLS+VEQQIKKILAATGVDVPS
Sbjct: 901 DFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPS 960
Query: 961 LAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASL 1020
LA+GGS PAMLPLPLAALISLTITYKLDKASERLLALVGPALNSL A CSWPCTPIIASL
Sbjct: 961 LALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASL 1020
Query: 1021 WAQKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHG 1080
WAQKVKRWNDFLVFSASRTVFHH SDAVVQLLKSCFTSTLGLGNSNVNS GGVG LLGHG
Sbjct: 1021 WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHG 1080
Query: 1081 FGSHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLK 1140
FGSHVLGGMSP APGILYLRVHR VRD LFLVEEIVSLLMLSV+DIAV+GLPKEKAEKLK
Sbjct: 1081 FGSHVLGGMSPAAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGLPKEKAEKLK 1140
Query: 1141 KTKYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVER 1200
K+K+GMR EQVSFASAMARVKLAASLGASLVWISGGSGLVQSL+KETLPSWFLSV SV+R
Sbjct: 1141 KSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDR 1200
Query: 1201 EGVEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLASALDGKF 1260
EGVEYGGMV VLRGYALAFFSVLCGTFSWGIDS SSASKRRAK+LDS LEFLASALDGKF
Sbjct: 1201 EGVEYGGMVPVLRGYALAFFSVLCGTFSWGIDSISSASKRRAKLLDSHLEFLASALDGKF 1260
Query: 1261 SIGCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGG 1320
SIGCDWATWRAYVSGFVSL+VRCAP+WLLEVDL +LKRL GLRQLNEE+LALALL SGG
Sbjct: 1261 SIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLKRLGKGLRQLNEEELALALLESGG 1320
Query: 1321 VTAMGAAAELIIEGGF 1335
+TAMGAAAELII GGF
Sbjct: 1321 LTAMGAAAELIIGGGF 1336
BLAST of Bhi05G000263 vs. ExPASy TrEMBL
Match:
A0A6J1HUY1 (mediator of RNA polymerase II transcription subunit 33B-like OS=Cucurbita maxima OX=3661 GN=LOC111466930 PE=4 SV=1)
HSP 1 Score: 2406.7 bits (6236), Expect = 0.0e+00
Identity = 1229/1336 (91.99%), Postives = 1279/1336 (95.73%), Query Frame = 0
Query: 1 MAVSTQPPGQLQEISGLWDSVLELTKSAQDKNCDPLLWAVQLSSTLNSAGVSLPSVELAQ 60
MAVS QPPGQLQ I+G+WD+VLELTKSAQ+KN DPLLWAV LSS+LNSA VSLPSVELAQ
Sbjct: 1 MAVSAQPPGQLQGIAGVWDTVLELTKSAQEKNSDPLLWAVHLSSSLNSASVSLPSVELAQ 60
Query: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLEVLS 120
LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLE+LS
Sbjct: 61 LLVSHICWDNHVPIMWKFLEKAMTARIVPPLLVIALLSTRAIPYRKLRPAAYRLYLELLS 120
Query: 121 RHIFSLTSQINGPNYQRIMQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLD 180
RH+FS T ++NGPNY RIMQTIDDVLHL+QIFG+QTCEPG+L+VELFFSIVW LLDASLD
Sbjct: 121 RHVFSSTLEVNGPNYPRIMQTIDDVLHLSQIFGLQTCEPGLLVVELFFSIVWHLLDASLD 180
Query: 181 DEGLLALPGEEKSVWLIRPQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQ 240
DEGLL LP EE+SVWLIRPQ H+MELDVH+SFGEK+TENSE+LLKVNTAKAIEIIGQFLQ
Sbjct: 181 DEGLLELPAEERSVWLIRPQPHNMELDVHNSFGEKKTENSENLLKVNTAKAIEIIGQFLQ 240
Query: 241 NKKTARILCLALRNMPLHWAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQ 300
NKKTARILCLA +NMPLHWAGFAQRL+LLAANSVVLRNTKLITPEVLLHWTSDK + LSQ
Sbjct: 241 NKKTARILCLAHQNMPLHWAGFAQRLQLLAANSVVLRNTKLITPEVLLHWTSDKHRFLSQ 300
Query: 301 EGKTC-QLEFRDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
EGKT QLEF DVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL
Sbjct: 301 EGKTASQLEFHDVMASGSLFSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERL 360
Query: 361 ICLIKSLRAVNDSSWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVT 420
ICLIKSLRAVND+SWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSI+TLAVT
Sbjct: 361 ICLIKSLRAVNDASWHNTFLGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSITTLAVT 420
Query: 421 IIIEEEEGEVK-EDECSASKSRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVAN 480
IIIEEEEGE+K EDECS SKSRDEKQSSG R+GLIT LQMLGEYESLL PPQSVI VAN
Sbjct: 421 IIIEEEEGELKEEDECSPSKSRDEKQSSGKRRQGLITCLQMLGEYESLLTPPQSVIEVAN 480
Query: 481 QAAAKAVMFISGVAVGNEYHDCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGY 540
QAAAKAVMFISGVAVGNE +DCVSM+DTPINCSGNMRHLIVEACISRNLLDTSAYFWPGY
Sbjct: 481 QAAAKAVMFISGVAVGNECYDCVSMNDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGY 540
Query: 541 ANARSSQVPRSASSQVVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGD 600
N RSSQVPRSASSQ+VGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGD
Sbjct: 541 VNTRSSQVPRSASSQIVGWSSFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGD 600
Query: 601 EKISAASILCGASLVRGWNLQEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVG 660
EKISAASILCGASLVRGWNLQEHT LFISRLLSPPIP DY GS+SYLIDYAPFLN+LLVG
Sbjct: 601 EKISAASILCGASLVRGWNLQEHTVLFISRLLSPPIPADYPGSDSYLIDYAPFLNILLVG 660
Query: 661 ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSCSPKSWILTSGEELTCHAVFSLAFTLLL 720
ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGS +PKSWIL SGEELTCHAVFSLAFTLLL
Sbjct: 661 ISSVDCVQIFSLHGMVPLLAGQLMPICEAFGSSTPKSWILASGEELTCHAVFSLAFTLLL 720
Query: 721 RLWRFHHPPVENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFS 780
RLWRFHHPP+ENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLK RRLSKLLKFS
Sbjct: 721 RLWRFHHPPIENVKGDARPVGSQLTPEYLLLVRNSQLASFGKSPKDRLKVRRLSKLLKFS 780
Query: 781 LEPIFMDSFPKLKGWYRQHQECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTS 840
LEP FMDSFPKLKGWYRQHQECIASI GL+PGAPVHQ VDALLTMMF+KINR GQSLTS
Sbjct: 781 LEPTFMDSFPKLKGWYRQHQECIASIPPGLVPGAPVHQTVDALLTMMFKKINRGGQSLTS 840
Query: 841 TTSGSSNSSGSANEEASIKLKVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA 900
TTSGSSNSSGSANEEASIKLKVP+WDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA
Sbjct: 841 TTSGSSNSSGSANEEASIKLKVPSWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLA 900
Query: 901 DFLPASFATIVSYFSAEVTRGIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPS 960
DFLPASFATIV YFSAEVTRGIWKPAFMNGTDWPSPAATLS+VEQQIKKILAATGVDVPS
Sbjct: 901 DFLPASFATIVCYFSAEVTRGIWKPAFMNGTDWPSPAATLSVVEQQIKKILAATGVDVPS 960
Query: 961 LAVGGSSPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASL 1020
LA+GGS PAMLPLPLAALISLTITYKLDKASERLLALVGPALNSL A CSWPCTPIIASL
Sbjct: 961 LALGGSFPAMLPLPLAALISLTITYKLDKASERLLALVGPALNSLVAGCSWPCTPIIASL 1020
Query: 1021 WAQKVKRWNDFLVFSASRTVFHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHG 1080
WAQKVKRWNDFLVFSASRTVFHH SDAVVQLLKSCFTSTLGLGNSNVNS GGVG LLGHG
Sbjct: 1021 WAQKVKRWNDFLVFSASRTVFHHNSDAVVQLLKSCFTSTLGLGNSNVNSGGGVGALLGHG 1080
Query: 1081 FGSHVLGGMSPVAPGILYLRVHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLK 1140
FGSHVLGGMSPVAPGILYLRVHR VRD LFLVEEIVSLLMLSV+DIAV+G+PKEKAEKLK
Sbjct: 1081 FGSHVLGGMSPVAPGILYLRVHRCVRDALFLVEEIVSLLMLSVKDIAVTGVPKEKAEKLK 1140
Query: 1141 KTKYGMRYEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVER 1200
K+K+GMR EQVSFASAMARVKLAASLGASLVWISGGSGLVQSL+KETLPSWFLSV SV+R
Sbjct: 1141 KSKHGMRCEQVSFASAMARVKLAASLGASLVWISGGSGLVQSLYKETLPSWFLSVHSVDR 1200
Query: 1201 EGVEYGGMVAVLRGYALAFFSVLCGTFSWGIDSSSSASKRRAKILDSFLEFLASALDGKF 1260
EGVEYGGMV VLRGYALAFFSVLCG FSWGIDS+SSASKRRAKILDS LEFLASALDGKF
Sbjct: 1201 EGVEYGGMVPVLRGYALAFFSVLCGMFSWGIDSTSSASKRRAKILDSHLEFLASALDGKF 1260
Query: 1261 SIGCDWATWRAYVSGFVSLMVRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGG 1320
SIGCDWATWRAYVSGFVSL+VRCAP+WLLEVDL +LKRL GLRQLNEE+LALALL SGG
Sbjct: 1261 SIGCDWATWRAYVSGFVSLLVRCAPKWLLEVDLKVLKRLGKGLRQLNEEELALALLESGG 1320
Query: 1321 VTAMGAAAELIIEGGF 1335
+TAMGAAAELIIEGGF
Sbjct: 1321 LTAMGAAAELIIEGGF 1336
BLAST of Bhi05G000263 vs. ExPASy TrEMBL
Match:
A0A1S4DXP9 (mediator of RNA polymerase II transcription subunit 33A-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491278 PE=4 SV=1)
HSP 1 Score: 2227.6 bits (5771), Expect = 0.0e+00
Identity = 1142/1196 (95.48%), Postives = 1164/1196 (97.32%), Query Frame = 0
Query: 139 MQTIDDVLHLTQIFGVQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSVWLIR 198
MQTIDDVLHLTQIFG+QTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKS WLIR
Sbjct: 1 MQTIDDVLHLTQIFGLQTCEPGVLMVELFFSIVWQLLDASLDDEGLLALPGEEKSAWLIR 60
Query: 199 PQLHDMELDVHDSFGEKRTENSESLLKVNTAKAIEIIGQFLQNKKTARILCLALRNMPLH 258
PQLHDMELDVHDSFGEK+TENSESLLKVNTAKAIEIIGQFLQNKKT RILCLALRNMPL
Sbjct: 61 PQLHDMELDVHDSFGEKKTENSESLLKVNTAKAIEIIGQFLQNKKTERILCLALRNMPLQ 120
Query: 259 WAGFAQRLKLLAANSVVLRNTKLITPEVLLHWTSDKSKLLSQEGKTCQLEFRDVMASGSL 318
WAGFAQRL+LL ANSVVL N KLITPEVLLHWTSDK+KLLSQ+GKT QLEFRDVM+SGSL
Sbjct: 121 WAGFAQRLQLLGANSVVLGNAKLITPEVLLHWTSDKNKLLSQKGKTSQLEFRDVMSSGSL 180
Query: 319 FSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDSSWHNTF 378
FSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVND+SWHNTF
Sbjct: 181 FSSAGQSHGVNWSALWLPIDLFLEDAMDGSQVLATSAVERLICLIKSLRAVNDTSWHNTF 240
Query: 379 LGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEGEVKEDECSASK 438
LGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEE E KED+CS SK
Sbjct: 241 LGLWIAALRLIQRERDPSEGPVPRLDTCLCMLLSISTLAVTIIIEEEEVEPKEDDCSPSK 300
Query: 439 SRDEKQSSGMCRKGLITSLQMLGEYESLLIPPQSVIAVANQAAAKAVMFISGVAVGNEYH 498
SRDEKQSSGMCRKGLITSLQMLGEYESLL PPQS+IAVANQAAAKAVMFISGVAVGNEY+
Sbjct: 301 SRDEKQSSGMCRKGLITSLQMLGEYESLLTPPQSIIAVANQAAAKAVMFISGVAVGNEYY 360
Query: 499 DCVSMSDTPINCSGNMRHLIVEACISRNLLDTSAYFWPGYANARSSQVPRSASSQVVGWS 558
DC SM+D PINCSGNMRHLIVEACISRNLLDTSAYFWPGY NA SSQVPRSAS+QVVGWS
Sbjct: 361 DCASMNDAPINCSGNMRHLIVEACISRNLLDTSAYFWPGYVNALSSQVPRSASNQVVGWS 420
Query: 559 SFMKGSSLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGWNL 618
SFMKGS LTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGW L
Sbjct: 421 SFMKGSPLTPSMVNALVATPASSLAEIEKIYEIAINGSGDEKISAASILCGASLVRGWYL 480
Query: 619 QEHTALFISRLLSPPIPTDYSGSESYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPLLA 678
QEHTALFISRLL PPIPTDYSGS+SYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPLLA
Sbjct: 481 QEHTALFISRLLLPPIPTDYSGSDSYLIDYAPFLNVLLVGISSVDCVQIFSLHGMVPLLA 540
Query: 679 GQLMPICEAFGSCSPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGDARPV 738
GQLMPICEAFGS PKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGDARPV
Sbjct: 541 GQLMPICEAFGSSPPKSWILTSGEELTCHAVFSLAFTLLLRLWRFHHPPVENVKGDARPV 600
Query: 739 GSQLTPEYLLLVRNSQLASFGKSPKDRLKARRLSKLLKFSLEPIFMDSFPKLKGWYRQHQ 798
GSQLTPEYLLLVRNSQLASFGKSP DRLKARRLSKLLKFSL+PIFMDSFPKLKGWYRQHQ
Sbjct: 601 GSQLTPEYLLLVRNSQLASFGKSPNDRLKARRLSKLLKFSLQPIFMDSFPKLKGWYRQHQ 660
Query: 799 ECIASILSGLIPGAPVHQIVDALLTMMFRKINRAGQSLTSTTSGSSNSSGSANEEASIKL 858
ECIASILSGL+PGAPVHQIVDALLTMMFRKINR GQSLTSTTSGSSNSSGSANEEASIKL
Sbjct: 661 ECIASILSGLVPGAPVHQIVDALLTMMFRKINRGGQSLTSTTSGSSNSSGSANEEASIKL 720
Query: 859 KVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSAEVTR 918
KVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSAEVTR
Sbjct: 721 KVPAWDILEATPFVLDAALTACAHGRLSPRDLATGLKDLADFLPASFATIVSYFSAEVTR 780
Query: 919 GIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPSLAVGGSSPAMLPLPLAALIS 978
GIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVP LAVGGSSPAMLPLPLAALIS
Sbjct: 781 GIWKPAFMNGTDWPSPAATLSIVEQQIKKILAATGVDVPCLAVGGSSPAMLPLPLAALIS 840
Query: 979 LTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSASRTV 1038
LTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSASRTV
Sbjct: 841 LTITYKLDKASERLLALVGPALNSLAASCSWPCTPIIASLWAQKVKRWNDFLVFSASRTV 900
Query: 1039 FHHTSDAVVQLLKSCFTSTLGLGNSNVNSSGGVGTLLGHGFGSHVLGGMSPVAPGILYLR 1098
FHH SDAVVQLLKSCFTSTLGLGNSN N+SGGVGTLLGHGFGSHVLGGMSPVAPGILYLR
Sbjct: 901 FHHNSDAVVQLLKSCFTSTLGLGNSNGNNSGGVGTLLGHGFGSHVLGGMSPVAPGILYLR 960
Query: 1099 VHRSVRDVLFLVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASAMARV 1158
VHRSVRDVLF+VEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASAMARV
Sbjct: 961 VHRSVRDVLFVVEEIVSLLMLSVRDIAVSGLPKEKAEKLKKTKYGMRYEQVSFASAMARV 1020
Query: 1159 KLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVQSVEREGVEYGGMVAVLRGYALAFF 1218
KLAASLGASLVWISGGSGLVQSLFKETLPSWFLSV SVEREGV YGGMVAVLRG+ALAFF
Sbjct: 1021 KLAASLGASLVWISGGSGLVQSLFKETLPSWFLSVHSVEREGVNYGGMVAVLRGHALAFF 1080
Query: 1219 SVLCGTFSWGIDSSSSASKRRAKILDSFLEFLASALDGKFSIGCDWATWRAYVSGFVSLM 1278
SVLCGTFSWGIDSSSSASKRRAKILDS+LEFLASALDGKFSIGCDWATWRAYVSGFVSL+
Sbjct: 1081 SVLCGTFSWGIDSSSSASKRRAKILDSYLEFLASALDGKFSIGCDWATWRAYVSGFVSLI 1140
Query: 1279 VRCAPRWLLEVDLNLLKRLSNGLRQLNEEQLALALLGSGGVTAMGAAAELIIEGGF 1335
VRCAPRWLLEVDLN+L RLSNGLRQLNEE+L L LL SGGV AMGAAAELIIEGGF
Sbjct: 1141 VRCAPRWLLEVDLNVLTRLSNGLRQLNEEELGLELLESGGVNAMGAAAELIIEGGF 1196
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LUG9 | 0.0e+00 | 58.31 | Mediator of RNA polymerase II transcription subunit 33A OS=Arabidopsis thaliana ... | [more] |
F4IN69 | 0.0e+00 | 59.90 | Mediator of RNA polymerase II transcription subunit 33B OS=Arabidopsis thaliana ... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BMJ1 | 0.0e+00 | 95.20 | mediator of RNA polymerase II transcription subunit 33B-like isoform X1 OS=Cucum... | [more] |
A0A6J1DPP9 | 0.0e+00 | 92.66 | mediator of RNA polymerase II transcription subunit 33B-like OS=Momordica charan... | [more] |
A0A6J1HFH4 | 0.0e+00 | 92.44 | mediator of RNA polymerase II transcription subunit 33B-like OS=Cucurbita moscha... | [more] |
A0A6J1HUY1 | 0.0e+00 | 91.99 | mediator of RNA polymerase II transcription subunit 33B-like OS=Cucurbita maxima... | [more] |
A0A1S4DXP9 | 0.0e+00 | 95.48 | mediator of RNA polymerase II transcription subunit 33A-like isoform X2 OS=Cucum... | [more] |