Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAAAACGTCGAAAAATCAGCGCAAGTTAGAGCAAACTAAAATGCGCCATTTACCATTTTTTCTTTCCCTCCTTTCATTTGCTTTTTGTCTCCAATTGCTTTCGTTCATCACTGCCTTTTTTCCTCACCGCTCATTTGTTTCTGTGATTCTCTTCCGCCGCGCTTTTTCTTTGATCTGCCGTGAAAGCCTAGGGCTCAACTTGATCGTCTCGTGACTGGGAAAACTGAAATTCTTCTTCACTTTGATGCATTCATTTCTCAGATTGATTCGGTAATCAACAACAGTTTCGTCTTTGTTTGTGCACGGATGTGCTCCACCGGCGTTCCATGGCGTCTCCGGTGGCTACTCATCTCAACATCTCCACCTCTTCCTCTCTCTGCCGTTCCAAGGTTCTCTTCTTTTCTAATCTTATTGCGCATCGTTTTCTAGTTACAGTTTGTTTCACTGTTTTTTTCAAGTCCGGGAAATTGGATGTTATTTTCGGCTGGAGTTTATTGGCTTGGATTTTTCTCGAGTTTTTCGGATCTGTTGGTTTTTTCGACATTGAGAAGTCAGTAGAGATGTTATTTGGAGAGAGATTAACTTCTTGGACCGATAAAGATTGAAGTATATTATAGAACTCATTTTTTAGTTCTATCTTCTTATCATGTTATTACATTTTGATTTCGGTAGTAGATTGTTGTCGAATGATTTGGATCAATATTAACTTCTTGGACCGATAAAGATTTGTAGTATAATTTAGAACTCATTTTTTACTTATATCTTCACATTTTGATTTCGGTAGTAGATTGTTGTCGAATGATTTGGATCGATATTAACTTCTTGGACCGATAAAGATTTGAAGTATATTTTAGAACTCATTTTTTACTTGTATCTTCACACTTTGATTTCGGCAGTAGATTGTTTCCGAATGACTTGGATCGATATTAACTTCTTGGACCGATAAAGAATTGAAGTATATTTTAGAACTCATTTTAGGTAAATATTTTGTCGTTGTTGTTTTATTACATTTTGGTACTTTATTGCTGTTGAATGATTGATGGGCGCATTTCTCTCCCTCTTACTTTTACAATTTACACTGTTGTAGGGATCGTTTATGTACTTACCAATCCTAGTTTGATTTATTATAGAGCCTAATGTGCTTTCTTTTTTTTTTTTTTGCCTATCGTTCCTCTTCTCTTAGCTGTAGGAATCTTTTTCTTTCATCCTGCTCATATCTCATTTGATCACTTTATATCTCTTGGATTTAATTTGAACTAGATAGTAATGGTTTTCTCTATTATCTCACTGGAATTTTCTGAAAGTATTGTTCACTGTTGCTTTTGGTTTCCTGTGTATAGAGGTTGTCTCTGAGGCTCAGTAGGAATCGAACAAAATTTATTACTAGCACAACTCAGAAGAGGAGATCTCATTCCCTGAAGGTTGTCCAGTCAGTCCTGAATAACTGCAAGTCAAATTTAAATGACAATGGAGCAAGTGAAGAAGCAAAACTGCTTCTTGAGAGATTGTATGCCCAGACACAGAGATTGGAAGAACATGTAAACAAAGATCCTCACTCCCCCCAAGATGTTTGGCTAGGGCTCAGTCTTGAAAATCTCGAGTCTGATCTCCAGGCTGCATTGGCAGTATTGAAAAAGAAGGAAGAAGATCTACAAGATGCAGAAAGAACGATTCTATTAGAACGAAGCCAATTAAATAATGCGAGGGAGAAGTTGGAGAAGCAGGAGGAAGAATTAACTGCTGCTTATCATAAACAGCAAAAATTAGAAGATGAGCTTAAGCAGGCAAATCTAGACTTAGCTTCTCAAGCTAGACAGATTGATGAATTAAAACTTCAAATTAGGGAGAAAGACGGGGGGATTGCTGCAGTTGAATCTGCCCTTACTTTGAAAGAAGACGAGTTGAAGAGAATGAGAGATGATTTGGCTAAGAAGAGTGAGGAAGCTGTTAAAACAGATTTTGAACTTAAATCTAAGTCCCAGCTTCTGACTGAAGCCATCGAAGTAGTGAAAAGACAAGAAGTTGAGCTGCAAATGCTTAAAAAGGCTGTGCTAGAGAAAGAAAAAGAACTGGAACTTTCTGTAAAGCTGCAGAAACTTGAAGAGGAAAGACTGGAAGTTGTAGAGAAAAATTTGGAGAACAGGACCATGGAATGGTTGTTAGCACAGGAAGAATTGAAAAAAATGAGAAAGGAAGGATCTAAGAAAGCAGTAGAGATGAACAAAACAGTGAATGACTTCAACCGAGTTAAGAAGCTTCTTGCCGATGTAAAAAGTGAGTTAGTTTCCTCTCAAAAGTCCCTTGTGTCCTCCAGAAAGAAAATAGAAGAACAAGAAGATATTCTTGGGAGGCAAATGGCAGAACTTGAAGAACAAAAGAAGGGTATCAATGCATATATGTCAAGTTTAAAAGATGCTCAAATTGAAGTAGAGAGTGAAAGAGTGAAGCTTAGAGTTGCAGAGGCTCATAACAAGGAGCTTGAATGCGACTTGTTGATGGAAAAGGAGCTCACTGATGAGTTACAGCAACAACTGAAGAAAGAGAAATCCTATCTCCAGCAGGAAACTGAGGAGAAATCACTTCTACAGAAGGAGCTAGAGCATAAACATATCGAGTTCGAGAAAACTCACACTCTCCTTCAAGATAAAGCGTCGGAGTTGGTCGAAGCCAAGTTAGAAATCCAGCATTTGAAATCCGAACAGGTTTCTCTTCAACTTCTTTTGGAAGAGAAAGACTTGGAGATACTCGATGCACAAAAGAAAATTGAGGAATTAAATCAGGAAATTATTGAGCTACAAACGCTTATGAGTATTAAAGAAGCTCAGCTCAATCAGACAACTGCTATGCTGAAAGAGAAAGATGAATGCGTTCAGATAATGCAAAATGAACTAAACGATACAAAGCTGAAAATTTCCGAAGCCGAAGCAGCAGTAGAACACATTGTGGATCTCACAAATAAATTGGTGATTTCCATCAAAGGCGGCGACGACAATGATGTTTTGAATTTGAACAACGATCTTAGCCTCAATCTACAGCAGCAGTTATTCATGAAACCTACTGATAACATGAAACTACAGAAGAAACAGCTCGAAACGGAGTTAGAGCTCACCAAGGAAAGCTTGAGACAAAAAGAAATGGAAATTCTAGCTGCAGAAAGAGCACTGACCGTTAAAGACGAGGAACTAAAAACGGTTCAGGAAAGATTAGATGCAAAGGAGAAAGAATTCGAGAAGATGAAGGAAGAGATGGATGAAGAAGCTAAAGATACAAGGAAGCTATACACATTGGCACAAGACAATGTTGGAGGAGAGGACAACAACATTGAGGACTTTGCAATTGGAAGGCTTCAAGTTGAAGCTGCTCAGTTAGAGGTAGAAGCAGCTACTAGCGCTCTTCAGAAACTCACAGACATGAGCCGAGAGCTTCTGAATAAAGCTAGCCATAGCCTCGAGGTTGATATCGGTTCAAGAAGCATTCACATTCAGCAGCACAATGACGACGAAGACGATGTCGATGCCGGAACAGGCGGTTGGGTTGATAATAACAACACAAGATTTAATGAAGTGAAGGTGGAAGTTTCCCGTCTTTCATCTCTAACTGAACAGCTCTTGAAGGAAGCTGGTATATTTGTTGATGCGAATTAGTGGAGATTTATTGGGATGAATAAAGGGTAGCTTGTGTAAATATTGGTTGCTTTTGGCATTATTTTTCCTTCCCAAGGCACTTCAAAGTTAACAAATTCGATTTTCTCTCTTATTTACTACGGCGACGAATCCGTAGCAATCAATAGAAAAGGTAAATCCTATATGATAGCACTCCATTTTTTCGATTCGGCAGTTTTTCATTCCGTCAAAAATCGATTTTACGGTATAGACACTTATGACCTCTCCCTCTATGCCCTAATTTGCCGCCACAGTCAAAAGGTCGGATGTTTGAGAGTAATTTTGAAATGGTTAAAATCATTTTTATCACTTCCAAAATTATGTTAAACATATTTTTTTAATCATTCAAAATCAATTTTAATGATACGAAAATTACATTTAAAAGTTTAAAATTAAATACTAATTTTGAGCGATTAAAAATGTGTTTTGGAGTGATTTTGAACTTGGCAAAAATGATTTCAATCATTTCAAAATCATTACTAAATATACATTAAAATTGTTGCATAAAGTCATTTGATTGCTATGCTTGGTACGTTGCTTAGTCAAAATAATCAAAATGGTATACCTTCAAGTTCAATTGCTCTTCACTATATGTTGGCTGACATCTCATGGATTTAAATATGAGTAAACATCAAGATTTTCCTCTTCCATCGGATTTGAACTTATGATCTTTAGTAGGAAGTAGAAATGTCTTAATATTGATTTAAGTTTGTACAGGCAACATATAATTTGCTTGAAAGGTAATTCTAAAGTATTTTAGTCAAATTACAAAGATAGTCTTTTTTTAACTTTTAAGCTTTTTGATATCATGATAGTTCAACAATAATTGATATTATACTCGTTTCTTTAAAGTTGAAGGTTCAAATCTTAATTTTTTTCATATTTGTTGTACTCAAAAAGAAAAGAAAAACTTACGAAGTTATCGCTAGCATGCAAATGTATTGAAATATAAAAGATAACGAAGAGTTCTAATATACCCTTCCCTTTTATGTATTTGAAATGCCAGTGGAGCCTAAATAACAAAACAATTGCCTACCTTTTTCTTTACACTGTCTATGTATCCATATACCTCAAACCACTTTGCAAGCTAACTACTTATATTCCTTAACAACGGCCTGACTTTCAATTCTATGTTTAAAAAAATCCACAAAAATGAAGAAAGCAATTGAGTTGGAGAATGAAGAGAGATACATAAGAAACTTACGAAACCTATTAAACATTTTCAAATCTTAGAAATCAACAAGACACAAAATTTAAGGTTCTAAAAACCATTAAATTGTGGTACTTACTGCTTGCTATTAAAAAAAAAAATCCTAGTTTTAAGATTTGAAATTTAATAACTTGCTATATGCTAGTATACAGTTCCAATGTCCAAGGCCTGAACCATCCACCAAACAGCATAGAAAATCTACAACCACTTTTAAGACATGAGCCTTATATAAACACGACCATATTTATAAAACACATTATTCTTCTATACATACACATTCTTAAGCAAAGCTGGCTTGCTTGTTTGTTTCACAACTGTACAGGAGCATTTACATAGTACTGGGAGTATAAACCAGTCAGTAAAGTTAATCTATACAGATAACTTGGCACAACTTTTTGCTGATGGAACGAACAAACTGGTCGTAGCATCATCGTGCGACCGTGCTAACGCCATCTCCGAAAAGAATGTTTTTCACAGGCGGAGGAATCCAAGCTGGATTGTTGGCGTTCTGTCCTTCGATATGTTGAGATTCACAATACACAAGGTTGCTTTGTCCTTTCTTCACCTTTGATCTCCCCACCTTCTTCCCAAATGCAATTCCCAGCTTCATTGCTGTAAACATCCCAAATGCAATCAGAACTGGTCTAACTGACCAGTGAAAGTCCTCAAGGTGTTGGTACTTTTTCACCATCCCTGGACAAGTGCTAAAGTTGACTGCCTCGACCTCGGCTGGATCAGCCATGTATGAATCACATTCTTCTTTGAAAGAGAGAATTCCAGGGAACTTGACCATGAGACAACCATTGGGTAGGATCCACCAAATCCTCCCTGTGGCCCAAACATTTCCTTTTTTACGAGGCCATTCGAATCGTGGTCTCAGGATGCTGGCTTTTATCCGTACAAATTGGCCAACGCAAAACGATTCTGCCATTTGAAACTGGGAAGAGTTACCATTCCAGAGAGTTTCTACACCAATAAAAGCAACAGCTACATTACCAACACGATCGATTGAATGAAGAATTCCCACTGGAGAGTGTTTCTTTTCTGCTTCCTTCAAACGTATCCAGTCACCAACTGCTAACCCGAAACTCACCCGCTCAAGCGTTGAAGCATACACTCTTACGGGATCATGGATTCCACGAACTCTCACCAAAACAAAAGCATCCTTTTCAGTTTCGCGTTCTAAACCCACTATCTTTCCTTCAGGGATGTTCATATTATCTGACTTGCATGAATTCAATGGCTTTCTAGAACGCACCAAGTCATTCACTTGCAAGTGATCCTTTGAGAGAAACCATTCAGTATGTCCAGTGGCATTTGATTTGTTCAACACTTTTGAACTCCCAATGGCTTGCCAGTCGCTATTAACATGTTGAGAGCTACAAAGCAGAATTAACTCCATCAATTAATTATATATATATATACCAAAAAGAGAGAGAGAGAGAGAGAGAGATGGCATTCCTTATAATTACTTGTATCTTCTCTCTTACATTTAATTTTCTCTTTAAAACAGACTTCTCATGTCCCTTATTGTCTACAAATTCTGTTAATGGTCAAAACTTGAAGTCCAAAGATTTAGTTGGATAGTACAGAAAGAAAGAAACTAGTTACTAATGCTAGCTAAATGCATAAATGAATCAAGTAGGAAATTCAAACGAAACCAAAAGTACAAATTATACACCTCAATCAGAGACCAAGAAAAATAATAGACAGAAAACCATACAACTCTTACAATCAAAAGAAGGAAGCATTTTTTGATTGATAAGGAAATCAATGTCATGGGAAAACAGAGCAGATATGAAGGAGAAGGCATACCTCTGGAACACATTTAGTATGTCTGTCAATAAAGGACGACTCCTTAAGTCATATTCAAAGCAACCAAGAAGAACATTCTCGATTAGAGGAGGAAGTCCACTTGGAATACATGGTTTTTCTTGTTTCCTGACAACAGAATGAAATATTTCATCAACTGATTTCCCACGCCATGGCTGTACACCAGTCAACATCTCAATTATGCAGCAGGCAAACCCCCAAGAATCAGTTTCATACGATACCGGACCTCTAACTTCTGGCTGCCATTGTTCTGGTGCCATATAGTTAGGTGTCCCAAGTCTCTGAATTATATCCATATTTGGTATCGGTACACCGTGTAGTAGAAAAGGAATTCCAACATCTCCCAAAATTGCTTGATCCTTGGTGGTAAGGAGCATGTTTGAAGGTTTCAGGTTAAGAACCAAGATCTCTTTTGAGTGCAATTCTAGAATCCCCTGGGCTAGATTGATTCCATATCTGAAACGAGTAGCATATGTGGAGTATTAGTAGCCCCCACAAAAGTTATTAGTTCAATCAATAGATAGCACTTAGAAGCACTAATCTAAAAGATACAGATAACAACTTCATTGTCAACCTAAACATTCAAACCCATATCAAAATATCACCTCAATACATCAGGGAGAGAAAGCTTTCCATCTTTAAGGCGAGCCATTTTATCAGCAACTGATCCTTCATAGAATTTCATGATGATGCATAACTGAAAGAACACACAAAGGACAGTAACATCTTAGCAACCTGTAAGGTACCTAGTTACGGATATTAGTATGGTGGTATTAGTATGGGGCATAAGGGTAATTAGGTAGGGAGTTAGTGTATCTTGATATAAAATTGTTAGGATACCTCCTAACTAGAATAAGACCTAAAACATATTGAGATATAAAAGATGAAACTACAAGATAATCCAAGTATAAGCCACTTGAGACCTCCCTCTTGAGTATTCTAATCCAAACCCTAGATCACTCAACAATTACCTACCTACACATATATTTCGAACAAAATAGCCCTACAAATGCTGAAGCTACAGATACCAGAATTACCTTCCCACCAATTATAGACACTCCATACAGCCTGCAGACACCCTTTGCACCTTGACACTTATAGAAATGATCCTCAAGTTTATCCAAAACAACCCTCAAATGGTCATCTTTAACAGGATTCAACATCTTGACTGCAACTTCATGATACTCATCATAATCCTTAGTGGACTGATGGTGAGTTGCTAACCAAACATCACCGAATACGCCTCGCCCAATTCGATGCCTAAGCTTCACTTTTGAAGGTTCAATCCTAGGGCTTGGATAGTTTGATGAAGCAACCACTGTCCTAACATGATCAGGATCACCATCAAGAAGCTCATACTCAAAAGAAGGAACCGGTTGAGAAGCAGCTTCTTGTGTAGACATTCTTCAAATCAAA
mRNA sequence
CGAAAAACGTCGAAAAATCAGCGCAAGTTAGAGCAAACTAAAATGCGCCATTTACCATTTTTTCTTTCCCTCCTTTCATTTGCTTTTTGTCTCCAATTGCTTTCGTTCATCACTGCCTTTTTTCCTCACCGCTCATTTGTTTCTGTGATTCTCTTCCGCCGCGCTTTTTCTTTGATCTGCCGTGAAAGCCTAGGGCTCAACTTGATCGTCTCGTGACTGGGAAAACTGAAATTCTTCTTCACTTTGATGCATTCATTTCTCAGATTGATTCGGTAATCAACAACAGTTTCGTCTTTGTTTGTGCACGGATGTGCTCCACCGGCGTTCCATGGCGTCTCCGGTGGCTACTCATCTCAACATCTCCACCTCTTCCTCTCTCTGCCGTTCCAAGAGGTTGTCTCTGAGGCTCAGTAGGAATCGAACAAAATTTATTACTAGCACAACTCAGAAGAGGAGATCTCATTCCCTGAAGGTTGTCCAGTCAGTCCTGAATAACTGCAAGTCAAATTTAAATGACAATGGAGCAAGTGAAGAAGCAAAACTGCTTCTTGAGAGATTGTATGCCCAGACACAGAGATTGGAAGAACATGTAAACAAAGATCCTCACTCCCCCCAAGATGTTTGGCTAGGGCTCAGTCTTGAAAATCTCGAGTCTGATCTCCAGGCTGCATTGGCAGTATTGAAAAAGAAGGAAGAAGATCTACAAGATGCAGAAAGAACGATTCTATTAGAACGAAGCCAATTAAATAATGCGAGGGAGAAGTTGGAGAAGCAGGAGGAAGAATTAACTGCTGCTTATCATAAACAGCAAAAATTAGAAGATGAGCTTAAGCAGGCAAATCTAGACTTAGCTTCTCAAGCTAGACAGATTGATGAATTAAAACTTCAAATTAGGGAGAAAGACGGGGGGATTGCTGCAGTTGAATCTGCCCTTACTTTGAAAGAAGACGAGTTGAAGAGAATGAGAGATGATTTGGCTAAGAAGAGTGAGGAAGCTGTTAAAACAGATTTTGAACTTAAATCTAAGTCCCAGCTTCTGACTGAAGCCATCGAAGTAGTGAAAAGACAAGAAGTTGAGCTGCAAATGCTTAAAAAGGCTGTGCTAGAGAAAGAAAAAGAACTGGAACTTTCTGTAAAGCTGCAGAAACTTGAAGAGGAAAGACTGGAAGTTGTAGAGAAAAATTTGGAGAACAGGACCATGGAATGGTTGTTAGCACAGGAAGAATTGAAAAAAATGAGAAAGGAAGGATCTAAGAAAGCAGTAGAGATGAACAAAACAGTGAATGACTTCAACCGAGTTAAGAAGCTTCTTGCCGATGTAAAAAGTGAGTTAGTTTCCTCTCAAAAGTCCCTTGTGTCCTCCAGAAAGAAAATAGAAGAACAAGAAGATATTCTTGGGAGGCAAATGGCAGAACTTGAAGAACAAAAGAAGGGTATCAATGCATATATGTCAAGTTTAAAAGATGCTCAAATTGAAGTAGAGAGTGAAAGAGTGAAGCTTAGAGTTGCAGAGGCTCATAACAAGGAGCTTGAATGCGACTTGTTGATGGAAAAGGAGCTCACTGATGAGTTACAGCAACAACTGAAGAAAGAGAAATCCTATCTCCAGCAGGAAACTGAGGAGAAATCACTTCTACAGAAGGAGCTAGAGCATAAACATATCGAGTTCGAGAAAACTCACACTCTCCTTCAAGATAAAGCGTCGGAGTTGGTCGAAGCCAAGTTAGAAATCCAGCATTTGAAATCCGAACAGGTTTCTCTTCAACTTCTTTTGGAAGAGAAAGACTTGGAGATACTCGATGCACAAAAGAAAATTGAGGAATTAAATCAGGAAATTATTGAGCTACAAACGCTTATGAGTATTAAAGAAGCTCAGCTCAATCAGACAACTGCTATGCTGAAAGAGAAAGATGAATGCGTTCAGATAATGCAAAATGAACTAAACGATACAAAGCTGAAAATTTCCGAAGCCGAAGCAGCAGTAGAACACATTGTGGATCTCACAAATAAATTGGTGATTTCCATCAAAGGCGGCGACGACAATGATGTTTTGAATTTGAACAACGATCTTAGCCTCAATCTACAGCAGCAGTTATTCATGAAACCTACTGATAACATGAAACTACAGAAGAAACAGCTCGAAACGGAGTTAGAGCTCACCAAGGAAAGCTTGAGACAAAAAGAAATGGAAATTCTAGCTGCAGAAAGAGCACTGACCGTTAAAGACGAGGAACTAAAAACGGTTCAGGAAAGATTAGATGCAAAGGAGAAAGAATTCGAGAAGATGAAGGAAGAGATGGATGAAGAAGCTAAAGATACAAGGAAGCTATACACATTGGCACAAGACAATGTTGGAGGAGAGGACAACAACATTGAGGACTTTGCAATTGGAAGGCTTCAAGTTGAAGCTGCTCAGTTAGAGGTAGAAGCAGCTACTAGCGCTCTTCAGAAACTCACAGACATGAGCCGAGAGCTTCTGAATAAAGCTAGCCATAGCCTCGAGGTTGATATCGGTTCAAGAAGCATTCACATTCAGCAGCACAATGACGACGAAGACGATGTCGATGCCGGAACAGGCGGTTGGGTTGATAATAACAACACAAGATTTAATGAAGTGAAGGTGGAAGTTTCCCGTCTTTCATCTCTAACTGAACAGCTCTTGAAGGAAGCTGGTATATTTGTTGATGCGAATTAGTGGAGATTTATTGGGATGAATAAAGGGTAGCTTGTGTAAATATTGGTTGCTTTTGGCATTATTTTTCCTTCCCAAGGCACTTCAAAGTTAACAAATTCGATTTTCTCTCTTATTTACTACGGCGACGAATCCGTAGCAATCAATAGAAAAGGAGCATTTACATAGTACTGGGAGTATAAACCAGTCAGTAAAGTTAATCTATACAGATAACTTGGCACAACTTTTTGCTGATGGAACGAACAAACTGGTCGTAGCATCATCGTGCGACCGTGCTAACGCCATCTCCGAAAAGAATGTTTTTCACAGGCGGAGGAATCCAAGCTGGATTGTTGGCGTTCTGTCCTTCGATATGTTGAGATTCACAATACACAAGGTTGCTTTGTCCTTTCTTCACCTTTGATCTCCCCACCTTCTTCCCAAATGCAATTCCCAGCTTCATTGCTGTAAACATCCCAAATGCAATCAGAACTGGTCTAACTGACCAGTGAAAGTCCTCAAGGTGTTGGTACTTTTTCACCATCCCTGGACAAGTGCTAAAGTTGACTGCCTCGACCTCGGCTGGATCAGCCATGTATGAATCACATTCTTCTTTGAAAGAGAGAATTCCAGGGAACTTGACCATGAGACAACCATTGGGTAGGATCCACCAAATCCTCCCTGTGGCCCAAACATTTCCTTTTTTACGAGGCCATTCGAATCGTGGTCTCAGGATGCTGGCTTTTATCCGTACAAATTGGCCAACGCAAAACGATTCTGCCATTTGAAACTGGGAAGAGTTACCATTCCAGAGAGTTTCTACACCAATAAAAGCAACAGCTACATTACCAACACGATCGATTGAATGAAGAATTCCCACTGGAGAGTGTTTCTTTTCTGCTTCCTTCAAACGTATCCAGTCACCAACTGCTAACCCGAAACTCACCCGCTCAAGCGTTGAAGCATACACTCTTACGGGATCATGGATTCCACGAACTCTCACCAAAACAAAAGCATCCTTTTCAGTTTCGCGTTCTAAACCCACTATCTTTCCTTCAGGGATGTTCATATTATCTGACTTGCATGAATTCAATGGCTTTCTAGAACGCACCAAGTCATTCACTTGCAAGTGATCCTTTGAGAGAAACCATTCAGTATGTCCAGTGGCATTTGATTTGTTCAACACTTTTGAACTCCCAATGGCTTGCCAGTCGCTATTAACATGTTGAGAGCTACAAAGCAGAATTAACTCCATCAATTAATTATATATATATATACCAAAAAGAGAGAGAGAGAGAGAGAGAGATGGCATTCCTTATAATTACTTGTATCTTCTCTCTTACATTTAATTTTCTCTTTAAAACAGACTTCTCATGTCCCTTATTGTCTACAAATTCTGTTAATGGTCAAAACTTGAAGTCCAAAGATTTAGTTGGATAGTACAGAAAGAAAGAAACTAGTTACTAATGCTAGCTAAATGCATAAATGAATCAAGTAGGAAATTCAAACGAAACCAAAAGTACAAATTATACACCTCAATCAGAGACCAAGAAAAATAATAGACAGAAAACCATACAACTCTTACAATCAAAAGAAGGAAGCATTTTTTGATTGATAAGGAAATCAATGTCATGGGAAAACAGAGCAGATATGAAGGAGAAGGCATACCTCTGGAACACATTTAGTATGTCTGTCAATAAAGGACGACTCCTTAAGTCATATTCAAAGCAACCAAGAAGAACATTCTCGATTAGAGGAGGAAGTCCACTTGGAATACATGGTTTTTCTTGTTTCCTGACAACAGAATGAAATATTTCATCAACTGATTTCCCACGCCATGGCTGTACACCAGTCAACATCTCAATTATGCAGCAGGCAAACCCCCAAGAATCAGTTTCATACGATACCGGACCTCTAACTTCTGGCTGCCATTGTTCTGGTGCCATATAGTTAGGTGTCCCAAGTCTCTGAATTATATCCATATTTGGTATCGGTACACCGTGTAGTAGAAAAGGAATTCCAACATCTCCCAAAATTGCTTGATCCTTGGTGGTAAGGAGCATGTTTGAAGGTTTCAGGTTAAGAACCAAGATCTCTTTTGAGTGCAATTCTAGAATCCCCTGGGCTAGATTGATTCCATATCTGAAACGAGTAGCATATGTGGAGTATTAGTAGCCCCCACAAAAGTTATTAGTTCAATCAATAGATAGCACTTAGAAGCACTAATCTAAAAGATACAGATAACAACTTCATTGTCAACCTAAACATTCAAACCCATATCAAAATATCACCTCAATACATCAGGGAGAGAAAGCTTTCCATCTTTAAGGCGAGCCATTTTATCAGCAACTGATCCTTCATAGAATTTCATGATGATGCATAACTGAAAGAACACACAAAGGACAGTAACATCTTAGCAACCTGTAAGGTACCTAGTTACGGATATTAGTATGGTGGTATTAGTATGGGGCATAAGGGTAATTAGGTAGGGAGTTAGTGTATCTTGATATAAAATTGTTAGGATACCTCCTAACTAGAATAAGACCTAAAACATATTGAGATATAAAAGATGAAACTACAAGATAATCCAAGTATAAGCCACTTGAGACCTCCCTCTTGAGTATTCTAATCCAAACCCTAGATCACTCAACAATTACCTACCTACACATATATTTCGAACAAAATAGCCCTACAAATGCTGAAGCTACAGATACCAGAATTACCTTCCCACCAATTATAGACACTCCATACAGCCTGCAGACACCCTTTGCACCTTGACACTTATAGAAATGATCCTCAAGTTTATCCAAAACAACCCTCAAATGGTCATCTTTAACAGGATTCAACATCTTGACTGCAACTTCATGATACTCATCATAATCCTTAGTGGACTGATGGTGAGTTGCTAACCAAACATCACCGAATACGCCTCGCCCAATTCGATGCCTAAGCTTCACTTTTGAAGGTTCAATCCTAGGGCTTGGATAGTTTGATGAAGCAACCACTGTCCTAACATGATCAGGATCACCATCAAGAAGCTCATACTCAAAAGAAGGAACCGGTTGAGAAGCAGCTTCTTGTGTAGACATTCTTCAAATCAAA
Coding sequence (CDS)
ATGGCGTCTCCGGTGGCTACTCATCTCAACATCTCCACCTCTTCCTCTCTCTGCCGTTCCAAGAGGTTGTCTCTGAGGCTCAGTAGGAATCGAACAAAATTTATTACTAGCACAACTCAGAAGAGGAGATCTCATTCCCTGAAGGTTGTCCAGTCAGTCCTGAATAACTGCAAGTCAAATTTAAATGACAATGGAGCAAGTGAAGAAGCAAAACTGCTTCTTGAGAGATTGTATGCCCAGACACAGAGATTGGAAGAACATGTAAACAAAGATCCTCACTCCCCCCAAGATGTTTGGCTAGGGCTCAGTCTTGAAAATCTCGAGTCTGATCTCCAGGCTGCATTGGCAGTATTGAAAAAGAAGGAAGAAGATCTACAAGATGCAGAAAGAACGATTCTATTAGAACGAAGCCAATTAAATAATGCGAGGGAGAAGTTGGAGAAGCAGGAGGAAGAATTAACTGCTGCTTATCATAAACAGCAAAAATTAGAAGATGAGCTTAAGCAGGCAAATCTAGACTTAGCTTCTCAAGCTAGACAGATTGATGAATTAAAACTTCAAATTAGGGAGAAAGACGGGGGGATTGCTGCAGTTGAATCTGCCCTTACTTTGAAAGAAGACGAGTTGAAGAGAATGAGAGATGATTTGGCTAAGAAGAGTGAGGAAGCTGTTAAAACAGATTTTGAACTTAAATCTAAGTCCCAGCTTCTGACTGAAGCCATCGAAGTAGTGAAAAGACAAGAAGTTGAGCTGCAAATGCTTAAAAAGGCTGTGCTAGAGAAAGAAAAAGAACTGGAACTTTCTGTAAAGCTGCAGAAACTTGAAGAGGAAAGACTGGAAGTTGTAGAGAAAAATTTGGAGAACAGGACCATGGAATGGTTGTTAGCACAGGAAGAATTGAAAAAAATGAGAAAGGAAGGATCTAAGAAAGCAGTAGAGATGAACAAAACAGTGAATGACTTCAACCGAGTTAAGAAGCTTCTTGCCGATGTAAAAAGTGAGTTAGTTTCCTCTCAAAAGTCCCTTGTGTCCTCCAGAAAGAAAATAGAAGAACAAGAAGATATTCTTGGGAGGCAAATGGCAGAACTTGAAGAACAAAAGAAGGGTATCAATGCATATATGTCAAGTTTAAAAGATGCTCAAATTGAAGTAGAGAGTGAAAGAGTGAAGCTTAGAGTTGCAGAGGCTCATAACAAGGAGCTTGAATGCGACTTGTTGATGGAAAAGGAGCTCACTGATGAGTTACAGCAACAACTGAAGAAAGAGAAATCCTATCTCCAGCAGGAAACTGAGGAGAAATCACTTCTACAGAAGGAGCTAGAGCATAAACATATCGAGTTCGAGAAAACTCACACTCTCCTTCAAGATAAAGCGTCGGAGTTGGTCGAAGCCAAGTTAGAAATCCAGCATTTGAAATCCGAACAGGTTTCTCTTCAACTTCTTTTGGAAGAGAAAGACTTGGAGATACTCGATGCACAAAAGAAAATTGAGGAATTAAATCAGGAAATTATTGAGCTACAAACGCTTATGAGTATTAAAGAAGCTCAGCTCAATCAGACAACTGCTATGCTGAAAGAGAAAGATGAATGCGTTCAGATAATGCAAAATGAACTAAACGATACAAAGCTGAAAATTTCCGAAGCCGAAGCAGCAGTAGAACACATTGTGGATCTCACAAATAAATTGGTGATTTCCATCAAAGGCGGCGACGACAATGATGTTTTGAATTTGAACAACGATCTTAGCCTCAATCTACAGCAGCAGTTATTCATGAAACCTACTGATAACATGAAACTACAGAAGAAACAGCTCGAAACGGAGTTAGAGCTCACCAAGGAAAGCTTGAGACAAAAAGAAATGGAAATTCTAGCTGCAGAAAGAGCACTGACCGTTAAAGACGAGGAACTAAAAACGGTTCAGGAAAGATTAGATGCAAAGGAGAAAGAATTCGAGAAGATGAAGGAAGAGATGGATGAAGAAGCTAAAGATACAAGGAAGCTATACACATTGGCACAAGACAATGTTGGAGGAGAGGACAACAACATTGAGGACTTTGCAATTGGAAGGCTTCAAGTTGAAGCTGCTCAGTTAGAGGTAGAAGCAGCTACTAGCGCTCTTCAGAAACTCACAGACATGAGCCGAGAGCTTCTGAATAAAGCTAGCCATAGCCTCGAGGTTGATATCGGTTCAAGAAGCATTCACATTCAGCAGCACAATGACGACGAAGACGATGTCGATGCCGGAACAGGCGGTTGGGTTGATAATAACAACACAAGATTTAATGAAGTGAAGGTGGAAGTTTCCCGTCTTTCATCTCTAACTGAACAGCTCTTGAAGGAAGCTGGTATATTTGTTGATGCGAATTAG
Protein sequence
MASPVATHLNISTSSSLCRSKRLSLRLSRNRTKFITSTTQKRRSHSLKVVQSVLNNCKSNLNDNGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAVLKKKEEDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQARQIDELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTEAIEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEELKKMRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQMAELEEQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQLKKEKSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQLLLEEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELNDTKLKISEAEAAVEHIVDLTNKLVISIKGGDDNDVLNLNNDLSLNLQQQLFMKPTDNMKLQKKQLETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDEEAKDTRKLYTLAQDNVGGEDNNIEDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNKASHSLEVDIGSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLLKEAGIFVDAN
Homology
BLAST of Bhi09G001921 vs. TAIR 10
Match:
AT4G32190.1 (Myosin heavy chain-related protein )
HSP 1 Score: 573.2 bits (1476), Expect = 3.3e-163
Identity = 373/781 (47.76%), Postives = 548/781 (70.17%), Query Frame = 0
Query: 6 ATHLNISTSSSLCRSKRLSLRLSRNRTKFITS--TTQKRRSHSLKVVQSVLNNCKSNLND 65
A LN+++ SS R+ ++ K + + + +R+ H L VQSVL+N + N+ND
Sbjct: 6 AIRLNLASFSSPSPCDYCLTRVVNHKQKSLVAFPSITRRKRHLLLSVQSVLHNTRPNIND 65
Query: 66 NGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLS-LENLESDLQAALAVLKKKE 125
NG++E A +L ++L+A+T RLE N+ P D L S L LESDL+AAL L K+E
Sbjct: 66 NGSAESANVLFDKLFARTHRLERQTNQHSVYPDDDDLPYSNLGVLESDLEAALVALLKRE 125
Query: 126 EDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQARQID 185
EDL DAER +L ++++LN A+E+LEK+E+ ++ A K + L++ELK+AN++LASQAR+I+
Sbjct: 126 EDLHDAERKLLSDKNKLNRAKEELEKREKTISEASLKHESLQEELKRANVELASQAREIE 185
Query: 186 ELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTEAIE 245
ELK ++RE+D AA++S+LTLKE+EL++MR ++A +S+E E +SKSQLL++A E
Sbjct: 186 ELKHKLRERDEERAALQSSLTLKEEELEKMRQEIANRSKEVSMAISEFESKSQLLSKANE 245
Query: 246 VVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEELKK 305
VVKRQE E+ L++A+ EKE+ELE+S +KLE+E+L E NL+ +T EWL+AQ+E+ K
Sbjct: 246 VVKRQEGEIYALQRALEEKEEELEISKATKKLEQEKLRETEANLKKQTEEWLIAQDEVNK 305
Query: 306 MRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQMAE 365
+++E K+ E N+T+ DF +VKKLL DV+ EL+SS+++LV SR+++EE+E +L +Q+ E
Sbjct: 306 LKEETVKRLGEANETMEDFMKVKKLLTDVRFELISSREALVFSREQMEEKELLLEKQLEE 365
Query: 366 LEEQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQLKKE 425
LEEQ+K + +YM SL+DA EVESERVKLRV EA N LE ++ ++KEL ++L+++L+KE
Sbjct: 366 LEEQRKSVLSYMQSLRDAHTEVESERVKLRVVEAKNFALEREISVQKELLEDLREELQKE 425
Query: 426 KSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQLLL 485
K L+ + S++Q EL K F+ + LLQ+K S LVEAKLEIQHLKSEQ SL+LLL
Sbjct: 426 KPLLELAMHDISVIQDELYKKANAFQVSQNLLQEKESSLVEAKLEIQHLKSEQASLELLL 485
Query: 486 EEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELNDTK 545
+EKD E+ +A+ K+ E+NQE+ EL+ LM +E QL + T MLKEKD + ++ EL +K
Sbjct: 486 QEKDEELAEARNKLGEVNQEVTELKALMISREDQLMEATEMLKEKDVHLHRIEGELGSSK 545
Query: 546 LKISEAEAAVEHIVDLTNKLVISIKGGDDNDVLNLNNDLSLNLQQQLFMKPTDNMKLQKK 605
LK++EAE VE I +LTN+L++S G + + + +NN++S++ QQ KP D+ ++ K
Sbjct: 546 LKVTEAEMVVERIAELTNRLLMSTTNGQNQNAMRINNEISIDSMQQPLEKPHDDYGMENK 605
Query: 606 QLETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDEEAK 665
+L EL T+E+LR KEME+LA +RALT KDEE+ V RL+AKE+E +K+KEE +++
Sbjct: 606 RLVMELSFTRENLRMKEMEVLAVQRALTFKDEEINVVMGRLEAKEQELKKLKEETINDSE 665
Query: 666 DTRKLYTLAQDNVGGEDNNIEDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNKASH 725
D + LY LAQ+ VG + + D AI LQ+EAA LEVEAATSALQKL MS ELL +A
Sbjct: 666 DLKVLYALAQERVG--EKTMGDLAIEMLQLEAANLEVEAATSALQKLAKMSTELLTQADM 725
Query: 726 SLEVDIGSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLLKEAG 784
S+E D + ++ ++ G+ + +N EVK EV RL SLTE+LL+ AG
Sbjct: 726 SIEADT-THTVMPER-------------GYSEGSNECLGEVKTEVVRLWSLTEKLLENAG 770
BLAST of Bhi09G001921 vs. ExPASy Swiss-Prot
Match:
Q54G05 (Putative leucine-rich repeat-containing protein DDB_G0290503 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0290503 PE=4 SV=1)
HSP 1 Score: 51.6 bits (122), Expect = 4.7e-05
Identity = 166/772 (21.50%), Postives = 335/772 (43.39%), Query Frame = 0
Query: 51 QSVLNNCKSNLNDNGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESD 110
QS++N +SNLN+N + +N+ + Q S + L S
Sbjct: 629 QSIINELQSNLNEN--------------------QNKINELIENNQS-----SSDELNSK 688
Query: 111 LQAALAVLKKKEEDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQA 170
L LK K E+++ E +I+ + +L+ + + EL Q KL ++
Sbjct: 689 LIKLSDELKDKNENVRSLETSIIENQDKLDQLIQSNQVTVNEL------QSKLNEKEINI 748
Query: 171 NLDLASQARQIDELKLQIREKDGGI-----------AAVESALTLKEDELKRMRDDLAKK 230
N + + +DEL+ ++ EK I ++S L K E+ ++ L +
Sbjct: 749 NQLIENNQSSLDELQSKLNEKQNEINQLIENNQSSSDELQSKLNEKHQEISELQSKLNEL 808
Query: 231 SEEAVKTDFELKSKSQLLTEAIEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERL 290
E + EL+SK L + + +K ++ +L+ L ++E +++L V+L K ++ L
Sbjct: 809 IENNESSSDELQSK---LIQLSDELKEKDEKLKSLDSIIIENQEKL---VQLTKSNQDSL 868
Query: 291 EVVEKNLENRTMEWLLAQEELKKMRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQ 350
+ ++ L + Q E+ ++ + + E+ +N+ LL + ++Q
Sbjct: 869 DELQSKLNEK-------QNEINELIENNQSSSNELQSKLNEKQNEINLLIE------NNQ 928
Query: 351 KSLVSSRKKIEEQEDILGRQMAELEEQKKGINAYMSSLKDAQIEVESERVKL--RVAEAH 410
S + K+ E+ + ++L E++ IN + + + + E++S+ ++L ++ E
Sbjct: 929 SSSDELQSKLNEKHQEINELQSKLNEKQNKINELVENNESSSDELQSKLIQLSDQLQEKE 988
Query: 411 N--KELECDLLMEKELTDELQQQLKKEKSYLQQETE--EKSL--LQKELEHKHIEFEKTH 470
N K E ++ E ++LQ +L ++++ + Q TE + SL LQ L K E +
Sbjct: 989 NQLKSFESSIIERDEKLNQLQSKLNEKQNEIDQITENNQSSLDELQSNLNEKQNEINQ-- 1048
Query: 471 TLLQDKASEL--VEAKL--EIQHLKSEQVSLQLLLEEKDLEILDAQKKIEELNQEIIELQ 530
L+++ S L +++KL ++ + + + L++ + D Q K E L QE+ E
Sbjct: 1049 -LIENNQSSLDELQSKLNEKLNEINEKDNKINELIQTNESLSKDQQSKFENLEQELEEKN 1108
Query: 531 TLMSIKEAQLNQTTAMLKEKDECVQIMQNELNDTKLKISEAEAAVEH----IVDLTNKLV 590
+ +Q+ EK +NELN +LK+ E + +E+ I+D+ N+L
Sbjct: 1109 NKILDLNSQIIDVNHQFSEK-------ENELNQLQLKLIEKDQEIENQNNKIIDINNQL- 1168
Query: 591 ISIKGGDDNDVLNLNNDLSLNLQQQLFMKPTDNMKLQKKQLETELELTKESLRQKEMEIL 650
+ +N+NND N ++ + + + +K + + LE EL L K+++ +K +I
Sbjct: 1169 -----NEKEKEININNDNDNNNEENIQL--IEELKEKLQDLENELNLEKDTVNEKNDDIN 1228
Query: 651 AAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDEE----------AKDTRKLYTLAQD 710
+ EE+K + E+L KE+E +M + DE K + T A
Sbjct: 1229 ELK-------EEIKLISEKLSEKEQELNEMINDYDESLNEINDQKDLVKSLNERLTNAHL 1288
Query: 711 NVGGEDNNI-----EDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNKASHSLEVDI 770
+ +DN I E F + Q+ ++ + L + + + +L + S +
Sbjct: 1289 KINEKDNEIHSLSKEGFNEIQSQLNLITNQLSEKDNLLIEKSQIISDLELQLRESYKERS 1324
Query: 771 GSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLLKE 781
S S+H QQ D+ + NE+K + +L + L ++
Sbjct: 1349 SSSSLH-QQQQMISPDLSNSNDELIVEKEEIINELKEKNQQLEQQLQDLCQQ 1324
BLAST of Bhi09G001921 vs. ExPASy Swiss-Prot
Match:
Q02224 (Centromere-associated protein E OS=Homo sapiens OX=9606 GN=CENPE PE=1 SV=2)
HSP 1 Score: 49.3 bits (116), Expect = 2.3e-04
Identity = 177/763 (23.20%), Postives = 357/763 (46.79%), Query Frame = 0
Query: 14 SSSLCRSKRLSLRLS---RNRTKFITSTTQKRRSHSLKVVQSVLNNCKSNLNDNGASEEA 73
S++L R + LRL+ + + I S T++R +LK ++ L + + E
Sbjct: 1320 STTLARIEMERLRLNEKFQESQEEIKSLTKER--DNLKTIKEAL-----EVKHDQLKEHI 1379
Query: 74 KLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAV------LKKKEED 133
+ L ++ + E+ +N + + +E + A L + L K+ ++
Sbjct: 1380 RETLAKIQESQSKQEQSLNMKEKDNETTKIVSEMEQFKPKDSALLRIEIEMLGLSKRLQE 1439
Query: 134 LQDAERTILLERSQLNNAREKLEKQ----EEELTAAYHKQQKLEDELKQANLDLASQARQ 193
D +++ E+ L +E L+ + +E + K + E+ELK A+ L Q
Sbjct: 1440 SHDEMKSVAKEKDDLQRLQEVLQSESDQLKENIKEIVAKHLETEEELKVAHCCLKEQEET 1499
Query: 194 IDELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEE-AVKTDFELKSKSQLLTE 253
I+EL++ + EK+ I+ ++ L D+L+ ++ +K E+ +K E++ K L +
Sbjct: 1500 INELRVNLSEKETEISTIQKQLEAINDKLQNKIQEIYEKEEQFNIKQISEVQEKVNELKQ 1559
Query: 254 AIEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEE-RLEVVEKNLENRTMEWL-LAQ 313
E K ++ LQ + + K LEL+ +LQ+ +EE ++ + EK R E L + +
Sbjct: 1560 FKEHRKAKDSALQSI------ESKMLELTNRLQESQEEIQIMIKEKEEMKRVQEALQIER 1619
Query: 314 EELKKMRKEGSKKAVEMNKTVNDF------NRVKKLLADVK--SELVSSQKSLVSSRKKI 373
++LK+ KE K E + F N ++ + +++ E +QK + + I
Sbjct: 1620 DQLKENTKEIVAKMKESQEKEYQFLKMTAVNETQEKMCEIEHLKEQFETQK---LNLENI 1679
Query: 374 EEQEDILGRQMAELEEQKKGINAYMSSLKDAQ--IEVESERVKLRVAEAHNKELEC--DL 433
E + L + + E E+ + + L+ + ++VE +++K + E ++LE +L
Sbjct: 1680 ETENIRLTQILHENLEEMRSVTKERDDLRSVEETLKVERDQLKENLRETITRDLEKQEEL 1739
Query: 434 LMEKELTDELQQQLKKEKSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAK 493
+ E Q+ + K + + ++T E S +QK+LEH + + +Q+ E +
Sbjct: 1740 KIVHMHLKEHQETIDKLRGIVSEKTNEISNMQKDLEHSNDALKAQDLKIQE------ELR 1799
Query: 494 LEIQHLKSEQVSLQLL---LEEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTA 553
+ HLK +Q ++ L + EK ++ + QK +E N ++ E I+E + N+
Sbjct: 1800 IAHMHLKEQQETIDKLRGIVSEKTDKLSNMQKDLENSNAKLQE-----KIQELKANEHQL 1859
Query: 554 MLKEKDECVQIMQNELNDTKLKISEAEAAVEHIVDLT---NKLVISIKGGDDNDVLNLNN 613
+ +KD +N+T+ K+SE E + I D + +KL I + LNL
Sbjct: 1860 ITLKKD---------VNETQKKVSEMEQLKKQIKDQSLTLSKLEI--------ENLNLAQ 1919
Query: 614 DLSLNLQQ-QLFMKPTDNMKLQKKQLETELELTKESLRQ---KEMEI---LAAERALTVK 673
L NL++ + MK DN++ ++ L+ E + KESL++ +++EI L R L+ +
Sbjct: 1920 KLHENLEEMKSVMKERDNLRRVEETLKLERDQLKESLQETKARDLEIQQELKTARMLSKE 1979
Query: 674 DEE-LKTVQERLDAK-------EKEFEKMKEEMDEEAKDTRK--LYTLAQDNVGGEDNNI 726
+E + ++E++ K +K+ +K K+E+ ++ ++ +K L L ED N+
Sbjct: 1980 HKETVDKLREKISEKTIQISDIQKDLDKSKDELQKKIQELQKKELQLLRVK----EDVNM 2032
BLAST of Bhi09G001921 vs. ExPASy TrEMBL
Match:
A0A0A0LDN9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G877610 PE=4 SV=1)
HSP 1 Score: 1152.1 bits (2979), Expect = 0.0e+00
Identity = 678/788 (86.04%), Postives = 712/788 (90.36%), Query Frame = 0
Query: 1 MASPVATHLNISTSSSLCRSKRLSLRLSRNRTKFITSTTQKRRSHSLKVVQSVLNNCKSN 60
MASP ATHLN STSSS+ +S+R SLR RNRT F ST QKRRSHSLKVVQSVLNNCKSN
Sbjct: 1 MASPAATHLNFSTSSSISQSQRSSLRFGRNRTNFFYSTNQKRRSHSLKVVQSVLNNCKSN 60
Query: 61 LNDNGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAVLKK 120
LNDNGA+EEAKLLLERLYAQTQRLEEHV+KDPH PQDVWLGLSLENLESDLQAALAVLKK
Sbjct: 61 LNDNGANEEAKLLLERLYAQTQRLEEHVSKDPHFPQDVWLGLSLENLESDLQAALAVLKK 120
Query: 121 KEEDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQARQ 180
KEEDLQDAERTILLERSQLNNAREKLEKQEEE+T AY KQQ+LEDELK+ANL+L SQ R
Sbjct: 121 KEEDLQDAERTILLERSQLNNAREKLEKQEEEITVAYRKQQELEDELKEANLNLVSQTRL 180
Query: 181 IDELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTEA 240
IDELKLQI EKD GIAAVESAL LKEDELKRMR DLA KSEEA KT+ ELKSKSQLLTEA
Sbjct: 181 IDELKLQIMEKDEGIAAVESALALKEDELKRMRADLAMKSEEAFKTNCELKSKSQLLTEA 240
Query: 241 IEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEEL 300
EVVKRQEVELQMLKK V+EKEKE ELSVKLQKLE ERLEVVEKNLE RTMEWLLAQEEL
Sbjct: 241 NEVVKRQEVELQMLKKTVVEKEKEFELSVKLQKLEVERLEVVEKNLEKRTMEWLLAQEEL 300
Query: 301 KKMRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQM 360
KK +KE SKK VEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDIL RQM
Sbjct: 301 KKTKKEASKKTVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILERQM 360
Query: 361 AELEEQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQLK 420
AELEEQKKGINAYMSSLKDAQIEVESERVKLR EAHNKELE DL+ EKELTDELQQQL+
Sbjct: 361 AELEEQKKGINAYMSSLKDAQIEVESERVKLRFIEAHNKELEGDLVKEKELTDELQQQLE 420
Query: 421 KEKSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQL 480
+EKS+LQQ TEEKSLLQ ELEHK IEFEKTH LLQDKAS LVEAKLEIQHLKS+QVSLQL
Sbjct: 421 REKSFLQQATEEKSLLQNELEHKRIEFEKTHKLLQDKASALVEAKLEIQHLKSKQVSLQL 480
Query: 481 LLEEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELND 540
LLEEKDLEILDAQKKI+ LNQEIIELQTLMS KEAQL+QTTAMLKEKDE V+ MQNELND
Sbjct: 481 LLEEKDLEILDAQKKIQNLNQEIIELQTLMSSKEAQLDQTTAMLKEKDERVETMQNELND 540
Query: 541 TKLKISEAEAAVEHIVDLTNKLVISIKGGDDNDVLNLNNDLSLNLQQQLFMKPTDNMKLQ 600
TKLKISEAEAAVEHIVDLTNKLVISIK GD+ DVL LN +LSLNLQQQLF KPTDN++LQ
Sbjct: 541 TKLKISEAEAAVEHIVDLTNKLVISIKDGDEYDVLKLNENLSLNLQQQLFKKPTDNIRLQ 600
Query: 601 KKQLETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDEE 660
KKQLETELELTKESLR+KEMEILAAERALTVKDEELKTVQERLD KEKEFEKMKEEMDEE
Sbjct: 601 KKQLETELELTKESLRRKEMEILAAERALTVKDEELKTVQERLDGKEKEFEKMKEEMDEE 660
Query: 661 AKDTRKLYTLAQDNVGGEDNNIEDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNKA 720
K R+ YTLAQDNVGG D AI RLQ EAAQLEVEAATSALQKLTDMSR+LLNKA
Sbjct: 661 GKHLREQYTLAQDNVGG------DLAIERLQFEAAQLEVEAATSALQKLTDMSRDLLNKA 720
Query: 721 SHSLEVDIGSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLLKE 780
SLE DIGSRSI IQQH+DD + V+ +DNNN+RFNEVKVEVSRLSSLTEQLLKE
Sbjct: 721 GRSLEADIGSRSIRIQQHDDDNNGVNG-----IDNNNSRFNEVKVEVSRLSSLTEQLLKE 777
Query: 781 AGIFVDAN 789
AGIF+DA+
Sbjct: 781 AGIFLDAD 777
BLAST of Bhi09G001921 vs. ExPASy TrEMBL
Match:
A0A1S3CSI3 (putative leucine-rich repeat-containing protein DDB_G0290503 OS=Cucumis melo OX=3656 GN=LOC103503836 PE=4 SV=1)
HSP 1 Score: 1143.6 bits (2957), Expect = 0.0e+00
Identity = 683/791 (86.35%), Postives = 716/791 (90.52%), Query Frame = 0
Query: 1 MASP-VATHLNISTSSSLCRSKRLSLRLSRNRTKFITSTTQKRRSHSLKVVQSVLNNCKS 60
MASP THLN S+SSSL +S+R SLR SRNR KF ST QKRRSH LKVVQSVLNN KS
Sbjct: 1 MASPATTTHLNFSSSSSLPQSQRSSLRFSRNRIKFCYSTNQKRRSHPLKVVQSVLNNFKS 60
Query: 61 NLNDNGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAVLK 120
NLNDNGASEEAKLLLERLYAQTQRLEEHV+KDPH P+DVWLGLSLENLESDLQAALAVLK
Sbjct: 61 NLNDNGASEEAKLLLERLYAQTQRLEEHVSKDPHFPRDVWLGLSLENLESDLQAALAVLK 120
Query: 121 KKEEDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQAR 180
KKEEDLQDAERTILLERSQLNNAREKLEKQEEE+T AY KQQ+LEDELKQANL+LASQAR
Sbjct: 121 KKEEDLQDAERTILLERSQLNNAREKLEKQEEEITVAYRKQQELEDELKQANLNLASQAR 180
Query: 181 QIDELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTE 240
IDELKLQIREKD GIAAVESAL LKEDELKRM DLA KSEEA+KT+ ELKSKSQLLTE
Sbjct: 181 LIDELKLQIREKDEGIAAVESALALKEDELKRMGADLAMKSEEALKTNCELKSKSQLLTE 240
Query: 241 AIEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEE 300
A +VVKRQEVELQMLKKAV+EKEKELELSVKLQKLE ER+EVVEKNLE RTMEWLLAQEE
Sbjct: 241 ANKVVKRQEVELQMLKKAVVEKEKELELSVKLQKLEMERVEVVEKNLEKRTMEWLLAQEE 300
Query: 301 LKKMRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQ 360
LKKM+KE SK+ VEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDIL RQ
Sbjct: 301 LKKMKKEASKQTVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILKRQ 360
Query: 361 MAELEEQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQL 420
MAELEEQKKGIN YMSSLKDAQIEVESERVKLR EAHNKELE DLL EKELTDELQQQL
Sbjct: 361 MAELEEQKKGINTYMSSLKDAQIEVESERVKLRFVEAHNKELERDLLKEKELTDELQQQL 420
Query: 421 KKEKSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQ 480
K+EKS LQQ TEEKSLLQ ELEHKHIEFEKTH LLQDKASELVEAKLEIQHLKS+QVSLQ
Sbjct: 421 KREKSDLQQATEEKSLLQNELEHKHIEFEKTHKLLQDKASELVEAKLEIQHLKSQQVSLQ 480
Query: 481 LLLEEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELN 540
LLLEEKDLEILDAQKKIE LNQEIIELQTLMS KEAQL+QTTAMLKEKDE V+IMQNELN
Sbjct: 481 LLLEEKDLEILDAQKKIENLNQEIIELQTLMSSKEAQLDQTTAMLKEKDERVEIMQNELN 540
Query: 541 DTKLKISEAEAAVEHIVDLTNKLVISIKGGDDND-VLNLNNDLSLNLQQQLFMKPTDNMK 600
DTKLKISEAEAAVEHIVDLTNKLVISIK GDD+D VL LN +LSLNLQQQLF KPTDN++
Sbjct: 541 DTKLKISEAEAAVEHIVDLTNKLVISIKDGDDDDSVLKLNENLSLNLQQQLFKKPTDNLR 600
Query: 601 LQKKQLETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMD 660
LQKKQLETELELTKESLR KEMEILAAER+LTVKDEELKTVQERL+AKEKEFEKMKEEMD
Sbjct: 601 LQKKQLETELELTKESLRHKEMEILAAERSLTVKDEELKTVQERLEAKEKEFEKMKEEMD 660
Query: 661 EEAKDTRKLYTLAQDNVGGEDNNI-EDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELL 720
EE K +TL QDN GGEDN I D I RLQ EAAQLE+EAATSALQKLTDMSR+LL
Sbjct: 661 EETKH----HTLPQDNAGGEDNKIGGDLEIERLQFEAAQLEIEAATSALQKLTDMSRDLL 720
Query: 721 NKASHSLEVDIGSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQL 780
NKA SLE DI SRSIHIQQH+D D +D +DNNN+RFNEVKVEVSRLSSLTEQL
Sbjct: 721 NKAGRSLEADIESRSIHIQQHDDYNDGIDD-----IDNNNSRFNEVKVEVSRLSSLTEQL 780
Query: 781 LKEAGIFVDAN 789
LKEAGIFVDA+
Sbjct: 781 LKEAGIFVDAD 782
BLAST of Bhi09G001921 vs. ExPASy TrEMBL
Match:
A0A5A7T7H9 (Putative leucine-rich repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold92G003770 PE=4 SV=1)
HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 671/786 (85.37%), Postives = 709/786 (90.20%), Query Frame = 0
Query: 5 VATHLNISTSSSLCRSKRLSLRLSRNRTKFITSTTQKRRSHSLKVVQSVLNNCKSNLNDN 64
+ H+++ + ++ + R SLR SRNR KF ST QKRRSH LKVVQSVLNNCKSNLNDN
Sbjct: 2 IDVHISLPLTFTI-NTDRSSLRFSRNRIKFCYSTNQKRRSHPLKVVQSVLNNCKSNLNDN 61
Query: 65 GASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAVLKKKEED 124
GASEEAKLLLERLYAQTQRLEEHV+KDPH P+DVWLGLSLENLESDLQAALAVLKKKEED
Sbjct: 62 GASEEAKLLLERLYAQTQRLEEHVSKDPHFPRDVWLGLSLENLESDLQAALAVLKKKEED 121
Query: 125 LQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQARQIDEL 184
LQDA+RTILLERSQLNNAREKLEKQEEE+T AY KQQ+LEDELKQANL+LASQAR IDEL
Sbjct: 122 LQDAKRTILLERSQLNNAREKLEKQEEEITVAYRKQQELEDELKQANLNLASQARLIDEL 181
Query: 185 KLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTEAIEVV 244
KLQIREKD GIAAVESAL LKEDELKRM DLA KSEEA+KT+ ELKSKSQLLTEA +VV
Sbjct: 182 KLQIREKDEGIAAVESALALKEDELKRMGADLAMKSEEALKTNCELKSKSQLLTEANKVV 241
Query: 245 KRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEELKKMR 304
KRQEVELQMLKKAV+EKEKELELSVKLQKLE ER+EVVEKNLE RTMEWLLAQEELKKM+
Sbjct: 242 KRQEVELQMLKKAVVEKEKELELSVKLQKLEMERVEVVEKNLEKRTMEWLLAQEELKKMK 301
Query: 305 KEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQMAELE 364
KE SK+ VEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQED L RQMAELE
Sbjct: 302 KEASKQTVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDFLKRQMAELE 361
Query: 365 EQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQLKKEKS 424
EQKKGIN YMSSLKDAQIEVESERVKLR EAHNKELE DLL EKELTDELQQQLK+EKS
Sbjct: 362 EQKKGINTYMSSLKDAQIEVESERVKLRFVEAHNKELERDLLKEKELTDELQQQLKREKS 421
Query: 425 YLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQLLLEE 484
YLQQ TEEKSLLQ ELEHKHIEFEKTH LLQDKASELVEAKLEIQHLKS+QVSLQLLLEE
Sbjct: 422 YLQQATEEKSLLQNELEHKHIEFEKTHKLLQDKASELVEAKLEIQHLKSQQVSLQLLLEE 481
Query: 485 KDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELNDTKLK 544
KDLEILDAQKKIE LNQEIIELQTLMS KEAQL+QTTAMLKEKDE V+IMQNELNDTKLK
Sbjct: 482 KDLEILDAQKKIENLNQEIIELQTLMSSKEAQLDQTTAMLKEKDERVEIMQNELNDTKLK 541
Query: 545 ISEAEAAVEHIVDLTNKLVISIK-GGDDNDVLNLNNDLSLNLQQQLFMKPTDNMKLQKKQ 604
ISEAEAAVEHIVDLTNKLVISIK G DDN VL LN +LSLNLQQQLF KPTDN++LQKKQ
Sbjct: 542 ISEAEAAVEHIVDLTNKLVISIKDGDDDNSVLKLNENLSLNLQQQLFKKPTDNLRLQKKQ 601
Query: 605 LETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDEEAKD 664
LETELELTKESLR+KEMEILAAER+LTVKDEELKTVQERL+AKEKEFEKMKEEMDEE K
Sbjct: 602 LETELELTKESLRRKEMEILAAERSLTVKDEELKTVQERLEAKEKEFEKMKEEMDEETKH 661
Query: 665 TRKLYTLAQDNVGGEDNNI-EDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNKASH 724
+TL QDN GGEDN I D I RLQ EAAQLE+EAATSALQKLTDMSR+LLNKA
Sbjct: 662 ----HTLPQDNAGGEDNKIGGDLEIERLQFEAAQLEIEAATSALQKLTDMSRDLLNKAGR 721
Query: 725 SLEVDIGSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLLKEAG 784
SLE DI SRSIHIQQH+D D VD +DNNN+RFNEVKVEVSRLSSLTEQLLKEAG
Sbjct: 722 SLEADIESRSIHIQQHDDYNDGVDD-----IDNNNSRFNEVKVEVSRLSSLTEQLLKEAG 777
Query: 785 IFVDAN 789
IFVDA+
Sbjct: 782 IFVDAD 777
BLAST of Bhi09G001921 vs. ExPASy TrEMBL
Match:
A0A6J1IMM0 (myosin-11 OS=Cucurbita maxima OX=3661 GN=LOC111476606 PE=4 SV=1)
HSP 1 Score: 1119.8 bits (2895), Expect = 0.0e+00
Identity = 664/789 (84.16%), Postives = 709/789 (89.86%), Query Frame = 0
Query: 1 MASPVATHLNISTSSSLCRSKRLSLRLSRNRTKFITSTTQKRRSHSLKVVQSVLNNCKSN 60
MAS A HL+IS SSSL SKR SLRL+RN+TKF +STT++RRSHSLKVVQSVLN KSN
Sbjct: 1 MASSAAIHLSISASSSLRFSKRSSLRLTRNQTKFTSSTTRERRSHSLKVVQSVLNTSKSN 60
Query: 61 LNDNGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAVLKK 120
LNDNGASEEAKLLLERLYAQTQRLEEHV+KD HSPQDVWLGLSLENLES LQAALAVLKK
Sbjct: 61 LNDNGASEEAKLLLERLYAQTQRLEEHVSKDSHSPQDVWLGLSLENLESGLQAALAVLKK 120
Query: 121 KEEDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQARQ 180
KEEDLQDAERTIL ERSQLNNAREKLEKQEEE+TAAYHKQQ+LEDELKQANL+LASQARQ
Sbjct: 121 KEEDLQDAERTILSERSQLNNAREKLEKQEEEITAAYHKQQELEDELKQANLNLASQARQ 180
Query: 181 IDELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTEA 240
IDELKLQIREKD IAAVES LTLKEDELK+MR DLAKKSEEA+KTD ELKSKS+LL EA
Sbjct: 181 IDELKLQIREKDERIAAVESTLTLKEDELKKMRADLAKKSEEAIKTDSELKSKSKLLNEA 240
Query: 241 IEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEEL 300
EVVKRQE EL+MLKKAVLEKE+ELE SVKLQKLEEE+L+VVEKNLE RT EWLL QEEL
Sbjct: 241 NEVVKRQEFELEMLKKAVLEKEQELEASVKLQKLEEEKLKVVEKNLEKRTTEWLLVQEEL 300
Query: 301 KKMRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQM 360
KK+RKE SKKAV MNKTVNDFNRVKKLLAD KSELVSSQKSLVS+RKKIEEQEDILG+QM
Sbjct: 301 KKLRKEASKKAVGMNKTVNDFNRVKKLLADAKSELVSSQKSLVSARKKIEEQEDILGKQM 360
Query: 361 AELEEQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQLK 420
ELEEQKKGINAYMSSL+DAQIE+ESERVKLRVAEA NKELE L MEKELTDEL+QQLK
Sbjct: 361 TELEEQKKGINAYMSSLEDAQIEIESERVKLRVAEAQNKELERVLFMEKELTDELRQQLK 420
Query: 421 KEKSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQL 480
KEKS LQQ TEEKSLLQKEL+HKHIEFEKTH LLQDK+SELVEA LEIQ LKS+QVSLQL
Sbjct: 421 KEKSNLQQVTEEKSLLQKELKHKHIEFEKTHNLLQDKSSELVEANLEIQRLKSQQVSLQL 480
Query: 481 LLEEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELND 540
LLEEKDLEI DAQKKIE LNQEIIELQT+MS KEAQL+QTT MLKEKDECVQIMQNELND
Sbjct: 481 LLEEKDLEIHDAQKKIEILNQEIIELQTIMSSKEAQLSQTTVMLKEKDECVQIMQNELND 540
Query: 541 TKLKISEAEAAVEHIVDLTNKLVISIKGGDDNDVLNLNNDLSLNLQQQLFMKPT-DNMKL 600
TKLKISEAEA V HIVDLTNKLVISI G DNDV LN+DLS+NLQQQ F +PT DNM L
Sbjct: 541 TKLKISEAEAVVGHIVDLTNKLVISINDG-DNDVSELNDDLSINLQQQFFKQPTDDNMGL 600
Query: 601 QKKQLETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDE 660
QKKQ+ETELELTKESLRQKEMEI AAERALTVKDEELKTV+ERLD KEKEFE M+EEM E
Sbjct: 601 QKKQIETELELTKESLRQKEMEIQAAERALTVKDEELKTVRERLDTKEKEFENMREEMGE 660
Query: 661 EAKDTRKLYTLAQDNVGGEDNNIEDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNK 720
EA D RKLY LA+D+VG + D AI RLQ+EAAQLEVEAATSALQKLTD+SRELLNK
Sbjct: 661 EANDLRKLYALAEDSVG-----VGDLAIERLQIEAAQLEVEAATSALQKLTDISRELLNK 720
Query: 721 ASHSLEVDIGSRSIHIQQHNDDEDDVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLLK 780
ASHSLE DI +RSIH Q H+DD D D GG +DNN+ RFNEVK+EVSRLSSLTEQLLK
Sbjct: 721 ASHSLEGDIDTRSIHFQLHDDDNDVDDTRIGGGIDNNSQRFNEVKLEVSRLSSLTEQLLK 780
Query: 781 EAGIFVDAN 789
EAGIFVDA+
Sbjct: 781 EAGIFVDAD 783
BLAST of Bhi09G001921 vs. ExPASy TrEMBL
Match:
A0A6J1FCM9 (myosin-11-like OS=Cucurbita moschata OX=3662 GN=LOC111442800 PE=4 SV=1)
HSP 1 Score: 1119.8 bits (2895), Expect = 0.0e+00
Identity = 664/790 (84.05%), Postives = 709/790 (89.75%), Query Frame = 0
Query: 1 MASPVATHLNISTSSSLCRSKRLSLRLSRNRTKFITSTTQKRRSHSLKVVQSVLNNCKSN 60
MAS A HLNIS SSSL SKR SLRL+RN+TKF +STT++RRSHSLKVVQSVLN KSN
Sbjct: 1 MASSAAIHLNISASSSLRLSKRSSLRLTRNQTKFTSSTTRERRSHSLKVVQSVLNTSKSN 60
Query: 61 LNDNGASEEAKLLLERLYAQTQRLEEHVNKDPHSPQDVWLGLSLENLESDLQAALAVLKK 120
LNDNGASEEAKLLLERLYAQTQRLEEHV+KD HSPQDVWLGLSLENLES LQAAL VLKK
Sbjct: 61 LNDNGASEEAKLLLERLYAQTQRLEEHVSKDSHSPQDVWLGLSLENLESGLQAALTVLKK 120
Query: 121 KEEDLQDAERTILLERSQLNNAREKLEKQEEELTAAYHKQQKLEDELKQANLDLASQARQ 180
KEEDLQDAERTIL ERSQLNNAREKLEKQEEE+TAAYHKQQ+LEDELKQANL+LASQARQ
Sbjct: 121 KEEDLQDAERTILSERSQLNNAREKLEKQEEEITAAYHKQQELEDELKQANLNLASQARQ 180
Query: 181 IDELKLQIREKDGGIAAVESALTLKEDELKRMRDDLAKKSEEAVKTDFELKSKSQLLTEA 240
IDELKLQIREKD IAAVES LTLKEDEL +MR DLAKKSEEA+KTD ELKSKS+LL EA
Sbjct: 181 IDELKLQIREKDERIAAVESTLTLKEDELTKMRADLAKKSEEAIKTDSELKSKSKLLNEA 240
Query: 241 IEVVKRQEVELQMLKKAVLEKEKELELSVKLQKLEEERLEVVEKNLENRTMEWLLAQEEL 300
EVVKRQE ELQMLK AVLEKE+ELE+SVKLQKLEEE+LEVVEKNLE RT EWLL QE+L
Sbjct: 241 NEVVKRQEFELQMLKNAVLEKEQELEVSVKLQKLEEEKLEVVEKNLEKRTTEWLLVQEDL 300
Query: 301 KKMRKEGSKKAVEMNKTVNDFNRVKKLLADVKSELVSSQKSLVSSRKKIEEQEDILGRQM 360
KK+RKE SKKAVEMNKTVNDFNRVKKLLAD KSELVSSQKSLVS+RKKIEEQEDILG+QM
Sbjct: 301 KKLRKESSKKAVEMNKTVNDFNRVKKLLADAKSELVSSQKSLVSARKKIEEQEDILGKQM 360
Query: 361 AELEEQKKGINAYMSSLKDAQIEVESERVKLRVAEAHNKELECDLLMEKELTDELQQQLK 420
ELEEQKKGINAYMSSL+DAQIE+ESERVKLRVA+A NKELE L MEKELTDELQQQLK
Sbjct: 361 TELEEQKKGINAYMSSLEDAQIEIESERVKLRVAQAQNKELERVLFMEKELTDELQQQLK 420
Query: 421 KEKSYLQQETEEKSLLQKELEHKHIEFEKTHTLLQDKASELVEAKLEIQHLKSEQVSLQL 480
KEKS LQQ TEEKS+LQKELEHKHIEFEKTH LLQ KASELVEA LEIQ LKS+QVSLQL
Sbjct: 421 KEKSNLQQVTEEKSILQKELEHKHIEFEKTHNLLQGKASELVEANLEIQRLKSQQVSLQL 480
Query: 481 LLEEKDLEILDAQKKIEELNQEIIELQTLMSIKEAQLNQTTAMLKEKDECVQIMQNELND 540
LLEEKDLEI DAQKKIE LN+EIIELQTLMS KEAQL+QTT MLKEKDECVQIMQNELND
Sbjct: 481 LLEEKDLEIHDAQKKIEILNEEIIELQTLMSSKEAQLSQTTVMLKEKDECVQIMQNELND 540
Query: 541 TKLKISEAEAAVEHIVDLTNKLVISIKGGDDNDVLNLNNDLSLNLQQQLFMKPT-DNMKL 600
TKLKISEAEA V HIVDLTNKLV+SI GDD D LN+DLS+NLQQQ F +PT DNM L
Sbjct: 541 TKLKISEAEAVVGHIVDLTNKLVMSINDGDD-DASELNDDLSINLQQQFFKQPTDDNMGL 600
Query: 601 QKKQLETELELTKESLRQKEMEILAAERALTVKDEELKTVQERLDAKEKEFEKMKEEMDE 660
QKKQLETELELTKESLRQKEMEI AAERALTVKDEELKTV+ERLD KEK+ E MKEEMDE
Sbjct: 601 QKKQLETELELTKESLRQKEMEIQAAERALTVKDEELKTVRERLDTKEKDLENMKEEMDE 660
Query: 661 EAKDTRKLYTLAQDNVGGEDNNIEDFAIGRLQVEAAQLEVEAATSALQKLTDMSRELLNK 720
EAKD RKLY LA+D+VG + D AI +LQ+EAAQLEVEAATSALQKLTD+SRELLNK
Sbjct: 661 EAKDLRKLYALAEDSVG-----VGDLAIEKLQIEAAQLEVEAATSALQKLTDISRELLNK 720
Query: 721 ASHSLEVDIGSRSIHIQQHNDDED-DVDAGTGGWVDNNNTRFNEVKVEVSRLSSLTEQLL 780
ASHSL+ DI +RSIH Q H+DD D DVD GG +DNN+ RFNEVK+EVSRLSSLTEQLL
Sbjct: 721 ASHSLKGDIDTRSIHFQLHDDDSDNDVDTRIGGGIDNNSQRFNEVKLEVSRLSSLTEQLL 780
Query: 781 KEAGIFVDAN 789
KEAGIFVDA+
Sbjct: 781 KEAGIFVDAD 784
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT4G32190.1 | 3.3e-163 | 47.76 | Myosin heavy chain-related protein | [more] |
Match Name | E-value | Identity | Description | |
Q54G05 | 4.7e-05 | 21.50 | Putative leucine-rich repeat-containing protein DDB_G0290503 OS=Dictyostelium di... | [more] |
Q02224 | 2.3e-04 | 23.20 | Centromere-associated protein E OS=Homo sapiens OX=9606 GN=CENPE PE=1 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LDN9 | 0.0e+00 | 86.04 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G877610 PE=4 SV=1 | [more] |
A0A1S3CSI3 | 0.0e+00 | 86.35 | putative leucine-rich repeat-containing protein DDB_G0290503 OS=Cucumis melo OX=... | [more] |
A0A5A7T7H9 | 0.0e+00 | 85.37 | Putative leucine-rich repeat-containing protein OS=Cucumis melo var. makuwa OX=1... | [more] |
A0A6J1IMM0 | 0.0e+00 | 84.16 | myosin-11 OS=Cucurbita maxima OX=3661 GN=LOC111476606 PE=4 SV=1 | [more] |
A0A6J1FCM9 | 0.0e+00 | 84.05 | myosin-11-like OS=Cucurbita moschata OX=3662 GN=LOC111442800 PE=4 SV=1 | [more] |