Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGAAAATCCCTTTCTCCTCAGAACTCCATTGGAGGCTGTCTTGTAGCTTTCAAAGCTCCGAGGGGTTTTCTCTGCTGTGGCCTCGATGCCATTCCCATCCCAGTAGCTCTCTCGGTACTGGGACGATGCCGACCTCAGCTTCTTCTGCGATTCCGAAGCCCTCCTCCGAGCTCGTCAGCCGATTGACTGCCGCCGACGCCGCCGTGAAGCTGAAGGCATTGAGAGACATCAAGAACCAGATCATCGGTAACCGCACCAAGAAGCTGTCCTTCATCAAGCTCGGCGTTGTTCCTCACGTTGCTGCAATTCTCTCCTCCACTTCCGACCCCAACATTCTCGTCCAGTCCGCTGCTGTTCTTGGCAGCTTTGCCTGCGGCGTCGACGCCGGTGTCTCCGCCGTTCTCGATGCCGGAGCCTTCCCTCGACTGCTTCGATTGCTTGCTGATCCCGATCCCAAGGTGCTTCTCCTCTCCTCTCGTCTCTTGTTCTTTCTATCCGTTTTGCTCCAATTTGGGAAGTGCATTTCTGGGGAAATTGAGGAAGAGTATCTAGTATTGTTCTATTGAAATTTGCATTTATGCGCTTAATTTTAAGTTACAAATGCAAATTCCTTTGTTTGATTTTAAATTGCATTTCTGTGGCGTCGACCCTTCTTCTAATCTTCTCTATTATGACCTCTTCGAACTACAAGAAATGCTCTGTTAAAAGTGAAGAAGAATGTCTCCTTGGAACTGGATAATAGCTCCTTTTTCTACTTATAATCGTCTTTTTTGGATTAGTTATAGCATATTCCTTAAGTACTCTTTTTGAACTACCAATCAGAGTCTAGAGTAGATGGCTACATCTTTTGAAAGGAAAAACAAATTTAAGCAATAATGGAGTTCTTCCTTTCAAAACCCGTACATCCAATTTGACCGTCCTTTTTATACACCACGCCTTAGGTTTTTACAAGAATTTGTTATGTACCTGCGTCCACAACCTTTTGCTTTTGTCTTCTTTATTTTCGGTATGCAATTTTTTCAACTTTCTTTTCAATCCTAATGTGAACTTTGATGGGTATATATATATATGTTCTGTTTTTAAGGTTGTGGACGCAGGAGCTCGTTCATTAAGGATGGTTTATCAATCAAAGTTGGCTCCCAAGTATGATTTCCTTCAGCAGGAAAACACTAAATTTCTTCTTTCATTATTGAATAGTCAAAATGAAAATGTGACTGGACTTGGGGCAAGCATCATTATTCATTCCTGTGACACAATTGCGGAGCAGAAGGCATTGTGTAATGGTGGAGTTCTAGAGAAGCTGATTGATCTTCTTGATGGTTCTCTAAGTCAAAGGGATGCTAGTTTAGAGTCTATTGCCACAATATTTAAAAATAATGTCGAAGCAATAGCAAAGTTCATGCAACCTGGTAGAGAAGATTGTTTGAGTTATATAATTGAATTGATGAAGGATAGAAATCCCAAGACAAGATTGCTAGCTTGCGTCTGCTTGATTGTTATGAGGAATTCATCACCTTGTTATCTACAAGATATAGGAATCAAAATGAAATTGATACATAGTTTGCTTGAGCTTCTTGATAATCCAGGTCAAGTTGGAGATGAAGCTTCCTTCGTTTTTTCGACTTTAATTGCTGAGAAGGAGGAGCTACAGAAACTAGCTTTTGAGGCAAATGCAATTGATAAGTTGTACAATCACTTGCAAAAGGATCAGTTGAGTCCTAGACGTTTCCAAGGAATATTGTTGGCCTTTTCTCATCTATGCTCAAAGTTGGAGAGCTGTAGGTCTAGATTTCTCTCTTTGCAGGTTTATTTCTTATTCTGATTGCATCAACTTTTTTGTTAAGTCAACTTATTGGTTTTTTTTTCTTTGTAACATTTATCAATTACAATGCATTTAACTTGTGCAGTTTTCCTAGTCCAGAGGTTAGTCATGTTTTTCACTAAGTCAAATTGGCTAGGTACCTAAATTGATGTCTAAAATCGTTTGTTTGTCTATTTGAATTTTTGTCTATGGTTATTTCTGCCCTAAGTTGATAGTTATCTTTATTATTAAAGATTTTTAAAAAGGTATCTGTTTTGCAAATGTGTCTGTCATCTGTTTTTAAGTCCTCAAAACTTTCCAAGGAAAAAATGGTTAAACTATATCTGCTGGTCCATTCCAATTGAATCAAAGGAAAAAAATGGAAAAAAAGAAAAGGAGGAAGAAGAAGTAACACATTTGTGGTGGTAACAATTATTGTGGCGGGTTCTCAAAATTTTCTCTTTATTTTACCTTTTGATATTTAACGAAGCTACTAACTGATGGAGGTTAACAAATCTGAATTTTGTTGGAATTTAACTTGCTTAGGTTTATGAATTTTTATTAATTTGCTTATAATCATGACATAGGTAATGAACATAGTGATTGATGCCCTACAACATGAAAGTAGTGACATACGTATTGCAGCTTGCACTTGCTTGAGAAGTGTCTCTAGATCAATCAAGGTATGTAATGCTATATTATTATTAGCTATTCTGATATTCATAGTCTTTCCTTTCCCCTGTTTTCTAATGGCATCATACTCAGAATTTGAGTGCAGGTTACTTTATGAACGAAACAGTTGTCCTTCCCGTTGTTCAGCTTTTACACAGTCCTTCTAATGCTGTTCAGGTAAACCTATTTATCTTGATATATCAAGTTCTCAAATTGTAGGAATCTTCCTACTTTTTAAGCCATTGAATGGTCAGGTAGAAAGAGGGGTTATTTGGTTGTTCAGTTATTTTTGGTTTTGCGCAATAATTATCATTTCATACTATTTAAACAGGTTGCAGCACTTGGTGCTCTTAGCAACATAGTTGTCGAATTTTCAACAAAGAGATCAATATTTATAGAATGTGGAGGTGTTAAAGAGCTGGTTCGATTATCAAAGTCTATGGACTTGGACATAAGGCTAAATGCATTGTGGGCTTTAAGGAATTTAATGTTCCTTGCAAACAGCATGTATAAATCAGGGATCTTCAGGGAGTTAACAGCTTCCTTGTTAGCCAGCCTTGTCTGTGGTACTGTTGAACTCTGTTGTCCATCTTTATGGTTGATTTCATCTTGTGTTTAATCAATACTGCTTTGTATGTTTATGTTTCAGATCCAGAGCCTTCCATACAAGAGCATGCTATGGCCCTTGTGCGCAATCTTATCAATGGATGTGAGGACTCAATTGAGTACGCGTTTGCTGAAGATGGTATCATATTGAATACTATTTGCCGGCAATTGAAAAGTATTTCAAGGGATGAAATTGGGGTCCAGGTTAGTTTTGTTTTGTTTTCAACTTTTGAATGTGATGTATGCTCAAAAGGAAAGGTTTATTTTGGAGAGGAAAAAGCTAAGAAATGGTTGCAAAGCAGTCTATTGAAAGGGCCGATTTTACAAAGTGGTGTTAGTTACTAGGAAACTCTGAGCTCAGCTTAAAATTGGCCTACAACAACCATGTTGAATAACCATACAGTAACGATGTTACGTAAATGCATAATACACACAAATTGATTAAACATGCTACGCCGAAGTATTTATGTTTTCCCTCCTGAAAGATCTCATAAGTAGGTCTGACACTCTGCGTTACTATTTTACATAGATACCAACAAACTAGAGAGATATTTCTATTTTTAAGGTCATGAGGACCCTTAACCTGGTCTCTATGATTCTGTTGAGGTTTTTTCTTTTGTATATAATATCCTTCATCAATAATATTATTTTCCTTTCGTTTTTTCTTTAAAAAGGAGCAAACAAAGATACAAATGGAAAATTGTCAGTTACAATTCAAAGTTTCACGTTAACTGAAGAAAGAATAAGGAAAATGACATGATTGCTTATGCTTTAAGCCTGTGATCTATATCTTACTTGGCTGTCTCTATAGATAGAAGCCGAAAATTTTAAAATATCTGGAAATTGGTTGAGCAAGTGACCTTGGGAAAACTTGCCAAAGGATGAACTCTCTCCTTCCGTAGGAGTTTCAATACCTTTCCTCACTTTGTTTTGTGGAATTTGTCTAGAATACCTATCTTTTACATGTCTACCTGACTCTTATCAAGAAAGCTGATATTAAATGTGGAAGGAAGAATCCAAAATTTATTAAGGGAGCCAATATTGATGGGAAGACTGAGAATGATGGTGGATTTCTTTGATAGGATGATAGGAATAAAAAAAGCGAAGTTCTTTAGGTGCTTATATTTGACTGAAATGTTAAAGTGCATTTTAAAGAAACCATGATTTCATAAATGACATGTGTTCTCGATCATCTGATGGATAATGTCCATGTTGTCTGGCTATTAAACAATTAAGTTTCAGCAGATTCATTTTCAATGATACTTGAGGAATTTTATACGACTATAAAATCTGTTGTTATGGTATTGAGTATATCAATTAACTATTAAGTAAACCATATCACAATGCAATTATTAAAGCATGACTGATGCAGGGAATGTATGTACTTTGTAATGTTGCAAGTGGAAATGAGTTCCATAAGGAAAGGCTAATGAAACAACTATTTCCACATGGAGATGACGTGATCCAGTCATTTGTGGTGAAGTTTTTGCAGAGTGACAACAGTCAACTTCGAATAGCTGCTGTCTGGGTAATAATAAATCTTTCTCTGCCTTCAAGTCCACGTGCACTAGATAGGGTTACAAAGCTACAGAATGCTGGTATTGTTTCTCAAATGAAGAATATGGTCAATGATCCGTGCCTGGATGTTAAGGTCATCTATTTCCTCATTTAAGTCCCACTTCCCCCTTCCCCCTTCTCCCCTTTCACCTCTATATTATCTTAATTCCTCGTCTCTTCTCCTTTACAGCTCCGTGTGAGAACCGTGCTTGGACAACTAATGGCTTTTGGTGATGGTATTACATTATAAAATTTCCATCATCAAAACCCAAGAGAAGTAGAGCGCTGCTAGCAGCTTTGTTTTTCATCCACCTTTTCTGTAGTATATTGTTGAGTTTTTGATCGGCGGCTTTTGATCCATCTGGTATGTTTGAACTTTGTTAGTTTATTTCTATTTTTCCCTTCCCAGATGTCCAAGCGAACTTAGGTTATTTATTATTTCATTGAGCAGCTGTGACAGATAATGTACTTGTGAAATATAACCAAAATGTCACACTTTGCTTATGTTCAAAAGTCGCAGGAAAAATTTTCTCTTAAAAAAAAAGTAATTTTGAGTGCTCGTATGTTACTTTATAAGTGAATCTTTTTGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTGATGGATAACTTTTACGGTTCTCTCTGTGTCTCTCCAAAGAGATTTGTGAAACTTGAGCAAATCTTGATTTGCTGTGAAATATTTTATGAATAAAGAAATGTAACGTGAGCTCTAACCTACCAAGTGTTTGCTTCTTGTCATTGGCCTTTTGCAGGAAAACATGAAAGCTCCAGTATATTCCACGGGCGGCTCTTGATCTCTAGGTTACTCGTACAAATCTTTTTTGGATGCATACCAGCCACTTTTGGTGGCAATTTTGATCTCTCTCTCTCAAATTCTTCTTTTCCTTCTTTTTGAAATCAAATGGCCGTATGATTAGTTTGGCTTGAAGTGTAATATAAGACTCGTAAATATTGCCTATCAAAAGTTCCATTATTCATCAGGTTATCTCAACTGCTGTTCCGTTTCGAGTGTTTAGTATCTAGGCAACAAATTGTCGCTTTCATACATTGTTCCAGCGGTTACCAAAACCCAGTACCATGCGTCTTTACGGGAAATATATCCTGAAGCCTTGTTAAGGAAAAACACCGCAATCCAAATGAATGACTGTTTAGCTAACATGTACCTTTTCAGGGATAGCTTATATTATTCAAGCATCCAAGGAGAGTAATGCAATTTTGGCTGATGTCTCCTTTCCTTCGAACAGATGCAAACTTCCAAGTAATTGTTGTTATAACATGGTCCAGATACTATACTTAATCCCCCCTTGTTTCCAATTTAATTTGTGTTGATTTGGAATAGTTGATGAATATGATTTAAGTTCTATCAAGGCCATTTATATAAACAAATTGAGAAAATCATATGAATCAGTCTTTTGAGCTTAAATTTGCAACTCTTATGTTAGAACATGGGAAAAGCTTGGAGGGTCTGTTGTGGTGTTGATCCATATTTATTCCACATAAATGTGGCTTTTGGGAGAAGAAGCATCATTATTCTATGATCCAGGAGAAAGGGATGCACTTGAGTCAACTAGATTGCTTATATTCCAACTTGTGTTAATAGTAATGGTCCAGTTCATTGTTTTCCAACTTTTTGTTTATCAATAGTAAAAATGAAACGTTGTTGAAGCAGCAGAATTTCAGATGGTTAAATTAGAGTATTGTTGTTGATTAATAGAACATAATCTCTAATGATTCGTCATAATTGTGGACAGGCCCCTTCTATGGTTGAATCCTCCAAGTGGCTTGTTTGAAGTGCTGGATAGTTGATGGAATAACGCACTGGGCTCTGTTTTACAGTGTACAAAGATGCAATGTTCGAGGTACATAGTACATTTTACAGTTGAATTGACTTCAATGATATATTTACACAAAATTAGTGTTTGCTTTGATCACAATTCATTAATTTCATTGCCCTGTCTTCTTTCGTTCTCATTTCAAGTTTTGCTCTATGATGTTCAAAGTAGGTCTAACATGATTTCCTACTTCTATTGACAACTATTTAGCAAAGTACTTGAGTAAATGCATCTTTTATCTGTTGATCTATTTGGTGTATGTGTGTATGATATGTTTTAGGGAAGAAAACACTTTGGTTTGGAGGAGGTTTGAGCAATTTGTGGCTCAATAATTTCTTTTGAGAACTACTGGCCAAGTTAAAAAAATATCTTTGGAAAAGTGTGACTGATTTATAGACTTATAATCTGTTCTCTTTCTCTTGCTTGAATAATCAATTAGTTCTGTTCTGCATATTACTAAAGAAAACTTCTGTTATTCTCAAAGTCATACTGAATAACTCACTGAAGAATCTCAAAACTCAGGCAAAATCAAGCACAATCAAGTTAAAGAAAACATCTTAATGGCCATTAGGCTAAACAAGCTGTGCAAACAAAGAAACAACCAGTAAATTGCACAATTATTGCTCATTCTGGTACCATCTCATCCCAATATGCATGATTTATAGAAGCACAAGTTCTGAGAAAATCCCAGCCCTACACTATACCATCAGAATAACCGAAATGGAGTCTGAATATATAAACTTTTCTACATAATATAAAGATTAATTGCTTACATATCTTCAACTAAAAAAAGACTGCCTCACTGAACTAAGAATACAATCGGCAACTCAACAATCAAACTAATCACGATTGAAGGATGCAATACAAGGAGAATTGACAAGGGAGAGAGAACTCAAAATTACCCATACTTCCTTAGCCAATCTTCTCTGTGTTCGTGTCATCTGAAGGCCATATAACAGTCTGCATAAGTCGTTCACGCATTAATTCTACGTTTTCGTCCGAGATCTCTTGGTATATTTCAGCATGGTTACATTTCTACACCACAGAAGAAATAAATGTTTGCAAACTTTGTTGCATGATCTTCCCAAAAAAAAAAGAAAAAAAAAAAGATTCTGTATTGAAGGAACATTACAATGATAGCAATACCTTCACCCATTTTCCATATAGGTGGAGACGTGTGATCATTACTCTTTCAGCAAGATCTTGTTTTTCCTATCAATTAGAAAAATGATGATCCAGGTTTAGTAAAACGCTATTTCACAAGGTGTGCCTCGAGACCGTAAATGACCTTAAATCTTTGTGTCCTAACATCAAACTGGAATGTTTGGATTCAGAAGTAAAAGGAATCATTTTAGCTAACTTTGTGTCAAGCAAAAGGACCACACTTGGACACAAGCCTTTGTGTAACTTGGCAAGTTATGGAGCATTGAATACACTTAGAACAACCATTTTTCGAGTGTATAACTCAAGAATGTGTAGATATGTGGTGCATTTCATATTACTTCGTAGCAGGTTTGATCAAAAGATACAGACTATAATGAAACTTATAAAAGTAAACTAAACTAGAAGTTAAAGCATGGAAACTAAATACACAATTTAACCATACTTTGTAACATCTGTGTGCTTATATTAGGCCTTCCCTTTCTACTGAGATAGCAAAATGCTAAAAACTAAAGCCGGTGTTAAACATGAGATTGTTAGAACAAGTCAGTCATAGTAAGAATTCAATGGAACGTTCGGGAGTGATTTTGAAATGGTTAAAATCTTCCGTATCATATTCAAAATCACTAAAAAACATGGATTTAATCCTTCAAAATCATTCCAAAATATACTCTAAAACCAATAGTGAAAAACATGTACCAAGCATAGATTGGTACAAGTCATGAAGTAGGTATGAACTTTTAAGAAGTACCTTTACAAGAGTGCGCATGAAATGTTTTCCATCTGATGGCTTGTTAGATAACAGAAAACTGTTGGAGAAGAATAGAACAGGATCAAAATTCAAGAACACGACCAAACTAGCATGAACAATTTGACAGCTTCAACTTGAGCTACCATGACAAAAAATACTGCAACTATTTCTAAACTGACTCCAGTTCCTGTAACACAGAAAAATGGCCAGTCTCAATGTGTAAAAATCGATGGCACTTACTCATAAAGCCACCTGTACTGTGGGGGATTCATTTCATAGAGCTGGTTCATAACAGTCCTCACAGCCTTGTATGTAAAATAATTAAGTATTTGCTGCAGCCAGAGAAATCTCAGTCAGTTTAGTAATTAATTTGTAGGCAAATGAGTATGCAAAGGATGGGAGAAGGATAAAGCTCAAGAAACTGGAAAAACAACAAAATCATATTAAACTTGACCAGAACAAAATCAATGGAAAACTCAAGTCTCAAGCATATCCCATGTTATTTAAAAGAACATTGCCATGCATCAACCTGATGGCCGTTGTATTTCATGTGTATCTTACTGAGTTACGAGTGTGGTTATGGCAGCTGCCTATGATACTAGGACCAATAGGTCAAAAATGAGTTTGTGTTGCAGGTGTTGAACGATCTACTCTACACAAGTGAAGACGTTTGGTTCTCCATGGGAGCCTTCCCCCCTCTCTGCACATGGTTCACGAACCCACTTTCTGACTAAGCCTCCTAATCATAGACGCAGCAAAAAAAGAAAAAAAAGGGTCAAAGGTTTCTCACCTGACCCCATCCAACGACTTTAGCGTCAAAATCTCGCAGTTGAGCTGTCTGCGGTCGATGGTGCATCCGGTAAGCTCTTTCGGGCCCTCTTTTCAGTAGATGAAAATGGAATTGAAAGAACCCCATGCCAACATTTCAT
mRNA sequence
AAGAAAATCCCTTTCTCCTCAGAACTCCATTGGAGGCTGTCTTGTAGCTTTCAAAGCTCCGAGGGGTTTTCTCTGCTGTGGCCTCGATGCCATTCCCATCCCAGTAGCTCTCTCGGTACTGGGACGATGCCGACCTCAGCTTCTTCTGCGATTCCGAAGCCCTCCTCCGAGCTCGTCAGCCGATTGACTGCCGCCGACGCCGCCGTGAAGCTGAAGGCATTGAGAGACATCAAGAACCAGATCATCGGTAACCGCACCAAGAAGCTGTCCTTCATCAAGCTCGGCGTTGTTCCTCACGTTGCTGCAATTCTCTCCTCCACTTCCGACCCCAACATTCTCGTCCAGTCCGCTGCTGTTCTTGGCAGCTTTGCCTGCGGCGTCGACGCCGGTGTCTCCGCCGTTCTCGATGCCGGAGCCTTCCCTCGACTGCTTCGATTGCTTGCTGATCCCGATCCCAAGGTTGTGGACGCAGGAGCTCGTTCATTAAGGATGGTTTATCAATCAAAGTTGGCTCCCAAGTATGATTTCCTTCAGCAGGAAAACACTAAATTTCTTCTTTCATTATTGAATAGTCAAAATGAAAATGTGACTGGACTTGGGGCAAGCATCATTATTCATTCCTGTGACACAATTGCGGAGCAGAAGGCATTGTGTAATGGTGGAGTTCTAGAGAAGCTGATTGATCTTCTTGATGGTTCTCTAAGTCAAAGGGATGCTAGTTTAGAGTCTATTGCCACAATATTTAAAAATAATGTCGAAGCAATAGCAAAGTTCATGCAACCTGGTAGAGAAGATTGTTTGAGTTATATAATTGAATTGATGAAGGATAGAAATCCCAAGACAAGATTGCTAGCTTGCGTCTGCTTGATTGTTATGAGGAATTCATCACCTTGTTATCTACAAGATATAGGAATCAAAATGAAATTGATACATAGTTTGCTTGAGCTTCTTGATAATCCAGGTCAAGTTGGAGATGAAGCTTCCTTCGTTTTTTCGACTTTAATTGCTGAGAAGGAGGAGCTACAGAAACTAGCTTTTGAGGCAAATGCAATTGATAAGTTGTACAATCACTTGCAAAAGGATCAGTTGAGTCCTAGACGTTTCCAAGGAATATTGTTGGCCTTTTCTCATCTATGCTCAAAGTTGGAGAGCTGTAGGTCTAGATTTCTCTCTTTGCAGGTAATGAACATAGTGATTGATGCCCTACAACATGAAAGTAGTGACATACGTATTGCAGCTTGCACTTGCTTGAGAAGTGTCTCTAGATCAATCAAGAATTTGAGTGCAGGTTACTTTATGAACGAAACAGTTGTCCTTCCCGTTGTTCAGCTTTTACACAGTCCTTCTAATGCTGTTCAGGTTGCAGCACTTGGTGCTCTTAGCAACATAGTTGTCGAATTTTCAACAAAGAGATCAATATTTATAGAATGTGGAGGTGTTAAAGAGCTGGTTCGATTATCAAAGTCTATGGACTTGGACATAAGGCTAAATGCATTGTGGGCTTTAAGGAATTTAATGTTCCTTGCAAACAGCATGTATAAATCAGGGATCTTCAGGGAGTTAACAGCTTCCTTGTTAGCCAGCCTTGTCTGTGATCCAGAGCCTTCCATACAAGAGCATGCTATGGCCCTTGTGCGCAATCTTATCAATGGATGTGAGGACTCAATTGAGTACGCGTTTGCTGAAGATGGTATCATATTGAATACTATTTGCCGGCAATTGAAAAGTATTTCAAGGGATGAAATTGGGGTCCAGGGAATGTATGTACTTTGTAATGTTGCAAGTGGAAATGAGTTCCATAAGGAAAGGCTAATGAAACAACTATTTCCACATGGAGATGACGTGATCCAGTCATTTGTGGTGAAGTTTTTGCAGAGTGACAACAGTCAACTTCGAATAGCTGCTGTCTGGGTAATAATAAATCTTTCTCTGCCTTCAAGTCCACGTGCACTAGATAGGGTTACAAAGCTACAGAATGCTGGTATTGTTTCTCAAATGAAGAATATGGTCAATGATCCGTGCCTGGATGTTAAGCTCCGTGTGAGAACCGTGCTTGGACAACTAATGGCTTTTGGTGATGTTTTTGATCGGCGGCTTTTGATCCATCTGGAAAACATGAAAGCTCCAGTATATTCCACGGGCGGCTCTTGATCTCTAGGTTACTCGTACAAATCTTTTTTGGATGCATACCAGCCACTTTTGGTGGCAATTTTGATCTCTCTCTCTCAAATTCTTCTTTTCCTTCTTTTTGAAATCAAATGGCCGTATGATTAGTTTGGCTTGAAGTGTAATATAAGACTCGTAAATATTGCCTATCAAAAGTTCCATTATTCATCAGGTTATCTCAACTGCTGTTCCGTTTCGAGTGTTTAGTATCTAGGCAACAAATTGTCGCTTTCATACATTGTTCCAGCGGTTACCAAAACCCAGTACCATGCGTCTTTACGGGAAATATATCCTGAAGCCTTGTTAAGGAAAAACACCGCAATCCAAATGAATGACTGTTTAGCTAACATGTACCTTTTCAGGGATAGCTTATATTATTCAAGCATCCAAGGAGAGTAATGCAATTTTGGCTGATGTCTCCTTTCCTTCGAACAGATGCAAACTTCCAAGTAATTGTTGTTATAACATGGTCCAGATACTATACTTAATCCCCCCTTGTTTCCAATTTAATTTGTGTTGATTTGGAATAGTTGATGAATATGATTTAAGTTCTATCAAGGCCATTTATATAAACAAATTGAGAAAATCATATGAATCAGTCTTTTGAGCTTAAATTTGCAACTCTTATGTTAGAACATGGGAAAAGCTTGGAGGGTCTGTTGTGGTGTTGATCCATATTTATTCCACATAAATGTGGCTTTTGGGAGAAGAAGCATCATTATTCTATGATCCAGGAGAAAGGGATGCACTTGAGTCAACTAGATTGCTTATATTCCAACTTGTGTTAATAGTAATGGTCCAGTTCATTGTTTTCCAACTTTTTGTTTATCAATAGTAAAAATGAAACGTTGTTGAAGCAGCAGAATTTCAGATGGTTAAATTAGAGTATTGTTGTTGATTAATAGAACATAATCTCTAATGATTCGTCATAATTGTGGACAGGCCCCTTCTATGGTTGAATCCTCCAAGTGGCTTGTTTGAAGTGCTGGATAGTTGATGGAATAACGCACTGGGCTCTGTTTTACAGTGTACAAAGATGCAATGTTCGAGGTGTTGAACGATCTACTCTACACAAGTGAAGACGTTTGGTTCTCCATGGGAGCCTTCCCCCCTCTCTGCACATGGTTCACGAACCCACTTTCTGACTAAGCCTCCTAATCATAGACGCAGCAAAAAAAGAAAAAAAAGGGTCAAAGGTTTCTCACCTGACCCCATCCAACGACTTTAGCGTCAAAATCTCGCAGTTGAGCTGTCTGCGGTCGATGGTGCATCCGGTAAGCTCTTTCGGGCCCTCTTTTCAGTAGATGAAAATGGAATTGAAAGAACCCCATGCCAACATTTCAT
Coding sequence (CDS)
ATGCCGACCTCAGCTTCTTCTGCGATTCCGAAGCCCTCCTCCGAGCTCGTCAGCCGATTGACTGCCGCCGACGCCGCCGTGAAGCTGAAGGCATTGAGAGACATCAAGAACCAGATCATCGGTAACCGCACCAAGAAGCTGTCCTTCATCAAGCTCGGCGTTGTTCCTCACGTTGCTGCAATTCTCTCCTCCACTTCCGACCCCAACATTCTCGTCCAGTCCGCTGCTGTTCTTGGCAGCTTTGCCTGCGGCGTCGACGCCGGTGTCTCCGCCGTTCTCGATGCCGGAGCCTTCCCTCGACTGCTTCGATTGCTTGCTGATCCCGATCCCAAGGTTGTGGACGCAGGAGCTCGTTCATTAAGGATGGTTTATCAATCAAAGTTGGCTCCCAAGTATGATTTCCTTCAGCAGGAAAACACTAAATTTCTTCTTTCATTATTGAATAGTCAAAATGAAAATGTGACTGGACTTGGGGCAAGCATCATTATTCATTCCTGTGACACAATTGCGGAGCAGAAGGCATTGTGTAATGGTGGAGTTCTAGAGAAGCTGATTGATCTTCTTGATGGTTCTCTAAGTCAAAGGGATGCTAGTTTAGAGTCTATTGCCACAATATTTAAAAATAATGTCGAAGCAATAGCAAAGTTCATGCAACCTGGTAGAGAAGATTGTTTGAGTTATATAATTGAATTGATGAAGGATAGAAATCCCAAGACAAGATTGCTAGCTTGCGTCTGCTTGATTGTTATGAGGAATTCATCACCTTGTTATCTACAAGATATAGGAATCAAAATGAAATTGATACATAGTTTGCTTGAGCTTCTTGATAATCCAGGTCAAGTTGGAGATGAAGCTTCCTTCGTTTTTTCGACTTTAATTGCTGAGAAGGAGGAGCTACAGAAACTAGCTTTTGAGGCAAATGCAATTGATAAGTTGTACAATCACTTGCAAAAGGATCAGTTGAGTCCTAGACGTTTCCAAGGAATATTGTTGGCCTTTTCTCATCTATGCTCAAAGTTGGAGAGCTGTAGGTCTAGATTTCTCTCTTTGCAGGTAATGAACATAGTGATTGATGCCCTACAACATGAAAGTAGTGACATACGTATTGCAGCTTGCACTTGCTTGAGAAGTGTCTCTAGATCAATCAAGAATTTGAGTGCAGGTTACTTTATGAACGAAACAGTTGTCCTTCCCGTTGTTCAGCTTTTACACAGTCCTTCTAATGCTGTTCAGGTTGCAGCACTTGGTGCTCTTAGCAACATAGTTGTCGAATTTTCAACAAAGAGATCAATATTTATAGAATGTGGAGGTGTTAAAGAGCTGGTTCGATTATCAAAGTCTATGGACTTGGACATAAGGCTAAATGCATTGTGGGCTTTAAGGAATTTAATGTTCCTTGCAAACAGCATGTATAAATCAGGGATCTTCAGGGAGTTAACAGCTTCCTTGTTAGCCAGCCTTGTCTGTGATCCAGAGCCTTCCATACAAGAGCATGCTATGGCCCTTGTGCGCAATCTTATCAATGGATGTGAGGACTCAATTGAGTACGCGTTTGCTGAAGATGGTATCATATTGAATACTATTTGCCGGCAATTGAAAAGTATTTCAAGGGATGAAATTGGGGTCCAGGGAATGTATGTACTTTGTAATGTTGCAAGTGGAAATGAGTTCCATAAGGAAAGGCTAATGAAACAACTATTTCCACATGGAGATGACGTGATCCAGTCATTTGTGGTGAAGTTTTTGCAGAGTGACAACAGTCAACTTCGAATAGCTGCTGTCTGGGTAATAATAAATCTTTCTCTGCCTTCAAGTCCACGTGCACTAGATAGGGTTACAAAGCTACAGAATGCTGGTATTGTTTCTCAAATGAAGAATATGGTCAATGATCCGTGCCTGGATGTTAAGCTCCGTGTGAGAACCGTGCTTGGACAACTAATGGCTTTTGGTGATGTTTTTGATCGGCGGCTTTTGATCCATCTGGAAAACATGAAAGCTCCAGTATATTCCACGGGCGGCTCTTGA
Protein sequence
MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAAILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSLRMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGVLEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQKLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGDVFDRRLLIHLENMKAPVYSTGGS
Homology
BLAST of CmoCh10G009760 vs. ExPASy Swiss-Prot
Match:
Q2KI54 (Armadillo repeat-containing protein 8 OS=Bos taurus OX=9913 GN=ARMC8 PE=2 SV=1)
HSP 1 Score: 184.1 bits (466), Expect = 5.2e-45
Identity = 172/674 (25.52%), Postives = 306/674 (45.40%), Query Frame = 0
Query: 4 SASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAAIL- 63
S S + S V RL D L+ + D+KN +IGN +K + I LG VP + +L
Sbjct: 12 SVLSEVTASSRHYVDRLFDPDPQKVLQGVIDMKNAVIGNNKQKANLIVLGAVPRLLYLLQ 71
Query: 64 SSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSLRM 123
TS + + A VLGS A G + V ++LD P LL+ L PD K ++A R LR
Sbjct: 72 QETSSTELKTECAVVLGSLAMGTENNVKSLLDCHIIPALLQGLLSPDLKFIEACLRCLRT 131
Query: 124 VYQSKLAPKYDFLQQENT--KFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 183
++ S + P+ + L + T L++LL S++ I H C Q L N G
Sbjct: 132 IFTSPVTPE-ELLYTDATVIPHLMALL-SRSRYTQEYICQIFSHCCKGPDHQTILFNHGA 191
Query: 184 LEKLIDLLDG-SLSQRDASLESIATIFKNNVE---AIAKFMQPGREDCLSYIIELMKDRN 243
++ + LL S R +L+ + + N + + + G ++ L +D+
Sbjct: 192 VQNIAHLLTSVSYKVRMQALKCFSVLAFENPQVSMTLVNVLVDGELLPQIFVKMLQRDKP 251
Query: 244 PKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGD--EASFVFSTLIA 303
+ +L + CL M + D I +K + L+ + + + E + + LI
Sbjct: 252 IEMQLTSAKCLTYMCRAGAIRTDDNCIVLKTLPCLVRMCSKERLLEERVEGAETLAYLIE 311
Query: 304 EKEELQKLA-----------------FEANAID--KLYNHLQKDQLSPRRFQGILLAFSH 363
ELQ++A +AI K +H K R Q ++
Sbjct: 312 PDVELQRIASITDHLIAMLADYFKYPSSVSAITDIKRLDHDLKHAHELR--QAAFKLYAS 371
Query: 364 LCSKLESCRSRFLSLQ-VMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNET 423
L + E R + + + +M+ ++ L S +R+AA CL S+SRS++ L F +
Sbjct: 372 LGANDEDIRKKIIDTENMMDRIVTGLSESSVKVRLAAVRCLHSLSRSVQQLRTS-FQDHA 431
Query: 424 VVLPVVQLLHSPSNAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRL 483
V P++++L + + + V A L N+++EFS + +E G V+ L L++S + +R+
Sbjct: 432 VWKPLMKVLQNAPDEILVVASSMLCNLLLEFSPSKEPILESGAVELLCGLTQSENPALRV 491
Query: 484 NALWALRNLMFLANSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSI 543
N +WAL N+ F A K+ I R L+ L L+ D + ++ + L+RNL++
Sbjct: 492 NGIWALMNMAFQAEQKIKADILRSLSTEQLFRLLSDSDLNVLMKTLGLLRNLLSTRPHID 551
Query: 544 EYAFAEDGIILNTICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVI 603
+ I+ + L+ E+ Q + +L N+A G K+L DD++
Sbjct: 552 KIMSTHGKQIMQAVTLILEGEHNIEVKEQTLCILANIADGT------TAKELIMTNDDIL 611
Query: 604 QSFVVKFLQSDNSQLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLD 649
Q + ++ + +L++AA++ I NL + +R KL++ GIV + + P +
Sbjct: 612 QK-IKYYMGHSHVKLQLAAMFCISNLVWNEEEGSQERQDKLRDMGIVDILHKLSQSPDSN 671
BLAST of CmoCh10G009760 vs. ExPASy Swiss-Prot
Match:
Q05AL1 (Armadillo repeat-containing protein 8 OS=Danio rerio OX=7955 GN=armc8 PE=2 SV=1)
HSP 1 Score: 181.8 bits (460), Expect = 2.6e-44
Identity = 169/672 (25.15%), Postives = 301/672 (44.79%), Query Frame = 0
Query: 4 SASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAAIL- 63
S S + S V RL D L+ + D+KN +IGN +K + I LG VP + +L
Sbjct: 12 SVLSEVTATSRHYVDRLFDPDPQNVLQGVIDMKNAVIGNNKQKANLIVLGAVPRLLYLLQ 71
Query: 64 SSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSLRM 123
S+S + + A VLGS + G + + +++D P LL+ L D ++A R LR
Sbjct: 72 QSSSTLELRTECAVVLGSLSMGTENNIKSLVDCHIIPALLQGLLCSDLIFIEACLRCLRT 131
Query: 124 VYQSKLAPKYDFLQQENT--KFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 183
V+ S + P L + T L+SLL S++++ I H C T Q L N G
Sbjct: 132 VFISPVTP-VQLLYTDPTVIPHLMSLL-SRSQHTQEYITQIFAHCCKTPEHQTVLFNHGA 191
Query: 184 LEKLIDLL-DGSLSQRDASLESIATIFKNNVE---AIAKFMQPGREDCLSYIIELMKDRN 243
++ + LL S R +L+ + + N + + + G + ++ + +D+
Sbjct: 192 IQNIAPLLISPSYKVRMQALKCFSVLAYENAQVSMTLVNVLVDGEQLSQVFVRMMQRDKP 251
Query: 244 PKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGD--EASFVFSTLIA 303
+ +L A CL M + +D I +K + L+ + + + E + + L+
Sbjct: 252 IEMQLTAAKCLTYMCRAGAIRTEDNCIVLKTLPCLVRMCSKERLLEERVEGAETLAYLME 311
Query: 304 EKEELQKLAFEANAIDKL----------------YNHLQKDQLSPRRF-QGILLAFSHLC 363
ELQ++A + + + L D Q ++ L
Sbjct: 312 PDIELQRIASVTDHLVSMLADYFKYPSSVSAITDIKRLDHDLKHAHELRQAAFKLYASLG 371
Query: 364 SKLESCRSRFLSLQ-VMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVV 423
S E R + + +M+ ++ L S +R+AA CL S+SRS++ L F + V
Sbjct: 372 SNDEDIRKKITETENMMDRIVSGLSESSIKVRLAAVRCLHSLSRSVQQLRTS-FHDHAVW 431
Query: 424 LPVVQLLHSPSNAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNA 483
P+++LL + + V V A L N+++EFS + +E G ++ L L++S +R+N
Sbjct: 432 KPLMKLLQNAPDEVLVMASSTLCNLLLEFSPSKEPILESGVIELLCSLTQSDSSALRVNG 491
Query: 484 LWALRNLMFLANSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEY 543
+WAL N+ F A+ K I R L L L+ DP+ ++ + L+RNL++ +
Sbjct: 492 IWALMNMAFQADQKVKVEIVRALGTEQLFRLLSDPDTNVLMKTLGLLRNLLSTRPHIDQI 551
Query: 544 AFAEDGIILNTICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQS 603
+ I+ + L+ E+ Q + +L N+A GN K+L DD++Q
Sbjct: 552 MSSHGKQIMQAVTLILEGEHSIEVKEQTLCILANIADGN------TAKELIMTDDDMLQK 611
Query: 604 FVVKFLQSDNSQLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVK 649
+ ++ N +L++AA + I NL + +R KL+ G V + + D+
Sbjct: 612 -IKYYMGHSNVKLQLAATFCISNLIWNEEDGSQERQDKLREMGFVDILHKLTQASDPDLC 671
BLAST of CmoCh10G009760 vs. ExPASy Swiss-Prot
Match:
Q8IUR7 (Armadillo repeat-containing protein 8 OS=Homo sapiens OX=9606 GN=ARMC8 PE=1 SV=2)
HSP 1 Score: 181.0 bits (458), Expect = 4.4e-44
Identity = 171/674 (25.37%), Postives = 304/674 (45.10%), Query Frame = 0
Query: 4 SASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAAIL- 63
S S + S V RL D L+ + D+KN +IGN +K + I LG VP + +L
Sbjct: 12 SVLSEVTASSRHYVDRLFDPDPQKVLQGVIDMKNAVIGNNKQKANLIVLGAVPRLLYLLQ 71
Query: 64 SSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSLRM 123
TS + + A VLGS A G + V ++LD P LL+ L PD K ++A R LR
Sbjct: 72 QETSSTELKTECAVVLGSLAMGTENNVKSLLDCHIIPALLQGLLSPDLKFIEACLRCLRT 131
Query: 124 VYQSKLAPKYDFLQQENT--KFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 183
++ S + P+ + L + T L++LL S++ I H C Q L N G
Sbjct: 132 IFTSPVTPE-ELLYTDATVIPHLMALL-SRSRYTQEYICQIFSHCCKGPDHQTILFNHGA 191
Query: 184 LEKLIDLLDG-SLSQRDASLESIATIFKNNVE---AIAKFMQPGREDCLSYIIELMKDRN 243
++ + LL S R +L+ + + N + + + G ++ L +D+
Sbjct: 192 VQNIAHLLTSLSYKVRMQALKCFSVLAFENPQVSMTLVNVLVDGELLPQIFVKMLQRDKP 251
Query: 244 PKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGD--EASFVFSTLIA 303
+ +L + CL M + D I +K + L+ + + + E + + LI
Sbjct: 252 IEMQLTSAKCLTYMCRAGAIRTDDNCIVLKTLPCLVRMCSKERLLEERVEGAETLAYLIE 311
Query: 304 EKEELQKLA-----------------FEANAID--KLYNHLQKDQLSPRRFQGILLAFSH 363
ELQ++A +AI K +H K R Q ++
Sbjct: 312 PDVELQRIASITDHLIAMLADYFKYPSSVSAITDIKRLDHDLKHAHELR--QAAFKLYAS 371
Query: 364 LCSKLESCRSRFLSLQ-VMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNET 423
L + E R + + + +M+ ++ L S +R+AA CL S+SRS++ L F +
Sbjct: 372 LGANDEDIRKKIIETENMMDRIVTGLSESSVKVRLAAVRCLHSLSRSVQQLRTS-FQDHA 431
Query: 424 VVLPVVQLLHSPSNAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRL 483
V P++++L + + + V A L N+++EFS + +E G V+ L L++S + +R+
Sbjct: 432 VWKPLMKVLQNAPDEILVVASSMLCNLLLEFSPSKEPILESGAVELLCGLTQSENPALRV 491
Query: 484 NALWALRNLMFLANSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSI 543
N +WAL N+ F A K+ I R L+ L L+ D + ++ + L+RNL++
Sbjct: 492 NGIWALMNMAFQAEQKIKADILRSLSTEQLFRLLSDSDLNVLMKTLGLLRNLLSTRPHID 551
Query: 544 EYAFAEDGIILNTICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVI 603
+ I+ + L+ E+ Q + +L N+A G K L DD++
Sbjct: 552 KIMSTHGKQIMQAVTLILEGEHNIEVKEQTLCILANIADGT------TAKDLIMTNDDIL 611
Query: 604 QSFVVKFLQSDNSQLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLD 649
Q + ++ + +L++AA++ I NL + +R KL++ GIV + + P +
Sbjct: 612 QK-IKYYMGHSHVKLQLAAMFCISNLIWNEEEGSQERQDKLRDMGIVDILHKLSQSPDSN 671
BLAST of CmoCh10G009760 vs. ExPASy Swiss-Prot
Match:
Q9DBR3 (Armadillo repeat-containing protein 8 OS=Mus musculus OX=10090 GN=Armc8 PE=1 SV=2)
HSP 1 Score: 180.3 bits (456), Expect = 7.5e-44
Identity = 170/674 (25.22%), Postives = 305/674 (45.25%), Query Frame = 0
Query: 4 SASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAAIL- 63
S S + S V RL D L+ + D+KN +IGN +K + I LG VP + +L
Sbjct: 12 SVLSEVTASSRHYVDRLFDPDPQKVLQGVIDMKNAVIGNNKQKANLIVLGAVPRLLYLLQ 71
Query: 64 SSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSLRM 123
TS + + A VLGS A G + V ++LD P LL+ L PD K ++A R LR
Sbjct: 72 QETSSTELKTECAVVLGSLAMGTENNVKSLLDCHIIPALLQGLLSPDLKFIEACLRCLRT 131
Query: 124 VYQSKLAPKYDFLQQENT--KFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 183
++ S + P+ + L + T L++LL S++ I H C Q L N G
Sbjct: 132 IFTSPVTPE-ELLYTDATVIPHLMALL-SRSRYTQEYICQIFSHCCKGPDHQTILFNHGA 191
Query: 184 LEKLIDLLDG-SLSQRDASLESIATIFKNNVE---AIAKFMQPGREDCLSYIIELMKDRN 243
++ + LL S R +L+ + + N + + + G ++ L +D+
Sbjct: 192 VQNIAHLLTSPSYKVRMQALKCFSVLAFENPQVSMTLVNVLVDGELLPQIFVKMLQRDKP 251
Query: 244 PKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGD--EASFVFSTLIA 303
+ +L + CL M + D I +K + L+ + + + E + + LI
Sbjct: 252 IEMQLTSAKCLTYMCRAGAIRTDDSCIVLKTLPCLVRMCSKERLLEERVEGAETLAYLIE 311
Query: 304 EKEELQKLA-----------------FEANAID--KLYNHLQKDQLSPRRFQGILLAFSH 363
ELQ++A +AI K +H K R Q ++
Sbjct: 312 PDVELQRIASITDHLIAMLADYFKYPSSVSAITDIKRLDHDLKHAHELR--QAAFKLYAS 371
Query: 364 LCSKLESCRSRFLSLQ-VMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNET 423
L + E R + + + +M+ ++ S +R+AA CL S+SRS++ L F +
Sbjct: 372 LGANDEDIRKKIIETETMMDRIVTGSSESSVKVRLAAVRCLHSLSRSVQQLRTS-FQDHA 431
Query: 424 VVLPVVQLLHSPSNAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRL 483
V P++++L + + + V A L N+++EFS + +E G V+ L L++S + +R+
Sbjct: 432 VWKPLMKVLQNAPDEILVVASSMLCNLLLEFSPSKEPILESGAVELLCGLTQSENPALRV 491
Query: 484 NALWALRNLMFLANSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSI 543
N +WAL N+ F A K+ I R L+ L L+ D + ++ + L+RNL++
Sbjct: 492 NGIWALMNMAFQAEQKIKADILRSLSTEQLFRLLSDSDMNVLMKTLGLLRNLLSTRPHID 551
Query: 544 EYAFAEDGIILNTICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVI 603
+ I+ + L+ E+ Q + +L N+A G K+L DD++
Sbjct: 552 KIMSTHGKQIMQAVTLILEGEHSIEVKEQTLCILANIADGT------TAKELIMTNDDIL 611
Query: 604 QSFVVKFLQSDNSQLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLD 649
Q + ++ + +L++AA++ I NL+ + +R KL++ GIV + + +
Sbjct: 612 QK-IKYYMGHSHVKLQLAAMFCISNLTWNEEEGSQERQDKLRDMGIVDILHKLSQSADSN 671
BLAST of CmoCh10G009760 vs. ExPASy Swiss-Prot
Match:
Q5R6S3 (Armadillo repeat-containing protein 8 OS=Pongo abelii OX=9601 GN=ARMC8 PE=2 SV=1)
HSP 1 Score: 177.2 bits (448), Expect = 6.3e-43
Identity = 170/674 (25.22%), Postives = 302/674 (44.81%), Query Frame = 0
Query: 4 SASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAAIL- 63
S S + S V RL D L+ + D+KN +IGN +K + I LG VP + +L
Sbjct: 12 SVLSEVTASSRHYVDRLFDPDPQKVLQGVIDMKNAVIGNNKQKANLIVLGAVPRLLYLLQ 71
Query: 64 SSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSLRM 123
TS + + A VLGS A G + V ++LD P LL+ L PD K ++A R R
Sbjct: 72 QETSSTELKTECAVVLGSLAMGTENNVKSLLDCHIIPALLQGLLSPDLKFIEACLRCPRT 131
Query: 124 VYQSKLAPKYDFLQQENT--KFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 183
++ S + P+ + L + T L++LL S++ I H C Q L N G
Sbjct: 132 IFTSPVTPE-ELLYTDATVIPHLMALL-SRSRYTQEYICQIFSHCCKGPDHQTILFNHGA 191
Query: 184 LEKLIDLLDG-SLSQRDASLESIATIFKNNVE---AIAKFMQPGREDCLSYIIELMKDRN 243
++ + LL S R +L+ + + N + + + G ++ L +D+
Sbjct: 192 VQNIAHLLTSLSYKVRMQALKCFSVLAFENPQVSMTLVNVLVDGELLPQIFVKMLQRDKP 251
Query: 244 PKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGD--EASFVFSTLIA 303
+ +L + CL M + D I +K + L+ + + + E + + LI
Sbjct: 252 IEMQLTSAKCLTYMCRAGAIRTDDNCIVLKTLPCLVRMCSKERLLEERVEGAETLAYLIE 311
Query: 304 EKEELQKLA-----------------FEANAID--KLYNHLQKDQLSPRRFQGILLAFSH 363
ELQ++A +AI K +H K R Q ++
Sbjct: 312 PDVELQRIASITDHLIAMLADYFKYPSSVSAITDIKRLDHDLKHAHELR--QAAFKLYAS 371
Query: 364 LCSKLESCRSRFLSLQ-VMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNET 423
L + E R + + + +M+ ++ L S +R+AA CL S+SRS++ L F +
Sbjct: 372 LGANDEDIRKKIIETENMMDRIVTGLSESSVKVRLAAVRCLHSLSRSVQQLRTS-FQDHA 431
Query: 424 VVLPVVQLLHSPSNAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRL 483
V P++++L + + + V A L N+++EFS + +E G V+ L L++S + +R+
Sbjct: 432 VWKPLMKVLQNAPDEILVVASSMLCNLLLEFSPSKEPILESGAVELLCGLTQSENPALRV 491
Query: 484 NALWALRNLMFLANSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSI 543
N +WAL N F A K+ I R L+ L L+ D + ++ + L+RNL++
Sbjct: 492 NGIWALMNTAFQAEQKIKADILRSLSTEQLFRLLSDSDLNVLMKTLGLLRNLLSTRPHID 551
Query: 544 EYAFAEDGIILNTICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVI 603
+ I+ + L+ E+ Q + +L N+A G K L DD++
Sbjct: 552 KIMSTHGKQIMQAVTLILEGEHNIEVKEQTLCILANIADGT------TAKDLIMTNDDIL 611
Query: 604 QSFVVKFLQSDNSQLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLD 649
Q + ++ + +L++AA++ I NL + +R KL++ GIV + + P +
Sbjct: 612 QK-IKYYMGHSHVKLQLAAMFCISNLIWNEEEGSQERQDKLRDMGIVDILHKLSQSPDSN 671
BLAST of CmoCh10G009760 vs. ExPASy TrEMBL
Match:
A0A6J1H8W8 (Armadillo repeat-containing protein 8 OS=Cucurbita moschata OX=3662 GN=LOC111461619 PE=4 SV=1)
HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 651/651 (100.00%), Postives = 651/651 (100.00%), Query Frame = 0
Query: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA
Sbjct: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
Query: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL
Sbjct: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
Query: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 180
RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV
Sbjct: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 180
Query: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR
Sbjct: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
Query: 241 LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ
Sbjct: 241 LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
Query: 301 KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL
Sbjct: 301 KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
Query: 361 QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSN 420
QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSN
Sbjct: 361 QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSN 420
Query: 421 IVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELT 480
IVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELT
Sbjct: 421 IVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELT 480
Query: 481 ASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEI 540
ASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEI
Sbjct: 481 ASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEI 540
Query: 541 GVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL 600
GVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL
Sbjct: 541 GVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL 600
Query: 601 SLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 652
SLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD
Sbjct: 601 SLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 651
BLAST of CmoCh10G009760 vs. ExPASy TrEMBL
Match:
A0A6J1JC02 (Armadillo repeat-containing protein 8 OS=Cucurbita maxima OX=3661 GN=LOC111485402 PE=4 SV=1)
HSP 1 Score: 1218.0 bits (3150), Expect = 0.0e+00
Identity = 635/651 (97.54%), Postives = 646/651 (99.23%), Query Frame = 0
Query: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSF+KLGVVPHVAA
Sbjct: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFVKLGVVPHVAA 60
Query: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL
Sbjct: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
Query: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 180
RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASII+HSCDTIAEQKAL NGGV
Sbjct: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIVHSCDTIAEQKALYNGGV 180
Query: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR
Sbjct: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
Query: 241 LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
LLACVCLIV+RNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ
Sbjct: 241 LLACVCLIVIRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
Query: 301 KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
KLAFEANAID LYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL
Sbjct: 301 KLAFEANAIDTLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
Query: 361 QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSN 420
QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVV+LLH+PSNAVQVAALGA+SN
Sbjct: 361 QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVRLLHNPSNAVQVAALGAISN 420
Query: 421 IVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELT 480
IVV+FSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLAN+MYKSGIF ELT
Sbjct: 421 IVVDFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANNMYKSGIFMELT 480
Query: 481 ASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEI 540
ASLLASLVCDPEPSIQEHAMALVRNLI+GCEDSIEYAFAEDGIILNTICRQL+SISRDEI
Sbjct: 481 ASLLASLVCDPEPSIQEHAMALVRNLIDGCEDSIEYAFAEDGIILNTICRQLQSISRDEI 540
Query: 541 GVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL 600
GVQGM+VLCNVASGNEFHKERLMKQLFP GDDVIQSFVVKFLQSDNSQLRIAAVWVIINL
Sbjct: 541 GVQGMHVLCNVASGNEFHKERLMKQLFPQGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL 600
Query: 601 SLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 652
SLPSSPRA DRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD
Sbjct: 601 SLPSSPRAPDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 651
BLAST of CmoCh10G009760 vs. ExPASy TrEMBL
Match:
A0A5A7UMI4 (Armadillo repeat-containing protein 8 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold2116G00160 PE=4 SV=1)
HSP 1 Score: 1151.3 bits (2977), Expect = 0.0e+00
Identity = 596/651 (91.55%), Postives = 628/651 (96.47%), Query Frame = 0
Query: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
MPTSASSAIPKPSS+L++RLT ADAAV LKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA
Sbjct: 1 MPTSASSAIPKPSSDLIARLTVADAAVNLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
Query: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFP+LLRLLADPDPKVVDAGARSL
Sbjct: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPKLLRLLADPDPKVVDAGARSL 120
Query: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 180
RM+YQSKLAPK+DFLQQENTKFLLSLLNSQNENVTGLGASII+HSC+TIAEQKAL +GGV
Sbjct: 121 RMIYQSKLAPKHDFLQQENTKFLLSLLNSQNENVTGLGASIIVHSCETIAEQKALHDGGV 180
Query: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIA+FMQPGRE+CL+YIIELMKDRNPKTR
Sbjct: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIARFMQPGRENCLNYIIELMKDRNPKTR 240
Query: 241 LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
LLACVCLIV+RNSSPCYLQDIGIKMKLIHSLLELLD+P QVGDEA FVFSTLIAEKEELQ
Sbjct: 241 LLACVCLIVIRNSSPCYLQDIGIKMKLIHSLLELLDSPDQVGDEAPFVFSTLIAEKEELQ 300
Query: 301 KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
KLAFEANAIDKLYN+LQKDQLSPRRFQG+LLAFSHLCSKLESCRSRFLSLQVMNIVIDA+
Sbjct: 301 KLAFEANAIDKLYNYLQKDQLSPRRFQGMLLAFSHLCSKLESCRSRFLSLQVMNIVIDAI 360
Query: 361 QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSN 420
+HESSDIRIAACTCLRSVSRSIKNLSAGYFMNE VVLPVV+LLH P N VQ+AALGA+SN
Sbjct: 361 KHESSDIRIAACTCLRSVSRSIKNLSAGYFMNEAVVLPVVRLLHDPCNDVQLAALGAISN 420
Query: 421 IVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELT 480
IVVEFSTKRSIFI CGGVKELVRLSKSMDL+IRLNALWALRNLMFL N M K IF ELT
Sbjct: 421 IVVEFSTKRSIFIGCGGVKELVRLSKSMDLEIRLNALWALRNLMFLTNIMCKEAIFMELT 480
Query: 481 ASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEI 540
ASLLASLVCDPEPSIQEHAMALVRNLI+GCEDSIEYAFAED I+LNTI RQL++IS DEI
Sbjct: 481 ASLLASLVCDPEPSIQEHAMALVRNLIDGCEDSIEYAFAEDAIVLNTIGRQLQNISTDEI 540
Query: 541 GVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL 600
GVQGMYVLCNVASGNEFHKE +MKQLFP GDDVI SFVVKFLQSDNSQLRIAA+W IINL
Sbjct: 541 GVQGMYVLCNVASGNEFHKEGVMKQLFPRGDDVIHSFVVKFLQSDNSQLRIAAIWAIINL 600
Query: 601 SLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 652
+LPSSPRALDRVTKL+NAGI+SQ+KNMVNDPCLDVKLRVRTVLGQLMAFGD
Sbjct: 601 TLPSSPRALDRVTKLRNAGIISQIKNMVNDPCLDVKLRVRTVLGQLMAFGD 651
BLAST of CmoCh10G009760 vs. ExPASy TrEMBL
Match:
A0A1S3BQ72 (Armadillo repeat-containing protein 8 OS=Cucumis melo OX=3656 GN=LOC103492030 PE=4 SV=1)
HSP 1 Score: 1151.3 bits (2977), Expect = 0.0e+00
Identity = 596/651 (91.55%), Postives = 628/651 (96.47%), Query Frame = 0
Query: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
MPTSASSAIPKPSS+L++RLT ADAAV LKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA
Sbjct: 1 MPTSASSAIPKPSSDLIARLTVADAAVNLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
Query: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFP+LLRLLADPDPKVVDAGARSL
Sbjct: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPKLLRLLADPDPKVVDAGARSL 120
Query: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 180
RM+YQSKLAPK+DFLQQENTKFLLSLLNSQNENVTGLGASII+HSC+TIAEQKAL +GGV
Sbjct: 121 RMIYQSKLAPKHDFLQQENTKFLLSLLNSQNENVTGLGASIIVHSCETIAEQKALHDGGV 180
Query: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIA+FMQPGRE+CL+YIIELMKDRNPKTR
Sbjct: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIARFMQPGRENCLNYIIELMKDRNPKTR 240
Query: 241 LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
LLACVCLIV+RNSSPCYLQDIGIKMKLIHSLLELLD+P QVGDEA FVFSTLIAEKEELQ
Sbjct: 241 LLACVCLIVIRNSSPCYLQDIGIKMKLIHSLLELLDSPDQVGDEAPFVFSTLIAEKEELQ 300
Query: 301 KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
KLAFEANAIDKLYN+LQKDQLSPRRFQG+LLAFSHLCSKLESCRSRFLSLQVMNIVIDA+
Sbjct: 301 KLAFEANAIDKLYNYLQKDQLSPRRFQGMLLAFSHLCSKLESCRSRFLSLQVMNIVIDAI 360
Query: 361 QHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPSNAVQVAALGALSN 420
+HESSDIRIAACTCLRSVSRSIKNLSAGYFMNE VVLPVV+LLH P N VQ+AALGA+SN
Sbjct: 361 KHESSDIRIAACTCLRSVSRSIKNLSAGYFMNEAVVLPVVRLLHDPCNDVQLAALGAISN 420
Query: 421 IVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIFRELT 480
IVVEFSTKRSIFI CGGVKELVRLSKSMDL+IRLNALWALRNLMFL N M K IF ELT
Sbjct: 421 IVVEFSTKRSIFIGCGGVKELVRLSKSMDLEIRLNALWALRNLMFLTNIMCKEAIFMELT 480
Query: 481 ASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSISRDEI 540
ASLLASLVCDPEPSIQEHAMALVRNLI+GCEDSIEYAFAED I+LNTI RQL++IS DEI
Sbjct: 481 ASLLASLVCDPEPSIQEHAMALVRNLIDGCEDSIEYAFAEDAIVLNTIGRQLQNISTDEI 540
Query: 541 GVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWVIINL 600
GVQGMYVLCNVASGNEFHKE +MKQLFP GDDVI SFVVKFLQSDNSQLRIAA+W IINL
Sbjct: 541 GVQGMYVLCNVASGNEFHKEGVMKQLFPRGDDVIHSFVVKFLQSDNSQLRIAAIWAIINL 600
Query: 601 SLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 652
+LPSSPRALDRVTKL+NAGI+SQ+KNMVNDPCLDVKLRVRTVLGQLMAFGD
Sbjct: 601 TLPSSPRALDRVTKLRNAGIISQIKNMVNDPCLDVKLRVRTVLGQLMAFGD 651
BLAST of CmoCh10G009760 vs. ExPASy TrEMBL
Match:
A0A5D3D5L9 (Armadillo repeat-containing protein 8 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold204G00510 PE=4 SV=1)
HSP 1 Score: 1145.6 bits (2962), Expect = 0.0e+00
Identity = 596/655 (90.99%), Postives = 628/655 (95.88%), Query Frame = 0
Query: 1 MPTSASSAIPKPSSELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
MPTSASSAIPKPSS+L++RLT ADAAV LKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA
Sbjct: 1 MPTSASSAIPKPSSDLIARLTVADAAVNLKALRDIKNQIIGNRTKKLSFIKLGVVPHVAA 60
Query: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVDAGARSL 120
ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFP+LLRLLADPDPKVVDAGARSL
Sbjct: 61 ILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPKLLRLLADPDPKVVDAGARSL 120
Query: 121 RMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKALCNGGV 180
RM+YQSKLAPK+DFLQQENTKFLLSLLNSQNENVTGLGASII+HSC+TIAEQKAL +GGV
Sbjct: 121 RMIYQSKLAPKHDFLQQENTKFLLSLLNSQNENVTGLGASIIVHSCETIAEQKALHDGGV 180
Query: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYIIELMKDRNPKTR 240
LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIA+FMQPGRE+CL+YIIELMKDRNPKTR
Sbjct: 181 LEKLIDLLDGSLSQRDASLESIATIFKNNVEAIARFMQPGRENCLNYIIELMKDRNPKTR 240
Query: 241 LLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFVFSTLIAEKEELQ 300
LLACVCLIV+RNSSPCYLQDIGIKMKLIHSLLELLD+P QVGDEA FVFSTLIAEKEELQ
Sbjct: 241 LLACVCLIVIRNSSPCYLQDIGIKMKLIHSLLELLDSPDQVGDEAPFVFSTLIAEKEELQ 300
Query: 301 KLAFEANAIDKLYNHLQKDQLSPRRFQGILLAFSHLCSKLESCRSRFLSLQVMNIVIDAL 360
KLAFEANAIDKLYN+LQKDQLSPRRFQG+LLAFSHLCSKLESCRSRFLSLQVMNIVIDA+
Sbjct: 301 KLAFEANAIDKLYNYLQKDQLSPRRFQGMLLAFSHLCSKLESCRSRFLSLQVMNIVIDAI 360
Query: 361 QHESSDIRIAACTCLRSVSRSIK----NLSAGYFMNETVVLPVVQLLHSPSNAVQVAALG 420
+HESSDIRIAACTCLRSVSRSIK NLSAGYFMNE VVLPVV+LLH P N VQ+AALG
Sbjct: 361 KHESSDIRIAACTCLRSVSRSIKCPIQNLSAGYFMNEAVVLPVVRLLHDPCNDVQLAALG 420
Query: 421 ALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLANSMYKSGIF 480
A+SNIVVEFSTKRSIFI CGGVKELVRLSKSMDL+IRLNALWALRNLMFL N M K IF
Sbjct: 421 AISNIVVEFSTKRSIFIGCGGVKELVRLSKSMDLEIRLNALWALRNLMFLTNIMCKEAIF 480
Query: 481 RELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNTICRQLKSIS 540
ELTASLLASLVCDPEPSIQEHAMALVRNLI+GCEDSIEYAFAED I+LNTI RQL++IS
Sbjct: 481 MELTASLLASLVCDPEPSIQEHAMALVRNLIDGCEDSIEYAFAEDAIVLNTIGRQLQNIS 540
Query: 541 RDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNSQLRIAAVWV 600
DEIGVQGMYVLCNVASGNEFHKE +MKQLFP GDDVI SFVVKFLQSDNSQLRIAA+W
Sbjct: 541 TDEIGVQGMYVLCNVASGNEFHKEGVMKQLFPRGDDVIHSFVVKFLQSDNSQLRIAAIWA 600
Query: 601 IINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLMAFGD 652
IINL+LPSSPRALDRVTKL+NAGI+SQ+KNMVNDPCLDVKLRVRTVLGQLMAFGD
Sbjct: 601 IINLTLPSSPRALDRVTKLRNAGIISQIKNMVNDPCLDVKLRVRTVLGQLMAFGD 655
BLAST of CmoCh10G009760 vs. TAIR 10
Match:
AT1G51350.1 (ARM repeat superfamily protein )
HSP 1 Score: 743.4 bits (1918), Expect = 1.6e-214
Identity = 388/664 (58.43%), Postives = 505/664 (76.05%), Query Frame = 0
Query: 1 MPTSASSAIPKPS--------SELVSRLTAADAAVKLKALRDIKNQIIGNRTKKLSFIKL 60
MPT+ +S+ S S++ SRL ++D VKLKALR++KNQIIGNRTKKLSF+KL
Sbjct: 1 MPTTTTSSASSSSSASGNNRQSDVFSRLASSDPEVKLKALREVKNQIIGNRTKKLSFLKL 60
Query: 61 GVVPHVAAILSSTSDP----NILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADP 120
G +P +A++L+ D NILVQSAA LGSFACG +AGV AVLDAG FP LLRLL +
Sbjct: 61 GAIPAIASVLADADDSDECNNILVQSAAALGSFACGFEAGVQAVLDAGVFPHLLRLLTNT 120
Query: 121 DPKVVDAGARSLRMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDT 180
D KVVDAGARSLRM++QS APKYDFLQ++N +FL SLLNS+NENV+GLGASII H+C T
Sbjct: 121 DEKVVDAGARSLRMIFQSNQAPKYDFLQEKNMEFLFSLLNSENENVSGLGASIIAHACGT 180
Query: 181 IAEQKALCNGGVLEKLIDLLDGSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCLSYI 240
EQ+ LC GVLEKL+ LLDGSLSQR+A LES+AT+ KNN EA++ F+ + +
Sbjct: 181 SVEQRVLCEAGVLEKLVILLDGSLSQREACLESLATVLKNNPEAVSDFVGLESGKYFNSV 240
Query: 241 IELMKDRNPKTRLLACVCLIVMRNSSPCYLQDIGIKMKLIHSLLELLDNPGQVGDEASFV 300
EL KDR P+TRLL+C+CL+V+ N+SP Y ++G K LI +LLELL++PGQ GD+A+
Sbjct: 241 TELTKDRYPRTRLLSCLCLVVIYNTSPSYFINMGTKSSLITTLLELLNDPGQSGDDAALG 300
Query: 301 FSTLIAEKEELQKLAFEANAIDKLYNHLQK-DQLSPRRFQGILLAFSHLCSKLESCRSRF 360
S LIAEKE+LQ+LA+EA+AI + L+ +L +R QG+ L+ + LCSKLE CR F
Sbjct: 301 LSCLIAEKEDLQQLAYEADAIKNIVEILKTGSELQSKRLQGLFLSLAELCSKLEDCRCSF 360
Query: 361 LSLQVMNIVIDALQHESSDIRIAACTCLRSVSRSIKNLSAGYFMNETVVLPVVQLLHSPS 420
LSLQV++++ DAL+H+ +D+R AAC C R+ +RS+K+LSAG F ++ V+LP+VQLLH PS
Sbjct: 361 LSLQVLDLLTDALRHKDADVRAAACICFRNAARSVKSLSAGRFNSDHVMLPLVQLLHDPS 420
Query: 421 NAVQVAALGALSNIVVEFSTKRSIFIECGGVKELVRLSKSMDLDIRLNALWALRNLMFLA 480
++VQVA LGAL+NIV++FS+ +S FIE GG+K+L LSKSMD + R +AL ALRNLMFLA
Sbjct: 421 SSVQVAVLGALNNIVMDFSSPKSSFIEYGGIKQLTELSKSMDPNTRCSALRALRNLMFLA 480
Query: 481 NSMYKSGIFRELTASLLASLVCDPEPSIQEHAMALVRNLINGCEDSIEYAFAEDGIILNT 540
+ K + ++ A LA L+ DPEP +QE A+AL+RNL++GC SIE+ F EDG+IL+T
Sbjct: 481 DIKRKELFYSDVKAQGLACLISDPEPPVQEQALALLRNLVDGCISSIEFVFDEDGLILDT 540
Query: 541 ICRQLKSISRDEIGVQGMYVLCNVASGNEFHKERLMKQLFPHGDDVIQSFVVKFLQSDNS 600
+ RQL+ + ++ +QGMYVL NVASG E HKE +M+QLFP ++F++KFLQSD S
Sbjct: 541 VGRQLRKAPQAQMAIQGMYVLTNVASGTELHKEAVMEQLFPQAKAGSENFMLKFLQSDES 600
Query: 601 QLRIAAVWVIINLSLPSSPRALDRVTKLQNAGIVSQMKNMVNDPCLDVKLRVRTVLGQLM 652
QLR AAVW IINL PSSP A DR KL+NAGI+ Q+KNMVND CLDVK+R+RTVLGQ M
Sbjct: 601 QLRSAAVWTIINLISPSSPGAFDRHVKLRNAGIIPQLKNMVNDACLDVKIRIRTVLGQSM 660
BLAST of CmoCh10G009760 vs. TAIR 10
Match:
AT1G02690.1 (importin alpha isoform 6 )
HSP 1 Score: 43.9 bits (102), Expect = 6.0e-04
Identity = 44/193 (22.80%), Postives = 85/193 (44.04%), Query Frame = 0
Query: 55 VPHVAAILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVD 114
+P + +L ST D +L ++ L + G + + V+DAG PRL++LLA P P V+
Sbjct: 248 LPALERLLHST-DEEVLTDASWALSYLSDGTNEKIQTVIDAGVIPRLVQLLAHPSPSVLI 307
Query: 115 AGARSLRMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKA 174
R++ + + + LL+LL + + SI +C TI+ A
Sbjct: 308 PALRTIGNIVTGDDIQTQAVISSQALPGLLNLLKNTYKK------SIKKEACWTISNITA 367
Query: 175 --------LCNGGVLEKLIDLLD-GSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCL 234
+ G++ LI+LL+ G + ++ +I+ KF+ + C+
Sbjct: 368 GNTSQIQEVFQAGIIRPLINLLEIGEFEIKKEAVWAISNATSGGNHDQIKFLV--SQGCI 427
Query: 235 SYIIELMKDRNPK 239
+ +L+ +P+
Sbjct: 428 RPLCDLLPCPDPR 431
BLAST of CmoCh10G009760 vs. TAIR 10
Match:
AT1G02690.2 (importin alpha isoform 6 )
HSP 1 Score: 43.9 bits (102), Expect = 6.0e-04
Identity = 44/193 (22.80%), Postives = 85/193 (44.04%), Query Frame = 0
Query: 55 VPHVAAILSSTSDPNILVQSAAVLGSFACGVDAGVSAVLDAGAFPRLLRLLADPDPKVVD 114
+P + +L ST D +L ++ L + G + + V+DAG PRL++LLA P P V+
Sbjct: 249 LPALERLLHST-DEEVLTDASWALSYLSDGTNEKIQTVIDAGVIPRLVQLLAHPSPSVLI 308
Query: 115 AGARSLRMVYQSKLAPKYDFLQQENTKFLLSLLNSQNENVTGLGASIIIHSCDTIAEQKA 174
R++ + + + LL+LL + + SI +C TI+ A
Sbjct: 309 PALRTIGNIVTGDDIQTQAVISSQALPGLLNLLKNTYKK------SIKKEACWTISNITA 368
Query: 175 --------LCNGGVLEKLIDLLD-GSLSQRDASLESIATIFKNNVEAIAKFMQPGREDCL 234
+ G++ LI+LL+ G + ++ +I+ KF+ + C+
Sbjct: 369 GNTSQIQEVFQAGIIRPLINLLEIGEFEIKKEAVWAISNATSGGNHDQIKFLV--SQGCI 428
Query: 235 SYIIELMKDRNPK 239
+ +L+ +P+
Sbjct: 429 RPLCDLLPCPDPR 432
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q2KI54 | 5.2e-45 | 25.52 | Armadillo repeat-containing protein 8 OS=Bos taurus OX=9913 GN=ARMC8 PE=2 SV=1 | [more] |
Q05AL1 | 2.6e-44 | 25.15 | Armadillo repeat-containing protein 8 OS=Danio rerio OX=7955 GN=armc8 PE=2 SV=1 | [more] |
Q8IUR7 | 4.4e-44 | 25.37 | Armadillo repeat-containing protein 8 OS=Homo sapiens OX=9606 GN=ARMC8 PE=1 SV=2 | [more] |
Q9DBR3 | 7.5e-44 | 25.22 | Armadillo repeat-containing protein 8 OS=Mus musculus OX=10090 GN=Armc8 PE=1 SV=... | [more] |
Q5R6S3 | 6.3e-43 | 25.22 | Armadillo repeat-containing protein 8 OS=Pongo abelii OX=9601 GN=ARMC8 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H8W8 | 0.0e+00 | 100.00 | Armadillo repeat-containing protein 8 OS=Cucurbita moschata OX=3662 GN=LOC111461... | [more] |
A0A6J1JC02 | 0.0e+00 | 97.54 | Armadillo repeat-containing protein 8 OS=Cucurbita maxima OX=3661 GN=LOC11148540... | [more] |
A0A5A7UMI4 | 0.0e+00 | 91.55 | Armadillo repeat-containing protein 8 OS=Cucumis melo var. makuwa OX=1194695 GN=... | [more] |
A0A1S3BQ72 | 0.0e+00 | 91.55 | Armadillo repeat-containing protein 8 OS=Cucumis melo OX=3656 GN=LOC103492030 PE... | [more] |
A0A5D3D5L9 | 0.0e+00 | 90.99 | Armadillo repeat-containing protein 8 OS=Cucumis melo var. makuwa OX=1194695 GN=... | [more] |