Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACCCTTCTAATCCCCACGGGAGCGTCTCCCGGTTTTACACAATTGCATTTCTTCCACTGCCGGACTTGTCCTGCTCCGGAAACTCAGCGGCGTCTGCAGTCCCGTGTAATTCGAACCCACTTCACCATCTCCTAATTTTTCCCTTATACAAAATCACCTTTCTTCTTCTAGGCAGTAATGAGCAGCTGGAAGAGTCTTCTTCTCCGAATTGGCGACAAGTCCCCGGAATACGGCACCTCCTCCGATTTCAAAGACCACATTGTTCGGCACTCTCTCACACTCATTCCAGTTATACTCCACTATTCGATTACTTGTCGCTCATTTTTATCTGGGATTTGCAGGAAACTTGCTTCGCAGCGATTCGGCGGGAGCTGGATCACTATGGAGATGAAATTTTGCCTGTAATCACTTGCCTCTGCACGTTTTTTTCTCTATAATTTATCTGGGTGTTGATAAAATTTGCAGTTCTTGTGTTGGTTGAAATTATACTAGTTGAATCACTATGTTAGTAGTACGTGTGAACTGAGTCTTTTTTGGCTGGTTATCTCTTAAGTCGTATCCCTTTGTGGAAAATGTGAGTTTTGTCTAATAACTACTCTATGCTGATGGTATTTACACTGTTTTTTTTCCTCGTTTGACAGTTCCTTCTGCAATGTGTTGAACAATTGCCTCATAAGACTCCTTTGTACGGGACATTGGTAAAGTACCTTACATCCTTCAGTTCTCCAGTGAATGTATTTTCTTTTCATTTATGTATTTCGTTGATCAGCTTATGTTGTTAGGTCGTGCGATCTAGCTTTAACTTCTCAATTATTTGTTACATGTATTCTAGAGTTGTACAATTCTTATTTAACGATCTCTCTATATGTTTCCCATTAAAAGACCAAAATTGAGGTTTAAATTTATAAAATGCAATAGCCTTCCTTTTGCTGTTGTCTTTCCGTTCTTTTTAGCTTGTCGTTTGTGCCGCAATTGGAGAATGTCTTTAGAGCTACATAGGTCAGACATTTCCCTTCTTCATTTATTCATTCTTTTGATTTAACAAAATTGTATGTTCCGTATAGAAAAAGACCACCTTATAAAAATTGTAGTTTTCTTAGTTCTTGGAGCCCTTTTTCCCCTTTTCCTAGTCTGGTTTTTGTTGCCTTGGCTCCCTCTGCTCTGATTTTTTTCTCAATTAAATATTCTCTCTGTATATTCTTTCATTTCATCTATGAAAATACCCTCCGACGAGATTCTTTTCTTTCATTATTTAGCTGCCCCATGTCGGCATCAATTGTGTTTTATTTATCTTAAAAAAAGAAATAAGTTGTCTGGCTTCTTTTATGGGGCTATCTTTTTGTATGTCCCTTGTTTATTTTATATATCCTTTGTTATTCTTTCATTCTCTATGAAAATGTGGTTTTTTATCTTTTAAAAAAACGAAGGTGTTTACTCACACTTAATATTTCATGGCACAACACTTATAATTCTCAATGATGGTATTAGTTTGCGTTAGTGACGTATGTCTATCTTGTTGTGCCTACAGATCGGATTGATGAATTTGGAAAATGAAGACTTCGTGAAGAAAATTGTGGAGCAAACTCACCAAAGTTTTCAGGTTTGCAATTTTGCAGTTGGCATTGTCATTGTCTTAGTGAAGTGTAGTAAGAACTTTTTTATATTAAGACACTTTTAAATTCTCTAATCTTTACGCCCTTGTGTGTCCCTAGTCCCTTCCTAGAAGGTTTAGACTGGTATCCCATTTTAGAGAGGGAAAGGGAGCAGTTTGATGCCCCGTTTTTATTGTGGAGGTTATTCACAAGGTGGTCTGGTGAAATAAGTCCCCTAGGCATGACGATTGTACCATGGCCTTTTTTTTCAGGATTATTGGAATTGCATCAAAGAGAATACGTGGAAAGTTTTGAGTGATTTTTTTTGAGAGGTACTATTGACACAGGCATGAGCGAGACCTATGTGTGCCTTATTTCCAAGAAAGAACACTAACAGGGTAAAAGACTTTAGGTCCATTAGCCTGGTCACTAGTTTTTATAAGATGATTGCTAAGACTCTTGAGAACAGGTTGAGGAAAGTCCTTCCTAGTACTATTTCTTATTGTTTATCCAAGTGTGGGTGTGTTGAGAAGATTGGCGTTGCTAGGCGTGCAAAAAGGGGTGGTTGAGGGGTTCCAAATGGGGAAGGAGTCAATTAGTTTGTCTTTCCTTCAGTTCACAGATGATTCAATCTTCTTATGTTTGGGGCAGAAAAATTTCTTGACGAATTTCAACGGTTTTTTTAATCCCTTTTTGAAGTGACTTTGAAACTTAAGATTAATAGGATGGAAATTTATACTTGGCCTTAGCCTTTTTCATTCTAAGCTAAGTGTTTGGGCCTCATTGGTTGGGTGTGAGGTGGGTCAACTTTCGTCCTCTTATTTTGGGCTTCCATTGGGAGGTAATCCGTGAAGGCAAATGTTTCGGGATTCAAAATTCTATATGATTCAGAAACAGTTGTCGTCTTGGAGGAAGCTCTTCTTCTCAAAATGGAGAAAACTCATTCTTATCCAATCTGTGTTAAGTAGGGTTCTCACTTGTTGCTTATCGCTCTTTAAGATTCTCGTTTCTGTGAGCATGAATATTGAAGGAGAGATGAAGAGTTTCTTTGAAGGTGTGGATAATGGGGGTGGGTCACATTTGGTTAATTGGGAGGTTGTGGTGAAGCCGGTGGAATTAGGTGATCTAGGTAATGGGAGCTTGATATTACATAATGAGGCTCTGCTAGTGAAATGGTTGTGGCGATGGTTCTTCATGAAGTTTGACTCTTTGCGGCACTAAGTTATTGTGAGTCAGTATGACCTGCATCTGCCCCTTTTCAAGTGGGTTGCAGATGGTGGTTTCATAGTGCCTAATAGAAATCTTTGGAAAGCAATTGCTTTGGGCTTCCTGTCCTTTTCCTCGTTTGTTCAATGCTTCATTGGGGAGGGCTCAAATGTTTACTTTTGGGAAGATGGGTGGATGGATGATAAACCACTTCATGAGTGTTTCCATGCCTATGCCATCTTTCGAGTAAGAAATTGCATTAAGTGGCTTCTATTTTATCGGGTTCCAACCTTTTGCTTCTTTGTTGTTAGGCTTCAAGAGATGTTGAGCAGATTTGGGAAGTCATGCTTAATAGTGGAAGAAGCTCCAATCCATTTGTCTTTCCAAAATTTGATTATTCTACCATTCCCCTTTCTAACATTCATAAGTTTTATTTTATAAAAACAACAGTATTCTTTGCTACATCAAACCAAGGCCGGGTGTTGCTTCGTGGCTCTGCCTATAGGATAAGAGGTGACCCAATCGTTTCGTTCTAGGCCCTAAATACTAGCAATTGCCTTCTTCCGTAAAAAGTGTGTAACGAGTTTTTGAACTTTAAAAAGTGTGTACCTAGTCTTTAAACTTTCGATTGTGTGTATAATAAATTCTTGAATTTTCGATTTTGTGTCTAATAGATTTTTGAACCTTCAATTTTCTGTCTAAATTTTAACCTATTCAACATTTTTTTAAATATTTACGGAGAACACAAAATTGAAATATTAGGGTCAAGTTTCAATAGGTTCATGTCTTTAAAAAAAATTATTGAATAGATCAAGGAACTATTTAGATACAATTGAAAATTCTGGGATCTATTATACATTTTTAAAGCGTAGAGACTTAGTTGATATAACCTTGAAAGTTTAGGGGCTAAACGTGTAAGTTTACAAAAAAAAAAAAGAAAAGAAAAGTATGTAGTTTAAATTTATAAAATAAATAATAGCAATAGAATGATGCCTTAAAGATTAATATTTATCACACCTGTGTTGAAAACATACTATATCCAAGAATAACATAATCATAGTGCAATTTAGGACGAGTACTAGTTGTTGTATGATTACTCTTATGTTAGTGCTAACAATTGAAGAGCAATAAATTAAATGTTTCTGGTCTGTCTGACTTCCATTCATACTATACAAATTTAAATTATTTTATCAGAGATATTTTGTTGTTCTTACTCGGTTCTTTCTCAGGATGCCTTAAACTCTGGACATTGCCACAAGATCCGGATTTTGTTACGCTTTCTAACTGCATTGGTATGCTTGATTTTGCTATCACAGCATATTTTTTTATGACTGATTATTATCATTTTTCAAAATTAGAAAGCCTAACATCTTAGAGTTTTGATTTTGTCTTCTCGAGAGAATATTCTGATCTTCTAAGCATTTTTGTTGTCTTACGTAGATGAGCAGTAAAGTCCTGACGTCCACATCTCTAGTAGTTGTTTTTGAGACATTATTATCTTCAGCTGCCACAATAGTGGATGATGAGAAGGGAAACCCTGCATGGCAGGCTTGTGCTGACTTTTACATAACTTGCATTCTATCTTGTTTCCCCTGGGGAGGCGCTGAACTGGTTGAGGTATTGTTATTTCAACTCTGAATAATGTCTTTACTTAATGGACCATCTTTCCCAACACTGTCCTTTTCTCTCATTCTCATTTCTTGTTATGAATTGAGTTATATCCTAGGTTAGAGATTGGGTGGTGAGCATCTTTCTCGACACTGTTCTCTCATTCTTGTTACAAATTGGATTAATATCCTACGATAGAACATTGGGCCTTGTTTTGATATACTTTCCTATTTGATTGCTCTAAAAGAATTGTTATAGGAGCTTTACTATTTTTAGCATATTATAACACACATTGGATGAAGATTTGAACCTACGAATTATTAGAGAAGATAGAGATACCTTAACTACTGAGCTATGTCCATGTTGGCCAAAGAGCCCCTACTATTGATAATTTGCTTTTATATGCAGCAAGTTCCTGAAGAGCTTGAGAGGGTTATGGTAGGAGTAGAAGCTTACTTAAGCATTCGAAGGCATACGTTGGACACCGGCTTATCCTTTTTTGAGGATGATGGTGAAGTTGAGAAAACTCTCAACGAGAAGGTATAATTTGACTAGTTTTGATTATTCATTATTGTCATGCTAAGCTTACTACCGGTGAAAGTTTGACTAGTTATGAGTAGAAGTGCCCTTTTATATAGCTAGGTTCGACTTCTCTTTTCTGGAACTAGTTTTTTGTTTGCCCTTGTATATACTTCCATTTCTCTAAAAAAGTGCGGTTTATTACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCATTCTAAGCATTCTACTCAATTTCTAATCTGTTTCAGGTCGGCTGGATTTTTATTTATTATTATTTTTTTAATTTTTACAATATATGTACATATATATACGGCAACCCCTTTTAGTCATATCTGTCAACTCAGTCTAGCCAGGTTTGTGAATCCCAGAATTACCCGGCTTGTCAATTATGACTGTTTTGTCAACGATGGCGATGCTCTTGGCCCAGATCAAACTCTGTCTTTTCAATATGCTAACACTTGCCCTTACCCTCTTTTTTGTATTTTCTGTAACTTGTTTTTCTCAACTTAACCAGCCTTTGGACAAACAGCTCATGTGGACCAGAAGAATTCTCTATGATCATTTAGTTTCTAGGAATTTCATCTTCTGGTTGATATATGAATCCATATTCTTCTTACATTATATCTAAAAAAAAGAGAGAAAAAGAAAAAGAAAATGGGAAACCTGTATTATTTCAACCTGGAGATTTTACTTTATGATTGGAACTGAAGATATATTTTTTTTTTTTTTGAACGTTAAACTTATTTATCAAAGAATTCTTTGTTTTCTCAGCCATATTTTCTAAAAACGCTCTTAAAATCATAAACACATTTTAAAAATAAAAAATGAGTATCTAAAAATTTATTTATTTTTCGATTTTGCTAAGATTTAAAAATAAATATCAGAAGGAGAAAAATAAAACGAAAGGATTGTTCAGAAGCCAGCTTAATGTTTCTAAACCCTTTTGGTAAGTGAGGATATCTCATCTCCCCTATCTGTTTAGTTAGGCTTTAGATTAATACAAGGTTTTCTGAGTTTCCTCTTAGAACAAAGCATAGTATTTAGAATCAGAGAATTAAAATGTTGAAACATTTGGGTACAGGATCGAAGTTATTCTATGGCAGTTAGGAAGTCCTAATTCAGGATTAACAACCCCCAACTAAATTCTAGGTGACTATAAAGTAAAAACCAGTGGCCTTTAGAAACTGTAAAAAAAAAAAGAGAAAATTGCGTAAGCAACAGTAAAAGAATTTTAATTACAAACCATACCATGGAGAAAAACCTTCTCCATGTCTTCCCTCCCGTCTGTACCCTCCTCCCCATGTCCCACCAGGGCTTAAAATAGATAAATTTAGTTTCTTCACATTGTTGTTACGAAATTAGCAACATTTATTTCACGAATCCTTTTCTTTCTGTGGTTGTTTCAGGATTTTTTAGAAGATTTATGGAGTCGCATACAAACGTTATCTACTGATGGATGGAAAGTGGATAGTGGTGATTTTCTTCAACTTTGGGCTTCAATTTGGTTATTTCCTCTGGTCCTTTTATGTTTCAATGTTATCCTGTCGTACAAGGAAAAAAAGGGAAATTATTCCATGCATAGAAGTGCTTCAAGTGTACGAGATGATGGATTGGTATACTAAAATTTTGAGAACCCCACGTACATCAAGAATCCATATATTTTCTTTCATATTTGATATTCCCGTTAGATGCTTTGGTACTTCTGGGAGTACTCTCAGTGGGACTCAAGATAACTGAGTTGCAAGGGTATAATTACCTTTGATGATGTATAGCAGTTCAATATCTTCCTGTAATGAAACCCAGCTAGCTAGTTTTGTCTCTGTCTGCTCAGTTTTTTGAATTTGGGTTTTCTAGAATCAATGAAATATTACTAGCAGTAGGGTAATCATCAACTATGTAATTAATCATCTGCTTTGCGATAATCCTTTTTTGTATCCCTATAGCCCCAAAGAGAGACCTAAATGAGAGATGTGTGTATCAAGGTTTTACAGTTGGGCAGCGAGGGATTGAGAAATAGGAGAGAGGGTCAATGGGAAAGTGAAATAGTTTAGTGACGAGAATGGCCGTGGCTTTATTTCTCCTATCCAGGGAGGCAAGTATATTTTTGTTCGGCTTCCTCCATTAGATTTGAGGGATCTAGAAAAGATAGCTCTTATTACTTGGGTCAATGCTGTTAAATCTATCATTTTGATCGTTTGAGGTGAAAGAAATAAATGAACGTTATATGAAGTGGCTAGGAATAGAGATAGTATATGGGATTCTGTGGTCTAGTATACCTCTTCAGAGTTTGATTACCCCCACATTTGGTTTTTTTTAACAGATGTGGGGCGAGAGATTAAGTCTTTGACAGAAGGTAACATGGACCTTCTCCTCTGGGCTATATTTCGGTTGGAACCTCCATGCATCTCACTAGTTATAGGAATTCCTGCTTATGATGATAATAATAACAATTGTGTGCCTTTGTGAAAAGTAAAAAGTTTGCCAACTCTTCTCTCTCAGCCAAGTAGTGATGAATATGCCGGGAAAAGGCTCTTGTTGAAGAATTCATGCCCCTTAACAAGATCTTGATGACGTACTATCCTAGCTTGATGACTCTGAGGACTGGCTTTCTATGTCTACTTTAGTGGGACATGATTTCAACTTTTGTTCTTGTGGCACATGATAGCAACTCTTGTTCTAACAGGAGTTTTCTTGTACTCAGGATAACAAGGTGTGCAAGGATGTCCTAAGATGTTAATTGGAGGTGTTTTTTGAAATTAATTATAGGATTTCTTTTGTATATCTATTTCAAGTTAATGAAAGGGTTTCACTTTCCTATAAAAAAAGGGAAATTGGACGACCTTGTAAAAGGTGGTTTCTTTGAAGAAGCCAGAAGGCAGTGAAGTTTTATTGGAGATCTCCCGGGCATGAGTGTCCAAAGAGACTTTTGCTGCCCTTTGGAAGTGTATTAAGCGGAAGGTTTGGTTGAGGCTGTTCAAGGATGATTCAAAGTAAGAGGAAACATGGATGTCCAAAAGAGCACAATGGCTTATGCACAATTTTCTTGAAGGCAAATATGGCAAGGAGCACAACTTGGTATTTTCAAATTTAGTAGAACACTAGGATCCATTGATTGTGCTTGGCTTTTCTAGCTATCTATGGTATCAACAATGCAGGGAAGAAAAACTTGGTCTTAATCTCCAAATAGCTTAGGAGATCCTAAAGAGTGAGGTTCTATTGTAGATTCCCTATGGGGGCTCAACGGTTTATATCTAAATAACCATAAAAGCAAATATGGGTCATTGGTGTATTGTTAAGTTTGGTAAAGTTCCATGGTAGTTAATTCAATTTTGAACCAAGACCTTTTGAAGCTTGCAAGTTGCTAGCCACTTGGAAGTGACACCTTTTTTCTCTTTTCCTTCTTAATGGGTGGTTGAGCAAAGGAAGCAATGAATGATAAGTGTTGAATTAGTACATTTTACTTTTAAGAAGTTGATGATCATTTCTTTTGCTTCTTCCATTATCAAGTAGATTTGTATTTTCAGTAATTATTGATCTTTTCTTTATGTGATAGTTCCAAGACCTCACCTTCTATTTGAAGCTCAGCTAGTTGCTGGGAAGTCTCATGAATTTGGAACCATCAGCTGTCCGGAGCAACCTGATCCACCTTTAACACTTTCTGACATTACTTATGGTAAACAGAAGTATGCTGCAGAGTTGAATTATCCTCAAAGGATACGTCGACTTAATATATTTCCATCAAGTAAATTTGAGGTACTATTGTATTTTGTGGCATTAATATCTTTCGTTCCGTCTTGACTTGTGAATATTATCCACAAATATGTTAAGAAGACTTTGGGTAGAGAGTTATGTTAGATTTGACTAAATTTGTTTTATTGATCTCAATCAAAAAAACTTTTTGGGGACTATTGTAAATTAACATTACACTATGAGTGAGTACTCGATACTATTCCTTTTACTAGATGGATATAGTAACATATATGAGAGCTACAAACTACTAGTACATCACATCATGATAAAGGGGACAGATGAAAGTAACTTATGGAGTCCTATATATAGGATATTTACATTTATATACAATTCTGTATGTGTTGTAATACTTCTCGTCAAGAACTATTATTAAAAAAAACTTCCCTTCTTGAATCGTGTCCAACATATCCATGCTTTATTTAATCCTTTTGATACTAAGTTTTGTTATATTTTCTTCCAAACATAATAACAACCAACTTCAACATTTGCGGGTTGCAATGCTAGATCATCTGCAAGGTCCTTTTGTACTGTCTTCTAAAGTCCCTGTTCATACTTAGATTTGTTGGGCTTGTTGTTTTATGTCATTGTACATTCTTTCATTTTTCTCGATGAAAGTTCAGTTTCTTACCAAAAAAAAAAGCAGATTACCAGACTTGTACGTGGTCCTCAATTGAATCGTGTTTCTTGGACCTTTGTTGATGCAGGATGTGCAACCTATTGATCGCTTTGTCGTGGAGGAGTATCTTCTTGATGTGCTTCTTTTCTTCAATGGCTGGTTCGTTAATTTCTCCAAATCTATAATTTAAGAATTAATACAAGAGCATGAAACTTTCCCTCTTCAAGTTATGCTGATTAATCAGTTTGGTTCATGGAAGGATATTTTTAAGAACGTTGATTTGAATCTGTAATTTAAGAATGAATAATGGTATAAGAGCATGGAACTTTCCCTTTTCAAGTTATGGCCGATTAATCAGGTTTGAGAACTTTGTTTTTGATAACATGATCAGGTGGCCGGTATCATTATGCACTTAAAATCACTAGATATTAGCATGCTTTTCAAAGAGAAGCTGTCAGTTCTTCTCAAAAAAATTGAAGGAAAATATGGGAAATTCCTTTCAAGTTACTGATGATATACCGTCAAAATTCTTGGTCATATGTTATTGGAGATGCGAAGGATTTATAGGGAAAGACTTTGTTGATTGCTCTTTCGTGGTTTAGTTGGAAAGAAACAAACAGTAGGTTCCTTTAAGATAAATTAGTTTTTCTGTTATTTATGTAATTGTCAAAGGAGGTTCAAAATGTTCCTAACTATGGGTTCCACAGATTGTATGTCTTGTTCTCATAGTATTTACTGCTTCATTCTGATTGTGCACTCTCCTCTACACTGTATATTCAAACATTCTCTTATTTCTTATATTAGAAACGTCAATGATATCACTTGTCCTTTTAATTCCAAGCCACCTGGGGGAATGGACAGCTCCTCCAAGTTTAACTTAGAAACCAAATGTGAATGCTCTTATGCTCTCTGGGGACTGGTCTGCTTGTAAATTCTTTTACCATTCAATTAGCTCTATTTTAAGAACAAAGTGTAGAGTTCTAGTGTTTTTATGGAATGAAATATTATTTATATAATATAACATTTTTACATTTCAATTAGCTGAAATGCAACATATTACAATGGTTTTTTGTTTTCTTGTTTCATGCAAAGCTAGTCGAAAGGAATGTGCATCTTTCATGGTTGGCCTTCCTGTACCTTTTAGATATGAGTATCTTATGGCAGAGACAATTTTCTCGCAGGTATGCATTTATCTTTCTTGTTAATTATTGAAGATTTTGTTCTCTCCCTTTTTTCAATTTTATGAAATAAGGAAATGTATAAAAGGCAAGAACTTAAAAGGCAAAATAGTAACATAAGGATCTCTTATTGGCATTAAGGGGAAGGGATGATTATAGAAGTGTACAGATTAAGTGGACTGAGGGGAAGAAGCAAATGTAACTAGATCACGAGTTTCCCCGAGCTTCTTTCTGTGTCTTTGAACATCTCCTAATTTTTTCTATTCCAAATTCTTTTATGGAATTACACAAAAACTATTTCCCAAAGACTTCAGTTCTCCCAAGGCCGTGGATGAAGAATTCTTTAAGGCTTCATTATTTTAAAGGTGAAAGGAAATCTGAAACCTAAGAAAAGCAAAGAGGTACTACATACAAATTTTAGTGATTTATTTACGGTTTCCTGGTTGTTTGCCCCGTATAAACATTAAGGCTTTTTGGAAGAATTTTTTCAGGTCCCTTACAGTTAATTCGTCTATGGCTGATTAAGCATTGCTGAATGGGGAGAAGTTTATAGATCATCAGTTTGGGGATGGAAAATGGAGACTTATTGGGGATTTACATCTTAAAATTGAGAAAAAGTCGAGGGTCAAGCATTCCCGATCAGATTTGATTGAAGGTTATGGGGGAAAAATTGCAATAAAGAACTTACCTTTTCCATGTTGGGAGCGCAATACTTCTGAGGCGATTGGGAAATGTTTTGGGAGTCTTGTTAGCATTTCTTCCCAAATTGTGAAACTCTTAAGATCGTTCTGCAGCGGATATTGAAGTGAAGAAAAATCTGTGTGGTTTCTACCGGCGACAATTTTGATTTTAAGGATCAAAATAGGGCTGAACCTTCCTTAGAACTCCTAAATTCTAAGACGACTCACTCCAAAATTAGATTGATCAAGGCTAATCCAATGAATCAATTACACTTTAAACCTAAAGCAAGATTTGAAGAAGAACCAACACCAAGATGGATTTGAGAAACACAAGTTTCTTTCATTGAATTGCCAAAATGTCTTCCAAAATGCAAAGCATTACATACATGACCTTAAAAAGTCCACAAAGTCGTTCAAGTACCCAAGAACAAATAAATGAACTTGGAGAAAACAAGACCTGTTTGGCTTGATGAGAACGTTTGACAAGTTTCAACTGTCAGAAACCCCCAAAACTTGCAATCTAAAAAGGCACCTTTTCACATATTCTGCATGATTTTTTTGTCCTTCGTGCCAACTTTCCAACGACATAATTATCTACTTGACTTATTGAAATGACATTATAAATAAATTGGGGGTTCATAAACTGTCTTTCCTGGTATATTAGCTAGTTTTTAAAGATGTTATATAAAAGCCTTTTGAATTTACTTTGCAAAGCTCCTTTGGGTACAAGTAATGGATCAAGGGTTGGACTCACATTATCATGGTGATCCTGGCATGTCTCTTGGAATTCTTTGAAAAGGATGAACATTTATTAGGTGCCCTGTATGTAGATTTGTTGGTGGTTAAAGAAACAATCAATGCTGATTCCTTTCATGTATCTTCAATTTACTACATCAGATTGTAGGTGGAAAACTATTTGAGGATGGTTTTAATTTGTCAATGTTAAAGAATCCCACTTTGGTGTCAGATGGTGATTTAAATATGGTTAATCGATTCATGCGATACCCGTGTTGGAAGTCAAGTGAAGAGCATAACAAGGTCTAATAAACGTTTTGAAGATCTTGGTATGCTGGTACCTTTGTCAAATGGTCTATTGACATGGTCAAGGGTGGGGGATGAAATTTCTCGCTTATTGTATAACAGGTTTTTTTGTCTCTAAACATGAGATAATCATTTTGATAATTCGTGCATTTCAAGACTAGCTGGTATTTTTTCTTATATCAGCCCAAGTTGGTTGATTGTGTCAGTTCTATTGAAGAATTTTGTTAAGAACTACATTAATGGGTGCAAGATTGTTCTTTTGATATGTTTTTTTTTTTATCTTGTTATCTATGCTATTTCTTGGTTTAAATTGTCCAATCTCCTCTATACATGTGTTCGTGTTTCCATTTTAGCCAATTAAGAACCTGCTTATAGTGGTCATGGATTTTTCAACCTTTTTGTAAACTTCACAAAATGTTTCTTATAAAAACAAAAGATCTTTCGGATTCTAAAGCTTCTTGCTTGAGTTTGATTGAGGCAAATTTCTTGATTGGTAGATTCTCATAAGCAAAGTTCAAGCTCTTCTAATTCAAGAGTTTTCTTTCATTTGTTAAAATTTTATTTTCTTATTTTTCTCTTTATATTTTCTTATGAGCTTTTTGAGTCACTCAAGTTGCTTCTCACCTTCTCTACCCAAAAATATTGAATTATTTGCAGTTACTCATGTTACCACAACCACCGTTCAAGCCTATTTATTATACGCTGGTCATTATTGACCTTTGCAAGGTACGTGCTTTTGTTATTGTAGATGTGATCTTCTACCTTGTTTCCTTCCTTTTTCTTTTTTCTAGTATAATTATCGTTTGTTCCTTTTCCTTTTTGACCAAAAATTCTTATTTGTGAAATATATGACTATGAGGTATATATAGGCTCTTCCTGGGGCATTTCCTGCCGTTGTAGCTGGTGCAGTTCGTGCCTTATTTGAGAAAATAGCTGATTTAGACATGGAGTGTCGTATACGGTTGATACTTTGGTTTTCACACCATTTGTAAGTTTGTACTTTGCGAACACTAAAGAATTCACTATCATATCTGACTGTTAACTTTCTTGCGATAAATATGTTATGGGCCAGTTTTGGAGTTCTCGTATTATTATCTCCTTTCCTTTTCCTTAAATTTGGAGTTCAATTTATTCTCACCTTCTTTCATGTATGCTGTCTTTTTCGGAATTAATGGATCACTCGCCTTTTAAATTTATATAAAACAACTCAAACGACACTACCGAAACAAATTACAGTTTCAGTTCTGTCTGTAACTTCTATTTCTTGTACATCTAATACTGACATGTTTTTAATGAGTTTAAACTTCCCATTCTTTGGCATGGCATATAAGTTTTGTGTCTTTATCACTTGTGAGAGGTTTTAGGATTACCCTCAGTCATGCAATCTCTTTTGAAGTGCCATTGTTGTGTTCAAGATGTACTATTGTGATGCTTGCATCATACTGAAAAAATTATGCTCTCCTTTTAAAGTTAGATTACCTGGATATTTACATTGTTTCTGCCACCGTTTACGCAGATCAAATTTTCAATTTATATGGCCATGGGATGAATGGGCTCACGTATTAGAACTTCCAAAATGGGCTCCCCAGCGAGTGTTTGTGAAAGAGGTTCTGGATAGGGAGGTCCGTCTGTCATATTGGGATAAAATAAAGCAGGTATTTATCATGCTTGAAACTGTTGACTTTTTTTTCTTTGTTCTCACATATACTATTATTCTGATATTTGTACTCAGCATGTTTTGTAAAACAAAGTAGTTGTAACGACCCAGATCCACCGCTAGTAGATATTGTCCTCTTTGGGCTTTCCTTTTCGGGCTTCCCCTCAAGGCTTTAAAATGTGTCTGCTAGGGGAAGGTTTCCACACCCTTATAAATGGTGGTTTGTTCTCTTCCCCAACCAATGTGGGACATCACAAAAGGTATCAGAGCCAGACATCGAACGATGTGCCAGCCTTCTCGCTGTTTCCCGAAGGGGGTCGACACGAGGCGGTGTGCTAGTAAGGACGCTGGGCACCAAAAGGGGTGGATTTGGGGGTGGTTCCACATCGGTTGGAGGAAGGAAAGAGTGCCAGCGAGGACGCTGGGCCCCGAAGGGGGTGGATTGTGATGTCCCACATTGGTTGGGGAGGAGAACAAACTACCATTTATTAGGGTGTGGAAACTTTCCCCTAGCAGACGCATTTTAAAGCCTTGAGGGGAAGCCCAAAAAGGAAATCCCAAAGAGGATAATATCTGCTAGCGGTGGATCTGGGCCGCTACAGTAGTCCTTCGTAGTAAATTCATAATGTATTCATCGGTCAACATGAGAAAAACAATGAGTAACAGAGTATCCACCCACCAAACGGTGACTATAATCCTCTAGGATATTCCAACATAATATTTGTATATGCTTGAAAAATCAGAATAATGGAGTAAATATGAGCTTATTCTTCCTGTTTTTTTACTTTGTTTTCTGACATGTCAAGTATATTGAGCGTATGTAGAGCATTGAGACTGCACCTGGTTTAGAAGAGTTGCTACCTCCCAAGGGTGGACCAAACTTCAAATTTGCTACCGAAGACGGAGAAAAAAGTGAGCAACACGCACTTTCTTCTGAATTGTGCAATATGGTGAAGGGACGGGCTGCAGCACGTGACGTAATTTCATGGTTGGATGAAAATGTTATTCCCAAGCATGGTTCAGATATTTCTCTCGTAGTAGTTGTGAAAACTCTGCTAGATATTGGGTCGAAGAGTTTCACTCATTTGATAACAGTCTTGGAGAGATATGGACAAGTTATTTCAAGAATATGCGATGATCAGGATAAGCAGGTCTTGCTTATATCTGAAGTGGGTTCTTACTGGGAGAATAATACTCAAATGACGGCAATAACAATTGATAGAATGATGGGTTATAGGTTAATATCCAATTTATCCATCATTAAATGGATCTTTTCTCCAGAAAACGTTGAGCAATATCATACATCAGATCGTCCATGGGAGGTATATATTGCCATTTTTTTTGGCTACTTCACTGCAGATGGAAATAAGTTGCTTAACTATTTTTTTCCTTATTATCTCTTGTACAGATACTGAGAAATGCGTTATGCAAGACGTATAATCGTATTTCTGATCTTAAAAAAGAAATATCCTCCTTGAAGAAAGATATTGTTGCAGCTGAAGAAGCTGTTGCTAAGACACAGGAGGAATTGAATGCTGCTGAATCAAAGCTCACACTTGTGGATGGTGAACCTGTTATGGGAGAGAATCCCGTGAGAACGAAGCGATTGAAAGCTTATGCTGGAAAAGCAAAAGAGCAGGAGACGTCGATACGAAACTCTTTAGAGGCCAAAGAAGCTCTTCTTACCCGAGCTCTTGAGGAGAACGAGGTAGTAATTTCAAGACTAAAATAATTTGCACCTCTTTCGTTAAATCTGGACACATTAATGTCAGTATTCTCGATTCTCTGTTGCATCTGCCTCAACAACGTAACACCGTATCACTTTCCCGTTTTAACTAATCGTTTGTGTGCAGACATTATTTCTATCTTTGTACAAAGGGTTTTCCAGTATATTGACAGAACGCCTTCCAGCTGCATCTAGCGCACAAACCCTGCAGGATTTGAAGTCTATTAATCCTGCTGGTGCGAATGCTATGGACCTTGAAGAACCATCAGCCATGGAGATGGAGATGGACAATGAGGATTCAAAACCTGAAAAAAGGTGTTCTCGTCAATACTACAATGATCCAATATCTTTTTGCCAAAATCCATCCATGATAGTACAGAACATGAAATCACAATGTTTTCTTTATTTTGCAGTCATTTGAATGGTAAGACAGAGCATTCCTACACTGTAGGTGAAAATGAACAGTGGTGTTTAACAACCTTGGGATATGTCAAGGCCTTCTCAAGGCAATACGCTTCCGAGGTATTTTTCGTTTCATACAGTGCCATGCTTTCGGGACTATTAATCATTGTTATGGCCTAAACCCACCACCGCTAGCAGATATTGTCCTATGTGGGCTTTCCTTTTTGGCTTCTCCTCAAGGTTTTTTAAAATGCGTCTGCTAGAGAGAGGTTTCCACACCCTTATAGAATATTTAGTTCTCCTCTTCAATCGATGTGGGATCTCACAATCATGCAAAAAGTTCTGATTATTCATTTATAAAACTATCAGAATAAACTGTTTTTTACTCGAACCATTTTCATTGTGATGTGCACAGATATGGCCACACATTGAGAAGTTGGATGCAGAAGTCTTGTCCGAAGATTCACACCCACTTTTCAGGAAAGCAGTCTACAGTGGCCTTCGTCGATCAATGGACTCGATCTAACAACGTAAACATATTTGTTATAACATTAAACATTTTTCTATTTTTCCGCTTAATTCGTTTGTTTCTTTTTCTCTTCATAGATTAGGTGATAGATCCCTCGGTAATGTTATGTAACATTTGGCTAAACGAAAAATACATAATAGGCTAGCCAATCATTGTCGATACCGTTTTAATTGTATTTGTTGTCGAGTTATGTTCATGGAACATTTTTCTATTATAGCTAAACGTAAACATATTTGTTATAACATTAAACATTTTTCTATTATAGCATCGATCTTAATCTCGGGTGTCACGAC
mRNA sequence
AAACCCTTCTAATCCCCACGGGAGCGTCTCCCGGTTTTACACAATTGCATTTCTTCCACTGCCGGACTTGTCCTGCTCCGGAAACTCAGCGGCGTCTGCAGTCCCGTGTAATTCGAACCCACTTCACCATCTCCTAATTTTTCCCTTATACAAAATCACCTTTCTTCTTCTAGGCAGTAATGAGCAGCTGGAAGAGTCTTCTTCTCCGAATTGGCGACAAGTCCCCGGAATACGGCACCTCCTCCGATTTCAAAGACCACATTGAAACTTGCTTCGCAGCGATTCGGCGGGAGCTGGATCACTATGGAGATGAAATTTTGCCTTTCCTTCTGCAATGTGTTGAACAATTGCCTCATAAGACTCCTTTGTACGGGACATTGATCGGATTGATGAATTTGGAAAATGAAGACTTCGTGAAGAAAATTGTGGAGCAAACTCACCAAAGTTTTCAGGATGCCTTAAACTCTGGACATTGCCACAAGATCCGGATTTTGTTACGCTTTCTAACTGCATTGATGAGCAGTAAAGTCCTGACGTCCACATCTCTAGTAGTTGTTTTTGAGACATTATTATCTTCAGCTGCCACAATAGTGGATGATGAGAAGGGAAACCCTGCATGGCAGGCTTGTGCTGACTTTTACATAACTTGCATTCTATCTTGTTTCCCCTGGGGAGGCGCTGAACTGGTTGAGCAAGTTCCTGAAGAGCTTGAGAGGGTTATGGTAGGAGTAGAAGCTTACTTAAGCATTCGAAGGCATACGTTGGACACCGGCTTATCCTTTTTTGAGGATGATGGTGAAGTTGAGAAAACTCTCAACGAGAAGGATTTTTTAGAAGATTTATGGAGTCGCATACAAACGTTATCTACTGATGGATGGAAAGTGGATAGTGTTCCAAGACCTCACCTTCTATTTGAAGCTCAGCTAGTTGCTGGGAAGTCTCATGAATTTGGAACCATCAGCTGTCCGGAGCAACCTGATCCACCTTTAACACTTTCTGACATTACTTATGGTAAACAGAAGTATGCTGCAGAGTTGAATTATCCTCAAAGGATACGTCGACTTAATATATTTCCATCAAGTAAATTTGAGGATGTGCAACCTATTGATCGCTTTGTCGTGGAGGAGTATCTTCTTGATGTGCTTCTTTTCTTCAATGGCTGTCGAAAGGAATGTGCATCTTTCATGGTTGGCCTTCCTGTACCTTTTAGATATGAGTATCTTATGGCAGAGACAATTTTCTCGCAGTTACTCATGTTACCACAACCACCGTTCAAGCCTATTTATTATACGCTGGTCATTATTGACCTTTGCAAGGCTCTTCCTGGGGCATTTCCTGCCGTTGTAGCTGGTGCAGTTCGTGCCTTATTTGAGAAAATAGCTGATTTAGACATGGAGTGTCGTATACGGTTGATACTTTGGTTTTCACACCATTTATCAAATTTTCAATTTATATGGCCATGGGATGAATGGGCTCACGTATTAGAACTTCCAAAATGGGCTCCCCAGCGAGTGTTTGTGAAAGAGGTTCTGGATAGGGAGGTCCGTCTGTCATATTGGGATAAAATAAAGCAGAGCATTGAGACTGCACCTGGTTTAGAAGAGTTGCTACCTCCCAAGGGTGGACCAAACTTCAAATTTGCTACCGAAGACGGAGAAAAAAGTGAGCAACACGCACTTTCTTCTGAATTGTGCAATATGGTGAAGGGACGGGCTGCAGCACGTGACGTAATTTCATGGTTGGATGAAAATGTTATTCCCAAGCATGGTTCAGATATTTCTCTCGTAGTAGTTGTGAAAACTCTGCTAGATATTGGGTCGAAGAGTTTCACTCATTTGATAACAGTCTTGGAGAGATATGGACAAGTTATTTCAAGAATATGCGATGATCAGGATAAGCAGGTCTTGCTTATATCTGAAGTGGGTTCTTACTGGGAGAATAATACTCAAATGACGGCAATAACAATTGATAGAATGATGGGTTATAGGTTAATATCCAATTTATCCATCATTAAATGGATCTTTTCTCCAGAAAACGTTGAGCAATATCATACATCAGATCGTCCATGGGAGATACTGAGAAATGCGTTATGCAAGACGTATAATCGTATTTCTGATCTTAAAAAAGAAATATCCTCCTTGAAGAAAGATATTGTTGCAGCTGAAGAAGCTGTTGCTAAGACACAGGAGGAATTGAATGCTGCTGAATCAAAGCTCACACTTGTGGATGGTGAACCTGTTATGGGAGAGAATCCCGTGAGAACGAAGCGATTGAAAGCTTATGCTGGAAAAGCAAAAGAGCAGGAGACGTCGATACGAAACTCTTTAGAGGCCAAAGAAGCTCTTCTTACCCGAGCTCTTGAGGAGAACGAGACATTATTTCTATCTTTGTACAAAGGGTTTTCCAGTATATTGACAGAACGCCTTCCAGCTGCATCTAGCGCACAAACCCTGCAGGATTTGAAGTCTATTAATCCTGCTGGTGCGAATGCTATGGACCTTGAAGAACCATCAGCCATGGAGATGGAGATGGACAATGAGGATTCAAAACCTGAAAAAAGTCATTTGAATGGTAAGACAGAGCATTCCTACACTGTAGGTGAAAATGAACAGTGGTGTTTAACAACCTTGGGATATGTCAAGGCCTTCTCAAGGCAATACGCTTCCGAGATATGGCCACACATTGAGAAGTTGGATGCAGAAGTCTTGTCCGAAGATTCACACCCACTTTTCAGGAAAGCAGTCTACAGTGGCCTTCGTCGATCAATGGACTCGATCTAACAACGTAAACATATTTGTTATAACATTAAACATTTTTCTATTTTTCCGCTTAATTCGTTTGTTTCTTTTTCTCTTCATAGATTAGGTGATAGATCCCTCGGTAATGTTATGTAACATTTGGCTAAACGAAAAATACATAATAGGCTAGCCAATCATTGTCGATACCGTTTTAATTGTATTTGTTGTCGAGTTATGTTCATGGAACATTTTTCTATTATAGCTAAACGTAAACATATTTGTTATAACATTAAACATTTTTCTATTATAGCATCGATCTTAATCTCGGGTGTCACGAC
Coding sequence (CDS)
ATGAGCAGCTGGAAGAGTCTTCTTCTCCGAATTGGCGACAAGTCCCCGGAATACGGCACCTCCTCCGATTTCAAAGACCACATTGAAACTTGCTTCGCAGCGATTCGGCGGGAGCTGGATCACTATGGAGATGAAATTTTGCCTTTCCTTCTGCAATGTGTTGAACAATTGCCTCATAAGACTCCTTTGTACGGGACATTGATCGGATTGATGAATTTGGAAAATGAAGACTTCGTGAAGAAAATTGTGGAGCAAACTCACCAAAGTTTTCAGGATGCCTTAAACTCTGGACATTGCCACAAGATCCGGATTTTGTTACGCTTTCTAACTGCATTGATGAGCAGTAAAGTCCTGACGTCCACATCTCTAGTAGTTGTTTTTGAGACATTATTATCTTCAGCTGCCACAATAGTGGATGATGAGAAGGGAAACCCTGCATGGCAGGCTTGTGCTGACTTTTACATAACTTGCATTCTATCTTGTTTCCCCTGGGGAGGCGCTGAACTGGTTGAGCAAGTTCCTGAAGAGCTTGAGAGGGTTATGGTAGGAGTAGAAGCTTACTTAAGCATTCGAAGGCATACGTTGGACACCGGCTTATCCTTTTTTGAGGATGATGGTGAAGTTGAGAAAACTCTCAACGAGAAGGATTTTTTAGAAGATTTATGGAGTCGCATACAAACGTTATCTACTGATGGATGGAAAGTGGATAGTGTTCCAAGACCTCACCTTCTATTTGAAGCTCAGCTAGTTGCTGGGAAGTCTCATGAATTTGGAACCATCAGCTGTCCGGAGCAACCTGATCCACCTTTAACACTTTCTGACATTACTTATGGTAAACAGAAGTATGCTGCAGAGTTGAATTATCCTCAAAGGATACGTCGACTTAATATATTTCCATCAAGTAAATTTGAGGATGTGCAACCTATTGATCGCTTTGTCGTGGAGGAGTATCTTCTTGATGTGCTTCTTTTCTTCAATGGCTGTCGAAAGGAATGTGCATCTTTCATGGTTGGCCTTCCTGTACCTTTTAGATATGAGTATCTTATGGCAGAGACAATTTTCTCGCAGTTACTCATGTTACCACAACCACCGTTCAAGCCTATTTATTATACGCTGGTCATTATTGACCTTTGCAAGGCTCTTCCTGGGGCATTTCCTGCCGTTGTAGCTGGTGCAGTTCGTGCCTTATTTGAGAAAATAGCTGATTTAGACATGGAGTGTCGTATACGGTTGATACTTTGGTTTTCACACCATTTATCAAATTTTCAATTTATATGGCCATGGGATGAATGGGCTCACGTATTAGAACTTCCAAAATGGGCTCCCCAGCGAGTGTTTGTGAAAGAGGTTCTGGATAGGGAGGTCCGTCTGTCATATTGGGATAAAATAAAGCAGAGCATTGAGACTGCACCTGGTTTAGAAGAGTTGCTACCTCCCAAGGGTGGACCAAACTTCAAATTTGCTACCGAAGACGGAGAAAAAAGTGAGCAACACGCACTTTCTTCTGAATTGTGCAATATGGTGAAGGGACGGGCTGCAGCACGTGACGTAATTTCATGGTTGGATGAAAATGTTATTCCCAAGCATGGTTCAGATATTTCTCTCGTAGTAGTTGTGAAAACTCTGCTAGATATTGGGTCGAAGAGTTTCACTCATTTGATAACAGTCTTGGAGAGATATGGACAAGTTATTTCAAGAATATGCGATGATCAGGATAAGCAGGTCTTGCTTATATCTGAAGTGGGTTCTTACTGGGAGAATAATACTCAAATGACGGCAATAACAATTGATAGAATGATGGGTTATAGGTTAATATCCAATTTATCCATCATTAAATGGATCTTTTCTCCAGAAAACGTTGAGCAATATCATACATCAGATCGTCCATGGGAGATACTGAGAAATGCGTTATGCAAGACGTATAATCGTATTTCTGATCTTAAAAAAGAAATATCCTCCTTGAAGAAAGATATTGTTGCAGCTGAAGAAGCTGTTGCTAAGACACAGGAGGAATTGAATGCTGCTGAATCAAAGCTCACACTTGTGGATGGTGAACCTGTTATGGGAGAGAATCCCGTGAGAACGAAGCGATTGAAAGCTTATGCTGGAAAAGCAAAAGAGCAGGAGACGTCGATACGAAACTCTTTAGAGGCCAAAGAAGCTCTTCTTACCCGAGCTCTTGAGGAGAACGAGACATTATTTCTATCTTTGTACAAAGGGTTTTCCAGTATATTGACAGAACGCCTTCCAGCTGCATCTAGCGCACAAACCCTGCAGGATTTGAAGTCTATTAATCCTGCTGGTGCGAATGCTATGGACCTTGAAGAACCATCAGCCATGGAGATGGAGATGGACAATGAGGATTCAAAACCTGAAAAAAGTCATTTGAATGGTAAGACAGAGCATTCCTACACTGTAGGTGAAAATGAACAGTGGTGTTTAACAACCTTGGGATATGTCAAGGCCTTCTCAAGGCAATACGCTTCCGAGATATGGCCACACATTGAGAAGTTGGATGCAGAAGTCTTGTCCGAAGATTCACACCCACTTTTCAGGAAAGCAGTCTACAGTGGCCTTCGTCGATCAATGGACTCGATCTAA
Protein sequence
MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHKTPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTSTSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERVMVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPRPHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPSSKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLMLPQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLSNFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPKGGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVKTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRMMGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVAAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAKEALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAMEMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDAEVLSEDSHPLFRKAVYSGLRRSMDSI
Homology
BLAST of CmoCh10G009520 vs. ExPASy Swiss-Prot
Match:
Q9SIU2 (Nuclear cap-binding protein subunit 1 OS=Arabidopsis thaliana OX=3702 GN=ABH1 PE=1 SV=2)
HSP 1 Score: 1176.4 bits (3042), Expect = 0.0e+00
Identity = 575/862 (66.71%), Postives = 706/862 (81.90%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MS+WK+LLLRIG+K PEYGTSSD+KDHIETCF IRRE++ GD++LPFLLQC EQLPHK
Sbjct: 1 MSNWKTLLLRIGEKGPEYGTSSDYKDHIETCFGVIRREIERSGDQVLPFLLQCAEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
PLYGTLIGL+NLENEDFV+K+VE H +FQ AL+SG+C+ IRILLRF+T+L+ SKV+
Sbjct: 61 IPLYGTLIGLLNLENEDFVQKLVESVHANFQVALDSGNCNSIRILLRFMTSLLCSKVIQP 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
SL+VVFETLLSSAAT VD+EKGNP+WQ ADFY+ CILS PWGG+EL EQVP+E+ERV
Sbjct: 121 ASLIVVFETLLSSAATTVDEEKGNPSWQPQADFYVICILSSLPWGGSELAEQVPDEIERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
+VG++AYLSIR+++ +GL+FF +GE E +L EKDF+EDL RIQ+L+++GWK++SVPR
Sbjct: 181 LVGIQAYLSIRKNSSTSGLNFFH-NGEFESSLAEKDFVEDLLDRIQSLASNGWKLESVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHL FEAQLVAGK HE I C EQP PP S GKQK+ A YPQRIRRLNIFP+
Sbjct: 241 PHLSFEAQLVAGKFHELRPIKCMEQPSPPSDHSRAYSGKQKHDALTRYPQRIRRLNIFPA 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
+K EDVQPIDRFVVEEYLLDVL + NGCRKECAS+M LPV FRYEYLMAET+FSQ+L+L
Sbjct: 301 NKMEDVQPIDRFVVEEYLLDVLFYLNGCRKECASYMANLPVTFRYEYLMAETLFSQILLL 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFK +YYTLVI+DLCKALPGAFPAVVAGAVRALFEKI+DLDME R RLILWFSHHLS
Sbjct: 361 PQPPFKTLYYTLVIMDLCKALPGAFPAVVAGAVRALFEKISDLDMESRTRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPW+EWA VL+LPKWAP+RVFV+E+L REVRLSYWDKIKQSIE A LEELLPPK
Sbjct: 421 NFQFIWPWEEWAFVLDLPKWAPKRVFVQEILQREVRLSYWDKIKQSIENATALEELLPPK 480
Query: 481 GGPNFKFATEDG-EKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVV 540
GPNF ++ E+G EK+E+ LS+EL VK + ARD+I W++E + P HG +++L +VV
Sbjct: 481 AGPNFMYSLEEGKEKTEEQQLSAELSRKVKEKQTARDMIVWIEETIYPVHGFEVTLTIVV 540
Query: 541 KTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRM 600
+TLLDIGSKSFTHL+TVLERYGQV S++C D DKQV+L+S+V +YW+NN QMTA+ IDRM
Sbjct: 541 QTLLDIGSKSFTHLVTVLERYGQVFSKLCPDNDKQVMLLSQVSTYWKNNVQMTAVAIDRM 600
Query: 601 MGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIV 660
MGYRL+SN +I++W+FSPENV+Q+H SD+PWEIL NAL KTYNRISDL+K+IS++ K+++
Sbjct: 601 MGYRLVSNQAIVRWVFSPENVDQFHVSDQPWEILGNALNKTYNRISDLRKDISNITKNVL 660
Query: 661 AAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAK 720
AE+A A + EL AAESKL+LV+GEPV+GENP + KRLK+ K E E S+R SLEAK
Sbjct: 661 VAEKASANARVELEAAESKLSLVEGEPVLGENPAKMKRLKSTVEKTGEAELSLRESLEAK 720
Query: 721 EALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSA 780
EALL RAL E E L L L++ F +L ERLP + +++QDLKSI + ++PSA
Sbjct: 721 EALLNRALSETEVLLLLLFQSFLGVLKERLPDPTKVRSVQDLKSI------GAEDDKPSA 780
Query: 781 MEMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
M++D+E+ P+K S VGE EQWCL+TLGY+ AF+RQYASEIWPH+EKL+
Sbjct: 781 --MDVDSENGNPKK---------SCEVGEREQWCLSTLGYLTAFTRQYASEIWPHMEKLE 840
Query: 841 AEVLS-EDSHPLFRKAVYSGLR 861
+EV S ED HPLF +A+ S L+
Sbjct: 841 SEVFSGEDVHPLFLQAISSALQ 844
BLAST of CmoCh10G009520 vs. ExPASy Swiss-Prot
Match:
Q10LJ0 (Nuclear cap-binding protein subunit 1 OS=Oryza sativa subsp. japonica OX=39947 GN=ABH1 PE=2 SV=1)
HSP 1 Score: 1085.9 bits (2807), Expect = 0.0e+00
Identity = 537/863 (62.22%), Postives = 674/863 (78.10%), Query Frame = 0
Query: 2 SSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHKT 61
+ W++LLLRIGD+ PEYG S+D K+HIETC+ + RE +H D + FLLQC +QLPHK
Sbjct: 3 AGWRTLLLRIGDRCPEYGGSADHKEHIETCYGVLCREYEHSKDAMFEFLLQCADQLPHKI 62
Query: 62 PLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTST 121
P +G LIGL+NLENEDF K IV+ TH + QDAL++ + +IRILLRFL LM SKV+
Sbjct: 63 PFFGVLIGLINLENEDFSKGIVDTTHANLQDALHNENRDRIRILLRFLCGLMCSKVVLPN 122
Query: 122 SLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERVM 181
S++ FE LLSSAATI+D+E GNP+WQ ADFY+ CIL+ PWGG+EL EQVP+E ERV+
Sbjct: 123 SIIETFEALLSSAATILDEETGNPSWQPRADFYVYCILASLPWGGSELFEQVPDEFERVL 182
Query: 182 VGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPRP 241
VG+++Y+SIRRH D S FE D + N+KDF+EDLW RIQ LS +GWKV SVP+P
Sbjct: 183 VGIQSYISIRRHFDDIAFSVFETD--EGNSPNKKDFIEDLWERIQVLSRNGWKVKSVPKP 242
Query: 242 HLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPSS 301
HL FEAQLVAG SH F ISCP P + S+I G++K+ A+L YPQR+RRL+IFP++
Sbjct: 243 HLSFEAQLVAGVSHRFSPISCP-PPTISQSSSEIVKGQEKHEADLKYPQRLRRLHIFPTN 302
Query: 302 KFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLMLP 361
K E++QP+DRFVVEE +LDVLLFFNGCRKECA ++V LPVPFRYEYLMAETIFSQLL+LP
Sbjct: 303 KAENMQPVDRFVVEECILDVLLFFNGCRKECAFYLVSLPVPFRYEYLMAETIFSQLLLLP 362
Query: 362 QPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLSN 421
PPF+PIYYTLVIIDLCKALPGAFP+VV GAV ALF++I+++DMECR RLILWFSHHLSN
Sbjct: 363 NPPFRPIYYTLVIIDLCKALPGAFPSVVVGAVHALFDRISNMDMECRTRLILWFSHHLSN 422
Query: 422 FQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPKG 481
FQFIWPW EWA+V +LPKWAPQRVFV+EVL+RE+RLSY+DKIKQSIE A LEELLPPK
Sbjct: 423 FQFIWPWQEWAYVKDLPKWAPQRVFVQEVLEREIRLSYFDKIKQSIEDAVELEELLPPKA 482
Query: 482 GPNFKFATEDG-EKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVK 541
GPNF++ +++G E ++ H LS EL MV+GR D+ISW+DE +IP +G+ +L VV +
Sbjct: 483 GPNFRYHSDEGKESTDGHRLSKELVAMVRGRKTQGDIISWVDEKIIPVNGAKFALDVVSQ 542
Query: 542 TLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRMM 601
TLLDIGSKSFTHLITVLERYGQ+IS++C +++ Q+LL+ EV +YW+N+TQM AI IDRMM
Sbjct: 543 TLLDIGSKSFTHLITVLERYGQIISKLCPNEEMQLLLMDEVSAYWKNSTQMIAIAIDRMM 602
Query: 602 GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA 661
GYRL+SNL+I+KW+FSP NV+Q+H SDRPWEILRNA+ KTYNRI DL+KEI +L+K + A
Sbjct: 603 GYRLLSNLAIVKWVFSPANVDQFHVSDRPWEILRNAVSKTYNRIFDLRKEIQTLRKGLQA 662
Query: 662 AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAKE 721
A+EA K EL A+S + +VDG+PV ENP R +RL+A A KAKE E + SLEAKE
Sbjct: 663 AKEASEKAARELEEAKSIIEIVDGQPVPSENPGRLRRLQARADKAKEGEVTTEESLEAKE 722
Query: 722 ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINP-AGANAMDLEEPSA 781
ALL R LEE++ L L+K F +LTERLP S+ + +L++ +P ++A D P A
Sbjct: 723 ALLARGLEESKELLRLLFKSFVEVLTERLPPISADGDVPNLRAGDPNVNSSARD---PEA 782
Query: 782 MEMEMDNEDSKPEKSHLNGKTEH-SYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKL 841
ME+DNE+ S LNG+ + S+ VGE EQWCL TLGY+K+FSRQYA+EIW HI L
Sbjct: 783 TTMEIDNENGGDNDSQLNGQNKKISHNVGELEQWCLCTLGYLKSFSRQYATEIWSHIAML 842
Query: 842 DAEVLSEDSHPLFRKAVYSGLRR 862
D E+ + HPL RKA +SGL R
Sbjct: 843 DQEIFVGNIHPLIRKAAFSGLCR 859
BLAST of CmoCh10G009520 vs. ExPASy Swiss-Prot
Match:
Q16UN6 (Nuclear cap-binding protein subunit 1 OS=Aedes aegypti OX=7159 GN=Cbp80 PE=3 SV=1)
HSP 1 Score: 321.6 bits (823), Expect = 2.7e-86
Identity = 210/761 (27.60%), Postives = 377/761 (49.54%), Query Frame = 0
Query: 5 KSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHKTPLY 64
++L+LR+G+ S SS + ++E + + +L ++ +IL L +C ++P K +Y
Sbjct: 35 ETLILRVGENS-----SSSLESNLEGLVSVLESDLGNFRSKILRILSECPIKMPEKCTIY 94
Query: 65 GTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTSTSLV 124
T++GLMN +N +F + V+ ++F+++L R LRFL+ L++ V+++ SL+
Sbjct: 95 STMVGLMNAKNYNFGGEFVDHMVKTFKESLKQCRWDAARYALRFLSDLVNCHVISTNSLL 154
Query: 125 VVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERVMVGV 184
+ + ++ +A +E P Q D+Y+ +LS PW G EL E+ LE ++V +
Sbjct: 155 QLLDNMVDAA-----NEDSVP--QVRRDWYVFAVLSTLPWVGRELYEKKESALENLLVRI 214
Query: 185 EAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPRPHLL 244
E +L+ R L + V+ ++++L+ LW++I+ L D W +PRP+L
Sbjct: 215 EVFLNKRTKKHHNSLRVW----SVDAPHPQEEYLDCLWAQIRKLRQDNWAEKHIPRPYLA 274
Query: 245 FEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPSSKFE 304
F++ L H I P D + E P + R+ +
Sbjct: 275 FDSVLCEALQHNLPLIHPPPHQD---------------SFEYPMPWVVYRMFDYTDCPAG 334
Query: 305 DVQP----IDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFR--YEYLMAETIFSQLL 364
+ P I+RF++EE+L ++ + RK+CA+ ++ L + EY + E IF++L
Sbjct: 335 PILPGAHSIERFLIEEHLHSIIEAHHWERKDCAANLLNLSYKDKIPLEYCIVEVIFAELF 394
Query: 365 MLPQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHH 424
+P P + I Y ++I+LCK P P V+A A LF +I ++ C R WFS+H
Sbjct: 395 KMPTPRYLDICYGSILIELCKLQPSKMPQVLAQATEILFMRIDSMNTSCFDRFANWFSYH 454
Query: 425 LSNFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLP 484
LSNFQF W WD+W L L P+ F++EVL + +RLSY D+ K+ + +L+P
Sbjct: 455 LSNFQFRWSWDDWDSCLLLEPEHPRPKFIEEVLLKCLRLSYHDRFKEMMPET--YSKLIP 514
Query: 485 PKGGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLV-- 544
P +K+ E A + +L ++ + DV++ L + P+ S+ +V
Sbjct: 515 KPPMPTYKYTMEGAASLPGTATAHKLVVAIRQKCTPEDVLNELKDLPNPRETSENDMVES 574
Query: 545 --------VVVKTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENN 604
V V+TLL++GSKSF+H + ++ V + + ++ Q+ ++ V W N+
Sbjct: 575 TFNPLKIDVFVQTLLNLGSKSFSHTFAAISKFHLVFKTLAETEEAQICILHNVFELWVNH 634
Query: 605 TQMTAITIDRMMGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLK 664
QM + ID+++ +++ ++ W+FS E V ++ T WEIL + K ++ L
Sbjct: 635 QQMMVVIIDKLLKTQIVECSAVATWVFSKEMVGEF-TKMYLWEILHLTIKKMNQHVTKLS 694
Query: 665 KEISSLKKDIVAAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQ 724
KE+S A+E + + E ++ + T G + R K + A K E+
Sbjct: 695 KELSD-------AKERLDRNAESSSSESEEETAPAGTDAVTPQRRRKKPIGDNADKPTEE 748
Query: 725 ETSIRNSLEAKEALLTRALEENETLFLSLYKGFSSILTERL 750
+ +E E L A + + LFL +++ F IL+E L
Sbjct: 755 Q------VERMEEKLEAAYVDQKRLFLIIFQRFIMILSEHL 748
BLAST of CmoCh10G009520 vs. ExPASy Swiss-Prot
Match:
Q7PX35 (Nuclear cap-binding protein subunit 1 OS=Anopheles gambiae OX=7165 GN=Cbp80 PE=3 SV=4)
HSP 1 Score: 321.6 bits (823), Expect = 2.7e-86
Identity = 207/761 (27.20%), Postives = 374/761 (49.15%), Query Frame = 0
Query: 5 KSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHKTPLY 64
++L+LR+G+ S +S + ++E + + +L ++ +IL L C ++P K +Y
Sbjct: 35 ETLILRVGENS-----TSSLESNLEGLVSVLESDLGNFRSKILRILSDCPIKMPEKCTIY 94
Query: 65 GTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTSTSLV 124
T++GLMN +N +F + VE ++F+D+L R LRFL L++ V+++ SL+
Sbjct: 95 STMVGLMNAKNYNFGGEFVEYMVKTFKDSLKQCQWDAARYALRFLADLVNCHVISTNSLL 154
Query: 125 VVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERVMVGV 184
+ ++++ +A +E P Q D+Y+ +LS PW G EL E+ LE ++V +
Sbjct: 155 QLLDSMVDAA-----NEDNVP--QVRRDWYVFAVLSTLPWVGRELYEKKESALENLLVRI 214
Query: 185 EAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPRPHLL 244
E +L+ R L + V+ ++++L+ LW++I+ L D W +PRP+L
Sbjct: 215 EVFLNKRTKKHHNALRVW----SVDAPHPQEEYLDCLWAQIRKLRQDNWTEKHIPRPYLA 274
Query: 245 FEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIF----PS 304
F++ L H I P D + E P + R+ + P
Sbjct: 275 FDSVLCEALQHNIPVIHPPPHQD---------------SFEYPMPWVVYRMFDYTDCPPG 334
Query: 305 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFR--YEYLMAETIFSQLL 364
I+RF++EE+L ++ RK+CA ++ LP + EY + E IF++L
Sbjct: 335 PILPGAHSIERFLIEEHLHSIIEMHRWERKDCAIHLLMLPYKDKIPLEYCIVEVIFAELF 394
Query: 365 MLPQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHH 424
+P P + I Y ++I+LCK P P V+A A LF +I ++ C R + WFS+H
Sbjct: 395 HMPTPRYLEICYGSILIELCKQQPSKMPQVLAQATEILFMRIDSMNTSCFDRFVNWFSYH 454
Query: 425 LSNFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLP 484
LSNFQF W WD+W L L P+ F++EVL + +R SY D+ K+ + G +L+P
Sbjct: 455 LSNFQFRWSWDDWDSCLLLENEHPRPKFIQEVLLKCLRFSYHDRFKEMM--PEGYAKLIP 514
Query: 485 PKGGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSD------ 544
P++K++ E A + +L ++ + A DV++ L++ + SD
Sbjct: 515 KPPVPHYKYSMEGAASLPGTATAHKLVVAIRQKCNAEDVLNELNDLPNSRDASDTDMAEA 574
Query: 545 ----ISLVVVVKTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENN 604
+ + V V+TLL++GSKSF+H + ++ V + + ++ Q+ ++ + W ++
Sbjct: 575 PFNPLKIDVFVQTLLNLGSKSFSHSFAAISKFHAVFKALAETEEAQICILHNMFELWVDH 634
Query: 605 TQMTAITIDRMMGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLK 664
QM + +D+++ +++ ++ W+FS E V ++ T WEIL + K ++ L
Sbjct: 635 QQMMVVIVDKLLKVQIVECSAVATWVFSKEMVGEF-TKMYLWEILHLTIKKMNQHVTKLS 694
Query: 665 KEISSLKKDIVAAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQ 724
+E++ K+ + E+ + E+ A +P KR K G
Sbjct: 695 REMNEAKEKLARTVESSSSESEDEAA----------------SPNAQKRRKNTEGSG--- 742
Query: 725 ETSIRNSLEAKEALLTRALEENETLFLSLYKGFSSILTERL 750
E +E E L A + + LFL +++ F IL+E L
Sbjct: 755 EKPTEEQVERMEEKLEAAYVDQKRLFLIIFQRFIMILSEHL 742
BLAST of CmoCh10G009520 vs. ExPASy Swiss-Prot
Match:
B4GW22 (Nuclear cap-binding protein subunit 1 OS=Drosophila persimilis OX=7234 GN=Cbp80 PE=3 SV=1)
HSP 1 Score: 318.9 bits (816), Expect = 1.7e-85
Identity = 206/759 (27.14%), Postives = 375/759 (49.41%), Query Frame = 0
Query: 5 KSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHKTPLY 64
+SL+LR+G++S +S + ++E + + +L + +IL L C ++P K +Y
Sbjct: 35 ESLILRVGERS-----TSSVESNLEGLVSVLEADLGTFRLKILRILSDCAVRMPEKCTVY 94
Query: 65 GTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTSTSLV 124
TL+GL+N +N F + V+ ++F+++L R LRFL L++ V+++TSL+
Sbjct: 95 TTLVGLLNAKNYKFGGEFVDHMVKTFKESLKMCRWDAARYSLRFLADLVNCHVISATSLL 154
Query: 125 VVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERVMVGV 184
+ +T++ V +E P Q D+++ +LS PW G +L E+ LE +++ +
Sbjct: 155 QLLDTMID-----VSNEDTVP--QVRRDWFVFAVLSTLPWVGRDLYEKKESALESLLLRI 214
Query: 185 EAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPRPHLL 244
E YL+ R L + D ++++L+ LW++I+ L D W +PRP+L
Sbjct: 215 EVYLNKRSKKHHNALRVWSSDAPHP----QEEYLDCLWAQIRKLRQDNWAEKHIPRPYLT 274
Query: 245 FEAQLVAGKSHEFGTISCPEQPDP-PLTLSDITYGKQKYAAELNYPQRIRRLNIFPSSKF 304
F+ L H I P D + + Y Y + P
Sbjct: 275 FDTILCEALQHNLPQIIPPPHNDAFVYPMPWVVYRMFDYTDCPDGP------------NL 334
Query: 305 EDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFR--YEYLMAETIFSQLLMLP 364
I+RF++EE+L ++ ++ RK+CA+ ++ P + EY + E IF++L +P
Sbjct: 335 PGAHSIERFLIEEHLHHIIETYHHERKDCAAQLLSFPFKHKIPLEYCIVEVIFAELFHMP 394
Query: 365 QPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLSN 424
P + I Y ++I+LCK P P V+A A LF +I ++ C R + WFS+HLSN
Sbjct: 395 TPRYLDICYGSILIELCKLQPATLPQVLAQATEILFMRIDSMNTSCFDRFVNWFSYHLSN 454
Query: 425 FQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPKG 484
F+F W WDEW L L P+ F++EVL + +RLSY +I + + T G +L+P
Sbjct: 455 FKFTWSWDEWDSCLLLDGEHPRPKFIQEVLQKCLRLSYHQRITEMMPTTYG--KLIPQVP 514
Query: 485 GPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHG----------- 544
PNFK+A+E+ A++ +L ++ + + +V++ L E IP G
Sbjct: 515 VPNFKYASEEAASLPGTAVAHQLVVAIRQKCSPEEVVNILKE--IPNSGYSGEEMSDGTF 574
Query: 545 SDISLVVVVKTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQ 604
+ + + V V+TLL++GSKSF+H + ++ V + + ++ Q+ ++ + W ++ Q
Sbjct: 575 NALKIDVFVQTLLNLGSKSFSHSFAAISKFHSVFRALAETEEAQICVLHNIYELWSSHQQ 634
Query: 605 MTAITIDRMMGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKE 664
M + +D+++ +++ ++ WIFS E ++ T WEIL + K + L E
Sbjct: 635 MMVVLVDKLLKLQIVDCSAVATWIFSKEMTSEF-TKMYLWEILHLTIKKMNKHVIKLNTE 694
Query: 665 ISSLKKDIVAAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQET 724
+S K + A+ + +++ E+ P + K+ +A K E+
Sbjct: 695 LSVAKDKLSKADSSSSESDEDA-------------------PTKRKKPITHADKPSEE-- 735
Query: 725 SIRNSLEAKEALLTRALEENETLFLSLYKGFSSILTERL 750
++E E L A + LFL +++ F IL+E +
Sbjct: 755 ----AVERMEEKLEAANVNQKRLFLIVFQRFIMILSEHM 735
BLAST of CmoCh10G009520 vs. ExPASy TrEMBL
Match:
A0A6J1H9M5 (nuclear cap-binding protein subunit 1-like OS=Cucurbita moschata OX=3662 GN=LOC111461363 PE=3 SV=1)
HSP 1 Score: 1734.9 bits (4492), Expect = 0.0e+00
Identity = 866/866 (100.00%), Postives = 866/866 (100.00%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK
Sbjct: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS
Sbjct: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV
Sbjct: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR
Sbjct: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS
Sbjct: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML
Sbjct: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS
Sbjct: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK
Sbjct: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
Query: 481 GGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVK 540
GGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVK
Sbjct: 481 GGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVK 540
Query: 541 TLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRMM 600
TLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRMM
Sbjct: 541 TLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRMM 600
Query: 601 GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA 660
GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA
Sbjct: 601 GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA 660
Query: 661 AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAKE 720
AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAKE
Sbjct: 661 AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAKE 720
Query: 721 ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAM 780
ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAM
Sbjct: 721 ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAM 780
Query: 781 EMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDA 840
EMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDA
Sbjct: 781 EMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDA 840
Query: 841 EVLSEDSHPLFRKAVYSGLRRSMDSI 867
EVLSEDSHPLFRKAVYSGLRRSMDSI
Sbjct: 841 EVLSEDSHPLFRKAVYSGLRRSMDSI 866
BLAST of CmoCh10G009520 vs. ExPASy TrEMBL
Match:
A0A6J1JG20 (nuclear cap-binding protein subunit 1-like OS=Cucurbita maxima OX=3661 GN=LOC111485390 PE=3 SV=1)
HSP 1 Score: 1710.3 bits (4428), Expect = 0.0e+00
Identity = 851/866 (98.27%), Postives = 859/866 (99.19%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDE+LPFLLQCVEQLPHK
Sbjct: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEVLPFLLQCVEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS
Sbjct: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
TSLVVVFETLLSSA TIVDDEKGNPAWQACADFYI CILSCFPWGGAELVEQVPEELERV
Sbjct: 121 TSLVVVFETLLSSATTIVDDEKGNPAWQACADFYIACILSCFPWGGAELVEQVPEELERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
MVGVEAYL+IRR+TLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR
Sbjct: 181 MVGVEAYLNIRRNTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHLLFEAQLVAGKSH+FGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS
Sbjct: 241 PHLLFEAQLVAGKSHDFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML
Sbjct: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS
Sbjct: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK
Sbjct: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
Query: 481 GGPNFKFATEDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVK 540
GGPNFKF TEDGEKSEQHALS+ELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVV+
Sbjct: 481 GGPNFKFTTEDGEKSEQHALSAELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVVQ 540
Query: 541 TLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRMM 600
TLLDIGSKSFTHLITVLERYGQVISRIC DQDKQVLLISEVGSYWENNTQMTAITIDRMM
Sbjct: 541 TLLDIGSKSFTHLITVLERYGQVISRICHDQDKQVLLISEVGSYWENNTQMTAITIDRMM 600
Query: 601 GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA 660
GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA
Sbjct: 601 GYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIVA 660
Query: 661 AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAKE 720
AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVR KRLKAYAGKAKEQE SIRNSLEAKE
Sbjct: 661 AEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRMKRLKAYAGKAKEQEMSIRNSLEAKE 720
Query: 721 ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAM 780
ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAM
Sbjct: 721 ALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSAM 780
Query: 781 EMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDA 840
EMEMDNEDS+PEKSHLNG TEH+YTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDA
Sbjct: 781 EMEMDNEDSRPEKSHLNGSTEHAYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLDA 840
Query: 841 EVLSEDSHPLFRKAVYSGLRRSMDSI 867
EVLSEDSHPLFRKAVYSGLRRSMDSI
Sbjct: 841 EVLSEDSHPLFRKAVYSGLRRSMDSI 866
BLAST of CmoCh10G009520 vs. ExPASy TrEMBL
Match:
A0A1S3BQ48 (nuclear cap-binding protein subunit 1 OS=Cucumis melo OX=3656 GN=LOC103492007 PE=3 SV=1)
HSP 1 Score: 1587.0 bits (4108), Expect = 0.0e+00
Identity = 792/868 (91.24%), Postives = 830/868 (95.62%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCF AIRRELDHYGDEILPFLLQCVEQLPHK
Sbjct: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFGAIRRELDHYGDEILPFLLQCVEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
TPLYGTLIGLMNLENEDFVKK+VE+TH+SFQDALNSGHCHKIRILLRFLTALMSSKVL S
Sbjct: 61 TPLYGTLIGLMNLENEDFVKKVVEKTHKSFQDALNSGHCHKIRILLRFLTALMSSKVLLS 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
TSLVVVFETLLSSAAT VDDEKGNPAWQA ADFYITCILSCFPWGGAELVEQVPEELERV
Sbjct: 121 TSLVVVFETLLSSAATTVDDEKGNPAWQARADFYITCILSCFPWGGAELVEQVPEELERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
MVGVEAYLSIRR TLDTGLSFFE+DGEVEKTLNEKDFLEDLW RIQ L+T GWKVDSVPR
Sbjct: 181 MVGVEAYLSIRRQTLDTGLSFFEEDGEVEKTLNEKDFLEDLWGRIQMLATGGWKVDSVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHLLFEAQLVAGKSHEFG I CPEQP+PP TLS +TYGKQKY AELNYPQRIRRLNIFPS
Sbjct: 241 PHLLFEAQLVAGKSHEFGAIKCPEQPNPPPTLSGVTYGKQKYDAELNYPQRIRRLNIFPS 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLL+L
Sbjct: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLLL 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS
Sbjct: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPW+EWA+VLELPKWAPQRVFVKEVLDREVRLSYWDK+KQSIE APGLEELLPPK
Sbjct: 421 NFQFIWPWEEWAYVLELPKWAPQRVFVKEVLDREVRLSYWDKVKQSIENAPGLEELLPPK 480
Query: 481 GGPNFKFATE-DGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVV 540
GGP+FKF+ E DGEKSEQHALS+EL NMVKGRA AR++ISWLDE+VIPKHG D+SLVVVV
Sbjct: 481 GGPSFKFSAEDDGEKSEQHALSAELYNMVKGRAPARELISWLDESVIPKHGLDVSLVVVV 540
Query: 541 KTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRM 600
+TLLDIGSKSFTHLITVLERYGQVISRIC DQDKQVLLISEVGSYW+NNTQMTAI IDRM
Sbjct: 541 QTLLDIGSKSFTHLITVLERYGQVISRICHDQDKQVLLISEVGSYWKNNTQMTAIAIDRM 600
Query: 601 MGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIV 660
MGYRLISNLSI+KWIFSPEN++QYHTSDRPWEILRNALCKTYNRISDL+KEISSLKKD+V
Sbjct: 601 MGYRLISNLSIVKWIFSPENLQQYHTSDRPWEILRNALCKTYNRISDLRKEISSLKKDVV 660
Query: 661 AAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAK 720
AAEEAVA+TQEEL AAESKL+LVDGEPV+GENPVR KRLK+YAG+AKEQE SIR+SLEAK
Sbjct: 661 AAEEAVARTQEELGAAESKLSLVDGEPVLGENPVRLKRLKSYAGRAKEQEISIRDSLEAK 720
Query: 721 EALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSA 780
EALL RALEENE LFLSLYK FSSILTERLPA SAQTLQDLKS NPA NAMD+EEPSA
Sbjct: 721 EALLARALEENEILFLSLYKSFSSILTERLPA--SAQTLQDLKSTNPADTNAMDVEEPSA 780
Query: 781 MEMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
MEMDN +S+PEKSHLNG+TEH+YTV ENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD
Sbjct: 781 --MEMDNVESRPEKSHLNGRTEHAYTVCENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
Query: 841 AEV-LSEDSHPLFRKAVYSGLRRSMDSI 867
AE+ LSEDSHPLFRKAVYSGLRRS+D I
Sbjct: 841 AEILLSEDSHPLFRKAVYSGLRRSLDLI 864
BLAST of CmoCh10G009520 vs. ExPASy TrEMBL
Match:
A0A6J1DLR3 (nuclear cap-binding protein subunit 1 OS=Momordica charantia OX=3673 GN=LOC111021617 PE=3 SV=1)
HSP 1 Score: 1586.2 bits (4106), Expect = 0.0e+00
Identity = 785/868 (90.44%), Postives = 834/868 (96.08%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK
Sbjct: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
PLYGTLIGLMNLENEDFVKKIVEQTH +FQDALNSG+CH+IRILLRFLTA+MSSKVL S
Sbjct: 61 IPLYGTLIGLMNLENEDFVKKIVEQTHTNFQDALNSGNCHRIRILLRFLTAMMSSKVLLS 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
TSLVVVFETLLSSAAT VD+EKGNPAWQA ADFYITCILSCFPWGGAEL+EQVPEELERV
Sbjct: 121 TSLVVVFETLLSSAATTVDEEKGNPAWQARADFYITCILSCFPWGGAELIEQVPEELERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
MVGVEAYLSIRRHT DTGLSFFEDDGEVEKTLNEKDFLEDLW RIQ LS+DGWKVDSVPR
Sbjct: 181 MVGVEAYLSIRRHTFDTGLSFFEDDGEVEKTLNEKDFLEDLWGRIQVLSSDGWKVDSVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLS ITYGKQK+ AEL YPQRIRRLNIFPS
Sbjct: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSGITYGKQKFTAELTYPQRIRRLNIFPS 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
SKFED+QPIDRFV+EEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML
Sbjct: 301 SKFEDLQPIDRFVMEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFKPIYYTLVI+DLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS
Sbjct: 361 PQPPFKPIYYTLVIMDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPWDEWA+VLELPKWAPQRVFV+EVL+REVRLSYWDKIKQSIE APGLEELLPPK
Sbjct: 421 NFQFIWPWDEWAYVLELPKWAPQRVFVQEVLNREVRLSYWDKIKQSIENAPGLEELLPPK 480
Query: 481 GGPNFKFAT-EDGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVV 540
GGPNFKF+T +DGEKSEQH +S+EL N+VKGRA AR+VISWLDE+VIPKH D+SLVVVV
Sbjct: 481 GGPNFKFSTDDDGEKSEQHVVSAELSNLVKGRAXAREVISWLDESVIPKHSLDVSLVVVV 540
Query: 541 KTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRM 600
+TLLDIGSKSFTHLITVLERYGQVI R+C+DQDKQVLLISEVGSYW+NNTQMTAI IDRM
Sbjct: 541 QTLLDIGSKSFTHLITVLERYGQVILRMCNDQDKQVLLISEVGSYWKNNTQMTAIAIDRM 600
Query: 601 MGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIV 660
MGYRLISNL+I++WIFSPEN++Q+HTSDRPWEILRNALCKTYNRISDL+KEISSLKKDIV
Sbjct: 601 MGYRLISNLAIVRWIFSPENIQQFHTSDRPWEILRNALCKTYNRISDLRKEISSLKKDIV 660
Query: 661 AAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAK 720
AAEEAVAKTQEELNAAESKL LVDGEPV+GENPVR KRLK+YA KAKE E S R++LEAK
Sbjct: 661 AAEEAVAKTQEELNAAESKLALVDGEPVLGENPVRLKRLKSYAEKAKEHEVSTRDNLEAK 720
Query: 721 EALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSA 780
EALL+RALEENETLFLSLY+ FS+ LTERLPAASSAQTLQDLKS+N A ANAMDLEEPSA
Sbjct: 721 EALLSRALEENETLFLSLYRNFSNTLTERLPAASSAQTLQDLKSVNAADANAMDLEEPSA 780
Query: 781 MEMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
MEMDNED++PEKS LNGKTEH+YT+GENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD
Sbjct: 781 --MEMDNEDARPEKSQLNGKTEHTYTIGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
Query: 841 AEV-LSEDSHPLFRKAVYSGLRRSMDSI 867
AEV LSE++HPLFRKAVYSGLRRS+D+I
Sbjct: 841 AEVLLSEEAHPLFRKAVYSGLRRSIDAI 866
BLAST of CmoCh10G009520 vs. ExPASy TrEMBL
Match:
A0A6J1EU52 (nuclear cap-binding protein subunit 1-like OS=Cucurbita moschata OX=3662 GN=LOC111437775 PE=3 SV=1)
HSP 1 Score: 1583.2 bits (4098), Expect = 0.0e+00
Identity = 792/868 (91.24%), Postives = 827/868 (95.28%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCF AIRRELDHYGDE+LPFLLQCVEQLPHK
Sbjct: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFGAIRRELDHYGDEVLPFLLQCVEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
TPLYGTLIGL+NLENEDFVKKIV+QTH+SFQDALN+G+CH IRILLRFLT LMSSKVL S
Sbjct: 61 TPLYGTLIGLINLENEDFVKKIVDQTHKSFQDALNTGNCHGIRILLRFLTTLMSSKVLLS 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
TSLVVVFETLLSSAAT VD+EKGNPAWQA ADFYI+CILSCFPWGGAELVEQVPEELERV
Sbjct: 121 TSLVVVFETLLSSAATTVDEEKGNPAWQARADFYISCILSCFPWGGAELVEQVPEELERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
MVGVEAYLSIRRHT+DTGLSFFED GEVEKTLNEKDFLEDLW RIQ LS+DGWKVDSVPR
Sbjct: 181 MVGVEAYLSIRRHTVDTGLSFFEDAGEVEKTLNEKDFLEDLWGRIQALSSDGWKVDSVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHLLFEAQLVAGKSHEFG+ISCPEQPD P T S ITYGKQKY AEL+YPQRIRRLNIFPS
Sbjct: 241 PHLLFEAQLVAGKSHEFGSISCPEQPDSPSTPSGITYGKQKYDAELSYPQRIRRLNIFPS 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
SKFED+QPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML
Sbjct: 301 SKFEDLQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS
Sbjct: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPW+EWA+VLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIE APGLEELLPPK
Sbjct: 421 NFQFIWPWEEWAYVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIENAPGLEELLPPK 480
Query: 481 GGPNFKFATE-DGEKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVV 540
GGPNFKF+TE DGEK+EQHALS+EL NMVKGRAAAR+VISWLDE VIPKHG D+SLVV+V
Sbjct: 481 GGPNFKFSTEDDGEKNEQHALSAELYNMVKGRAAAREVISWLDETVIPKHGFDVSLVVIV 540
Query: 541 KTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRM 600
+TLLDIGSKSFTHLITVLERYGQVISRIC DQDKQVLLISEV SYW+NNTQMTAI IDRM
Sbjct: 541 QTLLDIGSKSFTHLITVLERYGQVISRICPDQDKQVLLISEVSSYWKNNTQMTAIAIDRM 600
Query: 601 MGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIV 660
MGYRLISNL+II+WIFSPEN+EQYHTSDRPWEILRNALCKTYNRISDL+KEISSLKKDIV
Sbjct: 601 MGYRLISNLAIIRWIFSPENIEQYHTSDRPWEILRNALCKTYNRISDLRKEISSLKKDIV 660
Query: 661 AAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAK 720
AAEEAVAKTQ+ELNAAESKL LVDGEPVMGENPVR KRLK+YA KAKEQE SIR+SLEAK
Sbjct: 661 AAEEAVAKTQDELNAAESKLALVDGEPVMGENPVRLKRLKSYAEKAKEQEESIRDSLEAK 720
Query: 721 EALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSA 780
EALL RALEENETLFLSLYK FSSILTERLPAASS QTLQDLKSINP NAMDLEE A
Sbjct: 721 EALLARALEENETLFLSLYKSFSSILTERLPAASSVQTLQDLKSINPTDTNAMDLEEEPA 780
Query: 781 MEMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
M+MDNEDS+PEKS +NG TEH+YTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD
Sbjct: 781 -AMDMDNEDSRPEKSQVNGGTEHAYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
Query: 841 AEV-LSEDSHPLFRKAVYSGLRRSMDSI 867
AEV LSED+HPLFRKAVY LRRSMDSI
Sbjct: 841 AEVLLSEDAHPLFRKAVYCSLRRSMDSI 867
BLAST of CmoCh10G009520 vs. TAIR 10
Match:
AT2G13540.1 (ARM repeat superfamily protein )
HSP 1 Score: 1176.4 bits (3042), Expect = 0.0e+00
Identity = 575/862 (66.71%), Postives = 706/862 (81.90%), Query Frame = 0
Query: 1 MSSWKSLLLRIGDKSPEYGTSSDFKDHIETCFAAIRRELDHYGDEILPFLLQCVEQLPHK 60
MS+WK+LLLRIG+K PEYGTSSD+KDHIETCF IRRE++ GD++LPFLLQC EQLPHK
Sbjct: 1 MSNWKTLLLRIGEKGPEYGTSSDYKDHIETCFGVIRREIERSGDQVLPFLLQCAEQLPHK 60
Query: 61 TPLYGTLIGLMNLENEDFVKKIVEQTHQSFQDALNSGHCHKIRILLRFLTALMSSKVLTS 120
PLYGTLIGL+NLENEDFV+K+VE H +FQ AL+SG+C+ IRILLRF+T+L+ SKV+
Sbjct: 61 IPLYGTLIGLLNLENEDFVQKLVESVHANFQVALDSGNCNSIRILLRFMTSLLCSKVIQP 120
Query: 121 TSLVVVFETLLSSAATIVDDEKGNPAWQACADFYITCILSCFPWGGAELVEQVPEELERV 180
SL+VVFETLLSSAAT VD+EKGNP+WQ ADFY+ CILS PWGG+EL EQVP+E+ERV
Sbjct: 121 ASLIVVFETLLSSAATTVDEEKGNPSWQPQADFYVICILSSLPWGGSELAEQVPDEIERV 180
Query: 181 MVGVEAYLSIRRHTLDTGLSFFEDDGEVEKTLNEKDFLEDLWSRIQTLSTDGWKVDSVPR 240
+VG++AYLSIR+++ +GL+FF +GE E +L EKDF+EDL RIQ+L+++GWK++SVPR
Sbjct: 181 LVGIQAYLSIRKNSSTSGLNFFH-NGEFESSLAEKDFVEDLLDRIQSLASNGWKLESVPR 240
Query: 241 PHLLFEAQLVAGKSHEFGTISCPEQPDPPLTLSDITYGKQKYAAELNYPQRIRRLNIFPS 300
PHL FEAQLVAGK HE I C EQP PP S GKQK+ A YPQRIRRLNIFP+
Sbjct: 241 PHLSFEAQLVAGKFHELRPIKCMEQPSPPSDHSRAYSGKQKHDALTRYPQRIRRLNIFPA 300
Query: 301 SKFEDVQPIDRFVVEEYLLDVLLFFNGCRKECASFMVGLPVPFRYEYLMAETIFSQLLML 360
+K EDVQPIDRFVVEEYLLDVL + NGCRKECAS+M LPV FRYEYLMAET+FSQ+L+L
Sbjct: 301 NKMEDVQPIDRFVVEEYLLDVLFYLNGCRKECASYMANLPVTFRYEYLMAETLFSQILLL 360
Query: 361 PQPPFKPIYYTLVIIDLCKALPGAFPAVVAGAVRALFEKIADLDMECRIRLILWFSHHLS 420
PQPPFK +YYTLVI+DLCKALPGAFPAVVAGAVRALFEKI+DLDME R RLILWFSHHLS
Sbjct: 361 PQPPFKTLYYTLVIMDLCKALPGAFPAVVAGAVRALFEKISDLDMESRTRLILWFSHHLS 420
Query: 421 NFQFIWPWDEWAHVLELPKWAPQRVFVKEVLDREVRLSYWDKIKQSIETAPGLEELLPPK 480
NFQFIWPW+EWA VL+LPKWAP+RVFV+E+L REVRLSYWDKIKQSIE A LEELLPPK
Sbjct: 421 NFQFIWPWEEWAFVLDLPKWAPKRVFVQEILQREVRLSYWDKIKQSIENATALEELLPPK 480
Query: 481 GGPNFKFATEDG-EKSEQHALSSELCNMVKGRAAARDVISWLDENVIPKHGSDISLVVVV 540
GPNF ++ E+G EK+E+ LS+EL VK + ARD+I W++E + P HG +++L +VV
Sbjct: 481 AGPNFMYSLEEGKEKTEEQQLSAELSRKVKEKQTARDMIVWIEETIYPVHGFEVTLTIVV 540
Query: 541 KTLLDIGSKSFTHLITVLERYGQVISRICDDQDKQVLLISEVGSYWENNTQMTAITIDRM 600
+TLLDIGSKSFTHL+TVLERYGQV S++C D DKQV+L+S+V +YW+NN QMTA+ IDRM
Sbjct: 541 QTLLDIGSKSFTHLVTVLERYGQVFSKLCPDNDKQVMLLSQVSTYWKNNVQMTAVAIDRM 600
Query: 601 MGYRLISNLSIIKWIFSPENVEQYHTSDRPWEILRNALCKTYNRISDLKKEISSLKKDIV 660
MGYRL+SN +I++W+FSPENV+Q+H SD+PWEIL NAL KTYNRISDL+K+IS++ K+++
Sbjct: 601 MGYRLVSNQAIVRWVFSPENVDQFHVSDQPWEILGNALNKTYNRISDLRKDISNITKNVL 660
Query: 661 AAEEAVAKTQEELNAAESKLTLVDGEPVMGENPVRTKRLKAYAGKAKEQETSIRNSLEAK 720
AE+A A + EL AAESKL+LV+GEPV+GENP + KRLK+ K E E S+R SLEAK
Sbjct: 661 VAEKASANARVELEAAESKLSLVEGEPVLGENPAKMKRLKSTVEKTGEAELSLRESLEAK 720
Query: 721 EALLTRALEENETLFLSLYKGFSSILTERLPAASSAQTLQDLKSINPAGANAMDLEEPSA 780
EALL RAL E E L L L++ F +L ERLP + +++QDLKSI + ++PSA
Sbjct: 721 EALLNRALSETEVLLLLLFQSFLGVLKERLPDPTKVRSVQDLKSI------GAEDDKPSA 780
Query: 781 MEMEMDNEDSKPEKSHLNGKTEHSYTVGENEQWCLTTLGYVKAFSRQYASEIWPHIEKLD 840
M++D+E+ P+K S VGE EQWCL+TLGY+ AF+RQYASEIWPH+EKL+
Sbjct: 781 --MDVDSENGNPKK---------SCEVGEREQWCLSTLGYLTAFTRQYASEIWPHMEKLE 840
Query: 841 AEVLS-EDSHPLFRKAVYSGLR 861
+EV S ED HPLF +A+ S L+
Sbjct: 841 SEVFSGEDVHPLFLQAISSALQ 844
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9SIU2 | 0.0e+00 | 66.71 | Nuclear cap-binding protein subunit 1 OS=Arabidopsis thaliana OX=3702 GN=ABH1 PE... | [more] |
Q10LJ0 | 0.0e+00 | 62.22 | Nuclear cap-binding protein subunit 1 OS=Oryza sativa subsp. japonica OX=39947 G... | [more] |
Q16UN6 | 2.7e-86 | 27.60 | Nuclear cap-binding protein subunit 1 OS=Aedes aegypti OX=7159 GN=Cbp80 PE=3 SV=... | [more] |
Q7PX35 | 2.7e-86 | 27.20 | Nuclear cap-binding protein subunit 1 OS=Anopheles gambiae OX=7165 GN=Cbp80 PE=3... | [more] |
B4GW22 | 1.7e-85 | 27.14 | Nuclear cap-binding protein subunit 1 OS=Drosophila persimilis OX=7234 GN=Cbp80 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H9M5 | 0.0e+00 | 100.00 | nuclear cap-binding protein subunit 1-like OS=Cucurbita moschata OX=3662 GN=LOC1... | [more] |
A0A6J1JG20 | 0.0e+00 | 98.27 | nuclear cap-binding protein subunit 1-like OS=Cucurbita maxima OX=3661 GN=LOC111... | [more] |
A0A1S3BQ48 | 0.0e+00 | 91.24 | nuclear cap-binding protein subunit 1 OS=Cucumis melo OX=3656 GN=LOC103492007 PE... | [more] |
A0A6J1DLR3 | 0.0e+00 | 90.44 | nuclear cap-binding protein subunit 1 OS=Momordica charantia OX=3673 GN=LOC11102... | [more] |
A0A6J1EU52 | 0.0e+00 | 91.24 | nuclear cap-binding protein subunit 1-like OS=Cucurbita moschata OX=3662 GN=LOC1... | [more] |
Match Name | E-value | Identity | Description | |
AT2G13540.1 | 0.0e+00 | 66.71 | ARM repeat superfamily protein | [more] |