Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCGCGCAAAGGAAGAAGCTTGAATGCATAACTTGCTAATTCAACTGTCCCAGTGATGAGCTTCCGTCGCCGATCACTGTAACCCCTCCATTTATTCCGATCAGGCCCAATTCCATGGCGGGAACGGCCTCGGAGATCGATCCTATCTATCGTCTCCCTCTCCCTCTTTTAGGATCCGAGCCGATCCCTTCCGCCCCGAATAGATTCGCTGGTTCCTCCATCGACTGGATTCCAGATTTTGCTGGCTATGCATGGGTCGCTTATGGAGCCTCATCCGTTTTGGTGATTTCTCACTTCCCTTCTCCTCTGTCTCCCCAAGAAACCACAATTGGACCCATATTTCGCCAGGTACTGGAGCTCTCTGGCGACGACTTATCTGTTGTCAATGCCGTATCTTGGTCTCCCGTATTACCGTCGGAGGGCGAGCTCGCTGCAGCTGTGGGCAACCGGATATGGGTGTTTTCTCATGATTTGGGTGCTTCTCGAGGTACGCGACAGTTTACAGTTATTATGATTTCATTACTGTATGTTCGTGGGAAAATGGGCGAAGGAACTGATTGGGAAGTGACATATTGGATTTAGTGCCAATTAGTTCAGCGTGTCTTTACTAGGATAGTTATGAAGTTAATTTCCACTTGGTTTCGCCAACAATGTGGGAAATTGAGGTAATGGTGATGAATACATGTGTTCTAACGAACTTTGGTGAGGAAATAGTTGGGTTCGCTCATTTAAGAGCGAGGCTGATAACATTGAATCGTTTCCGTTCAATTTTGGACACCTTTAAATTATATTGGTTTACTCATATTCAGGTTCTTTTTGTTGGAGGCAGAATTCCGTGCTTCTGCAATCTTTGAAGGTTGAGGCCATTCAGTGGACGGCTGCAGGAGATGGAATTATTGCTGGTGGAGTAGAGGTTGTTTTGTGGAAGAACACTAACAGGTCCTGGGAAATTGCTTGGAAGTTCAAACCGGATGTACCCCAAACTCTTGTTTCCGCAAGCTGGTCTACCGAGGGACCATTTGCAACGGCATCTCAAGCAAGGATATCGAAGATGGATAACACGTTTATTGATAAAGCTTGCCGATCTGTGCTGGTCTGTCAGAGTGAAGGGGAGTACGGACATGTGAAAAGCGAGTTATGCCATCCTCTACCTATAACTATGATTCAATGGAGGACTTCAATTAAAGAGAAAGGAAGTTCCAAGAATACACCAAGACATGTGCTTCTGACATGCTGCTTGGATGGAACTGTGAGATTGTGGAGTGAGACTGAAAATGGAAAAGTAAAAAAATTTAGTAAGGATGTAAATAGTAAAAAGTCATTGAGAAGGCGTTTTTCTGTTGCTGCTGTCATTGAGGTAAATCAGGCATTGAATGGAACTCTTGGCACGGATTTATTTGTAACATGGGCAACCGAGATTAGAGGTATGTGCAAACCTTTTGAATTAACTAAGAAAGTTTTCTCCGAAGGATTCGAACACAACAGGGCTGGAAGCTGTGAGTGGCTAATAAGTTTAGGTCCGGGATCATTGGTTACTTTTTGGGCTGTACATTGTCTTGATGACGTATCCCCATTAAGATTCCCTCGAGTCACATTGTGGAAGAAGCAAGAGCTCAAAGGACTTGAAGTTGGACGGCATTACATTGATGGTTGTACAAACTTGAGTAACAAGTTTCTCCTCAAGAAAGTTGTAATCTCAAGGATTTATCCGTCTGGTTCCCCAAGTATGTGTTCTTTGATTCAGTTGTTGCCTTGTAATTCCTTGGTGTGGTCAATTTTATCTTCTCAGACATCAACTGACGTAGGGGATGTATCCTTTGACAAAAAAAGGTCAGACAATTTCTTCTCTCGTTCAGTTACTACCCAATTGAATTTAAGTGGTCATGCTGGAAAAATCTTACACGTTGCTGTACATCCATACAACTGTGAGGTCAAAGTAGCTGCTTCTTTGGACTCTAATGGCTTGCTTCTTTTCTGGTCACTATCTAGTATCTCCACCTGTGTATTAGGCCCTCCAACACTTAATCATACCTGGGAACTTTGTGGAAAACTCGTAACTCAAGATTCATGCTCAAAATATACAAGTGTGCAGTGGGCACCATCAATATTGGATGAAGAACTGATTCTTCTGATGGGACATGCGAGAGGAATTGATTTTTTTGCTGTTAGGATTAGCCAAAGTGATGGAGAAAATACTGAATGTCACTACTTATGTACCATACCTTTCACTGGTCATGGTCCTTTTGAGAATGGTCCAACTGATATATTTTCTATTTCTTTGCCTTCTGATTGTAATACAACATATAGATTCAATAAATTTATGTTAATAGGGCTATGGATGGAAGGATTTCAGGCACTATCATGGGAAATCACCTTACATACTTATGATATCTTTGGGACTGGAGTACATTGCAATTGTGGTATTGATGATGAAAATATAGCTGAGCTCAGTATATTGACATTTGAAAGTTCTTTTGGAAGCAAAAAGTATTGTGTTAGTATAATCCCTTGCTCATCACAGTTCCCAAATTCTCAAATTCATGAGCAGATTACAAGTTTTGCCGTGGTGCACCAAGGAACTTTTGTTCCTGTGCAGCAAAAATTAACTTCTTCAGGTGAACCATATACCCCTGCATATATTATGGCTACTGGCTCTGCTGATGGAAGTTTGAAACTTTGGAGAAGCAATGTAGGCAAACCATCGATCTTTCACGTGTCTTGGGAGCTTGTTTGTGTTGTTGTCACTCATCAAGGCCCCATTACTGCATTGTGTTTGACCGATTGTGGACGGAAAATTGCAACAATTAGCAAGAACAGCCATAAACCTAATATCAGCAACGTTCGCTTATGGGAGCTTGCATGCCTTGGTGCAGGGACCCTTTTGTTTGAAGATGAACTGTCCTTTGAAAGCAGTATCATTGCAGTAGACTGGTTAACTTTAGGAAATGGTCAATTTTTACTTGGAATTTGTTTGCAAAATGAATTGCGTGTGTACTCCCTGAAGCATTTTGGTCAGACCTTGTCAGGCATCACAAAATCTTTGGATACTGAGACTTGGATCTGCATTGGGTTTGCTCGTACTTTACCATCTAATTGTGGCTTCCTCTGGGGGCCGAAGAGCACAGCAATAGTTATACATGATCATTACTTTTGTATAGTTAGTCCATGGCTGTTCCTTGGGGATAAAAACCATGATGCTATGTGCAGCCCTTATTACATTGGAGAAACTAAAAATCACCATGTCAATGGGACTGATGTCGCTGTTTTTGCTGATGAATGTTGCAGTATCAGAAAATTATCAGATGATAATTATGACAGCAAACGTAGACCAAGATCTCTCACAAACATTCATGCTGAAACCAATATTCTATCGAATAGTTCGTATCCACGGGGTGCACAGATGAAAAGTACAACTTCGCTTGGTCTTATAAGCATGCCTGATATAGCTGATAAACTGTGTGGATCACTGTCCTCTTTTCATCCTCATGCTCTCCTCGTTAATGCCTGATATAGCTGATAAACTGTGTGGATCACTGTCCTCTTTTCATCCTCATGCTCTCCTCGTTAATATATATTCAGGTGATTATATAAATTACTGCCTATCGGCAAGTTTGGTTGTTTTTTTTTTTAAGGATTTTCCCTTAAAAGCTTCCAACAATAATTCATTACGATCCATCTTATTATANTCAGACATCAACTGACGTAGGGGATGTATCCTTTGACAAAAAAAGGTCAGACAATTTCTTCTCTCGTTCAGTTACTACCCAATTGAATTTAAGTGGTCATGCTGGAAAAATCTTACACGTTGCTGTACATCCATACAACTGTGAGGTCAAAGTAGCTGCTTCTTTGGACTCTAATGGCTTGCTTCTTTTCTGGTCACTATCTAGTATCTCCACCTGTGTATTAGGCCCTCCAACACTTAATCATACCTGGGAACTTTGTGGAAAACTCGTAACTCAAGATTCATGCTCAAAATATACAAGTGTGCAGTGGGCACCATCAATATTGGATGAAGAACTGATTCTTCTGATGGGACATGCGAGAGGAATTGATTTTTTTGCTGTTAGGATTAGCCAAAGTGATGGAGAAAATACTGAATGTCACTACTTATGTACCATACCTTTCACTGGTCATGGTCCTTTTGAGAATGGTCCAACTGATATATTTTCTATTTCTTTGCCTTCTGATTGTAATACAACATATAGATTCAATAAATTTATGTTAATAGGGCTATGGATGGAAGGATTTCAGGCACTATCATGGGAAATCACCTTACATACTTATGATATCTTTGGGACTGGAGTACATTGCAATTGTGGTATTGATGATGAAAATATAGCTGAGCTCAGTATATTGACATTTGAAAGTTCTTTTGGAAGCAAAAAGTATTGTGTTAGTATAATCCCTTGCTCATCACAGTTCCCAAATTCTCAAATTCATGAGCAGATTACAAGTTTTGCCGTGGTGCACCAAGGAACTTTTGTTCCTGTGCAGCAAAAATTAACTTCTTCAGGTGAACCATATACCCCTGCATATATTATGGCTACTGGCTCTGCTGATGGAAGTTTGAAACTTTGGAGAAGCAATGTAGGCAAACCATCGATCTTTCACGTGTCTTGGGAGCTTGTTTGTGTTGTTGTCACTCATCAAGGCCCCATTACTGCATTGTGTTTGACCGATTGTGGACGGAAAATTGCAACAATTAGCAAGAACAGCCATAAACCTAATATCAGCAACGTTCGCTTATGGGAGCTTGCATGCCTTGGTGCAGGGACCCTTTTGTTTGAAGATGAACTGTCCTTTGAAAGCAGTATCATTGCAGTAGACTGGTTAACTTTAGGAAATGGTCAATTTTTACTTGGAATTTGTTTGCAAAATGAATTGCGTGTGTACTCCCTGAAGCATTTTGGTCAGACCTTGTCAGGCATCACAAAATCTTTGGATACTGAGACTTGGATCTGCATTGGGTTTGCTCGTACTTTACCATCTAATTGTGGCTTCCTCTGGGGGCCGAAGAGCACAGCAATAGTTATACATGATCATTACTTTTGTATAGTTAGTCCATGGCTGTTCCTTGGGGATAAAAACCATGATGCTATGTGCAGCCCTTATTACATTGGAGAAACTAAAAATCACCATGTCAATGGGACTGATGTCGCTGTTTTTGCTGATGAATGTTGCAGTATCAGAAAATTATCAGATGATAATTATGACAGCAAACGTAGACCAAGATCTCTCACAAACATTCATGCTGAAACCAATATTCTATCGAATAGTTCGTATCCACGGGGTGCACAGATGAAAAGTACAACTTCGCTTGGTCTTATAAGCATGCCTGATATAGCTGATAAACTGTGTGGATCACTGTCCTCTTTTCATCCTCATGCTCTCCTCGTTAATATATATTCAGGTGATTATATAAATTACTGCCTATCGGCAAGTTTGGTTGTTTTTTTTTTTAAGGATTTTCCCTTAAAAGCTTCCAACAATAATTCATTACGATCCATCTTATTATATGCTTTTATGAGCACAGAGAATATTTTATCCATTTATCCATTTGTTTTATTTTTATATCTGATAAATTTTACATTTTTAGTCTCGTTAATTTTATTACTTTTGTTGACTTGCCACCAGCCACAGCTAACTAAACATTTATTTTCCTACTTTCTGGTTAAAGCATGAAACTGTTAATTTCCATGTTTTCTCGAAGAAAACTATAATTCGTCTTTAAAATGGTCAGATAAAATGCACCATGAGGCTGCAAAATAATCAACCATTAGCAGTTTAATTTTAGGTAAATTTCTGTAGAATTTTTATCATCTCTCTGTTCGGTACAGTTAAATTTAGTGGTTCATTCGGGCTTAAGATTTGCAATCCTCAATTTGAATAGGTAAGTGGAAACGTGCATATTCAGCTTTGAGTCATCTTATTGAGCATCTTTCTTCTGATAAAAAGAGCTCTGCAAACCCAACCAATACTATTCCGGAGATCCTTTTGTCAGATTATTTTGAAGGAGTTGCGAAAACTTCTACTGATAAAGAAGTTCAGTGGAGTATGAATGGCTTAGCATCCCAATTTAAGGAAGGTGTTTCACCATGGACCTTCAATTGGGATTCTATTAGTAACGACAGCTCATTCATTCCGTCCTCTACGAAATCTGAATTCAGCACCTTTATTGAGCCTCTTGAGAAGTTCTATGAATCAGCAGGGCTAACCAGCATGGAGAAGACAGAAACTCTAGCAATTATAGATCTTCTTGATGAAATCAGTAATAAATCTTCTGCATCTGCCTACGAAAGTCTTGATGAGCCCGGGAGAAGGTACTTTACAATAATGCTTTTGTAACTTCTCAATTACACTTTTTGTTAAACGGCGTGTGGAAACCCTTGCCTCATTTTCACTATTTGTGTTTTCTTATATATGCCTCACTGTCACTATGCCCTGCAGATATATATATGTGTGTGTGTATATACATATATTCTTATTGTGAAACCGTGGATTTTCTTTTTCCAGTTTTCAATGTTTCAATTGATTATAAGTTCATGTACAGTTTATTTCATTAATTTTTGAACATTTGTTACTTTTCTTTTCTATATTTTTCATTTGATCGTTCAAGTAGAGTATGGTACCCTTGTAATCGTTAGTGTATAGTCTATACACCTTAGATTCCATGTAAAGAGATCCCTGATTTGCTAGTTTGACGTGTGAAATTTTATTAGCACTTTGAGAATTGATTGTGGATCAAACGAAGGATATTGCCCTTCAAGTGTTGTGCCTTCCACAGCTCTATGCCGTGTTACCAAAAAAATATTGCTGTATAAGAGGTGTGGGAGGAAAATGTTTGTTATTAAAAGATTTTGACTAGAAGAACTAAACAGAACTCAAATTTGCTAGTTTAGGGAGCATATCCAATTGTTGAAATGGAGTACAACTTTGCATAGAGGAAGAGAGTCTTAAATTTTGTAATTTCAGTGGCCGAGTTTAATTTTAGTTAAACCCATTTTCTTCAGATAATTCTTTGACCAAGAACTTCACCAAGAAATTACTGGACCTCCAGCTGTTGAAGTCTTGTATCTAGAATATTTCCTCTCAACGACCACAAAAACCGCACGAAGATTCAATTATAGCTCTTCCATTTGCTCTTCAAGGGCGTGTTAACTATTAACATGACTGAAAAAAATTCACTGTAAATGTCCAACCATGCTTCCAAGTAACAGTTATGTGTTCTATGTGAAAGCAAAGGGTGACCAAGCACTTCTACTGTTTCGTTGCTCTTAAGCAAGAATGTACATTTCTGGTTGTATATTCATTTGCAACCCAGGAGTATACAAAACACTGAAAAAGAAAAGTATAAATATATATATATATATATATATATATATATTAATCCGGAAAAAGGTTATGAAATTGATTGAAGGGATGGAAAATCAATAAGAATTACAAAAAGCTTTCCCGAGTGATTACAATCTGGGTCAATATGTGGTCCCATACTTAAATATAAGGTTATTGTATTGGGGAAAAATCCAGTTAGAACAACAATTTATATATAAAAATAAACCGTAGGCTCTTCAGTGACTTGGATATGGAGGGAAAAGCTCCTGAATTCCTAATTTGACGAAGTTTTGTTACTCATCCTGGCGTACTTTATCTACCTTTTTCTTTTTTGTAATTTTACTCCCTTCTAACTAGTTCAAAATGGGGTAGTATTCTGTAGTATTCAATCATGTAATTTTCTTGTTCCATGTTGTAGTGATTATTTATTTTCATTCTCTAGATATTGGATTGCTTTGAGGTTTCAACAACTGCGATTTCTTCGGCGTGATGGTAGATCAGCATCTCTGGAAGAGCTGACCATTGATTCAAGATTGATTGGATGGGCCTATCACTCTGATTGTCAGCAAAATCTATTGGATTCGGTTATCTCTAAAGAACCAACATGGCAGGAAATGCGAAGTTTGGGTGTTGGAATTTGGTTTACTAATACAACACAGTTGCGTGCAAGGGTAATACTTGGAGATCTTGTATATTAGTATAAATTCTTTAGCTCGACTTAATCATAGAAATCAAATTGATTCTTCAATACTTAGGTATTTGTCTTTCTTTTTGTGAAAACTCTTTATATAAACATCAGACTGGGTACTGGTTAATGTAGATCAAGGCAGTTGTCCTAATTCATATAAAAAAAATACATTTCTTTTCCTGCAAAAATGAATGCAACCTAAAAATCTTTTAGCCATCCACTTGAAGTAGATTATGATTTTCTTGTCTACTTAAATGAGGTTTGGCCATGGGTTTTAGAACACCATTGGATAAATGTTTTCAAATCCGACTGTATTGAAAAAAGTATTTACCATTAGTTTATCATCCTGTTGATAAGTTCATTTGAGTGGAAACATCATGTCTTTATGAGGAAAATTGAAAAGGCTTTATCTACAATTGATTTAACTCCCTTTCTTGTTTTTCTTTTTAATTGGTCCATGTTATATGCAGATGGAGAAACTGGCGAGATCCCAATATCTGAAGAAAAAAGATCCCAAGGATTGTATGCTTCTGTATGTCACACTGAATAGAATCCAAGTTTTAGCTGGCCTTTTAAAAATTAGCAGGGATGAGAAAGATAAACCCTTGGTGGGATTTCTTTCACGCAATTTTCAGGTGCTCCTTTGACATGCCATCTGGAAAATGTCAAATTTATGTGCACGATCTTTCAAGAGAACAGTTTTGGAGGGTCGTTTTCCCCTTTAAATCATGTTAATATGTGGTTTCATTTAATTTTCACTTTTCAGGAGGAGAGAAATAAGGCAGCAGCTTTGAAGAATGCCTATGTTTTAATGGGGAAACATCAACTGGAATTGGCTGTTGCTTTTTTTTTGCTTGGTGGTGATACTTCTTCTGCTATCAGAGTCTGTGCGAAAAATCTTGGGGATGAGCAGCTTGCACTGGTAATTAGTCTTTTAGTTGAAGGGCGTGGTGGACCTCTTCAGCAACACCTAATAACAAAGTTCATGCTTCCATCTGCCATTGAGAAGGGTGATACCTGGCTTGCAAGCATTCTTGAGGTGCATCATTCATTTTAAGAAATAAATGCAAAATACATTCCCATAGATGCGAATACAAATTTTTCCTATCATTTAAAAATGATTACTTCTTCACCATATGATTTAATTATTGCATAACACACCCCTCTTACCGCCTTGACATCTTAATAAACGTCATGTCACTCCTCTTCTTTCTTACTTTTTCTTAAAAAAACCTATGATTTAATATAAGTTGGAAAGGCACCCCCTCAAGTCATTTCCACGTTAGCACGAGCGATTGAATCGGTAGATTATCGAACCACATAAATAATAGTTAATGTTTAAACTGTATGTGTTTTATTTGCATCTACCACTTCAAACAAATGATGAATATGAAAATAGGACAGCAGAAGTGAAATATATGTGTTTTCCTTGGATTTGGTATACCAAGAACTTAAAATTTCCTCTCTTTCACTTGCAGTGGGAATTAGGAAACTACTCTCAATCTTTCCTGAACGCGCTTGGTTTGGAGTCAGAGTCAAATTCTGTTACTGGGATACCGTTTCTTTCAAGTAGACATATTTCTTTACAAGACCCAAGCGTTGGTTTGTATTGTCTATTGTTAGCAACCAAAAATAGCATGAAGAAAGCAGTTGGAGAGCAATCTGCTGAGGTCCTTTGTCGGATTGCAACCTTGATGACGGCTACTGCTTTAAACAGATGTGGTCTTCCTGTAAGTTTTTATATTTGGTTAATGATGTTATTCGGATGGTTTATCTGCTTAAAGAAAAGGTCTATAACAATTTTCAAGCAATTTTTAAGCCAGTCACGAGTAGAAATGACAATTTCTAAGCAATAATTGCATATTTTATCAAGCTGTTACATAAAAAGAATATTTTTGTTAGAACATAGTGTCCATTGAAATTCATAGGGAAGAAGTTCTAAAAATGATTTAAAATCCATAAACCATACCCAATTCATGAGAAATTATTAAAGCTTGATAACAATCTTCATAACTCTTTGCTGAAATCAGCAACATATTAATGGAAAACCCAAAATCCAATTGAGCAATCCAATAGATGTAACAATATGTGTTAAATTGAAGACTTCAAAATGGGTAAGGAGACCAAGGAATAGGATGATTCAGTATCATAGATATTGTCTTGGTCCTAACTGTAGCATGTATCTGTACTAAAAATGATTTTCACTTCTAAATATGCACATGACGAACTTATTATTATTATTATTATTATTGAAAAGATAACCAAGAGAAAGCTTTATTTATGTTAAATCACCGATCAATAAAAAATTTTAAGGTGATGGGTTATAGTAAATTTAATTATATCAACACTCCAACACTCCCTTTCACTTGTGAGCTTGGAAATTTATAAAAGACTCAACAAATCAAAATCAATATTGATTAGAGAAGAAATGACATAGGTTTGAACTTGGGATCTTCCACTCTAATACCAACTTGAATTTAGGGAAGAAATAAAGATATTGTATTTCTTCTATATTCAAATCAAATACAAGGTCTTCTATATAGAAGAAAGACCATCTATCGATATAGAAAGAATATTACAAATATAGACTCATAATGGAAAAATAATAATACAAAAGAATAAAGGAAAATCCAACAACTTTCTTCTGAAAAGTTCCATACCATACCTCATGTACTTATTGTGTTTGCTTTTCTGGCGTTGTCATTTGATTTTAAGCTTGAAGCTTTGGAACAAATGTCAACCTGTGGAAGCATTACTGAAGTTTCAGATGGGACTAATGGAGTCGATATTCTGTGTTTTGAGACTATAAGAAAAATCTGTAAGCAATCGCCCAGAGATTCCTCCAGTTGGCTCTCTGTTGAATTTGCAGTCCATCTGGAGTATCGAGCAAAATTGGATTTGGCAGTTCAATACTTCTCAAAATTGATAAGGAAGCATCCAAGCTTTCCAACTATAAATTTAGAATCTGTTGGATGCATGGGTTGCTTGAAGGAATATGAGATGGATTATGAGAAATCTCTTGAAAGGTTTCAGCGTAAGTTAAATGTAGGGTTTGCACAGTTTGAAATGAAGTTCTCGTTGCTTCCTGCCTCTCTTGTTAGTATGGTAAGTGGATTTCTGAGTGTGATATTCTCTCTTTAATTTATTGTTTCATATGTCAGATACTTCCACTCTTTTCTCCAAATGATATTCATTCCCTGTCCATTTTTCTTATCAGATGTTAGTCTTTCTGTGTAATGTTGGGCTACAATTTATTGGATATGATATATTTCATGGATTTGCTTCTCAAGAATGCCCGGATGACAAGAACCAGAAGATTTATACTTTCCTCTTGCATCCTCTCGTGCACAAGTCACTACTCAAAACAGCACAAGAAATTTTATTTTCAGCTTCACGGTACACTATTGCTTGCAGTTTATCTTTTCACAAAGGCGAAACAGGGTCAAAATGTTTCGATACTTGGTGGTACTACCTTCAAGGTCTCTTACTATCCTTACAGGGTTTAAGAGCTGCTTTGAGAATTACACATGGTTCTCTCAAAGACGATCTTGTTTCCAAGCTCCTGACCATTCTTGATTTGGTTGAATATAACTTATATTTCACGTCTGCTTGGCTATTGAGAGACTCAAAATGTCTCCTTAAGATGCTGCAACCACTCTTGGCAAATGCACGATCTCCTCATGACATTGACGTGGAACATCTGAAGCAACTCCTCCCTCAGATTGGAGAGTTGATAGCTCAAAATTTATTGACCGATGTAGATTATAACCATCAGATTTTGGAAGGCATGCCTAATGCACAAAGTGATGACATTGTGCATTCAATTCCAGGAGATGAAAGATGGCACATTATTGGGGCTGTTTTGTGGCATCACATGTCCAAATTCATGAAACATAAGTTGATTACTTTAACTAATGCATCTAAGGAGGGTAGCTTGAGCAGTATCATTCTTGGGAACCTTGATACATGGGCTCAAAGTCTTTCAACCATTAAATCTGATTGGAAAGCCATTTCAAAGGATGTGATTGAATTGGTATCAGTGAGTCTTACTGCTCTACTGACTATCGTCCTTGCTCAGGTTTCTTCTTATCAACTAAAACAATTGGTATCATCTCTGCAATATAAATTAGATCAGAAGCTGTACGTGGCAACTGCTGTTTGGTTTGAACAGATTTGCCAGTCTCTATCTAGTCATGACAAGGGCCATACTGATGAGATTTACGATATGGATATGTGTATCAGAGGTGAATTTGAAACATTATGGAACGTTACTTCCAATCCCAATCTAATATCAGACTGCTTTACACATGAAAAGGTTCATATGTTGCATTGTTTTGATCGTAAACTCTCTGAAAGATGGAGTGATATTTACAATGGCATCACAAGGAAAGAGCACAATTGTACTCATGAAGCTGCACATATTAGTAGGTCTGTCAGCGATGCCACTGGATCACCTGGTAAATTACTTCGTAATGGGAAAACTCTTGTCAGATCTGACAAGGAATTGGCCACCCTTGATGATGCCATGCCTTTTCAGAAACCTAAAGAGATATATAGGAGGAATGGAGAGCTTTTAGAGGTAATTTTCAATCCAATCGTATGTTGTGGCCACTAACCCTTTTCTAGGATTGGAAACTTCTGCTTCTTTTTACTTAAAATTCTCTTTGATAATATATCTTTCTTATTTAAGACGGCAGAAATTGATTTTGTTTTGAAAAGTAATTTGCATGTTCCTTATAGTAAAAAGATATTCAAAACTTTGATATAACGTTCTGAGTGTATTTCTTTATTAAAGCAGGCATTGTGTATCAACTCTGTTGATCAAAGACAAGCTGCAGTTGCTAGCAATAAAAAGGTCAGTTCTCATCCTCATATTATGCTTTTTTCCTTACTACAAATCTCCCCGATGTTTAGTGTGGAGTATTTGAGTTCATCTTCAATTTTCTCAATTATTAGAAAATGTGCTACCTTACACAATCGTTTTGGCTTCAATCATTTAGCCATTTTGAAGTTGATACTTTCTTTGTTTGCTAATTACATAAACTATGTGCACAATGTGGCTATTATGGCGAGGTTATATTGAAGTATGCTTGTAAAGTATGCTAACTTCCTATTTTTTTTTTTTTCAAAAAAAAAAAGAAAAGAAAAAAAAAAAGAAAGGTTTGAATCATCGAAATTTGAAATTTGCTAAAGATCATCAAACTTCTGGCAGACCCTCAAATTAGCCTTCATAGTGTATTACTCTAATTTCACGAGAGGGATGACATAGCAAGTCGAACCTCTTATACTTAATTATTATACTGGGAGCTCATACCTAAAGCTACTTAACACTTCTGGGAAGAAAAAAAAAACTGTGCGTGATGGAAAGGAGTTTATTTGTTAGCTTAAAACTAGGTTGTGTATTCATCTGTTTTTGTTCTCTCTACTTTCATCTGTTTTTTTCCAAATCTGTACAGTTCTTCTGTTTTGTTATCTTAAAAACTTATTGACGTGGAACCTTACTTCGTGGCAAAGGTTTATATGCTGTGTGATTGAAATCTTTGGAAGACTCAATAAACGTATTAAGAGAATAACTTTTGATGACCTTTAAAAGATTCTTTAGTGAGTATATGGGTTCACATAATCGGTTTGTGAAGATTTGGTAAAATTGATCTTTTGTCAAATGCAGGAAGCATATGATTGCATTAACTTTTTTTCCTTTTTTCTAAGAAGTAATATTTTAGATATGGTGCTTAAAGGGCAATGTTTGCAGATGTACGTAACCGAGTATTTATCCAAGTTGATGACCAACTGTTCTCTGGCTTGTATCATATGTATGCAATCATGTCTATTAAAGGTGTTTGATTCCAACTTGGTTGCATTCAGGGTATAATTTTTGTTAGCTGGGAAGATGGGATGGCCTCCAGAGATGATGAGGATTATATCTGGTCAAACTCTGAGTGGCCTCTAAATCTAAATGGGTGGGCAGCCTCTGAATCAACACCAGCTCCAACGTGTGTATTTCCTGGTGTTGGTCTTGGGAGCAGCAAAGGGGCACACCTTGGGTTAGGTGGTGCTACTCTGGGTGTAGGATCATCTGTAAGGCCTGGGAGAGATTTGACTGGAGGTGGAGCATTTGGTATTTCGGGTTATGCAGGTGTTGGTGCTTCTGGCTTAGGTTGGGAAACACAAGAGGATTTCGAGGAATTTGTAGACCCTCCAGCTACAGCAGAACACACAAATTCGAGGGCTTTCTCCAGTCATCCTTCTAGACCTCTTTTCTTGGTTGGCTCTACCAATACACATGTATACTTGTGGGAGGTATGAACTAATAATATCTCTTGCTCTAGAGTAGTGGATTTGTGCAGTAGAACATACGAGTAACTCCAACTTTATATTTTAAAATTTATTTAATCATTTACCGTCAACTGTTCTTTGGATGGTGTAAATTTAAACTAATGCAGTTCGGGAAGAACAGAGCTACTGCAACTTATGGTGTCTTGCCTGCAGCAAATGTGCCTCCACCATACGCTCTAGCCTCAATATCATCTGTGCAGTTTGACCAATGTGGACACAGATTTGCTACTGCTGCATTAGATGGGACCGTATGCTCATGGCAGTTGGAGGTTGGAGGAAGAAGCAATGTCCGTCCAACGGAATCTTCTTTATGCTTTAATGGCCATGCATCGTACGTAGTTTTACTTTTGAGCCTCAAAGAAATACCTATTTAGTATCTACTCTTACACATTTACTAAAAATATTTGGTTTTACATATAAAATTGAATCTTCAAGCATGTACATTTCATACTCAGCTTAAGTTTGCACGCTGTTACCCAGGAGTCAATGTAATTTGTGGAATTAAAATTTACTGAACTGGATATCCATGGCCAAATGATCTTTATTTGGTGATATTGCTTCTGTTTCTTTCTTCCTGTATAAGCGTTTGAGTTCACTCTAACATAGTTTTTCACTCTTAGGTTTCAGATTGGAAGTTAATTATTTAGAACTTCTAATATCTTGGTATATTTTCATTGTTCAACTTGTTATTTGTCAATTTGCAGGGATGTCACTTATGTTACTTCCAGTGGATCAATAATAGCTGTGGCTGGATATAGCTCTACTGCAGTTAATGTGGTCATATGGGATACACTGGCTCCACCTAAAACTTCCCAAGCAGCCATTATGTGTCACGAAGGTATCTTATAAGTCATATTTAACTTCATAAACGAGACATTTTACCTTAGTTATCTTTCAAATTCTTTATCTTCTAATGAATGAAGCAGAATAGTCATTGTGTAAAACGTATGACTTGCCTGGTTGATTAAAAAGAGCATAAAGCCAAACCTGTCCACTTCAGGGAGATAGATTTAGTATTATTTCAAATCGGAGAATTTCTTTATTGGTCGGCTTATAGGACGAAGTTTTATAAGACCAAGAAATTATATATTTTCCCCAATACTTTATCAGGGCCCTAGATTGTTTGTGGACTTGTACGACACACTTTAACGTTTAGATCTTCTGCTGATGGTATACATACTCTGTTGTCAAAGCTGGTATTTCTTTTTTCTGTTAAAATTTTGATTTAGATATGGGCCTGTTCATAGTACTAAGCATCTTCCTTCTTTTAATGCGAAGGTGGTGCTCGCTCTATTTCTGTGTTTGATAATGAAATAGGGAGCGGTTCTGTTTCTCCCCTCATAGTTACTGGTGGCAAAGGTGGAGACGTGGCAATTCATGATTTCCGATACGTAGTCACTGGTAGGACTAAGAAACAAAAAAACTGTTCAAAAGATGAAATGATTAGCAATGCTTCGAATTCTGACATGCCGAGTACCGTTGGTGAACAAAACTTAAATGGAATGCTTTGGTACATACCGAAGGCTCACTCTGGGAGTGTCACTAAAATATCTTCTATCCCAAATACAAGTTTGTTCTTGACCGGAAGTAAAGATGGAGACGTAAAACTTTGGGATGCAAAAAGAGCTAAATTAGTGCATCATTGGCCAAAGTTGCATGATAGACACACTTTTCTGCAACCAAGTTCCCGAGGTTTCGGTGAAGTAGTTCGGGTAACTTCTTTTCTTGTCGTTCTGTGAAGTACTAGTAAAAATTCAGTTCTTACCATGTATGAAATTTATATTTTATTAGTATATTTGCTGGTAGACATCAAAAGATATACATTTATAGAAGCAAACTCTTTCACAATGCGAAAGCTCATTGAATTATGTAATGCTACAGGCCGCTGTTACTGATATACAAGTTATCTCAAGTGGATTTCTTACCTGTGGTGGAGATGGCTTGGTAAAGCTGGTTCAGTTTGGATGATCTAATACATAACAGGCACAATAGTTTGGTTACTGACTGTAAAAACGTGGGAATCTGAGATGTGCCAAAGAGGTCTGGCACAAGAACTGTGTTGTGTGTAGGACTCAGCATGTTTAAGAAGACATTCGGTGCATAAGCTTAAATCAGTCCTGTAATCTTCAAAGTCGAGTTCGACATGCTGTGACCGTATGTAGTAATAGTTACTTATGAAGTGAAGAAGCTAGTTTTCCTCGCTGAGGGATTACAGCAAACTTGAGTCTATTATCTTGTTATGAGGTAAAGGAAAGGCTCCTTGGCTATGATTGTTACAAACCCAAACAGAATAGAAGTAAGGATGGACTACAGGCATCATCCAGACATACATGGAGTGAACAAAGAATTTTCTGGTTAGTGCTATAATTGGTGTTCTTCTCTCTTTCAATTTTTAGCTTTTCTTTTCGTTGAGGTTAAGGATTCTGCTATATACCATAATGTTGGATGTTTGAGTAAATTTTTTGAGATGTGAATAAACTTTATTTGTCCTGGTTTCCCAGTCATAAAGGAAAGTAAAACTGTTCTTTTGTGTGTCATTCCTTTCACTGTACATTCTGTAAGAACTTTTGCAGAGTTGAAATTGAATGTGAATTATCATTTTTGGCTTCCTCTTTTTTGTCTAATCATATATTACTGCTTGATCGACGAACGACAGAATATGATCGATTGGAACGACTGGGCCTCTCGTCAAGCCGACTCATTCCTCTTTCTCAAGAGAGACTTTGCTTGAGCCTCAAGCTTCTCTAACTTAAAGCTCTATTCTTTGTTTCATGATATCCACAAATGTTTCGAAACTGTCGTTGGAGTAAATCTAATAGAATGTTAGATGAAAAAAATTTATGTGACGATGAGTTGAAATGATTTAGTGACAATCGTGATACTTGTTTGGATTAAGGCTCCATTTGGG
mRNA sequence
TTCGCGCAAAGGAAGAAGCTTGAATGCATAACTTGCTAATTCAACTGTCCCAGTGATGAGCTTCCGTCGCCGATCACTGTAACCCCTCCATTTATTCCGATCAGGCCCAATTCCATGGCGGGAACGGCCTCGGAGATCGATCCTATCTATCGTCTCCCTCTCCCTCTTTTAGGATCCGAGCCGATCCCTTCCGCCCCGAATAGATTCGCTGGTTCCTCCATCGACTGGATTCCAGATTTTGCTGGCTATGCATGGGTCGCTTATGGAGCCTCATCCGTTTTGGTGATTTCTCACTTCCCTTCTCCTCTGTCTCCCCAAGAAACCACAATTGGACCCATATTTCGCCAGGTACTGGAGCTCTCTGGCGACGACTTATCTGTTGTCAATGCCGTATCTTGGTCTCCCGTATTACCGTCGGAGGGCGAGCTCGCTGCAGCTGTGGGCAACCGGATATGGGTGTTTTCTCATGATTTGGGTGCTTCTCGAGGTTCTTTTTGTTGGAGGCAGAATTCCGTGCTTCTGCAATCTTTGAAGGTTGAGGCCATTCAGTGGACGGCTGCAGGAGATGGAATTATTGCTGGTGGAGTAGAGGTTGTTTTGTGGAAGAACACTAACAGGTCCTGGGAAATTGCTTGGAAGTTCAAACCGGATGTACCCCAAACTCTTGTTTCCGCAAGCTGGTCTACCGAGGGACCATTTGCAACGGCATCTCAAGCAAGGATATCGAAGATGGATAACACGTTTATTGATAAAGCTTGCCGATCTGTGCTGGTCTGTCAGAGTGAAGGGGAGTACGGACATGTGAAAAGCGAGTTATGCCATCCTCTACCTATAACTATGATTCAATGGAGGACTTCAATTAAAGAGAAAGGAAGTTCCAAGAATACACCAAGACATGTGCTTCTGACATGCTGCTTGGATGGAACTGTGAGATTGTGGAGTGAGACTGAAAATGGAAAAGTAAAAAAATTTAGTAAGGATGTAAATAGTAAAAAGTCATTGAGAAGGCGTTTTTCTGTTGCTGCTGTCATTGAGGTAAATCAGGCATTGAATGGAACTCTTGGCACGGATTTATTTGTAACATGGGCAACCGAGATTAGAGGTATGTGCAAACCTTTTGAATTAACTAAGAAAGTTTTCTCCGAAGGATTCGAACACAACAGGGCTGGAAGCTGTGAGTGGCTAATAAGTTTAGGTCCGGGATCATTGGTTACTTTTTGGGCTGTACATTGTCTTGATGACGTATCCCCATTAAGATTCCCTCGAGTCACATTGTGGAAGAAGCAAGAGCTCAAAGGACTTGAAGTTGGACGGCATTACATTGATGGTTGTACAAACTTGAGTAACAAGTTTCTCCTCAAGAAAGTTGTAATCTCAAGGATTTATCCGTCTGGTTCCCCAAGTATGTGTTCTTTGATTCAGTTGTTGCCTTGTAATTCCTTGGTGTGGTCAATTTTATCTTCTCAGACATCAACTGACGTAGGGGATGTATCCTTTGACAAAAAAAGGTCAGACAATTTCTTCTCTCGTTCAGTTACTACCCAATTGAATTTAAGTGGTCATGCTGGAAAAATCTTACACGTTGCTGTACATCCATACAACTGTGAGGTCAAAGTAGCTGCTTCTTTGGACTCTAATGGCTTGCTTCTTTTCTGGTCACTATCTAGTATCTCCACCTGTGTATTAGGCCCTCCAACACTTAATCATACCTGGGAACTTTGTGGAAAACTCGTAACTCAAGATTCATGCTCAAAATATACAAGTGTGCAGTGGGCACCATCAATATTGGATGAAGAACTGATTCTTCTGATGGGACATGCGAGAGGAATTGATTTTTTTGCTGTTAGGATTAGCCAAAGTGATGGAGAAAATACTGAATGTCACTACTTATGTACCATACCTTTCACTGGTCATGGTCCTTTTGAGAATGGTCCAACTGATATATTTTCTATTTCTTTGCCTTCTGATTGTAATACAACATATAGATTCAATAAATTTATGTTAATAGGGCTATGGATGGAAGGATTTCAGGCACTATCATGGGAAATCACCTTACATACTTATGATATCTTTGGGACTGGAGTACATTGCAATTGTGGTATTGATGATGAAAATATAGCTGAGCTCAGTATATTGACATTTGAAAGTTCTTTTGGAAGCAAAAAGTATTGTGTTAGTATAATCCCTTGCTCATCACAGTTCCCAAATTCTCAAATTCATGAGCAGATTACAAGTTTTGCCGTGGTGCACCAAGGAACTTTTGTTCCTGTGCAGCAAAAATTAACTTCTTCAGGTGAACCATATACCCCTGCATATATTATGGCTACTGGCTCTGCTGATGGAAGTTTGAAACTTTGGAGAAGCAATGTAGGCAAACCATCGATCTTTCACGTGTCTTGGGAGCTTGTTTGTGTTGTTGTCACTCATCAAGGCCCCATTACTGCATTGTGTTTGACCGATTGTGGACGGAAAATTGCAACAATTAGCAAGAACAGCCATAAACCTAATATCAGCAACGTTCGCTTATGGGAGCTTGCATGCCTTGGTGCAGGGACCCTTTTGTTTGAAGATGAACTGTCCTTTGAAAGCAGTATCATTGCAGTAGACTGGTTAACTTTAGGAAATGGTCAATTTTTACTTGGAATTTGTTTGCAAAATGAATTGCGTGTGTACTCCCTGAAGCATTTTGGTCAGACCTTGTCAGGCATCACAAAATCTTTGGATACTGAGACTTGGATCTGCATTGGGTTTGCTCGTACTTTACCATCTAATTGTGGCTTCCTCTGGGGGCCGAAGAGCACAGCAATAGTTATACATGATCATTACTTTTGTATAGTTAGTCCATGGCTGTTCCTTGGGGATAAAAACCATGATGCTATGTGCAGCCCTTATTACATTGGAGAAACTAAAAATCACCATGTCAATGGGACTGATGTCGCTGTTTTTGCTGATGAATGTTGCAGTATCAGAAAATTATCAGATGATAATTATGACAGCAAACGTAGACCAAGATCTCTCACAAACATTCATGCTGAAACCAATATTCTATCGAATAGTTCGTATCCACGGGGTGCACAGATGAAAAGTACAACTTCGCTTGGTCTTATAAGCATGCCTGATATAGCTGATAAACTGTGTGGATCACTGTCCTCTTTTCATCCTCATGCTCTCCTCGTTAATATATATTCAGGTAAGTGGAAACGTGCATATTCAGCTTTGAGTCATCTTATTGAGCATCTTTCTTCTGATAAAAAGAGCTCTGCAAACCCAACCAATACTATTCCGGAGATCCTTTTGTCAGATTATTTTGAAGGAGTTGCGAAAACTTCTACTGATAAAGAAGTTCAGTGGAGTATGAATGGCTTAGCATCCCAATTTAAGGAAGGTGTTTCACCATGGACCTTCAATTGGGATTCTATTAGTAACGACAGCTCATTCATTCCGTCCTCTACGAAATCTGAATTCAGCACCTTTATTGAGCCTCTTGAGAAGTTCTATGAATCAGCAGGGCTAACCAGCATGGAGAAGACAGAAACTCTAGCAATTATAGATCTTCTTGATGAAATCAGTAATAAATCTTCTGCATCTGCCTACGAAAGTCTTGATGAGCCCGGGAGAAGATATTGGATTGCTTTGAGGTTTCAACAACTGCGATTTCTTCGGCGTGATGGTAGATCAGCATCTCTGGAAGAGCTGACCATTGATTCAAGATTGATTGGATGGGCCTATCACTCTGATTGTCAGCAAAATCTATTGGATTCGGTTATCTCTAAAGAACCAACATGGCAGGAAATGCGAAGTTTGGGTGTTGGAATTTGGTTTACTAATACAACACAGTTGCGTGCAAGGATGGAGAAACTGGCGAGATCCCAATATCTGAAGAAAAAAGATCCCAAGGATTGTATGCTTCTGTATGTCACACTGAATAGAATCCAAGTTTTAGCTGGCCTTTTAAAAATTAGCAGGGATGAGAAAGATAAACCCTTGGTGGGATTTCTTTCACGCAATTTTCAGGAGGAGAGAAATAAGGCAGCAGCTTTGAAGAATGCCTATGTTTTAATGGGGAAACATCAACTGGAATTGGCTGTTGCTTTTTTTTTGCTTGGTGGTGATACTTCTTCTGCTATCAGAGTCTGTGCGAAAAATCTTGGGGATGAGCAGCTTGCACTGGTAATTAGTCTTTTAGTTGAAGGGCGTGGTGGACCTCTTCAGCAACACCTAATAACAAAGTTCATGCTTCCATCTGCCATTGAGAAGGGTGATACCTGGCTTGCAAGCATTCTTGAGTGGGAATTAGGAAACTACTCTCAATCTTTCCTGAACGCGCTTGGTTTGGAGTCAGAGTCAAATTCTGTTACTGGGATACCGTTTCTTTCAAGTAGACATATTTCTTTACAAGACCCAAGCGTTGGTTTGTATTGTCTATTGTTAGCAACCAAAAATAGCATGAAGAAAGCAGTTGGAGAGCAATCTGCTGAGGTCCTTTGTCGGATTGCAACCTTGATGACGGCTACTGCTTTAAACAGATGTGGTCTTCCTCTTGAAGCTTTGGAACAAATGTCAACCTGTGGAAGCATTACTGAAGTTTCAGATGGGACTAATGGAGTCGATATTCTGTGTTTTGAGACTATAAGAAAAATCTGTAAGCAATCGCCCAGAGATTCCTCCAGTTGGCTCTCTGTTGAATTTGCAGTCCATCTGGAGTATCGAGCAAAATTGGATTTGGCAGTTCAATACTTCTCAAAATTGATAAGGAAGCATCCAAGCTTTCCAACTATAAATTTAGAATCTGTTGGATGCATGGGTTGCTTGAAGGAATATGAGATGGATTATGAGAAATCTCTTGAAAGGTTTCAGCGTAAGTTAAATGTAGGGTTTGCACAGTTTGAAATGAAGTTCTCGTTGCTTCCTGCCTCTCTTGTTAGTATGATGTTAGTCTTTCTGTGTAATGTTGGGCTACAATTTATTGGATATGATATATTTCATGGATTTGCTTCTCAAGAATGCCCGGATGACAAGAACCAGAAGATTTATACTTTCCTCTTGCATCCTCTCGTGCACAAGTCACTACTCAAAACAGCACAAGAAATTTTATTTTCAGCTTCACGGTACACTATTGCTTGCAGTTTATCTTTTCACAAAGGCGAAACAGGGTCAAAATGTTTCGATACTTGGTGGTACTACCTTCAAGGTCTCTTACTATCCTTACAGGGTTTAAGAGCTGCTTTGAGAATTACACATGGTTCTCTCAAAGACGATCTTGTTTCCAAGCTCCTGACCATTCTTGATTTGGTTGAATATAACTTATATTTCACGTCTGCTTGGCTATTGAGAGACTCAAAATGTCTCCTTAAGATGCTGCAACCACTCTTGGCAAATGCACGATCTCCTCATGACATTGACGTGGAACATCTGAAGCAACTCCTCCCTCAGATTGGAGAGTTGATAGCTCAAAATTTATTGACCGATGTAGATTATAACCATCAGATTTTGGAAGGCATGCCTAATGCACAAAGTGATGACATTGTGCATTCAATTCCAGGAGATGAAAGATGGCACATTATTGGGGCTGTTTTGTGGCATCACATGTCCAAATTCATGAAACATAAGTTGATTACTTTAACTAATGCATCTAAGGAGGGTAGCTTGAGCAGTATCATTCTTGGGAACCTTGATACATGGGCTCAAAGTCTTTCAACCATTAAATCTGATTGGAAAGCCATTTCAAAGGATGTGATTGAATTGGTATCAGTGAGTCTTACTGCTCTACTGACTATCGTCCTTGCTCAGGTTTCTTCTTATCAACTAAAACAATTGGTATCATCTCTGCAATATAAATTAGATCAGAAGCTGTACGTGGCAACTGCTGTTTGGTTTGAACAGATTTGCCAGTCTCTATCTAGTCATGACAAGGGCCATACTGATGAGATTTACGATATGGATATGTGTATCAGAGGTGAATTTGAAACATTATGGAACGTTACTTCCAATCCCAATCTAATATCAGACTGCTTTACACATGAAAAGGTTCATATGTTGCATTGTTTTGATCGTAAACTCTCTGAAAGATGGAGTGATATTTACAATGGCATCACAAGGAAAGAGCACAATTGTACTCATGAAGCTGCACATATTAGTAGGTCTGTCAGCGATGCCACTGGATCACCTGGTAAATTACTTCGTAATGGGAAAACTCTTGTCAGATCTGACAAGGAATTGGCCACCCTTGATGATGCCATGCCTTTTCAGAAACCTAAAGAGATATATAGGAGGAATGGAGAGCTTTTAGAGGTAATTTTCAATCCAATCGCATTGTGTATCAACTCTGTTGATCAAAGACAAGCTGCAGTTGCTAGCAATAAAAAGATGTACGTAACCGAGTATTTATCCAACTGGGAAGATGGGATGGCCTCCAGAGATGATGAGGATTATATCTGGTCAAACTCTGAGTGGCCTCTAAATCTAAATGGGTGGGCAGCCTCTGAATCAACACCAGCTCCAACGTGTGTATTTCCTGGTGTTGGTCTTGGGAGCAGCAAAGGGGCACACCTTGGGTTAGGTGGTGCTACTCTGGGTGTAGGATCATCTGTAAGGCCTGGGAGAGATTTGACTGGAGGTGGAGCATTTGGTATTTCGGGTTATGCAGGTGTTGGTGCTTCTGGCTTAGGTTGGGAAACACAAGAGGATTTCGAGGAATTTGTAGACCCTCCAGCTACAGCAGAACACACAAATTCGAGGGCTTTCTCCAGTCATCCTTCTAGACCTCTTTTCTTGGTTGGCTCTACCAATACACATGTATACTTGTGGGAGTTCGGGAAGAACAGAGCTACTGCAACTTATGGTGTCTTGCCTGCAGCAAATGTGCCTCCACCATACGCTCTAGCCTCAATATCATCTGTGCAGTTTGACCAATGTGGACACAGATTTGCTACTGCTGCATTAGATGGGACCGTATGCTCATGGCAGTTGGAGGTTGGAGGAAGAAGCAATGTCCGTCCAACGGAATCTTCTTTATGCTTTAATGGCCATGCATCGGATGTCACTTATGTTACTTCCAGTGGATCAATAATAGCTGTGGCTGGATATAGCTCTACTGCAGTTAATGTGGTCATATGGGATACACTGGCTCCACCTAAAACTTCCCAAGCAGCCATTATGTGTCACGAAGGTGGTGCTCGCTCTATTTCTGTGTTTGATAATGAAATAGGGAGCGGTTCTGTTTCTCCCCTCATAGTTACTGGTGGCAAAGGTGGAGACGTGGCAATTCATGATTTCCGATACGTAGTCACTGGTAGGACTAAGAAACAAAAAAACTGTTCAAAAGATGAAATGATTAGCAATGCTTCGAATTCTGACATGCCGAGTACCGTTGGTGAACAAAACTTAAATGGAATGCTTTGGTACATACCGAAGGCTCACTCTGGGAGTGTCACTAAAATATCTTCTATCCCAAATACAAGTTTGTTCTTGACCGGAAGTAAAGATGGAGACGTAAAACTTTGGGATGCAAAAAGAGCTAAATTAGTGCATCATTGGCCAAAGTTGCATGATAGACACACTTTTCTGCAACCAAGTTCCCGAGGTTTCGGTGAAGTAGTTCGGGCCGCTGTTACTGATATACAAGTTATCTCAAGTGGATTTCTTACCTGTGGTGGAGATGGCTTGGTAAAGCTGGTTCAGTTTGGATGATCTAATACATAACAGGCACAATAGTTTGGTTACTGACTGTAAAAACGTGGGAATCTGAGATGTGCCAAAGAGGTCTGGCACAAGAACTGTGTTGTGTGTAGGACTCAGCATGTTTAAGAAGACATTCGGTGCATAAGCTTAAATCAGTCCTGTAATCTTCAAAGTCGAGTTCGACATGCTGTGACCGTATGTAGTAATAGTTACTTATGAAGTGAAGAAGCTAGTTTTCCTCGCTGAGGGATTACAGCAAACTTGAGTCTATTATCTTGTTATGAGGTAAAGGAAAGGCTCCTTGGCTATGATTGTTACAAACCCAAACAGAATAGAAGTAAGGATGGACTACAGGCATCATCCAGACATACATGGAGTGAACAAAGAATTTTCTGAATATGATCGATTGGAACGACTGGGCCTCTCGTCAAGCCGACTCATTCCTCTTTCTCAAGAGAGACTTTGCTTGAGCCTCAAGCTTCTCTAACTTAAAGCTCTATTCTTTGTTTCATGATATCCACAAATGTTTCGAAACTGTCGTTGGAGTAAATCTAATAGAATGTTAGATGAAAAAAATTTATGTGACGATGAGTTGAAATGATTTAGTGACAATCGTGATACTTGTTTGGATTAAGGCTCCATTTGGG
Coding sequence (CDS)
ATGGCGGGAACGGCCTCGGAGATCGATCCTATCTATCGTCTCCCTCTCCCTCTTTTAGGATCCGAGCCGATCCCTTCCGCCCCGAATAGATTCGCTGGTTCCTCCATCGACTGGATTCCAGATTTTGCTGGCTATGCATGGGTCGCTTATGGAGCCTCATCCGTTTTGGTGATTTCTCACTTCCCTTCTCCTCTGTCTCCCCAAGAAACCACAATTGGACCCATATTTCGCCAGGTACTGGAGCTCTCTGGCGACGACTTATCTGTTGTCAATGCCGTATCTTGGTCTCCCGTATTACCGTCGGAGGGCGAGCTCGCTGCAGCTGTGGGCAACCGGATATGGGTGTTTTCTCATGATTTGGGTGCTTCTCGAGGTTCTTTTTGTTGGAGGCAGAATTCCGTGCTTCTGCAATCTTTGAAGGTTGAGGCCATTCAGTGGACGGCTGCAGGAGATGGAATTATTGCTGGTGGAGTAGAGGTTGTTTTGTGGAAGAACACTAACAGGTCCTGGGAAATTGCTTGGAAGTTCAAACCGGATGTACCCCAAACTCTTGTTTCCGCAAGCTGGTCTACCGAGGGACCATTTGCAACGGCATCTCAAGCAAGGATATCGAAGATGGATAACACGTTTATTGATAAAGCTTGCCGATCTGTGCTGGTCTGTCAGAGTGAAGGGGAGTACGGACATGTGAAAAGCGAGTTATGCCATCCTCTACCTATAACTATGATTCAATGGAGGACTTCAATTAAAGAGAAAGGAAGTTCCAAGAATACACCAAGACATGTGCTTCTGACATGCTGCTTGGATGGAACTGTGAGATTGTGGAGTGAGACTGAAAATGGAAAAGTAAAAAAATTTAGTAAGGATGTAAATAGTAAAAAGTCATTGAGAAGGCGTTTTTCTGTTGCTGCTGTCATTGAGGTAAATCAGGCATTGAATGGAACTCTTGGCACGGATTTATTTGTAACATGGGCAACCGAGATTAGAGGTATGTGCAAACCTTTTGAATTAACTAAGAAAGTTTTCTCCGAAGGATTCGAACACAACAGGGCTGGAAGCTGTGAGTGGCTAATAAGTTTAGGTCCGGGATCATTGGTTACTTTTTGGGCTGTACATTGTCTTGATGACGTATCCCCATTAAGATTCCCTCGAGTCACATTGTGGAAGAAGCAAGAGCTCAAAGGACTTGAAGTTGGACGGCATTACATTGATGGTTGTACAAACTTGAGTAACAAGTTTCTCCTCAAGAAAGTTGTAATCTCAAGGATTTATCCGTCTGGTTCCCCAAGTATGTGTTCTTTGATTCAGTTGTTGCCTTGTAATTCCTTGGTGTGGTCAATTTTATCTTCTCAGACATCAACTGACGTAGGGGATGTATCCTTTGACAAAAAAAGGTCAGACAATTTCTTCTCTCGTTCAGTTACTACCCAATTGAATTTAAGTGGTCATGCTGGAAAAATCTTACACGTTGCTGTACATCCATACAACTGTGAGGTCAAAGTAGCTGCTTCTTTGGACTCTAATGGCTTGCTTCTTTTCTGGTCACTATCTAGTATCTCCACCTGTGTATTAGGCCCTCCAACACTTAATCATACCTGGGAACTTTGTGGAAAACTCGTAACTCAAGATTCATGCTCAAAATATACAAGTGTGCAGTGGGCACCATCAATATTGGATGAAGAACTGATTCTTCTGATGGGACATGCGAGAGGAATTGATTTTTTTGCTGTTAGGATTAGCCAAAGTGATGGAGAAAATACTGAATGTCACTACTTATGTACCATACCTTTCACTGGTCATGGTCCTTTTGAGAATGGTCCAACTGATATATTTTCTATTTCTTTGCCTTCTGATTGTAATACAACATATAGATTCAATAAATTTATGTTAATAGGGCTATGGATGGAAGGATTTCAGGCACTATCATGGGAAATCACCTTACATACTTATGATATCTTTGGGACTGGAGTACATTGCAATTGTGGTATTGATGATGAAAATATAGCTGAGCTCAGTATATTGACATTTGAAAGTTCTTTTGGAAGCAAAAAGTATTGTGTTAGTATAATCCCTTGCTCATCACAGTTCCCAAATTCTCAAATTCATGAGCAGATTACAAGTTTTGCCGTGGTGCACCAAGGAACTTTTGTTCCTGTGCAGCAAAAATTAACTTCTTCAGGTGAACCATATACCCCTGCATATATTATGGCTACTGGCTCTGCTGATGGAAGTTTGAAACTTTGGAGAAGCAATGTAGGCAAACCATCGATCTTTCACGTGTCTTGGGAGCTTGTTTGTGTTGTTGTCACTCATCAAGGCCCCATTACTGCATTGTGTTTGACCGATTGTGGACGGAAAATTGCAACAATTAGCAAGAACAGCCATAAACCTAATATCAGCAACGTTCGCTTATGGGAGCTTGCATGCCTTGGTGCAGGGACCCTTTTGTTTGAAGATGAACTGTCCTTTGAAAGCAGTATCATTGCAGTAGACTGGTTAACTTTAGGAAATGGTCAATTTTTACTTGGAATTTGTTTGCAAAATGAATTGCGTGTGTACTCCCTGAAGCATTTTGGTCAGACCTTGTCAGGCATCACAAAATCTTTGGATACTGAGACTTGGATCTGCATTGGGTTTGCTCGTACTTTACCATCTAATTGTGGCTTCCTCTGGGGGCCGAAGAGCACAGCAATAGTTATACATGATCATTACTTTTGTATAGTTAGTCCATGGCTGTTCCTTGGGGATAAAAACCATGATGCTATGTGCAGCCCTTATTACATTGGAGAAACTAAAAATCACCATGTCAATGGGACTGATGTCGCTGTTTTTGCTGATGAATGTTGCAGTATCAGAAAATTATCAGATGATAATTATGACAGCAAACGTAGACCAAGATCTCTCACAAACATTCATGCTGAAACCAATATTCTATCGAATAGTTCGTATCCACGGGGTGCACAGATGAAAAGTACAACTTCGCTTGGTCTTATAAGCATGCCTGATATAGCTGATAAACTGTGTGGATCACTGTCCTCTTTTCATCCTCATGCTCTCCTCGTTAATATATATTCAGGTAAGTGGAAACGTGCATATTCAGCTTTGAGTCATCTTATTGAGCATCTTTCTTCTGATAAAAAGAGCTCTGCAAACCCAACCAATACTATTCCGGAGATCCTTTTGTCAGATTATTTTGAAGGAGTTGCGAAAACTTCTACTGATAAAGAAGTTCAGTGGAGTATGAATGGCTTAGCATCCCAATTTAAGGAAGGTGTTTCACCATGGACCTTCAATTGGGATTCTATTAGTAACGACAGCTCATTCATTCCGTCCTCTACGAAATCTGAATTCAGCACCTTTATTGAGCCTCTTGAGAAGTTCTATGAATCAGCAGGGCTAACCAGCATGGAGAAGACAGAAACTCTAGCAATTATAGATCTTCTTGATGAAATCAGTAATAAATCTTCTGCATCTGCCTACGAAAGTCTTGATGAGCCCGGGAGAAGATATTGGATTGCTTTGAGGTTTCAACAACTGCGATTTCTTCGGCGTGATGGTAGATCAGCATCTCTGGAAGAGCTGACCATTGATTCAAGATTGATTGGATGGGCCTATCACTCTGATTGTCAGCAAAATCTATTGGATTCGGTTATCTCTAAAGAACCAACATGGCAGGAAATGCGAAGTTTGGGTGTTGGAATTTGGTTTACTAATACAACACAGTTGCGTGCAAGGATGGAGAAACTGGCGAGATCCCAATATCTGAAGAAAAAAGATCCCAAGGATTGTATGCTTCTGTATGTCACACTGAATAGAATCCAAGTTTTAGCTGGCCTTTTAAAAATTAGCAGGGATGAGAAAGATAAACCCTTGGTGGGATTTCTTTCACGCAATTTTCAGGAGGAGAGAAATAAGGCAGCAGCTTTGAAGAATGCCTATGTTTTAATGGGGAAACATCAACTGGAATTGGCTGTTGCTTTTTTTTTGCTTGGTGGTGATACTTCTTCTGCTATCAGAGTCTGTGCGAAAAATCTTGGGGATGAGCAGCTTGCACTGGTAATTAGTCTTTTAGTTGAAGGGCGTGGTGGACCTCTTCAGCAACACCTAATAACAAAGTTCATGCTTCCATCTGCCATTGAGAAGGGTGATACCTGGCTTGCAAGCATTCTTGAGTGGGAATTAGGAAACTACTCTCAATCTTTCCTGAACGCGCTTGGTTTGGAGTCAGAGTCAAATTCTGTTACTGGGATACCGTTTCTTTCAAGTAGACATATTTCTTTACAAGACCCAAGCGTTGGTTTGTATTGTCTATTGTTAGCAACCAAAAATAGCATGAAGAAAGCAGTTGGAGAGCAATCTGCTGAGGTCCTTTGTCGGATTGCAACCTTGATGACGGCTACTGCTTTAAACAGATGTGGTCTTCCTCTTGAAGCTTTGGAACAAATGTCAACCTGTGGAAGCATTACTGAAGTTTCAGATGGGACTAATGGAGTCGATATTCTGTGTTTTGAGACTATAAGAAAAATCTGTAAGCAATCGCCCAGAGATTCCTCCAGTTGGCTCTCTGTTGAATTTGCAGTCCATCTGGAGTATCGAGCAAAATTGGATTTGGCAGTTCAATACTTCTCAAAATTGATAAGGAAGCATCCAAGCTTTCCAACTATAAATTTAGAATCTGTTGGATGCATGGGTTGCTTGAAGGAATATGAGATGGATTATGAGAAATCTCTTGAAAGGTTTCAGCGTAAGTTAAATGTAGGGTTTGCACAGTTTGAAATGAAGTTCTCGTTGCTTCCTGCCTCTCTTGTTAGTATGATGTTAGTCTTTCTGTGTAATGTTGGGCTACAATTTATTGGATATGATATATTTCATGGATTTGCTTCTCAAGAATGCCCGGATGACAAGAACCAGAAGATTTATACTTTCCTCTTGCATCCTCTCGTGCACAAGTCACTACTCAAAACAGCACAAGAAATTTTATTTTCAGCTTCACGGTACACTATTGCTTGCAGTTTATCTTTTCACAAAGGCGAAACAGGGTCAAAATGTTTCGATACTTGGTGGTACTACCTTCAAGGTCTCTTACTATCCTTACAGGGTTTAAGAGCTGCTTTGAGAATTACACATGGTTCTCTCAAAGACGATCTTGTTTCCAAGCTCCTGACCATTCTTGATTTGGTTGAATATAACTTATATTTCACGTCTGCTTGGCTATTGAGAGACTCAAAATGTCTCCTTAAGATGCTGCAACCACTCTTGGCAAATGCACGATCTCCTCATGACATTGACGTGGAACATCTGAAGCAACTCCTCCCTCAGATTGGAGAGTTGATAGCTCAAAATTTATTGACCGATGTAGATTATAACCATCAGATTTTGGAAGGCATGCCTAATGCACAAAGTGATGACATTGTGCATTCAATTCCAGGAGATGAAAGATGGCACATTATTGGGGCTGTTTTGTGGCATCACATGTCCAAATTCATGAAACATAAGTTGATTACTTTAACTAATGCATCTAAGGAGGGTAGCTTGAGCAGTATCATTCTTGGGAACCTTGATACATGGGCTCAAAGTCTTTCAACCATTAAATCTGATTGGAAAGCCATTTCAAAGGATGTGATTGAATTGGTATCAGTGAGTCTTACTGCTCTACTGACTATCGTCCTTGCTCAGGTTTCTTCTTATCAACTAAAACAATTGGTATCATCTCTGCAATATAAATTAGATCAGAAGCTGTACGTGGCAACTGCTGTTTGGTTTGAACAGATTTGCCAGTCTCTATCTAGTCATGACAAGGGCCATACTGATGAGATTTACGATATGGATATGTGTATCAGAGGTGAATTTGAAACATTATGGAACGTTACTTCCAATCCCAATCTAATATCAGACTGCTTTACACATGAAAAGGTTCATATGTTGCATTGTTTTGATCGTAAACTCTCTGAAAGATGGAGTGATATTTACAATGGCATCACAAGGAAAGAGCACAATTGTACTCATGAAGCTGCACATATTAGTAGGTCTGTCAGCGATGCCACTGGATCACCTGGTAAATTACTTCGTAATGGGAAAACTCTTGTCAGATCTGACAAGGAATTGGCCACCCTTGATGATGCCATGCCTTTTCAGAAACCTAAAGAGATATATAGGAGGAATGGAGAGCTTTTAGAGGTAATTTTCAATCCAATCGCATTGTGTATCAACTCTGTTGATCAAAGACAAGCTGCAGTTGCTAGCAATAAAAAGATGTACGTAACCGAGTATTTATCCAACTGGGAAGATGGGATGGCCTCCAGAGATGATGAGGATTATATCTGGTCAAACTCTGAGTGGCCTCTAAATCTAAATGGGTGGGCAGCCTCTGAATCAACACCAGCTCCAACGTGTGTATTTCCTGGTGTTGGTCTTGGGAGCAGCAAAGGGGCACACCTTGGGTTAGGTGGTGCTACTCTGGGTGTAGGATCATCTGTAAGGCCTGGGAGAGATTTGACTGGAGGTGGAGCATTTGGTATTTCGGGTTATGCAGGTGTTGGTGCTTCTGGCTTAGGTTGGGAAACACAAGAGGATTTCGAGGAATTTGTAGACCCTCCAGCTACAGCAGAACACACAAATTCGAGGGCTTTCTCCAGTCATCCTTCTAGACCTCTTTTCTTGGTTGGCTCTACCAATACACATGTATACTTGTGGGAGTTCGGGAAGAACAGAGCTACTGCAACTTATGGTGTCTTGCCTGCAGCAAATGTGCCTCCACCATACGCTCTAGCCTCAATATCATCTGTGCAGTTTGACCAATGTGGACACAGATTTGCTACTGCTGCATTAGATGGGACCGTATGCTCATGGCAGTTGGAGGTTGGAGGAAGAAGCAATGTCCGTCCAACGGAATCTTCTTTATGCTTTAATGGCCATGCATCGGATGTCACTTATGTTACTTCCAGTGGATCAATAATAGCTGTGGCTGGATATAGCTCTACTGCAGTTAATGTGGTCATATGGGATACACTGGCTCCACCTAAAACTTCCCAAGCAGCCATTATGTGTCACGAAGGTGGTGCTCGCTCTATTTCTGTGTTTGATAATGAAATAGGGAGCGGTTCTGTTTCTCCCCTCATAGTTACTGGTGGCAAAGGTGGAGACGTGGCAATTCATGATTTCCGATACGTAGTCACTGGTAGGACTAAGAAACAAAAAAACTGTTCAAAAGATGAAATGATTAGCAATGCTTCGAATTCTGACATGCCGAGTACCGTTGGTGAACAAAACTTAAATGGAATGCTTTGGTACATACCGAAGGCTCACTCTGGGAGTGTCACTAAAATATCTTCTATCCCAAATACAAGTTTGTTCTTGACCGGAAGTAAAGATGGAGACGTAAAACTTTGGGATGCAAAAAGAGCTAAATTAGTGCATCATTGGCCAAAGTTGCATGATAGACACACTTTTCTGCAACCAAGTTCCCGAGGTTTCGGTGAAGTAGTTCGGGCCGCTGTTACTGATATACAAGTTATCTCAAGTGGATTTCTTACCTGTGGTGGAGATGGCTTGGTAAAGCTGGTTCAGTTTGGATGA
Protein sequence
MAGTASEIDPIYRLPLPLLGSEPIPSAPNRFAGSSIDWIPDFAGYAWVAYGASSVLVISHFPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDLGASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDVPQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPITMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRFSVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISLGPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVISRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNLSGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGHGPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCNCGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQQKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCLTDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQFLLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYDSKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALLVNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWSMNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIEKGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLATKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDILCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESVGCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETGSKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLRDSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQSDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSLSTIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVWFEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDRKLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDDAMPFQKPKEIYRRNGELLEVIFNPIALCINSVDQRQAAVASNKKMYVTEYLSNWEDGMASRDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFLVGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTVCSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKNCSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKLVQFG
Homology
BLAST of CmoCh06G010300 vs. ExPASy Swiss-Prot
Match:
Q6PNC0 (DmX-like protein 1 OS=Mus musculus OX=10090 GN=Dmxl1 PE=1 SV=1)
HSP 1 Score: 148.7 bits (374), Expect = 9.0e-34
Identity = 105/316 (33.23%), Postives = 157/316 (49.68%), Query Frame = 0
Query: 1132 GLTSMEKTETLAIIDLLDEIS-----NKSSASAYESLDEPGRRYWIALRFQQLRFLRRDG 1191
GLT ME+ +A+ D + S ++ E+LDE G ++ +A+R
Sbjct: 1498 GLTRMEQMSLMALADTIATTSTDIGESRDRNQGGETLDECGLKFLLAVRLHTFLTTSLPA 1557
Query: 1192 RSASLEELTIDSRLIGWAYHSDCQQ---NLLDSVISKEPTWQEMRSLGVGIWFTNTTQLR 1251
A L + + WA+HS ++ N+L ++ +PTW E+R++GVG W N LR
Sbjct: 1558 YRAQLLHQGLSTGHFAWAFHSVAEEELLNMLPAMQKDDPTWSELRAMGVGWWVRNARILR 1617
Query: 1252 ARMEKLARSQYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEER 1311
+EK+A++ + + DP D + Y+ + + V+ GL R +KD + F NF+EER
Sbjct: 1618 RCIEKVAKAAFHRNNDPLDAAIFYLAMKKKAVIWGLY---RSQKDTKMTQFFGHNFEEER 1677
Query: 1312 NKAAALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGG 1371
+ AALKNA+ L+GK + E + AFFLLGG AI VC + L D QLALVI+ L E
Sbjct: 1678 WRKAALKNAFSLLGKQRFEHSAAFFLLGGCLKDAIEVCLEKLNDIQLALVIARLFE---- 1737
Query: 1372 PLQQHLITKFMLPSAIEKGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFL-SS 1431
S +K T+ + + + LG S A L S S + PFL S
Sbjct: 1738 -------------SEFDKSATYKSILRKKVLGIGSP----ASELSSSSINAHHDPFLRSM 1789
Query: 1432 RHISLQDPSVGLYCLL 1439
H L+D S L L+
Sbjct: 1798 AHWILEDYSAALETLI 1789
BLAST of CmoCh06G010300 vs. ExPASy Swiss-Prot
Match:
Q9Y485 (DmX-like protein 1 OS=Homo sapiens OX=9606 GN=DMXL1 PE=1 SV=3)
HSP 1 Score: 145.6 bits (366), Expect = 7.7e-33
Identity = 84/236 (35.59%), Postives = 131/236 (55.51%), Query Frame = 0
Query: 1132 GLTSMEKTETLAIIDLLDEIS-----NKSSASAYESLDEPGRRYWIALRFQQLRFLRRDG 1191
GL+ ME+ +A+ D + S ++ + E+LDE G ++ +A+R
Sbjct: 1500 GLSRMEQMSLMALADTIATTSTDIGESRDRSQGGETLDECGLKFLLAVRLHTFLTTSLPA 1559
Query: 1192 RSASLEELTIDSRLIGWAYHSDCQQ---NLLDSVISKEPTWQEMRSLGVGIWFTNTTQLR 1251
A L + + WA+HS ++ N+L ++ +PTW E+R++GVG W NT LR
Sbjct: 1560 YRAQLLHQGLSTSHFAWAFHSVAEEELLNMLPAMQKDDPTWSELRAMGVGWWVRNTRILR 1619
Query: 1252 ARMEKLARSQYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEER 1311
+EK+A++ + +K DP D + Y+ + + V+ GL R EK+ + F NF++ER
Sbjct: 1620 KCIEKVAKAAFYRKNDPLDAAIFYLAMKKKAVIWGLY---RAEKNTRMTQFFGHNFEDER 1679
Query: 1312 NKAAALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVE 1360
+ AALKNA+ L+GK + E + AFFLL G AI VC + L D QLALVI+ L E
Sbjct: 1680 WRKAALKNAFSLLGKQRFEHSAAFFLLAGCLRDAIEVCLEKLNDIQLALVIARLYE 1732
BLAST of CmoCh06G010300 vs. ExPASy Swiss-Prot
Match:
Q8TDJ6 (DmX-like protein 2 OS=Homo sapiens OX=9606 GN=DMXL2 PE=1 SV=2)
HSP 1 Score: 136.3 bits (342), Expect = 4.6e-30
Identity = 91/274 (33.21%), Postives = 143/274 (52.19%), Query Frame = 0
Query: 1098 NWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTETLAIIDL-------LDE 1157
N + N S + P+ E + + GLT +E+ +A+ D LDE
Sbjct: 1491 NKSKVINLSQYGPAYFGQEHARVLSSHLMHSSLPGLTRLEQMFLVALADTVATTSTELDE 1550
Query: 1158 ISNKSSASAYESLDEPGRRYWIALRFQQ--LRFLRRDGRSASLEELTIDSRLIGWAYHSD 1217
S S S ++LDE G RY +A+R L L R L + + + WA+HS+
Sbjct: 1551 -SRDKSCSGRDTLDECGLRYLLAMRLHTCLLTSLPPLYRVQLLHQ-GVSTCHFAWAFHSE 1610
Query: 1218 CQQ---NLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDPKDCML 1277
++ N++ ++ +P W E+R++G+G W N LR +EK+A++ + + D D L
Sbjct: 1611 AEEELINMIPAIQRGDPQWSELRAMGIGWWVRNINTLRRCIEKVAKASFQRNNDALDAAL 1670
Query: 1278 LYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQLELAV 1337
Y+++ + V+ GL + DEK + F S NF E+R + AALKNA+ L+GK + E +
Sbjct: 1671 FYLSMKKKAVVWGLFRSQHDEK---MTTFFSHNFNEDRWRKAALKNAFSLLGKQRFEQSA 1730
Query: 1338 AFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVE 1360
AFFLL G AI VC + + D QLA+VI+ L E
Sbjct: 1731 AFFLLAGSLKDAIEVCLEKMEDIQLAMVIARLYE 1759
BLAST of CmoCh06G010300 vs. ExPASy Swiss-Prot
Match:
Q8BPN8 (DmX-like protein 2 OS=Mus musculus OX=10090 GN=Dmxl2 PE=1 SV=3)
HSP 1 Score: 133.3 bits (334), Expect = 3.9e-29
Identity = 89/274 (32.48%), Postives = 144/274 (52.55%), Query Frame = 0
Query: 1098 NWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTETLAIIDL-------LDE 1157
N + N S + P+ E + + GLT +E+ +A+ D LDE
Sbjct: 1490 NKSKVINLSQYGPACFGQEHARVLSSHLMHSSLPGLTRLEQMFLVALADTVATTSTELDE 1549
Query: 1158 ISNKSSASAYESLDEPGRRYWIALRFQQ--LRFLRRDGRSASLEELTIDSRLIGWAYHSD 1217
+K + S ++LDE G RY +A+R L L R L + + + WA+HS+
Sbjct: 1550 NRDK-NYSGRDTLDECGLRYLLAMRLHTCLLTSLPPLYRVQLLHQ-GVSTCHFAWAFHSE 1609
Query: 1218 CQQ---NLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDPKDCML 1277
++ N++ ++ +P W E+R++G+G W N LR +EK+A++ + + + D L
Sbjct: 1610 AEEELINMIPAIQRGDPQWSELRAMGIGWWVRNVNTLRRCIEKVAKAAFQRNNEALDAAL 1669
Query: 1278 LYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQLELAV 1337
Y+++ + V+ GL + DEK + F S NF E+R + AALKNA+ L+GK + E +
Sbjct: 1670 FYLSMKKKAVVWGLFRSQHDEK---MTTFFSHNFNEDRWRKAALKNAFSLLGKQRFEQSA 1729
Query: 1338 AFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVE 1360
AFFLL G AI VC + + D QLA+VI+ L E
Sbjct: 1730 AFFLLAGSLKDAIEVCLEKMEDIQLAMVIARLFE 1758
BLAST of CmoCh06G010300 vs. ExPASy Swiss-Prot
Match:
P47104 (Regulator of V-ATPase in vacuolar membrane protein 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=RAV1 PE=1 SV=1)
HSP 1 Score: 127.5 bits (319), Expect = 2.2e-27
Identity = 97/343 (28.28%), Postives = 162/343 (47.23%), Query Frame = 0
Query: 1133 LTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLE 1192
LT ++ + +I+ +DE++ + +D G R+ + ++ FL S
Sbjct: 852 LTRHQQITLITVIEAVDEVTKNENI-----VDYNGVRFLLGVKL----FLSHKNIQKS-- 911
Query: 1193 ELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARS 1252
I R + WA HSD ++ LL S+ +W R + W L + E +A+
Sbjct: 912 ---ILMRDVSWALHSDNKEILLSSIDRHITSWNRAREYRIAYWI-KEQDLVKKFEDIAKY 971
Query: 1253 QYLK--KKDPKDCMLLYVTLNRIQVLAGLLK--ISRDEKDKPLVGFLSRNFQEERNKAAA 1312
++ K K+DP C + Y+ L + Q+L L K I E+ K +V F+S +F R + AA
Sbjct: 972 EFSKDDKRDPSRCAIFYLALKKKQILLSLWKMAIGHPEQQK-MVRFISNDFTVPRWRTAA 1031
Query: 1313 LKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQH 1372
LKNA+VL+ KH+ A FFLL + V K + D LA+ + + EG GP+
Sbjct: 1032 LKNAFVLLSKHRYMDAAVFFLLTDSLKDCVNVLCKQVHDMDLAIGVCRVYEGDNGPVLGE 1091
Query: 1373 LITKFMLPSAIEKGDTWLASILEWELGNYS---QSFLNA-LGLESESNSVTGIPFLSSRH 1432
L+T MLP I++ D W AS + W+L ++ L A + LE+ S S+ +R
Sbjct: 1092 LLTAQMLPETIKENDRWKASFIYWKLRKQEVAIKALLTAPIDLENNS-SIVDKEVCVNRS 1151
Query: 1433 ISLQDPSVGLYCLLLATKNSMKKAVGEQSAEVLCRIATLMTAT 1468
++DP++ LY ++K +G + E ++ T
Sbjct: 1152 FLVEDPAL-LYLYNHLRNRNLKYFIGSLNVEAKIECTLILRVT 1176
BLAST of CmoCh06G010300 vs. ExPASy TrEMBL
Match:
A0A6J1GHN3 (uncharacterized protein LOC111454255 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454255 PE=4 SV=1)
HSP 1 Score: 5082.7 bits (13183), Expect = 0.0e+00
Identity = 2509/2524 (99.41%), Postives = 2511/2524 (99.48%), Query Frame = 0
Query: 1 MAGTASEIDPIYRLPLPLLGSEPIPSAPNRFAGSSIDWIPDFAGYAWVAYGASSVLVISH 60
MAGTASEIDPIYRLPLPLLGSEPIPSAPNRFAGSSIDWIPDFAGYAWVAYGASSVLVISH
Sbjct: 1 MAGTASEIDPIYRLPLPLLGSEPIPSAPNRFAGSSIDWIPDFAGYAWVAYGASSVLVISH 60
Query: 61 FPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDL 120
FPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDL
Sbjct: 61 FPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDL 120
Query: 121 GASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDV 180
GASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDV
Sbjct: 121 GASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDV 180
Query: 181 PQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPI 240
PQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPI
Sbjct: 181 PQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPI 240
Query: 241 TMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRF 300
TMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRF
Sbjct: 241 TMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRF 300
Query: 301 SVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISL 360
SVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISL
Sbjct: 301 SVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISL 360
Query: 361 GPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVI 420
GPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVI
Sbjct: 361 GPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVI 420
Query: 421 SRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNL 480
SRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNL
Sbjct: 421 SRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNL 480
Query: 481 SGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLV 540
SGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLV
Sbjct: 481 SGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLV 540
Query: 541 TQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGH 600
TQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGH
Sbjct: 541 TQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGH 600
Query: 601 GPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCN 660
GPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCN
Sbjct: 601 GPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCN 660
Query: 661 CGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQ 720
CGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQ
Sbjct: 661 CGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQ 720
Query: 721 QKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCL 780
QKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCL
Sbjct: 721 QKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCL 780
Query: 781 TDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQF 840
TDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQF
Sbjct: 781 TDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQF 840
Query: 841 LLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVI 900
LLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVI
Sbjct: 841 LLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVI 900
Query: 901 HDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYD 960
HDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYD
Sbjct: 901 HDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYD 960
Query: 961 SKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALL 1020
SKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALL
Sbjct: 961 SKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALL 1020
Query: 1021 VNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWS 1080
VNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWS
Sbjct: 1021 VNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWS 1080
Query: 1081 MNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTE 1140
MNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTE
Sbjct: 1081 MNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTE 1140
Query: 1141 TLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRL 1200
TLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRL
Sbjct: 1141 TLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRL 1200
Query: 1201 IGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDP 1260
IGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDP
Sbjct: 1201 IGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDP 1260
Query: 1261 KDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQ 1320
KDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQ
Sbjct: 1261 KDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQ 1320
Query: 1321 LELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIE 1380
LELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIE
Sbjct: 1321 LELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIE 1380
Query: 1381 KGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLA 1440
KGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLA
Sbjct: 1381 KGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLA 1440
Query: 1441 TKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDI 1500
TKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDI
Sbjct: 1441 TKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDI 1500
Query: 1501 LCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESV 1560
LCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESV
Sbjct: 1501 LCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESV 1560
Query: 1561 GCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGY 1620
GCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGY
Sbjct: 1561 GCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGY 1620
Query: 1621 DIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETG 1680
DIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETG
Sbjct: 1621 DIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETG 1680
Query: 1681 SKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLR 1740
SKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLR
Sbjct: 1681 SKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLR 1740
Query: 1741 DSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQ 1800
DSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQ
Sbjct: 1741 DSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQ 1800
Query: 1801 SDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSL 1860
SDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSL
Sbjct: 1801 SDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSL 1860
Query: 1861 STIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVW 1920
STIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVW
Sbjct: 1861 STIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVW 1920
Query: 1921 FEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDR 1980
FEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDR
Sbjct: 1921 FEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDR 1980
Query: 1981 KLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDD 2040
KLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDD
Sbjct: 1981 KLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDD 2040
Query: 2041 AMPFQKPKEIYRRNGELLEVIFNPIALCINSVDQRQAAVASNKKMYVTEYLSNWEDGMAS 2100
AMPFQKPKEIYRRNGELLE ALCINSVDQRQAAVASNKK + +WEDGMAS
Sbjct: 2041 AMPFQKPKEIYRRNGELLE------ALCINSVDQRQAAVASNKKGII---FVSWEDGMAS 2100
Query: 2101 RDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVR 2160
RDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVR
Sbjct: 2101 RDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVR 2160
Query: 2161 PGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFL 2220
PGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFL
Sbjct: 2161 PGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFL 2220
Query: 2221 VGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTV 2280
VGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTV
Sbjct: 2221 VGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTV 2280
Query: 2281 CSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPP 2340
CSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPP
Sbjct: 2281 CSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPP 2340
Query: 2341 KTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKN 2400
KTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKN
Sbjct: 2341 KTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKN 2400
Query: 2401 CSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDV 2460
CSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDV
Sbjct: 2401 CSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDV 2460
Query: 2461 KLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKL 2520
KLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKL
Sbjct: 2461 KLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKL 2515
Query: 2521 VQFG 2525
VQFG
Sbjct: 2521 VQFG 2515
BLAST of CmoCh06G010300 vs. ExPASy TrEMBL
Match:
A0A6J1IFR3 (LOW QUALITY PROTEIN: uncharacterized protein LOC111474058 OS=Cucurbita maxima OX=3661 GN=LOC111474058 PE=4 SV=1)
HSP 1 Score: 4879.3 bits (12655), Expect = 0.0e+00
Identity = 2424/2547 (95.17%), Postives = 2457/2547 (96.47%), Query Frame = 0
Query: 1 MAGTASEIDPIYRLPLPLLGSEPIPSAPNRFAGSSIDWIPDFAGYAWVAYGASSVLVISH 60
MAGTASEID IYRLPLPLLGSEPIPSAPNRFAGS IDWIPDFAGYAWVAYGASSVLVISH
Sbjct: 1 MAGTASEIDSIYRLPLPLLGSEPIPSAPNRFAGSPIDWIPDFAGYAWVAYGASSVLVISH 60
Query: 61 FPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDL 120
FPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDL
Sbjct: 61 FPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDL 120
Query: 121 GASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDV 180
GASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDV
Sbjct: 121 GASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDV 180
Query: 181 PQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPI 240
PQTLVSASWSTEGPFATASQARISKMDN FIDKAC+SVLV QSEGEYGHVKSELCHPLPI
Sbjct: 181 PQTLVSASWSTEGPFATASQARISKMDNMFIDKACQSVLVSQSEGEYGHVKSELCHPLPI 240
Query: 241 TMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRF 300
TMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVN KKSLRRRF
Sbjct: 241 TMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVN-KKSLRRRF 300
Query: 301 SVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISL 360
SV AVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISL
Sbjct: 301 SVTAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISL 360
Query: 361 GPGSLVTFWA----------------------VHCLDDVSPLRFPRVTLWKKQELKGLEV 420
GPGSLVTFWA VHCLDDVSPLRFPRVTLWKKQELKGLEV
Sbjct: 361 GPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQEVHCLDDVSPLRFPRVTLWKKQELKGLEV 420
Query: 421 GRHYIDGCTNLSNKFLLKKVVISRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGD 480
GRHYIDGCTNLSNKFLLKKVVISRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGD
Sbjct: 421 GRHYIDGCTNLSNKFLLKKVVISRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGD 480
Query: 481 VSFDKKRSDNFFSRSVTTQLNLSGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSS 540
VSFDKKRSDNFFSRSVTTQLNLSGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSS
Sbjct: 481 VSFDKKRSDNFFSRSVTTQLNLSGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSS 540
Query: 541 ISTCVLGPPTLNHTWELCGKLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVR 600
ISTCVLGPPTLNHTWELCGKLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVR
Sbjct: 541 ISTCVLGPPTLNHTWELCGKLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVR 600
Query: 601 ISQSDGENTECHYLCTIPFTGHGPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLW-MEG 660
ISQ+DGENTECHYLCTIPFTGHGPFENGPTDIFSISLPSDCNTTY+FNKFMLIG+W MEG
Sbjct: 601 ISQNDGENTECHYLCTIPFTGHGPFENGPTDIFSISLPSDCNTTYKFNKFMLIGVWXMEG 660
Query: 661 FQALSWEITLHTYDIFGTGVHCNCGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFP 720
FQALSWEITLHTYDIFGTGVHCNC ID+ENIAELSILTFESSFGSKKYCVSIIPCSSQFP
Sbjct: 661 FQALSWEITLHTYDIFGTGVHCNCDIDNENIAELSILTFESSFGSKKYCVSIIPCSSQFP 720
Query: 721 NSQIHEQITSFAVVHQGTFVPVQQKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSI 780
NSQI++QITSF VVHQGTF PVQQKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSI
Sbjct: 721 NSQINDQITSFGVVHQGTFAPVQQKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSI 780
Query: 781 FHVSWELVCVVVTHQGPITALCLTDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLF 840
FHVSWELVCVVVTHQGPITALCLTDCGRKIATISKN+HKPNISNV LWELACLGAGTLLF
Sbjct: 781 FHVSWELVCVVVTHQGPITALCLTDCGRKIATISKNNHKPNISNVHLWELACLGAGTLLF 840
Query: 841 EDELSFESSIIAVDWLTLGNGQFLLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICI 900
EDELSFES+IIAVDWLTLGNGQFLLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICI
Sbjct: 841 EDELSFESNIIAVDWLTLGNGQFLLGICLQNELRVYSLKHFGQTLSGITKSLDTETWICI 900
Query: 901 GFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVN 960
GFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVN
Sbjct: 901 GFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVN 960
Query: 961 GTDVAVFADECCSIRKLSDDNYDSKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGL 1020
GTDVAVFADECCSIRKLSDDNYDSKRRPRSLTNIHAETNILSNSSYPR AQMKSTTSLGL
Sbjct: 961 GTDVAVFADECCSIRKLSDDNYDSKRRPRSLTNIHAETNILSNSSYPRVAQMKSTTSLGL 1020
Query: 1021 ISMPDIADKLCGSLSSFHPHALLVNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIP 1080
ISMPDIADKLCGSLSSFHPHALL++IYSGKWKRAYSALSHLIEHLSS+KKSSANPTNTIP
Sbjct: 1021 ISMPDIADKLCGSLSSFHPHALLIDIYSGKWKRAYSALSHLIEHLSSNKKSSANPTNTIP 1080
Query: 1081 EILLSDYFEGVAKTSTDKEVQWSMNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEF 1140
EILLSDYFEGVAKTSTDKEVQWSMNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEF
Sbjct: 1081 EILLSDYFEGVAKTSTDKEVQWSMNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEF 1140
Query: 1141 STFIEPLEKFYESAGLTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQ 1200
S+FIEPLEKFYESAGLTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQ
Sbjct: 1141 SSFIEPLEKFYESAGLTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQ 1200
Query: 1201 QLRFLRRDGRSASLEELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFT 1260
QLRFLR DGRSASLEELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFT
Sbjct: 1201 QLRFLRCDGRSASLEELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFT 1260
Query: 1261 NTTQLRARMEKLARSQYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSR 1320
NTTQLRARMEKLARSQYLKKKDPKDCMLLYVTLNRIQVLAGL KISRDEKDKPLVGFLSR
Sbjct: 1261 NTTQLRARMEKLARSQYLKKKDPKDCMLLYVTLNRIQVLAGLFKISRDEKDKPLVGFLSR 1320
Query: 1321 NFQEERNKAAALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLL 1380
NFQEE+NKAAALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVIS L
Sbjct: 1321 NFQEEKNKAAALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISRL 1380
Query: 1381 VEGRGGPLQQHLITKFMLPSAIEKGDTWLASILEWELGNYSQSFLNALGLESESNSVTGI 1440
VEGRGGPLQQHLIT+FMLPSAIEKGDTWLASILEWELGNYSQSFLN LGLESESNSVTGI
Sbjct: 1381 VEGRGGPLQQHLITQFMLPSAIEKGDTWLASILEWELGNYSQSFLNVLGLESESNSVTGI 1440
Query: 1441 PFLSSRHISLQDPSVGLYCLLLATKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLE 1500
PFLSSRHISLQDPSVGLYCLLLATKNSMKKAVGEQSAEVLCR+ATLMTATALNRCGLPLE
Sbjct: 1441 PFLSSRHISLQDPSVGLYCLLLATKNSMKKAVGEQSAEVLCRLATLMTATALNRCGLPLE 1500
Query: 1501 ALEQMSTCGSITEVSDGTNGVDILCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDL 1560
ALEQMSTCGSITEV DGTNGVDILCFETIRKICKQSPRDSSSW+SVEFAVHLEYRAKLDL
Sbjct: 1501 ALEQMSTCGSITEVPDGTNGVDILCFETIRKICKQSPRDSSSWVSVEFAVHLEYRAKLDL 1560
Query: 1561 AVQYFSKLIRKHPSFPTINLESVGCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSL 1620
AVQYFSKLIRKHPS+PTINLESVGCMGCLKEYEMDYEKSLE FQRKLNVGFAQFEMKFSL
Sbjct: 1561 AVQYFSKLIRKHPSWPTINLESVGCMGCLKEYEMDYEKSLESFQRKLNVGFAQFEMKFSL 1620
Query: 1621 LPASLVSMMLVFLCNVGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQ 1680
LPASLVSMMLVFLCNVGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQ
Sbjct: 1621 LPASLVSMMLVFLCNVGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQ 1680
Query: 1681 EILFSASRYTIACSLSFHKGETGSKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLV 1740
EILFSASRYTIACSLSFHKGETGSKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLV
Sbjct: 1681 EILFSASRYTIACSLSFHKGETGSKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLV 1740
Query: 1741 SKLLTILDLVEYNLYFTSAWLLRDSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGEL 1800
SKL +LKMLQPLL NARSPHDIDVEHLK LLPQIGEL
Sbjct: 1741 SKL-----------------------HVLKMLQPLLENARSPHDIDVEHLKLLLPQIGEL 1800
Query: 1801 IAQNLLTDVDYNHQILEGMPNAQSDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLT 1860
IAQNLLTDVDYNHQILEGMPN QSDDIVH IPGDERWHIIGAVLWHHMSKFMKHKLITLT
Sbjct: 1801 IAQNLLTDVDYNHQILEGMPNEQSDDIVHLIPGDERWHIIGAVLWHHMSKFMKHKLITLT 1860
Query: 1861 NASKEGSLSSIILGNLDTWAQSLSTIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQ 1920
N SKEGSLSS+ILGNLDTWAQSLSTIKSDWKAISKDVIELVS+SLTALLTIVLAQVSSYQ
Sbjct: 1861 NTSKEGSLSSMILGNLDTWAQSLSTIKSDWKAISKDVIELVSMSLTALLTIVLAQVSSYQ 1920
Query: 1921 LKQLVSSLQYKLDQKLYVATAVWFEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVT 1980
LKQLVSSLQYKLDQKLYVATAVWFEQICQSLSSHDKGHTDE+Y+MDMCI+GEFETLWN+T
Sbjct: 1921 LKQLVSSLQYKLDQKLYVATAVWFEQICQSLSSHDKGHTDEMYNMDMCIKGEFETLWNIT 1980
Query: 1981 SNPNLISDCFTHEKVHMLHCFDRKLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGS 2040
SNPNLISDCF HEKVHMLHCFDRKLSERWS+IYNGITR E NCTHEAAHISRSVSDATGS
Sbjct: 1981 SNPNLISDCFAHEKVHMLHCFDRKLSERWSEIYNGITRAERNCTHEAAHISRSVSDATGS 2040
Query: 2041 PGKLLRNGKTLVRSDKELATLDDAMPFQKPKEIYRRNGELLEVIFNPIALCINSVDQRQA 2100
PGKLLRNGKTLVRSDKELATLDDAMPFQKPKEI RRNGELLE ALCINSVDQRQA
Sbjct: 2041 PGKLLRNGKTLVRSDKELATLDDAMPFQKPKEICRRNGELLE------ALCINSVDQRQA 2100
Query: 2101 AVASNKKMYVTEYLSNWEDGMASRDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVG 2160
AVASNKK + +WEDGMASRD+EDYIWSNSEWPLN G AASESTPAPTCVFPGVG
Sbjct: 2101 AVASNKKGII---FVSWEDGMASRDEEDYIWSNSEWPLNKWG-AASESTPAPTCVFPGVG 2160
Query: 2161 LGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVD 2220
LGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVD
Sbjct: 2161 LGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVD 2220
Query: 2221 PPATAEHTNSRAFSSHPSRPLFLVGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALA 2280
PPATAEHT++RAFSSHPSRPLFLVGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALA
Sbjct: 2221 PPATAEHTSTRAFSSHPSRPLFLVGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALA 2280
Query: 2281 SISSVQFDQCGHRFATAALDGTVCSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGS 2340
SISSVQFDQCGHRFATAALDGTVCSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGS
Sbjct: 2281 SISSVQFDQCGHRFATAALDGTVCSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGS 2340
Query: 2341 IIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGG 2400
IIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCHEGGARSISVFDNEIG+GSVSPLIVTGG
Sbjct: 2341 IIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGTGSVSPLIVTGG 2400
Query: 2401 KGGDVAIHDFRYVVTGRTKKQKNCSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSG 2460
KGGDVAIHDFRYVVTGRTKKQKNCSKDE IS+ASNSDMPSTVGEQNLNGMLWYIPKAHSG
Sbjct: 2401 KGGDVAIHDFRYVVTGRTKKQKNCSKDERISDASNSDMPSTVGEQNLNGMLWYIPKAHSG 2460
Query: 2461 SVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRA 2520
SVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRA
Sbjct: 2461 SVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRA 2513
Query: 2521 AVTDIQVISSGFLTCGGDGLVKLVQFG 2525
AVTDIQVISSGFLTCGGDGLVKLVQFG
Sbjct: 2521 AVTDIQVISSGFLTCGGDGLVKLVQFG 2513
BLAST of CmoCh06G010300 vs. ExPASy TrEMBL
Match:
A0A6J1GIT5 (uncharacterized protein LOC111454255 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111454255 PE=4 SV=1)
HSP 1 Score: 4816.1 bits (12491), Expect = 0.0e+00
Identity = 2378/2394 (99.33%), Postives = 2381/2394 (99.46%), Query Frame = 0
Query: 131 QNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDVPQTLVSASWS 190
+NSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDVPQTLVSASWS
Sbjct: 34 ENSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDVPQTLVSASWS 93
Query: 191 TEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPITMIQWRTSIK 250
TEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPITMIQWRTSIK
Sbjct: 94 TEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPITMIQWRTSIK 153
Query: 251 EKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRFSVAAVIEVNQ 310
EKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRFSVAAVIEVNQ
Sbjct: 154 EKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRFSVAAVIEVNQ 213
Query: 311 ALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISLGPGSLVTFWA 370
ALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISLGPGSLVTFWA
Sbjct: 214 ALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSEGFEHNRAGSCEWLISLGPGSLVTFWA 273
Query: 371 VHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVISRIYPSGSPS 430
VHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVISRIYPSGSPS
Sbjct: 274 VHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVISRIYPSGSPS 333
Query: 431 MCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNLSGHAGKILHV 490
MCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNLSGHAGKILHV
Sbjct: 334 MCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNLSGHAGKILHV 393
Query: 491 AVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLVTQDSCSKYTS 550
AVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLVTQDSCSKYTS
Sbjct: 394 AVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLVTQDSCSKYTS 453
Query: 551 VQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGHGPFENGPTDI 610
VQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGHGPFENGPTDI
Sbjct: 454 VQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGHGPFENGPTDI 513
Query: 611 FSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCNCGIDDENIAE 670
FSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCNCGIDDENIAE
Sbjct: 514 FSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGVHCNCGIDDENIAE 573
Query: 671 LSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQQKLTSSGEPY 730
LSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQQKLTSSGEPY
Sbjct: 574 LSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQQKLTSSGEPY 633
Query: 731 TPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCLTDCGRKIATI 790
TPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCLTDCGRKIATI
Sbjct: 634 TPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCLTDCGRKIATI 693
Query: 791 SKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQFLLGICLQNEL 850
SKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQFLLGICLQNEL
Sbjct: 694 SKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGNGQFLLGICLQNEL 753
Query: 851 RVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSP 910
RVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSP
Sbjct: 754 RVYSLKHFGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIVIHDHYFCIVSP 813
Query: 911 WLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYDSKRRPRSLTN 970
WLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYDSKRRPRSLTN
Sbjct: 814 WLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNYDSKRRPRSLTN 873
Query: 971 IHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALLVNIYSGKWKR 1030
IHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALLVNIYSGKWKR
Sbjct: 874 IHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFHPHALLVNIYSGKWKR 933
Query: 1031 AYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWSMNGLASQFKE 1090
AYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWSMNGLASQFKE
Sbjct: 934 AYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTSTDKEVQWSMNGLASQFKE 993
Query: 1091 GVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTETLAIIDLLDE 1150
GVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTETLAIIDLLDE
Sbjct: 994 GVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAGLTSMEKTETLAIIDLLDE 1053
Query: 1151 ISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRLIGWAYHSDCQ 1210
ISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRLIGWAYHSDCQ
Sbjct: 1054 ISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLEELTIDSRLIGWAYHSDCQ 1113
Query: 1211 QNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDPKDCMLLYVTL 1270
QNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDPKDCMLLYVTL
Sbjct: 1114 QNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARSQYLKKKDPKDCMLLYVTL 1173
Query: 1271 NRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQLELAVAFFLL 1330
NRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQLELAVAFFLL
Sbjct: 1174 NRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNAYVLMGKHQLELAVAFFLL 1233
Query: 1331 GGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIEKGDTWLASIL 1390
GGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIEKGDTWLASIL
Sbjct: 1234 GGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITKFMLPSAIEKGDTWLASIL 1293
Query: 1391 EWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLATKNSMKKAVG 1450
EWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLATKNSMKKAVG
Sbjct: 1294 EWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSVGLYCLLLATKNSMKKAVG 1353
Query: 1451 EQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDILCFETIRKIC 1510
EQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDILCFETIRKIC
Sbjct: 1354 EQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVSDGTNGVDILCFETIRKIC 1413
Query: 1511 KQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESVGCMGCLKEYE 1570
KQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESVGCMGCLKEYE
Sbjct: 1414 KQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSFPTINLESVGCMGCLKEYE 1473
Query: 1571 MDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGYDIFHGFASQE 1630
MDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGYDIFHGFASQE
Sbjct: 1474 MDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCNVGLQFIGYDIFHGFASQE 1533
Query: 1631 CPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETGSKCFDTWWYY 1690
CPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETGSKCFDTWWYY
Sbjct: 1534 CPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSLSFHKGETGSKCFDTWWYY 1593
Query: 1691 LQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLRDSKCLLKMLQ 1750
LQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLRDSKCLLKMLQ
Sbjct: 1594 LQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLYFTSAWLLRDSKCLLKMLQ 1653
Query: 1751 PLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQSDDIVHSIPG 1810
PLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQSDDIVHSIPG
Sbjct: 1654 PLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQILEGMPNAQSDDIVHSIPG 1713
Query: 1811 DERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSLSTIKSDWKAI 1870
DERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSLSTIKSDWKAI
Sbjct: 1714 DERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGNLDTWAQSLSTIKSDWKAI 1773
Query: 1871 SKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVWFEQICQSLSS 1930
SKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVWFEQICQSLSS
Sbjct: 1774 SKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQKLYVATAVWFEQICQSLSS 1833
Query: 1931 HDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDRKLSERWSDIY 1990
HDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDRKLSERWSDIY
Sbjct: 1834 HDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDRKLSERWSDIY 1893
Query: 1991 NGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDDAMPFQKPKEI 2050
NGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDDAMPFQKPKEI
Sbjct: 1894 NGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDDAMPFQKPKEI 1953
Query: 2051 YRRNGELLEVIFNPIALCINSVDQRQAAVASNKKMYVTEYLSNWEDGMASRDDEDYIWSN 2110
YRRNGELLE ALCINSVDQRQAAVASNKK + +WEDGMASRDDEDYIWSN
Sbjct: 1954 YRRNGELLE------ALCINSVDQRQAAVASNKKGII---FVSWEDGMASRDDEDYIWSN 2013
Query: 2111 SEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGA 2170
SEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGA
Sbjct: 2014 SEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGA 2073
Query: 2171 FGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFLVGSTNTHVYL 2230
FGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFLVGSTNTHVYL
Sbjct: 2074 FGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFLVGSTNTHVYL 2133
Query: 2231 WEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTVCSWQLEVGGR 2290
WEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTVCSWQLEVGGR
Sbjct: 2134 WEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTVCSWQLEVGGR 2193
Query: 2291 SNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCH 2350
SNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCH
Sbjct: 2194 SNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPPKTSQAAIMCH 2253
Query: 2351 EGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKNCSKDEMISNA 2410
EGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKNCSKDEMISNA
Sbjct: 2254 EGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKNCSKDEMISNA 2313
Query: 2411 SNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKL 2470
SNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKL
Sbjct: 2314 SNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDVKLWDAKRAKL 2373
Query: 2471 VHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKLVQFG 2525
VHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKLVQFG
Sbjct: 2374 VHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTCGGDGLVKLVQFG 2418
BLAST of CmoCh06G010300 vs. ExPASy TrEMBL
Match:
A0A0A0L3T8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G119460 PE=4 SV=1)
HSP 1 Score: 4245.3 bits (11009), Expect = 0.0e+00
Identity = 2099/2530 (82.96%), Postives = 2261/2530 (89.37%), Query Frame = 0
Query: 1 MAGTASEIDPIYRLPLPLLGSEPIPSAPNRF--AGSSIDWIPDFAGYAWVAYGASSVLVI 60
MAGTAS++DPI RLPLPLLGSEPIP APNR GSSIDWIPDFAGYAWVAYGASS+LVI
Sbjct: 1 MAGTASKMDPISRLPLPLLGSEPIPPAPNRLDPLGSSIDWIPDFAGYAWVAYGASSLLVI 60
Query: 61 SHFPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSH 120
SHFPSPLSP ET GPIFRQVLELSGD LS VNAVSWSPVLPSEGELAAA GNRIWVFSH
Sbjct: 61 SHFPSPLSPHETKFGPIFRQVLELSGDHLSAVNAVSWSPVLPSEGELAAAAGNRIWVFSH 120
Query: 121 DLGASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKP 180
DLGASRGSFCWRQNSVL+QSLKVEAIQWT +GDGIIA GVEVVLWKNTN+SWEIAWKFKP
Sbjct: 121 DLGASRGSFCWRQNSVLVQSLKVEAIQWTGSGDGIIACGVEVVLWKNTNKSWEIAWKFKP 180
Query: 181 DVPQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPL 240
DVPQTLVSASWSTEGPFATA ARISK +N ++ACRSVLV QSEGEYGHVK ELCHPL
Sbjct: 181 DVPQTLVSASWSTEGPFATAPHARISKTENMLTERACRSVLVSQSEGEYGHVKIELCHPL 240
Query: 241 PITMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRR 300
PIT+IQWR S+ K++PR+VLLTCCLDGTVRLWSETENGKV+KFSKDVN+KKS+RR
Sbjct: 241 PITVIQWRPSVNGPEIGKHSPRNVLLTCCLDGTVRLWSETENGKVRKFSKDVNNKKSMRR 300
Query: 301 RFSVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSE-GFEHNRAGSCEWL 360
FSVAAV+E+NQAL GTLG DLFVTWATEIRGMC+PFE+TKKV S GFE N+AG+CEWL
Sbjct: 301 HFSVAAVVEINQALKGTLGMDLFVTWATEIRGMCQPFEVTKKVQSSVGFEQNKAGNCEWL 360
Query: 361 ISLGPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKK 420
ISLGPGSLVTFWAVHCLDDVSPLRFP+VTLWKKQELKG EVGRHY DGCTNLSNKFLLKK
Sbjct: 361 ISLGPGSLVTFWAVHCLDDVSPLRFPQVTLWKKQELKGFEVGRHYTDGCTNLSNKFLLKK 420
Query: 421 VVISRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQ 480
VVISRI+ SGSPS+CSLIQLLPCNSLVWS+LS+ T TDVGD SFD+KR ++ S S ++Q
Sbjct: 421 VVISRIHQSGSPSICSLIQLLPCNSLVWSLLSAHTLTDVGDASFDQKRLESLSSCSFSSQ 480
Query: 481 LNLSGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCG 540
LNLSGHAGKILHVAVHPYNCEVK+AASLDSNGLLLFWSLSSIS C LG PTL TWELCG
Sbjct: 481 LNLSGHAGKILHVAVHPYNCEVKIAASLDSNGLLLFWSLSSISNCALGSPTLTPTWELCG 540
Query: 541 KLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPF 600
KLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSD ENTECHYLCTIPF
Sbjct: 541 KLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDEENTECHYLCTIPF 600
Query: 601 TGHGPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGV 660
TGHGPFENGPT+IFSI LPSDCN TY+FNKFML+G+WM+GFQALSWEITLH YDI GTG+
Sbjct: 601 TGHGPFENGPTNIFSILLPSDCNITYKFNKFMLLGIWMKGFQALSWEITLHAYDISGTGL 660
Query: 661 HCNCGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFV 720
HC C ID+EN AELSILTFES+FGSKKYCVSIIPCSSQ PNSQIH+QITSFAVVHQGTFV
Sbjct: 661 HCKCDIDNENRAELSILTFESAFGSKKYCVSIIPCSSQLPNSQIHDQITSFAVVHQGTFV 720
Query: 721 PVQQKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITA 780
PVQQKL SSGEP TPAYIMATGSADG LKLW+SNVGKPSIFHV WELVCVVV HQGPITA
Sbjct: 721 PVQQKLASSGEPSTPAYIMATGSADGCLKLWKSNVGKPSIFHVPWELVCVVVAHQGPITA 780
Query: 781 LCLTDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGN 840
L LTDCGRKIATISK++ + S+V LWELA LGAG LLFEDELSFES+IIAVDWLTLGN
Sbjct: 781 LSLTDCGRKIATISKDNLECKTSSVHLWELAYLGAGILLFEDELSFESNIIAVDWLTLGN 840
Query: 841 GQFLLGICLQNELRVYSLKHFG-QTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKST 900
GQFLLGICLQNEL VYSLK FG TL TKSLDT+TWICIG +RTLPSNCGFLWGP++T
Sbjct: 841 GQFLLGICLQNELCVYSLKRFGCHTLLETTKSLDTKTWICIGISRTLPSNCGFLWGPRTT 900
Query: 901 AIVIHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGTD----VAVFADECCSIR 960
AIV+HD YFCIVSPWLFLG NHDAMC+ +YIGETK HHVNGT+ VAVFAD+CC I+
Sbjct: 901 AIVLHDRYFCIVSPWLFLGVTNHDAMCNTHYIGETKTHHVNGTNTNISVAVFADKCCGIK 960
Query: 961 KLSDDNYDSKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLS 1020
L DD Y+ K RP SLGLISMPD+ DKLCGSLS
Sbjct: 961 TLPDDIYERKYRP---------------------------GSLGLISMPDVVDKLCGSLS 1020
Query: 1021 SFHPHALLVNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTS 1080
SFHP ALL NIYSGKWKRAYSALSHLIEHLSSDKKSSAN T TIPEI LSDYFEGV KTS
Sbjct: 1021 SFHPQALLFNIYSGKWKRAYSALSHLIEHLSSDKKSSANSTYTIPEIPLSDYFEGVIKTS 1080
Query: 1081 TDKEVQWSMNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAG 1140
TDK VQWS N L+SQFKEGVS W FNWDSISND+SF+PSSTKSEFS+FIEPLEK YE AG
Sbjct: 1081 TDKGVQWSTNSLSSQFKEGVSQWAFNWDSISNDNSFVPSSTKSEFSSFIEPLEKLYELAG 1140
Query: 1141 LTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLE 1200
LTSMEKT+TLAI+DLL EISNKSS+SAYESLDEPGRRYWIA RFQQL+FLRR+ RSAS+E
Sbjct: 1141 LTSMEKTQTLAIVDLLGEISNKSSSSAYESLDEPGRRYWIAWRFQQLQFLRRESRSASME 1200
Query: 1201 ELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARS 1260
EL IDS+LIGWAYHSDCQ+ LL+SV S EPTWQEMRSLGVGIWFTNTTQLR RMEKLARS
Sbjct: 1201 ELAIDSKLIGWAYHSDCQEILLNSVSSNEPTWQEMRSLGVGIWFTNTTQLRTRMEKLARS 1260
Query: 1261 QYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNA 1320
QYLKKKDPKDCMLLYVTLNRIQVLAGL KISRDEKDKPLVGFLSRNFQEE+NKAAALKNA
Sbjct: 1261 QYLKKKDPKDCMLLYVTLNRIQVLAGLFKISRDEKDKPLVGFLSRNFQEEKNKAAALKNA 1320
Query: 1321 YVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITK 1380
YVL+G+HQLELAVAFFLLGGD+ SA+ VCAKNLGDEQLALVI LVEGRGGPLQQHLITK
Sbjct: 1321 YVLLGRHQLELAVAFFLLGGDSYSAVSVCAKNLGDEQLALVICHLVEGRGGPLQQHLITK 1380
Query: 1381 FMLPSAIEKGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSV 1440
FMLPSAIEKGDTWLASILEWELGNY++SFLN L L +SNSVTG PFLSS+HI+L DPSV
Sbjct: 1381 FMLPSAIEKGDTWLASILEWELGNYTRSFLNMLRL--DSNSVTGPPFLSSKHIALLDPSV 1440
Query: 1441 GLYCLLLATKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVS 1500
G+YCLLLATKNSMKKAVG QSAE+LC++ATLM AT+LNR GLPLEALE +STCGSIT+VS
Sbjct: 1441 GMYCLLLATKNSMKKAVGVQSAEILCQLATLMMATSLNRRGLPLEALEHVSTCGSITDVS 1500
Query: 1501 DGTNGVDILCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSF 1560
DGTN VDI CF+TI IC++SP DSSSWLSVEFAVHLE++ KLDLA QYFSKLIRKHPS+
Sbjct: 1501 DGTNKVDIQCFDTISNICQKSPGDSSSWLSVEFAVHLEHQVKLDLAAQYFSKLIRKHPSW 1560
Query: 1561 PTINLESVGCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCN 1620
PTIN ESVGCM C KEYEMDYEKSLE +Q KL+VGFAQFEMKFSLLPASLVSMML+FLCN
Sbjct: 1561 PTINFESVGCMSCSKEYEMDYEKSLESYQHKLSVGFAQFEMKFSLLPASLVSMMLLFLCN 1620
Query: 1621 VGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSL 1680
+GLQFIG DI GF SQECPDDKN Y+FL+H L+HK+LLKTA+EI FSASRYTIACSL
Sbjct: 1621 LGLQFIGNDIVRGFTSQECPDDKNLTTYSFLVHRLLHKALLKTAREISFSASRYTIACSL 1680
Query: 1681 SFHKGETGSKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLY 1740
SFH GE SKC DTWWYYLQGLLLSLQG+RAALR TH SL DD VSKLLTILDLVEYNLY
Sbjct: 1681 SFHGGEIRSKCLDTWWYYLQGLLLSLQGVRAALRTTHDSLNDDRVSKLLTILDLVEYNLY 1740
Query: 1741 FTSAWLLRDSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQI 1800
FTSAWLLRDS+CLLKM+Q LLAN +SPHD+++E LKQLL Q GELIAQNL +DVD+NH+I
Sbjct: 1741 FTSAWLLRDSRCLLKMVQLLLANEQSPHDVEIERLKQLLSQFGELIAQNLSSDVDHNHEI 1800
Query: 1801 LEGMPNAQSDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGN 1860
LEGM N + DDIVHSIPGDERWHIIGA LWHHMSKF+KHKL TLTN SKEGS S I LGN
Sbjct: 1801 LEGMANEEYDDIVHSIPGDERWHIIGACLWHHMSKFIKHKLTTLTNKSKEGSFSGITLGN 1860
Query: 1861 LDTWAQSLSTIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQK 1920
L++W LST+KSD I K++IEL+S + T+LLTIVLAQ SSYQLKQLVS LQYKLDQ+
Sbjct: 1861 LNSWVPCLSTVKSDQNDILKNMIELISKNFTSLLTIVLAQASSYQLKQLVSFLQYKLDQR 1920
Query: 1921 LYVATAVWFEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKV 1980
L VAT VWFEQ +S S H K H DE+Y++DMC +GEFETLWN+TSNPNL+S+CF HEKV
Sbjct: 1921 LCVATVVWFEQFSKS-SEHKKHHADEMYNIDMCNKGEFETLWNITSNPNLVSECFAHEKV 1980
Query: 1981 HMLHCFDRKLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSD 2040
H+LHCFDRKLS+RW+DIYNG TR E C+ E A I+ S SD GSPGKLLR+G+TLV S+
Sbjct: 1981 HLLHCFDRKLSKRWTDIYNGTTRPEETCSREGALINSSASDTIGSPGKLLRSGRTLVSSE 2040
Query: 2041 KELATLDDAMPFQKPKEIYRRNGELLEVIFNPIALCINSVDQRQAAVASNKKMYVTEYLS 2100
KELATLDD MPFQKPKEIYRRNGELLE ALCINSVD RQAA+ASNKK +
Sbjct: 2041 KELATLDDVMPFQKPKEIYRRNGELLE------ALCINSVDGRQAALASNKKGII---FF 2100
Query: 2101 NWEDGMASRDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGAT 2160
+WEDGMASRD+EDYIWSNSEWPLNLNGWA SESTPAPTCVFPGVGLG++KGAHLGLGGAT
Sbjct: 2101 SWEDGMASRDEEDYIWSNSEWPLNLNGWAGSESTPAPTCVFPGVGLGTNKGAHLGLGGAT 2160
Query: 2161 LGVGSSVRPGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSS 2220
+GVGS RPGRDLTGGGAFGISGYAG+GASGLGWETQEDFEEFVDPPATAEHT++RAFSS
Sbjct: 2161 VGVGSPARPGRDLTGGGAFGISGYAGMGASGLGWETQEDFEEFVDPPATAEHTSTRAFSS 2220
Query: 2221 HPSRPLFLVGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFA 2280
HPSRPLFLVGSTNTHVYLWEFGK+RATATYGVLPAANVPPPYALASISSVQFDQCGHRFA
Sbjct: 2221 HPSRPLFLVGSTNTHVYLWEFGKDRATATYGVLPAANVPPPYALASISSVQFDQCGHRFA 2280
Query: 2281 TAALDGTVCSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVV 2340
TAALDGTVCSWQLEVGGRSNV PTESSLCFNGHASDVTYVTSSGSIIAVAGYSS+AVNVV
Sbjct: 2281 TAALDGTVCSWQLEVGGRSNVCPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSSAVNVV 2340
Query: 2341 IWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVT 2400
IWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDV +HDFRYVVT
Sbjct: 2341 IWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVGLHDFRYVVT 2400
Query: 2401 GRTKKQKNCSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFL 2460
GR K K+ K E IS+ASN++M TVGEQNLNGMLWYIPKAHSGSVTKI+SIPNTSLFL
Sbjct: 2401 GRNK--KHSPKGERISDASNTNMLGTVGEQNLNGMLWYIPKAHSGSVTKITSIPNTSLFL 2460
Query: 2461 TGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTC 2520
TGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVI+SGFLTC
Sbjct: 2461 TGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVIASGFLTC 2489
Query: 2521 GGDGLVKLVQ 2523
GGDGLVKLVQ
Sbjct: 2521 GGDGLVKLVQ 2489
BLAST of CmoCh06G010300 vs. ExPASy TrEMBL
Match:
A0A1S3AVL8 (uncharacterized protein LOC103483174 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103483174 PE=4 SV=1)
HSP 1 Score: 4239.5 bits (10994), Expect = 0.0e+00
Identity = 2099/2530 (82.96%), Postives = 2261/2530 (89.37%), Query Frame = 0
Query: 1 MAGTASEIDPIYRLPLPLLGSEPIPSAPNRF--AGSSIDWIPDFAGYAWVAYGASSVLVI 60
MAGTAS++DPI RLPLPLLGSEPIPSAPNR GSSIDWIPDFAGYAWVAYGASS+LVI
Sbjct: 1 MAGTASKMDPISRLPLPLLGSEPIPSAPNRLDPPGSSIDWIPDFAGYAWVAYGASSLLVI 60
Query: 61 SHFPSPLSPQETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSH 120
SHFPSPLSP ET GPIFRQVLELSGD LS VNAVSWSPVLPSEGELAAA GNRIWVFSH
Sbjct: 61 SHFPSPLSPNETKFGPIFRQVLELSGDHLSAVNAVSWSPVLPSEGELAAAAGNRIWVFSH 120
Query: 121 DLGASRGSFCWRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKP 180
DLGASRGSFCWRQNSVL+QSLKVEAIQWT AGDGIIA GVEVVLWKNTN+SWEIAWKFKP
Sbjct: 121 DLGASRGSFCWRQNSVLVQSLKVEAIQWTGAGDGIIACGVEVVLWKNTNKSWEIAWKFKP 180
Query: 181 DVPQTLVSASWSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPL 240
DV QTLVSASWSTEGPFATA ARISK +NT +KACRSVLV QSEGEYGHVK ELCHPL
Sbjct: 181 DVLQTLVSASWSTEGPFATAPHARISKTENTLTEKACRSVLVSQSEGEYGHVKIELCHPL 240
Query: 241 PITMIQWRTSIKEKGSSKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRR 300
PIT+IQWR S+ +K++PRHVLLTCCLDGTVRLWSETENGKV+KFSKDVN++KS RR
Sbjct: 241 PITVIQWRPSVNGPEFAKHSPRHVLLTCCLDGTVRLWSETENGKVRKFSKDVNNRKSTRR 300
Query: 301 RFSVAAVIEVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFSE-GFEHNRAGSCEWL 360
FSVAAV+E+NQAL GTLG DLFVTWATEIRGMC+PF++TKKV S GFE N+AG+CEWL
Sbjct: 301 HFSVAAVVEINQALKGTLGMDLFVTWATEIRGMCQPFDVTKKVQSSVGFEQNKAGNCEWL 360
Query: 361 ISLGPGSLVTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKK 420
ISLGPGSLVTFWAVHCLD+VSPLRFPR+TLWKKQELKG EVGRHY DGCTNLSNKFLLKK
Sbjct: 361 ISLGPGSLVTFWAVHCLDEVSPLRFPRITLWKKQELKGFEVGRHYTDGCTNLSNKFLLKK 420
Query: 421 VVISRIYPSGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQ 480
VVISRI+ SGSPS+CSLIQLLPCNSLVWS+LS+ T TDVGD SFD+KR ++ FS S ++Q
Sbjct: 421 VVISRIHQSGSPSICSLIQLLPCNSLVWSLLSAHTLTDVGDASFDQKRLESLFSCSSSSQ 480
Query: 481 LNLSGHAGKILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCG 540
LNLSGHAGKILHVAVHPYNCEVK+AASLDSNGLLLFWSLSSIS CVLGPPTL TWELCG
Sbjct: 481 LNLSGHAGKILHVAVHPYNCEVKIAASLDSNGLLLFWSLSSISNCVLGPPTLTPTWELCG 540
Query: 541 KLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPF 600
KLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSD ENTECHYLCTIPF
Sbjct: 541 KLVTQDSCSKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDEENTECHYLCTIPF 600
Query: 601 TGHGPFENGPTDIFSISLPSDCNTTYRFNKFMLIGLWMEGFQALSWEITLHTYDIFGTGV 660
TGHGPFENGPT+IFSI LPSD N TY+FNKFML+G+WM+GFQALSWEITLH YDI GTG+
Sbjct: 601 TGHGPFENGPTNIFSILLPSDINITYKFNKFMLLGVWMKGFQALSWEITLHAYDISGTGI 660
Query: 661 HCNCGIDDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFV 720
HC C ID+EN AELSIL FES+FG+KKYCVSIIPCSSQ PNSQIH+QITSFAVVHQGTFV
Sbjct: 661 HCKCDIDNENRAELSILRFESAFGTKKYCVSIIPCSSQLPNSQIHDQITSFAVVHQGTFV 720
Query: 721 PVQQKLTSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITA 780
PVQQKL+SSGEP TPAYIMATGSADGSLKLW+SNVGKPSIFHV WELVCVVVTHQGPITA
Sbjct: 721 PVQQKLSSSGEPSTPAYIMATGSADGSLKLWKSNVGKPSIFHVPWELVCVVVTHQGPITA 780
Query: 781 LCLTDCGRKIATISKNSHKPNISNVRLWELACLGAGTLLFEDELSFESSIIAVDWLTLGN 840
L LTDCGRKIATISK++ + SNV LWELA LGAGTLLFEDELSFES+IIAVDWLTLGN
Sbjct: 781 LSLTDCGRKIATISKDNLECKTSNVHLWELAYLGAGTLLFEDELSFESNIIAVDWLTLGN 840
Query: 841 GQFLLGICLQNELRVYSLKHFG-QTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKST 900
GQFLLGICLQNEL VYSLK FG TLS TKSLD +TWICIG +RTLPSNCGF WGP++T
Sbjct: 841 GQFLLGICLQNELCVYSLKRFGCHTLSETTKSLDAKTWICIGISRTLPSNCGFRWGPRTT 900
Query: 901 AIVIHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGT----DVAVFADECCSIR 960
AIV+HD YFCIVSPWLFLG N DAMC+ +YIGETK HHVNGT AVFAD+CC I+
Sbjct: 901 AIVLHDRYFCIVSPWLFLGVTNPDAMCNTHYIGETKTHHVNGTTTNISAAVFADKCCGIK 960
Query: 961 KLSDDNYDSKRRPRSLTNIHAETNILSNSSYPRGAQMKSTTSLGLISMPDIADKLCGSLS 1020
L DD Y+SK RP SLGLISMPD+ DKLCGSLS
Sbjct: 961 TLPDDIYESKYRP---------------------------GSLGLISMPDVVDKLCGSLS 1020
Query: 1021 SFHPHALLVNIYSGKWKRAYSALSHLIEHLSSDKKSSANPTNTIPEILLSDYFEGVAKTS 1080
SFHP ALL NIYSGKWKRAYSALSHLIEHLSSDKKSSAN T TIPEI LSDYFEGV KTS
Sbjct: 1021 SFHPQALLFNIYSGKWKRAYSALSHLIEHLSSDKKSSANSTYTIPEIPLSDYFEGVIKTS 1080
Query: 1081 TDKEVQWSMNGLASQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKFYESAG 1140
TDK VQWS N L+SQFKEGVS W FNWDSISND+SFIPSSTKSEFS+F+EPLEK YE AG
Sbjct: 1081 TDKGVQWSTNSLSSQFKEGVSQWAFNWDSISNDNSFIPSSTKSEFSSFVEPLEKLYELAG 1140
Query: 1141 LTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGRSASLE 1200
LTSMEKT+TLAI+DLL EISNKSS+SAYESLDEPGRRYWIALRFQQL+FLRR+ RSAS+E
Sbjct: 1141 LTSMEKTQTLAIVDLLGEISNKSSSSAYESLDEPGRRYWIALRFQQLQFLRRESRSASVE 1200
Query: 1201 ELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARMEKLARS 1260
EL IDS+LIGWAYHSDCQ+ LL+SV S EPTWQEMRSLGVGIWFTNTTQLR RMEKLARS
Sbjct: 1201 ELAIDSKLIGWAYHSDCQEILLNSVSSNEPTWQEMRSLGVGIWFTNTTQLRIRMEKLARS 1260
Query: 1261 QYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAAALKNA 1320
QYLKKKDPKDCMLLYVTLNRIQVLAGL KISRDEKDKPLVGFLSRNFQEE+NKAAALKNA
Sbjct: 1261 QYLKKKDPKDCMLLYVTLNRIQVLAGLFKISRDEKDKPLVGFLSRNFQEEKNKAAALKNA 1320
Query: 1321 YVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQHLITK 1380
YVL+GKHQLELAVAFFLLGGDTSSA+ VCAK LGDEQLALVI LVEGRGGPLQQHLITK
Sbjct: 1321 YVLLGKHQLELAVAFFLLGGDTSSAVSVCAKTLGDEQLALVICHLVEGRGGPLQQHLITK 1380
Query: 1381 FMLPSAIEKGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISLQDPSV 1440
FMLPSAIEKGDTWLASILEWELGNY+QSFLN L L ESNSVTG PFLSS+HI+L DPSV
Sbjct: 1381 FMLPSAIEKGDTWLASILEWELGNYTQSFLNVLRL--ESNSVTGPPFLSSKHIALLDPSV 1440
Query: 1441 GLYCLLLATKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTCGSITEVS 1500
G+YC LLA KNSMKKAVG QSAE+LC++ATLM ATALNR GLPLEALE +STCGSIT+VS
Sbjct: 1441 GMYCRLLANKNSMKKAVGVQSAEILCQLATLMMATALNRSGLPLEALEHVSTCGSITDVS 1500
Query: 1501 DGTNGVDILCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFSKLIRKHPSF 1560
DGTN VDI CF+TI KIC++ PRDSSSWLSVEFAVHLE++AK DLA QYFS LIRKHPS+
Sbjct: 1501 DGTNKVDIQCFDTISKICQKYPRDSSSWLSVEFAVHLEHQAKTDLAAQYFSNLIRKHPSW 1560
Query: 1561 PTINLESVGCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLVSMMLVFLCN 1620
PT+N ESVGCM KEYEMDYEKSLE +Q KL+VGFAQFEMKFSLLPASLVSMML+FLCN
Sbjct: 1561 PTVNFESVGCMLFSKEYEMDYEKSLESYQHKLSVGFAQFEMKFSLLPASLVSMMLLFLCN 1620
Query: 1621 VGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSASRYTIACSL 1680
+GLQFIG DI GF SQECPDDKN Y+FL+H L+HK+LLKTAQEI SASRYTIACSL
Sbjct: 1621 LGLQFIGNDIVQGFTSQECPDDKNLTTYSFLVHRLLHKALLKTAQEISLSASRYTIACSL 1680
Query: 1681 SFHKGETGSKCFDTWWYYLQGLLLSLQGLRAALRITHGSLKDDLVSKLLTILDLVEYNLY 1740
SFH+GE SKC DTWWYYLQGLLLSLQG+RAALR TH SL DD V KLLTILDLVEY+LY
Sbjct: 1681 SFHRGEIRSKCLDTWWYYLQGLLLSLQGVRAALRSTHDSLNDDCVFKLLTILDLVEYDLY 1740
Query: 1741 FTSAWLLRDSKCLLKMLQPLLANARSPHDIDVEHLKQLLPQIGELIAQNLLTDVDYNHQI 1800
FTSAWLLRDS+CLLKM+Q LLAN +SP D+++E LKQLL Q GELIAQNLL+DVDYNH+I
Sbjct: 1741 FTSAWLLRDSRCLLKMVQLLLANEQSPLDVEMERLKQLLSQFGELIAQNLLSDVDYNHEI 1800
Query: 1801 LEGMPNAQSDDIVHSIPGDERWHIIGAVLWHHMSKFMKHKLITLTNASKEGSLSSIILGN 1860
LEG+PN + DDIVHSIPGDERWHIIGA LWHH+SKF++HKL TLTN SKEGS S + L N
Sbjct: 1801 LEGVPNEEYDDIVHSIPGDERWHIIGACLWHHVSKFIRHKLTTLTNKSKEGSFSGLTLRN 1860
Query: 1861 LDTWAQSLSTIKSDWKAISKDVIELVSVSLTALLTIVLAQVSSYQLKQLVSSLQYKLDQK 1920
L++W LSTIKSD I K++IEL+S + T+LLTIVLAQ SSYQLKQLVS LQYKLD++
Sbjct: 1861 LNSWVPGLSTIKSDQNDILKNMIELISTNFTSLLTIVLAQASSYQLKQLVSFLQYKLDKR 1920
Query: 1921 LYVATAVWFEQICQSLSSHDKGHTDEIYDMDMCIRGEFETLWNVTSNPNLISDCFTHEKV 1980
L VAT VWFEQ +S S H K H DE+Y++DMC +GEFETLW++TSNPNL+S+CF HEKV
Sbjct: 1921 LCVATVVWFEQFSKS-SEHKKHHADEMYNIDMCNKGEFETLWSITSNPNLVSECFAHEKV 1980
Query: 1981 HMLHCFDRKLSERWSDIYNGITRKEHNCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSD 2040
H+LHCFDRKLS+RW+DIYNG TR E C E A I+ S SD TGSPGKLLR+G+TLV S+
Sbjct: 1981 HLLHCFDRKLSKRWTDIYNGTTRPEETCGRECALINSSASDTTGSPGKLLRSGRTLVSSE 2040
Query: 2041 KELATLDDAMPFQKPKEIYRRNGELLEVIFNPIALCINSVDQRQAAVASNKKMYVTEYLS 2100
KELATLDD MPFQKPKEIYRRNGELLE ALCINSVD RQAA+ASNKK +
Sbjct: 2041 KELATLDDVMPFQKPKEIYRRNGELLE------ALCINSVDGRQAALASNKKGII---FF 2100
Query: 2101 NWEDGMASRDDEDYIWSNSEWPLNLNGWAASESTPAPTCVFPGVGLGSSKGAHLGLGGAT 2160
+WEDGMASRD+EDYIWSNSEWPLNLNGWA SESTPAPTCVFPGVGLGS+KGAHLGLGGAT
Sbjct: 2101 SWEDGMASRDEEDYIWSNSEWPLNLNGWAGSESTPAPTCVFPGVGLGSNKGAHLGLGGAT 2160
Query: 2161 LGVGSSVRPGRDLTGGGAFGISGYAGVGASGLGWETQEDFEEFVDPPATAEHTNSRAFSS 2220
+G+GS RP RDLTGGGAFGISGYAG+GASGLGWETQEDFEEFVDPPATAEHT++RAFSS
Sbjct: 2161 VGIGSPARPARDLTGGGAFGISGYAGMGASGLGWETQEDFEEFVDPPATAEHTSTRAFSS 2220
Query: 2221 HPSRPLFLVGSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFA 2280
HPSRPLFLVGSTNTHVYLWEFGK+RATATYGVLPAANVPPPYALASISSVQFDQCGHRFA
Sbjct: 2221 HPSRPLFLVGSTNTHVYLWEFGKDRATATYGVLPAANVPPPYALASISSVQFDQCGHRFA 2280
Query: 2281 TAALDGTVCSWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVV 2340
TAALDGTVCSWQLEVGGRSNV PTESSLCFNGHASDVTYVTSSGSIIAVAGYSS+AVNVV
Sbjct: 2281 TAALDGTVCSWQLEVGGRSNVCPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSSAVNVV 2340
Query: 2341 IWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVT 2400
IWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDV +HDFRYVVT
Sbjct: 2341 IWDTLAPPKTSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVGLHDFRYVVT 2400
Query: 2401 GRTKKQKNCSKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFL 2460
GRTK K+ K E IS+ASN++M TVGEQNLNGMLWYIPKAHSGSVTKI+SIPNTSLFL
Sbjct: 2401 GRTK--KHSPKGERISDASNTNMLGTVGEQNLNGMLWYIPKAHSGSVTKITSIPNTSLFL 2460
Query: 2461 TGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVISSGFLTC 2520
TGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVI+SGFLTC
Sbjct: 2461 TGSKDGDVKLWDAKRAKLVHHWPKLHDRHTFLQPSSRGFGEVVRAAVTDIQVIASGFLTC 2489
Query: 2521 GGDGLVKLVQ 2523
GGDGLVKLVQ
Sbjct: 2521 GGDGLVKLVQ 2489
BLAST of CmoCh06G010300 vs. TAIR 10
Match:
AT2G46560.1 (transducin family protein / WD-40 repeat family protein )
HSP 1 Score: 2122.4 bits (5498), Expect = 0.0e+00
Identity = 1193/2580 (46.24%), Postives = 1584/2580 (61.40%), Query Frame = 0
Query: 9 DPIYRLPLPLLGSEPIPSAPNRFAGSSIDWIPDFAGYAWVAYGASSVLVISHFPSPLSPQ 68
D I LPL L SE +P AP R + SSIDW+PDFA Y+W+AYGAS+++VISH PSPL +
Sbjct: 22 DRIDHLPLRQLRSEIVPPAPTR-SQSSIDWLPDFANYSWLAYGASTLVVISHLPSPLRGE 81
Query: 69 ETTIGPIFRQVLELSGDDLSVVNAVSWSPVLPSEGELAAAVGNRIWVFSHDLGASRGSFC 128
++T GP FRQ+LE+SG+ V AV WSPV PS GELA GN I++F+ DL +GSFC
Sbjct: 82 DSTNGPFFRQILEVSGEP---VTAVCWSPVTPSVGELAVGSGNYIFLFARDL---KGSFC 141
Query: 129 WRQNSVLLQSLKVEAIQWTAAGDGIIAGGVEVVLWKNTNRSWEIAWKFKPDVPQTLVSAS 188
W QN++L+Q VEAI+WT +GDGII GG ++VLWK N+SWEIAWKF D Q LVS++
Sbjct: 142 WSQNAILVQETIVEAIEWTGSGDGIIVGGTDIVLWKRRNQSWEIAWKFSGDHLQDLVSST 201
Query: 189 WSTEGPFATASQARISKMDNTFIDKACRSVLVCQSEGEYGHVKSELCHPLPITMIQWRTS 248
WS EGPFATA+ R + D A +SVL S+GE H EL HP I+MIQWR
Sbjct: 202 WSFEGPFATATSWRKFPAE---CDDAGKSVLAYYSDGESYH-NFELPHPQRISMIQWRPM 261
Query: 249 IKEKGS--SKNTPRHVLLTCCLDGTVRLWSETENGKVKKFSKDVNSKKSLRRRFSVAAVI 308
E+ + + R+VL+TCCLDG VRLW E + GK KK KDV K + F VAAVI
Sbjct: 262 AAEQSAIGIGKSMRNVLMTCCLDGAVRLWCEVDGGKTKKGMKDVPDHK---KSFCVAAVI 321
Query: 309 EVNQALNGTLGTDLFVTWATEIRGMCKPFELTKKVFS-EGFEHNRAGSCEWLISLGPGSL 368
E+NQ L+G LG DLF+ W T G+ K E T +VFS E +++ G CEWL+ GPG+
Sbjct: 322 EINQVLDGCLGRDLFLFWGTRTGGIFKTIEGTNQVFSMEKYDNENVGKCEWLVGYGPGNF 381
Query: 369 VTFWAVHCLDDVSPLRFPRVTLWKKQELKGLEVGRHYIDGCTNLSNKFLLKKVVISRIYP 428
T WAVHCLDD+SP+RFPRVTLW KQE + G + T S++ LKKV + R
Sbjct: 382 ATLWAVHCLDDISPMRFPRVTLWAKQESNEIGAGSLSLASATGSSDRLPLKKVSVLRNNL 441
Query: 429 SGSPSMCSLIQLLPCNSLVWSILSSQTSTDVGDVSFDKKRSDNFFSRSVTTQLNLSGHAG 488
G+P +CS I L P N++ WS L + S D D S +K V L L GH G
Sbjct: 442 YGTPLICSSIYLSPQNTVYWSSLHTIKSHDSEDSSPNKSSLLKCIDGKV---LYLDGHGG 501
Query: 489 KILHVAVHPYNCEVKVAASLDSNGLLLFWSLSSISTCVLGPPTLNHTWELCGKLVTQDSC 548
KIL VA P+ CE ASLDSNGL++ S S + P +W+ CG+L Q+
Sbjct: 502 KILQVASDPFVCEAGYTASLDSNGLIIICSSSVYLNRTIEHPISVASWKPCGRLQNQEFR 561
Query: 549 SKYTSVQWAPSILDEELILLMGHARGIDFFAVRISQSDGENTECHYLCTIPFTGHGPFEN 608
KYTS+ WAPS L +E LL+GH G+D F+VR + HY+CTIPFT + P ++
Sbjct: 562 LKYTSLCWAPSSLKDERFLLVGHVGGVDCFSVRNCGKGDDGYLTHYICTIPFTVNSPLQS 621
Query: 609 GPTDIFSISLPSDCNTTYRFNKFMLIGLWM--EGFQALSWEITLHTYDIFGTGVHCNCGI 668
GPT IF+ L + C T++ N+F+L+ +WM + F ALSW +TLH +D G+ C+C
Sbjct: 622 GPTSIFAKPLSNSCGKTFKSNRFLLLSVWMKEKRFDALSWSVTLHHFDTAGS--TCDCHF 681
Query: 669 DDENIAELSILTFESSFGSKKYCVSIIPCSSQFPNSQIHEQITSFAVVHQGTFVPVQQKL 728
D + L FE +F K C++I CSS+ P S +++TSFAVV+ P + L
Sbjct: 682 HDFDSIGLGKWLFEDTFAGKTNCLAIRSCSSEIPESHREDEVTSFAVVN-----PSGRDL 741
Query: 729 TSSGEPYTPAYIMATGSADGSLKLWRSNVGKPSIFHVSWELVCVVVTHQGPITALCLTDC 788
+ + AY +ATG ADGSLKLWRS+ + S WELV ++ Q P++A+ LTD
Sbjct: 742 ENGVNSESQAYTIATGQADGSLKLWRSSFQESSTPSGLWELVGMLTVGQNPVSAISLTDS 801
Query: 789 GRKIATISKNSHKPNISNVRLWELA-CLGAGTLLFEDELSFESSIIAVDWLTLGNGQFLL 848
G KIA + SH V +WE+ + +G + ED++ ++ ++AV W T GN Q LL
Sbjct: 802 GHKIAALCTESHSKAARAVSIWEIVHLIDSGVFILEDKVHVDAEVVAVRWSTTGNDQLLL 861
Query: 849 GICLQNELRVYSLKH---FGQTLSGITKSLDTETWICIGFARTLPSNCGFLWGPKSTAIV 908
G+C Q E+RVY + + + S + + W C RT + WGPK+ +
Sbjct: 862 GVCTQIEMRVYGIARQPCKSTSFAAYDYSSEAQIWQCFAVTRTFSAIHDLWWGPKAMTCL 921
Query: 909 IHDHYFCIVSPWLFLGDKNHDAMCSPYYIGETKNHHVNGTDVAVFADECCSIRKLSDDNY 968
+H+ Y + WL + DK P + VN T+ ++ +
Sbjct: 922 VHNDYISLHGQWLAVVDKKQKIDNYPEIFASNLPNLVNATEEGRDSEFLSDSGTNDINEA 981
Query: 969 DSKRRPRSLTNIHAETNILS----NSSYPRGAQMKSTTSLGLISMPDIADKLCGSLSSFH 1028
D+ R + + +N + NS G S T ++SM + +KL G+L +H
Sbjct: 982 DTTSTSRGCIPLPSTSNAIDDGQVNSMSLIGTAYGSNTIDDIMSMGHMVEKLGGALPLYH 1041
Query: 1029 PHALLVNIYSGKWKRAYSALSHLIEHLSS---DKKSSANPTNTIPEILLSDYFEG-VAKT 1088
PHALLV I SG WKRA +AL HL E+++S +K A + P+ILLS Y+EG ++
Sbjct: 1042 PHALLVAIRSGNWKRASAALRHLAEYITSSDTSEKGYAVKSVLCPDILLSKYYEGSLSNG 1101
Query: 1089 STDKEVQWSMNGLA----SQFKEGVSPWTFNWDSISNDSSFIPSSTKSEFSTFIEPLEKF 1148
K+ QW + SQF+ G+ FN +S S +S +T EFS F E L+K
Sbjct: 1102 PNPKDFQWGGTSGSMLQYSQFQSGLQS-KFNMESYSPNS----PATDLEFSGFCEQLKKL 1161
Query: 1149 YESAGLTSMEKTETLAIIDLLDEISNKSSASAYESLDEPGRRYWIALRFQQLRFLRRDGR 1208
+ ++ +E + AI+DLL EISN S S Y SLDEPGRR+W+ LRF+QL R G+
Sbjct: 1162 SDEGNISRIEILQYFAIVDLLCEISNPHSTSVYASLDEPGRRFWVTLRFKQLFLARSSGK 1221
Query: 1209 SASLEELTIDSRLIGWAYHSDCQQNLLDSVISKEPTWQEMRSLGVGIWFTNTTQLRARME 1268
+ASLEEL IDS +IGWA+HS+ Q+NL S++ E +WQ+MRS G G W++N QLR+RME
Sbjct: 1222 TASLEELDIDSSMIGWAFHSESQENLSGSLLPNESSWQQMRSQGFGFWYSNAAQLRSRME 1281
Query: 1269 KLARSQYLKKKDPKDCMLLYVTLNRIQVLAGLLKISRDEKDKPLVGFLSRNFQEERNKAA 1328
KLAR QYLK K+PKDC LLY+ LNR+QVLAGL K+S+DEKDKPLV FLSRNFQEE+NKAA
Sbjct: 1282 KLARQQYLKNKNPKDCALLYIALNRVQVLAGLFKLSKDEKDKPLVVFLSRNFQEEKNKAA 1341
Query: 1329 ALKNAYVLMGKHQLELAVAFFLLGGDTSSAIRVCAKNLGDEQLALVISLLVEGRGGPLQQ 1388
ALKNAYVLMGKHQLELA+ FFLLGG+ SSAI VC KNL DEQLALVI L++G+GG L+
Sbjct: 1342 ALKNAYVLMGKHQLELAIGFFLLGGEASSAINVCVKNLQDEQLALVICRLIDGQGGALES 1401
Query: 1389 HLITKFMLPSAIEKGDTWLASILEWELGNYSQSFLNALGLESESNSVTGIPFLSSRHISL 1448
+LI K++LPSA+++GD WLAS+L+WELG Y +S L G N T +SS H+S
Sbjct: 1402 NLIKKYILPSAVQRGDFWLASLLKWELGEYHRSILAMAG--CLENPATESSTVSSNHVSF 1461
Query: 1449 QDPSVGLYCLLLATKNSMKKAVGEQSAEVLCRIATLMTATALNRCGLPLEALEQMSTC-- 1508
DPS+GLYCL+LATKNS+K A+GE++A L R A+LM ATA +RCGLPLEALE +S
Sbjct: 1462 VDPSIGLYCLMLATKNSVKNALGERTASTLSRWASLMAATAFSRCGLPLEALECLSPSAS 1521
Query: 1509 --GSITEVSDGTNGVDILCFETIRKICKQSPRDSSSWLSVEFAVHLEYRAKLDLAVQYFS 1568
G + S +NG T + + S SS+W+S + ++ +L LAVQ+ S
Sbjct: 1522 GHGGTHQTSVPSNGQ----LHTTQGVFDHSVPHSSNWVSSGVSSTVDTHFRLGLAVQFLS 1581
Query: 1569 KLIRKHPSFPTINLESVGCMGCLKEYEMDYEKSLERFQRKLNVGFAQFEMKFSLLPASLV 1628
++R+ + P +N E V C + RFQ KL QF +FSL + L
Sbjct: 1582 MILRE-ATAPLMNSEVVSC------------EKFSRFQHKLQTALEQFHQRFSLSASYLR 1641
Query: 1629 SMMLVFLCNVGLQFIGYDIFHGFASQECPDDKNQKIYTFLLHPLVHKSLLKTAQEILFSA 1688
+MM++ N GL +G++IF +S DDK+ L + + K +LK E
Sbjct: 1642 NMMILSAYNRGLLSMGHNIFQENSSSGLSDDKSHTDEDLLQYSALSKLILKATDEKSLVL 1701
Query: 1689 SRYTIACSLS-------FHKGETGSKCFDTW----WYYLQGLLLSLQGLRAALRITHGSL 1748
SR ACS++ F + + S W +Y QG+L S LR ++R+ GS
Sbjct: 1702 SRIIAACSVTCLHSVPCFEENKVSSGPDPKWSNALRFYFQGILESFSNLRTSIRLCLGSS 1761
Query: 1749 KDDLVSKLLTILDLVEYNLYFTSAWLLRDSKCLLKMLQPLLA---NARSPHDIDVEHLKQ 1808
+DL +KL +LDLVEY L AW+L D CL +M+QPL+ N P+++D+E +K+
Sbjct: 1762 VEDLKTKLAVVLDLVEYCLRLAMAWVLGDVHCLFRMVQPLVISYFNGHMPYEVDLESVKR 1821
Query: 1809 LLPQIGELIAQNLLTDVDYNHQILEGMPNAQSDDIVHSIPGDERWHIIGAVLWHHMSKFM 1868
+ Q + + +DV N + + N V+SIP DER + A W H+S F+
Sbjct: 1822 VYHQEASVSVPD-ASDVGVNSKFSSVVENHGVGYPVYSIPEDERCLVTQACFWKHVSDFV 1881
Query: 1869 KHKLITLTNASKEGSLSSIILGNLDTWAQSLSTIKSDWKAISKDVIELVSVSLTALLTIV 1928
K KL++++ +G +S N D AQ+ D +++ ++ ++ +L +
Sbjct: 1882 KLKLVSISINLDDGISNSGSAENFD--AQTSLDSSDDIVCVTEKIMSVLGKTLIS----T 1941
Query: 1929 LAQVSSYQLKQLVSSLQYKLDQKLYVATAVWFEQICQSLSSH-------DKG-HTDEIYD 1988
LAQ+SSY +KQLV L+ KL+++L V T +W + CQ ++ D G T++ D
Sbjct: 1942 LAQLSSYHVKQLVLVLKQKLEKRLQVPTLLWLLE-CQGSQANFLNRDIPDAGVETEKNGD 2001
Query: 1989 MDMCIRGEFETLWNVTSNPNLISDCFTHEKVHMLHCFDRKLSERWSDIYNGITRKEH--- 2048
+ +R W + +P+L+ + F E + K E WSD+Y + RK
Sbjct: 2002 PVVSVR-----FWKLCVDPHLLHEAFLLENFDIFEWSKSKPLEDWSDMYREVIRKNELYV 2061
Query: 2049 NCTHEAAHISRSVSDATGSPGKLLRNGKTLVRSDKELATLDDAMPFQKPKEIYRRNGELL 2108
C + + S A + S K T ++ FQ PKEI++R GEL+
Sbjct: 2062 PCNQDGRSSNEVASLANHASNS----------SPKAAVTANENSAFQNPKEIHKRTGELI 2121
Query: 2109 EVIFNPIALCINSVDQRQAAVASNKKMYVTEYLSNWEDGMASRDDEDYIWSNSEWPLNLN 2168
E ALCIN+++ RQAA+ASN+K + N EDG +S++ DYIWS+++WP N
Sbjct: 2122 E------ALCINAINHRQAALASNRKGII---FFNLEDGDSSQNQSDYIWSDADWP--HN 2181
Query: 2169 GWAASESTPAPTCVFPGVGLGSSKGAHLGLGGATLGVGSSVRPGRDLTGGGAFGISGYAG 2228
GWA SESTP PTCV GVGLG KGAHLGLGGAT+GV S +PG+ A + GY+G
Sbjct: 2182 GWANSESTPVPTCVSLGVGLGDKKGAHLGLGGATVGVVSLSKPGK------ADRVPGYSG 2241
Query: 2229 VGA-----------------SGLGWETQEDFEEFVDPPATAEHTNSRAFSSHPSRPLFLV 2288
+GA SGLGWETQE+FEEFVDPP T E +RAFS+HP+ PLFLV
Sbjct: 2242 LGAIADPGSFFTQIRRWLGVSGLGWETQEEFEEFVDPPPTVESVITRAFSNHPTMPLFLV 2301
Query: 2289 GSTNTHVYLWEFGKNRATATYGVLPAANVPPPYALASISSVQFDQCGHRFATAALDGTVC 2348
GS+NTH+YLWEFG RATATYGVLPAANV PPYALASIS+VQF GHRFA+AALDGTVC
Sbjct: 2302 GSSNTHIYLWEFGNERATATYGVLPAANVSPPYALASISAVQFGPFGHRFASAALDGTVC 2361
Query: 2349 SWQLEVGGRSNVRPTESSLCFNGHASDVTYVTSSGSIIAVAGYSSTAVNVVIWDTLAPPK 2408
+WQ EVGGRSN+ P ESSLCFNGHASDV Y++SSGSI+A +GYSS+ NVV+WDTLAPP
Sbjct: 2362 TWQSEVGGRSNIHPVESSLCFNGHASDVGYISSSGSIVAASGYSSSGANVVVWDTLAPPS 2421
Query: 2409 TSQAAIMCHEGGARSISVFDNEIGSGSVSPLIVTGGKGGDVAIHDFRYVVTGRTKKQKNC 2468
TSQA+I CHEGGARSISVFDN+IGSGS+SP+IVTGGK GDV +HDFR++ TG+ KKQ+N
Sbjct: 2422 TSQASINCHEGGARSISVFDNDIGSGSISPMIVTGGKNGDVGLHDFRFIATGKMKKQRNP 2481
Query: 2469 SKDEMISNASNSDMPSTVGEQNLNGMLWYIPKAHSGSVTKISSIPNTSLFLTGSKDGDVK 2522
ST G+QN NGMLWYIPKAH GSVTKI++IP TSLFLTGSKDG+VK
Sbjct: 2482 DGGS-----------STDGDQNKNGMLWYIPKAHLGSVTKIATIPRTSLFLTGSKDGEVK 2502
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q6PNC0 | 9.0e-34 | 33.23 | DmX-like protein 1 OS=Mus musculus OX=10090 GN=Dmxl1 PE=1 SV=1 | [more] |
Q9Y485 | 7.7e-33 | 35.59 | DmX-like protein 1 OS=Homo sapiens OX=9606 GN=DMXL1 PE=1 SV=3 | [more] |
Q8TDJ6 | 4.6e-30 | 33.21 | DmX-like protein 2 OS=Homo sapiens OX=9606 GN=DMXL2 PE=1 SV=2 | [more] |
Q8BPN8 | 3.9e-29 | 32.48 | DmX-like protein 2 OS=Mus musculus OX=10090 GN=Dmxl2 PE=1 SV=3 | [more] |
P47104 | 2.2e-27 | 28.28 | Regulator of V-ATPase in vacuolar membrane protein 1 OS=Saccharomyces cerevisiae... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GHN3 | 0.0e+00 | 99.41 | uncharacterized protein LOC111454255 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IFR3 | 0.0e+00 | 95.17 | LOW QUALITY PROTEIN: uncharacterized protein LOC111474058 OS=Cucurbita maxima OX... | [more] |
A0A6J1GIT5 | 0.0e+00 | 99.33 | uncharacterized protein LOC111454255 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A0A0L3T8 | 0.0e+00 | 82.96 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G119460 PE=4 SV=1 | [more] |
A0A1S3AVL8 | 0.0e+00 | 82.96 | uncharacterized protein LOC103483174 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT2G46560.1 | 0.0e+00 | 46.24 | transducin family protein / WD-40 repeat family protein | [more] |