Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTATTTTATTTTTTTTAGAAATTGTTTTATATTTTCGTCTTCCATTTTCATTCATTCCTCTTTCATTCCTGAAACTCTCTCACCGGCAGCGAAGCTTCCGACAGTTTTTTTTTTTTTTTTCTGATTCTGTTTCAAATGTAGAAGGAAGCACTTCCGACCTCCAGCATCCAAGAACTTCAATTCACTCCTTCTTCATCTTCTTCTTCTTTGCTTTTTCTCTGTGGAATTTTGTGTTCTTATGGCTTCCGCCTTCAGTCCTTCCCGGAGCCCAGGGAGTTCTCGGCTCCAACACTTGGGACCGGTGTCTGGTGTCTCTCGTTTGAGATCTTCATCGCTCAAGAAGCCTCCTGAACCGCTACGAAGAGCCATCGCCGATTGCCTCTCTTCCTCTGCTGCTAATTCGCATCATGGAGGTCCCTCTGCTTCAGTCGTCGTTGCCGAAGCTTCCAGGACTCTTCGGGTGAGTAATTACGAGGTTTCCTGTGTTCATTTTGTTTGGATCTTGCAATTCTGCTTGGATTACTCCAGAATCCGTCTTTTCGGATTCATTTTTTTTTTAATTGGAATTTTCCGTTTGTTTGACTTAAGCTGCGTGTTCTGTGGGAATTTCGATGGGTCTGAGGTTCATTTGTTGCCTGGTTGAGGGCTGGGATTTCTAACATTGGCTGGCTTCTTGTGAACGTGAATGGTTGTTTCGTGTTTTCCGATTGAATTGTGTCGATATACATCCGTTGTTTTTTTTTTCTTCTTTCGTTTTCTAATGTCGTATGATTCGGAATTGGATGTTGGAAGTATCGTCTGTGGTTTTTTGTAGATTATTGGTAATGTAAGAATACAACTATAGCGTTCAAGTGGAAGTATAATTATGGCAATTTACTTCTCTTAGGTGAATGCGTTCTCGTTAATCAATTACTTTCGAAGAAATAAATGAAAACTAAAATAGAACGTTCAAATACTCGTGTCGAGTTGAACTATATAAATCTATTCCGTTGGGGTTTATTTGCATCATGGGATAGTTCTCATATTTAATTTTTTACCAGTTATAATGAGACGGTATCATCTTCATTCTCATTCATTTGCTTTCTTATTTGTTTAATTGATGTATTTTTATATCTGCACAAGGTCGTGCTTTTTCTCTAATAATGGCGTAAAAATTAGAACAGTAATGAATATAAGGATGGTTCGGCAGTACAAAGCTTTTGAATCTAGTTAAGAGGCAAGATCAGGGGATTAGACTCAATTAAGCTAACGCTACTTAGTATTTTAATAATGCTGCTCAGTTAGGCATGCCCGGAAATTCTTTACTACCAAATTCTGTACCAATAGCTCTCATTTCTGTACCAATAGCTCATCATCGCAGCATGGTAGCCTGCATCGGACTACCCATTGCTAATCTGTCATTGAATGCACAAATGATTTGCATTGCTCGCATCCTATCTTTTTACCGATTTCTTGGATACGCTTGAGTCAAGTAGAGCTAAGGTGTAAAACATTCTTTATATAAAGATTAGCTATTTCACCTGAGTTGAAAATTGAAATTATTTGAGAATTCATCCAGTCACAACAACCTAAGTATGATATTCCTGCCCCACTTCATTATAAATCAGCTAGCCTCGAGCTTAGTAACAGGTCAGTAATAGCATGGATTTTAGCTATCTAATCTTCTCTGAAAATTTAAAACTTCATGAGGATGACTGCATGCTTTGTAGCTTTATGTTTTAGCACTTCTGAGAAGCCAAAGCTTTGAAGCTTTTTTGTTGGTACAAAACCTGGACTGAAAGAAATTATAGCCGAAGATGGAAGAGAGCACTAAGATACCTAGCATATAACGAGTGGCTTGGTGTTGACCAATCCTTCCTGGGATGGTTGTGTGGTTTGCCTTCATTAATCCAAGTGTTGTCAGTGAGTTCTTAGAAGTGTTTCAGCCGGAGAACTACGGAGGACCTATGTTGAAAACGAAAACACATCCTTAGTGAGCTAGCTTACGTCGTCACCTTCCAAATACCAGAAAAGGATAAGTGAAAATGTGTGAGTACTTGGCTACATGCACAAATGAAGGGCATCATCACTCTTATTAGGAGGGATAATCATTCTAAAGATTGATACAACATTAAATTTAACATAACCCATTAACTTAAATTTTTGAGTTGGTTGATGATTTAACTTGGTATCAGAGTAGGGGGTCATGTGGTTTGAAGCCCATTAATTTCATTTCTCCTCCCCAATTAATATCCATTTCCACTTGTTGACCTTCCACAATTTCTAAGGTGGGCAAGTGTTAAGGTATTAATATACATTAAATATATCGTAGCTTATTTGATAAACTTTTGGGTTGATTGGTGATTTAACCATTTTGGTTGGCTTTGATATAAATACCTACTAATTTTGTGTAAGTTATCAACAAATCCGGACTTGGCAAGGCTTCCATACCTCAATGCCGGCCCCCAAAAACACCTTTGAAGATTTGAATATTATACTGAAAGGCAATGAAATTGTCCAGCCATCATCAACCTAATAGACAAAGTTCACAGGAAGAAGAAGATCCTCAATTGAGGCTGGAACCTAAACTTTTAATGAAGACATAATTCTCTTGCCACGACTGTTCTCTAAAAAAATTGATGTGATTCTGTTATTTCATCCAACTCAGACTGGTGCAGTTGTTTATCTTCAAAATATTCTAAATGTACCTTCCTCTTCAAGAATAAGTCTTCTCAGCGTCTCTGAGCTTACTCATGGAAATTTTCTCAGCCTATCAGAGCTTACTCATGAGAATTCTGTCACCCACTGAATCACTTTGACGAGAGAAAGCTAGAGAAGGTGTAATTGGAATAGGGGTCTGATGGAGCGCTCCATCAATTTAAATTGACCCTTGTCAGTCAGTTACACTTGTCTCATGCCTGTTGTGTTCATTTTTTTTTTTATAAGATGACGAAAATAAAATGCATGTCAGCAAGGTATGGTCATGGATCTAGCTGTCTTAGTTTAGATTTTTGCAAGGCTTTCTTGAGTAAATTCCATGGCTTACCTTTTTCTTTTGCAAAATCGTACACAAATCCAAATTGATGCACTCGACATTTCGGGGCGTGCTTCTGTACTTTCTATTGTTGGATTTCACTATTACGCCTGTTTAGTTGATGATTACACTAAACTATCATGATTCAATCCTCTTTAAAATAAGAGTGGGCCTTCATCCATACAATTCAAAAACATCATAGATAACAAGTTTAACTATGTACCATCATATGTCTATAATCCTTTATTCAATGCTTTTGAAGATTGTTTGGACGTTCAGTATTAGTTTGAACCCAAATAAAGGTGCAGAATATGGAGTAGGTGGTGGTTCATCCTCCTTCCCTTCCACAATAATGGGTTAGATTCTTTAAGTAGGCAGCTCTCTTAAGTTCAGTTCAATACTCGATGACTCTTGACATTTAGAACACTTTTCAGTGTCTACTGTCTTCTGTGAGATTGGAGTAGCTTGTTACTCTAAGGCTCAATTAGTGAAAATTGCATATATATTACCAAGGTATCATCTCATTAGGACTTAATTTTAACATCACAAGTTTTATCAAATGACAGGACTATTTGGCAACACCTGCAACGACAGACCTGGCATATTGTGTGATCTTGGAACACACAATTGCAGAGAGGGAACGAAGGTAACCAAACTTGTTGATCATGTTGGTTTCAAGGAGACTCATGAATTTTCTTTTTTGCATAAGATGGTGAATTCTAATGCTTGCTTTAATTTCTTCACATTCTTTGAAGGAAGATGGTTCTGTACCATATAGCATATTCTTTTTATCCTTCTTAATTATCAGCAAATGGTTCATAGAATCTGTATCTTTTTTTTATAAAACCATCTTTTGACAATTACAACTGTTGATAGTAATGCCCCATGACTCAGAGCCAACTTTCAACTATTTCTTAGAGACATTATTGGTTCTGTGGATAAAGAGTATTATGATACATGTATACATGTATTTTCAACCTTCTTGTATGAGCGTATGTTGAAACCACGATATGTTCTTACTCAAAATGAGGGAGGGAGAGTTATTTGATTAGCTGAAGGGAAAGAACCCTAAAGAATTGCCTCAACCCTAGAGAATCCAAGAGCTATTGTATCTCTCTCATTACAACTAGCTGATACCCAGATCTTTGTGAAGAAGATTTAGTTTAACTGTATTCACTAGAGGGAAAATGTTTTTCATAAAAGAATAAAAGTAGAAGGCAAATCATTTGTATTCTGCTATTATTATTATTTACTCGTCAACAGGAGCAATTATAATATTTGGAAATTCGAGACAGGAAATACTTTTGATTTATGCTTGAATTCTGAGAAGGAATTACATCAATATCCTATATTCTTGTTTTGAATAATCGTTGGTGCTAATGCTAACTAACGCATTGCACATTTGACTAGTCTCACTGAACAGTTGTATGATTCCTGTAACCCTTGACATTAGAAACTCGTGTGATATGATATGTTAAGCAGGTGCCTATAAGGATGGAGCTATGACCTCAGAAACATTTTGGTTTGTAAATTGTCTTTGATTGTTCACGAACCCATTATCCTATATGCTTGATTACCCGCTAGTAATTGTTGGGTAGAAAAAGTTAAATCCTTATTATTCACAATGTGTTAATTTTTCTTGCATTTCCTGTGTTGTCGTTATACAACTCATTTTTCTTTTTGTGCATCCAGCCCAGCTGTAGTTGCAAGGTCGGTGGCACTTTTGAAACGCTACCTTCTTAGGTATGTTTGTCAAATCATCTTATATTTCTGTTGTACCCATATAATTGTTGAACTGGGTTGAATGTCTTCGTGATGGACTTCATTCTCCTTCAATATGGACCATTCGATCTTTTAGTTCAATATTTACTTCTCATGTCACAGGGGTTGTTCTATTGAACCGGAATACGTTCACTACAGTTACTCACTGTGCACATAATGCAATATAACTTCTTATTAACTCAAATCATCATGTTCTTCTTCTATAACATGAATTTACTTTTTTCTCCACTCCTTTTGATTCCCATGTCACAGGCTATTGATTCCCATGTTACAGGCTACAGTTACTCTCTCTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACACACAGAGGCTACAGTTACCGGTCTCCTTGTAAAGATCAAATGATTTTACTGCTCTTGGATGCTCTTTTCTGGAATCCTCACATGCCTTTTCCCGACTACAAATTCGTTAGATGGCTGTTTTTGCTGTTCCCTGATACAACTTTGTGTTATGCTTTGTTTTTGCAGATACAAACCCTCTGAAGAAACATTAATGCAGATAGACCGGTTTTGCTTAAACACGATTGGCGAGTGTAGTTTTAGTCCAAACCGAAGGTCGTCACCATGGACTCACTCTTTAAGTCAAGCATCTGCTGCGACTACAACCCCTTCTACTTTTTCACCATTGCCTGTATCAAGTATTGCCTCTGGAGCACTTCTAAAATCACTGAAATATGTTCGCTCCTTGGTGGCACAACACATACCAAGGAGATCGTTTCAACCAGCTGCTTTTGCTGGTGCACCTTCTATGTCAAGACAGCCGCTTCCTGCACTGTCATCTATGCTGAGTAGATCCTTCAATTCACAATTAAATGCTGCAAGTACTGGAGAATCTTCAGAACATAAAGACTCTACTGTTTTATCCATATCAAATTTATCTAACATTGAAGAAGTTGATGGTATGGTCGATCTTGAATACATTGCACATGATGCACTGAAATGGCGATGGCTTGGGGAACTACGGTCATCTTTTTTGCAAAGAGAGGGGTAATTAACTTTTCTAACAACCTTCATTTTAGTTTTTCCTTTTATGCATGCAGTAGTACAAAGTAGTTTGCTTATTTTGCATTAGTTAAGTTAAATGCTGGCTCTTTTTGATGGTACAGTGATAGTTTTGTTAATACTCAAGACTTGAGAACACGTAATCTTCTAGAGGTAGGTGCAGCAGCACTTTTAGTGGGAGATACAGAAGCCAAAGTGAAGGATCAACCTTGGAAATCTTTGGGAACAACTGATATGCCATATGTTGATCAACTATTGCAGCCTTCGCCAGTAGCGACTATAACCAATTCTTCCTCTGCTCGTCTCCACCTGAGGGCTATAACTGCATCAAAGCGCACAAAACCAGACTTGCATCAGATTTGGTACAGCCTTTCATTTACCAATTCTTATCTTCTTTATGCTTGCTAGTCTCTACTTAAAAGATAATCTAGAGAAAGGACTGGCTAAACTTTCTCTCTGCTTCCCCTTTTGGATCTTAGTTTCCTTCCTAATTACTATTTTGTTGCATTTGGGTCCTTGCTGCCATGTCATTTTGCAACAAAAAATTGTGGTATTTAATAGAAATGACATAGTTATTGTGACTAAGTGTAATACTGAGGATTTATTGTTAAAATGTTGGGTGAAAAATCTAACTTAGACAAAAAATGGTTATATGGTCAAAAAATGGTTATATGGTCAAAAAATGGTTATATGGTTACATATGCTGTAGTTTAGGAACCTAAGATATCATTAACTAAATTATTAGAATATACTGTCTTTTTCACCAAATGATAGGGAACCTGGTAGGGTGATTAATTCAAAGAACCCTGAACAAGTGTTTTGATGAATTTTTGTTTTACAAACCAAATGTTGGTCTTTGTTGAGATGCAAGAGCTGTGTCCAGAAACTAAATGGTCAGGTGGGGAATACAATGCATGTCAACCGGAAGGATTTCCATTCTGTGGGTTTTGCTTTCTGTTGTGCTCCATATTAAGGTTGTTGGATAGTATACAATATTGAATAGCAAAGTTGTTAAAAGGAATACAAACCGAAATACTTTAGACTGCCAAACCCCCCATCCCTTACCCAAAAATAAATAAATAAATAAAATAAAATAAAATAAAAAAACAATATATATAAGGCATAAATGTCCTTGTTCTTGCAGAAAAGTCCAGGCTTAAGCTGCATCCATCATTAGGATCTGTAAACAAAAATCAAAATTCATATTTATTAACTTTATATCAGATGTCCAATAATAGCTACCAACCAAATTATTTCTCCAAATCATGCTAAAATAAAGGTTAACTTTTATAAATTTGCTCTTGATGATTTCATAATGTCTTTTAGGGAAGATTCTCCTGGGAGTACATTTCGACCAAAGGCCCGACCACTTTTCCAATATCGTTACTACAGGTACAAAAAATAGATTCTGATATGGAACTCTGTAAGTTCATAACGCATAGATATTGATAAAGCTAGTACTTGTTTATCAAGTGAAGAGTAATTCCCATTTAAAATTATGAAATTGATCATCATAGCTTAAATGGATTTATTAATATATGGTGGTCTCGCTGTTATGAAAAATTACTGGTCTCGTTTTCATGCAATATGGGAAACTTGCATTTTGAATTATTACTTTAAGGTTTTTCCTTTCTGTTGGATACTTAACAACTTATAGGCTTTTGATGCATGATCTATGATGTTATTGAACTGGTAGGCTTGGAGTTATCATATCGTAGTGACCATTTCACACAATTTCTTTTCATTTTTTGAAGGTATAGACTTTGGCATGTAATATTGATTTCAGATACTGGTTAAATTGAGGTGCTGAGTAAACATGTATGCCTGTAATATTTTTTCTTTTCTTAAGTACAAATTAACATGCGGCTTTGACATGCAATAAAATAAAAAAATTGTAAGGTAACCTTGCTCTTAAAAAGTTAAAAGACTAGAACTCAACAAGAAGTTTTTATTAATCAAGTTCATATTATCTTGTGGTTGCATCTGTATCTCACAAAATTAACTTCATGCCCATCAGTGAACAGCAGCCTTTGAGACTGAATCCTGCTGAGGTGTGCGAGGTTATTGCTGCAGTTTGCTCTGAAATGTCTTCACCCATCGCTAATCCCCTTACAGTAACTTCTAGGTTAAGTACAAATAGTGGGAAGCCATCGATGGATGTGGCCGTGAGCGTTCTCGTAAAACTCATTATTGACATGTAAGTTTGTCCCTTTAGATGCTTGACTTACACTGAATGTTTCAAATTTCATCTACTATTCATCACTCTCTTTCATTTCAACCTGTTTGCGTTTATGTTTTTGTTACTCTCTCATACAATTTACTTGTGTGAGGTTGTTAGGTTTAATGCTTTGGTTGGCTTGTTTCTATGCCCTTTTTTGGGTATATAGCTTGAAAGCAATTATATGCTTTTTAGAGGAGCTGACACATTTGAGAGCGAGTTATGGGAGATTTTAGGTTTAATGCTTCGCTGTGGGCCTTGGAGACTAACATGTTCGTAATTGTCCACTAAGTTTCATTTTTTTTTTTTTGGATTAATGTCCTTTCTTGTAGTTATTTGCACTCCTTTTTCTTGGGTTTTTTCCCCTTGATGAAAGCTTGGTTTCTTATACCTAATTCAATGCACTTGTGTGCAAAACACATTTTGGCATTATGCACCAAACAAACTTGTGTTATAGATGTTAAAATTGAGAACTAGTTGTTGTTCCAAAGTTGATGCTGAAATGCCTTTTTCAACTAGGGCACTGATATGGTTTACTCATGTTAAGTAAGATAAACACTATGAATGTGATAACTTTTTCGATATCATGCCTCTTTGGGTCTCCTTTCTTAAAGACTTATGTAATTATCTTTTAGTTTTTTTTTTTTCCTTTTTTATTTTGCTTCACTGGAGCCCTTCCTTAGGTTAATTGGGCTTTTCTTGGAGTGTTTTTTATTCCCCTTGTTTCCTTTATCATTTTTAATGAAATTTGGATTTTTTAAACAAAAAATGGACATCTGAATTTATTTTGCCATCTTTTTCATTTGTTCTGATCCAAATTCTTAGCTATAGCAATGTGCGCCATTTTCTTATATATTGAGATGATGTTATTCAAGATATAAACAATCTGATAGATCGGACTATTATCTGTCTTTGAGGAATGTAATGATAATCTTAATCCAAGTTTATCTGTATTTCTTAAGTTCAAGGCTTTGAAAATAATTTTTATTGAGTCAATTGATGTTACAATATATTTTCTGATCTTGAAGGATTACTGAATGTATTATGATGTCTTTATTTATTTATTTTGTTCAATGTTTTCTATATGTCTTTTCTATGCTTTCATTTTTTGAATTTTTCATGCATGAATGAAGTTATTCAATAATTTCTCTTACAGGTATGTTTTGGATTCTGGGATTGCTGCACCTCTCACTTTATCCATGCTTGAGGTGATGTTCTGGATTGATTTATCATTCTAAATTATCCCAATAGCATATTTTCCTACGTTTAAGGTTTTTTTTAACATTATTATTGTTTTTTATTTTTCCTTAAAAGGAAATGCTTAGTTCTCCAAGATCAACCTGCAGAGTTCGTGCATTTGATTTAATCTTGAACCTTGGCGTTCATGCTCACTTGTTAGAACCAATCGCACTGGATGACAGTTCTGCAATTGAAGAAGAGTATTCTCAAGAATCATATCTTGCGGAAGAAACCCAATTTAATTCACAGGGGAAGAAAAATCCTGATTCTCCTAACAATATCAGTGCAACTTCATCTATTAACAAATTCGAATGTTGGATTCTTAACATCTTGTATGAGATACTGCTTCTTCTTGTCCAAGTATGTATGATATTTCATTCTTTTGCGAAAATTTTCCTTCTTCTAATCAGTAGTGCTCAGTCAAGAAGTACATAAAATTATTTTCATTTGGAAAATATTGCCAGAGGCTCCAAGAAAAAAAAGTAGGTGCTTTCTCAGATGCAATACTATCCATTCTTTTGGGAAATTTGGCTTGAATGAAAGAAACAGAATTTTTAATGGGGAGAATAGGAGCGCGGACACTTTATGGGAGAAATGCTTATTGATTCTTCCTCTTGGAGTTCTTTATGTAAACTATATTTATAATTACAATTTACAATGTCTTCCATTACGGCCAACTCAAGATTATTTTATAACGCCCATGGCATTTTGTTTCTATGTCTATTTCAATGAAATAATACATAAACATATAAGTTTTTATCATAATGTGTACAACTTATTAAATGAGTTCTACCAAGCTGACCGACAGTTCATGTTGAATTCTTACTCCCATGAATTATCTTGTTAGAAAAATGCATCCTGGTTGTTGCACTACGGTGTTGCATGGGTGATCCATATATGTAGCTTCCTGTTATGGGCTAGAGCACAGCTTGATTGAGATTACATTTTAGCAGATGATTAGTCATGAGTAATTGAATCAGTGACTGAATAAGGTAGGTTATGAACTTAACGTTAACTCAGCTCATGACTTAATTGGCAGGGAATGGTTTGGGCAATTTGTGTGAAGGTTATTTCTTAGAAAGAACTGTGATTGTATGAAGAACACAAATAATGGATGATACAAAATCTTAATGGAAGATGTGTAACTATAATAGTTCAATGGATGTTTACTATTTTGTCAATTTCTAAAACATCTATTTTATGATTGCAGATTGAAGAGAAGGAAGAATCTGTCTGGACATCTGCTTTGAGCTGTTTACTTTATTTTGTTTGTGACAGAGGCAGACTCAGGAGAAGCCGACTAAAGGGTCTTGACATAAGGGTGAGTTGTTTGCTAACTTGCCTCAAAGATATTGCAGTCAAACAAAAAACATTGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGTTGGGGTTACATCTGAATTCCTGAGTCACTTAAAGATACAAATCGTTATACTATGTGAACAGTAAATATTTTATCTTGGATCTCTATGGTTAATTTTATGTTACCACAAATTCTTTGTTTTTTAATTGTTGTTTTTGTTTGAGTTGGGGGGGTTATATCAAGGAACTCAACCTAACAGATCTGCTAAATGAAGAGCCATATGCAAAAAAAACTAAATAAGGCTTCTATGCAATTCATTCCAAATATATTTGTGTGCCCCTTTTCTGTTCTTGACAAAAAGACTTAGAAGATGTGTAGGCCATGTAGTTGTCATGTAATTTGTATCTTCTTTCCTGTGAAAAGTACATTGGTCGCTCCCCCTACTTGCAATAATGCTTTTGTTAATTTGACCTGTTGGCCGTAAGGAAATATACATTGTAATAATGCTTTTGTTCATTTAGCTGCTTCATCAATAAGAGGGTATGATGCTTTATTCTCTCTCTCTTTTTTTTTTTTTCTCTTTCCTTTCAGGTTATTAAGGCATTTCTAGAAACTAGCAGAAGAAATTCTTGGGCTGAAATAGTTCATTGCAGGCTCATTTGCTTGTTAACAAATATGTTTTATCAAGTCCCGGAGGAGTCCACAGATGTTGCTCCAAGCCCTATATTTCTTGTAGATCAGGTGGATCTGGTTGGAGGAACTAAGTTTATTTTCTTTGAGGTATGCTAATTAATCTACAGTTCAGCAGCATGGAGTAATGAGTCTGACTCTGTTAGAAGAACTAAATTTATTATCACGATTACATCCATTTTTGTTCTTGAAGGCTTATAATTTCATACCTGAGAGATATTTCCCATGTTAGATCCTTAAGTTGCTAGTTTTAACTTTCAAGGTTACGTTACCAGTTTGTGGTAAGTTGATTTTTTTTTTTCTTAGTTTACTTTGAATTTCTCATGTGATTAAGGTTAAGTTGATTTATATACTAGATCATATATCAATGTTTGAAATGTTCAGAATGATGATAACAGAAAAAATAATATGATCTATACATCTTACTTTTCTAATATTTTCAGTGCTTCTCTACTGCAGTATTCTCTAGCAAACTCAAGAGAAGAACGGCGAAACCTCTTTCTGGTGCTTTTTGATTATGTTTTGCATCAAATAAATGAGTCTTGCATCACAACTGGAGGTATGGAGTATAGTGATGATGAGATACATCCCCTTGCAGCCCTGTTCAGCCTCGCCAATGCACCTGAGGCTTTTTACATCTCAGTTAAGCTTGGAGTGGAAGGTGTTGGAGAGATCTTGAAAGCGTCTATCTCATCAGCATTGTGTAGATATCCTAATAGCGAGCGACTAAATTTGGTATGCACTCATCTAAATTTGTTTTTGTAGGCTATCTACTTGCTTGGTCGAAAAGCAGTTTCGAATGGCGTGGAGTTATGATTTTTTTCTGAAATCGGGGTAGCAGCTTGACATAATATATCCTTGGGAATTTCCAGGCCCCCTCAAATATCTCTCATTCTACGCACAAATCATGGAATTATTTACACTAAATGAATATATTGTATTGAATGTTTATTCTAACCCTTTCTATCTACTTTTTACTTCTAACTCTTCCTGTCAACCCACCACCTATTTATCAACATCGGCCCCCTCTCTCCCTTCTCAATGATATGAGATGGTTGACCATTCATAGTGTGACTGGGATTGTGGATGGGACATTTTAGAAATTTTCGCATCTTCTTTAAGTTATTTATATTTACGATTATGGTGGGGAGGTGGGTGAAGGTGGAAGCCAAAATCCAAAGGAAGTCTTTATTAACTTTGTTTAGTAGTAACCATTTTGAAAGAAAGGAATCTCTTATATGCAACGTGCACAATTAAGCTAGATTGAACTGCTCAAGTGGCATTTGTGATTATGTGAGGCCTCCTATTGTAAATTCTAGTAGGAGGGTTAAAATGGGAACGTAAGGGACTGAAAGAGTTAGTTAGTGATGCACATTCCTTTTATGTATTAAAGAAGGGAGAAAGGGAGTGCATCGGTTATTTTGAAAGGAGTTACGTCTTTGGGAGAGTTCCTGTTCTGTAGGAGGATTTTCACCCCTTCACTCTTTTCGACAAAAATAATACAATCTGATGCCGATGAGATTACTTTTGTCTTATTATTGCCAACAGGATACCTTCTTGTAAATTTCTTGGCTTTAAGTTGGTATAAGTCTATGAAACATTGATACTTTAGTTTAGGTAACTTGTCTATGTTCGACACAGATCGAACTCTTATTAGCATTATAGAAAAATACGTTAGACTCAAGTTGTACAAAGTCAATATAGGATAACATTTGTTAAATATGTATTAAATACTTGCTAAATAAACTGAATAGTCACGTAATAGTAGAGGACAAAAATAATAAATTTTGAGAATGAAATAGATTAAACTCGTATTTTTAAGCATATACTATGCATAAACTTATTGACTTAATTTCCTTGTATAACAATGATATATATTTTAATAAAAGTGTCCTGTGTCTATGTCCTAGATTTTTTAAAATGACATGTCATCATATCTTTATTATGTCATATTTGTGTCTTGTGTCTGTATCCTGCTTGTTAGGTACAAGTTATTTTTGTAAGACTTTTGAGATGAAATTTATTTTTCTCTCAATTTTATCCAACTTAAACTTGATAATGGTTTTAACTTCTGATTGTTCTTCTTTTGCAGCTCTTGGAGAACGTAATGGAGAAATTTAATACAATAATCAAATCGTTTACACATTTGGACAATGAGTTCTCTTATATGATACAGATAACCAAATCTCTCAAACTTTTTGAAAGCATTCAAGGTTCTTTATTAAGAAATGGTGTTAGCATGAAATCCAAGCTATCATGGGCCACTCTGCATTCCCTGATTCATTCAGAGAGAATTGCTTATCGTCAAAATGGTTATGTCTGGCTAGGGGATCTCCTTTTTGAAGAAATAACTGGTGAAAGGGACGAAAGTATGTGGACAAATGTGAAAAGATTACAGCAGAGAATTGCGTACGCTGGTTTGAATGATTATTCAACAACTTCAGACGTGCCACTTTCAATCTGGCTCATGTGCGGACTTTTGAAGTCAAAACACAACTTCATTAGATGGGGCTTTTTATTTGTTGTAGAAAGACTTCTTATGCGATGCAAATTTTTGTTGCATGAGAATGAATTGCGAAATTCTGGCAGCATCGACATTCGCCAGGCATCAAAAGATAGCCGTCTAGAGAAAGCTAATGCAGTGATAGACATAATGTGCAGTTCTCTTTTTTTGGTTTTTCAGATAAATGAAACTGATCGCACCAATATTTTAAAGGTAATTCTTTAATGATATGAACATATTATCAGCATAAGCTCTCCTTGTTCTACATATTGAAGTCATGGATTGAAAAACTGAAAGCACTAGTGAATTCACAACCATGGAACGTCAGACATCTATTACGTTAGGCTTATATATTCATGACTTATTGGAAGATCTAGTTAACTTTTTAAATGTAATTTTGGATTAGAAAAGTTCTTTTAAGGAAACTTTGAACTCACTACACGTTTATTCAGCATCCTTTAGTGTCGCGCATGATCTTATTTCTGGTTTTGGTGTTATTATAGATGTAATATGCTTTTAAAATTTGCAGTTGGATATGTTCTAGTAATCCAAGTTTTTATCTATACCATATCTGATCCTGTTGGAGCATCAGAAAGTTTAAAAGAAGTGCTATTTCTACTATACAATCTGATGTCTTCTTGATTTTGATATTTCTGATCGTATGACTTTTATATAACATTTCTTGTTTATGTTGCTCACGAATCCTAATATATCCTTTTATGAGCTATGCTCACTGATCGAATCTCCTTCCCACCCTTGTATCTGGAATTTAATTCGCTGCTATTCCTGAACAGATGTGTGACATACTCTTCTCTCAATTATGCTTGAGAGTACCACAAGTTTCTGACTTACCAATTGGAGATGATATGCCACGTGGCAGAGTTATGGATTACTCAGGTGAAAGTAAAACGATAGGGGTTACTGAATCTGAAGCTAAACTTGAGGAGAAAAGCAGATTTATTAAAACTTATAATAATCCTCTTGATCATGAGACAGCCTCCATGGCTGCATTACTGCTTCAGGGACAGACAATTGTCCCGATGCAGTTGATCTCGCATGTTCCTGCTGCTCTGTTCTACTGGCCATTGATTCAACTTGCTGGAGCAGCAACAGACAACATTGCTTTGGGTGTTGCTGTTGGAAGCCAAGCAAGAGGGAACCACCCAGGTGCCGCCTCTGACATACGGTCTGCACTGCTCTTACTCCTGATTGCTAAGTGCAGTTCTGATTCATCTGCTTTCCAAGAAGTGGATGGGGAACAATTTTTCAGGTGATTCCACATAGTTCTACCTCCCCCCCCCCACCCCCCCTCAGACCCTGATATGTTAACTTTGTTGGGGTTTTGCAGAGAGCTTCTAGATGATACGGATTCAAGGGTGGCTTATTACTCTTCAGCGTTTCTTTTGAAGGCAAGGAAATTCCTCTCCACCGCGAAAATGTATTTGTATGAAATTTCAAATTTCTTTTTCCTTTCGTAACTGAATGATTTTTTTTTTATTTTCAGCGTATGATGACAGAAAAACCTGAAAAGTACCAATACATGCTTCAGAATCTTGTAATTAAAGCTCAGCAGGTGATGTTTTCATATTTGTTGTCAAATACAGATTCTGATTTGGTTAATATATCCTGAACACCATTGATCTATTTGTCTTGGCAATCACATCATCTCTAGTATTGTTACTGGCCATAAACGAAGTTCAGTTTGATTTACCAAATCAGTGAGGGGAAAAAGAGAAGTTGGGGTTGGTTATACACTAGACATGCCCTTTGGTCAGCCTTGTTTTAATGGGAAGCAAATAATAATAATGAATTAAAAAGATCAAATAAACCCCGTAATGATTGGTGTACATTGGAAACTTTAGGGTTCATATACTTTAAATTTGTTTTCCCATGCACTGAACATTATTACTTGGAGGCTGTGGTGCCTTTTATAAGCAGATATTTGATTTGGCTGCTGGCTTACTATCTCTTTTTCATGTACGGTTTTTATTTTTAAGAACAAGCAAGGGTTGTATTGAAATAGAAAGAGCATATAAGAAAAAGAGGAGGGAAAGAAGAAACCCAACGGTCACACTCACAGCTTGGGAGATTGAGGTTTTAGTGATTTACAATACAGTGGCAGTACTCTCTGATAATGAAAGTCCCCAAAGAAATGAAAAGACAAGAAAAAACATTAAAAAGAAGACTAGAGCCCTATTATGCTTCCACGTTTCTTTTTTCTTTTATTTTCTCCTAATGCTAAAAGTTCCTGCTCATCCTAAAATAGCATAAATGATTGATATACAAATTAATTACTTGTTCTAATTTCTTTTCTTTTAATTATAAATGACTAATTTTAAATTCGTTTGAAGTAGTTGACTGATGAATTTTGATCTAACTTTATACTATTAGTTATTTTATAGAAAAGTATAATTTGATTTTAAATAATAAATCAGTGGAAATCTACATCATATTTTATAAAATTCGTTGCGCCCCTTCAGGTCTCTTGTGATATTGCTCGAATAACTTTTAACCGAGATGTTCTCTTCATTTGATTGTTGAGCTCAAATTGCTAACATGCTTGCTTGCTTTCTTCTCCCACCATGCAGAGCAATAATGAGAAGCTATTGGAAAATCCATACCTTCAGATGCGTGGTATACTTAAGCTAGCAAATGATATGGGCATTGAGTTGTGA
mRNA sequence
TTTATTTTATTTTTTTTAGAAATTGTTTTATATTTTCGTCTTCCATTTTCATTCATTCCTCTTTCATTCCTGAAACTCTCTCACCGGCAGCGAAGCTTCCGACAGTTTTTTTTTTTTTTTTCTGATTCTGTTTCAAATGTAGAAGGAAGCACTTCCGACCTCCAGCATCCAAGAACTTCAATTCACTCCTTCTTCATCTTCTTCTTCTTTGCTTTTTCTCTGTGGAATTTTGTGTTCTTATGGCTTCCGCCTTCAGTCCTTCCCGGAGCCCAGGGAGTTCTCGGCTCCAACACTTGGGACCGGTGTCTGGTGTCTCTCGTTTGAGATCTTCATCGCTCAAGAAGCCTCCTGAACCGCTACGAAGAGCCATCGCCGATTGCCTCTCTTCCTCTGCTGCTAATTCGCATCATGGAGGTCCCTCTGCTTCAGTCGTCGTTGCCGAAGCTTCCAGGACTCTTCGGGACTATTTGGCAACACCTGCAACGACAGACCTGGCATATTGTGTGATCTTGGAACACACAATTGCAGAGAGGGAACGAAGCCCAGCTGTAGTTGCAAGGTCGGTGGCACTTTTGAAACGCTACCTTCTTAGATACAAACCCTCTGAAGAAACATTAATGCAGATAGACCGGTTTTGCTTAAACACGATTGGCGAGTGTAGTTTTAGTCCAAACCGAAGGTCGTCACCATGGACTCACTCTTTAAGTCAAGCATCTGCTGCGACTACAACCCCTTCTACTTTTTCACCATTGCCTGTATCAAGTATTGCCTCTGGAGCACTTCTAAAATCACTGAAATATGTTCGCTCCTTGGTGGCACAACACATACCAAGGAGATCGTTTCAACCAGCTGCTTTTGCTGGTGCACCTTCTATGTCAAGACAGCCGCTTCCTGCACTGTCATCTATGCTGAGTAGATCCTTCAATTCACAATTAAATGCTGCAAGTACTGGAGAATCTTCAGAACATAAAGACTCTACTGTTTTATCCATATCAAATTTATCTAACATTGAAGAAGTTGATGGTATGGTCGATCTTGAATACATTGCACATGATGCACTGAAATGGCGATGGCTTGGGGAACTACGGTCATCTTTTTTGCAAAGAGAGGGTGATAGTTTTGTTAATACTCAAGACTTGAGAACACGTAATCTTCTAGAGGTAGGTGCAGCAGCACTTTTAGTGGGAGATACAGAAGCCAAAGTGAAGGATCAACCTTGGAAATCTTTGGGAACAACTGATATGCCATATGTTGATCAACTATTGCAGCCTTCGCCAGTAGCGACTATAACCAATTCTTCCTCTGCTCGTCTCCACCTGAGGGCTATAACTGCATCAAAGCGCACAAAACCAGACTTGCATCAGATTTGGGAAGATTCTCCTGGGAGTACATTTCGACCAAAGGCCCGACCACTTTTCCAATATCGTTACTACAGTGAACAGCAGCCTTTGAGACTGAATCCTGCTGAGGTGTGCGAGGTTATTGCTGCAGTTTGCTCTGAAATGTCTTCACCCATCGCTAATCCCCTTACAGTAACTTCTAGGTTAAGTACAAATAGTGGGAAGCCATCGATGGATGTGGCCGTGAGCGTTCTCGTAAAACTCATTATTGACATGTATGTTTTGGATTCTGGGATTGCTGCACCTCTCACTTTATCCATGCTTGAGGAAATGCTTAGTTCTCCAAGATCAACCTGCAGAGTTCGTGCATTTGATTTAATCTTGAACCTTGGCGTTCATGCTCACTTGTTAGAACCAATCGCACTGGATGACAGTTCTGCAATTGAAGAAGAGTATTCTCAAGAATCATATCTTGCGGAAGAAACCCAATTTAATTCACAGGGGAAGAAAAATCCTGATTCTCCTAACAATATCAGTGCAACTTCATCTATTAACAAATTCGAATGTTGGATTCTTAACATCTTGTATGAGATACTGCTTCTTCTTGTCCAAATTGAAGAGAAGGAAGAATCTGTCTGGACATCTGCTTTGAGCTGTTTACTTTATTTTGTTTGTGACAGAGGCAGACTCAGGAGAAGCCGACTAAAGGGTCTTGACATAAGGGTTATTAAGGCATTTCTAGAAACTAGCAGAAGAAATTCTTGGGCTGAAATAGTTCATTGCAGGCTCATTTGCTTGTTAACAAATATGTTTTATCAAGTCCCGGAGGAGTCCACAGATGTTGCTCCAAGCCCTATATTTCTTGTAGATCAGGTGGATCTGGTTGGAGGAACTAAGTTTATTTTCTTTGAGTATTCTCTAGCAAACTCAAGAGAAGAACGGCGAAACCTCTTTCTGGTGCTTTTTGATTATGTTTTGCATCAAATAAATGAGTCTTGCATCACAACTGGAGGTATGGAGTATAGTGATGATGAGATACATCCCCTTGCAGCCCTGTTCAGCCTCGCCAATGCACCTGAGGCTTTTTACATCTCAGTTAAGCTTGGAGTGGAAGGTGTTGGAGAGATCTTGAAAGCGTCTATCTCATCAGCATTGTGTAGATATCCTAATAGCGAGCGACTAAATTTGCTCTTGGAGAACGTAATGGAGAAATTTAATACAATAATCAAATCGTTTACACATTTGGACAATGAGTTCTCTTATATGATACAGATAACCAAATCTCTCAAACTTTTTGAAAGCATTCAAGGTTCTTTATTAAGAAATGGTGTTAGCATGAAATCCAAGCTATCATGGGCCACTCTGCATTCCCTGATTCATTCAGAGAGAATTGCTTATCGTCAAAATGGTTATGTCTGGCTAGGGGATCTCCTTTTTGAAGAAATAACTGGTGAAAGGGACGAAAGTATGTGGACAAATGTGAAAAGATTACAGCAGAGAATTGCGTACGCTGGTTTGAATGATTATTCAACAACTTCAGACGTGCCACTTTCAATCTGGCTCATGTGCGGACTTTTGAAGTCAAAACACAACTTCATTAGATGGGGCTTTTTATTTGTTGTAGAAAGACTTCTTATGCGATGCAAATTTTTGTTGCATGAGAATGAATTGCGAAATTCTGGCAGCATCGACATTCGCCAGGCATCAAAAGATAGCCGTCTAGAGAAAGCTAATGCAGTGATAGACATAATGTGCAGTTCTCTTTTTTTGGTTTTTCAGATAAATGAAACTGATCGCACCAATATTTTAAAGATGTGTGACATACTCTTCTCTCAATTATGCTTGAGAGTACCACAAGTTTCTGACTTACCAATTGGAGATGATATGCCACGTGGCAGAGTTATGGATTACTCAGGTGAAAGTAAAACGATAGGGGTTACTGAATCTGAAGCTAAACTTGAGGAGAAAAGCAGATTTATTAAAACTTATAATAATCCTCTTGATCATGAGACAGCCTCCATGGCTGCATTACTGCTTCAGGGACAGACAATTGTCCCGATGCAGTTGATCTCGCATGTTCCTGCTGCTCTGTTCTACTGGCCATTGATTCAACTTGCTGGAGCAGCAACAGACAACATTGCTTTGGGTGTTGCTGTTGGAAGCCAAGCAAGAGGGAACCACCCAGGTGCCGCCTCTGACATACGGTCTGCACTGCTCTTACTCCTGATTGCTAAGTGCAGTTCTGATTCATCTGCTTTCCAAGAAGTGGATGGGGAACAATTTTTCAGAGAGCTTCTAGATGATACGGATTCAAGGGTGGCTTATTACTCTTCAGCGTTTCTTTTGAAGCGTATGATGACAGAAAAACCTGAAAAGTACCAATACATGCTTCAGAATCTTGTAATTAAAGCTCAGCAGAGCAATAATGAGAAGCTATTGGAAAATCCATACCTTCAGATGCGTGGTATACTTAAGCTAGCAAATGATATGGGCATTGAGTTGTGA
Coding sequence (CDS)
ATGGCTTCCGCCTTCAGTCCTTCCCGGAGCCCAGGGAGTTCTCGGCTCCAACACTTGGGACCGGTGTCTGGTGTCTCTCGTTTGAGATCTTCATCGCTCAAGAAGCCTCCTGAACCGCTACGAAGAGCCATCGCCGATTGCCTCTCTTCCTCTGCTGCTAATTCGCATCATGGAGGTCCCTCTGCTTCAGTCGTCGTTGCCGAAGCTTCCAGGACTCTTCGGGACTATTTGGCAACACCTGCAACGACAGACCTGGCATATTGTGTGATCTTGGAACACACAATTGCAGAGAGGGAACGAAGCCCAGCTGTAGTTGCAAGGTCGGTGGCACTTTTGAAACGCTACCTTCTTAGATACAAACCCTCTGAAGAAACATTAATGCAGATAGACCGGTTTTGCTTAAACACGATTGGCGAGTGTAGTTTTAGTCCAAACCGAAGGTCGTCACCATGGACTCACTCTTTAAGTCAAGCATCTGCTGCGACTACAACCCCTTCTACTTTTTCACCATTGCCTGTATCAAGTATTGCCTCTGGAGCACTTCTAAAATCACTGAAATATGTTCGCTCCTTGGTGGCACAACACATACCAAGGAGATCGTTTCAACCAGCTGCTTTTGCTGGTGCACCTTCTATGTCAAGACAGCCGCTTCCTGCACTGTCATCTATGCTGAGTAGATCCTTCAATTCACAATTAAATGCTGCAAGTACTGGAGAATCTTCAGAACATAAAGACTCTACTGTTTTATCCATATCAAATTTATCTAACATTGAAGAAGTTGATGGTATGGTCGATCTTGAATACATTGCACATGATGCACTGAAATGGCGATGGCTTGGGGAACTACGGTCATCTTTTTTGCAAAGAGAGGGTGATAGTTTTGTTAATACTCAAGACTTGAGAACACGTAATCTTCTAGAGGTAGGTGCAGCAGCACTTTTAGTGGGAGATACAGAAGCCAAAGTGAAGGATCAACCTTGGAAATCTTTGGGAACAACTGATATGCCATATGTTGATCAACTATTGCAGCCTTCGCCAGTAGCGACTATAACCAATTCTTCCTCTGCTCGTCTCCACCTGAGGGCTATAACTGCATCAAAGCGCACAAAACCAGACTTGCATCAGATTTGGGAAGATTCTCCTGGGAGTACATTTCGACCAAAGGCCCGACCACTTTTCCAATATCGTTACTACAGTGAACAGCAGCCTTTGAGACTGAATCCTGCTGAGGTGTGCGAGGTTATTGCTGCAGTTTGCTCTGAAATGTCTTCACCCATCGCTAATCCCCTTACAGTAACTTCTAGGTTAAGTACAAATAGTGGGAAGCCATCGATGGATGTGGCCGTGAGCGTTCTCGTAAAACTCATTATTGACATGTATGTTTTGGATTCTGGGATTGCTGCACCTCTCACTTTATCCATGCTTGAGGAAATGCTTAGTTCTCCAAGATCAACCTGCAGAGTTCGTGCATTTGATTTAATCTTGAACCTTGGCGTTCATGCTCACTTGTTAGAACCAATCGCACTGGATGACAGTTCTGCAATTGAAGAAGAGTATTCTCAAGAATCATATCTTGCGGAAGAAACCCAATTTAATTCACAGGGGAAGAAAAATCCTGATTCTCCTAACAATATCAGTGCAACTTCATCTATTAACAAATTCGAATGTTGGATTCTTAACATCTTGTATGAGATACTGCTTCTTCTTGTCCAAATTGAAGAGAAGGAAGAATCTGTCTGGACATCTGCTTTGAGCTGTTTACTTTATTTTGTTTGTGACAGAGGCAGACTCAGGAGAAGCCGACTAAAGGGTCTTGACATAAGGGTTATTAAGGCATTTCTAGAAACTAGCAGAAGAAATTCTTGGGCTGAAATAGTTCATTGCAGGCTCATTTGCTTGTTAACAAATATGTTTTATCAAGTCCCGGAGGAGTCCACAGATGTTGCTCCAAGCCCTATATTTCTTGTAGATCAGGTGGATCTGGTTGGAGGAACTAAGTTTATTTTCTTTGAGTATTCTCTAGCAAACTCAAGAGAAGAACGGCGAAACCTCTTTCTGGTGCTTTTTGATTATGTTTTGCATCAAATAAATGAGTCTTGCATCACAACTGGAGGTATGGAGTATAGTGATGATGAGATACATCCCCTTGCAGCCCTGTTCAGCCTCGCCAATGCACCTGAGGCTTTTTACATCTCAGTTAAGCTTGGAGTGGAAGGTGTTGGAGAGATCTTGAAAGCGTCTATCTCATCAGCATTGTGTAGATATCCTAATAGCGAGCGACTAAATTTGCTCTTGGAGAACGTAATGGAGAAATTTAATACAATAATCAAATCGTTTACACATTTGGACAATGAGTTCTCTTATATGATACAGATAACCAAATCTCTCAAACTTTTTGAAAGCATTCAAGGTTCTTTATTAAGAAATGGTGTTAGCATGAAATCCAAGCTATCATGGGCCACTCTGCATTCCCTGATTCATTCAGAGAGAATTGCTTATCGTCAAAATGGTTATGTCTGGCTAGGGGATCTCCTTTTTGAAGAAATAACTGGTGAAAGGGACGAAAGTATGTGGACAAATGTGAAAAGATTACAGCAGAGAATTGCGTACGCTGGTTTGAATGATTATTCAACAACTTCAGACGTGCCACTTTCAATCTGGCTCATGTGCGGACTTTTGAAGTCAAAACACAACTTCATTAGATGGGGCTTTTTATTTGTTGTAGAAAGACTTCTTATGCGATGCAAATTTTTGTTGCATGAGAATGAATTGCGAAATTCTGGCAGCATCGACATTCGCCAGGCATCAAAAGATAGCCGTCTAGAGAAAGCTAATGCAGTGATAGACATAATGTGCAGTTCTCTTTTTTTGGTTTTTCAGATAAATGAAACTGATCGCACCAATATTTTAAAGATGTGTGACATACTCTTCTCTCAATTATGCTTGAGAGTACCACAAGTTTCTGACTTACCAATTGGAGATGATATGCCACGTGGCAGAGTTATGGATTACTCAGGTGAAAGTAAAACGATAGGGGTTACTGAATCTGAAGCTAAACTTGAGGAGAAAAGCAGATTTATTAAAACTTATAATAATCCTCTTGATCATGAGACAGCCTCCATGGCTGCATTACTGCTTCAGGGACAGACAATTGTCCCGATGCAGTTGATCTCGCATGTTCCTGCTGCTCTGTTCTACTGGCCATTGATTCAACTTGCTGGAGCAGCAACAGACAACATTGCTTTGGGTGTTGCTGTTGGAAGCCAAGCAAGAGGGAACCACCCAGGTGCCGCCTCTGACATACGGTCTGCACTGCTCTTACTCCTGATTGCTAAGTGCAGTTCTGATTCATCTGCTTTCCAAGAAGTGGATGGGGAACAATTTTTCAGAGAGCTTCTAGATGATACGGATTCAAGGGTGGCTTATTACTCTTCAGCGTTTCTTTTGAAGCGTATGATGACAGAAAAACCTGAAAAGTACCAATACATGCTTCAGAATCTTGTAATTAAAGCTCAGCAGAGCAATAATGAGAAGCTATTGGAAAATCCATACCTTCAGATGCGTGGTATACTTAAGCTAGCAAATGATATGGGCATTGAGTTGTGA
Protein sequence
MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGPSASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYKPSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGALLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGESSEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDLRTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHLRAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCSEMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLSSPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPDSPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRSRLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVDLVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALFSLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFTHLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVWLGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFIRWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQINETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKLEEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIEL
Homology
BLAST of CmoCh04G002160 vs. ExPASy TrEMBL
Match:
A0A6J1FQQ7 (uncharacterized protein LOC111447349 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447349 PE=4 SV=1)
HSP 1 Score: 2314.7 bits (5997), Expect = 0.0e+00
Identity = 1201/1201 (100.00%), Postives = 1201/1201 (100.00%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP
Sbjct: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES
Sbjct: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD
Sbjct: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF
Sbjct: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT
Sbjct: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI
Sbjct: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL
Sbjct: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
Query: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI
Sbjct: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
Query: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV
Sbjct: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
Query: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE
Sbjct: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
Query: 1201 L 1202
L
Sbjct: 1201 L 1201
BLAST of CmoCh04G002160 vs. ExPASy TrEMBL
Match:
A0A6J1ILW0 (uncharacterized protein LOC111476453 OS=Cucurbita maxima OX=3661 GN=LOC111476453 PE=4 SV=1)
HSP 1 Score: 2262.3 bits (5861), Expect = 0.0e+00
Identity = 1177/1201 (98.00%), Postives = 1184/1201 (98.58%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
M+SAFSPSRSPGSSRL HLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP
Sbjct: 1 MSSAFSPSRSPGSSRLHHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNS LNAAS+GE
Sbjct: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSHLNAASSGEP 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSS LQREGDSFVNTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSLLQREGDSFVNTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAK+KDQPWK+LGT DMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKMKDQPWKALGTADMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSS IEEEYSQESYLAEETQFNSQGKKNPD
Sbjct: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSTIEEEYSQESYLAEETQFNSQGKKNPD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTD APSPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDGAPSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGG KFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF
Sbjct: 661 LVGGAKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKS T
Sbjct: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSIT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI
Sbjct: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLLHENELRNSGSI+I QASKDSRLEKANAVIDIMCSSLFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSINIGQASKDSRLEKANAVIDIMCSSLFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRG+VMDYSGESKTIGVTESEAKL
Sbjct: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGKVMDYSGESKTIGVTESEAKL 1020
Query: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI
Sbjct: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
Query: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAF + F RELLDDTDSRV
Sbjct: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFXPLG---FCRELLDDTDSRV 1140
Query: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE
Sbjct: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1198
Query: 1201 L 1202
L
Sbjct: 1201 L 1198
BLAST of CmoCh04G002160 vs. ExPASy TrEMBL
Match:
A0A1S4E3E3 (uncharacterized protein LOC103500216 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103500216 PE=4 SV=1)
HSP 1 Score: 2148.2 bits (5565), Expect = 0.0e+00
Identity = 1114/1210 (92.07%), Postives = 1150/1210 (95.04%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
M+S FSPSRSPGSSRLQ LGPVSGVSRLRSSSLKKPPEPLRRA+ DCLSSSAANSHHGGP
Sbjct: 1 MSSTFSPSRSPGSSRLQQLGPVSGVSRLRSSSLKKPPEPLRRAVTDCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASV+VAEASRTLRDYLA PATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVLVAEASRTLRDYLAAPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTI ECSFSPNRRSSPW+ SLSQ SAA TT STFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTISECSFSPNRRSSPWSQSLSQPSAAPTTSSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
L+KSLKYVRSLV QHIPRRSFQPAAFAGAPSMSRQ LPALSSMLSRSFNSQLNAAS+ ES
Sbjct: 181 LIKSLKYVRSLVGQHIPRRSFQPAAFAGAPSMSRQSLPALSSMLSRSFNSQLNAASSAES 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDG VDLEYI+ DALKWRWLGE RSS QRE D+F NTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGTVDLEYISLDALKWRWLGEQRSSLFQRESDNFANTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAK+KDQPWKS GT DMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKMKDQPWKSFGTADMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKP LHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPGLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTC+VRAFDLILNLGVHAHLLEPI LD++S IEEEYSQESYLAEE Q NSQGKKN D
Sbjct: 481 SPRSTCKVRAFDLILNLGVHAHLLEPITLDENSTIEEEYSQESYLAEEAQLNSQGKKNLD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SP+NISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPDNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQV E+ T+ A SPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVSEDPTEGASSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIF EYSLANSREERRNLFLVLFDYVLHQINESCITTG MEY DDEI PLA LF
Sbjct: 661 LVGGTKFIFLEYSLANSREERRNLFLVLFDYVLHQINESCITTGVMEYGDDEIQPLANLF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
+LANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLN+LL+N+MEKFNTIIKSFT
Sbjct: 721 TLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNMLLDNIMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGS+LRNGVSMKSKLSWATLHSL+HSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSMLRNGVSMKSKLSWATLHSLLHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEIT ERDE+MWTNVK+LQQRI YAG+NDYSTTSD+PLSIWLMCGLLKSKH I
Sbjct: 841 LGDLLFEEITSERDENMWTNVKKLQQRITYAGVNDYSTTSDIPLSIWLMCGLLKSKHPII 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLL+ENE+RNSGS D+ Q SKD+RLEKANAVIDIMCS+L+LVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLNENEMRNSGSNDLGQVSKDTRLEKANAVIDIMCSALYLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDR NILKMCDILFSQLCLRVPQ SDLPIGDD+P GRV+DYSGESKT GV ESEAKL
Sbjct: 961 INETDRINILKMCDILFSQLCLRVPQASDLPIGDDLPHGRVIDYSGESKTTGVFESEAKL 1020
Query: 1021 ---------EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ 1080
EEK R+ KTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ
Sbjct: 1021 DGNFFGELKEEKGRYSKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ 1080
Query: 1081 LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRE 1140
LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDS AFQEVDGEQFFRE
Sbjct: 1081 LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSCAFQEVDGEQFFRE 1140
Query: 1141 LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGIL 1200
LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQ+MLQNLVIKAQQSNNEKLLENPYLQMRGIL
Sbjct: 1141 LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQHMLQNLVIKAQQSNNEKLLENPYLQMRGIL 1200
Query: 1201 KLANDMGIEL 1202
KLANDMG+EL
Sbjct: 1201 KLANDMGVEL 1210
BLAST of CmoCh04G002160 vs. ExPASy TrEMBL
Match:
A0A0A0KS77 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G182070 PE=4 SV=1)
HSP 1 Score: 2143.6 bits (5553), Expect = 0.0e+00
Identity = 1112/1210 (91.90%), Postives = 1148/1210 (94.88%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
M+S FSPSRSPGSSRLQ LGPVSGVSRLRSSSLKKPPEPLRRA+ DCLSSSAANSHHGGP
Sbjct: 1 MSSTFSPSRSPGSSRLQQLGPVSGVSRLRSSSLKKPPEPLRRAVTDCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASV+VAEASRTLRDYLA PATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVLVAEASRTLRDYLAAPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTI ECSFSPNRRSSPW+ SLSQ SAA TT STFSPLPVSSIASG+
Sbjct: 121 PSEETLMQIDRFCLNTISECSFSPNRRSSPWSQSLSQPSAAPTTSSTFSPLPVSSIASGS 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
L+KSLKYVRSLV QHIPRRSFQPAAFAGAPSMSRQ LPALSSMLSRSFNSQLNAAS+ ES
Sbjct: 181 LIKSLKYVRSLVGQHIPRRSFQPAAFAGAPSMSRQSLPALSSMLSRSFNSQLNAASSAES 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDG VDLEYI+ DALKWRWLGE R S QRE D+F NTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGTVDLEYISLDALKWRWLGEQRLSLFQRESDNFANTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAK+KDQPWKS GT DMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKMKDQPWKSFGTADMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKP LHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPGLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTC+VRAFDLILNLGVHAHLLEPI LD++S IEEEYSQESYLAEE Q NS GK N D
Sbjct: 481 SPRSTCKVRAFDLILNLGVHAHLLEPITLDENSTIEEEYSQESYLAEEAQLNSHGKNNLD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNI+ATSSIN FECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNINATSSINNFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQV E+ T+ A SPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVSEDPTEGASSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIF EYSLANSREERRNLFLVLFDYVLHQINESCITTG MEY DDEI PLA LF
Sbjct: 661 LVGGTKFIFLEYSLANSREERRNLFLVLFDYVLHQINESCITTGVMEYGDDEIQPLANLF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
+LANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLN+LLEN+MEKFNTIIKSFT
Sbjct: 721 TLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNMLLENIMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGS+LRNGVSMKSKLSWATLHSL+HSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSMLRNGVSMKSKLSWATLHSLLHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEIT ERDE+MWTNVK+LQQRI YAG+NDYSTTSD+PLSIWLMCGLLKSKH I
Sbjct: 841 LGDLLFEEITSERDENMWTNVKKLQQRITYAGVNDYSTTSDIPLSIWLMCGLLKSKHPII 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLL+ENE+RNSGS D+ QASKD+RLEKANAVIDIMCS+LFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLNENEMRNSGSNDLGQASKDTRLEKANAVIDIMCSALFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDR NILKMCDILFSQLCLRVPQ SDLPIGDD+P GRV+DYSGESKT G+ ESEAKL
Sbjct: 961 INETDRINILKMCDILFSQLCLRVPQSSDLPIGDDLPHGRVIDYSGESKTTGLFESEAKL 1020
Query: 1021 ---------EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ 1080
EEK R+ KTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ
Sbjct: 1021 DGNFFGELKEEKGRYSKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ 1080
Query: 1081 LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRE 1140
LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRE
Sbjct: 1081 LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRE 1140
Query: 1141 LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGIL 1200
LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQ+MLQNLVIKAQQSNNEKLLENPYLQMRGIL
Sbjct: 1141 LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQHMLQNLVIKAQQSNNEKLLENPYLQMRGIL 1200
Query: 1201 KLANDMGIEL 1202
KLANDMG+EL
Sbjct: 1201 KLANDMGVEL 1210
BLAST of CmoCh04G002160 vs. ExPASy TrEMBL
Match:
A0A6J1FPP3 (uncharacterized protein LOC111447349 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111447349 PE=4 SV=1)
HSP 1 Score: 2127.4 bits (5511), Expect = 0.0e+00
Identity = 1100/1100 (100.00%), Postives = 1100/1100 (100.00%), Query Frame = 0
Query: 102 PAVVARSVALLKRYLLRYKPSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAA 161
PAVVARSVALLKRYLLRYKPSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAA
Sbjct: 19 PAVVARSVALLKRYLLRYKPSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAA 78
Query: 162 TTTPSTFSPLPVSSIASGALLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALS 221
TTTPSTFSPLPVSSIASGALLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALS
Sbjct: 79 TTTPSTFSPLPVSSIASGALLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALS 138
Query: 222 SMLSRSFNSQLNAASTGESSEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGE 281
SMLSRSFNSQLNAASTGESSEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGE
Sbjct: 139 SMLSRSFNSQLNAASTGESSEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGE 198
Query: 282 LRSSFLQREGDSFVNTQDLRTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQL 341
LRSSFLQREGDSFVNTQDLRTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQL
Sbjct: 199 LRSSFLQREGDSFVNTQDLRTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQL 258
Query: 342 LQPSPVATITNSSSARLHLRAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQ 401
LQPSPVATITNSSSARLHLRAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQ
Sbjct: 259 LQPSPVATITNSSSARLHLRAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQ 318
Query: 402 QPLRLNPAEVCEVIAAVCSEMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYV 461
QPLRLNPAEVCEVIAAVCSEMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYV
Sbjct: 319 QPLRLNPAEVCEVIAAVCSEMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYV 378
Query: 462 LDSGIAAPLTLSMLEEMLSSPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQ 521
LDSGIAAPLTLSMLEEMLSSPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQ
Sbjct: 379 LDSGIAAPLTLSMLEEMLSSPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQ 438
Query: 522 ESYLAEETQFNSQGKKNPDSPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWT 581
ESYLAEETQFNSQGKKNPDSPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWT
Sbjct: 439 ESYLAEETQFNSQGKKNPDSPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWT 498
Query: 582 SALSCLLYFVCDRGRLRRSRLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQV 641
SALSCLLYFVCDRGRLRRSRLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQV
Sbjct: 499 SALSCLLYFVCDRGRLRRSRLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQV 558
Query: 642 PEESTDVAPSPIFLVDQVDLVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCI 701
PEESTDVAPSPIFLVDQVDLVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCI
Sbjct: 559 PEESTDVAPSPIFLVDQVDLVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCI 618
Query: 702 TTGGMEYSDDEIHPLAALFSLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERL 761
TTGGMEYSDDEIHPLAALFSLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERL
Sbjct: 619 TTGGMEYSDDEIHPLAALFSLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERL 678
Query: 762 NLLLENVMEKFNTIIKSFTHLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWAT 821
NLLLENVMEKFNTIIKSFTHLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWAT
Sbjct: 679 NLLLENVMEKFNTIIKSFTHLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWAT 738
Query: 822 LHSLIHSERIAYRQNGYVWLGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSD 881
LHSLIHSERIAYRQNGYVWLGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSD
Sbjct: 739 LHSLIHSERIAYRQNGYVWLGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSD 798
Query: 882 VPLSIWLMCGLLKSKHNFIRWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRL 941
VPLSIWLMCGLLKSKHNFIRWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRL
Sbjct: 799 VPLSIWLMCGLLKSKHNFIRWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRL 858
Query: 942 EKANAVIDIMCSSLFLVFQINETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRV 1001
EKANAVIDIMCSSLFLVFQINETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRV
Sbjct: 859 EKANAVIDIMCSSLFLVFQINETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRV 918
Query: 1002 MDYSGESKTIGVTESEAKLEEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVP 1061
MDYSGESKTIGVTESEAKLEEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVP
Sbjct: 919 MDYSGESKTIGVTESEAKLEEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVP 978
Query: 1062 AALFYWPLIQLAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQ 1121
AALFYWPLIQLAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQ
Sbjct: 979 AALFYWPLIQLAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQ 1038
Query: 1122 EVDGEQFFRELLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLE 1181
EVDGEQFFRELLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLE
Sbjct: 1039 EVDGEQFFRELLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLE 1098
Query: 1182 NPYLQMRGILKLANDMGIEL 1202
NPYLQMRGILKLANDMGIEL
Sbjct: 1099 NPYLQMRGILKLANDMGIEL 1118
BLAST of CmoCh04G002160 vs. NCBI nr
Match:
XP_022942239.1 (uncharacterized protein LOC111447349 isoform X1 [Cucurbita moschata] >XP_022942241.1 uncharacterized protein LOC111447349 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2314.7 bits (5997), Expect = 0.0e+00
Identity = 1201/1201 (100.00%), Postives = 1201/1201 (100.00%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP
Sbjct: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES
Sbjct: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD
Sbjct: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF
Sbjct: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT
Sbjct: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI
Sbjct: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL
Sbjct: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
Query: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI
Sbjct: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
Query: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV
Sbjct: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
Query: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE
Sbjct: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
Query: 1201 L 1202
L
Sbjct: 1201 L 1201
BLAST of CmoCh04G002160 vs. NCBI nr
Match:
KAG6600050.1 (hypothetical protein SDJN03_05283, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2303.9 bits (5969), Expect = 0.0e+00
Identity = 1195/1201 (99.50%), Postives = 1199/1201 (99.83%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP
Sbjct: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAAS+GES
Sbjct: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASSGES 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLS+NSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSSNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSS IEEEYSQESYLAEETQFNSQGKKNP+
Sbjct: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSTIEEEYSQESYLAEETQFNSQGKKNPE 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTD APSPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDGAPSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIFFEYSLA+SREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF
Sbjct: 661 LVGGTKFIFFEYSLASSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT
Sbjct: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI
Sbjct: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL
Sbjct: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
Query: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI
Sbjct: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
Query: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV
Sbjct: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
Query: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE
Sbjct: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
Query: 1201 L 1202
L
Sbjct: 1201 L 1201
BLAST of CmoCh04G002160 vs. NCBI nr
Match:
XP_023532081.1 (uncharacterized protein LOC111794351 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023532090.1 uncharacterized protein LOC111794351 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2290.8 bits (5935), Expect = 0.0e+00
Identity = 1188/1201 (98.92%), Postives = 1194/1201 (99.42%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
M+SAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP
Sbjct: 1 MSSAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAAS+G+S
Sbjct: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASSGQS 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
+EHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSS LQREGDSFVNTQDL
Sbjct: 241 AEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSLLQREGDSFVNTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAKVKDQPWK+LGT DMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKALGTADMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSS IEEEYSQESYLAEETQFNSQGKKNPD
Sbjct: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSTIEEEYSQESYLAEETQFNSQGKKNPD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTD APSPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDGAPSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF
Sbjct: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT
Sbjct: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI
Sbjct: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLLHENELRNSGSI+I QASKDSRLEKANAVIDIMCSSLFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSINIGQASKDSRLEKANAVIDIMCSSLFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDRTNILKMCDILFSQLCLRVPQVSDL IGDDMPRGRVMDYSGESKTIGVTESEAKL
Sbjct: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLSIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
Query: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
EEK RFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI
Sbjct: 1021 EEKGRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
Query: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV
Sbjct: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
Query: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE
Sbjct: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
Query: 1201 L 1202
L
Sbjct: 1201 L 1201
BLAST of CmoCh04G002160 vs. NCBI nr
Match:
XP_022975884.1 (uncharacterized protein LOC111476453 [Cucurbita maxima])
HSP 1 Score: 2262.3 bits (5861), Expect = 0.0e+00
Identity = 1177/1201 (98.00%), Postives = 1184/1201 (98.58%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
M+SAFSPSRSPGSSRL HLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP
Sbjct: 1 MSSAFSPSRSPGSSRLHHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNS LNAAS+GE
Sbjct: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSHLNAASSGEP 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSS LQREGDSFVNTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSLLQREGDSFVNTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAK+KDQPWK+LGT DMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKMKDQPWKALGTADMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSS IEEEYSQESYLAEETQFNSQGKKNPD
Sbjct: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSTIEEEYSQESYLAEETQFNSQGKKNPD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTD APSPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDGAPSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGG KFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF
Sbjct: 661 LVGGAKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKS T
Sbjct: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSIT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI
Sbjct: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLLHENELRNSGSI+I QASKDSRLEKANAVIDIMCSSLFLVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSINIGQASKDSRLEKANAVIDIMCSSLFLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRG+VMDYSGESKTIGVTESEAKL
Sbjct: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGKVMDYSGESKTIGVTESEAKL 1020
Query: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI
Sbjct: 1021 EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAGAATDNI 1080
Query: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLDDTDSRV 1140
ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAF + F RELLDDTDSRV
Sbjct: 1081 ALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFXPLG---FCRELLDDTDSRV 1140
Query: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1200
AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE
Sbjct: 1141 AYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLANDMGIE 1198
Query: 1201 L 1202
L
Sbjct: 1201 L 1198
BLAST of CmoCh04G002160 vs. NCBI nr
Match:
XP_016902743.1 (PREDICTED: uncharacterized protein LOC103500216 isoform X1 [Cucumis melo])
HSP 1 Score: 2148.2 bits (5565), Expect = 0.0e+00
Identity = 1114/1210 (92.07%), Postives = 1150/1210 (95.04%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSSAANSHHGGP 60
M+S FSPSRSPGSSRLQ LGPVSGVSRLRSSSLKKPPEPLRRA+ DCLSSSAANSHHGGP
Sbjct: 1 MSSTFSPSRSPGSSRLQQLGPVSGVSRLRSSSLKKPPEPLRRAVTDCLSSSAANSHHGGP 60
Query: 61 SASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
SASV+VAEASRTLRDYLA PATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK
Sbjct: 61 SASVLVAEASRTLRDYLAAPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLRYK 120
Query: 121 PSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIASGA 180
PSEETLMQIDRFCLNTI ECSFSPNRRSSPW+ SLSQ SAA TT STFSPLPVSSIASGA
Sbjct: 121 PSEETLMQIDRFCLNTISECSFSPNRRSSPWSQSLSQPSAAPTTSSTFSPLPVSSIASGA 180
Query: 181 LLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTGES 240
L+KSLKYVRSLV QHIPRRSFQPAAFAGAPSMSRQ LPALSSMLSRSFNSQLNAAS+ ES
Sbjct: 181 LIKSLKYVRSLVGQHIPRRSFQPAAFAGAPSMSRQSLPALSSMLSRSFNSQLNAASSAES 240
Query: 241 SEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQDL 300
SEHKDSTVLSISNLSNIEEVDG VDLEYI+ DALKWRWLGE RSS QRE D+F NTQDL
Sbjct: 241 SEHKDSTVLSISNLSNIEEVDGTVDLEYISLDALKWRWLGEQRSSLFQRESDNFANTQDL 300
Query: 301 RTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARLHL 360
RTRNLLEVGAAALLVGDTEAK+KDQPWKS GT DMPYVDQLLQPSPVATITNSSSARLHL
Sbjct: 301 RTRNLLEVGAAALLVGDTEAKMKDQPWKSFGTADMPYVDQLLQPSPVATITNSSSARLHL 360
Query: 361 RAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
RAITASKRTKP LHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS
Sbjct: 361 RAITASKRTKPGLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAVCS 420
Query: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS
Sbjct: 421 EMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEMLS 480
Query: 481 SPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKNPD 540
SPRSTC+VRAFDLILNLGVHAHLLEPI LD++S IEEEYSQESYLAEE Q NSQGKKN D
Sbjct: 481 SPRSTCKVRAFDLILNLGVHAHLLEPITLDENSTIEEEYSQESYLAEEAQLNSQGKKNLD 540
Query: 541 SPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
SP+NISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS
Sbjct: 541 SPDNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLRRS 600
Query: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPEESTDVAPSPIFLVDQVD 660
RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQV E+ T+ A SPIFLVDQVD
Sbjct: 601 RLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVSEDPTEGASSPIFLVDQVD 660
Query: 661 LVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHPLAALF 720
LVGGTKFIF EYSLANSREERRNLFLVLFDYVLHQINESCITTG MEY DDEI PLA LF
Sbjct: 661 LVGGTKFIFLEYSLANSREERRNLFLVLFDYVLHQINESCITTGVMEYGDDEIQPLANLF 720
Query: 721 SLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTIIKSFT 780
+LANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLN+LL+N+MEKFNTIIKSFT
Sbjct: 721 TLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNMLLDNIMEKFNTIIKSFT 780
Query: 781 HLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQNGYVW 840
HLDNEFSYMIQITKSLKLFESIQGS+LRNGVSMKSKLSWATLHSL+HSERIAYRQNGYVW
Sbjct: 781 HLDNEFSYMIQITKSLKLFESIQGSMLRNGVSMKSKLSWATLHSLLHSERIAYRQNGYVW 840
Query: 841 LGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKSKHNFI 900
LGDLLFEEIT ERDE+MWTNVK+LQQRI YAG+NDYSTTSD+PLSIWLMCGLLKSKH I
Sbjct: 841 LGDLLFEEITSERDENMWTNVKKLQQRITYAGVNDYSTTSDIPLSIWLMCGLLKSKHPII 900
Query: 901 RWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSLFLVFQ 960
RWGFLFVVERLLMRCKFLL+ENE+RNSGS D+ Q SKD+RLEKANAVIDIMCS+L+LVFQ
Sbjct: 901 RWGFLFVVERLLMRCKFLLNENEMRNSGSNDLGQVSKDTRLEKANAVIDIMCSALYLVFQ 960
Query: 961 INETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYSGESKTIGVTESEAKL 1020
INETDR NILKMCDILFSQLCLRVPQ SDLPIGDD+P GRV+DYSGESKT GV ESEAKL
Sbjct: 961 INETDRINILKMCDILFSQLCLRVPQASDLPIGDDLPHGRVIDYSGESKTTGVFESEAKL 1020
Query: 1021 ---------EEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ 1080
EEK R+ KTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ
Sbjct: 1021 DGNFFGELKEEKGRYSKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQ 1080
Query: 1081 LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRE 1140
LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDS AFQEVDGEQFFRE
Sbjct: 1081 LAGAATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSCAFQEVDGEQFFRE 1140
Query: 1141 LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGIL 1200
LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQ+MLQNLVIKAQQSNNEKLLENPYLQMRGIL
Sbjct: 1141 LLDDTDSRVAYYSSAFLLKRMMTEKPEKYQHMLQNLVIKAQQSNNEKLLENPYLQMRGIL 1200
Query: 1201 KLANDMGIEL 1202
KLANDMG+EL
Sbjct: 1201 KLANDMGVEL 1210
BLAST of CmoCh04G002160 vs. TAIR 10
Match:
AT3G12590.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; Has 50 Blast hits to 41 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 43; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). )
HSP 1 Score: 1456.8 bits (3770), Expect = 0.0e+00
Identity = 782/1203 (65.00%), Postives = 934/1203 (77.64%), Query Frame = 0
Query: 1 MASAFSPSRSPGSSRLQHLGPVSGVSRLRSSSLKKPPEPLRRAIADCLSSS--AANSHHG 60
M+S +SP +SPGSSRL LG SRLRSSS KKPPEPLRRA+ADCLSSS NSHHG
Sbjct: 1 MSSTYSPGQSPGSSRLLQLGAAGSASRLRSSSSKKPPEPLRRAVADCLSSSPPPVNSHHG 60
Query: 61 GPSASVVVAEASRTLRDYLATPATTDLAYCVILEHTIAERERSPAVVARSVALLKRYLLR 120
S+ +EA R LRDYL+ ATTDLAY ++LEHTIAER+RSPAVV R VALLKRY+LR
Sbjct: 61 A-IPSMAPSEALRNLRDYLSASATTDLAYNMLLEHTIAERDRSPAVVTRCVALLKRYILR 120
Query: 121 YKPSEETLMQIDRFCLNTIGECSFSPNRRSSPWTHSLSQASAATTTPSTFSPLPVSSIAS 180
YKP EETL+Q+D+FC+N I EC S ++S P + P+ SPLPVSS AS
Sbjct: 121 YKPGEETLLQVDKFCVNLIAECDASLKQKSLP----------VLSAPAGASPLPVSSFAS 180
Query: 181 GALLKSLKYVRSLVAQHIPRRSFQPAAFAGAPSMSRQPLPALSSMLSRSFNSQLNAASTG 240
AL+KSL YVRSLVA HIPRRSFQPAAFAGA SRQ LP+LSS+LS+SFNSQL+ A+
Sbjct: 181 AALVKSLHYVRSLVALHIPRRSFQPAAFAGATLASRQLLPSLSSLLSKSFNSQLSPANAA 240
Query: 241 ESSEHKDSTVLSISNLSNIEEVDGMVDLEYIAHDALKWRWLGELRSSFLQREGDSFVNTQ 300
ES + KD+ LS+SNLSNI+E++ M D EYI+ D L WRW+GEL+ S E + VN Q
Sbjct: 241 ESPQKKDAANLSVSNLSNIQEINAMEDTEYISSDLLNWRWVGELQLSSASSESERPVNLQ 300
Query: 301 DLRTRNLLEVGAAALLVGDTEAKVKDQPWKSLGTTDMPYVDQLLQPSPVATITNSSSARL 360
D+ NLLEVGAA LLVGD EAK+K Q WK GT +MPY++QLLQP+ V ITNS+SAR
Sbjct: 301 DMNNCNLLEVGAAGLLVGDMEAKMKGQHWKYFGTAEMPYLEQLLQPASVTMITNSASARS 360
Query: 361 HLRAITASKRTKPDLHQIWEDSPGSTFRPKARPLFQYRYYSEQQPLRLNPAEVCEVIAAV 420
HLRAITASKRT+ QIW+DS +TFRP+ARPLFQYR+YSEQQPLRLNPAEV EVIAAV
Sbjct: 361 HLRAITASKRTRAGPQQIWDDSTVNTFRPRARPLFQYRHYSEQQPLRLNPAEVGEVIAAV 420
Query: 421 CSEMSSPIANPLTVTSRLSTNSGKPSMDVAVSVLVKLIIDMYVLDSGIAAPLTLSMLEEM 480
CSE SS +N +TV+ +L++ +GKPSMDVAVSVL+KL+IDMYVLD+ IAAPLTLSMLEEM
Sbjct: 421 CSEASSTPSNQMTVSPQLTSKTGKPSMDVAVSVLIKLVIDMYVLDARIAAPLTLSMLEEM 480
Query: 481 LSSPRSTCRVRAFDLILNLGVHAHLLEPIALDDSSAIEEEYSQESYLAEETQFNSQGKKN 540
L S ++ CR+R FDLILNLGVHA LLEP+ D+++ IEE+Y+QE+Y+ E + QG +
Sbjct: 481 LCSTKAPCRIRVFDLILNLGVHAQLLEPMISDNATTIEEDYAQETYIDNENRLLLQGTRT 540
Query: 541 PDSPNNISATSSINKFECWILNILYEILLLLVQIEEKEESVWTSALSCLLYFVCDRGRLR 600
D P S +S+I FE WIL IL+EILLLLVQ+EEKEE VW SALSCLLYF+CDRG++R
Sbjct: 541 KDLPKMSSTSSAIENFESWILKILFEILLLLVQVEEKEECVWASALSCLLYFICDRGKIR 600
Query: 601 RSRLKGLDIRVIKAFLETSRRNSWAEIVHCRLICLLTNMFYQVPE---ESTDVAPSPIFL 660
R++L GLDIRVIKA L TS+RNSW+E+VH +LIC++TNMFYQ PE + ++ + FL
Sbjct: 601 RNQLNGLDIRVIKALLGTSKRNSWSEVVHSKLICIMTNMFYQSPEPEGSNKAISSASNFL 660
Query: 661 VDQVDLVGGTKFIFFEYSLANSREERRNLFLVLFDYVLHQINESCITTGGMEYSDDEIHP 720
+DQVDL+GG ++IFFEYSLA +REERRNL+ VLFDYVLHQINE+C + G EY+DDEI P
Sbjct: 661 IDQVDLIGGVEYIFFEYSLATTREERRNLYSVLFDYVLHQINEACSSAGLSEYTDDEIQP 720
Query: 721 LAALFSLANAPEAFYISVKLGVEGVGEILKASISSALCRYPNSERLNLLLENVMEKFNTI 780
LA +LA+APEAFYISVKLGVEG+GEIL+ SI++AL + NSERLN LL N+ EKF+TI
Sbjct: 721 LAVRLALADAPEAFYISVKLGVEGIGEILRRSIAAALSGFSNSERLNQLLANITEKFDTI 780
Query: 781 IKSFTHLDNEFSYMIQITKSLKLFESIQGSLLRNGVSMKSKLSWATLHSLIHSERIAYRQ 840
I SFTHLD EF ++ QITKS K ESI LRN +SM L+WATLHSL+HSER YRQ
Sbjct: 781 IGSFTHLDKEFLHLKQITKSSKFMESILD--LRNDISMSVNLAWATLHSLLHSERTTYRQ 840
Query: 841 NGYVWLGDLLFEEITGERDESMWTNVKRLQQRIAYAGLNDYSTTSDVPLSIWLMCGLLKS 900
NGY+WLGDLL EI+ E S+W ++K LQQ+IA+ G +D TSDVP+SI L+CGLLKS
Sbjct: 841 NGYIWLGDLLIAEISEESGGSIWLSIKDLQQKIAHCGTSDSLVTSDVPISIHLLCGLLKS 900
Query: 901 KHNFIRWGFLFVVERLLMRCKFLLHENELRNSGSIDIRQASKDSRLEKANAVIDIMCSSL 960
+++ IRWGFLF++ERLLMR KFLL ENE + S Q KD RLEKANAVIDIM S+L
Sbjct: 901 RNSVIRWGFLFILERLLMRSKFLLDENETQRSTGGVATQDHKDKRLEKANAVIDIMSSAL 960
Query: 961 FLVFQINETDRTNILKMCDILFSQLCLRVPQVSDLPIGDDMPRGRVMDYS-GESKTIGVT 1020
L+ QINETDR NILKMCDILFSQLCL+V + + + R D S S V
Sbjct: 961 SLMAQINETDRINILKMCDILFSQLCLKVLSTDEDAVPNSADRNSKFDTSHRNSYKESVD 1020
Query: 1021 ESEAKLEEKSRFIKTYNNPLDHETASMAALLLQGQTIVPMQLISHVPAALFYWPLIQLAG 1080
E + K + + T ETASMAA+LL+GQ IVPMQL++ VPAALFYWPLIQLAG
Sbjct: 1021 EGDTKPRYNNVSVSTC------ETASMAAMLLRGQAIVPMQLVARVPAALFYWPLIQLAG 1080
Query: 1081 AATDNIALGVAVGSQARGNHPGAASDIRSALLLLLIAKCSSDSSAFQEVDGEQFFRELLD 1140
AATDNIALGVAVGS+ RGN PGA SDIR+ LLLLLI KC++D+ AFQEV GE+FFRELLD
Sbjct: 1081 AATDNIALGVAVGSKGRGNIPGATSDIRATLLLLLIGKCTADTVAFQEVGGEEFFRELLD 1140
Query: 1141 DTDSRVAYYSSAFLLKRMMTEKPEKYQYMLQNLVIKAQQSNNEKLLENPYLQMRGILKLA 1198
DTDSRVAYYSSAFLLKRMMTE+PEKYQ MLQ LV KAQQSNNEKLLENPYLQM GIL+L+
Sbjct: 1141 DTDSRVAYYSSAFLLKRMMTEEPEKYQNMLQKLVFKAQQSNNEKLLENPYLQMCGILQLS 1184
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FQQ7 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447349 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1ILW0 | 0.0e+00 | 98.00 | uncharacterized protein LOC111476453 OS=Cucurbita maxima OX=3661 GN=LOC111476453... | [more] |
A0A1S4E3E3 | 0.0e+00 | 92.07 | uncharacterized protein LOC103500216 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0KS77 | 0.0e+00 | 91.90 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G182070 PE=4 SV=1 | [more] |
A0A6J1FPP3 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447349 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
XP_022942239.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447349 isoform X1 [Cucurbita moschata] >XP_0229422... | [more] |
KAG6600050.1 | 0.0e+00 | 99.50 | hypothetical protein SDJN03_05283, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023532081.1 | 0.0e+00 | 98.92 | uncharacterized protein LOC111794351 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
XP_022975884.1 | 0.0e+00 | 98.00 | uncharacterized protein LOC111476453 [Cucurbita maxima] | [more] |
XP_016902743.1 | 0.0e+00 | 92.07 | PREDICTED: uncharacterized protein LOC103500216 isoform X1 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
AT3G12590.1 | 0.0e+00 | 65.00 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... | [more] |