Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGAAAGGTTACAACGGAAAATATTATTCTTACCTATAAAAATTATATATATATATATTCAGAATATTAATTATTAATTAATAATAAAAAGTTGATATCTTTTTGTAGGCTTGGGTGGCGAGCGAGGGATCGAACTGCTTCTTTATTCTTACGCTTTGGATGTAACTGGGAGGGAACTTGGAATAGAATCTCATCTGTTGCTCCGTGAAAGGATCAGATACAGAAGCGTTTCATTGGCCGCCTCTGATAGGGTTCGAGCCACCATTTTCCACGCTCTTCTGCTCTGATTTCGTCCAAAAACCCTAATACTCTCTCCACTTTCAATACCTCAATCTTGGGTTTTTGTTATAATCTCAGGGCCTTCGAATTTCCTTAACTGGAGGTTCAAATTTGAATCCCAATTGATGCGGTGAGTGGCTTGAAACTCGTTATTTGCCATTGGGTTTTTTGTTTTTTTTTTGTTTTTTTTCTATTGCTTCCCTCGCGTGTCGTTTTGCGGGATTGTAGTGGGGAAGTCGGTATCTTAACTTTGCAATTCTTACTGTATTGGTTGATGGGATTGTTAGAATCGGTCTGTTCTTCTGTTTTTGTTTGTGTATTGTGGAAAAAAGGACAGTGTGCGAGCGAAGATATTATTCTGTTTCCGGTCGGAGGTAATGCTGTGCGTGTATCGGGCTACAAAATGTACCCGGAACATGGCATTTGATGAAAATTGTTGTTTAGTTATCATTGTTTTACTCAACCGGGTAATGAGTAGTTATGGTTAAAAGTGATGATACTTGGGAGGTTGAAAGCGAAGAATAGAGATATTTCCCTCGATTTTCATTATTTTTAAGCTTCTGCTACCATCAATGAATCCCATGTTTGGGTTTCCGAAGTTACGTTTCCTCACCGATTGCATGCATGATGCCTGGGCTTAAAATTTGTTTTGGATGAGTTTACGACGTGTATGTATATTCGAAGCACAACTTATGTACTGTTGGAACTGCTTGTTGATCACACCCAATCTGCCTTTTTGCTTGACAGGATGATTTATCAGTGGCTTGAGCTGAAGAATGTGCCCGTGCCTTTGTGGTTCTTTTAATACAATTTACCATTGTGTAAGTATAAAATTCTAATTTTATAGGATTTTAGATGTATGGTCCAGCAAATTATGCTTCTCAGTTCGGGCAGGGTCCTCAGAAACCATGGCCACCTGCATACCAACAGCGTGCAGTTGCACCTCCTCCGCCTCCTCCTCCTACCTCATATATGCAACCAGGCCCTCCAATCCCATCACGTCCTATAACTCAACAAGCACCTGCCCCCCCACCTCAGGCTGGTCAACCTCTTCATCTTTCTCAGTCGGGTCCTCACGTTCCACCACCTCCACTCTGTCAGGGCCCATCCGTTCAAGTGCTACCTGGTGGGATCCCAAACATCCGTCAAACTTATTTTCACACTTTTCCACCAGTCCATGGAAGCACACAAGGTTTTCAATTTAACTCAAGTACTCAGCAGAATGTACAACTTTCACAGTCAGGAGTTCAGAACATGCATCATATTCTACCTCCACCACCCCCGCTACCACCACCACCACCACCACCTCCTCCTCATGCTCCTAATCCACCACCACCCCCTCCTCATGCTCCTAATCCAGATTTATTACGGCCTCCACAACCCTCTACAGTAGTACCTGTTCATCCTCCTTCACAAGGACAAACATTGTATGGAGCTCGAGTTCATCCACCATTGCAACAAGGTGGTTTGCAGGTCTTTCCCTCTATACCACAACATCCAACAACGTCCAACTTCCCTACTCCTCCTTTTGGAGGAGTTATGCAATCAAATCTTGGAGAGTCTCATTTGTCTCCAATGGCTCCTCCACCACCACCATCCTCTCCGCCACCTATTCCACCTTCTCCCCCTCCTCCCACGTCGCCCTCCTTTTATTCAATTCCAAGCTCGGGTTCTTCCAACTTGTTGTGCCAGTCTGAATTTGATCCTAGTTCTACCATTAATTCTAGTAAAGAATTGAAAGCCTTTGAGTCTAATCAAGGAGGCACACCTACCAGACATTTAGGGGATAACGGACCCAAGCATAAGCATAGAAATTTGGATGGCAGTATTGGCCTTATGATGGGTTCTAAAGTGGACAATGAGATATTGTCAGATAAAGGTAATGTGCAGGATCTTCCTCCATCCCCACCTAAGCCAAAAGATGATAAAATTACCAGGAAGATAGGAGTATTATGCAAGTACATTGCTAATAATGGTTCCAGTTTTGAAGACACTACTCGCCAAAAGGAATTTGGGAATCCAGAGTTTGAATTTTTATATGGTGGTGAACCGGGAAGTGAAGCTGCAATTGGTCATGAATATTTTCTGTGGATGAAGAAGAAATATAGTTTGGATTGCAAAAATAAAGAAATGGAAGAAAAATCTCCAGTGAGATCTTTAAGAATTGGGCCACAATCTGAGTCTTTGACAGTGTCAGCAGCATCCATTTCACCTGAAAATTCCGACATGGAGATGGAAGGTTAGTCTTAGTGTTAATAAATCTCTTCCTATACGGTAACGTTTGACATTGGTGCTGAGTTTATAAGTCTTTTTTCTGGGTTGATTGTTTTCCCTTAACAGATGTTGCGTGATCTATAACCATGAAAGCTTATTACTTGAGACAATCAGAGTGAAATCTATGAAGCCAAGGCAAATTCTCTGCAGATTGCAGGGGTTTTGAGAAACTTACTATTATATTCATCACATGAAAATATTGTGATACTTTTGTTTCATGTGTATACAGCTCTGCTGCTTCTACTAAATTTCTGGTTATAAACGCATCATTCATTTAATTAAAGAGCACATTTAAAAAAATATATATTTGGGCTAGGAATTTAAAGGAGTTATATTTCTTGGCTATTTGGATCTTAGGCATTATTGAAATCCATTTTGTATCCTGTTAAATAGAATGTCTCGTATGTTTCTGGTAATTGTGTTTCTTTATATCTGTTTATCTGTACTTCTGTTTTAAGAGCTTCAATTTTGCTGTGAAGAAAATTTCTCCAATTTAAGTTGATAGGTCAGTTTTCTAATCTAATTTTGATCGAAAATTTTGTCCAGTTGGGAGATTAGAATAAGAAGAACTCAAGGGATATGGTTGTGACGTTGTTTATTTGTAGCATGAGCTCTGTAAATGCTTAATGCAAGTACCTTGTAGTAACTGGAGTACTGATTTTTTTTCCTCCTTGTTTTGTGTGGTTTCTTACTGGTGGCTTCAAGATGATATTACCCCAGATGGTATAGGGGAGGAAACTAGTCACTCCTTTAAAATTCAAAGCTACGAGTGCAAATCAAGAAAAGAAGAGCATGATGCAAAGGATCAGTTACAAGGACCTAAAGATTTGCAAAGAAGCAGCCCAGTGAAGGGGAAAGTAGCTGAAGGTATGCATTTCTATCTTTTTGCGTTTCTTGTTGAATGGGTTCATGCATGGCACAGATTTTGCTTTGTGATATTGTGGTGCTTTGTGTCTTACTGTCAACAAAGTTGAGAGAAAATATTGTCTTTTCACTCAATGAATTCGAAGTAGTTAAATAAGAGTTATATTTCTCTTGCTGCTGCATCTTTTAAAAAAATGAATATGGCAAGACAGAATCTTTTCTTCTTCTTAGAAGGGTTCCACTGGGGTAATAGATAAGAGATTCTGACATCACTAGGAAGAACTCTGCTTATTAAATTATACTTGTGTTTTTCTTTCTGCTGAAAAATTTGAACAATGTAAAGCTATGGTTTGGATATCAAGCATAGTATTCTTCAGTTTTTGCTTTTACTTCTGTTCAAATATGCAGTTCCACAGTTCCTTTCAATTCAACCTAGTTGTGCCATGCAGAGAGGATTTTGGACGAACCTGGGTGAGAGTTAAATTAATTCCGCAGGAACTTTGTAGTTTGTCAGATTAAGTCATGTGAATAAAACCAATGACTCTGATCTGCTATCCTGGTTCTGAAAAGTTGCATTGATTTGACAACACATAAAAGGTTTCCTTCAAGGTTTTTGATTTGTGAATTTCTTATGAAGTTTGTTTCTTTTATCGTATTCCTCATGTTGAGATCGCAATCATGGGATTACTGCTTTTTCATCCTCATTTTTGTATGTCTAATAAGCATCTACACTTCCTATTGAAGGTCTTTTTTTTATTTTTATTTTTTAAAGAAAAAGATGGAGAGTCAAAGCTTCTGCTCGAGCATGAAAAATCTGTCAGCCTGGAAGCTTGCCAAGTTCACAGCCCTGTCATAAATACTGCTGGAGTTGTTGAACAGCCTTTAGGAAGCAATTTTGAGATTTCTGTTACCTGCATACAAAATGAAAAAAGTCTAGCAGCTTCTGAAGCTGTTAATTCTAGTCTGTCTACTGAACTTATTATTGGTGGCAGCCCATTCAGACTTATACAGGACTACGCTTCTGATGAAAATTCAGAAACCGATGAGGAATCACACCTTAAAGATGTCAGTTTTGCTATCTCACCTTCAACTCCAGCTTCTTCCAAGACTTCCGGAAAAGACAGTGATAATTTGACTATTCTGGGATCAGAAGGTTCTTGCCAGGTTCAACGGAGTAATGTTCCACCTTGTGAAGCCTCGATGCCTGATTTTGGTTCTCAGTTCCTCTCGGAATCACCAAAACTGATTTTTGATGCCAATGAGGCAAATGTGAGAAGGGCAGGGAATGAACGGAACTATAAAATTCATCAGAATCAAGTTGGCACCCGCACCAGTTCTAAGTCTTTGGATGCAGATGCAGTGAAAGGTCGTAGTGTTGATGTTCTCCATGATAGTCACAAGTTACAAAAAGAGAATGATGAGGAAAAGCAGAAGTTTGGATCATCGCCTGTAAAAATAGATGAGTTTGGAAGATTAGTCAGAGAAGGTGGTAGCGATAGCGATTCAGATGATTCACACTATACAAGGAGACATAAGAAAAGAAGAACTAGAAATAGTAGTGAAAGTCATTCTCCTGTTGACAGGAGGAGGGGGCGAAGGAGTCCGTGGAGAAGAAGGCAGAGGCGAAGTCGCTCACGCAGGTAACAATACTTTTCGCATGTATATATTGTTACTTACAGTGTGAGTTAATTCACTCAATGTTATTAAGGAACAATGTAGCCTAAGCAATTATGTCAGATTAGATATGCACTTGTTTCGTTAACTTTTGAGATTATTGTTAATAGAAAATTTTCTCTTCCATTGTTTCAGTTGGTCTCCTCGTAATCAAAGAGGCAGAAGCAGAAGCAGGTCTCCTGTCAGCAGGCGTACAAGTCAGTTTAATAATGAGAATATGAAACGAGATAAGGGTATGATACGAAAATGCTTTGACTTTCAGCGAGGTAGGTGCTATAGAGGAGCATCTTGTCGCTATGTGCACCATGAACCCAGCAAGAATGATGGATCAAGACACCATAGGAGCAAACATCATGATGTTCATCCAACTTCAGAGAATATAAAGGGTAGAGAGGACACTGTGAACATGTCTAGGGAAGTATCAGATCCTGGGCATATTAAAGTTGAGAATCAGGGGTGCATTCAGCATAATGTGTCCCCAAAAGATGATACCCATGATTGGAAGAAAGGTAGTCCCACTGGTGATCCAGATTTAGATGTAACTAAATGTCAAAGTTCTAGAGATAGAGCTGGCTTAGTTCAAGAAGAATTAATTTATTCCAAAGCAGCAGAGGCTGTCCACATTCACGTTAACGAGAATATTCAAGAAGCAGGGAAGTCTTATGAGCAACTTTCAGTTACGGCTGCCTCACAATGCATGAGCAATGCTGATACTGAGAAATTATCGGGTGATATTTCCATGAGCATGCTGACTTCTGTAGAGAAGTCTTTGGCTCATGCTCAGCAATCCAACATGTTTGCTTCAGAGTTTGAAGCTGCCAATAGTGTCTCACACCAAATGGATGGTTCATTTGTCTCCCATTTGTTACCCGATCAAGTAACTGCTGTCAGTACCAATAAAGCTCCTGAATGTGAACATTTTCCTGATAAAAATTCATTAATTAAGCTCCAGTTTGATACCAGTTCTGCTGGTCAGCAGCCTTCGACCTTACAATTTTTATCTGAGTCTCCAGTACCAAAATCATTATCTGCTACTGCTCCAGGTTGTGCTATGGATGATGCCCATCCTCTAAGAGAGCTGCCCCCTCCGCCTCCTCTCCCAACTTCATGTGTCACTAGTGCTGATGTTCTAATGCCTACCCCCTATAACTTTGTGTCACAAAATGTGTCCTTTCCTTCTAAGCCTTCTCTACCGGGAGGTTTTCAACCTCATCAGGATATCGTATCCATCCAATCATCTCATTACCACAGCACCACTTTCCCGCCTTCAAGACCATTATATGATCCAACAATGGCTCATGTAACTACCAAAGATGGTACGCCAATGCAATTTCATCAGAGTCATTTGTCTCAAGGAAGTGATCGGGGTTCTCAGTCTGTTATGAAATCCCAGCCATTGGTGACAAATTCTCATTCTATGCTTGGTGAGTCTCCAGTTCGGGAGCCTTATAGAGCTCCATTGCACATGGATGAAATTAGATCAACCGCCCCAGTTGCAAATAATCGACCTATTCAGCCCTTTGGATTCCCCAGCTTTCAGAAAGAAGAAAACTTTGGGCGGACTTCTGTGGAGATGAGTTCTTCTAGTTTTTTTCCCCATCGGAACTTTAATGATCAATCTATGCCCTTCACAAATGCAAATAGAATGCAATCTTCTGGTGACAATTTTCCTCCCAGTGAATTTCGAAGTTCATTTTCACAGTTTCATTCTTATTCACGGTTCCAACAGCCATTATATGCCTCGCAATCTGCACATGATAGTTTTTTACATGGCCCAAGTCAGATTGGTACTATATCTCGACATTATCCTGATCCTCTAAGCAGGAACCATTCGTCTTTGCTTCCTGATTTTGGGGGTTTGGGTATTACCACTTATCATAATCCTTATGCGTCTACTTTTGACAAGCCACTTAGCTCCAACTTCAGATCTAACATTTTGAACTTTGGAAATGATGCACCTAGTGGTGATATACGTGATTCTACTTTCAATTTGAGCAATGTTCGAGTTGATGGGCAAGGTGCTAATTATTTTGGATCAGGGCTGACAACTACTTCGCCAAAATCCACCAAACCTTCGGGGAAACACTTGCCCAGCTCAGGTGGTGATCAGTATGATCCACTCTTTGACAGCATTGAGCCATCACCACCTATTACCAAGAAATCTGATCGCATTCGAAAGCTGGAAAAAGCAAGAGAATCTCATATGATGACAAGACTTGGTGGTTCCCATAAATTACCAGATGTGGAGGAGAACAACAAGCATAAGGAGGTTGCTGCTGTGGCTTCAACTACTTCTCTGGAGAATGATGAATTTGGGGAGACAGCAGATGCAGAAGCTGGTGCTGTTGAGAATGACCTTGATGACGAAGAAAACTTAACCGGAGAGATTGAAATTGATCAGGTTAAGTCCTCAGAGAAGAGCAAGAAATCCAAAGGTTCCAGGTCGCTGAGGCTTTTCAGGATTGCTATTGCCGATTTTGTGAAGGAAGTTCTAAAACCATCATGGCGACAGGGCAATATGAGCAAGGAAGCTTTTAAGACAATTGTCAAGAAGACTGTTGACAAAGTATCTGGAGCTATGAAGAGTCACCAAATACCCAAGTCTCAAGCAAAGATAAATAGATACATTGATTCGTCACAACGAAAACTGACAAAGCTTGTTATGGTACGTTATCTTTCTCTCTCAAAAGAATTGGATTATTTTGGTACTTATCTTAGAACATTAGAGAGTATACTCCCAATTTGTAATTCCATTTGCTTGAAGCTATGAAAAACTTGTCGATAAAATAGAAAAATTATAGATAATGATTTGTTGGTTTACTTTTGAAATTGAAAAAGAAAAACAGTAACAAACAACTACTGAAGTAACTTCTTGGTAAACTTTTGATCTTATGAAGTATATTTTTGTTAGAACATTTTCTTTAATTTTATAGCAGAAAGACAAACGATGTAGCTACTGGAACTCATTCCCCTTCGGCTACAATTATTTAACGAGGAGAGCAAATGTAGATGTATGAATGTAACAGGTTTTAGAAGAAAGGGAATTTTTAAGGTGGTATTCATAGATTATTTCCACAAAAGGTGATTGATATACTATTAAGGTTTAATTTTTAAGAATGTTGCATTAACATGGCACTCAATCTTATGGCCTTTTTAGGATTTGTGCCTTTTTGAGAGGGGTTTTGTTGCTCTGCTTGATTTTGTAGATGATTAAGGTTTAATTTTTAAGAATGTTGCATTAACATGGCATGGAGAGGGGTTTTGTTGCTCCGCTTGATTTTGTATATCTCATTCTTTTGATGGAATCTTGTTTCTATTTTTATAAAAGTTATATTCCATTTGTTTATTCTTTGCTGTGTGATTCCTTTTTTTATAATTCAAGCATTAATACTGTCCTATATGTTATCCAGGGTTACGTTGACAAGTATGTTAAGACATAGAAAATGAACGAAAGCTGCAATGCCAATGTGGTGGACTGATGAGAGCAACATTATTATGGATGCTTCTCCCCTTGTGGTGGTTATCAGGAACTATGCATTGTTGTTTGACATCTGCAGGGGATGGATAGGTGATATATCCATAGTGTTGGTTGAAATGTCGGTCAGAGTTAGTAATGGTCCTGTGGATACTGGAGAAATGATGAAGCTTATGATATGTTGAGCATCGTGACACAAGTTCTTTTAACATATGATTCTCATTAGAGTCCCGAGGCCGCAGCAAAATAACGTGTATGTAATATGTTTTCCTGAAAAATATTTTGTAGCCTTTTTTATACTTTCTAGATATCTAATTGATGCAGTAAATGTTTGCAAATGGCCGAGTGTATTTTTCTCTTCTTCTCAACATTGTAGTTGTATTGAGTTAGTAGAAAAGTATTGTAGTTTGCAAACATATTAGGAAAATATGTTAATTTTGCTGAGGTAGTGAGGAACTTGTCTCTTCAAGAGAGAAATTGATATTATGAACCGTGAACTTGTCTTTTTAAGAGATGATGAAGGGAGAAATTGATATTATGAACCGTAAACTTGTCTTTTTAAGAGATGATCTTGCACATGTCTCAATCCAACGCGTTTCTTGAAGAGAACAGTTTTGGTTTGGATGATAACTACCATACTTTATTATTATTTCTACTATATTTTATAAATTTTCTTAAAAACTATTATATTTTCCCAAGAAATCCGTTCAATCTGTGAAACATTGCTTATTGTTTTTAATCCCTACCAAATGTTTATGTAATAATTTCCAGTGACAGTAGAATGCATGTTTTTACTTCAAGGGCGTAATATAAGGTAGATATAAGAAGCCTTGAAAGTTTGCTTTGGATTTTGATCGTGTTAAATGAATACTACTTATCCTTCCTCCTTGGGTTTACTGCAGGAAACAACTAAGTGATGTGATAATACTTTGATCTTTATGAAAAATCTGTTCATGCTTCTGGTTTAAGGACTTATTGAGGAGCATTCCATCATCCTTCAACCATTTAGTGACCATTGACTTCCTTAGAGTTGCACTTTGCGGTCTTATTCTCAGTTTATTGGTATAAGGTATGTTTAGGAGGAACAAATTGTATTAGCATCAAACGCATTTACTTTGTACCTTTGCAGATTATTACTTAACTAGGATGAGTATATATGTTCAAAATTTTGGATTAGTTTAGGTCATGTCTGTATATCCCTCGTGTGTTTATTGACAAGAAAATGTAACAGTTGGGCTTCTAAATTACAAGGAATTGTTTTACAATTTGAATTATATGTTGGGACGAATACTGTAAAATAGATGAGTTTGTGCTGACTGCTTGAGAAATAGATTTTGGCCTTGTTTTCAGTAGCCTTTAGACCCTTATTTTTTATCTGAACACTTTGACTATGTAATGAAGGGAAGAAGTCTGTTGGGTTAAGGTTAGACCAGCTTTTGAGATTGTTATTCTTCCGACTTTGTTGCTTATGTTATTGGAGAATCCTTAAAGAGAAATATGGGAATTTAGAGAATGGTATCTATGGACGAGACTTGTCTACCATGATGAAGTATGTTTTCCTTGGTTCAACATGTGGAGTAGTTTGTAATTTCAGCTTAGCCATGATGGTCCAAGATTACTTTGTATTAGTTTTTGGGAGGGGTGCCCACATCTCCATCCTGCTGTTTGGAATTCGAATCTCCAACCTTTTGATTGAGTATATACCGCAACCAGATACAGACTGGCGGGATAAGCTTCCTTTTGGCTTGAATTGATGGATGATCTTGTTCTTAGAGCTACAGAAATTATGAAATGATTCCATTCAATCTCATCAAGATGCTCAAGTTACGACATGACACTAGTATTTCAGACCCAATCAAGTCAAGTCATGAAGTGATATAGTATTATTCCTCTGATCAGAACCAGTTTTCTGATGTCAAAGACTCTAGAACTAGAAGCTTGCTTCCTTTTTATGTTTCAGTTGGGGAGGAGACAGATGTGACTCCAAAACTGTTCACAATTCAACCGTTCCAAGGACGGGATCTGCCAGCAAAGGGTGTTTGTAATTTTGGTCGGAGAGGACCGATTGTGTTATGTTACGAACTACTTAAGATCTTACTCTTTGTCAATCTGAATATATTTCACTTGAAGAATTGTAGATTTGTTATCTGATTTGTGAGGTCATATTTTGAAAAGAAAATTTATGAATAGATGAAGATATATCTGTTAGAGGATCCAAAATTTAAGGGGATTTTTTTTATTAATTTGTTTTTAGAATTGTTTTATAATACTATGCTTAAAGTAGAATGGTTTTGTCTGCAATAATTCAATTGATGATCTCCTTTTAATATCTCTACTTGTATTTTTTGACACCATTTTCAAATGGTGCTGATGGTTTTTCC
mRNA sequence
AAGAAAGGTTACAACGGAAAATATTATTCTTACCTATAAAAATTATATATATATATATTCAGAATATTAATTATTAATTAATAATAAAAAGTTGATATCTTTTTGTAGGCTTGGGTGGCGAGCGAGGGATCGAACTGCTTCTTTATTCTTACGCTTTGGATGTAACTGGGAGGGAACTTGGAATAGAATCTCATCTGTTGCTCCGTGAAAGGATCAGATACAGAAGCGTTTCATTGGCCGCCTCTGATAGGGTTCGAGCCACCATTTTCCACGCTCTTCTGCTCTGATTTCGTCCAAAAACCCTAATACTCTCTCCACTTTCAATACCTCAATCTTGGGTTTTTGTTATAATCTCAGGGCCTTCGAATTTCCTTAACTGGAGGTTCAAATTTGAATCCCAATTGATGCGGATGATTTATCAGTGGCTTGAGCTGAAGAATGTGCCCGTGCCTTTGTGGTTCTTTTAATACAATTTACCATTGTGTAAGTATAAAATTCTAATTTTATAGGATTTTAGATGTATGGTCCAGCAAATTATGCTTCTCAGTTCGGGCAGGGTCCTCAGAAACCATGGCCACCTGCATACCAACAGCGTGCAGTTGCACCTCCTCCGCCTCCTCCTCCTACCTCATATATGCAACCAGGCCCTCCAATCCCATCACGTCCTATAACTCAACAAGCACCTGCCCCCCCACCTCAGGCTGGTCAACCTCTTCATCTTTCTCAGTCGGGTCCTCACGTTCCACCACCTCCACTCTGTCAGGGCCCATCCGTTCAAGTGCTACCTGGTGGGATCCCAAACATCCGTCAAACTTATTTTCACACTTTTCCACCAGTCCATGGAAGCACACAAGGTTTTCAATTTAACTCAAGTACTCAGCAGAATGTACAACTTTCACAGTCAGGAGTTCAGAACATGCATCATATTCTACCTCCACCACCCCCGCTACCACCACCACCACCACCACCTCCTCCTCATGCTCCTAATCCACCACCACCCCCTCCTCATGCTCCTAATCCAGATTTATTACGGCCTCCACAACCCTCTACAGTAGTACCTGTTCATCCTCCTTCACAAGGACAAACATTGTATGGAGCTCGAGTTCATCCACCATTGCAACAAGGTGGTTTGCAGGTCTTTCCCTCTATACCACAACATCCAACAACGTCCAACTTCCCTACTCCTCCTTTTGGAGGAGTTATGCAATCAAATCTTGGAGAGTCTCATTTGTCTCCAATGGCTCCTCCACCACCACCATCCTCTCCGCCACCTATTCCACCTTCTCCCCCTCCTCCCACGTCGCCCTCCTTTTATTCAATTCCAAGCTCGGGTTCTTCCAACTTGTTGTGCCAGTCTGAATTTGATCCTAGTTCTACCATTAATTCTAGTAAAGAATTGAAAGCCTTTGAGTCTAATCAAGGAGGCACACCTACCAGACATTTAGGGGATAACGGACCCAAGCATAAGCATAGAAATTTGGATGGCAGTATTGGCCTTATGATGGGTTCTAAAGTGGACAATGAGATATTGTCAGATAAAGGTAATGTGCAGGATCTTCCTCCATCCCCACCTAAGCCAAAAGATGATAAAATTACCAGGAAGATAGGAGTATTATGCAAGTACATTGCTAATAATGGTTCCAGTTTTGAAGACACTACTCGCCAAAAGGAATTTGGGAATCCAGAGTTTGAATTTTTATATGGTGGTGAACCGGGAAGTGAAGCTGCAATTGGTCATGAATATTTTCTGTGGATGAAGAAGAAATATAGTTTGGATTGCAAAAATAAAGAAATGGAAGAAAAATCTCCAGTGAGATCTTTAAGAATTGGGCCACAATCTGAGTCTTTGACAGTGTCAGCAGCATCCATTTCACCTGAAAATTCCGACATGGAGATGGAAGATGATATTACCCCAGATGGTATAGGGGAGGAAACTAGTCACTCCTTTAAAATTCAAAGCTACGAGTGCAAATCAAGAAAAGAAGAGCATGATGCAAAGGATCAGTTACAAGGACCTAAAGATTTGCAAAGAAGCAGCCCAGTGAAGGGGAAAGTAGCTGAAGAAAAAGATGGAGAGTCAAAGCTTCTGCTCGAGCATGAAAAATCTGTCAGCCTGGAAGCTTGCCAAGTTCACAGCCCTGTCATAAATACTGCTGGAGTTGTTGAACAGCCTTTAGGAAGCAATTTTGAGATTTCTGTTACCTGCATACAAAATGAAAAAAGTCTAGCAGCTTCTGAAGCTGTTAATTCTAGTCTGTCTACTGAACTTATTATTGGTGGCAGCCCATTCAGACTTATACAGGACTACGCTTCTGATGAAAATTCAGAAACCGATGAGGAATCACACCTTAAAGATGTCAGTTTTGCTATCTCACCTTCAACTCCAGCTTCTTCCAAGACTTCCGGAAAAGACAGTGATAATTTGACTATTCTGGGATCAGAAGGTTCTTGCCAGGTTCAACGGAGTAATGTTCCACCTTGTGAAGCCTCGATGCCTGATTTTGGTTCTCAGTTCCTCTCGGAATCACCAAAACTGATTTTTGATGCCAATGAGGCAAATGTGAGAAGGGCAGGGAATGAACGGAACTATAAAATTCATCAGAATCAAGTTGGCACCCGCACCAGTTCTAAGTCTTTGGATGCAGATGCAGTGAAAGGTCGTAGTGTTGATGTTCTCCATGATAGTCACAAGTTACAAAAAGAGAATGATGAGGAAAAGCAGAAGTTTGGATCATCGCCTGTAAAAATAGATGAGTTTGGAAGATTAGTCAGAGAAGGTGGTAGCGATAGCGATTCAGATGATTCACACTATACAAGGAGACATAAGAAAAGAAGAACTAGAAATAGTAGTGAAAGTCATTCTCCTGTTGACAGGAGGAGGGGGCGAAGGAGTCCGTGGAGAAGAAGGCAGAGGCGAAGTCGCTCACGCAGTTGGTCTCCTCGTAATCAAAGAGGCAGAAGCAGAAGCAGGTCTCCTGTCAGCAGGCGTACAAGTCAGTTTAATAATGAGAATATGAAACGAGATAAGGGTATGATACGAAAATGCTTTGACTTTCAGCGAGGTAGGTGCTATAGAGGAGCATCTTGTCGCTATGTGCACCATGAACCCAGCAAGAATGATGGATCAAGACACCATAGGAGCAAACATCATGATGTTCATCCAACTTCAGAGAATATAAAGGGTAGAGAGGACACTGTGAACATGTCTAGGGAAGTATCAGATCCTGGGCATATTAAAGTTGAGAATCAGGGGTGCATTCAGCATAATGTGTCCCCAAAAGATGATACCCATGATTGGAAGAAAGGTAGTCCCACTGGTGATCCAGATTTAGATGTAACTAAATGTCAAAGTTCTAGAGATAGAGCTGGCTTAGTTCAAGAAGAATTAATTTATTCCAAAGCAGCAGAGGCTGTCCACATTCACGTTAACGAGAATATTCAAGAAGCAGGGAAGTCTTATGAGCAACTTTCAGTTACGGCTGCCTCACAATGCATGAGCAATGCTGATACTGAGAAATTATCGGGTGATATTTCCATGAGCATGCTGACTTCTGTAGAGAAGTCTTTGGCTCATGCTCAGCAATCCAACATGTTTGCTTCAGAGTTTGAAGCTGCCAATAGTGTCTCACACCAAATGGATGGTTCATTTGTCTCCCATTTGTTACCCGATCAAGTAACTGCTGTCAGTACCAATAAAGCTCCTGAATGTGAACATTTTCCTGATAAAAATTCATTAATTAAGCTCCAGTTTGATACCAGTTCTGCTGGTCAGCAGCCTTCGACCTTACAATTTTTATCTGAGTCTCCAGTACCAAAATCATTATCTGCTACTGCTCCAGGTTGTGCTATGGATGATGCCCATCCTCTAAGAGAGCTGCCCCCTCCGCCTCCTCTCCCAACTTCATGTGTCACTAGTGCTGATGTTCTAATGCCTACCCCCTATAACTTTGTGTCACAAAATGTGTCCTTTCCTTCTAAGCCTTCTCTACCGGGAGGTTTTCAACCTCATCAGGATATCGTATCCATCCAATCATCTCATTACCACAGCACCACTTTCCCGCCTTCAAGACCATTATATGATCCAACAATGGCTCATGTAACTACCAAAGATGGTACGCCAATGCAATTTCATCAGAGTCATTTGTCTCAAGGAAGTGATCGGGGTTCTCAGTCTGTTATGAAATCCCAGCCATTGGTGACAAATTCTCATTCTATGCTTGGTGAGTCTCCAGTTCGGGAGCCTTATAGAGCTCCATTGCACATGGATGAAATTAGATCAACCGCCCCAGTTGCAAATAATCGACCTATTCAGCCCTTTGGATTCCCCAGCTTTCAGAAAGAAGAAAACTTTGGGCGGACTTCTGTGGAGATGAGTTCTTCTAGTTTTTTTCCCCATCGGAACTTTAATGATCAATCTATGCCCTTCACAAATGCAAATAGAATGCAATCTTCTGGTGACAATTTTCCTCCCAGTGAATTTCGAAGTTCATTTTCACAGTTTCATTCTTATTCACGGTTCCAACAGCCATTATATGCCTCGCAATCTGCACATGATAGTTTTTTACATGGCCCAAGTCAGATTGGTACTATATCTCGACATTATCCTGATCCTCTAAGCAGGAACCATTCGTCTTTGCTTCCTGATTTTGGGGGTTTGGGTATTACCACTTATCATAATCCTTATGCGTCTACTTTTGACAAGCCACTTAGCTCCAACTTCAGATCTAACATTTTGAACTTTGGAAATGATGCACCTAGTGGTGATATACGTGATTCTACTTTCAATTTGAGCAATGTTCGAGTTGATGGGCAAGGTGCTAATTATTTTGGATCAGGGCTGACAACTACTTCGCCAAAATCCACCAAACCTTCGGGGAAACACTTGCCCAGCTCAGGTGGTGATCAGTATGATCCACTCTTTGACAGCATTGAGCCATCACCACCTATTACCAAGAAATCTGATCGCATTCGAAAGCTGGAAAAAGCAAGAGAATCTCATATGATGACAAGACTTGGTGGTTCCCATAAATTACCAGATGTGGAGGAGAACAACAAGCATAAGGAGGTTGCTGCTGTGGCTTCAACTACTTCTCTGGAGAATGATGAATTTGGGGAGACAGCAGATGCAGAAGCTGGTGCTGTTGAGAATGACCTTGATGACGAAGAAAACTTAACCGGAGAGATTGAAATTGATCAGGTTAAGTCCTCAGAGAAGAGCAAGAAATCCAAAGGTTCCAGGTCGCTGAGGCTTTTCAGGATTGCTATTGCCGATTTTGTGAAGGAAGTTCTAAAACCATCATGGCGACAGGGCAATATGAGCAAGGAAGCTTTTAAGACAATTGTCAAGAAGACTGTTGACAAAGTATCTGGAGCTATGAAGAGTCACCAAATACCCAAGTCTCAAGCAAAGATAAATAGATACATTGATTCGTCACAACGAAAACTGACAAAGCTTGTTATGGGTTACGTTGACAAGTATGTTAAGACATAGAAAATGAACGAAAGCTGCAATGCCAATGTGGTGGACTGATGAGAGCAACATTATTATGGATGCTTCTCCCCTTGTGGTGGTTATCAGGAACTATGCATTGTTGTTTGACATCTGCAGGGGATGGATAGGTGATATATCCATAGTGTTGGTTGAAATGTCGGTCAGAGTTAGTAATGGTCCTGTGGATACTGGAGAAATGATGAAGCTTATGATATGTTGAGCATCGTGACACAAGTTCTTTTAACATATGATTCTCATTAGAGTCCCGAGGCCGCAGCAAAATAACGTGAAACAACTAAGTGATGTGATAATACTTTGATCTTTATGAAAAATCTGTTCATGCTTCTGGTTTAAGGACTTATTGAGGAGCATTCCATCATCCTTCAACCATTTAGTGACCATTGACTTCCTTAGAGTTGCACTTTGCGGTCTTATTCTCAGTTTATTGGTATAAGGTATGTTTAGGAGGAACAAATTGTATTAGCATCAAACGCATTTACTTTGTACCTTTGCAGATTATTACTTAACTAGGATGAGTATATATGTTCAAAATTTTGGATTAGTTTAGGTCATGTCTGTATATCCCTCGTGTGTTTATTGACAAGAAAATGTAACAGTTGGGCTTCTAAATTACAAGGAATTGTTTTACAATTTGAATTATATGTTGGGACGAATACTGTAAAATAGATGAGTTTGTGCTGACTGCTTGAGAAATAGATTTTGGCCTTGTTTTCAGTAGCCTTTAGACCCTTATTTTTTATCTGAACACTTTGACTATGTAATGAAGGGAAGAAGTCTGTTGGGTTAAGGTTAGACCAGCTTTTGAGATTGTTATTCTTCCGACTTTGTTGCTTATGTTATTGGAGAATCCTTAAAGAGAAATATGGGAATTTAGAGAATGGTATCTATGGACGAGACTTGTCTACCATGATGAAGTATGTTTTCCTTGGTTCAACATGTGGAGTAGTTTGTAATTTCAGCTTAGCCATGATGGTCCAAGATTACTTTGTATTAGTTTTTGGGAGGGGTGCCCACATCTCCATCCTGCTGTTTGGAATTCGAATCTCCAACCTTTTGATTGAGTATATACCGCAACCAGATACAGACTGGCGGGATAAGCTTCCTTTTGGCTTGAATTGATGGATGATCTTGTTCTTAGAGCTACAGAAATTATGAAATGATTCCATTCAATCTCATCAAGATGCTCAAGTTACGACATGACACTAGTATTTCAGACCCAATCAAGTCAAGTCATGAAGTGATATAGTATTATTCCTCTGATCAGAACCAGTTTTCTGATGTCAAAGACTCTAGAACTAGAAGCTTGCTTCCTTTTTATGTTTCAGTTGGGGAGGAGACAGATGTGACTCCAAAACTGTTCACAATTCAACCGTTCCAAGGACGGGATCTGCCAGCAAAGGGTGTTTGTAATTTTGGTCGGAGAGGACCGATTGTGTTATGTTACGAACTACTTAAGATCTTACTCTTTGTCAATCTGAATATATTTCACTTGAAGAATTGTAGATTTGTTATCTGATTTGTGAGGTCATATTTTGAAAAGAAAATTTATGAATAGATGAAGATATATCTGTTAGAGGATCCAAAATTTAAGGGGATTTTTTTTATTAATTTGTTTTTAGAATTGTTTTATAATACTATGCTTAAAGTAGAATGGTTTTGTCTGCAATAATTCAATTGATGATCTCCTTTTAATATCTCTACTTGTATTTTTTGACACCATTTTCAAATGGTGCTGATGGTTTTTCC
Coding sequence (CDS)
ATGTATGGTCCAGCAAATTATGCTTCTCAGTTCGGGCAGGGTCCTCAGAAACCATGGCCACCTGCATACCAACAGCGTGCAGTTGCACCTCCTCCGCCTCCTCCTCCTACCTCATATATGCAACCAGGCCCTCCAATCCCATCACGTCCTATAACTCAACAAGCACCTGCCCCCCCACCTCAGGCTGGTCAACCTCTTCATCTTTCTCAGTCGGGTCCTCACGTTCCACCACCTCCACTCTGTCAGGGCCCATCCGTTCAAGTGCTACCTGGTGGGATCCCAAACATCCGTCAAACTTATTTTCACACTTTTCCACCAGTCCATGGAAGCACACAAGGTTTTCAATTTAACTCAAGTACTCAGCAGAATGTACAACTTTCACAGTCAGGAGTTCAGAACATGCATCATATTCTACCTCCACCACCCCCGCTACCACCACCACCACCACCACCTCCTCCTCATGCTCCTAATCCACCACCACCCCCTCCTCATGCTCCTAATCCAGATTTATTACGGCCTCCACAACCCTCTACAGTAGTACCTGTTCATCCTCCTTCACAAGGACAAACATTGTATGGAGCTCGAGTTCATCCACCATTGCAACAAGGTGGTTTGCAGGTCTTTCCCTCTATACCACAACATCCAACAACGTCCAACTTCCCTACTCCTCCTTTTGGAGGAGTTATGCAATCAAATCTTGGAGAGTCTCATTTGTCTCCAATGGCTCCTCCACCACCACCATCCTCTCCGCCACCTATTCCACCTTCTCCCCCTCCTCCCACGTCGCCCTCCTTTTATTCAATTCCAAGCTCGGGTTCTTCCAACTTGTTGTGCCAGTCTGAATTTGATCCTAGTTCTACCATTAATTCTAGTAAAGAATTGAAAGCCTTTGAGTCTAATCAAGGAGGCACACCTACCAGACATTTAGGGGATAACGGACCCAAGCATAAGCATAGAAATTTGGATGGCAGTATTGGCCTTATGATGGGTTCTAAAGTGGACAATGAGATATTGTCAGATAAAGGTAATGTGCAGGATCTTCCTCCATCCCCACCTAAGCCAAAAGATGATAAAATTACCAGGAAGATAGGAGTATTATGCAAGTACATTGCTAATAATGGTTCCAGTTTTGAAGACACTACTCGCCAAAAGGAATTTGGGAATCCAGAGTTTGAATTTTTATATGGTGGTGAACCGGGAAGTGAAGCTGCAATTGGTCATGAATATTTTCTGTGGATGAAGAAGAAATATAGTTTGGATTGCAAAAATAAAGAAATGGAAGAAAAATCTCCAGTGAGATCTTTAAGAATTGGGCCACAATCTGAGTCTTTGACAGTGTCAGCAGCATCCATTTCACCTGAAAATTCCGACATGGAGATGGAAGATGATATTACCCCAGATGGTATAGGGGAGGAAACTAGTCACTCCTTTAAAATTCAAAGCTACGAGTGCAAATCAAGAAAAGAAGAGCATGATGCAAAGGATCAGTTACAAGGACCTAAAGATTTGCAAAGAAGCAGCCCAGTGAAGGGGAAAGTAGCTGAAGAAAAAGATGGAGAGTCAAAGCTTCTGCTCGAGCATGAAAAATCTGTCAGCCTGGAAGCTTGCCAAGTTCACAGCCCTGTCATAAATACTGCTGGAGTTGTTGAACAGCCTTTAGGAAGCAATTTTGAGATTTCTGTTACCTGCATACAAAATGAAAAAAGTCTAGCAGCTTCTGAAGCTGTTAATTCTAGTCTGTCTACTGAACTTATTATTGGTGGCAGCCCATTCAGACTTATACAGGACTACGCTTCTGATGAAAATTCAGAAACCGATGAGGAATCACACCTTAAAGATGTCAGTTTTGCTATCTCACCTTCAACTCCAGCTTCTTCCAAGACTTCCGGAAAAGACAGTGATAATTTGACTATTCTGGGATCAGAAGGTTCTTGCCAGGTTCAACGGAGTAATGTTCCACCTTGTGAAGCCTCGATGCCTGATTTTGGTTCTCAGTTCCTCTCGGAATCACCAAAACTGATTTTTGATGCCAATGAGGCAAATGTGAGAAGGGCAGGGAATGAACGGAACTATAAAATTCATCAGAATCAAGTTGGCACCCGCACCAGTTCTAAGTCTTTGGATGCAGATGCAGTGAAAGGTCGTAGTGTTGATGTTCTCCATGATAGTCACAAGTTACAAAAAGAGAATGATGAGGAAAAGCAGAAGTTTGGATCATCGCCTGTAAAAATAGATGAGTTTGGAAGATTAGTCAGAGAAGGTGGTAGCGATAGCGATTCAGATGATTCACACTATACAAGGAGACATAAGAAAAGAAGAACTAGAAATAGTAGTGAAAGTCATTCTCCTGTTGACAGGAGGAGGGGGCGAAGGAGTCCGTGGAGAAGAAGGCAGAGGCGAAGTCGCTCACGCAGTTGGTCTCCTCGTAATCAAAGAGGCAGAAGCAGAAGCAGGTCTCCTGTCAGCAGGCGTACAAGTCAGTTTAATAATGAGAATATGAAACGAGATAAGGGTATGATACGAAAATGCTTTGACTTTCAGCGAGGTAGGTGCTATAGAGGAGCATCTTGTCGCTATGTGCACCATGAACCCAGCAAGAATGATGGATCAAGACACCATAGGAGCAAACATCATGATGTTCATCCAACTTCAGAGAATATAAAGGGTAGAGAGGACACTGTGAACATGTCTAGGGAAGTATCAGATCCTGGGCATATTAAAGTTGAGAATCAGGGGTGCATTCAGCATAATGTGTCCCCAAAAGATGATACCCATGATTGGAAGAAAGGTAGTCCCACTGGTGATCCAGATTTAGATGTAACTAAATGTCAAAGTTCTAGAGATAGAGCTGGCTTAGTTCAAGAAGAATTAATTTATTCCAAAGCAGCAGAGGCTGTCCACATTCACGTTAACGAGAATATTCAAGAAGCAGGGAAGTCTTATGAGCAACTTTCAGTTACGGCTGCCTCACAATGCATGAGCAATGCTGATACTGAGAAATTATCGGGTGATATTTCCATGAGCATGCTGACTTCTGTAGAGAAGTCTTTGGCTCATGCTCAGCAATCCAACATGTTTGCTTCAGAGTTTGAAGCTGCCAATAGTGTCTCACACCAAATGGATGGTTCATTTGTCTCCCATTTGTTACCCGATCAAGTAACTGCTGTCAGTACCAATAAAGCTCCTGAATGTGAACATTTTCCTGATAAAAATTCATTAATTAAGCTCCAGTTTGATACCAGTTCTGCTGGTCAGCAGCCTTCGACCTTACAATTTTTATCTGAGTCTCCAGTACCAAAATCATTATCTGCTACTGCTCCAGGTTGTGCTATGGATGATGCCCATCCTCTAAGAGAGCTGCCCCCTCCGCCTCCTCTCCCAACTTCATGTGTCACTAGTGCTGATGTTCTAATGCCTACCCCCTATAACTTTGTGTCACAAAATGTGTCCTTTCCTTCTAAGCCTTCTCTACCGGGAGGTTTTCAACCTCATCAGGATATCGTATCCATCCAATCATCTCATTACCACAGCACCACTTTCCCGCCTTCAAGACCATTATATGATCCAACAATGGCTCATGTAACTACCAAAGATGGTACGCCAATGCAATTTCATCAGAGTCATTTGTCTCAAGGAAGTGATCGGGGTTCTCAGTCTGTTATGAAATCCCAGCCATTGGTGACAAATTCTCATTCTATGCTTGGTGAGTCTCCAGTTCGGGAGCCTTATAGAGCTCCATTGCACATGGATGAAATTAGATCAACCGCCCCAGTTGCAAATAATCGACCTATTCAGCCCTTTGGATTCCCCAGCTTTCAGAAAGAAGAAAACTTTGGGCGGACTTCTGTGGAGATGAGTTCTTCTAGTTTTTTTCCCCATCGGAACTTTAATGATCAATCTATGCCCTTCACAAATGCAAATAGAATGCAATCTTCTGGTGACAATTTTCCTCCCAGTGAATTTCGAAGTTCATTTTCACAGTTTCATTCTTATTCACGGTTCCAACAGCCATTATATGCCTCGCAATCTGCACATGATAGTTTTTTACATGGCCCAAGTCAGATTGGTACTATATCTCGACATTATCCTGATCCTCTAAGCAGGAACCATTCGTCTTTGCTTCCTGATTTTGGGGGTTTGGGTATTACCACTTATCATAATCCTTATGCGTCTACTTTTGACAAGCCACTTAGCTCCAACTTCAGATCTAACATTTTGAACTTTGGAAATGATGCACCTAGTGGTGATATACGTGATTCTACTTTCAATTTGAGCAATGTTCGAGTTGATGGGCAAGGTGCTAATTATTTTGGATCAGGGCTGACAACTACTTCGCCAAAATCCACCAAACCTTCGGGGAAACACTTGCCCAGCTCAGGTGGTGATCAGTATGATCCACTCTTTGACAGCATTGAGCCATCACCACCTATTACCAAGAAATCTGATCGCATTCGAAAGCTGGAAAAAGCAAGAGAATCTCATATGATGACAAGACTTGGTGGTTCCCATAAATTACCAGATGTGGAGGAGAACAACAAGCATAAGGAGGTTGCTGCTGTGGCTTCAACTACTTCTCTGGAGAATGATGAATTTGGGGAGACAGCAGATGCAGAAGCTGGTGCTGTTGAGAATGACCTTGATGACGAAGAAAACTTAACCGGAGAGATTGAAATTGATCAGGTTAAGTCCTCAGAGAAGAGCAAGAAATCCAAAGGTTCCAGGTCGCTGAGGCTTTTCAGGATTGCTATTGCCGATTTTGTGAAGGAAGTTCTAAAACCATCATGGCGACAGGGCAATATGAGCAAGGAAGCTTTTAAGACAATTGTCAAGAAGACTGTTGACAAAGTATCTGGAGCTATGAAGAGTCACCAAATACCCAAGTCTCAAGCAAAGATAAATAGATACATTGATTCGTCACAACGAAAACTGACAAAGCTTGTTATGGGTTACGTTGACAAGTATGTTAAGACATAG
Protein sequence
MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPPQAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSSTQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVVPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESNQGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSSLSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTILGSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRSPVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPVREPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT
Homology
BLAST of MC06g0242 vs. NCBI nr
Match:
XP_022135323.1 (uncharacterized protein LOC111007314 [Momordica charantia])
HSP 1 Score: 3200 bits (8297), Expect = 0.0
Identity = 1657/1678 (98.75%), Postives = 1657/1678 (98.75%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP
Sbjct: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST
Sbjct: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
Query: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV
Sbjct: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
Query: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP
Sbjct: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
Query: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN
Sbjct: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
Query: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT
Sbjct: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
Query: 361 RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD 420
RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD
Sbjct: 361 RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD 420
Query: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ 480
CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ
Sbjct: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ 480
Query: 481 SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAE---------------------EKDG 540
SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAE EKDG
Sbjct: 481 SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEVPQFLSIQPSCAMQRGFWTNLEKDG 540
Query: 541 ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS 600
ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS
Sbjct: 541 ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS 600
Query: 601 LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL
Sbjct: 601 LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
Query: 661 GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG 720
GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG
Sbjct: 661 GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG 720
Query: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD 780
TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD
Sbjct: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD 780
Query: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS
Sbjct: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
Query: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD
Sbjct: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
Query: 901 VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD 960
VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD
Sbjct: 901 VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD 960
Query: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE
Sbjct: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
Query: 1021 KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN 1080
KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN
Sbjct: 1021 KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN 1080
Query: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE
Sbjct: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
Query: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT
Sbjct: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV
Sbjct: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
Query: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ
Sbjct: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS
Sbjct: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD
Sbjct: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK
Sbjct: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
Query: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA
Sbjct: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN
Sbjct: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1678
BLAST of MC06g0242 vs. NCBI nr
Match:
XP_023531277.1 (uncharacterized protein LOC111793568 [Cucurbita pepo subsp. pepo] >XP_023531278.1 uncharacterized protein LOC111793568 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2370 bits (6142), Expect = 0.0
Identity = 1295/1676 (77.27%), Postives = 1393/1676 (83.11%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPP YQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPTYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-----VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQ 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPVHGSTQ Q
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVHGSTQVSQ 120
Query: 121 FNSSTQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQ 180
FNS+ QQNVQLS SGVQN HH+LPPPP LPPPPP P HAP+PDLLRPPQ
Sbjct: 121 FNSNAQQNVQLSHSGVQNTHHVLPPPPRLPPPPPRPL-----------HAPSPDLLRPPQ 180
Query: 181 PSTVVPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLG 240
ST+VP+HP SQGQTLYGAR++PPLQQGGLQ+FPSIPQHPTTSNFPTPP FGG+MQSNLG
Sbjct: 181 FSTIVPLHPRSQGQTLYGARINPPLQQGGLQIFPSIPQHPTTSNFPTPPSFGGLMQSNLG 240
Query: 241 ESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKEL 300
ESHL P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ EFDPSSTI+ SK L
Sbjct: 241 ESHLLPVAPPPPPSSPPPIPPSPPPPTSPSS-SIPNSDSSNLLCQIEFDPSSTIHCSKRL 300
Query: 301 KAFESNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPK 360
KAFE++ HLG+N PKH KHRNL+G IGL+MGSKVDNEILSDK VQ LPPSPPK
Sbjct: 301 KAFENDPVVASPSHLGNNRPKHDKHRNLEGGIGLVMGSKVDNEILSDKDYVQVLPPSPPK 360
Query: 361 PKDDKITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWM 420
PKDD+I RKI VLC+ IA+N SSFED TR KEFGNPEF+FL+GGEPGSE+AIGHEYFLWM
Sbjct: 361 PKDDRIVRKIEVLCQLIASNDSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWM 420
Query: 421 KKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEET 480
KKKYSL CKNKEM+EK P RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 KKKYSLACKNKEMKEKFPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEET 480
Query: 481 SHSFKIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSL 540
+IQSY+ KSRKEEHD KDQLQGP+DLQR S K K AE DG KLLL HEKSVS+
Sbjct: 481 GRLVQIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSV 540
Query: 541 EACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSL----AASEAVNSSLSTELIIGGS 600
ACQVH PV +AG+ E PLG+NFE SVTC QN+K+L AA EA NSS S L+ GGS
Sbjct: 541 AACQVHIPVRISAGLSEPPLGNNFESSVTCSQNDKNLSGEVAAFEATNSSQSAALVAGGS 600
Query: 601 PFRLIQDYASDENSETDEESHLKDVSFA-ISPSTPASSKTSGKDSDNLTILGSEGSCQVQ 660
PFRLIQDY+SDENSE+DEESHLKDV F +SPSTP SSKTS KD+D LT LGS+GSCQV+
Sbjct: 601 PFRLIQDYSSDENSESDEESHLKDVRFVPVSPSTPVSSKTSDKDTDQLTNLGSKGSCQVE 660
Query: 661 RSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLD 720
S P CE SMP+ G+ FLSE PKL+FDANEANVR+ GNE++ +NQ+GT TS KSLD
Sbjct: 661 LSYAPTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSLD 720
Query: 721 ADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRR 780
A + GRSVDV+ D+ KL+KENDEEK K GSSPVKIDEFGRLVREGGSDSDSDDS Y RR
Sbjct: 721 A--LNGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRR 780
Query: 781 HKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGR----SRSRSPVSRR 840
HK RR R+SSESHSPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRGR SRSRSPVSRR
Sbjct: 781 HKNRRARSSSESHSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSPVSRR 840
Query: 841 TSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTS 900
T+QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHPTS
Sbjct: 841 TNQFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHPTS 900
Query: 901 ENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTKCQ 960
+NI REDT+N SR++SD GHIKVENQ CIQHNVSPK D H W SPT DV +CQ
Sbjct: 901 KNIGSREDTMNTSRDISDLGHIKVENQECIQHNVSPKHDAHAWNTDSPT----RDVNRCQ 960
Query: 961 SSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGD 1020
SSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK SGD
Sbjct: 961 SSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFSGD 1020
Query: 1021 ISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAPEC 1080
IS SMLTS E S+A QQSNM SE + ANS S MDGSFVS+LLPDQVT V+TNKAPEC
Sbjct: 1021 ISTSMLTSAENSVA--QQSNMHVSELQTANSHSRPMDGSFVSNLLPDQVTVVTTNKAPEC 1080
Query: 1081 EHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPP 1140
E FPDK S I QFD SSA Q P+T QFLSESPVPK SATAPGCA DDAH LR LPPPP
Sbjct: 1081 ELFPDKTSSISEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPPPP 1140
Query: 1141 PL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFP 1200
PL S VTSA+V + PY+FVSQN SFPSK SLPGGF PHQD VSIQ S+ HST
Sbjct: 1141 PLLPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGGFHPHQDFVSIQPSNDHSTPLL 1200
Query: 1201 PSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPVRE 1260
P R LYD +A TTKDG PMQFHQS+LSQGSD GSQSVMKSQPL +SHS +GESP++E
Sbjct: 1201 PPRRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPLQE 1260
Query: 1261 PYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSM 1320
P RAP+HMDEIRS PVA +RP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQSM
Sbjct: 1261 PCRAPMHMDEIRSITPVATDRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQSM 1320
Query: 1321 PFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRH 1380
PFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQIGT+SRH
Sbjct: 1321 PFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQIGTMSRH 1380
Query: 1381 YPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDST 1440
Y DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + S ILNFGNDAPSGDIRDST
Sbjct: 1381 YLDPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSKILNFGNDAPSGDIRDST 1440
Query: 1441 FNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKS 1500
FN SN RVDGQGANY GS LTT SP STKP GK LPS+GGDQYDPLFDS+EPS PI KKS
Sbjct: 1441 FNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSAGGDQYDPLFDSMEPSSPIIKKS 1500
Query: 1501 DRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGA 1560
DR +KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEAGA
Sbjct: 1501 DRGQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEAGA 1560
Query: 1561 VENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMS 1620
VE+D DDE NL+GEIEIDQVKSSEKSK SKGSRSLRLFRIAIADFVKE+LKPSWRQGNMS
Sbjct: 1561 VEDDFDDEANLSGEIEIDQVKSSEKSKNSKGSRSLRLFRIAIADFVKEILKPSWRQGNMS 1620
Query: 1621 KEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
KEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVMGYVDKYVK+
Sbjct: 1621 KEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVMGYVDKYVKS 1654
BLAST of MC06g0242 vs. NCBI nr
Match:
XP_022931323.1 (uncharacterized protein LOC111437543 [Cucurbita moschata] >XP_022931325.1 uncharacterized protein LOC111437543 [Cucurbita moschata])
HSP 1 Score: 2363 bits (6123), Expect = 0.0
Identity = 1295/1674 (77.36%), Postives = 1389/1674 (82.97%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HH+LPPPP LPPPPP P HAP+PDL+RPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPPPPPRPL-----------HAPSPDLIRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPSS-SIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKY 420
+I RKI VLC+ IA+NGSSFED TR KEFGNPEF+FL+GGEPGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 KIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGP+DLQR S K K AE DG KLLL HEKSVS+ ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCIQNEKSL----AASEAVNSSLSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVTC QN K+L AA EA NSS S L+ GGSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTCSQNGKNLSGEVAAFEATNSSQSAALVAGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGS+GSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PKL+FDANEANVR+ GNE++ +NQ+GT TS KSLDA +
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSLDA--L 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGR------SRSRSPVSRRTS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRGR SRSRSPVSRRT+
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSPVSRRTN 840
Query: 841 QFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSEN 900
QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHPTS+N
Sbjct: 841 QFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHPTSKN 900
Query: 901 IKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTKCQSS 960
I REDT+N SR++SD GHIKVE Q CIQH+VSPK D H W SPT DV +CQSS
Sbjct: 901 IGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNRCQSS 960
Query: 961 RDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDIS 1020
RD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK SGDIS
Sbjct: 961 RDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFSGDIS 1020
Query: 1021 MSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAPECEH 1080
SMLTS E S+A QQSNM SE ANS S MDGSFVS+LLPDQVT ++TNKAPECE
Sbjct: 1021 TSMLTSAENSVA--QQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAPECEL 1080
Query: 1081 FPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPL 1140
FPDK S I QFD SSA Q P+T QFLSESPVPK SATAPGCA DDAH LR LPPPPPL
Sbjct: 1081 FPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPPPPPL 1140
Query: 1141 ---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPS 1200
S VTSA+V + PY+FVSQN SFPSK SLPG F PHQD VSIQ S+ HST P
Sbjct: 1141 LPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTPLLPP 1200
Query: 1201 RPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPVREPY 1260
R LYD +A TTKDG PMQFHQS+LSQGSD GSQSVMKSQPL +SHS +GESP++EP
Sbjct: 1201 RRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPLQEPC 1260
Query: 1261 RAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPF 1320
RAP+HMDEIRS PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQSMPF
Sbjct: 1261 RAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQSMPF 1320
Query: 1321 TNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYP 1380
T+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD FL SQIGT+SRHY
Sbjct: 1321 TDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGFLRDSSQIGTMSRHYL 1380
Query: 1381 DPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFN 1440
DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRDSTFN
Sbjct: 1381 DPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRDSTFN 1440
Query: 1441 LSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDR 1500
SN RVDGQGANY GS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI KKSDR
Sbjct: 1441 ASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIKKSDR 1500
Query: 1501 IRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVE 1560
+KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEAGAVE
Sbjct: 1501 GQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEAGAVE 1560
Query: 1561 NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKE 1620
+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGNMSKE
Sbjct: 1561 DDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGNMSKE 1620
Query: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVMGYVDKYVK+
Sbjct: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVMGYVDKYVKS 1652
BLAST of MC06g0242 vs. NCBI nr
Match:
KAG6587592.1 (Zinc finger CCCH domain-containing protein 55, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2352 bits (6095), Expect = 0.0
Identity = 1292/1678 (77.00%), Postives = 1386/1678 (82.60%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HH+LPPPP LPPPPP P HAP+PDLLRPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPPPPPRPL-----------HAPSPDLLRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPSS-SIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKY 420
+I RKI VLC+ IA+NGSSFED TR KEFGNPEF+FL+GGEPGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 KIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGP+DLQR S K K AE DG KLLL HEKSVS+ ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCIQNEKSL----AASEAVNSSLSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVT QN K+L AA EA NSS S L+ GGSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTRSQNGKNLSGEVAAFEATNSSQSAALVAGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGS+GSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PKL+FDANEANVR+ GNE++ +NQ+GT TS KSLDA +
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSLDA--L 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGR----------SRSRSPVS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRGR SRSRSPVS
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRGRGRSRSRSRSRSPVS 840
Query: 841 RRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHP 900
RRT+QFNNENM+RDKGM+RKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHP
Sbjct: 841 RRTNQFNNENMRRDKGMMRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHP 900
Query: 901 TSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTK 960
TS+NI REDT+N SR++SD GHIKVE Q CIQH+VSPK D H W SPT DV +
Sbjct: 901 TSKNIGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNR 960
Query: 961 CQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLS 1020
CQSSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK S
Sbjct: 961 CQSSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFS 1020
Query: 1021 GDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAP 1080
GDIS SMLTS E S+A QQSNM SE ANS S MDGSFVS+LLPDQVT ++TNKAP
Sbjct: 1021 GDISTSMLTSAENSVA--QQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAP 1080
Query: 1081 ECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPP 1140
ECE FPDK S I QFD SSA Q P+T QFLSESPVPK SATAPGCA DDAH LR LPP
Sbjct: 1081 ECELFPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPP 1140
Query: 1141 PPPL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
PPPL S VTSA+V + PY+FV QN SFPSK SLPG F PHQD VSIQ S+ HST
Sbjct: 1141 PPPLLPHMISHVTSAEVPISAPYSFVPQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTP 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
P R LYD +A TTKDG PMQFHQS+LSQGSD GSQSVMKSQPL +SHS +GESP+
Sbjct: 1201 LLPPRRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPL 1260
Query: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
+EP RAP+HMDEIRS PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQ
Sbjct: 1261 QEPCRAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQIGT+S
Sbjct: 1321 SMPFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQIGTMS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHY DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRD
Sbjct: 1381 RHYLDPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFN SN RVDGQGANY GS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI K
Sbjct: 1441 STFNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIK 1500
Query: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDR +KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEA
Sbjct: 1501 KSDRGQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVE+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGN
Sbjct: 1561 GAVEDDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVMGYVDKYVK+
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVMGYVDKYVKS 1656
BLAST of MC06g0242 vs. NCBI nr
Match:
XP_023001661.1 (serine/arginine repetitive matrix protein 2-like [Cucurbita maxima] >XP_023001669.1 serine/arginine repetitive matrix protein 2-like [Cucurbita maxima])
HSP 1 Score: 2334 bits (6048), Expect = 0.0
Identity = 1283/1683 (76.23%), Postives = 1385/1683 (82.29%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-----VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQ 120
QAGQPL++SQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPVHGSTQ Q
Sbjct: 61 QAGQPLYMSQSGSHGPPPPPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVHGSTQVSQ 120
Query: 121 FNSSTQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQ 180
FNS+ +QSGVQN HH+LPPPP LPPPPP P HAP+PDLLRPPQ
Sbjct: 121 FNSN-------AQSGVQNTHHVLPPPPLLPPPPPRPL-----------HAPSPDLLRPPQ 180
Query: 181 PSTVVPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLG 240
ST+VP+HP SQGQTLYGAR++PPLQQGGLQ+FPSIPQHPTTSNFPTPP FGG+MQSNLG
Sbjct: 181 FSTIVPLHPRSQGQTLYGARINPPLQQGGLQIFPSIPQHPTTSNFPTPPSFGGLMQSNLG 240
Query: 241 ESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKEL 300
E HL P+APPPPPS PPPIPPSPPPPTSPS SIP+S SSNLLCQ EFDPSSTI+ SK L
Sbjct: 241 EPHLLPVAPPPPPSYPPPIPPSPPPPTSPSS-SIPNSDSSNLLCQIEFDPSSTIHCSKRL 300
Query: 301 KAFESNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPK 360
KAFE++ HLGDN PKH KHRNL+G IGLMMGSKVDNEI SDK VQ LPPSPPK
Sbjct: 301 KAFENDPVVASPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEIFSDKDYVQVLPPSPPK 360
Query: 361 PKDDKITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWM 420
PKDD+I RKI VLC+ IA+NGSSFED TR KEFGNPEF+FL+GGEPGSE+AIGHEYFLWM
Sbjct: 361 PKDDRIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWM 420
Query: 421 KKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEET 480
KKKYSL CKNKEM+ KSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 KKKYSLACKNKEMKAKSPSRSLGIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEET 480
Query: 481 SHSFKIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSL 540
H +IQSY+ KSRKEE+D KDQLQGP+D QR S + K E +DG KLLL HEKSVS
Sbjct: 481 GHLVQIQSYKRKSRKEEYDVKDQLQGPEDSQRCS--REKEIEAEDGGPKLLLGHEKSVSA 540
Query: 541 EACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSL----AASEAVNSSLSTELIIGGS 600
ACQVH P +AG+ E LG+NFE SVTC QN+K+L AA EA NSS S L+ GGS
Sbjct: 541 AACQVHIPDRISAGLSEPALGNNFESSVTCSQNDKNLSGEVAAFEATNSSQSAALVAGGS 600
Query: 601 PFRLIQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQ 660
PFRLIQDY+SDENSE+DEESHLKDV F A+SPSTP SSKTS K +D LT LGS+GSCQV+
Sbjct: 601 PFRLIQDYSSDENSESDEESHLKDVRFVAVSPSTPVSSKTSDKYTDQLTNLGSKGSCQVE 660
Query: 661 RSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLD 720
S P CE SMP+ G+ FLS PKL+FDANEANVR+ GNE++ +NQ+GT TS KSLD
Sbjct: 661 LSYAPTCEHSMPESGAHFLSGPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSLD 720
Query: 721 ADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRR 780
A + GRSVDV+ D+ KL+KENDEEK K GSSPVKIDEFGRLVREGGSDSDSDDS Y RR
Sbjct: 721 A--LNGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRR 780
Query: 781 HKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGR----------SRSR 840
HK RR R+SSESHSPVDRR GRRSPWRRR+RRSRSRSWSPRNQRGR SRSR
Sbjct: 781 HKNRRARSSSESHSPVDRR-GRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSRSR 840
Query: 841 SPVSRRTSQFNNENMKRDKG-MIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKH 900
SPVSRRT+QFNNENM+RDKG MIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR HRSKH
Sbjct: 841 SPVSRRTNQFNNENMRRDKGIMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLHRSKH 900
Query: 901 HDVHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPD 960
HDVHPTS+NIK REDT+N SR++SD GHIKVENQ CIQHNVSPK D H W SPT
Sbjct: 901 HDVHPTSKNIKSREDTMNTSRDISDLGHIKVENQECIQHNVSPKHDAHAWNTDSPT---- 960
Query: 961 LDVTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNAD 1020
DV +CQSSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNAD
Sbjct: 961 RDVHRCQSSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNAD 1020
Query: 1021 TEKLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVS 1080
TEK SGDIS SMLTS E S+A QQSNM SE + ANS S MDGSF+S+LLPDQVT V+
Sbjct: 1021 TEKFSGDISTSMLTSAENSVA--QQSNMLVSELQTANSYSRPMDGSFISNLLPDQVTVVT 1080
Query: 1081 TNKAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPL 1140
TNKAPECE FPDK S I QFD SSA Q P T QFLSESP+PK SATAPGCA DDAH L
Sbjct: 1081 TNKAPECELFPDKTSSINEQFDASSASQPPMTSQFLSESPIPKQFSATAPGCANDDAHSL 1140
Query: 1141 RELPPPPPL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSH 1200
R LPPPPPL TS V A+V + PY+FVSQN SFPSK SLPGGF PHQD VSIQ S+
Sbjct: 1141 RALPPPPPLLPHMTSHVNGAEVPISAPYSFVSQNASFPSKSSLPGGFHPHQDFVSIQPSN 1200
Query: 1201 YHSTTFPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSML 1260
HST P R LYD T+A TTKDGTPMQFHQS+LSQGSD GSQSVMKSQPL +S S +
Sbjct: 1201 DHSTPLLPPRRLYDSTLAPTTTKDGTPMQFHQSNLSQGSDLGSQSVMKSQPLELHSRSKI 1260
Query: 1261 GESPVREPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHR 1320
GESP++EP R P+HMDEIRS+ PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP R
Sbjct: 1261 GESPLQEPCRGPMHMDEIRSSTPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRR 1320
Query: 1321 NFNDQSMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQ 1380
NFNDQSMPFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQ
Sbjct: 1321 NFNDQSMPFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQ 1380
Query: 1381 IGTISRHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPS 1440
IGT+SRHYPDP RNHSSL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPS
Sbjct: 1381 IGTMSRHYPDPSIRNHSSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPS 1440
Query: 1441 GDIRDSTFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPS 1500
GDIRDSTFN SN RVDGQGANY GS LTT SP STKP GK LPS GGDQYDPLFDS+EPS
Sbjct: 1441 GDIRDSTFNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPS 1500
Query: 1501 PPITKKSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGET 1560
PI +KSDR +KLEK RE HM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGET
Sbjct: 1501 SPIIRKSDRGQKLEKTREYHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGET 1560
Query: 1561 ADAEAGAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPS 1620
ADAEAGAVE+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPS
Sbjct: 1561 ADAEAGAVEDDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPS 1620
Query: 1621 WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKY 1657
WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVMGYVDKY
Sbjct: 1621 WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVMGYVDKY 1653
BLAST of MC06g0242 vs. ExPASy TrEMBL
Match:
A0A6J1C4H9 (uncharacterized protein LOC111007314 OS=Momordica charantia OX=3673 GN=LOC111007314 PE=4 SV=1)
HSP 1 Score: 3200 bits (8297), Expect = 0.0
Identity = 1657/1678 (98.75%), Postives = 1657/1678 (98.75%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP
Sbjct: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST
Sbjct: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
Query: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV
Sbjct: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
Query: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP
Sbjct: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
Query: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN
Sbjct: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
Query: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT
Sbjct: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
Query: 361 RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD 420
RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD
Sbjct: 361 RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD 420
Query: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ 480
CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ
Sbjct: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ 480
Query: 481 SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAE---------------------EKDG 540
SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAE EKDG
Sbjct: 481 SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEVPQFLSIQPSCAMQRGFWTNLEKDG 540
Query: 541 ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS 600
ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS
Sbjct: 541 ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS 600
Query: 601 LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL
Sbjct: 601 LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
Query: 661 GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG 720
GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG
Sbjct: 661 GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG 720
Query: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD 780
TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD
Sbjct: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD 780
Query: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS
Sbjct: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
Query: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD
Sbjct: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
Query: 901 VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD 960
VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD
Sbjct: 901 VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD 960
Query: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE
Sbjct: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
Query: 1021 KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN 1080
KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN
Sbjct: 1021 KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN 1080
Query: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE
Sbjct: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
Query: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT
Sbjct: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV
Sbjct: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
Query: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ
Sbjct: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS
Sbjct: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD
Sbjct: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK
Sbjct: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
Query: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA
Sbjct: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN
Sbjct: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1678
BLAST of MC06g0242 vs. ExPASy TrEMBL
Match:
A0A6J1EZ49 (uncharacterized protein LOC111437543 OS=Cucurbita moschata OX=3662 GN=LOC111437543 PE=4 SV=1)
HSP 1 Score: 2363 bits (6123), Expect = 0.0
Identity = 1295/1674 (77.36%), Postives = 1389/1674 (82.97%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HH+LPPPP LPPPPP P HAP+PDL+RPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPPPPPRPL-----------HAPSPDLIRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPSS-SIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKY 420
+I RKI VLC+ IA+NGSSFED TR KEFGNPEF+FL+GGEPGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 KIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGP+DLQR S K K AE DG KLLL HEKSVS+ ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCIQNEKSL----AASEAVNSSLSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVTC QN K+L AA EA NSS S L+ GGSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTCSQNGKNLSGEVAAFEATNSSQSAALVAGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGS+GSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PKL+FDANEANVR+ GNE++ +NQ+GT TS KSLDA +
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSLDA--L 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGR------SRSRSPVSRRTS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRGR SRSRSPVSRRT+
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSPVSRRTN 840
Query: 841 QFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSEN 900
QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHPTS+N
Sbjct: 841 QFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHPTSKN 900
Query: 901 IKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTKCQSS 960
I REDT+N SR++SD GHIKVE Q CIQH+VSPK D H W SPT DV +CQSS
Sbjct: 901 IGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNRCQSS 960
Query: 961 RDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDIS 1020
RD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK SGDIS
Sbjct: 961 RDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFSGDIS 1020
Query: 1021 MSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAPECEH 1080
SMLTS E S+A QQSNM SE ANS S MDGSFVS+LLPDQVT ++TNKAPECE
Sbjct: 1021 TSMLTSAENSVA--QQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAPECEL 1080
Query: 1081 FPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPL 1140
FPDK S I QFD SSA Q P+T QFLSESPVPK SATAPGCA DDAH LR LPPPPPL
Sbjct: 1081 FPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPPPPPL 1140
Query: 1141 ---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPS 1200
S VTSA+V + PY+FVSQN SFPSK SLPG F PHQD VSIQ S+ HST P
Sbjct: 1141 LPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTPLLPP 1200
Query: 1201 RPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPVREPY 1260
R LYD +A TTKDG PMQFHQS+LSQGSD GSQSVMKSQPL +SHS +GESP++EP
Sbjct: 1201 RRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPLQEPC 1260
Query: 1261 RAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPF 1320
RAP+HMDEIRS PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQSMPF
Sbjct: 1261 RAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQSMPF 1320
Query: 1321 TNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYP 1380
T+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD FL SQIGT+SRHY
Sbjct: 1321 TDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGFLRDSSQIGTMSRHYL 1380
Query: 1381 DPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFN 1440
DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRDSTFN
Sbjct: 1381 DPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRDSTFN 1440
Query: 1441 LSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDR 1500
SN RVDGQGANY GS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI KKSDR
Sbjct: 1441 ASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIKKSDR 1500
Query: 1501 IRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVE 1560
+KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEAGAVE
Sbjct: 1501 GQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEAGAVE 1560
Query: 1561 NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKE 1620
+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGNMSKE
Sbjct: 1561 DDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGNMSKE 1620
Query: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVMGYVDKYVK+
Sbjct: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVMGYVDKYVKS 1652
BLAST of MC06g0242 vs. ExPASy TrEMBL
Match:
A0A6J1KND4 (serine/arginine repetitive matrix protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC111495732 PE=4 SV=1)
HSP 1 Score: 2334 bits (6048), Expect = 0.0
Identity = 1283/1683 (76.23%), Postives = 1385/1683 (82.29%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-----VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQ 120
QAGQPL++SQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPVHGSTQ Q
Sbjct: 61 QAGQPLYMSQSGSHGPPPPPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVHGSTQVSQ 120
Query: 121 FNSSTQQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQ 180
FNS+ +QSGVQN HH+LPPPP LPPPPP P HAP+PDLLRPPQ
Sbjct: 121 FNSN-------AQSGVQNTHHVLPPPPLLPPPPPRPL-----------HAPSPDLLRPPQ 180
Query: 181 PSTVVPVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLG 240
ST+VP+HP SQGQTLYGAR++PPLQQGGLQ+FPSIPQHPTTSNFPTPP FGG+MQSNLG
Sbjct: 181 FSTIVPLHPRSQGQTLYGARINPPLQQGGLQIFPSIPQHPTTSNFPTPPSFGGLMQSNLG 240
Query: 241 ESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKEL 300
E HL P+APPPPPS PPPIPPSPPPPTSPS SIP+S SSNLLCQ EFDPSSTI+ SK L
Sbjct: 241 EPHLLPVAPPPPPSYPPPIPPSPPPPTSPSS-SIPNSDSSNLLCQIEFDPSSTIHCSKRL 300
Query: 301 KAFESNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPK 360
KAFE++ HLGDN PKH KHRNL+G IGLMMGSKVDNEI SDK VQ LPPSPPK
Sbjct: 301 KAFENDPVVASPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEIFSDKDYVQVLPPSPPK 360
Query: 361 PKDDKITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWM 420
PKDD+I RKI VLC+ IA+NGSSFED TR KEFGNPEF+FL+GGEPGSE+AIGHEYFLWM
Sbjct: 361 PKDDRIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWM 420
Query: 421 KKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEET 480
KKKYSL CKNKEM+ KSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 KKKYSLACKNKEMKAKSPSRSLGIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEET 480
Query: 481 SHSFKIQSYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSL 540
H +IQSY+ KSRKEE+D KDQLQGP+D QR S + K E +DG KLLL HEKSVS
Sbjct: 481 GHLVQIQSYKRKSRKEEYDVKDQLQGPEDSQRCS--REKEIEAEDGGPKLLLGHEKSVSA 540
Query: 541 EACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSL----AASEAVNSSLSTELIIGGS 600
ACQVH P +AG+ E LG+NFE SVTC QN+K+L AA EA NSS S L+ GGS
Sbjct: 541 AACQVHIPDRISAGLSEPALGNNFESSVTCSQNDKNLSGEVAAFEATNSSQSAALVAGGS 600
Query: 601 PFRLIQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQ 660
PFRLIQDY+SDENSE+DEESHLKDV F A+SPSTP SSKTS K +D LT LGS+GSCQV+
Sbjct: 601 PFRLIQDYSSDENSESDEESHLKDVRFVAVSPSTPVSSKTSDKYTDQLTNLGSKGSCQVE 660
Query: 661 RSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLD 720
S P CE SMP+ G+ FLS PKL+FDANEANVR+ GNE++ +NQ+GT TS KSLD
Sbjct: 661 LSYAPTCEHSMPESGAHFLSGPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSLD 720
Query: 721 ADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRR 780
A + GRSVDV+ D+ KL+KENDEEK K GSSPVKIDEFGRLVREGGSDSDSDDS Y RR
Sbjct: 721 A--LNGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRR 780
Query: 781 HKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGR----------SRSR 840
HK RR R+SSESHSPVDRR GRRSPWRRR+RRSRSRSWSPRNQRGR SRSR
Sbjct: 781 HKNRRARSSSESHSPVDRR-GRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSRSR 840
Query: 841 SPVSRRTSQFNNENMKRDKG-MIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKH 900
SPVSRRT+QFNNENM+RDKG MIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR HRSKH
Sbjct: 841 SPVSRRTNQFNNENMRRDKGIMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLHRSKH 900
Query: 901 HDVHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPD 960
HDVHPTS+NIK REDT+N SR++SD GHIKVENQ CIQHNVSPK D H W SPT
Sbjct: 901 HDVHPTSKNIKSREDTMNTSRDISDLGHIKVENQECIQHNVSPKHDAHAWNTDSPT---- 960
Query: 961 LDVTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNAD 1020
DV +CQSSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNAD
Sbjct: 961 RDVHRCQSSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNAD 1020
Query: 1021 TEKLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVS 1080
TEK SGDIS SMLTS E S+A QQSNM SE + ANS S MDGSF+S+LLPDQVT V+
Sbjct: 1021 TEKFSGDISTSMLTSAENSVA--QQSNMLVSELQTANSYSRPMDGSFISNLLPDQVTVVT 1080
Query: 1081 TNKAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPL 1140
TNKAPECE FPDK S I QFD SSA Q P T QFLSESP+PK SATAPGCA DDAH L
Sbjct: 1081 TNKAPECELFPDKTSSINEQFDASSASQPPMTSQFLSESPIPKQFSATAPGCANDDAHSL 1140
Query: 1141 RELPPPPPL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSH 1200
R LPPPPPL TS V A+V + PY+FVSQN SFPSK SLPGGF PHQD VSIQ S+
Sbjct: 1141 RALPPPPPLLPHMTSHVNGAEVPISAPYSFVSQNASFPSKSSLPGGFHPHQDFVSIQPSN 1200
Query: 1201 YHSTTFPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSML 1260
HST P R LYD T+A TTKDGTPMQFHQS+LSQGSD GSQSVMKSQPL +S S +
Sbjct: 1201 DHSTPLLPPRRLYDSTLAPTTTKDGTPMQFHQSNLSQGSDLGSQSVMKSQPLELHSRSKI 1260
Query: 1261 GESPVREPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHR 1320
GESP++EP R P+HMDEIRS+ PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP R
Sbjct: 1261 GESPLQEPCRGPMHMDEIRSSTPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRR 1320
Query: 1321 NFNDQSMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQ 1380
NFNDQSMPFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQ
Sbjct: 1321 NFNDQSMPFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQ 1380
Query: 1381 IGTISRHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPS 1440
IGT+SRHYPDP RNHSSL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPS
Sbjct: 1381 IGTMSRHYPDPSIRNHSSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPS 1440
Query: 1441 GDIRDSTFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPS 1500
GDIRDSTFN SN RVDGQGANY GS LTT SP STKP GK LPS GGDQYDPLFDS+EPS
Sbjct: 1441 GDIRDSTFNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPS 1500
Query: 1501 PPITKKSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGET 1560
PI +KSDR +KLEK RE HM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGET
Sbjct: 1501 SPIIRKSDRGQKLEKTREYHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGET 1560
Query: 1561 ADAEAGAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPS 1620
ADAEAGAVE+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPS
Sbjct: 1561 ADAEAGAVEDDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPS 1620
Query: 1621 WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKY 1657
WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVMGYVDKY
Sbjct: 1621 WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVMGYVDKY 1653
BLAST of MC06g0242 vs. ExPASy TrEMBL
Match:
A0A5A7UQ65 (Serine/arginine repetitive matrix protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G003060 PE=4 SV=1)
HSP 1 Score: 2281 bits (5911), Expect = 0.0
Identity = 1259/1663 (75.71%), Postives = 1370/1663 (82.38%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG ANYASQFGQGPQKPWPPAYQQRA APPPPPPPTSY+QPGPPIPS P+TQQAPAPPP
Sbjct: 1 MYGQANYASQFGQGPQKPWPPAYQQRAGAPPPPPPPTSYVQPGPPIPSHPVTQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QA QPLHLSQ G H PPPP CQGPS+QVLPGGI NIR YFHTFPP HG+TQ FNS+
Sbjct: 61 QA-QPLHLSQPGSHGPPPPFCQGPSIQVLPGGITNIRP-YFHTFPPAHGNTQVSVFNSNA 120
Query: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLS SG QNMHH+LPPPPPLPPPPPPPPP + APNPDLLRPPQPSTV
Sbjct: 121 QQNVQLSHSGAQNMHHVLPPPPPLPPPPPPPPPPS--------QAPNPDLLRPPQPSTVG 180
Query: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSN-LGESHLS 240
+HPPSQGQ YGA H PLQQGGLQVFPSIP HPTTS FPTP SN LG+SHL
Sbjct: 181 SLHPPSQGQAFYGALTHQPLQQGGLQVFPSIPPHPTTSTFPTP------SSNFLGDSHLL 240
Query: 241 PMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFES 300
PMAPPPPPSSPPPIPPSPPPPTSPS SIP SSNL S PSST++ SK+LK E
Sbjct: 241 PMAPPPPPSSPPPIPPSPPPPTSPS-PSIPHPDSSNLSHGSHLGPSSTVHYSKDLKPSEI 300
Query: 301 NQGGTPTRHLGDNGPKHK-HRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDK 360
+QGG P HLGDNGPKH+ H NL+ GLM+ SKVDNEILSDK VQ LPPSPPKPKDD+
Sbjct: 301 DQGGAPPSHLGDNGPKHEEHGNLEVGSGLMV-SKVDNEILSDKDYVQVLPPSPPKPKDDR 360
Query: 361 ITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYS 420
I +KI VLC+ IA+NG SFEDTTRQKEFGNPEF+FL+GGEPGSE+AI HEYFL MK KYS
Sbjct: 361 IVKKIEVLCQLIADNGPSFEDTTRQKEFGNPEFDFLFGGEPGSESAIAHEYFLRMKMKYS 420
Query: 421 LDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFK 480
L KN E+ EKSP+R LRI PQSE+LT SAAS+SP NSDMEMEDDIT I E TSH F
Sbjct: 421 LASKNIEITEKSPLRYLRIEPQSENLTASAASLSPANSDMEMEDDITVADIEEGTSHLFG 480
Query: 481 IQSYECKSRKEEHDAKD--QLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSLEAC 540
IQSYECK RKEEHDA+D QLQ P+ L+ SP K KVAE DG KLLL HEKS S+ AC
Sbjct: 481 IQSYECKPRKEEHDARDLVQLQKPEVLRSCSPEKEKVAE--DGGPKLLLNHEKSGSIAAC 540
Query: 541 QVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLA----ASEAVNSSLSTELIIGGSPFR 600
QVHSPV +TAGV P G++FE S+ +QN+K LA +S A SS ST LI GGSPFR
Sbjct: 541 QVHSPVRSTAGVAGHPPGNDFENSLISLQNDKGLAGEVASSAATISSQSTALITGGSPFR 600
Query: 601 LIQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQRSN 660
LIQDYASDENSE+DE+SH DV F AISPSTPA SKTSGKD+ +LT LGS+GSCQVQ S
Sbjct: 601 LIQDYASDENSESDEDSHHTDVHFVAISPSTPAYSKTSGKDTGDLTTLGSKGSCQVQWSY 660
Query: 661 VPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADA 720
VPPCE SMP+ G+QF SESPK + DA EANV++ GNE++Y NQ+ T T +KSLDA
Sbjct: 661 VPPCEFSMPEPGAQFHSESPKQVIDATEANVQKTGNEQSYNDQHNQIDTVTGTKSLDAMN 720
Query: 721 VKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKK 780
V RSVDV D+ KLQKEND EK + GSSP+KIDEFGRLVREGGSDSDSDD HY RRHK
Sbjct: 721 V--RSVDVPQDTDKLQKENDAEKGRLGSSPIKIDEFGRLVREGGSDSDSDDLHYRRRHKS 780
Query: 781 RRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRSPVSRRTSQFNNE 840
RR+RNSSES SPVDRRRGRRSP RRR+RRSRSRSWSPRNQR RSRS PV RRTSQF+NE
Sbjct: 781 RRSRNSSESRSPVDRRRGRRSPRRRRERRSRSRSWSPRNQRDRSRS--PVGRRTSQFSNE 840
Query: 841 NMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSENIKGRE 900
N +RDKGM+RKCFDFQRGRCYRGASCRYVHHEP+KNDG R HRSKHHDVHPTS+NIK RE
Sbjct: 841 NKRRDKGMVRKCFDFQRGRCYRGASCRYVHHEPNKNDGPRFHRSKHHDVHPTSKNIKIRE 900
Query: 901 DTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTKCQSSRDRAG 960
DT+NMSREVSD GH KVENQ I HNVSPK DTHDWK SPTGDPD VTKCQSS DR G
Sbjct: 901 DTMNMSREVSDLGHTKVENQESILHNVSPKKDTHDWKTDSPTGDPDSFVTKCQSSSDRTG 960
Query: 961 LVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDISMSMLT 1020
LVQ+ LI S+ AEA+H+H N++ QEA K YEQ SVTA+SQCM NADTEKLSGDISMS LT
Sbjct: 961 LVQDALICSEPAEAIHVHANDDGQEAKKCYEQPSVTASSQCMGNADTEKLSGDISMSTLT 1020
Query: 1021 SVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAPECEHFPDKN 1080
SVE S+A QQSN F +E +++N +SHQMDGSFVS+LLPDQVTAV++NKAPECEHF D+
Sbjct: 1021 SVENSVA--QQSNTFVAELQSSNDLSHQMDGSFVSNLLPDQVTAVTSNKAPECEHFTDRT 1080
Query: 1081 SLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPLPTSCV 1140
S IK QFDTSSA Q P T Q LSESPVPK SATAP A DDAH L ELPPPPPL S V
Sbjct: 1081 SSIKPQFDTSSAIQLPLTSQILSESPVPKPYSATAPVSATDDAHSLTELPPPPPLIISHV 1140
Query: 1141 TSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPSRPLYDPTM 1200
+SA++ MP PYNFVSQN+SFP SLP GF PH +VSIQ SHY ST+ P +PLY+ ++
Sbjct: 1141 SSAEISMPAPYNFVSQNLSFPPNSSLPIGFHPHHGMVSIQPSHYQSTSLLPPKPLYN-SL 1200
Query: 1201 AHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPVREPYRAP-LHMD 1260
A VTT G PMQFHQSHLSQG D GSQS M SQPL +SHS LGESPV+EPYRAP +H+D
Sbjct: 1201 APVTTNAGMPMQFHQSHLSQGRDLGSQSAMSSQPLELHSHSKLGESPVQEPYRAPPMHLD 1260
Query: 1261 EIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPFTNANRMQ 1320
EIRS APVANNRP QPFGFPSFQ EEN GRTSVEM+SSSFFP RNF+D SMP TNANRMQ
Sbjct: 1261 EIRSIAPVANNRPTQPFGFPSFQNEENHGRTSVEMNSSSFFPQRNFSDHSMPATNANRMQ 1320
Query: 1321 SSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYPDPLSRNH 1380
SGDNFPP+EFRSSFSQF YSRFQQPLY SQ AHDS PSQIG+ISRHYPDPLSR+H
Sbjct: 1321 PSGDNFPPTEFRSSFSQFQPYSRFQQPLYTSQPAHDSLFRDPSQIGSISRHYPDPLSRSH 1380
Query: 1381 SSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFNLSNVRVD 1440
SLLP++GGLGITTYHNPYASTF+KPLSS+FRSN LNFGNDAPSGDI STFN+S+V +D
Sbjct: 1381 PSLLPEYGGLGITTYHNPYASTFEKPLSSSFRSNFLNFGNDAPSGDICSSTFNMSSVHID 1440
Query: 1441 GQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDRIRKLEKA 1500
GQG NY GS T SP STKP GK L + GDQYDPLFDSIEPS PITKKSDR +KL+KA
Sbjct: 1441 GQGTNYVGSRQTVASPNSTKPLGKLLSGTDGDQYDPLFDSIEPSSPITKKSDRGQKLKKA 1500
Query: 1501 RESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVENDLDDEE 1560
RES M RLGGSHKL DVEENNKHKEVAAV STTSLENDEFGET DAEAGAVENDLDDE
Sbjct: 1501 RESDTMARLGGSHKLLDVEENNKHKEVAAVTSTTSLENDEFGETGDAEAGAVENDLDDEA 1560
Query: 1561 NLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVK 1620
NL+GEIEIDQVKSSEKSKKSKGSRSL+LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVK
Sbjct: 1561 NLSGEIEIDQVKSSEKSKKSKGSRSLKLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVK 1620
Query: 1621 KTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDK 1653
KTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDK
Sbjct: 1621 KTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDK 1636
BLAST of MC06g0242 vs. ExPASy TrEMBL
Match:
A0A0A0LRV0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G014360 PE=4 SV=1)
HSP 1 Score: 2234 bits (5788), Expect = 0.0
Identity = 1242/1668 (74.46%), Postives = 1371/1668 (82.19%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG ANYASQFGQGP KPWPPAYQQRA APPPPPPPTSY+QPGPPIPS PITQQAPAPPP
Sbjct: 1 MYGQANYASQFGQGPPKPWPPAYQQRAGAPPPPPPPTSYVQPGPPIPSHPITQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QA QPLHLSQ G H P PP CQGPS+QVLPGGI NIR YFHTFPPVHG+TQ FNS+
Sbjct: 61 QA-QPLHLSQPGSHGPLPPFCQGPSIQVLPGGITNIRP-YFHTFPPVHGNTQVSVFNSNA 120
Query: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLS SGVQNMHH+LPPPPPLP PPPPPPP P+ APNPDLLRPPQPSTV
Sbjct: 121 QQNVQLSHSGVQNMHHVLPPPPPLPLPPPPPPPPPPS------QAPNPDLLRPPQPSTVG 180
Query: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSN-LGESHLS 240
+HPPSQGQ LYGAR H PLQQGGLQVFPSIP HPTTS FPTP SN LG+SHL
Sbjct: 181 SLHPPSQGQALYGARTHQPLQQGGLQVFPSIPPHPTTSTFPTP------SSNFLGDSHLL 240
Query: 241 PMAPPPPP-SSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
PMAPPPPP SSPPPIPPSPPPPTSPS SIP SSNLL S+ PSST++ SK+LK E
Sbjct: 241 PMAPPPPPPSSPPPIPPSPPPPTSPS-PSIPHPDSSNLLHGSDLGPSSTVHYSKDLKPSE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
+QGGTP HLGDNGP + +H NL+ GLM+ S VDNE L+DK VQ LPPSPPKPKDD
Sbjct: 301 IDQGGTPPSHLGDNGPGNDEHGNLEVDSGLMV-SNVDNEKLADKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKY 420
+I +KI VLC+ IA+NG +FEDT RQKE GNPEFEFL GGEPGSE+AIGH+YFLWMK KY
Sbjct: 361 RIVKKIEVLCQLIADNGPNFEDTIRQKESGNPEFEFLLGGEPGSESAIGHKYFLWMKMKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
L KN E+ E+ +R LRI PQSE+LTV AAS+SP NSDMEMEDDIT + + TSHSF
Sbjct: 421 CLASKNIEITERCSLRYLRIEPQSENLTVLAASLSPANSDMEMEDDIT---VEQGTSHSF 480
Query: 481 KIQSYECKSRKEEHDAKD--QLQGPKDLQRSSPVKGKVAEEKDGESKLLLEHEKSVSLEA 540
+IQSYEC++RKEEHDA+D QLQ P+ L+ SP K KVAEE G K LL HEK S+ +
Sbjct: 481 EIQSYECEARKEEHDARDLVQLQEPEVLRSCSPEKEKVAEE--GGPKHLLNHEKFGSIAS 540
Query: 541 CQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKS----LAASEAVNSSLSTELIIGGSPF 600
CQVHSPV +TAGV P G++FE S++ +QN+K +A+S SS ST LI GGSPF
Sbjct: 541 CQVHSPVRSTAGVAGHPSGNDFENSLSYLQNDKGQAGEVASSAGTISSQSTALITGGSPF 600
Query: 601 RLIQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSEGSCQVQRS 660
RLIQDYASDENSE+DE+SH DV F AISPSTPA SKTS KD+ +LT LGS+GSCQV+ S
Sbjct: 601 RLIQDYASDENSESDEDSHRTDVHFVAISPSTPAYSKTSDKDTGDLTTLGSKGSCQVRWS 660
Query: 661 NVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDAD 720
VPPCE SMP+ G+QF SESPK + DA EANVR+ GNE +Y NQ+ T T +KSLDA
Sbjct: 661 YVPPCEFSMPEPGAQFHSESPKQVIDATEANVRKTGNELSYNDQHNQIDTVTGTKSLDA- 720
Query: 721 AVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHK 780
+ G SVDV D+ KLQKE D EK + G SPVKIDEFGRLVREGGSDSDSDDSHY RRH+
Sbjct: 721 -MNGCSVDVPQDTGKLQKETDAEKGRLGPSPVKIDEFGRLVREGGSDSDSDDSHYRRRHR 780
Query: 781 KRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRSPVSRRTSQFNN 840
RR+RNSSES SPVDRRRGRRSP RRR+RRSRSRSWSPRNQR RSRS PVSRRTSQF+N
Sbjct: 781 SRRSRNSSESRSPVDRRRGRRSPRRRRERRSRSRSWSPRNQRDRSRS--PVSRRTSQFSN 840
Query: 841 ENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSENIKGR 900
EN +RDKGM+RKCFDFQRGRCYRGASCRYVHHEP+KNDGSR HRSKH DVH TS+NIK R
Sbjct: 841 ENKRRDKGMVRKCFDFQRGRCYRGASCRYVHHEPNKNDGSRFHRSKHQDVHSTSKNIKIR 900
Query: 901 EDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLDVTKCQSSRDRA 960
EDT+NMSREVSD GH KVE Q I HNVSPK+DTHDWK +PTGDPD V+KC+SS +R
Sbjct: 901 EDTMNMSREVSDLGHTKVEIQESILHNVSPKEDTHDWKTDNPTGDPDSFVSKCRSSSERT 960
Query: 961 GLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDISMSML 1020
GLVQ+ LI + AEAVH+ N++ QE KSYEQ SVTA+SQCMSNADTEKLSGDISMS+L
Sbjct: 961 GLVQDALICLEPAEAVHVRANDDGQEPKKSYEQPSVTASSQCMSNADTEKLSGDISMSVL 1020
Query: 1021 TSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTNKAPECEHFPDK 1080
TSVE S+A QQSN F +E +++ +SHQMDGSFVS+LLPDQVTAV++NKAPE EHFPD+
Sbjct: 1021 TSVENSVA--QQSNTFVAELQSSTDLSHQMDGSFVSNLLPDQVTAVTSNKAPEWEHFPDR 1080
Query: 1081 NSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPLPTSC 1140
S IK QFDTSSA Q P T Q LSESPVPK LSATAP A DD H L ELPPPPPL S
Sbjct: 1081 TSSIKPQFDTSSAIQLPLTSQILSESPVPKPLSATAPVSATDDDHSLTELPPPPPLIISH 1140
Query: 1141 VTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPSRPLYDPT 1200
V+SA++ MP PYNFVSQN+SFPS SLP GF PH +VSIQ SH+ ST+ P +PLY+ +
Sbjct: 1141 VSSAEISMPAPYNFVSQNLSFPSNSSLPIGFHPHHGMVSIQPSHFQSTSLLPPKPLYN-S 1200
Query: 1201 MAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPVREPYRAP-LHM 1260
+A V T G PMQFH SHLSQG D GSQS M SQPL +SHS LGESP++EPYRAP +HM
Sbjct: 1201 LAPVATNAGMPMQFHHSHLSQGRDLGSQSAMSSQPLELHSHSKLGESPLQEPYRAPPMHM 1260
Query: 1261 DEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPFTNANRM 1320
DEIRS APVANNRP QPFGFPSFQ EEN GRTSVEM+SSSFFP RNF+DQSM TNANRM
Sbjct: 1261 DEIRSIAPVANNRPTQPFGFPSFQNEENLGRTSVEMNSSSFFPQRNFSDQSMLATNANRM 1320
Query: 1321 QSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYPDPLSRN 1380
Q SGDNFPPSEFRSSFSQF YSRFQQPLY SQ AHD+ H PSQIG+ISRHYPDPLSR+
Sbjct: 1321 QPSGDNFPPSEFRSSFSQFQPYSRFQQPLYTSQPAHDTLFHDPSQIGSISRHYPDPLSRS 1380
Query: 1381 HSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFNLSNVRV 1440
H SLLP+FGGLGITT+HNPYASTF+KPLSS+FRSN LNFGNDAPSGDIR STFNL++V V
Sbjct: 1381 HPSLLPEFGGLGITTHHNPYASTFEKPLSSSFRSNFLNFGNDAPSGDIRGSTFNLNSVHV 1440
Query: 1441 DGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDRIRKLEK 1500
DGQG NY GS T SP STKP GK L + DQYDPLFDSIEPS PITKKSDR +KL+K
Sbjct: 1441 DGQGTNYVGSRQTVASPNSTKPLGKLLSGTDDDQYDPLFDSIEPSSPITKKSDRGQKLKK 1500
Query: 1501 ARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVENDLDDE 1560
ARESHM+ RLGGSHKL DVEENNKHKEVAAV STTSLENDEFGET DAEAGAVENDLDD+
Sbjct: 1501 ARESHMIARLGGSHKLLDVEENNKHKEVAAVTSTTSLENDEFGETGDAEAGAVENDLDDD 1560
Query: 1561 ENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIV 1620
NL+GEIEIDQVKSSEKSKKSKGSRSL+LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIV
Sbjct: 1561 ANLSGEIEIDQVKSSEKSKKSKGSRSLKLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIV 1620
Query: 1621 KKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1657
KKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT
Sbjct: 1621 KKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMGYVDKYVKT 1640
BLAST of MC06g0242 vs. TAIR 10
Match:
AT3G26850.1 (histone-lysine N-methyltransferases )
HSP 1 Score: 160.6 bits (405), Expect = 1.1e-38
Identity = 117/259 (45.17%), Postives = 150/259 (57.92%), Query Frame = 0
Query: 1438 GSGLTTTSPKSTKPSGKHLPSSG--GDQYDPLFDSIEPS-------------------PP 1497
GS ++SP S K GK +P G GD YDP DS EP+ P
Sbjct: 11 GSRQASSSPYSGK--GKIVPECGLVGDMYDPFVDSFEPASVKLDCVQEHEPDNDLCIVPK 70
Query: 1498 ITKKSDRIRKLEKAR---------ESHMMTRLG-GSHKLPDVEENNKHKEVAAVASTTSL 1557
+ S+R +E+ ES M R+ S+K DVEEN E+ V S
Sbjct: 71 ASISSNRPLSMEENNQAVDKEPLCESEMTARVSVSSNKPADVEENTAGIEIGEVVSG--- 130
Query: 1558 ENDEFGETAD--AEAGAVE-------NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLR 1617
E+DEFG+ D E + E N+ EN E + + KS EKSK+ SRS++
Sbjct: 131 EDDEFGKNVDDGRECNSHETLTPNSDNENPKVENNVHEGDNTRKKSREKSKERDSSRSMK 190
Query: 1618 LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYID 1657
LF++ + FVK++LKPSWRQGNMSKEAFKTIVK+ VDKVS +M+ +IPKS+AKI++YID
Sbjct: 191 LFKVVLTKFVKDLLKPSWRQGNMSKEAFKTIVKRVVDKVSNSMEGRRIPKSRAKIDKYID 250
BLAST of MC06g0242 vs. TAIR 10
Match:
AT3G26850.2 (histone-lysine N-methyltransferases )
HSP 1 Score: 160.6 bits (405), Expect = 1.1e-38
Identity = 117/259 (45.17%), Postives = 150/259 (57.92%), Query Frame = 0
Query: 1438 GSGLTTTSPKSTKPSGKHLPSSG--GDQYDPLFDSIEPS-------------------PP 1497
GS ++SP S K GK +P G GD YDP DS EP+ P
Sbjct: 11 GSRQASSSPYSGK--GKIVPECGLVGDMYDPFVDSFEPASVKLDCVQEHEPDNDLCIVPK 70
Query: 1498 ITKKSDRIRKLEKAR---------ESHMMTRLG-GSHKLPDVEENNKHKEVAAVASTTSL 1557
+ S+R +E+ ES M R+ S+K DVEEN E+ V S
Sbjct: 71 ASISSNRPLSMEENNQAVDKEPLCESEMTARVSVSSNKPADVEENTAGIEIGEVVSG--- 130
Query: 1558 ENDEFGETAD--AEAGAVE-------NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLR 1617
E+DEFG+ D E + E N+ EN E + + KS EKSK+ SRS++
Sbjct: 131 EDDEFGKNVDDGRECNSHETLTPNSDNENPKVENNVHEGDNTRKKSREKSKERDSSRSMK 190
Query: 1618 LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYID 1657
LF++ + FVK++LKPSWRQGNMSKEAFKTIVK+ VDKVS +M+ +IPKS+AKI++YID
Sbjct: 191 LFKVVLTKFVKDLLKPSWRQGNMSKEAFKTIVKRVVDKVSNSMEGRRIPKSRAKIDKYID 250
BLAST of MC06g0242 vs. TAIR 10
Match:
AT3G18640.1 (Zinc finger C-x8-C-x5-C-x3-H type family protein )
HSP 1 Score: 99.0 bits (245), Expect = 3.8e-20
Identity = 58/140 (41.43%), Postives = 87/140 (62.14%), Query Frame = 0
Query: 1525 SLENDEFGETADAEAGAVE------NDLDDEENLTGEIEI-DQVKSSEKSKKSKGSRSLR 1584
SL+ E G+ EA E D +D EN+ E E D S E++KK K + +R
Sbjct: 537 SLDPKENGDKKTDEASKEEEGKKTGEDTNDAENVVDEDEDGDDDGSDEENKKEKDPKGMR 596
Query: 1585 LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYID 1644
F+ A+ + VKE+LKP+W++G ++K+ +K IVKK +KV+G M+S +P++Q KI+ Y+
Sbjct: 597 AFKFALVEVVKELLKPAWKEGKLNKDGYKNIVKKVAEKVTGTMQSGNVPQTQEKIDHYLS 656
Query: 1645 SSQRKLTKLVMGYVDKYVKT 1658
+S+ KLTKLV YV K KT
Sbjct: 657 ASKPKLTKLVQAYVGKIKKT 676
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022135323.1 | 0.0 | 98.75 | uncharacterized protein LOC111007314 [Momordica charantia] | [more] |
XP_023531277.1 | 0.0 | 77.27 | uncharacterized protein LOC111793568 [Cucurbita pepo subsp. pepo] >XP_023531278.... | [more] |
XP_022931323.1 | 0.0 | 77.36 | uncharacterized protein LOC111437543 [Cucurbita moschata] >XP_022931325.1 unchar... | [more] |
KAG6587592.1 | 0.0 | 77.00 | Zinc finger CCCH domain-containing protein 55, partial [Cucurbita argyrosperma s... | [more] |
XP_023001661.1 | 0.0 | 76.23 | serine/arginine repetitive matrix protein 2-like [Cucurbita maxima] >XP_02300166... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1C4H9 | 0.0 | 98.75 | uncharacterized protein LOC111007314 OS=Momordica charantia OX=3673 GN=LOC111007... | [more] |
A0A6J1EZ49 | 0.0 | 77.36 | uncharacterized protein LOC111437543 OS=Cucurbita moschata OX=3662 GN=LOC1114375... | [more] |
A0A6J1KND4 | 0.0 | 76.23 | serine/arginine repetitive matrix protein 2-like OS=Cucurbita maxima OX=3661 GN=... | [more] |
A0A5A7UQ65 | 0.0 | 75.71 | Serine/arginine repetitive matrix protein 2 OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A0A0LRV0 | 0.0 | 74.46 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G014360 PE=4 SV=1 | [more] |