Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGGTCCAGCAAATTATGCTTCTCAGTTCGGGCAGGGTCCTCAGAAACCATGGCCACCTGCATACCAACAGCGTGCAGTTGCACCTCCTCCGCCTCCTCCTCCTACCTCATATATGCAACCAGGCCCTCCAATCCCATCACGTCCTATAACTCAACAAGCACCTGCCCCCCCACCTCAGGCTGGTCAACCTCTTCATCTTTCTCAGTCGGGTCCTCACGTTCCACCACCTCCACTCTGTCAGGGCCCATCCGTTCAAGTGCTACCTGGTGGGATACCAAACATCCGTCAAACTTATTTTCACACTTTTCCACCAGTCCATGGAAGCACACAAGGTTTTCAATTTAACTCAAGTACTCAGCAGAATGTACAACTTTCACAGTCAGGAGTTCAGAACATGCATCATGTTCTACCTCCACCACCCCCGCTACCACCACCACCACCACCACCTCCTCCTCATGCTCCTAATCCACCACCACCCCCTCCTCATGCTCCTAATCCAGATTTATTACGGCCTCCACAACCCTCTACAGTAGTACCTGTTCATCCTCCTTCACAAGGACAAACATTGTATGGAACTCGAGTTCATCCACCATTGCAACAAGGTGGTTTGCAGGTCTTTCCCTCTATACCACAACATCCAACAACGTCCAACTTCCCTACTCCTCCTTTTGGAGGAGTTATGCAATCAAATCTTGGAGAGTCTCATTTGTCTCCAATGGCTCCTCCACCACCACCATCCTCTCCGCCACCTATTCCACCTTCTCCCCCTCCTCCCACGTCGCCCTCCTTTTATTCAATTCCAAGCTCGGGTTCTTCCAACTTGTTGTGCCAGTCTGAATTTGATCCTAGTTCTACCATTAATTCTAGTAAAGAATTGAAAGCCTTTGAGTCTAATCAAGGAGGCACACCTACCAGACATTTAGGGGATAACGGACCCAAGCATAAGCATAGAAATTTGGATGGCAGTATTGGCCTTATGATGGGTTCTAAAGTGGACAATGAGATATTGTCAGATAAAGGTAATGTGCAGGATCTTCCTCCATCTCCACCTAAGCCAAAAGATGATAAAATTACCAGGAAGATAGAAGTATTATGCAAGTACATTGCTAATAATGGTTCCAGTTTTGAAGACACTACTCGCCGAAAGGAATTTGGGAATCCAGAGTTTGAATTTTTATATGGTGGTAAACCGGGAAGTGAAGCTGCAATTGGTCATGAATATTTTCTGTGGATGAAGAAGAAATATAGTTTGGATTGCAAAAATAAAGAAATGGAAGAAAAATCTCCAGTGAGATCTTTAAGAATTGGGCCACAATCTGAGTCTTTGACAGTGTCAGCAGCATCCATTTCACCTGAAAATTCCGACATGGAGATGGAAGGTTAGTCTTAGTGTTAACAAATCTCTTCCTATACGGTAACGTTTGACATTGGTGCTGAGTTTATAAGTCTTTTTTCTGGGTTGATTGTTTTCCCTTAACAGATGTAGCGTGATCTATAACCATGAAAGCTTATTACTTGAGACAATCAGAGTGAAATCTATGAAGCCAAGGCAAATTCTCTGCAGATTGCAGGGATTTTGAGAAACTTACTATTATATTCATCACATGAAAATATTGTGATACTTTTGTTTCATGTGTATACAGCTCAGCTGCTTCTACTAAATTTCTGGTTATAAACGCATCATTCATTTAATTAAAGAGCACATTTAAAAAAATATATATTTGGGCTAGGAATTTAAAGGAGTTATATTTCTTGGCTATTTGGATCTTAGGCATTATTGAAATCCATTTTGTATCCTGTTAAATAGAATGTCTCGTATGTTTCTGGTAATTGTGTTTCTTTATATCTGTTTATCTGTACTTCTGTTTTAAGAGCTTCAATTTTGCTGTGAAGAAAATTTCTCCAATTTAAGTTGATAGGTCAGTTTTCTAATCTAATTTTGATCGAAAATTTTGTCCAGTTGGGAGATAAGAATAAGAAAAACTCAAGGAATATGGTTGTGACGTTGTTTATTTGTAGCATGAGCTCTGTAAATGCTTAATGCAAGTACCTTGTAGTAACTGGAGTACTGATTTTTTTTCCTCCTTGTTTTGTGTGGTTTCTTACTGGTGGCTTCAAGATGATATTACCCCAGATGGTATAGGGGAGGAAACTAGTCACTCCTTTGAAATTCAAAGCTACGAGTGCAAATCAAGAAAAGAAGAGCATGATGCAAAGGATCAGTTACAAGGACCTGAAGATTTGCAAAGAAGCAGCCCAGTGAAGGGGAAAGTAGCTGAAGGTATGCATTTCTATCTTTTTGCGTTTCTTGTTGAATGGGTTCATGCATGGCACAGATTTTGCTTTGTGATATTGTGGTGCTTTGTGTCTTACTGTCAACAAAGTTGAGATAAAATATTGTCTTTTCACTTAATGAATTCGAAGTAGTTAAATAAGAGTTATATTTCTCTTGCTGCTGCATCTTTTAAAAAAATGAATATGGCAAGACAGAATCTTTTCTTCTTCTTAGAAGGGTTCCACTGGGGTAATAGATAAGAGATTCTGACATCACTAGGAAGAACTCTGCTTATTAAATTATACTTGTGTTTTTCTTTCTGCTGAAAAATTTGGACAATGTAAAGCTATGGTTTGGATATCAAGCATAGTATTCTTCAGTTTTTGCTTTTACTTCTGTTCAAATATGCAGTTCCACAGTTCCTTTCAATTCAACCTAGTTGTGCCATGCAGAGAGGATTTTGGACGAACCTGGGTGAGAGTTAAATTAATTCCGCAGGAACTTTGTAGTTTGTCAGATTAAGTCACGTGAATAAAACCAATGACTCTGATCTGCTATCCTGGTTCTGAAAAGATGCATTGATTTGACAACACATAAAAGTTTTCCTTCAAGGTTTTTGATTTGTGAATTTCTTATGAAGTTTGTTTCTTTTATCGTATTCCTCATGTTGAGATCGCAATCATGGGATTACTGCTTTTTCATCCTCATTTTTGTATGTCTAATAAGCATCTACACTTCCTATTGAAGGATTTTTTTTTTTTTTTTTTTTAAAGAAAAAGATGGAGAGTCAAAGCTTCTGCTCGACCATGAAAAATCTGTCAGCCTGGAAGCTTGCCAAGTTCACAGCCCTGTCATAAATACTGCTGGAGTTGTTGAACAGCCTTTAGGAAGCAATTTTGAGATTTCTGTTACCTGCCTACAAAATGAAAAAAGTCTAGCAGCTTCTGAAGCTGTTAATTCTAGTCCGTCTACTGAACTTATTATTGGTGGCAGCCCATTCAGACTTATACAGGACTATGCTTCTGATGAAAATTCAGAAACCGATGAGGAATCACACCTTAAAGATGTCAGTTTTGCTATCTCACCTTCAACTCCAGCTTCTTCCAAGACTTCCGGAAAAGACAGTGATAATTTGACTATTCTGGGATCAAAAGGTTCTTGCCAGGTTCAACGGAGTAATGTTCCACCTTGTGAAGCCTCGATGCCTGATTTTGGTTCTCAGTTCCTCTCGGAATCACCAAAACAGATTTTTGATGCCAATGAGGCAAATGTGAGAAGGGCAGGGAATGAACGGAACTATAAAATTCATCAGAATCAAGTTGGCACCCGCACCAGTTCTAAGTCTTTGGATGCAGATGCAGTGAAAGGTCGTAGTGTTGATGTTCTCCATGATAGTCACAAGTTACAAAAAGAGAATGATGAGGAAAAGCTGAAGTTTGGATCATCGCCTGTAAAAATAGATGAGTTTGGAAGATTAGTCAGAGAAGGTGGTAGCGATAGCGATTCAGATGATTCACACTATACAAGGAGACATAAGAAAAGAAGAACTAGAAATAGTAGTGAAAGTCATTCTCCTGTTGACAGGAGGAGGGGGCGAAGGAGTCCGTGGAGAAGAAGGCAGAGGCGAAGTCGCTCACGCAGGTAACAATACTTTTCGCATGTATATATTGTTACTTAAGTGTGAGTTAATTCACTCAATGTTATTAAGGAACAATGTAGCCTAAGCAATTATGTCAGATTAGATATGCACTTGTTTCGTTAACTTTTGAGATTATTGTTAATAGAAAATTTTCTCTTCTATTGTTTCAGTTGGTCTCCTCGTAATCAAAGAGGCAGAAGCAGAAGCAGGTCTCCTGTCAGCAGGCGTACAAGTCAGTTTAATAATGAGAATATGAAACGAGATAAGGGTATGATACGAAAATGCTTTGACTTTCAGCGAGGTAGGTGCTATAGAGGAGCATCTTGTCGCTATGTGCACCATGAACCCAGCAAGAATGATGGATCAAGACACCATAGGAGCAAACATCATGATGTTCATCCAACTTCAGAGAATATAAAGGGTAGAGAGGACACTGTGAACATGTCTAGGGAAGTATCAGATCTTGGGCATATTAAAGTTGAGAATCAGGGGTGCATCCAGCATAATGTGTCCCCAAAAGATGATACCCATGATTGGAAGAAAGGTAGTCCCACTGGTGATCCAGGTTTAGATGTAACTAAATGTCAAAGTTCTAGAGATAGAGCTGGCTTAGTTCAAGAAGAATTAATTTATTCCAAAGCAGCAGAGGCTGTCCACATTCACGTTAACGAGAATATTCAAGAAGCAGGGAAGTCTTATGAGCAACTTTCAGTTACGGCTGCCTCACAATGCATGAGCAATGCTGATACTGAGAAATTATCGGGTGATATTTCCATGAGCATGCTGACTTCTGTAGAGAGGTCTTTGGCTCATGCTCAGCAATCCAACATGTTTGCTTCAGAGTTTGAAGCTGCCAATAGTGTCTCACACCAAATGGAAGGTTCATTTGTCTCCCATTTGTTACCCGATCAAGTAACTGCTGTCAGTACCAATAAGGCTCCTGAATGTGAACATTTTCCTGATAAAAATTCATTAATTAAGCTCCAGTTTGATACCAGTTCTGCTGGTCAGCAGCCTTCGACCTCACAATTTTTATCTGAGTCTCCAGTACCAAAATCATTATCTGCTACTGCTCCTGGTTGTGCTATGGATGATGCCCATCCTCTAAGAGAGCTGCCCCCTCCGCCTCCTCTCCCAACTTCATGTGTCACTAGTGCTGATGTTCTAATGCCTACCCCCTATAACTTTGTGTCACAAAATGTGTCCTTTCCTTCTAAGCCTTCTCTACCGGGAGGTTTTCAACCTCATCAGGATATCGTATCCATCCAATCATCTCATTACCACAGCACCACTTTCCCGCCTTCTAGACCATTATATGATCCAACAATGGCTCATGTAACCACCAAAGATGGTACGCCAATGCAATTTCATCAGAGTCATTTGTCTCAAGGAAGTGATCTGGGTTCTCAGTCTGTTATGAAATCCCAGCCATTGGTGTCAAATTCTCATTCTATGCTTGGTGAGTCTCCAGTTCGGGAGCCTTATAGAGCTCCATTGCACATGGATGAAATTAGATCAATCGCCCCAGTTGCAAATAATCGACCTATTCAGCCCTTTGGATTCCCCAGCTTTCAGAAAGAAGAAAACTTTGGGCGGACTTCTGTGGAGATGAGTTCTTCTAGTTTTTTTCCCCATCGGAACTTTAATGATCAATCTATGCCCTTCACAAATGCAAATAGAATGCAATCTTCTGGTGACAATTTTCCTCCCAGTGAATTTCGAAGTTCATTTTCACAGTTTCATTCTTATTCACGGTTCCAACAGCCATTATATGCCTCACAATCTGCACATGATAGTTTTTTACATGGCCCAAGTCAGATTGGTACTATATCTCGACATTATCCTGATCCTCTAAGCAGGAACCATTCGTCTTTGCTTCCTGATTTTGGGGGTTTGGGTATTACCACTTATCATAATCCTTATGCGTCTACTTTTGACAAGCCACTTAGCTCCAACTTCAGATCTAACATTTTGAACTTTGGAAATGATGCACCTAGTGGTGATATACGTGATTCTACTTTCAATTTGAGCAATGTTCGAGTTGATGGGCAAGGTGCTAATTATGTTGGATCAGGGCTGACAACTACTTCGCCAAAATCCACCAAACCTTCGGGGAAACACTTGCCCAGCTCAGGTGGTGATCAGTATGATCCACTCTTTGACAGCATTGAGCCATCACCACCTATTACCAAGAAATCTGATCGTGTTCGAAAGCTGGAAAAAGCAAGAGAATCTCATATGATGACAAGACTTGGTGGTTCCCATAAATTACCAGATGTGGAGGAGAACAACAAGCATAAGGAGGTTGCTGCTGTGGCTTCAACTACTTCTCTGGAGAATGATGAATTTGGGGAGACAGCAGATGCAGAAGCTGGTGCTGTTGAGAATGACCTTGATGACGAAGAAAACTTAACCGGAGAGATTGAAATTGATCAGGTTAAGTCCTCAGAGAAGAGCAAGAAATCCAAAGGTTCCAGGTCGCTGAGGCTTTTCAGGATTGCTATTGCCGATTTTGTGAAGGAAGTTCTAAAACCATCATGGCGACAGGGCAATATGAGCAAGGAAGCTTTTAAGACAATTGTCAAGAAGACTGTTGACAAAGTATCTGGAGCTATGAAGAGTCACCAAATACCCAAGTCTCAAGCAAAGATAAATAGATACATTGATTCGTCACAACGAAAACTGACAAAGCTTGTTATGGTACGTTATCTTTCTCTC
mRNA sequence
ATGTATGGTCCAGCAAATTATGCTTCTCAGTTCGGGCAGGGTCCTCAGAAACCATGGCCACCTGCATACCAACAGCGTGCAGTTGCACCTCCTCCGCCTCCTCCTCCTACCTCATATATGCAACCAGGCCCTCCAATCCCATCACGTCCTATAACTCAACAAGCACCTGCCCCCCCACCTCAGGCTGGTCAACCTCTTCATCTTTCTCAGTCGGGTCCTCACGTTCCACCACCTCCACTCTGTCAGGGCCCATCCGTTCAAGTGCTACCTGGTGGGATACCAAACATCCGTCAAACTTATTTTCACACTTTTCCACCAGTCCATGGAAGCACACAAGGTTTTCAATTTAACTCAAGTACTCAGCAGAATGTACAACTTTCACAGTCAGGAGTTCAGAACATGCATCATGTTCTACCTCCACCACCCCCGCTACCACCACCACCACCACCACCTCCTCCTCATGCTCCTAATCCACCACCACCCCCTCCTCATGCTCCTAATCCAGATTTATTACGGCCTCCACAACCCTCTACAGTAGTACCTGTTCATCCTCCTTCACAAGGACAAACATTGTATGGAACTCGAGTTCATCCACCATTGCAACAAGGTGGTTTGCAGGTCTTTCCCTCTATACCACAACATCCAACAACGTCCAACTTCCCTACTCCTCCTTTTGGAGGAGTTATGCAATCAAATCTTGGAGAGTCTCATTTGTCTCCAATGGCTCCTCCACCACCACCATCCTCTCCGCCACCTATTCCACCTTCTCCCCCTCCTCCCACGTCGCCCTCCTTTTATTCAATTCCAAGCTCGGGTTCTTCCAACTTGTTGTGCCAGTCTGAATTTGATCCTAGTTCTACCATTAATTCTAGTAAAGAATTGAAAGCCTTTGAGTCTAATCAAGGAGGCACACCTACCAGACATTTAGGGGATAACGGACCCAAGCATAAGCATAGAAATTTGGATGGCAGTATTGGCCTTATGATGGGTTCTAAAGTGGACAATGAGATATTGTCAGATAAAGGTAATGTGCAGGATCTTCCTCCATCTCCACCTAAGCCAAAAGATGATAAAATTACCAGGAAGATAGAAGTATTATGCAAGTACATTGCTAATAATGGTTCCAGTTTTGAAGACACTACTCGCCGAAAGGAATTTGGGAATCCAGAGTTTGAATTTTTATATGGTGGTAAACCGGGAAGTGAAGCTGCAATTGGTCATGAATATTTTCTGTGGATGAAGAAGAAATATAGTTTGGATTGCAAAAATAAAGAAATGGAAGAAAAATCTCCAGTGAGATCTTTAAGAATTGGGCCACAATCTGAGTCTTTGACAGTGTCAGCAGCATCCATTTCACCTGAAAATTCCGACATGGAGATGGAAGATGATATTACCCCAGATGGTATAGGGGAGGAAACTAGTCACTCCTTTGAAATTCAAAGCTACGAGTGCAAATCAAGAAAAGAAGAGCATGATGCAAAGGATCAGTTACAAGGACCTGAAGATTTGCAAAGAAGCAGCCCAGTGAAGGGGAAAGTAGCTGAAGAAAAAGATGGAGAGTCAAAGCTTCTGCTCGACCATGAAAAATCTGTCAGCCTGGAAGCTTGCCAAGTTCACAGCCCTGTCATAAATACTGCTGGAGTTGTTGAACAGCCTTTAGGAAGCAATTTTGAGATTTCTGTTACCTGCCTACAAAATGAAAAAAGTCTAGCAGCTTCTGAAGCTGTTAATTCTAGTCCGTCTACTGAACTTATTATTGGTGGCAGCCCATTCAGACTTATACAGGACTATGCTTCTGATGAAAATTCAGAAACCGATGAGGAATCACACCTTAAAGATGTCAGTTTTGCTATCTCACCTTCAACTCCAGCTTCTTCCAAGACTTCCGGAAAAGACAGTGATAATTTGACTATTCTGGGATCAAAAGGTTCTTGCCAGGTTCAACGGAGTAATGTTCCACCTTGTGAAGCCTCGATGCCTGATTTTGGTTCTCAGTTCCTCTCGGAATCACCAAAACAGATTTTTGATGCCAATGAGGCAAATGTGAGAAGGGCAGGGAATGAACGGAACTATAAAATTCATCAGAATCAAGTTGGCACCCGCACCAGTTCTAAGTCTTTGGATGCAGATGCAGTGAAAGGTCGTAGTGTTGATGTTCTCCATGATAGTCACAAGTTACAAAAAGAGAATGATGAGGAAAAGCTGAAGTTTGGATCATCGCCTGTAAAAATAGATGAGTTTGGAAGATTAGTCAGAGAAGGTGGTAGCGATAGCGATTCAGATGATTCACACTATACAAGGAGACATAAGAAAAGAAGAACTAGAAATAGTAGTGAAAGTCATTCTCCTGTTGACAGGAGGAGGGGGCGAAGGAGTCCGTGGAGAAGAAGGCAGAGGCGAAGTCGCTCACGCAGTTGGTCTCCTCGTAATCAAAGAGGCAGAAGCAGAAGCAGGTCTCCTGTCAGCAGGCGTACAAGTCAGTTTAATAATGAGAATATGAAACGAGATAAGGGTATGATACGAAAATGCTTTGACTTTCAGCGAGGTAGGTGCTATAGAGGAGCATCTTGTCGCTATGTGCACCATGAACCCAGCAAGAATGATGGATCAAGACACCATAGGAGCAAACATCATGATGTTCATCCAACTTCAGAGAATATAAAGGGTAGAGAGGACACTGTGAACATGTCTAGGGAAGTATCAGATCTTGGGCATATTAAAGTTGAGAATCAGGGGTGCATCCAGCATAATGTGTCCCCAAAAGATGATACCCATGATTGGAAGAAAGGTAGTCCCACTGGTGATCCAGGTTTAGATGTAACTAAATGTCAAAGTTCTAGAGATAGAGCTGGCTTAGTTCAAGAAGAATTAATTTATTCCAAAGCAGCAGAGGCTGTCCACATTCACGTTAACGAGAATATTCAAGAAGCAGGGAAGTCTTATGAGCAACTTTCAGTTACGGCTGCCTCACAATGCATGAGCAATGCTGATACTGAGAAATTATCGGGTGATATTTCCATGAGCATGCTGACTTCTGTAGAGAGGTCTTTGGCTCATGCTCAGCAATCCAACATGTTTGCTTCAGAGTTTGAAGCTGCCAATAGTGTCTCACACCAAATGGAAGGTTCATTTGTCTCCCATTTGTTACCCGATCAAGTAACTGCTGTCAGTACCAATAAGGCTCCTGAATGTGAACATTTTCCTGATAAAAATTCATTAATTAAGCTCCAGTTTGATACCAGTTCTGCTGGTCAGCAGCCTTCGACCTCACAATTTTTATCTGAGTCTCCAGTACCAAAATCATTATCTGCTACTGCTCCTGGTTGTGCTATGGATGATGCCCATCCTCTAAGAGAGCTGCCCCCTCCGCCTCCTCTCCCAACTTCATGTGTCACTAGTGCTGATGTTCTAATGCCTACCCCCTATAACTTTGTGTCACAAAATGTGTCCTTTCCTTCTAAGCCTTCTCTACCGGGAGGTTTTCAACCTCATCAGGATATCGTATCCATCCAATCATCTCATTACCACAGCACCACTTTCCCGCCTTCTAGACCATTATATGATCCAACAATGGCTCATGTAACCACCAAAGATGGTACGCCAATGCAATTTCATCAGAGTCATTTGTCTCAAGGAAGTGATCTGGGTTCTCAGTCTGTTATGAAATCCCAGCCATTGGTGTCAAATTCTCATTCTATGCTTGGTGAGTCTCCAGTTCGGGAGCCTTATAGAGCTCCATTGCACATGGATGAAATTAGATCAATCGCCCCAGTTGCAAATAATCGACCTATTCAGCCCTTTGGATTCCCCAGCTTTCAGAAAGAAGAAAACTTTGGGCGGACTTCTGTGGAGATGAGTTCTTCTAGTTTTTTTCCCCATCGGAACTTTAATGATCAATCTATGCCCTTCACAAATGCAAATAGAATGCAATCTTCTGGTGACAATTTTCCTCCCAGTGAATTTCGAAGTTCATTTTCACAGTTTCATTCTTATTCACGGTTCCAACAGCCATTATATGCCTCACAATCTGCACATGATAGTTTTTTACATGGCCCAAGTCAGATTGGTACTATATCTCGACATTATCCTGATCCTCTAAGCAGGAACCATTCGTCTTTGCTTCCTGATTTTGGGGGTTTGGGTATTACCACTTATCATAATCCTTATGCGTCTACTTTTGACAAGCCACTTAGCTCCAACTTCAGATCTAACATTTTGAACTTTGGAAATGATGCACCTAGTGGTGATATACGTGATTCTACTTTCAATTTGAGCAATGTTCGAGTTGATGGGCAAGGTGCTAATTATGTTGGATCAGGGCTGACAACTACTTCGCCAAAATCCACCAAACCTTCGGGGAAACACTTGCCCAGCTCAGGTGGTGATCAGTATGATCCACTCTTTGACAGCATTGAGCCATCACCACCTATTACCAAGAAATCTGATCGTGTTCGAAAGCTGGAAAAAGCAAGAGAATCTCATATGATGACAAGACTTGGTGGTTCCCATAAATTACCAGATGTGGAGGAGAACAACAAGCATAAGGAGGTTGCTGCTGTGGCTTCAACTACTTCTCTGGAGAATGATGAATTTGGGGAGACAGCAGATGCAGAAGCTGGTGCTGTTGAGAATGACCTTGATGACGAAGAAAACTTAACCGGAGAGATTGAAATTGATCAGGTTAAGTCCTCAGAGAAGAGCAAGAAATCCAAAGGTTCCAGGTCGCTGAGGCTTTTCAGGATTGCTATTGCCGATTTTGTGAAGGAAGTTCTAAAACCATCATGGCGACAGGGCAATATGAGCAAGGAAGCTTTTAAGACAATTGTCAAGAAGACTGTTGACAAAGTATCTGGAGCTATGAAGAGTCACCAAATACCCAAGTCTCAAGCAAAGATAAATAGATACATTGATTCGTCACAACGAAAACTGACAAAGCTTGTTATGGTACGTTATCTTTCTCTC
Coding sequence (CDS)
ATGTATGGTCCAGCAAATTATGCTTCTCAGTTCGGGCAGGGTCCTCAGAAACCATGGCCACCTGCATACCAACAGCGTGCAGTTGCACCTCCTCCGCCTCCTCCTCCTACCTCATATATGCAACCAGGCCCTCCAATCCCATCACGTCCTATAACTCAACAAGCACCTGCCCCCCCACCTCAGGCTGGTCAACCTCTTCATCTTTCTCAGTCGGGTCCTCACGTTCCACCACCTCCACTCTGTCAGGGCCCATCCGTTCAAGTGCTACCTGGTGGGATACCAAACATCCGTCAAACTTATTTTCACACTTTTCCACCAGTCCATGGAAGCACACAAGGTTTTCAATTTAACTCAAGTACTCAGCAGAATGTACAACTTTCACAGTCAGGAGTTCAGAACATGCATCATGTTCTACCTCCACCACCCCCGCTACCACCACCACCACCACCACCTCCTCCTCATGCTCCTAATCCACCACCACCCCCTCCTCATGCTCCTAATCCAGATTTATTACGGCCTCCACAACCCTCTACAGTAGTACCTGTTCATCCTCCTTCACAAGGACAAACATTGTATGGAACTCGAGTTCATCCACCATTGCAACAAGGTGGTTTGCAGGTCTTTCCCTCTATACCACAACATCCAACAACGTCCAACTTCCCTACTCCTCCTTTTGGAGGAGTTATGCAATCAAATCTTGGAGAGTCTCATTTGTCTCCAATGGCTCCTCCACCACCACCATCCTCTCCGCCACCTATTCCACCTTCTCCCCCTCCTCCCACGTCGCCCTCCTTTTATTCAATTCCAAGCTCGGGTTCTTCCAACTTGTTGTGCCAGTCTGAATTTGATCCTAGTTCTACCATTAATTCTAGTAAAGAATTGAAAGCCTTTGAGTCTAATCAAGGAGGCACACCTACCAGACATTTAGGGGATAACGGACCCAAGCATAAGCATAGAAATTTGGATGGCAGTATTGGCCTTATGATGGGTTCTAAAGTGGACAATGAGATATTGTCAGATAAAGGTAATGTGCAGGATCTTCCTCCATCTCCACCTAAGCCAAAAGATGATAAAATTACCAGGAAGATAGAAGTATTATGCAAGTACATTGCTAATAATGGTTCCAGTTTTGAAGACACTACTCGCCGAAAGGAATTTGGGAATCCAGAGTTTGAATTTTTATATGGTGGTAAACCGGGAAGTGAAGCTGCAATTGGTCATGAATATTTTCTGTGGATGAAGAAGAAATATAGTTTGGATTGCAAAAATAAAGAAATGGAAGAAAAATCTCCAGTGAGATCTTTAAGAATTGGGCCACAATCTGAGTCTTTGACAGTGTCAGCAGCATCCATTTCACCTGAAAATTCCGACATGGAGATGGAAGATGATATTACCCCAGATGGTATAGGGGAGGAAACTAGTCACTCCTTTGAAATTCAAAGCTACGAGTGCAAATCAAGAAAAGAAGAGCATGATGCAAAGGATCAGTTACAAGGACCTGAAGATTTGCAAAGAAGCAGCCCAGTGAAGGGGAAAGTAGCTGAAGAAAAAGATGGAGAGTCAAAGCTTCTGCTCGACCATGAAAAATCTGTCAGCCTGGAAGCTTGCCAAGTTCACAGCCCTGTCATAAATACTGCTGGAGTTGTTGAACAGCCTTTAGGAAGCAATTTTGAGATTTCTGTTACCTGCCTACAAAATGAAAAAAGTCTAGCAGCTTCTGAAGCTGTTAATTCTAGTCCGTCTACTGAACTTATTATTGGTGGCAGCCCATTCAGACTTATACAGGACTATGCTTCTGATGAAAATTCAGAAACCGATGAGGAATCACACCTTAAAGATGTCAGTTTTGCTATCTCACCTTCAACTCCAGCTTCTTCCAAGACTTCCGGAAAAGACAGTGATAATTTGACTATTCTGGGATCAAAAGGTTCTTGCCAGGTTCAACGGAGTAATGTTCCACCTTGTGAAGCCTCGATGCCTGATTTTGGTTCTCAGTTCCTCTCGGAATCACCAAAACAGATTTTTGATGCCAATGAGGCAAATGTGAGAAGGGCAGGGAATGAACGGAACTATAAAATTCATCAGAATCAAGTTGGCACCCGCACCAGTTCTAAGTCTTTGGATGCAGATGCAGTGAAAGGTCGTAGTGTTGATGTTCTCCATGATAGTCACAAGTTACAAAAAGAGAATGATGAGGAAAAGCTGAAGTTTGGATCATCGCCTGTAAAAATAGATGAGTTTGGAAGATTAGTCAGAGAAGGTGGTAGCGATAGCGATTCAGATGATTCACACTATACAAGGAGACATAAGAAAAGAAGAACTAGAAATAGTAGTGAAAGTCATTCTCCTGTTGACAGGAGGAGGGGGCGAAGGAGTCCGTGGAGAAGAAGGCAGAGGCGAAGTCGCTCACGCAGTTGGTCTCCTCGTAATCAAAGAGGCAGAAGCAGAAGCAGGTCTCCTGTCAGCAGGCGTACAAGTCAGTTTAATAATGAGAATATGAAACGAGATAAGGGTATGATACGAAAATGCTTTGACTTTCAGCGAGGTAGGTGCTATAGAGGAGCATCTTGTCGCTATGTGCACCATGAACCCAGCAAGAATGATGGATCAAGACACCATAGGAGCAAACATCATGATGTTCATCCAACTTCAGAGAATATAAAGGGTAGAGAGGACACTGTGAACATGTCTAGGGAAGTATCAGATCTTGGGCATATTAAAGTTGAGAATCAGGGGTGCATCCAGCATAATGTGTCCCCAAAAGATGATACCCATGATTGGAAGAAAGGTAGTCCCACTGGTGATCCAGGTTTAGATGTAACTAAATGTCAAAGTTCTAGAGATAGAGCTGGCTTAGTTCAAGAAGAATTAATTTATTCCAAAGCAGCAGAGGCTGTCCACATTCACGTTAACGAGAATATTCAAGAAGCAGGGAAGTCTTATGAGCAACTTTCAGTTACGGCTGCCTCACAATGCATGAGCAATGCTGATACTGAGAAATTATCGGGTGATATTTCCATGAGCATGCTGACTTCTGTAGAGAGGTCTTTGGCTCATGCTCAGCAATCCAACATGTTTGCTTCAGAGTTTGAAGCTGCCAATAGTGTCTCACACCAAATGGAAGGTTCATTTGTCTCCCATTTGTTACCCGATCAAGTAACTGCTGTCAGTACCAATAAGGCTCCTGAATGTGAACATTTTCCTGATAAAAATTCATTAATTAAGCTCCAGTTTGATACCAGTTCTGCTGGTCAGCAGCCTTCGACCTCACAATTTTTATCTGAGTCTCCAGTACCAAAATCATTATCTGCTACTGCTCCTGGTTGTGCTATGGATGATGCCCATCCTCTAAGAGAGCTGCCCCCTCCGCCTCCTCTCCCAACTTCATGTGTCACTAGTGCTGATGTTCTAATGCCTACCCCCTATAACTTTGTGTCACAAAATGTGTCCTTTCCTTCTAAGCCTTCTCTACCGGGAGGTTTTCAACCTCATCAGGATATCGTATCCATCCAATCATCTCATTACCACAGCACCACTTTCCCGCCTTCTAGACCATTATATGATCCAACAATGGCTCATGTAACCACCAAAGATGGTACGCCAATGCAATTTCATCAGAGTCATTTGTCTCAAGGAAGTGATCTGGGTTCTCAGTCTGTTATGAAATCCCAGCCATTGGTGTCAAATTCTCATTCTATGCTTGGTGAGTCTCCAGTTCGGGAGCCTTATAGAGCTCCATTGCACATGGATGAAATTAGATCAATCGCCCCAGTTGCAAATAATCGACCTATTCAGCCCTTTGGATTCCCCAGCTTTCAGAAAGAAGAAAACTTTGGGCGGACTTCTGTGGAGATGAGTTCTTCTAGTTTTTTTCCCCATCGGAACTTTAATGATCAATCTATGCCCTTCACAAATGCAAATAGAATGCAATCTTCTGGTGACAATTTTCCTCCCAGTGAATTTCGAAGTTCATTTTCACAGTTTCATTCTTATTCACGGTTCCAACAGCCATTATATGCCTCACAATCTGCACATGATAGTTTTTTACATGGCCCAAGTCAGATTGGTACTATATCTCGACATTATCCTGATCCTCTAAGCAGGAACCATTCGTCTTTGCTTCCTGATTTTGGGGGTTTGGGTATTACCACTTATCATAATCCTTATGCGTCTACTTTTGACAAGCCACTTAGCTCCAACTTCAGATCTAACATTTTGAACTTTGGAAATGATGCACCTAGTGGTGATATACGTGATTCTACTTTCAATTTGAGCAATGTTCGAGTTGATGGGCAAGGTGCTAATTATGTTGGATCAGGGCTGACAACTACTTCGCCAAAATCCACCAAACCTTCGGGGAAACACTTGCCCAGCTCAGGTGGTGATCAGTATGATCCACTCTTTGACAGCATTGAGCCATCACCACCTATTACCAAGAAATCTGATCGTGTTCGAAAGCTGGAAAAAGCAAGAGAATCTCATATGATGACAAGACTTGGTGGTTCCCATAAATTACCAGATGTGGAGGAGAACAACAAGCATAAGGAGGTTGCTGCTGTGGCTTCAACTACTTCTCTGGAGAATGATGAATTTGGGGAGACAGCAGATGCAGAAGCTGGTGCTGTTGAGAATGACCTTGATGACGAAGAAAACTTAACCGGAGAGATTGAAATTGATCAGGTTAAGTCCTCAGAGAAGAGCAAGAAATCCAAAGGTTCCAGGTCGCTGAGGCTTTTCAGGATTGCTATTGCCGATTTTGTGAAGGAAGTTCTAAAACCATCATGGCGACAGGGCAATATGAGCAAGGAAGCTTTTAAGACAATTGTCAAGAAGACTGTTGACAAAGTATCTGGAGCTATGAAGAGTCACCAAATACCCAAGTCTCAAGCAAAGATAAATAGATACATTGATTCGTCACAACGAAAACTGACAAAGCTTGTTATGGTACGTTATCTTTCTCTC
Protein sequence
MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPPQAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSSTQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVVPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESNQGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFEIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCLQNEKSLAASEAVNSSPSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSNVPPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRSPVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAPECEHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPVREPYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVMVRYLSL
Homology
BLAST of MS002044 vs. NCBI nr
Match:
XP_022135323.1 (uncharacterized protein LOC111007314 [Momordica charantia])
HSP 1 Score: 3132.4 bits (8120), Expect = 0.0e+00
Identity = 1625/1669 (97.36%), Postives = 1637/1669 (98.08%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP
Sbjct: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST
Sbjct: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
Query: 121 QQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLSQSGVQNMHH+LPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV
Sbjct: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
Query: 181 PVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
PVHPPSQGQTLYG RVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP
Sbjct: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
Query: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN
Sbjct: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
Query: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT
Sbjct: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
Query: 361 RKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKYSLD 420
RKI VLCKYIANNGSSFEDTTR+KEFGNPEFEFLYGG+PGSEAAIGHEYFLWMKKKYSLD
Sbjct: 361 RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD 420
Query: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFEIQ 480
CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF+IQ
Sbjct: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ 480
Query: 481 SYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAE---------------------EKDG 540
SYECKSRKEEHDAKDQLQGP+DLQRSSPVKGKVAE EKDG
Sbjct: 481 SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEVPQFLSIQPSCAMQRGFWTNLEKDG 540
Query: 541 ESKLLLDHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCLQNEKSLAASEAVNSS 600
ESKLLL+HEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTC+QNEKSLAASEAVNSS
Sbjct: 541 ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS 600
Query: 601 PSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
STELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL
Sbjct: 601 LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
Query: 661 GSKGSCQVQRSNVPPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVG 720
GS+GSCQVQRSNVPPCEASMPDFGSQFLSESPK IFDANEANVRRAGNERNYKIHQNQVG
Sbjct: 661 GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG 720
Query: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSD 780
TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEK KFGSSPVKIDEFGRLVREGGSDSD
Sbjct: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD 780
Query: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS
Sbjct: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
Query: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD
Sbjct: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
Query: 901 VHPTSENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLD 960
VHPTSENIKGREDTVNMSREVSD GHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDP LD
Sbjct: 901 VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD 960
Query: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE
Sbjct: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
Query: 1021 KLSGDISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTN 1080
KLSGDISMSMLTSVE+SLAHAQQSNMFASEFEAANSVSHQM+GSFVSHLLPDQVTAVSTN
Sbjct: 1021 KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN 1080
Query: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
KAPECEHFPDKNSLIKLQFDTSSAGQQPST QFLSESPVPKSLSATAPGCAMDDAHPLRE
Sbjct: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
Query: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT
Sbjct: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPV 1260
FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSD GSQSVMKSQPLV+NSHSMLGESPV
Sbjct: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
Query: 1261 REPYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
REPYRAPLHMDEIRS APVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ
Sbjct: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS
Sbjct: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD
Sbjct: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFNLSNVRVDGQGANY GSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK
Sbjct: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
Query: 1501 KSDRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDR+RKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA
Sbjct: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN
Sbjct: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1669
BLAST of MS002044 vs. NCBI nr
Match:
XP_023531277.1 (uncharacterized protein LOC111793568 [Cucurbita pepo subsp. pepo] >XP_023531278.1 uncharacterized protein LOC111793568 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2359.3 bits (6113), Expect = 0.0e+00
Identity = 1292/1667 (77.50%), Postives = 1389/1667 (83.32%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPP YQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPTYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-----VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQ 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPVHGSTQ Q
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVHGSTQVSQ 120
Query: 121 FNSSTQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQ 180
FNS+ QQNVQLS SGVQN HHVLPPPP LPP PPP P HAP+PDLLRPPQ
Sbjct: 121 FNSNAQQNVQLSHSGVQNTHHVLPPPPRLPP-----------PPPRPLHAPSPDLLRPPQ 180
Query: 181 PSTVVPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLG 240
ST+VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHPTTSNFPTPP FGG+MQSNLG
Sbjct: 181 FSTIVPLHPRSQGQTLYGARINPPLQQGGLQIFPSIPQHPTTSNFPTPPSFGGLMQSNLG 240
Query: 241 ESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKEL 300
ESHL P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ EFDPSSTI+ SK L
Sbjct: 241 ESHLLPVAPPPPPSSPPPIPPSPPPPTSPS-SSIPNSDSSNLLCQIEFDPSSTIHCSKRL 300
Query: 301 KAFESNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPK 360
KAFE++ HLG+N PKH KHRNL+G IGL+MGSKVDNEILSDK VQ LPPSPPK
Sbjct: 301 KAFENDPVVASPSHLGNNRPKHDKHRNLEGGIGLVMGSKVDNEILSDKDYVQVLPPSPPK 360
Query: 361 PKDDKITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWM 420
PKDD+I RKIEVLC+ IA+N SSFED TR KEFGNPEF+FL+GG+PGSE+AIGHEYFLWM
Sbjct: 361 PKDDRIVRKIEVLCQLIASNDSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWM 420
Query: 421 KKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEET 480
KKKYSL CKNKEM+EK P RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 KKKYSLACKNKEMKEKFPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEET 480
Query: 481 SHSFEIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSL 540
+IQSY+ KSRKEEHD KDQLQGPEDLQR S K K AE DG KLLL HEKSVS+
Sbjct: 481 GRLVQIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSV 540
Query: 541 EACQVHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGS 600
ACQVH PV +AG+ E PLG+NFE SVTC QN+K+L AA EA NSS S L+ GGS
Sbjct: 541 AACQVHIPVRISAGLSEPPLGNNFESSVTCSQNDKNLSGEVAAFEATNSSQSAALVAGGS 600
Query: 601 PFRLIQDYASDENSETDEESHLKDVSFA-ISPSTPASSKTSGKDSDNLTILGSKGSCQVQ 660
PFRLIQDY+SDENSE+DEESHLKDV F +SPSTP SSKTS KD+D LT LGSKGSCQV+
Sbjct: 601 PFRLIQDYSSDENSESDEESHLKDVRFVPVSPSTPVSSKTSDKDTDQLTNLGSKGSCQVE 660
Query: 661 RSNVPPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLD 720
S P CE SMP+ G+ FLSE PK +FDANEANVR+ GNE++ +NQ+GT TS KSL
Sbjct: 661 LSYAPTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSL- 720
Query: 721 ADAVKGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRR 780
DA+ GRSVDV+ D+ KL+KENDEEK+K GSSPVKIDEFGRLVREGGSDSDSDDS Y RR
Sbjct: 721 -DALNGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRR 780
Query: 781 HKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRG----RSRSRSPVSRR 840
HK RR R+SSESHSPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRG RSRSRSPVSRR
Sbjct: 781 HKNRRARSSSESHSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSPVSRR 840
Query: 841 TSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTS 900
T+QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHPTS
Sbjct: 841 TNQFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHPTS 900
Query: 901 ENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTKCQ 960
+NI REDT+N SR++SDLGHIKVENQ CIQHNVSPK D H W SPT DV +CQ
Sbjct: 901 KNIGSREDTMNTSRDISDLGHIKVENQECIQHNVSPKHDAHAWNTDSPT----RDVNRCQ 960
Query: 961 SSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGD 1020
SSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK SGD
Sbjct: 961 SSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFSGD 1020
Query: 1021 ISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAPEC 1080
IS SMLTS E S+ AQQSNM SE + ANS S M+GSFVS+LLPDQVT V+TNKAPEC
Sbjct: 1021 ISTSMLTSAENSV--AQQSNMHVSELQTANSHSRPMDGSFVSNLLPDQVTVVTTNKAPEC 1080
Query: 1081 EHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPP 1140
E FPDK S I QFD SSA Q P+TSQFLSESPVPK SATAPGCA DDAH LR LPPPP
Sbjct: 1081 ELFPDKTSSISEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPPPP 1140
Query: 1141 PL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFP 1200
PL S VTSA+V + PY+FVSQN SFPSK SLPGGF PHQD VSIQ S+ HST
Sbjct: 1141 PLLPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGGFHPHQDFVSIQPSNDHSTPLL 1200
Query: 1201 PSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPVRE 1260
P R LYD +A TTKDG PMQFHQS+LSQGSDLGSQSVMKSQPL +SHS +GESP++E
Sbjct: 1201 PPRRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPLQE 1260
Query: 1261 PYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSM 1320
P RAP+HMDEIRSI PVA +RP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQSM
Sbjct: 1261 PCRAPMHMDEIRSITPVATDRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQSM 1320
Query: 1321 PFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRH 1380
PFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQIGT+SRH
Sbjct: 1321 PFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQIGTMSRH 1380
Query: 1381 YPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDST 1440
Y DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + S ILNFGNDAPSGDIRDST
Sbjct: 1381 YLDPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSKILNFGNDAPSGDIRDST 1440
Query: 1441 FNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKS 1500
FN SN RVDGQGANYVGS LTT SP STKP GK LPS+GGDQYDPLFDS+EPS PI KKS
Sbjct: 1441 FNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSAGGDQYDPLFDSMEPSSPIIKKS 1500
Query: 1501 DRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGA 1560
DR +KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEAGA
Sbjct: 1501 DRGQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEAGA 1560
Query: 1561 VENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMS 1620
VE+D DDE NL+GEIEIDQVKSSEKSK SKGSRSLRLFRIAIADFVKE+LKPSWRQGNMS
Sbjct: 1561 VEDDFDDEANLSGEIEIDQVKSSEKSKNSKGSRSLRLFRIAIADFVKEILKPSWRQGNMS 1620
Query: 1621 KEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
KEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVM
Sbjct: 1621 KEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVM 1645
BLAST of MS002044 vs. NCBI nr
Match:
XP_022931323.1 (uncharacterized protein LOC111437543 [Cucurbita moschata] >XP_022931325.1 uncharacterized protein LOC111437543 [Cucurbita moschata])
HSP 1 Score: 2353.6 bits (6098), Expect = 0.0e+00
Identity = 1293/1665 (77.66%), Postives = 1386/1665 (83.24%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HHVLPPPP LPP PPP P HAP+PDL+RPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPP-----------PPPRPLHAPSPDLIRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPS-SSIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKY 420
+I RKIEVLC+ IA+NGSSFED TR KEFGNPEF+FL+GG+PGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 EIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGPEDLQR S K K AE DG KLLL HEKSVS+ ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVTC QN K+L AA EA NSS S L+ GGSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTCSQNGKNLSGEVAAFEATNSSQSAALVAGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGSKGSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PK +FDANEANVR+ GNE++ +NQ+GT TS KSL DA+
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSL--DAL 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK+K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRG------RSRSRSPVSRRTS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRG RSRSRSPVSRRT+
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSPVSRRTN 840
Query: 841 QFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSEN 900
QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHPTS+N
Sbjct: 841 QFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHPTSKN 900
Query: 901 IKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTKCQSS 960
I REDT+N SR++SDLGHIKVE Q CIQH+VSPK D H W SPT DV +CQSS
Sbjct: 901 IGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNRCQSS 960
Query: 961 RDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDIS 1020
RD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK SGDIS
Sbjct: 961 RDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFSGDIS 1020
Query: 1021 MSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAPECEH 1080
SMLTS E S+ AQQSNM SE ANS S M+GSFVS+LLPDQVT ++TNKAPECE
Sbjct: 1021 TSMLTSAENSV--AQQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAPECEL 1080
Query: 1081 FPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPL 1140
FPDK S I QFD SSA Q P+TSQFLSESPVPK SATAPGCA DDAH LR LPPPPPL
Sbjct: 1081 FPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPPPPPL 1140
Query: 1141 ---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPS 1200
S VTSA+V + PY+FVSQN SFPSK SLPG F PHQD VSIQ S+ HST P
Sbjct: 1141 LPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTPLLPP 1200
Query: 1201 RPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPVREPY 1260
R LYD +A TTKDG PMQFHQS+LSQGSDLGSQSVMKSQPL +SHS +GESP++EP
Sbjct: 1201 RRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPLQEPC 1260
Query: 1261 RAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPF 1320
RAP+HMDEIRSI PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQSMPF
Sbjct: 1261 RAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQSMPF 1320
Query: 1321 TNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYP 1380
T+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD FL SQIGT+SRHY
Sbjct: 1321 TDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGFLRDSSQIGTMSRHYL 1380
Query: 1381 DPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFN 1440
DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRDSTFN
Sbjct: 1381 DPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRDSTFN 1440
Query: 1441 LSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDR 1500
SN RVDGQGANYVGS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI KKSDR
Sbjct: 1441 ASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIKKSDR 1500
Query: 1501 VRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVE 1560
+KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEAGAVE
Sbjct: 1501 GQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEAGAVE 1560
Query: 1561 NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKE 1620
+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGNMSKE
Sbjct: 1561 DDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGNMSKE 1620
Query: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVM
Sbjct: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVM 1643
BLAST of MS002044 vs. NCBI nr
Match:
KAG6587592.1 (Zinc finger CCCH domain-containing protein 55, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2342.8 bits (6070), Expect = 0.0e+00
Identity = 1290/1669 (77.29%), Postives = 1383/1669 (82.86%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HHVLPPPP LPP PPP P HAP+PDLLRPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPP-----------PPPRPLHAPSPDLLRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPS-SSIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKY 420
+I RKIEVLC+ IA+NGSSFED TR KEFGNPEF+FL+GG+PGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 EIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGPEDLQR S K K AE DG KLLL HEKSVS+ ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVT QN K+L AA EA NSS S L+ GGSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTRSQNGKNLSGEVAAFEATNSSQSAALVAGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGSKGSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PK +FDANEANVR+ GNE++ +NQ+GT TS KSL DA+
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSL--DAL 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK+K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRG----------RSRSRSPVS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRG RSRSRSPVS
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRGRGRSRSRSRSRSPVS 840
Query: 841 RRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHP 900
RRT+QFNNENM+RDKGM+RKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHP
Sbjct: 841 RRTNQFNNENMRRDKGMMRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHP 900
Query: 901 TSENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTK 960
TS+NI REDT+N SR++SDLGHIKVE Q CIQH+VSPK D H W SPT DV +
Sbjct: 901 TSKNIGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNR 960
Query: 961 CQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLS 1020
CQSSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK S
Sbjct: 961 CQSSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFS 1020
Query: 1021 GDISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAP 1080
GDIS SMLTS E S+ AQQSNM SE ANS S M+GSFVS+LLPDQVT ++TNKAP
Sbjct: 1021 GDISTSMLTSAENSV--AQQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAP 1080
Query: 1081 ECEHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPP 1140
ECE FPDK S I QFD SSA Q P+TSQFLSESPVPK SATAPGCA DDAH LR LPP
Sbjct: 1081 ECELFPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPP 1140
Query: 1141 PPPL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
PPPL S VTSA+V + PY+FV QN SFPSK SLPG F PHQD VSIQ S+ HST
Sbjct: 1141 PPPLLPHMISHVTSAEVPISAPYSFVPQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTP 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPV 1260
P R LYD +A TTKDG PMQFHQS+LSQGSDLGSQSVMKSQPL +SHS +GESP+
Sbjct: 1201 LLPPRRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPL 1260
Query: 1261 REPYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
+EP RAP+HMDEIRSI PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQ
Sbjct: 1261 QEPCRAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQIGT+S
Sbjct: 1321 SMPFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQIGTMS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHY DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRD
Sbjct: 1381 RHYLDPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFN SN RVDGQGANYVGS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI K
Sbjct: 1441 STFNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIK 1500
Query: 1501 KSDRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDR +KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEA
Sbjct: 1501 KSDRGQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVE+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGN
Sbjct: 1561 GAVEDDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVM
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVM 1647
BLAST of MS002044 vs. NCBI nr
Match:
KAG7021558.1 (Zinc finger CCCH domain-containing protein 55 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2323.9 bits (6021), Expect = 0.0e+00
Identity = 1283/1669 (76.87%), Postives = 1374/1669 (82.32%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HHVLPPPP LPP PPP P HAP+PDLLRPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPP-----------PPPRPLHAPSPDLLRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPS-SSIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKY 420
+I RKIEVLC+ IA+NGSSFED TR KEFGNPEF+FL+GG+PGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 EIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGPEDLQR G KLLL HEKSV + ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRY------------GGPKLLLGHEKSVCVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVTC QN K+L AA EA NSS GSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTCSQNGKNLSGEVAAFEATNSS--------GSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGSKGSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PK +FDANEANVR+ GNE++ +NQ+GT TS KSL DA+
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSL--DAL 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK+K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRG----------RSRSRSPVS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRG RSRSRSPVS
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRGRGRSRSRSRSRSPVS 840
Query: 841 RRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHP 900
RRT+QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHP
Sbjct: 841 RRTNQFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHP 900
Query: 901 TSENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTK 960
TS+NI REDT+N SR++SDLGHIKVE Q CIQH+VSPK D H W SPT DV +
Sbjct: 901 TSKNIGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNR 960
Query: 961 CQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLS 1020
CQSSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK S
Sbjct: 961 CQSSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFS 1020
Query: 1021 GDISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAP 1080
GDIS SMLTS E S+ AQQSNM SE ANS S M+GSFVS+LLPDQVT ++TNKAP
Sbjct: 1021 GDISTSMLTSAENSV--AQQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAP 1080
Query: 1081 ECEHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPP 1140
ECE FPDK S I QFD SSA Q P+TSQFLSESPVPK SATAPGCA DDAH LR LPP
Sbjct: 1081 ECELFPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPP 1140
Query: 1141 PPPL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
PPPL S VTSA+V + PY+FVSQN SFPSK SLPG F PHQD VSIQ S+ HST
Sbjct: 1141 PPPLLPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTP 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPV 1260
P R LYD +A TTKDG PMQFHQS+LSQGSDLGSQSVMKSQPL +SHS +GESP+
Sbjct: 1201 LLPPRRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPL 1260
Query: 1261 REPYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
+EP RAP+HMDEIRSI PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQ
Sbjct: 1261 QEPCRAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQIGT+S
Sbjct: 1321 SMPFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQIGTMS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHY DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRD
Sbjct: 1381 RHYLDPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFN SN RVDGQGANYVGS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI K
Sbjct: 1441 STFNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIK 1500
Query: 1501 KSDRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDR +KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEA
Sbjct: 1501 KSDRGQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVE+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGN
Sbjct: 1561 GAVEDDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVM
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVM 1629
BLAST of MS002044 vs. ExPASy TrEMBL
Match:
A0A6J1C4H9 (uncharacterized protein LOC111007314 OS=Momordica charantia OX=3673 GN=LOC111007314 PE=4 SV=1)
HSP 1 Score: 3132.4 bits (8120), Expect = 0.0e+00
Identity = 1625/1669 (97.36%), Postives = 1637/1669 (98.08%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP
Sbjct: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST
Sbjct: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
Query: 121 QQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLSQSGVQNMHH+LPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV
Sbjct: 121 QQNVQLSQSGVQNMHHILPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
Query: 181 PVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
PVHPPSQGQTLYG RVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP
Sbjct: 181 PVHPPSQGQTLYGARVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
Query: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN
Sbjct: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
Query: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT
Sbjct: 301 QGGTPTRHLGDNGPKHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKIT 360
Query: 361 RKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKYSLD 420
RKI VLCKYIANNGSSFEDTTR+KEFGNPEFEFLYGG+PGSEAAIGHEYFLWMKKKYSLD
Sbjct: 361 RKIGVLCKYIANNGSSFEDTTRQKEFGNPEFEFLYGGEPGSEAAIGHEYFLWMKKKYSLD 420
Query: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFEIQ 480
CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF+IQ
Sbjct: 421 CKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFKIQ 480
Query: 481 SYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAE---------------------EKDG 540
SYECKSRKEEHDAKDQLQGP+DLQRSSPVKGKVAE EKDG
Sbjct: 481 SYECKSRKEEHDAKDQLQGPKDLQRSSPVKGKVAEVPQFLSIQPSCAMQRGFWTNLEKDG 540
Query: 541 ESKLLLDHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCLQNEKSLAASEAVNSS 600
ESKLLL+HEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTC+QNEKSLAASEAVNSS
Sbjct: 541 ESKLLLEHEKSVSLEACQVHSPVINTAGVVEQPLGSNFEISVTCIQNEKSLAASEAVNSS 600
Query: 601 PSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
STELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL
Sbjct: 601 LSTELIIGGSPFRLIQDYASDENSETDEESHLKDVSFAISPSTPASSKTSGKDSDNLTIL 660
Query: 661 GSKGSCQVQRSNVPPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVG 720
GS+GSCQVQRSNVPPCEASMPDFGSQFLSESPK IFDANEANVRRAGNERNYKIHQNQVG
Sbjct: 661 GSEGSCQVQRSNVPPCEASMPDFGSQFLSESPKLIFDANEANVRRAGNERNYKIHQNQVG 720
Query: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSD 780
TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEK KFGSSPVKIDEFGRLVREGGSDSD
Sbjct: 721 TRTSSKSLDADAVKGRSVDVLHDSHKLQKENDEEKQKFGSSPVKIDEFGRLVREGGSDSD 780
Query: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS
Sbjct: 781 SDDSHYTRRHKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRS 840
Query: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD
Sbjct: 841 PVSRRTSQFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHD 900
Query: 901 VHPTSENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLD 960
VHPTSENIKGREDTVNMSREVSD GHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDP LD
Sbjct: 901 VHPTSENIKGREDTVNMSREVSDPGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPDLD 960
Query: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE
Sbjct: 961 VTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTE 1020
Query: 1021 KLSGDISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTN 1080
KLSGDISMSMLTSVE+SLAHAQQSNMFASEFEAANSVSHQM+GSFVSHLLPDQVTAVSTN
Sbjct: 1021 KLSGDISMSMLTSVEKSLAHAQQSNMFASEFEAANSVSHQMDGSFVSHLLPDQVTAVSTN 1080
Query: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
KAPECEHFPDKNSLIKLQFDTSSAGQQPST QFLSESPVPKSLSATAPGCAMDDAHPLRE
Sbjct: 1081 KAPECEHFPDKNSLIKLQFDTSSAGQQPSTLQFLSESPVPKSLSATAPGCAMDDAHPLRE 1140
Query: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT
Sbjct: 1141 LPPPPPLPTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTT 1200
Query: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPV 1260
FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSD GSQSVMKSQPLV+NSHSMLGESPV
Sbjct: 1201 FPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDRGSQSVMKSQPLVTNSHSMLGESPV 1260
Query: 1261 REPYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
REPYRAPLHMDEIRS APVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ
Sbjct: 1261 REPYRAPLHMDEIRSTAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQ 1320
Query: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS
Sbjct: 1321 SMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTIS 1380
Query: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD
Sbjct: 1381 RHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRD 1440
Query: 1441 STFNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
STFNLSNVRVDGQGANY GSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK
Sbjct: 1441 STFNLSNVRVDGQGANYFGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITK 1500
Query: 1501 KSDRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
KSDR+RKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA
Sbjct: 1501 KSDRIRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEA 1560
Query: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN
Sbjct: 1561 GAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGN 1620
Query: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM
Sbjct: 1621 MSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1669
BLAST of MS002044 vs. ExPASy TrEMBL
Match:
A0A6J1EZ49 (uncharacterized protein LOC111437543 OS=Cucurbita moschata OX=3662 GN=LOC111437543 PE=4 SV=1)
HSP 1 Score: 2353.6 bits (6098), Expect = 0.0e+00
Identity = 1293/1665 (77.66%), Postives = 1386/1665 (83.24%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSS 120
QAGQPLHLSQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPV GSTQ QFNS+
Sbjct: 61 QAGQPLHLSQSGSHGPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVRGSTQVSQFNSN 120
Query: 121 TQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTV 180
QQNVQLS SGVQN HHVLPPPP LPP PPP P HAP+PDL+RPPQ ST
Sbjct: 121 AQQNVQLSHSGVQNTHHVLPPPPRLPP-----------PPPRPLHAPSPDLIRPPQFSTT 180
Query: 181 VPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLGESHL 240
VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHP+TSNFPTPP FGG+MQSNLGESHL
Sbjct: 181 VPLHPRSQGQTLYGVRINPPLQQGGLQIFPSIPQHPSTSNFPTPPSFGGLMQSNLGESHL 240
Query: 241 SPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFE 300
P+APPPPPSSPPPIPPSPPPPTSPS SIP+S SSNLLCQ E DPSSTI+ SK LKAFE
Sbjct: 241 LPVAPPPPPSSPPPIPPSPPPPTSPS-SSIPNSDSSNLLCQIEVDPSSTIHCSKRLKAFE 300
Query: 301 SNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDD 360
++ HLGDN PKH KHRNL+G IGLMMGSKVDNEILSDK VQ LPPSPPKPKDD
Sbjct: 301 NDPVVPSPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEILSDKDYVQVLPPSPPKPKDD 360
Query: 361 KITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKY 420
+I RKIEVLC+ IA+NGSSFED TR KEFGNPEF+FL+GG+PGSE+AIGHEYFLWMKKKY
Sbjct: 361 RIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWMKKKY 420
Query: 421 SLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSF 480
SL CKNKEM+EKSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 SLACKNKEMKEKSPSRSLSIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEETGRLV 480
Query: 481 EIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEACQ 540
+IQSY+ KSRKEEHD KDQLQGPEDLQR S K K AE DG KLLL HEKSVS+ ACQ
Sbjct: 481 QIQSYKRKSRKEEHDVKDQLQGPEDLQRCSREKEKEAE--DGGPKLLLGHEKSVSVAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGSPFRL 600
VH PV +AG+ E PLG+NFE SVTC QN K+L AA EA NSS S L+ GGSPFRL
Sbjct: 541 VHIPVRISAGLSEPPLGNNFESSVTCSQNGKNLSGEVAAFEATNSSQSAALVAGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSNV 660
IQDY+SDENSE+DEESHLKDV F A SPSTP SSKTS KD+D LT LGSKGSCQV+ S
Sbjct: 601 IQDYSSDENSESDEESHLKDVRFVAASPSTPVSSKTSDKDTDQLTNLGSKGSCQVELSYA 660
Query: 661 PPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
P CE SMP+ G+ FLSE PK +FDANEANVR+ GNE++ +NQ+GT TS KSL DA+
Sbjct: 661 PTCEYSMPESGAHFLSEPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSL--DAL 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
GRSVDV+ D+ KL+KENDEEK+K GSSPVKIDEFGRLVREGGSDSDSDDS Y RRHK R
Sbjct: 721 NGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRRHKNR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRG------RSRSRSPVSRRTS 840
R R+SSES SPVDRRRGRRSPWRRR+RRSRSRSWSPRNQRG RSRSRSPVSRRT+
Sbjct: 781 RARSSSESRSPVDRRRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSPVSRRTN 840
Query: 841 QFNNENMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSEN 900
QFNNENM+RDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR RSKHHDVHPTS+N
Sbjct: 841 QFNNENMRRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLQRSKHHDVHPTSKN 900
Query: 901 IKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTKCQSS 960
I REDT+N SR++SDLGHIKVE Q CIQH+VSPK D H W SPT DV +CQSS
Sbjct: 901 IGSREDTMNASRDISDLGHIKVEIQECIQHSVSPKHDAHAWNTDSPT----RDVNRCQSS 960
Query: 961 RDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDIS 1020
RD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNADTEK SGDIS
Sbjct: 961 RDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNADTEKFSGDIS 1020
Query: 1021 MSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAPECEH 1080
SMLTS E S+ AQQSNM SE ANS S M+GSFVS+LLPDQVT ++TNKAPECE
Sbjct: 1021 TSMLTSAENSV--AQQSNMLVSELLTANSYSRPMDGSFVSNLLPDQVTVLTTNKAPECEL 1080
Query: 1081 FPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPL 1140
FPDK S I QFD SSA Q P+TSQFLSESPVPK SATAPGCA DDAH LR LPPPPPL
Sbjct: 1081 FPDKTSSINEQFDASSASQPPTTSQFLSESPVPKQFSATAPGCANDDAHSLRALPPPPPL 1140
Query: 1141 ---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPS 1200
S VTSA+V + PY+FVSQN SFPSK SLPG F PHQD VSIQ S+ HST P
Sbjct: 1141 LPHMISHVTSAEVPISAPYSFVSQNASFPSKSSLPGDFHPHQDFVSIQPSNDHSTPLLPP 1200
Query: 1201 RPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPVREPY 1260
R LYD +A TTKDG PMQFHQS+LSQGSDLGSQSVMKSQPL +SHS +GESP++EP
Sbjct: 1201 RRLYDSALAPTTTKDGMPMQFHQSNLSQGSDLGSQSVMKSQPLELHSHSKIGESPLQEPC 1260
Query: 1261 RAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPF 1320
RAP+HMDEIRSI PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP RNFNDQSMPF
Sbjct: 1261 RAPMHMDEIRSITPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRRNFNDQSMPF 1320
Query: 1321 TNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYP 1380
T+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD FL SQIGT+SRHY
Sbjct: 1321 TDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGFLRDSSQIGTMSRHYL 1380
Query: 1381 DPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFN 1440
DP RNH SL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPSGDIRDSTFN
Sbjct: 1381 DPSIRNHPSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPSGDIRDSTFN 1440
Query: 1441 LSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDR 1500
SN RVDGQGANYVGS LTT SP STKP GK LPS GGDQYDPLFDS+EPS PI KKSDR
Sbjct: 1441 ASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPSSPIIKKSDR 1500
Query: 1501 VRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVE 1560
+KLEK RESHM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGETADAEAGAVE
Sbjct: 1501 GQKLEKTRESHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGETADAEAGAVE 1560
Query: 1561 NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKE 1620
+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPSWRQGNMSKE
Sbjct: 1561 DDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPSWRQGNMSKE 1620
Query: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVM
Sbjct: 1621 AFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVM 1643
BLAST of MS002044 vs. ExPASy TrEMBL
Match:
A0A6J1KND4 (serine/arginine repetitive matrix protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC111495732 PE=4 SV=1)
HSP 1 Score: 2321.2 bits (6014), Expect = 0.0e+00
Identity = 1279/1674 (76.40%), Postives = 1379/1674 (82.38%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG NY SQFGQGPQKPWPPAYQQRAVAPPPPPPPTSY+QPGPPIPSRPITQQ PAP
Sbjct: 1 MYGQPNYGSQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYIQPGPPIPSRPITQQPPAPQL 60
Query: 61 QAGQPLHLSQSGPH-----VPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQ 120
QAGQPL++SQSG H PPPPLCQ PS+QVL GGI NI QTYFHTFPPVHGSTQ Q
Sbjct: 61 QAGQPLYMSQSGSHGPPPPPPPPPLCQRPSIQVLSGGITNIHQTYFHTFPPVHGSTQVSQ 120
Query: 121 FNSSTQQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQ 180
FNS+ +QSGVQN HHVLPPPP LPP PPP P HAP+PDLLRPPQ
Sbjct: 121 FNSN-------AQSGVQNTHHVLPPPPLLPP-----------PPPRPLHAPSPDLLRPPQ 180
Query: 181 PSTVVPVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPP-FGGVMQSNLG 240
ST+VP+HP SQGQTLYG R++PPLQQGGLQ+FPSIPQHPTTSNFPTPP FGG+MQSNLG
Sbjct: 181 FSTIVPLHPRSQGQTLYGARINPPLQQGGLQIFPSIPQHPTTSNFPTPPSFGGLMQSNLG 240
Query: 241 ESHLSPMAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKEL 300
E HL P+APPPPPS PPPIPPSPPPPTSPS SIP+S SSNLLCQ EFDPSSTI+ SK L
Sbjct: 241 EPHLLPVAPPPPPSYPPPIPPSPPPPTSPS-SSIPNSDSSNLLCQIEFDPSSTIHCSKRL 300
Query: 301 KAFESNQGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPK 360
KAFE++ HLGDN PKH KHRNL+G IGLMMGSKVDNEI SDK VQ LPPSPPK
Sbjct: 301 KAFENDPVVASPSHLGDNRPKHDKHRNLEGGIGLMMGSKVDNEIFSDKDYVQVLPPSPPK 360
Query: 361 PKDDKITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWM 420
PKDD+I RKIEVLC+ IA+NGSSFED TR KEFGNPEF+FL+GG+PGSE+AIGHEYFLWM
Sbjct: 361 PKDDRIVRKIEVLCQLIASNGSSFEDATRHKEFGNPEFQFLFGGEPGSESAIGHEYFLWM 420
Query: 421 KKKYSLDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEET 480
KKKYSL CKNKEM+ KSP RSL I PQSE LTVSAASISP NSDMEM DDITP GEET
Sbjct: 421 KKKYSLACKNKEMKAKSPSRSLGIEPQSEYLTVSAASISPANSDMEMGDDITPAARGEET 480
Query: 481 SHSFEIQSYECKSRKEEHDAKDQLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSL 540
H +IQSY+ KSRKEE+D KDQLQGPED QR S + K E +DG KLLL HEKSVS
Sbjct: 481 GHLVQIQSYKRKSRKEEYDVKDQLQGPEDSQRCS--REKEIEAEDGGPKLLLGHEKSVSA 540
Query: 541 EACQVHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGS 600
ACQVH P +AG+ E LG+NFE SVTC QN+K+L AA EA NSS S L+ GGS
Sbjct: 541 AACQVHIPDRISAGLSEPALGNNFESSVTCSQNDKNLSGEVAAFEATNSSQSAALVAGGS 600
Query: 601 PFRLIQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQ 660
PFRLIQDY+SDENSE+DEESHLKDV F A+SPSTP SSKTS K +D LT LGSKGSCQV+
Sbjct: 601 PFRLIQDYSSDENSESDEESHLKDVRFVAVSPSTPVSSKTSDKYTDQLTNLGSKGSCQVE 660
Query: 661 RSNVPPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLD 720
S P CE SMP+ G+ FLS PK +FDANEANVR+ GNE++ +NQ+GT TS KSL
Sbjct: 661 LSYAPTCEHSMPESGAHFLSGPPKLVFDANEANVRKTGNEQSCNNQRNQIGTSTSPKSL- 720
Query: 721 ADAVKGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRR 780
DA+ GRSVDV+ D+ KL+KENDEEK+K GSSPVKIDEFGRLVREGGSDSDSDDS Y RR
Sbjct: 721 -DALNGRSVDVVQDTDKLRKENDEEKVKLGSSPVKIDEFGRLVREGGSDSDSDDSLYIRR 780
Query: 781 HKKRRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRG----------RSRSR 840
HK RR R+SSESHSPVD RRGRRSPWRRR+RRSRSRSWSPRNQRG RSRSR
Sbjct: 781 HKNRRARSSSESHSPVD-RRGRRSPWRRRERRSRSRSWSPRNQRGRGRSRSRSRSRSRSR 840
Query: 841 SPVSRRTSQFNNENMKRDKG-MIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKH 900
SPVSRRT+QFNNENM+RDKG MIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSR HRSKH
Sbjct: 841 SPVSRRTNQFNNENMRRDKGIMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRLHRSKH 900
Query: 901 HDVHPTSENIKGREDTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPG 960
HDVHPTS+NIK REDT+N SR++SDLGHIKVENQ CIQHNVSPK D H W SPT
Sbjct: 901 HDVHPTSKNIKSREDTMNTSRDISDLGHIKVENQECIQHNVSPKHDAHAWNTDSPT---- 960
Query: 961 LDVTKCQSSRDRAGLVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNAD 1020
DV +CQSSRD LV+E+LI SK A AVHIHVN N QE KSYEQ SV A+SQCMSNAD
Sbjct: 961 RDVHRCQSSRDGTSLVEEDLINSKPAGAVHIHVNNNGQETEKSYEQCSVVASSQCMSNAD 1020
Query: 1021 TEKLSGDISMSMLTSVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVS 1080
TEK SGDIS SMLTS E S+ AQQSNM SE + ANS S M+GSF+S+LLPDQVT V+
Sbjct: 1021 TEKFSGDISTSMLTSAENSV--AQQSNMLVSELQTANSYSRPMDGSFISNLLPDQVTVVT 1080
Query: 1081 TNKAPECEHFPDKNSLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPL 1140
TNKAPECE FPDK S I QFD SSA Q P TSQFLSESP+PK SATAPGCA DDAH L
Sbjct: 1081 TNKAPECELFPDKTSSINEQFDASSASQPPMTSQFLSESPIPKQFSATAPGCANDDAHSL 1140
Query: 1141 RELPPPPPL---PTSCVTSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSH 1200
R LPPPPPL TS V A+V + PY+FVSQN SFPSK SLPGGF PHQD VSIQ S+
Sbjct: 1141 RALPPPPPLLPHMTSHVNGAEVPISAPYSFVSQNASFPSKSSLPGGFHPHQDFVSIQPSN 1200
Query: 1201 YHSTTFPPSRPLYDPTMAHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSML 1260
HST P R LYD T+A TTKDGTPMQFHQS+LSQGSDLGSQSVMKSQPL +S S +
Sbjct: 1201 DHSTPLLPPRRLYDSTLAPTTTKDGTPMQFHQSNLSQGSDLGSQSVMKSQPLELHSRSKI 1260
Query: 1261 GESPVREPYRAPLHMDEIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHR 1320
GESP++EP R P+HMDEIRS PVA NRP PFGFPSF EENFGRTSVEM+SSSFFP R
Sbjct: 1261 GESPLQEPCRGPMHMDEIRSSTPVATNRPSLPFGFPSFSNEENFGRTSVEMNSSSFFPRR 1320
Query: 1321 NFNDQSMPFTNANRMQSSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQ 1380
NFNDQSMPFT+ANRMQ S DNFPPSEFRSSFSQFH YSRFQQP YASQ AHD L SQ
Sbjct: 1321 NFNDQSMPFTDANRMQFSDDNFPPSEFRSSFSQFHPYSRFQQPFYASQPAHDGLLRDSSQ 1380
Query: 1381 IGTISRHYPDPLSRNHSSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPS 1440
IGT+SRHYPDP RNHSSL PDF GLG+TTYHNPYASTF+KPLSS + SNILNFGNDAPS
Sbjct: 1381 IGTMSRHYPDPSIRNHSSLPPDFRGLGVTTYHNPYASTFEKPLSSTYSSNILNFGNDAPS 1440
Query: 1441 GDIRDSTFNLSNVRVDGQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPS 1500
GDIRDSTFN SN RVDGQGANYVGS LTT SP STKP GK LPS GGDQYDPLFDS+EPS
Sbjct: 1441 GDIRDSTFNASNARVDGQGANYVGSRLTTASPNSTKPLGKLLPSPGGDQYDPLFDSMEPS 1500
Query: 1501 PPITKKSDRVRKLEKARESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGET 1560
PI +KSDR +KLEK RE HM TRLG SHKL DVEENNKHKEV AVASTTSL+NDEFGET
Sbjct: 1501 SPIIRKSDRGQKLEKTREYHMTTRLGSSHKLLDVEENNKHKEVVAVASTTSLDNDEFGET 1560
Query: 1561 ADAEAGAVENDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPS 1620
ADAEAGAVE+D DDE NL+GEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKE+LKPS
Sbjct: 1561 ADAEAGAVEDDFDDEANLSGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEILKPS 1620
Query: 1621 WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQ+KLTKLVM
Sbjct: 1621 WRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYIDSSQQKLTKLVM 1644
BLAST of MS002044 vs. ExPASy TrEMBL
Match:
A0A5A7UQ65 (Serine/arginine repetitive matrix protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G003060 PE=4 SV=1)
HSP 1 Score: 2275.4 bits (5895), Expect = 0.0e+00
Identity = 1259/1657 (75.98%), Postives = 1370/1657 (82.68%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG ANYASQFGQGPQKPWPPAYQQRA APPPPPPPTSY+QPGPPIPS P+TQQAPAPPP
Sbjct: 1 MYGQANYASQFGQGPQKPWPPAYQQRAGAPPPPPPPTSYVQPGPPIPSHPVTQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QA QPLHLSQ G H PPPP CQGPS+QVLPGGI NIR YFHTFPP HG+TQ FNS+
Sbjct: 61 QA-QPLHLSQPGSHGPPPPFCQGPSIQVLPGGITNIR-PYFHTFPPAHGNTQVSVFNSNA 120
Query: 121 QQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLS SG QNMHHVLPPPPPLPPPPP PPPPP APNPDLLRPPQPSTV
Sbjct: 121 QQNVQLSHSGAQNMHHVLPPPPPLPPPPP--------PPPPPSQAPNPDLLRPPQPSTVG 180
Query: 181 PVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
+HPPSQGQ YG H PLQQGGLQVFPSIP HPTTS FPTP + LG+SHL P
Sbjct: 181 SLHPPSQGQAFYGALTHQPLQQGGLQVFPSIPPHPTTSTFPTP-----SSNFLGDSHLLP 240
Query: 241 MAPPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFESN 300
MAPPPPPSSPPPIPPSPPPPTSPS SIP SSNL S PSST++ SK+LK E +
Sbjct: 241 MAPPPPPSSPPPIPPSPPPPTSPS-PSIPHPDSSNLSHGSHLGPSSTVHYSKDLKPSEID 300
Query: 301 QGGTPTRHLGDNGPKH-KHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDKI 360
QGG P HLGDNGPKH +H NL+ GLM+ SKVDNEILSDK VQ LPPSPPKPKDD+I
Sbjct: 301 QGGAPPSHLGDNGPKHEEHGNLEVGSGLMV-SKVDNEILSDKDYVQVLPPSPPKPKDDRI 360
Query: 361 TRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKYSL 420
+KIEVLC+ IA+NG SFEDTTR+KEFGNPEF+FL+GG+PGSE+AI HEYFL MK KYSL
Sbjct: 361 VKKIEVLCQLIADNGPSFEDTTRQKEFGNPEFDFLFGGEPGSESAIAHEYFLRMKMKYSL 420
Query: 421 DCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFEI 480
KN E+ EKSP+R LRI PQSE+LT SAAS+SP NSDMEMEDDIT I E TSH F I
Sbjct: 421 ASKNIEITEKSPLRYLRIEPQSENLTASAASLSPANSDMEMEDDITVADIEEGTSHLFGI 480
Query: 481 QSYECKSRKEEHDAKD--QLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEACQ 540
QSYECK RKEEHDA+D QLQ PE L+ SP K KVAE DG KLLL+HEKS S+ ACQ
Sbjct: 481 QSYECKPRKEEHDARDLVQLQKPEVLRSCSPEKEKVAE--DGGPKLLLNHEKSGSIAACQ 540
Query: 541 VHSPVINTAGVVEQPLGSNFEISVTCLQNEKSL----AASEAVNSSPSTELIIGGSPFRL 600
VHSPV +TAGV P G++FE S+ LQN+K L A+S A SS ST LI GGSPFRL
Sbjct: 541 VHSPVRSTAGVAGHPPGNDFENSLISLQNDKGLAGEVASSAATISSQSTALITGGSPFRL 600
Query: 601 IQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSNV 660
IQDYASDENSE+DE+SH DV F AISPSTPA SKTSGKD+ +LT LGSKGSCQVQ S V
Sbjct: 601 IQDYASDENSESDEDSHHTDVHFVAISPSTPAYSKTSGKDTGDLTTLGSKGSCQVQWSYV 660
Query: 661 PPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADAV 720
PPCE SMP+ G+QF SESPKQ+ DA EANV++ GNE++Y NQ+ T T +KSLDA V
Sbjct: 661 PPCEFSMPEPGAQFHSESPKQVIDATEANVQKTGNEQSYNDQHNQIDTVTGTKSLDAMNV 720
Query: 721 KGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKKR 780
RSVDV D+ KLQKEND EK + GSSP+KIDEFGRLVREGGSDSDSDD HY RRHK R
Sbjct: 721 --RSVDVPQDTDKLQKENDAEKGRLGSSPIKIDEFGRLVREGGSDSDSDDLHYRRRHKSR 780
Query: 781 RTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRSPVSRRTSQFNNEN 840
R+RNSSES SPVDRRRGRRSP RRR+RRSRSRSWSPRNQ R RSRSPV RRTSQF+NEN
Sbjct: 781 RSRNSSESRSPVDRRRGRRSPRRRRERRSRSRSWSPRNQ--RDRSRSPVGRRTSQFSNEN 840
Query: 841 MKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSENIKGRED 900
+RDKGM+RKCFDFQRGRCYRGASCRYVHHEP+KNDG R HRSKHHDVHPTS+NIK RED
Sbjct: 841 KRRDKGMVRKCFDFQRGRCYRGASCRYVHHEPNKNDGPRFHRSKHHDVHPTSKNIKIRED 900
Query: 901 TVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTKCQSSRDRAGL 960
T+NMSREVSDLGH KVENQ I HNVSPK DTHDWK SPTGDP VTKCQSS DR GL
Sbjct: 901 TMNMSREVSDLGHTKVENQESILHNVSPKKDTHDWKTDSPTGDPDSFVTKCQSSSDRTGL 960
Query: 961 VQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDISMSMLTS 1020
VQ+ LI S+ AEA+H+H N++ QEA K YEQ SVTA+SQCM NADTEKLSGDISMS LTS
Sbjct: 961 VQDALICSEPAEAIHVHANDDGQEAKKCYEQPSVTASSQCMGNADTEKLSGDISMSTLTS 1020
Query: 1021 VERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAPECEHFPDKNS 1080
VE S+ AQQSN F +E +++N +SHQM+GSFVS+LLPDQVTAV++NKAPECEHF D+ S
Sbjct: 1021 VENSV--AQQSNTFVAELQSSNDLSHQMDGSFVSNLLPDQVTAVTSNKAPECEHFTDRTS 1080
Query: 1081 LIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPLPTSCVT 1140
IK QFDTSSA Q P TSQ LSESPVPK SATAP A DDAH L ELPPPPPL S V+
Sbjct: 1081 SIKPQFDTSSAIQLPLTSQILSESPVPKPYSATAPVSATDDAHSLTELPPPPPLIISHVS 1140
Query: 1141 SADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPSRPLYDPTMA 1200
SA++ MP PYNFVSQN+SFP SLP GF PH +VSIQ SHY ST+ P +PLY+ ++A
Sbjct: 1141 SAEISMPAPYNFVSQNLSFPPNSSLPIGFHPHHGMVSIQPSHYQSTSLLPPKPLYN-SLA 1200
Query: 1201 HVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPVREPYRA-PLHMDE 1260
VTT G PMQFHQSHLSQG DLGSQS M SQPL +SHS LGESPV+EPYRA P+H+DE
Sbjct: 1201 PVTTNAGMPMQFHQSHLSQGRDLGSQSAMSSQPLELHSHSKLGESPVQEPYRAPPMHLDE 1260
Query: 1261 IRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPFTNANRMQS 1320
IRSIAPVANNRP QPFGFPSFQ EEN GRTSVEM+SSSFFP RNF+D SMP TNANRMQ
Sbjct: 1261 IRSIAPVANNRPTQPFGFPSFQNEENHGRTSVEMNSSSFFPQRNFSDHSMPATNANRMQP 1320
Query: 1321 SGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYPDPLSRNHS 1380
SGDNFPP+EFRSSFSQF YSRFQQPLY SQ AHDS PSQIG+ISRHYPDPLSR+H
Sbjct: 1321 SGDNFPPTEFRSSFSQFQPYSRFQQPLYTSQPAHDSLFRDPSQIGSISRHYPDPLSRSHP 1380
Query: 1381 SLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFNLSNVRVDG 1440
SLLP++GGLGITTYHNPYASTF+KPLSS+FRSN LNFGNDAPSGDI STFN+S+V +DG
Sbjct: 1381 SLLPEYGGLGITTYHNPYASTFEKPLSSSFRSNFLNFGNDAPSGDICSSTFNMSSVHIDG 1440
Query: 1441 QGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDRVRKLEKAR 1500
QG NYVGS T SP STKP GK L + GDQYDPLFDSIEPS PITKKSDR +KL+KAR
Sbjct: 1441 QGTNYVGSRQTVASPNSTKPLGKLLSGTDGDQYDPLFDSIEPSSPITKKSDRGQKLKKAR 1500
Query: 1501 ESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVENDLDDEEN 1560
ES M RLGGSHKL DVEENNKHKEVAAV STTSLENDEFGET DAEAGAVENDLDDE N
Sbjct: 1501 ESDTMARLGGSHKLLDVEENNKHKEVAAVTSTTSLENDEFGETGDAEAGAVENDLDDEAN 1560
Query: 1561 LTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKK 1620
L+GEIEIDQVKSSEKSKKSKGSRSL+LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKK
Sbjct: 1561 LSGEIEIDQVKSSEKSKKSKGSRSLKLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKK 1620
Query: 1621 TVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
TVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM
Sbjct: 1621 TVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1631
BLAST of MS002044 vs. ExPASy TrEMBL
Match:
A0A0A0LRV0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G014360 PE=4 SV=1)
HSP 1 Score: 2222.6 bits (5758), Expect = 0.0e+00
Identity = 1240/1658 (74.79%), Postives = 1367/1658 (82.45%), Query Frame = 0
Query: 1 MYGPANYASQFGQGPQKPWPPAYQQRAVAPPPPPPPTSYMQPGPPIPSRPITQQAPAPPP 60
MYG ANYASQFGQGP KPWPPAYQQRA APPPPPPPTSY+QPGPPIPS PITQQAPAPPP
Sbjct: 1 MYGQANYASQFGQGPPKPWPPAYQQRAGAPPPPPPPTSYVQPGPPIPSHPITQQAPAPPP 60
Query: 61 QAGQPLHLSQSGPHVPPPPLCQGPSVQVLPGGIPNIRQTYFHTFPPVHGSTQGFQFNSST 120
QA QPLHLSQ G H P PP CQGPS+QVLPGGI NIR YFHTFPPVHG+TQ FNS+
Sbjct: 61 QA-QPLHLSQPGSHGPLPPFCQGPSIQVLPGGITNIR-PYFHTFPPVHGNTQVSVFNSNA 120
Query: 121 QQNVQLSQSGVQNMHHVLPPPPPLPPPPPPPPPHAPNPPPPPPHAPNPDLLRPPQPSTVV 180
QQNVQLS SGVQNMHHVLPPPPPLP PPPPP PPPPP APNPDLLRPPQPSTV
Sbjct: 121 QQNVQLSHSGVQNMHHVLPPPPPLPLPPPPP------PPPPPSQAPNPDLLRPPQPSTVG 180
Query: 181 PVHPPSQGQTLYGTRVHPPLQQGGLQVFPSIPQHPTTSNFPTPPFGGVMQSNLGESHLSP 240
+HPPSQGQ LYG R H PLQQGGLQVFPSIP HPTTS FPTP + LG+SHL P
Sbjct: 181 SLHPPSQGQALYGARTHQPLQQGGLQVFPSIPPHPTTSTFPTP-----SSNFLGDSHLLP 240
Query: 241 MA-PPPPPSSPPPIPPSPPPPTSPSFYSIPSSGSSNLLCQSEFDPSSTINSSKELKAFES 300
MA PPPPPSSPPPIPPSPPPPTSPS SIP SSNLL S+ PSST++ SK+LK E
Sbjct: 241 MAPPPPPPSSPPPIPPSPPPPTSPS-PSIPHPDSSNLLHGSDLGPSSTVHYSKDLKPSEI 300
Query: 301 NQGGTPTRHLGDNGP-KHKHRNLDGSIGLMMGSKVDNEILSDKGNVQDLPPSPPKPKDDK 360
+QGGTP HLGDNGP +H NL+ GLM+ S VDNE L+DK VQ LPPSPPKPKDD+
Sbjct: 301 DQGGTPPSHLGDNGPGNDEHGNLEVDSGLMV-SNVDNEKLADKDYVQVLPPSPPKPKDDR 360
Query: 361 ITRKIEVLCKYIANNGSSFEDTTRRKEFGNPEFEFLYGGKPGSEAAIGHEYFLWMKKKYS 420
I +KIEVLC+ IA+NG +FEDT R+KE GNPEFEFL GG+PGSE+AIGH+YFLWMK KY
Sbjct: 361 IVKKIEVLCQLIADNGPNFEDTIRQKESGNPEFEFLLGGEPGSESAIGHKYFLWMKMKYC 420
Query: 421 LDCKNKEMEEKSPVRSLRIGPQSESLTVSAASISPENSDMEMEDDITPDGIGEETSHSFE 480
L KN E+ E+ +R LRI PQSE+LTV AAS+SP NSDMEMEDDIT + + TSHSFE
Sbjct: 421 LASKNIEITERCSLRYLRIEPQSENLTVLAASLSPANSDMEMEDDIT---VEQGTSHSFE 480
Query: 481 IQSYECKSRKEEHDAKD--QLQGPEDLQRSSPVKGKVAEEKDGESKLLLDHEKSVSLEAC 540
IQSYEC++RKEEHDA+D QLQ PE L+ SP K KVAEE G K LL+HEK S+ +C
Sbjct: 481 IQSYECEARKEEHDARDLVQLQEPEVLRSCSPEKEKVAEE--GGPKHLLNHEKFGSIASC 540
Query: 541 QVHSPVINTAGVVEQPLGSNFEISVTCLQNEK----SLAASEAVNSSPSTELIIGGSPFR 600
QVHSPV +TAGV P G++FE S++ LQN+K +A+S SS ST LI GGSPFR
Sbjct: 541 QVHSPVRSTAGVAGHPSGNDFENSLSYLQNDKGQAGEVASSAGTISSQSTALITGGSPFR 600
Query: 601 LIQDYASDENSETDEESHLKDVSF-AISPSTPASSKTSGKDSDNLTILGSKGSCQVQRSN 660
LIQDYASDENSE+DE+SH DV F AISPSTPA SKTS KD+ +LT LGSKGSCQV+ S
Sbjct: 601 LIQDYASDENSESDEDSHRTDVHFVAISPSTPAYSKTSDKDTGDLTTLGSKGSCQVRWSY 660
Query: 661 VPPCEASMPDFGSQFLSESPKQIFDANEANVRRAGNERNYKIHQNQVGTRTSSKSLDADA 720
VPPCE SMP+ G+QF SESPKQ+ DA EANVR+ GNE +Y NQ+ T T +KSL DA
Sbjct: 661 VPPCEFSMPEPGAQFHSESPKQVIDATEANVRKTGNELSYNDQHNQIDTVTGTKSL--DA 720
Query: 721 VKGRSVDVLHDSHKLQKENDEEKLKFGSSPVKIDEFGRLVREGGSDSDSDDSHYTRRHKK 780
+ G SVDV D+ KLQKE D EK + G SPVKIDEFGRLVREGGSDSDSDDSHY RRH+
Sbjct: 721 MNGCSVDVPQDTGKLQKETDAEKGRLGPSPVKIDEFGRLVREGGSDSDSDDSHYRRRHRS 780
Query: 781 RRTRNSSESHSPVDRRRGRRSPWRRRQRRSRSRSWSPRNQRGRSRSRSPVSRRTSQFNNE 840
RR+RNSSES SPVDRRRGRRSP RRR+RRSRSRSWSPRNQ R RSRSPVSRRTSQF+NE
Sbjct: 781 RRSRNSSESRSPVDRRRGRRSPRRRRERRSRSRSWSPRNQ--RDRSRSPVSRRTSQFSNE 840
Query: 841 NMKRDKGMIRKCFDFQRGRCYRGASCRYVHHEPSKNDGSRHHRSKHHDVHPTSENIKGRE 900
N +RDKGM+RKCFDFQRGRCYRGASCRYVHHEP+KNDGSR HRSKH DVH TS+NIK RE
Sbjct: 841 NKRRDKGMVRKCFDFQRGRCYRGASCRYVHHEPNKNDGSRFHRSKHQDVHSTSKNIKIRE 900
Query: 901 DTVNMSREVSDLGHIKVENQGCIQHNVSPKDDTHDWKKGSPTGDPGLDVTKCQSSRDRAG 960
DT+NMSREVSDLGH KVE Q I HNVSPK+DTHDWK +PTGDP V+KC+SS +R G
Sbjct: 901 DTMNMSREVSDLGHTKVEIQESILHNVSPKEDTHDWKTDNPTGDPDSFVSKCRSSSERTG 960
Query: 961 LVQEELIYSKAAEAVHIHVNENIQEAGKSYEQLSVTAASQCMSNADTEKLSGDISMSMLT 1020
LVQ+ LI + AEAVH+ N++ QE KSYEQ SVTA+SQCMSNADTEKLSGDISMS+LT
Sbjct: 961 LVQDALICLEPAEAVHVRANDDGQEPKKSYEQPSVTASSQCMSNADTEKLSGDISMSVLT 1020
Query: 1021 SVERSLAHAQQSNMFASEFEAANSVSHQMEGSFVSHLLPDQVTAVSTNKAPECEHFPDKN 1080
SVE S+ AQQSN F +E +++ +SHQM+GSFVS+LLPDQVTAV++NKAPE EHFPD+
Sbjct: 1021 SVENSV--AQQSNTFVAELQSSTDLSHQMDGSFVSNLLPDQVTAVTSNKAPEWEHFPDRT 1080
Query: 1081 SLIKLQFDTSSAGQQPSTSQFLSESPVPKSLSATAPGCAMDDAHPLRELPPPPPLPTSCV 1140
S IK QFDTSSA Q P TSQ LSESPVPK LSATAP A DD H L ELPPPPPL S V
Sbjct: 1081 SSIKPQFDTSSAIQLPLTSQILSESPVPKPLSATAPVSATDDDHSLTELPPPPPLIISHV 1140
Query: 1141 TSADVLMPTPYNFVSQNVSFPSKPSLPGGFQPHQDIVSIQSSHYHSTTFPPSRPLYDPTM 1200
+SA++ MP PYNFVSQN+SFPS SLP GF PH +VSIQ SH+ ST+ P +PLY+ ++
Sbjct: 1141 SSAEISMPAPYNFVSQNLSFPSNSSLPIGFHPHHGMVSIQPSHFQSTSLLPPKPLYN-SL 1200
Query: 1201 AHVTTKDGTPMQFHQSHLSQGSDLGSQSVMKSQPLVSNSHSMLGESPVREPYRA-PLHMD 1260
A V T G PMQFH SHLSQG DLGSQS M SQPL +SHS LGESP++EPYRA P+HMD
Sbjct: 1201 APVATNAGMPMQFHHSHLSQGRDLGSQSAMSSQPLELHSHSKLGESPLQEPYRAPPMHMD 1260
Query: 1261 EIRSIAPVANNRPIQPFGFPSFQKEENFGRTSVEMSSSSFFPHRNFNDQSMPFTNANRMQ 1320
EIRSIAPVANNRP QPFGFPSFQ EEN GRTSVEM+SSSFFP RNF+DQSM TNANRMQ
Sbjct: 1261 EIRSIAPVANNRPTQPFGFPSFQNEENLGRTSVEMNSSSFFPQRNFSDQSMLATNANRMQ 1320
Query: 1321 SSGDNFPPSEFRSSFSQFHSYSRFQQPLYASQSAHDSFLHGPSQIGTISRHYPDPLSRNH 1380
SGDNFPPSEFRSSFSQF YSRFQQPLY SQ AHD+ H PSQIG+ISRHYPDPLSR+H
Sbjct: 1321 PSGDNFPPSEFRSSFSQFQPYSRFQQPLYTSQPAHDTLFHDPSQIGSISRHYPDPLSRSH 1380
Query: 1381 SSLLPDFGGLGITTYHNPYASTFDKPLSSNFRSNILNFGNDAPSGDIRDSTFNLSNVRVD 1440
SLLP+FGGLGITT+HNPYASTF+KPLSS+FRSN LNFGNDAPSGDIR STFNL++V VD
Sbjct: 1381 PSLLPEFGGLGITTHHNPYASTFEKPLSSSFRSNFLNFGNDAPSGDIRGSTFNLNSVHVD 1440
Query: 1441 GQGANYVGSGLTTTSPKSTKPSGKHLPSSGGDQYDPLFDSIEPSPPITKKSDRVRKLEKA 1500
GQG NYVGS T SP STKP GK L + DQYDPLFDSIEPS PITKKSDR +KL+KA
Sbjct: 1441 GQGTNYVGSRQTVASPNSTKPLGKLLSGTDDDQYDPLFDSIEPSSPITKKSDRGQKLKKA 1500
Query: 1501 RESHMMTRLGGSHKLPDVEENNKHKEVAAVASTTSLENDEFGETADAEAGAVENDLDDEE 1560
RESHM+ RLGGSHKL DVEENNKHKEVAAV STTSLENDEFGET DAEAGAVENDLDD+
Sbjct: 1501 RESHMIARLGGSHKLLDVEENNKHKEVAAVTSTTSLENDEFGETGDAEAGAVENDLDDDA 1560
Query: 1561 NLTGEIEIDQVKSSEKSKKSKGSRSLRLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVK 1620
NL+GEIEIDQVKSSEKSKKSKGSRSL+LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVK
Sbjct: 1561 NLSGEIEIDQVKSSEKSKKSKGSRSLKLFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVK 1620
Query: 1621 KTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1649
KTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM
Sbjct: 1621 KTVDKVSGAMKSHQIPKSQAKINRYIDSSQRKLTKLVM 1631
BLAST of MS002044 vs. TAIR 10
Match:
AT3G26850.1 (histone-lysine N-methyltransferases )
HSP 1 Score: 143.3 bits (360), Expect = 1.8e-33
Identity = 109/251 (43.43%), Postives = 142/251 (56.57%), Query Frame = 0
Query: 1438 GSGLTTTSPKSTKPSGKHLPSSG--GDQYDPLFDSIEPS-------------------PP 1497
GS ++SP S K GK +P G GD YDP DS EP+ P
Sbjct: 11 GSRQASSSPYSGK--GKIVPECGLVGDMYDPFVDSFEPASVKLDCVQEHEPDNDLCIVPK 70
Query: 1498 ITKKSDRVRKLEKAR---------ESHMMTRLG-GSHKLPDVEENNKHKEVAAVASTTSL 1557
+ S+R +E+ ES M R+ S+K DVEEN E+ V S
Sbjct: 71 ASISSNRPLSMEENNQAVDKEPLCESEMTARVSVSSNKPADVEENTAGIEIGEVVSG--- 130
Query: 1558 ENDEFGETAD--AEAGAVE-------NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLR 1617
E+DEFG+ D E + E N+ EN E + + KS EKSK+ SRS++
Sbjct: 131 EDDEFGKNVDDGRECNSHETLTPNSDNENPKVENNVHEGDNTRKKSREKSKERDSSRSMK 190
Query: 1618 LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYID 1649
LF++ + FVK++LKPSWRQGNMSKEAFKTIVK+ VDKVS +M+ +IPKS+AKI++YID
Sbjct: 191 LFKVVLTKFVKDLLKPSWRQGNMSKEAFKTIVKRVVDKVSNSMEGRRIPKSRAKIDKYID 250
BLAST of MS002044 vs. TAIR 10
Match:
AT3G26850.2 (histone-lysine N-methyltransferases )
HSP 1 Score: 143.3 bits (360), Expect = 1.8e-33
Identity = 109/251 (43.43%), Postives = 142/251 (56.57%), Query Frame = 0
Query: 1438 GSGLTTTSPKSTKPSGKHLPSSG--GDQYDPLFDSIEPS-------------------PP 1497
GS ++SP S K GK +P G GD YDP DS EP+ P
Sbjct: 11 GSRQASSSPYSGK--GKIVPECGLVGDMYDPFVDSFEPASVKLDCVQEHEPDNDLCIVPK 70
Query: 1498 ITKKSDRVRKLEKAR---------ESHMMTRLG-GSHKLPDVEENNKHKEVAAVASTTSL 1557
+ S+R +E+ ES M R+ S+K DVEEN E+ V S
Sbjct: 71 ASISSNRPLSMEENNQAVDKEPLCESEMTARVSVSSNKPADVEENTAGIEIGEVVSG--- 130
Query: 1558 ENDEFGETAD--AEAGAVE-------NDLDDEENLTGEIEIDQVKSSEKSKKSKGSRSLR 1617
E+DEFG+ D E + E N+ EN E + + KS EKSK+ SRS++
Sbjct: 131 EDDEFGKNVDDGRECNSHETLTPNSDNENPKVENNVHEGDNTRKKSREKSKERDSSRSMK 190
Query: 1618 LFRIAIADFVKEVLKPSWRQGNMSKEAFKTIVKKTVDKVSGAMKSHQIPKSQAKINRYID 1649
LF++ + FVK++LKPSWRQGNMSKEAFKTIVK+ VDKVS +M+ +IPKS+AKI++YID
Sbjct: 191 LFKVVLTKFVKDLLKPSWRQGNMSKEAFKTIVKRVVDKVSNSMEGRRIPKSRAKIDKYID 250
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022135323.1 | 0.0e+00 | 97.36 | uncharacterized protein LOC111007314 [Momordica charantia] | [more] |
XP_023531277.1 | 0.0e+00 | 77.50 | uncharacterized protein LOC111793568 [Cucurbita pepo subsp. pepo] >XP_023531278.... | [more] |
XP_022931323.1 | 0.0e+00 | 77.66 | uncharacterized protein LOC111437543 [Cucurbita moschata] >XP_022931325.1 unchar... | [more] |
KAG6587592.1 | 0.0e+00 | 77.29 | Zinc finger CCCH domain-containing protein 55, partial [Cucurbita argyrosperma s... | [more] |
KAG7021558.1 | 0.0e+00 | 76.87 | Zinc finger CCCH domain-containing protein 55 [Cucurbita argyrosperma subsp. arg... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C4H9 | 0.0e+00 | 97.36 | uncharacterized protein LOC111007314 OS=Momordica charantia OX=3673 GN=LOC111007... | [more] |
A0A6J1EZ49 | 0.0e+00 | 77.66 | uncharacterized protein LOC111437543 OS=Cucurbita moschata OX=3662 GN=LOC1114375... | [more] |
A0A6J1KND4 | 0.0e+00 | 76.40 | serine/arginine repetitive matrix protein 2-like OS=Cucurbita maxima OX=3661 GN=... | [more] |
A0A5A7UQ65 | 0.0e+00 | 75.98 | Serine/arginine repetitive matrix protein 2 OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A0A0LRV0 | 0.0e+00 | 74.79 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G014360 PE=4 SV=1 | [more] |