Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCAGAACCTAACAGAACCGATTTATAAACAAAGGATCGTGGTACGGTACGGCTTTATCTTGCTTTTGCTGTCAAAACCCTAAACAATAAATAGGAAGAAGATTCATTGCATAGAGGTGGAAGAAATCATTGGCTCCCGCCATGGAAGAAATCGATGGATTGAGAAACCAATTAGCAACCACACCGCTGCATTTCAAAGAATGGAACGATTGATGAAGGGTTTTACACTTTAGTGAAGCTAATGCTTAAATTTTCTGATCAATTTTGCTTGCTCCTTCTATAGAATACCCGCAGAGAACAATTTGAACCGACTTCAATGGACCGCCTTTTGGAGGAGGGGTCATAAGTTGATCACAGCAGTTCCCCGAAGAGACAGAGAGTTTTCGTTCAAAAACTTGAAAACTATTATGTTTGAAGAACTTGTTTCACAAGTTGATGATTACTGAACTCTTGCTCTTGAGTCATTCCCGGCGACTGATTGAAGTTGTCCATTTTCGATTCTCTTGCACGTTAACCGAGCAATTCCCCGGAACTTGGGAGATTTTAGACTATGGGTTTCGTAAGAATTTGAGTTCAGTAATCTGGCTGTTCTTATTTAACCCTTTAATTTTGCGGGGATTTGAAAATGGCGGAGGATGCTGTGCCGAAGCCTGAATCGTCTAATTCCTGCTGTAAAGCGGTAATGCTTTCTTTCTTTCTTTCTTCTTCTTTCTCTTCTTCTTCTTTTTCGTCATCTTCTGCATTTTTGTGCTTCGTTGTTTGTACTTCGTTTCATTTCGTTTCGTTATGGTTGTTATGAGTGCAGTGGAAGGATAAGTACTCAAAGCTAGAAGAGAAAAGAAATGCTTTACGGCAAGCAGTCAAGCTACTTGAGCAGCAAATCAATAAGATTCAGGCCGAGAATCTCAATCTTAAAAAAGGTAGAATCTCAATCTTGATGATTTTCCTCTGCCTCTTTTTTATGTGTTGTATGAACGAGACTTCTTTTAGCAGGTTGTCTGGGTGTTGCAGGGATGATATTTATTCATTTTAGAAATTGTGATACATGCCAACACTAAAATGGAGAAGGACATATGGTTCATTGAAAAAAAAAGTTAACAACATAACGAACTTTTTATAACTTTCTTTAATTTGTGCCAACCTAGAGGCGCAAATACACGACAGGTGACACTTCATTTTCTAAAAATCTAAGACACAGACACGGACATGGTGACACTTCATCGTATAAAAATCTAAGACGCAGACACGACAAAGATACACTTATTAAAATATAATTTTTTTTAAACATAAAAATGCATAAGATTATTGACTTTGAATATCTTTCATGTATAAAAGTGATATATACTTTTTATTTTATTTTATTTATGTTTAATAAAATGAGTTTAATGTATTTTACTTTTAAAATTTATTATTTTATCTTATATTACCATTTATGTCTATTTCTTATGCTTAACAAGTGTTTGTGCCTAACAAGTACCGAATCCATATTCATTTTGTACAACTAGTGTGTAACACAAATCTATTAAGCTAACAAGTGTCAGATATGCGTCTAACAAGTATCAAATCCATACTAATTTTGTATAACTAGTGTGAAACACAAATCTATTAAGCTAACAAGTGTTAGATATGCGTCTGATGAGTGCCAGAGTGTCACACGTGTCTGGACACGTTAACTAAAATGAAGTGTCCGTGCTTTTTAGGTGCCAATTGATGGAAGTTTGTTTCATCCAGTCAGCTATATTTTTTCTTCCCAGTGACTTTTTTCTTATTTGTATGATTAACAAAAAAATGAAGCCAAATGTATAATTCCTGTCCGTTATTTCTTTTCTTTTGAGTGGTTCTACGTAAGAAATGGAATTGGTTGTTTTCAGCCTTTATACTCTGAAAGAGTTTTCTTCTCGCTATCTGCATTAAATTTGGTTTCTGTGTCACCTTGCACCTCGCCCTAGAACACTGGGGATATAAAAGGATCTTTTAGTTGAGCTGTTGACTTATGTGGGTTGTAAGAAGGGCCACACTACAAGTCCTCAAGAGGCAGCCTAATTGGCAAGGGCTTGAGTCTCTTGGTTATATTAGCTTAGAGATCTCAAGTTTGAACCTTTCGGTGGACTTAATACCAAAAATCATGTTTCCCGAGTCTAGGCCTTGGGACAAACGTGGGTGCCCTGGGTATAGGGGAGCAAAGCTCGGACTCCCGGTTATCATGGAAAAAAAAAGAATGGCTACACTACACTTGGTCGATGTAACTTTATACCTTATTCTTCTTCTTATACTTTTTTTCTTTTTTGTGATTATTAATGGTTGTGTAGGTTATGAGGAGGAGAAGGCTCGAGCTTCAATTGAGAGAGAGGGCAAAGACAAAGAATCTACTATTAGAGTGTCTTTAGAGCGGGAAATTTTGGACTTGAAATCTCAAATTTCTTCTTTGGAACAAAATAATGTAAATGCTGTTAATGATCATGGGGAAGTGGAGCATCTTAATGCTCTTGTTGCTGAGTGTAAGAAGGAAATTAGCCAACTAAAAGAACTTCTCGAGATAGAGAAGAGAAGGACAGATGCTGAAAAGAAAAATGCTGAAGTGAGGAAGGAGGAGGCTGCTCAAGCTTGGAAAACAGGGAAGGTTGAAAGGAGTAAGACTAGTGACTCTAGGAAGTTTCACAAAACTGAAATGGATAAGGCTAATGAAGGCAGACAACAGCTAGGGATGTTGAAAAAGGAATGTGAAGAAACAAAGTTAAAGTTGGCTAGCGAAACATCTAAATTAATGGAACTTATAAAAGAGTTGGAGGTAGAAAAGCAAAATACTTTCAAAGAGAAAGAGCGTGCAGATTCTGAAATGTCTAAAGCAGAGGATTCAAGGATGCGAGCTGAAGCCATCAGGAAGCAGGCTGAGGGAGAAAAATCTAGGGCTGAAAAGTTATTGAAGCAATTGGGAAGAAAGAATAGCAAGATTGAGGAATTGCAGAAGCAGGTCAAAGAACTTCAGGCCTCGAAAATTAGTATTGAAGCTCGTTGTAGGCAACTTGACAAGAAGACTGATAGCAAGGCTGTGGAACAGAATGATAAGCCTTTGTCGGAAACGATACAGAGAGATGCAAATGAAATTAAGTTGGTTTACGATTTATTGAAAGCTAAGGAGGTTAACATGAGGCATAAGATGGTTGGAGATCTGGCAATTATGAAGGAAAAACCAATGGATTTCAAGCAAATGACATCAGAAGATCTGAAAAACCGTATAGGGATTTATCATAAGAAGGCCATGGATGAGCAATGCCGTGCTGATAAATTGTCTTTTGAATTAGAAGAAAATAAAAGGAAAGTTGAGGAATTGCAAAAGAAGTTACGTGAGTTCAAGTCTTGTAGGAAATTAGTTGATGCATCTGTTTCTTCTGAACATGCTACGAGTTCTGAACGTGCAGAAATGAAGCTTTTGAAAAAAAAGCTAAAGTTTGAGAAGGAGCGAGTAGAACATGCAAGACAAGTGGCTAACTTGGAAAAAACTCATCGTTCCATGATTCAACAAGATCTGGGTCGTTTTAAGCTAGAGTTTGTCCAGCTGTCAAACCACTTGGATACTATACATAAACTTGCCTCTACTGGTGCTAAGGATAATCATGACTTGGAAAAGGTTTGGTTTTCCTTTTTAGTTACTTTTTTTTTTCTCTAAATCCTGTATATACAGTAGAACTACTGGCTTTATTGACAATTATATTGTTTCTTACTTTTAAAAGTACAATGACACGACATTTATTTGCACTCCTTCTGCTGGATTAACAGTTGAAACTAAAAATCTTCATCAGTTTTCTGGTAACTCTAATTTCTACTAGCCTTGCAAGTTAAGAAGAAGTTGCTAATCATATTGACTTTAGATTCATAGTTTTGCTATACTTTGCATGAAACTGTCTTCCCTTTTCATATGTTTTATCAATAATCTTAAAGATTCTACATCCCTTTTTTTTCCTTATGCCAAAATTCACCTTCATCATATGCTTTTCCTTTATTTCCTTTCTCCTATATTCTTGATTGTTAGTCGTGATAGTAAGTGTTTATGATTTTGCTTCTTCACTTGTAATTGAGTTGTTATAATTCTCTTTTCTTCAACTGAAACAACGAATTTTTATTACATATGAATATGTTATTAGATCAGTGATCATCTTGTTGAATAGTAGGGCTACTAATGTGGACTCTTTACTCTTCAAGGATTGATGTTACTATGCTCTACCTACTTTTGTTGAAACTCATTTTTTGTCGAATGGTTCACAGACTAGGAATGCTGAGAACTTGCACAGGTTGTACTCTAAGAAGCATATACAGGCTATAGAGCCATTCCAGACATGGATGCCTGAAAGTAATTTTAGGCGGATGATCCCACAACACGGTGCTCCATTGCTTTCTTCAGAAGGGAATCATGTCACATCTGTATCAGGTATCGAATCTAGGTTGGAGTCTCTTCCTGGAGGCTCTAACAGAAAAATGTTGCAAAGTTGTGCAGTCAATTCCAGTACAGCATCTTTTTCTGACGGTCAGTTGGTCGGCTCACAGGATAAGGCTGGCCTTTGTTTGACAGCAGCAAAATTGGTTGGAGAGAATTTGAACGTGCAACCAAAAGTATCCAACTTAACTGGTGAACTTAGTAAGATGAAAAGCAATGAAAATCTTGCTATGGTGGCTGAAAATAGTGTCAGAAGTCCTATAAAAGACCATGTTGGAAGAGCTAATGAAAAACAAAAAAAGAGAAAAAGGACCATTGAAACTGTTGAATCCATTGAATATTTGTATCATAAAAGTAAGAAAATGCATTCTCAGATTGAAGAGAAGTTGTCTCTTTTGCATACTTTAAACCGCCCTACTGAGAAGCCCTCAAGAAAGAGTGAAGATGTAATATCAAACCCGCCTCAAGATTCTTCCGCTGATAAGAAATTTCGGAAGAAGACAAATTCTTTGTATCAGAAGAAAATAAAGATGCAAGGTTTACTTGATAATGATGAGATAAAGTTGAATAAAGTTGATACTGAAGTTTGTGTGCCTGAAAGTGTTAGTAGACAACCCTCTCAACCCGTCAGCAAACTTACAGACAGTTGTCAGCCATGCCTAGAGGAACTCGATAATTCTGTCATAAGTGAACTTCAAATCTTGGAAACTTTTGGGAACATAGCAGGTGGGGACTATATGAAATTGCTAGATTTGGATAGTGCTGCTGACGAGAAATGCTACAGGAGAGCAATGGAAATGCCACTTTCCCCTTCACTTCCAGATATTTGTGTTCCTGGCACTGAAACATCTGCTTTGAATGATTTTGAGCCTTTAGTAGACGAGTTCCATAAAAAATTGCCAAAAGAAAAAGGTCAGCCACATTCGCATAGCTATGATGTTATCGATGTTGAGATTAAGTCCAATTATACCCAATCCTGCAACTCTGGCTTGTTAAGAGATATTCATAGCAGTAAACGCCAAGTTGATCCGTGTTTAATGCTAGGGAGCCATGGGAGTGATCTTTGTGATATTGTACAGGCAGAAAAGATCTGTCTGGATAAGGTTGGGGTCATTGTAGAGATGCCTGGGACAGAATTCTCTCTTTCTGGTTGTGAAGGGGTGGAAACGTCAGAAATTAAATCTGGATCCCAGGACAATTTTGTCCCTGATTTTTGTGTTCTTTTCTCTAATGCAGAAGACTGCCACAGCATTTCCAGGATTTTTTCAGCAACTAGGGCTTGTATAAAGAGGATCTCTTTGACTAGTCAAAAAGAGTGGATGGTGCAAGAGATTTTGGCAGCCCTTAACATGGAGCATAGACTTTTACCAAAGTAAGTGTTTCATTTTTGCATCTTTGTTATCTGTATCAGTGCAAATAGTTATCGGTTTTAGAAAATATTTTATATTTTATGATCCCCTCCTACCCCAAAACTAAAAGAGATCCTTAAAATTTCAGAAAGAGCTTGCTCTGTAATAGAATTGAAGTACTAATAATGGATTTTCAGGGAGAAGACTTGTGTATTCGTTTCCTTGGTGCTGCTCAACTTCACCGTTGTTGCTATGCATAGGTATGGGAACTTTCTGAACTGCAAGACCTGCATGGATTCTTTTTCGGGGCACATAAGTGAAGGTTTACTCTTTTCCTCTTAGTAATTTATGGATTAATGAATTTACTTTATTTGTTTTTTTTTGGTTTCATCTTAATGATACTTTCCATATTTCAGTTTTTTTATTTAATTTTAATTATATTTTTTTTCCTATTGGCAGCAATGCTTAATGTGGAAACAAGAAGCTTGTTTGCTGAATTGTGCTGTTTGGACGAGTTACTTGCCCTCATAGAAGACTTCATAATAGATGGACGGGTCCTGTCATGTACTGATGCCTCTTTCGAGACATTGACTGAAGGTGGTGTGAGGGTTAACATCCCTATTGATGGTGTAGACAGAACATTGTCGTTTGTACCAGCATCAGCAGACTATTTGATTGCAGGGAGTTCCATATTAGCATCAATTTCTAAAGCAGTTGATCGTACTGGCCTTCTTTGGGAGGTGTCATATAGTATTTTAAGGATCTGCAGGTATGAATCATCGCAGATGTTAACTATGCTTCATATTTTTGCACATATTGGTGGAGATCAGTTTTTCAGTTTGGAAGGTTACTCTATTTTGATGGCTGTCTTGAAATCAATAATCATGCACCTTGAGTTGGTTGGAACATCAGATGATGCTACTTTCACCCTACCCAAAAGAAATTGCAGAACAGAGTTTGTTCAATGTGCTAATTGCCCTTTCTCAGAGGAAGTAATGTCAATGCCCATGGTTGTGTCTTTTCTACTGCAATTAATCCTGAAAAATATGTCAAATGGGATTATGGCTGAAGATTTAGAAAATGCTACCAGTTCATCCAATCTGAAATCCTTGTTTGAAAGAAATATTCAGATTCCAAGTGAAAATTCTAGTTGTAAAGAGGTCCATCCAGCACTGTATTTGGACTGTGATGCATCTTGTTGTTTGAAGAAGTATAAGGTGTCTGATGATCAACCATGCTTTTTCTTCAACCCAACATTGTGTGATGTTAGCGACGCGATCTCATTGGTTGAACTGATAGCATGCTACATGGTACTGGTTCTCACCTTTTCATATCACTATTAGTAATTTTACATTAGTCTTTCACTGATATGAGGACCGGCTTCCTGGTTCAGCAGGTCACTTTGAATATGTTGGAATATTGTCGAGTGTATTTAGGAAGTGAACAGAAAAAAAGGACAATGGAATTTTGCCAAATAAGAAACGTCTTCCTGATCTCTATCTTTGTCAATTGTCATCGTATGTCCTATTGTTTCTTTTTCCTAACCAAATAGCCAATAGAAATGCTTTGACAACTTTCATATAGATGATTTTAGGTTTTACTTTTGGACCATTTCCACATAGCATCTGAACCAAGTTATTCTTCAACGAGGGATCATATACCCAGGAGGCTTGGAATTCCTCTAATGAAATCCAGAAGTTTCTGCTATATTCACAAAAGAAGAACGAATGACTTCTACTCTTCTAGTGTCTTCCAATTTCTTGCATAAAACACCTCAATTTGGCATTAAATTCAGATTAGGACTCCTTTGAAACCTTTGTACATCAATGCTCTGTAAACAAAGACCAATAGAAAATTTATCTTATTTGGAACCTGCTAGATTGTTCAGGAAATTTTCTTGTGGCTTCCTTAAAAATTGTTAGTTTAGAAAGAATAGGTTTAGATGAAGAATCCCCAGATGAATCAAGCTTCCATATTGTTGTATTGTTGTATTTTTATTTTTTATTTTTGGGGTTCAATGAACCAAAGAACCCAAAAGGGAGAGACAACCTAAGGGCTGAGGGTTGAGATCACCCTTCCCAGAAACTAAACAATGGCGGCTTGCCAGTTTCTAACTATCGAAGTCTAACTATCGAAGTGTAGGCTATAATTAAAAAAGAATTTCTGGTTGTATCCTCTGTCCATGAAATACAACTTTTCGGCTGTATTATATGACATAAATGACTATTCACAGTGTATTTGATTCTCGGGTTCAAAATTATTGAGAGGTATTTGATTTGACTTGCTAGTTGCTCCTGTTGTATGCTTATCCTCAATGATATTGTTGTTTGATCTCTCTCTCTCTCTCTATAGTGGTATGGTACACACGGACACACATACACATATGGGGAGCGTCTTTTTAAGAGTTATTCTTTTAGTTGTTATTGTCTGTTAGGTATATGGACGTTATATTTTCAATTTGTAATTGTAAAAGTCACATTTAAGGTTTATTTACAACGGTAGCAACCTTAAAATAATATCCCTGGATCTTAAAGTGGATTACTTTCCTCTACACTGCAGTTCTTATACATGTCCATTTCAATATATGATAATTGATATTTAATCTCGAGAGTTCTGAACTTGATGTTGCTTATCTTGAGAGAACCGAACCTGAGATTGCTTCTGTTTTGCTTTTCAATTGCAGAGCTGGAATTGGACATGCGTTAACATTATCTCCCAGCTGCTGGAATTATTGAAGTCATCTGTAAAGGAGAGTCTTGCTATTGTGATTCTTCTAGGTCAACTTGGGAGGTAAATGATGTGCATTTTGGTTCATTTTGTCTTGTCAATAGTACCAAATATTTTGTGCTTGTTTTTGTTCTTGTGGCTATTCACCTTTTAATCGAGTGGAGCTTCTCTTGAAGCTTGAATGCAGGACAGGTTCTAGTCTCATTTGTAATGTTAAATCTTGTTCTAAATACATATTAGACATGAAACTTGTGTGCCAAAACCAAAAGAAAGCTTATTACCTAACATGATATGGGACTCTGACAGTGTGTGGTAAAATTTAACTGTGTCAGATGAATTACTGATATGAAAACTAGCTAGCTTACACTGACATTTTTAGATCATAGACGCTTTGCCATCCAAATAAGTCAACAGTGGACTGCTAATCATAAATACGTATGATGAACCTGTTTCCTTCCTACCTAAAACTGTTAGCATTTCCTCCTTGCTAAAATCTTAAATAGGGCCCTGCAGCATTTTCTCCTTACCAACATTGTACTATCCCCCATTATGTATATCTTCTTGAGACTTGGCTCCTCCCATTATGGAACTTTGTTTTTGTCTTGTTTACTTTTGTTTTTTATTTTGCTATTTGAAATGCACTGGTATATGGATGCACAATTGTACTACAGGGGACACAGTTTCACTGTTTGACAAAGCTGGCATTTGAATGGAAATTTTATTTGTGACCAATTATCTAAAATTTCAATTTTTAATTGTAGGTTTGGCGTGGATTCTGGAGGCTTTGAAGATGGAGGAGTTAAAATCTTGAGATCTAATCTATCAGAATTTCTTTGCCTGGACACTACCATTAAATCTGGTCTTTGTGTTCAAATTGCTATTGTTTCTGCCCTGTTAGGCCTTCTCCCTTTTGATTTTGAAACAATCGTTCAAGATAAAGGAAGCTATCCACCCTCTTTGAGTCAATATGTTGAGGTTAACTTGATAAAGACGTGGTATTCTTTATTGAGCTCGAAACAGAAAGAGTTGTCATGTAACATTTTACAAATTTCTGTTGCGATGTAAGCTGAAATTTGATTTCCTTTCTTGACGCAAGCTTAACCCAACTGACGTTGCAACCTGTTTTGGAGGCGTGGACAAGTAAGAGTTATCACAACCAATTTTTTAGGATTCAACTACCTGTTGATAGTTAGGAAATTGAGGATTTGTGCATACCTGTAGCTTAGTATATTTTTCTTACATTTTAAAGTGGAGGATACAATACAATATAGATTTTGTACTGTTTTCACAGGTGCAATATCTGAGTAAAGTTTTGTAAATGATCCCAGTAATTTCCAGTTTCTTCCCTTCAACCATATACATATTGATTCGCCAGATTTTCCTTGGTTTTGCAGTGTCCTGCTGACTTTAACGACAAAGTTGATTTGTACATCTGTGGTCTTCTGTCGGATTTGTGGTTATATACTCTTACTATTCAAATAACAAAGAAACAAGGAGTTCAAGAGTGCCTTAATTGTGATTAATTTGTGTCTACAGTTATTATTACTGTCACCGACAAATCATAGTGCAGAAGTGCAATGTGTTGTCGCTTTCTGATCAGCCCAGTTTGCACGTTGTTCTTGGAATTTGTGAACAGTAGCACTTATGGAGGGGACACAAATGACCAGCCATCTAGATCTTCCTTCGGTTACCAGCCTAAAGCTTGCAGACTGGCCCCATCTATTGGCCGGCCTCAGAAGAGAAGAGGTTAGGTTGCAGTGATGATTTCTAAGTATTGGTAGTTGTCTTGAATATATAAATCAAATAATTCGGCAGATGTACTATTCTTGTTCTGACTAGTTAATTCAATTTAGTTGAGTGCGTGGGGGAAAGGCTTGGAATTACGAGCAGATAGGATGTAGGGAAAGTTGAAGATGACTCATTAATCTTTTAAGTGGAGACTGTTGGGTTTGTAGAGCTAAATCACTCTAAAGGAGTCCAAAGTTATCATGGGGCACTGCAGCGTAAATCAGCATATCTAAGAAAACATTAAGTCATACAATTGTAGCATTCCATGTTGGGACAGTGCCAAATTGATGTCGTGTCATGATGCTCGTCTCTTAAAAGGTAAGAGTTTGTGAGATTCTCTTATGTTATCTCTTGTAATGTGTAATATTTATAGACTGTAGAAGTTAGGATGTAGTACAAGATGAGATGACTATATAAGTTTTGGTATTATTGTTTTCCAATCACTTTTATTCTTTTGCTATGAATTTATTTGTCCAACATAGCTTCCTTCCTGGACTAAAATTCTAAACCCACTCCATAGGATTCAAAATCCACTCTTTCGTATTATATAGAAAGATAAGCAGTCAGAAAAGAAAATTCCTCTTCAGTAGACATAGCAAGTAACAAACCACACACATCCTCACCCTCAGTTTAACAGAAGGTAAAAAAGTGAAAAACTATTGCTCATACTTCTACAGAAGTTAGTATTTGACCTCATATCTCTGCCACATAACCTAGACAGCTCTGATATTTGTAAGAAGGAAGATAATTCAGTTAACTGGAGGTTTATTGTTTTGGGTAGACTACTAGTTATAACCTCTTCTTCTGTATTTTGTAGTTCTTTTAAATGATGCAGTGCCTGCCAAAGGTGGAATGCTCTGTTCATGCCTGAAAAAATTCTCAGGATCAACTTTTGTCTTTACCTTCACAAGCCTATTGAAGTTGTCCTTGAAATATTTATTCCCCCACTCAGTGGCCTGAACCAAGCTTGTATTGTCCTTCTTGTTCATACCCAAATCAAGATCTCTGTAGTTGACATATGCTGCTCTTGGAGATGTGGACACATATGGAGCCATGTAATTATAAAGTTCTCTGATCCACTCTATGTGTTTTGCAGCATCTGTATCTCCACCCTGCCATGTGCTTAAATACTGAATTTTGAATAAGTTTCCTTTTCTGTGTGGGAATGGGATCTCATTCTCTGAGATCTTGCTCATCATTCCTCCATAAGGGTTCCAGATCGTTAATGGCTTGTCTTCTACTAGCAGCCTTTTCCACAGACCTTCCAGTCCAGCTTCTGGGATTGGGACTTGAACAAAATCTGATTTAGCTTTGAAGTAGTTCTTGAACTGTGGCTTTCCTTGCAGAAGAACTTCAGGAGGTGTTCCGGGGGGTTCTCCAGCTATATAAAGAACAGATTTAATCCAGCTGGTTTCAATACAATCTTTTGGTGTCAAACCCAATTCAGGGAAGCTTTTTCCCATAACTTGGAGGAGCCTGTTGGAATCTCCAAGAAAAAGTGCATTGTAAGCAGTTGTAATGGTTCTTTTTTTTGTTGCTTTATCACTTGTTGGTTGTATTATGACTCTGATGAACAGGTTTTCATCTAACTTATGTGCAACTTGCTGCCATTTATATAGGACTTGAGTGGCACCTTGCTCCAAGGTCTTGGGAACTGTGAAGACTGTCACAGTTTCTGGAACAGGAACCAACTGCAACTTCCACCACAGAATAATCCCAAAGCTGCCTCCAGCTCCCCCTCTTATGGCCCAAAAATGGTCCTCTCCCATGGCCGCTCGGTCAAGAATTCTGCCATTAATGTCGACGATCCGAGCATCAATGACATTATCAGCACCGAGGCCATATTTTCTCATCATAGAGCCATATGCACCGCCTGTTATGTGCCCGCCAATGCCTAAGCTAGTGCAAAGGCCAGCAGGGAAGCCATGGACTTCGCTTTTCTCTGAGATTCTGTAGTAAACTTCACCAACCGTAGCACCAGCCTGAACCCAAGCACTGTTATCTTCAATATTCACCTCGACCGACCGGAGTTTGGTGAGGTCTACAACAATGAAAGGGGTTTCGATTTCCGAAGCATAAGAGAGACCCTCGTAGTCATGGCCACCGCTACGCACTCTGAGATGTATTCGGAGCTTCTTCGAGCAAATTACAGCTGCTTGGACATGGCTGTCAAGGAGTGGAATGAAGATAAACTCCGGCTTCGGAACTGAAGCATCCAAGTATCTGAGGTTTTGCGCAGTGGACTGAAGAAGAGTGGTGAATGAAGCATTGTTTGGATCACAAACCGAGAAGGGAGGGGCAGAAAGTTGGGAATTGACAGAGAGACATTGGAAAAAAGCTTGCTCCAATGGAGCCGAGTTTGAAAGTGAAAATGAGAGAAGAAACGCAAGCACTAAATGGAAAATGGGAGAGCCTGAAGAAGGCAGCATAATTTGGTATATTTTATTTTTTTTTCTATAAAGCAGTGAAGTTTGGGGCCGAGGATTGAGATGGTCTAAAGAACTGAACTGCTATATATAGCTGTGAATGTGATGAGTTGAGAATGGTAAACGTGTACCTGAGCTAAGGCAAACGTCATGAAAATTTTGAAATCAATTTATCAGACGGCCTCATTTCAATTGTGAGATTACTCTCTAGTTTTACTCTCTGCATGGATACGGACACGAAACACAGATATGATATAACACAGATATTTTGACGTATTTTGTACTTAAAATTTATTATTACCTTATATTATTACTTATGTCTATTGTTTCTTATGCTGAGAAAGAAGTGTTTGATGTGTGTTTAACAAGTGTTTGATATGCGTCCAAAGTGTTGGGTGTGTTGAAGTGTTCGACACGACTTTTTAAACTCAAATTGTAATGTCTATGGTTCTTAGCTCATCCTCCTCTTTTTT
mRNA sequence
GTCAGAACCTAACAGAACCGATTTATAAACAAAGGATCGTGGTACGGTACGGCTTTATCTTGCTTTTGCTGTCAAAACCCTAAACAATAAATAGGAAGAAGATTCATTGCATAGAGGTGGAAGAAATCATTGGCTCCCGCCATGGAAGAAATCGATGGATTGAGAAACCAATTAGCAACCACACCGCTGCATTTCAAAGAATGGAACGATTGATGAAGGGTTTTACACTTTAGTGAAGCTAATGCTTAAATTTTCTGATCAATTTTGCTTGCTCCTTCTATAGAATACCCGCAGAGAACAATTTGAACCGACTTCAATGGACCGCCTTTTGGAGGAGGGGTCATAAGTTGATCACAGCAGTTCCCCGAAGAGACAGAGAGTTTTCGTTCAAAAACTTGAAAACTATTATGTTTGAAGAACTTGTTTCACAAGTTGATGATTACTGAACTCTTGCTCTTGAGTCATTCCCGGCGACTGATTGAAGTTGTCCATTTTCGATTCTCTTGCACGTTAACCGAGCAATTCCCCGGAACTTGGGAGATTTTAGACTATGGGTTTCGTAAGAATTTGAGTTCAGTAATCTGGCTGTTCTTATTTAACCCTTTAATTTTGCGGGGATTTGAAAATGGCGGAGGATGCTGTGCCGAAGCCTGAATCGTCTAATTCCTGCTGTAAAGCGTGGAAGGATAAGTACTCAAAGCTAGAAGAGAAAAGAAATGCTTTACGGCAAGCAGTCAAGCTACTTGAGCAGCAAATCAATAAGATTCAGGCCGAGAATCTCAATCTTAAAAAAGGTTATGAGGAGGAGAAGGCTCGAGCTTCAATTGAGAGAGAGGGCAAAGACAAAGAATCTACTATTAGAGTGTCTTTAGAGCGGGAAATTTTGGACTTGAAATCTCAAATTTCTTCTTTGGAACAAAATAATGTAAATGCTGTTAATGATCATGGGGAAGTGGAGCATCTTAATGCTCTTGTTGCTGAGTGTAAGAAGGAAATTAGCCAACTAAAAGAACTTCTCGAGATAGAGAAGAGAAGGACAGATGCTGAAAAGAAAAATGCTGAAGTGAGGAAGGAGGAGGCTGCTCAAGCTTGGAAAACAGGGAAGGTTGAAAGGAGTAAGACTAGTGACTCTAGGAAGTTTCACAAAACTGAAATGGATAAGGCTAATGAAGGCAGACAACAGCTAGGGATGTTGAAAAAGGAATGTGAAGAAACAAAGTTAAAGTTGGCTAGCGAAACATCTAAATTAATGGAACTTATAAAAGAGTTGGAGGTAGAAAAGCAAAATACTTTCAAAGAGAAAGAGCGTGCAGATTCTGAAATGTCTAAAGCAGAGGATTCAAGGATGCGAGCTGAAGCCATCAGGAAGCAGGCTGAGGGAGAAAAATCTAGGGCTGAAAAGTTATTGAAGCAATTGGGAAGAAAGAATAGCAAGATTGAGGAATTGCAGAAGCAGGTCAAAGAACTTCAGGCCTCGAAAATTAGTATTGAAGCTCGTTGTAGGCAACTTGACAAGAAGACTGATAGCAAGGCTGTGGAACAGAATGATAAGCCTTTGTCGGAAACGATACAGAGAGATGCAAATGAAATTAAGTTGGTTTACGATTTATTGAAAGCTAAGGAGGTTAACATGAGGCATAAGATGGTTGGAGATCTGGCAATTATGAAGGAAAAACCAATGGATTTCAAGCAAATGACATCAGAAGATCTGAAAAACCGTATAGGGATTTATCATAAGAAGGCCATGGATGAGCAATGCCGTGCTGATAAATTGTCTTTTGAATTAGAAGAAAATAAAAGGAAAGTTGAGGAATTGCAAAAGAAGTTACGTGAGTTCAAGTCTTGTAGGAAATTAGTTGATGCATCTGTTTCTTCTGAACATGCTACGAGTTCTGAACGTGCAGAAATGAAGCTTTTGAAAAAAAAGCTAAAGTTTGAGAAGGAGCGAGTAGAACATGCAAGACAAGTGGCTAACTTGGAAAAAACTCATCGTTCCATGATTCAACAAGATCTGGGTCGTTTTAAGCTAGAGTTTGTCCAGCTGTCAAACCACTTGGATACTATACATAAACTTGCCTCTACTGGTGCTAAGGATAATCATGACTTGGAAAAGGTTTGGAATGCTGAGAACTTGCACAGGTTGTACTCTAAGAAGCATATACAGGCTATAGAGCCATTCCAGACATGGATGCCTGAAAGTAATTTTAGGCGGATGATCCCACAACACGGTGCTCCATTGCTTTCTTCAGAAGGGAATCATGTCACATCTGTATCAGGTATCGAATCTAGGTTGGAGTCTCTTCCTGGAGGCTCTAACAGAAAAATGTTGCAAAGTTGTGCAGTCAATTCCAGTACAGCATCTTTTTCTGACGGTCAGTTGGTCGGCTCACAGGATAAGGCTGGCCTTTGTTTGACAGCAGCAAAATTGGTTGGAGAGAATTTGAACGTGCAACCAAAAGTATCCAACTTAACTGGTGAACTTAGTAAGATGAAAAGCAATGAAAATCTTGCTATGGTGGCTGAAAATAGTGTCAGAAGTCCTATAAAAGACCATGTTGGAAGAGCTAATGAAAAACAAAAAAAGAGAAAAAGGACCATTGAAACTGTTGAATCCATTGAATATTTGTATCATAAAAGTAAGAAAATGCATTCTCAGATTGAAGAGAAGTTGTCTCTTTTGCATACTTTAAACCGCCCTACTGAGAAGCCCTCAAGAAAGAGTGAAGATGTAATATCAAACCCGCCTCAAGATTCTTCCGCTGATAAGAAATTTCGGAAGAAGACAAATTCTTTGTATCAGAAGAAAATAAAGATGCAAGGTTTACTTGATAATGATGAGATAAAGTTGAATAAAGTTGATACTGAAGTTTGTGTGCCTGAAAGTGTTAGTAGACAACCCTCTCAACCCGTCAGCAAACTTACAGACAGTTGTCAGCCATGCCTAGAGGAACTCGATAATTCTGTCATAAGTGAACTTCAAATCTTGGAAACTTTTGGGAACATAGCAGGTGGGGACTATATGAAATTGCTAGATTTGGATAGTGCTGCTGACGAGAAATGCTACAGGAGAGCAATGGAAATGCCACTTTCCCCTTCACTTCCAGATATTTGTGTTCCTGGCACTGAAACATCTGCTTTGAATGATTTTGAGCCTTTAGTAGACGAGTTCCATAAAAAATTGCCAAAAGAAAAAGGTCAGCCACATTCGCATAGCTATGATGTTATCGATGTTGAGATTAAGTCCAATTATACCCAATCCTGCAACTCTGGCTTGTTAAGAGATATTCATAGCAGTAAACGCCAAGTTGATCCGTGTTTAATGCTAGGGAGCCATGGGAGTGATCTTTGTGATATTGTACAGGCAGAAAAGATCTGTCTGGATAAGGTTGGGGTCATTGTAGAGATGCCTGGGACAGAATTCTCTCTTTCTGGTTGTGAAGGGGTGGAAACGTCAGAAATTAAATCTGGATCCCAGGACAATTTTGTCCCTGATTTTTGTGTTCTTTTCTCTAATGCAGAAGACTGCCACAGCATTTCCAGGATTTTTTCAGCAACTAGGGCTTGTATAAAGAGGATCTCTTTGACTAGTCAAAAAGAGTGGATGGTGCAAGAGATTTTGGCAGCCCTTAACATGGAGCATAGACTTTTACCAAAGGAGAAGACTTGTGTATTCGTTTCCTTGGTGCTGCTCAACTTCACCGTTGTTGCTATGCATAGGTATGGGAACTTTCTGAACTGCAAGACCTGCATGGATTCTTTTTCGGGGCACATAAGTGAAGCAATGCTTAATGTGGAAACAAGAAGCTTGTTTGCTGAATTGTGCTGTTTGGACGAGTTACTTGCCCTCATAGAAGACTTCATAATAGATGGACGGGTCCTGTCATGTACTGATGCCTCTTTCGAGACATTGACTGAAGGTGGTGTGAGGGTTAACATCCCTATTGATGGTGTAGACAGAACATTGTCGTTTGTACCAGCATCAGCAGACTATTTGATTGCAGGGAGTTCCATATTAGCATCAATTTCTAAAGCAGTTGATCGTACTGGCCTTCTTTGGGAGGTGTCATATAGTATTTTAAGGATCTGCAGGTATGAATCATCGCAGATGTTAACTATGCTTCATATTTTTGCACATATTGGTGGAGATCAGTTTTTCAGTTTGGAAGGTTACTCTATTTTGATGGCTGTCTTGAAATCAATAATCATGCACCTTGAGTTGGTTGGAACATCAGATGATGCTACTTTCACCCTACCCAAAAGAAATTGCAGAACAGAGTTTGTTCAATGTGCTAATTGCCCTTTCTCAGAGGAAGTAATGTCAATGCCCATGGTTGTGTCTTTTCTACTGCAATTAATCCTGAAAAATATGTCAAATGGGATTATGGCTGAAGATTTAGAAAATGCTACCAGTTCATCCAATCTGAAATCCTTGTTTGAAAGAAATATTCAGATTCCAAGTGAAAATTCTAGTTGTAAAGAGGTCCATCCAGCACTGTATTTGGACTGTGATGCATCTTGTTGTTTGAAGAAGTATAAGGTGTCTGATGATCAACCATGCTTTTTCTTCAACCCAACATTGTGTGATGTTAGCGACGCGATCTCATTGGTTGAACTGATAGCATGCTACATGAGCTGGAATTGGACATGCGTTAACATTATCTCCCAGCTGCTGGAATTATTGAAGTCATCTGTAAAGGAGAGTCTTGCTATTGTGATTCTTCTAGGTCAACTTGGGAGGTTTGGCGTGGATTCTGGAGGCTTTGAAGATGGAGGAGTTAAAATCTTGAGATCTAATCTATCAGAATTTCTTTGCCTGGACACTACCATTAAATCTGGTCTTTGTGTTCAAATTGCTATTGTTTCTGCCCTGTTAGGCCTTCTCCCTTTTGATTTTGAAACAATCGTTCAAGATAAAGGAAGCTATCCACCCTCTTTGAGTCAATATGTTGAGGTTAACTTGATAAAGACGTGGTATTCTTTATTGAGCTCGAAACAGAAAGAGTTGTCATGTAACATTTTACAAATTTCTGTTGCGATGTAAGCTGAAATTTGATTTCCTTTCTTGACGCAAGCTTAACCCAACTGACGTTGCAACCTGTTTTGGAGGCGTGGACAAGTAAGAGTTATCACAACCAATTTTTTAGGATTCAACTACCTGTTGATAGTTAGGAAATTGAGGATTTGTGCATACCTGTAGCTTAGTATATTTTTCTTACATTTTAAAGTGGAGGATACAATACAATATAGATTTTGTACTGTTTTCACAGGTGCAATATCTGAGTAAAGTTTTGTAAATGATCCCAGTAATTTCCAGTTTCTTCCCTTCAACCATATACATATTGATTCGCCAGATTTTCCTTGGTTTTGCAGTGTCCTGCTGACTTTAACGACAAAGTTGATTTGTACATCTGTGGTCTTCTGTCGGATTTGTGGTTATATACTCTTACTATTCAAATAACAAAGAAACAAGGAGTTCAAGAGTGCCTTAATTGTGATTAATTTGTGTCTACAGTTATTATTACTGTCACCGACAAATCATAGTGCAGAAGTGCAATGTGTTGTCGCTTTCTGATCAGCCCAGTTTGCACGTTGTTCTTGGAATTTGTGAACAGTAGCACTTATGGAGGGGACACAAATGACCAGCCATCTAGATCTTCCTTCGGTTACCAGCCTAAAGCTTGCAGACTGGCCCCATCTATTGGCCGGCCTCAGAAGAGAAGAGTTGAGTGCGTGGGGGAAAGGCTTGGAATTACGAGCAGATAGGATGTAGGGAAAGTTGAAGATGACTCATTAATCTTTTAAGTGGAGACTGTTGGGTTTGTAGAGCTAAATCACTCTAAAGGAGTCCAAAGTTATCATGGGGCACTGCAGCGTAAATCAGCATATCTAAGAAAACATTAAGTCATACAATTGTAGCATTCCATGTTGGGACAGTGCCAAATTGATGTCGTGTCATGATGCTCGTCTCTTAAAAGTTCTTTTAAATGATGCAGTGCCTGCCAAAGGTGGAATGCTCTGTTCATGCCTGAAAAAATTCTCAGGATCAACTTTTGTCTTTACCTTCACAAGCCTATTGAAGTTGTCCTTGAAATATTTATTCCCCCACTCAGTGGCCTGAACCAAGCTTGTATTGTCCTTCTTGTTCATACCCAAATCAAGATCTCTGTAGTTGACATATGCTGCTCTTGGAGATGTGGACACATATGGAGCCATGTAATTATAAAGTTCTCTGATCCACTCTATGTGTTTTGCAGCATCTGTATCTCCACCCTGCCATGTGCTTAAATACTGAATTTTGAATAAGTTTCCTTTTCTGTGTGGGAATGGGATCTCATTCTCTGAGATCTTGCTCATCATTCCTCCATAAGGGTTCCAGATCGTTAATGGCTTGTCTTCTACTAGCAGCCTTTTCCACAGACCTTCCAGTCCAGCTTCTGGGATTGGGACTTGAACAAAATCTGATTTAGCTTTGAAGTAGTTCTTGAACTGTGGCTTTCCTTGCAGAAGAACTTCAGGAGGTGTTCCGGGGGGTTCTCCAGCTATATAAAGAACAGATTTAATCCAGCTGGTTTCAATACAATCTTTTGGTGTCAAACCCAATTCAGGGAAGCTTTTTCCCATAACTTGGAGGAGCCTGTTGGAATCTCCAAGAAAAAGTGCATTGACTTGAGTGGCACCTTGCTCCAAGGTCTTGGGAACTGTGAAGACTGTCACAGTTTCTGGAACAGGAACCAACTGCAACTTCCACCACAGAATAATCCCAAAGCTGCCTCCAGCTCCCCCTCTTATGGCCCAAAAATGGTCCTCTCCCATGGCCGCTCGGTCAAGAATTCTGCCATTAATGTCGACGATCCGAGCATCAATGACATTATCAGCACCGAGGCCATATTTTCTCATCATAGAGCCATATGCACCGCCTGTTATGTGCCCGCCAATGCCTAAGCTAGTGCAAAGGCCAGCAGGGAAGCCATGGACTTCGCTTTTCTCTGAGATTCTGTAGTAAACTTCACCAACCGTAGCACCAGCCTGAACCCAAGCACTGTTATCTTCAATATTCACCTCGACCGACCGGAGTTTGGTGAGGTCTACAACAATGAAAGGGGTTTCGATTTCCGAAGCATAAGAGAGACCCTCGTAGTCATGGCCACCGCTACGCACTCTGAGATGTATTCGGAGCTTCTTCGAGCAAATTACAGCTGCTTGGACATGGCTGTCAAGGAGTGGAATGAAGATAAACTCCGGCTTCGGAACTGAAGCATCCAAGTATCTGAGGTTTTGCGCAGTGGACTGAAGAAGAGTGGTGAATGAAGCATTGTTTGGATCACAAACCGAGAAGGGAGGGGCAGAAAGTTGGGAATTGACAGAGAGACATTGGAAAAAAGCTTGCTCCAATGGAGCCGAGTTTGAAAGTGAAAATGAGAGAAGAAACGCAAGCACTAAATGGAAAATGGGAGAGCCTGAAGAAGGCAGCATAATTTGGTATATTTTATTTTTTTTTCTATAAAGCAGTGAAGTTTGGGGCCGAGGATTGAGATGGTCTAAAGAACTGAACTGCTATATATAGCTGTGAATGTGATGAGTTGAGAATGGTAAACGTGTACCTGAGCTAAGGCAAACGTCATGAAAATTTTGAAATCAATTTATCAGACGGCCTCATTTCAATTGTGAGATTACTCTCTAGTTTTACTCTCTGCATGGATACGGACACGAAACACAGATATGATATAACACAGATATTTTGACGTATTTTGTACTTAAAATTTATTATTACCTTATATTATTACTTATGTCTATTGTTTCTTATGCTGAGAAAGAAGTGTTTGATGTGTGTTTAACAAGTGTTTGATATGCGTCCAAAGTGTTGGGTGTGTTGAAGTGTTCGACACGACTTTTTAAACTCAAATTGTAATGTCTATGGTTCTTAGCTCATCCTCCTCTTTTTT
Coding sequence (CDS)
ATGGCGGAGGATGCTGTGCCGAAGCCTGAATCGTCTAATTCCTGCTGTAAAGCGTGGAAGGATAAGTACTCAAAGCTAGAAGAGAAAAGAAATGCTTTACGGCAAGCAGTCAAGCTACTTGAGCAGCAAATCAATAAGATTCAGGCCGAGAATCTCAATCTTAAAAAAGGTTATGAGGAGGAGAAGGCTCGAGCTTCAATTGAGAGAGAGGGCAAAGACAAAGAATCTACTATTAGAGTGTCTTTAGAGCGGGAAATTTTGGACTTGAAATCTCAAATTTCTTCTTTGGAACAAAATAATGTAAATGCTGTTAATGATCATGGGGAAGTGGAGCATCTTAATGCTCTTGTTGCTGAGTGTAAGAAGGAAATTAGCCAACTAAAAGAACTTCTCGAGATAGAGAAGAGAAGGACAGATGCTGAAAAGAAAAATGCTGAAGTGAGGAAGGAGGAGGCTGCTCAAGCTTGGAAAACAGGGAAGGTTGAAAGGAGTAAGACTAGTGACTCTAGGAAGTTTCACAAAACTGAAATGGATAAGGCTAATGAAGGCAGACAACAGCTAGGGATGTTGAAAAAGGAATGTGAAGAAACAAAGTTAAAGTTGGCTAGCGAAACATCTAAATTAATGGAACTTATAAAAGAGTTGGAGGTAGAAAAGCAAAATACTTTCAAAGAGAAAGAGCGTGCAGATTCTGAAATGTCTAAAGCAGAGGATTCAAGGATGCGAGCTGAAGCCATCAGGAAGCAGGCTGAGGGAGAAAAATCTAGGGCTGAAAAGTTATTGAAGCAATTGGGAAGAAAGAATAGCAAGATTGAGGAATTGCAGAAGCAGGTCAAAGAACTTCAGGCCTCGAAAATTAGTATTGAAGCTCGTTGTAGGCAACTTGACAAGAAGACTGATAGCAAGGCTGTGGAACAGAATGATAAGCCTTTGTCGGAAACGATACAGAGAGATGCAAATGAAATTAAGTTGGTTTACGATTTATTGAAAGCTAAGGAGGTTAACATGAGGCATAAGATGGTTGGAGATCTGGCAATTATGAAGGAAAAACCAATGGATTTCAAGCAAATGACATCAGAAGATCTGAAAAACCGTATAGGGATTTATCATAAGAAGGCCATGGATGAGCAATGCCGTGCTGATAAATTGTCTTTTGAATTAGAAGAAAATAAAAGGAAAGTTGAGGAATTGCAAAAGAAGTTACGTGAGTTCAAGTCTTGTAGGAAATTAGTTGATGCATCTGTTTCTTCTGAACATGCTACGAGTTCTGAACGTGCAGAAATGAAGCTTTTGAAAAAAAAGCTAAAGTTTGAGAAGGAGCGAGTAGAACATGCAAGACAAGTGGCTAACTTGGAAAAAACTCATCGTTCCATGATTCAACAAGATCTGGGTCGTTTTAAGCTAGAGTTTGTCCAGCTGTCAAACCACTTGGATACTATACATAAACTTGCCTCTACTGGTGCTAAGGATAATCATGACTTGGAAAAGGTTTGGAATGCTGAGAACTTGCACAGGTTGTACTCTAAGAAGCATATACAGGCTATAGAGCCATTCCAGACATGGATGCCTGAAAGTAATTTTAGGCGGATGATCCCACAACACGGTGCTCCATTGCTTTCTTCAGAAGGGAATCATGTCACATCTGTATCAGGTATCGAATCTAGGTTGGAGTCTCTTCCTGGAGGCTCTAACAGAAAAATGTTGCAAAGTTGTGCAGTCAATTCCAGTACAGCATCTTTTTCTGACGGTCAGTTGGTCGGCTCACAGGATAAGGCTGGCCTTTGTTTGACAGCAGCAAAATTGGTTGGAGAGAATTTGAACGTGCAACCAAAAGTATCCAACTTAACTGGTGAACTTAGTAAGATGAAAAGCAATGAAAATCTTGCTATGGTGGCTGAAAATAGTGTCAGAAGTCCTATAAAAGACCATGTTGGAAGAGCTAATGAAAAACAAAAAAAGAGAAAAAGGACCATTGAAACTGTTGAATCCATTGAATATTTGTATCATAAAAGTAAGAAAATGCATTCTCAGATTGAAGAGAAGTTGTCTCTTTTGCATACTTTAAACCGCCCTACTGAGAAGCCCTCAAGAAAGAGTGAAGATGTAATATCAAACCCGCCTCAAGATTCTTCCGCTGATAAGAAATTTCGGAAGAAGACAAATTCTTTGTATCAGAAGAAAATAAAGATGCAAGGTTTACTTGATAATGATGAGATAAAGTTGAATAAAGTTGATACTGAAGTTTGTGTGCCTGAAAGTGTTAGTAGACAACCCTCTCAACCCGTCAGCAAACTTACAGACAGTTGTCAGCCATGCCTAGAGGAACTCGATAATTCTGTCATAAGTGAACTTCAAATCTTGGAAACTTTTGGGAACATAGCAGGTGGGGACTATATGAAATTGCTAGATTTGGATAGTGCTGCTGACGAGAAATGCTACAGGAGAGCAATGGAAATGCCACTTTCCCCTTCACTTCCAGATATTTGTGTTCCTGGCACTGAAACATCTGCTTTGAATGATTTTGAGCCTTTAGTAGACGAGTTCCATAAAAAATTGCCAAAAGAAAAAGGTCAGCCACATTCGCATAGCTATGATGTTATCGATGTTGAGATTAAGTCCAATTATACCCAATCCTGCAACTCTGGCTTGTTAAGAGATATTCATAGCAGTAAACGCCAAGTTGATCCGTGTTTAATGCTAGGGAGCCATGGGAGTGATCTTTGTGATATTGTACAGGCAGAAAAGATCTGTCTGGATAAGGTTGGGGTCATTGTAGAGATGCCTGGGACAGAATTCTCTCTTTCTGGTTGTGAAGGGGTGGAAACGTCAGAAATTAAATCTGGATCCCAGGACAATTTTGTCCCTGATTTTTGTGTTCTTTTCTCTAATGCAGAAGACTGCCACAGCATTTCCAGGATTTTTTCAGCAACTAGGGCTTGTATAAAGAGGATCTCTTTGACTAGTCAAAAAGAGTGGATGGTGCAAGAGATTTTGGCAGCCCTTAACATGGAGCATAGACTTTTACCAAAGGAGAAGACTTGTGTATTCGTTTCCTTGGTGCTGCTCAACTTCACCGTTGTTGCTATGCATAGGTATGGGAACTTTCTGAACTGCAAGACCTGCATGGATTCTTTTTCGGGGCACATAAGTGAAGCAATGCTTAATGTGGAAACAAGAAGCTTGTTTGCTGAATTGTGCTGTTTGGACGAGTTACTTGCCCTCATAGAAGACTTCATAATAGATGGACGGGTCCTGTCATGTACTGATGCCTCTTTCGAGACATTGACTGAAGGTGGTGTGAGGGTTAACATCCCTATTGATGGTGTAGACAGAACATTGTCGTTTGTACCAGCATCAGCAGACTATTTGATTGCAGGGAGTTCCATATTAGCATCAATTTCTAAAGCAGTTGATCGTACTGGCCTTCTTTGGGAGGTGTCATATAGTATTTTAAGGATCTGCAGGTATGAATCATCGCAGATGTTAACTATGCTTCATATTTTTGCACATATTGGTGGAGATCAGTTTTTCAGTTTGGAAGGTTACTCTATTTTGATGGCTGTCTTGAAATCAATAATCATGCACCTTGAGTTGGTTGGAACATCAGATGATGCTACTTTCACCCTACCCAAAAGAAATTGCAGAACAGAGTTTGTTCAATGTGCTAATTGCCCTTTCTCAGAGGAAGTAATGTCAATGCCCATGGTTGTGTCTTTTCTACTGCAATTAATCCTGAAAAATATGTCAAATGGGATTATGGCTGAAGATTTAGAAAATGCTACCAGTTCATCCAATCTGAAATCCTTGTTTGAAAGAAATATTCAGATTCCAAGTGAAAATTCTAGTTGTAAAGAGGTCCATCCAGCACTGTATTTGGACTGTGATGCATCTTGTTGTTTGAAGAAGTATAAGGTGTCTGATGATCAACCATGCTTTTTCTTCAACCCAACATTGTGTGATGTTAGCGACGCGATCTCATTGGTTGAACTGATAGCATGCTACATGAGCTGGAATTGGACATGCGTTAACATTATCTCCCAGCTGCTGGAATTATTGAAGTCATCTGTAAAGGAGAGTCTTGCTATTGTGATTCTTCTAGGTCAACTTGGGAGGTTTGGCGTGGATTCTGGAGGCTTTGAAGATGGAGGAGTTAAAATCTTGAGATCTAATCTATCAGAATTTCTTTGCCTGGACACTACCATTAAATCTGGTCTTTGTGTTCAAATTGCTATTGTTTCTGCCCTGTTAGGCCTTCTCCCTTTTGATTTTGAAACAATCGTTCAAGATAAAGGAAGCTATCCACCCTCTTTGAGTCAATATGTTGAGGTTAACTTGATAAAGACGTGGTATTCTTTATTGAGCTCGAAACAGAAAGAGTTGTCATGTAACATTTTACAAATTTCTGTTGCGATGTAA
Protein sequence
MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEEEKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAECKKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKANEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSRMRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTDSKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSEDLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTIHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLSSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALNDFEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENATSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQYVEVNLIKTWYSLLSSKQKELSCNILQISVAM
Homology
BLAST of MC03g0609 vs. NCBI nr
Match:
XP_022140057.1 (uncharacterized protein LOC111010806 [Momordica charantia] >XP_022140058.1 uncharacterized protein LOC111010806 [Momordica charantia] >XP_022140059.1 uncharacterized protein LOC111010806 [Momordica charantia] >XP_022140060.1 uncharacterized protein LOC111010806 [Momordica charantia])
HSP 1 Score: 2793 bits (7240), Expect = 0.0
Identity = 1469/1471 (99.86%), Postives = 1469/1471 (99.86%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE
Sbjct: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC
Sbjct: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA
Sbjct: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR
Sbjct: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD
Sbjct: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE
Sbjct: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHA 420
DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHA
Sbjct: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHA 420
Query: 421 TSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTI 480
TSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTI
Sbjct: 421 TSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTI 480
Query: 481 HKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLS 540
HKLASTGAKDNHDLEK NAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLS
Sbjct: 481 HKLASTGAKDNHDLEKTRNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLS 540
Query: 541 SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAK 600
SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAK
Sbjct: 541 SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAK 600
Query: 601 LVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIET 660
LVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIET
Sbjct: 601 LVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIET 660
Query: 661 VESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKT 720
VESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKT
Sbjct: 661 VESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKT 720
Query: 721 NSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSV 780
NSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSV
Sbjct: 721 NSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSV 780
Query: 781 ISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALND 840
ISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALND
Sbjct: 781 ISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALND 840
Query: 841 FEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLML 900
FEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLML
Sbjct: 841 FEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLML 900
Query: 901 GSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVL 960
GSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVL
Sbjct: 901 GSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVL 960
Query: 961 FSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLV 1020
FSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLV
Sbjct: 961 FSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLV 1020
Query: 1021 LLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFII 1080
LLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFII
Sbjct: 1021 LLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFII 1080
Query: 1081 DGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRT 1140
DGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRT
Sbjct: 1081 DGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRT 1140
Query: 1141 GLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVG 1200
GLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVG
Sbjct: 1141 GLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVG 1200
Query: 1201 TSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENA 1260
TSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENA
Sbjct: 1201 TSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENA 1260
Query: 1261 TSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCD 1320
TSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCD
Sbjct: 1261 TSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCD 1320
Query: 1321 VSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFE 1380
VSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFE
Sbjct: 1321 VSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFE 1380
Query: 1381 DGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQY 1440
DGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQY
Sbjct: 1381 DGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQY 1440
Query: 1441 VEVNLIKTWYSLLSSKQKELSCNILQISVAM 1471
VEVNLIKTWYSLLSSKQKELSCNILQISVAM
Sbjct: 1441 VEVNLIKTWYSLLSSKQKELSCNILQISVAM 1471
BLAST of MC03g0609 vs. NCBI nr
Match:
XP_023519446.1 (uncharacterized protein LOC111782859 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023519447.1 uncharacterized protein LOC111782859 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023519448.1 uncharacterized protein LOC111782859 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023519449.1 uncharacterized protein LOC111782859 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023519450.1 uncharacterized protein LOC111782859 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023519451.1 uncharacterized protein LOC111782859 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2055 bits (5324), Expect = 0.0
Identity = 1113/1473 (75.56%), Postives = 1247/1473 (84.66%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M D V KPESSNSCCK WKD Y+KLEEKR ALRQAVKLLE+QI KIQAENLNLK+GYE+
Sbjct: 1 MVGDVVSKPESSNSCCKVWKDMYTKLEEKRIALRQAVKLLEEQIRKIQAENLNLKEGYEK 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARAS+ERE KDKES IRVSLEREI DLKSQISSL QN+V AVN GEV+HLN LVAE
Sbjct: 61 EKARASVERESKDKESAIRVSLEREISDLKSQISSLRQNDVGAVNVRGEVDHLNVLVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KK+ISQLKELLE EKRRTDAE+KNAE RKEEAAQA KT K+ERSK SD +K HKTEMDK
Sbjct: 121 KKKISQLKELLETEKRRTDAERKNAEARKEEAAQALKTMKIERSKASDLKKLHKTEMDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NE RQQLG+L+KE EET LKLASETSKL E++K+LE+EKQ TFKEK+RADSEMSKA+ R
Sbjct: 181 NECRQQLGVLEKEYEETNLKLASETSKLTEVMKDLEIEKQRTFKEKKRADSEMSKAQALR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
M+ E KQ EKSRAE L +Q RK KI++LQKQVKEL+ K IE+ C Q K+T+
Sbjct: 241 MQTEVTMKQVGEEKSRAENLFQQSERKTCKIKKLQKQVKELKTLKKFIESCCGQPVKRTN 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SK V++NDKP E IQR+ NE+KL ++ +KAKEVN++HKM DLAIMKEK + M S
Sbjct: 301 SKDVKKNDKPWLEMIQRNENELKLAFEFVKAKEVNIKHKMDEDLAIMKEKTVSSNMMKSS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LKN + IY +KAMDEQCRADKLS ELEE RK+EELQK LREFKS RKL DAS VS EH
Sbjct: 361 ELKNHLEIYRRKAMDEQCRADKLSLELEEKNRKIEELQKNLREFKSSRKLADASTVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HARQVANLEK HRS+IQQ+LGRFKLEFVQLSNHLD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHARQVANLEKNHRSVIQQELGRFKLEFVQLSNHLDD 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK +STG KDN D EK NAE L YSKK+++AIE FQ WMP++ FR+ P HGAPLL
Sbjct: 481 LHKFSSTGTKDNDDSEKTMNAEKLQSSYSKKNLRAIEAFQAWMPDT-FRQATPHHGAPLL 540
Query: 541 -SSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
SS GNH+TS+SGIESRLES PG SNRKMLQSCAVNSSTASFSDGQLVGSQ+KAGL LTA
Sbjct: 541 PSSVGNHITSLSGIESRLESFPGDSNRKMLQSCAVNSSTASFSDGQLVGSQEKAGLRLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
KL GEN N+QP++SNL+ E+ KMKSNENLAM+A NSVRS IK++VGRANEKQ KRKRTI
Sbjct: 601 TKLAGENFNMQPRISNLSSEVGKMKSNENLAMMAGNSVRSHIKNNVGRANEKQGKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
ETVESI+YLYH+SKKMHSQIEEKLSLLH LN P EKP KSE VISN QDS ADKK RK
Sbjct: 661 ETVESIDYLYHESKKMHSQIEEKLSLLHALNSPAEKPLDKSEHVISNVLQDSCADKKIRK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK+K+Q LLDN E+KLNKVDTEVC P+S+ +PSQPVSKL D+CQPC+EEL+
Sbjct: 721 KRKALCQKKLKVQHLLDNSEMKLNKVDTEVCAPKSIGIKPSQPVSKLMDNCQPCVEELNT 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
VISELQ LETFGNIA DYMKLLDLDSAADE+CYRRA+EMPLSPSLP+I + G ETSAL
Sbjct: 781 RVISELQSLETFGNIANVDYMKLLDLDSAADEECYRRAIEMPLSPSLPNIYISGAETSAL 840
Query: 841 NDFEPLVDEFHKKLPKEK-GQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPC 900
N+FEPLVDE HK+LP E+ GQP +HSY+VIDVEIKSNYTQSC LL DIHSSKRQ+DPC
Sbjct: 841 NEFEPLVDELHKELPDEREGQPKTHSYNVIDVEIKSNYTQSCEFDLLGDIHSSKRQLDPC 900
Query: 901 LMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDF 960
L+ G +DL D+VQA CLD+VGVIV MPGT SLSGCEGV SEIKSG+ N PDF
Sbjct: 901 LIQGRQENDLFDVVQAGNNCLDQVGVIVGMPGTNVSLSGCEGVGASEIKSGTLGNSNPDF 960
Query: 961 CVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFV 1020
CVLFSN++DCHSI +IFSATRAC+KR S+ +QKEWMVQEILA+LNMEH L+PKEKTCVF
Sbjct: 961 CVLFSNSKDCHSILKIFSATRACVKRSSIITQKEWMVQEILASLNMEHELVPKEKTCVFF 1020
Query: 1021 SLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIED 1080
SL+LLNFTVVA+H+YGNFLNC TC+DSFSGHI EAML+V RSLF +L CLD LLAL+ED
Sbjct: 1021 SLLLLNFTVVAVHKYGNFLNCHTCLDSFSGHICEAMLDVAIRSLFTKLLCLDALLALMED 1080
Query: 1081 FIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAV 1140
F+IDGRVLSCTDASFETLT+G +RVNIPID V+RTLS PAS DYLIAGSSILASISKAV
Sbjct: 1081 FLIDGRVLSCTDASFETLTQGVLRVNIPIDSVNRTLSLTPASTDYLIAGSSILASISKAV 1140
Query: 1141 DRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE 1200
RTGLLWE+SY ILR CRYESS MLT+LHIFAHIGGD FFSLE YS L AVLKSII HLE
Sbjct: 1141 HRTGLLWEISYRILRSCRYESSLMLTILHIFAHIGGDHFFSLEVYSNLRAVLKSIITHLE 1200
Query: 1201 LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDL 1260
VG+S+DATFT KRNCR EFVQCANCPFSEE MSMPMVVSFLL+L+ KN+SN IM EDL
Sbjct: 1201 TVGSSNDATFTPLKRNCRAEFVQCANCPFSEEGMSMPMVVSFLLRLLQKNISNEIMDEDL 1260
Query: 1261 ENATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFN 1320
EN TSS NL+SLF+RN+ QIP +NSS KEVHP++YLDCDASCCLKK+KVSDD+P F FN
Sbjct: 1261 ENPTSSLNLESLFKRNLANQIPCKNSSGKEVHPSVYLDCDASCCLKKFKVSDDEPRFLFN 1320
Query: 1321 PTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVD 1380
PTLCDV+DAISLVEL+A YM WNWT NII QL+ELLKSSVK+ AIVILLGQLGRFGVD
Sbjct: 1321 PTLCDVTDAISLVELLAWYMGWNWTFANIIPQLMELLKSSVKKGFAIVILLGQLGRFGVD 1380
Query: 1381 SGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPP 1440
+GGFEDGGVKILRSNLS FLCLDTTIKSGL VQIA VS+LLGLLPFDFETIVQDK Y
Sbjct: 1381 AGGFEDGGVKILRSNLSSFLCLDTTIKSGLPVQIATVSSLLGLLPFDFETIVQDKVRYRA 1440
Query: 1441 SLSQYVEVNLIKTWYSLLSSKQKELSCNILQIS 1468
S QY EVNLIK W+SLLS KQKELSCNILQ++
Sbjct: 1441 SSYQYAEVNLIKMWFSLLSPKQKELSCNILQVA 1472
BLAST of MC03g0609 vs. NCBI nr
Match:
KAG7019455.1 (hypothetical protein SDJN02_18416 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2047 bits (5304), Expect = 0.0
Identity = 1112/1473 (75.49%), Postives = 1250/1473 (84.86%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M D V KPESSNSCCK WKD Y+KLEEKR ALRQAVKLLE+QI KIQAENLNLK+GYE+
Sbjct: 1 MVGDVVSKPESSNSCCKVWKDMYTKLEEKRIALRQAVKLLEEQIRKIQAENLNLKEGYEK 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIERE KDKES IRVSLEREI DLKSQISSL QN+V AVN GEV+HLN LVAE
Sbjct: 61 EKARASIERESKDKESAIRVSLEREISDLKSQISSLRQNDVEAVNVRGEVDHLNVLVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KK+ISQLKELLE EKRRTDAE+KNAE RKEEAAQA KT K+ERSK SD +K HKTEMDK
Sbjct: 121 KKKISQLKELLETEKRRTDAERKNAEARKEEAAQALKTMKIERSKASDLKKLHKTEMDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NE RQQLGML+KE EETKLKLASETSKL E++K+LE+EKQ TFKEK+RADSEMSKA+ SR
Sbjct: 181 NECRQQLGMLEKEYEETKLKLASETSKLTEVMKDLEIEKQRTFKEKKRADSEMSKAQASR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
M+ E KQ EKSRAE L +QL RK KI++L+KQVKEL+ K IE+ C Q K+T+
Sbjct: 241 MQTEVTVKQVGEEKSRAENLFQQLERKTCKIKKLRKQVKELKTLKKFIESCCGQPVKRTN 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SK V++NDKP E IQR+ NE+KL ++ +KAKEVN+ +KM DLAIMKEK ++ M +
Sbjct: 301 SKDVKKNDKPWLEMIQRNENELKLAFECVKAKEVNINYKMDEDLAIMKEKTVNSNMMKAS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LKN + IY +KAMDEQCRADKLS ELEE RK+EELQK LREFKS RKL DAS VS EH
Sbjct: 361 ELKNHLEIYRRKAMDEQCRADKLSLELEEKNRKIEELQKNLREFKSSRKLADASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HARQVANLEK HRS+IQQ+LGRFKLEFVQLSNHLD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHARQVANLEKNHRSVIQQELGRFKLEFVQLSNHLDD 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK +STG KDN D EK NAE L YSKK+++AIE FQ WMP+ NFR+ P HGAPLL
Sbjct: 481 LHKFSSTGTKDNDDSEKTMNAEKLQSSYSKKNLRAIEAFQAWMPD-NFRQATPHHGAPLL 540
Query: 541 -SSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
SS GNH+TS+SGIESRLES PG SNRKMLQSCAVNSSTASFSDGQLVGSQ+KAGL LTA
Sbjct: 541 PSSVGNHITSLSGIESRLESFPGDSNRKMLQSCAVNSSTASFSDGQLVGSQEKAGLRLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
KL GEN N+QP++SNL+ E+SKMKSNENLAM+A NSVRS IK+ VGRANEKQ KRKRTI
Sbjct: 601 TKLAGENFNMQPRISNLSSEVSKMKSNENLAMMAGNSVRSHIKNSVGRANEKQGKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
ETVESI+YLYH+SKKMHSQIEEKLSLLH LN PTEK KSE VISN QDS ADKK RK
Sbjct: 661 ETVESIDYLYHESKKMHSQIEEKLSLLHALNSPTEKALDKSEHVISNVLQDSCADKKIRK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK+K+Q LLDN E+KLNKVDTEVC P+S+ +PSQPVSKL D+CQPC+EEL+
Sbjct: 721 KRKALCQKKLKVQHLLDNSEMKLNKVDTEVCAPKSIGIKPSQPVSKLMDNCQPCVEELNT 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
VISELQ LETFGNIA DYMKLLDLDSAADE+CYRRA+EMPLSPSLP+I + G ETSAL
Sbjct: 781 HVISELQSLETFGNIANVDYMKLLDLDSAADEECYRRAIEMPLSPSLPNIYISGAETSAL 840
Query: 841 NDFEPLVDEFHKKLPKEK-GQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPC 900
N+FEPLVDE HK+LP E+ GQP +HSY+VIDVEIKSNYTQSC+ LL DIHSSK Q+DPC
Sbjct: 841 NEFEPLVDELHKELPDEREGQPKTHSYNVIDVEIKSNYTQSCDFDLLADIHSSKCQLDPC 900
Query: 901 LMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDF 960
L+ G +DL D+VQA CLD+VGVIV MPGT SLSGCE V SEIKSG+ N PDF
Sbjct: 901 LIQGRQENDLFDVVQAGNNCLDQVGVIVGMPGTNVSLSGCEEVGASEIKSGTLGNSNPDF 960
Query: 961 CVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFV 1020
CVLFSN++DCHSI +IFSATRAC+KR S+ +QKEWMVQEILA+LNMEH L+PKEKTCVF
Sbjct: 961 CVLFSNSKDCHSILKIFSATRACVKRSSIITQKEWMVQEILASLNMEHELVPKEKTCVFF 1020
Query: 1021 SLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIED 1080
SL+LLNFTVVA+H+YGNFLNC TC+DSFSGHI EAML+V RSLF +L CLD LLAL+ED
Sbjct: 1021 SLLLLNFTVVAVHKYGNFLNCHTCLDSFSGHICEAMLDVAIRSLFTKLLCLDALLALMED 1080
Query: 1081 FIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAV 1140
F+IDGRVLS TDASFETLT+G +RVNIPID V+RTLS PAS DYLIAGSSILASISKAV
Sbjct: 1081 FLIDGRVLSFTDASFETLTQGVLRVNIPIDSVNRTLSLTPASTDYLIAGSSILASISKAV 1140
Query: 1141 DRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE 1200
RTG+LWE+SY ILR CRYESS MLT+LHIFAHIGGDQFFSLE YS L AVLKSII HLE
Sbjct: 1141 HRTGILWEISYRILRSCRYESSLMLTILHIFAHIGGDQFFSLEVYSNLRAVLKSIITHLE 1200
Query: 1201 LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDL 1260
VG+S+DATFT KRNCR EFVQCANCPFSEE M MPMVVSFLL+L+ KN+SN IM EDL
Sbjct: 1201 TVGSSNDATFTPLKRNCRAEFVQCANCPFSEEGMPMPMVVSFLLRLLQKNISNEIMDEDL 1260
Query: 1261 ENATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFN 1320
EN+TSS NL+SLF+RN+ QIP +NSS KEVHP++YLDCDASCCLKK+KVSDD+P F FN
Sbjct: 1261 ENSTSSLNLESLFKRNLANQIPCKNSSGKEVHPSVYLDCDASCCLKKFKVSDDEPRFLFN 1320
Query: 1321 PTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVD 1380
PTLCDV+DAISLVEL+A YM WNWT NII QL+ELLKSSVK+ AIVILLGQLGRFGV
Sbjct: 1321 PTLCDVTDAISLVELLAWYMGWNWTFANIIPQLMELLKSSVKKGFAIVILLGQLGRFGVV 1380
Query: 1381 SGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPP 1440
+GGF+DGGVKILRSNLS FLCLDTTIKSGL VQIA VS+LLGLLPFDFETIVQDK Y
Sbjct: 1381 AGGFDDGGVKILRSNLSSFLCLDTTIKSGLPVQIATVSSLLGLLPFDFETIVQDKVRYRA 1440
Query: 1441 SLSQYVEVNLIKTWYSLLSSKQKELSCNILQIS 1468
S +QY EVNLIKTW+SLLS KQKELSCNILQ++
Sbjct: 1441 SPNQYAEVNLIKTWFSLLSPKQKELSCNILQVA 1472
BLAST of MC03g0609 vs. NCBI nr
Match:
XP_022927023.1 (myosin heavy chain, non-muscle-like [Cucurbita moschata] >XP_022927024.1 myosin heavy chain, non-muscle-like [Cucurbita moschata] >XP_022927025.1 myosin heavy chain, non-muscle-like [Cucurbita moschata] >XP_022927026.1 myosin heavy chain, non-muscle-like [Cucurbita moschata] >XP_022927027.1 myosin heavy chain, non-muscle-like [Cucurbita moschata])
HSP 1 Score: 2045 bits (5298), Expect = 0.0
Identity = 1111/1473 (75.42%), Postives = 1249/1473 (84.79%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M D V KPESSNSCCK WKD Y+KLEEKR ALRQAVKLLE+QI KIQAENLNLK+GYE+
Sbjct: 1 MVGDVVSKPESSNSCCKVWKDMYTKLEEKRIALRQAVKLLEEQIRKIQAENLNLKEGYEK 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIERE KDKES IRVSLEREI DLKSQISSL QN+V AVN GEV+HLN LVAE
Sbjct: 61 EKARASIERESKDKESAIRVSLEREISDLKSQISSLRQNDVEAVNVRGEVDHLNVLVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KK+ISQLKELLE EKRRTDAE+KNAE RKEEAAQA KT K+ERSK SD +K HKTEMDK
Sbjct: 121 KKKISQLKELLETEKRRTDAERKNAEARKEEAAQALKTMKIERSKASDLKKLHKTEMDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NE RQQLGML+KE EETKLKLASETSKL E++K+LE+EKQ TFKEK+RADSEMSKA+ SR
Sbjct: 181 NECRQQLGMLEKEYEETKLKLASETSKLTEVMKDLEIEKQRTFKEKKRADSEMSKAQASR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
M+ E KQ EKSRAE L +QL RK KI++L+KQVKEL+ K IE+ C Q K+T+
Sbjct: 241 MQTEVTVKQVGEEKSRAENLFQQLERKTCKIKKLRKQVKELKTLKKFIESCCGQPVKRTN 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SK V++NDKP E IQR+ NE+KL ++ +KAKEVN+ +KM DLAIMKEK ++ M +
Sbjct: 301 SKDVKKNDKPWLEMIQRNENELKLAFECVKAKEVNINYKMDEDLAIMKEKTVNSNMMKAS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LKN + IY +KAMDEQCRADKLS ELEE RK+EELQK LREFKS RKL DAS VS EH
Sbjct: 361 ELKNHLEIYRRKAMDEQCRADKLSLELEEKNRKIEELQKNLREFKSSRKLADASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HARQVANLEK HRS+IQQ+LGRFKLEFVQLSNHLD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHARQVANLEKNHRSVIQQELGRFKLEFVQLSNHLDD 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK +STG KDN D EK NAE L YSKK+++AIE FQ WMP+ NFR+ P HGAPLL
Sbjct: 481 LHKFSSTGTKDNDDSEKTMNAEKLQSSYSKKNLRAIEAFQAWMPD-NFRQATPHHGAPLL 540
Query: 541 -SSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
SS GNH+TS+SGIESRLES PG SNRKMLQSCAVNSSTASFSDGQLVGSQ+KAGL LTA
Sbjct: 541 PSSVGNHITSLSGIESRLESFPGDSNRKMLQSCAVNSSTASFSDGQLVGSQEKAGLRLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
KL GEN N+QP++SNL+ E+SKMKSNENLAM+A NSVRS IK+ VGRANEKQ KRKRTI
Sbjct: 601 TKLAGENFNMQPRISNLSSEVSKMKSNENLAMMAGNSVRSHIKNSVGRANEKQGKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
ETVESI+YLYH+SKKMHSQIEEKLSLLH LN PTEK KSE VISN QDS ADKK RK
Sbjct: 661 ETVESIDYLYHESKKMHSQIEEKLSLLHALNSPTEKALDKSEHVISNVLQDSCADKKIRK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK+K+Q LLDN E+KLNKVDTEVC P+S+ +PSQPVSKL D+CQPC+EEL+
Sbjct: 721 KRKALCQKKLKVQHLLDNSEMKLNKVDTEVCAPKSIGIKPSQPVSKLMDNCQPCVEELNT 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
VISELQ LETFGNIA DYMKLLDLDSAADE+CYRRA+EMPLSPSLP+I + G ETSA
Sbjct: 781 HVISELQSLETFGNIANVDYMKLLDLDSAADEECYRRAIEMPLSPSLPNIYISGAETSAS 840
Query: 841 NDFEPLVDEFHKKLPKEK-GQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPC 900
N+FEPLVDE HK+LP E+ GQP +HSY+VIDVEIKSNYTQSC+ LL DIHSSK Q+DPC
Sbjct: 841 NEFEPLVDELHKELPDEREGQPKTHSYNVIDVEIKSNYTQSCDFDLLADIHSSKCQLDPC 900
Query: 901 LMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDF 960
L+ G +DL D+VQA CLD+VGVIV MPGT SLSGCE V SEIKSG+ N PDF
Sbjct: 901 LIQGRQENDLFDVVQAGNNCLDQVGVIVGMPGTNVSLSGCEEVGASEIKSGTLGNSNPDF 960
Query: 961 CVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFV 1020
CVLFSN++DCHSI +IFSATRAC+KR S+ +QKEWMVQEILA+LNMEH L+PKEKTCVF
Sbjct: 961 CVLFSNSKDCHSILKIFSATRACVKRSSIITQKEWMVQEILASLNMEHELVPKEKTCVFF 1020
Query: 1021 SLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIED 1080
SL+LLNFTVVA+H+YGNFLNC TC+DSFSGHI EAML+V RSLF +L CLD LLAL+ED
Sbjct: 1021 SLLLLNFTVVAVHKYGNFLNCHTCLDSFSGHICEAMLDVAIRSLFTKLLCLDALLALMED 1080
Query: 1081 FIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAV 1140
F+IDGRVLS TDASFETLT+G +RVNIPID V+RTLS PAS DYLIAGSSILASISKAV
Sbjct: 1081 FLIDGRVLSFTDASFETLTQGVLRVNIPIDSVNRTLSLTPASTDYLIAGSSILASISKAV 1140
Query: 1141 DRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE 1200
RTG+LWE+SY ILR CRYESS MLT+LHIFAHIGGDQFFSLE YS L AVLKSII HLE
Sbjct: 1141 HRTGILWEISYRILRSCRYESSLMLTILHIFAHIGGDQFFSLEVYSNLRAVLKSIITHLE 1200
Query: 1201 LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDL 1260
VG+S+DATFT KRNCR EFVQCANCPFSEE M MPMVVSFLL+L+ KN+SN IM EDL
Sbjct: 1201 TVGSSNDATFTPLKRNCRAEFVQCANCPFSEEGMPMPMVVSFLLRLLQKNISNEIMDEDL 1260
Query: 1261 ENATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFN 1320
EN+TSS NL+SLF+RN+ QIP +NSS KEVHP++YLDCDASCCLKK+KVSDD+P F FN
Sbjct: 1261 ENSTSSLNLESLFKRNLANQIPCKNSSGKEVHPSVYLDCDASCCLKKFKVSDDEPRFLFN 1320
Query: 1321 PTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVD 1380
PTLCDV+DAISLVEL+A YM WNWT NII QL+ELLKSSVK+ AIVILLGQLGRFGV
Sbjct: 1321 PTLCDVTDAISLVELLAWYMGWNWTFANIIPQLMELLKSSVKKGFAIVILLGQLGRFGVV 1380
Query: 1381 SGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPP 1440
+GGF+DGGVKILRSNLS FLCLDTTIKSGL VQIA VS+LLGLLPFDFETIVQDK Y
Sbjct: 1381 AGGFDDGGVKILRSNLSSFLCLDTTIKSGLPVQIATVSSLLGLLPFDFETIVQDKVRYRA 1440
Query: 1441 SLSQYVEVNLIKTWYSLLSSKQKELSCNILQIS 1468
S +QY EVNLIKTW+SLLS KQKELSCNILQ++
Sbjct: 1441 SPNQYAEVNLIKTWFSLLSPKQKELSCNILQVA 1472
BLAST of MC03g0609 vs. NCBI nr
Match:
XP_023000919.1 (uncharacterized protein LOC111495215 [Cucurbita maxima] >XP_023000920.1 uncharacterized protein LOC111495215 [Cucurbita maxima] >XP_023000921.1 uncharacterized protein LOC111495215 [Cucurbita maxima])
HSP 1 Score: 2039 bits (5283), Expect = 0.0
Identity = 1110/1473 (75.36%), Postives = 1247/1473 (84.66%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M D V KPESSNSCCK WKD Y+KLEEKR ALRQAVKLLE+QI KIQAENLNLK+GYE+
Sbjct: 1 MVGDVVSKPESSNSCCKVWKDMYTKLEEKRIALRQAVKLLEEQIRKIQAENLNLKEGYEK 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIERE KDKES IRVSLEREI DLKSQISSL QN+V AVN HGEV+HLN LVAE
Sbjct: 61 EKARASIERESKDKESAIRVSLEREISDLKSQISSLRQNDVEAVNVHGEVDHLNVLVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KK+ISQLKELLE EKRRTDAE+KNAE RKEEAAQA KT K+ERSK SD +K HKTEMDK
Sbjct: 121 KKKISQLKELLETEKRRTDAERKNAEARKEEAAQALKTMKIERSKASDLKKLHKTEMDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NE RQQLGML+KE EETKLKLAS+TSKL E++K+LE+EKQ TFKEK+RADSEMSKA+ SR
Sbjct: 181 NEFRQQLGMLEKEYEETKLKLASKTSKLTEVMKDLEIEKQRTFKEKKRADSEMSKAQASR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
M+ E KQ EKS+AE L +QL RK KI++LQKQVKE + K IE+ C Q K+T+
Sbjct: 241 MQTEVTMKQVGEEKSKAENLFQQLERKTCKIKKLQKQVKEFKTLKKFIESCCGQPIKRTN 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SK V++NDKP E IQR+ NE+KL ++ +KAKEVN++HKM DLAIMKEK ++ M S
Sbjct: 301 SKDVKKNDKPWLEMIQRNENELKLAFEYVKAKEVNIKHKMDEDLAIMKEKTVNSNMMKSS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LKN + IY +KAMDEQCRADKLS ELEE RK+EELQK LR FKS RKL DAS VS EH
Sbjct: 361 ELKNHLEIYRRKAMDEQCRADKLSLELEEKNRKIEELQKNLRGFKSSRKLADASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HARQVANLEK HRS+IQQ+LGRFKLEFVQLSNHLD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHARQVANLEKNHRSVIQQELGRFKLEFVQLSNHLDD 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK +STG KDN D EK NAE L R Y KK+++AIE FQ WMP++ FR+ P HGAPLL
Sbjct: 481 LHKFSSTGTKDNDDSEKTMNAEKLQRSYPKKNLRAIEAFQAWMPDT-FRQATPHHGAPLL 540
Query: 541 -SSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
SS GNH+TS+SGIESRLES PG SNRKMLQSCAVNSSTASFSDGQLVGSQ+ G LTA
Sbjct: 541 PSSVGNHITSLSGIESRLESFPGDSNRKMLQSCAVNSSTASFSDGQLVGSQEN-GFRLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
KL GEN N+QP++SNL+ E+SKMKSNENLAM+A NSVRS IK+++GRANEKQ KRKRTI
Sbjct: 601 TKLAGENFNMQPRISNLSSEVSKMKSNENLAMMAGNSVRSHIKNNIGRANEKQGKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
ETVESI+YLYH+SKKMHSQIEEKLSLLH LN PTEKP KSE VISN QDS ADKK RK
Sbjct: 661 ETVESIDYLYHESKKMHSQIEEKLSLLHALNSPTEKPLDKSEHVISNVLQDSCADKKIRK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK+K+Q LLDN E+KLNKVDTEVC P+S+ +PSQPVSKL D+CQPC+EEL+
Sbjct: 721 KRKALCQKKLKVQHLLDNSEMKLNKVDTEVCAPKSIGIKPSQPVSKLMDNCQPCVEELNT 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
V SELQ LETFGNIA DYMKLLDLDSAADE+CYRRA+EMPLSP LP+I + G ETSAL
Sbjct: 781 YVRSELQTLETFGNIANVDYMKLLDLDSAADEECYRRAIEMPLSP-LPNIYIYGAETSAL 840
Query: 841 NDFEPLVDEFHKKLPKEK-GQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPC 900
N+FEPLVDE HK+LP E+ GQP +HSY VIDVEIKSNYTQSC+ LL DIHSSKRQ+DPC
Sbjct: 841 NEFEPLVDELHKELPDEREGQPKTHSYTVIDVEIKSNYTQSCDFDLLGDIHSSKRQLDPC 900
Query: 901 LMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDF 960
L+ G +DL DIVQA CLD+VGVIV MPGT SLSGCEGV SEIKSG+ N PDF
Sbjct: 901 LIQGRQENDLFDIVQAGNNCLDQVGVIVGMPGTNVSLSGCEGVGASEIKSGTLGNSNPDF 960
Query: 961 CVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFV 1020
CV+FSN+ DCHSI +IFSATRAC+KR S+ +QKEWMVQEILA+LNMEH L+PKEKTCVF
Sbjct: 961 CVIFSNSNDCHSILKIFSATRACVKRSSIITQKEWMVQEILASLNMEHELVPKEKTCVFF 1020
Query: 1021 SLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIED 1080
SL+LLNFTVVA+H+YGNFLNC TC+DSFSGHI EAML+V RSLF +L CLD LLAL+ED
Sbjct: 1021 SLLLLNFTVVAVHKYGNFLNCHTCLDSFSGHICEAMLDVAIRSLFTKLLCLDALLALMED 1080
Query: 1081 FIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAV 1140
F+IDG+VLSCTDASFETLT+G +RVNIPID V+RTLS PAS DYLIAGSSILASISKAV
Sbjct: 1081 FLIDGQVLSCTDASFETLTQGVLRVNIPIDSVNRTLSLTPASTDYLIAGSSILASISKAV 1140
Query: 1141 DRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE 1200
RTGLLWE+SY ILR CRYESS MLT+LHIFAHIGGDQFFSLE YS L AVLKSII HLE
Sbjct: 1141 HRTGLLWEISYRILRSCRYESSLMLTILHIFAHIGGDQFFSLEVYSNLRAVLKSIITHLE 1200
Query: 1201 LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDL 1260
VG+S+DATFT KRNCR EFVQCANCPFSEE MSMPMVVSFLLQL+ KN+SN IM EDL
Sbjct: 1201 TVGSSNDATFTPLKRNCRAEFVQCANCPFSEEGMSMPMVVSFLLQLLPKNISNEIMDEDL 1260
Query: 1261 ENATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFN 1320
EN TSS NL+SLF+RN+ QIP +NSS KEVHP++YLDCDASCCLKK+KVSDD+P F FN
Sbjct: 1261 ENPTSSLNLESLFKRNLANQIPCKNSSGKEVHPSVYLDCDASCCLKKFKVSDDEPRFLFN 1320
Query: 1321 PTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVD 1380
PTLCDV+DAISLVEL+A YM WNWT NII QL+ELLKSSV + AIVILLGQLGRFGVD
Sbjct: 1321 PTLCDVTDAISLVELLAWYMGWNWTFANIIPQLMELLKSSVTKGFAIVILLGQLGRFGVD 1380
Query: 1381 SGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPP 1440
+GGFE+GGVKILRSNLS FLCLDTTIKSGL VQIA VS+LLGLLPFDFETIVQDK
Sbjct: 1381 AGGFENGGVKILRSNLSSFLCLDTTIKSGLPVQIATVSSLLGLLPFDFETIVQDKVRCRA 1440
Query: 1441 SLSQYVEVNLIKTWYSLLSSKQKELSCNILQIS 1468
S +QYVEVNLIK W+SLLS KQKELSCNILQ++
Sbjct: 1441 SSNQYVEVNLIKMWFSLLSPKQKELSCNILQVA 1470
BLAST of MC03g0609 vs. ExPASy TrEMBL
Match:
A0A6J1CE16 (uncharacterized protein LOC111010806 OS=Momordica charantia OX=3673 GN=LOC111010806 PE=4 SV=1)
HSP 1 Score: 2793 bits (7240), Expect = 0.0
Identity = 1469/1471 (99.86%), Postives = 1469/1471 (99.86%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE
Sbjct: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC
Sbjct: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA
Sbjct: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR
Sbjct: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD
Sbjct: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE
Sbjct: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHA 420
DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHA
Sbjct: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDASVSSEHA 420
Query: 421 TSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTI 480
TSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTI
Sbjct: 421 TSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDTI 480
Query: 481 HKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLS 540
HKLASTGAKDNHDLEK NAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLS
Sbjct: 481 HKLASTGAKDNHDLEKTRNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLLS 540
Query: 541 SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAK 600
SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAK
Sbjct: 541 SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTAAK 600
Query: 601 LVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIET 660
LVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIET
Sbjct: 601 LVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTIET 660
Query: 661 VESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKT 720
VESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKT
Sbjct: 661 VESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRKKT 720
Query: 721 NSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSV 780
NSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSV
Sbjct: 721 NSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDNSV 780
Query: 781 ISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALND 840
ISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALND
Sbjct: 781 ISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSALND 840
Query: 841 FEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLML 900
FEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLML
Sbjct: 841 FEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCLML 900
Query: 901 GSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVL 960
GSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVL
Sbjct: 901 GSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFCVL 960
Query: 961 FSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLV 1020
FSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLV
Sbjct: 961 FSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVSLV 1020
Query: 1021 LLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFII 1080
LLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFII
Sbjct: 1021 LLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDFII 1080
Query: 1081 DGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRT 1140
DGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRT
Sbjct: 1081 DGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVDRT 1140
Query: 1141 GLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVG 1200
GLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVG
Sbjct: 1141 GLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLELVG 1200
Query: 1201 TSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENA 1260
TSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENA
Sbjct: 1201 TSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENA 1260
Query: 1261 TSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCD 1320
TSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCD
Sbjct: 1261 TSSSNLKSLFERNIQIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCD 1320
Query: 1321 VSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFE 1380
VSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFE
Sbjct: 1321 VSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDSGGFE 1380
Query: 1381 DGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQY 1440
DGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQY
Sbjct: 1381 DGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPSLSQY 1440
Query: 1441 VEVNLIKTWYSLLSSKQKELSCNILQISVAM 1471
VEVNLIKTWYSLLSSKQKELSCNILQISVAM
Sbjct: 1441 VEVNLIKTWYSLLSSKQKELSCNILQISVAM 1471
BLAST of MC03g0609 vs. ExPASy TrEMBL
Match:
A0A6J1EFZ6 (myosin heavy chain, non-muscle-like OS=Cucurbita moschata OX=3662 GN=LOC111433978 PE=4 SV=1)
HSP 1 Score: 2045 bits (5298), Expect = 0.0
Identity = 1111/1473 (75.42%), Postives = 1249/1473 (84.79%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M D V KPESSNSCCK WKD Y+KLEEKR ALRQAVKLLE+QI KIQAENLNLK+GYE+
Sbjct: 1 MVGDVVSKPESSNSCCKVWKDMYTKLEEKRIALRQAVKLLEEQIRKIQAENLNLKEGYEK 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIERE KDKES IRVSLEREI DLKSQISSL QN+V AVN GEV+HLN LVAE
Sbjct: 61 EKARASIERESKDKESAIRVSLEREISDLKSQISSLRQNDVEAVNVRGEVDHLNVLVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KK+ISQLKELLE EKRRTDAE+KNAE RKEEAAQA KT K+ERSK SD +K HKTEMDK
Sbjct: 121 KKKISQLKELLETEKRRTDAERKNAEARKEEAAQALKTMKIERSKASDLKKLHKTEMDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NE RQQLGML+KE EETKLKLASETSKL E++K+LE+EKQ TFKEK+RADSEMSKA+ SR
Sbjct: 181 NECRQQLGMLEKEYEETKLKLASETSKLTEVMKDLEIEKQRTFKEKKRADSEMSKAQASR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
M+ E KQ EKSRAE L +QL RK KI++L+KQVKEL+ K IE+ C Q K+T+
Sbjct: 241 MQTEVTVKQVGEEKSRAENLFQQLERKTCKIKKLRKQVKELKTLKKFIESCCGQPVKRTN 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SK V++NDKP E IQR+ NE+KL ++ +KAKEVN+ +KM DLAIMKEK ++ M +
Sbjct: 301 SKDVKKNDKPWLEMIQRNENELKLAFECVKAKEVNINYKMDEDLAIMKEKTVNSNMMKAS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LKN + IY +KAMDEQCRADKLS ELEE RK+EELQK LREFKS RKL DAS VS EH
Sbjct: 361 ELKNHLEIYRRKAMDEQCRADKLSLELEEKNRKIEELQKNLREFKSSRKLADASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HARQVANLEK HRS+IQQ+LGRFKLEFVQLSNHLD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHARQVANLEKNHRSVIQQELGRFKLEFVQLSNHLDD 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK +STG KDN D EK NAE L YSKK+++AIE FQ WMP+ NFR+ P HGAPLL
Sbjct: 481 LHKFSSTGTKDNDDSEKTMNAEKLQSSYSKKNLRAIEAFQAWMPD-NFRQATPHHGAPLL 540
Query: 541 -SSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
SS GNH+TS+SGIESRLES PG SNRKMLQSCAVNSSTASFSDGQLVGSQ+KAGL LTA
Sbjct: 541 PSSVGNHITSLSGIESRLESFPGDSNRKMLQSCAVNSSTASFSDGQLVGSQEKAGLRLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
KL GEN N+QP++SNL+ E+SKMKSNENLAM+A NSVRS IK+ VGRANEKQ KRKRTI
Sbjct: 601 TKLAGENFNMQPRISNLSSEVSKMKSNENLAMMAGNSVRSHIKNSVGRANEKQGKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
ETVESI+YLYH+SKKMHSQIEEKLSLLH LN PTEK KSE VISN QDS ADKK RK
Sbjct: 661 ETVESIDYLYHESKKMHSQIEEKLSLLHALNSPTEKALDKSEHVISNVLQDSCADKKIRK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK+K+Q LLDN E+KLNKVDTEVC P+S+ +PSQPVSKL D+CQPC+EEL+
Sbjct: 721 KRKALCQKKLKVQHLLDNSEMKLNKVDTEVCAPKSIGIKPSQPVSKLMDNCQPCVEELNT 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
VISELQ LETFGNIA DYMKLLDLDSAADE+CYRRA+EMPLSPSLP+I + G ETSA
Sbjct: 781 HVISELQSLETFGNIANVDYMKLLDLDSAADEECYRRAIEMPLSPSLPNIYISGAETSAS 840
Query: 841 NDFEPLVDEFHKKLPKEK-GQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPC 900
N+FEPLVDE HK+LP E+ GQP +HSY+VIDVEIKSNYTQSC+ LL DIHSSK Q+DPC
Sbjct: 841 NEFEPLVDELHKELPDEREGQPKTHSYNVIDVEIKSNYTQSCDFDLLADIHSSKCQLDPC 900
Query: 901 LMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDF 960
L+ G +DL D+VQA CLD+VGVIV MPGT SLSGCE V SEIKSG+ N PDF
Sbjct: 901 LIQGRQENDLFDVVQAGNNCLDQVGVIVGMPGTNVSLSGCEEVGASEIKSGTLGNSNPDF 960
Query: 961 CVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFV 1020
CVLFSN++DCHSI +IFSATRAC+KR S+ +QKEWMVQEILA+LNMEH L+PKEKTCVF
Sbjct: 961 CVLFSNSKDCHSILKIFSATRACVKRSSIITQKEWMVQEILASLNMEHELVPKEKTCVFF 1020
Query: 1021 SLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIED 1080
SL+LLNFTVVA+H+YGNFLNC TC+DSFSGHI EAML+V RSLF +L CLD LLAL+ED
Sbjct: 1021 SLLLLNFTVVAVHKYGNFLNCHTCLDSFSGHICEAMLDVAIRSLFTKLLCLDALLALMED 1080
Query: 1081 FIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAV 1140
F+IDGRVLS TDASFETLT+G +RVNIPID V+RTLS PAS DYLIAGSSILASISKAV
Sbjct: 1081 FLIDGRVLSFTDASFETLTQGVLRVNIPIDSVNRTLSLTPASTDYLIAGSSILASISKAV 1140
Query: 1141 DRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE 1200
RTG+LWE+SY ILR CRYESS MLT+LHIFAHIGGDQFFSLE YS L AVLKSII HLE
Sbjct: 1141 HRTGILWEISYRILRSCRYESSLMLTILHIFAHIGGDQFFSLEVYSNLRAVLKSIITHLE 1200
Query: 1201 LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDL 1260
VG+S+DATFT KRNCR EFVQCANCPFSEE M MPMVVSFLL+L+ KN+SN IM EDL
Sbjct: 1201 TVGSSNDATFTPLKRNCRAEFVQCANCPFSEEGMPMPMVVSFLLRLLQKNISNEIMDEDL 1260
Query: 1261 ENATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFN 1320
EN+TSS NL+SLF+RN+ QIP +NSS KEVHP++YLDCDASCCLKK+KVSDD+P F FN
Sbjct: 1261 ENSTSSLNLESLFKRNLANQIPCKNSSGKEVHPSVYLDCDASCCLKKFKVSDDEPRFLFN 1320
Query: 1321 PTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVD 1380
PTLCDV+DAISLVEL+A YM WNWT NII QL+ELLKSSVK+ AIVILLGQLGRFGV
Sbjct: 1321 PTLCDVTDAISLVELLAWYMGWNWTFANIIPQLMELLKSSVKKGFAIVILLGQLGRFGVV 1380
Query: 1381 SGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPP 1440
+GGF+DGGVKILRSNLS FLCLDTTIKSGL VQIA VS+LLGLLPFDFETIVQDK Y
Sbjct: 1381 AGGFDDGGVKILRSNLSSFLCLDTTIKSGLPVQIATVSSLLGLLPFDFETIVQDKVRYRA 1440
Query: 1441 SLSQYVEVNLIKTWYSLLSSKQKELSCNILQIS 1468
S +QY EVNLIKTW+SLLS KQKELSCNILQ++
Sbjct: 1441 SPNQYAEVNLIKTWFSLLSPKQKELSCNILQVA 1472
BLAST of MC03g0609 vs. ExPASy TrEMBL
Match:
A0A6J1KH58 (uncharacterized protein LOC111495215 OS=Cucurbita maxima OX=3661 GN=LOC111495215 PE=4 SV=1)
HSP 1 Score: 2039 bits (5283), Expect = 0.0
Identity = 1110/1473 (75.36%), Postives = 1247/1473 (84.66%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M D V KPESSNSCCK WKD Y+KLEEKR ALRQAVKLLE+QI KIQAENLNLK+GYE+
Sbjct: 1 MVGDVVSKPESSNSCCKVWKDMYTKLEEKRIALRQAVKLLEEQIRKIQAENLNLKEGYEK 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKARASIERE KDKES IRVSLEREI DLKSQISSL QN+V AVN HGEV+HLN LVAE
Sbjct: 61 EKARASIERESKDKESAIRVSLEREISDLKSQISSLRQNDVEAVNVHGEVDHLNVLVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KK+ISQLKELLE EKRRTDAE+KNAE RKEEAAQA KT K+ERSK SD +K HKTEMDK
Sbjct: 121 KKKISQLKELLETEKRRTDAERKNAEARKEEAAQALKTMKIERSKASDLKKLHKTEMDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
NE RQQLGML+KE EETKLKLAS+TSKL E++K+LE+EKQ TFKEK+RADSEMSKA+ SR
Sbjct: 181 NEFRQQLGMLEKEYEETKLKLASKTSKLTEVMKDLEIEKQRTFKEKKRADSEMSKAQASR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
M+ E KQ EKS+AE L +QL RK KI++LQKQVKE + K IE+ C Q K+T+
Sbjct: 241 MQTEVTMKQVGEEKSKAENLFQQLERKTCKIKKLQKQVKEFKTLKKFIESCCGQPIKRTN 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
SK V++NDKP E IQR+ NE+KL ++ +KAKEVN++HKM DLAIMKEK ++ M S
Sbjct: 301 SKDVKKNDKPWLEMIQRNENELKLAFEYVKAKEVNIKHKMDEDLAIMKEKTVNSNMMKSS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LKN + IY +KAMDEQCRADKLS ELEE RK+EELQK LR FKS RKL DAS VS EH
Sbjct: 361 ELKNHLEIYRRKAMDEQCRADKLSLELEEKNRKIEELQKNLRGFKSSRKLADASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HARQVANLEK HRS+IQQ+LGRFKLEFVQLSNHLD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHARQVANLEKNHRSVIQQELGRFKLEFVQLSNHLDD 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK +STG KDN D EK NAE L R Y KK+++AIE FQ WMP++ FR+ P HGAPLL
Sbjct: 481 LHKFSSTGTKDNDDSEKTMNAEKLQRSYPKKNLRAIEAFQAWMPDT-FRQATPHHGAPLL 540
Query: 541 -SSEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
SS GNH+TS+SGIESRLES PG SNRKMLQSCAVNSSTASFSDGQLVGSQ+ G LTA
Sbjct: 541 PSSVGNHITSLSGIESRLESFPGDSNRKMLQSCAVNSSTASFSDGQLVGSQEN-GFRLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
KL GEN N+QP++SNL+ E+SKMKSNENLAM+A NSVRS IK+++GRANEKQ KRKRTI
Sbjct: 601 TKLAGENFNMQPRISNLSSEVSKMKSNENLAMMAGNSVRSHIKNNIGRANEKQGKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
ETVESI+YLYH+SKKMHSQIEEKLSLLH LN PTEKP KSE VISN QDS ADKK RK
Sbjct: 661 ETVESIDYLYHESKKMHSQIEEKLSLLHALNSPTEKPLDKSEHVISNVLQDSCADKKIRK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK+K+Q LLDN E+KLNKVDTEVC P+S+ +PSQPVSKL D+CQPC+EEL+
Sbjct: 721 KRKALCQKKLKVQHLLDNSEMKLNKVDTEVCAPKSIGIKPSQPVSKLMDNCQPCVEELNT 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
V SELQ LETFGNIA DYMKLLDLDSAADE+CYRRA+EMPLSP LP+I + G ETSAL
Sbjct: 781 YVRSELQTLETFGNIANVDYMKLLDLDSAADEECYRRAIEMPLSP-LPNIYIYGAETSAL 840
Query: 841 NDFEPLVDEFHKKLPKEK-GQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPC 900
N+FEPLVDE HK+LP E+ GQP +HSY VIDVEIKSNYTQSC+ LL DIHSSKRQ+DPC
Sbjct: 841 NEFEPLVDELHKELPDEREGQPKTHSYTVIDVEIKSNYTQSCDFDLLGDIHSSKRQLDPC 900
Query: 901 LMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDF 960
L+ G +DL DIVQA CLD+VGVIV MPGT SLSGCEGV SEIKSG+ N PDF
Sbjct: 901 LIQGRQENDLFDIVQAGNNCLDQVGVIVGMPGTNVSLSGCEGVGASEIKSGTLGNSNPDF 960
Query: 961 CVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFV 1020
CV+FSN+ DCHSI +IFSATRAC+KR S+ +QKEWMVQEILA+LNMEH L+PKEKTCVF
Sbjct: 961 CVIFSNSNDCHSILKIFSATRACVKRSSIITQKEWMVQEILASLNMEHELVPKEKTCVFF 1020
Query: 1021 SLVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIED 1080
SL+LLNFTVVA+H+YGNFLNC TC+DSFSGHI EAML+V RSLF +L CLD LLAL+ED
Sbjct: 1021 SLLLLNFTVVAVHKYGNFLNCHTCLDSFSGHICEAMLDVAIRSLFTKLLCLDALLALMED 1080
Query: 1081 FIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAV 1140
F+IDG+VLSCTDASFETLT+G +RVNIPID V+RTLS PAS DYLIAGSSILASISKAV
Sbjct: 1081 FLIDGQVLSCTDASFETLTQGVLRVNIPIDSVNRTLSLTPASTDYLIAGSSILASISKAV 1140
Query: 1141 DRTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE 1200
RTGLLWE+SY ILR CRYESS MLT+LHIFAHIGGDQFFSLE YS L AVLKSII HLE
Sbjct: 1141 HRTGLLWEISYRILRSCRYESSLMLTILHIFAHIGGDQFFSLEVYSNLRAVLKSIITHLE 1200
Query: 1201 LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDL 1260
VG+S+DATFT KRNCR EFVQCANCPFSEE MSMPMVVSFLLQL+ KN+SN IM EDL
Sbjct: 1201 TVGSSNDATFTPLKRNCRAEFVQCANCPFSEEGMSMPMVVSFLLQLLPKNISNEIMDEDL 1260
Query: 1261 ENATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFN 1320
EN TSS NL+SLF+RN+ QIP +NSS KEVHP++YLDCDASCCLKK+KVSDD+P F FN
Sbjct: 1261 ENPTSSLNLESLFKRNLANQIPCKNSSGKEVHPSVYLDCDASCCLKKFKVSDDEPRFLFN 1320
Query: 1321 PTLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVD 1380
PTLCDV+DAISLVEL+A YM WNWT NII QL+ELLKSSV + AIVILLGQLGRFGVD
Sbjct: 1321 PTLCDVTDAISLVELLAWYMGWNWTFANIIPQLMELLKSSVTKGFAIVILLGQLGRFGVD 1380
Query: 1381 SGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPP 1440
+GGFE+GGVKILRSNLS FLCLDTTIKSGL VQIA VS+LLGLLPFDFETIVQDK
Sbjct: 1381 AGGFENGGVKILRSNLSSFLCLDTTIKSGLPVQIATVSSLLGLLPFDFETIVQDKVRCRA 1440
Query: 1441 SLSQYVEVNLIKTWYSLLSSKQKELSCNILQIS 1468
S +QYVEVNLIK W+SLLS KQKELSCNILQ++
Sbjct: 1441 SSNQYVEVNLIKMWFSLLSPKQKELSCNILQVA 1470
BLAST of MC03g0609 vs. ExPASy TrEMBL
Match:
A0A6J1GRR8 (restin homolog isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456908 PE=4 SV=1)
HSP 1 Score: 2009 bits (5206), Expect = 0.0
Identity = 1098/1473 (74.54%), Postives = 1242/1473 (84.32%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M +D VP PES+NSCCK WKDK +++EEKRNALRQAVKLL+QQINKIQAENLNLKKGYEE
Sbjct: 1 MLDDFVPNPESANSCCKRWKDKCTEVEEKRNALRQAVKLLQQQINKIQAENLNLKKGYEE 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
+KA ASIEREGK+KES IRVSLEREILDLKS ISSL QN+V+AV EVE LNALVAE
Sbjct: 61 DKAGASIEREGKEKESAIRVSLEREILDLKSHISSLRQNDVDAVEVCREVEQLNALVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KKEIS L ELLE EKR+TDAE+KNAEVRKEEAAQA KT ++ERSK SD RK HK E+DK
Sbjct: 121 KKEISHLNELLETEKRKTDAERKNAEVRKEEAAQALKTVRIERSKASDLRKLHKNELDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
E RQQL MLKKE EETKLKLASETSKL E+ K+LE+EK+ T KE+ERA+SEMSKA SR
Sbjct: 181 KECRQQLEMLKKEYEETKLKLASETSKLNEVKKDLEIEKRRTSKERERANSEMSKAHVSR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
++AEA RKQAE E+S+AE LL+QL RK KIEELQKQVKELQ K IE+ C Q D+KTD
Sbjct: 241 IQAEANRKQAEEEQSKAENLLQQLDRKTCKIEELQKQVKELQTLKTFIESCCGQHDEKTD 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
KAVE+N KP E IQ++ANE KL ++ LK KEVN+ HKM GDLAIMKEKP+D M S
Sbjct: 301 GKAVEKNVKPWLEVIQKNANEFKLAFEFLKDKEVNIMHKMDGDLAIMKEKPLDSNMMKSS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LK + IY KKAMDEQ RADKL+ ELEE KRKVE+LQK LRE KS RKLVDAS VS EH
Sbjct: 361 ELKKHLEIYRKKAMDEQYRADKLALELEEKKRKVEKLQKNLRELKSSRKLVDASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HAR+VANLE THRS+IQ +LG FKL+FVQLSN+LD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHAREVANLENTHRSIIQHELGSFKLKFVQLSNYLDN 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK ASTGAK N DLEK NAENL LY++K++ AIEPF+TW+P++ FR+ PQH APLL
Sbjct: 481 LHKFASTGAKGNDDLEKTKNAENLRSLYAEKNLHAIEPFKTWLPDT-FRQTTPQHDAPLL 540
Query: 541 S-SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
S GNHVTS+SGIESRLE+ P S+RKM QSCAVNSSTASFSDGQL GS++KAGLCLTA
Sbjct: 541 PLSGGNHVTSLSGIESRLEAHPVNSDRKMFQSCAVNSSTASFSDGQLAGSREKAGLCLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
AKLVGENL ++PK+SN++GE+S+MK NEN A +AENSVRSPIK+HVGRANEKQ+KRKRTI
Sbjct: 601 AKLVGENLIMKPKISNVSGEVSEMKDNEN-ARMAENSVRSPIKNHVGRANEKQQKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
E VE+IE LYH+S+K+HSQIEEKLSLLH LN PTEKP KS VISN QD SADKK RK
Sbjct: 661 EAVENIECLYHESRKIHSQIEEKLSLLHALNSPTEKPLDKSGHVISNVFQDPSADKKARK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L QKK + Q LLD++E++L+KV+ EVC ES RQPSQPVSKLTD+ QPCLEEL+N
Sbjct: 721 KRKTLCQKKTREQ-LLDDNEMELSKVNIEVCALESFGRQPSQPVSKLTDNLQPCLEELNN 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
S ISELQ L T GNI+ GDYMKLLDLDSAADE+CYRRAMEMPLSPSLPDI +PG ETS L
Sbjct: 781 SAISELQTLGTLGNISDGDYMKLLDLDSAADEECYRRAMEMPLSPSLPDIYIPGAETSTL 840
Query: 841 NDFEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCL 900
N+FEPL+DE H E+GQP HSYDV+DVEIKSN+TQ CNSGLL DIHSSK +DPC
Sbjct: 841 NEFEPLLDERH-----EEGQPQLHSYDVMDVEIKSNHTQYCNSGLLGDIHSSKHHLDPCF 900
Query: 901 MLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFC 960
M GSHGSDLCDIVQA++ L+++GV VEMPGT LSGCEGV SEIKSG+ DN +PDFC
Sbjct: 901 MQGSHGSDLCDIVQAKENYLNQIGVTVEMPGTNVPLSGCEGVGASEIKSGTLDNSIPDFC 960
Query: 961 VLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVS 1020
VLFSN +DC SISRIFSATRAC KR SLT++KEWMVQEILA+LNMEH+LLP EK CVF S
Sbjct: 961 VLFSNIKDCRSISRIFSATRACSKRSSLTNKKEWMVQEILASLNMEHKLLPMEKACVFFS 1020
Query: 1021 LVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDF 1080
L+LLNF VVA+H+YGNFLNC TC+DSFSGHI EAML+VE RSLF E CLDELL+LIEDF
Sbjct: 1021 LLLLNFNVVAVHKYGNFLNCNTCLDSFSGHICEAMLDVEIRSLFTESLCLDELLSLIEDF 1080
Query: 1081 IIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVD 1140
IIDGR+LSC DAS ET EG +RVNI +DGV+R LS PAS +YLIAGSSILASISKAVD
Sbjct: 1081 IIDGRILSCIDASLETSIEGVLRVNISVDGVNRALSLTPASTNYLIAGSSILASISKAVD 1140
Query: 1141 RTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLEL 1200
RTG LWEVSYSILRICRYESS +LTMLHIFAHIGGDQFFSLEGYS LMAVLKSII HLE+
Sbjct: 1141 RTGFLWEVSYSILRICRYESSLVLTMLHIFAHIGGDQFFSLEGYSTLMAVLKSIITHLEV 1200
Query: 1201 VGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLE 1260
VG+SDDA+FT PK NCRTEFVQCA+CPFS +MSMPM VSFLL+L+ KN +AEDLE
Sbjct: 1201 VGSSDDASFTPPKGNCRTEFVQCAHCPFSANIMSMPMAVSFLLRLVHKN----ALAEDLE 1260
Query: 1261 NATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNP 1320
N T S N +SL E+NI QIP +N S +EVHPALYLDCDASCCLKKY+VSDD+ FNP
Sbjct: 1261 NPTGSLNPESLSEKNIAYQIPCKNLSGQEVHPALYLDCDASCCLKKYRVSDDESWSLFNP 1320
Query: 1321 TLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDS 1380
+LC+++DAISLVEL+ACYM WNWT NIISQLLE LKSSVKESL VILLGQLGRFGV +
Sbjct: 1321 SLCEITDAISLVELLACYMGWNWTFANIISQLLEFLKSSVKESLPFVILLGQLGRFGVAA 1380
Query: 1381 GGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPS 1440
GGFEDGGVKILRSNLS FL LDTTIKSGLCVQIAIVSALLGLLPF+FETI+QDK SYP S
Sbjct: 1381 GGFEDGGVKILRSNLSAFLYLDTTIKSGLCVQIAIVSALLGLLPFEFETIIQDKVSYPAS 1440
Query: 1441 LSQYVEVNLIKTWYSLLSSKQKELSCNILQISV 1469
+ YVEVNLIKTW+S LS KQKELSCN LQ++V
Sbjct: 1441 SNPYVEVNLIKTWFSFLSPKQKELSCNTLQVAV 1461
BLAST of MC03g0609 vs. ExPASy TrEMBL
Match:
A0A6J1JWM7 (protein MLP1-like OS=Cucurbita maxima OX=3661 GN=LOC111490360 PE=4 SV=1)
HSP 1 Score: 1998 bits (5175), Expect = 0.0
Identity = 1094/1473 (74.27%), Postives = 1231/1473 (83.57%), Query Frame = 0
Query: 1 MAEDAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEE 60
M +D VP PES+NSCCK WKDK +++EEKRNALRQAVKLL+QQIN+IQAENLNLKKGYEE
Sbjct: 1 MLDDFVPNPESANSCCKRWKDKCTEVEEKRNALRQAVKLLQQQINRIQAENLNLKKGYEE 60
Query: 61 EKARASIEREGKDKESTIRVSLEREILDLKSQISSLEQNNVNAVNDHGEVEHLNALVAEC 120
EKA ASIEREGK+KES IRVSLEREILDLKS ISSL QN+V+AV EVEHLNALVAE
Sbjct: 61 EKAGASIEREGKEKESAIRVSLEREILDLKSHISSLRQNDVDAVKVCREVEHLNALVAEG 120
Query: 121 KKEISQLKELLEIEKRRTDAEKKNAEVRKEEAAQAWKTGKVERSKTSDSRKFHKTEMDKA 180
KKEIS L ELLE EKR+TDAE+KNAEVRKEEAAQ K+ ++ERSK SD RK HKTE+DK
Sbjct: 121 KKEISHLNELLETEKRKTDAERKNAEVRKEEAAQTLKSVRIERSKGSDLRKLHKTELDKV 180
Query: 181 NEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEKERADSEMSKAEDSR 240
E RQQL MLKKE EETKLKL SETSKL+E+ K LE+EK+ T KE+ERA+SEMSKA SR
Sbjct: 181 KECRQQLEMLKKEYEETKLKLESETSKLIEVKKGLEIEKRRTSKERERANSEMSKAHASR 240
Query: 241 MRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQASKISIEARCRQLDKKTD 300
++AEA RKQAE E+S+AE L +QL RK KIEELQKQVKELQ K IE+ C Q D+KTD
Sbjct: 241 VQAEANRKQAEEEQSKAENLFQQLERKTCKIEELQKQVKELQTLKTFIESCCGQHDEKTD 300
Query: 301 SKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLAIMKEKPMDFKQMTSE 360
KAVE+N KP E IQ++ANE+KL ++ LK KEVN+ HKM GDL IMKEKP+D M S
Sbjct: 301 GKAVEKNVKPWLEVIQKNANELKLAFEFLKDKEVNIMHKMDGDLVIMKEKPVDSNIMKSS 360
Query: 361 DLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFKSCRKLVDAS-VSSEH 420
+LK + IY KKAMDEQCRADKL+ ELEE KRKVE+LQK LRE KS RKLVDAS VS EH
Sbjct: 361 ELKKHLEIYRKKAMDEQCRADKLALELEEKKRKVEKLQKNLRELKSSRKLVDASAVSFEH 420
Query: 421 ATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLGRFKLEFVQLSNHLDT 480
A SSERAEMKLLKKKLKFEK R++HAR+VANLE TH S+IQ +LGRFKL+FVQLSN+LD
Sbjct: 421 AMSSERAEMKLLKKKLKFEKTRLKHAREVANLENTHHSIIQHELGRFKLKFVQLSNYLDN 480
Query: 481 IHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQTWMPESNFRRMIPQHGAPLL 540
+HK ASTGAK N DLEK NAENL LY+ K++ AIEPF+TW+P++ FR+ PQH APLL
Sbjct: 481 LHKFASTGAKGNDDLEKTKNAENLRSLYADKNLHAIEPFKTWLPDT-FRQTTPQHDAPLL 540
Query: 541 S-SEGNHVTSVSGIESRLESLPGGSNRKMLQSCAVNSSTASFSDGQLVGSQDKAGLCLTA 600
S GNHVTS+SGIESRLE+ P S+RKM QSCAVNSSTASFSDGQL GSQ+KAGLCLTA
Sbjct: 541 PLSGGNHVTSLSGIESRLEAHPVNSDRKMFQSCAVNSSTASFSDGQLAGSQEKAGLCLTA 600
Query: 601 AKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSPIKDHVGRANEKQKKRKRTI 660
AKLVGENL ++PK+SN++GE+S+MK NE +A +AENSVRSPIK+HVGRANEKQ+KRKRTI
Sbjct: 601 AKLVGENLIMKPKISNVSGEVSEMKDNE-IARMAENSVRSPIKNHVGRANEKQQKRKRTI 660
Query: 661 ETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPTEKPSRKSEDVISNPPQDSSADKKFRK 720
E VE+IE LYH+S+K+HSQIEEKLSLLH LN PTEKP K VISN QD SADKK RK
Sbjct: 661 EAVENIECLYHESRKIHSQIEEKLSLLHALNSPTEKPLEKRGHVISNVFQDPSADKKARK 720
Query: 721 KTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSRQPSQPVSKLTDSCQPCLEELDN 780
K +L KK + Q LLD++E++ +V+ EV ES RQPSQPVSKLTD+ QPC EEL+N
Sbjct: 721 KRKTLCPKKTREQ-LLDDNEME-TQVNIEVRALESFGRQPSQPVSKLTDNLQPCSEELNN 780
Query: 781 SVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRAMEMPLSPSLPDICVPGTETSAL 840
S ISELQ L T GNI+ GDYMKLLDLDSAADE+CYRRAMEMPLSPSLPDI +PG ETS L
Sbjct: 781 SAISELQTLGTLGNISDGDYMKLLDLDSAADEECYRRAMEMPLSPSLPDIYIPGAETSTL 840
Query: 841 NDFEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYTQSCNSGLLRDIHSSKRQVDPCL 900
N+FEPL+DE H E+GQP HSYDV+DVEIKSNYTQ CNSGLL DIHSSKR +DPC
Sbjct: 841 NEFEPLLDERH-----EEGQPQLHSYDVMDVEIKSNYTQHCNSGLLGDIHSSKRHLDPCF 900
Query: 901 MLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSGCEGVETSEIKSGSQDNFVPDFC 960
M G HGSDLCDIV AE+ CLD++G+ VE PGT LSGCEGV S IKSG+ DN +PDFC
Sbjct: 901 MQGRHGSDLCDIVHAEENCLDQIGITVEKPGTNVPLSGCEGVGASAIKSGTLDNSIPDFC 960
Query: 961 VLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQEILAALNMEHRLLPKEKTCVFVS 1020
VLFSN +DC SISRIFSAT+AC KR SLTS+KEWMVQEILA+LNMEH+LLP EK CVF S
Sbjct: 961 VLFSNTKDCRSISRIFSATKACSKRSSLTSKKEWMVQEILASLNMEHKLLPMEKACVFFS 1020
Query: 1021 LVLLNFTVVAMHRYGNFLNCKTCMDSFSGHISEAMLNVETRSLFAELCCLDELLALIEDF 1080
L+LLNF VVA+H+YGNFLNC T +DSFSGHI EAML+VE RSLF EL CLDELLALIEDF
Sbjct: 1021 LLLLNFNVVAVHKYGNFLNCNTYLDSFSGHICEAMLDVEIRSLFTELLCLDELLALIEDF 1080
Query: 1081 IIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLSFVPASADYLIAGSSILASISKAVD 1140
IIDGR+LSC DAS ET EG +RVNI +DGV+R LS PAS +YLIAGSSILASISKAV
Sbjct: 1081 IIDGRILSCIDASLETSIEGVLRVNISVDGVNRALSLTPASTNYLIAGSSILASISKAVH 1140
Query: 1141 RTGLLWEVSYSILRICRYESSQMLTMLHIFAHIGGDQFFSLEGYSILMAVLKSIIMHLEL 1200
RTG LWEVSYSILRICRYESS +LTMLHIFAHIGGDQFFSLEGYS LMAVLKSII HLE+
Sbjct: 1141 RTGFLWEVSYSILRICRYESSLVLTMLHIFAHIGGDQFFSLEGYSTLMAVLKSIITHLEV 1200
Query: 1201 VGTSDDATFTLPKRNCRTEFVQCANCPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLE 1260
VG+SDDA+FT PK NCRTEFVQCA CPFS +MSMPM VSFLL+LI KN +AEDLE
Sbjct: 1201 VGSSDDASFTPPKGNCRTEFVQCAYCPFSANIMSMPMAVSFLLRLIHKN----ALAEDLE 1260
Query: 1261 NATSSSNLKSLFERNI--QIPSENSSCKEVHPALYLDCDASCCLKKYKVSDDQPCFFFNP 1320
N T S N +SL E+NI QIP EN S +EVHPALYLDCDASCCLKKY+VSDD FNP
Sbjct: 1261 NPTGSINPESLSEKNIANQIPCENLSGQEVHPALYLDCDASCCLKKYRVSDDDSWSLFNP 1320
Query: 1321 TLCDVSDAISLVELIACYMSWNWTCVNIISQLLELLKSSVKESLAIVILLGQLGRFGVDS 1380
TLCD++DAISLVEL+ACYM WNWT NIISQLLE LKSSVKESL VILLGQLGRFGV +
Sbjct: 1321 TLCDITDAISLVELLACYMGWNWTFANIISQLLEFLKSSVKESLPFVILLGQLGRFGVAT 1380
Query: 1381 GGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQIAIVSALLGLLPFDFETIVQDKGSYPPS 1440
GGFEDGGVKILRSNLS FLCLDTTIKSGLCVQIA+VSALLGLLPFDFETI+QD+ YP S
Sbjct: 1381 GGFEDGGVKILRSNLSAFLCLDTTIKSGLCVQIAVVSALLGLLPFDFETIIQDEVGYPAS 1440
Query: 1441 LSQYVEVNLIKTWYSLLSSKQKELSCNILQISV 1469
L+QY EVNLIKTW+S LS KQKELSCN LQ++V
Sbjct: 1441 LNQYGEVNLIKTWFSFLSPKQKELSCNTLQVAV 1460
BLAST of MC03g0609 vs. TAIR 10
Match:
AT2G34780.1 (maternal effect embryo arrest 22 )
HSP 1 Score: 457.6 bits (1176), Expect = 3.8e-128
Identity = 466/1513 (30.80%), Postives = 717/1513 (47.39%), Query Frame = 0
Query: 4 DAVPKPESSNSCCKAWKDKYSKLEEKRNALRQAVKLLEQQINKIQAENLNLKKGYEEEKA 63
+A P+ S N CC AW+ KY ++++R+A ++ V LL++ I + AE NL++ + E
Sbjct: 5 NAPPELASGNPCCLAWQGKYIGMKKRRDAFKEGVTLLQKAIENVNAEKSNLERKFGE--- 64
Query: 64 RASIEREGKDKESTIRVSLEREILDLKSQISSLEQN-NVNAVNDHGEVEHLNALVAECKK 123
+ + + K+ ST++ SLE+EI LK +I SL+Q N E + L + +K
Sbjct: 65 -MATDGDTKENGSTVKASLEKEISRLKFEIVSLQQKLERNLKEKSEETKLLQDQASGREK 124
Query: 124 EISQLKELLEIEKRRTDAEKKNAE-------------VRKEEAAQAWKTGK--------- 183
EI++L++LL+ E R D+ ++ E V+ EE Q K
Sbjct: 125 EINELRDLLKKETLRADSSEEEREHAFKELNKAKALIVKDEEIEQDIPEVKREISLVKNL 184
Query: 184 --VERSKTSDSRKFHKTEMDKANEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVE 243
ER KT RK ++E KA++ +L +L+ +T L + TS L + K+LE+E
Sbjct: 185 LASERQKTESERKKAESEKKKADKYLSELEVLRNSAHKTSSDLLTLTSNLETVKKQLELE 244
Query: 244 KQNTFKEKERADSEMSKAEDSRMRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQV 303
KQ T KEK+RAD E +KA D AE + K+ E ++R E+L K++ + + QV
Sbjct: 245 KQKTLKEKKRADMESAKARDQMKLAEDVSKKFEIVRARNEELKKEM-----ESQTASSQV 304
Query: 304 KELQAS-KISIEARCRQLDKKTDSKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMR 363
K + S K+ + R +++KKT
Sbjct: 305 KFAENSEKLEEKIRLLEMNKKT-------------------------------------- 364
Query: 364 HKMVGDLAIMKEKPMDFKQMTSEDLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEEL 423
AMD + R D L+ +L+E + E L
Sbjct: 365 -----------------------------------AMDWKSRTDDLTQQLQEAQLVAEGL 424
Query: 424 QKKLREFKSCRKLVDA-SVSSEHATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHR 483
+K++ E +K + S+S + E+AEM+LLKKK+KFE+ +H++ VA EK R
Sbjct: 425 KKQVHELSLSQKSIKTHSISPQKVRDLEKAEMRLLKKKMKFERNCAKHSQTVAKFEKFRR 484
Query: 484 SMIQQDLGRFKLEFVQLSNHLDTIHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIE 543
++LGR KLEF L+N ++ + + ST + L K L L S+K+ +
Sbjct: 485 EFQCEELGRLKLEFGSLTNRMNLLDEYFSTDVEGTAGLGKATGCRKLLTLNSQKNRNGEK 544
Query: 544 PFQT---WMPESNFRRMIPQHGAPLLSSEGNHVT-SVSGIESRLESLPGGSNRKMLQSCA 603
+ S ++ + A L+S G V+ SVSG S+LES GGS + L S
Sbjct: 545 HSDARCKLVASSGYQEQACKLSAHLISKSGRGVSESVSGTISQLESPTGGSRK--LPSSG 604
Query: 604 VNSSTASFSDGQLVGSQDKAGLCL-TAAKLVGENLNVQPKVSNLTGELSKMKSNENLAMV 663
V SS SFSDGQL+ SQ + + T+A++ + N+QP S++ ++S N NL +V
Sbjct: 605 VISSATSFSDGQLLASQGREQFSVTTSAEIAKDKPNIQPTKSSMLQKISDTSKNGNLCLV 664
Query: 664 AENSVRSPIKDHVGRANEKQKKRKRTIETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRP 723
AEN ++ +D +E +KRKR +E V S ++L KK + I EK+ L ++
Sbjct: 665 AENYLQRCQRD----IHENSRKRKRMLEAVVSHKHLASGDKKKNLPIGEKMGTLQSMIVG 724
Query: 724 T-EKPSRKSEDVISNPPQ--DSSADKKFRKKTNSLYQKKIKMQGLLDNDEIKLNKVDTEV 783
T +PS K E ++ Q S+ D KK +KKI +Q L+ ++
Sbjct: 725 TGSRPSEKEETLVPPDRQGGSSAIDITVSKKRRVSCKKKIIVQNSLEFNQ---------- 784
Query: 784 CVPESVSRQPSQPVSKLTDSCQPCLEELDNSVISELQILETFGNIAGGDYMKLLDLDSAA 843
+ P K T CL + L E F A DYMKLL+LD+
Sbjct: 785 -----SGKTPGNIAGKTT-----CLSTATGHDVKTL-FSEDF---AATDYMKLLELDNLE 844
Query: 844 DEKCYRRAMEMPLSPSLPDICVPGTETSALNDFEPLVDEFHKKLPKEKGQPHSHSYDVID 903
+E Y+ A E LSP LP + G E + E P ID
Sbjct: 845 EENYYQMARESLLSPDLPQVDFLGCE-----------------IMNEDKNPAR----AID 904
Query: 904 VEIKSNYTQSCNSGLLRDIHSSKRQVDPCLMLGSHGSDLCDIVQAEKICLDKVGVIVEMP 963
+ + NS LR+ S ++ + + V VEMP
Sbjct: 905 L-------AASNSMYLRETILSS--------------------ESPSLNTQNISVTVEMP 964
Query: 964 GTEFSLSGCEGVETSEIKSGSQDNFVPDFCVLFSNAEDCHSISRIFSATRACIKRISLTS 1023
L G + + F ++FSN ED +SI I AT C++R +
Sbjct: 965 PMLKPLHG---------------HLLKHF-IVFSNIEDQNSIIIIIHATNNCLQRCPSVT 1024
Query: 1024 QKEWMVQEILAALNMEHRLLPKEKTCVFVSLVLLNFTVVAMHRYGNFLNCK--TCMDSFS 1083
+++W V IL++L ME LL +E+ CVF+SL+L NF++V + GN LN +C+DSFS
Sbjct: 1025 KEQWAVPAILSSLKMEENLLAQERACVFLSLLLHNFSMVHTTKTGNTLNVDSFSCLDSFS 1084
Query: 1084 GHISEAMLNVETRSLFAELCCLDELLALIEDFIIDGRVLSCTDASFETLTEGGVRVNIPI 1143
HI M + E + + +ELL L++D + RVL +S ET E + + + +
Sbjct: 1085 KHIRGVMADTEAGVMLSGFS--EELLCLLQDLLSGQRVLFSVKSS-ET-CESDLSIPVTL 1144
Query: 1144 DGVDRTLSFVPASADYLIAGSSILASISKAVDRTGLLWEVSYSILRICRYE-SSQMLTML 1203
+G + L A D L+AGS+ILA+I A+DR G + E S+ IL +E +S +LT+L
Sbjct: 1145 NGENVALVNKIALTDQLVAGSAILAAICTALDRIGYICEASFEILHKYSHEKTSVLLTIL 1204
Query: 1204 HIFAHIGGDQFFSLEGYSILMAVLKSIIMHLE--LVGTSDDATFTLPKRNCRTEFVQCAN 1263
H+FA+I G++ + I +AVLK I+M LE GT + ++ P +N
Sbjct: 1205 HVFAYIAGEKMVLSSEHGISIAVLKYIVMFLENKHFGTVEGSSRLHPGKN---------K 1264
Query: 1264 CPFSEEVMSMPMVVSFLLQLILKNMSNGIMAEDLENATSSSNLKSLFERNIQIPSENSSC 1323
CPFS+ S+ + S L++++ + + + + L + SS+L E +
Sbjct: 1265 CPFSDRSSSLEAMASKLMEILQEFTESNTLHKSLTGSLGSSHL------------EKTEF 1295
Query: 1324 KEVHPALYLDCDASCCLKKYKVSDDQPCFFFNPTLCDVSDAISLVELIACYMSWNWTCVN 1383
+ H D C L + DQ + LCD+ +SLVELIACY +W+WT N
Sbjct: 1325 RPAHK------DFQCVLTR-----DQ-----SINLCDI---LSLVELIACYTAWDWTSAN 1295
Query: 1384 IISQLLELL--KSSVKESLAIVILLGQLGRFGVDSGGFEDGGVKILRSNLSEFLCLDTTI 1443
I++ LL++L + S+AIV LLGQL GVD+GG+E+ G+ LR LS FL +TT+
Sbjct: 1385 IVAPLLKMLGMPLPMNLSVAIVSLLGQLSSIGVDAGGYENEGISNLRVKLSAFLQCETTL 1295
Query: 1444 KSGLCVQIAIVSALLGLLPFDFETIVQDK-----GSYPPSLSQYVEVNLIKTWYSLLSSK 1470
K+G VQIA VS+LL L F QDK GS SLS VN++ W SLLS +
Sbjct: 1445 KAGFAVQIATVSSLLKTLQLKFPIDFQDKTTMIPGSGDQSLSG--SVNVVTKWLSLLSKE 1295
BLAST of MC03g0609 vs. TAIR 10
Match:
AT2G34780.2 (maternal effect embryo arrest 22 )
HSP 1 Score: 425.2 bits (1092), Expect = 2.1e-118
Identity = 445/1445 (30.80%), Postives = 677/1445 (46.85%), Query Frame = 0
Query: 72 KDKESTIRVSLEREILDLKSQISSLEQN-NVNAVNDHGEVEHLNALVAECKKEISQLKEL 131
K+ ST++ SLE+EI LK +I SL+Q N E + L + +KEI++L++L
Sbjct: 8 KENGSTVKASLEKEISRLKFEIVSLQQKLERNLKEKSEETKLLQDQASGREKEINELRDL 67
Query: 132 LEIEKRRTDAEKKNAE-------------VRKEEAAQAWKTGK-----------VERSKT 191
L+ E R D+ ++ E V+ EE Q K ER KT
Sbjct: 68 LKKETLRADSSEEEREHAFKELNKAKALIVKDEEIEQDIPEVKREISLVKNLLASERQKT 127
Query: 192 SDSRKFHKTEMDKANEGRQQLGMLKKECEETKLKLASETSKLMELIKELEVEKQNTFKEK 251
RK ++E KA++ +L +L+ +T L + TS L + K+LE+EKQ T KEK
Sbjct: 128 ESERKKAESEKKKADKYLSELEVLRNSAHKTSSDLLTLTSNLETVKKQLELEKQKTLKEK 187
Query: 252 ERADSEMSKAEDSRMRAEAIRKQAEGEKSRAEKLLKQLGRKNSKIEELQKQVKELQAS-K 311
+RAD E +KA D AE + K+ E ++R E+L K++ + + QVK + S K
Sbjct: 188 KRADMESAKARDQMKLAEDVSKKFEIVRARNEELKKEM-----ESQTASSQVKFAENSEK 247
Query: 312 ISIEARCRQLDKKTDSKAVEQNDKPLSETIQRDANEIKLVYDLLKAKEVNMRHKMVGDLA 371
+ + R +++KKT
Sbjct: 248 LEEKIRLLEMNKKT---------------------------------------------- 307
Query: 372 IMKEKPMDFKQMTSEDLKNRIGIYHKKAMDEQCRADKLSFELEENKRKVEELQKKLREFK 431
AMD + R D L+ +L+E + E L+K++ E
Sbjct: 308 ---------------------------AMDWKSRTDDLTQQLQEAQLVAEGLKKQVHELS 367
Query: 432 SCRKLVDA-SVSSEHATSSERAEMKLLKKKLKFEKERVEHARQVANLEKTHRSMIQQDLG 491
+K + S+S + E+AEM+LLKKK+KFE+ +H++ VA EK R ++LG
Sbjct: 368 LSQKSIKTHSISPQKVRDLEKAEMRLLKKKMKFERNCAKHSQTVAKFEKFRREFQCEELG 427
Query: 492 RFKLEFVQLSNHLDTIHKLASTGAKDNHDLEKVWNAENLHRLYSKKHIQAIEPFQT---W 551
R KLEF L+N ++ + + ST + L K L L S+K+ +
Sbjct: 428 RLKLEFGSLTNRMNLLDEYFSTDVEGTAGLGKATGCRKLLTLNSQKNRNGEKHSDARCKL 487
Query: 552 MPESNFRRMIPQHGAPLLSSEGNHVT-SVSGIESRLESLPGGSNRKMLQSCAVNSSTASF 611
+ S ++ + A L+S G V+ SVSG S+LES GGS + L S V SS SF
Sbjct: 488 VASSGYQEQACKLSAHLISKSGRGVSESVSGTISQLESPTGGSRK--LPSSGVISSATSF 547
Query: 612 SDGQLVGSQDKAGLCL-TAAKLVGENLNVQPKVSNLTGELSKMKSNENLAMVAENSVRSP 671
SDGQL+ SQ + + T+A++ + N+QP S++ ++S N NL +VAEN ++
Sbjct: 548 SDGQLLASQGREQFSVTTSAEIAKDKPNIQPTKSSMLQKISDTSKNGNLCLVAENYLQRC 607
Query: 672 IKDHVGRANEKQKKRKRTIETVESIEYLYHKSKKMHSQIEEKLSLLHTLNRPT-EKPSRK 731
+D +E +KRKR +E V S ++L KK + I EK+ L ++ T +PS K
Sbjct: 608 QRD----IHENSRKRKRMLEAVVSHKHLASGDKKKNLPIGEKMGTLQSMIVGTGSRPSEK 667
Query: 732 SEDVISNPPQ--DSSADKKFRKKTNSLYQKKIKMQGLLDNDEIKLNKVDTEVCVPESVSR 791
E ++ Q S+ D KK +KKI +Q L+ ++ +
Sbjct: 668 EETLVPPDRQGGSSAIDITVSKKRRVSCKKKIIVQNSLEFNQ---------------SGK 727
Query: 792 QPSQPVSKLTDSCQPCLEELDNSVISELQILETFGNIAGGDYMKLLDLDSAADEKCYRRA 851
P K T CL + L E F A DYMKLL+LD+ +E Y+ A
Sbjct: 728 TPGNIAGKTT-----CLSTATGHDVKTL-FSEDF---AATDYMKLLELDNLEEENYYQMA 787
Query: 852 MEMPLSPSLPDICVPGTETSALNDFEPLVDEFHKKLPKEKGQPHSHSYDVIDVEIKSNYT 911
E LSP LP + G E + E P ID+
Sbjct: 788 RESLLSPDLPQVDFLGCE-----------------IMNEDKNPAR----AIDL------- 847
Query: 912 QSCNSGLLRDIHSSKRQVDPCLMLGSHGSDLCDIVQAEKICLDKVGVIVEMPGTEFSLSG 971
+ NS LR+ S ++ + + V VEMP L G
Sbjct: 848 AASNSMYLRETILSS--------------------ESPSLNTQNISVTVEMPPMLKPLHG 907
Query: 972 CEGVETSEIKSGSQDNFVPDFCVLFSNAEDCHSISRIFSATRACIKRISLTSQKEWMVQE 1031
+ + F ++FSN ED +SI I AT C++R ++++W V
Sbjct: 908 ---------------HLLKHF-IVFSNIEDQNSIIIIIHATNNCLQRCPSVTKEQWAVPA 967
Query: 1032 ILAALNMEHRLLPKEKTCVFVSLVLLNFTVVAMHRYGNFLNCK--TCMDSFSGHISEAML 1091
IL++L ME LL +E+ CVF+SL+L NF++V + GN LN +C+DSFS HI M
Sbjct: 968 ILSSLKMEENLLAQERACVFLSLLLHNFSMVHTTKTGNTLNVDSFSCLDSFSKHIRGVMA 1027
Query: 1092 NVETRSLFAELCCLDELLALIEDFIIDGRVLSCTDASFETLTEGGVRVNIPIDGVDRTLS 1151
+ E + + +ELL L++D + RVL +S ET E + + + ++G + L
Sbjct: 1028 DTEAGVMLSGFS--EELLCLLQDLLSGQRVLFSVKSS-ET-CESDLSIPVTLNGENVALV 1087
Query: 1152 FVPASADYLIAGSSILASISKAVDRTGLLWEVSYSILRICRYE-SSQMLTMLHIFAHIGG 1211
A D L+AGS+ILA+I A+DR G + E S+ IL +E +S +LT+LH+FA+I G
Sbjct: 1088 NKIALTDQLVAGSAILAAICTALDRIGYICEASFEILHKYSHEKTSVLLTILHVFAYIAG 1147
Query: 1212 DQFFSLEGYSILMAVLKSIIMHLE--LVGTSDDATFTLPKRNCRTEFVQCANCPFSEEVM 1271
++ + I +AVLK I+M LE GT + ++ P +N CPFS+
Sbjct: 1148 EKMVLSSEHGISIAVLKYIVMFLENKHFGTVEGSSRLHPGKN---------KCPFSDRSS 1207
Query: 1272 SMPMVVSFLLQLILKNMSNGIMAEDLENATSSSNLKSLFERNIQIPSENSSCKEVHPALY 1331
S+ + S L++++ + + + + L + SS+L E + + H
Sbjct: 1208 SLEAMASKLMEILQEFTESNTLHKSLTGSLGSSHL------------EKTEFRPAHK--- 1234
Query: 1332 LDCDASCCLKKYKVSDDQPCFFFNPTLCDVSDAISLVELIACYMSWNWTCVNIISQLLEL 1391
D C L + DQ + LCD+ +SLVELIACY +W+WT NI++ LL++
Sbjct: 1268 ---DFQCVLTR-----DQ-----SINLCDI---LSLVELIACYTAWDWTSANIVAPLLKM 1234
Query: 1392 L--KSSVKESLAIVILLGQLGRFGVDSGGFEDGGVKILRSNLSEFLCLDTTIKSGLCVQI 1451
L + S+AIV LLGQL GVD+GG+E+ G+ LR LS FL +TT+K+G VQI
Sbjct: 1328 LGMPLPMNLSVAIVSLLGQLSSIGVDAGGYENEGISNLRVKLSAFLQCETTLKAGFAVQI 1234
Query: 1452 AIVSALLGLLPFDFETIVQDK-----GSYPPSLSQYVEVNLIKTWYSLLSSKQKELSCNI 1470
A VS+LL L F QDK GS SLS VN++ W SLLS +Q+ +
Sbjct: 1388 ATVSSLLKTLQLKFPIDFQDKTTMIPGSGDQSLSG--SVNVVTKWLSLLSKEQRVFAFEF 1234
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022140057.1 | 0.0 | 99.86 | uncharacterized protein LOC111010806 [Momordica charantia] >XP_022140058.1 uncha... | [more] |
XP_023519446.1 | 0.0 | 75.56 | uncharacterized protein LOC111782859 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
KAG7019455.1 | 0.0 | 75.49 | hypothetical protein SDJN02_18416 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022927023.1 | 0.0 | 75.42 | myosin heavy chain, non-muscle-like [Cucurbita moschata] >XP_022927024.1 myosin ... | [more] |
XP_023000919.1 | 0.0 | 75.36 | uncharacterized protein LOC111495215 [Cucurbita maxima] >XP_023000920.1 uncharac... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CE16 | 0.0 | 99.86 | uncharacterized protein LOC111010806 OS=Momordica charantia OX=3673 GN=LOC111010... | [more] |
A0A6J1EFZ6 | 0.0 | 75.42 | myosin heavy chain, non-muscle-like OS=Cucurbita moschata OX=3662 GN=LOC11143397... | [more] |
A0A6J1KH58 | 0.0 | 75.36 | uncharacterized protein LOC111495215 OS=Cucurbita maxima OX=3661 GN=LOC111495215... | [more] |
A0A6J1GRR8 | 0.0 | 74.54 | restin homolog isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456908 PE=4 SV=... | [more] |
A0A6J1JWM7 | 0.0 | 74.27 | protein MLP1-like OS=Cucurbita maxima OX=3661 GN=LOC111490360 PE=4 SV=1 | [more] |